CSPI04G09420 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI04G09420
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationChr4: 7366651 .. 7367533 (+)
RNA-Seq ExpressionCSPI04G09420
SyntenyCSPI04G09420
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACCTAGTCAAAAGTAGCGTGTTGAACGAGGAGATGAGAAGAAAGTCTCAAAGTTCTTGTTCACAGTCAGATGTTTTGGTTAGTGAAAAGAGGGGGAGGAGTAAAAGTAAAGGTTTGAGAGGTATATAATAATATAAGCAAAAGCAAAAGTGACTGATTTGCAAATGTTGAGTGTTACTCTTGCCATAAAAAGAGGCATATAAAGAAGTATTGTCGAAAATTGAAAAGAGACCGTAAAAATCATAAGGGCAAGGAAAAGAAGAATGAGGATGATAGTGATACTGATACAATCACTGTAGCCACTGAAGATTTTTTCGTCTTGTCTGATGGTGATGTTGTAAATCTTGCCACACAACAGAGCAGTTGGGTGATTGATAGTGGTGCCTCAATTCATGCTACTTCAAAGAGGGAATTTTTTGCATCCTACACTCCTGGTGATTTTGGCAGTGTTATGATGGATAATGACGGATCAACAAATACAGTTGGCATCGGAGATGTACACTTGAAAAACAGAAATGGTTCTAGGCTGATTTTGAAAAATGTGAAACATATTCCTGATATTCGCATGAACTTGATTTCTATATGTAAGCTTGATGACGAAGGTTTCTACAATACCTTCAACAATGGCATATGGAAGCTTACTAAAGGTTCAATGGTTATAGCAAAGGGACAAAAATTTTCTTCACTGTACTATATGAATGCAAAAATCATGGATTCTGATATAAATACAGGGAATGATGAAGCAAATGTTGAGCTTTGGCATAAGAGACTTAGCCATATAAGTGAAAAGGGTTTAAAGATTTTAACCAAGAAAAATCATCTTCTTGATTTAAAGAGTACACCTCTAAAACAGTGTCCTCATTGTTTGGCAGGAAAGTAG

mRNA sequence

ATGGACCTAGTCAAAAGTAGCGTGTTGAACGAGGAGATGAGAAGAAAGTCTCAAAGTTCTTGTTCACAGTCAGATGTTTTGAGGCATATAAAGAAGTATTGTCGAAAATTGAAAAGAGACCGTAAAAATCATAAGGGCAAGGAAAAGAAGAATGAGGATGATAGTGATACTGATACAATCACTGTAGCCACTGAAGATTTTTTCGTCTTGTCTGATGGTGATGTTGTAAATCTTGCCACACAACAGAGCAGTTGGGTGATTGATAGTGGTGCCTCAATTCATGCTACTTCAAAGAGGGAATTTTTTGCATCCTACACTCCTGGTGATTTTGGCAGTGTTATGATGGATAATGACGGATCAACAAATACAGTTGGCATCGGAGATGTACACTTGAAAAACAGAAATGGTTCTAGGCTGATTTTGAAAAATGTGAAACATATTCCTGATATTCGCATGAACTTGATTTCTATATGTAAGCTTGATGACGAAGGTTTCTACAATACCTTCAACAATGGCATATGGAAGCTTACTAAAGGTTCAATGGTTATAGCAAAGGGACAAAAATTTTCTTCACTGTACTATATGAATGCAAAAATCATGGATTCTGATATAAATACAGGGAATGATGAAGCAAATGTTGAGCTTTGGCATAAGAGACTTAGCCATATAAGTGAAAAGGGTTTAAAGATTTTAACCAAGAAAAATCATCTTCTTGATTTAAAGAGTACACCTCTAAAACAGTGTCCTCATTGTTTGGCAGGAAAGTAG

Coding sequence (CDS)

ATGGACCTAGTCAAAAGTAGCGTGTTGAACGAGGAGATGAGAAGAAAGTCTCAAAGTTCTTGTTCACAGTCAGATGTTTTGAGGCATATAAAGAAGTATTGTCGAAAATTGAAAAGAGACCGTAAAAATCATAAGGGCAAGGAAAAGAAGAATGAGGATGATAGTGATACTGATACAATCACTGTAGCCACTGAAGATTTTTTCGTCTTGTCTGATGGTGATGTTGTAAATCTTGCCACACAACAGAGCAGTTGGGTGATTGATAGTGGTGCCTCAATTCATGCTACTTCAAAGAGGGAATTTTTTGCATCCTACACTCCTGGTGATTTTGGCAGTGTTATGATGGATAATGACGGATCAACAAATACAGTTGGCATCGGAGATGTACACTTGAAAAACAGAAATGGTTCTAGGCTGATTTTGAAAAATGTGAAACATATTCCTGATATTCGCATGAACTTGATTTCTATATGTAAGCTTGATGACGAAGGTTTCTACAATACCTTCAACAATGGCATATGGAAGCTTACTAAAGGTTCAATGGTTATAGCAAAGGGACAAAAATTTTCTTCACTGTACTATATGAATGCAAAAATCATGGATTCTGATATAAATACAGGGAATGATGAAGCAAATGTTGAGCTTTGGCATAAGAGACTTAGCCATATAAGTGAAAAGGGTTTAAAGATTTTAACCAAGAAAAATCATCTTCTTGATTTAAAGAGTACACCTCTAAAACAGTGTCCTCATTGTTTGGCAGGAAAGTAG

Protein sequence

MDLVKSSVLNEEMRRKSQSSCSQSDVLRHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVATEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVMMDNDGSTNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSDINTGNDEANVELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK*
Homology
BLAST of CSPI04G09420 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 182.2 bits (461), Expect = 7.5e-45
Identity = 99/243 (40.74%), Postives = 147/243 (60.49%), Query Frame = 0

Query: 14  RRKSQ-SSCSQSDVLRHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVATEDFFVLSD 73
           R KS+  +C   +   H K+ C   ++ +    G  +KN+D++            F+  +
Sbjct: 224 RSKSRVRNCYNCNQPGHFKRDCPNPRKGKGETSG--QKNDDNTAAMVQNNDNVVLFINEE 283

Query: 74  GDVVNLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVMMDNDGSTNTVGIGDVHLK 133
            + ++L+  +S WV+D+ AS HAT  R+ F  Y  GDFG+V M N   +   GIGD+ +K
Sbjct: 284 EECMHLSGPESEWVVDTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIK 343

Query: 134 NRNGSRLILKNVKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGSMVIAKGQKFSSL 193
              G  L+LK+V+H+PD+RMNLIS   LD +G+ + F N  W+LTKGS+VIAKG    +L
Sbjct: 344 TNVGCTLVLKDVRHVPDLRMNLISGIALDRDGYESYFANQKWRLTKGSLVIAKGVARGTL 403

Query: 194 YYMNAKIMDSDINTGNDEANVELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCL 253
           Y  NA+I   ++N   DE +V+LWHKR+ H+SEKGL+IL KK+ +   K T +K C +CL
Sbjct: 404 YRTNAEICQGELNAAQDEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCL 463

Query: 254 AGK 256
            GK
Sbjct: 464 FGK 464

BLAST of CSPI04G09420 vs. ExPASy Swiss-Prot
Match: P93293 (Uncharacterized mitochondrial protein AtMg00300 OS=Arabidopsis thaliana OX=3702 GN=AtMg00300 PE=4 SV=1)

HSP 1 Score: 65.5 bits (158), Expect = 1.0e-09
Identity = 33/89 (37.08%), Postives = 49/89 (55.06%), Query Frame = 0

Query: 170 NNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSDIN---TGNDEANVELWHKRLSHISEK 229
           + G+ K+ KG   I KG +  SLY +   +   + N   T  DE    LWH RL+H+S++
Sbjct: 25  SEGVLKVLKGCRTILKGNRHDSLYILQGSVETGESNLAETAKDE--TRLWHSRLAHMSQR 84

Query: 230 GLKILTKKNHLLDLKSTPLKQCPHCLAGK 256
           G+++L KK  L   K + LK C  C+ GK
Sbjct: 85  GMELLVKKGFLDSSKVSSLKFCEDCIYGK 111

BLAST of CSPI04G09420 vs. ExPASy TrEMBL
Match: A0A5D3CVK2 (Putative retrotransposon OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold2754G00140 PE=4 SV=1)

HSP 1 Score: 398.3 bits (1022), Expect = 2.5e-107
Identity = 217/303 (71.62%), Postives = 234/303 (77.23%), Query Frame = 0

Query: 1   MDLVKSSVLNEEMRRKSQSSCSQSDVL--------------------------------- 60
           MDLVKSSVLNEEMRRKSQSS  QSDVL                                 
Sbjct: 40  MDLVKSSVLNEEMRRKSQSSFVQSDVLVTERRGRSKSKGSRGNSRSKSKSDRFANVECHY 99

Query: 61  ----RHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVATEDFFVLSDGDVVNLATQQS 120
                HIKKYCRKLKRD KNHKGKEKKN+D+SDTDTI VATE+F++LS+GDVVNLA QQS
Sbjct: 100 CHEKGHIKKYCRKLKRDSKNHKGKEKKNDDESDTDTIIVATENFYILSNGDVVNLAIQQS 159

Query: 121 SWVIDSGASIHATSKREFFASYTPGDFGSVMMDNDGSTNTVGIGDVHLKNRNGSRLILKN 180
           SWVIDSGAS++ATSK +FFASYTP DFGSV M NDGS N VGIGDVHL NRNGSRLILKN
Sbjct: 160 SWVIDSGASVNATSKGQFFASYTPSDFGSVRMGNDGSANVVGIGDVHL-NRNGSRLILKN 219

Query: 181 VKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSD 240
           VKHI DIRMNLIS  KLDDEGF NTF+NGIWKLTKGS+VIA+G KFSSLYYM+AKI+DSD
Sbjct: 220 VKHISDIRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIARGHKFSSLYYMDAKIIDSD 279

Query: 241 INTGNDEANVELWHKRLSHISEKGLKILTKK-----------NHLLDLKSTPLKQCPHCL 256
           INT NDE N+ELWHKRLSH+SEKGLKILTKK           NHL DLKSTPLK+CPHCL
Sbjct: 280 INTVNDEVNIELWHKRLSHMSEKGLKILTKKNHLPDLKSTPLNHLPDLKSTPLKRCPHCL 339

BLAST of CSPI04G09420 vs. ExPASy TrEMBL
Match: A0A5D3BKF7 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold429G00180 PE=4 SV=1)

HSP 1 Score: 349.4 bits (895), Expect = 1.3e-92
Identity = 189/255 (74.12%), Postives = 197/255 (77.25%), Query Frame = 0

Query: 1   MDLVKSSVLNEEMRRKSQSSCSQSDVLRHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTI 60
           MDLVKSSVLNEEMRRKSQSS  QSD L                 +G+ K           
Sbjct: 163 MDLVKSSVLNEEMRRKSQSSSVQSDFL-------------VTERRGRSK----------- 222

Query: 61  TVATEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVMMDNDGS 120
                     S G  VNLA QQSSWVIDSGAS+HATSKREFFASYTPGDFGSV M NDG 
Sbjct: 223 ----------SKGPRVNLAIQQSSWVIDSGASVHATSKREFFASYTPGDFGSVRMGNDGP 282

Query: 121 TNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGS 180
           TN VGIGDVHLKNRNGSRLILKNVKHIPDI MNLIS  KLDDEGF NTF+NGIWKLTKGS
Sbjct: 283 TNVVGIGDVHLKNRNGSRLILKNVKHIPDIHMNLISTGKLDDEGFCNTFDNGIWKLTKGS 342

Query: 181 MVIAKGQKFSSLYYMNAKIMDSDINTGNDEANVELWHKRLSHISEKGLKILTKKNHLLDL 240
           MVIA GQKFSSLYYM+AKI+D DINT NDEANVELWHKRLSH+SEKGLKILTKKNHL DL
Sbjct: 343 MVIASGQKFSSLYYMDAKIIDYDINTVNDEANVELWHKRLSHMSEKGLKILTKKNHLHDL 383

Query: 241 KSTPLKQCPHCLAGK 256
           KSTPLK+CPHCLAGK
Sbjct: 403 KSTPLKRCPHCLAGK 383

BLAST of CSPI04G09420 vs. ExPASy TrEMBL
Match: A0A5D3C706 (Putative retrotransposon OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold606G00910 PE=4 SV=1)

HSP 1 Score: 349.0 bits (894), Expect = 1.7e-92
Identity = 181/241 (75.10%), Postives = 199/241 (82.57%), Query Frame = 0

Query: 15  RKSQSSCSQSDVLRHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVATEDFFVLSDGD 74
           R +   C       HIK YCRKLKRD KNHKG+EKKN++DS+ DTI +A EDF++LS+GD
Sbjct: 60  RFANVECHYCHEKAHIKNYCRKLKRDHKNHKGREKKNDNDSNADTIIIAIEDFYILSNGD 119

Query: 75  VVNLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVMMDNDGSTNTVGIGDVHLKNR 134
           VVNLATQQ+S VIDSGAS HATSKRE  ASYTPGDFG+V M NDGSTN VGIG VHLKN 
Sbjct: 120 VVNLATQQNSCVIDSGASAHATSKREVSASYTPGDFGNVRMGNDGSTNAVGIGGVHLKNI 179

Query: 135 NGSRLILKNVKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGSMVIAKGQKFSSLYY 194
           NGSRLILKNVKHIPDIRMNLIS  KLD+EGF NTF+NGIWKLT+G MVIAKGQK S LYY
Sbjct: 180 NGSRLILKNVKHIPDIRMNLISTSKLDNEGFCNTFDNGIWKLTEGFMVIAKGQKISLLYY 239

Query: 195 MNAKIMDSDINTGNDEANVELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAG 254
           ++AKI+DSDINT N EANVELW  RLSH+SEKGLKIL KKNHL DLKS PLK   H LAG
Sbjct: 240 VDAKIIDSDINTVNGEANVELWQMRLSHMSEKGLKILIKKNHLPDLKSAPLKWFAHYLAG 299

Query: 255 K 256
           K
Sbjct: 300 K 300

BLAST of CSPI04G09420 vs. ExPASy TrEMBL
Match: A0A5A7TFU1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold35G00580 PE=4 SV=1)

HSP 1 Score: 347.8 bits (891), Expect = 3.8e-92
Identity = 188/255 (73.73%), Postives = 196/255 (76.86%), Query Frame = 0

Query: 1   MDLVKSSVLNEEMRRKSQSSCSQSDVLRHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTI 60
           MDLVKSSVLNEEMRRKSQSS  QSD L                 +G+ K           
Sbjct: 163 MDLVKSSVLNEEMRRKSQSSSVQSDFL-------------VTERRGRSK----------- 222

Query: 61  TVATEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVMMDNDGS 120
                     S G  VNL  QQSSWVIDSGAS+HATSKREFFASYTPGDFGSV M NDG 
Sbjct: 223 ----------SKGPRVNLVIQQSSWVIDSGASVHATSKREFFASYTPGDFGSVRMGNDGP 282

Query: 121 TNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGS 180
           TN VGIGDVHLKNRNGSRLILKNVKHIPDI MNLIS  KLDDEGF NTF+NGIWKLTKGS
Sbjct: 283 TNVVGIGDVHLKNRNGSRLILKNVKHIPDIHMNLISTGKLDDEGFCNTFDNGIWKLTKGS 342

Query: 181 MVIAKGQKFSSLYYMNAKIMDSDINTGNDEANVELWHKRLSHISEKGLKILTKKNHLLDL 240
           MVIA GQKFSSLYYM+AKI+D DINT NDEANVELWHKRLSH+SEKGLKILTKKNHL DL
Sbjct: 343 MVIASGQKFSSLYYMDAKIIDYDINTVNDEANVELWHKRLSHMSEKGLKILTKKNHLHDL 383

Query: 241 KSTPLKQCPHCLAGK 256
           KSTPLK+CPHCLAGK
Sbjct: 403 KSTPLKRCPHCLAGK 383

BLAST of CSPI04G09420 vs. ExPASy TrEMBL
Match: A0A5C7IN93 (CCHC-type domain-containing protein OS=Acer yangbiense OX=1000413 GN=EZV62_005513 PE=4 SV=1)

HSP 1 Score: 321.2 bits (822), Expect = 3.8e-84
Identity = 171/292 (58.56%), Postives = 205/292 (70.21%), Query Frame = 0

Query: 1   MDLVKSSVLNEEMRRKSQSSCSQSDVL--------------------------------- 60
           MDL KSSVLNEEMRRKSQ S SQS+VL                                 
Sbjct: 172 MDLAKSSVLNEEMRRKSQGS-SQSEVLVTEKRGRSKSRGPKNRDRSKSKSNKFANVECYH 231

Query: 61  ----RHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVATEDFFVLSDGDVVNLATQQS 120
                HIKKYCR+LKRD KN KGKEKK +D +D D ++  T+DF V+ D DVVNLA  ++
Sbjct: 232 CGQKGHIKKYCRQLKRDHKNEKGKEKKTDDSNDGDRVSAVTDDFLVVYDDDVVNLACHET 291

Query: 121 SWVIDSGASIHATSKREFFASYTPGDFGSVMMDNDGSTNTVGIGDVHLKNRNGSRLILKN 180
           SWVIDSGASIHATS+R+FFASYT GDFG V M N+G    VG+GDV L+  NG  L+LKN
Sbjct: 292 SWVIDSGASIHATSRRDFFASYTSGDFGDVKMGNNGVAKAVGMGDVCLETNNGMMLLLKN 351

Query: 181 VKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSD 240
           VKHIPDIR+NLIS  KLDDEGF NTF++G WKLTKGSM++A+G+K SSLY+M AK+ D  
Sbjct: 352 VKHIPDIRLNLISAGKLDDEGFCNTFSDGHWKLTKGSMIVARGKKCSSLYFMQAKVSDCI 411

Query: 241 INTGNDEANVELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK 256
           INT ++E+  ELWH+RL H+SEKGL +L KKN L  +K+ PLK+C HCLAGK
Sbjct: 412 INTVDNESTTELWHRRLGHMSEKGLMVLAKKNLLSGMKNAPLKKCAHCLAGK 462

BLAST of CSPI04G09420 vs. NCBI nr
Match: KAA0047570.1 (putative retrotransposon [Cucumis melo var. makuwa] >TYK14964.1 putative retrotransposon [Cucumis melo var. makuwa])

HSP 1 Score: 398.3 bits (1022), Expect = 5.1e-107
Identity = 217/303 (71.62%), Postives = 234/303 (77.23%), Query Frame = 0

Query: 1   MDLVKSSVLNEEMRRKSQSSCSQSDVL--------------------------------- 60
           MDLVKSSVLNEEMRRKSQSS  QSDVL                                 
Sbjct: 40  MDLVKSSVLNEEMRRKSQSSFVQSDVLVTERRGRSKSKGSRGNSRSKSKSDRFANVECHY 99

Query: 61  ----RHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVATEDFFVLSDGDVVNLATQQS 120
                HIKKYCRKLKRD KNHKGKEKKN+D+SDTDTI VATE+F++LS+GDVVNLA QQS
Sbjct: 100 CHEKGHIKKYCRKLKRDSKNHKGKEKKNDDESDTDTIIVATENFYILSNGDVVNLAIQQS 159

Query: 121 SWVIDSGASIHATSKREFFASYTPGDFGSVMMDNDGSTNTVGIGDVHLKNRNGSRLILKN 180
           SWVIDSGAS++ATSK +FFASYTP DFGSV M NDGS N VGIGDVHL NRNGSRLILKN
Sbjct: 160 SWVIDSGASVNATSKGQFFASYTPSDFGSVRMGNDGSANVVGIGDVHL-NRNGSRLILKN 219

Query: 181 VKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSD 240
           VKHI DIRMNLIS  KLDDEGF NTF+NGIWKLTKGS+VIA+G KFSSLYYM+AKI+DSD
Sbjct: 220 VKHISDIRMNLISTGKLDDEGFCNTFDNGIWKLTKGSIVIARGHKFSSLYYMDAKIIDSD 279

Query: 241 INTGNDEANVELWHKRLSHISEKGLKILTKK-----------NHLLDLKSTPLKQCPHCL 256
           INT NDE N+ELWHKRLSH+SEKGLKILTKK           NHL DLKSTPLK+CPHCL
Sbjct: 280 INTVNDEVNIELWHKRLSHMSEKGLKILTKKNHLPDLKSTPLNHLPDLKSTPLKRCPHCL 339

BLAST of CSPI04G09420 vs. NCBI nr
Match: TYJ98688.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 349.4 bits (895), Expect = 2.7e-92
Identity = 189/255 (74.12%), Postives = 197/255 (77.25%), Query Frame = 0

Query: 1   MDLVKSSVLNEEMRRKSQSSCSQSDVLRHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTI 60
           MDLVKSSVLNEEMRRKSQSS  QSD L                 +G+ K           
Sbjct: 163 MDLVKSSVLNEEMRRKSQSSSVQSDFL-------------VTERRGRSK----------- 222

Query: 61  TVATEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVMMDNDGS 120
                     S G  VNLA QQSSWVIDSGAS+HATSKREFFASYTPGDFGSV M NDG 
Sbjct: 223 ----------SKGPRVNLAIQQSSWVIDSGASVHATSKREFFASYTPGDFGSVRMGNDGP 282

Query: 121 TNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGS 180
           TN VGIGDVHLKNRNGSRLILKNVKHIPDI MNLIS  KLDDEGF NTF+NGIWKLTKGS
Sbjct: 283 TNVVGIGDVHLKNRNGSRLILKNVKHIPDIHMNLISTGKLDDEGFCNTFDNGIWKLTKGS 342

Query: 181 MVIAKGQKFSSLYYMNAKIMDSDINTGNDEANVELWHKRLSHISEKGLKILTKKNHLLDL 240
           MVIA GQKFSSLYYM+AKI+D DINT NDEANVELWHKRLSH+SEKGLKILTKKNHL DL
Sbjct: 343 MVIASGQKFSSLYYMDAKIIDYDINTVNDEANVELWHKRLSHMSEKGLKILTKKNHLHDL 383

Query: 241 KSTPLKQCPHCLAGK 256
           KSTPLK+CPHCLAGK
Sbjct: 403 KSTPLKRCPHCLAGK 383

BLAST of CSPI04G09420 vs. NCBI nr
Match: KAA0065636.1 (putative retrotransposon [Cucumis melo var. makuwa] >TYK07205.1 putative retrotransposon [Cucumis melo var. makuwa])

HSP 1 Score: 349.0 bits (894), Expect = 3.5e-92
Identity = 181/241 (75.10%), Postives = 199/241 (82.57%), Query Frame = 0

Query: 15  RKSQSSCSQSDVLRHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVATEDFFVLSDGD 74
           R +   C       HIK YCRKLKRD KNHKG+EKKN++DS+ DTI +A EDF++LS+GD
Sbjct: 60  RFANVECHYCHEKAHIKNYCRKLKRDHKNHKGREKKNDNDSNADTIIIAIEDFYILSNGD 119

Query: 75  VVNLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVMMDNDGSTNTVGIGDVHLKNR 134
           VVNLATQQ+S VIDSGAS HATSKRE  ASYTPGDFG+V M NDGSTN VGIG VHLKN 
Sbjct: 120 VVNLATQQNSCVIDSGASAHATSKREVSASYTPGDFGNVRMGNDGSTNAVGIGGVHLKNI 179

Query: 135 NGSRLILKNVKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGSMVIAKGQKFSSLYY 194
           NGSRLILKNVKHIPDIRMNLIS  KLD+EGF NTF+NGIWKLT+G MVIAKGQK S LYY
Sbjct: 180 NGSRLILKNVKHIPDIRMNLISTSKLDNEGFCNTFDNGIWKLTEGFMVIAKGQKISLLYY 239

Query: 195 MNAKIMDSDINTGNDEANVELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAG 254
           ++AKI+DSDINT N EANVELW  RLSH+SEKGLKIL KKNHL DLKS PLK   H LAG
Sbjct: 240 VDAKIIDSDINTVNGEANVELWQMRLSHMSEKGLKILIKKNHLPDLKSAPLKWFAHYLAG 299

Query: 255 K 256
           K
Sbjct: 300 K 300

BLAST of CSPI04G09420 vs. NCBI nr
Match: KAA0040427.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 347.8 bits (891), Expect = 7.9e-92
Identity = 188/255 (73.73%), Postives = 196/255 (76.86%), Query Frame = 0

Query: 1   MDLVKSSVLNEEMRRKSQSSCSQSDVLRHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTI 60
           MDLVKSSVLNEEMRRKSQSS  QSD L                 +G+ K           
Sbjct: 163 MDLVKSSVLNEEMRRKSQSSSVQSDFL-------------VTERRGRSK----------- 222

Query: 61  TVATEDFFVLSDGDVVNLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVMMDNDGS 120
                     S G  VNL  QQSSWVIDSGAS+HATSKREFFASYTPGDFGSV M NDG 
Sbjct: 223 ----------SKGPRVNLVIQQSSWVIDSGASVHATSKREFFASYTPGDFGSVRMGNDGP 282

Query: 121 TNTVGIGDVHLKNRNGSRLILKNVKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGS 180
           TN VGIGDVHLKNRNGSRLILKNVKHIPDI MNLIS  KLDDEGF NTF+NGIWKLTKGS
Sbjct: 283 TNVVGIGDVHLKNRNGSRLILKNVKHIPDIHMNLISTGKLDDEGFCNTFDNGIWKLTKGS 342

Query: 181 MVIAKGQKFSSLYYMNAKIMDSDINTGNDEANVELWHKRLSHISEKGLKILTKKNHLLDL 240
           MVIA GQKFSSLYYM+AKI+D DINT NDEANVELWHKRLSH+SEKGLKILTKKNHL DL
Sbjct: 343 MVIASGQKFSSLYYMDAKIIDYDINTVNDEANVELWHKRLSHMSEKGLKILTKKNHLHDL 383

Query: 241 KSTPLKQCPHCLAGK 256
           KSTPLK+CPHCLAGK
Sbjct: 403 KSTPLKRCPHCLAGK 383

BLAST of CSPI04G09420 vs. NCBI nr
Match: TXG70578.1 (hypothetical protein EZV62_005513 [Acer yangbiense])

HSP 1 Score: 321.2 bits (822), Expect = 7.9e-84
Identity = 171/292 (58.56%), Postives = 205/292 (70.21%), Query Frame = 0

Query: 1   MDLVKSSVLNEEMRRKSQSSCSQSDVL--------------------------------- 60
           MDL KSSVLNEEMRRKSQ S SQS+VL                                 
Sbjct: 172 MDLAKSSVLNEEMRRKSQGS-SQSEVLVTEKRGRSKSRGPKNRDRSKSKSNKFANVECYH 231

Query: 61  ----RHIKKYCRKLKRDRKNHKGKEKKNEDDSDTDTITVATEDFFVLSDGDVVNLATQQS 120
                HIKKYCR+LKRD KN KGKEKK +D +D D ++  T+DF V+ D DVVNLA  ++
Sbjct: 232 CGQKGHIKKYCRQLKRDHKNEKGKEKKTDDSNDGDRVSAVTDDFLVVYDDDVVNLACHET 291

Query: 121 SWVIDSGASIHATSKREFFASYTPGDFGSVMMDNDGSTNTVGIGDVHLKNRNGSRLILKN 180
           SWVIDSGASIHATS+R+FFASYT GDFG V M N+G    VG+GDV L+  NG  L+LKN
Sbjct: 292 SWVIDSGASIHATSRRDFFASYTSGDFGDVKMGNNGVAKAVGMGDVCLETNNGMMLLLKN 351

Query: 181 VKHIPDIRMNLISICKLDDEGFYNTFNNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSD 240
           VKHIPDIR+NLIS  KLDDEGF NTF++G WKLTKGSM++A+G+K SSLY+M AK+ D  
Sbjct: 352 VKHIPDIRLNLISAGKLDDEGFCNTFSDGHWKLTKGSMIVARGKKCSSLYFMQAKVSDCI 411

Query: 241 INTGNDEANVELWHKRLSHISEKGLKILTKKNHLLDLKSTPLKQCPHCLAGK 256
           INT ++E+  ELWH+RL H+SEKGL +L KKN L  +K+ PLK+C HCLAGK
Sbjct: 412 INTVDNESTTELWHRRLGHMSEKGLMVLAKKNLLSGMKNAPLKKCAHCLAGK 462

BLAST of CSPI04G09420 vs. TAIR 10
Match: ATMG00300.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 65.5 bits (158), Expect = 7.3e-11
Identity = 33/89 (37.08%), Postives = 49/89 (55.06%), Query Frame = 0

Query: 170 NNGIWKLTKGSMVIAKGQKFSSLYYMNAKIMDSDIN---TGNDEANVELWHKRLSHISEK 229
           + G+ K+ KG   I KG +  SLY +   +   + N   T  DE    LWH RL+H+S++
Sbjct: 25  SEGVLKVLKGCRTILKGNRHDSLYILQGSVETGESNLAETAKDE--TRLWHSRLAHMSQR 84

Query: 230 GLKILTKKNHLLDLKSTPLKQCPHCLAGK 256
           G+++L KK  L   K + LK C  C+ GK
Sbjct: 85  GMELLVKKGFLDSSKVSSLKFCEDCIYGK 111

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109787.5e-4540.74Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P932931.0e-0937.08Uncharacterized mitochondrial protein AtMg00300 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A5D3CVK22.5e-10771.62Putative retrotransposon OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffol... [more]
A0A5D3BKF71.3e-9274.12Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5D3C7061.7e-9275.10Putative retrotransposon OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffol... [more]
A0A5A7TFU13.8e-9273.73Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5C7IN933.8e-8458.56CCHC-type domain-containing protein OS=Acer yangbiense OX=1000413 GN=EZV62_00551... [more]
Match NameE-valueIdentityDescription
KAA0047570.15.1e-10771.62putative retrotransposon [Cucumis melo var. makuwa] >TYK14964.1 putative retrotr... [more]
TYJ98688.12.7e-9274.12Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
KAA0065636.13.5e-9275.10putative retrotransposon [Cucumis melo var. makuwa] >TYK07205.1 putative retrotr... [more]
KAA0040427.17.9e-9273.73Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
TXG70578.17.9e-8458.56hypothetical protein EZV62_005513 [Acer yangbiense][more]
Match NameE-valueIdentityDescription
ATMG00300.17.3e-1137.08Gag-Pol-related retrotransposon family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 192..255
e-value: 6.3E-12
score: 45.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..25
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 37..56
NoneNo IPR availablePANTHERPTHR34676FAMILY NOT NAMEDcoord: 40..193
NoneNo IPR availablePANTHERPTHR34676:SF12ZINC FINGER, CCHC-TYPE, RIBONUCLEASE H-LIKE DOMAIN, GAG-PRE-INTEGRASE DOMAIN PROTEIN-RELATEDcoord: 40..193

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G09420.1CSPI04G09420.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding