CSPI04G20730 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI04G20730
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
LocationChr4: 18992295 .. 18993560 (+)
RNA-Seq ExpressionCSPI04G20730
SyntenyCSPI04G20730
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAAAGCTAAAAGGACTGAAAGCCATCCTAAAGAGTTGGAATAAGGAGACTTTTGGTAAGATTTTCTCCCAAAAACAGGTGCTGATTGATAAGATTAACTATCTTGACTCACTTGAAGAGTCAAGTTGTCTCAACGAGGAAAATGTGAAGGAAAGAGAAAATTGTAGAGGGGCTCTGCTTGATTTGATTGTGAAAGAGCAAAAGTTGTGGATTCAGAAGTCGAAGCTTCATTGGCTTAGAGAGGGGGAGGAGAACTCAAGCTTCTTCCACATATGGGTTTCGGCTCGTAAAAGTAAAAGTATTCTTTCTTCCTTGGTTAGTATCGAAGGGAAGACTCTTGTCACAGAGAAGGAAATTGTGGATGAGATCCTTAGTTTCTTTTCAAATTTATATGGCACAAGGATCTCCTCGCCGTTTATTTGTGACATTCTTAATTGGAGAGGCCTTAGCTTACAGGATTCGAGTTTACTTGAGGTTCCCTTTACCGAAAAAGAAATTAGAGAAGTTGTATTTGAGATGGGTTGTCTCAAGTCCCCTGGCCCTGATGGCTTGACTGGAGAGTTTTATAAAAAGTCATGGAACATTTTGAAGTCCGACCTCGTAAGGGTGTTCCAAGATTTTTTTAAAAACGGAATTATTAACAGAAGATGTAATGAGACTTATATTTATCTCATCCCCAAAAAGAAAGAGGCGGCCCGTGTCAGTGACTTCAGACCCATTAGCTTAATTACCTCCTTGTATAAAGTTATCTCCAAGGTGCTTCCAACAAGACTTAAAAAAGTTCTTCCTTCGATAATTAATGATTCTCAAATGGCTTTTGTGGAAGGAAGGCAAATCCTTGATGCTATTCTAACTGCTTCTGAGGCTGTTGACGAATGGTCTTTAAGAGGCAGAAAAGGTGTTCTTTTAAAGCTCGATTTGGAGAAAGCTTATGATAAGGTGGATTGGTCTTTTCTTGATATGGCCATGAAACTTAAAGGCTTTGGTAAGAGATGTAGGAAGTGGATATGGGGATGCTTGTCGACAACTAATTTTTCCATAATTGTCAACGGCAGGCCTAGAGGAAAGATTATTGCTAAAAGGGGCATTCGTCAAGGTGATCCTCTTGCCCCTTTTCTTTTTACGATAGTGGGAGATGCTCCAAGTTGCCTTATTCACTACTGTAATGAGAAAAGGAGTTTAAAAGGCTTTCATTTTGAGAACCTGTCAGAGGATTTAACCCATCTTCAGTATGCAGACGACACTCTTCTTTCTTCTTCCTAG

mRNA sequence

ATGGAAAAGCTAAAAGGACTGAAAGCCATCCTAAAGAGTTGGAATAAGGAGACTTTTGGTAAGATTTTCTCCCAAAAACAGGTGCTGATTGATAAGATTAACTATCTTGACTCACTTGAAGAGTCAAGTTGTCTCAACGAGGAAAATGTGAAGGAAAGAGAAAATTGTAGAGGGGCTCTGCTTGATTTGATTGTGAAAGAGCAAAAGTTGTGGATTCAGAAGTCGAAGCTTCATTGGCTTAGAGAGGGGGAGGAGAACTCAAGCTTCTTCCACATATGGGTTTCGGCTCGTAAAAGTAAAAGTATTCTTTCTTCCTTGGTTAGTATCGAAGGGAAGACTCTTGTCACAGAGAAGGAAATTGTGGATGAGATCCTTAGTTTCTTTTCAAATTTATATGGCACAAGGATCTCCTCGCCGTTTATTTGTGACATTCTTAATTGGAGAGGCCTTAGCTTACAGGATTCGAGTTTACTTGAGGTTCCCTTTACCGAAAAAGAAATTAGAGAAGTTGTATTTGAGATGGGTTGTCTCAAGTCCCCTGGCCCTGATGGCTTGACTGGAGAGTTTTATAAAAAGTCATGGAACATTTTGAAGTCCGACCTCGTAAGGGTGTTCCAAGATTTTTTTAAAAACGGAATTATTAACAGAAGATGTAATGAGACTTATATTTATCTCATCCCCAAAAAGAAAGAGGCGGCCCGTGTCAGTGACTTCAGACCCATTAGCTTAATTACCTCCTTGTATAAAGTTATCTCCAAGGTGCTTCCAACAAGACTTAAAAAAGTTCTTCCTTCGATAATTAATGATTCTCAAATGGCTTTTGTGGAAGGAAGGCAAATCCTTGATGCTATTCTAACTGCTTCTGAGGCTGTTGACGAATGGTCTTTAAGAGGCAGAAAAGGTGTTCTTTTAAAGCTCGATTTGGAGAAAGCTTATGATAAGGTGGATTGGTCTTTTCTTGATATGGCCATGAAACTTAAAGGCTTTGGTAAGAGATGTAGGAAGTGGATATGGGGATGCTTGTCGACAACTAATTTTTCCATAATTGTCAACGGCAGGCCTAGAGGAAAGATTATTGCTAAAAGGGGCATTCGTCAAGGTGATCCTCTTGCCCCTTTTCTTTTTACGATAGTGGGAGATGCTCCAAGTTGCCTTATTCACTACTGTAATGAGAAAAGGAGTTTAAAAGGCTTTCATTTTGAGAACCTGTCAGAGGATTTAACCCATCTTCAGTATGCAGACGACACTCTTCTTTCTTCTTCCTAG

Coding sequence (CDS)

ATGGAAAAGCTAAAAGGACTGAAAGCCATCCTAAAGAGTTGGAATAAGGAGACTTTTGGTAAGATTTTCTCCCAAAAACAGGTGCTGATTGATAAGATTAACTATCTTGACTCACTTGAAGAGTCAAGTTGTCTCAACGAGGAAAATGTGAAGGAAAGAGAAAATTGTAGAGGGGCTCTGCTTGATTTGATTGTGAAAGAGCAAAAGTTGTGGATTCAGAAGTCGAAGCTTCATTGGCTTAGAGAGGGGGAGGAGAACTCAAGCTTCTTCCACATATGGGTTTCGGCTCGTAAAAGTAAAAGTATTCTTTCTTCCTTGGTTAGTATCGAAGGGAAGACTCTTGTCACAGAGAAGGAAATTGTGGATGAGATCCTTAGTTTCTTTTCAAATTTATATGGCACAAGGATCTCCTCGCCGTTTATTTGTGACATTCTTAATTGGAGAGGCCTTAGCTTACAGGATTCGAGTTTACTTGAGGTTCCCTTTACCGAAAAAGAAATTAGAGAAGTTGTATTTGAGATGGGTTGTCTCAAGTCCCCTGGCCCTGATGGCTTGACTGGAGAGTTTTATAAAAAGTCATGGAACATTTTGAAGTCCGACCTCGTAAGGGTGTTCCAAGATTTTTTTAAAAACGGAATTATTAACAGAAGATGTAATGAGACTTATATTTATCTCATCCCCAAAAAGAAAGAGGCGGCCCGTGTCAGTGACTTCAGACCCATTAGCTTAATTACCTCCTTGTATAAAGTTATCTCCAAGGTGCTTCCAACAAGACTTAAAAAAGTTCTTCCTTCGATAATTAATGATTCTCAAATGGCTTTTGTGGAAGGAAGGCAAATCCTTGATGCTATTCTAACTGCTTCTGAGGCTGTTGACGAATGGTCTTTAAGAGGCAGAAAAGGTGTTCTTTTAAAGCTCGATTTGGAGAAAGCTTATGATAAGGTGGATTGGTCTTTTCTTGATATGGCCATGAAACTTAAAGGCTTTGGTAAGAGATGTAGGAAGTGGATATGGGGATGCTTGTCGACAACTAATTTTTCCATAATTGTCAACGGCAGGCCTAGAGGAAAGATTATTGCTAAAAGGGGCATTCGTCAAGGTGATCCTCTTGCCCCTTTTCTTTTTACGATAGTGGGAGATGCTCCAAGTTGCCTTATTCACTACTGTAATGAGAAAAGGAGTTTAAAAGGCTTTCATTTTGAGAACCTGTCAGAGGATTTAACCCATCTTCAGTATGCAGACGACACTCTTCTTTCTTCTTCCTAG

Protein sequence

MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRKGVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTLLSSS*
Homology
BLAST of CSPI04G20730 vs. ExPASy Swiss-Prot
Match: P14381 (Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis OX=8355 PE=4 SV=1)

HSP 1 Score: 161.4 bits (407), Expect = 2.3e-38
Identity = 114/418 (27.27%), Postives = 202/418 (48.33%), Query Frame = 0

Query: 7   LKAILKSWNKETFGKIFSQKQVLIDKINYLD---SLEESSCLNEENVKERENCRGALLDL 66
           LK + + + K   G+  ++ + L  ++  L+   S  E   L  E ++ +E    AL ++
Sbjct: 293 LKLLCQEYTKSVSGQRNAEIEALNGEVLDLEQRLSGSEDQALQCEYLERKE----ALRNM 352

Query: 67  IVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEIVDE 126
             ++ +    +S++  L + +  S FF+     + ++  ++ L + +G  L   + I D 
Sbjct: 353 EQRQARGAFVRSRMQLLCDMDRGSRFFYALEKKKGNRKQITCLFAEDGTPLEDPEAIRDR 412

Query: 127 ILSFFSNLYGTRISSPFICDILNWRGL---SLQDSSLLEVPFTEKEIREVVFEMGCLKSP 186
             SF+ NL+     SP  C+ L W GL   S +    LE P T  E+ + +  M   KSP
Sbjct: 413 ARSFYQNLFSPDPISPDACEEL-WDGLPVVSERRKERLETPITLDELSQALRLMPHNKSP 472

Query: 187 GPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRP 246
           G DGLT EF++  W+ L  D  RV  + FK G +   C    + L+PKK +   + ++RP
Sbjct: 473 GLDGLTIEFFQFFWDTLGPDFHRVLTEAFKKGELPLSCRRAVLSLLPKKGDLRLIKNWRP 532

Query: 247 ISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK 306
           +SL+++ YK+++K +  RLK VL  +I+  Q   V GR I D +    + +      G  
Sbjct: 533 VSLLSTDYKIVAKAISLRLKSVLAEVIHPDQSYTVPGRTIFDNVFLIRDLLHFARRTGLS 592

Query: 307 GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIA 366
              L LD EKA+D+VD  +L   ++   FG +   ++    ++    + +N      +  
Sbjct: 593 LAFLSLDQEKAFDRVDHQYLIGTLQAYSFGPQFVGYLKTMYASAECLVKINWSLTAPLAF 652

Query: 367 KRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTLL 419
            RG+RQG PL+  L+++  +   CL+     ++ L G   +     +    YADD +L
Sbjct: 653 GRGVRQGCPLSGQLYSLAIEPFLCLL-----RKRLTGLVLKEPDMRVVLSAYADDVIL 700

BLAST of CSPI04G20730 vs. ExPASy Swiss-Prot
Match: P08548 (LINE-1 reverse transcriptase homolog OS=Nycticebus coucang OX=9470 PE=4 SV=1)

HSP 1 Score: 139.8 bits (351), Expect = 7.0e-32
Identity = 113/422 (26.78%), Postives = 189/422 (44.79%), Query Frame = 0

Query: 3   KLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLD 62
           K   L+A LK   +E    +    + L          EE S       KE    R  L +
Sbjct: 297 KFIALQAFLKKTEREEVNNLMGHLKQL--------EKEEHSNPKPSRRKEITKIRAELNE 356

Query: 63  LIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEIVD 122
           +  K     I KSK  +  +  +           ++ KS++SS+ +   +      EI  
Sbjct: 357 IENKRIIQQINKSKSWFFEKINKIDKPLANLTRKKRVKSLISSIRNGNDEITTDPSEIQK 416

Query: 123 EILSFFSNLYGTRISS----PFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLK 182
            +  ++  LY  +  +        +  +   LS ++  +L  P +  EI   +  +   K
Sbjct: 417 ILNEYYKKLYSHKYENLKEIDQYLEACHLPRLSQKEVEMLNRPISSSEIASTIQNLPKKK 476

Query: 183 SPGPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKK-KEAARVSD 242
           SPGPDG T EFY+     L   L+ +FQ+  K GI+     E  I LIPK  K+  R  +
Sbjct: 477 SPGPDGFTSEFYQTFKEELVPILLNLFQNIEKEGILPNTFYEANITLIPKPGKDPTRKEN 536

Query: 243 FRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWS-L 302
           +RPISL+    K+++K+L  R+++ +  II+  Q+ F+ G Q    I  +   +   + L
Sbjct: 537 YRPISLMNIDAKILNKILTNRIQQHIKKIIHHDQVGFIPGSQGWFNIRKSINVIQHINKL 596

Query: 303 RGRKGVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRG 362
           + +  ++L +D EKA+D +   F+   +K  G      K I    S    +II+NG    
Sbjct: 597 KNKDHMILSIDAEKAFDNIQHPFMIRTLKKIGIEGTFLKLIEAIYSKPTANIILNGVKLK 656

Query: 363 KIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDT 419
               + G RQG PL+P LF IV +  +  I    E++++KG H    SE++    +ADD 
Sbjct: 657 SFPLRSGTRQGCPLSPLLFNIVMEVLAIAI---REEKAIKGIHIG--SEEIKLSLFADDM 705

BLAST of CSPI04G20730 vs. ExPASy Swiss-Prot
Match: O00370 (LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1)

HSP 1 Score: 135.2 bits (339), Expect = 1.7e-30
Identity = 88/331 (26.59%), Postives = 158/331 (47.73%), Query Frame = 0

Query: 94  VSARKSKSILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISS----PFICDILNWRG 153
           +  ++ K+ + ++ + +G       EI   I  ++ +LY  ++ +        D      
Sbjct: 381 IKKKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDTFLDTYTLPR 440

Query: 154 LSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSDLVRVFQDFF 213
           L+ ++   L  P T  EI  ++  +   KSPGPDG T EFY++    L   L+++FQ   
Sbjct: 441 LNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIE 500

Query: 214 KNGIINRRCNETYIYLIPKK-KEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIIN 273
           K GI+     E  I LIPK  ++  +  +FRPISL+    K+++K+L  R+++ +  +I+
Sbjct: 501 KEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIH 560

Query: 274 DSQMAFVEGRQILDAILTASEAVDEWS-LRGRKGVLLKLDLEKAYDKVDWSFLDMAMKLK 333
             Q+ F+ G Q    I  +   +   +  + +  V++ +D EKA+DK+   F+   +   
Sbjct: 561 HDQVGFIPGMQGWFNIRKSINVIQHINRAKDKNHVIISIDAEKAFDKIQQPFMLKTLNKL 620

Query: 334 GFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIH 393
           G      K I         +II+NG+       K G RQG PL+P LF IV +    L  
Sbjct: 621 GIDGMYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEV---LAR 680

Query: 394 YCNEKRSLKGFHFENLSEDLTHLQYADDTLL 419
              +++ +KG       E++    +ADD ++
Sbjct: 681 AIRQEKEIKGIQLG--KEEVKLSLFADDMIV 706

BLAST of CSPI04G20730 vs. ExPASy Swiss-Prot
Match: P11369 (LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE=1 SV=2)

HSP 1 Score: 131.3 bits (329), Expect = 2.5e-29
Identity = 92/327 (28.13%), Postives = 157/327 (48.01%), Query Frame = 0

Query: 98  KSKSILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISS----PFICDILNWRGLSLQ 157
           + K +++ + + +G      +EI + I SF+  LY T++ +        D      L+  
Sbjct: 392 RDKILINKIRNEKGDITTDPEEIQNTIRSFYKRLYSTKLENLDEMDKFLDRYQVPKLNQD 451

Query: 158 DSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGI 217
               L  P + KEI  V+  +   KSPGPDG + EFY+     L   L ++F      G 
Sbjct: 452 QVDHLNSPISPKEIEAVINSLPTKKSPGPDGFSAEFYQTFKEDLIPILHKLFHKIEVEGT 511

Query: 218 INRRCNETYIYLIPK-KKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQM 277
           +     E  I LIPK +K+  ++ +FRPISL+    K+++K+L  R+++ + +II+  Q+
Sbjct: 512 LPNSFYEATITLIPKPQKDPTKIENFRPISLMNIDAKILNKILANRIQEHIKAIIHPDQV 571

Query: 278 AFVEGRQILDAILTASEAVDEWS-LRGRKGVLLKLDLEKAYDKVDWSFLDMAMKLKGFGK 337
            F+ G Q    I  +   +   + L+ +  +++ LD EKA+DK+   F+   ++  G   
Sbjct: 572 GFIPGMQGWFNIRKSINVIHYINKLKDKNHMIISLDAEKAFDKIQHPFMIKVLERSGIQG 631

Query: 338 RCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNE 397
                I    S    +I VNG     I  K G RQG PL+P+LF IV +    L     +
Sbjct: 632 PYLNMIKAIYSKPVANIKVNGEKLEAIPLKSGTRQGCPLSPYLFNIVLEV---LARAIRQ 691

Query: 398 KRSLKGFHFENLSEDLTHLQYADDTLL 419
           ++ +KG       E++     ADD ++
Sbjct: 692 QKEIKGIQIG--KEEVKISLLADDMIV 713

BLAST of CSPI04G20730 vs. ExPASy Swiss-Prot
Match: Q03274 (Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment) OS=Popillia japonica OX=7064 PE=4 SV=1)

HSP 1 Score: 85.1 bits (209), Expect = 2.1e-15
Identity = 72/246 (29.27%), Postives = 119/246 (48.37%), Query Frame = 0

Query: 179 SPGPDGLTGEFYKKSWNILKSDLVRVF-QDFFKNGIINRRCNETYIYLIPKKKEAARVSD 238
           +PG DGLT +       I ++ L R F Q     G +          LIPK  +    S+
Sbjct: 22  APGSDGLTVQ------AITRTRLPRNFVQLHLLRGHVPTPWTAMRTTLIPKDGDLENPSN 81

Query: 239 FRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEW--S 298
           +RPI++ ++L +++ ++L  RL+  +   ++ +Q  +      +D  L  S  +D +  S
Sbjct: 82  WRPITIASALQRLLHRILAKRLEAAVE--LHPAQKGYAR----IDGTLVNSLLLDTYISS 141

Query: 299 LRGRKGV--LLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVN-G 358
            R ++    ++ LD+ KA+D V  S +  A++  G  +    +I G LS +  +I V  G
Sbjct: 142 RREQRKTYNVVSLDVRKAFDTVSHSSICRALQRLGIDEGTSNYITGSLSDSTTTIRVGPG 201

Query: 359 RPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQY 418
               KI  +RG++QGDPL+PFLF  V D   C +      +S  G       E +  L +
Sbjct: 202 SQTRKICIRRGVKQGDPLSPFLFNAVLDELLCSL------QSTPGIGGTIGEEKIPVLAF 249

BLAST of CSPI04G20730 vs. ExPASy TrEMBL
Match: A0A438IK87 (Transposon TX1 uncharacterized 149 kDa protein OS=Vitis vinifera OX=29760 GN=YTX2_766 PE=4 SV=1)

HSP 1 Score: 394.0 bits (1011), Expect = 7.6e-106
Identity = 191/420 (45.48%), Postives = 285/420 (67.86%), Query Frame = 0

Query: 1   MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGAL 60
           M KLK +K+ LK WN  TFG +  +K++++  ++ +D +E+   LN + V ER   R  L
Sbjct: 30  MRKLKFVKSKLKEWNIMTFGDLKERKKLILTDLSRIDLIEQEGNLNSDLVLERTLKRREL 89

Query: 61  LDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEI 120
            D+++KE+  W QKS++ W++EG+ NS FFH   + RKS+  + SL+S  G+TL   ++I
Sbjct: 90  EDVLLKEEVQWRQKSRIKWIKEGDCNSKFFHRVATGRKSRKFIKSLISERGETLNNIEDI 149

Query: 121 VDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSP 180
            +EI++FF NLY   +   +  + ++W  +S +    L+ PFTE+E+R  VF++   K+P
Sbjct: 150 SEEIVNFFGNLYSKPVGESWRVEGIDWVPISGESGVWLDRPFTEEEVRRAVFQLNKEKAP 209

Query: 181 GPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRP 240
           GPDG T   Y++ W+++K DL+RVF +F  NG+IN+  N T+I L+PKK ++ ++SD+RP
Sbjct: 210 GPDGFTIAVYQECWDVIKEDLMRVFLEFHTNGVINQSTNATFIALVPKKSQSVKISDYRP 269

Query: 241 ISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK 300
           ISL+TSLYK+I+KVL  RL+KVL   I+DSQ AFVEGR ILDA+L A+E VDE    G +
Sbjct: 270 ISLVTSLYKIIAKVLSGRLRKVLHETISDSQGAFVEGRHILDAVLIANEVVDEKRRSGEE 329

Query: 301 GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIA 360
           G++ K+D EKAYD VDW FLD  ++ KGF ++ R WI GCLS+++F+I+VNG  +G + A
Sbjct: 330 GIVFKIDFEKAYDHVDWGFLDHVLQRKGFSQKWRLWIRGCLSSSSFAILVNGNAKGWVKA 389

Query: 361 KRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTLLSS 420
            RG+RQGDPL+PFLFT+V D  S ++    E    +GF        ++ LQ+ADDT+  S
Sbjct: 390 SRGLRQGDPLSPFLFTLVADVLSRMLFRAEETGLTEGFSVGRDRTRVSLLQFADDTIFFS 449

BLAST of CSPI04G20730 vs. ExPASy TrEMBL
Match: A0A438FWU5 (LINE-1 retrotransposable element ORF2 protein OS=Vitis vinifera OX=29760 GN=LORF2_70 PE=4 SV=1)

HSP 1 Score: 393.3 bits (1009), Expect = 1.3e-105
Identity = 190/420 (45.24%), Postives = 285/420 (67.86%), Query Frame = 0

Query: 1    MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGAL 60
            M KLK +K+ LK WN  TFG +  +K++++  ++ +D +E+   LN + V ER   R  L
Sbjct: 1010 MRKLKFVKSKLKEWNIMTFGDLKERKKLILTDLSRIDLIEQEGNLNSDLVLERTLKRREL 1069

Query: 61   LDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEI 120
             D+++KE+  W QKS++ W++EG+ NS FFH   + R+S+  + SL+S  G+TL   ++I
Sbjct: 1070 EDVLLKEEVQWRQKSRVKWIKEGDCNSKFFHRVATGRRSRKFIKSLISERGETLNNIEDI 1129

Query: 121  VDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSP 180
             +EI++FF NLY   +   +  + ++W  +S +    L+ PFTE+E+R  VF++   K+P
Sbjct: 1130 SEEIVNFFGNLYSKPVGESWRVEGIDWVPISGESGGWLDRPFTEEEVRRAVFQLNKEKAP 1189

Query: 181  GPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRP 240
            GPDG T   Y++ W+++K DL+RVF +F  NG+IN+  N T+I L+PKK ++ ++SD+RP
Sbjct: 1190 GPDGFTIAVYQECWDVIKEDLMRVFLEFHTNGVINQSTNATFIALVPKKSQSVKISDYRP 1249

Query: 241  ISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK 300
            ISL+TSLYK+I+KVL  RL+KVL   I+DSQ AFVEGR ILDA+L A+E VDE    G +
Sbjct: 1250 ISLVTSLYKIIAKVLSGRLRKVLHETISDSQGAFVEGRHILDAVLIANEVVDEKRRSGEE 1309

Query: 301  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIA 360
            G++ K+D EKAYD VDW FLD  ++ KGF ++ R WI GCLS+++F+I+VNG  +G + A
Sbjct: 1310 GIVFKIDFEKAYDHVDWGFLDHVLQRKGFSQKWRLWIRGCLSSSSFAILVNGNAKGWVKA 1369

Query: 361  KRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTLLSS 420
             RG+RQGDPL+PFLFT+V D  S ++    E    +GF        ++ LQ+ADDT+  S
Sbjct: 1370 SRGLRQGDPLSPFLFTLVADVLSRMLFRAEETGLTEGFSVGRDRTRVSLLQFADDTIFFS 1429

BLAST of CSPI04G20730 vs. ExPASy TrEMBL
Match: A5BV95 (Reverse transcriptase domain-containing protein OS=Vitis vinifera OX=29760 GN=VITISV_026478 PE=4 SV=1)

HSP 1 Score: 393.3 bits (1009), Expect = 1.3e-105
Identity = 190/420 (45.24%), Postives = 285/420 (67.86%), Query Frame = 0

Query: 1    MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGAL 60
            M KLK +K+ LK WN  TFG +  +K++++  ++ +D +E+   LN + V ER   R  L
Sbjct: 897  MRKLKFVKSKLKEWNIMTFGDLKERKKLILTDLSRIDLIEQEGNLNSDLVLERTLKRREL 956

Query: 61   LDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEI 120
             D+++KE+  W QKS++ W++EG+ NS FFH   + R+S+  + SL+S  G+TL   ++I
Sbjct: 957  EDVLLKEEVQWRQKSRVKWIKEGDCNSKFFHRVATGRRSRKFIKSLISERGETLNNIEDI 1016

Query: 121  VDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSP 180
             +EI++FF NLY   +   +  + ++W  +S +    L+ PFTE+E+R  VF++   K+P
Sbjct: 1017 SEEIVNFFGNLYSKPVGESWRXEGIDWVPISGESGGWLDRPFTEEEVRRAVFQLNKEKAP 1076

Query: 181  GPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRP 240
            GPDG T   Y++ W+++K DL+RVF +F  NG+IN+  N T+I L+PKK ++ ++SD+RP
Sbjct: 1077 GPDGFTIAVYQECWDVIKEDLMRVFLEFHTNGVINQSTNATFIALVPKKSQSVKISDYRP 1136

Query: 241  ISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK 300
            ISL+TSLYK+I+KVL  RL+KVL   I+DSQ AFVEGR ILDA+L A+E VDE    G +
Sbjct: 1137 ISLVTSLYKIIAKVLSGRLRKVLHETISDSQGAFVEGRHILDAVLIANEVVDEKRRSGEE 1196

Query: 301  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIA 360
            G++ K+D EKAYD VDW FLD  ++ KGF ++ R WI GCLS+++F+I+VNG  +G + A
Sbjct: 1197 GIVFKIDFEKAYDHVDWGFLDHVLQRKGFSQKWRLWIRGCLSSSSFAILVNGNAKGWVKA 1256

Query: 361  KRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTLLSS 420
             RG+RQGDPL+PFLFT+V D  S ++    E    +GF        ++ LQ+ADDT+  S
Sbjct: 1257 SRGLRQGDPLSPFLFTLVADVLSRMLFRAEETGLTEGFSVGRDRTRVSLLQFADDTIFFS 1316

BLAST of CSPI04G20730 vs. ExPASy TrEMBL
Match: A0A2P6QQZ3 (Putative RNA-directed DNA polymerase OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr4g0393411 PE=4 SV=1)

HSP 1 Score: 392.9 bits (1008), Expect = 1.7e-105
Identity = 192/417 (46.04%), Postives = 280/417 (67.15%), Query Frame = 0

Query: 1   MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGAL 60
           M KLK +K  LK W++ETFG I  +K+V+  +IN LD  E S+ +     +ERE  RG L
Sbjct: 30  MRKLKNVKGKLKIWSRETFGDIGKEKKVVEARINELDEEERSAGICVVKKREREVLRGRL 89

Query: 61  LDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEI 120
            +L ++E+  W Q++KL W +EG+ N+ FFH  V+ R+ ++++  L    G  +  E  I
Sbjct: 90  EELALREEIFWRQRAKLKWAKEGDNNTRFFHKLVNGRRKRNVIEKLELSNGIVVEEEDLI 149

Query: 121 VDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSP 180
            +EI+ F+ NLY ++    F  + L+W  +S++ +  LE PF E+EI+  VFE   +KSP
Sbjct: 150 EEEIIRFYKNLYSSKEEVSFGIEGLHWNPISVEKARWLETPFEEEEIKRAVFECDTVKSP 209

Query: 181 GPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRP 240
           GPDG +    +++W ++K +++ V  +F  NG++N+  NETYI LIPKK  + +V D+RP
Sbjct: 210 GPDGFSFAVLQRNWEVVKREVLDVMAEFHTNGVVNKVTNETYICLIPKKANSLKVGDYRP 269

Query: 241 ISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK 300
           ISLITSLYK+I+KVL  RL++VL   I+ +Q AF++GRQILDA+L A+E VDE   + ++
Sbjct: 270 ISLITSLYKIIAKVLAWRLREVLSDTISGAQGAFIKGRQILDAVLVANEVVDETRKKKKE 329

Query: 301 GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIA 360
           G++ K+D EKAYD V+W+FLD AM+ KGFG R RKWI GCL + NFSI +NGRPRGK  A
Sbjct: 330 GLVFKIDFEKAYDHVEWNFLDYAMESKGFGDRWRKWIGGCLRSANFSIQINGRPRGKFGA 389

Query: 361 KRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTL 418
            RG+RQGDPL+ FLFT+V D    L+    + R ++G        ++THLQ+ADDT+
Sbjct: 390 SRGLRQGDPLSSFLFTLVVDVLDRLMERARDCRLIEGLVVGREEVEITHLQFADDTI 446

BLAST of CSPI04G20730 vs. ExPASy TrEMBL
Match: A0A5E4GN72 (PREDICTED: RNA-directed DNA polymerase (Fragment) OS=Prunus dulcis OX=3755 GN=ALMOND_2B014918 PE=4 SV=1)

HSP 1 Score: 392.5 bits (1007), Expect = 2.2e-105
Identity = 193/415 (46.51%), Postives = 275/415 (66.27%), Query Frame = 0

Query: 3   KLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLD 62
           +L+ +K  +K WNKE FG + S K+    +I  LD +E    L+    KERE+    + D
Sbjct: 212 RLRTIKQKIKVWNKEVFGDLVSAKKEAEARIAALDLMEGQGGLDNTLRKEREDLYFMVSD 271

Query: 63  LIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEIVD 122
           L+ KE+  W Q+ K+ W R+G+ N+ FFH   S R+ ++ +  L    G  +V+E EI  
Sbjct: 272 LVHKEEVKWRQRGKIQWARDGDSNTKFFHRIASGRRKRNFIQKLEVAGGGVVVSEGEIEL 331

Query: 123 EILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGP 182
           EI++FF NLY + + + +  + LNW  +S++++  L+ PF E+E++  VF+ G  KSPGP
Sbjct: 332 EIINFFKNLYSSNVEAGWCLEGLNWNAISVEEAEWLDRPFEEEEVKRAVFDCGIDKSPGP 391

Query: 183 DGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPIS 242
           DG +   ++  W+I+K DL++V  DFF  GIIN   NET+I LIPKKKE+ +VSDFRPIS
Sbjct: 392 DGFSMLLFQSCWDIVKEDLMKVMADFFNCGIINAITNETFICLIPKKKESVKVSDFRPIS 451

Query: 243 LITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRKGV 302
           L+TSLYK++SKVL +RL++VL S I+  Q AFV+GRQILDA L A+E V+E     + G+
Sbjct: 452 LVTSLYKMVSKVLASRLREVLGSTISSYQSAFVQGRQILDAALIANEVVEESRRLNKSGM 511

Query: 303 LLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKR 362
           + K+DLEKAYD V+W F+D  +  KGFG R R WI GCL T NFS+++NGRPRGK  A R
Sbjct: 512 VFKIDLEKAYDHVEWRFVDEVLIRKGFGDRWRSWIRGCLETANFSVMINGRPRGKFRASR 571

Query: 363 GIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTL 418
           G+RQGDPL+PFLFT+V D  S ++    +     G    N   +++HLQ+ADDT+
Sbjct: 572 GLRQGDPLSPFLFTLVMDVLSRIMEKAQDADEFHGLSPGNGMVEISHLQFADDTI 626

BLAST of CSPI04G20730 vs. NCBI nr
Match: RVW97045.1 (Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera])

HSP 1 Score: 394.0 bits (1011), Expect = 1.6e-105
Identity = 191/420 (45.48%), Postives = 285/420 (67.86%), Query Frame = 0

Query: 1   MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGAL 60
           M KLK +K+ LK WN  TFG +  +K++++  ++ +D +E+   LN + V ER   R  L
Sbjct: 30  MRKLKFVKSKLKEWNIMTFGDLKERKKLILTDLSRIDLIEQEGNLNSDLVLERTLKRREL 89

Query: 61  LDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEI 120
            D+++KE+  W QKS++ W++EG+ NS FFH   + RKS+  + SL+S  G+TL   ++I
Sbjct: 90  EDVLLKEEVQWRQKSRIKWIKEGDCNSKFFHRVATGRKSRKFIKSLISERGETLNNIEDI 149

Query: 121 VDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSP 180
            +EI++FF NLY   +   +  + ++W  +S +    L+ PFTE+E+R  VF++   K+P
Sbjct: 150 SEEIVNFFGNLYSKPVGESWRVEGIDWVPISGESGVWLDRPFTEEEVRRAVFQLNKEKAP 209

Query: 181 GPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRP 240
           GPDG T   Y++ W+++K DL+RVF +F  NG+IN+  N T+I L+PKK ++ ++SD+RP
Sbjct: 210 GPDGFTIAVYQECWDVIKEDLMRVFLEFHTNGVINQSTNATFIALVPKKSQSVKISDYRP 269

Query: 241 ISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK 300
           ISL+TSLYK+I+KVL  RL+KVL   I+DSQ AFVEGR ILDA+L A+E VDE    G +
Sbjct: 270 ISLVTSLYKIIAKVLSGRLRKVLHETISDSQGAFVEGRHILDAVLIANEVVDEKRRSGEE 329

Query: 301 GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIA 360
           G++ K+D EKAYD VDW FLD  ++ KGF ++ R WI GCLS+++F+I+VNG  +G + A
Sbjct: 330 GIVFKIDFEKAYDHVDWGFLDHVLQRKGFSQKWRLWIRGCLSSSSFAILVNGNAKGWVKA 389

Query: 361 KRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTLLSS 420
            RG+RQGDPL+PFLFT+V D  S ++    E    +GF        ++ LQ+ADDT+  S
Sbjct: 390 SRGLRQGDPLSPFLFTLVADVLSRMLFRAEETGLTEGFSVGRDRTRVSLLQFADDTIFFS 449

BLAST of CSPI04G20730 vs. NCBI nr
Match: RVW64408.1 (LINE-1 retrotransposable element ORF2 protein [Vitis vinifera])

HSP 1 Score: 393.3 bits (1009), Expect = 2.7e-105
Identity = 190/420 (45.24%), Postives = 285/420 (67.86%), Query Frame = 0

Query: 1    MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGAL 60
            M KLK +K+ LK WN  TFG +  +K++++  ++ +D +E+   LN + V ER   R  L
Sbjct: 1010 MRKLKFVKSKLKEWNIMTFGDLKERKKLILTDLSRIDLIEQEGNLNSDLVLERTLKRREL 1069

Query: 61   LDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEI 120
             D+++KE+  W QKS++ W++EG+ NS FFH   + R+S+  + SL+S  G+TL   ++I
Sbjct: 1070 EDVLLKEEVQWRQKSRVKWIKEGDCNSKFFHRVATGRRSRKFIKSLISERGETLNNIEDI 1129

Query: 121  VDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSP 180
             +EI++FF NLY   +   +  + ++W  +S +    L+ PFTE+E+R  VF++   K+P
Sbjct: 1130 SEEIVNFFGNLYSKPVGESWRVEGIDWVPISGESGGWLDRPFTEEEVRRAVFQLNKEKAP 1189

Query: 181  GPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRP 240
            GPDG T   Y++ W+++K DL+RVF +F  NG+IN+  N T+I L+PKK ++ ++SD+RP
Sbjct: 1190 GPDGFTIAVYQECWDVIKEDLMRVFLEFHTNGVINQSTNATFIALVPKKSQSVKISDYRP 1249

Query: 241  ISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK 300
            ISL+TSLYK+I+KVL  RL+KVL   I+DSQ AFVEGR ILDA+L A+E VDE    G +
Sbjct: 1250 ISLVTSLYKIIAKVLSGRLRKVLHETISDSQGAFVEGRHILDAVLIANEVVDEKRRSGEE 1309

Query: 301  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIA 360
            G++ K+D EKAYD VDW FLD  ++ KGF ++ R WI GCLS+++F+I+VNG  +G + A
Sbjct: 1310 GIVFKIDFEKAYDHVDWGFLDHVLQRKGFSQKWRLWIRGCLSSSSFAILVNGNAKGWVKA 1369

Query: 361  KRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTLLSS 420
             RG+RQGDPL+PFLFT+V D  S ++    E    +GF        ++ LQ+ADDT+  S
Sbjct: 1370 SRGLRQGDPLSPFLFTLVADVLSRMLFRAEETGLTEGFSVGRDRTRVSLLQFADDTIFFS 1429

BLAST of CSPI04G20730 vs. NCBI nr
Match: CAN75040.1 (hypothetical protein VITISV_026478 [Vitis vinifera])

HSP 1 Score: 393.3 bits (1009), Expect = 2.7e-105
Identity = 190/420 (45.24%), Postives = 285/420 (67.86%), Query Frame = 0

Query: 1    MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGAL 60
            M KLK +K+ LK WN  TFG +  +K++++  ++ +D +E+   LN + V ER   R  L
Sbjct: 897  MRKLKFVKSKLKEWNIMTFGDLKERKKLILTDLSRIDLIEQEGNLNSDLVLERTLKRREL 956

Query: 61   LDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEI 120
             D+++KE+  W QKS++ W++EG+ NS FFH   + R+S+  + SL+S  G+TL   ++I
Sbjct: 957  EDVLLKEEVQWRQKSRVKWIKEGDCNSKFFHRVATGRRSRKFIKSLISERGETLNNIEDI 1016

Query: 121  VDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSP 180
             +EI++FF NLY   +   +  + ++W  +S +    L+ PFTE+E+R  VF++   K+P
Sbjct: 1017 SEEIVNFFGNLYSKPVGESWRXEGIDWVPISGESGGWLDRPFTEEEVRRAVFQLNKEKAP 1076

Query: 181  GPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRP 240
            GPDG T   Y++ W+++K DL+RVF +F  NG+IN+  N T+I L+PKK ++ ++SD+RP
Sbjct: 1077 GPDGFTIAVYQECWDVIKEDLMRVFLEFHTNGVINQSTNATFIALVPKKSQSVKISDYRP 1136

Query: 241  ISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK 300
            ISL+TSLYK+I+KVL  RL+KVL   I+DSQ AFVEGR ILDA+L A+E VDE    G +
Sbjct: 1137 ISLVTSLYKIIAKVLSGRLRKVLHETISDSQGAFVEGRHILDAVLIANEVVDEKRRSGEE 1196

Query: 301  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIA 360
            G++ K+D EKAYD VDW FLD  ++ KGF ++ R WI GCLS+++F+I+VNG  +G + A
Sbjct: 1197 GIVFKIDFEKAYDHVDWGFLDHVLQRKGFSQKWRLWIRGCLSSSSFAILVNGNAKGWVKA 1256

Query: 361  KRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTLLSS 420
             RG+RQGDPL+PFLFT+V D  S ++    E    +GF        ++ LQ+ADDT+  S
Sbjct: 1257 SRGLRQGDPLSPFLFTLVADVLSRMLFRAEETGLTEGFSVGRDRTRVSLLQFADDTIFFS 1316

BLAST of CSPI04G20730 vs. NCBI nr
Match: PRQ36601.1 (putative RNA-directed DNA polymerase [Rosa chinensis])

HSP 1 Score: 392.9 bits (1008), Expect = 3.5e-105
Identity = 192/417 (46.04%), Postives = 280/417 (67.15%), Query Frame = 0

Query: 1   MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGAL 60
           M KLK +K  LK W++ETFG I  +K+V+  +IN LD  E S+ +     +ERE  RG L
Sbjct: 30  MRKLKNVKGKLKIWSRETFGDIGKEKKVVEARINELDEEERSAGICVVKKREREVLRGRL 89

Query: 61  LDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEI 120
            +L ++E+  W Q++KL W +EG+ N+ FFH  V+ R+ ++++  L    G  +  E  I
Sbjct: 90  EELALREEIFWRQRAKLKWAKEGDNNTRFFHKLVNGRRKRNVIEKLELSNGIVVEEEDLI 149

Query: 121 VDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSP 180
            +EI+ F+ NLY ++    F  + L+W  +S++ +  LE PF E+EI+  VFE   +KSP
Sbjct: 150 EEEIIRFYKNLYSSKEEVSFGIEGLHWNPISVEKARWLETPFEEEEIKRAVFECDTVKSP 209

Query: 181 GPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRP 240
           GPDG +    +++W ++K +++ V  +F  NG++N+  NETYI LIPKK  + +V D+RP
Sbjct: 210 GPDGFSFAVLQRNWEVVKREVLDVMAEFHTNGVVNKVTNETYICLIPKKANSLKVGDYRP 269

Query: 241 ISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK 300
           ISLITSLYK+I+KVL  RL++VL   I+ +Q AF++GRQILDA+L A+E VDE   + ++
Sbjct: 270 ISLITSLYKIIAKVLAWRLREVLSDTISGAQGAFIKGRQILDAVLVANEVVDETRKKKKE 329

Query: 301 GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIA 360
           G++ K+D EKAYD V+W+FLD AM+ KGFG R RKWI GCL + NFSI +NGRPRGK  A
Sbjct: 330 GLVFKIDFEKAYDHVEWNFLDYAMESKGFGDRWRKWIGGCLRSANFSIQINGRPRGKFGA 389

Query: 361 KRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTL 418
            RG+RQGDPL+ FLFT+V D    L+    + R ++G        ++THLQ+ADDT+
Sbjct: 390 SRGLRQGDPLSSFLFTLVVDVLDRLMERARDCRLIEGLVVGREEVEITHLQFADDTI 446

BLAST of CSPI04G20730 vs. NCBI nr
Match: BBN69746.1 (VIRB2-interacting protein 2 [Prunus dulcis])

HSP 1 Score: 392.5 bits (1007), Expect = 4.6e-105
Identity = 193/415 (46.51%), Postives = 275/415 (66.27%), Query Frame = 0

Query: 3   KLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLD 62
           +L+ +K  +K WNKE FG + S K+    +I  LD +E    L+    KERE+    + D
Sbjct: 32  RLRTIKQKIKVWNKEVFGDLVSAKKEAEARIAALDLMEGQGGLDNTLRKEREDLYFMVSD 91

Query: 63  LIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVTEKEIVD 122
           L+ KE+  W Q+ K+ W R+G+ N+ FFH   S R+ ++ +  L    G  +V+E EI  
Sbjct: 92  LVHKEEVKWRQRGKIQWARDGDSNTKFFHRIASGRRKRNFIQKLEVAGGGVVVSEGEIEL 151

Query: 123 EILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGP 182
           EI++FF NLY + + + +  + LNW  +S++++  L+ PF E+E++  VF+ G  KSPGP
Sbjct: 152 EIINFFKNLYSSNVEAGWCLEGLNWNAISVEEAEWLDRPFEEEEVKRAVFDCGIDKSPGP 211

Query: 183 DGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPIS 242
           DG +   ++  W+I+K DL++V  DFF  GIIN   NET+I LIPKKKE+ +VSDFRPIS
Sbjct: 212 DGFSMLLFQSCWDIVKEDLMKVMADFFNCGIINAITNETFICLIPKKKESVKVSDFRPIS 271

Query: 243 LITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRKGV 302
           L+TSLYK++SKVL +RL++VL S I+  Q AFV+GRQILDA L A+E V+E     + G+
Sbjct: 272 LVTSLYKMVSKVLASRLREVLGSTISSYQSAFVQGRQILDAALIANEVVEESRRLNKSGM 331

Query: 303 LLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKR 362
           + K+DLEKAYD V+W F+D  +  KGFG R R WI GCL T NFS+++NGRPRGK  A R
Sbjct: 332 VFKIDLEKAYDHVEWRFVDEVLIRKGFGDRWRSWIRGCLETANFSVMINGRPRGKFRASR 391

Query: 363 GIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTL 418
           G+RQGDPL+PFLFT+V D  S ++    +     G    N   +++HLQ+ADDT+
Sbjct: 392 GLRQGDPLSPFLFTLVMDVLSRIMEKAQDADEFHGLSPGNGMVEISHLQFADDTI 446

BLAST of CSPI04G20730 vs. TAIR 10
Match: AT1G43760.1 (DNAse I-like superfamily protein )

HSP 1 Score: 114.4 bits (285), Expect = 2.2e-25
Identity = 78/262 (29.77%), Postives = 135/262 (51.53%), Query Frame = 0

Query: 2   EKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSL-----EESSCLNEENVKERENC 61
           E LK  K   K  N++ FG I  + +  +D +  + S       +S    E   +++ N 
Sbjct: 367 EHLKAAKKCCKLLNRQGFGNIQHKTKEALDSLESIQSQLLTNPSDSLFRVEHVARKKWNF 426

Query: 62  RGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIEGKTLVT 121
             A L      +  + QKS++ WL++G+ N+ FFH  + A ++K+++  L   +   +  
Sbjct: 427 FAAAL------ESFYRQKSRIKWLQDGDANTRFFHKVILANQAKNLIKFLRMDDDVRVEN 486

Query: 122 EKEIVDEILSFFSNLYG------TRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREV 181
             ++ + I++++++L G      T  S   I DI  +R      S L  +P ++KEI   
Sbjct: 487 VTQVKEMIVAYYTHLLGSDSDILTPDSVQRIKDIHPFRCNDTLASRLSALP-SDKEITAA 546

Query: 182 VFEMGCLKSPGPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKK 241
           VF M   K+PGPD  T EF+ +SW ++K   +   ++FF+ G + +R N T I LIPK  
Sbjct: 547 VFAMPRNKAPGPDSFTAEFFWESWFVVKDSTIAAVKEFFRTGHLLKRFNATAITLIPKVT 606

Query: 242 EAARVSDFRPISLITSLYKVIS 253
              ++S FRP+S  T +YK+I+
Sbjct: 607 GVDQLSMFRPVSCCTVVYKIIT 621

BLAST of CSPI04G20730 vs. TAIR 10
Match: ATMG01250.1 (RNA-directed DNA polymerase (reverse transcriptase) )

HSP 1 Score: 65.9 bits (159), Expect = 9.2e-11
Identity = 29/68 (42.65%), Postives = 40/68 (58.82%), Query Frame = 0

Query: 349 IVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLT 408
           I+NG P+G +   RG+RQGDPL+P+LF +  +  S L     E+  L G    N S  + 
Sbjct: 13  IINGAPQGLVTPSRGLRQGDPLSPYLFILCTEVLSGLCRRAQEQGRLPGIRVSNNSPRIN 72

Query: 409 HLQYADDT 417
           HL +ADDT
Sbjct: 73  HLLFADDT 80

BLAST of CSPI04G20730 vs. TAIR 10
Match: AT4G20520.1 (RNA binding;RNA-directed DNA polymerases )

HSP 1 Score: 59.3 bits (142), Expect = 8.6e-09
Identity = 33/76 (43.42%), Postives = 48/76 (63.16%), Query Frame = 0

Query: 258 RLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRKGV----LLKLDLEKAYD 317
           RLK ++ ++I  +Q +F+ GR   D I+   EAV   S+R +KGV    LLKLDLEKAYD
Sbjct: 4   RLKPLMTNLIGPAQASFIPGRVSTDNIVFVQEAVH--SMRRKKGVKGWMLLKLDLEKAYD 63

Query: 318 KVDWSFLDMAMKLKGF 330
           ++ W +L+  +   GF
Sbjct: 64  RIRWDYLEDTLISAGF 77

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P143812.3e-3827.27Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis OX=8355 PE=4 SV... [more]
P085487.0e-3226.78LINE-1 reverse transcriptase homolog OS=Nycticebus coucang OX=9470 PE=4 SV=1[more]
O003701.7e-3026.59LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1[more]
P113692.5e-2928.13LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE... [more]
Q032742.1e-1529.27Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fra... [more]
Match NameE-valueIdentityDescription
A0A438IK877.6e-10645.48Transposon TX1 uncharacterized 149 kDa protein OS=Vitis vinifera OX=29760 GN=YTX... [more]
A0A438FWU51.3e-10545.24LINE-1 retrotransposable element ORF2 protein OS=Vitis vinifera OX=29760 GN=LORF... [more]
A5BV951.3e-10545.24Reverse transcriptase domain-containing protein OS=Vitis vinifera OX=29760 GN=VI... [more]
A0A2P6QQZ31.7e-10546.04Putative RNA-directed DNA polymerase OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr4... [more]
A0A5E4GN722.2e-10546.51PREDICTED: RNA-directed DNA polymerase (Fragment) OS=Prunus dulcis OX=3755 GN=AL... [more]
Match NameE-valueIdentityDescription
RVW97045.11.6e-10545.48Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera][more]
RVW64408.12.7e-10545.24LINE-1 retrotransposable element ORF2 protein [Vitis vinifera][more]
CAN75040.12.7e-10545.24hypothetical protein VITISV_026478 [Vitis vinifera][more]
PRQ36601.13.5e-10546.04putative RNA-directed DNA polymerase [Rosa chinensis][more]
BBN69746.14.6e-10546.51VIRB2-interacting protein 2 [Prunus dulcis][more]
Match NameE-valueIdentityDescription
AT1G43760.12.2e-2529.77DNAse I-like superfamily protein [more]
ATMG01250.19.2e-1142.65RNA-directed DNA polymerase (reverse transcriptase) [more]
AT4G20520.18.6e-0943.42RNA binding;RNA-directed DNA polymerases [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 226..420
e-value: 1.5E-37
score: 129.3
IPR000477Reverse transcriptase domainPROSITEPS50878RT_POLcoord: 207..421
score: 15.804279
NoneNo IPR availablePANTHERPTHR33116REVERSE TRANSCRIPTASE ZINC-BINDING DOMAIN-CONTAINING PROTEIN-RELATED-RELATEDcoord: 8..396
NoneNo IPR availablePANTHERPTHR33116:SF38OS01G0158850 PROTEINcoord: 8..396
NoneNo IPR availableCDDcd01650RT_nLTR_likecoord: 220..418
e-value: 1.62326E-42
score: 147.053
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 178..420

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G20730.1CSPI04G20730.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0090304 nucleic acid metabolic process
biological_process GO:0048583 regulation of response to stimulus
molecular_function GO:0016740 transferase activity