CSPI05G18960 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI05G18960
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
LocationChr5: 20133334 .. 20138295 (+)
RNA-Seq ExpressionCSPI05G18960
SyntenyCSPI05G18960
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTATACTCTCTAGGTGTAACCGAAGTGGACTGGAAGAATTTGATTCTGACTTTTACAGACCAACAAAGAAGATTGTTATAAGGGGGGATCCAAGCCTGACAAAAGCAAGGGTCAGTTTAAAAAATCTCATGAAATCTTGGGGAGAAGAGGATCAGGGATTTCTAGTGGAATGTCGTATGTTGGAAAGAAGAGAGTCATCGGAGGAGGAAGATTCGATTGAGGAAGTGTTGACTATAGAAGAATCAATGGCAGTTGTGTTGGAAAGATTTGAGGATGCGTTCGAATGGCCCGAAACATTACCTCCACGTAGATTGATAGAACATCATATCCATCTAAAGAAAGGAACCGACCCAGTAAATGTTCGTCCTTACCGCTATGCATATCAACAAAAGACAGAAATGGAGAGATTAGTGGAAGAGATGCTTCAGGGGTAATAAGGCCGAGTACGAGCCCATATTCCAGTCCAGTGTTGCTGGTGAGAAAGAAGGATGGGAGCTGGCACTTTTGTGTTGATTACAGAGCTCTAAATAATGTAACAGTACCAGACAAGTTCCCAATACCAGTGATTGAGGAGTTGTTTGATGAGTTACATGGAGCTACTATGTTTACTAAGATAGATCTTAAATCAAGGTATCATCAAATTAGAATGTGTGCATGGAGCTACTATGTTTACTAAGATAGATCTTAAATCAGTGTATCATCGAATTAGAATGTATGCAGATGACATTGAAAAGACACCATTCAGAACCCATGAGGGTCACTATGAGTTTATGGTGATGCCATTTGGATTGACAAATGCACCGTCTACTTTTCAATCGTTGATGTATATTCAGGCCATATCTCCGGAAATTTGTCTTAGTGTTTTTTGATGATATCTTGATCTATAGCAAAGGGTTGGAGGATCATTTAAATCATATGAGAGCTTTGTTAGAAGTGTTGAGGAAGAATGAATTATATGCAAATAAGAAGAAATGCAGCTTTGCTCAATTTCGGGTGGATTACTTGGGGCATATTATTTCAAGAGAGGGAGTGGAAGTGGATCCTGAGAAAATCAGAGCTATAAAGGAATGGCCAATTCCAGCTAATGTGAGGGAGGTTCGAGGATTTCTTGGGTTGACCGGATATTATCGTAAATTTGTTCAAAATTATGGGACAATTGCAGCTTCTCTATCACAGCTGTTGAAGATAGGGGGGTTTAAATGGACAGAGGAAGCTCAAGTGGCTTTTAATAGGCTACAACAAGCGATGATGTCTCTTCCTGTATTAGCTCTACCAGATTTCAGTGTGCCATTTGAAATCCAAACTGATGCCTCAAGGTATGGATTAGGAGCTGTTTTGGTGCAGAATCGGCGGCCAATTGCTTATTATAGCCATACATTGGCAATGAGAGATAGGGCTAAACCTGTATATGAAAGGGAGCTGATGGCAGTAGTTATGGCTGTACAAAGGTGGCGTCCATACCTATTGGGGAAGAAGTTCCTAGTCAAAACTGATCAACGGTCTTTACAGTTTTTATTGGAGCAAAGGGTGATACAGCCTCAGTATAAAAAATGGATATCCAAACTTCTCGGATGGTTTATAAGCCGGGGTTGGAAAATGAAGCAGCAGATGCATTATCTAGAGTACCAGCAACAGTGCATTTGAATCAGCTAACAGCTCCAAATGTGATTGATATAGAAGTGATTAGGGCAGAGGTGGATAAAGATGAGAAGTCGAAGATCATTAAGCAGAAAATAGCTGAGGCAGCAGAGGCAGAGAATAGTAAGTATTCAGTGAAACAAGGCATGTTGGTGTATAAGGATCAGATGGTTTAATCTAAAACATCCAAATTGATTCCAACAATTTTGCATACTTACCATGATTCAGTATTTGGTGGTCATTCGGGGTTCTTGTGAACCTATAAGAGGTTGGCTGAAGAGTTATACTGGGAAGGAATGAAGCAGGAAGTGAAGAAATATTGTGAGGAGTGTATGATCTGTCAACGCAATAAGACACTGGCTCTTTCACCAGCTGGCTTATTAACTCCTCTAGAAGTGCCAAATAGAGTGTGGGAGGATATATCAATGGATTTCATGGAAGGATTACCTAAGGCAGGAGGTTATGAAGTGATATTTGTAGTGGTTGACCGATTTAGTAAATATGGACATTTCATTCCTGTGAAACATCCATATACAGCCAAAGTCGTGTCTGAAGTATTTGTTAAGGAAATAGTACGGTTACATGGATATCCCAAATCCATAGTATCAGATAGAGATAAGGTGTTATTAAGTCACTTTTGGAGAGAACTATTTCATCTGGCAGGAACAAAACTGAACCATAGCACAACGTATCATTCAAGGGAAGGAAGTTATTGTGCTCGTTGATTGTGGGGCAACGCACAACTTTATCTCAGACAAACTAGTGGCGACATTGCAGCTACCCACTAAAGATACTTCCAATTATGGAGTAATTCTGGGATCAGGAACTGCGATTAAAGGCAAGGGAGTTTGTGAACAAGTGAAGCTTAACCTCAACGGATGGATGATCACAACAGACTTTCTACCTCTGGAATTGGGGGGAGTGGACGTGATACTTGGGATGCAGTGGCTTTACTCATTGGGTGTAACGGAAGTTGATTGGAAGAAGTTAGTCATGACGTTCTCTCATAACAACAAGAGAGTAATAATTAAAGGGGATCCCAGCCTAACCAAGACTCAAGTCAGCCTGAAGAATTTAACTAAATCATGGAACTGACTTGGACTTGGGATATCTAGTGGAGTGTAGGGCGTTGGAGATCCGAATCACAGAGATAGGACCGCAAATAGGGGAGGGACCCATGGTTGTACCAGAAAGAGTTCAAGGAGTATTGAGGCAGTATGAAGATGTGTTTGATTGGCCTGAAGAATTACCATCGGAAAGGACTAGTGAACATCATATACACATCAAAGGTGGCACAGAACCAATCAATGTCAGACCCTATAGATATGCATTTCAGCAAAAGGAGGAGATGGAGAGGCTAGTGGATGAAATGCTGTCATCGACGATCATCCGTCCTAGTACCGGCCCATACTCGAGTCCAGTCCTTTTGGTGAAAAAAGAGATGGGAGTTGGCGTTTTTGCATGGATTACCAAGCACTCAATAACGTGACCGTCCCAGATAAGTTTCCTATTCCTGTAGTTGAGGAACTCTTTGATGAATTTAATGGGGCCAGCTTGTTTTCTAAAATTGATCTCAAATCCGGTTATCACCAAATTAGAATGAGTAGTCAGGACATTGAGAAAACGACTTTTAGGACTCATGAAGGGCATTATGAATTCTTAGTAGTGCCATTGGACTCACAAATGCCCCGGCAACGTTTCAGTCTTTGATGAACAATATATTCAGAGCATACTTGAGAAAGTTTGTCTTAGTATTTTTTGATGATATCCTGATATATAGTAGGGGATTGAAAGAGCATTGTCAACATATAGAGCTGGCATTAGAAGTTTTGAGAAGTCATAGACTGTTTGCCAACAAGAAAAATTGTAGCTTTGCCTATCCTAAGCTGGAATATTTGGGGCACATTTTGTCTGGAAGGGGAGTGGAGGTGGATCTTGAAAAGATCAGATCAATTAAACAATGGCCGATTCCCACAAATGTTCGAGAAGTGAGAGGATTCCTGGGGTTGACCGGGTACTATCGTAGATTTGTACAACATTAGGGTCAATTGCTGCTCCTTTAACTTAGTTACTGAAGTTGGGGTCATTTAAGTGGAATCCGGAAGCACAAGAAGCGTTTGAAAAGTTGCAACAAGCAATGATGACCCTTCCTATTTTGGCACTACCAGATATCAACGCACCATTTGAGGTAGAGACTGATGCATCCGAATACAGGGTAGGAGCTGTCCTAGTGCAAAACAAGAGGCCGATTGCATTCTACAACCACACATTAGCTATGAGAGGTTGTGCTAGACCCGTCTATGAGAGGGAATTAATGGCGGTTGTGTTAGCAGTACAGCATTAGAGGCCATATTTATTAGGCGGAAAGTTCATAGTTAAAACGGATCAATGATCGCTTAAGTTCTTACTAGAACAGAGGGTAATACAACCACAATACCAAAAATGGATTGCAAAGTTGCTGGGCTATTCATTCGAGGTGGTCTATAAACCTGGCTTAGAAAACAAGGCTGCTGATGCCTTATCCCGAGTACCCCTTGTGGCAGAAATTAACCAACTAACGGTCCATACGTTGATTGATCTGAAGGTTATAAAGGAGGAGGTAATGAAAGACGAATTCTTGAAAGAGATCTGCAGATTGCAGGGAGGAGAGGAGGTAAAGAATTACTCATGGTACCATGAGATTCTCTGATACAAAGGCAGGTTGGTTACTGCGAAAGGCTCTGCCTTGATCTCGACCATTATGCACACATACCATGATTCTGTTCTTAGGGGACACTCCGGGTTCTTAAGAACGTATAAGAGGCTAACAGAAGAACTCTTTTGGGTCGGCATGAAATCGGAAGTATAGAAGTATTGTGAAGAGTGCAATATCTGTCAGCGAAATAAAATGTTAGCACTAACACCAGCAGGGTTGTTGCATCCGCTGGAGATACCACAGAGGATGTGGGAAGACATCTCTATGGACTTTATTGAGGGATTACCAAAATCTTTTGGGTATGAGGTAATTTTTGTGGTGGTGGATCGGTTAAGTAATTACGACCACTTCTTATGTCTAAAGCACCCCTTTGATGCCAAGACCGTAGCTGAATTATTCGTTAAGGAAATAGTAATATTGCACGGCTTTCCAAAATCGATCGTGTCTGATTGTGACAAGATTTTCTTGAGTAACTTTTGGAAAGAACTGTTTCGCTTGGCAGGCACAAAATTGGACAGAAGCACGACTTATCACCCCCAAACGGATGGGTAGACTGAGGTGGCTAACAGATCAGTGGAAATTTACCTACGCTGTTTCTGTGGTGAAAGACCAAAAGAGTGGATGA

mRNA sequence

ATGGTTATACTCTCTAGGTGTAACCGAAGTGGACTGGAAGAATTTGATTCTGACTTTTACAGACCAACAAAGAAGATTGTTATAAGGGGGGATCCAAGCCTGACAAAAGCAAGGGTCAGTTTAAAAAATCTCATGAAATCTTGGGGAGAAGAGGATCAGGGATTTCTAGTGGAATGTCGTATGTTGGAAAGAAGAGAGTCATCGGAGGAGGAAGATTCGATTGAGGAAGTGTTGACTATAGAAGAATCAATGGCAGTTGTGTTGGAAAGATTTGAGGATGCGTTCGAATGGCCCGAAACATTACCTCCACGTAGATTGATAGAACATCATATCCATCTAAAGAAAGGAACCGACCCAGTAAATGTTCGTCCTTACCGCTATGCATATCAACAAAAGACAGAAATGGAGAGATTAGTGGAAGAGATGCTTCAGGGAGCTCTAAATAATGTAACAGTACCAGACAAGTTCCCAATACCAGTGATTGAGGAGTTGTTTGATGAGTTACATGGAGCTACTATGTTTACTAAGATAGATCTTAAATCAAGGCCATATCTCCGGAAATTTGTCTTAGTGTTTTTTGATGATATCTTGATCTATAGCAAAGGGTTGGAGGATCATTTAAATCATATGAGAGCTTTGTTAGAAGTGTTGAGGAAGAATGAATTATATGCAAATAAGAAGAAATGCAGCTTTGCTCAATTTCGGGTGGATTACTTGGGGCATATTATTTCAAGAGAGGGAGTGGAAGTGGATCCTGAGAAAATCAGAGCTATAAAGGAATGGCCAATTCCAGCTAATGTGAGGGAGGTTCGAGGATTTCTTGGGTTGACCGGATATTATCGTAAATTTGTTCAAAATTATGGGACAATTGCAGCTTCTCTATCACAGCTGTTGAAGATAGGGGGGTTTAAATGGACAGAGGAAGCTCAAGTGGCTTTTAATAGGCTACAACAAGCGATGATGTCTCTTCCTGTATTAGCTCTACCAGATTTCAGTGTGCCATTTGAAATCCAAACTGATGCCTCAAGGTATGGATTAGGAGCTGTTTTGGTGCAGAATCGGCGGCCAATTGCTTATTATAGCCATACATTGGCAATGAGAGATAGGGCTAAACCTGTATATGAAAGGGAGCTGATGGCAGTAGTTATGGCTGTACAAAGGTGGCGTCCATACCTATTGGGGAAGAAGTTCCTAGTCAAAACTGATCAACGGTCTTTACAGTTTTTATTGGAGCAAAGGAGGGTAATACAACCACAATACCAAAAATGGATTGCAAAGTTGCTGGGCTATTCATTCGAGGTGGTCTATAAACCTGGCTTAGAAAACAAGGCTGCTGATGCCTTATCCCGAGTACCCCTTGTGGCAGAAATTAACCAACTAACGGTCCATACGTTGATTGATCTGAAGGTTATAAAGGAGGAGGTAATGAAAGACGAATTCTTGAAAGAGATCTGCAGATTGCAGGGAGGAGAGGAGGTAAAGAATTACTCATGGGGACACTCCGGGTTCTTAAGAACGTATAAGAGGCTAACAGAAGAACTCTTTTGGGTCGGCATGAAATCGGAAATACCACAGAGGATGTGGGAAGACATCTCTATGGACTTTATTGAGGGATTACCAAAATCTTTTGGGTATGAGGTAATTTTTGTGGTGGTGGATCGGTTAAGTAATTACGACCACTTCTTATGTCTAAAGCACCCCTTTGATGCCAAGACCGTAGCTGAATTATTCGTTAAGGAAATAGCACAAAATTGGACAGAAGCACGACTTATCACCCCCAAACGGATGGGTAGACTGAGGTGGCTAACAGATCAGTGGAAATTTACCTACGCTGTTTCTGTGGTGAAAGACCAAAAGAGTGGATGA

Coding sequence (CDS)

ATGGTTATACTCTCTAGGTGTAACCGAAGTGGACTGGAAGAATTTGATTCTGACTTTTACAGACCAACAAAGAAGATTGTTATAAGGGGGGATCCAAGCCTGACAAAAGCAAGGGTCAGTTTAAAAAATCTCATGAAATCTTGGGGAGAAGAGGATCAGGGATTTCTAGTGGAATGTCGTATGTTGGAAAGAAGAGAGTCATCGGAGGAGGAAGATTCGATTGAGGAAGTGTTGACTATAGAAGAATCAATGGCAGTTGTGTTGGAAAGATTTGAGGATGCGTTCGAATGGCCCGAAACATTACCTCCACGTAGATTGATAGAACATCATATCCATCTAAAGAAAGGAACCGACCCAGTAAATGTTCGTCCTTACCGCTATGCATATCAACAAAAGACAGAAATGGAGAGATTAGTGGAAGAGATGCTTCAGGGAGCTCTAAATAATGTAACAGTACCAGACAAGTTCCCAATACCAGTGATTGAGGAGTTGTTTGATGAGTTACATGGAGCTACTATGTTTACTAAGATAGATCTTAAATCAAGGCCATATCTCCGGAAATTTGTCTTAGTGTTTTTTGATGATATCTTGATCTATAGCAAAGGGTTGGAGGATCATTTAAATCATATGAGAGCTTTGTTAGAAGTGTTGAGGAAGAATGAATTATATGCAAATAAGAAGAAATGCAGCTTTGCTCAATTTCGGGTGGATTACTTGGGGCATATTATTTCAAGAGAGGGAGTGGAAGTGGATCCTGAGAAAATCAGAGCTATAAAGGAATGGCCAATTCCAGCTAATGTGAGGGAGGTTCGAGGATTTCTTGGGTTGACCGGATATTATCGTAAATTTGTTCAAAATTATGGGACAATTGCAGCTTCTCTATCACAGCTGTTGAAGATAGGGGGGTTTAAATGGACAGAGGAAGCTCAAGTGGCTTTTAATAGGCTACAACAAGCGATGATGTCTCTTCCTGTATTAGCTCTACCAGATTTCAGTGTGCCATTTGAAATCCAAACTGATGCCTCAAGGTATGGATTAGGAGCTGTTTTGGTGCAGAATCGGCGGCCAATTGCTTATTATAGCCATACATTGGCAATGAGAGATAGGGCTAAACCTGTATATGAAAGGGAGCTGATGGCAGTAGTTATGGCTGTACAAAGGTGGCGTCCATACCTATTGGGGAAGAAGTTCCTAGTCAAAACTGATCAACGGTCTTTACAGTTTTTATTGGAGCAAAGGAGGGTAATACAACCACAATACCAAAAATGGATTGCAAAGTTGCTGGGCTATTCATTCGAGGTGGTCTATAAACCTGGCTTAGAAAACAAGGCTGCTGATGCCTTATCCCGAGTACCCCTTGTGGCAGAAATTAACCAACTAACGGTCCATACGTTGATTGATCTGAAGGTTATAAAGGAGGAGGTAATGAAAGACGAATTCTTGAAAGAGATCTGCAGATTGCAGGGAGGAGAGGAGGTAAAGAATTACTCATGGGGACACTCCGGGTTCTTAAGAACGTATAAGAGGCTAACAGAAGAACTCTTTTGGGTCGGCATGAAATCGGAAATACCACAGAGGATGTGGGAAGACATCTCTATGGACTTTATTGAGGGATTACCAAAATCTTTTGGGTATGAGGTAATTTTTGTGGTGGTGGATCGGTTAAGTAATTACGACCACTTCTTATGTCTAAAGCACCCCTTTGATGCCAAGACCGTAGCTGAATTATTCGTTAAGGAAATAGCACAAAATTGGACAGAAGCACGACTTATCACCCCCAAACGGATGGGTAGACTGAGGTGGCTAACAGATCAGTGGAAATTTACCTACGCTGTTTCTGTGGTGAAAGACCAAAAGAGTGGATGA

Protein sequence

MVILSRCNRSGLEEFDSDFYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVLTIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLQGALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKSRPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYLGHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQTDASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDEFLKEICRLQGGEEVKNYSWGHSGFLRTYKRLTEELFWVGMKSEIPQRMWEDISMDFIEGLPKSFGYEVIFVVVDRLSNYDHFLCLKHPFDAKTVAELFVKEIAQNWTEARLITPKRMGRLRWLTDQWKFTYAVSVVKDQKSG*
Homology
BLAST of CSPI05G18960 vs. ExPASy Swiss-Prot
Match: P20825 (Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 214.5 bits (545), Expect = 3.3e-54
Identity = 123/357 (34.45%), Postives = 188/357 (52.66%), Query Frame = 0

Query: 147 LNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------------------------- 206
           LN +T+PD++PIP ++E+  +L     FT IDL                           
Sbjct: 270 LNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKSGHYE 329

Query: 207 -----------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLR 266
                                  RP L K  LV+ DDI+I+S  L +HLN ++ +   L 
Sbjct: 330 YLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTKLA 389

Query: 267 KNELYANKKKCSFAQFRVDYLGHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTG 326
              L     KC F +   ++LGHI++ +G++ +P K++AI  +PIP   +E+R FLGLTG
Sbjct: 390 DANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGLTG 449

Query: 327 YYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQV--AFNRLQQAMMSLPVLALPDFSVPFE 386
           YYRKF+ NY  IA  ++  LK      T++ +   AF +L+  ++  P+L LPDF   F 
Sbjct: 450 YYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPDFEKKFV 509

Query: 387 IQTDASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKK 446
           + TDAS   LGAVL QN  PI++ S TL   +      E+EL+A+V A + +R YLLG++
Sbjct: 510 LTTDASNLALGAVLSQNGHPISFISRTLNDHELNYSAIEKELLAIVWATKTFRHYLLGRQ 569

Query: 447 FLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAADALSRVPL 454
           FL+ +D + L++ L   +    + ++W  +L  Y F++ Y  G EN  ADALSR+ +
Sbjct: 570 FLIASDHQPLRW-LHNLKEPGAKLERWRVRLSEYQFKIDYIKGKENSVADALSRIKI 625

BLAST of CSPI05G18960 vs. ExPASy Swiss-Prot
Match: Q8I7P9 (Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 208.8 bits (530), Expect = 1.8e-52
Identity = 134/445 (30.11%), Postives = 213/445 (47.87%), Query Frame = 0

Query: 118 DPVNVRPYRYAYQQKTEMERLVEEMLQGA------------------------------- 177
           DP+  + Y Y    + E+ER ++E+LQ                                 
Sbjct: 122 DPIYAKSYPYPVNMRGEVERQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMV 181

Query: 178 -----LNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS-------------------- 237
                LN VT+PD +PIP I      L  A  FT +DL S                    
Sbjct: 182 VDFKRLNTVTIPDTYPIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTL 241

Query: 238 ----------------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRAL 297
                                       R ++ K   V+ DDI+++S+  + H  ++R +
Sbjct: 242 NGKYEFLRLPFGLKNAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLV 301

Query: 298 LEVLRKNELYANKKKCSFAQFRVDYLGHIISREGVEVDPEKIRAIKEWPIPANVREVRGF 357
           L  L K  L  N +K  F   +V++LG+I++ +G++ DP+K+RAI E P P +V+E++ F
Sbjct: 302 LASLSKANLQVNLEKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRF 361

Query: 358 LGLTGYYRKFVQNYGTIAASLSQLLK--IGGFKWTEEAQV----------AFNRLQQAMM 417
           LG+T YYRKF+Q+Y  +A  L+ L +      K ++ ++V          +FN L+  + 
Sbjct: 362 LGMTSYYRKFIQDYAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILC 421

Query: 418 SLPVLALPDFSVPFEIQTDASRYGLGAVLVQN----RRPIAYYSHTLAMRDRAKPVYERE 462
           S  +LA P F+ PF + TDAS + +GAVL Q+     RPIAY S +L   +      E+E
Sbjct: 422 SSEILAFPCFTKPFHLTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKE 481

BLAST of CSPI05G18960 vs. ExPASy Swiss-Prot
Match: P04323 (Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 205.7 bits (522), Expect = 1.5e-51
Identity = 129/403 (32.01%), Postives = 204/403 (50.62%), Query Frame = 0

Query: 147 LNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------------------------- 206
           LN +TV D+ PIP ++E+  +L     FT IDL                           
Sbjct: 271 LNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYE 330

Query: 207 -----------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLR 266
                                  RP L K  LV+ DDI+++S  L++HL  +  + E L 
Sbjct: 331 YLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLA 390

Query: 267 KNELYANKKKCSFAQFRVDYLGHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTG 326
           K  L     KC F +    +LGH+++ +G++ +PEKI AI+++PIP   +E++ FLGLTG
Sbjct: 391 KANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTG 450

Query: 327 YYRKFVQNYGTIAASLSQLLK--IGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFE 386
           YYRKF+ N+  IA  +++ LK  +       E   AF +L+  +   P+L +PDF+  F 
Sbjct: 451 YYRKFIPNFADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFT 510

Query: 387 IQTDASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKK 446
           + TDAS   LGAVL Q+  P++Y S TL   +      E+EL+A+V A + +R YLLG+ 
Sbjct: 511 LTTDASDVALGAVLSQDGHPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRH 570

Query: 447 FLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAADALSRVPL-VA 495
           F + +D + L +L   +     +  +W  KL  + F++ Y  G EN  ADALSR+ L   
Sbjct: 571 FEISSDHQPLSWLYRMKDP-NSKLTRWRVKLSEFDFDIKYIKGKENCVADALSRIKLEET 630

BLAST of CSPI05G18960 vs. ExPASy Swiss-Prot
Match: Q99315 (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-G PE=1 SV=3)

HSP 1 Score: 196.4 bits (498), Expect = 9.3e-49
Identity = 173/672 (25.74%), Postives = 270/672 (40.18%), Query Frame = 0

Query: 101  LPPRRL------IEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLQG--------- 160
            LPPR        ++H I +K G     ++PY    + + E+ ++V+++L           
Sbjct: 572  LPPRPADINNIPVKHDIEIKPGARLPRLQPYHVTEKNEQEINKIVQKLLDNKFIVPSKSP 631

Query: 161  ----------------------ALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS-- 220
                                   LN  T+ D FP+P I+ L   +  A +FT +DL S  
Sbjct: 632  CSSPVVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGY 691

Query: 221  -----RPYLR---------------------------------------KFVLVFFDDIL 280
                  P  R                                       +FV V+ DDIL
Sbjct: 692  HQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFARYMADTFRDLRFVNVYLDDIL 751

Query: 281  IYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYLGHIISREGVEVDPEKIRA 340
            I+S+  E+H  H+  +LE L+   L   KKKC FA    ++LG+ I  + +     K  A
Sbjct: 752  IFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQHKCAA 811

Query: 341  IKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQ 400
            I+++P P  V++ + FLG+  YYR+F+ N   IA  + QL      +WTE+   A ++L+
Sbjct: 812  IRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPI-QLFICDKSQWTEKQDKAIDKLK 871

Query: 401  QAMMSLPVLALPDFSVPFEIQTDASRYGLGAVL--VQNRRP----IAYYSHTLAMRDRAK 460
             A+ + PVL   +    + + TDAS+ G+GAVL  V N+      + Y+S +L    +  
Sbjct: 872  DALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVVGYFSKSLESAQKNY 931

Query: 461  PVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYS 520
            P  E EL+ ++ A+  +R  L GK F ++TD  SL   L+ +     + Q+W+  L  Y 
Sbjct: 932  PAGELELLGIIKALHHFRYMLHGKHFTLRTDHISL-LSLQNKNEPARRVQRWLDDLATYD 991

Query: 521  FEVVYKPGLENKAADALSRV----------PLVAE-------INQLTVHTLIDLKVIKEE 580
            F + Y  G +N  ADA+SR           P+  E        + L    LI +K + + 
Sbjct: 992  FTLEYLAGPKNVVADAISRAVYTITPETSRPIDTESWKSYYKSDPLCSAVLIHMKELTQH 1051

Query: 581  VMKDEFLKEICRLQGGEEV-----KNYS-------------------------------- 600
             +  E +      Q   E+     KNYS                                
Sbjct: 1052 NVTPEDMSAFRSYQKKLELSETFRKNYSLEDEMIYYQDRLVVPIKQQNAVMRLYHDHTLF 1111

BLAST of CSPI05G18960 vs. ExPASy Swiss-Prot
Match: P0CT41 (Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-12 PE=3 SV=1)

HSP 1 Score: 193.0 bits (489), Expect = 1.0e-47
Identity = 157/621 (25.28%), Postives = 261/621 (42.03%), Query Frame = 0

Query: 101  LPPRRLIEHHIHLKKG--------TDPVNVRPYRYAYQQKTEMERLVEEMLQGALNNVTV 160
            LPP ++   +  + +G        +  +N  P  +  +++  +  +V+      LN    
Sbjct: 420  LPPGKMQAMNDEINQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVD---YKPLNKYVK 479

Query: 161  PDKFPIPVIEELFDELHGATMFTKIDLKSRPYL--------------------------- 220
            P+ +P+P+IE+L  ++ G+T+FTK+DLKS  +L                           
Sbjct: 480  PNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPY 539

Query: 221  ---------------------RKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYA 280
                                    V+ + DDILI+SK   +H+ H++ +L+ L+   L  
Sbjct: 540  GISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLII 599

Query: 281  NKKKCSFAQFRVDYLGHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFV 340
            N+ KC F Q +V ++G+ IS +G     E I  + +W  P N +E+R FLG   Y RKF+
Sbjct: 600  NQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFI 659

Query: 341  QNYGTIAASLSQLLKIG-GFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQTDASR 400
                 +   L+ LLK    +KWT     A   ++Q ++S PVL   DFS    ++TDAS 
Sbjct: 660  PKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASD 719

Query: 401  YGLGAVLVQNR-----RPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLG--KK 460
              +GAVL Q        P+ YYS  ++       V ++E++A++ +++ WR YL    + 
Sbjct: 720  VAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEP 779

Query: 461  FLVKTDQRSL-QFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAADALSR----- 520
            F + TD R+L   +  +      +  +W   L  ++FE+ Y+PG  N  ADALSR     
Sbjct: 780  FKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDET 839

Query: 521  -----------VPLVAEI-------NQLTVHTLIDLKVIKEEVMKDEFLKEICRLQGG-- 580
                       +  V +I       NQ+      D K++     +D+ ++E  +L+ G  
Sbjct: 840  EPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLL 899

Query: 581  -----------------EEVKNY----SWGHSGFLRTYKRLTEELFWVGMKSEI------ 582
                               +K Y       H G       +     W G++ +I      
Sbjct: 900  INSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQN 959

BLAST of CSPI05G18960 vs. ExPASy TrEMBL
Match: A0A5D3CU05 (Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold832G00630 PE=4 SV=1)

HSP 1 Score: 738.0 bits (1904), Expect = 3.2e-209
Identity = 397/745 (53.29%), Postives = 486/745 (65.23%), Query Frame = 0

Query: 19   FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVL 78
            F+   KK+VIRGDPSLTKARVSLKNLMKSWG +DQGFLVECR +E     E E   E+  
Sbjct: 621  FHHQGKKVVIRGDPSLTKARVSLKNLMKSWGADDQGFLVECRTIECGPLEEHEQDREQGE 680

Query: 79   TIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERL 138
               E +A +L+RF   FEWP TLPP+R I+HHI+LK G DPVNVRPYRYA+ QK EMERL
Sbjct: 681  INAEPIAALLQRFARVFEWPSTLPPQRGIDHHIYLKSGADPVNVRPYRYAHHQKEEMERL 740

Query: 139  VEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDE 198
            V+EML                                 ALNNVT+PDKFPIPVIEELFDE
Sbjct: 741  VDEMLTSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDE 800

Query: 199  LHGATMFTKIDLKS---------------------------------------------- 258
            L GA++F+KIDLK+                                              
Sbjct: 801  LKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQ 860

Query: 259  --RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL 318
              +PYLR+FVLVFFDDIL+YS+G+E+H+ H+  +L +L++ ELY N +KCSFA+ R+ YL
Sbjct: 861  VFKPYLRRFVLVFFDDILVYSRGMEEHVQHLEVVLGLLQEKELYVNMEKCSFAKPRISYL 920

Query: 319  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLK 378
            GH IS +G+E DPEKIRA+ EWP PANVREVRGFLGLTGYYR+FV+NYGTIAA L+QLLK
Sbjct: 921  GHFISEQGIEADPEKIRAVSEWPTPANVREVRGFLGLTGYYRRFVKNYGTIAAPLTQLLK 980

Query: 379  IGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQTDASRYGLGAVLVQNRRPIAY 438
             G +KW EE + AF +L++AMM+LPVL +PDFS+PFEI++DAS +G+GAVL Q R+P+AY
Sbjct: 981  KGAYKWGEEEETAFGKLKRAMMTLPVLTMPDFSLPFEIESDASGFGMGAVLTQCRKPVAY 1040

Query: 439  YSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQ 498
            +S TL+MRDR++PVYEREL+AVV+AVQRWRPYLLG+KF VKTDQRSL+FLLEQ RV+QPQ
Sbjct: 1041 FSKTLSMRDRSRPVYERELIAVVLAVQRWRPYLLGRKFTVKTDQRSLKFLLEQ-RVVQPQ 1100

Query: 499  YQKWIAKLLGYSFEVVYKPGLENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDE 558
            YQKW+AKLLGYSFEVVY+PGLENKAADALSR+   A +NQ+T   +ID+++IKEE   D 
Sbjct: 1101 YQKWVAKLLGYSFEVVYQPGLENKAADALSRITPTARLNQITAPAMIDVEIIKEETRHDP 1160

Query: 559  FLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL 618
             L+EI RL  + G E+ +Y+                                  GHSGFL
Sbjct: 1161 ALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVVSSKSTLLPTILHTYHDSVFGGHSGFL 1220

Query: 619  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDF 621
            RTYKRLT E++W GMK                              EIP  +W DISMDF
Sbjct: 1221 RTYKRLTGEIYWKGMKKDVMRYCEECAICQRNKSSALTPAGLLMPLEIPDAIWSDISMDF 1280

BLAST of CSPI05G18960 vs. ExPASy TrEMBL
Match: A0A5A7T4Y0 (Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold379G001090 PE=4 SV=1)

HSP 1 Score: 738.0 bits (1904), Expect = 3.2e-209
Identity = 397/745 (53.29%), Postives = 486/745 (65.23%), Query Frame = 0

Query: 19   FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVL 78
            F+   KK+VIRGDPSLTKARVSLKNLMKSWG +DQGFLVECR +E     E E   E+  
Sbjct: 621  FHHQGKKVVIRGDPSLTKARVSLKNLMKSWGADDQGFLVECRTIECGPLEEHEQDREQGE 680

Query: 79   TIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERL 138
               E +A +L+RF   FEWP TLPP+R I+HHI+LK G DPVNVRPYRYA+ QK EMERL
Sbjct: 681  INAEPIAALLQRFARVFEWPSTLPPQRGIDHHIYLKSGADPVNVRPYRYAHHQKEEMERL 740

Query: 139  VEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDE 198
            V+EML                                 ALNNVT+PDKFPIPVIEELFDE
Sbjct: 741  VDEMLTSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDE 800

Query: 199  LHGATMFTKIDLKS---------------------------------------------- 258
            L GA++F+KIDLK+                                              
Sbjct: 801  LKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQ 860

Query: 259  --RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL 318
              +PYLR+FVLVFFDDIL+YS+G+E+H+ H+  +L +L++ ELY N +KCSFA+ R+ YL
Sbjct: 861  VFKPYLRRFVLVFFDDILVYSRGMEEHVQHLEVVLGLLQEKELYVNMEKCSFAKPRISYL 920

Query: 319  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLK 378
            GH IS +G+E DPEKIRA+ EWP PANVREVRGFLGLTGYYR+FV+NYGTIAA L+QLLK
Sbjct: 921  GHFISEQGIEADPEKIRAVSEWPAPANVREVRGFLGLTGYYRRFVKNYGTIAAPLTQLLK 980

Query: 379  IGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQTDASRYGLGAVLVQNRRPIAY 438
             G +KW EE + AF +L++AMM+LPVL +PDFS+PFEI++DAS +G+GAVL Q R+P+AY
Sbjct: 981  KGAYKWGEEEETAFGKLKRAMMTLPVLTMPDFSLPFEIESDASGFGMGAVLTQCRKPVAY 1040

Query: 439  YSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQ 498
            +S TL+MRDR++PVYEREL+AVV+AVQRWRPYLLG+KF VKTDQRSL+FLLEQ RV+QPQ
Sbjct: 1041 FSKTLSMRDRSRPVYERELIAVVLAVQRWRPYLLGRKFTVKTDQRSLKFLLEQ-RVVQPQ 1100

Query: 499  YQKWIAKLLGYSFEVVYKPGLENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDE 558
            YQKW+AKLLGYSFEVVY+PGLENKAADALSR+   A +NQ+T   +ID+++IKEE   D 
Sbjct: 1101 YQKWVAKLLGYSFEVVYQPGLENKAADALSRITPTARLNQITAPAMIDVEIIKEETRHDP 1160

Query: 559  FLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL 618
             L+EI RL  + G E+ +Y+                                  GHSGFL
Sbjct: 1161 ALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVVSSKSTLLPTILHTYHDSVFGGHSGFL 1220

Query: 619  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDF 621
            RTYKRLT E++W GMK                              EIP  +W DISMDF
Sbjct: 1221 RTYKRLTGEIYWKGMKKDVMRYCEECAICQRNKSSALTPAGLLMPLEIPDAIWSDISMDF 1280

BLAST of CSPI05G18960 vs. ExPASy TrEMBL
Match: A0A5A7UAE4 (Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold175G001550 PE=4 SV=1)

HSP 1 Score: 735.3 bits (1897), Expect = 2.1e-208
Identity = 395/745 (53.02%), Postives = 487/745 (65.37%), Query Frame = 0

Query: 19   FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVL 78
            F+   KK+VIRGDPSLTKARVSLKNLMKSWG +DQGFLVECR +E     E E   E   
Sbjct: 623  FHHQGKKVVIRGDPSLTKARVSLKNLMKSWGADDQGFLVECRTIECGPLEEYEQDREPGE 682

Query: 79   TIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERL 138
               E +A +L+RF   FEWP TLPP+R I+HHI+LK G DPVNVRPYRYA+ QK EMERL
Sbjct: 683  MNAEPIAALLQRFARVFEWPSTLPPQRGIDHHIYLKSGADPVNVRPYRYAHHQKEEMERL 742

Query: 139  VEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDE 198
            V+EML                                 ALNNVT+PDKFPIPVIEELFDE
Sbjct: 743  VDEMLTSGIIRPSKSPYSSPVLLVRKRDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDE 802

Query: 199  LHGATMFTKIDLKS---------------------------------------------- 258
            L GA++F+KIDLK+                                              
Sbjct: 803  LKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQ 862

Query: 259  --RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL 318
              +PYLR+FVLVFFDDIL+YS+G+E+HL H+  +L +L++ ELY N +KCSFA+ R+ YL
Sbjct: 863  VFKPYLRRFVLVFFDDILVYSQGMEEHLQHLEVVLGLLQEKELYVNMEKCSFAKPRISYL 922

Query: 319  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLK 378
            GH IS +G+E DPEKIRA+ EWP PANVREVRGFLGLTGYYR+FV+NYGTIAA L+QLLK
Sbjct: 923  GHFISEQGIEADPEKIRAVSEWPTPANVREVRGFLGLTGYYRRFVKNYGTIAAPLTQLLK 982

Query: 379  IGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQTDASRYGLGAVLVQNRRPIAY 438
             G +KW EE + AF +L++AMM+LPVL +PDFS+PFEI++DAS +G+GAVL Q R+P+AY
Sbjct: 983  KGAYKWGEEEEAAFGKLKRAMMTLPVLTMPDFSLPFEIESDASGFGVGAVLTQCRKPVAY 1042

Query: 439  YSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQ 498
            +S TL++RDR++PVYEREL+AVV+AVQRWRPYLLG+KF VKTDQRSL++LLEQ RV+QPQ
Sbjct: 1043 FSKTLSIRDRSRPVYERELIAVVLAVQRWRPYLLGRKFTVKTDQRSLKYLLEQ-RVVQPQ 1102

Query: 499  YQKWIAKLLGYSFEVVYKPGLENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDE 558
            YQKW+AKLLGYSFEVVY+PGLENKAADALSR+   A++NQ+T   LID++++KEE  +D 
Sbjct: 1103 YQKWVAKLLGYSFEVVYQPGLENKAADALSRITPTAQLNQITAPALIDVEILKEETRQDP 1162

Query: 559  FLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL 618
             L+EI RL  + G E+ +Y+                                  GHSGFL
Sbjct: 1163 ALREIIRLIEEQGMEIPHYTLQQGVLKFKGRLVVSNKSTLLPTILHTYHDSVFGGHSGFL 1222

Query: 619  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDF 621
            RTYKRLT E++W GMK                              EIP  +W DISMDF
Sbjct: 1223 RTYKRLTGEIYWKGMKRDVMRYCEECAICQRNKSSALTPAGLLMPLEIPDAIWSDISMDF 1282

BLAST of CSPI05G18960 vs. ExPASy TrEMBL
Match: A0A5D3DM31 (Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold266G002040 PE=4 SV=1)

HSP 1 Score: 732.3 bits (1889), Expect = 1.7e-207
Identity = 393/745 (52.75%), Postives = 485/745 (65.10%), Query Frame = 0

Query: 19   FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVL 78
            F+   KK+VIRGDPSLTKARVSLKNLMKSWG +DQGFLVECR +E     E E   E   
Sbjct: 864  FHHQGKKVVIRGDPSLTKARVSLKNLMKSWGADDQGFLVECRTIECGPLEEYEQDREPGE 923

Query: 79   TIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERL 138
               E +A +L+RF   FEWP TLPP+R I+HHI++K G DPVNVRPYRYA+ QK EMERL
Sbjct: 924  MNAEPIAALLQRFARVFEWPSTLPPQRGIDHHIYVKSGADPVNVRPYRYAHHQKEEMERL 983

Query: 139  VEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDE 198
            V+EML                                 ALNNVT+PDKFPIPVIEELFDE
Sbjct: 984  VDEMLTSGIIRPSKSPYSSPVLLVRKRDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDE 1043

Query: 199  LHGATMFTKIDLKS---------------------------------------------- 258
            L GA++F+KIDLK+                                              
Sbjct: 1044 LKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQ 1103

Query: 259  --RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL 318
              +PYLR+FVLVFFDDIL+YS+G+E+H  H+  +L +L+  ELY N +KCSFA+ R+ YL
Sbjct: 1104 VFKPYLRRFVLVFFDDILVYSRGMEEHFQHLEVVLGLLQAKELYVNMEKCSFAKPRISYL 1163

Query: 319  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLK 378
            GH IS +G+E DPEKIRA+ EWP PANVREVRGFLGLTGYYR+FV+NYGTIAA L+QLLK
Sbjct: 1164 GHFISEQGIEADPEKIRAVSEWPTPANVREVRGFLGLTGYYRRFVKNYGTIAAPLTQLLK 1223

Query: 379  IGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQTDASRYGLGAVLVQNRRPIAY 438
             G +KW EE + AF +L++AMM+LPVL +PDFS+PFEI++DAS +G+GAVL Q R+P+AY
Sbjct: 1224 KGAYKWGEEEEAAFGKLKRAMMTLPVLTMPDFSLPFEIESDASGFGVGAVLTQCRKPVAY 1283

Query: 439  YSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQ 498
            +S TL++RDR++PVYEREL+AVV+AVQRWRPYLLG+KF VKTDQRSL++LLEQ RV+QPQ
Sbjct: 1284 FSKTLSIRDRSRPVYERELIAVVLAVQRWRPYLLGRKFTVKTDQRSLKYLLEQ-RVVQPQ 1343

Query: 499  YQKWIAKLLGYSFEVVYKPGLENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDE 558
            YQKW+AKLLGYSFEVVY+PGLENKAADALSR+   A++NQ+T   LID++++KEE  +D 
Sbjct: 1344 YQKWVAKLLGYSFEVVYQPGLENKAADALSRITPTAQLNQITAPALIDVEILKEETRQDP 1403

Query: 559  FLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL 618
             L+EI RL  + G E+ +Y+                                  GHSGFL
Sbjct: 1404 ALREIIRLIEEQGMEIPHYTLQQGVLKFKGRLVVSNKSTLLPTILHTYHDSVFGGHSGFL 1463

Query: 619  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDF 621
            RTYKRLT E++W GMK                              EIP  +W DISMDF
Sbjct: 1464 RTYKRLTGEIYWKGMKRDVMRYCEECAICQRNKSSALTPAGLLMPLEIPDAIWSDISMDF 1523

BLAST of CSPI05G18960 vs. ExPASy TrEMBL
Match: A0A5D3C5N7 (Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold453G00350 PE=4 SV=1)

HSP 1 Score: 732.3 bits (1889), Expect = 1.7e-207
Identity = 393/745 (52.75%), Postives = 485/745 (65.10%), Query Frame = 0

Query: 19   FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVL 78
            F+   KK+VIRGDPSLTKARVSLKNLMKSWG +DQGFLVECR +E     E E   E   
Sbjct: 623  FHHQGKKVVIRGDPSLTKARVSLKNLMKSWGADDQGFLVECRTIECGPLEEYEQDREPGE 682

Query: 79   TIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERL 138
               E +A +L+RF   FEWP TLPP+R I+HHI++K G DPVNVRPYRYA+ QK EMERL
Sbjct: 683  MNAEPIAALLQRFARVFEWPSTLPPQRGIDHHIYVKSGADPVNVRPYRYAHHQKEEMERL 742

Query: 139  VEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDE 198
            V+EML                                 ALNNVT+PDKFPIPVIEELFDE
Sbjct: 743  VDEMLTSGIIRPSKSPYSSPVLLVRKRDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDE 802

Query: 199  LHGATMFTKIDLKS---------------------------------------------- 258
            L GA++F+KIDLK+                                              
Sbjct: 803  LKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQ 862

Query: 259  --RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL 318
              +PYLR+FVLVFFDDIL+YS+G+E+H  H+  +L +L+  ELY N +KCSFA+ R+ YL
Sbjct: 863  VFKPYLRRFVLVFFDDILVYSRGMEEHFQHLEVVLGLLQAKELYVNMEKCSFAKPRISYL 922

Query: 319  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLK 378
            GH IS +G+E DPEKIRA+ EWP PANVREVRGFLGLTGYYR+FV+NYGTIAA L+QLLK
Sbjct: 923  GHFISEQGIEADPEKIRAVSEWPTPANVREVRGFLGLTGYYRRFVKNYGTIAAPLTQLLK 982

Query: 379  IGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQTDASRYGLGAVLVQNRRPIAY 438
             G +KW EE + AF +L++AMM+LPVL +PDFS+PFEI++DAS +G+GAVL Q R+P+AY
Sbjct: 983  KGAYKWGEEEEAAFGKLKRAMMTLPVLTMPDFSLPFEIESDASGFGVGAVLTQCRKPVAY 1042

Query: 439  YSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQ 498
            +S TL++RDR++PVYEREL+AVV+AVQRWRPYLLG+KF VKTDQRSL++LLEQ RV+QPQ
Sbjct: 1043 FSKTLSIRDRSRPVYERELIAVVLAVQRWRPYLLGRKFTVKTDQRSLKYLLEQ-RVVQPQ 1102

Query: 499  YQKWIAKLLGYSFEVVYKPGLENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDE 558
            YQKW+AKLLGYSFEVVY+PGLENKAADALSR+   A++NQ+T   LID++++KEE  +D 
Sbjct: 1103 YQKWVAKLLGYSFEVVYQPGLENKAADALSRITPTAQLNQITAPALIDVEILKEETRQDP 1162

Query: 559  FLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL 618
             L+EI RL  + G E+ +Y+                                  GHSGFL
Sbjct: 1163 ALREIIRLIEEQGMEIPHYTLQQGVLKFKGRLVVSNKSTLLPTILHTYHDSVFGGHSGFL 1222

Query: 619  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDF 621
            RTYKRLT E++W GMK                              EIP  +W DISMDF
Sbjct: 1223 RTYKRLTGEIYWKGMKRDVMRYCEECAICQRNKSSALTPAGLLMPLEIPDAIWSDISMDF 1282

BLAST of CSPI05G18960 vs. NCBI nr
Match: KAA0037196.1 (Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 738.0 bits (1904), Expect = 6.6e-209
Identity = 397/745 (53.29%), Postives = 486/745 (65.23%), Query Frame = 0

Query: 19   FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVL 78
            F+   KK+VIRGDPSLTKARVSLKNLMKSWG +DQGFLVECR +E     E E   E+  
Sbjct: 621  FHHQGKKVVIRGDPSLTKARVSLKNLMKSWGADDQGFLVECRTIECGPLEEHEQDREQGE 680

Query: 79   TIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERL 138
               E +A +L+RF   FEWP TLPP+R I+HHI+LK G DPVNVRPYRYA+ QK EMERL
Sbjct: 681  INAEPIAALLQRFARVFEWPSTLPPQRGIDHHIYLKSGADPVNVRPYRYAHHQKEEMERL 740

Query: 139  VEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDE 198
            V+EML                                 ALNNVT+PDKFPIPVIEELFDE
Sbjct: 741  VDEMLTSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDE 800

Query: 199  LHGATMFTKIDLKS---------------------------------------------- 258
            L GA++F+KIDLK+                                              
Sbjct: 801  LKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQ 860

Query: 259  --RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL 318
              +PYLR+FVLVFFDDIL+YS+G+E+H+ H+  +L +L++ ELY N +KCSFA+ R+ YL
Sbjct: 861  VFKPYLRRFVLVFFDDILVYSRGMEEHVQHLEVVLGLLQEKELYVNMEKCSFAKPRISYL 920

Query: 319  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLK 378
            GH IS +G+E DPEKIRA+ EWP PANVREVRGFLGLTGYYR+FV+NYGTIAA L+QLLK
Sbjct: 921  GHFISEQGIEADPEKIRAVSEWPAPANVREVRGFLGLTGYYRRFVKNYGTIAAPLTQLLK 980

Query: 379  IGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQTDASRYGLGAVLVQNRRPIAY 438
             G +KW EE + AF +L++AMM+LPVL +PDFS+PFEI++DAS +G+GAVL Q R+P+AY
Sbjct: 981  KGAYKWGEEEETAFGKLKRAMMTLPVLTMPDFSLPFEIESDASGFGMGAVLTQCRKPVAY 1040

Query: 439  YSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQ 498
            +S TL+MRDR++PVYEREL+AVV+AVQRWRPYLLG+KF VKTDQRSL+FLLEQ RV+QPQ
Sbjct: 1041 FSKTLSMRDRSRPVYERELIAVVLAVQRWRPYLLGRKFTVKTDQRSLKFLLEQ-RVVQPQ 1100

Query: 499  YQKWIAKLLGYSFEVVYKPGLENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDE 558
            YQKW+AKLLGYSFEVVY+PGLENKAADALSR+   A +NQ+T   +ID+++IKEE   D 
Sbjct: 1101 YQKWVAKLLGYSFEVVYQPGLENKAADALSRITPTARLNQITAPAMIDVEIIKEETRHDP 1160

Query: 559  FLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL 618
             L+EI RL  + G E+ +Y+                                  GHSGFL
Sbjct: 1161 ALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVVSSKSTLLPTILHTYHDSVFGGHSGFL 1220

Query: 619  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDF 621
            RTYKRLT E++W GMK                              EIP  +W DISMDF
Sbjct: 1221 RTYKRLTGEIYWKGMKKDVMRYCEECAICQRNKSSALTPAGLLMPLEIPDAIWSDISMDF 1280

BLAST of CSPI05G18960 vs. NCBI nr
Match: TYK13876.1 (Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 738.0 bits (1904), Expect = 6.6e-209
Identity = 397/745 (53.29%), Postives = 486/745 (65.23%), Query Frame = 0

Query: 19   FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVL 78
            F+   KK+VIRGDPSLTKARVSLKNLMKSWG +DQGFLVECR +E     E E   E+  
Sbjct: 621  FHHQGKKVVIRGDPSLTKARVSLKNLMKSWGADDQGFLVECRTIECGPLEEHEQDREQGE 680

Query: 79   TIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERL 138
               E +A +L+RF   FEWP TLPP+R I+HHI+LK G DPVNVRPYRYA+ QK EMERL
Sbjct: 681  INAEPIAALLQRFARVFEWPSTLPPQRGIDHHIYLKSGADPVNVRPYRYAHHQKEEMERL 740

Query: 139  VEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDE 198
            V+EML                                 ALNNVT+PDKFPIPVIEELFDE
Sbjct: 741  VDEMLTSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDE 800

Query: 199  LHGATMFTKIDLKS---------------------------------------------- 258
            L GA++F+KIDLK+                                              
Sbjct: 801  LKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQ 860

Query: 259  --RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL 318
              +PYLR+FVLVFFDDIL+YS+G+E+H+ H+  +L +L++ ELY N +KCSFA+ R+ YL
Sbjct: 861  VFKPYLRRFVLVFFDDILVYSRGMEEHVQHLEVVLGLLQEKELYVNMEKCSFAKPRISYL 920

Query: 319  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLK 378
            GH IS +G+E DPEKIRA+ EWP PANVREVRGFLGLTGYYR+FV+NYGTIAA L+QLLK
Sbjct: 921  GHFISEQGIEADPEKIRAVSEWPTPANVREVRGFLGLTGYYRRFVKNYGTIAAPLTQLLK 980

Query: 379  IGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQTDASRYGLGAVLVQNRRPIAY 438
             G +KW EE + AF +L++AMM+LPVL +PDFS+PFEI++DAS +G+GAVL Q R+P+AY
Sbjct: 981  KGAYKWGEEEETAFGKLKRAMMTLPVLTMPDFSLPFEIESDASGFGMGAVLTQCRKPVAY 1040

Query: 439  YSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQ 498
            +S TL+MRDR++PVYEREL+AVV+AVQRWRPYLLG+KF VKTDQRSL+FLLEQ RV+QPQ
Sbjct: 1041 FSKTLSMRDRSRPVYERELIAVVLAVQRWRPYLLGRKFTVKTDQRSLKFLLEQ-RVVQPQ 1100

Query: 499  YQKWIAKLLGYSFEVVYKPGLENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDE 558
            YQKW+AKLLGYSFEVVY+PGLENKAADALSR+   A +NQ+T   +ID+++IKEE   D 
Sbjct: 1101 YQKWVAKLLGYSFEVVYQPGLENKAADALSRITPTARLNQITAPAMIDVEIIKEETRHDP 1160

Query: 559  FLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL 618
             L+EI RL  + G E+ +Y+                                  GHSGFL
Sbjct: 1161 ALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVVSSKSTLLPTILHTYHDSVFGGHSGFL 1220

Query: 619  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDF 621
            RTYKRLT E++W GMK                              EIP  +W DISMDF
Sbjct: 1221 RTYKRLTGEIYWKGMKKDVMRYCEECAICQRNKSSALTPAGLLMPLEIPDAIWSDISMDF 1280

BLAST of CSPI05G18960 vs. NCBI nr
Match: KAA0050511.1 (Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 735.3 bits (1897), Expect = 4.3e-208
Identity = 395/745 (53.02%), Postives = 487/745 (65.37%), Query Frame = 0

Query: 19   FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVL 78
            F+   KK+VIRGDPSLTKARVSLKNLMKSWG +DQGFLVECR +E     E E   E   
Sbjct: 623  FHHQGKKVVIRGDPSLTKARVSLKNLMKSWGADDQGFLVECRTIECGPLEEYEQDREPGE 682

Query: 79   TIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERL 138
               E +A +L+RF   FEWP TLPP+R I+HHI+LK G DPVNVRPYRYA+ QK EMERL
Sbjct: 683  MNAEPIAALLQRFARVFEWPSTLPPQRGIDHHIYLKSGADPVNVRPYRYAHHQKEEMERL 742

Query: 139  VEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDE 198
            V+EML                                 ALNNVT+PDKFPIPVIEELFDE
Sbjct: 743  VDEMLTSGIIRPSKSPYSSPVLLVRKRDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDE 802

Query: 199  LHGATMFTKIDLKS---------------------------------------------- 258
            L GA++F+KIDLK+                                              
Sbjct: 803  LKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQ 862

Query: 259  --RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL 318
              +PYLR+FVLVFFDDIL+YS+G+E+HL H+  +L +L++ ELY N +KCSFA+ R+ YL
Sbjct: 863  VFKPYLRRFVLVFFDDILVYSQGMEEHLQHLEVVLGLLQEKELYVNMEKCSFAKPRISYL 922

Query: 319  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLK 378
            GH IS +G+E DPEKIRA+ EWP PANVREVRGFLGLTGYYR+FV+NYGTIAA L+QLLK
Sbjct: 923  GHFISEQGIEADPEKIRAVSEWPTPANVREVRGFLGLTGYYRRFVKNYGTIAAPLTQLLK 982

Query: 379  IGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQTDASRYGLGAVLVQNRRPIAY 438
             G +KW EE + AF +L++AMM+LPVL +PDFS+PFEI++DAS +G+GAVL Q R+P+AY
Sbjct: 983  KGAYKWGEEEEAAFGKLKRAMMTLPVLTMPDFSLPFEIESDASGFGVGAVLTQCRKPVAY 1042

Query: 439  YSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQ 498
            +S TL++RDR++PVYEREL+AVV+AVQRWRPYLLG+KF VKTDQRSL++LLEQ RV+QPQ
Sbjct: 1043 FSKTLSIRDRSRPVYERELIAVVLAVQRWRPYLLGRKFTVKTDQRSLKYLLEQ-RVVQPQ 1102

Query: 499  YQKWIAKLLGYSFEVVYKPGLENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDE 558
            YQKW+AKLLGYSFEVVY+PGLENKAADALSR+   A++NQ+T   LID++++KEE  +D 
Sbjct: 1103 YQKWVAKLLGYSFEVVYQPGLENKAADALSRITPTAQLNQITAPALIDVEILKEETRQDP 1162

Query: 559  FLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL 618
             L+EI RL  + G E+ +Y+                                  GHSGFL
Sbjct: 1163 ALREIIRLIEEQGMEIPHYTLQQGVLKFKGRLVVSNKSTLLPTILHTYHDSVFGGHSGFL 1222

Query: 619  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDF 621
            RTYKRLT E++W GMK                              EIP  +W DISMDF
Sbjct: 1223 RTYKRLTGEIYWKGMKRDVMRYCEECAICQRNKSSALTPAGLLMPLEIPDAIWSDISMDF 1282

BLAST of CSPI05G18960 vs. NCBI nr
Match: TYK24654.1 (Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 732.3 bits (1889), Expect = 3.6e-207
Identity = 393/745 (52.75%), Postives = 485/745 (65.10%), Query Frame = 0

Query: 19   FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVL 78
            F+   KK+VIRGDPSLTKARVSLKNLMKSWG +DQGFLVECR +E     E E   E   
Sbjct: 864  FHHQGKKVVIRGDPSLTKARVSLKNLMKSWGADDQGFLVECRTIECGPLEEYEQDREPGE 923

Query: 79   TIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERL 138
               E +A +L+RF   FEWP TLPP+R I+HHI++K G DPVNVRPYRYA+ QK EMERL
Sbjct: 924  MNAEPIAALLQRFARVFEWPSTLPPQRGIDHHIYVKSGADPVNVRPYRYAHHQKEEMERL 983

Query: 139  VEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDE 198
            V+EML                                 ALNNVT+PDKFPIPVIEELFDE
Sbjct: 984  VDEMLTSGIIRPSKSPYSSPVLLVRKRDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDE 1043

Query: 199  LHGATMFTKIDLKS---------------------------------------------- 258
            L GA++F+KIDLK+                                              
Sbjct: 1044 LKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQ 1103

Query: 259  --RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL 318
              +PYLR+FVLVFFDDIL+YS+G+E+H  H+  +L +L+  ELY N +KCSFA+ R+ YL
Sbjct: 1104 VFKPYLRRFVLVFFDDILVYSRGMEEHFQHLEVVLGLLQAKELYVNMEKCSFAKPRISYL 1163

Query: 319  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLK 378
            GH IS +G+E DPEKIRA+ EWP PANVREVRGFLGLTGYYR+FV+NYGTIAA L+QLLK
Sbjct: 1164 GHFISEQGIEADPEKIRAVSEWPTPANVREVRGFLGLTGYYRRFVKNYGTIAAPLTQLLK 1223

Query: 379  IGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQTDASRYGLGAVLVQNRRPIAY 438
             G +KW EE + AF +L++AMM+LPVL +PDFS+PFEI++DAS +G+GAVL Q R+P+AY
Sbjct: 1224 KGAYKWGEEEEAAFGKLKRAMMTLPVLTMPDFSLPFEIESDASGFGVGAVLTQCRKPVAY 1283

Query: 439  YSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQ 498
            +S TL++RDR++PVYEREL+AVV+AVQRWRPYLLG+KF VKTDQRSL++LLEQ RV+QPQ
Sbjct: 1284 FSKTLSIRDRSRPVYERELIAVVLAVQRWRPYLLGRKFTVKTDQRSLKYLLEQ-RVVQPQ 1343

Query: 499  YQKWIAKLLGYSFEVVYKPGLENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDE 558
            YQKW+AKLLGYSFEVVY+PGLENKAADALSR+   A++NQ+T   LID++++KEE  +D 
Sbjct: 1344 YQKWVAKLLGYSFEVVYQPGLENKAADALSRITPTAQLNQITAPALIDVEILKEETRQDP 1403

Query: 559  FLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL 618
             L+EI RL  + G E+ +Y+                                  GHSGFL
Sbjct: 1404 ALREIIRLIEEQGMEIPHYTLQQGVLKFKGRLVVSNKSTLLPTILHTYHDSVFGGHSGFL 1463

Query: 619  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDF 621
            RTYKRLT E++W GMK                              EIP  +W DISMDF
Sbjct: 1464 RTYKRLTGEIYWKGMKRDVMRYCEECAICQRNKSSALTPAGLLMPLEIPDAIWSDISMDF 1523

BLAST of CSPI05G18960 vs. NCBI nr
Match: TYK06572.1 (Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 732.3 bits (1889), Expect = 3.6e-207
Identity = 393/745 (52.75%), Postives = 485/745 (65.10%), Query Frame = 0

Query: 19   FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVL 78
            F+   KK+VIRGDPSLTKARVSLKNLMKSWG +DQGFLVECR +E     E E   E   
Sbjct: 623  FHHQGKKVVIRGDPSLTKARVSLKNLMKSWGADDQGFLVECRTIECGPLEEYEQDREPGE 682

Query: 79   TIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERL 138
               E +A +L+RF   FEWP TLPP+R I+HHI++K G DPVNVRPYRYA+ QK EMERL
Sbjct: 683  MNAEPIAALLQRFARVFEWPSTLPPQRGIDHHIYVKSGADPVNVRPYRYAHHQKEEMERL 742

Query: 139  VEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDE 198
            V+EML                                 ALNNVT+PDKFPIPVIEELFDE
Sbjct: 743  VDEMLTSGIIRPSKSPYSSPVLLVRKRDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDE 802

Query: 199  LHGATMFTKIDLKS---------------------------------------------- 258
            L GA++F+KIDLK+                                              
Sbjct: 803  LKGASVFSKIDLKAGYHQIRMCPEDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQ 862

Query: 259  --RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL 318
              +PYLR+FVLVFFDDIL+YS+G+E+H  H+  +L +L+  ELY N +KCSFA+ R+ YL
Sbjct: 863  VFKPYLRRFVLVFFDDILVYSRGMEEHFQHLEVVLGLLQAKELYVNMEKCSFAKPRISYL 922

Query: 319  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLK 378
            GH IS +G+E DPEKIRA+ EWP PANVREVRGFLGLTGYYR+FV+NYGTIAA L+QLLK
Sbjct: 923  GHFISEQGIEADPEKIRAVSEWPTPANVREVRGFLGLTGYYRRFVKNYGTIAAPLTQLLK 982

Query: 379  IGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQTDASRYGLGAVLVQNRRPIAY 438
             G +KW EE + AF +L++AMM+LPVL +PDFS+PFEI++DAS +G+GAVL Q R+P+AY
Sbjct: 983  KGAYKWGEEEEAAFGKLKRAMMTLPVLTMPDFSLPFEIESDASGFGVGAVLTQCRKPVAY 1042

Query: 439  YSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQ 498
            +S TL++RDR++PVYEREL+AVV+AVQRWRPYLLG+KF VKTDQRSL++LLEQ RV+QPQ
Sbjct: 1043 FSKTLSIRDRSRPVYERELIAVVLAVQRWRPYLLGRKFTVKTDQRSLKYLLEQ-RVVQPQ 1102

Query: 499  YQKWIAKLLGYSFEVVYKPGLENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDE 558
            YQKW+AKLLGYSFEVVY+PGLENKAADALSR+   A++NQ+T   LID++++KEE  +D 
Sbjct: 1103 YQKWVAKLLGYSFEVVYQPGLENKAADALSRITPTAQLNQITAPALIDVEILKEETRQDP 1162

Query: 559  FLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL 618
             L+EI RL  + G E+ +Y+                                  GHSGFL
Sbjct: 1163 ALREIIRLIEEQGMEIPHYTLQQGVLKFKGRLVVSNKSTLLPTILHTYHDSVFGGHSGFL 1222

Query: 619  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDF 621
            RTYKRLT E++W GMK                              EIP  +W DISMDF
Sbjct: 1223 RTYKRLTGEIYWKGMKRDVMRYCEECAICQRNKSSALTPAGLLMPLEIPDAIWSDISMDF 1282

BLAST of CSPI05G18960 vs. TAIR 10
Match: ATMG00860.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 151.0 bits (380), Expect = 3.2e-36
Identity = 70/131 (53.44%), Postives = 94/131 (71.76%), Query Frame = 0

Query: 207 LNHMRALLEVLRKNELYANKKKCSFAQFRVDYLG--HIISREGVEVDPEKIRAIKEWPIP 266
           +NH+  +L++  +++ YAN+KKC+F Q ++ YLG  HIIS EGV  DP K+ A+  WP P
Sbjct: 1   MNHLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEP 60

Query: 267 ANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQQAMMSLP 326
            N  E+RGFLGLTGYYR+FV+NYG I   L++LLK    KWTE A +AF  L+ A+ +LP
Sbjct: 61  KNTTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKNSLKWTEMAALAFKALKGAVTTLP 120

Query: 327 VLALPDFSVPF 336
           VLALPD  +PF
Sbjct: 121 VLALPDLKLPF 131

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P208253.3e-5434.45Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaste... [more]
Q8I7P91.8e-5230.11Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogast... [more]
P043231.5e-5132.01Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogast... [more]
Q993159.3e-4925.74Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
P0CT411.0e-4725.28Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24... [more]
Match NameE-valueIdentityDescription
A0A5D3CU053.2e-20953.29Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A5A7T4Y03.2e-20953.29Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... [more]
A0A5A7UAE42.1e-20853.02Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... [more]
A0A5D3DM311.7e-20752.75Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A5D3C5N71.7e-20752.75Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
Match NameE-valueIdentityDescription
KAA0037196.16.6e-20953.29Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa][more]
TYK13876.16.6e-20953.29Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa][more]
KAA0050511.14.3e-20853.02Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa][more]
TYK24654.13.6e-20752.75Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa][more]
TYK06572.13.6e-20752.75Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
ATMG00860.13.2e-3653.44DNA/RNA polymerases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.10.10.10HIV Type 1 Reverse Transcriptase, subunit A, domain 1coord: 146..155
e-value: 2.4E-8
score: 35.5
NoneNo IPR availableGENE3D3.10.10.10HIV Type 1 Reverse Transcriptase, subunit A, domain 1coord: 101..145
e-value: 2.0E-5
score: 26.3
NoneNo IPR availablePANTHERPTHR24559:SF324TRANSPOSON TY3-I GAG-POL POLYPROTEIN-LIKE PROTEINcoord: 188..435
coord: 146..180
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 188..435
coord: 146..180
NoneNo IPR availableCDDcd09274RNase_HI_RT_Ty3coord: 336..452
e-value: 7.14176E-44
score: 150.721
NoneNo IPR availableCDDcd01647RT_LTRcoord: 147..243
e-value: 1.17192E-32
score: 121.934
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 156..181
e-value: 2.4E-8
score: 35.5
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 252..342
e-value: 4.9E-27
score: 95.8
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 185..248
e-value: 7.2E-22
score: 79.6
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 187..243
e-value: 7.8E-10
score: 38.6
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 518..602
e-value: 5.0E-8
score: 34.4
IPR041577Reverse transcriptase/retrotransposon-derived protein, RNase H-like domainPFAMPF17919RT_RNaseH_2coord: 305..399
e-value: 2.8E-31
score: 107.5
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 92..437
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 523..583

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI05G18960.1CSPI05G18960.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0044260 cellular macromolecule metabolic process
biological_process GO:0006807 nitrogen compound metabolic process
biological_process GO:0044238 primary metabolic process
molecular_function GO:0003676 nucleic acid binding