CSPI05G14310 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI05G14310
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGag/pol protein
LocationChr5: 14692872 .. 14695259 (+)
RNA-Seq ExpressionCSPI05G14310
SyntenyCSPI05G14310
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACCAAGAGATCTTTTACTGGAAAAGGTCTAAGAGCCAAAGGACCCTTAGAGCTCATACATTCGGACCTTTGTGGACCAATGAATGTCAAAGCTCGAGGTGGATATGAATATTTCATTAGCTTCATTGATGATTATTCAAGGTATGGTCATATTTACCTAATACATCATAAGTCTAATAGTCTTGAAAAGTTCAAAGAATATAAGGCTGAAGTAGAAAACGAATTAGGTAAAACAATAAAAATACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTACGATTCCGAGACTATTTAATAGAAAATGGAATCCAGTCACAACTCTCTGCACCTAGTACACCTCAACAGAACGGTGTATCAGAAAGAAGAAACCGGACCTTGTTAGACATGGTTCGCTCTATGATGAGTTTTTCTCAGATGTCAGATTCTTTTTGGGGATATGCTTTAGAAACAGCTGCTTATATTTTGAATAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAAGGGCGTAAAGGAAGTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTACAAAATCCAAAGAAATTGGAACATCGTTCAAAATTATGCTTTTTTATAGGTTATCCAAAAGAATCAAGAGGTGGTTTGTTTTATGATCCTCAAGAAAATAAAATATTTGTGTCAACAAATGCCACATTCTTAGAGGAAGACCACATCAGGGATCATCAACCTCGTAGTAAACTAGTATTAAAAGAAATTTCCAAAAGTGCTATAGATAAACCTAGTTCATCCACTAAGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAGAGCCTCGACGTAGTGGGAGGGTTGTTCATCAGCCTGATCGCTATTTGGGTTTAATTGAAACTCAAGTCGTCATACCTGACGATGGCATAGAGGATCCATTAACCTATAAACAGGCAATGAAAGATGTAGATCGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCATGCCGGTAAAGTACAGACTTTCAAGGCTCGACTTGTGGCAAAGGGTTATACCCAGAGAGAGGGAGTAGACTATGAGGAAACTTTCTCTCCTGTTGCCATGTTAAAGTCAATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAGCTTTTTTGAATGGTAATCTTGAAGAGAGTATCTATATGTCTCAACCAGAGGGGTTTATAGAACAAGATCAAGAACAAAAGGTTTGTAAGCTTAAAAAATCCATTTATGGATTAAAACAAGCTTCTAGATCCTGGAATATAAGATTTGATACTGCGATCAAATCTTATGGCTTTGAACAAAATATTGACGAGCCTTGTGTTTACAAAAAGGTCGTCAATTCCATTATAGCATTTTTAGTCTTATATGTAGATGATATTCTACTTATTGGAAATGACGTAGGATATCTTACTGATATCAAGAAATGGCTAGCTATGCAATTTCAAATGAAAGATCTGGGAGATGCACAATACGTTCTCGGAATCCAAATTGTTCGAAACCGTAAGAACAAAACACTAGCCATGTCTCAAGCATCTTACATAGACAAAATGTTGTCTAGATATAAAATGCAGAATTCCAAAAAGGGTCTGCTGCCGTACAGATATGGAATTCATTTGTCAAAGGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGAAATATTCCCTATGCTTCCGCTGTTGGAAGTTTAATGTATGCAATGTTATGTACTAGACCTGACATTTGCTACTCAGTAGGGATGGTCAGTAGGTATCAATCCAATCCTGGACGTGATCACTGGACAGCCGTTAAAAACATTCTAAAATATCTTCGAAGAACAAAAGACTACATGCTCATGTATGGTACAAAGGATCTGATCCTTACTGGATACACTGATTCAGATTTCCAAACTGATAAAGATGCTAGAAAGTCTACATCAGGATCAGTATTTACTCTAAATGGAGGAGCAGTAGTTTGGAGAAGCATAAAGCAAACTTGTATAGCTGATTCCACAATGGAAGCTGAATACGTAGCGGCTTGTGAAGCAGCAAAAGAAGCAGTATGGCTAAGAAAATTCTTGACAGATTTGGAAGTCGTTCCAAATATGCATCTACCAATCACTTTATACTGTGACAACAGTGGTGCAGTTGAAAATTCAAGAAACATAAGGATCGCAGACCAGATTTCCCATCCAAAAGGACAAGAAAAGAACAAATGA

mRNA sequence

ATGACCAAGAGATCTTTTACTGGAAAAGGTCTAAGAGCCAAAGGACCCTTAGAGCTCATACATTCGGACCTTTGTGGACCAATGAATGTCAAAGCTCGAGGTGGATATGAATATTTCATTAGCTTCATTGATGATTATTCAAGGTATGGTCATATTTACCTAATACATCATAAGTCTAATAGTCTTGAAAAGTTCAAAGAATATAAGGCTGAAGTAGAAAACGAATTAGGTAAAACAATAAAAATACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTACGATTCCGAGACTATTTAATAGAAAATGGAATCCAGTCACAACTCTCTGCACCTAGTACACCTCAACAGAACGGTGTATCAGAAAGAAGAAACCGGACCTTGTTAGACATGGTTCGCTCTATGATGAGTTTTTCTCAGATGTCAGATTCTTTTTGGGGATATGCTTTAGAAACAGCTGCTTATATTTTGAATAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAAGGGCGTAAAGGAAGTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTACAAAATCCAAAGAAATTGGAACATCGTTCAAAATTATGCTTTTTTATAGGTTATCCAAAAGAATCAAGAGGTGGTTTGTTTTATGATCCTCAAGAAAATAAAATATTTGTGTCAACAAATGCCACATTCTTAGAGGAAGACCACATCAGGGATCATCAACCTCGTAGTAAACTAGTATTAAAAGAAATTTCCAAAAGTGCTATAGATAAACCTAGTTCATCCACTAAGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAGAGCCTCGACGTAGTGGGAGGGTTGTTCATCAGCCTGATCGCTATTTGGGTTTAATTGAAACTCAAGTCGTCATACCTGACGATGGCATAGAGGATCCATTAACCTATAAACAGGCAATGAAAGATGTAGATCGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCATGCCGGTAAAGTACAGACTTTCAAGGCTCGACTTGTGGCAAAGGGTTATACCCAGAGAGAGGGAGTAGACTATGAGGAAACTTTCTCTCCTGTTGCCATGTTAAAGTCAATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAGCTTTTTTGAATGGTAATCTTGAAGAGAGTATCTATATGTCTCAACCAGAGGGGTTTATAGAACAAGATCAAGAACAAAAGGTTTGTAAGCTTAAAAAATCCATTTATGGATTAAAACAAGCTTCTAGATCCTGGAATATAAGATTTGATACTGCGATCAAATCTTATGGCTTTGAACAAAATATTGACGAGCCTTGTGTTTACAAAAAGGTCGTCAATTCCATTATAGCATTTTTAGTCTTATATGTAGATGATATTCTACTTATTGGAAATGACGTAGGATATCTTACTGATATCAAGAAATGGCTAGCTATGCAATTTCAAATGAAAGATCTGGGAGATGCACAATACGTTCTCGGAATCCAAATTGTTCGAAACCGTAAGAACAAAACACTAGCCATGTCTCAAGCATCTTACATAGACAAAATGTTGTCTAGATATAAAATGCAGAATTCCAAAAAGGGTCTGCTGCCGTACAGATATGGAATTCATTTGTCAAAGGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGAAATATTCCCTATGCTTCCGCTGTTGGAAGTTTAATGTATGCAATGTTATGTACTAGACCTGACATTTGCTACTCAGTAGGGATGGTCAGTAGGTATCAATCCAATCCTGGACGTGATCACTGGACAGCCGTTAAAAACATTCTAAAATATCTTCGAAGAACAAAAGACTACATGCTCATGTATGGTACAAAGGATCTGATCCTTACTGGATACACTGATTCAGATTTCCAAACTGATAAAGATGCTAGAAAGTCTACATCAGGATCAGTATTTACTCTAAATGGAGGAGCAGTAGTTTGGAGAAGCATAAAGCAAACTTGTATAGCTGATTCCACAATGGAAGCTGAATACGTAGCGGCTTGTGAAGCAGCAAAAGAAGCAGTATGGCTAAGAAAATTCTTGACAGATTTGGAAGTCGTTCCAAATATGCATCTACCAATCACTTTATACTGTGACAACAGTGGTGCAGTTGAAAATTCAAGAAACATAAGGATCGCAGACCAGATTTCCCATCCAAAAGGACAAGAAAAGAACAAATGA

Coding sequence (CDS)

ATGACCAAGAGATCTTTTACTGGAAAAGGTCTAAGAGCCAAAGGACCCTTAGAGCTCATACATTCGGACCTTTGTGGACCAATGAATGTCAAAGCTCGAGGTGGATATGAATATTTCATTAGCTTCATTGATGATTATTCAAGGTATGGTCATATTTACCTAATACATCATAAGTCTAATAGTCTTGAAAAGTTCAAAGAATATAAGGCTGAAGTAGAAAACGAATTAGGTAAAACAATAAAAATACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTACGATTCCGAGACTATTTAATAGAAAATGGAATCCAGTCACAACTCTCTGCACCTAGTACACCTCAACAGAACGGTGTATCAGAAAGAAGAAACCGGACCTTGTTAGACATGGTTCGCTCTATGATGAGTTTTTCTCAGATGTCAGATTCTTTTTGGGGATATGCTTTAGAAACAGCTGCTTATATTTTGAATAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAAGGGCGTAAAGGAAGTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTACAAAATCCAAAGAAATTGGAACATCGTTCAAAATTATGCTTTTTTATAGGTTATCCAAAAGAATCAAGAGGTGGTTTGTTTTATGATCCTCAAGAAAATAAAATATTTGTGTCAACAAATGCCACATTCTTAGAGGAAGACCACATCAGGGATCATCAACCTCGTAGTAAACTAGTATTAAAAGAAATTTCCAAAAGTGCTATAGATAAACCTAGTTCATCCACTAAGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAGAGCCTCGACGTAGTGGGAGGGTTGTTCATCAGCCTGATCGCTATTTGGGTTTAATTGAAACTCAAGTCGTCATACCTGACGATGGCATAGAGGATCCATTAACCTATAAACAGGCAATGAAAGATGTAGATCGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCATGCCGGTAAAGTACAGACTTTCAAGGCTCGACTTGTGGCAAAGGGTTATACCCAGAGAGAGGGAGTAGACTATGAGGAAACTTTCTCTCCTGTTGCCATGTTAAAGTCAATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAGCTTTTTTGAATGGTAATCTTGAAGAGAGTATCTATATGTCTCAACCAGAGGGGTTTATAGAACAAGATCAAGAACAAAAGGTTTGTAAGCTTAAAAAATCCATTTATGGATTAAAACAAGCTTCTAGATCCTGGAATATAAGATTTGATACTGCGATCAAATCTTATGGCTTTGAACAAAATATTGACGAGCCTTGTGTTTACAAAAAGGTCGTCAATTCCATTATAGCATTTTTAGTCTTATATGTAGATGATATTCTACTTATTGGAAATGACGTAGGATATCTTACTGATATCAAGAAATGGCTAGCTATGCAATTTCAAATGAAAGATCTGGGAGATGCACAATACGTTCTCGGAATCCAAATTGTTCGAAACCGTAAGAACAAAACACTAGCCATGTCTCAAGCATCTTACATAGACAAAATGTTGTCTAGATATAAAATGCAGAATTCCAAAAAGGGTCTGCTGCCGTACAGATATGGAATTCATTTGTCAAAGGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGAAATATTCCCTATGCTTCCGCTGTTGGAAGTTTAATGTATGCAATGTTATGTACTAGACCTGACATTTGCTACTCAGTAGGGATGGTCAGTAGGTATCAATCCAATCCTGGACGTGATCACTGGACAGCCGTTAAAAACATTCTAAAATATCTTCGAAGAACAAAAGACTACATGCTCATGTATGGTACAAAGGATCTGATCCTTACTGGATACACTGATTCAGATTTCCAAACTGATAAAGATGCTAGAAAGTCTACATCAGGATCAGTATTTACTCTAAATGGAGGAGCAGTAGTTTGGAGAAGCATAAAGCAAACTTGTATAGCTGATTCCACAATGGAAGCTGAATACGTAGCGGCTTGTGAAGCAGCAAAAGAAGCAGTATGGCTAAGAAAATTCTTGACAGATTTGGAAGTCGTTCCAAATATGCATCTACCAATCACTTTATACTGTGACAACAGTGGTGCAGTTGAAAATTCAAGAAACATAAGGATCGCAGACCAGATTTCCCATCCAAAAGGACAAGAAAAGAACAAATGA

Protein sequence

MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNIDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISHPKGQEKNK*
Homology
BLAST of CSPI05G14310 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 625.5 bits (1612), Expect = 8.0e-178
Identity = 336/806 (41.69%), Postives = 497/806 (61.66%), Query Frame = 0

Query: 5    SFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEK 64
            SF     R    L+L++SD+CGPM +++ GG +YF++FIDD SR   +Y++  K    + 
Sbjct: 469  SFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQV 528

Query: 65   FKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERR 124
            F+++ A VE E G+ +K LRSD GGEY    F +Y   +GI+ + + P TPQ NGV+ER 
Sbjct: 529  FQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERM 588

Query: 125  NRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHF 184
            NRT+++ VRSM+  +++  SFWG A++TA Y++N  PS  ++ E P  +W  ++ S  H 
Sbjct: 589  NRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHL 648

Query: 185  RIWGCP--AHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEED 244
            +++GC   AHV  +   KL+ +S  C FIGY  E  G   +DP + K+  S +  F  E 
Sbjct: 649  KVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVF-RES 708

Query: 245  HIRDHQPRSKLVLKEI------SKSAIDKPSSSTKVVDKTRKSGQ--------------- 304
             +R     S+ V   I        S  + P+S+    D+  + G+               
Sbjct: 709  EVRTAADMSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEG 768

Query: 305  ----SHPSQ---QLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDR 364
                 HP+Q   Q +  RRS R   +  RY       V+I DD   +P + K+ +   ++
Sbjct: 769  VEEVEHPTQGEEQHQPLRRSERPRVESRRYPS--TEYVLISDD--REPESLKEVLSHPEK 828

Query: 365  DQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGY 424
            +Q +KAM  EMES+  N  + LV+ P   +P+ CKW++K K+D   K+  +KARLV KG+
Sbjct: 829  NQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGF 888

Query: 425  TQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEG 484
             Q++G+D++E FSPV  + SIR +LS+A   D E+ Q+DVKTAFL+G+LEE IYM QPEG
Sbjct: 889  EQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEG 948

Query: 485  FIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNIDEPCVY-KKVVNSIIA 544
            F    ++  VCKL KS+YGLKQA R W ++FD+ +KS  + +   +PCVY K+   +   
Sbjct: 949  FEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFI 1008

Query: 545  FLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQA 604
             L+LYVDD+L++G D G +  +K  L+  F MKDLG AQ +LG++IVR R ++ L +SQ 
Sbjct: 1009 ILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQE 1068

Query: 605  SYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAML 664
             YI+++L R+ M+N+K    P    + LSK+ CP T +E  +M  +PY+SAVGSLMYAM+
Sbjct: 1069 KYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMV 1128

Query: 665  CTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDF 724
            CTRPDI ++VG+VSR+  NPG++HW AVK IL+YLR T    L +G  D IL GYTD+D 
Sbjct: 1129 CTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLCFGGSDPILKGYTDADM 1188

Query: 725  QTDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTD 779
              D D RKS++G +FT +GGA+ W+S  Q C+A ST EAEY+AA E  KE +WL++FL +
Sbjct: 1189 AGDIDNRKSSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQE 1248

BLAST of CSPI05G14310 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 421.0 bits (1081), Expect = 3.0e-116
Identity = 270/885 (30.51%), Postives = 447/885 (50.51%), Query Frame = 0

Query: 14   KGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVE 73
            K PL ++HSD+CGP+         YF+ F+D ++ Y   YLI +KS+    F+++ A+ E
Sbjct: 478  KRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVAKSE 537

Query: 74   NELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVR 133
                  +  L  D G EY+    R + ++ GI   L+ P TPQ NGVSER  RT+ +  R
Sbjct: 538  AHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSERMIRTITEKAR 597

Query: 134  SMMSFSQMSDSFWGYALETAAYILNNVPSKSV---SETPYELWKGRKGSLRHFRIWGCPA 193
            +M+S +++  SFWG A+ TA Y++N +PS+++   S+TPYE+W  +K  L+H R++G   
Sbjct: 598  TMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLRVFGATV 657

Query: 194  HVLVQNPK-KLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPR 253
            +V ++N + K + +S    F+GY  E  G   +D    K  V+ +    E + +     +
Sbjct: 658  YVHIKNKQGKFDDKSFKSIFVGY--EPNGFKLWDAVNEKFIVARDVVVDETNMVNSRAVK 717

Query: 254  SKLVLKEISKSAIDK--PSSSTKVV------------------DKTRKSGQSHPS----- 313
             + V  + SK + +K  P+ S K++                  D      ++ P+     
Sbjct: 718  FETVFLKDSKESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESENKNFPNDSRKI 777

Query: 314  ---------------QQLREPRRSGRVV------HQPDRYLG------------LIETQV 373
                           Q L++ + S +         + D +L               ET  
Sbjct: 778  IQTEFPNESKECDNIQFLKDSKESNKYFLNESKKRKRDDHLNESKGSGNPNESRESETAE 837

Query: 374  VIPDDGIEDP---------------------LTYKQ--------------AMKDV----- 433
             + + GI++P                     ++Y +                 DV     
Sbjct: 838  HLKEIGIDNPTKNDGIEIINRRSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPNSFD 897

Query: 434  ------DRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFK 493
                  D+  W +A++ E+ +   N+ WT+  +P +   +  +W++  K +  G    +K
Sbjct: 898  EIQYRDDKSSWEEAINTELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYK 957

Query: 494  ARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEES 553
            ARLVA+G+TQ+  +DYEETF+PVA + S R +LS+   Y+ ++ QMDVKTAFLNG L+E 
Sbjct: 958  ARLVARGFTQKYQIDYEETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEE 1017

Query: 554  IYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNIDEPCVY-- 613
            IYM  P+G         VCKL K+IYGLKQA+R W   F+ A+K   F  +  + C+Y  
Sbjct: 1018 IYMRLPQGI--SCNSDNVCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIYIL 1077

Query: 614  -KKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNR 673
             K  +N  I +++LYVDD+++   D+  + + K++L  +F+M DL + ++ +GI+I    
Sbjct: 1078 DKGNINENI-YVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRI--EM 1137

Query: 674  KNKTLAMSQASYIDKMLSRYKMQNSKKGLLP----YRYGIHLSKEQCPKTPQEVEDMRNI 733
            +   + +SQ++Y+ K+LS++ M+N      P      Y +  S E C           N 
Sbjct: 1138 QEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPSKINYELLNSDEDC-----------NT 1197

Query: 734  PYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYG 779
            P  S +G LMY MLCTRPD+  +V ++SRY S    + W  +K +L+YL+ T D  L++ 
Sbjct: 1198 PCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIF- 1257

BLAST of CSPI05G14310 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 379.4 bits (973), Expect = 9.9e-104
Identity = 273/923 (29.58%), Postives = 424/923 (45.94%), Query Frame = 0

Query: 3    KRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSL 62
            K  F+   + +  PLE I+SD+     + +   Y Y++ F+D ++RY  +Y +  KS   
Sbjct: 489  KVPFSNSTITSSKPLEYIYSDVWS-SPILSIDNYRYYVIFVDHFTRYTWLYPLKQKSQVK 548

Query: 63   EKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSE 122
            + F  +K+ VEN     I  L SD GGE++ L  RDYL ++GI    S P TP+ NG+SE
Sbjct: 549  DTFIIFKSLVENRFQTRIGTLYSDNGGEFVVL--RDYLSQHGISHFTSPPHTPEHNGLSE 608

Query: 123  RRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLR 182
            R++R +++M  +++S + +  ++W YA   A Y++N +P+  +  ++P++   G+  +  
Sbjct: 609  RKHRHIVEMGLTLLSHASVPKTYWPYAFSVAVYLINRLPTPLLQLQSPFQKLFGQPPNYE 668

Query: 183  HFRIWGCPAHVLVQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLE 242
              +++GC  +  ++  N  KLE +SK C F+GY       L       +++ S +  F E
Sbjct: 669  KLKVFGCACYPWLRPYNRHKLEDKSKQCAFMGYSLTQSAYLCLHIPTGRLYTSRHVQFDE 728

Query: 243  E---------------------------------------------DHIRDHQPR----- 302
                                                           H+ D  PR     
Sbjct: 729  RCFPFSTTNFGVSTSQEQRSDSAPNWPSHTTLPTTPLVLPAPPCLGPHL-DTSPRPPSSP 788

Query: 303  SKLVLKEIS-----KSAIDKPSSSTKVV---DKTRKSGQSHPSQQ-------LREPRRSG 362
            S L   ++S      S+I  PSSS       +  + + Q H +Q        L  P  + 
Sbjct: 789  SPLCTTQVSSSNLPSSSISSPSSSEPTAPSHNGPQPTAQPHQTQNSNSNSPILNNPNPNS 848

Query: 363  RVVHQPDRYLGLIETQVVIP---------------------------------------- 422
               + P++   L ++ +  P                                        
Sbjct: 849  PSPNSPNQNSPLPQSPISSPHIPTPSTSISEPNSPSSSSTSTPPLPPVLPAPPIIQVNAQ 908

Query: 423  ------------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMES 482
                         DGI                 +P T  QAMKD   D+W +AM  E+ +
Sbjct: 909  APVNTHSMATRAKDGIRKPNQKYSYATSLAANSEPRTAIQAMKD---DRWRQAMGSEINA 968

Query: 483  MYFNSVWTLV-DQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETF 542
               N  W LV   P  V  +GC+WI+ +K +  G +  +KARLVAKGY QR G+DY ETF
Sbjct: 969  QIGNHTWDLVPPPPPSVTIVGCRWIFTKKFNSDGSLNRYKARLVAKGYNQRPGLDYAETF 1028

Query: 543  SPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCK 602
            SPV    SIRI+L +A    + I Q+DV  AFL G L + +YMSQP GF+++D+   VC+
Sbjct: 1029 SPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDEVYMSQPPGFVDKDRPDYVCR 1088

Query: 603  LKKSIYGLKQASRSWNIRFDTAIKSYGFEQNIDEPCVYKKVVNSIIAFLVLYVDDILLIG 662
            L+K+IYGLKQA R+W +   T + + GF  +I +  ++       I ++++YVDDIL+ G
Sbjct: 1089 LRKAIYGLKQAPRAWYVELRTYLLTVGFVNSISDTSLFVLQRGRSIIYMLVYVDDILITG 1148

Query: 663  NDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQ 722
            ND   L      L+ +F +K+  D  Y LGI+    R  + L +SQ  Y   +L+R  M 
Sbjct: 1149 NDTVLLKHTLDALSQRFSVKEHEDLHYFLGIE--AKRVPQGLHLSQRRYTLDLLARTNML 1208

Query: 723  NSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMV 782
             +K    P      L+     K P   E      Y   VGSL Y +  TRPD+ Y+V  +
Sbjct: 1209 TAKPVATPMATSPKLTLHSGTKLPDPTE------YRGIVGSLQY-LAFTRPDLSYAVNRL 1268

Query: 783  SRYQSNPGRDHWTAVKNILKYLRRTKDY-MLMYGTKDLILTGYTDSDFQTDKDARKSTSG 788
            S+Y   P  DHW A+K +L+YL  T D+ + +     L L  Y+D+D+  D D   ST+G
Sbjct: 1269 SQYMHMPTDDHWNALKRVLRYLAGTPDHGIFLKKGNTLSLHAYSDADWAGDTDDYVSTNG 1328

BLAST of CSPI05G14310 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 360.9 bits (925), Expect = 3.7e-98
Identity = 262/918 (28.54%), Postives = 416/918 (45.32%), Query Frame = 0

Query: 3    KRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSL 62
            K  F+   + +  PLE I+SD+     + +   Y Y++ F+D ++RY  +Y +  KS   
Sbjct: 510  KVPFSQSTINSTRPLEYIYSDVWS-SPILSHDNYRYYVIFVDHFTRYTWLYPLKQKSQVK 569

Query: 63   EKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSE 122
            E F  +K  +EN     I    SD GGE++ L   +Y  ++GI    S P TP+ NG+SE
Sbjct: 570  ETFITFKNLLENRFQTRIGTFYSDNGGEFVAL--WEYFSQHGISHLTSPPHTPEHNGLSE 629

Query: 123  RRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLR 182
            R++R +++   +++S + +  ++W YA   A Y++N +P+  +  E+P++   G   +  
Sbjct: 630  RKHRHIVETGLTLLSHASIPKTYWPYAFAVAVYLINRLPTPLLQLESPFQKLFGTSPNYD 689

Query: 183  HFRIWGCPAHVLVQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLE 242
              R++GC  +  ++  N  KL+ +S+ C F+GY       L    Q +++++S +  F E
Sbjct: 690  KLRVFGCACYPWLRPYNQHKLDDKSRQCVFLGYSLTQSAYLCLHLQTSRLYISRHVRFDE 749

Query: 243  E-----------DHIRDHQPRSKLVL---------------------------------- 302
                          +++ +  S  V                                   
Sbjct: 750  NCFPFSNYLATLSPVQEQRRESSCVWSPHTTLPTRTPVLPAPSCSDPHHAATPPSSPSAP 809

Query: 303  ---KEISKSAIDKPSSST----------------KVVDKTRKSGQSHPS----------- 362
                ++S S +D   SS+                     T+   Q+H S           
Sbjct: 810  FRNSQVSSSNLDSSFSSSFPSSPEPTAPRQNGPQPTTQPTQTQTQTHSSQNTSQNNPTNE 869

Query: 363  ------QQLREPRRSGR---------------------VVHQPDRYLGLIETQVVIP--- 422
                  Q L  P +S                       ++H P     ++      P   
Sbjct: 870  SPSQLAQSLSTPAQSSSSSPSPTTSASSSSTSPTPPSILIHPPPPLAQIVNNNNQAPLNT 929

Query: 423  -------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNS 482
                     GI                 +P T  QA+KD   ++W  AM  E+ +   N 
Sbjct: 930  HSMGTRAKAGIIKPNPKYSLAVSLAAESEPRTAIQALKD---ERWRNAMGSEINAQIGNH 989

Query: 483  VWTLV-DQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAM 542
             W LV   P+ V  +GC+WI+ +K +  G +  +KARLVAKGY QR G+DY ETFSPV  
Sbjct: 990  TWDLVPPPPSHVTIVGCRWIFTKKYNSDGSLNRYKARLVAKGYNQRPGLDYAETFSPVIK 1049

Query: 543  LKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSI 602
              SIRI+L +A    + I Q+DV  AFL G L + +YMSQP GFI++D+   VCKL+K++
Sbjct: 1050 STSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDDVYMSQPPGFIDKDRPNYVCKLRKAL 1109

Query: 603  YGLKQASRSWNIRFDTAIKSYGFEQNIDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGY 662
            YGLKQA R+W +     + + GF  ++ +  ++       I ++++YVDDIL+ GND   
Sbjct: 1110 YGLKQAPRAWYVELRNYLLTIGFVNSVSDTSLFVLQRGKSIVYMLVYVDDILITGNDPTL 1169

Query: 663  LTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKG 722
            L +    L+ +F +KD  +  Y LGI+    R    L +SQ  YI  +L+R  M  +K  
Sbjct: 1170 LHNTLDNLSQRFSVKDHEELHYFLGIE--AKRVPTGLHLSQRRYILDLLARTNMITAKPV 1229

Query: 723  LLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQS 782
              P      LS     K     E      Y   VGSL Y +  TRPDI Y+V  +S++  
Sbjct: 1230 TTPMAPSPKLSLYSGTKLTDPTE------YRGIVGSLQY-LAFTRPDISYAVNRLSQFMH 1289

Query: 783  NPGRDHWTAVKNILKYLRRTKDY-MLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTL 788
             P  +H  A+K IL+YL  T ++ + +     L L  Y+D+D+  DKD   ST+G +  L
Sbjct: 1290 MPTEEHLQALKRILRYLAGTPNHGIFLKKGNTLSLHAYSDADWAGDKDDYVSTNGYIVYL 1349

BLAST of CSPI05G14310 vs. ExPASy Swiss-Prot
Match: P25600 (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY5A PE=5 SV=2)

HSP 1 Score: 149.4 bits (376), Expect = 1.7e-34
Identity = 103/314 (32.80%), Postives = 149/314 (47.45%), Query Frame = 0

Query: 431 MDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKS 490
           MDV TAFLN  ++E IY+ QP GF+ +     V +L   +YGLKQA   WN   +  +K 
Sbjct: 1   MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 491 YGFEQNIDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDA 550
            GF ++  E  +Y +  +    ++ +YVDD+L+          +K+ L   + MKDLG  
Sbjct: 61  IGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 551 QYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQ 610
              LG+ I     N  + +S   YI K  S  ++   K    P    +  SK     T  
Sbjct: 121 DKFLGLNI-HQSSNGDITLSLQDYIAKAASESEINTFKLTQTP----LCNSKPLFETTSP 180

Query: 611 EVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRT 670
            ++D+   PY S VG L++     RPDI Y V ++SR+   P   H  + + +L+YL  T
Sbjct: 181 HLKDI--TPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTT 240

Query: 671 KDYMLMYGT-KDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIK-QTCIADST 730
           +   L Y +   L LT Y D+      D   ST G V  L G  V W S K +  I   +
Sbjct: 241 RSMCLKYRSGSQLALTVYCDASHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPS 300

Query: 731 MEAEYVAACEAAKE 743
            EAEY+ A E   E
Sbjct: 301 TEAEYITASETVME 307

BLAST of CSPI05G14310 vs. ExPASy TrEMBL
Match: E2GK51 (Gag/pol protein (Fragment) OS=Bryonia dioica OX=3652 PE=4 SV=1)

HSP 1 Score: 1443.7 bits (3736), Expect = 0.0e+00
Identity = 700/791 (88.50%), Postives = 750/791 (94.82%), Query Frame = 0

Query: 1    MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSN 60
            MTKRSFTGKGLRAK PLEL+HSDLCGPMNVKARGGYEYFISFIDD+SRYGH+YL+HHKS 
Sbjct: 485  MTKRSFTGKGLRAKVPLELVHSDLCGPMNVKARGGYEYFISFIDDFSRYGHVYLLHHKSE 544

Query: 61   SLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGV 120
            S EKFKEYKAEVENE+GKTIK LRSDRGGEYMD +F+DYLIE GIQSQLSAPSTPQQNGV
Sbjct: 545  SFEKFKEYKAEVENEIGKTIKTLRSDRGGEYMDSKFQDYLIEFGIQSQLSAPSTPQQNGV 604

Query: 121  SERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSL 180
            SERRNRTLLDMVRSMMS++Q+ DSFWGYALETA +ILNNVPSKSV ETPYELWKGRK SL
Sbjct: 605  SERRNRTLLDMVRSMMSYAQLPDSFWGYALETAIHILNNVPSKSVLETPYELWKGRKSSL 664

Query: 181  RHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE 240
            R+FRIWGCPAHVLVQNPKKLE RSKLC F+GYPKESRGGLFY PQENK+FVSTNATFLEE
Sbjct: 665  RYFRIWGCPAHVLVQNPKKLEPRSKLCLFVGYPKESRGGLFYHPQENKVFVSTNATFLEE 724

Query: 241  DHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQP 300
            DH R+HQPRSK+VLKE+ K+A DKPSSSTKVVDK   S QSH SQ+LR PRRSGRVVHQP
Sbjct: 725  DHXRNHQPRSKIVLKEMFKNATDKPSSSTKVVDKANISDQSHTSQELRVPRRSGRVVHQP 784

Query: 301  DRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPN 360
            +RYLGL+ETQ++IPDDG+EDPLTYKQAM DVDRDQWIKAM+LEMESMYFNSVWTLVD P+
Sbjct: 785  NRYLGLVETQIIIPDDGVEDPLTYKQAMNDVDRDQWIKAMNLEMESMYFNSVWTLVDLPS 844

Query: 361  DVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSI 420
            DVKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLKSIRILLSI
Sbjct: 845  DVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLSI 904

Query: 421  ATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSW 480
            ATFY+YEIWQMDVKTAFLNGNLEESIYM QPEGFI QDQEQKVCKL+KSIYGLKQASRSW
Sbjct: 905  ATFYNYEIWQMDVKTAFLNGNLEESIYMVQPEGFIAQDQEQKVCKLQKSIYGLKQASRSW 964

Query: 481  NIRFDTAIKSYGFEQNIDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAM 540
            NIRFDTAIKSYGFEQN+DEPCVYKK+VNS++AFL+LYVDDILLIGNDV YLTD+KKWL  
Sbjct: 965  NIRFDTAIKSYGFEQNVDEPCVYKKIVNSVVAFLILYVDDILLIGNDVEYLTDVKKWLNT 1024

Query: 541  QFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHL 600
            QFQMKDLG+AQY+LGIQIVRNRKNKTLAMSQASYIDK+LSRYKMQNSKKG LP+R+GIHL
Sbjct: 1025 QFQMKDLGEAQYILGIQIVRNRKNKTLAMSQASYIDKVLSRYKMQNSKKGQLPFRHGIHL 1084

Query: 601  SKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAV 660
            SKEQCPKTPQEVEDMRNIPY+SAVGSLMYAMLCTRPDICYSVG+VSRYQSNPGRDHWTAV
Sbjct: 1085 SKEQCPKTPQEVEDMRNIPYSSAVGSLMYAMLCTRPDICYSVGIVSRYQSNPGRDHWTAV 1144

Query: 661  KNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIK 720
            KNILKYLRRT++YML+YG KDLILTGYTDSDFQ+DKDARKSTSGSVFTLNGGAVVWRS+K
Sbjct: 1145 KNILKYLRRTRNYMLVYGAKDLILTGYTDSDFQSDKDARKSTSGSVFTLNGGAVVWRSVK 1204

Query: 721  QTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIR 780
            QTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAV NS+  R
Sbjct: 1205 QTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPR 1264

Query: 781  IADQISHPKGQ 792
                 SH +G+
Sbjct: 1265 -----SHKRGK 1270

BLAST of CSPI05G14310 vs. ExPASy TrEMBL
Match: A0A5A7TZD0 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G00090 PE=4 SV=1)

HSP 1 Score: 1358.2 bits (3514), Expect = 0.0e+00
Identity = 665/794 (83.75%), Postives = 725/794 (91.31%), Query Frame = 0

Query: 1    MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSN 60
            MTKR FTGKG RAK PLELIHSDLCGPMNVKARGG+EYFISFIDDYSRYG++YL+ HKS 
Sbjct: 391  MTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSE 450

Query: 61   SLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGV 120
            +LEKFKEYK EVEN L K IKILRSDRGGEYMDLRF+DY+IE+GIQSQLSAP TPQQNGV
Sbjct: 451  ALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGV 510

Query: 121  SERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSL 180
            SERRNRTLLDMVRSMMS++Q+  SFWGYA+ETA +ILNNVPSKSVSETP+ELW+GRK SL
Sbjct: 511  SERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSL 570

Query: 181  RHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE 240
             HFRIWGCPAHVLV NPKKLE RS+LC F+GYPKE+RGGLF+DPQEN++FVSTNATFLEE
Sbjct: 571  SHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEE 630

Query: 241  DHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV 300
            DH+R+H+PRSKLVL E    S   +D+   S++ VD+T  SGQSHPSQ LR PRRSGRVV
Sbjct: 631  DHMRNHKPRSKLVLSEATDESTRVVDEVGPSSR-VDETTTSGQSHPSQSLRMPRRSGRVV 690

Query: 301  HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVD 360
             QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD
Sbjct: 691  SQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVD 750

Query: 361  QPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRIL 420
             P  VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRIL
Sbjct: 751  LPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRIL 810

Query: 421  LSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQAS 480
            LSIATFYDYEIWQMDVKTAFLNGNLEESI+MSQPEGFI Q QEQKVCKL +SIYGLKQAS
Sbjct: 811  LSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQAS 870

Query: 481  RSWNIRFDTAIKSYGFEQNIDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKW 540
            RSWNIRFDTAIKSYGF+QN+DEPCVYKK+    +AFLVLYVDDILLIGNDVGYLTD+K W
Sbjct: 871  RSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAW 930

Query: 541  LAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG 600
            LA QFQMKDLG+AQYVLGIQI+R+RKNKTLA+SQA+YIDK+L RY MQNSKKGLLP+R+G
Sbjct: 931  LAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHG 990

Query: 601  IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHW 660
            +HLSKEQ PKTPQEVEDMR IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG DHW
Sbjct: 991  VHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHW 1050

Query: 661  TAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWR 720
            TAVK +LKYLRRT+DYML+YG KDLILTGYTDSDFQTDKD+RKSTSGSVFTLNGGAVVWR
Sbjct: 1051 TAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWR 1110

Query: 721  SIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSR 780
            SIKQ CIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNM+LPITLYCDNSGAV NS+
Sbjct: 1111 SIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSK 1170

Query: 781  NIRIADQISHPKGQ 792
              R     SH +G+
Sbjct: 1171 EPR-----SHKRGK 1178

BLAST of CSPI05G14310 vs. ExPASy TrEMBL
Match: A0A5A7UYE8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold430G001570 PE=4 SV=1)

HSP 1 Score: 1358.2 bits (3514), Expect = 0.0e+00
Identity = 665/794 (83.75%), Postives = 725/794 (91.31%), Query Frame = 0

Query: 1    MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSN 60
            MTKR FTGKG RAK PLELIHSDLCGPMNVKARGG+EYFISFIDDYSRYG++YL+ HKS 
Sbjct: 265  MTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSE 324

Query: 61   SLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGV 120
            +LEKFKEYK EVEN L K IKILRSDRGGEYMDLRF+DY+IE+GIQSQLSAP TPQQNGV
Sbjct: 325  ALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGV 384

Query: 121  SERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSL 180
            SERRNRTLLDMVRSMMS++Q+  SFWGYA+ETA +ILNNVPSKSVSETP+ELW+GRK SL
Sbjct: 385  SERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSL 444

Query: 181  RHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE 240
             HFRIWGCPAHVLV NPKKLE RS+LC F+GYPKE+RGGLF+DPQEN++FVSTNATFLEE
Sbjct: 445  SHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEE 504

Query: 241  DHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV 300
            DH+R+H+PRSKLVL E    S   +D+   S++ VD+T  SGQSHPSQ LR PRRSGRVV
Sbjct: 505  DHMRNHKPRSKLVLSEATDESTRVVDEVGPSSR-VDETTTSGQSHPSQSLRMPRRSGRVV 564

Query: 301  HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVD 360
             QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD
Sbjct: 565  SQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVD 624

Query: 361  QPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRIL 420
             P  VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRIL
Sbjct: 625  LPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRIL 684

Query: 421  LSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQAS 480
            LSIATFYDYEIWQMDVKTAFLNGNLEESI+MSQPEGFI Q QEQKVCKL +SIYGLKQAS
Sbjct: 685  LSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQAS 744

Query: 481  RSWNIRFDTAIKSYGFEQNIDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKW 540
            RSWNIRFDTAIKSYGF+QN+DEPCVYKK+    +AFLVLYVDDILLIGNDVGYLTD+K W
Sbjct: 745  RSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAW 804

Query: 541  LAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG 600
            LA QFQMKDLG+AQYVLGIQI+R+RKNKTLA+SQA+YIDK+L RY MQNSKKGLLP+R+G
Sbjct: 805  LAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHG 864

Query: 601  IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHW 660
            +HLSKEQ PKTPQEVEDMR IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG DHW
Sbjct: 865  VHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHW 924

Query: 661  TAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWR 720
            TAVK +LKYLRRT+DYML+YG KDLILTGYTDSDFQTDKD+RKSTSGSVFTLNGGAVVWR
Sbjct: 925  TAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWR 984

Query: 721  SIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSR 780
            SIKQ CIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNM+LPITLYCDNSGAV NS+
Sbjct: 985  SIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSK 1044

Query: 781  NIRIADQISHPKGQ 792
              R     SH +G+
Sbjct: 1045 EPR-----SHKRGK 1052

BLAST of CSPI05G14310 vs. ExPASy TrEMBL
Match: A0A5A7T2V9 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold56G00760 PE=4 SV=1)

HSP 1 Score: 1334.3 bits (3452), Expect = 0.0e+00
Identity = 654/794 (82.37%), Postives = 718/794 (90.43%), Query Frame = 0

Query: 1    MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSN 60
            MTKR FTGKG RAK PLELIHSDLCGPMNVKARG +EYFISFIDDYSRYG++YL+ HKS 
Sbjct: 391  MTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGSFEYFISFIDDYSRYGYLYLMEHKSE 450

Query: 61   SLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGV 120
            +LEKFKEYK EVEN L K IKI RSDRGGEYMDL F+DY+IE+GIQSQLSAP TPQQNGV
Sbjct: 451  ALEKFKEYKTEVENLLSKKIKIFRSDRGGEYMDLIFQDYMIEHGIQSQLSAPGTPQQNGV 510

Query: 121  SERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSL 180
            SERRNRTLLDMVRSMMS++Q+  SFWGYA+ETA +ILNNVPSKSVSETP+ELW+GRK SL
Sbjct: 511  SERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSL 570

Query: 181  RHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE 240
             HFRIWGCPAHVLV NPKKLE RS+LC F+GYPKE+RGGLF+DP+EN++FVSTNATFLEE
Sbjct: 571  SHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPKENRVFVSTNATFLEE 630

Query: 241  DHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV 300
            DH+R+H+PRSKLVL E    S   +D+   S++ VD+T  SGQSHPSQ LR PRRSGRVV
Sbjct: 631  DHMRNHKPRSKLVLSEATDESTRVVDEVGPSSR-VDETTTSGQSHPSQSLRMPRRSGRVV 690

Query: 301  HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVD 360
             QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD
Sbjct: 691  SQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVD 750

Query: 361  QPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRIL 420
             P  VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYT++EGVDYEETFS VAMLKSIRIL
Sbjct: 751  LPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTRKEGVDYEETFSSVAMLKSIRIL 810

Query: 421  LSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQAS 480
            LSIA FYDYEIWQMDVKTAFLNGNLEESI+MSQPEGFI Q QEQKVCKL +SIYGLKQAS
Sbjct: 811  LSIAKFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQAS 870

Query: 481  RSWNIRFDTAIKSYGFEQNIDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKW 540
            RSWNIRFDTAIKSYGF+QN+DEPCVYKK+    +AFLVLYVDDILLIGNDVGYLTD+K W
Sbjct: 871  RSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAW 930

Query: 541  LAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG 600
            LA QFQMKDLG+ QYVLGIQI+R+RKNKTLA+SQA+YIDK+L RY MQNSKKGLLP+R+G
Sbjct: 931  LAAQFQMKDLGEGQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHG 990

Query: 601  IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHW 660
            +HLSKEQ PKTPQEVEDMR IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG DHW
Sbjct: 991  VHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHW 1050

Query: 661  TAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWR 720
            TAVK ILKYLRRT+DYML+YG KDLILTGYT+SDFQTDKD+RKSTS SVFTLNGGAVVWR
Sbjct: 1051 TAVKIILKYLRRTRDYMLVYGAKDLILTGYTNSDFQTDKDSRKSTSRSVFTLNGGAVVWR 1110

Query: 721  SIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSR 780
            SIKQ CIADSTMEAEYVAACEAAKEAVWL+KFL DLEVVPNM+LPITLYCDNSGAV NS+
Sbjct: 1111 SIKQGCIADSTMEAEYVAACEAAKEAVWLKKFLHDLEVVPNMNLPITLYCDNSGAVANSK 1170

Query: 781  NIRIADQISHPKGQ 792
              R     SH +G+
Sbjct: 1171 EPR-----SHKRGK 1178

BLAST of CSPI05G14310 vs. ExPASy TrEMBL
Match: A0A5D3CZY3 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1032G00460 PE=4 SV=1)

HSP 1 Score: 1309.3 bits (3387), Expect = 0.0e+00
Identity = 640/790 (81.01%), Postives = 708/790 (89.62%), Query Frame = 0

Query: 1   MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSN 60
           MTKR FTGKG RAK PLELIHSDLCGPMNVKARGG+EYFISFIDDYSRYG++YL+ HKS 
Sbjct: 1   MTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSE 60

Query: 61  SLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGV 120
           +LEKFKEYK EVEN L K IKILRSDRGGEYMDLRF+DY+IE+GIQ QLSAP TPQQNGV
Sbjct: 61  ALEKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGV 120

Query: 121 SERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSL 180
            ERRNRT+LDMVRSMMS++Q+  SFWGYA+ETA +ILNNV SKSVSETP+ELW+GRK SL
Sbjct: 121 LERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSSKSVSETPFELWRGRKPSL 180

Query: 181 RHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE 240
            HF+I GCPAHVLV NPKKLE RS+LC F+GYPKE+RGGLF+DPQ+N++ VSTNATFLEE
Sbjct: 181 SHFKILGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEE 240

Query: 241 DHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV 300
           DH+RDH+P++KLVL E    S   +D+   S++ V++T  SGQSHPSQ LR PRRSGR+V
Sbjct: 241 DHMRDHKPQNKLVLNEAIDESTRVVDEVGPSSR-VNETTTSGQSHPSQSLRMPRRSGRIV 300

Query: 301 HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVD 360
            QP+RYLGL ETQVVIPDDG+EDPL+Y QAM DVD+DQW+KAMDLEMESMYFN +W LVD
Sbjct: 301 SQPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVD 360

Query: 361 QPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRIL 420
            P  VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRIL
Sbjct: 361 LPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRIL 420

Query: 421 LSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQAS 480
           LSIATFYDYEIW+MDV TAFLNGNLEESI+MSQPEGFI Q QEQKVCKL +SIYGLKQAS
Sbjct: 421 LSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQAS 480

Query: 481 RSWNIRFDTAIKSYGFEQNIDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKW 540
           RSWNIRFDTAIKSYGFEQN+DEPCVYKK+    + FLVLYVDDILLIGNDVGYLTD+K W
Sbjct: 481 RSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAW 540

Query: 541 LAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG 600
           LA QFQMKDLG+AQYVLGIQI+R+RKNKTLA+SQA+YIDKML RY MQNSKKGLLP+R+G
Sbjct: 541 LAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHG 600

Query: 601 IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHW 660
           +HLSKEQCPKTPQEVEDMR IPYASAVGSLMY + CTR +ICY+V +VSRYQSN G DHW
Sbjct: 601 VHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHW 660

Query: 661 TAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWR 720
           TAVK ILKYLRRT+DYML+YG KDLILTGYTDSDFQT+KD+RKSTS SVFTLNGGA+VWR
Sbjct: 661 TAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWR 720

Query: 721 SIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSR 780
           SIKQ CIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNM+LPITLYCDNSGAV NS+
Sbjct: 721 SIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSK 780

Query: 781 NIRIADQISH 788
             R   +  H
Sbjct: 781 EPRSHKREKH 789

BLAST of CSPI05G14310 vs. NCBI nr
Match: ADJ18449.1 (gag/pol protein, partial [Bryonia dioica])

HSP 1 Score: 1443.7 bits (3736), Expect = 0.0e+00
Identity = 700/791 (88.50%), Postives = 750/791 (94.82%), Query Frame = 0

Query: 1    MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSN 60
            MTKRSFTGKGLRAK PLEL+HSDLCGPMNVKARGGYEYFISFIDD+SRYGH+YL+HHKS 
Sbjct: 485  MTKRSFTGKGLRAKVPLELVHSDLCGPMNVKARGGYEYFISFIDDFSRYGHVYLLHHKSE 544

Query: 61   SLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGV 120
            S EKFKEYKAEVENE+GKTIK LRSDRGGEYMD +F+DYLIE GIQSQLSAPSTPQQNGV
Sbjct: 545  SFEKFKEYKAEVENEIGKTIKTLRSDRGGEYMDSKFQDYLIEFGIQSQLSAPSTPQQNGV 604

Query: 121  SERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSL 180
            SERRNRTLLDMVRSMMS++Q+ DSFWGYALETA +ILNNVPSKSV ETPYELWKGRK SL
Sbjct: 605  SERRNRTLLDMVRSMMSYAQLPDSFWGYALETAIHILNNVPSKSVLETPYELWKGRKSSL 664

Query: 181  RHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE 240
            R+FRIWGCPAHVLVQNPKKLE RSKLC F+GYPKESRGGLFY PQENK+FVSTNATFLEE
Sbjct: 665  RYFRIWGCPAHVLVQNPKKLEPRSKLCLFVGYPKESRGGLFYHPQENKVFVSTNATFLEE 724

Query: 241  DHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQP 300
            DH R+HQPRSK+VLKE+ K+A DKPSSSTKVVDK   S QSH SQ+LR PRRSGRVVHQP
Sbjct: 725  DHXRNHQPRSKIVLKEMFKNATDKPSSSTKVVDKANISDQSHTSQELRVPRRSGRVVHQP 784

Query: 301  DRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPN 360
            +RYLGL+ETQ++IPDDG+EDPLTYKQAM DVDRDQWIKAM+LEMESMYFNSVWTLVD P+
Sbjct: 785  NRYLGLVETQIIIPDDGVEDPLTYKQAMNDVDRDQWIKAMNLEMESMYFNSVWTLVDLPS 844

Query: 361  DVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSI 420
            DVKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLKSIRILLSI
Sbjct: 845  DVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLSI 904

Query: 421  ATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSW 480
            ATFY+YEIWQMDVKTAFLNGNLEESIYM QPEGFI QDQEQKVCKL+KSIYGLKQASRSW
Sbjct: 905  ATFYNYEIWQMDVKTAFLNGNLEESIYMVQPEGFIAQDQEQKVCKLQKSIYGLKQASRSW 964

Query: 481  NIRFDTAIKSYGFEQNIDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAM 540
            NIRFDTAIKSYGFEQN+DEPCVYKK+VNS++AFL+LYVDDILLIGNDV YLTD+KKWL  
Sbjct: 965  NIRFDTAIKSYGFEQNVDEPCVYKKIVNSVVAFLILYVDDILLIGNDVEYLTDVKKWLNT 1024

Query: 541  QFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHL 600
            QFQMKDLG+AQY+LGIQIVRNRKNKTLAMSQASYIDK+LSRYKMQNSKKG LP+R+GIHL
Sbjct: 1025 QFQMKDLGEAQYILGIQIVRNRKNKTLAMSQASYIDKVLSRYKMQNSKKGQLPFRHGIHL 1084

Query: 601  SKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAV 660
            SKEQCPKTPQEVEDMRNIPY+SAVGSLMYAMLCTRPDICYSVG+VSRYQSNPGRDHWTAV
Sbjct: 1085 SKEQCPKTPQEVEDMRNIPYSSAVGSLMYAMLCTRPDICYSVGIVSRYQSNPGRDHWTAV 1144

Query: 661  KNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIK 720
            KNILKYLRRT++YML+YG KDLILTGYTDSDFQ+DKDARKSTSGSVFTLNGGAVVWRS+K
Sbjct: 1145 KNILKYLRRTRNYMLVYGAKDLILTGYTDSDFQSDKDARKSTSGSVFTLNGGAVVWRSVK 1204

Query: 721  QTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIR 780
            QTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAV NS+  R
Sbjct: 1205 QTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPR 1264

Query: 781  IADQISHPKGQ 792
                 SH +G+
Sbjct: 1265 -----SHKRGK 1270

BLAST of CSPI05G14310 vs. NCBI nr
Match: KAA0025945.1 (gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0035786.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0040492.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0041262.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1358.2 bits (3514), Expect = 0.0e+00
Identity = 665/794 (83.75%), Postives = 725/794 (91.31%), Query Frame = 0

Query: 1    MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSN 60
            MTKR FTGKG RAK PLELIHSDLCGPMNVKARGG+EYFISFIDDYSRYG++YL+ HKS 
Sbjct: 391  MTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSE 450

Query: 61   SLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGV 120
            +LEKFKEYK EVEN L K IKILRSDRGGEYMDLRF+DY+IE+GIQSQLSAP TPQQNGV
Sbjct: 451  ALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGV 510

Query: 121  SERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSL 180
            SERRNRTLLDMVRSMMS++Q+  SFWGYA+ETA +ILNNVPSKSVSETP+ELW+GRK SL
Sbjct: 511  SERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSL 570

Query: 181  RHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE 240
             HFRIWGCPAHVLV NPKKLE RS+LC F+GYPKE+RGGLF+DPQEN++FVSTNATFLEE
Sbjct: 571  SHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEE 630

Query: 241  DHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV 300
            DH+R+H+PRSKLVL E    S   +D+   S++ VD+T  SGQSHPSQ LR PRRSGRVV
Sbjct: 631  DHMRNHKPRSKLVLSEATDESTRVVDEVGPSSR-VDETTTSGQSHPSQSLRMPRRSGRVV 690

Query: 301  HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVD 360
             QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD
Sbjct: 691  SQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVD 750

Query: 361  QPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRIL 420
             P  VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRIL
Sbjct: 751  LPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRIL 810

Query: 421  LSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQAS 480
            LSIATFYDYEIWQMDVKTAFLNGNLEESI+MSQPEGFI Q QEQKVCKL +SIYGLKQAS
Sbjct: 811  LSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQAS 870

Query: 481  RSWNIRFDTAIKSYGFEQNIDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKW 540
            RSWNIRFDTAIKSYGF+QN+DEPCVYKK+    +AFLVLYVDDILLIGNDVGYLTD+K W
Sbjct: 871  RSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAW 930

Query: 541  LAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG 600
            LA QFQMKDLG+AQYVLGIQI+R+RKNKTLA+SQA+YIDK+L RY MQNSKKGLLP+R+G
Sbjct: 931  LAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHG 990

Query: 601  IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHW 660
            +HLSKEQ PKTPQEVEDMR IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG DHW
Sbjct: 991  VHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHW 1050

Query: 661  TAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWR 720
            TAVK +LKYLRRT+DYML+YG KDLILTGYTDSDFQTDKD+RKSTSGSVFTLNGGAVVWR
Sbjct: 1051 TAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWR 1110

Query: 721  SIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSR 780
            SIKQ CIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNM+LPITLYCDNSGAV NS+
Sbjct: 1111 SIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSK 1170

Query: 781  NIRIADQISHPKGQ 792
              R     SH +G+
Sbjct: 1171 EPR-----SHKRGK 1178

BLAST of CSPI05G14310 vs. NCBI nr
Match: KAA0059226.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1358.2 bits (3514), Expect = 0.0e+00
Identity = 665/794 (83.75%), Postives = 725/794 (91.31%), Query Frame = 0

Query: 1    MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSN 60
            MTKR FTGKG RAK PLELIHSDLCGPMNVKARGG+EYFISFIDDYSRYG++YL+ HKS 
Sbjct: 265  MTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSE 324

Query: 61   SLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGV 120
            +LEKFKEYK EVEN L K IKILRSDRGGEYMDLRF+DY+IE+GIQSQLSAP TPQQNGV
Sbjct: 325  ALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGV 384

Query: 121  SERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSL 180
            SERRNRTLLDMVRSMMS++Q+  SFWGYA+ETA +ILNNVPSKSVSETP+ELW+GRK SL
Sbjct: 385  SERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSL 444

Query: 181  RHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE 240
             HFRIWGCPAHVLV NPKKLE RS+LC F+GYPKE+RGGLF+DPQEN++FVSTNATFLEE
Sbjct: 445  SHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEE 504

Query: 241  DHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV 300
            DH+R+H+PRSKLVL E    S   +D+   S++ VD+T  SGQSHPSQ LR PRRSGRVV
Sbjct: 505  DHMRNHKPRSKLVLSEATDESTRVVDEVGPSSR-VDETTTSGQSHPSQSLRMPRRSGRVV 564

Query: 301  HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVD 360
             QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD
Sbjct: 565  SQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVD 624

Query: 361  QPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRIL 420
             P  VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRIL
Sbjct: 625  LPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRIL 684

Query: 421  LSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQAS 480
            LSIATFYDYEIWQMDVKTAFLNGNLEESI+MSQPEGFI Q QEQKVCKL +SIYGLKQAS
Sbjct: 685  LSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQAS 744

Query: 481  RSWNIRFDTAIKSYGFEQNIDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKW 540
            RSWNIRFDTAIKSYGF+QN+DEPCVYKK+    +AFLVLYVDDILLIGNDVGYLTD+K W
Sbjct: 745  RSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAW 804

Query: 541  LAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG 600
            LA QFQMKDLG+AQYVLGIQI+R+RKNKTLA+SQA+YIDK+L RY MQNSKKGLLP+R+G
Sbjct: 805  LAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHG 864

Query: 601  IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHW 660
            +HLSKEQ PKTPQEVEDMR IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG DHW
Sbjct: 865  VHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHW 924

Query: 661  TAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWR 720
            TAVK +LKYLRRT+DYML+YG KDLILTGYTDSDFQTDKD+RKSTSGSVFTLNGGAVVWR
Sbjct: 925  TAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWR 984

Query: 721  SIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSR 780
            SIKQ CIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNM+LPITLYCDNSGAV NS+
Sbjct: 985  SIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSK 1044

Query: 781  NIRIADQISHPKGQ 792
              R     SH +G+
Sbjct: 1045 EPR-----SHKRGK 1052

BLAST of CSPI05G14310 vs. NCBI nr
Match: KAA0035907.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1334.3 bits (3452), Expect = 0.0e+00
Identity = 654/794 (82.37%), Postives = 718/794 (90.43%), Query Frame = 0

Query: 1    MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSN 60
            MTKR FTGKG RAK PLELIHSDLCGPMNVKARG +EYFISFIDDYSRYG++YL+ HKS 
Sbjct: 391  MTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGSFEYFISFIDDYSRYGYLYLMEHKSE 450

Query: 61   SLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGV 120
            +LEKFKEYK EVEN L K IKI RSDRGGEYMDL F+DY+IE+GIQSQLSAP TPQQNGV
Sbjct: 451  ALEKFKEYKTEVENLLSKKIKIFRSDRGGEYMDLIFQDYMIEHGIQSQLSAPGTPQQNGV 510

Query: 121  SERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSL 180
            SERRNRTLLDMVRSMMS++Q+  SFWGYA+ETA +ILNNVPSKSVSETP+ELW+GRK SL
Sbjct: 511  SERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSL 570

Query: 181  RHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE 240
             HFRIWGCPAHVLV NPKKLE RS+LC F+GYPKE+RGGLF+DP+EN++FVSTNATFLEE
Sbjct: 571  SHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPKENRVFVSTNATFLEE 630

Query: 241  DHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV 300
            DH+R+H+PRSKLVL E    S   +D+   S++ VD+T  SGQSHPSQ LR PRRSGRVV
Sbjct: 631  DHMRNHKPRSKLVLSEATDESTRVVDEVGPSSR-VDETTTSGQSHPSQSLRMPRRSGRVV 690

Query: 301  HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVD 360
             QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD
Sbjct: 691  SQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVD 750

Query: 361  QPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRIL 420
             P  VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYT++EGVDYEETFS VAMLKSIRIL
Sbjct: 751  LPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTRKEGVDYEETFSSVAMLKSIRIL 810

Query: 421  LSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQAS 480
            LSIA FYDYEIWQMDVKTAFLNGNLEESI+MSQPEGFI Q QEQKVCKL +SIYGLKQAS
Sbjct: 811  LSIAKFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQAS 870

Query: 481  RSWNIRFDTAIKSYGFEQNIDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKW 540
            RSWNIRFDTAIKSYGF+QN+DEPCVYKK+    +AFLVLYVDDILLIGNDVGYLTD+K W
Sbjct: 871  RSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAW 930

Query: 541  LAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG 600
            LA QFQMKDLG+ QYVLGIQI+R+RKNKTLA+SQA+YIDK+L RY MQNSKKGLLP+R+G
Sbjct: 931  LAAQFQMKDLGEGQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHG 990

Query: 601  IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHW 660
            +HLSKEQ PKTPQEVEDMR IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG DHW
Sbjct: 991  VHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHW 1050

Query: 661  TAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWR 720
            TAVK ILKYLRRT+DYML+YG KDLILTGYT+SDFQTDKD+RKSTS SVFTLNGGAVVWR
Sbjct: 1051 TAVKIILKYLRRTRDYMLVYGAKDLILTGYTNSDFQTDKDSRKSTSRSVFTLNGGAVVWR 1110

Query: 721  SIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSR 780
            SIKQ CIADSTMEAEYVAACEAAKEAVWL+KFL DLEVVPNM+LPITLYCDNSGAV NS+
Sbjct: 1111 SIKQGCIADSTMEAEYVAACEAAKEAVWLKKFLHDLEVVPNMNLPITLYCDNSGAVANSK 1170

Query: 781  NIRIADQISHPKGQ 792
              R     SH +G+
Sbjct: 1171 EPR-----SHKRGK 1178

BLAST of CSPI05G14310 vs. NCBI nr
Match: KAA0033121.1 (gag/pol protein [Cucumis melo var. makuwa] >TYK17112.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1309.3 bits (3387), Expect = 0.0e+00
Identity = 640/790 (81.01%), Postives = 708/790 (89.62%), Query Frame = 0

Query: 1   MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSN 60
           MTKR FTGKG RAK PLELIHSDLCGPMNVKARGG+EYFISFIDDYSRYG++YL+ HKS 
Sbjct: 1   MTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSE 60

Query: 61  SLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGV 120
           +LEKFKEYK EVEN L K IKILRSDRGGEYMDLRF+DY+IE+GIQ QLSAP TPQQNGV
Sbjct: 61  ALEKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGV 120

Query: 121 SERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSL 180
            ERRNRT+LDMVRSMMS++Q+  SFWGYA+ETA +ILNNV SKSVSETP+ELW+GRK SL
Sbjct: 121 LERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSSKSVSETPFELWRGRKPSL 180

Query: 181 RHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE 240
            HF+I GCPAHVLV NPKKLE RS+LC F+GYPKE+RGGLF+DPQ+N++ VSTNATFLEE
Sbjct: 181 SHFKILGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEE 240

Query: 241 DHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV 300
           DH+RDH+P++KLVL E    S   +D+   S++ V++T  SGQSHPSQ LR PRRSGR+V
Sbjct: 241 DHMRDHKPQNKLVLNEAIDESTRVVDEVGPSSR-VNETTTSGQSHPSQSLRMPRRSGRIV 300

Query: 301 HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVD 360
            QP+RYLGL ETQVVIPDDG+EDPL+Y QAM DVD+DQW+KAMDLEMESMYFN +W LVD
Sbjct: 301 SQPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVD 360

Query: 361 QPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRIL 420
            P  VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRIL
Sbjct: 361 LPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRIL 420

Query: 421 LSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQAS 480
           LSIATFYDYEIW+MDV TAFLNGNLEESI+MSQPEGFI Q QEQKVCKL +SIYGLKQAS
Sbjct: 421 LSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQAS 480

Query: 481 RSWNIRFDTAIKSYGFEQNIDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKW 540
           RSWNIRFDTAIKSYGFEQN+DEPCVYKK+    + FLVLYVDDILLIGNDVGYLTD+K W
Sbjct: 481 RSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAW 540

Query: 541 LAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG 600
           LA QFQMKDLG+AQYVLGIQI+R+RKNKTLA+SQA+YIDKML RY MQNSKKGLLP+R+G
Sbjct: 541 LAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHG 600

Query: 601 IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHW 660
           +HLSKEQCPKTPQEVEDMR IPYASAVGSLMY + CTR +ICY+V +VSRYQSN G DHW
Sbjct: 601 VHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHW 660

Query: 661 TAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWR 720
           TAVK ILKYLRRT+DYML+YG KDLILTGYTDSDFQT+KD+RKSTS SVFTLNGGA+VWR
Sbjct: 661 TAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWR 720

Query: 721 SIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSR 780
           SIKQ CIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNM+LPITLYCDNSGAV NS+
Sbjct: 721 SIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSK 780

Query: 781 NIRIADQISH 788
             R   +  H
Sbjct: 781 EPRSHKREKH 789

BLAST of CSPI05G14310 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 292.0 bits (746), Expect = 1.5e-78
Identity = 163/474 (34.39%), Postives = 267/474 (56.33%), Query Frame = 0

Query: 319 EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHA 378
           ++P TY +A + +    W  AMD E+ +M     W +   P + KPIGCKW+YK K +  
Sbjct: 84  KEPSTYNEAKEFL---VWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSD 143

Query: 379 GKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFL 438
           G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L+I+  Y++ + Q+D+  AFL
Sbjct: 144 GTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFL 203

Query: 439 NGNLEESIYMSQPEGFIEQDQE----QKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFE 498
           NG+L+E IYM  P G+  +  +      VC LKKSIYGLKQASR W ++F   +  +GF 
Sbjct: 204 NGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFV 263

Query: 499 QNIDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVL 558
           Q+  +   + K+  ++   +++YVDDI++  N+   + ++K  L   F+++DLG  +Y L
Sbjct: 264 QSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFL 323

Query: 559 GIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVED 618
           G++I R+     + + Q  Y   +L    +   K   +P    +  S      +  +  D
Sbjct: 324 GLEIARSAAG--INICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAH----SGGDFVD 383

Query: 619 MRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYM 678
            +   Y   +G LMY  + TR DI ++V  +S++   P   H  AV  IL Y++ T    
Sbjct: 384 AK--AYRRLIGRLMYLQI-TRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQG 443

Query: 679 LMYGTK-DLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEY 738
           L Y ++ ++ L  ++D+ FQ+ KD R+ST+G    L    + W+S KQ  ++ S+ EAEY
Sbjct: 444 LFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEY 503

Query: 739 VAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISH 788
            A   A  E +WL +F  +L++   +  P  L+CDN+ A+  + N    ++  H
Sbjct: 504 RALSFATDEMMWLAQFFRELQL--PLSKPTLLFCDNTAAIHIATNAVFHERTKH 543

BLAST of CSPI05G14310 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 97.8 bits (242), Expect = 4.1e-20
Identity = 75/236 (31.78%), Postives = 117/236 (49.58%), Query Frame = 0

Query: 513 FLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQA 572
           +L+LYVDDILL G+    L  +   L+  F MKDLG   Y LGIQI  +     L +SQ 
Sbjct: 2   YLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSG--LFLSQT 61

Query: 573 SYIDKMLSRYKMQNSK--KGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYA 632
            Y +++L+   M + K     LP +    +S  + P    +  D R+I     VG+L Y 
Sbjct: 62  KYAEQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYP----DPSDFRSI-----VGALQYL 121

Query: 633 MLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDY-MLMYGTKDLILTGYTD 692
            L TRPDI Y+V +V +    P    +  +K +L+Y++ T  + + ++    L +  + D
Sbjct: 122 TL-TRPDISYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCD 181

Query: 693 SDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVW 746
           SD+      R+ST+G    L    + W + +Q  ++ S+ E EY A    A E  W
Sbjct: 182 SDWAGCTSTRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CSPI05G14310 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 82.0 bits (201), Expect = 2.3e-15
Identity = 40/103 (38.83%), Postives = 62/103 (60.19%), Query Frame = 0

Query: 319 EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHA 378
           ++P +   A+KD     W +AM  E++++  N  W LV  P +   +GCKW++K K    
Sbjct: 26  KEPKSVIFALKD---PGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSD 85

Query: 379 GKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA 422
           G +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L++A
Sbjct: 86  GTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of CSPI05G14310 vs. TAIR 10
Match: ATMG00710.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 45.8 bits (107), Expect = 1.9e-04
Identity = 26/82 (31.71%), Postives = 46/82 (56.10%), Query Frame = 0

Query: 125 NRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHF 184
           NRT+++ VRSM+    +  +F   A  TA +I+N  PS +++   P E+W     +  + 
Sbjct: 2   NRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSYL 61

Query: 185 RIWGCPAHVLVQNPKKLEHRSK 206
           R +GC A++   +  KL+ R+K
Sbjct: 62  RRFGCVAYIHC-DEGKLKPRAK 82

BLAST of CSPI05G14310 vs. TAIR 10
Match: ATMG00240.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 45.4 bits (106), Expect = 2.4e-04
Identity = 25/72 (34.72%), Postives = 39/72 (54.17%), Query Frame = 0

Query: 634 TRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMY-GTKDLILTGYTDSDF 693
           TRPD+ ++V  +S++ S        AV  +L Y++ T    L Y  T DL L  + DSD+
Sbjct: 6   TRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAFADSDW 65

Query: 694 QTDKDARKSTSG 705
            +  D R+S +G
Sbjct: 66  ASCPDTRRSVTG 77

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109788.0e-17841.69Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041463.0e-11630.51Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q9ZT949.9e-10429.58Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW23.7e-9828.54Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P256001.7e-3432.80Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
Match NameE-valueIdentityDescription
E2GK510.0e+0088.50Gag/pol protein (Fragment) OS=Bryonia dioica OX=3652 PE=4 SV=1[more]
A0A5A7TZD00.0e+0083.75Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G000... [more]
A0A5A7UYE80.0e+0083.75Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold430G0015... [more]
A0A5A7T2V90.0e+0082.37Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold56G00760... [more]
A0A5D3CZY30.0e+0081.01Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1032G004... [more]
Match NameE-valueIdentityDescription
ADJ18449.10.0e+0088.50gag/pol protein, partial [Bryonia dioica][more]
KAA0025945.10.0e+0083.75gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumi... [more]
KAA0059226.10.0e+0083.75gag/pol protein [Cucumis melo var. makuwa][more]
KAA0035907.10.0e+0082.37gag/pol protein [Cucumis melo var. makuwa][more]
KAA0033121.10.0e+0081.01gag/pol protein [Cucumis melo var. makuwa] >TYK17112.1 gag/pol protein [Cucumis ... [more]
Match NameE-valueIdentityDescription
AT4G23160.11.5e-7834.39cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00810.14.1e-2031.78DNA/RNA polymerases superfamily protein [more]
ATMG00820.12.3e-1538.83Reverse transcriptase (RNA-dependent DNA polymerase) [more]
ATMG00710.11.9e-0431.71Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
ATMG00240.12.4e-0434.72Gag-Pol-related retrotransposon family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 15..115
e-value: 4.5E-11
score: 42.9
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 12..177
score: 23.973724
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 10..186
e-value: 3.5E-41
score: 142.7
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 350..593
e-value: 4.1E-74
score: 249.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 264..298
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 11..265
coord: 319..677
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 685..778
e-value: 4.27516E-41
score: 145.304
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 11..171
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 350..769

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI05G14310.1CSPI05G14310.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0006508 proteolysis
molecular_function GO:0008234 cysteine-type peptidase activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding