Tan0009842 (gene) Snake gourd v1

Overview
NameTan0009842
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
LocationLG03: 5978916 .. 5981614 (-)
RNA-Seq ExpressionTan0009842
SyntenyTan0009842
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGAACTAAGAGAATAAAAGTTTCTCTAAAAGAAAGTGCCCATCTTTGGCATCTAAGGTTAGGCCACATAAATCTCAATAAGATTGAGAGACTAGTAATGAGTGGACTTCTAAGCGAGTTGGAAGAAAACTCTTTACCGGTATGTGAGTCATGCCTTGAAGGCAAAATGACCAAACGTCCTTTTAGTGGAAAAGGGTATAGAGCCAAGGAGCCCTTTGAGCTTATACATTCTGACCTCTGTGGTCCGATGAATGTTAAAGCACGAGGAGGTTACGAATACTTCGTATCTTTCATAGATGACTATTTGAGGTATGGGTACATTTACCTAATGTATAGGAAGTCTGAAACACTTGAAAAGTTCAAGGAGTTCAAGACTGAGGTTGAGAACCTGTTAGGTAAAACTATTAAAACACTTCGATCTGATCGAGGTGGAGAGTATATGGACACTGAATTCCAGGACTATATGATAGAACATGGAATTACATCCCAACTCTCAGCACCTAGTATGCCACAACAGAATGGTGTATCGGAGAGGAGAAACAGAACCCTGTTGGACATGGTTCGGTAGATGATGAGCTACGCTCGTCTCCCTGATTCTTCTTGGGGTTACGCAGTGGAGATTGCGGTATACATTTTGAACACAGTTCCGTCGAAAAGTGTTTGTGAAACACCTTTTGAACTTTGGCATGGTCGTAAAATCAGTTTACGTCATTTCAGAATTTGGGGATGCCTGACCCATGTGTTGGTGTCAAACCCGAAAAAGTTGGAATCCCGTTTAAAGTTGTGCCTCTTTGTAGGTTACCCAAAAGAGACTAGGGGTGATATGTTTTTCGATCCTAAGGATAATAGGGTGCTTGTGTCGACAAATGCCACTTTCCTTGAGGAAAACCACATCAGGGATCACTTACCAAGGAGTAAGATTGTGTTGAATGAAATGGACAGTTCATCAGCAAGAGTTGCTGATGGGGCTAGTACGTCAACAAGTGTTGTTGATCCTAGCTCGTCTAGTAAAGTCCGTTCTCAAGAGTTGAGAATGCCTCGACGTAGTGAGAGGGTTGTGAGACAGCTTGAACGTTACATGGGTTTGGCTAAAACCCTGGTCGTCACCCCTGATGATGACTGTGAGGATCCATTGACCTATGATCAGGCAATGGCAAAGGTTGACAAAAATGAATGGATTAAAGCTATGGATCAGGAAATGGAGTCAATGTACTTCAATTCCATCTGGGAGCTTGTGGACCAACCAGATGGGGTTAAACCTATTGGTTGCAAGTGGATCTACAAGCGTAAACGTGACGTAGATGAGAAGGTGCAAACCTTCAAAGCACGACTAGTGGCAAAGAGTTTTACCCAGGTGGAAGAAGTTGACTATAAGGAAACCTTTTCACCTGTTGCCATGGTAAAGTCGATTAGAATCCTTCTAGCTATTGCCGTATATTATGACTATGAGGTATGGAAAATGGATGTCAAGACCGCTTTTATGAATGACAACCTTGACGAAACCATCTACATGGACCAACCCAAGGGGTTCATTGCCAAAGGCCAAGAGCAAAAGGTTTGCCGGCTTCAAAGGTCTATTTATGGACTGAAACAAGCCTCGAGGTCTTGGAATATAAGGTTTGATGAGGCGATCAAATCTTATGGCTTTAATCAAAATGTTGACGAGCCTTGTGTCTACAAGAAAATCGTTAACAAAACTATCGCATTTCTAATTTTGTATGTGGATGATATCCTTCTCATTGGGAATGAGGTAGGATTTCTTACCGACATTAAGAAATGGCTAGCTTCATAATTCCAAATGAAAGATTTGGGAGAGGCACAATATGTTCTAGGTATCCAGATAGTCCGGAGCTAAAAGAACAGAACGCTAGCCATGTCTCAGGCATCTTACTTTGACAAGATGTTGTCTAGGTATAAGATGCAGAACTAAGAAGGGTTTGCTGCCTTTCAAGGCATGAGGTTCAATTGTCTTAGGATCAATGTCCTAAGACACCTCAAAAGGTTGAGGATATGAGACGAATCCCCTATGCTTCAGCTGTAAGGAGCCTCATGTATGCCATGCTGTGTACTAGGCCTGACATCTGTTATGCAGTTGGGATTGTCAGTAGGTATCAATCCAATCAAGGATTAGATCACTGGACAATCGTAAAGGCAATCCTCAAGTATCTTAGGAGAATGAGGAACTACAACCTTGTGTATGACAGAGGGGATTTGATCCTTACGGTATACACAGATTCTGACTTTCAGACCGATAAGGATTCTAGGAAATCCACTTCGGGGTCAACCTTCATTCTGAATGGAGGAGCTGTAGTGTAGTGAAGCATCAAGCAGGGATGCATCGCTAATTCCACGATGGAAGCCGAGTATGTTGTGGCTTGTGAAGCTGAAAAGGAAGCTGTTTGGCTAAGGAAATTCATGATGGATTTGGAAGTTGTTCCAAATATGAACTTATCGATCACGTTGTTTTGTGACAACAGTGGTGCAGTAGCCAACTCGAGAGAACCTCGGAGTCATAAAAGGGGCAAGCACATAGAGCGTAAGTATCACCTGATACGGGAGATTATGCACCGTGGAGACGTGACAGTCACGCAGATAGCTTCGGAGCACAACGTTGCTGATCCATTTATAAAGGCCCTCACGGCTAAGGTGTTTAAGGGTCACCTAGAGAGTCTAGGTCTTCAAGTGCTTCCTGACTAG

mRNA sequence

ATGCGAACTAAGAGAATAAAAGTTTCTCTAAAAGAAAGTGCCCATCTTTGGCATCTAAGGTTAGGCCACATAAATCTCAATAAGATTGAGAGACTAGTAATGAGTGGACTTCTAAGCGAGTTGGAAGAAAACTCTTTACCGGTATGTGAGTCATGCCTTGAAGGCAAAATGACCAAACGTCCTTTTAGTGGAAAAGGGTATAGAGCCAAGGAGCCCTTTGAGCTTATACATTCTGACCTCTGTGGTCCGATGAATGTTAAAGCACGAGGAGGTTACGAATACTTCGTATCTTTCATAGATGACTATTTGAGGTATGGGTACATTTACCTAATGTATAGGAAGTCTGAAACACTTGAAAAGTTCAAGGAGTTCAAGACTGAGGTTGAGAACCTGTTAGTGGAGATTGCGGTATACATTTTGAACACAGTTCCGTCGAAAAGTGTTTGTGAAACACCTTTTGAACTTTGGCATGGTCGTAAAATCAGTTTACGTCATTTCAGAATTTGGGGATGCCTGACCCATGTGTTGGTGTCAAACCCGAAAAAGTTGGAATCCCGTTTAAAGTTGTGCCTCTTTGTAGGTTACCCAAAAGAGACTAGGGGTGATATGTTTTTCGATCCTAAGGATAATAGGGTGCTTGTGTCGACAAATGCCACTTTCCTTGAGGAAAACCACATCAGGGATCACTTACCAAGGAGTAAGATTGTGTTGAATGAAATGGACAGTTCATCAGCAAGAGTTGCTGATGGGGCTAGTACGTCAACAAGTGTTGTTGATCCTAGCTCGTCTAGTAAAGTCCGTTCTCAAGAGTTGAGAATGCCTCGACGTAGTGAGAGGGTTGTGAGACAGCTTGAACGTTACATGGGTTTGGCTAAAACCCTGGTCGTCACCCCTGATGATGACTGTGAGGATCCATTGACCTATGATCAGGCAATGGCAAAGGTTGACAAAAATGAATGGATTAAAGCTATGGATCAGGAAATGGAGTCAATGTACTTCAATTCCATCTGGGAGCTTGTGGACCAACCAGATGGGGTTAAACCTATTGGTTGCAAGTGGATCTACAAGCGTAAACGTGACGTAGATGAGAAGGTGCAAACCTTCAAAGCACGACTAGTGGCAAAGAGTTTTACCCAGGTGGAAGAAGTTGACTATAAGGAAACCTTTTCACCTGTTGCCATGGTAAAGTCGATTAGAATCCTTCTAGCTATTGCCGTATATTATGACTATGAGGTATGGAAAATGGATGTCAAGACCGCTTTTATGAATGACAACCTTGACGAAACCATCTACATGGACCAACCCAAGGGGTTCATTGCCAAAGGCCAAGAGCAAAAGGTTTGCCGGCTTCAAAGGTCTATTTATGGACTGAAACAAGCCTCGAGGTCTTGGAATATAAGGTTTGATGAGGCGATCAAATCTTATGGCTTTAATCAAAATGTTGACGAGCCTTGTGTCTACAAGAAAATCGTTAACAAAACTATCGCATTTCTAATTTTGTATGTGGATGATATCCTTCTCATTGGGAATGAGGATCAATGTCCTAAGACACCTCAAAAGGTTGAGGATATGAGACGAATCCCCTATGCTTCAGCTGTAAGGAGCCTCATGTATGCCATGCTGTGTACTAGGCCTGACATCTGTTATGCAGTTGGGATTGTCAGTAGGTATCAATCCAATCAAGGATTAGATCACTGGACAATCGTAAAGGCAATCCTCAAGTATCTTAGGAGAATGAGGAACTACAACCTTGTGTATGACAGAGGGGATTTGATCCTTACGGGATGCATCGCTAATTCCACGATGGAAGCCGAGTATGTTGTGGCTTGTGAAGCTGAAAAGGAAGCTGTTTGGCTAAGGAAATTCATGATGGATTTGGAAGTTGTTCCAAATATGAACTTATCGATCACGTTGTTTTGTGACAACAGTGGTGCAGTAGCCAACTCGAGAGAACCTCGGAGTCATAAAAGGGGCAAGCACATAGAGCGTAAGTATCACCTGATACGGGAGATTATGCACCGTGGAGACGTGACAGTCACGCAGATAGCTTCGGAGCACAACGTTGCTGATCCATTTATAAAGGCCCTCACGGCTAAGGTGTTTAAGGGTCACCTAGAGAGTCTAGGTCTTCAAGTGCTTCCTGACTAG

Coding sequence (CDS)

ATGCGAACTAAGAGAATAAAAGTTTCTCTAAAAGAAAGTGCCCATCTTTGGCATCTAAGGTTAGGCCACATAAATCTCAATAAGATTGAGAGACTAGTAATGAGTGGACTTCTAAGCGAGTTGGAAGAAAACTCTTTACCGGTATGTGAGTCATGCCTTGAAGGCAAAATGACCAAACGTCCTTTTAGTGGAAAAGGGTATAGAGCCAAGGAGCCCTTTGAGCTTATACATTCTGACCTCTGTGGTCCGATGAATGTTAAAGCACGAGGAGGTTACGAATACTTCGTATCTTTCATAGATGACTATTTGAGGTATGGGTACATTTACCTAATGTATAGGAAGTCTGAAACACTTGAAAAGTTCAAGGAGTTCAAGACTGAGGTTGAGAACCTGTTAGTGGAGATTGCGGTATACATTTTGAACACAGTTCCGTCGAAAAGTGTTTGTGAAACACCTTTTGAACTTTGGCATGGTCGTAAAATCAGTTTACGTCATTTCAGAATTTGGGGATGCCTGACCCATGTGTTGGTGTCAAACCCGAAAAAGTTGGAATCCCGTTTAAAGTTGTGCCTCTTTGTAGGTTACCCAAAAGAGACTAGGGGTGATATGTTTTTCGATCCTAAGGATAATAGGGTGCTTGTGTCGACAAATGCCACTTTCCTTGAGGAAAACCACATCAGGGATCACTTACCAAGGAGTAAGATTGTGTTGAATGAAATGGACAGTTCATCAGCAAGAGTTGCTGATGGGGCTAGTACGTCAACAAGTGTTGTTGATCCTAGCTCGTCTAGTAAAGTCCGTTCTCAAGAGTTGAGAATGCCTCGACGTAGTGAGAGGGTTGTGAGACAGCTTGAACGTTACATGGGTTTGGCTAAAACCCTGGTCGTCACCCCTGATGATGACTGTGAGGATCCATTGACCTATGATCAGGCAATGGCAAAGGTTGACAAAAATGAATGGATTAAAGCTATGGATCAGGAAATGGAGTCAATGTACTTCAATTCCATCTGGGAGCTTGTGGACCAACCAGATGGGGTTAAACCTATTGGTTGCAAGTGGATCTACAAGCGTAAACGTGACGTAGATGAGAAGGTGCAAACCTTCAAAGCACGACTAGTGGCAAAGAGTTTTACCCAGGTGGAAGAAGTTGACTATAAGGAAACCTTTTCACCTGTTGCCATGGTAAAGTCGATTAGAATCCTTCTAGCTATTGCCGTATATTATGACTATGAGGTATGGAAAATGGATGTCAAGACCGCTTTTATGAATGACAACCTTGACGAAACCATCTACATGGACCAACCCAAGGGGTTCATTGCCAAAGGCCAAGAGCAAAAGGTTTGCCGGCTTCAAAGGTCTATTTATGGACTGAAACAAGCCTCGAGGTCTTGGAATATAAGGTTTGATGAGGCGATCAAATCTTATGGCTTTAATCAAAATGTTGACGAGCCTTGTGTCTACAAGAAAATCGTTAACAAAACTATCGCATTTCTAATTTTGTATGTGGATGATATCCTTCTCATTGGGAATGAGGATCAATGTCCTAAGACACCTCAAAAGGTTGAGGATATGAGACGAATCCCCTATGCTTCAGCTGTAAGGAGCCTCATGTATGCCATGCTGTGTACTAGGCCTGACATCTGTTATGCAGTTGGGATTGTCAGTAGGTATCAATCCAATCAAGGATTAGATCACTGGACAATCGTAAAGGCAATCCTCAAGTATCTTAGGAGAATGAGGAACTACAACCTTGTGTATGACAGAGGGGATTTGATCCTTACGGGATGCATCGCTAATTCCACGATGGAAGCCGAGTATGTTGTGGCTTGTGAAGCTGAAAAGGAAGCTGTTTGGCTAAGGAAATTCATGATGGATTTGGAAGTTGTTCCAAATATGAACTTATCGATCACGTTGTTTTGTGACAACAGTGGTGCAGTAGCCAACTCGAGAGAACCTCGGAGTCATAAAAGGGGCAAGCACATAGAGCGTAAGTATCACCTGATACGGGAGATTATGCACCGTGGAGACGTGACAGTCACGCAGATAGCTTCGGAGCACAACGTTGCTGATCCATTTATAAAGGCCCTCACGGCTAAGGTGTTTAAGGGTCACCTAGAGAGTCTAGGTCTTCAAGTGCTTCCTGACTAG

Protein sequence

MRTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDDYLRYGYIYLMYRKSETLEKFKEFKTEVENLLVEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRIWGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRDHLPRSKIVLNEMDSSSARVADGASTSTSVVDPSSSSKVRSQELRMPRRSERVVRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKAMDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNEDQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDICYAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRGDLILTGCIANSTMEAEYVVACEAEKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQVLPD
Homology
BLAST of Tan0009842 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 396.4 bits (1017), Expect = 7.1e-109
Identity = 261/913 (28.59%), Postives = 415/913 (45.45%), Query Frame = 0

Query: 13   SAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEP 72
            S  LWH R+GH++   ++ L    L+S  +  ++  C+ CL GK  +  F     R    
Sbjct: 421  SVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSSERKLNI 480

Query: 73   FELIHSDLCGPMNVKARGGYEYFVSFIDDYLRYGYIYLMYRKSETLEKFKEF-------- 132
             +L++SD+CGPM +++ GG +YFV+FIDD  R  ++Y++  K +  + F++F        
Sbjct: 481  LDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVERET 540

Query: 133  -------------------------------------------------KTEVENL---- 192
                                                             +T VE +    
Sbjct: 541  GRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSML 600

Query: 193  ------------LVEIAVYILNTVPSKSVC-ETPFELWHGRKISLRHFRIWGC--LTHVL 252
                         V+ A Y++N  PS  +  E P  +W  +++S  H +++GC    HV 
Sbjct: 601  RMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHVP 660

Query: 253  VSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRDHLPRSKIV 312
                 KL+ +   C+F+GY  E  G   +DP   +V+ S +  F  E+ +R     S+ V
Sbjct: 661  KEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVF-RESEVRTAADMSEKV 720

Query: 313  LN-----------------EMDSSSARVADGASTSTSVVDPSSSSKVRSQELRMP----- 372
             N                   +S++  V++       V++         +E+  P     
Sbjct: 721  KNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVEHPTQGEE 780

Query: 373  -----RRSERVVRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKAMDQEME 432
                 RRSER   +  RY      L+     D  +P +  + ++  +KN+ +KAM +EME
Sbjct: 781  QHQPLRRSERPRVESRRYPSTEYVLI----SDDREPESLKEVLSHPEKNQLMKAMQEEME 840

Query: 433  SMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETF 492
            S+  N  ++LV+ P G +P+ CKW++K K+D D K+  +KARLV K F Q + +D+ E F
Sbjct: 841  SLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIF 900

Query: 493  SPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCR 552
            SPV  + SIR +L++A   D EV ++DVKTAF++ +L+E IYM+QP+GF   G++  VC+
Sbjct: 901  SPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCK 960

Query: 553  LQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVY-KKIVNKTIAFLILYVDDILLI 612
            L +S+YGLKQA R W ++FD  +KS  + +   +PCVY K+        L+LYVDD+L++
Sbjct: 961  LNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIV 1020

Query: 613  G----------------------------------------------------------- 672
            G                                                           
Sbjct: 1021 GKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNM 1080

Query: 673  ----------------NEDQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDICYAVGI 711
                            ++  CP T ++  +M ++PY+SAV SLMYAM+CTRPDI +AVG+
Sbjct: 1081 KNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGV 1140

BLAST of Tan0009842 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 278.5 bits (711), Expect = 2.1e-73
Identity = 245/1003 (24.43%), Postives = 400/1003 (39.88%), Query Frame = 0

Query: 6    IKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELE-----ENSLPVCESCLEGKMTKR 65
            I    K +  LWH R GHI+  K+  +    + S+       E S  +CE CL GK  + 
Sbjct: 407  INAKHKNNFRLWHERFGHISDGKLLEIKRKNMFSDQSLLNNLELSCEICEPCLNGKQARL 466

Query: 66   PFSGKGYRA--KEPFELIHSDLCGPMNVKARGGYEYFVSFIDDYLRYGYIYLMYRKSETL 125
            PF     +   K P  ++HSD+CGP+         YFV F+D +  Y   YL+  KS+  
Sbjct: 467  PFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVF 526

Query: 126  EKFKEFKTEVE---NL-------------------------------------------- 185
              F++F  + E   NL                                            
Sbjct: 527  SMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSE 586

Query: 186  --------------------------LVEIAVYILNTVPSKSVCE---TPFELWHGRKIS 245
                                       V  A Y++N +PS+++ +   TP+E+WH +K  
Sbjct: 587  RMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPY 646

Query: 246  LRHFRIWGCLTHVLVSNPK-KLESRLKLCLFVGY-------------------------- 305
            L+H R++G   +V + N + K + +    +FVGY                          
Sbjct: 647  LKHLRVFGATVYVHIKNKQGKFDDKSFKSIFVGYEPNGFKLWDAVNEKFIVARDVVVDET 706

Query: 306  -----------------PKETRGDMFFDPKDNRVLVST----------NATFLEEN---- 365
                              KE+    F  P D+R ++ T          N  FL+++    
Sbjct: 707  NMVNSRAVKFETVFLKDSKESENKNF--PNDSRKIIQTEFPNESKECDNIQFLKDSKESE 766

Query: 366  ---------------------------HIRDHLPRSKIVLNEMDSSSARVADGASTSTSV 425
                                        ++D    +K  LNE  S   +  D  + S   
Sbjct: 767  NKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESNKYFLNE--SKKRKRDDHLNESKGS 826

Query: 426  VDPSSSSKVRSQE-LR---------------MPRRSERV-----VRQLERYMGLAKTLVV 485
             +P+ S +  + E L+               + RRSER+     +   E    L K ++ 
Sbjct: 827  GNPNESRESETAEHLKEIGIDNPTKNDGIEIINRRSERLKTKPQISYNEEDNSLNKVVLN 886

Query: 486  TPDDDCEDPLTYDQAMAKVDKNEWIKAMDQEMESMYFNSIWELVDQPDGVKPIGCKWIYK 545
                  + P ++D+   + DK+ W +A++ E+ +   N+ W +  +P+    +  +W++ 
Sbjct: 887  AHTIFNDVPNSFDEIQYRDDKSSWEEAINTELNAHKINNTWTITKRPENKNIVDSRWVFS 946

Query: 546  RKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMD 605
             K +       +KARLVA+ FTQ  ++DY+ETF+PVA + S R +L++ + Y+ +V +MD
Sbjct: 947  VKYNELGNPIRYKARLVARGFTQKYQIDYEETFAPVARISSFRFILSLVIQYNLKVHQMD 1006

Query: 606  VKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYG 665
            VKTAF+N  L E IYM  P+G         VC+L ++IYGLKQA+R W   F++A+K   
Sbjct: 1007 VKTAFLNGTLKEEIYMRLPQGISC--NSDNVCKLNKAIYGLKQAARCWFEVFEQALKECE 1066

Query: 666  FNQNVDEPCVY---KKIVNKTIAFLILYVDDILL-------------------------- 711
            F  +  + C+Y   K  +N+ I +++LYVDD+++                          
Sbjct: 1067 FVNSSVDRCIYILDKGNINENI-YVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNE 1126

BLAST of Tan0009842 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 202.6 bits (514), Expect = 1.5e-50
Identity = 156/599 (26.04%), Postives = 257/599 (42.90%), Query Frame = 0

Query: 229  HLPRSKIVLNEMDSSSARVADGASTSTSVVDP--------SSSSKVRSQELRMPRRSERV 288
            H+P     ++E +S S+     +STST  + P          +++       M  R++  
Sbjct: 865  HIPTPSTSISEPNSPSS-----SSTSTPPLPPVLPAPPIIQVNAQAPVNTHSMATRAKDG 924

Query: 289  VRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKAMDQEMESMYFNSIWELV 348
            +R+  +    A +L         +P T  QAM     + W +AM  E+ +   N  W+LV
Sbjct: 925  IRKPNQKYSYATSLAAN-----SEPRTAIQAM---KDDRWRQAMGSEINAQIGNHTWDLV 984

Query: 349  -DQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIR 408
               P  V  +GC+WI+ +K + D  +  +KARLVAK + Q   +DY ETFSPV    SIR
Sbjct: 985  PPPPPSVTIVGCRWIFTKKFNSDGSLNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIR 1044

Query: 409  ILLAIAVYYDYEVWKMDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQ 468
            I+L +AV   + + ++DV  AF+   L + +YM QP GF+ K +   VCRL+++IYGLKQ
Sbjct: 1045 IVLGVAVDRSWPIRQLDVNNAFLQGTLTDEVYMSQPPGFVDKDRPDYVCRLRKAIYGLKQ 1104

Query: 469  ASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNEDQCPK--- 528
            A R+W +     + + GF  ++ +  ++     ++I ++++YVDDIL+ GN+    K   
Sbjct: 1105 APRAWYVELRTYLLTVGFVNSISDTSLFVLQRGRSIIYMLVYVDDILITGNDTVLLKHTL 1164

Query: 529  -------TPQKVEDM--------RRIP--------------------------------- 588
                   + ++ ED+        +R+P                                 
Sbjct: 1165 DALSQRFSVKEHEDLHYFLGIEAKRVPQGLHLSQRRYTLDLLARTNMLTAKPVATPMATS 1224

Query: 589  ----------------YASAVRSLMYAMLCTRPDICYAVGIVSRYQSNQGLDHWTIVKAI 648
                            Y   V SL Y +  TRPD+ YAV  +S+Y      DHW  +K +
Sbjct: 1225 PKLTLHSGTKLPDPTEYRGIVGSLQY-LAFTRPDLSYAVNRLSQYMHMPTDDHWNALKRV 1284

Query: 649  LKYLRRMRNYNLVYDRG------------------DLILTG------------------- 708
            L+YL    ++ +   +G                  D + T                    
Sbjct: 1285 LRYLAGTPDHGIFLKKGNTLSLHAYSDADWAGDTDDYVSTNGYIVYLGHHPISWSSKKQK 1344

Query: 709  CIANSTMEAEYVVACEAEKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSH 715
             +  S+ EAEY        E  W+   + +L +   ++    ++CDN GA      P  H
Sbjct: 1345 GVVRSSTEAEYRSVANTSSELQWICSLLTELGI--QLSHPPVIYCDNVGATYLCANPVFH 1404

BLAST of Tan0009842 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 198.0 bits (502), Expect = 3.7e-49
Identity = 136/509 (26.72%), Postives = 216/509 (42.44%), Query Frame = 0

Query: 311  AMAKVDKNEWIKAMDQEMESMYFNSIWELVDQPDG-VKPIGCKWIYKRKRDVDEKVQTFK 370
            A+  +    W  AM  E+ +   N  W+LV  P   V  +GC+WI+ +K + D  +  +K
Sbjct: 959  AIQALKDERWRNAMGSEINAQIGNHTWDLVPPPPSHVTIVGCRWIFTKKYNSDGSLNRYK 1018

Query: 371  ARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMNDNLDET 430
            ARLVAK + Q   +DY ETFSPV    SIRI+L +AV   + + ++DV  AF+   L + 
Sbjct: 1019 ARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDD 1078

Query: 431  IYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKK 490
            +YM QP GFI K +   VC+L++++YGLKQA R+W +     + + GF  +V +  ++  
Sbjct: 1079 VYMSQPPGFIDKDRPNYVCKLRKALYGLKQAPRAWYVELRNYLLTIGFVNSVSDTSLFVL 1138

Query: 491  IVNKTIAFLILYVDDILLIGNE-----DQCPKTPQKVE-------------DMRRIP--- 550
               K+I ++++YVDDIL+ GN+     +      Q+               + +R+P   
Sbjct: 1139 QRGKSIVYMLVYVDDILITGNDPTLLHNTLDNLSQRFSVKDHEELHYFLGIEAKRVPTGL 1198

Query: 551  ----------------------------------------------YASAVRSLMYAMLC 610
                                                          Y   V SL Y +  
Sbjct: 1199 HLSQRRYILDLLARTNMITAKPVTTPMAPSPKLSLYSGTKLTDPTEYRGIVGSLQY-LAF 1258

Query: 611  TRPDICYAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVY---------------- 670
            TRPDI YAV  +S++      +H   +K IL+YL    N+ +                  
Sbjct: 1259 TRPDISYAVNRLSQFMHMPTEEHLQALKRILRYLAGTPNHGIFLKKGNTLSLHAYSDADW 1318

Query: 671  --DRGDLILTG-------------------CIANSTMEAEYVVACEAEKEAVWLRKFMMD 715
              D+ D + T                     +  S+ EAEY        E  W+   + +
Sbjct: 1319 AGDKDDYVSTNGYIVYLGHHPISWSSKKQKGVVRSSTEAEYRSVANTSSEMQWICSLLTE 1378

BLAST of Tan0009842 vs. ExPASy Swiss-Prot
Match: P25600 (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY5A PE=5 SV=2)

HSP 1 Score: 77.4 bits (189), Expect = 7.3e-13
Identity = 59/243 (24.28%), Postives = 91/243 (37.45%), Query Frame = 0

Query: 415 MDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKS 474
           MDV TAF+N  +DE IY+ QP GF+ +     V  L   +YGLKQA   WN   +  +K 
Sbjct: 1   MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 475 YGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE----DQCPKTPQKVEDMRRI--- 534
            GF ++  E  +Y +  +    ++ +YVDD+L+        D+  +   K+  M+ +   
Sbjct: 61  IGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 535 ------------------------------------------------------------ 590
                                                                       
Sbjct: 121 DKFLGLNIHQSSNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSPHLKDI 180

BLAST of Tan0009842 vs. NCBI nr
Match: KAA0048404.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1045.4 bits (2702), Expect = 2.2e-301
Identity = 552/902 (61.20%), Postives = 618/902 (68.51%), Query Frame = 0

Query: 2    RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRP 61
            + KR+K+S KE+AHLWHLRLGHINLN+IERLV +GLLSELEENSLPVCESCLEGKMTKRP
Sbjct: 438  QNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKRP 497

Query: 62   FSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDDYLRYGYIYLMYRKSETLEKF 121
            F+GKG+RAKEP EL+HSDLCGPMNVKARGG+EYF++F DDY RYGY+YLM  KSE LEKF
Sbjct: 498  FTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKF 557

Query: 122  KEFKTEVENLL------------------------------------------------- 181
            KE+K EVEN L                                                 
Sbjct: 558  KEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRN 617

Query: 182  ------------------------VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRI 241
                                    V+ AVYILN VPSKSV ETP +LW+GRK SLRHFRI
Sbjct: 618  RTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRI 677

Query: 242  WGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD 301
            WGC  HVL +NPKKLE R KLCLFVGYPK TRG  F+DPKDN+V VSTNATFLEE+HIR+
Sbjct: 678  WGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIRE 737

Query: 302  HLPRSKIVLNEMDSS----SARVADGASTSTSVVDPSSSSKV-RSQELRMPRRSERVVRQ 361
            H PRSKIVLNE+       S RV +  S  T VV   SS++  + Q LR PRRS RV   
Sbjct: 738  HKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNL 797

Query: 362  LERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKAMDQEMESMYFNSIWELVDQP 421
              RYM L +TL V  D D EDPLT+ +AM  VDK+EWIKAM+ E+ESMYFNS+W+LVDQP
Sbjct: 798  PIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQP 857

Query: 422  DGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLA 481
            DGVKPIGCKWIYKRKR  D KVQTFKARLVAK +TQVE VDY+ETFSPVAM+KSIRILL+
Sbjct: 858  DGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMLKSIRILLS 917

Query: 482  IAVYYDYEVWKMDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRS 541
            IA Y+DYE+W+MDVKTAF+N NL+ETIYM QP+GFI  GQEQK+C+L RSIYGLKQASRS
Sbjct: 918  IAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRS 977

Query: 542  WNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE------------ 601
            WNIRFD AIKSYGF+Q VDEPCVYK+I+NK++AFL+LYVDDILLIGN+            
Sbjct: 978  WNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLA 1037

Query: 602  ------------------------------------------------------------ 661
                                                                        
Sbjct: 1038 TQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVT 1097

Query: 662  ---DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDICYAVGIVSRYQSNQGLDHWTI 715
               +QCPKTPQ VE+MR IPYASAV SLMYAMLCTRPDICYAVGIVSRYQSN GL HWT 
Sbjct: 1098 LSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTA 1157

BLAST of Tan0009842 vs. NCBI nr
Match: KAA0035879.1 (gag/pol protein [Cucumis melo var. makuwa] >KAA0044276.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0051221.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0051893.1 gag/pol protein [Cucumis melo var. makuwa] >TYK00551.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1045.4 bits (2702), Expect = 2.2e-301
Identity = 552/902 (61.20%), Postives = 618/902 (68.51%), Query Frame = 0

Query: 2    RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRP 61
            + KR+K+S KE+AHLWHLRLGHINLN+IERLV +GLLSELEENSLPVCESCLEGKMTKRP
Sbjct: 439  QNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKRP 498

Query: 62   FSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDDYLRYGYIYLMYRKSETLEKF 121
            F+GKG+RAKEP EL+HSDLCGPMNVKARGG+EYF++F DDY RYGY+YLM  KSE LEKF
Sbjct: 499  FTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKF 558

Query: 122  KEFKTEVENLL------------------------------------------------- 181
            KE+K EVEN L                                                 
Sbjct: 559  KEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRN 618

Query: 182  ------------------------VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRI 241
                                    V+ AVYILN VPSKSV ETP +LW+GRK SLRHFRI
Sbjct: 619  RTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRI 678

Query: 242  WGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD 301
            WGC  HVL +NPKKLE R KLCLFVGYPK TRG  F+DPKDN+V VSTNATFLEE+HIR+
Sbjct: 679  WGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIRE 738

Query: 302  HLPRSKIVLNEMDSS----SARVADGASTSTSVVDPSSSSKV-RSQELRMPRRSERVVRQ 361
            H PRSKIVLNE+       S RV +  S  T VV   SS++  + Q LR PRRS RV   
Sbjct: 739  HKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNL 798

Query: 362  LERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKAMDQEMESMYFNSIWELVDQP 421
              RYM L +TL V  D D EDPLT+ +AM  VDK+EWIKAM+ E+ESMYFNS+W+LVDQP
Sbjct: 799  PIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQP 858

Query: 422  DGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLA 481
            DGVKPIGCKWIYKRKR  D KVQTFKARLVAK +TQVE VDY+ETFSPVAM+KSIRILL+
Sbjct: 859  DGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMLKSIRILLS 918

Query: 482  IAVYYDYEVWKMDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRS 541
            IA Y+DYE+W+MDVKTAF+N NL+ETIYM QP+GFI  GQEQK+C+L RSIYGLKQASRS
Sbjct: 919  IAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRS 978

Query: 542  WNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE------------ 601
            WNIRFD AIKSYGF+Q VDEPCVYK+I+NK++AFL+LYVDDILLIGN+            
Sbjct: 979  WNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLA 1038

Query: 602  ------------------------------------------------------------ 661
                                                                        
Sbjct: 1039 TQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVT 1098

Query: 662  ---DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDICYAVGIVSRYQSNQGLDHWTI 715
               +QCPKTPQ VE+MR IPYASAV SLMYAMLCTRPDICYAVGIVSRYQSN GL HWT 
Sbjct: 1099 LSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTA 1158

BLAST of Tan0009842 vs. NCBI nr
Match: KAA0031826.1 (gag/pol protein [Cucumis melo var. makuwa] >KAA0032384.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0039313.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0043789.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0048789.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1045.4 bits (2702), Expect = 2.2e-301
Identity = 552/902 (61.20%), Postives = 618/902 (68.51%), Query Frame = 0

Query: 2    RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRP 61
            + KR+K+S KE+AHLWHLRLGHINLN+IERLV +GLLSELEENSLPVCESCLEGKMTKRP
Sbjct: 439  QNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKRP 498

Query: 62   FSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDDYLRYGYIYLMYRKSETLEKF 121
            F+GKG+RAKEP EL+HSDLCGPMNVKARGG+EYF++F DDY RYGY+YLM  KSE LEKF
Sbjct: 499  FTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKF 558

Query: 122  KEFKTEVENLL------------------------------------------------- 181
            KE+K EVEN L                                                 
Sbjct: 559  KEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRN 618

Query: 182  ------------------------VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRI 241
                                    V+ AVYILN VPSKSV ETP +LW+GRK SLRHFRI
Sbjct: 619  RTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRI 678

Query: 242  WGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD 301
            WGC  HVL +NPKKLE R KLCLFVGYPK TRG  F+DPKDN+V VSTNATFLEE+HIR+
Sbjct: 679  WGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIRE 738

Query: 302  HLPRSKIVLNEMDSS----SARVADGASTSTSVVDPSSSSKV-RSQELRMPRRSERVVRQ 361
            H PRSKIVLNE+       S RV +  S  T VV   SS++  + Q LR PRRS RV   
Sbjct: 739  HKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNL 798

Query: 362  LERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKAMDQEMESMYFNSIWELVDQP 421
              RYM L +TL V  D D EDPLT+ +AM  VDK+EWIKAM+ E+ESMYFNS+W+LVDQP
Sbjct: 799  PIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQP 858

Query: 422  DGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLA 481
            DGVKPIGCKWIYKRKR  D KVQTFKARLVAK +TQVE VDY+ETFSPVAM+KSIRILL+
Sbjct: 859  DGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMLKSIRILLS 918

Query: 482  IAVYYDYEVWKMDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRS 541
            IA Y+DYE+W+MDVKTAF+N NL+ETIYM QP+GFI  GQEQK+C+L RSIYGLKQASRS
Sbjct: 919  IAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRS 978

Query: 542  WNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE------------ 601
            WNIRFD AIKSYGF+Q VDEPCVYK+I+NK++AFL+LYVDDILLIGN+            
Sbjct: 979  WNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLA 1038

Query: 602  ------------------------------------------------------------ 661
                                                                        
Sbjct: 1039 TQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVT 1098

Query: 662  ---DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDICYAVGIVSRYQSNQGLDHWTI 715
               +QCPKTPQ VE+MR IPYASAV SLMYAMLCTRPDICYAVGIVSRYQSN GL HWT 
Sbjct: 1099 LSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTA 1158

BLAST of Tan0009842 vs. NCBI nr
Match: KAA0025945.1 (gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0035786.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0040492.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0041262.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1032.3 bits (2668), Expect = 1.9e-297
Identity = 544/895 (60.78%), Postives = 616/895 (68.83%), Query Frame = 0

Query: 2    RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRP 61
            + KR ++S   + +LWHLRLGHINL++I RLV +GLL++L++ SLP CESCLEGKMTKRP
Sbjct: 336  QNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRP 395

Query: 62   FSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDDYLRYGYIYLMYRKSETLEKF 121
            F+GKGYRAKEP ELIHSDLCGPMNVKARGG+EYF+SFIDDY RYGY+YLM  KSE LEKF
Sbjct: 396  FTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKF 455

Query: 122  KEFKTEVENLL------------------------------------------------- 181
            KE+KTEVENLL                                                 
Sbjct: 456  KEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRN 515

Query: 182  ------------------------VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRI 241
                                    VE AV+ILN VPSKSV ETPFELW GRK SL HFRI
Sbjct: 516  RTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRI 575

Query: 242  WGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD 301
            WGC  HVLV+NPKKLE R +LC FVGYPKETRG +FFDP++NRV VSTNATFLEE+H+R+
Sbjct: 576  WGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRN 635

Query: 302  HLPRSKIVLNEMDSSSARVADGASTSTSVVDPSSSSKVR-SQELRMPRRSERVVRQLERY 361
            H PRSK+VL+E    S RV D    S+ V + ++S +   SQ LRMPRRS RVV Q  RY
Sbjct: 636  HKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRY 695

Query: 362  MGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKAMDQEMESMYFNSIWELVDQPDGVK 421
            +GL +T VV PDD  EDPL+Y QAM  VDK++W+KAMD EMESMYFNS+WELVD P+GVK
Sbjct: 696  LGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVK 755

Query: 422  PIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVY 481
            PIGCKWIYKRKRD   KVQTFKARLVAK +TQ E VDY+ETFSPVAM+KSIRILL+IA +
Sbjct: 756  PIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATF 815

Query: 482  YDYEVWKMDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIR 541
            YDYE+W+MDVKTAF+N NL+E+I+M QP+GFI +GQEQKVC+L RSIYGLKQASRSWNIR
Sbjct: 816  YDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIR 875

Query: 542  FDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE---------------- 601
            FD AIKSYGF+QNVDEPCVYKKI    +AFL+LYVDDILLIGN+                
Sbjct: 876  FDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQ 935

Query: 602  -----------------------------------------------------------D 661
                                                                       +
Sbjct: 936  MKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKE 995

Query: 662  QCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDICYAVGIVSRYQSNQGLDHWTIVKAI 712
            Q PKTPQ+VEDMRRIPYASAV SLMYAMLCTRPDICYAVGIVSRYQSN GLDHWT VK +
Sbjct: 996  QSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIV 1055

BLAST of Tan0009842 vs. NCBI nr
Match: KAA0025159.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1030.0 bits (2662), Expect = 9.6e-297
Identity = 526/782 (67.26%), Postives = 599/782 (76.60%), Query Frame = 0

Query: 20  RLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSD 79
           +LGHINL++I RLV +GLL++L+++SLP CESCLEGKMTKRPF+ KGYRAKEP ELIHSD
Sbjct: 217 KLGHINLDRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTEKGYRAKEPLELIHSD 276

Query: 80  LCGPMNVKARGGYEYFVSFIDDYLRYGYIYLMYRKSETLEKFKEFKTEVENLL------- 139
           LCG MNVKARGG+EYF+SFIDDY RYGY+YLM  KSE LEKFKE+KTEVENLL       
Sbjct: 277 LCGLMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKIL 336

Query: 140 -----------------------------VEIAVYILNTVPSKSVCETPFELWHGRKISL 199
                                        VE AV+ILN  PSKSV ETPFELW GRK SL
Sbjct: 337 RSDRGGEYMDLRFQDYMIEHGIQSQLSTPVETAVHILNNAPSKSVSETPFELWRGRKPSL 396

Query: 200 RHFRIWGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEE 259
            HFRIWGC THVLV+NPKKL+SR +LC FVGYPKETRG +FFDP++NRV VSTNATFLEE
Sbjct: 397 SHFRIWGCPTHVLVTNPKKLKSRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEE 456

Query: 260 NHIRDHLPRSKIVLNEMDSSSARVADGASTSTSVVDPSSSSKVR-SQELRMPRRSERVVR 319
           +H+R+H PRSK+VL+E    S RV D    S+ V + ++S +   SQ LRMPRRS RVV 
Sbjct: 457 DHMRNHKPRSKLVLSEATDKSTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVS 516

Query: 320 QLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKAMDQEMESMYFNSIWELVDQ 379
           Q  RY+GL +T  V PDD  EDPL+Y QAM  VDK++W+KAMD EMESMYFN +WELVD 
Sbjct: 517 QPNRYLGLTETQAVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNLVWELVDL 576

Query: 380 PDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILL 439
           P+GVKPIGCKWIYKRKRD   KVQTFKARLVAK +TQ E VDY+ETFSPVAM+KSIRILL
Sbjct: 577 PEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQRERVDYEETFSPVAMLKSIRILL 636

Query: 440 AIAVYYDYEVWKMDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASR 499
           +IA +YDYE+W+MDVKTAF+N NL+E+I+M +P+GFI +GQEQKVC+L RSIYGLKQAS+
Sbjct: 637 SIATFYDYEIWQMDVKTAFLNGNLEESIFMSKPEGFITRGQEQKVCKLNRSIYGLKQASK 696

Query: 500 SWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE----------- 559
           SWNIRFD AIKSYGF+QNVDEPC+YKKI    +AFL+LYVDDIL IGN+           
Sbjct: 697 SWNIRFDIAIKSYGFDQNVDEPCLYKKINKGKVAFLVLYVDDILFIGNDMGYLTDVKAWH 756

Query: 560 ------DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDICYAVGIVSRYQSNQGLDH 619
                 +QCPKTPQ+VEDMRRIPYASAV SLMYAMLCTRPDICYAVGIVSRYQSN GLDH
Sbjct: 757 GVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDH 816

Query: 620 WTIVKAILKYLRRMRNYNLVYDRGDLILT------------------------------- 679
           WT VK ILKYLRR R+Y LVY   DLILT                               
Sbjct: 817 WTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTLGSVFTLNGGAVVW 876

Query: 680 -----GCIANSTMEAEYVVACEAEKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANS 712
                GCIA+STMEAEYV ACEA KEAVWLRKF+ DLEVVPNMNL ITL+ DNS AVANS
Sbjct: 877 RSIKQGCIADSTMEAEYVAACEAVKEAVWLRKFLHDLEVVPNMNLPITLYYDNSEAVANS 936

BLAST of Tan0009842 vs. ExPASy TrEMBL
Match: A0A5A7SMH8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold219G002560 PE=4 SV=1)

HSP 1 Score: 1045.4 bits (2702), Expect = 1.1e-301
Identity = 552/902 (61.20%), Postives = 618/902 (68.51%), Query Frame = 0

Query: 2    RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRP 61
            + KR+K+S KE+AHLWHLRLGHINLN+IERLV +GLLSELEENSLPVCESCLEGKMTKRP
Sbjct: 439  QNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKRP 498

Query: 62   FSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDDYLRYGYIYLMYRKSETLEKF 121
            F+GKG+RAKEP EL+HSDLCGPMNVKARGG+EYF++F DDY RYGY+YLM  KSE LEKF
Sbjct: 499  FTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKF 558

Query: 122  KEFKTEVENLL------------------------------------------------- 181
            KE+K EVEN L                                                 
Sbjct: 559  KEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRN 618

Query: 182  ------------------------VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRI 241
                                    V+ AVYILN VPSKSV ETP +LW+GRK SLRHFRI
Sbjct: 619  RTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRI 678

Query: 242  WGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD 301
            WGC  HVL +NPKKLE R KLCLFVGYPK TRG  F+DPKDN+V VSTNATFLEE+HIR+
Sbjct: 679  WGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIRE 738

Query: 302  HLPRSKIVLNEMDSS----SARVADGASTSTSVVDPSSSSKV-RSQELRMPRRSERVVRQ 361
            H PRSKIVLNE+       S RV +  S  T VV   SS++  + Q LR PRRS RV   
Sbjct: 739  HKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNL 798

Query: 362  LERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKAMDQEMESMYFNSIWELVDQP 421
              RYM L +TL V  D D EDPLT+ +AM  VDK+EWIKAM+ E+ESMYFNS+W+LVDQP
Sbjct: 799  PIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQP 858

Query: 422  DGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLA 481
            DGVKPIGCKWIYKRKR  D KVQTFKARLVAK +TQVE VDY+ETFSPVAM+KSIRILL+
Sbjct: 859  DGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMLKSIRILLS 918

Query: 482  IAVYYDYEVWKMDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRS 541
            IA Y+DYE+W+MDVKTAF+N NL+ETIYM QP+GFI  GQEQK+C+L RSIYGLKQASRS
Sbjct: 919  IAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRS 978

Query: 542  WNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE------------ 601
            WNIRFD AIKSYGF+Q VDEPCVYK+I+NK++AFL+LYVDDILLIGN+            
Sbjct: 979  WNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLA 1038

Query: 602  ------------------------------------------------------------ 661
                                                                        
Sbjct: 1039 TQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVT 1098

Query: 662  ---DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDICYAVGIVSRYQSNQGLDHWTI 715
               +QCPKTPQ VE+MR IPYASAV SLMYAMLCTRPDICYAVGIVSRYQSN GL HWT 
Sbjct: 1099 LSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTA 1158

BLAST of Tan0009842 vs. ExPASy TrEMBL
Match: A0A5D3CPJ6 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G00040 PE=4 SV=1)

HSP 1 Score: 1045.4 bits (2702), Expect = 1.1e-301
Identity = 552/902 (61.20%), Postives = 618/902 (68.51%), Query Frame = 0

Query: 2    RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRP 61
            + KR+K+S KE+AHLWHLRLGHINLN+IERLV +GLLSELEENSLPVCESCLEGKMTKRP
Sbjct: 439  QNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKRP 498

Query: 62   FSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDDYLRYGYIYLMYRKSETLEKF 121
            F+GKG+RAKEP EL+HSDLCGPMNVKARGG+EYF++F DDY RYGY+YLM  KSE LEKF
Sbjct: 499  FTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKF 558

Query: 122  KEFKTEVENLL------------------------------------------------- 181
            KE+K EVEN L                                                 
Sbjct: 559  KEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRN 618

Query: 182  ------------------------VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRI 241
                                    V+ AVYILN VPSKSV ETP +LW+GRK SLRHFRI
Sbjct: 619  RTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRI 678

Query: 242  WGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD 301
            WGC  HVL +NPKKLE R KLCLFVGYPK TRG  F+DPKDN+V VSTNATFLEE+HIR+
Sbjct: 679  WGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIRE 738

Query: 302  HLPRSKIVLNEMDSS----SARVADGASTSTSVVDPSSSSKV-RSQELRMPRRSERVVRQ 361
            H PRSKIVLNE+       S RV +  S  T VV   SS++  + Q LR PRRS RV   
Sbjct: 739  HKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNL 798

Query: 362  LERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKAMDQEMESMYFNSIWELVDQP 421
              RYM L +TL V  D D EDPLT+ +AM  VDK+EWIKAM+ E+ESMYFNS+W+LVDQP
Sbjct: 799  PIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQP 858

Query: 422  DGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLA 481
            DGVKPIGCKWIYKRKR  D KVQTFKARLVAK +TQVE VDY+ETFSPVAM+KSIRILL+
Sbjct: 859  DGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMLKSIRILLS 918

Query: 482  IAVYYDYEVWKMDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRS 541
            IA Y+DYE+W+MDVKTAF+N NL+ETIYM QP+GFI  GQEQK+C+L RSIYGLKQASRS
Sbjct: 919  IAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRS 978

Query: 542  WNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE------------ 601
            WNIRFD AIKSYGF+Q VDEPCVYK+I+NK++AFL+LYVDDILLIGN+            
Sbjct: 979  WNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLA 1038

Query: 602  ------------------------------------------------------------ 661
                                                                        
Sbjct: 1039 TQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVT 1098

Query: 662  ---DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDICYAVGIVSRYQSNQGLDHWTI 715
               +QCPKTPQ VE+MR IPYASAV SLMYAMLCTRPDICYAVGIVSRYQSN GL HWT 
Sbjct: 1099 LSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTA 1158

BLAST of Tan0009842 vs. ExPASy TrEMBL
Match: A0A5A7TZD7 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold264G001300 PE=4 SV=1)

HSP 1 Score: 1045.4 bits (2702), Expect = 1.1e-301
Identity = 552/902 (61.20%), Postives = 618/902 (68.51%), Query Frame = 0

Query: 2    RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRP 61
            + KR+K+S KE+AHLWHLRLGHINLN+IERLV +GLLSELEENSLPVCESCLEGKMTKRP
Sbjct: 438  QNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKRP 497

Query: 62   FSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDDYLRYGYIYLMYRKSETLEKF 121
            F+GKG+RAKEP EL+HSDLCGPMNVKARGG+EYF++F DDY RYGY+YLM  KSE LEKF
Sbjct: 498  FTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKF 557

Query: 122  KEFKTEVENLL------------------------------------------------- 181
            KE+K EVEN L                                                 
Sbjct: 558  KEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRN 617

Query: 182  ------------------------VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRI 241
                                    V+ AVYILN VPSKSV ETP +LW+GRK SLRHFRI
Sbjct: 618  RTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRI 677

Query: 242  WGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD 301
            WGC  HVL +NPKKLE R KLCLFVGYPK TRG  F+DPKDN+V VSTNATFLEE+HIR+
Sbjct: 678  WGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIRE 737

Query: 302  HLPRSKIVLNEMDSS----SARVADGASTSTSVVDPSSSSKV-RSQELRMPRRSERVVRQ 361
            H PRSKIVLNE+       S RV +  S  T VV   SS++  + Q LR PRRS RV   
Sbjct: 738  HKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNL 797

Query: 362  LERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKAMDQEMESMYFNSIWELVDQP 421
              RYM L +TL V  D D EDPLT+ +AM  VDK+EWIKAM+ E+ESMYFNS+W+LVDQP
Sbjct: 798  PIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQP 857

Query: 422  DGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLA 481
            DGVKPIGCKWIYKRKR  D KVQTFKARLVAK +TQVE VDY+ETFSPVAM+KSIRILL+
Sbjct: 858  DGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMLKSIRILLS 917

Query: 482  IAVYYDYEVWKMDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRS 541
            IA Y+DYE+W+MDVKTAF+N NL+ETIYM QP+GFI  GQEQK+C+L RSIYGLKQASRS
Sbjct: 918  IAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRS 977

Query: 542  WNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE------------ 601
            WNIRFD AIKSYGF+Q VDEPCVYK+I+NK++AFL+LYVDDILLIGN+            
Sbjct: 978  WNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLA 1037

Query: 602  ------------------------------------------------------------ 661
                                                                        
Sbjct: 1038 TQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVT 1097

Query: 662  ---DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDICYAVGIVSRYQSNQGLDHWTI 715
               +QCPKTPQ VE+MR IPYASAV SLMYAMLCTRPDICYAVGIVSRYQSN GL HWT 
Sbjct: 1098 LSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTA 1157

BLAST of Tan0009842 vs. ExPASy TrEMBL
Match: A0A5A7TZD0 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G00090 PE=4 SV=1)

HSP 1 Score: 1032.3 bits (2668), Expect = 9.4e-298
Identity = 544/895 (60.78%), Postives = 616/895 (68.83%), Query Frame = 0

Query: 2    RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRP 61
            + KR ++S   + +LWHLRLGHINL++I RLV +GLL++L++ SLP CESCLEGKMTKRP
Sbjct: 336  QNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRP 395

Query: 62   FSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDDYLRYGYIYLMYRKSETLEKF 121
            F+GKGYRAKEP ELIHSDLCGPMNVKARGG+EYF+SFIDDY RYGY+YLM  KSE LEKF
Sbjct: 396  FTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKF 455

Query: 122  KEFKTEVENLL------------------------------------------------- 181
            KE+KTEVENLL                                                 
Sbjct: 456  KEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRN 515

Query: 182  ------------------------VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRI 241
                                    VE AV+ILN VPSKSV ETPFELW GRK SL HFRI
Sbjct: 516  RTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRI 575

Query: 242  WGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD 301
            WGC  HVLV+NPKKLE R +LC FVGYPKETRG +FFDP++NRV VSTNATFLEE+H+R+
Sbjct: 576  WGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRN 635

Query: 302  HLPRSKIVLNEMDSSSARVADGASTSTSVVDPSSSSKVR-SQELRMPRRSERVVRQLERY 361
            H PRSK+VL+E    S RV D    S+ V + ++S +   SQ LRMPRRS RVV Q  RY
Sbjct: 636  HKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRY 695

Query: 362  MGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKAMDQEMESMYFNSIWELVDQPDGVK 421
            +GL +T VV PDD  EDPL+Y QAM  VDK++W+KAMD EMESMYFNS+WELVD P+GVK
Sbjct: 696  LGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVK 755

Query: 422  PIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVY 481
            PIGCKWIYKRKRD   KVQTFKARLVAK +TQ E VDY+ETFSPVAM+KSIRILL+IA +
Sbjct: 756  PIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATF 815

Query: 482  YDYEVWKMDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIR 541
            YDYE+W+MDVKTAF+N NL+E+I+M QP+GFI +GQEQKVC+L RSIYGLKQASRSWNIR
Sbjct: 816  YDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIR 875

Query: 542  FDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE---------------- 601
            FD AIKSYGF+QNVDEPCVYKKI    +AFL+LYVDDILLIGN+                
Sbjct: 876  FDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQ 935

Query: 602  -----------------------------------------------------------D 661
                                                                       +
Sbjct: 936  MKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKE 995

Query: 662  QCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDICYAVGIVSRYQSNQGLDHWTIVKAI 712
            Q PKTPQ+VEDMRRIPYASAV SLMYAMLCTRPDICYAVGIVSRYQSN GLDHWT VK +
Sbjct: 996  QSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIV 1055

BLAST of Tan0009842 vs. ExPASy TrEMBL
Match: A0A5A7SIN2 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold2405G00060 PE=4 SV=1)

HSP 1 Score: 1030.0 bits (2662), Expect = 4.6e-297
Identity = 526/782 (67.26%), Postives = 599/782 (76.60%), Query Frame = 0

Query: 20  RLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSD 79
           +LGHINL++I RLV +GLL++L+++SLP CESCLEGKMTKRPF+ KGYRAKEP ELIHSD
Sbjct: 217 KLGHINLDRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTEKGYRAKEPLELIHSD 276

Query: 80  LCGPMNVKARGGYEYFVSFIDDYLRYGYIYLMYRKSETLEKFKEFKTEVENLL------- 139
           LCG MNVKARGG+EYF+SFIDDY RYGY+YLM  KSE LEKFKE+KTEVENLL       
Sbjct: 277 LCGLMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKIL 336

Query: 140 -----------------------------VEIAVYILNTVPSKSVCETPFELWHGRKISL 199
                                        VE AV+ILN  PSKSV ETPFELW GRK SL
Sbjct: 337 RSDRGGEYMDLRFQDYMIEHGIQSQLSTPVETAVHILNNAPSKSVSETPFELWRGRKPSL 396

Query: 200 RHFRIWGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEE 259
            HFRIWGC THVLV+NPKKL+SR +LC FVGYPKETRG +FFDP++NRV VSTNATFLEE
Sbjct: 397 SHFRIWGCPTHVLVTNPKKLKSRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEE 456

Query: 260 NHIRDHLPRSKIVLNEMDSSSARVADGASTSTSVVDPSSSSKVR-SQELRMPRRSERVVR 319
           +H+R+H PRSK+VL+E    S RV D    S+ V + ++S +   SQ LRMPRRS RVV 
Sbjct: 457 DHMRNHKPRSKLVLSEATDKSTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVS 516

Query: 320 QLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKAMDQEMESMYFNSIWELVDQ 379
           Q  RY+GL +T  V PDD  EDPL+Y QAM  VDK++W+KAMD EMESMYFN +WELVD 
Sbjct: 517 QPNRYLGLTETQAVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNLVWELVDL 576

Query: 380 PDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILL 439
           P+GVKPIGCKWIYKRKRD   KVQTFKARLVAK +TQ E VDY+ETFSPVAM+KSIRILL
Sbjct: 577 PEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQRERVDYEETFSPVAMLKSIRILL 636

Query: 440 AIAVYYDYEVWKMDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASR 499
           +IA +YDYE+W+MDVKTAF+N NL+E+I+M +P+GFI +GQEQKVC+L RSIYGLKQAS+
Sbjct: 637 SIATFYDYEIWQMDVKTAFLNGNLEESIFMSKPEGFITRGQEQKVCKLNRSIYGLKQASK 696

Query: 500 SWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE----------- 559
           SWNIRFD AIKSYGF+QNVDEPC+YKKI    +AFL+LYVDDIL IGN+           
Sbjct: 697 SWNIRFDIAIKSYGFDQNVDEPCLYKKINKGKVAFLVLYVDDILFIGNDMGYLTDVKAWH 756

Query: 560 ------DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDICYAVGIVSRYQSNQGLDH 619
                 +QCPKTPQ+VEDMRRIPYASAV SLMYAMLCTRPDICYAVGIVSRYQSN GLDH
Sbjct: 757 GVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDH 816

Query: 620 WTIVKAILKYLRRMRNYNLVYDRGDLILT------------------------------- 679
           WT VK ILKYLRR R+Y LVY   DLILT                               
Sbjct: 817 WTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTLGSVFTLNGGAVVW 876

Query: 680 -----GCIANSTMEAEYVVACEAEKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANS 712
                GCIA+STMEAEYV ACEA KEAVWLRKF+ DLEVVPNMNL ITL+ DNS AVANS
Sbjct: 877 RSIKQGCIADSTMEAEYVAACEAVKEAVWLRKFLHDLEVVPNMNLPITLYYDNSEAVANS 936

BLAST of Tan0009842 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 196.8 bits (499), Expect = 5.8e-50
Identity = 137/475 (28.84%), Postives = 212/475 (44.63%), Query Frame = 0

Query: 303 EDPLTYDQAMAKVDKNEWIKAMDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVD 362
           ++P TY++A   +    W  AMD E+ +M     WE+   P   KPIGCKW+YK K + D
Sbjct: 84  KEPSTYNEAKEFL---VWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSD 143

Query: 363 EKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFM 422
             ++ +KARLVAK +TQ E +D+ ETFSPV  + S++++LAI+  Y++ + ++D+  AF+
Sbjct: 144 GTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFL 203

Query: 423 NDNLDETIYMDQPKGFIAKGQE----QKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFN 482
           N +LDE IYM  P G+ A+  +      VC L++SIYGLKQASR W ++F   +  +GF 
Sbjct: 204 NGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFV 263

Query: 483 QNVDEPCVYKKIVNKTIAFLILYVDDILLIGNEDQCP-------KTPQKVEDMRRIPY-- 542
           Q+  +   + KI       +++YVDDI++  N D          K+  K+ D+  + Y  
Sbjct: 264 QSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFL 323

Query: 543 ------------------------------------------------------ASAVRS 602
                                                                 A A R 
Sbjct: 324 GLEIARSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAHSGGDFVDAKAYRR 383

Query: 603 LMYAML---CTRPDICYAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDR---- 662
           L+  ++    TR DI +AV  +S++     L H   V  IL Y++      L Y      
Sbjct: 384 LIGRLMYLQITRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGLFYSSQAEM 443

Query: 663 -----GDLILTGC----------------------------IANSTMEAEYVVACEAEKE 671
                 D     C                            ++ S+ EAEY     A  E
Sbjct: 444 QLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRALSFATDE 503

BLAST of Tan0009842 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 73.9 bits (180), Expect = 5.7e-13
Identity = 35/86 (40.70%), Postives = 51/86 (59.30%), Query Frame = 0

Query: 320 WIKAMDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQ 379
           W +AM +E++++  N  W LV  P     +GCKW++K K   D  +   KARLVAK F Q
Sbjct: 40  WCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQ 99

Query: 380 VEEVDYKETFSPVAMVKSIRILLAIA 406
            E + + ET+SPV    +IR +L +A
Sbjct: 100 EEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of Tan0009842 vs. TAIR 10
Match: ATMG00300.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 61.6 bits (148), Expect = 2.9e-09
Identity = 29/76 (38.16%), Postives = 41/76 (53.95%), Query Frame = 0

Query: 11  KESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAK 70
           K+   LWH RL H++   +E LV  G L   + +SL  CE C+ GK  +  FS   +  K
Sbjct: 66  KDETRLWHSRLAHMSQRGMELLVKKGFLDSSKVSSLKFCEDCIYGKTHRVNFSTGQHTTK 125

Query: 71  EPFELIHSDLCGPMNV 87
            P + +HSDL G  +V
Sbjct: 126 NPLDYVHSDLWGAPSV 141

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109787.1e-10928.59Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041462.1e-7324.43Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q9ZT941.5e-5026.04Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW23.7e-4926.72Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P256007.3e-1324.28Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
Match NameE-valueIdentityDescription
KAA0048404.12.2e-30161.20gag/pol protein [Cucumis melo var. makuwa][more]
KAA0035879.12.2e-30161.20gag/pol protein [Cucumis melo var. makuwa] >KAA0044276.1 gag/pol protein [Cucumi... [more]
KAA0031826.12.2e-30161.20gag/pol protein [Cucumis melo var. makuwa] >KAA0032384.1 gag/pol protein [Cucumi... [more]
KAA0025945.11.9e-29760.78gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumi... [more]
KAA0025159.19.6e-29767.26gag/pol protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
A0A5A7SMH81.1e-30161.20Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold219G0025... [more]
A0A5D3CPJ61.1e-30161.20Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G0004... [more]
A0A5A7TZD71.1e-30161.20Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold264G0013... [more]
A0A5A7TZD09.4e-29860.78Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G000... [more]
A0A5A7SIN24.6e-29767.26Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold2405G000... [more]
Match NameE-valueIdentityDescription
AT4G23160.15.8e-5028.84cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00820.15.7e-1340.70Reverse transcriptase (RNA-dependent DNA polymerase) [more]
ATMG00300.12.9e-0938.16Gag-Pol-related retrotransposon family protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 334..511
e-value: 1.8E-56
score: 191.4
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 11..57
e-value: 1.7E-12
score: 47.0
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 65..132
e-value: 5.8E-8
score: 34.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 246..274
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 246..268
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 18..100
coord: 133..513
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 596..699
e-value: 5.71593E-36
score: 130.281
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 71..168
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 334..666

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0009842.1Tan0009842.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0043170 macromolecule metabolic process
biological_process GO:0006807 nitrogen compound metabolic process
biological_process GO:0044238 primary metabolic process
molecular_function GO:0003676 nucleic acid binding