Cmc02g0052001 (gene) Melon (Charmono) v1.1

Overview
NameCmc02g0052001
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCMiso1.1chr02: 19069272 .. 19070147 (-)
RNA-Seq ExpressionCmc02g0052001
SyntenyCmc02g0052001
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTCAAAGTCTTTGTAACTGAAATAGAGAACCAGTTTAACAAAAGAATTAAGAGACTTCGTAGTGATAGAGGAACTGAATACGATTCAGTTGCTTTCAATGAGTTTTATAGCTCAAAAGGAATAATACATGAAACTACTGCGCCTTATTCTCCTGAAATGAATGGAAAAGCAGAAAGAAAGAATAGAACTCTAACTGAGTTAGTAGTTGCTATCTTACTTGAGTCAGGAGCCGCACCATCTTGGTGGGAAGAAATAATTAAGAGTGTTAATTATGTTCTTAATAGGATTCCTAAATCAAACAGTAAAACTTCACTATACGAAGTCCTTAAACATAAAACACCAAACCTATCTTATCTTCGAACTTGGGGTTGTCTAGCTTATGTTAGAATACCTGATCCAAAAAGAAGAAAATTAGCAAGTAGAGCCTATGAATGTGTTTTCATAGGATATGCTGAAAATAGTAAAACCTATAGATTCTATGACTTAGAAAACAAAGTAATTATAGAATCGAATGACGTAGATTTTTTCGAGGACATATTTCCTTTTAAATCTAGAAATAGTGGGGGCCTATGTAGTCAAACTAGTGGGGGCTCAAGTTCCAATAGTCTACCTTCAATTAGGATCCAAACCCAAGACAAGGAAGTAGATCCTGAACCTAGAAGAAACAAGAGAGCTAAAACAGTAAAAGACTTTGGAGAAGACTTCGAAATGTACAACGTAGAAGATCCAAAAGATCTAACAGAAGCATTGTCATCAGTAGATGCTAATTTATGGCAAGAAGCTATCAATGATGAAATGAACTCTCTTGAATCCAATAGAACTTGGCACCTAGTTGACTTACCCCCTGGATGTAAAGCTATAGGCTGCAAATGA

mRNA sequence

ATGTTCAAAGTCTTTGTAACTGAAATAGAGAACCAGTTTAACAAAAGAATTAAGAGACTTCGTAGTGATAGAGGAACTGAATACGATTCAGTTGCTTTCAATGAGTTTTATAGCTCAAAAGGAATAATACATGAAACTACTGCGCCTTATTCTCCTGAAATGAATGGAAAAGCAGAAAGAAAGAATAGAACTCTAACTGAGTTAGTAGTTGCTATCTTACTTGAGTCAGGAGCCGCACCATCTTGGTGGGAAGAAATAATTAAGAGTGTTAATTATGTTCTTAATAGGATTCCTAAATCAAACAGTAAAACTTCACTATACGAAGTCCTTAAACATAAAACACCAAACCTATCTTATCTTCGAACTTGGGGTTGTCTAGCTTATGTTAGAATACCTGATCCAAAAAGAAGAAAATTAGCAAGTAGAGCCTATGAATGTGTTTTCATAGGATATGCTGAAAATAGTAAAACCTATAGATTCTATGACTTAGAAAACAAAGTAATTATAGAATCGAATGACGTAGATTTTTTCGAGGACATATTTCCTTTTAAATCTAGAAATAGTGGGGGCCTATGTAGTCAAACTAGTGGGGGCTCAAGTTCCAATAGTCTACCTTCAATTAGGATCCAAACCCAAGACAAGGAAGTAGATCCTGAACCTAGAAGAAACAAGAGAGCTAAAACAGTAAAAGACTTTGGAGAAGACTTCGAAATGTACAACGTAGAAGATCCAAAAGATCTAACAGAAGCATTGTCATCAGTAGATGCTAATTTATGGCAAGAAGCTATCAATGATGAAATGAACTCTCTTGAATCCAATAGAACTTGGCACCTAGTTGACTTACCCCCTGGATGTAAAGCTATAGGCTGCAAATGA

Coding sequence (CDS)

ATGTTCAAAGTCTTTGTAACTGAAATAGAGAACCAGTTTAACAAAAGAATTAAGAGACTTCGTAGTGATAGAGGAACTGAATACGATTCAGTTGCTTTCAATGAGTTTTATAGCTCAAAAGGAATAATACATGAAACTACTGCGCCTTATTCTCCTGAAATGAATGGAAAAGCAGAAAGAAAGAATAGAACTCTAACTGAGTTAGTAGTTGCTATCTTACTTGAGTCAGGAGCCGCACCATCTTGGTGGGAAGAAATAATTAAGAGTGTTAATTATGTTCTTAATAGGATTCCTAAATCAAACAGTAAAACTTCACTATACGAAGTCCTTAAACATAAAACACCAAACCTATCTTATCTTCGAACTTGGGGTTGTCTAGCTTATGTTAGAATACCTGATCCAAAAAGAAGAAAATTAGCAAGTAGAGCCTATGAATGTGTTTTCATAGGATATGCTGAAAATAGTAAAACCTATAGATTCTATGACTTAGAAAACAAAGTAATTATAGAATCGAATGACGTAGATTTTTTCGAGGACATATTTCCTTTTAAATCTAGAAATAGTGGGGGCCTATGTAGTCAAACTAGTGGGGGCTCAAGTTCCAATAGTCTACCTTCAATTAGGATCCAAACCCAAGACAAGGAAGTAGATCCTGAACCTAGAAGAAACAAGAGAGCTAAAACAGTAAAAGACTTTGGAGAAGACTTCGAAATGTACAACGTAGAAGATCCAAAAGATCTAACAGAAGCATTGTCATCAGTAGATGCTAATTTATGGCAAGAAGCTATCAATGATGAAATGAACTCTCTTGAATCCAATAGAACTTGGCACCTAGTTGACTTACCCCCTGGATGTAAAGCTATAGGCTGCAAATGA

Protein sequence

MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQTSGGSSSNSLPSIRIQTQDKEVDPEPRRNKRAKTVKDFGEDFEMYNVEDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK
Homology
BLAST of Cmc02g0052001 vs. NCBI nr
Match: KAA0034938.1 (putative Polyprotein [Cucumis melo var. makuwa] >TYK21293.1 putative Polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 525.4 bits (1352), Expect = 3.1e-145
Identity = 260/285 (91.23%), Postives = 271/285 (95.09%), Query Frame = 0

Query: 1   MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAER 60
           MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFY+SKGIIHETT PYSPEMNGK ER
Sbjct: 403 MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYNSKGIIHETTTPYSPEMNGKEER 462

Query: 61  KNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVLKHKTPNLSYL 120
           KNRTLTEL VAILLES AAPSWW EIIK+VNYVLNRIPKSNSKTS YEVLKHK PNLSYL
Sbjct: 463 KNRTLTELAVAILLESEAAPSWWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKIPNLSYL 522

Query: 121 RTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDI 180
           RTWGCLAYVRIP+P+RRKLAS+AYECVFIGYAENSK YRFYDLENKVIIESNDVDFFED 
Sbjct: 523 RTWGCLAYVRIPNPERRKLASKAYECVFIGYAENSKAYRFYDLENKVIIESNDVDFFEDK 582

Query: 181 FPFKSRNSGGLCSQTSGGSSSNSLPSIRIQTQDKEVDPEPRRNKRAKTVKDFGEDFEMYN 240
           FPFKSRNSGGL SQTSGGSS +SLPSIRIQTQDKEVDPEPRR+KRA+TVKDF EDFEMYN
Sbjct: 583 FPFKSRNSGGLYSQTSGGSSFSSLPSIRIQTQDKEVDPEPRRSKRARTVKDFREDFEMYN 642

Query: 241 VEDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGC 286
           VEDPKDLT+ALSSVDANLWQEAIND ++SLESNRTWHLVDLPP C
Sbjct: 643 VEDPKDLTKALSSVDANLWQEAINDGIDSLESNRTWHLVDLPPRC 687

BLAST of Cmc02g0052001 vs. NCBI nr
Match: RZC09450.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja])

HSP 1 Score: 414.5 bits (1064), Expect = 7.8e-112
Identity = 210/293 (71.67%), Postives = 237/293 (80.89%), Query Frame = 0

Query: 1   MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAER 60
           MFK+FVTEIENQFNK+IK+LRSDRGT+YDS  FNEFY+  GIIHETTAPYSPEMNGKAER
Sbjct: 10  MFKLFVTEIENQFNKKIKKLRSDRGTKYDSSLFNEFYNLHGIIHETTAPYSPEMNGKAER 69

Query: 61  KNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVLKHKTPNLSYL 120
           KNRT TELVVA +L S A   WW EI+ +V YVLNRIPKS SKTS YE+LK + PNLSYL
Sbjct: 70  KNRTFTELVVATMLSSSATSFWWGEILLTVCYVLNRIPKSKSKTSPYEILKKRQPNLSYL 129

Query: 121 RTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDI 180
           RTWGCLAYVRIPDPKR KLASRAYECVFIGYA NSK YRFYDL  KVIIESND DF+E+ 
Sbjct: 130 RTWGCLAYVRIPDPKRVKLASRAYECVFIGYAINSKAYRFYDLNAKVIIESNDADFYENK 189

Query: 181 FPFKSRNSGGLCSQTSGGSSSNSLPSIRIQT-QDKEVDPEPRRNKRAKTVKDFGEDFEMY 240
           FPFK R+        SGG+SSN LP+I  +     + D EPRR KRA+  KD+G D+  Y
Sbjct: 190 FPFKLRD--------SGGTSSNYLPAISSENLAQPKPDIEPRRGKRARIAKDYGPDYMAY 249

Query: 241 NV-EDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK 292
            + EDP +L EALS +DA+LWQEAINDEM+SLES++TWHLVDLPPGCK IGCK
Sbjct: 250 TLEEDPSNLQEALSFLDADLWQEAINDEMDSLESDKTWHLVDLPPGCKPIGCK 294

BLAST of Cmc02g0052001 vs. NCBI nr
Match: AAU90333.1 (Putative gag and pol polyprotein, identical [Solanum demissum])

HSP 1 Score: 310.8 bits (795), Expect = 1.2e-80
Identity = 163/293 (55.63%), Postives = 205/293 (69.97%), Query Frame = 0

Query: 2   FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERK 61
           FK ++ E+ENQF ++IKR+RSDRG EY+S  FN F  S GIIHETT PYSP  NG AERK
Sbjct: 344 FKTYLHEVENQFGRKIKRIRSDRGREYESNEFNSFVRSLGIIHETTPPYSPSSNGAAERK 403

Query: 62  NRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVLKHKTPNLSYLR 121
           NRTL EL  A+L+ES A  ++W E I +  YVLNR+P   SK + +E+ K   P+L YLR
Sbjct: 404 NRTLVELTNAMLIESHAPLNFWGETILTACYVLNRVPHKKSKLTHFELWKGYKPSLGYLR 463

Query: 122 TWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIF 181
            WGCLA+VR+ DPK  KL  +   C F+GYA NS  YRF++LE+ ++IES D  F E+ F
Sbjct: 464 VWGCLAFVRLMDPKITKLGKKVTTCAFLGYASNSTAYRFFNLEDNIVIESGDAIFHENKF 523

Query: 182 PFKSRNSGGLCSQTSGGSSSNSLPSIRIQT-QDKEV-DPEPRRNKRAKTVKDFGEDFEMY 241
           PF S+NSGG   +     +  SLPS    T ++KEV D E RR+KRA+  KDFG +F ++
Sbjct: 524 PFDSKNSGGQRIE----QNILSLPSSSTSTLKNKEVNDFELRRSKRARIEKDFGPNFYVF 583

Query: 242 NV-EDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK 292
           NV +DP  L EALSS D+  W+EA+NDEM SL SN+TW LVDLPPGCK IGCK
Sbjct: 584 NVGDDPLTLKEALSSHDSIFWKEAVNDEMESLISNKTWKLVDLPPGCKTIGCK 632

BLAST of Cmc02g0052001 vs. NCBI nr
Match: ABI34306.1 (Polyprotein, putative [Solanum demissum])

HSP 1 Score: 308.5 bits (789), Expect = 6.0e-80
Identity = 162/293 (55.29%), Postives = 204/293 (69.62%), Query Frame = 0

Query: 2   FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERK 61
           FK ++ E+ENQF ++IKR+RSDRG EY+S  FN F  S GIIHETT PYSP  NG AERK
Sbjct: 543 FKTYLHEVENQFGRKIKRIRSDRGREYESNEFNSFVRSLGIIHETTPPYSPSSNGVAERK 602

Query: 62  NRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVLKHKTPNLSYLR 121
           NRTL EL  A+L+ES A  ++W E I +  YVLNR+P   SK + +E+ K   P+L YLR
Sbjct: 603 NRTLVELTNAMLIESHAPLNFWGEAILTACYVLNRVPHKKSKLTPFELWKGYKPSLGYLR 662

Query: 122 TWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIF 181
            WGCLA+VR+ DPK  KL  +   C F+GYA NS  YRF++LE+ ++IES D  F E+ F
Sbjct: 663 VWGCLAFVRLMDPKITKLGKKVTTCAFLGYASNSTAYRFFNLEDNIVIESGDAIFHENKF 722

Query: 182 PFKSRNSGGLCSQTSGGSSSNSLPSIRIQT-QDKEV-DPEPRRNKRAKTVKDFGEDFEMY 241
           PF S+NSGG   +     +  +LPS    T ++KEV D E RR+KRA+  KDFG DF ++
Sbjct: 723 PFDSKNSGGQRIE----QNILTLPSSSTSTLKNKEVNDFELRRSKRARVEKDFGPDFYVF 782

Query: 242 NVEDPK-DLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK 292
           NV D +  L EALSS D+  W+EA+NDEM SL SN+TW LVDLPPGCK IGCK
Sbjct: 783 NVGDDRLTLKEALSSHDSIFWKEAVNDEMESLISNKTWKLVDLPPGCKTIGCK 831

BLAST of Cmc02g0052001 vs. NCBI nr
Match: XP_023158131.2 (uncharacterized protein LOC103653943 isoform X1 [Zea mays] >XP_035823266.1 uncharacterized protein LOC103653943 isoform X1 [Zea mays])

HSP 1 Score: 287.3 bits (734), Expect = 1.4e-73
Identity = 150/298 (50.34%), Postives = 196/298 (65.77%), Query Frame = 0

Query: 2   FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERK 61
           FK++ TE+ENQ +K+IKRLRSDRG EY S  F+E+    GIIHETTAPYSP+ NG AERK
Sbjct: 623 FKIYKTEVENQLDKKIKRLRSDRGGEYLSNLFDEYCKECGIIHETTAPYSPQSNGVAERK 682

Query: 62  NRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVLKHKTPNLSYLR 121
           NRT+ +L  A+L  SG    WW E + +V YVLNR+P  N + + YE  K + P+LS+LR
Sbjct: 683 NRTVCDLANALLQSSGMPDIWWGEAVLTVCYVLNRVPPRNREATPYEGFKGRKPDLSHLR 742

Query: 122 TWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENK-------VIIESNDV 181
           TWGCLA V +P PK+RKL  +  +CVF+GYA NS  YRF  + ++       VI+ES DV
Sbjct: 743 TWGCLAKVNVPLPKKRKLGPKTVDCVFLGYAHNSAAYRFLVVHSETSEVAINVIMESRDV 802

Query: 182 DFFEDIFPFKSRNSGGLCSQTSGGSSSNSLPSIRIQTQDKEVDPEPRRNKRAKTVKDFGE 241
            FFE IFP + +          G S + SLPS      D+  D E RR+KR +T K  G+
Sbjct: 803 TFFESIFPMRDKE----VVAPDGPSRTYSLPS---SVNDQTPDLELRRSKRQRTEKSLGD 862

Query: 242 DFEMYNV-EDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK 292
           D+ +Y V E+P+ LTEA +S DA  W+EA+  EM+S+ SN TW + DLP GCK +GCK
Sbjct: 863 DYIIYLVDEEPRSLTEAYTSPDAEYWREAVRSEMDSIISNGTWEITDLPAGCKPVGCK 913

BLAST of Cmc02g0052001 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 152.5 bits (384), Expect = 7.2e-36
Identity = 105/331 (31.72%), Postives = 156/331 (47.13%), Query Frame = 0

Query: 1   MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAER 60
           +F+ F   +E +  +++KRLRSD G EY S  F E+ SS GI HE T P +P+ NG AER
Sbjct: 528 VFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAER 587

Query: 61  KNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYE-VLKHKTPNLSY 120
            NRT+ E V ++L  +    S+W E +++  Y++NR P       + E V  +K  + S+
Sbjct: 588 MNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSH 647

Query: 121 LRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFED 180
           L+ +GC A+  +P  +R KL  ++  C+FIGY +    YR +D   K +I S DV F E 
Sbjct: 648 LKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRES 707

Query: 181 ---------------------IFPFKSRNSGGLCSQTSGGSSSNSLPSIRIQ-------- 240
                                  P  S N     S T   S     P   I+        
Sbjct: 708 EVRTAADMSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEG 767

Query: 241 -------TQDKEVDPEPRRNKRAKTVKDFGEDFEMYNVED---PKDLTEALSSVDANLWQ 292
                  TQ +E     RR++R +         E   + D   P+ L E LS  + N   
Sbjct: 768 VEEVEHPTQGEEQHQPLRRSERPRVESRRYPSTEYVLISDDREPESLKEVLSHPEKNQLM 827

BLAST of Cmc02g0052001 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 109.8 bits (273), Expect = 5.4e-23
Identity = 64/206 (31.07%), Postives = 103/206 (50.00%), Query Frame = 0

Query: 2   FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERK 61
           F +F + +EN+F  RI  L SD G E+  V   ++ S  GI H T+ P++PE NG +ERK
Sbjct: 550 FIIFKSLVENRFQTRIGTLYSDNGGEF--VVLRDYLSQHGISHFTSPPHTPEHNGLSERK 609

Query: 62  NRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSK-TSLYEVLKHKTPNLSYL 121
           +R + E+ + +L  +    ++W        Y++NR+P    +  S ++ L  + PN   L
Sbjct: 610 HRHIVEMGLTLLSHASVPKTYWPYAFSVAVYLINRLPTPLLQLQSPFQKLFGQPPNYEKL 669

Query: 122 RTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDI 181
           + +GC  Y  +    R KL  ++ +C F+GY+     Y    +    +  S  V F E  
Sbjct: 670 KVFGCACYPWLRPYNRHKLEDKSKQCAFMGYSLTQSAYLCLHIPTGRLYTSRHVQFDERC 729

Query: 182 FPFKSRNSGGLCSQTSGGSSSNSLPS 207
           FPF + N G   SQ     S+ + PS
Sbjct: 730 FPFSTTNFGVSTSQEQRSDSAPNWPS 753

BLAST of Cmc02g0052001 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 103.6 bits (257), Expect = 3.8e-21
Identity = 60/185 (32.43%), Postives = 95/185 (51.35%), Query Frame = 0

Query: 2   FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERK 61
           F  F   +EN+F  RI    SD G E+  VA  E++S  GI H T+ P++PE NG +ERK
Sbjct: 571 FITFKNLLENRFQTRIGTFYSDNGGEF--VALWEYFSQHGISHLTSPPHTPEHNGLSERK 630

Query: 62  NRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSK-TSLYEVLKHKTPNLSYL 121
           +R + E  + +L  +    ++W        Y++NR+P    +  S ++ L   +PN   L
Sbjct: 631 HRHIVETGLTLLSHASIPKTYWPYAFAVAVYLINRLPTPLLQLESPFQKLFGTSPNYDKL 690

Query: 122 RTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDI 181
           R +GC  Y  +    + KL  ++ +CVF+GY+     Y    L+   +  S  V F E+ 
Sbjct: 691 RVFGCACYPWLRPYNQHKLDDKSRQCVFLGYSLTQSAYLCLHLQTSRLYISRHVRFDENC 750

Query: 182 FPFKS 186
           FPF +
Sbjct: 751 FPFSN 753

BLAST of Cmc02g0052001 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 101.7 bits (252), Expect = 1.5e-20
Identity = 78/281 (27.76%), Postives = 138/281 (49.11%), Query Frame = 0

Query: 1   MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAER 60
           MF+ FV + E  FN ++  L  D G EY S    +F   KGI +  T P++P++NG +ER
Sbjct: 528 MFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSER 587

Query: 61  KNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKS---NSKTSLYEVLKHKTPNL 120
             RT+TE    ++  +    S+W E + +  Y++NRIP     +S  + YE+  +K P L
Sbjct: 588 MIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYL 647

Query: 121 SYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFF 180
            +LR +G   YV I + K+ K   ++++ +F+GY  N   ++ +D  N+  I + DV   
Sbjct: 648 KHLRVFGATVYVHIKN-KQGKFDDKSFKSIFVGYEPNG--FKLWDAVNEKFIVARDVVVD 707

Query: 181 E-DIFPFKSRNSGGLCSQTSGGSSSNSLPSIRIQTQDKEVDPE-PRRNKRAKTVKDFGED 240
           E ++   ++     +  + S  S + + P+       K +  E P  +K    ++   + 
Sbjct: 708 ETNMVNSRAVKFETVFLKDSKESENKNFPN----DSRKIIQTEFPNESKECDNIQFLKDS 767

Query: 241 FEMYNVEDPKDLTEALSSVDANLWQEAINDEM--NSLESNR 275
            E  N   P D  + + +   N  +E  N +   +S ESN+
Sbjct: 768 KESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESNK 801

BLAST of Cmc02g0052001 vs. ExPASy Swiss-Prot
Match: P0C2J7 (Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY4B-H PE=3 SV=1)

HSP 1 Score: 67.4 bits (163), Expect = 3.1e-10
Identity = 46/167 (27.54%), Postives = 78/167 (46.71%), Query Frame = 0

Query: 9   IENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTEL 68
           +E QF+++++ + SDRGTE+ +    E++ SKGI H  T+      NG+AER  RT+   
Sbjct: 681 VETQFDRKVREINSDRGTEFTNDQIEEYFISKGIHHILTSTQDHAANGRAERYIRTIVTD 740

Query: 69  VVAILLESGAAPSWWEEIIKSVNYVLNRIP-KSNSKTSLYEVLKHK-TPNLSYLRTWGCL 128
              +L +S     +WE  + S   + N +  KS  K  L  + +   T  L     +G  
Sbjct: 741 ATTLLRQSNLRVKFWEYAVTSATNIRNCLEHKSTGKLPLKAISRQPVTVRLMSFLPFGEK 800

Query: 129 AYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFY-DLENKVIIESN 173
               I +   +KL       + +    NS  Y+F+   +NK++   N
Sbjct: 801 GI--IWNHNHKKLKPSGLPSIILCKDPNSYGYKFFIPSKNKIVTSDN 845

BLAST of Cmc02g0052001 vs. ExPASy TrEMBL
Match: A0A5D3DCJ1 (Putative Polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1199G00010 PE=4 SV=1)

HSP 1 Score: 525.4 bits (1352), Expect = 1.5e-145
Identity = 260/285 (91.23%), Postives = 271/285 (95.09%), Query Frame = 0

Query: 1   MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAER 60
           MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFY+SKGIIHETT PYSPEMNGK ER
Sbjct: 403 MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYNSKGIIHETTTPYSPEMNGKEER 462

Query: 61  KNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVLKHKTPNLSYL 120
           KNRTLTEL VAILLES AAPSWW EIIK+VNYVLNRIPKSNSKTS YEVLKHK PNLSYL
Sbjct: 463 KNRTLTELAVAILLESEAAPSWWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKIPNLSYL 522

Query: 121 RTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDI 180
           RTWGCLAYVRIP+P+RRKLAS+AYECVFIGYAENSK YRFYDLENKVIIESNDVDFFED 
Sbjct: 523 RTWGCLAYVRIPNPERRKLASKAYECVFIGYAENSKAYRFYDLENKVIIESNDVDFFEDK 582

Query: 181 FPFKSRNSGGLCSQTSGGSSSNSLPSIRIQTQDKEVDPEPRRNKRAKTVKDFGEDFEMYN 240
           FPFKSRNSGGL SQTSGGSS +SLPSIRIQTQDKEVDPEPRR+KRA+TVKDF EDFEMYN
Sbjct: 583 FPFKSRNSGGLYSQTSGGSSFSSLPSIRIQTQDKEVDPEPRRSKRARTVKDFREDFEMYN 642

Query: 241 VEDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGC 286
           VEDPKDLT+ALSSVDANLWQEAIND ++SLESNRTWHLVDLPP C
Sbjct: 643 VEDPKDLTKALSSVDANLWQEAINDGIDSLESNRTWHLVDLPPRC 687

BLAST of Cmc02g0052001 vs. ExPASy TrEMBL
Match: A0A445KFK2 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Glycine soja OX=3848 GN=D0Y65_015970 PE=4 SV=1)

HSP 1 Score: 414.5 bits (1064), Expect = 3.8e-112
Identity = 210/293 (71.67%), Postives = 237/293 (80.89%), Query Frame = 0

Query: 1   MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAER 60
           MFK+FVTEIENQFNK+IK+LRSDRGT+YDS  FNEFY+  GIIHETTAPYSPEMNGKAER
Sbjct: 10  MFKLFVTEIENQFNKKIKKLRSDRGTKYDSSLFNEFYNLHGIIHETTAPYSPEMNGKAER 69

Query: 61  KNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVLKHKTPNLSYL 120
           KNRT TELVVA +L S A   WW EI+ +V YVLNRIPKS SKTS YE+LK + PNLSYL
Sbjct: 70  KNRTFTELVVATMLSSSATSFWWGEILLTVCYVLNRIPKSKSKTSPYEILKKRQPNLSYL 129

Query: 121 RTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDI 180
           RTWGCLAYVRIPDPKR KLASRAYECVFIGYA NSK YRFYDL  KVIIESND DF+E+ 
Sbjct: 130 RTWGCLAYVRIPDPKRVKLASRAYECVFIGYAINSKAYRFYDLNAKVIIESNDADFYENK 189

Query: 181 FPFKSRNSGGLCSQTSGGSSSNSLPSIRIQT-QDKEVDPEPRRNKRAKTVKDFGEDFEMY 240
           FPFK R+        SGG+SSN LP+I  +     + D EPRR KRA+  KD+G D+  Y
Sbjct: 190 FPFKLRD--------SGGTSSNYLPAISSENLAQPKPDIEPRRGKRARIAKDYGPDYMAY 249

Query: 241 NV-EDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK 292
            + EDP +L EALS +DA+LWQEAINDEM+SLES++TWHLVDLPPGCK IGCK
Sbjct: 250 TLEEDPSNLQEALSFLDADLWQEAINDEMDSLESDKTWHLVDLPPGCKPIGCK 294

BLAST of Cmc02g0052001 vs. ExPASy TrEMBL
Match: A0A7N2L531 (Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1)

HSP 1 Score: 329.3 bits (843), Expect = 1.6e-86
Identity = 171/291 (58.76%), Postives = 207/291 (71.13%), Query Frame = 0

Query: 2   FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERK 61
           F+ F+ E+ENQF ++IKR+RSDRG EY+S AFN F  S GIIHETTAPYSP  NG AERK
Sbjct: 567 FQDFLQEVENQFGRKIKRIRSDRGREYESSAFNSFAQSLGIIHETTAPYSPASNGVAERK 626

Query: 62  NRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVLKHKTPNLSYLR 121
           NRTL EL  A+L+ESGA   +W E I +  +VLNR+P   S T+ +E+ K   PNL YLR
Sbjct: 627 NRTLIELTNAMLIESGAPLHFWGEAILTACHVLNRVPHKKSHTTPFEMWKGHKPNLGYLR 686

Query: 122 TWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIF 181
            W CLAYVR+ DPK  KL  RA  C F+GYA NS  YRF+DLENK+I ES D  F E+ F
Sbjct: 687 AWDCLAYVRLTDPKMPKLGIRATTCAFLGYAINSAAYRFFDLENKIIFESGDAIFHEEKF 746

Query: 182 PFKSRNSGGLCSQTSGGSSSNSLPSIRIQTQDKEVDPEPRRNKRAKTVKDFGEDFEMYNV 241
           PFK +NSGG  +  S  SSS S      Q Q+   + EPRR+KRA+  KDFG D+ ++N+
Sbjct: 747 PFKLKNSGGEENILSQPSSSTS----HFQNQE-NFEMEPRRSKRARVEKDFGPDYYVFNI 806

Query: 242 ED-PKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK 292
           E+ PK+L EAL+S DA  W+EA+NDEM SL SNRTW LVDLPPGCK IGCK
Sbjct: 807 EENPKNLKEALTSPDAIFWKEAVNDEMESLISNRTWKLVDLPPGCKTIGCK 852

BLAST of Cmc02g0052001 vs. ExPASy TrEMBL
Match: A0A7N2R9F3 (Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1)

HSP 1 Score: 323.2 bits (827), Expect = 1.1e-84
Identity = 168/291 (57.73%), Postives = 206/291 (70.79%), Query Frame = 0

Query: 2   FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERK 61
           F+ F+ E+ENQF ++IKR+RSDRG EY+S AFN F  S GIIHETTAPYSP  NG  ERK
Sbjct: 567 FQDFLKEVENQFGRKIKRIRSDRGREYESSAFNSFVQSLGIIHETTAPYSPASNGVVERK 626

Query: 62  NRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVLKHKTPNLSYLR 121
           NRTL EL  A+L+ESGA   +W E I +  +VLNR+P   S T+ +E+ K   PNL YLR
Sbjct: 627 NRTLIELTNAMLIESGAPLHFWGEAILTACHVLNRVPHKKSHTTPFEMWKGHKPNLGYLR 686

Query: 122 TWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIF 181
            WGCLAYVR+ DPK  KL  RA  C F+GYA NS  YRF+DLENK+I ES D  F E+ F
Sbjct: 687 VWGCLAYVRLTDPKIPKLGIRATTCAFLGYAINSAAYRFFDLENKIIFESGDAIFHEEKF 746

Query: 182 PFKSRNSGGLCSQTSGGSSSNSLPSIRIQTQDKEVDPEPRRNKRAKTVKDFGEDFEMYNV 241
           PFK +NSGG  +     SSS S     +Q Q+   + E RR+KRA+  KDFG D+ ++N+
Sbjct: 747 PFKLKNSGGEENILLQPSSSTS----HLQNQE-NFEMELRRSKRARVEKDFGPDYYVFNI 806

Query: 242 ED-PKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK 292
           E+ P++L EAL+S DA  W+EA+NDEM SL SNRTW LVDLPPGCK IGCK
Sbjct: 807 EENPQNLKEALTSSDAIFWKEAVNDEMESLISNRTWKLVDLPPGCKTIGCK 852

BLAST of Cmc02g0052001 vs. ExPASy TrEMBL
Match: A0A7N2N1S1 (Integrase catalytic domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV=1)

HSP 1 Score: 315.8 bits (808), Expect = 1.8e-82
Identity = 165/277 (59.57%), Postives = 197/277 (71.12%), Query Frame = 0

Query: 16  RIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLE 75
           +IKR+RSDRG EY+S AFN F  S GIIHETTAPYSP  NG AERKNRTL EL  A+L+E
Sbjct: 430 KIKRIRSDRGHEYESSAFNSFAQSLGIIHETTAPYSPASNGVAERKNRTLIELTNAMLIE 489

Query: 76  SGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPK 135
           SGA   +W E I +  +VLNR+P   S T+ +E+ K   PNL YLR WGCLAYVR+ DPK
Sbjct: 490 SGAPLHFWGEAILTACHVLNRVPHKKSHTTPFEMWKGHKPNLGYLRVWGCLAYVRLTDPK 549

Query: 136 RRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQT 195
             KL  RA  C F+GYA NS  YRF+DLENK+I ES D  F E+ FPFK +NSGG  +  
Sbjct: 550 MPKLGIRATTCAFLGYAINSAAYRFFDLENKIIFESGDAIFHEEKFPFKLKNSGGEENIL 609

Query: 196 SGGSSSNSLPSIRIQTQDKEVDPEPRRNKRAKTVKDFGEDFEMYNVED-PKDLTEALSSV 255
           S  SSS S      Q Q+   + EPRR+KRA+  KDFG D+ ++N+E+ PK+L EAL+S 
Sbjct: 610 SQPSSSTS----HFQNQE-NFEMEPRRSKRARVEKDFGPDYYVFNIEENPKNLKEALTSP 669

Query: 256 DANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK 292
           DA  W+EA+NDEM SL SNRTW LVDLPPGCK IGCK
Sbjct: 670 DAIFWKEAVNDEMESLISNRTWKLVDLPPGCKTIGCK 701

BLAST of Cmc02g0052001 vs. TAIR 10
Match: ATMG00710.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 43.9 bits (102), Expect = 2.6e-04
Identity = 24/85 (28.24%), Postives = 43/85 (50.59%), Query Frame = 0

Query: 62  NRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSL-YEVLKHKTPNLSYL 121
           NRT+ E V ++L E G   ++  +   +  +++N+ P +     +  EV     P  SYL
Sbjct: 2   NRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSYL 61

Query: 122 RTWGCLAYVRIPDPKRRKLASRAYE 146
           R +GC+AY+   + K +  A +  E
Sbjct: 62  RRFGCVAYIHCDEGKLKPRAKKGEE 86

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0034938.13.1e-14591.23putative Polyprotein [Cucumis melo var. makuwa] >TYK21293.1 putative Polyprotein... [more]
RZC09450.17.8e-11271.67Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja][more]
AAU90333.11.2e-8055.63Putative gag and pol polyprotein, identical [Solanum demissum][more]
ABI34306.16.0e-8055.29Polyprotein, putative [Solanum demissum][more]
XP_023158131.21.4e-7350.34uncharacterized protein LOC103653943 isoform X1 [Zea mays] >XP_035823266.1 uncha... [more]
Match NameE-valueIdentityDescription
P109787.2e-3631.72Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Q9ZT945.4e-2331.07Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW23.8e-2132.43Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P041461.5e-2027.76Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P0C2J73.1e-1027.54Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
A0A5D3DCJ11.5e-14591.23Putative Polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119... [more]
A0A445KFK23.8e-11271.67Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Glycine soja OX=3... [more]
A0A7N2L5311.6e-8658.76Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1[more]
A0A7N2R9F31.1e-8457.73Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1[more]
A0A7N2N1S11.8e-8259.57Integrase catalytic domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV... [more]
Match NameE-valueIdentityDescription
ATMG00710.12.6e-0428.24Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 1..123
e-value: 2.8E-25
score: 90.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 189..225
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 189..210
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 3..197
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 1..123
score: 16.014511
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 3..122

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc02g0052001.1Cmc02g0052001.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0071897 DNA biosynthetic process
biological_process GO:0015074 DNA integration
molecular_function GO:0003887 DNA-directed DNA polymerase activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding