Homology
BLAST of Cmc02g0052001 vs. NCBI nr
Match:
KAA0034938.1 (putative Polyprotein [Cucumis melo var. makuwa] >TYK21293.1 putative Polyprotein [Cucumis melo var. makuwa])
HSP 1 Score: 525.4 bits (1352), Expect = 3.1e-145
Identity = 260/285 (91.23%), Postives = 271/285 (95.09%), Query Frame = 0
Query: 1 MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAER 60
MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFY+SKGIIHETT PYSPEMNGK ER
Sbjct: 403 MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYNSKGIIHETTTPYSPEMNGKEER 462
Query: 61 KNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVLKHKTPNLSYL 120
KNRTLTEL VAILLES AAPSWW EIIK+VNYVLNRIPKSNSKTS YEVLKHK PNLSYL
Sbjct: 463 KNRTLTELAVAILLESEAAPSWWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKIPNLSYL 522
Query: 121 RTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDI 180
RTWGCLAYVRIP+P+RRKLAS+AYECVFIGYAENSK YRFYDLENKVIIESNDVDFFED
Sbjct: 523 RTWGCLAYVRIPNPERRKLASKAYECVFIGYAENSKAYRFYDLENKVIIESNDVDFFEDK 582
Query: 181 FPFKSRNSGGLCSQTSGGSSSNSLPSIRIQTQDKEVDPEPRRNKRAKTVKDFGEDFEMYN 240
FPFKSRNSGGL SQTSGGSS +SLPSIRIQTQDKEVDPEPRR+KRA+TVKDF EDFEMYN
Sbjct: 583 FPFKSRNSGGLYSQTSGGSSFSSLPSIRIQTQDKEVDPEPRRSKRARTVKDFREDFEMYN 642
Query: 241 VEDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGC 286
VEDPKDLT+ALSSVDANLWQEAIND ++SLESNRTWHLVDLPP C
Sbjct: 643 VEDPKDLTKALSSVDANLWQEAINDGIDSLESNRTWHLVDLPPRC 687
BLAST of Cmc02g0052001 vs. NCBI nr
Match:
RZC09450.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja])
HSP 1 Score: 414.5 bits (1064), Expect = 7.8e-112
Identity = 210/293 (71.67%), Postives = 237/293 (80.89%), Query Frame = 0
Query: 1 MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAER 60
MFK+FVTEIENQFNK+IK+LRSDRGT+YDS FNEFY+ GIIHETTAPYSPEMNGKAER
Sbjct: 10 MFKLFVTEIENQFNKKIKKLRSDRGTKYDSSLFNEFYNLHGIIHETTAPYSPEMNGKAER 69
Query: 61 KNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVLKHKTPNLSYL 120
KNRT TELVVA +L S A WW EI+ +V YVLNRIPKS SKTS YE+LK + PNLSYL
Sbjct: 70 KNRTFTELVVATMLSSSATSFWWGEILLTVCYVLNRIPKSKSKTSPYEILKKRQPNLSYL 129
Query: 121 RTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDI 180
RTWGCLAYVRIPDPKR KLASRAYECVFIGYA NSK YRFYDL KVIIESND DF+E+
Sbjct: 130 RTWGCLAYVRIPDPKRVKLASRAYECVFIGYAINSKAYRFYDLNAKVIIESNDADFYENK 189
Query: 181 FPFKSRNSGGLCSQTSGGSSSNSLPSIRIQT-QDKEVDPEPRRNKRAKTVKDFGEDFEMY 240
FPFK R+ SGG+SSN LP+I + + D EPRR KRA+ KD+G D+ Y
Sbjct: 190 FPFKLRD--------SGGTSSNYLPAISSENLAQPKPDIEPRRGKRARIAKDYGPDYMAY 249
Query: 241 NV-EDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK 292
+ EDP +L EALS +DA+LWQEAINDEM+SLES++TWHLVDLPPGCK IGCK
Sbjct: 250 TLEEDPSNLQEALSFLDADLWQEAINDEMDSLESDKTWHLVDLPPGCKPIGCK 294
BLAST of Cmc02g0052001 vs. NCBI nr
Match:
AAU90333.1 (Putative gag and pol polyprotein, identical [Solanum demissum])
HSP 1 Score: 310.8 bits (795), Expect = 1.2e-80
Identity = 163/293 (55.63%), Postives = 205/293 (69.97%), Query Frame = 0
Query: 2 FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERK 61
FK ++ E+ENQF ++IKR+RSDRG EY+S FN F S GIIHETT PYSP NG AERK
Sbjct: 344 FKTYLHEVENQFGRKIKRIRSDRGREYESNEFNSFVRSLGIIHETTPPYSPSSNGAAERK 403
Query: 62 NRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVLKHKTPNLSYLR 121
NRTL EL A+L+ES A ++W E I + YVLNR+P SK + +E+ K P+L YLR
Sbjct: 404 NRTLVELTNAMLIESHAPLNFWGETILTACYVLNRVPHKKSKLTHFELWKGYKPSLGYLR 463
Query: 122 TWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIF 181
WGCLA+VR+ DPK KL + C F+GYA NS YRF++LE+ ++IES D F E+ F
Sbjct: 464 VWGCLAFVRLMDPKITKLGKKVTTCAFLGYASNSTAYRFFNLEDNIVIESGDAIFHENKF 523
Query: 182 PFKSRNSGGLCSQTSGGSSSNSLPSIRIQT-QDKEV-DPEPRRNKRAKTVKDFGEDFEMY 241
PF S+NSGG + + SLPS T ++KEV D E RR+KRA+ KDFG +F ++
Sbjct: 524 PFDSKNSGGQRIE----QNILSLPSSSTSTLKNKEVNDFELRRSKRARIEKDFGPNFYVF 583
Query: 242 NV-EDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK 292
NV +DP L EALSS D+ W+EA+NDEM SL SN+TW LVDLPPGCK IGCK
Sbjct: 584 NVGDDPLTLKEALSSHDSIFWKEAVNDEMESLISNKTWKLVDLPPGCKTIGCK 632
BLAST of Cmc02g0052001 vs. NCBI nr
Match:
ABI34306.1 (Polyprotein, putative [Solanum demissum])
HSP 1 Score: 308.5 bits (789), Expect = 6.0e-80
Identity = 162/293 (55.29%), Postives = 204/293 (69.62%), Query Frame = 0
Query: 2 FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERK 61
FK ++ E+ENQF ++IKR+RSDRG EY+S FN F S GIIHETT PYSP NG AERK
Sbjct: 543 FKTYLHEVENQFGRKIKRIRSDRGREYESNEFNSFVRSLGIIHETTPPYSPSSNGVAERK 602
Query: 62 NRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVLKHKTPNLSYLR 121
NRTL EL A+L+ES A ++W E I + YVLNR+P SK + +E+ K P+L YLR
Sbjct: 603 NRTLVELTNAMLIESHAPLNFWGEAILTACYVLNRVPHKKSKLTPFELWKGYKPSLGYLR 662
Query: 122 TWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIF 181
WGCLA+VR+ DPK KL + C F+GYA NS YRF++LE+ ++IES D F E+ F
Sbjct: 663 VWGCLAFVRLMDPKITKLGKKVTTCAFLGYASNSTAYRFFNLEDNIVIESGDAIFHENKF 722
Query: 182 PFKSRNSGGLCSQTSGGSSSNSLPSIRIQT-QDKEV-DPEPRRNKRAKTVKDFGEDFEMY 241
PF S+NSGG + + +LPS T ++KEV D E RR+KRA+ KDFG DF ++
Sbjct: 723 PFDSKNSGGQRIE----QNILTLPSSSTSTLKNKEVNDFELRRSKRARVEKDFGPDFYVF 782
Query: 242 NVEDPK-DLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK 292
NV D + L EALSS D+ W+EA+NDEM SL SN+TW LVDLPPGCK IGCK
Sbjct: 783 NVGDDRLTLKEALSSHDSIFWKEAVNDEMESLISNKTWKLVDLPPGCKTIGCK 831
BLAST of Cmc02g0052001 vs. NCBI nr
Match:
XP_023158131.2 (uncharacterized protein LOC103653943 isoform X1 [Zea mays] >XP_035823266.1 uncharacterized protein LOC103653943 isoform X1 [Zea mays])
HSP 1 Score: 287.3 bits (734), Expect = 1.4e-73
Identity = 150/298 (50.34%), Postives = 196/298 (65.77%), Query Frame = 0
Query: 2 FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERK 61
FK++ TE+ENQ +K+IKRLRSDRG EY S F+E+ GIIHETTAPYSP+ NG AERK
Sbjct: 623 FKIYKTEVENQLDKKIKRLRSDRGGEYLSNLFDEYCKECGIIHETTAPYSPQSNGVAERK 682
Query: 62 NRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVLKHKTPNLSYLR 121
NRT+ +L A+L SG WW E + +V YVLNR+P N + + YE K + P+LS+LR
Sbjct: 683 NRTVCDLANALLQSSGMPDIWWGEAVLTVCYVLNRVPPRNREATPYEGFKGRKPDLSHLR 742
Query: 122 TWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENK-------VIIESNDV 181
TWGCLA V +P PK+RKL + +CVF+GYA NS YRF + ++ VI+ES DV
Sbjct: 743 TWGCLAKVNVPLPKKRKLGPKTVDCVFLGYAHNSAAYRFLVVHSETSEVAINVIMESRDV 802
Query: 182 DFFEDIFPFKSRNSGGLCSQTSGGSSSNSLPSIRIQTQDKEVDPEPRRNKRAKTVKDFGE 241
FFE IFP + + G S + SLPS D+ D E RR+KR +T K G+
Sbjct: 803 TFFESIFPMRDKE----VVAPDGPSRTYSLPS---SVNDQTPDLELRRSKRQRTEKSLGD 862
Query: 242 DFEMYNV-EDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK 292
D+ +Y V E+P+ LTEA +S DA W+EA+ EM+S+ SN TW + DLP GCK +GCK
Sbjct: 863 DYIIYLVDEEPRSLTEAYTSPDAEYWREAVRSEMDSIISNGTWEITDLPAGCKPVGCK 913
BLAST of Cmc02g0052001 vs. ExPASy Swiss-Prot
Match:
P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)
HSP 1 Score: 152.5 bits (384), Expect = 7.2e-36
Identity = 105/331 (31.72%), Postives = 156/331 (47.13%), Query Frame = 0
Query: 1 MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAER 60
+F+ F +E + +++KRLRSD G EY S F E+ SS GI HE T P +P+ NG AER
Sbjct: 528 VFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAER 587
Query: 61 KNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYE-VLKHKTPNLSY 120
NRT+ E V ++L + S+W E +++ Y++NR P + E V +K + S+
Sbjct: 588 MNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSH 647
Query: 121 LRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFED 180
L+ +GC A+ +P +R KL ++ C+FIGY + YR +D K +I S DV F E
Sbjct: 648 LKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRES 707
Query: 181 ---------------------IFPFKSRNSGGLCSQTSGGSSSNSLPSIRIQ-------- 240
P S N S T S P I+
Sbjct: 708 EVRTAADMSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEG 767
Query: 241 -------TQDKEVDPEPRRNKRAKTVKDFGEDFEMYNVED---PKDLTEALSSVDANLWQ 292
TQ +E RR++R + E + D P+ L E LS + N
Sbjct: 768 VEEVEHPTQGEEQHQPLRRSERPRVESRRYPSTEYVLISDDREPESLKEVLSHPEKNQLM 827
BLAST of Cmc02g0052001 vs. ExPASy Swiss-Prot
Match:
Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)
HSP 1 Score: 109.8 bits (273), Expect = 5.4e-23
Identity = 64/206 (31.07%), Postives = 103/206 (50.00%), Query Frame = 0
Query: 2 FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERK 61
F +F + +EN+F RI L SD G E+ V ++ S GI H T+ P++PE NG +ERK
Sbjct: 550 FIIFKSLVENRFQTRIGTLYSDNGGEF--VVLRDYLSQHGISHFTSPPHTPEHNGLSERK 609
Query: 62 NRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSK-TSLYEVLKHKTPNLSYL 121
+R + E+ + +L + ++W Y++NR+P + S ++ L + PN L
Sbjct: 610 HRHIVEMGLTLLSHASVPKTYWPYAFSVAVYLINRLPTPLLQLQSPFQKLFGQPPNYEKL 669
Query: 122 RTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDI 181
+ +GC Y + R KL ++ +C F+GY+ Y + + S V F E
Sbjct: 670 KVFGCACYPWLRPYNRHKLEDKSKQCAFMGYSLTQSAYLCLHIPTGRLYTSRHVQFDERC 729
Query: 182 FPFKSRNSGGLCSQTSGGSSSNSLPS 207
FPF + N G SQ S+ + PS
Sbjct: 730 FPFSTTNFGVSTSQEQRSDSAPNWPS 753
BLAST of Cmc02g0052001 vs. ExPASy Swiss-Prot
Match:
Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)
HSP 1 Score: 103.6 bits (257), Expect = 3.8e-21
Identity = 60/185 (32.43%), Postives = 95/185 (51.35%), Query Frame = 0
Query: 2 FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERK 61
F F +EN+F RI SD G E+ VA E++S GI H T+ P++PE NG +ERK
Sbjct: 571 FITFKNLLENRFQTRIGTFYSDNGGEF--VALWEYFSQHGISHLTSPPHTPEHNGLSERK 630
Query: 62 NRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSK-TSLYEVLKHKTPNLSYL 121
+R + E + +L + ++W Y++NR+P + S ++ L +PN L
Sbjct: 631 HRHIVETGLTLLSHASIPKTYWPYAFAVAVYLINRLPTPLLQLESPFQKLFGTSPNYDKL 690
Query: 122 RTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDI 181
R +GC Y + + KL ++ +CVF+GY+ Y L+ + S V F E+
Sbjct: 691 RVFGCACYPWLRPYNQHKLDDKSRQCVFLGYSLTQSAYLCLHLQTSRLYISRHVRFDENC 750
Query: 182 FPFKS 186
FPF +
Sbjct: 751 FPFSN 753
BLAST of Cmc02g0052001 vs. ExPASy Swiss-Prot
Match:
P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)
HSP 1 Score: 101.7 bits (252), Expect = 1.5e-20
Identity = 78/281 (27.76%), Postives = 138/281 (49.11%), Query Frame = 0
Query: 1 MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAER 60
MF+ FV + E FN ++ L D G EY S +F KGI + T P++P++NG +ER
Sbjct: 528 MFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSER 587
Query: 61 KNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKS---NSKTSLYEVLKHKTPNL 120
RT+TE ++ + S+W E + + Y++NRIP +S + YE+ +K P L
Sbjct: 588 MIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYL 647
Query: 121 SYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFF 180
+LR +G YV I + K+ K ++++ +F+GY N ++ +D N+ I + DV
Sbjct: 648 KHLRVFGATVYVHIKN-KQGKFDDKSFKSIFVGYEPNG--FKLWDAVNEKFIVARDVVVD 707
Query: 181 E-DIFPFKSRNSGGLCSQTSGGSSSNSLPSIRIQTQDKEVDPE-PRRNKRAKTVKDFGED 240
E ++ ++ + + S S + + P+ K + E P +K ++ +
Sbjct: 708 ETNMVNSRAVKFETVFLKDSKESENKNFPN----DSRKIIQTEFPNESKECDNIQFLKDS 767
Query: 241 FEMYNVEDPKDLTEALSSVDANLWQEAINDEM--NSLESNR 275
E N P D + + + N +E N + +S ESN+
Sbjct: 768 KESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESNK 801
BLAST of Cmc02g0052001 vs. ExPASy Swiss-Prot
Match:
P0C2J7 (Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY4B-H PE=3 SV=1)
HSP 1 Score: 67.4 bits (163), Expect = 3.1e-10
Identity = 46/167 (27.54%), Postives = 78/167 (46.71%), Query Frame = 0
Query: 9 IENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTEL 68
+E QF+++++ + SDRGTE+ + E++ SKGI H T+ NG+AER RT+
Sbjct: 681 VETQFDRKVREINSDRGTEFTNDQIEEYFISKGIHHILTSTQDHAANGRAERYIRTIVTD 740
Query: 69 VVAILLESGAAPSWWEEIIKSVNYVLNRIP-KSNSKTSLYEVLKHK-TPNLSYLRTWGCL 128
+L +S +WE + S + N + KS K L + + T L +G
Sbjct: 741 ATTLLRQSNLRVKFWEYAVTSATNIRNCLEHKSTGKLPLKAISRQPVTVRLMSFLPFGEK 800
Query: 129 AYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFY-DLENKVIIESN 173
I + +KL + + NS Y+F+ +NK++ N
Sbjct: 801 GI--IWNHNHKKLKPSGLPSIILCKDPNSYGYKFFIPSKNKIVTSDN 845
BLAST of Cmc02g0052001 vs. ExPASy TrEMBL
Match:
A0A5D3DCJ1 (Putative Polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1199G00010 PE=4 SV=1)
HSP 1 Score: 525.4 bits (1352), Expect = 1.5e-145
Identity = 260/285 (91.23%), Postives = 271/285 (95.09%), Query Frame = 0
Query: 1 MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAER 60
MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFY+SKGIIHETT PYSPEMNGK ER
Sbjct: 403 MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYNSKGIIHETTTPYSPEMNGKEER 462
Query: 61 KNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVLKHKTPNLSYL 120
KNRTLTEL VAILLES AAPSWW EIIK+VNYVLNRIPKSNSKTS YEVLKHK PNLSYL
Sbjct: 463 KNRTLTELAVAILLESEAAPSWWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKIPNLSYL 522
Query: 121 RTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDI 180
RTWGCLAYVRIP+P+RRKLAS+AYECVFIGYAENSK YRFYDLENKVIIESNDVDFFED
Sbjct: 523 RTWGCLAYVRIPNPERRKLASKAYECVFIGYAENSKAYRFYDLENKVIIESNDVDFFEDK 582
Query: 181 FPFKSRNSGGLCSQTSGGSSSNSLPSIRIQTQDKEVDPEPRRNKRAKTVKDFGEDFEMYN 240
FPFKSRNSGGL SQTSGGSS +SLPSIRIQTQDKEVDPEPRR+KRA+TVKDF EDFEMYN
Sbjct: 583 FPFKSRNSGGLYSQTSGGSSFSSLPSIRIQTQDKEVDPEPRRSKRARTVKDFREDFEMYN 642
Query: 241 VEDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGC 286
VEDPKDLT+ALSSVDANLWQEAIND ++SLESNRTWHLVDLPP C
Sbjct: 643 VEDPKDLTKALSSVDANLWQEAINDGIDSLESNRTWHLVDLPPRC 687
BLAST of Cmc02g0052001 vs. ExPASy TrEMBL
Match:
A0A445KFK2 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Glycine soja OX=3848 GN=D0Y65_015970 PE=4 SV=1)
HSP 1 Score: 414.5 bits (1064), Expect = 3.8e-112
Identity = 210/293 (71.67%), Postives = 237/293 (80.89%), Query Frame = 0
Query: 1 MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAER 60
MFK+FVTEIENQFNK+IK+LRSDRGT+YDS FNEFY+ GIIHETTAPYSPEMNGKAER
Sbjct: 10 MFKLFVTEIENQFNKKIKKLRSDRGTKYDSSLFNEFYNLHGIIHETTAPYSPEMNGKAER 69
Query: 61 KNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVLKHKTPNLSYL 120
KNRT TELVVA +L S A WW EI+ +V YVLNRIPKS SKTS YE+LK + PNLSYL
Sbjct: 70 KNRTFTELVVATMLSSSATSFWWGEILLTVCYVLNRIPKSKSKTSPYEILKKRQPNLSYL 129
Query: 121 RTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDI 180
RTWGCLAYVRIPDPKR KLASRAYECVFIGYA NSK YRFYDL KVIIESND DF+E+
Sbjct: 130 RTWGCLAYVRIPDPKRVKLASRAYECVFIGYAINSKAYRFYDLNAKVIIESNDADFYENK 189
Query: 181 FPFKSRNSGGLCSQTSGGSSSNSLPSIRIQT-QDKEVDPEPRRNKRAKTVKDFGEDFEMY 240
FPFK R+ SGG+SSN LP+I + + D EPRR KRA+ KD+G D+ Y
Sbjct: 190 FPFKLRD--------SGGTSSNYLPAISSENLAQPKPDIEPRRGKRARIAKDYGPDYMAY 249
Query: 241 NV-EDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK 292
+ EDP +L EALS +DA+LWQEAINDEM+SLES++TWHLVDLPPGCK IGCK
Sbjct: 250 TLEEDPSNLQEALSFLDADLWQEAINDEMDSLESDKTWHLVDLPPGCKPIGCK 294
BLAST of Cmc02g0052001 vs. ExPASy TrEMBL
Match:
A0A7N2L531 (Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1)
HSP 1 Score: 329.3 bits (843), Expect = 1.6e-86
Identity = 171/291 (58.76%), Postives = 207/291 (71.13%), Query Frame = 0
Query: 2 FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERK 61
F+ F+ E+ENQF ++IKR+RSDRG EY+S AFN F S GIIHETTAPYSP NG AERK
Sbjct: 567 FQDFLQEVENQFGRKIKRIRSDRGREYESSAFNSFAQSLGIIHETTAPYSPASNGVAERK 626
Query: 62 NRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVLKHKTPNLSYLR 121
NRTL EL A+L+ESGA +W E I + +VLNR+P S T+ +E+ K PNL YLR
Sbjct: 627 NRTLIELTNAMLIESGAPLHFWGEAILTACHVLNRVPHKKSHTTPFEMWKGHKPNLGYLR 686
Query: 122 TWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIF 181
W CLAYVR+ DPK KL RA C F+GYA NS YRF+DLENK+I ES D F E+ F
Sbjct: 687 AWDCLAYVRLTDPKMPKLGIRATTCAFLGYAINSAAYRFFDLENKIIFESGDAIFHEEKF 746
Query: 182 PFKSRNSGGLCSQTSGGSSSNSLPSIRIQTQDKEVDPEPRRNKRAKTVKDFGEDFEMYNV 241
PFK +NSGG + S SSS S Q Q+ + EPRR+KRA+ KDFG D+ ++N+
Sbjct: 747 PFKLKNSGGEENILSQPSSSTS----HFQNQE-NFEMEPRRSKRARVEKDFGPDYYVFNI 806
Query: 242 ED-PKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK 292
E+ PK+L EAL+S DA W+EA+NDEM SL SNRTW LVDLPPGCK IGCK
Sbjct: 807 EENPKNLKEALTSPDAIFWKEAVNDEMESLISNRTWKLVDLPPGCKTIGCK 852
BLAST of Cmc02g0052001 vs. ExPASy TrEMBL
Match:
A0A7N2R9F3 (Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1)
HSP 1 Score: 323.2 bits (827), Expect = 1.1e-84
Identity = 168/291 (57.73%), Postives = 206/291 (70.79%), Query Frame = 0
Query: 2 FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERK 61
F+ F+ E+ENQF ++IKR+RSDRG EY+S AFN F S GIIHETTAPYSP NG ERK
Sbjct: 567 FQDFLKEVENQFGRKIKRIRSDRGREYESSAFNSFVQSLGIIHETTAPYSPASNGVVERK 626
Query: 62 NRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVLKHKTPNLSYLR 121
NRTL EL A+L+ESGA +W E I + +VLNR+P S T+ +E+ K PNL YLR
Sbjct: 627 NRTLIELTNAMLIESGAPLHFWGEAILTACHVLNRVPHKKSHTTPFEMWKGHKPNLGYLR 686
Query: 122 TWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIF 181
WGCLAYVR+ DPK KL RA C F+GYA NS YRF+DLENK+I ES D F E+ F
Sbjct: 687 VWGCLAYVRLTDPKIPKLGIRATTCAFLGYAINSAAYRFFDLENKIIFESGDAIFHEEKF 746
Query: 182 PFKSRNSGGLCSQTSGGSSSNSLPSIRIQTQDKEVDPEPRRNKRAKTVKDFGEDFEMYNV 241
PFK +NSGG + SSS S +Q Q+ + E RR+KRA+ KDFG D+ ++N+
Sbjct: 747 PFKLKNSGGEENILLQPSSSTS----HLQNQE-NFEMELRRSKRARVEKDFGPDYYVFNI 806
Query: 242 ED-PKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK 292
E+ P++L EAL+S DA W+EA+NDEM SL SNRTW LVDLPPGCK IGCK
Sbjct: 807 EENPQNLKEALTSSDAIFWKEAVNDEMESLISNRTWKLVDLPPGCKTIGCK 852
BLAST of Cmc02g0052001 vs. ExPASy TrEMBL
Match:
A0A7N2N1S1 (Integrase catalytic domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV=1)
HSP 1 Score: 315.8 bits (808), Expect = 1.8e-82
Identity = 165/277 (59.57%), Postives = 197/277 (71.12%), Query Frame = 0
Query: 16 RIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLE 75
+IKR+RSDRG EY+S AFN F S GIIHETTAPYSP NG AERKNRTL EL A+L+E
Sbjct: 430 KIKRIRSDRGHEYESSAFNSFAQSLGIIHETTAPYSPASNGVAERKNRTLIELTNAMLIE 489
Query: 76 SGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPK 135
SGA +W E I + +VLNR+P S T+ +E+ K PNL YLR WGCLAYVR+ DPK
Sbjct: 490 SGAPLHFWGEAILTACHVLNRVPHKKSHTTPFEMWKGHKPNLGYLRVWGCLAYVRLTDPK 549
Query: 136 RRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQT 195
KL RA C F+GYA NS YRF+DLENK+I ES D F E+ FPFK +NSGG +
Sbjct: 550 MPKLGIRATTCAFLGYAINSAAYRFFDLENKIIFESGDAIFHEEKFPFKLKNSGGEENIL 609
Query: 196 SGGSSSNSLPSIRIQTQDKEVDPEPRRNKRAKTVKDFGEDFEMYNVED-PKDLTEALSSV 255
S SSS S Q Q+ + EPRR+KRA+ KDFG D+ ++N+E+ PK+L EAL+S
Sbjct: 610 SQPSSSTS----HFQNQE-NFEMEPRRSKRARVEKDFGPDYYVFNIEENPKNLKEALTSP 669
Query: 256 DANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK 292
DA W+EA+NDEM SL SNRTW LVDLPPGCK IGCK
Sbjct: 670 DAIFWKEAVNDEMESLISNRTWKLVDLPPGCKTIGCK 701
BLAST of Cmc02g0052001 vs. TAIR 10
Match:
ATMG00710.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )
HSP 1 Score: 43.9 bits (102), Expect = 2.6e-04
Identity = 24/85 (28.24%), Postives = 43/85 (50.59%), Query Frame = 0
Query: 62 NRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSL-YEVLKHKTPNLSYL 121
NRT+ E V ++L E G ++ + + +++N+ P + + EV P SYL
Sbjct: 2 NRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSYL 61
Query: 122 RTWGCLAYVRIPDPKRRKLASRAYE 146
R +GC+AY+ + K + A + E
Sbjct: 62 RRFGCVAYIHCDEGKLKPRAKKGEE 86
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
KAA0034938.1 | 3.1e-145 | 91.23 | putative Polyprotein [Cucumis melo var. makuwa] >TYK21293.1 putative Polyprotein... | [more] |
RZC09450.1 | 7.8e-112 | 71.67 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja] | [more] |
AAU90333.1 | 1.2e-80 | 55.63 | Putative gag and pol polyprotein, identical [Solanum demissum] | [more] |
ABI34306.1 | 6.0e-80 | 55.29 | Polyprotein, putative [Solanum demissum] | [more] |
XP_023158131.2 | 1.4e-73 | 50.34 | uncharacterized protein LOC103653943 isoform X1 [Zea mays] >XP_035823266.1 uncha... | [more] |
Match Name | E-value | Identity | Description | |
P10978 | 7.2e-36 | 31.72 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... | [more] |
Q9ZT94 | 5.4e-23 | 31.07 | Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... | [more] |
Q94HW2 | 3.8e-21 | 32.43 | Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... | [more] |
P04146 | 1.5e-20 | 27.76 | Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3 | [more] |
P0C2J7 | 3.1e-10 | 27.54 | Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... | [more] |
Match Name | E-value | Identity | Description | |
A0A5D3DCJ1 | 1.5e-145 | 91.23 | Putative Polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119... | [more] |
A0A445KFK2 | 3.8e-112 | 71.67 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Glycine soja OX=3... | [more] |
A0A7N2L531 | 1.6e-86 | 58.76 | Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1 | [more] |
A0A7N2R9F3 | 1.1e-84 | 57.73 | Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1 | [more] |
A0A7N2N1S1 | 1.8e-82 | 59.57 | Integrase catalytic domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV... | [more] |
Match Name | E-value | Identity | Description | |
ATMG00710.1 | 2.6e-04 | 28.24 | Polynucleotidyl transferase, ribonuclease H-like superfamily protein | [more] |