Homology
BLAST of Moc02g02020 vs. NCBI nr
Match:
XP_022153017.1 (pentatricopeptide repeat-containing protein At4g14170-like [Momordica charantia])
HSP 1 Score: 1705.3 bits (4415), Expect = 0.0e+00
Identity = 832/832 (100.00%), Postives = 832/832 (100.00%), Query Frame = 0
Query: 1 MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLPSTEPVSSVHVDLL 60
MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLPSTEPVSSVHVDLL
Sbjct: 1 MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLPSTEPVSSVHVDLL 60
Query: 61 TLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCR 120
TLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCR
Sbjct: 61 TLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCR 120
Query: 121 TAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEV 180
TAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEV
Sbjct: 121 TAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEV 180
Query: 181 HGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDY 240
HGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDY
Sbjct: 181 HGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDY 240
Query: 241 REARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNA 300
REARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNA
Sbjct: 241 REARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNA 300
Query: 301 LVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPN 360
LVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPN
Sbjct: 301 LVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPN 360
Query: 361 SVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFH 420
SVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFH
Sbjct: 361 SVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFH 420
Query: 421 NMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGP 480
NMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGP
Sbjct: 421 NMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGP 480
Query: 481 GKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSET 540
GKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSET
Sbjct: 481 GKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSET 540
Query: 541 SDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVS 600
SDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVS
Sbjct: 541 SDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVS 600
Query: 601 NSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQ 660
NSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQ
Sbjct: 601 NSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQ 660
Query: 661 YDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAEL 720
YDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAEL
Sbjct: 661 YDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAEL 720
Query: 721 IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW 780
IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW
Sbjct: 721 IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW 780
Query: 781 DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAESF 833
DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAESF
Sbjct: 781 DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAESF 832
BLAST of Moc02g02020 vs. NCBI nr
Match:
XP_038901996.1 (pentatricopeptide repeat-containing protein At4g14170-like [Benincasa hispida])
HSP 1 Score: 1461.0 bits (3781), Expect = 0.0e+00
Identity = 712/832 (85.58%), Postives = 761/832 (91.47%), Query Frame = 0
Query: 1 MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLP-STEPVSSVHVDL 60
ML C RS+ SPF F Q TRFQYR F P+ SA SP PLP STEP SS+H++L
Sbjct: 1 MLQFCIRSS---IVSPFPFISQITRFQYRHFHQPIFVSALQSPFPLPRSTEPNSSIHINL 60
Query: 61 LTLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNC 120
LTLCSNAQSL QTKQLHALCLLNGLLP SVSLC+SLIL+YA F+ PESF LFHQTVQNC
Sbjct: 61 LTLCSNAQSLPQTKQLHALCLLNGLLPRSVSLCSSLILNYAKFQHPESFCSLFHQTVQNC 120
Query: 121 RTAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGME 180
RTAFLWNTLIRAHSIAGNG DG +TYNRMVR GVQLDDHTFPF+LKLCSDS DI KGME
Sbjct: 121 RTAFLWNTLIRAHSIAGNGTRDGFETYNRMVRVGVQLDDHTFPFLLKLCSDSFDIWKGME 180
Query: 181 VHGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGD 240
VHGVV KLGFD+DVYVGNTLLMLYGNC ++DARRVFDEMPERDVVSWNTI+GL SVNGD
Sbjct: 181 VHGVVFKLGFDTDVYVGNTLLMLYGNCRFLNDARRVFDEMPERDVVSWNTIIGLFSVNGD 240
Query: 241 YREARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCN 300
YREARNYYFWM LRSGI+PN+VSV+ LLPISA LEDEEMTRRIHCYT+K GLDS VT CN
Sbjct: 241 YREARNYYFWMNLRSGIKPNLVSVITLLPISAGLEDEEMTRRIHCYTVKVGLDSHVTICN 300
Query: 301 ALVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKP 360
ALVDAYGKCGNVKA WQVFDE+ ERNEVSWNA+INGLACKG DAL+VF+MMIDA KP
Sbjct: 301 ALVDAYGKCGNVKALWQVFDEIFERNEVSWNAMINGLACKGRCWDALNVFKMMIDAGAKP 360
Query: 361 NSVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIF 420
NS+T+SSILPVLVELE FKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSG STEAS IF
Sbjct: 361 NSITVSSILPVLVELECFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGRSTEASSIF 420
Query: 421 HNMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLG 480
HNM RRN+VSWNAMIANYALNGLALEAIR +ILMQETGE PNAVTFTNVLPACARLG LG
Sbjct: 421 HNMGRRNIVSWNAMIANYALNGLALEAIRFIILMQETGERPNAVTFTNVLPACARLGLLG 480
Query: 481 PGKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSE 540
PGKEIHA+ +RLG T+D+FVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILI+GYSE
Sbjct: 481 PGKEIHAMAVRLGPTSDLFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILIIGYSE 540
Query: 541 TSDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFV 600
++DCLESLNLFSEMRLLG+KPDVVSFVGVISACANLAAVKQGKEIHGVALRN YSHLFV
Sbjct: 541 SNDCLESLNLFSEMRLLGKKPDVVSFVGVISACANLAAVKQGKEIHGVALRNLLYSHLFV 600
Query: 601 SNSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTV 660
SNSLLDFYTKC RIDLACKVFNQILFKDVASWNTMILGYGMIGELETAI+MFE MR DTV
Sbjct: 601 SNSLLDFYTKCGRIDLACKVFNQILFKDVASWNTMILGYGMIGELETAISMFEAMRNDTV 660
Query: 661 QYDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAE 720
QYDLVSYIAVLSACSHGGLVE+GW+YFSEMLAQ+LEPT+MHY C+VDLLGRAGFVEEAAE
Sbjct: 661 QYDLVSYIAVLSACSHGGLVERGWQYFSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAE 720
Query: 721 LIRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGR 780
LI+ LPIAPDANIWGALLGACRIY NV+LGC AAEHLFE+KPQHCGYYILLSN+YAE GR
Sbjct: 721 LIQQLPIAPDANIWGALLGACRIYGNVKLGCRAAEHLFELKPQHCGYYILLSNIYAETGR 780
Query: 781 WDEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAES 832
WDEVNR+R+LMKSRGAKKNPGCSWVQI+DQ+H F+AEERAEGFESGGWLAES
Sbjct: 781 WDEVNRIRELMKSRGAKKNPGCSWVQIYDQLHAFVAEERAEGFESGGWLAES 829
BLAST of Moc02g02020 vs. NCBI nr
Match:
KAG6570435.1 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1425.6 bits (3689), Expect = 0.0e+00
Identity = 689/830 (83.01%), Postives = 750/830 (90.36%), Query Frame = 0
Query: 1 MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLPSTEPVSSVHVDLL 60
ML C RS HFA Q RF +R+++ STE +SSVH++LL
Sbjct: 1 MLQFCIRSIRFHFA-------QIARFHFRNYVR--------------STEQISSVHINLL 60
Query: 61 TLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCR 120
TLC NAQSLRQTK++HA+CLLNGLLPHSVSLCASLIL+YA F+ PESF LFHQTVQNCR
Sbjct: 61 TLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCR 120
Query: 121 TAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEV 180
TAFLWNTLIRAHSIAGNG +DGL+TYNRMVR GVQLDDHTFPFVLK+CSDSLDICKGMEV
Sbjct: 121 TAFLWNTLIRAHSIAGNGTLDGLETYNRMVRLGVQLDDHTFPFVLKICSDSLDICKGMEV 180
Query: 181 HGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDY 240
HGVV KLGFDS VYVGNTLLMLYGNCG ++DA++VFDEM ERDVVSWNT++GLLSVNGDY
Sbjct: 181 HGVVFKLGFDSHVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDY 240
Query: 241 REARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNA 300
REARNYYFWMTLRSGIQPN+VSV+ LLPISA LEDEEMTRRIHCY +K GLDS VT+CNA
Sbjct: 241 REARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNA 300
Query: 301 LVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPN 360
LVDAYGKCG+VK SWQVFDE++E+NEVSWN+IINGLA KGHF DALDVFRMMIDA KPN
Sbjct: 301 LVDAYGKCGSVKTSWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPN 360
Query: 361 SVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFH 420
SVTISSILPV VELE FKAGKEIHGFSMRMGTETD+FIANSLIDMYAKSGHSTEAS IFH
Sbjct: 361 SVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFH 420
Query: 421 NMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGP 480
NMD RN+VSWNAMIANY LNG++LEAIR VIL+QE+GE PNAVTFTNVLPACARLG LGP
Sbjct: 421 NMDGRNIVSWNAMIANYVLNGVSLEAIRFVILLQESGERPNAVTFTNVLPACARLGHLGP 480
Query: 481 GKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSET 540
GKEIHA+G+RLGLT+D+FV+NALTDMYAKCGC SARNVFNTSHKDEVSYNILI GYSET
Sbjct: 481 GKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSET 540
Query: 541 SDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVS 600
+DCLESLNLFSEMRLLG+KPDVVSF+GVISACANLAAVKQGKEIHGVALRNH SHLFVS
Sbjct: 541 NDCLESLNLFSEMRLLGKKPDVVSFMGVISACANLAAVKQGKEIHGVALRNHLNSHLFVS 600
Query: 601 NSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQ 660
NSLLDFYTKC RIDLACK+FNQILFKDVASWNTMILGYGMIGELETAI MFE MR D VQ
Sbjct: 601 NSLLDFYTKCGRIDLACKIFNQILFKDVASWNTMILGYGMIGELETAINMFEAMRDDKVQ 660
Query: 661 YDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAEL 720
YDLVSYIAVLSACSHGGLVE+GW+YFSEMLAQ+LEPT+MHY C+VDLLGRAGFVEEAAEL
Sbjct: 661 YDLVSYIAVLSACSHGGLVERGWQYFSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAEL 720
Query: 721 IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW 780
IR LPIAPD+NIWGALLGACRIY NVELGC AAEHLFE+KPQHCGYYILL+N++AE GRW
Sbjct: 721 IRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEHLFELKPQHCGYYILLANIHAETGRW 780
Query: 781 DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAE 831
DEVNR+R+LMKSRGAKK+PGCSWVQIHDQ H F+ ++RAEGFESGG LAE
Sbjct: 781 DEVNRIRELMKSRGAKKSPGCSWVQIHDQPHAFVVDDRAEGFESGGLLAE 809
BLAST of Moc02g02020 vs. NCBI nr
Match:
XP_022985648.1 (pentatricopeptide repeat-containing protein At4g14170-like isoform X1 [Cucurbita maxima])
HSP 1 Score: 1423.7 bits (3684), Expect = 0.0e+00
Identity = 687/830 (82.77%), Postives = 752/830 (90.60%), Query Frame = 0
Query: 1 MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLPSTEPVSSVHVDLL 60
ML C RS HFA Q RFQ+R+++ STEP SSVH++LL
Sbjct: 1 MLQFCIRSIRFHFA-------QIARFQFRNYVR--------------STEPNSSVHINLL 60
Query: 61 TLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCR 120
TLC N+QSLRQTK++HA+CLLNGLLPHSVSLCASLIL+YA F+ PESF LFHQTVQNCR
Sbjct: 61 TLCFNSQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCR 120
Query: 121 TAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEV 180
T FLWNTLIRAHSIAGNG +DGL+TYNRMVR GVQLDDHTFPFVLK+CSDSLDICKGMEV
Sbjct: 121 TTFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEV 180
Query: 181 HGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDY 240
HGVV KLGFDSDVYVGNTLLMLYGNCG ++ A++VFDEM ERDVVSWNT++GLLSVNGDY
Sbjct: 181 HGVVFKLGFDSDVYVGNTLLMLYGNCGFLNGAKKVFDEMSERDVVSWNTVIGLLSVNGDY 240
Query: 241 REARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNA 300
REARNYYFWMTLRSGIQPN+VSV+ LLPISA LEDEEMTRRIHCY +K GLDS VT+CNA
Sbjct: 241 REARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNA 300
Query: 301 LVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPN 360
LVDAYGKCG+VKASWQVFDE++E+NEVSWN+IINGLA KGHF DAL+VFRMMIDA KPN
Sbjct: 301 LVDAYGKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFWDALEVFRMMIDAGTKPN 360
Query: 361 SVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFH 420
SVTISSILPV VELE FKAGKEIHGFSMRMGTETD+FIANSLIDMYAKSGH TEAS IFH
Sbjct: 361 SVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHLTEASSIFH 420
Query: 421 NMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGP 480
NMD RN+VSWNAMIANYALNG+ALEAIR VIL+QE+GE PNAVTFTNVLPACARLG LGP
Sbjct: 421 NMDGRNIVSWNAMIANYALNGVALEAIRFVILLQESGERPNAVTFTNVLPACARLGHLGP 480
Query: 481 GKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSET 540
GKEIHA+G+RLGLT+D+FV+NALTDMYAKCGC SARNVFNTSHKDEVSYNILI GYSET
Sbjct: 481 GKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSET 540
Query: 541 SDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVS 600
+DCLESLNLFSEMRLLG+KPDVVSF+GV+SACANLAAVKQGKEIHGVALRNH SHLFVS
Sbjct: 541 NDCLESLNLFSEMRLLGKKPDVVSFMGVLSACANLAAVKQGKEIHGVALRNHLNSHLFVS 600
Query: 601 NSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQ 660
NSLLDFYTKC RIDLACK+FNQILFKDVASWNTMILGYGMIGELETAI MFE MR D VQ
Sbjct: 601 NSLLDFYTKCGRIDLACKIFNQILFKDVASWNTMILGYGMIGELETAINMFEAMRDDKVQ 660
Query: 661 YDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAEL 720
YD+VSYIAVLSACSHGGLVE+G +YFSEMLAQ+LEPT+MHY C+VDLLGRAGFVEEAAEL
Sbjct: 661 YDVVSYIAVLSACSHGGLVERGCQYFSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAEL 720
Query: 721 IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW 780
IR LPIAPD+NIWGALLGACRIY N++LGC AAEHLFE+KPQHCGYYILLSNMYAE GRW
Sbjct: 721 IRRLPIAPDSNIWGALLGACRIYGNIDLGCKAAEHLFELKPQHCGYYILLSNMYAETGRW 780
Query: 781 DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAE 831
D+VNR+R+LMKSRGAKK+PGCSWVQIHDQ+H F+ ++RAEGFESGG+LAE
Sbjct: 781 DDVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRAEGFESGGFLAE 809
BLAST of Moc02g02020 vs. NCBI nr
Match:
XP_022944177.1 (pentatricopeptide repeat-containing protein At4g14170-like isoform X1 [Cucurbita moschata])
HSP 1 Score: 1421.4 bits (3678), Expect = 0.0e+00
Identity = 690/830 (83.13%), Postives = 748/830 (90.12%), Query Frame = 0
Query: 1 MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLPSTEPVSSVHVDLL 60
ML C RS HFA Q RFQ+R+F+ TEP SSVH++LL
Sbjct: 1 MLQFCIRSFRFHFA-------QIARFQFRNFVR--------------RTEPNSSVHINLL 60
Query: 61 TLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCR 120
TLC NAQSLRQTK++HA+CLLNGLLPHSVSLCASLIL+YA F+ PESF LFHQTVQNCR
Sbjct: 61 TLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCR 120
Query: 121 TAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEV 180
TAFLWNTLIRAHSIAGNG +DGL+TYNRMVR GVQLDDHTFPFVLK+CSDSLDICKGMEV
Sbjct: 121 TAFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEV 180
Query: 181 HGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDY 240
HGVV KLGFDSDVYVGNTLLMLYGNCG ++DA++VFDEM ERDVVSWNT++GLLSVNGDY
Sbjct: 181 HGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDY 240
Query: 241 REARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNA 300
REARNYYFWMTLRSGIQPN+VSV+ LLPISA LEDEEMTRRIHCY +K GLDS VT+CNA
Sbjct: 241 REARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNA 300
Query: 301 LVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPN 360
LVDAY KCG+VKASWQVFDE++E+NEVSWN+IINGLA KGHF DALDVFRMMIDA KPN
Sbjct: 301 LVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPN 360
Query: 361 SVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFH 420
SVTISSILPV VELE FKAGKEIHGFSMRMGTETD+FIANSLIDMYAKSGHSTEAS IFH
Sbjct: 361 SVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFH 420
Query: 421 NMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGP 480
NMD RN+VSWNAMIANY LNG+ALEAIR VIL+QE+GE PNAVTFTNVLPACAR G LGP
Sbjct: 421 NMDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGP 480
Query: 481 GKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSET 540
GKEIHA+G+RLGLT+D+FV+NALTDMYAKCGC SARNVFNTSHKDEVSYNILI GYSET
Sbjct: 481 GKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSET 540
Query: 541 SDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVS 600
+DCLESLNLFSEMRLL +KPDVVSF+GVISACANLAAVKQGKEIHGVALRNH SHLFVS
Sbjct: 541 NDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQGKEIHGVALRNHLNSHLFVS 600
Query: 601 NSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQ 660
NSLLDFYTKC RIDLACK+FNQILFKDVASWNTMILGYGMIGELETAI MFE MR D VQ
Sbjct: 601 NSLLDFYTKCGRIDLACKIFNQILFKDVASWNTMILGYGMIGELETAINMFEAMRDDKVQ 660
Query: 661 YDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAEL 720
YDLVSYIAVLSACSHGGLVE+GW+Y SEMLAQ+LEPT+MHY C+VDLLGRAGFVEEAAEL
Sbjct: 661 YDLVSYIAVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAEL 720
Query: 721 IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW 780
IR LPIAPD+NIWGALLGACRIY NVELGC AAE LFE+KPQHCGYYILL+NM+AE GRW
Sbjct: 721 IRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRW 780
Query: 781 DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAE 831
DEVNR+R+LMKSRGAKK+PGCSWVQIHDQ+H F+ ++RAEGFESGG LAE
Sbjct: 781 DEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRAEGFESGGLLAE 809
BLAST of Moc02g02020 vs. ExPASy Swiss-Prot
Match:
Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)
HSP 1 Score: 490.0 bits (1260), Expect = 5.5e-137
Identity = 258/663 (38.91%), Postives = 395/663 (59.58%), Query Frame = 0
Query: 156 LDDHTFPFVLKLCSDSLDICKGMEVHGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRV 215
+D T VL+LC+DS + G EV + GF D +G+ L ++Y NCG + +A RV
Sbjct: 92 IDPRTLCSVLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRV 151
Query: 216 FDEMPERDVVSWNTILGLLSVNGDYREARNYYFWMTLRSGIQPNVVSVVILLPISAALED 275
FDE+ + WN ++ L+ +GD+ + + M + SG++ + + + ++L
Sbjct: 152 FDEVKIEKALFWNILMNELAKSGDFSGSIGLFKKM-MSSGVEMDSYTFSCVSKSFSSLRS 211
Query: 276 EEMTRRIHCYTMKAGLDSQVTTCNALVDAYGKCGNVKASWQVFDEMVERNEVSWNAIING 335
++H + +K+G + + N+LV Y K V ++ +VFDEM ER+ +SWN+IING
Sbjct: 212 VHGGEQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIING 271
Query: 336 LACKGHFCDALDVFRMMIDAEVKPNSVTISSILPVLVELEYFKAGKEIHGFSMRMGTETD 395
G L VF M+ + ++ + TI S+ + G+ +H ++ +
Sbjct: 272 YVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSRE 331
Query: 396 IFIANSLIDMYAKSGHSTEASCIFHNMDRRNVVSWNAMIANYALNGLALEAIRQVILMQE 455
N+L+DMY+K G A +F M R+VVS+ +MIA YA GLA EA++ M+E
Sbjct: 332 DRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEE 391
Query: 456 TGENPNAVTFTNVLPACARLGFLGPGKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHS 515
G +P+ T T VL CAR L GK +H L D+FVSNAL DMYAKCG +
Sbjct: 392 EGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQE 451
Query: 516 ARNVFNTSH-KDEVSYNILILGYSETSDCLESLNLFSEMRLLGR-KPDVVSFVGVISACA 575
A VF+ KD +S+N +I GYS+ E+L+LF+ + R PD + V+ ACA
Sbjct: 452 AELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACA 511
Query: 576 NLAAVKQGKEIHGVALRNHFYSHLFVSNSLLDFYTKCARIDLACKVFNQILFKDVASWNT 635
+L+A +G+EIHG +RN ++S V+NSL+D Y KC + LA +F+ I KD+ SW
Sbjct: 512 SLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTV 571
Query: 636 MILGYGMIGELETAITMFEEMRGDTVQYDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQ- 695
MI GYGM G + AI +F +MR ++ D +S++++L ACSH GLV++GW++F+ M +
Sbjct: 572 MIAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHEC 631
Query: 696 NLEPTQMHYACIVDLLGRAGFVEEAAELIRCLPIAPDANIWGALLGACRIYRNVELGCWA 755
+EPT HYACIVD+L R G + +A I +PI PDA IWGALL CRI+ +V+L
Sbjct: 632 KIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKV 691
Query: 756 AEHLFEIKPQHCGYYILLSNMYAEAGRWDEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHT 815
AE +FE++P++ GYY+L++N+YAEA +W++V R+R + RG +KNPGCSW++I +V+
Sbjct: 692 AEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNI 751
BLAST of Moc02g02020 vs. ExPASy Swiss-Prot
Match:
Q9SS60 (Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H23 PE=2 SV=1)
HSP 1 Score: 473.4 bits (1217), Expect = 5.3e-132
Identity = 265/753 (35.19%), Postives = 429/753 (56.97%), Query Frame = 0
Query: 64 SNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCRTAF 123
S++ +L + +++HAL + G L S LI Y+ FR+P S +F + V + +
Sbjct: 15 SSSSNLNELRRIHALVISLG-LDSSDFFSGKLIDKYSHFREPASSLSVFRR-VSPAKNVY 74
Query: 124 LWNTLIRAHSIAGNGMI-DGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEVHG 183
LWN++IRA S NG+ + L+ Y ++ S V D +TFP V+K C+ D G V+
Sbjct: 75 LWNSIIRAFS--KNGLFPEALEFYGKLRESKVSPDKYTFPSVIKACAGLFDAEMGDLVYE 134
Query: 184 VVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDYRE 243
+ +GF+SD++VGN L+ +Y G+++ AR+VFDEMP RD+VSWN+++ S +G Y E
Sbjct: 135 QILDMGFESDLFVGNALVDMYSRMGLLTRARQVFDEMPVRDLVSWNSLISGYSSHGYYEE 194
Query: 244 ARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNALV 303
A Y + S I P+ +V +LP L + + +H + +K+G++S V N LV
Sbjct: 195 ALEIYHELK-NSWIVPDSFTVSSVLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVNNGLV 254
Query: 304 DAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPNSV 363
Y K + +VFDEM R+ VS+N +I G +++ +F +D + KP+ +
Sbjct: 255 AMYLKFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEMVEESVRMFLENLD-QFKPDLL 314
Query: 364 TISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFHNM 423
T+SS+L L K I+ + ++ G + + N LID+YAK G A +F++M
Sbjct: 315 TVSSVLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYAKCGDMITARDVFNSM 374
Query: 424 DRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGPGK 483
+ ++ VSWN++I+ Y +G +EA++ +M E + +T+ ++ RL L GK
Sbjct: 375 ECKDTVSWNSIISGYIQSGDLMEAMKLFKMMMIMEEQADHITYLMLISVSTRLADLKFGK 434
Query: 484 EIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNT-SHKDEVSYNILILGYSETS 543
+H+ GI+ G+ D+ VSNAL DMYAKCG + + +F++ D V++N +I
Sbjct: 435 GLHSNGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSSMGTGDTVTWNTVISACVRFG 494
Query: 544 DCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVSN 603
D L + ++MR PD+ +F+ + CA+LAA + GKEIH LR + S L + N
Sbjct: 495 DFATGLQVTTQMRKSEVVPDMATFLVTLPMCASLAAKRLGKEIHCCLLRFGYESELQIGN 554
Query: 604 SLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQY 663
+L++ Y+KC ++ + +VF ++ +DV +W MI YGM GE E A+ F +M +
Sbjct: 555 ALIEMYSKCGCLENSSRVFERMSRRDVVTWTGMIYAYGMYGEGEKALETFADMEKSGIVP 614
Query: 664 DLVSYIAVLSACSHGGLVEQGWKYFSEMLAQ-NLEPTQMHYACIVDLLGRAGFVEEAAEL 723
D V +IA++ ACSH GLV++G F +M ++P HYAC+VDLL R+ + +A E
Sbjct: 615 DSVVFIAIIYACSHSGLVDEGLACFEKMKTHYKIDPMIEHYACVVDLLSRSQKISKAEEF 674
Query: 724 IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW 783
I+ +PI PDA+IW ++L ACR ++E + + E+ P GY IL SN YA +W
Sbjct: 675 IQAMPIKPDASIWASVLRACRTSGDMETAERVSRRIIELNPDDPGYSILASNAYAALRKW 734
Query: 784 DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTF 814
D+V+ +R +K + KNPG SW+++ VH F
Sbjct: 735 DKVSLIRKSLKDKHITKNPGYSWIEVGKNVHVF 761
BLAST of Moc02g02020 vs. ExPASy Swiss-Prot
Match:
Q0WN60 (Pentatricopeptide repeat-containing protein At1g18485 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H8 PE=2 SV=2)
HSP 1 Score: 471.9 bits (1213), Expect = 1.5e-131
Identity = 263/776 (33.89%), Postives = 437/776 (56.31%), Query Frame = 0
Query: 59 LLTLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQN 118
LL + + +++H L + L + LC +I YA P+ R +F
Sbjct: 90 LLQASGKRKDIEMGRKIHQLVSGSTRLRNDDVLCTRIITMYAMCGSPDDSRFVF--DALR 149
Query: 119 CRTAFLWNTLIRAHSIAGNGMIDG-LQTYNRMVRSGVQLDDH-TFPFVLKLCSDSLDICK 178
+ F WN +I ++S N + D L+T+ M+ + L DH T+P V+K C+ D+
Sbjct: 150 SKNLFQWNAVISSYS--RNELYDEVLETFIEMISTTDLLPDHFTYPCVIKACAGMSDVGI 209
Query: 179 GMEVHGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSV 238
G+ VHG+V K G DV+VGN L+ YG G V+DA ++FD MPER++VSWN+++ + S
Sbjct: 210 GLAVHGLVVKTGLVEDVFVGNALVSFYGTHGFVTDALQLFDIMPERNLVSWNSMIRVFSD 269
Query: 239 NGDYREARNYYFWMTLRSG---IQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDS 298
NG E+ M +G P+V ++V +LP+ A + + + +H + +K LD
Sbjct: 270 NGFSEESFLLLGEMMEENGDGAFMPDVATLVTVLPVCAREREIGLGKGVHGWAVKLRLDK 329
Query: 299 QVTTCNALVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMI 358
++ NAL+D Y KCG + + +F +N VSWN ++ G + +G DV R M+
Sbjct: 330 ELVLNNALMDMYSKCGCITNAQMIFKMNNNKNVVSWNTMVGGFSAEGDTHGTFDVLRQML 389
Query: 359 --DAEVKPNSVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGH 418
+VK + VTI + +PV + + KE+H +S++ + +AN+ + YAK G
Sbjct: 390 AGGEDVKADEVTILNAVPVCFHESFLPSLKELHCYSLKQEFVYNELVANAFVASYAKCGS 449
Query: 419 STEASCIFHNMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPA 478
+ A +FH + + V SWNA+I +A + ++ + M+ +G P++ T ++L A
Sbjct: 450 LSYAQRVFHGIRSKTVNSWNALIGGHAQSNDPRLSLDAHLQMKISGLLPDSFTVCSLLSA 509
Query: 479 CARLGFLGPGKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNT-SHKDEVSY 538
C++L L GKE+H IR L D+FV ++ +Y CG L + + +F+ K VS+
Sbjct: 510 CSKLKSLRLGKEVHGFIIRNWLERDLFVYLSVLSLYIHCGELCTVQALFDAMEDKSLVSW 569
Query: 539 NILILGYSETSDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALR 598
N +I GY + +L +F +M L G + +S + V AC+ L +++ G+E H AL+
Sbjct: 570 NTVITGYLQNGFPDRALGVFRQMVLYGIQLCGISMMPVFGACSLLPSLRLGREAHAYALK 629
Query: 599 NHFYSHLFVSNSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITM 658
+ F++ SL+D Y K I + KVFN + K ASWN MI+GYG+ G + AI +
Sbjct: 630 HLLEDDAFIACSLIDMYAKNGSITQSSKVFNGLKEKSTASWNAMIMGYGIHGLAKEAIKL 689
Query: 659 FEEMRGDTVQYDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQ-NLEPTQMHYACIVDLLG 718
FEEM+ D ++++ VL+AC+H GL+ +G +Y +M + L+P HYAC++D+LG
Sbjct: 690 FEEMQRTGHNPDDLTFLGVLTACNHSGLIHEGLRYLDQMKSSFGLKPNLKHYACVIDMLG 749
Query: 719 RAGFVEEAAELI-RCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYI 778
RAG +++A ++ + D IW +LL +CRI++N+E+G A LFE++P+ Y+
Sbjct: 750 RAGQLDKALRVVAEEMSEEADVGIWKSLLSSCRIHQNLEMGEKVAAKLFELEPEKPENYV 809
Query: 779 LLSNMYAEAGRWDEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEER-AEGFE 824
LLSN+YA G+W++V +VR M +K+ GCSW++++ +V +F+ ER +GFE
Sbjct: 810 LLSNLYAGLGKWEDVRKVRQRMNEMSLRKDAGCSWIELNRKVFSFVVGERFLDGFE 861
BLAST of Moc02g02020 vs. ExPASy Swiss-Prot
Match:
Q9M9E2 (Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H73 PE=1 SV=1)
HSP 1 Score: 468.0 bits (1203), Expect = 2.2e-130
Identity = 237/689 (34.40%), Postives = 389/689 (56.46%), Query Frame = 0
Query: 132 HSIAGNGMI-DGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEVHGVVSKLGFD 191
H + NG + + ++ N M V +D+ F +++LC +G +V+ +
Sbjct: 67 HGLCANGKLEEAMKLLNSMQELRVAVDEDVFVALVRLCEWKRAQEEGSKVYSIALSSMSS 126
Query: 192 SDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDYREARNYYFWM 251
V +GN L ++ G + DA VF +M ER++ SWN ++G + G + EA Y M
Sbjct: 127 LGVELGNAFLAMFVRFGNLVDAWYVFGKMSERNLFSWNVLVGGYAKQGYFDEAMCLYHRM 186
Query: 252 TLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNALVDAYGKCGN 311
G++P+V + +L + D + +H + ++ G + + NAL+ Y KCG+
Sbjct: 187 LWVGGVKPDVYTFPCVLRTCGGIPDLARGKEVHVHVVRYGYELDIDVVNALITMYVKCGD 246
Query: 312 VKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPNSVTISSILPV 371
VK++ +FD M R+ +SWNA+I+G G + L++F M V P+ +T++S++
Sbjct: 247 VKSARLLFDRMPRRDIISWNAMISGYFENGMCHEGLELFFAMRGLSVDPDLMTLTSVISA 306
Query: 372 LVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFHNMDRRNVVSW 431
L + G++IH + + G DI + NSL MY +G EA +F M+R+++VSW
Sbjct: 307 CELLGDRRLGRDIHAYVITTGFAVDISVCNSLTQMYLNAGSWREAEKLFSRMERKDIVSW 366
Query: 432 NAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGPGKEIHAIGIR 491
MI+ Y N L +AI +M + P+ +T VL ACA LG L G E+H + I+
Sbjct: 367 TTMISGYEYNFLPDKAIDTYRMMDQDSVKPDEITVAAVLSACATLGDLDTGVELHKLAIK 426
Query: 492 LGLTADMFVSNALTDMYAKCGCLHSARNVF-NTSHKDEVSYNILILGYSETSDCLESLNL 551
L + + V+N L +MY+KC C+ A ++F N K+ +S+ +I G + C E+L
Sbjct: 427 ARLISYVIVANNLINMYSKCKCIDKALDIFHNIPRKNVISWTSIIAGLRLNNRCFEALIF 486
Query: 552 FSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVSNSLLDFYTK 611
+M++ +P+ ++ ++ACA + A+ GKEIH LR F+ N+LLD Y +
Sbjct: 487 LRQMKMT-LQPNAITLTAALAACARIGALMCGKEIHAHVLRTGVGLDDFLPNALLDMYVR 546
Query: 612 CARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQYDLVSYIAV 671
C R++ A FN KDV SWN ++ GY G+ + +F+ M V+ D +++I++
Sbjct: 547 CGRMNTAWSQFNS-QKKDVTSWNILLTGYSERGQGSMVVELFDRMVKSRVRPDEITFISL 606
Query: 672 LSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAELIRCLPIAPD 731
L CS +V QG YFS+M + P HYAC+VDLLGRAG ++EA + I+ +P+ PD
Sbjct: 607 LCGCSKSQMVRQGLMYFSKMEDYGVTPNLKHYACVVDLLGRAGELQEAHKFIQKMPVTPD 666
Query: 732 ANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRWDEVNRVRDL 791
+WGALL ACRI+ ++LG +A+H+FE+ + GYYILL N+YA+ G+W EV +VR +
Sbjct: 667 PAVWGALLNACRIHHKIDLGELSAQHIFELDKKSVGYYILLCNLYADCGKWREVAKVRRM 726
Query: 792 MKSRGAKKNPGCSWVQIHDQVHTFMAEER 819
MK G + GCSWV++ +VH F+++++
Sbjct: 727 MKENGLTVDAGCSWVEVKGKVHAFLSDDK 753
BLAST of Moc02g02020 vs. ExPASy Swiss-Prot
Match:
Q9C507 (Putative pentatricopeptide repeat-containing protein At1g69350, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E66 PE=3 SV=1)
HSP 1 Score: 453.4 bits (1165), Expect = 5.7e-126
Identity = 252/762 (33.07%), Postives = 429/762 (56.30%), Query Frame = 0
Query: 60 LTLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNC 119
+ L + SLR QLHA L+ G L LI YA P+S R++F
Sbjct: 5 MPLFRSCSSLRLVSQLHAHLLVTGRLRRDPLPVTKLIESYAFMGSPDSSRLVFEAFPY-- 64
Query: 120 RTAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLD-ICKGM 179
+F++ LI+ ++ + + + Y+R+V Q+ FP VL+ C+ S + + G
Sbjct: 65 PDSFMYGVLIKC-NVWCHLLDAAIDLYHRLVSETTQISKFVFPSVLRACAGSREHLSVGG 124
Query: 180 EVHGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNG 239
+VHG + K G D D + +LL +YG G +SDA +VFD MP RD+V+W+T++ NG
Sbjct: 125 KVHGRIIKGGVDDDAVIETSLLCMYGQTGNLSDAEKVFDGMPVRDLVAWSTLVSSCLENG 184
Query: 240 DYREARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTC 299
+ +A + M + G++P+ V+++ ++ A L + R +H + D T C
Sbjct: 185 EVVKALRMFKCM-VDDGVEPDAVTMISVVEGCAELGCLRIARSVHGQITRKMFDLDETLC 244
Query: 300 NALVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCD-ALDVFRMMIDAEV 359
N+L+ Y KCG++ +S ++F+++ ++N VSW A+I+ +G F + AL F MI + +
Sbjct: 245 NSLLTMYSKCGDLLSSERIFEKIAKKNAVSWTAMISSYN-RGEFSEKALRSFSEMIKSGI 304
Query: 360 KPNSVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDI-FIANSLIDMYAKSGHSTEAS 419
+PN VT+ S+L + + GK +HGF++R + + ++ +L+++YA+ G ++
Sbjct: 305 EPNLVTLYSVLSSCGLIGLIREGKSVHGFAVRRELDPNYESLSLALVELYAECGKLSDCE 364
Query: 420 CIFHNMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLG 479
+ + RN+V+WN++I+ YA G+ ++A+ M P+A T + + AC G
Sbjct: 365 TVLRVVSDRNIVAWNSLISLYAHRGMVIQALGLFRQMVTQRIKPDAFTLASSISACENAG 424
Query: 480 FLGPGKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFN-TSHKDEVSYNILIL 539
+ GK+IH IR + +D FV N+L DMY+K G + SA VFN H+ V++N ++
Sbjct: 425 LVPLGKQIHGHVIRTDV-SDEFVQNSLIDMYSKSGSVDSASTVFNQIKHRSVVTWNSMLC 484
Query: 540 GYSETSDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYS 599
G+S+ + +E+++LF M + + V+F+ VI AC+++ ++++GK +H + +
Sbjct: 485 GFSQNGNSVEAISLFDYMYHSYLEMNEVTFLAVIQACSSIGSLEKGKWVHHKLIISGL-K 544
Query: 600 HLFVSNSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMR 659
LF +L+D Y KC ++ A VF + + + SW++MI YGM G + +AI+ F +M
Sbjct: 545 DLFTDTALIDMYAKCGDLNAAETVFRAMSSRSIVSWSSMINAYGMHGRIGSAISTFNQMV 604
Query: 660 GDTVQYDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVE 719
+ + V ++ VLSAC H G VE+G YF+ M + + P H+AC +DLL R+G ++
Sbjct: 605 ESGTKPNEVVFMNVLSACGHSGSVEEGKYYFNLMKSFGVSPNSEHFACFIDLLSRSGDLK 664
Query: 720 EAAELIRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYA 779
EA I+ +P DA++WG+L+ CRI++ +++ L +I GYY LLSN+YA
Sbjct: 665 EAYRTIKEMPFLADASVWGSLVNGCRIHQKMDIIKAIKNDLSDIVTDDTGYYTLLSNIYA 724
Query: 780 EAGRWDEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEE 818
E G W+E R+R MKS KK PG S ++I +V F A E
Sbjct: 725 EEGEWEEFRRLRSAMKSSNLKKVPGYSAIEIDQKVFRFGAGE 759
BLAST of Moc02g02020 vs. ExPASy TrEMBL
Match:
A0A6J1DHT8 (pentatricopeptide repeat-containing protein At4g14170-like OS=Momordica charantia OX=3673 GN=LOC111020621 PE=4 SV=1)
HSP 1 Score: 1705.3 bits (4415), Expect = 0.0e+00
Identity = 832/832 (100.00%), Postives = 832/832 (100.00%), Query Frame = 0
Query: 1 MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLPSTEPVSSVHVDLL 60
MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLPSTEPVSSVHVDLL
Sbjct: 1 MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLPSTEPVSSVHVDLL 60
Query: 61 TLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCR 120
TLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCR
Sbjct: 61 TLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCR 120
Query: 121 TAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEV 180
TAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEV
Sbjct: 121 TAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEV 180
Query: 181 HGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDY 240
HGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDY
Sbjct: 181 HGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDY 240
Query: 241 REARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNA 300
REARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNA
Sbjct: 241 REARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNA 300
Query: 301 LVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPN 360
LVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPN
Sbjct: 301 LVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPN 360
Query: 361 SVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFH 420
SVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFH
Sbjct: 361 SVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFH 420
Query: 421 NMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGP 480
NMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGP
Sbjct: 421 NMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGP 480
Query: 481 GKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSET 540
GKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSET
Sbjct: 481 GKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSET 540
Query: 541 SDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVS 600
SDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVS
Sbjct: 541 SDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVS 600
Query: 601 NSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQ 660
NSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQ
Sbjct: 601 NSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQ 660
Query: 661 YDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAEL 720
YDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAEL
Sbjct: 661 YDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAEL 720
Query: 721 IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW 780
IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW
Sbjct: 721 IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW 780
Query: 781 DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAESF 833
DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAESF
Sbjct: 781 DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAESF 832
BLAST of Moc02g02020 vs. ExPASy TrEMBL
Match:
A0A6J1JE86 (pentatricopeptide repeat-containing protein At4g14170-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111483644 PE=4 SV=1)
HSP 1 Score: 1423.7 bits (3684), Expect = 0.0e+00
Identity = 687/830 (82.77%), Postives = 752/830 (90.60%), Query Frame = 0
Query: 1 MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLPSTEPVSSVHVDLL 60
ML C RS HFA Q RFQ+R+++ STEP SSVH++LL
Sbjct: 1 MLQFCIRSIRFHFA-------QIARFQFRNYVR--------------STEPNSSVHINLL 60
Query: 61 TLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCR 120
TLC N+QSLRQTK++HA+CLLNGLLPHSVSLCASLIL+YA F+ PESF LFHQTVQNCR
Sbjct: 61 TLCFNSQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCR 120
Query: 121 TAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEV 180
T FLWNTLIRAHSIAGNG +DGL+TYNRMVR GVQLDDHTFPFVLK+CSDSLDICKGMEV
Sbjct: 121 TTFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEV 180
Query: 181 HGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDY 240
HGVV KLGFDSDVYVGNTLLMLYGNCG ++ A++VFDEM ERDVVSWNT++GLLSVNGDY
Sbjct: 181 HGVVFKLGFDSDVYVGNTLLMLYGNCGFLNGAKKVFDEMSERDVVSWNTVIGLLSVNGDY 240
Query: 241 REARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNA 300
REARNYYFWMTLRSGIQPN+VSV+ LLPISA LEDEEMTRRIHCY +K GLDS VT+CNA
Sbjct: 241 REARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNA 300
Query: 301 LVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPN 360
LVDAYGKCG+VKASWQVFDE++E+NEVSWN+IINGLA KGHF DAL+VFRMMIDA KPN
Sbjct: 301 LVDAYGKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFWDALEVFRMMIDAGTKPN 360
Query: 361 SVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFH 420
SVTISSILPV VELE FKAGKEIHGFSMRMGTETD+FIANSLIDMYAKSGH TEAS IFH
Sbjct: 361 SVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHLTEASSIFH 420
Query: 421 NMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGP 480
NMD RN+VSWNAMIANYALNG+ALEAIR VIL+QE+GE PNAVTFTNVLPACARLG LGP
Sbjct: 421 NMDGRNIVSWNAMIANYALNGVALEAIRFVILLQESGERPNAVTFTNVLPACARLGHLGP 480
Query: 481 GKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSET 540
GKEIHA+G+RLGLT+D+FV+NALTDMYAKCGC SARNVFNTSHKDEVSYNILI GYSET
Sbjct: 481 GKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSET 540
Query: 541 SDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVS 600
+DCLESLNLFSEMRLLG+KPDVVSF+GV+SACANLAAVKQGKEIHGVALRNH SHLFVS
Sbjct: 541 NDCLESLNLFSEMRLLGKKPDVVSFMGVLSACANLAAVKQGKEIHGVALRNHLNSHLFVS 600
Query: 601 NSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQ 660
NSLLDFYTKC RIDLACK+FNQILFKDVASWNTMILGYGMIGELETAI MFE MR D VQ
Sbjct: 601 NSLLDFYTKCGRIDLACKIFNQILFKDVASWNTMILGYGMIGELETAINMFEAMRDDKVQ 660
Query: 661 YDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAEL 720
YD+VSYIAVLSACSHGGLVE+G +YFSEMLAQ+LEPT+MHY C+VDLLGRAGFVEEAAEL
Sbjct: 661 YDVVSYIAVLSACSHGGLVERGCQYFSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAEL 720
Query: 721 IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW 780
IR LPIAPD+NIWGALLGACRIY N++LGC AAEHLFE+KPQHCGYYILLSNMYAE GRW
Sbjct: 721 IRRLPIAPDSNIWGALLGACRIYGNIDLGCKAAEHLFELKPQHCGYYILLSNMYAETGRW 780
Query: 781 DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAE 831
D+VNR+R+LMKSRGAKK+PGCSWVQIHDQ+H F+ ++RAEGFESGG+LAE
Sbjct: 781 DDVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRAEGFESGGFLAE 809
BLAST of Moc02g02020 vs. ExPASy TrEMBL
Match:
A0A6J1FV31 (pentatricopeptide repeat-containing protein At4g14170-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111448704 PE=4 SV=1)
HSP 1 Score: 1421.4 bits (3678), Expect = 0.0e+00
Identity = 690/830 (83.13%), Postives = 748/830 (90.12%), Query Frame = 0
Query: 1 MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLPSTEPVSSVHVDLL 60
ML C RS HFA Q RFQ+R+F+ TEP SSVH++LL
Sbjct: 1 MLQFCIRSFRFHFA-------QIARFQFRNFVR--------------RTEPNSSVHINLL 60
Query: 61 TLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCR 120
TLC NAQSLRQTK++HA+CLLNGLLPHSVSLCASLIL+YA F+ PESF LFHQTVQNCR
Sbjct: 61 TLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCR 120
Query: 121 TAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEV 180
TAFLWNTLIRAHSIAGNG +DGL+TYNRMVR GVQLDDHTFPFVLK+CSDSLDICKGMEV
Sbjct: 121 TAFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEV 180
Query: 181 HGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDY 240
HGVV KLGFDSDVYVGNTLLMLYGNCG ++DA++VFDEM ERDVVSWNT++GLLSVNGDY
Sbjct: 181 HGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDY 240
Query: 241 REARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNA 300
REARNYYFWMTLRSGIQPN+VSV+ LLPISA LEDEEMTRRIHCY +K GLDS VT+CNA
Sbjct: 241 REARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNA 300
Query: 301 LVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPN 360
LVDAY KCG+VKASWQVFDE++E+NEVSWN+IINGLA KGHF DALDVFRMMIDA KPN
Sbjct: 301 LVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPN 360
Query: 361 SVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFH 420
SVTISSILPV VELE FKAGKEIHGFSMRMGTETD+FIANSLIDMYAKSGHSTEAS IFH
Sbjct: 361 SVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFH 420
Query: 421 NMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGP 480
NMD RN+VSWNAMIANY LNG+ALEAIR VIL+QE+GE PNAVTFTNVLPACAR G LGP
Sbjct: 421 NMDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGP 480
Query: 481 GKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSET 540
GKEIHA+G+RLGLT+D+FV+NALTDMYAKCGC SARNVFNTSHKDEVSYNILI GYSET
Sbjct: 481 GKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSET 540
Query: 541 SDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVS 600
+DCLESLNLFSEMRLL +KPDVVSF+GVISACANLAAVKQGKEIHGVALRNH SHLFVS
Sbjct: 541 NDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQGKEIHGVALRNHLNSHLFVS 600
Query: 601 NSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQ 660
NSLLDFYTKC RIDLACK+FNQILFKDVASWNTMILGYGMIGELETAI MFE MR D VQ
Sbjct: 601 NSLLDFYTKCGRIDLACKIFNQILFKDVASWNTMILGYGMIGELETAINMFEAMRDDKVQ 660
Query: 661 YDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAEL 720
YDLVSYIAVLSACSHGGLVE+GW+Y SEMLAQ+LEPT+MHY C+VDLLGRAGFVEEAAEL
Sbjct: 661 YDLVSYIAVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAEL 720
Query: 721 IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW 780
IR LPIAPD+NIWGALLGACRIY NVELGC AAE LFE+KPQHCGYYILL+NM+AE GRW
Sbjct: 721 IRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRW 780
Query: 781 DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAE 831
DEVNR+R+LMKSRGAKK+PGCSWVQIHDQ+H F+ ++RAEGFESGG LAE
Sbjct: 781 DEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRAEGFESGGLLAE 809
BLAST of Moc02g02020 vs. ExPASy TrEMBL
Match:
A0A5A7V0P3 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold212G001270 PE=4 SV=1)
HSP 1 Score: 1417.5 bits (3668), Expect = 0.0e+00
Identity = 693/833 (83.19%), Postives = 754/833 (90.52%), Query Frame = 0
Query: 1 MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPL--PLPSTEPVSSVHVD 60
ML C RS+ + PF F Q TRF+YR+F P+L SA SPL P ST+ S +H++
Sbjct: 1 MLELCIRSS-IVSPLPFPFPSQITRFRYRNFHQPILVSALQSPLSPPSSSTDSNSFIHIN 60
Query: 61 LLTLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQN 120
LLTLCS QSL QTKQ+HAL +LNG LP SVSLCASLIL+YA F+ PESF LF+QT QN
Sbjct: 61 LLTLCSKVQSLLQTKQVHALGILNGFLPRSVSLCASLILNYAKFQHPESFCSLFNQTFQN 120
Query: 121 CRTAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGM 180
CRTAFLWNTLIRAHSIA NG +DG +TYNRMVR GVQLDDHTFPFVLKLCSDS DICKGM
Sbjct: 121 CRTAFLWNTLIRAHSIAWNGTLDGFETYNRMVRLGVQLDDHTFPFVLKLCSDSFDICKGM 180
Query: 181 EVHGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNG 240
EVHGVV KLGFD+DVYVGNTLLMLYGNCG ++DARRVFDEMPERDVVSWNT++GLLSVNG
Sbjct: 181 EVHGVVFKLGFDTDVYVGNTLLMLYGNCGFLNDARRVFDEMPERDVVSWNTVIGLLSVNG 240
Query: 241 DYREARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTC 300
DY+EARNYYFWM LRSGI+PN+VSV+ LLPISAALEDEEMTRRIHC+++K GLDSQVTTC
Sbjct: 241 DYKEARNYYFWMILRSGIKPNLVSVISLLPISAALEDEEMTRRIHCFSVKVGLDSQVTTC 300
Query: 301 NALVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVK 360
NALVDAYGKCG+VKA WQVF+EMVERNEVSWN+IINGLACKG DAL FRMMIDA K
Sbjct: 301 NALVDAYGKCGSVKALWQVFNEMVERNEVSWNSIINGLACKGRCWDALKAFRMMIDAGAK 360
Query: 361 PNSVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCI 420
PNSVTISSILPVLVELE FKAGKEIHGFSMR+GTETDIFIANSLIDMYAKSG STEAS I
Sbjct: 361 PNSVTISSILPVLVELECFKAGKEIHGFSMRIGTETDIFIANSLIDMYAKSGRSTEASTI 420
Query: 421 FHNMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFL 480
FHN+DRRNVV+WNAMIANYALN L LEAIR VI MQETGE PNAVTFTNVLPACARLGFL
Sbjct: 421 FHNLDRRNVVTWNAMIANYALNRLPLEAIRFVIQMQETGECPNAVTFTNVLPACARLGFL 480
Query: 481 GPGKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYS 540
GPGKEIHA+ +R+GLT+D+FVSN+L DMYAKCG L SARN+FNTSHKDEVSYNILI+GYS
Sbjct: 481 GPGKEIHAMVVRIGLTSDLFVSNSLIDMYAKCGSLCSARNLFNTSHKDEVSYNILIIGYS 540
Query: 541 ETSDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLF 600
ET+DC +SLNLFSEMRLLG+KPDVVSFVGVISACANLAA+KQGKEIHGVALRNH YSHLF
Sbjct: 541 ETNDCFQSLNLFSEMRLLGKKPDVVSFVGVISACANLAALKQGKEIHGVALRNHLYSHLF 600
Query: 601 VSNSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDT 660
VSNSLLDFYTKC RID+AC+VFNQILFKDVASWNTMILGYGMIGELETAI+MFE MR DT
Sbjct: 601 VSNSLLDFYTKCGRIDIACRVFNQILFKDVASWNTMILGYGMIGELETAISMFEAMRDDT 660
Query: 661 VQYDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAA 720
VQYDLVSYIAVLSACSHGGLVE+GW+YFSEMLAQ+LEPT+MHY C+VDLLGRAGFVEEAA
Sbjct: 661 VQYDLVSYIAVLSACSHGGLVERGWQYFSEMLAQHLEPTEMHYTCMVDLLGRAGFVEEAA 720
Query: 721 ELIRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAG 780
ELI+ LPIAPDANIWGALLGACRIY NVELGC AAEHLFE+KPQHCGYYILLSN+YAE G
Sbjct: 721 ELIQRLPIAPDANIWGALLGACRIYGNVELGCRAAEHLFELKPQHCGYYILLSNIYAETG 780
Query: 781 RWDEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAES 832
RWDE NR+R+LMKSRGAKKNPGCSWVQI DQVH+F+AEER EGFESG WLAES
Sbjct: 781 RWDEANRIRELMKSRGAKKNPGCSWVQICDQVHSFVAEERVEGFESGDWLAES 832
BLAST of Moc02g02020 vs. ExPASy TrEMBL
Match:
A0A0A0KEH6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G428560 PE=4 SV=1)
HSP 1 Score: 1417.5 bits (3668), Expect = 0.0e+00
Identity = 691/833 (82.95%), Postives = 751/833 (90.16%), Query Frame = 0
Query: 1 MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPL--PLPSTEPVSSVHVD 60
ML C RS+ + PF F Q RFQYR+FL P+L SA SPL P ST+P S +H++
Sbjct: 1 MLEFCIRSS-IVSPLPFPFPSQIIRFQYRNFLQPILVSALQSPLSPPSSSTDPNSYIHIN 60
Query: 61 LLTLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQN 120
LLTLCS QSL QTKQ+HAL +LNG LP SVSLCASLIL+YA F+ P SF LF+QT QN
Sbjct: 61 LLTLCSKVQSLLQTKQVHALGILNGFLPRSVSLCASLILNYAKFQHPGSFCSLFNQTFQN 120
Query: 121 CRTAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGM 180
CRTAFLWNTLIRAHSIA NG DG +TYNRMVR GVQLDDHTFPFVLKLCSDS DICKGM
Sbjct: 121 CRTAFLWNTLIRAHSIAWNGTFDGFETYNRMVRRGVQLDDHTFPFVLKLCSDSFDICKGM 180
Query: 181 EVHGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNG 240
EVHGVV KLGFD+DVYVGNTLLMLYGNCG ++DARR+FDEMPERDVVSWNTI+GLLSVNG
Sbjct: 181 EVHGVVFKLGFDTDVYVGNTLLMLYGNCGFLNDARRLFDEMPERDVVSWNTIIGLLSVNG 240
Query: 241 DYREARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTC 300
DY EARNYYFWM LRS I+PN+VSV+ LLPISAALEDEEMTRRIHCY++K GLDSQVTTC
Sbjct: 241 DYTEARNYYFWMILRSVIKPNLVSVISLLPISAALEDEEMTRRIHCYSVKVGLDSQVTTC 300
Query: 301 NALVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVK 360
NALVDAYGKCG+VKA WQVF+E VE+NEVSWN+IINGLACKG DAL+ FRMMIDA +
Sbjct: 301 NALVDAYGKCGSVKALWQVFNETVEKNEVSWNSIINGLACKGRCWDALNAFRMMIDAGAQ 360
Query: 361 PNSVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCI 420
PNSVTISSILPVLVELE FKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEAS I
Sbjct: 361 PNSVTISSILPVLVELECFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASTI 420
Query: 421 FHNMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFL 480
FHN+DRRN+VSWNAMIANYALN L LEAIR VI MQETGE PNAVTFTNVLPACARLGFL
Sbjct: 421 FHNLDRRNIVSWNAMIANYALNRLPLEAIRFVIQMQETGECPNAVTFTNVLPACARLGFL 480
Query: 481 GPGKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYS 540
GPGKEIHA+G+R+GLT+D+FVSN+L DMYAKCGCLHSARNVFNTS KDEVSYNILI+GYS
Sbjct: 481 GPGKEIHAMGVRIGLTSDLFVSNSLIDMYAKCGCLHSARNVFNTSRKDEVSYNILIIGYS 540
Query: 541 ETSDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLF 600
ET DCL+SLNLFSEMRLLG+KPDVVSFVGVISACANLAA+KQGKE+HGVALRNH YSHLF
Sbjct: 541 ETDDCLQSLNLFSEMRLLGKKPDVVSFVGVISACANLAALKQGKEVHGVALRNHLYSHLF 600
Query: 601 VSNSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDT 660
VSNSLLDFYTKC RID+AC++FNQILFKDVASWNTMILGYGMIGELETAI+MFE MR DT
Sbjct: 601 VSNSLLDFYTKCGRIDIACRLFNQILFKDVASWNTMILGYGMIGELETAISMFEAMRDDT 660
Query: 661 VQYDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAA 720
VQYDLVSYIAVLSACSHGGLVE+GW+YFSEMLAQ LEPT+MHY C+VDLLGRAGFVEEAA
Sbjct: 661 VQYDLVSYIAVLSACSHGGLVERGWQYFSEMLAQRLEPTEMHYTCMVDLLGRAGFVEEAA 720
Query: 721 ELIRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAG 780
+LI+ LPIAPDANIWGALLGACRIY NVELG AAEHLFE+KPQHCGYYILLSN+YAE G
Sbjct: 721 KLIQQLPIAPDANIWGALLGACRIYGNVELGRRAAEHLFELKPQHCGYYILLSNIYAETG 780
Query: 781 RWDEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAES 832
RWDE N++R+LMKSRGAKKNPGCSWVQI+DQVH F+AEER EGFE G WLAES
Sbjct: 781 RWDEANKIRELMKSRGAKKNPGCSWVQIYDQVHAFVAEERVEGFELGDWLAES 832
BLAST of Moc02g02020 vs. TAIR 10
Match:
AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 490.0 bits (1260), Expect = 3.9e-138
Identity = 258/663 (38.91%), Postives = 395/663 (59.58%), Query Frame = 0
Query: 156 LDDHTFPFVLKLCSDSLDICKGMEVHGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRV 215
+D T VL+LC+DS + G EV + GF D +G+ L ++Y NCG + +A RV
Sbjct: 92 IDPRTLCSVLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRV 151
Query: 216 FDEMPERDVVSWNTILGLLSVNGDYREARNYYFWMTLRSGIQPNVVSVVILLPISAALED 275
FDE+ + WN ++ L+ +GD+ + + M + SG++ + + + ++L
Sbjct: 152 FDEVKIEKALFWNILMNELAKSGDFSGSIGLFKKM-MSSGVEMDSYTFSCVSKSFSSLRS 211
Query: 276 EEMTRRIHCYTMKAGLDSQVTTCNALVDAYGKCGNVKASWQVFDEMVERNEVSWNAIING 335
++H + +K+G + + N+LV Y K V ++ +VFDEM ER+ +SWN+IING
Sbjct: 212 VHGGEQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIING 271
Query: 336 LACKGHFCDALDVFRMMIDAEVKPNSVTISSILPVLVELEYFKAGKEIHGFSMRMGTETD 395
G L VF M+ + ++ + TI S+ + G+ +H ++ +
Sbjct: 272 YVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSRE 331
Query: 396 IFIANSLIDMYAKSGHSTEASCIFHNMDRRNVVSWNAMIANYALNGLALEAIRQVILMQE 455
N+L+DMY+K G A +F M R+VVS+ +MIA YA GLA EA++ M+E
Sbjct: 332 DRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEE 391
Query: 456 TGENPNAVTFTNVLPACARLGFLGPGKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHS 515
G +P+ T T VL CAR L GK +H L D+FVSNAL DMYAKCG +
Sbjct: 392 EGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQE 451
Query: 516 ARNVFNTSH-KDEVSYNILILGYSETSDCLESLNLFSEMRLLGR-KPDVVSFVGVISACA 575
A VF+ KD +S+N +I GYS+ E+L+LF+ + R PD + V+ ACA
Sbjct: 452 AELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACA 511
Query: 576 NLAAVKQGKEIHGVALRNHFYSHLFVSNSLLDFYTKCARIDLACKVFNQILFKDVASWNT 635
+L+A +G+EIHG +RN ++S V+NSL+D Y KC + LA +F+ I KD+ SW
Sbjct: 512 SLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTV 571
Query: 636 MILGYGMIGELETAITMFEEMRGDTVQYDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQ- 695
MI GYGM G + AI +F +MR ++ D +S++++L ACSH GLV++GW++F+ M +
Sbjct: 572 MIAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHEC 631
Query: 696 NLEPTQMHYACIVDLLGRAGFVEEAAELIRCLPIAPDANIWGALLGACRIYRNVELGCWA 755
+EPT HYACIVD+L R G + +A I +PI PDA IWGALL CRI+ +V+L
Sbjct: 632 KIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKV 691
Query: 756 AEHLFEIKPQHCGYYILLSNMYAEAGRWDEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHT 815
AE +FE++P++ GYY+L++N+YAEA +W++V R+R + RG +KNPGCSW++I +V+
Sbjct: 692 AEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNI 751
BLAST of Moc02g02020 vs. TAIR 10
Match:
AT3G03580.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 473.4 bits (1217), Expect = 3.8e-133
Identity = 265/753 (35.19%), Postives = 429/753 (56.97%), Query Frame = 0
Query: 64 SNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCRTAF 123
S++ +L + +++HAL + G L S LI Y+ FR+P S +F + V + +
Sbjct: 15 SSSSNLNELRRIHALVISLG-LDSSDFFSGKLIDKYSHFREPASSLSVFRR-VSPAKNVY 74
Query: 124 LWNTLIRAHSIAGNGMI-DGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEVHG 183
LWN++IRA S NG+ + L+ Y ++ S V D +TFP V+K C+ D G V+
Sbjct: 75 LWNSIIRAFS--KNGLFPEALEFYGKLRESKVSPDKYTFPSVIKACAGLFDAEMGDLVYE 134
Query: 184 VVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDYRE 243
+ +GF+SD++VGN L+ +Y G+++ AR+VFDEMP RD+VSWN+++ S +G Y E
Sbjct: 135 QILDMGFESDLFVGNALVDMYSRMGLLTRARQVFDEMPVRDLVSWNSLISGYSSHGYYEE 194
Query: 244 ARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNALV 303
A Y + S I P+ +V +LP L + + +H + +K+G++S V N LV
Sbjct: 195 ALEIYHELK-NSWIVPDSFTVSSVLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVNNGLV 254
Query: 304 DAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPNSV 363
Y K + +VFDEM R+ VS+N +I G +++ +F +D + KP+ +
Sbjct: 255 AMYLKFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEMVEESVRMFLENLD-QFKPDLL 314
Query: 364 TISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFHNM 423
T+SS+L L K I+ + ++ G + + N LID+YAK G A +F++M
Sbjct: 315 TVSSVLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYAKCGDMITARDVFNSM 374
Query: 424 DRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGPGK 483
+ ++ VSWN++I+ Y +G +EA++ +M E + +T+ ++ RL L GK
Sbjct: 375 ECKDTVSWNSIISGYIQSGDLMEAMKLFKMMMIMEEQADHITYLMLISVSTRLADLKFGK 434
Query: 484 EIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNT-SHKDEVSYNILILGYSETS 543
+H+ GI+ G+ D+ VSNAL DMYAKCG + + +F++ D V++N +I
Sbjct: 435 GLHSNGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSSMGTGDTVTWNTVISACVRFG 494
Query: 544 DCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVSN 603
D L + ++MR PD+ +F+ + CA+LAA + GKEIH LR + S L + N
Sbjct: 495 DFATGLQVTTQMRKSEVVPDMATFLVTLPMCASLAAKRLGKEIHCCLLRFGYESELQIGN 554
Query: 604 SLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQY 663
+L++ Y+KC ++ + +VF ++ +DV +W MI YGM GE E A+ F +M +
Sbjct: 555 ALIEMYSKCGCLENSSRVFERMSRRDVVTWTGMIYAYGMYGEGEKALETFADMEKSGIVP 614
Query: 664 DLVSYIAVLSACSHGGLVEQGWKYFSEMLAQ-NLEPTQMHYACIVDLLGRAGFVEEAAEL 723
D V +IA++ ACSH GLV++G F +M ++P HYAC+VDLL R+ + +A E
Sbjct: 615 DSVVFIAIIYACSHSGLVDEGLACFEKMKTHYKIDPMIEHYACVVDLLSRSQKISKAEEF 674
Query: 724 IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW 783
I+ +PI PDA+IW ++L ACR ++E + + E+ P GY IL SN YA +W
Sbjct: 675 IQAMPIKPDASIWASVLRACRTSGDMETAERVSRRIIELNPDDPGYSILASNAYAALRKW 734
Query: 784 DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTF 814
D+V+ +R +K + KNPG SW+++ VH F
Sbjct: 735 DKVSLIRKSLKDKHITKNPGYSWIEVGKNVHVF 761
BLAST of Moc02g02020 vs. TAIR 10
Match:
AT1G18485.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 471.9 bits (1213), Expect = 1.1e-132
Identity = 263/776 (33.89%), Postives = 437/776 (56.31%), Query Frame = 0
Query: 59 LLTLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQN 118
LL + + +++H L + L + LC +I YA P+ R +F
Sbjct: 90 LLQASGKRKDIEMGRKIHQLVSGSTRLRNDDVLCTRIITMYAMCGSPDDSRFVF--DALR 149
Query: 119 CRTAFLWNTLIRAHSIAGNGMIDG-LQTYNRMVRSGVQLDDH-TFPFVLKLCSDSLDICK 178
+ F WN +I ++S N + D L+T+ M+ + L DH T+P V+K C+ D+
Sbjct: 150 SKNLFQWNAVISSYS--RNELYDEVLETFIEMISTTDLLPDHFTYPCVIKACAGMSDVGI 209
Query: 179 GMEVHGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSV 238
G+ VHG+V K G DV+VGN L+ YG G V+DA ++FD MPER++VSWN+++ + S
Sbjct: 210 GLAVHGLVVKTGLVEDVFVGNALVSFYGTHGFVTDALQLFDIMPERNLVSWNSMIRVFSD 269
Query: 239 NGDYREARNYYFWMTLRSG---IQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDS 298
NG E+ M +G P+V ++V +LP+ A + + + +H + +K LD
Sbjct: 270 NGFSEESFLLLGEMMEENGDGAFMPDVATLVTVLPVCAREREIGLGKGVHGWAVKLRLDK 329
Query: 299 QVTTCNALVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMI 358
++ NAL+D Y KCG + + +F +N VSWN ++ G + +G DV R M+
Sbjct: 330 ELVLNNALMDMYSKCGCITNAQMIFKMNNNKNVVSWNTMVGGFSAEGDTHGTFDVLRQML 389
Query: 359 --DAEVKPNSVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGH 418
+VK + VTI + +PV + + KE+H +S++ + +AN+ + YAK G
Sbjct: 390 AGGEDVKADEVTILNAVPVCFHESFLPSLKELHCYSLKQEFVYNELVANAFVASYAKCGS 449
Query: 419 STEASCIFHNMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPA 478
+ A +FH + + V SWNA+I +A + ++ + M+ +G P++ T ++L A
Sbjct: 450 LSYAQRVFHGIRSKTVNSWNALIGGHAQSNDPRLSLDAHLQMKISGLLPDSFTVCSLLSA 509
Query: 479 CARLGFLGPGKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNT-SHKDEVSY 538
C++L L GKE+H IR L D+FV ++ +Y CG L + + +F+ K VS+
Sbjct: 510 CSKLKSLRLGKEVHGFIIRNWLERDLFVYLSVLSLYIHCGELCTVQALFDAMEDKSLVSW 569
Query: 539 NILILGYSETSDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALR 598
N +I GY + +L +F +M L G + +S + V AC+ L +++ G+E H AL+
Sbjct: 570 NTVITGYLQNGFPDRALGVFRQMVLYGIQLCGISMMPVFGACSLLPSLRLGREAHAYALK 629
Query: 599 NHFYSHLFVSNSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITM 658
+ F++ SL+D Y K I + KVFN + K ASWN MI+GYG+ G + AI +
Sbjct: 630 HLLEDDAFIACSLIDMYAKNGSITQSSKVFNGLKEKSTASWNAMIMGYGIHGLAKEAIKL 689
Query: 659 FEEMRGDTVQYDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQ-NLEPTQMHYACIVDLLG 718
FEEM+ D ++++ VL+AC+H GL+ +G +Y +M + L+P HYAC++D+LG
Sbjct: 690 FEEMQRTGHNPDDLTFLGVLTACNHSGLIHEGLRYLDQMKSSFGLKPNLKHYACVIDMLG 749
Query: 719 RAGFVEEAAELI-RCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYI 778
RAG +++A ++ + D IW +LL +CRI++N+E+G A LFE++P+ Y+
Sbjct: 750 RAGQLDKALRVVAEEMSEEADVGIWKSLLSSCRIHQNLEMGEKVAAKLFELEPEKPENYV 809
Query: 779 LLSNMYAEAGRWDEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEER-AEGFE 824
LLSN+YA G+W++V +VR M +K+ GCSW++++ +V +F+ ER +GFE
Sbjct: 810 LLSNLYAGLGKWEDVRKVRQRMNEMSLRKDAGCSWIELNRKVFSFVVGERFLDGFE 861
BLAST of Moc02g02020 vs. TAIR 10
Match:
AT1G15510.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 468.0 bits (1203), Expect = 1.6e-131
Identity = 237/689 (34.40%), Postives = 389/689 (56.46%), Query Frame = 0
Query: 132 HSIAGNGMI-DGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEVHGVVSKLGFD 191
H + NG + + ++ N M V +D+ F +++LC +G +V+ +
Sbjct: 67 HGLCANGKLEEAMKLLNSMQELRVAVDEDVFVALVRLCEWKRAQEEGSKVYSIALSSMSS 126
Query: 192 SDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDYREARNYYFWM 251
V +GN L ++ G + DA VF +M ER++ SWN ++G + G + EA Y M
Sbjct: 127 LGVELGNAFLAMFVRFGNLVDAWYVFGKMSERNLFSWNVLVGGYAKQGYFDEAMCLYHRM 186
Query: 252 TLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNALVDAYGKCGN 311
G++P+V + +L + D + +H + ++ G + + NAL+ Y KCG+
Sbjct: 187 LWVGGVKPDVYTFPCVLRTCGGIPDLARGKEVHVHVVRYGYELDIDVVNALITMYVKCGD 246
Query: 312 VKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPNSVTISSILPV 371
VK++ +FD M R+ +SWNA+I+G G + L++F M V P+ +T++S++
Sbjct: 247 VKSARLLFDRMPRRDIISWNAMISGYFENGMCHEGLELFFAMRGLSVDPDLMTLTSVISA 306
Query: 372 LVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFHNMDRRNVVSW 431
L + G++IH + + G DI + NSL MY +G EA +F M+R+++VSW
Sbjct: 307 CELLGDRRLGRDIHAYVITTGFAVDISVCNSLTQMYLNAGSWREAEKLFSRMERKDIVSW 366
Query: 432 NAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGPGKEIHAIGIR 491
MI+ Y N L +AI +M + P+ +T VL ACA LG L G E+H + I+
Sbjct: 367 TTMISGYEYNFLPDKAIDTYRMMDQDSVKPDEITVAAVLSACATLGDLDTGVELHKLAIK 426
Query: 492 LGLTADMFVSNALTDMYAKCGCLHSARNVF-NTSHKDEVSYNILILGYSETSDCLESLNL 551
L + + V+N L +MY+KC C+ A ++F N K+ +S+ +I G + C E+L
Sbjct: 427 ARLISYVIVANNLINMYSKCKCIDKALDIFHNIPRKNVISWTSIIAGLRLNNRCFEALIF 486
Query: 552 FSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVSNSLLDFYTK 611
+M++ +P+ ++ ++ACA + A+ GKEIH LR F+ N+LLD Y +
Sbjct: 487 LRQMKMT-LQPNAITLTAALAACARIGALMCGKEIHAHVLRTGVGLDDFLPNALLDMYVR 546
Query: 612 CARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQYDLVSYIAV 671
C R++ A FN KDV SWN ++ GY G+ + +F+ M V+ D +++I++
Sbjct: 547 CGRMNTAWSQFNS-QKKDVTSWNILLTGYSERGQGSMVVELFDRMVKSRVRPDEITFISL 606
Query: 672 LSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAELIRCLPIAPD 731
L CS +V QG YFS+M + P HYAC+VDLLGRAG ++EA + I+ +P+ PD
Sbjct: 607 LCGCSKSQMVRQGLMYFSKMEDYGVTPNLKHYACVVDLLGRAGELQEAHKFIQKMPVTPD 666
Query: 732 ANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRWDEVNRVRDL 791
+WGALL ACRI+ ++LG +A+H+FE+ + GYYILL N+YA+ G+W EV +VR +
Sbjct: 667 PAVWGALLNACRIHHKIDLGELSAQHIFELDKKSVGYYILLCNLYADCGKWREVAKVRRM 726
Query: 792 MKSRGAKKNPGCSWVQIHDQVHTFMAEER 819
MK G + GCSWV++ +VH F+++++
Sbjct: 727 MKENGLTVDAGCSWVEVKGKVHAFLSDDK 753
BLAST of Moc02g02020 vs. TAIR 10
Match:
AT1G69350.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 453.4 bits (1165), Expect = 4.0e-127
Identity = 252/762 (33.07%), Postives = 429/762 (56.30%), Query Frame = 0
Query: 60 LTLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNC 119
+ L + SLR QLHA L+ G L LI YA P+S R++F
Sbjct: 5 MPLFRSCSSLRLVSQLHAHLLVTGRLRRDPLPVTKLIESYAFMGSPDSSRLVFEAFPY-- 64
Query: 120 RTAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLD-ICKGM 179
+F++ LI+ ++ + + + Y+R+V Q+ FP VL+ C+ S + + G
Sbjct: 65 PDSFMYGVLIKC-NVWCHLLDAAIDLYHRLVSETTQISKFVFPSVLRACAGSREHLSVGG 124
Query: 180 EVHGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNG 239
+VHG + K G D D + +LL +YG G +SDA +VFD MP RD+V+W+T++ NG
Sbjct: 125 KVHGRIIKGGVDDDAVIETSLLCMYGQTGNLSDAEKVFDGMPVRDLVAWSTLVSSCLENG 184
Query: 240 DYREARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTC 299
+ +A + M + G++P+ V+++ ++ A L + R +H + D T C
Sbjct: 185 EVVKALRMFKCM-VDDGVEPDAVTMISVVEGCAELGCLRIARSVHGQITRKMFDLDETLC 244
Query: 300 NALVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCD-ALDVFRMMIDAEV 359
N+L+ Y KCG++ +S ++F+++ ++N VSW A+I+ +G F + AL F MI + +
Sbjct: 245 NSLLTMYSKCGDLLSSERIFEKIAKKNAVSWTAMISSYN-RGEFSEKALRSFSEMIKSGI 304
Query: 360 KPNSVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDI-FIANSLIDMYAKSGHSTEAS 419
+PN VT+ S+L + + GK +HGF++R + + ++ +L+++YA+ G ++
Sbjct: 305 EPNLVTLYSVLSSCGLIGLIREGKSVHGFAVRRELDPNYESLSLALVELYAECGKLSDCE 364
Query: 420 CIFHNMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLG 479
+ + RN+V+WN++I+ YA G+ ++A+ M P+A T + + AC G
Sbjct: 365 TVLRVVSDRNIVAWNSLISLYAHRGMVIQALGLFRQMVTQRIKPDAFTLASSISACENAG 424
Query: 480 FLGPGKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFN-TSHKDEVSYNILIL 539
+ GK+IH IR + +D FV N+L DMY+K G + SA VFN H+ V++N ++
Sbjct: 425 LVPLGKQIHGHVIRTDV-SDEFVQNSLIDMYSKSGSVDSASTVFNQIKHRSVVTWNSMLC 484
Query: 540 GYSETSDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYS 599
G+S+ + +E+++LF M + + V+F+ VI AC+++ ++++GK +H + +
Sbjct: 485 GFSQNGNSVEAISLFDYMYHSYLEMNEVTFLAVIQACSSIGSLEKGKWVHHKLIISGL-K 544
Query: 600 HLFVSNSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMR 659
LF +L+D Y KC ++ A VF + + + SW++MI YGM G + +AI+ F +M
Sbjct: 545 DLFTDTALIDMYAKCGDLNAAETVFRAMSSRSIVSWSSMINAYGMHGRIGSAISTFNQMV 604
Query: 660 GDTVQYDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVE 719
+ + V ++ VLSAC H G VE+G YF+ M + + P H+AC +DLL R+G ++
Sbjct: 605 ESGTKPNEVVFMNVLSACGHSGSVEEGKYYFNLMKSFGVSPNSEHFACFIDLLSRSGDLK 664
Query: 720 EAAELIRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYA 779
EA I+ +P DA++WG+L+ CRI++ +++ L +I GYY LLSN+YA
Sbjct: 665 EAYRTIKEMPFLADASVWGSLVNGCRIHQKMDIIKAIKNDLSDIVTDDTGYYTLLSNIYA 724
Query: 780 EAGRWDEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEE 818
E G W+E R+R MKS KK PG S ++I +V F A E
Sbjct: 725 EEGEWEEFRRLRSAMKSSNLKKVPGYSAIEIDQKVFRFGAGE 759
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022153017.1 | 0.0e+00 | 100.00 | pentatricopeptide repeat-containing protein At4g14170-like [Momordica charantia] | [more] |
XP_038901996.1 | 0.0e+00 | 85.58 | pentatricopeptide repeat-containing protein At4g14170-like [Benincasa hispida] | [more] |
KAG6570435.1 | 0.0e+00 | 83.01 | Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurb... | [more] |
XP_022985648.1 | 0.0e+00 | 82.77 | pentatricopeptide repeat-containing protein At4g14170-like isoform X1 [Cucurbita... | [more] |
XP_022944177.1 | 0.0e+00 | 83.13 | pentatricopeptide repeat-containing protein At4g14170-like isoform X1 [Cucurbita... | [more] |
Match Name | E-value | Identity | Description | |
Q9SN39 | 5.5e-137 | 38.91 | Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... | [more] |
Q9SS60 | 5.3e-132 | 35.19 | Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX... | [more] |
Q0WN60 | 1.5e-131 | 33.89 | Pentatricopeptide repeat-containing protein At1g18485 OS=Arabidopsis thaliana OX... | [more] |
Q9M9E2 | 2.2e-130 | 34.40 | Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidop... | [more] |
Q9C507 | 5.7e-126 | 33.07 | Putative pentatricopeptide repeat-containing protein At1g69350, mitochondrial OS... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1DHT8 | 0.0e+00 | 100.00 | pentatricopeptide repeat-containing protein At4g14170-like OS=Momordica charanti... | [more] |
A0A6J1JE86 | 0.0e+00 | 82.77 | pentatricopeptide repeat-containing protein At4g14170-like isoform X1 OS=Cucurbi... | [more] |
A0A6J1FV31 | 0.0e+00 | 83.13 | pentatricopeptide repeat-containing protein At4g14170-like isoform X1 OS=Cucurbi... | [more] |
A0A5A7V0P3 | 0.0e+00 | 83.19 | Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... | [more] |
A0A0A0KEH6 | 0.0e+00 | 82.95 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G428560 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT4G18750.1 | 3.9e-138 | 38.91 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT3G03580.1 | 3.8e-133 | 35.19 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT1G18485.1 | 1.1e-132 | 33.89 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT1G15510.1 | 1.6e-131 | 34.40 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT1G69350.1 | 4.0e-127 | 33.07 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |