Homology
BLAST of HG10020467 vs. NCBI nr
Match:
XP_038904608.1 (pentatricopeptide repeat-containing protein At1g09900-like [Benincasa hispida])
HSP 1 Score: 806.2 bits (2081), Expect = 1.6e-229
Identity = 413/490 (84.29%), Postives = 429/490 (87.55%), Query Frame = 0
Query: 28 LRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRSFT---------- 87
+ L L PSF TT PLS TYTPF AN E SRLS NKQSHSI T SFT
Sbjct: 1 MHLALPPSFFTTRPLSYTYTPFAANFCGEYSRLSANKQSHSIDTGQHSFTHNAHASPFKL 60
Query: 88 ----PTRITRIKSLPIPSEEGTEIFIMSQKHTEIRNISEFNDLFMDFVSENELHLALKLL 147
PTR+TRIKSLPIPSEEGTEIFIMSQK TEIRN+SEFND MDFVSENEL LALKLL
Sbjct: 61 EIEIPTRVTRIKSLPIPSEEGTEIFIMSQKRTEIRNVSEFNDRLMDFVSENELDLALKLL 120
Query: 148 SNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKR 207
SNISSYGLVPNSRTFSIMIR YCKKGEL+ AG+VLEQM+GRGH PNDATV LVNAFCKR
Sbjct: 121 SNISSYGLVPNSRTFSIMIRGYCKKGELETAGKVLEQMVGRGHNPNDATVTVLVNAFCKR 180
Query: 208 GKTQKALEMVEVVGRIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTY 267
GKTQKALEMVE+VGRIGRKPTV+TYNCLLKGLCYVGRVEEACEMVTEMKKD LIPDIYTY
Sbjct: 181 GKTQKALEMVEMVGRIGRKPTVQTYNCLLKGLCYVGRVEEACEMVTEMKKDRLIPDIYTY 240
Query: 268 TALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKMKQ 327
TALMDGLCKVGRSDEAMELL EAE NGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKMKQ
Sbjct: 241 TALMDGLCKVGRSDEAMELLIEAEGNGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKMKQ 300
Query: 328 MNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEK 387
+NC PDRITYTTLLHGLIKWGKIRIAL TYKEMVSSGHTIEAKMMNTFMRAL +RSW EK
Sbjct: 301 INCTPDRITYTTLLHGLIKWGKIRIALSTYKEMVSSGHTIEAKMMNTFMRALCRRSWNEK 360
Query: 388 DLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMISEALANLHHMIGKGYSPRAITI 447
DLLEDAHQVFEKMKDD+QVID+STYGLLIQALCSGNMISEALANLHHMI KGYSPRAI I
Sbjct: 361 DLLEDAHQVFEKMKDDYQVIDQSTYGLLIQALCSGNMISEALANLHHMIEKGYSPRAIII 420
Query: 448 DVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGLALK 504
DVVVQ+LC H GS +EAL +LGHGIPF SFDLII+ELNKQ MR SA NVYGLALK
Sbjct: 421 DVVVQTLC----HRGSVDEALLVLGHGIPFRRFSFDLIIDELNKQEMRFSACNVYGLALK 480
BLAST of HG10020467 vs. NCBI nr
Match:
XP_022982589.1 (pentatricopeptide repeat-containing protein At5g64320, mitochondrial-like [Cucurbita maxima])
HSP 1 Score: 768.8 bits (1984), Expect = 2.8e-218
Identity = 394/496 (79.44%), Postives = 426/496 (85.89%), Query Frame = 0
Query: 28 LRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRSFT---------- 87
+RL LRPSFLTTLPLSST TPFGAN E N R S NK+ HS+ SF
Sbjct: 1 MRLALRPSFLTTLPLSSTDTPFGANFIEANDRRSANKRPHSVCNGGFSFNHHVHASPFKL 60
Query: 88 ----PTRIT------RIKSLPIPSEEGTEIFIMSQKHTEIRNISEFNDLFMDFVSENELH 147
PTR+T RIKSLPIPSEEGTEIFIMSQK EI+NI EFNDLFM+FVSE+EL
Sbjct: 61 GIQIPTRMTIQNVADRIKSLPIPSEEGTEIFIMSQKSCEIQNICEFNDLFMEFVSEDELD 120
Query: 148 LALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLV 207
LALKLLSN++SYGLVPNSRTFSIMIRCYCKKG+LDNA RVL QMLGRG PNDAT+ LV
Sbjct: 121 LALKLLSNLTSYGLVPNSRTFSIMIRCYCKKGDLDNAARVLGQMLGRGCNPNDATITVLV 180
Query: 208 NAFCKRGKTQKALEMVEVVGRIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLI 267
NAFCKRGK QKALEMVE+VGR GRKPTV+ YNCLLKGLCYVGRVEEACEMVTEMKKDSLI
Sbjct: 181 NAFCKRGKMQKALEMVELVGRNGRKPTVQAYNCLLKGLCYVGRVEEACEMVTEMKKDSLI 240
Query: 268 PDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYV 327
PDIYTYTALMDGLCKVGRSDEAMELL+EAE NG+KPSVVTFNTLFNGYCKEGRP+DGI V
Sbjct: 241 PDIYTYTALMDGLCKVGRSDEAMELLSEAEGNGVKPSVVTFNTLFNGYCKEGRPLDGIRV 300
Query: 328 LNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSK 387
L KMKQMNC PDRI+Y+TLLHGLIKWGKIR ALRTYKEMVSSGH+IE KMMNTFMRAL +
Sbjct: 301 LKKMKQMNCTPDRISYSTLLHGLIKWGKIRTALRTYKEMVSSGHSIEEKMMNTFMRALCR 360
Query: 388 RSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMISEALANLHHMIGKGYS 447
R+WKEKDLLEDAHQVFEKMK++FQVIDRSTYGLLIQALCSGN SEALANLHHMIGKGYS
Sbjct: 361 RTWKEKDLLEDAHQVFEKMKNEFQVIDRSTYGLLIQALCSGNRTSEALANLHHMIGKGYS 420
Query: 448 PRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNV 504
P AITIDV+VQ+LC H+GSA+EALC+LGHGI FS SFDLII ELN++GM LSA +V
Sbjct: 421 PWAITIDVMVQALC----HSGSASEALCVLGHGIRFSRISFDLIIEELNEEGMWLSACSV 480
BLAST of HG10020467 vs. NCBI nr
Match:
KAG6580429.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 766.5 bits (1978), Expect = 1.4e-217
Identity = 391/496 (78.83%), Postives = 424/496 (85.48%), Query Frame = 0
Query: 28 LRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRSFT---------- 87
+R+ LRPSFLTTLPLSST TPFGAN EEN RLS NK+ HS+ SF
Sbjct: 1 MRIALRPSFLTTLPLSSTDTPFGANFIEENGRLSANKRPHSVCNGGLSFNHHVHASPFKL 60
Query: 88 ----PTRIT------RIKSLPIPSEEGTEIFIMSQKHTEIRNISEFNDLFMDFVSENELH 147
PTR+T IKSLPIPSEEGTEIFIMSQK EI+NI EFNDLFM+FVSE+EL
Sbjct: 61 GIQIPTRMTIQNVADTIKSLPIPSEEGTEIFIMSQKSCEIQNICEFNDLFMEFVSEDELD 120
Query: 148 LALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLV 207
LALKLLSN+SSYGLVPN RTFSIMIRCYCKKG+L+NA RVL QMLGRG PNDAT+ LV
Sbjct: 121 LALKLLSNLSSYGLVPNCRTFSIMIRCYCKKGDLENAARVLGQMLGRGCNPNDATITVLV 180
Query: 208 NAFCKRGKTQKALEMVEVVGRIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLI 267
NAFCKRGK QKALEMVE+VGR GRKPTV+ YNCLLKGLCYVGRVEEACEMVTEMKKDSLI
Sbjct: 181 NAFCKRGKMQKALEMVELVGRNGRKPTVQAYNCLLKGLCYVGRVEEACEMVTEMKKDSLI 240
Query: 268 PDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYV 327
PDIYTYTALMDGLCKVGRSDEAMELL+EAE+NG+KPSVVTFNTLFNGYCKEGRP+DGI V
Sbjct: 241 PDIYTYTALMDGLCKVGRSDEAMELLSEAEQNGVKPSVVTFNTLFNGYCKEGRPLDGIRV 300
Query: 328 LNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSK 387
L KMKQMNC PDRI+Y+TLL GLIKWGKIR ALRTYKEMVSSGH+IE KMMNTFMRAL +
Sbjct: 301 LKKMKQMNCTPDRISYSTLLQGLIKWGKIRTALRTYKEMVSSGHSIEEKMMNTFMRALCR 360
Query: 388 RSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMISEALANLHHMIGKGYS 447
R+WKEKDLLEDAHQVFEKMK++ QVIDRSTYGLLIQALCSGN SEALANLHHMIGKGYS
Sbjct: 361 RTWKEKDLLEDAHQVFEKMKNELQVIDRSTYGLLIQALCSGNRTSEALANLHHMIGKGYS 420
Query: 448 PRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNV 504
PRAITIDV+VQ+LC H+G A+EALC+LGHGI FS SFDLII ELN++GM SA NV
Sbjct: 421 PRAITIDVMVQALC----HSGGASEALCVLGHGIRFSRISFDLIIEELNEEGMWFSACNV 480
BLAST of HG10020467 vs. NCBI nr
Match:
XP_022935128.1 (pentatricopeptide repeat-containing protein At1g09900-like [Cucurbita moschata])
HSP 1 Score: 766.1 bits (1977), Expect = 1.8e-217
Identity = 390/496 (78.63%), Postives = 424/496 (85.48%), Query Frame = 0
Query: 28 LRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRSFT---------- 87
+R+ LRPSFLTTLPLSST TPFGAN EEN RLS NK+ HS+ SF
Sbjct: 1 MRIALRPSFLTTLPLSSTDTPFGANFIEENGRLSANKRPHSVCNGGLSFNHHVHGSPFKL 60
Query: 88 ----PTRIT------RIKSLPIPSEEGTEIFIMSQKHTEIRNISEFNDLFMDFVSENELH 147
PTR+T IKSLPIPSEEGTEIFIMSQK EI+NI EFNDLFM+FVSE+EL
Sbjct: 61 GIQIPTRMTIQNVADTIKSLPIPSEEGTEIFIMSQKSCEIQNICEFNDLFMEFVSEDELD 120
Query: 148 LALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLV 207
LALKLLSN+SSYGLVPN RTFSIMIRCYCKKG+L+NA RVL QMLGRG PNDAT+ LV
Sbjct: 121 LALKLLSNLSSYGLVPNCRTFSIMIRCYCKKGDLENAARVLGQMLGRGCNPNDATITVLV 180
Query: 208 NAFCKRGKTQKALEMVEVVGRIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLI 267
NAFCKRGK QKALEMVE+VGR GRKPTV+ YNCLLKGLCYVGRVEEACEMVTEMKKDSLI
Sbjct: 181 NAFCKRGKMQKALEMVELVGRNGRKPTVQAYNCLLKGLCYVGRVEEACEMVTEMKKDSLI 240
Query: 268 PDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYV 327
PDIYTYTALMDGLCKVGRSDEAMELL+EAE+NG+KPSVVTFNTLFNGYCKEGRP+DGI V
Sbjct: 241 PDIYTYTALMDGLCKVGRSDEAMELLSEAEQNGVKPSVVTFNTLFNGYCKEGRPLDGIRV 300
Query: 328 LNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSK 387
L KMKQMNC PDRI+Y+TLL GLIKWGKIR ALRTYKEMVSSGH+IE KMMNTFMRAL +
Sbjct: 301 LKKMKQMNCTPDRISYSTLLQGLIKWGKIRTALRTYKEMVSSGHSIEEKMMNTFMRALCR 360
Query: 388 RSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMISEALANLHHMIGKGYS 447
R+WKEKDLLEDAHQVFEKMK++ QVIDRSTYGLLIQALCSGN SEALANLHHMIGKGYS
Sbjct: 361 RTWKEKDLLEDAHQVFEKMKNELQVIDRSTYGLLIQALCSGNRTSEALANLHHMIGKGYS 420
Query: 448 PRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNV 504
PRAITIDV+VQ+LC H+G A+EALC+LGHGI FS SFDL+I ELN++GM SA NV
Sbjct: 421 PRAITIDVMVQALC----HSGGASEALCVLGHGIRFSRISFDLVIEELNEEGMWFSACNV 480
BLAST of HG10020467 vs. NCBI nr
Match:
KAG7017186.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 764.6 bits (1973), Expect = 5.3e-217
Identity = 391/496 (78.83%), Postives = 421/496 (84.88%), Query Frame = 0
Query: 28 LRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRSFT---------- 87
+RL LRPSFLTTLPLSST TPFGAN EEN RLS NK+ HS+ SF
Sbjct: 1 MRLALRPSFLTTLPLSSTDTPFGANFIEENGRLSANKRPHSVCNGGLSFNHHVHASPFKL 60
Query: 88 ----PTRIT------RIKSLPIPSEEGTEIFIMSQKHTEIRNISEFNDLFMDFVSENELH 147
PTR+T IKSLPIPSEEGTEIF MSQK EI+NI EFNDLFM+FVSE+EL
Sbjct: 61 GIQIPTRMTIQNVADTIKSLPIPSEEGTEIFFMSQKSCEIQNICEFNDLFMEFVSEDELD 120
Query: 148 LALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLV 207
LALKLLSN+SSYGLVPN RTFSIMIRCYCKKG+LDNA RVL QMLG G PNDAT+ LV
Sbjct: 121 LALKLLSNLSSYGLVPNCRTFSIMIRCYCKKGDLDNAARVLGQMLGSGCNPNDATITVLV 180
Query: 208 NAFCKRGKTQKALEMVEVVGRIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLI 267
NAFCKRGK QKA EMVE+VGR GRKPTV+ YNCLLKGLCYVGRVEEACEMVTEMKKDSLI
Sbjct: 181 NAFCKRGKMQKAFEMVELVGRNGRKPTVQAYNCLLKGLCYVGRVEEACEMVTEMKKDSLI 240
Query: 268 PDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYV 327
PDIYTYTALMDGLCKVGRSDEAMELL+EAE+NG+KPSVVTFNTLFNGYCKEGRP+DGI V
Sbjct: 241 PDIYTYTALMDGLCKVGRSDEAMELLSEAEQNGVKPSVVTFNTLFNGYCKEGRPLDGIRV 300
Query: 328 LNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSK 387
L KMKQMNC PDRI+Y+TLL GLIKWGKIR ALRTYKEMVSSGH+IE KMMNTFMRAL +
Sbjct: 301 LKKMKQMNCTPDRISYSTLLQGLIKWGKIRTALRTYKEMVSSGHSIEEKMMNTFMRALCR 360
Query: 388 RSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMISEALANLHHMIGKGYS 447
RSWKEKDLLEDAHQVFEKMK++ QVIDRSTYGLLIQALCSGN SEALANLHHMIGKGYS
Sbjct: 361 RSWKEKDLLEDAHQVFEKMKNELQVIDRSTYGLLIQALCSGNRTSEALANLHHMIGKGYS 420
Query: 448 PRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNV 504
PRAITIDV+VQ+LC H+G A+EALC+LGHGI FS SFDLII ELN++GM SA NV
Sbjct: 421 PRAITIDVMVQALC----HSGGASEALCVLGHGIRFSRISFDLIIEELNEEGMWFSACNV 480
BLAST of HG10020467 vs. ExPASy Swiss-Prot
Match:
Q9SR00 (Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At3g04760 PE=2 SV=1)
HSP 1 Score: 189.1 bits (479), Expect = 1.2e-46
Identity = 112/372 (30.11%), Postives = 183/372 (49.19%), Query Frame = 0
Query: 113 FNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQML 172
+N + S +L LALK+L+ + S P T++I+I +G +D A +++++ML
Sbjct: 196 YNIMIGSLCSRGKLDLALKVLNQLLSDNCQPTVITYTILIEATMLEGGVDEALKLMDEML 255
Query: 173 GRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGRIGRKPTVKTYNCLLKGLCYVGRVE 232
RG P+ T ++ CK G +A EMV + G +P V +YN LL+ L G+ E
Sbjct: 256 SRGLKPDMFTYNTIIRGMCKEGMVDRAFEMVRNLELKGCEPDVISYNILLRALLNQGKWE 315
Query: 233 EACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLF 292
E +++T+M + P++ TY+ L+ LC+ G+ +EAM LL +E GL P +++ L
Sbjct: 316 EGEKLMTKMFSEKCDPNVVTYSILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLI 375
Query: 293 NGYCKEGRPVDGIYVLNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHT 352
+C+EGR I L M C+PD + Y T+L L K GK AL + ++ G +
Sbjct: 376 AAFCREGRLDVAIEFLETMISDGCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCS 435
Query: 353 IEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMIS 412
+ NT AL W D + H + E M + D TY +I LC M+
Sbjct: 436 PNSSSYNTMFSAL----WSSGDKIRALHMILEMMSNGIDP-DEITYNSMISCLCREGMVD 495
Query: 413 EALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLII 472
EA L M + P +T ++V+ C H + N ++G+G + T++ ++I
Sbjct: 496 EAFELLVDMRSCEFHPSVVTYNIVLLGFCKAHRIEDAINVLESMVGNGCRPNETTYTVLI 555
Query: 473 NELNKQGMRLSA 485
+ G R A
Sbjct: 556 EGIGFAGYRAEA 562
BLAST of HG10020467 vs. ExPASy Swiss-Prot
Match:
Q9FMF6 (Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g64320 PE=2 SV=1)
HSP 1 Score: 186.8 bits (473), Expect = 6.0e-46
Identity = 115/428 (26.87%), Postives = 201/428 (46.96%), Query Frame = 0
Query: 103 KHTEIRNISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELD 162
KH + N + L N ++ AL+LL + G VP++ TF+ +I CK ++
Sbjct: 245 KHGCVPNSVIYQTLIHSLSKCNRVNEALQLLEEMFLMGCVPDAETFNDVILGLCKFDRIN 304
Query: 163 NAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMV------------------- 222
A +++ +ML RG P+D T +L+N CK G+ A ++
Sbjct: 305 EAAKMVNRMLIRGFAPDDITYGYLMNGLCKIGRVDAAKDLFYRIPKPEIVIFNTLIHGFV 364
Query: 223 -------------EVVGRIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDI 282
++V G P V TYN L+ G G V A E++ +M+ P++
Sbjct: 365 THGRLDDAKAVLSDMVTSYGIVPDVCTYNSLIYGYWKEGLVGLALEVLHDMRNKGCKPNV 424
Query: 283 YTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNK 342
Y+YT L+DG CK+G+ DEA +LNE +GLKP+ V FN L + +CKE R + + + +
Sbjct: 425 YSYTILVDGFCKLGKIDEAYNVLNEMSADGLKPNTVGFNCLISAFCKEHRIPEAVEIFRE 484
Query: 343 MKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSW 402
M + C PD T+ +L+ GL + +I+ AL ++M+S G NT + A +R
Sbjct: 485 MPRKGCKPDVYTFNSLISGLCEVDEIKHALWLLRDMISEGVVANTVTYNTLINAFLRRG- 544
Query: 403 KEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMISEALANLHHMIGKGYSPRA 462
+++A ++ +M +D TY LI+ LC + +A + M+ G++P
Sbjct: 545 ----EIKEARKLVNEMVFQGSPLDEITYNSLIKGLCRAGEVDKARSLFEKMLRDGHAPSN 604
Query: 463 ITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGL 499
I+ ++++ LC + + ++ G +F+ +IN L + G ++
Sbjct: 605 ISCNILINGLCRSGMVEEAVEFQKEMVLRGSTPDIVTFNSLINGLCRAGRIEDGLTMFRK 664
BLAST of HG10020467 vs. ExPASy Swiss-Prot
Match:
Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)
HSP 1 Score: 182.6 bits (462), Expect = 1.1e-44
Identity = 113/367 (30.79%), Postives = 182/367 (49.59%), Query Frame = 0
Query: 109 NISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVL 168
N+ +N L + ++ KLL +++ GL PN +++++I C++G + VL
Sbjct: 239 NVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVL 298
Query: 169 EQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGRIGRKPTVKTYNCLLKGLCYV 228
+M RG+ ++ T L+ +CK G +AL M + R G P+V TY L+ +C
Sbjct: 299 TEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKA 358
Query: 229 GRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTF 288
G + A E + +M+ L P+ TYT L+DG + G +EA +L E +NG PSVVT+
Sbjct: 359 GNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTY 418
Query: 289 NTLFNGYCKEGRPVDGIYVLNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVS 348
N L NG+C G+ D I VL MK+ PD ++Y+T+L G + + ALR +EMV
Sbjct: 419 NALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVE 478
Query: 349 SGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSG 408
G + ++ ++ E+ ++A ++E+M D TY LI A C
Sbjct: 479 KGIKPDTITYSSLIQGFC-----EQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCME 538
Query: 409 NMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGH--GIPFSTT 468
+ +AL + M+ KG P +T V++ L + S T A L L + +P T
Sbjct: 539 GDLEKALQLHNEMVEKGVLPDVVTYSVLINGL-NKQSRTREAKRLLLKLFYEESVPSDVT 598
Query: 469 SFDLIIN 474
LI N
Sbjct: 599 YHTLIEN 599
BLAST of HG10020467 vs. ExPASy Swiss-Prot
Match:
Q9LFF1 (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=MEE40 PE=2 SV=1)
HSP 1 Score: 181.4 bits (459), Expect = 2.5e-44
Identity = 108/391 (27.62%), Postives = 195/391 (49.87%), Query Frame = 0
Query: 109 NISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVL 168
++S FN L ++L A+ +L ++ SYGLVP+ +TF+ +++ Y ++G+LD A R+
Sbjct: 188 DVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLDGALRIR 247
Query: 169 EQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMV-EVVGRIGRKPTVKTYNCLLKGLCY 228
EQM+ G ++ +V +V+ FCK G+ + AL + E+ + G P T+N L+ GLC
Sbjct: 248 EQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFNTLVNGLCK 307
Query: 229 VGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVT 288
G V+ A E++ M ++ PD+YTY +++ GLCK+G EA+E+L++ P+ VT
Sbjct: 308 AGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSPNTVT 367
Query: 289 FNTLFNGYCKEGRPVDGIYVLNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMV 348
+NTL + CKE + + + + +PD T+ +L+ GL R+A+ ++EM
Sbjct: 368 YNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELFEEMR 427
Query: 349 SSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCS 408
S G + N + +L K L++A + ++M+ TY LI C
Sbjct: 428 SKGCEPDEFTYNMLIDSLC-----SKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCK 487
Query: 409 GNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTS 468
N EA M G S ++T + ++ LC + +A ++ G +
Sbjct: 488 ANKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYT 547
Query: 469 FDLIINELNKQGMRLSASNVYGLALKRGVNP 499
++ ++ + G A+++ G P
Sbjct: 548 YNSLLTHFCRGGDIKKAADIVQAMTSNGCEP 573
BLAST of HG10020467 vs. ExPASy Swiss-Prot
Match:
Q3EDF8 (Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX=3702 GN=At1g09900 PE=2 SV=1)
HSP 1 Score: 176.4 bits (446), Expect = 8.1e-43
Identity = 106/368 (28.80%), Postives = 174/368 (47.28%), Query Frame = 0
Query: 112 EFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQM 171
E N+ V EL K L N+ +G VP+ + +IR +C+ G+ A ++LE +
Sbjct: 104 ESNNHLRQMVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEIL 163
Query: 172 LGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGRIGRKPTVKTYNCLLKGLCYVGRV 231
G G P+ T +++ +CK G+ AL V+ R+ P V TYN +L+ LC G++
Sbjct: 164 EGSGAVPDVITYNVMISGYCKAGEINNAL---SVLDRMSVSPDVVTYNTILRSLCDSGKL 223
Query: 232 EEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTL 291
++A E++ M + PD+ TYT L++ C+ AM+LL+E + G P VVT+N L
Sbjct: 224 KQAMEVLDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVL 283
Query: 292 FNGYCKEGRPVDGIYVLNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGH 351
NG CKEGR + I LN M C P+ IT+ +L + G+ A + +M+ G
Sbjct: 284 VNGICKEGRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGF 343
Query: 352 TIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMI 411
+ N + L + K LL A + EKM + +Y L+ C +
Sbjct: 344 SPSVVTFNILINFLCR-----KGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKM 403
Query: 412 SEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLI 471
A+ L M+ +G P +T + ++ +LC + L G +++ +
Sbjct: 404 DRAIEYLERMVSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTV 463
Query: 472 INELNKQG 480
I+ L K G
Sbjct: 464 IDGLAKAG 463
BLAST of HG10020467 vs. ExPASy TrEMBL
Match:
A0A6J1J506 (pentatricopeptide repeat-containing protein At5g64320, mitochondrial-like OS=Cucurbita maxima OX=3661 GN=LOC111481420 PE=4 SV=1)
HSP 1 Score: 768.8 bits (1984), Expect = 1.4e-218
Identity = 394/496 (79.44%), Postives = 426/496 (85.89%), Query Frame = 0
Query: 28 LRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRSFT---------- 87
+RL LRPSFLTTLPLSST TPFGAN E N R S NK+ HS+ SF
Sbjct: 1 MRLALRPSFLTTLPLSSTDTPFGANFIEANDRRSANKRPHSVCNGGFSFNHHVHASPFKL 60
Query: 88 ----PTRIT------RIKSLPIPSEEGTEIFIMSQKHTEIRNISEFNDLFMDFVSENELH 147
PTR+T RIKSLPIPSEEGTEIFIMSQK EI+NI EFNDLFM+FVSE+EL
Sbjct: 61 GIQIPTRMTIQNVADRIKSLPIPSEEGTEIFIMSQKSCEIQNICEFNDLFMEFVSEDELD 120
Query: 148 LALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLV 207
LALKLLSN++SYGLVPNSRTFSIMIRCYCKKG+LDNA RVL QMLGRG PNDAT+ LV
Sbjct: 121 LALKLLSNLTSYGLVPNSRTFSIMIRCYCKKGDLDNAARVLGQMLGRGCNPNDATITVLV 180
Query: 208 NAFCKRGKTQKALEMVEVVGRIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLI 267
NAFCKRGK QKALEMVE+VGR GRKPTV+ YNCLLKGLCYVGRVEEACEMVTEMKKDSLI
Sbjct: 181 NAFCKRGKMQKALEMVELVGRNGRKPTVQAYNCLLKGLCYVGRVEEACEMVTEMKKDSLI 240
Query: 268 PDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYV 327
PDIYTYTALMDGLCKVGRSDEAMELL+EAE NG+KPSVVTFNTLFNGYCKEGRP+DGI V
Sbjct: 241 PDIYTYTALMDGLCKVGRSDEAMELLSEAEGNGVKPSVVTFNTLFNGYCKEGRPLDGIRV 300
Query: 328 LNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSK 387
L KMKQMNC PDRI+Y+TLLHGLIKWGKIR ALRTYKEMVSSGH+IE KMMNTFMRAL +
Sbjct: 301 LKKMKQMNCTPDRISYSTLLHGLIKWGKIRTALRTYKEMVSSGHSIEEKMMNTFMRALCR 360
Query: 388 RSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMISEALANLHHMIGKGYS 447
R+WKEKDLLEDAHQVFEKMK++FQVIDRSTYGLLIQALCSGN SEALANLHHMIGKGYS
Sbjct: 361 RTWKEKDLLEDAHQVFEKMKNEFQVIDRSTYGLLIQALCSGNRTSEALANLHHMIGKGYS 420
Query: 448 PRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNV 504
P AITIDV+VQ+LC H+GSA+EALC+LGHGI FS SFDLII ELN++GM LSA +V
Sbjct: 421 PWAITIDVMVQALC----HSGSASEALCVLGHGIRFSRISFDLIIEELNEEGMWLSACSV 480
BLAST of HG10020467 vs. ExPASy TrEMBL
Match:
A0A6J1F9P1 (pentatricopeptide repeat-containing protein At1g09900-like OS=Cucurbita moschata OX=3662 GN=LOC111442091 PE=4 SV=1)
HSP 1 Score: 766.1 bits (1977), Expect = 8.8e-218
Identity = 390/496 (78.63%), Postives = 424/496 (85.48%), Query Frame = 0
Query: 28 LRLPLRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRSFT---------- 87
+R+ LRPSFLTTLPLSST TPFGAN EEN RLS NK+ HS+ SF
Sbjct: 1 MRIALRPSFLTTLPLSSTDTPFGANFIEENGRLSANKRPHSVCNGGLSFNHHVHGSPFKL 60
Query: 88 ----PTRIT------RIKSLPIPSEEGTEIFIMSQKHTEIRNISEFNDLFMDFVSENELH 147
PTR+T IKSLPIPSEEGTEIFIMSQK EI+NI EFNDLFM+FVSE+EL
Sbjct: 61 GIQIPTRMTIQNVADTIKSLPIPSEEGTEIFIMSQKSCEIQNICEFNDLFMEFVSEDELD 120
Query: 148 LALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLV 207
LALKLLSN+SSYGLVPN RTFSIMIRCYCKKG+L+NA RVL QMLGRG PNDAT+ LV
Sbjct: 121 LALKLLSNLSSYGLVPNCRTFSIMIRCYCKKGDLENAARVLGQMLGRGCNPNDATITVLV 180
Query: 208 NAFCKRGKTQKALEMVEVVGRIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLI 267
NAFCKRGK QKALEMVE+VGR GRKPTV+ YNCLLKGLCYVGRVEEACEMVTEMKKDSLI
Sbjct: 181 NAFCKRGKMQKALEMVELVGRNGRKPTVQAYNCLLKGLCYVGRVEEACEMVTEMKKDSLI 240
Query: 268 PDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYV 327
PDIYTYTALMDGLCKVGRSDEAMELL+EAE+NG+KPSVVTFNTLFNGYCKEGRP+DGI V
Sbjct: 241 PDIYTYTALMDGLCKVGRSDEAMELLSEAEQNGVKPSVVTFNTLFNGYCKEGRPLDGIRV 300
Query: 328 LNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSK 387
L KMKQMNC PDRI+Y+TLL GLIKWGKIR ALRTYKEMVSSGH+IE KMMNTFMRAL +
Sbjct: 301 LKKMKQMNCTPDRISYSTLLQGLIKWGKIRTALRTYKEMVSSGHSIEEKMMNTFMRALCR 360
Query: 388 RSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMISEALANLHHMIGKGYS 447
R+WKEKDLLEDAHQVFEKMK++ QVIDRSTYGLLIQALCSGN SEALANLHHMIGKGYS
Sbjct: 361 RTWKEKDLLEDAHQVFEKMKNELQVIDRSTYGLLIQALCSGNRTSEALANLHHMIGKGYS 420
Query: 448 PRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNV 504
PRAITIDV+VQ+LC H+G A+EALC+LGHGI FS SFDL+I ELN++GM SA NV
Sbjct: 421 PRAITIDVMVQALC----HSGGASEALCVLGHGIRFSRISFDLVIEELNEEGMWFSACNV 480
BLAST of HG10020467 vs. ExPASy TrEMBL
Match:
A0A6J1DW66 (pentatricopeptide repeat-containing protein At1g09900-like OS=Momordica charantia OX=3673 GN=LOC111024057 PE=4 SV=1)
HSP 1 Score: 693.0 bits (1787), Expect = 9.5e-196
Identity = 363/494 (73.48%), Postives = 397/494 (80.36%), Query Frame = 0
Query: 40 LPLSSTYTPFGANLFEENSRLSTNKQSHS---------------IHTALRSFTPTRIT-- 99
LPLSST N EEN RLS NKQSHS + L P RIT
Sbjct: 9 LPLSST----SVNFIEENGRLSPNKQSHSNRNRGPGFGGDNVYALPFKLEIENPRRITVK 68
Query: 100 ----RIKSLPIPSEEGTEIFIMSQKHTEIRNISEFNDLFMDFVSENELHLALKLLSNISS 159
RI SLP PS+EGTE+FI SQK EI+NISEFNDLF DFVS EL LAL+LLSNISS
Sbjct: 69 NEAGRIGSLPTPSKEGTEMFITSQKDCEIQNISEFNDLFADFVSAEELDLALRLLSNISS 128
Query: 160 YGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQK 219
YGLVPNSRTFSI IRCYCKKG+LDNA RV +QMLG G PNDATV LVNA C+RGK ++
Sbjct: 129 YGLVPNSRTFSIAIRCYCKKGDLDNAKRVFDQMLGSGCNPNDATVTVLVNALCRRGKIKR 188
Query: 220 ALEMVEVVGRIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMD 279
ALEMVE+VGRIGRK TV+TYNCLLKGLCYVGRVEEACEMV +MKKD L+PDIYTYTALMD
Sbjct: 189 ALEMVELVGRIGRKQTVRTYNCLLKGLCYVGRVEEACEMVAKMKKDGLVPDIYTYTALMD 248
Query: 280 GLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKMKQMNCMP 339
GLCKVGRSDEAMELLNEAEENGL+PSVVTFNTLFNGYCKEGRP+DGI+VL KMKQMNCMP
Sbjct: 249 GLCKVGRSDEAMELLNEAEENGLEPSVVTFNTLFNGYCKEGRPLDGIHVLKKMKQMNCMP 308
Query: 340 DRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLED 399
DRI+YTTLLHGLIKWGKIR ALRTYKEMVSSGH++E KMMNTFMRAL +RSWKEKDLLED
Sbjct: 309 DRISYTTLLHGLIKWGKIRTALRTYKEMVSSGHSVEEKMMNTFMRALCRRSWKEKDLLED 368
Query: 400 AHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMISEALANLHHMIGKGYSPRAITIDVVVQ 459
AHQVFEKMK++FQVI RSTYG++I ALCSGN ISEA+ANLHHMI KGYSPRAITI+VVV+
Sbjct: 369 AHQVFEKMKNEFQVIHRSTYGVVIPALCSGNKISEAVANLHHMIRKGYSPRAITINVVVE 428
Query: 460 SLCHTHSHTGSANEALCLLG---------HGIPFSTTSFDLIINELNKQGMRLSASNVYG 504
+LC GS NEAL ++G H IPFS S+DLII+ELNKQGM A VYG
Sbjct: 429 ALC----RRGSTNEALGVVGLGLVGDGHHHVIPFSRVSYDLIIDELNKQGMWFDACKVYG 488
BLAST of HG10020467 vs. ExPASy TrEMBL
Match:
A0A2I4EIR4 (pentatricopeptide repeat-containing protein At1g13040, mitochondrial-like OS=Juglans regia OX=51240 GN=LOC108989956 PE=4 SV=1)
HSP 1 Score: 488.0 bits (1255), Expect = 4.6e-134
Identity = 259/455 (56.92%), Postives = 328/455 (72.09%), Query Frame = 0
Query: 56 ENSRLSTNKQSHSIHTALRSF-TPTRITRIKSLPIPSEEGTEIFIMSQKHTEIRNISEFN 115
++S L ++H + SF P + I SL P+EE TE K TE R IS FN
Sbjct: 78 KHSHLIYVNRNHLVDPRTSSFQVPDFVGTIGSL--PTEERTEFLTAYIKDTEFRTISNFN 137
Query: 116 DLFMDFVSENELHLALKLLSNISSYGLV-PNSRTFSIMIRCYCKKGELDNAGRVLEQML- 175
DL M + E LALKL S++SSY + P+S TFSI+ RCYCKK LD A RVL+ M+
Sbjct: 138 DLLMALLIAEEPDLALKLFSDLSSYASIEPDSWTFSIVSRCYCKKNHLDEAQRVLDHMVE 197
Query: 176 GRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGRIGRKPTVKTYNCLLKGLCYVGRVE 235
RG +P+ AT+ L+N FCKRGK Q+A E++ +GRIG +PTV+TYNCLLKG+CYVGRVE
Sbjct: 198 ERGFHPDVATITMLINGFCKRGKLQRAFELLHFMGRIGCEPTVRTYNCLLKGMCYVGRVE 257
Query: 236 EACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLF 295
EA EM+ ++ K+S+ PDIYTYTA+MDG CKVGRSDEAMELL+EAEE GL P+VV++NTLF
Sbjct: 258 EAFEMLIKI-KESMKPDIYTYTAVMDGFCKVGRSDEAMELLDEAEEMGLTPNVVSYNTLF 317
Query: 296 NGYCKEGRPVDGIYVLNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHT 355
GYCKEGRP++GI VL +MKQ NCMPD I Y+TLLHGL+KWGKIR ALRTYKEMV G
Sbjct: 318 QGYCKEGRPLEGIGVLKQMKQRNCMPDYICYSTLLHGLLKWGKIRSALRTYKEMVGDGFE 377
Query: 356 IEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMIS 415
++ +MMNTF+R L +RSWKEKD+LEDA+QVFEKMK F VID STY L+IQALC+G I
Sbjct: 378 VDERMMNTFLRRLCRRSWKEKDMLEDAYQVFEKMKKRFYVIDLSTYSLMIQALCTGKKIE 437
Query: 416 EALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLL-----GHGIPFSTTS 475
+AL NLH MI G+SP IT + ++++LC G +EAL +L G +P + S
Sbjct: 438 QALDNLHDMIRMGHSPPIITFNSIIRALC----AGGRVDEALLVLALMDEGRRMP-NRIS 497
Query: 476 FDLIINELNKQGMRLSASNVYGLALKRGVNPTKTP 503
++L+I ELN++ L A N+YG+ALKRGV P + P
Sbjct: 498 YNLLIEELNRRKWLLGACNIYGMALKRGVIPNRKP 524
BLAST of HG10020467 vs. ExPASy TrEMBL
Match:
A0A5N6RKM0 (Uncharacterized protein OS=Carpinus fangiana OX=176857 GN=FH972_017034 PE=4 SV=1)
HSP 1 Score: 480.3 bits (1235), Expect = 9.7e-132
Identity = 259/478 (54.18%), Postives = 330/478 (69.04%), Query Frame = 0
Query: 32 LRPSFLTTLPLSSTYTPFGANLFEENSRLSTNKQSHSIHTALRSFTPTRITRIKSLPIPS 91
L P +LPL+ + ++ +S +T + H ++ F T I+ L P+
Sbjct: 2 LHPPHSFSLPLNPLH-----SIAISSSATATEAATIREHDMVKYFIDT----IRGL--PT 61
Query: 92 EEGTEIFIMSQKHTEIRNISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIM 151
+ TE K E R +S+FNDL M E LAL+L S++SSYG P+S T SI+
Sbjct: 62 KGRTEFLSTFFKDREFRTVSDFNDLVMALFIAEEPDLALELFSDMSSYGFEPDSWTLSIV 121
Query: 152 IRCYCKKGELDNAGRVLEQML-GRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGRIG 211
RCYCKK LD A RVL+QM+ +G +P+ ATV L+N FCKRG+ Q+A E+ + +GRIG
Sbjct: 122 SRCYCKKNHLDEAKRVLDQMVEEKGFHPDVATVTILINGFCKRGRLQRAFEVFDFMGRIG 181
Query: 212 RKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAM 271
KPTV+TYNCLLKGLC+VGRVEEA EM+ ++KKDS+ PDIYTYTA+MDG CKVGRSDEA+
Sbjct: 182 CKPTVQTYNCLLKGLCFVGRVEEAFEMLIKIKKDSVKPDIYTYTAVMDGFCKVGRSDEAV 241
Query: 272 ELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNKMKQMNCMPDRITYTTLLHGL 331
ELL+EA E GL P+VVT+NTLFNGYCKEGRP+ GI VL +MKQ NCMPD ITY+TLLHGL
Sbjct: 242 ELLDEAVEMGLTPNVVTYNTLFNGYCKEGRPLKGIGVLKQMKQRNCMPDYITYSTLLHGL 301
Query: 332 IKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDF 391
+KWGKIR ALRTYKEMV G + +MMNT +R L KRSWKEKDLLE+A+Q+FEKMK+ F
Sbjct: 302 LKWGKIRSALRTYKEMVGDGFEADERMMNTLLRGLCKRSWKEKDLLENAYQLFEKMKNGF 361
Query: 392 QVIDRSTYGLLIQALCSGNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSA 451
VID STYGL+++ALC G EAL NLH G+SP IT + V+++LC G
Sbjct: 362 YVIDLSTYGLMVKALCVGEKTEEALDNLHDFTRMGHSPHIITFNSVIRALC----AEGRV 421
Query: 452 NEALCLL-----GHGIPFSTTSFDLIINELNKQGMRLSASNVYGLALKRGVNPTKTPR 504
+EAL +L G IP + S++L+I E N++ L A N+YG ALKRGV P + PR
Sbjct: 422 DEALLVLILMDEGRRIP-NRISYNLLIEECNRRKWLLGACNIYGAALKRGVIPDRKPR 463
BLAST of HG10020467 vs. TAIR 10
Match:
AT3G04760.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )
HSP 1 Score: 189.1 bits (479), Expect = 8.6e-48
Identity = 112/372 (30.11%), Postives = 183/372 (49.19%), Query Frame = 0
Query: 113 FNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQML 172
+N + S +L LALK+L+ + S P T++I+I +G +D A +++++ML
Sbjct: 196 YNIMIGSLCSRGKLDLALKVLNQLLSDNCQPTVITYTILIEATMLEGGVDEALKLMDEML 255
Query: 173 GRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGRIGRKPTVKTYNCLLKGLCYVGRVE 232
RG P+ T ++ CK G +A EMV + G +P V +YN LL+ L G+ E
Sbjct: 256 SRGLKPDMFTYNTIIRGMCKEGMVDRAFEMVRNLELKGCEPDVISYNILLRALLNQGKWE 315
Query: 233 EACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLF 292
E +++T+M + P++ TY+ L+ LC+ G+ +EAM LL +E GL P +++ L
Sbjct: 316 EGEKLMTKMFSEKCDPNVVTYSILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLI 375
Query: 293 NGYCKEGRPVDGIYVLNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHT 352
+C+EGR I L M C+PD + Y T+L L K GK AL + ++ G +
Sbjct: 376 AAFCREGRLDVAIEFLETMISDGCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCS 435
Query: 353 IEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMIS 412
+ NT AL W D + H + E M + D TY +I LC M+
Sbjct: 436 PNSSSYNTMFSAL----WSSGDKIRALHMILEMMSNGIDP-DEITYNSMISCLCREGMVD 495
Query: 413 EALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLII 472
EA L M + P +T ++V+ C H + N ++G+G + T++ ++I
Sbjct: 496 EAFELLVDMRSCEFHPSVVTYNIVLLGFCKAHRIEDAINVLESMVGNGCRPNETTYTVLI 555
Query: 473 NELNKQGMRLSA 485
+ G R A
Sbjct: 556 EGIGFAGYRAEA 562
BLAST of HG10020467 vs. TAIR 10
Match:
AT5G64320.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 186.8 bits (473), Expect = 4.2e-47
Identity = 115/428 (26.87%), Postives = 201/428 (46.96%), Query Frame = 0
Query: 103 KHTEIRNISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELD 162
KH + N + L N ++ AL+LL + G VP++ TF+ +I CK ++
Sbjct: 245 KHGCVPNSVIYQTLIHSLSKCNRVNEALQLLEEMFLMGCVPDAETFNDVILGLCKFDRIN 304
Query: 163 NAGRVLEQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMV------------------- 222
A +++ +ML RG P+D T +L+N CK G+ A ++
Sbjct: 305 EAAKMVNRMLIRGFAPDDITYGYLMNGLCKIGRVDAAKDLFYRIPKPEIVIFNTLIHGFV 364
Query: 223 -------------EVVGRIGRKPTVKTYNCLLKGLCYVGRVEEACEMVTEMKKDSLIPDI 282
++V G P V TYN L+ G G V A E++ +M+ P++
Sbjct: 365 THGRLDDAKAVLSDMVTSYGIVPDVCTYNSLIYGYWKEGLVGLALEVLHDMRNKGCKPNV 424
Query: 283 YTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTLFNGYCKEGRPVDGIYVLNK 342
Y+YT L+DG CK+G+ DEA +LNE +GLKP+ V FN L + +CKE R + + + +
Sbjct: 425 YSYTILVDGFCKLGKIDEAYNVLNEMSADGLKPNTVGFNCLISAFCKEHRIPEAVEIFRE 484
Query: 343 MKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGHTIEAKMMNTFMRALSKRSW 402
M + C PD T+ +L+ GL + +I+ AL ++M+S G NT + A +R
Sbjct: 485 MPRKGCKPDVYTFNSLISGLCEVDEIKHALWLLRDMISEGVVANTVTYNTLINAFLRRG- 544
Query: 403 KEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMISEALANLHHMIGKGYSPRA 462
+++A ++ +M +D TY LI+ LC + +A + M+ G++P
Sbjct: 545 ----EIKEARKLVNEMVFQGSPLDEITYNSLIKGLCRAGEVDKARSLFEKMLRDGHAPSN 604
Query: 463 ITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLIINELNKQGMRLSASNVYGL 499
I+ ++++ LC + + ++ G +F+ +IN L + G ++
Sbjct: 605 ISCNILINGLCRSGMVEEAVEFQKEMVLRGSTPDIVTFNSLINGLCRAGRIEDGLTMFRK 664
BLAST of HG10020467 vs. TAIR 10
Match:
AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 182.6 bits (462), Expect = 8.0e-46
Identity = 113/367 (30.79%), Postives = 182/367 (49.59%), Query Frame = 0
Query: 109 NISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVL 168
N+ +N L + ++ KLL +++ GL PN +++++I C++G + VL
Sbjct: 239 NVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVL 298
Query: 169 EQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGRIGRKPTVKTYNCLLKGLCYV 228
+M RG+ ++ T L+ +CK G +AL M + R G P+V TY L+ +C
Sbjct: 299 TEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKA 358
Query: 229 GRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTF 288
G + A E + +M+ L P+ TYT L+DG + G +EA +L E +NG PSVVT+
Sbjct: 359 GNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTY 418
Query: 289 NTLFNGYCKEGRPVDGIYVLNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVS 348
N L NG+C G+ D I VL MK+ PD ++Y+T+L G + + ALR +EMV
Sbjct: 419 NALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVE 478
Query: 349 SGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSG 408
G + ++ ++ E+ ++A ++E+M D TY LI A C
Sbjct: 479 KGIKPDTITYSSLIQGFC-----EQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCME 538
Query: 409 NMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGH--GIPFSTT 468
+ +AL + M+ KG P +T V++ L + S T A L L + +P T
Sbjct: 539 GDLEKALQLHNEMVEKGVLPDVVTYSVLINGL-NKQSRTREAKRLLLKLFYEESVPSDVT 598
Query: 469 SFDLIIN 474
LI N
Sbjct: 599 YHTLIEN 599
BLAST of HG10020467 vs. TAIR 10
Match:
AT3G53700.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 181.4 bits (459), Expect = 1.8e-45
Identity = 108/391 (27.62%), Postives = 195/391 (49.87%), Query Frame = 0
Query: 109 NISEFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVL 168
++S FN L ++L A+ +L ++ SYGLVP+ +TF+ +++ Y ++G+LD A R+
Sbjct: 188 DVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLDGALRIR 247
Query: 169 EQMLGRGHYPNDATVAFLVNAFCKRGKTQKALEMV-EVVGRIGRKPTVKTYNCLLKGLCY 228
EQM+ G ++ +V +V+ FCK G+ + AL + E+ + G P T+N L+ GLC
Sbjct: 248 EQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFNTLVNGLCK 307
Query: 229 VGRVEEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVT 288
G V+ A E++ M ++ PD+YTY +++ GLCK+G EA+E+L++ P+ VT
Sbjct: 308 AGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSPNTVT 367
Query: 289 FNTLFNGYCKEGRPVDGIYVLNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMV 348
+NTL + CKE + + + + +PD T+ +L+ GL R+A+ ++EM
Sbjct: 368 YNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELFEEMR 427
Query: 349 SSGHTIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCS 408
S G + N + +L K L++A + ++M+ TY LI C
Sbjct: 428 SKGCEPDEFTYNMLIDSLC-----SKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCK 487
Query: 409 GNMISEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTS 468
N EA M G S ++T + ++ LC + +A ++ G +
Sbjct: 488 ANKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYT 547
Query: 469 FDLIINELNKQGMRLSASNVYGLALKRGVNP 499
++ ++ + G A+++ G P
Sbjct: 548 YNSLLTHFCRGGDIKKAADIVQAMTSNGCEP 573
BLAST of HG10020467 vs. TAIR 10
Match:
AT1G09900.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )
HSP 1 Score: 176.4 bits (446), Expect = 5.7e-44
Identity = 106/368 (28.80%), Postives = 174/368 (47.28%), Query Frame = 0
Query: 112 EFNDLFMDFVSENELHLALKLLSNISSYGLVPNSRTFSIMIRCYCKKGELDNAGRVLEQM 171
E N+ V EL K L N+ +G VP+ + +IR +C+ G+ A ++LE +
Sbjct: 104 ESNNHLRQMVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEIL 163
Query: 172 LGRGHYPNDATVAFLVNAFCKRGKTQKALEMVEVVGRIGRKPTVKTYNCLLKGLCYVGRV 231
G G P+ T +++ +CK G+ AL V+ R+ P V TYN +L+ LC G++
Sbjct: 164 EGSGAVPDVITYNVMISGYCKAGEINNAL---SVLDRMSVSPDVVTYNTILRSLCDSGKL 223
Query: 232 EEACEMVTEMKKDSLIPDIYTYTALMDGLCKVGRSDEAMELLNEAEENGLKPSVVTFNTL 291
++A E++ M + PD+ TYT L++ C+ AM+LL+E + G P VVT+N L
Sbjct: 224 KQAMEVLDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVL 283
Query: 292 FNGYCKEGRPVDGIYVLNKMKQMNCMPDRITYTTLLHGLIKWGKIRIALRTYKEMVSSGH 351
NG CKEGR + I LN M C P+ IT+ +L + G+ A + +M+ G
Sbjct: 284 VNGICKEGRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGF 343
Query: 352 TIEAKMMNTFMRALSKRSWKEKDLLEDAHQVFEKMKDDFQVIDRSTYGLLIQALCSGNMI 411
+ N + L + K LL A + EKM + +Y L+ C +
Sbjct: 344 SPSVVTFNILINFLCR-----KGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKM 403
Query: 412 SEALANLHHMIGKGYSPRAITIDVVVQSLCHTHSHTGSANEALCLLGHGIPFSTTSFDLI 471
A+ L M+ +G P +T + ++ +LC + L G +++ +
Sbjct: 404 DRAIEYLERMVSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTV 463
Query: 472 INELNKQG 480
I+ L K G
Sbjct: 464 IDGLAKAG 463
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038904608.1 | 1.6e-229 | 84.29 | pentatricopeptide repeat-containing protein At1g09900-like [Benincasa hispida] | [more] |
XP_022982589.1 | 2.8e-218 | 79.44 | pentatricopeptide repeat-containing protein At5g64320, mitochondrial-like [Cucur... | [more] |
KAG6580429.1 | 1.4e-217 | 78.83 | Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... | [more] |
XP_022935128.1 | 1.8e-217 | 78.63 | pentatricopeptide repeat-containing protein At1g09900-like [Cucurbita moschata] | [more] |
KAG7017186.1 | 5.3e-217 | 78.83 | Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... | [more] |
Match Name | E-value | Identity | Description | |
Q9SR00 | 1.2e-46 | 30.11 | Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidop... | [more] |
Q9FMF6 | 6.0e-46 | 26.87 | Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidop... | [more] |
Q9FIX3 | 1.1e-44 | 30.79 | Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... | [more] |
Q9LFF1 | 2.5e-44 | 27.62 | Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... | [more] |
Q3EDF8 | 8.1e-43 | 28.80 | Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1J506 | 1.4e-218 | 79.44 | pentatricopeptide repeat-containing protein At5g64320, mitochondrial-like OS=Cuc... | [more] |
A0A6J1F9P1 | 8.8e-218 | 78.63 | pentatricopeptide repeat-containing protein At1g09900-like OS=Cucurbita moschata... | [more] |
A0A6J1DW66 | 9.5e-196 | 73.48 | pentatricopeptide repeat-containing protein At1g09900-like OS=Momordica charanti... | [more] |
A0A2I4EIR4 | 4.6e-134 | 56.92 | pentatricopeptide repeat-containing protein At1g13040, mitochondrial-like OS=Jug... | [more] |
A0A5N6RKM0 | 9.7e-132 | 54.18 | Uncharacterized protein OS=Carpinus fangiana OX=176857 GN=FH972_017034 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT3G04760.1 | 8.6e-48 | 30.11 | Pentatricopeptide repeat (PPR-like) superfamily protein | [more] |
AT5G64320.1 | 4.2e-47 | 26.87 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT5G39710.1 | 8.0e-46 | 30.79 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT3G53700.1 | 1.8e-45 | 27.62 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT1G09900.1 | 5.7e-44 | 28.80 | Pentatricopeptide repeat (PPR-like) superfamily protein | [more] |