Homology
BLAST of HG10003044 vs. NCBI nr
Match:
XP_038906217.1 (pentatricopeptide repeat-containing protein At5g46580, chloroplastic [Benincasa hispida])
HSP 1 Score: 1357.4 bits (3512), Expect = 0.0e+00
Identity = 676/711 (95.08%), Postives = 696/711 (97.89%), Query Frame = 0
Query: 225 MAPPLFSSLDLNLKPTPIFFTSPLPRKNFTKRLTVLCSSSSKSPRRASPISSESSDNKNP 284
MA PL SSLDLNLKPT FFTSPL RKNFTKRLTV+C +SSK PR+ASPISSES DNKNP
Sbjct: 1 MAAPLSSSLDLNLKPT--FFTSPLRRKNFTKRLTVIC-TSSKPPRKASPISSESIDNKNP 60
Query: 285 SLSEQLKNLSTTTLSNAPNDQTHLLSKPKSTWVNPTKPKHSVLSPHRQKRSSYSYNPKMR 344
SLSEQLKNLSTTTLSNAPND++HLLSKPKSTWVNPTKPK SVLS RQKRSSYSYNPKMR
Sbjct: 61 SLSEQLKNLSTTTLSNAPNDESHLLSKPKSTWVNPTKPKRSVLSLQRQKRSSYSYNPKMR 120
Query: 345 DLKSFAHKLNACDSSDEASFMAALEEIPHPPTKENALIILNSLRPWQKTHLFFNWIKTQN 404
DLKSFAHKLNACDSSDEA+FMAALEEIPHPPTKENAL++LNSLRPWQKTHLFFNWIKTQN
Sbjct: 121 DLKSFAHKLNACDSSDEAAFMAALEEIPHPPTKENALLVLNSLRPWQKTHLFFNWIKTQN 180
Query: 405 LFPMETIFYNVAMKSLRYGRQFQLIEDLANEMITTGIELDNITYSTIITCAKKCSRFDKA 464
LFPMETIFYNVAMKSLRYGRQFQL+EDLANEMI+TGIELDNITYSTIITCAKKCSRFDKA
Sbjct: 181 LFPMETIFYNVAMKSLRYGRQFQLVEDLANEMISTGIELDNITYSTIITCAKKCSRFDKA 240
Query: 465 MEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWKPDPFTFSVLGKM 524
+EWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWKPDPFTFSVLGKM
Sbjct: 241 VEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWKPDPFTFSVLGKM 300
Query: 525 FGEARDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGIMPN 584
FGEA DYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGI PN
Sbjct: 301 FGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGITPN 360
Query: 585 EKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFE 644
EKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFE
Sbjct: 361 EKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFE 420
Query: 645 EMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVEINVMCCTCLIQCLGKS 704
EMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEML+LGVEINVMCCTCLIQCLGKS
Sbjct: 421 EMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLKLGVEINVMCCTCLIQCLGKS 480
Query: 705 GRIDDLIRVFDVSVQKGIKPDDRLCGCLLSVVSLCDNSEDIDKVFTCLQQANPKLVAFVN 764
GRID+L+RVF+VSVQKGIKPDDRLCGCLLSVVSLCDNSEDI+KVFTCLQQA+PKLVAFVN
Sbjct: 481 GRIDELVRVFNVSVQKGIKPDDRLCGCLLSVVSLCDNSEDINKVFTCLQQADPKLVAFVN 540
Query: 765 LLQQNDITFEVVKDEFRNILGETANEARRPFCNCLIDICRNQNLRERAHELLYLGSLYGL 824
LLQQNDITFEVVK+EFRNILGETA EARRPFCNCLIDICRNQNLRERAHELLYLGSLYGL
Sbjct: 541 LLQQNDITFEVVKNEFRNILGETATEARRPFCNCLIDICRNQNLRERAHELLYLGSLYGL 600
Query: 825 YPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMITLSKIVHREEALPELLSAQTGAGTHRF 884
YPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMITLSKIV REEALPELLSAQTGAGTH+F
Sbjct: 601 YPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMITLSKIVQREEALPELLSAQTGAGTHKF 660
Query: 885 SQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVPAVAATA 936
SQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVP+VAATA
Sbjct: 661 SQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVPSVAATA 708
BLAST of HG10003044 vs. NCBI nr
Match:
XP_004148730.1 (pentatricopeptide repeat-containing protein At5g46580, chloroplastic [Cucumis sativus] >KGN55781.1 hypothetical protein Csa_011464 [Cucumis sativus])
HSP 1 Score: 1340.1 bits (3467), Expect = 0.0e+00
Identity = 667/711 (93.81%), Postives = 689/711 (96.91%), Query Frame = 0
Query: 225 MAPPLFSSLDLNLKPTPIFFTSPLPRKNFTKRLTVLCSSSSKSPRRASPISSESSDNKNP 284
MA PL SSLDL LKPTPIFFTSPL RKN TKRLT+LC SSSKSPR+ S +SS+S DNKNP
Sbjct: 1 MAVPLSSSLDLKLKPTPIFFTSPLRRKNVTKRLTLLC-SSSKSPRKPSSVSSQSVDNKNP 60
Query: 285 SLSEQLKNLSTTTLSNAPNDQTHLLSKPKSTWVNPTKPKHSVLSPHRQKRSSYSYNPKMR 344
SLSEQLKNLSTTTLSNAPND+T LLSKPKSTWVNPTKPK SVLS RQKRSSYSYNPKMR
Sbjct: 61 SLSEQLKNLSTTTLSNAPNDETRLLSKPKSTWVNPTKPKRSVLSLQRQKRSSYSYNPKMR 120
Query: 345 DLKSFAHKLNACDSSDEASFMAALEEIPHPPTKENALIILNSLRPWQKTHLFFNWIKTQN 404
DLKSFAHKLNACDSSD+ASF+AALEEIPHPPTKENAL+ILNSLRPWQKTHLFFNWIK+QN
Sbjct: 121 DLKSFAHKLNACDSSDDASFIAALEEIPHPPTKENALLILNSLRPWQKTHLFFNWIKSQN 180
Query: 405 LFPMETIFYNVAMKSLRYGRQFQLIEDLANEMITTGIELDNITYSTIITCAKKCSRFDKA 464
LFPMETIFYNVAMKSLRYGRQFQLIEDLANEMI+ GIELDNITYSTIITCAKKCSRFDKA
Sbjct: 181 LFPMETIFYNVAMKSLRYGRQFQLIEDLANEMISAGIELDNITYSTIITCAKKCSRFDKA 240
Query: 465 MEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWKPDPFTFSVLGKM 524
MEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGW PDP+TFSVLGKM
Sbjct: 241 MEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWTPDPYTFSVLGKM 300
Query: 525 FGEARDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGIMPN 584
FGEA DYDGIMYVLQEMKSIE+QPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGI PN
Sbjct: 301 FGEAGDYDGIMYVLQEMKSIEMQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGITPN 360
Query: 585 EKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFE 644
EKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAE LFE
Sbjct: 361 EKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAETLFE 420
Query: 645 EMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVEINVMCCTCLIQCLGKS 704
EMKKS+HSRPDSWSYTAMLNI+GSGGNVKRSMELFEEMLELGVEINVMCCTCLIQCLGKS
Sbjct: 421 EMKKSKHSRPDSWSYTAMLNIYGSGGNVKRSMELFEEMLELGVEINVMCCTCLIQCLGKS 480
Query: 705 GRIDDLIRVFDVSVQKGIKPDDRLCGCLLSVVSLCDNSEDIDKVFTCLQQANPKLVAFVN 764
GRIDDL+RVF+VSVQKGIKPDDRLCGCLLSV+SLC NSEDI+KVFTCLQQANPKLV+F+N
Sbjct: 481 GRIDDLVRVFNVSVQKGIKPDDRLCGCLLSVLSLCYNSEDINKVFTCLQQANPKLVSFIN 540
Query: 765 LLQQNDITFEVVKDEFRNILGETANEARRPFCNCLIDICRNQNLRERAHELLYLGSLYGL 824
LLQQNDITFEVVK+EFRNILGETA EARRPFCNCLIDICRNQNLRERAHELLYLGSLYGL
Sbjct: 541 LLQQNDITFEVVKNEFRNILGETAPEARRPFCNCLIDICRNQNLRERAHELLYLGSLYGL 600
Query: 825 YPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMITLSKIVHREEALPELLSAQTGAGTHRF 884
YPGLHNKTE EWCLDVRSLSVGAAQTALEEWMITLSKIV REEALPELLSAQTGAGTHRF
Sbjct: 601 YPGLHNKTETEWCLDVRSLSVGAAQTALEEWMITLSKIVQREEALPELLSAQTGAGTHRF 660
Query: 885 SQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVPAVAATA 936
SQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVP+VAATA
Sbjct: 661 SQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVPSVAATA 710
BLAST of HG10003044 vs. NCBI nr
Match:
XP_008448710.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g46580, chloroplastic [Cucumis melo] >KAA0053056.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK11511.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])
HSP 1 Score: 1334.3 bits (3452), Expect = 0.0e+00
Identity = 664/711 (93.39%), Postives = 687/711 (96.62%), Query Frame = 0
Query: 225 MAPPLFSSLDLNLKPTPIFFTSPLPRKNFTKRLTVLCSSSSKSPRRASPISSESSDNKNP 284
MA PL SSLD LKPTPIFFTS L RK KRLT+LC SSSKSPR+ S ISSES DNKNP
Sbjct: 1 MAAPLSSSLDFKLKPTPIFFTSLLRRKYVNKRLTLLC-SSSKSPRKPSSISSESIDNKNP 60
Query: 285 SLSEQLKNLSTTTLSNAPNDQTHLLSKPKSTWVNPTKPKHSVLSPHRQKRSSYSYNPKMR 344
SLS+QLKNLSTTTLSNAPND+T LLSKPKSTWVNPTKPK SVLS RQKRSSYSYNPKMR
Sbjct: 61 SLSDQLKNLSTTTLSNAPNDETRLLSKPKSTWVNPTKPKRSVLSLQRQKRSSYSYNPKMR 120
Query: 345 DLKSFAHKLNACDSSDEASFMAALEEIPHPPTKENALIILNSLRPWQKTHLFFNWIKTQN 404
DLKSFAHKLNACDSSDEASF+AALEEIPHPPTKENAL+ILNSLRPWQKTHLFFNWIKTQN
Sbjct: 121 DLKSFAHKLNACDSSDEASFIAALEEIPHPPTKENALLILNSLRPWQKTHLFFNWIKTQN 180
Query: 405 LFPMETIFYNVAMKSLRYGRQFQLIEDLANEMITTGIELDNITYSTIITCAKKCSRFDKA 464
LFPMETIFYNVAMKSLRYGRQFQLIEDLAN+M++TGIELDNITYSTIITCAKKCSRFDKA
Sbjct: 181 LFPMETIFYNVAMKSLRYGRQFQLIEDLANDMVSTGIELDNITYSTIITCAKKCSRFDKA 240
Query: 465 MEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWKPDPFTFSVLGKM 524
MEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWKPDP+TFSVLGKM
Sbjct: 241 MEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWKPDPYTFSVLGKM 300
Query: 525 FGEARDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGIMPN 584
FGEA DYDGIMYVLQEMKSIE+QPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGI PN
Sbjct: 301 FGEAGDYDGIMYVLQEMKSIEMQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGITPN 360
Query: 585 EKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFE 644
EKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFE
Sbjct: 361 EKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFE 420
Query: 645 EMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVEINVMCCTCLIQCLGKS 704
EMKKS+HSRPDSWSYTAMLNI+GSGGNVKRSMELFEEML+LGVEINVMCCTCLIQCLGKS
Sbjct: 421 EMKKSKHSRPDSWSYTAMLNIYGSGGNVKRSMELFEEMLKLGVEINVMCCTCLIQCLGKS 480
Query: 705 GRIDDLIRVFDVSVQKGIKPDDRLCGCLLSVVSLCDNSEDIDKVFTCLQQANPKLVAFVN 764
GRIDDL+RVF+VSVQKGIKPDDRLCGCLLSVVSLCDNSEDI+KVFTCLQQANPKLV+FVN
Sbjct: 481 GRIDDLVRVFNVSVQKGIKPDDRLCGCLLSVVSLCDNSEDINKVFTCLQQANPKLVSFVN 540
Query: 765 LLQQNDITFEVVKDEFRNILGETANEARRPFCNCLIDICRNQNLRERAHELLYLGSLYGL 824
LLQQN ITFEV+K+EFRNIL ETA+EARRPFCNCLIDICRNQNLRERAHELLYLGSLYGL
Sbjct: 541 LLQQNSITFEVIKNEFRNILSETASEARRPFCNCLIDICRNQNLRERAHELLYLGSLYGL 600
Query: 825 YPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMITLSKIVHREEALPELLSAQTGAGTHRF 884
YPGLHNKTE EWCLDVRSLSVGAAQTALEEWMITLSKIV R+EALPELLSAQTGAGTHRF
Sbjct: 601 YPGLHNKTETEWCLDVRSLSVGAAQTALEEWMITLSKIVQRKEALPELLSAQTGAGTHRF 660
Query: 885 SQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVPAVAATA 936
SQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVP+V ATA
Sbjct: 661 SQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVPSVPATA 710
BLAST of HG10003044 vs. NCBI nr
Match:
KAG7015666.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1280.0 bits (3311), Expect = 0.0e+00
Identity = 638/714 (89.36%), Postives = 671/714 (93.98%), Query Frame = 0
Query: 225 MAPPLFSSLD--LNLKPT-PIFFTSPLPRKNFTKRLTVLCSSSSKSPRRASPISSESSDN 284
MA PL SSLD L LKPT P+FFTSPL R NFTKR TVLC+SSSKSPR S+D
Sbjct: 1 MAAPLSSSLDIKLKLKPTPPLFFTSPLRRNNFTKRFTVLCTSSSKSPR--------STDK 60
Query: 285 KNPSLSEQLKNLSTTTLSNAPNDQTHLLSKPKSTWVNPTKPKHSVLSPHRQKRSSYSYNP 344
KNPSLSEQLK+LST+TLSNA ND++HLLS PKS WVNPTKPK SVLS RQKRSSYSYNP
Sbjct: 61 KNPSLSEQLKDLSTSTLSNASNDESHLLSNPKSIWVNPTKPKRSVLSLQRQKRSSYSYNP 120
Query: 345 KMRDLKSFAHKLNACDSSDEASFMAALEEIPHPPTKENALIILNSLRPWQKTHLFFNWIK 404
KMR+LK+FAHKLNA DSS EA+FMA L+EIPHPPTKENAL+ILNSL+PWQKTHLFFNWIK
Sbjct: 121 KMRELKTFAHKLNASDSS-EAAFMAVLKEIPHPPTKENALLILNSLKPWQKTHLFFNWIK 180
Query: 405 TQNLFPMETIFYNVAMKSLRYGRQFQLIEDLANEMITTGIELDNITYSTIITCAKKCSRF 464
TQNLFPMETIFYNVAMKSLRYGRQFQLIE+LANEMI TGIELDNITYSTIITCAKKCSRF
Sbjct: 181 TQNLFPMETIFYNVAMKSLRYGRQFQLIEELANEMINTGIELDNITYSTIITCAKKCSRF 240
Query: 465 DKAMEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWKPDPFTFSVL 524
DKAMEWFERMY+TGLMPDEVTYSAILDVYANLGKVEE LSLYERGRASGWKPDP+TFSVL
Sbjct: 241 DKAMEWFERMYRTGLMPDEVTYSAILDVYANLGKVEEALSLYERGRASGWKPDPYTFSVL 300
Query: 525 GKMFGEARDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGI 584
GKMFGEA DYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAG+PGFARSLF+EM+ESGI
Sbjct: 301 GKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGRPGFARSLFEEMIESGI 360
Query: 585 MPNEKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEK 644
PNEKTLTALVKIYGKARWARDAL+LWERMRS GWPMDFILYNTLLNMCADLGLEEEAEK
Sbjct: 361 TPNEKTLTALVKIYGKARWARDALELWERMRSKGWPMDFILYNTLLNMCADLGLEEEAEK 420
Query: 645 LFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVEINVMCCTCLIQCL 704
LFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGV INVMCCTCLIQCL
Sbjct: 421 LFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVGINVMCCTCLIQCL 480
Query: 705 GKSGRIDDLIRVFDVSVQKGIKPDDRLCGCLLSVVSLCDNSEDIDKVFTCLQQANPKLVA 764
GK+ RIDDL+RVFDVS++KG+KPDDRLCGCLLSVVSLCDN+EDI KVFTCLQQANPKLVA
Sbjct: 481 GKARRIDDLVRVFDVSIRKGVKPDDRLCGCLLSVVSLCDNNEDISKVFTCLQQANPKLVA 540
Query: 765 FVNLLQQNDITFEVVKDEFRNILGETANEARRPFCNCLIDICRNQNLRERAHELLYLGSL 824
FVNLLQQNDITF+V+KDEFR ILGETA EARRPFCNCLIDICRNQNL +RAHELLYLGSL
Sbjct: 541 FVNLLQQNDITFDVIKDEFRTILGETATEARRPFCNCLIDICRNQNLSKRAHELLYLGSL 600
Query: 825 YGLYPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMITLSKIVHREEALPELLSAQTGAGT 884
YGLYPGLHNKTE EWCLDVRSLSVGAAQTALEEWMITLSKIV REEALPELLSAQTGAGT
Sbjct: 601 YGLYPGLHNKTEGEWCLDVRSLSVGAAQTALEEWMITLSKIVQREEALPELLSAQTGAGT 660
Query: 885 HRFSQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVPAVAATA 936
HRFSQGLANSFASHV+KLAAPF+LREDRAGWFVATRED+V WVHSRVP+VA A
Sbjct: 661 HRFSQGLANSFASHVEKLAAPFRLREDRAGWFVATREDVVAWVHSRVPSVATRA 705
BLAST of HG10003044 vs. NCBI nr
Match:
XP_023552442.1 (pentatricopeptide repeat-containing protein At5g46580, chloroplastic-like [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1279.2 bits (3309), Expect = 0.0e+00
Identity = 638/714 (89.36%), Postives = 671/714 (93.98%), Query Frame = 0
Query: 225 MAPPLFSSLD--LNLKPT-PIFFTSPLPRKNFTKRLTVLCSSSSKSPRRASPISSESSDN 284
MA PL SSLD L LKPT P+FFTSPL R NFTKRLTVLC+SSSKSPR S+D
Sbjct: 1 MAAPLSSSLDIKLKLKPTPPLFFTSPLRRNNFTKRLTVLCTSSSKSPR--------STDK 60
Query: 285 KNPSLSEQLKNLSTTTLSNAPNDQTHLLSKPKSTWVNPTKPKHSVLSPHRQKRSSYSYNP 344
KNPSLSEQLK+LST+TLSNA ND++HLLS KS WVNPTKPK SVLS RQKRSSYSYNP
Sbjct: 61 KNPSLSEQLKDLSTSTLSNASNDESHLLSNSKSIWVNPTKPKRSVLSLQRQKRSSYSYNP 120
Query: 345 KMRDLKSFAHKLNACDSSDEASFMAALEEIPHPPTKENALIILNSLRPWQKTHLFFNWIK 404
KMR+LK+FAHKLNA DSS EA+FMA L+EIPHPPTKENAL+ILNSL+PWQKTHLFFNWIK
Sbjct: 121 KMRELKTFAHKLNASDSS-EAAFMAVLKEIPHPPTKENALLILNSLKPWQKTHLFFNWIK 180
Query: 405 TQNLFPMETIFYNVAMKSLRYGRQFQLIEDLANEMITTGIELDNITYSTIITCAKKCSRF 464
TQNLFPMETIFYNVAMKSLRYGRQFQLIE+LANEMI TGIELDNITYSTIITCAKKCSRF
Sbjct: 181 TQNLFPMETIFYNVAMKSLRYGRQFQLIEELANEMINTGIELDNITYSTIITCAKKCSRF 240
Query: 465 DKAMEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWKPDPFTFSVL 524
DKAMEWFERMY+TGLMPDEVTYSAILDVYANLGKVEE LSLYERGRASGWKPDP+TFSVL
Sbjct: 241 DKAMEWFERMYRTGLMPDEVTYSAILDVYANLGKVEEALSLYERGRASGWKPDPYTFSVL 300
Query: 525 GKMFGEARDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGI 584
GKMFGEA DYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAG+PGFARSLF+EM+ESGI
Sbjct: 301 GKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGRPGFARSLFEEMIESGI 360
Query: 585 MPNEKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEK 644
PNEKTLTALVKIYGKARWARDAL+LWERMRS GWPMDFILYNTLLNMCADLGLEEEAEK
Sbjct: 361 TPNEKTLTALVKIYGKARWARDALELWERMRSKGWPMDFILYNTLLNMCADLGLEEEAEK 420
Query: 645 LFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVEINVMCCTCLIQCL 704
LFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGV INVMCCTCLIQCL
Sbjct: 421 LFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVGINVMCCTCLIQCL 480
Query: 705 GKSGRIDDLIRVFDVSVQKGIKPDDRLCGCLLSVVSLCDNSEDIDKVFTCLQQANPKLVA 764
GK+ RIDDL+RVFDVS++KG+KPDDR CGCLLSVVSLCDN+EDI KVFTCLQQANPKLVA
Sbjct: 481 GKARRIDDLVRVFDVSIRKGVKPDDRFCGCLLSVVSLCDNNEDISKVFTCLQQANPKLVA 540
Query: 765 FVNLLQQNDITFEVVKDEFRNILGETANEARRPFCNCLIDICRNQNLRERAHELLYLGSL 824
FVNLLQQNDITF+V+KDEFR ILGETA+EARRPFCNCLIDICRNQNL +RAHELLYLGSL
Sbjct: 541 FVNLLQQNDITFDVIKDEFRTILGETASEARRPFCNCLIDICRNQNLSKRAHELLYLGSL 600
Query: 825 YGLYPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMITLSKIVHREEALPELLSAQTGAGT 884
YGLYPGLHNKTE EWCLDVRSLSVGAAQTALEEWMITLSKIV REEALPELLSAQTGAGT
Sbjct: 601 YGLYPGLHNKTEGEWCLDVRSLSVGAAQTALEEWMITLSKIVQREEALPELLSAQTGAGT 660
Query: 885 HRFSQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVPAVAATA 936
HRFSQGLANSFASHV+KLAAPF+LREDRAGWFVATRED V WVHSRVP+VA TA
Sbjct: 661 HRFSQGLANSFASHVEKLAAPFRLREDRAGWFVATREDFVAWVHSRVPSVATTA 705
BLAST of HG10003044 vs. ExPASy Swiss-Prot
Match:
Q9LS25 (Pentatricopeptide repeat-containing protein At5g46580, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At5g46580 PE=2 SV=1)
HSP 1 Score: 969.1 bits (2504), Expect = 3.4e-281
Identity = 470/705 (66.67%), Postives = 583/705 (82.70%), Query Frame = 0
Query: 230 FSSLDLNLKPTPIFFTSPLPRKNFTKRLTVLCSSSSKSPRRASPISSESSDNKNPSLSEQ 289
F+ + + K +F L R++ +++L + C SS K P+ + E K PSLSEQ
Sbjct: 13 FNPQNSDTKKHSLFLKPSLFRQSRSRKLNISC-SSLKQPK---TLEEEPITTKTPSLSEQ 72
Query: 290 LKNLSTTTLSNAPNDQTHLLSKPKSTWVNPTKPKHSVLSPHRQKRSSYSYNPKMRDLKSF 349
LK LS TTL +QT +LSKPKS WVNPT+PK SVLS RQKRS+YSYNP+++DL++F
Sbjct: 73 LKPLSATTLR---QEQTQILSKPKSVWVNPTRPKRSVLSLQRQKRSAYSYNPQIKDLRAF 132
Query: 350 AHKLNACDSSDEASFMAALEEIPHPPTKENALIILNSLRPWQKTHLFFNWIKTQNLFPME 409
A KLN+ ++++ F++ L+EIPHPP ++NAL++LNSLR WQKTH FFNW+K+++LFPME
Sbjct: 133 ALKLNSSIFTEKSEFLSLLDEIPHPPNRDNALLVLNSLREWQKTHTFFNWVKSKSLFPME 192
Query: 410 TIFYNVAMKSLRYGRQFQLIEDLANEMITTGIELDNITYSTIITCAKKCSRFDKAMEWFE 469
TIFYNV MKSLR+GRQFQLIE++A EM+ G+ELDNITYSTIITCAK+C+ ++KA+EWFE
Sbjct: 193 TIFYNVTMKSLRFGRQFQLIEEMALEMVKDGVELDNITYSTIITCAKRCNLYNKAIEWFE 252
Query: 470 RMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWKPDPFTFSVLGKMFGEAR 529
RMYKTGLMPDEVTYSAILDVY+ GKVEEVLSLYER A+GWKPD FSVLGKMFGEA
Sbjct: 253 RMYKTGLMPDEVTYSAILDVYSKSGKVEEVLSLYERAVATGWKPDAIAFSVLGKMFGEAG 312
Query: 530 DYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGIMPNEKTLT 589
DYDGI YVLQEMKS++V+PN+VVYNTLL+AMG+AGKPG ARSLF+EM+E+G+ PNEKTLT
Sbjct: 313 DYDGIRYVLQEMKSMDVKPNVVVYNTLLEAMGRAGKPGLARSLFNEMLEAGLTPNEKTLT 372
Query: 590 ALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKS 649
ALVKIYGKARWARDAL LWE M++ WPMDFILYNTLLNMCAD+GLEEEAE+LF +MK+S
Sbjct: 373 ALVKIYGKARWARDALQLWEEMKAKKWPMDFILYNTLLNMCADIGLEEEAERLFNDMKES 432
Query: 650 EHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVEINVMCCTCLIQCLGKSGRIDD 709
RPD++SYTAMLNI+GSGG +++MELFEEML+ GV++NVM CTCL+QCLGK+ RIDD
Sbjct: 433 VQCRPDNFSYTAMLNIYGSGGKAEKAMELFEEMLKAGVQVNVMGCTCLVQCLGKAKRIDD 492
Query: 710 LIRVFDVSVQKGIKPDDRLCGCLLSVVSLCDNSEDIDKVFTCLQQANPKLVAFVNLLQQN 769
++ VFD+S+++G+KPDDRLCGCLLSV++LC++SED +KV CL++AN KLV FVNL+
Sbjct: 493 VVYVFDLSIKRGVKPDDRLCGCLLSVMALCESSEDAEKVMACLERANKKLVTFVNLIVDE 552
Query: 770 DITFEVVKDEFRNILGETANEARRPFCNCLIDICRNQNLRERAHELLYLGSLYGLYPGLH 829
+E VK+EF+ ++ T EARRPFCNCLIDICR N ERAHELLYLG+L+GLYPGLH
Sbjct: 553 KTEYETVKEEFKLVINATQVEARRPFCNCLIDICRGNNRHERAHELLYLGTLFGLYPGLH 612
Query: 830 NKTEAEWCLDVRSLSVGAAQTALEEWMITLSKIVHREEALPELLSAQTGAGTHRFSQGLA 889
NKT EW LDVRSLSVGAA+TALEEWM TL+ I+ R+E LPEL AQTG GTHRFSQGLA
Sbjct: 613 NKTIKEWSLDVRSLSVGAAETALEEWMRTLANIIKRQEELPELFLAQTGTGTHRFSQGLA 672
Query: 890 NSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVPAVAAT 935
NSFA H+ +L+APF+ + DR G FVAT+EDLV+W+ S+ P + +
Sbjct: 673 NSFALHLQQLSAPFR-QSDRPGIFVATKEDLVSWLESKFPPLVTS 709
BLAST of HG10003044 vs. ExPASy Swiss-Prot
Match:
B4F8Z1 (Pentatricopeptide repeat-containing protein ATP4, chloroplastic OS=Zea mays OX=4577 GN=ATP4 PE=2 SV=1)
HSP 1 Score: 443.4 bits (1139), Expect = 6.6e-123
Identity = 250/679 (36.82%), Postives = 389/679 (57.29%), Query Frame = 0
Query: 263 SSSKSPRR-ASPISSESSDNKNPSLSEQLKNLSTTTLSNAPNDQTHLLSKPKSTWVNPTK 322
S+S +P+ +SP+++ S + P Q ++ S + SN + S + WVNP
Sbjct: 23 SASFNPKNPSSPVAAHVSVQETP---PQPQDPSPPSDSNPNGTRPSSSSNTRFLWVNPNS 82
Query: 323 PKHSVLSPHRQKRSSYSYNPKMRDLKSFAHKLNACDSSDEASFMAALEEIPHPPTKENAL 382
P+ + ++ R + + + L S A L AC++++ A A P PP++++A+
Sbjct: 83 PRAADVARAR------AGSGRRARLASAAAALGACETTESAVEAALQAAFPEPPSEQDAV 142
Query: 383 IILNSLRPW--QKTHLFFNWIKTQNLFPMETIFYNVAMKSLRYGRQFQLIEDLANEMITT 442
I+LN+ + L W + I YNV +K LR R + E L EM+
Sbjct: 143 IVLNTAAATRAETAVLALRWFLGNAKVRKKVILYNVVLKLLRKKRLWSETEALWAEMLRD 202
Query: 443 GIELDNITYSTIITCAKKCSRFDKAMEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEV 502
G++ DN T+ST+I+CA+ C KA+EWF++M + G PD +TYSA++D Y + G E
Sbjct: 203 GVQPDNATFSTVISCARACGLHSKAVEWFDKMPEFGCSPDMLTYSAVIDAYGHAGNSEAA 262
Query: 503 LSLYERGRASGWKPDPFTFSVLGKMFGEARDYDGIMYVLQEMKSIEVQPNLVVYNTLLDA 562
L LY+R RA W+ DP S + K+ + ++DG + V +EMK+I V+PNLVVYNT+LDA
Sbjct: 263 LRLYDRARAEKWQLDPVICSTVIKVHSTSGNFDGALNVFEEMKAIGVRPNLVVYNTMLDA 322
Query: 563 MGKAGKPGFARSLFDEMVESGIMPNEKTLTALVKIYGKARWARDALDLWERMRSNGWPMD 622
MG+A +P +++ EMV+ + P+ T L+ Y +AR+ DA+ ++ M+ +D
Sbjct: 323 MGRALRPWVVKTIHREMVDQQVQPSRATYCCLLHAYTRARYGEDAMAVYRLMKDEAMGID 382
Query: 623 FILYNTLLNMCADLGLEEEAEKLFEEMKKS--EHSRPDSWSYTAMLNIHGSGGNVKRSME 682
+LYN LL+MCAD+G +EAE++F +MK S HS+PDSWSY++M+ ++ S NV +
Sbjct: 383 VMLYNMLLSMCADIGYVDEAEEIFRDMKASMGAHSKPDSWSYSSMVTLYSSTANVLSAEG 442
Query: 683 LFEEMLELGVEINVMCCTCLIQCLGKSGRIDDLIRVFDVSVQKGIKPDDRLCGCLLSVVS 742
+ EM+E G + N+ T LI+C GK GR DD++R F + GI PDDR CGCLLSV +
Sbjct: 443 ILNEMVEAGFKPNIFVLTSLIRCYGKVGRTDDVVRSFGMLQDLGIIPDDRFCGCLLSVAA 502
Query: 743 LCDNSEDIDKVFTCLQQANPKLVAFVNLLQQNDITFEVVKDEFRNILGETANEARRPFCN 802
+E++ KV +C++++N +L A V LL + E ++ R +L + + P+CN
Sbjct: 503 NTP-AEELGKVISCIERSNVQLGAVVKLLVDRS-SSESFREAARELLRSSRGVVKMPYCN 562
Query: 803 CLIDICRNQNLRERAHELLYLGSLYGLYPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMI 862
CL+D+C N N E+A LL G+Y + +T+ +W L +R LSVGAA T L WM
Sbjct: 563 CLMDLCVNLNQMEKACALLDAAQQLGIYANIQTRTQTQWSLHLRGLSVGAALTTLHVWMN 622
Query: 863 TL-SKIVHREEALPELLSAQTGAGTHRFS-QGLANSFASHVDKLAAPFQLREDRAGWFVA 922
L + + E LP LL TG G + +S +GLA F +H+ +L APF D+AGWF+
Sbjct: 623 DLYTSLQTGNEGLPPLLGIHTGQGKNTYSDRGLAAMFEAHLKELDAPFHEAPDKAGWFLT 682
Query: 923 TREDLVTWVHSRVPAVAAT 935
T W+ S+ + T
Sbjct: 683 TNVAAKQWLESKAASELVT 690
BLAST of HG10003044 vs. ExPASy Swiss-Prot
Match:
Q10PZ4 (Pentatricopeptide repeat-containing protein ATP4 homolog, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=Os03g0215900 PE=3 SV=1)
HSP 1 Score: 436.4 bits (1121), Expect = 8.0e-121
Identity = 255/679 (37.56%), Postives = 383/679 (56.41%), Query Frame = 0
Query: 262 SSSSKSPRRASPISSESSDNKNPSLSEQLKNLSTTTLSNAPNDQTHLLSKPKST----WV 321
SS P RA +S + KNPS S +S P+D + +T WV
Sbjct: 5 SSLLSWPHRAISLSFQP---KNPSPSPATARVSVQDPPPPPSDANPSPGRSSNTSRYVWV 64
Query: 322 NPTKPKHSVLSPHRQKRSSYSYNPKMRDLKSFAHKLNACDSSDEASFMAALE-EIPHPPT 381
NP P+ + L+ R + + + L + A L AC++ EA AALE P PP+
Sbjct: 65 NPNSPRAAGLARAR------AGSGRRARLAAAAAALAACEAG-EAPVAAALEAAFPEPPS 124
Query: 382 KENALIILN--SLRPWQKTHLFFNWIKTQNLFPMETIFYNVAMKSLRYGRQFQLIEDLAN 441
+++A+I+LN S RP L W E I YNVA+K+LR R++ E L
Sbjct: 125 EQDAVIVLNTTSARP-AAVVLALWWFLRNAEVRKEVILYNVALKALRKRRRWSDAEALWE 184
Query: 442 EMITTGIELDNITYSTIITCAKKCSRFDKAMEWFERMYKTGLMPDEVTYSAILDVYANLG 501
EM+ G++ DN T+ST+I+CA+ C KA+EWFE+M G PD +TYSA++D Y G
Sbjct: 185 EMLREGVQPDNATFSTVISCARACGMPGKAVEWFEKMPDFGCSPDMLTYSAVIDAYGRAG 244
Query: 502 KVEEVLSLYERGRASGWKPDPFTFSVLGKMFGEARDYDGIMYVLQEMKSIEVQPNLVVYN 561
E L LY+R RA W+ DP + + ++ + ++DG + V +EMK+ V+PNLVVYN
Sbjct: 245 DAETALRLYDRARAEKWQLDPVICATVIRVHSSSGNFDGALNVFEEMKAAGVKPNLVVYN 304
Query: 562 TLLDAMGKAGKPGFARSLFDEMVESGIMPNEKTLTALVKIYGKARWARDALDLWERMRSN 621
T+LDAMG+A +P +++ E+V +PN+ T L+ Y +AR+ DA+ ++ M+
Sbjct: 305 TVLDAMGRAMRPWVVKTIHRELVSQEAVPNKATYCCLLHAYTRARYGEDAMAVYRVMKDE 364
Query: 622 GWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKS--EHSRPDSWSYTAMLNIHGSGGNV 681
+D +LYN LL+MCAD+G EEAE++F +MK S S+PDSWSY++M+ ++ GNV
Sbjct: 365 VMDIDVVLYNMLLSMCADIGYVEEAEEIFRDMKASMDSRSKPDSWSYSSMVTLYSCTGNV 424
Query: 682 KRSMELFEEMLELGVEINVMCCTCLIQCLGKSGRIDDLIRVFDVSVQKGIKPDDRLCGCL 741
+ + EM+E G + N+ T LI+C GK+GR DD++R F + GI PDDR CGCL
Sbjct: 425 AGAEGILNEMVEAGFKPNIFILTSLIRCYGKAGRTDDVVRSFAMLEDLGITPDDRFCGCL 484
Query: 742 LSVVSLCDNSEDIDKVFTCLQQANPKLVAFVNLLQQNDITFEVVKDEFRNILGETANEAR 801
L+V + ++++ KV C+ +++ +L A V LL E +++ +LG R
Sbjct: 485 LTVAA-GTPADELGKVIGCIDRSSAQLGAVVRLLVDAAAPSEPLREAAGELLGGARGVVR 544
Query: 802 RPFCNCLIDICRNQNLRERAHELLYLGSLYGLYPGLHNKTEAEWCLDVRSLSVGAAQTAL 861
P+CNCL+D+ N + E+A LL + G+Y + +T+ +W L +R LSVGAA T L
Sbjct: 545 MPYCNCLMDLAVNLSQMEKACALLDVALRLGIYSNVQTRTQTQWSLHLRGLSVGAALTTL 604
Query: 862 EEWMITLSKIVHREEALPELLSAQTGAGTHRFS-QGLANSFASHVDKLAAPFQLREDRAG 921
WM L + + LP LL TG G + +S +GLA F SH+ +L APF D+AG
Sbjct: 605 HVWMSDLYAALQAGDELPPLLGIHTGQGKNTYSYKGLATVFESHLKELDAPFHEAPDKAG 664
Query: 922 WFVATREDLVTWVHSRVPA 931
WF+ T W+ ++ A
Sbjct: 665 WFLTTSVAARHWLETKKSA 671
BLAST of HG10003044 vs. ExPASy Swiss-Prot
Match:
Q8GWE0 (Pentatricopeptide repeat-containing protein At4g16390, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=P67 PE=1 SV=3)
HSP 1 Score: 434.5 bits (1116), Expect = 3.1e-120
Identity = 254/693 (36.65%), Postives = 390/693 (56.28%), Query Frame = 0
Query: 260 LCSSSSKSPRRASPISSESSDNKNPSLSEQLKNLSTTTLS---NAPNDQTHLL------- 319
LC+ S P +++P S SS N N S L T +S P + L
Sbjct: 20 LCNLLSVYP-KSTPRSFLSSYNPNSSHFHSRNLLQATHVSVQEAIPQSEKSKLVDVDLPI 79
Query: 320 ----SKPKSTWVNPTKPKHSVLSPHRQKRSSYSYNPKMRDLKSFAHKLNACDSSDEASFM 379
+ WVNP P+ S L R+K SY+ + L A L+AC +EA
Sbjct: 80 PEPTASKSYVWVNPKSPRASQL---RRK----SYDSRYSSLIKLAESLDAC-KPNEADVC 139
Query: 380 AALEEIPHPPTKENALIILNSLRPWQKTHLFFNWIKTQNLFPMETIFYNVAMKSLRYGRQ 439
+ +++A++ LN++ + L N + E I YNV MK R +
Sbjct: 140 DVITGFGGKLFEQDAVVTLNNMTNPETAPLVLNNLLETMKPSREVILYNVTMKVFRKSKD 199
Query: 440 FQLIEDLANEMITTGIELDNITYSTIITCAKKCSRFDKAMEWFERMYKTGLMPDEVTYSA 499
+ E L +EM+ GI+ DN T++TII+CA++ +A+EWFE+M G PD VT +A
Sbjct: 200 LEKSEKLFDEMLERGIKPDNATFTTIISCARQNGVPKRAVEWFEKMSSFGCEPDNVTMAA 259
Query: 500 ILDVYANLGKVEEVLSLYERGRASGWKPDPFTFSVLGKMFGEARDYDGIMYVLQEMKSIE 559
++D Y G V+ LSLY+R R W+ D TFS L +++G + +YDG + + +EMK++
Sbjct: 260 MIDAYGRAGNVDMALSLYDRARTEKWRIDAVTFSTLIRIYGVSGNYDGCLNIYEEMKALG 319
Query: 560 VQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGIMPNEKTLTALVKIYGKARWARDAL 619
V+PNLV+YN L+D+MG+A +P A+ ++ +++ +G PN T ALV+ YG+AR+ DAL
Sbjct: 320 VKPNLVIYNRLIDSMGRAKRPWQAKIIYKDLITNGFTPNWSTYAALVRAYGRARYGDDAL 379
Query: 620 DLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSEHSRPDSWSYTAMLNI 679
++ M+ G + ILYNTLL+MCAD +EA ++F++MK E PDSW++++++ +
Sbjct: 380 AIYREMKEKGLSLTVILYNTLLSMCADNRYVDEAFEIFQDMKNCETCDPDSWTFSSLITV 439
Query: 680 HGSGGNVKRSMELFEEMLELGVEINVMCCTCLIQCLGKSGRIDDLIRVFDVSVQKGIKPD 739
+ G V + +M E G E + T +IQC GK+ ++DD++R FD ++ GI PD
Sbjct: 440 YACSGRVSEAEAALLQMREAGFEPTLFVLTSVIQCYGKAKQVDDVVRTFDQVLELGITPD 499
Query: 740 DRLCGCLLSVVSLCDNSEDIDKVFTCLQQANPKLVAFVNLL-QQNDITFEVVKDEFRNIL 799
DR CGCLL+V++ SE+I K+ C+++A PKL V +L ++ + V K E ++
Sbjct: 500 DRFCGCLLNVMTQTP-SEEIGKLIGCVEKAKPKLGQVVKMLVEEQNCEEGVFKKEASELI 559
Query: 800 GETANEARRPFCNCLIDICRNQNLRERAHELLYLGSLYGLYPGLHNKTEAEWCLDVRSLS 859
++ ++ + NCLID+C N N ERA E+L LG Y +Y GL +K+ +W L ++SLS
Sbjct: 560 DSIGSDVKKAYLNCLIDLCVNLNKLERACEILQLGLEYDIYTGLQSKSATQWSLHLKSLS 619
Query: 860 VGAAQTALEEWMITLSK-IVHREEALPELLSAQTGAGTHRFS-QGLANSFASHVDKLAAP 919
+GAA TAL WM LS+ + E P LL TG G H++S +GLA F SH+ +L AP
Sbjct: 620 LGAALTALHVWMNDLSEAALESGEEFPPLLGINTGHGKHKYSDKGLAAVFESHLKELNAP 679
Query: 920 FQLREDRAGWFVATREDLVTWVHSRVPAVAATA 936
F D+ GWF+ T W+ SR A +A
Sbjct: 680 FHEAPDKVGWFLTTSVAAKAWLESRRSAGGVSA 702
BLAST of HG10003044 vs. ExPASy Swiss-Prot
Match:
Q8GYP6 (Pentatricopeptide repeat-containing protein At1g18900 OS=Arabidopsis thaliana OX=3702 GN=At1g18900 PE=2 SV=1)
HSP 1 Score: 187.6 bits (475), Expect = 6.5e-46
Identity = 131/551 (23.77%), Postives = 246/551 (44.65%), Query Frame = 0
Query: 383 ILNSLRPWQKTHLFFNWIKTQNLFPMETIFYNVAMKSLRYGRQFQLIEDLANEMITTGIE 442
+L + + FF W+K Q F + Y + +L +QF I L +EM+ G +
Sbjct: 337 VLKQMNDYGNALGFFYWLKRQPGFKHDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQ 396
Query: 443 LDNITYSTIITCAKKCSRFDKAMEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSL 502
+ +TY+ +I + + ++AM F +M + G PD VTY ++D++A G ++ + +
Sbjct: 397 PNTVTYNRLIHSYGRANYLNEAMNVFNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDM 456
Query: 503 YERGRASGWKPDPFTFSVLGKMFGEARDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGK 562
Y+R +A G PD FT+SV +++ +GK
Sbjct: 457 YQRMQAGGLSPDTFTYSV-----------------------------------IINCLGK 516
Query: 563 AGKPGFARSLFDEMVESGIMPNEKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFIL 622
AG A LF EMV+ G PN T ++ ++ KAR ++AL L+ M++ G+ D +
Sbjct: 517 AGHLPAAHKLFCEMVDQGCTPNLVTYNIMMDLHAKARNYQNALKLYRDMQNAGFEPDKVT 576
Query: 623 YNTLLNMCADLGLEEEAEKLFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEM 682
Y+ ++ + G EEAE +F EM++ ++ PD Y ++++ G GNV+++ + ++ M
Sbjct: 577 YSIVMEVLGHCGYLEEAEAVFTEMQQ-KNWIPDEPVYGLLVDLWGKAGNVEKAWQWYQAM 636
Query: 683 LELGVEINVMCCTCLIQCLGKSGRIDDLIRVFDVSVQKGIKPDDRLCGCLLSVVSLCDNS 742
L G+ NV C L+ + +I + + + G++P + LLS + D
Sbjct: 637 LHAGLRPNVPTCNSLLSTFLRVNKIAEAYELLQNMLALGLRPSLQTYTLLLSCCT--DGR 696
Query: 743 EDIDKVFTCLQQANPKLVAFVNLLQQ-----NDITFEVVKDEFRNILGETANEARRPFCN 802
+D F A+ A + LL+ + + F +++ E++R +
Sbjct: 697 SKLDMGFCGQLMASTGHPAHMFLLKMPAAGPDGENVRNHANNFLDLMHSEDRESKRGLVD 756
Query: 803 CLIDICRNQNLRERAHELLYLGSLYGLYP-GLHNKTEAEWCLDVRSLSVGAAQTALEEWM 862
++D +E A + + + ++P L K+ + W +++ +S G A TAL +
Sbjct: 757 AVVDFLHKSGQKEEAGSVWEVAAQKNVFPDALREKSCSYWLINLHVMSEGTAVTALSRTL 816
Query: 863 ITLSKIVHREEALPELLSAQTGAGTHRFSQG---LANSFASHVDKLAAPFQLREDRAGWF 922
K + P + TG G G + + ++ +PF +G F
Sbjct: 817 AWFRKQMLASGTCPSRIDIVTGWGRRSRVTGTSMVRQAVEELLNIFGSPFFTESGNSGCF 849
Query: 923 VATREDLVTWV 925
V + E L W+
Sbjct: 877 VGSGEPLNRWL 849
BLAST of HG10003044 vs. ExPASy TrEMBL
Match:
A0A0A0L6K8 (Smr domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G011820 PE=3 SV=1)
HSP 1 Score: 1340.1 bits (3467), Expect = 0.0e+00
Identity = 667/711 (93.81%), Postives = 689/711 (96.91%), Query Frame = 0
Query: 225 MAPPLFSSLDLNLKPTPIFFTSPLPRKNFTKRLTVLCSSSSKSPRRASPISSESSDNKNP 284
MA PL SSLDL LKPTPIFFTSPL RKN TKRLT+LC SSSKSPR+ S +SS+S DNKNP
Sbjct: 1 MAVPLSSSLDLKLKPTPIFFTSPLRRKNVTKRLTLLC-SSSKSPRKPSSVSSQSVDNKNP 60
Query: 285 SLSEQLKNLSTTTLSNAPNDQTHLLSKPKSTWVNPTKPKHSVLSPHRQKRSSYSYNPKMR 344
SLSEQLKNLSTTTLSNAPND+T LLSKPKSTWVNPTKPK SVLS RQKRSSYSYNPKMR
Sbjct: 61 SLSEQLKNLSTTTLSNAPNDETRLLSKPKSTWVNPTKPKRSVLSLQRQKRSSYSYNPKMR 120
Query: 345 DLKSFAHKLNACDSSDEASFMAALEEIPHPPTKENALIILNSLRPWQKTHLFFNWIKTQN 404
DLKSFAHKLNACDSSD+ASF+AALEEIPHPPTKENAL+ILNSLRPWQKTHLFFNWIK+QN
Sbjct: 121 DLKSFAHKLNACDSSDDASFIAALEEIPHPPTKENALLILNSLRPWQKTHLFFNWIKSQN 180
Query: 405 LFPMETIFYNVAMKSLRYGRQFQLIEDLANEMITTGIELDNITYSTIITCAKKCSRFDKA 464
LFPMETIFYNVAMKSLRYGRQFQLIEDLANEMI+ GIELDNITYSTIITCAKKCSRFDKA
Sbjct: 181 LFPMETIFYNVAMKSLRYGRQFQLIEDLANEMISAGIELDNITYSTIITCAKKCSRFDKA 240
Query: 465 MEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWKPDPFTFSVLGKM 524
MEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGW PDP+TFSVLGKM
Sbjct: 241 MEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWTPDPYTFSVLGKM 300
Query: 525 FGEARDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGIMPN 584
FGEA DYDGIMYVLQEMKSIE+QPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGI PN
Sbjct: 301 FGEAGDYDGIMYVLQEMKSIEMQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGITPN 360
Query: 585 EKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFE 644
EKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAE LFE
Sbjct: 361 EKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAETLFE 420
Query: 645 EMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVEINVMCCTCLIQCLGKS 704
EMKKS+HSRPDSWSYTAMLNI+GSGGNVKRSMELFEEMLELGVEINVMCCTCLIQCLGKS
Sbjct: 421 EMKKSKHSRPDSWSYTAMLNIYGSGGNVKRSMELFEEMLELGVEINVMCCTCLIQCLGKS 480
Query: 705 GRIDDLIRVFDVSVQKGIKPDDRLCGCLLSVVSLCDNSEDIDKVFTCLQQANPKLVAFVN 764
GRIDDL+RVF+VSVQKGIKPDDRLCGCLLSV+SLC NSEDI+KVFTCLQQANPKLV+F+N
Sbjct: 481 GRIDDLVRVFNVSVQKGIKPDDRLCGCLLSVLSLCYNSEDINKVFTCLQQANPKLVSFIN 540
Query: 765 LLQQNDITFEVVKDEFRNILGETANEARRPFCNCLIDICRNQNLRERAHELLYLGSLYGL 824
LLQQNDITFEVVK+EFRNILGETA EARRPFCNCLIDICRNQNLRERAHELLYLGSLYGL
Sbjct: 541 LLQQNDITFEVVKNEFRNILGETAPEARRPFCNCLIDICRNQNLRERAHELLYLGSLYGL 600
Query: 825 YPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMITLSKIVHREEALPELLSAQTGAGTHRF 884
YPGLHNKTE EWCLDVRSLSVGAAQTALEEWMITLSKIV REEALPELLSAQTGAGTHRF
Sbjct: 601 YPGLHNKTETEWCLDVRSLSVGAAQTALEEWMITLSKIVQREEALPELLSAQTGAGTHRF 660
Query: 885 SQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVPAVAATA 936
SQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVP+VAATA
Sbjct: 661 SQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVPSVAATA 710
BLAST of HG10003044 vs. ExPASy TrEMBL
Match:
A0A5A7UFR2 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold139G001750 PE=3 SV=1)
HSP 1 Score: 1334.3 bits (3452), Expect = 0.0e+00
Identity = 664/711 (93.39%), Postives = 687/711 (96.62%), Query Frame = 0
Query: 225 MAPPLFSSLDLNLKPTPIFFTSPLPRKNFTKRLTVLCSSSSKSPRRASPISSESSDNKNP 284
MA PL SSLD LKPTPIFFTS L RK KRLT+LC SSSKSPR+ S ISSES DNKNP
Sbjct: 1 MAAPLSSSLDFKLKPTPIFFTSLLRRKYVNKRLTLLC-SSSKSPRKPSSISSESIDNKNP 60
Query: 285 SLSEQLKNLSTTTLSNAPNDQTHLLSKPKSTWVNPTKPKHSVLSPHRQKRSSYSYNPKMR 344
SLS+QLKNLSTTTLSNAPND+T LLSKPKSTWVNPTKPK SVLS RQKRSSYSYNPKMR
Sbjct: 61 SLSDQLKNLSTTTLSNAPNDETRLLSKPKSTWVNPTKPKRSVLSLQRQKRSSYSYNPKMR 120
Query: 345 DLKSFAHKLNACDSSDEASFMAALEEIPHPPTKENALIILNSLRPWQKTHLFFNWIKTQN 404
DLKSFAHKLNACDSSDEASF+AALEEIPHPPTKENAL+ILNSLRPWQKTHLFFNWIKTQN
Sbjct: 121 DLKSFAHKLNACDSSDEASFIAALEEIPHPPTKENALLILNSLRPWQKTHLFFNWIKTQN 180
Query: 405 LFPMETIFYNVAMKSLRYGRQFQLIEDLANEMITTGIELDNITYSTIITCAKKCSRFDKA 464
LFPMETIFYNVAMKSLRYGRQFQLIEDLAN+M++TGIELDNITYSTIITCAKKCSRFDKA
Sbjct: 181 LFPMETIFYNVAMKSLRYGRQFQLIEDLANDMVSTGIELDNITYSTIITCAKKCSRFDKA 240
Query: 465 MEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWKPDPFTFSVLGKM 524
MEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWKPDP+TFSVLGKM
Sbjct: 241 MEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWKPDPYTFSVLGKM 300
Query: 525 FGEARDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGIMPN 584
FGEA DYDGIMYVLQEMKSIE+QPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGI PN
Sbjct: 301 FGEAGDYDGIMYVLQEMKSIEMQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGITPN 360
Query: 585 EKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFE 644
EKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFE
Sbjct: 361 EKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFE 420
Query: 645 EMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVEINVMCCTCLIQCLGKS 704
EMKKS+HSRPDSWSYTAMLNI+GSGGNVKRSMELFEEML+LGVEINVMCCTCLIQCLGKS
Sbjct: 421 EMKKSKHSRPDSWSYTAMLNIYGSGGNVKRSMELFEEMLKLGVEINVMCCTCLIQCLGKS 480
Query: 705 GRIDDLIRVFDVSVQKGIKPDDRLCGCLLSVVSLCDNSEDIDKVFTCLQQANPKLVAFVN 764
GRIDDL+RVF+VSVQKGIKPDDRLCGCLLSVVSLCDNSEDI+KVFTCLQQANPKLV+FVN
Sbjct: 481 GRIDDLVRVFNVSVQKGIKPDDRLCGCLLSVVSLCDNSEDINKVFTCLQQANPKLVSFVN 540
Query: 765 LLQQNDITFEVVKDEFRNILGETANEARRPFCNCLIDICRNQNLRERAHELLYLGSLYGL 824
LLQQN ITFEV+K+EFRNIL ETA+EARRPFCNCLIDICRNQNLRERAHELLYLGSLYGL
Sbjct: 541 LLQQNSITFEVIKNEFRNILSETASEARRPFCNCLIDICRNQNLRERAHELLYLGSLYGL 600
Query: 825 YPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMITLSKIVHREEALPELLSAQTGAGTHRF 884
YPGLHNKTE EWCLDVRSLSVGAAQTALEEWMITLSKIV R+EALPELLSAQTGAGTHRF
Sbjct: 601 YPGLHNKTETEWCLDVRSLSVGAAQTALEEWMITLSKIVQRKEALPELLSAQTGAGTHRF 660
Query: 885 SQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVPAVAATA 936
SQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVP+V ATA
Sbjct: 661 SQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVPSVPATA 710
BLAST of HG10003044 vs. ExPASy TrEMBL
Match:
A0A1S3BKZ1 (pentatricopeptide repeat-containing protein At5g46580, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103490799 PE=3 SV=1)
HSP 1 Score: 1334.3 bits (3452), Expect = 0.0e+00
Identity = 664/711 (93.39%), Postives = 687/711 (96.62%), Query Frame = 0
Query: 225 MAPPLFSSLDLNLKPTPIFFTSPLPRKNFTKRLTVLCSSSSKSPRRASPISSESSDNKNP 284
MA PL SSLD LKPTPIFFTS L RK KRLT+LC SSSKSPR+ S ISSES DNKNP
Sbjct: 1 MAAPLSSSLDFKLKPTPIFFTSLLRRKYVNKRLTLLC-SSSKSPRKPSSISSESIDNKNP 60
Query: 285 SLSEQLKNLSTTTLSNAPNDQTHLLSKPKSTWVNPTKPKHSVLSPHRQKRSSYSYNPKMR 344
SLS+QLKNLSTTTLSNAPND+T LLSKPKSTWVNPTKPK SVLS RQKRSSYSYNPKMR
Sbjct: 61 SLSDQLKNLSTTTLSNAPNDETRLLSKPKSTWVNPTKPKRSVLSLQRQKRSSYSYNPKMR 120
Query: 345 DLKSFAHKLNACDSSDEASFMAALEEIPHPPTKENALIILNSLRPWQKTHLFFNWIKTQN 404
DLKSFAHKLNACDSSDEASF+AALEEIPHPPTKENAL+ILNSLRPWQKTHLFFNWIKTQN
Sbjct: 121 DLKSFAHKLNACDSSDEASFIAALEEIPHPPTKENALLILNSLRPWQKTHLFFNWIKTQN 180
Query: 405 LFPMETIFYNVAMKSLRYGRQFQLIEDLANEMITTGIELDNITYSTIITCAKKCSRFDKA 464
LFPMETIFYNVAMKSLRYGRQFQLIEDLAN+M++TGIELDNITYSTIITCAKKCSRFDKA
Sbjct: 181 LFPMETIFYNVAMKSLRYGRQFQLIEDLANDMVSTGIELDNITYSTIITCAKKCSRFDKA 240
Query: 465 MEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWKPDPFTFSVLGKM 524
MEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWKPDP+TFSVLGKM
Sbjct: 241 MEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWKPDPYTFSVLGKM 300
Query: 525 FGEARDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGIMPN 584
FGEA DYDGIMYVLQEMKSIE+QPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGI PN
Sbjct: 301 FGEAGDYDGIMYVLQEMKSIEMQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGITPN 360
Query: 585 EKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFE 644
EKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFE
Sbjct: 361 EKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFE 420
Query: 645 EMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVEINVMCCTCLIQCLGKS 704
EMKKS+HSRPDSWSYTAMLNI+GSGGNVKRSMELFEEML+LGVEINVMCCTCLIQCLGKS
Sbjct: 421 EMKKSKHSRPDSWSYTAMLNIYGSGGNVKRSMELFEEMLKLGVEINVMCCTCLIQCLGKS 480
Query: 705 GRIDDLIRVFDVSVQKGIKPDDRLCGCLLSVVSLCDNSEDIDKVFTCLQQANPKLVAFVN 764
GRIDDL+RVF+VSVQKGIKPDDRLCGCLLSVVSLCDNSEDI+KVFTCLQQANPKLV+FVN
Sbjct: 481 GRIDDLVRVFNVSVQKGIKPDDRLCGCLLSVVSLCDNSEDINKVFTCLQQANPKLVSFVN 540
Query: 765 LLQQNDITFEVVKDEFRNILGETANEARRPFCNCLIDICRNQNLRERAHELLYLGSLYGL 824
LLQQN ITFEV+K+EFRNIL ETA+EARRPFCNCLIDICRNQNLRERAHELLYLGSLYGL
Sbjct: 541 LLQQNSITFEVIKNEFRNILSETASEARRPFCNCLIDICRNQNLRERAHELLYLGSLYGL 600
Query: 825 YPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMITLSKIVHREEALPELLSAQTGAGTHRF 884
YPGLHNKTE EWCLDVRSLSVGAAQTALEEWMITLSKIV R+EALPELLSAQTGAGTHRF
Sbjct: 601 YPGLHNKTETEWCLDVRSLSVGAAQTALEEWMITLSKIVQRKEALPELLSAQTGAGTHRF 660
Query: 885 SQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVPAVAATA 936
SQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVP+V ATA
Sbjct: 661 SQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVPSVPATA 710
BLAST of HG10003044 vs. ExPASy TrEMBL
Match:
A0A6J1E5X7 (pentatricopeptide repeat-containing protein At5g46580, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111431017 PE=3 SV=1)
HSP 1 Score: 1278.8 bits (3308), Expect = 0.0e+00
Identity = 638/714 (89.36%), Postives = 671/714 (93.98%), Query Frame = 0
Query: 225 MAPPLFSSLD--LNLKPT-PIFFTSPLPRKNFTKRLTVLCSSSSKSPRRASPISSESSDN 284
MA PL SSLD L LKPT P+FFTSPL R NFTKR TVLC+SSSKSPR S+D
Sbjct: 1 MAAPLSSSLDIKLKLKPTPPLFFTSPLRRNNFTKRFTVLCTSSSKSPR--------STDK 60
Query: 285 KNPSLSEQLKNLSTTTLSNAPNDQTHLLSKPKSTWVNPTKPKHSVLSPHRQKRSSYSYNP 344
KNPSLSEQLK+LST+TLSNA ND++HLLS PKS WVNPTKPK SVLS RQKRSSYSYNP
Sbjct: 61 KNPSLSEQLKDLSTSTLSNASNDESHLLSNPKSIWVNPTKPKRSVLSLQRQKRSSYSYNP 120
Query: 345 KMRDLKSFAHKLNACDSSDEASFMAALEEIPHPPTKENALIILNSLRPWQKTHLFFNWIK 404
KMR+LK+FAHKLNA DSS EA+FMA L+EIPHPPTKENAL+ILNSL+PWQKTHLFFNWIK
Sbjct: 121 KMRELKTFAHKLNASDSS-EAAFMAVLKEIPHPPTKENALLILNSLKPWQKTHLFFNWIK 180
Query: 405 TQNLFPMETIFYNVAMKSLRYGRQFQLIEDLANEMITTGIELDNITYSTIITCAKKCSRF 464
TQNLFPMETIFYNVAMKSLRYGRQFQLIE+LANEMI TGIELDNITYSTIITCAKKCSRF
Sbjct: 181 TQNLFPMETIFYNVAMKSLRYGRQFQLIEELANEMINTGIELDNITYSTIITCAKKCSRF 240
Query: 465 DKAMEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWKPDPFTFSVL 524
DKAMEWFERMY+TGLMPDEVTYSAILDVYANLGKVEE LSLYERGRASGWKPDP+TFSVL
Sbjct: 241 DKAMEWFERMYRTGLMPDEVTYSAILDVYANLGKVEEALSLYERGRASGWKPDPYTFSVL 300
Query: 525 GKMFGEARDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGI 584
GKMFGEA DYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAG+PGFARSLF+EM+ESGI
Sbjct: 301 GKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGRPGFARSLFEEMIESGI 360
Query: 585 MPNEKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEK 644
PNEKTLTALVKIYGKARWARDAL+LWERMRS GWPMDFILYNTLLNMCADLGLEEEAEK
Sbjct: 361 TPNEKTLTALVKIYGKARWARDALELWERMRSKGWPMDFILYNTLLNMCADLGLEEEAEK 420
Query: 645 LFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVEINVMCCTCLIQCL 704
LFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGV INVMCCTCLIQCL
Sbjct: 421 LFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVGINVMCCTCLIQCL 480
Query: 705 GKSGRIDDLIRVFDVSVQKGIKPDDRLCGCLLSVVSLCDNSEDIDKVFTCLQQANPKLVA 764
GK+ RIDDL+RVFDVSV+KG++PDDRLCGCLLSVVSLCDN+EDI KVFTCLQQANPKLVA
Sbjct: 481 GKARRIDDLVRVFDVSVRKGVEPDDRLCGCLLSVVSLCDNNEDISKVFTCLQQANPKLVA 540
Query: 765 FVNLLQQNDITFEVVKDEFRNILGETANEARRPFCNCLIDICRNQNLRERAHELLYLGSL 824
FVNLLQQNDITF+V+KDEFR ILGETA EARRPFCNCLIDICRNQNL +RAHELLYLGSL
Sbjct: 541 FVNLLQQNDITFDVIKDEFRTILGETATEARRPFCNCLIDICRNQNLSKRAHELLYLGSL 600
Query: 825 YGLYPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMITLSKIVHREEALPELLSAQTGAGT 884
YGLYPGLHNKTE EWCLDVRSLSVGAAQTALEEWMITLSKIV REEALPELLSAQTGAGT
Sbjct: 601 YGLYPGLHNKTEGEWCLDVRSLSVGAAQTALEEWMITLSKIVQREEALPELLSAQTGAGT 660
Query: 885 HRFSQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVPAVAATA 936
HRFSQGLANSFASHV+KLAAPF+LREDRAGWFVATRED+V WVHSRVP+VA A
Sbjct: 661 HRFSQGLANSFASHVEKLAAPFRLREDRAGWFVATREDVVAWVHSRVPSVATRA 705
BLAST of HG10003044 vs. ExPASy TrEMBL
Match:
A0A6J1CW10 (pentatricopeptide repeat-containing protein At5g46580, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111014808 PE=3 SV=1)
HSP 1 Score: 1273.8 bits (3295), Expect = 0.0e+00
Identity = 637/698 (91.26%), Postives = 672/698 (96.28%), Query Frame = 0
Query: 238 KPTPIFFTSPLPRKNFTKRLTVLCSSSSKSPRRASPISSESSDNKNPSLSEQLKNLSTTT 297
KP+P+FFTSPL RKNFTKRLTVLC SSSKSPR+ SP +S S++KNPSLS+QLKNLSTTT
Sbjct: 21 KPSPMFFTSPLRRKNFTKRLTVLC-SSSKSPRK-SPQTSSQSNHKNPSLSDQLKNLSTTT 80
Query: 298 LSNAP-NDQTHLLSKPKSTWVNPTKPKHSVLSPHRQKRSSYSYNPKMRDLKSFAHKLNAC 357
LS +P +D++HLLSKPKSTWVNPTKPK SVLS RQKRSSYSYNPKMR+LKSFA KLNAC
Sbjct: 81 LSTSPKDDESHLLSKPKSTWVNPTKPKRSVLSLQRQKRSSYSYNPKMRELKSFAQKLNAC 140
Query: 358 DSSDEASFMAALEEIPHPPTKENALIILNSLRPWQKTHLFFNWIKTQNLFPMETIFYNVA 417
DSS E++F+AALEEIPHPPTKENAL+ILNSL+PWQKT +FFNWIK+QNLFPMETIFYNVA
Sbjct: 141 DSS-ESAFVAALEEIPHPPTKENALLILNSLKPWQKTQMFFNWIKSQNLFPMETIFYNVA 200
Query: 418 MKSLRYGRQFQLIEDLANEMITTGIELDNITYSTIITCAKKCSRFDKAMEWFERMYKTGL 477
MKSLRYGRQFQ+IEDLANEMI++GIELDNITYSTIITCAKKCSRFDKAMEWFERMYKTGL
Sbjct: 201 MKSLRYGRQFQVIEDLANEMISSGIELDNITYSTIITCAKKCSRFDKAMEWFERMYKTGL 260
Query: 478 MPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWKPDPFTFSVLGKMFGEARDYDGIMY 537
MPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWKPDP TFSVLGKMFGEA DYDGIMY
Sbjct: 261 MPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMY 320
Query: 538 VLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGIMPNEKTLTALVKIYG 597
VLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEM+ESGI PNEKTLTALVKIYG
Sbjct: 321 VLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIESGITPNEKTLTALVKIYG 380
Query: 598 KARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSEHSRPDS 657
KARWARDAL LWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSE+SRPDS
Sbjct: 381 KARWARDALYLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSENSRPDS 440
Query: 658 WSYTAMLNIHGSGGNVKRSMELFEEMLELGVEINVMCCTCLIQCLGKSGRIDDLIRVFDV 717
WSYTAMLNIHGSGGNVKRSMELFEEMLELGVEINVM CTCLIQCLGK+ RIDDL+RVFDV
Sbjct: 441 WSYTAMLNIHGSGGNVKRSMELFEEMLELGVEINVMGCTCLIQCLGKARRIDDLVRVFDV 500
Query: 718 SVQKGIKPDDRLCGCLLSVVSLCDNSEDIDKVFTCLQQANPKLVAFVNLLQQNDITFEVV 777
SVQKGIKPDDRLCGCLLSVVSLCDNSEDI+KVFTCLQQANP LVAF+NLLQQN ITFEVV
Sbjct: 501 SVQKGIKPDDRLCGCLLSVVSLCDNSEDINKVFTCLQQANPNLVAFINLLQQNVITFEVV 560
Query: 778 KDEFRNILGETANEARRPFCNCLIDICRNQNLRERAHELLYLGSLYGLYPGLHNKTEAEW 837
K+EFR ILGETA EARRPFCNCLIDICRNQNLRERAHELLYLGSLYGLYPGLHNKTE+EW
Sbjct: 561 KEEFRKILGETATEARRPFCNCLIDICRNQNLRERAHELLYLGSLYGLYPGLHNKTESEW 620
Query: 838 CLDVRSLSVGAAQTALEEWMITLSKIVHREEALPELLSAQTGAGTHRFSQGLANSFASHV 897
CLDVRSLSVGAAQTALEEWM TLSKIV REEALP+LLSAQTGAGTHRFSQGLANSFASHV
Sbjct: 621 CLDVRSLSVGAAQTALEEWMTTLSKIVQREEALPQLLSAQTGAGTHRFSQGLANSFASHV 680
Query: 898 DKLAAPFQLREDRAGWFVATREDLVTWVHSRVPAVAAT 935
+KLAAPF+LREDRAGWFVATREDLV+WVHSRVP+VAAT
Sbjct: 681 EKLAAPFRLREDRAGWFVATREDLVSWVHSRVPSVAAT 715
BLAST of HG10003044 vs. TAIR 10
Match:
AT5G46580.1 (pentatricopeptide (PPR) repeat-containing protein )
HSP 1 Score: 969.1 bits (2504), Expect = 2.4e-282
Identity = 470/705 (66.67%), Postives = 583/705 (82.70%), Query Frame = 0
Query: 230 FSSLDLNLKPTPIFFTSPLPRKNFTKRLTVLCSSSSKSPRRASPISSESSDNKNPSLSEQ 289
F+ + + K +F L R++ +++L + C SS K P+ + E K PSLSEQ
Sbjct: 13 FNPQNSDTKKHSLFLKPSLFRQSRSRKLNISC-SSLKQPK---TLEEEPITTKTPSLSEQ 72
Query: 290 LKNLSTTTLSNAPNDQTHLLSKPKSTWVNPTKPKHSVLSPHRQKRSSYSYNPKMRDLKSF 349
LK LS TTL +QT +LSKPKS WVNPT+PK SVLS RQKRS+YSYNP+++DL++F
Sbjct: 73 LKPLSATTLR---QEQTQILSKPKSVWVNPTRPKRSVLSLQRQKRSAYSYNPQIKDLRAF 132
Query: 350 AHKLNACDSSDEASFMAALEEIPHPPTKENALIILNSLRPWQKTHLFFNWIKTQNLFPME 409
A KLN+ ++++ F++ L+EIPHPP ++NAL++LNSLR WQKTH FFNW+K+++LFPME
Sbjct: 133 ALKLNSSIFTEKSEFLSLLDEIPHPPNRDNALLVLNSLREWQKTHTFFNWVKSKSLFPME 192
Query: 410 TIFYNVAMKSLRYGRQFQLIEDLANEMITTGIELDNITYSTIITCAKKCSRFDKAMEWFE 469
TIFYNV MKSLR+GRQFQLIE++A EM+ G+ELDNITYSTIITCAK+C+ ++KA+EWFE
Sbjct: 193 TIFYNVTMKSLRFGRQFQLIEEMALEMVKDGVELDNITYSTIITCAKRCNLYNKAIEWFE 252
Query: 470 RMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWKPDPFTFSVLGKMFGEAR 529
RMYKTGLMPDEVTYSAILDVY+ GKVEEVLSLYER A+GWKPD FSVLGKMFGEA
Sbjct: 253 RMYKTGLMPDEVTYSAILDVYSKSGKVEEVLSLYERAVATGWKPDAIAFSVLGKMFGEAG 312
Query: 530 DYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGIMPNEKTLT 589
DYDGI YVLQEMKS++V+PN+VVYNTLL+AMG+AGKPG ARSLF+EM+E+G+ PNEKTLT
Sbjct: 313 DYDGIRYVLQEMKSMDVKPNVVVYNTLLEAMGRAGKPGLARSLFNEMLEAGLTPNEKTLT 372
Query: 590 ALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKS 649
ALVKIYGKARWARDAL LWE M++ WPMDFILYNTLLNMCAD+GLEEEAE+LF +MK+S
Sbjct: 373 ALVKIYGKARWARDALQLWEEMKAKKWPMDFILYNTLLNMCADIGLEEEAERLFNDMKES 432
Query: 650 EHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVEINVMCCTCLIQCLGKSGRIDD 709
RPD++SYTAMLNI+GSGG +++MELFEEML+ GV++NVM CTCL+QCLGK+ RIDD
Sbjct: 433 VQCRPDNFSYTAMLNIYGSGGKAEKAMELFEEMLKAGVQVNVMGCTCLVQCLGKAKRIDD 492
Query: 710 LIRVFDVSVQKGIKPDDRLCGCLLSVVSLCDNSEDIDKVFTCLQQANPKLVAFVNLLQQN 769
++ VFD+S+++G+KPDDRLCGCLLSV++LC++SED +KV CL++AN KLV FVNL+
Sbjct: 493 VVYVFDLSIKRGVKPDDRLCGCLLSVMALCESSEDAEKVMACLERANKKLVTFVNLIVDE 552
Query: 770 DITFEVVKDEFRNILGETANEARRPFCNCLIDICRNQNLRERAHELLYLGSLYGLYPGLH 829
+E VK+EF+ ++ T EARRPFCNCLIDICR N ERAHELLYLG+L+GLYPGLH
Sbjct: 553 KTEYETVKEEFKLVINATQVEARRPFCNCLIDICRGNNRHERAHELLYLGTLFGLYPGLH 612
Query: 830 NKTEAEWCLDVRSLSVGAAQTALEEWMITLSKIVHREEALPELLSAQTGAGTHRFSQGLA 889
NKT EW LDVRSLSVGAA+TALEEWM TL+ I+ R+E LPEL AQTG GTHRFSQGLA
Sbjct: 613 NKTIKEWSLDVRSLSVGAAETALEEWMRTLANIIKRQEELPELFLAQTGTGTHRFSQGLA 672
Query: 890 NSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVPAVAAT 935
NSFA H+ +L+APF+ + DR G FVAT+EDLV+W+ S+ P + +
Sbjct: 673 NSFALHLQQLSAPFR-QSDRPGIFVATKEDLVSWLESKFPPLVTS 709
BLAST of HG10003044 vs. TAIR 10
Match:
AT4G16390.1 (pentatricopeptide (PPR) repeat-containing protein )
HSP 1 Score: 434.5 bits (1116), Expect = 2.2e-121
Identity = 254/693 (36.65%), Postives = 390/693 (56.28%), Query Frame = 0
Query: 260 LCSSSSKSPRRASPISSESSDNKNPSLSEQLKNLSTTTLS---NAPNDQTHLL------- 319
LC+ S P +++P S SS N N S L T +S P + L
Sbjct: 20 LCNLLSVYP-KSTPRSFLSSYNPNSSHFHSRNLLQATHVSVQEAIPQSEKSKLVDVDLPI 79
Query: 320 ----SKPKSTWVNPTKPKHSVLSPHRQKRSSYSYNPKMRDLKSFAHKLNACDSSDEASFM 379
+ WVNP P+ S L R+K SY+ + L A L+AC +EA
Sbjct: 80 PEPTASKSYVWVNPKSPRASQL---RRK----SYDSRYSSLIKLAESLDAC-KPNEADVC 139
Query: 380 AALEEIPHPPTKENALIILNSLRPWQKTHLFFNWIKTQNLFPMETIFYNVAMKSLRYGRQ 439
+ +++A++ LN++ + L N + E I YNV MK R +
Sbjct: 140 DVITGFGGKLFEQDAVVTLNNMTNPETAPLVLNNLLETMKPSREVILYNVTMKVFRKSKD 199
Query: 440 FQLIEDLANEMITTGIELDNITYSTIITCAKKCSRFDKAMEWFERMYKTGLMPDEVTYSA 499
+ E L +EM+ GI+ DN T++TII+CA++ +A+EWFE+M G PD VT +A
Sbjct: 200 LEKSEKLFDEMLERGIKPDNATFTTIISCARQNGVPKRAVEWFEKMSSFGCEPDNVTMAA 259
Query: 500 ILDVYANLGKVEEVLSLYERGRASGWKPDPFTFSVLGKMFGEARDYDGIMYVLQEMKSIE 559
++D Y G V+ LSLY+R R W+ D TFS L +++G + +YDG + + +EMK++
Sbjct: 260 MIDAYGRAGNVDMALSLYDRARTEKWRIDAVTFSTLIRIYGVSGNYDGCLNIYEEMKALG 319
Query: 560 VQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGIMPNEKTLTALVKIYGKARWARDAL 619
V+PNLV+YN L+D+MG+A +P A+ ++ +++ +G PN T ALV+ YG+AR+ DAL
Sbjct: 320 VKPNLVIYNRLIDSMGRAKRPWQAKIIYKDLITNGFTPNWSTYAALVRAYGRARYGDDAL 379
Query: 620 DLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSEHSRPDSWSYTAMLNI 679
++ M+ G + ILYNTLL+MCAD +EA ++F++MK E PDSW++++++ +
Sbjct: 380 AIYREMKEKGLSLTVILYNTLLSMCADNRYVDEAFEIFQDMKNCETCDPDSWTFSSLITV 439
Query: 680 HGSGGNVKRSMELFEEMLELGVEINVMCCTCLIQCLGKSGRIDDLIRVFDVSVQKGIKPD 739
+ G V + +M E G E + T +IQC GK+ ++DD++R FD ++ GI PD
Sbjct: 440 YACSGRVSEAEAALLQMREAGFEPTLFVLTSVIQCYGKAKQVDDVVRTFDQVLELGITPD 499
Query: 740 DRLCGCLLSVVSLCDNSEDIDKVFTCLQQANPKLVAFVNLL-QQNDITFEVVKDEFRNIL 799
DR CGCLL+V++ SE+I K+ C+++A PKL V +L ++ + V K E ++
Sbjct: 500 DRFCGCLLNVMTQTP-SEEIGKLIGCVEKAKPKLGQVVKMLVEEQNCEEGVFKKEASELI 559
Query: 800 GETANEARRPFCNCLIDICRNQNLRERAHELLYLGSLYGLYPGLHNKTEAEWCLDVRSLS 859
++ ++ + NCLID+C N N ERA E+L LG Y +Y GL +K+ +W L ++SLS
Sbjct: 560 DSIGSDVKKAYLNCLIDLCVNLNKLERACEILQLGLEYDIYTGLQSKSATQWSLHLKSLS 619
Query: 860 VGAAQTALEEWMITLSK-IVHREEALPELLSAQTGAGTHRFS-QGLANSFASHVDKLAAP 919
+GAA TAL WM LS+ + E P LL TG G H++S +GLA F SH+ +L AP
Sbjct: 620 LGAALTALHVWMNDLSEAALESGEEFPPLLGINTGHGKHKYSDKGLAAVFESHLKELNAP 679
Query: 920 FQLREDRAGWFVATREDLVTWVHSRVPAVAATA 936
F D+ GWF+ T W+ SR A +A
Sbjct: 680 FHEAPDKVGWFLTTSVAAKAWLESRRSAGGVSA 702
BLAST of HG10003044 vs. TAIR 10
Match:
AT1G32520.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; Has 143 Blast hits to 142 proteins in 34 species: Archae - 0; Bacteria - 0; Metazoa - 39; Fungi - 0; Plants - 56; Viruses - 0; Other Eukaryotes - 48 (source: NCBI BLink). )
HSP 1 Score: 316.2 bits (809), Expect = 8.6e-86
Identity = 159/228 (69.74%), Postives = 180/228 (78.95%), Query Frame = 0
Query: 1 MASCIFNNVFYRL--KTTPSCS-----FGWNWNFGNGNKKEDKPQIKYHDIVLPFPLSLV 60
MASC +N F+ + KT S FGWN N + + + D+ +PF LS+V
Sbjct: 1 MASCAISNSFHTVTFKTLKRISPYNSLFGWNSGKKIDNIRPPQQPAYHDDVEIPFSLSMV 60
Query: 61 DKTFLKRKELTCCYKATSDGFSATDFHTCCDFKGPCVIIGYT-DKSFKFGAFNPEGYRST 120
+KTFLK +EL CCYKA+ DGF AT FH CDFKGPCVII YT DKSFKFG F+PEGYRST
Sbjct: 61 NKTFLKGRELKCCYKASIDGFGATKFHERCDFKGPCVIIAYTKDKSFKFGGFSPEGYRST 120
Query: 121 DDYYDTFDAFLFYWEENEDADPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVM 180
DDYYDTFDAFLFYW E+ D DPI+LPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVM
Sbjct: 121 DDYYDTFDAFLFYWLEDCD-DPIVLPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVM 180
Query: 181 GGFAGPDTNSGVGDLRQARSRLGLSYAKRKDGKDSIFGDENRAVVAEV 221
GGFAGPDTNSG+GDLR A+SRLGLSYAKRKDGK+SIFGDEN+ + +V
Sbjct: 181 GGFAGPDTNSGIGDLRVAKSRLGLSYAKRKDGKESIFGDENKVSLDDV 227
BLAST of HG10003044 vs. TAIR 10
Match:
AT1G18900.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 187.6 bits (475), Expect = 4.6e-47
Identity = 131/551 (23.77%), Postives = 246/551 (44.65%), Query Frame = 0
Query: 383 ILNSLRPWQKTHLFFNWIKTQNLFPMETIFYNVAMKSLRYGRQFQLIEDLANEMITTGIE 442
+L + + FF W+K Q F + Y + +L +QF I L +EM+ G +
Sbjct: 337 VLKQMNDYGNALGFFYWLKRQPGFKHDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQ 396
Query: 443 LDNITYSTIITCAKKCSRFDKAMEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSL 502
+ +TY+ +I + + ++AM F +M + G PD VTY ++D++A G ++ + +
Sbjct: 397 PNTVTYNRLIHSYGRANYLNEAMNVFNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDM 456
Query: 503 YERGRASGWKPDPFTFSVLGKMFGEARDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGK 562
Y+R +A G PD FT+SV +++ +GK
Sbjct: 457 YQRMQAGGLSPDTFTYSV-----------------------------------IINCLGK 516
Query: 563 AGKPGFARSLFDEMVESGIMPNEKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFIL 622
AG A LF EMV+ G PN T ++ ++ KAR ++AL L+ M++ G+ D +
Sbjct: 517 AGHLPAAHKLFCEMVDQGCTPNLVTYNIMMDLHAKARNYQNALKLYRDMQNAGFEPDKVT 576
Query: 623 YNTLLNMCADLGLEEEAEKLFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEM 682
Y+ ++ + G EEAE +F EM++ ++ PD Y ++++ G GNV+++ + ++ M
Sbjct: 577 YSIVMEVLGHCGYLEEAEAVFTEMQQ-KNWIPDEPVYGLLVDLWGKAGNVEKAWQWYQAM 636
Query: 683 LELGVEINVMCCTCLIQCLGKSGRIDDLIRVFDVSVQKGIKPDDRLCGCLLSVVSLCDNS 742
L G+ NV C L+ + +I + + + G++P + LLS + D
Sbjct: 637 LHAGLRPNVPTCNSLLSTFLRVNKIAEAYELLQNMLALGLRPSLQTYTLLLSCCT--DGR 696
Query: 743 EDIDKVFTCLQQANPKLVAFVNLLQQ-----NDITFEVVKDEFRNILGETANEARRPFCN 802
+D F A+ A + LL+ + + F +++ E++R +
Sbjct: 697 SKLDMGFCGQLMASTGHPAHMFLLKMPAAGPDGENVRNHANNFLDLMHSEDRESKRGLVD 756
Query: 803 CLIDICRNQNLRERAHELLYLGSLYGLYP-GLHNKTEAEWCLDVRSLSVGAAQTALEEWM 862
++D +E A + + + ++P L K+ + W +++ +S G A TAL +
Sbjct: 757 AVVDFLHKSGQKEEAGSVWEVAAQKNVFPDALREKSCSYWLINLHVMSEGTAVTALSRTL 816
Query: 863 ITLSKIVHREEALPELLSAQTGAGTHRFSQG---LANSFASHVDKLAAPFQLREDRAGWF 922
K + P + TG G G + + ++ +PF +G F
Sbjct: 817 AWFRKQMLASGTCPSRIDIVTGWGRRSRVTGTSMVRQAVEELLNIFGSPFFTESGNSGCF 849
Query: 923 VATREDLVTWV 925
V + E L W+
Sbjct: 877 VGSGEPLNRWL 849
BLAST of HG10003044 vs. TAIR 10
Match:
AT1G18900.2 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 187.6 bits (475), Expect = 4.6e-47
Identity = 131/551 (23.77%), Postives = 246/551 (44.65%), Query Frame = 0
Query: 383 ILNSLRPWQKTHLFFNWIKTQNLFPMETIFYNVAMKSLRYGRQFQLIEDLANEMITTGIE 442
+L + + FF W+K Q F + Y + +L +QF I L +EM+ G +
Sbjct: 337 VLKQMNDYGNALGFFYWLKRQPGFKHDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQ 396
Query: 443 LDNITYSTIITCAKKCSRFDKAMEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSL 502
+ +TY+ +I + + ++AM F +M + G PD VTY ++D++A G ++ + +
Sbjct: 397 PNTVTYNRLIHSYGRANYLNEAMNVFNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDM 456
Query: 503 YERGRASGWKPDPFTFSVLGKMFGEARDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGK 562
Y+R +A G PD FT+SV +++ +GK
Sbjct: 457 YQRMQAGGLSPDTFTYSV-----------------------------------IINCLGK 516
Query: 563 AGKPGFARSLFDEMVESGIMPNEKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFIL 622
AG A LF EMV+ G PN T ++ ++ KAR ++AL L+ M++ G+ D +
Sbjct: 517 AGHLPAAHKLFCEMVDQGCTPNLVTYNIMMDLHAKARNYQNALKLYRDMQNAGFEPDKVT 576
Query: 623 YNTLLNMCADLGLEEEAEKLFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEM 682
Y+ ++ + G EEAE +F EM++ ++ PD Y ++++ G GNV+++ + ++ M
Sbjct: 577 YSIVMEVLGHCGYLEEAEAVFTEMQQ-KNWIPDEPVYGLLVDLWGKAGNVEKAWQWYQAM 636
Query: 683 LELGVEINVMCCTCLIQCLGKSGRIDDLIRVFDVSVQKGIKPDDRLCGCLLSVVSLCDNS 742
L G+ NV C L+ + +I + + + G++P + LLS + D
Sbjct: 637 LHAGLRPNVPTCNSLLSTFLRVNKIAEAYELLQNMLALGLRPSLQTYTLLLSCCT--DGR 696
Query: 743 EDIDKVFTCLQQANPKLVAFVNLLQQ-----NDITFEVVKDEFRNILGETANEARRPFCN 802
+D F A+ A + LL+ + + F +++ E++R +
Sbjct: 697 SKLDMGFCGQLMASTGHPAHMFLLKMPAAGPDGENVRNHANNFLDLMHSEDRESKRGLVD 756
Query: 803 CLIDICRNQNLRERAHELLYLGSLYGLYP-GLHNKTEAEWCLDVRSLSVGAAQTALEEWM 862
++D +E A + + + ++P L K+ + W +++ +S G A TAL +
Sbjct: 757 AVVDFLHKSGQKEEAGSVWEVAAQKNVFPDALREKSCSYWLINLHVMSEGTAVTALSRTL 816
Query: 863 ITLSKIVHREEALPELLSAQTGAGTHRFSQG---LANSFASHVDKLAAPFQLREDRAGWF 922
K + P + TG G G + + ++ +PF +G F
Sbjct: 817 AWFRKQMLASGTCPSRIDIVTGWGRRSRVTGTSMVRQAVEELLNIFGSPFFTESGNSGCF 849
Query: 923 VATREDLVTWV 925
V + E L W+
Sbjct: 877 VGSGEPLNRWL 849
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038906217.1 | 0.0e+00 | 95.08 | pentatricopeptide repeat-containing protein At5g46580, chloroplastic [Benincasa ... | [more] |
XP_004148730.1 | 0.0e+00 | 93.81 | pentatricopeptide repeat-containing protein At5g46580, chloroplastic [Cucumis sa... | [more] |
XP_008448710.1 | 0.0e+00 | 93.39 | PREDICTED: pentatricopeptide repeat-containing protein At5g46580, chloroplastic ... | [more] |
KAG7015666.1 | 0.0e+00 | 89.36 | Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... | [more] |
XP_023552442.1 | 0.0e+00 | 89.36 | pentatricopeptide repeat-containing protein At5g46580, chloroplastic-like [Cucur... | [more] |
Match Name | E-value | Identity | Description | |
Q9LS25 | 3.4e-281 | 66.67 | Pentatricopeptide repeat-containing protein At5g46580, chloroplastic OS=Arabidop... | [more] |
B4F8Z1 | 6.6e-123 | 36.82 | Pentatricopeptide repeat-containing protein ATP4, chloroplastic OS=Zea mays OX=4... | [more] |
Q10PZ4 | 8.0e-121 | 37.56 | Pentatricopeptide repeat-containing protein ATP4 homolog, chloroplastic OS=Oryza... | [more] |
Q8GWE0 | 3.1e-120 | 36.65 | Pentatricopeptide repeat-containing protein At4g16390, chloroplastic OS=Arabidop... | [more] |
Q8GYP6 | 6.5e-46 | 23.77 | Pentatricopeptide repeat-containing protein At1g18900 OS=Arabidopsis thaliana OX... | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0L6K8 | 0.0e+00 | 93.81 | Smr domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G011820 PE=3 SV... | [more] |
A0A5A7UFR2 | 0.0e+00 | 93.39 | Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... | [more] |
A0A1S3BKZ1 | 0.0e+00 | 93.39 | pentatricopeptide repeat-containing protein At5g46580, chloroplastic OS=Cucumis ... | [more] |
A0A6J1E5X7 | 0.0e+00 | 89.36 | pentatricopeptide repeat-containing protein At5g46580, chloroplastic-like OS=Cuc... | [more] |
A0A6J1CW10 | 0.0e+00 | 91.26 | pentatricopeptide repeat-containing protein At5g46580, chloroplastic OS=Momordic... | [more] |
Match Name | E-value | Identity | Description | |
AT5G46580.1 | 2.4e-282 | 66.67 | pentatricopeptide (PPR) repeat-containing protein | [more] |
AT4G16390.1 | 2.2e-121 | 36.65 | pentatricopeptide (PPR) repeat-containing protein | [more] |
AT1G32520.1 | 8.6e-86 | 69.74 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT1G18900.1 | 4.6e-47 | 23.77 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT1G18900.2 | 4.6e-47 | 23.77 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |