Homology
BLAST of HG10008123 vs. NCBI nr
Match:
XP_038876888.1 (pentatricopeptide repeat-containing protein At2g26790, mitochondrial-like isoform X1 [Benincasa hispida] >XP_038876889.1 pentatricopeptide repeat-containing protein At2g26790, mitochondrial-like isoform X1 [Benincasa hispida] >XP_038876890.1 pentatricopeptide repeat-containing protein At2g26790, mitochondrial-like isoform X1 [Benincasa hispida] >XP_038876891.1 pentatricopeptide repeat-containing protein At2g26790, mitochondrial-like isoform X1 [Benincasa hispida] >XP_038876892.1 pentatricopeptide repeat-containing protein At2g26790, mitochondrial-like isoform X1 [Benincasa hispida] >XP_038876893.1 pentatricopeptide repeat-containing protein At2g26790, mitochondrial-like isoform X1 [Benincasa hispida] >XP_038876894.1 pentatricopeptide repeat-containing protein At2g26790, mitochondrial-like isoform X1 [Benincasa hispida] >XP_038876895.1 pentatricopeptide repeat-containing protein At2g26790, mitochondrial-like isoform X1 [Benincasa hispida])
HSP 1 Score: 1401.3 bits (3626), Expect = 0.0e+00
Identity = 706/804 (87.81%), Postives = 743/804 (92.41%), Query Frame = 0
Query: 1 MNCLLLAINNPIKLNKFKLKLVKFASTAIAQPNPCSFSHSEDEQSTTSINTTLHVQCNPS 60
MNCLLLA+NNPIKLN KLKLVKF STAIAQ NPCSFSHS+DE+STTS NTTLH++C PS
Sbjct: 1 MNCLLLAVNNPIKLNILKLKLVKFTSTAIAQLNPCSFSHSDDERSTTSFNTTLHIRCKPS 60
Query: 61 KVAQILESLRREPNIAFSFFRELEEWGFQHNISTYAALIRILCSWGLERKLESLFSNLIA 120
KV QILESLRREPNIAFSFFRELEEWGFQHNISTYAALIRILCSWGL RKLESLF NLI
Sbjct: 61 KVVQILESLRREPNIAFSFFRELEEWGFQHNISTYAALIRILCSWGLGRKLESLFLNLIG 120
Query: 121 SKRREFDVLDLLESLNQGYVVDGSFIRAYDALIKAYVSVSLFDSVVDLLFRLERKGFVPH 180
SK+ EFDVLDLL+SLNQ VVDGSFIRAYDALIKAYVSV+LFDSV+DLL R ERKGFVPH
Sbjct: 121 SKKMEFDVLDLLDSLNQECVVDGSFIRAYDALIKAYVSVNLFDSVMDLLLRSERKGFVPH 180
Query: 181 IFTCNFLLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYATVIKALCKIGHMEEAIDIFE 240
IFT N+LLN LIEHGK NMALVVYKQLKRFGCQPNDYTYATVIKALCKIG+MEEAIDIFE
Sbjct: 181 IFTYNYLLNSLIEHGKTNMALVVYKQLKRFGCQPNDYTYATVIKALCKIGNMEEAIDIFE 240
Query: 241 EMSESGVVPNAFACAAYIEGLCTHDCSTSGYQLLQAWRAEQVPIDAYAYSVVIRGFCEEM 300
EMSESGVVPNAFACAAYIEGLCT+DCSTSGYQLLQAWRAEQ PIDAYAYSVVIRGFCEEM
Sbjct: 241 EMSESGVVPNAFACAAYIEGLCTYDCSTSGYQLLQAWRAEQFPIDAYAYSVVIRGFCEEM 300
Query: 301 KIDEAENVFLDMEKYGVVPDALTYGVLINGYCKKLMLQKALSLHSLMLSKGVKSNCVIIS 360
K+DEAE VFLDMEKYGVVPDA TYGVLINGYCKKL LQKALSLHSLMLSKG+KSNCVIIS
Sbjct: 301 KLDEAEYVFLDMEKYGVVPDAQTYGVLINGYCKKLNLQKALSLHSLMLSKGLKSNCVIIS 360
Query: 361 SILQCLFRMQMYSEVVNQFKIFQGEGLFLDKVVYNIVVHALCELGKLEEAIELLEDMTSR 420
SILQCL RMQMYSEVVNQFK FQ +G+FLDKVVYNIVVHALCELGKLEEAIELLEDMTSR
Sbjct: 361 SILQCLLRMQMYSEVVNQFKAFQEKGVFLDKVVYNIVVHALCELGKLEEAIELLEDMTSR 420
Query: 421 QIQMDVVHYTTMIKGFFVQGKIHEAMIMFENLKKNGIEPDTITYNVLATGLSRNGLVSKV 480
QIQMDVVHYTTMIKGFF+QGKIHEAM+MFENLKKNG+EPDTITYNVLA GL RNGLVS V
Sbjct: 421 QIQMDVVHYTTMIKGFFLQGKIHEAMMMFENLKKNGVEPDTITYNVLAAGLCRNGLVSNV 480
Query: 481 HGLLDYMEEHGLKEDPKIRDLVIENLCIGGKVKEATEFFNSLEVKTVDNYSAMINGYCAA 540
GLLDYMEEH L++DPK+ DLVIENLCIGGKVKEAT FFNSLEVKTVDNYSAMINGYCAA
Sbjct: 481 QGLLDYMEEHDLRKDPKMPDLVIENLCIGGKVKEATVFFNSLEVKTVDNYSAMINGYCAA 540
Query: 541 NHTKDAYELFINLSKKGVFIKRSALVRLVSSLCMEDSSDRDIKVMKQLPIRNVEANEIVY 600
N TK AYELF+NLSKKGV IKRS+LVRLVS LCMEDSS R I+V+KQLPI NVEANEIVY
Sbjct: 541 NRTKSAYELFVNLSKKGVLIKRSSLVRLVSRLCMEDSSIRAIEVIKQLPIMNVEANEIVY 600
Query: 601 NKVIASLCRERNMKKAQCLFDFLVSSGLTPDLITYTMMINGYCNINFLRKAYELLCDMKN 660
NKVIASLCR RNMK AQ LFD LV +GLTPDLITYT+MINGYC INFLR+AYELLCDMKN
Sbjct: 601 NKVIASLCRARNMKMAQSLFDCLVCAGLTPDLITYTVMINGYCKINFLREAYELLCDMKN 660
Query: 661 RGREPDIFIYTILLDGQFKTRVRELCSSVEIAFTSTIFDEMKDMKFTPDVVYYTVMIDGY 720
RGREPDIFIYT+LLDG+FKTRVRELCSSVEI +EMKDMK TPDVVYYTV+IDGY
Sbjct: 661 RGREPDIFIYTVLLDGRFKTRVRELCSSVEIV------NEMKDMKITPDVVYYTVLIDGY 720
Query: 721 CKMNKLNDAVVLFEEMIDQGIEPDTVTYTALLSGCCRSGNTEKAQTLYYDMVSKGILPPK 780
CKMN LNDA VLFEEM+DQGIE DTVTYTALLSGCCRSG+ EKAQTL YDM+SKGILPPK
Sbjct: 721 CKMNNLNDAFVLFEEMVDQGIEADTVTYTALLSGCCRSGDMEKAQTLCYDMMSKGILPPK 780
Query: 781 HFSFLLHHDTLKTKKVQLYKHLTE 805
FS LLHHDTLKTKK+QLYKH TE
Sbjct: 781 IFSHLLHHDTLKTKKMQLYKHFTE 798
BLAST of HG10008123 vs. NCBI nr
Match:
XP_038894511.1 (pentatricopeptide repeat-containing protein At2g26790, mitochondrial-like isoform X1 [Benincasa hispida] >XP_038894512.1 pentatricopeptide repeat-containing protein At2g26790, mitochondrial-like isoform X1 [Benincasa hispida] >XP_038894513.1 pentatricopeptide repeat-containing protein At2g26790, mitochondrial-like isoform X1 [Benincasa hispida])
HSP 1 Score: 1385.5 bits (3585), Expect = 0.0e+00
Identity = 695/804 (86.44%), Postives = 737/804 (91.67%), Query Frame = 0
Query: 1 MNCLLLAINNPIKLNKFKLKLVKFASTAIAQPNPCSFSHSEDEQSTTSINTTLHVQCNPS 60
MNCLL A+NNP LNK KLKLVKFASTAIAQ NPC FSHS+DEQS TS NTTL VQC PS
Sbjct: 1 MNCLLHAVNNPTNLNKLKLKLVKFASTAIAQLNPCFFSHSDDEQSVTSFNTTLDVQCKPS 60
Query: 61 KVAQILESLRREPNIAFSFFRELEEWGFQHNISTYAALIRILCSWGLERKLESLFSNLIA 120
KV +ILESLRREPNIAFSFF ELEE GFQHNISTYAALIR+LCSWGL +KLESLF NLI
Sbjct: 61 KVVRILESLRREPNIAFSFFHELEERGFQHNISTYAALIRVLCSWGLGKKLESLFLNLIG 120
Query: 121 SKRREFDVLDLLESLNQGYVVDGSFIRAYDALIKAYVSVSLFDSVVDLLFRLERKGFVPH 180
SK+ EFDVLDLLESLNQG VVD SFIRAYDALIKAY SV+LFDSVVDLLFRLERKGFVPH
Sbjct: 121 SKKMEFDVLDLLESLNQGCVVDDSFIRAYDALIKAYASVNLFDSVVDLLFRLERKGFVPH 180
Query: 181 IFTCNFLLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYATVIKALCKIGHMEEAIDIFE 240
IFTCN+LLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYA VIKALCKIG+MEEAIDIFE
Sbjct: 181 IFTCNYLLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYAIVIKALCKIGNMEEAIDIFE 240
Query: 241 EMSESGVVPNAFACAAYIEGLCTHDCSTSGYQLLQAWRAEQVPIDAYAYSVVIRGFCEEM 300
EM+ESGVVPNAFACAAYIEGLCTHDCSTSGYQLLQAWRAE+ PIDAYAY VVIRGFCEEM
Sbjct: 241 EMTESGVVPNAFACAAYIEGLCTHDCSTSGYQLLQAWRAERFPIDAYAYYVVIRGFCEEM 300
Query: 301 KIDEAENVFLDMEKYGVVPDALTYGVLINGYCKKLMLQKALSLHSLMLSKGVKSNCVIIS 360
KIDEAE VFLDME YGVVPDA TYGVLINGYCK L LQKALSLHSLMLSKG+KSNCVI+S
Sbjct: 301 KIDEAEYVFLDMENYGVVPDAQTYGVLINGYCKTLNLQKALSLHSLMLSKGIKSNCVIVS 360
Query: 361 SILQCLFRMQMYSEVVNQFKIFQGEGLFLDKVVYNIVVHALCELGKLEEAIELLEDMTSR 420
ILQCL MQMYSEVVNQFK+FQ +G+FLDKVV+NIV+HALCELGKLEEAIELLEDMTSR
Sbjct: 361 FILQCLLTMQMYSEVVNQFKVFQDKGVFLDKVVFNIVIHALCELGKLEEAIELLEDMTSR 420
Query: 421 QIQMDVVHYTTMIKGFFVQGKIHEAMIMFENLKKNGIEPDTITYNVLATGLSRNGLVSKV 480
QIQMDV HYTTMIKGFFVQGK HEAMIMF+NLKKNG+EPDTITYNV+A GLSRNGLVSKV
Sbjct: 421 QIQMDVKHYTTMIKGFFVQGKTHEAMIMFDNLKKNGVEPDTITYNVVAAGLSRNGLVSKV 480
Query: 481 HGLLDYMEEHGLKEDPKIRDLVIENLCIGGKVKEATEFFNSLEVKTVDNYSAMINGYCAA 540
GLLDYMEEHGLK+D KI DLVIENLCIGGKVKEATEFFNSLEVKTVDNYSAMINGYCAA
Sbjct: 481 QGLLDYMEEHGLKKDSKIPDLVIENLCIGGKVKEATEFFNSLEVKTVDNYSAMINGYCAA 540
Query: 541 NHTKDAYELFINLSKKGVFIKRSALVRLVSSLCMEDSSDRDIKVMKQLPIRNVEANEIVY 600
NH K AYELF+NLSKKG FIKRS+LVRLVSSLCMEDS R I+V+K LPI +VEANEIVY
Sbjct: 541 NHKKAAYELFVNLSKKGDFIKRSSLVRLVSSLCMEDSCVRAIEVIKHLPIIDVEANEIVY 600
Query: 601 NKVIASLCRERNMKKAQCLFDFLVSSGLTPDLITYTMMINGYCNINFLRKAYELLCDMKN 660
NKVIASLCR RNMK AQ LFD LV +GLTPDLITYTMMI+GYC INFLR+A+ELLCDMKN
Sbjct: 601 NKVIASLCRARNMKMAQSLFDCLVCAGLTPDLITYTMMIDGYCKINFLREAHELLCDMKN 660
Query: 661 RGREPDIFIYTILLDGQFKTRVRELCSSVEIAFTSTIFDEMKDMKFTPDVVYYTVMIDGY 720
RGREPDIFIYT+LLD QFKTRV+ELC SVE FTSTIFDEMK MKFTPDVVYYTV+IDG+
Sbjct: 661 RGREPDIFIYTVLLDSQFKTRVQELCPSVETPFTSTIFDEMKGMKFTPDVVYYTVLIDGF 720
Query: 721 CKMNKLNDAVVLFEEMIDQGIEPDTVTYTALLSGCCRSGNTEKAQTLYYDMVSKGILPPK 780
CKMN LNDA V FEEM+DQGIE DTVT+TALLSGCCRSG+ EKAQTL YDM+SKGILPPK
Sbjct: 721 CKMNNLNDAFVFFEEMVDQGIEADTVTFTALLSGCCRSGDMEKAQTLCYDMISKGILPPK 780
Query: 781 HFSFLLHHDTLKTKKVQLYKHLTE 805
FS+LLH DTLKTKK+ YKHLT+
Sbjct: 781 IFSYLLHQDTLKTKKMPHYKHLTK 804
BLAST of HG10008123 vs. NCBI nr
Match:
XP_008443906.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g26790, mitochondrial-like isoform X1 [Cucumis melo] >XP_008443907.1 PREDICTED: pentatricopeptide repeat-containing protein At2g26790, mitochondrial-like isoform X1 [Cucumis melo] >XP_016899822.1 PREDICTED: pentatricopeptide repeat-containing protein At2g26790, mitochondrial-like isoform X1 [Cucumis melo])
HSP 1 Score: 1350.9 bits (3495), Expect = 0.0e+00
Identity = 672/796 (84.42%), Postives = 733/796 (92.09%), Query Frame = 0
Query: 1 MNCLLLAINNPIKLNKFKLKLVKFASTAIAQPNPCSFSHSEDEQSTTSINTTLHVQCNPS 60
MNCLLLA+NNP KLNK KLKLVKFASTAIAQ N C FSHS+DEQ+TTS NTTL+VQC PS
Sbjct: 1 MNCLLLAVNNPTKLNKLKLKLVKFASTAIAQLNSCFFSHSDDEQTTTSFNTTLNVQCKPS 60
Query: 61 KVAQILESLRREPNIAFSFFRELEEWGFQHNISTYAALIRILCSWGLERKLESLFSNLIA 120
KV Q+LESLRREPNIAFSFFRELEE GFQHNISTYAALIRILCSW L RKLE+LF NLI
Sbjct: 61 KVVQVLESLRREPNIAFSFFRELEERGFQHNISTYAALIRILCSWRLRRKLETLFLNLIG 120
Query: 121 SKRREFDVLDLLESLNQGYVVDGSFIRAYDALIKAYVSVSLFDSVVDLLFRLERKGFVPH 180
SK+ +FDVLDLLESLNQG V+D SFIRAYDALIKAYVSV+LFDSVVDLLFRL RKGFVPH
Sbjct: 121 SKKMDFDVLDLLESLNQGCVLDASFIRAYDALIKAYVSVNLFDSVVDLLFRLGRKGFVPH 180
Query: 181 IFTCNFLLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYATVIKALCKIGHMEEAIDIFE 240
IFTCN+LLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYATVIKALCKIG+MEEAIDIFE
Sbjct: 181 IFTCNYLLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYATVIKALCKIGNMEEAIDIFE 240
Query: 241 EMSESGVVPNAFACAAYIEGLCTHDCSTSGYQLLQAWRAEQVPIDAYAYSVVIRGFCEEM 300
EMS G+VPNAFACAAYIE LCTHDCSTSGYQLLQAWRAE+ PIDAYAYSVVIRGFC+EM
Sbjct: 241 EMSGYGMVPNAFACAAYIEALCTHDCSTSGYQLLQAWRAERFPIDAYAYSVVIRGFCDEM 300
Query: 301 KIDEAENVFLDMEKYGVVPDALTYGVLINGYCKKLMLQKALSLHSLMLSKGVKSNCVIIS 360
KIDEAE+VFLDME YGVVPDA TY VLI+GYCKKL LQKALSLHSLMLSKG+KSNCVI+S
Sbjct: 301 KIDEAESVFLDMENYGVVPDAQTYSVLIDGYCKKLNLQKALSLHSLMLSKGIKSNCVIVS 360
Query: 361 SILQCLFRMQMYSEVVNQFKIFQGEGLFLDKVVYNIVVHALCELGKLEEAIELLEDMTSR 420
ILQC RMQMYSEVVNQFK+FQ +GLFLDKVVYNIVVHALCELGKLEEAIELLE+MTSR
Sbjct: 361 FILQCFLRMQMYSEVVNQFKVFQEKGLFLDKVVYNIVVHALCELGKLEEAIELLEEMTSR 420
Query: 421 QIQMDVVHYTTMIKGFFVQGKIHEAMIMFENLKKNGIEPDTITYNVLATGLSRNGLVSKV 480
QIQMDV+HYTTMIKG F QGKIHEAM+MFENLKKNG+EPD+ITYNVLA GLSRNGLVSKV
Sbjct: 421 QIQMDVIHYTTMIKGLFAQGKIHEAMMMFENLKKNGVEPDSITYNVLAAGLSRNGLVSKV 480
Query: 481 HGLLDYMEEHGLKEDPKIRDLVIENLCIGGKVKEATEFFNSLEVKTVDNYSAMINGYCAA 540
LL+YMEEHGL+EDPK+ +LVIENLCIGGKVKEATEFFNSLEVKTVDNY+AMINGYCAA
Sbjct: 481 QELLNYMEEHGLREDPKMPNLVIENLCIGGKVKEATEFFNSLEVKTVDNYAAMINGYCAA 540
Query: 541 NHTKDAYELFINLSKKGVFIKRSALVRLVSSLCMEDSSDRDIKVMKQLPIRNVEANEIVY 600
N TK AY+LF+NLSK+GVFIKRS+LVRLVS LCME+SS R I+VMKQLP+ N+EA E VY
Sbjct: 541 NSTKAAYKLFVNLSKEGVFIKRSSLVRLVSRLCMENSSFRAIEVMKQLPVMNLEAKEFVY 600
Query: 601 NKVIASLCRERNMKKAQCLFDFLVSSGLTPDLITYTMMINGYCNINFLRKAYELLCDMKN 660
NKVIASLC+ RNMK AQCLFD LV +GL PDLITYTMMINGYC IN+LR+AYELLCDM+N
Sbjct: 601 NKVIASLCQVRNMKMAQCLFDCLVRAGLIPDLITYTMMINGYCKINYLREAYELLCDMRN 660
Query: 661 RGREPDIFIYTILLDGQFKTRVRELCSSVEIAFTSTIFDEMKDMKFTPDVVYYTVMIDGY 720
RGREPDIF+YT+LLDG FKTRV++ CSSVE+A TS+IF+E KDMK TPDVVYYTV+IDGY
Sbjct: 661 RGREPDIFVYTVLLDGGFKTRVQK-CSSVELALTSSIFNETKDMKITPDVVYYTVLIDGY 720
Query: 721 CKMNKLNDAVVLFEEMIDQGIEPDTVTYTALLSGCCRSGNTEKAQTLYYDMVSKGILPPK 780
CKMN LN A VLFEEM+DQGIE D VTYTALLSGCCR+G+ EKAQTL Y+M SKGILPP+
Sbjct: 721 CKMNNLNAAFVLFEEMVDQGIEADAVTYTALLSGCCRNGDKEKAQTLCYEMTSKGILPPE 780
Query: 781 HFSFLLHHDTLKTKKV 797
+FS+LL HDTL+TKK+
Sbjct: 781 NFSYLLQHDTLETKKI 795
BLAST of HG10008123 vs. NCBI nr
Match:
KAA0050042.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])
HSP 1 Score: 1349.3 bits (3491), Expect = 0.0e+00
Identity = 670/796 (84.17%), Postives = 733/796 (92.09%), Query Frame = 0
Query: 1 MNCLLLAINNPIKLNKFKLKLVKFASTAIAQPNPCSFSHSEDEQSTTSINTTLHVQCNPS 60
MNCLLLA+NNP KLNK KLKLVKFASTAIAQ N C FSHS+DEQ+TTS NTTL+VQC PS
Sbjct: 1 MNCLLLAVNNPTKLNKLKLKLVKFASTAIAQLNSCFFSHSDDEQTTTSFNTTLNVQCKPS 60
Query: 61 KVAQILESLRREPNIAFSFFRELEEWGFQHNISTYAALIRILCSWGLERKLESLFSNLIA 120
KV Q+LESLRREPNIAFSFFRELEE GFQHN+STYAALIRILCSW L RKLE+LF NLI
Sbjct: 61 KVVQVLESLRREPNIAFSFFRELEERGFQHNVSTYAALIRILCSWRLRRKLETLFLNLIG 120
Query: 121 SKRREFDVLDLLESLNQGYVVDGSFIRAYDALIKAYVSVSLFDSVVDLLFRLERKGFVPH 180
SK+ +FDVLDLLESLNQG V+D SFIRAYDALIKAYV+V+LFDSVVDLLFRL RKGFVPH
Sbjct: 121 SKKMDFDVLDLLESLNQGCVLDASFIRAYDALIKAYVNVNLFDSVVDLLFRLGRKGFVPH 180
Query: 181 IFTCNFLLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYATVIKALCKIGHMEEAIDIFE 240
IFTCN+LLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYATVIKALCKIG+MEEAIDIFE
Sbjct: 181 IFTCNYLLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYATVIKALCKIGNMEEAIDIFE 240
Query: 241 EMSESGVVPNAFACAAYIEGLCTHDCSTSGYQLLQAWRAEQVPIDAYAYSVVIRGFCEEM 300
EMS G+VPNAFACAAYIE LCTHDCSTSGYQLLQAWRAE+ PIDAYAYSVVIRGFC+EM
Sbjct: 241 EMSGYGMVPNAFACAAYIEALCTHDCSTSGYQLLQAWRAERFPIDAYAYSVVIRGFCDEM 300
Query: 301 KIDEAENVFLDMEKYGVVPDALTYGVLINGYCKKLMLQKALSLHSLMLSKGVKSNCVIIS 360
KIDEAE+VFLDME YGVVPDA TY VLI+GYCKKL LQKALSLHSLMLSKG+KSNCVI+S
Sbjct: 301 KIDEAESVFLDMENYGVVPDAQTYSVLIDGYCKKLNLQKALSLHSLMLSKGIKSNCVIVS 360
Query: 361 SILQCLFRMQMYSEVVNQFKIFQGEGLFLDKVVYNIVVHALCELGKLEEAIELLEDMTSR 420
ILQC RMQMYSEVVNQFK+FQ +GLFLDKVVYNIVVHALCELGKLEEAIELLE+MTSR
Sbjct: 361 FILQCFLRMQMYSEVVNQFKVFQEKGLFLDKVVYNIVVHALCELGKLEEAIELLEEMTSR 420
Query: 421 QIQMDVVHYTTMIKGFFVQGKIHEAMIMFENLKKNGIEPDTITYNVLATGLSRNGLVSKV 480
QIQMDV+HYTTMIKG F QGKIHEAM+MFENLKKNG+EPD+ITYNVLA GLSRNGLVSKV
Sbjct: 421 QIQMDVIHYTTMIKGLFAQGKIHEAMMMFENLKKNGVEPDSITYNVLAAGLSRNGLVSKV 480
Query: 481 HGLLDYMEEHGLKEDPKIRDLVIENLCIGGKVKEATEFFNSLEVKTVDNYSAMINGYCAA 540
LL+YMEEHGL+EDPK+ +LVIENLCIGGKVKEATEFFNSLEVKTVDNY+AMINGYCAA
Sbjct: 481 QELLNYMEEHGLREDPKMPNLVIENLCIGGKVKEATEFFNSLEVKTVDNYAAMINGYCAA 540
Query: 541 NHTKDAYELFINLSKKGVFIKRSALVRLVSSLCMEDSSDRDIKVMKQLPIRNVEANEIVY 600
N TK AY+LF+NLSK+GVFIKRS+LVRLVS LCME+SS R I+VMKQLP+ N+EA E VY
Sbjct: 541 NSTKAAYKLFVNLSKEGVFIKRSSLVRLVSRLCMENSSFRAIEVMKQLPVMNLEAKEFVY 600
Query: 601 NKVIASLCRERNMKKAQCLFDFLVSSGLTPDLITYTMMINGYCNINFLRKAYELLCDMKN 660
NKVIASLC+ RNMK AQCLFD LV +GL PDLITYTMMINGYC IN+LR+AYELLCDM+N
Sbjct: 601 NKVIASLCQVRNMKMAQCLFDCLVRAGLIPDLITYTMMINGYCKINYLREAYELLCDMRN 660
Query: 661 RGREPDIFIYTILLDGQFKTRVRELCSSVEIAFTSTIFDEMKDMKFTPDVVYYTVMIDGY 720
RGREPDIF+YT+LLDG FKTRV++ CSSVE+A TS+IF+E KDMK TPDVVYYTV+IDGY
Sbjct: 661 RGREPDIFVYTVLLDGGFKTRVQK-CSSVELALTSSIFNETKDMKITPDVVYYTVLIDGY 720
Query: 721 CKMNKLNDAVVLFEEMIDQGIEPDTVTYTALLSGCCRSGNTEKAQTLYYDMVSKGILPPK 780
CKMN LN A VLFEEM+DQGIE D VTYTALLSGCCR+G+ EKAQTL Y+M SKGILPP+
Sbjct: 721 CKMNNLNAAFVLFEEMVDQGIEADAVTYTALLSGCCRNGDKEKAQTLCYEMTSKGILPPE 780
Query: 781 HFSFLLHHDTLKTKKV 797
+FS+LL HDTL+TKK+
Sbjct: 781 NFSYLLQHDTLETKKI 795
BLAST of HG10008123 vs. NCBI nr
Match:
XP_022987398.1 (pentatricopeptide repeat-containing protein At2g26790, mitochondrial [Cucurbita maxima] >XP_022987399.1 pentatricopeptide repeat-containing protein At2g26790, mitochondrial [Cucurbita maxima] >XP_022987400.1 pentatricopeptide repeat-containing protein At2g26790, mitochondrial [Cucurbita maxima])
HSP 1 Score: 1341.6 bits (3471), Expect = 0.0e+00
Identity = 677/805 (84.10%), Postives = 727/805 (90.31%), Query Frame = 0
Query: 1 MNCLLLAINNPIKLNKFKLKLVKFASTAIAQPNPCSFSHSEDEQSTTSINTTLHVQCNPS 60
MNCLLLA+NNPIKL++ KLKLVKFASTAIAQ NPCSFSHS+DEQSTT +TTL VQ NPS
Sbjct: 1 MNCLLLAVNNPIKLSRLKLKLVKFASTAIAQLNPCSFSHSDDEQSTTFFHTTLPVQFNPS 60
Query: 61 KVAQILESLRREPNIAFSFFRELEEWGFQHNISTYAALIRILCSWGLERKLESLFSNLIA 120
KV QIL SLRREP IAFSFFRELEE GFQHNISTYAALIRILCSWGLERKLESLF NLI
Sbjct: 61 KVVQILGSLRREPKIAFSFFRELEERGFQHNISTYAALIRILCSWGLERKLESLFLNLIE 120
Query: 121 SKRREFDVLDLLESLNQGYVVDGSFIRAYDALIKAYVSVSLFDSVVDLLFRLERKGFVPH 180
SK+ EFDVLDLLESLNQGY V GSF RAYDALIKAYVSVSLFDS VDLLFR ERKGFVPH
Sbjct: 121 SKKMEFDVLDLLESLNQGYAVGGSFTRAYDALIKAYVSVSLFDSAVDLLFRSERKGFVPH 180
Query: 181 IFTCNFLLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYATVIKALCKIGHMEEAIDIFE 240
IFTCNFLLNRL EHGKMNMALVVYKQLKRFG PNDYTYA VIKALCK+G+MEEAI IFE
Sbjct: 181 IFTCNFLLNRLTEHGKMNMALVVYKQLKRFGFHPNDYTYAIVIKALCKMGNMEEAIYIFE 240
Query: 241 EMSESGVVPNAFACAAYIEGLCTHDCSTSGYQLLQAWRAEQVPIDAYAYSVVIRGFCEEM 300
EMSE+GVV +AFA AYIEGLCTH CS S YQLLQAWR Q PID YAY VVIRGFCEEM
Sbjct: 241 EMSEAGVVASAFAYTAYIEGLCTHHCSASAYQLLQAWREAQAPIDVYAYFVVIRGFCEEM 300
Query: 301 KIDEAENVFLDMEKYGVVPDALTYGVLINGYCKKLMLQKALSLHSLMLSKGVKSNCVIIS 360
KIDEAENVFL+ME+YGVVPDA TYGVLINGYCKKL LQKALSLHSLMLSKG+K+NCVI+S
Sbjct: 301 KIDEAENVFLEMEQYGVVPDAQTYGVLINGYCKKLKLQKALSLHSLMLSKGIKTNCVIVS 360
Query: 361 SILQCLFRMQMYSEVVNQFKIFQGEGLFLDKVVYNIVVHALCELGKLEEAIELLEDMTSR 420
SILQCL RMQM SEVVNQFK+FQG+G+F DKV YNIVVHALCE GKLEEA+ELLEDMTSR
Sbjct: 361 SILQCLLRMQMCSEVVNQFKVFQGKGVFFDKVAYNIVVHALCEQGKLEEAMELLEDMTSR 420
Query: 421 QIQMDVVHYTTMIKGFFVQGKIHEAMIMFENLKKNGIEPDTITYNVLATGLSRNGLVSKV 480
QIQMDVVHYTTMIKGFFVQGKIHEAM+MFENLKKNGIEPDTITYNVLA GLSRNGLVSKV
Sbjct: 421 QIQMDVVHYTTMIKGFFVQGKIHEAMVMFENLKKNGIEPDTITYNVLAAGLSRNGLVSKV 480
Query: 481 HGLLDYMEEHGLKEDPKIRDLVIENLCIGGKVKEATEFFNSLEVKTVDNYSAMINGYCAA 540
GLLDYMEEHGL+EDPKI DLVIENLC+GGKVKEATEFFNSLEVKTV+NYSAMINGYCAA
Sbjct: 481 QGLLDYMEEHGLREDPKILDLVIENLCVGGKVKEATEFFNSLEVKTVENYSAMINGYCAA 540
Query: 541 NHTKDAYELFINLSKKGVFIKRSALVRLVSSLCMEDSSDRDIKVMKQLPIRNVEANEIVY 600
NHT DAY+LF+NLSKKGV IK+S L RLVSSLCMEDSSDR IKV+K+LPI +V+AN+IVY
Sbjct: 541 NHTNDAYDLFVNLSKKGVSIKKSTLSRLVSSLCMEDSSDRAIKVIKKLPIMDVKANKIVY 600
Query: 601 NKVIASLCRERNMKKAQCLFDFLVSSGLTPDLITYTMMINGYCNINFLRKAYELLCDMKN 660
N VI+SLCR NMK+AQ LFD LV +GLTPDLITYTMMINGYC INFLR+AYELLCDMKN
Sbjct: 601 NNVISSLCRVGNMKRAQSLFDCLVCAGLTPDLITYTMMINGYCKINFLREAYELLCDMKN 660
Query: 661 RGREPDIFIYTILLDGQFKTRVRELCSSV-------EIAFTSTIFDEMKDMKFTPDVVYY 720
RGR+PDI IYT+LLDG FKTR++ELCSSV EIA STIFDEMKDMK TPD++ Y
Sbjct: 661 RGRKPDIVIYTVLLDGLFKTRLQELCSSVHLRGEKQEIALASTIFDEMKDMKITPDIICY 720
Query: 721 TVMIDGYCKMNKLNDAVVLFEEMIDQGIEPDTVTYTALLSGCCRSGNTEKAQTLYYDMVS 780
TV+IDGYCKMN L+DAVVLFEEM+DQGIEPDTVTYTALLSGCCRSG+TEKA TL DM+S
Sbjct: 721 TVLIDGYCKMNNLSDAVVLFEEMVDQGIEPDTVTYTALLSGCCRSGDTEKAVTLMSDMLS 780
Query: 781 KGILPPKHFSFLLHHDTLKTKKVQL 799
KGILP + FS LLHH+T KTKK++L
Sbjct: 781 KGILPSEQFSTLLHHNTPKTKKMRL 805
BLAST of HG10008123 vs. ExPASy Swiss-Prot
Match:
O81028 (Pentatricopeptide repeat-containing protein At2g26790, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At2g26790 PE=3 SV=1)
HSP 1 Score: 537.7 bits (1384), Expect = 2.2e-151
Identity = 312/785 (39.75%), Postives = 458/785 (58.34%), Query Frame = 0
Query: 24 FASTAIAQPNPCSFSHSEDEQSTTSINTTLHVQCNPSKVAQILESLRREPNIAFSFFREL 83
+A +A+ PN S S ++Q +N + Q + ++L S R +PN+A SF R+L
Sbjct: 27 YAVSALNNPNNLSDS---EQQQVNHLNLSKLTQ---HGLQRLLNSTRDDPNLALSFLRQL 86
Query: 84 EEWGFQHNISTYAALIRILCSWGLERKLESLFSNLIASKRREFDVLDLLESL-NQGYVVD 143
+E G N++ YA L+RIL +WGL+ KL+S+ LI ++ R F V+DL+E + Q
Sbjct: 87 KEHGVSPNVNAYATLVRILTTWGLDIKLDSVLVELIKNEERGFTVMDLIEVIGEQAEEKK 146
Query: 144 GSF--IRAYDALIKAYVSVSLFDSVVDLLFRLERKGFVPHIFTCNFLLNRLIEHGKMNMA 203
SF IR AL+KAYVS+ +FD D+LF+ +R V I CNFL+NR+ E GK+ M
Sbjct: 147 RSFVLIRVSGALVKAYVSLGMFDEATDVLFQSKRLDCVVDIKACNFLMNRMTEFGKIGML 206
Query: 204 LVVYKQLKRFGCQPNDYTYATVIKALCKIGHMEEAIDIFEEMSESGVVPNAFACAAYIEG 263
+ ++KQLK+ G N+YTYA V+KALC+ G++EEA + E + F +I G
Sbjct: 207 MTLFKQLKQLGLCANEYTYAIVVKALCRKGNLEEAAMLLIENE------SVFGYKTFING 266
Query: 264 LC-THDCSTSGYQLLQAWRAEQVPID--AYAYSVVIRGFCEEMKIDEAENVFLDMEKYGV 323
LC T + + +L+ + + D +V+RGFC EMK+ AE+V ++ME+ G
Sbjct: 267 LCVTGETEKAVALILELIDRKYLAGDDLRAVLGMVVRGFCNEMKMKAAESVIIEMEEIGF 326
Query: 324 VPDALTYGVLINGYCKKLMLQKALSLHSLMLSKGVKSNCVIISSILQCLFRMQMYSEVVN 383
D +I+ YCK + L +AL ML KG+K NCVI+S ILQC +M M E +
Sbjct: 327 GLDVYACLAVIDRYCKNMNLPEALGFLDKMLGKGLKVNCVIVSLILQCYCKMDMCLEALE 386
Query: 384 QFKIFQGEGLFLDKVVYNIVVHALCELGKLEEAIELLEDMTSRQIQMDVVHYTTMIKGFF 443
+FK F+ +FLD+V YN+ AL +LG++EEA ELL++M R I DV++YTT+I G+
Sbjct: 387 KFKEFRDMNIFLDRVCYNVAFDALSKLGRVEEAFELLQEMKDRGIVPDVINYTTLIDGYC 446
Query: 444 VQGKIHEAMIMFENLKKNGIEPDTITYNVLATGLSRNGLVSKVHGLLDYMEEHGLKEDPK 503
+QGK+ +A+ + + + NG+ PD ITYNVL +GL+RNG +V + + M+ G K +
Sbjct: 447 LQGKVVDALDLIDEMIGNGMSPDLITYNVLVSGLARNGHEEEVLEIYERMKAEGPKPNAV 506
Query: 504 IRDLVIENLCIGGKVKEATEFFNSLEVKTVDNYSAMINGYCAANHTKDAYELFINLSKKG 563
++IE LC KVKEA +FF+SLE K +N ++ + GYC A +K AY+ F+ L
Sbjct: 507 TNSVIIEGLCFARKVKEAEDFFSSLEQKCPENKASFVKGYCEAGLSKKAYKAFVRLEYP- 566
Query: 564 VFIKRSALVRLVSSLCMEDSSDRDIKVMKQLPIRNVEANEIVYNKVIASLCRERNMKKAQ 623
+++S ++L SLC+E ++ V+K++ VE + K+I + C+ N+++AQ
Sbjct: 567 --LRKSVYIKLFFSLCIEGYLEKAHDVLKKMSAYRVEPGRSMCGKMIGAFCKLNNVREAQ 626
Query: 624 CLFDFLVSSGLTPDLITYTMMINGYCNINFLRKAYELLCDMKNRGREPDIFIYTILLDGQ 683
LFD +V GL PDL TYT+MI+ YC +N L+KA L DMK RG +PD+ YT+LLD
Sbjct: 627 VLFDTMVERGLIPDLFTYTIMIHTYCRLNELQKAESLFEDMKQRGIKPDVVTYTVLLDRY 686
Query: 684 FK--TRVRELCS---SVEIAFTSTIFDEMKDMKFTPDVVYYTVMIDGYCKMNKLNDAVVL 743
K E CS V S + E DVV YTV+ID CKMN L A L
Sbjct: 687 LKLDPEHHETCSVQGEVGKRKASEVLREFSAAGIGLDVVCYTVLIDRQCKMNNLEQAAEL 746
Query: 744 FEEMIDQGIEPDTVTYTALLSGCCRSGNTEKAQTLYYDMVSKGILPPKHFSFLLHHDTLK 798
F+ MID G+EPD V YT L+S R G + A TL ++ K +P + F + LK
Sbjct: 747 FDRMIDSGLEPDMVAYTTLISSYFRKGYIDMAVTLVTELSKKYNIPSESFEAAVKSAALK 796
BLAST of HG10008123 vs. ExPASy Swiss-Prot
Match:
Q9LMH5 (Putative pentatricopeptide repeat-containing protein At1g13800 OS=Arabidopsis thaliana OX=3702 GN=At1g13800 PE=3 SV=1)
HSP 1 Score: 459.5 bits (1181), Expect = 7.6e-128
Identity = 273/807 (33.83%), Postives = 433/807 (53.66%), Query Frame = 0
Query: 28 AIAQPNPCSFSHSEDEQSTTSINTTLHVQCNPSKVAQILESLRREPNIAFSFFRELEEWG 87
A+A+ N + SHSE + T L + N V ++L S++ +P +A SF + +E
Sbjct: 29 ALARTN-LTISHSEQVKEGTFDYKAL--ELNDIGVLRVLNSMKDDPYLALSFLKRIEGNV 88
Query: 88 FQHNISTYAALIRILCSWGLERKLESLFSNLI--ASKRREFDVLDLLESLNQGYVVDGSF 147
++ YA +IRI+C WGL++KL++ L+ + R F V+DLL+++ +
Sbjct: 89 TLPSVQAYATVIRIVCGWGLDKKLDTFLFELVRRGDEGRGFSVMDLLKAIGEMEQSLVLL 148
Query: 148 IRAYDALIKAYVSVSLFDSVVDLLFRLERK-GFVPHIFTCNFLLNRLIEHGKMNMALVVY 207
IR AL+KAY ++ +FD +D+ FR G P I NFL++R+I G+ +M + +
Sbjct: 149 IRVSTALVKAYANLDMFDEAIDIFFRAYYSLGRAPDIKALNFLISRMIASGRSDMVVGFF 208
Query: 208 KQLKRFGCQPNDYTYATVIKALCKIGHMEEAIDIFEEMSESGVVPNAFACAAYIEGLCTH 267
+++R G + +TY V++AL + EE + + S +IEGLC +
Sbjct: 209 WEIERLGLDADAHTYVLVVQALWRNDDKEELEKLLSRLLISETRNPCVFYLNFIEGLCLN 268
Query: 268 DCSTSGYQLLQAWRAEQVPID----AYAYSVVIRGFCEEMKIDEAENVFLDMEKYGVVPD 327
+ Y LLQ R + +D AY V+RG C EM+I++AE+V LDMEK+G+ PD
Sbjct: 269 QMTDIAYFLLQPLRDANILVDKSDLGIAYRKVVRGLCYEMRIEDAESVVLDMEKHGIDPD 328
Query: 328 ALTYGVLINGYCKKLMLQKALSLHSLMLSKGVKSNCVIISSILQCLFRMQMYSEVVNQFK 387
Y +I G+ K + + KA+ + + ML K + NCVI+SSILQC +M +SE + FK
Sbjct: 329 VYVYSAIIEGHRKNMNIPKAVDVFNKMLKKRKRINCVIVSSILQCYCQMGNFSEAYDLFK 388
Query: 388 IFQGEGLFLDKVVYNIVVHALCELGKLEEAIELLEDMTSRQIQMDVVHYTTMIKGFFVQG 447
F+ + LD+V YN+ AL +LGK+EEAIEL +MT + I DV++YTT+I G +QG
Sbjct: 389 EFRETNISLDRVCYNVAFDALGKLGKVEEAIELFREMTGKGIAPDVINYTTLIGGCCLQG 448
Query: 448 KIHEAMIMFENLKKNGIEPDTITYNVLATGLSRNGLVSKVHGLLDYMEEHGLKEDPKIRD 507
K +A + + G PD + YNVLA GL+ NGL + L ME G+K +
Sbjct: 449 KCSDAFDLMIEMDGTGKTPDIVIYNVLAGGLATNGLAQEAFETLKMMENRGVKPTYVTHN 508
Query: 508 LVIENLCIGGKVKEATEFFNSLEVKTVDNYSAMINGYCAANHTKDAYELFINLSKKGVFI 567
+VIE L G++ +A F+ SLE K+ +N ++M+ G+CAA A+E FI L +
Sbjct: 509 MVIEGLIDAGELDKAEAFYESLEHKSRENDASMVKGFCAAGCLDHAFERFIRLEFP---L 568
Query: 568 KRSALVRLVSSLCME-DSSDRDIKVMKQLPIRNVEANEIVYNKVIASLCRERNMKKAQCL 627
+S L +SLC E D + ++ ++ VE + +Y K+I + CR N++KA+
Sbjct: 569 PKSVYFTLFTSLCAEKDYISKAQDLLDRMWKLGVEPEKSMYGKLIGAWCRVNNVRKAREF 628
Query: 628 FDFLVSSGLTPDLITYTMMINGYCNINFLRKAYELLCDMKNRGREPDIFIYTILLDGQ-- 687
F+ LV+ + PDL TYT+MIN YC +N ++AY L DMK R +PD+ Y++LL+
Sbjct: 629 FEILVTKKIVPDLFTYTIMINTYCRLNEPKQAYALFEDMKRRDVKPDVVTYSVLLNSDPE 688
Query: 688 ------------------FKTRVRELCSSVEIAFTSTIFDEMKDMKFTPDVV-------- 747
+ + C ++ +F +MK + PDVV
Sbjct: 689 LDMKREMEAFDVIPDVVYYTIMINRYCHLNDLKKVYALFKDMKRREIVPDVVTYTVLLKN 748
Query: 748 --------------------YYTVMIDGYCKMNKLNDAVVLFEEMIDQGIEPDTVTYTAL 779
YYTV+ID CK+ L +A +F++MI+ G++PD YTAL
Sbjct: 749 KPERNLSREMKAFDVKPDVFYYTVLIDWQCKIGDLGEAKRIFDQMIESGVDPDAAPYTAL 808
BLAST of HG10008123 vs. ExPASy Swiss-Prot
Match:
Q76C99 (Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV=1)
HSP 1 Score: 262.7 bits (670), Expect = 1.4e-68
Identity = 160/619 (25.85%), Postives = 301/619 (48.63%), Query Frame = 0
Query: 171 RLERKG---FVPHIFTCNFLLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYATVIKALC 230
R+ R G P + T L+ G++++ + + G + + + ++K LC
Sbjct: 74 RMARAGADEVTPDLCTYGILIGCCCRAGRLDLGFAALGNVIKKGFRVDAIAFTPLLKGLC 133
Query: 231 KIGHMEEAIDI-FEEMSESGVVPNAFACAAYIEGLCTHDCSTSGYQLLQAW---RAEQVP 290
+A+DI M+E G +PN F+ ++GLC + S +LL R P
Sbjct: 134 ADKRTSDAMDIVLRRMTELGCIPNVFSYNILLKGLCDENRSQEALELLHMMADDRGGGSP 193
Query: 291 IDAYAYSVVIRGFCEEMKIDEAENVFLDMEKYGVVPDALTYGVLINGYCKKLMLQKALSL 350
D +Y+ VI GF +E D+A + + +M G++PD +TY +I CK + KA+ +
Sbjct: 194 PDVVSYTTVINGFFKEGDSDKAYSTYHEMLDRGILPDVVTYNSIIAALCKAQAMDKAMEV 253
Query: 351 HSLMLSKGVKSNCVIISSILQCLFRMQMYSEVVNQFKIFQGEGLFLDKVVYNIVVHALCE 410
+ M+ GV +C+ +SIL E + K + +G+ D V Y++++ LC+
Sbjct: 254 LNTMVKNGVMPDCMTYNSILHGYCSSGQPKEAIGFLKKMRSDGVEPDVVTYSLLMDYLCK 313
Query: 411 LGKLEEAIELLEDMTSRQIQMDVVHYTTMIKGFFVQGKIHEAMIMFENLKKNGIEPDTIT 470
G+ EA ++ + MT R ++ ++ Y T+++G+ +G + E + + + +NGI PD
Sbjct: 314 NGRCMEARKIFDSMTKRGLKPEITTYGTLLQGYATKGALVEMHGLLDLMVRNGIHPDHYV 373
Query: 471 YNVLATGLSRNGLVSKVHGLLDYMEEHGLKEDPKIRDLVIENLCIGGKVKEATEFFNSL- 530
+++L ++ G V + + M + GL + VI LC G+V++A +F +
Sbjct: 374 FSILICAYAKQGKVDQAMLVFSKMRQQGLNPNAVTYGAVIGILCKSGRVEDAMLYFEQMI 433
Query: 531 -EVKTVDN--YSAMINGYCAANHTKDAYELFINLSKKGVFIKRSALVRLVSSLCMEDSSD 590
E + N Y+++I+G C N + A EL + + +G+ + ++ S C E
Sbjct: 434 DEGLSPGNIVYNSLIHGLCTCNKWERAEELILEMLDRGICLNTIFFNSIIDSHCKEGRVI 493
Query: 591 RDIKVMKQLPIRNVEANEIVYNKVIASLCRERNMKKAQCLFDFLVSSGLTPDLITYTMMI 650
K+ + + V+ N I YN +I C M +A L +VS GL P+ +TY+ +I
Sbjct: 494 ESEKLFELMVRIGVKPNVITYNTLINGYCLAGKMDEAMKLLSGMVSVGLKPNTVTYSTLI 553
Query: 651 NGYCNINFLRKAYELLCDMKNRGREPDIFIYTILLDGQFKTRVRELCSSVEIAFTSTIFD 710
NGYC I+ + A L +M++ G PDI Y I+L G F+TR A ++
Sbjct: 554 NGYCKISRMEDALVLFKEMESSGVSPDIITYNIILQGLFQTR--------RTAAAKELYV 613
Query: 711 EMKDMKFTPDVVYYTVMIDGYCKMNKLNDAVVLFEEMIDQGIEPDTVTYTALLSGCCRSG 770
+ + ++ Y +++ G CK +DA+ +F+ + ++ + T+ ++ + G
Sbjct: 614 RITESGTQIELSTYNIILHGLCKNKLTDDALQMFQNLCLMDLKLEARTFNIMIDALLKVG 673
Query: 771 NTEKAQTLYYDMVSKGILP 779
++A+ L+ S G++P
Sbjct: 674 RNDEAKDLFVAFSSNGLVP 684
BLAST of HG10008123 vs. ExPASy Swiss-Prot
Match:
Q9LSL9 (Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX=3702 GN=At5g65560 PE=2 SV=1)
HSP 1 Score: 250.8 bits (639), Expect = 5.4e-65
Identity = 201/780 (25.77%), Postives = 351/780 (45.00%), Query Frame = 0
Query: 58 NPSKVAQILESLRREPNIAFSFFREL-EEWGFQHNISTYAALIRILCS---WGLERKLES 117
+PS V+ + SL +P A +F + + ++H++ +YA+L+ +L + G+ K+
Sbjct: 89 SPSHVSSLF-SLDLDPKTALNFSHWISQNPRYKHSVYSYASLLTLLINNGYVGVVFKIRL 148
Query: 118 LFSNLIASKRREFDVLDLLESLNQGYVVDGSFIRAYDALIKAYVSVSLFDSVVDLLFRLE 177
L S VLDL +N+ D F Y +I Y
Sbjct: 149 LMIKSCDSVGDALYVLDLCRKMNK----DERFELKYKLIIGCY----------------- 208
Query: 178 RKGFVPHIFTCNFLLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYATVIKALCKIGHME 237
N LLN L G ++ VY ++ PN YTY ++ CK+G++E
Sbjct: 209 -----------NTLLNSLARFGLVDEMKQVYMEMLEDKVCPNIYTYNKMVNGYCKLGNVE 268
Query: 238 EAIDIFEEMSESGVVPNAFACAAYIEGLCTHDCSTSGYQLLQAWRAEQVPIDAYAYSVVI 297
EA ++ E+G+ P+ F + I G C S +++ + + AY+ +I
Sbjct: 269 EANQYVSKIVEAGLDPDFFTYTSLIMGYCQRKDLDSAFKVFNEMPLKGCRRNEVAYTHLI 328
Query: 298 RGFCEEMKIDEAENVFLDMEKYGVVPDALTYGVLINGYCKKLMLQKALSLHSLMLSKGVK 357
G C +IDEA ++F+ M+ P TY VLI C +AL+L M G+K
Sbjct: 329 HGLCVARRIDEAMDLFVKMKDDECFPTVRTYTVLIKSLCGSERKSEALNLVKEMEETGIK 388
Query: 358 SN----CVIISSIL-QCLFRMQMYSEVVNQFKIFQGEGLFLDKVVYNIVVHALCELGKLE 417
N V+I S+ QC F + E++ Q +GL + + YN +++ C+ G +E
Sbjct: 389 PNIHTYTVLIDSLCSQCKF--EKARELLGQ---MLEKGLMPNVITYNALINGYCKRGMIE 448
Query: 418 EAIELLEDMTSRQIQMDVVHYTTMIKGFFVQGKIHEAMIMFENLKKNGIEPDTITYNVLA 477
+A++++E M SR++ + Y +IKG + + +H+AM + + + + PD +TYN L
Sbjct: 449 DAVDVVELMESRKLSPNTRTYNELIKG-YCKSNVHKAMGVLNKMLERKVLPDVVTYNSLI 508
Query: 478 TGLSRNGLVSKVHGLLDYMEEHGLKEDPKIRDLVIENLCIGGKVKEATEFFNSLEVKTVD 537
G R+G + LL M + GL D +I++LC +V+EA + F+SLE K V+
Sbjct: 509 DGQCRSGNFDSAYRLLSLMNDRGLVPDQWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVN 568
Query: 538 ----NYSAMINGYCAANHTKDAYELFINLSKKGVFIKRSALVRLVSSLCMEDS------- 597
Y+A+I+GYC A +A+ + + K L+ LC +
Sbjct: 569 PNVVMYTALIDGYCKAGKVDEAHLMLEKMLSKNCLPNSLTFNALIHGLCADGKLKEATLL 628
Query: 598 SDRDIKV----------------------------MKQLPIRNVEANEIVYNKVIASLCR 657
++ +K+ +Q+ + + Y I + CR
Sbjct: 629 EEKMVKIGLQPTVSTDTILIHRLLKDGDFDHAYSRFQQMLSSGTKPDAHTYTTFIQTYCR 688
Query: 658 ERNMKKAQCLFDFLVSSGLTPDLITYTMMINGYCNINFLRKAYELLCDMKNRGREPDIFI 717
E + A+ + + +G++PDL TY+ +I GY ++ A+++L M++ G EP
Sbjct: 689 EGRLLDAEDMMAKMRENGVSPDLFTYSSLIKGYGDLGQTNFAFDVLKRMRDTGCEPSQHT 748
Query: 718 YTILLD-------GQFKTRVRELC---SSVEIAFTSTIFDEMKDMKFTPDVVYYTVMIDG 777
+ L+ G+ K ELC + +E + ++M + TP+ Y +I G
Sbjct: 749 FLSLIKHLLEMKYGKQKGSEPELCAMSNMMEFDTVVELLEKMVEHSVTPNAKSYEKLILG 808
Query: 778 YCKMNKLNDAVVLFEEM-IDQGIEPDTVTYTALLSGCCRSGNTEKAQTLYYDMVSKGILP 779
C++ L A +F+ M ++GI P + + ALLS CC+ +A + DM+ G LP
Sbjct: 809 ICEVGNLRVAEKVFDHMQRNEGISPSELVFNALLSCCCKLKKHNEAAKVVDDMICVGHLP 829
BLAST of HG10008123 vs. ExPASy Swiss-Prot
Match:
Q9FJE6 (Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis thaliana OX=3702 GN=At5g59900 PE=3 SV=1)
HSP 1 Score: 248.8 bits (634), Expect = 2.0e-64
Identity = 189/820 (23.05%), Postives = 348/820 (42.44%), Query Frame = 0
Query: 62 VAQILESLRREPNIAFSFFRELE-EWGFQHNISTYAALIRILCSWGLERKLESLFSNLIA 121
V +IL +P + FF L GF H+ +++ LI L L SL L+
Sbjct: 73 VEEILIGTIDDPKLGLRFFNFLGLHRGFDHSTASFCILIHALVKANLFWPASSLLQTLLL 132
Query: 122 SKRREFDVLDLLESLNQGYVVDGSFIRAYDALIKAYV-SVSLFDSVVDLLFRLERKGFVP 181
+ DV ++L S + + S ++D LI+ YV S + D V+ + + +P
Sbjct: 133 RALKPSDVFNVLFSCYEKCKLSSS--SSFDLLIQHYVRSRRVLDGVLVFKMMITKVSLLP 192
Query: 182 HIFTCNFLLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYATVIKALCK----------I 241
+ T + LL+ L++ +A+ ++ + G +P+ Y Y VI++LC+ I
Sbjct: 193 EVRTLSALLHGLVKFRHFGLAMELFNDMVSVGIRPDVYIYTGVIRSLCELKDLSRAKEMI 252
Query: 242 GHME-------------------------EAIDIFEEMSESGVVPNAFACAAYIEGLCTH 301
HME EA+ I ++++ + P+ + GLC
Sbjct: 253 AHMEATGCDVNIVPYNVLIDGLCKKQKVWEAVGIKKDLAGKDLKPDVVTYCTLVYGLCKV 312
Query: 302 DCSTSGYQLL-----------------------QAWRAEQ------------VPIDAYAY 361
G +++ + + E+ V + + Y
Sbjct: 313 QEFEIGLEMMDEMLCLRFSPSEAAVSSLVEGLRKRGKIEEALNLVKRVVDFGVSPNLFVY 372
Query: 362 SVVIRGFCEEMKIDEAENVFLDMEKYGVVPDALTYGVLINGYCKKLMLQKALSLHSLMLS 421
+ +I C+ K EAE +F M K G+ P+ +TY +LI+ +C++ L ALS M+
Sbjct: 373 NALIDSLCKGRKFHEAELLFDRMGKIGLRPNDVTYSILIDMFCRRGKLDTALSFLGEMVD 432
Query: 422 KGVKSNCVIISSILQCLFRMQMYSEVVNQFKIFQGEGLFLDKVVYNIVVHALCELGKLEE 481
G+K + +S++ + S + L V Y ++ C GK+ +
Sbjct: 433 TGLKLSVYPYNSLINGHCKFGDISAAEGFMAEMINKKLEPTVVTYTSLMGGYCSKGKINK 492
Query: 482 AIELLEDMTSRQIQMDVVHYTTMIKGFFVQGKIHEAMIMFENLKKNGIEPDTITYNVLAT 541
A+ L +MT + I + +TT++ G F G I +A+ +F + + ++P+ +TYNV+
Sbjct: 493 ALRLYHEMTGKGIAPSIYTFTTLLSGLFRAGLIRDAVKLFNEMAEWNVKPNRVTYNVMIE 552
Query: 542 GLSRNGLVSKVHGLLDYMEEHGLKEDPKIRDLVIENLCIGGKVKEATEFFNSLEVKTVD- 601
G G +SK L M E G+ D +I LC+ G+ EA F + L +
Sbjct: 553 GYCEEGDMSKAFEFLKEMTEKGIVPDTYSYRPLIHGLCLTGQASEAKVFVDGLHKGNCEL 612
Query: 602 ---NYSAMINGYCAANHTKDAYELFINLSKKGVFIKRSALVRLVSSLCMEDSSDRDIKVM 661
Y+ +++G+C ++A + + ++GV + L+ ++
Sbjct: 613 NEICYTGLLHGFCREGKLEEALSVCQEMVQRGVDLDLVCYGVLIDGSLKHKDRKLFFGLL 672
Query: 662 KQLPIRNVEANEIVYNKVIASLCRERNMKKAQCLFDFLVSSGLTPDLITYTMMINGYCNI 721
K++ R ++ ++++Y +I + + + K+A ++D +++ G P+ +TYT +ING C
Sbjct: 673 KEMHDRGLKPDDVIYTSMIDAKSKTGDFKEAFGIWDLMINEGCVPNEVTYTAVINGLCKA 732
Query: 722 NFLRKAYELLCDMKNRGREPDIFIYTILLD---------------------------GQF 779
F+ +A L M+ P+ Y LD +
Sbjct: 733 GFVNEAEVLCSKMQPVSSVPNQVTYGCFLDILTKGEVDMQKAVELHNAILKGLLANTATY 792
BLAST of HG10008123 vs. ExPASy TrEMBL
Match:
A0A1S4DV07 (pentatricopeptide repeat-containing protein At2g26790, mitochondrial-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103487387 PE=4 SV=1)
HSP 1 Score: 1350.9 bits (3495), Expect = 0.0e+00
Identity = 672/796 (84.42%), Postives = 733/796 (92.09%), Query Frame = 0
Query: 1 MNCLLLAINNPIKLNKFKLKLVKFASTAIAQPNPCSFSHSEDEQSTTSINTTLHVQCNPS 60
MNCLLLA+NNP KLNK KLKLVKFASTAIAQ N C FSHS+DEQ+TTS NTTL+VQC PS
Sbjct: 1 MNCLLLAVNNPTKLNKLKLKLVKFASTAIAQLNSCFFSHSDDEQTTTSFNTTLNVQCKPS 60
Query: 61 KVAQILESLRREPNIAFSFFRELEEWGFQHNISTYAALIRILCSWGLERKLESLFSNLIA 120
KV Q+LESLRREPNIAFSFFRELEE GFQHNISTYAALIRILCSW L RKLE+LF NLI
Sbjct: 61 KVVQVLESLRREPNIAFSFFRELEERGFQHNISTYAALIRILCSWRLRRKLETLFLNLIG 120
Query: 121 SKRREFDVLDLLESLNQGYVVDGSFIRAYDALIKAYVSVSLFDSVVDLLFRLERKGFVPH 180
SK+ +FDVLDLLESLNQG V+D SFIRAYDALIKAYVSV+LFDSVVDLLFRL RKGFVPH
Sbjct: 121 SKKMDFDVLDLLESLNQGCVLDASFIRAYDALIKAYVSVNLFDSVVDLLFRLGRKGFVPH 180
Query: 181 IFTCNFLLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYATVIKALCKIGHMEEAIDIFE 240
IFTCN+LLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYATVIKALCKIG+MEEAIDIFE
Sbjct: 181 IFTCNYLLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYATVIKALCKIGNMEEAIDIFE 240
Query: 241 EMSESGVVPNAFACAAYIEGLCTHDCSTSGYQLLQAWRAEQVPIDAYAYSVVIRGFCEEM 300
EMS G+VPNAFACAAYIE LCTHDCSTSGYQLLQAWRAE+ PIDAYAYSVVIRGFC+EM
Sbjct: 241 EMSGYGMVPNAFACAAYIEALCTHDCSTSGYQLLQAWRAERFPIDAYAYSVVIRGFCDEM 300
Query: 301 KIDEAENVFLDMEKYGVVPDALTYGVLINGYCKKLMLQKALSLHSLMLSKGVKSNCVIIS 360
KIDEAE+VFLDME YGVVPDA TY VLI+GYCKKL LQKALSLHSLMLSKG+KSNCVI+S
Sbjct: 301 KIDEAESVFLDMENYGVVPDAQTYSVLIDGYCKKLNLQKALSLHSLMLSKGIKSNCVIVS 360
Query: 361 SILQCLFRMQMYSEVVNQFKIFQGEGLFLDKVVYNIVVHALCELGKLEEAIELLEDMTSR 420
ILQC RMQMYSEVVNQFK+FQ +GLFLDKVVYNIVVHALCELGKLEEAIELLE+MTSR
Sbjct: 361 FILQCFLRMQMYSEVVNQFKVFQEKGLFLDKVVYNIVVHALCELGKLEEAIELLEEMTSR 420
Query: 421 QIQMDVVHYTTMIKGFFVQGKIHEAMIMFENLKKNGIEPDTITYNVLATGLSRNGLVSKV 480
QIQMDV+HYTTMIKG F QGKIHEAM+MFENLKKNG+EPD+ITYNVLA GLSRNGLVSKV
Sbjct: 421 QIQMDVIHYTTMIKGLFAQGKIHEAMMMFENLKKNGVEPDSITYNVLAAGLSRNGLVSKV 480
Query: 481 HGLLDYMEEHGLKEDPKIRDLVIENLCIGGKVKEATEFFNSLEVKTVDNYSAMINGYCAA 540
LL+YMEEHGL+EDPK+ +LVIENLCIGGKVKEATEFFNSLEVKTVDNY+AMINGYCAA
Sbjct: 481 QELLNYMEEHGLREDPKMPNLVIENLCIGGKVKEATEFFNSLEVKTVDNYAAMINGYCAA 540
Query: 541 NHTKDAYELFINLSKKGVFIKRSALVRLVSSLCMEDSSDRDIKVMKQLPIRNVEANEIVY 600
N TK AY+LF+NLSK+GVFIKRS+LVRLVS LCME+SS R I+VMKQLP+ N+EA E VY
Sbjct: 541 NSTKAAYKLFVNLSKEGVFIKRSSLVRLVSRLCMENSSFRAIEVMKQLPVMNLEAKEFVY 600
Query: 601 NKVIASLCRERNMKKAQCLFDFLVSSGLTPDLITYTMMINGYCNINFLRKAYELLCDMKN 660
NKVIASLC+ RNMK AQCLFD LV +GL PDLITYTMMINGYC IN+LR+AYELLCDM+N
Sbjct: 601 NKVIASLCQVRNMKMAQCLFDCLVRAGLIPDLITYTMMINGYCKINYLREAYELLCDMRN 660
Query: 661 RGREPDIFIYTILLDGQFKTRVRELCSSVEIAFTSTIFDEMKDMKFTPDVVYYTVMIDGY 720
RGREPDIF+YT+LLDG FKTRV++ CSSVE+A TS+IF+E KDMK TPDVVYYTV+IDGY
Sbjct: 661 RGREPDIFVYTVLLDGGFKTRVQK-CSSVELALTSSIFNETKDMKITPDVVYYTVLIDGY 720
Query: 721 CKMNKLNDAVVLFEEMIDQGIEPDTVTYTALLSGCCRSGNTEKAQTLYYDMVSKGILPPK 780
CKMN LN A VLFEEM+DQGIE D VTYTALLSGCCR+G+ EKAQTL Y+M SKGILPP+
Sbjct: 721 CKMNNLNAAFVLFEEMVDQGIEADAVTYTALLSGCCRNGDKEKAQTLCYEMTSKGILPPE 780
Query: 781 HFSFLLHHDTLKTKKV 797
+FS+LL HDTL+TKK+
Sbjct: 781 NFSYLLQHDTLETKKI 795
BLAST of HG10008123 vs. ExPASy TrEMBL
Match:
A0A5A7U2I6 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold675G00200 PE=4 SV=1)
HSP 1 Score: 1349.3 bits (3491), Expect = 0.0e+00
Identity = 670/796 (84.17%), Postives = 733/796 (92.09%), Query Frame = 0
Query: 1 MNCLLLAINNPIKLNKFKLKLVKFASTAIAQPNPCSFSHSEDEQSTTSINTTLHVQCNPS 60
MNCLLLA+NNP KLNK KLKLVKFASTAIAQ N C FSHS+DEQ+TTS NTTL+VQC PS
Sbjct: 1 MNCLLLAVNNPTKLNKLKLKLVKFASTAIAQLNSCFFSHSDDEQTTTSFNTTLNVQCKPS 60
Query: 61 KVAQILESLRREPNIAFSFFRELEEWGFQHNISTYAALIRILCSWGLERKLESLFSNLIA 120
KV Q+LESLRREPNIAFSFFRELEE GFQHN+STYAALIRILCSW L RKLE+LF NLI
Sbjct: 61 KVVQVLESLRREPNIAFSFFRELEERGFQHNVSTYAALIRILCSWRLRRKLETLFLNLIG 120
Query: 121 SKRREFDVLDLLESLNQGYVVDGSFIRAYDALIKAYVSVSLFDSVVDLLFRLERKGFVPH 180
SK+ +FDVLDLLESLNQG V+D SFIRAYDALIKAYV+V+LFDSVVDLLFRL RKGFVPH
Sbjct: 121 SKKMDFDVLDLLESLNQGCVLDASFIRAYDALIKAYVNVNLFDSVVDLLFRLGRKGFVPH 180
Query: 181 IFTCNFLLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYATVIKALCKIGHMEEAIDIFE 240
IFTCN+LLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYATVIKALCKIG+MEEAIDIFE
Sbjct: 181 IFTCNYLLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYATVIKALCKIGNMEEAIDIFE 240
Query: 241 EMSESGVVPNAFACAAYIEGLCTHDCSTSGYQLLQAWRAEQVPIDAYAYSVVIRGFCEEM 300
EMS G+VPNAFACAAYIE LCTHDCSTSGYQLLQAWRAE+ PIDAYAYSVVIRGFC+EM
Sbjct: 241 EMSGYGMVPNAFACAAYIEALCTHDCSTSGYQLLQAWRAERFPIDAYAYSVVIRGFCDEM 300
Query: 301 KIDEAENVFLDMEKYGVVPDALTYGVLINGYCKKLMLQKALSLHSLMLSKGVKSNCVIIS 360
KIDEAE+VFLDME YGVVPDA TY VLI+GYCKKL LQKALSLHSLMLSKG+KSNCVI+S
Sbjct: 301 KIDEAESVFLDMENYGVVPDAQTYSVLIDGYCKKLNLQKALSLHSLMLSKGIKSNCVIVS 360
Query: 361 SILQCLFRMQMYSEVVNQFKIFQGEGLFLDKVVYNIVVHALCELGKLEEAIELLEDMTSR 420
ILQC RMQMYSEVVNQFK+FQ +GLFLDKVVYNIVVHALCELGKLEEAIELLE+MTSR
Sbjct: 361 FILQCFLRMQMYSEVVNQFKVFQEKGLFLDKVVYNIVVHALCELGKLEEAIELLEEMTSR 420
Query: 421 QIQMDVVHYTTMIKGFFVQGKIHEAMIMFENLKKNGIEPDTITYNVLATGLSRNGLVSKV 480
QIQMDV+HYTTMIKG F QGKIHEAM+MFENLKKNG+EPD+ITYNVLA GLSRNGLVSKV
Sbjct: 421 QIQMDVIHYTTMIKGLFAQGKIHEAMMMFENLKKNGVEPDSITYNVLAAGLSRNGLVSKV 480
Query: 481 HGLLDYMEEHGLKEDPKIRDLVIENLCIGGKVKEATEFFNSLEVKTVDNYSAMINGYCAA 540
LL+YMEEHGL+EDPK+ +LVIENLCIGGKVKEATEFFNSLEVKTVDNY+AMINGYCAA
Sbjct: 481 QELLNYMEEHGLREDPKMPNLVIENLCIGGKVKEATEFFNSLEVKTVDNYAAMINGYCAA 540
Query: 541 NHTKDAYELFINLSKKGVFIKRSALVRLVSSLCMEDSSDRDIKVMKQLPIRNVEANEIVY 600
N TK AY+LF+NLSK+GVFIKRS+LVRLVS LCME+SS R I+VMKQLP+ N+EA E VY
Sbjct: 541 NSTKAAYKLFVNLSKEGVFIKRSSLVRLVSRLCMENSSFRAIEVMKQLPVMNLEAKEFVY 600
Query: 601 NKVIASLCRERNMKKAQCLFDFLVSSGLTPDLITYTMMINGYCNINFLRKAYELLCDMKN 660
NKVIASLC+ RNMK AQCLFD LV +GL PDLITYTMMINGYC IN+LR+AYELLCDM+N
Sbjct: 601 NKVIASLCQVRNMKMAQCLFDCLVRAGLIPDLITYTMMINGYCKINYLREAYELLCDMRN 660
Query: 661 RGREPDIFIYTILLDGQFKTRVRELCSSVEIAFTSTIFDEMKDMKFTPDVVYYTVMIDGY 720
RGREPDIF+YT+LLDG FKTRV++ CSSVE+A TS+IF+E KDMK TPDVVYYTV+IDGY
Sbjct: 661 RGREPDIFVYTVLLDGGFKTRVQK-CSSVELALTSSIFNETKDMKITPDVVYYTVLIDGY 720
Query: 721 CKMNKLNDAVVLFEEMIDQGIEPDTVTYTALLSGCCRSGNTEKAQTLYYDMVSKGILPPK 780
CKMN LN A VLFEEM+DQGIE D VTYTALLSGCCR+G+ EKAQTL Y+M SKGILPP+
Sbjct: 721 CKMNNLNAAFVLFEEMVDQGIEADAVTYTALLSGCCRNGDKEKAQTLCYEMTSKGILPPE 780
Query: 781 HFSFLLHHDTLKTKKV 797
+FS+LL HDTL+TKK+
Sbjct: 781 NFSYLLQHDTLETKKI 795
BLAST of HG10008123 vs. ExPASy TrEMBL
Match:
A0A6J1JGR8 (pentatricopeptide repeat-containing protein At2g26790, mitochondrial OS=Cucurbita maxima OX=3661 GN=LOC111484960 PE=4 SV=1)
HSP 1 Score: 1341.6 bits (3471), Expect = 0.0e+00
Identity = 677/805 (84.10%), Postives = 727/805 (90.31%), Query Frame = 0
Query: 1 MNCLLLAINNPIKLNKFKLKLVKFASTAIAQPNPCSFSHSEDEQSTTSINTTLHVQCNPS 60
MNCLLLA+NNPIKL++ KLKLVKFASTAIAQ NPCSFSHS+DEQSTT +TTL VQ NPS
Sbjct: 1 MNCLLLAVNNPIKLSRLKLKLVKFASTAIAQLNPCSFSHSDDEQSTTFFHTTLPVQFNPS 60
Query: 61 KVAQILESLRREPNIAFSFFRELEEWGFQHNISTYAALIRILCSWGLERKLESLFSNLIA 120
KV QIL SLRREP IAFSFFRELEE GFQHNISTYAALIRILCSWGLERKLESLF NLI
Sbjct: 61 KVVQILGSLRREPKIAFSFFRELEERGFQHNISTYAALIRILCSWGLERKLESLFLNLIE 120
Query: 121 SKRREFDVLDLLESLNQGYVVDGSFIRAYDALIKAYVSVSLFDSVVDLLFRLERKGFVPH 180
SK+ EFDVLDLLESLNQGY V GSF RAYDALIKAYVSVSLFDS VDLLFR ERKGFVPH
Sbjct: 121 SKKMEFDVLDLLESLNQGYAVGGSFTRAYDALIKAYVSVSLFDSAVDLLFRSERKGFVPH 180
Query: 181 IFTCNFLLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYATVIKALCKIGHMEEAIDIFE 240
IFTCNFLLNRL EHGKMNMALVVYKQLKRFG PNDYTYA VIKALCK+G+MEEAI IFE
Sbjct: 181 IFTCNFLLNRLTEHGKMNMALVVYKQLKRFGFHPNDYTYAIVIKALCKMGNMEEAIYIFE 240
Query: 241 EMSESGVVPNAFACAAYIEGLCTHDCSTSGYQLLQAWRAEQVPIDAYAYSVVIRGFCEEM 300
EMSE+GVV +AFA AYIEGLCTH CS S YQLLQAWR Q PID YAY VVIRGFCEEM
Sbjct: 241 EMSEAGVVASAFAYTAYIEGLCTHHCSASAYQLLQAWREAQAPIDVYAYFVVIRGFCEEM 300
Query: 301 KIDEAENVFLDMEKYGVVPDALTYGVLINGYCKKLMLQKALSLHSLMLSKGVKSNCVIIS 360
KIDEAENVFL+ME+YGVVPDA TYGVLINGYCKKL LQKALSLHSLMLSKG+K+NCVI+S
Sbjct: 301 KIDEAENVFLEMEQYGVVPDAQTYGVLINGYCKKLKLQKALSLHSLMLSKGIKTNCVIVS 360
Query: 361 SILQCLFRMQMYSEVVNQFKIFQGEGLFLDKVVYNIVVHALCELGKLEEAIELLEDMTSR 420
SILQCL RMQM SEVVNQFK+FQG+G+F DKV YNIVVHALCE GKLEEA+ELLEDMTSR
Sbjct: 361 SILQCLLRMQMCSEVVNQFKVFQGKGVFFDKVAYNIVVHALCEQGKLEEAMELLEDMTSR 420
Query: 421 QIQMDVVHYTTMIKGFFVQGKIHEAMIMFENLKKNGIEPDTITYNVLATGLSRNGLVSKV 480
QIQMDVVHYTTMIKGFFVQGKIHEAM+MFENLKKNGIEPDTITYNVLA GLSRNGLVSKV
Sbjct: 421 QIQMDVVHYTTMIKGFFVQGKIHEAMVMFENLKKNGIEPDTITYNVLAAGLSRNGLVSKV 480
Query: 481 HGLLDYMEEHGLKEDPKIRDLVIENLCIGGKVKEATEFFNSLEVKTVDNYSAMINGYCAA 540
GLLDYMEEHGL+EDPKI DLVIENLC+GGKVKEATEFFNSLEVKTV+NYSAMINGYCAA
Sbjct: 481 QGLLDYMEEHGLREDPKILDLVIENLCVGGKVKEATEFFNSLEVKTVENYSAMINGYCAA 540
Query: 541 NHTKDAYELFINLSKKGVFIKRSALVRLVSSLCMEDSSDRDIKVMKQLPIRNVEANEIVY 600
NHT DAY+LF+NLSKKGV IK+S L RLVSSLCMEDSSDR IKV+K+LPI +V+AN+IVY
Sbjct: 541 NHTNDAYDLFVNLSKKGVSIKKSTLSRLVSSLCMEDSSDRAIKVIKKLPIMDVKANKIVY 600
Query: 601 NKVIASLCRERNMKKAQCLFDFLVSSGLTPDLITYTMMINGYCNINFLRKAYELLCDMKN 660
N VI+SLCR NMK+AQ LFD LV +GLTPDLITYTMMINGYC INFLR+AYELLCDMKN
Sbjct: 601 NNVISSLCRVGNMKRAQSLFDCLVCAGLTPDLITYTMMINGYCKINFLREAYELLCDMKN 660
Query: 661 RGREPDIFIYTILLDGQFKTRVRELCSSV-------EIAFTSTIFDEMKDMKFTPDVVYY 720
RGR+PDI IYT+LLDG FKTR++ELCSSV EIA STIFDEMKDMK TPD++ Y
Sbjct: 661 RGRKPDIVIYTVLLDGLFKTRLQELCSSVHLRGEKQEIALASTIFDEMKDMKITPDIICY 720
Query: 721 TVMIDGYCKMNKLNDAVVLFEEMIDQGIEPDTVTYTALLSGCCRSGNTEKAQTLYYDMVS 780
TV+IDGYCKMN L+DAVVLFEEM+DQGIEPDTVTYTALLSGCCRSG+TEKA TL DM+S
Sbjct: 721 TVLIDGYCKMNNLSDAVVLFEEMVDQGIEPDTVTYTALLSGCCRSGDTEKAVTLMSDMLS 780
Query: 781 KGILPPKHFSFLLHHDTLKTKKVQL 799
KGILP + FS LLHH+T KTKK++L
Sbjct: 781 KGILPSEQFSTLLHHNTPKTKKMRL 805
BLAST of HG10008123 vs. ExPASy TrEMBL
Match:
A0A6J1E1Q9 (pentatricopeptide repeat-containing protein At2g26790, mitochondrial-like OS=Cucurbita moschata OX=3662 GN=LOC111430036 PE=4 SV=1)
HSP 1 Score: 1315.4 bits (3403), Expect = 0.0e+00
Identity = 666/805 (82.73%), Postives = 721/805 (89.57%), Query Frame = 0
Query: 1 MNCLLLAINNPIKLNKFKLKLVKFASTAIAQPNPCSFSHSEDEQSTTSINTTLHVQCNPS 60
MNCLLLA+NNPI KLKLVKFASTAIAQ NPCSFSHS+ EQSTT +TTL VQ NPS
Sbjct: 1 MNCLLLAVNNPI-----KLKLVKFASTAIAQLNPCSFSHSDGEQSTTFFHTTLPVQFNPS 60
Query: 61 KVAQILESLRREPNIAFSFFRELEEWGFQHNISTYAALIRILCSWGLERKLESLFSNLIA 120
KV QIL SLRREPNIAFSFFRELE+ GFQHNISTYAALIRILCSWGLERKLESLF NLI
Sbjct: 61 KVVQILGSLRREPNIAFSFFRELEDRGFQHNISTYAALIRILCSWGLERKLESLFLNLID 120
Query: 121 SKRREFDVLDLLESLNQGYVVDGSFIRAYDALIKAYVSVSLFDSVVDLLFRLERKGFVPH 180
SK+ EFDVLDLLESLNQGY V+GSF RAYDALIKAYVSVSLFDS VDLLFR RKGFVPH
Sbjct: 121 SKKMEFDVLDLLESLNQGYAVNGSFTRAYDALIKAYVSVSLFDSAVDLLFRSGRKGFVPH 180
Query: 181 IFTCNFLLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYATVIKALCKIGHMEEAIDIFE 240
IFTCN+LLNRLI+HGKMN+ALVVYKQLKRFG PNDYTYA VIKALCK+G+MEEAI IFE
Sbjct: 181 IFTCNYLLNRLIKHGKMNIALVVYKQLKRFGFHPNDYTYAIVIKALCKMGNMEEAIYIFE 240
Query: 241 EMSESGVVPNAFACAAYIEGLCTHDCSTSGYQLLQAWRAEQVPIDAYAYSVVIRGFCEEM 300
EMSE+GVV +AFA AYIEGLCTH CS S YQLLQAWRA Q PID YAYSVVI GFCEEM
Sbjct: 241 EMSEAGVVASAFAYTAYIEGLCTHHCSASAYQLLQAWRAAQAPIDVYAYSVVIHGFCEEM 300
Query: 301 KIDEAENVFLDMEKYGVVPDALTYGVLINGYCKKLMLQKALSLHSLMLSKGVKSNCVIIS 360
KIDEAENVFL+MEKYGVVP+A TYGVLINGYCKKL LQKALSLHSLMLSKG+K+NCVI+S
Sbjct: 301 KIDEAENVFLEMEKYGVVPNAQTYGVLINGYCKKLKLQKALSLHSLMLSKGIKTNCVIVS 360
Query: 361 SILQCLFRMQMYSEVVNQFKIFQGEGLFLDKVVYNIVVHALCELGKLEEAIELLEDMTSR 420
SILQCL RMQM SEVVNQFK+FQG+G++ DKV YNIV+HALCE GKLEEA+ELLEDMTSR
Sbjct: 361 SILQCLLRMQMCSEVVNQFKVFQGKGVYFDKVAYNIVLHALCEQGKLEEAMELLEDMTSR 420
Query: 421 QIQMDVVHYTTMIKGFFVQGKIHEAMIMFENLKKNGIEPDTITYNVLATGLSRNGLVSKV 480
QIQMDVVHYTTMIKGF VQGKIHEAM+MFENLKKNGIEPDTITY+VLA GLSRNGLVSKV
Sbjct: 421 QIQMDVVHYTTMIKGFLVQGKIHEAMVMFENLKKNGIEPDTITYDVLAAGLSRNGLVSKV 480
Query: 481 HGLLDYMEEHGLKEDPKIRDLVIENLCIGGKVKEATEFFNSLEVKTVDNYSAMINGYCAA 540
GLLDYMEEHGL+EDPKI +LVIENLC+GGKVKEATEFFNSLEVKTVDNYSAMINGYCAA
Sbjct: 481 QGLLDYMEEHGLREDPKILNLVIENLCVGGKVKEATEFFNSLEVKTVDNYSAMINGYCAA 540
Query: 541 NHTKDAYELFINLSKKGVFIKRSALVRLVSSLCMEDSSDRDIKVMKQLPIRNVEANEIVY 600
NHT DAY+LF+NLSKKGV IK+S L RLVSSLCMEDSS R IKV+K+LPI +VEANEIVY
Sbjct: 541 NHTNDAYDLFVNLSKKGVSIKKSTLSRLVSSLCMEDSSGRAIKVIKKLPIMDVEANEIVY 600
Query: 601 NKVIASLCRERNMKKAQCLFDFLVSSGLTPDLITYTMMINGYCNINFLRKAYELLCDMKN 660
N VI+SLCR NMK+AQ LFD LV +GLTPDLITYTMMI GYC INFLR+AY LLCDMKN
Sbjct: 601 NNVISSLCRVGNMKRAQSLFDCLVCAGLTPDLITYTMMIKGYCKINFLREAYLLLCDMKN 660
Query: 661 RGREPDIFIYTILLDGQFKTRVRELCSSV-------EIAFTSTIFDEMKDMKFTPDVVYY 720
RGR+PDI IYT+LLDG FKTR++E CSSV EIAF STIFDEMKDMK TPD++ Y
Sbjct: 661 RGRKPDIVIYTVLLDGLFKTRLQEFCSSVDLRGEKQEIAFASTIFDEMKDMKITPDIICY 720
Query: 721 TVMIDGYCKMNKLNDAVVLFEEMIDQGIEPDTVTYTALLSGCCRSGNTEKAQTLYYDMVS 780
TV+IDGYCKMN LNDAVVLFEEM+DQGIEPD VTYTALLSGCCRSG+T+KA TL DM+S
Sbjct: 721 TVLIDGYCKMNNLNDAVVLFEEMVDQGIEPDIVTYTALLSGCCRSGDTKKAATLMSDMLS 780
Query: 781 KGILPPKHFSFLLHHDTLKTKKVQL 799
KGILP + FS LLHH++LKTKK++L
Sbjct: 781 KGILPSEQFSSLLHHNSLKTKKMRL 800
BLAST of HG10008123 vs. ExPASy TrEMBL
Match:
A0A6J1EVM2 (pentatricopeptide repeat-containing protein At2g26790, mitochondrial-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111438390 PE=4 SV=1)
HSP 1 Score: 1297.0 bits (3355), Expect = 0.0e+00
Identity = 648/789 (82.13%), Postives = 705/789 (89.35%), Query Frame = 0
Query: 1 MNCLLLAINNPIKLNKFKLKLVKFASTAIAQPNPCSFSHSEDEQSTTSINTTLHVQCNPS 60
MNCLLL +NNPIKLNK KLKLVKFASTAIA+ N CSFS S+DEQ T S+NTT H+Q NPS
Sbjct: 1 MNCLLLTVNNPIKLNKVKLKLVKFASTAIARLNQCSFSQSDDEQITISLNTTSHIQFNPS 60
Query: 61 KVAQILESLRREPNIAFSFFRELEEWGFQHNISTYAALIRILCSWGLERKLESLFSNLIA 120
KV QILESLRREPN AFSFF +LEE GF HNISTYAALIRILCSWGLERKL+SLF NLI
Sbjct: 61 KVIQILESLRREPNSAFSFFHKLEERGFCHNISTYAALIRILCSWGLERKLDSLFLNLIG 120
Query: 121 SKRREFDVLDLLESLNQGYVVDGSFIRAYDALIKAYVSVSLFDSVVDLLFRLERKGFVPH 180
S++ EFDVLDLLESLNQGYV+DGSFIRAYD+LIKAYVSVSLFDS VDLLFR ERKGFVPH
Sbjct: 121 SEKTEFDVLDLLESLNQGYVMDGSFIRAYDSLIKAYVSVSLFDSAVDLLFRSERKGFVPH 180
Query: 181 IFTCNFLLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYATVIKALCKIGHMEEAIDIFE 240
IFTCNFLLNRLIEHGK+NMAL VYKQLKRFG PNDYTYA VIKALCK+G+MEEAIDIFE
Sbjct: 181 IFTCNFLLNRLIEHGKLNMALTVYKQLKRFGFHPNDYTYAIVIKALCKMGNMEEAIDIFE 240
Query: 241 EMSESGVVPNAFACAAYIEGLCTHDCSTSGYQLLQAWRAEQVPIDAYAYSVVIRGFCEEM 300
EMSE+GVVPNAFAC AYIEGLCTH CS GYQLLQ WRA Q PID YAY VVIRGFCEEM
Sbjct: 241 EMSEAGVVPNAFACTAYIEGLCTHRCSAYGYQLLQDWRAAQAPIDVYAYFVVIRGFCEEM 300
Query: 301 KIDEAENVFLDMEKYGVVPDALTYGVLINGYCKKLMLQKALSLHSLMLSKGVKSNCVIIS 360
++++AENVFLDMEKYGVVP+A TYGVLINGYCK L LQKALSLHS MLSKG+K+NCVI+S
Sbjct: 301 EVNKAENVFLDMEKYGVVPNAQTYGVLINGYCKMLKLQKALSLHSFMLSKGIKTNCVIVS 360
Query: 361 SILQCLFRMQMYSEVVNQFKIFQGEGLFLDKVVYNIVVHALCELGKLEEAIELLEDMTSR 420
SILQCL RMQMYSEVVNQFK+FQ EG+F DKV YNIVVHALCE GKLEEA+ELLEDMTSR
Sbjct: 361 SILQCLIRMQMYSEVVNQFKVFQDEGVFFDKVAYNIVVHALCEQGKLEEAMELLEDMTSR 420
Query: 421 QIQMDVVHYTTMIKGFFVQGKIHEAMIMFENLKKNGIEPDTITYNVLATGLSRNGLVSKV 480
QIQMDVVHYTTMIKGFFVQGKI+EAM MFENLKKNGIEPDTITYNVLA GLSRNGLVS+V
Sbjct: 421 QIQMDVVHYTTMIKGFFVQGKIYEAMTMFENLKKNGIEPDTITYNVLAAGLSRNGLVSEV 480
Query: 481 HGLLDYMEEHGLKEDPKIRDLVIENLCIGGKVKEATEFFNSLEVKTVDNYSAMINGYCAA 540
LLDYM+EHGL+EDPKI +L+IENLCIGGKVKEATE FNSLEVKTVDNYSAMINGYCAA
Sbjct: 481 KNLLDYMDEHGLREDPKISNLIIENLCIGGKVKEATEIFNSLEVKTVDNYSAMINGYCAA 540
Query: 541 NHTKDAYELFINLSKKGVFIKRSALVRLVSSLCMEDSSDRDIKVMKQLPIRNVEANEIVY 600
N TKDAYELF+NLSKKGV +K+S+L RLVSSLCMEDS+DR IKV+K+L I NVEANEIVY
Sbjct: 541 NRTKDAYELFVNLSKKGVLLKKSSLFRLVSSLCMEDSNDRAIKVIKRLSIINVEANEIVY 600
Query: 601 NKVIASLCRERNMKKAQCLFDFLVSSGLTPDLITYTMMINGYCNINFLRKAYELLCDMKN 660
NKVIASLCR NMK AQCLFD L+ +GLTPDL TYTMMINGYC INFLR+AYELLCDMKN
Sbjct: 601 NKVIASLCRAGNMKMAQCLFDCLICAGLTPDLKTYTMMINGYCKINFLREAYELLCDMKN 660
Query: 661 RGREPDIFIYTILLDGQFKTRVRELCSSV-------EIAFTSTIFDEMKDMKFTPDVVYY 720
RGR+PDIFIYT+LLDGQFKTR+R LCS+V E AFTS IFDEM+DMK TPDV+ Y
Sbjct: 661 RGRKPDIFIYTVLLDGQFKTRLRGLCSAVDLRGAKQETAFTSRIFDEMEDMKITPDVICY 720
Query: 721 TVMIDGYCKMNKLNDAVVLFEEMIDQGIEPDTVTYTALLSGCCRSGNTEKAQTLYYDMVS 780
TV+IDGYCKMN LNDA+VL E+M+DQGI PDTVTYT L+SG CRSG+ EKA L+ DM+S
Sbjct: 721 TVLIDGYCKMNNLNDAIVLLEKMVDQGIMPDTVTYTTLVSGFCRSGDVEKAVALFDDMLS 780
Query: 781 KGILPPKHF 783
KG+LP F
Sbjct: 781 KGVLPDAFF 789
BLAST of HG10008123 vs. TAIR 10
Match:
AT2G26790.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 537.7 bits (1384), Expect = 1.6e-152
Identity = 312/785 (39.75%), Postives = 458/785 (58.34%), Query Frame = 0
Query: 24 FASTAIAQPNPCSFSHSEDEQSTTSINTTLHVQCNPSKVAQILESLRREPNIAFSFFREL 83
+A +A+ PN S S ++Q +N + Q + ++L S R +PN+A SF R+L
Sbjct: 27 YAVSALNNPNNLSDS---EQQQVNHLNLSKLTQ---HGLQRLLNSTRDDPNLALSFLRQL 86
Query: 84 EEWGFQHNISTYAALIRILCSWGLERKLESLFSNLIASKRREFDVLDLLESL-NQGYVVD 143
+E G N++ YA L+RIL +WGL+ KL+S+ LI ++ R F V+DL+E + Q
Sbjct: 87 KEHGVSPNVNAYATLVRILTTWGLDIKLDSVLVELIKNEERGFTVMDLIEVIGEQAEEKK 146
Query: 144 GSF--IRAYDALIKAYVSVSLFDSVVDLLFRLERKGFVPHIFTCNFLLNRLIEHGKMNMA 203
SF IR AL+KAYVS+ +FD D+LF+ +R V I CNFL+NR+ E GK+ M
Sbjct: 147 RSFVLIRVSGALVKAYVSLGMFDEATDVLFQSKRLDCVVDIKACNFLMNRMTEFGKIGML 206
Query: 204 LVVYKQLKRFGCQPNDYTYATVIKALCKIGHMEEAIDIFEEMSESGVVPNAFACAAYIEG 263
+ ++KQLK+ G N+YTYA V+KALC+ G++EEA + E + F +I G
Sbjct: 207 MTLFKQLKQLGLCANEYTYAIVVKALCRKGNLEEAAMLLIENE------SVFGYKTFING 266
Query: 264 LC-THDCSTSGYQLLQAWRAEQVPID--AYAYSVVIRGFCEEMKIDEAENVFLDMEKYGV 323
LC T + + +L+ + + D +V+RGFC EMK+ AE+V ++ME+ G
Sbjct: 267 LCVTGETEKAVALILELIDRKYLAGDDLRAVLGMVVRGFCNEMKMKAAESVIIEMEEIGF 326
Query: 324 VPDALTYGVLINGYCKKLMLQKALSLHSLMLSKGVKSNCVIISSILQCLFRMQMYSEVVN 383
D +I+ YCK + L +AL ML KG+K NCVI+S ILQC +M M E +
Sbjct: 327 GLDVYACLAVIDRYCKNMNLPEALGFLDKMLGKGLKVNCVIVSLILQCYCKMDMCLEALE 386
Query: 384 QFKIFQGEGLFLDKVVYNIVVHALCELGKLEEAIELLEDMTSRQIQMDVVHYTTMIKGFF 443
+FK F+ +FLD+V YN+ AL +LG++EEA ELL++M R I DV++YTT+I G+
Sbjct: 387 KFKEFRDMNIFLDRVCYNVAFDALSKLGRVEEAFELLQEMKDRGIVPDVINYTTLIDGYC 446
Query: 444 VQGKIHEAMIMFENLKKNGIEPDTITYNVLATGLSRNGLVSKVHGLLDYMEEHGLKEDPK 503
+QGK+ +A+ + + + NG+ PD ITYNVL +GL+RNG +V + + M+ G K +
Sbjct: 447 LQGKVVDALDLIDEMIGNGMSPDLITYNVLVSGLARNGHEEEVLEIYERMKAEGPKPNAV 506
Query: 504 IRDLVIENLCIGGKVKEATEFFNSLEVKTVDNYSAMINGYCAANHTKDAYELFINLSKKG 563
++IE LC KVKEA +FF+SLE K +N ++ + GYC A +K AY+ F+ L
Sbjct: 507 TNSVIIEGLCFARKVKEAEDFFSSLEQKCPENKASFVKGYCEAGLSKKAYKAFVRLEYP- 566
Query: 564 VFIKRSALVRLVSSLCMEDSSDRDIKVMKQLPIRNVEANEIVYNKVIASLCRERNMKKAQ 623
+++S ++L SLC+E ++ V+K++ VE + K+I + C+ N+++AQ
Sbjct: 567 --LRKSVYIKLFFSLCIEGYLEKAHDVLKKMSAYRVEPGRSMCGKMIGAFCKLNNVREAQ 626
Query: 624 CLFDFLVSSGLTPDLITYTMMINGYCNINFLRKAYELLCDMKNRGREPDIFIYTILLDGQ 683
LFD +V GL PDL TYT+MI+ YC +N L+KA L DMK RG +PD+ YT+LLD
Sbjct: 627 VLFDTMVERGLIPDLFTYTIMIHTYCRLNELQKAESLFEDMKQRGIKPDVVTYTVLLDRY 686
Query: 684 FK--TRVRELCS---SVEIAFTSTIFDEMKDMKFTPDVVYYTVMIDGYCKMNKLNDAVVL 743
K E CS V S + E DVV YTV+ID CKMN L A L
Sbjct: 687 LKLDPEHHETCSVQGEVGKRKASEVLREFSAAGIGLDVVCYTVLIDRQCKMNNLEQAAEL 746
Query: 744 FEEMIDQGIEPDTVTYTALLSGCCRSGNTEKAQTLYYDMVSKGILPPKHFSFLLHHDTLK 798
F+ MID G+EPD V YT L+S R G + A TL ++ K +P + F + LK
Sbjct: 747 FDRMIDSGLEPDMVAYTTLISSYFRKGYIDMAVTLVTELSKKYNIPSESFEAAVKSAALK 796
BLAST of HG10008123 vs. TAIR 10
Match:
AT1G13800.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 459.5 bits (1181), Expect = 5.4e-129
Identity = 273/807 (33.83%), Postives = 433/807 (53.66%), Query Frame = 0
Query: 28 AIAQPNPCSFSHSEDEQSTTSINTTLHVQCNPSKVAQILESLRREPNIAFSFFRELEEWG 87
A+A+ N + SHSE + T L + N V ++L S++ +P +A SF + +E
Sbjct: 29 ALARTN-LTISHSEQVKEGTFDYKAL--ELNDIGVLRVLNSMKDDPYLALSFLKRIEGNV 88
Query: 88 FQHNISTYAALIRILCSWGLERKLESLFSNLI--ASKRREFDVLDLLESLNQGYVVDGSF 147
++ YA +IRI+C WGL++KL++ L+ + R F V+DLL+++ +
Sbjct: 89 TLPSVQAYATVIRIVCGWGLDKKLDTFLFELVRRGDEGRGFSVMDLLKAIGEMEQSLVLL 148
Query: 148 IRAYDALIKAYVSVSLFDSVVDLLFRLERK-GFVPHIFTCNFLLNRLIEHGKMNMALVVY 207
IR AL+KAY ++ +FD +D+ FR G P I NFL++R+I G+ +M + +
Sbjct: 149 IRVSTALVKAYANLDMFDEAIDIFFRAYYSLGRAPDIKALNFLISRMIASGRSDMVVGFF 208
Query: 208 KQLKRFGCQPNDYTYATVIKALCKIGHMEEAIDIFEEMSESGVVPNAFACAAYIEGLCTH 267
+++R G + +TY V++AL + EE + + S +IEGLC +
Sbjct: 209 WEIERLGLDADAHTYVLVVQALWRNDDKEELEKLLSRLLISETRNPCVFYLNFIEGLCLN 268
Query: 268 DCSTSGYQLLQAWRAEQVPID----AYAYSVVIRGFCEEMKIDEAENVFLDMEKYGVVPD 327
+ Y LLQ R + +D AY V+RG C EM+I++AE+V LDMEK+G+ PD
Sbjct: 269 QMTDIAYFLLQPLRDANILVDKSDLGIAYRKVVRGLCYEMRIEDAESVVLDMEKHGIDPD 328
Query: 328 ALTYGVLINGYCKKLMLQKALSLHSLMLSKGVKSNCVIISSILQCLFRMQMYSEVVNQFK 387
Y +I G+ K + + KA+ + + ML K + NCVI+SSILQC +M +SE + FK
Sbjct: 329 VYVYSAIIEGHRKNMNIPKAVDVFNKMLKKRKRINCVIVSSILQCYCQMGNFSEAYDLFK 388
Query: 388 IFQGEGLFLDKVVYNIVVHALCELGKLEEAIELLEDMTSRQIQMDVVHYTTMIKGFFVQG 447
F+ + LD+V YN+ AL +LGK+EEAIEL +MT + I DV++YTT+I G +QG
Sbjct: 389 EFRETNISLDRVCYNVAFDALGKLGKVEEAIELFREMTGKGIAPDVINYTTLIGGCCLQG 448
Query: 448 KIHEAMIMFENLKKNGIEPDTITYNVLATGLSRNGLVSKVHGLLDYMEEHGLKEDPKIRD 507
K +A + + G PD + YNVLA GL+ NGL + L ME G+K +
Sbjct: 449 KCSDAFDLMIEMDGTGKTPDIVIYNVLAGGLATNGLAQEAFETLKMMENRGVKPTYVTHN 508
Query: 508 LVIENLCIGGKVKEATEFFNSLEVKTVDNYSAMINGYCAANHTKDAYELFINLSKKGVFI 567
+VIE L G++ +A F+ SLE K+ +N ++M+ G+CAA A+E FI L +
Sbjct: 509 MVIEGLIDAGELDKAEAFYESLEHKSRENDASMVKGFCAAGCLDHAFERFIRLEFP---L 568
Query: 568 KRSALVRLVSSLCME-DSSDRDIKVMKQLPIRNVEANEIVYNKVIASLCRERNMKKAQCL 627
+S L +SLC E D + ++ ++ VE + +Y K+I + CR N++KA+
Sbjct: 569 PKSVYFTLFTSLCAEKDYISKAQDLLDRMWKLGVEPEKSMYGKLIGAWCRVNNVRKAREF 628
Query: 628 FDFLVSSGLTPDLITYTMMINGYCNINFLRKAYELLCDMKNRGREPDIFIYTILLDGQ-- 687
F+ LV+ + PDL TYT+MIN YC +N ++AY L DMK R +PD+ Y++LL+
Sbjct: 629 FEILVTKKIVPDLFTYTIMINTYCRLNEPKQAYALFEDMKRRDVKPDVVTYSVLLNSDPE 688
Query: 688 ------------------FKTRVRELCSSVEIAFTSTIFDEMKDMKFTPDVV-------- 747
+ + C ++ +F +MK + PDVV
Sbjct: 689 LDMKREMEAFDVIPDVVYYTIMINRYCHLNDLKKVYALFKDMKRREIVPDVVTYTVLLKN 748
Query: 748 --------------------YYTVMIDGYCKMNKLNDAVVLFEEMIDQGIEPDTVTYTAL 779
YYTV+ID CK+ L +A +F++MI+ G++PD YTAL
Sbjct: 749 KPERNLSREMKAFDVKPDVFYYTVLIDWQCKIGDLGEAKRIFDQMIESGVDPDAAPYTAL 808
BLAST of HG10008123 vs. TAIR 10
Match:
AT5G65560.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 250.8 bits (639), Expect = 3.8e-66
Identity = 201/780 (25.77%), Postives = 351/780 (45.00%), Query Frame = 0
Query: 58 NPSKVAQILESLRREPNIAFSFFREL-EEWGFQHNISTYAALIRILCS---WGLERKLES 117
+PS V+ + SL +P A +F + + ++H++ +YA+L+ +L + G+ K+
Sbjct: 89 SPSHVSSLF-SLDLDPKTALNFSHWISQNPRYKHSVYSYASLLTLLINNGYVGVVFKIRL 148
Query: 118 LFSNLIASKRREFDVLDLLESLNQGYVVDGSFIRAYDALIKAYVSVSLFDSVVDLLFRLE 177
L S VLDL +N+ D F Y +I Y
Sbjct: 149 LMIKSCDSVGDALYVLDLCRKMNK----DERFELKYKLIIGCY----------------- 208
Query: 178 RKGFVPHIFTCNFLLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYATVIKALCKIGHME 237
N LLN L G ++ VY ++ PN YTY ++ CK+G++E
Sbjct: 209 -----------NTLLNSLARFGLVDEMKQVYMEMLEDKVCPNIYTYNKMVNGYCKLGNVE 268
Query: 238 EAIDIFEEMSESGVVPNAFACAAYIEGLCTHDCSTSGYQLLQAWRAEQVPIDAYAYSVVI 297
EA ++ E+G+ P+ F + I G C S +++ + + AY+ +I
Sbjct: 269 EANQYVSKIVEAGLDPDFFTYTSLIMGYCQRKDLDSAFKVFNEMPLKGCRRNEVAYTHLI 328
Query: 298 RGFCEEMKIDEAENVFLDMEKYGVVPDALTYGVLINGYCKKLMLQKALSLHSLMLSKGVK 357
G C +IDEA ++F+ M+ P TY VLI C +AL+L M G+K
Sbjct: 329 HGLCVARRIDEAMDLFVKMKDDECFPTVRTYTVLIKSLCGSERKSEALNLVKEMEETGIK 388
Query: 358 SN----CVIISSIL-QCLFRMQMYSEVVNQFKIFQGEGLFLDKVVYNIVVHALCELGKLE 417
N V+I S+ QC F + E++ Q +GL + + YN +++ C+ G +E
Sbjct: 389 PNIHTYTVLIDSLCSQCKF--EKARELLGQ---MLEKGLMPNVITYNALINGYCKRGMIE 448
Query: 418 EAIELLEDMTSRQIQMDVVHYTTMIKGFFVQGKIHEAMIMFENLKKNGIEPDTITYNVLA 477
+A++++E M SR++ + Y +IKG + + +H+AM + + + + PD +TYN L
Sbjct: 449 DAVDVVELMESRKLSPNTRTYNELIKG-YCKSNVHKAMGVLNKMLERKVLPDVVTYNSLI 508
Query: 478 TGLSRNGLVSKVHGLLDYMEEHGLKEDPKIRDLVIENLCIGGKVKEATEFFNSLEVKTVD 537
G R+G + LL M + GL D +I++LC +V+EA + F+SLE K V+
Sbjct: 509 DGQCRSGNFDSAYRLLSLMNDRGLVPDQWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVN 568
Query: 538 ----NYSAMINGYCAANHTKDAYELFINLSKKGVFIKRSALVRLVSSLCMEDS------- 597
Y+A+I+GYC A +A+ + + K L+ LC +
Sbjct: 569 PNVVMYTALIDGYCKAGKVDEAHLMLEKMLSKNCLPNSLTFNALIHGLCADGKLKEATLL 628
Query: 598 SDRDIKV----------------------------MKQLPIRNVEANEIVYNKVIASLCR 657
++ +K+ +Q+ + + Y I + CR
Sbjct: 629 EEKMVKIGLQPTVSTDTILIHRLLKDGDFDHAYSRFQQMLSSGTKPDAHTYTTFIQTYCR 688
Query: 658 ERNMKKAQCLFDFLVSSGLTPDLITYTMMINGYCNINFLRKAYELLCDMKNRGREPDIFI 717
E + A+ + + +G++PDL TY+ +I GY ++ A+++L M++ G EP
Sbjct: 689 EGRLLDAEDMMAKMRENGVSPDLFTYSSLIKGYGDLGQTNFAFDVLKRMRDTGCEPSQHT 748
Query: 718 YTILLD-------GQFKTRVRELC---SSVEIAFTSTIFDEMKDMKFTPDVVYYTVMIDG 777
+ L+ G+ K ELC + +E + ++M + TP+ Y +I G
Sbjct: 749 FLSLIKHLLEMKYGKQKGSEPELCAMSNMMEFDTVVELLEKMVEHSVTPNAKSYEKLILG 808
Query: 778 YCKMNKLNDAVVLFEEM-IDQGIEPDTVTYTALLSGCCRSGNTEKAQTLYYDMVSKGILP 779
C++ L A +F+ M ++GI P + + ALLS CC+ +A + DM+ G LP
Sbjct: 809 ICEVGNLRVAEKVFDHMQRNEGISPSELVFNALLSCCCKLKKHNEAAKVVDDMICVGHLP 829
BLAST of HG10008123 vs. TAIR 10
Match:
AT5G59900.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 248.8 bits (634), Expect = 1.5e-65
Identity = 189/820 (23.05%), Postives = 348/820 (42.44%), Query Frame = 0
Query: 62 VAQILESLRREPNIAFSFFRELE-EWGFQHNISTYAALIRILCSWGLERKLESLFSNLIA 121
V +IL +P + FF L GF H+ +++ LI L L SL L+
Sbjct: 73 VEEILIGTIDDPKLGLRFFNFLGLHRGFDHSTASFCILIHALVKANLFWPASSLLQTLLL 132
Query: 122 SKRREFDVLDLLESLNQGYVVDGSFIRAYDALIKAYV-SVSLFDSVVDLLFRLERKGFVP 181
+ DV ++L S + + S ++D LI+ YV S + D V+ + + +P
Sbjct: 133 RALKPSDVFNVLFSCYEKCKLSSS--SSFDLLIQHYVRSRRVLDGVLVFKMMITKVSLLP 192
Query: 182 HIFTCNFLLNRLIEHGKMNMALVVYKQLKRFGCQPNDYTYATVIKALCK----------I 241
+ T + LL+ L++ +A+ ++ + G +P+ Y Y VI++LC+ I
Sbjct: 193 EVRTLSALLHGLVKFRHFGLAMELFNDMVSVGIRPDVYIYTGVIRSLCELKDLSRAKEMI 252
Query: 242 GHME-------------------------EAIDIFEEMSESGVVPNAFACAAYIEGLCTH 301
HME EA+ I ++++ + P+ + GLC
Sbjct: 253 AHMEATGCDVNIVPYNVLIDGLCKKQKVWEAVGIKKDLAGKDLKPDVVTYCTLVYGLCKV 312
Query: 302 DCSTSGYQLL-----------------------QAWRAEQ------------VPIDAYAY 361
G +++ + + E+ V + + Y
Sbjct: 313 QEFEIGLEMMDEMLCLRFSPSEAAVSSLVEGLRKRGKIEEALNLVKRVVDFGVSPNLFVY 372
Query: 362 SVVIRGFCEEMKIDEAENVFLDMEKYGVVPDALTYGVLINGYCKKLMLQKALSLHSLMLS 421
+ +I C+ K EAE +F M K G+ P+ +TY +LI+ +C++ L ALS M+
Sbjct: 373 NALIDSLCKGRKFHEAELLFDRMGKIGLRPNDVTYSILIDMFCRRGKLDTALSFLGEMVD 432
Query: 422 KGVKSNCVIISSILQCLFRMQMYSEVVNQFKIFQGEGLFLDKVVYNIVVHALCELGKLEE 481
G+K + +S++ + S + L V Y ++ C GK+ +
Sbjct: 433 TGLKLSVYPYNSLINGHCKFGDISAAEGFMAEMINKKLEPTVVTYTSLMGGYCSKGKINK 492
Query: 482 AIELLEDMTSRQIQMDVVHYTTMIKGFFVQGKIHEAMIMFENLKKNGIEPDTITYNVLAT 541
A+ L +MT + I + +TT++ G F G I +A+ +F + + ++P+ +TYNV+
Sbjct: 493 ALRLYHEMTGKGIAPSIYTFTTLLSGLFRAGLIRDAVKLFNEMAEWNVKPNRVTYNVMIE 552
Query: 542 GLSRNGLVSKVHGLLDYMEEHGLKEDPKIRDLVIENLCIGGKVKEATEFFNSLEVKTVD- 601
G G +SK L M E G+ D +I LC+ G+ EA F + L +
Sbjct: 553 GYCEEGDMSKAFEFLKEMTEKGIVPDTYSYRPLIHGLCLTGQASEAKVFVDGLHKGNCEL 612
Query: 602 ---NYSAMINGYCAANHTKDAYELFINLSKKGVFIKRSALVRLVSSLCMEDSSDRDIKVM 661
Y+ +++G+C ++A + + ++GV + L+ ++
Sbjct: 613 NEICYTGLLHGFCREGKLEEALSVCQEMVQRGVDLDLVCYGVLIDGSLKHKDRKLFFGLL 672
Query: 662 KQLPIRNVEANEIVYNKVIASLCRERNMKKAQCLFDFLVSSGLTPDLITYTMMINGYCNI 721
K++ R ++ ++++Y +I + + + K+A ++D +++ G P+ +TYT +ING C
Sbjct: 673 KEMHDRGLKPDDVIYTSMIDAKSKTGDFKEAFGIWDLMINEGCVPNEVTYTAVINGLCKA 732
Query: 722 NFLRKAYELLCDMKNRGREPDIFIYTILLD---------------------------GQF 779
F+ +A L M+ P+ Y LD +
Sbjct: 733 GFVNEAEVLCSKMQPVSSVPNQVTYGCFLDILTKGEVDMQKAVELHNAILKGLLANTATY 792
BLAST of HG10008123 vs. TAIR 10
Match:
AT1G31840.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 242.7 bits (618), Expect = 1.0e-63
Identity = 185/731 (25.31%), Postives = 334/731 (45.69%), Query Frame = 0
Query: 65 ILESLRREPNIAFSFFRELEEWGFQHNISTYAALIRILCSWGLERKLESLFSNLIASKRR 124
+L SL EPN A +FR E G + T A +L G+ + +F +I ++ +
Sbjct: 75 VLLSLESEPNSALKYFRWAEISGKDPSFYTIA---HVLIRNGMFDVADKVFDEMITNRGK 134
Query: 125 EFDVLDLLESLNQGYVVDGSFIRAYDA-----LIKAYVSVSLFDSVVDLLFRLERKGFVP 184
+F+VL G + D R+ DA L++ + D +++ + G V
Sbjct: 135 DFNVL--------GSIRD----RSLDADVCKFLMECCCRYGMVDKALEIFVYSTQLGVVI 194
Query: 185 HIFTCNFLLNRLIEHGKMNMALVVYKQLKRFGCQPNDYT-YATVIKALCKIGHMEEAIDI 244
+ +LN LI ++++ + +L R G +P+ + + V+ AL G + +A+D
Sbjct: 195 PQDSVYRMLNSLIGSDRVDLIADHFDKLCRGGIEPSGVSAHGFVLDALFCKGEVTKALDF 254
Query: 245 FEEMSESGVVPNAFACAAYIEGLCTHDCSTSGYQLLQAWRAEQVPIDAYAYSVVIRGFCE 304
+ E G +C ++GL + L P + + +I GFC+
Sbjct: 255 HRLVMERGFRVGIVSCNKVLKGLSVDQIEVASRLLSLVLDCGPAP-NVVTFCTLINGFCK 314
Query: 305 EMKIDEAENVFLDMEKYGVVPDALTYGVLINGYCKKLMLQKALSLHSLMLSKGVKSNCVI 364
++D A ++F ME+ G+ PD + Y LI+GY K ML L S L KGVK + V+
Sbjct: 315 RGEMDRAFDLFKVMEQRGIEPDLIAYSTLIDGYFKAGMLGMGHKLFSQALHKGVKLDVVV 374
Query: 365 ISSILQCLFRMQMYSEVVNQFKIFQGEGLFLDKVVYNIVVHALCELGKLEEAIELLEDMT 424
SS + + + +K +G+ + V Y I++ LC+ G++ EA + +
Sbjct: 375 FSSTIDVYVKSGDLATASVVYKRMLCQGISPNVVTYTILIKGLCQDGRIYEAFGMYGQIL 434
Query: 425 SRQIQMDVVHYTTMIKGFFVQGKIHEAMIMFENLKKNGIEPDTITYNVLATGLSRNGLVS 484
R ++ +V Y+++I GF G + ++E++ K G PD + Y VL GLS+ GL+
Sbjct: 435 KRGMEPSIVTYSSLIDGFCKCGNLRSGFALYEDMIKMGYPPDVVIYGVLVDGLSKQGLML 494
Query: 485 KVHGLLDYMEEHGLKEDPKIRDLVIENLCIGGKVKEATEFFNSLEV----------KTVD 544
M ++ + + + +I+ C + EA + F + + TV
Sbjct: 495 HAMRFSVKMLGQSIRLNVVVFNSLIDGWCRLNRFDEALKVFRLMGIYGIKPDVATFTTVM 554
Query: 545 NYSAMINGYCAANHTKDAYELFINLSKKGVFIKRSALVRLVSSLCMEDSSDRDI-KVMKQ 604
S M + +C +LF +L ++ A+ +V L + D K
Sbjct: 555 RVSIMEDAFCKHMKPTIGLQLF-DLMQRNKISADIAVCNVVIHLLFKCHRIEDASKFFNN 614
Query: 605 LPIRNVEANEIVYNKVIASLCRERNMKKAQCLFDFLVSSGLTPDLITYTMMINGYCNINF 664
L +E + + YN +I C R + +A+ +F+ L + P+ +T T++I+ C N
Sbjct: 615 LIEGKMEPDIVTYNTMICGYCSLRRLDEAERIFELLKVTPFGPNTVTLTILIHVLCKNND 674
Query: 665 LRKAYELLCDMKNRGREPDIFIYTILLDGQFKTRVRELCSSVEIAFTSTIFDEMKDMKFT 724
+ A + M +G +P+ Y L+D K SV+I + +F+EM++ +
Sbjct: 675 MDGAIRMFSIMAEKGSKPNAVTYGCLMDWFSK--------SVDIEGSFKLFEEMQEKGIS 734
Query: 725 PDVVYYTVMIDGYCKMNKLNDAVVLFEEMIDQGIEPDTVTYTALLSGCCRSGNTEKAQTL 779
P +V Y+++IDG CK ++++A +F + ID + PD V Y L+ G C+ G +A L
Sbjct: 735 PSIVSYSIIIDGLCKRGRVDEATNIFHQAIDAKLLPDVVAYAILIRGYCKVGRLVEAALL 780
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038876888.1 | 0.0e+00 | 87.81 | pentatricopeptide repeat-containing protein At2g26790, mitochondrial-like isofor... | [more] |
XP_038894511.1 | 0.0e+00 | 86.44 | pentatricopeptide repeat-containing protein At2g26790, mitochondrial-like isofor... | [more] |
XP_008443906.1 | 0.0e+00 | 84.42 | PREDICTED: pentatricopeptide repeat-containing protein At2g26790, mitochondrial-... | [more] |
KAA0050042.1 | 0.0e+00 | 84.17 | pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] | [more] |
XP_022987398.1 | 0.0e+00 | 84.10 | pentatricopeptide repeat-containing protein At2g26790, mitochondrial [Cucurbita ... | [more] |
Match Name | E-value | Identity | Description | |
O81028 | 2.2e-151 | 39.75 | Pentatricopeptide repeat-containing protein At2g26790, mitochondrial OS=Arabidop... | [more] |
Q9LMH5 | 7.6e-128 | 33.83 | Putative pentatricopeptide repeat-containing protein At1g13800 OS=Arabidopsis th... | [more] |
Q76C99 | 1.4e-68 | 25.85 | Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV... | [more] |
Q9LSL9 | 5.4e-65 | 25.77 | Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX... | [more] |
Q9FJE6 | 2.0e-64 | 23.05 | Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis th... | [more] |
Match Name | E-value | Identity | Description | |
A0A1S4DV07 | 0.0e+00 | 84.42 | pentatricopeptide repeat-containing protein At2g26790, mitochondrial-like isofor... | [more] |
A0A5A7U2I6 | 0.0e+00 | 84.17 | Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... | [more] |
A0A6J1JGR8 | 0.0e+00 | 84.10 | pentatricopeptide repeat-containing protein At2g26790, mitochondrial OS=Cucurbit... | [more] |
A0A6J1E1Q9 | 0.0e+00 | 82.73 | pentatricopeptide repeat-containing protein At2g26790, mitochondrial-like OS=Cuc... | [more] |
A0A6J1EVM2 | 0.0e+00 | 82.13 | pentatricopeptide repeat-containing protein At2g26790, mitochondrial-like isofor... | [more] |
Match Name | E-value | Identity | Description | |
AT2G26790.1 | 1.6e-152 | 39.75 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT1G13800.1 | 5.4e-129 | 33.83 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT5G65560.1 | 3.8e-66 | 25.77 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT5G59900.1 | 1.5e-65 | 23.05 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT1G31840.2 | 1.0e-63 | 25.31 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |