Homology
BLAST of HG10011116 vs. NCBI nr
Match:
XP_038882887.1 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X7 [Benincasa hispida])
HSP 1 Score: 1732.2 bits (4485), Expect = 0.0e+00
Identity = 841/907 (92.72%), Postives = 865/907 (95.37%), Query Frame = 0
Query: 1 MYSKFGRINYARLVFDGMPERNEASWNNMMSAYVRVGSYLEAVLFFRDICGIGIKPSGFV 60
MYSKFGRINYARLVFDGMP+RNEASWNNMMS YV+VGSYLEAV FFRDICGIGIKPSGFV
Sbjct: 1 MYSKFGRINYARLVFDGMPKRNEASWNNMMSGYVQVGSYLEAVFFFRDICGIGIKPSGFV 60
Query: 61 ISSLVTACNKSSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEM 120
I+SLVTACNKSSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEM
Sbjct: 61 IASLVTACNKSSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEM 120
Query: 121 PDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGH 180
PDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGH
Sbjct: 121 PDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGH 180
Query: 181 QLLGHVLKFGLETKVSAANSLVSMFGGCGDINEACSIFNEMNERDTISWNSIISANAQNA 240
QLL HVLKFGL TK+SAANSL+SMFGGCGDI+EA SIF+EMNERDTISWNSIISANAQNA
Sbjct: 181 QLLAHVLKFGLLTKISAANSLISMFGGCGDIDEAFSIFSEMNERDTISWNSIISANAQNA 240
Query: 241 LHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPNICLCN 300
LHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPN+CLCN
Sbjct: 241 LHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPNVCLCN 300
Query: 301 TLLNMYSDAGRSEGAELIFRRMPDRDLISWNSMLACYVQDGRYLCALKFFAEMLWMKKEI 360
TLLNMYSDAGRSE AELIFRR+PDRDLISWNSMLACYVQDGR LCAL FFAEMLWMKK+I
Sbjct: 301 TLLNMYSDAGRSEDAELIFRRIPDRDLISWNSMLACYVQDGRRLCALNFFAEMLWMKKDI 360
Query: 361 NYVTFTSALAACLDPEFFTEGKILHGFVVVLGLQDDLIIGNTLITLYGKCHKMAEAKKLF 420
NYVTFTSALAACLDPEFF +GKILH FVV+LGL DDLIIGNTL+T YGKCHKMAEAKKLF
Sbjct: 361 NYVTFTSALAACLDPEFFGKGKILHAFVVILGLHDDLIIGNTLVTFYGKCHKMAEAKKLF 420
Query: 421 QRIPKLDKVSWNALIGGFADNAEPNEAVAAFKLMREGGTCGVDYITIVNILSSCLIHEDL 480
QR+PKLDKV+WNALIGGFADNAEPNEAVAAFKLMREGGTCG+DYITIVNIL SCL HEDL
Sbjct: 421 QRMPKLDKVTWNALIGGFADNAEPNEAVAAFKLMREGGTCGIDYITIVNILGSCLTHEDL 480
Query: 481 IKYGMPIHAHTVVTGFDLDQHVQSSLITMYTKCGDLHSSSYIFDNLVFKTSSVWNAIITA 540
IKYGM IHA TVVTGFDLDQHVQSSLITMY KCGDLHSSSYIFDNLVFK SSVWNAIITA
Sbjct: 481 IKYGMTIHAQTVVTGFDLDQHVQSSLITMYAKCGDLHSSSYIFDNLVFKASSVWNAIITA 540
Query: 541 NARYGFGEEALKLVGRMRTAGIEFDQFNFSTALSVAADLAMLEEGKQLHGSTIKLGFELD 600
NARYGFGEEALKLV +MR+ GIEFDQFNFSTALSVAADLAMLEEG+QLHGS IKLGFELD
Sbjct: 541 NARYGFGEEALKLVLKMRSGGIEFDQFNFSTALSVAADLAMLEEGQQLHGSAIKLGFELD 600
Query: 601 HFVINAAMDMYGKCGELDDALKILPQPTYRSRLSWNTMISIFARHGHFHKAKETFHEMLK 660
HFVINAAMDMYGKCGELDDALKILPQPT RSRLSWNTMISIFARHG+FHKAKETFHEMLK
Sbjct: 601 HFVINAAMDMYGKCGELDDALKILPQPTNRSRLSWNTMISIFARHGNFHKAKETFHEMLK 660
Query: 661 LGVKPDHVSFVCLLSACSHGGLVDEGRAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV 720
LGVKPDHVSFVCLLSACSHGGLVDEG AYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV
Sbjct: 661 LGVKPDHVSFVCLLSACSHGGLVDEGLAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV 720
Query: 721 EAEAFITDMSIPPNDLVWRSLLASCRIYRNLDLGRKAAEHLLELDPSDDSAYVLYSNVFA 780
EAEAFITDM IPPNDLVWRSLLASCRIYRNLDLGRKAAE LLELDPSDDSAYVLYSNVFA
Sbjct: 721 EAEAFITDMPIPPNDLVWRSLLASCRIYRNLDLGRKAAERLLELDPSDDSAYVLYSNVFA 780
Query: 781 TIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNIGIFGMGDQTHPQMEQINGKLLGLMK 840
TIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNI IFGMGDQTHPQ+EQINGKLLGLMK
Sbjct: 781 TIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNISIFGMGDQTHPQVEQINGKLLGLMK 840
Query: 841 MVREAGYVPDTSYSLQDTDEEQKEHNMWNHSERIAHSERIALAFGLINIPEDDFSPLFSF 900
MVREAGYVPDTSYSLQDTDEEQKEHNMWN HSERIALAFGLINIPED +F
Sbjct: 841 MVREAGYVPDTSYSLQDTDEEQKEHNMWN------HSERIALAFGLINIPEDTTVRIFKN 900
Query: 901 IFLTDRC 908
+ + C
Sbjct: 901 LRVCGDC 901
BLAST of HG10011116 vs. NCBI nr
Match:
XP_038882805.1 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X1 [Benincasa hispida] >XP_038882813.1 pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X1 [Benincasa hispida] >XP_038882820.1 pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X1 [Benincasa hispida] >XP_038882828.1 pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X1 [Benincasa hispida])
HSP 1 Score: 1732.2 bits (4485), Expect = 0.0e+00
Identity = 841/907 (92.72%), Postives = 865/907 (95.37%), Query Frame = 0
Query: 1 MYSKFGRINYARLVFDGMPERNEASWNNMMSAYVRVGSYLEAVLFFRDICGIGIKPSGFV 60
MYSKFGRINYARLVFDGMP+RNEASWNNMMS YV+VGSYLEAV FFRDICGIGIKPSGFV
Sbjct: 170 MYSKFGRINYARLVFDGMPKRNEASWNNMMSGYVQVGSYLEAVFFFRDICGIGIKPSGFV 229
Query: 61 ISSLVTACNKSSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEM 120
I+SLVTACNKSSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEM
Sbjct: 230 IASLVTACNKSSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEM 289
Query: 121 PDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGH 180
PDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGH
Sbjct: 290 PDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGH 349
Query: 181 QLLGHVLKFGLETKVSAANSLVSMFGGCGDINEACSIFNEMNERDTISWNSIISANAQNA 240
QLL HVLKFGL TK+SAANSL+SMFGGCGDI+EA SIF+EMNERDTISWNSIISANAQNA
Sbjct: 350 QLLAHVLKFGLLTKISAANSLISMFGGCGDIDEAFSIFSEMNERDTISWNSIISANAQNA 409
Query: 241 LHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPNICLCN 300
LHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPN+CLCN
Sbjct: 410 LHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPNVCLCN 469
Query: 301 TLLNMYSDAGRSEGAELIFRRMPDRDLISWNSMLACYVQDGRYLCALKFFAEMLWMKKEI 360
TLLNMYSDAGRSE AELIFRR+PDRDLISWNSMLACYVQDGR LCAL FFAEMLWMKK+I
Sbjct: 470 TLLNMYSDAGRSEDAELIFRRIPDRDLISWNSMLACYVQDGRRLCALNFFAEMLWMKKDI 529
Query: 361 NYVTFTSALAACLDPEFFTEGKILHGFVVVLGLQDDLIIGNTLITLYGKCHKMAEAKKLF 420
NYVTFTSALAACLDPEFF +GKILH FVV+LGL DDLIIGNTL+T YGKCHKMAEAKKLF
Sbjct: 530 NYVTFTSALAACLDPEFFGKGKILHAFVVILGLHDDLIIGNTLVTFYGKCHKMAEAKKLF 589
Query: 421 QRIPKLDKVSWNALIGGFADNAEPNEAVAAFKLMREGGTCGVDYITIVNILSSCLIHEDL 480
QR+PKLDKV+WNALIGGFADNAEPNEAVAAFKLMREGGTCG+DYITIVNIL SCL HEDL
Sbjct: 590 QRMPKLDKVTWNALIGGFADNAEPNEAVAAFKLMREGGTCGIDYITIVNILGSCLTHEDL 649
Query: 481 IKYGMPIHAHTVVTGFDLDQHVQSSLITMYTKCGDLHSSSYIFDNLVFKTSSVWNAIITA 540
IKYGM IHA TVVTGFDLDQHVQSSLITMY KCGDLHSSSYIFDNLVFK SSVWNAIITA
Sbjct: 650 IKYGMTIHAQTVVTGFDLDQHVQSSLITMYAKCGDLHSSSYIFDNLVFKASSVWNAIITA 709
Query: 541 NARYGFGEEALKLVGRMRTAGIEFDQFNFSTALSVAADLAMLEEGKQLHGSTIKLGFELD 600
NARYGFGEEALKLV +MR+ GIEFDQFNFSTALSVAADLAMLEEG+QLHGS IKLGFELD
Sbjct: 710 NARYGFGEEALKLVLKMRSGGIEFDQFNFSTALSVAADLAMLEEGQQLHGSAIKLGFELD 769
Query: 601 HFVINAAMDMYGKCGELDDALKILPQPTYRSRLSWNTMISIFARHGHFHKAKETFHEMLK 660
HFVINAAMDMYGKCGELDDALKILPQPT RSRLSWNTMISIFARHG+FHKAKETFHEMLK
Sbjct: 770 HFVINAAMDMYGKCGELDDALKILPQPTNRSRLSWNTMISIFARHGNFHKAKETFHEMLK 829
Query: 661 LGVKPDHVSFVCLLSACSHGGLVDEGRAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV 720
LGVKPDHVSFVCLLSACSHGGLVDEG AYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV
Sbjct: 830 LGVKPDHVSFVCLLSACSHGGLVDEGLAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV 889
Query: 721 EAEAFITDMSIPPNDLVWRSLLASCRIYRNLDLGRKAAEHLLELDPSDDSAYVLYSNVFA 780
EAEAFITDM IPPNDLVWRSLLASCRIYRNLDLGRKAAE LLELDPSDDSAYVLYSNVFA
Sbjct: 890 EAEAFITDMPIPPNDLVWRSLLASCRIYRNLDLGRKAAERLLELDPSDDSAYVLYSNVFA 949
Query: 781 TIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNIGIFGMGDQTHPQMEQINGKLLGLMK 840
TIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNI IFGMGDQTHPQ+EQINGKLLGLMK
Sbjct: 950 TIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNISIFGMGDQTHPQVEQINGKLLGLMK 1009
Query: 841 MVREAGYVPDTSYSLQDTDEEQKEHNMWNHSERIAHSERIALAFGLINIPEDDFSPLFSF 900
MVREAGYVPDTSYSLQDTDEEQKEHNMWN HSERIALAFGLINIPED +F
Sbjct: 1010 MVREAGYVPDTSYSLQDTDEEQKEHNMWN------HSERIALAFGLINIPEDTTVRIFKN 1069
Query: 901 IFLTDRC 908
+ + C
Sbjct: 1070 LRVCGDC 1070
BLAST of HG10011116 vs. NCBI nr
Match:
XP_038882845.1 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X3 [Benincasa hispida])
HSP 1 Score: 1732.2 bits (4485), Expect = 0.0e+00
Identity = 841/907 (92.72%), Postives = 865/907 (95.37%), Query Frame = 0
Query: 1 MYSKFGRINYARLVFDGMPERNEASWNNMMSAYVRVGSYLEAVLFFRDICGIGIKPSGFV 60
MYSKFGRINYARLVFDGMP+RNEASWNNMMS YV+VGSYLEAV FFRDICGIGIKPSGFV
Sbjct: 120 MYSKFGRINYARLVFDGMPKRNEASWNNMMSGYVQVGSYLEAVFFFRDICGIGIKPSGFV 179
Query: 61 ISSLVTACNKSSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEM 120
I+SLVTACNKSSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEM
Sbjct: 180 IASLVTACNKSSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEM 239
Query: 121 PDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGH 180
PDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGH
Sbjct: 240 PDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGH 299
Query: 181 QLLGHVLKFGLETKVSAANSLVSMFGGCGDINEACSIFNEMNERDTISWNSIISANAQNA 240
QLL HVLKFGL TK+SAANSL+SMFGGCGDI+EA SIF+EMNERDTISWNSIISANAQNA
Sbjct: 300 QLLAHVLKFGLLTKISAANSLISMFGGCGDIDEAFSIFSEMNERDTISWNSIISANAQNA 359
Query: 241 LHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPNICLCN 300
LHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPN+CLCN
Sbjct: 360 LHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPNVCLCN 419
Query: 301 TLLNMYSDAGRSEGAELIFRRMPDRDLISWNSMLACYVQDGRYLCALKFFAEMLWMKKEI 360
TLLNMYSDAGRSE AELIFRR+PDRDLISWNSMLACYVQDGR LCAL FFAEMLWMKK+I
Sbjct: 420 TLLNMYSDAGRSEDAELIFRRIPDRDLISWNSMLACYVQDGRRLCALNFFAEMLWMKKDI 479
Query: 361 NYVTFTSALAACLDPEFFTEGKILHGFVVVLGLQDDLIIGNTLITLYGKCHKMAEAKKLF 420
NYVTFTSALAACLDPEFF +GKILH FVV+LGL DDLIIGNTL+T YGKCHKMAEAKKLF
Sbjct: 480 NYVTFTSALAACLDPEFFGKGKILHAFVVILGLHDDLIIGNTLVTFYGKCHKMAEAKKLF 539
Query: 421 QRIPKLDKVSWNALIGGFADNAEPNEAVAAFKLMREGGTCGVDYITIVNILSSCLIHEDL 480
QR+PKLDKV+WNALIGGFADNAEPNEAVAAFKLMREGGTCG+DYITIVNIL SCL HEDL
Sbjct: 540 QRMPKLDKVTWNALIGGFADNAEPNEAVAAFKLMREGGTCGIDYITIVNILGSCLTHEDL 599
Query: 481 IKYGMPIHAHTVVTGFDLDQHVQSSLITMYTKCGDLHSSSYIFDNLVFKTSSVWNAIITA 540
IKYGM IHA TVVTGFDLDQHVQSSLITMY KCGDLHSSSYIFDNLVFK SSVWNAIITA
Sbjct: 600 IKYGMTIHAQTVVTGFDLDQHVQSSLITMYAKCGDLHSSSYIFDNLVFKASSVWNAIITA 659
Query: 541 NARYGFGEEALKLVGRMRTAGIEFDQFNFSTALSVAADLAMLEEGKQLHGSTIKLGFELD 600
NARYGFGEEALKLV +MR+ GIEFDQFNFSTALSVAADLAMLEEG+QLHGS IKLGFELD
Sbjct: 660 NARYGFGEEALKLVLKMRSGGIEFDQFNFSTALSVAADLAMLEEGQQLHGSAIKLGFELD 719
Query: 601 HFVINAAMDMYGKCGELDDALKILPQPTYRSRLSWNTMISIFARHGHFHKAKETFHEMLK 660
HFVINAAMDMYGKCGELDDALKILPQPT RSRLSWNTMISIFARHG+FHKAKETFHEMLK
Sbjct: 720 HFVINAAMDMYGKCGELDDALKILPQPTNRSRLSWNTMISIFARHGNFHKAKETFHEMLK 779
Query: 661 LGVKPDHVSFVCLLSACSHGGLVDEGRAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV 720
LGVKPDHVSFVCLLSACSHGGLVDEG AYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV
Sbjct: 780 LGVKPDHVSFVCLLSACSHGGLVDEGLAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV 839
Query: 721 EAEAFITDMSIPPNDLVWRSLLASCRIYRNLDLGRKAAEHLLELDPSDDSAYVLYSNVFA 780
EAEAFITDM IPPNDLVWRSLLASCRIYRNLDLGRKAAE LLELDPSDDSAYVLYSNVFA
Sbjct: 840 EAEAFITDMPIPPNDLVWRSLLASCRIYRNLDLGRKAAERLLELDPSDDSAYVLYSNVFA 899
Query: 781 TIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNIGIFGMGDQTHPQMEQINGKLLGLMK 840
TIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNI IFGMGDQTHPQ+EQINGKLLGLMK
Sbjct: 900 TIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNISIFGMGDQTHPQVEQINGKLLGLMK 959
Query: 841 MVREAGYVPDTSYSLQDTDEEQKEHNMWNHSERIAHSERIALAFGLINIPEDDFSPLFSF 900
MVREAGYVPDTSYSLQDTDEEQKEHNMWN HSERIALAFGLINIPED +F
Sbjct: 960 MVREAGYVPDTSYSLQDTDEEQKEHNMWN------HSERIALAFGLINIPEDTTVRIFKN 1019
Query: 901 IFLTDRC 908
+ + C
Sbjct: 1020 LRVCGDC 1020
BLAST of HG10011116 vs. NCBI nr
Match:
XP_038882854.1 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X4 [Benincasa hispida] >XP_038882863.1 pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X4 [Benincasa hispida])
HSP 1 Score: 1732.2 bits (4485), Expect = 0.0e+00
Identity = 841/907 (92.72%), Postives = 865/907 (95.37%), Query Frame = 0
Query: 1 MYSKFGRINYARLVFDGMPERNEASWNNMMSAYVRVGSYLEAVLFFRDICGIGIKPSGFV 60
MYSKFGRINYARLVFDGMP+RNEASWNNMMS YV+VGSYLEAV FFRDICGIGIKPSGFV
Sbjct: 111 MYSKFGRINYARLVFDGMPKRNEASWNNMMSGYVQVGSYLEAVFFFRDICGIGIKPSGFV 170
Query: 61 ISSLVTACNKSSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEM 120
I+SLVTACNKSSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEM
Sbjct: 171 IASLVTACNKSSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEM 230
Query: 121 PDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGH 180
PDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGH
Sbjct: 231 PDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGH 290
Query: 181 QLLGHVLKFGLETKVSAANSLVSMFGGCGDINEACSIFNEMNERDTISWNSIISANAQNA 240
QLL HVLKFGL TK+SAANSL+SMFGGCGDI+EA SIF+EMNERDTISWNSIISANAQNA
Sbjct: 291 QLLAHVLKFGLLTKISAANSLISMFGGCGDIDEAFSIFSEMNERDTISWNSIISANAQNA 350
Query: 241 LHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPNICLCN 300
LHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPN+CLCN
Sbjct: 351 LHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPNVCLCN 410
Query: 301 TLLNMYSDAGRSEGAELIFRRMPDRDLISWNSMLACYVQDGRYLCALKFFAEMLWMKKEI 360
TLLNMYSDAGRSE AELIFRR+PDRDLISWNSMLACYVQDGR LCAL FFAEMLWMKK+I
Sbjct: 411 TLLNMYSDAGRSEDAELIFRRIPDRDLISWNSMLACYVQDGRRLCALNFFAEMLWMKKDI 470
Query: 361 NYVTFTSALAACLDPEFFTEGKILHGFVVVLGLQDDLIIGNTLITLYGKCHKMAEAKKLF 420
NYVTFTSALAACLDPEFF +GKILH FVV+LGL DDLIIGNTL+T YGKCHKMAEAKKLF
Sbjct: 471 NYVTFTSALAACLDPEFFGKGKILHAFVVILGLHDDLIIGNTLVTFYGKCHKMAEAKKLF 530
Query: 421 QRIPKLDKVSWNALIGGFADNAEPNEAVAAFKLMREGGTCGVDYITIVNILSSCLIHEDL 480
QR+PKLDKV+WNALIGGFADNAEPNEAVAAFKLMREGGTCG+DYITIVNIL SCL HEDL
Sbjct: 531 QRMPKLDKVTWNALIGGFADNAEPNEAVAAFKLMREGGTCGIDYITIVNILGSCLTHEDL 590
Query: 481 IKYGMPIHAHTVVTGFDLDQHVQSSLITMYTKCGDLHSSSYIFDNLVFKTSSVWNAIITA 540
IKYGM IHA TVVTGFDLDQHVQSSLITMY KCGDLHSSSYIFDNLVFK SSVWNAIITA
Sbjct: 591 IKYGMTIHAQTVVTGFDLDQHVQSSLITMYAKCGDLHSSSYIFDNLVFKASSVWNAIITA 650
Query: 541 NARYGFGEEALKLVGRMRTAGIEFDQFNFSTALSVAADLAMLEEGKQLHGSTIKLGFELD 600
NARYGFGEEALKLV +MR+ GIEFDQFNFSTALSVAADLAMLEEG+QLHGS IKLGFELD
Sbjct: 651 NARYGFGEEALKLVLKMRSGGIEFDQFNFSTALSVAADLAMLEEGQQLHGSAIKLGFELD 710
Query: 601 HFVINAAMDMYGKCGELDDALKILPQPTYRSRLSWNTMISIFARHGHFHKAKETFHEMLK 660
HFVINAAMDMYGKCGELDDALKILPQPT RSRLSWNTMISIFARHG+FHKAKETFHEMLK
Sbjct: 711 HFVINAAMDMYGKCGELDDALKILPQPTNRSRLSWNTMISIFARHGNFHKAKETFHEMLK 770
Query: 661 LGVKPDHVSFVCLLSACSHGGLVDEGRAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV 720
LGVKPDHVSFVCLLSACSHGGLVDEG AYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV
Sbjct: 771 LGVKPDHVSFVCLLSACSHGGLVDEGLAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV 830
Query: 721 EAEAFITDMSIPPNDLVWRSLLASCRIYRNLDLGRKAAEHLLELDPSDDSAYVLYSNVFA 780
EAEAFITDM IPPNDLVWRSLLASCRIYRNLDLGRKAAE LLELDPSDDSAYVLYSNVFA
Sbjct: 831 EAEAFITDMPIPPNDLVWRSLLASCRIYRNLDLGRKAAERLLELDPSDDSAYVLYSNVFA 890
Query: 781 TIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNIGIFGMGDQTHPQMEQINGKLLGLMK 840
TIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNI IFGMGDQTHPQ+EQINGKLLGLMK
Sbjct: 891 TIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNISIFGMGDQTHPQVEQINGKLLGLMK 950
Query: 841 MVREAGYVPDTSYSLQDTDEEQKEHNMWNHSERIAHSERIALAFGLINIPEDDFSPLFSF 900
MVREAGYVPDTSYSLQDTDEEQKEHNMWN HSERIALAFGLINIPED +F
Sbjct: 951 MVREAGYVPDTSYSLQDTDEEQKEHNMWN------HSERIALAFGLINIPEDTTVRIFKN 1010
Query: 901 IFLTDRC 908
+ + C
Sbjct: 1011 LRVCGDC 1011
BLAST of HG10011116 vs. NCBI nr
Match:
XP_038882837.1 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X2 [Benincasa hispida])
HSP 1 Score: 1732.2 bits (4485), Expect = 0.0e+00
Identity = 841/907 (92.72%), Postives = 865/907 (95.37%), Query Frame = 0
Query: 1 MYSKFGRINYARLVFDGMPERNEASWNNMMSAYVRVGSYLEAVLFFRDICGIGIKPSGFV 60
MYSKFGRINYARLVFDGMP+RNEASWNNMMS YV+VGSYLEAV FFRDICGIGIKPSGFV
Sbjct: 140 MYSKFGRINYARLVFDGMPKRNEASWNNMMSGYVQVGSYLEAVFFFRDICGIGIKPSGFV 199
Query: 61 ISSLVTACNKSSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEM 120
I+SLVTACNKSSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEM
Sbjct: 200 IASLVTACNKSSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEM 259
Query: 121 PDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGH 180
PDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGH
Sbjct: 260 PDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGH 319
Query: 181 QLLGHVLKFGLETKVSAANSLVSMFGGCGDINEACSIFNEMNERDTISWNSIISANAQNA 240
QLL HVLKFGL TK+SAANSL+SMFGGCGDI+EA SIF+EMNERDTISWNSIISANAQNA
Sbjct: 320 QLLAHVLKFGLLTKISAANSLISMFGGCGDIDEAFSIFSEMNERDTISWNSIISANAQNA 379
Query: 241 LHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPNICLCN 300
LHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPN+CLCN
Sbjct: 380 LHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPNVCLCN 439
Query: 301 TLLNMYSDAGRSEGAELIFRRMPDRDLISWNSMLACYVQDGRYLCALKFFAEMLWMKKEI 360
TLLNMYSDAGRSE AELIFRR+PDRDLISWNSMLACYVQDGR LCAL FFAEMLWMKK+I
Sbjct: 440 TLLNMYSDAGRSEDAELIFRRIPDRDLISWNSMLACYVQDGRRLCALNFFAEMLWMKKDI 499
Query: 361 NYVTFTSALAACLDPEFFTEGKILHGFVVVLGLQDDLIIGNTLITLYGKCHKMAEAKKLF 420
NYVTFTSALAACLDPEFF +GKILH FVV+LGL DDLIIGNTL+T YGKCHKMAEAKKLF
Sbjct: 500 NYVTFTSALAACLDPEFFGKGKILHAFVVILGLHDDLIIGNTLVTFYGKCHKMAEAKKLF 559
Query: 421 QRIPKLDKVSWNALIGGFADNAEPNEAVAAFKLMREGGTCGVDYITIVNILSSCLIHEDL 480
QR+PKLDKV+WNALIGGFADNAEPNEAVAAFKLMREGGTCG+DYITIVNIL SCL HEDL
Sbjct: 560 QRMPKLDKVTWNALIGGFADNAEPNEAVAAFKLMREGGTCGIDYITIVNILGSCLTHEDL 619
Query: 481 IKYGMPIHAHTVVTGFDLDQHVQSSLITMYTKCGDLHSSSYIFDNLVFKTSSVWNAIITA 540
IKYGM IHA TVVTGFDLDQHVQSSLITMY KCGDLHSSSYIFDNLVFK SSVWNAIITA
Sbjct: 620 IKYGMTIHAQTVVTGFDLDQHVQSSLITMYAKCGDLHSSSYIFDNLVFKASSVWNAIITA 679
Query: 541 NARYGFGEEALKLVGRMRTAGIEFDQFNFSTALSVAADLAMLEEGKQLHGSTIKLGFELD 600
NARYGFGEEALKLV +MR+ GIEFDQFNFSTALSVAADLAMLEEG+QLHGS IKLGFELD
Sbjct: 680 NARYGFGEEALKLVLKMRSGGIEFDQFNFSTALSVAADLAMLEEGQQLHGSAIKLGFELD 739
Query: 601 HFVINAAMDMYGKCGELDDALKILPQPTYRSRLSWNTMISIFARHGHFHKAKETFHEMLK 660
HFVINAAMDMYGKCGELDDALKILPQPT RSRLSWNTMISIFARHG+FHKAKETFHEMLK
Sbjct: 740 HFVINAAMDMYGKCGELDDALKILPQPTNRSRLSWNTMISIFARHGNFHKAKETFHEMLK 799
Query: 661 LGVKPDHVSFVCLLSACSHGGLVDEGRAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV 720
LGVKPDHVSFVCLLSACSHGGLVDEG AYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV
Sbjct: 800 LGVKPDHVSFVCLLSACSHGGLVDEGLAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV 859
Query: 721 EAEAFITDMSIPPNDLVWRSLLASCRIYRNLDLGRKAAEHLLELDPSDDSAYVLYSNVFA 780
EAEAFITDM IPPNDLVWRSLLASCRIYRNLDLGRKAAE LLELDPSDDSAYVLYSNVFA
Sbjct: 860 EAEAFITDMPIPPNDLVWRSLLASCRIYRNLDLGRKAAERLLELDPSDDSAYVLYSNVFA 919
Query: 781 TIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNIGIFGMGDQTHPQMEQINGKLLGLMK 840
TIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNI IFGMGDQTHPQ+EQINGKLLGLMK
Sbjct: 920 TIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNISIFGMGDQTHPQVEQINGKLLGLMK 979
Query: 841 MVREAGYVPDTSYSLQDTDEEQKEHNMWNHSERIAHSERIALAFGLINIPEDDFSPLFSF 900
MVREAGYVPDTSYSLQDTDEEQKEHNMWN HSERIALAFGLINIPED +F
Sbjct: 980 MVREAGYVPDTSYSLQDTDEEQKEHNMWN------HSERIALAFGLINIPEDTTVRIFKN 1039
Query: 901 IFLTDRC 908
+ + C
Sbjct: 1040 LRVCGDC 1040
BLAST of HG10011116 vs. ExPASy Swiss-Prot
Match:
Q9SMZ2 (Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H53 PE=3 SV=1)
HSP 1 Score: 508.4 bits (1308), Expect = 1.7e-142
Identity = 297/903 (32.89%), Postives = 481/903 (53.27%), Query Frame = 0
Query: 1 MYSKFGRINYARLVFDGMPERNEASWNNMMSAYVR-----VGSYLEAVLFFRDICGIGIK 60
MYSK G + YAR VFD MP+R+ SWN++++AY + V + +A L FR + +
Sbjct: 83 MYSKCGSLTYARRVFDKMPDRDLVSWNSILAAYAQSSECVVENIQQAFLLFRILRQDVVY 142
Query: 61 PSGFVISSLVTACNKSS-IMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQ 120
S +S ++ C S + A E F HG+A K GL D FV + V+ Y +G V +
Sbjct: 143 TSRMTLSPMLKLCLHSGYVWASESF--HGYACKIGLDGDEFVAGALVNIYLKFGKVKEGK 202
Query: 121 KMFNEMPDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLV 180
+F EMP R+VV W ++ +Y + G KEE I+ G+ N N I L
Sbjct: 203 VLFEEMPYRDVVLWNLMLKAYLEMGFKEEAIDLSSAFHSSGL--NPNEITL--------- 262
Query: 181 DILLGHQLLGHVLKFGLETKVSAANSLVSMFGGCGDINEACSIFNEMNERDTISWNSIIS 240
+LL + GD ++A + + N D S + II
Sbjct: 263 ------RLLARI---------------------SGDDSDAGQVKSFANGNDASSVSEIIF 322
Query: 241 ANAQNALHEESFRYFHWMRLVHE------EINYTTLSILLSICGSVDYLKWGKGVHGLVV 300
N + + S +Y ++ + E + T ++L+ VD L G+ VH + +
Sbjct: 323 RNKGLSEYLHSGQYSALLKCFADMVESDVECDQVTFILMLATAVKVDSLALGQQVHCMAL 382
Query: 301 KYGLEPNICLCNTLLNMYSDAGRSEGAELIFRRMPDRDLISWNSMLACYVQDGRYLCALK 360
K GL+ + + N+L+NMY + A +F M +RDLISWNS++A Q+G + A+
Sbjct: 383 KLGLDLMLTVSNSLINMYCKLRKFGFARTVFDNMSERDLISWNSVIAGIAQNGLEVEAVC 442
Query: 361 FFAEMLWMKKEINYVTFTSAL-AACLDPEFFTEGKILHGFVVVLGLQDDLIIGNTLITLY 420
F ++L + + T TS L AA PE + K +H + + D + LI Y
Sbjct: 443 LFMQLLRCGLKPDQYTMTSVLKAASSLPEGLSLSKQVHVHAIKINNVSDSFVSTALIDAY 502
Query: 421 GKCHKMAEAKKLFQRIPKLDKVSWNALIGGFADNAEPNEAVAAFKLMREGGTCGVDYITI 480
+ M EA+ LF+R D V+WNA++ G+ + + ++ + F LM + G D+ T+
Sbjct: 503 SRNRCMKEAEILFER-HNFDLVAWNAMMAGYTQSHDGHKTLKLFALMHKQGERSDDF-TL 562
Query: 481 VNILSSCLIHEDLIKYGMPIHAHTVVTGFDLDQHVQSSLITMYTKCGDLHSSSYIFDNLV 540
+ +C I G +HA+ + +G+DLD V S ++ MY KCGD+ ++ + FD++
Sbjct: 563 ATVFKTCGF-LFAINQGKQVHAYAIKSGYDLDLWVSSGILDMYVKCGDMSAAQFAFDSIP 622
Query: 541 FKTSSVWNAIITANARYGFGEEALKLVGRMRTAGIEFDQFNFSTALSVAADLAMLEEGKQ 600
W +I+ G E A + +MR G+ D+F +T ++ L LE+G+Q
Sbjct: 623 VPDDVAWTTMISGCIENGEEERAFHVFSQMRLMGVLPDEFTIATLAKASSCLTALEQGRQ 682
Query: 601 LHGSTIKLGFELDHFVINAAMDMYGKCGELDDALKILPQPTYRSRLSWNTMISIFARHGH 660
+H + +KL D FV + +DMY KCG +DDA + + + +WN M+ A+HG
Sbjct: 683 IHANALKLNCTNDPFVGTSLVDMYAKCGSIDDAYCLFKRIEMMNITAWNAMLVGLAQHGE 742
Query: 661 FHKAKETFHEMLKLGVKPDHVSFVCLLSACSHGGLVDEGRAYYASMTSEYGIQPGIEHCV 720
+ + F +M LG+KPD V+F+ +LSACSH GLV E + SM +YGI+P IEH
Sbjct: 743 GKETLQLFKQMKSLGIKPDKVTFIGVLSACSHSGLVSEAYKHMRSMHGDYGIKPEIEHYS 802
Query: 721 CMIDLLGRSGRLVEAEAFITDMSIPPNDLVWRSLLASCRIYRNLDLGRKAAEHLLELDPS 780
C+ D LGR+G + +AE I MS+ + ++R+LLA+CR+ + + G++ A LLEL+P
Sbjct: 803 CLADALGRAGLVKQAENLIESMSMEASASMYRTLLAACRVQGDTETGKRVATKLLELEPL 862
Query: 781 DDSAYVLYSNVFATIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNIGIFGMGDQTHPQ 840
D SAYVL SN++A +W++++ R M HK++K P SW++ K I IF + D+++ Q
Sbjct: 863 DSSAYVLLSNMYAAASKWDEMKLARTMMKGHKVKKDPGFSWIEVKNKIHIFVVDDRSNRQ 922
Query: 841 MEQINGKLLGLMKMVREAGYVPDTSYSLQDTDEEQKEHNMWNHSERIAHSERIALAFGLI 891
E I K+ +++ +++ GYVP+T ++L D +EE+KE ++ HSE++A+AFGL+
Sbjct: 923 TELIYRKVKDMIRDIKQEGYVPETDFTLVDVEEEEKERALY------YHSEKLAVAFGLL 936
BLAST of HG10011116 vs. ExPASy Swiss-Prot
Match:
Q9SVP7 (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)
HSP 1 Score: 496.5 bits (1277), Expect = 6.6e-139
Identity = 279/877 (31.81%), Postives = 464/877 (52.91%), Query Frame = 0
Query: 14 VFDGMPERNEASWNNMMSAYVRVGSYLEAVLFFRDICGIGIKPSGFVISSLVTACNKSSI 73
VFD MPER +WN M+ E F + + P+ S ++ AC S+
Sbjct: 142 VFDEMPERTIFTWNKMIKELASRNLIGEVFGLFVRMVSENVTPNEGTFSGVLEACRGGSV 201
Query: 74 MAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEMPDRNVVSWTSLMV 133
Q+H + GL V + Y G V A+++F+ + ++ SW +++
Sbjct: 202 AFDVVEQIHARILYQGLRDSTVVCNPLIDLYSRNGFVDLARRVFDGLRLKDHSSWVAMIS 261
Query: 134 SYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGHQLLGHVLKFGLET 193
S N + E I + M GI + V+S+C + + +G QL G VLK G +
Sbjct: 262 GLSKNECEAEAIRLFCDMYVLGIMPTPYAFSSVLSACKKIESLEIGEQLHGLVLKLGFSS 321
Query: 194 KVSAANSLVSMFGGCGDINEACSIFNEMNERDTISWNSIISANAQNALHEESFRYFHWMR 253
N+LVS++ G++ A IF+ M++RD +++N++I+ +Q E++ F M
Sbjct: 322 DTYVCNALVSLYFHLGNLISAEHIFSNMSQRDAVTYNTLINGLSQCGYGEKAMELFKRMH 381
Query: 254 LVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPNICLCNTLLNMYSDAGRSE 313
L E + TL+ L+ C + L G+ +H K G N + LLN+Y+ E
Sbjct: 382 LDGLEPDSNTLASLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGALLNLYAKCADIE 441
Query: 314 GAELIFRRMPDRDLISWNSMLACYVQDGRYLCALKFFAEMLWMKKEINYVTFTSALAACL 373
A F +++ WN ML Y + + F +M + N T+ S L C+
Sbjct: 442 TALDYFLETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEEIVPNQYTYPSILKTCI 501
Query: 374 DPEFFTEGKILHGFVVVLGLQDDLIIGNTLITLYGKCHKMAEAKKLFQRIPKLDKVSWNA 433
G+ +H ++ Q + + + LI +Y K K+ A + R D VSW
Sbjct: 502 RLGDLELGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTT 561
Query: 434 LIGGFADNAEPNEAVAAFKLMREGGTCGVDYITIVNILSSCLIHEDLIKYGMPIHAHTVV 493
+I G+ ++A+ F+ M + G D + + N +S+C + L K G IHA V
Sbjct: 562 MIAGYTQYNFDDKALTTFRQMLDRGIRS-DEVGLTNAVSACAGLQAL-KEGQQIHAQACV 621
Query: 494 TGFDLDQHVQSSLITMYTKCGDLHSSSYIFDNLVFKTSSVWNAIITANARYGFGEEALKL 553
+GF D Q++L+T+Y++CG + S F+ + WNA+++ + G EEAL++
Sbjct: 622 SGFSSDLPFQNALVTLYSRCGKIEESYLAFEQTEAGDNIAWNALVSGFQQSGNNEEALRV 681
Query: 554 VGRMRTAGIEFDQFNFSTALSVAADLAMLEEGKQLHGSTIKLGFELDHFVINAAMDMYGK 613
RM GI+ + F F +A+ A++ A +++GKQ+H K G++ + V NA + MY K
Sbjct: 682 FVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYDSETEVCNALISMYAK 741
Query: 614 CGELDDALKILPQPTYRSRLSWNTMISIFARHGHFHKAKETFHEMLKLGVKPDHVSFVCL 673
CG + DA K + + ++ +SWN +I+ +++HG +A ++F +M+ V+P+HV+ V +
Sbjct: 742 CGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVGV 801
Query: 674 LSACSHGGLVDEGRAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLVEAEAFITDMSIPP 733
LSACSH GLVD+G AY+ SM SEYG+ P EH VC++D+L R+G L A+ FI +M I P
Sbjct: 802 LSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSRAKEFIQEMPIKP 861
Query: 734 NDLVWRSLLASCRIYRNLDLGRKAAEHLLELDPSDDSAYVLYSNVFATIGRWEDVEDVRG 793
+ LVWR+LL++C +++N+++G AA HLLEL+P D + YVL SN++A +W+ + R
Sbjct: 862 DALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAVSKKWDARDLTRQ 921
Query: 794 QMGAHKIQKKPAHSWVKWKGNIGIFGMGDQTHPQMEQINGKLLGLMKMVREAGYVPDTSY 853
+M ++K+P SW++ K +I F +GDQ HP ++I+ L K E GYV D
Sbjct: 922 KMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDLTKRASEIGYVQDCFS 981
Query: 854 SLQDTDEEQKEHNMWNHSERIAHSERIALAFGLINIP 891
L + EQK+ ++ HSE++A++FGL+++P
Sbjct: 982 LLNELQHEQKDPIIF------IHSEKLAISFGLLSLP 1010
BLAST of HG10011116 vs. ExPASy Swiss-Prot
Match:
Q9M1V3 (Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H83 PE=2 SV=2)
HSP 1 Score: 487.3 bits (1253), Expect = 4.0e-136
Identity = 278/828 (33.57%), Postives = 451/828 (54.47%), Query Frame = 0
Query: 68 CNKSSIMAKEGFQLHGFAIKCGLIYDV-FVGTSFVHFYGSYGIVSNAQKMFNEMPDRNVV 127
C K ++ +G QLH K +++ F+ V YG G + +A+K+F+EMPDR
Sbjct: 90 CGKRRAVS-QGRQLHSRIFKTFPSFELDFLAGKLVFMYGKCGSLDDAEKVFDEMPDRTAF 149
Query: 128 SWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGHQLLGHV 187
+W +++ +Y NG + Y MR EG+ ++ ++ +C L DI G +L +
Sbjct: 150 AWNTMIGAYVSNGEPASALALYWNMRVEGVPLGLSSFPALLKACAKLRDIRSGSELHSLL 209
Query: 188 LKFGLETKVSAANSLVSMFGGCGDINEACSIFNEMNER-DTISWNSIISANAQNALHEES 247
+K G + N+LVSM+ D++ A +F+ E+ D + WNSI+S+ + + E+
Sbjct: 210 VKLGYHSTGFIVNALVSMYAKNDDLSAARRLFDGFQEKGDAVLWNSILSSYSTSGKSLET 269
Query: 248 FRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPN-ICLCNTLLN 307
F M + N T+ L+ C Y K GK +H V+K + + +CN L+
Sbjct: 270 LELFREMHMTGPAPNSYTIVSALTACDGFSYAKLGKEIHASVLKSSTHSSELYVCNALIA 329
Query: 308 MYSDAGRSEGAELIFRRMPDRDLISWNSMLACYVQDGRYLCALKFFAEMLWMKKEINYVT 367
MY+ G+ AE I R+M + D+++WNS++ YVQ+ Y AL+FF++M+ + + V+
Sbjct: 330 MYTRCGKMPQAERILRQMNNADVVTWNSLIKGYVQNLMYKEALEFFSDMIAAGHKSDEVS 389
Query: 368 FTSALAACLDPEFFTEGKILHGFVVVLGLQDDLIIGNTLITLYGKCHKMAEAKKLFQRIP 427
TS +AA G LH +V+ G +L +GNTLI +Y KC+ + F R+
Sbjct: 390 MTSIIAASGRLSNLLAGMELHAYVIKHGWDSNLQVGNTLIDMYSKCNLTCYMGRAFLRMH 449
Query: 428 KLDKVSWNALIGGFADNAEPNEAVAAFKLMREGGTCGVDYITIVNILSSCLIHEDLIKYG 487
D +SW +I G+A N EA+ F+ + + +D + + +IL + + + ++
Sbjct: 450 DKDLISWTTVIAGYAQNDCHVEALELFRDVAK-KRMEIDEMILGSILRASSVLKSML-IV 509
Query: 488 MPIHAHTVVTGFDLDQHVQSSLITMYTKCGDLHSSSYIFDNLVFKTSSVWNAIITANARY 547
IH H + G LD +Q+ L+ +Y KC ++ ++ +F+++ K W ++I+++A
Sbjct: 510 KEIHCHILRKGL-LDTVIQNELVDVYGKCRNMGYATRVFESIKGKDVVSWTSMISSSALN 569
Query: 548 GFGEEALKLVGRMRTAGIEFDQFNFSTALSVAADLAMLEEGKQLHGSTIKLGFELDHFVI 607
G EA++L RM G+ D LS AA L+ L +G+++H ++ GF L+ +
Sbjct: 570 GNESEAVELFRRMVETGLSADSVALLCILSAAASLSALNKGREIHCYLLRKGFCLEGSIA 629
Query: 608 NAAMDMYGKCGELDDALKILPQPTYRSRLSWNTMISIFARHGHFHKAKETFHEMLKLGVK 667
A +DMY CG+L A + + + L + +MI+ + HG A E F +M V
Sbjct: 630 VAVVDMYACCGDLQSAKAVFDRIERKGLLQYTSMINAYGMHGCGKAAVELFDKMRHENVS 689
Query: 668 PDHVSFVCLLSACSHGGLVDEGRAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLVEAEA 727
PDH+SF+ LL ACSH GL+DEGR + M EY ++P EH VC++D+LGR+ +VEA
Sbjct: 690 PDHISFLALLYACSHAGLLDEGRGFLKIMEHEYELEPWPEHYVCLVDMLGRANCVVEAFE 749
Query: 728 FITDMSIPPNDLVWRSLLASCRIYRNLDLGRKAAEHLLELDPSDDSAYVLYSNVFATIGR 787
F+ M P VW +LLA+CR + ++G AA+ LLEL+P + VL SNVFA GR
Sbjct: 750 FVKMMKTEPTAEVWCALLAACRSHSEKEIGEIAAQRLLELEPKNPGNLVLVSNVFAEQGR 809
Query: 788 WEDVEDVRGQMGAHKIQKKPAHSWVKWKGNIGIFGMGDQTHPQMEQINGKLLGL-MKMVR 847
W DVE VR +M A ++K P SW++ G + F D++HP+ ++I KL + K+ R
Sbjct: 810 WNDVEKVRAKMKASGMEKHPGCSWIEMDGKVHKFTARDKSHPESKEIYEKLSEVTRKLER 869
Query: 848 EAGYVPDTSYSLQDTDEEQKEHNMWNHSERIAHSERIALAFGLINIPE 892
E GYV DT + L + DE +K + HSERIA+A+GL+ P+
Sbjct: 870 EVGYVADTKFVLHNVDEGEKVQMLH------GHSERIAIAYGLLRTPD 907
BLAST of HG10011116 vs. ExPASy Swiss-Prot
Match:
Q9FIB2 (Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H35 PE=3 SV=1)
HSP 1 Score: 486.9 bits (1252), Expect = 5.2e-136
Identity = 297/893 (33.26%), Postives = 472/893 (52.86%), Query Frame = 0
Query: 2 YSKFGRINYARLVFDGMPERNEASWNNMMSAYVRVGSYLEAVLFFRDICGIGIKPSGFVI 61
Y + G AR VFD MP RN SW ++S Y R G + EA++F RD+ GI + +
Sbjct: 46 YLETGDSVSARKVFDEMPLRNCVSWACIVSGYSRNGEHKEALVFLRDMVKEGIFSNQYAF 105
Query: 62 SSLVTACNK-SSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGS-YGIVSNAQKMFNE 121
S++ AC + S+ G Q+HG K D V + Y G V A F +
Sbjct: 106 VSVLRACQEIGSVGILFGRQIHGLMFKLSYAVDAVVSNVLISMYWKCIGSVGYALCAFGD 165
Query: 122 MPDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNI-ALVISSCGFL-VDIL 181
+ +N VSW S++ YS G + + M+++G E +LV ++C D+
Sbjct: 166 IEVKNSVSWNSIISVYSQAGDQRSAFRIFSSMQYDGSRPTEYTFGSLVTTACSLTEPDVR 225
Query: 182 LGHQLLGHVLKFGLETKVSAANSLVSMFGGCGDINEACSIFNEMNERDTISWNSIISANA 241
L Q++ + K GL T + + LVS F G ++ A +FN+M R+ ++ N ++
Sbjct: 226 LLEQIMCTIQKSGLLTDLFVGSGLVSAFAKSGSLSYARKVFNQMETRNAVTLNGLMVGLV 285
Query: 242 QNALHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDY-------LKWGKGVHGLVVKY 301
+ EE+ + F M + I+ + S ++ + +Y LK G+ VHG V+
Sbjct: 286 RQKWGEEATKLFMDM---NSMIDVSPESYVILLSSFPEYSLAEEVGLKKGREVHGHVITT 345
Query: 302 GL-EPNICLCNTLLNMYSDAGRSEGAELIFRRMPDRDLISWNSMLACYVQDGRYLCALKF 361
GL + + + N L+NMY+ G A +F M D+D +SWNSM+ Q+G ++ A++
Sbjct: 346 GLVDFMVGIGNGLVNMYAKCGSIADARRVFYFMTDKDSVSWNSMITGLDQNGCFIEAVER 405
Query: 362 FAEMLWMKKEINYVTFTSALAACLDPEFFTEGKILHGFVVVLGLQDDLIIGNTLITLYGK 421
+ M T S+L++C ++ G+ +HG + LG+ ++ + N L+TLY +
Sbjct: 406 YKSMRRHDILPGSFTLISSLSSCASLKWAKLGQQIHGESLKLGIDLNVSVSNALMTLYAE 465
Query: 422 CHKMAEAKKLFQRIPKLDKVSWNALIGGFA--DNAEPNEAVAAFKLMREGGTCG-VDYIT 481
+ E +K+F +P+ D+VSWN++IG A + + P V R G + + +
Sbjct: 466 TGYLNECRKIFSSMPEHDQVSWNSIIGALARSERSLPEAVVCFLNAQRAGQKLNRITFSS 525
Query: 482 IVNILSSCLIHEDLIKYGMPIHAHTVVTGFDLDQHVQSSLITMYTKCGDLHSSSYIFDNL 541
+++ +SS E G IH + + +++LI Y KCG++ IF +
Sbjct: 526 VLSAVSSLSFGE----LGKQIHGLALKNNIADEATTENALIACYGKCGEMDGCEKIFSRM 585
Query: 542 VFKTSSV-WNAIITANARYGFGEEALKLVGRMRTAGIEFDQFNFSTALSVAADLAMLEEG 601
+ +V WN++I+ +AL LV M G D F ++T LS A +A LE G
Sbjct: 586 AERRDNVTWNSMISGYIHNELLAKALDLVWFMLQTGQRLDSFMYATVLSAFASVATLERG 645
Query: 602 KQLHGSTIKLGFELDHFVINAAMDMYGKCGELDDALKILPQPTYRSRLSWNTMISIFARH 661
++H +++ E D V +A +DMY KCG LD AL+ R+ SWN+MIS +ARH
Sbjct: 646 MEVHACSVRACLESDVVVGSALVDMYSKCGRLDYALRFFNTMPVRNSYSWNSMISGYARH 705
Query: 662 GHFHKAKETFHEMLKLG-VKPDHVSFVCLLSACSHGGLVDEGRAYYASMTSEYGIQPGIE 721
G +A + F M G PDHV+FV +LSACSH GL++EG ++ SM+ YG+ P IE
Sbjct: 706 GQGEEALKLFETMKLDGQTPPDHVTFVGVLSACSHAGLLEEGFKHFESMSDSYGLAPRIE 765
Query: 722 HCVCMIDLLGRSGRLVEAEAFITDMSIPPNDLVWRSLLASC--RIYRNLDLGRKAAEHLL 781
H CM D+LGR+G L + E FI M + PN L+WR++L +C R +LG+KAAE L
Sbjct: 766 HFSCMADVLGRAGELDKLEDFIEKMPMKPNVLIWRTVLGACCRANGRKAELGKKAAEMLF 825
Query: 782 ELDPSDDSAYVLYSNVFATIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNIGIFGMGD 841
+L+P + YVL N++A GRWED+ R +M ++K+ +SWV K + +F GD
Sbjct: 826 QLEPENAVNYVLLGNMYAAGGRWEDLVKARKKMKDADVKKEAGYSWVTMKDGVHMFVAGD 885
Query: 842 QTHPQMEQINGKLLGLMKMVREAGYVPDTSYSLQDTDEEQKEHNMWNHSERIA 876
++HP + I KL L + +R+AGYVP T ++L D ++E KE + HSE++A
Sbjct: 886 KSHPDADVIYKKLKELNRKMRDAGYVPQTGFALYDLEQENKEEILSYHSEKLA 931
BLAST of HG10011116 vs. ExPASy Swiss-Prot
Match:
Q9SS60 (Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H23 PE=2 SV=1)
HSP 1 Score: 456.8 bits (1174), Expect = 5.8e-127
Identity = 255/830 (30.72%), Postives = 439/830 (52.89%), Query Frame = 0
Query: 60 VISSLVTACNKSSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNE 119
V S ++ SS E ++H I GL F + Y + +++ +F
Sbjct: 5 VSSPFISRALSSSSNLNELRRIHALVISLGLDSSDFFSGKLIDKYSHFREPASSLSVFRR 64
Query: 120 M-PDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILL 179
+ P +NV W S++ ++S NG E + Y ++R + ++ VI +C L D +
Sbjct: 65 VSPAKNVYLWNSIIRAFSKNGLFPEALEFYGKLRESKVSPDKYTFPSVIKACAGLFDAEM 124
Query: 180 GHQLLGHVLKFGLETKVSAANSLVSMFGGCGDINEACSIFNEMNERDTISWNSIISANAQ 239
G + +L G E+ + N+LV M+ G + A +F+EM RD +SWNS+IS +
Sbjct: 125 GDLVYEQILDMGFESDLFVGNALVDMYSRMGLLTRARQVFDEMPVRDLVSWNSLISGYSS 184
Query: 240 NALHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPNICL 299
+ +EE+ +H ++ + T+S +L G++ +K G+G+HG +K G+ + +
Sbjct: 185 HGYYEEALEIYHELKNSWIVPDSFTVSSVLPAFGNLLVVKQGQGLHGFALKSGVNSVVVV 244
Query: 300 CNTLLNMYSDAGRSEGAELIFRRMPDRDLISWNSMLACYVQDGRYLCALKFFAEMLWMKK 359
N L+ MY R A +F M RD +S+N+M+ Y++ +++ F E L K
Sbjct: 245 NNGLVAMYLKFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEMVEESVRMFLENLDQFK 304
Query: 360 EINYVTFTSALAACLDPEFFTEGKILHGFVVVLGLQDDLIIGNTLITLYGKCHKMAEAKK 419
+ +T +S L AC + K ++ +++ G + + N LI +Y KC M A+
Sbjct: 305 P-DLLTVSSVLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYAKCGDMITARD 364
Query: 420 LFQRIPKLDKVSWNALIGGFADNAEPNEAVAAFKLMREGGTCGVDYITIVNILSSCLIHE 479
+F + D VSWN++I G+ + + EA+ FK+M D+IT + ++S
Sbjct: 365 VFNSMECKDTVSWNSIISGYIQSGDLMEAMKLFKMMMIMEE-QADHITYLMLISVSTRLA 424
Query: 480 DLIKYGMPIHAHTVVTGFDLDQHVQSSLITMYTKCGDLHSSSYIFDNLVFKTSSVWNAII 539
DL K+G +H++ + +G +D V ++LI MY KCG++ S IF ++ + WN +I
Sbjct: 425 DL-KFGKGLHSNGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSSMGTGDTVTWNTVI 484
Query: 540 TANARYGFGEEALKLVGRMRTAGIEFDQFNFSTALSVAADLAMLEEGKQLHGSTIKLGFE 599
+A R+G L++ +MR + + D F L + A LA GK++H ++ G+E
Sbjct: 485 SACVRFGDFATGLQVTTQMRKSEVVPDMATFLVTLPMCASLAAKRLGKEIHCCLLRFGYE 544
Query: 600 LDHFVINAAMDMYGKCGELDDALKILPQPTYRSRLSWNTMISIFARHGHFHKAKETFHEM 659
+ + NA ++MY KCG L+++ ++ + + R ++W MI + +G KA ETF +M
Sbjct: 545 SELQIGNALIEMYSKCGCLENSSRVFERMSRRDVVTWTGMIYAYGMYGEGEKALETFADM 604
Query: 660 LKLGVKPDHVSFVCLLSACSHGGLVDEGRAYYASMTSEYGIQPGIEHCVCMIDLLGRSGR 719
K G+ PD V F+ ++ ACSH GLVDEG A + M + Y I P IEH C++DLL RS +
Sbjct: 605 EKSGIVPDSVVFIAIIYACSHSGLVDEGLACFEKMKTHYKIDPMIEHYACVVDLLSRSQK 664
Query: 720 LVEAEAFITDMSIPPNDLVWRSLLASCRIYRNLDLGRKAAEHLLELDPSDDSAYVLYSNV 779
+ +AE FI M I P+ +W S+L +CR +++ + + ++EL+P D +L SN
Sbjct: 665 ISKAEEFIQAMPIKPDASIWASVLRACRTSGDMETAERVSRRIIELNPDDPGYSILASNA 724
Query: 780 FATIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNIGIFGMGDQTHPQMEQINGKLLGL 839
+A + +W+ V +R + I K P +SW++ N+ +F GD + PQ E I L L
Sbjct: 725 YAALRKWDKVSLIRKSLKDKHITKNPGYSWIEVGKNVHVFSSGDDSAPQSEAIYKSLEIL 784
Query: 840 MKMVREAGYVPDTSYSLQDTDEEQKEHNMWNHSERIAHSERIALAFGLIN 889
++ + GY+PD Q+ +EE+++ + HSER+A+AFGL+N
Sbjct: 785 YSLMAKEGYIPDPREVSQNLEEEEEKRRL-----ICGHSERLAIAFGLLN 826
BLAST of HG10011116 vs. ExPASy TrEMBL
Match:
A0A0A0LAC1 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G215600 PE=3 SV=1)
HSP 1 Score: 1698.7 bits (4398), Expect = 0.0e+00
Identity = 825/907 (90.96%), Postives = 858/907 (94.60%), Query Frame = 0
Query: 1 MYSKFGRINYARLVFDGMPERNEASWNNMMSAYVRVGSYLEAVLFFRDICGIGIKPSGFV 60
MYSKFGRINYA+LVFD M ERNEASWN+MMS YVRVGSY+EAVLFFRDICGIGIKPSGF+
Sbjct: 143 MYSKFGRINYAQLVFDRMSERNEASWNHMMSGYVRVGSYVEAVLFFRDICGIGIKPSGFM 202
Query: 61 ISSLVTACNKSSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEM 120
I+SLVTACNKSSIMAKEGFQ HGFAIKCGLIYDVFVGTSFVHFY SYGIVSNAQKMFNEM
Sbjct: 203 IASLVTACNKSSIMAKEGFQFHGFAIKCGLIYDVFVGTSFVHFYASYGIVSNAQKMFNEM 262
Query: 121 PDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGH 180
PDRNVVSWTSLMVSYSDNGSK+EVINTYKRMRHEGICCNENNIALVISSCGFL+DI+LGH
Sbjct: 263 PDRNVVSWTSLMVSYSDNGSKKEVINTYKRMRHEGICCNENNIALVISSCGFLMDIILGH 322
Query: 181 QLLGHVLKFGLETKVSAANSLVSMFGGCGDINEACSIFNEMNERDTISWNSIISANAQNA 240
QLLGH LKFGLETKVSAANSL+ MFGGCGDINEACSIFNEMNERDTISWNSIISANAQN
Sbjct: 323 QLLGHALKFGLETKVSAANSLIFMFGGCGDINEACSIFNEMNERDTISWNSIISANAQNT 382
Query: 241 LHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPNICLCN 300
LHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGL VKYGLE NICLCN
Sbjct: 383 LHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLAVKYGLESNICLCN 442
Query: 301 TLLNMYSDAGRSEGAELIFRRMPDRDLISWNSMLACYVQDGRYLCALKFFAEMLWMKKEI 360
TLL++YSDAGRS+ AELIFRRMP+RDLISWNSMLACYVQDGR LCALK FAEMLWMKKEI
Sbjct: 443 TLLSVYSDAGRSKDAELIFRRMPERDLISWNSMLACYVQDGRCLCALKVFAEMLWMKKEI 502
Query: 361 NYVTFTSALAACLDPEFFTEGKILHGFVVVLGLQDDLIIGNTLITLYGKCHKMAEAKKLF 420
NYVTFTSALAACLDPEFFT GKILHGFVVVLGLQD+LIIGNTLIT YGKCHKMAEAKK+F
Sbjct: 503 NYVTFTSALAACLDPEFFTNGKILHGFVVVLGLQDELIIGNTLITFYGKCHKMAEAKKVF 562
Query: 421 QRIPKLDKVSWNALIGGFADNAEPNEAVAAFKLMREGGTCGVDYITIVNILSSCLIHEDL 480
QR+PKLDKV+WNALIGGFA+NAE NEAVAAFKLMREG T GVDYITIVNIL SCL HEDL
Sbjct: 563 QRMPKLDKVTWNALIGGFANNAELNEAVAAFKLMREGSTSGVDYITIVNILGSCLTHEDL 622
Query: 481 IKYGMPIHAHTVVTGFDLDQHVQSSLITMYTKCGDLHSSSYIFDNLVFKTSSVWNAIITA 540
IKYG+PIHAHTVVTGFDLDQHVQSSLITMY KCGDLHSSSYIFD LVFKTSSVWNAII A
Sbjct: 623 IKYGIPIHAHTVVTGFDLDQHVQSSLITMYAKCGDLHSSSYIFDQLVFKTSSVWNAIIAA 682
Query: 541 NARYGFGEEALKLVGRMRTAGIEFDQFNFSTALSVAADLAMLEEGKQLHGSTIKLGFELD 600
NARYGFGEEALKLV RMR+AGIEFDQFNFSTALSVAADLAMLEEG+QLHGSTIKLGFELD
Sbjct: 683 NARYGFGEEALKLVVRMRSAGIEFDQFNFSTALSVAADLAMLEEGQQLHGSTIKLGFELD 742
Query: 601 HFVINAAMDMYGKCGELDDALKILPQPTYRSRLSWNTMISIFARHGHFHKAKETFHEMLK 660
HF+INAAMDMYGKCGELDDAL+ILPQPT RSRLSWNT+ISI ARHG FHKAKETFH+MLK
Sbjct: 743 HFIINAAMDMYGKCGELDDALRILPQPTDRSRLSWNTLISISARHGQFHKAKETFHDMLK 802
Query: 661 LGVKPDHVSFVCLLSACSHGGLVDEGRAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV 720
LGVKP+HVSFVCLLSACSHGGLVDEG AYYASMTS YGIQPGIEHCVCMIDLLGRSGRLV
Sbjct: 803 LGVKPNHVSFVCLLSACSHGGLVDEGLAYYASMTSVYGIQPGIEHCVCMIDLLGRSGRLV 862
Query: 721 EAEAFITDMSIPPNDLVWRSLLASCRIYRNLDLGRKAAEHLLELDPSDDSAYVLYSNVFA 780
EAEAFIT+M IPPNDLVWRSLLASCRIYRNLDLGRKAA+HLLELDPSDDSAYVLYSNVFA
Sbjct: 863 EAEAFITEMPIPPNDLVWRSLLASCRIYRNLDLGRKAAKHLLELDPSDDSAYVLYSNVFA 922
Query: 781 TIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNIGIFGMGDQTHPQMEQINGKLLGLMK 840
TIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNI IFGMGDQTHPQMEQINGKLLGLMK
Sbjct: 923 TIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNISIFGMGDQTHPQMEQINGKLLGLMK 982
Query: 841 MVREAGYVPDTSYSLQDTDEEQKEHNMWNHSERIAHSERIALAFGLINIPEDDFSPLFSF 900
+V EAGYVPDTSYSLQDTDEEQKEHNMW +HSERIALAFGLINIPE +F
Sbjct: 983 IVGEAGYVPDTSYSLQDTDEEQKEHNMW------SHSERIALAFGLINIPEGSTVRIFKN 1042
Query: 901 IFLTDRC 908
+ + C
Sbjct: 1043 LRVCGDC 1043
BLAST of HG10011116 vs. ExPASy TrEMBL
Match:
A0A1S4E120 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X1 OS=Cucumis melo OX=3656 GN=LOC103496132 PE=3 SV=1)
HSP 1 Score: 1691.0 bits (4378), Expect = 0.0e+00
Identity = 822/907 (90.63%), Postives = 857/907 (94.49%), Query Frame = 0
Query: 1 MYSKFGRINYARLVFDGMPERNEASWNNMMSAYVRVGSYLEAVLFFRDICGIGIKPSGFV 60
MYSKFGRINYA+LVFD M ERNEASWN+MMS YVRVGSY+EAVLFFRDICGIGIKPSGF+
Sbjct: 1 MYSKFGRINYAQLVFDRMSERNEASWNHMMSGYVRVGSYVEAVLFFRDICGIGIKPSGFM 60
Query: 61 ISSLVTACNKSSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEM 120
I+SLVTACNKSSIMAKEGFQ HGFAIKCGLIYDVFVGTSFVHFY SYGIVSNAQKMFNEM
Sbjct: 61 IASLVTACNKSSIMAKEGFQFHGFAIKCGLIYDVFVGTSFVHFYASYGIVSNAQKMFNEM 120
Query: 121 PDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGH 180
PDRNVVSWTSLMVSYSDNGSK+EVINTYKRMR EGICCNENNIALVISSCGFLVDI+LG
Sbjct: 121 PDRNVVSWTSLMVSYSDNGSKKEVINTYKRMRLEGICCNENNIALVISSCGFLVDIILGR 180
Query: 181 QLLGHVLKFGLETKVSAANSLVSMFGGCGDINEACSIFNEMNERDTISWNSIISANAQNA 240
QLLGH LKFGLETKVSAANSLV MFGGCGD++EACSIFNEMNERDTISWNSIISANAQNA
Sbjct: 181 QLLGHALKFGLETKVSAANSLVFMFGGCGDVDEACSIFNEMNERDTISWNSIISANAQNA 240
Query: 241 LHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPNICLCN 300
LHEESFRYFHWMRLVHEE+NYTTLSILLSICGSVDYLKWGKGVHGL VKYGLE NICLCN
Sbjct: 241 LHEESFRYFHWMRLVHEEMNYTTLSILLSICGSVDYLKWGKGVHGLAVKYGLESNICLCN 300
Query: 301 TLLNMYSDAGRSEGAELIFRRMPDRDLISWNSMLACYVQDGRYLCALKFFAEMLWMKKEI 360
TLL+MYSDAGRS+ AELIFRRMP+RDL+SWNSMLACYVQDGR LCALK FAEMLWMKKEI
Sbjct: 301 TLLSMYSDAGRSKDAELIFRRMPERDLVSWNSMLACYVQDGRCLCALKVFAEMLWMKKEI 360
Query: 361 NYVTFTSALAACLDPEFFTEGKILHGFVVVLGLQDDLIIGNTLITLYGKCHKMAEAKKLF 420
NYVTFTSALAACLDPEFFTEGKILHGFVVVLGLQD+LIIGNTLIT YGKC KM+EAKKLF
Sbjct: 361 NYVTFTSALAACLDPEFFTEGKILHGFVVVLGLQDELIIGNTLITFYGKCQKMSEAKKLF 420
Query: 421 QRIPKLDKVSWNALIGGFADNAEPNEAVAAFKLMREGGTCGVDYITIVNILSSCLIHEDL 480
QR+PKLDKV+WNALIGGFA+NAE NEAVAAFKLMREGGTCGVDYITIVNIL SCL EDL
Sbjct: 421 QRMPKLDKVTWNALIGGFANNAELNEAVAAFKLMREGGTCGVDYITIVNILGSCLTREDL 480
Query: 481 IKYGMPIHAHTVVTGFDLDQHVQSSLITMYTKCGDLHSSSYIFDNLVFKTSSVWNAIITA 540
IKYG+PIHAHTVVTGFDLDQHVQSSLITMY KCGDL SSSYIFD LVFKTSSVWNAII A
Sbjct: 481 IKYGIPIHAHTVVTGFDLDQHVQSSLITMYAKCGDLQSSSYIFDQLVFKTSSVWNAIIAA 540
Query: 541 NARYGFGEEALKLVGRMRTAGIEFDQFNFSTALSVAADLAMLEEGKQLHGSTIKLGFELD 600
NARYGFGEEALKLV RMR+AGIEFDQFNFST+LSVAADLAMLEEG+QLHGSTIKLGFELD
Sbjct: 541 NARYGFGEEALKLVVRMRSAGIEFDQFNFSTSLSVAADLAMLEEGQQLHGSTIKLGFELD 600
Query: 601 HFVINAAMDMYGKCGELDDALKILPQPTYRSRLSWNTMISIFARHGHFHKAKETFHEMLK 660
HF+ NAAMDMYGKCGELDDAL+ILPQPT RSRLSWNTMISIFARHGHF KAKETFHEMLK
Sbjct: 601 HFITNAAMDMYGKCGELDDALRILPQPTDRSRLSWNTMISIFARHGHFRKAKETFHEMLK 660
Query: 661 LGVKPDHVSFVCLLSACSHGGLVDEGRAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV 720
LGVKP+HVSFVCLLSAC+HGGLV+EG AYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV
Sbjct: 661 LGVKPNHVSFVCLLSACNHGGLVEEGLAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV 720
Query: 721 EAEAFITDMSIPPNDLVWRSLLASCRIYRNLDLGRKAAEHLLELDPSDDSAYVLYSNVFA 780
EAEAFITDM IPPNDLVWRSLLASCRIYRNLDLGRKAA+HLLELDPSDDSAYVLYSNVFA
Sbjct: 721 EAEAFITDMPIPPNDLVWRSLLASCRIYRNLDLGRKAAKHLLELDPSDDSAYVLYSNVFA 780
Query: 781 TIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNIGIFGMGDQTHPQMEQINGKLLGLMK 840
TIGRW DVEDVRGQMGAH+IQKKPAHSWVKWKGNI IFGMGDQTHPQMEQINGKLLGLMK
Sbjct: 781 TIGRWADVEDVRGQMGAHRIQKKPAHSWVKWKGNISIFGMGDQTHPQMEQINGKLLGLMK 840
Query: 841 MVREAGYVPDTSYSLQDTDEEQKEHNMWNHSERIAHSERIALAFGLINIPEDDFSPLFSF 900
+V EAGYVPDTSYSLQDTDEEQKEHNMW +HSERIALAFGLINIPE +F
Sbjct: 841 IVGEAGYVPDTSYSLQDTDEEQKEHNMW------SHSERIALAFGLINIPEGTTVRIFKN 900
Query: 901 IFLTDRC 908
+ + C
Sbjct: 901 LRVCGDC 901
BLAST of HG10011116 vs. ExPASy TrEMBL
Match:
A0A1S3C3P4 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X2 OS=Cucumis melo OX=3656 GN=LOC103496132 PE=3 SV=1)
HSP 1 Score: 1691.0 bits (4378), Expect = 0.0e+00
Identity = 822/907 (90.63%), Postives = 857/907 (94.49%), Query Frame = 0
Query: 1 MYSKFGRINYARLVFDGMPERNEASWNNMMSAYVRVGSYLEAVLFFRDICGIGIKPSGFV 60
MYSKFGRINYA+LVFD M ERNEASWN+MMS YVRVGSY+EAVLFFRDICGIGIKPSGF+
Sbjct: 150 MYSKFGRINYAQLVFDRMSERNEASWNHMMSGYVRVGSYVEAVLFFRDICGIGIKPSGFM 209
Query: 61 ISSLVTACNKSSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEM 120
I+SLVTACNKSSIMAKEGFQ HGFAIKCGLIYDVFVGTSFVHFY SYGIVSNAQKMFNEM
Sbjct: 210 IASLVTACNKSSIMAKEGFQFHGFAIKCGLIYDVFVGTSFVHFYASYGIVSNAQKMFNEM 269
Query: 121 PDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGH 180
PDRNVVSWTSLMVSYSDNGSK+EVINTYKRMR EGICCNENNIALVISSCGFLVDI+LG
Sbjct: 270 PDRNVVSWTSLMVSYSDNGSKKEVINTYKRMRLEGICCNENNIALVISSCGFLVDIILGR 329
Query: 181 QLLGHVLKFGLETKVSAANSLVSMFGGCGDINEACSIFNEMNERDTISWNSIISANAQNA 240
QLLGH LKFGLETKVSAANSLV MFGGCGD++EACSIFNEMNERDTISWNSIISANAQNA
Sbjct: 330 QLLGHALKFGLETKVSAANSLVFMFGGCGDVDEACSIFNEMNERDTISWNSIISANAQNA 389
Query: 241 LHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPNICLCN 300
LHEESFRYFHWMRLVHEE+NYTTLSILLSICGSVDYLKWGKGVHGL VKYGLE NICLCN
Sbjct: 390 LHEESFRYFHWMRLVHEEMNYTTLSILLSICGSVDYLKWGKGVHGLAVKYGLESNICLCN 449
Query: 301 TLLNMYSDAGRSEGAELIFRRMPDRDLISWNSMLACYVQDGRYLCALKFFAEMLWMKKEI 360
TLL+MYSDAGRS+ AELIFRRMP+RDL+SWNSMLACYVQDGR LCALK FAEMLWMKKEI
Sbjct: 450 TLLSMYSDAGRSKDAELIFRRMPERDLVSWNSMLACYVQDGRCLCALKVFAEMLWMKKEI 509
Query: 361 NYVTFTSALAACLDPEFFTEGKILHGFVVVLGLQDDLIIGNTLITLYGKCHKMAEAKKLF 420
NYVTFTSALAACLDPEFFTEGKILHGFVVVLGLQD+LIIGNTLIT YGKC KM+EAKKLF
Sbjct: 510 NYVTFTSALAACLDPEFFTEGKILHGFVVVLGLQDELIIGNTLITFYGKCQKMSEAKKLF 569
Query: 421 QRIPKLDKVSWNALIGGFADNAEPNEAVAAFKLMREGGTCGVDYITIVNILSSCLIHEDL 480
QR+PKLDKV+WNALIGGFA+NAE NEAVAAFKLMREGGTCGVDYITIVNIL SCL EDL
Sbjct: 570 QRMPKLDKVTWNALIGGFANNAELNEAVAAFKLMREGGTCGVDYITIVNILGSCLTREDL 629
Query: 481 IKYGMPIHAHTVVTGFDLDQHVQSSLITMYTKCGDLHSSSYIFDNLVFKTSSVWNAIITA 540
IKYG+PIHAHTVVTGFDLDQHVQSSLITMY KCGDL SSSYIFD LVFKTSSVWNAII A
Sbjct: 630 IKYGIPIHAHTVVTGFDLDQHVQSSLITMYAKCGDLQSSSYIFDQLVFKTSSVWNAIIAA 689
Query: 541 NARYGFGEEALKLVGRMRTAGIEFDQFNFSTALSVAADLAMLEEGKQLHGSTIKLGFELD 600
NARYGFGEEALKLV RMR+AGIEFDQFNFST+LSVAADLAMLEEG+QLHGSTIKLGFELD
Sbjct: 690 NARYGFGEEALKLVVRMRSAGIEFDQFNFSTSLSVAADLAMLEEGQQLHGSTIKLGFELD 749
Query: 601 HFVINAAMDMYGKCGELDDALKILPQPTYRSRLSWNTMISIFARHGHFHKAKETFHEMLK 660
HF+ NAAMDMYGKCGELDDAL+ILPQPT RSRLSWNTMISIFARHGHF KAKETFHEMLK
Sbjct: 750 HFITNAAMDMYGKCGELDDALRILPQPTDRSRLSWNTMISIFARHGHFRKAKETFHEMLK 809
Query: 661 LGVKPDHVSFVCLLSACSHGGLVDEGRAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV 720
LGVKP+HVSFVCLLSAC+HGGLV+EG AYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV
Sbjct: 810 LGVKPNHVSFVCLLSACNHGGLVEEGLAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV 869
Query: 721 EAEAFITDMSIPPNDLVWRSLLASCRIYRNLDLGRKAAEHLLELDPSDDSAYVLYSNVFA 780
EAEAFITDM IPPNDLVWRSLLASCRIYRNLDLGRKAA+HLLELDPSDDSAYVLYSNVFA
Sbjct: 870 EAEAFITDMPIPPNDLVWRSLLASCRIYRNLDLGRKAAKHLLELDPSDDSAYVLYSNVFA 929
Query: 781 TIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNIGIFGMGDQTHPQMEQINGKLLGLMK 840
TIGRW DVEDVRGQMGAH+IQKKPAHSWVKWKGNI IFGMGDQTHPQMEQINGKLLGLMK
Sbjct: 930 TIGRWADVEDVRGQMGAHRIQKKPAHSWVKWKGNISIFGMGDQTHPQMEQINGKLLGLMK 989
Query: 841 MVREAGYVPDTSYSLQDTDEEQKEHNMWNHSERIAHSERIALAFGLINIPEDDFSPLFSF 900
+V EAGYVPDTSYSLQDTDEEQKEHNMW +HSERIALAFGLINIPE +F
Sbjct: 990 IVGEAGYVPDTSYSLQDTDEEQKEHNMW------SHSERIALAFGLINIPEGTTVRIFKN 1049
Query: 901 IFLTDRC 908
+ + C
Sbjct: 1050 LRVCGDC 1050
BLAST of HG10011116 vs. ExPASy TrEMBL
Match:
A0A1S3C2F0 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X4 OS=Cucumis melo OX=3656 GN=LOC103496132 PE=3 SV=1)
HSP 1 Score: 1691.0 bits (4378), Expect = 0.0e+00
Identity = 822/907 (90.63%), Postives = 857/907 (94.49%), Query Frame = 0
Query: 1 MYSKFGRINYARLVFDGMPERNEASWNNMMSAYVRVGSYLEAVLFFRDICGIGIKPSGFV 60
MYSKFGRINYA+LVFD M ERNEASWN+MMS YVRVGSY+EAVLFFRDICGIGIKPSGF+
Sbjct: 128 MYSKFGRINYAQLVFDRMSERNEASWNHMMSGYVRVGSYVEAVLFFRDICGIGIKPSGFM 187
Query: 61 ISSLVTACNKSSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEM 120
I+SLVTACNKSSIMAKEGFQ HGFAIKCGLIYDVFVGTSFVHFY SYGIVSNAQKMFNEM
Sbjct: 188 IASLVTACNKSSIMAKEGFQFHGFAIKCGLIYDVFVGTSFVHFYASYGIVSNAQKMFNEM 247
Query: 121 PDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGH 180
PDRNVVSWTSLMVSYSDNGSK+EVINTYKRMR EGICCNENNIALVISSCGFLVDI+LG
Sbjct: 248 PDRNVVSWTSLMVSYSDNGSKKEVINTYKRMRLEGICCNENNIALVISSCGFLVDIILGR 307
Query: 181 QLLGHVLKFGLETKVSAANSLVSMFGGCGDINEACSIFNEMNERDTISWNSIISANAQNA 240
QLLGH LKFGLETKVSAANSLV MFGGCGD++EACSIFNEMNERDTISWNSIISANAQNA
Sbjct: 308 QLLGHALKFGLETKVSAANSLVFMFGGCGDVDEACSIFNEMNERDTISWNSIISANAQNA 367
Query: 241 LHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPNICLCN 300
LHEESFRYFHWMRLVHEE+NYTTLSILLSICGSVDYLKWGKGVHGL VKYGLE NICLCN
Sbjct: 368 LHEESFRYFHWMRLVHEEMNYTTLSILLSICGSVDYLKWGKGVHGLAVKYGLESNICLCN 427
Query: 301 TLLNMYSDAGRSEGAELIFRRMPDRDLISWNSMLACYVQDGRYLCALKFFAEMLWMKKEI 360
TLL+MYSDAGRS+ AELIFRRMP+RDL+SWNSMLACYVQDGR LCALK FAEMLWMKKEI
Sbjct: 428 TLLSMYSDAGRSKDAELIFRRMPERDLVSWNSMLACYVQDGRCLCALKVFAEMLWMKKEI 487
Query: 361 NYVTFTSALAACLDPEFFTEGKILHGFVVVLGLQDDLIIGNTLITLYGKCHKMAEAKKLF 420
NYVTFTSALAACLDPEFFTEGKILHGFVVVLGLQD+LIIGNTLIT YGKC KM+EAKKLF
Sbjct: 488 NYVTFTSALAACLDPEFFTEGKILHGFVVVLGLQDELIIGNTLITFYGKCQKMSEAKKLF 547
Query: 421 QRIPKLDKVSWNALIGGFADNAEPNEAVAAFKLMREGGTCGVDYITIVNILSSCLIHEDL 480
QR+PKLDKV+WNALIGGFA+NAE NEAVAAFKLMREGGTCGVDYITIVNIL SCL EDL
Sbjct: 548 QRMPKLDKVTWNALIGGFANNAELNEAVAAFKLMREGGTCGVDYITIVNILGSCLTREDL 607
Query: 481 IKYGMPIHAHTVVTGFDLDQHVQSSLITMYTKCGDLHSSSYIFDNLVFKTSSVWNAIITA 540
IKYG+PIHAHTVVTGFDLDQHVQSSLITMY KCGDL SSSYIFD LVFKTSSVWNAII A
Sbjct: 608 IKYGIPIHAHTVVTGFDLDQHVQSSLITMYAKCGDLQSSSYIFDQLVFKTSSVWNAIIAA 667
Query: 541 NARYGFGEEALKLVGRMRTAGIEFDQFNFSTALSVAADLAMLEEGKQLHGSTIKLGFELD 600
NARYGFGEEALKLV RMR+AGIEFDQFNFST+LSVAADLAMLEEG+QLHGSTIKLGFELD
Sbjct: 668 NARYGFGEEALKLVVRMRSAGIEFDQFNFSTSLSVAADLAMLEEGQQLHGSTIKLGFELD 727
Query: 601 HFVINAAMDMYGKCGELDDALKILPQPTYRSRLSWNTMISIFARHGHFHKAKETFHEMLK 660
HF+ NAAMDMYGKCGELDDAL+ILPQPT RSRLSWNTMISIFARHGHF KAKETFHEMLK
Sbjct: 728 HFITNAAMDMYGKCGELDDALRILPQPTDRSRLSWNTMISIFARHGHFRKAKETFHEMLK 787
Query: 661 LGVKPDHVSFVCLLSACSHGGLVDEGRAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV 720
LGVKP+HVSFVCLLSAC+HGGLV+EG AYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV
Sbjct: 788 LGVKPNHVSFVCLLSACNHGGLVEEGLAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV 847
Query: 721 EAEAFITDMSIPPNDLVWRSLLASCRIYRNLDLGRKAAEHLLELDPSDDSAYVLYSNVFA 780
EAEAFITDM IPPNDLVWRSLLASCRIYRNLDLGRKAA+HLLELDPSDDSAYVLYSNVFA
Sbjct: 848 EAEAFITDMPIPPNDLVWRSLLASCRIYRNLDLGRKAAKHLLELDPSDDSAYVLYSNVFA 907
Query: 781 TIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNIGIFGMGDQTHPQMEQINGKLLGLMK 840
TIGRW DVEDVRGQMGAH+IQKKPAHSWVKWKGNI IFGMGDQTHPQMEQINGKLLGLMK
Sbjct: 908 TIGRWADVEDVRGQMGAHRIQKKPAHSWVKWKGNISIFGMGDQTHPQMEQINGKLLGLMK 967
Query: 841 MVREAGYVPDTSYSLQDTDEEQKEHNMWNHSERIAHSERIALAFGLINIPEDDFSPLFSF 900
+V EAGYVPDTSYSLQDTDEEQKEHNMW +HSERIALAFGLINIPE +F
Sbjct: 968 IVGEAGYVPDTSYSLQDTDEEQKEHNMW------SHSERIALAFGLINIPEGTTVRIFKN 1027
Query: 901 IFLTDRC 908
+ + C
Sbjct: 1028 LRVCGDC 1028
BLAST of HG10011116 vs. ExPASy TrEMBL
Match:
A0A1S3C2I9 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X3 OS=Cucumis melo OX=3656 GN=LOC103496132 PE=3 SV=1)
HSP 1 Score: 1691.0 bits (4378), Expect = 0.0e+00
Identity = 822/907 (90.63%), Postives = 857/907 (94.49%), Query Frame = 0
Query: 1 MYSKFGRINYARLVFDGMPERNEASWNNMMSAYVRVGSYLEAVLFFRDICGIGIKPSGFV 60
MYSKFGRINYA+LVFD M ERNEASWN+MMS YVRVGSY+EAVLFFRDICGIGIKPSGF+
Sbjct: 143 MYSKFGRINYAQLVFDRMSERNEASWNHMMSGYVRVGSYVEAVLFFRDICGIGIKPSGFM 202
Query: 61 ISSLVTACNKSSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEM 120
I+SLVTACNKSSIMAKEGFQ HGFAIKCGLIYDVFVGTSFVHFY SYGIVSNAQKMFNEM
Sbjct: 203 IASLVTACNKSSIMAKEGFQFHGFAIKCGLIYDVFVGTSFVHFYASYGIVSNAQKMFNEM 262
Query: 121 PDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGH 180
PDRNVVSWTSLMVSYSDNGSK+EVINTYKRMR EGICCNENNIALVISSCGFLVDI+LG
Sbjct: 263 PDRNVVSWTSLMVSYSDNGSKKEVINTYKRMRLEGICCNENNIALVISSCGFLVDIILGR 322
Query: 181 QLLGHVLKFGLETKVSAANSLVSMFGGCGDINEACSIFNEMNERDTISWNSIISANAQNA 240
QLLGH LKFGLETKVSAANSLV MFGGCGD++EACSIFNEMNERDTISWNSIISANAQNA
Sbjct: 323 QLLGHALKFGLETKVSAANSLVFMFGGCGDVDEACSIFNEMNERDTISWNSIISANAQNA 382
Query: 241 LHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPNICLCN 300
LHEESFRYFHWMRLVHEE+NYTTLSILLSICGSVDYLKWGKGVHGL VKYGLE NICLCN
Sbjct: 383 LHEESFRYFHWMRLVHEEMNYTTLSILLSICGSVDYLKWGKGVHGLAVKYGLESNICLCN 442
Query: 301 TLLNMYSDAGRSEGAELIFRRMPDRDLISWNSMLACYVQDGRYLCALKFFAEMLWMKKEI 360
TLL+MYSDAGRS+ AELIFRRMP+RDL+SWNSMLACYVQDGR LCALK FAEMLWMKKEI
Sbjct: 443 TLLSMYSDAGRSKDAELIFRRMPERDLVSWNSMLACYVQDGRCLCALKVFAEMLWMKKEI 502
Query: 361 NYVTFTSALAACLDPEFFTEGKILHGFVVVLGLQDDLIIGNTLITLYGKCHKMAEAKKLF 420
NYVTFTSALAACLDPEFFTEGKILHGFVVVLGLQD+LIIGNTLIT YGKC KM+EAKKLF
Sbjct: 503 NYVTFTSALAACLDPEFFTEGKILHGFVVVLGLQDELIIGNTLITFYGKCQKMSEAKKLF 562
Query: 421 QRIPKLDKVSWNALIGGFADNAEPNEAVAAFKLMREGGTCGVDYITIVNILSSCLIHEDL 480
QR+PKLDKV+WNALIGGFA+NAE NEAVAAFKLMREGGTCGVDYITIVNIL SCL EDL
Sbjct: 563 QRMPKLDKVTWNALIGGFANNAELNEAVAAFKLMREGGTCGVDYITIVNILGSCLTREDL 622
Query: 481 IKYGMPIHAHTVVTGFDLDQHVQSSLITMYTKCGDLHSSSYIFDNLVFKTSSVWNAIITA 540
IKYG+PIHAHTVVTGFDLDQHVQSSLITMY KCGDL SSSYIFD LVFKTSSVWNAII A
Sbjct: 623 IKYGIPIHAHTVVTGFDLDQHVQSSLITMYAKCGDLQSSSYIFDQLVFKTSSVWNAIIAA 682
Query: 541 NARYGFGEEALKLVGRMRTAGIEFDQFNFSTALSVAADLAMLEEGKQLHGSTIKLGFELD 600
NARYGFGEEALKLV RMR+AGIEFDQFNFST+LSVAADLAMLEEG+QLHGSTIKLGFELD
Sbjct: 683 NARYGFGEEALKLVVRMRSAGIEFDQFNFSTSLSVAADLAMLEEGQQLHGSTIKLGFELD 742
Query: 601 HFVINAAMDMYGKCGELDDALKILPQPTYRSRLSWNTMISIFARHGHFHKAKETFHEMLK 660
HF+ NAAMDMYGKCGELDDAL+ILPQPT RSRLSWNTMISIFARHGHF KAKETFHEMLK
Sbjct: 743 HFITNAAMDMYGKCGELDDALRILPQPTDRSRLSWNTMISIFARHGHFRKAKETFHEMLK 802
Query: 661 LGVKPDHVSFVCLLSACSHGGLVDEGRAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV 720
LGVKP+HVSFVCLLSAC+HGGLV+EG AYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV
Sbjct: 803 LGVKPNHVSFVCLLSACNHGGLVEEGLAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV 862
Query: 721 EAEAFITDMSIPPNDLVWRSLLASCRIYRNLDLGRKAAEHLLELDPSDDSAYVLYSNVFA 780
EAEAFITDM IPPNDLVWRSLLASCRIYRNLDLGRKAA+HLLELDPSDDSAYVLYSNVFA
Sbjct: 863 EAEAFITDMPIPPNDLVWRSLLASCRIYRNLDLGRKAAKHLLELDPSDDSAYVLYSNVFA 922
Query: 781 TIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNIGIFGMGDQTHPQMEQINGKLLGLMK 840
TIGRW DVEDVRGQMGAH+IQKKPAHSWVKWKGNI IFGMGDQTHPQMEQINGKLLGLMK
Sbjct: 923 TIGRWADVEDVRGQMGAHRIQKKPAHSWVKWKGNISIFGMGDQTHPQMEQINGKLLGLMK 982
Query: 841 MVREAGYVPDTSYSLQDTDEEQKEHNMWNHSERIAHSERIALAFGLINIPEDDFSPLFSF 900
+V EAGYVPDTSYSLQDTDEEQKEHNMW +HSERIALAFGLINIPE +F
Sbjct: 983 IVGEAGYVPDTSYSLQDTDEEQKEHNMW------SHSERIALAFGLINIPEGTTVRIFKN 1042
Query: 901 IFLTDRC 908
+ + C
Sbjct: 1043 LRVCGDC 1043
BLAST of HG10011116 vs. TAIR 10
Match:
AT1G16480.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 1090.5 bits (2819), Expect = 0.0e+00
Identity = 522/907 (57.55%), Postives = 680/907 (74.97%), Query Frame = 0
Query: 1 MYSKFGRINYARLVFDGMPERNEASWNNMMSAYVRVGSYLEAVLFFRDICGIGIKPSGFV 60
MY+KFGR+ AR +FD MP RNE SWN MMS VRVG YLE + FFR +C +GIKPS FV
Sbjct: 1 MYTKFGRVKPARHLFDIMPVRNEVSWNTMMSGIVRVGLYLEGMEFFRKMCDLGIKPSSFV 60
Query: 61 ISSLVTACNKSSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEM 120
I+SLVTAC +S M +EG Q+HGF K GL+ DV+V T+ +H YG YG+VS ++K+F EM
Sbjct: 61 IASLVTACGRSGSMFREGVQVHGFVAKSGLLSDVYVSTAILHLYGVYGLVSCSRKVFEEM 120
Query: 121 PDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGH 180
PDRNVVSWTSLMV YSD G EEVI+ YK MR EG+ CNEN+++LVISSCG L D LG
Sbjct: 121 PDRNVVSWTSLMVGYSDKGEPEEVIDIYKGMRGEGVGCNENSMSLVISSCGLLKDESLGR 180
Query: 181 QLLGHVLKFGLETKVSAANSLVSMFGGCGDINEACSIFNEMNERDTISWNSIISANAQNA 240
Q++G V+K GLE+K++ NSL+SM G G+++ A IF++M+ERDTISWNSI +A AQN
Sbjct: 181 QIIGQVVKSGLESKLAVENSLISMLGSMGNVDYANYIFDQMSERDTISWNSIAAAYAQNG 240
Query: 241 LHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPNICLCN 300
EESFR F MR H+E+N TT+S LLS+ G VD+ KWG+G+HGLVVK G + +C+CN
Sbjct: 241 HIEESFRIFSLMRRFHDEVNSTTVSTLLSVLGHVDHQKWGRGIHGLVVKMGFDSVVCVCN 300
Query: 301 TLLNMYSDAGRSEGAELIFRRMPDRDLISWNSMLACYVQDGRYLCALKFFAEMLWMKKEI 360
TLL MY+ AGRS A L+F++MP +DLISWNS++A +V DGR L AL M+ K +
Sbjct: 301 TLLRMYAGAGRSVEANLVFKQMPTKDLISWNSLMASFVNDGRSLDALGLLCSMISSGKSV 360
Query: 361 NYVTFTSALAACLDPEFFTEGKILHGFVVVLGLQDDLIIGNTLITLYGKCHKMAEAKKLF 420
NYVTFTSALAAC P+FF +G+ILHG VVV GL + IIGN L+++YGK +M+E++++
Sbjct: 361 NYVTFTSALAACFTPDFFEKGRILHGLVVVSGLFYNQIIGNALVSMYGKIGEMSESRRVL 420
Query: 421 QRIPKLDKVSWNALIGGFADNAEPNEAVAAFKLMREGGTCGVDYITIVNILSSCLIHEDL 480
++P+ D V+WNALIGG+A++ +P++A+AAF+ MR G +YIT+V++LS+CL+ DL
Sbjct: 421 LQMPRRDVVAWNALIGGYAEDEDPDKALAAFQTMRVEGVSS-NYITVVSVLSACLLPGDL 480
Query: 481 IKYGMPIHAHTVVTGFDLDQHVQSSLITMYTKCGDLHSSSYIFDNLVFKTSSVWNAIITA 540
++ G P+HA+ V GF+ D+HV++SLITMY KCGDL SS +F+ L + WNA++ A
Sbjct: 481 LERGKPLHAYIVSAGFESDEHVKNSLITMYAKCGDLSSSQDLFNGLDNRNIITWNAMLAA 540
Query: 541 NARYGFGEEALKLVGRMRTAGIEFDQFNFSTALSVAADLAMLEEGKQLHGSTIKLGFELD 600
NA +G GEE LKLV +MR+ G+ DQF+FS LS AA LA+LEEG+QLHG +KLGFE D
Sbjct: 541 NAHHGHGEEVLKLVSKMRSFGVSLDQFSFSEGLSAAAKLAVLEEGQQLHGLAVKLGFEHD 600
Query: 601 HFVINAAMDMYGKCGELDDALKILPQPTYRSRLSWNTMISIFARHGHFHKAKETFHEMLK 660
F+ NAA DMY KCGE+ + +K+LP RS SWN +IS RHG+F + TFHEML+
Sbjct: 601 SFIFNAAADMYSKCGEIGEVVKMLPPSVNRSLPSWNILISALGRHGYFEEVCATFHEMLE 660
Query: 661 LGVKPDHVSFVCLLSACSHGGLVDEGRAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLV 720
+G+KP HV+FV LL+ACSHGGLVD+G AYY + ++G++P IEHC+C+IDLLGRSGRL
Sbjct: 661 MGIKPGHVTFVSLLTACSHGGLVDKGLAYYDMIARDFGLEPAIEHCICVIDLLGRSGRLA 720
Query: 721 EAEAFITDMSIPPNDLVWRSLLASCRIYRNLDLGRKAAEHLLELDPSDDSAYVLYSNVFA 780
EAE FI+ M + PNDLVWRSLLASC+I+ NLD GRKAAE+L +L+P DDS YVL SN+FA
Sbjct: 721 EAETFISKMPMKPNDLVWRSLLASCKIHGNLDRGRKAAENLSKLEPEDDSVYVLSSNMFA 780
Query: 781 TIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNIGIFGMGDQTHPQMEQINGKLLGLMK 840
T GRWEDVE+VR QMG I+KK A SWVK K + FG+GD+THPQ +I KL + K
Sbjct: 781 TTGRWEDVENVRKQMGFKNIKKKQACSWVKLKDKVSSFGIGDRTHPQTMEIYAKLEDIKK 840
Query: 841 MVREAGYVPDTSYSLQDTDEEQKEHNMWNHSERIAHSERIALAFGLINIPEDDFSPLFSF 900
+++E+GYV DTS +LQDTDEEQKEHN+WN HSER+ALA+ L++ PE +F
Sbjct: 841 LIKESGYVADTSQALQDTDEEQKEHNLWN------HSERLALAYALMSTPEGSTVRIFKN 900
Query: 901 IFLTDRC 908
+ + C
Sbjct: 901 LRICSDC 900
BLAST of HG10011116 vs. TAIR 10
Match:
AT1G16480.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 1070.8 bits (2768), Expect = 6.1e-313
Identity = 512/890 (57.53%), Postives = 667/890 (74.94%), Query Frame = 0
Query: 18 MPERNEASWNNMMSAYVRVGSYLEAVLFFRDICGIGIKPSGFVISSLVTACNKSSIMAKE 77
MP RNE SWN MMS VRVG YLE + FFR +C +GIKPS FVI+SLVTAC +S M +E
Sbjct: 1 MPVRNEVSWNTMMSGIVRVGLYLEGMEFFRKMCDLGIKPSSFVIASLVTACGRSGSMFRE 60
Query: 78 GFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEMPDRNVVSWTSLMVSYSD 137
G Q+HGF K GL+ DV+V T+ +H YG YG+VS ++K+F EMPDRNVVSWTSLMV YSD
Sbjct: 61 GVQVHGFVAKSGLLSDVYVSTAILHLYGVYGLVSCSRKVFEEMPDRNVVSWTSLMVGYSD 120
Query: 138 NGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGHQLLGHVLKFGLETKVSA 197
G EEVI+ YK MR EG+ CNEN+++LVISSCG L D LG Q++G V+K GLE+K++
Sbjct: 121 KGEPEEVIDIYKGMRGEGVGCNENSMSLVISSCGLLKDESLGRQIIGQVVKSGLESKLAV 180
Query: 198 ANSLVSMFGGCGDINEACSIFNEMNERDTISWNSIISANAQNALHEESFRYFHWMRLVHE 257
NSL+SM G G+++ A IF++M+ERDTISWNSI +A AQN EESFR F MR H+
Sbjct: 181 ENSLISMLGSMGNVDYANYIFDQMSERDTISWNSIAAAYAQNGHIEESFRIFSLMRRFHD 240
Query: 258 EINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPNICLCNTLLNMYSDAGRSEGAEL 317
E+N TT+S LLS+ G VD+ KWG+G+HGLVVK G + +C+CNTLL MY+ AGRS A L
Sbjct: 241 EVNSTTVSTLLSVLGHVDHQKWGRGIHGLVVKMGFDSVVCVCNTLLRMYAGAGRSVEANL 300
Query: 318 IFRRMPDRDLISWNSMLACYVQDGRYLCALKFFAEMLWMKKEINYVTFTSALAACLDPEF 377
+F++MP +DLISWNS++A +V DGR L AL M+ K +NYVTFTSALAAC P+F
Sbjct: 301 VFKQMPTKDLISWNSLMASFVNDGRSLDALGLLCSMISSGKSVNYVTFTSALAACFTPDF 360
Query: 378 FTEGKILHGFVVVLGLQDDLIIGNTLITLYGKCHKMAEAKKLFQRIPKLDKVSWNALIGG 437
F +G+ILHG VVV GL + IIGN L+++YGK +M+E++++ ++P+ D V+WNALIGG
Sbjct: 361 FEKGRILHGLVVVSGLFYNQIIGNALVSMYGKIGEMSESRRVLLQMPRRDVVAWNALIGG 420
Query: 438 FADNAEPNEAVAAFKLMREGGTCGVDYITIVNILSSCLIHEDLIKYGMPIHAHTVVTGFD 497
+A++ +P++A+AAF+ MR G +YIT+V++LS+CL+ DL++ G P+HA+ V GF+
Sbjct: 421 YAEDEDPDKALAAFQTMRVEGVSS-NYITVVSVLSACLLPGDLLERGKPLHAYIVSAGFE 480
Query: 498 LDQHVQSSLITMYTKCGDLHSSSYIFDNLVFKTSSVWNAIITANARYGFGEEALKLVGRM 557
D+HV++SLITMY KCGDL SS +F+ L + WNA++ ANA +G GEE LKLV +M
Sbjct: 481 SDEHVKNSLITMYAKCGDLSSSQDLFNGLDNRNIITWNAMLAANAHHGHGEEVLKLVSKM 540
Query: 558 RTAGIEFDQFNFSTALSVAADLAMLEEGKQLHGSTIKLGFELDHFVINAAMDMYGKCGEL 617
R+ G+ DQF+FS LS AA LA+LEEG+QLHG +KLGFE D F+ NAA DMY KCGE+
Sbjct: 541 RSFGVSLDQFSFSEGLSAAAKLAVLEEGQQLHGLAVKLGFEHDSFIFNAAADMYSKCGEI 600
Query: 618 DDALKILPQPTYRSRLSWNTMISIFARHGHFHKAKETFHEMLKLGVKPDHVSFVCLLSAC 677
+ +K+LP RS SWN +IS RHG+F + TFHEML++G+KP HV+FV LL+AC
Sbjct: 601 GEVVKMLPPSVNRSLPSWNILISALGRHGYFEEVCATFHEMLEMGIKPGHVTFVSLLTAC 660
Query: 678 SHGGLVDEGRAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLVEAEAFITDMSIPPNDLV 737
SHGGLVD+G AYY + ++G++P IEHC+C+IDLLGRSGRL EAE FI+ M + PNDLV
Sbjct: 661 SHGGLVDKGLAYYDMIARDFGLEPAIEHCICVIDLLGRSGRLAEAETFISKMPMKPNDLV 720
Query: 738 WRSLLASCRIYRNLDLGRKAAEHLLELDPSDDSAYVLYSNVFATIGRWEDVEDVRGQMGA 797
WRSLLASC+I+ NLD GRKAAE+L +L+P DDS YVL SN+FAT GRWEDVE+VR QMG
Sbjct: 721 WRSLLASCKIHGNLDRGRKAAENLSKLEPEDDSVYVLSSNMFATTGRWEDVENVRKQMGF 780
Query: 798 HKIQKKPAHSWVKWKGNIGIFGMGDQTHPQMEQINGKLLGLMKMVREAGYVPDTSYSLQD 857
I+KK A SWVK K + FG+GD+THPQ +I KL + K+++E+GYV DTS +LQD
Sbjct: 781 KNIKKKQACSWVKLKDKVSSFGIGDRTHPQTMEIYAKLEDIKKLIKESGYVADTSQALQD 840
Query: 858 TDEEQKEHNMWNHSERIAHSERIALAFGLINIPEDDFSPLFSFIFLTDRC 908
TDEEQKEHN+WN HSER+ALA+ L++ PE +F + + C
Sbjct: 841 TDEEQKEHNLWN------HSERLALAYALMSTPEGSTVRIFKNLRICSDC 883
BLAST of HG10011116 vs. TAIR 10
Match:
AT4G33170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 508.4 bits (1308), Expect = 1.2e-143
Identity = 297/903 (32.89%), Postives = 481/903 (53.27%), Query Frame = 0
Query: 1 MYSKFGRINYARLVFDGMPERNEASWNNMMSAYVR-----VGSYLEAVLFFRDICGIGIK 60
MYSK G + YAR VFD MP+R+ SWN++++AY + V + +A L FR + +
Sbjct: 83 MYSKCGSLTYARRVFDKMPDRDLVSWNSILAAYAQSSECVVENIQQAFLLFRILRQDVVY 142
Query: 61 PSGFVISSLVTACNKSS-IMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQ 120
S +S ++ C S + A E F HG+A K GL D FV + V+ Y +G V +
Sbjct: 143 TSRMTLSPMLKLCLHSGYVWASESF--HGYACKIGLDGDEFVAGALVNIYLKFGKVKEGK 202
Query: 121 KMFNEMPDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLV 180
+F EMP R+VV W ++ +Y + G KEE I+ G+ N N I L
Sbjct: 203 VLFEEMPYRDVVLWNLMLKAYLEMGFKEEAIDLSSAFHSSGL--NPNEITL--------- 262
Query: 181 DILLGHQLLGHVLKFGLETKVSAANSLVSMFGGCGDINEACSIFNEMNERDTISWNSIIS 240
+LL + GD ++A + + N D S + II
Sbjct: 263 ------RLLARI---------------------SGDDSDAGQVKSFANGNDASSVSEIIF 322
Query: 241 ANAQNALHEESFRYFHWMRLVHE------EINYTTLSILLSICGSVDYLKWGKGVHGLVV 300
N + + S +Y ++ + E + T ++L+ VD L G+ VH + +
Sbjct: 323 RNKGLSEYLHSGQYSALLKCFADMVESDVECDQVTFILMLATAVKVDSLALGQQVHCMAL 382
Query: 301 KYGLEPNICLCNTLLNMYSDAGRSEGAELIFRRMPDRDLISWNSMLACYVQDGRYLCALK 360
K GL+ + + N+L+NMY + A +F M +RDLISWNS++A Q+G + A+
Sbjct: 383 KLGLDLMLTVSNSLINMYCKLRKFGFARTVFDNMSERDLISWNSVIAGIAQNGLEVEAVC 442
Query: 361 FFAEMLWMKKEINYVTFTSAL-AACLDPEFFTEGKILHGFVVVLGLQDDLIIGNTLITLY 420
F ++L + + T TS L AA PE + K +H + + D + LI Y
Sbjct: 443 LFMQLLRCGLKPDQYTMTSVLKAASSLPEGLSLSKQVHVHAIKINNVSDSFVSTALIDAY 502
Query: 421 GKCHKMAEAKKLFQRIPKLDKVSWNALIGGFADNAEPNEAVAAFKLMREGGTCGVDYITI 480
+ M EA+ LF+R D V+WNA++ G+ + + ++ + F LM + G D+ T+
Sbjct: 503 SRNRCMKEAEILFER-HNFDLVAWNAMMAGYTQSHDGHKTLKLFALMHKQGERSDDF-TL 562
Query: 481 VNILSSCLIHEDLIKYGMPIHAHTVVTGFDLDQHVQSSLITMYTKCGDLHSSSYIFDNLV 540
+ +C I G +HA+ + +G+DLD V S ++ MY KCGD+ ++ + FD++
Sbjct: 563 ATVFKTCGF-LFAINQGKQVHAYAIKSGYDLDLWVSSGILDMYVKCGDMSAAQFAFDSIP 622
Query: 541 FKTSSVWNAIITANARYGFGEEALKLVGRMRTAGIEFDQFNFSTALSVAADLAMLEEGKQ 600
W +I+ G E A + +MR G+ D+F +T ++ L LE+G+Q
Sbjct: 623 VPDDVAWTTMISGCIENGEEERAFHVFSQMRLMGVLPDEFTIATLAKASSCLTALEQGRQ 682
Query: 601 LHGSTIKLGFELDHFVINAAMDMYGKCGELDDALKILPQPTYRSRLSWNTMISIFARHGH 660
+H + +KL D FV + +DMY KCG +DDA + + + +WN M+ A+HG
Sbjct: 683 IHANALKLNCTNDPFVGTSLVDMYAKCGSIDDAYCLFKRIEMMNITAWNAMLVGLAQHGE 742
Query: 661 FHKAKETFHEMLKLGVKPDHVSFVCLLSACSHGGLVDEGRAYYASMTSEYGIQPGIEHCV 720
+ + F +M LG+KPD V+F+ +LSACSH GLV E + SM +YGI+P IEH
Sbjct: 743 GKETLQLFKQMKSLGIKPDKVTFIGVLSACSHSGLVSEAYKHMRSMHGDYGIKPEIEHYS 802
Query: 721 CMIDLLGRSGRLVEAEAFITDMSIPPNDLVWRSLLASCRIYRNLDLGRKAAEHLLELDPS 780
C+ D LGR+G + +AE I MS+ + ++R+LLA+CR+ + + G++ A LLEL+P
Sbjct: 803 CLADALGRAGLVKQAENLIESMSMEASASMYRTLLAACRVQGDTETGKRVATKLLELEPL 862
Query: 781 DDSAYVLYSNVFATIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNIGIFGMGDQTHPQ 840
D SAYVL SN++A +W++++ R M HK++K P SW++ K I IF + D+++ Q
Sbjct: 863 DSSAYVLLSNMYAAASKWDEMKLARTMMKGHKVKKDPGFSWIEVKNKIHIFVVDDRSNRQ 922
Query: 841 MEQINGKLLGLMKMVREAGYVPDTSYSLQDTDEEQKEHNMWNHSERIAHSERIALAFGLI 891
E I K+ +++ +++ GYVP+T ++L D +EE+KE ++ HSE++A+AFGL+
Sbjct: 923 TELIYRKVKDMIRDIKQEGYVPETDFTLVDVEEEEKERALY------YHSEKLAVAFGLL 936
BLAST of HG10011116 vs. TAIR 10
Match:
AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 496.5 bits (1277), Expect = 4.7e-140
Identity = 279/877 (31.81%), Postives = 464/877 (52.91%), Query Frame = 0
Query: 14 VFDGMPERNEASWNNMMSAYVRVGSYLEAVLFFRDICGIGIKPSGFVISSLVTACNKSSI 73
VFD MPER +WN M+ E F + + P+ S ++ AC S+
Sbjct: 142 VFDEMPERTIFTWNKMIKELASRNLIGEVFGLFVRMVSENVTPNEGTFSGVLEACRGGSV 201
Query: 74 MAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGSYGIVSNAQKMFNEMPDRNVVSWTSLMV 133
Q+H + GL V + Y G V A+++F+ + ++ SW +++
Sbjct: 202 AFDVVEQIHARILYQGLRDSTVVCNPLIDLYSRNGFVDLARRVFDGLRLKDHSSWVAMIS 261
Query: 134 SYSDNGSKEEVINTYKRMRHEGICCNENNIALVISSCGFLVDILLGHQLLGHVLKFGLET 193
S N + E I + M GI + V+S+C + + +G QL G VLK G +
Sbjct: 262 GLSKNECEAEAIRLFCDMYVLGIMPTPYAFSSVLSACKKIESLEIGEQLHGLVLKLGFSS 321
Query: 194 KVSAANSLVSMFGGCGDINEACSIFNEMNERDTISWNSIISANAQNALHEESFRYFHWMR 253
N+LVS++ G++ A IF+ M++RD +++N++I+ +Q E++ F M
Sbjct: 322 DTYVCNALVSLYFHLGNLISAEHIFSNMSQRDAVTYNTLINGLSQCGYGEKAMELFKRMH 381
Query: 254 LVHEEINYTTLSILLSICGSVDYLKWGKGVHGLVVKYGLEPNICLCNTLLNMYSDAGRSE 313
L E + TL+ L+ C + L G+ +H K G N + LLN+Y+ E
Sbjct: 382 LDGLEPDSNTLASLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGALLNLYAKCADIE 441
Query: 314 GAELIFRRMPDRDLISWNSMLACYVQDGRYLCALKFFAEMLWMKKEINYVTFTSALAACL 373
A F +++ WN ML Y + + F +M + N T+ S L C+
Sbjct: 442 TALDYFLETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEEIVPNQYTYPSILKTCI 501
Query: 374 DPEFFTEGKILHGFVVVLGLQDDLIIGNTLITLYGKCHKMAEAKKLFQRIPKLDKVSWNA 433
G+ +H ++ Q + + + LI +Y K K+ A + R D VSW
Sbjct: 502 RLGDLELGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTT 561
Query: 434 LIGGFADNAEPNEAVAAFKLMREGGTCGVDYITIVNILSSCLIHEDLIKYGMPIHAHTVV 493
+I G+ ++A+ F+ M + G D + + N +S+C + L K G IHA V
Sbjct: 562 MIAGYTQYNFDDKALTTFRQMLDRGIRS-DEVGLTNAVSACAGLQAL-KEGQQIHAQACV 621
Query: 494 TGFDLDQHVQSSLITMYTKCGDLHSSSYIFDNLVFKTSSVWNAIITANARYGFGEEALKL 553
+GF D Q++L+T+Y++CG + S F+ + WNA+++ + G EEAL++
Sbjct: 622 SGFSSDLPFQNALVTLYSRCGKIEESYLAFEQTEAGDNIAWNALVSGFQQSGNNEEALRV 681
Query: 554 VGRMRTAGIEFDQFNFSTALSVAADLAMLEEGKQLHGSTIKLGFELDHFVINAAMDMYGK 613
RM GI+ + F F +A+ A++ A +++GKQ+H K G++ + V NA + MY K
Sbjct: 682 FVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYDSETEVCNALISMYAK 741
Query: 614 CGELDDALKILPQPTYRSRLSWNTMISIFARHGHFHKAKETFHEMLKLGVKPDHVSFVCL 673
CG + DA K + + ++ +SWN +I+ +++HG +A ++F +M+ V+P+HV+ V +
Sbjct: 742 CGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVGV 801
Query: 674 LSACSHGGLVDEGRAYYASMTSEYGIQPGIEHCVCMIDLLGRSGRLVEAEAFITDMSIPP 733
LSACSH GLVD+G AY+ SM SEYG+ P EH VC++D+L R+G L A+ FI +M I P
Sbjct: 802 LSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSRAKEFIQEMPIKP 861
Query: 734 NDLVWRSLLASCRIYRNLDLGRKAAEHLLELDPSDDSAYVLYSNVFATIGRWEDVEDVRG 793
+ LVWR+LL++C +++N+++G AA HLLEL+P D + YVL SN++A +W+ + R
Sbjct: 862 DALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAVSKKWDARDLTRQ 921
Query: 794 QMGAHKIQKKPAHSWVKWKGNIGIFGMGDQTHPQMEQINGKLLGLMKMVREAGYVPDTSY 853
+M ++K+P SW++ K +I F +GDQ HP ++I+ L K E GYV D
Sbjct: 922 KMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDLTKRASEIGYVQDCFS 981
Query: 854 SLQDTDEEQKEHNMWNHSERIAHSERIALAFGLINIP 891
L + EQK+ ++ HSE++A++FGL+++P
Sbjct: 982 LLNELQHEQKDPIIF------IHSEKLAISFGLLSLP 1010
BLAST of HG10011116 vs. TAIR 10
Match:
AT5G09950.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 486.9 bits (1252), Expect = 3.7e-137
Identity = 297/893 (33.26%), Postives = 472/893 (52.86%), Query Frame = 0
Query: 2 YSKFGRINYARLVFDGMPERNEASWNNMMSAYVRVGSYLEAVLFFRDICGIGIKPSGFVI 61
Y + G AR VFD MP RN SW ++S Y R G + EA++F RD+ GI + +
Sbjct: 46 YLETGDSVSARKVFDEMPLRNCVSWACIVSGYSRNGEHKEALVFLRDMVKEGIFSNQYAF 105
Query: 62 SSLVTACNK-SSIMAKEGFQLHGFAIKCGLIYDVFVGTSFVHFYGS-YGIVSNAQKMFNE 121
S++ AC + S+ G Q+HG K D V + Y G V A F +
Sbjct: 106 VSVLRACQEIGSVGILFGRQIHGLMFKLSYAVDAVVSNVLISMYWKCIGSVGYALCAFGD 165
Query: 122 MPDRNVVSWTSLMVSYSDNGSKEEVINTYKRMRHEGICCNENNI-ALVISSCGFL-VDIL 181
+ +N VSW S++ YS G + + M+++G E +LV ++C D+
Sbjct: 166 IEVKNSVSWNSIISVYSQAGDQRSAFRIFSSMQYDGSRPTEYTFGSLVTTACSLTEPDVR 225
Query: 182 LGHQLLGHVLKFGLETKVSAANSLVSMFGGCGDINEACSIFNEMNERDTISWNSIISANA 241
L Q++ + K GL T + + LVS F G ++ A +FN+M R+ ++ N ++
Sbjct: 226 LLEQIMCTIQKSGLLTDLFVGSGLVSAFAKSGSLSYARKVFNQMETRNAVTLNGLMVGLV 285
Query: 242 QNALHEESFRYFHWMRLVHEEINYTTLSILLSICGSVDY-------LKWGKGVHGLVVKY 301
+ EE+ + F M + I+ + S ++ + +Y LK G+ VHG V+
Sbjct: 286 RQKWGEEATKLFMDM---NSMIDVSPESYVILLSSFPEYSLAEEVGLKKGREVHGHVITT 345
Query: 302 GL-EPNICLCNTLLNMYSDAGRSEGAELIFRRMPDRDLISWNSMLACYVQDGRYLCALKF 361
GL + + + N L+NMY+ G A +F M D+D +SWNSM+ Q+G ++ A++
Sbjct: 346 GLVDFMVGIGNGLVNMYAKCGSIADARRVFYFMTDKDSVSWNSMITGLDQNGCFIEAVER 405
Query: 362 FAEMLWMKKEINYVTFTSALAACLDPEFFTEGKILHGFVVVLGLQDDLIIGNTLITLYGK 421
+ M T S+L++C ++ G+ +HG + LG+ ++ + N L+TLY +
Sbjct: 406 YKSMRRHDILPGSFTLISSLSSCASLKWAKLGQQIHGESLKLGIDLNVSVSNALMTLYAE 465
Query: 422 CHKMAEAKKLFQRIPKLDKVSWNALIGGFA--DNAEPNEAVAAFKLMREGGTCG-VDYIT 481
+ E +K+F +P+ D+VSWN++IG A + + P V R G + + +
Sbjct: 466 TGYLNECRKIFSSMPEHDQVSWNSIIGALARSERSLPEAVVCFLNAQRAGQKLNRITFSS 525
Query: 482 IVNILSSCLIHEDLIKYGMPIHAHTVVTGFDLDQHVQSSLITMYTKCGDLHSSSYIFDNL 541
+++ +SS E G IH + + +++LI Y KCG++ IF +
Sbjct: 526 VLSAVSSLSFGE----LGKQIHGLALKNNIADEATTENALIACYGKCGEMDGCEKIFSRM 585
Query: 542 VFKTSSV-WNAIITANARYGFGEEALKLVGRMRTAGIEFDQFNFSTALSVAADLAMLEEG 601
+ +V WN++I+ +AL LV M G D F ++T LS A +A LE G
Sbjct: 586 AERRDNVTWNSMISGYIHNELLAKALDLVWFMLQTGQRLDSFMYATVLSAFASVATLERG 645
Query: 602 KQLHGSTIKLGFELDHFVINAAMDMYGKCGELDDALKILPQPTYRSRLSWNTMISIFARH 661
++H +++ E D V +A +DMY KCG LD AL+ R+ SWN+MIS +ARH
Sbjct: 646 MEVHACSVRACLESDVVVGSALVDMYSKCGRLDYALRFFNTMPVRNSYSWNSMISGYARH 705
Query: 662 GHFHKAKETFHEMLKLG-VKPDHVSFVCLLSACSHGGLVDEGRAYYASMTSEYGIQPGIE 721
G +A + F M G PDHV+FV +LSACSH GL++EG ++ SM+ YG+ P IE
Sbjct: 706 GQGEEALKLFETMKLDGQTPPDHVTFVGVLSACSHAGLLEEGFKHFESMSDSYGLAPRIE 765
Query: 722 HCVCMIDLLGRSGRLVEAEAFITDMSIPPNDLVWRSLLASC--RIYRNLDLGRKAAEHLL 781
H CM D+LGR+G L + E FI M + PN L+WR++L +C R +LG+KAAE L
Sbjct: 766 HFSCMADVLGRAGELDKLEDFIEKMPMKPNVLIWRTVLGACCRANGRKAELGKKAAEMLF 825
Query: 782 ELDPSDDSAYVLYSNVFATIGRWEDVEDVRGQMGAHKIQKKPAHSWVKWKGNIGIFGMGD 841
+L+P + YVL N++A GRWED+ R +M ++K+ +SWV K + +F GD
Sbjct: 826 QLEPENAVNYVLLGNMYAAGGRWEDLVKARKKMKDADVKKEAGYSWVTMKDGVHMFVAGD 885
Query: 842 QTHPQMEQINGKLLGLMKMVREAGYVPDTSYSLQDTDEEQKEHNMWNHSERIA 876
++HP + I KL L + +R+AGYVP T ++L D ++E KE + HSE++A
Sbjct: 886 KSHPDADVIYKKLKELNRKMRDAGYVPQTGFALYDLEQENKEEILSYHSEKLA 931
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038882887.1 | 0.0e+00 | 92.72 | pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X7 ... | [more] |
XP_038882805.1 | 0.0e+00 | 92.72 | pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X1 ... | [more] |
XP_038882845.1 | 0.0e+00 | 92.72 | pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X3 ... | [more] |
XP_038882854.1 | 0.0e+00 | 92.72 | pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X4 ... | [more] |
XP_038882837.1 | 0.0e+00 | 92.72 | pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X2 ... | [more] |
Match Name | E-value | Identity | Description | |
Q9SMZ2 | 1.7e-142 | 32.89 | Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX... | [more] |
Q9SVP7 | 6.6e-139 | 31.81 | Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... | [more] |
Q9M1V3 | 4.0e-136 | 33.57 | Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidop... | [more] |
Q9FIB2 | 5.2e-136 | 33.26 | Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis th... | [more] |
Q9SS60 | 5.8e-127 | 30.72 | Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX... | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0LAC1 | 0.0e+00 | 90.96 | DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G2156... | [more] |
A0A1S4E120 | 0.0e+00 | 90.63 | pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X1 ... | [more] |
A0A1S3C3P4 | 0.0e+00 | 90.63 | pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X2 ... | [more] |
A0A1S3C2F0 | 0.0e+00 | 90.63 | pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X4 ... | [more] |
A0A1S3C2I9 | 0.0e+00 | 90.63 | pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X3 ... | [more] |
Match Name | E-value | Identity | Description | |
AT1G16480.1 | 0.0e+00 | 57.55 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT1G16480.2 | 6.1e-313 | 57.53 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT4G33170.1 | 1.2e-143 | 32.89 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT4G13650.1 | 4.7e-140 | 31.81 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT5G09950.1 | 3.7e-137 | 33.26 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |