Homology
BLAST of HG10003471 vs. NCBI nr
Match:
XP_038890628.1 (pentatricopeptide repeat-containing protein At1g71490 [Benincasa hispida] >XP_038890630.1 pentatricopeptide repeat-containing protein At1g71490 [Benincasa hispida])
HSP 1 Score: 1275.8 bits (3300), Expect = 0.0e+00
Identity = 623/658 (94.68%), Postives = 641/658 (97.42%), Query Frame = 0
Query: 1 MIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYNDSFDLILQSISILLVSCTNCSSLPS 60
MIDSIFSSLKNFAS+GQLSKTFEAFSLI+LR SYNDSFDLILQSISILLVSCTNCSSLP
Sbjct: 38 MIDSIFSSLKNFASNGQLSKTFEAFSLIRLRASYNDSFDLILQSISILLVSCTNCSSLPP 97
Query: 61 GKQLHGHIISSGLEEDSILVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYVR 120
GKQLHGHII+SGLEEDS LVPKLVTFYSSFKLLPEAHTLVE SNLFHPC WNLLI SYVR
Sbjct: 98 GKQLHGHIITSGLEEDSFLVPKLVTFYSSFKLLPEAHTLVETSNLFHPCAWNLLIISYVR 157
Query: 121 NELHDAAILAYKQMLSKGVRPDNFTFPSILKACGETQNLEFGLEVHKSINAWSTKWSLFV 180
NELH+AAILAYKQMLSKGVRPDNFTFPSILKACGET+NLEFGLEVHKSINAWSTKWSLFV
Sbjct: 158 NELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETRNLEFGLEVHKSINAWSTKWSLFV 217
Query: 181 QNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMISCYASKAMWKEAFELFDSMQSKCV 240
QNAL+SMYGRCGEVDTARNLFDNMLE DAVSWNSMISCYASK MWKEAFELFD MQSKCV
Sbjct: 218 QNALVSMYGRCGEVDTARNLFDNMLEWDAVSWNSMISCYASKGMWKEAFELFDGMQSKCV 277
Query: 241 EINVVTWNIIAGGCSRVGNFTRALKLLSQMRNFGIHLDSVAMIIGLGACSHIGAIRLGKE 300
INVVTWNIIAGGC RVGNFTRALKLLSQMRN GI+LD+VAM+IGLGACSHIGAIRLGKE
Sbjct: 278 GINVVTWNIIAGGCLRVGNFTRALKLLSQMRNLGIYLDNVAMVIGLGACSHIGAIRLGKE 337
Query: 301 IHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYKLFRLNDDKSIITWNSMLSGLTHLDR 360
IHGFTIRHYYHKLST+QNALVTMYARCKDIMHAY LFRLNDDKSIITWNSMLSGLTHLDR
Sbjct: 338 IHGFTIRHYYHKLSTIQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDR 397
Query: 361 VKDALHLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFRDYLLLWN 420
V+DALHLFRELL FGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFRDYLLLWN
Sbjct: 398 VEDALHLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFRDYLLLWN 457
Query: 421 ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFKEMKRFQIKP 480
ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI+GYGMQGEG KALRLF+EMKRF+IKP
Sbjct: 458 ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLISGYGMQGEGAKALRLFEEMKRFEIKP 517
Query: 481 DHITMVAVLSACSHSGLVKQGELLFAEMQSLHGLRPHLEHYACMADLFGRVGLLNKAKEI 540
DHITMVAVLSACSHSGL++QGELLFAEMQS+HGLRPHLEHYACMADLFGRVGLLNKAKEI
Sbjct: 518 DHITMVAVLSACSHSGLLEQGELLFAEMQSVHGLRPHLEHYACMADLFGRVGLLNKAKEI 577
Query: 541 ITRMPYRPTSAMWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSW 600
ITRMPYRPTSAMWATLIGACCIHGNTDIGEWAAEKLLEM PEHSGYYVLIANMYAAAGSW
Sbjct: 578 ITRMPYRPTSAMWATLIGACCIHGNTDIGEWAAEKLLEMWPEHSGYYVLIANMYAAAGSW 637
Query: 601 SKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFLVGDTSNPQALESKLLLDSLNDVMK 659
SKLAKIRTLMRDSGVAKVPGCSWVDVGS FVSFLVGDTSNPQALESKL+LDSLNDVMK
Sbjct: 638 SKLAKIRTLMRDSGVAKVPGCSWVDVGSGFVSFLVGDTSNPQALESKLVLDSLNDVMK 695
BLAST of HG10003471 vs. NCBI nr
Match:
XP_022973516.1 (pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucurbita maxima] >XP_022973518.1 pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucurbita maxima] >XP_022973519.1 pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucurbita maxima] >XP_022973520.1 pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucurbita maxima])
HSP 1 Score: 1252.3 bits (3239), Expect = 0.0e+00
Identity = 612/676 (90.53%), Postives = 642/676 (94.97%), Query Frame = 0
Query: 1 MIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYNDSFDLILQSISILLVSCTNCSSLPS 60
MI+SIF SLK+FASHGQLSK FEAFSL+QLR SYNDSFDLI+QSISILLVSCT CSSLPS
Sbjct: 38 MINSIFYSLKSFASHGQLSKAFEAFSLVQLRRSYNDSFDLIVQSISILLVSCTTCSSLPS 97
Query: 61 GKQLHGHIISSGLEEDSILVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYVR 120
GKQLHG IISSGLEEDSILVPKLVTFYSSFKLL EAHTLVENSNLFHPCPWNLLITSYVR
Sbjct: 98 GKQLHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVR 157
Query: 121 NELHDAAILAYKQMLSKGVRPDNFTFPSILKACGETQNLEFGLEVHKSINAWSTKWSLFV 180
NELH++AILAYKQMLSKGVRPDNFTFPSILKACGETQNL FGLEVHK IN+WS +WSLFV
Sbjct: 158 NELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGLEVHKCINSWSNQWSLFV 217
Query: 181 QNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMISCYASKAMWKEAFELFDSMQSKCV 240
QNALISMYGRCGE+DTARNLFDNML+RDAVSWNSMISCYASK MWKEAFELFD MQSKC+
Sbjct: 218 QNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDIMQSKCL 277
Query: 241 EINVVTWNIIAGGCSRVGNFTRALKLLSQMRNFGIHLDSVAMIIGLGACSHIGAIRLGKE 300
EIN+VTWNIIAGGC R+G FTRALKLLSQMRNFGIHLD VAMIIGLGACSHIGAIRLGKE
Sbjct: 278 EINIVTWNIIAGGCLRLGKFTRALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKE 337
Query: 301 IHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYKLFRLNDDKSIITWNSMLSGLTHLDR 360
IHGFTIRH YHK STVQNAL+TMYARCKDIM AY LFRLNDDKSIITWNSMLSGL+H+DR
Sbjct: 338 IHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDR 397
Query: 361 VKDALHLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFRDYLLLWN 420
V+DAL LFRE L FGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDF+DYLLLWN
Sbjct: 398 VEDALRLFREFLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWN 457
Query: 421 ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFKEMKRFQIKP 480
ALVDMYARSGKV+EAKRVFDSLSKKDEVTYTSLIAGYGMQGEG KALRLF+EMK IKP
Sbjct: 458 ALVDMYARSGKVIEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGRKALRLFEEMKSVDIKP 517
Query: 481 DHITMVAVLSACSHSGLVKQGELLFAEMQSLHGLRPHLEHYACMADLFGRVGLLNKAKEI 540
DHITMVAVLSACSHSGLVKQGE+LFAEMQS+HGL PHLEHYACMADLFGRVGLL++AKEI
Sbjct: 518 DHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPHLEHYACMADLFGRVGLLDRAKEI 577
Query: 541 ITRMPYRPTSAMWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSW 600
ITRMPYRPTSAMWATLIGACCIH NTDIGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSW
Sbjct: 578 ITRMPYRPTSAMWATLIGACCIHRNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSW 637
Query: 601 SKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFLVGDTSNPQALESKLLLDSLNDVMKHG 660
SKLAKIRTLMRD GVAK PGCSWV+VGSEFVSFLVGDTSNPQALESK LLD LNDVMKHG
Sbjct: 638 SKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDDLNDVMKHG 697
Query: 661 SLMTTDSYDDIGDDIF 677
+L+ TD Y DIG+D+F
Sbjct: 698 TLVMTDDY-DIGNDVF 712
BLAST of HG10003471 vs. NCBI nr
Match:
KAG7025166.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1249.2 bits (3231), Expect = 0.0e+00
Identity = 612/676 (90.53%), Postives = 640/676 (94.67%), Query Frame = 0
Query: 1 MIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYNDSFDLILQSISILLVSCTNCSSLPS 60
MI SIF SLK+FASHGQLSK FEAFSL+QLR+SYNDSFDLI+QSISILLVSCT CSSLPS
Sbjct: 38 MITSIFDSLKSFASHGQLSKAFEAFSLVQLRSSYNDSFDLIVQSISILLVSCTTCSSLPS 97
Query: 61 GKQLHGHIISSGLEEDSILVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYVR 120
GKQLHG II SGLEEDSILVPKLVTFYSSFKLL EAHTLVENSNLFHPCPWNLLITSYVR
Sbjct: 98 GKQLHGRIILSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVR 157
Query: 121 NELHDAAILAYKQMLSKGVRPDNFTFPSILKACGETQNLEFGLEVHKSINAWSTKWSLFV 180
NELH++AILAYKQMLSKGVRPDNFTFPSILKACGETQNL FGLEVHK IN+WS +WSLFV
Sbjct: 158 NELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGLEVHKCINSWSNQWSLFV 217
Query: 181 QNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMISCYASKAMWKEAFELFDSMQSKCV 240
QNALISMYGRCGE+DTARNLFDNML+RDAVSWNSMISCYASK MWKEAFELFD MQSKC+
Sbjct: 218 QNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDIMQSKCL 277
Query: 241 EINVVTWNIIAGGCSRVGNFTRALKLLSQMRNFGIHLDSVAMIIGLGACSHIGAIRLGKE 300
EIN+VTWNIIAGGC R+G FTRALKLLSQMRNFGIHLD VAMIIGLGACSHIGAIRLGKE
Sbjct: 278 EINIVTWNIIAGGCLRLGKFTRALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKE 337
Query: 301 IHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYKLFRLNDDKSIITWNSMLSGLTHLDR 360
IHGFTIRH YHK STVQNAL+TMYARCKDI AY LFR+NDDKSIITWNSMLSGL+H+DR
Sbjct: 338 IHGFTIRHCYHKSSTVQNALLTMYARCKDITRAYILFRINDDKSIITWNSMLSGLSHVDR 397
Query: 361 VKDALHLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFRDYLLLWN 420
V+DAL LFRELL FGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDF+DYLLLWN
Sbjct: 398 VEDALRLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWN 457
Query: 421 ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFKEMKRFQIKP 480
ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG KALRLF+EMK IKP
Sbjct: 458 ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGRKALRLFEEMKSVDIKP 517
Query: 481 DHITMVAVLSACSHSGLVKQGELLFAEMQSLHGLRPHLEHYACMADLFGRVGLLNKAKEI 540
DHITMVAVLSACSHSGLVKQGE+LFAEMQS+HGL P LEHYACMADLFGRVGLL++AKEI
Sbjct: 518 DHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEI 577
Query: 541 ITRMPYRPTSAMWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSW 600
ITRMPYRPTSAMWATLIGACCIH NTDIGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSW
Sbjct: 578 ITRMPYRPTSAMWATLIGACCIHRNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSW 637
Query: 601 SKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFLVGDTSNPQALESKLLLDSLNDVMKHG 660
SKLAKIRTLMRD GVAK PGCSWV+VGSEFVSFLVGDTSNPQALESK LLD LNDVMKHG
Sbjct: 638 SKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHG 697
Query: 661 SLMTTDSYDDIGDDIF 677
+L+ TD Y DIGDDIF
Sbjct: 698 TLVMTDDY-DIGDDIF 712
BLAST of HG10003471 vs. NCBI nr
Match:
XP_022925519.1 (pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucurbita moschata] >XP_022925520.1 pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucurbita moschata] >XP_022925521.1 pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucurbita moschata] >XP_022925522.1 pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucurbita moschata])
HSP 1 Score: 1246.5 bits (3224), Expect = 0.0e+00
Identity = 611/676 (90.38%), Postives = 639/676 (94.53%), Query Frame = 0
Query: 1 MIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYNDSFDLILQSISILLVSCTNCSSLPS 60
MI SIF SLK+FASHGQLSK FEAFSL+QLR SYNDSFDLI+QSISILLVSCT CSSLPS
Sbjct: 38 MITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPS 97
Query: 61 GKQLHGHIISSGLEEDSILVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYVR 120
GKQLHG IISSGLEEDSILVPKLVTFYSSFKLL EAHTLVENSNLFHPCPWNLLITSYVR
Sbjct: 98 GKQLHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVR 157
Query: 121 NELHDAAILAYKQMLSKGVRPDNFTFPSILKACGETQNLEFGLEVHKSINAWSTKWSLFV 180
NELH++AILAYKQMLSKGVRPDNFTFPSILKACGETQNL FGLEVHK IN+WS +WSLFV
Sbjct: 158 NELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGLEVHKCINSWSNQWSLFV 217
Query: 181 QNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMISCYASKAMWKEAFELFDSMQSKCV 240
QNALISMYGRCGE+DTARNLFDNML+RDAVSWNSMISCYAS MWKEAFELFD MQSKC+
Sbjct: 218 QNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCL 277
Query: 241 EINVVTWNIIAGGCSRVGNFTRALKLLSQMRNFGIHLDSVAMIIGLGACSHIGAIRLGKE 300
EIN+VTWNIIAGGC R+G FT+ALKLLSQMRNFGIHLD VAMIIGLGACSHIGAIRLGKE
Sbjct: 278 EINIVTWNIIAGGCLRLGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKE 337
Query: 301 IHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYKLFRLNDDKSIITWNSMLSGLTHLDR 360
IHGFTIRH YHK STVQNAL+TMYARCKDIM AY LFRLNDDKSIITWNSMLSGL+H+DR
Sbjct: 338 IHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDR 397
Query: 361 VKDALHLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFRDYLLLWN 420
V+DAL LFRELL +GVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDF+DYLLLWN
Sbjct: 398 VEDALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWN 457
Query: 421 ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFKEMKRFQIKP 480
ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG KALRLF+EMK IKP
Sbjct: 458 ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGRKALRLFEEMKSVDIKP 517
Query: 481 DHITMVAVLSACSHSGLVKQGELLFAEMQSLHGLRPHLEHYACMADLFGRVGLLNKAKEI 540
DHITMVAVLSACSHSGLVKQGE+LFAEMQS+HGL P LEHYACMADLFGRVGLL++AKEI
Sbjct: 518 DHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEI 577
Query: 541 ITRMPYRPTSAMWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSW 600
ITRMPYRPTSAMWATLIGACCIH NTDIGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSW
Sbjct: 578 ITRMPYRPTSAMWATLIGACCIHRNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSW 637
Query: 601 SKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFLVGDTSNPQALESKLLLDSLNDVMKHG 660
SKLAKIRTLMRD GVAK PGCSWV+VGSEFVSFLVGDTSNPQALESK LLD LNDVMKHG
Sbjct: 638 SKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHG 697
Query: 661 SLMTTDSYDDIGDDIF 677
+L+ D Y DIGDDIF
Sbjct: 698 TLVMIDDY-DIGDDIF 712
BLAST of HG10003471 vs. NCBI nr
Match:
XP_023535485.1 (pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023535486.1 pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023535487.1 pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1245.3 bits (3221), Expect = 0.0e+00
Identity = 612/676 (90.53%), Postives = 640/676 (94.67%), Query Frame = 0
Query: 1 MIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYNDSFDLILQSISILLVSCTNCSSLPS 60
MI+SIF SLK+FASHGQLSK FEAFSL+QLR SYNDSFDLI+QSISILLVSCT CSSLPS
Sbjct: 38 MINSIFDSLKSFASHGQLSKAFEAFSLVQLRRSYNDSFDLIVQSISILLVSCTTCSSLPS 97
Query: 61 GKQLHGHIISSGLEEDSILVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYVR 120
GKQLHG IISSGLEEDSILVPKLVTFYSSFKLL EAHTLVENSNLFHPCPWNLLITSYVR
Sbjct: 98 GKQLHGCIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVR 157
Query: 121 NELHDAAILAYKQMLSKGVRPDNFTFPSILKACGETQNLEFGLEVHKSINAWSTKWSLFV 180
NELH++AILAYKQMLSKGVRPDNFTFPSILKACGETQNL FGLEVHK IN+WS +WSLFV
Sbjct: 158 NELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGLEVHKCINSWSNQWSLFV 217
Query: 181 QNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMISCYASKAMWKEAFELFDSMQSKCV 240
QNALISMYGRCGE+DTARNLFDNML+RDAVSWNSMISCYASK MWKEAFELFD MQSKC+
Sbjct: 218 QNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDIMQSKCL 277
Query: 241 EINVVTWNIIAGGCSRVGNFTRALKLLSQMRNFGIHLDSVAMIIGLGACSHIGAIRLGKE 300
EIN+VTWNIIAGGC R+G FT+ALKLLSQMRNFGIHLD VAMIIGLGACSHIGAIRLGKE
Sbjct: 278 EINIVTWNIIAGGCLRLGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKE 337
Query: 301 IHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYKLFRLNDDKSIITWNSMLSGLTHLDR 360
IHGFTIRH YHK STVQNAL+TMYARCKDI AY LFRLNDDKSIITWNSMLSGL+H+DR
Sbjct: 338 IHGFTIRHCYHKSSTVQNALLTMYARCKDITRAYILFRLNDDKSIITWNSMLSGLSHVDR 397
Query: 361 VKDALHLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFRDYLLLWN 420
V+DAL LFRELL FGVEPNYVT ASILPLCARVADLQHGREFHCYITKRQDFRDYLLLWN
Sbjct: 398 VEDALRLFRELLLFGVEPNYVTCASILPLCARVADLQHGREFHCYITKRQDFRDYLLLWN 457
Query: 421 ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFKEMKRFQIKP 480
ALVDMYARSGKV+EAKRVFDSLSKKDEVTYTSLIAGYGMQGEG KALRLF+EMK IKP
Sbjct: 458 ALVDMYARSGKVIEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGRKALRLFEEMKSVDIKP 517
Query: 481 DHITMVAVLSACSHSGLVKQGELLFAEMQSLHGLRPHLEHYACMADLFGRVGLLNKAKEI 540
DHITMVAVLSACSHSGLVKQGE+LFAEMQS+HGL P LEHYACMADLFGRVGLL++AKEI
Sbjct: 518 DHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEI 577
Query: 541 ITRMPYRPTSAMWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSW 600
ITRMP RPTSAMWATLIGACCIH NTDIGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSW
Sbjct: 578 ITRMPCRPTSAMWATLIGACCIHRNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSW 637
Query: 601 SKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFLVGDTSNPQALESKLLLDSLNDVMKHG 660
SKLAKIRTLMRD GVAK PGCSWV+VGSEFVSFLVGDTSNPQALESK LLDSLNDVMKHG
Sbjct: 638 SKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDSLNDVMKHG 697
Query: 661 SLMTTDSYDDIGDDIF 677
+L+ TD Y DIGDDIF
Sbjct: 698 TLVMTDDY-DIGDDIF 712
BLAST of HG10003471 vs. ExPASy Swiss-Prot
Match:
Q9C9I6 (Pentatricopeptide repeat-containing protein At1g71490 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E67 PE=2 SV=1)
HSP 1 Score: 828.9 bits (2140), Expect = 4.0e-239
Identity = 397/656 (60.52%), Postives = 504/656 (76.83%), Query Frame = 0
Query: 3 DSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYNDSFDLILQSISILLVSCTNCSSLPSGK 62
+S+F SL + ASHG L F+ FSL++L++S S DL+L S + LL +C + + +G
Sbjct: 4 ESLFKSLGHLASHGHLHDAFKTFSLLRLQSSSAVSDDLVLHSAASLLSACVDVRAFLAGV 63
Query: 63 QLHGHIISSGLEEDSILVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYVRNE 122
Q+H H ISSG+E S+LVPKLVTFYS+F L EA +++ENS++ HP PWN+LI SY +NE
Sbjct: 64 QVHAHCISSGVEYHSVLVPKLVTFYSAFNLHNEAQSIIENSDILHPLPWNVLIASYAKNE 123
Query: 123 LHDAAILAYKQMLSKGVRPDNFTFPSILKACGETQNLEFGLEVHKSINAWSTKWSLFVQN 182
L + I AYK+M+SKG+RPD FT+PS+LKACGET ++ FG VH SI S K SL+V N
Sbjct: 124 LFEEVIAAYKRMVSKGIRPDAFTYPSVLKACGETLDVAFGRVVHGSIEVSSYKSSLYVCN 183
Query: 183 ALISMYGRCGEVDTARNLFDNMLERDAVSWNSMISCYASKAMWKEAFELFDSMQSKCVEI 242
ALISMY R + AR LFD M ERDAVSWN++I+CYAS+ MW EAFELFD M VE+
Sbjct: 184 ALISMYKRFRNMGIARRLFDRMFERDAVSWNAVINCYASEGMWSEAFELFDKMWFSGVEV 243
Query: 243 NVVTWNIIAGGCSRVGNFTRALKLLSQMRNFGIHLDSVAMIIGLGACSHIGAIRLGKEIH 302
+V+TWNII+GGC + GN+ AL L+S+MRNF LD VAMIIGL ACS IGAIRLGKEIH
Sbjct: 244 SVITWNIISGGCLQTGNYVGALGLISRMRNFPTSLDPVAMIIGLKACSLIGAIRLGKEIH 303
Query: 303 GFTIRHYYHKLSTVQNALVTMYARCKDIMHAYKLFRLNDDKSIITWNSMLSGLTHLDRVK 362
G I Y + V+N L+TMY++CKD+ HA +FR ++ S+ TWNS++SG L++ +
Sbjct: 304 GLAIHSSYDGIDNVRNTLITMYSKCKDLRHALIVFRQTEENSLCTWNSIISGYAQLNKSE 363
Query: 363 DALHLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFRDYLLLWNAL 422
+A HL RE+L G +PN +T ASILPLCAR+A+LQHG+EFHCYI +R+ F+DY +LWN+L
Sbjct: 364 EASHLLREMLVAGFQPNSITLASILPLCARIANLQHGKEFHCYILRRKCFKDYTMLWNSL 423
Query: 423 VDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFKEMKRFQIKPDH 482
VD+YA+SGK++ AK+V D +SK+DEVTYTSLI GYG QGEGG AL LFKEM R IKPDH
Sbjct: 424 VDVYAKSGKIVAAKQVSDLMSKRDEVTYTSLIDGYGNQGEGGVALALFKEMTRSGIKPDH 483
Query: 483 ITMVAVLSACSHSGLVKQGELLFAEMQSLHGLRPHLEHYACMADLFGRVGLLNKAKEIIT 542
+T+VAVLSACSHS LV +GE LF +MQ +G+RP L+H++CM DL+GR G L KAK+II
Sbjct: 484 VTVVAVLSACSHSKLVHEGERLFMKMQCEYGIRPCLQHFSCMVDLYGRAGFLAKAKDIIH 543
Query: 543 RMPYRPTSAMWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSK 602
MPY+P+ A WATL+ AC IHGNT IG+WAAEKLLEM+PE+ GYYVLIANMYAAAGSWSK
Sbjct: 544 NMPYKPSGATWATLLNACHIHGNTQIGKWAAEKLLEMKPENPGYYVLIANMYAAAGSWSK 603
Query: 603 LAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFLVGDTSNPQALESKLLLDSLNDVMK 659
LA++RT+MRD GV K PGC+W+D S F F VGDTS+P+A + LLD LN +MK
Sbjct: 604 LAEVRTIMRDLGVKKDPGCAWIDTDSGFSLFSVGDTSSPEACNTYPLLDGLNQLMK 659
BLAST of HG10003471 vs. ExPASy Swiss-Prot
Match:
Q4V389 (Pentatricopeptide repeat-containing protein At1g22830 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E24 PE=2 SV=1)
HSP 1 Score: 688.7 bits (1776), Expect = 6.5e-197
Identity = 338/654 (51.68%), Postives = 459/654 (70.18%), Query Frame = 0
Query: 5 IFSSLKNFASHGQLSKTFEAFSLIQLRTSYNDSFDLILQSISILLVSCTNCSSLPSGKQL 64
+F+S ++ SHGQL + F FSL++ ++ S + +L S + LL +C + G+QL
Sbjct: 49 LFNSFRHCISHGQLYEAFRTFSLLRYQSG---SHEFVLYSSASLLSTCVGFNEFVPGQQL 108
Query: 65 HGHIISSGLEEDSILVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYVRNELH 124
H H ISSGLE DS+LVPKLVTFYS+F LL EA T+ ENS + HP PWN+LI SY+RN+
Sbjct: 109 HAHCISSGLEFDSVLVPKLVTFYSAFNLLDEAQTITENSEILHPLPWNVLIGSYIRNKRF 168
Query: 125 DAAILAYKQMLSKGVRPDNFTFPSILKACGETQNLEFGLEVHKSINAWSTKWSLFVQNAL 184
++ YK+M+SKG+R D FT+PS++KAC + +G VH SI S + +L+V NAL
Sbjct: 169 QESVSVYKRMMSKGIRADEFTYPSVIKACAALLDFAYGRVVHGSIEVSSHRCNLYVCNAL 228
Query: 185 ISMYGRCGEVDTARNLFDNMLERDAVSWNSMISCYASKAMWKEAFELFDSMQSKCVEINV 244
ISMY R G+VD AR LFD M ERDAVSWN++I+CY S+ EAF+L D M VE ++
Sbjct: 229 ISMYKRFGKVDVARRLFDRMSERDAVSWNAIINCYTSEEKLGEAFKLLDRMYLSGVEASI 288
Query: 245 VTWNIIAGGCSRVGNFTRALKLLSQMRNFGIHLDSVAMIIGLGACSHIGAIRLGKEIHGF 304
VTWN IAGGC GN+ AL + MRN + + SVAMI GL ACSHIGA++ GK H
Sbjct: 289 VTWNTIAGGCLEAGNYIGALNCVVGMRNCNVRIGSVAMINGLKACSHIGALKWGKVFHCL 348
Query: 305 TIR--HYYHKLSTVQNALVTMYARCKDIMHAYKLFRLNDDKSIITWNSMLSGLTHLDRVK 364
IR + H + V+N+L+TMY+RC D+ HA+ +F+ + S+ TWNS++SG + +R +
Sbjct: 349 VIRSCSFSHDIDNVRNSLITMYSRCSDLRHAFIVFQQVEANSLSTWNSIISGFAYNERSE 408
Query: 365 DALHLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFRDYLLLWNAL 424
+ L +E+L G PN++T ASILPL ARV +LQHG+EFHCYI +RQ ++D L+LWN+L
Sbjct: 409 ETSFLLKEMLLSGFHPNHITLASILPLFARVGNLQHGKEFHCYILRRQSYKDCLILWNSL 468
Query: 425 VDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFKEMKRFQIKPDH 484
VDMYA+SG+++ AKRVFDS+ K+D+VTYTSLI GYG G+G AL FK+M R IKPDH
Sbjct: 469 VDMYAKSGEIIAAKRVFDSMRKRDKVTYTSLIDGYGRLGKGEVALAWFKDMDRSGIKPDH 528
Query: 485 ITMVAVLSACSHSGLVKQGELLFAEMQSLHGLRPHLEHYACMADLFGRVGLLNKAKEIIT 544
+TMVAVLSACSHS LV++G LF +M+ + G+R LEHY+CM DL+ R G L+KA++I
Sbjct: 529 VTMVAVLSACSHSNLVREGHWLFTKMEHVFGIRLRLEHYSCMVDLYCRAGYLDKARDIFH 588
Query: 545 RMPYRPTSAMWATLIGACCIHGNTDIGEWAAEK-LLEMRPEHSGYYVLIANMYAAAGSWS 604
+PY P+SAM ATL+ AC IHGNT+IGEWAA+K LLE +PEH G+Y+L+A+MYA GSWS
Sbjct: 589 TIPYEPSSAMCATLLKACLIHGNTNIGEWAADKLLLETKPEHLGHYMLLADMYAVTGSWS 648
Query: 605 KLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFLVGDTSNPQALESKLLLDSLND 656
KL ++TL+ D GV K + ++ SE L G+ + P +S + + +D
Sbjct: 649 KLVTVKTLLSDLGVQKAHEFALMETDSE----LDGENNKPMNDDSVINQEQSSD 695
BLAST of HG10003471 vs. ExPASy Swiss-Prot
Match:
Q9LFL5 (Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H92 PE=2 SV=1)
HSP 1 Score: 422.9 bits (1086), Expect = 6.7e-117
Identity = 229/682 (33.58%), Postives = 370/682 (54.25%), Query Frame = 0
Query: 7 SSLKNFASHGQLSKTFEAFSLIQLRTSYNDSFDLILQSISILLVSCTNCSSLPSGKQLHG 66
S ++++ +G +K F L+ + D++ + + +C SS+ G+ H
Sbjct: 97 SLIRSYGDNGCANKCLYLFGLMHSLSWTPDNY-----TFPFVFKACGEISSVRCGESAHA 156
Query: 67 HIISSGLEEDSILVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYVRNELHDA 126
+ +G + + LV YS + L +A + + +++ WN +I SY +
Sbjct: 157 LSLVTGFISNVFVGNALVAMYSRCRSLSDARKVFDEMSVWDVVSWNSIIESYAKLGKPKV 216
Query: 127 AILAYKQMLSK-GVRPDNFTFPSILKACGETQNLEFGLEVHKSINAWSTKWSLFVQNALI 186
A+ + +M ++ G RPDN T ++L C G ++H ++FV N L+
Sbjct: 217 ALEMFSRMTNEFGCRPDNITLVNVLPPCASLGTHSLGKQLHCFAVTSEMIQNMFVGNCLV 276
Query: 187 SMYGRCGEVDTARNLFDNMLERDAVSWNSMISCYASKAMWKEAFELFDSMQSKCVEINVV 246
MY +CG +D A +F NM +D VSWN+M++ Y+ +++A LF+ MQ + ++++VV
Sbjct: 277 DMYAKCGMMDEANTVFSNMSVKDVVSWNAMVAGYSQIGRFEDAVRLFEKMQEEKIKMDVV 336
Query: 247 TWNIIAGGCSRVGNFTRALKLLSQMRNFGIHLDSVAMIIGLGACSHIGAIRLGKEIHGFT 306
TW+ G ++ G AL + QM + GI + V +I L C+ +GA+ GKEIH +
Sbjct: 337 TWSAAISGYAQRGLGYEALGVCRQMLSSGIKPNEVTLISVLSGCASVGALMHGKEIHCYA 396
Query: 307 IRH-------YYHKLSTVQNALVTMYARCKDIMHAYKLF--RLNDDKSIITWNSMLSGLT 366
I++ + + V N L+ MYA+CK + A +F ++ ++TW M+ G +
Sbjct: 397 IKYPIDLRKNGHGDENMVINQLIDMYAKCKKVDTARAMFDSLSPKERDVVTWTVMIGGYS 456
Query: 367 HLDRVKDALHLFRELLQFGVE--PNYVTFASILPLCARVADLQHGREFHCYITKRQDFRD 426
AL L E+ + + PN T + L CA +A L+ G++ H Y + Q
Sbjct: 457 QHGDANKALELLSEMFEEDCQTRPNAFTISCALVACASLAALRIGKQIHAYALRNQQNAV 516
Query: 427 YLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFKEMK 486
L + N L+DMYA+ G + +A+ VFD++ K+EVT+TSL+ GYGM G G +AL +F EM+
Sbjct: 517 PLFVSNCLIDMYAKCGSISDARLVFDNMMAKNEVTWTSLMTGYGMHGYGEEALGIFDEMR 576
Query: 487 RFQIKPDHITMVAVLSACSHSGLVKQGELLFAEMQSLHGLRPHLEHYACMADLFGRVGLL 546
R K D +T++ VL ACSHSG++ QG F M+++ G+ P EHYAC+ DL GR G L
Sbjct: 577 RIGFKLDGVTLLVVLYACSHSGMIDQGMEYFNRMKTVFGVSPGPEHYACLVDLLGRAGRL 636
Query: 547 NKAKEIITRMPYRPTSAMWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMY 606
N A +I MP P +W + C IHG ++GE+AAEK+ E+ H G Y L++N+Y
Sbjct: 637 NAALRLIEEMPMEPPPVVWVAFLSCCRIHGKVELGEYAAEKITELASNHDGSYTLLSNLY 696
Query: 607 AAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFLVGDTSNPQALE-SKLLLDSL 666
A AG W + +IR+LMR GV K PGCSWV+ +F VGD ++P A E ++LLD +
Sbjct: 697 ANAGRWKDVTRIRSLMRHKGVKKRPGCSWVEGIKGTTTFFVGDKTHPHAKEIYQVLLDHM 756
Query: 667 NDVMKHGSLMTTD-SYDDIGDD 675
+ G + T + D+ D+
Sbjct: 757 QRIKDIGYVPETGFALHDVDDE 773
BLAST of HG10003471 vs. ExPASy Swiss-Prot
Match:
Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)
HSP 1 Score: 418.7 bits (1075), Expect = 1.3e-115
Identity = 219/642 (34.11%), Postives = 364/642 (56.70%), Query Frame = 0
Query: 30 LRTSYNDSFDLILQSISILLVSCTNCSSLPSGKQLHGHIISSGLEEDSILVPKLVTF--- 89
L +S + +D I S+ L+ NC +L S + +H +I GL + + KL+ F
Sbjct: 20 LPSSSDPPYDSIRNHPSLSLLH--NCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCIL 79
Query: 90 YSSFKLLPEAHTLVENSNLFHPCPWNLLITSYVRNELHDAAILAYKQMLSKGVRPDNFTF 149
F+ LP A ++ + + WN + + + +A+ Y M+S G+ P+++TF
Sbjct: 80 SPHFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTF 139
Query: 150 PSILKACGETQNLEFGLEVHKSINAWSTKWSLFVQNALISMYGRCGEVDTARNLFDNMLE 209
P +LK+C +++ + G ++H + L+V +LISMY + G ++ A +FD
Sbjct: 140 PFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPH 199
Query: 210 RDAVSWNSMISCYASKAMWKEAFELFDSMQSKCVEINVVTWNIIAGGCSRVGNFTRALKL 269
RD VS+ ++I YAS+ + A +LFD + K +VV+WN + G + GN+ AL+L
Sbjct: 200 RDVVSYTALIKGYASRGYIENAQKLFDEIPVK----DVVSWNAMISGYAETGNYKEALEL 259
Query: 270 LSQMRNFGIHLDSVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYAR 329
M + D M+ + AC+ G+I LG+++H + H + + NAL+ +Y++
Sbjct: 260 FKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSK 319
Query: 330 CKDIMHAYKLFRLNDDKSIITWNSMLSGLTHLDRVKDALHLFRELLQFGVEPNYVTFASI 389
C ++ A LF K +I+WN+++ G TH++ K+AL LF+E+L+ G PN VT SI
Sbjct: 320 CGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSI 379
Query: 390 LPLCARVADLQHGREFHCYITKR-QDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKK 449
LP CA + + GR H YI KR + + L +L+DMYA+ G + A +VF+S+ K
Sbjct: 380 LPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHK 439
Query: 450 DEVTYTSLIAGYGMQGEGGKALRLFKEMKRFQIKPDHITMVAVLSACSHSGLVKQGELLF 509
++ ++I G+ M G + LF M++ I+PD IT V +LSACSHSG++ G +F
Sbjct: 440 SLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIF 499
Query: 510 AEMQSLHGLRPHLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAMWATLIGACCIHGN 569
M + + P LEHY CM DL G GL +A+E+I M P +W +L+ AC +HGN
Sbjct: 500 RTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGN 559
Query: 570 TDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVD 629
++GE AE L+++ PE+ G YVL++N+YA+AG W+++AK R L+ D G+ KVPGCS ++
Sbjct: 560 VELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIE 619
Query: 630 VGSEFVSFLVGDTSNPQALESKLLLDSLNDVMKHGSLMTTDS 668
+ S F++GD +P+ E +L+ + +++ + S
Sbjct: 620 IDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTS 655
BLAST of HG10003471 vs. ExPASy Swiss-Prot
Match:
Q9LNU6 (Pentatricopeptide repeat-containing protein At1g20230 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H21 PE=2 SV=2)
HSP 1 Score: 377.5 bits (968), Expect = 3.2e-103
Identity = 195/638 (30.56%), Postives = 335/638 (52.51%), Query Frame = 0
Query: 56 SSLPSGKQLHGHIISSGLEEDSILVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLI 115
SSL Q H I+ SG + D + KL+ YS++ +A ++++ ++ LI
Sbjct: 29 SSLSKTTQAHARILKSGAQNDGYISAKLIASYSNYNCFNDADLVLQSIPDPTIYSFSSLI 88
Query: 116 TSYVRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGETQNLEFGLEVHKSINAWSTK 175
+ + +L +I + +M S G+ PD+ P++ K C E + G ++H
Sbjct: 89 YALTKAKLFTQSIGVFSRMFSHGLIPDSHVLPNLFKVCAELSAFKVGKQIHCVSCVSGLD 148
Query: 176 WSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMISCYASKAMWKEAFELFDSM 235
FVQ ++ MY RCG + AR +FD M ++D V+ ++++ YA K +E + M
Sbjct: 149 MDAFVQGSMFHMYMRCGRMGDARKVFDRMSDKDVVTCSALLCAYARKGCLEEVVRILSEM 208
Query: 236 QSKCVEINVVTWNIIAGGCSRVGNFTRALKLLSQMRNFGIHLDSVAMIIGLGACSHIGAI 295
+S +E N+V+WN I G +R G A+ + ++ + G D V + L + +
Sbjct: 209 ESSGIEANIVSWNGILSGFNRSGYHKEAVVMFQKIHHLGFCPDQVTVSSVLPSVGDSEML 268
Query: 296 RLGKEIHGFTIRHYYHKLSTVQNALVTMYARCKDI------------------------- 355
+G+ IHG+ I+ K V +A++ MY + +
Sbjct: 269 NMGRLIHGYVIKQGLLKDKCVISAMIDMYGKSGHVYGIISLFNQFEMMEAGVCNAYITGL 328
Query: 356 ---------MHAYKLFRLND-DKSIITWNSMLSGLTHLDRVKDALHLFRELLQFGVEPNY 415
+ ++LF+ + ++++W S+++G + +AL LFRE+ GV+PN+
Sbjct: 329 SRNGLVDKALEMFELFKEQTMELNVVSWTSIIAGCAQNGKDIEALELFREMQVAGVKPNH 388
Query: 416 VTFASILPLCARVADLQHGREFHCYITKRQDFRDYLLLWNALVDMYARSGKVLEAKRVFD 475
VT S+LP C +A L HGR H + R D + + +AL+DMYA+ G++ ++ VF+
Sbjct: 389 VTIPSMLPACGNIAALGHGRSTHGFAV-RVHLLDNVHVGSALIDMYAKCGRINLSQIVFN 448
Query: 476 SLSKKDEVTYTSLIAGYGMQGEGGKALRLFKEMKRFQIKPDHITMVAVLSACSHSGLVKQ 535
+ K+ V + SL+ G+ M G+ + + +F+ + R ++KPD I+ ++LSAC GL +
Sbjct: 449 MMPTKNLVCWNSLMNGFSMHGKAKEVMSIFESLMRTRLKPDFISFTSLLSACGQVGLTDE 508
Query: 536 GELLFAEMQSLHGLRPHLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAMWATLIGAC 595
G F M +G++P LEHY+CM +L GR G L +A ++I MP+ P S +W L+ +C
Sbjct: 509 GWKYFKMMSEEYGIKPRLEHYSCMVNLLGRAGKLQEAYDLIKEMPFEPDSCVWGALLNSC 568
Query: 596 CIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPG 655
+ N D+ E AAEKL + PE+ G YVL++N+YAA G W+++ IR M G+ K PG
Sbjct: 569 RLQNNVDLAEIAAEKLFHLEPENPGTYVLLSNIYAAKGMWTEVDSIRNKMESLGLKKNPG 628
Query: 656 CSWVDVGSEFVSFLVGDTSNPQALESKLLLDSLNDVMK 659
CSW+ V + + L GD S+PQ + +D ++ M+
Sbjct: 629 CSWIQVKNRVYTLLAGDKSHPQIDQITEKMDEISKEMR 665
BLAST of HG10003471 vs. ExPASy TrEMBL
Match:
A0A6J1I8V4 (pentatricopeptide repeat-containing protein At1g71490 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111472055 PE=4 SV=1)
HSP 1 Score: 1252.3 bits (3239), Expect = 0.0e+00
Identity = 612/676 (90.53%), Postives = 642/676 (94.97%), Query Frame = 0
Query: 1 MIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYNDSFDLILQSISILLVSCTNCSSLPS 60
MI+SIF SLK+FASHGQLSK FEAFSL+QLR SYNDSFDLI+QSISILLVSCT CSSLPS
Sbjct: 38 MINSIFYSLKSFASHGQLSKAFEAFSLVQLRRSYNDSFDLIVQSISILLVSCTTCSSLPS 97
Query: 61 GKQLHGHIISSGLEEDSILVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYVR 120
GKQLHG IISSGLEEDSILVPKLVTFYSSFKLL EAHTLVENSNLFHPCPWNLLITSYVR
Sbjct: 98 GKQLHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVR 157
Query: 121 NELHDAAILAYKQMLSKGVRPDNFTFPSILKACGETQNLEFGLEVHKSINAWSTKWSLFV 180
NELH++AILAYKQMLSKGVRPDNFTFPSILKACGETQNL FGLEVHK IN+WS +WSLFV
Sbjct: 158 NELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGLEVHKCINSWSNQWSLFV 217
Query: 181 QNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMISCYASKAMWKEAFELFDSMQSKCV 240
QNALISMYGRCGE+DTARNLFDNML+RDAVSWNSMISCYASK MWKEAFELFD MQSKC+
Sbjct: 218 QNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDIMQSKCL 277
Query: 241 EINVVTWNIIAGGCSRVGNFTRALKLLSQMRNFGIHLDSVAMIIGLGACSHIGAIRLGKE 300
EIN+VTWNIIAGGC R+G FTRALKLLSQMRNFGIHLD VAMIIGLGACSHIGAIRLGKE
Sbjct: 278 EINIVTWNIIAGGCLRLGKFTRALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKE 337
Query: 301 IHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYKLFRLNDDKSIITWNSMLSGLTHLDR 360
IHGFTIRH YHK STVQNAL+TMYARCKDIM AY LFRLNDDKSIITWNSMLSGL+H+DR
Sbjct: 338 IHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDR 397
Query: 361 VKDALHLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFRDYLLLWN 420
V+DAL LFRE L FGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDF+DYLLLWN
Sbjct: 398 VEDALRLFREFLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWN 457
Query: 421 ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFKEMKRFQIKP 480
ALVDMYARSGKV+EAKRVFDSLSKKDEVTYTSLIAGYGMQGEG KALRLF+EMK IKP
Sbjct: 458 ALVDMYARSGKVIEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGRKALRLFEEMKSVDIKP 517
Query: 481 DHITMVAVLSACSHSGLVKQGELLFAEMQSLHGLRPHLEHYACMADLFGRVGLLNKAKEI 540
DHITMVAVLSACSHSGLVKQGE+LFAEMQS+HGL PHLEHYACMADLFGRVGLL++AKEI
Sbjct: 518 DHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPHLEHYACMADLFGRVGLLDRAKEI 577
Query: 541 ITRMPYRPTSAMWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSW 600
ITRMPYRPTSAMWATLIGACCIH NTDIGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSW
Sbjct: 578 ITRMPYRPTSAMWATLIGACCIHRNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSW 637
Query: 601 SKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFLVGDTSNPQALESKLLLDSLNDVMKHG 660
SKLAKIRTLMRD GVAK PGCSWV+VGSEFVSFLVGDTSNPQALESK LLD LNDVMKHG
Sbjct: 638 SKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDDLNDVMKHG 697
Query: 661 SLMTTDSYDDIGDDIF 677
+L+ TD Y DIG+D+F
Sbjct: 698 TLVMTDDY-DIGNDVF 712
BLAST of HG10003471 vs. ExPASy TrEMBL
Match:
A0A6J1EI84 (pentatricopeptide repeat-containing protein At1g71490 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111432795 PE=4 SV=1)
HSP 1 Score: 1246.5 bits (3224), Expect = 0.0e+00
Identity = 611/676 (90.38%), Postives = 639/676 (94.53%), Query Frame = 0
Query: 1 MIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYNDSFDLILQSISILLVSCTNCSSLPS 60
MI SIF SLK+FASHGQLSK FEAFSL+QLR SYNDSFDLI+QSISILLVSCT CSSLPS
Sbjct: 38 MITSIFDSLKSFASHGQLSKAFEAFSLVQLRGSYNDSFDLIVQSISILLVSCTTCSSLPS 97
Query: 61 GKQLHGHIISSGLEEDSILVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYVR 120
GKQLHG IISSGLEEDSILVPKLVTFYSSFKLL EAHTLVENSNLFHPCPWNLLITSYVR
Sbjct: 98 GKQLHGRIISSGLEEDSILVPKLVTFYSSFKLLAEAHTLVENSNLFHPCPWNLLITSYVR 157
Query: 121 NELHDAAILAYKQMLSKGVRPDNFTFPSILKACGETQNLEFGLEVHKSINAWSTKWSLFV 180
NELH++AILAYKQMLSKGVRPDNFTFPSILKACGETQNL FGLEVHK IN+WS +WSLFV
Sbjct: 158 NELHESAILAYKQMLSKGVRPDNFTFPSILKACGETQNLGFGLEVHKCINSWSNQWSLFV 217
Query: 181 QNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMISCYASKAMWKEAFELFDSMQSKCV 240
QNALISMYGRCGE+DTARNLFDNML+RDAVSWNSMISCYAS MWKEAFELFD MQSKC+
Sbjct: 218 QNALISMYGRCGELDTARNLFDNMLDRDAVSWNSMISCYASNGMWKEAFELFDIMQSKCL 277
Query: 241 EINVVTWNIIAGGCSRVGNFTRALKLLSQMRNFGIHLDSVAMIIGLGACSHIGAIRLGKE 300
EIN+VTWNIIAGGC R+G FT+ALKLLSQMRNFGIHLD VAMIIGLGACSHIGAIRLGKE
Sbjct: 278 EINIVTWNIIAGGCLRLGKFTQALKLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKE 337
Query: 301 IHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYKLFRLNDDKSIITWNSMLSGLTHLDR 360
IHGFTIRH YHK STVQNAL+TMYARCKDIM AY LFRLNDDKSIITWNSMLSGL+H+DR
Sbjct: 338 IHGFTIRHCYHKSSTVQNALLTMYARCKDIMRAYILFRLNDDKSIITWNSMLSGLSHVDR 397
Query: 361 VKDALHLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFRDYLLLWN 420
V+DAL LFRELL +GVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDF+DYLLLWN
Sbjct: 398 VEDALRLFRELLLYGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFQDYLLLWN 457
Query: 421 ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFKEMKRFQIKP 480
ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG KALRLF+EMK IKP
Sbjct: 458 ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGRKALRLFEEMKSVDIKP 517
Query: 481 DHITMVAVLSACSHSGLVKQGELLFAEMQSLHGLRPHLEHYACMADLFGRVGLLNKAKEI 540
DHITMVAVLSACSHSGLVKQGE+LFAEMQS+HGL P LEHYACMADLFGRVGLL++AKEI
Sbjct: 518 DHITMVAVLSACSHSGLVKQGEVLFAEMQSVHGLSPRLEHYACMADLFGRVGLLDRAKEI 577
Query: 541 ITRMPYRPTSAMWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSW 600
ITRMPYRPTSAMWATLIGACCIH NTDIGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSW
Sbjct: 578 ITRMPYRPTSAMWATLIGACCIHRNTDIGEWAAEKLLEMKPEHSGYYVLIANMYAAAGSW 637
Query: 601 SKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFLVGDTSNPQALESKLLLDSLNDVMKHG 660
SKLAKIRTLMRD GVAK PGCSWV+VGSEFVSFLVGDTSNPQALESK LLD LNDVMKHG
Sbjct: 638 SKLAKIRTLMRDYGVAKAPGCSWVEVGSEFVSFLVGDTSNPQALESKHLLDGLNDVMKHG 697
Query: 661 SLMTTDSYDDIGDDIF 677
+L+ D Y DIGDDIF
Sbjct: 698 TLVMIDDY-DIGDDIF 712
BLAST of HG10003471 vs. ExPASy TrEMBL
Match:
A0A6J1CJU8 (pentatricopeptide repeat-containing protein At1g71490-like OS=Momordica charantia OX=3673 GN=LOC111012088 PE=4 SV=1)
HSP 1 Score: 1232.6 bits (3188), Expect = 0.0e+00
Identity = 605/676 (89.50%), Postives = 634/676 (93.79%), Query Frame = 0
Query: 1 MIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYNDSFDLILQSISILLVSCTNCSSLPS 60
MI +FSSLK+FA HGQLSK FEAFSLIQLRT YNDSFDLILQS SILLVSCTN SSLP
Sbjct: 38 MIGPVFSSLKSFACHGQLSKAFEAFSLIQLRTGYNDSFDLILQSFSILLVSCTNASSLPP 97
Query: 61 GKQLHGHIISSGLEEDSILVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYVR 120
G+QLHG II SGLE+DSILVPKLVTFYSSFKLL EAHTLVENSN+FHPCPWNLLITSYVR
Sbjct: 98 GRQLHGRIILSGLEQDSILVPKLVTFYSSFKLLAEAHTLVENSNIFHPCPWNLLITSYVR 157
Query: 121 NELHDAAILAYKQMLSKGVRPDNFTFPSILKACGETQNLEFGLEVHKSINAWSTKWSLFV 180
N LH+AAIL YKQMLS+G+RPDNFTFPSILKACGETQNL FGLEVHK INAWST+WSLFV
Sbjct: 158 NGLHEAAILVYKQMLSRGIRPDNFTFPSILKACGETQNLGFGLEVHKYINAWSTEWSLFV 217
Query: 181 QNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMISCYASKAMWKEAFELFDSMQSKCV 240
QNALISMYGRCGEVDTARNLFDNML+RDAVSWNSMISCYASK MWKEAFELFD+MQSKC+
Sbjct: 218 QNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASKGMWKEAFELFDNMQSKCI 277
Query: 241 EINVVTWNIIAGGCSRVGNFTRALKLLSQMRNFGIHLDSVAMIIGLGACSHIGAIRLGKE 300
EIN+VTWNIIAGGC RVGNF ALKLLSQMRNFG HLD VAMIIGLGACSHIGAIRLGKE
Sbjct: 278 EINIVTWNIIAGGCLRVGNFVGALKLLSQMRNFGAHLDYVAMIIGLGACSHIGAIRLGKE 337
Query: 301 IHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYKLFRLNDDKSIITWNSMLSGLTHLDR 360
IHGFTIRH YH+LS VQNALVTMYARCKDIM+AY LFRLN DKSIITWNSMLSG THLDR
Sbjct: 338 IHGFTIRHCYHRLSIVQNALVTMYARCKDIMNAYILFRLNSDKSIITWNSMLSGYTHLDR 397
Query: 361 VKDALHLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFRDYLLLWN 420
V++AL LFRELL GVEPNYVT ASILPLCARVADLQHGREFHCYITKR+D DYLLLWN
Sbjct: 398 VEEALFLFRELLLSGVEPNYVTIASILPLCARVADLQHGREFHCYITKREDLGDYLLLWN 457
Query: 421 ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFKEMKRFQIKP 480
ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG KALRLF+EMKRFQIKP
Sbjct: 458 ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGSKALRLFQEMKRFQIKP 517
Query: 481 DHITMVAVLSACSHSGLVKQGELLFAEMQSLHGLRPHLEHYACMADLFGRVGLLNKAKEI 540
DHITMVAVLSACSHSGL+KQGELLFAEMQS+HGL PHLEHYACMADLFGRVGLLNKAK I
Sbjct: 518 DHITMVAVLSACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLNKAKGI 577
Query: 541 ITRMPYRPTSAMWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSW 600
ITRMPYRPTSAMWATLIGACCIHGNT+IGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSW
Sbjct: 578 ITRMPYRPTSAMWATLIGACCIHGNTNIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSW 637
Query: 601 SKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFLVGDTSNPQALESKLLLDSLNDVMKHG 660
SKLAKIRTLMRDSGVAK PGCSWVDVGS FVSFLVGDTSNPQALE+ LLLD+LN+VMKHG
Sbjct: 638 SKLAKIRTLMRDSGVAKAPGCSWVDVGSGFVSFLVGDTSNPQALEANLLLDNLNNVMKHG 697
Query: 661 SLMTTDSYDDIGDDIF 677
SL+T DS+ DI +D F
Sbjct: 698 SLVTKDSH-DIDNDSF 712
BLAST of HG10003471 vs. ExPASy TrEMBL
Match:
A0A1S3CB12 (pentatricopeptide repeat-containing protein At1g71490 OS=Cucumis melo OX=3656 GN=LOC103498667 PE=4 SV=1)
HSP 1 Score: 1231.5 bits (3185), Expect = 0.0e+00
Identity = 609/676 (90.09%), Postives = 637/676 (94.23%), Query Frame = 0
Query: 1 MIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYNDSFDLILQSISILLVSCTNCSSLPS 60
MI SIFSSLK+FASHGQLSKTFEAFSLIQLRTSYNDSFDLILQSISILLVSCT CSSLP
Sbjct: 38 MIGSIFSSLKDFASHGQLSKTFEAFSLIQLRTSYNDSFDLILQSISILLVSCTKCSSLPP 97
Query: 61 GKQLHGHIISSGLEEDSILVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYVR 120
GKQLHGHIISSGL EDS LV KLV FYSS + LPEAHTLVE SNLF PC WN+L+TSYVR
Sbjct: 98 GKQLHGHIISSGLVEDSFLVSKLVMFYSSLECLPEAHTLVETSNLFRPCSWNILMTSYVR 157
Query: 121 NELHDAAILAYKQMLSKGVRPDNFTFPSILKACGETQNLEFGLEVHKSINAWSTKWSLFV 180
N+L++AAILAYKQMLSKGVRPDNFTFPSILKACGETQNLEFGLEVHKSINA ST WSLFV
Sbjct: 158 NKLYEAAILAYKQMLSKGVRPDNFTFPSILKACGETQNLEFGLEVHKSINACSTNWSLFV 217
Query: 181 QNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMISCYASKAMWKEAFELFDSMQSKCV 240
NALISMYGRCGEVDTAR LFD MLERD VSWNSMISCY+S+ MW+EAFELF+SMQSK +
Sbjct: 218 HNALISMYGRCGEVDTARYLFDIMLERDGVSWNSMISCYSSRGMWREAFELFESMQSKSL 277
Query: 241 EINVVTWNIIAGGCSRVGNFTRALKLLSQMRNFGIHLDSVAMIIGLGACSHIGAIRLGKE 300
EINVVTWNIIAGGC RVGNFTRAL LLSQMRNFGIHLD VAMIIGLGACSHIGAIRLGKE
Sbjct: 278 EINVVTWNIIAGGCLRVGNFTRALNLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKE 337
Query: 301 IHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYKLFRLNDDKSIITWNSMLSGLTHLDR 360
IHGFTIRHY+H LSTVQNALVTMYARCKDI HAY LFRLNDDKSIITWNSMLSGLTHLDR
Sbjct: 338 IHGFTIRHYHHMLSTVQNALVTMYARCKDIRHAYMLFRLNDDKSIITWNSMLSGLTHLDR 397
Query: 361 VKDALHLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFRDYLLLWN 420
V+DAL LFRELL FGVEPNYVTFASILPLCARVA+LQHGREFHCYITKR DFRD+LLLWN
Sbjct: 398 VEDALCLFRELLLFGVEPNYVTFASILPLCARVANLQHGREFHCYITKRHDFRDHLLLWN 457
Query: 421 ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFKEMKRFQIKP 480
ALVDMYARSGKV EAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLF+EMKRFQIKP
Sbjct: 458 ALVDMYARSGKVSEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFEEMKRFQIKP 517
Query: 481 DHITMVAVLSACSHSGLVKQGELLFAEMQSLHGLRPHLEHYACMADLFGRVGLLNKAKEI 540
DHITMVAVLSACSHSGL+ QGELLFAEMQS+HGL P LEHY+CMADLFGRVGLLNKAKEI
Sbjct: 518 DHITMVAVLSACSHSGLLNQGELLFAEMQSVHGLSPRLEHYSCMADLFGRVGLLNKAKEI 577
Query: 541 ITRMPYRPTSAMWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSW 600
ITRMPYRPTSA+WATLIGACCIHGNTDIGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSW
Sbjct: 578 ITRMPYRPTSAIWATLIGACCIHGNTDIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSW 637
Query: 601 SKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFLVGDTSNPQALESKLLLDSLNDVMKHG 660
SKLA+IRT MRDSGVAKVPGCSWVDVGSEFVSF VGDTS+PQALESKLLLDSL DV+KH
Sbjct: 638 SKLAEIRTRMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSSPQALESKLLLDSLYDVIKHD 697
Query: 661 SLMTTDSYDDIGDDIF 677
SL+TTD+Y D GD+IF
Sbjct: 698 SLITTDNY-DTGDNIF 712
BLAST of HG10003471 vs. ExPASy TrEMBL
Match:
A0A5D3BN10 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold169G00830 PE=4 SV=1)
HSP 1 Score: 1231.5 bits (3185), Expect = 0.0e+00
Identity = 609/676 (90.09%), Postives = 637/676 (94.23%), Query Frame = 0
Query: 1 MIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYNDSFDLILQSISILLVSCTNCSSLPS 60
MI SIFSSLK+FASHGQLSKTFEAFSLIQLRTSYNDSFDLILQSISILLVSCT CSSLP
Sbjct: 38 MIGSIFSSLKDFASHGQLSKTFEAFSLIQLRTSYNDSFDLILQSISILLVSCTKCSSLPP 97
Query: 61 GKQLHGHIISSGLEEDSILVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYVR 120
GKQLHGHIISSGL EDS LV KLV FYSS + LPEAHTLVE SNLF PC WN+L+TSYVR
Sbjct: 98 GKQLHGHIISSGLVEDSFLVSKLVMFYSSLECLPEAHTLVETSNLFRPCSWNILMTSYVR 157
Query: 121 NELHDAAILAYKQMLSKGVRPDNFTFPSILKACGETQNLEFGLEVHKSINAWSTKWSLFV 180
N+L++AAILAYKQMLSKGVRPDNFTFPSILKACGETQNLEFGLEVHKSINA ST WSLFV
Sbjct: 158 NKLYEAAILAYKQMLSKGVRPDNFTFPSILKACGETQNLEFGLEVHKSINACSTNWSLFV 217
Query: 181 QNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMISCYASKAMWKEAFELFDSMQSKCV 240
NALISMYGRCGEVDTAR LFD MLERD VSWNSMISCY+S+ MW+EAFELF+SMQSK +
Sbjct: 218 HNALISMYGRCGEVDTARYLFDIMLERDGVSWNSMISCYSSRGMWREAFELFESMQSKSL 277
Query: 241 EINVVTWNIIAGGCSRVGNFTRALKLLSQMRNFGIHLDSVAMIIGLGACSHIGAIRLGKE 300
EINVVTWNIIAGGC RVGNFTRAL LLSQMRNFGIHLD VAMIIGLGACSHIGAIRLGKE
Sbjct: 278 EINVVTWNIIAGGCLRVGNFTRALNLLSQMRNFGIHLDDVAMIIGLGACSHIGAIRLGKE 337
Query: 301 IHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYKLFRLNDDKSIITWNSMLSGLTHLDR 360
IHGFTIRHY+H LSTVQNALVTMYARCKDI HAY LFRLNDDKSIITWNSMLSGLTHLDR
Sbjct: 338 IHGFTIRHYHHMLSTVQNALVTMYARCKDIRHAYMLFRLNDDKSIITWNSMLSGLTHLDR 397
Query: 361 VKDALHLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFRDYLLLWN 420
V+DAL LFRELL FGVEPNYVTFASILPLCARVA+LQHGREFHCYITKR DFRD+LLLWN
Sbjct: 398 VEDALCLFRELLLFGVEPNYVTFASILPLCARVANLQHGREFHCYITKRHDFRDHLLLWN 457
Query: 421 ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFKEMKRFQIKP 480
ALVDMYARSGKV EAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLF+EMKRFQIKP
Sbjct: 458 ALVDMYARSGKVSEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFEEMKRFQIKP 517
Query: 481 DHITMVAVLSACSHSGLVKQGELLFAEMQSLHGLRPHLEHYACMADLFGRVGLLNKAKEI 540
DHITMVAVLSACSHSGL+ QGELLFAEMQS+HGL P LEHY+CMADLFGRVGLLNKAKEI
Sbjct: 518 DHITMVAVLSACSHSGLLNQGELLFAEMQSVHGLSPRLEHYSCMADLFGRVGLLNKAKEI 577
Query: 541 ITRMPYRPTSAMWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSW 600
ITRMPYRPTSA+WATLIGACCIHGNTDIGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSW
Sbjct: 578 ITRMPYRPTSAIWATLIGACCIHGNTDIGEWAAEKLLEMQPEHSGYYVLIANMYAAAGSW 637
Query: 601 SKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFLVGDTSNPQALESKLLLDSLNDVMKHG 660
SKLA+IRT MRDSGVAKVPGCSWVDVGSEFVSF VGDTS+PQALESKLLLDSL DV+KH
Sbjct: 638 SKLAEIRTRMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSSPQALESKLLLDSLYDVIKHD 697
Query: 661 SLMTTDSYDDIGDDIF 677
SL+TTD+Y D GD+IF
Sbjct: 698 SLITTDNY-DTGDNIF 712
BLAST of HG10003471 vs. TAIR 10
Match:
AT1G71490.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 828.9 bits (2140), Expect = 2.9e-240
Identity = 397/656 (60.52%), Postives = 504/656 (76.83%), Query Frame = 0
Query: 3 DSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYNDSFDLILQSISILLVSCTNCSSLPSGK 62
+S+F SL + ASHG L F+ FSL++L++S S DL+L S + LL +C + + +G
Sbjct: 4 ESLFKSLGHLASHGHLHDAFKTFSLLRLQSSSAVSDDLVLHSAASLLSACVDVRAFLAGV 63
Query: 63 QLHGHIISSGLEEDSILVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYVRNE 122
Q+H H ISSG+E S+LVPKLVTFYS+F L EA +++ENS++ HP PWN+LI SY +NE
Sbjct: 64 QVHAHCISSGVEYHSVLVPKLVTFYSAFNLHNEAQSIIENSDILHPLPWNVLIASYAKNE 123
Query: 123 LHDAAILAYKQMLSKGVRPDNFTFPSILKACGETQNLEFGLEVHKSINAWSTKWSLFVQN 182
L + I AYK+M+SKG+RPD FT+PS+LKACGET ++ FG VH SI S K SL+V N
Sbjct: 124 LFEEVIAAYKRMVSKGIRPDAFTYPSVLKACGETLDVAFGRVVHGSIEVSSYKSSLYVCN 183
Query: 183 ALISMYGRCGEVDTARNLFDNMLERDAVSWNSMISCYASKAMWKEAFELFDSMQSKCVEI 242
ALISMY R + AR LFD M ERDAVSWN++I+CYAS+ MW EAFELFD M VE+
Sbjct: 184 ALISMYKRFRNMGIARRLFDRMFERDAVSWNAVINCYASEGMWSEAFELFDKMWFSGVEV 243
Query: 243 NVVTWNIIAGGCSRVGNFTRALKLLSQMRNFGIHLDSVAMIIGLGACSHIGAIRLGKEIH 302
+V+TWNII+GGC + GN+ AL L+S+MRNF LD VAMIIGL ACS IGAIRLGKEIH
Sbjct: 244 SVITWNIISGGCLQTGNYVGALGLISRMRNFPTSLDPVAMIIGLKACSLIGAIRLGKEIH 303
Query: 303 GFTIRHYYHKLSTVQNALVTMYARCKDIMHAYKLFRLNDDKSIITWNSMLSGLTHLDRVK 362
G I Y + V+N L+TMY++CKD+ HA +FR ++ S+ TWNS++SG L++ +
Sbjct: 304 GLAIHSSYDGIDNVRNTLITMYSKCKDLRHALIVFRQTEENSLCTWNSIISGYAQLNKSE 363
Query: 363 DALHLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFRDYLLLWNAL 422
+A HL RE+L G +PN +T ASILPLCAR+A+LQHG+EFHCYI +R+ F+DY +LWN+L
Sbjct: 364 EASHLLREMLVAGFQPNSITLASILPLCARIANLQHGKEFHCYILRRKCFKDYTMLWNSL 423
Query: 423 VDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFKEMKRFQIKPDH 482
VD+YA+SGK++ AK+V D +SK+DEVTYTSLI GYG QGEGG AL LFKEM R IKPDH
Sbjct: 424 VDVYAKSGKIVAAKQVSDLMSKRDEVTYTSLIDGYGNQGEGGVALALFKEMTRSGIKPDH 483
Query: 483 ITMVAVLSACSHSGLVKQGELLFAEMQSLHGLRPHLEHYACMADLFGRVGLLNKAKEIIT 542
+T+VAVLSACSHS LV +GE LF +MQ +G+RP L+H++CM DL+GR G L KAK+II
Sbjct: 484 VTVVAVLSACSHSKLVHEGERLFMKMQCEYGIRPCLQHFSCMVDLYGRAGFLAKAKDIIH 543
Query: 543 RMPYRPTSAMWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSK 602
MPY+P+ A WATL+ AC IHGNT IG+WAAEKLLEM+PE+ GYYVLIANMYAAAGSWSK
Sbjct: 544 NMPYKPSGATWATLLNACHIHGNTQIGKWAAEKLLEMKPENPGYYVLIANMYAAAGSWSK 603
Query: 603 LAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFLVGDTSNPQALESKLLLDSLNDVMK 659
LA++RT+MRD GV K PGC+W+D S F F VGDTS+P+A + LLD LN +MK
Sbjct: 604 LAEVRTIMRDLGVKKDPGCAWIDTDSGFSLFSVGDTSSPEACNTYPLLDGLNQLMK 659
BLAST of HG10003471 vs. TAIR 10
Match:
AT1G22830.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 688.7 bits (1776), Expect = 4.6e-198
Identity = 338/654 (51.68%), Postives = 459/654 (70.18%), Query Frame = 0
Query: 5 IFSSLKNFASHGQLSKTFEAFSLIQLRTSYNDSFDLILQSISILLVSCTNCSSLPSGKQL 64
+F+S ++ SHGQL + F FSL++ ++ S + +L S + LL +C + G+QL
Sbjct: 49 LFNSFRHCISHGQLYEAFRTFSLLRYQSG---SHEFVLYSSASLLSTCVGFNEFVPGQQL 108
Query: 65 HGHIISSGLEEDSILVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYVRNELH 124
H H ISSGLE DS+LVPKLVTFYS+F LL EA T+ ENS + HP PWN+LI SY+RN+
Sbjct: 109 HAHCISSGLEFDSVLVPKLVTFYSAFNLLDEAQTITENSEILHPLPWNVLIGSYIRNKRF 168
Query: 125 DAAILAYKQMLSKGVRPDNFTFPSILKACGETQNLEFGLEVHKSINAWSTKWSLFVQNAL 184
++ YK+M+SKG+R D FT+PS++KAC + +G VH SI S + +L+V NAL
Sbjct: 169 QESVSVYKRMMSKGIRADEFTYPSVIKACAALLDFAYGRVVHGSIEVSSHRCNLYVCNAL 228
Query: 185 ISMYGRCGEVDTARNLFDNMLERDAVSWNSMISCYASKAMWKEAFELFDSMQSKCVEINV 244
ISMY R G+VD AR LFD M ERDAVSWN++I+CY S+ EAF+L D M VE ++
Sbjct: 229 ISMYKRFGKVDVARRLFDRMSERDAVSWNAIINCYTSEEKLGEAFKLLDRMYLSGVEASI 288
Query: 245 VTWNIIAGGCSRVGNFTRALKLLSQMRNFGIHLDSVAMIIGLGACSHIGAIRLGKEIHGF 304
VTWN IAGGC GN+ AL + MRN + + SVAMI GL ACSHIGA++ GK H
Sbjct: 289 VTWNTIAGGCLEAGNYIGALNCVVGMRNCNVRIGSVAMINGLKACSHIGALKWGKVFHCL 348
Query: 305 TIR--HYYHKLSTVQNALVTMYARCKDIMHAYKLFRLNDDKSIITWNSMLSGLTHLDRVK 364
IR + H + V+N+L+TMY+RC D+ HA+ +F+ + S+ TWNS++SG + +R +
Sbjct: 349 VIRSCSFSHDIDNVRNSLITMYSRCSDLRHAFIVFQQVEANSLSTWNSIISGFAYNERSE 408
Query: 365 DALHLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFRDYLLLWNAL 424
+ L +E+L G PN++T ASILPL ARV +LQHG+EFHCYI +RQ ++D L+LWN+L
Sbjct: 409 ETSFLLKEMLLSGFHPNHITLASILPLFARVGNLQHGKEFHCYILRRQSYKDCLILWNSL 468
Query: 425 VDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFKEMKRFQIKPDH 484
VDMYA+SG+++ AKRVFDS+ K+D+VTYTSLI GYG G+G AL FK+M R IKPDH
Sbjct: 469 VDMYAKSGEIIAAKRVFDSMRKRDKVTYTSLIDGYGRLGKGEVALAWFKDMDRSGIKPDH 528
Query: 485 ITMVAVLSACSHSGLVKQGELLFAEMQSLHGLRPHLEHYACMADLFGRVGLLNKAKEIIT 544
+TMVAVLSACSHS LV++G LF +M+ + G+R LEHY+CM DL+ R G L+KA++I
Sbjct: 529 VTMVAVLSACSHSNLVREGHWLFTKMEHVFGIRLRLEHYSCMVDLYCRAGYLDKARDIFH 588
Query: 545 RMPYRPTSAMWATLIGACCIHGNTDIGEWAAEK-LLEMRPEHSGYYVLIANMYAAAGSWS 604
+PY P+SAM ATL+ AC IHGNT+IGEWAA+K LLE +PEH G+Y+L+A+MYA GSWS
Sbjct: 589 TIPYEPSSAMCATLLKACLIHGNTNIGEWAADKLLLETKPEHLGHYMLLADMYAVTGSWS 648
Query: 605 KLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFLVGDTSNPQALESKLLLDSLND 656
KL ++TL+ D GV K + ++ SE L G+ + P +S + + +D
Sbjct: 649 KLVTVKTLLSDLGVQKAHEFALMETDSE----LDGENNKPMNDDSVINQEQSSD 695
BLAST of HG10003471 vs. TAIR 10
Match:
AT1G22830.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 688.7 bits (1776), Expect = 4.6e-198
Identity = 338/654 (51.68%), Postives = 459/654 (70.18%), Query Frame = 0
Query: 5 IFSSLKNFASHGQLSKTFEAFSLIQLRTSYNDSFDLILQSISILLVSCTNCSSLPSGKQL 64
+F+S ++ SHGQL + F FSL++ ++ S + +L S + LL +C + G+QL
Sbjct: 49 LFNSFRHCISHGQLYEAFRTFSLLRYQSG---SHEFVLYSSASLLSTCVGFNEFVPGQQL 108
Query: 65 HGHIISSGLEEDSILVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYVRNELH 124
H H ISSGLE DS+LVPKLVTFYS+F LL EA T+ ENS + HP PWN+LI SY+RN+
Sbjct: 109 HAHCISSGLEFDSVLVPKLVTFYSAFNLLDEAQTITENSEILHPLPWNVLIGSYIRNKRF 168
Query: 125 DAAILAYKQMLSKGVRPDNFTFPSILKACGETQNLEFGLEVHKSINAWSTKWSLFVQNAL 184
++ YK+M+SKG+R D FT+PS++KAC + +G VH SI S + +L+V NAL
Sbjct: 169 QESVSVYKRMMSKGIRADEFTYPSVIKACAALLDFAYGRVVHGSIEVSSHRCNLYVCNAL 228
Query: 185 ISMYGRCGEVDTARNLFDNMLERDAVSWNSMISCYASKAMWKEAFELFDSMQSKCVEINV 244
ISMY R G+VD AR LFD M ERDAVSWN++I+CY S+ EAF+L D M VE ++
Sbjct: 229 ISMYKRFGKVDVARRLFDRMSERDAVSWNAIINCYTSEEKLGEAFKLLDRMYLSGVEASI 288
Query: 245 VTWNIIAGGCSRVGNFTRALKLLSQMRNFGIHLDSVAMIIGLGACSHIGAIRLGKEIHGF 304
VTWN IAGGC GN+ AL + MRN + + SVAMI GL ACSHIGA++ GK H
Sbjct: 289 VTWNTIAGGCLEAGNYIGALNCVVGMRNCNVRIGSVAMINGLKACSHIGALKWGKVFHCL 348
Query: 305 TIR--HYYHKLSTVQNALVTMYARCKDIMHAYKLFRLNDDKSIITWNSMLSGLTHLDRVK 364
IR + H + V+N+L+TMY+RC D+ HA+ +F+ + S+ TWNS++SG + +R +
Sbjct: 349 VIRSCSFSHDIDNVRNSLITMYSRCSDLRHAFIVFQQVEANSLSTWNSIISGFAYNERSE 408
Query: 365 DALHLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFRDYLLLWNAL 424
+ L +E+L G PN++T ASILPL ARV +LQHG+EFHCYI +RQ ++D L+LWN+L
Sbjct: 409 ETSFLLKEMLLSGFHPNHITLASILPLFARVGNLQHGKEFHCYILRRQSYKDCLILWNSL 468
Query: 425 VDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFKEMKRFQIKPDH 484
VDMYA+SG+++ AKRVFDS+ K+D+VTYTSLI GYG G+G AL FK+M R IKPDH
Sbjct: 469 VDMYAKSGEIIAAKRVFDSMRKRDKVTYTSLIDGYGRLGKGEVALAWFKDMDRSGIKPDH 528
Query: 485 ITMVAVLSACSHSGLVKQGELLFAEMQSLHGLRPHLEHYACMADLFGRVGLLNKAKEIIT 544
+TMVAVLSACSHS LV++G LF +M+ + G+R LEHY+CM DL+ R G L+KA++I
Sbjct: 529 VTMVAVLSACSHSNLVREGHWLFTKMEHVFGIRLRLEHYSCMVDLYCRAGYLDKARDIFH 588
Query: 545 RMPYRPTSAMWATLIGACCIHGNTDIGEWAAEK-LLEMRPEHSGYYVLIANMYAAAGSWS 604
+PY P+SAM ATL+ AC IHGNT+IGEWAA+K LLE +PEH G+Y+L+A+MYA GSWS
Sbjct: 589 TIPYEPSSAMCATLLKACLIHGNTNIGEWAADKLLLETKPEHLGHYMLLADMYAVTGSWS 648
Query: 605 KLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFLVGDTSNPQALESKLLLDSLND 656
KL ++TL+ D GV K + ++ SE L G+ + P +S + + +D
Sbjct: 649 KLVTVKTLLSDLGVQKAHEFALMETDSE----LDGENNKPMNDDSVINQEQSSD 695
BLAST of HG10003471 vs. TAIR 10
Match:
AT5G16860.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 422.9 bits (1086), Expect = 4.7e-118
Identity = 229/682 (33.58%), Postives = 370/682 (54.25%), Query Frame = 0
Query: 7 SSLKNFASHGQLSKTFEAFSLIQLRTSYNDSFDLILQSISILLVSCTNCSSLPSGKQLHG 66
S ++++ +G +K F L+ + D++ + + +C SS+ G+ H
Sbjct: 97 SLIRSYGDNGCANKCLYLFGLMHSLSWTPDNY-----TFPFVFKACGEISSVRCGESAHA 156
Query: 67 HIISSGLEEDSILVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYVRNELHDA 126
+ +G + + LV YS + L +A + + +++ WN +I SY +
Sbjct: 157 LSLVTGFISNVFVGNALVAMYSRCRSLSDARKVFDEMSVWDVVSWNSIIESYAKLGKPKV 216
Query: 127 AILAYKQMLSK-GVRPDNFTFPSILKACGETQNLEFGLEVHKSINAWSTKWSLFVQNALI 186
A+ + +M ++ G RPDN T ++L C G ++H ++FV N L+
Sbjct: 217 ALEMFSRMTNEFGCRPDNITLVNVLPPCASLGTHSLGKQLHCFAVTSEMIQNMFVGNCLV 276
Query: 187 SMYGRCGEVDTARNLFDNMLERDAVSWNSMISCYASKAMWKEAFELFDSMQSKCVEINVV 246
MY +CG +D A +F NM +D VSWN+M++ Y+ +++A LF+ MQ + ++++VV
Sbjct: 277 DMYAKCGMMDEANTVFSNMSVKDVVSWNAMVAGYSQIGRFEDAVRLFEKMQEEKIKMDVV 336
Query: 247 TWNIIAGGCSRVGNFTRALKLLSQMRNFGIHLDSVAMIIGLGACSHIGAIRLGKEIHGFT 306
TW+ G ++ G AL + QM + GI + V +I L C+ +GA+ GKEIH +
Sbjct: 337 TWSAAISGYAQRGLGYEALGVCRQMLSSGIKPNEVTLISVLSGCASVGALMHGKEIHCYA 396
Query: 307 IRH-------YYHKLSTVQNALVTMYARCKDIMHAYKLF--RLNDDKSIITWNSMLSGLT 366
I++ + + V N L+ MYA+CK + A +F ++ ++TW M+ G +
Sbjct: 397 IKYPIDLRKNGHGDENMVINQLIDMYAKCKKVDTARAMFDSLSPKERDVVTWTVMIGGYS 456
Query: 367 HLDRVKDALHLFRELLQFGVE--PNYVTFASILPLCARVADLQHGREFHCYITKRQDFRD 426
AL L E+ + + PN T + L CA +A L+ G++ H Y + Q
Sbjct: 457 QHGDANKALELLSEMFEEDCQTRPNAFTISCALVACASLAALRIGKQIHAYALRNQQNAV 516
Query: 427 YLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFKEMK 486
L + N L+DMYA+ G + +A+ VFD++ K+EVT+TSL+ GYGM G G +AL +F EM+
Sbjct: 517 PLFVSNCLIDMYAKCGSISDARLVFDNMMAKNEVTWTSLMTGYGMHGYGEEALGIFDEMR 576
Query: 487 RFQIKPDHITMVAVLSACSHSGLVKQGELLFAEMQSLHGLRPHLEHYACMADLFGRVGLL 546
R K D +T++ VL ACSHSG++ QG F M+++ G+ P EHYAC+ DL GR G L
Sbjct: 577 RIGFKLDGVTLLVVLYACSHSGMIDQGMEYFNRMKTVFGVSPGPEHYACLVDLLGRAGRL 636
Query: 547 NKAKEIITRMPYRPTSAMWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMY 606
N A +I MP P +W + C IHG ++GE+AAEK+ E+ H G Y L++N+Y
Sbjct: 637 NAALRLIEEMPMEPPPVVWVAFLSCCRIHGKVELGEYAAEKITELASNHDGSYTLLSNLY 696
Query: 607 AAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFLVGDTSNPQALE-SKLLLDSL 666
A AG W + +IR+LMR GV K PGCSWV+ +F VGD ++P A E ++LLD +
Sbjct: 697 ANAGRWKDVTRIRSLMRHKGVKKRPGCSWVEGIKGTTTFFVGDKTHPHAKEIYQVLLDHM 756
Query: 667 NDVMKHGSLMTTD-SYDDIGDD 675
+ G + T + D+ D+
Sbjct: 757 QRIKDIGYVPETGFALHDVDDE 773
BLAST of HG10003471 vs. TAIR 10
Match:
AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 418.7 bits (1075), Expect = 8.9e-117
Identity = 219/642 (34.11%), Postives = 364/642 (56.70%), Query Frame = 0
Query: 30 LRTSYNDSFDLILQSISILLVSCTNCSSLPSGKQLHGHIISSGLEEDSILVPKLVTF--- 89
L +S + +D I S+ L+ NC +L S + +H +I GL + + KL+ F
Sbjct: 20 LPSSSDPPYDSIRNHPSLSLLH--NCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCIL 79
Query: 90 YSSFKLLPEAHTLVENSNLFHPCPWNLLITSYVRNELHDAAILAYKQMLSKGVRPDNFTF 149
F+ LP A ++ + + WN + + + +A+ Y M+S G+ P+++TF
Sbjct: 80 SPHFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTF 139
Query: 150 PSILKACGETQNLEFGLEVHKSINAWSTKWSLFVQNALISMYGRCGEVDTARNLFDNMLE 209
P +LK+C +++ + G ++H + L+V +LISMY + G ++ A +FD
Sbjct: 140 PFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPH 199
Query: 210 RDAVSWNSMISCYASKAMWKEAFELFDSMQSKCVEINVVTWNIIAGGCSRVGNFTRALKL 269
RD VS+ ++I YAS+ + A +LFD + K +VV+WN + G + GN+ AL+L
Sbjct: 200 RDVVSYTALIKGYASRGYIENAQKLFDEIPVK----DVVSWNAMISGYAETGNYKEALEL 259
Query: 270 LSQMRNFGIHLDSVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYAR 329
M + D M+ + AC+ G+I LG+++H + H + + NAL+ +Y++
Sbjct: 260 FKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSK 319
Query: 330 CKDIMHAYKLFRLNDDKSIITWNSMLSGLTHLDRVKDALHLFRELLQFGVEPNYVTFASI 389
C ++ A LF K +I+WN+++ G TH++ K+AL LF+E+L+ G PN VT SI
Sbjct: 320 CGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSI 379
Query: 390 LPLCARVADLQHGREFHCYITKR-QDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKK 449
LP CA + + GR H YI KR + + L +L+DMYA+ G + A +VF+S+ K
Sbjct: 380 LPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHK 439
Query: 450 DEVTYTSLIAGYGMQGEGGKALRLFKEMKRFQIKPDHITMVAVLSACSHSGLVKQGELLF 509
++ ++I G+ M G + LF M++ I+PD IT V +LSACSHSG++ G +F
Sbjct: 440 SLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIF 499
Query: 510 AEMQSLHGLRPHLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAMWATLIGACCIHGN 569
M + + P LEHY CM DL G GL +A+E+I M P +W +L+ AC +HGN
Sbjct: 500 RTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGN 559
Query: 570 TDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVD 629
++GE AE L+++ PE+ G YVL++N+YA+AG W+++AK R L+ D G+ KVPGCS ++
Sbjct: 560 VELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIE 619
Query: 630 VGSEFVSFLVGDTSNPQALESKLLLDSLNDVMKHGSLMTTDS 668
+ S F++GD +P+ E +L+ + +++ + S
Sbjct: 620 IDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTS 655
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038890628.1 | 0.0e+00 | 94.68 | pentatricopeptide repeat-containing protein At1g71490 [Benincasa hispida] >XP_03... | [more] |
XP_022973516.1 | 0.0e+00 | 90.53 | pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucurbita maxi... | [more] |
KAG7025166.1 | 0.0e+00 | 90.53 | Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... | [more] |
XP_022925519.1 | 0.0e+00 | 90.38 | pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucurbita mosc... | [more] |
XP_023535485.1 | 0.0e+00 | 90.53 | pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucurbita pepo... | [more] |
Match Name | E-value | Identity | Description | |
Q9C9I6 | 4.0e-239 | 60.52 | Pentatricopeptide repeat-containing protein At1g71490 OS=Arabidopsis thaliana OX... | [more] |
Q4V389 | 6.5e-197 | 51.68 | Pentatricopeptide repeat-containing protein At1g22830 OS=Arabidopsis thaliana OX... | [more] |
Q9LFL5 | 6.7e-117 | 33.58 | Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana OX... | [more] |
Q9LN01 | 1.3e-115 | 34.11 | Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... | [more] |
Q9LNU6 | 3.2e-103 | 30.56 | Pentatricopeptide repeat-containing protein At1g20230 OS=Arabidopsis thaliana OX... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1I8V4 | 0.0e+00 | 90.53 | pentatricopeptide repeat-containing protein At1g71490 isoform X1 OS=Cucurbita ma... | [more] |
A0A6J1EI84 | 0.0e+00 | 90.38 | pentatricopeptide repeat-containing protein At1g71490 isoform X1 OS=Cucurbita mo... | [more] |
A0A6J1CJU8 | 0.0e+00 | 89.50 | pentatricopeptide repeat-containing protein At1g71490-like OS=Momordica charanti... | [more] |
A0A1S3CB12 | 0.0e+00 | 90.09 | pentatricopeptide repeat-containing protein At1g71490 OS=Cucumis melo OX=3656 GN... | [more] |
A0A5D3BN10 | 0.0e+00 | 90.09 | Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... | [more] |
Match Name | E-value | Identity | Description | |
AT1G71490.1 | 2.9e-240 | 60.52 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT1G22830.1 | 4.6e-198 | 51.68 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT1G22830.2 | 4.6e-198 | 51.68 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT5G16860.1 | 4.7e-118 | 33.58 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT1G08070.1 | 8.9e-117 | 34.11 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |