HG10012837 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10012837
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr01: 24633068 .. 24634819 (+)
RNA-Seq ExpressionHG10012837
SyntenyHG10012837
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACGACTTCTTCGACTCGGACCCAACTCCACCGCCAGCGCCACCCATGGCGACGGTAACGACGGCGGCGTCGAAGTTGGATCCTCGCATTGTGCATGCCCAAGCAATCAAATCCCCGGCCACTGATTGTTCTGTTTACAACAATCTCATCACTCGCTACTCCAAATCAAATTTGCTCCACTACTCTGTTCGTCTCTTCGATCAAATCCCATTTCCCAATGTCGTTTCTTGGACTGCTTTCATCTCTGCTCATTCCAACACATTCCTTTCTCTTCGCCATTTTGTTTCTATGCTCAGATACCCAGCTTTTCCCAATCAACGCACATTAGCTTCTCTTTTCAAAACCTGCGCTTCTCTCCCTTCCATCTCTTTCGGCCTTGCTCTTCACTCCCTTGCCCACAAACTCTCACTTTGCTACGAACCCTACTCTGGCTCTGCACTTGTGAATTTCTACTTCAAATGTCGGTTGTTCGATGACGCTCGTAAGGTGTTCGATGAAATTTCTGACAGAGATGAGGTTTGCTATTCTGCCCTCGTTGTTGGTCTAGCGCAGAATGCGCAGTCCATTGACGCATTGTCGATGTTTAGAGAGATGAAAGCTTCTGATGTTGGGTCGACGCTGTACAGTGTTTCGGGGGCTCTTCGTGCCACGGCTGATCTTGCTGCGTTGGAACAATGCAGGGTTTTACATGCGCACGCTGTGGTTACTGGATTGGATACTGATGTGATTGTTCAAACTGCTTTGATTGATGGGTATGGGAAATCGGGGCTGATAATCGATGCACGTCAGGTGTTCGATGAAAATTTGGGATGTATGAATGTTGTGGGGTGGAATGCAATGTTGGCAAGTTATGCACAGCAGGGAGATAAGAACTCTACGCTTGAGCTATTCAATTCAATGGAAGTTTTCGGGATGTCTCCTGATGAATACAGCTTCTTAGCCATTCTGACATCATTTTGCAATTCACGTTTAGTTAGTGAGATTGAACTGTGGCTGAGAAGGATGAGAGTGGAGTATGGAGTGGAACCTACTCTTGAACATTTTACTTGTCTCATAGATGCAATGGGGAGAGCTGGGAAATTAAAAGAAGCTGAGAGGGTAGCCATGACAATGCCTTTCGTGCCAGATGCCGCGGTTTGGCGTGCGTTGCTATCGAGCTCTGCAAGCCATGGTGCAGGCGATATGGCATGGACGATGGCAAAACGGTTACTGGAGCTCGACCAGCATGACGACTCGGCTTATGTGATTGTTGCGAATGCTCTATCTGCTACAGGAAGATGGGAGGAAGTGGCAGTAGTGAGGAAGCTGATGAAAGAGAGACAAGTGAAGAAGAAAAGTGGGAGAAGTTGGATTGAAGTGAGAGGGGAAGTTCATGTGTTTTTAGCAGGGGACAGAAACCATGAAAGGATTGTGGAGATATATGCAAAGCTAAAAGAGTTGATTTGGGAGATAGAGAAGCTTGGGTATGTCCCAATTTGGGATGAGATGCTTCACGAAGTAGGGGAAAAGGAAAAGAAAGAAGCACTTTGGTATCACAGTGAGAAATTGGCGCTGGCTTATGGGATGCTGACCGGAGTTGCACCACCTGGAAAAGCTCTGCGGATTGTAAAGAATCTAAAAATATGCAGAGATTGTCATCAAGTTTTCAAATATGCAAGTAGGGTGCTGAAGAGGGAGATTATAGTTAGAGATGTAAATAGATACCATAGATTTTCAAATGGTAGTTGCAGCTGTGAAGACATCTGGTAA

mRNA sequence

ATGAACGACTTCTTCGACTCGGACCCAACTCCACCGCCAGCGCCACCCATGGCGACGGTAACGACGGCGGCGTCGAAGTTGGATCCTCGCATTGTGCATGCCCAAGCAATCAAATCCCCGGCCACTGATTGTTCTGTTTACAACAATCTCATCACTCGCTACTCCAAATCAAATTTGCTCCACTACTCTGTTCGTCTCTTCGATCAAATCCCATTTCCCAATGTCGTTTCTTGGACTGCTTTCATCTCTGCTCATTCCAACACATTCCTTTCTCTTCGCCATTTTGTTTCTATGCTCAGATACCCAGCTTTTCCCAATCAACGCACATTAGCTTCTCTTTTCAAAACCTGCGCTTCTCTCCCTTCCATCTCTTTCGGCCTTGCTCTTCACTCCCTTGCCCACAAACTCTCACTTTGCTACGAACCCTACTCTGGCTCTGCACTTGTGAATTTCTACTTCAAATGTCGGTTGTTCGATGACGCTCGTAAGGTGTTCGATGAAATTTCTGACAGAGATGAGGTTTGCTATTCTGCCCTCGTTGTTGGTCTAGCGCAGAATGCGCAGTCCATTGACGCATTGTCGATGTTTAGAGAGATGAAAGCTTCTGATGTTGGGTCGACGCTGTACAGTGTTTCGGGGGCTCTTCGTGCCACGGCTGATCTTGCTGCGTTGGAACAATGCAGGGTTTTACATGCGCACGCTGTGGTTACTGGATTGGATACTGATGTGATTGTTCAAACTGCTTTGATTGATGGGTATGGGAAATCGGGGCTGATAATCGATGCACGTCAGGTGTTCGATGAAAATTTGGGATGTATGAATGTTGTGGGGTGGAATGCAATGTTGGCAAGTTATGCACAGCAGGGAGATAAGAACTCTACGCTTGAGCTATTCAATTCAATGGAAGTTTTCGGGATGTCTCCTGATGAATACAGCTTCTTAGCCATTCTGACATCATTTTGCAATTCACGTTTAGTTAGTGAGATTGAACTGTGGCTGAGAAGGATGAGAGTGGAGTATGGAGTGGAACCTACTCTTGAACATTTTACTTGTCTCATAGATGCAATGGGGAGAGCTGGGAAATTAAAAGAAGCTGAGAGGGTAGCCATGACAATGCCTTTCGTGCCAGATGCCGCGGTTTGGCGTGCGTTGCTATCGAGCTCTGCAAGCCATGGTGCAGGCGATATGGCATGGACGATGGCAAAACGGTTACTGGAGCTCGACCAGCATGACGACTCGGCTTATGTGATTGTTGCGAATGCTCTATCTGCTACAGGAAGATGGGAGGAAGTGGCAGTAGTGAGGAAGCTGATGAAAGAGAGACAAGTGAAGAAGAAAAGTGGGAGAAGTTGGATTGAAGTGAGAGGGGAAGTTCATGTGTTTTTAGCAGGGGACAGAAACCATGAAAGGATTGTGGAGATATATGCAAAGCTAAAAGAGTTGATTTGGGAGATAGAGAAGCTTGGGTATGTCCCAATTTGGGATGAGATGCTTCACGAAGTAGGGGAAAAGGAAAAGAAAGAAGCACTTTGGTATCACAGTGAGAAATTGGCGCTGGCTTATGGGATGCTGACCGGAGTTGCACCACCTGGAAAAGCTCTGCGGATTGTAAAGAATCTAAAAATATGCAGAGATTGTCATCAAGTTTTCAAATATGCAAGTAGGGTGCTGAAGAGGGAGATTATAGTTAGAGATGTAAATAGATACCATAGATTTTCAAATGGTAGTTGCAGCTGTGAAGACATCTGGTAA

Coding sequence (CDS)

ATGAACGACTTCTTCGACTCGGACCCAACTCCACCGCCAGCGCCACCCATGGCGACGGTAACGACGGCGGCGTCGAAGTTGGATCCTCGCATTGTGCATGCCCAAGCAATCAAATCCCCGGCCACTGATTGTTCTGTTTACAACAATCTCATCACTCGCTACTCCAAATCAAATTTGCTCCACTACTCTGTTCGTCTCTTCGATCAAATCCCATTTCCCAATGTCGTTTCTTGGACTGCTTTCATCTCTGCTCATTCCAACACATTCCTTTCTCTTCGCCATTTTGTTTCTATGCTCAGATACCCAGCTTTTCCCAATCAACGCACATTAGCTTCTCTTTTCAAAACCTGCGCTTCTCTCCCTTCCATCTCTTTCGGCCTTGCTCTTCACTCCCTTGCCCACAAACTCTCACTTTGCTACGAACCCTACTCTGGCTCTGCACTTGTGAATTTCTACTTCAAATGTCGGTTGTTCGATGACGCTCGTAAGGTGTTCGATGAAATTTCTGACAGAGATGAGGTTTGCTATTCTGCCCTCGTTGTTGGTCTAGCGCAGAATGCGCAGTCCATTGACGCATTGTCGATGTTTAGAGAGATGAAAGCTTCTGATGTTGGGTCGACGCTGTACAGTGTTTCGGGGGCTCTTCGTGCCACGGCTGATCTTGCTGCGTTGGAACAATGCAGGGTTTTACATGCGCACGCTGTGGTTACTGGATTGGATACTGATGTGATTGTTCAAACTGCTTTGATTGATGGGTATGGGAAATCGGGGCTGATAATCGATGCACGTCAGGTGTTCGATGAAAATTTGGGATGTATGAATGTTGTGGGGTGGAATGCAATGTTGGCAAGTTATGCACAGCAGGGAGATAAGAACTCTACGCTTGAGCTATTCAATTCAATGGAAGTTTTCGGGATGTCTCCTGATGAATACAGCTTCTTAGCCATTCTGACATCATTTTGCAATTCACGTTTAGTTAGTGAGATTGAACTGTGGCTGAGAAGGATGAGAGTGGAGTATGGAGTGGAACCTACTCTTGAACATTTTACTTGTCTCATAGATGCAATGGGGAGAGCTGGGAAATTAAAAGAAGCTGAGAGGGTAGCCATGACAATGCCTTTCGTGCCAGATGCCGCGGTTTGGCGTGCGTTGCTATCGAGCTCTGCAAGCCATGGTGCAGGCGATATGGCATGGACGATGGCAAAACGGTTACTGGAGCTCGACCAGCATGACGACTCGGCTTATGTGATTGTTGCGAATGCTCTATCTGCTACAGGAAGATGGGAGGAAGTGGCAGTAGTGAGGAAGCTGATGAAAGAGAGACAAGTGAAGAAGAAAAGTGGGAGAAGTTGGATTGAAGTGAGAGGGGAAGTTCATGTGTTTTTAGCAGGGGACAGAAACCATGAAAGGATTGTGGAGATATATGCAAAGCTAAAAGAGTTGATTTGGGAGATAGAGAAGCTTGGGTATGTCCCAATTTGGGATGAGATGCTTCACGAAGTAGGGGAAAAGGAAAAGAAAGAAGCACTTTGGTATCACAGTGAGAAATTGGCGCTGGCTTATGGGATGCTGACCGGAGTTGCACCACCTGGAAAAGCTCTGCGGATTGTAAAGAATCTAAAAATATGCAGAGATTGTCATCAAGTTTTCAAATATGCAAGTAGGGTGCTGAAGAGGGAGATTATAGTTAGAGATGTAAATAGATACCATAGATTTTCAAATGGTAGTTGCAGCTGTGAAGACATCTGGTAA

Protein sequence

MNDFFDSDPTPPPAPPMATVTTAASKLDPRIVHAQAIKSPATDCSVYNNLITRYSKSNLLHYSVRLFDQIPFPNVVSWTAFISAHSNTFLSLRHFVSMLRYPAFPNQRTLASLFKTCASLPSISFGLALHSLAHKLSLCYEPYSGSALVNFYFKCRLFDDARKVFDEISDRDEVCYSALVVGLAQNAQSIDALSMFREMKASDVGSTLYSVSGALRATADLAALEQCRVLHAHAVVTGLDTDVIVQTALIDGYGKSGLIIDARQVFDENLGCMNVVGWNAMLASYAQQGDKNSTLELFNSMEVFGMSPDEYSFLAILTSFCNSRLVSEIELWLRRMRVEYGVEPTLEHFTCLIDAMGRAGKLKEAERVAMTMPFVPDAAVWRALLSSSASHGAGDMAWTMAKRLLELDQHDDSAYVIVANALSATGRWEEVAVVRKLMKERQVKKKSGRSWIEVRGEVHVFLAGDRNHERIVEIYAKLKELIWEIEKLGYVPIWDEMLHEVGEKEKKEALWYHSEKLALAYGMLTGVAPPGKALRIVKNLKICRDCHQVFKYASRVLKREIIVRDVNRYHRFSNGSCSCEDIW
Homology
BLAST of HG10012837 vs. NCBI nr
Match: XP_038880033.1 (pentatricopeptide repeat-containing protein At4g33170-like [Benincasa hispida])

HSP 1 Score: 1060.8 bits (2742), Expect = 4.1e-306
Identity = 532/583 (91.25%), Postives = 551/583 (94.51%), Query Frame = 0

Query: 1   MNDFFDSDPTPPPAPPMATVTTAASKLDPRIVHAQAIKSPATDCSVYNNLITRYSKSNLL 60
           MNDFFDSDPTPPPA  MA  TTA +K DPRI HAQAIKSPATD SVYN+LIT YSKSNLL
Sbjct: 1   MNDFFDSDPTPPPALSMAATTTAVAKSDPRIRHAQAIKSPATDRSVYNSLITLYSKSNLL 60

Query: 61  HYSVRLFDQIPFPNVVSWTAFISAHSNTFLSLRHFVSMLRYPAFPNQRTLASLFKTCASL 120
           HYSVRLFDQIPFP+VVSWTAFISAHSNTFLSLRHFVSMLRYP FPNQRTLASLFKTC SL
Sbjct: 61  HYSVRLFDQIPFPSVVSWTAFISAHSNTFLSLRHFVSMLRYPVFPNQRTLASLFKTCVSL 120

Query: 121 PSISFGLALHSLAHKLSLCYEPYSGSALVNFYFKCRLFDDARKVFDEISDRDEVCYSALV 180
             +SFGL+LHSLAHKLSLC EPYSGSALVNFY KCRL DDARKVFDEISDRDEVCYSALV
Sbjct: 121 SCVSFGLSLHSLAHKLSLCSEPYSGSALVNFYSKCRLLDDARKVFDEISDRDEVCYSALV 180

Query: 181 VGLAQNAQSIDALSMFREMKASDVGSTLYSVSGALRATADLAALEQCRVLHAHAVVTGLD 240
           VGLAQNAQSI ALSMFREMKASDV ST+ SVSGALRATADLA LEQCRVLHAHAVVTGLD
Sbjct: 181 VGLAQNAQSIAALSMFREMKASDVASTMQSVSGALRATADLATLEQCRVLHAHAVVTGLD 240

Query: 241 TDVIVQTALIDGYGKSGLIIDARQVFDENLGCMNVVGWNAMLASYAQQGDKNSTLELFNS 300
           TDVIVQT+LIDGYGKSGLIIDARQVFDENLGCMNVVGWNAMLASYAQQGDKNSTL LFNS
Sbjct: 241 TDVIVQTSLIDGYGKSGLIIDARQVFDENLGCMNVVGWNAMLASYAQQGDKNSTLLLFNS 300

Query: 301 MEVFGMSPDEYSFLAILTSFCNSRLVSEIELWLRRMRVEYGVEPTLEHFTCLIDAMGRAG 360
           M+ FGMSPDEYSFLAILTS C+S LVSEIELWLRRMRVEYGVEPTLEHFTCLIDAMGR G
Sbjct: 301 MQAFGMSPDEYSFLAILTSLCSSSLVSEIELWLRRMRVEYGVEPTLEHFTCLIDAMGRVG 360

Query: 361 KLKEAERVAMTMPFVPDAAVWRALLSSSASHGAGDMAWTMAKRLLELDQHDDSAYVIVAN 420
           KLKEAERVAMTMPFVPDAAVWRALLSSSASHGAGDMAWTMAKRLLE++Q DDSAYVIVAN
Sbjct: 361 KLKEAERVAMTMPFVPDAAVWRALLSSSASHGAGDMAWTMAKRLLEINQRDDSAYVIVAN 420

Query: 421 ALSATGRWEEVAVVRKLMKERQVKKKSGRSWIEVRGEVHVFLAGDRNHERIVEIYAKLKE 480
           ALSAT RWEEVA VRKLMKER+VKKKSGRSWIEVRGEVHVFLAGDRNHER  EIY KL+E
Sbjct: 421 ALSATERWEEVAEVRKLMKERKVKKKSGRSWIEVRGEVHVFLAGDRNHERNEEIYGKLRE 480

Query: 481 LIWEIEKLGYVPIWDEMLHEVGEKEKKEALWYHSEKLALAYGMLTGVAPPGKALRIVKNL 540
           L+WE+EKLGYVPIWDEML EVGEKEKKEALWYHSEKLALAYGMLTGVAPPGKALRIVKNL
Sbjct: 481 LMWEVEKLGYVPIWDEMLQEVGEKEKKEALWYHSEKLALAYGMLTGVAPPGKALRIVKNL 540

Query: 541 KICRDCHQVFKYASRVLKREIIVRDVNRYHRFSNGSCSCEDIW 584
           +ICRDCHQVFKYASRVLKREIIVRD+NRYHRFSNGSC+C+DIW
Sbjct: 541 RICRDCHQVFKYASRVLKREIIVRDINRYHRFSNGSCTCKDIW 583

BLAST of HG10012837 vs. NCBI nr
Match: XP_022974509.1 (pentatricopeptide repeat-containing protein At4g33170-like [Cucurbita maxima])

HSP 1 Score: 1038.1 bits (2683), Expect = 2.9e-299
Identity = 514/583 (88.16%), Postives = 542/583 (92.97%), Query Frame = 0

Query: 1   MNDFFDSDPTPPPAPPMATVTTAASKLDPRIVHAQAIKSPATDCSVYNNLITRYSKSNLL 60
           MND FD DP PP APP+A  T AA K DPRI+HAQAIKSPATD SVYNNLIT Y K+NLL
Sbjct: 1   MNDLFDPDPIPPRAPPLAAATAAAGKSDPRIIHAQAIKSPATDRSVYNNLITLYGKANLL 60

Query: 61  HYSVRLFDQIPFPNVVSWTAFISAHSNTFLSLRHFVSMLRYPAFPNQRTLASLFKTCASL 120
           HYSVRLFDQIP PNVVSWTA ISAHSNTFL+LRHFVSMLRYP+FPNQRTLASLFKTCASL
Sbjct: 61  HYSVRLFDQIPLPNVVSWTALISAHSNTFLALRHFVSMLRYPSFPNQRTLASLFKTCASL 120

Query: 121 PSISFGLALHSLAHKLSLCYEPYSGSALVNFYFKCRLFDDARKVFDEISDRDEVCYSALV 180
           P +SFGL+LHSLAHKLSLC EP+ GSALVNFY K  LFDDARKVFDEISD DEVCYSALV
Sbjct: 121 PCVSFGLSLHSLAHKLSLCSEPFCGSALVNFYSKSHLFDDARKVFDEISDTDEVCYSALV 180

Query: 181 VGLAQNAQSIDALSMFREMKASDVGSTLYSVSGALRATADLAALEQCRVLHAHAVVTGLD 240
           VGLAQNAQSIDALSMFREMKAS+V ST+YSVSGALRA ADLAALEQCRVLH HAV TGLD
Sbjct: 181 VGLAQNAQSIDALSMFREMKASEVASTMYSVSGALRAAADLAALEQCRVLHGHAVSTGLD 240

Query: 241 TDVIVQTALIDGYGKSGLIIDARQVFDENLGCMNVVGWNAMLASYAQQGDKNSTLELFNS 300
           TDVIVQTALIDGYGKSGLIIDARQVF+E+LGCMNVVGWNAMLASYAQQGDKNSTLELFNS
Sbjct: 241 TDVIVQTALIDGYGKSGLIIDARQVFEESLGCMNVVGWNAMLASYAQQGDKNSTLELFNS 300

Query: 301 MEVFGMSPDEYSFLAILTSFCNSRLVSEIELWLRRMRVEYGVEPTLEHFTCLIDAMGRAG 360
           M  FG+ PDEY+FLAILTS+CNSRLV EIELWLRRMRVEYGV+PTLEHFTCLIDAMGR G
Sbjct: 301 MGSFGLCPDEYTFLAILTSYCNSRLVGEIELWLRRMRVEYGVQPTLEHFTCLIDAMGRDG 360

Query: 361 KLKEAERVAMTMPFVPDAAVWRALLSSSASHGAGDMAWTMAKRLLELDQHDDSAYVIVAN 420
           KL+EAER+AMTMPFVPDAAVWRALLSSSASHGAGDMAW MAKRLLEL+ HDDSAYVIVAN
Sbjct: 361 KLEEAERIAMTMPFVPDAAVWRALLSSSASHGAGDMAWAMAKRLLELNPHDDSAYVIVAN 420

Query: 421 ALSATGRWEEVAVVRKLMKERQVKKKSGRSWIEVRGEVHVFLAGDRNHERIVEIYAKLKE 480
           ALS+T RWEEVA+VRK+MKERQVKKKSGRSWIEVRGEVHVFLAGDRNHER  EIYAKL+ 
Sbjct: 421 ALSSTARWEEVAIVRKMMKERQVKKKSGRSWIEVRGEVHVFLAGDRNHERTEEIYAKLRN 480

Query: 481 LIWEIEKLGYVPIWDEMLHEVGEKEKKEALWYHSEKLALAYGMLTGVAPPGKALRIVKNL 540
           LIWEIEKLGYVPIW EMLHEVGEKEKKEALWYHSEKLALAYG+L G APPGKALRIVKNL
Sbjct: 481 LIWEIEKLGYVPIWAEMLHEVGEKEKKEALWYHSEKLALAYGVLVGAAPPGKALRIVKNL 540

Query: 541 KICRDCHQVFKYASRVLKREIIVRDVNRYHRFSNGSCSCEDIW 584
           +ICRDCHQVFKYASRVL+REI+VRDVNRYHRF NGSC+C DIW
Sbjct: 541 RICRDCHQVFKYASRVLRREIVVRDVNRYHRFLNGSCTCGDIW 583

BLAST of HG10012837 vs. NCBI nr
Match: XP_022947552.1 (pentatricopeptide repeat-containing protein At4g33170-like [Cucurbita moschata])

HSP 1 Score: 1034.2 bits (2673), Expect = 4.1e-298
Identity = 513/583 (87.99%), Postives = 541/583 (92.80%), Query Frame = 0

Query: 1   MNDFFDSDPTPPPAPPMATVTTAASKLDPRIVHAQAIKSPATDCSVYNNLITRYSKSNLL 60
           MND FDSDP PP APP+A  T AA K DPRI+HAQAIKSPATD SVYNNLIT Y K+NLL
Sbjct: 1   MNDLFDSDPIPPRAPPLAAATAAAGKSDPRIIHAQAIKSPATDRSVYNNLITLYGKANLL 60

Query: 61  HYSVRLFDQIPFPNVVSWTAFISAHSNTFLSLRHFVSMLRYPAFPNQRTLASLFKTCASL 120
           HYSVRLFDQIP PNVVSWTA ISAHSNTFL+LRHFVSMLRYP+FPNQRTLASL KTCASL
Sbjct: 61  HYSVRLFDQIPLPNVVSWTALISAHSNTFLALRHFVSMLRYPSFPNQRTLASLLKTCASL 120

Query: 121 PSISFGLALHSLAHKLSLCYEPYSGSALVNFYFKCRLFDDARKVFDEISDRDEVCYSALV 180
           P +SFGL+LHSLAHKLSLC EP+ GSALVNFY K  LFDDARKVFDEISD DEVCYSALV
Sbjct: 121 PCVSFGLSLHSLAHKLSLCSEPFCGSALVNFYSKSHLFDDARKVFDEISDTDEVCYSALV 180

Query: 181 VGLAQNAQSIDALSMFREMKASDVGSTLYSVSGALRATADLAALEQCRVLHAHAVVTGLD 240
           VGLAQNAQSIDALS+FREMKASDV ST+YSVSGALRA ADLAALEQCRVLH HAVV GLD
Sbjct: 181 VGLAQNAQSIDALSVFREMKASDVASTMYSVSGALRAAADLAALEQCRVLHGHAVVVGLD 240

Query: 241 TDVIVQTALIDGYGKSGLIIDARQVFDENLGCMNVVGWNAMLASYAQQGDKNSTLELFNS 300
           TDVIVQT+LIDGYGKSGLIIDARQVF+E+LGCMNVVGWNAMLASYAQQGDKNSTLELFNS
Sbjct: 241 TDVIVQTSLIDGYGKSGLIIDARQVFEESLGCMNVVGWNAMLASYAQQGDKNSTLELFNS 300

Query: 301 MEVFGMSPDEYSFLAILTSFCNSRLVSEIELWLRRMRVEYGVEPTLEHFTCLIDAMGRAG 360
           M  FG+ PDEY+FLAILTS+CNSRLV EIELWLRRM VEYGV+PTLEH+TCLIDAMGRAG
Sbjct: 301 MVSFGLCPDEYTFLAILTSYCNSRLVGEIELWLRRMGVEYGVQPTLEHYTCLIDAMGRAG 360

Query: 361 KLKEAERVAMTMPFVPDAAVWRALLSSSASHGAGDMAWTMAKRLLELDQHDDSAYVIVAN 420
           KL+EAER+AMTMPFVPDAAVWRALLSSSASHGAGDMAW MAKRLLEL+ HDDSAYVIVAN
Sbjct: 361 KLEEAERIAMTMPFVPDAAVWRALLSSSASHGAGDMAWAMAKRLLELNPHDDSAYVIVAN 420

Query: 421 ALSATGRWEEVAVVRKLMKERQVKKKSGRSWIEVRGEVHVFLAGDRNHERIVEIYAKLKE 480
           ALS+T RWEEVAVVRK+MKERQVKKKSGRSWIEVRGEVHVFLAGDRNHER  EIYAKL+ 
Sbjct: 421 ALSSTARWEEVAVVRKMMKERQVKKKSGRSWIEVRGEVHVFLAGDRNHERTEEIYAKLRN 480

Query: 481 LIWEIEKLGYVPIWDEMLHEVGEKEKKEALWYHSEKLALAYGMLTGVAPPGKALRIVKNL 540
           LIWEIEKLGYVPIW EMLHEVGEKEKKEALWYHSEKLALAYG+L G APPGKALRIVKNL
Sbjct: 481 LIWEIEKLGYVPIWAEMLHEVGEKEKKEALWYHSEKLALAYGVLVGAAPPGKALRIVKNL 540

Query: 541 KICRDCHQVFKYASRVLKREIIVRDVNRYHRFSNGSCSCEDIW 584
           +ICRDCHQVFKYASRVL REI+VRDVNRYHRF NGSC+C DIW
Sbjct: 541 RICRDCHQVFKYASRVLGREIVVRDVNRYHRFLNGSCTCGDIW 583

BLAST of HG10012837 vs. NCBI nr
Match: KAG6597338.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1034.2 bits (2673), Expect = 4.1e-298
Identity = 515/584 (88.18%), Postives = 543/584 (92.98%), Query Frame = 0

Query: 1   MNDFFDSDPTPPPAPPMATVTTAAS-KLDPRIVHAQAIKSPATDCSVYNNLITRYSKSNL 60
           MND FDSDP PP APP+A  T AA+ K DPRI+HAQAIKSPATD SVYNNLIT Y K+NL
Sbjct: 1   MNDLFDSDPIPPRAPPLAAATVAAAGKSDPRIIHAQAIKSPATDRSVYNNLITLYGKANL 60

Query: 61  LHYSVRLFDQIPFPNVVSWTAFISAHSNTFLSLRHFVSMLRYPAFPNQRTLASLFKTCAS 120
           LHYSVRLFDQIP PNVVSWTA ISAHSNTFL+LRHFVSMLRYP+FPNQRTLASLFKTCAS
Sbjct: 61  LHYSVRLFDQIPLPNVVSWTALISAHSNTFLALRHFVSMLRYPSFPNQRTLASLFKTCAS 120

Query: 121 LPSISFGLALHSLAHKLSLCYEPYSGSALVNFYFKCRLFDDARKVFDEISDRDEVCYSAL 180
           LP +SFGL+LHSLAHKLSLC EP+ GSALVNFY K  LFDDARKVFDEISD DEVCYSAL
Sbjct: 121 LPCVSFGLSLHSLAHKLSLCSEPFCGSALVNFYSKSHLFDDARKVFDEISDTDEVCYSAL 180

Query: 181 VVGLAQNAQSIDALSMFREMKASDVGSTLYSVSGALRATADLAALEQCRVLHAHAVVTGL 240
           VVGLAQNAQSIDALS+FREMKAS+V ST+YSVSGALRA ADLAALEQCRVLH HAVV GL
Sbjct: 181 VVGLAQNAQSIDALSVFREMKASEVASTMYSVSGALRAAADLAALEQCRVLHGHAVVVGL 240

Query: 241 DTDVIVQTALIDGYGKSGLIIDARQVFDENLGCMNVVGWNAMLASYAQQGDKNSTLELFN 300
           DTDVIVQTALIDGYGKSGLIIDARQVF+E+LGCMNVVGWNAMLASYAQQGDKNSTLELFN
Sbjct: 241 DTDVIVQTALIDGYGKSGLIIDARQVFEESLGCMNVVGWNAMLASYAQQGDKNSTLELFN 300

Query: 301 SMEVFGMSPDEYSFLAILTSFCNSRLVSEIELWLRRMRVEYGVEPTLEHFTCLIDAMGRA 360
           SM  FG+ PDEY+FLAILTS+CNSRLV EIELWLRRMRVEYGV+PTLEHFTCLIDAMGRA
Sbjct: 301 SMVSFGLCPDEYTFLAILTSYCNSRLVGEIELWLRRMRVEYGVQPTLEHFTCLIDAMGRA 360

Query: 361 GKLKEAERVAMTMPFVPDAAVWRALLSSSASHGAGDMAWTMAKRLLELDQHDDSAYVIVA 420
           GKL+EAER+AMTMPFVPDAAVWRALLSSSASHGAGDMAW MAKRLLEL+ HDDSAYVIVA
Sbjct: 361 GKLEEAERIAMTMPFVPDAAVWRALLSSSASHGAGDMAWAMAKRLLELNPHDDSAYVIVA 420

Query: 421 NALSATGRWEEVAVVRKLMKERQVKKKSGRSWIEVRGEVHVFLAGDRNHERIVEIYAKLK 480
           NALS+T RWEEVAVVRK+MKERQVKKKSGRSWIEVRGEVHVFLAGDRNHER  EIYAKL+
Sbjct: 421 NALSSTARWEEVAVVRKMMKERQVKKKSGRSWIEVRGEVHVFLAGDRNHERTEEIYAKLR 480

Query: 481 ELIWEIEKLGYVPIWDEMLHEVGEKEKKEALWYHSEKLALAYGMLTGVAPPGKALRIVKN 540
            LIWEIEKLGYVPIW EMLHEVGEKEKKEALWYHSEKLALAYG+L G APPGKALRIVKN
Sbjct: 481 NLIWEIEKLGYVPIWAEMLHEVGEKEKKEALWYHSEKLALAYGVLVGAAPPGKALRIVKN 540

Query: 541 LKICRDCHQVFKYASRVLKREIIVRDVNRYHRFSNGSCSCEDIW 584
           L+ICRDCHQVFKYASRVL REI+ RDVNRYHRF NGSC+C DIW
Sbjct: 541 LRICRDCHQVFKYASRVLGREIVARDVNRYHRFLNGSCTCGDIW 584

BLAST of HG10012837 vs. NCBI nr
Match: XP_023540846.1 (pentatricopeptide repeat-containing protein At4g33170-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1030.8 bits (2664), Expect = 4.6e-297
Identity = 511/583 (87.65%), Postives = 540/583 (92.62%), Query Frame = 0

Query: 1   MNDFFDSDPTPPPAPPMATVTTAASKLDPRIVHAQAIKSPATDCSVYNNLITRYSKSNLL 60
           MND FD DP PP APP+A  T AA K DPRI+HAQAIKSPATD SVYNNLIT Y ++NLL
Sbjct: 1   MNDLFDPDPIPPRAPPLAAATAAAGKSDPRIIHAQAIKSPATDRSVYNNLITLYGRANLL 60

Query: 61  HYSVRLFDQIPFPNVVSWTAFISAHSNTFLSLRHFVSMLRYPAFPNQRTLASLFKTCASL 120
           HYSVRLFDQIP PNVVSWTA ISAHSNTFL+LRHFVSMLR P+FPNQRTLASLFKTCASL
Sbjct: 61  HYSVRLFDQIPLPNVVSWTALISAHSNTFLALRHFVSMLRCPSFPNQRTLASLFKTCASL 120

Query: 121 PSISFGLALHSLAHKLSLCYEPYSGSALVNFYFKCRLFDDARKVFDEISDRDEVCYSALV 180
           P +SFGL+LHSLAHKLSLC EP+ GSALVNFY K  LFDDARKVFDEISD DEVCYSALV
Sbjct: 121 PCVSFGLSLHSLAHKLSLCSEPFCGSALVNFYSKSHLFDDARKVFDEISDTDEVCYSALV 180

Query: 181 VGLAQNAQSIDALSMFREMKASDVGSTLYSVSGALRATADLAALEQCRVLHAHAVVTGLD 240
           VGLAQNAQSIDALS+FREMKAS+V ST+YSVSGALRA ADLAALEQCRVLH HAVV GLD
Sbjct: 181 VGLAQNAQSIDALSVFREMKASEVASTMYSVSGALRAAADLAALEQCRVLHGHAVVVGLD 240

Query: 241 TDVIVQTALIDGYGKSGLIIDARQVFDENLGCMNVVGWNAMLASYAQQGDKNSTLELFNS 300
           TDVIVQT+LIDGYGKSGLIIDARQVF+E+LGCMNVVGWNAMLASYAQQGDKNSTLELFNS
Sbjct: 241 TDVIVQTSLIDGYGKSGLIIDARQVFEESLGCMNVVGWNAMLASYAQQGDKNSTLELFNS 300

Query: 301 MEVFGMSPDEYSFLAILTSFCNSRLVSEIELWLRRMRVEYGVEPTLEHFTCLIDAMGRAG 360
           M  FG+ PDEY+FLAILTS+CNSRLV EIELWLRRMRVEYGV+PTLEHFTCLIDAMGRAG
Sbjct: 301 MVSFGLCPDEYTFLAILTSYCNSRLVGEIELWLRRMRVEYGVQPTLEHFTCLIDAMGRAG 360

Query: 361 KLKEAERVAMTMPFVPDAAVWRALLSSSASHGAGDMAWTMAKRLLELDQHDDSAYVIVAN 420
           KL+EAER+AMTMPFVPDAAVWRALLSSSASHGAGDMAW MAKRLLEL+ HDDSAYVIVAN
Sbjct: 361 KLEEAERIAMTMPFVPDAAVWRALLSSSASHGAGDMAWAMAKRLLELNPHDDSAYVIVAN 420

Query: 421 ALSATGRWEEVAVVRKLMKERQVKKKSGRSWIEVRGEVHVFLAGDRNHERIVEIYAKLKE 480
           ALS+T RWEEVAVVRK+MKERQVKKKSGRSWIEVRGEVH FLAGDRNHER  EIYAKL+ 
Sbjct: 421 ALSSTARWEEVAVVRKMMKERQVKKKSGRSWIEVRGEVHAFLAGDRNHERTEEIYAKLRN 480

Query: 481 LIWEIEKLGYVPIWDEMLHEVGEKEKKEALWYHSEKLALAYGMLTGVAPPGKALRIVKNL 540
           LIWEIEKLGYVPIW EMLHEVGEKEKKEALWYHSEKLALAYG+L G APPGKALRIVKNL
Sbjct: 481 LIWEIEKLGYVPIWAEMLHEVGEKEKKEALWYHSEKLALAYGVLVGAAPPGKALRIVKNL 540

Query: 541 KICRDCHQVFKYASRVLKREIIVRDVNRYHRFSNGSCSCEDIW 584
           +ICRDCHQVFKYASRVL REI+VRDVNRYHRF NGSC+C DIW
Sbjct: 541 RICRDCHQVFKYASRVLGREIVVRDVNRYHRFLNGSCTCGDIW 583

BLAST of HG10012837 vs. ExPASy Swiss-Prot
Match: Q9SMZ2 (Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H53 PE=3 SV=1)

HSP 1 Score: 386.7 bits (992), Expect = 4.6e-106
Identity = 214/581 (36.83%), Postives = 349/581 (60.07%), Query Frame = 0

Query: 12  PPAPPMATVTTAASKLDPRI-----VHAQAIK-SPATDCSVYNNLITRYSKSNLLHYSVR 71
           P    M +V  AAS L   +     VH  AIK +  +D  V   LI  YS++  +  +  
Sbjct: 414 PDQYTMTSVLKAASSLPEGLSLSKQVHVHAIKINNVSDSFVSTALIDAYSRNRCMKEAEI 473

Query: 72  LFDQIPFPNVVSWTAFISAHSNT---FLSLRHFVSMLRYPAFPNQRTLASLFKTCASLPS 131
           LF++  F ++V+W A ++ ++ +     +L+ F  M +     +  TLA++FKTC  L +
Sbjct: 474 LFERHNF-DLVAWNAMMAGYTQSHDGHKTLKLFALMHKQGERSDDFTLATVFKTCGFLFA 533

Query: 132 ISFGLALHSLAHKLSLCYEPYSGSALVNFYFKCRLFDDARKVFDEISDRDEVCYSALVVG 191
           I+ G  +H+ A K     + +  S +++ Y KC     A+  FD I   D+V ++ ++ G
Sbjct: 534 INQGKQVHAYAIKSGYDLDLWVSSGILDMYVKCGDMSAAQFAFDSIPVPDDVAWTTMISG 593

Query: 192 LAQNAQSIDALSMFREMKASDVGSTLYSVSGALRATADLAALEQCRVLHAHAVVTGLDTD 251
             +N +   A  +F +M+   V    ++++   +A++ L ALEQ R +HA+A+      D
Sbjct: 594 CIENGEEERAFHVFSQMRLMGVLPDEFTIATLAKASSCLTALEQGRQIHANALKLNCTND 653

Query: 252 VIVQTALIDGYGKSGLIIDARQVFDENLGCMNVVGWNAMLASYAQQGDKNSTLELFNSME 311
             V T+L+D Y K G I DA  +F + +  MN+  WNAML   AQ G+   TL+LF  M+
Sbjct: 654 PFVGTSLVDMYAKCGSIDDAYCLF-KRIEMMNITAWNAMLVGLAQHGEGKETLQLFKQMK 713

Query: 312 VFGMSPDEYSFLAILTSFCNSRLVSEIELWLRRMRVEYGVEPTLEHFTCLIDAMGRAGKL 371
             G+ PD+ +F+ +L++  +S LVSE    +R M  +YG++P +EH++CL DA+GRAG +
Sbjct: 714 SLGIKPDKVTFIGVLSACSHSGLVSEAYKHMRSMHGDYGIKPEIEHYSCLADALGRAGLV 773

Query: 372 KEAERVAMTMPFVPDAAVWRALLSSSASHGAGDMAWTMAKRLLELDQHDDSAYVIVANAL 431
           K+AE +  +M     A+++R LL++    G  +    +A +LLEL+  D SAYV+++N  
Sbjct: 774 KQAENLIESMSMEASASMYRTLLAACRVQGDTETGKRVATKLLELEPLDSSAYVLLSNMY 833

Query: 432 SATGRWEEVAVVRKLMKERQVKKKSGRSWIEVRGEVHVFLAGDRNHERIVEIYAKLKELI 491
           +A  +W+E+ + R +MK  +VKK  G SWIEV+ ++H+F+  DR++ +   IY K+K++I
Sbjct: 834 AAASKWDEMKLARTMMKGHKVKKDPGFSWIEVKNKIHIFVVDDRSNRQTELIYRKVKDMI 893

Query: 492 WEIEKLGYVPIWDEMLHEVGEKEKKEALWYHSEKLALAYGMLTGVAPPGKALRIVKNLKI 551
            +I++ GYVP  D  L +V E+EK+ AL+YHSEKLA+A+G+L+   PP   +R++KNL++
Sbjct: 894 RDIKQEGYVPETDFTLVDVEEEEKERALYYHSEKLAVAFGLLS--TPPSTPIRVIKNLRV 953

Query: 552 CRDCHQVFKYASRVLKREIIVRDVNRYHRFSNGSCSCEDIW 584
           C DCH   KY ++V  REI++RD NR+HRF +G CSC D W
Sbjct: 954 CGDCHNAMKYIAKVYNREIVLRDANRFHRFKDGICSCGDYW 990

BLAST of HG10012837 vs. ExPASy Swiss-Prot
Match: Q9LTF4 (Putative pentatricopeptide repeat-containing protein At5g52630 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H52 PE=3 SV=1)

HSP 1 Score: 362.5 bits (929), Expect = 9.2e-99
Identity = 205/541 (37.89%), Postives = 312/541 (57.67%), Query Frame = 0

Query: 46  VYNNLITRYSKSNLLHYSVRLFDQIPFPNVVSWTAFISAHSNT---FLSLRHFVSMLRYP 105
           V NNLI  YSKS L   S R F+  P  +  +W++ IS  +     ++SL     M+   
Sbjct: 52  VANNLINFYSKSQLPFDSRRAFEDSPQKSSTTWSSIISCFAQNELPWMSLEFLKKMMAGN 111

Query: 106 AFPNQRTLASLFKTCASLPSISFGLALHSLAHKLSLCYEPYSGSALVNFYFKCRLFDDAR 165
             P+   L S  K+CA L     G ++H L+ K     + + GS+LV+ Y KC     AR
Sbjct: 112 LRPDDHVLPSATKSCAILSRCDIGRSVHCLSMKTGYDADVFVGSSLVDMYAKCGEIVYAR 171

Query: 166 KVFDEISDRDEVCYSALVVGLAQNAQSIDALSMFREMKASDVGSTLYSVSGALRATADLA 225
           K+FDE+  R+ V +S ++ G AQ  ++ +AL +F+E    ++    YS S  +   A+  
Sbjct: 172 KMFDEMPQRNVVTWSGMMYGYAQMGENEEALWLFKEALFENLAVNDYSFSSVISVCANST 231

Query: 226 ALEQCRVLHAHAVVTGLDTDVIVQTALIDGYGKSGLIIDARQVFDENLGCMNVVGWNAML 285
            LE  R +H  ++ +  D+   V ++L+  Y K G+   A QVF+E +   N+  WNAML
Sbjct: 232 LLELGRQIHGLSIKSSFDSSSFVGSSLVSLYSKCGVPEGAYQVFNE-VPVKNLGIWNAML 291

Query: 286 ASYAQQGDKNSTLELFNSMEVFGMSPDEYSFLAILTSFCNSRLVSEIELWLRRMRVEYGV 345
            +YAQ       +ELF  M++ GM P+  +FL +L +  ++ LV E   +  +M+ E  +
Sbjct: 292 KAYAQHSHTQKVIELFKRMKLSGMKPNFITFLNVLNACSHAGLVDEGRYYFDQMK-ESRI 351

Query: 346 EPTLEHFTCLIDAMGRAGKLKEAERVAMTMPFVPDAAVWRALLSSSASHGAGDMAWTMAK 405
           EPT +H+  L+D +GRAG+L+EA  V   MP  P  +VW ALL+S   H   ++A   A 
Sbjct: 352 EPTDKHYASLVDMLGRAGRLQEALEVITNMPIDPTESVWGALLTSCTVHKNTELAAFAAD 411

Query: 406 RLLELDQHDDSAYVIVANALSATGRWEEVAVVRKLMKERQVKKKSGRSWIEVRGEVHVFL 465
           ++ EL       ++ ++NA +A GR+E+ A  RKL+++R  KK++G SW+E R +VH F 
Sbjct: 412 KVFELGPVSSGMHISLSNAYAADGRFEDAAKARKLLRDRGEKKETGLSWVEERNKVHTFA 471

Query: 466 AGDRNHERIVEIYAKLKELIWEIEKLGYVPIWDEMLHEVGEKEKKEALWYHSEKLALAYG 525
           AG+R HE+  EIY KL EL  E+EK GY+     +L EV   EK + + YHSE+LA+A+G
Sbjct: 472 AGERRHEKSKEIYEKLAELGEEMEKAGYIADTSYVLREVDGDEKNQTIRYHSERLAIAFG 531

Query: 526 MLTGVAPPGKALRIVKNLKICRDCHQVFKYASRVLKREIIVRDVNRYHRFSNGSCSCEDI 584
           ++T   P  + +R++KNL++C DCH   K+ S   +R IIVRD NR+HRF +G CSC D 
Sbjct: 532 LIT--FPADRPIRVMKNLRVCGDCHNAIKFMSVCTRRVIIVRDNNRFHRFEDGKCSCNDY 588

BLAST of HG10012837 vs. ExPASy Swiss-Prot
Match: Q9LIQ7 (Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H87 PE=3 SV=1)

HSP 1 Score: 360.1 bits (923), Expect = 4.6e-98
Identity = 205/558 (36.74%), Postives = 317/558 (56.81%), Query Frame = 0

Query: 30  RIVHAQAIKSP-ATDCSVYNNLITRYSKSNLLHYSVRLFDQIPFPNVVSWTAFISA---H 89
           RIVHA  ++S    D  + N L+  Y+K   L  + ++F+++P  + V+WT  IS    H
Sbjct: 80  RIVHAHILQSIFRHDIVMGNTLLNMYAKCGSLEEARKVFEKMPQRDFVTWTTLISGYSQH 139

Query: 90  SNTFLSLRHFVSMLRYPAFPNQRTLASLFKTCASLPSISFGLALHSLAHKLSLCYEPYSG 149
                +L  F  MLR+   PN+ TL+S+ K  A+      G  LH    K       + G
Sbjct: 140 DRPCDALLFFNQMLRFGYSPNEFTLSSVIKAAAAERRGCCGHQLHGFCVKCGFDSNVHVG 199

Query: 150 SALVNFYFKCRLFDDARKVFDEISDRDEVCYSALVVGLAQNAQSIDALSMFREMKASDVG 209
           SAL++ Y +  L DDA+ VFD +  R++V ++AL+ G A+ + +  AL +F+ M      
Sbjct: 200 SALLDLYTRYGLMDDAQLVFDALESRNDVSWNALIAGHARRSGTEKALELFQGMLRDGFR 259

Query: 210 STLYSVSGALRATADLAALEQCRVLHAHAVVTGLDTDVIVQTALIDGYGKSGLIIDARQV 269
            + +S +    A +    LEQ + +HA+ + +G          L+D Y KSG I DAR++
Sbjct: 260 PSHFSYASLFGACSSTGFLEQGKWVHAYMIKSGEKLVAFAGNTLLDMYAKSGSIHDARKI 319

Query: 270 FDENLGCMNVVGWNAMLASYAQQGDKNSTLELFNSMEVFGMSPDEYSFLAILTSFCNSRL 329
           FD  L   +VV WN++L +YAQ G     +  F  M   G+ P+E SFL++LT+  +S L
Sbjct: 320 FD-RLAKRDVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNEISFLSVLTACSHSGL 379

Query: 330 VSEIELWLRRMRVEYGVEPTLEHFTCLIDAMGRAGKLKEAERVAMTMPFVPDAAVWRALL 389
           + E   +   M+ + G+ P   H+  ++D +GRAG L  A R    MP  P AA+W+ALL
Sbjct: 380 LDEGWHYYELMKKD-GIVPEAWHYVTVVDLLGRAGDLNRALRFIEEMPIEPTAAIWKALL 439

Query: 390 SSSASHGAGDMAWTMAKRLLELDQHDDSAYVIVANALSATGRWEEVAVVRKLMKERQVKK 449
           ++   H   ++    A+ + ELD  D   +VI+ N  ++ GRW + A VRK MKE  VKK
Sbjct: 440 NACRMHKNTELGAYAAEHVFELDPDDPGPHVILYNIYASGGRWNDAARVRKKMKESGVKK 499

Query: 450 KSGRSWIEVRGEVHVFLAGDRNHERIVEIYAKLKELIWEIEKLGYVPIWDEMLHEVGEKE 509
           +   SW+E+   +H+F+A D  H +  EI  K +E++ +I++LGYVP    ++  V ++E
Sbjct: 500 EPACSWVEIENAIHMFVANDERHPQREEIARKWEEVLAKIKELGYVPDTSHVIVHVDQQE 559

Query: 510 KKEALWYHSEKLALAYGMLTGVAPPGKALRIVKNLKICRDCHQVFKYASRVLKREIIVRD 569
           ++  L YHSEK+ALA+ +L    PPG  + I KN+++C DCH   K AS+V+ REIIVRD
Sbjct: 560 REVNLQYHSEKIALAFALLN--TPPGSTIHIKKNIRVCGDCHTAIKLASKVVGREIIVRD 619

Query: 570 VNRYHRFSNGSCSCEDIW 584
            NR+H F +G+CSC+D W
Sbjct: 620 TNRFHHFKDGNCSCKDYW 633

BLAST of HG10012837 vs. ExPASy Swiss-Prot
Match: Q9SI53 (Pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H44 PE=2 SV=1)

HSP 1 Score: 344.0 bits (881), Expect = 3.4e-93
Identity = 195/539 (36.18%), Postives = 313/539 (58.07%), Query Frame = 0

Query: 48  NNLITRYSKSNLLHYSVRLFDQIPFPNVVSWTAFISAHSNTFL---SLRHFVSMLRYPAF 107
           N LI  Y K NLL+ + +LFDQ+P  NV+SWT  ISA+S   +   +L   V MLR    
Sbjct: 100 NVLINMYVKFNLLNDAHQLFDQMPQRNVISWTTMISAYSKCKIHQKALELLVLMLRDNVR 159

Query: 108 PNQRTLASLFKTCASLPSISFGLALHSLAHKLSLCYEPYSGSALVNFYFKCRLFDDARKV 167
           PN  T +S+ ++C  +  +     LH    K  L  + +  SAL++ + K    +DA  V
Sbjct: 160 PNVYTYSSVLRSCNGMSDVRM---LHCGIIKEGLESDVFVRSALIDVFAKLGEPEDALSV 219

Query: 168 FDEISDRDEVCYSALVVGLAQNAQSIDALSMFREMKASDVGSTLYSVSGALRATADLAAL 227
           FDE+   D + +++++ G AQN++S  AL +F+ MK +   +   +++  LRA   LA L
Sbjct: 220 FDEMVTGDAIVWNSIIGGFAQNSRSDVALELFKRMKRAGFIAEQATLTSVLRACTGLALL 279

Query: 228 EQCRVLHAHAVVTGLDTDVIVQTALIDGYGKSGLIIDARQVFDENLGCMNVVGWNAMLAS 287
           E     H H  +   D D+I+  AL+D Y K G + DA +VF++ +   +V+ W+ M++ 
Sbjct: 280 ELGMQAHVH--IVKYDQDLILNNALVDMYCKCGSLEDALRVFNQ-MKERDVITWSTMISG 339

Query: 288 YAQQGDKNSTLELFNSMEVFGMSPDEYSFLAILTSFCNSRLVSEIELWLRRMRVEYGVEP 347
            AQ G     L+LF  M+  G  P+  + + +L +  ++ L+ +   + R M+  YG++P
Sbjct: 340 LAQNGYSQEALKLFERMKSSGTKPNYITIVGVLFACSHAGLLEDGWYYFRSMKKLYGIDP 399

Query: 348 TLEHFTCLIDAMGRAGKLKEAERVAMTMPFVPDAAVWRALLSSSASHGAGDMAWTMAKRL 407
             EH+ C+ID +G+AGKL +A ++   M   PDA  WR LL +        +A   AK++
Sbjct: 400 VREHYGCMIDLLGKAGKLDDAVKLLNEMECEPDAVTWRTLLGACRVQRNMVLAEYAAKKV 459

Query: 408 LELDQHDDSAYVIVANALSATGRWEEVAVVRKLMKERQVKKKSGRSWIEVRGEVHVFLAG 467
           + LD  D   Y +++N  + + +W+ V  +R  M++R +KK+ G SWIEV  ++H F+ G
Sbjct: 460 IALDPEDAGTYTLLSNIYANSQKWDSVEEIRTRMRDRGIKKEPGCSWIEVNKQIHAFIIG 519

Query: 468 DRNHERIVEIYAKLKELIWEIEKLGYVPIWDEMLHEVGEKEKKEALWYHSEKLALAYGML 527
           D +H +IVE+  KL +LI  +  +GYVP  + +L ++  ++ +++L +HSEKLALA+G++
Sbjct: 520 DNSHPQIVEVSKKLNQLIHRLTGIGYVPETNFVLQDLEGEQMEDSLRHHSEKLALAFGLM 579

Query: 528 TGVAPPGKALRIVKNLKICRDCHQVFKYASRVLKREIIVRDVNRYHRFSNGSCSCEDIW 584
           T   P  K +RI KNL+IC DCH   K AS++  R I++RD  RYH F +G CSC D W
Sbjct: 580 T--LPIEKVIRIRKNLRICGDCHVFCKLASKLEIRSIVIRDPIRYHHFQDGKCSCGDYW 630

BLAST of HG10012837 vs. ExPASy Swiss-Prot
Match: Q9FHF9 (Pentatricopeptide repeat-containing protein At5g46460, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H49 PE=2 SV=1)

HSP 1 Score: 340.9 bits (873), Expect = 2.9e-92
Identity = 187/548 (34.12%), Postives = 302/548 (55.11%), Query Frame = 0

Query: 40  PATDCSVYNNLITRYSKSNLLHYSVRLFDQIPFPNVVSWTAFI---SAHSNTFLSLRHFV 99
           P  D + +N+++  Y +   +  +++LF Q+P  NV+SWT  I     +  +  +L  F 
Sbjct: 155 PVKDTAAWNSMVHGYLQFGKVDDALKLFKQMPGKNVISWTTMICGLDQNERSGEALDLFK 214

Query: 100 SMLRYPAFPNQRTLASLFKTCASLPSISFGLALHSLAHKLSLCYEPYSGSALVNFYFKCR 159
           +MLR       R    +   CA+ P+   G+ +H L  KL   YE Y  ++L+ FY  C+
Sbjct: 215 NMLRCCIKSTSRPFTCVITACANAPAFHMGIQVHGLIIKLGFLYEEYVSASLITFYANCK 274

Query: 160 LFDDARKVFDEISDRDEVCYSALVVGLAQNAQSIDALSMFREMKASDVGSTLYSVSGALR 219
              D+RKVFDE        ++AL+ G + N +  DALS+F  M  + +     + +  L 
Sbjct: 275 RIGDSRKVFDEKVHEQVAVWTALLSGYSLNKKHEDALSIFSGMLRNSILPNQSTFASGLN 334

Query: 220 ATADLAALEQCRVLHAHAVVTGLDTDVIVQTALIDGYGKSGLIIDARQVFDENLGCMNVV 279
           + + L  L+  + +H  AV  GL+TD  V  +L+  Y  SG + DA  VF + +   ++V
Sbjct: 335 SCSALGTLDWGKEMHGVAVKLGLETDAFVGNSLVVMYSDSGNVNDAVSVFIK-IFKKSIV 394

Query: 280 GWNAMLASYAQQGDKNSTLELFNSMEVFGMSPDEYSFLAILTSFCNSRLVSE-IELWLRR 339
            WN+++   AQ G       +F  M      PDE +F  +L++  +   + +  +L+   
Sbjct: 395 SWNSIIVGCAQHGRGKWAFVIFGQMIRLNKEPDEITFTGLLSACSHCGFLEKGRKLFYYM 454

Query: 340 MRVEYGVEPTLEHFTCLIDAMGRAGKLKEAERVAMTMPFVPDAAVWRALLSSSASHGAGD 399
                 ++  ++H+TC++D +GR GKLKEAE +   M   P+  VW ALLS+   H   D
Sbjct: 455 SSGINHIDRKIQHYTCMVDILGRCGKLKEAEELIERMVVKPNEMVWLALLSACRMHSDVD 514

Query: 400 MAWTMAKRLLELDQHDDSAYVIVANALSATGRWEEVAVVRKLMKERQVKKKSGRSWIEVR 459
                A  +  LD    +AYV+++N  ++ GRW  V+ +R  MK+  + KK G SW+ +R
Sbjct: 515 RGEKAAAAIFNLDSKSSAAYVLLSNIYASAGRWSNVSKLRVKMKKNGIMKKPGSSWVVIR 574

Query: 460 GEVHVFLAGDRNHERIVEIYAKLKELIWEIEKLGYVPIWDEMLHEVGEKEKKEALWYHSE 519
           G+ H F +GD+ H     IY KL+ L  ++++LGY P +   LH+V +++K+E LWYHSE
Sbjct: 575 GKKHEFFSGDQPH--CSRIYEKLEFLREKLKELGYAPDYRSALHDVEDEQKEEMLWYHSE 634

Query: 520 KLALAYGMLTGVAPPGKALRIVKNLKICRDCHQVFKYASRVLKREIIVRDVNRYHRFSNG 579
           +LA+A+G++  V   G A+ ++KNL++C DCH V K  S V+ REI++RD  R+H F NG
Sbjct: 635 RLAIAFGLINTV--EGSAVTVMKNLRVCEDCHTVIKLISGVVGREIVLRDPIRFHHFKNG 694

Query: 580 SCSCEDIW 584
           +CSC D W
Sbjct: 695 TCSCGDYW 697

BLAST of HG10012837 vs. ExPASy TrEMBL
Match: A0A6J1IHT2 (pentatricopeptide repeat-containing protein At4g33170-like OS=Cucurbita maxima OX=3661 GN=LOC111473187 PE=3 SV=1)

HSP 1 Score: 1038.1 bits (2683), Expect = 1.4e-299
Identity = 514/583 (88.16%), Postives = 542/583 (92.97%), Query Frame = 0

Query: 1   MNDFFDSDPTPPPAPPMATVTTAASKLDPRIVHAQAIKSPATDCSVYNNLITRYSKSNLL 60
           MND FD DP PP APP+A  T AA K DPRI+HAQAIKSPATD SVYNNLIT Y K+NLL
Sbjct: 1   MNDLFDPDPIPPRAPPLAAATAAAGKSDPRIIHAQAIKSPATDRSVYNNLITLYGKANLL 60

Query: 61  HYSVRLFDQIPFPNVVSWTAFISAHSNTFLSLRHFVSMLRYPAFPNQRTLASLFKTCASL 120
           HYSVRLFDQIP PNVVSWTA ISAHSNTFL+LRHFVSMLRYP+FPNQRTLASLFKTCASL
Sbjct: 61  HYSVRLFDQIPLPNVVSWTALISAHSNTFLALRHFVSMLRYPSFPNQRTLASLFKTCASL 120

Query: 121 PSISFGLALHSLAHKLSLCYEPYSGSALVNFYFKCRLFDDARKVFDEISDRDEVCYSALV 180
           P +SFGL+LHSLAHKLSLC EP+ GSALVNFY K  LFDDARKVFDEISD DEVCYSALV
Sbjct: 121 PCVSFGLSLHSLAHKLSLCSEPFCGSALVNFYSKSHLFDDARKVFDEISDTDEVCYSALV 180

Query: 181 VGLAQNAQSIDALSMFREMKASDVGSTLYSVSGALRATADLAALEQCRVLHAHAVVTGLD 240
           VGLAQNAQSIDALSMFREMKAS+V ST+YSVSGALRA ADLAALEQCRVLH HAV TGLD
Sbjct: 181 VGLAQNAQSIDALSMFREMKASEVASTMYSVSGALRAAADLAALEQCRVLHGHAVSTGLD 240

Query: 241 TDVIVQTALIDGYGKSGLIIDARQVFDENLGCMNVVGWNAMLASYAQQGDKNSTLELFNS 300
           TDVIVQTALIDGYGKSGLIIDARQVF+E+LGCMNVVGWNAMLASYAQQGDKNSTLELFNS
Sbjct: 241 TDVIVQTALIDGYGKSGLIIDARQVFEESLGCMNVVGWNAMLASYAQQGDKNSTLELFNS 300

Query: 301 MEVFGMSPDEYSFLAILTSFCNSRLVSEIELWLRRMRVEYGVEPTLEHFTCLIDAMGRAG 360
           M  FG+ PDEY+FLAILTS+CNSRLV EIELWLRRMRVEYGV+PTLEHFTCLIDAMGR G
Sbjct: 301 MGSFGLCPDEYTFLAILTSYCNSRLVGEIELWLRRMRVEYGVQPTLEHFTCLIDAMGRDG 360

Query: 361 KLKEAERVAMTMPFVPDAAVWRALLSSSASHGAGDMAWTMAKRLLELDQHDDSAYVIVAN 420
           KL+EAER+AMTMPFVPDAAVWRALLSSSASHGAGDMAW MAKRLLEL+ HDDSAYVIVAN
Sbjct: 361 KLEEAERIAMTMPFVPDAAVWRALLSSSASHGAGDMAWAMAKRLLELNPHDDSAYVIVAN 420

Query: 421 ALSATGRWEEVAVVRKLMKERQVKKKSGRSWIEVRGEVHVFLAGDRNHERIVEIYAKLKE 480
           ALS+T RWEEVA+VRK+MKERQVKKKSGRSWIEVRGEVHVFLAGDRNHER  EIYAKL+ 
Sbjct: 421 ALSSTARWEEVAIVRKMMKERQVKKKSGRSWIEVRGEVHVFLAGDRNHERTEEIYAKLRN 480

Query: 481 LIWEIEKLGYVPIWDEMLHEVGEKEKKEALWYHSEKLALAYGMLTGVAPPGKALRIVKNL 540
           LIWEIEKLGYVPIW EMLHEVGEKEKKEALWYHSEKLALAYG+L G APPGKALRIVKNL
Sbjct: 481 LIWEIEKLGYVPIWAEMLHEVGEKEKKEALWYHSEKLALAYGVLVGAAPPGKALRIVKNL 540

Query: 541 KICRDCHQVFKYASRVLKREIIVRDVNRYHRFSNGSCSCEDIW 584
           +ICRDCHQVFKYASRVL+REI+VRDVNRYHRF NGSC+C DIW
Sbjct: 541 RICRDCHQVFKYASRVLRREIVVRDVNRYHRFLNGSCTCGDIW 583

BLAST of HG10012837 vs. ExPASy TrEMBL
Match: A0A6J1G6S2 (pentatricopeptide repeat-containing protein At4g33170-like OS=Cucurbita moschata OX=3662 GN=LOC111451382 PE=3 SV=1)

HSP 1 Score: 1034.2 bits (2673), Expect = 2.0e-298
Identity = 513/583 (87.99%), Postives = 541/583 (92.80%), Query Frame = 0

Query: 1   MNDFFDSDPTPPPAPPMATVTTAASKLDPRIVHAQAIKSPATDCSVYNNLITRYSKSNLL 60
           MND FDSDP PP APP+A  T AA K DPRI+HAQAIKSPATD SVYNNLIT Y K+NLL
Sbjct: 1   MNDLFDSDPIPPRAPPLAAATAAAGKSDPRIIHAQAIKSPATDRSVYNNLITLYGKANLL 60

Query: 61  HYSVRLFDQIPFPNVVSWTAFISAHSNTFLSLRHFVSMLRYPAFPNQRTLASLFKTCASL 120
           HYSVRLFDQIP PNVVSWTA ISAHSNTFL+LRHFVSMLRYP+FPNQRTLASL KTCASL
Sbjct: 61  HYSVRLFDQIPLPNVVSWTALISAHSNTFLALRHFVSMLRYPSFPNQRTLASLLKTCASL 120

Query: 121 PSISFGLALHSLAHKLSLCYEPYSGSALVNFYFKCRLFDDARKVFDEISDRDEVCYSALV 180
           P +SFGL+LHSLAHKLSLC EP+ GSALVNFY K  LFDDARKVFDEISD DEVCYSALV
Sbjct: 121 PCVSFGLSLHSLAHKLSLCSEPFCGSALVNFYSKSHLFDDARKVFDEISDTDEVCYSALV 180

Query: 181 VGLAQNAQSIDALSMFREMKASDVGSTLYSVSGALRATADLAALEQCRVLHAHAVVTGLD 240
           VGLAQNAQSIDALS+FREMKASDV ST+YSVSGALRA ADLAALEQCRVLH HAVV GLD
Sbjct: 181 VGLAQNAQSIDALSVFREMKASDVASTMYSVSGALRAAADLAALEQCRVLHGHAVVVGLD 240

Query: 241 TDVIVQTALIDGYGKSGLIIDARQVFDENLGCMNVVGWNAMLASYAQQGDKNSTLELFNS 300
           TDVIVQT+LIDGYGKSGLIIDARQVF+E+LGCMNVVGWNAMLASYAQQGDKNSTLELFNS
Sbjct: 241 TDVIVQTSLIDGYGKSGLIIDARQVFEESLGCMNVVGWNAMLASYAQQGDKNSTLELFNS 300

Query: 301 MEVFGMSPDEYSFLAILTSFCNSRLVSEIELWLRRMRVEYGVEPTLEHFTCLIDAMGRAG 360
           M  FG+ PDEY+FLAILTS+CNSRLV EIELWLRRM VEYGV+PTLEH+TCLIDAMGRAG
Sbjct: 301 MVSFGLCPDEYTFLAILTSYCNSRLVGEIELWLRRMGVEYGVQPTLEHYTCLIDAMGRAG 360

Query: 361 KLKEAERVAMTMPFVPDAAVWRALLSSSASHGAGDMAWTMAKRLLELDQHDDSAYVIVAN 420
           KL+EAER+AMTMPFVPDAAVWRALLSSSASHGAGDMAW MAKRLLEL+ HDDSAYVIVAN
Sbjct: 361 KLEEAERIAMTMPFVPDAAVWRALLSSSASHGAGDMAWAMAKRLLELNPHDDSAYVIVAN 420

Query: 421 ALSATGRWEEVAVVRKLMKERQVKKKSGRSWIEVRGEVHVFLAGDRNHERIVEIYAKLKE 480
           ALS+T RWEEVAVVRK+MKERQVKKKSGRSWIEVRGEVHVFLAGDRNHER  EIYAKL+ 
Sbjct: 421 ALSSTARWEEVAVVRKMMKERQVKKKSGRSWIEVRGEVHVFLAGDRNHERTEEIYAKLRN 480

Query: 481 LIWEIEKLGYVPIWDEMLHEVGEKEKKEALWYHSEKLALAYGMLTGVAPPGKALRIVKNL 540
           LIWEIEKLGYVPIW EMLHEVGEKEKKEALWYHSEKLALAYG+L G APPGKALRIVKNL
Sbjct: 481 LIWEIEKLGYVPIWAEMLHEVGEKEKKEALWYHSEKLALAYGVLVGAAPPGKALRIVKNL 540

Query: 541 KICRDCHQVFKYASRVLKREIIVRDVNRYHRFSNGSCSCEDIW 584
           +ICRDCHQVFKYASRVL REI+VRDVNRYHRF NGSC+C DIW
Sbjct: 541 RICRDCHQVFKYASRVLGREIVVRDVNRYHRFLNGSCTCGDIW 583

BLAST of HG10012837 vs. ExPASy TrEMBL
Match: A0A6J1DUT1 (pentatricopeptide repeat-containing protein At4g33170-like OS=Momordica charantia OX=3673 GN=LOC111024565 PE=3 SV=1)

HSP 1 Score: 1014.6 bits (2622), Expect = 1.6e-292
Identity = 511/585 (87.35%), Postives = 540/585 (92.31%), Query Frame = 0

Query: 1   MNDFFDSDPT--PPPAPPMATVTTAASKLDPRIVHAQAIKSPATDCSVYNNLITRYSKSN 60
           MNDFFD DPT  PPPAPP+A    AA K DPRIVHAQAIKSPAT  SVYNNLIT YSKS+
Sbjct: 1   MNDFFDPDPTPAPPPAPPLA----AAGKSDPRIVHAQAIKSPATHRSVYNNLITLYSKSS 60

Query: 61  LLHYSVRLFDQIPFPNVVSWTAFISAHSNTFLSLRHFVSMLRYPAFPNQRTLASLFKTCA 120
            LHYSVRLF QIPFPNVVSWTA ISAHSNT LSLRHF+ MLRYPAFPNQRTLASLFKTCA
Sbjct: 61  FLHYSVRLFHQIPFPNVVSWTALISAHSNTSLSLRHFIYMLRYPAFPNQRTLASLFKTCA 120

Query: 121 SLPSISFGLALHSLAHKLSLCYEPYSGSALVNFYFKCRLFDDARKVFDEISDRDEVCYSA 180
           SLPS+ FG ALHSLA+KLSLC EP+SGSALVNFY KC+LF DARKVFDEISDRDEVCYSA
Sbjct: 121 SLPSVPFGFALHSLAYKLSLCSEPFSGSALVNFYSKCQLFGDARKVFDEISDRDEVCYSA 180

Query: 181 LVVGLAQNAQSIDALSMFREMKASDVGSTLYSVSGALRATADLAALEQCRVLHAHAVVTG 240
           LVVGLAQNAQSIDALSMFREMK  DV ST+YSVSGALRA ADLAALEQCRVLH HAVVTG
Sbjct: 181 LVVGLAQNAQSIDALSMFREMKVCDVASTMYSVSGALRAAADLAALEQCRVLHGHAVVTG 240

Query: 241 LDTDVIVQTALIDGYGKSGLIIDARQVFDENLGCMNVVGWNAMLASYAQQGDKNSTLELF 300
           LDTDVIVQTALIDGYGKSGLI+DAR+VFDE+L  MNVVGWNAMLASYAQQGDKNS LELF
Sbjct: 241 LDTDVIVQTALIDGYGKSGLILDAREVFDESLASMNVVGWNAMLASYAQQGDKNSALELF 300

Query: 301 NSMEVFGMSPDEYSFLAILTSFCNSRLVSEIELWLRRMRVEYGVEPTLEHFTCLIDAMGR 360
           NSM   G+SPDEYSFLAILTS+CNSR V +I+LWLRRMRV YGVEPTLEHFTCLIDAMGR
Sbjct: 301 NSMAARGLSPDEYSFLAILTSYCNSRSVGDIQLWLRRMRVGYGVEPTLEHFTCLIDAMGR 360

Query: 361 AGKLKEAERVAMTMPFVPDAAVWRALLSSSASHGAGDMAWTMAKRLLELDQHDDSAYVIV 420
            GKLKEAERVAMTMPF+PDAAVWRALLSSSASHGAG+MAW MAKRLLEL+ HDDSAYVIV
Sbjct: 361 VGKLKEAERVAMTMPFMPDAAVWRALLSSSASHGAGNMAWAMAKRLLELNPHDDSAYVIV 420

Query: 421 ANALSATGRWEEVAVVRKLMKERQVKKKSGRSWIEVRGEVHVFLAGDRNHERIVEIYAKL 480
           ANALSAT RWEEVAVVRKLMKERQVKKKSGRSWIEVRG+VHVFLAGDRNHER  E+YAKL
Sbjct: 421 ANALSATARWEEVAVVRKLMKERQVKKKSGRSWIEVRGKVHVFLAGDRNHERAEEVYAKL 480

Query: 481 KELIWEIEKLGYVPIWDEMLHEVGEKEKKEALWYHSEKLALAYGMLTGVAPPGKALRIVK 540
           +ELI EIEKLGYVPIWDEMLHEVGEKEKKEALWYHSEKLALA+G++ GVAPPGKALRIVK
Sbjct: 481 RELIREIEKLGYVPIWDEMLHEVGEKEKKEALWYHSEKLALAFGVVAGVAPPGKALRIVK 540

Query: 541 NLKICRDCHQVFKYASRVLKREIIVRDVNRYHRFSNGSCSCEDIW 584
           NL+IC+DCHQVFKYASRVL+REIIVRDVNRYHRF +GSC+CEDIW
Sbjct: 541 NLRICKDCHQVFKYASRVLEREIIVRDVNRYHRFLHGSCTCEDIW 581

BLAST of HG10012837 vs. ExPASy TrEMBL
Match: A0A0A0L4A2 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G134640 PE=3 SV=1)

HSP 1 Score: 983.4 bits (2541), Expect = 4.1e-283
Identity = 489/583 (83.88%), Postives = 533/583 (91.42%), Query Frame = 0

Query: 1   MNDFFDSDPTPPPAPPMATVTTAASKLDPRIVHAQAIKSPATDCSVYNNLITRYSKSNLL 60
           MN FF SDPTP PAPP+A  T   +K DPRI HAQAIKSPAT+C VYNNLITRYSK+NLL
Sbjct: 1   MNAFFPSDPTPLPAPPLAAATKVVAKSDPRIDHAQAIKSPATECVVYNNLITRYSKANLL 60

Query: 61  HYSVRLFDQIPFPNVVSWTAFISAHSNTFLSLRHFVSMLRYPAFPNQRTLASLFKTCASL 120
           HYS RLF+QIPFP++VSWTA ISAHS+TFLSLRHFVSMLRYP FPN+RTLA LFKTCASL
Sbjct: 61  HYSARLFNQIPFPDIVSWTALISAHSSTFLSLRHFVSMLRYPTFPNERTLAPLFKTCASL 120

Query: 121 PSISFGLALHSLAHKLSLCYEPYSGSALVNFYFKCRLFDDARKVFDEISDRDEVCYSALV 180
           P +SFG ALHSLA+KLSLC  PYSGSALVNFY KCRLF+DA KVFDEIS RDE CYSALV
Sbjct: 121 PCVSFGFALHSLAYKLSLCNGPYSGSALVNFYSKCRLFNDACKVFDEISYRDEFCYSALV 180

Query: 181 VGLAQNAQSIDALSMFREMKASDVGSTLYSVSGALRATADLAALEQCRVLHAHAVVTGLD 240
           VGLAQNAQSI ALSMFR+MKAS+V ST+YSVSGALRA ADLAALE+CRV+H+HAVVTGLD
Sbjct: 181 VGLAQNAQSIRALSMFRQMKASEVASTIYSVSGALRAAADLAALERCRVIHSHAVVTGLD 240

Query: 241 TDVIVQTALIDGYGKSGLIIDARQVFDENLGCMNVVGWNAMLASYAQQGDKNSTLELFNS 300
           T+VIVQTALIDGYGKSGLIIDARQVFDENLGCMN+VGWNAML+SYAQQGD+NSTLE+FNS
Sbjct: 241 TNVIVQTALIDGYGKSGLIIDARQVFDENLGCMNIVGWNAMLSSYAQQGDQNSTLEVFNS 300

Query: 301 MEVFGMSPDEYSFLAILTSFCNSRLVSEIELWLRRMRVEYGVEPTLEHFTCLIDAMGRAG 360
           M+ FGMSPDEYSFLAIL+SFCNS LVSEI+ WLRRM VEYGV+PTLEHFTCLIDA+GR G
Sbjct: 301 MKPFGMSPDEYSFLAILSSFCNSGLVSEIKPWLRRMIVEYGVKPTLEHFTCLIDALGRTG 360

Query: 361 KLKEAERVAMTMPFVPDAAVWRALLSSSASHGAGDMAWTMAKRLLELDQHDDSAYVIVAN 420
           KL+EAERVAMTMPFVPD AVWRALLSSSASHGAGDMAWTMAKRLLEL+QHDDSAYVIV+N
Sbjct: 361 KLEEAERVAMTMPFVPDEAVWRALLSSSASHGAGDMAWTMAKRLLELNQHDDSAYVIVSN 420

Query: 421 ALSATGRWEEVAVVRKLMKERQVKKKSGRSWIEVRGEVHVFLAGDRNHERIVEIYAKLKE 480
           ALS   RWEEVA+VRKLMKER VKK SG+SWIEVRGE HVFLAGDRNHER  E+ AKLKE
Sbjct: 421 ALSVAARWEEVALVRKLMKERHVKKISGKSWIEVRGEAHVFLAGDRNHERAEEMNAKLKE 480

Query: 481 LIWEIEKLGYVPIWDEMLHEVGEKEKKEALWYHSEKLALAYGMLTGVAPPGKALRIVKNL 540
           L+ EIEKLGYVP+  E LH+VGEKE+KEAL YHSEKLALAYG+LTGVAPPGKALRI+KNL
Sbjct: 481 LVGEIEKLGYVPVCSETLHKVGEKERKEALLYHSEKLALAYGILTGVAPPGKALRIIKNL 540

Query: 541 KICRDCHQVFKYASRVLKREIIVRDVNRYHRFSNGSCSCEDIW 584
           +ICRDCH  FKYASRVLK+EIIVRD+NRYHRFS GSC+C DIW
Sbjct: 541 RICRDCHLFFKYASRVLKKEIIVRDINRYHRFSYGSCTCADIW 583

BLAST of HG10012837 vs. ExPASy TrEMBL
Match: A0A5A7U1J3 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold434G002030 PE=3 SV=1)

HSP 1 Score: 978.4 bits (2528), Expect = 1.3e-281
Identity = 493/583 (84.56%), Postives = 532/583 (91.25%), Query Frame = 0

Query: 1   MNDFFDSDPTPPPAPPMATVTTAASKLDPRIVHAQAIKSPATDCSVYNNLITRYSKSNLL 60
           MN FF SDPTPPP PP+A  T   +K DPRI HAQAIKSPATD +VYNNLITRYS++NLL
Sbjct: 1   MNAFFASDPTPPPEPPLAAATKVVAKSDPRIDHAQAIKSPATDRAVYNNLITRYSRTNLL 60

Query: 61  HYSVRLFDQIPFPNVVSWTAFISAHSNTFLSLRHFVSMLRYPAFPNQRTLASLFKTCASL 120
           HYSVRLF QIPFP++VSWTAFISAHSNTFLSLRHFVSMLRYP FPN+RTLASLFKTCASL
Sbjct: 61  HYSVRLFYQIPFPDIVSWTAFISAHSNTFLSLRHFVSMLRYPTFPNERTLASLFKTCASL 120

Query: 121 PSISFGLALHSLAHKLSLCYEPYSGSALVNFYFKCRLFDDARKVFDEISDRDEVCYSALV 180
           P +SFGLALHSLA+KLSLC + YSGSALVNFY KCRLF+ A KVFDEISDRDE CYSALV
Sbjct: 121 PCVSFGLALHSLAYKLSLCNDRYSGSALVNFYSKCRLFNHACKVFDEISDRDEFCYSALV 180

Query: 181 VGLAQNAQSIDALSMFREMKASDVGSTLYSVSGALRATADLAALEQCRVLHAHAVVTGLD 240
           VGLAQNAQSI ALSMFR+MKAS+V ST+YSVSGAL A ADLAALE+CRVLH+HAVVTGLD
Sbjct: 181 VGLAQNAQSIPALSMFRQMKASEVASTIYSVSGALCAAADLAALERCRVLHSHAVVTGLD 240

Query: 241 TDVIVQTALIDGYGKSGLIIDARQVFDENLGCMNVVGWNAMLASYAQQGDKNSTLELFNS 300
           T+VIVQTALID YGKSGLII+ARQVFDENLGCMN+VGWNA+L+SYAQQGD+NST ELFNS
Sbjct: 241 TNVIVQTALIDAYGKSGLIIEARQVFDENLGCMNIVGWNAILSSYAQQGDQNSTRELFNS 300

Query: 301 MEVFGMSPDEYSFLAILTSFCNSRLVSEIELWLRRMRVEYGVEPTLEHFTCLIDAMGRAG 360
           M+ F MSPDEYSFLAIL+SFCN+  V EI+ WLRRM VEYGV+PTLEHFTCLIDA+GRAG
Sbjct: 301 MKAFRMSPDEYSFLAILSSFCNAGSVGEIKPWLRRMIVEYGVKPTLEHFTCLIDALGRAG 360

Query: 361 KLKEAERVAMTMPFVPDAAVWRALLSSSASHGAGDMAWTMAKRLLELDQHDDSAYVIVAN 420
           KL+EAERVAMTMPFVPD AVWRALL+SSASHGAGDMAW MAKRLLEL+ HDDSAYVIVAN
Sbjct: 361 KLEEAERVAMTMPFVPDEAVWRALLTSSASHGAGDMAWKMAKRLLELNLHDDSAYVIVAN 420

Query: 421 ALSATGRWEEVAVVRKLMKERQVKKKSGRSWIEVRGEVHVFLAGDRNHERIVEIYAKLKE 480
           ALSAT RWEEVAVVRKLMKER VKKKSGRSWIEVRGEVHVFLAGDRNHER  E+ AKLKE
Sbjct: 421 ALSATARWEEVAVVRKLMKERDVKKKSGRSWIEVRGEVHVFLAGDRNHERAEELNAKLKE 480

Query: 481 LIWEIEKLGYVPIWDEMLHEVGEKEKKEALWYHSEKLALAYGMLTGVAPPGKALRIVKNL 540
           L+ EIEKLGYVPI  E+L EVGEKEKKEAL YHSEKLALAYG+LTGVA PGKALRIVKNL
Sbjct: 481 LVGEIEKLGYVPICGEVLQEVGEKEKKEALLYHSEKLALAYGILTGVALPGKALRIVKNL 540

Query: 541 KICRDCHQVFKYASRVLKREIIVRDVNRYHRFSNGSCSCEDIW 584
           +ICRDCHQ FKYASRVLK+EIIVRDVNRYHRFS G C+C DIW
Sbjct: 541 RICRDCHQFFKYASRVLKKEIIVRDVNRYHRFSYGGCTCADIW 583

BLAST of HG10012837 vs. TAIR 10
Match: AT4G33170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 386.7 bits (992), Expect = 3.2e-107
Identity = 214/581 (36.83%), Postives = 349/581 (60.07%), Query Frame = 0

Query: 12  PPAPPMATVTTAASKLDPRI-----VHAQAIK-SPATDCSVYNNLITRYSKSNLLHYSVR 71
           P    M +V  AAS L   +     VH  AIK +  +D  V   LI  YS++  +  +  
Sbjct: 414 PDQYTMTSVLKAASSLPEGLSLSKQVHVHAIKINNVSDSFVSTALIDAYSRNRCMKEAEI 473

Query: 72  LFDQIPFPNVVSWTAFISAHSNT---FLSLRHFVSMLRYPAFPNQRTLASLFKTCASLPS 131
           LF++  F ++V+W A ++ ++ +     +L+ F  M +     +  TLA++FKTC  L +
Sbjct: 474 LFERHNF-DLVAWNAMMAGYTQSHDGHKTLKLFALMHKQGERSDDFTLATVFKTCGFLFA 533

Query: 132 ISFGLALHSLAHKLSLCYEPYSGSALVNFYFKCRLFDDARKVFDEISDRDEVCYSALVVG 191
           I+ G  +H+ A K     + +  S +++ Y KC     A+  FD I   D+V ++ ++ G
Sbjct: 534 INQGKQVHAYAIKSGYDLDLWVSSGILDMYVKCGDMSAAQFAFDSIPVPDDVAWTTMISG 593

Query: 192 LAQNAQSIDALSMFREMKASDVGSTLYSVSGALRATADLAALEQCRVLHAHAVVTGLDTD 251
             +N +   A  +F +M+   V    ++++   +A++ L ALEQ R +HA+A+      D
Sbjct: 594 CIENGEEERAFHVFSQMRLMGVLPDEFTIATLAKASSCLTALEQGRQIHANALKLNCTND 653

Query: 252 VIVQTALIDGYGKSGLIIDARQVFDENLGCMNVVGWNAMLASYAQQGDKNSTLELFNSME 311
             V T+L+D Y K G I DA  +F + +  MN+  WNAML   AQ G+   TL+LF  M+
Sbjct: 654 PFVGTSLVDMYAKCGSIDDAYCLF-KRIEMMNITAWNAMLVGLAQHGEGKETLQLFKQMK 713

Query: 312 VFGMSPDEYSFLAILTSFCNSRLVSEIELWLRRMRVEYGVEPTLEHFTCLIDAMGRAGKL 371
             G+ PD+ +F+ +L++  +S LVSE    +R M  +YG++P +EH++CL DA+GRAG +
Sbjct: 714 SLGIKPDKVTFIGVLSACSHSGLVSEAYKHMRSMHGDYGIKPEIEHYSCLADALGRAGLV 773

Query: 372 KEAERVAMTMPFVPDAAVWRALLSSSASHGAGDMAWTMAKRLLELDQHDDSAYVIVANAL 431
           K+AE +  +M     A+++R LL++    G  +    +A +LLEL+  D SAYV+++N  
Sbjct: 774 KQAENLIESMSMEASASMYRTLLAACRVQGDTETGKRVATKLLELEPLDSSAYVLLSNMY 833

Query: 432 SATGRWEEVAVVRKLMKERQVKKKSGRSWIEVRGEVHVFLAGDRNHERIVEIYAKLKELI 491
           +A  +W+E+ + R +MK  +VKK  G SWIEV+ ++H+F+  DR++ +   IY K+K++I
Sbjct: 834 AAASKWDEMKLARTMMKGHKVKKDPGFSWIEVKNKIHIFVVDDRSNRQTELIYRKVKDMI 893

Query: 492 WEIEKLGYVPIWDEMLHEVGEKEKKEALWYHSEKLALAYGMLTGVAPPGKALRIVKNLKI 551
            +I++ GYVP  D  L +V E+EK+ AL+YHSEKLA+A+G+L+   PP   +R++KNL++
Sbjct: 894 RDIKQEGYVPETDFTLVDVEEEEKERALYYHSEKLAVAFGLLS--TPPSTPIRVIKNLRV 953

Query: 552 CRDCHQVFKYASRVLKREIIVRDVNRYHRFSNGSCSCEDIW 584
           C DCH   KY ++V  REI++RD NR+HRF +G CSC D W
Sbjct: 954 CGDCHNAMKYIAKVYNREIVLRDANRFHRFKDGICSCGDYW 990

BLAST of HG10012837 vs. TAIR 10
Match: AT5G52630.1 (mitochondrial RNAediting factor 1 )

HSP 1 Score: 362.5 bits (929), Expect = 6.5e-100
Identity = 205/541 (37.89%), Postives = 312/541 (57.67%), Query Frame = 0

Query: 46  VYNNLITRYSKSNLLHYSVRLFDQIPFPNVVSWTAFISAHSNT---FLSLRHFVSMLRYP 105
           V NNLI  YSKS L   S R F+  P  +  +W++ IS  +     ++SL     M+   
Sbjct: 52  VANNLINFYSKSQLPFDSRRAFEDSPQKSSTTWSSIISCFAQNELPWMSLEFLKKMMAGN 111

Query: 106 AFPNQRTLASLFKTCASLPSISFGLALHSLAHKLSLCYEPYSGSALVNFYFKCRLFDDAR 165
             P+   L S  K+CA L     G ++H L+ K     + + GS+LV+ Y KC     AR
Sbjct: 112 LRPDDHVLPSATKSCAILSRCDIGRSVHCLSMKTGYDADVFVGSSLVDMYAKCGEIVYAR 171

Query: 166 KVFDEISDRDEVCYSALVVGLAQNAQSIDALSMFREMKASDVGSTLYSVSGALRATADLA 225
           K+FDE+  R+ V +S ++ G AQ  ++ +AL +F+E    ++    YS S  +   A+  
Sbjct: 172 KMFDEMPQRNVVTWSGMMYGYAQMGENEEALWLFKEALFENLAVNDYSFSSVISVCANST 231

Query: 226 ALEQCRVLHAHAVVTGLDTDVIVQTALIDGYGKSGLIIDARQVFDENLGCMNVVGWNAML 285
            LE  R +H  ++ +  D+   V ++L+  Y K G+   A QVF+E +   N+  WNAML
Sbjct: 232 LLELGRQIHGLSIKSSFDSSSFVGSSLVSLYSKCGVPEGAYQVFNE-VPVKNLGIWNAML 291

Query: 286 ASYAQQGDKNSTLELFNSMEVFGMSPDEYSFLAILTSFCNSRLVSEIELWLRRMRVEYGV 345
            +YAQ       +ELF  M++ GM P+  +FL +L +  ++ LV E   +  +M+ E  +
Sbjct: 292 KAYAQHSHTQKVIELFKRMKLSGMKPNFITFLNVLNACSHAGLVDEGRYYFDQMK-ESRI 351

Query: 346 EPTLEHFTCLIDAMGRAGKLKEAERVAMTMPFVPDAAVWRALLSSSASHGAGDMAWTMAK 405
           EPT +H+  L+D +GRAG+L+EA  V   MP  P  +VW ALL+S   H   ++A   A 
Sbjct: 352 EPTDKHYASLVDMLGRAGRLQEALEVITNMPIDPTESVWGALLTSCTVHKNTELAAFAAD 411

Query: 406 RLLELDQHDDSAYVIVANALSATGRWEEVAVVRKLMKERQVKKKSGRSWIEVRGEVHVFL 465
           ++ EL       ++ ++NA +A GR+E+ A  RKL+++R  KK++G SW+E R +VH F 
Sbjct: 412 KVFELGPVSSGMHISLSNAYAADGRFEDAAKARKLLRDRGEKKETGLSWVEERNKVHTFA 471

Query: 466 AGDRNHERIVEIYAKLKELIWEIEKLGYVPIWDEMLHEVGEKEKKEALWYHSEKLALAYG 525
           AG+R HE+  EIY KL EL  E+EK GY+     +L EV   EK + + YHSE+LA+A+G
Sbjct: 472 AGERRHEKSKEIYEKLAELGEEMEKAGYIADTSYVLREVDGDEKNQTIRYHSERLAIAFG 531

Query: 526 MLTGVAPPGKALRIVKNLKICRDCHQVFKYASRVLKREIIVRDVNRYHRFSNGSCSCEDI 584
           ++T   P  + +R++KNL++C DCH   K+ S   +R IIVRD NR+HRF +G CSC D 
Sbjct: 532 LIT--FPADRPIRVMKNLRVCGDCHNAIKFMSVCTRRVIIVRDNNRFHRFEDGKCSCNDY 588

BLAST of HG10012837 vs. TAIR 10
Match: AT2G03880.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 344.0 bits (881), Expect = 2.4e-94
Identity = 195/539 (36.18%), Postives = 313/539 (58.07%), Query Frame = 0

Query: 48  NNLITRYSKSNLLHYSVRLFDQIPFPNVVSWTAFISAHSNTFL---SLRHFVSMLRYPAF 107
           N LI  Y K NLL+ + +LFDQ+P  NV+SWT  ISA+S   +   +L   V MLR    
Sbjct: 100 NVLINMYVKFNLLNDAHQLFDQMPQRNVISWTTMISAYSKCKIHQKALELLVLMLRDNVR 159

Query: 108 PNQRTLASLFKTCASLPSISFGLALHSLAHKLSLCYEPYSGSALVNFYFKCRLFDDARKV 167
           PN  T +S+ ++C  +  +     LH    K  L  + +  SAL++ + K    +DA  V
Sbjct: 160 PNVYTYSSVLRSCNGMSDVRM---LHCGIIKEGLESDVFVRSALIDVFAKLGEPEDALSV 219

Query: 168 FDEISDRDEVCYSALVVGLAQNAQSIDALSMFREMKASDVGSTLYSVSGALRATADLAAL 227
           FDE+   D + +++++ G AQN++S  AL +F+ MK +   +   +++  LRA   LA L
Sbjct: 220 FDEMVTGDAIVWNSIIGGFAQNSRSDVALELFKRMKRAGFIAEQATLTSVLRACTGLALL 279

Query: 228 EQCRVLHAHAVVTGLDTDVIVQTALIDGYGKSGLIIDARQVFDENLGCMNVVGWNAMLAS 287
           E     H H  +   D D+I+  AL+D Y K G + DA +VF++ +   +V+ W+ M++ 
Sbjct: 280 ELGMQAHVH--IVKYDQDLILNNALVDMYCKCGSLEDALRVFNQ-MKERDVITWSTMISG 339

Query: 288 YAQQGDKNSTLELFNSMEVFGMSPDEYSFLAILTSFCNSRLVSEIELWLRRMRVEYGVEP 347
            AQ G     L+LF  M+  G  P+  + + +L +  ++ L+ +   + R M+  YG++P
Sbjct: 340 LAQNGYSQEALKLFERMKSSGTKPNYITIVGVLFACSHAGLLEDGWYYFRSMKKLYGIDP 399

Query: 348 TLEHFTCLIDAMGRAGKLKEAERVAMTMPFVPDAAVWRALLSSSASHGAGDMAWTMAKRL 407
             EH+ C+ID +G+AGKL +A ++   M   PDA  WR LL +        +A   AK++
Sbjct: 400 VREHYGCMIDLLGKAGKLDDAVKLLNEMECEPDAVTWRTLLGACRVQRNMVLAEYAAKKV 459

Query: 408 LELDQHDDSAYVIVANALSATGRWEEVAVVRKLMKERQVKKKSGRSWIEVRGEVHVFLAG 467
           + LD  D   Y +++N  + + +W+ V  +R  M++R +KK+ G SWIEV  ++H F+ G
Sbjct: 460 IALDPEDAGTYTLLSNIYANSQKWDSVEEIRTRMRDRGIKKEPGCSWIEVNKQIHAFIIG 519

Query: 468 DRNHERIVEIYAKLKELIWEIEKLGYVPIWDEMLHEVGEKEKKEALWYHSEKLALAYGML 527
           D +H +IVE+  KL +LI  +  +GYVP  + +L ++  ++ +++L +HSEKLALA+G++
Sbjct: 520 DNSHPQIVEVSKKLNQLIHRLTGIGYVPETNFVLQDLEGEQMEDSLRHHSEKLALAFGLM 579

Query: 528 TGVAPPGKALRIVKNLKICRDCHQVFKYASRVLKREIIVRDVNRYHRFSNGSCSCEDIW 584
           T   P  K +RI KNL+IC DCH   K AS++  R I++RD  RYH F +G CSC D W
Sbjct: 580 T--LPIEKVIRIRKNLRICGDCHVFCKLASKLEIRSIVIRDPIRYHHFQDGKCSCGDYW 630

BLAST of HG10012837 vs. TAIR 10
Match: AT3G24000.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 343.6 bits (880), Expect = 3.1e-94
Identity = 200/551 (36.30%), Postives = 310/551 (56.26%), Query Frame = 0

Query: 30  RIVHAQAIKSP-ATDCSVYNNLITRYSKSNLLHYSVRLFDQIPFPNVVSWTAFISA---H 89
           RIVHA  ++S    D  + N L+  Y+K   L  + ++F+++P  + V+WT  IS    H
Sbjct: 80  RIVHAHILQSIFRHDIVMGNTLLNMYAKCGSLEEARKVFEKMPQRDFVTWTTLISGYSQH 139

Query: 90  SNTFLSLRHFVSMLRYPAFPNQRTLASLFKTCASLPSISFGLALHSLAHKLSLCYEPYSG 149
                +L  F  MLR+   PN+ TL+S+ K  A+      G  LH    K       + G
Sbjct: 140 DRPCDALLFFNQMLRFGYSPNEFTLSSVIKAAAAERRGCCGHQLHGFCVKCGFDSNVHVG 199

Query: 150 SALVNFYFKCRLFDDARKVFDEISDRDEVCYSALVVGLAQNAQSIDALSMFREMKASDVG 209
           SAL++ Y +  L DDA+ VFD +  R++V ++AL+ G A+ + +  AL +F+ M      
Sbjct: 200 SALLDLYTRYGLMDDAQLVFDALESRNDVSWNALIAGHARRSGTEKALELFQGMLRDGFR 259

Query: 210 STLYSVSGALRATADLAALEQCRVLHAHAVVTGLDTDVIVQTALIDGYGKSGLIIDARQV 269
            + +S +    A +    LEQ + +HA+ + +G          L+D Y KSG I DAR++
Sbjct: 260 PSHFSYASLFGACSSTGFLEQGKWVHAYMIKSGEKLVAFAGNTLLDMYAKSGSIHDARKI 319

Query: 270 FDENLGCMNVVGWNAMLASYAQQGDKNSTLELFNSMEVFGMSPDEYSFLAILTSFCNSRL 329
           FD  L   +VV WN++L +YAQ G     +  F  M   G+ P+E SFL++LT+  +S L
Sbjct: 320 FD-RLAKRDVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNEISFLSVLTACSHSGL 379

Query: 330 VSEIELWLRRMRVEYGVEPTLEHFTCLIDAMGRAGKLKEAERVAMTMPFVPDAAVWRALL 389
           + E   +   M+ + G+ P   H+  ++D +GRAG L  A R    MP  P AA+W+ALL
Sbjct: 380 LDEGWHYYELMKKD-GIVPEAWHYVTVVDLLGRAGDLNRALRFIEEMPIEPTAAIWKALL 439

Query: 390 SSSASHGAGDMAWTMAKRLLELDQHDDSAYVIVANALSATGRWEEVAVVRKLMKERQVKK 449
           ++   H   ++    A+ + ELD  D   +VI+ N  ++ GRW + A VRK MKE  VKK
Sbjct: 440 NACRMHKNTELGAYAAEHVFELDPDDPGPHVILYNIYASGGRWNDAARVRKKMKESGVKK 499

Query: 450 KSGRSWIEVRGEVHVFLAGDRNHERIVEIYAKLKELIWEIEKLGYVPIWDEMLHEVGEKE 509
           +   SW+E+   +H+F+A D  H +  EI  K +E++ +I++LGYVP    ++  V ++E
Sbjct: 500 EPACSWVEIENAIHMFVANDERHPQREEIARKWEEVLAKIKELGYVPDTSHVIVHVDQQE 559

Query: 510 KKEALWYHSEKLALAYGMLTGVAPPGKALRIVKNLKICRDCHQVFKYASRVLKREIIVRD 569
           ++  L YHSEK+ALA+ +L    PPG  + I KN+++C DCH   K AS+V+ REIIVRD
Sbjct: 560 REVNLQYHSEKIALAFALLN--TPPGSTIHIKKNIRVCGDCHTAIKLASKVVGREIIVRD 619

Query: 570 VNRYHRFSNGS 577
            NR+H F + S
Sbjct: 620 TNRFHHFKDAS 626

BLAST of HG10012837 vs. TAIR 10
Match: AT5G46460.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 340.9 bits (873), Expect = 2.0e-93
Identity = 187/548 (34.12%), Postives = 302/548 (55.11%), Query Frame = 0

Query: 40  PATDCSVYNNLITRYSKSNLLHYSVRLFDQIPFPNVVSWTAFI---SAHSNTFLSLRHFV 99
           P  D + +N+++  Y +   +  +++LF Q+P  NV+SWT  I     +  +  +L  F 
Sbjct: 155 PVKDTAAWNSMVHGYLQFGKVDDALKLFKQMPGKNVISWTTMICGLDQNERSGEALDLFK 214

Query: 100 SMLRYPAFPNQRTLASLFKTCASLPSISFGLALHSLAHKLSLCYEPYSGSALVNFYFKCR 159
           +MLR       R    +   CA+ P+   G+ +H L  KL   YE Y  ++L+ FY  C+
Sbjct: 215 NMLRCCIKSTSRPFTCVITACANAPAFHMGIQVHGLIIKLGFLYEEYVSASLITFYANCK 274

Query: 160 LFDDARKVFDEISDRDEVCYSALVVGLAQNAQSIDALSMFREMKASDVGSTLYSVSGALR 219
              D+RKVFDE        ++AL+ G + N +  DALS+F  M  + +     + +  L 
Sbjct: 275 RIGDSRKVFDEKVHEQVAVWTALLSGYSLNKKHEDALSIFSGMLRNSILPNQSTFASGLN 334

Query: 220 ATADLAALEQCRVLHAHAVVTGLDTDVIVQTALIDGYGKSGLIIDARQVFDENLGCMNVV 279
           + + L  L+  + +H  AV  GL+TD  V  +L+  Y  SG + DA  VF + +   ++V
Sbjct: 335 SCSALGTLDWGKEMHGVAVKLGLETDAFVGNSLVVMYSDSGNVNDAVSVFIK-IFKKSIV 394

Query: 280 GWNAMLASYAQQGDKNSTLELFNSMEVFGMSPDEYSFLAILTSFCNSRLVSE-IELWLRR 339
            WN+++   AQ G       +F  M      PDE +F  +L++  +   + +  +L+   
Sbjct: 395 SWNSIIVGCAQHGRGKWAFVIFGQMIRLNKEPDEITFTGLLSACSHCGFLEKGRKLFYYM 454

Query: 340 MRVEYGVEPTLEHFTCLIDAMGRAGKLKEAERVAMTMPFVPDAAVWRALLSSSASHGAGD 399
                 ++  ++H+TC++D +GR GKLKEAE +   M   P+  VW ALLS+   H   D
Sbjct: 455 SSGINHIDRKIQHYTCMVDILGRCGKLKEAEELIERMVVKPNEMVWLALLSACRMHSDVD 514

Query: 400 MAWTMAKRLLELDQHDDSAYVIVANALSATGRWEEVAVVRKLMKERQVKKKSGRSWIEVR 459
                A  +  LD    +AYV+++N  ++ GRW  V+ +R  MK+  + KK G SW+ +R
Sbjct: 515 RGEKAAAAIFNLDSKSSAAYVLLSNIYASAGRWSNVSKLRVKMKKNGIMKKPGSSWVVIR 574

Query: 460 GEVHVFLAGDRNHERIVEIYAKLKELIWEIEKLGYVPIWDEMLHEVGEKEKKEALWYHSE 519
           G+ H F +GD+ H     IY KL+ L  ++++LGY P +   LH+V +++K+E LWYHSE
Sbjct: 575 GKKHEFFSGDQPH--CSRIYEKLEFLREKLKELGYAPDYRSALHDVEDEQKEEMLWYHSE 634

Query: 520 KLALAYGMLTGVAPPGKALRIVKNLKICRDCHQVFKYASRVLKREIIVRDVNRYHRFSNG 579
           +LA+A+G++  V   G A+ ++KNL++C DCH V K  S V+ REI++RD  R+H F NG
Sbjct: 635 RLAIAFGLINTV--EGSAVTVMKNLRVCEDCHTVIKLISGVVGREIVLRDPIRFHHFKNG 694

Query: 580 SCSCEDIW 584
           +CSC D W
Sbjct: 695 TCSCGDYW 697

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038880033.14.1e-30691.25pentatricopeptide repeat-containing protein At4g33170-like [Benincasa hispida][more]
XP_022974509.12.9e-29988.16pentatricopeptide repeat-containing protein At4g33170-like [Cucurbita maxima][more]
XP_022947552.14.1e-29887.99pentatricopeptide repeat-containing protein At4g33170-like [Cucurbita moschata][more]
KAG6597338.14.1e-29888.18Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_023540846.14.6e-29787.65pentatricopeptide repeat-containing protein At4g33170-like [Cucurbita pepo subsp... [more]
Match NameE-valueIdentityDescription
Q9SMZ24.6e-10636.83Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX... [more]
Q9LTF49.2e-9937.89Putative pentatricopeptide repeat-containing protein At5g52630 OS=Arabidopsis th... [more]
Q9LIQ74.6e-9836.74Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidop... [more]
Q9SI533.4e-9336.18Pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Arabidop... [more]
Q9FHF92.9e-9234.12Pentatricopeptide repeat-containing protein At5g46460, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1IHT21.4e-29988.16pentatricopeptide repeat-containing protein At4g33170-like OS=Cucurbita maxima O... [more]
A0A6J1G6S22.0e-29887.99pentatricopeptide repeat-containing protein At4g33170-like OS=Cucurbita moschata... [more]
A0A6J1DUT11.6e-29287.35pentatricopeptide repeat-containing protein At4g33170-like OS=Momordica charanti... [more]
A0A0A0L4A24.1e-28383.88DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G1346... [more]
A0A5A7U1J31.3e-28184.56Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT4G33170.13.2e-10736.83Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G52630.16.5e-10037.89mitochondrial RNAediting factor 1 [more]
AT2G03880.12.4e-9436.18Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G24000.13.1e-9436.30Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G46460.12.0e-9334.12Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 174..203
e-value: 0.004
score: 17.3
coord: 147..170
e-value: 0.43
score: 10.9
coord: 247..268
e-value: 0.0071
score: 16.5
coord: 348..368
e-value: 0.069
score: 13.4
coord: 46..69
e-value: 0.8
score: 10.1
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 274..322
e-value: 6.9E-10
score: 39.0
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 278..310
e-value: 5.5E-5
score: 21.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 274..308
score: 10.588674
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 172..206
score: 9.054091
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 448..572
e-value: 2.1E-37
score: 127.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 127..226
e-value: 6.4E-12
score: 47.2
coord: 32..124
e-value: 1.9E-7
score: 32.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 272..473
e-value: 3.6E-32
score: 113.9
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 25..569
NoneNo IPR availablePANTHERPTHR47928:SF49PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 25..569

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10012837.1HG10012837.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding