HG10019709 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10019709
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr04: 24761874 .. 24763424 (-)
RNA-Seq ExpressionHG10019709
SyntenyHG10019709
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCGGCTCCGGATCTCTGCCGTCCATCAAATTTTTCCCCCCAACGCCCACAATTACAATTCAAATCCCAATTTCCTCTCCACAAAGCATCAATTCCTCTCCCTTCTCAACCTCTGTTCCTCAACGAATCACCTATCTCAAATCCACGCGCAAATCCTCGTCTCTGGCCTCCAAAATGACCCATTTCTCACCACTGAACTCCTCCGCATCGCTGCTCTATCGCCTTCCAGAAATCTCAGCTATGGCCGCTCTCTCCTCTTCCATTCCCACTTTCATTCCGCCCCTTTGCCATGGAATCTCATCATCAGGGGATATGCCTCGAGTGATTCTCCACAAGAGGCCATTTCGGTATTTGGGGAAATGCGAAGACGAGGAATCAGACCCAATAACCTCACTTTCCCCTTCCTTCTCAAGGCCTGTGCGACGCTCGCGACTCTCCAACAAGGTAAGCAGTTTCATGCTGTTGCCATAAAGTGTGGTTTAGATTTAGATGTTTATGTTCGGAACACTCTGATTAATTTCTATGGGTCCTGCAAAAGAATGTCTGGTGCACGAAAGATGTTCGACGAAATGTCTGAAAGAACTTTAGTTTCGTGGAATGCGATTATTACAGCGTGTGTTGAGAATTATTGCTTTGATGAAGCGATTGACTACTTTTTGAAAATGGGAAATCATGGTTTTGAGCCGGATGAAACTACAATGGTGGTTATATTATCAGCTTGTGCAGAGCTTGGTAACTTGAGCTTAGGGAGATGGGTTCATTCTCAAGTGGTGGGAAGAGGGATGGTTTTGAATGTACAATTGGGCACTGCCTTCGTTGATATGTATGCAAAATCTGGCGATGTGGGATGTGCTAGGCTTGTATTCAATTGTTTGAAACAAAGAAGTGTGTGGACATGGAGTGCAATGATTTTGGGGCTTGCCCAACATGGATTTGCCAATGAAGCCATCAAACTTTTCACAAATATGATGAGCTCCTCTGTAGTCCCTAACTATGTCACTTTCGTTGGTGTCCTATGTGCCTGCAGCCATGCCGGATTGGTGGATAAAAGCTACCACTACTTCCACATTATGGAGAGAGTTTACGGGATTAAGCCGATGATGATACATTACGGGTCGCTCGTGGATGTTTTAGGACGTGCAGGTCGAGTCAAGGAAGCTTATGAGCTCATTATGAGCATGCCTGTGGAGCCTGATCCAATTGTGTGGAGGACATTGCTGAGTGCGTGCAGTGCTCGTGACGTGGATGGCGGGGCTCAGATTGCAGAGGAGGCAAGGAAGAGGCTGCTTGAGCTTGAACCGAAGAGGGGCGGGAATGTGGTGATTGTTGCAAATAAGTTTGCTGAAGTTGGGATGTGGAAGCAGGCAGCAGATTGTCGTAGGGCGATGAAAGATAGAGGAATAAAAAAGATGGCTGGGGAGAGTTGCATTGAATTGGGTGGCTCTTTGCGTAAGTTCTTTTCAGGTTTTGATGCTCGTGCTGCTACTGATGGCATTTACGATTTGCTTGATGGATTGAACCTGCATATGCAAATGATAAACTTCTGA

mRNA sequence

ATGGTTCGGCTCCGGATCTCTGCCGTCCATCAAATTTTTCCCCCCAACGCCCACAATTACAATTCAAATCCCAATTTCCTCTCCACAAAGCATCAATTCCTCTCCCTTCTCAACCTCTGTTCCTCAACGAATCACCTATCTCAAATCCACGCGCAAATCCTCGTCTCTGGCCTCCAAAATGACCCATTTCTCACCACTGAACTCCTCCGCATCGCTGCTCTATCGCCTTCCAGAAATCTCAGCTATGGCCGCTCTCTCCTCTTCCATTCCCACTTTCATTCCGCCCCTTTGCCATGGAATCTCATCATCAGGGGATATGCCTCGAGTGATTCTCCACAAGAGGCCATTTCGGTATTTGGGGAAATGCGAAGACGAGGAATCAGACCCAATAACCTCACTTTCCCCTTCCTTCTCAAGGCCTGTGCGACGCTCGCGACTCTCCAACAAGCGTGTGTTGAGAATTATTGCTTTGATGAAGCGATTGACTACTTTTTGAAAATGGGAAATCATGGTTTTGAGCCGGATGAAACTACAATGGTGGTTATATTATCAGCTTGTGCAGAGCTTGGTAACTTGAGCTTAGGGAGATGGGTTCATTCTCAAGTGGTGGGAAGAGGGATGGTTTTGAATGTACAATTGGGCACTGCCTTCGTTGATATGTATGCAAAATCTGGCGATGTGGGATGTGCTAGGCTTGTATTCAATTGTTTGAAACAAAGAAGTGTGTGGACATGGAGTGCAATGATTTTGGGGCTTGCCCAACATGGATTTGCCAATGAAGCCATCAAACTTTTCACAAATATGATGAGCTCCTCTGTAGTCCCTAACTATGTCACTTTCGTTGGTGTCCTATGTGCCTGCAGCCATGCCGGATTGGTGGATAAAAGCTACCACTACTTCCACATTATGGAGAGAGTTTACGGGATTAAGCCGATGATGATACATTACGGGTCGCTCGTGGATGTTTTAGGACGTGCAGGTCGAGTCAAGGAAGCTTATGAGCTCATTATGAGCATGCCTGTGGAGCCTGATCCAATTGTGTGGAGGACATTGCTGAGTGCGTGCAGTGCTCGTGACGTGGATGGCGGGGCTCAGATTGCAGAGGAGGCAAGGAAGAGGCTGCTTGAGCTTGAACCGAAGAGGGGCGGGAATGTGGTGATTGTTGCAAATAAGTTTGCTGAAGTTGGGATGTGGAAGCAGGCAGCAGATTGTCGTAGGGCGATGAAAGATAGAGGAATAAAAAAGATGGCTGGGGAGAGTTGCATTGAATTGGGTGGCTCTTTGCGTAAGTTCTTTTCAGGTTTTGATGCTCGTGCTGCTACTGATGGCATTTACGATTTGCTTGATGGATTGAACCTGCATATGCAAATGATAAACTTCTGA

Coding sequence (CDS)

ATGGTTCGGCTCCGGATCTCTGCCGTCCATCAAATTTTTCCCCCCAACGCCCACAATTACAATTCAAATCCCAATTTCCTCTCCACAAAGCATCAATTCCTCTCCCTTCTCAACCTCTGTTCCTCAACGAATCACCTATCTCAAATCCACGCGCAAATCCTCGTCTCTGGCCTCCAAAATGACCCATTTCTCACCACTGAACTCCTCCGCATCGCTGCTCTATCGCCTTCCAGAAATCTCAGCTATGGCCGCTCTCTCCTCTTCCATTCCCACTTTCATTCCGCCCCTTTGCCATGGAATCTCATCATCAGGGGATATGCCTCGAGTGATTCTCCACAAGAGGCCATTTCGGTATTTGGGGAAATGCGAAGACGAGGAATCAGACCCAATAACCTCACTTTCCCCTTCCTTCTCAAGGCCTGTGCGACGCTCGCGACTCTCCAACAAGCGTGTGTTGAGAATTATTGCTTTGATGAAGCGATTGACTACTTTTTGAAAATGGGAAATCATGGTTTTGAGCCGGATGAAACTACAATGGTGGTTATATTATCAGCTTGTGCAGAGCTTGGTAACTTGAGCTTAGGGAGATGGGTTCATTCTCAAGTGGTGGGAAGAGGGATGGTTTTGAATGTACAATTGGGCACTGCCTTCGTTGATATGTATGCAAAATCTGGCGATGTGGGATGTGCTAGGCTTGTATTCAATTGTTTGAAACAAAGAAGTGTGTGGACATGGAGTGCAATGATTTTGGGGCTTGCCCAACATGGATTTGCCAATGAAGCCATCAAACTTTTCACAAATATGATGAGCTCCTCTGTAGTCCCTAACTATGTCACTTTCGTTGGTGTCCTATGTGCCTGCAGCCATGCCGGATTGGTGGATAAAAGCTACCACTACTTCCACATTATGGAGAGAGTTTACGGGATTAAGCCGATGATGATACATTACGGGTCGCTCGTGGATGTTTTAGGACGTGCAGGTCGAGTCAAGGAAGCTTATGAGCTCATTATGAGCATGCCTGTGGAGCCTGATCCAATTGTGTGGAGGACATTGCTGAGTGCGTGCAGTGCTCGTGACGTGGATGGCGGGGCTCAGATTGCAGAGGAGGCAAGGAAGAGGCTGCTTGAGCTTGAACCGAAGAGGGGCGGGAATGTGGTGATTGTTGCAAATAAGTTTGCTGAAGTTGGGATGTGGAAGCAGGCAGCAGATTGTCGTAGGGCGATGAAAGATAGAGGAATAAAAAAGATGGCTGGGGAGAGTTGCATTGAATTGGGTGGCTCTTTGCGTAAGTTCTTTTCAGGTTTTGATGCTCGTGCTGCTACTGATGGCATTTACGATTTGCTTGATGGATTGAACCTGCATATGCAAATGATAAACTTCTGA

Protein sequence

MVRLRISAVHQIFPPNAHNYNSNPNFLSTKHQFLSLLNLCSSTNHLSQIHAQILVSGLQNDPFLTTELLRIAALSPSRNLSYGRSLLFHSHFHSAPLPWNLIIRGYASSDSPQEAISVFGEMRRRGIRPNNLTFPFLLKACATLATLQQACVENYCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIKLFTNMMSSSVVPNYVTFVGVLCACSHAGLVDKSYHYFHIMERVYGIKPMMIHYGSLVDVLGRAGRVKEAYELIMSMPVEPDPIVWRTLLSACSARDVDGGAQIAEEARKRLLELEPKRGGNVVIVANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIELGGSLRKFFSGFDARAATDGIYDLLDGLNLHMQMINF
Homology
BLAST of HG10019709 vs. NCBI nr
Match: XP_038903331.1 (pentatricopeptide repeat-containing protein At2g36730 isoform X4 [Benincasa hispida])

HSP 1 Score: 855.1 bits (2208), Expect = 2.7e-244
Identity = 428/515 (83.11%), Postives = 450/515 (87.38%), Query Frame = 0

Query: 1   MVRLRISAVHQIFPPNAHNYNSNPNFLSTKHQFLSLLNLCSSTNHLSQIHAQILVSGLQN 60
           MVRLRISA+HQIFPP+AHNYNSN NFLS KHQFLSLLNLCSSTNHL +IHAQILVSGLQN
Sbjct: 1   MVRLRISAIHQIFPPSAHNYNSNLNFLSRKHQFLSLLNLCSSTNHLFEIHAQILVSGLQN 60

Query: 61  DPFLTTELLRIAALSPSRNLSYGRSLLFHSHFHSAPLPWNLIIRGYASSDSPQEAISVFG 120
           DPFL+TELLRIAALSPSRNLSYGRSLLFH HFHSAPLPWNLIIRGYASSDSPQEAI VFG
Sbjct: 61  DPFLSTELLRIAALSPSRNLSYGRSLLFHCHFHSAPLPWNLIIRGYASSDSPQEAIWVFG 120

Query: 121 EMRRRGIRPNNLTFPFLLKACATLATLQQ------------------------------- 180
           EMRRRGIRPNNLTFPFLLKACATLATLQ+                               
Sbjct: 121 EMRRRGIRPNNLTFPFLLKACATLATLQEGKQFHAVAIKCGLDLDVYVRNTLINFYGSCK 180

Query: 181 -------------------------ACVENYCFDEAIDYFLKMGNHGFEPDETTMVVILS 240
                                    ACVEN+CFDEAID+FLKMG HGFEPDETTMVVILS
Sbjct: 181 RMSGARKVFDEMSERTLVSWNAVITACVENFCFDEAIDFFLKMGKHGFEPDETTMVVILS 240

Query: 241 ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT 300
           ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAF+DMYAKSGDVGCARLVFNCLKQRSVWT
Sbjct: 241 ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFIDMYAKSGDVGCARLVFNCLKQRSVWT 300

Query: 301 WSAMILGLAQHGFANEAIKLFTNMMSSSVVPNYVTFVGVLCACSHAGLVDKSYHYFHIME 360
           WSAMILGLAQHG+ANEAI+LFT+MMSSSVVPNYVTF+GVLCACSHA LVDKSYHYF+IME
Sbjct: 301 WSAMILGLAQHGYANEAIELFTHMMSSSVVPNYVTFIGVLCACSHARLVDKSYHYFNIME 360

Query: 361 RVYGIKPMMIHYGSLVDVLGRAGRVKEAYELIMSMPVEPDPIVWRTLLSACSARDVDGGA 420
           RVYGIKPMMIHYGS+VDVLGRAG+VKEAYELIMSMP+EPDPIVWRTLLSAC+ RDVDGGA
Sbjct: 361 RVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPMEPDPIVWRTLLSACTGRDVDGGA 420

Query: 421 QIAEEARKRLLELEPKRGGNVVIVANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEL 460
           Q+AEEARKRLLELEPKRGGNVV+VANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEL
Sbjct: 421 QVAEEARKRLLELEPKRGGNVVMVANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEL 480

BLAST of HG10019709 vs. NCBI nr
Match: XP_038903330.1 (pentatricopeptide repeat-containing protein At2g36730 isoform X3 [Benincasa hispida])

HSP 1 Score: 853.6 bits (2204), Expect = 7.9e-244
Identity = 428/513 (83.43%), Postives = 448/513 (87.33%), Query Frame = 0

Query: 1   MVRLRISAVHQIFPPNAHNYNSNPNFLSTKHQFLSLLNLCSSTNHLSQIHAQILVSGLQN 60
           MVRLRISA+HQIFPP+AHNYNSN NFLS KHQFLSLLNLCSSTNHL +IHAQILVSGLQN
Sbjct: 1   MVRLRISAIHQIFPPSAHNYNSNLNFLSRKHQFLSLLNLCSSTNHLFEIHAQILVSGLQN 60

Query: 61  DPFLTTELLRIAALSPSRNLSYGRSLLFHSHFHSAPLPWNLIIRGYASSDSPQEAISVFG 120
           DPFL+TELLRIAALSPSRNLSYGRSLLFH HFHSAPLPWNLIIRGYASSDSPQEAI VFG
Sbjct: 61  DPFLSTELLRIAALSPSRNLSYGRSLLFHCHFHSAPLPWNLIIRGYASSDSPQEAIWVFG 120

Query: 121 EMRRRGIRPNNLTFPFLLKACATLATLQQ------------------------------- 180
           EMRRRGIRPNNLTFPFLLKACATLATLQ+                               
Sbjct: 121 EMRRRGIRPNNLTFPFLLKACATLATLQEGKQFHAVAIKCGLDLDVYVRNTLINFYGSCK 180

Query: 181 -------------------------ACVENYCFDEAIDYFLKMGNHGFEPDETTMVVILS 240
                                    ACVEN+CFDEAID+FLKMG HGFEPDETTMVVILS
Sbjct: 181 RMSGARKVFDEMSERTLVSWNAVITACVENFCFDEAIDFFLKMGKHGFEPDETTMVVILS 240

Query: 241 ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT 300
           ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAF+DMYAKSGDVGCARLVFNCLKQRSVWT
Sbjct: 241 ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFIDMYAKSGDVGCARLVFNCLKQRSVWT 300

Query: 301 WSAMILGLAQHGFANEAIKLFTNMMSSSVVPNYVTFVGVLCACSHAGLVDKSYHYFHIME 360
           WSAMILGLAQHG+ANEAI+LFT+MMSSSVVPNYVTF+GVLCACSHA LVDKSYHYF+IME
Sbjct: 301 WSAMILGLAQHGYANEAIELFTHMMSSSVVPNYVTFIGVLCACSHARLVDKSYHYFNIME 360

Query: 361 RVYGIKPMMIHYGSLVDVLGRAGRVKEAYELIMSMPVEPDPIVWRTLLSACSARDVDGGA 420
           RVYGIKPMMIHYGS+VDVLGRAG+VKEAYELIMSMP+EPDPIVWRTLLSAC+ RDVDGGA
Sbjct: 361 RVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPMEPDPIVWRTLLSACTGRDVDGGA 420

Query: 421 QIAEEARKRLLELEPKRGGNVVIVANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEL 458
           Q+AEEARKRLLELEPKRGGNVV+VANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEL
Sbjct: 421 QVAEEARKRLLELEPKRGGNVVMVANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEL 480

BLAST of HG10019709 vs. NCBI nr
Match: XP_038903328.1 (pentatricopeptide repeat-containing protein At2g36730 isoform X1 [Benincasa hispida])

HSP 1 Score: 853.6 bits (2204), Expect = 7.9e-244
Identity = 428/513 (83.43%), Postives = 448/513 (87.33%), Query Frame = 0

Query: 1   MVRLRISAVHQIFPPNAHNYNSNPNFLSTKHQFLSLLNLCSSTNHLSQIHAQILVSGLQN 60
           MVRLRISA+HQIFPP+AHNYNSN NFLS KHQFLSLLNLCSSTNHL +IHAQILVSGLQN
Sbjct: 1   MVRLRISAIHQIFPPSAHNYNSNLNFLSRKHQFLSLLNLCSSTNHLFEIHAQILVSGLQN 60

Query: 61  DPFLTTELLRIAALSPSRNLSYGRSLLFHSHFHSAPLPWNLIIRGYASSDSPQEAISVFG 120
           DPFL+TELLRIAALSPSRNLSYGRSLLFH HFHSAPLPWNLIIRGYASSDSPQEAI VFG
Sbjct: 61  DPFLSTELLRIAALSPSRNLSYGRSLLFHCHFHSAPLPWNLIIRGYASSDSPQEAIWVFG 120

Query: 121 EMRRRGIRPNNLTFPFLLKACATLATLQQ------------------------------- 180
           EMRRRGIRPNNLTFPFLLKACATLATLQ+                               
Sbjct: 121 EMRRRGIRPNNLTFPFLLKACATLATLQEGKQFHAVAIKCGLDLDVYVRNTLINFYGSCK 180

Query: 181 -------------------------ACVENYCFDEAIDYFLKMGNHGFEPDETTMVVILS 240
                                    ACVEN+CFDEAID+FLKMG HGFEPDETTMVVILS
Sbjct: 181 RMSGARKVFDEMSERTLVSWNAVITACVENFCFDEAIDFFLKMGKHGFEPDETTMVVILS 240

Query: 241 ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT 300
           ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAF+DMYAKSGDVGCARLVFNCLKQRSVWT
Sbjct: 241 ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFIDMYAKSGDVGCARLVFNCLKQRSVWT 300

Query: 301 WSAMILGLAQHGFANEAIKLFTNMMSSSVVPNYVTFVGVLCACSHAGLVDKSYHYFHIME 360
           WSAMILGLAQHG+ANEAI+LFT+MMSSSVVPNYVTF+GVLCACSHA LVDKSYHYF+IME
Sbjct: 301 WSAMILGLAQHGYANEAIELFTHMMSSSVVPNYVTFIGVLCACSHARLVDKSYHYFNIME 360

Query: 361 RVYGIKPMMIHYGSLVDVLGRAGRVKEAYELIMSMPVEPDPIVWRTLLSACSARDVDGGA 420
           RVYGIKPMMIHYGS+VDVLGRAG+VKEAYELIMSMP+EPDPIVWRTLLSAC+ RDVDGGA
Sbjct: 361 RVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPMEPDPIVWRTLLSACTGRDVDGGA 420

Query: 421 QIAEEARKRLLELEPKRGGNVVIVANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEL 458
           Q+AEEARKRLLELEPKRGGNVV+VANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEL
Sbjct: 421 QVAEEARKRLLELEPKRGGNVVMVANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEL 480

BLAST of HG10019709 vs. NCBI nr
Match: XP_038903329.1 (pentatricopeptide repeat-containing protein At2g36730 isoform X2 [Benincasa hispida])

HSP 1 Score: 853.6 bits (2204), Expect = 7.9e-244
Identity = 428/513 (83.43%), Postives = 448/513 (87.33%), Query Frame = 0

Query: 1   MVRLRISAVHQIFPPNAHNYNSNPNFLSTKHQFLSLLNLCSSTNHLSQIHAQILVSGLQN 60
           MVRLRISA+HQIFPP+AHNYNSN NFLS KHQFLSLLNLCSSTNHL +IHAQILVSGLQN
Sbjct: 1   MVRLRISAIHQIFPPSAHNYNSNLNFLSRKHQFLSLLNLCSSTNHLFEIHAQILVSGLQN 60

Query: 61  DPFLTTELLRIAALSPSRNLSYGRSLLFHSHFHSAPLPWNLIIRGYASSDSPQEAISVFG 120
           DPFL+TELLRIAALSPSRNLSYGRSLLFH HFHSAPLPWNLIIRGYASSDSPQEAI VFG
Sbjct: 61  DPFLSTELLRIAALSPSRNLSYGRSLLFHCHFHSAPLPWNLIIRGYASSDSPQEAIWVFG 120

Query: 121 EMRRRGIRPNNLTFPFLLKACATLATLQQ------------------------------- 180
           EMRRRGIRPNNLTFPFLLKACATLATLQ+                               
Sbjct: 121 EMRRRGIRPNNLTFPFLLKACATLATLQEGKQFHAVAIKCGLDLDVYVRNTLINFYGSCK 180

Query: 181 -------------------------ACVENYCFDEAIDYFLKMGNHGFEPDETTMVVILS 240
                                    ACVEN+CFDEAID+FLKMG HGFEPDETTMVVILS
Sbjct: 181 RMSGARKVFDEMSERTLVSWNAVITACVENFCFDEAIDFFLKMGKHGFEPDETTMVVILS 240

Query: 241 ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT 300
           ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAF+DMYAKSGDVGCARLVFNCLKQRSVWT
Sbjct: 241 ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFIDMYAKSGDVGCARLVFNCLKQRSVWT 300

Query: 301 WSAMILGLAQHGFANEAIKLFTNMMSSSVVPNYVTFVGVLCACSHAGLVDKSYHYFHIME 360
           WSAMILGLAQHG+ANEAI+LFT+MMSSSVVPNYVTF+GVLCACSHA LVDKSYHYF+IME
Sbjct: 301 WSAMILGLAQHGYANEAIELFTHMMSSSVVPNYVTFIGVLCACSHARLVDKSYHYFNIME 360

Query: 361 RVYGIKPMMIHYGSLVDVLGRAGRVKEAYELIMSMPVEPDPIVWRTLLSACSARDVDGGA 420
           RVYGIKPMMIHYGS+VDVLGRAG+VKEAYELIMSMP+EPDPIVWRTLLSAC+ RDVDGGA
Sbjct: 361 RVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPMEPDPIVWRTLLSACTGRDVDGGA 420

Query: 421 QIAEEARKRLLELEPKRGGNVVIVANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEL 458
           Q+AEEARKRLLELEPKRGGNVV+VANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEL
Sbjct: 421 QVAEEARKRLLELEPKRGGNVVMVANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEL 480

BLAST of HG10019709 vs. NCBI nr
Match: XP_038903332.1 (pentatricopeptide repeat-containing protein At2g36730 isoform X5 [Benincasa hispida])

HSP 1 Score: 853.6 bits (2204), Expect = 7.9e-244
Identity = 428/513 (83.43%), Postives = 448/513 (87.33%), Query Frame = 0

Query: 1   MVRLRISAVHQIFPPNAHNYNSNPNFLSTKHQFLSLLNLCSSTNHLSQIHAQILVSGLQN 60
           MVRLRISA+HQIFPP+AHNYNSN NFLS KHQFLSLLNLCSSTNHL +IHAQILVSGLQN
Sbjct: 1   MVRLRISAIHQIFPPSAHNYNSNLNFLSRKHQFLSLLNLCSSTNHLFEIHAQILVSGLQN 60

Query: 61  DPFLTTELLRIAALSPSRNLSYGRSLLFHSHFHSAPLPWNLIIRGYASSDSPQEAISVFG 120
           DPFL+TELLRIAALSPSRNLSYGRSLLFH HFHSAPLPWNLIIRGYASSDSPQEAI VFG
Sbjct: 61  DPFLSTELLRIAALSPSRNLSYGRSLLFHCHFHSAPLPWNLIIRGYASSDSPQEAIWVFG 120

Query: 121 EMRRRGIRPNNLTFPFLLKACATLATLQQ------------------------------- 180
           EMRRRGIRPNNLTFPFLLKACATLATLQ+                               
Sbjct: 121 EMRRRGIRPNNLTFPFLLKACATLATLQEGKQFHAVAIKCGLDLDVYVRNTLINFYGSCK 180

Query: 181 -------------------------ACVENYCFDEAIDYFLKMGNHGFEPDETTMVVILS 240
                                    ACVEN+CFDEAID+FLKMG HGFEPDETTMVVILS
Sbjct: 181 RMSGARKVFDEMSERTLVSWNAVITACVENFCFDEAIDFFLKMGKHGFEPDETTMVVILS 240

Query: 241 ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT 300
           ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAF+DMYAKSGDVGCARLVFNCLKQRSVWT
Sbjct: 241 ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFIDMYAKSGDVGCARLVFNCLKQRSVWT 300

Query: 301 WSAMILGLAQHGFANEAIKLFTNMMSSSVVPNYVTFVGVLCACSHAGLVDKSYHYFHIME 360
           WSAMILGLAQHG+ANEAI+LFT+MMSSSVVPNYVTF+GVLCACSHA LVDKSYHYF+IME
Sbjct: 301 WSAMILGLAQHGYANEAIELFTHMMSSSVVPNYVTFIGVLCACSHARLVDKSYHYFNIME 360

Query: 361 RVYGIKPMMIHYGSLVDVLGRAGRVKEAYELIMSMPVEPDPIVWRTLLSACSARDVDGGA 420
           RVYGIKPMMIHYGS+VDVLGRAG+VKEAYELIMSMP+EPDPIVWRTLLSAC+ RDVDGGA
Sbjct: 361 RVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPMEPDPIVWRTLLSACTGRDVDGGA 420

Query: 421 QIAEEARKRLLELEPKRGGNVVIVANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEL 458
           Q+AEEARKRLLELEPKRGGNVV+VANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEL
Sbjct: 421 QVAEEARKRLLELEPKRGGNVVMVANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEL 480

BLAST of HG10019709 vs. ExPASy Swiss-Prot
Match: Q9ZQA1 (Pentatricopeptide repeat-containing protein At2g36730 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E44 PE=3 SV=1)

HSP 1 Score: 482.6 bits (1241), Expect = 4.8e-135
Identity = 256/487 (52.57%), Postives = 325/487 (66.74%), Query Frame = 0

Query: 20  YNSNPNFLSTKHQFLSLLNLCSSTNHLSQIHAQILVSGLQNDPFLTTELLRIAALSPSRN 79
           ++S+  F S KHQ L  L LCSS  HL QIH QI +S LQND F+ +EL+R+++LS +++
Sbjct: 3   WSSDSCFKSRKHQCLIFLKLCSSIKHLLQIHGQIHLSSLQNDSFIISELVRVSSLSLAKD 62

Query: 80  LSYGRSLLFHSHFHSAPLPWNLIIRGYASSDSPQEAISVFGEMRRRGIRPNNLTFPFLLK 139
           L++ R+LL HS   S P  WN++ RGY+SSDSP E+I V+ EM+RRGI+PN LTFPFLLK
Sbjct: 63  LAFARTLLLHSS-DSTPSTWNMLSRGYSSSDSPVESIWVYSEMKRRGIKPNKLTFPFLLK 122

Query: 140 ACATLA------------------------------------------------------ 199
           ACA+                                                        
Sbjct: 123 ACASFLGLTAGRQIQVEVLKHGFDFDVYVGNNLIHLYGTCKKTSDARKVFDEMTERNVVS 182

Query: 200 --TLQQACVENYCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVV 259
             ++  A VEN   +   + F +M    F PDETTMVV+LSAC   GNLSLG+ VHSQV+
Sbjct: 183 WNSIMTALVENGKLNLVFECFCEMIGKRFCPDETTMVVLLSACG--GNLSLGKLVHSQVM 242

Query: 260 GRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIK 319
            R + LN +LGTA VDMYAKSG +  ARLVF  +  ++VWTWSAMI+GLAQ+GFA EA++
Sbjct: 243 VRELELNCRLGTALVDMYAKSGGLEYARLVFERMVDKNVWTWSAMIVGLAQYGFAEEALQ 302

Query: 320 LFTNMM-SSSVVPNYVTFVGVLCACSHAGLVDKSYHYFHIMERVYGIKPMMIHYGSLVDV 379
           LF+ MM  SSV PNYVTF+GVLCACSH GLVD  Y YFH ME+++ IKPMMIHYG++VD+
Sbjct: 303 LFSKMMKESSVRPNYVTFLGVLCACSHTGLVDDGYKYFHEMEKIHKIKPMMIHYGAMVDI 362

Query: 380 LGRAGRVKEAYELIMSMPVEPDPIVWRTLLSACSARDVDGGAQIAEEARKRLLELEPKRG 439
           LGRAGR+ EAY+ I  MP EPD +VWRTLLSACS    +    I E+ +KRL+ELEPKR 
Sbjct: 363 LGRAGRLNEAYDFIKKMPFEPDAVVWRTLLSACSIHHDEDDEGIGEKVKKRLIELEPKRS 422

Query: 440 GNVVIVANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIELGGSLRKFFSGFDARAATD 450
           GN+VIVAN+FAE  MW +AA+ RR MK+  +KK+AGESC+ELGGS  +FFSG+D R+   
Sbjct: 423 GNLVIVANRFAEARMWAEAAEVRRVMKETKMKKIAGESCLELGGSFHRFFSGYDPRSEYV 482

BLAST of HG10019709 vs. ExPASy Swiss-Prot
Match: Q8LK93 (Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H26 PE=2 SV=2)

HSP 1 Score: 262.3 bits (669), Expect = 1.0e-68
Identity = 158/497 (31.79%), Postives = 261/497 (52.52%), Query Frame = 0

Query: 21  NSNPNFLSTKHQFLSLLNLCSSTNHLSQIHAQILVSGLQNDPFLTTELLRIAALSPSR-N 80
           +S  + ++T++  L L++ C+S   L QI A  + S +++  F+  +L+     SP+  +
Sbjct: 21  HSKIDTVNTQNPIL-LISKCNSLRELMQIQAYAIKSHIEDVSFV-AKLINFCTESPTESS 80

Query: 81  LSYGRSLLFHSHFHSAPLPWNLIIRGYASSDSPQEAISVFGEMRRRGIRPNNLTFPFLLK 140
           +SY R  LF +      + +N + RGY+   +P E  S+F E+   GI P+N TFP LLK
Sbjct: 81  MSYARH-LFEAMSEPDIVIFNSMARGYSRFTNPLEVFSLFVEILEDGILPDNYTFPSLLK 140

Query: 141 ACATLATLQQA------------------------------------CVEN-------YC 200
           ACA    L++                                     CV +        C
Sbjct: 141 ACAVAKALEEGRQLHCLSMKLGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVC 200

Query: 201 F-------------DEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVV 260
           +             +EA+  F +M     +P+E T++ +LS+CA LG+L LG+W+H    
Sbjct: 201 YNAMITGYARRNRPNEALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAK 260

Query: 261 GRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIK 320
                  V++ TA +DM+AK G +  A  +F  ++ +    WSAMI+  A HG A +++ 
Sbjct: 261 KHSFCKYVKVNTALIDMFAKCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSML 320

Query: 321 LFTNMMSSSVVPNYVTFVGVLCACSHAGLVDKSYHYFHIMERVYGIKPMMIHYGSLVDVL 380
           +F  M S +V P+ +TF+G+L ACSH G V++   YF  M   +GI P + HYGS+VD+L
Sbjct: 321 MFERMRSENVQPDEITFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLL 380

Query: 381 GRAGRVKEAYELIMSMPVEPDPIVWRTLLSACSARDVDGGAQIAEEARKRLLELEPKRGG 440
            RAG +++AYE I  +P+ P P++WR LL+ACS+ +      +AE+  +R+ EL+   GG
Sbjct: 381 SRAGNLEDAYEFIDKLPISPTPMLWRILLAACSSHN---NLDLAEKVSERIFELDDSHGG 440

Query: 441 NVVIVANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIELGGSLRKFFSGFDARAATDG 461
           + VI++N +A    W+     R+ MKDR   K+ G S IE+   + +FFSG   ++AT  
Sbjct: 441 DYVILSNLYARNKKWEYVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTK 500

BLAST of HG10019709 vs. ExPASy Swiss-Prot
Match: Q9FMA1 (Pentatricopeptide repeat-containing protein At5g56310 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E13 PE=2 SV=1)

HSP 1 Score: 257.7 bits (657), Expect = 2.5e-67
Identity = 150/413 (36.32%), Postives = 227/413 (54.96%), Query Frame = 0

Query: 48  QIHAQILVSGLQNDPFLTTELLRI----AALSPSRNLSYGRSLLFHSHFHSAPLPWNLII 107
           QIH Q++V G  +   + T L+++      L  +R        +F          WN ++
Sbjct: 137 QIHGQVVVFGFDSSVHVVTGLIQMYFSCGGLGDARK-------MFDEMLVKDVNVWNALL 196

Query: 108 RGYASSDSPQEAISVFGEMRRRGIRPNNLTFPFLLKACATLATLQQACVENYCFDEAIDY 167
            GY       EA S+  EM    +R N +++  ++   A                EAI+ 
Sbjct: 197 AGYGKVGEMDEARSLL-EMMPCWVR-NEVSWTCVISGYAKSGRA----------SEAIEV 256

Query: 168 FLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAK 227
           F +M     EPDE T++ +LSACA+LG+L LG  + S V  RGM   V L  A +DMYAK
Sbjct: 257 FQRMLMENVEPDEVTLLAVLSACADLGSLELGERICSYVDHRGMNRAVSLNNAVIDMYAK 316

Query: 228 SGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIKLFTNMMSSSVVPNYVTFVGV 287
           SG++  A  VF C+ +R+V TW+ +I GLA HG   EA+ +F  M+ + V PN VTF+ +
Sbjct: 317 SGNITKALDVFECVNERNVVTWTTIIAGLATHGHGAEALAMFNRMVKAGVRPNDVTFIAI 376

Query: 288 LCACSHAGLVDKSYHYFHIMERVYGIKPMMIHYGSLVDVLGRAGRVKEAYELIMSMPVEP 347
           L ACSH G VD     F+ M   YGI P + HYG ++D+LGRAG+++EA E+I SMP + 
Sbjct: 377 LSACSHVGWVDLGKRLFNSMRSKYGIHPNIEHYGCMIDLLGRAGKLREADEVIKSMPFKA 436

Query: 348 DPIVWRTLLSACSARDVDGGAQIAEEARKRLLELEPKRGGNVVIVANKFAEVGMWKQAAD 407
           +  +W +LL   +A +V    ++ E A   L++LEP   GN +++AN ++ +G W ++  
Sbjct: 437 NAAIWGSLL---AASNVHHDLELGERALSELIKLEPNNSGNYMLLANLYSNLGRWDESRM 496

Query: 408 CRRAMKDRGIKKMAGESCIELGGSLRKFFSGFDARAATDGIYDLLDGLNLHMQ 457
            R  MK  G+KKMAGES IE+   + KF SG       + I+++L  ++L +Q
Sbjct: 497 MRNMMKGIGVKKMAGESSIEVENRVYKFISGDLTHPQVERIHEILQEMDLQIQ 527

BLAST of HG10019709 vs. ExPASy Swiss-Prot
Match: Q9CA54 (Pentatricopeptide repeat-containing protein At1g74630 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H71 PE=2 SV=1)

HSP 1 Score: 257.3 bits (656), Expect = 3.3e-67
Identity = 159/524 (30.34%), Postives = 252/524 (48.09%), Query Frame = 0

Query: 31  HQFLSLLNLCSSTNHLSQIHAQILVSGLQNDPFLTTELLRIAALSPSRNLSYGRSLLFHS 90
           H  LSLLN C +   L+QIH   +  G+  D + T +L+   A+S S  L Y R LL   
Sbjct: 6   HHCLSLLNSCKNLRALTQIHGLFIKYGVDTDSYFTGKLILHCAISISDALPYARRLLLCF 65

Query: 91  HFHSAPLPWNLIIRGYASSDSPQEAISVFGEMRRRG-IRPNNLTFPFLLKACATLATLQQ 150
               A   +N ++RGY+ SD P  +++VF EM R+G + P++ +F F++KA     +L+ 
Sbjct: 66  PEPDA-FMFNTLVRGYSESDEPHNSVAVFVEMMRKGFVFPDSFSFAFVIKAVENFRSLRT 125

Query: 151 ------------------------------ACVE-------------------------- 210
                                          CVE                          
Sbjct: 126 GFQMHCQALKHGLESHLFVGTTLIGMYGGCGCVEFARKVFDEMHQPNLVAWNAVITACFR 185

Query: 211 ------------------------------------------------------------ 270
                                                                       
Sbjct: 186 GNDVAGAREIFDKMLVRNHTSWNVMLAGYIKAGELESAKRIFSEMPHRDDVSWSTMIVGI 245

Query: 271 --NYCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNV 330
             N  F+E+  YF ++   G  P+E ++  +LSAC++ G+   G+ +H  V   G    V
Sbjct: 246 AHNGSFNESFLYFRELQRAGMSPNEVSLTGVLSACSQSGSFEFGKILHGFVEKAGYSWIV 305

Query: 331 QLGTAFVDMYAKSGDVGCARLVFNCLKQ-RSVWTWSAMILGLAQHGFANEAIKLFTNMMS 390
            +  A +DMY++ G+V  ARLVF  +++ R + +W++MI GLA HG   EA++LF  M +
Sbjct: 306 SVNNALIDMYSRCGNVPMARLVFEGMQEKRCIVSWTSMIAGLAMHGQGEEAVRLFNEMTA 365

Query: 391 SSVVPNYVTFVGVLCACSHAGLVDKSYHYFHIMERVYGIKPMMIHYGSLVDVLGRAGRVK 435
             V P+ ++F+ +L ACSHAGL+++   YF  M+RVY I+P + HYG +VD+ GR+G+++
Sbjct: 366 YGVTPDGISFISLLHACSHAGLIEEGEDYFSEMKRVYHIEPEIEHYGCMVDLYGRSGKLQ 425

BLAST of HG10019709 vs. ExPASy Swiss-Prot
Match: A8MQA3 (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 253.8 bits (647), Expect = 3.6e-66
Identity = 152/465 (32.69%), Postives = 243/465 (52.26%), Query Frame = 0

Query: 41  SSTNHLSQIHAQILVSGLQ-NDPFLTTELL-RIAALSPSRNLSYGRSLLFHSHFHSAPLP 100
           SS   L QIHA  +  G+  +D  L   L+  + +L     +SY   +            
Sbjct: 28  SSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINVFI 87

Query: 101 WNLIIRGYASSDSPQEAISVFGEMRRRG-IRPNNLTFPFLLKACATLATLQ--------- 160
           WN +IRGYA   +   A S++ EMR  G + P+  T+PFL+KA  T+A ++         
Sbjct: 88  WNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVRLGETIHSVV 147

Query: 161 -----------------------------------------------QACVENYCFDEAI 220
                                                              EN   +EA+
Sbjct: 148 IRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFAENGKPEEAL 207

Query: 221 DYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMY 280
             + +M + G +PD  T+V +LSACA++G L+LG+ VH  ++  G+  N+      +D+Y
Sbjct: 208 ALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSSNVLLDLY 267

Query: 281 AKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIKLFTNMMSS-SVVPNYVTF 340
           A+ G V  A+ +F+ +  ++  +W+++I+GLA +GF  EAI+LF  M S+  ++P  +TF
Sbjct: 268 ARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGLLPCEITF 327

Query: 341 VGVLCACSHAGLVDKSYHYFHIMERVYGIKPMMIHYGSLVDVLGRAGRVKEAYELIMSMP 400
           VG+L ACSH G+V + + YF  M   Y I+P + H+G +VD+L RAG+VK+AYE I SMP
Sbjct: 328 VGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAYEYIKSMP 387

Query: 401 VEPDPIVWRTLLSACSARDVDGGAQIAEEARKRLLELEPKRGGNVVIVANKFAEVGMWKQ 446
           ++P+ ++WRTLL AC+   V G + +AE AR ++L+LEP   G+ V+++N +A    W  
Sbjct: 388 MQPNVVIWRTLLGACT---VHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQRWSD 447

BLAST of HG10019709 vs. ExPASy TrEMBL
Match: A0A0A0K153 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G007910 PE=4 SV=1)

HSP 1 Score: 826.6 bits (2134), Expect = 5.0e-236
Identity = 413/516 (80.04%), Postives = 438/516 (84.88%), Query Frame = 0

Query: 1   MVRLRISAVHQIFPPNAHNYNSNPNFLSTKHQFLSLLNLCSSTNHLSQIHAQILVSGLQN 60
           MVRL ISAVHQ FP N HNY+S P FLSTKHQ LSLLN CSSTNHL +IHAQILVSGLQN
Sbjct: 1   MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLNHCSSTNHLFEIHAQILVSGLQN 60

Query: 61  DPFLTTELLRIAALSPSRNLSYGRSLLFHSHFHSAPLPWNLIIRGYASSDSPQEAISVFG 120
           D F TTELLR+AALSPSRNLSYG SLLFH HFHSA +PWN IIRGY+SSDSPQEAIS+FG
Sbjct: 61  DSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWNFIIRGYSSSDSPQEAISLFG 120

Query: 121 EMRRRGIRPNNLTFPFLLKACATLATLQQ------------------------------- 180
           EMRRRG+RPNNLTFPFLLKACATLATLQ+                               
Sbjct: 121 EMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLIYFYGSCK 180

Query: 181 -------------------------ACVENYCFDEAIDYFLKMGNHGFEPDETTMVVILS 240
                                    ACVEN+CFDEAIDYFLKMGNHGFEPDETTMVVILS
Sbjct: 181 RMSGARKVFDEMTERTLVSWNAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILS 240

Query: 241 ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT 300
           ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCAR VFNCLKQ+SVWT
Sbjct: 241 ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT 300

Query: 301 WSAMILGLAQHGFANEAIKLFTNMMSSSVVPNYVTFVGVLCACSHAGLVDKSYHYFHIME 360
           WSAMILGLAQHGFANEAI+LFTNMMSS +VPN+VTF+GVLCACSHAGLVDKSYHYF++ME
Sbjct: 301 WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLME 360

Query: 361 RVYGIKPMMIHYGSLVDVLGRAGRVKEAYELIMSMPVEPDPIVWRTLLSACSARDVDGGA 420
           RVYGIKPMMIHYGS+VDVLGRAG+VKEAYELIMSMPVEPDPIVWRTLLSACS RDV+GGA
Sbjct: 361 RVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWRTLLSACSGRDVNGGA 420

Query: 421 QIAEEARKRLLELEPKRGGNVVIVANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEL 461
           ++AEEARKRLLELEPKRGGNVV+VANKFAE+GMWKQAAD RR MKDRGIKKMAGESCIEL
Sbjct: 421 EVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIEL 480

BLAST of HG10019709 vs. ExPASy TrEMBL
Match: A0A5D3CL98 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold106G001370 PE=4 SV=1)

HSP 1 Score: 815.5 bits (2105), Expect = 1.2e-232
Identity = 411/516 (79.65%), Postives = 436/516 (84.50%), Query Frame = 0

Query: 1   MVRLRISAVHQIFPPNAHNYNSNPNFLSTKHQFLSLLNLCSSTNHLSQIHAQILVSGLQN 60
           MVRL ISAVHQ FP NAH+Y S P FLSTKHQFLSLL  CSSTNHL +IHAQILVSG QN
Sbjct: 1   MVRLWISAVHQFFPINAHSYISKPKFLSTKHQFLSLLKHCSSTNHLFEIHAQILVSGRQN 60

Query: 61  DPFLTTELLRIAALSPSRNLSYGRSLLFHSHFHSAPLPWNLIIRGYASSDSPQEAISVFG 120
           D FLTTELLR+AALSPSRNLSYG SLLFH HFHSA LPWNLIIRGY+SSDSP+EAIS+FG
Sbjct: 61  DSFLTTELLRVAALSPSRNLSYGCSLLFHCHFHSATLPWNLIIRGYSSSDSPREAISLFG 120

Query: 121 EMRRRGIRPNNLTFPFLLKACATLATLQQ------------------------------- 180
           EMRRRG+ PNNLTFPFLLKACATLATLQ+                               
Sbjct: 121 EMRRRGVIPNNLTFPFLLKACATLATLQEGKQFHAIVIKCGLDLDVYVRNTLIHFYGSCK 180

Query: 181 -------------------------ACVENYCFDEAIDYFLKMGNHGFEPDETTMVVILS 240
                                    ACVEN+ FDEAIDYFLKMGNHGFEPDETTMVVILS
Sbjct: 181 RMSGARKVFDEMTERTLVSWNAIITACVENFFFDEAIDYFLKMGNHGFEPDETTMVVILS 240

Query: 241 ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT 300
           ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCAR VFNCLKQ+SVWT
Sbjct: 241 ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARRVFNCLKQKSVWT 300

Query: 301 WSAMILGLAQHGFANEAIKLFTNMMSSSVVPNYVTFVGVLCACSHAGLVDKSYHYFHIME 360
           WSAMILGLAQHGFANEAI+LFTNM SS +VPNYVTFVGVLCACSHAGLVDKSYHYF++ME
Sbjct: 301 WSAMILGLAQHGFANEAIELFTNMKSSPIVPNYVTFVGVLCACSHAGLVDKSYHYFNVME 360

Query: 361 RVYGIKPMMIHYGSLVDVLGRAGRVKEAYELIMSMPVEPDPIVWRTLLSACSARDVDGGA 420
           RVYGIKPMMIHYG +VDVLGRAG+VKEAYELIMSMPVEPDP+VWRTLLSACS RDV+GGA
Sbjct: 361 RVYGIKPMMIHYGLMVDVLGRAGQVKEAYELIMSMPVEPDPVVWRTLLSACSGRDVNGGA 420

Query: 421 QIAEEARKRLLELEPKRGGNVVIVANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEL 461
           ++AEEARKRLLELEPKRGGNVV+VANKFAEVGMWKQAAD RR MKDRGIKKMAGESCIEL
Sbjct: 421 EVAEEARKRLLELEPKRGGNVVMVANKFAEVGMWKQAADYRRTMKDRGIKKMAGESCIEL 480

BLAST of HG10019709 vs. ExPASy TrEMBL
Match: A0A1S3BI68 (pentatricopeptide repeat-containing protein At2g36730 OS=Cucumis melo OX=3656 GN=LOC103490331 PE=4 SV=1)

HSP 1 Score: 815.1 bits (2104), Expect = 1.5e-232
Identity = 410/516 (79.46%), Postives = 436/516 (84.50%), Query Frame = 0

Query: 1   MVRLRISAVHQIFPPNAHNYNSNPNFLSTKHQFLSLLNLCSSTNHLSQIHAQILVSGLQN 60
           MVRL ISAVHQ FP NAH+Y S P FLSTKHQFLSLL  CSSTNHL +IHAQILVSG QN
Sbjct: 1   MVRLWISAVHQFFPINAHSYISKPKFLSTKHQFLSLLKHCSSTNHLFEIHAQILVSGRQN 60

Query: 61  DPFLTTELLRIAALSPSRNLSYGRSLLFHSHFHSAPLPWNLIIRGYASSDSPQEAISVFG 120
           D FLTTELLR+AALSPSRNLSYG SLLFH HFHSA LPWNLIIRGY+SSDSP+EAIS+FG
Sbjct: 61  DSFLTTELLRVAALSPSRNLSYGCSLLFHCHFHSATLPWNLIIRGYSSSDSPREAISLFG 120

Query: 121 EMRRRGIRPNNLTFPFLLKACATLATLQQ------------------------------- 180
           EMRRRG+ PNNLTFPFLLKACATLATLQ+                               
Sbjct: 121 EMRRRGVIPNNLTFPFLLKACATLATLQEGKQFHAIVIKCGLDLDVYVRNTLIHFYGSCK 180

Query: 181 -------------------------ACVENYCFDEAIDYFLKMGNHGFEPDETTMVVILS 240
                                    ACVEN+ FDEAIDYFLKMGNHGFEPDETTMVVILS
Sbjct: 181 RMSGARKVFDEMTERTLVSWNAVITACVENFFFDEAIDYFLKMGNHGFEPDETTMVVILS 240

Query: 241 ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT 300
           ACAELGNLSLGRWVHSQVVGRGMVLN+QLGTAFVDMYAKSGDVGCAR VFNCLKQ+SVWT
Sbjct: 241 ACAELGNLSLGRWVHSQVVGRGMVLNIQLGTAFVDMYAKSGDVGCARRVFNCLKQKSVWT 300

Query: 301 WSAMILGLAQHGFANEAIKLFTNMMSSSVVPNYVTFVGVLCACSHAGLVDKSYHYFHIME 360
           WSAMILGLAQHGFANEAI+LFTNM SS +VPNYVTFVGVLCACSHAGLVDKSYHYF++ME
Sbjct: 301 WSAMILGLAQHGFANEAIELFTNMKSSPIVPNYVTFVGVLCACSHAGLVDKSYHYFNVME 360

Query: 361 RVYGIKPMMIHYGSLVDVLGRAGRVKEAYELIMSMPVEPDPIVWRTLLSACSARDVDGGA 420
           RVYGIKPMMIHYG +VDVLGRAG+VKEAYELIMSMPVEPDP+VWRTLLSACS RDV+GGA
Sbjct: 361 RVYGIKPMMIHYGLMVDVLGRAGQVKEAYELIMSMPVEPDPVVWRTLLSACSGRDVNGGA 420

Query: 421 QIAEEARKRLLELEPKRGGNVVIVANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEL 461
           ++AEEARKRLLELEPKRGGNVV+VANKFAEVGMWKQAAD RR MKDRGIKKMAGESCIEL
Sbjct: 421 EVAEEARKRLLELEPKRGGNVVMVANKFAEVGMWKQAADYRRTMKDRGIKKMAGESCIEL 480

BLAST of HG10019709 vs. ExPASy TrEMBL
Match: A0A6J1KVA8 (pentatricopeptide repeat-containing protein At2g36730-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111499014 PE=4 SV=1)

HSP 1 Score: 800.4 bits (2066), Expect = 3.9e-228
Identity = 405/516 (78.49%), Postives = 428/516 (82.95%), Query Frame = 0

Query: 1   MVRLRISAVHQIFPPNAHNYNSNPNFLSTKHQFLSLLNLCSSTNHLSQIHAQILVSGLQN 60
           MVRLRISAVHQIFPPNAH  NSN NFLS KHQFLSL+ LCSS NHL QIH+QI+V GLQN
Sbjct: 1   MVRLRISAVHQIFPPNAH--NSNSNFLSRKHQFLSLIKLCSSPNHLFQIHSQIIVFGLQN 60

Query: 61  DPFLTTELLRIAALSPSRNLSYGRSLLFHSHFHSAPLPWNLIIRGYASSDSPQEAISVFG 120
           D FLTTELLR AALSPSRNLSY RSLLFH + H +PLPWN IIRGYASSDSP+EAI VF 
Sbjct: 61  DSFLTTELLRFAALSPSRNLSYARSLLFHYNLHFSPLPWNCIIRGYASSDSPREAIWVFE 120

Query: 121 EMRRRGIRPNNLTFPFLLKACATLATLQQ------------------------------- 180
           EMRRRGIRPN+LTFPFL+KACATL TLQ+                               
Sbjct: 121 EMRRRGIRPNHLTFPFLIKACATLTTLQEGKKFHADAIKCGLDLDVYVRNTLINFYGSCK 180

Query: 181 -------------------------ACVENYCFDEAIDYFLKMGNHGFEPDETTMVVILS 240
                                    ACVEN+CFDEAI+YFL+MGNHGFE DETTMVVILS
Sbjct: 181 RMSGARKVFDEMSVRTLVSWNAVITACVENFCFDEAIEYFLRMGNHGFESDETTMVVILS 240

Query: 241 ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT 300
           ACAELGNLSLGRWVHSQVV RGMVLNVQLGTA VDMYAKSGDVGCARLVFNCLKQRSVWT
Sbjct: 241 ACAELGNLSLGRWVHSQVVERGMVLNVQLGTALVDMYAKSGDVGCARLVFNCLKQRSVWT 300

Query: 301 WSAMILGLAQHGFANEAIKLFTNMMSSSVVPNYVTFVGVLCACSHAGLVDKSYHYFHIME 360
           WSAMILGLAQHGFANEAI+LFTNMMSSSV PNYVTF+GVLCACSHAGLVDK YHYF+IME
Sbjct: 301 WSAMILGLAQHGFANEAIELFTNMMSSSVTPNYVTFIGVLCACSHAGLVDKGYHYFNIME 360

Query: 361 RVYGIKPMMIHYGSLVDVLGRAGRVKEAYELIMSMPVEPDPIVWRTLLSACSARDVDGGA 420
           RVY IKPMMIHYGS+VDVL RAGRVKEAYE IM MPVEPDPIVWRTLLSACSARDVDGGA
Sbjct: 361 RVYRIKPMMIHYGSMVDVLCRAGRVKEAYEFIMRMPVEPDPIVWRTLLSACSARDVDGGA 420

Query: 421 QIAEEARKRLLELEPKRGGNVVIVANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEL 461
           Q+ EEA+KRLLELEPKRGGNVV+VAN FAEVGMWKQAADCRRAMKD GIKKMAGESC+E+
Sbjct: 421 QVVEEAKKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDGGIKKMAGESCVEV 480

BLAST of HG10019709 vs. ExPASy TrEMBL
Match: A0A6J1F5Z2 (pentatricopeptide repeat-containing protein At2g36730 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111441141 PE=4 SV=1)

HSP 1 Score: 800.0 bits (2065), Expect = 5.0e-228
Identity = 404/516 (78.29%), Postives = 427/516 (82.75%), Query Frame = 0

Query: 1   MVRLRISAVHQIFPPNAHNYNSNPNFLSTKHQFLSLLNLCSSTNHLSQIHAQILVSGLQN 60
           MVRLRI AVHQIFPPNAH  NSN NFLS KHQFLS++ LCSS NHL QIH+QI+VSGLQN
Sbjct: 1   MVRLRILAVHQIFPPNAH--NSNSNFLSRKHQFLSIIKLCSSPNHLFQIHSQIIVSGLQN 60

Query: 61  DPFLTTELLRIAALSPSRNLSYGRSLLFHSHFHSAPLPWNLIIRGYASSDSPQEAISVFG 120
           D FLTTELLR AALSPSRNLSY RSLLFH + H +PLPWN IIRGYASSDSP+EAI VF 
Sbjct: 61  DSFLTTELLRFAALSPSRNLSYARSLLFHYNLHFSPLPWNCIIRGYASSDSPREAIWVFE 120

Query: 121 EMRRRGIRPNNLTFPFLLKACATLATLQQ------------------------------- 180
           EMRRRGIRPNNLTFPFL+KACATL TLQ+                               
Sbjct: 121 EMRRRGIRPNNLTFPFLIKACATLTTLQEGKKFHADAIKCGLDLDVYVRNTLINFYGSCK 180

Query: 181 -------------------------ACVENYCFDEAIDYFLKMGNHGFEPDETTMVVILS 240
                                    ACVEN+CFD+AI+YFLKMGNHGFEPDETTMVVILS
Sbjct: 181 RMSGARKVFDEMSVRTLVSWNAVITACVENFCFDKAIEYFLKMGNHGFEPDETTMVVILS 240

Query: 241 ACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWT 300
           ACAELGNLSLGRWVHSQVV RGMVLNVQLGTA VDMYAKSGDVGCARLVFNCLKQRSVWT
Sbjct: 241 ACAELGNLSLGRWVHSQVVERGMVLNVQLGTALVDMYAKSGDVGCARLVFNCLKQRSVWT 300

Query: 301 WSAMILGLAQHGFANEAIKLFTNMMSSSVVPNYVTFVGVLCACSHAGLVDKSYHYFHIME 360
           WSAMILGLAQHGFA+EAI+LFTNMMSSSV PNYVTF+GVLCACSHAGLVDK YHYF+IME
Sbjct: 301 WSAMILGLAQHGFASEAIELFTNMMSSSVTPNYVTFIGVLCACSHAGLVDKGYHYFNIME 360

Query: 361 RVYGIKPMMIHYGSLVDVLGRAGRVKEAYELIMSMPVEPDPIVWRTLLSACSARDVDGGA 420
           RVYGIKPMMIHYGS+VDVL RAGRVKEAYE IM MPVEPDPIVWRTLLSACS RDVDGGA
Sbjct: 361 RVYGIKPMMIHYGSMVDVLCRAGRVKEAYEFIMRMPVEPDPIVWRTLLSACSGRDVDGGA 420

Query: 421 QIAEEARKRLLELEPKRGGNVVIVANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIEL 461
           Q+ EEARKRLLELEPKRGGNVV+VAN FAEVGMWKQAADCRRAMKD GIKKMAGESC+E+
Sbjct: 421 QVVEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRAMKDGGIKKMAGESCVEV 480

BLAST of HG10019709 vs. TAIR 10
Match: AT2G36730.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 482.6 bits (1241), Expect = 3.4e-136
Identity = 256/487 (52.57%), Postives = 325/487 (66.74%), Query Frame = 0

Query: 20  YNSNPNFLSTKHQFLSLLNLCSSTNHLSQIHAQILVSGLQNDPFLTTELLRIAALSPSRN 79
           ++S+  F S KHQ L  L LCSS  HL QIH QI +S LQND F+ +EL+R+++LS +++
Sbjct: 3   WSSDSCFKSRKHQCLIFLKLCSSIKHLLQIHGQIHLSSLQNDSFIISELVRVSSLSLAKD 62

Query: 80  LSYGRSLLFHSHFHSAPLPWNLIIRGYASSDSPQEAISVFGEMRRRGIRPNNLTFPFLLK 139
           L++ R+LL HS   S P  WN++ RGY+SSDSP E+I V+ EM+RRGI+PN LTFPFLLK
Sbjct: 63  LAFARTLLLHSS-DSTPSTWNMLSRGYSSSDSPVESIWVYSEMKRRGIKPNKLTFPFLLK 122

Query: 140 ACATLA------------------------------------------------------ 199
           ACA+                                                        
Sbjct: 123 ACASFLGLTAGRQIQVEVLKHGFDFDVYVGNNLIHLYGTCKKTSDARKVFDEMTERNVVS 182

Query: 200 --TLQQACVENYCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVV 259
             ++  A VEN   +   + F +M    F PDETTMVV+LSAC   GNLSLG+ VHSQV+
Sbjct: 183 WNSIMTALVENGKLNLVFECFCEMIGKRFCPDETTMVVLLSACG--GNLSLGKLVHSQVM 242

Query: 260 GRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIK 319
            R + LN +LGTA VDMYAKSG +  ARLVF  +  ++VWTWSAMI+GLAQ+GFA EA++
Sbjct: 243 VRELELNCRLGTALVDMYAKSGGLEYARLVFERMVDKNVWTWSAMIVGLAQYGFAEEALQ 302

Query: 320 LFTNMM-SSSVVPNYVTFVGVLCACSHAGLVDKSYHYFHIMERVYGIKPMMIHYGSLVDV 379
           LF+ MM  SSV PNYVTF+GVLCACSH GLVD  Y YFH ME+++ IKPMMIHYG++VD+
Sbjct: 303 LFSKMMKESSVRPNYVTFLGVLCACSHTGLVDDGYKYFHEMEKIHKIKPMMIHYGAMVDI 362

Query: 380 LGRAGRVKEAYELIMSMPVEPDPIVWRTLLSACSARDVDGGAQIAEEARKRLLELEPKRG 439
           LGRAGR+ EAY+ I  MP EPD +VWRTLLSACS    +    I E+ +KRL+ELEPKR 
Sbjct: 363 LGRAGRLNEAYDFIKKMPFEPDAVVWRTLLSACSIHHDEDDEGIGEKVKKRLIELEPKRS 422

Query: 440 GNVVIVANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIELGGSLRKFFSGFDARAATD 450
           GN+VIVAN+FAE  MW +AA+ RR MK+  +KK+AGESC+ELGGS  +FFSG+D R+   
Sbjct: 423 GNLVIVANRFAEARMWAEAAEVRRVMKETKMKKIAGESCLELGGSFHRFFSGYDPRSEYV 482

BLAST of HG10019709 vs. TAIR 10
Match: AT2G02980.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 262.3 bits (669), Expect = 7.3e-70
Identity = 158/497 (31.79%), Postives = 261/497 (52.52%), Query Frame = 0

Query: 21  NSNPNFLSTKHQFLSLLNLCSSTNHLSQIHAQILVSGLQNDPFLTTELLRIAALSPSR-N 80
           +S  + ++T++  L L++ C+S   L QI A  + S +++  F+  +L+     SP+  +
Sbjct: 21  HSKIDTVNTQNPIL-LISKCNSLRELMQIQAYAIKSHIEDVSFV-AKLINFCTESPTESS 80

Query: 81  LSYGRSLLFHSHFHSAPLPWNLIIRGYASSDSPQEAISVFGEMRRRGIRPNNLTFPFLLK 140
           +SY R  LF +      + +N + RGY+   +P E  S+F E+   GI P+N TFP LLK
Sbjct: 81  MSYARH-LFEAMSEPDIVIFNSMARGYSRFTNPLEVFSLFVEILEDGILPDNYTFPSLLK 140

Query: 141 ACATLATLQQA------------------------------------CVEN-------YC 200
           ACA    L++                                     CV +        C
Sbjct: 141 ACAVAKALEEGRQLHCLSMKLGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVC 200

Query: 201 F-------------DEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVV 260
           +             +EA+  F +M     +P+E T++ +LS+CA LG+L LG+W+H    
Sbjct: 201 YNAMITGYARRNRPNEALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAK 260

Query: 261 GRGMVLNVQLGTAFVDMYAKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIK 320
                  V++ TA +DM+AK G +  A  +F  ++ +    WSAMI+  A HG A +++ 
Sbjct: 261 KHSFCKYVKVNTALIDMFAKCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSML 320

Query: 321 LFTNMMSSSVVPNYVTFVGVLCACSHAGLVDKSYHYFHIMERVYGIKPMMIHYGSLVDVL 380
           +F  M S +V P+ +TF+G+L ACSH G V++   YF  M   +GI P + HYGS+VD+L
Sbjct: 321 MFERMRSENVQPDEITFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLL 380

Query: 381 GRAGRVKEAYELIMSMPVEPDPIVWRTLLSACSARDVDGGAQIAEEARKRLLELEPKRGG 440
            RAG +++AYE I  +P+ P P++WR LL+ACS+ +      +AE+  +R+ EL+   GG
Sbjct: 381 SRAGNLEDAYEFIDKLPISPTPMLWRILLAACSSHN---NLDLAEKVSERIFELDDSHGG 440

Query: 441 NVVIVANKFAEVGMWKQAADCRRAMKDRGIKKMAGESCIELGGSLRKFFSGFDARAATDG 461
           + VI++N +A    W+     R+ MKDR   K+ G S IE+   + +FFSG   ++AT  
Sbjct: 441 DYVILSNLYARNKKWEYVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTK 500

BLAST of HG10019709 vs. TAIR 10
Match: AT5G56310.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 257.7 bits (657), Expect = 1.8e-68
Identity = 150/413 (36.32%), Postives = 227/413 (54.96%), Query Frame = 0

Query: 48  QIHAQILVSGLQNDPFLTTELLRI----AALSPSRNLSYGRSLLFHSHFHSAPLPWNLII 107
           QIH Q++V G  +   + T L+++      L  +R        +F          WN ++
Sbjct: 137 QIHGQVVVFGFDSSVHVVTGLIQMYFSCGGLGDARK-------MFDEMLVKDVNVWNALL 196

Query: 108 RGYASSDSPQEAISVFGEMRRRGIRPNNLTFPFLLKACATLATLQQACVENYCFDEAIDY 167
            GY       EA S+  EM    +R N +++  ++   A                EAI+ 
Sbjct: 197 AGYGKVGEMDEARSLL-EMMPCWVR-NEVSWTCVISGYAKSGRA----------SEAIEV 256

Query: 168 FLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAK 227
           F +M     EPDE T++ +LSACA+LG+L LG  + S V  RGM   V L  A +DMYAK
Sbjct: 257 FQRMLMENVEPDEVTLLAVLSACADLGSLELGERICSYVDHRGMNRAVSLNNAVIDMYAK 316

Query: 228 SGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIKLFTNMMSSSVVPNYVTFVGV 287
           SG++  A  VF C+ +R+V TW+ +I GLA HG   EA+ +F  M+ + V PN VTF+ +
Sbjct: 317 SGNITKALDVFECVNERNVVTWTTIIAGLATHGHGAEALAMFNRMVKAGVRPNDVTFIAI 376

Query: 288 LCACSHAGLVDKSYHYFHIMERVYGIKPMMIHYGSLVDVLGRAGRVKEAYELIMSMPVEP 347
           L ACSH G VD     F+ M   YGI P + HYG ++D+LGRAG+++EA E+I SMP + 
Sbjct: 377 LSACSHVGWVDLGKRLFNSMRSKYGIHPNIEHYGCMIDLLGRAGKLREADEVIKSMPFKA 436

Query: 348 DPIVWRTLLSACSARDVDGGAQIAEEARKRLLELEPKRGGNVVIVANKFAEVGMWKQAAD 407
           +  +W +LL   +A +V    ++ E A   L++LEP   GN +++AN ++ +G W ++  
Sbjct: 437 NAAIWGSLL---AASNVHHDLELGERALSELIKLEPNNSGNYMLLANLYSNLGRWDESRM 496

Query: 408 CRRAMKDRGIKKMAGESCIELGGSLRKFFSGFDARAATDGIYDLLDGLNLHMQ 457
            R  MK  G+KKMAGES IE+   + KF SG       + I+++L  ++L +Q
Sbjct: 497 MRNMMKGIGVKKMAGESSIEVENRVYKFISGDLTHPQVERIHEILQEMDLQIQ 527

BLAST of HG10019709 vs. TAIR 10
Match: AT1G74630.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 257.3 bits (656), Expect = 2.3e-68
Identity = 159/524 (30.34%), Postives = 252/524 (48.09%), Query Frame = 0

Query: 31  HQFLSLLNLCSSTNHLSQIHAQILVSGLQNDPFLTTELLRIAALSPSRNLSYGRSLLFHS 90
           H  LSLLN C +   L+QIH   +  G+  D + T +L+   A+S S  L Y R LL   
Sbjct: 6   HHCLSLLNSCKNLRALTQIHGLFIKYGVDTDSYFTGKLILHCAISISDALPYARRLLLCF 65

Query: 91  HFHSAPLPWNLIIRGYASSDSPQEAISVFGEMRRRG-IRPNNLTFPFLLKACATLATLQQ 150
               A   +N ++RGY+ SD P  +++VF EM R+G + P++ +F F++KA     +L+ 
Sbjct: 66  PEPDA-FMFNTLVRGYSESDEPHNSVAVFVEMMRKGFVFPDSFSFAFVIKAVENFRSLRT 125

Query: 151 ------------------------------ACVE-------------------------- 210
                                          CVE                          
Sbjct: 126 GFQMHCQALKHGLESHLFVGTTLIGMYGGCGCVEFARKVFDEMHQPNLVAWNAVITACFR 185

Query: 211 ------------------------------------------------------------ 270
                                                                       
Sbjct: 186 GNDVAGAREIFDKMLVRNHTSWNVMLAGYIKAGELESAKRIFSEMPHRDDVSWSTMIVGI 245

Query: 271 --NYCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNV 330
             N  F+E+  YF ++   G  P+E ++  +LSAC++ G+   G+ +H  V   G    V
Sbjct: 246 AHNGSFNESFLYFRELQRAGMSPNEVSLTGVLSACSQSGSFEFGKILHGFVEKAGYSWIV 305

Query: 331 QLGTAFVDMYAKSGDVGCARLVFNCLKQ-RSVWTWSAMILGLAQHGFANEAIKLFTNMMS 390
            +  A +DMY++ G+V  ARLVF  +++ R + +W++MI GLA HG   EA++LF  M +
Sbjct: 306 SVNNALIDMYSRCGNVPMARLVFEGMQEKRCIVSWTSMIAGLAMHGQGEEAVRLFNEMTA 365

Query: 391 SSVVPNYVTFVGVLCACSHAGLVDKSYHYFHIMERVYGIKPMMIHYGSLVDVLGRAGRVK 435
             V P+ ++F+ +L ACSHAGL+++   YF  M+RVY I+P + HYG +VD+ GR+G+++
Sbjct: 366 YGVTPDGISFISLLHACSHAGLIEEGEDYFSEMKRVYHIEPEIEHYGCMVDLYGRSGKLQ 425

BLAST of HG10019709 vs. TAIR 10
Match: AT4G21065.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 253.8 bits (647), Expect = 2.6e-67
Identity = 152/465 (32.69%), Postives = 243/465 (52.26%), Query Frame = 0

Query: 41  SSTNHLSQIHAQILVSGLQ-NDPFLTTELL-RIAALSPSRNLSYGRSLLFHSHFHSAPLP 100
           SS   L QIHA  +  G+  +D  L   L+  + +L     +SY   +            
Sbjct: 28  SSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINVFI 87

Query: 101 WNLIIRGYASSDSPQEAISVFGEMRRRG-IRPNNLTFPFLLKACATLATLQ--------- 160
           WN +IRGYA   +   A S++ EMR  G + P+  T+PFL+KA  T+A ++         
Sbjct: 88  WNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVRLGETIHSVV 147

Query: 161 -----------------------------------------------QACVENYCFDEAI 220
                                                              EN   +EA+
Sbjct: 148 IRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFAENGKPEEAL 207

Query: 221 DYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMY 280
             + +M + G +PD  T+V +LSACA++G L+LG+ VH  ++  G+  N+      +D+Y
Sbjct: 208 ALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSSNVLLDLY 267

Query: 281 AKSGDVGCARLVFNCLKQRSVWTWSAMILGLAQHGFANEAIKLFTNMMSS-SVVPNYVTF 340
           A+ G V  A+ +F+ +  ++  +W+++I+GLA +GF  EAI+LF  M S+  ++P  +TF
Sbjct: 268 ARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGLLPCEITF 327

Query: 341 VGVLCACSHAGLVDKSYHYFHIMERVYGIKPMMIHYGSLVDVLGRAGRVKEAYELIMSMP 400
           VG+L ACSH G+V + + YF  M   Y I+P + H+G +VD+L RAG+VK+AYE I SMP
Sbjct: 328 VGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAYEYIKSMP 387

Query: 401 VEPDPIVWRTLLSACSARDVDGGAQIAEEARKRLLELEPKRGGNVVIVANKFAEVGMWKQ 446
           ++P+ ++WRTLL AC+   V G + +AE AR ++L+LEP   G+ V+++N +A    W  
Sbjct: 388 MQPNVVIWRTLLGACT---VHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQRWSD 447

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038903331.12.7e-24483.11pentatricopeptide repeat-containing protein At2g36730 isoform X4 [Benincasa hisp... [more]
XP_038903330.17.9e-24483.43pentatricopeptide repeat-containing protein At2g36730 isoform X3 [Benincasa hisp... [more]
XP_038903328.17.9e-24483.43pentatricopeptide repeat-containing protein At2g36730 isoform X1 [Benincasa hisp... [more]
XP_038903329.17.9e-24483.43pentatricopeptide repeat-containing protein At2g36730 isoform X2 [Benincasa hisp... [more]
XP_038903332.17.9e-24483.43pentatricopeptide repeat-containing protein At2g36730 isoform X5 [Benincasa hisp... [more]
Match NameE-valueIdentityDescription
Q9ZQA14.8e-13552.57Pentatricopeptide repeat-containing protein At2g36730 OS=Arabidopsis thaliana OX... [more]
Q8LK931.0e-6831.79Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidop... [more]
Q9FMA12.5e-6736.32Pentatricopeptide repeat-containing protein At5g56310 OS=Arabidopsis thaliana OX... [more]
Q9CA543.3e-6730.34Pentatricopeptide repeat-containing protein At1g74630 OS=Arabidopsis thaliana OX... [more]
A8MQA33.6e-6632.69Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A0A0K1535.0e-23680.04Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G007910 PE=4 SV=1[more]
A0A5D3CL981.2e-23279.65Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3BI681.5e-23279.46pentatricopeptide repeat-containing protein At2g36730 OS=Cucumis melo OX=3656 GN... [more]
A0A6J1KVA83.9e-22878.49pentatricopeptide repeat-containing protein At2g36730-like isoform X1 OS=Cucurbi... [more]
A0A6J1F5Z25.0e-22878.29pentatricopeptide repeat-containing protein At2g36730 isoform X1 OS=Cucurbita mo... [more]
Match NameE-valueIdentityDescription
AT2G36730.13.4e-13652.57Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G02980.17.3e-7031.79Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G56310.11.8e-6836.32Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G74630.12.3e-6830.34Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G21065.12.6e-6732.69Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 13..145
e-value: 1.7E-10
score: 42.5
coord: 146..192
e-value: 2.5E-6
score: 28.9
coord: 193..290
e-value: 2.1E-18
score: 68.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 291..420
e-value: 4.1E-13
score: 51.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 99..130
e-value: 6.6E-8
score: 30.3
coord: 244..276
e-value: 1.9E-5
score: 22.5
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 315..339
e-value: 0.0042
score: 17.2
coord: 244..271
e-value: 9.8E-5
score: 22.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 99..142
e-value: 2.7E-10
score: 40.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 241..275
score: 10.720209
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 95..129
score: 11.586152
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 149..459
NoneNo IPR availablePANTHERPTHR47926:SF217OS09G0412900 PROTEINcoord: 25..148
NoneNo IPR availablePANTHERPTHR47926:SF217OS09G0412900 PROTEINcoord: 149..459
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 25..148

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10019709.1HG10019709.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding