HG10021066 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10021066
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr05: 5050011 .. 5051972 (-)
RNA-Seq ExpressionHG10021066
SyntenyHG10021066
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATTTTTAGCTTCATTTTTCCGAGGAGAAGTTGTGTTTTTAGCCGAGGTTTCCATACGGGTAAGAAGCTTTTAAGTCCAAGCACTGAGGATATCATTTGTAAAGCAATTTGTGTTAATTTAAAACAGAGAAGATGGAAGTTCTTAGAGCAAGTATCACCAAGTCTCACGAATTCATTGGTCTGTCGTGTTGTTCGTGAGTTTCGCAACTCTCCACAGTTAGCTTTAGAATTCTACAACTGGGTTGAGGCAAGAGACAACTTTTCGCATTCATTAGAATCATGTTGTACTTTAGTTCACGTGTTAGTCAATGCCAGAAATTTTAATGATGCCTTGTCTATCATGGAGAGTTTAATGCTTAAAAATGGTAAGTCGCCATTGGAGGTTCTGGGAGGGTTGATTGATAGCTACGACATTTGTAATTCAAATCCTGCAGTCTTTGATGCTCTGGTGAGGACTTGCACTCAGTTAGGGAGTCTGGAAGGTGCATATGACGTAATAAAAAAGTCGAGACTAGTAGGTTTTTGGGTTACGATTCATGCTTGGAATAACTTCTTAAATCAACTTTTAAAGTTGGATGAGACTGATAAGTTTTGGAACATGTATAAGGAAATGGTTGCCAGTGGTTACAGTGAAAATGTGAATACTTTTAACTTGATTATTTATGCTTTATGCAAGGACTGTAAATTACTAGAAGCAATTTATGTAGTTTATTTGATGCTGAAGATTGAAATTTGGCCTAATGTCGTCACTTTTAATATGATCATTGATAAAGCAAGTAAAATGGGTGACATGGGTCTTGCTCTAAAACTGGCAAGGAACACGGGGGTAATTTCAGGAGGCAGTGTATCGCCTAATATAGTTACATATAATTGTATCATTAATGGGTTCTGCAAGATAAGGAGGTTAGAGTCTGCAAAAAATGTTCTTGGTGAAATGATTAAGCTGGGAATAGATTTCAATGTGAGAACATATGCTACTTTGATTGACGGCTATGCTAGAAAAGGGAGTTTGGATGTGGCATTCAGGTTGTGTGATGAAATGGTTGAAATGGGCTTGATTCCAGACACTGTTTTATATAACTCCATCATCTACTGGCTTTACATGGAAGGAGAATTAGAAGAAGCTTCTTTTTTATTATCTGACATGATTAATAGGCATATCCTCCCGGATGAGGTTACCTTCTCAATCCTTACAAAGGGTCTCTGTGTAAATGGACATCTCAATAAAGCTTTAAGAGTTCATGACCACATTCTCAAAAGAAACCTTGTAAAAGATGCTTTTACTCATAATATTCTCATCAACTATATATTCCAGACCCAAAACGCAGCAGGTGCCAAGCAATTACTGAGCAGTATGATTGTTCGTGGTATCAAACCTGACATGGTTACTTATGGCACTCTGGTTGATGGGTGTTGTAAGGAAGGAAAAATCGAAGCTGCAGTTCAGATTTATGACAAAGCAGTAAAGGCAGATGGGAAATCCAACTTGGTGGTATATAATTCTATTTTAAATGGTTTGTGTAAGCAAGGTTCAATTGATGCTGCTAAACTCTTGGTAGACAAATTACAGCAAAATGGTTTCCTCGATACAGTTACCTATAACACGTTGCTACATGGGTACTGCGTCAATGGGAAGGTCGAGAAGGCTTTTGCACTGTTTCTAGAGATGATTAATGTGGGGAGTTTGGTGAACATAGTTACTTACAATATAATGATTAACTTTCTGTGCAAGATGGGATTGATTTTTCAAACCATGGAACTGATGAGAGCAATGACTAGTCAGGGGATAGTTCCCGACCATATAACATACACGACACTCATCACCAATTTTGTTAAGTGTTGTGGCTCCGAGGATGTAATTGAGTTACATGATTATATGGTACTTAAAGGAGCAGTTCCTGATAGGCAAACATACCAGTCTCTTGTTAGCCCCTCCCTTCAAGAAAACGTTGAGGGGTAG

mRNA sequence

ATGATTTTTAGCTTCATTTTTCCGAGGAGAAGTTGTGTTTTTAGCCGAGGTTTCCATACGGGTAAGAAGCTTTTAAGTCCAAGCACTGAGGATATCATTTGTAAAGCAATTTGTGTTAATTTAAAACAGAGAAGATGGAAGTTCTTAGAGCAAGTATCACCAAGTCTCACGAATTCATTGGTCTGTCGTGTTGTTCGTGAGTTTCGCAACTCTCCACAGTTAGCTTTAGAATTCTACAACTGGGTTGAGGCAAGAGACAACTTTTCGCATTCATTAGAATCATGTTGTACTTTAGTTCACGTGTTAGTCAATGCCAGAAATTTTAATGATGCCTTGTCTATCATGGAGAGTTTAATGCTTAAAAATGGTAAGTCGCCATTGGAGGTTCTGGGAGGGTTGATTGATAGCTACGACATTTGTAATTCAAATCCTGCAGTCTTTGATGCTCTGGTGAGGACTTGCACTCAGTTAGGGAGTCTGGAAGGTGCATATGACGTAATAAAAAAGTCGAGACTAGTAGGTTTTTGGGTTACGATTCATGCTTGGAATAACTTCTTAAATCAACTTTTAAAGTTGGATGAGACTGATAAGTTTTGGAACATGTATAAGGAAATGGTTGCCAGTGGTTACAGTGAAAATGTGAATACTTTTAACTTGATTATTTATGCTTTATGCAAGGACTGTAAATTACTAGAAGCAATTTATGTAGTTTATTTGATGCTGAAGATTGAAATTTGGCCTAATGTCGTCACTTTTAATATGATCATTGATAAAGCAAGTAAAATGGGTGACATGGGTCTTGCTCTAAAACTGGCAAGGAACACGGGGGTAATTTCAGGAGGCAGTGTATCGCCTAATATAGTTACATATAATTGTATCATTAATGGGTTCTGCAAGATAAGGAGGTTAGAGTCTGCAAAAAATGTTCTTGGTGAAATGATTAAGCTGGGAATAGATTTCAATGTGAGAACATATGCTACTTTGATTGACGGCTATGCTAGAAAAGGGAGTTTGGATGTGGCATTCAGGTTGTGTGATGAAATGGTTGAAATGGGCTTGATTCCAGACACTGTTTTATATAACTCCATCATCTACTGGCTTTACATGGAAGGAGAATTAGAAGAAGCTTCTTTTTTATTATCTGACATGATTAATAGGCATATCCTCCCGGATGAGGTTACCTTCTCAATCCTTACAAAGGGTCTCTGTGTAAATGGACATCTCAATAAAGCTTTAAGAGTTCATGACCACATTCTCAAAAGAAACCTTGTAAAAGATGCTTTTACTCATAATATTCTCATCAACTATATATTCCAGACCCAAAACGCAGCAGGTGCCAAGCAATTACTGAGCAGTATGATTGTTCGTGGTATCAAACCTGACATGGTTACTTATGGCACTCTGGTTGATGGGTGTTGTAAGGAAGGAAAAATCGAAGCTGCAGTTCAGATTTATGACAAAGCAGTAAAGGCAGATGGGAAATCCAACTTGGTGGTATATAATTCTATTTTAAATGGTTTGTGTAAGCAAGGTTCAATTGATGCTGCTAAACTCTTGGTAGACAAATTACAGCAAAATGGTTTCCTCGATACAGTTACCTATAACACGTTGCTACATGGGTACTGCGTCAATGGGAAGGTCGAGAAGGCTTTTGCACTGTTTCTAGAGATGATTAATGTGGGGAGTTTGGTGAACATAGTTACTTACAATATAATGATTAACTTTCTGTGCAAGATGGGATTGATTTTTCAAACCATGGAACTGATGAGAGCAATGACTAGTCAGGGGATAGTTCCCGACCATATAACATACACGACACTCATCACCAATTTTGTTAAGTGTTGTGGCTCCGAGGATGTAATTGAGTTACATGATTATATGGTACTTAAAGGAGCAGTTCCTGATAGGCAAACATACCAGTCTCTTGTTAGCCCCTCCCTTCAAGAAAACGTTGAGGGGTAG

Coding sequence (CDS)

ATGATTTTTAGCTTCATTTTTCCGAGGAGAAGTTGTGTTTTTAGCCGAGGTTTCCATACGGGTAAGAAGCTTTTAAGTCCAAGCACTGAGGATATCATTTGTAAAGCAATTTGTGTTAATTTAAAACAGAGAAGATGGAAGTTCTTAGAGCAAGTATCACCAAGTCTCACGAATTCATTGGTCTGTCGTGTTGTTCGTGAGTTTCGCAACTCTCCACAGTTAGCTTTAGAATTCTACAACTGGGTTGAGGCAAGAGACAACTTTTCGCATTCATTAGAATCATGTTGTACTTTAGTTCACGTGTTAGTCAATGCCAGAAATTTTAATGATGCCTTGTCTATCATGGAGAGTTTAATGCTTAAAAATGGTAAGTCGCCATTGGAGGTTCTGGGAGGGTTGATTGATAGCTACGACATTTGTAATTCAAATCCTGCAGTCTTTGATGCTCTGGTGAGGACTTGCACTCAGTTAGGGAGTCTGGAAGGTGCATATGACGTAATAAAAAAGTCGAGACTAGTAGGTTTTTGGGTTACGATTCATGCTTGGAATAACTTCTTAAATCAACTTTTAAAGTTGGATGAGACTGATAAGTTTTGGAACATGTATAAGGAAATGGTTGCCAGTGGTTACAGTGAAAATGTGAATACTTTTAACTTGATTATTTATGCTTTATGCAAGGACTGTAAATTACTAGAAGCAATTTATGTAGTTTATTTGATGCTGAAGATTGAAATTTGGCCTAATGTCGTCACTTTTAATATGATCATTGATAAAGCAAGTAAAATGGGTGACATGGGTCTTGCTCTAAAACTGGCAAGGAACACGGGGGTAATTTCAGGAGGCAGTGTATCGCCTAATATAGTTACATATAATTGTATCATTAATGGGTTCTGCAAGATAAGGAGGTTAGAGTCTGCAAAAAATGTTCTTGGTGAAATGATTAAGCTGGGAATAGATTTCAATGTGAGAACATATGCTACTTTGATTGACGGCTATGCTAGAAAAGGGAGTTTGGATGTGGCATTCAGGTTGTGTGATGAAATGGTTGAAATGGGCTTGATTCCAGACACTGTTTTATATAACTCCATCATCTACTGGCTTTACATGGAAGGAGAATTAGAAGAAGCTTCTTTTTTATTATCTGACATGATTAATAGGCATATCCTCCCGGATGAGGTTACCTTCTCAATCCTTACAAAGGGTCTCTGTGTAAATGGACATCTCAATAAAGCTTTAAGAGTTCATGACCACATTCTCAAAAGAAACCTTGTAAAAGATGCTTTTACTCATAATATTCTCATCAACTATATATTCCAGACCCAAAACGCAGCAGGTGCCAAGCAATTACTGAGCAGTATGATTGTTCGTGGTATCAAACCTGACATGGTTACTTATGGCACTCTGGTTGATGGGTGTTGTAAGGAAGGAAAAATCGAAGCTGCAGTTCAGATTTATGACAAAGCAGTAAAGGCAGATGGGAAATCCAACTTGGTGGTATATAATTCTATTTTAAATGGTTTGTGTAAGCAAGGTTCAATTGATGCTGCTAAACTCTTGGTAGACAAATTACAGCAAAATGGTTTCCTCGATACAGTTACCTATAACACGTTGCTACATGGGTACTGCGTCAATGGGAAGGTCGAGAAGGCTTTTGCACTGTTTCTAGAGATGATTAATGTGGGGAGTTTGGTGAACATAGTTACTTACAATATAATGATTAACTTTCTGTGCAAGATGGGATTGATTTTTCAAACCATGGAACTGATGAGAGCAATGACTAGTCAGGGGATAGTTCCCGACCATATAACATACACGACACTCATCACCAATTTTGTTAAGTGTTGTGGCTCCGAGGATGTAATTGAGTTACATGATTATATGGTACTTAAAGGAGCAGTTCCTGATAGGCAAACATACCAGTCTCTTGTTAGCCCCTCCCTTCAAGAAAACGTTGAGGGGTAG

Protein sequence

MIFSFIFPRRSCVFSRGFHTGKKLLSPSTEDIICKAICVNLKQRRWKFLEQVSPSLTNSLVCRVVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVNARNFNDALSIMESLMLKNGKSPLEVLGGLIDSYDICNSNPAVFDALVRTCTQLGSLEGAYDVIKKSRLVGFWVTIHAWNNFLNQLLKLDETDKFWNMYKEMVASGYSENVNTFNLIIYALCKDCKLLEAIYVVYLMLKIEIWPNVVTFNMIIDKASKMGDMGLALKLARNTGVISGGSVSPNIVTYNCIINGFCKIRRLESAKNVLGEMIKLGIDFNVRTYATLIDGYARKGSLDVAFRLCDEMVEMGLIPDTVLYNSIIYWLYMEGELEEASFLLSDMINRHILPDEVTFSILTKGLCVNGHLNKALRVHDHILKRNLVKDAFTHNILINYIFQTQNAAGAKQLLSSMIVRGIKPDMVTYGTLVDGCCKEGKIEAAVQIYDKAVKADGKSNLVVYNSILNGLCKQGSIDAAKLLVDKLQQNGFLDTVTYNTLLHGYCVNGKVEKAFALFLEMINVGSLVNIVTYNIMINFLCKMGLIFQTMELMRAMTSQGIVPDHITYTTLITNFVKCCGSEDVIELHDYMVLKGAVPDRQTYQSLVSPSLQENVEG
Homology
BLAST of HG10021066 vs. NCBI nr
Match: XP_038894115.1 (pentatricopeptide repeat-containing protein At1g11710, mitochondrial [Benincasa hispida])

HSP 1 Score: 1215.7 bits (3144), Expect = 0.0e+00
Identity = 603/653 (92.34%), Postives = 625/653 (95.71%), Query Frame = 0

Query: 1   MIFSFIFPRRSCVFSRGFHTGKKLLSPSTEDIICKAICVNLKQRRWKFLEQVSPSLTNSL 60
           M+FSFIF R SCVF RGFH GKKLLS STEDIICKAICVNLKQRRWKFLEQVSPSLT+SL
Sbjct: 1   MVFSFIFLRGSCVFRRGFHKGKKLLSQSTEDIICKAICVNLKQRRWKFLEQVSPSLTSSL 60

Query: 61  VCRVVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVNARNFNDALSIMESLML 120
           +CRVVREFR+SPQLALEFYNWVEARDNFSHSLESCCTLVHVLVN+RNFNDALS+MESLML
Sbjct: 61  ICRVVREFRSSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVNSRNFNDALSVMESLML 120

Query: 121 KNGKSPLEVLGGLIDSYDICNSNPAVFDALVRTCTQLGSLEGAYDVIKKSRLVGFWVTIH 180
           KNGK PLEVLGGLID Y ICNSNPAVFDALVRTCTQLG LEGAYDVIKK R  GFWV+IH
Sbjct: 121 KNGKPPLEVLGGLIDCYKICNSNPAVFDALVRTCTQLGRLEGAYDVIKKLRNEGFWVSIH 180

Query: 181 AWNNFLNQLLKLDETDKFWNMYKEMVASGYSENVNTFNLIIYALCKDCKLLEAIYVVYLM 240
           AWNNFLNQLLKLDETDKFWNMYKEMVASGYSENVNTFNLIIYALCK+CKLLEAI+VVYLM
Sbjct: 181 AWNNFLNQLLKLDETDKFWNMYKEMVASGYSENVNTFNLIIYALCKECKLLEAIFVVYLM 240

Query: 241 LKIEIWPNVVTFNMIIDKASKMGDMGLALKLARNTGVISGGSVSPNIVTYNCIINGFCKI 300
           L+IEIWPNVVTFNMIIDKASKMGDMGLALKLARN GVISG SVSPNIVTYNCIINGFCKI
Sbjct: 241 LQIEIWPNVVTFNMIIDKASKMGDMGLALKLARNIGVISGRSVSPNIVTYNCIINGFCKI 300

Query: 301 RRLESAKNVLGEMIKLGIDFNVRTYATLIDGYARKGSLDVAFRLCDEMVEMGLIPDTVLY 360
           RRLESAKNVLGEMIKLGIDFNVRTYA LIDGYARKGSLDVAFRLCDEMV+MGLIPDTV+Y
Sbjct: 301 RRLESAKNVLGEMIKLGIDFNVRTYAILIDGYARKGSLDVAFRLCDEMVQMGLIPDTVVY 360

Query: 361 NSIIYWLYMEGELEEASFLLSDMINRHILPDEVTFSILTKGLCVNGHLNKALRVHDHILK 420
           NSIIYWLYMEGELEEASFLLSDMINRH+LPDEVTFSILTKGLC+NGHLNKALRVHDHIL 
Sbjct: 361 NSIIYWLYMEGELEEASFLLSDMINRHVLPDEVTFSILTKGLCLNGHLNKALRVHDHILG 420

Query: 421 RNLVKDAFTHNILINYIFQTQNAAGAKQLLSSMIVRGIKPDMVTYGTLVDGCCKEGKIEA 480
           RNLVKDAFTHNILINY+ Q+Q  AGAKQLLSSMIVRGIKPDMVTYGTLVDG CKEGKIEA
Sbjct: 421 RNLVKDAFTHNILINYMLQSQKVAGAKQLLSSMIVRGIKPDMVTYGTLVDGHCKEGKIEA 480

Query: 481 AVQIYDKAVKADGKSNLVVYNSILNGLCKQGSIDAAKLLVDKLQQNGFLDTVTYNTLLHG 540
           AV+IYDK +KADGK +LV+YNSIL+GLCKQGSIDAAKLLVDKLQQNGFLDTVTYNTLLHG
Sbjct: 481 AVKIYDKTIKADGKCDLVIYNSILDGLCKQGSIDAAKLLVDKLQQNGFLDTVTYNTLLHG 540

Query: 541 YCVNGKVEKAFALFLEMINVGSLVNIVTYNIMINFLCKMGLIFQTMELMRAMTSQGIVPD 600
           YC+NGKVEKAFALFLEMINVGSLVNIVTYNIMINFLCKMGLI Q MELMRAM SQGIVPD
Sbjct: 541 YCINGKVEKAFALFLEMINVGSLVNIVTYNIMINFLCKMGLIHQAMELMRAMISQGIVPD 600

Query: 601 HITYTTLITNFVKCCGSEDVIELHDYMVLKGAVPDRQTYQSLVSPSLQENVEG 654
            ITYTTLITNFVK CGSEDVIELHDYMVLKGAVPDRQTYQSLVSP LQENVEG
Sbjct: 601 LITYTTLITNFVKSCGSEDVIELHDYMVLKGAVPDRQTYQSLVSPCLQENVEG 653

BLAST of HG10021066 vs. NCBI nr
Match: XP_022923680.1 (pentatricopeptide repeat-containing protein At1g11710, mitochondrial [Cucurbita moschata] >KAG6584089.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1162.9 bits (3007), Expect = 0.0e+00
Identity = 573/649 (88.29%), Postives = 612/649 (94.30%), Query Frame = 0

Query: 1   MIFSFIFPRRSCVFSRGFHTGKKLLSPSTEDIICKAICVNLKQRRWKFLEQVSPSLTNSL 60
           MIFSFIF RRSCVFSRGFHTGKKL SPSTEDIICKAICVNLK RRWKFLEQVSPSLTNSL
Sbjct: 1   MIFSFIFLRRSCVFSRGFHTGKKLFSPSTEDIICKAICVNLKHRRWKFLEQVSPSLTNSL 60

Query: 61  VCRVVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVNARNFNDALSIMESLML 120
           VCRVVREFR SPQLALEFYNWVEARDN SHSLESCCTLVHVLVN+RNF+DALSIM+SLML
Sbjct: 61  VCRVVREFRTSPQLALEFYNWVEARDNISHSLESCCTLVHVLVNSRNFDDALSIMKSLML 120

Query: 121 KNGKSPLEVLGGLIDSYDICNSNPAVFDALVRTCTQLGSLEGAYDVIKKSRLVGFWVTIH 180
           K+GKSPLEVLGGLIDSY+IC+SN AVFDALVRTCTQLG+ EGAY+VIKK R  GFWVTIH
Sbjct: 121 KDGKSPLEVLGGLIDSYEICSSNSAVFDALVRTCTQLGTAEGAYNVIKKLRDEGFWVTIH 180

Query: 181 AWNNFLNQLLKLDETDKFWNMYKEMVASGYSENVNTFNLIIYALCKDCKLLEAIYVVYLM 240
           AWNNFLNQLLKLDE DKFWNMYKEMVASGY ENVNTFNLIIYA CK+CKLLEAI VVYLM
Sbjct: 181 AWNNFLNQLLKLDEIDKFWNMYKEMVASGYIENVNTFNLIIYAFCKECKLLEAISVVYLM 240

Query: 241 LKIEIWPNVVTFNMIIDKASKMGDMGLALKLARNTGVISGGSVSPNIVTYNCIINGFCKI 300
           LKIEIWPNVV FNMIIDKASKMG M LALKLARN GVISGGSV  NIV+YNC+INGFCKI
Sbjct: 241 LKIEIWPNVVAFNMIIDKASKMGHMDLALKLARNMGVISGGSVLLNIVSYNCMINGFCKI 300

Query: 301 RRLESAKNVLGEMIKLGIDFNVRTYATLIDGYARKGSLDVAFRLCDEMVEMGLIPDTVLY 360
            RLESA+NVL EMIKLG+DFNV TYATLIDGYARKGSLDVAFRLC+EMV+MGLIPDTV+Y
Sbjct: 301 GRLESAENVLSEMIKLGVDFNVTTYATLIDGYARKGSLDVAFRLCEEMVKMGLIPDTVVY 360

Query: 361 NSIIYWLYMEGELEEASFLLSDMINRHILPDEVTFSILTKGLCVNGHLNKALRVHDHILK 420
           NSI+YWLY EGELEEASFL+SDMINRHI PDEVTFSILTKGLC+NGHL+KALR+HDHIL+
Sbjct: 361 NSIVYWLYKEGELEEASFLISDMINRHIFPDEVTFSILTKGLCINGHLDKALRIHDHILE 420

Query: 421 RNLVKDAFTHNILINYIFQTQNAAGAKQLLSSMIVRGIKPDMVTYGTLVDGCCKEGKIEA 480
           RNLVKDAFTHNILINY+FQ+Q  AGAKQLLSSMIVRGI+PD+VTY TL+DGCCKEGK+EA
Sbjct: 421 RNLVKDAFTHNILINYMFQSQKTAGAKQLLSSMIVRGIEPDIVTYATLIDGCCKEGKMEA 480

Query: 481 AVQIYDKAVKADGKSNLVVYNSILNGLCKQGSIDAAKLLVDKLQQNGFLDTVTYNTLLHG 540
           A+QIYDK VKA GKS+LVVYNSIL+GLCKQGSIDAAKLLVDKLQ+NGFLD+VTYNTLLHG
Sbjct: 481 AIQIYDKTVKASGKSDLVVYNSILDGLCKQGSIDAAKLLVDKLQKNGFLDSVTYNTLLHG 540

Query: 541 YCVNGKVEKAFALFLEMINVGSLVNIVTYNIMINFLCKMGLIFQTMELMRAMTSQGIVPD 600
           YCVNG++EKAFALFLEMINVGSL NIVT+NIMINFLC+MGLI Q  ELMRAM++QGIVPD
Sbjct: 541 YCVNGRIEKAFALFLEMINVGSLANIVTFNIMINFLCEMGLIHQAKELMRAMSTQGIVPD 600

Query: 601 HITYTTLITNFVKCCGSEDVIELHDYMVLKGAVPDRQTYQSLVSPSLQE 650
            ITYTTLITNFVK CGSEDVIELHDYMVLKGAVPDRQTYQSLVSP LQE
Sbjct: 601 LITYTTLITNFVKNCGSEDVIELHDYMVLKGAVPDRQTYQSLVSPRLQE 649

BLAST of HG10021066 vs. NCBI nr
Match: XP_023519211.1 (pentatricopeptide repeat-containing protein At1g11710, mitochondrial [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1161.0 bits (3002), Expect = 0.0e+00
Identity = 571/649 (87.98%), Postives = 611/649 (94.14%), Query Frame = 0

Query: 1   MIFSFIFPRRSCVFSRGFHTGKKLLSPSTEDIICKAICVNLKQRRWKFLEQVSPSLTNSL 60
           MIFSFIF RRSCVFSRGFHTGKKL SPSTEDIICKAICVNLK RRWKFLEQVSPSLTNSL
Sbjct: 1   MIFSFIFLRRSCVFSRGFHTGKKLFSPSTEDIICKAICVNLKHRRWKFLEQVSPSLTNSL 60

Query: 61  VCRVVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVNARNFNDALSIMESLML 120
           VCRVVREFR SPQLALEFYNWVEARDN SHSLESCCTLVHVLVN+RNF+DALSIM+SLML
Sbjct: 61  VCRVVREFRTSPQLALEFYNWVEARDNISHSLESCCTLVHVLVNSRNFDDALSIMKSLML 120

Query: 121 KNGKSPLEVLGGLIDSYDICNSNPAVFDALVRTCTQLGSLEGAYDVIKKSRLVGFWVTIH 180
           K+GKSPLEVLGGLIDSY+IC+SNPAVFDALVRTCTQLG+ EGAY+VIKK R  GFWVTIH
Sbjct: 121 KDGKSPLEVLGGLIDSYEICSSNPAVFDALVRTCTQLGTAEGAYNVIKKLRDEGFWVTIH 180

Query: 181 AWNNFLNQLLKLDETDKFWNMYKEMVASGYSENVNTFNLIIYALCKDCKLLEAIYVVYLM 240
           AWNNFLNQLLKLDE DKFWNMYKEMVASGY ENVNTFNLIIYA CK+CKLLEAI VVYLM
Sbjct: 181 AWNNFLNQLLKLDEIDKFWNMYKEMVASGYIENVNTFNLIIYAFCKECKLLEAISVVYLM 240

Query: 241 LKIEIWPNVVTFNMIIDKASKMGDMGLALKLARNTGVISGGSVSPNIVTYNCIINGFCKI 300
           LKIEIWPNVV FNMIIDKASKMG M LALKLARN GVISGGSV  NIV+YNC+INGFCK 
Sbjct: 241 LKIEIWPNVVAFNMIIDKASKMGHMDLALKLARNMGVISGGSVLLNIVSYNCMINGFCKT 300

Query: 301 RRLESAKNVLGEMIKLGIDFNVRTYATLIDGYARKGSLDVAFRLCDEMVEMGLIPDTVLY 360
            RLESA+NVL EMIKLG+DFNV TYATLIDGYARKGSLDVAFRLC+EMV+MGLIPDTV+Y
Sbjct: 301 GRLESAENVLSEMIKLGVDFNVTTYATLIDGYARKGSLDVAFRLCEEMVKMGLIPDTVVY 360

Query: 361 NSIIYWLYMEGELEEASFLLSDMINRHILPDEVTFSILTKGLCVNGHLNKALRVHDHILK 420
           NSI+YWLY EGELEEASFL+SDMINRHI PDEVTFSILTKGLC+NGHL+KALR+HDH+L+
Sbjct: 361 NSIVYWLYKEGELEEASFLISDMINRHIFPDEVTFSILTKGLCINGHLDKALRIHDHVLE 420

Query: 421 RNLVKDAFTHNILINYIFQTQNAAGAKQLLSSMIVRGIKPDMVTYGTLVDGCCKEGKIEA 480
            NLVKDAFTHNILINY+FQ+Q  AGAKQLLSSMIVRGI+PD+VTY TL+DGCCKEGK+EA
Sbjct: 421 GNLVKDAFTHNILINYMFQSQKTAGAKQLLSSMIVRGIEPDIVTYATLIDGCCKEGKMEA 480

Query: 481 AVQIYDKAVKADGKSNLVVYNSILNGLCKQGSIDAAKLLVDKLQQNGFLDTVTYNTLLHG 540
           A+QIYDK VKA GKS+LVVYNSIL+GLCKQGSIDAAKLLVDKLQ+NGFLD+VTYNTLLHG
Sbjct: 481 AIQIYDKTVKASGKSDLVVYNSILDGLCKQGSIDAAKLLVDKLQKNGFLDSVTYNTLLHG 540

Query: 541 YCVNGKVEKAFALFLEMINVGSLVNIVTYNIMINFLCKMGLIFQTMELMRAMTSQGIVPD 600
           YCVNG++EKAFALFLEMINVGSL NIVT+NIMINFLC+MGLI Q  ELMRAM++QGIVPD
Sbjct: 541 YCVNGRIEKAFALFLEMINVGSLANIVTFNIMINFLCEMGLIHQAKELMRAMSTQGIVPD 600

Query: 601 HITYTTLITNFVKCCGSEDVIELHDYMVLKGAVPDRQTYQSLVSPSLQE 650
            ITYTTLITNFVK CGSEDVIELHDYMVLKGAVPDRQTYQSLVSP LQE
Sbjct: 601 LITYTTLITNFVKNCGSEDVIELHDYMVLKGAVPDRQTYQSLVSPRLQE 649

BLAST of HG10021066 vs. NCBI nr
Match: KAA0056855.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYJ99358.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1160.6 bits (3001), Expect = 0.0e+00
Identity = 583/653 (89.28%), Postives = 615/653 (94.18%), Query Frame = 0

Query: 1   MIFSFIFPRRSCVFSRGFHTGKKLLSPSTEDIICKAICVNLKQRRWKFLEQVSPSLTNSL 60
           M+FSF F R S VF RGF TGKKLLSPSTEDII KAICVNLKQRRWKFLEQVSPSLTNSL
Sbjct: 53  MVFSFTFLRGSFVFRRGFRTGKKLLSPSTEDIIYKAICVNLKQRRWKFLEQVSPSLTNSL 112

Query: 61  VCRVVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVNARNFNDALSIMESLML 120
           VCRVVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVN+RNFNDALSIMESLML
Sbjct: 113 VCRVVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVNSRNFNDALSIMESLML 172

Query: 121 KNGKSPLEVLGGLIDSYDICNSNPAVFDALVRTCTQLGSLEGAYDVIKKSRLVGFWVTIH 180
           KNGKSPLEVLGGL++SY+ICNSNPAVFDALVRTCTQL S+EGAYDVI+K RL GFWVTIH
Sbjct: 173 KNGKSPLEVLGGLMNSYEICNSNPAVFDALVRTCTQLKSVEGAYDVIRKLRLEGFWVTIH 232

Query: 181 AWNNFLNQLLKLDETDKFWNMYKEMVASGYSENVNTFNLIIYALCKDCKLLEAIYVVYLM 240
           AWNNFLN LLKL ETDKFWNMY EMVASGYSENVNTFNLIIYALCK+CKLLEAI VVYLM
Sbjct: 233 AWNNFLNLLLKLGETDKFWNMYMEMVASGYSENVNTFNLIIYALCKECKLLEAISVVYLM 292

Query: 241 LKIEIWPNVVTFNMIIDKASKMGDMGLALKLARNTGVISGGSVSPNIVTYNCIINGFCKI 300
           LKIEIWPNVV+FNMIIDKASKMG+M LALKL RNT VISGGSVSPNIVTYNCIINGFCKI
Sbjct: 293 LKIEIWPNVVSFNMIIDKASKMGEMDLALKLTRNTEVISGGSVSPNIVTYNCIINGFCKI 352

Query: 301 RRLESAKNVLGEMIKLGIDFNVRTYATLIDGYARKGSLDVAFRLCDEMVEMGLIPDTVLY 360
           RRLESAKNVL EMIKLGID N RTYA LIDGYARKGSLDVAFRLCDEMVE  LIPDTV+Y
Sbjct: 353 RRLESAKNVLAEMIKLGIDSNERTYAPLIDGYARKGSLDVAFRLCDEMVETRLIPDTVVY 412

Query: 361 NSIIYWLYMEGELEEASFLLSDMINRHILPDEVTFSILTKGLCVNGHLNKALRVHDHILK 420
           NS+IYWLY+EGELEEASFLLSDMINR ILPDE T+SILTKGLC++GHLNKALRVH +I++
Sbjct: 413 NSLIYWLYIEGELEEASFLLSDMINRRILPDEFTYSILTKGLCLSGHLNKALRVHYYIVE 472

Query: 421 RNLVKDAFTHNILINYIFQTQNAAGAKQLLSSMIVRGIKPDMVTYGTLVDGCCKEGKIEA 480
           RNLVKDA+THNILINY+FQ++N AGAKQLLSSMIVRGIKPDMVTYGTLV G CKEGKIEA
Sbjct: 473 RNLVKDAYTHNILINYMFQSRNIAGAKQLLSSMIVRGIKPDMVTYGTLVAGHCKEGKIEA 532

Query: 481 AVQIYDKAVKADGKSNLVVYNSILNGLCKQGSIDAAKLLVDKLQQNGFLDTVTYNTLLHG 540
           AVQIYDK VKADGKSNLVVYNSIL+GLCKQGSIDAA+LLVDKLQQNGFLD+VTYNTLLHG
Sbjct: 533 AVQIYDKTVKADGKSNLVVYNSILDGLCKQGSIDAARLLVDKLQQNGFLDSVTYNTLLHG 592

Query: 541 YCVNGKVEKAFALFLEMINVGSLVNIVTYNIMINFLCKMGLIFQTMELMRAMTSQGIVPD 600
           +CVNG+VEKAFALFLEMINVGSLVNIV+YNIMINFLCKMGLI Q MELMRAM SQGIVPD
Sbjct: 593 FCVNGEVEKAFALFLEMINVGSLVNIVSYNIMINFLCKMGLIQQAMELMRAMASQGIVPD 652

Query: 601 HITYTTLITNFVKCCGSEDVIELHDYMVLKGAVPDRQTYQSLVSPSLQENVEG 654
            ITYTTLITNFVK  GS++VIELHDYMVLKGAVPDRQTYQSLVSP LQE+ EG
Sbjct: 653 LITYTTLITNFVKSYGSDNVIELHDYMVLKGAVPDRQTYQSLVSPCLQEHTEG 705

BLAST of HG10021066 vs. NCBI nr
Match: ADN34051.1 (pentatricopeptide repeat-containing protein [Cucumis melo subsp. melo])

HSP 1 Score: 1160.6 bits (3001), Expect = 0.0e+00
Identity = 583/653 (89.28%), Postives = 615/653 (94.18%), Query Frame = 0

Query: 1   MIFSFIFPRRSCVFSRGFHTGKKLLSPSTEDIICKAICVNLKQRRWKFLEQVSPSLTNSL 60
           M+FSF F R S VF RGF TGKKLLSPSTEDII KAICVNLKQRRWKFLEQVSPSLTNSL
Sbjct: 1   MVFSFTFLRGSFVFRRGFRTGKKLLSPSTEDIIYKAICVNLKQRRWKFLEQVSPSLTNSL 60

Query: 61  VCRVVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVNARNFNDALSIMESLML 120
           VCRVVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVN+RNFNDALSIMESLML
Sbjct: 61  VCRVVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVNSRNFNDALSIMESLML 120

Query: 121 KNGKSPLEVLGGLIDSYDICNSNPAVFDALVRTCTQLGSLEGAYDVIKKSRLVGFWVTIH 180
           KNGKSPLEVLGGL++SY+ICNSNPAVFDALVRTCTQL S+EGAYDVI+K RL GFWVTIH
Sbjct: 121 KNGKSPLEVLGGLMNSYEICNSNPAVFDALVRTCTQLKSVEGAYDVIRKLRLEGFWVTIH 180

Query: 181 AWNNFLNQLLKLDETDKFWNMYKEMVASGYSENVNTFNLIIYALCKDCKLLEAIYVVYLM 240
           AWNNFLN LLKL ETDKFWNMY EMVASGYSENVNTFNLIIYALCK+CKLLEAI VVYLM
Sbjct: 181 AWNNFLNLLLKLGETDKFWNMYMEMVASGYSENVNTFNLIIYALCKECKLLEAISVVYLM 240

Query: 241 LKIEIWPNVVTFNMIIDKASKMGDMGLALKLARNTGVISGGSVSPNIVTYNCIINGFCKI 300
           LKIEIWPNVV+FNMIIDKASKMG+M LALKL RNT VISGGSVSPNIVTYNCIINGFCKI
Sbjct: 241 LKIEIWPNVVSFNMIIDKASKMGEMDLALKLTRNTEVISGGSVSPNIVTYNCIINGFCKI 300

Query: 301 RRLESAKNVLGEMIKLGIDFNVRTYATLIDGYARKGSLDVAFRLCDEMVEMGLIPDTVLY 360
           RRLESAKNVL EMIKLGID N RTYA LIDGYARKGSLDVAFRLCDEMVE  LIPDTV+Y
Sbjct: 301 RRLESAKNVLAEMIKLGIDSNERTYAPLIDGYARKGSLDVAFRLCDEMVETRLIPDTVVY 360

Query: 361 NSIIYWLYMEGELEEASFLLSDMINRHILPDEVTFSILTKGLCVNGHLNKALRVHDHILK 420
           NS+IYWLY+EGELEEASFLLSDMINR ILPDE T+SILTKGLC++GHLNKALRVH +I++
Sbjct: 361 NSLIYWLYIEGELEEASFLLSDMINRRILPDEFTYSILTKGLCLSGHLNKALRVHYYIVE 420

Query: 421 RNLVKDAFTHNILINYIFQTQNAAGAKQLLSSMIVRGIKPDMVTYGTLVDGCCKEGKIEA 480
           RNLVKDA+THNILINY+FQ++N AGAKQLLSSMIVRGIKPDMVTYGTLV G CKEGKIEA
Sbjct: 421 RNLVKDAYTHNILINYMFQSRNIAGAKQLLSSMIVRGIKPDMVTYGTLVAGHCKEGKIEA 480

Query: 481 AVQIYDKAVKADGKSNLVVYNSILNGLCKQGSIDAAKLLVDKLQQNGFLDTVTYNTLLHG 540
           AVQIYDK VKADGKSNLVVYNSIL+GLCKQGSIDAA+LLVDKLQQNGFLD+VTYNTLLHG
Sbjct: 481 AVQIYDKTVKADGKSNLVVYNSILDGLCKQGSIDAARLLVDKLQQNGFLDSVTYNTLLHG 540

Query: 541 YCVNGKVEKAFALFLEMINVGSLVNIVTYNIMINFLCKMGLIFQTMELMRAMTSQGIVPD 600
           +CVNG+VEKAFALFLEMINVGSLVNIV+YNIMINFLCKMGLI Q MELMRAM SQGIVPD
Sbjct: 541 FCVNGEVEKAFALFLEMINVGSLVNIVSYNIMINFLCKMGLIQQAMELMRAMASQGIVPD 600

Query: 601 HITYTTLITNFVKCCGSEDVIELHDYMVLKGAVPDRQTYQSLVSPSLQENVEG 654
            ITYTTLITNFVK  GS++VIELHDYMVLKGAVPDRQTYQSLVSP LQE+ EG
Sbjct: 601 LITYTTLITNFVKSYGSDNVIELHDYMVLKGAVPDRQTYQSLVSPCLQEHTEG 653

BLAST of HG10021066 vs. ExPASy Swiss-Prot
Match: Q9SAA6 (Pentatricopeptide repeat-containing protein At1g11710, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g11710 PE=2 SV=1)

HSP 1 Score: 642.9 bits (1657), Expect = 4.0e-183
Identity = 324/653 (49.62%), Postives = 446/653 (68.30%), Query Frame = 0

Query: 2   IFSFIFPRRSCVFSRGFHTGKKLLSPSTEDIICKAICVNLKQRRWKFLEQVSPSLTNSLV 61
           +F  +F RR+    R FH  KK  +P  EDI+  A+C+NL+QRRW  L Q S SLTN L+
Sbjct: 1   MFGHVFSRRTSFLVRCFHVAKKFSNPEPEDILFSALCLNLRQRRWNTLHQFSSSLTNPLI 60

Query: 62  CRVVREFRNSPQLALEFYNWVEARDNFSHS---LESCCTLVHVLVNARNFNDALSIMESL 121
            RV+REFR+SP+LALEFYNWV   +  + S    E+ C ++H+LV +R F+DALSIM +L
Sbjct: 61  SRVLREFRSSPKLALEFYNWVLRSNTVAKSENRFEASCVMIHLLVGSRRFDDALSIMANL 120

Query: 122 MLKNGK--SPLEVLGGLIDSYDICNSNPAVFDALVRTCTQLGSLEGAYDVIKKSRLVGFW 181
           M   G+  SPL VL GLI SY  C S+P VFD+LVR CTQ G  +GAY+VI+++R  GF 
Sbjct: 121 MSVEGEKLSPLHVLSGLIRSYQACGSSPDVFDSLVRACTQNGDAQGAYEVIEQTRAEGFC 180

Query: 182 VTIHAWNNFLNQLLKLDETDKFWNMYKEMVASGYSENVNTFNLIIYALCKDCKLLEAIYV 241
           V++HA NNF+  LL ++E D+FW +YKEM + GY ENVNTFNL+IY+ CK+ KL EA+ V
Sbjct: 181 VSVHALNNFMGCLLNVNEIDRFWKVYKEMDSLGYVENVNTFNLVIYSFCKESKLFEALSV 240

Query: 242 VYLMLKIEIWPNVVTFNMIIDKASKMGDMGLALKLARNTGVISGGSVSPNIVTYNCIING 301
            Y MLK  +WPNVV+FNM+ID A K GDM  AL+L    G++SG  VSPN VTYN +ING
Sbjct: 241 FYRMLKCGVWPNVVSFNMMIDGACKTGDMRFALQLLGKMGMMSGNFVSPNAVTYNSVING 300

Query: 302 FCKIRRLESAKNVLGEMIKLGIDFNVRTYATLIDGYARKGSLDVAFRLCDEMVEMGLIPD 361
           FCK  RL+ A+ + G+M+K G+D N RTY  L+D Y R GS D A RLCDEM   GL+ +
Sbjct: 301 FCKAGRLDLAERIRGDMVKSGVDCNERTYGALVDAYGRAGSSDEALRLCDEMTSKGLVVN 360

Query: 362 TVLYNSIIYWLYMEGELEEASFLLSDMINRHILPDEVTFSILTKGLCVNGHLNKALRVHD 421
           TV+YNSI+YWL+MEG++E A  +L DM ++++  D  T +I+ +GLC NG++ +A+    
Sbjct: 361 TVIYNSIVYWLFMEGDIEGAMSVLRDMNSKNMQIDRFTQAIVVRGLCRNGYVKEAVEFQR 420

Query: 422 HILKRNLVKDAFTHNILINYIFQTQNAAGAKQLLSSMIVRGIKPDMVTYGTLVDGCCKEG 481
            I ++ LV+D   HN L+++  + +  A A Q+L SM+V+G+  D +++GTL+DG  KEG
Sbjct: 421 QISEKKLVEDIVCHNTLMHHFVRDKKLACADQILGSMLVQGLSLDAISFGTLIDGYLKEG 480

Query: 482 KIEAAVQIYDKAVKADGKSNLVVYNSILNGLCKQGSIDAAKLLVDKLQQNGFLDTVTYNT 541
           K+E A++IYD  +K +  SNLV+YNSI+NGL K+G   AA+ +V+ ++     D VTYNT
Sbjct: 481 KLERALEIYDGMIKMNKTSNLVIYNSIVNGLSKRGMAGAAEAVVNAME---IKDIVTYNT 540

Query: 542 LLHGYCVNGKVEKAFALFLEMINVG--SLVNIVTYNIMINFLCKMGLIFQTMELMRAMTS 601
           LL+     G VE+A  +  +M        V++VT+NIMIN LCK G   +  E+++ M  
Sbjct: 541 LLNESLKTGNVEEADDILSKMQKQDGEKSVSLVTFNIMINHLCKFGSYEKAKEVLKFMVE 600

Query: 602 QGIVPDHITYTTLITNFVKCCGSEDVIELHDYMVLKGAVPDRQTYQSLVSPSL 648
           +G+VPD ITY TLIT+F K    E V+ELHDY++L+G  P    Y S+V P L
Sbjct: 601 RGVVPDSITYGTLITSFSKHRSQEKVVELHDYLILQGVTPHEHIYLSIVRPLL 650

BLAST of HG10021066 vs. ExPASy Swiss-Prot
Match: Q9LFC5 (Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana OX=3702 GN=At5g01110 PE=2 SV=1)

HSP 1 Score: 292.0 bits (746), Expect = 1.7e-77
Identity = 181/626 (28.91%), Postives = 319/626 (50.96%), Query Frame = 0

Query: 29  TEDIICKAICVNLKQRRWKFLEQVSPSLTNSLVCRVVREFRNSPQLALEFYNWVEAR-DN 88
           ++  + + IC +LKQ        +   L    V  V+   RN   L   F + +     N
Sbjct: 50  SDSFLVEKICFSLKQGNNNVRNHLI-RLNPLAVVEVLYRCRNDLTLGQRFVDQLGFHFPN 109

Query: 89  FSHSLESCCTLVHVLVNARNFNDALSIMESLMLKNGKSPLEVLGGLIDSYDICNSNPAVF 148
           F H+  S   ++H+LV +   +DA S +  ++ ++G S LE++  L  ++  C SN +VF
Sbjct: 110 FKHTSLSLSAMIHILVRSGRLSDAQSCLLRMIRRSGVSRLEIVNSLDSTFSNCGSNDSVF 169

Query: 149 DALVRTCTQLGSLEGAYDVIKKSRLVGFWVTIHAWNNFLNQLLKLDETDKFWNMYKEMVA 208
           D L+RT  Q   L  A++     R  GF V+I A N  +  L+++   +  W +Y+E+  
Sbjct: 170 DLLIRTYVQARKLREAHEAFTLLRSKGFTVSIDACNALIGSLVRIGWVELAWGVYQEISR 229

Query: 209 SGYSENVNTFNLIIYALCKDCKLLEAIYVVYLMLKIEIWPNVVTFNMIIDKASKMGDMGL 268
           SG   NV T N+++ ALCKD K+ +    +  + +  ++P++VT+N +I   S  G M  
Sbjct: 230 SGVGINVYTLNIMVNALCKDGKMEKVGTFLSQVQEKGVYPDIVTYNTLISAYSSKGLMEE 289

Query: 269 ALKLARNTGVISGGSVSPNIVTYNCIINGFCKIRRLESAKNVLGEMIKLGIDFNVRTYAT 328
           A +L      + G   SP + TYN +ING CK  + E AK V  EM++ G+  +  TY +
Sbjct: 290 AFEL---MNAMPGKGFSPGVYTYNTVINGLCKHGKYERAKEVFAEMLRSGLSPDSTTYRS 349

Query: 329 LIDGYARKGSLDVAFRLCDEMVEMGLIPDTVLYNSIIYWLYMEGELEEASFLLSDMINRH 388
           L+    +KG +    ++  +M    ++PD V ++S++      G L++A    + +    
Sbjct: 350 LLMEACKKGDVVETEKVFSDMRSRDVVPDLVCFSSMMSLFTRSGNLDKALMYFNSVKEAG 409

Query: 389 ILPDEVTFSILTKGLCVNGHLNKALRVHDHILKRNLVKDAFTHNILINYIFQTQNAAGAK 448
           ++PD V ++IL +G C  G ++ A+ + + +L++    D  T+N +++ + + +    A 
Sbjct: 410 LIPDNVIYTILIQGYCRKGMISVAMNLRNEMLQQGCAMDVVTYNTILHGLCKRKMLGEAD 469

Query: 449 QLLSSMIVRGIKPDMVTYGTLVDGCCKEGKIEAAVQIYDKAVKADGKSNLVVYNSILNGL 508
           +L + M  R + PD  T   L+DG CK G ++ A++++ K  +   + ++V YN++L+G 
Sbjct: 470 KLFNEMTERALFPDSYTLTILIDGHCKLGNLQNAMELFQKMKEKRIRLDVVTYNTLLDGF 529

Query: 509 CKQGSIDAAKLLVDKLQQNGFLDT-VTYNTLLHGYCVNGKVEKAFALFLEMINVGSLVNI 568
            K G ID AK +   +     L T ++Y+ L++  C  G + +AF ++ EMI+      +
Sbjct: 530 GKVGDIDTAKEIWADMVSKEILPTPISYSILVNALCSKGHLAEAFRVWDEMISKNIKPTV 589

Query: 569 VTYNIMINFLCKMGLIFQTMELMRAMTSQGIVPDHITYTTLITNFVKCCGSEDVIELHDY 628
           +  N MI   C+ G        +  M S+G VPD I+Y TLI  FV+         L   
Sbjct: 590 MICNSMIKGYCRSGNASDGESFLEKMISEGFVPDCISYNTLIYGFVREENMSKAFGLVKK 649

Query: 629 MVLK--GAVPDRQTYQSLVSPSLQEN 651
           M  +  G VPD  TY S++    ++N
Sbjct: 650 MEEEQGGLVPDVFTYNSILHGFCRQN 671

BLAST of HG10021066 vs. ExPASy Swiss-Prot
Match: Q9LVQ5 (Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX=3702 GN=At5g55840 PE=3 SV=2)

HSP 1 Score: 275.0 bits (702), Expect = 2.2e-72
Identity = 171/610 (28.03%), Postives = 306/610 (50.16%), Query Frame = 0

Query: 73  QLALEFYNWVEARDNF--SHSLESCCTLVHVLVNARNFNDALSIMESLMLKNGKSPLEVL 132
           +LAL+F  WV  +      H ++  C   H+LV AR ++ A  I++ L L +GKS   V 
Sbjct: 51  KLALKFLKWVVKQPGLETDHIVQLVCITTHILVRARMYDPARHILKELSLMSGKSSF-VF 110

Query: 133 GGLIDSYDICNSNPAVFDALVRTCTQLGSLEGAYDVIKKSRLVGFWVTIHAWNNFLNQLL 192
           G L+ +Y +CNSNP+V+D L+R   + G ++ + ++ +   L GF  +++  N  L  ++
Sbjct: 111 GALMTTYRLCNSNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAILGSVV 170

Query: 193 KLDETDKFWNMYKEMVASGYSENVNTFNLIIYALCKDCKLLEAIYVVYLMLKIEIWPNVV 252
           K  E    W+  KEM+      +V TFN++I  LC +    ++ Y++  M K    P +V
Sbjct: 171 KSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAPTIV 230

Query: 253 TFNMIIDKASKMGDMGLALKLARNTGVISGGSVSPNIVTYNCIINGFCKIRRLESAKNVL 312
           T+N ++    K G    A++L  +   +    V  ++ TYN +I+  C+  R+     +L
Sbjct: 231 TYNTVLHWYCKKGRFKAAIELLDH---MKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLL 290

Query: 313 GEMIKLGIDFNVRTYATLIDGYARKGSLDVAFRLCDEMVEMGLIPDTVLYNSIIYWLYME 372
            +M K  I  N  TY TLI+G++ +G + +A +L +EM+  GL P+ V +N++I     E
Sbjct: 291 RDMRKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISE 350

Query: 373 GELEEASFLLSDMINRHILPDEVTFSIL-------------------------------- 432
           G  +EA  +   M  + + P EV++ +L                                
Sbjct: 351 GNFKEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITY 410

Query: 433 ---TKGLCVNGHLNKALRVHDHILKRNLVKDAFTHNILINYIFQTQNAAGAKQLLSSMIV 492
                GLC NG L++A+ + + + K  +  D  T++ LIN   +      AK+++  +  
Sbjct: 411 TGMIDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYR 470

Query: 493 RGIKPDMVTYGTLVDGCCKEGKIEAAVQIYDKAVKADGKSNLVVYNSILNGLCKQGSIDA 552
            G+ P+ + Y TL+  CC+ G ++ A++IY+  +      +   +N ++  LCK G +  
Sbjct: 471 VGLSPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAE 530

Query: 553 AKLLVDKLQQNGFL-DTVTYNTLLHGYCVNGKVEKAFALFLEMINVGSLVNIVTYNIMIN 612
           A+  +  +  +G L +TV+++ L++GY  +G+  KAF++F EM  VG      TY  ++ 
Sbjct: 531 AEEFMRCMTSDGILPNTVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYGSLLK 590

Query: 613 FLCKMGLIFQTMELMRAMTSQGIVPDHITYTTLITNFVKCCGSEDVIELHDYMVLKGAVP 645
            LCK G + +  + ++++ +     D + Y TL+T   K       + L   MV +  +P
Sbjct: 591 GLCKGGHLREAEKFLKSLHAVPAAVDTVMYNTLLTAMCKSGNLAKAVSLFGEMVQRSILP 650

BLAST of HG10021066 vs. ExPASy Swiss-Prot
Match: Q0WVK7 (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 270.8 bits (691), Expect = 4.1e-71
Identity = 172/586 (29.35%), Postives = 296/586 (50.51%), Query Frame = 0

Query: 64  VVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVNARNFNDALSIMESL----M 123
           V+ + +   +L L+F++W  AR     +LES C ++H+ V +++   A S++ S      
Sbjct: 93  VLMKIKCDYRLVLDFFDW--ARSRRDSNLESLCIVIHLAVASKDLKVAQSLISSFWERPK 152

Query: 124 LKNGKSPLEVLGGLIDSYDICNSNPAVFDALVRTCTQLGSLEGAYDVIKKSRLVGFWVTI 183
           L    S ++    L+ +Y    S+P VFD   +     G L  A  V +K    G  +++
Sbjct: 153 LNVTDSFVQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSV 212

Query: 184 HAWNNFLNQLLK-LDETDKFWNMYKEMVASGYSENVNTFNLIIYALCKDCKLLEAIYVVY 243
            + N +L +L K   +T     +++E    G   NV ++N++I+ +C+  ++ EA +++ 
Sbjct: 213 DSCNVYLTRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLL 272

Query: 244 LMLKIEIWPNVVTFNMIIDKASKMGDMGLALKLARNTGVISGGSVSPNIVTYNCIINGFC 303
           LM      P+V++++ +++   + G++    KL     V+    + PN   Y  II   C
Sbjct: 273 LMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIE---VMKRKGLKPNSYIYGSIIGLLC 332

Query: 304 KIRRLESAKNVLGEMIKLGIDFNVRTYATLIDGYARKGSLDVAFRLCDEMVEMGLIPDTV 363
           +I +L  A+    EMI+ GI  +   Y TLIDG+ ++G +  A +   EM    + PD +
Sbjct: 333 RICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVL 392

Query: 364 LYNSIIYWLYMEGELEEASFLLSDMINRHILPDEVTFSILTKGLCVNGHLNKALRVHDHI 423
            Y +II      G++ EA  L  +M  + + PD VTF+ L  G C  GH+  A RVH+H 
Sbjct: 393 TYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNH- 452

Query: 424 LKRNLVKDAFTHNILINYIFQTQNAAGAKQLLSSMIVRGIKPDMVTYGTLVDGCCKEGKI 483
                                             MI  G  P++VTY TL+DG CKEG +
Sbjct: 453 ----------------------------------MIQAGCSPNVVTYTTLIDGLCKEGDL 512

Query: 484 EAAVQIYDKAVKADGKSNLVVYNSILNGLCKQGSIDAAKLLVDKLQQNGF-LDTVTYNTL 543
           ++A ++  +  K   + N+  YNSI+NGLCK G+I+ A  LV + +  G   DTVTY TL
Sbjct: 513 DSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLVGEFEAAGLNADTVTYTTL 572

Query: 544 LHGYCVNGKVEKAFALFLEMINVGSLVNIVTYNIMINFLCKMGLIFQTMELMRAMTSQGI 603
           +  YC +G+++KA  +  EM+  G    IVT+N+++N  C  G++    +L+  M ++GI
Sbjct: 573 MDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLHGMLEDGEKLLNWMLAKGI 632

Query: 604 VPDHITYTTLITNFVKCCGSEDVIELHDYMVLKGAVPDRQTYQSLV 644
            P+  T+ +L+  +      +    ++  M  +G  PD +TY++LV
Sbjct: 633 APNATTFNSLVKQYCIRNNLKAATAIYKDMCSRGVGPDGKTYENLV 638

BLAST of HG10021066 vs. ExPASy Swiss-Prot
Match: Q9ZQF1 (Pentatricopeptide repeat-containing protein At2g15630, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At2g15630 PE=3 SV=1)

HSP 1 Score: 269.2 bits (687), Expect = 1.2e-70
Identity = 170/592 (28.72%), Postives = 301/592 (50.84%), Query Frame = 0

Query: 20  TGKKLLSPSTEDIICKAICVNLKQRRWKFLEQVSPSLTNSLVCRVVREFRNSPQLALEFY 79
           T + +L P T +I+ ++I    +  +W  +E V+  LT SLV   +     +P LA  F 
Sbjct: 37  TPESVLPPITSEILLESI----RSSQWHIVEHVADKLTPSLVSTTLLSLVKTPNLAFNFV 96

Query: 80  NWVEARDNFSHSLESCCTLVHVLVNARNFNDALSIMESLMLKNGKSPLEVLGGLIDSYDI 139
           N +   D +    ++ C  + V+    +      +++ ++     S   +   L+ ++D 
Sbjct: 97  NHI---DLYRLDFQTQCLAIAVISKLSSPKPVTQLLKEVVTSRKNSIRNLFDELVLAHDR 156

Query: 140 CNSNPAV-FDALVRTCTQLGSLEGAYDVIKKSRLVGFWVTIHAWNNFLNQLLKLDETDKF 199
             +   + FD LVR C QL  ++ A +     +  GF+      N+ L  L +L+  +  
Sbjct: 157 LETKSTILFDLLVRCCCQLRMVDEAIECFYLMKEKGFYPKTETCNHILTLLSRLNRIENA 216

Query: 200 WNMYKEMVASGYSENVNTFNLIIYALCKDCKLLEAIYVVYLMLKIEIWPNVVTFNMIIDK 259
           W  Y +M       NV TFN++I  LCK+ KL +A   + +M    I P +VT+N ++  
Sbjct: 217 WVFYADMYRMEIKSNVYTFNIMINVLCKEGKLKKAKGFLGIMEVFGIKPTIVTYNTLVQG 276

Query: 260 ASKMGDM-GLALKLARNTGVISGGSVSPNIVTYNCIINGFCKIRRLESAKNVLGEMIKLG 319
            S  G + G  L ++     +      P++ TYN I++  C   R   A  VL EM ++G
Sbjct: 277 FSLRGRIEGARLIISE----MKSKGFQPDMQTYNPILSWMCNEGR---ASEVLREMKEIG 336

Query: 320 IDFNVRTYATLIDGYARKGSLDVAFRLCDEMVEMGLIPDTVLYNSIIYWLYMEGELEEAS 379
           +  +  +Y  LI G +  G L++AF   DEMV+ G++P    YN++I+ L+ME ++E A 
Sbjct: 337 LVPDSVSYNILIRGCSNNGDLEMAFAYRDEMVKQGMVPTFYTYNTLIHGLFMENKIEAAE 396

Query: 380 FLLSDMINRHILPDEVTFSILTKGLCVNGHLNKALRVHDHILKRNLVKDAFTHNILINYI 439
            L+ ++  + I+ D VT++IL  G C +G   KA  +HD ++   +    FT+  LI  +
Sbjct: 397 ILIREIREKGIVLDSVTYNILINGYCQHGDAKKAFALHDEMMTDGIQPTQFTYTSLIYVL 456

Query: 440 FQTQNAAGAKQLLSSMIVRGIKPDMVTYGTLVDGCCKEGKIEAAVQIYDKAVKADGKSNL 499
            +      A +L   ++ +G+KPD+V   TL+DG C  G ++ A  +  +        + 
Sbjct: 457 CRKNKTREADELFEKVVGKGMKPDLVMMNTLMDGHCAIGNMDRAFSLLKEMDMMSINPDD 516

Query: 500 VVYNSILNGLCKQGSIDAAKLLVDKLQQNGFL-DTVTYNTLLHGYCVNGKVEKAFALFLE 559
           V YN ++ GLC +G  + A+ L+ ++++ G   D ++YNTL+ GY   G  + AF +  E
Sbjct: 517 VTYNCLMRGLCGEGKFEEARELMGEMKRRGIKPDHISYNTLISGYSKKGDTKHAFMVRDE 576

Query: 560 MINVGSLVNIVTYNIMINFLCKMGLIFQTMELMRAMTSQGIVPDHITYTTLI 609
           M+++G    ++TYN ++  L K        EL+R M S+GIVP+  ++ ++I
Sbjct: 577 MLSLGFNPTLLTYNALLKGLSKNQEGELAEELLREMKSEGIVPNDSSFCSVI 614

BLAST of HG10021066 vs. ExPASy TrEMBL
Match: A0A6J1ECJ9 (pentatricopeptide repeat-containing protein At1g11710, mitochondrial OS=Cucurbita moschata OX=3662 GN=LOC111431321 PE=4 SV=1)

HSP 1 Score: 1162.9 bits (3007), Expect = 0.0e+00
Identity = 573/649 (88.29%), Postives = 612/649 (94.30%), Query Frame = 0

Query: 1   MIFSFIFPRRSCVFSRGFHTGKKLLSPSTEDIICKAICVNLKQRRWKFLEQVSPSLTNSL 60
           MIFSFIF RRSCVFSRGFHTGKKL SPSTEDIICKAICVNLK RRWKFLEQVSPSLTNSL
Sbjct: 1   MIFSFIFLRRSCVFSRGFHTGKKLFSPSTEDIICKAICVNLKHRRWKFLEQVSPSLTNSL 60

Query: 61  VCRVVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVNARNFNDALSIMESLML 120
           VCRVVREFR SPQLALEFYNWVEARDN SHSLESCCTLVHVLVN+RNF+DALSIM+SLML
Sbjct: 61  VCRVVREFRTSPQLALEFYNWVEARDNISHSLESCCTLVHVLVNSRNFDDALSIMKSLML 120

Query: 121 KNGKSPLEVLGGLIDSYDICNSNPAVFDALVRTCTQLGSLEGAYDVIKKSRLVGFWVTIH 180
           K+GKSPLEVLGGLIDSY+IC+SN AVFDALVRTCTQLG+ EGAY+VIKK R  GFWVTIH
Sbjct: 121 KDGKSPLEVLGGLIDSYEICSSNSAVFDALVRTCTQLGTAEGAYNVIKKLRDEGFWVTIH 180

Query: 181 AWNNFLNQLLKLDETDKFWNMYKEMVASGYSENVNTFNLIIYALCKDCKLLEAIYVVYLM 240
           AWNNFLNQLLKLDE DKFWNMYKEMVASGY ENVNTFNLIIYA CK+CKLLEAI VVYLM
Sbjct: 181 AWNNFLNQLLKLDEIDKFWNMYKEMVASGYIENVNTFNLIIYAFCKECKLLEAISVVYLM 240

Query: 241 LKIEIWPNVVTFNMIIDKASKMGDMGLALKLARNTGVISGGSVSPNIVTYNCIINGFCKI 300
           LKIEIWPNVV FNMIIDKASKMG M LALKLARN GVISGGSV  NIV+YNC+INGFCKI
Sbjct: 241 LKIEIWPNVVAFNMIIDKASKMGHMDLALKLARNMGVISGGSVLLNIVSYNCMINGFCKI 300

Query: 301 RRLESAKNVLGEMIKLGIDFNVRTYATLIDGYARKGSLDVAFRLCDEMVEMGLIPDTVLY 360
            RLESA+NVL EMIKLG+DFNV TYATLIDGYARKGSLDVAFRLC+EMV+MGLIPDTV+Y
Sbjct: 301 GRLESAENVLSEMIKLGVDFNVTTYATLIDGYARKGSLDVAFRLCEEMVKMGLIPDTVVY 360

Query: 361 NSIIYWLYMEGELEEASFLLSDMINRHILPDEVTFSILTKGLCVNGHLNKALRVHDHILK 420
           NSI+YWLY EGELEEASFL+SDMINRHI PDEVTFSILTKGLC+NGHL+KALR+HDHIL+
Sbjct: 361 NSIVYWLYKEGELEEASFLISDMINRHIFPDEVTFSILTKGLCINGHLDKALRIHDHILE 420

Query: 421 RNLVKDAFTHNILINYIFQTQNAAGAKQLLSSMIVRGIKPDMVTYGTLVDGCCKEGKIEA 480
           RNLVKDAFTHNILINY+FQ+Q  AGAKQLLSSMIVRGI+PD+VTY TL+DGCCKEGK+EA
Sbjct: 421 RNLVKDAFTHNILINYMFQSQKTAGAKQLLSSMIVRGIEPDIVTYATLIDGCCKEGKMEA 480

Query: 481 AVQIYDKAVKADGKSNLVVYNSILNGLCKQGSIDAAKLLVDKLQQNGFLDTVTYNTLLHG 540
           A+QIYDK VKA GKS+LVVYNSIL+GLCKQGSIDAAKLLVDKLQ+NGFLD+VTYNTLLHG
Sbjct: 481 AIQIYDKTVKASGKSDLVVYNSILDGLCKQGSIDAAKLLVDKLQKNGFLDSVTYNTLLHG 540

Query: 541 YCVNGKVEKAFALFLEMINVGSLVNIVTYNIMINFLCKMGLIFQTMELMRAMTSQGIVPD 600
           YCVNG++EKAFALFLEMINVGSL NIVT+NIMINFLC+MGLI Q  ELMRAM++QGIVPD
Sbjct: 541 YCVNGRIEKAFALFLEMINVGSLANIVTFNIMINFLCEMGLIHQAKELMRAMSTQGIVPD 600

Query: 601 HITYTTLITNFVKCCGSEDVIELHDYMVLKGAVPDRQTYQSLVSPSLQE 650
            ITYTTLITNFVK CGSEDVIELHDYMVLKGAVPDRQTYQSLVSP LQE
Sbjct: 601 LITYTTLITNFVKNCGSEDVIELHDYMVLKGAVPDRQTYQSLVSPRLQE 649

BLAST of HG10021066 vs. ExPASy TrEMBL
Match: A0A5D3BMH2 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G005760 PE=4 SV=1)

HSP 1 Score: 1160.6 bits (3001), Expect = 0.0e+00
Identity = 583/653 (89.28%), Postives = 615/653 (94.18%), Query Frame = 0

Query: 1   MIFSFIFPRRSCVFSRGFHTGKKLLSPSTEDIICKAICVNLKQRRWKFLEQVSPSLTNSL 60
           M+FSF F R S VF RGF TGKKLLSPSTEDII KAICVNLKQRRWKFLEQVSPSLTNSL
Sbjct: 53  MVFSFTFLRGSFVFRRGFRTGKKLLSPSTEDIIYKAICVNLKQRRWKFLEQVSPSLTNSL 112

Query: 61  VCRVVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVNARNFNDALSIMESLML 120
           VCRVVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVN+RNFNDALSIMESLML
Sbjct: 113 VCRVVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVNSRNFNDALSIMESLML 172

Query: 121 KNGKSPLEVLGGLIDSYDICNSNPAVFDALVRTCTQLGSLEGAYDVIKKSRLVGFWVTIH 180
           KNGKSPLEVLGGL++SY+ICNSNPAVFDALVRTCTQL S+EGAYDVI+K RL GFWVTIH
Sbjct: 173 KNGKSPLEVLGGLMNSYEICNSNPAVFDALVRTCTQLKSVEGAYDVIRKLRLEGFWVTIH 232

Query: 181 AWNNFLNQLLKLDETDKFWNMYKEMVASGYSENVNTFNLIIYALCKDCKLLEAIYVVYLM 240
           AWNNFLN LLKL ETDKFWNMY EMVASGYSENVNTFNLIIYALCK+CKLLEAI VVYLM
Sbjct: 233 AWNNFLNLLLKLGETDKFWNMYMEMVASGYSENVNTFNLIIYALCKECKLLEAISVVYLM 292

Query: 241 LKIEIWPNVVTFNMIIDKASKMGDMGLALKLARNTGVISGGSVSPNIVTYNCIINGFCKI 300
           LKIEIWPNVV+FNMIIDKASKMG+M LALKL RNT VISGGSVSPNIVTYNCIINGFCKI
Sbjct: 293 LKIEIWPNVVSFNMIIDKASKMGEMDLALKLTRNTEVISGGSVSPNIVTYNCIINGFCKI 352

Query: 301 RRLESAKNVLGEMIKLGIDFNVRTYATLIDGYARKGSLDVAFRLCDEMVEMGLIPDTVLY 360
           RRLESAKNVL EMIKLGID N RTYA LIDGYARKGSLDVAFRLCDEMVE  LIPDTV+Y
Sbjct: 353 RRLESAKNVLAEMIKLGIDSNERTYAPLIDGYARKGSLDVAFRLCDEMVETRLIPDTVVY 412

Query: 361 NSIIYWLYMEGELEEASFLLSDMINRHILPDEVTFSILTKGLCVNGHLNKALRVHDHILK 420
           NS+IYWLY+EGELEEASFLLSDMINR ILPDE T+SILTKGLC++GHLNKALRVH +I++
Sbjct: 413 NSLIYWLYIEGELEEASFLLSDMINRRILPDEFTYSILTKGLCLSGHLNKALRVHYYIVE 472

Query: 421 RNLVKDAFTHNILINYIFQTQNAAGAKQLLSSMIVRGIKPDMVTYGTLVDGCCKEGKIEA 480
           RNLVKDA+THNILINY+FQ++N AGAKQLLSSMIVRGIKPDMVTYGTLV G CKEGKIEA
Sbjct: 473 RNLVKDAYTHNILINYMFQSRNIAGAKQLLSSMIVRGIKPDMVTYGTLVAGHCKEGKIEA 532

Query: 481 AVQIYDKAVKADGKSNLVVYNSILNGLCKQGSIDAAKLLVDKLQQNGFLDTVTYNTLLHG 540
           AVQIYDK VKADGKSNLVVYNSIL+GLCKQGSIDAA+LLVDKLQQNGFLD+VTYNTLLHG
Sbjct: 533 AVQIYDKTVKADGKSNLVVYNSILDGLCKQGSIDAARLLVDKLQQNGFLDSVTYNTLLHG 592

Query: 541 YCVNGKVEKAFALFLEMINVGSLVNIVTYNIMINFLCKMGLIFQTMELMRAMTSQGIVPD 600
           +CVNG+VEKAFALFLEMINVGSLVNIV+YNIMINFLCKMGLI Q MELMRAM SQGIVPD
Sbjct: 593 FCVNGEVEKAFALFLEMINVGSLVNIVSYNIMINFLCKMGLIQQAMELMRAMASQGIVPD 652

Query: 601 HITYTTLITNFVKCCGSEDVIELHDYMVLKGAVPDRQTYQSLVSPSLQENVEG 654
            ITYTTLITNFVK  GS++VIELHDYMVLKGAVPDRQTYQSLVSP LQE+ EG
Sbjct: 653 LITYTTLITNFVKSYGSDNVIELHDYMVLKGAVPDRQTYQSLVSPCLQEHTEG 705

BLAST of HG10021066 vs. ExPASy TrEMBL
Match: E5GC52 (Pentatricopeptide repeat-containing protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1)

HSP 1 Score: 1160.6 bits (3001), Expect = 0.0e+00
Identity = 583/653 (89.28%), Postives = 615/653 (94.18%), Query Frame = 0

Query: 1   MIFSFIFPRRSCVFSRGFHTGKKLLSPSTEDIICKAICVNLKQRRWKFLEQVSPSLTNSL 60
           M+FSF F R S VF RGF TGKKLLSPSTEDII KAICVNLKQRRWKFLEQVSPSLTNSL
Sbjct: 1   MVFSFTFLRGSFVFRRGFRTGKKLLSPSTEDIIYKAICVNLKQRRWKFLEQVSPSLTNSL 60

Query: 61  VCRVVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVNARNFNDALSIMESLML 120
           VCRVVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVN+RNFNDALSIMESLML
Sbjct: 61  VCRVVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVNSRNFNDALSIMESLML 120

Query: 121 KNGKSPLEVLGGLIDSYDICNSNPAVFDALVRTCTQLGSLEGAYDVIKKSRLVGFWVTIH 180
           KNGKSPLEVLGGL++SY+ICNSNPAVFDALVRTCTQL S+EGAYDVI+K RL GFWVTIH
Sbjct: 121 KNGKSPLEVLGGLMNSYEICNSNPAVFDALVRTCTQLKSVEGAYDVIRKLRLEGFWVTIH 180

Query: 181 AWNNFLNQLLKLDETDKFWNMYKEMVASGYSENVNTFNLIIYALCKDCKLLEAIYVVYLM 240
           AWNNFLN LLKL ETDKFWNMY EMVASGYSENVNTFNLIIYALCK+CKLLEAI VVYLM
Sbjct: 181 AWNNFLNLLLKLGETDKFWNMYMEMVASGYSENVNTFNLIIYALCKECKLLEAISVVYLM 240

Query: 241 LKIEIWPNVVTFNMIIDKASKMGDMGLALKLARNTGVISGGSVSPNIVTYNCIINGFCKI 300
           LKIEIWPNVV+FNMIIDKASKMG+M LALKL RNT VISGGSVSPNIVTYNCIINGFCKI
Sbjct: 241 LKIEIWPNVVSFNMIIDKASKMGEMDLALKLTRNTEVISGGSVSPNIVTYNCIINGFCKI 300

Query: 301 RRLESAKNVLGEMIKLGIDFNVRTYATLIDGYARKGSLDVAFRLCDEMVEMGLIPDTVLY 360
           RRLESAKNVL EMIKLGID N RTYA LIDGYARKGSLDVAFRLCDEMVE  LIPDTV+Y
Sbjct: 301 RRLESAKNVLAEMIKLGIDSNERTYAPLIDGYARKGSLDVAFRLCDEMVETRLIPDTVVY 360

Query: 361 NSIIYWLYMEGELEEASFLLSDMINRHILPDEVTFSILTKGLCVNGHLNKALRVHDHILK 420
           NS+IYWLY+EGELEEASFLLSDMINR ILPDE T+SILTKGLC++GHLNKALRVH +I++
Sbjct: 361 NSLIYWLYIEGELEEASFLLSDMINRRILPDEFTYSILTKGLCLSGHLNKALRVHYYIVE 420

Query: 421 RNLVKDAFTHNILINYIFQTQNAAGAKQLLSSMIVRGIKPDMVTYGTLVDGCCKEGKIEA 480
           RNLVKDA+THNILINY+FQ++N AGAKQLLSSMIVRGIKPDMVTYGTLV G CKEGKIEA
Sbjct: 421 RNLVKDAYTHNILINYMFQSRNIAGAKQLLSSMIVRGIKPDMVTYGTLVAGHCKEGKIEA 480

Query: 481 AVQIYDKAVKADGKSNLVVYNSILNGLCKQGSIDAAKLLVDKLQQNGFLDTVTYNTLLHG 540
           AVQIYDK VKADGKSNLVVYNSIL+GLCKQGSIDAA+LLVDKLQQNGFLD+VTYNTLLHG
Sbjct: 481 AVQIYDKTVKADGKSNLVVYNSILDGLCKQGSIDAARLLVDKLQQNGFLDSVTYNTLLHG 540

Query: 541 YCVNGKVEKAFALFLEMINVGSLVNIVTYNIMINFLCKMGLIFQTMELMRAMTSQGIVPD 600
           +CVNG+VEKAFALFLEMINVGSLVNIV+YNIMINFLCKMGLI Q MELMRAM SQGIVPD
Sbjct: 541 FCVNGEVEKAFALFLEMINVGSLVNIVSYNIMINFLCKMGLIQQAMELMRAMASQGIVPD 600

Query: 601 HITYTTLITNFVKCCGSEDVIELHDYMVLKGAVPDRQTYQSLVSPSLQENVEG 654
            ITYTTLITNFVK  GS++VIELHDYMVLKGAVPDRQTYQSLVSP LQE+ EG
Sbjct: 601 LITYTTLITNFVKSYGSDNVIELHDYMVLKGAVPDRQTYQSLVSPCLQEHTEG 653

BLAST of HG10021066 vs. ExPASy TrEMBL
Match: A0A1S3B385 (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g11710, mitochondrial OS=Cucumis melo OX=3656 GN=LOC103485289 PE=4 SV=1)

HSP 1 Score: 1158.7 bits (2996), Expect = 0.0e+00
Identity = 582/653 (89.13%), Postives = 614/653 (94.03%), Query Frame = 0

Query: 1   MIFSFIFPRRSCVFSRGFHTGKKLLSPSTEDIICKAICVNLKQRRWKFLEQVSPSLTNSL 60
           M+FSF F R S VF RGF TGKKLLSPSTEDII KAICVNLKQRRWKFLEQVSPSLTNSL
Sbjct: 1   MVFSFTFLRGSFVFRRGFRTGKKLLSPSTEDIIYKAICVNLKQRRWKFLEQVSPSLTNSL 60

Query: 61  VCRVVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVNARNFNDALSIMESLML 120
           VCRVVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVN+RNFNDALSIMESLML
Sbjct: 61  VCRVVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVNSRNFNDALSIMESLML 120

Query: 121 KNGKSPLEVLGGLIDSYDICNSNPAVFDALVRTCTQLGSLEGAYDVIKKSRLVGFWVTIH 180
           KNGKSPLEVLGGL++SY+ICNSNPAVFDALVRTCTQL S+EGAYDVI+K RL GFWVTIH
Sbjct: 121 KNGKSPLEVLGGLMNSYEICNSNPAVFDALVRTCTQLKSVEGAYDVIRKLRLEGFWVTIH 180

Query: 181 AWNNFLNQLLKLDETDKFWNMYKEMVASGYSENVNTFNLIIYALCKDCKLLEAIYVVYLM 240
           AWNNFLN LLKL ETDKFWNMY EMVASGYSENVNTFNLIIYALCK+CKLLEAI VVYLM
Sbjct: 181 AWNNFLNLLLKLGETDKFWNMYMEMVASGYSENVNTFNLIIYALCKECKLLEAISVVYLM 240

Query: 241 LKIEIWPNVVTFNMIIDKASKMGDMGLALKLARNTGVISGGSVSPNIVTYNCIINGFCKI 300
           LKIEIWPNVV+FNMIIDKASKMG+M LALKL RNT VISGGSVSPNIVTYNCIINGFCKI
Sbjct: 241 LKIEIWPNVVSFNMIIDKASKMGEMDLALKLTRNTEVISGGSVSPNIVTYNCIINGFCKI 300

Query: 301 RRLESAKNVLGEMIKLGIDFNVRTYATLIDGYARKGSLDVAFRLCDEMVEMGLIPDTVLY 360
           RRLESAKNVL EMIKLGID N RTYA LIDGYARKGSLDVAFRLCDEMVE  LIPDTV+Y
Sbjct: 301 RRLESAKNVLAEMIKLGIDSNERTYAPLIDGYARKGSLDVAFRLCDEMVETRLIPDTVVY 360

Query: 361 NSIIYWLYMEGELEEASFLLSDMINRHILPDEVTFSILTKGLCVNGHLNKALRVHDHILK 420
           NS+IYWLY+EGELEEASFLLSDMINR ILPDE T+SILTKGLC++GHLNKALRVH +I++
Sbjct: 361 NSLIYWLYIEGELEEASFLLSDMINRRILPDEFTYSILTKGLCLSGHLNKALRVHYYIVE 420

Query: 421 RNLVKDAFTHNILINYIFQTQNAAGAKQLLSSMIVRGIKPDMVTYGTLVDGCCKEGKIEA 480
           RNLVKDA+THNILINY+FQ++N AGAKQLLSSMIVRGIKPDMVTYGTLV G CKEGK EA
Sbjct: 421 RNLVKDAYTHNILINYMFQSRNIAGAKQLLSSMIVRGIKPDMVTYGTLVAGHCKEGKXEA 480

Query: 481 AVQIYDKAVKADGKSNLVVYNSILNGLCKQGSIDAAKLLVDKLQQNGFLDTVTYNTLLHG 540
           AVQIYDK VKADGKSNLVVYNSIL+GLCKQGSIDAA+LLVDKLQQNGFLD+VTYNTLLHG
Sbjct: 481 AVQIYDKTVKADGKSNLVVYNSILDGLCKQGSIDAARLLVDKLQQNGFLDSVTYNTLLHG 540

Query: 541 YCVNGKVEKAFALFLEMINVGSLVNIVTYNIMINFLCKMGLIFQTMELMRAMTSQGIVPD 600
           +CVNG+VEKAFALFLEMINVGSLVNIV+YNIMINFLCKMGLI Q MELMRAM SQGIVPD
Sbjct: 541 FCVNGEVEKAFALFLEMINVGSLVNIVSYNIMINFLCKMGLIQQAMELMRAMASQGIVPD 600

Query: 601 HITYTTLITNFVKCCGSEDVIELHDYMVLKGAVPDRQTYQSLVSPSLQENVEG 654
            ITYTTLITNFVK  GS++VIELHDYMVLKGAVPDRQTYQSLVSP LQE+ EG
Sbjct: 601 LITYTTLITNFVKSYGSDNVIELHDYMVLKGAVPDRQTYQSLVSPCLQEHTEG 653

BLAST of HG10021066 vs. ExPASy TrEMBL
Match: A0A6J1KKM8 (pentatricopeptide repeat-containing protein At1g11710, mitochondrial OS=Cucurbita maxima OX=3661 GN=LOC111495429 PE=4 SV=1)

HSP 1 Score: 1151.3 bits (2977), Expect = 0.0e+00
Identity = 567/649 (87.37%), Postives = 607/649 (93.53%), Query Frame = 0

Query: 1   MIFSFIFPRRSCVFSRGFHTGKKLLSPSTEDIICKAICVNLKQRRWKFLEQVSPSLTNSL 60
           MIFSFIF RRSC FSRGFHTGKK  SPST DIICKAICVNLK RRWKFLEQVSPSLTNSL
Sbjct: 1   MIFSFIFLRRSCFFSRGFHTGKKFFSPSTVDIICKAICVNLKHRRWKFLEQVSPSLTNSL 60

Query: 61  VCRVVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVNARNFNDALSIMESLML 120
           VCRVVREFR SPQLALEFYNWVEARDN SHSLESCCTLVHVL+N+RNF+DALSIM+SLML
Sbjct: 61  VCRVVREFRTSPQLALEFYNWVEARDNISHSLESCCTLVHVLINSRNFDDALSIMKSLML 120

Query: 121 KNGKSPLEVLGGLIDSYDICNSNPAVFDALVRTCTQLGSLEGAYDVIKKSRLVGFWVTIH 180
           K+GKSPLEVLGGLIDSY+IC+SNPAVFDALVRTCTQLG+ EGAY+VIKK ++ GFWVTIH
Sbjct: 121 KDGKSPLEVLGGLIDSYEICSSNPAVFDALVRTCTQLGTAEGAYNVIKKLKVEGFWVTIH 180

Query: 181 AWNNFLNQLLKLDETDKFWNMYKEMVASGYSENVNTFNLIIYALCKDCKLLEAIYVVYLM 240
           AWNNFLNQLLKLDE DKFWNMYKEMVASGY ENVNTFNLIIYA CK+CKLLEAI VVYLM
Sbjct: 181 AWNNFLNQLLKLDEIDKFWNMYKEMVASGYIENVNTFNLIIYAFCKECKLLEAISVVYLM 240

Query: 241 LKIEIWPNVVTFNMIIDKASKMGDMGLALKLARNTGVISGGSVSPNIVTYNCIINGFCKI 300
           LKIEIWPNVV FNMIIDKASKMG M LALKLARN  VISGGSV  NIVTYNC+INGFCKI
Sbjct: 241 LKIEIWPNVVAFNMIIDKASKMGHMDLALKLARNARVISGGSVLLNIVTYNCMINGFCKI 300

Query: 301 RRLESAKNVLGEMIKLGIDFNVRTYATLIDGYARKGSLDVAFRLCDEMVEMGLIPDTVLY 360
            RLESA+NVL EMIKLG+DFNV TYATLIDGYARKGSLDVAFRLC+EMV+MGLIPDTV+Y
Sbjct: 301 GRLESAENVLSEMIKLGVDFNVTTYATLIDGYARKGSLDVAFRLCEEMVKMGLIPDTVVY 360

Query: 361 NSIIYWLYMEGELEEASFLLSDMINRHILPDEVTFSILTKGLCVNGHLNKALRVHDHILK 420
           NSI+YWLY EGELEEASFL+SDMINRHI PDEVTFSILTKGLC+NGHL+KALR+HD IL+
Sbjct: 361 NSIVYWLYKEGELEEASFLISDMINRHIFPDEVTFSILTKGLCINGHLDKALRIHDLILE 420

Query: 421 RNLVKDAFTHNILINYIFQTQNAAGAKQLLSSMIVRGIKPDMVTYGTLVDGCCKEGKIEA 480
           RNLVKDAFTHNILINY+FQ+Q  AGA QLLSSMIVRGI+PD+VTY TL+DGCCKEGK+EA
Sbjct: 421 RNLVKDAFTHNILINYMFQSQKTAGANQLLSSMIVRGIEPDIVTYATLIDGCCKEGKMEA 480

Query: 481 AVQIYDKAVKADGKSNLVVYNSILNGLCKQGSIDAAKLLVDKLQQNGFLDTVTYNTLLHG 540
           A+QIYDK VKA G S+LVVYNSIL+GLCKQGSIDAAKLLVDKLQ+NGFLDTVTYNTLLHG
Sbjct: 481 AIQIYDKTVKASGISDLVVYNSILDGLCKQGSIDAAKLLVDKLQKNGFLDTVTYNTLLHG 540

Query: 541 YCVNGKVEKAFALFLEMINVGSLVNIVTYNIMINFLCKMGLIFQTMELMRAMTSQGIVPD 600
           YCVNG++EKAFALFLEMINVGSL NIVT+NIMINFLC+MGLI Q  ELMRAM++QGIVPD
Sbjct: 541 YCVNGRIEKAFALFLEMINVGSLANIVTFNIMINFLCEMGLIHQAKELMRAMSTQGIVPD 600

Query: 601 HITYTTLITNFVKCCGSEDVIELHDYMVLKGAVPDRQTYQSLVSPSLQE 650
            ITYTTLITNFVK CGSEDVIELHDYMVLKGAVPDRQTYQSLVSP LQE
Sbjct: 601 LITYTTLITNFVKNCGSEDVIELHDYMVLKGAVPDRQTYQSLVSPRLQE 649

BLAST of HG10021066 vs. TAIR 10
Match: AT1G11710.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 642.9 bits (1657), Expect = 2.8e-184
Identity = 324/653 (49.62%), Postives = 446/653 (68.30%), Query Frame = 0

Query: 2   IFSFIFPRRSCVFSRGFHTGKKLLSPSTEDIICKAICVNLKQRRWKFLEQVSPSLTNSLV 61
           +F  +F RR+    R FH  KK  +P  EDI+  A+C+NL+QRRW  L Q S SLTN L+
Sbjct: 1   MFGHVFSRRTSFLVRCFHVAKKFSNPEPEDILFSALCLNLRQRRWNTLHQFSSSLTNPLI 60

Query: 62  CRVVREFRNSPQLALEFYNWVEARDNFSHS---LESCCTLVHVLVNARNFNDALSIMESL 121
            RV+REFR+SP+LALEFYNWV   +  + S    E+ C ++H+LV +R F+DALSIM +L
Sbjct: 61  SRVLREFRSSPKLALEFYNWVLRSNTVAKSENRFEASCVMIHLLVGSRRFDDALSIMANL 120

Query: 122 MLKNGK--SPLEVLGGLIDSYDICNSNPAVFDALVRTCTQLGSLEGAYDVIKKSRLVGFW 181
           M   G+  SPL VL GLI SY  C S+P VFD+LVR CTQ G  +GAY+VI+++R  GF 
Sbjct: 121 MSVEGEKLSPLHVLSGLIRSYQACGSSPDVFDSLVRACTQNGDAQGAYEVIEQTRAEGFC 180

Query: 182 VTIHAWNNFLNQLLKLDETDKFWNMYKEMVASGYSENVNTFNLIIYALCKDCKLLEAIYV 241
           V++HA NNF+  LL ++E D+FW +YKEM + GY ENVNTFNL+IY+ CK+ KL EA+ V
Sbjct: 181 VSVHALNNFMGCLLNVNEIDRFWKVYKEMDSLGYVENVNTFNLVIYSFCKESKLFEALSV 240

Query: 242 VYLMLKIEIWPNVVTFNMIIDKASKMGDMGLALKLARNTGVISGGSVSPNIVTYNCIING 301
            Y MLK  +WPNVV+FNM+ID A K GDM  AL+L    G++SG  VSPN VTYN +ING
Sbjct: 241 FYRMLKCGVWPNVVSFNMMIDGACKTGDMRFALQLLGKMGMMSGNFVSPNAVTYNSVING 300

Query: 302 FCKIRRLESAKNVLGEMIKLGIDFNVRTYATLIDGYARKGSLDVAFRLCDEMVEMGLIPD 361
           FCK  RL+ A+ + G+M+K G+D N RTY  L+D Y R GS D A RLCDEM   GL+ +
Sbjct: 301 FCKAGRLDLAERIRGDMVKSGVDCNERTYGALVDAYGRAGSSDEALRLCDEMTSKGLVVN 360

Query: 362 TVLYNSIIYWLYMEGELEEASFLLSDMINRHILPDEVTFSILTKGLCVNGHLNKALRVHD 421
           TV+YNSI+YWL+MEG++E A  +L DM ++++  D  T +I+ +GLC NG++ +A+    
Sbjct: 361 TVIYNSIVYWLFMEGDIEGAMSVLRDMNSKNMQIDRFTQAIVVRGLCRNGYVKEAVEFQR 420

Query: 422 HILKRNLVKDAFTHNILINYIFQTQNAAGAKQLLSSMIVRGIKPDMVTYGTLVDGCCKEG 481
            I ++ LV+D   HN L+++  + +  A A Q+L SM+V+G+  D +++GTL+DG  KEG
Sbjct: 421 QISEKKLVEDIVCHNTLMHHFVRDKKLACADQILGSMLVQGLSLDAISFGTLIDGYLKEG 480

Query: 482 KIEAAVQIYDKAVKADGKSNLVVYNSILNGLCKQGSIDAAKLLVDKLQQNGFLDTVTYNT 541
           K+E A++IYD  +K +  SNLV+YNSI+NGL K+G   AA+ +V+ ++     D VTYNT
Sbjct: 481 KLERALEIYDGMIKMNKTSNLVIYNSIVNGLSKRGMAGAAEAVVNAME---IKDIVTYNT 540

Query: 542 LLHGYCVNGKVEKAFALFLEMINVG--SLVNIVTYNIMINFLCKMGLIFQTMELMRAMTS 601
           LL+     G VE+A  +  +M        V++VT+NIMIN LCK G   +  E+++ M  
Sbjct: 541 LLNESLKTGNVEEADDILSKMQKQDGEKSVSLVTFNIMINHLCKFGSYEKAKEVLKFMVE 600

Query: 602 QGIVPDHITYTTLITNFVKCCGSEDVIELHDYMVLKGAVPDRQTYQSLVSPSL 648
           +G+VPD ITY TLIT+F K    E V+ELHDY++L+G  P    Y S+V P L
Sbjct: 601 RGVVPDSITYGTLITSFSKHRSQEKVVELHDYLILQGVTPHEHIYLSIVRPLL 650

BLAST of HG10021066 vs. TAIR 10
Match: AT5G01110.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 292.0 bits (746), Expect = 1.2e-78
Identity = 181/626 (28.91%), Postives = 319/626 (50.96%), Query Frame = 0

Query: 29  TEDIICKAICVNLKQRRWKFLEQVSPSLTNSLVCRVVREFRNSPQLALEFYNWVEAR-DN 88
           ++  + + IC +LKQ        +   L    V  V+   RN   L   F + +     N
Sbjct: 50  SDSFLVEKICFSLKQGNNNVRNHLI-RLNPLAVVEVLYRCRNDLTLGQRFVDQLGFHFPN 109

Query: 89  FSHSLESCCTLVHVLVNARNFNDALSIMESLMLKNGKSPLEVLGGLIDSYDICNSNPAVF 148
           F H+  S   ++H+LV +   +DA S +  ++ ++G S LE++  L  ++  C SN +VF
Sbjct: 110 FKHTSLSLSAMIHILVRSGRLSDAQSCLLRMIRRSGVSRLEIVNSLDSTFSNCGSNDSVF 169

Query: 149 DALVRTCTQLGSLEGAYDVIKKSRLVGFWVTIHAWNNFLNQLLKLDETDKFWNMYKEMVA 208
           D L+RT  Q   L  A++     R  GF V+I A N  +  L+++   +  W +Y+E+  
Sbjct: 170 DLLIRTYVQARKLREAHEAFTLLRSKGFTVSIDACNALIGSLVRIGWVELAWGVYQEISR 229

Query: 209 SGYSENVNTFNLIIYALCKDCKLLEAIYVVYLMLKIEIWPNVVTFNMIIDKASKMGDMGL 268
           SG   NV T N+++ ALCKD K+ +    +  + +  ++P++VT+N +I   S  G M  
Sbjct: 230 SGVGINVYTLNIMVNALCKDGKMEKVGTFLSQVQEKGVYPDIVTYNTLISAYSSKGLMEE 289

Query: 269 ALKLARNTGVISGGSVSPNIVTYNCIINGFCKIRRLESAKNVLGEMIKLGIDFNVRTYAT 328
           A +L      + G   SP + TYN +ING CK  + E AK V  EM++ G+  +  TY +
Sbjct: 290 AFEL---MNAMPGKGFSPGVYTYNTVINGLCKHGKYERAKEVFAEMLRSGLSPDSTTYRS 349

Query: 329 LIDGYARKGSLDVAFRLCDEMVEMGLIPDTVLYNSIIYWLYMEGELEEASFLLSDMINRH 388
           L+    +KG +    ++  +M    ++PD V ++S++      G L++A    + +    
Sbjct: 350 LLMEACKKGDVVETEKVFSDMRSRDVVPDLVCFSSMMSLFTRSGNLDKALMYFNSVKEAG 409

Query: 389 ILPDEVTFSILTKGLCVNGHLNKALRVHDHILKRNLVKDAFTHNILINYIFQTQNAAGAK 448
           ++PD V ++IL +G C  G ++ A+ + + +L++    D  T+N +++ + + +    A 
Sbjct: 410 LIPDNVIYTILIQGYCRKGMISVAMNLRNEMLQQGCAMDVVTYNTILHGLCKRKMLGEAD 469

Query: 449 QLLSSMIVRGIKPDMVTYGTLVDGCCKEGKIEAAVQIYDKAVKADGKSNLVVYNSILNGL 508
           +L + M  R + PD  T   L+DG CK G ++ A++++ K  +   + ++V YN++L+G 
Sbjct: 470 KLFNEMTERALFPDSYTLTILIDGHCKLGNLQNAMELFQKMKEKRIRLDVVTYNTLLDGF 529

Query: 509 CKQGSIDAAKLLVDKLQQNGFLDT-VTYNTLLHGYCVNGKVEKAFALFLEMINVGSLVNI 568
            K G ID AK +   +     L T ++Y+ L++  C  G + +AF ++ EMI+      +
Sbjct: 530 GKVGDIDTAKEIWADMVSKEILPTPISYSILVNALCSKGHLAEAFRVWDEMISKNIKPTV 589

Query: 569 VTYNIMINFLCKMGLIFQTMELMRAMTSQGIVPDHITYTTLITNFVKCCGSEDVIELHDY 628
           +  N MI   C+ G        +  M S+G VPD I+Y TLI  FV+         L   
Sbjct: 590 MICNSMIKGYCRSGNASDGESFLEKMISEGFVPDCISYNTLIYGFVREENMSKAFGLVKK 649

Query: 629 MVLK--GAVPDRQTYQSLVSPSLQEN 651
           M  +  G VPD  TY S++    ++N
Sbjct: 650 MEEEQGGLVPDVFTYNSILHGFCRQN 671

BLAST of HG10021066 vs. TAIR 10
Match: AT5G55840.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 275.0 bits (702), Expect = 1.5e-73
Identity = 171/610 (28.03%), Postives = 306/610 (50.16%), Query Frame = 0

Query: 73  QLALEFYNWVEARDNF--SHSLESCCTLVHVLVNARNFNDALSIMESLMLKNGKSPLEVL 132
           +LAL+F  WV  +      H ++  C   H+LV AR ++ A  I++ L L +GKS   V 
Sbjct: 91  KLALKFLKWVVKQPGLETDHIVQLVCITTHILVRARMYDPARHILKELSLMSGKSSF-VF 150

Query: 133 GGLIDSYDICNSNPAVFDALVRTCTQLGSLEGAYDVIKKSRLVGFWVTIHAWNNFLNQLL 192
           G L+ +Y +CNSNP+V+D L+R   + G ++ + ++ +   L GF  +++  N  L  ++
Sbjct: 151 GALMTTYRLCNSNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAILGSVV 210

Query: 193 KLDETDKFWNMYKEMVASGYSENVNTFNLIIYALCKDCKLLEAIYVVYLMLKIEIWPNVV 252
           K  E    W+  KEM+      +V TFN++I  LC +    ++ Y++  M K    P +V
Sbjct: 211 KSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAPTIV 270

Query: 253 TFNMIIDKASKMGDMGLALKLARNTGVISGGSVSPNIVTYNCIINGFCKIRRLESAKNVL 312
           T+N ++    K G    A++L  +   +    V  ++ TYN +I+  C+  R+     +L
Sbjct: 271 TYNTVLHWYCKKGRFKAAIELLDH---MKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLL 330

Query: 313 GEMIKLGIDFNVRTYATLIDGYARKGSLDVAFRLCDEMVEMGLIPDTVLYNSIIYWLYME 372
            +M K  I  N  TY TLI+G++ +G + +A +L +EM+  GL P+ V +N++I     E
Sbjct: 331 RDMRKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISE 390

Query: 373 GELEEASFLLSDMINRHILPDEVTFSIL-------------------------------- 432
           G  +EA  +   M  + + P EV++ +L                                
Sbjct: 391 GNFKEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITY 450

Query: 433 ---TKGLCVNGHLNKALRVHDHILKRNLVKDAFTHNILINYIFQTQNAAGAKQLLSSMIV 492
                GLC NG L++A+ + + + K  +  D  T++ LIN   +      AK+++  +  
Sbjct: 451 TGMIDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYR 510

Query: 493 RGIKPDMVTYGTLVDGCCKEGKIEAAVQIYDKAVKADGKSNLVVYNSILNGLCKQGSIDA 552
            G+ P+ + Y TL+  CC+ G ++ A++IY+  +      +   +N ++  LCK G +  
Sbjct: 511 VGLSPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAE 570

Query: 553 AKLLVDKLQQNGFL-DTVTYNTLLHGYCVNGKVEKAFALFLEMINVGSLVNIVTYNIMIN 612
           A+  +  +  +G L +TV+++ L++GY  +G+  KAF++F EM  VG      TY  ++ 
Sbjct: 571 AEEFMRCMTSDGILPNTVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYGSLLK 630

Query: 613 FLCKMGLIFQTMELMRAMTSQGIVPDHITYTTLITNFVKCCGSEDVIELHDYMVLKGAVP 645
            LCK G + +  + ++++ +     D + Y TL+T   K       + L   MV +  +P
Sbjct: 631 GLCKGGHLREAEKFLKSLHAVPAAVDTVMYNTLLTAMCKSGNLAKAVSLFGEMVQRSILP 690

BLAST of HG10021066 vs. TAIR 10
Match: AT1G05670.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 270.8 bits (691), Expect = 2.9e-72
Identity = 172/586 (29.35%), Postives = 296/586 (50.51%), Query Frame = 0

Query: 64  VVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVNARNFNDALSIMESL----M 123
           V+ + +   +L L+F++W  AR     +LES C ++H+ V +++   A S++ S      
Sbjct: 93  VLMKIKCDYRLVLDFFDW--ARSRRDSNLESLCIVIHLAVASKDLKVAQSLISSFWERPK 152

Query: 124 LKNGKSPLEVLGGLIDSYDICNSNPAVFDALVRTCTQLGSLEGAYDVIKKSRLVGFWVTI 183
           L    S ++    L+ +Y    S+P VFD   +     G L  A  V +K    G  +++
Sbjct: 153 LNVTDSFVQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSV 212

Query: 184 HAWNNFLNQLLK-LDETDKFWNMYKEMVASGYSENVNTFNLIIYALCKDCKLLEAIYVVY 243
            + N +L +L K   +T     +++E    G   NV ++N++I+ +C+  ++ EA +++ 
Sbjct: 213 DSCNVYLTRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLL 272

Query: 244 LMLKIEIWPNVVTFNMIIDKASKMGDMGLALKLARNTGVISGGSVSPNIVTYNCIINGFC 303
           LM      P+V++++ +++   + G++    KL     V+    + PN   Y  II   C
Sbjct: 273 LMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIE---VMKRKGLKPNSYIYGSIIGLLC 332

Query: 304 KIRRLESAKNVLGEMIKLGIDFNVRTYATLIDGYARKGSLDVAFRLCDEMVEMGLIPDTV 363
           +I +L  A+    EMI+ GI  +   Y TLIDG+ ++G +  A +   EM    + PD +
Sbjct: 333 RICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVL 392

Query: 364 LYNSIIYWLYMEGELEEASFLLSDMINRHILPDEVTFSILTKGLCVNGHLNKALRVHDHI 423
            Y +II      G++ EA  L  +M  + + PD VTF+ L  G C  GH+  A RVH+H 
Sbjct: 393 TYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNH- 452

Query: 424 LKRNLVKDAFTHNILINYIFQTQNAAGAKQLLSSMIVRGIKPDMVTYGTLVDGCCKEGKI 483
                                             MI  G  P++VTY TL+DG CKEG +
Sbjct: 453 ----------------------------------MIQAGCSPNVVTYTTLIDGLCKEGDL 512

Query: 484 EAAVQIYDKAVKADGKSNLVVYNSILNGLCKQGSIDAAKLLVDKLQQNGF-LDTVTYNTL 543
           ++A ++  +  K   + N+  YNSI+NGLCK G+I+ A  LV + +  G   DTVTY TL
Sbjct: 513 DSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLVGEFEAAGLNADTVTYTTL 572

Query: 544 LHGYCVNGKVEKAFALFLEMINVGSLVNIVTYNIMINFLCKMGLIFQTMELMRAMTSQGI 603
           +  YC +G+++KA  +  EM+  G    IVT+N+++N  C  G++    +L+  M ++GI
Sbjct: 573 MDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLHGMLEDGEKLLNWMLAKGI 632

Query: 604 VPDHITYTTLITNFVKCCGSEDVIELHDYMVLKGAVPDRQTYQSLV 644
            P+  T+ +L+  +      +    ++  M  +G  PD +TY++LV
Sbjct: 633 APNATTFNSLVKQYCIRNNLKAATAIYKDMCSRGVGPDGKTYENLV 638

BLAST of HG10021066 vs. TAIR 10
Match: AT1G05670.2 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 270.8 bits (691), Expect = 2.9e-72
Identity = 172/586 (29.35%), Postives = 296/586 (50.51%), Query Frame = 0

Query: 64  VVREFRNSPQLALEFYNWVEARDNFSHSLESCCTLVHVLVNARNFNDALSIMESL----M 123
           V+ + +   +L L+F++W  AR     +LES C ++H+ V +++   A S++ S      
Sbjct: 93  VLMKIKCDYRLVLDFFDW--ARSRRDSNLESLCIVIHLAVASKDLKVAQSLISSFWERPK 152

Query: 124 LKNGKSPLEVLGGLIDSYDICNSNPAVFDALVRTCTQLGSLEGAYDVIKKSRLVGFWVTI 183
           L    S ++    L+ +Y    S+P VFD   +     G L  A  V +K    G  +++
Sbjct: 153 LNVTDSFVQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSV 212

Query: 184 HAWNNFLNQLLK-LDETDKFWNMYKEMVASGYSENVNTFNLIIYALCKDCKLLEAIYVVY 243
            + N +L +L K   +T     +++E    G   NV ++N++I+ +C+  ++ EA +++ 
Sbjct: 213 DSCNVYLTRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLL 272

Query: 244 LMLKIEIWPNVVTFNMIIDKASKMGDMGLALKLARNTGVISGGSVSPNIVTYNCIINGFC 303
           LM      P+V++++ +++   + G++    KL     V+    + PN   Y  II   C
Sbjct: 273 LMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIE---VMKRKGLKPNSYIYGSIIGLLC 332

Query: 304 KIRRLESAKNVLGEMIKLGIDFNVRTYATLIDGYARKGSLDVAFRLCDEMVEMGLIPDTV 363
           +I +L  A+    EMI+ GI  +   Y TLIDG+ ++G +  A +   EM    + PD +
Sbjct: 333 RICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVL 392

Query: 364 LYNSIIYWLYMEGELEEASFLLSDMINRHILPDEVTFSILTKGLCVNGHLNKALRVHDHI 423
            Y +II      G++ EA  L  +M  + + PD VTF+ L  G C  GH+  A RVH+H 
Sbjct: 393 TYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNH- 452

Query: 424 LKRNLVKDAFTHNILINYIFQTQNAAGAKQLLSSMIVRGIKPDMVTYGTLVDGCCKEGKI 483
                                             MI  G  P++VTY TL+DG CKEG +
Sbjct: 453 ----------------------------------MIQAGCSPNVVTYTTLIDGLCKEGDL 512

Query: 484 EAAVQIYDKAVKADGKSNLVVYNSILNGLCKQGSIDAAKLLVDKLQQNGF-LDTVTYNTL 543
           ++A ++  +  K   + N+  YNSI+NGLCK G+I+ A  LV + +  G   DTVTY TL
Sbjct: 513 DSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLVGEFEAAGLNADTVTYTTL 572

Query: 544 LHGYCVNGKVEKAFALFLEMINVGSLVNIVTYNIMINFLCKMGLIFQTMELMRAMTSQGI 603
           +  YC +G+++KA  +  EM+  G    IVT+N+++N  C  G++    +L+  M ++GI
Sbjct: 573 MDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLHGMLEDGEKLLNWMLAKGI 632

Query: 604 VPDHITYTTLITNFVKCCGSEDVIELHDYMVLKGAVPDRQTYQSLV 644
            P+  T+ +L+  +      +    ++  M  +G  PD +TY++LV
Sbjct: 633 APNATTFNSLVKQYCIRNNLKAATAIYKDMCSRGVGPDGKTYENLV 638

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038894115.10.0e+0092.34pentatricopeptide repeat-containing protein At1g11710, mitochondrial [Benincasa ... [more]
XP_022923680.10.0e+0088.29pentatricopeptide repeat-containing protein At1g11710, mitochondrial [Cucurbita ... [more]
XP_023519211.10.0e+0087.98pentatricopeptide repeat-containing protein At1g11710, mitochondrial [Cucurbita ... [more]
KAA0056855.10.0e+0089.28pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYJ99358... [more]
ADN34051.10.0e+0089.28pentatricopeptide repeat-containing protein [Cucumis melo subsp. melo][more]
Match NameE-valueIdentityDescription
Q9SAA64.0e-18349.62Pentatricopeptide repeat-containing protein At1g11710, mitochondrial OS=Arabidop... [more]
Q9LFC51.7e-7728.91Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana OX... [more]
Q9LVQ52.2e-7228.03Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX... [more]
Q0WVK74.1e-7129.35Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
Q9ZQF11.2e-7028.72Pentatricopeptide repeat-containing protein At2g15630, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1ECJ90.0e+0088.29pentatricopeptide repeat-containing protein At1g11710, mitochondrial OS=Cucurbit... [more]
A0A5D3BMH20.0e+0089.28Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
E5GC520.0e+0089.28Pentatricopeptide repeat-containing protein OS=Cucumis melo subsp. melo OX=41267... [more]
A0A1S3B3850.0e+0089.13LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g11710, mito... [more]
A0A6J1KKM80.0e+0087.37pentatricopeptide repeat-containing protein At1g11710, mitochondrial OS=Cucurbit... [more]
Match NameE-valueIdentityDescription
AT1G11710.12.8e-18449.62Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G01110.11.2e-7828.91Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G55840.11.5e-7328.03Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G05670.12.9e-7229.35Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G05670.22.9e-7229.35Pentatricopeptide repeat (PPR-like) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 442..557
e-value: 1.6E-31
score: 111.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 353..441
e-value: 5.1E-19
score: 70.3
coord: 561..651
e-value: 1.2E-18
score: 69.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 277..352
e-value: 4.0E-19
score: 70.9
coord: 207..276
e-value: 1.6E-9
score: 39.5
coord: 39..206
e-value: 1.2E-11
score: 46.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 530..578
e-value: 3.9E-16
score: 59.0
coord: 355..403
e-value: 2.2E-12
score: 47.0
coord: 285..334
e-value: 2.5E-14
score: 53.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 358..392
e-value: 7.0E-4
score: 17.6
coord: 393..426
e-value: 0.0018
score: 16.3
coord: 532..564
e-value: 5.6E-9
score: 33.6
coord: 463..491
e-value: 2.3E-6
score: 25.4
coord: 428..461
e-value: 0.0014
score: 16.7
coord: 288..320
e-value: 6.5E-8
score: 30.3
coord: 567..600
e-value: 3.8E-7
score: 27.9
coord: 216..242
e-value: 0.0026
score: 15.8
coord: 498..528
e-value: 4.5E-6
score: 24.5
coord: 324..357
e-value: 9.7E-9
score: 32.9
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 456..487
e-value: 1.5E-10
score: 40.6
coord: 495..522
e-value: 4.9E-6
score: 26.2
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 588..643
e-value: 7.0E-4
score: 19.6
coord: 203..257
e-value: 1.9E-6
score: 27.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 600..634
score: 9.119859
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 565..599
score: 11.465577
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 530..564
score: 12.24383
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 356..390
score: 9.437737
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 391..425
score: 9.722731
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 286..320
score: 11.73961
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 496..526
score: 9.328124
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 321..355
score: 12.638436
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 426..460
score: 9.076014
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 461..495
score: 10.621557
NoneNo IPR availablePANTHERPTHR47942TETRATRICOPEPTIDE REPEAT (TPR)-LIKE SUPERFAMILY PROTEIN-RELATEDcoord: 7..650
NoneNo IPR availablePANTHERPTHR47942:SF4REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 7..650
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 332..561

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10021066.1HG10021066.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015986 ATP synthesis coupled proton transport
cellular_component GO:0045261 proton-transporting ATP synthase complex, catalytic core F(1)
molecular_function GO:0005515 protein binding
molecular_function GO:0046933 proton-transporting ATP synthase activity, rotational mechanism