HG10012690 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10012690
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr01: 23424910 .. 23427000 (-)
RNA-Seq ExpressionHG10012690
SyntenyHG10012690
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCGTTTCTCTCGCCAAACTCGCTCGCTTCACTGGTCGAATTGGCCGTATCGGTTCGTTCTTCTCTTCTTGGCCGAGCCGCCCATGCCCAAATTCTCAAAACCCTAAAAACCCCTCTTCCAGCCTTCCTCTACAACCACCTCGTGAACATGTACGCCAAACTCGACCATCTTAACTCGGCCAAACTCATCCTCGAACTCGCCCCTTGCCGCTCCGTTGTCACTTGGACCGCCCTCATCGCCGGTTCCGTCCAAAACGGTTGTTTTGCTTCCGCTCTGCTTCACTTCTCCGACATGCTAAGTGACTGTGTTCGACCCAATGACTTCACTTTCCCTTGCGTTTTCAAAGCCTCCACTGGTCTTCGCATGGCCATGACAGGCAAACAGCTACACGCACTTGCGGTTAAGGAGGGATTAATAAGTGATGTCTTCGTCGGGTGCAGTGTCTTCGACATGTACAGCAAATTGGGCTTTCTTAATGACGCATACAAGGTGTTTGATGAAATGCCTTATCGAAATCTCGAGATGTGGAATGCGTATATATCCAATTCCGTGCTCCATGGGCGACCTGAAGACTCTGCCATTGCATTTATTGAGCTACTTCGAGTTGGTGGGAAGCCAGATTCCATAACATTTTGTGCTTTTTTCAATGCGTGTTCGGACAAACTAGGCTTGGGGCCTGGGTGTCAGCTTCATGGGTTCATTATTAGATGCGGGTATGGGCAGAATGTCTCTGTTTCAAATGGGTTGATTGATTTTTATGGGAAATGTGGGGAGGTTGAATGTTCTGAGATGGTTTTTGATAGAATGGGGGAGCGGAACAGCGTATCTTGGTCCTCTTTGATAGCTGCTTACGTTCAAAACAATGAGGAAGAGAAGGCTTCCTGCTTATTCTTGCGAGCGAGGAAAGAAGATATCAAACCAACTGATTTTATGGTATCAAGTGTGCTTTGTGCCTGTGCTGGTCTTTCAGAAATCGAGTTTGGAAGGTCAGTTCAAGCACTAGCCGTTAAGGCTTGTATAGAGGATAATATCTTTGTTGGAAGTGCACTGGTTGACATGTATGGAAAATGTGGAAGTATTGATGAAGCAGAGCGAGCCTTCAACGAGATGCCAGAAAGAAACTTGGTATCTTGGAATGCTTTGTTGGGAGGATACGCACACCAAGGACACGCAGACAAGGCTGTGGCATTGCTCGAGGAGATGGTATCGATGGCAGGCATGGTGCCGAGCTATGTAAGTTTGGTCTGTGCATTATCAGCTTGCAGTAGAGCAGGAGATTTGAAGACGGGGATGCAGATTTTTGAGTCCATGAAAGCAAGGTACGGTATAGAACCAGGGCCAGAGCATTATGCTTGCTTGGTAGATTTGCTTGGACGTGCTGGAATGGTGGAATGTGCGTATGATTTTATAAAGAGCATGTCATTTCCTCCTACAATCTCAATCTGGGGTGCTCTGTTGGGGGCTTGCCGAATGCATGGGAAGCCAGAGTTGGGAAAGTTGGCCGCTGAGAAGCTGTTTGAACTTGATCCAAAAGACTCTGGAAATCACGTTGTGCTGTCCAATATGTTTGCTGCAACTGGAAGGTAACCCAATTTCAAATTTCGCAACACTCTTATTCACATATATTTTATTCGTATTCTACTATTATTCACTCGTGAGTAAAATCCAATAGGTAACTACTATATTATTAATTTTCTAAGATAAGACAAAAACTTCTTACTTCAATACTAAGTTAAATCACCATTAAATCTAAAAATTTGAACCAATGAATTGTGATATACTTCATCTTTAACATTCTTCAAAACTCTGATTTCTTAAATTCTATGGTCAATTGATTGGGGGGTTGAATTTAGGTGGGAAGAAGTGACTGTCGTACGAAATGAGATGAAAGAAGTAGGAATCAAAAAGGGAGCTGGGTTCAGTTGGATAACAGTAAACAGTAGAATCCATATATTCCAAGCGAAAGACAAAAGCCATGAGAAGGACTCTGAAATTCAGGACATGTTGGGAAAGCTGAGGAAGGAGATGCAGGAAGCTGCTGGTTGCATTGCAGACACCAATTATGCTCTTTTTGAAGTGTCGAATTAA

mRNA sequence

ATGCCGTTTCTCTCGCCAAACTCGCTCGCTTCACTGGTCGAATTGGCCGTATCGGTTCGTTCTTCTCTTCTTGGCCGAGCCGCCCATGCCCAAATTCTCAAAACCCTAAAAACCCCTCTTCCAGCCTTCCTCTACAACCACCTCGTGAACATGTACGCCAAACTCGACCATCTTAACTCGGCCAAACTCATCCTCGAACTCGCCCCTTGCCGCTCCGTTGTCACTTGGACCGCCCTCATCGCCGGTTCCGTCCAAAACGGTTGTTTTGCTTCCGCTCTGCTTCACTTCTCCGACATGCTAAGTGACTGTGTTCGACCCAATGACTTCACTTTCCCTTGCGTTTTCAAAGCCTCCACTGGTCTTCGCATGGCCATGACAGGCAAACAGCTACACGCACTTGCGGTTAAGGAGGGATTAATAAGTGATGTCTTCGTCGGGTGCAGTGTCTTCGACATGTACAGCAAATTGGGCTTTCTTAATGACGCATACAAGGTGTTTGATGAAATGCCTTATCGAAATCTCGAGATGTGGAATGCGTATATATCCAATTCCGTGCTCCATGGGCGACCTGAAGACTCTGCCATTGCATTTATTGAGCTACTTCGAGTTGGTGGGAAGCCAGATTCCATAACATTTTGTGCTTTTTTCAATGCGTGTTCGGACAAACTAGGCTTGGGGCCTGGGTGTCAGCTTCATGGGTTCATTATTAGATGCGGGTATGGGCAGAATGTCTCTGTTTCAAATGGGTTGATTGATTTTTATGGGAAATGTGGGGAGGTTGAATGTTCTGAGATGGTTTTTGATAGAATGGGGGAGCGGAACAGCGTATCTTGGTCCTCTTTGATAGCTGCTTACGTTCAAAACAATGAGGAAGAGAAGGCTTCCTGCTTATTCTTGCGAGCGAGGAAAGAAGATATCAAACCAACTGATTTTATGGTATCAAGTGTGCTTTGTGCCTGTGCTGGTCTTTCAGAAATCGAGTTTGGAAGGTCAGTTCAAGCACTAGCCGTTAAGGCTTGTATAGAGGATAATATCTTTGTTGGAAGTGCACTGGTTGACATGTATGGAAAATGTGGAAGTATTGATGAAGCAGAGCGAGCCTTCAACGAGATGCCAGAAAGAAACTTGGTATCTTGGAATGCTTTGTTGGGAGGATACGCACACCAAGGACACGCAGACAAGGCTGTGGCATTGCTCGAGGAGATGGTATCGATGGCAGGCATGGTGCCGAGCTATGTAAGTTTGGTCTGTGCATTATCAGCTTGCAGTAGAGCAGGAGATTTGAAGACGGGGATGCAGATTTTTGAGTCCATGAAAGCAAGGTACGGTATAGAACCAGGGCCAGAGCATTATGCTTGCTTGGTAGATTTGCTTGGACGTGCTGGAATGGTGGAATGTGCGTATGATTTTATAAAGAGCATGTCATTTCCTCCTACAATCTCAATCTGGGGTGCTCTGTTGGGGGCTTGCCGAATGCATGGGAAGCCAGAGTTGGGAAAGTTGGCCGCTGAGAAGCTGTTTGAACTTGATCCAAAAGACTCTGGAAATCACGTTGTGCTGTCCAATATGTTTGCTGCAACTGGAAGGTGGGAAGAAGTGACTGTCGTACGAAATGAGATGAAAGAAGTAGGAATCAAAAAGGGAGCTGGGTTCAGTTGGATAACAGTAAACAGTAGAATCCATATATTCCAAGCGAAAGACAAAAGCCATGAGAAGGACTCTGAAATTCAGGACATGTTGGGAAAGCTGAGGAAGGAGATGCAGGAAGCTGCTGGTTGCATTGCAGACACCAATTATGCTCTTTTTGAAGTGTCGAATTAA

Coding sequence (CDS)

ATGCCGTTTCTCTCGCCAAACTCGCTCGCTTCACTGGTCGAATTGGCCGTATCGGTTCGTTCTTCTCTTCTTGGCCGAGCCGCCCATGCCCAAATTCTCAAAACCCTAAAAACCCCTCTTCCAGCCTTCCTCTACAACCACCTCGTGAACATGTACGCCAAACTCGACCATCTTAACTCGGCCAAACTCATCCTCGAACTCGCCCCTTGCCGCTCCGTTGTCACTTGGACCGCCCTCATCGCCGGTTCCGTCCAAAACGGTTGTTTTGCTTCCGCTCTGCTTCACTTCTCCGACATGCTAAGTGACTGTGTTCGACCCAATGACTTCACTTTCCCTTGCGTTTTCAAAGCCTCCACTGGTCTTCGCATGGCCATGACAGGCAAACAGCTACACGCACTTGCGGTTAAGGAGGGATTAATAAGTGATGTCTTCGTCGGGTGCAGTGTCTTCGACATGTACAGCAAATTGGGCTTTCTTAATGACGCATACAAGGTGTTTGATGAAATGCCTTATCGAAATCTCGAGATGTGGAATGCGTATATATCCAATTCCGTGCTCCATGGGCGACCTGAAGACTCTGCCATTGCATTTATTGAGCTACTTCGAGTTGGTGGGAAGCCAGATTCCATAACATTTTGTGCTTTTTTCAATGCGTGTTCGGACAAACTAGGCTTGGGGCCTGGGTGTCAGCTTCATGGGTTCATTATTAGATGCGGGTATGGGCAGAATGTCTCTGTTTCAAATGGGTTGATTGATTTTTATGGGAAATGTGGGGAGGTTGAATGTTCTGAGATGGTTTTTGATAGAATGGGGGAGCGGAACAGCGTATCTTGGTCCTCTTTGATAGCTGCTTACGTTCAAAACAATGAGGAAGAGAAGGCTTCCTGCTTATTCTTGCGAGCGAGGAAAGAAGATATCAAACCAACTGATTTTATGGTATCAAGTGTGCTTTGTGCCTGTGCTGGTCTTTCAGAAATCGAGTTTGGAAGGTCAGTTCAAGCACTAGCCGTTAAGGCTTGTATAGAGGATAATATCTTTGTTGGAAGTGCACTGGTTGACATGTATGGAAAATGTGGAAGTATTGATGAAGCAGAGCGAGCCTTCAACGAGATGCCAGAAAGAAACTTGGTATCTTGGAATGCTTTGTTGGGAGGATACGCACACCAAGGACACGCAGACAAGGCTGTGGCATTGCTCGAGGAGATGGTATCGATGGCAGGCATGGTGCCGAGCTATGTAAGTTTGGTCTGTGCATTATCAGCTTGCAGTAGAGCAGGAGATTTGAAGACGGGGATGCAGATTTTTGAGTCCATGAAAGCAAGGTACGGTATAGAACCAGGGCCAGAGCATTATGCTTGCTTGGTAGATTTGCTTGGACGTGCTGGAATGGTGGAATGTGCGTATGATTTTATAAAGAGCATGTCATTTCCTCCTACAATCTCAATCTGGGGTGCTCTGTTGGGGGCTTGCCGAATGCATGGGAAGCCAGAGTTGGGAAAGTTGGCCGCTGAGAAGCTGTTTGAACTTGATCCAAAAGACTCTGGAAATCACGTTGTGCTGTCCAATATGTTTGCTGCAACTGGAAGGTGGGAAGAAGTGACTGTCGTACGAAATGAGATGAAAGAAGTAGGAATCAAAAAGGGAGCTGGGTTCAGTTGGATAACAGTAAACAGTAGAATCCATATATTCCAAGCGAAAGACAAAAGCCATGAGAAGGACTCTGAAATTCAGGACATGTTGGGAAAGCTGAGGAAGGAGATGCAGGAAGCTGCTGGTTGCATTGCAGACACCAATTATGCTCTTTTTGAAGTGTCGAATTAA

Protein sequence

MPFLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLKTPLPAFLYNHLVNMYAKLDHLNSAKLILELAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCVFKASTGLRMAMTGKQLHALAVKEGLISDVFVGCSVFDMYSKLGFLNDAYKVFDEMPYRNLEMWNAYISNSVLHGRPEDSAIAFIELLRVGGKPDSITFCAFFNACSDKLGLGPGCQLHGFIIRCGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLRARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACIEDNIFVGSALVDMYGKCGSIDEAERAFNEMPERNLVSWNALLGGYAHQGHADKAVALLEEMVSMAGMVPSYVSLVCALSACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMSFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEMKEVGIKKGAGFSWITVNSRIHIFQAKDKSHEKDSEIQDMLGKLRKEMQEAAGCIADTNYALFEVSN
Homology
BLAST of HG10012690 vs. NCBI nr
Match: XP_038881355.1 (pentatricopeptide repeat-containing protein At4g14850 [Benincasa hispida] >XP_038881359.1 pentatricopeptide repeat-containing protein At4g14850 [Benincasa hispida] >XP_038881365.1 pentatricopeptide repeat-containing protein At4g14850 [Benincasa hispida])

HSP 1 Score: 1176.4 bits (3042), Expect = 0.0e+00
Identity = 583/606 (96.20%), Postives = 594/606 (98.02%), Query Frame = 0

Query: 1   MPFLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLKTPLPAFLYNHLVNMYAKLDHLNS 60
           MPFLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLKTPLPAFLYNHLVNMYAK DHLNS
Sbjct: 1   MPFLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLKTPLPAFLYNHLVNMYAKFDHLNS 60

Query: 61  AKLILELAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCVFKASTG 120
           AKLILELAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCV KASTG
Sbjct: 61  AKLILELAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCVLKASTG 120

Query: 121 LRMAMTGKQLHALAVKEGLISDVFVGCSVFDMYSKLGFLNDAYKVFDEMPYRNLEMWNAY 180
           LRMAMTGKQLHALAVKEGLI+DVFVGCSVFDMYSKLG L+DAYK+FDEMP+RNLE  NAY
Sbjct: 121 LRMAMTGKQLHALAVKEGLINDVFVGCSVFDMYSKLGLLSDAYKMFDEMPHRNLETLNAY 180

Query: 181 ISNSVLHGRPEDSAIAFIELLRVGGKPDSITFCAFFNACSDKLGLGPGCQLHGFIIRCGY 240
           ISNSVLHGRPEDSAIAFIELLRVG KPDSITFCAFFNACSDKLGLGPGCQLHGFIIR GY
Sbjct: 181 ISNSVLHGRPEDSAIAFIELLRVGEKPDSITFCAFFNACSDKLGLGPGCQLHGFIIRSGY 240

Query: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300
           GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
Sbjct: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300

Query: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACIEDNIFVGSALVDMYGKCGS 360
           ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKAC+E+NIFVGSALVDMYGKCGS
Sbjct: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEENIFVGSALVDMYGKCGS 360

Query: 361 IDEAERAFNEMPERNLVSWNALLGGYAHQGHADKAVALLEEMVSMAGMVPSYVSLVCALS 420
           I EAE+AFNEMPERNLVSWNALLGGYAHQGHADKAVALLEEMVS+AG+ PSYVSLVCALS
Sbjct: 361 IYEAEQAFNEMPERNLVSWNALLGGYAHQGHADKAVALLEEMVSVAGLSPSYVSLVCALS 420

Query: 421 ACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMSFPPTI 480
           ACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIK+M FPPTI
Sbjct: 421 ACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKTMPFPPTI 480

Query: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM 540
           SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM
Sbjct: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM 540

Query: 541 KEVGIKKGAGFSWITVNSRIHIFQAKDKSHEKDSEIQDMLGKLRKEMQEAAGCIADTNYA 600
           KEVGIKKGAGFSWITVNSRIHIFQAKDKSHEKDSEIQDMLGKLRKEMQEAAG IADTNYA
Sbjct: 541 KEVGIKKGAGFSWITVNSRIHIFQAKDKSHEKDSEIQDMLGKLRKEMQEAAGFIADTNYA 600

Query: 601 LFEVSN 607
            FE+SN
Sbjct: 601 PFEMSN 606

BLAST of HG10012690 vs. NCBI nr
Match: XP_031738596.1 (pentatricopeptide repeat-containing protein At4g14850 [Cucumis sativus] >KGN56980.1 hypothetical protein Csa_011264 [Cucumis sativus])

HSP 1 Score: 1161.0 bits (3002), Expect = 0.0e+00
Identity = 570/606 (94.06%), Postives = 585/606 (96.53%), Query Frame = 0

Query: 1   MPFLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLKTPLPAFLYNHLVNMYAKLDHLNS 60
           MPFLS NSLAS+VELAVSVRSSLLGRAAHAQILKTLKTP PAFLYNHLVNMYAKLDHLNS
Sbjct: 1   MPFLSQNSLASVVELAVSVRSSLLGRAAHAQILKTLKTPFPAFLYNHLVNMYAKLDHLNS 60

Query: 61  AKLILELAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCVFKASTG 120
           AKLILELAPCRSVVTWTALIAGSVQNGCF SALLHFSDMLSDCVRPNDFTFPCV KASTG
Sbjct: 61  AKLILELAPCRSVVTWTALIAGSVQNGCFVSALLHFSDMLSDCVRPNDFTFPCVLKASTG 120

Query: 121 LRMAMTGKQLHALAVKEGLISDVFVGCSVFDMYSKLGFLNDAYKVFDEMPYRNLEMWNAY 180
           LRM  TGKQLHALAVKEGLI+DVFVGCSVFDMYSKLGFLNDAYKVFDEMP+RNLE WNAY
Sbjct: 121 LRMDTTGKQLHALAVKEGLINDVFVGCSVFDMYSKLGFLNDAYKVFDEMPHRNLETWNAY 180

Query: 181 ISNSVLHGRPEDSAIAFIELLRVGGKPDSITFCAFFNACSDKLGLGPGCQLHGFIIRCGY 240
           ISNSVLHGRPEDS IAFIELLRVGGKPDSITFCAF NACSDKLGLGPGCQLHGFIIR GY
Sbjct: 181 ISNSVLHGRPEDSVIAFIELLRVGGKPDSITFCAFLNACSDKLGLGPGCQLHGFIIRSGY 240

Query: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300
           GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
Sbjct: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300

Query: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACIEDNIFVGSALVDMYGKCGS 360
           ARKEDI+PTDFMVSSVLCACAGLSEIEFGRSVQALAVKAC+E NIFV SALVDMYGKCGS
Sbjct: 301 ARKEDIEPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEQNIFVASALVDMYGKCGS 360

Query: 361 IDEAERAFNEMPERNLVSWNALLGGYAHQGHADKAVALLEEMVSMAGMVPSYVSLVCALS 420
           ID AE+AFN MPERNLVSWNALLGGYAHQGHA+KAVALLEEM S AG+VPSYVSL+CALS
Sbjct: 361 IDNAEQAFNAMPERNLVSWNALLGGYAHQGHANKAVALLEEMTSAAGIVPSYVSLICALS 420

Query: 421 ACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMSFPPTI 480
           ACSRAGDLKTGM+IFESMK RYG+EPGPEHYACLVDLLGRAGMVECAYDFIK M FPPTI
Sbjct: 421 ACSRAGDLKTGMKIFESMKERYGVEPGPEHYACLVDLLGRAGMVECAYDFIKRMPFPPTI 480

Query: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM 540
           SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM
Sbjct: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM 540

Query: 541 KEVGIKKGAGFSWITVNSRIHIFQAKDKSHEKDSEIQDMLGKLRKEMQEAAGCIADTNYA 600
           KEVGIKKGAGFSWITV+SRIH+FQAKDKSHEKD EIQD+LGKLRKEMQ+AAGCIAD NYA
Sbjct: 541 KEVGIKKGAGFSWITVDSRIHMFQAKDKSHEKDPEIQDILGKLRKEMQDAAGCIADPNYA 600

Query: 601 LFEVSN 607
           LFEVSN
Sbjct: 601 LFEVSN 606

BLAST of HG10012690 vs. NCBI nr
Match: XP_008438671.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g14850 [Cucumis melo] >KAA0049360.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK17198.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1154.4 bits (2985), Expect = 0.0e+00
Identity = 567/606 (93.56%), Postives = 583/606 (96.20%), Query Frame = 0

Query: 1   MPFLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLKTPLPAFLYNHLVNMYAKLDHLNS 60
           MPFLS NSLAS+VELAVSVRSSLLGRAAHAQILKTLKTP PAFLYNHLVNMYAKLDHLNS
Sbjct: 1   MPFLSKNSLASVVELAVSVRSSLLGRAAHAQILKTLKTPFPAFLYNHLVNMYAKLDHLNS 60

Query: 61  AKLILELAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCVFKASTG 120
           AKLILELAPCRSVVTWTALIAGSVQNGCF SALLHFSDMLSDCVRPNDFTFPCV KASTG
Sbjct: 61  AKLILELAPCRSVVTWTALIAGSVQNGCFVSALLHFSDMLSDCVRPNDFTFPCVLKASTG 120

Query: 121 LRMAMTGKQLHALAVKEGLISDVFVGCSVFDMYSKLGFLNDAYKVFDEMPYRNLEMWNAY 180
           LRM MTGKQLHALAVKEGLI+DVFVGCSVFDMYSKLGFLNDAYK+FDEMP RNLE WNAY
Sbjct: 121 LRMDMTGKQLHALAVKEGLINDVFVGCSVFDMYSKLGFLNDAYKLFDEMPQRNLETWNAY 180

Query: 181 ISNSVLHGRPEDSAIAFIELLRVGGKPDSITFCAFFNACSDKLGLGPGCQLHGFIIRCGY 240
           I+NSVLHGRPEDSAIAFIELLRVG KPDSITFCAF NACSDKLGLGPGCQLHGF+IR GY
Sbjct: 181 ITNSVLHGRPEDSAIAFIELLRVGEKPDSITFCAFLNACSDKLGLGPGCQLHGFVIRSGY 240

Query: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300
           GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
Sbjct: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300

Query: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACIEDNIFVGSALVDMYGKCGS 360
           ARKEDI+PTDFMVSSVLCACAGLSEIEFGRSVQALAVKAC+E NIFV SALVDMYGKCGS
Sbjct: 301 ARKEDIEPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEQNIFVASALVDMYGKCGS 360

Query: 361 IDEAERAFNEMPERNLVSWNALLGGYAHQGHADKAVALLEEMVSMAGMVPSYVSLVCALS 420
           ID A +AFN MPERNLVSWNALLGGYAHQGHA+KAVALLEEM S AG+VPSYVSL+CALS
Sbjct: 361 IDNAVQAFNAMPERNLVSWNALLGGYAHQGHANKAVALLEEMTSAAGIVPSYVSLICALS 420

Query: 421 ACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMSFPPTI 480
           ACSRAGDLKTGM+IFESMK RYG+EPGPEHYACLVDLLGRAGMVECAYDFIK M FPPTI
Sbjct: 421 ACSRAGDLKTGMKIFESMKERYGVEPGPEHYACLVDLLGRAGMVECAYDFIKRMPFPPTI 480

Query: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM 540
           SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEE TVVRNEM
Sbjct: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEATVVRNEM 540

Query: 541 KEVGIKKGAGFSWITVNSRIHIFQAKDKSHEKDSEIQDMLGKLRKEMQEAAGCIADTNYA 600
           KEVGIKKGAGFSWITV+SRIHIFQAKDKSHEKD EIQ+MLGKLRKEMQ+AAGCIAD NYA
Sbjct: 541 KEVGIKKGAGFSWITVDSRIHIFQAKDKSHEKDPEIQNMLGKLRKEMQDAAGCIADPNYA 600

Query: 601 LFEVSN 607
           LFEVSN
Sbjct: 601 LFEVSN 606

BLAST of HG10012690 vs. NCBI nr
Match: KAG6582300.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1122.1 bits (2901), Expect = 0.0e+00
Identity = 550/606 (90.76%), Postives = 577/606 (95.21%), Query Frame = 0

Query: 1   MPFLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLKTPLPAFLYNHLVNMYAKLDHLNS 60
           MPFLSPNSLASLVE A+S+RSSLLGR AHAQILKTLKTP PAFLYNHLVNMYAKLD LNS
Sbjct: 1   MPFLSPNSLASLVEFALSIRSSLLGRVAHAQILKTLKTPFPAFLYNHLVNMYAKLDQLNS 60

Query: 61  AKLILELAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCVFKASTG 120
           A+LILELAPCRSVVTWT+LIAGSVQNG FASALLHFSDMLSDCVRPNDFTFPCVFKASTG
Sbjct: 61  AELILELAPCRSVVTWTSLIAGSVQNGRFASALLHFSDMLSDCVRPNDFTFPCVFKASTG 120

Query: 121 LRMAMTGKQLHALAVKEGLISDVFVGCSVFDMYSKLGFLNDAYKVFDEMPYRNLEMWNAY 180
           LRMAMTGKQ+HALAVKEGLI+DVFVGCS FDMYSKLG L+DAYK+F EMP+RNLE WNAY
Sbjct: 121 LRMAMTGKQVHALAVKEGLINDVFVGCSAFDMYSKLGLLDDAYKLFVEMPHRNLETWNAY 180

Query: 181 ISNSVLHGRPEDSAIAFIELLRVGGKPDSITFCAFFNACSDKLGLGPGCQLHGFIIRCGY 240
           ISNSVLHGRPEDSAIAFIELLR GGKPDSITFCAF NACSDKLGL PGCQLHGFIIR G 
Sbjct: 181 ISNSVLHGRPEDSAIAFIELLRAGGKPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGC 240

Query: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300
           GQNVS+SNGLIDFYGKCGEV CSE++FDRMGERNSVSWSSLIAAYVQNNEEEKA CLFLR
Sbjct: 241 GQNVSISNGLIDFYGKCGEVVCSEVIFDRMGERNSVSWSSLIAAYVQNNEEEKACCLFLR 300

Query: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACIEDNIFVGSALVDMYGKCGS 360
           ARKEDIKPTDFMVSSVLCA AGLSEIE GRSVQALAVKAC+++NIFVGSALVDMYGKCGS
Sbjct: 301 ARKEDIKPTDFMVSSVLCASAGLSEIELGRSVQALAVKACVDENIFVGSALVDMYGKCGS 360

Query: 361 IDEAERAFNEMPERNLVSWNALLGGYAHQGHADKAVALLEEMVSMAGMVPSYVSLVCALS 420
           ID+AE+AFNEMPERNLVSWNALLGGYAHQG+ADKAVALL++M S+ G+ PSYVSLVCALS
Sbjct: 361 IDKAEQAFNEMPERNLVSWNALLGGYAHQGYADKAVALLKDMASVEGIAPSYVSLVCALS 420

Query: 421 ACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMSFPPTI 480
           ACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDL GRAGMVECAYDFI+ M FPPTI
Sbjct: 421 ACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDLFGRAGMVECAYDFIRRMPFPPTI 480

Query: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM 540
           SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNM AAT RWEEVTV+RNEM
Sbjct: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMLAATSRWEEVTVIRNEM 540

Query: 541 KEVGIKKGAGFSWITVNSRIHIFQAKDKSHEKDSEIQDMLGKLRKEMQEAAGCIADTNYA 600
           KEVGIKKGAGFSWITVNSRIHIFQAKDKS+EKDSE+QDMLGKLRKEMQEAAG IAD NYA
Sbjct: 541 KEVGIKKGAGFSWITVNSRIHIFQAKDKSYEKDSELQDMLGKLRKEMQEAAGSIADANYA 600

Query: 601 LFEVSN 607
           LFE S+
Sbjct: 601 LFEASS 606

BLAST of HG10012690 vs. NCBI nr
Match: XP_022956070.1 (pentatricopeptide repeat-containing protein At4g14850 [Cucurbita moschata])

HSP 1 Score: 1120.1 bits (2896), Expect = 0.0e+00
Identity = 549/603 (91.04%), Postives = 575/603 (95.36%), Query Frame = 0

Query: 1   MPFLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLKTPLPAFLYNHLVNMYAKLDHLNS 60
           MPFLSPNSLASLVE A+S+RSSLLGR AHAQILKTLKTP PAFLYNHLVNMYAKLD LNS
Sbjct: 1   MPFLSPNSLASLVEFALSIRSSLLGRVAHAQILKTLKTPFPAFLYNHLVNMYAKLDQLNS 60

Query: 61  AKLILELAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCVFKASTG 120
           A+LILELAPCRSVVTWT+LIAGSVQNG FASALLHFSDMLSDCVRPNDFTFPCVFKASTG
Sbjct: 61  AELILELAPCRSVVTWTSLIAGSVQNGRFASALLHFSDMLSDCVRPNDFTFPCVFKASTG 120

Query: 121 LRMAMTGKQLHALAVKEGLISDVFVGCSVFDMYSKLGFLNDAYKVFDEMPYRNLEMWNAY 180
           LRMAMTGKQ+HALAVKEGLI+DVFVGCS FDMYSKLG L+DAYK+F EMP+RNLE WNAY
Sbjct: 121 LRMAMTGKQVHALAVKEGLINDVFVGCSAFDMYSKLGLLDDAYKLFVEMPHRNLETWNAY 180

Query: 181 ISNSVLHGRPEDSAIAFIELLRVGGKPDSITFCAFFNACSDKLGLGPGCQLHGFIIRCGY 240
           ISNSVLHGRPEDSAIAFIELLR GGKPDSITFCAF NACSDKLGL PGCQLHGFIIR G 
Sbjct: 181 ISNSVLHGRPEDSAIAFIELLRAGGKPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGC 240

Query: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300
           GQNVS+SNGLIDFYGKCGEV CSE++FDRMGERNSVSWSSLIAAYVQNNEEEKA CLFLR
Sbjct: 241 GQNVSISNGLIDFYGKCGEVVCSEVIFDRMGERNSVSWSSLIAAYVQNNEEEKACCLFLR 300

Query: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACIEDNIFVGSALVDMYGKCGS 360
           ARKEDIKPTDFMVSSVLCA AGLSEIE GRSVQALAVKAC+++NIFVGSALVDMYGKCGS
Sbjct: 301 ARKEDIKPTDFMVSSVLCASAGLSEIELGRSVQALAVKACVDENIFVGSALVDMYGKCGS 360

Query: 361 IDEAERAFNEMPERNLVSWNALLGGYAHQGHADKAVALLEEMVSMAGMVPSYVSLVCALS 420
           ID+AE+AFNEMPERNLVSWNALLGGYAHQG+ADKAVALL++M S+ G+ PSYVSLVCALS
Sbjct: 361 IDKAEQAFNEMPERNLVSWNALLGGYAHQGYADKAVALLKDMASVEGIAPSYVSLVCALS 420

Query: 421 ACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMSFPPTI 480
           ACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDL GRAGMVECAYDFI+ M FPPTI
Sbjct: 421 ACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDLFGRAGMVECAYDFIRRMPFPPTI 480

Query: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM 540
           SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNM AAT RWEEVTV+RNEM
Sbjct: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMLAATSRWEEVTVIRNEM 540

Query: 541 KEVGIKKGAGFSWITVNSRIHIFQAKDKSHEKDSEIQDMLGKLRKEMQEAAGCIADTNYA 600
           KEVGIKKGAGFSWITVNSRIHIFQAKDKS+EKDSE+QDMLGKLRKEMQEAAG IAD NYA
Sbjct: 541 KEVGIKKGAGFSWITVNSRIHIFQAKDKSYEKDSELQDMLGKLRKEMQEAAGSIADANYA 600

Query: 601 LFE 604
           LFE
Sbjct: 601 LFE 603

BLAST of HG10012690 vs. ExPASy Swiss-Prot
Match: Q0WSH6 (Pentatricopeptide repeat-containing protein At4g14850 OS=Arabidopsis thaliana OX=3702 GN=LOI1 PE=1 SV=1)

HSP 1 Score: 763.1 bits (1969), Expect = 2.4e-219
Identity = 372/605 (61.49%), Postives = 468/605 (77.36%), Query Frame = 0

Query: 1   MPFLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLKTPLPAFLYNHLVNMYAKLDHLNS 60
           M  LS ++L  L++ A+S  S  LGR  HA+I+KTL +P P FL N+L+NMY+KLDH  S
Sbjct: 1   MSLLSADALGLLLKNAISASSMRLGRVVHARIVKTLDSPPPPFLANYLINMYSKLDHPES 60

Query: 61  AKLILELAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCVFKASTG 120
           A+L+L L P R+VV+WT+LI+G  QNG F++AL+ F +M  + V PNDFTFPC FKA   
Sbjct: 61  ARLVLRLTPARNVVSWTSLISGLAQNGHFSTALVEFFEMRREGVVPNDFTFPCAFKAVAS 120

Query: 121 LRMAMTGKQLHALAVKEGLISDVFVGCSVFDMYSKLGFLNDAYKVFDEMPYRNLEMWNAY 180
           LR+ +TGKQ+HALAVK G I DVFVGCS FDMY K    +DA K+FDE+P RNLE WNA+
Sbjct: 121 LRLPVTGKQIHALAVKCGRILDVFVGCSAFDMYCKTRLRDDARKLFDEIPERNLETWNAF 180

Query: 181 ISNSVLHGRPEDSAIAFIELLRVGGKPDSITFCAFFNACSDKLGLGPGCQLHGFIIRCGY 240
           ISNSV  GRP ++  AFIE  R+ G P+SITFCAF NACSD L L  G QLHG ++R G+
Sbjct: 181 ISNSVTDGRPREAIEAFIEFRRIDGHPNSITFCAFLNACSDWLHLNLGMQLHGLVLRSGF 240

Query: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300
             +VSV NGLIDFYGKC ++  SE++F  MG +N+VSW SL+AAYVQN+E+EKAS L+LR
Sbjct: 241 DTDVSVCNGLIDFYGKCKQIRSSEIIFTEMGTKNAVSWCSLVAAYVQNHEDEKASVLYLR 300

Query: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACIEDNIFVGSALVDMYGKCGS 360
           +RK+ ++ +DFM+SSVL ACAG++ +E GRS+ A AVKAC+E  IFVGSALVDMYGKCG 
Sbjct: 301 SRKDIVETSDFMISSVLSACAGMAGLELGRSIHAHAVKACVERTIFVGSALVDMYGKCGC 360

Query: 361 IDEAERAFNEMPERNLVSWNALLGGYAHQGHADKAVALLEEMVSM-AGMVPSYVSLVCAL 420
           I+++E+AF+EMPE+NLV+ N+L+GGYAHQG  D A+AL EEM     G  P+Y++ V  L
Sbjct: 361 IEDSEQAFDEMPEKNLVTRNSLIGGYAHQGQVDMALALFEEMAPRGCGPTPNYMTFVSLL 420

Query: 421 SACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMSFPPT 480
           SACSRAG ++ GM+IF+SM++ YGIEPG EHY+C+VD+LGRAGMVE AY+FIK M   PT
Sbjct: 421 SACSRAGAVENGMKIFDSMRSTYGIEPGAEHYSCIVDMLGRAGMVERAYEFIKKMPIQPT 480

Query: 481 ISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNE 540
           IS+WGAL  ACRMHGKP+LG LAAE LF+LDPKDSGNHV+LSN FAA GRW E   VR E
Sbjct: 481 ISVWGALQNACRMHGKPQLGLLAAENLFKLDPKDSGNHVLLSNTFAAAGRWAEANTVREE 540

Query: 541 MKEVGIKKGAGFSWITVNSRIHIFQAKDKSHEKDSEIQDMLGKLRKEMQEAAGCIADTNY 600
           +K VGIKKGAG+SWITV +++H FQAKD+SH  + EIQ  L KLR EM EAAG   D   
Sbjct: 541 LKGVGIKKGAGYSWITVKNQVHAFQAKDRSHILNKEIQTTLAKLRNEM-EAAGYKPDLKL 600

Query: 601 ALFEV 605
           +L+++
Sbjct: 601 SLYDL 604

BLAST of HG10012690 vs. ExPASy Swiss-Prot
Match: Q9FIB2 (Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H35 PE=3 SV=1)

HSP 1 Score: 413.7 bits (1062), Expect = 3.6e-114
Identity = 230/611 (37.64%), Postives = 349/611 (57.12%), Query Frame = 0

Query: 4   LSPNS----LASLVELAVSVRSSL-LGRAAHAQILKTLKTPLPAFLYNHLVNMYAKLDHL 63
           +SP S    L+S  E +++    L  GR  H  ++ T        + N LVNMYAK   +
Sbjct: 306 VSPESYVILLSSFPEYSLAEEVGLKKGREVHGHVITTGLVDFMVGIGNGLVNMYAKCGSI 365

Query: 64  NSAKLILELAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCVFKAS 123
             A+ +      +  V+W ++I G  QNGCF  A+  +  M    + P  FT      + 
Sbjct: 366 ADARRVFYFMTDKDSVSWNSMITGLDQNGCFIEAVERYKSMRRHDILPGSFTLISSLSSC 425

Query: 124 TGLRMAMTGKQLHALAVKEGLISDVFVGCSVFDMYSKLGFLNDAYKVFDEMPYRNLEMWN 183
             L+ A  G+Q+H  ++K G+  +V V  ++  +Y++ G+LN+  K+F  MP  +   WN
Sbjct: 426 ASLKWAKLGQQIHGESLKLGIDLNVSVSNALMTLYAETGYLNECRKIFSSMPEHDQVSWN 485

Query: 184 AYISNSVLHGRP-EDSAIAFIELLRVGGKPDSITFCAFFNACSDKLGLGP-GCQLHGFII 243
           + I       R   ++ + F+   R G K + ITF +  +A S  L  G  G Q+HG  +
Sbjct: 486 SIIGALARSERSLPEAVVCFLNAQRAGQKLNRITFSSVLSAVS-SLSFGELGKQIHGLAL 545

Query: 244 RCGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGE-RNSVSWSSLIAAYVQNNEEEKAS 303
           +       +  N LI  YGKCGE++  E +F RM E R++V+W+S+I+ Y+ N    KA 
Sbjct: 546 KNNIADEATTENALIACYGKCGEMDGCEKIFSRMAERRDNVTWNSMISGYIHNELLAKAL 605

Query: 304 CLFLRARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACIEDNIFVGSALVDMY 363
            L     +   +   FM ++VL A A ++ +E G  V A +V+AC+E ++ VGSALVDMY
Sbjct: 606 DLVWFMLQTGQRLDSFMYATVLSAFASVATLERGMEVHACSVRACLESDVVVGSALVDMY 665

Query: 364 GKCGSIDEAERAFNEMPERNLVSWNALLGGYAHQGHADKAVALLEEMVSMAGMVPSYVSL 423
            KCG +D A R FN MP RN  SWN+++ GYA  G  ++A+ L E M       P +V+ 
Sbjct: 666 SKCGRLDYALRFFNTMPVRNSYSWNSMISGYARHGQGEEALKLFETMKLDGQTPPDHVTF 725

Query: 424 VCALSACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMS 483
           V  LSACS AG L+ G + FESM   YG+ P  EH++C+ D+LGRAG ++   DFI+ M 
Sbjct: 726 VGVLSACSHAGLLEEGFKHFESMSDSYGLAPRIEHFSCMADVLGRAGELDKLEDFIEKMP 785

Query: 484 FPPTISIWGALLGA-CRMHG-KPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEV 543
             P + IW  +LGA CR +G K ELGK AAE LF+L+P+++ N+V+L NM+AA GRWE++
Sbjct: 786 MKPNVLIWRTVLGACCRANGRKAELGKKAAEMLFQLEPENAVNYVLLGNMYAAGGRWEDL 845

Query: 544 TVVRNEMKEVGIKKGAGFSWITVNSRIHIFQAKDKSHEKDSEIQDMLGKLRKEMQEAAGC 603
              R +MK+  +KK AG+SW+T+   +H+F A DKSH     I   L +L ++M++ AG 
Sbjct: 846 VKARKKMKDADVKKEAGYSWVTMKDGVHMFVAGDKSHPDADVIYKKLKELNRKMRD-AGY 905

Query: 604 IADTNYALFEV 605
           +  T +AL+++
Sbjct: 906 VPQTGFALYDL 914

BLAST of HG10012690 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 398.3 bits (1022), Expect = 1.6e-109
Identity = 209/584 (35.79%), Postives = 326/584 (55.82%), Query Frame = 0

Query: 43  FLYNHLVNMYAKLDHLNSAKLILELAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSD 102
           + +N +V    KL  L+ A  +    P R   TW ++++G  Q+     AL +F+ M  +
Sbjct: 87  YTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFAMMHKE 146

Query: 103 CVRPNDFTFPCVFKASTGLRMAMTGKQLHALAVKEGLISDVFVGCSVFDMYSKLGFLNDA 162
               N+++F  V  A +GL     G Q+H+L  K   +SDV++G ++ DMYSK G +NDA
Sbjct: 147 GFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCGNVNDA 206

Query: 163 YKVFDEMPYRNLEMWNAYISNSVLHGRPEDSAIAFIELLRVGGKPDSITFCAFFNACSDK 222
            +VFDEM  RN+  WN+ I+    +G   ++   F  +L    +PD +T  +  +AC+  
Sbjct: 207 QRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACASL 266

Query: 223 LGLGPGCQLHGFIIRCGYGQN-VSVSNGLIDFYGKCGEVECSEMVFD------------- 282
             +  G ++HG +++    +N + +SN  +D Y KC  ++ +  +FD             
Sbjct: 267 SAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVIAETSM 326

Query: 283 ------------------RMGERNSVSWSSLIAAYVQNNEEEKASCLFLRARKEDIKPTD 342
                             +M ERN VSW++LIA Y QN E E+A  LF   ++E + PT 
Sbjct: 327 ISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRESVCPTH 386

Query: 343 FMVSSVLCACAGLSEIEFGRSVQALAVK------ACIEDNIFVGSALVDMYGKCGSIDEA 402
           +  +++L ACA L+E+  G       +K      +  ED+IFVG++L+DMY KCG ++E 
Sbjct: 387 YSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVEEG 446

Query: 403 ERAFNEMPERNLVSWNALLGGYAHQGHADKAVALLEEMVSMAGMVPSYVSLVCALSACSR 462
              F +M ER+ VSWNA++ G+A  G+ ++A+ L  EM+  +G  P +++++  LSAC  
Sbjct: 447 YLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELFREMLE-SGEKPDHITMIGVLSACGH 506

Query: 463 AGDLKTGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMSFPPTISIWG 522
           AG ++ G   F SM   +G+ P  +HY C+VDLLGRAG +E A   I+ M   P   IWG
Sbjct: 507 AGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWG 566

Query: 523 ALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEMKEVG 582
           +LL AC++H    LGK  AEKL E++P +SG +V+LSNM+A  G+WE+V  VR  M++ G
Sbjct: 567 SLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNVRKSMRKEG 626

Query: 583 IKKGAGFSWITVNSRIHIFQAKDKSHEKDSEIQDMLGKLRKEMQ 589
           + K  G SWI +    H+F  KDKSH +  +I  +L  L  EM+
Sbjct: 627 VTKQPGCSWIKIQGHDHVFMVKDKSHPRKKQIHSLLDILIAEMR 669

BLAST of HG10012690 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 382.9 bits (982), Expect = 6.8e-105
Identity = 205/578 (35.47%), Postives = 321/578 (55.54%), Query Frame = 0

Query: 24  LGRAAHAQILKTLKTPLPAFLYNHLVNMYAKLDHLNSAKLILELAPCRSVVTWTALIAGS 83
           +G+  H  ++K+    L  F    L NMYAK   +N A+ + +  P R +V+W  ++AG 
Sbjct: 153 VGKEIHGLLVKS-GFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGY 212

Query: 84  VQNGCFASALLHFSDMLSDCVRPNDFTFPCVFKASTGLRMAMTGKQLHALAVKEGLISDV 143
            QNG    AL     M  + ++P+  T   V  A + LR+   GK++H  A++ G  S V
Sbjct: 213 SQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLV 272

Query: 144 FVGCSVFDMYSKLGFLNDAYKVFDEMPYRNLEMWNAYISNSVLHGRPEDSAIAFIELLRV 203
            +  ++ DMY+K G L  A ++FD M  RN+  WN+ I   V +  P+++ + F ++L  
Sbjct: 273 NISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDE 332

Query: 204 GGKPDSITFCAFFNACSDKLGLGPGCQLHGFIIRCGYGQNVSVSNGLIDFYGKCGEVECS 263
           G KP  ++     +AC+D   L  G  +H   +  G  +NVSV N LI  Y KC EV+ +
Sbjct: 333 GVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTA 392

Query: 264 EMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLRARKEDIKPTDFMVSSVLCACAGL 323
             +F ++  R  VSW+++I  + QN     A   F + R   +KP  F   SV+ A A L
Sbjct: 393 ASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAEL 452

Query: 324 SEIEFGRSVQALAVKACIEDNIFVGSALVDMYGKCGSIDEAERAFNEMPERNLVSWNALL 383
           S     + +  + +++C++ N+FV +ALVDMY KCG+I  A   F+ M ER++ +WNA++
Sbjct: 453 SITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMI 512

Query: 384 GGYAHQGHADKAVALLEEMVSMAGMVPSYVSLVCALSACSRAGDLKTGMQIFESMKARYG 443
            GY   G    A+ L EEM     + P+ V+ +  +SACS +G ++ G++ F  MK  Y 
Sbjct: 513 DGYGTHGFGKAALELFEEM-QKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYS 572

Query: 444 IEPGPEHYACLVDLLGRAGMVECAYDFIKSMSFPPTISIWGALLGACRMHGKPELGKLAA 503
           IE   +HY  +VDLLGRAG +  A+DFI  M   P ++++GA+LGAC++H      + AA
Sbjct: 573 IELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAA 632

Query: 504 EKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEMKEVGIKKGAGFSWITVNSRIHIF 563
           E+LFEL+P D G HV+L+N++ A   WE+V  VR  M   G++K  G S + + + +H F
Sbjct: 633 ERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSF 692

Query: 564 QAKDKSHEKDSEIQDMLGKLRKEMQEAAGCIADTNYAL 602
            +   +H    +I   L KL   ++E AG + DTN  L
Sbjct: 693 FSGSTAHPDSKKIYAFLEKLICHIKE-AGYVPDTNLVL 727

BLAST of HG10012690 vs. ExPASy Swiss-Prot
Match: Q9SMZ2 (Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H53 PE=3 SV=1)

HSP 1 Score: 376.7 bits (966), Expect = 4.9e-103
Identity = 206/598 (34.45%), Postives = 339/598 (56.69%), Query Frame = 0

Query: 12  LVELAVSVRSSLLGRAAHAQILKTLKTPLPAFLYNHLVNMYAKLDHLNSAKLILELAPCR 71
           ++  AV V S  LG+  H   LK L   L   + N L+NMY KL     A+ + +    R
Sbjct: 321 MLATAVKVDSLALGQQVHCMALK-LGLDLMLTVSNSLINMYCKLRKFGFARTVFDNMSER 380

Query: 72  SVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCVFKASTGLRMAMT-GKQL 131
            +++W ++IAG  QNG    A+  F  +L   ++P+ +T   V KA++ L   ++  KQ+
Sbjct: 381 DLISWNSVIAGIAQNGLEVEAVCLFMQLLRCGLKPDQYTMTSVLKAASSLPEGLSLSKQV 440

Query: 132 HALAVKEGLISDVFVGCSVFDMYSKLGFLNDAYKVFDEMPYRNLEMWNAYISNSVLHGRP 191
           H  A+K   +SD FV  ++ D YS+   + +A  +F+   + +L  WNA ++        
Sbjct: 441 HVHAIKINNVSDSFVSTALIDAYSRNRCMKEAEILFERHNF-DLVAWNAMMAGYTQSHDG 500

Query: 192 EDSAIAFIELLRVGGKPDSITFCAFFNACSDKLGLGPGCQLHGFIIRCGYGQNVSVSNGL 251
             +   F  + + G + D  T    F  C     +  G Q+H + I+ GY  ++ VS+G+
Sbjct: 501 HKTLKLFALMHKQGERSDDFTLATVFKTCGFLFAINQGKQVHAYAIKSGYDLDLWVSSGI 560

Query: 252 IDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLRARKEDIKPTD 311
           +D Y KCG++  ++  FD +   + V+W+++I+  ++N EEE+A  +F + R   + P +
Sbjct: 561 LDMYVKCGDMSAAQFAFDSIPVPDDVAWTTMISGCIENGEEERAFHVFSQMRLMGVLPDE 620

Query: 312 FMVSSVLCACAGLSEIEFGRSVQALAVKACIEDNIFVGSALVDMYGKCGSIDEAERAFNE 371
           F ++++  A + L+ +E GR + A A+K    ++ FVG++LVDMY KCGSID+A   F  
Sbjct: 621 FTIATLAKASSCLTALEQGRQIHANALKLNCTNDPFVGTSLVDMYAKCGSIDDAYCLFKR 680

Query: 372 MPERNLVSWNALLGGYAHQGHADKAVALLEEMVSMAGMVPSYVSLVCALSACSRAGDLKT 431
           +   N+ +WNA+L G A  G   + + L ++M S+ G+ P  V+ +  LSACS +G +  
Sbjct: 681 IEMMNITAWNAMLVGLAQHGEGKETLQLFKQMKSL-GIKPDKVTFIGVLSACSHSGLVSE 740

Query: 432 GMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMSFPPTISIWGALLGAC 491
             +   SM   YGI+P  EHY+CL D LGRAG+V+ A + I+SMS   + S++  LL AC
Sbjct: 741 AYKHMRSMHGDYGIKPEIEHYSCLADALGRAGLVKQAENLIESMSMEASASMYRTLLAAC 800

Query: 492 RMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEMKEVGIKKGAG 551
           R+ G  E GK  A KL EL+P DS  +V+LSNM+AA  +W+E+ + R  MK   +KK  G
Sbjct: 801 RVQGDTETGKRVATKLLELEPLDSSAYVLLSNMYAAASKWDEMKLARTMMKGHKVKKDPG 860

Query: 552 FSWITVNSRIHIFQAKDKSHEKDS----EIQDMLGKLRKEMQEAAGCIADTNYALFEV 605
           FSWI V ++IHIF   D+S+ +      +++DM+  +++E     G + +T++ L +V
Sbjct: 861 FSWIEVKNKIHIFVVDDRSNRQTELIYRKVKDMIRDIKQE-----GYVPETDFTLVDV 910

BLAST of HG10012690 vs. ExPASy TrEMBL
Match: A0A0A0L4T8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G146650 PE=4 SV=1)

HSP 1 Score: 1161.0 bits (3002), Expect = 0.0e+00
Identity = 570/606 (94.06%), Postives = 585/606 (96.53%), Query Frame = 0

Query: 1   MPFLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLKTPLPAFLYNHLVNMYAKLDHLNS 60
           MPFLS NSLAS+VELAVSVRSSLLGRAAHAQILKTLKTP PAFLYNHLVNMYAKLDHLNS
Sbjct: 1   MPFLSQNSLASVVELAVSVRSSLLGRAAHAQILKTLKTPFPAFLYNHLVNMYAKLDHLNS 60

Query: 61  AKLILELAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCVFKASTG 120
           AKLILELAPCRSVVTWTALIAGSVQNGCF SALLHFSDMLSDCVRPNDFTFPCV KASTG
Sbjct: 61  AKLILELAPCRSVVTWTALIAGSVQNGCFVSALLHFSDMLSDCVRPNDFTFPCVLKASTG 120

Query: 121 LRMAMTGKQLHALAVKEGLISDVFVGCSVFDMYSKLGFLNDAYKVFDEMPYRNLEMWNAY 180
           LRM  TGKQLHALAVKEGLI+DVFVGCSVFDMYSKLGFLNDAYKVFDEMP+RNLE WNAY
Sbjct: 121 LRMDTTGKQLHALAVKEGLINDVFVGCSVFDMYSKLGFLNDAYKVFDEMPHRNLETWNAY 180

Query: 181 ISNSVLHGRPEDSAIAFIELLRVGGKPDSITFCAFFNACSDKLGLGPGCQLHGFIIRCGY 240
           ISNSVLHGRPEDS IAFIELLRVGGKPDSITFCAF NACSDKLGLGPGCQLHGFIIR GY
Sbjct: 181 ISNSVLHGRPEDSVIAFIELLRVGGKPDSITFCAFLNACSDKLGLGPGCQLHGFIIRSGY 240

Query: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300
           GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
Sbjct: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300

Query: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACIEDNIFVGSALVDMYGKCGS 360
           ARKEDI+PTDFMVSSVLCACAGLSEIEFGRSVQALAVKAC+E NIFV SALVDMYGKCGS
Sbjct: 301 ARKEDIEPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEQNIFVASALVDMYGKCGS 360

Query: 361 IDEAERAFNEMPERNLVSWNALLGGYAHQGHADKAVALLEEMVSMAGMVPSYVSLVCALS 420
           ID AE+AFN MPERNLVSWNALLGGYAHQGHA+KAVALLEEM S AG+VPSYVSL+CALS
Sbjct: 361 IDNAEQAFNAMPERNLVSWNALLGGYAHQGHANKAVALLEEMTSAAGIVPSYVSLICALS 420

Query: 421 ACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMSFPPTI 480
           ACSRAGDLKTGM+IFESMK RYG+EPGPEHYACLVDLLGRAGMVECAYDFIK M FPPTI
Sbjct: 421 ACSRAGDLKTGMKIFESMKERYGVEPGPEHYACLVDLLGRAGMVECAYDFIKRMPFPPTI 480

Query: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM 540
           SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM
Sbjct: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM 540

Query: 541 KEVGIKKGAGFSWITVNSRIHIFQAKDKSHEKDSEIQDMLGKLRKEMQEAAGCIADTNYA 600
           KEVGIKKGAGFSWITV+SRIH+FQAKDKSHEKD EIQD+LGKLRKEMQ+AAGCIAD NYA
Sbjct: 541 KEVGIKKGAGFSWITVDSRIHMFQAKDKSHEKDPEIQDILGKLRKEMQDAAGCIADPNYA 600

Query: 601 LFEVSN 607
           LFEVSN
Sbjct: 601 LFEVSN 606

BLAST of HG10012690 vs. ExPASy TrEMBL
Match: A0A5A7U206 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold434G00440 PE=4 SV=1)

HSP 1 Score: 1154.4 bits (2985), Expect = 0.0e+00
Identity = 567/606 (93.56%), Postives = 583/606 (96.20%), Query Frame = 0

Query: 1   MPFLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLKTPLPAFLYNHLVNMYAKLDHLNS 60
           MPFLS NSLAS+VELAVSVRSSLLGRAAHAQILKTLKTP PAFLYNHLVNMYAKLDHLNS
Sbjct: 1   MPFLSKNSLASVVELAVSVRSSLLGRAAHAQILKTLKTPFPAFLYNHLVNMYAKLDHLNS 60

Query: 61  AKLILELAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCVFKASTG 120
           AKLILELAPCRSVVTWTALIAGSVQNGCF SALLHFSDMLSDCVRPNDFTFPCV KASTG
Sbjct: 61  AKLILELAPCRSVVTWTALIAGSVQNGCFVSALLHFSDMLSDCVRPNDFTFPCVLKASTG 120

Query: 121 LRMAMTGKQLHALAVKEGLISDVFVGCSVFDMYSKLGFLNDAYKVFDEMPYRNLEMWNAY 180
           LRM MTGKQLHALAVKEGLI+DVFVGCSVFDMYSKLGFLNDAYK+FDEMP RNLE WNAY
Sbjct: 121 LRMDMTGKQLHALAVKEGLINDVFVGCSVFDMYSKLGFLNDAYKLFDEMPQRNLETWNAY 180

Query: 181 ISNSVLHGRPEDSAIAFIELLRVGGKPDSITFCAFFNACSDKLGLGPGCQLHGFIIRCGY 240
           I+NSVLHGRPEDSAIAFIELLRVG KPDSITFCAF NACSDKLGLGPGCQLHGF+IR GY
Sbjct: 181 ITNSVLHGRPEDSAIAFIELLRVGEKPDSITFCAFLNACSDKLGLGPGCQLHGFVIRSGY 240

Query: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300
           GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
Sbjct: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300

Query: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACIEDNIFVGSALVDMYGKCGS 360
           ARKEDI+PTDFMVSSVLCACAGLSEIEFGRSVQALAVKAC+E NIFV SALVDMYGKCGS
Sbjct: 301 ARKEDIEPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEQNIFVASALVDMYGKCGS 360

Query: 361 IDEAERAFNEMPERNLVSWNALLGGYAHQGHADKAVALLEEMVSMAGMVPSYVSLVCALS 420
           ID A +AFN MPERNLVSWNALLGGYAHQGHA+KAVALLEEM S AG+VPSYVSL+CALS
Sbjct: 361 IDNAVQAFNAMPERNLVSWNALLGGYAHQGHANKAVALLEEMTSAAGIVPSYVSLICALS 420

Query: 421 ACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMSFPPTI 480
           ACSRAGDLKTGM+IFESMK RYG+EPGPEHYACLVDLLGRAGMVECAYDFIK M FPPTI
Sbjct: 421 ACSRAGDLKTGMKIFESMKERYGVEPGPEHYACLVDLLGRAGMVECAYDFIKRMPFPPTI 480

Query: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM 540
           SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEE TVVRNEM
Sbjct: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEATVVRNEM 540

Query: 541 KEVGIKKGAGFSWITVNSRIHIFQAKDKSHEKDSEIQDMLGKLRKEMQEAAGCIADTNYA 600
           KEVGIKKGAGFSWITV+SRIHIFQAKDKSHEKD EIQ+MLGKLRKEMQ+AAGCIAD NYA
Sbjct: 541 KEVGIKKGAGFSWITVDSRIHIFQAKDKSHEKDPEIQNMLGKLRKEMQDAAGCIADPNYA 600

Query: 601 LFEVSN 607
           LFEVSN
Sbjct: 601 LFEVSN 606

BLAST of HG10012690 vs. ExPASy TrEMBL
Match: A0A1S3AXN0 (pentatricopeptide repeat-containing protein At4g14850 OS=Cucumis melo OX=3656 GN=LOC103483708 PE=4 SV=1)

HSP 1 Score: 1154.4 bits (2985), Expect = 0.0e+00
Identity = 567/606 (93.56%), Postives = 583/606 (96.20%), Query Frame = 0

Query: 1   MPFLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLKTPLPAFLYNHLVNMYAKLDHLNS 60
           MPFLS NSLAS+VELAVSVRSSLLGRAAHAQILKTLKTP PAFLYNHLVNMYAKLDHLNS
Sbjct: 1   MPFLSKNSLASVVELAVSVRSSLLGRAAHAQILKTLKTPFPAFLYNHLVNMYAKLDHLNS 60

Query: 61  AKLILELAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCVFKASTG 120
           AKLILELAPCRSVVTWTALIAGSVQNGCF SALLHFSDMLSDCVRPNDFTFPCV KASTG
Sbjct: 61  AKLILELAPCRSVVTWTALIAGSVQNGCFVSALLHFSDMLSDCVRPNDFTFPCVLKASTG 120

Query: 121 LRMAMTGKQLHALAVKEGLISDVFVGCSVFDMYSKLGFLNDAYKVFDEMPYRNLEMWNAY 180
           LRM MTGKQLHALAVKEGLI+DVFVGCSVFDMYSKLGFLNDAYK+FDEMP RNLE WNAY
Sbjct: 121 LRMDMTGKQLHALAVKEGLINDVFVGCSVFDMYSKLGFLNDAYKLFDEMPQRNLETWNAY 180

Query: 181 ISNSVLHGRPEDSAIAFIELLRVGGKPDSITFCAFFNACSDKLGLGPGCQLHGFIIRCGY 240
           I+NSVLHGRPEDSAIAFIELLRVG KPDSITFCAF NACSDKLGLGPGCQLHGF+IR GY
Sbjct: 181 ITNSVLHGRPEDSAIAFIELLRVGEKPDSITFCAFLNACSDKLGLGPGCQLHGFVIRSGY 240

Query: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300
           GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
Sbjct: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300

Query: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACIEDNIFVGSALVDMYGKCGS 360
           ARKEDI+PTDFMVSSVLCACAGLSEIEFGRSVQALAVKAC+E NIFV SALVDMYGKCGS
Sbjct: 301 ARKEDIEPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEQNIFVASALVDMYGKCGS 360

Query: 361 IDEAERAFNEMPERNLVSWNALLGGYAHQGHADKAVALLEEMVSMAGMVPSYVSLVCALS 420
           ID A +AFN MPERNLVSWNALLGGYAHQGHA+KAVALLEEM S AG+VPSYVSL+CALS
Sbjct: 361 IDNAVQAFNAMPERNLVSWNALLGGYAHQGHANKAVALLEEMTSAAGIVPSYVSLICALS 420

Query: 421 ACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMSFPPTI 480
           ACSRAGDLKTGM+IFESMK RYG+EPGPEHYACLVDLLGRAGMVECAYDFIK M FPPTI
Sbjct: 421 ACSRAGDLKTGMKIFESMKERYGVEPGPEHYACLVDLLGRAGMVECAYDFIKRMPFPPTI 480

Query: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM 540
           SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEE TVVRNEM
Sbjct: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEATVVRNEM 540

Query: 541 KEVGIKKGAGFSWITVNSRIHIFQAKDKSHEKDSEIQDMLGKLRKEMQEAAGCIADTNYA 600
           KEVGIKKGAGFSWITV+SRIHIFQAKDKSHEKD EIQ+MLGKLRKEMQ+AAGCIAD NYA
Sbjct: 541 KEVGIKKGAGFSWITVDSRIHIFQAKDKSHEKDPEIQNMLGKLRKEMQDAAGCIADPNYA 600

Query: 601 LFEVSN 607
           LFEVSN
Sbjct: 601 LFEVSN 606

BLAST of HG10012690 vs. ExPASy TrEMBL
Match: A0A6J1GWT9 (pentatricopeptide repeat-containing protein At4g14850 OS=Cucurbita moschata OX=3662 GN=LOC111457875 PE=4 SV=1)

HSP 1 Score: 1120.1 bits (2896), Expect = 0.0e+00
Identity = 549/603 (91.04%), Postives = 575/603 (95.36%), Query Frame = 0

Query: 1   MPFLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLKTPLPAFLYNHLVNMYAKLDHLNS 60
           MPFLSPNSLASLVE A+S+RSSLLGR AHAQILKTLKTP PAFLYNHLVNMYAKLD LNS
Sbjct: 1   MPFLSPNSLASLVEFALSIRSSLLGRVAHAQILKTLKTPFPAFLYNHLVNMYAKLDQLNS 60

Query: 61  AKLILELAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCVFKASTG 120
           A+LILELAPCRSVVTWT+LIAGSVQNG FASALLHFSDMLSDCVRPNDFTFPCVFKASTG
Sbjct: 61  AELILELAPCRSVVTWTSLIAGSVQNGRFASALLHFSDMLSDCVRPNDFTFPCVFKASTG 120

Query: 121 LRMAMTGKQLHALAVKEGLISDVFVGCSVFDMYSKLGFLNDAYKVFDEMPYRNLEMWNAY 180
           LRMAMTGKQ+HALAVKEGLI+DVFVGCS FDMYSKLG L+DAYK+F EMP+RNLE WNAY
Sbjct: 121 LRMAMTGKQVHALAVKEGLINDVFVGCSAFDMYSKLGLLDDAYKLFVEMPHRNLETWNAY 180

Query: 181 ISNSVLHGRPEDSAIAFIELLRVGGKPDSITFCAFFNACSDKLGLGPGCQLHGFIIRCGY 240
           ISNSVLHGRPEDSAIAFIELLR GGKPDSITFCAF NACSDKLGL PGCQLHGFIIR G 
Sbjct: 181 ISNSVLHGRPEDSAIAFIELLRAGGKPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGC 240

Query: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300
           GQNVS+SNGLIDFYGKCGEV CSE++FDRMGERNSVSWSSLIAAYVQNNEEEKA CLFLR
Sbjct: 241 GQNVSISNGLIDFYGKCGEVVCSEVIFDRMGERNSVSWSSLIAAYVQNNEEEKACCLFLR 300

Query: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACIEDNIFVGSALVDMYGKCGS 360
           ARKEDIKPTDFMVSSVLCA AGLSEIE GRSVQALAVKAC+++NIFVGSALVDMYGKCGS
Sbjct: 301 ARKEDIKPTDFMVSSVLCASAGLSEIELGRSVQALAVKACVDENIFVGSALVDMYGKCGS 360

Query: 361 IDEAERAFNEMPERNLVSWNALLGGYAHQGHADKAVALLEEMVSMAGMVPSYVSLVCALS 420
           ID+AE+AFNEMPERNLVSWNALLGGYAHQG+ADKAVALL++M S+ G+ PSYVSLVCALS
Sbjct: 361 IDKAEQAFNEMPERNLVSWNALLGGYAHQGYADKAVALLKDMASVEGIAPSYVSLVCALS 420

Query: 421 ACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMSFPPTI 480
           ACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDL GRAGMVECAYDFI+ M FPPTI
Sbjct: 421 ACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDLFGRAGMVECAYDFIRRMPFPPTI 480

Query: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM 540
           SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNM AAT RWEEVTV+RNEM
Sbjct: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMLAATSRWEEVTVIRNEM 540

Query: 541 KEVGIKKGAGFSWITVNSRIHIFQAKDKSHEKDSEIQDMLGKLRKEMQEAAGCIADTNYA 600
           KEVGIKKGAGFSWITVNSRIHIFQAKDKS+EKDSE+QDMLGKLRKEMQEAAG IAD NYA
Sbjct: 541 KEVGIKKGAGFSWITVNSRIHIFQAKDKSYEKDSELQDMLGKLRKEMQEAAGSIADANYA 600

Query: 601 LFE 604
           LFE
Sbjct: 601 LFE 603

BLAST of HG10012690 vs. ExPASy TrEMBL
Match: A0A6J1IQR6 (pentatricopeptide repeat-containing protein At4g14850 OS=Cucurbita maxima OX=3661 GN=LOC111479151 PE=4 SV=1)

HSP 1 Score: 1113.6 bits (2879), Expect = 0.0e+00
Identity = 549/603 (91.04%), Postives = 570/603 (94.53%), Query Frame = 0

Query: 1   MPFLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLKTPLPAFLYNHLVNMYAKLDHLNS 60
           MPFLSPNSLASLVELA+S+RSSLLGR AHAQILKTLKTP PAFLYNHLVNMYAKLD LNS
Sbjct: 1   MPFLSPNSLASLVELALSIRSSLLGRVAHAQILKTLKTPFPAFLYNHLVNMYAKLDQLNS 60

Query: 61  AKLILELAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCVFKASTG 120
           A+LILELAPCRSVVTWT+LIAGSVQNG F+SALLHFSDMLSDCVRPNDFTFPCV KASTG
Sbjct: 61  AELILELAPCRSVVTWTSLIAGSVQNGRFSSALLHFSDMLSDCVRPNDFTFPCVLKASTG 120

Query: 121 LRMAMTGKQLHALAVKEGLISDVFVGCSVFDMYSKLGFLNDAYKVFDEMPYRNLEMWNAY 180
           LRMAMTGKQLHALAVKEGLI+DVFVGCS FDMYSKLG L+DAYK+F EMP+RNLE WNAY
Sbjct: 121 LRMAMTGKQLHALAVKEGLINDVFVGCSAFDMYSKLGLLDDAYKLFVEMPHRNLETWNAY 180

Query: 181 ISNSVLHGRPEDSAIAFIELLRVGGKPDSITFCAFFNACSDKLGLGPGCQLHGFIIRCGY 240
           ISNSVLHGRPEDSAIAFIELLR GGKPDSITFCAF NACSDKLGL PGCQLHGFIIR G 
Sbjct: 181 ISNSVLHGRPEDSAIAFIELLRAGGKPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGC 240

Query: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300
           GQNVS+SNGLIDFYGKCGEV CSE++FDRMGERNSVSWSSLIAAYVQNNEEEKA CLFLR
Sbjct: 241 GQNVSISNGLIDFYGKCGEVICSEVIFDRMGERNSVSWSSLIAAYVQNNEEEKACCLFLR 300

Query: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACIEDNIFVGSALVDMYGKCGS 360
           ARKE IKPTDFMVSSVLCA AGLSEIE GRSVQALAVKAC+E+NIFVGSALVDMYGKCGS
Sbjct: 301 ARKEGIKPTDFMVSSVLCASAGLSEIELGRSVQALAVKACVEENIFVGSALVDMYGKCGS 360

Query: 361 IDEAERAFNEMPERNLVSWNALLGGYAHQGHADKAVALLEEMVSMAGMVPSYVSLVCALS 420
           IDEAERAFNEMPERNLVSWN+LLGGYAHQG ADKAVALLEEM S  G+ PSYVSLVCALS
Sbjct: 361 IDEAERAFNEMPERNLVSWNSLLGGYAHQGCADKAVALLEEMASADGIAPSYVSLVCALS 420

Query: 421 ACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMSFPPTI 480
           ACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDL GRAGMVECAYDFI+ M FPPTI
Sbjct: 421 ACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDLFGRAGMVECAYDFIRRMPFPPTI 480

Query: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM 540
           SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNM AAT RWEEVTV+RNEM
Sbjct: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMLAATSRWEEVTVIRNEM 540

Query: 541 KEVGIKKGAGFSWITVNSRIHIFQAKDKSHEKDSEIQDMLGKLRKEMQEAAGCIADTNYA 600
           KEVGIKKGAGFSWITVN RIHIFQAKDKS+EKDSE+QDMLG LRKEMQEAAG IA+ NYA
Sbjct: 541 KEVGIKKGAGFSWITVNRRIHIFQAKDKSYEKDSELQDMLGNLRKEMQEAAGSIAEANYA 600

Query: 601 LFE 604
           LFE
Sbjct: 601 LFE 603

BLAST of HG10012690 vs. TAIR 10
Match: AT4G14850.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 763.1 bits (1969), Expect = 1.7e-220
Identity = 372/605 (61.49%), Postives = 468/605 (77.36%), Query Frame = 0

Query: 1   MPFLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLKTPLPAFLYNHLVNMYAKLDHLNS 60
           M  LS ++L  L++ A+S  S  LGR  HA+I+KTL +P P FL N+L+NMY+KLDH  S
Sbjct: 1   MSLLSADALGLLLKNAISASSMRLGRVVHARIVKTLDSPPPPFLANYLINMYSKLDHPES 60

Query: 61  AKLILELAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCVFKASTG 120
           A+L+L L P R+VV+WT+LI+G  QNG F++AL+ F +M  + V PNDFTFPC FKA   
Sbjct: 61  ARLVLRLTPARNVVSWTSLISGLAQNGHFSTALVEFFEMRREGVVPNDFTFPCAFKAVAS 120

Query: 121 LRMAMTGKQLHALAVKEGLISDVFVGCSVFDMYSKLGFLNDAYKVFDEMPYRNLEMWNAY 180
           LR+ +TGKQ+HALAVK G I DVFVGCS FDMY K    +DA K+FDE+P RNLE WNA+
Sbjct: 121 LRLPVTGKQIHALAVKCGRILDVFVGCSAFDMYCKTRLRDDARKLFDEIPERNLETWNAF 180

Query: 181 ISNSVLHGRPEDSAIAFIELLRVGGKPDSITFCAFFNACSDKLGLGPGCQLHGFIIRCGY 240
           ISNSV  GRP ++  AFIE  R+ G P+SITFCAF NACSD L L  G QLHG ++R G+
Sbjct: 181 ISNSVTDGRPREAIEAFIEFRRIDGHPNSITFCAFLNACSDWLHLNLGMQLHGLVLRSGF 240

Query: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300
             +VSV NGLIDFYGKC ++  SE++F  MG +N+VSW SL+AAYVQN+E+EKAS L+LR
Sbjct: 241 DTDVSVCNGLIDFYGKCKQIRSSEIIFTEMGTKNAVSWCSLVAAYVQNHEDEKASVLYLR 300

Query: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACIEDNIFVGSALVDMYGKCGS 360
           +RK+ ++ +DFM+SSVL ACAG++ +E GRS+ A AVKAC+E  IFVGSALVDMYGKCG 
Sbjct: 301 SRKDIVETSDFMISSVLSACAGMAGLELGRSIHAHAVKACVERTIFVGSALVDMYGKCGC 360

Query: 361 IDEAERAFNEMPERNLVSWNALLGGYAHQGHADKAVALLEEMVSM-AGMVPSYVSLVCAL 420
           I+++E+AF+EMPE+NLV+ N+L+GGYAHQG  D A+AL EEM     G  P+Y++ V  L
Sbjct: 361 IEDSEQAFDEMPEKNLVTRNSLIGGYAHQGQVDMALALFEEMAPRGCGPTPNYMTFVSLL 420

Query: 421 SACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMSFPPT 480
           SACSRAG ++ GM+IF+SM++ YGIEPG EHY+C+VD+LGRAGMVE AY+FIK M   PT
Sbjct: 421 SACSRAGAVENGMKIFDSMRSTYGIEPGAEHYSCIVDMLGRAGMVERAYEFIKKMPIQPT 480

Query: 481 ISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNE 540
           IS+WGAL  ACRMHGKP+LG LAAE LF+LDPKDSGNHV+LSN FAA GRW E   VR E
Sbjct: 481 ISVWGALQNACRMHGKPQLGLLAAENLFKLDPKDSGNHVLLSNTFAAAGRWAEANTVREE 540

Query: 541 MKEVGIKKGAGFSWITVNSRIHIFQAKDKSHEKDSEIQDMLGKLRKEMQEAAGCIADTNY 600
           +K VGIKKGAG+SWITV +++H FQAKD+SH  + EIQ  L KLR EM EAAG   D   
Sbjct: 541 LKGVGIKKGAGYSWITVKNQVHAFQAKDRSHILNKEIQTTLAKLRNEM-EAAGYKPDLKL 600

Query: 601 ALFEV 605
           +L+++
Sbjct: 601 SLYDL 604

BLAST of HG10012690 vs. TAIR 10
Match: AT5G09950.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 413.7 bits (1062), Expect = 2.6e-115
Identity = 230/611 (37.64%), Postives = 349/611 (57.12%), Query Frame = 0

Query: 4   LSPNS----LASLVELAVSVRSSL-LGRAAHAQILKTLKTPLPAFLYNHLVNMYAKLDHL 63
           +SP S    L+S  E +++    L  GR  H  ++ T        + N LVNMYAK   +
Sbjct: 306 VSPESYVILLSSFPEYSLAEEVGLKKGREVHGHVITTGLVDFMVGIGNGLVNMYAKCGSI 365

Query: 64  NSAKLILELAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCVFKAS 123
             A+ +      +  V+W ++I G  QNGCF  A+  +  M    + P  FT      + 
Sbjct: 366 ADARRVFYFMTDKDSVSWNSMITGLDQNGCFIEAVERYKSMRRHDILPGSFTLISSLSSC 425

Query: 124 TGLRMAMTGKQLHALAVKEGLISDVFVGCSVFDMYSKLGFLNDAYKVFDEMPYRNLEMWN 183
             L+ A  G+Q+H  ++K G+  +V V  ++  +Y++ G+LN+  K+F  MP  +   WN
Sbjct: 426 ASLKWAKLGQQIHGESLKLGIDLNVSVSNALMTLYAETGYLNECRKIFSSMPEHDQVSWN 485

Query: 184 AYISNSVLHGRP-EDSAIAFIELLRVGGKPDSITFCAFFNACSDKLGLGP-GCQLHGFII 243
           + I       R   ++ + F+   R G K + ITF +  +A S  L  G  G Q+HG  +
Sbjct: 486 SIIGALARSERSLPEAVVCFLNAQRAGQKLNRITFSSVLSAVS-SLSFGELGKQIHGLAL 545

Query: 244 RCGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGE-RNSVSWSSLIAAYVQNNEEEKAS 303
           +       +  N LI  YGKCGE++  E +F RM E R++V+W+S+I+ Y+ N    KA 
Sbjct: 546 KNNIADEATTENALIACYGKCGEMDGCEKIFSRMAERRDNVTWNSMISGYIHNELLAKAL 605

Query: 304 CLFLRARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACIEDNIFVGSALVDMY 363
            L     +   +   FM ++VL A A ++ +E G  V A +V+AC+E ++ VGSALVDMY
Sbjct: 606 DLVWFMLQTGQRLDSFMYATVLSAFASVATLERGMEVHACSVRACLESDVVVGSALVDMY 665

Query: 364 GKCGSIDEAERAFNEMPERNLVSWNALLGGYAHQGHADKAVALLEEMVSMAGMVPSYVSL 423
            KCG +D A R FN MP RN  SWN+++ GYA  G  ++A+ L E M       P +V+ 
Sbjct: 666 SKCGRLDYALRFFNTMPVRNSYSWNSMISGYARHGQGEEALKLFETMKLDGQTPPDHVTF 725

Query: 424 VCALSACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMS 483
           V  LSACS AG L+ G + FESM   YG+ P  EH++C+ D+LGRAG ++   DFI+ M 
Sbjct: 726 VGVLSACSHAGLLEEGFKHFESMSDSYGLAPRIEHFSCMADVLGRAGELDKLEDFIEKMP 785

Query: 484 FPPTISIWGALLGA-CRMHG-KPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEV 543
             P + IW  +LGA CR +G K ELGK AAE LF+L+P+++ N+V+L NM+AA GRWE++
Sbjct: 786 MKPNVLIWRTVLGACCRANGRKAELGKKAAEMLFQLEPENAVNYVLLGNMYAAGGRWEDL 845

Query: 544 TVVRNEMKEVGIKKGAGFSWITVNSRIHIFQAKDKSHEKDSEIQDMLGKLRKEMQEAAGC 603
              R +MK+  +KK AG+SW+T+   +H+F A DKSH     I   L +L ++M++ AG 
Sbjct: 846 VKARKKMKDADVKKEAGYSWVTMKDGVHMFVAGDKSHPDADVIYKKLKELNRKMRD-AGY 905

Query: 604 IADTNYALFEV 605
           +  T +AL+++
Sbjct: 906 VPQTGFALYDL 914

BLAST of HG10012690 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 398.3 bits (1022), Expect = 1.1e-110
Identity = 209/584 (35.79%), Postives = 326/584 (55.82%), Query Frame = 0

Query: 43  FLYNHLVNMYAKLDHLNSAKLILELAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSD 102
           + +N +V    KL  L+ A  +    P R   TW ++++G  Q+     AL +F+ M  +
Sbjct: 87  YTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFAMMHKE 146

Query: 103 CVRPNDFTFPCVFKASTGLRMAMTGKQLHALAVKEGLISDVFVGCSVFDMYSKLGFLNDA 162
               N+++F  V  A +GL     G Q+H+L  K   +SDV++G ++ DMYSK G +NDA
Sbjct: 147 GFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCGNVNDA 206

Query: 163 YKVFDEMPYRNLEMWNAYISNSVLHGRPEDSAIAFIELLRVGGKPDSITFCAFFNACSDK 222
            +VFDEM  RN+  WN+ I+    +G   ++   F  +L    +PD +T  +  +AC+  
Sbjct: 207 QRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACASL 266

Query: 223 LGLGPGCQLHGFIIRCGYGQN-VSVSNGLIDFYGKCGEVECSEMVFD------------- 282
             +  G ++HG +++    +N + +SN  +D Y KC  ++ +  +FD             
Sbjct: 267 SAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVIAETSM 326

Query: 283 ------------------RMGERNSVSWSSLIAAYVQNNEEEKASCLFLRARKEDIKPTD 342
                             +M ERN VSW++LIA Y QN E E+A  LF   ++E + PT 
Sbjct: 327 ISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRESVCPTH 386

Query: 343 FMVSSVLCACAGLSEIEFGRSVQALAVK------ACIEDNIFVGSALVDMYGKCGSIDEA 402
           +  +++L ACA L+E+  G       +K      +  ED+IFVG++L+DMY KCG ++E 
Sbjct: 387 YSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVEEG 446

Query: 403 ERAFNEMPERNLVSWNALLGGYAHQGHADKAVALLEEMVSMAGMVPSYVSLVCALSACSR 462
              F +M ER+ VSWNA++ G+A  G+ ++A+ L  EM+  +G  P +++++  LSAC  
Sbjct: 447 YLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELFREMLE-SGEKPDHITMIGVLSACGH 506

Query: 463 AGDLKTGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMSFPPTISIWG 522
           AG ++ G   F SM   +G+ P  +HY C+VDLLGRAG +E A   I+ M   P   IWG
Sbjct: 507 AGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWG 566

Query: 523 ALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEMKEVG 582
           +LL AC++H    LGK  AEKL E++P +SG +V+LSNM+A  G+WE+V  VR  M++ G
Sbjct: 567 SLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNVRKSMRKEG 626

Query: 583 IKKGAGFSWITVNSRIHIFQAKDKSHEKDSEIQDMLGKLRKEMQ 589
           + K  G SWI +    H+F  KDKSH +  +I  +L  L  EM+
Sbjct: 627 VTKQPGCSWIKIQGHDHVFMVKDKSHPRKKQIHSLLDILIAEMR 669

BLAST of HG10012690 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 382.9 bits (982), Expect = 4.9e-106
Identity = 205/578 (35.47%), Postives = 321/578 (55.54%), Query Frame = 0

Query: 24  LGRAAHAQILKTLKTPLPAFLYNHLVNMYAKLDHLNSAKLILELAPCRSVVTWTALIAGS 83
           +G+  H  ++K+    L  F    L NMYAK   +N A+ + +  P R +V+W  ++AG 
Sbjct: 153 VGKEIHGLLVKS-GFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGY 212

Query: 84  VQNGCFASALLHFSDMLSDCVRPNDFTFPCVFKASTGLRMAMTGKQLHALAVKEGLISDV 143
            QNG    AL     M  + ++P+  T   V  A + LR+   GK++H  A++ G  S V
Sbjct: 213 SQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLV 272

Query: 144 FVGCSVFDMYSKLGFLNDAYKVFDEMPYRNLEMWNAYISNSVLHGRPEDSAIAFIELLRV 203
            +  ++ DMY+K G L  A ++FD M  RN+  WN+ I   V +  P+++ + F ++L  
Sbjct: 273 NISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDE 332

Query: 204 GGKPDSITFCAFFNACSDKLGLGPGCQLHGFIIRCGYGQNVSVSNGLIDFYGKCGEVECS 263
           G KP  ++     +AC+D   L  G  +H   +  G  +NVSV N LI  Y KC EV+ +
Sbjct: 333 GVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTA 392

Query: 264 EMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLRARKEDIKPTDFMVSSVLCACAGL 323
             +F ++  R  VSW+++I  + QN     A   F + R   +KP  F   SV+ A A L
Sbjct: 393 ASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAEL 452

Query: 324 SEIEFGRSVQALAVKACIEDNIFVGSALVDMYGKCGSIDEAERAFNEMPERNLVSWNALL 383
           S     + +  + +++C++ N+FV +ALVDMY KCG+I  A   F+ M ER++ +WNA++
Sbjct: 453 SITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMI 512

Query: 384 GGYAHQGHADKAVALLEEMVSMAGMVPSYVSLVCALSACSRAGDLKTGMQIFESMKARYG 443
            GY   G    A+ L EEM     + P+ V+ +  +SACS +G ++ G++ F  MK  Y 
Sbjct: 513 DGYGTHGFGKAALELFEEM-QKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYS 572

Query: 444 IEPGPEHYACLVDLLGRAGMVECAYDFIKSMSFPPTISIWGALLGACRMHGKPELGKLAA 503
           IE   +HY  +VDLLGRAG +  A+DFI  M   P ++++GA+LGAC++H      + AA
Sbjct: 573 IELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAA 632

Query: 504 EKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEMKEVGIKKGAGFSWITVNSRIHIF 563
           E+LFEL+P D G HV+L+N++ A   WE+V  VR  M   G++K  G S + + + +H F
Sbjct: 633 ERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSF 692

Query: 564 QAKDKSHEKDSEIQDMLGKLRKEMQEAAGCIADTNYAL 602
            +   +H    +I   L KL   ++E AG + DTN  L
Sbjct: 693 FSGSTAHPDSKKIYAFLEKLICHIKE-AGYVPDTNLVL 727

BLAST of HG10012690 vs. TAIR 10
Match: AT4G33170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 376.7 bits (966), Expect = 3.5e-104
Identity = 206/598 (34.45%), Postives = 339/598 (56.69%), Query Frame = 0

Query: 12  LVELAVSVRSSLLGRAAHAQILKTLKTPLPAFLYNHLVNMYAKLDHLNSAKLILELAPCR 71
           ++  AV V S  LG+  H   LK L   L   + N L+NMY KL     A+ + +    R
Sbjct: 321 MLATAVKVDSLALGQQVHCMALK-LGLDLMLTVSNSLINMYCKLRKFGFARTVFDNMSER 380

Query: 72  SVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCVFKASTGLRMAMT-GKQL 131
            +++W ++IAG  QNG    A+  F  +L   ++P+ +T   V KA++ L   ++  KQ+
Sbjct: 381 DLISWNSVIAGIAQNGLEVEAVCLFMQLLRCGLKPDQYTMTSVLKAASSLPEGLSLSKQV 440

Query: 132 HALAVKEGLISDVFVGCSVFDMYSKLGFLNDAYKVFDEMPYRNLEMWNAYISNSVLHGRP 191
           H  A+K   +SD FV  ++ D YS+   + +A  +F+   + +L  WNA ++        
Sbjct: 441 HVHAIKINNVSDSFVSTALIDAYSRNRCMKEAEILFERHNF-DLVAWNAMMAGYTQSHDG 500

Query: 192 EDSAIAFIELLRVGGKPDSITFCAFFNACSDKLGLGPGCQLHGFIIRCGYGQNVSVSNGL 251
             +   F  + + G + D  T    F  C     +  G Q+H + I+ GY  ++ VS+G+
Sbjct: 501 HKTLKLFALMHKQGERSDDFTLATVFKTCGFLFAINQGKQVHAYAIKSGYDLDLWVSSGI 560

Query: 252 IDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLRARKEDIKPTD 311
           +D Y KCG++  ++  FD +   + V+W+++I+  ++N EEE+A  +F + R   + P +
Sbjct: 561 LDMYVKCGDMSAAQFAFDSIPVPDDVAWTTMISGCIENGEEERAFHVFSQMRLMGVLPDE 620

Query: 312 FMVSSVLCACAGLSEIEFGRSVQALAVKACIEDNIFVGSALVDMYGKCGSIDEAERAFNE 371
           F ++++  A + L+ +E GR + A A+K    ++ FVG++LVDMY KCGSID+A   F  
Sbjct: 621 FTIATLAKASSCLTALEQGRQIHANALKLNCTNDPFVGTSLVDMYAKCGSIDDAYCLFKR 680

Query: 372 MPERNLVSWNALLGGYAHQGHADKAVALLEEMVSMAGMVPSYVSLVCALSACSRAGDLKT 431
           +   N+ +WNA+L G A  G   + + L ++M S+ G+ P  V+ +  LSACS +G +  
Sbjct: 681 IEMMNITAWNAMLVGLAQHGEGKETLQLFKQMKSL-GIKPDKVTFIGVLSACSHSGLVSE 740

Query: 432 GMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMSFPPTISIWGALLGAC 491
             +   SM   YGI+P  EHY+CL D LGRAG+V+ A + I+SMS   + S++  LL AC
Sbjct: 741 AYKHMRSMHGDYGIKPEIEHYSCLADALGRAGLVKQAENLIESMSMEASASMYRTLLAAC 800

Query: 492 RMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEMKEVGIKKGAG 551
           R+ G  E GK  A KL EL+P DS  +V+LSNM+AA  +W+E+ + R  MK   +KK  G
Sbjct: 801 RVQGDTETGKRVATKLLELEPLDSSAYVLLSNMYAAASKWDEMKLARTMMKGHKVKKDPG 860

Query: 552 FSWITVNSRIHIFQAKDKSHEKDS----EIQDMLGKLRKEMQEAAGCIADTNYALFEV 605
           FSWI V ++IHIF   D+S+ +      +++DM+  +++E     G + +T++ L +V
Sbjct: 861 FSWIEVKNKIHIFVVDDRSNRQTELIYRKVKDMIRDIKQE-----GYVPETDFTLVDV 910

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038881355.10.0e+0096.20pentatricopeptide repeat-containing protein At4g14850 [Benincasa hispida] >XP_03... [more]
XP_031738596.10.0e+0094.06pentatricopeptide repeat-containing protein At4g14850 [Cucumis sativus] >KGN5698... [more]
XP_008438671.10.0e+0093.56PREDICTED: pentatricopeptide repeat-containing protein At4g14850 [Cucumis melo] ... [more]
KAG6582300.10.0e+0090.76Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022956070.10.0e+0091.04pentatricopeptide repeat-containing protein At4g14850 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q0WSH62.4e-21961.49Pentatricopeptide repeat-containing protein At4g14850 OS=Arabidopsis thaliana OX... [more]
Q9FIB23.6e-11437.64Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis th... [more]
Q9SIT71.6e-10935.79Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
Q3E6Q16.8e-10535.47Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Q9SMZ24.9e-10334.45Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A0A0L4T80.0e+0094.06Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G146650 PE=4 SV=1[more]
A0A5A7U2060.0e+0093.56Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3AXN00.0e+0093.56pentatricopeptide repeat-containing protein At4g14850 OS=Cucumis melo OX=3656 GN... [more]
A0A6J1GWT90.0e+0091.04pentatricopeptide repeat-containing protein At4g14850 OS=Cucurbita moschata OX=3... [more]
A0A6J1IQR60.0e+0091.04pentatricopeptide repeat-containing protein At4g14850 OS=Cucurbita maxima OX=366... [more]
Match NameE-valueIdentityDescription
AT4G14850.11.7e-22061.49Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G09950.12.6e-11537.64Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G13600.11.1e-11035.79Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G11290.14.9e-10635.47Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G33170.13.5e-10434.45Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 126..224
e-value: 4.2E-12
score: 47.8
coord: 9..125
e-value: 2.0E-13
score: 52.1
coord: 230..327
e-value: 7.7E-16
score: 59.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 330..405
e-value: 5.5E-9
score: 37.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 406..600
e-value: 6.6E-22
score: 80.3
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 162..443
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 350..377
e-value: 1.6E-4
score: 19.6
coord: 417..446
e-value: 9.4E-4
score: 17.2
coord: 377..411
e-value: 1.6E-6
score: 25.9
coord: 276..309
e-value: 1.9E-4
score: 19.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 74..102
e-value: 0.018
score: 15.3
coord: 349..375
e-value: 2.9E-4
score: 20.9
coord: 276..305
e-value: 0.0057
score: 16.8
coord: 248..274
e-value: 0.026
score: 14.7
coord: 377..404
e-value: 3.9E-6
score: 26.8
coord: 148..173
e-value: 0.0099
score: 16.1
coord: 419..440
e-value: 0.16
score: 12.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 375..405
score: 10.13926
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 344..374
score: 8.845827
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 72..106
score: 8.758137
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 274..308
score: 9.821383
NoneNo IPR availablePANTHERPTHR47925:SF52SUBFAMILY NOT NAMEDcoord: 7..603
NoneNo IPR availablePANTHERPTHR47925OS01G0913400 PROTEIN-RELATEDcoord: 7..603

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10012690.1HG10012690.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0019288 isopentenyl diphosphate biosynthetic process, methylerythritol 4-phosphate pathway
biological_process GO:0019287 isopentenyl diphosphate biosynthetic process, mevalonate pathway
biological_process GO:0050790 regulation of catalytic activity
biological_process GO:0048364 root development
biological_process GO:0016125 sterol metabolic process
cellular_component GO:0005739 mitochondrion
molecular_function GO:0003729 mRNA binding
molecular_function GO:0034046 poly(G) binding
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding