Sed0010756 (gene) Chayote v1

Overview
NameSed0010756
Typegene
OrganismSechium edule (Chayote v1)
DescriptionProcollagen-proline 4-dioxygenase
LocationLG06: 5276451 .. 5286522 (-)
RNA-Seq ExpressionSed0010756
SyntenySed0010756
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATTCTCGATTTTCTCTCGCATTTTCCCTCTGTTTCCTCTGCTCATTTCCTCTCCTGTTTCGCGCCACCAATCGCTTGCCCAAATTGGTCATAACCAGCACCATCACGTAATACCATTTCCATTTTTTGATTTCTCATCTTCGTCTCTTTGTTTCAGAAAATTTGGCCTAATTCATGACAATTTTGCAGGAAAAAATCTGCCGTTGGCTTCGTCTCCGGTAAAATCGATCCCACTCGTGTAATTCAGCTTTCATCGCAACCCAGGTTCTTAACATTCTTCATTCTTCTCTGAATTCTTACAATTATAGCCATATAATTCATAGATTTACACCCCTTTGATAACCCTTTTTTGTTTTTGAAATTTAAGTCTACGACAATTACCATATCATATGTTTTCTTCTACATGGATATCTACGTCTTACCTATATTTGTAGAAACCAAACTAATTTTTGGAAACAAAAAAAAGTCGTTTCCAAAAGTTTGTTTTTGTTTTTGAAATTTGGTTAAAATTCACATACTTTCTTAACCAAAGTTAGATATATTGCAGGAAAGATGTAGGAAATACGAGTAAAAATGAGCTTAAATTTCAAAAACCGAAAACAAAAAATCAAATGATTATCGAACAGGGTCTTAATATTTGAATATGGTTAGATATTCTCTGGGGTTTGTTTAATTTAACAGAAAACCAGGAGGTTTAGGCATTTGTCAAAATTTATCTAACTAAATTAGAGATGGTTATTTTGCTTAATTAGTATAGGCCCTGTCATTATGGGCTGTTGTATGATTAGTGAAATTCGTATGATTCGTGTAGTGCAATAACGTGTGTGGCGTTCATTGCAATCCACGTCTTTATCATTCTTAATTCTTCTTGAGCTATGCTTCCTCGGACAGCTAAATGCTGGAAAGTGCTTAAACTAGAAAACCTTTGAAGTATCTCAGGTCCGGACCTTGGGATGTGCTATCAGATCTCCAGGAATATGGGAGTGAAGCTCCGGAACATTCTTAATTCTTCTCGAAATTTTCGTGATTAGGTTGAGAACATGCACATTTTTGTACAATCCGCTTCTGTTTTTTTCACTCAATAGGATGAAAATTGACAGGGCTTTCTTGTATAAGGGATTTTTGTCTGCAGCGCAGTGCGATCATATTATCAATTTGGTATGGTGATCTCTTGTTCTTCATTTTGGATTTCCTGTTTGGAGGCCAAGAAAGTTTATCCCCGGAGTTTGTTGTGTATAATTGTTTTATATATTTTTAGGCTAGGGATCATATGGAGAAATCAATGGTGGCTGATGACGAAACGGGTGCGAGTGTTGCGAGTGAAGATCGGACGAGTAGCGGCATGTTTCTTGATATAGCTCAGGTATTTTACTTCAGTATCGACAGAACTTGATGTTTCTATAATGTTACCTACTATTGTTATGCCTTTCCTAAATATTTGTCAGTTTCTGTAGTTTTGATTCCTTTTTCCAATTTGATTATCAAATGGGCTTGGTAAAATTCATTTAAGGTAGAATTTCAAAGAGACAGTTGAGCAAAATGGTAACTTAAGGTGCTGGAAGAATCAACCTTCTTGTTCTTATCTATCTTTGGCAGTTTATGAGTCTGATCTTGCATGTTTTTTTTTGTTATCATTGGTTGTGTGAAGGCCATCTTTCAATTTTCTTTTCATGACCTTGTTGAGGCTATTAGATTGAAGATCAACCCAGGAGGTATATACTCATATGTTTGAGTTCTAACCCTTGTTTAAATTGAATGATAGAACTATTGGTAATCAGTCAATGTAATTTCACATAAGTGTTGTGTTAGATTTTGTCTCTTGTTTCTGTAGTTTTATATTCTAATTCTGATAGCTCCTACTTTAATGAATGTTGTTGAGTGCAATGATAGTTTGTTCTGTTGTAAATTTACATTCGTATTTTTAAATTCAACAAAGGGTGGGGGATTCGAACATGTAACCTCTTGGTTACTGGTTCATACATGAGACCATTATTGCTCTACTTCTCTTGGTTGTGTTCATACGTGATGTCATTGGAGTTCTGTTCCTTTAGGTTGTGTTCATTTAGATTGGCATATGTTGATGTGGTCAATGGCATTTTAATGACTATCAGTAAGTAGGGATTGTTGAAATTTATAAATTAAAAAAGTACAAGGAAGAAATTCAAGCAAACATAAAATGACAATGCTAGAGGCAAAGGTTAACTAAAGAAAATTAAAGAGAAAACAGAAGAGCGATCTTAGAGTTTTTCCTTATAAGTTTAGGTTGTTTAGCAGTTCTTAATATTAGAACTGTTCTCAAGGGGTGGCACAGTGGTTGAAAATTTGAGCTTTGAGGGCATGCTCCTCTCAAGGTCCTGAGTTCGAAACTCACTTGTGGCATTCTGATACGGGCCCGGTACTTAAGCATATATACTCTAGGCGTAAGAAGCAAGTGGCGGCTTAGGCGTTTAGCTTGTAGTCGCACGTGTGGGGAGTTTGTGGTAACGGAATTCGAATATAAGTAGTTAGGGAGGGATTGCGTAGAGGGAAAGAAATCATTTTGTATAGTTTGTCTCTTTGAGATAGGAGAGTGGGAGCTCTAGAACTCCTGGAAATTGTGATTTGATAATAAAATTGGCAGATTCTACCAATTTTGGTATCAGAGCACAGTGACGATCAAAGATGACCGGAAAGTTGGATCAACGATTGAAATCGGTGGAGGAAATTGCAGAGGGAATGGCGACGAAGCAGCAGGAACTTGAATCGCAAATCAAACTGCAGTTCGCCGATATGGAAGAGAAGCAGAGTGCGGCGGAGGAGAAGCAGAGAGCGGCAGAGAAGCGATTGGAGATGAAATTCGACGCCGTGAGGGAAGAAATGAGAACTCTGTTCTCGCGACGGGAAACAGAGGTTGCCGATGGAGATCTGTCGTCGTTCGGCAAAGGAAAAACGGTAGTGACGATCGATTCGGGGCAAAAGAGATTCGAAATTGGAGAAATTCCGTTGACGATAGGCGCGAATGTGAGAGGTCCGAAAACAGGGGCTCCAAATCTGGGAGAGTCCTCGGTTGTTCGTGATTTTGGAGTGAGCGGAATCCACGCTCCGACGAACAAAGAAGTCCCTTTCTTCAATATGCGATTACGTAAACTTGAGGTGCCAGTGTTTAAGGGAGAGGAGAACGAAGATCCGATCGGCTGGTTACATCGAGTGGAACGCTATTTCTTAGTGAATCGCTTAACCGAAAACGACAAATTGGATGCAGCAATCATGTGTTTGGAGGGAGAAGCATTAGATTGGCATCAATATGAGGAAGATCGTTCGACCATTCGTTCGTGGAGTGACTTTCGGGTGTTGTTGTTAGAACGATTTCAACCTACGACACAAGGGAATAGGTATGCGAGGTTAATGAAACTGCAACAAGATGGAACGGTACGTGAGTATCGACGTTTGTTCGAAAAGTACTCAATTGGATTGAAGGACTTAAGCGATAGCGTGCTGGAAGGGAAATTCGAGAGTGGTCTTAAGGAAGAGGTGCAGAGTGAGTTGAGGAAATTGCAGCCGATTGGTTTGAAGGCAAAGATGTTGATGGCTCAACTCATTGAAGATGACGAAGTGATCCGAGCGAAAAAGAAGTCCAGTTCGGTAACAACGGCGGTGGGAAGGAATAGCACGACACCAGCCAGTCCAAATACAGTTGGTACAAATAACTCAGGTGGATCAAACTTACGTTCGTTTACGTTTTCCCCTCAACGCTCGACTAGTAGTAATTCTACTACCATTAGTACTAATGCATCAAGTATAAAAGGGCCATTTAAAAGGTTATCCGTTAGTGAGATCCGTGCAAAAAAAGATAAGGGTCTCTGTTTTTGTTGTGACGAAAAGTTTTTTCCGGGACACAAGTGTAAGAAAACGGAGTTGCAAGCATTGCAAGTTTTGATTGTGCAAGATGGGGTGGAGCTGCGAGAGCCGGACGAGATAGTATTGAGTGCAGGGGAGTCTGCGGGGGGTGAACAAACCGCGGAGGTTGAAGTTGATCACGAAATGGCAGCATTGTCTCTAAATTCGTTAGCTGGGCTAAGTTCGCCTAAAACGTTGAAAGTGCGCTGAAGTATTTTTGGACTAGAAGTGGCTATTCTGGTGGATACTGGGGCGATGCACAATTTTATTTCAGAGGAGGTGGTATCGAAATTGGGTTTAATGGTGTCCCCATCTGATGAGTATGGTATTGTTTTGGGGACTGGGGGTTCCGTGAGAGCGACAGGGGTGTGTAGGGATGTTGTGCTCCAGTTGTCTGAATTACGGATTGTGCATGATTTCTTACCTTTGTCGTTGGGGAGTGCGGATGTCATTTTAGGTGTGTCTTGGTTGGAAATTTTGGGTAGTGTGGAGTTTGACTATCGAGCCCTTCAGATGAGGTTCGCGGTGGGTTCCTAGAGGGTGCGACTTCAGGGGGATCCTAGTCTGGTAAAGGCTCAAGTTTCTTTGAAATCTATGATGAAATCGCTTCGGGCCGAAGATCAGGGGTTGTTGGTTGAGCTTAACCTGTTGGAAGGTCACGACGATCTAGGGGTCGAGGCCGCGGTGCCAAAGGTATGGGTGGGGGTACCTTCTGAACTACACCTGCTGCTGTCGGATTATTCAAGGATCTTCGAGTCATTACAAACTCTACCACCGCATCGAAATTGCGACCATGCTATTGAATTGTATGAGGGTAAGGGTTCAGTCAATGTGCGCCCTTATCGATATCCACAATTTCAAAAGAATGAAATCGAAAAGTTAGTTCGTGAAATGCTTTTGGCGGGAATCATTCGCCCGAATACTAGCTCTTTCTCAAGTCCGGTGTTATTGGTCAAGAAAAAGGATGGCAGTTGGCGTTTTTGTGTGGATTATAGAGCATTGAACTTAGCCACCATCCCGGACAAATTTCTCATTCCGCTTGTTGACGAATTGTTGGATGAATTACATGGAGCGGTGATTTTCTCTAAGATTGACTTAAAAGCAGGGTATCATCAAATTCGGGTAAAGCCGGCAGACGTTCCAAAGACTGCTTTTCGAACGCATGAGGGACACTACGAGTTCCTTGTGATGTCGTTTGGGTTGAAGAATGCACCAGCAACTTTTCAATCGGTGATGAATGAGATTTTACGGCCATATTTACGGAAATTTGTATTGGTGTTTTTCGATGATATTCTGATCTATAGCTTGTCGATGGAGGAGCATATTGAACATTTGACTAAGGTCTTTGAGGTACTTAAGACTCACTTCTTTGTGGCCAATGCTAAGAAGTGCCAATTTGGGGTGACTTGTATTGAATACCTAGGGCATTTTATTTCGGCAGATGGTGTTTCAGCTGATCCGGCCAAGATTGAAGCGATGGTGAAATGGCCTAATCCGCTTTCCATTAAGGAACTACGAGGATTTTTGGGGTTGACGGGATATTACAGGCGTTTCGTAGCAAATTACGGTATGATAGCATTTCCGTTGACTCAGTTGCTGAAGAAAGGGAAGTTTGTTTGGTCCGCGGAGGCGGAAGATGCTTTTCAACGATTGAAACATGTTATGATCAGCATTCCCGTGCTTCGTTTACCGGATTTTATGCAGTTGTTTGTTGTAGAGACCGATGCATCGGGGATCGGAGTGGGTGCCGTGTTGATGTAGGAAGGAAGACCACTTGCGTACTTCAGTCGGGCGTTATCGATTACGCACTGTTGCAAGCCAGTGTATGAACGATAATTGATGGCTATCGTGTTTTCCGTCCAACGTTGGAGGGCATATTTGCTCGGTCAGCACTTTGTAGTCCGAATGGATCAGAAAAGTTTGAAGTTTTTGCTGGAACAACGCGTTGTGGTGGGAGAGTATCAACGCTGGATTACTAAACTGTTGGGTTACGACTTTCATATTGAATACAAGAGGGGGTTGGAGAACTCGGCGGCTGATGCACTGTCCCGTATACCGTTATCTTGTGAATTGGGAATGTTGAGCTGTGTTTCCGGGATAAATACTGACGTATTTTTAGAGCAAGTTAAGGCTGACCCACATTTTATGTCGATTTATACGGCTTTGTTGGAGGGTAAAACTGCACCCAAGGGATTTTCGTTGCACCGTGGGTTGTTATGTTTCCAGGGCCGACTGGTTTTGCCTCCTGATTCTCCAACTATTCCTTTACTTTTACAGGAATTTCACGCCGGGCCGATTGGAGGTCATCATGGTGCCCTTAAGACTTACCAACGACTGGCCAAGGAGGTTTACTGGGCAGGAATGAGGGCTCGCGTGCGAATGTTTGTAGCCGAGTGTGCTATTTGTGTTCAGGCCAAACATTTGTCTTTAGCACCGGCCGGGCTATTACAGCCTTTACCAATACCGGAAAGGGTATGGGAAGATGTGTCTATGGACTTTGTTGAGGGCTTGCCTCGCTCAGAGGGTTTTGATACTATTTTGGTGGTGGTGGATCGGTTGTCCAAGTATGCACATTTTATTTTGGTGAAGCATCCGTATACTTCGTTGACCATAGCTTTAGTCTTTATTCGAGAGGTGGTACGTTTGCATGGGGTGCCGCGAAGTATTGTCTCGGATCGGGACAAGGTGTTTACTAGTTCTTTTTGGGAAGAGTTGTTTCGTGCCACAGGCACTAAGCTATGACGTAGCACGACCTATCACCCACAAACGGATGGACAGACTGAAGTTGTTAATCGTTGTTTGGAGTCGTATCTACGATGCTTTGTTATGACTCAACCGAAACAGTGGGCGACATGGATCCCATGGGCTGAATTCAGTTATAATACGTCCTACCATTCGTCTGCGCGATTGACTCCATTTGAAGTGGTTTATGGTCGAGCACCTCCGCCCATTTTGGGATATGAGAAAGGGCAGAGTCCACTCTTTGCGGTGGATTTGTTGTTGGCTGACCGTGACAAGATGTTGGCGACTCTTAAGGCTTCGCTCTTACGAGCTCAACAACTGATGATTAAACATGCTGATGGTAAGCGACGTGATGTTCAGTTTATGGTAGGTGATTTGGTGTATCTAAAATTACGGCCTTATCGACAGCTGTCGGTAGCTCGGTTCAAGTATCCCAAGCATGCTCCACGTTTCGTCGGTCCTTACCGTGTGCTTGCTAGGGTGGGAAGTGTTGCTTATCGTTTGGAGCTACCGGAGACGGTGCGTATTCATCCCGTTGTTCATGTGTCGGTTTCGCAAGGCGCTTGGTACGCTGCGTTACGGTATTCCCACTCGCCTTCGGGGTTACAAGATGATTTGGTATTTGTTTTCCGGCCTTTGGCAGTACTTGGAATGCGCGAGTCTGCTACGATGCTAGGTGATCGGGAAGTGTTGATTCAATGGGAGAGGGGCTTATCAGAAGATGCAACTTGGGAGTCGGCCAGTTTTATCCAGACTCAATTTCCTGATTTCCACCTTGAGGACAAGGTGGTTCTTTGGGGGGCGGGTATTGATACGGGCCCGGTACTTAAGCATATATACTCTAGGCGTAAGAAGCAAGTGGCGGCTTAGGCGTTTAGCTTGTAGTCGCACGTGTGGGGAGTTTGTGGTAACGGAATTCGAATATAAGTAGTTAGGGAAGGATTGCGTAGAGGGAAAGAAATCATTTTGTATAGTTTGTCTCTTTGAGATAGGAGAGTGGGAGCTCTAGAACTCCTGGAAATTGTGATTTGATAATAAAATTGGCAGATTCTACCACATTCCTTCGATGTCTCCAGTGCCTGGCCTAGAGACGGGCGTGATTACCTTGTTTCAAAAAAAAAAAATTAGAACTTGGAACCTCTTCGATAACAAGAATTAGAATTTTCATACTTGGGATGCCAAATCCCAACTAGTTTTTGATTCATTAGCATGTCTGAGAACTTTATAGTTGTTTCCAAGTGTGACAAACTTGTATTGTATTTTTAGTTTTGCATCTTCGTTAAATATAAAACAAATGCAGCTTGATTCTGTGGCCCCCTTCCCATCCCCAGGATCGCATAGTTGCTGGCATTGAGTTCAAGATCGCTGCGTGGACTTTCCTTCCCATCAGTTAACTACTTTTAGATCTTTAGATGTTTTTTTATCATTTATTTATTGATGGAATGAGGCTCGTAAAATATCGAAATCGTTGTTCTGTTCAGCTCATGGGGAGCCCATGCAAGTACTTAGGTATGAGAACGGTCAGAAATACGAGCCACATTATGATTATTTTCTCGACCCAGTTAATATGGCTATGGGTGGTCATCGGGTCGCCACAGTCTTGATGTATTTATCCGATGTCGAAAAGGGTGGAGAAACAGTCTTTCCCAGTTCTCCGGTATTACTTTCTCCTAACACTTACTGCTCTTATTCAGGTCTGTTTATAATTCGTTTTGAGTTGTTGAAACTTGGAACTTGTTTATGAATTGTTGCAATTTCAATTCTCTTTCTGACAGGTTAAACTATCCGAGGAGCAGAAGAATGACTTGTCCGATTGTGCTAAGATCGGCTACGGAGGTATTGAGTTTTTTCTTTGAAAATATATCGTTGAATTATGTTAAACCACTAATCAACTCAAAAGCTTAACCTAATCGGTTGGGGTTAATTTAATTATATCAACCAATACTCTCCCTCACTTACGGACTTTCCCCCTCACTTTTGGACTTTGAAATTTGAGAAAGACCATACAAGTGGAATTCAATTTTAATTGGGGAGGAAACGACTTGATAGGGATTCGAACTCAAGACCTTCTACTCTGATGTTGATTTTGGAGAATAGAAACAAGTTCTATTATTACTTGATATATTCAATGTTACATTTGCCTTTCTTATATAGGAAAACCTAGGCTAGAATAATAAAAAATTTACACAATAATGTGAATTTACAAATAAGGAAATAATACATAAGGAATGAACCGCATAAAGAAAGAATATAAATCAACATATATCAATGTAGGCTGAGAATCGAGTAATTTTGATGATTTTCAATTTGGTCAAATAACATGATAGGGTCCTTATTGGTCAAATCAACATATTAAGAATATTAAGTGGTTTCACTTAGGGCTCAAAAGAGGTTTCTTTAGAGAAGGGAACAGTGTTTTAAAAGTCGCATCCTCGCTTGTCCCAGGCAAGATGCGTGCCTTTTGGCACCTTACAAGAAGCCCCGCGCGACCCCTTATTAGGTCATTCGAAGCGTGTAATGGTTGATCGTTGAATATTCTGTATATTTTATACATTATTCTTAGTTAAAACTTTATTTATGTCCATGAAAAGTGTTTTTATTGTGTATGCATACACACACATATATATAAGTATTTCGTGTTTTTTTTTATATAATGCGTCTTAAGTAAAATAGTTCTTGCTTTTTGGTGTGCCTTGCGCCTCAGGCTCCAGAGGGTCATTGCACCTTAGTGCGCCTTGGTATTTTTAAAACACTAGAAGGGAGGTTCTAAGTCCTATTCAAACTTACCTTTTCTTTTATTTTTTTTACCTTTTGATATCAATATGGATCTTGATCTTCCTAAGGATGGGCCATTGGCCCATGGGTGCCCCGAGTATAGTGGAGTAAACAAAAAAAAAAGATCTTGGATCTTATCGTAACAATAACCACAGAAACGTTATGAAATGATCATGACCTTCAGTATATCAAGTATAAAGTGGTCAAATAATCAACTCTGGAATTTGTGAAGATGTGATGAGAATGTTTCATTGCCCTAACCTTCTCTACCTCTTCATCCATTTTCTCATTGGATCACATTGTTTCTCTAGCCATCTGTTACTGTAGCATACAAATTCTAATCAAAACAGCATGTTCGTTTGTGGGTGCAGTAAAACCAAAGAAGGGTGATGCTTTACTGTTCTTCAGTCTCCATGCCAATTTGACACCAGACTCGACCAGCTTTCACGGAAGCTGTCCGGTGATAGAGGGCGAGAAGTGGTCTGCAACCAAATGGATTCACATGCTTCCATACACTGAGGTTTGGAGGAATCCAGATTGTGTGGATGAGAGTGAGCACTGTAGTGTGTGGGCAAATGCAGGTGAGTGTGAAAAGAATCCAAGCTATATGGTGGGCTCAAAGGGTGAGCTTGGATATTGTAGAGAGAGTTGCAAAGTGTGTTCTTCCCCCTCATAG

mRNA sequence

ATGGATTCTCGATTTTCTCTCGCATTTTCCCTCTGTTTCCTCTGCTCATTTCCTCTCCTGTTTCGCGCCACCAATCGCTTGCCCAAATTGGTCATAACCAGCACCATCACGAAAAAATCTGCCGTTGGCTTCGTCTCCGGTAAAATCGATCCCACTCGTGTAATTCAGCTTTCATCGCAACCCAGGGCTTTCTTGTATAAGGGATTTTTGTCTGCAGCGCAGTGCGATCATATTATCAATTTGGCTAGGGATCATATGGAGAAATCAATGGTGGCTGATGACGAAACGGGTGCGAGTGTTGCGAGTGAAGATCGGACGAGTAGCGGCATGTTTCTTGATATAGCTCAGGTTAAACTATCCGAGGAGCAGAAGAATGACTTGTCCGATTGTGCTAAGATCGGCTACGGAGTAAAACCAAAGAAGGGTGATGCTTTACTGTTCTTCAGTCTCCATGCCAATTTGACACCAGACTCGACCAGCTTTCACGGAAGCTGTCCGGTGATAGAGGGCGAGAAGTGGTCTGCAACCAAATGGATTCACATGCTTCCATACACTGAGGTTTGGAGGAATCCAGATTGTGTGGATGAGAGTGAGCACTGTAGTGTGTGGGCAAATGCAGGTGAGTGTGAAAAGAATCCAAGCTATATGGTGGGCTCAAAGGGTGAGCTTGGATATTGTAGAGAGAGTTGCAAAGTGTGTTCTTCCCCCTCATAG

Coding sequence (CDS)

ATGGATTCTCGATTTTCTCTCGCATTTTCCCTCTGTTTCCTCTGCTCATTTCCTCTCCTGTTTCGCGCCACCAATCGCTTGCCCAAATTGGTCATAACCAGCACCATCACGAAAAAATCTGCCGTTGGCTTCGTCTCCGGTAAAATCGATCCCACTCGTGTAATTCAGCTTTCATCGCAACCCAGGGCTTTCTTGTATAAGGGATTTTTGTCTGCAGCGCAGTGCGATCATATTATCAATTTGGCTAGGGATCATATGGAGAAATCAATGGTGGCTGATGACGAAACGGGTGCGAGTGTTGCGAGTGAAGATCGGACGAGTAGCGGCATGTTTCTTGATATAGCTCAGGTTAAACTATCCGAGGAGCAGAAGAATGACTTGTCCGATTGTGCTAAGATCGGCTACGGAGTAAAACCAAAGAAGGGTGATGCTTTACTGTTCTTCAGTCTCCATGCCAATTTGACACCAGACTCGACCAGCTTTCACGGAAGCTGTCCGGTGATAGAGGGCGAGAAGTGGTCTGCAACCAAATGGATTCACATGCTTCCATACACTGAGGTTTGGAGGAATCCAGATTGTGTGGATGAGAGTGAGCACTGTAGTGTGTGGGCAAATGCAGGTGAGTGTGAAAAGAATCCAAGCTATATGGTGGGCTCAAAGGGTGAGCTTGGATATTGTAGAGAGAGTTGCAAAGTGTGTTCTTCCCCCTCATAG

Protein sequence

MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGFVSGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSSGMFLDIAQVKLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMVGSKGELGYCRESCKVCSSPS
Homology
BLAST of Sed0010756 vs. NCBI nr
Match: XP_023530715.1 (probable prolyl 4-hydroxylase 7 [Cucurbita pepo subsp. pepo] >XP_023530716.1 probable prolyl 4-hydroxylase 7 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 325.9 bits (834), Expect = 3.0e-85
Identity = 174/323 (53.87%), Postives = 198/323 (61.30%), Query Frame = 0

Query: 1   MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGF----VSGKIDPTRVIQ 60
           MDSRF LAFSLCFLCSFPL  R+ NRLPKL++  T T+ S +       S KIDPTRV+Q
Sbjct: 1   MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQ 60

Query: 61  LSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSSGMFLDIAQ 120
           LSSQPRAFLYKGFLSA +C H+I+LA+D++E+S+V DD TGAS +S DRTS+GMFL  AQ
Sbjct: 61  LSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQ 120

Query: 121 ------------------------------------------------------------ 180
                                                                       
Sbjct: 121 DDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVL 180

Query: 181 ------------------VKLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDS 238
                              K+ EE+  DLSDC+  GYGVKPKKGDALLFFSLH N+T D 
Sbjct: 181 MYLSNVERGGETVFPDSPAKVFEEENKDLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDP 240

BLAST of Sed0010756 vs. NCBI nr
Match: XP_022931100.1 (probable prolyl 4-hydroxylase 7 [Cucurbita moschata])

HSP 1 Score: 324.7 bits (831), Expect = 6.6e-85
Identity = 174/323 (53.87%), Postives = 197/323 (60.99%), Query Frame = 0

Query: 1   MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGF----VSGKIDPTRVIQ 60
           MDSRF LAFSLCFLCSFPL  R+ NRLPKL++  T T+ S +       S KIDPTRV+Q
Sbjct: 1   MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQ 60

Query: 61  LSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSSGMFLDIAQ 120
           LSSQPRAFLYKGFLSA +C H+I+LA+D++E+S+V DD TGAS +S DRTS+GMFL  AQ
Sbjct: 61  LSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQ 120

Query: 121 ------------------------------------------------------------ 180
                                                                       
Sbjct: 121 DDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVL 180

Query: 181 ------------------VKLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDS 238
                              K+ EE+  DL DC+  GYGVKPKKGDALLFFSLH N+T D 
Sbjct: 181 MYLSNVERGGETVFPDSPAKVFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDP 240

BLAST of Sed0010756 vs. NCBI nr
Match: XP_022971148.1 (probable prolyl 4-hydroxylase 7 [Cucurbita maxima] >XP_022971154.1 probable prolyl 4-hydroxylase 7 [Cucurbita maxima])

HSP 1 Score: 323.6 bits (828), Expect = 1.5e-84
Identity = 176/323 (54.49%), Postives = 198/323 (61.30%), Query Frame = 0

Query: 1   MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGF----VSGKIDPTRVIQ 60
           MDSRF LAFSLCFLCSFPL  R+ NRLPKL++  T T+ S +       S KIDPTRV+Q
Sbjct: 1   MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQ 60

Query: 61  LSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSSGMFLDIAQ 120
           LSSQPRAFLYKGFLSA +C H+I+LA+D++E+S+V DD TGAS +S DRTS+GMFL  AQ
Sbjct: 61  LSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQ 120

Query: 121 ------------------------------------------------------------ 180
                                                                       
Sbjct: 121 DDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVL 180

Query: 181 ------------------VKLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDS 238
                              K+ EE K DLSDC+  GYGVKPKKGDALLFFSLH N+T D 
Sbjct: 181 LYLSNVERGGETVFPDSPAKVFEENK-DLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDP 240

BLAST of Sed0010756 vs. NCBI nr
Match: KAG6588394.1 (putative prolyl 4-hydroxylase 7, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 307.4 bits (786), Expect = 1.1e-79
Identity = 172/323 (53.25%), Postives = 192/323 (59.44%), Query Frame = 0

Query: 1   MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGF----VSGKIDPTRVIQ 60
           MDSRF LAFSLCFLCSFPL  R  NRLPKL++  T T+ S +       S KIDPTRV+Q
Sbjct: 1   MDSRFFLAFSLCFLCSFPLFARPANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQ 60

Query: 61  LSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSSGMFLDIAQ 120
           LSSQPRAFLYKGFLSA +C HI+++     E+S+V DD TGAS +S DRTS+GMFL  AQ
Sbjct: 61  LSSQPRAFLYKGFLSAEEC-HILSI---WYEQSLVVDDITGASRSSTDRTSTGMFLYKAQ 120

Query: 121 ------------------------------------------------------------ 180
                                                                       
Sbjct: 121 DDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVL 180

Query: 181 ------------------VKLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDS 238
                              K+ EE+  DL DC+  GYGVKPKKGDALLFFSLH N+T D 
Sbjct: 181 MYLSNVERGGETVFPDSPAKVFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDP 240

BLAST of Sed0010756 vs. NCBI nr
Match: XP_038905408.1 (probable prolyl 4-hydroxylase 7 isoform X1 [Benincasa hispida] >XP_038905409.1 probable prolyl 4-hydroxylase 7 isoform X1 [Benincasa hispida])

HSP 1 Score: 303.1 bits (775), Expect = 2.1e-78
Identity = 162/319 (50.78%), Postives = 194/319 (60.82%), Query Frame = 0

Query: 1   MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGFVS----GKIDPTRVIQ 60
           M SRF LAFSLCFLC FP   R+ NRLPKL++ +    +S +   +      IDPTRVI+
Sbjct: 1   MASRFFLAFSLCFLCFFPFFSRSANRLPKLLLHNNNMDQSVIRMKTVGSPVTIDPTRVIK 60

Query: 61  LSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSSGMFLDIAQ 120
           LSS+PRAFLYKGFLS  +C H+INLA+  +++S+VA  ETG SV S++RTS+GMFL  AQ
Sbjct: 61  LSSKPRAFLYKGFLSEDECQHLINLAKGKLQQSLVA-AETGESVTSQERTSTGMFLTRAQ 120

Query: 121 ------------------------------------------------------------ 180
                                                                       
Sbjct: 121 DEIVARIESRIAAWTFLPIDNGEPIQILRYENGQKYEPHFDFFQDPVNIAIGGHRIATIL 180

Query: 181 ------------------VKLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDS 238
                             +KLSE+++ DLSDCAK+GYGVKPK GDALLFFSL+ N+TPD+
Sbjct: 181 MYLSDVEKGGETVFPNSPIKLSEQERADLSDCAKVGYGVKPKMGDALLFFSLNPNVTPDA 240

BLAST of Sed0010756 vs. ExPASy Swiss-Prot
Match: Q8L970 (Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana OX=3702 GN=P4H7 PE=2 SV=1)

HSP 1 Score: 229.6 bits (584), Expect = 3.8e-59
Identity = 136/319 (42.63%), Postives = 169/319 (52.98%), Query Frame = 0

Query: 1   MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAV-----GFVSGKIDPTRVI 60
           MDSR  LAFSLCFL + PL+  A NR    +  S+ T+  +V        S   DPTRV 
Sbjct: 1   MDSRIFLAFSLCFLFTLPLISSAPNR---FLTRSSNTRDGSVIKMKTSASSFGFDPTRVT 60

Query: 61  QLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSSGMFLD-- 120
           QLS  PR FLY+GFLS  +CDH I LA+  +EKSMVAD+++G SV SE RTSSGMFL   
Sbjct: 61  QLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKR 120

Query: 121 --------------------------------------------------------IAQV 180
                                                                   IA V
Sbjct: 121 QDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATV 180

Query: 181 --------------------KLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPD 236
                               K ++ + +  ++CAK GY VKP+KGDALLFF+LH N T D
Sbjct: 181 LMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNATTD 240

BLAST of Sed0010756 vs. ExPASy Swiss-Prot
Match: F4J0A8 (Probable prolyl 4-hydroxylase 6 OS=Arabidopsis thaliana OX=3702 GN=P4H6 PE=2 SV=1)

HSP 1 Score: 205.3 bits (521), Expect = 7.6e-52
Identity = 132/312 (42.31%), Postives = 157/312 (50.32%), Query Frame = 0

Query: 1   MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGFVSGKIDPTRVIQLSSQ 60
           MDS++ LAFSL  L  F                      S +   S  +DPTR+ QLS  
Sbjct: 1   MDSQYFLAFSLSLLLIF----------------------SQISSFSFSVDPTRITQLSWT 60

Query: 61  PRAFLYKGFLSAAQCDHIINLARDHMEKSM-VADDETGASVASEDRTSSGMFLD------ 120
           PRAFLYKGFLS  +CDH+I LA+  +EKSM VAD ++G S  SE RTSSGMFL       
Sbjct: 61  PRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSEVRTSSGMFLTKRQDDI 120

Query: 121 IAQVK--------LSEE-------------QKND-------------------------- 180
           +A V+        L EE             QK D                          
Sbjct: 121 VANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHRIATVLMYL 180

Query: 181 -------------------------LSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSF 234
                                     S CAK GY VKP+KGDALLFF+LH N T D  S 
Sbjct: 181 SNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKGDALLFFNLHLNGTTDPNSL 240

BLAST of Sed0010756 vs. ExPASy Swiss-Prot
Match: Q8LAN3 (Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana OX=3702 GN=P4H4 PE=2 SV=1)

HSP 1 Score: 185.7 bits (470), Expect = 6.3e-46
Identity = 102/267 (38.20%), Postives = 135/267 (50.56%), Query Frame = 0

Query: 49  IDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSS 108
           ++P++V Q+SS+PRAF+Y+GFL+  +CDH+++LA+  +++S VAD+++G S  SE RTSS
Sbjct: 32  VNPSKVKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSS 91

Query: 109 GMFL-------------------------------------------------------- 168
           G F+                                                        
Sbjct: 92  GTFISKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRG 151

Query: 169 -------------------------DIAQVKLSEEQKNDLSDCAKIGYGVKPKKGDALLF 228
                                    +I   ++  E K DLSDCAK G  VKP+KGDALLF
Sbjct: 152 GHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKRGIAVKPRKGDALLF 211

Query: 229 FSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTE-VWRNPDCVDESEHCSVWANA 234
           F+LH +  PD  S HG CPVIEGEKWSATKWIH+  +   V  + +C D +E C  WA  
Sbjct: 212 FNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHVDSFDRIVTPSGNCTDMNESCERWAVL 271

BLAST of Sed0010756 vs. ExPASy Swiss-Prot
Match: F4JAU3 (Prolyl 4-hydroxylase 2 OS=Arabidopsis thaliana OX=3702 GN=P4H2 PE=1 SV=1)

HSP 1 Score: 180.6 bits (457), Expect = 2.0e-44
Identity = 104/270 (38.52%), Postives = 137/270 (50.74%), Query Frame = 0

Query: 46  SGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDR 105
           S  I+P++V Q+SS+PRAF+Y+GFL+  +CDH+I+LA++++++S VAD++ G S  S+ R
Sbjct: 30  SSIINPSKVKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVR 89

Query: 106 TSSGMFLD---------------------------------------------------- 165
           TSSG F+                                                     
Sbjct: 90  TSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNI 149

Query: 166 ------IAQVKL-----------------------SEEQKNDLSDCAKIGYGVKPKKGDA 225
                 IA V L                         E K+DLSDCAK G  VKPKKG+A
Sbjct: 150 ARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNA 209

Query: 226 LLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEV-WRNPDCVDESEHCSVW 234
           LLFF+L  +  PD  S HG CPVIEGEKWSATKWIH+  + ++   + +C D +E C  W
Sbjct: 210 LLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHVDSFDKILTHDGNCTDVNESCERW 269

BLAST of Sed0010756 vs. ExPASy Swiss-Prot
Match: Q8GXT7 (Probable prolyl 4-hydroxylase 12 OS=Arabidopsis thaliana OX=3702 GN=P4H12 PE=2 SV=1)

HSP 1 Score: 131.0 bits (328), Expect = 1.8e-29
Identity = 85/256 (33.20%), Postives = 116/256 (45.31%), Query Frame = 0

Query: 36  ITKKS---AVGFVSGK--IDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSM 95
           IT KS      +V G   +DPTRV+QLS  PR FLY+GFLS  +CDH+I+L ++  E   
Sbjct: 36  ITSKSDDTQASYVLGSKFVDPTRVLQLSWLPRVFLYRGFLSEEECDHLISLRKETTEVYS 95

Query: 96  VADD------------------------ETGASVASEDRTS--SGMFLDI---------- 155
           V  D                        E G S+     TS  SG  LD           
Sbjct: 96  VDADGKTQLDPVVAGIEEKVSAWTFLPGENGGSIKVRSYTSEKSGKKLDYFGEEPSSVLH 155

Query: 156 -----------------AQVKLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPD 215
                             ++     +    + C + G  ++P KG+A+LFF+   N + D
Sbjct: 156 ESLLATVVLYLSNTTQGGELLFPNSEMKPKNSCLEGGNILRPVKGNAILFFTRLLNASLD 215

Query: 216 STSFHGSCPVIEGEKWSATKWIHMLPYTEVWRNPDCVDESEHCSVWANAGECEKNPSYMV 234
             S H  CPV++GE   ATK I+      +  + +C DE E+C  WA  GEC+KNP YM+
Sbjct: 216 GKSTHLRCPVVKGELLVATKLIYAKKQARIEESGECSDEDENCGRWAKLGECKKNPVYMI 275

BLAST of Sed0010756 vs. ExPASy TrEMBL
Match: A0A6J1EYJ1 (Procollagen-proline 4-dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111437385 PE=3 SV=1)

HSP 1 Score: 324.7 bits (831), Expect = 3.2e-85
Identity = 174/323 (53.87%), Postives = 197/323 (60.99%), Query Frame = 0

Query: 1   MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGF----VSGKIDPTRVIQ 60
           MDSRF LAFSLCFLCSFPL  R+ NRLPKL++  T T+ S +       S KIDPTRV+Q
Sbjct: 1   MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQ 60

Query: 61  LSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSSGMFLDIAQ 120
           LSSQPRAFLYKGFLSA +C H+I+LA+D++E+S+V DD TGAS +S DRTS+GMFL  AQ
Sbjct: 61  LSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQ 120

Query: 121 ------------------------------------------------------------ 180
                                                                       
Sbjct: 121 DDIVAGIEAKIAAWTFLPVDNGEPIQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVL 180

Query: 181 ------------------VKLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDS 238
                              K+ EE+  DL DC+  GYGVKPKKGDALLFFSLH N+T D 
Sbjct: 181 MYLSNVERGGETVFPDSPAKVFEEENKDLFDCSTTGYGVKPKKGDALLFFSLHPNVTTDP 240

BLAST of Sed0010756 vs. ExPASy TrEMBL
Match: A0A6J1I5Z9 (Procollagen-proline 4-dioxygenase OS=Cucurbita maxima OX=3661 GN=LOC111469906 PE=3 SV=1)

HSP 1 Score: 323.6 bits (828), Expect = 7.1e-85
Identity = 176/323 (54.49%), Postives = 198/323 (61.30%), Query Frame = 0

Query: 1   MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGF----VSGKIDPTRVIQ 60
           MDSRF LAFSLCFLCSFPL  R+ NRLPKL++  T T+ S +       S KIDPTRV+Q
Sbjct: 1   MDSRFFLAFSLCFLCSFPLFARSANRLPKLLLDDTKTEDSVIRMKMDGSSIKIDPTRVVQ 60

Query: 61  LSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSSGMFLDIAQ 120
           LSSQPRAFLYKGFLSA +C H+I+LA+D++E+S+V DD TGAS +S DRTS+GMFL  AQ
Sbjct: 61  LSSQPRAFLYKGFLSAEECQHLIDLAKDYLEQSLVVDDITGASRSSTDRTSTGMFLYKAQ 120

Query: 121 ------------------------------------------------------------ 180
                                                                       
Sbjct: 121 DDIVAGIEAKIAAWTFLPVDNGEPLQILRYENGQQYVPHFDFFQDPVNVAAGGHRIATVL 180

Query: 181 ------------------VKLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDS 238
                              K+ EE K DLSDC+  GYGVKPKKGDALLFFSLH N+T D 
Sbjct: 181 LYLSNVERGGETVFPDSPAKVFEENK-DLSDCSTTGYGVKPKKGDALLFFSLHPNVTTDP 240

BLAST of Sed0010756 vs. ExPASy TrEMBL
Match: A0A6J1DX45 (Procollagen-proline 4-dioxygenase OS=Momordica charantia OX=3673 GN=LOC111024321 PE=3 SV=1)

HSP 1 Score: 292.7 bits (748), Expect = 1.3e-75
Identity = 158/318 (49.69%), Postives = 191/318 (60.06%), Query Frame = 0

Query: 1   MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAV-----GFVSGKIDPTRVI 60
           MDSR  LAFSLCFLC FPL  R+TN +P+L++      + ++     G  S  IDP+RV 
Sbjct: 1   MDSRRFLAFSLCFLCLFPLFCRSTNPMPRLLMDRNNMGRGSLIRMKTGGSSISIDPSRVT 60

Query: 61  QLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSSGMFL--- 120
           QLSSQPRAF+YKGFLSA +C+H+INLA+D +E+S+VADD TG SV S +RTS+GMFL   
Sbjct: 61  QLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKG 120

Query: 121 ------------------------------------------------DIAQ-------- 180
                                                           ++AQ        
Sbjct: 121 QDKIVAGIESRIAAWTFLPVDNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATV 180

Query: 181 -------------------VKLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPD 236
                              VKLS  +K +LSDCAK+GY VKPK GDALLFFSLHAN T D
Sbjct: 181 LMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTD 240

BLAST of Sed0010756 vs. ExPASy TrEMBL
Match: A0A6J1DTY4 (Procollagen-proline 4-dioxygenase OS=Momordica charantia OX=3673 GN=LOC111024321 PE=3 SV=1)

HSP 1 Score: 292.7 bits (748), Expect = 1.3e-75
Identity = 158/318 (49.69%), Postives = 191/318 (60.06%), Query Frame = 0

Query: 1   MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAV-----GFVSGKIDPTRVI 60
           MDSR  LAFSLCFLC FPL  R+TN +P+L++      + ++     G  S  IDP+RV 
Sbjct: 86  MDSRRFLAFSLCFLCLFPLFCRSTNPMPRLLMDRNNMGRGSLIRMKTGGSSISIDPSRVT 145

Query: 61  QLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSSGMFL--- 120
           QLSSQPRAF+YKGFLSA +C+H+INLA+D +E+S+VADD TG SV S +RTS+GMFL   
Sbjct: 146 QLSSQPRAFVYKGFLSAEECEHLINLAKDKLEESLVADDVTGESVTSRERTSTGMFLVKG 205

Query: 121 ------------------------------------------------DIAQ-------- 180
                                                           ++AQ        
Sbjct: 206 QDKIVAGIESRIAAWTFLPVDNGEPMQVLRYENGQKYDPHFDFFQDPVNMAQGGHRIATV 265

Query: 181 -------------------VKLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPD 236
                              VKLS  +K +LSDCAK+GY VKPK GDALLFFSLHAN T D
Sbjct: 266 LMYLSNVEEGGETVFPNSHVKLSAREKKELSDCAKLGYAVKPKMGDALLFFSLHANGTTD 325

BLAST of Sed0010756 vs. ExPASy TrEMBL
Match: A0A1S3B814 (Procollagen-proline 4-dioxygenase OS=Cucumis melo OX=3656 GN=LOC103487037 PE=3 SV=1)

HSP 1 Score: 283.5 bits (724), Expect = 8.2e-73
Identity = 155/319 (48.59%), Postives = 189/319 (59.25%), Query Frame = 0

Query: 1   MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGFVSG----KIDPTRVIQ 60
           M S F LAFS+ FL   PL   + NR PK+++ +    +S +   +G     IDPTRVIQ
Sbjct: 1   MASPFLLAFSIFFLWLLPLSSLSANRFPKMLLHNNDMYESVIRMKTGGSAITIDPTRVIQ 60

Query: 61  LSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSSGMFLDIAQ 120
           LSS+PRAFLYKGFLS  +C H+I+LA+  + +S+VA   TG SV S++RTS+GMFL  AQ
Sbjct: 61  LSSKPRAFLYKGFLSYEECQHLIHLAKGKLRQSLVAAG-TGESVTSKERTSTGMFLRKAQ 120

Query: 121 ------------------------------------------------------------ 180
                                                                       
Sbjct: 121 DKIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATIL 180

Query: 181 ------------------VKLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPDS 238
                             VKLSEE+K DLS+CAK+GYGV+PK GDALLFFS++ N+TPD+
Sbjct: 181 MYLSDVEKGGETVFPNSPVKLSEEEKGDLSECAKVGYGVRPKLGDALLFFSMNPNVTPDA 240

BLAST of Sed0010756 vs. TAIR 10
Match: AT3G28480.1 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 229.6 bits (584), Expect = 2.7e-60
Identity = 136/319 (42.63%), Postives = 169/319 (52.98%), Query Frame = 0

Query: 1   MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAV-----GFVSGKIDPTRVI 60
           MDSR  LAFSLCFL + PL+  A NR    +  S+ T+  +V        S   DPTRV 
Sbjct: 1   MDSRIFLAFSLCFLFTLPLISSAPNR---FLTRSSNTRDGSVIKMKTSASSFGFDPTRVT 60

Query: 61  QLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSSGMFLD-- 120
           QLS  PR FLY+GFLS  +CDH I LA+  +EKSMVAD+++G SV SE RTSSGMFL   
Sbjct: 61  QLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKR 120

Query: 121 --------------------------------------------------------IAQV 180
                                                                   IA V
Sbjct: 121 QDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATV 180

Query: 181 --------------------KLSEEQKNDLSDCAKIGYGVKPKKGDALLFFSLHANLTPD 236
                               K ++ + +  ++CAK GY VKP+KGDALLFF+LH N T D
Sbjct: 181 LMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNATTD 240

BLAST of Sed0010756 vs. TAIR 10
Match: AT3G28480.2 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 215.7 bits (548), Expect = 4.0e-56
Identity = 132/327 (40.37%), Postives = 167/327 (51.07%), Query Frame = 0

Query: 1   MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAV-----GFVSGKIDPTRVI 60
           MDSR  LAFSLCFL + PL+  A NR    +  S+ T+  +V        S   DPTRV 
Sbjct: 1   MDSRIFLAFSLCFLFTLPLISSAPNR---FLTRSSNTRDGSVIKMKTSASSFGFDPTRVT 60

Query: 61  QLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASED-----RTSSGM 120
           QLS  PR FLY+GFLS  +CDH I LA+  +EKSMVAD+++G SV SED     R SS  
Sbjct: 61  QLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEDSVSVVRQSSSF 120

Query: 121 ---------------------------------------------------FLDIAQVKL 180
                                                              F D A ++L
Sbjct: 121 IANMDSLEIDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLEL 180

Query: 181 ------------------------------SEEQKNDLSDCAKIGYGVKPKKGDALLFFS 236
                                         ++ + +  ++CAK GY VKP+KGDALLFF+
Sbjct: 181 GGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFN 240

BLAST of Sed0010756 vs. TAIR 10
Match: AT3G28490.1 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 205.3 bits (521), Expect = 5.4e-53
Identity = 132/312 (42.31%), Postives = 157/312 (50.32%), Query Frame = 0

Query: 1   MDSRFSLAFSLCFLCSFPLLFRATNRLPKLVITSTITKKSAVGFVSGKIDPTRVIQLSSQ 60
           MDS++ LAFSL  L  F                      S +   S  +DPTR+ QLS  
Sbjct: 1   MDSQYFLAFSLSLLLIF----------------------SQISSFSFSVDPTRITQLSWT 60

Query: 61  PRAFLYKGFLSAAQCDHIINLARDHMEKSM-VADDETGASVASEDRTSSGMFLD------ 120
           PRAFLYKGFLS  +CDH+I LA+  +EKSM VAD ++G S  SE RTSSGMFL       
Sbjct: 61  PRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSEVRTSSGMFLTKRQDDI 120

Query: 121 IAQVK--------LSEE-------------QKND-------------------------- 180
           +A V+        L EE             QK D                          
Sbjct: 121 VANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHRIATVLMYL 180

Query: 181 -------------------------LSDCAKIGYGVKPKKGDALLFFSLHANLTPDSTSF 234
                                     S CAK GY VKP+KGDALLFF+LH N T D  S 
Sbjct: 181 SNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKGDALLFFNLHLNGTTDPNSL 240

BLAST of Sed0010756 vs. TAIR 10
Match: AT5G18900.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 185.7 bits (470), Expect = 4.5e-47
Identity = 102/267 (38.20%), Postives = 135/267 (50.56%), Query Frame = 0

Query: 49  IDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDRTSS 108
           ++P++V Q+SS+PRAF+Y+GFL+  +CDH+++LA+  +++S VAD+++G S  SE RTSS
Sbjct: 32  VNPSKVKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSS 91

Query: 109 GMFL-------------------------------------------------------- 168
           G F+                                                        
Sbjct: 92  GTFISKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRG 151

Query: 169 -------------------------DIAQVKLSEEQKNDLSDCAKIGYGVKPKKGDALLF 228
                                    +I   ++  E K DLSDCAK G  VKP+KGDALLF
Sbjct: 152 GHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKRGIAVKPRKGDALLF 211

Query: 229 FSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTE-VWRNPDCVDESEHCSVWANA 234
           F+LH +  PD  S HG CPVIEGEKWSATKWIH+  +   V  + +C D +E C  WA  
Sbjct: 212 FNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHVDSFDRIVTPSGNCTDMNESCERWAVL 271

BLAST of Sed0010756 vs. TAIR 10
Match: AT3G06300.1 (P4H isoform 2 )

HSP 1 Score: 180.6 bits (457), Expect = 1.4e-45
Identity = 104/270 (38.52%), Postives = 137/270 (50.74%), Query Frame = 0

Query: 46  SGKIDPTRVIQLSSQPRAFLYKGFLSAAQCDHIINLARDHMEKSMVADDETGASVASEDR 105
           S  I+P++V Q+SS+PRAF+Y+GFL+  +CDH+I+LA++++++S VAD++ G S  S+ R
Sbjct: 30  SSIINPSKVKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVR 89

Query: 106 TSSGMFLD---------------------------------------------------- 165
           TSSG F+                                                     
Sbjct: 90  TSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNI 149

Query: 166 ------IAQVKL-----------------------SEEQKNDLSDCAKIGYGVKPKKGDA 225
                 IA V L                         E K+DLSDCAK G  VKPKKG+A
Sbjct: 150 ARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNA 209

Query: 226 LLFFSLHANLTPDSTSFHGSCPVIEGEKWSATKWIHMLPYTEV-WRNPDCVDESEHCSVW 234
           LLFF+L  +  PD  S HG CPVIEGEKWSATKWIH+  + ++   + +C D +E C  W
Sbjct: 210 LLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHVDSFDKILTHDGNCTDVNESCERW 269

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_023530715.13.0e-8553.87probable prolyl 4-hydroxylase 7 [Cucurbita pepo subsp. pepo] >XP_023530716.1 pro... [more]
XP_022931100.16.6e-8553.87probable prolyl 4-hydroxylase 7 [Cucurbita moschata][more]
XP_022971148.11.5e-8454.49probable prolyl 4-hydroxylase 7 [Cucurbita maxima] >XP_022971154.1 probable prol... [more]
KAG6588394.11.1e-7953.25putative prolyl 4-hydroxylase 7, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_038905408.12.1e-7850.78probable prolyl 4-hydroxylase 7 isoform X1 [Benincasa hispida] >XP_038905409.1 p... [more]
Match NameE-valueIdentityDescription
Q8L9703.8e-5942.63Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana OX=3702 GN=P4H7 PE=2 SV=... [more]
F4J0A87.6e-5242.31Probable prolyl 4-hydroxylase 6 OS=Arabidopsis thaliana OX=3702 GN=P4H6 PE=2 SV=... [more]
Q8LAN36.3e-4638.20Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana OX=3702 GN=P4H4 PE=2 SV=... [more]
F4JAU32.0e-4438.52Prolyl 4-hydroxylase 2 OS=Arabidopsis thaliana OX=3702 GN=P4H2 PE=1 SV=1[more]
Q8GXT71.8e-2933.20Probable prolyl 4-hydroxylase 12 OS=Arabidopsis thaliana OX=3702 GN=P4H12 PE=2 S... [more]
Match NameE-valueIdentityDescription
A0A6J1EYJ13.2e-8553.87Procollagen-proline 4-dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111437385 ... [more]
A0A6J1I5Z97.1e-8554.49Procollagen-proline 4-dioxygenase OS=Cucurbita maxima OX=3661 GN=LOC111469906 PE... [more]
A0A6J1DX451.3e-7549.69Procollagen-proline 4-dioxygenase OS=Momordica charantia OX=3673 GN=LOC111024321... [more]
A0A6J1DTY41.3e-7549.69Procollagen-proline 4-dioxygenase OS=Momordica charantia OX=3673 GN=LOC111024321... [more]
A0A1S3B8148.2e-7348.59Procollagen-proline 4-dioxygenase OS=Cucumis melo OX=3656 GN=LOC103487037 PE=3 S... [more]
Match NameE-valueIdentityDescription
AT3G28480.12.7e-6042.63Oxoglutarate/iron-dependent oxygenase [more]
AT3G28480.24.0e-5640.37Oxoglutarate/iron-dependent oxygenase [more]
AT3G28490.15.4e-5342.31Oxoglutarate/iron-dependent oxygenase [more]
AT5G18900.14.5e-4738.202-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT3G06300.11.4e-4538.52P4H isoform 2 [more]
InterPro
Analysis Name: InterPro Annotations of Chayote (edule) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003582ShKT domainSMARTSM00254ShkT_1coord: 192..234
e-value: 2.1E-7
score: 40.6
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 61..180
e-value: 7.0E-5
score: 26.2
NoneNo IPR availableGENE3D2.60.120.620q2cbj1_9rhob like domaincoord: 53..118
e-value: 2.3E-13
score: 52.4
coord: 119..181
e-value: 5.5E-18
score: 67.4
NoneNo IPR availablePANTHERPTHR10869:SF140OS03G0803500 PROTEINcoord: 32..115
coord: 124..234
IPR044862Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domainPFAMPF136402OG-FeII_Oxy_3coord: 124..180
e-value: 1.4E-7
score: 32.3
IPR045054Prolyl 4-hydroxylasePANTHERPTHR10869PROLYL 4-HYDROXYLASE ALPHA SUBUNITcoord: 32..115
coord: 124..234

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sed0010756.1Sed0010756.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0110165 cellular anatomical entity
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen