HG10001835 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10001835
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprolyl 4-hydroxylase 1
LocationChr11: 887376 .. 905641 (+)
RNA-Seq ExpressionHG10001835
SyntenyHG10001835
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCTCCGCTCCGATGAGGATTGTCTTCGGTCTTCTAACCTTCGTCACCGTCGGCATGATCATCGGTACGCATTTTTGTTTCTCTTTGTCGTTTTTTTGTTGACTCTATACATCGATGTTTTGTGCGGCTTGTTATTTACTCATTCGATGTCAAAATGGAGATTAACGATGTTGTTGTGTTTGGATGAACTGGAGTGATTAGTGGCTTTACGCTTTGTTGAGGTATCCGTCAATGTTAGTTAAATCTGAGTTATGTTGTAAGCCTGTTTTAGTAGGAGCTCTTGCGTGCGATTTTTCTTTGAGCTGGCTTCTCTAGAGGAGCTAGGAATTGCTGGTAACATTAGCGAATTACGTTTTTTCTTTCTTCCTCAACTTTTTAATTGTGCAGTTGCCGGCTCTATCAGAGATTGGCAGCTACACAGTGTTGACGACGACTTGTAAATGTTCCTTTGCTTCTCATTTTCTGCAATTTAGCTGTTCCGTGGACTTGATTTAATGGAAAGGAGAGCACTCCTTTCATTTTTCCAATAATTAAACGTTGCAGGTTTTATGTTTACTATGAATTGAATCATTTTAATGATGAATCTTCGTACCATTTGGCTGGATTCTTTGTTCGCATTTAGTAAAATTTAGTGGCTTGATTGTGTTCCTGTACTTCTAGAGAACTATGCAAAGTTTTTACTGATAAGGTGCTTTCTGGTAATTGCAGGTGCTTTGTTGCAACTAGCATTTATAAGAAGGCTGGAGGACTCTATTGGTACTCTTCTTGGTTTATGGAACAAATATATATTACTCTTCTTGGTTTATCTGTCATCTAGGATGTAAAAACTTGAAATTATATTTATAATGAAAGTATCTTATATTTGGAAAGAGGACGAGCACTTTTGGGGTTGGATTTGGATCATTTTGAATCTTGCATTTCAATTTTATTTTCATTCATGTTTCTGTTATCATGTCTACTCAAGTCATAAATAAGCAAGTATGTATATTATCAGAGACAACTTGCGTTCATTAATAAAAAGATCTATAAAATGTCGAGGAAATAGCATAAGGTTACAGTCAATATGCAGTATGAGTTAACTTATTTTGGTTCAAACTAATAGAAAAAGAAAGTTGTTCATGAACAGATTTGAAATCCCTAAGAAACTTAATTCTATAACATGCGTCATAGGCATACAGATCTTAGAAGGCCAAGTTCCGTGAAAAAGAGGAAAAGGATCAGGTGTGAAGACTTGCCCGCTAGTTTTGTAACATTTTTTTTAGTTCAACTTGACTAAAAAATTTCTACGGTAAGTATCTAGAATGGACCAAACACTTGGCCTTTGGAGCACATAGAAGATGTTTATCTAAGTGACTGCACTCATGATAGTTCAATTTGAAATTCGATGAAGAAAAAAGTTCTTACATAGATATTTTTATATTATCTAAAGTAGATGTATGGATCTCAATAGGATTTGAAATATTGTTTGCATACCATAAAACTTATGACTTCCAGGTATTTATTTGGAAGAATGACTGAACGATGCAGACACTTCACTTAGTCAAGAGCTCTTGACGTATCCATGCAAAATAATATCATCTATCTTTGCTGACAGCACGGCATTTGCTGGCTGCTGGATGTAACCTATATGTTTATGAGACATGGTTTTTTTTGGTATACTAAAATGATTGATTACGACCTTACATCTCTTATTTCACAATGAAATAGCATTATGTAACCTCACGGCATATATTTCCCTTTTCGTAATTTCACGTTATCAATGAAATTATGATTGAATGTTTCTCATTAAAAAAAAAAAGATTGACTTTTGTATGATCCATGTCTTTGCAGGCACGGAGTTTCTATCCGCTGGAAGCTTACATAAAACACAGTATGATAGCGAACGTCAATTACCCCGAGGTTGGAAAGATATTATAGGGTCAATTTTTACACAAAAATTTACCCCTATTAAATAACAATATAACATCAGAATTGGTTTGAAGGGGATGTATCTTTCTTCGATCAAATTGATGTGTTTGATCTTCTATGATTTGGAATGGCTGCATGTTGTTCATCTATGTTTTTGCTTCAAAAGTGGTTTTCAACGTCAGTTTATTTCATATGATGAAGAAAAGATTTGATGGAAATTTGACTTATATTACAACTTTTATTTTCTTCAGTCATTTTTGTTCTGTTCCTAGATTTTTAACCTAGCAGTAACCCATCCACAATTATTACTGAAAGTTTCCATCTGCATGTTTATAAAAAATTATGATTTCATCATTTTTTTTTGTTTGAAAGCAAGTTATGTTTTCATCAAAGGACTCTCTATACATGCGTTTTTTGTAATAACCATCATGCGTTTGCCTAGTGGTGATAAAGGGCCATGGTTTAATAAAGAACTTGGAACGAGTTCAATCCATGGTAGTCACATACCTATGATTTAATATTCTAGGAGTTTCCTTGGCACCCAAATGTTGTAGGGTCAGACAGGTTGTCCCGTGAGATTAGTCGAAGTGTGCGAAGGTTGGCCTACAGTCGACACTCACGGTTATTAAAAAAAATATGTTTGTAATTGCCTTATTTCTAGGGCTCTTTTCAAATATAGAAAAATGAGCCAAACTATTTACAAATATAGAAAATTTTACAGTCTATCACTGATAGATTGCGATAGCATTCTATCGCTCAGGCGATAGAATTTATTTGCTTGACCGATAGAAAGCGATAGAATTCTATCGTGGTCTATCACTGATAGACAGTGAAATTTTTCTATATTTGTAAATAGTTTGATATTTTTTCTGTTTATAGTAATTTCCCTTATTTTTATCCAGACTGGACCAATTGGAGAACCTTATTTATTATTATTATTTTAAAATGAACTCCTTTGGCCGGTTTTATTCCTTATCGTATCTTTGAATTTCTTTTTTTTTTTTTTTTTTTTGAAAAGATTAATGTTGGTATTAAGGTTCTAATCGAAGTAAGTTCAAATAGAGTTTTATAAAAAATAAACCTAGAAAAATTAGACTGAGATTTAGCAAACATAATGAGAATGTCCAATAATCAGCATGCTTTTCGCTCTGATTTTATTTTATTTTGATCTTGTCTTGTTCATTCTCTTTTTGCTAATTTTCAGGCCTTCCTAATTGGATTAATGACAAAGAAGCAGAAATTCTTCGTCTTGGCTACGTATGCTTATCCCTACTTTTCTATTTATGATTATCAGCGTTATTTTTTTGGTTTTGGGATCTATGACCTTGTTATTAAAAAAGATAGGATATTGAATATGAAAACAGTTTCTTTATCCTTTGTAAACAAAGTAAAAGTTAAGAACCAAGTAGATTGGGGGAAAACATATAATATGATTGTGTTCAAGAAGAAATGAGTAAGTAGGCTAAAGAATACCATAATTCAACAATGTCCTGGACACACCCACCCACCCACACAACACAACACACATACACACACACAAAAGAGGAAAGGGATATCCTCCAATCTAGATGAGGAAAGAGATTACTAGACTCTCTGTACTTGGCTTGAATCAAGACAAGATAGTAATTACCAATAGCCTTGGAAGAATAATTCCATCCAGAAGCACGGGAGAAGGAAAATACTCACATATTATATGTATACGGAAGTGGTCAAGTAATTCAAATATGAATGTAAAATAATTGAAACATTGTTTCATATCTCATGACATGTAAATATCTATGTTAAAAAATATGAGAAATATTATCTGAAAGAAGAAAAGATGGTCAATGAATGATTTGCAAGTAATTAAGAAATGTTGGGAGTACTTCTATCATGTCATTTCTATTAACAGGTCTGCTCTTTTACATTGATGTGTAATCTTTTCTAACATTTAACTGAAACTTTTTGACCTTACACGTGTGCTTTTTGGGGAATTATTTGTAGGCAAGTTTTCCATGTTATCAATTGAAGTTTCCTGTCCTAAAAGAAAGTTCCCATTTTATTTTGTGGAATAGGTTAAACCAGAAGTAGTTAGCTGGTCACCACGAATCATTGTATTGCATAATTTTTTGAGCACAGAGGTAAGCATAGCCCCATCAATATGGTAATTCTTTTGCTGGATGGTATTGTTTCACTGCAGATTCTTCCCTTACAGAATTTTTGTAGATAGATAGACTTTCAATTGTCATCAGTGTTTTAAGAAACCTTCTAGTGTGCGTGCCTAGAATTAATCTCAAAATTCTACACTTTGACACATAATCTTAACATGCATTAGTGGAATGTGTGTTGTCAACATTTGGTGAACTTAGACATGCAAAGACATGCAAAAGTTGCATGCTTCAAAATAAGGCGCGTGCCTTTTGTGAAACCCCAACGCTCAAAGCGCTGGGCCTTGAAGCTTTTCATTATTTTTAAAAAATAATAATAATAATTAGCGTTTTTCTTTCTTTATTAACTAAAAAAAATAAAATCTACTAAGCCTAAATGCAAAATTTCTTGTGTTTAGGGTTTTCCCCTCATATTTCCACTTCAACTATGCTTTCTCTTTTGTATGTAGTTCTTTTATATATAGGTGTGCGTTTGCACCTTGCGTTTAAGCTCCAGAAGACTATTTTGCTTTATTGCACCTTGAGCTTTAAAAAATATTGAGTGTCATAAATATATATATATATATATATATATATATATATATATATATATATATATATATAAAAGAAACAATTCTATTGATGAATGAACTTTACCAAAAAGGATGGAAAATCTACAAGATTACATAAAACTTTCCCAATTAGCTAAATGGGAAGCATACCAAAGAGGTTAGACAGTTTATCCCAAGAAATACCTTATAATATAACAAAATCGTAAAGCATGTTGAACGGGTGCTCCTTACTAAAAAAGACACTGATTTCGTTCCATCCAAGTTGAAACATGGGTTATTATCAGCCCATCATTCACGGCCTAGCAACACTCAGATTTATTTATTATATTTGTTTTAAGTATTTGATGGGCTTATGTATGTACTAAGAAGCACAGACACTGACACGTGACACGAATACGACACCAACACGACGACACACCAATTTCCAAAAAAGTAGGACATGACACGTTATTATTATTATTTTTTTAACAAAAATAAATATATATTGTGCATATAAACATATCGTGAAAATGCAACTAGTGTTGGACAAAAAAAAGGAAAGATTTAACAAAAGGAAAAAAAAACAGAAATGGAGAAAAAAGATGATAAAAACGGAAGAAAAGCAAAACATAAATGGAAAAAAAGGACAACAGAAGGTATGAGAAACAAAAATGGGGGAAAAAAAAGGGAAAAAAACAGAAATGGGAGGGAAAGAAACCAAAGTTGTCGGGTGGAGGGTTTTCAAGTCGCCGGCGTGGAGGGTCTTCAAATCGCTGGTGTTGGGTGGTCTTCAAGTCGTTCGCGTCTGATGAAGGAGGAAGAATCGTGGGTGGGCGGGCGTGGGAGACTCTTCAGCAAGTGCGTAAGTCAGTAAGTGTGTGACAAAAAGTTTAAAGATTTAAAAAATTTGGATTGGGATTGGGCTTAAAATGGCCATAATGGGTCTTAATTGTTTTTTTTTTAAAATTGCAGCTCCCGTGTCCTAGGGGTGTCTTGCCGTGTCCATAAGGTGTCCTGCCGTGTTGGAAATACAAAAAAATAAAATAATGGACACGTGTCCGACACTGGCACTTTGCCATTTTAGAAGTGTCGGTGCTTTATAGTGTATGTATATAGTCAAACCCATTATACAACACTACTAGGGTTTCAAAAACTTATATGTATACTCCATCATACATTCCGATTCAGTAAAATGAGACCACACAAATTTTCATAAAATATAATTTTAAACTTAGTATCAAAGCTATCCAAATTTTCCTGGACTCATAGGAAACAACATTGTTAACACTTCATCATCTAGAAGTGGGCTAGACTCAAGACTATGTAAAGTTGAATGAAAACACTATACAATTTGGAAGTCAATGGCTCTAGTTGTTCTACGAGGACATAGGTTGGACAGTTTCATATTGGGAACCAAACTCCATCAGAGCTCATAGCTGTTACCAACAAAGCAACTCCTCAACCAAAACCATTTCAAACCAAGAGCACAAAGAATGGATAATAGTGGATCAAGCCCTCTTAGCATGGCTATGTGGCTCAATGGGACAATCCATTGCCCCAAATGTTATAAACAATACAACCTCAAGGACTAGTGGAAGGCATTAGTGGAAGATAATTCATAATTATTGTCTCCTTCAGTTCTAAGGGTACGAGACTTGGTAACTCTTTGACTTATTTATTTTGTTTCTTGGACGCCTTTGTCTTCTTGTTTGTAGAAGGGATAGACATAATACACATGGTACTTTTGATGCCATGGAGCAATGAAACACATCTCGTTGAAGGAGAAGGGGGCTCTTGGGTTTGTAAGGCTTCAACCTCATCTTCCTCAGCAAAGTTCTCTAGGAGGAAGGTGCCAATCTCTGTGCCATACCAAATCAATTTGCTTTGGTGAAGGTTGGTGTGGGGTTTTGTCACCACACAGGGTGGGGTAGCAGAAAGAGGGAAAGGGTTGGATACATTCAAAGTCAAGGTGGGGAGGAGAAGTGAAAAATTTTGTGTAAGGGATGTGAAGGTGTCTTGTTGATTTATGGTGACAACTAGTTTGTCAGGAGAGGATGTAAGTAAAGTCAGGTCAGGTAGAATGGACAGGGGCGGGTTTGATTGTATGTCCAAGGGTATAAAAAAGTCAATGGGCTTTGGGCTGGAAGATCAGGCAGAGAGAGTTTGGTTGGTGGTTGTGGGCAAGGTGGGCCTTGATCCATTGTTTGAGATTTTTGGAGGAGTGGATGGGCGGGAAGATGGGTTTTGTAGGTTCTTGGACTTTGGAGGGATGTTGAGGTTAAAGTACGAGGTATGTGACTGATGTTTGGCGACAAAGGGAAGTAAATGTTATGTGTTATTGTAATGTCAACTTAAGGACAAGGGGTGTTAACAAAAAAAGCACCTCTTAGTTGTGGGTTGAGAGTCTTGAGCGTACGGATCTGAATTGTCATTAGAAGGTTTGTCAATTGCGTTGGGTTTATCATTTTTTTTCATTGAGAGACCTCTGATGATGGCTCCTTCACATGATGATTTAGGCCATATAACCAACGTGGTATTTAACCTTGAAAAATGGATTGACTTGTACGTTTAGAGTGGCCTTTGAAGATGAAGGTATTTCTACTCCTGGCAGGACAAAGCCAAAGCTGTTGCACTTCACCTTGATGCATGCACTTCTATGATGTCCATTTTGGACAAAGTCCTTTTCGGTGTATCAACATAACCCCCACAAGCCTCTGAGATCGTATCAGGGGTCCCAATGGTCAAGGGGCAGAATTTTTATTTTGATCCAACCTCTGTGAATAGAATCTGAGTTTCTTCTATAGGAATTTCATGTTCAAGCTTAAGGAGGAAGGATCCAACTTCATACCAACTTGCTTTGGTGCATAGGTCTTTTGCTTGTTTTCCATTTTCATATTGGAGGAAGGCTTGGTCCCTTCGGAGACATCTAAATCTAATAAAATTGGAAGATAGAGAGAAGATTAGCATGACCCTTGCTCAAGGATGACACTTACACATCGGAGGAGGGCTTCATTGGGTCGAAAAGAATTCAAGAAGCAAAAACCACTGGCTCATGGTTTTGTTCCAATCATCATGAAAGTGCTTTCTAGTGACCACAATGGAGGAGTCCCACTGAGTTGAGGCATGCTGACAAGAGGCACTGGGGTCCAAAACATTATTGTTGCTCATGGTTAAGACCACTTTGATGGTAATGACAATAATGTTCCTAACTCGTCCATAAGATAATTGACCATTTTCAACTGAAGCCCTTATAATTCGAAATGATCGTGAGATATTTTCTCTTCTTTACCTTTTGCATGAGATATTTGTGAGACTATTTATTTGGTTCCTTGCTACTTTCTGGCTGGAAAGAGATTAAGAAATGTAGGATTTTAGAAAGTAGATTAAGTTTAGGATAAAATAAAACATGAGTTGGGAGTGTGTAATTTTGATGCATATAAATAAGACCCAGTGATCTTAATGGTGGTGGAGGCTTTGACTCTTCATGGAGTCAATTATGTAAATCACGTTTTGTAAGCAAAGGTCTGAGAATTAGAAAGAGGCTATTGCTTTTAAACCTTTCATTGAATGCAATAGGGTGTTCTAGACTCTAGAGGTCCTAGTTATTTTTGGATTATTTGGCTTGAAAGGAACGTAGGGACATTTGAGACCAAAGATTGTATCAAGCTTCTGATGTCTGAAGTCTTGGTTTGGGCTTTAATCACCAAGTATATTTTCTTTTGATATTTCTCTTTCTTGATTGTTCATATTTCGGAGCTAGGGGGCTTATTTATTATTATCACTGTTATTATGTGCGTGTGTTTTTCTTAATGACAGAAAACTTCCCTCAGAGAAAGTAAGTACAGATCGAGAATATAAGGAATCATCCTAGCCTGAAGTTGAAGGAAACCGCAAAAGGGACACTCCAAGTGACATTACTTTTTTTAAGGAATCAGCACCTAAAAGTGGATTACATGAAAGTTCACCAATTAGCAATTAAAGTAGAGAGGCTATAGTCCTACCAATTCCTACCAATACTGAATGCTGAATCACAAATTTGTTCCTACCCATACTCAATGCTTTTGCTTTAACCCTTTAAGTTCCCCAAGTTGTTCAGTTTATTTTTCCAAAGAGCTGCCTTAATTGCATTTTTTTCTGGGTTAAAATTGAGGCCTTTTTTCTTGAGTGAGTGCCGAGACAAAAGCTGGCACGGGAGGTTTTTGAAGGTTCTAGAAAGGCAACCTCAATTACCAAATTTATTGAACAACCAGCTGGTGGAGGCAAAAGAACTACCAAAACAAAAAAGAAAAAGGACCGTGAAGGTTTTCAGCAATTTGATTGCGAAGACAACACCCGTTAGGCAGGATGCTTAGATTGGAAGTCTCCTAAGAACTTGGTCATTCTAATTGATGCCAGCCAAAGCCACACACCATGAAAAGATTTTAATTTGTTGAAGATATTCTATATTTGATTCACATTTGATCATTTCTTTTTGGAGAAGTTGGTAACCTCCATACATTTGTCTAAAGAGAGGCTTGCTTGTAAACCTTCCAAACATATCATCCAGCCACCACAGCTGGGTATCATGCTGAGGGATAGGCTTTTACATTCAAACTTTTTTAAGAAAGAGATATGTTTTCAGAAATTTCCCCTAGAACAAGATTGTATCATTGGCTTTTTTTGTTTTTTTGCTATGCTTTATAAATCTCAATTCTAATGGAATTGTATCAATTCTACTTTTGGTTCAAGAAGCCTATATTTGGATAGAGGGACATGATAGCTTCTGTTTTGACTAATTGATTACTTGTACAGGAGTGCGACTACCTTAAGGCAATAGCACTTCCTCGCCTTGAAATTTCCACGGTCGTGGATACAAAAACTGGGAAGGTATGCTTATTATTATTATTATTATTTGCTAACCATTTTAATTTTTGGTTTTTCTGCTTTTAGTTTTTGATTTTGGATTTTTTGTTTTCTGTTTTTGAAAAACAAAAACAATCATGTTTGGTGATTTTTTTTGCCTTTTAGTTTTCTAACCAGGCATGTTTGGTAACAATTTTCAGTTTAATTGTTTGGTAGTTTGATACTTGTTTTTAGTTATGTTAAATCGCCGATTGACCCAAAAGTTTAAGCTGGTGGGTAAAAACAAATTTAATTATATATCACTAACACTCCCCCTCACTCGTGGGCTTGAAATGTGAAGAAGACTCAACAAGCGATAATTGATATTAAATGGGGAGGAAACAACATTGCAGGGGCTTGAACACAGGACATCCTGGACCACCCGCTCTGATACCATGTTAAAAAAGTATAATTTCTCTCTTATTTCTTAGGGAAGAAAAGAGTTTCTTAAATAAAAGAAACTCCAATTACAAACATGGAAATAATAATTACATAGGGAAAGAAAAAATATAGCAAATATAATCACAATAAATATAATAAGATTTTGATAAAAAAGGAAATAATCAACACTCCCCCTCAAGTTGGTTTAAAGATATCATTCATGGCCAGCTTGTCGATTAGGTTGTTGAATTGCCACTTTGGAAGTCCTTTAGTCAGTACATCTGACATCTGCAATTTGTTCTGTTGCGGAAGATAGGGTATGCATATTATTCCCGCATCAATCTTCTCCTTTATGAAATGTTTATCAACTTCTATATGTTTTGTCCTATCATGAAGAACCGGATTGTGGGCAATGAAAATTGCTGCCTTATTATCACAATAAATGCACATGGGAATCGGCTGATCGAATCTCAGCTCGTCAAGTAGTCTTCTTATCCATATGCCCTCACAAATACCATGGGCCAACGCCCTAAATTCTGCTTCAGCACTGCGTCTCGTGACCACACTCTGTTTTTTACTTCGCCAAGTTACAAAATTTCCTCCAACAAAGGAACAATAACCAGAAGTGGATCTTCTATCAGTAGTACTACCTGCCCAATCTGCATTAGTGTAAACTTCGACATTCAGGTGATCATGCTTTCTAAACAATATACCTTTTCCAGGAGTGCCTTTCATGTATCCTAGGATTCTATAAATGACATCAAAGTGTGTTGGTCCAGGAGCATGCATGAACTGACTCACCATACTGACTGCAAAAGCAATGTCTAACCCTAAAATTTATTGGGAGATGTTGAAGTTAGGCAATGAAGAAAGGATATGTTGTATTCATTGTTGGAGTTTTAGCTATTTCGGGGGGTTTTCTTGTAAATCTCTCTTTCATTACTTACCGGATCCTTCACCCTTTAGAGGGTCGATTTCTTCTGTTCTTGGAAGGTGACAATCCCAAAGAAGGTGAAGTTCTTTATTTGCCAGGTTATTCATGAAAGGATTAATACCTTGGATAGGTTTTTGAGGAAGATGTAGTACTTGGTAGGGTCTTTTTGCTGTATTCTTTATCGGAGGGCAAAGGAAGATTTGGATCACTTATTTTGGAATCACACCCCTTGTTCAAAATTGCTACCAAACCATCTTGAGAAACTGGAGAATATTGAGAGGGGCTTCCAAATCTGCTGCTATAGATTTGTCTTTGCACTTCTCAATAATGGCAGAGTAAGCATCCTTTAGCTCTTTGGGGGGAGAGAAACAATAGAGTGTTTTGCGGTGTGGAGAGAGATCCTAGTGAGGTTTAGTTTTTTGTATGCCCTTGTATTCTTTCATTTTTTCTCAGTGAAGGTATCTGTAATTAAAAAAAAGAAAAGAAGTAAGCATCATTTAGCTCTTGTTTGTATTGGAGGAGATTCTTTAACTTCTTTTGAATATAATCTTACAAAGAACCATTTCTTGACAAATCAAATCCAGCAACCCACCAAGTTGTGCAATCCACGCTGAAAGTTTATCCATAAAAACAGAATTTGACTCCATAAAAGAAGCTTCACAAGGAACGAAGGAATACACCAGTAGACCAAATGCCTATTTCTAATGTTCAGATACCATTTTAGTGCACGTACTTTTAAAACATCCATTTCTAGTCCTCATGCTTTGAATAAATCTTTAGTTGGTTTGTTGTTAACGTTTACCTGACATGACAAAAAGCTAGCAAGGAACGAAGGAAGGAATACACCAGTAGACCAAATGCCTGTTTCTAATGTTCAGATACCATTTTAGTGCGTGTACTTTTAAAACATCCATTTCTAGTCCTCATACTTTGAATAAATCTTTAGTTGGTTTGTTGTTAATGTTTACATGACGTGACAAAAAGCTAGCAGTAAATTAACCGAAAACTGATAGCGATTGATTTTAAAAATTATTCAAAACATAGCAACTAAGTTGAATGTTTAGAAAGTCATAAATCAAAATAGACTAAGCAATAAAGTACAAGGAGTAAAATGTTTTTTAAACCAATTTTTAATTATATTCAAATGAACTTATTTAAAAGAACTAAATTAAAAAAGTATTTAATATATCGACAAAAAAGCAGTTTAAAAAGTAAAAGTGAAAAAAAAGAAAGGAAAAGGAAAAAAGAAAGTGGAAAAAACAATGCAGTGTCTGGTGTCTGAAAAAAAAAGAAAATTTTTCCATTTCTGAAATTTCTTTACCAATTAAAAAATGTTCTTAAAATAAAATACCAAAAACTGAATTTGTTAGCTAATGAACTTGTTTATTAAAAACCAAAATCCAAAAACCAGTTTGTTATCAAACTGCCCCTTAGGGATTTATTGAGGATGTTGGGGAAAAAAAAAACAGATTGAAGAAGGAATAGATAAAACAATCTCAAACTTACATCCTTCGGGCAAGTTTTCGATTAAAACCTTGTTTCACAATCAATCAGTAGGAAAGGATTGGATTGTATAACCATGAATTTCATATAATCAATGAAATTTTTTCTTATATAAAAAAACCTTGGGAAAGGATCTCGTGGAGGAGGTATGAAAAGGAAATTGTCCCAAAGAGGTCAAAAGATTTAGACTATCACCAAGGAAGAGATTTCTTTAAAAAAGGATGATAAAGTCAGCATAGTTTCAAACCTGAAAGAGGGAAGGATCCATCTAGGGAAATCACGGCATAGATAAATATTTATTATTTTCTAAATAAATAATTTTTATTTAAAATTTTTAATGGCTTTGCTTGGCTAAAGTGTGTATTTCTAGAATGTTGCAAATTTGTGCCGATAATTATGATAAGATAAAATATCGTTGTATTGTCAGAAAAAAAAAAAAGATAAAATATCGTTGTTTATGAGTAGATGATGTATTTGTTTTGAAGTGCTAATACTTCTATTTTTAGGGTTACTTTATTTTTCCTTGCGTGGATTTATTTTTTTCTTGATAGATTCTATAGTTCAAGATTGTGATGGATAGTTTTGCTTAATTCTTTTGTAAAAGATGTTACTTTTGTATTGGAGGACACACTTAGTAATGCTCTTTTGGTTGAATGATATATATATTTTTCCTATTAAAAAGGGTTTTTCTTTCAGCTGACAATGATATGTTAAGTATATATGGTCTCCATTGTGGATTTATTTTCAGGGAGTTAAGAGTGATTTCAGAACGAGCTCTGGAATGTTTTTAAGTCATCAAGAGAAAAATTATCCAATGGTCCAGGTATTAATTACAAAGAAAGCGTTGGTATATTTTGATGTTTCAAGTAATGGATATGGTCATATGGATGAGCTCAAAAGTCAGCTTTATGGAAAAACTTTCAATTTAGTTAAAGGGGATAAATTTACAAATCTGCTCCACTTGTTTTAGACAAGATTTTTATTTAAAGAAAACATGAATGATCATGTACGACTCATTTACATTATAGTCTATAGGCATGCTAAGCAAATCCAGTATTAGTTTTATAAACTATAGACTATTCCACAAACTTCAGATTTTCATTAACTTCAAATAGCCAAACTCAGTTCTTTAAGAGTGACAAAAAACAGCAGCAGACTTTAAACCAGTAGTCACTTAACAGTTAATAAGTTTTATTTCTATTAAAAAAAGAGAGTAAATAAGTGGCTGACCAAAAAGGCAAAAGCTAAAAATTACTGCATCAGTTACCCAACATTTACCCTGCAGCCAAAAAGAAGAAAGTAAAGATTCTTTAAAAGATCTTGCAAGAAGTTTTAAAGAAACAGATCGAGAACTTCTAAAGAAACAAATCACATCATGTCCGTCTCTCCAAAGAAAAAGCTCGTCCTCAAAGAAAAGGTTTCAGGAGGATGATATTTAAAAAATTGGCAACATTGAAGGTGGGGTTGTTCTTGAGATTTTCAGGTAGATACACTTCATATGCACTGTCACCAAATCTCTCAAGAATTTGAAAAGGTCCAATCTTCTTAGGATGAAACTTAGAATAGAAGTAGAACCAGAAGAAAATCTTGTTATGTTATTCACAAGTAATCAAACTTCCAAACTTAAACTATTGCTTGTTCTCGAGTAACGCTGCAAAATTGTTTCTATGCTTAACCTACAGTGTGCTAAACTACACTGCATTTGCCTGATGCATTTTTTTTGCTGTCTCAACAATGCGTTGTAAAAATGTCCAGCTATTTGAGCCTAAACTTAGCTCTGCCTTAATTCCCTCAAGGCACCTCTTTCTGCCTTATCCCAATTTCACCCGGCTCTCTTCCTAGTTCCTACAGTACTTCCTTAAAGTTAACTTACTTTTATTTTCTAGCTTGGATAATGCACTAACAATTCTCAATAGATTTAAACAAAAAATGACGAATACCGATCTGTCTGCCCATTTTTGTAGTTCATAGTTTAGGTCACTTTTTTTGGACTTCTTTTTTATGTCATATTCTTTCACTTTTCTTAATGGGAGTTCAATTTCTCATTAAAAAAATTGAAAATTTTTATAGGAACTCAAAAAGAAAATTAATCATGGTACATAAATATTAGATCATAAAGCTTGAAGCGACATGATTATGTACCGCAAAGGTATTAATACATTACTGAAGAGCGATAGTCAATCCTTAACATTCCGTCATTATTCCTTAACGTTGAAGTGCATTTGAAATTGTAGATCTTAAAAGTTCAGTAAATTTGTTGAAAATCCATGAATGTGAGGTGTGTTGTTCAAATTGGGTAAGGTATCAATCTCTTCCTTCATCAGCTCCTCGGTCCTAACCAGCCTCCCAGAAGGAACAAGTTTATAGAGAATCTGGAAATTAGAAATTTTGTTTTGAAATTACCTTTTCCACAAAAGGAAATATTATAAAGTATGCTAAATGTTGACAACATCTTTCATTATTACAGCCCTTTTGGGGTTCTCTCTCTCAGTTGGTGATAGACTTCCTGTGTTTTGGAATCTGTGAAAGCAGGCAATTGAAAAAAGAATTTCCGTTTATTCTCAAATACCAGTAGAAAATGGAGAGCTCATTCAAGTGTTAAGGTATGAATATAATTTTGTTTCTGTGAAGATGCATGTGAACACGAAAAGTGGTTTGTACTTCAGTTTTAAGTCTAACATTTAAAGCAGTACAGGTACGAGAAGAGTCAATTTTACAAGCCTCATCATGACTACTTTTCTGATACTGTAAGAGTTTTGTTATTTCTTCAGCCATGAGCCCTAATATTCTAAAATCAGCATTTATTTATATTTTTATTTTAAGTATTCTGAACTTGTGCTCTTGCTGATGGCATGCGTCGTTTCTTCCATGTCTTCTTTTTACAGTTTAACTTGAAGCGTGGTGGTCAGCGAATAGCAACTATGCTTATGTACCTAAGTGAAAACGTTGAAGGAGGAGAAACCTACTTTCCGAAGGTATTTACCATTTTCCTTGCTGAATTAAATTTCATTGCCATAATGTATTAGTTTTCTTACTCATTTTATTAAAACCAACCAATTGATATTAAAGAACAAGACAATTACTAAGCACGAAGAGAATAGAGACATCCTCTTAGACAGAAGCCCAGTTCGATGTATTAGTTAATGTAAAAAGAAAAGTTCACTTTTGCTGTCCATTTCGTAGATAATCAACTTGTTTTATACCATTTTAGTCATGGAACTGGGAGTTGATACTGAGCCCCATAAAAGGACAGGACAGTATTCTTATTACCTGTTGGAGTTTCTGATGTATGATAGTGGAAGCAGTACAAGAGATGAATAGGGTGTGAACTACGACGAGCCTAGAAGCATGCATGGATGGACACAACATTGGAATTTCAATGCAACATTTTCTTAAAAATTAGGAAGGGAGAAATCCAACAAAATATGTTTTTGAAAAGTACTTTTGTATACTTTCCTTATCAATATCATATAAAGAATTAAGATATAAAAGTAACTTCTCAACCATTGAATTTCAAGCAAATTGAAACATTGTAAAGTCAATTATATTTCTTTGTCGACAAATATTTTCCACATTGAAGTCTAGTGTTCCATCAACGACACGGCTTTAAAGGGGGTAAAAAACCGTTGTTTTTGCATCTTCTCATACCAGTACAATAAAGTTCAGTAGGAGTGAAATCTACTTTTTTTTTTTACTTCTTACTTCTTAGTTTATTTTCCTTGGGGTTTCATACCTTCATTCATTAATATCCTAGTTCCTTGACGAATATTATGAATATTGATTAGAAAGAGATCTTTATGTTTACTATTCTTTTTCTTTATCTCCTCTGCAAACCTTGTCTTTCAATTCCATGCGTCCTGAGCCCGTATTAATTTGTTTCACTAGTTGGACACTTCAAATCATGTATCTGTAAGACCCTTAGTTGGGAAAATATTAGGATTATTAGTATGATTAGTAGTAAGGGGTGTTCTAGTAATTAGACATTAAGTTTGTTAGGTCTTTTGAGTTATAAATAGTAGGTGAGAGTGAGGCTTGAGATGTGGAGAATCTGGTGAGCATTTGGGCTCTTGGGAGAGGATACTCAACCCTCTAGAATGTGCTGAGGAGTTTTGGTTCATATGCCCTTGCTTGTAATGGTTCTTAGCTTGACATTACTATTAGTTGGCCTTTCCTTGAAGTAAATATATAACACAAGAGTGTTTGCTTTGATTTCTTGGATATTTGTGTTCTTGTTGGGATTTTCTGTTTGCAGGAAGAAGAATCCTAACATTATTCTACTCCTTTTCTCCTAGCATGTTCAAATATCCTCATTGGGAACAGGCATGTATCAACAGTAGTACTCTATCTCACAATGGAGCATCCATACTAAATAGAGTTAGAAAGGAGGAAAAAAGAAAAAGAAAAGAAGAGAGGAGTCCGAGGACAATAGAGTTATTCTAGCATACCCATTATCAAAGGCAACTTATATTATGCTGATTGAAATCAAATAAATGTGGTAAACGAGGAGAGTTTCAGATTTGGGCTCCTGCTTGAGCACTGTGGCGTTCTATATTTGGAAAATAACTGTATTATTTTGACAATTAGTCTGCTTTCCAATCTTAGTTATCTCAGGGATAGAAAGCGGCATGGTGCTATTGCAATGGCATCTTGCAGCTGCAGGTTATGTGATCAAATTTGTCTACTTATTTTTACTCAAATTTAGAAACCAAAATGTTTGATATTTTTAGAATTTTGGCGAAGATTTCATTGTATTAAATAAAATAAATAACAAAACTAGTAGTAAACAAGTTTAGCCTTTTAAAAAACTGAAGTAAAATAAGATTGTTTTCAAATTTGACCGTAGAATAGCATCTGATTGTGTTTAAGTTGGTAATTATCGTTGAAAAAACTAATGGACGTTTGGATTAATCAAATGGCCAGTAGTCTATCAATGATAAGGAAGGCCACCCTAAGCTATTGAAGTTGCTTAGGTTGGAGGAAAATGAAAGGAAGTAGGCCAGAATTTAGAATGAAGAGAAAGTACGAAGAGAAACCTTTTTGATGGAGTGCTATGACACACTGCGGAGTTTAGCATGAATTTGATAGACAGAGCAACGCCGGCTTTTCCCCTTTTAAATCTTGGTTTTTCTCGTCTGATCCTACTTTCTTGCAAGTGTCGTGTCCTGAATTTTCCTTTTGTATATTTTCTTCAGGCTGGTTCTGGTGAGTGTAGCTGTGGTGGGAAGACCGTTCCAGGACTGTCAGTTAAACCATCCAAAGGAGATGCAGTGCTTTTCTGGAGCATGGTATGCATCTTTTGCCTCTTGAAATGAAATGAATGATGTGAAAACAAAATGATTATGATTTTGACTACTAAATCAAGAAATCTGACACCTTAAGATTTGTTTCAGAGTTGCCCTAAAAGTCGTGGCCTGGGACCCCAGCCTCAACTTTAAAACTTAATAACTAAAGAGTTAGTTTGCTAACGTTTCTGTTTCTTGTTTTTCATTATGAGTTTCTTGTTCACGTCTCTCGTTTTTCTCTAAAACATAAATTACAACATTATTTTTATCATTCTTGACTTATAAAAGGGATAATCAATATTTAAAAATTATTAACACAATTTTGTTTTTCTTTTATTAAAAAAGTAAAAGCGAGGAACATGAACCTGTAAGTGAGAAACAAAAAACAGGAACTTCACCAAATGGACGAGTCCTTAAATTCTTTATCAATTCTTGGAGTTGGTCTTAAAGTGGATTAAGAGATAAAAGTAAGAAAATATATTTAAAAAAGAAAGAAAAACCCTTTCCTTAGAGCCTGTTTGGAATAAACTCCCAATAGATGGCTGTTGTTTTAACCATTTATTTCATATGTGAATGCTTTCAGGGCTTAGATGGACAATCAGATCCAAGTAGCATTCATGGAGGGTGTGAAGTACTGGCAGGGGAAAAATGGTCTGCCACAAAATGGATGAGACAAAAGAGTACTCTGGTACCATAA

mRNA sequence

ATGGCCTCCGCTCCGATGAGGATTGTCTTCGGTCTTCTAACCTTCGTCACCGTCGGCATGATCATCGGAGAGGATGTAAGTAAAGTCAGGTCAGGTAGAATGGACAGGGGCGGGTTTGATTGTATGTCCAAGGGTATAAAAAAGTCAATGGGCTTTGGGCTGGAAGATCAGGCAGAGAGAGTTTGGTTGGTGGTTGTGGGCAAGGTGGGCCTTGATCCATTGTTTGAGATTTTTGGAGGAGTGGATGGGCGGGAAGATGGGTTTTGTAGGTTCTTGGACTTTGGAGGGATGTTGAGGTTAAAGTACGAGGAGTGCGACTACCTTAAGGCAATAGCACTTCCTCGCCTTGAAATTTCCACGGTCGTGGATACAAAAACTGGGAAGGGAGTTAAGAGTGATTTCAGAACGAGCTCTGGAATGTTTTTAAGTCATCAAGAGAAAAATTATCCAATGGTCCAGGCAATTGAAAAAAGAATTTCCGTTTATTCTCAAATACCAGTAGAAAATGGAGAGCTCATTCAAGTGTTAAGGTACGAGAAGAGTCAATTTTACAAGCCTCATCATGACTACTTTTCTGATACTTTTAACTTGAAGCGTGGTGGTCAGCGAATAGCAACTATGCTTATGTACCTAAGTGAAAACGTTGAAGGAGGAGAAACCTACTTTCCGAAGGCTGGTTCTGGTGAGTGTAGCTGTGGTGGGAAGACCGTTCCAGGACTGTCAGTTAAACCATCCAAAGGAGATGCAGTGCTTTTCTGGAGCATGGGCTTAGATGGACAATCAGATCCAAGTAGCATTCATGGAGGGTGTGAAGTACTGGCAGGGGAAAAATGGTCTGCCACAAAATGGATGAGACAAAAGAGTACTCTGGTACCATAA

Coding sequence (CDS)

ATGGCCTCCGCTCCGATGAGGATTGTCTTCGGTCTTCTAACCTTCGTCACCGTCGGCATGATCATCGGAGAGGATGTAAGTAAAGTCAGGTCAGGTAGAATGGACAGGGGCGGGTTTGATTGTATGTCCAAGGGTATAAAAAAGTCAATGGGCTTTGGGCTGGAAGATCAGGCAGAGAGAGTTTGGTTGGTGGTTGTGGGCAAGGTGGGCCTTGATCCATTGTTTGAGATTTTTGGAGGAGTGGATGGGCGGGAAGATGGGTTTTGTAGGTTCTTGGACTTTGGAGGGATGTTGAGGTTAAAGTACGAGGAGTGCGACTACCTTAAGGCAATAGCACTTCCTCGCCTTGAAATTTCCACGGTCGTGGATACAAAAACTGGGAAGGGAGTTAAGAGTGATTTCAGAACGAGCTCTGGAATGTTTTTAAGTCATCAAGAGAAAAATTATCCAATGGTCCAGGCAATTGAAAAAAGAATTTCCGTTTATTCTCAAATACCAGTAGAAAATGGAGAGCTCATTCAAGTGTTAAGGTACGAGAAGAGTCAATTTTACAAGCCTCATCATGACTACTTTTCTGATACTTTTAACTTGAAGCGTGGTGGTCAGCGAATAGCAACTATGCTTATGTACCTAAGTGAAAACGTTGAAGGAGGAGAAACCTACTTTCCGAAGGCTGGTTCTGGTGAGTGTAGCTGTGGTGGGAAGACCGTTCCAGGACTGTCAGTTAAACCATCCAAAGGAGATGCAGTGCTTTTCTGGAGCATGGGCTTAGATGGACAATCAGATCCAAGTAGCATTCATGGAGGGTGTGAAGTACTGGCAGGGGAAAAATGGTCTGCCACAAAATGGATGAGACAAAAGAGTACTCTGGTACCATAA

Protein sequence

MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP
Homology
BLAST of HG10001835 vs. NCBI nr
Match: XP_038904320.1 (prolyl 4-hydroxylase 1 isoform X1 [Benincasa hispida])

HSP 1 Score: 411.4 bits (1056), Expect = 6.6e-111
Identity = 220/308 (71.43%), Postives = 237/308 (76.95%), Query Frame = 0

Query: 1   MASAPMRIVFGLLTFVTVGMIIG-----------ED---VSKVRSGRMDRGGFDCMSKGI 60
           MASAPMRIVFGLLTFVTVGMIIG           ED      + +GR+ +  +D   +  
Sbjct: 1   MASAPMRIVFGLLTFVTVGMIIGALLQLAFIRRLEDSIGTEFLSAGRLHKTQYDSQRQLS 60

Query: 61  KKSMGFGLEDQAE--RVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLKYEE 120
           +    +  + +AE  R+  V    V   P   +                      L  EE
Sbjct: 61  RGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNF------------------LSTEE 120

Query: 121 CDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQ 180
           CDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQ
Sbjct: 121 CDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQ 180

Query: 181 IPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENVEGGETYFPK 240
           IPVENGELIQVLRYEK+QFYKPHHDYFSDTFNLKRGGQRIATMLMYLSEN+EGGETYFPK
Sbjct: 181 IPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPK 240

Query: 241 AGSGECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWM 293
           AGSGECSCGGKTVPGLSVKP+KGDAVLFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWM
Sbjct: 241 AGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWM 290

BLAST of HG10001835 vs. NCBI nr
Match: XP_008453926.1 (PREDICTED: prolyl 4-hydroxylase 1 isoform X6 [Cucumis melo])

HSP 1 Score: 407.9 bits (1047), Expect = 7.3e-110
Identity = 213/294 (72.45%), Postives = 232/294 (78.91%), Query Frame = 0

Query: 1   MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAE- 60
           M SA MRIVFGLLTFVTVGMIIG +   + +GR+ +  +D   +  +    +  + +AE 
Sbjct: 1   MVSAQMRIVFGLLTFVTVGMIIGTEF--LPAGRLHKTQYDSQRQLPRGLPNWINDKEAEI 60

Query: 61  -RVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLKYEECDYLKAIALPRLEI 120
            R+  V    V   P   +                      L  EECDYLK IALPRLEI
Sbjct: 61  LRLGYVKPEVVSWSPRIIVLHNF------------------LSTEECDYLKGIALPRLEI 120

Query: 121 STVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRY 180
           STVVDTKTGKGVKSDFRTSSGMFLSH EKNYPMVQAIEKRISVYSQIPVENGELIQVLRY
Sbjct: 121 STVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISVYSQIPVENGELIQVLRY 180

Query: 181 EKSQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVP 240
           EK+QFYKPHHDYFSDTFNLKRGGQRIATMLMYLSEN+EGGETYFPKAGSGECSCGGKTVP
Sbjct: 181 EKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVP 240

Query: 241 GLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP 293
           GLSVKP+KGDA+LFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKSTLVP
Sbjct: 241 GLSVKPAKGDAILFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP 274

BLAST of HG10001835 vs. NCBI nr
Match: XP_016901569.1 (PREDICTED: prolyl 4-hydroxylase 1 isoform X4 [Cucumis melo])

HSP 1 Score: 405.6 bits (1041), Expect = 3.6e-109
Identity = 216/295 (73.22%), Postives = 235/295 (79.66%), Query Frame = 0

Query: 1   MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRM-DRGGFDCMSKGIKKSMGFGLEDQAE 60
           M SA MRIVFGLLTFVTVGMIIG  +      R+ D  G + +  G      +  + Q  
Sbjct: 1   MVSAQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKTQYDSQRQLP 60

Query: 61  RVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLD--FGGMLRLKYEECDYLKAIALPRLE 120
           RV +        +  F    G +    G   +++     +LRL Y ECDYLK IALPRLE
Sbjct: 61  RVTV-------KNREFSKELGGNQFNPGLPNWINDKEAEILRLGY-ECDYLKGIALPRLE 120

Query: 121 ISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLR 180
           ISTVVDTKTGKGVKSDFRTSSGMFLSH EKNYPMVQAIEKRISVYSQIPVENGELIQVLR
Sbjct: 121 ISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISVYSQIPVENGELIQVLR 180

Query: 181 YEKSQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTV 240
           YEK+QFYKPHHDYFSDTFNLKRGGQRIATMLMYLSEN+EGGETYFPKAGSGECSCGGKTV
Sbjct: 181 YEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTV 240

Query: 241 PGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP 293
           PGLSVKP+KGDA+LFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKSTLVP
Sbjct: 241 PGLSVKPAKGDAILFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP 287

BLAST of HG10001835 vs. NCBI nr
Match: XP_008453928.1 (PREDICTED: prolyl 4-hydroxylase 1 isoform X7 [Cucumis melo])

HSP 1 Score: 404.1 bits (1037), Expect = 1.1e-108
Identity = 215/293 (73.38%), Postives = 231/293 (78.84%), Query Frame = 0

Query: 1   MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRM-DRGGFDCMSKGIKKSMGFGLEDQAE 60
           M SA MRIVFGLLTFVTVGMIIG  +      R+ D  G + +  G      +  + Q  
Sbjct: 1   MVSAQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKTQYDSQRQLP 60

Query: 61  RVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLKYEECDYLKAIALPRLEIS 120
           R         GL         ++ +E           +LRL Y ECDYLK IALPRLEIS
Sbjct: 61  R---------GLP------NWINDKE---------AEILRLGY-ECDYLKGIALPRLEIS 120

Query: 121 TVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYE 180
           TVVDTKTGKGVKSDFRTSSGMFLSH EKNYPMVQAIEKRISVYSQIPVENGELIQVLRYE
Sbjct: 121 TVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYE 180

Query: 181 KSQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPG 240
           K+QFYKPHHDYFSDTFNLKRGGQRIATMLMYLSEN+EGGETYFPKAGSGECSCGGKTVPG
Sbjct: 181 KNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPG 240

Query: 241 LSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP 293
           LSVKP+KGDA+LFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKSTLVP
Sbjct: 241 LSVKPAKGDAILFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP 268

BLAST of HG10001835 vs. NCBI nr
Match: XP_022137963.1 (prolyl 4-hydroxylase 1 [Momordica charantia])

HSP 1 Score: 403.3 bits (1035), Expect = 1.8e-108
Identity = 213/294 (72.45%), Postives = 234/294 (79.59%), Query Frame = 0

Query: 1   MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRM-DRGGFDCMSKGIKKSMGFGLEDQAE 60
           MASAPMRIVFGLLTFVT+GMIIG         R+ D  G + +S G      +  + Q  
Sbjct: 1   MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLP 60

Query: 61  RVWLVVVGKVGLDPLFEIFGGVDGREDGFC-RFLDFGGMLRLKYEECDYLKAIALPRLEI 120
           R +   +     + L    G V      +  R +       L  EECDYL+A+ALPRLE+
Sbjct: 61  RGFPNWINDREAEIL--RLGYVKPEVVSWSPRIIVLHNF--LSTEECDYLRAVALPRLEV 120

Query: 121 STVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRY 180
           STVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPM+QAIEKRISVYSQIP+ENGELIQVLRY
Sbjct: 121 STVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRY 180

Query: 181 EKSQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVP 240
           EK+QFYKPHHDYFSDTFNLKRGGQR+ATMLMYLS+NVEGGETYFPKAGSGECSCGGKTVP
Sbjct: 181 EKNQFYKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVP 240

Query: 241 GLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP 293
           GLSVKP KGDAVLFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKSTLVP
Sbjct: 241 GLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP 290

BLAST of HG10001835 vs. ExPASy Swiss-Prot
Match: Q9ZW86 (Prolyl 4-hydroxylase 1 OS=Arabidopsis thaliana OX=3702 GN=P4H1 PE=1 SV=1)

HSP 1 Score: 349.4 bits (895), Expect = 4.0e-95
Identity = 184/290 (63.45%), Postives = 217/290 (74.83%), Query Frame = 0

Query: 6   MRIVFGLLTFVTVGMIIGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVV 65
           M+IVFGLLTFVTVGM+IG  +      R++    D    G       GL  Q  R +L  
Sbjct: 5   MKIVFGLLTFVTVGMVIGSLLQLAFINRLE----DSYGTGFPSLR--GLRGQNTR-YLRD 64

Query: 66  VGKVGLDPLFEI--FGGVDGREDGFCRFL----DFGGMLRLKYEECDYLKAIALPRLEIS 125
           V +   D   E+   G V      +   +    DF     L  EEC+YLKAIA PRL++S
Sbjct: 65  VSRWANDKDAELLRIGNVKPEVVSWSPRIIVLHDF-----LSPEECEYLKAIARPRLQVS 124

Query: 126 TVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYE 185
           TVVD KTGKGVKSD RTSSGMFL+H E++YP++QAIEKRI+V+SQ+P ENGELIQVLRYE
Sbjct: 125 TVVDVKTGKGVKSDVRTSSGMFLTHVERSYPIIQAIEKRIAVFSQVPAENGELIQVLRYE 184

Query: 186 KSQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPG 245
             QFYKPHHDYF+DTFNLKRGGQR+ATMLMYL+++VEGGETYFP AG G+C+CGGK + G
Sbjct: 185 PQQFYKPHHDYFADTFNLKRGGQRVATMLMYLTDDVEGGETYFPLAGDGDCTCGGKIMKG 244

Query: 246 LSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKST 290
           +SVKP+KGDAVLFWSMGLDGQSDP SIHGGCEVL+GEKWSATKWMRQK+T
Sbjct: 245 ISVKPTKGDAVLFWSMGLDGQSDPRSIHGGCEVLSGEKWSATKWMRQKAT 282

BLAST of HG10001835 vs. ExPASy Swiss-Prot
Match: F4JZ24 (Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana OX=3702 GN=P4H10 PE=3 SV=1)

HSP 1 Score: 199.1 bits (505), Expect = 6.8e-50
Identity = 107/197 (54.31%), Postives = 127/197 (64.47%), Query Frame = 0

Query: 100 LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRI 159
           L  EEC YL  +A P +E STVVD KTGK   S  RTSSG FL+        ++ IEKRI
Sbjct: 94  LTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFLARGRDK--TIREIEKRI 153

Query: 160 SVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENVEGGE 219
           S ++ IPVE+GE +QVL YE  Q Y+PH+DYF D +N + GGQRIAT+LMYLS+  EGGE
Sbjct: 154 SDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQRIATVLMYLSDVEEGGE 213

Query: 220 TYFPKAGS-----------GECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHG 279
           T FP A              EC  G     GLSVKP  GDA+LFWSM  D   DPSS+HG
Sbjct: 214 TVFPAAKGNYSAVPWWNELSECGKG-----GLSVKPKMGDALLFWSMTPDATLDPSSLHG 273

Query: 280 GCEVLAGEKWSATKWMR 286
           GC V+ G KWS+TKW+R
Sbjct: 274 GCAVIKGNKWSSTKWLR 283

BLAST of HG10001835 vs. ExPASy Swiss-Prot
Match: Q9LN20 (Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=1)

HSP 1 Score: 198.0 bits (502), Expect = 1.5e-49
Identity = 105/194 (54.12%), Postives = 128/194 (65.98%), Query Frame = 0

Query: 100 LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRI 159
           L  EEC+YL ++A P +  STVVD++TGK   S  RTSSG FL        +++ IEKRI
Sbjct: 92  LSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRRGRDK--IIKTIEKRI 151

Query: 160 SVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENVEGGE 219
           + Y+ IP ++GE +QVL YE  Q Y+PH+DYF D FN K GGQR+ATMLMYLS+  EGGE
Sbjct: 152 ADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRMATMLMYLSDVEEGGE 211

Query: 220 TYFPKAGSGECS---------CGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGC 279
           T FP A     S         CG K   GLSVKP  GDA+LFWSM  D   DP+S+HGGC
Sbjct: 212 TVFPAANMNFSSVPWYNELSECGKK---GLSVKPRMGDALLFWSMRPDATLDPTSLHGGC 271

Query: 280 EVLAGEKWSATKWM 285
            V+ G KWS+TKWM
Sbjct: 272 PVIRGNKWSSTKWM 280

BLAST of HG10001835 vs. ExPASy Swiss-Prot
Match: F4JNU8 (Probable prolyl 4-hydroxylase 8 OS=Arabidopsis thaliana OX=3702 GN=P4H8 PE=3 SV=1)

HSP 1 Score: 193.7 bits (491), Expect = 2.8e-48
Identity = 107/194 (55.15%), Postives = 129/194 (66.49%), Query Frame = 0

Query: 100 LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLS--HQEKNYPMVQAIEK 159
           L  EEC++L ++A P +  S VVD KTGK + S  RTSSG FL+  H E    +V+ IE 
Sbjct: 96  LTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSRVRTSSGTFLNRGHDE----IVEEIEN 155

Query: 160 RISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENVEG 219
           RIS ++ IP ENGE +QVL YE  Q Y+PHHDYF D FN+++GGQRIAT+LMYLS+  EG
Sbjct: 156 RISDFTFIPPENGEGLQVLHYEVGQRYEPHHDYFFDEFNVRKGGQRIATVLMYLSDVDEG 215

Query: 220 GETYFPKAGSG--------ECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGG 279
           GET FP A           E S  GK   GLSV P K DA+LFWSM  D   DPSS+HGG
Sbjct: 216 GETVFPAAKGNVSDVPWWDELSQCGK--EGLSVLPKKRDALLFWSMKPDASLDPSSLHGG 275

Query: 280 CEVLAGEKWSATKW 284
           C V+ G KWS+TKW
Sbjct: 276 CPVIKGNKWSSTKW 283

BLAST of HG10001835 vs. ExPASy Swiss-Prot
Match: Q24JN5 (Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana OX=3702 GN=P4H5 PE=2 SV=1)

HSP 1 Score: 188.7 bits (478), Expect = 9.1e-47
Identity = 106/194 (54.64%), Postives = 127/194 (65.46%), Query Frame = 0

Query: 100 LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFL--SHQEKNYPMVQAIEK 159
           L  EEC++L ++A P +  STVVD KTG    S  RTSSG FL   H E    +V+ IEK
Sbjct: 96  LTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSRVRTSSGTFLRRGHDE----VVEVIEK 155

Query: 160 RISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENVEG 219
           RIS ++ IPVENGE +QVL Y+  Q Y+PH+DYF D FN K GGQRIAT+LMYLS+  +G
Sbjct: 156 RISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVDDG 215

Query: 220 GETYFPKAGS--------GECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGG 279
           GET FP A           E S  GK   GLSV P K DA+LFW+M  D   DPSS+HGG
Sbjct: 216 GETVFPAARGNISAVPWWNELSKCGK--EGLSVLPKKRDALLFWNMRPDASLDPSSLHGG 275

Query: 280 CEVLAGEKWSATKW 284
           C V+ G KWS+TKW
Sbjct: 276 CPVVKGNKWSSTKW 283

BLAST of HG10001835 vs. ExPASy TrEMBL
Match: A0A1S3BY76 (prolyl 4-hydroxylase 1 isoform X6 OS=Cucumis melo OX=3656 GN=LOC103494504 PE=4 SV=1)

HSP 1 Score: 407.9 bits (1047), Expect = 3.5e-110
Identity = 213/294 (72.45%), Postives = 232/294 (78.91%), Query Frame = 0

Query: 1   MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAE- 60
           M SA MRIVFGLLTFVTVGMIIG +   + +GR+ +  +D   +  +    +  + +AE 
Sbjct: 1   MVSAQMRIVFGLLTFVTVGMIIGTEF--LPAGRLHKTQYDSQRQLPRGLPNWINDKEAEI 60

Query: 61  -RVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLKYEECDYLKAIALPRLEI 120
            R+  V    V   P   +                      L  EECDYLK IALPRLEI
Sbjct: 61  LRLGYVKPEVVSWSPRIIVLHNF------------------LSTEECDYLKGIALPRLEI 120

Query: 121 STVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRY 180
           STVVDTKTGKGVKSDFRTSSGMFLSH EKNYPMVQAIEKRISVYSQIPVENGELIQVLRY
Sbjct: 121 STVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISVYSQIPVENGELIQVLRY 180

Query: 181 EKSQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVP 240
           EK+QFYKPHHDYFSDTFNLKRGGQRIATMLMYLSEN+EGGETYFPKAGSGECSCGGKTVP
Sbjct: 181 EKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVP 240

Query: 241 GLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP 293
           GLSVKP+KGDA+LFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKSTLVP
Sbjct: 241 GLSVKPAKGDAILFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP 274

BLAST of HG10001835 vs. ExPASy TrEMBL
Match: A0A1S4E033 (prolyl 4-hydroxylase 1 isoform X4 OS=Cucumis melo OX=3656 GN=LOC103494504 PE=4 SV=1)

HSP 1 Score: 405.6 bits (1041), Expect = 1.8e-109
Identity = 216/295 (73.22%), Postives = 235/295 (79.66%), Query Frame = 0

Query: 1   MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRM-DRGGFDCMSKGIKKSMGFGLEDQAE 60
           M SA MRIVFGLLTFVTVGMIIG  +      R+ D  G + +  G      +  + Q  
Sbjct: 1   MVSAQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKTQYDSQRQLP 60

Query: 61  RVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLD--FGGMLRLKYEECDYLKAIALPRLE 120
           RV +        +  F    G +    G   +++     +LRL Y ECDYLK IALPRLE
Sbjct: 61  RVTV-------KNREFSKELGGNQFNPGLPNWINDKEAEILRLGY-ECDYLKGIALPRLE 120

Query: 121 ISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLR 180
           ISTVVDTKTGKGVKSDFRTSSGMFLSH EKNYPMVQAIEKRISVYSQIPVENGELIQVLR
Sbjct: 121 ISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISVYSQIPVENGELIQVLR 180

Query: 181 YEKSQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTV 240
           YEK+QFYKPHHDYFSDTFNLKRGGQRIATMLMYLSEN+EGGETYFPKAGSGECSCGGKTV
Sbjct: 181 YEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTV 240

Query: 241 PGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP 293
           PGLSVKP+KGDA+LFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKSTLVP
Sbjct: 241 PGLSVKPAKGDAILFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP 287

BLAST of HG10001835 vs. ExPASy TrEMBL
Match: A0A1S3BYM5 (prolyl 4-hydroxylase 1 isoform X7 OS=Cucumis melo OX=3656 GN=LOC103494504 PE=4 SV=1)

HSP 1 Score: 404.1 bits (1037), Expect = 5.1e-109
Identity = 215/293 (73.38%), Postives = 231/293 (78.84%), Query Frame = 0

Query: 1   MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRM-DRGGFDCMSKGIKKSMGFGLEDQAE 60
           M SA MRIVFGLLTFVTVGMIIG  +      R+ D  G + +  G      +  + Q  
Sbjct: 1   MVSAQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKTQYDSQRQLP 60

Query: 61  RVWLVVVGKVGLDPLFEIFGGVDGREDGFCRFLDFGGMLRLKYEECDYLKAIALPRLEIS 120
           R         GL         ++ +E           +LRL Y ECDYLK IALPRLEIS
Sbjct: 61  R---------GLP------NWINDKE---------AEILRLGY-ECDYLKGIALPRLEIS 120

Query: 121 TVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYE 180
           TVVDTKTGKGVKSDFRTSSGMFLSH EKNYPMVQAIEKRISVYSQIPVENGELIQVLRYE
Sbjct: 121 TVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYE 180

Query: 181 KSQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPG 240
           K+QFYKPHHDYFSDTFNLKRGGQRIATMLMYLSEN+EGGETYFPKAGSGECSCGGKTVPG
Sbjct: 181 KNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPG 240

Query: 241 LSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP 293
           LSVKP+KGDA+LFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKSTLVP
Sbjct: 241 LSVKPAKGDAILFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP 268

BLAST of HG10001835 vs. ExPASy TrEMBL
Match: A0A6J1CBS4 (prolyl 4-hydroxylase 1 OS=Momordica charantia OX=3673 GN=LOC111009248 PE=4 SV=1)

HSP 1 Score: 403.3 bits (1035), Expect = 8.7e-109
Identity = 213/294 (72.45%), Postives = 234/294 (79.59%), Query Frame = 0

Query: 1   MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRM-DRGGFDCMSKGIKKSMGFGLEDQAE 60
           MASAPMRIVFGLLTFVT+GMIIG         R+ D  G + +S G      +  + Q  
Sbjct: 1   MASAPMRIVFGLLTFVTLGMIIGALFQLAFIRRLEDSYGTEFLSAGRLHKTQYDGDRQLP 60

Query: 61  RVWLVVVGKVGLDPLFEIFGGVDGREDGFC-RFLDFGGMLRLKYEECDYLKAIALPRLEI 120
           R +   +     + L    G V      +  R +       L  EECDYL+A+ALPRLE+
Sbjct: 61  RGFPNWINDREAEIL--RLGYVKPEVVSWSPRIIVLHNF--LSTEECDYLRAVALPRLEV 120

Query: 121 STVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRY 180
           STVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPM+QAIEKRISVYSQIP+ENGELIQVLRY
Sbjct: 121 STVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISVYSQIPIENGELIQVLRY 180

Query: 181 EKSQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVP 240
           EK+QFYKPHHDYFSDTFNLKRGGQR+ATMLMYLS+NVEGGETYFPKAGSGECSCGGKTVP
Sbjct: 181 EKNQFYKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVP 240

Query: 241 GLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKSTLVP 293
           GLSVKP KGDAVLFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKSTLVP
Sbjct: 241 GLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP 290

BLAST of HG10001835 vs. ExPASy TrEMBL
Match: A0A1S4E011 (prolyl 4-hydroxylase 1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103494504 PE=4 SV=1)

HSP 1 Score: 402.9 bits (1034), Expect = 1.1e-108
Identity = 216/304 (71.05%), Postives = 235/304 (77.30%), Query Frame = 0

Query: 1   MASAPMRIVFGLLTFVTVGMIIGEDVSKVRSGRMDRGGFDCM---------SKGIKKSMG 60
           M SA MRIVFGLLTFVTVGMIIG +   + +GR+ +  +D           ++   K +G
Sbjct: 1   MVSAQMRIVFGLLTFVTVGMIIGTEF--LPAGRLHKTQYDSQRQLPRVTVKNREFSKELG 60

Query: 61  FGLEDQAERVWLVVVGKVGLDPLFEI--FGGVDGREDGFC-RFLDFGGMLRLKYEECDYL 120
               +     W+        D   EI   G V      +  R +       L  EECDYL
Sbjct: 61  GNQFNPGLPNWI-------NDKEAEILRLGYVKPEVVSWSPRIIVLHNF--LSTEECDYL 120

Query: 121 KAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVE 180
           K IALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSH EKNYPMVQAIEKRISVYSQIPVE
Sbjct: 121 KGIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISVYSQIPVE 180

Query: 181 NGELIQVLRYEKSQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENVEGGETYFPKAGSG 240
           NGELIQVLRYEK+QFYKPHHDYFSDTFNLKRGGQRIATMLMYLSEN+EGGETYFPKAGSG
Sbjct: 181 NGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSG 240

Query: 241 ECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKS 293
           ECSCGGKTVPGLSVKP+KGDA+LFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQKS
Sbjct: 241 ECSCGGKTVPGLSVKPAKGDAILFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKS 293

BLAST of HG10001835 vs. TAIR 10
Match: AT2G43080.1 (P4H isoform 1 )

HSP 1 Score: 349.4 bits (895), Expect = 2.9e-96
Identity = 184/290 (63.45%), Postives = 217/290 (74.83%), Query Frame = 0

Query: 6   MRIVFGLLTFVTVGMIIGEDVSKVRSGRMDRGGFDCMSKGIKKSMGFGLEDQAERVWLVV 65
           M+IVFGLLTFVTVGM+IG  +      R++    D    G       GL  Q  R +L  
Sbjct: 5   MKIVFGLLTFVTVGMVIGSLLQLAFINRLE----DSYGTGFPSLR--GLRGQNTR-YLRD 64

Query: 66  VGKVGLDPLFEI--FGGVDGREDGFCRFL----DFGGMLRLKYEECDYLKAIALPRLEIS 125
           V +   D   E+   G V      +   +    DF     L  EEC+YLKAIA PRL++S
Sbjct: 65  VSRWANDKDAELLRIGNVKPEVVSWSPRIIVLHDF-----LSPEECEYLKAIARPRLQVS 124

Query: 126 TVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYE 185
           TVVD KTGKGVKSD RTSSGMFL+H E++YP++QAIEKRI+V+SQ+P ENGELIQVLRYE
Sbjct: 125 TVVDVKTGKGVKSDVRTSSGMFLTHVERSYPIIQAIEKRIAVFSQVPAENGELIQVLRYE 184

Query: 186 KSQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENVEGGETYFPKAGSGECSCGGKTVPG 245
             QFYKPHHDYF+DTFNLKRGGQR+ATMLMYL+++VEGGETYFP AG G+C+CGGK + G
Sbjct: 185 PQQFYKPHHDYFADTFNLKRGGQRVATMLMYLTDDVEGGETYFPLAGDGDCTCGGKIMKG 244

Query: 246 LSVKPSKGDAVLFWSMGLDGQSDPSSIHGGCEVLAGEKWSATKWMRQKST 290
           +SVKP+KGDAVLFWSMGLDGQSDP SIHGGCEVL+GEKWSATKWMRQK+T
Sbjct: 245 ISVKPTKGDAVLFWSMGLDGQSDPRSIHGGCEVLSGEKWSATKWMRQKAT 282

BLAST of HG10001835 vs. TAIR 10
Match: AT5G66060.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 199.1 bits (505), Expect = 4.8e-51
Identity = 107/197 (54.31%), Postives = 127/197 (64.47%), Query Frame = 0

Query: 100 LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRI 159
           L  EEC YL  +A P +E STVVD KTGK   S  RTSSG FL+        ++ IEKRI
Sbjct: 94  LTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFLARGRDK--TIREIEKRI 153

Query: 160 SVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENVEGGE 219
           S ++ IPVE+GE +QVL YE  Q Y+PH+DYF D +N + GGQRIAT+LMYLS+  EGGE
Sbjct: 154 SDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQRIATVLMYLSDVEEGGE 213

Query: 220 TYFPKAGS-----------GECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHG 279
           T FP A              EC  G     GLSVKP  GDA+LFWSM  D   DPSS+HG
Sbjct: 214 TVFPAAKGNYSAVPWWNELSECGKG-----GLSVKPKMGDALLFWSMTPDATLDPSSLHG 273

Query: 280 GCEVLAGEKWSATKWMR 286
           GC V+ G KWS+TKW+R
Sbjct: 274 GCAVIKGNKWSSTKWLR 283

BLAST of HG10001835 vs. TAIR 10
Match: AT1G20270.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 198.0 bits (502), Expect = 1.1e-50
Identity = 105/194 (54.12%), Postives = 128/194 (65.98%), Query Frame = 0

Query: 100 LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRI 159
           L  EEC+YL ++A P +  STVVD++TGK   S  RTSSG FL        +++ IEKRI
Sbjct: 92  LSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRRGRDK--IIKTIEKRI 151

Query: 160 SVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENVEGGE 219
           + Y+ IP ++GE +QVL YE  Q Y+PH+DYF D FN K GGQR+ATMLMYLS+  EGGE
Sbjct: 152 ADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRMATMLMYLSDVEEGGE 211

Query: 220 TYFPKAGSGECS---------CGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGGC 279
           T FP A     S         CG K   GLSVKP  GDA+LFWSM  D   DP+S+HGGC
Sbjct: 212 TVFPAANMNFSSVPWYNELSECGKK---GLSVKPRMGDALLFWSMRPDATLDPTSLHGGC 271

Query: 280 EVLAGEKWSATKWM 285
            V+ G KWS+TKWM
Sbjct: 272 PVIRGNKWSSTKWM 280

BLAST of HG10001835 vs. TAIR 10
Match: AT4G35810.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 193.7 bits (491), Expect = 2.0e-49
Identity = 107/194 (55.15%), Postives = 129/194 (66.49%), Query Frame = 0

Query: 100 LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLS--HQEKNYPMVQAIEK 159
           L  EEC++L ++A P +  S VVD KTGK + S  RTSSG FL+  H E    +V+ IE 
Sbjct: 96  LTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSRVRTSSGTFLNRGHDE----IVEEIEN 155

Query: 160 RISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENVEG 219
           RIS ++ IP ENGE +QVL YE  Q Y+PHHDYF D FN+++GGQRIAT+LMYLS+  EG
Sbjct: 156 RISDFTFIPPENGEGLQVLHYEVGQRYEPHHDYFFDEFNVRKGGQRIATVLMYLSDVDEG 215

Query: 220 GETYFPKAGSG--------ECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGG 279
           GET FP A           E S  GK   GLSV P K DA+LFWSM  D   DPSS+HGG
Sbjct: 216 GETVFPAAKGNVSDVPWWDELSQCGK--EGLSVLPKKRDALLFWSMKPDASLDPSSLHGG 275

Query: 280 CEVLAGEKWSATKW 284
           C V+ G KWS+TKW
Sbjct: 276 CPVIKGNKWSSTKW 283

BLAST of HG10001835 vs. TAIR 10
Match: AT2G17720.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 188.7 bits (478), Expect = 6.5e-48
Identity = 106/194 (54.64%), Postives = 127/194 (65.46%), Query Frame = 0

Query: 100 LKYEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFL--SHQEKNYPMVQAIEK 159
           L  EEC++L ++A P +  STVVD KTG    S  RTSSG FL   H E    +V+ IEK
Sbjct: 96  LTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSRVRTSSGTFLRRGHDE----VVEVIEK 155

Query: 160 RISVYSQIPVENGELIQVLRYEKSQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENVEG 219
           RIS ++ IPVENGE +QVL Y+  Q Y+PH+DYF D FN K GGQRIAT+LMYLS+  +G
Sbjct: 156 RISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVDDG 215

Query: 220 GETYFPKAGS--------GECSCGGKTVPGLSVKPSKGDAVLFWSMGLDGQSDPSSIHGG 279
           GET FP A           E S  GK   GLSV P K DA+LFW+M  D   DPSS+HGG
Sbjct: 216 GETVFPAARGNISAVPWWNELSKCGK--EGLSVLPKKRDALLFWNMRPDASLDPSSLHGG 275

Query: 280 CEVLAGEKWSATKW 284
           C V+ G KWS+TKW
Sbjct: 276 CPVVKGNKWSSTKW 283

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038904320.16.6e-11171.43prolyl 4-hydroxylase 1 isoform X1 [Benincasa hispida][more]
XP_008453926.17.3e-11072.45PREDICTED: prolyl 4-hydroxylase 1 isoform X6 [Cucumis melo][more]
XP_016901569.13.6e-10973.22PREDICTED: prolyl 4-hydroxylase 1 isoform X4 [Cucumis melo][more]
XP_008453928.11.1e-10873.38PREDICTED: prolyl 4-hydroxylase 1 isoform X7 [Cucumis melo][more]
XP_022137963.11.8e-10872.45prolyl 4-hydroxylase 1 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Q9ZW864.0e-9563.45Prolyl 4-hydroxylase 1 OS=Arabidopsis thaliana OX=3702 GN=P4H1 PE=1 SV=1[more]
F4JZ246.8e-5054.31Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana OX=3702 GN=P4H10 PE=3 S... [more]
Q9LN201.5e-4954.12Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=... [more]
F4JNU82.8e-4855.15Probable prolyl 4-hydroxylase 8 OS=Arabidopsis thaliana OX=3702 GN=P4H8 PE=3 SV=... [more]
Q24JN59.1e-4754.64Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana OX=3702 GN=P4H5 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A1S3BY763.5e-11072.45prolyl 4-hydroxylase 1 isoform X6 OS=Cucumis melo OX=3656 GN=LOC103494504 PE=4 S... [more]
A0A1S4E0331.8e-10973.22prolyl 4-hydroxylase 1 isoform X4 OS=Cucumis melo OX=3656 GN=LOC103494504 PE=4 S... [more]
A0A1S3BYM55.1e-10973.38prolyl 4-hydroxylase 1 isoform X7 OS=Cucumis melo OX=3656 GN=LOC103494504 PE=4 S... [more]
A0A6J1CBS48.7e-10972.45prolyl 4-hydroxylase 1 OS=Momordica charantia OX=3673 GN=LOC111009248 PE=4 SV=1[more]
A0A1S4E0111.1e-10871.05prolyl 4-hydroxylase 1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103494504 PE=4 S... [more]
Match NameE-valueIdentityDescription
AT2G43080.12.9e-9663.45P4H isoform 1 [more]
AT5G66060.14.8e-5154.312-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT1G20270.11.1e-5054.122-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT4G35810.12.0e-4955.152-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT2G17720.16.5e-4854.642-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 89..285
e-value: 7.3E-49
score: 178.3
NoneNo IPR availableGENE3D2.60.120.620q2cbj1_9rhob like domaincoord: 99..285
e-value: 5.9E-60
score: 204.5
NoneNo IPR availablePANTHERPTHR10869:SF179BNAA04G24820D PROTEINcoord: 1..26
coord: 100..289
IPR044862Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domainPFAMPF136402OG-FeII_Oxy_3coord: 173..285
e-value: 1.2E-20
score: 74.2
IPR045054Prolyl 4-hydroxylasePANTHERPTHR10869PROLYL 4-HYDROXYLASE ALPHA SUBUNITcoord: 1..26
coord: 100..289
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 169..286
score: 11.273666

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10001835.1HG10001835.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0018401 peptidyl-proline hydroxylation to 4-hydroxy-L-proline
cellular_component GO:0005783 endoplasmic reticulum
cellular_component GO:0000137 Golgi cis cisterna
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0004656 procollagen-proline 4-dioxygenase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen