PI0020559 (gene) Melon (PI 482460) v1

Overview
NamePI0020559
Typegene
OrganismCucumis metuliferus (Melon (PI 482460) v1)
Descriptionprolyl 4-hydroxylase 1
Locationchr07: 3009154 .. 3020069 (+)
RNA-Seq ExpressionPI0020559
SyntenyPI0020559
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAGCATGAATTCCCTAATAAACACAAATATTTGAACAAAGAAATCAAAAGGATCTCAAATCACAATCACTTTTTCTTCACAGTTACGGAAGAAATCATAGTGCACGAGAAAGATGATGAACAACAACAAGCACTAGAAGCAAATCCTCTGCACTGGGTCATCCACACTTCAAACACTAACGGCTTCTCCTCAAGAATTTTGAGTCAAATTTTTGTTTAGGCAGCTATGGTTTCCGCTCAGATGAGGATTGTCTTCGGTCTCTTGACCTTTGTCACCGTCGGCATGATCATCGGTACGCATTTTCGTTTCTCTTTGTCGGTTTTTGTTGATTTTATGTATATGTGTTTTGTGTGGCGTGTTTTTCACTCATTCGATGCCGAAATGGAGATTAACAAGAACTTGAGTGCTTAGTGGCTTTATGATTTGTTGAGGTATTCGTCAATGTAAGTTAAATCTGAGTTGTGATGTGAGCATGTTTCAGGAGGAGTTTTCTTGTTTGCGACTTTCTTTGAGCTGATTTCTTTAGAGGAGCGAGGAATTGCTGGTACCATTAGCGAACTGCGTTTTTTTTTTCTTCTTCTCATACTCTGTAATTGTGCACTTGCCGGCTCGGTCAGAGATTGGCAGCTACACAGTGTTGAGTTGACCACAAATGTTCATTTGCTTTTCATTTTCTGGAATTTAGCTCTTCCTTGGATTTGATTTAATGGAAAGGAAAACACTCTTGGCTCTCCTTCTTTGAGTTTTTCTACGATTAAACGTTGCAGTTTTTTATGTTTGCTATGAGTTGAACTATTGAATGATGAATCTTTGTACCATTTGGCTGGATTCTTTGTTCGCATATGGTAACGTTTTGGTGACTTGATTGTGTTCATGTACTTCTAGAGAACTATGCAAAGTTGTGATTGATAAGGTACTTCCTTGAATGGTAATTGCAGGTGCTTTGTTACAACTAGCATTTTTAAGAAGGCTGGAGGACTCTATTGGTACACTTCTTGGTTTATTTGTCATCTAGGATGTAAAAACTTGAAATTATATAATGACAGTATCTTATATTTGGAAAGACGATGAGCACTATTGGGGTTGGATTTGGATATCTTTGAATTTTTCATTTAGATTTTATTTTTATTCATGTCTTTGGAGAAGTGTTGTCATCATGTCTACTCATAACTAAATAGATATGTATATTACTAGAAACAACATACGTTCATTAATAAAAAGATCTGTAAAATGTCGAAGAAATAGCACATGGTAACGGTCAATATGCAGTATGAGTTAACTTATTTTGCTTCCAACTAATAGAAAAAGAAGTTGTTCATGAACATATTTGAAATCCATTAAAAACTCAATTCCATAGCATGCACCATAGGCATACAGATATTGCAAGGCCAAGTTCCATGAAAAAGACAGAAAAATGATCAGGTGTAAAGACTTTCTCGCTAGTATTGTAACTTTCTATTTTGTTCAATTGGACTAATAAATTTCTACGCTAAGTATCTGGATTGGACCAAACACTTGGACTTTGGAGCACATAAAAGATAGTTTGATTTATCTAAGTGACTGCATCATGACAGTTCAATTTGAAATTCAATGAAGAAAAAGTTCTTACATAGATATTTTAAAATTATCTGAGGTAGTTGTATGGATCTCATGAGGATTTGAAATATTGTTTGCATTCCATAAAACTTATGACTTCCTGGTAATTGTTTGGAAGAATGACCAAACGATGTAGACACTTCACTTAGCCCAGAGCTCTTGACGTATCCATGCTTAAATAATATCATCTATCTTTGCTGACAGCATGGCATTTACTGACTGCTGGATGTACCCTATATGTTTATGGGGCATGGTTTGTTTTGGTATCACTAAAATGATTGACTTTTGTATGATCCATGTCTTTGCAGGCACGGAGTTTCTATCTGCTGGAAGGTTACATAAAACTCAGTATGATGGCCAACGTCAATTACCCCGAGGTTGGAAATATATTATAGGGTCAACTTTTGCACAAAAATTTGCCCCTATTAAATTACATTAGAATTGTTTTAAAGGGGACATATCTTTCTTCGACGATCATTTATTTGATTTAGCATGGCTGCAAGTTGTTCATCTATGTTTTTGCTTCAAAAGTGGTTTTACGTCAGTTTATTTCATATGATGAAGGAAAGACTTGATGGAAATTTGACTTAATTTCAACTTCCAATTCCTTTAGTCATTTTTGTTCTGTTCTAGATTTTTAACTTAGGAATAATCCATCTACAATTTTTTACTGAAAGTTGTTTCCATCTTCATGTTTATAAAAAATATTGATTTCATCATTTTTTTTATTTGAAAGCATATTATATTTTCATTAAAGGACTATACATAACTTTTGTTTGTGTTAACCATCATGAGTTGGCCAAGTGGCTGTAAAGGACCATGAATTTAGTAAAGAACTTAGAGGGAATCGATTCAATCCATGGTAGGCACATACCTATATATACTACAAGTTTCTTCGGCATCCAATGTTGTAAAATCGTGATTGTCTCGTGAGATTTGTCGAGGTGGCCGTAAGTTGGCCACACAAATATGAAATTGTTATTGCCTTACAAATATGAAATTGTTATTGCCTTACAAATATGAAATTGTTATTGCCTTACAAATATGAAATTGTTATTGCCTTACAAATATGAAATTGTTATTGCCTTACTTTTACCCTGGACCGATTGGAAAACTTTTTTTATTTTTATTTTTATTTTTTATTATTTAAAATTGACTTTGGTTGGTTGTATTCCTTTTCCTATCTTTGTAACTTTTGTTTCCTAATTTGAAACAATTATTTCTTATCTAGAAAGCATTAATTCTTGTTGGCCTGTTAGACAATAAGTCTGATTACTTATTTATTGTTATTATTTTTTTAGAAGATTAATATTGGTATTAAGGTTCTCATCGAAGTAAGTTCATATTAGAATGAGATTTAGCAAACATAATGAAATTGTCCAATTATCAGCATGTTTTTCACACTGATTTATTTATTTATATTTTTTCTTGATCATTCTCCTTTTGCTAATTTTCAGGCCTTCCTAATTGGATTAATGACAAAGAAGCAGAAATTCTTCGTCTTGGCTATGTATGCTTATCACTACTTTTCTATTTATGATTGTCAACGTTTCTTTTCTTTTTTTCTTTTGGTTTTGGGATCTTTGGATGCCTTGTTATTAAAAAAATAGGACAATTTTATACAAAAAAAGTTTGTTTATCCTTTGTAAACAAAGTAAAAGTTAAGAACCAAGCAGATTGGGGGAAAACATACAGTACAGTTGTGTTGAAGAAGAAATGAGTAAGTAGGCCAAAGAATACCATAATTCGGCAGTGTCCTGGACCCACCCACCCACCCACACAATGGCCTTGGAATCAAGACAAGGGATTAATTACCAATATCCTTGAAGAATAATTCCATCTAGAAGCACGAGAGAAGAAAAATACTCACATATTACATGTATACGCAAGTGCTCAATTCATTCAAATATGAATGTAAAATAATTGAAATATTGTTTCATATCTCATGACATGTAAATATCCATGTGCAAAACCAAATGAGAAATATTCTCTTGAAGCGAAAAGATGGTCAACGAATGATTTGAAAGCAATTAAGAAATTTTGGAGTATTTCTATTGTGTCTTTACTACTAACACGTCTGCTCTTTTACATTGGTGGACAGTCTTTTCTAACATTTAAGTGAAACTTTTTGATCCTACACGTGTTTTTGGGAATAATTTGTAGGCAAGTTTTCCATGTTATCAATTGAAGTTTCCTGTCCTAAAAGAAAGTTTCCCTTTTATTTTGTGGAATAGGTTAAACCAGAAGTAGTAAGCTGGTCACCACGAATCGTAGTATTTCATAATTTTTTGAGCACAGAGGTGAGCATAATCCTATCGGAATGGTAATTCTTATTCTGGATGTTATCGATTCTCTGCAGATTCTTTCCTTTACATAACTTTGAAAATAGAAAGACTTTCAACTAAGTGACTTATATCTATATATATGTATTATCTATATATATTAAGAAACAGTTTTATTGATGAATGAACTTTATGTAGATCAGGATGACATAAAGCTTTCCCAATTAGCTAAAAGGGAAGCGTACCTGTAGGCAGTAAAGAGGGTTGACAATTTACCTCAGAAATACCATGTAATATAACAAAATCGTAAAGCATGTCGAAATGGTGCTCCTTATTTAAAATGACTGATTTTGTTCCATCCATGTTGAAACATGGGTTATTATTAGCCATCACTCAAGGCCTAGCAACACTCAGATTTATTTATTATCTTTATTTCTGAGTATTTGATGGGCTTGTGTATGTATATAGTTAAACCCGTTATACAGCACTACTAGGGTTTCAAAAATTTAAATGTATACTACTCCTTTATACATTCTGATTCAATATAATGAGACAACACAAACTCACTTTGGTGCATAGGTGTCTTGCTTGTTTTTCATTTTTATATTGGAGGAGGGTTTGGTCCCTTCAGTGGCATCTAATAAAATTGGAAGAAAGAGAGAAGATAGGAATGACCCTTCCAAAGGATGACAGTTATGCATCAGAGGAAGGCTTCATTGGATCGTAAAGGATTCAAGAAGTAGAACCCAACGGCACATGATTTTGTTCCAATAATCATGAAAGTGCTTTATAGTGACCACAATGGAGGAGTCCCACTGGGTTGAGGCATGCTGACTGGAAGGATTGGGATCCTCCATAGACATTGCTCATGGTTAAAGACCAATTGGATGACGACCATACATGAACAGGAGTTGTTCTATAGGCTATACAATTGTGTTCTTTCCACTATGATGGTAATGACAACAATGTTCCTAACACGTCTGTAAGATAAATGGCCATTTTCAATCGAAGTCCATATAATTGGAAATGATCATGGGATATTTGCCCTTCTTCCCCTTTTATGCGAGATATTTGGAAGACTACTTGGTTCCTTGCTACTTTTTGGCTGGAAAGGAATTAAGAAATGTAGGATTTTAGAAAGTGGATTAAGATTAGGAATCTAGGATAGAATAAGACATGAGTTGGGACTTGGGAGTTTGTAGTTTTGATGCATAGAAATAAGACCACCAGTTATTGAACAATCACCAGGTGGTGGAGTTGAACAAACTTCCAAAAGAAAAAATGACGGTGAAGGTTTTCAGTAATTTGGTTGCGAAGACAACACCAGTTAGGCCGGATGCTTAGATTGGAATTCTCTTAAGAACTTGTTCATTCACATTGATGCCAGCCGAAGCCAAACACCATGAAAAGAATTTAATTTGTTGAAGATATTCTATATTTAGTTTGTATTTGATCATTTCTTTCTTGAGAAGTTGGTAACCTCTAGACATTTGTCTAAAGAGAGACTTGCTTGTAAAACTTCCCAACTTATCATCCAGCCACCACAGCTGGGTATCATGCTGAGGGATAGGCTTTTACCCTCAAACTTTGTAAGAAAGAGATTTCCTATAGAACAAGATTTTATCATTGTGGTTTTTTTTTTCCGCTTTATAAATCTCAATTCTAATGGAATTGCATCAGGTCTTTTAGTTCAAGAAGCATATATTTTGATAGAGGGACATGCTAGCTTCTGTTTTTACTAATTGATTAATTGTATAGGAGTGCGACTACCTTAAGGGAATAGCACTTCCTCGCCTTGAACTTTCCACTGTCGTGGATACGAAAACTGGGAAGGTATGCACTTTTTCCCCCTTCCTTTGTATGATCACCTGTCTATTAGTTTAAGGCATGATAGATAATTACAACAGATAGTGGTTTTCTCTCGCTTATTTCTTGTCTATACACTATTTTTGCTCTCAACATTTCTATGAATGTTATCTGTTGAAATAGACCCTACATACTGAAATGAAAATGAATTAACTATGGTCATATATTGATATTTGGTTCTACTCTGATTCTTTACACACTTTGAAGTTTGAAGTATCTGTGTTTCATAGATAGACAAAGTAAAAAGAAAAAAAAAAAAAAAAACGAAAAAGAAGACAACGAAGAGAAAACATTAATGCAGTGTCTGGTGTTCGGTTTAGAAATGGAATTGGGTTTTTGAAAACCAAAATTTTCCATTTCTGTAATTTCTTTATATTTTTAAAGATGTTTCTAAAATAAAAAAACCAATAACCGAATTTGTTAGCTAATAAACTTCTTTATTAACAACAAAAAAAAAAAAAAACTAATTTGTTATTTATTATTACTATTTTTTGTTAAGAAAAAACTAATTTGCTATCAAACTACCAGTTCGTGATTTTTTTAGGATTTGGGAAAAAAACAGATGGAATAGGTAAAACAATCTCAAACTTACATCTTTCTGGCAATTTTTGGATTAAAACATTGTTTCAACATCAAACACTAGGAAATTATTAGATTGTATATCCCTTTTGTGAATTTCATACATCAATGAAATTGTTTCTTATAAAGAAAATCTTAGGAAATGATCTTGTGAAGGAGGCATGAAAAGGAAATTGCCCCAAAGAGGTCAAAAAACTTAGACTATCTCCAAGGAAGAGATTTCTTTAAAAAAGGATGATCAAGTGAGCATAGTTTCCCTGGATGATCAATTTCTCAACTTTTTTTGGTTGAATATATATACATTCTTGTTTCCTATAAAAAAAAGTCAGCATAGTTTCAAACCTGAGAGAGAGGAGGACCCATCCAGGGAAATCAAAGCTCAGTGTGTCTTTTTAGAATGTTGCAAATTTGTTCCGACAGTTATGATAAGATAAAATATCACTGTATTTTCAGCAAAGAAAGAAGATAAAATATCATTGTCTATGAGTAGCTATATATTTATATTGAAATGCTAATACTTCTACTTTTAGGGTAACTTTATCTCGCGTGGATTTATTTTCTCTTGATAGATTCTATAGTTCAAGATTGTGATTGATAGTTTTGCTTTGTTCTTTTGTAAATGGTATATATATTTTTCCTATTAGAAAGATATGTTAAATACATATGTTCTCCCTTGTACATTTATTTTCAGGGCGTTAAGAGTGATTTTAGAACGAGCTCTGGAATGTTTTTAAGTCATCATGAGAAAAACTATCCAATGGTCCAGGTATTAATTTCAAAGAAAGTGTTGGTATATTTTGATGTTTGAAGTAATGGATATGGATGTGCTCAATAGTCAACTTTATGGAAAAAACTTTCAATTTAGTTAAAGGTATTCTTCCGGATAAATTTACAAATTTGTTCCAATTGTTTTAGACAAGATCTTTATTTAAAGAAAACATAAATGATCATGTACGACTCATTTACAGTATAGCCTATAGGCATGCTAAGCAAATGTAGTATATAGTTTATTTTTAAACTATAGACTATTCCACAAACTTCAAATTTTCATTAACTTCAAATAGCGAAGCTCAGTCCTTAGAGTGACAAAAAACAGTAGCAGACTTTAAACTAGTAGTCACTTAACAGTAAATAAGTGTTATTTCTATCCAAAAAAAGAAAAAGAAAAAAAGGAAAAGGAAAAGAAAAAGGAGAGTAAATAAGTGGCCGACCAAAAACTTAGAATTACAGCATCAGCTACCCAATATTTACCCCACGGCCAAACACAAGAATCTAAAGATTTTTTAAAAGATCTTGGAAGAACTTTTAAAGAAACAGATCAAAAACTTCGAAAGCAACAGATCACCTTATTTCCTTCCCTCCAAAGGAAAGCTTGTCCTTAAGTTTTAGGCCACCAAATAAAAGGTTTCAGGAGATTATTTAAAATTCAGCAACATTGAAGGTGGGGTTGATCTTGAGATTTTGAGTTAGTTACTTTACATGCACTGCCACCAAATCTCTTAAGATTTTGAAAAGGCCCAATCTTTCTTAGGATGAATCTTAGAGTAGAACCAGAAGAAAATCTTGTTATCTTGTCATTCACAAGTTATCAAATTTCCAAACTTGAACTATTGTTTGTCCTCCATGTTGGAAATTTACGAGATATAAATAAATGTAGAATAAACTAAATTATATCTCAAACTCCTACTTTATAGGTTAGAGCCAAAATAAGTAATCGGTCATACAATAGCAAATAATAAAAAGTAGGGAAAAGAGGAACGACGCTGAAAAATAAACTGGTTCGTCCCAAACTCAGAACTACGTTTAGTCTTCTGCAATTGCATGTTCTACTATTTTGGAGATTGCAAAGAACCTTCAAACATGCTTACAAATCTCTTCTAAATAGCTTAGAGAAACTCACAACCCAACAAGAAATAATACTCTTATGTATTTTTCTTTTCCTCGATGCCACACAAAAACGACACTGAAATTTGAATAACTTCAAACTAAAGTCTTCAAATTGCCGTCTGGTGCATTTTCTCCACTTCCGGCGAGGACCTCACGGCAGACCTTCTGCAGTTTTTCTTCTTCAACACAAACATATGTTTATACGTGTGTTGCTGTCAAATGGGCAAAGCTCAATAATTATTTGGTGTCTCTCTACCGCCCCAAAATTTTCTTTTAAAAAGAAGTCACCTTTATAATTATAATAGGTAATTAAACAATTTTGCTCGCACATTACCCAATTAAAAAGAAGCTTATGGGTTAAGTAAATGAGTTGGATTTGGAGTTTTGCAATATCCTAAAACTCAAGTAATGCAGCAAAAATTATTCTAACCTCAGTGCGCTAACCACACTGCATTTCCCTGATGCATTTTTACTGCTGTGTCTCACAATGCGTTGTGAAAATGTCCAACTATTTGAGATTAAACATTTCCTCAAGGCACCTATTTTTGCCTTAGCCAATTTCACCCGGCTCTCTTATTACAGTTCTTCCTGAAAGTTAACTTACTTCCTTCATCCTTCATACTACATGTTCTTATTTTTGGGGTTCTCTCTCTCGGTTGGTGATAGTCTGAGACTCTGATAGATTTCCTGTGTTTTGGAATCTGTGAAAGCAGGCAATTGAAAAAAGAATTTCTGTCTATTCTCAAATACCAGTAGAAAATGGAGAGCTAATTCAAGTGTTAAGGTGTGGATATAATTTTGTTTCTGTAAGCACGTGAACATGAAAAGTGGTTTGTACTTCAGTTTTACGTCTAACATTTTAAGCAGTACAGGTACGAGAAGAATCAATTTTACAAGCCTCATCATGACTACTTTTCTGATACTGTAAGATTTTTGTTATTTCTTCAGCCATGAGCCCCAGTATTCAAAAATCAGTATTTATTTATATTTTTCTTTAAGTAATGTGAACTTGGGCTCTTGCTGATGGCATGTGCCATTTCTTCCATGTCTGCCTTTTACAGTTTAACTTGAAGCGTGGTGGTCAGCGGATTGCAACTATGCTTATGTATCTAAGTGAAAACATTGAAGGAGGAGAAACCTACTTTCCGAAGGTATTTACCATTATCCTTGCTGAATTAAATTCAGTATTTAGATGGAGTGCTATGACACACGGCAGAATTTAGCATAAATTTGAAAGACAGAAAAATGGCAGCTTTTCCCATTTTAAATCTTGTTTTCTTGTCGGATCCAAGTTTCTTGGAATTGTCATGTCCTGAATTTTCCTTTTGCATACTTTCTTCAGGCTGGTTCTGGTGAGTGTAGTTGTGGTGGGAAGACCGCTCCAGGACTGTCAGTTAAACCAGCCAAAGGAGATGCAGTACTTTTCTGGAGCATGGTATGGATCTCTTGCCTCTTAAAATGAAATGAGTATGACGTGAAAACAAAATGATTATGATTTTGACTACTAAATCAAGAAATCGACACCTAAGGGATGCCTTATGATAGATAAGGAGGGCTTAAGATTTGTTTCAGAGTTACCCTAGAAGTCGTGGCCTAGGACCCCAGCCTCAATTCAAATTTCAAAAGCTTAATAACTAGGAGTTAGTTTGCCCGATGTTTCTGTTTCTTGATTCTCATTTTTCGTTTTTCTTTTCAAAATTGTGTAGAAATTTTGGAAGTAGCAAAATTGAGTTTCTTTTTTCTTGTTTATGTCTCTGGTTTTTCTCTAAAACTAAAACTTAAATTACAGCATTATTTTATCATTCTTGACTTATAAAAGGGATAATAATCAATATTTAAAAACAATTAACATAAATTTTTTTTGGATATTTAACTTTAAAAATCAATTGTCCTGAATATTTATTACTTTTTTAATCTAAAATGTTTTAGAACAAGAAACCATAAATGGTTACGAAAACAAGAAACAATGATCAAACACAAATTTGTTTTTCTTTATTAAAAAAGTAAAAATGACAACAAGAACCTGTAAGTGAGAAATGAAAAACAGGAACTTCACCAAATGAACAAGCCCTTAAATTCTTTCTGGATTTCTGGAGTTGATCTTAAACTCAAAGTGGATCAAGAGATCAAAGTAGGAGTGACCCTTGAGTATACCAATAATAATGAAAAGGGGGTACCCTGTTCAAGAACATAAAACCAACCATTGCTTAAAGCCTGTTTGGAATGAACTTCTCAATAGAAGGCTGTTGTTTTAAACCAATTCCTTCGATATGTAATTGCTTCAGGGGTTAGATGGACAATCAGATCCAAATAGCATTCATGGAGGGTGTGAAGTACTGTCAGGGGAAAAATGGTCTGCCACAAAATGGATGAGGCAAAAGAGTACTCTGGTACCATAATTCATACTTTCTAGTTTCATTTGTATTGTATATCACCATTGAATATTTTGTTACATATATCAACTAATAAATTTATATATAGAGAGAGAAAGAAAAATGGAGAGGCTAATTTAGATAGCATCTTAACATAATTAAGCACACTTAGAATGAATCAATTTATTTAATGAATAATACAGTACAAGCGGCATCTGATGTTTCTTTTCTCTTTTGTCAACGTGTGGAAATTCTCTATATTTTTCTTGCATGACTATTATAAGGTTAAGTCTCAATAATCAACAAAAGTTTCTCCAAGAACTATTGAGGGA

mRNA sequence

AAAAGCATGAATTCCCTAATAAACACAAATATTTGAACAAAGAAATCAAAAGGATCTCAAATCACAATCACTTTTTCTTCACAGTTACGGAAGAAATCATAGTGCACGAGAAAGATGATGAACAACAACAAGCACTAGAAGCAAATCCTCTGCACTGGGTCATCCACACTTCAAACACTAACGGCTTCTCCTCAAGAATTTTGAGTCAAATTTTTGTTTAGGCAGCTATGGTTTCCGCTCAGATGAGGATTGTCTTCGGTCTCTTGACCTTTGTCACCGTCGGCATGATCATCGGAGGAGTTTTCTTGTTTGCGACTTTCTTTGAGCTGATTTCTTTAGAGGAGCGAGGAATTGCTGGTGCTTTGTTACAACTAGCATTTTTAAGAAGGCTGGAGGACTCTATTGGCACGGAGTTTCTATCTGCTGGAAGGTTACATAAAACTCAGTATGATGGCCAACGTCAATTACCCCGAGGCCTTCCTAATTGGATTAATGACAAAGAAGCAGAAATTCTTCGTCTTGGCTATGTTAAACCAGAAGTAGTAAGCTGGTCACCACGAATCGTAGTATTTCATAATTTTTTGAGCACAGAGGAGTGCGACTACCTTAAGGGAATAGCACTTCCTCGCCTTGAACTTTCCACTGTCGTGGATACGAAAACTGGGAAGGGCGTTAAGAGTGATTTTAGAACGAGCTCTGGAATGTTTTTAAGTCATCATGAGAAAAACTATCCAATGGTCCAGGCAATTGAAAAAAGAATTTCTGTCTATTCTCAAATACCAGTAGAAAATGGAGAGCTAATTCAAGTGTTAAGGTACGAGAAGAATCAATTTTACAAGCCTCATCATGACTACTTTTCTGATACTTTTAACTTGAAGCGTGGTGGTCAGCGGATTGCAACTATGCTTATGTATCTAAGTGAAAACATTGAAGGAGGAGAAACCTACTTTCCGAAGGCTGGTTCTGGTGAGTGTAGTTGTGGTGGGAAGACCGCTCCAGGACTGTCAGTTAAACCAGCCAAAGGAGATGCAGTACTTTTCTGGAGCATGGGGTTAGATGGACAATCAGATCCAAATAGCATTCATGGAGGGTGTGAAGTACTGTCAGGGGAAAAATGGTCTGCCACAAAATGGATGAGGCAAAAGAGTACTCTGGTACCATAATTCATACTTTCTAGTTTCATTTGTATTGTATATCACCATTGAATATTTTGTTACATATATCAACTAATAAATTTATATATAGAGAGAGAAAGAAAAATGGAGAGGCTAATTTAGATAGCATCTTAACATAATTAAGCACACTTAGAATGAATCAATTTATTTAATGAATAATACAGTACAAGCGGCATCTGATGTTTCTTTTCTCTTTTGTCAACGTGTGGAAATTCTCTATATTTTTCTTGCATGACTATTATAAGGTTAAGTCTCAATAATCAACAAAAGTTTCTCCAAGAACTATTGAGGGA

Coding sequence (CDS)

ATGGTTTCCGCTCAGATGAGGATTGTCTTCGGTCTCTTGACCTTTGTCACCGTCGGCATGATCATCGGAGGAGTTTTCTTGTTTGCGACTTTCTTTGAGCTGATTTCTTTAGAGGAGCGAGGAATTGCTGGTGCTTTGTTACAACTAGCATTTTTAAGAAGGCTGGAGGACTCTATTGGCACGGAGTTTCTATCTGCTGGAAGGTTACATAAAACTCAGTATGATGGCCAACGTCAATTACCCCGAGGCCTTCCTAATTGGATTAATGACAAAGAAGCAGAAATTCTTCGTCTTGGCTATGTTAAACCAGAAGTAGTAAGCTGGTCACCACGAATCGTAGTATTTCATAATTTTTTGAGCACAGAGGAGTGCGACTACCTTAAGGGAATAGCACTTCCTCGCCTTGAACTTTCCACTGTCGTGGATACGAAAACTGGGAAGGGCGTTAAGAGTGATTTTAGAACGAGCTCTGGAATGTTTTTAAGTCATCATGAGAAAAACTATCCAATGGTCCAGGCAATTGAAAAAAGAATTTCTGTCTATTCTCAAATACCAGTAGAAAATGGAGAGCTAATTCAAGTGTTAAGGTACGAGAAGAATCAATTTTACAAGCCTCATCATGACTACTTTTCTGATACTTTTAACTTGAAGCGTGGTGGTCAGCGGATTGCAACTATGCTTATGTATCTAAGTGAAAACATTGAAGGAGGAGAAACCTACTTTCCGAAGGCTGGTTCTGGTGAGTGTAGTTGTGGTGGGAAGACCGCTCCAGGACTGTCAGTTAAACCAGCCAAAGGAGATGCAGTACTTTTCTGGAGCATGGGGTTAGATGGACAATCAGATCCAAATAGCATTCATGGAGGGTGTGAAGTACTGTCAGGGGAAAAATGGTCTGCCACAAAATGGATGAGGCAAAAGAGTACTCTGGTACCATAA

Protein sequence

MVSAQMRIVFGLLTFVTVGMIIGGVFLFATFFELISLEERGIAGALLQLAFLRRLEDSIGTEFLSAGRLHKTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIVVFHNFLSTEECDYLKGIALPRLELSTVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTAPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQKSTLVP
Homology
BLAST of PI0020559 vs. ExPASy Swiss-Prot
Match: Q9ZW86 (Prolyl 4-hydroxylase 1 OS=Arabidopsis thaliana OX=3702 GN=P4H1 PE=1 SV=1)

HSP 1 Score: 427.6 bits (1098), Expect = 1.2e-118
Identity = 212/303 (69.97%), Postives = 244/303 (80.53%), Query Frame = 0

Query: 6   MRIVFGLLTFVTVGMIIGGVFLFATFFELISLEERGIAGALLQLAFLRRLEDSIGTEFLS 65
           M+IVFGLLTFVTVGM+I                     G+LLQLAF+ RLEDS GT F S
Sbjct: 5   MKIVFGLLTFVTVGMVI---------------------GSLLQLAFINRLEDSYGTGFPS 64

Query: 66  AGRLHKTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIVVFHNFLSTEECD 125
              L   +    R L R +  W NDK+AE+LR+G VKPEVVSWSPRI+V H+FLS EEC+
Sbjct: 65  ---LRGLRGQNTRYL-RDVSRWANDKDAELLRIGNVKPEVVSWSPRIIVLHDFLSPEECE 124

Query: 126 YLKGIALPRLELSTVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISVYSQIP 185
           YLK IA PRL++STVVD KTGKGVKSD RTSSGMFL+H E++YP++QAIEKRI+V+SQ+P
Sbjct: 125 YLKAIARPRLQVSTVVDVKTGKGVKSDVRTSSGMFLTHVERSYPIIQAIEKRIAVFSQVP 184

Query: 186 VENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAG 245
            ENGELIQVLRYE  QFYKPHHDYF+DTFNLKRGGQR+ATMLMYL++++EGGETYFP AG
Sbjct: 185 AENGELIQVLRYEPQQFYKPHHDYFADTFNLKRGGQRVATMLMYLTDDVEGGETYFPLAG 244

Query: 246 SGECSCGGKTAPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQ 305
            G+C+CGGK   G+SVKP KGDAVLFWSMGLDGQSDP SIHGGCEVLSGEKWSATKWMRQ
Sbjct: 245 DGDCTCGGKIMKGISVKPTKGDAVLFWSMGLDGQSDPRSIHGGCEVLSGEKWSATKWMRQ 282

Query: 306 KST 309
           K+T
Sbjct: 305 KAT 282

BLAST of PI0020559 vs. ExPASy Swiss-Prot
Match: Q9LN20 (Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=1)

HSP 1 Score: 224.2 bits (570), Expect = 2.1e-57
Identity = 116/209 (55.50%), Postives = 139/209 (66.51%), Query Frame = 0

Query: 104 EVVSWSPRIVVFHNFLSTEECDYLKGIALPRLELSTVVDTKTGKGVKSDFRTSSGMFLSH 163
           EV+SW PR  V+HNFLS EEC+YL  +A P +  STVVD++TGK   S  RTSSG FL  
Sbjct: 77  EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRR 136

Query: 164 HEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRI 223
                 +++ IEKRI+ Y+ IP ++GE +QVL YE  Q Y+PH+DYF D FN K GGQR+
Sbjct: 137 GRDK--IIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRM 196

Query: 224 ATMLMYLSENIEGGETYFPKAGSGECS---------CGGKTAPGLSVKPAKGDAVLFWSM 283
           ATMLMYLS+  EGGET FP A     S         CG K   GLSVKP  GDA+LFWSM
Sbjct: 197 ATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKK---GLSVKPRMGDALLFWSM 256

Query: 284 GLDGQSDPNSIHGGCEVLSGEKWSATKWM 304
             D   DP S+HGGC V+ G KWS+TKWM
Sbjct: 257 RPDATLDPTSLHGGCPVIRGNKWSSTKWM 280

BLAST of PI0020559 vs. ExPASy Swiss-Prot
Match: F4JZ24 (Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana OX=3702 GN=P4H10 PE=3 SV=1)

HSP 1 Score: 223.8 bits (569), Expect = 2.7e-57
Identity = 115/212 (54.25%), Postives = 140/212 (66.04%), Query Frame = 0

Query: 104 EVVSWSPRIVVFHNFLSTEECDYLKGIALPRLELSTVVDTKTGKGVKSDFRTSSGMFLSH 163
           E++SW PR  V+HNFL+ EEC YL  +A P +E STVVD KTGK   S  RTSSG FL+ 
Sbjct: 79  EIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFLAR 138

Query: 164 HEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRI 223
                  ++ IEKRIS ++ IPVE+GE +QVL YE  Q Y+PH+DYF D +N + GGQRI
Sbjct: 139 GRDK--TIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQRI 198

Query: 224 ATMLMYLSENIEGGETYFPKAGS-----------GECSCGGKTAPGLSVKPAKGDAVLFW 283
           AT+LMYLS+  EGGET FP A              EC  G     GLSVKP  GDA+LFW
Sbjct: 199 ATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKG-----GLSVKPKMGDALLFW 258

Query: 284 SMGLDGQSDPNSIHGGCEVLSGEKWSATKWMR 305
           SM  D   DP+S+HGGC V+ G KWS+TKW+R
Sbjct: 259 SMTPDATLDPSSLHGGCAVIKGNKWSSTKWLR 283

BLAST of PI0020559 vs. ExPASy Swiss-Prot
Match: F4JNU8 (Probable prolyl 4-hydroxylase 8 OS=Arabidopsis thaliana OX=3702 GN=P4H8 PE=3 SV=1)

HSP 1 Score: 219.9 bits (559), Expect = 3.9e-56
Identity = 115/208 (55.29%), Postives = 142/208 (68.27%), Query Frame = 0

Query: 104 EVVSWSPRIVVFHNFLSTEECDYLKGIALPRLELSTVVDTKTGKGVKSDFRTSSGMFLSH 163
           EV+SW PR  V+HNFL+ EEC++L  +A P +  S VVD KTGK + S  RTSSG FL+ 
Sbjct: 81  EVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSRVRTSSGTFLNR 140

Query: 164 -HEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQR 223
            H++   +V+ IE RIS ++ IP ENGE +QVL YE  Q Y+PHHDYF D FN+++GGQR
Sbjct: 141 GHDE---IVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHDYFFDEFNVRKGGQR 200

Query: 224 IATMLMYLSENIEGGETYFPKAGSG--------ECSCGGKTAPGLSVKPAKGDAVLFWSM 283
           IAT+LMYLS+  EGGET FP A           E S  GK   GLSV P K DA+LFWSM
Sbjct: 201 IATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGK--EGLSVLPKKRDALLFWSM 260

Query: 284 GLDGQSDPNSIHGGCEVLSGEKWSATKW 303
             D   DP+S+HGGC V+ G KWS+TKW
Sbjct: 261 KPDASLDPSSLHGGCPVIKGNKWSSTKW 283

BLAST of PI0020559 vs. ExPASy Swiss-Prot
Match: Q24JN5 (Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana OX=3702 GN=P4H5 PE=2 SV=1)

HSP 1 Score: 216.5 bits (550), Expect = 4.4e-55
Identity = 115/208 (55.29%), Postives = 141/208 (67.79%), Query Frame = 0

Query: 104 EVVSWSPRIVVFHNFLSTEECDYLKGIALPRLELSTVVDTKTGKGVKSDFRTSSGMFLSH 163
           EV+SW PR VV+HNFL+ EEC++L  +A P +  STVVD KTG    S  RTSSG FL  
Sbjct: 81  EVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSRVRTSSGTFLRR 140

Query: 164 -HEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQR 223
            H++   +V+ IEKRIS ++ IPVENGE +QVL Y+  Q Y+PH+DYF D FN K GGQR
Sbjct: 141 GHDE---VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQR 200

Query: 224 IATMLMYLSENIEGGETYFPKAGS--------GECSCGGKTAPGLSVKPAKGDAVLFWSM 283
           IAT+LMYLS+  +GGET FP A           E S  GK   GLSV P K DA+LFW+M
Sbjct: 201 IATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGK--EGLSVLPKKRDALLFWNM 260

Query: 284 GLDGQSDPNSIHGGCEVLSGEKWSATKW 303
             D   DP+S+HGGC V+ G KWS+TKW
Sbjct: 261 RPDASLDPSSLHGGCPVVKGNKWSSTKW 283

BLAST of PI0020559 vs. ExPASy TrEMBL
Match: A0A1S3BXE6 (prolyl 4-hydroxylase 1 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103494504 PE=4 SV=1)

HSP 1 Score: 569.3 bits (1466), Expect = 9.8e-159
Identity = 283/311 (91.00%), Postives = 286/311 (91.96%), Query Frame = 0

Query: 1   MVSAQMRIVFGLLTFVTVGMIIGGVFLFATFFELISLEERGIAGALLQLAFLRRLEDSIG 60
           MVSAQMRIVFGLLTFVTVGMII                     GALLQLAFLRRLEDSIG
Sbjct: 1   MVSAQMRIVFGLLTFVTVGMII---------------------GALLQLAFLRRLEDSIG 60

Query: 61  TEFLSAGRLHKTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIVVFHNFLS 120
           TEFL AGRLHKTQYD QRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRI+V HNFLS
Sbjct: 61  TEFLPAGRLHKTQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLS 120

Query: 121 TEECDYLKGIALPRLELSTVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISV 180
           TEECDYLKGIALPRLE+STVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISV
Sbjct: 121 TEECDYLKGIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISV 180

Query: 181 YSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETY 240
           YSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETY
Sbjct: 181 YSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETY 240

Query: 241 FPKAGSGECSCGGKTAPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSAT 300
           FPKAGSGECSCGGKT PGLSVKPAKGDA+LFWSMGLDGQSDPNSIHGGCEVLSGEKWSAT
Sbjct: 241 FPKAGSGECSCGGKTVPGLSVKPAKGDAILFWSMGLDGQSDPNSIHGGCEVLSGEKWSAT 290

Query: 301 KWMRQKSTLVP 312
           KWMRQKSTLVP
Sbjct: 301 KWMRQKSTLVP 290

BLAST of PI0020559 vs. ExPASy TrEMBL
Match: A0A1S4DZZ9 (prolyl 4-hydroxylase 1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103494504 PE=4 SV=1)

HSP 1 Score: 557.8 bits (1436), Expect = 2.9e-155
Identity = 283/330 (85.76%), Postives = 286/330 (86.67%), Query Frame = 0

Query: 1   MVSAQMRIVFGLLTFVTVGMIIGGVFLFATFFELISLEERGIAGALLQLAFLRRLEDSIG 60
           MVSAQMRIVFGLLTFVTVGMII                     GALLQLAFLRRLEDSIG
Sbjct: 1   MVSAQMRIVFGLLTFVTVGMII---------------------GALLQLAFLRRLEDSIG 60

Query: 61  TEFLSAGRLHKTQYDGQRQLPR-------------------GLPNWINDKEAEILRLGYV 120
           TEFL AGRLHKTQYD QRQLPR                   GLPNWINDKEAEILRLGYV
Sbjct: 61  TEFLPAGRLHKTQYDSQRQLPRVTVKNREFSKELGGNQFNPGLPNWINDKEAEILRLGYV 120

Query: 121 KPEVVSWSPRIVVFHNFLSTEECDYLKGIALPRLELSTVVDTKTGKGVKSDFRTSSGMFL 180
           KPEVVSWSPRI+V HNFLSTEECDYLKGIALPRLE+STVVDTKTGKGVKSDFRTSSGMFL
Sbjct: 121 KPEVVSWSPRIIVLHNFLSTEECDYLKGIALPRLEISTVVDTKTGKGVKSDFRTSSGMFL 180

Query: 181 SHHEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ 240
           SHHEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
Sbjct: 181 SHHEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ 240

Query: 241 RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTAPGLSVKPAKGDAVLFWSMGLDGQSD 300
           RIATMLMYLSENIEGGETYFPKAGSGECSCGGKT PGLSVKPAKGDA+LFWSMGLDGQSD
Sbjct: 241 RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAILFWSMGLDGQSD 300

Query: 301 PNSIHGGCEVLSGEKWSATKWMRQKSTLVP 312
           PNSIHGGCEVLSGEKWSATKWMRQKSTLVP
Sbjct: 301 PNSIHGGCEVLSGEKWSATKWMRQKSTLVP 309

BLAST of PI0020559 vs. ExPASy TrEMBL
Match: A0A0A0KU17 (Fe2OG dioxygenase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G017130 PE=4 SV=1)

HSP 1 Score: 554.3 bits (1427), Expect = 3.3e-154
Identity = 275/311 (88.42%), Postives = 281/311 (90.35%), Query Frame = 0

Query: 1   MVSAQMRIVFGLLTFVTVGMIIGGVFLFATFFELISLEERGIAGALLQLAFLRRLEDSIG 60
           MVS+QMRIVFGLLTFVTVGMII                     GALLQLAFLRRLEDSIG
Sbjct: 1   MVSSQMRIVFGLLTFVTVGMII---------------------GALLQLAFLRRLEDSIG 60

Query: 61  TEFLSAGRLHKTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIVVFHNFLS 120
           TEFL AGRLHK QYD Q QLPRG PNWINDKEAEILRLGYVKPEVVSWSPRI+V HNFLS
Sbjct: 61  TEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLS 120

Query: 121 TEECDYLKGIALPRLELSTVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISV 180
           T+ECDYLKGIAL RLE+STVVDTKTGKGVKSDFRTSSGMFLSHHEKN+PMVQAIEKRISV
Sbjct: 121 TKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISV 180

Query: 181 YSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETY 240
           YSQ+PVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETY
Sbjct: 181 YSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETY 240

Query: 241 FPKAGSGECSCGGKTAPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSAT 300
           FPKAGSGECSCGGKT PGLSVKPAKGDAVLFWSMGLDGQSDP SIHGGCEVLSGEKWSAT
Sbjct: 241 FPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSAT 290

Query: 301 KWMRQKSTLVP 312
           KWMRQKSTLVP
Sbjct: 301 KWMRQKSTLVP 290

BLAST of PI0020559 vs. ExPASy TrEMBL
Match: A0A6J1CBS4 (prolyl 4-hydroxylase 1 OS=Momordica charantia OX=3673 GN=LOC111009248 PE=4 SV=1)

HSP 1 Score: 548.5 bits (1412), Expect = 1.8e-152
Identity = 267/311 (85.85%), Postives = 279/311 (89.71%), Query Frame = 0

Query: 1   MVSAQMRIVFGLLTFVTVGMIIGGVFLFATFFELISLEERGIAGALLQLAFLRRLEDSIG 60
           M SA MRIVFGLLTFVT+GMII                     GAL QLAF+RRLEDS G
Sbjct: 1   MASAPMRIVFGLLTFVTLGMII---------------------GALFQLAFIRRLEDSYG 60

Query: 61  TEFLSAGRLHKTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIVVFHNFLS 120
           TEFLSAGRLHKTQYDG RQLPRG PNWIND+EAEILRLGYVKPEVVSWSPRI+V HNFLS
Sbjct: 61  TEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLS 120

Query: 121 TEECDYLKGIALPRLELSTVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISV 180
           TEECDYL+ +ALPRLE+STVVDTKTGKGVKSDFRTSSGMFLSH EKNYPM+QAIEKRISV
Sbjct: 121 TEECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISV 180

Query: 181 YSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETY 240
           YSQIP+ENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQR+ATMLMYLS+N+EGGETY
Sbjct: 181 YSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETY 240

Query: 241 FPKAGSGECSCGGKTAPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSAT 300
           FPKAGSGECSCGGKT PGLSVKP KGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSAT
Sbjct: 241 FPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSAT 290

Query: 301 KWMRQKSTLVP 312
           KWMRQKSTLVP
Sbjct: 301 KWMRQKSTLVP 290

BLAST of PI0020559 vs. ExPASy TrEMBL
Match: A0A6J1F2K0 (prolyl 4-hydroxylase 1-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111439093 PE=4 SV=1)

HSP 1 Score: 538.5 bits (1386), Expect = 1.8e-149
Identity = 264/310 (85.16%), Postives = 276/310 (89.03%), Query Frame = 0

Query: 1   MVSAQMRIVFGLLTFVTVGMIIGGVFLFATFFELISLEERGIAGALLQLAFLRRLEDSIG 60
           M S  MRIVFGLLTFVTVGMII                     GAL QLAF+RRLEDSIG
Sbjct: 1   MASGLMRIVFGLLTFVTVGMII---------------------GALFQLAFIRRLEDSIG 60

Query: 61  TEFLSAGRLHKTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIVVFHNFLS 120
           TEFLSAGRLHKTQYDGQRQ  +GLPNWINDKEAEILRLGYVKPEVVSWSPRI+V HNFLS
Sbjct: 61  TEFLSAGRLHKTQYDGQRQFLQGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLS 120

Query: 121 TEECDYLKGIALPRLELSTVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISV 180
           +EECDYLK IALPRLE+STVVDTKTGKG+KSDFRTSSGMFLSH E+NYPMVQAIEKRISV
Sbjct: 121 SEECDYLKAIALPRLEISTVVDTKTGKGIKSDFRTSSGMFLSHQERNYPMVQAIEKRISV 180

Query: 181 YSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETY 240
           YSQIP+ENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYL++N+EGGETY
Sbjct: 181 YSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLTDNVEGGETY 240

Query: 241 FPKAGSGECSCGGKTAPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSAT 300
           FPKAGSG CSCGGKT PGLSVKP KGDAVLFWSMGLDGQSDPNSIHGGCEVL GEKWSAT
Sbjct: 241 FPKAGSGMCSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLGGEKWSAT 289

Query: 301 KWMRQKSTLV 311
           KWMRQKSTL+
Sbjct: 301 KWMRQKSTLI 289

BLAST of PI0020559 vs. NCBI nr
Match: XP_008453925.1 (PREDICTED: prolyl 4-hydroxylase 1 isoform X3 [Cucumis melo])

HSP 1 Score: 569.3 bits (1466), Expect = 2.0e-158
Identity = 283/311 (91.00%), Postives = 286/311 (91.96%), Query Frame = 0

Query: 1   MVSAQMRIVFGLLTFVTVGMIIGGVFLFATFFELISLEERGIAGALLQLAFLRRLEDSIG 60
           MVSAQMRIVFGLLTFVTVGMII                     GALLQLAFLRRLEDSIG
Sbjct: 1   MVSAQMRIVFGLLTFVTVGMII---------------------GALLQLAFLRRLEDSIG 60

Query: 61  TEFLSAGRLHKTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIVVFHNFLS 120
           TEFL AGRLHKTQYD QRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRI+V HNFLS
Sbjct: 61  TEFLPAGRLHKTQYDSQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLS 120

Query: 121 TEECDYLKGIALPRLELSTVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISV 180
           TEECDYLKGIALPRLE+STVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISV
Sbjct: 121 TEECDYLKGIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISV 180

Query: 181 YSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETY 240
           YSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETY
Sbjct: 181 YSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETY 240

Query: 241 FPKAGSGECSCGGKTAPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSAT 300
           FPKAGSGECSCGGKT PGLSVKPAKGDA+LFWSMGLDGQSDPNSIHGGCEVLSGEKWSAT
Sbjct: 241 FPKAGSGECSCGGKTVPGLSVKPAKGDAILFWSMGLDGQSDPNSIHGGCEVLSGEKWSAT 290

Query: 301 KWMRQKSTLVP 312
           KWMRQKSTLVP
Sbjct: 301 KWMRQKSTLVP 290

BLAST of PI0020559 vs. NCBI nr
Match: XP_038904320.1 (prolyl 4-hydroxylase 1 isoform X1 [Benincasa hispida])

HSP 1 Score: 558.5 bits (1438), Expect = 3.6e-155
Identity = 279/311 (89.71%), Postives = 282/311 (90.68%), Query Frame = 0

Query: 1   MVSAQMRIVFGLLTFVTVGMIIGGVFLFATFFELISLEERGIAGALLQLAFLRRLEDSIG 60
           M SA MRIVFGLLTFVTVGMII                     GALLQLAF+RRLEDSIG
Sbjct: 1   MASAPMRIVFGLLTFVTVGMII---------------------GALLQLAFIRRLEDSIG 60

Query: 61  TEFLSAGRLHKTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIVVFHNFLS 120
           TEFLSAGRLHKTQYD QRQL RGLPNWINDKEAEILRLGYVKPEVVSWSPRI+V HNFLS
Sbjct: 61  TEFLSAGRLHKTQYDSQRQLSRGLPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLS 120

Query: 121 TEECDYLKGIALPRLELSTVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISV 180
           TEECDYLK IALPRLE+STVVDTKTGKGVKSDFRTSSGMFLSH EKNYPMVQAIEKRISV
Sbjct: 121 TEECDYLKAIALPRLEISTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMVQAIEKRISV 180

Query: 181 YSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETY 240
           YSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETY
Sbjct: 181 YSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETY 240

Query: 241 FPKAGSGECSCGGKTAPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSAT 300
           FPKAGSGECSCGGKT PGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSAT
Sbjct: 241 FPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSAT 290

Query: 301 KWMRQKSTLVP 312
           KWMRQKSTLVP
Sbjct: 301 KWMRQKSTLVP 290

BLAST of PI0020559 vs. NCBI nr
Match: XP_016901567.1 (PREDICTED: prolyl 4-hydroxylase 1 isoform X1 [Cucumis melo])

HSP 1 Score: 557.8 bits (1436), Expect = 6.1e-155
Identity = 283/330 (85.76%), Postives = 286/330 (86.67%), Query Frame = 0

Query: 1   MVSAQMRIVFGLLTFVTVGMIIGGVFLFATFFELISLEERGIAGALLQLAFLRRLEDSIG 60
           MVSAQMRIVFGLLTFVTVGMII                     GALLQLAFLRRLEDSIG
Sbjct: 1   MVSAQMRIVFGLLTFVTVGMII---------------------GALLQLAFLRRLEDSIG 60

Query: 61  TEFLSAGRLHKTQYDGQRQLPR-------------------GLPNWINDKEAEILRLGYV 120
           TEFL AGRLHKTQYD QRQLPR                   GLPNWINDKEAEILRLGYV
Sbjct: 61  TEFLPAGRLHKTQYDSQRQLPRVTVKNREFSKELGGNQFNPGLPNWINDKEAEILRLGYV 120

Query: 121 KPEVVSWSPRIVVFHNFLSTEECDYLKGIALPRLELSTVVDTKTGKGVKSDFRTSSGMFL 180
           KPEVVSWSPRI+V HNFLSTEECDYLKGIALPRLE+STVVDTKTGKGVKSDFRTSSGMFL
Sbjct: 121 KPEVVSWSPRIIVLHNFLSTEECDYLKGIALPRLEISTVVDTKTGKGVKSDFRTSSGMFL 180

Query: 181 SHHEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ 240
           SHHEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ
Sbjct: 181 SHHEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ 240

Query: 241 RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTAPGLSVKPAKGDAVLFWSMGLDGQSD 300
           RIATMLMYLSENIEGGETYFPKAGSGECSCGGKT PGLSVKPAKGDA+LFWSMGLDGQSD
Sbjct: 241 RIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAILFWSMGLDGQSD 300

Query: 301 PNSIHGGCEVLSGEKWSATKWMRQKSTLVP 312
           PNSIHGGCEVLSGEKWSATKWMRQKSTLVP
Sbjct: 301 PNSIHGGCEVLSGEKWSATKWMRQKSTLVP 309

BLAST of PI0020559 vs. NCBI nr
Match: XP_004152082.1 (prolyl 4-hydroxylase 1 [Cucumis sativus] >KGN53125.1 hypothetical protein Csa_014405 [Cucumis sativus])

HSP 1 Score: 554.3 bits (1427), Expect = 6.7e-154
Identity = 275/311 (88.42%), Postives = 281/311 (90.35%), Query Frame = 0

Query: 1   MVSAQMRIVFGLLTFVTVGMIIGGVFLFATFFELISLEERGIAGALLQLAFLRRLEDSIG 60
           MVS+QMRIVFGLLTFVTVGMII                     GALLQLAFLRRLEDSIG
Sbjct: 1   MVSSQMRIVFGLLTFVTVGMII---------------------GALLQLAFLRRLEDSIG 60

Query: 61  TEFLSAGRLHKTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIVVFHNFLS 120
           TEFL AGRLHK QYD Q QLPRG PNWINDKEAEILRLGYVKPEVVSWSPRI+V HNFLS
Sbjct: 61  TEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLS 120

Query: 121 TEECDYLKGIALPRLELSTVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISV 180
           T+ECDYLKGIAL RLE+STVVDTKTGKGVKSDFRTSSGMFLSHHEKN+PMVQAIEKRISV
Sbjct: 121 TKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISV 180

Query: 181 YSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETY 240
           YSQ+PVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETY
Sbjct: 181 YSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETY 240

Query: 241 FPKAGSGECSCGGKTAPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSAT 300
           FPKAGSGECSCGGKT PGLSVKPAKGDAVLFWSMGLDGQSDP SIHGGCEVLSGEKWSAT
Sbjct: 241 FPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSAT 290

Query: 301 KWMRQKSTLVP 312
           KWMRQKSTLVP
Sbjct: 301 KWMRQKSTLVP 290

BLAST of PI0020559 vs. NCBI nr
Match: XP_022137963.1 (prolyl 4-hydroxylase 1 [Momordica charantia])

HSP 1 Score: 548.5 bits (1412), Expect = 3.7e-152
Identity = 267/311 (85.85%), Postives = 279/311 (89.71%), Query Frame = 0

Query: 1   MVSAQMRIVFGLLTFVTVGMIIGGVFLFATFFELISLEERGIAGALLQLAFLRRLEDSIG 60
           M SA MRIVFGLLTFVT+GMII                     GAL QLAF+RRLEDS G
Sbjct: 1   MASAPMRIVFGLLTFVTLGMII---------------------GALFQLAFIRRLEDSYG 60

Query: 61  TEFLSAGRLHKTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIVVFHNFLS 120
           TEFLSAGRLHKTQYDG RQLPRG PNWIND+EAEILRLGYVKPEVVSWSPRI+V HNFLS
Sbjct: 61  TEFLSAGRLHKTQYDGDRQLPRGFPNWINDREAEILRLGYVKPEVVSWSPRIIVLHNFLS 120

Query: 121 TEECDYLKGIALPRLELSTVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISV 180
           TEECDYL+ +ALPRLE+STVVDTKTGKGVKSDFRTSSGMFLSH EKNYPM+QAIEKRISV
Sbjct: 121 TEECDYLRAVALPRLEVSTVVDTKTGKGVKSDFRTSSGMFLSHQEKNYPMIQAIEKRISV 180

Query: 181 YSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETY 240
           YSQIP+ENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQR+ATMLMYLS+N+EGGETY
Sbjct: 181 YSQIPIENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETY 240

Query: 241 FPKAGSGECSCGGKTAPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSAT 300
           FPKAGSGECSCGGKT PGLSVKP KGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSAT
Sbjct: 241 FPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSAT 290

Query: 301 KWMRQKSTLVP 312
           KWMRQKSTLVP
Sbjct: 301 KWMRQKSTLVP 290

BLAST of PI0020559 vs. TAIR 10
Match: AT2G43080.1 (P4H isoform 1 )

HSP 1 Score: 427.6 bits (1098), Expect = 8.8e-120
Identity = 212/303 (69.97%), Postives = 244/303 (80.53%), Query Frame = 0

Query: 6   MRIVFGLLTFVTVGMIIGGVFLFATFFELISLEERGIAGALLQLAFLRRLEDSIGTEFLS 65
           M+IVFGLLTFVTVGM+I                     G+LLQLAF+ RLEDS GT F S
Sbjct: 5   MKIVFGLLTFVTVGMVI---------------------GSLLQLAFINRLEDSYGTGFPS 64

Query: 66  AGRLHKTQYDGQRQLPRGLPNWINDKEAEILRLGYVKPEVVSWSPRIVVFHNFLSTEECD 125
              L   +    R L R +  W NDK+AE+LR+G VKPEVVSWSPRI+V H+FLS EEC+
Sbjct: 65  ---LRGLRGQNTRYL-RDVSRWANDKDAELLRIGNVKPEVVSWSPRIIVLHDFLSPEECE 124

Query: 126 YLKGIALPRLELSTVVDTKTGKGVKSDFRTSSGMFLSHHEKNYPMVQAIEKRISVYSQIP 185
           YLK IA PRL++STVVD KTGKGVKSD RTSSGMFL+H E++YP++QAIEKRI+V+SQ+P
Sbjct: 125 YLKAIARPRLQVSTVVDVKTGKGVKSDVRTSSGMFLTHVERSYPIIQAIEKRIAVFSQVP 184

Query: 186 VENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAG 245
            ENGELIQVLRYE  QFYKPHHDYF+DTFNLKRGGQR+ATMLMYL++++EGGETYFP AG
Sbjct: 185 AENGELIQVLRYEPQQFYKPHHDYFADTFNLKRGGQRVATMLMYLTDDVEGGETYFPLAG 244

Query: 246 SGECSCGGKTAPGLSVKPAKGDAVLFWSMGLDGQSDPNSIHGGCEVLSGEKWSATKWMRQ 305
            G+C+CGGK   G+SVKP KGDAVLFWSMGLDGQSDP SIHGGCEVLSGEKWSATKWMRQ
Sbjct: 245 DGDCTCGGKIMKGISVKPTKGDAVLFWSMGLDGQSDPRSIHGGCEVLSGEKWSATKWMRQ 282

Query: 306 KST 309
           K+T
Sbjct: 305 KAT 282

BLAST of PI0020559 vs. TAIR 10
Match: AT1G20270.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 224.2 bits (570), Expect = 1.5e-58
Identity = 116/209 (55.50%), Postives = 139/209 (66.51%), Query Frame = 0

Query: 104 EVVSWSPRIVVFHNFLSTEECDYLKGIALPRLELSTVVDTKTGKGVKSDFRTSSGMFLSH 163
           EV+SW PR  V+HNFLS EEC+YL  +A P +  STVVD++TGK   S  RTSSG FL  
Sbjct: 77  EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRR 136

Query: 164 HEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRI 223
                 +++ IEKRI+ Y+ IP ++GE +QVL YE  Q Y+PH+DYF D FN K GGQR+
Sbjct: 137 GRDK--IIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRM 196

Query: 224 ATMLMYLSENIEGGETYFPKAGSGECS---------CGGKTAPGLSVKPAKGDAVLFWSM 283
           ATMLMYLS+  EGGET FP A     S         CG K   GLSVKP  GDA+LFWSM
Sbjct: 197 ATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKK---GLSVKPRMGDALLFWSM 256

Query: 284 GLDGQSDPNSIHGGCEVLSGEKWSATKWM 304
             D   DP S+HGGC V+ G KWS+TKWM
Sbjct: 257 RPDATLDPTSLHGGCPVIRGNKWSSTKWM 280

BLAST of PI0020559 vs. TAIR 10
Match: AT5G66060.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 223.8 bits (569), Expect = 1.9e-58
Identity = 115/212 (54.25%), Postives = 140/212 (66.04%), Query Frame = 0

Query: 104 EVVSWSPRIVVFHNFLSTEECDYLKGIALPRLELSTVVDTKTGKGVKSDFRTSSGMFLSH 163
           E++SW PR  V+HNFL+ EEC YL  +A P +E STVVD KTGK   S  RTSSG FL+ 
Sbjct: 79  EIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFLAR 138

Query: 164 HEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRI 223
                  ++ IEKRIS ++ IPVE+GE +QVL YE  Q Y+PH+DYF D +N + GGQRI
Sbjct: 139 GRDK--TIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQRI 198

Query: 224 ATMLMYLSENIEGGETYFPKAGS-----------GECSCGGKTAPGLSVKPAKGDAVLFW 283
           AT+LMYLS+  EGGET FP A              EC  G     GLSVKP  GDA+LFW
Sbjct: 199 ATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKG-----GLSVKPKMGDALLFW 258

Query: 284 SMGLDGQSDPNSIHGGCEVLSGEKWSATKWMR 305
           SM  D   DP+S+HGGC V+ G KWS+TKW+R
Sbjct: 259 SMTPDATLDPSSLHGGCAVIKGNKWSSTKWLR 283

BLAST of PI0020559 vs. TAIR 10
Match: AT4G35810.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 219.9 bits (559), Expect = 2.8e-57
Identity = 115/208 (55.29%), Postives = 142/208 (68.27%), Query Frame = 0

Query: 104 EVVSWSPRIVVFHNFLSTEECDYLKGIALPRLELSTVVDTKTGKGVKSDFRTSSGMFLSH 163
           EV+SW PR  V+HNFL+ EEC++L  +A P +  S VVD KTGK + S  RTSSG FL+ 
Sbjct: 81  EVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSRVRTSSGTFLNR 140

Query: 164 -HEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQR 223
            H++   +V+ IE RIS ++ IP ENGE +QVL YE  Q Y+PHHDYF D FN+++GGQR
Sbjct: 141 GHDE---IVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHDYFFDEFNVRKGGQR 200

Query: 224 IATMLMYLSENIEGGETYFPKAGSG--------ECSCGGKTAPGLSVKPAKGDAVLFWSM 283
           IAT+LMYLS+  EGGET FP A           E S  GK   GLSV P K DA+LFWSM
Sbjct: 201 IATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGK--EGLSVLPKKRDALLFWSM 260

Query: 284 GLDGQSDPNSIHGGCEVLSGEKWSATKW 303
             D   DP+S+HGGC V+ G KWS+TKW
Sbjct: 261 KPDASLDPSSLHGGCPVIKGNKWSSTKW 283

BLAST of PI0020559 vs. TAIR 10
Match: AT2G17720.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 216.5 bits (550), Expect = 3.1e-56
Identity = 115/208 (55.29%), Postives = 141/208 (67.79%), Query Frame = 0

Query: 104 EVVSWSPRIVVFHNFLSTEECDYLKGIALPRLELSTVVDTKTGKGVKSDFRTSSGMFLSH 163
           EV+SW PR VV+HNFL+ EEC++L  +A P +  STVVD KTG    S  RTSSG FL  
Sbjct: 81  EVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSRVRTSSGTFLRR 140

Query: 164 -HEKNYPMVQAIEKRISVYSQIPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQR 223
            H++   +V+ IEKRIS ++ IPVENGE +QVL Y+  Q Y+PH+DYF D FN K GGQR
Sbjct: 141 GHDE---VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQR 200

Query: 224 IATMLMYLSENIEGGETYFPKAGS--------GECSCGGKTAPGLSVKPAKGDAVLFWSM 283
           IAT+LMYLS+  +GGET FP A           E S  GK   GLSV P K DA+LFW+M
Sbjct: 201 IATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGK--EGLSVLPKKRDALLFWNM 260

Query: 284 GLDGQSDPNSIHGGCEVLSGEKWSATKW 303
             D   DP+S+HGGC V+ G KWS+TKW
Sbjct: 261 RPDASLDPSSLHGGCPVVKGNKWSSTKW 283

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9ZW861.2e-11869.97Prolyl 4-hydroxylase 1 OS=Arabidopsis thaliana OX=3702 GN=P4H1 PE=1 SV=1[more]
Q9LN202.1e-5755.50Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=... [more]
F4JZ242.7e-5754.25Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana OX=3702 GN=P4H10 PE=3 S... [more]
F4JNU83.9e-5655.29Probable prolyl 4-hydroxylase 8 OS=Arabidopsis thaliana OX=3702 GN=P4H8 PE=3 SV=... [more]
Q24JN54.4e-5555.29Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana OX=3702 GN=P4H5 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A1S3BXE69.8e-15991.00prolyl 4-hydroxylase 1 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103494504 PE=4 S... [more]
A0A1S4DZZ92.9e-15585.76prolyl 4-hydroxylase 1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103494504 PE=4 S... [more]
A0A0A0KU173.3e-15488.42Fe2OG dioxygenase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G... [more]
A0A6J1CBS41.8e-15285.85prolyl 4-hydroxylase 1 OS=Momordica charantia OX=3673 GN=LOC111009248 PE=4 SV=1[more]
A0A6J1F2K01.8e-14985.16prolyl 4-hydroxylase 1-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC11143... [more]
Match NameE-valueIdentityDescription
XP_008453925.12.0e-15891.00PREDICTED: prolyl 4-hydroxylase 1 isoform X3 [Cucumis melo][more]
XP_038904320.13.6e-15589.71prolyl 4-hydroxylase 1 isoform X1 [Benincasa hispida][more]
XP_016901567.16.1e-15585.76PREDICTED: prolyl 4-hydroxylase 1 isoform X1 [Cucumis melo][more]
XP_004152082.16.7e-15488.42prolyl 4-hydroxylase 1 [Cucumis sativus] >KGN53125.1 hypothetical protein Csa_01... [more]
XP_022137963.13.7e-15285.85prolyl 4-hydroxylase 1 [Momordica charantia][more]
Match NameE-valueIdentityDescription
AT2G43080.18.8e-12069.97P4H isoform 1 [more]
AT1G20270.11.5e-5855.502-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT5G66060.11.9e-5854.252-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT4G35810.12.8e-5755.292-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT2G17720.13.1e-5655.292-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (PI 482460) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 110..304
e-value: 1.3E-60
score: 217.3
NoneNo IPR availableGENE3D2.60.120.620q2cbj1_9rhob like domaincoord: 103..304
e-value: 4.7E-68
score: 231.0
NoneNo IPR availablePANTHERPTHR10869:SF179BNAA04G24820D PROTEINcoord: 42..308
coord: 1..29
IPR044862Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domainPFAMPF136402OG-FeII_Oxy_3coord: 192..304
e-value: 1.1E-20
score: 74.3
IPR045054Prolyl 4-hydroxylasePANTHERPTHR10869PROLYL 4-HYDROXYLASE ALPHA SUBUNITcoord: 1..29
IPR045054Prolyl 4-hydroxylasePANTHERPTHR10869PROLYL 4-HYDROXYLASE ALPHA SUBUNITcoord: 42..308
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 188..305
score: 11.104969

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
PI0020559.2PI0020559.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0018401 peptidyl-proline hydroxylation to 4-hydroxy-L-proline
cellular_component GO:0005783 endoplasmic reticulum
cellular_component GO:0000137 Golgi cis cisterna
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0004656 procollagen-proline 4-dioxygenase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen