Cp4.1LG13g08580 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG13g08580
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionProcollagen-proline 4-dioxygenase
LocationCp4.1LG13: 7894491 .. 7897982 (-)
RNA-Seq ExpressionCp4.1LG13g08580
SyntenyCp4.1LG13g08580
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGCCCACTGGTTCTCATCTCTCATTGGTTGTCTTCACAGTTGAATTTTCGTAATAAATTCAAATCCAATTAATTTATTTTATTTTTCTTTCCCTTCTCGTACACGGAAGAACCCTCGAGTTGAATCTTTTCTTCTTCCTTTCTCCCATTTGATTCCGGAGAAACGATTATGGATTCCCGACGGTTCCTCGCATTTTCTCTCTTCTTTCTGTCCGTCTCTACTGGCTTCGCTCGCTTGCCGGAAACGCACAAGAAATTGTACGATCCTTTTCGTGTTCTTCTCGCTCTTCATTTTTTATTTCACGAATCTTCTGACGTATGGTTTTGGATTCAATGTTTTCGAATTTCAGAAGTGGATCTGTGCTTGAATTGAAGAGGGATTCGCCACGGCTCATTTTCGATCCAACACGAGTCACTCAGCTCTCCTGGCAACCCAGGTATTCTGTCTTCCTCTCTTATGCGATCGAATGCGATGTTCATTTACTTTCTCAGAAGGAAAGCAAATCCTTCTTGACACTTTCAAACCTCAATTTCCTTCGTTTAGTTTTTAATATTATACTCAATTGGCGATGTGTTTGTTTATATTCTTATGAAAAGTGCAAATCTACAGGGCATTTTTGTATAAGGGATTTTTAACTGATCAGGAATGTGATCATCTAATCGATCTGGTAAGTGACTTATGGAACGCTATGTTTGTTTTAATTTAGATTTTTCAATAATGTGGTAATTCTATGGTTTTGTTTGATTTTCTTATTTTCGGGAATAGGCCAAGGATAAATTAGAGAAGTCAATGGTAGCAGATAATGAGTCTGGTAAGAGTGTAAGTAGTGAAGTTCGAACGAGTTCTGGCATGTTCCTCCGGAAGGCCCAGGTGCGTCAGTTTATTTGAATCATCCTTTTTGTGTTGGACTTTTTAGATTTATAGTATTCTTTGCAAATTACAGCGCCTGGATCCTTATAATGTCTGACTTTTTGTACTTATTTTTATGTTATCTTATGTAAAAGATCGAGGTATCACGGCAGATCCTTGTAGAATTAGGTGAGAAGGATAACTGTAGATGTCAATATATGAAATTACGAATGTATTATCCAGTATCTTGGTTTGGATGAACATTCTAAGAAAAAAAAAAATTCTTAGGAAAACTGTGAACTTGTATAAATAATTTGACTGTGAGTAATATCTGTTTAAGACAGAGATATATGTCTGAATATAAATGACCCAATTTGTTTGCCCAAATGTAGTTTGACCTATGCCCTGTGTTGGGGGTATTGTTTTGAGGAGAATTATCTGCATCCCTGCTCATTAACGCGTTGTCAGCAATATCCTTTGTTTATATATCATGAGGAAGGATCCATCTGCTCTCTCTCCTATCCATACTGTTGGTGTTCGATGTTGAAGTTAATGACATTTTTCTTTCTTTGAAATGGGTGCTTAATCTTGATAATTATTTCCCCCCACTTTTCCTCAAATATGATTTAGTGGGGCGCGATCATGGACAGAACCTGCAAATCTCAATTTAGAAGAGTAGTAGGGTGACAATCTAGTGTTCGAGGACCTGGTTTCTGTTTGTCTTTATAGAACATGGTTTCCTTAGTGAATAACATAAGTAGAAGATATATTGAGGTTTCAAGTCTATACACCTAGTGAAATGACCAGAAGAGCTATACCAAATTGTTATATTCTTAAATATAGAATGGTTGTTTTAATGCCTATGCACCGCTAGTTGGTAAGTTACGACCCATTACTAATTCTACCACGAAGTTTAAAGAAACATGGATGACAAGAAGTAAAGATTAAGTTGATTCTATTAGTGATAATATTGATCAATGATATTGAAAATGATGTTTCCAACAGTTATTGAAGAAAGTTACTATTGATTGATAGACTTGTTTTACATATCAATACTTTCATACTTTAATCATGGATCATGCTTGCAAATCGCCTTGAAGCTTAATTATAATGTATATACTTTATTTTGTGCTTTGAAGGATGAAATTGTTGCTGGCATTGAGGCCAGGATTTCTGCATGGACATTCCTTCCAGTAGGTATATTTGTTATATGCTCCTTGTCATTGATGTACTTTTTTGTCATTCAGATATAAAGCTATTACAATCACTTACCTTTCTTTCTTGAAACAACGTTGTTAAAATCAGAAAATGGAGAGTCCATTCAAATTCTTCACTACGAAAATGGCCAAAAGTATGAACCACATTTTGATTTTTTTCACGACAAGGTGAATCAGGAGCTAGGTGGCCACCGAATAGCCACAGTCTTGATGTATCTATCCAATGTTGAAAAGGGTGGAGAAACCATCTTTCCAAATTCAGCGGTATGGCAGTGGCTCTGCCACTTCAATATGCCTTTGTTTTTCGAGAAAAAACAGAATTATATTTTCTTTTTGTAGCCAACTGTGACAACGTTGTCTTCTGCAGTTTGAATCTCGAGAAAAGGATGACAGCTGGTCTGATTGTGCTCGAAAGGGTTATGCAGGTAGATATTTGTTATCGACTCATTTTTATGAGTATATATCACGAGTAATTAGTGTATTATGTTACATTTTCAATTTAGAAAAGGCCATGGATGGTCTATCTATCATTGCTAGACCTTTCCTGAAAGTATCTTTCTTGAACATGATCTTCCTTTCTGGTCTTTGAGAAAGGGTACTCGTGGCCGGCTAAGTATCTTATGGGGAGCCAACTTGATTTGACCCTTTGTTATGACTGCATCATTGATTCGTCATTATAGCCTGGTTTTTAAAAAAATTGCTCGTCCGATTAGTTTGAAATGTTAGCCGAAATTGTTGTTTCTTGACTTTTTCGTCGCTTCTATTTTTGTTGTCCTTGTATCTTAGCTTCTGGGATTGATATAGTACGCCACTTATGGTTCTACAACATGCCTGTTCAATTGTAGTTAAAGCGCAGAAGGGTGATGCATTGCTGTTCTTCAGCCTCCATCTCGATGCAACAACAGATAAAAGAAGCTTGCACGGTAGTTGCCCTGTGATCCAGGGCGAGAAATGGTCTGCAACCAAGTGGATTCATGTGAGATCCTTCGATAAGGCAACTCGGATAAGTAGTCAGGACTGTGTGGATGAGAACAAAAATTGCCCTTCATGGGCAAAAAGGGGTGAGTGCCAAAAGAACCCTACTTATATGGTGGGTTCAGAAGGTGCAGTAGGATACTGTAGGAAGAGTTGCAAAGCGTGTTAAACTTAACCTATAGGAATATGTCCACGTCTCTCTCTCTCTCTCTTTCACCCGTTTTGCAGAGCTGAGTGTTGATTCTCTGATGGTTATGTATATAACATCGGGCAGTAACTGGGTATACGATACAAGTGGATATTACATATCTTTGATTAAACCTTGTAGTAGCAATTAGCCAAGTGTTTCATTTGGTAATCCAGACTCTGATGAGAAAATTTTCTCTTGATGCTATTGGAACTTTACAAATGATATATTTTTCGCTTCTTAAATGAAATAACCTTTTAGGAT

mRNA sequence

AGCCCACTGGTTCTCATCTCTCATTGGTTGTCTTCACAGTTGAATTTTCGTAATAAATTCAAATCCAATTAATTTATTTTATTTTTCTTTCCCTTCTCGTACACGGAAGAACCCTCGAGTTGAATCTTTTCTTCTTCCTTTCTCCCATTTGATTCCGGAGAAACGATTATGGATTCCCGACGGTTCCTCGCATTTTCTCTCTTCTTTCTGTCCGTCTCTACTGGCTTCGCTCGCTTGCCGGAAACGCACAAGAAATTAAGTGGATCTGTGCTTGAATTGAAGAGGGATTCGCCACGGCTCATTTTCGATCCAACACGAGTCACTCAGCTCTCCTGGCAACCCAGGGCATTTTTGTATAAGGGATTTTTAACTGATCAGGAATGTGATCATCTAATCGATCTGGCCAAGGATAAATTAGAGAAGTCAATGGTAGCAGATAATGAGTCTGGTAAGAGTGTAAGTAGTGAAGTTCGAACGAGTTCTGGCATGTTCCTCCGGAAGGCCCAGGATGAAATTGTTGCTGGCATTGAGGCCAGGATTTCTGCATGGACATTCCTTCCAGTAGAAAATGGAGAGTCCATTCAAATTCTTCACTACGAAAATGGCCAAAAGTATGAACCACATTTTGATTTTTTTCACGACAAGGTGAATCAGGAGCTAGGTGGCCACCGAATAGCCACAGTCTTGATGTATCTATCCAATGTTGAAAAGGGTGGAGAAACCATCTTTCCAAATTCAGCGTTTGAATCTCGAGAAAAGGATGACAGCTGGTCTGATTGTGCTCGAAAGGGTTATGCAGTTAAAGCGCAGAAGGGTGATGCATTGCTGTTCTTCAGCCTCCATCTCGATGCAACAACAGATAAAAGAAGCTTGCACGGTAGTTGCCCTGTGATCCAGGGCGAGAAATGGTCTGCAACCAAGTGGATTCATGTGAGATCCTTCGATAAGGCAACTCGGATAAGTAGTCAGGACTGTGTGGATGAGAACAAAAATTGCCCTTCATGGGCAAAAAGGGGTGAGTGCCAAAAGAACCCTACTTATATGGTGGGTTCAGAAGGTGCAGTAGGATACTGTAGGAAGAGTTGCAAAGCGTGTTAAACTTAACCTATAGGAATATGTCCACGTCTCTCTCTCTCTCTCTTTCACCCGTTTTGCAGAGCTGAGTGTTGATTCTCTGATGGTTATGTATATAACATCGGGCAGTAACTGGGTATACGATACAAGTGGATATTACATATCTTTGATTAAACCTTGTAGTAGCAATTAGCCAAGTGTTTCATTTGGTAATCCAGACTCTGATGAGAAAATTTTCTCTTGATGCTATTGGAACTTTACAAATGATATATTTTTCGCTTCTTAAATGAAATAACCTTTTAGGAT

Coding sequence (CDS)

ATGGATTCCCGACGGTTCCTCGCATTTTCTCTCTTCTTTCTGTCCGTCTCTACTGGCTTCGCTCGCTTGCCGGAAACGCACAAGAAATTAAGTGGATCTGTGCTTGAATTGAAGAGGGATTCGCCACGGCTCATTTTCGATCCAACACGAGTCACTCAGCTCTCCTGGCAACCCAGGGCATTTTTGTATAAGGGATTTTTAACTGATCAGGAATGTGATCATCTAATCGATCTGGCCAAGGATAAATTAGAGAAGTCAATGGTAGCAGATAATGAGTCTGGTAAGAGTGTAAGTAGTGAAGTTCGAACGAGTTCTGGCATGTTCCTCCGGAAGGCCCAGGATGAAATTGTTGCTGGCATTGAGGCCAGGATTTCTGCATGGACATTCCTTCCAGTAGAAAATGGAGAGTCCATTCAAATTCTTCACTACGAAAATGGCCAAAAGTATGAACCACATTTTGATTTTTTTCACGACAAGGTGAATCAGGAGCTAGGTGGCCACCGAATAGCCACAGTCTTGATGTATCTATCCAATGTTGAAAAGGGTGGAGAAACCATCTTTCCAAATTCAGCGTTTGAATCTCGAGAAAAGGATGACAGCTGGTCTGATTGTGCTCGAAAGGGTTATGCAGTTAAAGCGCAGAAGGGTGATGCATTGCTGTTCTTCAGCCTCCATCTCGATGCAACAACAGATAAAAGAAGCTTGCACGGTAGTTGCCCTGTGATCCAGGGCGAGAAATGGTCTGCAACCAAGTGGATTCATGTGAGATCCTTCGATAAGGCAACTCGGATAAGTAGTCAGGACTGTGTGGATGAGAACAAAAATTGCCCTTCATGGGCAAAAAGGGGTGAGTGCCAAAAGAACCCTACTTATATGGTGGGTTCAGAAGGTGCAGTAGGATACTGTAGGAAGAGTTGCAAAGCGTGTTAA

Protein sequence

MDSRRFLAFSLFFLSVSTGFARLPETHKKLSGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESREKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCPVIQGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
Homology
BLAST of Cp4.1LG13g08580 vs. ExPASy Swiss-Prot
Match: Q8L970 (Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana OX=3702 GN=P4H7 PE=2 SV=1)

HSP 1 Score: 441.4 bits (1134), Expect = 8.3e-123
Identity = 215/315 (68.25%), Postives = 256/315 (81.27%), Query Frame = 0

Query: 1   MDSRRFLAFSLFFL----SVSTGFAR-LPETHKKLSGSVLELKRDSPRLIFDPTRVTQLS 60
           MDSR FLAFSL FL     +S+   R L  +     GSV+++K  +    FDPTRVTQLS
Sbjct: 1   MDSRIFLAFSLCFLFTLPLISSAPNRFLTRSSNTRDGSVIKMKTSASSFGFDPTRVTQLS 60

Query: 61  WQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDE 120
           W PR FLY+GFL+D+ECDH I LAK KLEKSMVADN+SG+SV SEVRTSSGMFL K QD+
Sbjct: 61  WTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKRQDD 120

Query: 121 IVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY 180
           IV+ +EA+++AWTFLP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMY
Sbjct: 121 IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATVLMY 180

Query: 181 LSNVEKGGETIFPNSAFESRE-KDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRS 240
           LSNVEKGGET+FP    ++ + KDDSW++CA++GYAVK +KGDALLFF+LH +ATTD  S
Sbjct: 181 LSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSNS 240

Query: 241 LHGSCPVIQGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVG 300
           LHGSCPV++GEKWSAT+WIHV+SF++A    S  C+DEN +C  WAK GECQKNPTYMVG
Sbjct: 241 LHGSCPVVEGEKWSATRWIHVKSFERAFNKQS-GCMDENVSCEKWAKAGECQKNPTYMVG 300

Query: 301 SEGAVGYCRKSCKAC 310
           S+   GYCRKSCKAC
Sbjct: 301 SDKDHGYCRKSCKAC 314

BLAST of Cp4.1LG13g08580 vs. ExPASy Swiss-Prot
Match: F4J0A8 (Probable prolyl 4-hydroxylase 6 OS=Arabidopsis thaliana OX=3702 GN=P4H6 PE=2 SV=1)

HSP 1 Score: 408.7 bits (1049), Expect = 5.9e-113
Identity = 205/311 (65.92%), Postives = 245/311 (78.78%), Query Frame = 0

Query: 1   MDSRRFLAFSLFFLSVSTGFARLPETHKKLSGSVLELKRDSPRLIFDPTRVTQLSWQPRA 60
           MDS+ FLAFSL  L +   F+++       S SV            DPTR+TQLSW PRA
Sbjct: 1   MDSQYFLAFSLSLLLI---FSQI----SSFSFSV------------DPTRITQLSWTPRA 60

Query: 61  FLYKGFLTDQECDHLIDLAKDKLEKSM-VADNESGKSVSSEVRTSSGMFLRKAQDEIVAG 120
           FLYKGFL+D+ECDHLI LAK KLEKSM VAD +SG+S  SEVRTSSGMFL K QD+IVA 
Sbjct: 61  FLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSEVRTSSGMFLTKRQDDIVAN 120

Query: 121 IEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNV 180
           +EA+++AWTFLP ENGE++QILHYENGQKY+PHFD+F+DK   ELGGHRIATVLMYLSNV
Sbjct: 121 VEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHRIATVLMYLSNV 180

Query: 181 EKGGETIFPNSAFESRE-KDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGS 240
            KGGET+FPN   ++ + KDDSWS CA++GYAVK +KGDALLFF+LHL+ TTD  SLHGS
Sbjct: 181 TKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKGDALLFFNLHLNGTTDPNSLHGS 240

Query: 241 CPVIQGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGA 300
           CPVI+GEKWSAT+WIHVRSF K   +    CVD++++C  WA  GEC+KNP YMVGSE +
Sbjct: 241 CPVIEGEKWSATRWIHVRSFGKKKLV----CVDDHESCQEWADAGECEKNPMYMVGSETS 288

Query: 301 VGYCRKSCKAC 310
           +G+CRKSCKAC
Sbjct: 301 LGFCRKSCKAC 288

BLAST of Cp4.1LG13g08580 vs. ExPASy Swiss-Prot
Match: Q8LAN3 (Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana OX=3702 GN=P4H4 PE=2 SV=1)

HSP 1 Score: 352.8 bits (904), Expect = 3.9e-96
Identity = 168/273 (61.54%), Postives = 208/273 (76.19%), Query Frame = 0

Query: 41  SPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSE 100
           S  +  +P++V Q+S +PRAF+Y+GFLT+ ECDH++ LAK  L++S VADN+SG+S  SE
Sbjct: 27  SSSVFVNPSKVKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSE 86

Query: 101 VRTSSGMFLRKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKV 160
           VRTSSG F+ K +D IV+GIE +IS WTFLP ENGE IQ+L YE+GQKY+ HFD+FHDKV
Sbjct: 87  VRTSSGTFISKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKV 146

Query: 161 NQELGGHRIATVLMYLSNVEKGGETIFPNSAFESR----EKDDSWSDCARKGYAVKAQKG 220
           N   GGHR+AT+LMYLSNV KGGET+FP++   SR    E  +  SDCA++G AVK +KG
Sbjct: 147 NIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKRGIAVKPRKG 206

Query: 221 DALLFFSLHLDATTDKRSLHGSCPVIQGEKWSATKWIHVRSFDKATRISSQDCVDENKNC 280
           DALLFF+LH DA  D  SLHG CPVI+GEKWSATKWIHV SFD+     S +C D N++C
Sbjct: 207 DALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHVDSFDRIV-TPSGNCTDMNESC 266

Query: 281 PSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC 310
             WA  GEC KNP YMVG+    GYCR+SCKAC
Sbjct: 267 ERWAVLGECTKNPEYMVGTTELPGYCRRSCKAC 298

BLAST of Cp4.1LG13g08580 vs. ExPASy Swiss-Prot
Match: F4JAU3 (Prolyl 4-hydroxylase 2 OS=Arabidopsis thaliana OX=3702 GN=P4H2 PE=1 SV=1)

HSP 1 Score: 349.4 bits (895), Expect = 4.3e-95
Identity = 169/273 (61.90%), Postives = 207/273 (75.82%), Query Frame = 0

Query: 41  SPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSE 100
           SP  I +P++V Q+S +PRAF+Y+GFLTD ECDHLI LAK+ L++S VADN++G+S  S+
Sbjct: 28  SPSSIINPSKVKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSD 87

Query: 101 VRTSSGMFLRKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKV 160
           VRTSSG F+ K +D IV+GIE ++S WTFLP ENGE +Q+L YE+GQKY+ HFD+FHDKV
Sbjct: 88  VRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKV 147

Query: 161 NQELGGHRIATVLMYLSNVEKGGETIFPNSAFESR----EKDDSWSDCARKGYAVKAQKG 220
           N   GGHRIATVL+YLSNV KGGET+FP++   SR    E  D  SDCA+KG AVK +KG
Sbjct: 148 NIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKG 207

Query: 221 DALLFFSLHLDATTDKRSLHGSCPVIQGEKWSATKWIHVRSFDKATRISSQDCVDENKNC 280
           +ALLFF+L  DA  D  SLHG CPVI+GEKWSATKWIHV SFDK       +C D N++C
Sbjct: 208 NALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHVDSFDKIL-THDGNCTDVNESC 267

Query: 281 PSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC 310
             WA  GEC KNP YMVG+    G CR+SCKAC
Sbjct: 268 ERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299

BLAST of Cp4.1LG13g08580 vs. ExPASy Swiss-Prot
Match: Q9LN20 (Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=1)

HSP 1 Score: 250.4 bits (638), Expect = 2.7e-65
Identity = 117/207 (56.52%), Postives = 157/207 (75.85%), Query Frame = 0

Query: 54  LSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQ 113
           LSW+PRAF+Y  FL+ +EC++LI LAK  + KS V D+E+GKS  S VRTSSG FLR+ +
Sbjct: 79  LSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRRGR 138

Query: 114 DEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVL 173
           D+I+  IE RI+ +TF+P ++GE +Q+LHYE GQKYEPH+D+F D+ N + GG R+AT+L
Sbjct: 139 DKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRMATML 198

Query: 174 MYLSNVEKGGETIFP--NSAFESREKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTD 233
           MYLS+VE+GGET+FP  N  F S    +  S+C +KG +VK + GDALLF+S+  DAT D
Sbjct: 199 MYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDATLD 258

Query: 234 KRSLHGSCPVIQGEKWSATKWIHVRSF 259
             SLHG CPVI+G KWS+TKW+HV  +
Sbjct: 259 PTSLHGGCPVIRGNKWSSTKWMHVGEY 285

BLAST of Cp4.1LG13g08580 vs. NCBI nr
Match: XP_023549944.1 (probable prolyl 4-hydroxylase 7 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 635 bits (1637), Expect = 8.29e-230
Identity = 309/309 (100.00%), Postives = 309/309 (100.00%), Query Frame = 0

Query: 1   MDSRRFLAFSLFFLSVSTGFARLPETHKKLSGSVLELKRDSPRLIFDPTRVTQLSWQPRA 60
           MDSRRFLAFSLFFLSVSTGFARLPETHKKLSGSVLELKRDSPRLIFDPTRVTQLSWQPRA
Sbjct: 1   MDSRRFLAFSLFFLSVSTGFARLPETHKKLSGSVLELKRDSPRLIFDPTRVTQLSWQPRA 60

Query: 61  FLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGI 120
           FLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGI
Sbjct: 61  FLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGI 120

Query: 121 EARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVE 180
           EARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVE
Sbjct: 121 EARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVE 180

Query: 181 KGGETIFPNSAFESREKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCP 240
           KGGETIFPNSAFESREKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCP
Sbjct: 181 KGGETIFPNSAFESREKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCP 240

Query: 241 VIQGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVG 300
           VIQGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVG
Sbjct: 241 VIQGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVG 300

Query: 301 YCRKSCKAC 309
           YCRKSCKAC
Sbjct: 301 YCRKSCKAC 309

BLAST of Cp4.1LG13g08580 vs. NCBI nr
Match: XP_022938573.1 (probable prolyl 4-hydroxylase 7 [Cucurbita moschata])

HSP 1 Score: 632 bits (1630), Expect = 9.68e-229
Identity = 307/309 (99.35%), Postives = 309/309 (100.00%), Query Frame = 0

Query: 1   MDSRRFLAFSLFFLSVSTGFARLPETHKKLSGSVLELKRDSPRLIFDPTRVTQLSWQPRA 60
           MDSRRFLAFSLFFLSVSTGFARLPETHKKLSGSVLELKRDSPRLIFDPTRVTQLSWQPRA
Sbjct: 1   MDSRRFLAFSLFFLSVSTGFARLPETHKKLSGSVLELKRDSPRLIFDPTRVTQLSWQPRA 60

Query: 61  FLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGI 120
           FLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGI
Sbjct: 61  FLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGI 120

Query: 121 EARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVE 180
           EARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVE
Sbjct: 121 EARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVE 180

Query: 181 KGGETIFPNSAFESREKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCP 240
           KGGETIFPNSAFES+EKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCP
Sbjct: 181 KGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCP 240

Query: 241 VIQGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVG 300
           VI+GEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVG
Sbjct: 241 VIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVG 300

Query: 301 YCRKSCKAC 309
           YCRKSCKAC
Sbjct: 301 YCRKSCKAC 309

BLAST of Cp4.1LG13g08580 vs. NCBI nr
Match: KAG6578605.1 (putative prolyl 4-hydroxylase 7, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 630 bits (1626), Expect = 3.94e-228
Identity = 306/309 (99.03%), Postives = 309/309 (100.00%), Query Frame = 0

Query: 1   MDSRRFLAFSLFFLSVSTGFARLPETHKKLSGSVLELKRDSPRLIFDPTRVTQLSWQPRA 60
           MDSRRFLAFSLFFLSVSTGFARLPETHKKLSGSVLELKRDSPRLIFDPTRVTQLSWQPRA
Sbjct: 1   MDSRRFLAFSLFFLSVSTGFARLPETHKKLSGSVLELKRDSPRLIFDPTRVTQLSWQPRA 60

Query: 61  FLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGI 120
           FLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGI
Sbjct: 61  FLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGI 120

Query: 121 EARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVE 180
           EARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVE
Sbjct: 121 EARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVE 180

Query: 181 KGGETIFPNSAFESREKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCP 240
           KGGETIFPNSAFES+EKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCP
Sbjct: 181 KGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCP 240

Query: 241 VIQGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVG 300
           VI+GEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGS+GAVG
Sbjct: 241 VIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSKGAVG 300

Query: 301 YCRKSCKAC 309
           YCRKSCKAC
Sbjct: 301 YCRKSCKAC 309

BLAST of Cp4.1LG13g08580 vs. NCBI nr
Match: XP_022993651.1 (probable prolyl 4-hydroxylase 7 [Cucurbita maxima])

HSP 1 Score: 629 bits (1621), Expect = 2.28e-227
Identity = 305/309 (98.71%), Postives = 307/309 (99.35%), Query Frame = 0

Query: 1   MDSRRFLAFSLFFLSVSTGFARLPETHKKLSGSVLELKRDSPRLIFDPTRVTQLSWQPRA 60
           MDSRRFL FSLFFLSVSTGFARLPETHKKLSGSVLELKRDSPRLIFDPTRVTQLSWQPRA
Sbjct: 1   MDSRRFLGFSLFFLSVSTGFARLPETHKKLSGSVLELKRDSPRLIFDPTRVTQLSWQPRA 60

Query: 61  FLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGI 120
           FLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGI
Sbjct: 61  FLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGI 120

Query: 121 EARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVE 180
           EARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVE
Sbjct: 121 EARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVE 180

Query: 181 KGGETIFPNSAFESREKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCP 240
           KGGETIFPNSAFES+EKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCP
Sbjct: 181 KGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCP 240

Query: 241 VIQGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVG 300
           VI+GEKWSATKWIHVRSFDKATR SSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVG
Sbjct: 241 VIEGEKWSATKWIHVRSFDKATRTSSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVG 300

Query: 301 YCRKSCKAC 309
           YCRKSCKAC
Sbjct: 301 YCRKSCKAC 309

BLAST of Cp4.1LG13g08580 vs. NCBI nr
Match: KAG7016155.1 (putative prolyl 4-hydroxylase 7, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 575 bits (1483), Expect = 1.23e-206
Identity = 277/279 (99.28%), Postives = 279/279 (100.00%), Query Frame = 0

Query: 31  SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVAD 90
           SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVAD
Sbjct: 12  SGSVLELKRDSPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVAD 71

Query: 91  NESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYE 150
           NESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYE
Sbjct: 72  NESGKSVSSEVRTSSGMFLRKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYE 131

Query: 151 PHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESREKDDSWSDCARKGYA 210
           PHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFES+EKDDSWSDCARKGYA
Sbjct: 132 PHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSAFESQEKDDSWSDCARKGYA 191

Query: 211 VKAQKGDALLFFSLHLDATTDKRSLHGSCPVIQGEKWSATKWIHVRSFDKATRISSQDCV 270
           VKAQKGDALLFFSLHLDATTDKRSLHGSCPVI+GEKWSATKWIHVRSFDKATRISSQDCV
Sbjct: 192 VKAQKGDALLFFSLHLDATTDKRSLHGSCPVIEGEKWSATKWIHVRSFDKATRISSQDCV 251

Query: 271 DENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC 309
           DENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC
Sbjct: 252 DENKNCPSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC 290

BLAST of Cp4.1LG13g08580 vs. ExPASy TrEMBL
Match: A0A6J1FJ93 (Procollagen-proline 4-dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111444767 PE=3 SV=1)

HSP 1 Score: 632 bits (1630), Expect = 4.68e-229
Identity = 307/309 (99.35%), Postives = 309/309 (100.00%), Query Frame = 0

Query: 1   MDSRRFLAFSLFFLSVSTGFARLPETHKKLSGSVLELKRDSPRLIFDPTRVTQLSWQPRA 60
           MDSRRFLAFSLFFLSVSTGFARLPETHKKLSGSVLELKRDSPRLIFDPTRVTQLSWQPRA
Sbjct: 1   MDSRRFLAFSLFFLSVSTGFARLPETHKKLSGSVLELKRDSPRLIFDPTRVTQLSWQPRA 60

Query: 61  FLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGI 120
           FLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGI
Sbjct: 61  FLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGI 120

Query: 121 EARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVE 180
           EARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVE
Sbjct: 121 EARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVE 180

Query: 181 KGGETIFPNSAFESREKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCP 240
           KGGETIFPNSAFES+EKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCP
Sbjct: 181 KGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCP 240

Query: 241 VIQGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVG 300
           VI+GEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVG
Sbjct: 241 VIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVG 300

Query: 301 YCRKSCKAC 309
           YCRKSCKAC
Sbjct: 301 YCRKSCKAC 309

BLAST of Cp4.1LG13g08580 vs. ExPASy TrEMBL
Match: A0A6J1JWX0 (Procollagen-proline 4-dioxygenase OS=Cucurbita maxima OX=3661 GN=LOC111489579 PE=3 SV=1)

HSP 1 Score: 629 bits (1621), Expect = 1.10e-227
Identity = 305/309 (98.71%), Postives = 307/309 (99.35%), Query Frame = 0

Query: 1   MDSRRFLAFSLFFLSVSTGFARLPETHKKLSGSVLELKRDSPRLIFDPTRVTQLSWQPRA 60
           MDSRRFL FSLFFLSVSTGFARLPETHKKLSGSVLELKRDSPRLIFDPTRVTQLSWQPRA
Sbjct: 1   MDSRRFLGFSLFFLSVSTGFARLPETHKKLSGSVLELKRDSPRLIFDPTRVTQLSWQPRA 60

Query: 61  FLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGI 120
           FLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGI
Sbjct: 61  FLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAGI 120

Query: 121 EARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVE 180
           EARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVE
Sbjct: 121 EARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVE 180

Query: 181 KGGETIFPNSAFESREKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCP 240
           KGGETIFPNSAFES+EKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCP
Sbjct: 181 KGGETIFPNSAFESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGSCP 240

Query: 241 VIQGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVG 300
           VI+GEKWSATKWIHVRSFDKATR SSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVG
Sbjct: 241 VIEGEKWSATKWIHVRSFDKATRTSSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGAVG 300

Query: 301 YCRKSCKAC 309
           YCRKSCKAC
Sbjct: 301 YCRKSCKAC 309

BLAST of Cp4.1LG13g08580 vs. ExPASy TrEMBL
Match: A0A1S3C8G4 (Procollagen-proline 4-dioxygenase OS=Cucumis melo OX=3656 GN=LOC103498028 PE=3 SV=1)

HSP 1 Score: 565 bits (1457), Expect = 1.43e-202
Identity = 276/316 (87.34%), Postives = 294/316 (93.04%), Query Frame = 0

Query: 1   MDSRRFLAFSLFFLSVSTGFARLPET------HKKLSGSVLELKRDSPRLIFDPTRVTQL 60
           MDSR FLAFSL FLSV T FARLPET      +K+ +GSVL LK DS  LIFDPTRVTQL
Sbjct: 1   MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQL 60

Query: 61  SWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQD 120
           SWQPRAFLYKGFL+D+ECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQD
Sbjct: 61  SWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQD 120

Query: 121 EIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLM 180
           +IVAG+EARI+AWT LP ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLM
Sbjct: 121 KIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLM 180

Query: 181 YLSNVEKGGETIFPNSAF-ESREKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKR 240
           YLSNVEKGGETIFPNS F ES+EKDDSWSDC+RKGYAVKAQKGDALLFFSLHLDATTD+R
Sbjct: 181 YLSNVEKGGETIFPNSEFKESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDER 240

Query: 241 SLHGSCPVIQGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMV 300
           SLHGSCPVI+GEKWSATKWIHVRSF+K  R+S QDCVDEN+NCP+WAKRGEC+KNPTYMV
Sbjct: 241 SLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMV 300

Query: 301 GSEGAVGYCRKSCKAC 309
           GSEGA+GYCRKSCKAC
Sbjct: 301 GSEGALGYCRKSCKAC 316

BLAST of Cp4.1LG13g08580 vs. ExPASy TrEMBL
Match: A0A0A0KS38 (Procollagen-proline 4-dioxygenase OS=Cucumis sativus OX=3659 GN=Csa_5G633280 PE=3 SV=1)

HSP 1 Score: 549 bits (1414), Expect = 4.55e-196
Identity = 271/313 (86.58%), Postives = 289/313 (92.33%), Query Frame = 0

Query: 1   MDSRRFLAFSLFFLSVSTGFARLPET--HKKLSGSVLELKRDSPRLIFDPTRVTQLSWQP 60
           MDSR FLAFSL FLSV T FARLPET  HK+ SGSVL LK DS  LIFDPTRVTQLSWQP
Sbjct: 1   MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQP 60

Query: 61  RAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVA 120
           RAFLYKGFL+D ECDHLIDLAKDKLEKSMVADN+SGKSVSSEVRTSSGMFLRKAQDE+VA
Sbjct: 61  RAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQDEVVA 120

Query: 121 GIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSN 180
           G+EARI+AWT LP ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSN
Sbjct: 121 GVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSN 180

Query: 181 VEKGGETIFPNSAF-ESREKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHG 240
           VEKGGETIFPNS F ES+ KD+SWSDC+RKGYAVKAQKGDALLFFSL+LDATTD+RSLHG
Sbjct: 181 VEKGGETIFPNSEFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHG 240

Query: 241 SCPVIQGEKWSATKWIHVRSFDKAT-RISSQDCVDENKNCPSWAKRGECQKNPTYMVGSE 300
           SCPVI GEKWSATKWIHVRSF+K T R+S Q CVDEN+NC +WAK+GEC+KNPTYMVGS 
Sbjct: 241 SCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSG 300

Query: 301 GAVGYCRKSCKAC 309
           GA+GYCRKSCKAC
Sbjct: 301 GALGYCRKSCKAC 313

BLAST of Cp4.1LG13g08580 vs. ExPASy TrEMBL
Match: A0A6J1BXN9 (Procollagen-proline 4-dioxygenase OS=Momordica charantia OX=3673 GN=LOC111006412 PE=3 SV=1)

HSP 1 Score: 542 bits (1396), Expect = 2.52e-193
Identity = 266/313 (84.98%), Postives = 286/313 (91.37%), Query Frame = 0

Query: 1   MDSRRFLAFSLFFLSVSTGFARLPE--THKKLSGSVLELKRDSPRLIFDPTRVTQLSWQP 60
           MDS RFL+FSL FL V T  ARLP+   HKK+SGSVL LK +   LIFDPTRVTQLSWQP
Sbjct: 1   MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQP 60

Query: 61  RAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVA 120
           RAFLYKGFL+D+ECDHLIDLAKDKLEKSMVADN SGKSVSSEVRTSSGMFL KAQDEIVA
Sbjct: 61  RAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVSSEVRTSSGMFLHKAQDEIVA 120

Query: 121 GIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSN 180
            +EARI+AWTFLP ENGESIQILHYENGQKYEPHFD+FHDKVNQELGGHR+ATVLMYLSN
Sbjct: 121 AVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSN 180

Query: 181 VEKGGETIFPNSAF-ESREKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHG 240
           VEKGGETIFPNS F ES+EKDDSWSDCARKGYAVKA+KGDALLFFSLHLDATTD +SLHG
Sbjct: 181 VEKGGETIFPNSEFKESQEKDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHG 240

Query: 241 SCPVIQGEKWSATKWIHVRSFDKATRISSQ-DCVDENKNCPSWAKRGECQKNPTYMVGSE 300
           SCPVI+GEKWSATKWIHVRSF+K TR S + DCVDEN+NC SWAKRGEC+KNPTYMVGSE
Sbjct: 241 SCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSE 300

Query: 301 GAVGYCRKSCKAC 309
            A+GYCRKSC+AC
Sbjct: 301 SALGYCRKSCQAC 313

BLAST of Cp4.1LG13g08580 vs. TAIR 10
Match: AT3G28480.1 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 441.4 bits (1134), Expect = 5.9e-124
Identity = 215/315 (68.25%), Postives = 256/315 (81.27%), Query Frame = 0

Query: 1   MDSRRFLAFSLFFL----SVSTGFAR-LPETHKKLSGSVLELKRDSPRLIFDPTRVTQLS 60
           MDSR FLAFSL FL     +S+   R L  +     GSV+++K  +    FDPTRVTQLS
Sbjct: 1   MDSRIFLAFSLCFLFTLPLISSAPNRFLTRSSNTRDGSVIKMKTSASSFGFDPTRVTQLS 60

Query: 61  WQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDE 120
           W PR FLY+GFL+D+ECDH I LAK KLEKSMVADN+SG+SV SEVRTSSGMFL K QD+
Sbjct: 61  WTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKRQDD 120

Query: 121 IVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY 180
           IV+ +EA+++AWTFLP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMY
Sbjct: 121 IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATVLMY 180

Query: 181 LSNVEKGGETIFPNSAFESRE-KDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRS 240
           LSNVEKGGET+FP    ++ + KDDSW++CA++GYAVK +KGDALLFF+LH +ATTD  S
Sbjct: 181 LSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSNS 240

Query: 241 LHGSCPVIQGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVG 300
           LHGSCPV++GEKWSAT+WIHV+SF++A    S  C+DEN +C  WAK GECQKNPTYMVG
Sbjct: 241 LHGSCPVVEGEKWSATRWIHVKSFERAFNKQS-GCMDENVSCEKWAKAGECQKNPTYMVG 300

Query: 301 SEGAVGYCRKSCKAC 310
           S+   GYCRKSCKAC
Sbjct: 301 SDKDHGYCRKSCKAC 314

BLAST of Cp4.1LG13g08580 vs. TAIR 10
Match: AT3G28480.2 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 414.8 bits (1065), Expect = 5.9e-116
Identity = 208/323 (64.40%), Postives = 249/323 (77.09%), Query Frame = 0

Query: 1   MDSRRFLAFSLFFL----SVSTGFAR-LPETHKKLSGSVLELKRDSPRLIFDPTRVTQLS 60
           MDSR FLAFSL FL     +S+   R L  +     GSV+++K  +    FDPTRVTQLS
Sbjct: 1   MDSRIFLAFSLCFLFTLPLISSAPNRFLTRSSNTRDGSVIKMKTSASSFGFDPTRVTQLS 60

Query: 61  WQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSE-----VRTSSGMFLR 120
           W PR FLY+GFL+D+ECDH I LAK KLEKSMVADN+SG+SV SE     VR SS     
Sbjct: 61  WTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEDSVSVVRQSSSFIAN 120

Query: 121 KAQ---DEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH 180
                 D+IV+ +EA+++AWTFLP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGH
Sbjct: 121 MDSLEIDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGH 180

Query: 181 RIATVLMYLSNVEKGGETIFPNSAFESRE-KDDSWSDCARKGYAVKAQKGDALLFFSLHL 240
           RIATVLMYLSNVEKGGET+FP    ++ + KDDSW++CA++GYAVK +KGDALLFF+LH 
Sbjct: 181 RIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHP 240

Query: 241 DATTDKRSLHGSCPVIQGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQ 300
           +ATTD  SLHGSCPV++GEKWSAT+WIHV+SF++A    S  C+DEN +C  WAK GECQ
Sbjct: 241 NATTDSNSLHGSCPVVEGEKWSATRWIHVKSFERAFNKQS-GCMDENVSCEKWAKAGECQ 300

Query: 301 KNPTYMVGSEGAVGYCRKSCKAC 310
           KNPTYMVGS+   GYCRKSCKAC
Sbjct: 301 KNPTYMVGSDKDHGYCRKSCKAC 322

BLAST of Cp4.1LG13g08580 vs. TAIR 10
Match: AT3G28490.1 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 408.7 bits (1049), Expect = 4.2e-114
Identity = 205/311 (65.92%), Postives = 245/311 (78.78%), Query Frame = 0

Query: 1   MDSRRFLAFSLFFLSVSTGFARLPETHKKLSGSVLELKRDSPRLIFDPTRVTQLSWQPRA 60
           MDS+ FLAFSL  L +   F+++       S SV            DPTR+TQLSW PRA
Sbjct: 1   MDSQYFLAFSLSLLLI---FSQI----SSFSFSV------------DPTRITQLSWTPRA 60

Query: 61  FLYKGFLTDQECDHLIDLAKDKLEKSM-VADNESGKSVSSEVRTSSGMFLRKAQDEIVAG 120
           FLYKGFL+D+ECDHLI LAK KLEKSM VAD +SG+S  SEVRTSSGMFL K QD+IVA 
Sbjct: 61  FLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSEVRTSSGMFLTKRQDDIVAN 120

Query: 121 IEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNV 180
           +EA+++AWTFLP ENGE++QILHYENGQKY+PHFD+F+DK   ELGGHRIATVLMYLSNV
Sbjct: 121 VEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHRIATVLMYLSNV 180

Query: 181 EKGGETIFPNSAFESRE-KDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHGS 240
            KGGET+FPN   ++ + KDDSWS CA++GYAVK +KGDALLFF+LHL+ TTD  SLHGS
Sbjct: 181 TKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKGDALLFFNLHLNGTTDPNSLHGS 240

Query: 241 CPVIQGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEGA 300
           CPVI+GEKWSAT+WIHVRSF K   +    CVD++++C  WA  GEC+KNP YMVGSE +
Sbjct: 241 CPVIEGEKWSATRWIHVRSFGKKKLV----CVDDHESCQEWADAGECEKNPMYMVGSETS 288

Query: 301 VGYCRKSCKAC 310
           +G+CRKSCKAC
Sbjct: 301 LGFCRKSCKAC 288

BLAST of Cp4.1LG13g08580 vs. TAIR 10
Match: AT5G18900.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 352.8 bits (904), Expect = 2.7e-97
Identity = 168/273 (61.54%), Postives = 208/273 (76.19%), Query Frame = 0

Query: 41  SPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSE 100
           S  +  +P++V Q+S +PRAF+Y+GFLT+ ECDH++ LAK  L++S VADN+SG+S  SE
Sbjct: 27  SSSVFVNPSKVKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSE 86

Query: 101 VRTSSGMFLRKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKV 160
           VRTSSG F+ K +D IV+GIE +IS WTFLP ENGE IQ+L YE+GQKY+ HFD+FHDKV
Sbjct: 87  VRTSSGTFISKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKV 146

Query: 161 NQELGGHRIATVLMYLSNVEKGGETIFPNSAFESR----EKDDSWSDCARKGYAVKAQKG 220
           N   GGHR+AT+LMYLSNV KGGET+FP++   SR    E  +  SDCA++G AVK +KG
Sbjct: 147 NIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKRGIAVKPRKG 206

Query: 221 DALLFFSLHLDATTDKRSLHGSCPVIQGEKWSATKWIHVRSFDKATRISSQDCVDENKNC 280
           DALLFF+LH DA  D  SLHG CPVI+GEKWSATKWIHV SFD+     S +C D N++C
Sbjct: 207 DALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHVDSFDRIV-TPSGNCTDMNESC 266

Query: 281 PSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC 310
             WA  GEC KNP YMVG+    GYCR+SCKAC
Sbjct: 267 ERWAVLGECTKNPEYMVGTTELPGYCRRSCKAC 298

BLAST of Cp4.1LG13g08580 vs. TAIR 10
Match: AT3G06300.1 (P4H isoform 2 )

HSP 1 Score: 349.4 bits (895), Expect = 3.0e-96
Identity = 169/273 (61.90%), Postives = 207/273 (75.82%), Query Frame = 0

Query: 41  SPRLIFDPTRVTQLSWQPRAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSE 100
           SP  I +P++V Q+S +PRAF+Y+GFLTD ECDHLI LAK+ L++S VADN++G+S  S+
Sbjct: 28  SPSSIINPSKVKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSD 87

Query: 101 VRTSSGMFLRKAQDEIVAGIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKV 160
           VRTSSG F+ K +D IV+GIE ++S WTFLP ENGE +Q+L YE+GQKY+ HFD+FHDKV
Sbjct: 88  VRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKV 147

Query: 161 NQELGGHRIATVLMYLSNVEKGGETIFPNSAFESR----EKDDSWSDCARKGYAVKAQKG 220
           N   GGHRIATVL+YLSNV KGGET+FP++   SR    E  D  SDCA+KG AVK +KG
Sbjct: 148 NIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKG 207

Query: 221 DALLFFSLHLDATTDKRSLHGSCPVIQGEKWSATKWIHVRSFDKATRISSQDCVDENKNC 280
           +ALLFF+L  DA  D  SLHG CPVI+GEKWSATKWIHV SFDK       +C D N++C
Sbjct: 208 NALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHVDSFDKIL-THDGNCTDVNESC 267

Query: 281 PSWAKRGECQKNPTYMVGSEGAVGYCRKSCKAC 310
             WA  GEC KNP YMVG+    G CR+SCKAC
Sbjct: 268 ERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8L9708.3e-12368.25Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana OX=3702 GN=P4H7 PE=2 SV=... [more]
F4J0A85.9e-11365.92Probable prolyl 4-hydroxylase 6 OS=Arabidopsis thaliana OX=3702 GN=P4H6 PE=2 SV=... [more]
Q8LAN33.9e-9661.54Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana OX=3702 GN=P4H4 PE=2 SV=... [more]
F4JAU34.3e-9561.90Prolyl 4-hydroxylase 2 OS=Arabidopsis thaliana OX=3702 GN=P4H2 PE=1 SV=1[more]
Q9LN202.7e-6556.52Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
XP_023549944.18.29e-230100.00probable prolyl 4-hydroxylase 7 [Cucurbita pepo subsp. pepo][more]
XP_022938573.19.68e-22999.35probable prolyl 4-hydroxylase 7 [Cucurbita moschata][more]
KAG6578605.13.94e-22899.03putative prolyl 4-hydroxylase 7, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022993651.12.28e-22798.71probable prolyl 4-hydroxylase 7 [Cucurbita maxima][more]
KAG7016155.11.23e-20699.28putative prolyl 4-hydroxylase 7, partial [Cucurbita argyrosperma subsp. argyrosp... [more]
Match NameE-valueIdentityDescription
A0A6J1FJ934.68e-22999.35Procollagen-proline 4-dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111444767 ... [more]
A0A6J1JWX01.10e-22798.71Procollagen-proline 4-dioxygenase OS=Cucurbita maxima OX=3661 GN=LOC111489579 PE... [more]
A0A1S3C8G41.43e-20287.34Procollagen-proline 4-dioxygenase OS=Cucumis melo OX=3656 GN=LOC103498028 PE=3 S... [more]
A0A0A0KS384.55e-19686.58Procollagen-proline 4-dioxygenase OS=Cucumis sativus OX=3659 GN=Csa_5G633280 PE=... [more]
A0A6J1BXN92.52e-19384.98Procollagen-proline 4-dioxygenase OS=Momordica charantia OX=3673 GN=LOC111006412... [more]
Match NameE-valueIdentityDescription
AT3G28480.15.9e-12468.25Oxoglutarate/iron-dependent oxygenase [more]
AT3G28480.25.9e-11664.40Oxoglutarate/iron-dependent oxygenase [more]
AT3G28490.14.2e-11465.92Oxoglutarate/iron-dependent oxygenase [more]
AT5G18900.12.7e-9761.542-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT3G06300.13.0e-9661.90P4H isoform 2 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003582ShKT domainSMARTSM00254ShkT_1coord: 268..309
e-value: 9.8E-6
score: 35.1
IPR003582ShKT domainPROSITEPS51670SHKTcoord: 269..309
score: 10.172873
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 58..254
e-value: 4.0E-56
score: 202.5
NoneNo IPR availableGENE3D2.60.120.620q2cbj1_9rhob like domaincoord: 50..255
e-value: 1.1E-77
score: 262.5
NoneNo IPR availablePANTHERPTHR10869:SF140OS03G0803500 PROTEINcoord: 40..309
IPR044862Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domainPFAMPF136402OG-FeII_Oxy_3coord: 139..254
e-value: 5.0E-20
score: 72.2
IPR045054Prolyl 4-hydroxylasePANTHERPTHR10869PROLYL 4-HYDROXYLASE ALPHA SUBUNITcoord: 40..309
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 134..255
score: 12.40233

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG13g08580.1Cp4.1LG13g08580.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0018401 peptidyl-proline hydroxylation to 4-hydroxy-L-proline
cellular_component GO:0005789 endoplasmic reticulum membrane
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0004656 procollagen-proline 4-dioxygenase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen