CsaV3_6G047520 (gene) Cucumber (Chinese Long) v3

Overview
NameCsaV3_6G047520
Typegene
OrganismCucumis sativus L. var. sativus cv. Chinese Long (Cucumber (Chinese Long) v3)
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
Locationchr6: 28030486 .. 28034086 (-)
RNA-Seq ExpressionCsaV3_6G047520
SyntenyCsaV3_6G047520
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAATAATGTGTACATAAACTAAAATATAGTATTTTAACCGAAGAAAAAGGATTGAATTTAATTTAATTATTGAAAAATAGAAAAGAAAAAAAGCCGCCGGGAAGCGAGAGAGAAAAATAAAAGAAAAAGAAGTTTGTGAGATTGGTGGAAAGGAGAGGCAGAACCCAAAATCAATCTTTAATTACGGTGTCAAAGTTCTTCGAGTTTACCATTACACAGAAGTTGAGAGAGAGAGAATTTGAGACTTTGAGAATTAGAGAATTAGAGGCTTTGGACTTTGGAGACGAAGACCAAAATTTTCAATCCACTACTCTCATCCAAATCCCCATCTTCAATTCCAGATCTTCCCCCCTAAAGGGTTTCTGTTTCTGTTTCCATGGCGAAACACAGACAATCTCGATTCCCTACTCGCAAGTCCTCTTCTTCCTCTACTCTCGTCTTTACCTTGCTCATTATGTTCACTTTCGTCATTCTCATTCTTCTCGCCCTTGGAATCCTCTCGATCCCTGGGAATTCCGGCGGTTCAACCAAGGTTCACGATCTCAGCTCGATTGTGCGCAAAACTTCTGATGAGTATGTGATCTTCTGGGTCTTGCTTTCAATTTCTTGTTCGGTTTAGGGTGTTTTTTTTTTTTTATTGATTATAAGGGTTTTGTTTGTTTGTTTGTTTTTTTACTTTTCATTTCAGCGTCGACGAGGAGAAAGGCGAGCAGTGGGTTGAAGTGATCTCATGGGAACCTAGAGCTTTCGTTTATCACAACTTTCTGGTATTCTCTTCTTCCCGATCTGTAATGTGAGAGATTTGTGTTTTATAAATATATATGCAACTTTTGCGTGGAATTTTTTAAACTTGGTTTGTTTATGCTCTTACGTGTTGAATGTGTCTTTGGGAGCGCATTTTTTAAATGAAGTCTTTAGGGGCTGCTCATCACTTAGGTTGATCATGTTTATGTTCCTCTAAGAAGTCAGTGTGCTTCTTTGTTATTATGATGAAAGGGGTTAAGGATAATTCCCATTTTTATATGTCTTTAGATCGTGTTGGAGTTTTGAGTTGACTTAGGCTGAGGCTCGTCGATTAATCTTGGAGTAGGAAAATACTGAACTGGTTTACTTTTTAATTGCTTAGACGAAGGAGGAATGTGAATACCTAATCAGCCTTGCCAAGCCTCATATGCAAAAGTCTACAGTTGTTGACAGTGAAACTGGACAGAGTAAAGATAGCAGGTGTGCTATTCATTTCTTCATAATTCCTTTTTGTTAGTATTGCTGGATCTACAATGTAGAAGCCTCCCTGACCTATTTTTCACAGAGTTCGCACTAGCTCTGGAACGTTTTTGCCGAGAGGGCGTGATAAGACTGTTAGAACTATAGAGAAAAGGCTTTCTGATTTCTCCTTCATACCCGTAGGTACATTTTTTAGCCTCTTTCCTTCTTCCCGGTTTCTTCCCACTACATCAAGCATTATAATTTGATTATGCATGGTTAAAATTGAACTTTGTAATCTTTAATTTAAATCCAACTGCACTCATCATTGTGTTATAAATTTCCCCTCTTACCTAGCATTGCCAATCATGACTTTTATATTGTTAGAAAAAAGATTATTACAATCATGCTCGAGATCATTCAAATAGGCAGAATTATTGTGCTAAGTTTCACGTGCCAAGGAAGATCTTATAACTATGGCTGAAATGATCAATAATGAATATCAACTCCAATTGTTATCCTATGAGCTTATTACGCACCCACTGTCGCTTGTAAGAATGAAAGATAATTGGATTCACCTGATAAGCTGCTTCAATATTTGTTATTATATGTAGTTTTTGTTATCTTATCTTTGAAATTTTGCATTAGATCAAACTTATTAGACACCTTCTTTTTCATATTTTGTTACTCAAGTGCACACATAATATGCAGAGCACGGAGAAGGACTTCAGGTTCTTCATTACGAAGTAGGACAAAAATATGAACCTCATTTTGATTACTTCCTTGATGAGTACAATACAAAGAATGGAGGTCAACGTATAGCAACGGTGCTTATGTACCTGTAAGTAGTAATCAATATGAACTCAGAACAAGCGCTACTACATTTTGCATGGTTATCTCATCATTTTATATATCTTCTATCATAATTCAGCTCAGATGTCGAAGAAGGAGGTGAGACAGTGTTCCCTGCTGCCAAAGGAAACTTTAGTTCTGTACCTTGGTGGAATGAGCTTTCAGATTGTGGGAAGAAAGGACTTTCAGTTAAACCAAAGAGGGGCGATGCATTGCTTTTCTGGAGCATGAAGCCTGATGCTTCCCTAGATCCATCAAGTTTGCATGGTTAGATTGGATACTCTCAACTTATAAACTTACCATTCATTTTACTGAGCATGATATATAGGTTTCAGTGATGCACATTCATATCTGTCGATGCAATTTCTTTTGCATCTCCTATATGCAGAAAGTGATTGATTTTCTTGATAAAGAGATGACATTTTTTCTGGTGTTCATTTATAGGTGGCTGTCCTGTTATCAAGGGGAACAAATGGTCTGCTACTAAATGGATGCGAGTAGAAGAATATAAAGCTTGAGTTGATGTGACTAAGGTAATTTCCATGCTCTAAACCATGAACTAAAATTGTAATCAAATGAACACCTCAATCACTGGACAGTCATTCTTTAATTTTTCCTAGATCGTTATTCACCTGAATTGTTGAGGGTCTGTTTAATAATAGTTTTTGAACATTAAGCTTGTTTCCTCTTAATTTTTTACAATGTTCATTTTTCTTTAGTAAAAAAGAGCTGAGTTCTTTTCCAAGTTCCAAAAATAAAAACAAGTTTTTAAAAGCTATTATTTTTTAGTTCTCAAAAGTTGATTGGGTCTTTAAAACATTAGTAAAATGTAGATAATAATTCAAAAAGTTTAGAGTTGGAAGCGATGTTTGTAGGCTTTTAACTTTTAAAAACTGAAAAAATAAATAACTAAATGGTTAACAAACGGGACTAAATCACTTTCAAGAAGCGAACAAAGGAATTTTTTGTTCTGACTTTTCTCATGTTACTGGTAAAAGAATTATTCCTAAAGGTTTGATGCTTCATTATCTTGCAGGTTGGTTGACATTTCTTGTTATTTGTCAAAGTCACAGATGCAATTTGTGTAGGGAATCATCATTTTCTATAATTTAACCTTTGTCCACAACTACTAGATCCTTAAATCTATGAATTTTGATATGGTTAAATTTTAGTTACGAGCTTACATTGTTATTGATTTCTTGTTTATGGCCACGAGAGAATAAGAGTTCATATTTTTGCCAGAAAATTCCACCTTCAAACCTCAGTTACAGATTCATATGGTATGTTATGTAAAAAAAAAAAAACAAAAACAAAGATTTTACTCTTGATGTACAACACAACTCCATTTTTTTCTCTTCATTTTATAGTTTGTGTTCTTGTTGAAGATATATATGTCTTGGTTTTGTTTAGGATTCTGTTGTTTCTGTGAAAGGAAGTGATAGGGAATGAAAAGTCGTGACTCCAAGTCTCATTATTACAAAAAGAAAGTAAACCCATTTTTGTTATTGTATATTTTTGGTTAAATTTATGGGTCTTTAATTGCAA

mRNA sequence

ATGGCGAAACACAGACAATCTCGATTCCCTACTCGCAAGTCCTCTTCTTCCTCTACTCTCGTCTTTACCTTGCTCATTATGTTCACTTTCGTCATTCTCATTCTTCTCGCCCTTGGAATCCTCTCGATCCCTGGGAATTCCGGCGGTTCAACCAAGGTTCACGATCTCAGCTCGATTGTGCGCAAAACTTCTGATGACGTCGACGAGGAGAAAGGCGAGCAGTGGGTTGAAGTGATCTCATGGGAACCTAGAGCTTTCGTTTATCACAACTTTCTGACGAAGGAGGAATGTGAATACCTAATCAGCCTTGCCAAGCCTCATATGCAAAAGTCTACAGTTGTTGACAGTGAAACTGGACAGAGTAAAGATAGCAGAGTTCGCACTAGCTCTGGAACGTTTTTGCCGAGAGGGCGTGATAAGACTGTTAGAACTATAGAGAAAAGGCTTTCTGATTTCTCCTTCATACCCGTAGAGCACGGAGAAGGACTTCAGGTTCTTCATTACGAAGTAGGACAAAAATATGAACCTCATTTTGATTACTTCCTTGATGAGTACAATACAAAGAATGGAGGTCAACGTATAGCAACGGTGCTTATGTACCTCTCAGATGTCGAAGAAGGAGGTGAGACAGTGTTCCCTGCTGCCAAAGGAAACTTTAGTTCTGTACCTTGGTGGAATGAGCTTTCAGATTGTGGGAAGAAAGGACTTTCAGTTAAACCAAAGAGGGGCGATGCATTGCTTTTCTGGAGCATGAAGCCTGATGCTTCCCTAGATCCATCAAGTTTGCATGGTGGCTGTCCTGTTATCAAGGGGAACAAATGGTCTGCTACTAAATGGATGCGAGTAGAAGAATATAAAGCTTGA

Coding sequence (CDS)

ATGGCGAAACACAGACAATCTCGATTCCCTACTCGCAAGTCCTCTTCTTCCTCTACTCTCGTCTTTACCTTGCTCATTATGTTCACTTTCGTCATTCTCATTCTTCTCGCCCTTGGAATCCTCTCGATCCCTGGGAATTCCGGCGGTTCAACCAAGGTTCACGATCTCAGCTCGATTGTGCGCAAAACTTCTGATGACGTCGACGAGGAGAAAGGCGAGCAGTGGGTTGAAGTGATCTCATGGGAACCTAGAGCTTTCGTTTATCACAACTTTCTGACGAAGGAGGAATGTGAATACCTAATCAGCCTTGCCAAGCCTCATATGCAAAAGTCTACAGTTGTTGACAGTGAAACTGGACAGAGTAAAGATAGCAGAGTTCGCACTAGCTCTGGAACGTTTTTGCCGAGAGGGCGTGATAAGACTGTTAGAACTATAGAGAAAAGGCTTTCTGATTTCTCCTTCATACCCGTAGAGCACGGAGAAGGACTTCAGGTTCTTCATTACGAAGTAGGACAAAAATATGAACCTCATTTTGATTACTTCCTTGATGAGTACAATACAAAGAATGGAGGTCAACGTATAGCAACGGTGCTTATGTACCTCTCAGATGTCGAAGAAGGAGGTGAGACAGTGTTCCCTGCTGCCAAAGGAAACTTTAGTTCTGTACCTTGGTGGAATGAGCTTTCAGATTGTGGGAAGAAAGGACTTTCAGTTAAACCAAAGAGGGGCGATGCATTGCTTTTCTGGAGCATGAAGCCTGATGCTTCCCTAGATCCATCAAGTTTGCATGGTGGCTGTCCTGTTATCAAGGGGAACAAATGGTCTGCTACTAAATGGATGCGAGTAGAAGAATATAAAGCTTGA

Protein sequence

MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQSKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA*
Homology
BLAST of CsaV3_6G047520 vs. NCBI nr
Match: XP_004134841.1 (probable prolyl 4-hydroxylase 10 [Cucumis sativus] >XP_031742781.1 probable prolyl 4-hydroxylase 10 [Cucumis sativus] >KGN48959.1 hypothetical protein Csa_003630 [Cucumis sativus])

HSP 1 Score: 584.3 bits (1505), Expect = 5.6e-163
Identity = 287/287 (100.00%), Postives = 287/287 (100.00%), Query Frame = 0

Query: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60
           MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV
Sbjct: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60

Query: 61  RKTSDDVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQ 120
           RKTSDDVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQ
Sbjct: 61  RKTSDDVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQ 120

Query: 121 SKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDY 180
           SKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDY
Sbjct: 121 SKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDY 180

Query: 181 FLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP 240
           FLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP
Sbjct: 181 FLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP 240

Query: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 288
           KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA
Sbjct: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 287

BLAST of CsaV3_6G047520 vs. NCBI nr
Match: XP_008440878.1 (PREDICTED: probable prolyl 4-hydroxylase 10 [Cucumis melo])

HSP 1 Score: 582.8 bits (1501), Expect = 1.6e-162
Identity = 284/287 (98.95%), Postives = 287/287 (100.00%), Query Frame = 0

Query: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60
           MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV
Sbjct: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60

Query: 61  RKTSDDVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQ 120
           RKTSDDVDEEKGEQWVEVISWEPRAF+YHNFLTKEECEYLISLAKPHMQKSTVVDSETGQ
Sbjct: 61  RKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQ 120

Query: 121 SKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDY 180
           SKDSRVRTSSGTFLPRGRDKT+RTIEKR+SDFSFIPVEHGEGLQVLHYEVGQKYEPHFDY
Sbjct: 121 SKDSRVRTSSGTFLPRGRDKTIRTIEKRISDFSFIPVEHGEGLQVLHYEVGQKYEPHFDY 180

Query: 181 FLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP 240
           FLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP
Sbjct: 181 FLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP 240

Query: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 288
           KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA
Sbjct: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 287

BLAST of CsaV3_6G047520 vs. NCBI nr
Match: XP_038881910.1 (probable prolyl 4-hydroxylase 10 isoform X2 [Benincasa hispida])

HSP 1 Score: 569.7 bits (1467), Expect = 1.4e-158
Identity = 277/286 (96.85%), Postives = 284/286 (99.30%), Query Frame = 0

Query: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60
           MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGS KVHDLSSIV
Sbjct: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKVHDLSSIV 60

Query: 61  RKTSDDVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQ 120
           RKTSDDVDEE+GEQWVEVISWEPRAFVYHNFLTKEECEYLISLA PHMQKSTVVDSETG+
Sbjct: 61  RKTSDDVDEEQGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAMPHMQKSTVVDSETGK 120

Query: 121 SKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDY 180
           SKDSRVRTSSGTFLPRGRDKT+RTIEKR++DFSFIPVEHGEGLQVLHYEVGQKYEPHFDY
Sbjct: 121 SKDSRVRTSSGTFLPRGRDKTIRTIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEPHFDY 180

Query: 181 FLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP 240
           FLDEYNTKNGGQRIATVLMYLS+VEEGGETVFPAAKGNFSSVPWWNELS+CGKKGLSVKP
Sbjct: 181 FLDEYNTKNGGQRIATVLMYLSEVEEGGETVFPAAKGNFSSVPWWNELSECGKKGLSVKP 240

Query: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYK 287
           KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYK
Sbjct: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYK 286

BLAST of CsaV3_6G047520 vs. NCBI nr
Match: XP_038881909.1 (probable prolyl 4-hydroxylase 10 isoform X1 [Benincasa hispida])

HSP 1 Score: 562.4 bits (1448), Expect = 2.3e-156
Identity = 276/290 (95.17%), Postives = 284/290 (97.93%), Query Frame = 0

Query: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60
           MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGS KVHDLSSIV
Sbjct: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSPKVHDLSSIV 60

Query: 61  RKTSDD----VDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDS 120
           RKTSD+    VDEE+GEQWVEVISWEPRAFVYHNFLTKEECEYLISLA PHMQKSTVVDS
Sbjct: 61  RKTSDEYVISVDEEQGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAMPHMQKSTVVDS 120

Query: 121 ETGQSKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEP 180
           ETG+SKDSRVRTSSGTFLPRGRDKT+RTIEKR++DFSFIPVEHGEGLQVLHYEVGQKYEP
Sbjct: 121 ETGKSKDSRVRTSSGTFLPRGRDKTIRTIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEP 180

Query: 181 HFDYFLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGL 240
           HFDYFLDEYNTKNGGQRIATVLMYLS+VEEGGETVFPAAKGNFSSVPWWNELS+CGKKGL
Sbjct: 181 HFDYFLDEYNTKNGGQRIATVLMYLSEVEEGGETVFPAAKGNFSSVPWWNELSECGKKGL 240

Query: 241 SVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYK 287
           SVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYK
Sbjct: 241 SVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYK 290

BLAST of CsaV3_6G047520 vs. NCBI nr
Match: XP_022978860.1 (probable prolyl 4-hydroxylase 10 [Cucurbita maxima])

HSP 1 Score: 558.5 bits (1438), Expect = 3.3e-155
Identity = 269/287 (93.73%), Postives = 281/287 (97.91%), Query Frame = 0

Query: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60
           MAKHRQSRFPTRKSSSSST++FTLLIMFTFVILILLALGILSIPGNSGGS KVHDLSSIV
Sbjct: 1   MAKHRQSRFPTRKSSSSSTIIFTLLIMFTFVILILLALGILSIPGNSGGSPKVHDLSSIV 60

Query: 61  RKTSDDVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQ 120
           RKTS+DVDEEKGE+W EVISWEPRAFVYHNFLTKEECEYLIS AKPHMQKSTVVDSETG+
Sbjct: 61  RKTSEDVDEEKGERWAEVISWEPRAFVYHNFLTKEECEYLISHAKPHMQKSTVVDSETGK 120

Query: 121 SKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDY 180
           SKDSRVRTSSGTFLPRG DK + TIEKR++DF+FIP+EHGEGLQVLHYEVGQKYEPHFDY
Sbjct: 121 SKDSRVRTSSGTFLPRGGDKIISTIEKRIADFTFIPIEHGEGLQVLHYEVGQKYEPHFDY 180

Query: 181 FLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP 240
           FLD+YNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP
Sbjct: 181 FLDDYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP 240

Query: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 288
           KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRV+EYKA
Sbjct: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA 287

BLAST of CsaV3_6G047520 vs. ExPASy Swiss-Prot
Match: F4JZ24 (Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana OX=3702 GN=P4H10 PE=3 SV=1)

HSP 1 Score: 471.1 bits (1211), Expect = 9.1e-132
Identity = 227/288 (78.82%), Postives = 256/288 (88.89%), Query Frame = 0

Query: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60
           MA+ R  R P+ + SS STLVF +LIM TFVILILLA GILS+P N+ GS+K +DL+SIV
Sbjct: 2   MARPRNHR-PSARKSSHSTLVFAVLIMSTFVILILLAFGILSVPSNNAGSSKANDLTSIV 61

Query: 61  RKT--SDDVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSET 120
           RKT      D+ K E+WVE+ISWEPRA VYHNFLTKEEC+YLI LAKPHM+KSTVVD +T
Sbjct: 62  RKTLQRSGEDDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKT 121

Query: 121 GQSKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHF 180
           G+S DSRVRTSSGTFL RGRDKT+R IEKR+SDF+FIPVEHGEGLQVLHYE+GQKYEPH+
Sbjct: 122 GKSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHY 181

Query: 181 DYFLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSV 240
           DYF+DEYNT+NGGQRIATVLMYLSDVEEGGETVFPAAKGN+S+VPWWNELS+CGK GLSV
Sbjct: 182 DYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSV 241

Query: 241 KPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYK 287
           KPK GDALLFWSM PDA+LDPSSLHGGC VIKGNKWS+TKW+RV EYK
Sbjct: 242 KPKMGDALLFWSMTPDATLDPSSLHGGCAVIKGNKWSSTKWLRVHEYK 288

BLAST of CsaV3_6G047520 vs. ExPASy Swiss-Prot
Match: Q24JN5 (Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana OX=3702 GN=P4H5 PE=2 SV=1)

HSP 1 Score: 433.3 bits (1113), Expect = 2.1e-120
Identity = 210/281 (74.73%), Postives = 241/281 (85.77%), Query Frame = 0

Query: 8   RFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIVRK--TSD 67
           R+  RKS S ST  FT+LI+   VILILL LGILS+P  +  S+K +DL++IVRK  TS 
Sbjct: 10  RYQPRKSVSRSTQAFTVLILLLVVILILLGLGILSLPNANRNSSKTNDLTNIVRKSETSS 69

Query: 68  DVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQSKDSR 127
             +E  GE+WVEVISWEPRA VYHNFLT EECE+LISLAKP M KSTVVD +TG SKDSR
Sbjct: 70  GDEEGNGERWVEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSR 129

Query: 128 VRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFLDEY 187
           VRTSSGTFL RG D+ V  IEKR+SDF+FIPVE+GEGLQVLHY+VGQKYEPH+DYFLDE+
Sbjct: 130 VRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEF 189

Query: 188 NTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDA 247
           NTKNGGQRIATVLMYLSDV++GGETVFPAA+GN S+VPWWNELS CGK+GLSV PK+ DA
Sbjct: 190 NTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDA 249

Query: 248 LLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYK 287
           LLFW+M+PDASLDPSSLHGGCPV+KGNKWS+TKW  V E+K
Sbjct: 250 LLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFHVHEFK 290

BLAST of CsaV3_6G047520 vs. ExPASy Swiss-Prot
Match: Q9LN20 (Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=1)

HSP 1 Score: 430.3 bits (1105), Expect = 1.8e-119
Identity = 208/288 (72.22%), Postives = 248/288 (86.11%), Query Frame = 0

Query: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60
           MAK R SRF  RK S+   ++F +L M T V+L+LLA G+ S+P N+  S+ + DLS   
Sbjct: 1   MAKLRHSRFQARKWSTLMLVLF-MLFMLTIVLLMLLAFGVFSLPINNDESSPI-DLSYFR 60

Query: 61  RKTSDDVD--EEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSET 120
           R  ++  +   ++G+QW EV+SWEPRAFVYHNFL+KEECEYLISLAKPHM KSTVVDSET
Sbjct: 61  RAATERSEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSET 120

Query: 121 GQSKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHF 180
           G+SKDSRVRTSSGTFL RGRDK ++TIEKR++D++FIP +HGEGLQVLHYE GQKYEPH+
Sbjct: 121 GKSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHY 180

Query: 181 DYFLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSV 240
           DYF+DE+NTKNGGQR+AT+LMYLSDVEEGGETVFPAA  NFSSVPW+NELS+CGKKGLSV
Sbjct: 181 DYFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSV 240

Query: 241 KPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYK 287
           KP+ GDALLFWSM+PDA+LDP+SLHGGCPVI+GNKWS+TKWM V EYK
Sbjct: 241 KPRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWMHVGEYK 286

BLAST of CsaV3_6G047520 vs. ExPASy Swiss-Prot
Match: F4JNU8 (Probable prolyl 4-hydroxylase 8 OS=Arabidopsis thaliana OX=3702 GN=P4H8 PE=3 SV=1)

HSP 1 Score: 403.3 bits (1035), Expect = 2.3e-111
Identity = 199/288 (69.10%), Postives = 233/288 (80.90%), Query Frame = 0

Query: 3   KHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV-- 62
           K +Q R   RKS S+ T  FT++++  FVILIL+ LGI S+P  +  S+   DL++IV  
Sbjct: 4   KPKQLRNKPRKSFSTQT--FTVVVLVLFVILILVGLGIFSLPSTNKTSSMPMDLTTIVQT 63

Query: 63  ---RKTSDDVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSE 122
              R++  D ++  G++W+EVISWEPRAFVYHNFLT EECE+LISLAKP M KS VVD +
Sbjct: 64  IQERESFGDEEDGNGDRWLEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVK 123

Query: 123 TGQSKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPH 182
           TG+S DSRVRTSSGTFL RG D+ V  IE R+SDF+FIP E+GEGLQVLHYEVGQ+YEPH
Sbjct: 124 TGKSIDSRVRTSSGTFLNRGHDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPH 183

Query: 183 FDYFLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLS 242
            DYF DE+N + GGQRIATVLMYLSDV+EGGETVFPAAKGN S VPWW+ELS CGK+GLS
Sbjct: 184 HDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGLS 243

Query: 243 VKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEY 286
           V PK+ DALLFWSMKPDASLDPSSLHGGCPVIKGNKWS+TKW  V EY
Sbjct: 244 VLPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWFHVHEY 289

BLAST of CsaV3_6G047520 vs. ExPASy Swiss-Prot
Match: Q8LAN3 (Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana OX=3702 GN=P4H4 PE=2 SV=1)

HSP 1 Score: 252.3 bits (643), Expect = 6.6e-66
Identity = 123/238 (51.68%), Postives = 174/238 (73.11%), Query Frame = 0

Query: 50  STKVHDLSSIVRKTSDDVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQ 109
           S  +   +S++  +S  V+  K    V+ +S +PRAFVY  FLT+ EC++++SLAK  ++
Sbjct: 15  SVLLQSSTSLISSSSVFVNPSK----VKQVSSKPRAFVYEGFLTELECDHMVSLAKASLK 74

Query: 110 KSTVVDSETGQSKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYE 169
           +S V D+++G+SK S VRTSSGTF+ +G+D  V  IE ++S ++F+P E+GE +QVL YE
Sbjct: 75  RSAVADNDSGESKFSEVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYE 134

Query: 170 VGQKYEPHFDYFLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWN--E 229
            GQKY+ HFDYF D+ N   GG R+AT+LMYLS+V +GGETVFP A+     V   N  +
Sbjct: 135 HGQKYDAHFDYFHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENKED 194

Query: 230 LSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEY 286
           LSDC K+G++VKP++GDALLF+++ PDA  DP SLHGGCPVI+G KWSATKW+ V+ +
Sbjct: 195 LSDCAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHVDSF 248

BLAST of CsaV3_6G047520 vs. ExPASy TrEMBL
Match: A0A0A0KJH9 (Fe2OG dioxygenase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G507320 PE=4 SV=1)

HSP 1 Score: 584.3 bits (1505), Expect = 2.7e-163
Identity = 287/287 (100.00%), Postives = 287/287 (100.00%), Query Frame = 0

Query: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60
           MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV
Sbjct: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60

Query: 61  RKTSDDVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQ 120
           RKTSDDVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQ
Sbjct: 61  RKTSDDVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQ 120

Query: 121 SKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDY 180
           SKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDY
Sbjct: 121 SKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDY 180

Query: 181 FLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP 240
           FLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP
Sbjct: 181 FLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP 240

Query: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 288
           KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA
Sbjct: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 287

BLAST of CsaV3_6G047520 vs. ExPASy TrEMBL
Match: A0A1S3B1P6 (probable prolyl 4-hydroxylase 10 OS=Cucumis melo OX=3656 GN=LOC103485167 PE=4 SV=1)

HSP 1 Score: 582.8 bits (1501), Expect = 7.9e-163
Identity = 284/287 (98.95%), Postives = 287/287 (100.00%), Query Frame = 0

Query: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60
           MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV
Sbjct: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60

Query: 61  RKTSDDVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQ 120
           RKTSDDVDEEKGEQWVEVISWEPRAF+YHNFLTKEECEYLISLAKPHMQKSTVVDSETGQ
Sbjct: 61  RKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQ 120

Query: 121 SKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDY 180
           SKDSRVRTSSGTFLPRGRDKT+RTIEKR+SDFSFIPVEHGEGLQVLHYEVGQKYEPHFDY
Sbjct: 121 SKDSRVRTSSGTFLPRGRDKTIRTIEKRISDFSFIPVEHGEGLQVLHYEVGQKYEPHFDY 180

Query: 181 FLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP 240
           FLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP
Sbjct: 181 FLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP 240

Query: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 288
           KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA
Sbjct: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 287

BLAST of CsaV3_6G047520 vs. ExPASy TrEMBL
Match: A0A6J1IRF1 (probable prolyl 4-hydroxylase 10 OS=Cucurbita maxima OX=3661 GN=LOC111478689 PE=4 SV=1)

HSP 1 Score: 558.5 bits (1438), Expect = 1.6e-155
Identity = 269/287 (93.73%), Postives = 281/287 (97.91%), Query Frame = 0

Query: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60
           MAKHRQSRFPTRKSSSSST++FTLLIMFTFVILILLALGILSIPGNSGGS KVHDLSSIV
Sbjct: 1   MAKHRQSRFPTRKSSSSSTIIFTLLIMFTFVILILLALGILSIPGNSGGSPKVHDLSSIV 60

Query: 61  RKTSDDVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQ 120
           RKTS+DVDEEKGE+W EVISWEPRAFVYHNFLTKEECEYLIS AKPHMQKSTVVDSETG+
Sbjct: 61  RKTSEDVDEEKGERWAEVISWEPRAFVYHNFLTKEECEYLISHAKPHMQKSTVVDSETGK 120

Query: 121 SKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDY 180
           SKDSRVRTSSGTFLPRG DK + TIEKR++DF+FIP+EHGEGLQVLHYEVGQKYEPHFDY
Sbjct: 121 SKDSRVRTSSGTFLPRGGDKIISTIEKRIADFTFIPIEHGEGLQVLHYEVGQKYEPHFDY 180

Query: 181 FLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP 240
           FLD+YNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP
Sbjct: 181 FLDDYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP 240

Query: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 288
           KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRV+EYKA
Sbjct: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA 287

BLAST of CsaV3_6G047520 vs. ExPASy TrEMBL
Match: A0A6J1GDV6 (probable prolyl 4-hydroxylase 10 OS=Cucurbita moschata OX=3662 GN=LOC111453285 PE=4 SV=1)

HSP 1 Score: 557.0 bits (1434), Expect = 4.6e-155
Identity = 269/287 (93.73%), Postives = 280/287 (97.56%), Query Frame = 0

Query: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60
           MAKHRQSR PTRKSSSSSTL+FTLLIMFTFVILILLALGILSIPGNSGGS KVHDLSSIV
Sbjct: 1   MAKHRQSRLPTRKSSSSSTLIFTLLIMFTFVILILLALGILSIPGNSGGSPKVHDLSSIV 60

Query: 61  RKTSDDVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQ 120
           RKTS+DVDEEKGE+W EVISWEPRAFVYHNFLTKEECEYLIS AKPHMQKSTVVDSETG+
Sbjct: 61  RKTSEDVDEEKGERWAEVISWEPRAFVYHNFLTKEECEYLISHAKPHMQKSTVVDSETGK 120

Query: 121 SKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDY 180
           SKDSRVRTSSGTFLPRG DK + TIEKR++DF+FIP+EHGEGLQVLHYEVGQKYEPHFDY
Sbjct: 121 SKDSRVRTSSGTFLPRGGDKIISTIEKRIADFTFIPIEHGEGLQVLHYEVGQKYEPHFDY 180

Query: 181 FLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP 240
           FLD+YNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP
Sbjct: 181 FLDDYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP 240

Query: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 288
           KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRV+EYKA
Sbjct: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVDEYKA 287

BLAST of CsaV3_6G047520 vs. ExPASy TrEMBL
Match: A0A6J1EFG6 (probable prolyl 4-hydroxylase 10 OS=Cucurbita moschata OX=3662 GN=LOC111433654 PE=4 SV=1)

HSP 1 Score: 552.4 bits (1422), Expect = 1.1e-153
Identity = 268/287 (93.38%), Postives = 280/287 (97.56%), Query Frame = 0

Query: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60
           MAKHRQ RFP+RKSSSSSTL+FTLLIMFTFVILILLALGILSIPGNS G+ K HDLSSIV
Sbjct: 1   MAKHRQPRFPSRKSSSSSTLIFTLLIMFTFVILILLALGILSIPGNS-GAPKAHDLSSIV 60

Query: 61  RKTSDDVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQ 120
           RKTSD+VDEEKGEQW EVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKS+VVDSETG+
Sbjct: 61  RKTSDEVDEEKGEQWAEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSSVVDSETGK 120

Query: 121 SKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDY 180
           SKDSRVRTSSGTFLPRGRDK +R IEKR++DFSF+PVEHGEGLQVLHYEVGQKYEPHFDY
Sbjct: 121 SKDSRVRTSSGTFLPRGRDKIIRNIEKRIADFSFVPVEHGEGLQVLHYEVGQKYEPHFDY 180

Query: 181 FLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP 240
           FLDEYNTKNGGQRIAT+LMYLSDVEEGGETVFPAAKGNFSSVPWW+ELSDCGKKGLSVKP
Sbjct: 181 FLDEYNTKNGGQRIATMLMYLSDVEEGGETVFPAAKGNFSSVPWWDELSDCGKKGLSVKP 240

Query: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 288
           KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA
Sbjct: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 286

BLAST of CsaV3_6G047520 vs. TAIR 10
Match: AT5G66060.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 471.1 bits (1211), Expect = 6.4e-133
Identity = 227/288 (78.82%), Postives = 256/288 (88.89%), Query Frame = 0

Query: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60
           MA+ R  R P+ + SS STLVF +LIM TFVILILLA GILS+P N+ GS+K +DL+SIV
Sbjct: 2   MARPRNHR-PSARKSSHSTLVFAVLIMSTFVILILLAFGILSVPSNNAGSSKANDLTSIV 61

Query: 61  RKT--SDDVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSET 120
           RKT      D+ K E+WVE+ISWEPRA VYHNFLTKEEC+YLI LAKPHM+KSTVVD +T
Sbjct: 62  RKTLQRSGEDDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKT 121

Query: 121 GQSKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHF 180
           G+S DSRVRTSSGTFL RGRDKT+R IEKR+SDF+FIPVEHGEGLQVLHYE+GQKYEPH+
Sbjct: 122 GKSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHY 181

Query: 181 DYFLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSV 240
           DYF+DEYNT+NGGQRIATVLMYLSDVEEGGETVFPAAKGN+S+VPWWNELS+CGK GLSV
Sbjct: 182 DYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSV 241

Query: 241 KPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYK 287
           KPK GDALLFWSM PDA+LDPSSLHGGC VIKGNKWS+TKW+RV EYK
Sbjct: 242 KPKMGDALLFWSMTPDATLDPSSLHGGCAVIKGNKWSSTKWLRVHEYK 288

BLAST of CsaV3_6G047520 vs. TAIR 10
Match: AT2G17720.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 433.3 bits (1113), Expect = 1.5e-121
Identity = 210/281 (74.73%), Postives = 241/281 (85.77%), Query Frame = 0

Query: 8   RFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIVRK--TSD 67
           R+  RKS S ST  FT+LI+   VILILL LGILS+P  +  S+K +DL++IVRK  TS 
Sbjct: 10  RYQPRKSVSRSTQAFTVLILLLVVILILLGLGILSLPNANRNSSKTNDLTNIVRKSETSS 69

Query: 68  DVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQSKDSR 127
             +E  GE+WVEVISWEPRA VYHNFLT EECE+LISLAKP M KSTVVD +TG SKDSR
Sbjct: 70  GDEEGNGERWVEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSR 129

Query: 128 VRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFLDEY 187
           VRTSSGTFL RG D+ V  IEKR+SDF+FIPVE+GEGLQVLHY+VGQKYEPH+DYFLDE+
Sbjct: 130 VRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEF 189

Query: 188 NTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDA 247
           NTKNGGQRIATVLMYLSDV++GGETVFPAA+GN S+VPWWNELS CGK+GLSV PK+ DA
Sbjct: 190 NTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDA 249

Query: 248 LLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYK 287
           LLFW+M+PDASLDPSSLHGGCPV+KGNKWS+TKW  V E+K
Sbjct: 250 LLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFHVHEFK 290

BLAST of CsaV3_6G047520 vs. TAIR 10
Match: AT1G20270.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 430.3 bits (1105), Expect = 1.3e-120
Identity = 208/288 (72.22%), Postives = 248/288 (86.11%), Query Frame = 0

Query: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60
           MAK R SRF  RK S+   ++F +L M T V+L+LLA G+ S+P N+  S+ + DLS   
Sbjct: 1   MAKLRHSRFQARKWSTLMLVLF-MLFMLTIVLLMLLAFGVFSLPINNDESSPI-DLSYFR 60

Query: 61  RKTSDDVD--EEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSET 120
           R  ++  +   ++G+QW EV+SWEPRAFVYHNFL+KEECEYLISLAKPHM KSTVVDSET
Sbjct: 61  RAATERSEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSET 120

Query: 121 GQSKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHF 180
           G+SKDSRVRTSSGTFL RGRDK ++TIEKR++D++FIP +HGEGLQVLHYE GQKYEPH+
Sbjct: 121 GKSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHY 180

Query: 181 DYFLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSV 240
           DYF+DE+NTKNGGQR+AT+LMYLSDVEEGGETVFPAA  NFSSVPW+NELS+CGKKGLSV
Sbjct: 181 DYFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSV 240

Query: 241 KPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYK 287
           KP+ GDALLFWSM+PDA+LDP+SLHGGCPVI+GNKWS+TKWM V EYK
Sbjct: 241 KPRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWMHVGEYK 286

BLAST of CsaV3_6G047520 vs. TAIR 10
Match: AT4G35810.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 403.3 bits (1035), Expect = 1.7e-112
Identity = 199/288 (69.10%), Postives = 233/288 (80.90%), Query Frame = 0

Query: 3   KHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV-- 62
           K +Q R   RKS S+ T  FT++++  FVILIL+ LGI S+P  +  S+   DL++IV  
Sbjct: 4   KPKQLRNKPRKSFSTQT--FTVVVLVLFVILILVGLGIFSLPSTNKTSSMPMDLTTIVQT 63

Query: 63  ---RKTSDDVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSE 122
              R++  D ++  G++W+EVISWEPRAFVYHNFLT EECE+LISLAKP M KS VVD +
Sbjct: 64  IQERESFGDEEDGNGDRWLEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVK 123

Query: 123 TGQSKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPH 182
           TG+S DSRVRTSSGTFL RG D+ V  IE R+SDF+FIP E+GEGLQVLHYEVGQ+YEPH
Sbjct: 124 TGKSIDSRVRTSSGTFLNRGHDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPH 183

Query: 183 FDYFLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLS 242
            DYF DE+N + GGQRIATVLMYLSDV+EGGETVFPAAKGN S VPWW+ELS CGK+GLS
Sbjct: 184 HDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGLS 243

Query: 243 VKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEY 286
           V PK+ DALLFWSMKPDASLDPSSLHGGCPVIKGNKWS+TKW  V EY
Sbjct: 244 VLPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWFHVHEY 289

BLAST of CsaV3_6G047520 vs. TAIR 10
Match: AT5G66060.2 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 363.6 bits (932), Expect = 1.5e-100
Identity = 181/237 (76.37%), Postives = 207/237 (87.34%), Query Frame = 0

Query: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60
           MA+ R  R P+ + SS STLVF +LIM TFVILILLA GILS+P N+ GS+K +DL+SIV
Sbjct: 2   MARPRNHR-PSARKSSHSTLVFAVLIMSTFVILILLAFGILSVPSNNAGSSKANDLTSIV 61

Query: 61  RKT--SDDVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSET 120
           RKT      D+ K E+WVE+ISWEPRA VYHNFL  EEC+YLI LAKPHM+KSTVVD +T
Sbjct: 62  RKTLQRSGEDDSKNERWVEIISWEPRASVYHNFL--EECKYLIELAKPHMEKSTVVDEKT 121

Query: 121 GQSKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHF 180
           G+S DSRVRTSSGTFL RGRDKT+R IEKR+SDF+FIPVEHGEGLQVLHYE+GQKYEPH+
Sbjct: 122 GKSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHY 181

Query: 181 DYFLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKG 236
           DYF+DEYNT+NGGQRIATVLMYLSDVEEGGETVFPAAKGN+S+VPWWNELS+CGK G
Sbjct: 182 DYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGG 235

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004134841.15.6e-163100.00probable prolyl 4-hydroxylase 10 [Cucumis sativus] >XP_031742781.1 probable prol... [more]
XP_008440878.11.6e-16298.95PREDICTED: probable prolyl 4-hydroxylase 10 [Cucumis melo][more]
XP_038881910.11.4e-15896.85probable prolyl 4-hydroxylase 10 isoform X2 [Benincasa hispida][more]
XP_038881909.12.3e-15695.17probable prolyl 4-hydroxylase 10 isoform X1 [Benincasa hispida][more]
XP_022978860.13.3e-15593.73probable prolyl 4-hydroxylase 10 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
F4JZ249.1e-13278.82Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana OX=3702 GN=P4H10 PE=3 S... [more]
Q24JN52.1e-12074.73Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana OX=3702 GN=P4H5 PE=2 SV=1[more]
Q9LN201.8e-11972.22Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=... [more]
F4JNU82.3e-11169.10Probable prolyl 4-hydroxylase 8 OS=Arabidopsis thaliana OX=3702 GN=P4H8 PE=3 SV=... [more]
Q8LAN36.6e-6651.68Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana OX=3702 GN=P4H4 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
A0A0A0KJH92.7e-163100.00Fe2OG dioxygenase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G... [more]
A0A1S3B1P67.9e-16398.95probable prolyl 4-hydroxylase 10 OS=Cucumis melo OX=3656 GN=LOC103485167 PE=4 SV... [more]
A0A6J1IRF11.6e-15593.73probable prolyl 4-hydroxylase 10 OS=Cucurbita maxima OX=3661 GN=LOC111478689 PE=... [more]
A0A6J1GDV64.6e-15593.73probable prolyl 4-hydroxylase 10 OS=Cucurbita moschata OX=3662 GN=LOC111453285 P... [more]
A0A6J1EFG61.1e-15393.38probable prolyl 4-hydroxylase 10 OS=Cucurbita moschata OX=3662 GN=LOC111433654 P... [more]
Match NameE-valueIdentityDescription
AT5G66060.16.4e-13378.822-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT2G17720.11.5e-12174.732-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT1G20270.11.3e-12072.222-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT4G35810.11.7e-11269.102-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT5G66060.21.5e-10076.372-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Chinese Long) v3
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 280..287
NoneNo IPR availableGENE3D2.60.120.620q2cbj1_9rhob like domaincoord: 75..282
e-value: 1.2E-81
score: 275.3
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 112..133
NoneNo IPR availablePANTHERPTHR10869:SF157PROLYL 4-HYDROXYLASE 10-RELATEDcoord: 1..286
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 83..281
e-value: 2.3E-65
score: 233.1
IPR044862Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domainPFAMPF136402OG-FeII_Oxy_3coord: 163..281
e-value: 5.8E-22
score: 78.4
IPR045054Prolyl 4-hydroxylasePANTHERPTHR10869PROLYL 4-HYDROXYLASE ALPHA SUBUNITcoord: 1..286
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 159..282
score: 12.68751

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_6G047520.1CsaV3_6G047520.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen