Cla97C09G172620 (gene) Watermelon (97103) v2.5

Overview
NameCla97C09G172620
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
Descriptionureidoglycolate hydrolases
LocationCla97Chr09: 9101008 .. 9103151 (-)
RNA-Seq ExpressionCla97C09G172620
SyntenyCla97C09G172620
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAAAGACAATAATGAAGCTGAAAGCCATAGAAGCGACGGCAGAGAACTTCGCAGAGTACGGGCAATTGATCGGAGCTACAGCGGACGGCGTTGAATTCGGAGATGAAGACGCCCAATTAGACCTTAACAATGGAATCCCTAGGTAAATTTTCACAAAACAAAATCTTCAATTATTTTAGTATATATTTGGTTCTGAATCGGATAGATCTGAGGGAAGAACAGGTTGTACATCCTCCACATTGAGAATCGAGCATTGGAATTCTCGAAGATAACGCATCACGCGCGAGTAACGCAGTGCTTGGGATCGGTGGATCGAGAGGCTTGGTATCTCGGAGTTGCGAAGGCGTCGATTGTTGCGGAGGGGGAGAATGGCGGAGGAGGGAGTTTGTGCAAGTCTGAGAGCGGCGGGCATTTGTATGTGCCGCCGAGTGTGGATGAAATTCGGGCGTTTAGGATATCGGGAGCGAAATTTGTGAAGCTGAATAAAGGGACATGGCATGCGGGTCCTCTGTTTAGAGCAAGTGCTAGAGATTTCTACAACCTGGAATTGACTGATACTAATGTAAGTTCCTCATCTCTCTCTTTCTTTTTTACCTTTTTACTTGAATCTTCAAATTTGCAGTGTTAAATTGGCCCAACTAAGATACAGATACACCGTCTAGTCTAATGTATCTCTTTCACGTATGTAGAAGGAGGATAAAAAAGAAAATAAGATACTTCAAATCATATATATATATATATATGATATATATATGATATTCTGCACAATTAAAATCTTCAACAAAACCCCCCTAAGGTTTTTAGAATCAATACCAACCAATCAATCTATCAATTTAGGCACTCTCAACTTGTTTGAGATAAGGTGAAATTTTAGAGATTTTTGTGGTTGTGAAGCACAAAATAATTATTTATTTTTCACTTGTTGAATTTTTGTTCAATTGTTTTTTTTAGGAAAATGAATTTATAATTAATTTATTAGATATAAAATTGAATTTTTATTTCTAATAGAATCTTGAATTTTTAATTTTTTGTTGGATAGATACATAAATCTAATTAAGTCTAGAATATTCATTTCACCTCTAATTTTTAAATTTGTGATCTACTAGTTTTATTATTATTTTCAAAAATCAAACCAAGTTTTGAAAATTAAAAAAAAAATTATTAAAAAAGAAAATTATTAAAAAAAAAATCACAGTTATAATAATTTTTTTTTTAATAATTATTTCTTTAGGAAATTATCTTAAATGGTATAACTGCTGATATTTCGAGATCTTTCGCGGTTTATCGCAGATAAAGAATGAAATTTTGTTACATTTGTAAATATTTTGGTTTTTTTTGCTATATTTGAAAATAACTTATTTTTTATTATTTTTTTAATTAAGGCCAAATATCAAATGACTTTGAAGTAATATATCATTTTAATTAATTAATGATTTTTTAACCTGCTTTAGAAAAAAAAAATTGTGAATAATTAGTGCAAGCTAATTAAATAATTTAAGTTTAATTTATAAAATAGCTATGAGAATATATTTTAATTACTTCTACATTTTAACTAATTTCAATCAATGAAGTTATCTATCTTATCGAATTTTAGTAAATTTATTATAAATTTTTTGGACGGAGTCAAGAATGATTTTGTTGGTTGCTATAATTTATTTTAAAATAATTAATCTTAAACTTTTAAAAGCACTTTTACTTTTTATACTATTTACAATATTAATTGTGAATAATTAAAAATATTTTGGGGTGATTTTAAAAATGGCCAAAATCATTTTAACACTCCCAAGCATGCTTGGTTTTAATTTCAAGTTAGGTATCCCATTAACTCCTTTTTAACTCCTTTGTCCTTTGTTAAATTGCCCGTATGACCATTTGACAATTTCCTTTCAACCCATAAATCTCTTTTCTTCAAAGAACACTTAGCTTTCAGTTTGTTTTCTTCAGAAGAATTTCTTTTCTTTTAGAAGAAATTTGAAGAAATGAAAGATAGAATTACAAATAATATGATTTTTGAATTAAAATATATTATTATAATTTAATATCACTTCAATGACTTCTGTATCTTCTCTTGTAGGTTGTTGATCATACAAGTTACAACTTTGGAGAAGAAAATGGGGTATTATTTCACATTGAGGACTAG

mRNA sequence

ATGGAAAAGACAATAATGAAGCTGAAAGCCATAGAAGCGACGGCAGAGAACTTCGCAGAGTACGGGCAATTGATCGGAGCTACAGCGGACGGCGTTGAATTCGGAGATGAAGACGCCCAATTAGACCTTAACAATGGAATCCCTAGGTTGTACATCCTCCACATTGAGAATCGAGCATTGGAATTCTCGAAGATAACGCATCACGCGCGAGTAACGCAGTGCTTGGGATCGGTGGATCGAGAGGCTTGGTATCTCGGAGTTGCGAAGGCGTCGATTGTTGCGGAGGGGGAGAATGGCGGAGGAGGGAGTTTGTGCAAGTCTGAGAGCGGCGGGCATTTGTATGTGCCGCCGAGTGTGGATGAAATTCGGGCGTTTAGGATATCGGGAGCGAAATTTGTGAAGCTGAATAAAGGGACATGGCATGCGGGTCCTCTGTTTAGAGCAAGTGCTAGAGATTTCTACAACCTGGAATTGACTGATACTAATGTTGTTGATCATACAAGTTACAACTTTGGAGAAGAAAATGGGGTATTATTTCACATTGAGGACTAG

Coding sequence (CDS)

ATGGAAAAGACAATAATGAAGCTGAAAGCCATAGAAGCGACGGCAGAGAACTTCGCAGAGTACGGGCAATTGATCGGAGCTACAGCGGACGGCGTTGAATTCGGAGATGAAGACGCCCAATTAGACCTTAACAATGGAATCCCTAGGTTGTACATCCTCCACATTGAGAATCGAGCATTGGAATTCTCGAAGATAACGCATCACGCGCGAGTAACGCAGTGCTTGGGATCGGTGGATCGAGAGGCTTGGTATCTCGGAGTTGCGAAGGCGTCGATTGTTGCGGAGGGGGAGAATGGCGGAGGAGGGAGTTTGTGCAAGTCTGAGAGCGGCGGGCATTTGTATGTGCCGCCGAGTGTGGATGAAATTCGGGCGTTTAGGATATCGGGAGCGAAATTTGTGAAGCTGAATAAAGGGACATGGCATGCGGGTCCTCTGTTTAGAGCAAGTGCTAGAGATTTCTACAACCTGGAATTGACTGATACTAATGTTGTTGATCATACAAGTTACAACTTTGGAGAAGAAAATGGGGTATTATTTCACATTGAGGACTAG

Protein sequence

MEKTIMKLKAIEATAENFAEYGQLIGATADGVEFGDEDAQLDLNNGIPRLYILHIENRALEFSKITHHARVTQCLGSVDREAWYLGVAKASIVAEGENGGGGSLCKSESGGHLYVPPSVDEIRAFRISGAKFVKLNKGTWHAGPLFRASARDFYNLELTDTNVVDHTSYNFGEENGVLFHIED
Homology
BLAST of Cla97C09G172620 vs. NCBI nr
Match: XP_038896769.1 (uncharacterized protein LOC120085023 [Benincasa hispida])

HSP 1 Score: 312.0 bits (798), Expect = 3.4e-81
Identity = 152/183 (83.06%), Postives = 163/183 (89.07%), Query Frame = 0

Query: 1   MEKTIMKLKAIEATAENFAEYGQLIGATADGVEFGDEDAQLDLNNGIPRLYILHIENRAL 60
           ME+TI+KLKAIEATAE+FAEYGQ+I AT DG EFG EDAQLDL+NGIPRLYILHIENR  
Sbjct: 1   MERTILKLKAIEATAESFAEYGQVIEATDDGAEFGGEDAQLDLSNGIPRLYILHIENRPF 60

Query: 61  EFSKITHHARVTQCLGSVDREAWYLGVAKASIVAEGENGGGGSLCKSESGGHLYVPPSVD 120
           EFSKITHHARVTQCLGSVDREAWYLGVAKASIV E E   GG       GGHLYV P+V+
Sbjct: 61  EFSKITHHARVTQCLGSVDREAWYLGVAKASIVEEEEEMNGGGRSFRSGGGHLYVAPNVE 120

Query: 121 EIRAFRISGAKFVKLNKGTWHAGPLFRASARDFYNLELTDTNVVDHTSYNFGEENGVLFH 180
           EIRAFRISGAKFVKLNKGTWHAGPLF+ASARDFYNLELTDTN+VDHT Y+FGEE+GVLFH
Sbjct: 121 EIRAFRISGAKFVKLNKGTWHAGPLFKASARDFYNLELTDTNIVDHTCYSFGEEDGVLFH 180

Query: 181 IED 184
           IED
Sbjct: 181 IED 183

BLAST of Cla97C09G172620 vs. NCBI nr
Match: XP_008461494.1 (PREDICTED: uncharacterized protein LOC103500077 [Cucumis melo] >TYK04349.1 Ureidoglycolate hydrolase [Cucumis melo var. makuwa])

HSP 1 Score: 304.7 bits (779), Expect = 5.5e-79
Identity = 150/184 (81.52%), Postives = 158/184 (85.87%), Query Frame = 0

Query: 4   TIMKLKAIEATAENFAEYGQLIGATADGVEFGDEDAQLDLNNGIPRLYILHIENRALEFS 63
           TIMKLKAIEAT E+FAEYGQ+I AT DG EFG +DAQLDL NGIPR YILHIENR  EFS
Sbjct: 12  TIMKLKAIEATPESFAEYGQVIEATGDGAEFGSQDAQLDLTNGIPRFYILHIENRPFEFS 71

Query: 64  KITHHARVTQCLGSVDREAWYLGVAKASIV----AEGENGGGGSLCKSESGGHLYVPPSV 123
           KITHHARVTQCLGSVDREAWYLGVAKASIV      G  GGGG   +SE GGHLYV P+V
Sbjct: 72  KITHHARVTQCLGSVDREAWYLGVAKASIVEGEEINGGGGGGGRNLRSERGGHLYVAPNV 131

Query: 124 DEIRAFRISGAKFVKLNKGTWHAGPLFRASARDFYNLELTDTNVVDHTSYNFGEENGVLF 183
           DEIRAFRISGAKFVKLNKGTWHAGPLFR +ARDFYNLELTDTN+VDHT YN GEEN V+F
Sbjct: 132 DEIRAFRISGAKFVKLNKGTWHAGPLFRENARDFYNLELTDTNIVDHTCYNIGEENRVVF 191

BLAST of Cla97C09G172620 vs. NCBI nr
Match: XP_004139672.1 (uncharacterized protein LOC101212947 [Cucumis sativus] >KGN44503.1 hypothetical protein Csa_015731 [Cucumis sativus])

HSP 1 Score: 301.6 bits (771), Expect = 4.6e-78
Identity = 149/182 (81.87%), Postives = 159/182 (87.36%), Query Frame = 0

Query: 4   TIMKLKAIEATAENFAEYGQLIGATADGVEFGDEDAQLDLNNGIPRLYILHIENRALEFS 63
           TIM LKAIEATAE+FAEYGQ+I AT D  EFG+EDAQLDL NGIPR YILHIENR  EFS
Sbjct: 9   TIMNLKAIEATAESFAEYGQVIQATDDRAEFGNEDAQLDLTNGIPRFYILHIENRPFEFS 68

Query: 64  KITHHARVTQCLGSVDREAWYLGVAKASIVA--EGENGGGGSLCKSESGGHLYVPPSVDE 123
           KITHHARVTQCLGSVDREAWYLGVAKASIV   E   GGGG   +SESGGHLYV P+VDE
Sbjct: 69  KITHHARVTQCLGSVDREAWYLGVAKASIVEGDEVNGGGGGRKLRSESGGHLYVAPNVDE 128

Query: 124 IRAFRISGAKFVKLNKGTWHAGPLFRASARDFYNLELTDTNVVDHTSYNFGEENGVLFHI 183
           IRAF+ISGAKFVKLNKGTWHAGPLFR +ARDFYNLELT+TN+VDHT YN GEEN V+FHI
Sbjct: 129 IRAFKISGAKFVKLNKGTWHAGPLFRENARDFYNLELTNTNIVDHTCYNIGEENRVVFHI 188

BLAST of Cla97C09G172620 vs. NCBI nr
Match: XP_022926175.1 (uncharacterized protein LOC111433372 [Cucurbita moschata])

HSP 1 Score: 261.2 bits (666), Expect = 6.9e-66
Identity = 131/183 (71.58%), Postives = 146/183 (79.78%), Query Frame = 0

Query: 6   MKLKAIEATAENFAEYGQLIGATADGVEFGDEDAQLDLNNGIPRLYILHIENRALEFSKI 65
           MKLKAIEAT E+FAEYGQ+I  T DG+ FG +DAQLDL NG PR YILHIENR   FS I
Sbjct: 1   MKLKAIEATPESFAEYGQVIEPTDDGLGFGPDDAQLDLTNGTPRFYILHIENRPFNFSMI 60

Query: 66  THHARVTQCLGSVDREAWYLGVAKASIVAEGENGGGGSLCKSES-----GGHLYVPPSVD 125
           THHARVTQCLGSVDR+ WYL VAK SIV +    G   + KSES      GHL+VPP VD
Sbjct: 61  THHARVTQCLGSVDRQPWYLAVAKPSIVDDEHKTG---IDKSESVLRSKSGHLFVPPCVD 120

Query: 126 EIRAFRISGAKFVKLNKGTWHAGPLFRASARDFYNLELTDTNVVDHTSYNFGEENGVLFH 184
           +I+ F+ISGAKFVKLNKGTWHAGPLFR SARDFYNLELT+TNVVDHT+Y+ G+ENGV F 
Sbjct: 121 DIKVFKISGAKFVKLNKGTWHAGPLFRESARDFYNLELTNTNVVDHTTYDLGKENGVPFE 180

BLAST of Cla97C09G172620 vs. NCBI nr
Match: XP_022981624.1 (uncharacterized protein LOC111480690 [Cucurbita maxima])

HSP 1 Score: 260.8 bits (665), Expect = 9.0e-66
Identity = 130/183 (71.04%), Postives = 147/183 (80.33%), Query Frame = 0

Query: 6   MKLKAIEATAENFAEYGQLIGATADGVEFGDEDAQLDLNNGIPRLYILHIENRALEFSKI 65
           MKLKA+EAT E+FAEYGQ+I  T DG+ FG +DAQLDL+NG PR YILHIENR   FS I
Sbjct: 1   MKLKAMEATPESFAEYGQVIEPTDDGLGFGPDDAQLDLSNGTPRFYILHIENRPFNFSMI 60

Query: 66  THHARVTQCLGSVDREAWYLGVAKASIVAEGENGGGGSLCKSES-----GGHLYVPPSVD 125
           THHARVTQCLGSVDR+ WYL VAK SIV +    G   + KSES      GHL++PP VD
Sbjct: 61  THHARVTQCLGSVDRQPWYLAVAKPSIVDDEHKTG---IDKSESVLRSKSGHLFLPPCVD 120

Query: 126 EIRAFRISGAKFVKLNKGTWHAGPLFRASARDFYNLELTDTNVVDHTSYNFGEENGVLFH 184
           EI+ F+ISGAKFVKLNKGTWHAGPLFR SARDFYNLELT+TNVVDHT+Y+ G+ENGV F 
Sbjct: 121 EIKVFKISGAKFVKLNKGTWHAGPLFRESARDFYNLELTNTNVVDHTTYDLGKENGVSFE 180

BLAST of Cla97C09G172620 vs. ExPASy TrEMBL
Match: A0A5D3C1U6 (Ureidoglycolate hydrolase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold675G00120 PE=4 SV=1)

HSP 1 Score: 304.7 bits (779), Expect = 2.6e-79
Identity = 150/184 (81.52%), Postives = 158/184 (85.87%), Query Frame = 0

Query: 4   TIMKLKAIEATAENFAEYGQLIGATADGVEFGDEDAQLDLNNGIPRLYILHIENRALEFS 63
           TIMKLKAIEAT E+FAEYGQ+I AT DG EFG +DAQLDL NGIPR YILHIENR  EFS
Sbjct: 12  TIMKLKAIEATPESFAEYGQVIEATGDGAEFGSQDAQLDLTNGIPRFYILHIENRPFEFS 71

Query: 64  KITHHARVTQCLGSVDREAWYLGVAKASIV----AEGENGGGGSLCKSESGGHLYVPPSV 123
           KITHHARVTQCLGSVDREAWYLGVAKASIV      G  GGGG   +SE GGHLYV P+V
Sbjct: 72  KITHHARVTQCLGSVDREAWYLGVAKASIVEGEEINGGGGGGGRNLRSERGGHLYVAPNV 131

Query: 124 DEIRAFRISGAKFVKLNKGTWHAGPLFRASARDFYNLELTDTNVVDHTSYNFGEENGVLF 183
           DEIRAFRISGAKFVKLNKGTWHAGPLFR +ARDFYNLELTDTN+VDHT YN GEEN V+F
Sbjct: 132 DEIRAFRISGAKFVKLNKGTWHAGPLFRENARDFYNLELTDTNIVDHTCYNIGEENRVVF 191

BLAST of Cla97C09G172620 vs. ExPASy TrEMBL
Match: A0A1S3CEV7 (uncharacterized protein LOC103500077 OS=Cucumis melo OX=3656 GN=LOC103500077 PE=4 SV=1)

HSP 1 Score: 304.7 bits (779), Expect = 2.6e-79
Identity = 150/184 (81.52%), Postives = 158/184 (85.87%), Query Frame = 0

Query: 4   TIMKLKAIEATAENFAEYGQLIGATADGVEFGDEDAQLDLNNGIPRLYILHIENRALEFS 63
           TIMKLKAIEAT E+FAEYGQ+I AT DG EFG +DAQLDL NGIPR YILHIENR  EFS
Sbjct: 12  TIMKLKAIEATPESFAEYGQVIEATGDGAEFGSQDAQLDLTNGIPRFYILHIENRPFEFS 71

Query: 64  KITHHARVTQCLGSVDREAWYLGVAKASIV----AEGENGGGGSLCKSESGGHLYVPPSV 123
           KITHHARVTQCLGSVDREAWYLGVAKASIV      G  GGGG   +SE GGHLYV P+V
Sbjct: 72  KITHHARVTQCLGSVDREAWYLGVAKASIVEGEEINGGGGGGGRNLRSERGGHLYVAPNV 131

Query: 124 DEIRAFRISGAKFVKLNKGTWHAGPLFRASARDFYNLELTDTNVVDHTSYNFGEENGVLF 183
           DEIRAFRISGAKFVKLNKGTWHAGPLFR +ARDFYNLELTDTN+VDHT YN GEEN V+F
Sbjct: 132 DEIRAFRISGAKFVKLNKGTWHAGPLFRENARDFYNLELTDTNIVDHTCYNIGEENRVVF 191

BLAST of Cla97C09G172620 vs. ExPASy TrEMBL
Match: A0A0A0K4Z5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G320020 PE=4 SV=1)

HSP 1 Score: 301.6 bits (771), Expect = 2.2e-78
Identity = 149/182 (81.87%), Postives = 159/182 (87.36%), Query Frame = 0

Query: 4   TIMKLKAIEATAENFAEYGQLIGATADGVEFGDEDAQLDLNNGIPRLYILHIENRALEFS 63
           TIM LKAIEATAE+FAEYGQ+I AT D  EFG+EDAQLDL NGIPR YILHIENR  EFS
Sbjct: 9   TIMNLKAIEATAESFAEYGQVIQATDDRAEFGNEDAQLDLTNGIPRFYILHIENRPFEFS 68

Query: 64  KITHHARVTQCLGSVDREAWYLGVAKASIVA--EGENGGGGSLCKSESGGHLYVPPSVDE 123
           KITHHARVTQCLGSVDREAWYLGVAKASIV   E   GGGG   +SESGGHLYV P+VDE
Sbjct: 69  KITHHARVTQCLGSVDREAWYLGVAKASIVEGDEVNGGGGGRKLRSESGGHLYVAPNVDE 128

Query: 124 IRAFRISGAKFVKLNKGTWHAGPLFRASARDFYNLELTDTNVVDHTSYNFGEENGVLFHI 183
           IRAF+ISGAKFVKLNKGTWHAGPLFR +ARDFYNLELT+TN+VDHT YN GEEN V+FHI
Sbjct: 129 IRAFKISGAKFVKLNKGTWHAGPLFRENARDFYNLELTNTNIVDHTCYNIGEENRVVFHI 188

BLAST of Cla97C09G172620 vs. ExPASy TrEMBL
Match: A0A6J1EK98 (uncharacterized protein LOC111433372 OS=Cucurbita moschata OX=3662 GN=LOC111433372 PE=4 SV=1)

HSP 1 Score: 261.2 bits (666), Expect = 3.3e-66
Identity = 131/183 (71.58%), Postives = 146/183 (79.78%), Query Frame = 0

Query: 6   MKLKAIEATAENFAEYGQLIGATADGVEFGDEDAQLDLNNGIPRLYILHIENRALEFSKI 65
           MKLKAIEAT E+FAEYGQ+I  T DG+ FG +DAQLDL NG PR YILHIENR   FS I
Sbjct: 1   MKLKAIEATPESFAEYGQVIEPTDDGLGFGPDDAQLDLTNGTPRFYILHIENRPFNFSMI 60

Query: 66  THHARVTQCLGSVDREAWYLGVAKASIVAEGENGGGGSLCKSES-----GGHLYVPPSVD 125
           THHARVTQCLGSVDR+ WYL VAK SIV +    G   + KSES      GHL+VPP VD
Sbjct: 61  THHARVTQCLGSVDRQPWYLAVAKPSIVDDEHKTG---IDKSESVLRSKSGHLFVPPCVD 120

Query: 126 EIRAFRISGAKFVKLNKGTWHAGPLFRASARDFYNLELTDTNVVDHTSYNFGEENGVLFH 184
           +I+ F+ISGAKFVKLNKGTWHAGPLFR SARDFYNLELT+TNVVDHT+Y+ G+ENGV F 
Sbjct: 121 DIKVFKISGAKFVKLNKGTWHAGPLFRESARDFYNLELTNTNVVDHTTYDLGKENGVPFE 180

BLAST of Cla97C09G172620 vs. ExPASy TrEMBL
Match: A0A6J1J2D4 (uncharacterized protein LOC111480690 OS=Cucurbita maxima OX=3661 GN=LOC111480690 PE=4 SV=1)

HSP 1 Score: 260.8 bits (665), Expect = 4.4e-66
Identity = 130/183 (71.04%), Postives = 147/183 (80.33%), Query Frame = 0

Query: 6   MKLKAIEATAENFAEYGQLIGATADGVEFGDEDAQLDLNNGIPRLYILHIENRALEFSKI 65
           MKLKA+EAT E+FAEYGQ+I  T DG+ FG +DAQLDL+NG PR YILHIENR   FS I
Sbjct: 1   MKLKAMEATPESFAEYGQVIEPTDDGLGFGPDDAQLDLSNGTPRFYILHIENRPFNFSMI 60

Query: 66  THHARVTQCLGSVDREAWYLGVAKASIVAEGENGGGGSLCKSES-----GGHLYVPPSVD 125
           THHARVTQCLGSVDR+ WYL VAK SIV +    G   + KSES      GHL++PP VD
Sbjct: 61  THHARVTQCLGSVDRQPWYLAVAKPSIVDDEHKTG---IDKSESVLRSKSGHLFLPPCVD 120

Query: 126 EIRAFRISGAKFVKLNKGTWHAGPLFRASARDFYNLELTDTNVVDHTSYNFGEENGVLFH 184
           EI+ F+ISGAKFVKLNKGTWHAGPLFR SARDFYNLELT+TNVVDHT+Y+ G+ENGV F 
Sbjct: 121 EIKVFKISGAKFVKLNKGTWHAGPLFRESARDFYNLELTNTNVVDHTTYDLGKENGVSFE 180

BLAST of Cla97C09G172620 vs. TAIR 10
Match: AT2G35810.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G35830.2); Has 153 Blast hits to 153 proteins in 52 species: Archae - 0; Bacteria - 62; Metazoa - 0; Fungi - 0; Plants - 82; Viruses - 0; Other Eukaryotes - 9 (source: NCBI BLink). )

HSP 1 Score: 217.6 bits (553), Expect = 8.2e-57
Identity = 105/178 (58.99%), Postives = 134/178 (75.28%), Query Frame = 0

Query: 6   MKLKAIEATAENFAEYGQLIGATADGVEFGDEDAQLDLNNGIPRLYILHIENRALEFSKI 65
           + L  IEAT E FAEYGQ+I A+ DG  +G  DAQLDL+ GIPRLYIL ++   L F KI
Sbjct: 22  VNLIPIEATPETFAEYGQVIEASRDGAGYGPNDAQLDLSKGIPRLYILRLKETPLGFFKI 81

Query: 66  THHARVTQCLGSVDREAWYLGVAKASIVAEGENGGGGSLCKSESGGHLYVPPSVDEIRAF 125
           THHA+VTQCLGS+  + WY+GVAK S++ + ++G      K++S GHLY+PP V+EIR F
Sbjct: 82  THHAKVTQCLGSIGGDIWYMGVAKPSLIEDDDDGRRVDTVKAKS-GHLYIPPEVEEIRVF 141

Query: 126 RISGAKFVKLNKGTWHAGPLFRASA-RDFYNLELTDTNVVDHTSYNFGEENGVLFHIE 183
           R SG KFVKL++GTWHAGPLF  S+  DFYNLEL++TNVVDHTS++F + NGV F  +
Sbjct: 142 RFSGPKFVKLHRGTWHAGPLFSGSSIMDFYNLELSNTNVVDHTSHDFTKNNGVSFRFD 198

BLAST of Cla97C09G172620 vs. TAIR 10
Match: AT2G35830.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G35810.1). )

HSP 1 Score: 217.6 bits (553), Expect = 8.2e-57
Identity = 104/175 (59.43%), Postives = 129/175 (73.71%), Query Frame = 0

Query: 6   MKLKAIEATAENFAEYGQLIGATADGVEFGDEDAQLDLNNGIPRLYILHIENRALEFSKI 65
           + L  IEAT ENFAEYGQ+I A+ DG  FG  DAQLDL+ G PRLYIL ++   L F KI
Sbjct: 8   VNLIPIEATPENFAEYGQVIEASRDGAGFGPHDAQLDLSRGTPRLYILRLKETPLGFFKI 67

Query: 66  THHARVTQCLGSVDREAWYLGVAKASIVAEGENGGGGSLCKSESGGHLYVPPSVDEIRAF 125
           THHA+VTQCLGS+  + WY+GVAK S++ + ++ G          GHLY+PP V+EIR F
Sbjct: 68  THHAKVTQCLGSIGGDVWYMGVAKPSLIEDDDDDGRSVDTVKSKSGHLYIPPEVEEIRVF 127

Query: 126 RISGAKFVKLNKGTWHAGPLFRASA-RDFYNLELTDTNVVDHTSYNFGEENGVLF 180
           R SG KFVKL++GTWHAGPLF  S+  DFYNLEL++TNVVDHTS++F + NGV F
Sbjct: 128 RFSGPKFVKLHRGTWHAGPLFSGSSFMDFYNLELSNTNVVDHTSHDFTKNNGVSF 182

BLAST of Cla97C09G172620 vs. TAIR 10
Match: AT2G35820.1 (ureidoglycolate hydrolases )

HSP 1 Score: 214.2 bits (544), Expect = 9.0e-56
Identity = 101/177 (57.06%), Postives = 128/177 (72.32%), Query Frame = 0

Query: 6   MKLKAIEATAENFAEYGQLIGATADGVEFGDEDAQLDLNNGIPRLYILHIENRALEFSKI 65
           +KL  IEAT ENFA+YGQ+I A+ DG  FG  DAQLDL+ GIPR YI+ I +   +FS +
Sbjct: 8   VKLIPIEATPENFADYGQVIEASRDGAGFGPNDAQLDLSRGIPRFYIMRIRDTPFDFSVL 67

Query: 66  THHARVTQCLGSVDREAWYLGVAKASIVAEGENGGGGSLCKSESGGHLYVPPSVDEIRAF 125
           THHA VTQCLGS+    WYLGVAK +++ +G++G      KS S GHLY PP+V+EIR F
Sbjct: 68  THHASVTQCLGSIGGHVWYLGVAKPTLIEDGDDGKMVDKLKSRS-GHLYAPPAVEEIRVF 127

Query: 126 RISGAKFVKLNKGTWHAGPLFRASARDFYNLELTDTNVVDHTSYNFGEENGVLFHIE 183
           R+SG KF+KLN GTWH GPLF  S  DFYNLEL++TN VD T+Y+F +  GV   ++
Sbjct: 128 RVSGPKFIKLNHGTWHVGPLFSDSYMDFYNLELSNTNAVDRTTYDFIKNKGVTIRVD 183

BLAST of Cla97C09G172620 vs. TAIR 10
Match: AT2G35830.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G35810.1); Has 155 Blast hits to 155 proteins in 54 species: Archae - 0; Bacteria - 66; Metazoa - 0; Fungi - 0; Plants - 82; Viruses - 0; Other Eukaryotes - 7 (source: NCBI BLink). )

HSP 1 Score: 204.9 bits (520), Expect = 5.5e-53
Identity = 101/175 (57.71%), Postives = 125/175 (71.43%), Query Frame = 0

Query: 6   MKLKAIEATAENFAEYGQLIGATADGVEFGDEDAQLDLNNGIPRLYILHIENRALEFSKI 65
           + L  IEAT ENFAEYGQ+I A+ DG  FG  DAQLDL+ G PRL     +   L F KI
Sbjct: 8   VNLIPIEATPENFAEYGQVIEASRDGAGFGPHDAQLDLSRGTPRL-----KETPLGFFKI 67

Query: 66  THHARVTQCLGSVDREAWYLGVAKASIVAEGENGGGGSLCKSESGGHLYVPPSVDEIRAF 125
           THHA+VTQCLGS+  + WY+GVAK S++ + ++ G          GHLY+PP V+EIR F
Sbjct: 68  THHAKVTQCLGSIGGDVWYMGVAKPSLIEDDDDDGRSVDTVKSKSGHLYIPPEVEEIRVF 127

Query: 126 RISGAKFVKLNKGTWHAGPLFRASA-RDFYNLELTDTNVVDHTSYNFGEENGVLF 180
           R SG KFVKL++GTWHAGPLF  S+  DFYNLEL++TNVVDHTS++F + NGV F
Sbjct: 128 RFSGPKFVKLHRGTWHAGPLFSGSSFMDFYNLELSNTNVVDHTSHDFTKNNGVSF 177

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038896769.13.4e-8183.06uncharacterized protein LOC120085023 [Benincasa hispida][more]
XP_008461494.15.5e-7981.52PREDICTED: uncharacterized protein LOC103500077 [Cucumis melo] >TYK04349.1 Ureid... [more]
XP_004139672.14.6e-7881.87uncharacterized protein LOC101212947 [Cucumis sativus] >KGN44503.1 hypothetical ... [more]
XP_022926175.16.9e-6671.58uncharacterized protein LOC111433372 [Cucurbita moschata][more]
XP_022981624.19.0e-6671.04uncharacterized protein LOC111480690 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5D3C1U62.6e-7981.52Ureidoglycolate hydrolase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffo... [more]
A0A1S3CEV72.6e-7981.52uncharacterized protein LOC103500077 OS=Cucumis melo OX=3656 GN=LOC103500077 PE=... [more]
A0A0A0K4Z52.2e-7881.87Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G320020 PE=4 SV=1[more]
A0A6J1EK983.3e-6671.58uncharacterized protein LOC111433372 OS=Cucurbita moschata OX=3662 GN=LOC1114333... [more]
A0A6J1J2D44.4e-6671.04uncharacterized protein LOC111480690 OS=Cucurbita maxima OX=3661 GN=LOC111480690... [more]
Match NameE-valueIdentityDescription
AT2G35810.18.2e-5758.99unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G35830.28.2e-5759.43unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G35820.19.0e-5657.06ureidoglycolate hydrolases [more]
AT2G35830.15.5e-5357.71unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR024060Ureidoglycolate lyase domain superfamilyGENE3D2.60.120.480Ureidoglycolate hydrolasecoord: 6..182
e-value: 5.6E-24
score: 86.7
NoneNo IPR availablePANTHERPTHR35721UREIDOGLYCOLATE HYDROLASEcoord: 3..183
IPR011051RmlC-like cupin domain superfamilySUPERFAMILY51182RmlC-like cupinscoord: 5..181

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C09G172620.1Cla97C09G172620.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0004848 ureidoglycolate hydrolase activity