Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTATCGTTGCCACGTGGCTTCGTCCTCTTTATATATAGTACCAGATCCCATCCGACACTTAAATCAAATCCCGAATTAGGGTTTCTCATTCTTCATTATCGTCATGGATGCAGCAGAACATCACAATCCCCAACCAAGTTTCTTCCATCAAATCCTCCCTCCCCGTCTCGAAGACGCCGGCCTCGAGGATTCTGCCCTTCCTCCCGATTCCATTCGCGAAGCCTTCTTCAAGGCCGCCTCCGCCGTCAAATCCAGGGCCACCGCCCGTCTTTCACACTCCGACGACGAGGATGATGATGTTCCATGCTCCCCTACTTCCGCACTACCAACCGACGAGGACGCTCCGGCGATTTGCGCGACGAAGAAGGGATTGGAATTGCCCGAATTCGGTAAAGACGAGGTGGTTATTGGGGGAATGGAGGAAAGGAGAGGGAAGGGTTGTGTGGTAGATGGATTGGAAGGGTTGGAGATTGGCGATGATGCCGAGAAGGAGAGCGGGAAAGAGGAGAAGAAGAAACCTGTCTTAGGCGAAGGTTTCGCTTGAAATTGTATAGATGAGTTTAGGTCATAGTTAGAATGGGGGAGGGCTGTTGTTCTTGAAAAGTTATTGTTTTCACGTTGTTTGTGAATAGAGATGAGTTCTTGATTTTATTTTTCACCAAAAGTATTTTTATTTATTGATCGTAGAGCATATATAGGTTTTAATTTAAACTCGAAACTCCAATGGTTCTTTCAACATTCCTAATCACCT
mRNA sequence
CTATCGTTGCCACGTGGCTTCGTCCTCTTTATATATAGTACCAGATCCCATCCGACACTTAAATCAAATCCCGAATTAGGGTTTCTCATTCTTCATTATCGTCATGGATGCAGCAGAACATCACAATCCCCAACCAAGTTTCTTCCATCAAATCCTCCCTCCCCGTCTCGAAGACGCCGGCCTCGAGGATTCTGCCCTTCCTCCCGATTCCATTCGCGAAGCCTTCTTCAAGGCCGCCTCCGCCGTCAAATCCAGGGCCACCGCCCGTCTTTCACACTCCGACGACGAGGATGATGATGTTCCATGCTCCCCTACTTCCGCACTACCAACCGACGAGGACGCTCCGGCGATTTGCGCGACGAAGAAGGGATTGGAATTGCCCGAATTCGGTAAAGACGAGGTGGTTATTGGGGGAATGGAGGAAAGGAGAGGGAAGGGTTGTGTGGTAGATGGATTGGAAGGGTTGGAGATTGGCGATGATGCCGAGAAGGAGAGCGGGAAAGAGGAGAAGAAGAAACCTGTCTTAGGCGAAGGTTTCGCTTGAAATTGTATAGATGAGTTTAGGTCATAGTTAGAATGGGGGAGGGCTGTTGTTCTTGAAAAGTTATTGTTTTCACGTTGTTTGTGAATAGAGATGAGTTCTTGATTTTATTTTTCACCAAAAGTATTTTTATTTATTGATCGTAGAGCATATATAGGTTTTAATTTAAACTCGAAACTCCAATGGTTCTTTCAACATTCCTAATCACCT
Coding sequence (CDS)
ATGGATGCAGCAGAACATCACAATCCCCAACCAAGTTTCTTCCATCAAATCCTCCCTCCCCGTCTCGAAGACGCCGGCCTCGAGGATTCTGCCCTTCCTCCCGATTCCATTCGCGAAGCCTTCTTCAAGGCCGCCTCCGCCGTCAAATCCAGGGCCACCGCCCGTCTTTCACACTCCGACGACGAGGATGATGATGTTCCATGCTCCCCTACTTCCGCACTACCAACCGACGAGGACGCTCCGGCGATTTGCGCGACGAAGAAGGGATTGGAATTGCCCGAATTCGGTAAAGACGAGGTGGTTATTGGGGGAATGGAGGAAAGGAGAGGGAAGGGTTGTGTGGTAGATGGATTGGAAGGGTTGGAGATTGGCGATGATGCCGAGAAGGAGAGCGGGAAAGAGGAGAAGAAGAAACCTGTCTTAGGCGAAGGTTTCGCTTGA
Protein sequence
MDAAEHHNPQPSFFHQILPPRLEDAGLEDSALPPDSIREAFFKAASAVKSRATARLSHSDDEDDDVPCSPTSALPTDEDAPAICATKKGLELPEFGKDEVVIGGMEERRGKGCVVDGLEGLEIGDDAEKESGKEEKKKPVLGEGFA*
Homology
BLAST of CsGy3G023020 vs. NCBI nr
Match:
XP_011651498.1 (uncharacterized protein LOC105434906 [Cucumis sativus])
HSP 1 Score: 290 bits (742), Expect = 9.19e-99
Identity = 146/146 (100.00%), Postives = 146/146 (100.00%), Query Frame = 0
Query: 1 MDAAEHHNPQPSFFHQILPPRLEDAGLEDSALPPDSIREAFFKAASAVKSRATARLSHSD 60
MDAAEHHNPQPSFFHQILPPRLEDAGLEDSALPPDSIREAFFKAASAVKSRATARLSHSD
Sbjct: 1 MDAAEHHNPQPSFFHQILPPRLEDAGLEDSALPPDSIREAFFKAASAVKSRATARLSHSD 60
Query: 61 DEDDDVPCSPTSALPTDEDAPAICATKKGLELPEFGKDEVVIGGMEERRGKGCVVDGLEG 120
DEDDDVPCSPTSALPTDEDAPAICATKKGLELPEFGKDEVVIGGMEERRGKGCVVDGLEG
Sbjct: 61 DEDDDVPCSPTSALPTDEDAPAICATKKGLELPEFGKDEVVIGGMEERRGKGCVVDGLEG 120
Query: 121 LEIGDDAEKESGKEEKKKPVLGEGFA 146
LEIGDDAEKESGKEEKKKPVLGEGFA
Sbjct: 121 LEIGDDAEKESGKEEKKKPVLGEGFA 146
BLAST of CsGy3G023020 vs. NCBI nr
Match:
KAE8650731.1 (hypothetical protein Csa_023393 [Cucumis sativus])
HSP 1 Score: 284 bits (726), Expect = 2.37e-96
Identity = 143/143 (100.00%), Postives = 143/143 (100.00%), Query Frame = 0
Query: 1 MDAAEHHNPQPSFFHQILPPRLEDAGLEDSALPPDSIREAFFKAASAVKSRATARLSHSD 60
MDAAEHHNPQPSFFHQILPPRLEDAGLEDSALPPDSIREAFFKAASAVKSRATARLSHSD
Sbjct: 1 MDAAEHHNPQPSFFHQILPPRLEDAGLEDSALPPDSIREAFFKAASAVKSRATARLSHSD 60
Query: 61 DEDDDVPCSPTSALPTDEDAPAICATKKGLELPEFGKDEVVIGGMEERRGKGCVVDGLEG 120
DEDDDVPCSPTSALPTDEDAPAICATKKGLELPEFGKDEVVIGGMEERRGKGCVVDGLEG
Sbjct: 61 DEDDDVPCSPTSALPTDEDAPAICATKKGLELPEFGKDEVVIGGMEERRGKGCVVDGLEG 120
Query: 121 LEIGDDAEKESGKEEKKKPVLGE 143
LEIGDDAEKESGKEEKKKPVLGE
Sbjct: 121 LEIGDDAEKESGKEEKKKPVLGE 143
BLAST of CsGy3G023020 vs. NCBI nr
Match:
XP_008447517.1 (PREDICTED: uncharacterized protein LOC103489947 [Cucumis melo] >KAA0050786.1 uncharacterized protein E6C27_scaffold404G00370 [Cucumis melo var. makuwa] >TYK08560.1 uncharacterized protein E5676_scaffold323G001030 [Cucumis melo var. makuwa])
HSP 1 Score: 233 bits (593), Expect = 6.26e-76
Identity = 128/156 (82.05%), Postives = 135/156 (86.54%), Query Frame = 0
Query: 1 MDAAEHHNPQPSFFHQILPPRLEDAGLEDSALPPDSIREAFFKAASAVKSRATARLSHSD 60
MDAAEHHNPQP FF QILPPRLEDAGLED ALPPDSIREAFFKAASAVKSRATA LS D
Sbjct: 1 MDAAEHHNPQPPFFDQILPPRLEDAGLEDPALPPDSIREAFFKAASAVKSRATALLSPDD 60
Query: 61 DEDDDVPCSPTSALPTD--------EDAPAICATKKGLELPEFGKDEVVIGGMEERRGKG 120
D++DD P SPTS LPTD EDAPAICAT+KGL+LPEFGKDEVVIGGMEERRGK
Sbjct: 61 DDEDD-PWSPTSTLPTDIVTGILPDEDAPAICATRKGLKLPEFGKDEVVIGGMEERRGKA 120
Query: 121 CVVDGLEGLEIGDDAEKE--SGKEEKKKPVLGEGFA 146
CVVDGLEGLEIGD+AEKE SGKEE+K P+LGEGFA
Sbjct: 121 CVVDGLEGLEIGDEAEKEKKSGKEEEK-PILGEGFA 154
BLAST of CsGy3G023020 vs. NCBI nr
Match:
XP_038900451.1 (uncharacterized protein LOC120087668 [Benincasa hispida])
HSP 1 Score: 210 bits (534), Expect = 5.33e-67
Identity = 118/155 (76.13%), Postives = 125/155 (80.65%), Query Frame = 0
Query: 1 MDAAEHHNPQPSFFHQILPPRLEDAGLEDSALPPDSIREAFFKAASAVKSRATARLSHSD 60
MDAAE HNP P F QILPPRLEDAGLED ALPPDSIREAFFKAASAVKS ATA LS SD
Sbjct: 1 MDAAEQHNPNPGIFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLSPSD 60
Query: 61 DEDDDVPCSPTSALPTD--------EDAPAICATKKGLELPEFGKDEVVIGGMEERRGKG 120
D+D P SPTS LPTD D PA+CAT+KGL+LPE G DEVVIGGMEERRGK
Sbjct: 61 DDD---PWSPTSTLPTDVVTGILPDRDDPAVCATEKGLKLPEIGGDEVVIGGMEERRGKA 120
Query: 121 CVVDGLEGLEIGDDA-EKESGKEEKKKPVLGEGFA 146
CVVDGLEGLEIGD+A EK+SGKEEK P+LGEGFA
Sbjct: 121 CVVDGLEGLEIGDEAKEKKSGKEEK--PILGEGFA 150
BLAST of CsGy3G023020 vs. NCBI nr
Match:
KAG6606765.1 (hypothetical protein SDJN03_00107, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 184 bits (467), Expect = 1.36e-56
Identity = 104/160 (65.00%), Postives = 118/160 (73.75%), Query Frame = 0
Query: 1 MDAAEHHNPQPSFFHQILPPRLEDAGLEDSALPPDSIREAFFKAASAVKSRATARLSHSD 60
MDAAE H +P F QILPPRLEDAGLED ALPPDSIREAFFKAASA+KS ATA LS D
Sbjct: 1 MDAAEEHQSKPGLFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAIKSTATAFLSSDD 60
Query: 61 DEDDDV-----PCSPTSALPTDE--------DAPAICATKKGLELPEFGKDEVVIGGMEE 120
ED D SPT++LPTD D PA CAT KGL+LPEF D VV+GGMEE
Sbjct: 61 GEDSDGYGVEDNWSPTASLPTDVVTGILPELDPPAACATDKGLKLPEFNVDGVVVGGMEE 120
Query: 121 RRGKGCVVDGLEGLEIGDDAEKESGK-EEKKKPVLGEGFA 146
RRGKGCVVD LEGLE+GD+A+K+ EE+++P+L EGFA
Sbjct: 121 RRGKGCVVDVLEGLEVGDEAKKKKNSGEEEEQPILAEGFA 160
BLAST of CsGy3G023020 vs. ExPASy TrEMBL
Match:
A0A0A0LDD6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G483800 PE=4 SV=1)
HSP 1 Score: 290 bits (742), Expect = 4.45e-99
Identity = 146/146 (100.00%), Postives = 146/146 (100.00%), Query Frame = 0
Query: 1 MDAAEHHNPQPSFFHQILPPRLEDAGLEDSALPPDSIREAFFKAASAVKSRATARLSHSD 60
MDAAEHHNPQPSFFHQILPPRLEDAGLEDSALPPDSIREAFFKAASAVKSRATARLSHSD
Sbjct: 1 MDAAEHHNPQPSFFHQILPPRLEDAGLEDSALPPDSIREAFFKAASAVKSRATARLSHSD 60
Query: 61 DEDDDVPCSPTSALPTDEDAPAICATKKGLELPEFGKDEVVIGGMEERRGKGCVVDGLEG 120
DEDDDVPCSPTSALPTDEDAPAICATKKGLELPEFGKDEVVIGGMEERRGKGCVVDGLEG
Sbjct: 61 DEDDDVPCSPTSALPTDEDAPAICATKKGLELPEFGKDEVVIGGMEERRGKGCVVDGLEG 120
Query: 121 LEIGDDAEKESGKEEKKKPVLGEGFA 146
LEIGDDAEKESGKEEKKKPVLGEGFA
Sbjct: 121 LEIGDDAEKESGKEEKKKPVLGEGFA 146
BLAST of CsGy3G023020 vs. ExPASy TrEMBL
Match:
A0A5A7U9B5 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold323G001030 PE=4 SV=1)
HSP 1 Score: 233 bits (593), Expect = 3.03e-76
Identity = 128/156 (82.05%), Postives = 135/156 (86.54%), Query Frame = 0
Query: 1 MDAAEHHNPQPSFFHQILPPRLEDAGLEDSALPPDSIREAFFKAASAVKSRATARLSHSD 60
MDAAEHHNPQP FF QILPPRLEDAGLED ALPPDSIREAFFKAASAVKSRATA LS D
Sbjct: 1 MDAAEHHNPQPPFFDQILPPRLEDAGLEDPALPPDSIREAFFKAASAVKSRATALLSPDD 60
Query: 61 DEDDDVPCSPTSALPTD--------EDAPAICATKKGLELPEFGKDEVVIGGMEERRGKG 120
D++DD P SPTS LPTD EDAPAICAT+KGL+LPEFGKDEVVIGGMEERRGK
Sbjct: 61 DDEDD-PWSPTSTLPTDIVTGILPDEDAPAICATRKGLKLPEFGKDEVVIGGMEERRGKA 120
Query: 121 CVVDGLEGLEIGDDAEKE--SGKEEKKKPVLGEGFA 146
CVVDGLEGLEIGD+AEKE SGKEE+K P+LGEGFA
Sbjct: 121 CVVDGLEGLEIGDEAEKEKKSGKEEEK-PILGEGFA 154
BLAST of CsGy3G023020 vs. ExPASy TrEMBL
Match:
A0A1S3BI85 (uncharacterized protein LOC103489947 OS=Cucumis melo OX=3656 GN=LOC103489947 PE=4 SV=1)
HSP 1 Score: 233 bits (593), Expect = 3.03e-76
Identity = 128/156 (82.05%), Postives = 135/156 (86.54%), Query Frame = 0
Query: 1 MDAAEHHNPQPSFFHQILPPRLEDAGLEDSALPPDSIREAFFKAASAVKSRATARLSHSD 60
MDAAEHHNPQP FF QILPPRLEDAGLED ALPPDSIREAFFKAASAVKSRATA LS D
Sbjct: 1 MDAAEHHNPQPPFFDQILPPRLEDAGLEDPALPPDSIREAFFKAASAVKSRATALLSPDD 60
Query: 61 DEDDDVPCSPTSALPTD--------EDAPAICATKKGLELPEFGKDEVVIGGMEERRGKG 120
D++DD P SPTS LPTD EDAPAICAT+KGL+LPEFGKDEVVIGGMEERRGK
Sbjct: 61 DDEDD-PWSPTSTLPTDIVTGILPDEDAPAICATRKGLKLPEFGKDEVVIGGMEERRGKA 120
Query: 121 CVVDGLEGLEIGDDAEKE--SGKEEKKKPVLGEGFA 146
CVVDGLEGLEIGD+AEKE SGKEE+K P+LGEGFA
Sbjct: 121 CVVDGLEGLEIGDEAEKEKKSGKEEEK-PILGEGFA 154
BLAST of CsGy3G023020 vs. ExPASy TrEMBL
Match:
A0A6J1GBZ4 (uncharacterized protein LOC111452587 OS=Cucurbita moschata OX=3662 GN=LOC111452587 PE=4 SV=1)
HSP 1 Score: 183 bits (465), Expect = 1.33e-56
Identity = 104/160 (65.00%), Postives = 118/160 (73.75%), Query Frame = 0
Query: 1 MDAAEHHNPQPSFFHQILPPRLEDAGLEDSALPPDSIREAFFKAASAVKSRATARLSHSD 60
MDAAE H +P F QILPPRLEDAGLED ALPPDSIREAFFKAASA+KS ATA LS D
Sbjct: 1 MDAAEEHQSKPGLFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAIKSTATAFLSSDD 60
Query: 61 DEDDDV-----PCSPTSALPTDE--------DAPAICATKKGLELPEFGKDEVVIGGMEE 120
ED D SPT++LPTD D PA CAT KGL+LPEF D VV+GGMEE
Sbjct: 61 GEDSDGYGVEDNWSPTASLPTDVVTGILPELDPPAACATDKGLKLPEFNVDGVVVGGMEE 120
Query: 121 RRGKGCVVDGLEGLEIGDDAEKESGK-EEKKKPVLGEGFA 146
RRGKGCVVD LEGLE+GD+A+K+ EE+++P+L EGFA
Sbjct: 121 RRGKGCVVDVLEGLEVGDEAKKKKKSGEEEEQPILAEGFA 160
BLAST of CsGy3G023020 vs. ExPASy TrEMBL
Match:
A0A6J1K989 (uncharacterized protein LOC111492831 OS=Cucurbita maxima OX=3661 GN=LOC111492831 PE=4 SV=1)
HSP 1 Score: 160 bits (405), Expect = 1.06e-47
Identity = 91/142 (64.08%), Postives = 101/142 (71.13%), Query Frame = 0
Query: 1 MDAAEHHNPQPSFFHQILPPRLEDAGLEDSALPPDSIREAFFKAASAVKSRATARLSHSD 60
MDAAE H +P F QILPPRLEDAGLED ALPPDSI EAFFKAASA+KS AT LS +
Sbjct: 1 MDAAEEHQSKPGLFDQILPPRLEDAGLEDCALPPDSICEAFFKAASAIKSTATGFLSSDE 60
Query: 61 DEDDDV-----PCSPTSALPTDE--------DAPAICATKKGLELPEFGKDEVVIGGMEE 120
+D D SPT+AL TD D PA CAT KGL+LPEF D VV+GGMEE
Sbjct: 61 GDDSDGYGVEDKWSPTAALSTDVVTGIWPELDPPAACATDKGLKLPEFDLDGVVVGGMEE 120
Query: 121 RRGKGCVVDGLEGLEIGDDAEK 129
RRGKGC VD LEGLE+GD+A+K
Sbjct: 121 RRGKGCAVDVLEGLEVGDEAKK 142
BLAST of CsGy3G023020 vs. TAIR 10
Match:
AT1G15230.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; Has 26 Blast hits to 26 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 26; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 77.4 bits (189), Expect = 1.1e-14
Identity = 60/143 (41.96%), Postives = 82/143 (57.34%), Query Frame = 0
Query: 17 ILPPRLEDAGLEDSALPPDSIREAFFKAASAVKSRATARLSHSDDED---DDVPCSPTSA 76
ILPP L DAGLED ALPP+SI+EAF KAA+AVKSRA + H +++ D +P +
Sbjct: 14 ILPPALADAGLEDCALPPESIQEAFRKAANAVKSRAASIFDHEEEDGCLADPIPETADKI 73
Query: 77 L--PTDEDAPAICATKKGLELPEFGKD--EVVIGGMEERRGKGCVVDGL-----EGLEIG 136
+ +E C KG+E K ++V+ G E GK C VDGL EG+E
Sbjct: 74 IVGGDNERDTGPCLAGKGIEKLAESKQAGDLVVAG-EGEEGKSC-VDGLKDLDVEGIERS 133
Query: 137 DDAEKES--GKEEKKKPVLGEGF 146
+ + +S +EE+ KP+L EGF
Sbjct: 134 SEKKDQSDEDEEEEMKPILVEGF 154
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_011651498.1 | 9.19e-99 | 100.00 | uncharacterized protein LOC105434906 [Cucumis sativus] | [more] |
KAE8650731.1 | 2.37e-96 | 100.00 | hypothetical protein Csa_023393 [Cucumis sativus] | [more] |
XP_008447517.1 | 6.26e-76 | 82.05 | PREDICTED: uncharacterized protein LOC103489947 [Cucumis melo] >KAA0050786.1 unc... | [more] |
XP_038900451.1 | 5.33e-67 | 76.13 | uncharacterized protein LOC120087668 [Benincasa hispida] | [more] |
KAG6606765.1 | 1.36e-56 | 65.00 | hypothetical protein SDJN03_00107, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0LDD6 | 4.45e-99 | 100.00 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G483800 PE=4 SV=1 | [more] |
A0A5A7U9B5 | 3.03e-76 | 82.05 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3BI85 | 3.03e-76 | 82.05 | uncharacterized protein LOC103489947 OS=Cucumis melo OX=3656 GN=LOC103489947 PE=... | [more] |
A0A6J1GBZ4 | 1.33e-56 | 65.00 | uncharacterized protein LOC111452587 OS=Cucurbita moschata OX=3662 GN=LOC1114525... | [more] |
A0A6J1K989 | 1.06e-47 | 64.08 | uncharacterized protein LOC111492831 OS=Cucurbita maxima OX=3661 GN=LOC111492831... | [more] |
Match Name | E-value | Identity | Description | |
AT1G15230.1 | 1.1e-14 | 41.96 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |