Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCCACTCCCCTCGACCAGCTCCAGCCCCCGCCGCCCCTCCACTCCGCCCACGCCTCCGTCGGCCCACTCATCGCCGTCCTCGCCGTCATTTCCATCCTCGGCGTCATTGCCGGCATGATTGGCCGCCTCTGCTCCGGCCGCCCTGTCTTTGGCTACGGTGCCCACTACGACGTCGAGGATTGGGTCGAGAAGAAATGTGCTTCCTGCCTCGACGGCTCCCTCGACCCTCCTCCTCCGCCGCCGCACCTCCGCCGCCCACCGCCGCTCGAATCTGTTCCTGTGGCTGAACCGCTTGATGGGCCACCACCGGAGATCAAGCAAGGTGCCGATCCCGATGTGGAGGCCGATGCTAAACGCGAGAATTTGCAGTCGGCGCCTCCTGGAACCGGCGGTGAGTCGTGA
mRNA sequence
ATGTCCACTCCCCTCGACCAGCTCCAGCCCCCGCCGCCCCTCCACTCCGCCCACGCCTCCGTCGGCCCACTCATCGCCGTCCTCGCCGTCATTTCCATCCTCGGCGTCATTGCCGGCATGATTGGCCGCCTCTGCTCCGGCCGCCCTGTCTTTGGCTACGGTGCCCACTACGACGTCGAGGATTGGGTCGAGAAGAAATGTGCTTCCTGCCTCGACGGCTCCCTCGACCCTCCTCCTCCGCCGCCGCACCTCCGCCGCCCACCGCCGCTCGAATCTGTTCCTGTGGCTGAACCGCTTGATGGGCCACCACCGGAGATCAAGCAAGGTGCCGATCCCGATGTGGAGGCCGATGCTAAACGCGAGAATTTGCAGTCGGCGCCTCCTGGAACCGGCGGTGAGTCGTGA
Coding sequence (CDS)
ATGTCCACTCCCCTCGACCAGCTCCAGCCCCCGCCGCCCCTCCACTCCGCCCACGCCTCCGTCGGCCCACTCATCGCCGTCCTCGCCGTCATTTCCATCCTCGGCGTCATTGCCGGCATGATTGGCCGCCTCTGCTCCGGCCGCCCTGTCTTTGGCTACGGTGCCCACTACGACGTCGAGGATTGGGTCGAGAAGAAATGTGCTTCCTGCCTCGACGGCTCCCTCGACCCTCCTCCTCCGCCGCCGCACCTCCGCCGCCCACCGCCGCTCGAATCTGTTCCTGTGGCTGAACCGCTTGATGGGCCACCACCGGAGATCAAGCAAGGTGCCGATCCCGATGTGGAGGCCGATGCTAAACGCGAGAATTTGCAGTCGGCGCCTCCTGGAACCGGCGGTGAGTCGTGA
Protein sequence
MSTPLDQLQPPPPLHSAHASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDVEDWVEKKCASCLDGSLDPPPPPPHLRRPPPLESVPVAEPLDGPPPEIKQGADPDVEADAKRENLQSAPPGTGGES
Homology
BLAST of CmUC02G031830 vs. NCBI nr
Match:
XP_038887147.1 (uncharacterized protein LOC120077337 [Benincasa hispida])
HSP 1 Score: 241.9 bits (616), Expect = 3.2e-60
Identity = 125/137 (91.24%), Postives = 128/137 (93.43%), Query Frame = 0
Query: 1 MSTPLDQLQPPPPLHSAHASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDVE 60
MSTPLDQLQPPP LHSAHASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDVE
Sbjct: 1 MSTPLDQLQPPPSLHSAHASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDVE 60
Query: 61 DWVEKKCASCLDGSLD-PPPPPPHLRRPPPLESVPVAEPLDGPPPEIKQGADPD--VEAD 120
DW+EKKCASCLDGSLD PPPPPPHLRRPP L+SVPVAEPL GPPPEIKQGAD D V+ D
Sbjct: 61 DWIEKKCASCLDGSLDPPPPPPPHLRRPPLLDSVPVAEPLGGPPPEIKQGADADAVVDTD 120
Query: 121 AKRENLQSAPPGTGGES 135
KRENLQSAPPGTGGES
Sbjct: 121 VKRENLQSAPPGTGGES 137
BLAST of CmUC02G031830 vs. NCBI nr
Match:
XP_008455516.1 (PREDICTED: type IV secretion system protein virB10-like [Cucumis melo] >KAA0031279.1 type IV secretion system protein virB10-like [Cucumis melo var. makuwa])
HSP 1 Score: 237.3 bits (604), Expect = 7.8e-59
Identity = 123/138 (89.13%), Postives = 126/138 (91.30%), Query Frame = 0
Query: 1 MSTPLDQLQPPPPLHSAHASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDVE 60
MSTP+DQLQPPPPLHS+HASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDVE
Sbjct: 1 MSTPIDQLQPPPPLHSSHASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDVE 60
Query: 61 DWVEKKCASCLDGSLD----PPPPPPHLRRPPPLESVPVAEPLDGPPPEIKQGADPDVEA 120
DWVEKKCASCLDGSLD PPPPPPHLR PPPL+SVPVAEPL GPPPEIKQGAD A
Sbjct: 61 DWVEKKCASCLDGSLDPPPPPPPPPPHLRHPPPLDSVPVAEPLGGPPPEIKQGAD----A 120
Query: 121 DAKRENLQSAPPGTGGES 135
DAK ENLQSA PGTGGES
Sbjct: 121 DAKGENLQSAAPGTGGES 134
BLAST of CmUC02G031830 vs. NCBI nr
Match:
KAE8645870.1 (hypothetical protein Csa_017149 [Cucumis sativus])
HSP 1 Score: 235.7 bits (600), Expect = 2.3e-58
Identity = 119/132 (90.15%), Postives = 123/132 (93.18%), Query Frame = 0
Query: 1 MSTPLDQLQPPPPLHSAHASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDVE 60
MSTP+DQLQPPPPLHS+HASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYD+E
Sbjct: 1 MSTPIDQLQPPPPLHSSHASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDLE 60
Query: 61 DWVEKKCASCLDGSLDPPPPPPHLRRPPPLESVPVAEPLDGPPPEIKQGADPDVEADAKR 120
DWVEKKCASCLDGSLDPPPPPPHLR PPPL+SVPVAEPL GPPPEIKQ A D ADAK
Sbjct: 61 DWVEKKCASCLDGSLDPPPPPPHLRHPPPLDSVPVAEPLGGPPPEIKQSAHAD--ADAKG 120
Query: 121 ENLQSAPPGTGG 133
ENLQSA PGTGG
Sbjct: 121 ENLQSAAPGTGG 130
BLAST of CmUC02G031830 vs. NCBI nr
Match:
KAG7011671.1 (hypothetical protein SDJN02_26577, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 212.6 bits (540), Expect = 2.1e-51
Identity = 112/137 (81.75%), Postives = 118/137 (86.13%), Query Frame = 0
Query: 1 MSTPLDQLQPPPPLHSAHASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDVE 60
MSTP+DQLQPPP HS H SVGP+IAVLAVISILGVIAG+IGRLCSGRPVFGYGAHYDVE
Sbjct: 1 MSTPVDQLQPPPTAHSGHGSVGPVIAVLAVISILGVIAGIIGRLCSGRPVFGYGAHYDVE 60
Query: 61 DWVEKKCASCLDGSLD---PPPPPPHLRRPPPLESVPVAEPLDGPPPEIKQGADPDVEAD 120
+WVEKKCASCLDGSLD PPPPPPHLR PPPL++VPV EPL G PPEIKQGAD
Sbjct: 61 EWVEKKCASCLDGSLDPPPPPPPPPHLRHPPPLDAVPVVEPLGG-PPEIKQGADD----- 120
Query: 121 AKRENLQSAPPGTGGES 135
KRENLQSA PGTGGES
Sbjct: 121 -KRENLQSAAPGTGGES 130
BLAST of CmUC02G031830 vs. NCBI nr
Match:
XP_022972364.1 (uncharacterized protein LOC111470940 [Cucurbita maxima])
HSP 1 Score: 211.8 bits (538), Expect = 3.5e-51
Identity = 109/132 (82.58%), Postives = 114/132 (86.36%), Query Frame = 0
Query: 1 MSTPLDQLQPPPPLHSAHASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDVE 60
MSTP DQLQPPP HS H SVGP+IAVLAVISILGVIAG+IGRLCSGRPVFGYGAHYDVE
Sbjct: 1 MSTPFDQLQPPPTAHSGHGSVGPVIAVLAVISILGVIAGIIGRLCSGRPVFGYGAHYDVE 60
Query: 61 DWVEKKCASCLDGSLDPPPPPPHLRRPPPLESVPVAEPLDGPPPEIKQGADPDVEADAKR 120
+WVEKKCASCLDGSLDPPPPP HLR PPPL++VPV EPL G PPEIKQG AD KR
Sbjct: 61 EWVEKKCASCLDGSLDPPPPPAHLRHPPPLDAVPVVEPLGG-PPEIKQG------ADEKR 120
Query: 121 ENLQSAPPGTGG 133
ENLQSA PGTGG
Sbjct: 121 ENLQSAAPGTGG 125
BLAST of CmUC02G031830 vs. ExPASy TrEMBL
Match:
A0A0A0K6N4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G074850 PE=4 SV=1)
HSP 1 Score: 238.8 bits (608), Expect = 1.3e-59
Identity = 121/134 (90.30%), Postives = 125/134 (93.28%), Query Frame = 0
Query: 1 MSTPLDQLQPPPPLHSAHASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDVE 60
MSTP+DQLQPPPPLHS+HASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYD+E
Sbjct: 1 MSTPIDQLQPPPPLHSSHASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDLE 60
Query: 61 DWVEKKCASCLDGSLDPPPPPPHLRRPPPLESVPVAEPLDGPPPEIKQGADPDVEADAKR 120
DWVEKKCASCLDGSLDPPPPPPHLR PPPL+SVPVAEPL GPPPEIKQ A D ADAK
Sbjct: 61 DWVEKKCASCLDGSLDPPPPPPHLRHPPPLDSVPVAEPLGGPPPEIKQSAHAD--ADAKG 120
Query: 121 ENLQSAPPGTGGES 135
ENLQSA PGTGGES
Sbjct: 121 ENLQSAAPGTGGES 132
BLAST of CmUC02G031830 vs. ExPASy TrEMBL
Match:
A0A5A7SPN0 (Type IV secretion system protein virB10-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold139G00320 PE=4 SV=1)
HSP 1 Score: 237.3 bits (604), Expect = 3.8e-59
Identity = 123/138 (89.13%), Postives = 126/138 (91.30%), Query Frame = 0
Query: 1 MSTPLDQLQPPPPLHSAHASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDVE 60
MSTP+DQLQPPPPLHS+HASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDVE
Sbjct: 1 MSTPIDQLQPPPPLHSSHASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDVE 60
Query: 61 DWVEKKCASCLDGSLD----PPPPPPHLRRPPPLESVPVAEPLDGPPPEIKQGADPDVEA 120
DWVEKKCASCLDGSLD PPPPPPHLR PPPL+SVPVAEPL GPPPEIKQGAD A
Sbjct: 61 DWVEKKCASCLDGSLDPPPPPPPPPPHLRHPPPLDSVPVAEPLGGPPPEIKQGAD----A 120
Query: 121 DAKRENLQSAPPGTGGES 135
DAK ENLQSA PGTGGES
Sbjct: 121 DAKGENLQSAAPGTGGES 134
BLAST of CmUC02G031830 vs. ExPASy TrEMBL
Match:
A0A1S3C1T9 (type IV secretion system protein virB10-like OS=Cucumis melo OX=3656 GN=LOC103495668 PE=4 SV=1)
HSP 1 Score: 237.3 bits (604), Expect = 3.8e-59
Identity = 123/138 (89.13%), Postives = 126/138 (91.30%), Query Frame = 0
Query: 1 MSTPLDQLQPPPPLHSAHASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDVE 60
MSTP+DQLQPPPPLHS+HASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDVE
Sbjct: 1 MSTPIDQLQPPPPLHSSHASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDVE 60
Query: 61 DWVEKKCASCLDGSLD----PPPPPPHLRRPPPLESVPVAEPLDGPPPEIKQGADPDVEA 120
DWVEKKCASCLDGSLD PPPPPPHLR PPPL+SVPVAEPL GPPPEIKQGAD A
Sbjct: 61 DWVEKKCASCLDGSLDPPPPPPPPPPHLRHPPPLDSVPVAEPLGGPPPEIKQGAD----A 120
Query: 121 DAKRENLQSAPPGTGGES 135
DAK ENLQSA PGTGGES
Sbjct: 121 DAKGENLQSAAPGTGGES 134
BLAST of CmUC02G031830 vs. ExPASy TrEMBL
Match:
A0A6J1IBA8 (uncharacterized protein LOC111470940 OS=Cucurbita maxima OX=3661 GN=LOC111470940 PE=4 SV=1)
HSP 1 Score: 211.8 bits (538), Expect = 1.7e-51
Identity = 109/132 (82.58%), Postives = 114/132 (86.36%), Query Frame = 0
Query: 1 MSTPLDQLQPPPPLHSAHASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDVE 60
MSTP DQLQPPP HS H SVGP+IAVLAVISILGVIAG+IGRLCSGRPVFGYGAHYDVE
Sbjct: 1 MSTPFDQLQPPPTAHSGHGSVGPVIAVLAVISILGVIAGIIGRLCSGRPVFGYGAHYDVE 60
Query: 61 DWVEKKCASCLDGSLDPPPPPPHLRRPPPLESVPVAEPLDGPPPEIKQGADPDVEADAKR 120
+WVEKKCASCLDGSLDPPPPP HLR PPPL++VPV EPL G PPEIKQG AD KR
Sbjct: 61 EWVEKKCASCLDGSLDPPPPPAHLRHPPPLDAVPVVEPLGG-PPEIKQG------ADEKR 120
Query: 121 ENLQSAPPGTGG 133
ENLQSA PGTGG
Sbjct: 121 ENLQSAAPGTGG 125
BLAST of CmUC02G031830 vs. ExPASy TrEMBL
Match:
A0A6J1GL66 (uncharacterized protein LOC111455383 OS=Cucurbita moschata OX=3662 GN=LOC111455383 PE=4 SV=1)
HSP 1 Score: 208.0 bits (528), Expect = 2.5e-50
Identity = 109/133 (81.95%), Postives = 116/133 (87.22%), Query Frame = 0
Query: 1 MSTPLDQLQPPPPLHSAHASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDVE 60
MSTP+DQLQPPP HS + SVGP+IAVLAVISILGVIAG+IGRLCSGRPVFGYGAHYDVE
Sbjct: 1 MSTPVDQLQPPPTAHSGYGSVGPVIAVLAVISILGVIAGIIGRLCSGRPVFGYGAHYDVE 60
Query: 61 DWVEKKCASCLDGSLD-PPPPPPHLRRPPPLESVPVAEPLDGPPPEIKQGADPDVEADAK 120
+WVEKKCASCLDGSLD PPPPPPHLR PPPL++VPV EPL G PPEIKQGAD K
Sbjct: 61 EWVEKKCASCLDGSLDPPPPPPPHLRHPPPLDAVPVVEPLGG-PPEIKQGADD------K 120
Query: 121 RENLQSAPPGTGG 133
RENLQSA PGTGG
Sbjct: 121 RENLQSAAPGTGG 126
BLAST of CmUC02G031830 vs. TAIR 10
Match:
AT2G26520.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G57500.1); Has 51 Blast hits to 51 proteins in 11 species: Archae - 0; Bacteria - 1; Metazoa - 0; Fungi - 0; Plants - 50; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 73.2 bits (178), Expect = 1.8e-13
Identity = 51/132 (38.64%), Postives = 73/132 (55.30%), Query Frame = 0
Query: 9 QPPPPLH----------SAHASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYD 68
QPPP + ++++GP IAV V+++L V+A +IGRLCSG+ + GYG YD
Sbjct: 9 QPPPATEVSQDSSSVSSAGNSTIGPFIAVFIVVTVLCVLASVIGRLCSGKTILGYG-DYD 68
Query: 69 VEDWVEKKCASCLDGSLDPPPPPPHLRRPPPLESVPVAEPLDGPPPEIK-QGADPDVEAD 128
+E W E +C SC+DG + P P P PPP + P+ G E + AD D E D
Sbjct: 69 MERWAESRCGSCIDGHIHPHRPSPS-PTPPPRQ--PLHHTSSGVSAESEGHVADLDHETD 128
BLAST of CmUC02G031830 vs. TAIR 10
Match:
AT3G57500.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G26520.1); Has 51 Blast hits to 51 proteins in 11 species: Archae - 0; Bacteria - 1; Metazoa - 0; Fungi - 0; Plants - 50; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 69.7 bits (169), Expect = 2.0e-12
Identity = 42/95 (44.21%), Postives = 57/95 (60.00%), Query Frame = 0
Query: 9 QPPPPLHSAHASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDVEDWVEKKCA 68
Q P +S H S+ L+ VLAVI+IL V+AG+ RLC GR + +G +D+E WVE+KC
Sbjct: 26 QDQPSHNSDHRSIETLVVVLAVITILSVLAGVFARLCGGRHL-SHGGDHDIEGWVERKCR 85
Query: 69 SCLDG-----SLDPPPPPPHLRRPPPLESVPVAEP 99
SC+D S P PPPP PPP + ++P
Sbjct: 86 SCIDAGIPAVSAAPSPPPP----PPPATAEERSKP 115
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038887147.1 | 3.2e-60 | 91.24 | uncharacterized protein LOC120077337 [Benincasa hispida] | [more] |
XP_008455516.1 | 7.8e-59 | 89.13 | PREDICTED: type IV secretion system protein virB10-like [Cucumis melo] >KAA00312... | [more] |
KAE8645870.1 | 2.3e-58 | 90.15 | hypothetical protein Csa_017149 [Cucumis sativus] | [more] |
KAG7011671.1 | 2.1e-51 | 81.75 | hypothetical protein SDJN02_26577, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
XP_022972364.1 | 3.5e-51 | 82.58 | uncharacterized protein LOC111470940 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0K6N4 | 1.3e-59 | 90.30 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G074850 PE=4 SV=1 | [more] |
A0A5A7SPN0 | 3.8e-59 | 89.13 | Type IV secretion system protein virB10-like OS=Cucumis melo var. makuwa OX=1194... | [more] |
A0A1S3C1T9 | 3.8e-59 | 89.13 | type IV secretion system protein virB10-like OS=Cucumis melo OX=3656 GN=LOC10349... | [more] |
A0A6J1IBA8 | 1.7e-51 | 82.58 | uncharacterized protein LOC111470940 OS=Cucurbita maxima OX=3661 GN=LOC111470940... | [more] |
A0A6J1GL66 | 2.5e-50 | 81.95 | uncharacterized protein LOC111455383 OS=Cucurbita moschata OX=3662 GN=LOC1114553... | [more] |
Match Name | E-value | Identity | Description | |
AT2G26520.1 | 1.8e-13 | 38.64 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT3G57500.1 | 2.0e-12 | 44.21 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |