Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCTCCATCAGGGCAATTGCTTTTGTTTGTTTCGTAGTCTGTTTCTGTGCGATTGAATCACAAGCACGAGTGGCGAGGAAGGACCTGGGCCTTGACCTTGGAGGCTTGGGAATTGGGCTCGGGGCAGGAATCGGGTTGGGGATAGGCGGAGGCAGTGGCTCGGGTGCTGGAGCCGGGTCTGGATCCGGGTCCGGGTCGGGTTCGTATTCATCCTCGTCATCACACTCGTCAAGCTCTAGCTACGGTGGCTCGGGTGCAGGCTCGGAAGCTGGTTCCTACGCAGGGTCACGTGCAGGGTCGGGTTCGGGTTCGGGAAGAAATTACAGGAACGGTGGATCGGGATCTGGGTCGGGCTATGGCGAGGGTTCCGGTAGAGGAAACGGTGAAGGTTATGGTGAGGGTCATGGCTATGGCGAGGGACGTGGTTACGGCGGTGAAGGCGGTAACAAT
mRNA sequence
ATGGCCTCCATCAGGGCAATTGCTTTTGTTTGTTTCGTAGTCTGTTTCTGTGCGATTGAATCACAAGCACGAGTGGCGAGGAAGGACCTGGGCCTTGACCTTGGAGGCTTGGGAATTGGGCTCGGGGCAGGAATCGGGTTGGGGATAGGCGGAGGCAGTGGCTCGGGTGCTGGAGCCGGGTCTGGATCCGGGTCCGGGTCGGGTTCGTATTCATCCTCGTCATCACACTCGTCAAGCTCTAGCTACGGTGGCTCGGGTGCAGGCTCGGAAGCTGGTTCCTACGCAGGGTCACGTGCAGGGTCGGGTTCGGGTTCGGGAAGAAATTACAGGAACGGTGGATCGGGATCTGGGTCGGGCTATGGCGAGGGTTCCGGTAGAGGAAACGGTGAAGGTTATGGTGAGGGTCATGGCTATGGCGAGGGACGTGGTTACGGCGGTGAAGGCGGTAACAAT
Coding sequence (CDS)
ATGGCCTCCATCAGGGCAATTGCTTTTGTTTGTTTCGTAGTCTGTTTCTGTGCGATTGAATCACAAGCACGAGTGGCGAGGAAGGACCTGGGCCTTGACCTTGGAGGCTTGGGAATTGGGCTCGGGGCAGGAATCGGGTTGGGGATAGGCGGAGGCAGTGGCTCGGGTGCTGGAGCCGGGTCTGGATCCGGGTCCGGGTCGGGTTCGTATTCATCCTCGTCATCACACTCGTCAAGCTCTAGCTACGGTGGCTCGGGTGCAGGCTCGGAAGCTGGTTCCTACGCAGGGTCACGTGCAGGGTCGGGTTCGGGTTCGGGAAGAAATTACAGGAACGGTGGATCGGGATCTGGGTCGGGCTATGGCGAGGGTTCCGGTAGAGGAAACGGTGAAGGTTATGGTGAGGGTCATGGCTATGGCGAGGGACGTGGTTACGGCGGTGAAGGCGGTAACAAT
Protein sequence
MASIRAIAFVCFVVCFCAIESQARVARKDLGLDLGGLGIGLGAGIGLGIGGGSGSGAGAGSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSRAGSGSGSGRNYRNGGSGSGSGYGEGSGRGNGEGYGEGHGYGEGRGYGGEGGNN
Homology
BLAST of MS002300 vs. NCBI nr
Match:
XP_022135257.1 (glycine-rich cell wall structural protein 2-like [Momordica charantia])
HSP 1 Score: 228.4 bits (581), Expect = 4.1e-56
Identity = 151/151 (100.00%), Postives = 151/151 (100.00%), Query Frame = 0
Query: 1 MASIRAIAFVCFVVCFCAIESQARVARKDLGLDLGGLGIGLGAGIGLGIGGGSGSGAGAG 60
MASIRAIAFVCFVVCFCAIESQARVARKDLGLDLGGLGIGLGAGIGLGIGGGSGSGAGAG
Sbjct: 1 MASIRAIAFVCFVVCFCAIESQARVARKDLGLDLGGLGIGLGAGIGLGIGGGSGSGAGAG 60
Query: 61 SGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSRAGSGSGSGRNYRNGGSGSGSGY 120
SGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSRAGSGSGSGRNYRNGGSGSGSGY
Sbjct: 61 SGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSRAGSGSGSGRNYRNGGSGSGSGY 120
Query: 121 GEGSGRGNGEGYGEGHGYGEGRGYGGEGGNN 152
GEGSGRGNGEGYGEGHGYGEGRGYGGEGGNN
Sbjct: 121 GEGSGRGNGEGYGEGHGYGEGRGYGGEGGNN 151
BLAST of MS002300 vs. NCBI nr
Match:
XP_023515820.1 (putative glycine-rich cell wall structural protein 1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 156.4 bits (394), Expect = 2.0e-34
Identity = 126/164 (76.83%), Postives = 133/164 (81.10%), Query Frame = 0
Query: 1 MASIRAIAFVCFVVCFCAIESQARVARKDLGLDLGGLGIGLGAGIGLGIGGGSGSGAGAG 60
M SIR+IA +CFVVC AIESQ RVARKDLGLDLGGLG+GLG G+GLG+GGGSGSG+G+G
Sbjct: 1 MTSIRSIAVLCFVVCMSAIESQGRVARKDLGLDLGGLGVGLGVGLGLGLGGGSGSGSGSG 60
Query: 61 SGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSRAGS--GSGSGRNYRNGGS---- 120
SGSGSGSGSY SSS SSSSSYGGSGAGSEAGSYAGS AGS GS SGRNYRNGGS
Sbjct: 61 SGSGSGSGSY--SSSQSSSSSYGGSGAGSEAGSYAGSYAGSRGGSDSGRNYRNGGSGYGG 120
Query: 121 ----GSGSGYGEGSGRGNG---EGYGEGHGYGEGRGYGGEGGNN 152
G G GYGEG G G G EGYGEG GYGEGRGY GEGGNN
Sbjct: 121 GGGGGRGEGYGEGRGYGEGSGREGYGEGRGYGEGRGY-GEGGNN 161
BLAST of MS002300 vs. NCBI nr
Match:
KAG6589746.1 (hypothetical protein SDJN03_15169, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 155.6 bits (392), Expect = 3.4e-34
Identity = 128/171 (74.85%), Postives = 134/171 (78.36%), Query Frame = 0
Query: 1 MASIRAIAFVCFVVCFCAIESQARVARKDLGLDLGGLGIGLGAGIGLGIGGGSGSGAGAG 60
M SIRAIA +CFVVC AIESQ RVARKDLGLDLGGLG+GLG G+GLG+GGGSGSG+G+G
Sbjct: 1 MTSIRAIAVLCFVVCMSAIESQGRVARKDLGLDLGGLGVGLGVGLGLGLGGGSGSGSGSG 60
Query: 61 SGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSRAGS--GSGSGRNYRNGGS---- 120
SGSGSGSGSY SSS SSSSSYGGSGAGSEAGSYAGS AGS GS SGRNYRNGGS
Sbjct: 61 SGSGSGSGSY--SSSQSSSSSYGGSGAGSEAGSYAGSYAGSRGGSDSGRNYRNGGSGYGE 120
Query: 121 GSGSGYGEGSGRGNGEGYGEG---------HGYGEGRGYG-----GEGGNN 152
GSG GYG G G G GEGYGEG GYGEGRGYG GEGGNN
Sbjct: 121 GSGRGYGGGGGGGRGEGYGEGRGYGEGSGREGYGEGRGYGEGRGYGEGGNN 169
BLAST of MS002300 vs. NCBI nr
Match:
XP_038880102.1 (glycine-rich cell wall structural protein 2-like [Benincasa hispida])
HSP 1 Score: 144.4 bits (363), Expect = 7.8e-31
Identity = 114/152 (75.00%), Postives = 122/152 (80.26%), Query Frame = 0
Query: 1 MASIRAIAFVCFVVCFCAIESQA-RVARKDLGLDLGGLGIGLGAGIGLGIGGGSGSGAGA 60
MA+I+AIA +C V+C IES A RVARKDLGLDLGGLG+GLG G+GLG+GGGSGSGAGA
Sbjct: 1 MATIKAIAVLCLVLCMSVIESGAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGAGA 60
Query: 61 GSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSRAGSGSGSGRNYRNGGSGSGSG 120
GSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGS AGS +GS
Sbjct: 61 GSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSYAGSRAGS-------------- 120
Query: 121 YGEGSGRGNGEGYGEGHGYGEGRGYGGEGGNN 152
GSGRGNG G GEGHGYGEGRGY GEGGNN
Sbjct: 121 ---GSGRGNGRGGGEGHGYGEGRGY-GEGGNN 134
BLAST of MS002300 vs. NCBI nr
Match:
XP_022921885.1 (glycine-rich protein DOT1-like [Cucurbita moschata])
HSP 1 Score: 144.1 bits (362), Expect = 1.0e-30
Identity = 116/155 (74.84%), Postives = 122/155 (78.71%), Query Frame = 0
Query: 1 MASIRAIAFVCFVVCFCAIESQARVARKDLGLDLGGLGIGLGAGIGLGIGGGSGSGAGAG 60
M SIRAIA +CFVVC AIESQ RVARKDLGLDLGGLG+GLG G+GLG+GGGSGSG+G+G
Sbjct: 1 MTSIRAIAVLCFVVCMSAIESQGRVARKDLGLDLGGLGVGLGVGLGLGLGGGSGSGSGSG 60
Query: 61 SGSGSGSGSYSS--SSSHSSSSSYGGSGAGSEAGSYAGSRAGS--GSGSGRNYRNGGSGS 120
SGSGSGSGS S SSS SSSSSYGGSGAGSEAGSYAGS AGS GS SGRNYRNGGS
Sbjct: 61 SGSGSGSGSGSGSYSSSQSSSSSYGGSGAGSEAGSYAGSYAGSRGGSDSGRNYRNGGS-- 120
Query: 121 GSGYGEGSGRGNGEGYGEGHGYGEGRGYGGEGGNN 152
GYGEG GYGEGRGY GEGGNN
Sbjct: 121 --------------GYGEGRGYGEGRGY-GEGGNN 138
BLAST of MS002300 vs. ExPASy TrEMBL
Match:
A0A6J1C269 (glycine-rich cell wall structural protein 2-like OS=Momordica charantia OX=3673 GN=LOC111007265 PE=4 SV=1)
HSP 1 Score: 228.4 bits (581), Expect = 2.0e-56
Identity = 151/151 (100.00%), Postives = 151/151 (100.00%), Query Frame = 0
Query: 1 MASIRAIAFVCFVVCFCAIESQARVARKDLGLDLGGLGIGLGAGIGLGIGGGSGSGAGAG 60
MASIRAIAFVCFVVCFCAIESQARVARKDLGLDLGGLGIGLGAGIGLGIGGGSGSGAGAG
Sbjct: 1 MASIRAIAFVCFVVCFCAIESQARVARKDLGLDLGGLGIGLGAGIGLGIGGGSGSGAGAG 60
Query: 61 SGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSRAGSGSGSGRNYRNGGSGSGSGY 120
SGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSRAGSGSGSGRNYRNGGSGSGSGY
Sbjct: 61 SGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSRAGSGSGSGRNYRNGGSGSGSGY 120
Query: 121 GEGSGRGNGEGYGEGHGYGEGRGYGGEGGNN 152
GEGSGRGNGEGYGEGHGYGEGRGYGGEGGNN
Sbjct: 121 GEGSGRGNGEGYGEGHGYGEGRGYGGEGGNN 151
BLAST of MS002300 vs. ExPASy TrEMBL
Match:
A0A6J1E720 (glycine-rich protein DOT1-like OS=Cucurbita moschata OX=3662 GN=LOC111430016 PE=4 SV=1)
HSP 1 Score: 144.1 bits (362), Expect = 4.9e-31
Identity = 116/155 (74.84%), Postives = 122/155 (78.71%), Query Frame = 0
Query: 1 MASIRAIAFVCFVVCFCAIESQARVARKDLGLDLGGLGIGLGAGIGLGIGGGSGSGAGAG 60
M SIRAIA +CFVVC AIESQ RVARKDLGLDLGGLG+GLG G+GLG+GGGSGSG+G+G
Sbjct: 1 MTSIRAIAVLCFVVCMSAIESQGRVARKDLGLDLGGLGVGLGVGLGLGLGGGSGSGSGSG 60
Query: 61 SGSGSGSGSYSS--SSSHSSSSSYGGSGAGSEAGSYAGSRAGS--GSGSGRNYRNGGSGS 120
SGSGSGSGS S SSS SSSSSYGGSGAGSEAGSYAGS AGS GS SGRNYRNGGS
Sbjct: 61 SGSGSGSGSGSGSYSSSQSSSSSYGGSGAGSEAGSYAGSYAGSRGGSDSGRNYRNGGS-- 120
Query: 121 GSGYGEGSGRGNGEGYGEGHGYGEGRGYGGEGGNN 152
GYGEG GYGEGRGY GEGGNN
Sbjct: 121 --------------GYGEGRGYGEGRGY-GEGGNN 138
BLAST of MS002300 vs. ExPASy TrEMBL
Match:
A0A0A0LW94 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G278490 PE=4 SV=1)
HSP 1 Score: 141.7 bits (356), Expect = 2.4e-30
Identity = 112/152 (73.68%), Postives = 120/152 (78.95%), Query Frame = 0
Query: 1 MASIRAIAFVCFVVCFCAIESQA-RVARKDLGLDLGGLGIGLGAGIGLGIGGGSGSGAGA 60
MASI+AIA +C V+C IES+A RVARKDLGLDLGGLG+GLG G+GLG+GGGSGSGAGA
Sbjct: 1 MASIKAIAVLCLVLCMSVIESEAGRVARKDLGLDLGGLGVGLGLGLGLGLGGGSGSGAGA 60
Query: 61 GSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSRAGSGSGSGRNYRNGGSGSGSG 120
GSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGS AGS +GSG RNG SG
Sbjct: 61 GSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSYAGSRAGSGSGSRNGASG---- 120
Query: 121 YGEGSGRGNGEGYGEGHGYGEGRGYGGEGGNN 152
GEGHGYGEG GY GEGGNN
Sbjct: 121 -------------GEGHGYGEGHGY-GEGGNN 134
BLAST of MS002300 vs. ExPASy TrEMBL
Match:
A0A6J1E521 (putative glycine-rich cell wall structural protein 1 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111430014 PE=4 SV=1)
HSP 1 Score: 140.6 bits (353), Expect = 5.4e-30
Identity = 116/151 (76.82%), Postives = 127/151 (84.11%), Query Frame = 0
Query: 1 MASIRAIAFVCFVVCFCAIESQARVARKDLGLDLGGLGIGLGAGIGLGIGG-GSGSGAGA 60
MASI ++ VCF++ F I SQARVARKDLGLDLGGLG+G+G GIGLG+GG GSGSG+G+
Sbjct: 1 MASINSLIVVCFLLSFSVILSQARVARKDLGLDLGGLGVGIGTGIGLGLGGSGSGSGSGS 60
Query: 61 GSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSRAGSGSGSGRNYRNGGSGSGSG 120
GSGSGSGS S SSSSS+SSSS GSGAGSEAGSYAGSRAGSGSG RNGGSGSGSG
Sbjct: 61 GSGSGSGSSSSSSSSSYSSSSG-SGSGAGSEAGSYAGSRAGSGSGPN---RNGGSGSGSG 120
Query: 121 YGEGSGRGN----GEGYGEGHGYGEGRGYGG 147
YG GSGRG+ GEGYGEGHGYGEGRGYGG
Sbjct: 121 YGGGSGRGSGNGGGEGYGEGHGYGEGRGYGG 147
BLAST of MS002300 vs. ExPASy TrEMBL
Match:
A0A6J1E2L1 (cell wall protein IFF6-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111430014 PE=4 SV=1)
HSP 1 Score: 138.3 bits (347), Expect = 2.7e-29
Identity = 118/168 (70.24%), Postives = 129/168 (76.79%), Query Frame = 0
Query: 1 MASIRAIAFVCFVVCFCAIESQARVARKDLGLDLGGLGIGLGAGIGLGIGG-GSGSGAGA 60
MASI ++ VCF++ F I SQARVARKDLGLDLGGLG+G+G GIGLG+GG GSGSG+G+
Sbjct: 1 MASINSLIVVCFLLSFSVILSQARVARKDLGLDLGGLGVGIGTGIGLGLGGSGSGSGSGS 60
Query: 61 GSGSGSGSGSYSSSSSHSSSSSYGGSGAGSEAGSYAGSRAGSGSGSGRN----------- 120
GSGSGSGS S SSSSS+SSSS GSGAGSEAGSYAGSRAGSGSGSG
Sbjct: 61 GSGSGSGSSSSSSSSSYSSSSG-SGSGAGSEAGSYAGSRAGSGSGSGAGSEAGSYAGSRA 120
Query: 121 ------YRNGGSGSGSGYGEGSGRGN----GEGYGEGHGYGEGRGYGG 147
RNGGSGSGSGYG GSGRG+ GEGYGEGHGYGEGRGYGG
Sbjct: 121 GSGSGPNRNGGSGSGSGYGGGSGRGSGNGGGEGYGEGHGYGEGRGYGG 167
BLAST of MS002300 vs. TAIR 10
Match:
AT4G30460.1 (glycine-rich protein )
HSP 1 Score: 91.3 bits (225), Expect = 7.3e-19
Identity = 95/151 (62.91%), Postives = 107/151 (70.86%), Query Frame = 0
Query: 6 AIAFVCFVVCFCAIESQARVARKDLGLDLGGLGIGLGAGIGLGIGGGSGSGAGAGSGSGS 65
++ + ++ + S++RVARKDLGLDLGG+G G+G GIG+G GGGSGSGAGAGSGSG
Sbjct: 9 SLILITLILATSVLVSESRVARKDLGLDLGGIGAGIGIGIGIG-GGGSGSGAGAGSGSGG 68
Query: 66 GSGSYSSSSSHSSSSSYGGSG--AGSEAGSYAGSRAGSG----SGSGRNYRNGGSGS-GS 125
G S SSSSS SSSSS GG G AGSEAGSYAGS AGSG SGSGR +GG G G
Sbjct: 69 GGSSSSSSSSSSSSSSSGGGGGDAGSEAGSYAGSHAGSGSGGRSGSGRGRGSGGGGGHGG 128
Query: 126 GYGEGSGRGNGEGYGEGHGYGEGRGYGGEGG 150
G G G GRG G G G G GYGEG GYGG G
Sbjct: 129 GGGGGGGRGGGGGSGNGEGYGEGGGYGGGYG 158
BLAST of MS002300 vs. TAIR 10
Match:
AT4G30450.1 (glycine-rich protein )
HSP 1 Score: 83.2 bits (204), Expect = 2.0e-16
Identity = 67/94 (71.28%), Postives = 80/94 (85.11%), Query Frame = 0
Query: 13 VVCFCAIESQARVARKDLGLDLGGLGIGLGAGIGLGIGGGSGSGAGAGSGSGSGSGSYSS 72
V+ + +++R+ARKDLG+DLGG+GIGLG G+G+G+GGGSGSGAGAGSGSGSGS S SS
Sbjct: 14 VISSLVMLTESRLARKDLGIDLGGIGIGLGVGLGIGLGGGSGSGAGAGSGSGSGSRSSSS 73
Query: 73 SSSHSSSSSYG-GSGAGSEAGSYAGSRAGSGSGS 106
SSS SSSSS G G AGS AGS+AGSRAGSGSG+
Sbjct: 74 SSSSSSSSSSGSGGSAGSSAGSFAGSRAGSGSGN 107
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022135257.1 | 4.1e-56 | 100.00 | glycine-rich cell wall structural protein 2-like [Momordica charantia] | [more] |
XP_023515820.1 | 2.0e-34 | 76.83 | putative glycine-rich cell wall structural protein 1 [Cucurbita pepo subsp. pepo... | [more] |
KAG6589746.1 | 3.4e-34 | 74.85 | hypothetical protein SDJN03_15169, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_038880102.1 | 7.8e-31 | 75.00 | glycine-rich cell wall structural protein 2-like [Benincasa hispida] | [more] |
XP_022921885.1 | 1.0e-30 | 74.84 | glycine-rich protein DOT1-like [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1C269 | 2.0e-56 | 100.00 | glycine-rich cell wall structural protein 2-like OS=Momordica charantia OX=3673 ... | [more] |
A0A6J1E720 | 4.9e-31 | 74.84 | glycine-rich protein DOT1-like OS=Cucurbita moschata OX=3662 GN=LOC111430016 PE=... | [more] |
A0A0A0LW94 | 2.4e-30 | 73.68 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G278490 PE=4 SV=1 | [more] |
A0A6J1E521 | 5.4e-30 | 76.82 | putative glycine-rich cell wall structural protein 1 isoform X3 OS=Cucurbita mos... | [more] |
A0A6J1E2L1 | 2.7e-29 | 70.24 | cell wall protein IFF6-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC11143... | [more] |