Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCCGCCGCCACCCACCTCTCCGGCAACCCAGATAAGATCTTCCACAAACCACCACATCGGCGGAACGCTTCCACCGAGCTCGACGTCTTCGAAGCCGCCCGCTACTTCTCCGGTAGCAACGAACTCATTTACACCAACACTTCATCATTCACTCACAATATCATCAGAGAGGATCACAGCAGCCGGAGAGGCGGAAGAATGAGCCTCGACTTGCCGTTAAGAACCAATGCAGCTCCACCGCCGCCGTACACGGCGGAGAAGCCACCAATCAAAGAGAAAAAACACCGGCAACAACCCAGCTCCCCGGCTGGTCGGCTCGCCAATTTCCTCAATTCCCTCTTCAACCAATCAGCTTCTCAAAAGAAAAAGAAACCCAAAAATGGCATGAAAAACGATGAATTTCATCACGAAATTGGAGCAAGAAAGAGAAGAAGCAGCTTAAGTTCGCTCATCGATTCAAAATCATCTTCAAACATCTCAGCTCCCAACAAAAATTACAAGCAAATCAGAAACCTTTTGGATCAGACACGACCAGCTTCCAAGAATCTCGGAAATGTTAAGCAAAATTCTGATCACCCGACCAGAAAAATCAAAGATGTCGACCAAGATGCTCGAAGTGATTCAAGTTCCGATCTTTTTGAGCTACGAATCTACGACTTCGATTGA
mRNA sequence
ATGTCCGCCGCCACCCACCTCTCCGGCAACCCAGATAAGATCTTCCACAAACCACCACATCGGCGGAACGCTTCCACCGAGCTCGACGTCTTCGAAGCCGCCCGCTACTTCTCCGGTAGCAACGAACTCATTTACACCAACACTTCATCATTCACTCACAATATCATCAGAGAGGATCACAGCAGCCGGAGAGGCGGAAGAATGAGCCTCGACTTGCCGTTAAGAACCAATGCAGCTCCACCGCCGCCGTACACGGCGGAGAAGCCACCAATCAAAGAGAAAAAACACCGGCAACAACCCAGCTCCCCGGCTGGTCGGCTCGCCAATTTCCTCAATTCCCTCTTCAACCAATCAGCTTCTCAAAAGAAAAAGAAACCCAAAAATGGCATGAAAAACGATGAATTTCATCACGAAATTGGAGCAAGAAAGAGAAGAAGCAGCTTAAGTTCGCTCATCGATTCAAAATCATCTTCAAACATCTCAGCTCCCAACAAAAATTACAAGCAAATCAGAAACCTTTTGGATCAGACACGACCAGCTTCCAAGAATCTCGGAAATGTTAAGCAAAATTCTGATCACCCGACCAGAAAAATCAAAGATGTCGACCAAGATGCTCGAAGTGATTCAAGTTCCGATCTTTTTGAGCTACGAATCTACGACTTCGATTGA
Coding sequence (CDS)
ATGTCCGCCGCCACCCACCTCTCCGGCAACCCAGATAAGATCTTCCACAAACCACCACATCGGCGGAACGCTTCCACCGAGCTCGACGTCTTCGAAGCCGCCCGCTACTTCTCCGGTAGCAACGAACTCATTTACACCAACACTTCATCATTCACTCACAATATCATCAGAGAGGATCACAGCAGCCGGAGAGGCGGAAGAATGAGCCTCGACTTGCCGTTAAGAACCAATGCAGCTCCACCGCCGCCGTACACGGCGGAGAAGCCACCAATCAAAGAGAAAAAACACCGGCAACAACCCAGCTCCCCGGCTGGTCGGCTCGCCAATTTCCTCAATTCCCTCTTCAACCAATCAGCTTCTCAAAAGAAAAAGAAACCCAAAAATGGCATGAAAAACGATGAATTTCATCACGAAATTGGAGCAAGAAAGAGAAGAAGCAGCTTAAGTTCGCTCATCGATTCAAAATCATCTTCAAACATCTCAGCTCCCAACAAAAATTACAAGCAAATCAGAAACCTTTTGGATCAGACACGACCAGCTTCCAAGAATCTCGGAAATGTTAAGCAAAATTCTGATCACCCGACCAGAAAAATCAAAGATGTCGACCAAGATGCTCGAAGTGATTCAAGTTCCGATCTTTTTGAGCTACGAATCTACGACTTCGATTGA
Protein sequence
MSAATHLSGNPDKIFHKPPHRRNASTELDVFEAARYFSGSNELIYTNTSSFTHNIIREDHSSRRGGRMSLDLPLRTNAAPPPPYTAEKPPIKEKKHRQQPSSPAGRLANFLNSLFNQSASQKKKKPKNGMKNDEFHHEIGARKRRSSLSSLIDSKSSSNISAPNKNYKQIRNLLDQTRPASKNLGNVKQNSDHPTRKIKDVDQDARSDSSSDLFELRIYDFD
Homology
BLAST of CmoCh18G011190 vs. ExPASy Swiss-Prot
Match:
Q93Z37 (Protein BIG GRAIN 1-like E OS=Arabidopsis thaliana OX=3702 GN=At1g69160 PE=2 SV=1)
HSP 1 Score: 77.4 bits (189), Expect = 2.3e-13
Identity = 92/281 (32.74%), Postives = 123/281 (43.77%), Query Frame = 0
Query: 20 HRRNASTELDVFEAARYFSGSNELI---YTNTSSFTHNIIREDHSSR-----RGGRMSLD 79
H+RN S ELDVFEAA YF G NE + +T + +N RE++ R G R+SLD
Sbjct: 22 HKRN-SEELDVFEAAVYF-GYNEASSGDHGHTQKYGYNAAREENPRRWGILGGGRRISLD 81
Query: 80 LPLRTNAA---------PPPPYTAEKPPIKEKKHRQQPSSPAGRLANFLNSLFNQSASQK 139
LP+R + T K + +H+ QPSSP G++A+FLNSLF+Q+ S+K
Sbjct: 82 LPIRCSEQVYHLQQDHHEKHEVTTIKERLGNVRHK-QPSSPGGKIASFLNSLFHQAGSKK 141
Query: 140 ---KKKPKNGMKNDEFHHEIGA-----RKRRSSLS----------------------SLI 199
K K K + E EI R+RRSS+S SLI
Sbjct: 142 NKSKSKSKTKPTDPEVEEEIPGGGWMRRRRRSSISHFFSSSRSTSTTTTTTASSSSKSLI 201
Query: 200 DSKSSS------NISAPNKNYKQIRNLLDQTRPASKNLGNVKQNSDHPTRKIK------- 221
S SS ++ P KNYKQ N T+ + + K+K
Sbjct: 202 SSSSSGFRTPPPYLNTPTKNYKQFLNYTSATKQVGEEETKTNKEYSWLDEKLKVMESLSE 261
BLAST of CmoCh18G011190 vs. ExPASy TrEMBL
Match:
A0A6J1G172 (protein BIG GRAIN 1-like E OS=Cucurbita moschata OX=3662 GN=LOC111449738 PE=3 SV=1)
HSP 1 Score: 432.6 bits (1111), Expect = 1.0e-117
Identity = 222/222 (100.00%), Postives = 222/222 (100.00%), Query Frame = 0
Query: 1 MSAATHLSGNPDKIFHKPPHRRNASTELDVFEAARYFSGSNELIYTNTSSFTHNIIREDH 60
MSAATHLSGNPDKIFHKPPHRRNASTELDVFEAARYFSGSNELIYTNTSSFTHNIIREDH
Sbjct: 1 MSAATHLSGNPDKIFHKPPHRRNASTELDVFEAARYFSGSNELIYTNTSSFTHNIIREDH 60
Query: 61 SSRRGGRMSLDLPLRTNAAPPPPYTAEKPPIKEKKHRQQPSSPAGRLANFLNSLFNQSAS 120
SSRRGGRMSLDLPLRTNAAPPPPYTAEKPPIKEKKHRQQPSSPAGRLANFLNSLFNQSAS
Sbjct: 61 SSRRGGRMSLDLPLRTNAAPPPPYTAEKPPIKEKKHRQQPSSPAGRLANFLNSLFNQSAS 120
Query: 121 QKKKKPKNGMKNDEFHHEIGARKRRSSLSSLIDSKSSSNISAPNKNYKQIRNLLDQTRPA 180
QKKKKPKNGMKNDEFHHEIGARKRRSSLSSLIDSKSSSNISAPNKNYKQIRNLLDQTRPA
Sbjct: 121 QKKKKPKNGMKNDEFHHEIGARKRRSSLSSLIDSKSSSNISAPNKNYKQIRNLLDQTRPA 180
Query: 181 SKNLGNVKQNSDHPTRKIKDVDQDARSDSSSDLFELRIYDFD 223
SKNLGNVKQNSDHPTRKIKDVDQDARSDSSSDLFELRIYDFD
Sbjct: 181 SKNLGNVKQNSDHPTRKIKDVDQDARSDSSSDLFELRIYDFD 222
BLAST of CmoCh18G011190 vs. ExPASy TrEMBL
Match:
A0A6J1I085 (protein BIG GRAIN 1-like E OS=Cucurbita maxima OX=3661 GN=LOC111467800 PE=3 SV=1)
HSP 1 Score: 414.5 bits (1064), Expect = 2.9e-112
Identity = 213/222 (95.95%), Postives = 215/222 (96.85%), Query Frame = 0
Query: 1 MSAATHLSGNPDKIFHKPPHRRNASTELDVFEAARYFSGSNELIYTNTSSFTHNIIREDH 60
MSAATHL GNPDKIFHKPPHRRN STELDVFEAARYFSGSNEL YTNTSSFTH II EDH
Sbjct: 1 MSAATHLPGNPDKIFHKPPHRRNDSTELDVFEAARYFSGSNELAYTNTSSFTHKIIIEDH 60
Query: 61 SSRRGGRMSLDLPLRTNAAPPPPYTAEKPPIKEKKHRQQPSSPAGRLANFLNSLFNQSAS 120
S RRGGRMSLDLPLRTNAAPPPPYTAEKPPIKEKKHRQQPSSPAGRLANFLNSLFNQSAS
Sbjct: 61 SRRRGGRMSLDLPLRTNAAPPPPYTAEKPPIKEKKHRQQPSSPAGRLANFLNSLFNQSAS 120
Query: 121 QKKKKPKNGMKNDEFHHEIGARKRRSSLSSLIDSKSSSNISAPNKNYKQIRNLLDQTRPA 180
QKKKKPKNG KNDEFHHEIGARKRRSSLSSLIDSKSSSNISAPNKNYK+IRNLLDQTRPA
Sbjct: 121 QKKKKPKNGTKNDEFHHEIGARKRRSSLSSLIDSKSSSNISAPNKNYKEIRNLLDQTRPA 180
Query: 181 SKNLGNVKQNSDHPTRKIKDVDQDARSDSSSDLFELRIYDFD 223
SKNLGNVKQNSDHPTRKIKDVD+DARSDSSSDLFELRIYDFD
Sbjct: 181 SKNLGNVKQNSDHPTRKIKDVDEDARSDSSSDLFELRIYDFD 222
BLAST of CmoCh18G011190 vs. ExPASy TrEMBL
Match:
A0A6J1D6V3 (protein BIG GRAIN 1-like E OS=Momordica charantia OX=3673 GN=LOC111017581 PE=3 SV=1)
HSP 1 Score: 248.1 bits (632), Expect = 3.6e-62
Identity = 156/255 (61.18%), Postives = 172/255 (67.45%), Query Frame = 0
Query: 1 MSAATHLSGNPDKIFHKPPHRRNASTELDVFEAARYFSGSNELIYTNTSSFTHNIIREDH 60
MS +HLS NPDKIF K HRRN S ELDVFEAARYFSG NE+ YT + I REDH
Sbjct: 1 MSITSHLSDNPDKIFKKSFHRRNNSGELDVFEAARYFSGCNEVSYTGGGGY-QKIFREDH 60
Query: 61 SSRRG-GRMSLDLPLRTNAA--PPPPYTAEKPPIKEKKHRQQPSSPAGRLANFLNSLFNQ 120
RRG GRMSLDLP+RTN PPPPYTAEK IKEKKHRQQPSSP GRLANFLNSLFNQ
Sbjct: 61 GGRRGAGRMSLDLPIRTNVVPLPPPPYTAEKQTIKEKKHRQQPSSPGGRLANFLNSLFNQ 120
Query: 121 SASQKKKKPKNGMKNDEFHHEIGARKRRSSLSSLIDSKSSSNIS---------------A 180
SAS+KKKKPKN +K+++ RKRRSSLS+LIDSKSSSN S
Sbjct: 121 SASKKKKKPKNPIKSEDQEIGSAGRKRRSSLSNLIDSKSSSNFSVSGFRTPPTPNCTLQT 180
Query: 181 PNKNYKQIRNLLDQTRPASKNLG-----------NVKQNSD----HPTRKIKDVDQDARS 223
PNKNYK+IR LL ++ NLG N KQNSD + RKI +D+ A S
Sbjct: 181 PNKNYKEIRTLLSS---SNNNLGNGSFNQVAYNNNYKQNSDRHQENEIRKINGLDEGADS 240
BLAST of CmoCh18G011190 vs. ExPASy TrEMBL
Match:
A0A0A0KSX8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G606790 PE=3 SV=1)
HSP 1 Score: 241.1 bits (614), Expect = 4.3e-60
Identity = 145/245 (59.18%), Postives = 169/245 (68.98%), Query Frame = 0
Query: 1 MSAATHLSGNPDKIFHKPPHRRNASTELDVFEAARYFSGSNELIYTNTSSFTHNIIREDH 60
MS +HLSG+PDKIF K HRR S ELDVFEAARYF+GSNE YT +++ ++ H
Sbjct: 1 MSVTSHLSGHPDKIFRKSLHRRKDSGELDVFEAARYFAGSNEPSYTTAATYETSLF---H 60
Query: 61 SSRRGGRMSLDLPLRTNA--APPPPYTAEKPPIKE-KKHRQQPSSPAGRLANFLNSLFNQ 120
RRGGRMSLDLPLRTN P PPYTAEK +K+ KKHRQQPSSP GRLANFLNSLFNQ
Sbjct: 61 GGRRGGRMSLDLPLRTNVIPLPTPPYTAEKQSVKDPKKHRQQPSSPGGRLANFLNSLFNQ 120
Query: 121 SASQKKKKPKNGMKNDEFHHEIGARKRRSSLSSLIDSKSSSN------------------ 180
SAS+KKKKPKN +K ++ HHE+GARKRRSSLS+LID+KSSS+
Sbjct: 121 SASKKKKKPKNSIKTEDLHHEMGARKRRSSLSTLIDTKSSSHSSSKISGFRTPPAPNCCT 180
Query: 181 ISAPNKNYKQIRNLLDQTRPA----SKNLGNVKQNSDHPTRKIKDVDQDARSDSSSDLFE 221
+ PNKNY +IR+ LDQ R SK G Q S+ RK+ D D+ SDSSSDLFE
Sbjct: 181 VQTPNKNYMEIRSFLDQKREGINYYSKKNGINNQKSE--MRKMNDQDEGDESDSSSDLFE 240
BLAST of CmoCh18G011190 vs. ExPASy TrEMBL
Match:
A0A5D3CB02 (Protein BIG GRAIN 1-like E OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold475G00190 PE=3 SV=1)
HSP 1 Score: 233.4 bits (594), Expect = 9.1e-58
Identity = 143/245 (58.37%), Postives = 163/245 (66.53%), Query Frame = 0
Query: 1 MSAATHLSGNPDKIFHKPPHRRNASTELDVFEAARYFSGSNELIYTNTSSFTHNIIREDH 60
MS +HLSG+P+KIF K HRR S ELDVFEAA YFSGSNE NT++ + H
Sbjct: 1 MSVTSHLSGHPNKIFRKSLHRRKDSDELDVFEAASYFSGSNEPCNYNTATTYETSL--FH 60
Query: 61 SSRRGGRMSLDLPLRTNA--APPPPYTAEKPPIKE-KKHRQQPSSPAGRLANFLNSLFNQ 120
RRGGRMSLDLPLRTN PPPPYTAEK IK+ KKHRQQPSSP GRLANFLNSLFNQ
Sbjct: 61 GGRRGGRMSLDLPLRTNVIPLPPPPYTAEKQSIKDPKKHRQQPSSPGGRLANFLNSLFNQ 120
Query: 121 SASQKKKKPKNGMKNDEFHHEIGARKRRSSLSSLIDSKSSSN------------------ 180
S S+KKKKPKN +K ++ HHEIGARKRRSSLS+LID+KSSSN
Sbjct: 121 SGSKKKKKPKNSIKTEDLHHEIGARKRRSSLSTLIDTKSSSNNSSKVSGFRTPPAPNCCS 180
Query: 181 ISAPNKNYKQIRNLLDQTRP----ASKNLGNVKQNSDHPTRKIKDVDQDARSDSSSDLFE 221
+ P+K Y +IR+ LDQ R + KN N RK+ D D+ SDSSSDLFE
Sbjct: 181 VQTPSKKYMEIRSFLDQKREGNYYSKKNNRGNGINEKSEMRKMNDEDEGDESDSSSDLFE 240
BLAST of CmoCh18G011190 vs. TAIR 10
Match:
AT1G69160.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G13980.1); Has 173 Blast hits to 172 proteins in 54 species: Archae - 0; Bacteria - 0; Metazoa - 25; Fungi - 33; Plants - 84; Viruses - 2; Other Eukaryotes - 29 (source: NCBI BLink). )
HSP 1 Score: 77.4 bits (189), Expect = 1.6e-14
Identity = 92/281 (32.74%), Postives = 123/281 (43.77%), Query Frame = 0
Query: 20 HRRNASTELDVFEAARYFSGSNELI---YTNTSSFTHNIIREDHSSR-----RGGRMSLD 79
H+RN S ELDVFEAA YF G NE + +T + +N RE++ R G R+SLD
Sbjct: 22 HKRN-SEELDVFEAAVYF-GYNEASSGDHGHTQKYGYNAAREENPRRWGILGGGRRISLD 81
Query: 80 LPLRTNAA---------PPPPYTAEKPPIKEKKHRQQPSSPAGRLANFLNSLFNQSASQK 139
LP+R + T K + +H+ QPSSP G++A+FLNSLF+Q+ S+K
Sbjct: 82 LPIRCSEQVYHLQQDHHEKHEVTTIKERLGNVRHK-QPSSPGGKIASFLNSLFHQAGSKK 141
Query: 140 ---KKKPKNGMKNDEFHHEIGA-----RKRRSSLS----------------------SLI 199
K K K + E EI R+RRSS+S SLI
Sbjct: 142 NKSKSKSKTKPTDPEVEEEIPGGGWMRRRRRSSISHFFSSSRSTSTTTTTTASSSSKSLI 201
Query: 200 DSKSSS------NISAPNKNYKQIRNLLDQTRPASKNLGNVKQNSDHPTRKIK------- 221
S SS ++ P KNYKQ N T+ + + K+K
Sbjct: 202 SSSSSGFRTPPPYLNTPTKNYKQFLNYTSATKQVGEEETKTNKEYSWLDEKLKVMESLSE 261
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q93Z37 | 2.3e-13 | 32.74 | Protein BIG GRAIN 1-like E OS=Arabidopsis thaliana OX=3702 GN=At1g69160 PE=2 SV=... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1G172 | 1.0e-117 | 100.00 | protein BIG GRAIN 1-like E OS=Cucurbita moschata OX=3662 GN=LOC111449738 PE=3 SV... | [more] |
A0A6J1I085 | 2.9e-112 | 95.95 | protein BIG GRAIN 1-like E OS=Cucurbita maxima OX=3661 GN=LOC111467800 PE=3 SV=1 | [more] |
A0A6J1D6V3 | 3.6e-62 | 61.18 | protein BIG GRAIN 1-like E OS=Momordica charantia OX=3673 GN=LOC111017581 PE=3 S... | [more] |
A0A0A0KSX8 | 4.3e-60 | 59.18 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G606790 PE=3 SV=1 | [more] |
A0A5D3CB02 | 9.1e-58 | 58.37 | Protein BIG GRAIN 1-like E OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaff... | [more] |
Match Name | E-value | Identity | Description | |
AT1G69160.1 | 1.6e-14 | 32.74 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |