Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTTCTGGGCTTGGATCTTCAGGCGTTTTCATGTTGCCGCTCAATCGTTTAGCGCTAATCGCTTGGCTTGTAAAGCATTTTCCAGCTGTTGCGATCCATATCAGAGGCGTGCGCGGCAAGATCATCCGGTTAGCCTCTCATTGGTTGCTCTGATATTCGTCCGCGATTGGTTGATTCTTCGCGAGAGTCACTCCTGGAAATTATGCACTACTCGCATTTTTGTTTGGTAGGTTCAATTGCAGGATTTTTCTTCATTTCGCATCTGTCTTGATGGGTATTTTGGTTGTTCTGGATTTTGCGTTTCCCCCTTTAACTGGACTTGAGTGGTCCTTTACTTGATGCGTAGCTCGAGATGGAACCTCCTTTTTTTATAGTTTGAATGATTTTCTTTGCTTCTTCGCTGTGTTCTCTAGTGACAGCTCCGATCGGCGAAAATCTGTTTTGTGGTACTCAAGATTTTGGATTTGAGGGGTATCTATTTTGACATGGACTTGATTTCTTTGAGTTTGTTTGAATTGACTGAGACTATAATCAATCTGTTGAAACTTTATAATCCATGCATTTGGGGGTAAGTGGGGGTGCTTGACAATAGATTATTGGCTGTAGACGTGTGTTTTTGATCGAAGTTGACCTTTTGCCACTCTCTCTTTATTCCAGTCTATTTTTCATGATTACTAGATTGGCTCAAAGAGAACGAAGAATAATAATTTATGTTATGCAGCTCTTTGGTTGTAATTACATGTTTAAAGTATTGCCCTCAATTGATTGTATACTTGGAACAGCGAGATTCTTCAAATCTTCCATATTTGCCTTAGATAATAGTGATTCTTTTGGGAGATTGCCAGATTGGAGTATGATACTTGCAAACCGAAGTAACGAGAAACTAAACTACAAAAATGAGTGGAGGGGTCTCAAGGTGAAGGTGAAGGTAGAGGTGGGATTGATTTTGTGTTAATATATTCATACACGATGTTTTATTATCTTTAGTTTATGAAGTACTCGAGTCATAAAATAGAGGTATAATGTTCTAAGGATTTGAATTCTTTTGTGCTGTTGTATGTTTCTAGGTTCCACCCTATGCTTTTTAATTGGGTTCTTTGTGTAGGTCTACTCGTACTTGGATGCTTCATCCATTAAGGGAGTTCATTGGAGAGCTGTTCAGACGGCGATCTTCATTAATGGTAGGGGATCTTCTTTAATGGCAGAGACAATTTTTCTTGGAATTTCGATTTACATTTTAATTGAAAACTAAGTTTTTATCGTGTTTGGATTATTGATTAATGGTCTAATTTTTTTTTTTTTTGAGTTCAACAATGTGTCGGGTGAGCTATTTGATAGAGGGTACATCTTTTATGCCAATTGTGCTATACTTATGTTGGTTGATTAGTAGCATCTTTCAGGAGTCTACGTTATTTGATAGAGGGTACATCTTTTATGTTGGGTGAGCTATTTGTCGGGTGAGCTATTTGAGTTCAACAATTGTGCTATACAATGTGTCGGGTGAGCTATTTCTCTTGCAGGAGTCTACGTTAAAAGACTGCTAAACGACGACTAAATAAAGCTTTAAATTTTGAGCGTGATTAAGCAGCTGGCTAGGAGCTGTTTTTTCTATGACAGTTTCCTATTTCCAAGAACAAAAACTAATGAACACGTACTTGCTTCTAGTAGTTTTGGATTTGTTATCTTGTTCTCTTAATCTGTTCTGACATGATGTTTTCATTTTAGAAGAATTGAGATTGGCAAGTGCAAGTAGTTCTTTTCTTGATATCCAAGAGCGTCTGGGATGCTTGTGCGCACCTCAACTATTCTCTGGGACAAACTCCTTACCCTATTATATTTGGTTATCAAGGTAACTTGCTGACTTGAACTCATTCTCTCTAAAAGCTCATTATTATTTACATTGTCTTCTTGACTAGTTGGCCAACCCATCATAGTTCATAGCGATAGGTTAAAAACCAAAAGTTCAAAATGATTTGAAAACAATCTCTCTTAGATAGGTACAAAATTTTCTCCTCCAAAACCTGATATTGTTGCTGCTCATTTGAGTTTCCATTTTCGTTACATGCACGACGATTCCATCTCACTGTCACACCTGACTGGACTTACTTGATATGTTGCATATGTAGGCATTTCTGTGAAGGGGCAAGTGTCAGTTTCAATTTAAGGGCGAGGTAAGTGTATGATTGACCGATTAGTAGTTGTTGGACGTTGGAAACCTTGGAAAAGGAATGTCCGGGTTATTCTTTACCCCATTGACTAATGCAACATATCAAGAAGCGTTTCCTAGAGTTGGTTATTGTTTGATTTACCTCACATAACTTTTCCAAGGCAACTCCATTTCTAAAGCTGTTGAAAATAAGTTGGGCTCATTTCAATACAGATGGAAAACAAAAGGACAACTCTCATTCTTTGTCTAACCCCAATTCATTTACCAACACTTGAGAGTTATTTTTCTGATCCATTCTACTTTCAAGTTTGCATCGGATTGCAGTTGTGATGAATCGATGATAAAACTTACACAGAACACAAATATTGATGCCGCAGCATGAGAGAAGAGTGAAGAATTCAAATTTTGATGAAGCCGGAAATGAAGCCATAGTTCAATTAGAAGCTACAGAGGGCTAAATTCCAACACTTGGG
mRNA sequence
ATGGTTTCTGGGCTTGGATCTTCAGGCGTTTTCATGTTGCCGCTCAATCGTTTAGCGCTAATCGCTTGGCTTGTAAAGCATTTTCCAGCTGTTGCGATCCATATCAGAGGCGTGCGCGGCAAGATCATCCGGTTCAATTGCAGGATTTTTCTTCATTTCGCATCTGTCTTGATGGCGAGATTCTTCAAATCTTCCATATTTGCCTTAGATAATAGTGATTCTTTTGGGAGATTGCCAGATTGGAGTATGATACTTGCAAACCGAAGTAACGAGAAACTAAACTACAAAAATGAGTGGAGGGGTCTCAAGGTGAAGGTGAAGGTAGAGGTCTACTCGTACTTGGATGCTTCATCCATTAAGGGAGTTCATTGGAGAGCTGTTCAGACGGCGATCTTCATTAATGAATTGAGATTGGCAAGTGCAAGTAGTTCTTTTCTTGATATCCAAGAGCGTCTGGGATGCTTGTGCGCACCTCAACTATTCTCTGGGACAAACTCCTTACCCTATTATATTTGGTTATCAAGGCATTTCTGTGAAGGGGCAAGTGTCAGTTTCAATTTAAGGGCGAGGTAAGTGTATGATTGACCGATTAGTAGTTGTTGGACGTTGGAAACCTTGGAAAAGGAATGTCCGGGTTATTCTTTACCCCATTGACTAATGCAACATATCAAGAAGCGTTTCCTAGAGTTGGTTATTGTTTGATTTACCTCACATAACTTTTCCAAGGCAACTCCATTTCTAAAGCTGTTGAAAATAAGTTGGGCTCATTTCAATACAGATGGAAAACAAAAGGACAACTCTCATTCTTTGTCTAACCCCAATTCATTTACCAACACTTGAGAGTTATTTTTCTGATCCATTCTACTTTCAAGTTTGCATCGGATTGCAGTTGTGATGAATCGATGATAAAACTTACACAGAACACAAATATTGATGCCGCAGCATGAGAGAAGAGTGAAGAATTCAAATTTTGATGAAGCCGGAAATGAAGCCATAGTTCAATTAGAAGCTACAGAGGGCTAAATTCCAACACTTGGG
Coding sequence (CDS)
ATGGTTTCTGGGCTTGGATCTTCAGGCGTTTTCATGTTGCCGCTCAATCGTTTAGCGCTAATCGCTTGGCTTGTAAAGCATTTTCCAGCTGTTGCGATCCATATCAGAGGCGTGCGCGGCAAGATCATCCGGTTCAATTGCAGGATTTTTCTTCATTTCGCATCTGTCTTGATGGCGAGATTCTTCAAATCTTCCATATTTGCCTTAGATAATAGTGATTCTTTTGGGAGATTGCCAGATTGGAGTATGATACTTGCAAACCGAAGTAACGAGAAACTAAACTACAAAAATGAGTGGAGGGGTCTCAAGGTGAAGGTGAAGGTAGAGGTCTACTCGTACTTGGATGCTTCATCCATTAAGGGAGTTCATTGGAGAGCTGTTCAGACGGCGATCTTCATTAATGAATTGAGATTGGCAAGTGCAAGTAGTTCTTTTCTTGATATCCAAGAGCGTCTGGGATGCTTGTGCGCACCTCAACTATTCTCTGGGACAAACTCCTTACCCTATTATATTTGGTTATCAAGGCATTTCTGTGAAGGGGCAAGTGTCAGTTTCAATTTAAGGGCGAGGTAA
Protein sequence
MVSGLGSSGVFMLPLNRLALIAWLVKHFPAVAIHIRGVRGKIIRFNCRIFLHFASVLMARFFKSSIFALDNSDSFGRLPDWSMILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASSIKGVHWRAVQTAIFINELRLASASSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLSRHFCEGASVSFNLRAR
Homology
BLAST of CmaCh06G000920 vs. ExPASy TrEMBL
Match:
A0A6J1I578 (uncharacterized protein LOC111470024 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111470024 PE=4 SV=1)
HSP 1 Score: 266.2 bits (679), Expect = 1.1e-67
Identity = 132/133 (99.25%), Postives = 132/133 (99.25%), Query Frame = 0
Query: 59 ARFFKSSIFALDNSDSFGRLPDWSMILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASS 118
ARFFKSSIFALDNSDSFGRLPDWSMILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASS
Sbjct: 39 ARFFKSSIFALDNSDSFGRLPDWSMILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASS 98
Query: 119 IKGVHWRAVQTAIFIN-ELRLASASSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLSRHF 178
IKGVHWRAVQTAIFIN ELRLASASSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLSRHF
Sbjct: 99 IKGVHWRAVQTAIFINEELRLASASSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLSRHF 158
Query: 179 CEGASVSFNLRAR 191
CEGASVSFNLRAR
Sbjct: 159 CEGASVSFNLRAR 171
BLAST of CmaCh06G000920 vs. ExPASy TrEMBL
Match:
A0A6J1I6A1 (uncharacterized protein LOC111470024 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111470024 PE=4 SV=1)
HSP 1 Score: 237.3 bits (604), Expect = 5.4e-59
Identity = 116/116 (100.00%), Postives = 116/116 (100.00%), Query Frame = 0
Query: 59 ARFFKSSIFALDNSDSFGRLPDWSMILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASS 118
ARFFKSSIFALDNSDSFGRLPDWSMILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASS
Sbjct: 39 ARFFKSSIFALDNSDSFGRLPDWSMILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASS 98
Query: 119 IKGVHWRAVQTAIFINELRLASASSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLS 175
IKGVHWRAVQTAIFINELRLASASSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLS
Sbjct: 99 IKGVHWRAVQTAIFINELRLASASSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLS 154
BLAST of CmaCh06G000920 vs. ExPASy TrEMBL
Match:
A0A6J1I2S8 (uncharacterized protein LOC111470024 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111470024 PE=4 SV=1)
HSP 1 Score: 232.6 bits (592), Expect = 1.3e-57
Identity = 116/117 (99.15%), Postives = 116/117 (99.15%), Query Frame = 0
Query: 59 ARFFKSSIFALDNSDSFGRLPDWSMILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASS 118
ARFFKSSIFALDNSDSFGRLPDWSMILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASS
Sbjct: 39 ARFFKSSIFALDNSDSFGRLPDWSMILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASS 98
Query: 119 IKGVHWRAVQTAIFIN-ELRLASASSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLS 175
IKGVHWRAVQTAIFIN ELRLASASSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLS
Sbjct: 99 IKGVHWRAVQTAIFINEELRLASASSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLS 155
BLAST of CmaCh06G000920 vs. ExPASy TrEMBL
Match:
A0A6J1I2T3 (uncharacterized protein LOC111470024 isoform X4 OS=Cucurbita maxima OX=3661 GN=LOC111470024 PE=4 SV=1)
HSP 1 Score: 184.1 bits (466), Expect = 5.4e-43
Identity = 92/93 (98.92%), Postives = 92/93 (98.92%), Query Frame = 0
Query: 83 MILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASSIKGVHWRAVQTAIFIN-ELRLASA 142
MILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASSIKGVHWRAVQTAIFIN ELRLASA
Sbjct: 1 MILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASSIKGVHWRAVQTAIFINEELRLASA 60
Query: 143 SSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLS 175
SSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLS
Sbjct: 61 SSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLS 93
BLAST of CmaCh06G000920 vs. ExPASy TrEMBL
Match:
A0A6J1GUC1 (uncharacterized protein LOC111457164 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111457164 PE=4 SV=1)
HSP 1 Score: 140.6 bits (353), Expect = 6.8e-30
Identity = 71/76 (93.42%), Postives = 72/76 (94.74%), Query Frame = 0
Query: 59 ARFFKSSIFALDNSDSFGRLPDWSMILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASS 118
ARFFKSSIFALDNS SFGRLPDWS+ILA RSNEKLNYKNEWRGL KVKVEVYSYLDASS
Sbjct: 39 ARFFKSSIFALDNSGSFGRLPDWSVILAYRSNEKLNYKNEWRGL--KVKVEVYSYLDASS 98
Query: 119 IKGVHWRAVQTAIFIN 135
IKGVHWRAVQTAIFIN
Sbjct: 99 IKGVHWRAVQTAIFIN 112
BLAST of CmaCh06G000920 vs. NCBI nr
Match:
XP_022971245.1 (uncharacterized protein LOC111470024 isoform X3 [Cucurbita maxima])
HSP 1 Score: 266.2 bits (679), Expect = 2.2e-67
Identity = 132/133 (99.25%), Postives = 132/133 (99.25%), Query Frame = 0
Query: 59 ARFFKSSIFALDNSDSFGRLPDWSMILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASS 118
ARFFKSSIFALDNSDSFGRLPDWSMILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASS
Sbjct: 39 ARFFKSSIFALDNSDSFGRLPDWSMILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASS 98
Query: 119 IKGVHWRAVQTAIFIN-ELRLASASSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLSRHF 178
IKGVHWRAVQTAIFIN ELRLASASSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLSRHF
Sbjct: 99 IKGVHWRAVQTAIFINEELRLASASSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLSRHF 158
Query: 179 CEGASVSFNLRAR 191
CEGASVSFNLRAR
Sbjct: 159 CEGASVSFNLRAR 171
BLAST of CmaCh06G000920 vs. NCBI nr
Match:
KAG6596233.1 (hypothetical protein SDJN03_09413, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 241.5 bits (615), Expect = 5.9e-60
Identity = 125/139 (89.93%), Postives = 128/139 (92.09%), Query Frame = 0
Query: 1 MVSGLGSSGVFMLPLNRLALIAWLVKHFPAVAIHIRGVRGKIIRFNCRIFLHFASVLMAR 60
MVSGL SSGVF+LPLNRLALIAWLVKHFPAVAIHIRGVR KIIRFNCRIFLHFASVLMAR
Sbjct: 70 MVSGLRSSGVFVLPLNRLALIAWLVKHFPAVAIHIRGVRVKIIRFNCRIFLHFASVLMAR 129
Query: 61 FFKSSIFALDNSDSFGRLPDWSMILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASSIK 120
FFKSSIFALDNS SFGRLPDWS+ILA RSNEKLNYKNEWRGLK VYSYLDASSIK
Sbjct: 130 FFKSSIFALDNSGSFGRLPDWSVILAYRSNEKLNYKNEWRGLK------VYSYLDASSIK 189
Query: 121 GVHWRAVQTAIFINELRLA 140
GVHWRAVQTAIF NELRL+
Sbjct: 190 GVHWRAVQTAIFTNELRLS 202
BLAST of CmaCh06G000920 vs. NCBI nr
Match:
XP_022971243.1 (uncharacterized protein LOC111470024 isoform X2 [Cucurbita maxima])
HSP 1 Score: 237.3 bits (604), Expect = 1.1e-58
Identity = 116/116 (100.00%), Postives = 116/116 (100.00%), Query Frame = 0
Query: 59 ARFFKSSIFALDNSDSFGRLPDWSMILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASS 118
ARFFKSSIFALDNSDSFGRLPDWSMILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASS
Sbjct: 39 ARFFKSSIFALDNSDSFGRLPDWSMILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASS 98
Query: 119 IKGVHWRAVQTAIFINELRLASASSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLS 175
IKGVHWRAVQTAIFINELRLASASSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLS
Sbjct: 99 IKGVHWRAVQTAIFINELRLASASSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLS 154
BLAST of CmaCh06G000920 vs. NCBI nr
Match:
XP_022971241.1 (uncharacterized protein LOC111470024 isoform X1 [Cucurbita maxima] >XP_022971242.1 uncharacterized protein LOC111470024 isoform X1 [Cucurbita maxima])
HSP 1 Score: 232.6 bits (592), Expect = 2.7e-57
Identity = 116/117 (99.15%), Postives = 116/117 (99.15%), Query Frame = 0
Query: 59 ARFFKSSIFALDNSDSFGRLPDWSMILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASS 118
ARFFKSSIFALDNSDSFGRLPDWSMILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASS
Sbjct: 39 ARFFKSSIFALDNSDSFGRLPDWSMILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASS 98
Query: 119 IKGVHWRAVQTAIFIN-ELRLASASSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLS 175
IKGVHWRAVQTAIFIN ELRLASASSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLS
Sbjct: 99 IKGVHWRAVQTAIFINEELRLASASSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLS 155
BLAST of CmaCh06G000920 vs. NCBI nr
Match:
XP_022971246.1 (uncharacterized protein LOC111470024 isoform X4 [Cucurbita maxima])
HSP 1 Score: 184.1 bits (466), Expect = 1.1e-42
Identity = 92/93 (98.92%), Postives = 92/93 (98.92%), Query Frame = 0
Query: 83 MILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASSIKGVHWRAVQTAIFIN-ELRLASA 142
MILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASSIKGVHWRAVQTAIFIN ELRLASA
Sbjct: 1 MILANRSNEKLNYKNEWRGLKVKVKVEVYSYLDASSIKGVHWRAVQTAIFINEELRLASA 60
Query: 143 SSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLS 175
SSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLS
Sbjct: 61 SSSFLDIQERLGCLCAPQLFSGTNSLPYYIWLS 93
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1I578 | 1.1e-67 | 99.25 | uncharacterized protein LOC111470024 isoform X3 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1I6A1 | 5.4e-59 | 100.00 | uncharacterized protein LOC111470024 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1I2S8 | 1.3e-57 | 99.15 | uncharacterized protein LOC111470024 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1I2T3 | 5.4e-43 | 98.92 | uncharacterized protein LOC111470024 isoform X4 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1GUC1 | 6.8e-30 | 93.42 | uncharacterized protein LOC111457164 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
XP_022971245.1 | 2.2e-67 | 99.25 | uncharacterized protein LOC111470024 isoform X3 [Cucurbita maxima] | [more] |
KAG6596233.1 | 5.9e-60 | 89.93 | hypothetical protein SDJN03_09413, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022971243.1 | 1.1e-58 | 100.00 | uncharacterized protein LOC111470024 isoform X2 [Cucurbita maxima] | [more] |
XP_022971241.1 | 2.7e-57 | 99.15 | uncharacterized protein LOC111470024 isoform X1 [Cucurbita maxima] >XP_022971242... | [more] |
XP_022971246.1 | 1.1e-42 | 98.92 | uncharacterized protein LOC111470024 isoform X4 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |