Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATTACTATCAAATCAATGGCAGCGATGGCCGTGTATTTGCTGTTTATCGTCGCAATCGCGGCGGCGCTGGAGATTCGGCCGTCGGAGCATGGTCTGGAGTTTCAGAGCCCTCCGGCAGCGGGAGACAAATCGTCGCCGGAGATGCGGTCGTTCTTCGGAGGAACTTCGTCGCCGACGCCGGAAGTGGCATTGCCGTTGCCGAAGGCGATGAATTCCAGCGAGGCGCCGGGATGGTGGACCCACCGTGACGGTGGAGATAAACGACTTAGAAATGCACTATTGGTGGCGACGGCGGCTTGTGGAATAACAGGTGTCACTTTATTAGTCGGTTCTACGTTATACTACATTTTTAAGGTCAAAAATCAAAGATCATTGCCGCTCTCTTCTAACAACAGTAATCACAAATAA
mRNA sequence
ATGATTACTATCAAATCAATGGCAGCGATGGCCGTGTATTTGCTGTTTATCGTCGCAATCGCGGCGGCGCTGGAGATTCGGCCGTCGGAGCATGGTCTGGAGTTTCAGAGCCCTCCGGCAGCGGGAGACAAATCGTCGCCGGAGATGCGGTCGTTCTTCGGAGGAACTTCGTCGCCGACGCCGGAAGTGGCATTGCCGTTGCCGAAGGCGATGAATTCCAGCGAGGCGCCGGGATGGTGGACCCACCGTGACGGTGGAGATAAACGACTTAGAAATGCACTATTGGTGGCGACGGCGGCTTGTGGAATAACAGGTGTCACTTTATTAGTCGGTTCTACGTTATACTACATTTTTAAGGTCAAAAATCAAAGATCATTGCCGCTCTCTTCTAACAACAGTAATCACAAATAA
Coding sequence (CDS)
ATGATTACTATCAAATCAATGGCAGCGATGGCCGTGTATTTGCTGTTTATCGTCGCAATCGCGGCGGCGCTGGAGATTCGGCCGTCGGAGCATGGTCTGGAGTTTCAGAGCCCTCCGGCAGCGGGAGACAAATCGTCGCCGGAGATGCGGTCGTTCTTCGGAGGAACTTCGTCGCCGACGCCGGAAGTGGCATTGCCGTTGCCGAAGGCGATGAATTCCAGCGAGGCGCCGGGATGGTGGACCCACCGTGACGGTGGAGATAAACGACTTAGAAATGCACTATTGGTGGCGACGGCGGCTTGTGGAATAACAGGTGTCACTTTATTAGTCGGTTCTACGTTATACTACATTTTTAAGGTCAAAAATCAAAGATCATTGCCGCTCTCTTCTAACAACAGTAATCACAAATAA
Protein sequence
MITIKSMAAMAVYLLFIVAIAAALEIRPSEHGLEFQSPPAAGDKSSPEMRSFFGGTSSPTPEVALPLPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGITGVTLLVGSTLYYIFKVKNQRSLPLSSNNSNHK
Homology
BLAST of Cp4.1LG19g05600 vs. NCBI nr
Match:
XP_023518243.1 (uncharacterized protein LOC111781779 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 260 bits (665), Expect = 2.31e-87
Identity = 136/136 (100.00%), Postives = 136/136 (100.00%), Query Frame = 0
Query: 1 MITIKSMAAMAVYLLFIVAIAAALEIRPSEHGLEFQSPPAAGDKSSPEMRSFFGGTSSPT 60
MITIKSMAAMAVYLLFIVAIAAALEIRPSEHGLEFQSPPAAGDKSSPEMRSFFGGTSSPT
Sbjct: 1 MITIKSMAAMAVYLLFIVAIAAALEIRPSEHGLEFQSPPAAGDKSSPEMRSFFGGTSSPT 60
Query: 61 PEVALPLPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGITGVTLLVGSTLYYIFKV 120
PEVALPLPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGITGVTLLVGSTLYYIFKV
Sbjct: 61 PEVALPLPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGITGVTLLVGSTLYYIFKV 120
Query: 121 KNQRSLPLSSNNSNHK 136
KNQRSLPLSSNNSNHK
Sbjct: 121 KNQRSLPLSSNNSNHK 136
BLAST of Cp4.1LG19g05600 vs. NCBI nr
Match:
XP_022926736.1 (uncharacterized protein LOC111433768 [Cucurbita moschata])
HSP 1 Score: 256 bits (655), Expect = 7.75e-86
Identity = 135/136 (99.26%), Postives = 135/136 (99.26%), Query Frame = 0
Query: 1 MITIKSMAAMAVYLLFIVAIAAALEIRPSEHGLEFQSPPAAGDKSSPEMRSFFGGTSSPT 60
MITIKSMAAMAVYLLFIVAIAAALEIRPSEHGLEFQS PAAGDKSSPEMRSFFGGTSSPT
Sbjct: 1 MITIKSMAAMAVYLLFIVAIAAALEIRPSEHGLEFQSLPAAGDKSSPEMRSFFGGTSSPT 60
Query: 61 PEVALPLPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGITGVTLLVGSTLYYIFKV 120
PEVALPLPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGITGVTLLVGSTLYYIFKV
Sbjct: 61 PEVALPLPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGITGVTLLVGSTLYYIFKV 120
Query: 121 KNQRSLPLSSNNSNHK 136
KNQRSLPLSSNNSNHK
Sbjct: 121 KNQRSLPLSSNNSNHK 136
BLAST of Cp4.1LG19g05600 vs. NCBI nr
Match:
XP_023003567.1 (uncharacterized protein LOC111497130 [Cucurbita maxima])
HSP 1 Score: 256 bits (654), Expect = 1.10e-85
Identity = 133/136 (97.79%), Postives = 135/136 (99.26%), Query Frame = 0
Query: 1 MITIKSMAAMAVYLLFIVAIAAALEIRPSEHGLEFQSPPAAGDKSSPEMRSFFGGTSSPT 60
MITIKSMAAMAVYLLFIVAI AALEIRPSEHGLEFQSPPAAGDKSSPEMRSFFGGTSSPT
Sbjct: 1 MITIKSMAAMAVYLLFIVAIEAALEIRPSEHGLEFQSPPAAGDKSSPEMRSFFGGTSSPT 60
Query: 61 PEVALPLPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGITGVTLLVGSTLYYIFKV 120
PEVALPLPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACG+TGVTLLVGSTLYYIFKV
Sbjct: 61 PEVALPLPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGMTGVTLLVGSTLYYIFKV 120
Query: 121 KNQRSLPLSSNNSNHK 136
KNQRSLPLSSNN+NHK
Sbjct: 121 KNQRSLPLSSNNNNHK 136
BLAST of Cp4.1LG19g05600 vs. NCBI nr
Match:
KAG6594423.1 (hypothetical protein SDJN03_10976, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 248 bits (632), Expect = 2.04e-82
Identity = 129/130 (99.23%), Postives = 129/130 (99.23%), Query Frame = 0
Query: 7 MAAMAVYLLFIVAIAAALEIRPSEHGLEFQSPPAAGDKSSPEMRSFFGGTSSPTPEVALP 66
MAAMAVYLLFIVAIAAALEIRPSEHGLEFQS PAAGDKSSPEMRSFFGGTSSPTPEVALP
Sbjct: 1 MAAMAVYLLFIVAIAAALEIRPSEHGLEFQSLPAAGDKSSPEMRSFFGGTSSPTPEVALP 60
Query: 67 LPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGITGVTLLVGSTLYYIFKVKNQRSL 126
LPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGITGVTLLVGSTLYYIFKVKNQRSL
Sbjct: 61 LPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGITGVTLLVGSTLYYIFKVKNQRSL 120
Query: 127 PLSSNNSNHK 136
PLSSNNSNHK
Sbjct: 121 PLSSNNSNHK 130
BLAST of Cp4.1LG19g05600 vs. NCBI nr
Match:
KAG7026421.1 (hypothetical protein SDJN02_10421, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 216 bits (550), Expect = 4.68e-70
Identity = 112/114 (98.25%), Postives = 113/114 (99.12%), Query Frame = 0
Query: 7 MAAMAVYLLFIVAIAAALEIRPSEHGLEFQSPPAAGDKSSPEMRSFFGGTSSPTPEVALP 66
MAAMAVYLLFIVAIAAALEIRPSEHGLEFQS PAAGDKSSPEMRSFFGGTSSPTPEVALP
Sbjct: 1 MAAMAVYLLFIVAIAAALEIRPSEHGLEFQSLPAAGDKSSPEMRSFFGGTSSPTPEVALP 60
Query: 67 LPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGITGVTLLVGSTLYYIFKV 120
LPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGITGVTLLVGSTLYYIFK+
Sbjct: 61 LPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGITGVTLLVGSTLYYIFKM 114
BLAST of Cp4.1LG19g05600 vs. ExPASy TrEMBL
Match:
A0A6J1EFQ9 (uncharacterized protein LOC111433768 OS=Cucurbita moschata OX=3662 GN=LOC111433768 PE=4 SV=1)
HSP 1 Score: 256 bits (655), Expect = 3.75e-86
Identity = 135/136 (99.26%), Postives = 135/136 (99.26%), Query Frame = 0
Query: 1 MITIKSMAAMAVYLLFIVAIAAALEIRPSEHGLEFQSPPAAGDKSSPEMRSFFGGTSSPT 60
MITIKSMAAMAVYLLFIVAIAAALEIRPSEHGLEFQS PAAGDKSSPEMRSFFGGTSSPT
Sbjct: 1 MITIKSMAAMAVYLLFIVAIAAALEIRPSEHGLEFQSLPAAGDKSSPEMRSFFGGTSSPT 60
Query: 61 PEVALPLPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGITGVTLLVGSTLYYIFKV 120
PEVALPLPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGITGVTLLVGSTLYYIFKV
Sbjct: 61 PEVALPLPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGITGVTLLVGSTLYYIFKV 120
Query: 121 KNQRSLPLSSNNSNHK 136
KNQRSLPLSSNNSNHK
Sbjct: 121 KNQRSLPLSSNNSNHK 136
BLAST of Cp4.1LG19g05600 vs. ExPASy TrEMBL
Match:
A0A6J1KMY3 (uncharacterized protein LOC111497130 OS=Cucurbita maxima OX=3661 GN=LOC111497130 PE=4 SV=1)
HSP 1 Score: 256 bits (654), Expect = 5.33e-86
Identity = 133/136 (97.79%), Postives = 135/136 (99.26%), Query Frame = 0
Query: 1 MITIKSMAAMAVYLLFIVAIAAALEIRPSEHGLEFQSPPAAGDKSSPEMRSFFGGTSSPT 60
MITIKSMAAMAVYLLFIVAI AALEIRPSEHGLEFQSPPAAGDKSSPEMRSFFGGTSSPT
Sbjct: 1 MITIKSMAAMAVYLLFIVAIEAALEIRPSEHGLEFQSPPAAGDKSSPEMRSFFGGTSSPT 60
Query: 61 PEVALPLPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGITGVTLLVGSTLYYIFKV 120
PEVALPLPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACG+TGVTLLVGSTLYYIFKV
Sbjct: 61 PEVALPLPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGMTGVTLLVGSTLYYIFKV 120
Query: 121 KNQRSLPLSSNNSNHK 136
KNQRSLPLSSNN+NHK
Sbjct: 121 KNQRSLPLSSNNNNHK 136
BLAST of Cp4.1LG19g05600 vs. ExPASy TrEMBL
Match:
A0A6J1EFW4 (uncharacterized protein LOC111433017 OS=Cucurbita moschata OX=3662 GN=LOC111433017 PE=4 SV=1)
HSP 1 Score: 214 bits (544), Expect = 1.98e-68
Identity = 112/135 (82.96%), Postives = 122/135 (90.37%), Query Frame = 0
Query: 2 ITIKSMAAMAVYLLFIVAIAAALEIRPSEHGLEFQSPPAAGDKSSPEMRSFFGGTSSPTP 61
I IK +AAM V+L IVA+A+ALEIRPSEHGLEFQSPP AG+KSSPEMRSFF GTSSP P
Sbjct: 59 IPIKPVAAMVVFLPLIVAVASALEIRPSEHGLEFQSPPPAGEKSSPEMRSFFVGTSSPIP 118
Query: 62 EVALPLPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGITGVTLLVGSTLYYIFKVK 121
+ LPLPKAMNSSEAPGWWTHRDGG+KR+RNALLVATAACGITGVTLLVGSTL+YI+KVK
Sbjct: 119 DTTLPLPKAMNSSEAPGWWTHRDGGNKRVRNALLVATAACGITGVTLLVGSTLFYIYKVK 178
Query: 122 NQRSLPLSSNNSNHK 136
NQ LPLSSNN NHK
Sbjct: 179 NQTPLPLSSNN-NHK 192
BLAST of Cp4.1LG19g05600 vs. ExPASy TrEMBL
Match:
A0A6J1CZN3 (uncharacterized protein LOC111015729 OS=Momordica charantia OX=3673 GN=LOC111015729 PE=4 SV=1)
HSP 1 Score: 206 bits (525), Expect = 2.73e-66
Identity = 115/140 (82.14%), Postives = 120/140 (85.71%), Query Frame = 0
Query: 1 MITIKSMAAMAVYLLFIVAIAA----ALEIRPSEHGLEFQSPPAAGDKSSPEMRSFFGGT 60
MI IK AAMAV L IVA+ A ALEIRPSEHGLEFQSPP AGDKSSPEM SFFGG
Sbjct: 1 MIPIKIAAAMAVCLPLIVAVLAVKTTALEIRPSEHGLEFQSPPPAGDKSSPEMLSFFGGR 60
Query: 61 SSPTPEVALPLPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGITGVTLLVGSTLYY 120
SSPTP+ ALPLPKAMNSSEAPGWWT RDGGD RLRNALLVATAA GITGVTLLVGS L+Y
Sbjct: 61 SSPTPDAALPLPKAMNSSEAPGWWTRRDGGDTRLRNALLVATAAFGITGVTLLVGSVLFY 120
Query: 121 IFKVKNQRSLPLSSNNSNHK 136
++KVKNQRSLPLSSNN NHK
Sbjct: 121 VYKVKNQRSLPLSSNN-NHK 139
BLAST of Cp4.1LG19g05600 vs. ExPASy TrEMBL
Match:
A0A0A0KM65 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G525480 PE=4 SV=1)
HSP 1 Score: 206 bits (524), Expect = 3.40e-66
Identity = 106/136 (77.94%), Postives = 117/136 (86.03%), Query Frame = 0
Query: 1 MITIKSMAAMAVYLLFIVAIAAALEIRPSEHGLEFQSPPAAGDKSSPEMRSFFGGTSSPT 60
MI IKS AA+ + I +IAA EIRPSEHGLEFQSPP GDKSSPEMRSFFGG +SPT
Sbjct: 1 MIPIKSSAAIVAFFSLIASIAAVSEIRPSEHGLEFQSPPPVGDKSSPEMRSFFGGIASPT 60
Query: 61 PEVALPLPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGITGVTLLVGSTLYYIFKV 120
PEVALP+PK +NSSE+PGWW H DGG+KRLRNALLVATAACGITGVTLLVGSTL+YIFK
Sbjct: 61 PEVALPIPKTLNSSESPGWWNHHDGGNKRLRNALLVATAACGITGVTLLVGSTLFYIFKA 120
Query: 121 KNQRSLPLSSNNSNHK 136
KN+RS+PLS NN NHK
Sbjct: 121 KNKRSMPLSPNN-NHK 135
BLAST of Cp4.1LG19g05600 vs. TAIR 10
Match:
AT4G21740.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G30515.1); Has 20 Blast hits to 20 proteins in 4 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 20; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 68.6 bits (166), Expect = 4.6e-12
Identity = 50/132 (37.88%), Postives = 71/132 (53.79%), Query Frame = 0
Query: 13 YLLFIVAIAAALEIRPSEHGLEFQ--SPPAAGDKSSPEMRSFFGG--TSSPTPEVALPLP 72
+L+ + A E+RPS+HGL++Q SPP +M+SFFG +SSP P LP
Sbjct: 22 FLVIFTGNSLAGELRPSDHGLQYQFSSPPTESHSPPGKMKSFFGDSHSSSPPPSHPQLLP 81
Query: 73 K--AMNSSEAPGWWTHRDGG----DKRLRNALLVATAACGITGVTLLVGSTLYYIFKV-K 132
K A + + WW RDG D +R+ L A+ CG++GV LLV TL Y F+ K
Sbjct: 82 KATAADGGDDDSWW--RDGAGIRRDHVMRHVFLAASIICGVSGVALLVVFTLIYFFRYRK 141
Query: 133 NQRSLPLSSNNS 134
+ S + N+S
Sbjct: 142 HNHSNSPTGNDS 151
BLAST of Cp4.1LG19g05600 vs. TAIR 10
Match:
AT1G30515.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 6 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G21740.1); Has 20 Blast hits to 20 proteins in 4 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 20; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 52.0 bits (123), Expect = 4.4e-07
Identity = 43/133 (32.33%), Postives = 70/133 (52.63%), Query Frame = 0
Query: 10 MAVYLLFIVAIAA----ALEIRPSEHGLEFQSPPAAGDKSSPEMRSFFGGTSS-PTPEVA 69
+++ ++FI+ + + A E+RPS+HGLE+ P S EM SFFG SS ++
Sbjct: 13 ISMLIMFIIVLESTIINARELRPSDHGLEYYYEPG----ESSEMTSFFGPPSSNDLTSIS 72
Query: 70 LPLPKAMNSS-EAPGWWTHRDGGDKRLRN-ALLVATAACGITGVTLLVGSTLYYIFKVKN 129
P + S+ ++P +D D R+ N L+V + CG++GV L+V S L Y
Sbjct: 73 SPSSSILPSAVKSPMKTLSKDQDDDRVMNHVLVVGSLVCGLSGVALMVASALIYFLGYPK 132
Query: 130 QRSLPLSSNNSNH 136
++ SS N +H
Sbjct: 133 TQN---SSVNCDH 138
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023518243.1 | 2.31e-87 | 100.00 | uncharacterized protein LOC111781779 [Cucurbita pepo subsp. pepo] | [more] |
XP_022926736.1 | 7.75e-86 | 99.26 | uncharacterized protein LOC111433768 [Cucurbita moschata] | [more] |
XP_023003567.1 | 1.10e-85 | 97.79 | uncharacterized protein LOC111497130 [Cucurbita maxima] | [more] |
KAG6594423.1 | 2.04e-82 | 99.23 | hypothetical protein SDJN03_10976, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KAG7026421.1 | 4.68e-70 | 98.25 | hypothetical protein SDJN02_10421, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1EFQ9 | 3.75e-86 | 99.26 | uncharacterized protein LOC111433768 OS=Cucurbita moschata OX=3662 GN=LOC1114337... | [more] |
A0A6J1KMY3 | 5.33e-86 | 97.79 | uncharacterized protein LOC111497130 OS=Cucurbita maxima OX=3661 GN=LOC111497130... | [more] |
A0A6J1EFW4 | 1.98e-68 | 82.96 | uncharacterized protein LOC111433017 OS=Cucurbita moschata OX=3662 GN=LOC1114330... | [more] |
A0A6J1CZN3 | 2.73e-66 | 82.14 | uncharacterized protein LOC111015729 OS=Momordica charantia OX=3673 GN=LOC111015... | [more] |
A0A0A0KM65 | 3.40e-66 | 77.94 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G525480 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT4G21740.1 | 4.6e-12 | 37.88 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT1G30515.1 | 4.4e-07 | 32.33 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |