Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATAATTCCGATCAAACCAGTAGCGGCGATGGCCGTGTTCTTGCCACTGATCGTGGCGGTCGCATCCGCGCTGGAGATACGGCCGTCGGAGCACGGGCTGGAGTTTCAGAGCCCTCCGCCTGCGGGAGAAAAATCGTCGCCACAGATGCGGTCGTTCTTCGTAGGAACATCGTCGCCAATTCCGGATACAACATTGCCGTTGCCGAAGGCGATGAATTCGAGCGAGGCGCCGGGATGGTGGACCCACCGTGACGGTGGAAATAAACGAGTAAGAAATGCATTATTGGTGGCGACGGCGGCTTGTGGAATTACAGGTGTCACTTTATTAGTGGGTTCTACGCTATTCTACATTTATAAGGTAAAAAATCAAACACCATTGCCATTATCTTCAAATAATAATCACAAATAAACATATCAACAACAATCATGATCCCTTACTTTGTTCATCTCTTGAAGTAGCGATCAATTTCCCAGTTCCAATTTGTTCTTACCGTATAA
mRNA sequence
ATGATAATTCCGATCAAACCAGTAGCGGCGATGGCCGTGTTCTTGCCACTGATCGTGGCGGTCGCATCCGCGCTGGAGATACGGCCGTCGGAGCACGGGCTGGAGTTTCAGAGCCCTCCGCCTGCGGGAGAAAAATCGTCGCCACAGATGCGGTCGTTCTTCGTAGGAACATCGTCGCCAATTCCGGATACAACATTGCCGTTGCCGAAGGCGATGAATTCGAGCGAGGCGCCGGGATGGTGGACCCACCGTGACGGTGGAAATAAACGAGTAAGAAATGCATTATTGGTGGCGACGGCGGCTTGTGGAATTACAGGTGTCACTTTATTAGTGGGTTCTACGCTATTCTACATTTATAAGCGATCAATTTCCCAGTTCCAATTTGTTCTTACCGTATAA
Coding sequence (CDS)
ATGATAATTCCGATCAAACCAGTAGCGGCGATGGCCGTGTTCTTGCCACTGATCGTGGCGGTCGCATCCGCGCTGGAGATACGGCCGTCGGAGCACGGGCTGGAGTTTCAGAGCCCTCCGCCTGCGGGAGAAAAATCGTCGCCACAGATGCGGTCGTTCTTCGTAGGAACATCGTCGCCAATTCCGGATACAACATTGCCGTTGCCGAAGGCGATGAATTCGAGCGAGGCGCCGGGATGGTGGACCCACCGTGACGGTGGAAATAAACGAGTAAGAAATGCATTATTGGTGGCGACGGCGGCTTGTGGAATTACAGGTGTCACTTTATTAGTGGGTTCTACGCTATTCTACATTTATAAGCGATCAATTTCCCAGTTCCAATTTGTTCTTACCGTATAA
Protein sequence
MIIPIKPVAAMAVFLPLIVAVASALEIRPSEHGLEFQSPPPAGEKSSPQMRSFFVGTSSPIPDTTLPLPKAMNSSEAPGWWTHRDGGNKRVRNALLVATAACGITGVTLLVGSTLFYIYKRSISQFQFVLTV
Homology
BLAST of Cp4.1LG10g06470 vs. NCBI nr
Match:
KAG6581475.1 (hypothetical protein SDJN03_21477, partial [Cucurbita argyrosperma subsp. sororia] >KAG7034767.1 hypothetical protein SDJN02_04498, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 231 bits (589), Expect = 5.67e-76
Identity = 119/120 (99.17%), Postives = 120/120 (100.00%), Query Frame = 0
Query: 1 MIIPIKPVAAMAVFLPLIVAVASALEIRPSEHGLEFQSPPPAGEKSSPQMRSFFVGTSSP 60
MIIPIKPVAAMAVFLPLIVAVASALEIRPSEHGLEFQSPPPAGEKSSP+MRSFFVGTSSP
Sbjct: 1 MIIPIKPVAAMAVFLPLIVAVASALEIRPSEHGLEFQSPPPAGEKSSPEMRSFFVGTSSP 60
Query: 61 IPDTTLPLPKAMNSSEAPGWWTHRDGGNKRVRNALLVATAACGITGVTLLVGSTLFYIYK 120
IPDTTLPLPKAMNSSEAPGWWTHRDGGNKRVRNALLVATAACGITGVTLLVGSTLFYIYK
Sbjct: 61 IPDTTLPLPKAMNSSEAPGWWTHRDGGNKRVRNALLVATAACGITGVTLLVGSTLFYIYK 120
BLAST of Cp4.1LG10g06470 vs. NCBI nr
Match:
XP_022925668.1 (uncharacterized protein LOC111433017 [Cucurbita moschata])
HSP 1 Score: 229 bits (585), Expect = 2.00e-74
Identity = 118/120 (98.33%), Postives = 119/120 (99.17%), Query Frame = 0
Query: 1 MIIPIKPVAAMAVFLPLIVAVASALEIRPSEHGLEFQSPPPAGEKSSPQMRSFFVGTSSP 60
MIIPIKPVAAM VFLPLIVAVASALEIRPSEHGLEFQSPPPAGEKSSP+MRSFFVGTSSP
Sbjct: 57 MIIPIKPVAAMVVFLPLIVAVASALEIRPSEHGLEFQSPPPAGEKSSPEMRSFFVGTSSP 116
Query: 61 IPDTTLPLPKAMNSSEAPGWWTHRDGGNKRVRNALLVATAACGITGVTLLVGSTLFYIYK 120
IPDTTLPLPKAMNSSEAPGWWTHRDGGNKRVRNALLVATAACGITGVTLLVGSTLFYIYK
Sbjct: 117 IPDTTLPLPKAMNSSEAPGWWTHRDGGNKRVRNALLVATAACGITGVTLLVGSTLFYIYK 176
BLAST of Cp4.1LG10g06470 vs. NCBI nr
Match:
XP_023543933.1 (uncharacterized protein LOC111803654 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 217 bits (553), Expect = 1.75e-70
Identity = 110/110 (100.00%), Postives = 110/110 (100.00%), Query Frame = 0
Query: 11 MAVFLPLIVAVASALEIRPSEHGLEFQSPPPAGEKSSPQMRSFFVGTSSPIPDTTLPLPK 70
MAVFLPLIVAVASALEIRPSEHGLEFQSPPPAGEKSSPQMRSFFVGTSSPIPDTTLPLPK
Sbjct: 1 MAVFLPLIVAVASALEIRPSEHGLEFQSPPPAGEKSSPQMRSFFVGTSSPIPDTTLPLPK 60
Query: 71 AMNSSEAPGWWTHRDGGNKRVRNALLVATAACGITGVTLLVGSTLFYIYK 120
AMNSSEAPGWWTHRDGGNKRVRNALLVATAACGITGVTLLVGSTLFYIYK
Sbjct: 61 AMNSSEAPGWWTHRDGGNKRVRNALLVATAACGITGVTLLVGSTLFYIYK 110
BLAST of Cp4.1LG10g06470 vs. NCBI nr
Match:
XP_023518243.1 (uncharacterized protein LOC111781779 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 193 bits (491), Expect = 6.59e-61
Identity = 98/118 (83.05%), Postives = 109/118 (92.37%), Query Frame = 0
Query: 3 IPIKPVAAMAVFLPLIVAVASALEIRPSEHGLEFQSPPPAGEKSSPQMRSFFVGTSSPIP 62
I IK +AAMAV+L IVA+A+ALEIRPSEHGLEFQSPP AG+KSSP+MRSFF GTSSP P
Sbjct: 2 ITIKSMAAMAVYLLFIVAIAAALEIRPSEHGLEFQSPPAAGDKSSPEMRSFFGGTSSPTP 61
Query: 63 DTTLPLPKAMNSSEAPGWWTHRDGGNKRVRNALLVATAACGITGVTLLVGSTLFYIYK 120
+ LPLPKAMNSSEAPGWWTHRDGG+KR+RNALLVATAACGITGVTLLVGSTL+YI+K
Sbjct: 62 EVALPLPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGITGVTLLVGSTLYYIFK 119
BLAST of Cp4.1LG10g06470 vs. NCBI nr
Match:
XP_023003567.1 (uncharacterized protein LOC111497130 [Cucurbita maxima])
HSP 1 Score: 190 bits (483), Expect = 1.09e-59
Identity = 96/118 (81.36%), Postives = 108/118 (91.53%), Query Frame = 0
Query: 3 IPIKPVAAMAVFLPLIVAVASALEIRPSEHGLEFQSPPPAGEKSSPQMRSFFVGTSSPIP 62
I IK +AAMAV+L IVA+ +ALEIRPSEHGLEFQSPP AG+KSSP+MRSFF GTSSP P
Sbjct: 2 ITIKSMAAMAVYLLFIVAIEAALEIRPSEHGLEFQSPPAAGDKSSPEMRSFFGGTSSPTP 61
Query: 63 DTTLPLPKAMNSSEAPGWWTHRDGGNKRVRNALLVATAACGITGVTLLVGSTLFYIYK 120
+ LPLPKAMNSSEAPGWWTHRDGG+KR+RNALLVATAACG+TGVTLLVGSTL+YI+K
Sbjct: 62 EVALPLPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGMTGVTLLVGSTLYYIFK 119
BLAST of Cp4.1LG10g06470 vs. ExPASy TrEMBL
Match:
A0A6J1EFW4 (uncharacterized protein LOC111433017 OS=Cucurbita moschata OX=3662 GN=LOC111433017 PE=4 SV=1)
HSP 1 Score: 229 bits (585), Expect = 9.68e-75
Identity = 118/120 (98.33%), Postives = 119/120 (99.17%), Query Frame = 0
Query: 1 MIIPIKPVAAMAVFLPLIVAVASALEIRPSEHGLEFQSPPPAGEKSSPQMRSFFVGTSSP 60
MIIPIKPVAAM VFLPLIVAVASALEIRPSEHGLEFQSPPPAGEKSSP+MRSFFVGTSSP
Sbjct: 57 MIIPIKPVAAMVVFLPLIVAVASALEIRPSEHGLEFQSPPPAGEKSSPEMRSFFVGTSSP 116
Query: 61 IPDTTLPLPKAMNSSEAPGWWTHRDGGNKRVRNALLVATAACGITGVTLLVGSTLFYIYK 120
IPDTTLPLPKAMNSSEAPGWWTHRDGGNKRVRNALLVATAACGITGVTLLVGSTLFYIYK
Sbjct: 117 IPDTTLPLPKAMNSSEAPGWWTHRDGGNKRVRNALLVATAACGITGVTLLVGSTLFYIYK 176
BLAST of Cp4.1LG10g06470 vs. ExPASy TrEMBL
Match:
A0A6J1KMY3 (uncharacterized protein LOC111497130 OS=Cucurbita maxima OX=3661 GN=LOC111497130 PE=4 SV=1)
HSP 1 Score: 190 bits (483), Expect = 5.28e-60
Identity = 96/118 (81.36%), Postives = 108/118 (91.53%), Query Frame = 0
Query: 3 IPIKPVAAMAVFLPLIVAVASALEIRPSEHGLEFQSPPPAGEKSSPQMRSFFVGTSSPIP 62
I IK +AAMAV+L IVA+ +ALEIRPSEHGLEFQSPP AG+KSSP+MRSFF GTSSP P
Sbjct: 2 ITIKSMAAMAVYLLFIVAIEAALEIRPSEHGLEFQSPPAAGDKSSPEMRSFFGGTSSPTP 61
Query: 63 DTTLPLPKAMNSSEAPGWWTHRDGGNKRVRNALLVATAACGITGVTLLVGSTLFYIYK 120
+ LPLPKAMNSSEAPGWWTHRDGG+KR+RNALLVATAACG+TGVTLLVGSTL+YI+K
Sbjct: 62 EVALPLPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGMTGVTLLVGSTLYYIFK 119
BLAST of Cp4.1LG10g06470 vs. ExPASy TrEMBL
Match:
A0A6J1EFQ9 (uncharacterized protein LOC111433768 OS=Cucurbita moschata OX=3662 GN=LOC111433768 PE=4 SV=1)
HSP 1 Score: 189 bits (481), Expect = 1.06e-59
Identity = 97/118 (82.20%), Postives = 108/118 (91.53%), Query Frame = 0
Query: 3 IPIKPVAAMAVFLPLIVAVASALEIRPSEHGLEFQSPPPAGEKSSPQMRSFFVGTSSPIP 62
I IK +AAMAV+L IVA+A+ALEIRPSEHGLEFQS P AG+KSSP+MRSFF GTSSP P
Sbjct: 2 ITIKSMAAMAVYLLFIVAIAAALEIRPSEHGLEFQSLPAAGDKSSPEMRSFFGGTSSPTP 61
Query: 63 DTTLPLPKAMNSSEAPGWWTHRDGGNKRVRNALLVATAACGITGVTLLVGSTLFYIYK 120
+ LPLPKAMNSSEAPGWWTHRDGG+KR+RNALLVATAACGITGVTLLVGSTL+YI+K
Sbjct: 62 EVALPLPKAMNSSEAPGWWTHRDGGDKRLRNALLVATAACGITGVTLLVGSTLYYIFK 119
BLAST of Cp4.1LG10g06470 vs. ExPASy TrEMBL
Match:
A0A6J1CZN3 (uncharacterized protein LOC111015729 OS=Momordica charantia OX=3673 GN=LOC111015729 PE=4 SV=1)
HSP 1 Score: 182 bits (462), Expect = 9.17e-57
Identity = 98/122 (80.33%), Postives = 104/122 (85.25%), Query Frame = 0
Query: 3 IPIKPVAAMAVFLPLIVAV----ASALEIRPSEHGLEFQSPPPAGEKSSPQMRSFFVGTS 62
IPIK AAMAV LPLIVAV +ALEIRPSEHGLEFQSPPPAG+KSSP+M SFF G S
Sbjct: 2 IPIKIAAAMAVCLPLIVAVLAVKTTALEIRPSEHGLEFQSPPPAGDKSSPEMLSFFGGRS 61
Query: 63 SPIPDTTLPLPKAMNSSEAPGWWTHRDGGNKRVRNALLVATAACGITGVTLLVGSTLFYI 120
SP PD LPLPKAMNSSEAPGWWT RDGG+ R+RNALLVATAA GITGVTLLVGS LFY+
Sbjct: 62 SPTPDAALPLPKAMNSSEAPGWWTRRDGGDTRLRNALLVATAAFGITGVTLLVGSVLFYV 121
BLAST of Cp4.1LG10g06470 vs. ExPASy TrEMBL
Match:
A0A0A0KM65 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G525480 PE=4 SV=1)
HSP 1 Score: 179 bits (453), Expect = 1.89e-55
Identity = 87/118 (73.73%), Postives = 100/118 (84.75%), Query Frame = 0
Query: 3 IPIKPVAAMAVFLPLIVAVASALEIRPSEHGLEFQSPPPAGEKSSPQMRSFFVGTSSPIP 62
IPIK AA+ F LI ++A+ EIRPSEHGLEFQSPPP G+KSSP+MRSFF G +SP P
Sbjct: 2 IPIKSSAAIVAFFSLIASIAAVSEIRPSEHGLEFQSPPPVGDKSSPEMRSFFGGIASPTP 61
Query: 63 DTTLPLPKAMNSSEAPGWWTHRDGGNKRVRNALLVATAACGITGVTLLVGSTLFYIYK 120
+ LP+PK +NSSE+PGWW H DGGNKR+RNALLVATAACGITGVTLLVGSTLFYI+K
Sbjct: 62 EVALPIPKTLNSSESPGWWNHHDGGNKRLRNALLVATAACGITGVTLLVGSTLFYIFK 119
BLAST of Cp4.1LG10g06470 vs. TAIR 10
Match:
AT4G21740.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G30515.1); Has 20 Blast hits to 20 proteins in 4 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 20; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 65.1 bits (157), Expect = 4.9e-11
Identity = 46/117 (39.32%), Postives = 64/117 (54.70%), Query Frame = 0
Query: 14 FLPLIVAVASALEIRPSEHGLEFQSPPPAGEKSSP--QMRSFF--VGTSSPIPDTTLPLP 73
FL + + A E+RPS+HGL++Q P E SP +M+SFF +SSP P LP
Sbjct: 22 FLVIFTGNSLAGELRPSDHGLQYQFSSPPTESHSPPGKMKSFFGDSHSSSPPPSHPQLLP 81
Query: 74 K--AMNSSEAPGWWTHRDGGNKR----VRNALLVATAACGITGVTLLVGSTLFYIYK 121
K A + + WW RDG R +R+ L A+ CG++GV LLV TL Y ++
Sbjct: 82 KATAADGGDDDSWW--RDGAGIRRDHVMRHVFLAASIICGVSGVALLVVFTLIYFFR 136
BLAST of Cp4.1LG10g06470 vs. TAIR 10
Match:
AT1G30515.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 6 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G21740.1); Has 20 Blast hits to 20 proteins in 4 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 20; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 48.1 bits (113), Expect = 6.2e-06
Identity = 39/117 (33.33%), Postives = 65/117 (55.56%), Query Frame = 0
Query: 7 PVAAMAVFLPLIV--AVASALEIRPSEHGLEFQSPPPAGEKSSPQMRSFFVGTSSPIPDT 66
P+ +M + +++ + +A E+RPS+HGLE+ P GE S +M SFF G S T
Sbjct: 11 PLISMLIMFIIVLESTIINARELRPSDHGLEYYYEP--GESS--EMTSFF-GPPSSNDLT 70
Query: 67 TLPLPKAM---NSSEAPGWWTHRDGGNKRVRN-ALLVATAACGITGVTLLVGSTLFY 118
++ P + ++ ++P +D + RV N L+V + CG++GV L+V S L Y
Sbjct: 71 SISSPSSSILPSAVKSPMKTLSKDQDDDRVMNHVLVVGSLVCGLSGVALMVASALIY 122
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
KAG6581475.1 | 5.67e-76 | 99.17 | hypothetical protein SDJN03_21477, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022925668.1 | 2.00e-74 | 98.33 | uncharacterized protein LOC111433017 [Cucurbita moschata] | [more] |
XP_023543933.1 | 1.75e-70 | 100.00 | uncharacterized protein LOC111803654 [Cucurbita pepo subsp. pepo] | [more] |
XP_023518243.1 | 6.59e-61 | 83.05 | uncharacterized protein LOC111781779 [Cucurbita pepo subsp. pepo] | [more] |
XP_023003567.1 | 1.09e-59 | 81.36 | uncharacterized protein LOC111497130 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1EFW4 | 9.68e-75 | 98.33 | uncharacterized protein LOC111433017 OS=Cucurbita moschata OX=3662 GN=LOC1114330... | [more] |
A0A6J1KMY3 | 5.28e-60 | 81.36 | uncharacterized protein LOC111497130 OS=Cucurbita maxima OX=3661 GN=LOC111497130... | [more] |
A0A6J1EFQ9 | 1.06e-59 | 82.20 | uncharacterized protein LOC111433768 OS=Cucurbita moschata OX=3662 GN=LOC1114337... | [more] |
A0A6J1CZN3 | 9.17e-57 | 80.33 | uncharacterized protein LOC111015729 OS=Momordica charantia OX=3673 GN=LOC111015... | [more] |
A0A0A0KM65 | 1.89e-55 | 73.73 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G525480 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT4G21740.1 | 4.9e-11 | 39.32 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT1G30515.1 | 6.2e-06 | 33.33 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |