Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTTCGATCCCCTACCTCTCCTTCTTCCCCACAGATCCGCCGCTCATCATGTTAGCCGCCGCCTCTTCCTGGCTTCATTCCAGCCGGAGCCGGAGCCGTAGCCGCTTTCTCTTTCTCTTGGTTTGTTCCCCTCTCTTCATTCCCATTTTCTGCGCTACTTTCCCCTTCATCTTTGCCATAGATCTCTGCATCCGCCTTGCTCGTCACAGGAGAAGGATATATCTTCACGATTCACCAGAAATCGAACGCTTGCAGGAATGTGAGGAAGGCGGCTGCAGACCACCGCTTCCGGAGCAGATTGGCGGTGACAGTGGGGAGGAGATCGGCTTATTACAGAGGTACTTGGATGATCAGCTACTGCTCGTTCGTTCTGTTTATGAATGTGCTGATTGTAGCGACCACTTAAATGGGGATCCTCCATTTTGTAACATTAAAAATAGTAATTTGACTCCACTATTAGGTTGA
mRNA sequence
ATGTCTTCGATCCCCTACCTCTCCTTCTTCCCCACAGATCCGCCGCTCATCATGTTAGCCGCCGCCTCTTCCTGGCTTCATTCCAGCCGGAGCCGGAGCCGTAGCCGCTTTCTCTTTCTCTTGGTTTGTTCCCCTCTCTTCATTCCCATTTTCTGCGCTACTTTCCCCTTCATCTTTGCCATAGATCTCTGCATCCGCCTTGCTCGTCACAGGAGAAGGATATATCTTCACGATTCACCAGAAATCGAACGCTTGCAGGAATGTGAGGAAGGCGGCTGCAGACCACCGCTTCCGGAGCAGATTGGCGGTGACAGTGGGGAGGAGATCGGCTTATTACAGAGGTACTTGGATGATCAGCTACTGCTCGTTCGTTCTGTTTATGAATGTGCTGATTGTAGCGACCACTTAAATGGGGATCCTCCATTTTGTAACATTAAAAATAGTAATTTGACTCCACTATTAGGTTGA
Coding sequence (CDS)
ATGTCTTCGATCCCCTACCTCTCCTTCTTCCCCACAGATCCGCCGCTCATCATGTTAGCCGCCGCCTCTTCCTGGCTTCATTCCAGCCGGAGCCGGAGCCGTAGCCGCTTTCTCTTTCTCTTGGTTTGTTCCCCTCTCTTCATTCCCATTTTCTGCGCTACTTTCCCCTTCATCTTTGCCATAGATCTCTGCATCCGCCTTGCTCGTCACAGGAGAAGGATATATCTTCACGATTCACCAGAAATCGAACGCTTGCAGGAATGTGAGGAAGGCGGCTGCAGACCACCGCTTCCGGAGCAGATTGGCGGTGACAGTGGGGAGGAGATCGGCTTATTACAGAGGTACTTGGATGATCAGCTACTGCTCGTTCGTTCTGTTTATGAATGTGCTGATTGTAGCGACCACTTAAATGGGGATCCTCCATTTTGTAACATTAAAAATAGTAATTTGACTCCACTATTAGGTTGA
Protein sequence
MSSIPYLSFFPTDPPLIMLAAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFPFIFAIDLCIRLARHRRRIYLHDSPEIERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYLDDQLLLVRSVYECADCSDHLNGDPPFCNIKNSNLTPLLG
Homology
BLAST of Cla97C11G215030 vs. NCBI nr
Match:
XP_022962190.1 (uncharacterized protein LOC111462719 [Cucurbita moschata] >KAG7033430.1 hypothetical protein SDJN02_07486, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 209.1 bits (531), Expect = 2.6e-50
Identity = 109/159 (68.55%), Postives = 122/159 (76.73%), Query Frame = 0
Query: 1 MSSIPYLSFFPTDPPLIML----AAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFP 60
MSSIP L + P+IML AAAS WL S SRSRF LL+CSPL +PIFCATFP
Sbjct: 1 MSSIPNLPVLLANQPVIMLATAAAAASFWLRS----SRSRFFILLLCSPLLVPIFCATFP 60
Query: 61 FIFAIDLCIRLARHRRRIYLHDSPEIERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYL 120
I AI+LCIRLARHR RI L DSPE ERL+ CEEGGCR LPE++G D E+IGLLQRYL
Sbjct: 61 IICAIELCIRLARHRIRICLRDSPESERLRRCEEGGCRSALPEKVGDDGEEDIGLLQRYL 120
Query: 121 DDQLLLVRSVYECADCSDHLNGDPPFCNIKNSNLTPLLG 156
DDQLLLVRSVYEC DC+D L+GD FC+I+NSNL PLLG
Sbjct: 121 DDQLLLVRSVYECGDCADDLSGDARFCDIENSNLIPLLG 155
BLAST of Cla97C11G215030 vs. NCBI nr
Match:
XP_023550754.1 (uncharacterized protein LOC111808798 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 205.7 bits (522), Expect = 2.9e-49
Identity = 108/159 (67.92%), Postives = 121/159 (76.10%), Query Frame = 0
Query: 1 MSSIPYLSFFPTDPPLIML----AAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFP 60
MSSIP L + +IML AAAS WL S SRSRF LL+CSPL +PIFCATFP
Sbjct: 1 MSSIPNLPVLLANQHVIMLATAAAAASFWLRS----SRSRFFILLLCSPLLVPIFCATFP 60
Query: 61 FIFAIDLCIRLARHRRRIYLHDSPEIERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYL 120
I AI+LCIRLARHR RI L DSPE ERL+ CEEGGCR LPE++G D E+IGLLQRYL
Sbjct: 61 IICAIELCIRLARHRIRICLRDSPESERLRRCEEGGCRSALPEKVGDDGEEDIGLLQRYL 120
Query: 121 DDQLLLVRSVYECADCSDHLNGDPPFCNIKNSNLTPLLG 156
DDQLLLVRSVYEC DC+D L+GD FC+I+NSNL PLLG
Sbjct: 121 DDQLLLVRSVYECGDCADDLSGDARFCDIENSNLIPLLG 155
BLAST of Cla97C11G215030 vs. NCBI nr
Match:
XP_022990172.1 (uncharacterized protein LOC111487144 [Cucurbita maxima])
HSP 1 Score: 202.2 bits (513), Expect = 3.2e-48
Identity = 106/159 (66.67%), Postives = 119/159 (74.84%), Query Frame = 0
Query: 1 MSSIPYLSFFPTDPPLIML----AAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFP 60
MSSIP L + P+IML AAAS WL SRSRF LL+CSPL +PIFCATFP
Sbjct: 1 MSSIPKLPVLLANQPVIMLATAAAAASFWLRP----SRSRFFILLLCSPLLVPIFCATFP 60
Query: 61 FIFAIDLCIRLARHRRRIYLHDSPEIERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYL 120
I AI+LCIRLARHR RI L DSPE ERL+ CEEGGCR LPE++G D E+IGLL RYL
Sbjct: 61 IICAIELCIRLARHRIRICLRDSPESERLRRCEEGGCRSALPEKVGDDGEEDIGLLPRYL 120
Query: 121 DDQLLLVRSVYECADCSDHLNGDPPFCNIKNSNLTPLLG 156
DDQLLLVRSVY C DC+D L+GD FC+I+NSNL PLLG
Sbjct: 121 DDQLLLVRSVYVCGDCADDLSGDARFCDIENSNLIPLLG 155
BLAST of Cla97C11G215030 vs. NCBI nr
Match:
XP_022152501.1 (uncharacterized protein LOC111020213 [Momordica charantia])
HSP 1 Score: 199.5 bits (506), Expect = 2.1e-47
Identity = 108/159 (67.92%), Postives = 121/159 (76.10%), Query Frame = 0
Query: 1 MSSIPYLSFFPTDPPLIML----AAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFP 60
M SI LS F +PP+IML AAASSWL S SR+RF+F+L+CSPL +PIFCATFP
Sbjct: 1 MYSILNLSVFHKNPPVIMLATAAAAASSWLCS----SRTRFVFVLLCSPLLVPIFCATFP 60
Query: 61 FIFAIDLCIRLARHRRRIYLHDSPEIERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYL 120
FI AI+LCIRLARHRR I L DSPEIERL+ CEEGGC LPE D EEIGLLQRYL
Sbjct: 61 FICAIELCIRLARHRRSISLRDSPEIERLRRCEEGGCAAALPE----DGEEEIGLLQRYL 120
Query: 121 DDQLLLVRSVYECADCSDHLNGDPPFCNIKNSNLTPLLG 156
DDQLLLVRSVYEC D + +NGD FC+ KNSN+ PLLG
Sbjct: 121 DDQLLLVRSVYECGDGDNDINGDARFCDFKNSNVIPLLG 151
BLAST of Cla97C11G215030 vs. NCBI nr
Match:
KAG6602743.1 (hypothetical protein SDJN03_07976, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 199.1 bits (505), Expect = 2.7e-47
Identity = 98/136 (72.06%), Postives = 110/136 (80.88%), Query Frame = 0
Query: 20 AAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFPFIFAIDLCIRLARHRRRIYLHDS 79
AAAS WL S SRSRF LL+CSPL +PIFCATFP I AI+LCIRLARHR RI L DS
Sbjct: 7 AAASFWLRS----SRSRFFILLLCSPLLVPIFCATFPIICAIELCIRLARHRIRICLRDS 66
Query: 80 PEIERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYLDDQLLLVRSVYECADCSDHLNGD 139
PE ERL+ CEEGGCR LPE++G D E+IGLLQRYLDDQLLLVRSVYEC DC+D L+GD
Sbjct: 67 PESERLRRCEEGGCRSALPEKVGDDGEEDIGLLQRYLDDQLLLVRSVYECGDCADDLSGD 126
Query: 140 PPFCNIKNSNLTPLLG 156
FC+++NSNL PLLG
Sbjct: 127 ARFCDLENSNLIPLLG 138
BLAST of Cla97C11G215030 vs. ExPASy TrEMBL
Match:
A0A6J1HEC7 (uncharacterized protein LOC111462719 OS=Cucurbita moschata OX=3662 GN=LOC111462719 PE=4 SV=1)
HSP 1 Score: 209.1 bits (531), Expect = 1.3e-50
Identity = 109/159 (68.55%), Postives = 122/159 (76.73%), Query Frame = 0
Query: 1 MSSIPYLSFFPTDPPLIML----AAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFP 60
MSSIP L + P+IML AAAS WL S SRSRF LL+CSPL +PIFCATFP
Sbjct: 1 MSSIPNLPVLLANQPVIMLATAAAAASFWLRS----SRSRFFILLLCSPLLVPIFCATFP 60
Query: 61 FIFAIDLCIRLARHRRRIYLHDSPEIERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYL 120
I AI+LCIRLARHR RI L DSPE ERL+ CEEGGCR LPE++G D E+IGLLQRYL
Sbjct: 61 IICAIELCIRLARHRIRICLRDSPESERLRRCEEGGCRSALPEKVGDDGEEDIGLLQRYL 120
Query: 121 DDQLLLVRSVYECADCSDHLNGDPPFCNIKNSNLTPLLG 156
DDQLLLVRSVYEC DC+D L+GD FC+I+NSNL PLLG
Sbjct: 121 DDQLLLVRSVYECGDCADDLSGDARFCDIENSNLIPLLG 155
BLAST of Cla97C11G215030 vs. ExPASy TrEMBL
Match:
A0A6J1JHX7 (uncharacterized protein LOC111487144 OS=Cucurbita maxima OX=3661 GN=LOC111487144 PE=4 SV=1)
HSP 1 Score: 202.2 bits (513), Expect = 1.6e-48
Identity = 106/159 (66.67%), Postives = 119/159 (74.84%), Query Frame = 0
Query: 1 MSSIPYLSFFPTDPPLIML----AAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFP 60
MSSIP L + P+IML AAAS WL SRSRF LL+CSPL +PIFCATFP
Sbjct: 1 MSSIPKLPVLLANQPVIMLATAAAAASFWLRP----SRSRFFILLLCSPLLVPIFCATFP 60
Query: 61 FIFAIDLCIRLARHRRRIYLHDSPEIERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYL 120
I AI+LCIRLARHR RI L DSPE ERL+ CEEGGCR LPE++G D E+IGLL RYL
Sbjct: 61 IICAIELCIRLARHRIRICLRDSPESERLRRCEEGGCRSALPEKVGDDGEEDIGLLPRYL 120
Query: 121 DDQLLLVRSVYECADCSDHLNGDPPFCNIKNSNLTPLLG 156
DDQLLLVRSVY C DC+D L+GD FC+I+NSNL PLLG
Sbjct: 121 DDQLLLVRSVYVCGDCADDLSGDARFCDIENSNLIPLLG 155
BLAST of Cla97C11G215030 vs. ExPASy TrEMBL
Match:
A0A6J1DHX6 (uncharacterized protein LOC111020213 OS=Momordica charantia OX=3673 GN=LOC111020213 PE=4 SV=1)
HSP 1 Score: 199.5 bits (506), Expect = 1.0e-47
Identity = 108/159 (67.92%), Postives = 121/159 (76.10%), Query Frame = 0
Query: 1 MSSIPYLSFFPTDPPLIML----AAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFP 60
M SI LS F +PP+IML AAASSWL S SR+RF+F+L+CSPL +PIFCATFP
Sbjct: 1 MYSILNLSVFHKNPPVIMLATAAAAASSWLCS----SRTRFVFVLLCSPLLVPIFCATFP 60
Query: 61 FIFAIDLCIRLARHRRRIYLHDSPEIERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYL 120
FI AI+LCIRLARHRR I L DSPEIERL+ CEEGGC LPE D EEIGLLQRYL
Sbjct: 61 FICAIELCIRLARHRRSISLRDSPEIERLRRCEEGGCAAALPE----DGEEEIGLLQRYL 120
Query: 121 DDQLLLVRSVYECADCSDHLNGDPPFCNIKNSNLTPLLG 156
DDQLLLVRSVYEC D + +NGD FC+ KNSN+ PLLG
Sbjct: 121 DDQLLLVRSVYECGDGDNDINGDARFCDFKNSNVIPLLG 151
BLAST of Cla97C11G215030 vs. ExPASy TrEMBL
Match:
M5XT02 (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_1G555000 PE=4 SV=1)
HSP 1 Score: 116.3 bits (290), Expect = 1.1e-22
Identity = 68/119 (57.14%), Postives = 80/119 (67.23%), Query Frame = 0
Query: 15 PLIMLAAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFPFIFAIDLCIRLARHRRRI 74
PLIMLAAASS R+R RSR+LFLL+CSP+ IP CATFPF+ A +LC+RL R RR
Sbjct: 62 PLIMLAAASSSTTWLRAR-RSRYLFLLICSPILIPFLCATFPFLCAAELCLRLCRRRRIK 121
Query: 75 YLHDSPE--IERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYLDDQLLLVRSVYECAD 132
H + E ERL+ CEEG G EE+GLLQRYL+DQLLLV SVY+C D
Sbjct: 122 NAHGADEEVEERLRRCEEG----------RGGEREEMGLLQRYLEDQLLLVGSVYDCGD 169
BLAST of Cla97C11G215030 vs. ExPASy TrEMBL
Match:
A0A6J5WCG8 (Uncharacterized protein OS=Prunus armeniaca OX=36596 GN=CURHAP_LOCUS10400 PE=4 SV=1)
HSP 1 Score: 116.3 bits (290), Expect = 1.1e-22
Identity = 68/119 (57.14%), Postives = 80/119 (67.23%), Query Frame = 0
Query: 15 PLIMLAAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFPFIFAIDLCIRLARHRRRI 74
PLIMLAAASS R+R RSR+LFLL+CSP+ IP CATFPF+ A +LC+RL R RR
Sbjct: 62 PLIMLAAASSSTTWLRAR-RSRYLFLLICSPILIPFLCATFPFLCAAELCLRLCRRRRIK 121
Query: 75 YLHDSPE--IERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYLDDQLLLVRSVYECAD 132
H + E ERL+ CEEG G EE+GLLQRYL+DQLLLV SVY+C D
Sbjct: 122 TAHGADEEVEERLRRCEEG----------RGGEREEMGLLQRYLEDQLLLVGSVYDCGD 169
BLAST of Cla97C11G215030 vs. TAIR 10
Match:
AT1G35430.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: mitochondrion; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G09170.1); Has 23 Blast hits to 23 proteins in 5 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 23; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 87.0 bits (214), Expect = 1.4e-17
Identity = 51/101 (50.50%), Postives = 66/101 (65.35%), Query Frame = 0
Query: 34 RSRFLFLLVCSPLFIPIFCATFPFIFAIDLCIRLARHRRRIYLHDSPEIE--RLQECEEG 93
RSRF+F L+CSPL IPI CA+ P + A+++ RL R R + + E + RL+ CEEG
Sbjct: 12 RSRFVFFLLCSPLLIPILCASIPILCAVEIFSRL-RSRHPWFAKSTAEEDDLRLRRCEEG 71
Query: 94 -GCRPPLPEQIGGDSGEEIGLLQRYLDDQLLLVRSVYECAD 132
GC G D EE GLLQRYL+DQL+LVRSVY+C +
Sbjct: 72 CGCG-------GFDEPEEAGLLQRYLEDQLVLVRSVYDCGE 104
BLAST of Cla97C11G215030 vs. TAIR 10
Match:
AT4G09170.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G35430.1); Has 23 Blast hits to 23 proteins in 5 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 23; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 55.1 bits (131), Expect = 5.9e-08
Identity = 45/134 (33.58%), Postives = 67/134 (50.00%), Query Frame = 0
Query: 17 IMLAAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFPFIFAIDLCIRLARHRRR--- 76
+ML AS + S SR +FL++CSPL C + P + A+++ RL +
Sbjct: 13 VMLTCASCF-----STRWSRIIFLILCSPL----LCLSIPLLCAVEIFSRLLSRIVKPPP 72
Query: 77 -----IYLHDSPEIERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYLDDQLLLVRSVYE 136
L D + RL++CEEG + E+ D EE GLL RYLD+QL L R++++
Sbjct: 73 SSAVSKVLVDDEDNLRLRQCEEGF---GMKEE---DENEESGLLHRYLDNQLSLARTIFD 131
Query: 137 CADCSDHLNGDPPF 143
DH + PF
Sbjct: 133 DDGDRDHDSIRVPF 131
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022962190.1 | 2.6e-50 | 68.55 | uncharacterized protein LOC111462719 [Cucurbita moschata] >KAG7033430.1 hypothet... | [more] |
XP_023550754.1 | 2.9e-49 | 67.92 | uncharacterized protein LOC111808798 [Cucurbita pepo subsp. pepo] | [more] |
XP_022990172.1 | 3.2e-48 | 66.67 | uncharacterized protein LOC111487144 [Cucurbita maxima] | [more] |
XP_022152501.1 | 2.1e-47 | 67.92 | uncharacterized protein LOC111020213 [Momordica charantia] | [more] |
KAG6602743.1 | 2.7e-47 | 72.06 | hypothetical protein SDJN03_07976, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1HEC7 | 1.3e-50 | 68.55 | uncharacterized protein LOC111462719 OS=Cucurbita moschata OX=3662 GN=LOC1114627... | [more] |
A0A6J1JHX7 | 1.6e-48 | 66.67 | uncharacterized protein LOC111487144 OS=Cucurbita maxima OX=3661 GN=LOC111487144... | [more] |
A0A6J1DHX6 | 1.0e-47 | 67.92 | uncharacterized protein LOC111020213 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
M5XT02 | 1.1e-22 | 57.14 | Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_1G555000 PE=4 SV=1 | [more] |
A0A6J5WCG8 | 1.1e-22 | 57.14 | Uncharacterized protein OS=Prunus armeniaca OX=36596 GN=CURHAP_LOCUS10400 PE=4 S... | [more] |
Match Name | E-value | Identity | Description | |
AT1G35430.1 | 1.4e-17 | 50.50 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT4G09170.1 | 5.9e-08 | 33.58 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |