Cla97C11G215030 (gene) Watermelon (97103) v2.5

Overview
NameCla97C11G215030
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionJHL25P11.1 protein
LocationCla97Chr11: 10830616 .. 10831083 (-)
RNA-Seq ExpressionCla97C11G215030
SyntenyCla97C11G215030
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTTCGATCCCCTACCTCTCCTTCTTCCCCACAGATCCGCCGCTCATCATGTTAGCCGCCGCCTCTTCCTGGCTTCATTCCAGCCGGAGCCGGAGCCGTAGCCGCTTTCTCTTTCTCTTGGTTTGTTCCCCTCTCTTCATTCCCATTTTCTGCGCTACTTTCCCCTTCATCTTTGCCATAGATCTCTGCATCCGCCTTGCTCGTCACAGGAGAAGGATATATCTTCACGATTCACCAGAAATCGAACGCTTGCAGGAATGTGAGGAAGGCGGCTGCAGACCACCGCTTCCGGAGCAGATTGGCGGTGACAGTGGGGAGGAGATCGGCTTATTACAGAGGTACTTGGATGATCAGCTACTGCTCGTTCGTTCTGTTTATGAATGTGCTGATTGTAGCGACCACTTAAATGGGGATCCTCCATTTTGTAACATTAAAAATAGTAATTTGACTCCACTATTAGGTTGA

mRNA sequence

ATGTCTTCGATCCCCTACCTCTCCTTCTTCCCCACAGATCCGCCGCTCATCATGTTAGCCGCCGCCTCTTCCTGGCTTCATTCCAGCCGGAGCCGGAGCCGTAGCCGCTTTCTCTTTCTCTTGGTTTGTTCCCCTCTCTTCATTCCCATTTTCTGCGCTACTTTCCCCTTCATCTTTGCCATAGATCTCTGCATCCGCCTTGCTCGTCACAGGAGAAGGATATATCTTCACGATTCACCAGAAATCGAACGCTTGCAGGAATGTGAGGAAGGCGGCTGCAGACCACCGCTTCCGGAGCAGATTGGCGGTGACAGTGGGGAGGAGATCGGCTTATTACAGAGGTACTTGGATGATCAGCTACTGCTCGTTCGTTCTGTTTATGAATGTGCTGATTGTAGCGACCACTTAAATGGGGATCCTCCATTTTGTAACATTAAAAATAGTAATTTGACTCCACTATTAGGTTGA

Coding sequence (CDS)

ATGTCTTCGATCCCCTACCTCTCCTTCTTCCCCACAGATCCGCCGCTCATCATGTTAGCCGCCGCCTCTTCCTGGCTTCATTCCAGCCGGAGCCGGAGCCGTAGCCGCTTTCTCTTTCTCTTGGTTTGTTCCCCTCTCTTCATTCCCATTTTCTGCGCTACTTTCCCCTTCATCTTTGCCATAGATCTCTGCATCCGCCTTGCTCGTCACAGGAGAAGGATATATCTTCACGATTCACCAGAAATCGAACGCTTGCAGGAATGTGAGGAAGGCGGCTGCAGACCACCGCTTCCGGAGCAGATTGGCGGTGACAGTGGGGAGGAGATCGGCTTATTACAGAGGTACTTGGATGATCAGCTACTGCTCGTTCGTTCTGTTTATGAATGTGCTGATTGTAGCGACCACTTAAATGGGGATCCTCCATTTTGTAACATTAAAAATAGTAATTTGACTCCACTATTAGGTTGA

Protein sequence

MSSIPYLSFFPTDPPLIMLAAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFPFIFAIDLCIRLARHRRRIYLHDSPEIERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYLDDQLLLVRSVYECADCSDHLNGDPPFCNIKNSNLTPLLG
Homology
BLAST of Cla97C11G215030 vs. NCBI nr
Match: XP_022962190.1 (uncharacterized protein LOC111462719 [Cucurbita moschata] >KAG7033430.1 hypothetical protein SDJN02_07486, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 209.1 bits (531), Expect = 2.6e-50
Identity = 109/159 (68.55%), Postives = 122/159 (76.73%), Query Frame = 0

Query: 1   MSSIPYLSFFPTDPPLIML----AAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFP 60
           MSSIP L     + P+IML    AAAS WL S    SRSRF  LL+CSPL +PIFCATFP
Sbjct: 1   MSSIPNLPVLLANQPVIMLATAAAAASFWLRS----SRSRFFILLLCSPLLVPIFCATFP 60

Query: 61  FIFAIDLCIRLARHRRRIYLHDSPEIERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYL 120
            I AI+LCIRLARHR RI L DSPE ERL+ CEEGGCR  LPE++G D  E+IGLLQRYL
Sbjct: 61  IICAIELCIRLARHRIRICLRDSPESERLRRCEEGGCRSALPEKVGDDGEEDIGLLQRYL 120

Query: 121 DDQLLLVRSVYECADCSDHLNGDPPFCNIKNSNLTPLLG 156
           DDQLLLVRSVYEC DC+D L+GD  FC+I+NSNL PLLG
Sbjct: 121 DDQLLLVRSVYECGDCADDLSGDARFCDIENSNLIPLLG 155

BLAST of Cla97C11G215030 vs. NCBI nr
Match: XP_023550754.1 (uncharacterized protein LOC111808798 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 205.7 bits (522), Expect = 2.9e-49
Identity = 108/159 (67.92%), Postives = 121/159 (76.10%), Query Frame = 0

Query: 1   MSSIPYLSFFPTDPPLIML----AAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFP 60
           MSSIP L     +  +IML    AAAS WL S    SRSRF  LL+CSPL +PIFCATFP
Sbjct: 1   MSSIPNLPVLLANQHVIMLATAAAAASFWLRS----SRSRFFILLLCSPLLVPIFCATFP 60

Query: 61  FIFAIDLCIRLARHRRRIYLHDSPEIERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYL 120
            I AI+LCIRLARHR RI L DSPE ERL+ CEEGGCR  LPE++G D  E+IGLLQRYL
Sbjct: 61  IICAIELCIRLARHRIRICLRDSPESERLRRCEEGGCRSALPEKVGDDGEEDIGLLQRYL 120

Query: 121 DDQLLLVRSVYECADCSDHLNGDPPFCNIKNSNLTPLLG 156
           DDQLLLVRSVYEC DC+D L+GD  FC+I+NSNL PLLG
Sbjct: 121 DDQLLLVRSVYECGDCADDLSGDARFCDIENSNLIPLLG 155

BLAST of Cla97C11G215030 vs. NCBI nr
Match: XP_022990172.1 (uncharacterized protein LOC111487144 [Cucurbita maxima])

HSP 1 Score: 202.2 bits (513), Expect = 3.2e-48
Identity = 106/159 (66.67%), Postives = 119/159 (74.84%), Query Frame = 0

Query: 1   MSSIPYLSFFPTDPPLIML----AAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFP 60
           MSSIP L     + P+IML    AAAS WL      SRSRF  LL+CSPL +PIFCATFP
Sbjct: 1   MSSIPKLPVLLANQPVIMLATAAAAASFWLRP----SRSRFFILLLCSPLLVPIFCATFP 60

Query: 61  FIFAIDLCIRLARHRRRIYLHDSPEIERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYL 120
            I AI+LCIRLARHR RI L DSPE ERL+ CEEGGCR  LPE++G D  E+IGLL RYL
Sbjct: 61  IICAIELCIRLARHRIRICLRDSPESERLRRCEEGGCRSALPEKVGDDGEEDIGLLPRYL 120

Query: 121 DDQLLLVRSVYECADCSDHLNGDPPFCNIKNSNLTPLLG 156
           DDQLLLVRSVY C DC+D L+GD  FC+I+NSNL PLLG
Sbjct: 121 DDQLLLVRSVYVCGDCADDLSGDARFCDIENSNLIPLLG 155

BLAST of Cla97C11G215030 vs. NCBI nr
Match: XP_022152501.1 (uncharacterized protein LOC111020213 [Momordica charantia])

HSP 1 Score: 199.5 bits (506), Expect = 2.1e-47
Identity = 108/159 (67.92%), Postives = 121/159 (76.10%), Query Frame = 0

Query: 1   MSSIPYLSFFPTDPPLIML----AAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFP 60
           M SI  LS F  +PP+IML    AAASSWL S    SR+RF+F+L+CSPL +PIFCATFP
Sbjct: 1   MYSILNLSVFHKNPPVIMLATAAAAASSWLCS----SRTRFVFVLLCSPLLVPIFCATFP 60

Query: 61  FIFAIDLCIRLARHRRRIYLHDSPEIERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYL 120
           FI AI+LCIRLARHRR I L DSPEIERL+ CEEGGC   LPE    D  EEIGLLQRYL
Sbjct: 61  FICAIELCIRLARHRRSISLRDSPEIERLRRCEEGGCAAALPE----DGEEEIGLLQRYL 120

Query: 121 DDQLLLVRSVYECADCSDHLNGDPPFCNIKNSNLTPLLG 156
           DDQLLLVRSVYEC D  + +NGD  FC+ KNSN+ PLLG
Sbjct: 121 DDQLLLVRSVYECGDGDNDINGDARFCDFKNSNVIPLLG 151

BLAST of Cla97C11G215030 vs. NCBI nr
Match: KAG6602743.1 (hypothetical protein SDJN03_07976, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 199.1 bits (505), Expect = 2.7e-47
Identity = 98/136 (72.06%), Postives = 110/136 (80.88%), Query Frame = 0

Query: 20  AAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFPFIFAIDLCIRLARHRRRIYLHDS 79
           AAAS WL S    SRSRF  LL+CSPL +PIFCATFP I AI+LCIRLARHR RI L DS
Sbjct: 7   AAASFWLRS----SRSRFFILLLCSPLLVPIFCATFPIICAIELCIRLARHRIRICLRDS 66

Query: 80  PEIERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYLDDQLLLVRSVYECADCSDHLNGD 139
           PE ERL+ CEEGGCR  LPE++G D  E+IGLLQRYLDDQLLLVRSVYEC DC+D L+GD
Sbjct: 67  PESERLRRCEEGGCRSALPEKVGDDGEEDIGLLQRYLDDQLLLVRSVYECGDCADDLSGD 126

Query: 140 PPFCNIKNSNLTPLLG 156
             FC+++NSNL PLLG
Sbjct: 127 ARFCDLENSNLIPLLG 138

BLAST of Cla97C11G215030 vs. ExPASy TrEMBL
Match: A0A6J1HEC7 (uncharacterized protein LOC111462719 OS=Cucurbita moschata OX=3662 GN=LOC111462719 PE=4 SV=1)

HSP 1 Score: 209.1 bits (531), Expect = 1.3e-50
Identity = 109/159 (68.55%), Postives = 122/159 (76.73%), Query Frame = 0

Query: 1   MSSIPYLSFFPTDPPLIML----AAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFP 60
           MSSIP L     + P+IML    AAAS WL S    SRSRF  LL+CSPL +PIFCATFP
Sbjct: 1   MSSIPNLPVLLANQPVIMLATAAAAASFWLRS----SRSRFFILLLCSPLLVPIFCATFP 60

Query: 61  FIFAIDLCIRLARHRRRIYLHDSPEIERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYL 120
            I AI+LCIRLARHR RI L DSPE ERL+ CEEGGCR  LPE++G D  E+IGLLQRYL
Sbjct: 61  IICAIELCIRLARHRIRICLRDSPESERLRRCEEGGCRSALPEKVGDDGEEDIGLLQRYL 120

Query: 121 DDQLLLVRSVYECADCSDHLNGDPPFCNIKNSNLTPLLG 156
           DDQLLLVRSVYEC DC+D L+GD  FC+I+NSNL PLLG
Sbjct: 121 DDQLLLVRSVYECGDCADDLSGDARFCDIENSNLIPLLG 155

BLAST of Cla97C11G215030 vs. ExPASy TrEMBL
Match: A0A6J1JHX7 (uncharacterized protein LOC111487144 OS=Cucurbita maxima OX=3661 GN=LOC111487144 PE=4 SV=1)

HSP 1 Score: 202.2 bits (513), Expect = 1.6e-48
Identity = 106/159 (66.67%), Postives = 119/159 (74.84%), Query Frame = 0

Query: 1   MSSIPYLSFFPTDPPLIML----AAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFP 60
           MSSIP L     + P+IML    AAAS WL      SRSRF  LL+CSPL +PIFCATFP
Sbjct: 1   MSSIPKLPVLLANQPVIMLATAAAAASFWLRP----SRSRFFILLLCSPLLVPIFCATFP 60

Query: 61  FIFAIDLCIRLARHRRRIYLHDSPEIERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYL 120
            I AI+LCIRLARHR RI L DSPE ERL+ CEEGGCR  LPE++G D  E+IGLL RYL
Sbjct: 61  IICAIELCIRLARHRIRICLRDSPESERLRRCEEGGCRSALPEKVGDDGEEDIGLLPRYL 120

Query: 121 DDQLLLVRSVYECADCSDHLNGDPPFCNIKNSNLTPLLG 156
           DDQLLLVRSVY C DC+D L+GD  FC+I+NSNL PLLG
Sbjct: 121 DDQLLLVRSVYVCGDCADDLSGDARFCDIENSNLIPLLG 155

BLAST of Cla97C11G215030 vs. ExPASy TrEMBL
Match: A0A6J1DHX6 (uncharacterized protein LOC111020213 OS=Momordica charantia OX=3673 GN=LOC111020213 PE=4 SV=1)

HSP 1 Score: 199.5 bits (506), Expect = 1.0e-47
Identity = 108/159 (67.92%), Postives = 121/159 (76.10%), Query Frame = 0

Query: 1   MSSIPYLSFFPTDPPLIML----AAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFP 60
           M SI  LS F  +PP+IML    AAASSWL S    SR+RF+F+L+CSPL +PIFCATFP
Sbjct: 1   MYSILNLSVFHKNPPVIMLATAAAAASSWLCS----SRTRFVFVLLCSPLLVPIFCATFP 60

Query: 61  FIFAIDLCIRLARHRRRIYLHDSPEIERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYL 120
           FI AI+LCIRLARHRR I L DSPEIERL+ CEEGGC   LPE    D  EEIGLLQRYL
Sbjct: 61  FICAIELCIRLARHRRSISLRDSPEIERLRRCEEGGCAAALPE----DGEEEIGLLQRYL 120

Query: 121 DDQLLLVRSVYECADCSDHLNGDPPFCNIKNSNLTPLLG 156
           DDQLLLVRSVYEC D  + +NGD  FC+ KNSN+ PLLG
Sbjct: 121 DDQLLLVRSVYECGDGDNDINGDARFCDFKNSNVIPLLG 151

BLAST of Cla97C11G215030 vs. ExPASy TrEMBL
Match: M5XT02 (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_1G555000 PE=4 SV=1)

HSP 1 Score: 116.3 bits (290), Expect = 1.1e-22
Identity = 68/119 (57.14%), Postives = 80/119 (67.23%), Query Frame = 0

Query: 15  PLIMLAAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFPFIFAIDLCIRLARHRRRI 74
           PLIMLAAASS     R+R RSR+LFLL+CSP+ IP  CATFPF+ A +LC+RL R RR  
Sbjct: 62  PLIMLAAASSSTTWLRAR-RSRYLFLLICSPILIPFLCATFPFLCAAELCLRLCRRRRIK 121

Query: 75  YLHDSPE--IERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYLDDQLLLVRSVYECAD 132
             H + E   ERL+ CEEG           G   EE+GLLQRYL+DQLLLV SVY+C D
Sbjct: 122 NAHGADEEVEERLRRCEEG----------RGGEREEMGLLQRYLEDQLLLVGSVYDCGD 169

BLAST of Cla97C11G215030 vs. ExPASy TrEMBL
Match: A0A6J5WCG8 (Uncharacterized protein OS=Prunus armeniaca OX=36596 GN=CURHAP_LOCUS10400 PE=4 SV=1)

HSP 1 Score: 116.3 bits (290), Expect = 1.1e-22
Identity = 68/119 (57.14%), Postives = 80/119 (67.23%), Query Frame = 0

Query: 15  PLIMLAAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFPFIFAIDLCIRLARHRRRI 74
           PLIMLAAASS     R+R RSR+LFLL+CSP+ IP  CATFPF+ A +LC+RL R RR  
Sbjct: 62  PLIMLAAASSSTTWLRAR-RSRYLFLLICSPILIPFLCATFPFLCAAELCLRLCRRRRIK 121

Query: 75  YLHDSPE--IERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYLDDQLLLVRSVYECAD 132
             H + E   ERL+ CEEG           G   EE+GLLQRYL+DQLLLV SVY+C D
Sbjct: 122 TAHGADEEVEERLRRCEEG----------RGGEREEMGLLQRYLEDQLLLVGSVYDCGD 169

BLAST of Cla97C11G215030 vs. TAIR 10
Match: AT1G35430.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: mitochondrion; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G09170.1); Has 23 Blast hits to 23 proteins in 5 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 23; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 87.0 bits (214), Expect = 1.4e-17
Identity = 51/101 (50.50%), Postives = 66/101 (65.35%), Query Frame = 0

Query: 34  RSRFLFLLVCSPLFIPIFCATFPFIFAIDLCIRLARHRRRIYLHDSPEIE--RLQECEEG 93
           RSRF+F L+CSPL IPI CA+ P + A+++  RL R R   +   + E +  RL+ CEEG
Sbjct: 12  RSRFVFFLLCSPLLIPILCASIPILCAVEIFSRL-RSRHPWFAKSTAEEDDLRLRRCEEG 71

Query: 94  -GCRPPLPEQIGGDSGEEIGLLQRYLDDQLLLVRSVYECAD 132
            GC        G D  EE GLLQRYL+DQL+LVRSVY+C +
Sbjct: 72  CGCG-------GFDEPEEAGLLQRYLEDQLVLVRSVYDCGE 104

BLAST of Cla97C11G215030 vs. TAIR 10
Match: AT4G09170.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G35430.1); Has 23 Blast hits to 23 proteins in 5 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 23; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 55.1 bits (131), Expect = 5.9e-08
Identity = 45/134 (33.58%), Postives = 67/134 (50.00%), Query Frame = 0

Query: 17  IMLAAASSWLHSSRSRSRSRFLFLLVCSPLFIPIFCATFPFIFAIDLCIRLARHRRR--- 76
           +ML  AS +     S   SR +FL++CSPL     C + P + A+++  RL     +   
Sbjct: 13  VMLTCASCF-----STRWSRIIFLILCSPL----LCLSIPLLCAVEIFSRLLSRIVKPPP 72

Query: 77  -----IYLHDSPEIERLQECEEGGCRPPLPEQIGGDSGEEIGLLQRYLDDQLLLVRSVYE 136
                  L D  +  RL++CEEG     + E+   D  EE GLL RYLD+QL L R++++
Sbjct: 73  SSAVSKVLVDDEDNLRLRQCEEGF---GMKEE---DENEESGLLHRYLDNQLSLARTIFD 131

Query: 137 CADCSDHLNGDPPF 143
                DH +   PF
Sbjct: 133 DDGDRDHDSIRVPF 131

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022962190.12.6e-5068.55uncharacterized protein LOC111462719 [Cucurbita moschata] >KAG7033430.1 hypothet... [more]
XP_023550754.12.9e-4967.92uncharacterized protein LOC111808798 [Cucurbita pepo subsp. pepo][more]
XP_022990172.13.2e-4866.67uncharacterized protein LOC111487144 [Cucurbita maxima][more]
XP_022152501.12.1e-4767.92uncharacterized protein LOC111020213 [Momordica charantia][more]
KAG6602743.12.7e-4772.06hypothetical protein SDJN03_07976, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1HEC71.3e-5068.55uncharacterized protein LOC111462719 OS=Cucurbita moschata OX=3662 GN=LOC1114627... [more]
A0A6J1JHX71.6e-4866.67uncharacterized protein LOC111487144 OS=Cucurbita maxima OX=3661 GN=LOC111487144... [more]
A0A6J1DHX61.0e-4767.92uncharacterized protein LOC111020213 OS=Momordica charantia OX=3673 GN=LOC111020... [more]
M5XT021.1e-2257.14Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_1G555000 PE=4 SV=1[more]
A0A6J5WCG81.1e-2257.14Uncharacterized protein OS=Prunus armeniaca OX=36596 GN=CURHAP_LOCUS10400 PE=4 S... [more]
Match NameE-valueIdentityDescription
AT1G35430.11.4e-1750.50unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G09170.15.9e-0833.58unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR36322TRANSMEMBRANE PROTEINcoord: 14..139
NoneNo IPR availablePANTHERPTHR36322:SF3TRANSMEMBRANE PROTEINcoord: 14..139

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C11G215030.1Cla97C11G215030.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016020 membrane