CmaCh01G008600 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh01G008600
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionTransmembrane protein
LocationCma_Chr01: 4850603 .. 4853395 (+)
RNA-Seq ExpressionCmaCh01G008600
SyntenyCmaCh01G008600
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTGATGTGCCAAAACACATTTATTGAATAACCGATTCTTGAGGATTGGAGTGTTCTGGTTTACTTCAGACGCCGACAGTGAGTTCATCCGTATTCAATCCCATCTTCCATCGTATTTTTCTCCATACTTCGATTCTCCTCAAACCCCCTGTTCTCACCTCTCCCCATTGCCCACATAAATCTCCTCTACGCCTCTGGTTTTTGATACTCTGTTGTTGAATTCCAAGATTCAACGAATTTGGGTCAGGTTTTGATTCGGGGGGGTCGTTGGTTTTCTTTCTTTCTCACTGGGTTGGATTCCTTTTTCTACTACTTTTGTTTGATTTCTTGCCCTTTTTGTGTTTTGGCTGCTTGTTTCTGTATTCAACTTGTTTATTGTAGTTTTCCCCTTTTTCATTTTCAGTCCGTAGGTTGAGGAAATTTCTATTATTGGCCACTATGGAAGGAGCTGTGGGTGCTGAGTTTGAGGATTGGGAGGTGCTGCTCCATGATTCGAACGCCGAAACTCCCCTGACTGCCGCTGAGTTTTCCGGGGAGAAACCGACCCGTTTCGGCGGAGCTGAGGATGTGTCCGATTCCGAAAGCATGATCAAATCCGATTATTTCTCTCTTGATAATCAGGGACGGCGAGCGAAAACTGTTCCTGAGCGTGATCTTAGCGAAGAGGAGGGTTCGGTTAAATCCGATAATCCTAGTTGGATTGATCCGAGTTCGGAGAATCGGCATAGTCGGGTAATTTCGGGAGAATTGTGGTCCGATTCTGGTAGTGATAGGTCTGATGATCGTAAATTTAGCGAATTCAATTTGAAAACTGAGTCTGGAATTGCAGAATTTCTGCTAGGCGATGAGGAAATGAGCGGTAGGAATCGAAAATTGGAGAGTTTAGAATCACATGTTGGATTAGCTTTTGAAGAATCTGAAGAAATCCAACCCCAAAGCAAGGATTTGAACAATTTCTTGTCTGATTCTGGTGGGAATATAGATCAAAGTGGCTTGAAAGTTGGGAAGTTGGAAGAAGAAGAAGGCAAAGAACATTTGGAGGAGAACAAGAATCTCCAAATTGAAGAAACAAAAGTGAATGCAGAATCTGGCAGTGAGGTTGGAGATACAAGGAAAGTAGTTTGGTGGAAAGTCTCATTTGATGTCTTGAAGTATTGCATGTTTAAGGCAAGCCCTGTCTGGTCATTCTCATTAGCAGCTGCTGTAATGGGATTCATTATTCTTGGGAGGAAGCTCTACAAAATGAAGAGGAAGAGCCAGAGCTTGCACTTGAAGGTTATCCTGAATGAGAAGGTAAATTCACTAAGAATTTATACAACAAACAAATTAGATGATGAACTGTTGATACTGAAATTTTTTCTTAGGTTAACTAATTGCTTCACTGTGAAGAAATTCTGGAAATTGTTGTTTGTTGGGTTTATTGTGCCTGCATATTTGTTGTCATTTTACTTGGTATATTAACAATTTGAGTGAAAATTCAGTATTTTTCTTTATAGTGAAACACAAATCAAGTTTAATAGGCATGGCTCGAGTTTGTTGTTAGCTTATGTGAGTGGTGATTTGGTTGTTTAGGTCAGTAGGCTCGAGCTTGAGTTTGTTGTTAGCTTTTTGTGAGTAGTGATTTAGCTGTTTAGGTCAGCAGGCTTCGAGCTTGAGTTTGTTGTTAGCTTTTTGTGAGTAGTGATTTAGCTGTTTAGGTCAGCAGGCTTGAGCTTGAGTTTGTTGTTAGCTTATGTNAGTGGTGATTCGGCTGTTTAGGTCAGCAGGCTTGAGCTTGAGTTTGTTGTTAGCTTATGTGAGTGGTGATTCGGCTGTTTAGGTCAGCAGGCTCGAGCTTGAGTTTGTTGTTAGCTTTTTGTGAGTAGTGATTTAGCTGTTTAGGTCAGCAGGCTCGAGCTTGAGTTTGTTGTTAGCTTATGTGAGAGCTGATTCGGTTGTTTAGGCCAGTAGGCTCGATAGGCTTGGTGCTGGGGCAGTTCAGCTCCTTCTCATTGATGTGCCAAAATGTGAACCAAGTTTAATACTGCTGCTGTTAGTTGGCTTGCTGTGAACACTTTCTTGTTGGACATTTATAGTTGGGGCATTTATTGCATGGTTTTCCTTTTGGGAACTTCCCCATTTATTCAAGTTTAGCATTCAATTAGATGAAAACGACACCTTGAGAAATTTGGGTAGGAGCTAGACTGCTTTATGTGGCTGATATTCTGCAACATAGGCAGCAATTCCCCCTCCCCCTTTTCAAATACTCTATCGGGATGTCGATTTTCGGCATCGTTTCAAAGTTGGGATTATATTTTTTGACCCTATGAACTAATTATTACCTGCAGAAGGGATCTCCACTCAAGAGCCGAGCTGCTCGTCTTAACGAAGCCTTTTCGATTGTGAGGCGTGTCCCAGTTGTTCGACCTGCTCTCCCTGGTGCCGGGATAAATCCATGGCCTGCAATGAGCATGAGTTGAGGATTCTTCCCACACGAGATGCGGATAAAGTAAAAGTGATGGTGTTTCCCTGTGTCTCTCTCTCTATCTAAAAGATGTAATATTCCACAGGTTTCTATCTATGTTATGCACCAATACAAGATGCTTTCAGCATCAGCTGTCTGTAAAGTGAATTCAGATAATGGCTTCTCTTTTCTTTTTTTCTTTCCCTTTGAGATATTCCATTTTGTGTTTATATTGATGAATGCATAGCTCAGGCAAGTAGTCCCATTATCGGGATCAAGCTTCACGTATAATATGCAACTGGGGGCCTAATCATAGTTGTACTTTGGGCTTTATTTTCATGTAGGGTTGG

mRNA sequence

TTTGATGTGCCAAAACACATTTATTGAATAACCGATTCTTGAGGATTGGAGTGTTCTGGTTTACTTCAGACGCCGACAGTGAGTTCATCCGTATTCAATCCCATCTTCCATCGTATTTTTCTCCATACTTCGATTCTCCTCAAACCCCCTGTTCTCACCTCTCCCCATTGCCCACATAAATCTCCTCTACGCCTCTGGTTTTTGATACTCTGTTGTTGAATTCCAAGATTCAACGAATTTGGGTCAGGTTTTGATTCGGGGGGGTCGTTGGTTTTCTTTCTTTCTCACTGGTCCGTAGGTTGAGGAAATTTCTATTATTGGCCACTATGGAAGGAGCTGTGGGTGCTGAGTTTGAGGATTGGGAGGTGCTGCTCCATGATTCGAACGCCGAAACTCCCCTGACTGCCGCTGAGTTTTCCGGGGAGAAACCGACCCGTTTCGGCGGAGCTGAGGATGTGTCCGATTCCGAAAGCATGATCAAATCCGATTATTTCTCTCTTGATAATCAGGGACGGCGAGCGAAAACTGTTCCTGAGCGTGATCTTAGCGAAGAGGAGGGTTCGGTTAAATCCGATAATCCTAGTTGGATTGATCCGAGTTCGGAGAATCGGCATAGTCGGGTAATTTCGGGAGAATTGTGGTCCGATTCTGGTAGTGATAGGTCTGATGATCGTAAATTTAGCGAATTCAATTTGAAAACTGAGTCTGGAATTGCAGAATTTCTGCTAGGCGATGAGGAAATGAGCGGTAGGAATCGAAAATTGGAGAGTTTAGAATCACATGTTGGATTAGCTTTTGAAGAATCTGAAGAAATCCAACCCCAAAGCAAGGATTTGAACAATTTCTTGTCTGATTCTGGTGGGAATATAGATCAAAGTGGCTTGAAAGTTGGGAAGTTGGAAGAAGAAGAAGGCAAAGAACATTTGGAGGAGAACAAGAATCTCCAAATTGAAGAAACAAAAGTGAATGCAGAATCTGGCAGTGAGGTTGGAGATACAAGGAAAGTAGTTTGGTGGAAAGTCTCATTTGATGTCTTGAAGTATTGCATGTTTAAGGCAAGCCCTGTCTGGTCATTCTCATTAGCAGCTGCTGTAATGGGATTCATTATTCTTGGGAGGAAGCTCTACAAAATGAAGAGGAAGAGCCAGAGCTTGCACTTGAAGGTTATCCTGAATGAGAAGAAGGGATCTCCACTCAAGAGCCGAGCTGCTCGTCTTAACGAAGCCTTTTCGATTGTGAGGCGTGTCCCAGTTGTTCGACCTGCTCTCCCTGGTGCCGGGATAAATCCATGGCCTGCAATGAGCATGAGTTGAGGATTCTTCCCACACGAGATGCGGATAAAGTAAAAGTGATGGTGTTTCCCTGTGTCTCTCTCTCTATCTAAAAGATGTAATATTCCACAGGTTTCTATCTATGTTATGCACCAATACAAGATGCTTTCAGCATCAGCTGTCTGTAAAGTGAATTCAGATAATGGCTTCTCTTTTCTTTTTTTCTTTCCCTTTGAGATATTCCATTTTGTGTTTATATTGATGAATGCATAGCTCAGGCAAGTAGTCCCATTATCGGGATCAAGCTTCACGTATAATATGCAACTGGGGGCCTAATCATAGTTGTACTTTGGGCTTTATTTTCATGTAGGGTTGG

Coding sequence (CDS)

ATGGAAGGAGCTGTGGGTGCTGAGTTTGAGGATTGGGAGGTGCTGCTCCATGATTCGAACGCCGAAACTCCCCTGACTGCCGCTGAGTTTTCCGGGGAGAAACCGACCCGTTTCGGCGGAGCTGAGGATGTGTCCGATTCCGAAAGCATGATCAAATCCGATTATTTCTCTCTTGATAATCAGGGACGGCGAGCGAAAACTGTTCCTGAGCGTGATCTTAGCGAAGAGGAGGGTTCGGTTAAATCCGATAATCCTAGTTGGATTGATCCGAGTTCGGAGAATCGGCATAGTCGGGTAATTTCGGGAGAATTGTGGTCCGATTCTGGTAGTGATAGGTCTGATGATCGTAAATTTAGCGAATTCAATTTGAAAACTGAGTCTGGAATTGCAGAATTTCTGCTAGGCGATGAGGAAATGAGCGGTAGGAATCGAAAATTGGAGAGTTTAGAATCACATGTTGGATTAGCTTTTGAAGAATCTGAAGAAATCCAACCCCAAAGCAAGGATTTGAACAATTTCTTGTCTGATTCTGGTGGGAATATAGATCAAAGTGGCTTGAAAGTTGGGAAGTTGGAAGAAGAAGAAGGCAAAGAACATTTGGAGGAGAACAAGAATCTCCAAATTGAAGAAACAAAAGTGAATGCAGAATCTGGCAGTGAGGTTGGAGATACAAGGAAAGTAGTTTGGTGGAAAGTCTCATTTGATGTCTTGAAGTATTGCATGTTTAAGGCAAGCCCTGTCTGGTCATTCTCATTAGCAGCTGCTGTAATGGGATTCATTATTCTTGGGAGGAAGCTCTACAAAATGAAGAGGAAGAGCCAGAGCTTGCACTTGAAGGTTATCCTGAATGAGAAGAAGGGATCTCCACTCAAGAGCCGAGCTGCTCGTCTTAACGAAGCCTTTTCGATTGTGAGGCGTGTCCCAGTTGTTCGACCTGCTCTCCCTGGTGCCGGGATAAATCCATGGCCTGCAATGAGCATGAGTTGA

Protein sequence

MEGAVGAEFEDWEVLLHDSNAETPLTAAEFSGEKPTRFGGAEDVSDSESMIKSDYFSLDNQGRRAKTVPERDLSEEEGSVKSDNPSWIDPSSENRHSRVISGELWSDSGSDRSDDRKFSEFNLKTESGIAEFLLGDEEMSGRNRKLESLESHVGLAFEESEEIQPQSKDLNNFLSDSGGNIDQSGLKVGKLEEEEGKEHLEENKNLQIEETKVNAESGSEVGDTRKVVWWKVSFDVLKYCMFKASPVWSFSLAAAVMGFIILGRKLYKMKRKSQSLHLKVILNEKKGSPLKSRAARLNEAFSIVRRVPVVRPALPGAGINPWPAMSMS
Homology
BLAST of CmaCh01G008600 vs. TAIR 10
Match: AT4G13530.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G10080.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 166.4 bits (420), Expect = 3.9e-41
Identity = 116/329 (35.26%), Postives = 182/329 (55.32%), Query Frame = 0

Query: 1   MEGAVGAEFEDWEVLLHDSNAETPLTAAEFSGEKPTRFGGAEDVSD-SESMIKSDYFSLD 60
           MEG    E +DWE+L      ++  +  E    +       E++ D ++ MI+ D+FSL+
Sbjct: 1   MEG----EIQDWEIL------QSSRSTTEDDNSR-----SLEEIDDGTQGMIRFDHFSLE 60

Query: 61  NQGRRAKTVPERDLSEEEGSVKSDNPSWIDPSSENRHSRVISGELWSDSGSDRSDDRKFS 120
           NQ   ++     + ++E+GSV+S +P WI+PSS+  +      ELWSDS SDR DD++  
Sbjct: 61  NQSGLSRL----EANDEDGSVQSGSPGWIEPSSDVPYGPKHFSELWSDSSSDRLDDQRLV 120

Query: 121 EFNLKTESGIAEFLLGDEEMSGRNRKLESLESHVGLAFEESEEIQPQSKDLNNFLSDSGG 180
           + ++  E GI                     + VG+  E SE I   ++D++   SD   
Sbjct: 121 DDDVNNEMGIE-------------------RNEVGIV-EYSESI---AQDMDLISSDE-- 180

Query: 181 NIDQSGLKVGKLEEEEGKEHLEENKNLQIEETKVNAESGSEVGDTRKVVWWKVSFDVLKY 240
                        +EE   H  E +   +       +SG   G+ +  VWWK+  +VLKY
Sbjct: 181 ------------RKEESLLHPVEGEGNSV-SIDPGVKSGGGGGEEKGFVWWKIPIEVLKY 240

Query: 241 CMFKASPVWSFSLAAAVMGFIILGRKLYKMKRKSQSLHLKVILNEKKGSPLKSRAARLNE 300
           C+ K +P+WS S+AAA +GF++LGR+LY MK+K++SL LKV+L++KK   + + AAR NE
Sbjct: 241 CVLKINPIWSLSMAAAFVGFVMLGRRLYNMKKKTRSLQLKVLLDDKK---VANHAARWNE 269

Query: 301 AFSIVRRVPVVRPALPGA-GINPWPAMSM 328
           A S+V+RVP++RPALP + G+N W  MS+
Sbjct: 301 AISVVKRVPIIRPALPSSVGMNQWSMMSL 269

BLAST of CmaCh01G008600 vs. TAIR 10
Match: AT4G13530.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G10080.1); Has 70 Blast hits to 69 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 70; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 164.5 bits (415), Expect = 1.5e-40
Identity = 115/329 (34.95%), Postives = 181/329 (55.02%), Query Frame = 0

Query: 1   MEGAVGAEFEDWEVLLHDSNAETPLTAAEFSGEKPTRFGGAEDVSD-SESMIKSDYFSLD 60
           MEG    E +DWE+L      ++  +  E    +       E++ D ++ MI+ D+FSL+
Sbjct: 1   MEG----EIQDWEIL------QSSRSTTEDDNSR-----SLEEIDDGTQGMIRFDHFSLE 60

Query: 61  NQGRRAKTVPERDLSEEEGSVKSDNPSWIDPSSENRHSRVISGELWSDSGSDRSDDRKFS 120
           NQ   ++     + ++E+GSV+S +P WI+PSS+  +      ELWSDS SDR DD++  
Sbjct: 61  NQSGLSRL----EANDEDGSVQSGSPGWIEPSSDVPYGPKHFSELWSDSSSDRLDDQRLV 120

Query: 121 EFNLKTESGIAEFLLGDEEMSGRNRKLESLESHVGLAFEESEEIQPQSKDLNNFLSDSGG 180
           + ++  E GI                     + VG+  E SE I   ++D++   SD   
Sbjct: 121 DDDVNNEMGIE-------------------RNEVGIV-EYSESI---AQDMDLISSDE-- 180

Query: 181 NIDQSGLKVGKLEEEEGKEHLEENKNLQIEETKVNAESGSEVGDTRKVVWWKVSFDVLKY 240
                        +EE   H  E +   +       +SG   G+ +  VWWK+  +VLKY
Sbjct: 181 ------------RKEESLLHPVEGEGNSV-SIDPGVKSGGGGGEEKGFVWWKIPIEVLKY 240

Query: 241 CMFKASPVWSFSLAAAVMGFIILGRKLYKMKRKSQSLHLKVILNEKKGSPLKSRAARLNE 300
           C+ K +P+WS S+AAA +GF++LGR+LY MK+K++SL LKV+L++K    + + AAR NE
Sbjct: 241 CVLKINPIWSLSMAAAFVGFVMLGRRLYNMKKKTRSLQLKVLLDDK----VANHAARWNE 268

Query: 301 AFSIVRRVPVVRPALPGA-GINPWPAMSM 328
           A S+V+RVP++RPALP + G+N W  MS+
Sbjct: 301 AISVVKRVPIIRPALPSSVGMNQWSMMSL 268

BLAST of CmaCh01G008600 vs. TAIR 10
Match: AT4G10080.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G13530.1); Has 120 Blast hits to 114 proteins in 21 species: Archae - 2; Bacteria - 4; Metazoa - 0; Fungi - 12; Plants - 100; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 127.9 bits (320), Expect = 1.5e-29
Identity = 113/334 (33.83%), Postives = 169/334 (50.60%), Query Frame = 0

Query: 10  EDWEVLLHDSNAE-----TPLTAAEFSGEKPTRFGGAEDVSDSESMIKSDYFSLDNQGRR 69
           +DWE+L H S+ E     T  T  E S         ++  S ++ +I+S YF        
Sbjct: 2   DDWELLHHGSDTESTDSITSETKLESSSVIDDGMILSDHFSATDRVIESGYFDSFRVDYG 61

Query: 70  AKTVPERDLSEEEGSVKSDNPSWIDPSSENRHSRVISGELWSDSGSDRSDDRKFSEFNLK 129
           ++ +   ++S + G  +       D    N       G   S++G     + + S+F   
Sbjct: 62  SECLNPGEVSVDSGLDQFSVSQSGDDCVRNEF-----GVYDSETGILGDGEVRLSDFEAA 121

Query: 130 TESGIAEFLLGDEEMSG--RNRKLESLESHV-GLAFEESEEIQPQSKDLNNFLSDSGGN- 189
            E  + E     E   G   + + E+LE  V G   E    ++   +D +   SD GGN 
Sbjct: 122 NEKYV-ESEAATELTGGTVSHYETENLEEFVDGRHGENESGVEEPIEDSSKLCSDLGGNE 181

Query: 190 --IDQSGLKVGKLEEEE-----GKEHLEENKNLQIEETKVNAESGSEVGDTRKVVWWKVS 249
                SG+  G+ E          E +E +    +E   V+  SG E G +R+ VWWK+ 
Sbjct: 182 LVSRDSGVVNGEKEVVSDSVVASSEVIEGSGGDTVEVGGVS--SGGE-GKSRETVWWKMP 241

Query: 250 FDVLKYCMFKASPVWSFSLAAAVMGFIILGRKLYKMKRKSQSLHLKVILNEKKGSPLKSR 309
           F +LKY +F+  PVWS S+AAAVMG ++LGR+LY MK+K+Q  HLKV +++KK S + S+
Sbjct: 242 FVLLKYSVFRIGPVWSVSMAAAVMGLVLLGRRLYNMKKKAQRFHLKVTIDDKKASRVMSQ 301

Query: 310 AARLNEAFSIVRRVPVVRPALPGAGINPWPAMSM 328
           AARLNE F+ VRRVPV+RPALP  G   WP +S+
Sbjct: 302 AARLNEVFTEVRRVPVIRPALPSPG--AWPVLSL 324

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT4G13530.13.9e-4135.26unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G13530.21.5e-4034.95unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G10080.11.5e-2933.83unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 25..118
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 80..107
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 64..79
NoneNo IPR availablePANTHERPTHR33646:SF6TRANSMEMBRANE PROTEINcoord: 1..327
NoneNo IPR availablePANTHERPTHR33646GB|AAF00631.1coord: 1..327

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh01G008600.1CmaCh01G008600.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane