CmaCh05G012530 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh05G012530
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionProtein of unknown function (DUF1068)
LocationCma_Chr05: 9613553 .. 9618596 (-)
RNA-Seq ExpressionCmaCh05G012530
SyntenyCmaCh05G012530
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTCAATACACCGACACCTGCAGAAGAAATCTTAGCAGAAGTTCTATTTTGTTCATTCATTCATTTCTTTGCTTTCGAACTTCCACTGCTGCAATTAGATCTCAGTTCATCGCTTTCTCAGTAACAACGCGGAAACAGAGAGGAAAAGCACGAAAGAATCAAAGCGGGGAAGATCAGGGCGAGAAATGTCACGCCGATCTGGGTCTTGCCTGAGGTGTTGTCTCGTGATTTTCGCTGTAGTTTCTGCTTTGGCTGTTTGTGGGCCGGCCTTGTATTGGCGATTCAAGAAGGCTTTGCTATTGGGAGATTCCAAAACCTCCTGTGCTCCCTGCATCTGCGATTGTCCGCCCCCCTTGTCCCTTTTGAAGATTGCTCCTGGTATTGTTTCTAATCTTCGTGTCTCAGTTCCAGCAAACGCAAACGATTTCTCTTGTGGGTTGCTTTGGATTTCTGTTTTTGAACACTTAGTTGTGAAATTTAGTGAGAATCATAGATACCCATTTGGCCATTTGAAGGATTTTGGTTGTTTGATAGATATGTTTCCATTGATGGTTTGGATTCTGAGCTGTTTCTTTCTTTGATTGTGTGTTTTTGTATATGGCTGTGGAAAACTTGACCCATCTTAGATTTGAATTTGTACTCCCTTACCGTTGGGCTTGAGGATTGTTTAGAGAGGAGTCCCATATCAGTTAATTAAGAGGTTGATCACAGGTTTGTAAGTAAGAAATATATCTCTATTGGTATGAGGCCTTTTGGGGAAACCAAAAGCAAAGCCACGAGAGCTTATATCAAAGTGGACAATATCGTGATTCCTAATATGGTATCAGAGTCATGCCTTTAACTTAACTATGTCAATAGAATTTTCAAATATCGAACAAAGTAGTTGTGAGTCTTGAAGGTGTAGTCAAAAGTGACTCAAGTGTTGAACAAAAGATGTACTTTGTTCGAGGGCTCAAGAGAAGGAGTTGAGCCTTGATTAAGGGGAGGCTGTTCGATGGCTCCATAGGCCTCAGAAGAGGCTCGGTGGTGTACTTTGTTCGAGGGGAGGGTGGTTGAGGATTGTTGGGAGAGGAGTCAGGGGTTGATCACGGGTTTATAAGTAAGAAATACATCTCCATTGGTATGAGACCTTTTGGGAAACCAAAAGCAAAGCCAGGAGAGCTTATGCTCGAAGTGGACAATATCGTGATTCCTAATATGGTATCAGAGACATGCCCTTTAACTTAGCTATGTTAGTAGAATTCTCAAATATCGAACAAAGAAGTTGTGAGCCTCGAAGTTGTAGTCAAAAGTGACTCAAGTGTCGAACAAAGGGTGTACTTTGTTTGAGGGCTCCAGAGAAGGAGTCGAGTCTCGATTAAGGGGAGACTGTTTGATGGCTCCATAGGACTCAGGGGAGGCTCTATGGTGTACTTTGTTCGAGGGGAGGATTGTTGAGGATTGTTGGGAGAGGAGTCCCACATCGGCTAATTAGGGGGTTGATCACGAGTTTATAAGTAAGGAGTACATCTCCATTGGTATGAGGCCTTTTGGGGAAACCAAAATCAAAGTCATGAGAGCTGATGCTCGAAGTGGACAATATCATACCATTATGGAGGTTTGTGATTCCTAACACTTAATGGAGCTAGATTATGCCCGATCTCAACTTAGAATCTTGGTTTTGTTGGAAAACTTTCAAGAATTTGTGCTTTTCTTCACCGGTGCCTTTCTGAGTAACACTACTAACATTCGGACTTCCTGTATTTGTTCTTTGTGGTTTTTAACTCGAGTATCTTAAGTGGCAGTCGATCTGAAATTATAAACTTGAGGATTGTTGGGAGAGAAGTCCCACATTAGCTAATTAAGGGGTTGATCATGAGTTTATAAGTAAGAAACACGATCTCTATTAGTATGAGGTCTTTTGGGGAAAGCAAAAGCAAAGCCATGAGAGTTTATGCTCAAAGTGGACAATATCATACCGTTGTGGAGATTCATGATTCCAAACATCTGTTTCCTTGGCATTTGATCCACTTGTTATTTGCTTCAGTTGTAGATAGGCTAAAACTTGTATCATTTAATGAAATGGTAATGTCAAGCAAGTTGATGATCTTTGCATCAATACCCTTGCCCTGGGCTATGGCCAGCATCAGTCAATACCACTGAAAAAATTCTCATTCACACCCACAGACTCAACCAACCTGTTGGGCAAGCAGTCCTATTGAAACCAAGAAATATAGGTCGGCCAGTCATCCACACTAGAATAAATGAAGTTTTCCAAAAAAAACTTGCTAGTAGATTCCCAATACATCTATTCAATTAAGCGAATCTTGTTTGGTATGGATAAGCAAGCATGGACAGGCTCCTTGAGAGGCCCAATTCAGGAAGACAGAAATCGTCAGTTGGGTTGGTCCAACCAAAAAAGGCATGGGAATGGGCATTGCTTAGGCCGAGTTAAGTAAAACCGGCCTATCTAGTGATTTACAACGACCCCTCACCGCACACTTTCTATATACCTAGTATATAGACGGCATATAAATTGCTCAAACATTATTTACCAAATGCAGAGTCTTTTTTTATTAAATTTATGTCTTCCTCTCTTAGATGGACTACATTTGATTTCATGTATAGTAGACTTGATTTGTGTTCTTAACAGATTCATTGTTACTTTCTCACTAACAGTCTTTTTTGTTCTTGATAATTCTTTTGTAGGTCTGTCCAATCTCTCTGTCACAGGTTAGAATTCATCCCTCCAAATCAACTTGGCAATGAACAAAATCATGATTCTTTACTTGTTTTACTACCATTTGAATGCCTGTTAAAACTTATTTGTGCTCTCTATTTTTCTGGAGAAATATATGCAACTTTCATCACATTTCTCTTACATCATCCATTTGTTTGTTTCGGGTCCGATGTTTTCTTGTTCTGCTTTGTTCTACGATGGATCGGAACTCTTAAAAAATTCTGCATTCTACACTTCATTAAAGGGATTCTGCCAACTATTTTCTTCTAGTCCTTGATGTTCTTTGCCATAATATGTGCACACAAGATAGGAATCATCATATGTGAGATCCCACGTTGGTTGGAGAGGGAATGAAGTATTTCTTATAAGGGTGTCAGCGGGTGGTGTACCAGCGAGGATGCTGGGCTCCCAAGGGGGTGGATTGTGAAATCCCACCTCGGTTGAAGAGGAGAACAAACATACCTTATAAGAGTGTCAAAGCTAGTTATCGAGTGGCGTGCCAGCGAGGATGCTGGACTCCCAAGGGAGTGGATTGTGAGATCCCATATCAATTGGAGAGGGGAACAAAACATTCCTTATAAGGGTGTCAGAGCCAATCACTGGGCAGTGTACCAATGAGGACGTTGGGCCCCCGAGGGAGGTGGATTGTAAGATCTCACATTGGTTGGAGAAGGAAACGAAACATTCCTTATAAGGGTTTCAAAACCAGTCACCGTGTAGTGTGCCAGCGAGGGTGCTGGGTCCCCAAGGGGGTGGATTGTGAAATCCCTCTTCGGTTGGAGAAGGAAACGAAACATTCCTCTGGGAGGTGTGCTATTGAGGACGCTGGGGCGCCAAGGGGGTGGATTGTGAGATCCCACATCAGTTGGAGAGGGAAACAAAGCATTCCTTATAAGGGTATCAAAGCCAGTCACCAGGCGGTATGCCAGCGAGGACGCTGGGCCCCAAAGAGAGTGGATTGTGAGATTCCACATCAGTTGGAAAGGGGAACGAAACATTCCTTATAAGGGTGTCAGAGTCAATCACCGGGCGGTGTGCCAATGAGGACGTTGACCCCCAATGGGGGTGGATTGTGAGATCCCACATCAGTTGCAGAGAGGAACAAAACATTCCTTGTAAGGGTGTCAGAGCCAGTCATTAGGCGGTGTGCCAGCGAGGATGCTGGGCCCTCAAAGGGGGTGGATTATGAGATCCCACATCGGTTGGAGAAAGGAACAAAACATTCTTTATAAGGGTATAGAAACCTCTCCCTAGTAGGCACGTTTTAATACTATGCGGTTAACGACGATACTTAACGAGCCAAAGCAGACAATATTTGCTAACGGTGGGCTTGAGCTATTACATCATATGTCTTAATATCTCTCTTCCAATATTTTGTTGAGTCCAGGGACGACTCAACTCTCGTTCCCCCCGTTTCTACCTTGATTATATGGGCCGTGTTTTCTTCACTTGTTGATTGGAACTTTTAGCTACATATTTCTGGAATTTGAAAGTTTGAAACTTATCTTGTTTAAGTTTGAAACTTCTATATGAAAGTATTGCGAGATTTCAGCTAGTTATTTATACTCTTTCACATCATCTGCTTTATATTTTCTTTGACATTCTTCAATGCACATGCTCACTTCTTTATACATTATAGACTGTGGGAGTAACGACCCAGATCTCAAGCAGGAGATGGAAAAACAATTTGTGGATCTTTTAACAGAGGAATTGAAGCTTCAAGAAGCAGTTTCTGGCGAACATACTCGCCATATGAACATAACGTTATTCGAGGCGAAAAGGGTAGCTTCTCAGTACCAGAGGGAGGCTGAAAAGTGCATTGCTGCAACCGAAACTTGTGAAGAGGCCCGAGAACGCGCTGAGGCTTTGACGATCAAGGAAAGGAAGCTTACGTTGTTGTGGGAGCGACGAGCCCACCAAATGGGTTGGGTCGAGAAATAAGTCTATAATTTCAAAGGCCACCCTCTTAGTTCAACTAGTTGTCGAGTCCAGTCGAGTAGAAGCTGGCAAGCGAATTGCTTGGTTCATCAATCTTTCAAGGCATGATGTTCTCTCTTTTCTTCTGTTGCACTTTTCCTGGAAACATGTCGATTCTAGATCATAATTCATACATGACATGTTACAGACCCCGTTCTTCTGCATTTGCTATATATATGTGTGATCGTCTCCTCTCGAATAGAAAAAAACTATGTTGTTGTTCTAAATTTTGGCACCAGTTTGGTTCCCTGTAAGTGATGTTAGTAAAATCTGGTGGTAGCCATATTGGCCATTTTCAGCTTGCTGTCTGCTTTAAATAACAGTGGATATACGCCTAAACGAACCATT

mRNA sequence

ATTCAATACACCGACACCTGCAGAAGAAATCTTAGCAGAAGTTCTATTTTGTTCATTCATTCATTTCTTTGCTTTCGAACTTCCACTGCTGCAATTAGATCTCAGTTCATCGCTTTCTCAGTAACAACGCGGAAACAGAGAGGAAAAGCACGAAAGAATCAAAGCGGGGAAGATCAGGGCGAGAAATGTCACGCCGATCTGGGTCTTGCCTGAGGTGTTGTCTCGTGATTTTCGCTGTAGTTTCTGCTTTGGCTGTTTGTGGGCCGGCCTTGTATTGGCGATTCAAGAAGGCTTTGCTATTGGGAGATTCCAAAACCTCCTGTGCTCCCTGCATCTGCGATTGTCCGCCCCCCTTGTCCCTTTTGAAGATTGCTCCTGGTCTGTCCAATCTCTCTGTCACAGACTGTGGGAGTAACGACCCAGATCTCAAGCAGGAGATGGAAAAACAATTTGTGGATCTTTTAACAGAGGAATTGAAGCTTCAAGAAGCAGTTTCTGGCGAACATACTCGCCATATGAACATAACGTTATTCGAGGCGAAAAGGGTAGCTTCTCAGTACCAGAGGGAGGCTGAAAAGTGCATTGCTGCAACCGAAACTTGTGAAGAGGCCCGAGAACGCGCTGAGGCTTTGACGATCAAGGAAAGGAAGCTTACGTTGTTGTGGGAGCGACGAGCCCACCAAATGGGTTGGGTCGAGAAATAAGTCTATAATTTCAAAGGCCACCCTCTTAGTTCAACTAGTTGTCGAGTCCAGTCGAGTAGAAGCTGGCAAGCGAATTGCTTGGTTCATCAATCTTTCAAGGCATGATGTTCTCTCTTTTCTTCTGTTGCACTTTTCCTGGAAACATGTCGATTCTAGATCATAATTCATACATGACATGTTACAGACCCCGTTCTTCTGCATTTGCTATATATATGTGTGATCGTCTCCTCTCGAATAGAAAAAAACTATGTTGTTGTTCTAAATTTTGGCACCAGTTTGGTTCCCTGTAAGTGATGTTAGTAAAATCTGGTGGTAGCCATATTGGCCATTTTCAGCTTGCTGTCTGCTTTAAATAACAGTGGATATACGCCTAAACGAACCATT

Coding sequence (CDS)

ATGTCACGCCGATCTGGGTCTTGCCTGAGGTGTTGTCTCGTGATTTTCGCTGTAGTTTCTGCTTTGGCTGTTTGTGGGCCGGCCTTGTATTGGCGATTCAAGAAGGCTTTGCTATTGGGAGATTCCAAAACCTCCTGTGCTCCCTGCATCTGCGATTGTCCGCCCCCCTTGTCCCTTTTGAAGATTGCTCCTGGTCTGTCCAATCTCTCTGTCACAGACTGTGGGAGTAACGACCCAGATCTCAAGCAGGAGATGGAAAAACAATTTGTGGATCTTTTAACAGAGGAATTGAAGCTTCAAGAAGCAGTTTCTGGCGAACATACTCGCCATATGAACATAACGTTATTCGAGGCGAAAAGGGTAGCTTCTCAGTACCAGAGGGAGGCTGAAAAGTGCATTGCTGCAACCGAAACTTGTGAAGAGGCCCGAGAACGCGCTGAGGCTTTGACGATCAAGGAAAGGAAGCTTACGTTGTTGTGGGAGCGACGAGCCCACCAAATGGGTTGGGTCGAGAAATAA

Protein sequence

MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKALLLGDSKTSCAPCICDCPPPLSLLKIAPGLSNLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQEAVSGEHTRHMNITLFEAKRVASQYQREAEKCIAATETCEEARERAEALTIKERKLTLLWERRAHQMGWVEK
Homology
BLAST of CmaCh05G012530 vs. TAIR 10
Match: AT4G30996.1 (Protein of unknown function (DUF1068) )

HSP 1 Score: 261.2 bits (666), Expect = 6.1e-70
Identity = 131/170 (77.06%), Postives = 146/170 (85.88%), Query Frame = 0

Query: 1   MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKALLLGDSKTS-CAPCICDCPPPLSL 60
           M RRSG C+R CLVIFAVVSAL VCGPALYW+F K  +      S C PC+CDCPPPLSL
Sbjct: 1   MPRRSGDCMR-CLVIFAVVSALVVCGPALYWKFNKGFVGSTRANSLCPPCVCDCPPPLSL 60

Query: 61  LKIAPGLSNLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQEAVSGEHTRHMNITLFEAK 120
           L+IAPGL+NLS+TDCGS+DP+LKQEMEKQFVDLLTEELKLQEAV+ EH+RHMN+TL EAK
Sbjct: 61  LQIAPGLANLSITDCGSDDPELKQEMEKQFVDLLTEELKLQEAVADEHSRHMNVTLAEAK 120

Query: 121 RVASQYQREAEKCIAATETCEEARERAEALTIKERKLTLLWERRAHQMGW 170
           RVASQYQ+EAEKC AATE CE ARERAEAL IKERK+T LWE+RA Q GW
Sbjct: 121 RVASQYQKEAEKCNAATEICESARERAEALLIKERKITSLWEKRARQSGW 169

BLAST of CmaCh05G012530 vs. TAIR 10
Match: AT2G24290.1 (Protein of unknown function (DUF1068) )

HSP 1 Score: 249.6 bits (636), Expect = 1.8e-66
Identity = 124/171 (72.51%), Postives = 146/171 (85.38%), Query Frame = 0

Query: 1   MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKALLLGDSKTS--CAPCICDCPPPLS 60
           M+RRSG+C+R CLVIF+VVSAL VCGPALYW+  K  +     T+  C PC+CD PPPLS
Sbjct: 1   MARRSGNCMR-CLVIFSVVSALLVCGPALYWKLNKGFVGSARSTNSICPPCVCDFPPPLS 60

Query: 61  LLKIAPGLSNLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQEAVSGEHTRHMNITLFEA 120
           LL+IAPGL+NLS+T CGS+DP+LK+EMEK FVDLLTEELKLQEAV+ EH+RHMN+TL EA
Sbjct: 61  LLQIAPGLANLSITGCGSDDPELKEEMEKPFVDLLTEELKLQEAVADEHSRHMNVTLAEA 120

Query: 121 KRVASQYQREAEKCIAATETCEEARERAEALTIKERKLTLLWERRAHQMGW 170
           KRVASQYQ+EAEKC AATE CE ARERA+AL +KERK+T LWERRA Q+GW
Sbjct: 121 KRVASQYQKEAEKCNAATEICESARERAQALLLKERKITFLWERRARQLGW 170

BLAST of CmaCh05G012530 vs. TAIR 10
Match: AT2G32580.1 (Protein of unknown function (DUF1068) )

HSP 1 Score: 141.7 bits (356), Expect = 5.4e-34
Identity = 75/163 (46.01%), Postives = 107/163 (65.64%), Query Frame = 0

Query: 7   SCLRCCLVIFAVVSALAVCGPALYWRFKKALLLGDSKTSCAPCICDCPPPLSLLKIAPGL 66
           + L+  L + A+     + GP LYW   +AL +  S TSC+ C+CDC   L LL I  GL
Sbjct: 6   AALKVGLALLALSMIGYILGPPLYWHLTEALAV--SATSCSACVCDC-SSLPLLTIPTGL 65

Query: 67  SNLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQEAVSGEHTRHMNITLFEAKRVASQYQ 126
           SN S TDC   DP++ ++ EK + +LLTEELK +EA S E  + ++  L EAK++ S YQ
Sbjct: 66  SNGSFTDCAKRDPEVNEDTEKNYAELLTEELKQREAASMEKHKRVDTGLLEAKKITSSYQ 125

Query: 127 REAEKCIAATETCEEARERAEALTIKERKLTLLWERRAHQMGW 170
           +EA+KC +  ETCEEARE+AE   ++++KLT +WE+RA Q G+
Sbjct: 126 KEADKCNSGMETCEEAREKAEKALVEQKKLTSMWEQRARQKGY 165

BLAST of CmaCh05G012530 vs. TAIR 10
Match: AT1G05070.1 (Protein of unknown function (DUF1068) )

HSP 1 Score: 132.5 bits (332), Expect = 3.2e-31
Identity = 74/168 (44.05%), Postives = 102/168 (60.71%), Query Frame = 0

Query: 4   RSGSCLRCCLVIFAVVSALAVCGPALYWRFKKALLLGDSKTSCAPCICDCPPPLSLLKIA 63
           R  + L+  L +  +  A  + GP LYW   +A L   S +SC  C C+C    S + I 
Sbjct: 3   RHTAALKIGLALLGLSMAGYILGPPLYWHLTEA-LAAVSASSCPSCPCEC-STYSAVTIP 62

Query: 64  PGLSNLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQEAVSGEHTRHMNITLFEAKRVAS 123
             LSN S  DC  +DP++ ++ EK + +LLTEELKL+EA S E  +  ++ L EAK+V S
Sbjct: 63  KELSNASFADCAKHDPEVNEDTEKNYAELLTEELKLREAESLEKHKRADMGLLEAKKVTS 122

Query: 124 QYQREAEKCIAATETCEEARERAEALTIKERKLTLLWERRAHQMGWVE 172
            YQ+EA+KC +  ETCEEARE+AE    +++KLT  WE RA Q GW E
Sbjct: 123 SYQKEADKCNSGMETCEEAREKAELALAEQKKLTSRWEERARQKGWRE 168

BLAST of CmaCh05G012530 vs. TAIR 10
Match: AT4G04360.1 (Protein of unknown function (DUF1068) )

HSP 1 Score: 122.1 bits (305), Expect = 4.4e-28
Identity = 71/172 (41.28%), Postives = 101/172 (58.72%), Query Frame = 0

Query: 1   MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKALLLGDS-KTSCAPCICDCPPPLSL 60
           M+RR     +   V+  +     + GP+LYW   +   + DS  +SC PC+CDC     L
Sbjct: 1   MTRRQKKTAKVVTVVMGLCIVAYIAGPSLYWHLNET--IADSLHSSCPPCVCDCSSQ-PL 60

Query: 61  LKIAPGLSNLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQEAVSGEHTRHMNITLFEAK 120
           L I  GLSN S  DC  ++ +  +E E  F +++ EELKL+EA + E     +  L +AK
Sbjct: 61  LSIPDGLSNHSFLDCMRHE-EGSEESESSFTEMVAEELKLREAQAQEDEWRADRLLLDAK 120

Query: 121 RVASQYQREAEKCIAATETCEEARERAEALTIKERKLTLLWERRAHQMGWVE 172
           + ASQYQ+EA+KC    ETCE ARE+AEA   ++R+L+ +WE RA Q GW E
Sbjct: 121 KAASQYQKEADKCSMGMETCELAREKAEAALDEQRRLSYMWELRARQGGWKE 168

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT4G30996.16.1e-7077.06Protein of unknown function (DUF1068) [more]
AT2G24290.11.8e-6672.51Protein of unknown function (DUF1068) [more]
AT2G32580.15.4e-3446.01Protein of unknown function (DUF1068) [more]
AT1G05070.13.2e-3144.05Protein of unknown function (DUF1068) [more]
AT4G04360.14.4e-2841.28Protein of unknown function (DUF1068) [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 132..152
NoneNo IPR availablePANTHERPTHR32254:SF3EXPRESSED PROTEIN-RELATEDcoord: 1..171
NoneNo IPR availablePANTHERPTHR32254EXPRESSED PROTEINcoord: 1..171
IPR010471Protein of unknown function DUF1068PFAMPF06364DUF1068coord: 7..171
e-value: 1.6E-71
score: 239.6

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh05G012530.1CmaCh05G012530.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane