Sgr017671 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr017671
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Locationtig00153054: 647788 .. 653593 (+)
RNA-Seq ExpressionSgr017671
SyntenySgr017671
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTTCTCCTGTACTCAACAAAAGCAGAGAGAAACAGGGCTGCAATGTTGAGTGTCAAGCCCACGCTCATCTTTTGGAGAAGAGTTATGCCGCTTTCTTTTCCAGTGAGTTTTCGGAGAGCCGGTACGAGGATTCTGTCGTAGATCGGTACCCAAATGGTGACGCCGATCATCGTAAAGATGTAGAAGGAGGCCGTTGGAATCTTGAAATTGGAGTGGCCAAGACGGCGGTCGGATTGAAGAGCTTGAAACACTCCATAAGTTTGATGGACTATGGGAACGTAGTAAATGATACTGGTAGACCATATGGGAATGATCCTTAATATGCATTTCAGTTCTTCAACTTGCTGAATGCTGCAAATCCGCCATGGATGAGCAGCTGATCCATCAGGATTGATTTGGTCTCCCGGAGTTATAATTGCAGCCTTGTCGAGGAACCTACCAATCAAGAACACTAATGAAAAGGGATGTTGCTTATGAAATCAAAAATGGGTTGGTTAGTTCTTATTTTCAAATGGTTTTTGATTACCTAAACTGCTTGCTGTGATAAAGCTTGGAATTAATGGAGTTGGTAGGAACACAGTTGAAAAGAGAAGCCATTGGTGGTTGTTCAGTCATCTTCAACTTTCGCTTCTTAAATGAAACTACAACAACTTTCACAACACTCGTCAAAGGACTGCCAAATAGTTTCACTCTGACGTAAATTCTCGTCCCGAGGAAGAAGAACATACAGGAAAAGAACATACAAAACGCAGGAATGGCCAAACCCCAACCCCAGCTGGTATCAGATTGGACGTAGACTATGGCCGTAAGAGAAATCATCACTGCGAAAGTGTAGGTGAAATAGTACCAATTGAAGAAGCTGCTGAGTCCTCTTTTGCCGGAATCTGTGTTGGGGTTGAATTGGTCGGCTCCAAGGGCAAGGTTGCAGGGCCTGATGCCACCGGCTCCGATGACCAGAAGTCCAAGTCCGCCGATCAGAAAAGCCATCTGCCATGGCGTTGGGCCGTGGCAGTCGCTGGCGTCGGCCGCCGATTGTCTGCACTGTGGAGGATGCAGTTTTGGAATTGCAGCAGTGAGTGTTATTGTTGTCATACCCTGCGACGAGTCAAATCGTATCATATCAAGGCCATGTTGGAAGAATTTGGCATGGCAAGAAATTACTTTTTCCTTGTCAAAATCACTTGATAAAGATGTTTGCCAAATGTTGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGTGCGAGAAGGGAAGCGAGTGAGCAAATCCGAGAGTGTAGTAACGGCCAAAGTAAGTGTCGGAAAGGAAAGCTCCGATGAAGGTGGCGAAGTTGCAGGTTCCATTAAGAGGTTCGCAACATTGGTTGCAGTTATGGCATTCATGTGAAACACAGCAGTCAAGTAAACCAGCAGATTTGACGCTGTCCCCGTCGTTCCCAGTTTCTCAAATATCTCATTCCTGCACACCATTATTAATATAATTAAAAATATAATTTCCTCCTTCACCCACCAACATAAACACACACAGTCGTCCACAGAGACACAAATTAAATCAGATCAGGAAACAAACCTATAATGAAGGGCATGGCTCTGATCCCTCGATACTTTATCATTGCCTGCTGATGATCCTCATCAGAAACAGCAGCACCATTCTCAGCGTTGTCCATATGGCTGCTGATCTTCTTCTTCTTCTTCTTCTTTTCTGGGTTTTCTGTTCGTCAATGATATCAGCAAAACTAGCAAAGTAGTGTGAGATCCGTGGATATGCATATGAGACGACTACGTCTCATTATAAATATTGTGAAGAAGACACGGGGGCTGGCTGTGATCTTGCTTGTAATTGGTTGCTTTGGCATGTTGGCTCGATCTAAGTGGGGGACATTTTTTGTTGTGTTAAGTTCAATGAAAGCCATGGATGAGATAGAAAACAAAAAATATATAAAGCTAAAAATCATCAAATAAGCTTTCACTAATGGTTGTTTGGAATGACATCACCACACTGAAACCCCATTCATTTGCATAATAGAAAGCGATAACTGTGAAGTGTGAACCATTTTAATTTGATGCCAATGTGATATTTTAATTGTTGAAGGTTCACTTTGTACTGGTCAAGTTATATATCAACTTAATTAAGGTGAAAGTTTAGCTTCTAAAGATAGTTAATTAATTAGAAAGAAAAAATACACGATTTTTAAAGATGACAACTAGATTACCTATAAACTTCAACATTCATTTGTTTGTTGGGAAGCTACTCGAGCTAACACACTTCATGTCAACATTGCCGACCCTATCAAAAAAAGCTGCACACTGGGTGTGATGCTTGTCAAACATCTTCTTATATTTAAGTTAGGTCAAAGCTTGAGTGAAAGTAGAGATGAGGAAGGAGAAACTTTTGCATACTTTAATAGTGAGTAAGACTTTCTAGCGGAGCAATTTTTTACTCTACCATATCTCATCGTATGTACAGGCTTAGTCCATCGTTAGTGCACTCTCTCATCACTATAGGTACCTACTTTGTCACAAAAGTGACATTTCTTTAAGATGCATGCCACCTGCTCATTGGCTGACTAAAGGCAAGTGATCTTCCAAAATAAGTCTAATTACCCAGCGACATAAATGTGTTTATCCTACTTTTATTTTATCGAGCTAGGCATGGTCGACCTACCTCACTCTACAAATTTTGTACCTACTACGTGGGATGCAATGAGATGTTCGAGAATTTCATCCATAACAATAACAATAACATTCACTATTCCAATTTTAATTCTTATTTCACCCGACGATTAAAATCTATGCCATAAAAGTAAGGAGTTTTAATGGACCAAACTCTATTTGGCTTCAAAGTTACGTTCAATATAGAATGATAATAAAGTTTTTATGTGCGTCAACTGTGGTATATGCTTATACACGTCTCTGCAAACGTTAGCCACAGAATTAAATCACTAATTAATCAACACAAAACTTAATTAATTAGCCACCGTGTAATCTAAGTGTAATACTTGTGATGATTGTTCATTAGGATTAAAGTGAGGTTTGGAGTAACATCAATACAATTATTCAAATGCACTAGCTGGAAGAGACCATAATTCTTTTCAAAGAAAAAAGAAAAAAAAAAAAGAGACCATAATTATTCAAATGCACTAATTTTGAATTCGAGCATATAGCCCAATAGAGAAAATTATGTTCGAATAGATAAAATACCTATTACCTTTATTGAGTTGAAGGTTCAATCTCCCACCCTCATATTTGTTGAACTAAAAAAAAAAATTATAAGCCTACAACATTTTACATATTTGTATTTTTTTAGTATTAATAATCTATTTTATTTAGAGAGAGTTTGTATATAGAGAATAAGTCGAAATATACTAATTGTTATAATTGTGATTATTTCATAGTTTATTTTATTATTAACAGCCCACTATATTGTGACAGAGAGATTTTTTACAAAAGGGTTAGTTCAATTATTAATAATTTATTGAGGGAGAGAGAGGTGGAAGGGTTTTGTAATATTAATAGTTATTTTTAAAAAATTAATAATCTAATTTATTGAGAGAGAGTTGAAATTATAGATTTTATAATATTAATAGTTTTTTTTTTGTTCAATAATGTGGGGGTGAGAATTCAAATATACCTTTAAAACGATGTATAATGCTTTAACTAGTTGAGTTATGCTCGCATAATTAATAATATTAATGGTTATTAATGTAAAATTAATAACTCAATTTATTGATTGATAGGGTTGAATAGCTGTTTTAAAACTAATTTTCCAATTTATTGAGAGAGTTAGAAGGGTTTTATGATATTAATTGTAACTTTTTAATATTAACAACTCAATTTATTGTGAGAGAGCCACATATCATAATGAATGATAAATCAAATATTAAGTAAGAATTATGAAAGATAATGTCTTGACTAGGGATCTTTTTTTTTTTTTTTTTTTTTTGAGTAGAGTCTTGACTAGTTTGTTTATTTTTTTTGAGTTCAATAAGTATTAGGGTGAAGAATTGAACCTTCAACTTTGAGAAAAATAATTGGTGTCTTATCTACTGAGCTATGTTTAGATTGACGACTAATTTGTTTAGATAAATGAACTTTTATATATATTATAGATTTTAAAAATGTGTTATGATATATATTTCATATTTTATAAAATAAAATTGCAAGGTATTTTAAGCTATTATACCACTACATATTTTATTGAAAAATTGTAGTGGATGATAAAATTAGTTGTCATATTTGCAAATCTGTCATAGAGAAAAAGAATTTGCAAATATGACAATTTTTTTAAATTTCGCGCGCGTCTATTTAAACAATTTTACGTAATTCTCTTTACTCTCATATATCACTGATATAACGTTTTTATAGTATATCACTGATATTCCGCTTTTATAGTATGCCACTGATATACCGCTTTTATAGTATGCCACTGATATACCGCTTTTATAGTATGTCATTGATATACCGCTTTATAGTATGTCACTGATATACCGCTTTTATAGTATATCAATGATTTACTATTTGGTTATCAGTGATTCACTTTTATTTATTATTGATACATGGCTATCGTTGATTACTCTTTTTCTAATTGATATATGATTATCAACTTATAATTGAATCTTGTTTTTCAAATTTTATTTTATATTTTGGCCAATCAAACACACATTTCACAAGGTATATTAGCAAAAGACAATGTAAAGGAAAATGTACATCTTTCACAATGCGTATATCAACAAGAAATTCGAAGATGAATCTATTAGCAATGTAGCGTAAAAAAAATCAAAGAAAATAGAAGGTAAAAGGAGACAATAGAGTTATGAAATGAGAAATTGAAAAAATGGAAAGAAATTAGGGTAAATGACAATAGATAAATCACAAGAATACGCGAGAAATTAGGGTAAAAATCGAGAGAGAAATAACAATAAATTGGGTAAATGTGTTTAAGGAGAAGGGAACGTTCTGTGAGACGTTAAAATCAATGATATACTATATCACTGATACCTATATATAATATATCATTGATATACCATTTTTGAATCATCGATAAAATATATCACTGATATATCTAATATAAAATATATTACTGATAATATTTAAGATTATTTTAGATTTTTTACATATTCTGTTTGCACTATTTTTGTAATACGTGTGTTAAATTTTCGATTTATTATTATTATTTTTTGTCAGATCTGTAAGAGTCCCTATTTAATTTCTCATTTTAATCGGCGCATTGTACAAATATAATATTATTATAGTGTAGAACCACATATTAATTTTCAATAATTTTGTTACATTATAAAGATTTTTCATTAGGGTTGAGTTTTTCAATCAGATTTCCAATGTTTGTTTTATTTTGTTTTTACCGATTATGAAAAAGTAAAAGAAAAAAAAAATCAATTTTTACCTCTAAATTTGACTATAAATATAACATATACTAAAGTTAGCCTTAATTTTATTGTTTGAGATATAAATTAATTGTAAAAAACTGCTGGGGAGTATTTTTCTTAATTTTATTCCCTCTGGAGAGTAAAAAAAAATTGACTTCTTTTTTTTAAAAAGTTCTTACAATTTATTGCTTGAGATATAAATTAATTGTAAAAAACTGCTCAGTCTCTTGCTCTTGAAAAACAAAACCCTAATTTCACTGTACAAAGCGCTAAGCTATATATTGCCATGGCAGTCACCGTTTCTTCTCAAGCATTCAACATTCGGCATCCTTCGTCTCCTTCTTCTCAGGCAGATCAATCCGATGGTCGTTATCCGTCGCGTTCTCCGACGGAAGTTCTTGTCTCAGTTTATTAG

mRNA sequence

ATGTCTTCTCCTGTACTCAACAAAAGCAGAGAGAAACAGGGCTGCAATGTTGAGTGTCAAGCCCACGCTCATCTTTTGGAGAAGAGTTATGCCGCTTTCTTTTCCAGTGAGTTTTCGGAGAGCCGGTACGAGGATTCTGTCGTAGATCGGTACCCAAATGGTGACGCCGATCATCGTAAAGATGTAGAAGGAGGCCGTTGGAATCTTGAAATTGGAGTGGCCAAGACGGCGGTCGGATTGAAGAGCTTGAAACACTCCATAAGTTTGATGGACTATGGGAACGAAGAAGAACATACAGGAAAAGAACATACAAAACGCAGGAATGGCCAAACCCCAACCCCAGCTGGTATCAGATTGGACGTAGACTATGGCCGTAAGAGAAATCATCACTGCGAAAGTGTAGGTGAAATAGTACCAATTGAAGAAGCTGCTGAGTCCTCTTTTGCCGGAATCTGTGTTGGGGTTGAATTGGTCGGCTCCAAGGGCAAGGTTGCAGGGCCTGATGCCACCGGCTCCGATGACCAGAAGTCCAAGTCCGCCGATCAGAAAAGCCATCTGCCATGGCGTTGGGCCGTGGCAGTCGCTGGCGTCGGCCGCCGATTGTCTGCACTGTGGAGGATGCAGTTTTGGAATTGCAGCAATGTTTGCCAAATGTTGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGTGCGAGAAGGGAAGCGAGTGAGCAAATCCGAGAGTGTAGTAACGGCCAAAGTAAGTGTCGGAAAGGAAAGCTCCGATGAAGGTGGCGAAGTTGCAGGTTCCATTAAGAGAAACAGCAGCACCATTCTCAGCGTTGTCCATATGGCTGCTGATCTTCTTCTTCTTCTTCTTCTTTTCTGGGTTTTCTGTTCGTCAATGATATCAGCAAAACTAGCAAACGCTAAGCTATATATTGCCATGGCAGTCACCGTTTCTTCTCAAGCATTCAACATTCGGCATCCTTCGTCTCCTTCTTCTCAGGCAGATCAATCCGATGGTCGTTATCCGTCGCGTTCTCCGACGGAAGTTCTTGTCTCAGTTTATTAG

Coding sequence (CDS)

ATGTCTTCTCCTGTACTCAACAAAAGCAGAGAGAAACAGGGCTGCAATGTTGAGTGTCAAGCCCACGCTCATCTTTTGGAGAAGAGTTATGCCGCTTTCTTTTCCAGTGAGTTTTCGGAGAGCCGGTACGAGGATTCTGTCGTAGATCGGTACCCAAATGGTGACGCCGATCATCGTAAAGATGTAGAAGGAGGCCGTTGGAATCTTGAAATTGGAGTGGCCAAGACGGCGGTCGGATTGAAGAGCTTGAAACACTCCATAAGTTTGATGGACTATGGGAACGAAGAAGAACATACAGGAAAAGAACATACAAAACGCAGGAATGGCCAAACCCCAACCCCAGCTGGTATCAGATTGGACGTAGACTATGGCCGTAAGAGAAATCATCACTGCGAAAGTGTAGGTGAAATAGTACCAATTGAAGAAGCTGCTGAGTCCTCTTTTGCCGGAATCTGTGTTGGGGTTGAATTGGTCGGCTCCAAGGGCAAGGTTGCAGGGCCTGATGCCACCGGCTCCGATGACCAGAAGTCCAAGTCCGCCGATCAGAAAAGCCATCTGCCATGGCGTTGGGCCGTGGCAGTCGCTGGCGTCGGCCGCCGATTGTCTGCACTGTGGAGGATGCAGTTTTGGAATTGCAGCAATGTTTGCCAAATGTTGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGTGCGAGAAGGGAAGCGAGTGAGCAAATCCGAGAGTGTAGTAACGGCCAAAGTAAGTGTCGGAAAGGAAAGCTCCGATGAAGGTGGCGAAGTTGCAGGTTCCATTAAGAGAAACAGCAGCACCATTCTCAGCGTTGTCCATATGGCTGCTGATCTTCTTCTTCTTCTTCTTCTTTTCTGGGTTTTCTGTTCGTCAATGATATCAGCAAAACTAGCAAACGCTAAGCTATATATTGCCATGGCAGTCACCGTTTCTTCTCAAGCATTCAACATTCGGCATCCTTCGTCTCCTTCTTCTCAGGCAGATCAATCCGATGGTCGTTATCCGTCGCGTTCTCCGACGGAAGTTCTTGTCTCAGTTTATTAG

Protein sequence

MSSPVLNKSREKQGCNVECQAHAHLLEKSYAAFFSSEFSESRYEDSVVDRYPNGDADHRKDVEGGRWNLEIGVAKTAVGLKSLKHSISLMDYGNEEEHTGKEHTKRRNGQTPTPAGIRLDVDYGRKRNHHCESVGEIVPIEEAAESSFAGICVGVELVGSKGKVAGPDATGSDDQKSKSADQKSHLPWRWAVAVAGVGRRLSALWRMQFWNCSNVCQMLREREREREREREVREGKRVSKSESVVTAKVSVGKESSDEGGEVAGSIKRNSSTILSVVHMAADLLLLLLLFWVFCSSMISAKLANAKLYIAMAVTVSSQAFNIRHPSSPSSQADQSDGRYPSRSPTEVLVSVY
Homology
BLAST of Sgr017671 vs. NCBI nr
Match: KAB8093065.1 (hypothetical protein EE612_019750, partial [Oryza sativa] >BAS85815.1 Os03g0687100, partial [Oryza sativa Japonica Group])

HSP 1 Score: 67.4 bits (163), Expect = 2.8e-07
Identity = 37/96 (38.54%), Postives = 55/96 (57.29%), Query Frame = 0

Query: 94  NEEEHTGKEHTKRRNGQTPTPAGIRLDVDYGRKRNHHCESVGEIVPIEEAAESSFAGICV 153
           +EEE  G+EH +  + +   PA + LDVD     + H    GE+VP+EEA +++ AG+ V
Sbjct: 341 HEEEDAGEEHERGGDAEADGPADVALDVDDDGGGDEHGGGEGEVVPVEEAVDAALAGLRV 400

Query: 154 GVELVGSKGKVAGPDATGSDDQKSKSADQKSHLPWR 190
           GVELVG++   A PD      Q+ +  +Q   LP R
Sbjct: 401 GVELVGAERHAARPDPRRPQHQERERHEQHRELPRR 436

BLAST of Sgr017671 vs. ExPASy TrEMBL
Match: A0A0P0W2A2 (Os03g0687100 protein (Fragment) OS=Oryza sativa subsp. japonica OX=39947 GN=Os03g0687100 PE=4 SV=1)

HSP 1 Score: 67.4 bits (163), Expect = 1.4e-07
Identity = 37/96 (38.54%), Postives = 55/96 (57.29%), Query Frame = 0

Query: 94  NEEEHTGKEHTKRRNGQTPTPAGIRLDVDYGRKRNHHCESVGEIVPIEEAAESSFAGICV 153
           +EEE  G+EH +  + +   PA + LDVD     + H    GE+VP+EEA +++ AG+ V
Sbjct: 341 HEEEDAGEEHERGGDAEADGPADVALDVDDDGGGDEHGGGEGEVVPVEEAVDAALAGLRV 400

Query: 154 GVELVGSKGKVAGPDATGSDDQKSKSADQKSHLPWR 190
           GVELVG++   A PD      Q+ +  +Q   LP R
Sbjct: 401 GVELVGAERHAARPDPRRPQHQERERHEQHRELPRR 436

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAB8093065.12.8e-0738.54hypothetical protein EE612_019750, partial [Oryza sativa] >BAS85815.1 Os03g06871... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0P0W2A21.4e-0738.54Os03g0687100 protein (Fragment) OS=Oryza sativa subsp. japonica OX=39947 GN=Os03... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 94..108
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 323..352
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 323..341
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 94..118

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr017671.1Sgr017671.1mRNA