Sgr026940 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr026940
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Locationtig00153047: 2430290 .. 2432154 (-)
RNA-Seq ExpressionSgr026940
SyntenySgr026940
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTACATGATCTCACCATCTGATCCACCTTCTCCTTCCCCACCACCGCCCCCGCTGCCCTCAGCGCCTTCATTCACTGGTAAAACACCGGCGCCCATGGAAATTGGCTCCACAGCCTTCAAGATCTTCCACTCCTTAGGCCCGCATTCTCTCCTTGAAGCAAACCCACATTTCTTCCCATTGCAAAATGTCCTCCAAACAGGCTCTTCAAGCAGCCTCCCTCCTAATTTCTTGTTCTCCTCTTTCTCTTTATCGCATTCCAGCGCAATTCGAACCAGTCCAGACGCCATTTCCTTCACCAAACCACTGATTGGCGTGGCGAGCTCAATCAAGAAAGCTGGTTGTGAATTGGGATCTCTCTGAAATGCAAAATGTACGTGCCCACGGCGAGAACCAAACAGAGTTCCAACCACTCGAGAACGTAGGCCGAGCTGCGGGTTCCGATGGCGGCTGAGGGCGGTGAGAACAGAACGGAACCGAGAAATGGCAAGCTGTTGCAGTTTCCTCCTTGTGAATTGTGCTCGAACCTGCTGCCGCTGGTCCTGTAGCTCCTGTGAAATGGGTTTTGCGTTCTCCTCCTTTACTTCTCCAGCAGATTCTGAGGATTTGAAGGTGGAATCGTCTTCATCTTCATTACTAACCTTCCTTGTCCAATGGAAGTGCCTTCTGGAGGAGGATTCAAGAGCCATTGATGTCTGCAGCTGAGAAACCAAGGGGAAGGAAGGAAGGAAGGTAGGTTCTTTAGAAGGTTGGTTGGTAGGAGAGAGAATGAGAGGGGTGTATAATAGAGAGAATGAGGTTTCTCCCCTTTTATTTGCTGTGTGGGGCTCTCAACATTTAATTCTTAGTTTCAAGGTTCATTGATGTGCTGCAAGAGAATCCTGACAAAAGAAACATAGACTGACCCACCAACTGATATGGGGAGACATGAGAGAGAGAGAGAGACTTGAAATCAAAAAAGAGAGAGAGAGAGAGAAAGGAATCCAGAGAGTCTGTGTTTGTGTTACTTTCTTGTGAGATTGGTATTTGGTTGACTTGAATCCCTCCAACAGTGAAACGATGGCAACAAAAGCCACACCAAGTATAAGTCAGAACTGTTCATAATAGACAATCTAACAACAATCAAATCCAAATCCATGTTTGCGAGTGATACACAAAGCTTCTTAGGGTCACTGAGAAACCTCTTGAACCCATGTTGGGCAGAGAGTATACATTATGTCTTTACAACAAAAGATCTAATAATCATTCTCACGAGACCCATCATCTGATTCATCCAAAGACAAGAAAAATGTTTCATTTAGGTATTGAAGGTTTGAGAAAGCCCTGCCTCGACAGTAAATCTGAAAGGACCAAGGCAAAGCCGATTTCTGAAGGTTGTAATGTTAACAGTTAACACGCACATGCTGTAGGGAGACAAATATGTCCATTCAGATAGTATATTGGGGAGAGGGAGACAAATATATCAATGAACTAAACAGTCGCTGCTTGTATTAACTAGACATCGATCCGCAGATAATGGATATATATTACAATGCATCTTCACAGAGACCAGTAGCTAGGTTAATCATTTCTTTTTTGTACATATGCACAGGGCATATTTGAAGCACGAATGGAAGGGTCAAGGAATCTTGCCTATGGTTGAAGAAAATAAAGATCCTAAAATTAGCTCTCTAAACTTTGCTCGGCCTCCGGTTGCTTCGGAGCAGTGTTGCTGCCATTCATTCTTGCACGTGCCTCTGCAAACATTCTTTGTTGCTCAGCCAAAGCTTCTTCCTCGGTCATCTCAGCTCCATTACTCCACTTTCCACCTTTTAAGGAATCTTGCTGGAGAAATTTCTCAACAAAGAAGAAAAGAAAAATGTTAG

mRNA sequence

ATGTACATGATCTCACCATCTGATCCACCTTCTCCTTCCCCACCACCGCCCCCGCTGCCCTCAGCGCCTTCATTCACTGGTAAAACACCGGCGCCCATGGAAATTGGCTCCACAGCCTTCAAGATCTTCCACTCCTTAGGCCCGCATTCTCTCCTTGAAGCAAACCCACATTTCTTCCCATTGCAAAATGTCCTCCAAACAGGCTCTTCAAGCAGCCTCCCTCCTAATTTCTTGTTCTCCTCTTTCTCTTTATCGCATTCCAGCGCAATTCGAACCAGTCCAGACGCCATTTCCTTCACCAAACCACTGATTGGCGTGGCGAGCTCAATCAAGAAAGCTGAACGTAGGCCGAGCTGCGGGTTCCGATGGCGGCTGAGGGCGGTGAGAACAGAACGGAACCGAGAAATGGCAAGCTGTTGCAGTTTCCTCCTTGTGAATTGTGCTCGAACCTGCTGCCGCTGGTCCTGTAGCTCCTGTGAAATGGGTTTTGCGTTCTCCTCCTTTACTTCTCCAGCAGATTCTGAGGATTTGAAGGTGGAATCTGCCTTCTGGAGGAGGATTCAAGAGCCATTGATGTCTGCAGCTGAGAAACCAAGGGGAAGGAAGGAAGGAAGGGCATATTTGAAGCACGAATGGAAGGGTCAAGGAATCTTGCCTATGGTTGAAGAAAATAAAGATCCTAAAATTAGCTCTCTAAACTTTGCTCGGCCTCCGGTTGCTTCGGAGCAGTGTTGCTGCCATTCATTCTTGCACGTGCCTCTGCAAACATTCTTTGTTGCTCAGCCAAAGCTTCTTCCTCGGTCATCTCAGCTCCATTACTCCACTTTCCACCTTTTAAGGAATCTTGCTGGAGAAATTTCTCAACAAAGAAGAAAAGAAAAATGTTAG

Coding sequence (CDS)

ATGTACATGATCTCACCATCTGATCCACCTTCTCCTTCCCCACCACCGCCCCCGCTGCCCTCAGCGCCTTCATTCACTGGTAAAACACCGGCGCCCATGGAAATTGGCTCCACAGCCTTCAAGATCTTCCACTCCTTAGGCCCGCATTCTCTCCTTGAAGCAAACCCACATTTCTTCCCATTGCAAAATGTCCTCCAAACAGGCTCTTCAAGCAGCCTCCCTCCTAATTTCTTGTTCTCCTCTTTCTCTTTATCGCATTCCAGCGCAATTCGAACCAGTCCAGACGCCATTTCCTTCACCAAACCACTGATTGGCGTGGCGAGCTCAATCAAGAAAGCTGAACGTAGGCCGAGCTGCGGGTTCCGATGGCGGCTGAGGGCGGTGAGAACAGAACGGAACCGAGAAATGGCAAGCTGTTGCAGTTTCCTCCTTGTGAATTGTGCTCGAACCTGCTGCCGCTGGTCCTGTAGCTCCTGTGAAATGGGTTTTGCGTTCTCCTCCTTTACTTCTCCAGCAGATTCTGAGGATTTGAAGGTGGAATCTGCCTTCTGGAGGAGGATTCAAGAGCCATTGATGTCTGCAGCTGAGAAACCAAGGGGAAGGAAGGAAGGAAGGGCATATTTGAAGCACGAATGGAAGGGTCAAGGAATCTTGCCTATGGTTGAAGAAAATAAAGATCCTAAAATTAGCTCTCTAAACTTTGCTCGGCCTCCGGTTGCTTCGGAGCAGTGTTGCTGCCATTCATTCTTGCACGTGCCTCTGCAAACATTCTTTGTTGCTCAGCCAAAGCTTCTTCCTCGGTCATCTCAGCTCCATTACTCCACTTTCCACCTTTTAAGGAATCTTGCTGGAGAAATTTCTCAACAAAGAAGAAAAGAAAAATGTTAG

Protein sequence

MYMISPSDPPSPSPPPPPLPSAPSFTGKTPAPMEIGSTAFKIFHSLGPHSLLEANPHFFPLQNVLQTGSSSSLPPNFLFSSFSLSHSSAIRTSPDAISFTKPLIGVASSIKKAERRPSCGFRWRLRAVRTERNREMASCCSFLLVNCARTCCRWSCSSCEMGFAFSSFTSPADSEDLKVESAFWRRIQEPLMSAAEKPRGRKEGRAYLKHEWKGQGILPMVEENKDPKISSLNFARPPVASEQCCCHSFLHVPLQTFFVAQPKLLPRSSQLHYSTFHLLRNLAGEISQQRRKEKC
Homology
BLAST of Sgr026940 vs. NCBI nr
Match: KAF8675905.1 (hypothetical protein HU200_047402 [Digitaria exilis])

HSP 1 Score: 72.0 bits (175), Expect = 9.6e-09
Identity = 57/149 (38.26%), Postives = 69/149 (46.31%), Query Frame = 0

Query: 20  PSAPSFTGKTPAPMEIGSTAFKIFHSLGPHSLLEANPHFFPLQNVLQTGSSSSL-----P 79
           PS P   G TPAPM+  S+A     S   HS   A PH  PLQ    T SS+S      P
Sbjct: 6   PSPPHAAGMTPAPMDTASSARSTRQSAAAHSRRAAYPHPLPLQYARHTFSSTSFLPPPPP 65

Query: 80  PNFLFSSFSLSHSSAIRTSPDAISFTKPLIGVASSIKKA--------------ERRPSCG 139
           P    ++   SHSSA+RT P+AIS T   +G ASS  +A               R P   
Sbjct: 66  PPLATAALGRSHSSAMRTRPEAISRTSDAVGAASSSSRAGHARGSTWNASVPVSRAPRPT 125

Query: 140 FRWR--LRAVRTERNREMASCCSFLLVNC 148
           +RWR  + A R  R+RE A    FL   C
Sbjct: 126 WRWRPAMAAARAVRSRETAVMRPFLPARC 154

BLAST of Sgr026940 vs. NCBI nr
Match: KAF8697035.1 (hypothetical protein HU200_036686 [Digitaria exilis])

HSP 1 Score: 71.2 bits (173), Expect = 1.6e-08
Identity = 56/149 (37.58%), Postives = 69/149 (46.31%), Query Frame = 0

Query: 20  PSAPSFTGKTPAPMEIGSTAFKIFHSLGPHSLLEANPHFFPLQNVLQTGSSSSL-----P 79
           PS P   G TPAP++  S+A     S  PHS   A PH  PLQ    T SS+S      P
Sbjct: 6   PSPPHAAGMTPAPIDTASSARSTRQSAAPHSRRAAYPHPLPLQYARHTFSSTSFLPPPPP 65

Query: 80  PNFLFSSFSLSHSSAIRTSPDAISFTKPLIGVASSIKKA--------------ERRPSCG 139
           P    ++   SHSSA+RT P+AIS T   +G ASS  +A               R P   
Sbjct: 66  PPLAAAALGRSHSSAMRTRPEAISRTSEAVGAASSSSRAGHARGSTWNASVPVSRAPRPT 125

Query: 140 FRWR--LRAVRTERNREMASCCSFLLVNC 148
           +RWR  + A    R+RE A    FL   C
Sbjct: 126 WRWRPAMAAASAVRSRETAVMRPFLPARC 154

BLAST of Sgr026940 vs. NCBI nr
Match: KAG6542517.1 (hypothetical protein Mapa_015987 [Marchantia paleacea])

HSP 1 Score: 62.8 bits (151), Expect = 5.8e-06
Identity = 42/92 (45.65%), Postives = 53/92 (57.61%), Query Frame = 0

Query: 27  GKTPAPMEIGSTAFKIFHSLGPHSLLEANPHFFPLQNVLQTGS---SSSLPPNF--LFSS 86
           G TPAPM       +IF S+   SLL ANP F PLQN LQTGS   S  L P+   L   
Sbjct: 17  GMTPAPMLTLCMRLRIFRSVSELSLLMANPTFLPLQNKLQTGSKRRSGDLLPSLQSLEGG 76

Query: 87  FSLSHSSAIRTSPDAISFTKPLIGVASSIKKA 114
              SHSSAI + P+ IS T+  +G+A+S + +
Sbjct: 77  GDFSHSSAILSMPEDISLTREYVGIANSSRSS 108

BLAST of Sgr026940 vs. NCBI nr
Match: KAG0538653.1 (hypothetical protein BDA96_03G255000 [Sorghum bicolor] >OQU87223.1 hypothetical protein SORBI_3003G235601 [Sorghum bicolor])

HSP 1 Score: 61.6 bits (148), Expect = 1.3e-05
Identity = 56/162 (34.57%), Postives = 70/162 (43.21%), Query Frame = 0

Query: 1   MYMISPSDPPSPSPPPPPLPSAPSFTGKTPAPMEIGSTAFKIFHSLGPHSLLEANPHFFP 60
           MYM SPS PP            P   G TPAP++ GS+A     S  PHS   A P   P
Sbjct: 1   MYMTSPSPPP------------PHAAGMTPAPIDTGSSARSTRQSAAPHSRRTAYPQPLP 60

Query: 61  LQNVLQTGSSSSL----PPNFLFSS-----FSLSHSSAIRTSPDAISFTKPLIGVASSIK 120
           LQ      SS+S     PP  + ++      + SHSSA+RT P+AIS T   +G ASS  
Sbjct: 61  LQYARHAFSSTSFLPPPPPPLVAAAAQAAPLARSHSSAMRTRPEAISRTSDAVGAASSSS 120

Query: 121 KAER----------------RPSCGFRWRLRAVRTERNREMA 138
            A                   P+C  R  + A    R+RE A
Sbjct: 121 SAGHARGSTWNASVPVSRAPSPTCRCRPAMAAASAVRSRETA 150

BLAST of Sgr026940 vs. NCBI nr
Match: KAF8726779.1 (hypothetical protein HU200_019254 [Digitaria exilis])

HSP 1 Score: 61.6 bits (148), Expect = 1.3e-05
Identity = 65/181 (35.91%), Postives = 77/181 (42.54%), Query Frame = 0

Query: 1   MYMISPSDPPSPSPPPPPLPSAPSFTGKTPAPMEIGSTAFKIFHSLGPHSLLEANPHFFP 60
           MYM SPS       PPP L +     G TPAPM+ GS A     S  PHS   A P   P
Sbjct: 1   MYMTSPS-------PPPQLAA-----GITPAPMDTGSRARSTRQSAAPHSRRTAYPQLLP 60

Query: 61  LQNVLQTGSSSS-LPPNFLFSSFSL------------SHSSAIRTSPDAISFTKPLIGVA 120
           LQ    T SS+S LPP  L  + ++            SHSSA+RT P+AIS T   +G A
Sbjct: 61  LQYARHTVSSTSFLPPQLLGPAVAVEVGVGSAEPLARSHSSAMRTRPEAISRTSAAVGAA 120

Query: 121 SSIKKAER----------------RPSCGFRWRLRAVRTERNREMASCCSFLLVNCARTC 153
           SS   A                  RP+   R  +     ER R+ A     LL  C   C
Sbjct: 121 SSSSSAGHARGSTWNASVPVSRAPRPTWRCRPAMALASAERRRDTAVRTFLLLGGCC--C 167

BLAST of Sgr026940 vs. ExPASy TrEMBL
Match: A0A7C8YZ86 (Uncharacterized protein OS=Opuntia streptacantha OX=393608 PE=4 SV=1)

HSP 1 Score: 78.2 bits (191), Expect = 6.5e-11
Identity = 47/86 (54.65%), Postives = 55/86 (63.95%), Query Frame = 0

Query: 33  MEIGSTAFKIFHSLGPHSLLEANPHFFPLQNVLQTGSSSSLPPNFLFSSFS-----LSHS 92
           M++G T  +  HS GP S L A PHF PLQ  LQ GSS S    FLF  FS     LSHS
Sbjct: 1   MDMGPTELRTRHSWGPQSFLMAYPHFLPLQYALQRGSSRSFFFFFLFFLFSQQLDVLSHS 60

Query: 93  SAIRTSPDAISFTKPLIGVASSIKKA 114
           +A  TSPDAIS T+ L+GV SS++KA
Sbjct: 61  NATFTSPDAISLTRTLVGVESSMRKA 86

BLAST of Sgr026940 vs. ExPASy TrEMBL
Match: A0A1W0VYM4 (Uncharacterized protein OS=Sorghum bicolor OX=4558 GN=SORBI_3003G235601 PE=4 SV=1)

HSP 1 Score: 61.6 bits (148), Expect = 6.3e-06
Identity = 56/162 (34.57%), Postives = 70/162 (43.21%), Query Frame = 0

Query: 1   MYMISPSDPPSPSPPPPPLPSAPSFTGKTPAPMEIGSTAFKIFHSLGPHSLLEANPHFFP 60
           MYM SPS PP            P   G TPAP++ GS+A     S  PHS   A P   P
Sbjct: 1   MYMTSPSPPP------------PHAAGMTPAPIDTGSSARSTRQSAAPHSRRTAYPQPLP 60

Query: 61  LQNVLQTGSSSSL----PPNFLFSS-----FSLSHSSAIRTSPDAISFTKPLIGVASSIK 120
           LQ      SS+S     PP  + ++      + SHSSA+RT P+AIS T   +G ASS  
Sbjct: 61  LQYARHAFSSTSFLPPPPPPLVAAAAQAAPLARSHSSAMRTRPEAISRTSDAVGAASSSS 120

Query: 121 KAER----------------RPSCGFRWRLRAVRTERNREMA 138
            A                   P+C  R  + A    R+RE A
Sbjct: 121 SAGHARGSTWNASVPVSRAPSPTCRCRPAMAAASAVRSRETA 150

BLAST of Sgr026940 vs. ExPASy TrEMBL
Match: A0A0A9K4X2 (Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1)

HSP 1 Score: 55.5 bits (132), Expect = 4.5e-04
Identity = 46/118 (38.98%), Postives = 52/118 (44.07%), Query Frame = 0

Query: 29  TPAPMEIGSTAFKIFHSLGPHSLLEANPHFFPLQNVLQTGSSSS-------LPPNF---- 88
           TPAPM+ GS+A     S  PHS   A P   PLQ    T SSSS       LPP      
Sbjct: 2   TPAPMDTGSSARSTRQSAAPHSRRTAYPQLLPLQYARHTVSSSSFLPLPLPLPPVLGAAV 61

Query: 89  ---LFSSFSLSHSSAIRTSPDAISFTKPLIGVASSIKKAERRPSCGFRWRLRAVRTER 133
                   + SHSSA+RT P+AIS T   +G ASS   A      G  W  R     R
Sbjct: 62  GVGSAEPLARSHSSAMRTRPEAISRTSAAVGAASSSSSAGHER--GSTWNARCTWPRR 117

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAF8675905.19.6e-0938.26hypothetical protein HU200_047402 [Digitaria exilis][more]
KAF8697035.11.6e-0837.58hypothetical protein HU200_036686 [Digitaria exilis][more]
KAG6542517.15.8e-0645.65hypothetical protein Mapa_015987 [Marchantia paleacea][more]
KAG0538653.11.3e-0534.57hypothetical protein BDA96_03G255000 [Sorghum bicolor] >OQU87223.1 hypothetical ... [more]
KAF8726779.11.3e-0535.91hypothetical protein HU200_019254 [Digitaria exilis][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A7C8YZ866.5e-1154.65Uncharacterized protein OS=Opuntia streptacantha OX=393608 PE=4 SV=1[more]
A0A1W0VYM46.3e-0634.57Uncharacterized protein OS=Sorghum bicolor OX=4558 GN=SORBI_3003G235601 PE=4 SV=... [more]
A0A0A9K4X24.5e-0438.98Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 7..25
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..30

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr026940.1Sgr026940.1mRNA