Sgr017749 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr017749
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionLarge proline-rich protein BAG6 isoform X2
Locationtig00153055: 517572 .. 519425 (+)
RNA-Seq ExpressionSgr017749
SyntenySgr017749
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGACTTACGGTGGCACGGACGACGATATCAAGGAGCGCAAAACAAATAATACCATAGAGAATATCCTTCTTATAGATCAGGGTAATTGATTCTTTTTTTTTTCTTGATCAATTCTAACAGAACTTAACCATTGGGGAAATTTTTTCCTCTCAAAGTACCTATTTTCTCTACACTATGAATTTTTTTTCATAAATAATTTTTTAATGTTGGATTCCACGTCATTTTTAAATAATACGGACGTGGCTAATAATTAATAGGAGTGGTATTCATACTATTGTTGCATATGAGAGGAAAATTTTGGTGTTCACATTTTAGTTTTATCATCATTTTCTAACAGGGCATCTGATGAGGAATTTTGCTCTGAACTTGGTTTTTTATTAGATAGAAAGAGCCATAAGAGGAGCTTTCATGCATTTTGTTTTTTATGACTGTTCAATCCCTTGGCATAAATTAAGGGAAAATTTTTCTGTTGACAACCTAACTTTTGCTTCCTTGCATTAGAAGCAGGCTGCATGGGTAATTATCAATAGGTCCTTTCTAGATACAATTTTATAGAATAAAAATTATCTAGTCTATCAACAAACTGTTTGCATTTTTTTTTTTTATAAAAAAAGAACTGTTTGCCTTTCATCTTGATTTTCAAGACGTGGTTATTCTCTTTTATCTTGGAAGTTGGAACCAAGGTGGGTTCATCATGATACCATGATGAAAGATAAAGGGCAATATATGTTATATAAATTTGTTTTTGGGACCTTTTTCCCAAAAAAAAAAAAAAAGAAACACAAATGTAAATGAATTGTTCGCCAAAACAATTAACAAAAATTAATAAAATTATAAAAAAAAATTATCAAATACCAATCTCGATATAATATGCCAATTTGGGTATTATTTACATCATCTATCTCGGCAACTATCTTGATGTGCAAATGAGTTTCAATTACTTGGTCGTTTTGTTAATGAACATATAACTTCTTTTATAGGCAAGAGAGTTGTATACTAAGAGGATATTTGTAGACATGTCTTATAAAAACTAATGGCCGAAGCTACAAGAATTCGTTAAAGAAAGTCTAAGGGATTATTTGGAGTCGATTTTTAAATCTATTTCCTATTTTTTAGAATAATTGTTCGTTTAAAAACATGTTTAGTTTCTTTTCTTTCAAGGTTTTATTTTTTGAAAAAAAAAAATTTTTTATGAATAGACTATTTTTTAATTAATTCGTTAAAAAAAGCATAAATTGTCGAATAGTTACGTAATAATCTTAACTTCCCAGTTTTAAAAAATTATTTTACAGAAGAAGCTAACAAACAAATAGGTCTAATGTTAAAAACTCGTTCCAGGTACATCTGAGACGTGGCGTAATACAATTGGCTGTGAGGTTAATGAAAGCAATGTTTGGGTGTTCCCTACCGTAGAAGATTTAAATTCTCGCTTATTTGCTTCTCCCTCCATTTTCGACCGTGAGAAATCCGCCCCCTCGTCTGCCCCTTCTGTTGGGAGTTTCTGCTGTTACAGTTCTCCCATGACTCCTCCCTCAGCACCTTCAAGCTCAGACCCTGGAGTCCAAATTCCCAAAAAGACACTTGGTTTGTTCGCAAACGCACTAAAACGCAAGGATAGCTTCATTCAGTTCTTCGCGATGACTGGTATTTTACTCTTAAGCGTTCGATCTTTGGGTCAGAAGTACCGTATCCACGACCTGCAAGAGGACACTTCGGCTCTCAAAGAAGAACATAAAACCCTAATTGACCGTATGAAGAACATCAAGCGCAGCCTCCTTCATGAGGCATCGATCGAGCCCACTGGCCTCTTCGCATCCAGGCTTCGCCTTCTCTTCAGCGAAGAAGATTGA

mRNA sequence

ATGTCGACTTACGGTGGCACGGACGACGATATCAAGGAGCGCAAAACAAATAATACCATAGAGAATATCCTTCTTATAGATCAGGGTACATCTGAGACGTGGCGTAATACAATTGGCTGTGAGGTTAATGAAAGCAATGTTTGGGTGTTCCCTACCGTAGAAGATTTAAATTCTCGCTTATTTGCTTCTCCCTCCATTTTCGACCGTGAGAAATCCGCCCCCTCGTCTGCCCCTTCTGTTGGGAGTTTCTGCTGTTACAGTTCTCCCATGACTCCTCCCTCAGCACCTTCAAGCTCAGACCCTGGAGTCCAAATTCCCAAAAAGACACTTGGTTTGTTCGCAAACGCACTAAAACGCAAGGATAGCTTCATTCAGTTCTTCGCGATGACTGGTATTTTACTCTTAAGCGTTCGATCTTTGGGTCAGAAGTACCGTATCCACGACCTGCAAGAGGACACTTCGGCTCTCAAAGAAGAACATAAAACCCTAATTGACCGTATGAAGAACATCAAGCGCAGCCTCCTTCATGAGGCATCGATCGAGCCCACTGGCCTCTTCGCATCCAGGCTTCGCCTTCTCTTCAGCGAAGAAGATTGA

Coding sequence (CDS)

ATGTCGACTTACGGTGGCACGGACGACGATATCAAGGAGCGCAAAACAAATAATACCATAGAGAATATCCTTCTTATAGATCAGGGTACATCTGAGACGTGGCGTAATACAATTGGCTGTGAGGTTAATGAAAGCAATGTTTGGGTGTTCCCTACCGTAGAAGATTTAAATTCTCGCTTATTTGCTTCTCCCTCCATTTTCGACCGTGAGAAATCCGCCCCCTCGTCTGCCCCTTCTGTTGGGAGTTTCTGCTGTTACAGTTCTCCCATGACTCCTCCCTCAGCACCTTCAAGCTCAGACCCTGGAGTCCAAATTCCCAAAAAGACACTTGGTTTGTTCGCAAACGCACTAAAACGCAAGGATAGCTTCATTCAGTTCTTCGCGATGACTGGTATTTTACTCTTAAGCGTTCGATCTTTGGGTCAGAAGTACCGTATCCACGACCTGCAAGAGGACACTTCGGCTCTCAAAGAAGAACATAAAACCCTAATTGACCGTATGAAGAACATCAAGCGCAGCCTCCTTCATGAGGCATCGATCGAGCCCACTGGCCTCTTCGCATCCAGGCTTCGCCTTCTCTTCAGCGAAGAAGATTGA

Protein sequence

MSTYGGTDDDIKERKTNNTIENILLIDQGTSETWRNTIGCEVNESNVWVFPTVEDLNSRLFASPSIFDREKSAPSSAPSVGSFCCYSSPMTPPSAPSSSDPGVQIPKKTLGLFANALKRKDSFIQFFAMTGILLLSVRSLGQKYRIHDLQEDTSALKEEHKTLIDRMKNIKRSLLHEASIEPTGLFASRLRLLFSEED
Homology
BLAST of Sgr017749 vs. NCBI nr
Match: XP_022142735.1 (uncharacterized protein LOC111012778 [Momordica charantia] >XP_022142744.1 uncharacterized protein LOC111012778 [Momordica charantia])

HSP 1 Score: 185.7 bits (470), Expect = 4.0e-43
Identity = 94/106 (88.68%), Postives = 101/106 (95.28%), Query Frame = 0

Query: 93  PSAPSSSDPGVQIPKKTLGLFANALKRKDSFIQFFAMTGILLLSVRSLGQKYRIHDLQED 152
           PSAPSSS P  Q+PKK LGLFANALKRKDSFIQFFAMTGI+LLSVRSLGQKYRIHDLQED
Sbjct: 3   PSAPSSSAPEFQLPKKPLGLFANALKRKDSFIQFFAMTGIMLLSVRSLGQKYRIHDLQED 62

Query: 153 TSALKEEHKTLIDRMKNIKRSLLHEASIEPTGLFASRLRLLFSEED 199
           TSALK+EH+TLIDRMKNIKRSLLHEAS++PTG FASRLRLLFS+ED
Sbjct: 63  TSALKQEHETLIDRMKNIKRSLLHEASLDPTGFFASRLRLLFSDED 108

BLAST of Sgr017749 vs. NCBI nr
Match: XP_038897763.1 (uncharacterized protein LOC120085692 [Benincasa hispida])

HSP 1 Score: 177.6 bits (449), Expect = 1.1e-40
Identity = 93/106 (87.74%), Postives = 97/106 (91.51%), Query Frame = 0

Query: 93  PSAPSSSDPGVQIPKKTLGLFANALKRKDSFIQFFAMTGILLLSVRSLGQKYRIHDLQED 152
           PSAPSSS P  Q PKK L LFANALKRKDSFIQFFAMTGILLLS RSLGQKYRIHDLQED
Sbjct: 3   PSAPSSSAPEFQTPKKPLSLFANALKRKDSFIQFFAMTGILLLSFRSLGQKYRIHDLQED 62

Query: 153 TSALKEEHKTLIDRMKNIKRSLLHEASIEPTGLFASRLRLLFSEED 199
           T+ALK+E +TLIDRMKNIKRSLLHEAS+E TGLFASRLRLLFSEED
Sbjct: 63  TTALKQEQETLIDRMKNIKRSLLHEASLESTGLFASRLRLLFSEED 108

BLAST of Sgr017749 vs. NCBI nr
Match: XP_023535555.1 (uncharacterized protein LOC111796960 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 176.8 bits (447), Expect = 1.9e-40
Identity = 91/106 (85.85%), Postives = 99/106 (93.40%), Query Frame = 0

Query: 93  PSAPSSSDPGVQIPKKTLGLFANALKRKDSFIQFFAMTGILLLSVRSLGQKYRIHDLQED 152
           PS+PSSS P  Q+PKK L LFANALKRKDSFIQFFAMTGILLLSVRSLGQKYRIHDLQED
Sbjct: 3   PSSPSSSAPEFQMPKKPLSLFANALKRKDSFIQFFAMTGILLLSVRSLGQKYRIHDLQED 62

Query: 153 TSALKEEHKTLIDRMKNIKRSLLHEASIEPTGLFASRLRLLFSEED 199
           T+ALK+E +TLIDRMKNIKRSLLHEAS++ TGLFASRLRLLFS+ED
Sbjct: 63  TTALKQEQETLIDRMKNIKRSLLHEASLDSTGLFASRLRLLFSQED 108

BLAST of Sgr017749 vs. NCBI nr
Match: KAG6591237.1 (hypothetical protein SDJN03_13583, partial [Cucurbita argyrosperma subsp. sororia] >KAG7024121.1 hypothetical protein SDJN02_12934, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 175.6 bits (444), Expect = 4.1e-40
Identity = 90/106 (84.91%), Postives = 99/106 (93.40%), Query Frame = 0

Query: 93  PSAPSSSDPGVQIPKKTLGLFANALKRKDSFIQFFAMTGILLLSVRSLGQKYRIHDLQED 152
           PS+PSSS P  Q+PKK L LFANALKRKDSFIQFFAMTGILLLSVRSLGQKYRIHDLQED
Sbjct: 3   PSSPSSSAPEFQMPKKPLSLFANALKRKDSFIQFFAMTGILLLSVRSLGQKYRIHDLQED 62

Query: 153 TSALKEEHKTLIDRMKNIKRSLLHEASIEPTGLFASRLRLLFSEED 199
           T+ALK+E +TLIDRMKNIKRSLLHEAS++ TGLFASRLRLLF++ED
Sbjct: 63  TTALKQEQETLIDRMKNIKRSLLHEASLDSTGLFASRLRLLFTQED 108

BLAST of Sgr017749 vs. NCBI nr
Match: XP_022936518.1 (uncharacterized protein LOC111443106 [Cucurbita moschata])

HSP 1 Score: 175.6 bits (444), Expect = 4.1e-40
Identity = 90/106 (84.91%), Postives = 99/106 (93.40%), Query Frame = 0

Query: 93  PSAPSSSDPGVQIPKKTLGLFANALKRKDSFIQFFAMTGILLLSVRSLGQKYRIHDLQED 152
           PS+PSSS P  Q+PKK L LFANALKRKDSFIQFFAMTGILLLSVRSLGQKYRIHDLQED
Sbjct: 3   PSSPSSSAPEFQMPKKPLSLFANALKRKDSFIQFFAMTGILLLSVRSLGQKYRIHDLQED 62

Query: 153 TSALKEEHKTLIDRMKNIKRSLLHEASIEPTGLFASRLRLLFSEED 199
           T+ALK+E +TL+DRMKNIKRSLLHEAS++ TGLFASRLRLLFS+ED
Sbjct: 63  TTALKQEQETLMDRMKNIKRSLLHEASLDSTGLFASRLRLLFSQED 108

BLAST of Sgr017749 vs. ExPASy TrEMBL
Match: A0A6J1CMC7 (uncharacterized protein LOC111012778 OS=Momordica charantia OX=3673 GN=LOC111012778 PE=4 SV=1)

HSP 1 Score: 185.7 bits (470), Expect = 1.9e-43
Identity = 94/106 (88.68%), Postives = 101/106 (95.28%), Query Frame = 0

Query: 93  PSAPSSSDPGVQIPKKTLGLFANALKRKDSFIQFFAMTGILLLSVRSLGQKYRIHDLQED 152
           PSAPSSS P  Q+PKK LGLFANALKRKDSFIQFFAMTGI+LLSVRSLGQKYRIHDLQED
Sbjct: 3   PSAPSSSAPEFQLPKKPLGLFANALKRKDSFIQFFAMTGIMLLSVRSLGQKYRIHDLQED 62

Query: 153 TSALKEEHKTLIDRMKNIKRSLLHEASIEPTGLFASRLRLLFSEED 199
           TSALK+EH+TLIDRMKNIKRSLLHEAS++PTG FASRLRLLFS+ED
Sbjct: 63  TSALKQEHETLIDRMKNIKRSLLHEASLDPTGFFASRLRLLFSDED 108

BLAST of Sgr017749 vs. ExPASy TrEMBL
Match: A0A6J1FDG4 (uncharacterized protein LOC111443106 OS=Cucurbita moschata OX=3662 GN=LOC111443106 PE=4 SV=1)

HSP 1 Score: 175.6 bits (444), Expect = 2.0e-40
Identity = 90/106 (84.91%), Postives = 99/106 (93.40%), Query Frame = 0

Query: 93  PSAPSSSDPGVQIPKKTLGLFANALKRKDSFIQFFAMTGILLLSVRSLGQKYRIHDLQED 152
           PS+PSSS P  Q+PKK L LFANALKRKDSFIQFFAMTGILLLSVRSLGQKYRIHDLQED
Sbjct: 3   PSSPSSSAPEFQMPKKPLSLFANALKRKDSFIQFFAMTGILLLSVRSLGQKYRIHDLQED 62

Query: 153 TSALKEEHKTLIDRMKNIKRSLLHEASIEPTGLFASRLRLLFSEED 199
           T+ALK+E +TL+DRMKNIKRSLLHEAS++ TGLFASRLRLLFS+ED
Sbjct: 63  TTALKQEQETLMDRMKNIKRSLLHEASLDSTGLFASRLRLLFSQED 108

BLAST of Sgr017749 vs. ExPASy TrEMBL
Match: A0A6J1IKL0 (uncharacterized protein LOC111476471 OS=Cucurbita maxima OX=3661 GN=LOC111476471 PE=4 SV=1)

HSP 1 Score: 174.1 bits (440), Expect = 5.8e-40
Identity = 89/106 (83.96%), Postives = 99/106 (93.40%), Query Frame = 0

Query: 93  PSAPSSSDPGVQIPKKTLGLFANALKRKDSFIQFFAMTGILLLSVRSLGQKYRIHDLQED 152
           PS+PSSS P  Q+PKK L LFANALKRKDSFIQFFAM+GILLLSVRSLGQKYRIHDLQED
Sbjct: 3   PSSPSSSAPEFQMPKKPLSLFANALKRKDSFIQFFAMSGILLLSVRSLGQKYRIHDLQED 62

Query: 153 TSALKEEHKTLIDRMKNIKRSLLHEASIEPTGLFASRLRLLFSEED 199
           T+ALK+E +TLIDRMKNIKRSLLHEAS++ TGLFASRLRLLFS++D
Sbjct: 63  TTALKQEQETLIDRMKNIKRSLLHEASLDSTGLFASRLRLLFSQDD 108

BLAST of Sgr017749 vs. ExPASy TrEMBL
Match: A0A0A0L5S3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G653420 PE=4 SV=1)

HSP 1 Score: 168.3 bits (425), Expect = 3.2e-38
Identity = 87/106 (82.08%), Postives = 96/106 (90.57%), Query Frame = 0

Query: 93  PSAPSSSDPGVQIPKKTLGLFANALKRKDSFIQFFAMTGILLLSVRSLGQKYRIHDLQED 152
           PSAPSSS P  Q+ KK LGL+ANALKRKDSFIQ  AMTGILLLS RSLGQKYRI+DLQED
Sbjct: 3   PSAPSSSTPEFQMSKKPLGLYANALKRKDSFIQLLAMTGILLLSFRSLGQKYRINDLQED 62

Query: 153 TSALKEEHKTLIDRMKNIKRSLLHEASIEPTGLFASRLRLLFSEED 199
           T+ALK+EH+TL+DRMKNIKRSLLHEAS+E TG FASRLRLLFS+ED
Sbjct: 63  TTALKQEHETLVDRMKNIKRSLLHEASLESTGHFASRLRLLFSDED 108

BLAST of Sgr017749 vs. ExPASy TrEMBL
Match: A0A2I4G353 (uncharacterized protein LOC109004280 OS=Juglans regia OX=51240 GN=LOC109004280 PE=4 SV=2)

HSP 1 Score: 163.3 bits (412), Expect = 1.0e-36
Identity = 82/105 (78.10%), Postives = 93/105 (88.57%), Query Frame = 0

Query: 94  SAPSSSDPGVQIPKKTLGLFANALKRKDSFIQFFAMTGILLLSVRSLGQKYRIHDLQEDT 153
           SAP SS PGVQ P++++GL ANA+KRKDSFIQFFAMTGILLLS+RSLGQKYRIHDLQEDT
Sbjct: 4   SAPPSSGPGVQNPRRSMGLLANAMKRKDSFIQFFAMTGILLLSLRSLGQKYRIHDLQEDT 63

Query: 154 SALKEEHKTLIDRMKNIKRSLLHEASIEPTGLFASRLRLLFSEED 199
           SAL EE  TL +RMKNIKR LLHEAS++PTG  +SRLRLL+ EED
Sbjct: 64  SALIEERDTLTERMKNIKRDLLHEASLDPTGFLSSRLRLLYGEED 108

BLAST of Sgr017749 vs. TAIR 10
Match: AT1G20430.1 (unknown protein; Has 29 Blast hits to 29 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 29; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 114.8 bits (286), Expect = 8.1e-26
Identity = 61/111 (54.95%), Postives = 74/111 (66.67%), Query Frame = 0

Query: 87  SSPMTPPSAPSSSDPGVQIPKKTLGLFANALKRKDSFIQFFAMTGILLLSVRSLGQKYRI 146
           S+P       +  DP   +     G F N  K K SF QF AMTGILLLS RS+ QKYRI
Sbjct: 4   SAPQGSVDPLTGKDPAKALTAVASGFFENVKKNKQSFFQFAAMTGILLLSFRSVSQKYRI 63

Query: 147 HDLQEDTSALKEEHKTLIDRMKNIKRSLLHEASIEPTGLFASRLRLLFSEE 198
           HDL+EDT+ LK+E  +L DRM  IK  LLH+ASI+ +G+FASRLRLLF E+
Sbjct: 64  HDLEEDTAVLKKEQDSLTDRMSKIKSDLLHQASIDSSGVFASRLRLLFGED 114

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022142735.14.0e-4388.68uncharacterized protein LOC111012778 [Momordica charantia] >XP_022142744.1 uncha... [more]
XP_038897763.11.1e-4087.74uncharacterized protein LOC120085692 [Benincasa hispida][more]
XP_023535555.11.9e-4085.85uncharacterized protein LOC111796960 [Cucurbita pepo subsp. pepo][more]
KAG6591237.14.1e-4084.91hypothetical protein SDJN03_13583, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022936518.14.1e-4084.91uncharacterized protein LOC111443106 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1CMC71.9e-4388.68uncharacterized protein LOC111012778 OS=Momordica charantia OX=3673 GN=LOC111012... [more]
A0A6J1FDG42.0e-4084.91uncharacterized protein LOC111443106 OS=Cucurbita moschata OX=3662 GN=LOC1114431... [more]
A0A6J1IKL05.8e-4083.96uncharacterized protein LOC111476471 OS=Cucurbita maxima OX=3661 GN=LOC111476471... [more]
A0A0A0L5S33.2e-3882.08Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G653420 PE=4 SV=1[more]
A0A2I4G3531.0e-3678.10uncharacterized protein LOC109004280 OS=Juglans regia OX=51240 GN=LOC109004280 P... [more]
Match NameE-valueIdentityDescription
AT1G20430.18.1e-2654.95unknown protein; Has 29 Blast hits to 29 proteins in 10 species: Archae - 0; Bac... [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 146..166
NoneNo IPR availablePANTHERPTHR36316OS06G0213900 PROTEINcoord: 91..197

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr017749.1Sgr017749.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane