Sgr023669 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr023669
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Locationtig00000892: 5437051 .. 5437521 (+)
RNA-Seq ExpressionSgr023669
SyntenySgr023669
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGAACTGCTTGAAGAGCAACAAAGTGATGGCCCAAGATGAGCCTTCGCCTTCGCCTTTGCCTCCGACAGAAACTGATAAAGTAGATAAACCAGCGGGCGGATCGGCGCTGGCGCGGCAGAAGACGGAGGAGGCGAGAAGCGCTGCGCGAGGTAAGAAGGTGGTGAGGTTTAAGCTACAAGAAAATGAAAATTCCGGCGACGGAAAAGTGATCGTCGGCAGAAGTGGCGACGGCTCCGGAGCCAGAGGCGGAGTATTGAGGATTAAAGTGGTGGTGTCTCAGAAAGAGTTGAAGCAGATATTGAAGGATAGAGACAACAATTCCTGCACCTTGGAGGAAATGTTAGCTGAATTGAAGATGAAAGGCAGGGCGACAATTTCAGATGCTAAAACCGATGAAGATGAAGACGAGGATGAAAATGGAAGCTGGAGGCCGTCTTTGGAAAGTATTCCTGAGGATCTCCATTAG

mRNA sequence

ATGGGGAACTGCTTGAAGAGCAACAAAGTGATGGCCCAAGATGAGCCTTCGCCTTCGCCTTTGCCTCCGACAGAAACTGATAAAGTAGATAAACCAGCGGGCGGATCGGCGCTGGCGCGGCAGAAGACGGAGGAGGCGAGAAGCGCTGCGCGAGGTAAGAAGGTGGTGAGGTTTAAGCTACAAGAAAATGAAAATTCCGGCGACGGAAAAGTGATCGTCGGCAGAAGTGGCGACGGCTCCGGAGCCAGAGGCGGAGTATTGAGGATTAAAGTGGTGGTGTCTCAGAAAGAGTTGAAGCAGATATTGAAGGATAGAGACAACAATTCCTGCACCTTGGAGGAAATGTTAGCTGAATTGAAGATGAAAGGCAGGGCGACAATTTCAGATGCTAAAACCGATGAAGATGAAGACGAGGATGAAAATGGAAGCTGGAGGCCGTCTTTGGAAAGTATTCCTGAGGATCTCCATTAG

Coding sequence (CDS)

ATGGGGAACTGCTTGAAGAGCAACAAAGTGATGGCCCAAGATGAGCCTTCGCCTTCGCCTTTGCCTCCGACAGAAACTGATAAAGTAGATAAACCAGCGGGCGGATCGGCGCTGGCGCGGCAGAAGACGGAGGAGGCGAGAAGCGCTGCGCGAGGTAAGAAGGTGGTGAGGTTTAAGCTACAAGAAAATGAAAATTCCGGCGACGGAAAAGTGATCGTCGGCAGAAGTGGCGACGGCTCCGGAGCCAGAGGCGGAGTATTGAGGATTAAAGTGGTGGTGTCTCAGAAAGAGTTGAAGCAGATATTGAAGGATAGAGACAACAATTCCTGCACCTTGGAGGAAATGTTAGCTGAATTGAAGATGAAAGGCAGGGCGACAATTTCAGATGCTAAAACCGATGAAGATGAAGACGAGGATGAAAATGGAAGCTGGAGGCCGTCTTTGGAAAGTATTCCTGAGGATCTCCATTAG

Protein sequence

MGNCLKSNKVMAQDEPSPSPLPPTETDKVDKPAGGSALARQKTEEARSAARGKKVVRFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQKELKQILKDRDNNSCTLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPEDLH
Homology
BLAST of Sgr023669 vs. NCBI nr
Match: XP_022140639.1 (uncharacterized protein LOC111011249 [Momordica charantia])

HSP 1 Score: 172.6 bits (436), Expect = 2.8e-39
Identity = 109/160 (68.12%), Postives = 121/160 (75.62%), Query Frame = 0

Query: 1   MGNCLKSNKVMAQDE---PSP-SPLPPTETDKVDKPAGGSALARQKTEEARSAARGKKVV 60
           MGNCL++N+VMAQDE   PSP S L  T     DKPA GSALAR KTEEAR AAR KKVV
Sbjct: 43  MGNCLRNNRVMAQDEACSPSPNSSLTETNYKVEDKPAAGSALARPKTEEARIAARRKKVV 102

Query: 61  RFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQKELKQILKDRDNNSCTLEEML 120
           RF+ +E+E SG G              GGVLRIKVVVSQKELKQILKDR++NS TLEE+L
Sbjct: 103 RFQQREDEISGGG--------------GGVLRIKVVVSQKELKQILKDRESNSSTLEELL 162

Query: 121 AELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPEDLH 157
           AELKMKGR TISDA+   D +EDENGSWRP+LESIPEDLH
Sbjct: 163 AELKMKGR-TISDARA--DNEEDENGSWRPALESIPEDLH 185

BLAST of Sgr023669 vs. NCBI nr
Match: XP_038902397.1 (uncharacterized protein LOC120089037 [Benincasa hispida])

HSP 1 Score: 156.8 bits (395), Expect = 1.6e-34
Identity = 96/154 (62.34%), Postives = 108/154 (70.13%), Query Frame = 0

Query: 3   NCLKSNKVMAQDEPSPSPLPPTETDKV-DKPAGGSALARQKTEEARSAARGKKVVRFKLQ 62
           NC KSNKVMAQDEP    LPP E  KV +KP  GSA+A+ KT EAR+    KKVVRFKLQ
Sbjct: 4   NCFKSNKVMAQDEPE-DLLPPIEAKKVEEKPRPGSAMAKPKTAEARTGGASKKVVRFKLQ 63

Query: 63  ENE--NSGDGKVIVGRSGDGSGARGGVLRIKVVVSQKELKQILKDRDNNSCTLEEMLAEL 122
           E E  NSGD               GGVLRIKVV+SQKELKQ+LKDR+NNSCTLEE++ EL
Sbjct: 64  EEEEKNSGD---------------GGVLRIKVVMSQKELKQMLKDRENNSCTLEELITEL 123

Query: 123 KMKGRATISDAKTDEDEDEDENGSWRPSLESIPE 154
           K+KGR TISD +   D  EDENG W+P LE IPE
Sbjct: 124 KVKGRTTISDGRI--DAVEDENGRWKPDLEGIPE 139

BLAST of Sgr023669 vs. NCBI nr
Match: KAG6570883.1 (hypothetical protein SDJN03_29798, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 151.4 bits (381), Expect = 6.6e-33
Identity = 103/162 (63.58%), Postives = 119/162 (73.46%), Query Frame = 0

Query: 1   MGN-CLKSNKVMAQDEP--SPSPLPPTETDKV-DKPAGGSALARQKTEEARS-AARGKKV 60
           MGN C KSNKVMAQDE   + S  PP E  KV +KP  GSA+A+ KT E RS AA GKKV
Sbjct: 1   MGNSCFKSNKVMAQDESCLALSNSPPVEAKKVEEKPVAGSAMAKPKTAEERSGAAAGKKV 60

Query: 61  VRFKLQ-ENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQKELKQILKDRDNNSCTLEE 120
           VRFKLQ E+ENSG      G  GDG   R GVLRIKVV+SQ+ELKQILK+ +N+S +LEE
Sbjct: 61  VRFKLQEEDENSG------GSGGDGD--RAGVLRIKVVMSQRELKQILKENENSSRSLEE 120

Query: 121 MLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPEDLH 157
           ++AE K+KGR T+SDA T  DE EDENGS RP+LE IPE LH
Sbjct: 121 LIAEFKVKGRTTVSDAIT--DEVEDENGSRRPALECIPEGLH 152

BLAST of Sgr023669 vs. NCBI nr
Match: KGN63254.1 (hypothetical protein Csa_022493 [Cucumis sativus])

HSP 1 Score: 124.0 bits (310), Expect = 1.1e-24
Identity = 87/158 (55.06%), Postives = 101/158 (63.92%), Query Frame = 0

Query: 1   MGN-CLKSNKVMAQDEPSPSPLPP---TETDKV-DKPAGGSALARQKTEEARSAARGKKV 60
           MGN C KSNKVMAQD+ S    PP    E  KV  +P  GSA+A+ K       A GKKV
Sbjct: 1   MGNICFKSNKVMAQDD-SYDDFPPHHLIEPKKVQQQPLPGSAMAKPK--NGTGGAAGKKV 60

Query: 61  VRFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQKELKQILKDRDNNSCTLEEM 120
           VRF LQE E   +G+     SGD      GVLRIKVV+SQKELKQILK R+NNSC+LEE+
Sbjct: 61  VRFNLQEEEKDEEGR----NSGDSG---PGVLRIKVVISQKELKQILKSRENNSCSLEEL 120

Query: 121 LAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPE 154
           + ELK+KGRAT   A        DE GSW+P+LE IPE
Sbjct: 121 IEELKVKGRATTVSA--------DETGSWKPALECIPE 140

BLAST of Sgr023669 vs. NCBI nr
Match: TYK24218.1 (hypothetical protein E5676_scaffold27G00200 [Cucumis melo var. makuwa])

HSP 1 Score: 122.5 bits (306), Expect = 3.3e-24
Identity = 85/163 (52.15%), Postives = 104/163 (63.80%), Query Frame = 0

Query: 1   MGN-CLKSNKVMAQDEPSPSPLPP---TETDKVDKPAGGSALARQKTEEARSAARGKKVV 60
           MGN C ++NKVMAQD+ S   LPP    E +KV++       A  K +     A GKKVV
Sbjct: 1   MGNICFRTNKVMAQDD-SYDNLPPDQFIEAEKVEEQPLRPGSAMAKPKNGTGGAAGKKVV 60

Query: 61  RFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQKELKQILKDRDNNSCTLEEML 120
           RF LQE E   + +     SGD SGA  GVLRIKVV+SQKELK+ILK+R+NNSC+LEE++
Sbjct: 61  RFNLQEEEEDQEDR----NSGDDSGA--GVLRIKVVISQKELKEILKNRENNSCSLEELI 120

Query: 121 AELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPE---DLH 157
            ELK+KGRAT            DE GSW+P+LE IPE   DLH
Sbjct: 121 EELKVKGRATTV---------SDEIGSWKPALECIPEGEGDLH 147

BLAST of Sgr023669 vs. ExPASy TrEMBL
Match: A0A6J1CG85 (uncharacterized protein LOC111011249 OS=Momordica charantia OX=3673 GN=LOC111011249 PE=4 SV=1)

HSP 1 Score: 172.6 bits (436), Expect = 1.3e-39
Identity = 109/160 (68.12%), Postives = 121/160 (75.62%), Query Frame = 0

Query: 1   MGNCLKSNKVMAQDE---PSP-SPLPPTETDKVDKPAGGSALARQKTEEARSAARGKKVV 60
           MGNCL++N+VMAQDE   PSP S L  T     DKPA GSALAR KTEEAR AAR KKVV
Sbjct: 43  MGNCLRNNRVMAQDEACSPSPNSSLTETNYKVEDKPAAGSALARPKTEEARIAARRKKVV 102

Query: 61  RFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQKELKQILKDRDNNSCTLEEML 120
           RF+ +E+E SG G              GGVLRIKVVVSQKELKQILKDR++NS TLEE+L
Sbjct: 103 RFQQREDEISGGG--------------GGVLRIKVVVSQKELKQILKDRESNSSTLEELL 162

Query: 121 AELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPEDLH 157
           AELKMKGR TISDA+   D +EDENGSWRP+LESIPEDLH
Sbjct: 163 AELKMKGR-TISDARA--DNEEDENGSWRPALESIPEDLH 185

BLAST of Sgr023669 vs. ExPASy TrEMBL
Match: A0A0A0LQE9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G418890 PE=4 SV=1)

HSP 1 Score: 124.0 bits (310), Expect = 5.4e-25
Identity = 87/158 (55.06%), Postives = 101/158 (63.92%), Query Frame = 0

Query: 1   MGN-CLKSNKVMAQDEPSPSPLPP---TETDKV-DKPAGGSALARQKTEEARSAARGKKV 60
           MGN C KSNKVMAQD+ S    PP    E  KV  +P  GSA+A+ K       A GKKV
Sbjct: 1   MGNICFKSNKVMAQDD-SYDDFPPHHLIEPKKVQQQPLPGSAMAKPK--NGTGGAAGKKV 60

Query: 61  VRFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQKELKQILKDRDNNSCTLEEM 120
           VRF LQE E   +G+     SGD      GVLRIKVV+SQKELKQILK R+NNSC+LEE+
Sbjct: 61  VRFNLQEEEKDEEGR----NSGDSG---PGVLRIKVVISQKELKQILKSRENNSCSLEEL 120

Query: 121 LAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPE 154
           + ELK+KGRAT   A        DE GSW+P+LE IPE
Sbjct: 121 IEELKVKGRATTVSA--------DETGSWKPALECIPE 140

BLAST of Sgr023669 vs. ExPASy TrEMBL
Match: A0A5D3DKZ8 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold27G00200 PE=4 SV=1)

HSP 1 Score: 122.5 bits (306), Expect = 1.6e-24
Identity = 85/163 (52.15%), Postives = 104/163 (63.80%), Query Frame = 0

Query: 1   MGN-CLKSNKVMAQDEPSPSPLPP---TETDKVDKPAGGSALARQKTEEARSAARGKKVV 60
           MGN C ++NKVMAQD+ S   LPP    E +KV++       A  K +     A GKKVV
Sbjct: 1   MGNICFRTNKVMAQDD-SYDNLPPDQFIEAEKVEEQPLRPGSAMAKPKNGTGGAAGKKVV 60

Query: 61  RFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQKELKQILKDRDNNSCTLEEML 120
           RF LQE E   + +     SGD SGA  GVLRIKVV+SQKELK+ILK+R+NNSC+LEE++
Sbjct: 61  RFNLQEEEEDQEDR----NSGDDSGA--GVLRIKVVISQKELKEILKNRENNSCSLEELI 120

Query: 121 AELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPE---DLH 157
            ELK+KGRAT            DE GSW+P+LE IPE   DLH
Sbjct: 121 EELKVKGRATTV---------SDEIGSWKPALECIPEGEGDLH 147

BLAST of Sgr023669 vs. ExPASy TrEMBL
Match: A0A7N2MA05 (Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1)

HSP 1 Score: 87.8 bits (216), Expect = 4.3e-14
Identity = 69/155 (44.52%), Postives = 97/155 (62.58%), Query Frame = 0

Query: 1   MGNCLKSNKVMAQDEPSPSPLPPTETDKVDKPAGGSALARQKTEEARSAARGKKVVRFKL 60
           MGNCL SNK +AQ+E  P      E  +  KP+  S L   K  +     + KKVVRFKL
Sbjct: 1   MGNCL-SNKSLAQEEEVPK---EAEVVEQTKPSTASKLEPVKLVDG-GHKKKKKVVRFKL 60

Query: 61  QENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQKELKQILKDRDN-NSCTLEEMLAEL 120
           +E++ +      VG S +G  +R GV+RI+VVV+QKELKQIL  ++     ++E+++  L
Sbjct: 61  EEDDTN------VGTSSEGD-SRSGVVRIRVVVTQKELKQILDCKEGLKYSSVEQLVNAL 120

Query: 121 KMKGRATISDAKTDEDEDEDENGSWRPSLESIPED 155
            ++GR  IS+ +T  DEDE  N +WRP+LESIPED
Sbjct: 121 NLRGR-KISEVRT-SDEDEGINSNWRPALESIPED 141

BLAST of Sgr023669 vs. ExPASy TrEMBL
Match: A0A6J1B1M6 (uncharacterized protein LOC110423293 OS=Herrania umbratica OX=108875 GN=LOC110423293 PE=4 SV=1)

HSP 1 Score: 82.8 bits (203), Expect = 1.4e-12
Identity = 61/156 (39.10%), Postives = 90/156 (57.69%), Query Frame = 0

Query: 1   MGNCLKSNKVMAQ-DEPSPSPLPPTETDKVDKPAGGSALARQKTEEARSAARGKKVVRFK 60
           MGNCL SNK++AQ D+P P        ++  K         +   +     + KK+VRFK
Sbjct: 1   MGNCLTSNKIVAQNDQPEPQGCRAEVIEETGKVTASKLERAEVAADEGEKVKKKKMVRFK 60

Query: 61  LQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQKELKQILKDR-DNNSCTLEEMLAE 120
           L E EN  DG    GR G+   ++ GV+RI++VV+QKELKQIL  R D    +LE ++  
Sbjct: 61  LNE-ENDVDG----GRQGE---SKDGVVRIRLVVTQKELKQILSSREDLKHTSLEGLIRV 120

Query: 121 LKMKGRATISDAKTDEDEDEDENGSWRPSLESIPED 155
           +K++G       +T  ++D+  +G WRP+LESIPE+
Sbjct: 121 MKLRGVRISEGGRT--NDDDGFHGGWRPALESIPEE 146

BLAST of Sgr023669 vs. TAIR 10
Match: AT3G21680.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-terminal protein myristoylation; LOCATED IN: cellular_component unknown; EXPRESSED IN: root, flower, stamen; EXPRESSED DURING: 4 anthesis, petal differentiation and expansion stage; Has 34 Blast hits to 34 proteins in 7 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 34; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 52.0 bits (123), Expect = 5.1e-07
Identity = 51/156 (32.69%), Postives = 80/156 (51.28%), Query Frame = 0

Query: 1   MGNCLKSNKVMA---QDEPSPSPLPPTETDKVDKPAGGSALARQKTEEARSAARGKKVVR 60
           MGNCL+ +  +A   +D+  P PL                   +  EE +++ RG+    
Sbjct: 1   MGNCLRHDNGVARKEKDDLDPEPLV------------------KLLEEGKTSFRGE---- 60

Query: 61  FKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQKELKQILKDRDNNSCTLEEMLA 120
              +E+E S +                 V+RIKVVV++KEL+QIL  + N   ++++++ 
Sbjct: 61  ---EESERSTE-------------EESKVVRIKVVVTKKELRQILGHK-NGINSIQQLVH 116

Query: 121 ELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPE 154
            LK  GR  IS A  +EDE E+ + +WRP+LESIPE
Sbjct: 121 VLKDSGR-NISMASYEEDEKEEGDENWRPTLESIPE 116

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022140639.12.8e-3968.13uncharacterized protein LOC111011249 [Momordica charantia][more]
XP_038902397.11.6e-3462.34uncharacterized protein LOC120089037 [Benincasa hispida][more]
KAG6570883.16.6e-3363.58hypothetical protein SDJN03_29798, partial [Cucurbita argyrosperma subsp. sorori... [more]
KGN63254.11.1e-2455.06hypothetical protein Csa_022493 [Cucumis sativus][more]
TYK24218.13.3e-2452.15hypothetical protein E5676_scaffold27G00200 [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1CG851.3e-3968.13uncharacterized protein LOC111011249 OS=Momordica charantia OX=3673 GN=LOC111011... [more]
A0A0A0LQE95.4e-2555.06Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G418890 PE=4 SV=1[more]
A0A5D3DKZ81.6e-2452.15Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A7N2MA054.3e-1444.52Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1[more]
A0A6J1B1M61.4e-1239.10uncharacterized protein LOC110423293 OS=Herrania umbratica OX=108875 GN=LOC11042... [more]
Match NameE-valueIdentityDescription
AT3G21680.15.1e-0732.69unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-termin... [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..54
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 61..80
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 124..156
NoneNo IPR availablePANTHERPTHR33148PLASTID MOVEMENT IMPAIRED PROTEIN-RELATEDcoord: 1..153
NoneNo IPR availablePANTHERPTHR33148:SF55OS01G0219300 PROTEINcoord: 1..153

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr023669.1Sgr023669.1mRNA