Sgr015638 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr015638
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
Descriptionthionin-like protein 2
Locationtig00004836: 497751 .. 508881 (+)
RNA-Seq ExpressionSgr015638
SyntenySgr015638
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGTCAGTTGTTGTAGTTTGCTTGGCTCTGATTCTGAGCTTATTTGCAGGCAGGTCCACAGCTGGCTCCTTCGGGGATTGCTTTGACCGTTGCTACGTCTCCTGCTTTAATGGAGCCGCACCCTGGGAATTAACCCTTTGCCCTGCAAAGTGCGTGGCAGAATGCGCCCTGCGTTATGTGGGATCCTCCCCAATGGACTTCAACCACAAGAAGGACACTCGTTACTTCTGCAACTTTGGCTGTGCCACTTCTCTCTGCACCAAATTCAGCACCAAGAAAGACCCAGGTCCGTCTTTTCGCGTTAAATTATAAGTGTGGTTTAATTCTATCTATTTAAGTCTATAAATTTTAAATAAGTTTTGAAACTTTTAAAGTTGTATTTATGTAGTCCCTGTATCTATTCAAATTCAATCTACCATGCAATCCTTAAACACCAATAAAGAAACTTTTTAAGAACAATATTTGTCTGAGCAATAGAAAATTCGTATTGGTCACCACTAAGTTTTTAACATTTGGGTGAGAAAAGACAGAGAAGAAATGAGAGAAGAGTGCCTTTTACTATTTAAAAAAAAAAAACATTCGGATGTCAAAAACTAAGTGGCGGACAACAATCACTTTTTAAAACAGAGAGACTAGATATACACAACTTTAAAAGTATTTTAGAGATTAAACTTGTAATTTAAACTGATATTTTTTATACTTTTAGACCTCTCTCTTTCTTTTTCTCTCTAGTTTTGATAGATGTTTTGGTTTTGTTTTTGATGAACGTGGCATGCAGCTGAGAAGAAAGTGAAAAGCTGTGTGGACTCGTGCTCTCGAACATGCGCAAACGCTTAAGCAGCAAAAATGAGGGGCCCGAAGAAGGAGCTGGGCTTACGTTTGGCTTAAAGAAACTATGAATATAAATAAAATGTCAGTGGTGAAACAAATCCATTTGGCTATAAGCCTACCTTATCCCAGATTTAAGTTTGATGCATGTATTAAGATATATATGTACTATATACAGGATCCTCTCTCTCTCTCTCTTCTTTTTTTTTTAATAAAAGATCATAAAGAAACACAATAGTCTAAAAAACGGGCATGAAAACTCGAGAAATACGCGAGAAGATGACGATCTTCTCTCGCACATCTTCCATCACCCATGGAGGGCATTCCTCAATCCAAGCGTTCGACTACCTGTGAGCTCTTGCGTGAGTCACCAAGATATGAGTAGCGTTTGTGATTGTTATGACTGTTAAAATAATTGTTGTCAATTATGCTTTAAAAAAAATCAGCTTTAATATGAAGAAAATTTGGTAACTTTCAGTTTTAAATTTGAAAAAACAAGTTATTGTGTTTGGTGAATTAATACTAAATTGCTGTGATTTTGTTATAATGACCAAAATAGACATAATATTTCAATTATGCATAATGTTGTGTTGATATTTAATTTCAAATTTTTTTTACAACATAATTATTTAAAATTTTATATAGTTTAAATTACAAGTTATATTTTGTTTATTCTAAATTATTTGATAAAAGATTAAGAAAAAAATTGTCAATCAAAATGTTATTTCAATTTAAATACAAAAAAGGTCAGAAAAATAAATTATATTGAATTTCATAAGCAAACAACAAATTAAATTAAAAAAATTGATGAGATGAGATAGATTGTGAATCACGATGCTTGAAAACCTTCGTTTAGATATAGGAGGCTAAATTGGAAATTTTTAAAAGTTTGTTAGCAAATGTGTAGATTTTTAAGGGGTCGTTTGATAACCATTTTGTTTTCTGTTTTTGGTTTTTGTTTTTTATTTTTAAAAAACATGGCTGTTTGATTATCAATTCTGTTTTCTGTTGTTTGTTTTTAAAAAACGTTTTTGAAATTATAAGTAAATTTCAAAAACAAAAAAAAGTAGTTTTTAAAAACAGATTTCTATTTTTCAAAATCTCATATTTTTTTCTAACACTATTTTTTTTTCTCTTTGACATATCTTTCTCTAGTAATTTTTTTCACTCTCTAAAATTTCTCTCCAAATTTTTTTCTTTCTCTCTCTAAAATTTTTATTTCTATATTACTTTATTTATTTCTTTTCTAACATATCTCTCTCACTTTTCCCTCTCTAAAATTTTAATATTTTTCGAAAACTCTCTCTCTAATTTTTTTTCCCTGTATAACTTTTTTCTCTCTCTAATTTATTTTTCTCTCTCACTCTTCCCTCTCTCTAAAATTTTAATATTTTTTCTAAACTTTCTCTCTAAAAATTCTCCCTCTATAACCTTTCTTTCTTTCTCTCATTTAATTTTCTCTCTAACTTATATATCTCTAACTTTTCTCTCTCCAAATTTTTTTCCTCTCTCTAAATTTCTCTCTTTCTAAAATTTTCATTTCTCTATTATTTTTATTTCTCGTTCTCTTCTAATATATATCTCTCTAAAATTTTAATATTTTTCTCAAACTTTCTCCCTCTAAAATTTCTCCATCTATAACTTTTCTCTCTCTCTAAAATTTTAATAATTTTCCCAAACTTTCTCTCTAAAATTTCTCTCTATAACTTTTCTTTCTCTCTTTCTTATTTATTTTTCTCTCTAACATGTATATCACTAACTTTTCTCTCTCTAAATTTCTCTCTCTAAAATTTTCATTTCTCTCTCTCTTCTAATATATCTTTCTCTAAAATTTTCATTTATCTCTTTCTCTTCTAACATATCTTTCTCTAAAATTTTAATATTTTTTCGAAACTTTTTTCTCTCGAAAATTTCTCTCTCTAATATATCTCTCTCTATCTTAAAAAAAAATCATCTCTCTCTAAAATTTCTCCTTCTATAACTTTTCTTTCTAATTTATCTTTTTTTTCTCTAACATGTCTTTTACTCTCTCTCTAACTTTTCTCTCTATAAAATTTCTCGTTTTCTCTCTCTAAAATTTATTTCTATCTCTAAACATTCTTTCTCTTTCTCTAACCAATATTCTCTCTCTAACATATCTTTCGCTCTTTAACTTTTCTCTCTCTATTATATCATTCTCTTTCTCTAACTACTATTCTCTCTCTAACATATCTCTCGCTCTTTAACTTTTCTCTCTCTATTATATCCTTCTCTTTCTCTAACTACTATTCTCTCTCTAAATCTCTCTCTTTAGTTACTCTCTAAAATTTCTCTCTCTAACTTTTCTCTCTAAACTTTTATTTTCCTGCCTCCATATATTTTTCTCTCTCTAGAACTTAAAAAACAAAAAACCAAAAACAGTTATCAAACATGTTTGGCTCTTTGTTTTTAAAAAATAGAAAACAAAAAACAGTTATCAAACATGTATGGTTTTTGTTTTTTAAAAATAGAAAATTAAAAACAAAAAACAAAAAACAGAAAACAGAAATGGTTATCAAATGGGGCCTAGGGAAGCCAATTTCAAAAGCTGTTAATTGGATTTTGCAAATCAAAGTGATTGAAAAAGTGATTCCATTTAAATTAATATATAATCAGTTTTACCAAATAGTAACGGTTAAAATTTTTCTATAATCAGGCCTTTGAAATCAATTTCAAACAGTGCCTTAGTGGGCAAAGGTATGACACTGTAGACATACGGATTCCAAGTCATAGACCATATAAGCGAGGAGGGTTAGCGATATATTCCACTAAAACCAAAACAATAATTTTAATCAACTATTTATAATTCAAGTTGGAGGAGCAACTTTGGTGTAAGATTATAGAATTAAATTGTAAATGGCTGACAATTTAAAAGATAATAATATTTTAATTGTAATGCTCTAGGTTTAGGATTTTGAATCCGAATTTAGCATCTAATGGTCCCAATATTCTTATGCATCCCCCTCACCACGAGTTATTTTAGCAAATTTTGTTTGCATTCACACACATAATTTTTACTAAAATTTCTTAAAATGTTACCCAACATGAAATTGCTTCAGATCAAGTACACTTCACTTTAGAGGTTGTAAGGAGAAATTAGGAGTGTTTAAAAAACCCGATCAAGGTTGGGTTGGGTTTAAATAAACAAAAATTTCATGAGTTAGGTTGGTTCATGGGTTCACCTAAAATAATTGGTTTGGATTGAACCAAACCAACTCGAACTTGTAATTAATTTTTAAAAATATATTTGTCTCTTAAAATAAAATTATATATGCATATATTCATTCTACTTTTCTTTCATTCAATAATTCTTTTTAGATTATTTAATCTCCAAAAACTTTTAAAAATAATTTTTTAAAGTTTTCAGAGTGTAATTTTTCGCAATATAAATTTAAATTGAGTTACAATTGTAATTCAATGTTTGAAAGTAACCAAATAGATCTCTAAGAGATTAATATTTTACATTTTTTTTCAAACAAAATTTTGAATAAATGATCCAAATAATCCGAACCAATCCAACCTAAAACTTCATTCTGAATTGGGTTATTTGGGTTGAAGAAATTTGCAACTTGAACAATTGAGTTAGATCTAAATAGTACTTTCAACTTAACCAAACTCAACCCATAGACGCCTCTAGTAAAAATGTGTCACGGCGAAGATGCTCAACTAAAGTGGTTGTTTACGAGTCAAGGGTTTAATAATTTCCTACCACTGAGTTATAATGTTCCACTCACCAAGTGGTTATTTAAGATCCTCTTTCCCAAGAACTCAAACCACACAAATTAAAACTGAGTTCAAAAGTTTGAAAAAATTCAACACGATGAAGTCAGTTGTTGTAGTTTGCTTGGTTATGACTCTGAGCTTATTTGCAGGCATGTCCACAGCCGGCCCCTTCGGGAATTGCTTTGCCCGTTGCTTCGGTCCCTGCTGTATAACACCCATGGATTCAATCTTCTGCACTGGAATGTGCATGGGACAATGCGCCCTGCCTTATGTGGGATCCTCCCCAATGGACTTGAACCACAAGAAGGACACTCGTTACTTCTGCAACCTTGGCTGTGCCACTTCTCTCTGCACCAAATTTAGCACCAAGAAAGACCCAGGTCCCATCTTTTCGCGTTAAATTATAAGTGTGGTTTAATTTTATCTATTTAAGTCTATATATTTTAAAAAATTTTAAATAAGTTTTGAAATTTTTAAAATTGTATTTATGTAGTCCCTATATCTATTCGAATTCAACCTGCCATACAATCCTTAGACACCAACAAAGAAACTTTTTAAGAACAATATTTGTTTGAGCAATAGAAATTTCATGTTGGTCACCGCTAATTTTTAACAGCTGGGTGAGAAAAGACAAGGAGAGAGTGAGAGAAAAGGGACTTTTGCTATTTTTAAAAAAAAAATATTCGGATGTCAAAAACTAAGTGGCGGACAACAATCACTTTTTAAAACAGAGAGACTAGATATACACAACTTTAAAAGTATTTCAGAGATTAAACTTGTAATTTAAACTTATATTTTTTATACTTTTAGCCCTCTCTCTTTCTTTTTCTCTCTAGCTTTGATGGATGTTTTGGTTTTGTTTTTGATGAACGTGGCATGCAGCTGAGAAGAAAGTGGAAAGCTGTGTGGACTCGTGCTCTCGAACATGCGCAAACGCTTAAGAAGCAGCAAAAATGAGGGGCCCGAAGAAGGAGCTGGGGTTACGTTTGGCTTAAAGAAACTATGAATATAAATAAAATGTCAGTGGTGAAACAAATCCATTTGGGTATAAGCCTAGCTTATCCCAGATTTAAGTTTGATGCATGTATTAAGATACATATATGTATTATATAGTTTCAAAGTTTCTGCATGCACACATCAATATGTGAAAAAAACGGTTGCTTTATCCTATATTATGCGTGTGAGAGAATTGGTAAAAGAGGTTTCGTCTATGTATTATTTTGCCTATGTTGGCTTGTCACAAATTGCCCACAATCTAACTATATATAGAGAGGATTCTCTCTCTCTCTTTTTGAATAAAAGATCATAAAGAAACACATCAGTCTAAAAACGGGTATGAAAACTTGAGGAATACGCAAGAAGATGTCGATCTTCTATCACACATCTTCCATTGCCATGGAGGGCATTTTTCAATCCAAACGTTCGACCACCTATGAGCTCTTGCGTGAGTCTCCAAGATATATGAGTAGCTTCGTTGCCAACGATACGCGAAGTACGCAAAATTTACCACAATCAAAATTCATTTATCGCTTAAAAGATGATAGTAAGAAATTTAAAATAATTTCATAATCTTATTACTCATAATATAAGAAATATTAATATGAATGTATGAATAAAAAAATTTGAACACATAAATAATTTGATATTATTGAAATGTCGATAGAAATATTAATATAGATGCAAGTCTCAAGAAAAATCATAAAAATTATAGAAATTAATTTGAATAAATTGAAAGAAATTATTCGAATAAATAATTAAACTTCAATATTATCATTTTTAATATTCATATCTATGCATAGAACTAAAATATAAAATTTCATATGATTATAAATAGGTTTAAATACTATTTTAGTCCCTATACTTTGAGCTTTTGTTCATTTTGGTCCCTATACTTTTAAAATATTCATTTTTGGTCCCTATACTTTCAGTTTTTGTCCATTTTAGTCCTTATACTTTCAAAACATCCATTTTAGTCCCTGTACTTTTAAAAAATAATCGTTTTGGTCCATATTTGATAAATTCTAATACAAATTCTATACTCAACAAAAACCTTTTAATACAAAAATATGATAATATTTCAAGAAATTTACTGAGTGTATTATTGTGCTGAAAATGTGATTGAAAAAAATGAGATAAAAAGGAAAATCAAAGGACCAAAATGGTAAATTTCTTGAAAGTATAGGGACTAAAGTGGACAAAAACTGATAGTACGAGACCAAAAATGAACATTTTGAAAGTACAAGGACCAAAATGAATAAAAGCTCAAAGTACAGGGACCAAAAATGAACATTTTGAAAGTACAAGGACTAAAAGGAACAAAAGTTCAAAGTATAGGGACTAACATAATATTTAAACCTTATAAATAATATAAAATAAAGATGTAAAAATAAAAAGTATTTTTTTTATTAAACAAATGTGGAAGTGGGGGATCGAACTTTCTACCTTAAGGAAGGTAATAGAATGCCTTATCCACTAAACTATATTCGGATGGACTAAAAAAAAAAGTATTTAATAAATACTAAACTCTTTGATGTTATGTAATAAAAAAAAGAGAAATAAATACAAAATTATAAAATAAGAAGTGATAATGTCTATACTGTATGTGAGAGCAATGGAGAGAAATAATATTATAAAATGATCTAAAGATATTAACTTGAATAATAAATTTTTAAAAAATTATTATTTTAATTAAAAGATATTAATAAAGTAGAATAAAGAATTTGAAATAAAGAATAATAGAGTGACAAAGAATAATAGATAGAGAAATAATAATATAAATAGGTAAAAAATTTCTAATAAGTTCCTAAACTTTTAGGGTTATGTCTATTTAGTTCTTATACTTTAAAAAGTGACATTTAGACTTTAAGAGTTGTATCTATTTAGTCCCTATACTTTAGAAAGATTCTAATAGGCTCTTGAACTTTCAATTTTGTGTCTAATAAGTTTTGTCATTAATTTCGTTAGTTTAATACTTACACCCCAAACGCCGCAATCTATTAGAAATCAACATACAATGTAGTTCTTAAATAACATAGCCATAAGTGCTAAAGTGATGATGAAGTTAACGACATAGATATATTAAACACAACCAAAAGTTCAGAGATATAGTAGAAACTTTTTAAAGTAGAAAGACCATATAGAATATGCCTTGAAAGTTTAAAAATTTATTAGAAAAAATTTTAAAATATAAAAGTTAAATAGATAGTAACCAAAAAGTTAAAAACGAATATTGTAACTTAATCCTAATCAGAAATGAGAGAAATAGGTGACGCGAGTGAGGGAGGTAGAGAGGAGAGAGAATATGTGTGAGGAATTAGCGAGAGAGTGGCATAGTCCCAATCTCAATCCCACATCGAAAATTTTCGTGATAGGAGGAGTCATTTGCCACTATAAACATCTCCCTTCCCGTATTGGGCTTCTATTTGAAATGTTTTGTGAGCAGATTGCAAAAATTGTGGGCTCAAGTCGACATGTCGGGAATATCATTGACATTTCGTAGATGTTGACAGTATTTTTTTTTGACATGTTGCAATCTTCATGGAAGATGTAGGTGACATTTTTTCTCTTGTTCGATGTGTTGAGTTCGCTTGATATTTTAATCATTTAAGGCATAAGAATGCAAACTTTTATTTGAGTTTAAAATTTAATTTGATAAAATTAAAAATTTTAAGATTATTTATTGATGACGGAAATTTTATTGATACAATTAGATATGATTGTCATTGAAACAGATTAATATAATGTTAATGAATAACTAGAGAATGCATGGTCAAGAAGACTATTCATTAAGTTTCTCCGTATTTCACTAGTTTGAATTTTGGTTTAAAGTAGAACCATTGACCTAACTCATTCTTCATTAAATTACCTCAATTGATTCAAAATTCTAACCATTTTCTCGATAAAGAAATAAGCTTTCTATTGCTTGGATTTCAGTCTTACATACTGTTTTTTTTATCTACTCAAAACCCTTAAATAACAAATAATGATGTCCAAAACTATAGTTTGGTTTAATGGACAAGGACATGTGAAACTCAATCTTGACCGTCTATAGAACTTCCACCAACATAGCTAGCTTTGTCTTTGAACAATTAATATTTTTCTTTGATTTTGGTGACATTCAAAGTATGTTTGAAAATGCTTCTAGAATAAGTACTTTTTTAAAAAAATACTTGAATTATAAGCACTTTAATAGAAAGTCATTTTAATGTTTGGTTCTACATTCTTAAAAGTACTTTTGAAGTACTTTCAATTAAGCACTTGGATCAAAAGCACTGTACAAGTACCTTAATTTTGACATAGTACTTTAAGTGCTTTTGACTTTTAATTCAAAGTATTTTTACATCACTCACAAACATGATAGATTTTGAAGGAGAAATGCTTTTGATCATGGTAAAAGTACTTTTGGCACGACAAAATTTAACTGATATTTGGCAAATATGTGCCTGCAAGAATTTTCTCAATGTGGTTCAATTTATTTTAATTTTATAACATAAATCAAATCAAATCAAATCAAATCAAGAATTTTTCGTTCCATTCCACTCTTAGAATGTAATACAAAATGTAGCAAGAGAATGGGAAGTTGAAAAGAAAAGAATAGACATGACTAGTCTTAGTGGGCAAAGGTATGACATGCGGATTCCTAGTCATGGACCATTCCAGCGAGGAGGGTTAGTGATATATTCTCACTAAAACCAAAACAATAATCTATTTATAATTCAAGTTGGAGGAGCAACTTTGGTGTAAAATTTTAGAATTAAATAATAAATGGCTGACAATTTAAAGATAATAATATTTTAATTGTAATGCTCTAGGCTCATAATTTTGAATATGAATTTAGCATCTAACCATCCCAATCCAATATTCATATGCATTCTCTCAATACAATATGTTTTGTTTGCATTCACACACATGATTCTTACTAACATTTCTTAAAATTTTACCCAACATAAAAATGCTCTAGACCAAGTACACTCAATTTTAGAGGTTCTTAAGAGAAATTAGGAGTGTTCAAAAAATTCGATCAATCCAATCCAACCCATACAGTATGGGTTGGGTTGGGTTGGGTTCAAATAAATAAAAAATTCATGGGTTCCACTTAAAATAAGTTTGGATTGAACCAAACCAACTCAAACTTATTATTAATTTTAAAAAATATATTTGTTTTACTTACAATAAAATTTTATATATATGCATGTATTTATTTTAATTTTCTTTCATTCAATAATTTTTTTTAGATTATTTAATCTCCAACAACTTTTAAAAATAATTTTTTAAAGTTTTTCGAGTGTAATTTTGCGCAATATAAATTTAAATTGAGTTACAATTGTAATTCAATGTTCGAAAGTAACTAAATAGATCTCTAAAGAGATTACCATTTTACTTTTTTTTTTTAAAACAAAATTTTGAATAAGTGATCCAAATAATCCGAACCAACCCAATCTCAAATTTTATTTTGGGTTGAAGGAATTTGCAACTCAAACAATTGAGTTGGATCTAAAAAGTAGTTCAACCAAACTAAACCCATCCCTGGACACCCCTAACAAAAATGTGCCACCACGACTTGAAGATGCTCAACTAAGGTGGTTGTTTATGAGTCAAGGGTTTAATAATTTCCTACCACTTATTTATAATGTTCCACTCACCAAGTGGTTATTTAAGAACCCTCTTCCTCAAGAAATCAAACCACACAAATTAAAACTGAGTTCCTGAGTTCGAAAAGAGTAGTAAAATTAAACAACATGAAGTCAGTTGTTGTAGTTTGCTTGGTTTTGATTCTGAGCTTATTTGCAGGCAGGTCCACAGCAGGCTCCTTCGGGCCTTGCTTTGGCCGTTGCTTCGCTTTCTGCTATAATACAGCCATGCAAGCAATCGACTGCTCTGCAAAGTGCGTGTCAGAATGCAGCCCTTATGTGGGATCCTCCCCAATGGACTTGAACCACGAGAAGGACACTCGTTACTTCTGCAACCTTGGCTGTGCCACTTCTCTCTGCACCAAATTCAGCACCAAGAAAGACCCAGGTCCGTCTTTTCGCGTTAAATTATAAGTGTGGTTTAATTCTATCTATTTAAGTCTATAAATTTTAAATAAGTTTTGAAACTTTTAAAGTTGTATTTATGTAGTCCCTGTATCTATTCGAATTCAATCTACCATGCAATCCTTAAATACCAATAAAGAAACTTTTTAAGAACAATATTTGTCTGAGCAATAGAAAATTCGTATTGGTCGCCACTAAGTTTTTAACATTTGGGTGAGAAAAGACAGAGAATGAATGAGAGAAGAGTGTCTTTTGCTATTTTTTTAGCTGTCAAAAACTATGTGGTGGACAACAATCACTTTTTTAAAGTACATAAATTAAATATACGCAACTTTAAAAGTTGAGAGAATAAACTTGTAATTTAGTCCGATATTTTTTTTTATACTTTTATAGTCCTCTCTCTTTCTTTTTCTCTCTAGTTTTGATAGATGTTTTGGTTTTGTTTTTGATGAACGTGGCATGCAGCTGAGAAGAAAGTGGAAAGCTGTGTGGACTCGTGCTCTCGAACATGCGCAAACGCTTAA

mRNA sequence

ATGAAGTCAGTTGTTGTAGTTTGCTTGGCTCTGATTCTGAGCTTATTTGCAGGCAGGTCCACAGCTGGCTCCTTCGGGGATTGCTTTGACCGTTGCTACGTCTCCTGCTTTAATGGAGCCGCACCCTGGGAATTAACCCTTTGCCCTGCAAAGTGCGTGGCAGAATGCGCCCTGCGTTATGTGGGATCCTCCCCAATGGACTTCAACCACAAGAAGGACACTCGTTACTTCTGCAACTTTGGCTGTGCCACTTCTCTCTGCACCAAATTCAGCACCAAGAAAGACCCAGGCATGTCCACAGCCGGCCCCTTCGGGAATTGCTTTGCCCGTTGCTTCGGTCCCTGCTGTATAACACCCATGGATTCAATCTTCTGCACTGGAATGTGCATGGGACAATGCGCCCTGCCTTATGTGGGATCCTCCCCAATGGACTTGAACCACAAGAAGGACACTCGTTACTTCTGCAACCTTGGCTGTGCCACTTCTCTCTGCACCAAATTTAGCACCAAGAAAGACCCAGCTGAGAAGAAAGTGGAAAGCTGTGTGGACTCGTGCTCTCGAACATGCGCAAACGCTTAA

Coding sequence (CDS)

ATGAAGTCAGTTGTTGTAGTTTGCTTGGCTCTGATTCTGAGCTTATTTGCAGGCAGGTCCACAGCTGGCTCCTTCGGGGATTGCTTTGACCGTTGCTACGTCTCCTGCTTTAATGGAGCCGCACCCTGGGAATTAACCCTTTGCCCTGCAAAGTGCGTGGCAGAATGCGCCCTGCGTTATGTGGGATCCTCCCCAATGGACTTCAACCACAAGAAGGACACTCGTTACTTCTGCAACTTTGGCTGTGCCACTTCTCTCTGCACCAAATTCAGCACCAAGAAAGACCCAGGCATGTCCACAGCCGGCCCCTTCGGGAATTGCTTTGCCCGTTGCTTCGGTCCCTGCTGTATAACACCCATGGATTCAATCTTCTGCACTGGAATGTGCATGGGACAATGCGCCCTGCCTTATGTGGGATCCTCCCCAATGGACTTGAACCACAAGAAGGACACTCGTTACTTCTGCAACCTTGGCTGTGCCACTTCTCTCTGCACCAAATTTAGCACCAAGAAAGACCCAGCTGAGAAGAAAGTGGAAAGCTGTGTGGACTCGTGCTCTCGAACATGCGCAAACGCTTAA

Protein sequence

MKSVVVVCLALILSLFAGRSTAGSFGDCFDRCYVSCFNGAAPWELTLCPAKCVAECALRYVGSSPMDFNHKKDTRYFCNFGCATSLCTKFSTKKDPGMSTAGPFGNCFARCFGPCCITPMDSIFCTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCANA
Homology
BLAST of Sgr015638 vs. NCBI nr
Match: EOY26692.1 (To encode a PR protein, Belongs to the plant thionin family with the following members:, putative [Theobroma cacao])

HSP 1 Score: 156.0 bits (393), Expect = 3.3e-34
Identity = 87/192 (45.31%), Postives = 110/192 (57.29%), Query Frame = 0

Query: 1   MKSVVVVCLALILSLFAGRSTAGSFGDCFDRCYVSCFNGAAPWELTLCPAKCVAECALRY 60
           + +V++VC  L+L    G+STA     C+  C++ C   +       C AKC+ +C L  
Sbjct: 6   VSAVLMVC--LVLGTLVGQSTAQGTILCYAACFIPCMADSTTTTF-YCAAKCLKDCIL-- 65

Query: 61  VGSSPMDFNHKKDTRYFCNFGCATSLCTKFSTKKDPGMSTA-GPFGNCFARCFGPCCITP 120
                      KDT+YFC  GCAT+LCT  STK+DPG STA G    C+A CF PC   P
Sbjct: 66  ---PKSTVGGIKDTQYFCKLGCATALCTNISTKEDPGQSTAQGTNVLCYAACFIPCMADP 125

Query: 121 MDSIF-CTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKV 180
             + F CT  C+  C LP        +   KDT+YFC LGCAT+LCT  STK+DP EKKV
Sbjct: 126 NTTTFYCTIKCLKNCILP-----KSTVGGIKDTQYFCKLGCATALCTNISTKEDPGEKKV 184

Query: 181 ESCVDSCSRTCA 191
            SCVD+CS TCA
Sbjct: 186 GSCVDACSATCA 184

BLAST of Sgr015638 vs. NCBI nr
Match: KAG6575552.1 (Thionin-like protein 2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 155.2 bits (391), Expect = 5.6e-34
Identity = 82/170 (48.24%), Postives = 102/170 (60.00%), Query Frame = 0

Query: 24  SFGDCFDRCYVSCFNGAAPWELTLCPAKCVAECALRYVGSSPMDFNHKKDTRYFCNFGCA 83
           S G C+  C ++C +G  P E  LC   C+          SPMD NH  +  YFC  GCA
Sbjct: 19  SNGYCYALCGLACLSG--PIECALCIGSCMISAQ-----DSPMDINHLNNA-YFCKLGCA 78

Query: 84  TSLCTKFSTKKDPGMSTAGPFGNCFARCFGPCCITPMDSI-FCTGMCMGQCALPYVGSSP 143
           TS CT+  +    G STA  F  C+A+CF  C ITP  ++  C   C+ +C   ++ S+P
Sbjct: 79  TSRCTRLLSNARTGRSTAS-FRKCYAKCFIACAITPGITLGTCGATCLAKCL--FIASAP 138

Query: 144 MDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCANA 193
           MD NH  DT YFC LGCATS+CTKFSTK DP+EKKVE CVDSC+ TC  A
Sbjct: 139 MDFNH-MDTHYFCKLGCATSMCTKFSTKNDPSEKKVERCVDSCAGTCIKA 176

BLAST of Sgr015638 vs. NCBI nr
Match: KAG6575557.1 (Thionin-like protein 2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 124.8 bits (312), Expect = 8.1e-25
Identity = 69/163 (42.33%), Postives = 85/163 (52.15%), Query Frame = 0

Query: 24  SFGDCFDRCYVSCFNGAAPWELTLCPAKCVAECALRYVGSSPMDFNHKKDTRYFCNFGCA 83
           S G C+  C ++C +G  P E  LC   C+          SPMD NH  +  YFC  GCA
Sbjct: 19  SNGYCYALCGLACLSG--PIECALCIGSCMISAQ-----DSPMDINHLNNA-YFCKLGCA 78

Query: 84  TSLCTKFSTKKDPGMSTAGPFGNCFARCFGPCCITPMDSIFCTGMCMGQCALPYVGSSPM 143
           TS CT+  +    G++     G C A C   C                     ++ S+PM
Sbjct: 79  TSRCTRLLSNARTGIT----LGTCGATCLAKCL--------------------FIASAPM 138

Query: 144 DLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCS 187
           D NH  DT YFC LGCATS+CTKFSTK DP+EKKVE CVDSC+
Sbjct: 139 DFNH-MDTHYFCKLGCATSMCTKFSTKNDPSEKKVERCVDSCA 148

BLAST of Sgr015638 vs. NCBI nr
Match: KAG6575558.1 (hypothetical protein SDJN03_26197, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 123.2 bits (308), Expect = 2.4e-24
Identity = 73/192 (38.02%), Postives = 105/192 (54.69%), Query Frame = 0

Query: 11  LILSLFAGRSTAGSFGDCFDRCYVSC--FNGAAPWELTLCPAKCVAECALR----YVGSS 70
           ++ SL    ST  SF +C+  C+V C    G A    + CP +C+  C +     +  ++
Sbjct: 15  VLSSLTTANST--SFQECYATCFVICAITPGVA---FSDCPLRCLQACIIPSFPIHNAAA 74

Query: 71  PMDFNHKKDTRYFCNFGCATSLCTKFSTKKDPGMS-----TAGPFGNCFARCFGPCCITP 130
             DF+ ++  ++FC  GCA S CTKFST ++PG+S       GP  NC+  C   C    
Sbjct: 75  DDDFHRQQKNQFFCELGCAASSCTKFSTHQNPGISEPTEYVGGP--NCYFGCINSC---- 134

Query: 131 MDSIFCTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVE 190
            +S F   +C+G+C +  V S+PM  NH  D RYFC LGC+TS C K      P EK+++
Sbjct: 135 FESAFQCAICIGKCMIS-VESTPMQANH-MDNRYFCKLGCSTSRCAKL-LLNHPNEKQMK 192

Query: 191 SCVDSCSRTCAN 192
            CV+SCS TC N
Sbjct: 195 GCVNSCSHTCTN 192

BLAST of Sgr015638 vs. NCBI nr
Match: XP_038899787.1 (thionin-like protein 2 [Benincasa hispida])

HSP 1 Score: 119.0 bits (297), Expect = 4.5e-23
Identity = 61/97 (62.89%), Postives = 69/97 (71.13%), Query Frame = 0

Query: 97  GMSTAGPFGNCFARCFGPCCITPMDSI-FCTGMCMGQCALPYVGSSPMDLNHKKDTRYFC 156
           G STA  FG C+A+CF  C ITP   I  C   C+G C   ++ SSP+D NH  DT YFC
Sbjct: 16  GRSTAS-FGKCYAKCFVVCAITPGIPIGTCGAKCLGDCL--FIASSPLDFNH-LDTHYFC 75

Query: 157 NLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCANA 193
            LGCATSLCTKFSTKKDPAEKKVESCV+SC +TC  A
Sbjct: 76  KLGCATSLCTKFSTKKDPAEKKVESCVNSCGQTCIKA 108

BLAST of Sgr015638 vs. ExPASy Swiss-Prot
Match: A8MRP4 (Thionin-like protein 2 OS=Arabidopsis thaliana OX=3702 GN=At1g12663 PE=3 SV=1)

HSP 1 Score: 56.6 bits (135), Expect = 3.6e-07
Identity = 32/93 (34.41%), Postives = 46/93 (49.46%), Query Frame = 0

Query: 103 PFGNCFARCFGPCCITPMDSIF------CTGMCMGQCALPYVGSSPMDLNHKKDTRYFCN 162
           PF  C+  C   C        +      CT  C+ Q + P V S+ +D     ++ YFC 
Sbjct: 24  PFKECYPACLVECKAGSKFPKYLKCPFTCTKECLQQPSPPSVSSNNID-----ESDYFCK 83

Query: 163 LGCATSLCTKFSTKKDPAEKKVESCVDSCSRTC 190
           LGCAT  C   S+ ++P  ++V +CVDSCS  C
Sbjct: 84  LGCATYHCVSLSSIQNPNVERVSACVDSCSNKC 111

BLAST of Sgr015638 vs. ExPASy TrEMBL
Match: A0A061GC54 (To encode a PR protein, Belongs to the plant thionin family with the following members:, putative OS=Theobroma cacao OX=3641 GN=TCM_028663 PE=4 SV=1)

HSP 1 Score: 156.0 bits (393), Expect = 1.6e-34
Identity = 87/192 (45.31%), Postives = 110/192 (57.29%), Query Frame = 0

Query: 1   MKSVVVVCLALILSLFAGRSTAGSFGDCFDRCYVSCFNGAAPWELTLCPAKCVAECALRY 60
           + +V++VC  L+L    G+STA     C+  C++ C   +       C AKC+ +C L  
Sbjct: 6   VSAVLMVC--LVLGTLVGQSTAQGTILCYAACFIPCMADSTTTTF-YCAAKCLKDCIL-- 65

Query: 61  VGSSPMDFNHKKDTRYFCNFGCATSLCTKFSTKKDPGMSTA-GPFGNCFARCFGPCCITP 120
                      KDT+YFC  GCAT+LCT  STK+DPG STA G    C+A CF PC   P
Sbjct: 66  ---PKSTVGGIKDTQYFCKLGCATALCTNISTKEDPGQSTAQGTNVLCYAACFIPCMADP 125

Query: 121 MDSIF-CTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKV 180
             + F CT  C+  C LP        +   KDT+YFC LGCAT+LCT  STK+DP EKKV
Sbjct: 126 NTTTFYCTIKCLKNCILP-----KSTVGGIKDTQYFCKLGCATALCTNISTKEDPGEKKV 184

Query: 181 ESCVDSCSRTCA 191
            SCVD+CS TCA
Sbjct: 186 GSCVDACSATCA 184

BLAST of Sgr015638 vs. ExPASy TrEMBL
Match: A0A6J1KDK0 (thionin-like protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111494667 PE=4 SV=1)

HSP 1 Score: 117.9 bits (294), Expect = 4.8e-23
Identity = 62/97 (63.92%), Postives = 69/97 (71.13%), Query Frame = 0

Query: 97  GMSTAGPFGNCFARCFGPCCITPMDSI-FCTGMCMGQCALPYVGSSPMDLNHKKDTRYFC 156
           G STA  FG C+A+CF  C ITP   I  C G C+  C   ++ S+P DLNH  DT YFC
Sbjct: 51  GRSTAS-FGKCYAKCFIVCAITPGVPIGTCGGKCLADCL--FLASAPRDLNH-LDTHYFC 110

Query: 157 NLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCANA 193
            LGCATSLCTKFSTK DPAEKKVESCV+SCSRTC  A
Sbjct: 111 KLGCATSLCTKFSTKTDPAEKKVESCVNSCSRTCLKA 143

BLAST of Sgr015638 vs. ExPASy TrEMBL
Match: A0A6J1DDL8 (thionin-like protein 2 OS=Momordica charantia OX=3673 GN=LOC111019402 PE=4 SV=1)

HSP 1 Score: 112.5 bits (280), Expect = 2.0e-21
Identity = 59/97 (60.82%), Postives = 66/97 (68.04%), Query Frame = 0

Query: 97  GMSTAGPFGNCFARCFGPCCITPMDSI-FCTGMCMGQCALPYVGSSPMDLNHKKDTRYFC 156
           G STA  FG C+A+CF  C ITP   +  C   C+  C L    S+  D N + DTRYFC
Sbjct: 18  GKSTAS-FGKCYAKCFVVCAITPGIPVGTCAAKCLTDC-LFRAASTTADFNDQIDTRYFC 77

Query: 157 NLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCANA 193
            LGCATSLCTKFSTKKDPAEKKV SCVDSCS+ C NA
Sbjct: 78  KLGCATSLCTKFSTKKDPAEKKVGSCVDSCSQKCINA 112

BLAST of Sgr015638 vs. ExPASy TrEMBL
Match: A0A0A0KC18 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G446640 PE=4 SV=1)

HSP 1 Score: 102.1 bits (253), Expect = 2.7e-18
Identity = 56/98 (57.14%), Postives = 68/98 (69.39%), Query Frame = 0

Query: 1  MKSVVVVCLALILSLFAGRSTAGSFGDCFDRCYVSC-FNGAAPWELTLCPAKCVAECALR 60
          MKSVV++C   ILSL AGRSTA SFG C+ +C++ C      P  +  C AKC+A+C   
Sbjct: 1  MKSVVLIC--FILSLVAGRSTA-SFGKCYAKCFIVCAITPGIP--VGTCGAKCLADCL-- 60

Query: 61 YVGSSPMDFNHKKDTRYFCNFGCATSLCTKFSTKKDPG 98
          ++ SSPMD N+  DT YFC  GCATS CTKFSTKKDPG
Sbjct: 61 FIASSPMDLNY-MDTHYFCKLGCATSRCTKFSTKKDPG 90

BLAST of Sgr015638 vs. ExPASy TrEMBL
Match: A0A2U1N652 (Uncharacterized protein OS=Artemisia annua OX=35608 GN=CTI12_AA300800 PE=4 SV=1)

HSP 1 Score: 95.9 bits (237), Expect = 2.0e-16
Identity = 47/122 (38.52%), Postives = 63/122 (51.64%), Query Frame = 0

Query: 77  FCNFGCATSLCTKFSTKKDPGMSTAG--------PFGNCFARCFGPCCITPMDSIFCTGM 136
           FC  GCA SLC    T+++PG    G        PF +C+ RCF  C I P ++  CT  
Sbjct: 12  FCKLGCANSLCANIGTRENPGQGANGSSAAPTPVPFTDCYGRCFFFCIIVPTNACSCTST 71

Query: 137 CMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRT 191
           C+ +C L     + M L+       FC LGCA SLC    T+++P E  +  CVDSCS  
Sbjct: 72  CLKKC-LDTPPMTTMALDDHSQNLGFCKLGCANSLCANIGTRENPDEHGMGRCVDSCSNK 131

BLAST of Sgr015638 vs. TAIR 10
Match: AT1G12663.1 (Predicted to encode a PR (pathogenesis-related) protein. Belongs to the plant thionin (PR-13) family with the following members: At1g66100, At5g36910, At1g72260, At2g15010, At1g12663, At1g12660. )

HSP 1 Score: 56.6 bits (135), Expect = 2.5e-08
Identity = 32/93 (34.41%), Postives = 46/93 (49.46%), Query Frame = 0

Query: 103 PFGNCFARCFGPCCITPMDSIF------CTGMCMGQCALPYVGSSPMDLNHKKDTRYFCN 162
           PF  C+  C   C        +      CT  C+ Q + P V S+ +D     ++ YFC 
Sbjct: 24  PFKECYPACLVECKAGSKFPKYLKCPFTCTKECLQQPSPPSVSSNNID-----ESDYFCK 83

Query: 163 LGCATSLCTKFSTKKDPAEKKVESCVDSCSRTC 190
           LGCAT  C   S+ ++P  ++V +CVDSCS  C
Sbjct: 84  LGCATYHCVSLSSIQNPNVERVSACVDSCSNKC 111

BLAST of Sgr015638 vs. TAIR 10
Match: AT1G12672.2 (unknown protein; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G12663.1). )

HSP 1 Score: 54.3 bits (129), Expect = 1.3e-07
Identity = 31/74 (41.89%), Postives = 38/74 (51.35%), Query Frame = 0

Query: 119 PMDSIFCTGMCMGQCALPYVGSSPMD-LNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKK 178
           P  S F T  C+  C  P   SS MD  N      Y+C LGC+T  C   S+ ++P   K
Sbjct: 31  PQPSPFKTFSCIKTCLEP--PSSQMDSTNEINHNDYYCKLGCSTHHCASLSSIQNPNVDK 90

Query: 179 VESCVDSCSRTCAN 192
           V  CVDSCS  C+N
Sbjct: 91  VVDCVDSCSDKCSN 102

BLAST of Sgr015638 vs. TAIR 10
Match: AT1G12672.1 (unknown protein; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G12663.1); Has 4 Blast hits to 4 proteins in 1 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 4; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 48.1 bits (113), Expect = 9.0e-06
Identity = 22/51 (43.14%), Postives = 30/51 (58.82%), Query Frame = 0

Query: 141 SPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCAN 192
           S  ++NH     Y+C LGC+T  C   S+ ++P   KV  CVDSCS  C+N
Sbjct: 17  STNEINHND---YYCKLGCSTHHCASLSSIQNPNVDKVVDCVDSCSDKCSN 64

BLAST of Sgr015638 vs. TAIR 10
Match: AT1G12660.1 (Predicted to encode a PR (pathogenesis-related) protein. Belongs to the plant thionin (PR-13) family with the following members: At1g66100, At5g36910, At1g72260, At2g15010, At1g12663, At1g12660. )

HSP 1 Score: 42.7 bits (99), Expect = 3.8e-04
Identity = 22/71 (30.99%), Postives = 32/71 (45.07%), Query Frame = 0

Query: 104 FGNCFARCFGPCCI--TPMDSIFCTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCAT 163
           F  C+  C   C +   P+  +FC  +C+  C    + S   +LN    T  +C LGCAT
Sbjct: 29  FKLCYGGCLVACALIAPPIKKLFCPFLCIKDCKRRPMLSFEANLNEIDQTGSYCELGCAT 88

Query: 164 SLCTKFSTKKD 173
             C   S+  D
Sbjct: 89  DRCVSSSSIDD 99

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
EOY26692.13.3e-3445.31To encode a PR protein, Belongs to the plant thionin family with the following m... [more]
KAG6575552.15.6e-3448.24Thionin-like protein 2, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG6575557.18.1e-2542.33Thionin-like protein 2, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG6575558.12.4e-2438.02hypothetical protein SDJN03_26197, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_038899787.14.5e-2362.89thionin-like protein 2 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A8MRP43.6e-0734.41Thionin-like protein 2 OS=Arabidopsis thaliana OX=3702 GN=At1g12663 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
A0A061GC541.6e-3445.31To encode a PR protein, Belongs to the plant thionin family with the following m... [more]
A0A6J1KDK04.8e-2363.92thionin-like protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111494667 PE=4 SV=1[more]
A0A6J1DDL82.0e-2160.82thionin-like protein 2 OS=Momordica charantia OX=3673 GN=LOC111019402 PE=4 SV=1[more]
A0A0A0KC182.7e-1857.14Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G446640 PE=4 SV=1[more]
A0A2U1N6522.0e-1638.52Uncharacterized protein OS=Artemisia annua OX=35608 GN=CTI12_AA300800 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G12663.12.5e-0834.41Predicted to encode a PR (pathogenesis-related) protein. Belongs to the plant t... [more]
AT1G12672.21.3e-0741.89unknown protein; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana prot... [more]
AT1G12672.19.0e-0643.14unknown protein; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana prot... [more]
AT1G12660.13.8e-0430.99Predicted to encode a PR (pathogenesis-related) protein. Belongs to the plant t... [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR36312:SF1THIONIN-LIKE PROTEIN 1coord: 99..191
NoneNo IPR availablePANTHERPTHR36312:SF1THIONIN-LIKE PROTEIN 1coord: 2..98
IPR038975Thionin-like proteinPANTHERPTHR36312THIONIN-LIKE PROTEIN 1coord: 99..191
IPR038975Thionin-like proteinPANTHERPTHR36312THIONIN-LIKE PROTEIN 1coord: 2..98

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr015638.1Sgr015638.1mRNA