Sgr020012 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr020012
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionSM-ATX domain-containing protein
Locationtig00153446: 895059 .. 908061 (-)
RNA-Seq ExpressionSgr020012
SyntenySgr020012
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATCCCAACTTGCAGCTTCATGCTTCTGACAATGATTATTTGCAGGATCCTTCTGTTTACAGGAGATTAATTGGCAGATTATTGTATTTGACCATTTCTCGTCCTGATATAACTTTTTACAAGTTGAGCCAATTTGTGTCCAAGCCGTGCAAGTCCCACCTATCTGCTGCCCACCATTTATTGCGATATTTAAAGGCTTCGCCAGGACAAGGTGTTTTCCTCCCAGCTTCTTCTTCCTTCCAGGTTCGGGCTTTTTTTGATTCAGATTGGGCATCATGTTTAGATACACGGAAGTTGACCACTGACTTTTGTATTTTCTTGGGTGACTCAATGGTTTCTTGGAAATCCAAGAAACAGTCCACTGTATCTCGGTCTTCTACAGAGGCTGAATATCGTGCATTGGCTATGACTACTTGTGAAATTGTTTGGCTTTCTCATCTCTTATGTGATTTGCAGGTTCTCTTTACTCCTCCGGCTCTTTTGTTTTGTGACAACCACGCGGCGGTTCATATTGCTTCTAATCCAACCTTTCATGAGCGCACAAAACATATAGAGTTGGATTGTCATTTTATTTGGTACAAGTTTGTGACTGGTTTTATCAAACTTCTTCATATTCGCTCCCATATGCAGTTGGCTAATATCTTCACCAAAGCGGTTGCTGCTCCCGTTTTGTTTCCACTGCTTTCCAAGATGGGTATTCTTGATATATTTAGCCCATCTTGAGGGGGAGTTTTAGGAATATTGTTAACCCAACTAATTGGTTAGATTATTAGGAATATTGTTATCCTAACAAACTGGTTAGATTATTAGGAATATTGTTATCCTTCTATTACACGTGTATATATAATTTATTCATTCTTGGAACCAGTTTGTTAGAATAACAGTATTCCTAATAATCTAACCAATTAGTTGGGTTAACAATATTCCTAATAACTGACAAAGGGAACCTTGAAGATAAACAACTCATGGCTAAGCGTGCTTCACTTTCTATTAAGCATTTCTTAATCATGTTCTCTCATATTTGTATAATCTTAGGATCTCTTATTTCGATGTAGTGTTGATTCATTAATGTTTATTTTTGCTCGGGTGTCACAAATACCATGTTGACGATGATAGTTCAAACCTTTAGACTTTTTTTCCCTGTTTTGCAAGCCATAAGATTGATCAAAATAGATTTTGGCCGTACACAATACTAAGGAACTCTTTGGGTGTATTTAAAAATGTTGCTTTCAGAAACTATTTTGTAAACTGTGTTTTTTTTAAGAAAAATTATTTTATACTGTTTTTAAAATTGAAAACTATTTGTGAATTTTTATTTATTTCGGGAAATATTAATTTAAAAATATAACATATTTAAATAAGATCATACTTAAAAATAATTGATAATATATATGGACAGTTAACGTAGAAGGATAAATTTTGGTAAATTTAAAATTGACAAGTATTTTTAAATTAATGGTTAAATAACAAATTTGTTCTCTATGATTTGGGCATAATTTTCATATGGTCCGGTTTTAAAAGTTTTAATTTAGTCTTTATGAATTTAAAAGTTTCAATTTGGTCTTTATGATTGGGCCAAACCTCATAAATCATCATTGGCGTTATCATACTAATACATTTTTTTATGTGACAATAAACTAATGTTTTCTATTTGATTAACTGTGTTAGTTGATTATATCAGGGTTTAGGGTTGAGTAAAAAATAGAAGGAAATAAGTGAATTTTTTGGGTGGGTAAAAGATAGTGTAGAGATGATTTGTGAGGTTTAACCAAACTATAGGGGCTAAATTGAAACTTTTAAAATCATAAGAACTAAATTAAAACTAAACTAAAACAATAGGGACTAAATCAAAATTTTTAAAATCATAGGGACCAAATTGAAACTATGTCCAAATCAAAGGGACCAACTTTGTAATTTAACCTAATTTAATTAAATTAAAGGTTAAATTGTAAATTTAGTCAATGCTTGTGTCAAAATAGGTTCATGAACTTTAAAAAATGTATAATAGGTCCCTAAATTTTAAAAATATCTAATAGGTGTCTGATCTTTCAATTTTGTGTCTATTGACTTATTAGACTTTTTTAAAATTAACGGACCTATTATATACAAAAGTCAATTTTTTGTCTAATAAGTCCTTAGCTTTTCAATATTGTGTCTAAAAAGTTTGTAAATTTTAAAAAATGTCTAATAATTAGTGATTTATTAGATACAAAATTGAAAGTTTAGGAACTTATTAGATATAAAATTAAAAGTATATGGACTTATTTAACACAATTCTGAAAGTTCAGAGACTAAATTTGTAATTTAAAATAAATTAAAAAGATATAAAAAAAATGGCTAATTAAATAATGAATTTCGCGGGAGAGGGGTAAAAATGAATTGCCGAACATTACGGTTGTGGGAGAATGTGAATAGGGGAAGAAATGAGGGGTAGGCGTGCAAAAAAAAGAAAATATAAATTAGGTTCTAGAAATCGAAGACAGTGAAGCACGGTTACAGAGCATTACGCATAGCACGTAACGCAACACCAAAAACGTTCTTATTTCCCTTTCGTTTTCCATTTCGTCTCGTCTTCCCCATTAGATCTCGTTTCCCTTCTGCAACACCAACTTTTCGACACTCAGAGAAACACCACCTCCACCGGCGGATCCATCAGCTAACACCCACACAGAAATTGGCATATGGGTTGCAGAAACAGGGAGTTTTCCGAAGATGACACTTCCTCTTCTACGCTTAGCGAGGCTTTGCTCTTTGCCACCATGTGCCTCATTGGCCTCCCAGTTGAGGTTCACGTTAAAGATGGCTCTGTCTATTGCGGCATCTTTCACACTGCCTGTGTGGAGAATGAATATGGTGCGCTTTCCATAACCTTCTTCACCTTTGCTTCCTCCAATCATTTCCGGAGATTCTATTGATTGCTAGAATTCTACTTTTTGTATTCATTCTTATGTGAATTTGTGCCTTGGTTTCGCCCGTGAGCGAGTGGTTTTGATTTTATCGTTTACGTATGGCGGTCTGGTGTGTAGATGATTTGGATCCGAAATTTGATCTGGGCATGGATGAAAAGATTGGCATCTAATTTTTTTTTTCATCTCATTCTCTTGGAAATGAGACGTTCCTATCTCTCTTAGTAAATGTTTACAAATTATCGTTAAGTTGAATTTGTAGATTTTGACGTTTCTTTTGATGTCTTACGATTTTATCTTTTAATGTTTTAATGGAAACCGCCATGATCCCCCAACCCCTTCTTTTTCTTATACATCGAGAAACGGTGATAGTCATGAAATCAATCTGTGTATCACTAGGTGTTGTTCTGAAGAAAGCAAGGATGACAAAAAAGGGTAAAAGGAATGTGAATGTGGACGATGGAGTTGTAATCGATACTCTTATTGTTCTTTCCGGTGATCTTGTCCAAGTTGTTGCGACGGTATGTAATTAAGGTCACTCATTTTGCTTCTCCTAGTTCTCATGTTATTTAATTTAATTTAATTTGAATCTAGGTCATATTAAGTTCTGGGTTATTTCAAATGTACAGGAAGTTCTACTTCCGGCTGGTAGTTTTTCCAAAAGTTTGGCTGGTTATGATAATGAAGCCATGGCCAATGTTCCTATTTCATTGCTTCCAGCTTCAGAGACTAAGACATGTATGGAGTCATTCAAGGAGGGGAGTCAGATGAATCAAACAAGGTATACTCCCATTTTGACATTGTATATTTATATGGATAGAGCAATCTGGTCCAAGTTTTGGAGTCTGATATTCTTTTGTCAGTAGCATTTGGACCGATGTTATTAGTATTATCTGTTTCTCACCAAGCATCCTTGTCATTGGAAGATAGTTTCCAAGATCTCTATTTGTACAATATAACCTTATATATTACGTGACAGAAAATTGAGAATAAGAAAGTAACTTACCAAAAAAAAATTGTAATTGATGATAAGGAATAGAGTCAATTCAGACTCTCATTTAAAATTAGAACTGGCTTTGTCCTAAAAGAAGTGGCTGTCTGTGGTTTAGAAGGTTATGTTGTCTTCAATATGACCTTTTTTCTTCTTGATTTTACTTGGAAAGAATATTCTTCTCTCTCCTAATTGGTTTTACACATATACTAGATAGATCTTTGCAATACTAAATCAGTTGCATGAAAGTAGCTTCAGGAAATTATTATTTATTGGCATGTTAAACGAGTACTGTGCATCATGTAGAAAGCCATAATTTCTTGCTTCAAATCCATATTTCAAGATGGGAGAAGGTTTCAATTATATTATATTGTAACTTATGTGCTTAGTTTGGTATTGGTATAATGGATGTAACTACCAACACACATTTCAATTGTAAAAGCAGCGACTTGGTCCAAGATCAGAATGGGTTTGCTCATGGTTCAGTGCCTACGATAACTGGGAAGCATGGTGATGTTAGACAGCTTTTGCGAGATAATGCTGAGAACAACCAGGGAGATGCACAGCAGAAAAGGGAAAGGATCAATTGCAAAAAGGTGAACTTTTTATCCCTTGGTTTCATAATAACACACTCACACACAGAATATGCTCCTGAAAATATAAGTATTGAAGGGAGCGATTACCTATTAGAAGGATAGTTAAGAATGGCTCATGTATTGGTTAACATTGCCTCCTCAGAGACATCTACAGTTAGTACTTGATTAGTTGCATCCACTTAACAAGCCTACACTGCTAACTGTTTGTGTGAATTATTTCGTAGGCATGTTATTTAAATTAAGCATGTAAGTCTTTGTTTTGAAGTTTATCATTTTCCTATTCATAATTTCTTTGCCTTCTCCCCCACCTCCTTCTTGCAGCCTGAAGGTGTCACTGATGCTGCAATCAATTGGAGGTAGGAAATAACTTCTCTTTTTACTAAATATATTTCTTGTTTATGGTTGTCCAGCAAACAATTGGGGCTCATTCCATGATCATGCCACCAAAAGTAGATGTGGTTCATAAACTATCTTCAGTTCTTTTATTTTTATTATTATTATTTTTGGTACGAAACAATTGCATACCAAAAAGGGAACATGCTGTGTGGCAAATGCCTCTCTTACTTTATTTGAGATTGATATATCTGTTGCTTTGATTTTATTTTTCAACTAATTACTCTTGTCTCAGTCAAGTGCTCCTTATCATTGGGTTTATGTTAAATTATTTGACAGACAGGACCCAGATAACCAATTAAAAAGGGAGCAGGATGATCATGGTCAGGAATTTGACATTCATAAAAGAGTCAATGTATGTCAAGTAGGCTTATTCAGGACCCTGGTTTTCTTGAGAATTAGATGTTATTCATTTGGGTGGACTAAATTTAAGACTAACTACTTTTATTTCTATAGGTTGATCGAGTTCAATCCTCTATATCGAGTGGTGAGTTCGAAGTGCTGTTAATGATATCTGGCTCATCTATGATTTGGTTATAAAGTCTTATACAAGTGGGTTTTCTTAATATAGAAACTAAATATACTTGCAAATGGTTTTTTATTGGAAATGTTGATTTTGCAATTGACATTACATCATTGTAGTTTTTAGTTGCAACTTACCTATGATATTAAATAGTCAATGAACTTTTTGACATCAAGTGTTGAAGGATTAGGTAGTTGTCTCTTGAGATTAGTTGGTTTAGACACCTATGATTTTTCTATATAGTTTTCAAATACCCTCCATGAAAGTGGACACGTCTTTGTAATGTTTATTATTCTTTGCCTACTTGAATTTTAAAATTCTTAATATGCAGTGGAATTTCAGCTTAATCAAGTAAATTCTGACAAGAAAAAGAATGTAATAGTGGCATAAAAAAAAATTGCTGGGAGTATCTAATGGTCCTGCTCCTGCTCTTGTTAAAATAGAGAAACCATGCATTGAAAGACCCACCTCAGCCAACACTTCTCTGAATGCATATTCAGTTTGTGTCTCAACTTGTTCACTTTCGTCCGTAGATAGTAGCATGGACTCGTGTCATAGTTCTATAACATCAACGACCGACTTGGCCCCTTCTCATGTTTCAGAATCCAATAAAAGTGCCAAGGTACATATCATTATCAATATATCACTATCACTACTATATTTGAGAGATAGGATGTTAATGAAACATTTTTGGATACTTTATTTGGATGATGACAAAATATACCGTCATCCATTTTACTTTGTCAATTTTATGTTACAGATTATATGCCAGTTGAAGCTAAAGTACTTTTTTTCTTTGCGCCTGTTTTTTGTTTTTTTATTTTATAAGAAACAAAATCTTATGGAATAGATGAAAAGCAAGCCAAAAGCCTACAAAAGTTAAAAGGAGTTACAAGAGCACTCTCCAATTGGCACAAATCAAAGAAAATCTGTAGTTGCAAAAGTCGTTATAGAAAATACACCTAGTAGAAGAAACAAAAACAGCATAAATATCCATCCACTTTTCTTTACCTAGGAAGACTCTCATATTCTTTTCTAAAGAAACCTTCCGATGGAATGGATCTTATGACATTTAGCCACAAAAGTTTGGTTTTTTTTTTAAAGTAATGGAAACACAAAAAAGAGTACTGCAAGTTCCTCCTGATCCCTGTAGGCAATGGTAAGACCAATTGAAAAGTTTGGAGAATTTCCTCCACAATTTCATGCTTAAAGGTACCAAAGGAAGAGGTGGTGTGGTCTGCCAATTCATTTCTCCTTTAACACACCAACTTGTAGAAAGACATAAGATTTTTTTATCGCATCTTTATTTGGGTATTCAGACCTCCCAGAAGATGAGACCCACAAGCATCTTGATCTTACACGGACTTGAGAAAAAAAGAAGTCGGCACTAAAATTGACTTCGAAGGGCTTTCATTGTAGACAAGGAACACCAGCTTGAGCACCAACCCTTAGCCATACTTACTGACTATGATTCATGCTACGTAGCACGCCCCCAAAGAAAATCCCCAATTGGCGAGAAGGGCAAAATTCCTATTTTTTTCCATTACCTATTCCAGCCACCTTTTGATAGTGGAAAGCCAAATACCCCAAGAATATCAAGTTCTTCTGCTGGGAATTAAGCCACAGCTCCATTAATACACATGAGATAATCCAGAAGAAATTTCCAAGATTGGCCATCTCTCCAACCGCTGTATCATGTGTATGAAAAGTTCAGAATCTCAAGGGCACCTTTCATTTCCTGTGAGTTTGCTAGAGAATTCTGGAATCAGCTTCTTGAAGCTTTTGGATGGCAGACTCCTCTCCACACGAATTTAAGCTCTTTTCTCCATTCTTTCATGGTTGGCTACCCGTTCAAAAGAGGGAAAGAAATTCTGTGGTTGCAGCTTATCAGAGCCTACTTTTGGTGTCTTTGGATGGAGAGAGACAACAGAATCTTCAGTAACAAATCTCTCCTTTTTGATAGATTTCTTGATAATATTATGGTTAGGCTCTTGGTTGGTGTAAATGCTTTTTCTTCCCTTTCACCGAGTATAGTTTAAATGATCTTTTGAAGAACTGGAAGGCCTTTTTGTAATCCAGTTGTTCGTGCCCTTTCCCTGTTGTATATTTCATCTTATCAATGAAATATCTCTGTTTCTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGATAGTGGAAGTGAAACTGTTTTCCAATTAATCAATTGAGGAGCCACCTTCCAATTTTGGCTCTTCCCATAAAAAAATCCCCAAAGTTTTCTCTATCTCCACTGTGACACTTCTGGGAGCGTGGAAGACAGAGAACTGGAGAAGTAGTTGGACAGATTATAAATAATGAAGTTGATGAAGGTTTATTCACCCTTTGAGATGAAAACATGCTTCCACTTGTCCAACTTGTGAGAGATCCTTTCCAAATCTGGTGCCCAAAAAACCTTAGAACTTGGGTGGAGACCCTCGTCTTCAATAAGGAAGAGGCTACTCATTGACTTTACACCCAAAACCTTCAGCCACTTCTTTTCACCTTTGAGGTAGTACAATTAATTCCAACTCCGATAATCACTTTCTTGACGATGAGAATAATTTTGAAGAGATTGCTAGTCATTGTAGGCCAAGAGATAGGGAGGATAATGAGGTATGGACCTTGGAAGAATATGGCCATTTCTCAATCAGATCACCTGTTAGAAAATTAACGAAAAAGGAATATATTCTAGAGGAAATATTGATTAAGCTGGTGTGGAAAGGTAAGGTACCAAAGGTGAAATGATTTGTGGACTTGTTATTTGAATCATTAAATGCCTCAGATAAGTAACAGAATAAGTACCCAAATTGGCTCTTTCTTTAAACTGTTGTGTATCTTGTGAAGCCAAGTAAGAAAGTCAACATGTCTTGTTCCAGTGTGAGTTTAATACAAAATTCTGGTATTCAATTCTGAATCTCCTAATTGATGGATGGGTTTTTCTCAAATCCATAAAGGAATTTGCAGCAAATGGCAGGGGTTTTGGGGAAGGAGAACGAGTTGAAAATTGCTTCGGAAAACCTATTAAAAGCAATGCTATGGAAATATGTTTGGAAAGGAATTACATAATTTTTGTAACCGAGGAAAGAAAATTTTTGGATAATAAACAATTTCATTTGATAAAATGAAATTACAAAAAAAGGGAATAATAACCCCATCTAAAGGAATTTACAAAAAGCTCCTCCAATTTGCTAAAAGAGTGCTTAAACTATAATCAATGAAAGGGGGAACAGCTTTACACCAAGATAAGGCTAGTAAAATAACAGAATCAAAAAGCAAGTCAGCATCCCTCTCCTAATCTTGGAAGATCCTATGGTTTCCTTCATAGGATTCGAGGAAAGAAAATGGCAGAACGTAGTTGAAATGGCTTGATTTAAAGCTTCTCTTTGATGCTCTTACATAAATTATTTAGAATTTCTGTAATTAACTCTAATTATGGAGCGTTTTTTGTTTTCATGGGGTTCTACACTTTCGTAAACTTTTCTTGTTTCTGTTTTTTTTTAAAAAAAGAATATCTACAATGAGAAATCCTGCAAATTTTTAGGTTTGCACCAGTGAAGACCAAAACACTTGAAGAACTCAATCCCAAAACTTCCCTCAAAAACTGCATGTCAAAAATAAGGCCTCCGTTTTCAGTACCCTTCGTACAAAGAATGCACTATTGGGGAGAAGGCCAAGTGAGGGGGCTGTATGAGAATTCTATTATGTCTATTAATTCAATGGAGAGAAACAAACCAAATGAAGAATCTGATCTCCTTAGAAACTTTTAGTCTTCAAATACCTTTGGCCAGATGTGAAGGAAAAATCCTTTACTAATAATGCAGAAAGAAATTCCTGCAGAGTCAACTCCTCTTTTGTCTGAATCCACACTCTGTCATAATCACTTTGGGTAGGAGGAAAACCATGAAGAAGATTGACAAGGGTCAACTGTTCCTCAACCCCCCCCCCCCCCCCCCATTATTTAAATTCCTGCTAAATTTTTAGTCCCTCCTGTGGTTCTCAGCATCACAGAGTTGCTGATAGTACTGCTACTGCTGTTTGCAAGACTGAACAGCTTGGAAAATTGGTACTCCATGAGTCCTAAGATTAACCTTGCTTCCTTTATCAACCTTCGTGAAATGGGCTGAGTGGATGAAGGCATTGCACTAAGGAATGTTGCACTGAGGACTTCTCTTACTACTACCCTACCAAACGTTGGTGTAGTTGCTGGTTGAAATGCCACAAGCTACTGGTTTCAAAGGATATCTCCACAGCCTCTAGGATAATAAAGCTAGGTATTTCCTCCTGATATTACCTACTTTCAGCCGACCAGACTGCTTAGGGAGGGCGACATTTTCCCATCTAAAAAGATGATTGTCTCCTGTTTTCTCCATAAGAAATCTATTATTTATTTATTTTTAACATTCTTCCATTCTTGTTAAAAAAAATCTTACAAAATAATAAAAGTTCAATCATGGATCCTGGTTTTGAAGGAGGGATCTTGAAGAGGAAAGGTAGTAAATTGGATGATTAGCAAGGACAAATTTCCATCTTTTTTAATGGAAAATACATCTTAATGAAACCTTTTCCACGTGAGGTAGGCTGTAGGCATTTTATGCAAGGTTTATTACCGTGGCGATGACCTTTCCCACGTGTATGTTCAACTGATGTTTCTTTTCAATGTAGCCTTAATCCTCACTTTCAAATCCCATCCAAAGTTTCAACTTTGAATCTAGCACATAATATCCAACTCTTGTGTATTCATTTGCTAAGAAATTGGATCGATACAAAGGATGGTTTGGTTTTCAACCGAAAGTGTTTTTTTGTTGGATAACTGTATCTTGTTCTAAAAAAAATTATAATGAAACCTTTTGATTGAATCTTCCTTTCATTTTATGAAATGGACACTAAGGGGCTGCTTGGTAAAAATTGCTCCTAAAATTGTCTAACCTTTTAAACTGTTTTTTTTTTTAATTATTGATTTTAAAATTTAAAAACTGTTTGTGAAGATATTTTTGTTCTCGTGTTTTCACTTGTTTCTTGTAAGGTTAGGAATAACTTATTATTCTATGATAATAAAATTCTACAGTTTCCTAGACAAAATCACAGTGTGCACTCTTAAAATTTTGCTGCAGGTTTTTTCTGATAGAATAATAATAATTATTATTTTTTTCTGTTAATCTCTGGGAACAACTTACTTATCTTTTACAAGATAGTCCTACTTATGGTTGGTAAGTTGACTAATAGTGATGTTCATAGATTGATGGCATTTATAAGGTCATGGTGCAGGCTTATCCTTGATATTCTTCTCTGATAACTAATTAAACATTTCTTCTATTTATACTTCTGCAGGAATTTAAGCTGAATCCAAGAGCCAAACTCTTCTCTCCATCTGTTACCAATAGCATGTCAGCAACTCCTGCAGCTTCAATGGTTGCAAGCGTGGCTTACATTTCAAACAACTCACCAGTAGTACCTGTGGCTGTTGCTCAGCCAGAGGTGGAGTTCAGTCCTTTTGTACCTCGTTCATCTGTGCCTGCCAAGTTTGTCCCTTATGGCAACTCAATAGCTGGATTTGGTGGCAATGTTGCTCAATTTTCCCAACCTGTACGTATTGTTTGGGCAGATTTTTCTTGATCAGTTTTCCTTTTGTTTTTCTTATCAAACTATCCATTTAGCCTTTTGGGGGGCCTAGACCAGCTCTTATAATTGTTGTTTCTCCATCTCACAACCAAAATAAATGAGAGCTAAAACTCAAAGTCTGAATATGTACTTCCAGAGGTCCAGTGGTACATCAGGTCTCTTCTTAGTTCAAGAAGGAAATTCTAAGTTTTATTTATAAGATCAGTTCTTTAGTTAAATGTACTATATCTTGGGCATCATAGATCTAATGATAGATGAGTTTGAATTCTGAAACTTGTACTTAGCTAATCCCTCCAGGATTTTGTTAAGGTGTTTATTGGTTTCAAGATGATTACATCAAATTGGTTTAATTGTTCATATAAAAATAAAAATATTGGTGTAAGTGTATCCCATAAAGCAAAAAGAGTTGTGCAGAGGGTTTTTCCATGAATGTTTCAATTGGCATTTCTACCGATAATTTAGTGATTAAGATTATGTGATGAGGATGCATGATGGTTGATGATGTAAGCAATTTTATACAACCGTTGTACTGAAACTTCATACTACTCTCTATTATAACGAGCCACATGTAACTGTTACATGTAATGATTTAGTTTATTTATTATTTTATGCTTTCAATAGCATCCACTTTACCGGTTATAGCATGATGGAAAATTTCAGTGCAAGGGTAGTTTTTTCTGCATGTTCACTGACTCGAGTTTTCTCCCTTTATCTTGACTACAGATGGTGGGACATGTAGGAACCAGGACGCAGCCAGTTAGATATGTTGGTCAGTATCCTCTCCAGGCTGGTCCAACCTTTGGGCCCCCAAACTCACAAGCAGTAAGTGGAAACAGGATTATATCTACCCTTTCAACTTAGTTTCTTCCATATGGTTGCAATTTCAGCGAGCAGATTAGAATATATTTTAATTTATTCACGACCAGCTGGTCATATGTGTTGCTAATCTCTGCCCCTTCCCTTCGATTGTTACAGGTTATGGTCGGACGTTTTGGGCAACTTGTTTATGTTCACCCAGTCTCGCATGTAAGCACTTAGTCTCCCATTTGTTTGTTTTTTTTTTCCCCCATTTGTTAAAAGTGTAAACCATCCTTCCATAAATTCTATTTTGCTTCTTTGAATTATTTGACCTTGTAGAGTTAATCTTTTTCTTCACTTTACGTTTCATGACGCCATGAGCAGTTGAATATTTACTTAGTGTTTTGACTTGTGATTAATGTATGTTTTGTTTTAATATTTTAAATCTTTATTTTTAAGTTCTTTTTCCTCAAGGTCAATTAATTGTATAATCTATTTTTATGATGTTGCATCTTCCTTTGCCTTAACTTTCTACGACTTCTTTTTGGTTTCTGCATCCCTTTTTTTCCCTTTTTTTATTTCCTCTCTCCGAATTGCTTATTTATCTTGTCTCATATTGAAGGACTTGGCTCAAGGTACAACAGTCGTCTCACCGGTATCACCTTGCCCTTTGTTGACAACACAGCCAGCTCAATATCCAAAACATCAAGGTGAAATTCTTTTCCTGGCTGATGACTGAGTGTTTCGAATGTGTTGCTCAACTTTTATCATATCATAAACTATGGTATTTTTTTAGATGATGTTGCCTTCTTATCGGAAGAGAATGAAATTACAGTACTTATTTTACCCAATTAAGCTACCCCTCGTGACCAATCAACAGTTTCAATAGTTTAAGATTATGCTACCAAATATATCAATTTAAACAACTTGTTATTTGCCTTCTTCATCTAAGCGTCCTCCACAAATGCAGGAACTGCAGCAGCAGCAGCAGTACAAGCATTGCAGTTTTGCGTTCCTCCACCATTTATGGCCAATGGACACCAGCCGCTCTCCGCAGTGCCAAACCACATTCCAATTTTGCAGCCCTCCTTCCCCCTCAATCGCCCAATGCAAGTCCCAGGATCTAATGCATTCTTCAACACCAAGTTCACCTGA

mRNA sequence

ATGGATCCCAACTTGCAGCTTCATGCTTCTGACAATGATTATTTGCAGGATCCTTCTGTTTACAGGAGATTAATTGGCAGATTATTGTATTTGACCATTTCTCGTCCTGATATAACTTTTTACAAGTTGAGCCAATTTGTGTCCAAGCCGTGCAAGTCCCACCTATCTGCTGCCCACCATTTATTGCGATATTTAAAGGCTTCGCCAGGACAAGGTGTTTTCCTCCCAGCTTCTTCTTCCTTCCAGATCTCGTTTCCCTTCTGCAACACCAACTTTTCGACACTCAGAGAAACACCACCTCCACCGGCGGATCCATCAGCTAACACCCACACAGAAATTGGCATATGGGTTGCAGAAACAGGGAGTTTTCCGAAGATGACACTTCCTCTTCTACGCTTAGCGAGGCTTTGCTCTTTGCCACCATGTGCCTCATTGGCCTCCCAGTTGAGGTTCACGTTAAAGATGGCTCTGTCTATTGCGGCATCTTTCACACTGCCTGTGTGGAGAATGAATATGAATTCTACTTTTTGTATTCATTCTTATGTGAATTTGTGCCTTGGTTTCGCCCGTGAGCGAGTGGTTTTGATTTTATCGTTTACGTATGGCGGTGTTGTTCTGAAGAAAGCAAGGATGACAAAAAAGGGTAAAAGGAATGTGAATGTGGACGATGGAGTTGTAATCGATACTCTTATTGTTCTTTCCGGTGATCTTGTCCAAGTTGTTGCGACGGAAGTTCTACTTCCGGCTGGTAGTTTTTCCAAAAGTTTGGCTGGTTATGATAATGAAGCCATGGCCAATGTTCCTATTTCATTGCTTCCAGCTTCAGAGACTAAGACATGTATGGAGTCATTCAAGGAGGGGAGTCAGATGAATCAAACAAGCGACTTGGTCCAAGATCAGAATGGGTTTGCTCATGGTTCAGTGCCTACGATAACTGGGAAGCATGGTGATGTTAGACAGCTTTTGCGAGATAATGCTGAGAACAACCAGGGAGATGCACAGCAGAAAAGGGAAAGGATCAATTGCAAAAAGCCTGAAGGTGTCACTGATGCTGCAATCAATTGGAGACAGGACCCAGATAACCAATTAAAAAGGGAGCAGGATGATCATGGTCAGGAATTTGACATTCATAAAAGAGTCAATGTTGATCGAGTTCAATCCTCTATATCGAGTGAGAAACCATGCATTGAAAGACCCACCTCAGCCAACACTTCTCTGAATGCATATTCAGTTTGTGTCTCAACTTGTTCACTTTCGTCCGTAGATAGTAGCATGGACTCGTGTCATAGTTCTATAACATCAACGACCGACTTGGCCCCTTCTCATGTTTCAGAATCCAATAAAAGTGCCAAGGAATTTAAGCTGAATCCAAGAGCCAAACTCTTCTCTCCATCTGTTACCAATAGCATGTCAGCAACTCCTGCAGCTTCAATGGTTGCAAGCGTGGCTTACATTTCAAACAACTCACCAGTAGTACCTGTGGCTGTTGCTCAGCCAGAGGTGGAGTTCAGTCCTTTTGTACCTCGTTCATCTGTGCCTGCCAAGTTTGTCCCTTATGGCAACTCAATAGCTGGATTTGGTGGCAATGTTGCTCAATTTTCCCAACCTATGGTGGGACATGTAGGAACCAGGACGCAGCCAGTTAGATATGTTGGTCAGTATCCTCTCCAGGCTGGTCCAACCTTTGGGCCCCCAAACTCACAAGCAGTTATGGTCGGACGTTTTGGGCAACTTGTTTATGTTCACCCAGTCTCGCATGACTTGGCTCAAGGTACAACAGTCGTCTCACCGGTATCACCTTGCCCTTTGTTGACAACACAGCCAGCTCAATATCCAAAACATCAAGCGTCCTCCACAAATGCAGGAACTGCAGCAGCAGCAGCAGTACAAGCATTGCAGTTTTGCGTTCCTCCACCATTTATGGCCAATGGACACCAGCCGCTCTCCGCAGTGCCAAACCACATTCCAATTTTGCAGCCCTCCTTCCCCCTCAATCGCCCAATGCAAGTCCCAGGATCTAATGCATTCTTCAACACCAAGTTCACCTGA

Coding sequence (CDS)

ATGGATCCCAACTTGCAGCTTCATGCTTCTGACAATGATTATTTGCAGGATCCTTCTGTTTACAGGAGATTAATTGGCAGATTATTGTATTTGACCATTTCTCGTCCTGATATAACTTTTTACAAGTTGAGCCAATTTGTGTCCAAGCCGTGCAAGTCCCACCTATCTGCTGCCCACCATTTATTGCGATATTTAAAGGCTTCGCCAGGACAAGGTGTTTTCCTCCCAGCTTCTTCTTCCTTCCAGATCTCGTTTCCCTTCTGCAACACCAACTTTTCGACACTCAGAGAAACACCACCTCCACCGGCGGATCCATCAGCTAACACCCACACAGAAATTGGCATATGGGTTGCAGAAACAGGGAGTTTTCCGAAGATGACACTTCCTCTTCTACGCTTAGCGAGGCTTTGCTCTTTGCCACCATGTGCCTCATTGGCCTCCCAGTTGAGGTTCACGTTAAAGATGGCTCTGTCTATTGCGGCATCTTTCACACTGCCTGTGTGGAGAATGAATATGAATTCTACTTTTTGTATTCATTCTTATGTGAATTTGTGCCTTGGTTTCGCCCGTGAGCGAGTGGTTTTGATTTTATCGTTTACGTATGGCGGTGTTGTTCTGAAGAAAGCAAGGATGACAAAAAAGGGTAAAAGGAATGTGAATGTGGACGATGGAGTTGTAATCGATACTCTTATTGTTCTTTCCGGTGATCTTGTCCAAGTTGTTGCGACGGAAGTTCTACTTCCGGCTGGTAGTTTTTCCAAAAGTTTGGCTGGTTATGATAATGAAGCCATGGCCAATGTTCCTATTTCATTGCTTCCAGCTTCAGAGACTAAGACATGTATGGAGTCATTCAAGGAGGGGAGTCAGATGAATCAAACAAGCGACTTGGTCCAAGATCAGAATGGGTTTGCTCATGGTTCAGTGCCTACGATAACTGGGAAGCATGGTGATGTTAGACAGCTTTTGCGAGATAATGCTGAGAACAACCAGGGAGATGCACAGCAGAAAAGGGAAAGGATCAATTGCAAAAAGCCTGAAGGTGTCACTGATGCTGCAATCAATTGGAGACAGGACCCAGATAACCAATTAAAAAGGGAGCAGGATGATCATGGTCAGGAATTTGACATTCATAAAAGAGTCAATGTTGATCGAGTTCAATCCTCTATATCGAGTGAGAAACCATGCATTGAAAGACCCACCTCAGCCAACACTTCTCTGAATGCATATTCAGTTTGTGTCTCAACTTGTTCACTTTCGTCCGTAGATAGTAGCATGGACTCGTGTCATAGTTCTATAACATCAACGACCGACTTGGCCCCTTCTCATGTTTCAGAATCCAATAAAAGTGCCAAGGAATTTAAGCTGAATCCAAGAGCCAAACTCTTCTCTCCATCTGTTACCAATAGCATGTCAGCAACTCCTGCAGCTTCAATGGTTGCAAGCGTGGCTTACATTTCAAACAACTCACCAGTAGTACCTGTGGCTGTTGCTCAGCCAGAGGTGGAGTTCAGTCCTTTTGTACCTCGTTCATCTGTGCCTGCCAAGTTTGTCCCTTATGGCAACTCAATAGCTGGATTTGGTGGCAATGTTGCTCAATTTTCCCAACCTATGGTGGGACATGTAGGAACCAGGACGCAGCCAGTTAGATATGTTGGTCAGTATCCTCTCCAGGCTGGTCCAACCTTTGGGCCCCCAAACTCACAAGCAGTTATGGTCGGACGTTTTGGGCAACTTGTTTATGTTCACCCAGTCTCGCATGACTTGGCTCAAGGTACAACAGTCGTCTCACCGGTATCACCTTGCCCTTTGTTGACAACACAGCCAGCTCAATATCCAAAACATCAAGCGTCCTCCACAAATGCAGGAACTGCAGCAGCAGCAGCAGTACAAGCATTGCAGTTTTGCGTTCCTCCACCATTTATGGCCAATGGACACCAGCCGCTCTCCGCAGTGCCAAACCACATTCCAATTTTGCAGCCCTCCTTCCCCCTCAATCGCCCAATGCAAGTCCCAGGATCTAATGCATTCTTCAACACCAAGTTCACCTGA

Protein sequence

MDPNLQLHASDNDYLQDPSVYRRLIGRLLYLTISRPDITFYKLSQFVSKPCKSHLSAAHHLLRYLKASPGQGVFLPASSSFQISFPFCNTNFSTLRETPPPPADPSANTHTEIGIWVAETGSFPKMTLPLLRLARLCSLPPCASLASQLRFTLKMALSIAASFTLPVWRMNMNSTFCIHSYVNLCLGFARERVVLILSFTYGGVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNGFAHGSVPTITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIERPTSANTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNSPVVPVAVAQPEVEFSPFVPRSSVPAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTKFT
Homology
BLAST of Sgr020012 vs. NCBI nr
Match: XP_022151014.1 (uncharacterized protein LOC111019035 isoform X2 [Momordica charantia])

HSP 1 Score: 818.1 bits (2112), Expect = 5.5e-233
Identity = 425/481 (88.36%), Postives = 448/481 (93.14%), Query Frame = 0

Query: 203 GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNE 262
           GVVLKKARMTKKGKRNVNVDDG VIDTLIVLSGDLVQVVATEVLLPAGSFSKSL GYDNE
Sbjct: 58  GVVLKKARMTKKGKRNVNVDDGAVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLVGYDNE 117

Query: 263 AMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNGFAHGSVPTITGKHGDVRQLL 322
           AMANVP+SLLP  E KTCMESFKEGSQ+NQ S+LVQDQNGFAHGSVPTITGKH DVRQLL
Sbjct: 118 AMANVPVSLLPTVEAKTCMESFKEGSQINQISNLVQDQNGFAHGSVPTITGKHSDVRQLL 177

Query: 323 RDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNV 382
           RDN E+NQGDAQQKRERINCKKPE  TDAAINWRQDPDNQLKRE+DDH QEFD+HK VNV
Sbjct: 178 RDNIESNQGDAQQKRERINCKKPEDATDAAINWRQDPDNQLKREKDDHSQEFDLHKGVNV 237

Query: 383 DRVQSSISSEKPCIERPTSANTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSH 442
           DRVQSSISSEKPCIERPTSANT+ NA+SV VST SLSS+DSSMDSCHSSITST D+A  H
Sbjct: 238 DRVQSSISSEKPCIERPTSANTTPNAFSVGVSTSSLSSIDSSMDSCHSSITSTIDVASPH 297

Query: 443 VSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNSPVVPVAVAQPEVE 502
            SESNKS+KEFKLNPRAKLFSPSV N+M+A+PA  +VA+VAYISN+SPVVPVAVAQPEVE
Sbjct: 298 GSESNKSSKEFKLNPRAKLFSPSVANNMTASPATPVVANVAYISNSSPVVPVAVAQPEVE 357

Query: 503 FSPFVPRSSV-PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPT 562
           FSPFVPRSSV PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQP+RYVGQYPLQAGPT
Sbjct: 358 FSPFVPRSSVPPAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPLRYVGQYPLQAGPT 417

Query: 563 FGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQASSTNA 622
           FGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQ +    
Sbjct: 418 FGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQGT---- 477

Query: 623 GTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTKF 682
               AAA QALQFCVPPPFMA+GHQPL+AVPNHIPILQPSFPLNRPMQ+PGSNAFF+TKF
Sbjct: 478 ----AAAQQALQFCVPPPFMASGHQPLAAVPNHIPILQPSFPLNRPMQMPGSNAFFSTKF 530

BLAST of Sgr020012 vs. NCBI nr
Match: XP_022151013.1 (uncharacterized protein LOC111019035 isoform X1 [Momordica charantia])

HSP 1 Score: 814.3 bits (2102), Expect = 7.9e-232
Identity = 425/482 (88.17%), Postives = 449/482 (93.15%), Query Frame = 0

Query: 203 GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNE 262
           GVVLKKARMTKKGKRNVNVDDG VIDTLIVLSGDLVQVVATEVLLPAGSFSKSL GYDNE
Sbjct: 58  GVVLKKARMTKKGKRNVNVDDGAVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLVGYDNE 117

Query: 263 AMANVPISLLPASETKTCMESFKEGSQMNQ-TSDLVQDQNGFAHGSVPTITGKHGDVRQL 322
           AMANVP+SLLP  E KTCMESFKEGSQ+NQ +S+LVQDQNGFAHGSVPTITGKH DVRQL
Sbjct: 118 AMANVPVSLLPTVEAKTCMESFKEGSQINQISSNLVQDQNGFAHGSVPTITGKHSDVRQL 177

Query: 323 LRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVN 382
           LRDN E+NQGDAQQKRERINCKKPE  TDAAINWRQDPDNQLKRE+DDH QEFD+HK VN
Sbjct: 178 LRDNIESNQGDAQQKRERINCKKPEDATDAAINWRQDPDNQLKREKDDHSQEFDLHKGVN 237

Query: 383 VDRVQSSISSEKPCIERPTSANTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPS 442
           VDRVQSSISSEKPCIERPTSANT+ NA+SV VST SLSS+DSSMDSCHSSITST D+A  
Sbjct: 238 VDRVQSSISSEKPCIERPTSANTTPNAFSVGVSTSSLSSIDSSMDSCHSSITSTIDVASP 297

Query: 443 HVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNSPVVPVAVAQPEV 502
           H SESNKS+KEFKLNPRAKLFSPSV N+M+A+PA  +VA+VAYISN+SPVVPVAVAQPEV
Sbjct: 298 HGSESNKSSKEFKLNPRAKLFSPSVANNMTASPATPVVANVAYISNSSPVVPVAVAQPEV 357

Query: 503 EFSPFVPRSSV-PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGP 562
           EFSPFVPRSSV PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQP+RYVGQYPLQAGP
Sbjct: 358 EFSPFVPRSSVPPAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPLRYVGQYPLQAGP 417

Query: 563 TFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQASSTN 622
           TFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQ +   
Sbjct: 418 TFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQGT--- 477

Query: 623 AGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTK 682
                AAA QALQFCVPPPFMA+GHQPL+AVPNHIPILQPSFPLNRPMQ+PGSNAFF+TK
Sbjct: 478 -----AAAQQALQFCVPPPFMASGHQPLAAVPNHIPILQPSFPLNRPMQMPGSNAFFSTK 531

BLAST of Sgr020012 vs. NCBI nr
Match: XP_022151016.1 (uncharacterized protein LOC111019035 isoform X4 [Momordica charantia])

HSP 1 Score: 800.4 bits (2066), Expect = 1.2e-227
Identity = 417/474 (87.97%), Postives = 441/474 (93.04%), Query Frame = 0

Query: 211 MTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPIS 270
           MTKKGKRNVNVDDG VIDTLIVLSGDLVQVVATEVLLPAGSFSKSL GYDNEAMANVP+S
Sbjct: 1   MTKKGKRNVNVDDGAVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLVGYDNEAMANVPVS 60

Query: 271 LLPASETKTCMESFKEGSQMNQ-TSDLVQDQNGFAHGSVPTITGKHGDVRQLLRDNAENN 330
           LLP  E KTCMESFKEGSQ+NQ +S+LVQDQNGFAHGSVPTITGKH DVRQLLRDN E+N
Sbjct: 61  LLPTVEAKTCMESFKEGSQINQISSNLVQDQNGFAHGSVPTITGKHSDVRQLLRDNIESN 120

Query: 331 QGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSI 390
           QGDAQQKRERINCKKPE  TDAAINWRQDPDNQLKRE+DDH QEFD+HK VNVDRVQSSI
Sbjct: 121 QGDAQQKRERINCKKPEDATDAAINWRQDPDNQLKREKDDHSQEFDLHKGVNVDRVQSSI 180

Query: 391 SSEKPCIERPTSANTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKS 450
           SSEKPCIERPTSANT+ NA+SV VST SLSS+DSSMDSCHSSITST D+A  H SESNKS
Sbjct: 181 SSEKPCIERPTSANTTPNAFSVGVSTSSLSSIDSSMDSCHSSITSTIDVASPHGSESNKS 240

Query: 451 AKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNSPVVPVAVAQPEVEFSPFVPR 510
           +KEFKLNPRAKLFSPSV N+M+A+PA  +VA+VAYISN+SPVVPVAVAQPEVEFSPFVPR
Sbjct: 241 SKEFKLNPRAKLFSPSVANNMTASPATPVVANVAYISNSSPVVPVAVAQPEVEFSPFVPR 300

Query: 511 SSV-PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPTFGPPNSQ 570
           SSV PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQP+RYVGQYPLQAGPTFGPPNSQ
Sbjct: 301 SSVPPAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPLRYVGQYPLQAGPTFGPPNSQ 360

Query: 571 AVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQASSTNAGTAAAAA 630
           AVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQ +        AAA
Sbjct: 361 AVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQGT--------AAA 420

Query: 631 VQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTKFT 683
            QALQFCVPPPFMA+GHQPL+AVPNHIPILQPSFPLNRPMQ+PGSNAFF+TKFT
Sbjct: 421 QQALQFCVPPPFMASGHQPLAAVPNHIPILQPSFPLNRPMQMPGSNAFFSTKFT 466

BLAST of Sgr020012 vs. NCBI nr
Match: XP_022943019.1 (polyadenylate-binding protein-interacting protein 4-like [Cucurbita moschata])

HSP 1 Score: 748.8 bits (1932), Expect = 4.1e-212
Identity = 401/482 (83.20%), Postives = 426/482 (88.38%), Query Frame = 0

Query: 203 GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNE 262
           GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLA  DNE
Sbjct: 58  GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAVCDNE 117

Query: 263 AMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNGFAHGSVPTITGKHGDVRQLL 322
           +M N P SLLPA+ETKTCMESFKEGSQ+NQTSDLVQDQNGFAHGS+PT+TGK  DVRQLL
Sbjct: 118 SMVNDPTSLLPATETKTCMESFKEGSQINQTSDLVQDQNGFAHGSLPTVTGKQSDVRQLL 177

Query: 323 RDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNV 382
            DN ENN+GDAQQK+ER NCKKP+GVTDAAINWR+DPDNQLK+EQDDHGQEFD+HK  N 
Sbjct: 178 GDNVENNKGDAQQKKERSNCKKPQGVTDAAINWRRDPDNQLKKEQDDHGQEFDLHKGANG 237

Query: 383 DRVQSSISSEKPCIERPTSANTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSH 442
           DRVQSSI SEKPC ER  SANT+ NAYSV VST S SSVDSSMDSC SSITST D+APSH
Sbjct: 238 DRVQSSILSEKPCTERSISANTT-NAYSVGVSTSSRSSVDSSMDSCRSSITSTIDMAPSH 297

Query: 443 VSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNS-PVVPVAVAQPEV 502
            SESNKS KEFKLNPRAKLFSPS  N M A  A  + A++AYISNNS PVVP AV QPE+
Sbjct: 298 GSESNKSGKEFKLNPRAKLFSPSNANIMPANHATPVAANLAYISNNSPPVVPSAVVQPEL 357

Query: 503 EFSPFVPRSSVP-AKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGP 562
           +FSPFVPRSSVP AKFVPYGNS+AGF GNVAQFSQPMVGHVGTRTQP+RYVGQYPLQAGP
Sbjct: 358 DFSPFVPRSSVPAAKFVPYGNSLAGFDGNVAQFSQPMVGHVGTRTQPLRYVGQYPLQAGP 417

Query: 563 TFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQASSTN 622
           TFGPPNS AVMV RFGQLVY+ PVSHDLAQG TVVSPV PCPLLTTQPAQYPKHQ     
Sbjct: 418 TFGPPNSSAVMVSRFGQLVYMQPVSHDLAQGATVVSPVPPCPLLTTQPAQYPKHQ----- 477

Query: 623 AGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQP-SFPLNRPMQVPGSNAFFNT 682
            GTAAA   QALQFCVPPPFMA+GHQPL+AVPNHIPILQP SFPLNRPM +PG+NAFF T
Sbjct: 478 -GTAAA---QALQFCVPPPFMASGHQPLAAVPNHIPILQPSSFPLNRPMPMPGTNAFFTT 529

BLAST of Sgr020012 vs. NCBI nr
Match: KAG6600769.1 (Polyadenylate-binding protein-interacting protein 4, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 748.8 bits (1932), Expect = 4.1e-212
Identity = 398/481 (82.74%), Postives = 425/481 (88.36%), Query Frame = 0

Query: 203 GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNE 262
           GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKS+A  DNE
Sbjct: 58  GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSMAVCDNE 117

Query: 263 AMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNGFAHGSVPTITGKHGDVRQLL 322
           +M N P SLLPA+ETKTCMESFKEGSQ+NQTSDL+QDQNGFAHGS+PT+TGK  DVRQLL
Sbjct: 118 SMVNDPTSLLPATETKTCMESFKEGSQINQTSDLIQDQNGFAHGSLPTVTGKQSDVRQLL 177

Query: 323 RDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNV 382
            DN ENN+ DAQQK+ER NCKKPEGVTDAAINWR+DPDNQLK+EQDDHGQEFD+HK  N 
Sbjct: 178 GDNVENNKEDAQQKKERSNCKKPEGVTDAAINWRRDPDNQLKKEQDDHGQEFDLHKGANG 237

Query: 383 DRVQSSISSEKPCIERPTSANTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSH 442
           DRVQSSI SEKPC ER  SANT+ NAYSV VST S SSVDSSMDSC SSITST D+APSH
Sbjct: 238 DRVQSSILSEKPCTERSISANTT-NAYSVGVSTSSRSSVDSSMDSCRSSITSTIDMAPSH 297

Query: 443 VSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNS-PVVPVAVAQPEV 502
            SESNKS KEFKLNPRAKLFSPS  N + A  A  + A++AYISNNS PVVP AV QPE+
Sbjct: 298 GSESNKSGKEFKLNPRAKLFSPSNANIIPANHATPVAANLAYISNNSPPVVPSAVVQPEL 357

Query: 503 EFSPFVPRSSVP-AKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGP 562
           +FSPFVPRSSVP AKFVPYGNS+AGF GNVAQFSQPMVGHVGTRTQP+RYVGQYPLQAGP
Sbjct: 358 DFSPFVPRSSVPAAKFVPYGNSLAGFDGNVAQFSQPMVGHVGTRTQPLRYVGQYPLQAGP 417

Query: 563 TFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQASSTN 622
           TFGPPNS AVMV RFGQLVY+ PVSHDLAQG TVVSPV PCPLLTTQPAQYPKHQ     
Sbjct: 418 TFGPPNSSAVMVSRFGQLVYMQPVSHDLAQGATVVSPVPPCPLLTTQPAQYPKHQ----- 477

Query: 623 AGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTK 682
            GTAAA   QALQFCVPPPFMA+GHQPL+AVPNHIPILQPSFPLNRPM +PG+NAFF TK
Sbjct: 478 -GTAAA---QALQFCVPPPFMASGHQPLAAVPNHIPILQPSFPLNRPMPMPGTNAFFTTK 528

BLAST of Sgr020012 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 65.5 bits (158), Expect = 2.7e-09
Identity = 32/85 (37.65%), Postives = 49/85 (57.65%), Query Frame = 0

Query: 1    MDPNLQLHASDNDYLQDPSVYRRLIGRLLYLTISRPDITF--YKLSQFVSKPCKSHLSAA 60
            M P+ +L       L DP+ YR ++G L YL  +RPDI++   +LSQF+  P + HL A 
Sbjct: 1225 MAPSPKLSLYSGTKLTDPTEYRGIVGSLQYLAFTRPDISYAVNRLSQFMHMPTEEHLQAL 1284

Query: 61   HHLLRYLKASPGQGVFLPASSSFQI 84
              +LRYL  +P  G+FL   ++  +
Sbjct: 1285 KRILRYLAGTPNHGIFLKKGNTLSL 1309

BLAST of Sgr020012 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 61.2 bits (147), Expect = 5.1e-08
Identity = 30/83 (36.14%), Postives = 48/83 (57.83%), Query Frame = 0

Query: 3    PNLQLHASDNDYLQDPSVYRRLIGRLLYLTISRPDITF--YKLSQFVSKPCKSHLSAAHH 62
            P L LH+     L DP+ YR ++G L YL  +RPD+++   +LSQ++  P   H +A   
Sbjct: 1212 PKLTLHSGTK--LPDPTEYRGIVGSLQYLAFTRPDLSYAVNRLSQYMHMPTDDHWNALKR 1271

Query: 63   LLRYLKASPGQGVFLPASSSFQI 84
            +LRYL  +P  G+FL   ++  +
Sbjct: 1272 VLRYLAGTPDHGIFLKKGNTLSL 1292

BLAST of Sgr020012 vs. ExPASy Swiss-Prot
Match: P93290 (Uncharacterized mitochondrial protein AtMg00240 OS=Arabidopsis thaliana OX=3702 GN=AtMg00240 PE=4 SV=1)

HSP 1 Score: 51.2 bits (121), Expect = 5.3e-05
Identity = 27/72 (37.50%), Postives = 47/72 (65.28%), Query Frame = 0

Query: 29 LYLTISRPDITF--YKLSQFVSKPCKSHLSAAHHLLRYLKASPGQGVFLPASSSFQISFP 88
          +YLTI+RPD+TF   +LSQF S    + + A + +L Y+K + GQG+F  A+S  Q+   
Sbjct: 1  MYLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLK-A 60

Query: 89 FCNTNFSTLRET 99
          F ++++++  +T
Sbjct: 61 FADSDWASCPDT 71

BLAST of Sgr020012 vs. ExPASy TrEMBL
Match: A0A6J1DBR8 (uncharacterized protein LOC111019035 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111019035 PE=4 SV=1)

HSP 1 Score: 818.1 bits (2112), Expect = 2.6e-233
Identity = 425/481 (88.36%), Postives = 448/481 (93.14%), Query Frame = 0

Query: 203 GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNE 262
           GVVLKKARMTKKGKRNVNVDDG VIDTLIVLSGDLVQVVATEVLLPAGSFSKSL GYDNE
Sbjct: 58  GVVLKKARMTKKGKRNVNVDDGAVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLVGYDNE 117

Query: 263 AMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNGFAHGSVPTITGKHGDVRQLL 322
           AMANVP+SLLP  E KTCMESFKEGSQ+NQ S+LVQDQNGFAHGSVPTITGKH DVRQLL
Sbjct: 118 AMANVPVSLLPTVEAKTCMESFKEGSQINQISNLVQDQNGFAHGSVPTITGKHSDVRQLL 177

Query: 323 RDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNV 382
           RDN E+NQGDAQQKRERINCKKPE  TDAAINWRQDPDNQLKRE+DDH QEFD+HK VNV
Sbjct: 178 RDNIESNQGDAQQKRERINCKKPEDATDAAINWRQDPDNQLKREKDDHSQEFDLHKGVNV 237

Query: 383 DRVQSSISSEKPCIERPTSANTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSH 442
           DRVQSSISSEKPCIERPTSANT+ NA+SV VST SLSS+DSSMDSCHSSITST D+A  H
Sbjct: 238 DRVQSSISSEKPCIERPTSANTTPNAFSVGVSTSSLSSIDSSMDSCHSSITSTIDVASPH 297

Query: 443 VSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNSPVVPVAVAQPEVE 502
            SESNKS+KEFKLNPRAKLFSPSV N+M+A+PA  +VA+VAYISN+SPVVPVAVAQPEVE
Sbjct: 298 GSESNKSSKEFKLNPRAKLFSPSVANNMTASPATPVVANVAYISNSSPVVPVAVAQPEVE 357

Query: 503 FSPFVPRSSV-PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPT 562
           FSPFVPRSSV PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQP+RYVGQYPLQAGPT
Sbjct: 358 FSPFVPRSSVPPAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPLRYVGQYPLQAGPT 417

Query: 563 FGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQASSTNA 622
           FGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQ +    
Sbjct: 418 FGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQGT---- 477

Query: 623 GTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTKF 682
               AAA QALQFCVPPPFMA+GHQPL+AVPNHIPILQPSFPLNRPMQ+PGSNAFF+TKF
Sbjct: 478 ----AAAQQALQFCVPPPFMASGHQPLAAVPNHIPILQPSFPLNRPMQMPGSNAFFSTKF 530

BLAST of Sgr020012 vs. ExPASy TrEMBL
Match: A0A6J1DB05 (uncharacterized protein LOC111019035 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111019035 PE=4 SV=1)

HSP 1 Score: 814.3 bits (2102), Expect = 3.8e-232
Identity = 425/482 (88.17%), Postives = 449/482 (93.15%), Query Frame = 0

Query: 203 GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNE 262
           GVVLKKARMTKKGKRNVNVDDG VIDTLIVLSGDLVQVVATEVLLPAGSFSKSL GYDNE
Sbjct: 58  GVVLKKARMTKKGKRNVNVDDGAVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLVGYDNE 117

Query: 263 AMANVPISLLPASETKTCMESFKEGSQMNQ-TSDLVQDQNGFAHGSVPTITGKHGDVRQL 322
           AMANVP+SLLP  E KTCMESFKEGSQ+NQ +S+LVQDQNGFAHGSVPTITGKH DVRQL
Sbjct: 118 AMANVPVSLLPTVEAKTCMESFKEGSQINQISSNLVQDQNGFAHGSVPTITGKHSDVRQL 177

Query: 323 LRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVN 382
           LRDN E+NQGDAQQKRERINCKKPE  TDAAINWRQDPDNQLKRE+DDH QEFD+HK VN
Sbjct: 178 LRDNIESNQGDAQQKRERINCKKPEDATDAAINWRQDPDNQLKREKDDHSQEFDLHKGVN 237

Query: 383 VDRVQSSISSEKPCIERPTSANTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPS 442
           VDRVQSSISSEKPCIERPTSANT+ NA+SV VST SLSS+DSSMDSCHSSITST D+A  
Sbjct: 238 VDRVQSSISSEKPCIERPTSANTTPNAFSVGVSTSSLSSIDSSMDSCHSSITSTIDVASP 297

Query: 443 HVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNSPVVPVAVAQPEV 502
           H SESNKS+KEFKLNPRAKLFSPSV N+M+A+PA  +VA+VAYISN+SPVVPVAVAQPEV
Sbjct: 298 HGSESNKSSKEFKLNPRAKLFSPSVANNMTASPATPVVANVAYISNSSPVVPVAVAQPEV 357

Query: 503 EFSPFVPRSSV-PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGP 562
           EFSPFVPRSSV PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQP+RYVGQYPLQAGP
Sbjct: 358 EFSPFVPRSSVPPAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPLRYVGQYPLQAGP 417

Query: 563 TFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQASSTN 622
           TFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQ +   
Sbjct: 418 TFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQGT--- 477

Query: 623 AGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTK 682
                AAA QALQFCVPPPFMA+GHQPL+AVPNHIPILQPSFPLNRPMQ+PGSNAFF+TK
Sbjct: 478 -----AAAQQALQFCVPPPFMASGHQPLAAVPNHIPILQPSFPLNRPMQMPGSNAFFSTK 531

BLAST of Sgr020012 vs. ExPASy TrEMBL
Match: A0A6J1DD98 (uncharacterized protein LOC111019035 isoform X4 OS=Momordica charantia OX=3673 GN=LOC111019035 PE=4 SV=1)

HSP 1 Score: 800.4 bits (2066), Expect = 5.7e-228
Identity = 417/474 (87.97%), Postives = 441/474 (93.04%), Query Frame = 0

Query: 211 MTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPIS 270
           MTKKGKRNVNVDDG VIDTLIVLSGDLVQVVATEVLLPAGSFSKSL GYDNEAMANVP+S
Sbjct: 1   MTKKGKRNVNVDDGAVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLVGYDNEAMANVPVS 60

Query: 271 LLPASETKTCMESFKEGSQMNQ-TSDLVQDQNGFAHGSVPTITGKHGDVRQLLRDNAENN 330
           LLP  E KTCMESFKEGSQ+NQ +S+LVQDQNGFAHGSVPTITGKH DVRQLLRDN E+N
Sbjct: 61  LLPTVEAKTCMESFKEGSQINQISSNLVQDQNGFAHGSVPTITGKHSDVRQLLRDNIESN 120

Query: 331 QGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSI 390
           QGDAQQKRERINCKKPE  TDAAINWRQDPDNQLKRE+DDH QEFD+HK VNVDRVQSSI
Sbjct: 121 QGDAQQKRERINCKKPEDATDAAINWRQDPDNQLKREKDDHSQEFDLHKGVNVDRVQSSI 180

Query: 391 SSEKPCIERPTSANTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKS 450
           SSEKPCIERPTSANT+ NA+SV VST SLSS+DSSMDSCHSSITST D+A  H SESNKS
Sbjct: 181 SSEKPCIERPTSANTTPNAFSVGVSTSSLSSIDSSMDSCHSSITSTIDVASPHGSESNKS 240

Query: 451 AKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNSPVVPVAVAQPEVEFSPFVPR 510
           +KEFKLNPRAKLFSPSV N+M+A+PA  +VA+VAYISN+SPVVPVAVAQPEVEFSPFVPR
Sbjct: 241 SKEFKLNPRAKLFSPSVANNMTASPATPVVANVAYISNSSPVVPVAVAQPEVEFSPFVPR 300

Query: 511 SSV-PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPTFGPPNSQ 570
           SSV PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQP+RYVGQYPLQAGPTFGPPNSQ
Sbjct: 301 SSVPPAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPLRYVGQYPLQAGPTFGPPNSQ 360

Query: 571 AVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQASSTNAGTAAAAA 630
           AVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQ +        AAA
Sbjct: 361 AVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQGT--------AAA 420

Query: 631 VQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTKFT 683
            QALQFCVPPPFMA+GHQPL+AVPNHIPILQPSFPLNRPMQ+PGSNAFF+TKFT
Sbjct: 421 QQALQFCVPPPFMASGHQPLAAVPNHIPILQPSFPLNRPMQMPGSNAFFSTKFT 466

BLAST of Sgr020012 vs. ExPASy TrEMBL
Match: A0A6J1FT17 (polyadenylate-binding protein-interacting protein 4-like OS=Cucurbita moschata OX=3662 GN=LOC111447883 PE=4 SV=1)

HSP 1 Score: 748.8 bits (1932), Expect = 2.0e-212
Identity = 401/482 (83.20%), Postives = 426/482 (88.38%), Query Frame = 0

Query: 203 GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNE 262
           GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLA  DNE
Sbjct: 58  GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAVCDNE 117

Query: 263 AMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNGFAHGSVPTITGKHGDVRQLL 322
           +M N P SLLPA+ETKTCMESFKEGSQ+NQTSDLVQDQNGFAHGS+PT+TGK  DVRQLL
Sbjct: 118 SMVNDPTSLLPATETKTCMESFKEGSQINQTSDLVQDQNGFAHGSLPTVTGKQSDVRQLL 177

Query: 323 RDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNV 382
            DN ENN+GDAQQK+ER NCKKP+GVTDAAINWR+DPDNQLK+EQDDHGQEFD+HK  N 
Sbjct: 178 GDNVENNKGDAQQKKERSNCKKPQGVTDAAINWRRDPDNQLKKEQDDHGQEFDLHKGANG 237

Query: 383 DRVQSSISSEKPCIERPTSANTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSH 442
           DRVQSSI SEKPC ER  SANT+ NAYSV VST S SSVDSSMDSC SSITST D+APSH
Sbjct: 238 DRVQSSILSEKPCTERSISANTT-NAYSVGVSTSSRSSVDSSMDSCRSSITSTIDMAPSH 297

Query: 443 VSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNS-PVVPVAVAQPEV 502
            SESNKS KEFKLNPRAKLFSPS  N M A  A  + A++AYISNNS PVVP AV QPE+
Sbjct: 298 GSESNKSGKEFKLNPRAKLFSPSNANIMPANHATPVAANLAYISNNSPPVVPSAVVQPEL 357

Query: 503 EFSPFVPRSSVP-AKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGP 562
           +FSPFVPRSSVP AKFVPYGNS+AGF GNVAQFSQPMVGHVGTRTQP+RYVGQYPLQAGP
Sbjct: 358 DFSPFVPRSSVPAAKFVPYGNSLAGFDGNVAQFSQPMVGHVGTRTQPLRYVGQYPLQAGP 417

Query: 563 TFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQASSTN 622
           TFGPPNS AVMV RFGQLVY+ PVSHDLAQG TVVSPV PCPLLTTQPAQYPKHQ     
Sbjct: 418 TFGPPNSSAVMVSRFGQLVYMQPVSHDLAQGATVVSPVPPCPLLTTQPAQYPKHQ----- 477

Query: 623 AGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQP-SFPLNRPMQVPGSNAFFNT 682
            GTAAA   QALQFCVPPPFMA+GHQPL+AVPNHIPILQP SFPLNRPM +PG+NAFF T
Sbjct: 478 -GTAAA---QALQFCVPPPFMASGHQPLAAVPNHIPILQPSSFPLNRPMPMPGTNAFFTT 529

BLAST of Sgr020012 vs. ExPASy TrEMBL
Match: A0A6J1IG89 (polyadenylate-binding protein-interacting protein 4-like OS=Cucurbita maxima OX=3661 GN=LOC111476647 PE=4 SV=1)

HSP 1 Score: 743.8 bits (1919), Expect = 6.3e-211
Identity = 400/482 (82.99%), Postives = 424/482 (87.97%), Query Frame = 0

Query: 203 GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNE 262
           GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLA  DNE
Sbjct: 58  GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAVCDNE 117

Query: 263 AMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNGFAHGSVPTITGKHGDVRQLL 322
           +M N P SLLPA+ETKTCMESFKEGSQ+NQTSDLVQDQNGFAHGS+PT+TGK  DVRQLL
Sbjct: 118 SMVNDPTSLLPATETKTCMESFKEGSQINQTSDLVQDQNGFAHGSLPTVTGKQSDVRQLL 177

Query: 323 RDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNV 382
            DN ENN+GDAQQK+E  NCKKPEGVTDAAINWR+DPDNQLK+EQDDHGQEFD+HK  N 
Sbjct: 178 GDNVENNKGDAQQKKEGSNCKKPEGVTDAAINWRRDPDNQLKKEQDDHGQEFDLHKGANG 237

Query: 383 DRVQSSISSEKPCIERPTSANTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSH 442
           DRVQSSI SEKPC ER  SANT+ NAYSV VST S SSVDSSMDSC SSITST D+APSH
Sbjct: 238 DRVQSSILSEKPCTERSISANTT-NAYSVGVSTSSRSSVDSSMDSCRSSITSTIDMAPSH 297

Query: 443 VSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNS-PVVPVAVAQPEV 502
            SESNKS KEFKLNPRAKLFSPS  N M A  A  + A++AYISNNS PVVP AV QPE+
Sbjct: 298 GSESNKSGKEFKLNPRAKLFSPSNANIMPANHATPVAANLAYISNNSPPVVPSAVIQPEL 357

Query: 503 EFSPFVPRSSVP-AKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGP 562
           +FSPFVPRSSVP AKFVPYGNS+AGF GNVAQFSQPMVGHVGTRTQP+RYVGQYPLQAGP
Sbjct: 358 DFSPFVPRSSVPAAKFVPYGNSLAGFDGNVAQFSQPMVGHVGTRTQPLRYVGQYPLQAGP 417

Query: 563 TFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQASSTN 622
           TFGP NS AVMV RFGQLVY+ PVSHDLAQG TVVSPV PCPLLTTQPAQYPKHQ     
Sbjct: 418 TFGPQNSSAVMVSRFGQLVYMQPVSHDLAQGATVVSPVPPCPLLTTQPAQYPKHQ----- 477

Query: 623 AGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQP-SFPLNRPMQVPGSNAFFNT 682
            GTAAA   QALQFCVPPPFMA+GHQPL+AVPNHIPILQP SFPLNRPM +PG+NAFF T
Sbjct: 478 -GTAAA---QALQFCVPPPFMASGHQPLAAVPNHIPILQPSSFPLNRPMPMPGTNAFFTT 529

BLAST of Sgr020012 vs. TAIR 10
Match: AT5G54920.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G26990.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 207.6 bits (527), Expect = 3.1e-53
Identity = 169/499 (33.87%), Postives = 251/499 (50.30%), Query Frame = 0

Query: 204 VVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEA 263
           +VLK A++TKKG+   NV+ G +++TL++LS ++VQ+VA  V     S S ++AG   E 
Sbjct: 64  IVLKNAKLTKKGRSKSNVESGKIVETLVILSSNIVQIVAEGV-----SLSSNVAG---EI 123

Query: 264 MANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNGFAHGSVPTITGKHGDVRQLLR 323
                +S +  S       S K     N+  +  + +N        T+T   G+    ++
Sbjct: 124 EGENVVSAVAVSS----FNSGKNRRGTNRRRNSAKREN-CLESKARTLTS--GETAGAMK 183

Query: 324 DNAENNQGDAQQKR---ERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRV 383
           +    ++    Q +     +N ++  GV                R   +  +  D+H+  
Sbjct: 184 EPGRRDEVGILQNKYHPSSLNHQRQAGV----------------RILKNSKKITDVHQED 243

Query: 384 NVDRVQSSISSE------KPCIERPTSANTSLNAY--------SVCVSTCSLSSVDSSMD 443
           NV+   SS S +      KP IE+      S N +        S   S+   ++VD + +
Sbjct: 244 NVEARSSSCSLDNMSERVKP-IEQEKMPEPSSNGFHDATERPSSTENSSSQSTTVDENSE 303

Query: 444 SCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPA--ASMVASVAY 503
                + ST  L P+  ++ +K AKEFKLNP AK FSPS+   +++  A    +VA++ Y
Sbjct: 304 VSLVLVVSTNSLPPTQATDPDKKAKEFKLNPGAKTFSPSLAKRLTSAHAGMTPVVANMGY 363

Query: 504 ISNNSPVVPVAVA-QPEVEFSPFVPRSSVPAKFVPYGNSIAGFGGNVAQFSQPMVGHVGT 563
           + +N+P++PV  A QPE+  SPF+  +S P+KFVPY N   G  G  + F Q MVG    
Sbjct: 364 VPSNTPMLPVPEAVQPEIGISPFLSHASSPSKFVPYTNLATGNAGGGSHFPQHMVGPTIN 423

Query: 564 RTQPVRYVGQY-PLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCP 623
           R QP R+  QY  +Q  P    PN Q VMVGR GQL+Y+ P+S DL QG    S + P P
Sbjct: 424 RGQPHRFTTQYHSVQPTPMLVNPNPQ-VMVGRSGQLMYMQPISQDLVQGAPHNSHLPPRP 483

Query: 624 LLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSF 682
           L T Q  QYPKHQ        +  A  Q +    P PF ANGHQP + +P  IP++Q  F
Sbjct: 484 LFTPQQFQYPKHQ--------SLIATGQPMHLYAPQPFAANGHQPYTVMPTDIPVMQSPF 521

BLAST of Sgr020012 vs. TAIR 10
Match: AT5G54920.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G26990.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 206.8 bits (525), Expect = 5.4e-53
Identity = 168/496 (33.87%), Postives = 250/496 (50.40%), Query Frame = 0

Query: 204 VVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEA 263
           +VLK A++TKKG+   NV+ G +++TL++LS ++VQ+VA  V     S S ++AG   E 
Sbjct: 64  IVLKNAKLTKKGRSKSNVESGKIVETLVILSSNIVQIVAEGV-----SLSSNVAG---EI 123

Query: 264 MANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNGFAHGSVPTITGKHGDVRQLLR 323
                +S +  S       S K     N+  +  + +N        T+T   G+    ++
Sbjct: 124 EGENVVSAVAVSS----FNSGKNRRGTNRRRNSAKREN-CLESKARTLTS--GETAGAMK 183

Query: 324 DNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVD 383
           +    ++   +     +N ++  GV                R   +  +  D+H+  NV+
Sbjct: 184 EPGRRDEN--KYHPSSLNHQRQAGV----------------RILKNSKKITDVHQEDNVE 243

Query: 384 RVQSSISSE------KPCIERPTSANTSLNAY--------SVCVSTCSLSSVDSSMDSCH 443
              SS S +      KP IE+      S N +        S   S+   ++VD + +   
Sbjct: 244 ARSSSCSLDNMSERVKP-IEQEKMPEPSSNGFHDATERPSSTENSSSQSTTVDENSEVSL 303

Query: 444 SSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPA--ASMVASVAYISN 503
             + ST  L P+  ++ +K AKEFKLNP AK FSPS+   +++  A    +VA++ Y+ +
Sbjct: 304 VLVVSTNSLPPTQATDPDKKAKEFKLNPGAKTFSPSLAKRLTSAHAGMTPVVANMGYVPS 363

Query: 504 NSPVVPVAVA-QPEVEFSPFVPRSSVPAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQ 563
           N+P++PV  A QPE+  SPF+  +S P+KFVPY N   G  G  + F Q MVG    R Q
Sbjct: 364 NTPMLPVPEAVQPEIGISPFLSHASSPSKFVPYTNLATGNAGGGSHFPQHMVGPTINRGQ 423

Query: 564 PVRYVGQY-PLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLT 623
           P R+  QY  +Q  P    PN Q VMVGR GQL+Y+ P+S DL QG    S + P PL T
Sbjct: 424 PHRFTTQYHSVQPTPMLVNPNPQ-VMVGRSGQLMYMQPISQDLVQGAPHNSHLPPRPLFT 483

Query: 624 TQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLN 682
            Q  QYPKHQ        +  A  Q +    P PF ANGHQP + +P  IP++Q  FP+N
Sbjct: 484 PQQFQYPKHQ--------SLIATGQPMHLYAPQPFAANGHQPYTVMPTDIPVMQSPFPIN 516

BLAST of Sgr020012 vs. TAIR 10
Match: AT4G26990.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G54920.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 186.0 bits (471), Expect = 9.8e-47
Identity = 174/498 (34.94%), Postives = 236/498 (47.39%), Query Frame = 0

Query: 203 GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNE 262
           G+VLK AR+TKKG    NV  G V+DTL++LS  +VQ++A  V LP+     ++   +NE
Sbjct: 59  GIVLKDARITKKGTSISNVASGSVVDTLVILSSTIVQIIAEGVSLPS-----NVTTANNE 118

Query: 263 AMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNGFAHGSVPTITGKHGDVRQLL 322
                    LP SE + C          N+++++     GF H                 
Sbjct: 119 --VGSATETLP-SEPRLC--------AANKSTNVSTQGRGFNH----------------- 178

Query: 323 RDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNV 382
                  Q  AQ  +  +                               Q  +++++ N+
Sbjct: 179 -----KRQAGAQILKRSV-------------------------------QIPEVYQQDNI 238

Query: 383 DRVQSSISSEKPCIER-----------PTSANTSLNAYSVCVSTCSLSS----VDSSMDS 442
           D +QSS SS     ER              +N   NA +   ST +L S    VD +++ 
Sbjct: 239 D-IQSSSSSLDSMSERVKPIEEDNLMPEPLSNGFHNAAAKPSSTDNLLSESTPVDDTLEL 298

Query: 443 CHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAA-SMVASVAYIS 502
           C   + +++    S   ++ K  KEFKLNP AK+FSPS T  +S +P     V ++AYI 
Sbjct: 299 CRGRVAASS--TASVPIQAVKKPKEFKLNPEAKIFSPSYTKRLSPSPVGMPHVGNIAYIP 358

Query: 503 NNSPVVPVAVA-QPEVEFSPFVPRSSVPAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRT 562
           +N+P++PV  A  PEV  +P+VP++  P+KFVPYGN  AG      QF Q M+G    R 
Sbjct: 359 SNTPMLPVPEAIYPEVVNNPYVPQAPPPSKFVPYGNVTAGHAVGGFQFPQHMIGPTVNRA 418

Query: 563 QPVRYVGQY-PLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLL 622
           QP RY  QY  +QA P    P+ Q VMV R GQLVYV  VS DL QGT  +SP+  CPL 
Sbjct: 419 QPQRYTAQYHSVQAAPMLVNPSPQ-VMVARSGQLVYVQSVSQDLVQGTPPLSPMLSCPLP 473

Query: 623 TTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPIL-QPSFP 682
           T Q  QY KHQ           AA Q L  CV  PF   G QP   +P   P + QP FP
Sbjct: 479 TAQHVQYLKHQ--------GVVAAGQPLPLCVSLPFTTGGPQPY-GIPTQFPAMQQPPFP 473

BLAST of Sgr020012 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 70.9 bits (172), Expect = 4.6e-12
Identity = 39/100 (39.00%), Postives = 57/100 (57.00%), Query Frame = 0

Query: 1   MDPNLQLHASDNDYLQDPSVYRRLIGRLLYLTISRPDITF--YKLSQFVSKPCKSHLSAA 60
           MDP++   A       D   YRRLIGRL+YL I+R DI+F   KLSQF   P  +H  A 
Sbjct: 358 MDPSVTFSAHSGGDFVDAKAYRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLAHQQAV 417

Query: 61  HHLLRYLKASPGQGVFLPASSSFQISFPFCNTNFSTLRET 99
             +L Y+K + GQG+F  + +  Q+   F + +F + ++T
Sbjct: 418 MKILHYIKGTVGQGLFYSSQAEMQLQV-FSDASFQSCKDT 456

BLAST of Sgr020012 vs. TAIR 10
Match: ATMG00240.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 51.2 bits (121), Expect = 3.8e-06
Identity = 27/72 (37.50%), Postives = 47/72 (65.28%), Query Frame = 0

Query: 29 LYLTISRPDITF--YKLSQFVSKPCKSHLSAAHHLLRYLKASPGQGVFLPASSSFQISFP 88
          +YLTI+RPD+TF   +LSQF S    + + A + +L Y+K + GQG+F  A+S  Q+   
Sbjct: 1  MYLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLK-A 60

Query: 89 FCNTNFSTLRET 99
          F ++++++  +T
Sbjct: 61 FADSDWASCPDT 71

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022151014.15.5e-23388.36uncharacterized protein LOC111019035 isoform X2 [Momordica charantia][more]
XP_022151013.17.9e-23288.17uncharacterized protein LOC111019035 isoform X1 [Momordica charantia][more]
XP_022151016.11.2e-22787.97uncharacterized protein LOC111019035 isoform X4 [Momordica charantia][more]
XP_022943019.14.1e-21283.20polyadenylate-binding protein-interacting protein 4-like [Cucurbita moschata][more]
KAG6600769.14.1e-21282.74Polyadenylate-binding protein-interacting protein 4, partial [Cucurbita argyrosp... [more]
Match NameE-valueIdentityDescription
Q94HW22.7e-0937.65Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT945.1e-0836.14Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P932905.3e-0537.50Uncharacterized mitochondrial protein AtMg00240 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A6J1DBR82.6e-23388.36uncharacterized protein LOC111019035 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A6J1DB053.8e-23288.17uncharacterized protein LOC111019035 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1DD985.7e-22887.97uncharacterized protein LOC111019035 isoform X4 OS=Momordica charantia OX=3673 G... [more]
A0A6J1FT172.0e-21283.20polyadenylate-binding protein-interacting protein 4-like OS=Cucurbita moschata O... [more]
A0A6J1IG896.3e-21182.99polyadenylate-binding protein-interacting protein 4-like OS=Cucurbita maxima OX=... [more]
Match NameE-valueIdentityDescription
AT5G54920.23.1e-5333.87unknown protein; FUNCTIONS IN: molecular_function unknown; BEST Arabidopsis thal... [more]
AT5G54920.15.4e-5333.87unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G26990.19.8e-4734.94unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G23160.14.6e-1239.00cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00240.13.8e-0637.50Gag-Pol-related retrotransposon family protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 323..343
NoneNo IPR availablePANTHERPTHR12854:SF12POLYADENYLATE-BINDING PROTEIN INTERACTING PROTEINcoord: 203..679
IPR045117Ataxin2-likePANTHERPTHR12854ATAXIN 2-RELATEDcoord: 203..679

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr020012.1Sgr020012.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0034063 stress granule assembly
cellular_component GO:0010494 cytoplasmic stress granule
molecular_function GO:0003723 RNA binding