Sgr014298 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr014298
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionProteasome subunit beta
Locationtig00000289: 212096 .. 234387 (-)
RNA-Seq ExpressionSgr014298
SyntenySgr014298
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGGAGAATCAAACAGGGAGCAGAAAAAGTTGCTTGTTTGAAGCTGCAAAACGTGCACGCAAGGTTCCCCTCCGTTGTCCCTCACACACAAACTGTCTTCTAAAACCGATTATTAATGTTGCGTCTTCAACTATCTCAGTGCTCCACTTAGAAAAGTGAGCCTTTAAAGTTAATAAAATACACCAACAATTTTTAAATAGTTAAAATAAAAAAACAAATATTTTTTTAATATCTAATAGTTATGAATTGTGAATTATGGTGACTATTCATAGTTTTTGAGATGCGAATCTCTCACTCTGTATGCATTACGAAGGAACCTAAAATTTTCGAGGCAATGAATTCTCATTAATGTCTGTATTATTCAAGTTGACATTCTCGCTTTCATTGCGTCTTCACAATTTTGACAAAATCACTACTAGGCAAAACTATTGGTGGTCTCATAACCCCTACCTCTTTTATCACATATTTTATTTTTTATAGCCAAAATCTATTTTCATTTTTAGTGGGGCGAAGATTCAAACATTTAACCTTTAAGATAGGATAGCAATGCCTCGACCGATTGAACTATATTGCATCACATGTAATATACTTGTTTCCAAATTGAATACAAGGATCGGATATATGTAATTTATAACATAACTCTGCCATGACATTACAAATTTTTGTAAAATGTGCTAAAGGAATTTAATCACATGGTCATCAAGGATATTTTTATACAGAAATTAAGGATGAAAATATAGTATTAGAAATATAAAAATTAAATCAAAACATGAAATATAAAAAATTAAAATATAATTCAGATCATGGATCTAACAATAACCGAACGTTAGTGATTTGAGTATGAAACTTCAATCCAGCGAGAAGCAGTTGACTTCACCCTGCAGAATCGAAAATAAAATTGAATATTAGTTAAGTCAATGGTTCTGTCTTTGAATTAATTGTTTATTTATTGGCATAATCAATGCTAAGTACAGACACCAGTTAAAATAAAACAAGTTGAATTAAGATAAAATATTATTTTCGTTCCCTTTTATTTCTCTACATTTTTTTAGAATACAATTTTTTTTTCAGTACGAATACAATTTTAATGCTGTATTTGGAATAAACTGCCATTTTGATCTCTAATATTAATTTTAAGATATGATTTTTTAATTAATTAAAAGTAATTTTTACTCTTCAAAAAGTCATTTCAAGTTCACTTTTATTCTTTAATATTTTATGTAAAATGGCATAATTTATTATTTATAAAATGTAGAGATTAAAAAAGGGACTAAAACCACAATTAAATCTTAAAAGGAAAAATATTATATTTCATGTGATATTTTTTTGAATTTGATTGATTCATAGAATGGGCCAAGAAGGCCCAGGGCTATTGGGCAAATTCATAGAATTAATCTAAGTTCATGGATTTAAAAACATGATTTAAAATTGATTAAAAACTTATTAAAATTATTAAAACGGATTTATAACTTTTTTAAAAATCGATTTTACGGATAAACTAGTAAAAGTTGTCGATTAGTTATTTATAACAATTCAATTAGTCAATTTAGATTAAAAAAAGAGCATATTTTTGGTATTCTAATCAGCCTCTGATTGCATATAAAATTATATAAAAAAAATCAATCTTTAAAACTATACCTTTTTAACAAAAATTAGAAAATTGCCTAGCTGAAATTTACGGGTTTTAAGGTTATTGTTCAAGGGTCTTTTGAGCTCTTCAAAACATAATTTATAGTTCTAAAAATCATGTTATATCTATAAAATATAACTTTCAAATAATTTGCGAAAATTCAATACAAAAATCTCAGAAATTAGATTTTCACTCTAGGAAACTAGTTGGAGTGATTTTTGAAAATTTTTTAGGAAAAAATTTTGTACCTTTCACAGAGCAACTAATCTCAATTTGAGGGTTGAATTTTCAGAGATCTCCACTTCAATATTTACCTCCCTCTCTAACAAGGTTCCAACCTTGACTTGTTTAGAATTCTTAAATACAAACCGAAGGGTGCAACTCCATGTCACACAAAATAGCATGCGGTATAAAAATTATCGAAGCAGAGGAAAAAAATATTGATATGAAGGGAGGGTAATAGCCAATCCGAGCATAGTTTAATAGATAAAACGTTTATTACCATCCCAAAGATCAATGGTTCGATCTCTCACCCTGCAATTGTTGAACTAAAAAAAAATATGAAGAGGGTAATAAATGAGGATTGTTAATTTGGTGGTAGTGGTTGAATCTTGAAATGTTTGTTCTTAATTTTTTTAAAAATAAAAAATGAAAAACATGAAATAAAAAATTTTCAATTAGGTCATTTCATAAAAAAAAAAAATGATAAAATTTATTAATAGAAAGTTTACGTGTCATTAAATTGTTGTTTTATTAACATATTTGGACACGTTAGCATGCTACTTAGAAGCAATAGATTTTGTTGATGGGCGTAGTTGTATTTTGCTAATCGGCGAGTTAGGATTTCATGTGTCAAGTAGCATCCAAGTGACATGTCTAAATATATTAATAAAGCAATAATTAAATAATACGTCAACCTTTTGTTAACAAAATTATAACAATTTTCTTACAAAATGACCTAAATGAGAATTTTTAATATATTAATGATAAAATTGAGAATTTTTAAAGATTAAGAACGGAATTGAGAATGACCTAAAATATTACAGACCATTGTGTATATTTAACATAAACATTAAACATAGGAAAATATATGAAAAACAATAAAACACTATTTATATTTTATGTTATTATTATTATTTTTTGAAATAGATTTTAAAAACTATAGTACCAAACAAACTTTTTGCATAAAAAATAGGTTTTTATTCTTAAAAAACAAAAGATATTTTTTAAACACTACTCCCAAATAACTTGCTTCTACAAGTACATTGAGCTTGGGCCTAGGGTTAGGCGTGATACATTGCAAATTTACATTGTTTTTTTTTTTACAGAAATTTTAGCTTTGAGTGTTTTTTTTTATTTATTTAAAATAACTCTGTTACAAATGTAAAATAGAAAAAAAATGTTCACTTGTAATCTATTAAAAATGAGAGTATTTTTATATATAATTTTGTAGTAATCATTTGACTTATGTGAACCTATGTCATAAATATCTATGCATTGAAAACACGAAGTCAGTATCAAATTTAATAATATATATATATTTTTGGTTTTTAAAAAAGAATACTCGAAATAACGATGTGAATGTTTTATTTTATAATAAATTAATATTGAAAATACAATTGTGTCTATATATATATATATATTGAAAATACAATACAATTAATTTCAAAGAATGGAACTAAAAAAAAAAATCAAAGAATTTAAAGAAAATTACAGTGCAATTCGACCGGAAATTCCGGGGGTATTGATTATTGACTGTGGTCAGAAATATATTCTCAATCTCTCTCGGATCTCTCCCAAACGACCCGAACATTGCGAACGCGCGGGCAACGATAACAGTAATCTCAGCGACAGCGACTGCTTCGACTCCCATTTACTTTGGCGTGCCGCATTCCATTCGCGAATCGTCTTCTTCCTCTCGCATTCTGGTTCTCTCCCCTCTCCCTCATCGAATCGTCTTCTTCCTCTCGCATTCCGGTTCTCTCCCTCCTCCCTCGCCGGAATCGTTTTCTTACTCTCAGTCTCGCCGACGCGCTCACTCTTCTTTTCCTCTCGCTCGCATTCTGGTTCTCTCCCTCCTCCCTGGCCGGAATCGTTTTCTTACTCTCACTCTCACCGACGGGCTCACTCTTCTTTTCCTCTCGCTCGCATTCTGGGTCTCTCCCTCCTCTCTCGCCGGAATCGTTTTCTTAGTCTCGCCGACGCGCTCACTCTTCTTTTCCTCTCGCGCACATTCTGGTTCGCTCCCTCCTCCCGCGCCGGAATCGTTTTCTTACTCTCAGTCTCGCCGACTCGCTTACTCTTCTTCCTCTCGCTCGCATTCTGGTTCTTTACCCCTCACCAGAATCAGAATCAGAATCAGAATCGTTTTCTTACTCTCAGTCTCGCCGACTCGCTTACTCTTCTTCCTCTCGCTCGCATTCTGGTTCTTTACCCCTCACCAGAATCAGAATCAGAATCAGAATCAGAATCGTGTACTTCTTGATTTCGTTCGTATTGCTCCATCTCTCTGTAAAAACCGAACTACAGTGCAGAGCTCAACCTCAATCGCAACCGGCGGAGATAACAACGATTTCTGTGGCGAAAAGAATAGCAATGGTTCGATCCTCGGAATCCTCAGTCTCACCGACTCACTCTCTCGGAATCCTCTCTTTCTTTCTGTGTCTCTCGCTTTCTGGTAATTTTTTCTCCCCGTTTTCTTCACGCTTTCTTTGGTATATTATTTGTTTGATTAATGGATCCAACTTTAAATGTGTTGTTTAATTTGTGTAGCCTTGAAACTTGTTTAATATTATTCTAAACTTTTGTTAGATCCTCAAAGTTTTGAATACTCGAGGATCCAGTGCATTCCAGCCCTCGAGAGCATATCTTATGTTTCCTTCTTTCAGGTTCTCGTCTTATTCTCTCTCTCTCTCTCTCTCTGTTGTGCTGTTTTGCGTTTTGGACATTAACTCTTATTTCTGGGCTAAAATTCGGAGTTGCAGTGCCTCCATGTTTGAGATATGGGAAAGAAATTCTGTGGGCTGTACAGAGGGTGCCCTGCGTGAAGCACGATGCCTGTGTCCAAGCCAAGAACGGTGAGTCCTCTTCCCTTGAATTAATTGTTCGTTTTCTTGAGAAAATTTCTGTTTGTTTCGTTGGGAAATTTCTCTGGTTCTGCTAATGAATTTGCGACCCTCAGAGATCTTAGCTTAAAAAGGAAAAGACGCAATTGCAAATCCGTTCAATTTTGATTCGGCCATTTTCAATTTCTCTCTTACAACTTCGTTCCAAAGAACTCTCAATCAGCTACGAAAATTTAGTTTCTAGTTTAAGGGGAGTCGTTATCCCCATGAAGAATCCAAGTTTTGTCTTCCGTAGGTTATTGAATACTCGAATTTACTGGATTTCCCGATGCTGCTTCCAAACTTAATTTACTTGCGCCAGTTTTGTTCTGTAATACTTGGCCTGGATTCTCCAAAATTCACTTTAATATGCCTGTGTCAATAAAAGCATATTTTGGGGCTTTTGTTTGATCCCTGTTCCTTTGTCACTTAGGTTTATTTGTTGGCTGGAGTTTCTTCTTGAATGAATTGTTTTGTTTCTTATCAAAAAAAAAATTCATGCTTTGGAACAAAACTAAGATTGCCTATTGTCCATGTTCTCTCTCTGGGAAGGACATCTATGTTCTTATACCCTAAACCTTAAGTCCACATAAGAAATAGTATTGGCTTGGCAACCATCAAATTCTTTTTATGCAATTAGTTACATATTCAACTGTTCAAGTTACTATATTTAGTGATTTTGCTAGTTTTCTTGCAGAAAATATATGTTCCTCCACTGTGCCCATTTCCAAGACTAAAAAGAAAATTGTACAAGATTTCCACAATGCATGGGAGCTGATAATGAGGTAAAAGATGATGTTTTGAGCTTTCTCTTATGGGTTACAGGTTGATGTTCTGAACTTTCTCTTCTGGGATACAGGTTCTTCCACTGCTTGGTTAGCTTCCAGACAGAGTATGGTTCAAAACCTCTACTACTTACAATATCAACAAGCATAATTTCCTTTTGGTATGTTCCCATTTCCATGTTTTTCATGTCCTAATACTCATATGAGCGGTTGTATTTTATTTAAAAAGTATTTACAATTCTAACAAAATGTGTTTGGCTCTTTGGGTGGGATCACAGGAGGTTGTGCTATCATAATCAGAATTTTGGAGATATTGGGATGAAGAATCAACGAGTTGGTGAATGATTTCTAGCAAATCGTCGGAAGTAATTAACCTGGAAAGACAGATCCTTTAGGTATGTTCCCATTTCCATGTTCTTCATGTCTTAATTATATCCATTTGAACAGATGATTTGTTGCCCCTGCCTTTTAAGTAATACTTTTAAAATTTTAGCTAAATTTTAAATAATGTTTTTGGAAATTGTTTTTCTTTTAGTTTTTAAAATTTGGCTAAGATTATAGAAATGAGTCCAGAGAAGTAGATTAGAAAATAAAGAAATCAATAGTAATTTTTGTTTCTCTCTCCCTTTCTTCCTTTTTAACTTTGTTTTTTTCTCTTTAAAATTTTTTCTCATTTTTAAACTTAATTTTCTCTTCCTCTCTAAATTTTTTTTTCTCATTTTCTAAACTTTTTTCTCCCTTTAAAATGTTTTCTCAATTTTTAACTTTTTCCTATCTAAATTTTTTTTTTCATTTTTTAACTTTTTCTTATCTAAAAATTCTCATTTTTAAACTTATTTTTCTTTCCCTCCCTAATTTTTTTTCTATCTCTAGAACTTTTTCTCACATTTTATAAACTCTTTTCTCTCTAAAATGTTTTCTCACATTTTAACTTTGTTTTTTTTTCTCTCTAAATGTTTTCCCTCTAAAATATTTTCTCCAATTTTAACTTTTTTCTATTGAAAGTTTTTCTAATTTTTAAACTTGTTTATCTCTCCCTCCCTAAAATTTTTTCTATCTCGAAAACTTTTTTCTCACCTATTATAAACTTTTTTCATTCTCCAAAACGTTTTCTCACTTTTTAACTTTGTTTTTCTGTCTAAAATTTTTTCTCATTTTTTAACTTATTTTTCTCTTCCTCCCTAATTTTTTCTCTCATTTCATAAACTTTTTTCTTTTTCTAAAATAAAATTTTCAATTTTTAACTTTGTTTTTTTTTTCTGTAAATGTTTTTACTCTAAAAAGTTCTCAATTTTTTTTACTTTTTCCTATCAAAAAATTTTCACATTTTAAAACTTATTTTAATCTCACTCCTTAATTTTTTTTTATCTTGAAAACTTTTTTCTCATATTTTATAAACTTTTCTCTCTCAAAAGTTTTCGCACTTTTTAACTTTGTTTTTTTTATTGAAACTTTTTTCTCATTTTTTAACTTATTTTTCTCTCCCTCCCTCAACTTTTTTTCTATCTCTAAAACTTTTTTCTCTCATTTTATAAACCTTTTTTCTCTCTCAAAAGTTTTTTCACTTTTAACGTTAATTTTTTTCTCATTTTAAACTTATTTTTCTCTTCCTCCCTAAATTTTCTCCATTTTATAACATTTTTTCTTCTCCAAAACATTTTCTCACTTTTTAACTTGTTTTTTTCTCTGTAGATGTTTTCCCTTTAAAATATTTTCTCAAATTTTAACTTTTTACTATCATAAAATTTTCTCATTTTTAAACTAATTTTTCTCTCCCTCCCTAAATTTTTGTCTATCTCTAAATTTTTTTTCTCTCATTTTGGAAAAAAAATTCTCTCTCTAAAATGTTTTCCCTCACTTTTAAATTGTTATCCCTCTCTAATTTTTTCTCACTTAACTTTTTTTTCTCTGCCAAACTTTTTTCTTTCTTAAAATTTTTCTCATTTTTCAACTTTTTTCCTCTCTCCCTCTAAATTTTTGAGTAGTAAAGGAGGTGGTATTAAGAAAATTGATTGACTGGGAATCAAAATTTCTATTTCACCTCTTTTCTACCCTAAATTTGGGGATTTTTTTATAAAAAAATCTCTACATTTTTTTTGCACCTTTTAACTATTCTCTCTCCCTTCAACTTTTTTCTATCTCTAAAATTTTTACTTATTTTTTTCTCTCTCTAAAATTTTTTCTCTCAATTTATAAGCTTTAGTCTCTAAAATGTTTTTTCCACTTTTTACTTTGTTTTTACTCTTCTGAATGTTTCTCTCTAAAATATTTCTCACTCTAAAACTTTTTTCTAATGAAAACATTTCTAATTTTAAACTTATTTTTCTTTCTGTCCCAAGCTGTTTTCTATCTCCAAACTTTTTTCTCACATTTTATAAACGTTTTCTTCTCTCTCAGAAGTTTTCTCAATTTTTAAACTTTCTTTTTCTCTCTAAATTTTTTTCTCATTTTTAAACTTATTTATCTTTTCCTCCATAAAATTTTTCTCTCGTTTTATTAACTTTTTTCTCTCTCTAGAATGTTTTCTCAATTTTTAACTTTATTTTTTCCTCTCTAAATATTTTCACTGTAAAATGTTTTCTCAAATTTTAACTTTTTACTATCAAAAATTTTCTCATCTTTAAATTTATTTTTCTCTCTCTCCCTAAACCTTTTCTATGTCCAAACTTTTTCCCACCTTTATAAATTTTTTCTCTCTCAAAATTTCTTACTTTTTAACCTTTTTCCCTCTAAATTTTTTCTCATTTTAAAACTTATTTTCTCTTATTTTATAAACTTTTTTCTCTCTCAAACTTTTCTCACTTTAAACTTTATTTTTCTCTCTAATTTTTTCTCATTTTCTCTTCTTCCCTAATTTTTTCTCCCATTTATAAACTTTTTTCTCTTTCTAAATGTTTTTTTAACTTTTAACTTTGTTTTTTCACTCTAAATGTTTTACTCTAAAACTTTTATCAATTTTTTACAATTTTCTGTTGAAATTTTTCTCAATTTTAAACTTATTTCCTCTCCTCCCTAAACTTTTTCTATCTCCAAAACTTTCCCCTCATTTTATAACTTTTCTCTTTCGAAGTTCTCTCACTTTAATTTGTTTTTTCCTTAAATTTTTTTCATTTTAAACCTATTGTTCTCTTCCTCCCTAATATTTTTATCTCATTTATAAACTTTTTCTCTCTCTAAATATTTTTGACTTTTTAACTTTGTTTTTTCACTTTAAATGTTTACTCTAAAACTTTTTCTCAATTTTTAACAATTTTCTGTTGAAAAATTTTCTCATTTTTAAACTGATTTTTCCTCCTCTCTAAACTTTTTTCTATCTCCATAACTTTTTTTCCTTTAAATTTTTTCATTTTTAAACTTATTTTTCTCTTCCTCCCCAAATATTTTTATCTCATTTTATAAACTTTTTTCTCTCTAAAATGTTTTTTCACTTCTTAATTTTGTTTTTTTATCCCAAAATATTTTCTCAATTTTTTACTTTTTTCTATCGAAATTTTTTCTCATTTTTAAACTTATTTATCTCTCCCTCTCAAAACTTTTTCTATCTCCAAAACTTTTTTCTCTCATTTTATGAACTTTTTCCCTCTCTCAAAAGATTTCTCACTTTTTAATTTGTTTTAAATTTTTTTTCTTATTTTTAAACTTAATTTTCTCTTCCTCCCTAAATTTTTCTCTTATTTTATTAACTTTTTTCTCTCCCCAAAACTTTTTTTCACTTTTTAACTTTTTTTTTTCTCTAAATGTTTTCTCAATTATTAACGTTTTCAATAAAAAAAAATTTATCATTTTTTAAACTTATTTTTTTCTCCCTCCCTATACTTTTTATTATCTCATAAAATTTTTTCTCATTTTGAAAAAAAAAATCTCTCTAAAATGTTTTCCCTCACTTTTTAATTGTTTTCTCTTTCTAAGTTTTTTTTCACTTTTGAACCATTTTTTCTCTCCCAAGCTTTTTTGTTTCATAAAATTTTTTCTCACTTTTTAAATTTATTTTTTGTTTTTTTTCTCTCCAAATGTTTTCTCTCTAAAACGTTTTCTCAAATTTTAACTTTTTCCTATTGAAAATTTTTCACATTTTAAAACTTATTTTTCTCTCCCCCCTAATCTTTTTTCTATCTTGAAAACTTTTTTCTCTCTCTCAAAAATTTTCACAATCTTTAATATTGTTTTTTGTATTTAAAATTTTTTCTCATTTTAAACTTATTTTTCTCTCCCTCCCTGAACTTTTTTTCTATTTCTAAAACTTTTCTCTCATTTTATAAACTTTTCTCTCTCAAAAGGTTTTTCACTTTTAAACTTTATTTTTCCTCTCTAATTTATTTTTCTCATTTTAAACTTATTTTTCTCTTCCTCCCTAAATTGATTTCTCTCATTTTATAAACTTTTTTCTTTCTCCAAAATGTTTTTCACTTTTTCACTTTATTTTTTTCTCTCTAAATATTTTCTCAAATTTTATATTTTTACTATCAGAAAGTTTTCTCATTTTTAAACTAATTTTTCTCTCCCTCCTTAGACCTTTTTCTATCTCTAAAACTTTTTTCTCATTTTGAAAAAAAAATTCTCTCTCTCAAATGTTTTTCCCACTTTTTAATTGTATTCTCTCTCTAAATTTTTTTCACTTTTTCACTTTTTTTTCTTTGCAAAACTTTTTTCTTTTGTAAATTTTTTTTCTCATTTTTCAACCTTTTATTTTCTCTCTCTAAGTTTTTGAGTAGTAAACGGAGGTGATATTAAGAAAATTGATTGATTGGGAATCAAATTTTCTATTTCACGTCTTTTCTACCCTAAATTTAGGGACAGTTTTTAAAATTTAGTTCACCACTTTTTCTATCTGAAAAGTTTTTACTTATTTTTTTTTCTATCCCTAAAATTTTTTCTCTCAATTTATAAACTTTTTGTCTCTCTAGAATATTTTTTCACTTTTTGATTTTATTTTCTTCTCTCTAAATATTTTAACTCTAAAATGTCTATAAAATGTCTATATATCGAAATATTTTCTCATTTTTAAACTTGTTTTTCTCTCCCTACCTAAACTTTTTTCTATGTTCAAAACTTTTTTCTCAGCTTTAATAAACTTTTTTCTCTCTCCGGAAAGTTTTCTTACTATTTATCTTAGTTTTCTTTTCTCTAAATGTTTCACTCTAAATGTTTTCTCAAATATTAACTTTTTTCTATCTAAAAATTTCTCATTTTTAAACTTATTTCTCTCCTTCCCATAAATTTTTTCTCACCTTTAATTAACTTTTTCTTCTCTCTCAAAAGTTTTCTTACTTTTTAACTTTGTTTTTTCTCTCTAAATGTTTTTCTCATTTTTAAACTTAATTTTTTCTTCGTCCCGAAAATTTCCCCTCATTTTATAATCTTTTTTCTCTCTCTAAAATGTTTTTTTCAATTTTTAATTTTTTTTCTTTCTAAATGTTTTCTCTGAAATGTTTTTCAATTTTTAATTTTTTTTTCTTTCTAAATGTTCTCTCTAAAATGTTTTTCAATTTTTAACCTTTTTTCTATAAAAAAATTTTCTCATTTTTAAACTTATTTTTCTCTTCCTCTTATTTTATAAACTTTTTTCTCTCTCTCAAAAGTTCTCACTTTTTAACTTTGTTTTTCTTTCTAAAATTTTTTTTATCATTTTTAACCTTATTTTTCTCTTCTGCCCTAAATTTTTTTTCCCTCTCAAATGTTTTCTTACTATTTAACTTTTTCCTATTGAAAAATTTACTCATTTTTAAACTATTTTTTCTTTCCCTCCTTCAAGTTTTTTTTTTCTTCTAAAACTTGTGCCTCACTGTTTAAAATTTTTTCAATCTCGAAAACTTTTTTCTTTCATTTAAACTTTTTTCTCCATCTCAAAAGTTCTCACTTTTAAATTTGTTATTTTTCTCCAAAATATTTCTCATTTTTAAACTTAATTTTTTCTTCCTCCCTAAATTTTTAGTCTCATTTTATAAACTTTTTCTCTCTCTAAATGTTTTCTCAATTTTTAAACTTTTTTATCGAAAAAATTTATCATTTTTAAACTTATTTTTCTCTCCCTCGCTAAACCTTTTTCTATATCCTTTTTTCTCTCATTTTATAAACTTATTTTTCTCTTTCAACAGTTTTCTCACTTTTTAACTTTGTTTTTTCTGTTTAAATTTTTCTCAAACTTATTTTTCTGTTCCTCCCTAAATTTTTTTTATTTCATTTTATAAACTTTTATTTCTCTAAAATGTTTTCTCACTTTTTAATTTTGTTATTTTCACTCTAAATGTTTTCTCTCTAAGAACAATTTTTAACGTTTTTCTATCTAAAAATTTTCTTATGTTTAAACTTATTTTTCTTTCCCTCCCTAAACATTTTTTTATCTCTAAACCTTTTTTCTCACATTTTATAAACTTTTTGTTGTGGCATTAGTGCCCCATTCCCACATCTGTAATTTACTTGGAGATGGAGACGCTCATACCTTAACGTGCCTGATCCCCCTTGCCCGTACAGGTGTGAGAAAAATAGTATATTATAACTATTTTTAGATACCTTTTTATTAGTTAATTTCTAATCATATTTAGAAATTAATTTTGGTGAGTTTAAATATTTTGGATGTATGGGGTATTTGGTGAGTGCTTGAGCGTGCTCTTTTGTAATCTCATTTTGAATATTAGTATAAATTATTCTCTTCGCCCGTGGATGTAGGCTTAGTCCGAACCACGTAAGTCTTCTGTGTTCATATTTCTCTCTCTTCTCATTATTATATTTTAATTGCTATAATTAGATCTAGTTTATTATTCTGCTGCATCGTAAATACAACATTTTTTTCTCTCTCAAAAGTTTTTTTACTTTTTTTACTTTGTTTTTTCTCTCTAAATTTTTTTCTCATTTTTTAAACTTATTTTTTTCTTCCTCCCTAAAAGTTTCTCTCATTTTATAAAATTTTTTTTTCTAAAATATTTTCTCAATTTTTAACTTTTTTCTATCGAAAACATTTTTCATTTTTTAACTTATTTTTCTCTCCATCCCTAAAACTTTTTTTTTCAAATTTTATAAACTTTTTCCTCTCTCTAAGTTTTCTTACTTTCTAACTTTATTTTTTCTCATTTTAAATATTTTTCTCCCTAAATTTTGTTTATTAAATTTATTTCTCACTTTCAAACTTTTTTTCTCTCTAACTTTGTTCTAACTTTTTTTCTCTCTATAATTTTTGAACTATCTCTAAATTTTTGTCTCTTTCTCTAACTTTTTTCTCTTTCTAACTTTTTTTCTCCCAAAATTTTTATTTCTAACTCTCTAAACTCTTTTCTCTCTTAAAATTTTGTCATTGTCTGTCTCTAAAATTGTTGTCTCTAACTCTTTCTCAAATTTTTTTTTTAAAATTTACTTATGATTTAAAAGACATTTTTGAAAATTTAAAGTGATTTTTCTCTAAAATTTTTGTTTTTAAAGTTTCTTTCTCACTTTCAAATTTTTTTCTCTCTAACCTTTTTCTCAGGTTTTGTTTTTTCAATTTACTTACAATTTCAAAAATATTTTTGAAAATTTAAACGGATTTTGGGAAACACTTAAAATGTGATTTTAGAAAAATGGATTGACCGGAATCAATTGTTTCCTATTTCACATCATTTCTTCCATTACTCTGGGAAAGTTAAAATTTTTCCTAAATTATTTTACTCCCTTTTTCGTTAACTTTATTTATCTCTATTTTTCGCCCTTTCCAAATTTTTTTCTCAACTTTTTTCTCCCTCTAAAATTTTTGTACCACTCCCTTTCTAACTTTTTTCCTCCCTCTATAATTTTTTACCACTCTCTTTCTACCTTTGTTTCTCCATAAAATTTTCTCTTTTAACTTTTTTCTCTCTAAAGTTTTAAACCCTTCCTCTCTTTCTAACTTTTTTTCTCCAAAATTTTTGTCTACTATTCTCTAACTTTTTTTCTCTCCAAAATTTGTACCTTTCTAATTTTTCTCTCTAAATTTTTACCCTTAATGGATTACAAATTCTTAATTAAGTCCTTCCTAAAAAAATTGTTAAAATTTGTTTACAAAAAGTTGATGTATATCAATATTTTTTTTAGTTCAACAATTAGGGGGTGTGAGATCGAACAATTGACTTTTTGGATGGTAATAGGTAACTTATTCACTAAGCTATACTTGGGTTGGCAAAAAGTTGATGTATGTTATATTATTGTTTTATGTTAATATTTGAACATGTGGAAATTTCTCTCAGAATGTTACTTTACATAAAACTGTACGCCCATCAATAAAATTCATTTCTTATAAGTGACATGTTCATGTGGCATTCCCACATGCTCAAATATATTAATAAAGTAATAATGCAATAACACATCTGCTTTCTATTAACAAATTTTATTAGTTTTTTTTTTATGAAATGACTTAATTAAGAATTTTGATACATTAAGGGTAAAATTGAGAATTTTAGAAGACTACTCTTTTAACACGTGATATTTTTATTTTATTTTATCAAATATTAAAAATTATTTTTTAAAAAAATGTTGATTATGTATCAAATACTACGAGCGTTTATATTTTTTTAAAAAACTAAGTAATAAGTGATTTTTCAACCCAATTTGATAATTATGGTTCTTAAAGCTCAAACCATAGTTATTAAATTAAATAGCTTACCATTTTATTGAAACAATTAATTTGTATACTTTAATTATAACTATGAATCTCTTATAGAGATATAACAATTAGAATAGAAGAAACTGAGATGAATTTTGAATATAGAACGGATAAAATACTAAAATGGATTGTATTGACTAACCTCTATAACTATCTATATATATAATGAATTTATTGAATGAAAAAAGTGGGTAAATGAAAAGTTATAAATGTTCATGCATGTGTATGTATGTGGAATGTGAGAATGTGTAGAAGTTTAAAGCCATTATTACTCAAATAAATATGTAAGTGTAAGTGCAAGTGGGTTGTAGGAATACTAAAAGATTGAGTTATTATTACTCAATTTAACATGTATGTACGTGAGATGTGGGAAATGAATTTTGAAAATAAAGTTAAATAAAGTTGGAAAGGATAAATTAGGTTTTATATAAAATATTAAAATTAATATCGTCCAAAAAAATTTTCTTTCATATCCAGGCTTTTATATACAATAATAATTATAGATTGTAGGTTAAGGACTAAGTTGAGAAGTACTTAAAGGATTAGGGACTATTTGATATGTCTATCATTGCTAATGCGTATATTGAAGTTATTATTAAAAAAGAATAATTACATTCCTTCTAAAAATCAGATCATCAATAAAAATATCAAACAAAAAAACTTTATAAAAATCAGTCATTCAAACATATTTTCACCGAATTTTTATTCTAAAAAACTAAATATTGTTCTGAAAACACCCTTAATTATTTTTTTTTTAACTCCCTTCCACTTGATGTTATACCTCCACTTTTGTTTACAATAATTCTAATCAATTATGTTTAAATTCCTTGATTTTCACCCAATATAGATATATATTTTTTTCTATAACAAACTTTGGATTTTCCTTCTTGATGTCACTTAATGTATATTGAAGATACTAGCCCCTAACCTTTGTACCCAGACACATCGATATCTGTTTTTTATCATGCAAGTTCTACTTCCTTGGCCCTTATGCTTTCTCGTACTTCTCTTGCCATAATCAACTTAATCATACTTCTCTTATCTACTATATAGTAAACCATTAATTCAATCCTTTTTAAGCATACTTAGACTCAACTCCACATGAATGAGATATGACTATGAAATGACATGCTTGCCCTTTCATCTAACACATAAAATTGTCATGAACAAATTATATGCCCAAAAATCACATAGAATACACACGACAAATATATAACCAAACAAGTCAAGTATTAGGCAAAATAAACATATATGCGGAGGTGCCTCTGCGGCCATTCTATGAAACAATCAAGTTCAAAAAATTATCATATAAATGGTTCTAAGGAAACCTTAATGAGAACTAATGAGCTTCAGACTGACATTGGGCCTTCGTGGCCCATTTAACTCCCTACGTTCTCCAAATGGTATCTACACATGTTGGGCCGGCCCACAGTTGACGACAGCCATCCTTTTAGGGACTAGTAAAAAAATTTACAAGAATGTCAAAATGATTATAAATATTTACAATATTAACAATCCATTCTCCACATTTAAAAAATGAATTTTTCATTTATATCCTCTTACTTTAGACACCAAAATCTAAAACCAATTTCATGAAAAACTCGATTCATATATTTGTTATTTCTTTCAAATTTTAAATTGGTAAAAATCCATCTCTCATATTTTTAACAGAAAAAAGATTTTTACTGAATTCAAAAAGCTTCACCATAAATGTAAGGCAAGAAGGCAGACTAAGAGGAGAATTCTGCTTGCATGTTGCCACTTTCTAAGAAGAAAGATAACTCTCCGATATTTTCTACCCAAAAAAAATGATAAATATATGTTTTAGTCTTTAATTTTTTGGTATTTTTTCAATTTAGTCCCTATTGTTTAAAAAATTTCAATTTAGTCCCTTGCGTGTTAACATTTTTCAATTATCTCATTTTGTAGGTGAATGTTAAAATTGGTTGATAATAAACTTATTTGGCATGATGATAATTAGTGAGTTAGGAGAAATTTAAAAGGGGAATCAGGCTACCTAAGAGAAAAATCAATTTTTGCCTATGTTTTAGAAATTTGGAGAAATAATTAGAAAATTTTCTCTTTCATTCAGTTTAATTTCTTTTTTAAATTTCTGTCAACTTACTAAGTATCATGTCATATTTATTATCAGTTAATTAAAACATTTACAAACCTAAAGGGTATAATTAGAAAAATGTCAAAATGTAGAAAACTAAATTGAAAATTTTTAAACAATAAGGACTAAATTGAAAAAGTACTCAAATATTAAAAATTAAAATATATATTTAGTCTAAAAAAAATAGGGATATTAATAGACAAAATAAAAAAAGCAAACCAAAACTGAACCATTTTGATCTGGTTCGATAATTCAAAAAAAGAAATTAGATTTCTCGGTTTACAAAATTGAAAACCCAAAAATATTTAGCTTGGTTCAATTTTTTTTATTTTTTGTTAAAAAAAATCGAATCGAAGCAGTCTATCTATAATTATATAAAATTCTAAAAATATCCCTAGGCTTCTATTTATCCTTTCAGTAGCCCAATCCAGCCCTTATAAAGGAGGGGGAAAAAAAGCCCAATCCAGTTCGGTTGTCCATTTTCACAAACCGAGACCATTGGTTCGGTTCGGTTCGGTTTAGGCAAAAACCGACGATTTCTGATAATTTTAAGAATTATTAATTAGAAGTGAAGAAAGTAAGAAAGGCCGAAAGGGAGAGAGAAGGGGTTGGAGTGAAGCCCAAGCCATCTATAGCTATACTAAAAACCACTGTGTCGCAAGTTCCTGCAGTCCTCTTTGCGATTCAACTGTCTCTTCTCTCAGGTTTGGATTTCCTCCTTCCAATCTCCAAAATTTCAGTTCATCTTGTATCTGTGATTGTTTTCTATTAGGGTTTAGGAACTGGATTGCATTTCTTTGATTTTGATTTCGATTTCGGTCGGAAGAAATCCAAACATTTTCGATCGCTCTGAGCTGACAATTCTTTCCCTTCGTTTTCTCATGTCACAAAGATTCTCTTACTTTGTTCCGGGAAAGCTTGAGTTTCGGGCCTGATGTCGGTGCGTTTCTCCTCTTAGCATTATTGCCCTGCTACGGGAAGTTTTTGCTAGTATGTAGTTAATACATTGTGGTATTGGATTGTCATTGCAGATCTTTGAGTATAATGGAAGTGCCGTAGTTGCGATGGTGGGGAAGAATTGCTTTGCCATCGCTAGCGATCGTAGGCTTGGAGTTCAGCTGCAGACTGTTGCCACAGATTTCCAGAAAATTTACAGGATTCATGATAAGCTGTTTCTCGGCCTTTCGGGCCTCGGTACCGATGCGCAGACACTGTATGTATCTGTGGTAGATTCGTTTGGTTTAATTTTCCATTGCTTAACTTAACTCCTTCTGCCCTTGAAATGAAGGTATCAACGGCTTGTCTATCAGCACAAATTGTACCAGCTCCGAGAAGAGAGGGACATGAAGCCTGAGACATTTGCTAGCCTCGTCTCAGCTCGTCTTTATGAGAAAAGGTCGGACAACTACTGAGATTTTATTTTTCTCTTGGTCTTAATCACGTATACTCTCTACTTTTCCGCCTTGATTAAAAAACATGGTTGAGATTTCTTTAGGATTTGCACTGAGTTTCTTTTTGTATGTGTTGATGTTTCTGCAAATGTGATCCTTGTGTGTTGGGGTAAGACTGTAAGCTGCCTTGATAGAGATATTGCATTGTACTAAGAGTCACTCGAGCGTTTATGATGTGTCAACCTTCAATGAGTTATAGTGCAGATATGTAGTCAAGGTGTGTATATGGAATTGGTATATGTTAAACTGACTGGCAAGATATTTTTATACTTTGAAATAGGATTACAGGTTGAATAGTTGAAATGTGGATAAACCTATGAACTGTATTGAATCAAATGTCTCAATCACAAGATAGTTCTGTGTCCCTGTCCTAGCTTGAAGGCAATAAAAACTGTTGTGTCATAATCACAATGAAAATGAAATTGAAATTGGTACAATTAGTCCTTCCTCCTATGGAATCATTAGTTTGCAAGTTTGCTTTTAGTGCATGCCATGAATATACAAAAAGCTTAAGTACTATTTCCAAGATTCAAATGCTTGAAATATATAGTCTTCTGTCCATGAAGTTGTGTTATGATTTGGAAAGTAGTATCTCCCTGAAAATTGTTTGTTCTATAGTTCTAGTTGATCATCTTTGGATCTTGAAGAGGTGGTTCTTTTGGAAATGGTCTCATACCCCACCCCCCCCCCCCCCCCAAAAAAAAAAGGATTGGAACTGTTTGTTTTTTTTCCCCCTCTGCATATGCAGATCAGATAAAAAAAACTCGTATAATTTTTCTTATCTGTAGATTTGGTCCATACTTCTGCCAGCCTGTAATTGCTGGATTGAGCGAAGACAAACCCTTTATTTGCACGATGGACTCCATTGGGGCCAAGTAAGTAGTTTTTCCTTTCTTCTTTCTCTGTCTATTCTCAACTTTTTGCTTTTGTTTGATTTGAATTTGCACCTAAATTAGTTATATTGGTTTCTTCATCTCGTTCGATTTCAAAATGTGCATCAGACTTGAAGTTGCAGTCTCTGCAACACATTGTAATGTTAATTCTCATGGTTTAATGAAATCTGATTTTTGGGATAAATTGTTTTTAGGTTTAGGATTATTGAATGTAAAACTTATACATTGTCAAATTGATTTTTCCCAGAGAGCTTGCTAAGGATTTTGTTGTTTCTGGCACTGCATCTGAGTCCCTTTATGGTGCCTGTGAGGCAATGTTTAAACCTGACATGGTATGTTGCTATAATCCATTTACATATGCTTTTTATTCATCATTGGGATTCTAAATTGCTTTGCATTGAGGAAGATATATGCCTAAGTTGCTTGTTTGCTTAATCTTTGTGATTTAGTCATTTACCATTTCCATAGGGGTGTTAGCGGTTTGGGAGGGGTCAATTTTCCCAACCAAATGGTATCAATAGTTGAAAATAAAATATCCAAAATAACTGAGCAGCAACTTAGATGTGCTAAAACCAAGCTTACTCACCAAAAGTCAGTTGGTTAGTCAGTTCAGTTAATCGATTTCAGTGCTGTATGCCTGTGCTGCAGGTAGATCTATCTTCCATTTCATTACAGGCATTACCCCCACCCCACCAACCAAACCAACACCACCCCCCCCCCCCCCCCAAAAAAAAAAAAAAAAAAACCCTTCAAAACAGAGTACAAAACACCAAATGCTTTGGCATGAGAATGAAGAATAAACAAAAACTTATACCTAGAATCATAAAAGAAAAAAGAAGAAGCATATGCCCATGAAAGGAAGAGGCAGAGAAATTAGTGCGAGCATGTGTGTGAAAGAGAGATAGTAGAAAGAAATTAATAATAAGGATTCAGTTACACATACACCATTACCTTAAAGACTGAAAGAACAAATTAGTGAGGGCAAAAATATGTCTTCAAGTTGGAGGAGGCAAACCAGTGATAATTTTCGAACAAGGATGATCAACAAAGTTCATGGATTGAATCATGGAAGAGAAGATGAGAAACAAAGATAATTTTCTGGCTATGGTTAGTGACTTAATTACGTTCACAAATTGAATGTAATGAACAGATGAAGCGATGCAGAAATCTGTGACTCACAAGAAGAAATCAAGGAGAAAGATAAGGGAGAAGACAATGAGAGGGAGATATTCAGGGGTGGGGATCAACGTGTCTGAGTAAGAGTAACTGAAGGGTAAAACATGTATATAGAAGTGGGACTGGACAGTTTTATTATCAGTTGGTTGGATTGACAGTTTTGGTTTCAGTCGGTTTCCTTAGTTTGTTGGTTCTGTCTTGCACCCTGAATTATCCAGGTGTGATGCATTGGTTTGAGCTTTTTGAAAGACAACCACTATAATTTTGTGGCTTCAGATATTTATTTGAATATCTCGTCAGAATGGAAAGGCACTGTAGTTTCTTGTATAGGATTGTGCGAAAAGCTGCAAATGATGTTCAAATAGGACCGCTTGATATTCTTATTAGGGAAAAAAAATTCCCGTCTTAGATCTTTTCTGAAATGTTCTTACCATTTAGTTAACTCTGTCAATTAAATTTTGTATTGTCCAGGAACCAGAGGAATTATTTGAAACCATCTCTCAAGCGCTGCTGTCTTCTGTGGATAGGGACTGTTTGAGTGGATGGGGAGGGCATGTCTATGTTGTGTAAGCCACATATTATTATTCCTCGATGTTTAATGCTTGAGATACCTTTCTGCATGGTTTATTGGTATTTTTTTCCCTCTTTGGAAAGTAGTTTGTTGTGATTTTGCTGTCGGATGTTGTTTTTACCGAACACTTAGCACATTACCTTGGATGAAATAATAATAATTGTAATAGAAACTATCATGAGGTCATTTTGTTTAAAGTTGTGCAATTTATTACCTGGTTTAAAGGTATGCATTATTTAGTCTCCTATTTACATTTCCAATCTTCATGATGTTGCATGTAATTTTCAAATGCACCCCTACCTGTTTATGTTGGATCGTTTTTTATTTGCTCTAGTTAATAATTTAGACCATTATGGATGCCCCTGGACAGTTGGATCACCATTTCAAAAGTATGCATTATTTACCCAATGGATTTTTCTTACTTTTCTGCCTTGCTCTTACCATTTTTTCCTTTCACCTTTCTTGTCTTCCATCGTATATTTGTGATTGAAATAATCCTATCATGTGCCTCTCGTTGCAGTACACCGACTGAAGTAAAAGAGAGAATCTTGAAGGGACGAATGGATTGATCTTTCTTCTGATGCATGAAAGATGACTTGTTCAATGGTTACTTCCTATTAGGCTGCAGCACAGGCATGGGAGTTTGTGAGTGAAATGCAAAAAAAATTTCATGTTGGGCTGAAACTTGTAATTTGCTCTATATATTTGACATGGAATTCGCTTTATCTAGAATTGTCTACATGAAACGTGGTTTATGAAACCAAATGTGCCTTTTAATTGCGAAATTGATTAATTTAGATTCCAAATTTACCAATTACTCTATGATGGTAACTTTGCATGTGCACATCTTTAAGTTTTCATTGAATTACATTTTGCATCAAGTTGTAGCCCCATTATATTGTCTTGTCAAAACATCCATTCATTGAATCAGTTCTCATACGGCCTCCAGTATGAACAAAGGTTGTGGGCCAAAGAGAGGCCGTGATGGACTTGCTGAGAGGCGAGGTAAGAATTGGCAAGTTTTTGCTATCTGAAATGCAGGATTGAGGCATAATGTTACTTGACACAGACGACTTTTCATTTCTTTGCACAAAACTTGCCTTCTCTGCAAAGAAAACTTAGAACCCCTCCTCTAAAGATGGATGAGAGGCTTGGTAAGTTGAAGAACTTGAACAAGGTGGGAATCTGGAATATTGATATGTCCATTAGCGTTGGTTTTTAGTTGCCTTCTATGAAAGTTGTCTCTCAATCAAGAAACCAGCGAAAAAAAACATCCTCATATATCCTAATCCATGGCTCTTGTGTTGCATTGTTCTCGAAGATTTTGATCATTTGTAATGGAAGAGCAAGAGGATAAACAACCACCCCAAAGAATTTCCCTTGCTTATGTGGAACAGGTGAACTCAGATTTTATCATGGCCTTGGCAATGCAGCAGCAGGAGCACGAGAGGGCCTATACGATGCTCGAGACCATCGAAAGTGACAGCGAAGAAGACGAAAGTTTAGATTCCAACAGCAACAATTGTCTCGACACCAATGATTCCTTGCAAAGCCAAGAGCTTCAATCTCGGTGGGATTTTCTTGCAGAGGACGAGGAAACTTCCGATGAAGATATGGACGATGATGATGAAGAGGATGATTTTGATTTGGATGAGCTAACTTACGAAGAGTTGGTTGCGTTAGGAGAGTTCATTGGAGAAGAAAAGAGGGGGCTGCCTATGAATGAAATACCTTCGTGCTTGCACTCAGGAAAGTTTCGAACAGTTGAAAGCAGAAGTGGGATGGATCGGTGTGTGATTTGCCAAGTTGAATATGAAGATGGTGAAGAATTGGCTGCTCTTCCTTGTGAGCATCCTTACCATTCAGAGTGTATAAGCAAATGGCTTCAAATCAAAAGGGTTTGTCCAATATGTGGCTCAGAAGTTTCATCCCCCAAGGCTTCTTCAAATGCATAA

mRNA sequence

ATGTTGGAGAATCAAACAGGGAGCAGAAAAAGTTGCTTGTTTGAAGCTGCAAAACGTGCACGCAAGGTTCCCCTCCGTTGTCCCTCACACACAAACTGTCTTCTAAAACCGATTATTAATGTTGCGTCTTCAACTATCTCAGTGCTCCACTTAGAAAAAAATATATTCTCAATCTCTCTCGGATCTCTCCCAAACGACCCGAACATTGCGAACGCGCGGGCAACGATAACAGTAATCTCAGCGACAGCGACTGCTTCGACTCCCATTTACTTTGGCGTGCCGCATTCCATTCGCGAATCGTCTTCTTCCTCTCGCATTCTGGTTCTCTCCCCTCTCCCTCATCGAATCGTCTTCTTCCTCTCGCATTCCGGTTCTCTCCCTCCTCCCTCGCCGGAATCGTTTTCTTACTCTCAGTCTCGCCGACGCGCTCACTCTTCTTTTCCTCTCGCTCGCATTCTGGTTCTCTCCCTCCTCCCTGGCCGGAATCGTTTTCTTACTCTCACTCTCACCGACGGGCTCACTCTTCTTTTCCTCTCGCTCGCATTCTGGGTCTCTCCCTCCTCTCTCGCCGGAATCGTTTTCTTAGTCTCGCCGACGCGCTCACTCTTCTTTTCCTCTCGCGCACATTCTGGTTCGCTCCCTCCTCCCGCGCCGGAATCGTTTTCTTACTCTCAGTCTCGCCGACTCGCTTACTCTTCTTCCTCTCGCTCGCATTCTGGTTCTTTACCCCTCACCAGAATCAGAATCAGAATCAGAATCGTTTTCTTACTCTCAGTCTCGCCGACTCGCTTACTCTTCTTCCTCTCGCTCGCATTCTGGTTCTTTACCCCTCACCAGAATCAGAATCAGAATCAGAATCAGAATCGTGTACTTCTTGATTTCGTTCGTATTGCTCCATCTCTCTGTAAAAACCGAACTACAGTGCAGAGCTCAACCTCAATCGCAACCGGCGGAGATAACAACGATTTCTGTGGCGAAAAGAATAGCAATGGTTCGATCCTCGGAATCCTCAGTTCTCGTCTTATTCTCTCTCTCTCTCTCTCTCTGTTGTGCTGTTTTGCGTTTTGGACATTAACTCTTATTTCTGGGCTAAAATTCGGAGTTGCAGTGCCTCCATGTTTGAGATATGGGAAAGAAATTCTGTGGGCTGTACAGAGGGTGCCCTGCGTGAAGCACGATGCCTGTGTCCAAGCCAAGAACGGTGAGTCCTCTTCCCTTGAATTAATTGTTGATGTTCTGAACTTTCTCTTCTGGGATACAGGTTCTTCCACTGCTTGGTTAGCTTCCAGACAGAAAGTGAAGAAAGTAAGAAAGGCCGAAAGGGAGAGAGAAGGGGTTGGAGTGAAGCCCAAGCCATCTATAGCTATACTAAAAACCACTGTGTCGCAAGTTCCTGCAGTCCTCTTTGCGATTCAACTGTCTCTTCTCTCAGATTCTCTTACTTTGTTCCGGGAAAGCTTGAGTTTCGGGCCTGATGTCGGTGCGTTTCTCCTCTTAGCATTATTGCCCTGCTACGGGAAGTTTTTGCTAATCTTTGAGTATAATGGAAGTGCCGTAGTTGCGATGGTGGGGAAGAATTGCTTTGCCATCGCTAGCGATCGTAGGCTTGGAGTTCAGCTGCAGACTGTTGCCACAGATTTCCAGAAAATTTACAGGATTCATGATAAGCTGTTTCTCGGCCTTTCGGGCCTCGGTACCGATGCGCAGACACTGTATCAACGGCTTGTCTATCAGCACAAATTGTACCAGCTCCGAGAAGAGAGGGACATGAAGCCTGAGACATTTGCTAGCCTCGTCTCAGCTCGTCTTTATGAGAAAAGATTTGGTCCATACTTCTGCCAGCCTGTAATTGCTGGATTGAGCGAAGACAAACCCTTTATTTGCACGATGGACTCCATTGGGGCCAAAGAGCTTGCTAAGGATTTTGTTGTTTCTGGCACTGCATCTGAGTCCCTTTATGGTGCCTGTGAGGCAATGTTTAAACCTGACATGGAACCAGAGGAATTATTTGAAACCATCTCTCAAGCGCTGCTGTCTTCTGTGGATAGGGACTGTTTGAGTGGATGGGGAGGGCATGTCTATGTTGTTTCTCATACGGCCTCCAGTATGAACAAAGGTTGTGGGCCAAAGAGAGGCCGTGATGGACTTGCTGAGAGGCGAGAGCAAGAGGATAAACAACCACCCCAAAGAATTTCCCTTGCTTATGTGGAACAGGTGAACTCAGATTTTATCATGGCCTTGGCAATGCAGCAGCAGGAGCACGAGAGGGCCTATACGATGCTCGAGACCATCGAAAGTGACAGCGAAGAAGACGAAAGTTTAGATTCCAACAGCAACAATTGTCTCGACACCAATGATTCCTTGCAAAGCCAAGAGCTTCAATCTCGGTGGGATTTTCTTGCAGAGGACGAGGAAACTTCCGATGAAGATATGGACGATGATGATGAAGAGGATGATTTTGATTTGGATGAGCTAACTTACGAAGAGTTGGTTGCGTTAGGAGAGTTCATTGGAGAAGAAAAGAGGGGGCTGCCTATGAATGAAATACCTTCGTGCTTGCACTCAGGAAAGTTTCGAACAGTTGAAAGCAGAAGTGGGATGGATCGGTGTGTGATTTGCCAAGTTGAATATGAAGATGGTGAAGAATTGGCTGCTCTTCCTTGTGAGCATCCTTACCATTCAGAGTGTATAAGCAAATGGCTTCAAATCAAAAGGGTTTGTCCAATATGTGGCTCAGAAGTTTCATCCCCCAAGGCTTCTTCAAATGCATAA

Coding sequence (CDS)

ATGTTGGAGAATCAAACAGGGAGCAGAAAAAGTTGCTTGTTTGAAGCTGCAAAACGTGCACGCAAGGTTCCCCTCCGTTGTCCCTCACACACAAACTGTCTTCTAAAACCGATTATTAATGTTGCGTCTTCAACTATCTCAGTGCTCCACTTAGAAAAAAATATATTCTCAATCTCTCTCGGATCTCTCCCAAACGACCCGAACATTGCGAACGCGCGGGCAACGATAACAGTAATCTCAGCGACAGCGACTGCTTCGACTCCCATTTACTTTGGCGTGCCGCATTCCATTCGCGAATCGTCTTCTTCCTCTCGCATTCTGGTTCTCTCCCCTCTCCCTCATCGAATCGTCTTCTTCCTCTCGCATTCCGGTTCTCTCCCTCCTCCCTCGCCGGAATCGTTTTCTTACTCTCAGTCTCGCCGACGCGCTCACTCTTCTTTTCCTCTCGCTCGCATTCTGGTTCTCTCCCTCCTCCCTGGCCGGAATCGTTTTCTTACTCTCACTCTCACCGACGGGCTCACTCTTCTTTTCCTCTCGCTCGCATTCTGGGTCTCTCCCTCCTCTCTCGCCGGAATCGTTTTCTTAGTCTCGCCGACGCGCTCACTCTTCTTTTCCTCTCGCGCACATTCTGGTTCGCTCCCTCCTCCCGCGCCGGAATCGTTTTCTTACTCTCAGTCTCGCCGACTCGCTTACTCTTCTTCCTCTCGCTCGCATTCTGGTTCTTTACCCCTCACCAGAATCAGAATCAGAATCAGAATCGTTTTCTTACTCTCAGTCTCGCCGACTCGCTTACTCTTCTTCCTCTCGCTCGCATTCTGGTTCTTTACCCCTCACCAGAATCAGAATCAGAATCAGAATCAGAATCGTGTACTTCTTGATTTCGTTCGTATTGCTCCATCTCTCTGTAAAAACCGAACTACAGTGCAGAGCTCAACCTCAATCGCAACCGGCGGAGATAACAACGATTTCTGTGGCGAAAAGAATAGCAATGGTTCGATCCTCGGAATCCTCAGTTCTCGTCTTATTCTCTCTCTCTCTCTCTCTCTGTTGTGCTGTTTTGCGTTTTGGACATTAACTCTTATTTCTGGGCTAAAATTCGGAGTTGCAGTGCCTCCATGTTTGAGATATGGGAAAGAAATTCTGTGGGCTGTACAGAGGGTGCCCTGCGTGAAGCACGATGCCTGTGTCCAAGCCAAGAACGGTGAGTCCTCTTCCCTTGAATTAATTGTTGATGTTCTGAACTTTCTCTTCTGGGATACAGGTTCTTCCACTGCTTGGTTAGCTTCCAGACAGAAAGTGAAGAAAGTAAGAAAGGCCGAAAGGGAGAGAGAAGGGGTTGGAGTGAAGCCCAAGCCATCTATAGCTATACTAAAAACCACTGTGTCGCAAGTTCCTGCAGTCCTCTTTGCGATTCAACTGTCTCTTCTCTCAGATTCTCTTACTTTGTTCCGGGAAAGCTTGAGTTTCGGGCCTGATGTCGGTGCGTTTCTCCTCTTAGCATTATTGCCCTGCTACGGGAAGTTTTTGCTAATCTTTGAGTATAATGGAAGTGCCGTAGTTGCGATGGTGGGGAAGAATTGCTTTGCCATCGCTAGCGATCGTAGGCTTGGAGTTCAGCTGCAGACTGTTGCCACAGATTTCCAGAAAATTTACAGGATTCATGATAAGCTGTTTCTCGGCCTTTCGGGCCTCGGTACCGATGCGCAGACACTGTATCAACGGCTTGTCTATCAGCACAAATTGTACCAGCTCCGAGAAGAGAGGGACATGAAGCCTGAGACATTTGCTAGCCTCGTCTCAGCTCGTCTTTATGAGAAAAGATTTGGTCCATACTTCTGCCAGCCTGTAATTGCTGGATTGAGCGAAGACAAACCCTTTATTTGCACGATGGACTCCATTGGGGCCAAAGAGCTTGCTAAGGATTTTGTTGTTTCTGGCACTGCATCTGAGTCCCTTTATGGTGCCTGTGAGGCAATGTTTAAACCTGACATGGAACCAGAGGAATTATTTGAAACCATCTCTCAAGCGCTGCTGTCTTCTGTGGATAGGGACTGTTTGAGTGGATGGGGAGGGCATGTCTATGTTGTTTCTCATACGGCCTCCAGTATGAACAAAGGTTGTGGGCCAAAGAGAGGCCGTGATGGACTTGCTGAGAGGCGAGAGCAAGAGGATAAACAACCACCCCAAAGAATTTCCCTTGCTTATGTGGAACAGGTGAACTCAGATTTTATCATGGCCTTGGCAATGCAGCAGCAGGAGCACGAGAGGGCCTATACGATGCTCGAGACCATCGAAAGTGACAGCGAAGAAGACGAAAGTTTAGATTCCAACAGCAACAATTGTCTCGACACCAATGATTCCTTGCAAAGCCAAGAGCTTCAATCTCGGTGGGATTTTCTTGCAGAGGACGAGGAAACTTCCGATGAAGATATGGACGATGATGATGAAGAGGATGATTTTGATTTGGATGAGCTAACTTACGAAGAGTTGGTTGCGTTAGGAGAGTTCATTGGAGAAGAAAAGAGGGGGCTGCCTATGAATGAAATACCTTCGTGCTTGCACTCAGGAAAGTTTCGAACAGTTGAAAGCAGAAGTGGGATGGATCGGTGTGTGATTTGCCAAGTTGAATATGAAGATGGTGAAGAATTGGCTGCTCTTCCTTGTGAGCATCCTTACCATTCAGAGTGTATAAGCAAATGGCTTCAAATCAAAAGGGTTTGTCCAATATGTGGCTCAGAAGTTTCATCCCCCAAGGCTTCTTCAAATGCATAA

Protein sequence

MLENQTGSRKSCLFEAAKRARKVPLRCPSHTNCLLKPIINVASSTISVLHLEKNIFSISLGSLPNDPNIANARATITVISATATASTPIYFGVPHSIRESSSSSRILVLSPLPHRIVFFLSHSGSLPPPSPESFSYSQSRRRAHSSFPLARILVLSLLPGRNRFLTLTLTDGLTLLFLSLAFWVSPSSLAGIVFLVSPTRSLFFSSRAHSGSLPPPAPESFSYSQSRRLAYSSSSRSHSGSLPLTRIRIRIRIVFLLSVSPTRLLFFLSLAFWFFTPHQNQNQNQNQNRVLLDFVRIAPSLCKNRTTVQSSTSIATGGDNNDFCGEKNSNGSILGILSSRLILSLSLSLLCCFAFWTLTLISGLKFGVAVPPCLRYGKEILWAVQRVPCVKHDACVQAKNGESSSLELIVDVLNFLFWDTGSSTAWLASRQKVKKVRKAEREREGVGVKPKPSIAILKTTVSQVPAVLFAIQLSLLSDSLTLFRESLSFGPDVGAFLLLALLPCYGKFLLIFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTVATDFQKIYRIHDKLFLGLSGLGTDAQTLYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLSEDKPFICTMDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGWGGHVYVVSHTASSMNKGCGPKRGRDGLAERREQEDKQPPQRISLAYVEQVNSDFIMALAMQQQEHERAYTMLETIESDSEEDESLDSNSNNCLDTNDSLQSQELQSRWDFLAEDEETSDEDMDDDDEEDDFDLDELTYEELVALGEFIGEEKRGLPMNEIPSCLHSGKFRTVESRSGMDRCVICQVEYEDGEELAALPCEHPYHSECISKWLQIKRVCPICGSEVSSPKASSNA
Homology
BLAST of Sgr014298 vs. NCBI nr
Match: KAG6602111.1 (Proteasome subunit beta type-3-A, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 648.7 bits (1672), Expect = 7.8e-182
Identity = 348/446 (78.03%), Postives = 370/446 (82.96%), Query Frame = 0

Query: 511 IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTVATDFQKIYRIHDKLFLGLSGLGTDAQT 570
           IFEYNGSAVVAMVGKNCFAIASDRR GVQLQT+ATDFQKIYRIHDKLFLGLSGLGTDAQT
Sbjct: 3   IFEYNGSAVVAMVGKNCFAIASDRRFGVQLQTIATDFQKIYRIHDKLFLGLSGLGTDAQT 62

Query: 571 LYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLS-EDKPFICT 630
           LYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLS EDKPFICT
Sbjct: 63  LYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLSDEDKPFICT 122

Query: 631 MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGW 690
           MD IGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGW
Sbjct: 123 MDCIGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGW 182

Query: 691 GGHVYVV--------------------------SHTASSMNKGCGPKRGRDGLAERREQE 750
           GGHVYVV                          S  AS M+KGC            +++E
Sbjct: 183 GGHVYVVTPTENGPRETSHSIFFLVKTSIHWISSRAASGMDKGC------------QQRE 242

Query: 751 DKQPPQRISLAYVEQVNSDFIMALAMQQQEHERAYTMLETIESDSEEDESLDSNSNNCLD 810
               P R       +VN+DFIMALAMQQQEHER YTMLETIESDSEEDE  DSNS+N LD
Sbjct: 243 AVVEPWR------GKVNTDFIMALAMQQQEHERNYTMLETIESDSEEDERSDSNSSNGLD 302

Query: 811 -TNDSLQSQELQSRWDFLAEDEETSDED---MDDDDEED--DFDLDELTYEELVALGEFI 870
            TNDS QSQEL S W FLA+DEE + +D   MD+D+EED  +FDLDEL+YEEL+ALGEFI
Sbjct: 303 HTNDSFQSQELHSPWGFLADDEEETTDDNEYMDEDEEEDIEEFDLDELSYEELIALGEFI 362

Query: 871 GEEKRGLPMNEIPSCLHSGKFRTVESRSGMDRCVICQVEYEDGEELAALPCEHPYHSECI 924
           GEEKRGLPMNEIPSCLHS KF+T+E++SG+DRCVICQVEYED EELAALPCEHPYHSECI
Sbjct: 363 GEEKRGLPMNEIPSCLHSSKFQTIENKSGIDRCVICQVEYEDSEELAALPCEHPYHSECI 422

BLAST of Sgr014298 vs. NCBI nr
Match: KAA0055464.1 (E3 ubiquitin ligase BIG BROTHER-related-like [Cucumis melo var. makuwa] >TYK08946.1 E3 ubiquitin ligase BIG BROTHER-related-like [Cucumis melo var. makuwa])

HSP 1 Score: 641.0 bits (1652), Expect = 1.6e-179
Identity = 347/465 (74.62%), Postives = 376/465 (80.86%), Query Frame = 0

Query: 511 IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTVATDFQKIYRIHDKLFLGLSGLGTDAQT 570
           IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQT+ATDFQKIYRIHDKLFLGLSGLGTDAQT
Sbjct: 3   IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTIATDFQKIYRIHDKLFLGLSGLGTDAQT 62

Query: 571 LYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLS-EDKPFICT 630
           LYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLS EDKPFICT
Sbjct: 63  LYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLSDEDKPFICT 122

Query: 631 MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGW 690
           MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGW
Sbjct: 123 MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGW 182

Query: 691 GGHVYVV-----------------------------SHTASSMNKGCGPKRGRDGLAERR 750
           GGHVYVV                             S+ +    + C  +   + +A RR
Sbjct: 183 GGHVYVVLQSRAMEFTSEMQDFHAGLKLLGHICYILSYPSIHSIRSCAVR--PEAVAGRR 242

Query: 751 EQED------KQPPQRISLAYVEQVNSDFIMALAMQQQEHERAYTMLETIESDSEEDESL 810
           EQED      +Q PQRISLAYVEQVN+D IMALAMQQQEHE AYT LETI S+SEEDES 
Sbjct: 243 EQEDNNKQQQQQLPQRISLAYVEQVNTDLIMALAMQQQEHEMAYTTLETIASESEEDESS 302

Query: 811 DSNSNNCLDTNDSLQSQELQSRWDFLAEDE----------ETSD-----EDMDDDDEED- 870
           DSNSNN L+TN + Q +EL  RW F  E+E          E SD     EDM++D++ D 
Sbjct: 303 DSNSNNGLNTNAASQREELICRWAFPDENEYMEGNEDGDYEGSDMDELNEDMEEDEDGDF 362

Query: 871 -DFDLDELTYEELVALGEFIGEEKRGLPMNEIPSCLHSGKFRTVESRSGMDRCVICQVEY 923
             FDLDELTYEEL+ALGEFIGEEKRGLP+NEIPSCLHS KF+T+E++SG+DRCVICQVEY
Sbjct: 363 EGFDLDELTYEELIALGEFIGEEKRGLPINEIPSCLHSSKFQTIENKSGIDRCVICQVEY 422

BLAST of Sgr014298 vs. NCBI nr
Match: RXH90981.1 (hypothetical protein DVH24_006926 [Malus domestica])

HSP 1 Score: 516.2 bits (1328), Expect = 6.0e-142
Identity = 284/440 (64.55%), Postives = 329/440 (74.77%), Query Frame = 0

Query: 511 IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTVATDFQKIYRIHDKLFLGLSGLGTDAQT 570
           I EYNGSA+VAMVGKNCFAIASDRRLGVQLQTVATDF++I ++HD+LF+GLSGL TDAQT
Sbjct: 3   ITEYNGSALVAMVGKNCFAIASDRRLGVQLQTVATDFKRISQVHDRLFIGLSGLATDAQT 62

Query: 571 LYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLS-EDKPFICT 630
           LYQRL+++HKLYQLREERDMKPETFASLVSA LYEKRFGPYF QPVIAGLS ED+PFICT
Sbjct: 63  LYQRLMFRHKLYQLREERDMKPETFASLVSAILYEKRFGPYFTQPVIAGLSDEDRPFICT 122

Query: 631 MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGW 690
            DSIGAKELAKDFVV+GTASESLYGACEAMFKPDMEPEELFET+SQALLSSVDRDCLSGW
Sbjct: 123 TDSIGAKELAKDFVVAGTASESLYGACEAMFKPDMEPEELFETVSQALLSSVDRDCLSGW 182

Query: 691 GGHVYVV--------------------SHTASSMNKGCGPKRGRDGLAERREQEDKQPPQ 750
           GGHV+VV                     H    +      K   DG     E + KQ   
Sbjct: 183 GGHVFVVCGVMVKDMLLTKFLRFDKELEHPTDVL------KLSMDG----EEDQAKQSSG 242

Query: 751 RISLAYVEQVNSDFIMALAMQQQEHERAYTMLETIESDSEEDE------SLDSNSNNCLD 810
           RI    + QV++DFI+ALAMQ+Q  E+AYTMLETIESDSEED+      S  S+ N   D
Sbjct: 243 RIPFTQLIQVSADFILALAMQEQ--EQAYTMLETIESDSEEDDTEIDYASSSSSENYDPD 302

Query: 811 TNDSLQSQELQSRWDFLAEDEETSDEDMDDDDEEDDFDLDELTYEELVALGEFIGEEKRG 870
               L+S+E  +      EDEE        D E D  D+DELTYEE +ALGEFIGEEKRG
Sbjct: 303 VAAFLESREFDADDFRFLEDEEIGSSSSSSDQETDGLDVDELTYEEFLALGEFIGEEKRG 362

Query: 871 LPMNEIPSCLH--SGKFRTVESRSGMDRCVICQVEYEDGEELAALPCEHPYHSECISKWL 922
           LP +EI +CLH  + +    +S++ +DRCV+CQ+EY+DGE LAAL CEHPYH ECIS+WL
Sbjct: 363 LPRSEISACLHPYTCELAVGQSKTSIDRCVVCQLEYDDGESLAALSCEHPYHWECISQWL 422

BLAST of Sgr014298 vs. NCBI nr
Match: KAF2291370.1 (hypothetical protein GH714_023327 [Hevea brasiliensis])

HSP 1 Score: 466.8 bits (1200), Expect = 4.2e-127
Identity = 291/599 (48.58%), Postives = 343/599 (57.26%), Query Frame = 0

Query: 511 IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTVATDFQKIYRIHDKLFLGLSGLGTDAQT 570
           IFEYNGSA++AMVGKNCFAIASDRRLGVQLQT+ATDFQ+IY+ HD+LF+GLSGL TDAQT
Sbjct: 3   IFEYNGSALIAMVGKNCFAIASDRRLGVQLQTIATDFQRIYKFHDRLFMGLSGLATDAQT 62

Query: 571 LYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLS-EDKPFICT 630
           LYQRLV++HKLYQLREERDMKPETFASLVSA LYEKRFGPYFCQPVIAGLS E+KPFICT
Sbjct: 63  LYQRLVFRHKLYQLREERDMKPETFASLVSALLYEKRFGPYFCQPVIAGLSDENKPFICT 122

Query: 631 MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGW 690
           MDSIGAKELAKDFVV+GTASESLYGACEA+FKPDMEPEELFET+SQALLSSVDRDCLSGW
Sbjct: 123 MDSIGAKELAKDFVVAGTASESLYGACEAVFKPDMEPEELFETVSQALLSSVDRDCLSGW 182

Query: 691 GGHVYVVSHTASSMNKGC------------------------------------------ 750
           GGH+YVVS   S+ +  C                                          
Sbjct: 183 GGHIYVVSVCGSTWDFICYIENFCIASLLKLAMVLALFYIVLLFFYLLYKLGICECVARS 242

Query: 751 -------------------------------------------------------GPKR- 810
                                                                   P R 
Sbjct: 243 LWKMLWACMVSWFSLWEYCCFCLCDTLTMLKRVSHHHTKQFSSDEFDTSEDDYHYAPNRT 302

Query: 811 --------------------------------------------GRDGLAER-------- 870
                                                       GR+ L  +        
Sbjct: 303 LEMRRSLSGEMRDYRRVHHLRKSLRPRSHRIRVGFSKESEFSTYGRNYLLNKHRNHGSTV 362

Query: 871 ------------------------REQEDKQPP-----------QRISLAYVEQVNSDFI 917
                                   R  +D++ P           +R+  A ++QV+SDF 
Sbjct: 363 HNIRVIHSSKFARKGTNLRPKIYSRVSQDRKRPMDDEERKQAAARRVPFADLDQVHSDFA 422

BLAST of Sgr014298 vs. NCBI nr
Match: KAG6578952.1 (Proteasome subunit beta type-3-A, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 407.9 bits (1047), Expect = 2.3e-109
Identity = 234/374 (62.57%), Postives = 240/374 (64.17%), Query Frame = 0

Query: 522 MVGKNCFAIASDRRLGVQLQTVATDFQKIYRIHDKLFLGLSGLGTDAQTLYQRLVYQHKL 581
           MVGKNCFAIASDRRLGVQLQTVATDFQKIYRIHDKLFLGLSGLGTDAQT+YQRLVYQHKL
Sbjct: 1   MVGKNCFAIASDRRLGVQLQTVATDFQKIYRIHDKLFLGLSGLGTDAQTMYQRLVYQHKL 60

Query: 582 YQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLS-EDKPFICTMDSIGAKELAK 641
           YQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLS EDKPFICTMD IGAKELAK
Sbjct: 61  YQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLSDEDKPFICTMDCIGAKELAK 120

Query: 642 DFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGWGGHVYVVSHTA 701
           DFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGWGGHVYVV+ T 
Sbjct: 121 DFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGWGGHVYVVTPTE 180

Query: 702 SSMNKGCGPKRGRDGLAERREQEDKQPPQRISLAYVEQVNSDFIMALAMQQQEHERAYTM 761
           ++                               A   +VNSD IMALAMQQQEHE  YT 
Sbjct: 181 AA-------------------------------AQGNKVNSDLIMALAMQQQEHEMPYTT 240

Query: 762 LETIESDSEEDESLDSNSNNCLDTNDSLQSQELQSRWDFLAEDEETSDEDMDDDDEEDDF 821
           L TIESDS+EDE  DSNSNN LDTN S                                 
Sbjct: 241 LGTIESDSDEDERSDSNSNNGLDTNASF-------------------------------- 258

Query: 822 DLDELTYEELVALGEFIGEEKRGLPMNEIPSCLHSGKFRTVESRSGMDRCVICQVEYEDG 881
                                                                QVEYED 
Sbjct: 301 -----------------------------------------------------QVEYEDS 258

Query: 882 EELAALPCEHPYHS 895
           EELAALPCEHPYHS
Sbjct: 361 EELAALPCEHPYHS 258

BLAST of Sgr014298 vs. ExPASy Swiss-Prot
Match: Q9XI05 (Proteasome subunit beta type-3-A OS=Arabidopsis thaliana OX=3702 GN=PBC1 PE=1 SV=2)

HSP 1 Score: 347.8 bits (891), Expect = 3.7e-94
Identity = 169/190 (88.95%), Postives = 181/190 (95.26%), Query Frame = 0

Query: 511 IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTVATDFQKIYRIHDKLFLGLSGLGTDAQT 570
           IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQT+ATDFQ+I +IHD++F+GLSGL TD QT
Sbjct: 3   IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTIATDFQRISKIHDRVFIGLSGLATDVQT 62

Query: 571 LYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGL-SEDKPFICT 630
           LYQRLV++HKLYQLREERDMKPETFASLVSA LYEKRFGPY CQPVIAGL  +DKPFICT
Sbjct: 63  LYQRLVFRHKLYQLREERDMKPETFASLVSAILYEKRFGPYLCQPVIAGLGDDDKPFICT 122

Query: 631 MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGW 690
           MDSIGAKELAKDFVVSGTASESLYGACEAM+KPDME EELFETISQALLSSVDRDCLSGW
Sbjct: 123 MDSIGAKELAKDFVVSGTASESLYGACEAMYKPDMEAEELFETISQALLSSVDRDCLSGW 182

Query: 691 GGHVYVVSHT 700
           GGHVY+V+ T
Sbjct: 183 GGHVYIVTPT 192

BLAST of Sgr014298 vs. ExPASy Swiss-Prot
Match: O81153 (Proteasome subunit beta type-3-B OS=Arabidopsis thaliana OX=3702 GN=PBC2 PE=1 SV=1)

HSP 1 Score: 346.3 bits (887), Expect = 1.1e-93
Identity = 170/188 (90.43%), Postives = 179/188 (95.21%), Query Frame = 0

Query: 511 IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTVATDFQKIYRIHDKLFLGLSGLGTDAQT 570
           IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQT+ATDFQ+I +IHD LF+GLSGL TD QT
Sbjct: 3   IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTIATDFQRISKIHDHLFIGLSGLATDVQT 62

Query: 571 LYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLSED-KPFICT 630
           LYQRLV++HKLYQLREERDMKPETFASLVSA LYEKRFGP+ CQPVIAGL +D KPFICT
Sbjct: 63  LYQRLVFRHKLYQLREERDMKPETFASLVSAILYEKRFGPFLCQPVIAGLGDDNKPFICT 122

Query: 631 MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGW 690
           MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDME EELFETISQALLSSVDRDCLSGW
Sbjct: 123 MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEAEELFETISQALLSSVDRDCLSGW 182

Query: 691 GGHVYVVS 698
           GGHVYVV+
Sbjct: 183 GGHVYVVT 190

BLAST of Sgr014298 vs. ExPASy Swiss-Prot
Match: O65084 (Proteasome subunit beta type-3 OS=Picea mariana OX=3335 GN=PBC1 PE=2 SV=1)

HSP 1 Score: 346.3 bits (887), Expect = 1.1e-93
Identity = 165/188 (87.77%), Postives = 181/188 (96.28%), Query Frame = 0

Query: 511 IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTVATDFQKIYRIHDKLFLGLSGLGTDAQT 570
           IFEYNGSA+VAMVGKNCFAIASDRRLGVQLQT+ATDFQ+I++IHDKL++GLSGL TD QT
Sbjct: 3   IFEYNGSALVAMVGKNCFAIASDRRLGVQLQTIATDFQRIFKIHDKLYVGLSGLATDVQT 62

Query: 571 LYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLSE-DKPFICT 630
           LYQR  ++HKLYQLREER+M+PETFASLVSA LYEKRFGPYFCQPVIAGL E DKPFICT
Sbjct: 63  LYQRFAFRHKLYQLREERNMRPETFASLVSALLYEKRFGPYFCQPVIAGLGEDDKPFICT 122

Query: 631 MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGW 690
           MDSIGAKELAKDFVV+GTA+ESLYGACE+M+KPDMEPEELFETISQALLSS+DRDCLSGW
Sbjct: 123 MDSIGAKELAKDFVVAGTAAESLYGACESMYKPDMEPEELFETISQALLSSIDRDCLSGW 182

Query: 691 GGHVYVVS 698
           GGHVYVVS
Sbjct: 183 GGHVYVVS 190

BLAST of Sgr014298 vs. ExPASy Swiss-Prot
Match: Q9LST7 (Proteasome subunit beta type-3 OS=Oryza sativa subsp. japonica OX=39947 GN=PBC1 PE=2 SV=1)

HSP 1 Score: 342.0 bits (876), Expect = 2.0e-92
Identity = 166/190 (87.37%), Postives = 181/190 (95.26%), Query Frame = 0

Query: 511 IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTVATDFQKIYRIHDKLFLGLSGLGTDAQT 570
           IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTVATDFQ++++IHDKL++GLSGL TDAQT
Sbjct: 3   IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTVATDFQRVFKIHDKLYIGLSGLATDAQT 62

Query: 571 LYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLSED-KPFICT 630
           LYQRLV++HKLYQLREERDMKP+TFASLVSA LYEKRFGPYFCQPVIAGL ED +PFICT
Sbjct: 63  LYQRLVFRHKLYQLREERDMKPQTFASLVSALLYEKRFGPYFCQPVIAGLGEDNEPFICT 122

Query: 631 MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGW 690
           MD IGAKELAKDFVVSGTASESLYGACE+M+KP+MEPEELFETISQAL SSVDRDCLSGW
Sbjct: 123 MDCIGAKELAKDFVVSGTASESLYGACESMYKPNMEPEELFETISQALQSSVDRDCLSGW 182

Query: 691 GGHVYVVSHT 700
           GG V +V+ T
Sbjct: 183 GGFVLLVTPT 192

BLAST of Sgr014298 vs. ExPASy Swiss-Prot
Match: Q9Y7T8 (Probable proteasome subunit beta type-3 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=pup3 PE=3 SV=1)

HSP 1 Score: 242.7 bits (618), Expect = 1.7e-62
Identity = 113/188 (60.11%), Postives = 149/188 (79.26%), Query Frame = 0

Query: 511 IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTVATDFQKIYRIHDKLFLGLSGLGTDAQT 570
           I EYNG + VAM GKNC AIASD RLGVQ  ++  +F K++ + DK +LGL+GL TD QT
Sbjct: 3   IMEYNGGSCVAMAGKNCVAIASDLRLGVQSISLTNNFPKVFAMGDKTYLGLTGLATDVQT 62

Query: 571 LYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLSEDK-PFICT 630
           LY+   Y+  LY+ REER ++P+TFA+LVS+ LYEKRFGPYF  PV+AG+S D  PFIC 
Sbjct: 63  LYELFRYKVNLYKFREERQIQPKTFANLVSSTLYEKRFGPYFSFPVVAGVSNDNTPFICG 122

Query: 631 MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGW 690
            DSIG  + A+DF+VSGTA+E LYG CE++++P++EP++LFETISQALL++ DRDC+SGW
Sbjct: 123 FDSIGCIDFAEDFIVSGTATEQLYGMCESVYEPNLEPDDLFETISQALLNAQDRDCISGW 182

Query: 691 GGHVYVVS 698
           G  VYV++
Sbjct: 183 GCVVYVIT 190

BLAST of Sgr014298 vs. ExPASy TrEMBL
Match: A0A5D3CF09 (E3 ubiquitin ligase BIG BROTHER-related-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold314G00370 PE=4 SV=1)

HSP 1 Score: 641.0 bits (1652), Expect = 7.8e-180
Identity = 347/465 (74.62%), Postives = 376/465 (80.86%), Query Frame = 0

Query: 511 IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTVATDFQKIYRIHDKLFLGLSGLGTDAQT 570
           IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQT+ATDFQKIYRIHDKLFLGLSGLGTDAQT
Sbjct: 3   IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTIATDFQKIYRIHDKLFLGLSGLGTDAQT 62

Query: 571 LYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLS-EDKPFICT 630
           LYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLS EDKPFICT
Sbjct: 63  LYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLSDEDKPFICT 122

Query: 631 MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGW 690
           MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGW
Sbjct: 123 MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGW 182

Query: 691 GGHVYVV-----------------------------SHTASSMNKGCGPKRGRDGLAERR 750
           GGHVYVV                             S+ +    + C  +   + +A RR
Sbjct: 183 GGHVYVVLQSRAMEFTSEMQDFHAGLKLLGHICYILSYPSIHSIRSCAVR--PEAVAGRR 242

Query: 751 EQED------KQPPQRISLAYVEQVNSDFIMALAMQQQEHERAYTMLETIESDSEEDESL 810
           EQED      +Q PQRISLAYVEQVN+D IMALAMQQQEHE AYT LETI S+SEEDES 
Sbjct: 243 EQEDNNKQQQQQLPQRISLAYVEQVNTDLIMALAMQQQEHEMAYTTLETIASESEEDESS 302

Query: 811 DSNSNNCLDTNDSLQSQELQSRWDFLAEDE----------ETSD-----EDMDDDDEED- 870
           DSNSNN L+TN + Q +EL  RW F  E+E          E SD     EDM++D++ D 
Sbjct: 303 DSNSNNGLNTNAASQREELICRWAFPDENEYMEGNEDGDYEGSDMDELNEDMEEDEDGDF 362

Query: 871 -DFDLDELTYEELVALGEFIGEEKRGLPMNEIPSCLHSGKFRTVESRSGMDRCVICQVEY 923
             FDLDELTYEEL+ALGEFIGEEKRGLP+NEIPSCLHS KF+T+E++SG+DRCVICQVEY
Sbjct: 363 EGFDLDELTYEELIALGEFIGEEKRGLPINEIPSCLHSSKFQTIENKSGIDRCVICQVEY 422

BLAST of Sgr014298 vs. ExPASy TrEMBL
Match: A0A498JC90 (RING-type domain-containing protein OS=Malus domestica OX=3750 GN=DVH24_006926 PE=4 SV=1)

HSP 1 Score: 516.2 bits (1328), Expect = 2.9e-142
Identity = 284/440 (64.55%), Postives = 329/440 (74.77%), Query Frame = 0

Query: 511 IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTVATDFQKIYRIHDKLFLGLSGLGTDAQT 570
           I EYNGSA+VAMVGKNCFAIASDRRLGVQLQTVATDF++I ++HD+LF+GLSGL TDAQT
Sbjct: 3   ITEYNGSALVAMVGKNCFAIASDRRLGVQLQTVATDFKRISQVHDRLFIGLSGLATDAQT 62

Query: 571 LYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLS-EDKPFICT 630
           LYQRL+++HKLYQLREERDMKPETFASLVSA LYEKRFGPYF QPVIAGLS ED+PFICT
Sbjct: 63  LYQRLMFRHKLYQLREERDMKPETFASLVSAILYEKRFGPYFTQPVIAGLSDEDRPFICT 122

Query: 631 MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGW 690
            DSIGAKELAKDFVV+GTASESLYGACEAMFKPDMEPEELFET+SQALLSSVDRDCLSGW
Sbjct: 123 TDSIGAKELAKDFVVAGTASESLYGACEAMFKPDMEPEELFETVSQALLSSVDRDCLSGW 182

Query: 691 GGHVYVV--------------------SHTASSMNKGCGPKRGRDGLAERREQEDKQPPQ 750
           GGHV+VV                     H    +      K   DG     E + KQ   
Sbjct: 183 GGHVFVVCGVMVKDMLLTKFLRFDKELEHPTDVL------KLSMDG----EEDQAKQSSG 242

Query: 751 RISLAYVEQVNSDFIMALAMQQQEHERAYTMLETIESDSEEDE------SLDSNSNNCLD 810
           RI    + QV++DFI+ALAMQ+Q  E+AYTMLETIESDSEED+      S  S+ N   D
Sbjct: 243 RIPFTQLIQVSADFILALAMQEQ--EQAYTMLETIESDSEEDDTEIDYASSSSSENYDPD 302

Query: 811 TNDSLQSQELQSRWDFLAEDEETSDEDMDDDDEEDDFDLDELTYEELVALGEFIGEEKRG 870
               L+S+E  +      EDEE        D E D  D+DELTYEE +ALGEFIGEEKRG
Sbjct: 303 VAAFLESREFDADDFRFLEDEEIGSSSSSSDQETDGLDVDELTYEEFLALGEFIGEEKRG 362

Query: 871 LPMNEIPSCLH--SGKFRTVESRSGMDRCVICQVEYEDGEELAALPCEHPYHSECISKWL 922
           LP +EI +CLH  + +    +S++ +DRCV+CQ+EY+DGE LAAL CEHPYH ECIS+WL
Sbjct: 363 LPRSEISACLHPYTCELAVGQSKTSIDRCVVCQLEYDDGESLAALSCEHPYHWECISQWL 422

BLAST of Sgr014298 vs. ExPASy TrEMBL
Match: A0A6A6KQT5 (RING-type domain-containing protein OS=Hevea brasiliensis OX=3981 GN=GH714_023327 PE=4 SV=1)

HSP 1 Score: 466.8 bits (1200), Expect = 2.0e-127
Identity = 291/599 (48.58%), Postives = 343/599 (57.26%), Query Frame = 0

Query: 511 IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTVATDFQKIYRIHDKLFLGLSGLGTDAQT 570
           IFEYNGSA++AMVGKNCFAIASDRRLGVQLQT+ATDFQ+IY+ HD+LF+GLSGL TDAQT
Sbjct: 3   IFEYNGSALIAMVGKNCFAIASDRRLGVQLQTIATDFQRIYKFHDRLFMGLSGLATDAQT 62

Query: 571 LYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLS-EDKPFICT 630
           LYQRLV++HKLYQLREERDMKPETFASLVSA LYEKRFGPYFCQPVIAGLS E+KPFICT
Sbjct: 63  LYQRLVFRHKLYQLREERDMKPETFASLVSALLYEKRFGPYFCQPVIAGLSDENKPFICT 122

Query: 631 MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGW 690
           MDSIGAKELAKDFVV+GTASESLYGACEA+FKPDMEPEELFET+SQALLSSVDRDCLSGW
Sbjct: 123 MDSIGAKELAKDFVVAGTASESLYGACEAVFKPDMEPEELFETVSQALLSSVDRDCLSGW 182

Query: 691 GGHVYVVSHTASSMNKGC------------------------------------------ 750
           GGH+YVVS   S+ +  C                                          
Sbjct: 183 GGHIYVVSVCGSTWDFICYIENFCIASLLKLAMVLALFYIVLLFFYLLYKLGICECVARS 242

Query: 751 -------------------------------------------------------GPKR- 810
                                                                   P R 
Sbjct: 243 LWKMLWACMVSWFSLWEYCCFCLCDTLTMLKRVSHHHTKQFSSDEFDTSEDDYHYAPNRT 302

Query: 811 --------------------------------------------GRDGLAER-------- 870
                                                       GR+ L  +        
Sbjct: 303 LEMRRSLSGEMRDYRRVHHLRKSLRPRSHRIRVGFSKESEFSTYGRNYLLNKHRNHGSTV 362

Query: 871 ------------------------REQEDKQPP-----------QRISLAYVEQVNSDFI 917
                                   R  +D++ P           +R+  A ++QV+SDF 
Sbjct: 363 HNIRVIHSSKFARKGTNLRPKIYSRVSQDRKRPMDDEERKQAAARRVPFADLDQVHSDFA 422

BLAST of Sgr014298 vs. ExPASy TrEMBL
Match: W9QYY7 (Proteasome subunit beta type-3-A OS=Morus notabilis OX=981085 GN=L484_013374 PE=4 SV=1)

HSP 1 Score: 401.4 bits (1030), Expect = 1.0e-107
Identity = 280/697 (40.17%), Postives = 333/697 (47.78%), Query Frame = 0

Query: 511 IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTVATDFQKIYRIHDKLFLGLSGLGTDAQT 570
           IFEYNGSA+VAMVGKNCFAIASDRRLGVQLQT+ATDFQKIY+IHD+L+LGLSGL TDAQT
Sbjct: 3   IFEYNGSALVAMVGKNCFAIASDRRLGVQLQTIATDFQKIYKIHDRLYLGLSGLATDAQT 62

Query: 571 LYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLS-EDKPFICT 630
           LYQRLV++HKLYQLREERDMKPETFASLVSA LYEKRFGPYFCQPVIAGLS EDKPFICT
Sbjct: 63  LYQRLVFRHKLYQLREERDMKPETFASLVSAVLYEKRFGPYFCQPVIAGLSDEDKPFICT 122

Query: 631 MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGW 690
           MDSIGAKELAKDFVV+GTASESLYGACEAMFKPD+EPEELFE +SQALLSSV+RDCLSGW
Sbjct: 123 MDSIGAKELAKDFVVAGTASESLYGACEAMFKPDLEPEELFEVVSQALLSSVNRDCLSGW 182

Query: 691 GGH--------------------------------------------------------- 750
           GGH                                                         
Sbjct: 183 GGHNTHRSDREDLEGKNGLSFAEDLDGDCHTTEIRDRDYRDIGLPLLREGYSVCGSTWDF 242

Query: 751 -----VYVVSH------------------------------------------------- 810
                 + VSH                                                 
Sbjct: 243 ICYIENFCVSHLLKMAMVLVLLYIVLLFLYLLHKLGICECVSWSLCRMLWACFASCLSVC 302

Query: 811 ------------------------------------------------------------ 870
                                                                       
Sbjct: 303 DCCCTFLCLKLRNLRGRTNRRRRRRRKLDIEVVDTSSTEAGDEHHNGETSFASCDYDMYK 362

Query: 871 --TASSMNK------------------------------GCGPKRG-------------- 921
             + SS+++                              G G +R               
Sbjct: 363 EKSDSSLSRRKRDYRSDQLRKSLRPRSHRIGAGISKDHLGHGNRRSYNIKLGDHHNHNHD 422

BLAST of Sgr014298 vs. ExPASy TrEMBL
Match: A0A6A6KS48 (RING-type domain-containing protein OS=Hevea brasiliensis OX=3981 GN=GH714_023176 PE=4 SV=1)

HSP 1 Score: 378.3 bits (970), Expect = 9.5e-101
Identity = 258/601 (42.93%), Postives = 308/601 (51.25%), Query Frame = 0

Query: 509 LLIFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTVATDFQKIYRIHDKLFLGLSGLGTDA 568
           L IFEYNGSA++AMVGKNCFAIASDRRLGVQLQT+ATDFQ+IY+ HD+LF+GLSGL TDA
Sbjct: 4   LQIFEYNGSALIAMVGKNCFAIASDRRLGVQLQTIATDFQRIYKFHDRLFMGLSGLATDA 63

Query: 569 QTLYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLS-EDKPFI 628
           QTL                                    FGPYFCQPVIAGLS E+KPFI
Sbjct: 64  QTL------------------------------------FGPYFCQPVIAGLSDENKPFI 123

Query: 629 CTMDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLS 688
           CTMDSIGAKELAKDF V+GTASESLYGACEA+FKPDMEPEELFET+SQALLSSVDRDCLS
Sbjct: 124 CTMDSIGAKELAKDF-VAGTASESLYGACEAVFKPDMEPEELFETVSQALLSSVDRDCLS 183

Query: 689 GWGGHVYVVSHTASSMNKGC---------------------------------------- 748
           GWGGH+YVVS   S+ +  C                                        
Sbjct: 184 GWGGHIYVVSVCGSTWDFICYIENFCIASLLKLAMVLALFYIVLLFFYLLYKLGICECVA 243

Query: 749 ---------------------------------------------------------GPK 808
                                                                     P 
Sbjct: 244 RSLWKMLWACMVSWFSLWEYCCFCLCDTLTMLKRVSHHHTKQFSSDEFDTSEDDYHYAPN 303

Query: 809 R---------------------------------------------GRDGLAER------ 868
           R                                             GR+ L  +      
Sbjct: 304 RTLEMRRSLSGEMRDYRRVHHLRKSLRPRSHRIRVGFSKESEFSTYGRNYLLNKHRNHGS 363

Query: 869 --------------------------REQEDKQPP-----------QRISLAYVEQVNSD 917
                                     R  +D++ P           +R+  A ++QV+SD
Sbjct: 364 TVHNIRVIHSSKFARKGTNLRPKIYSRVSQDRKRPMDDEERKQAAARRVPFADLDQVHSD 423

BLAST of Sgr014298 vs. TAIR 10
Match: AT1G21720.1 (proteasome beta subunit C1 )

HSP 1 Score: 347.8 bits (891), Expect = 2.6e-95
Identity = 169/190 (88.95%), Postives = 181/190 (95.26%), Query Frame = 0

Query: 511 IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTVATDFQKIYRIHDKLFLGLSGLGTDAQT 570
           IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQT+ATDFQ+I +IHD++F+GLSGL TD QT
Sbjct: 3   IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTIATDFQRISKIHDRVFIGLSGLATDVQT 62

Query: 571 LYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGL-SEDKPFICT 630
           LYQRLV++HKLYQLREERDMKPETFASLVSA LYEKRFGPY CQPVIAGL  +DKPFICT
Sbjct: 63  LYQRLVFRHKLYQLREERDMKPETFASLVSAILYEKRFGPYLCQPVIAGLGDDDKPFICT 122

Query: 631 MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGW 690
           MDSIGAKELAKDFVVSGTASESLYGACEAM+KPDME EELFETISQALLSSVDRDCLSGW
Sbjct: 123 MDSIGAKELAKDFVVSGTASESLYGACEAMYKPDMEAEELFETISQALLSSVDRDCLSGW 182

Query: 691 GGHVYVVSHT 700
           GGHVY+V+ T
Sbjct: 183 GGHVYIVTPT 192

BLAST of Sgr014298 vs. TAIR 10
Match: AT1G77440.1 (20S proteasome beta subunit C2 )

HSP 1 Score: 346.3 bits (887), Expect = 7.7e-95
Identity = 170/188 (90.43%), Postives = 179/188 (95.21%), Query Frame = 0

Query: 511 IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTVATDFQKIYRIHDKLFLGLSGLGTDAQT 570
           IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQT+ATDFQ+I +IHD LF+GLSGL TD QT
Sbjct: 3   IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTIATDFQRISKIHDHLFIGLSGLATDVQT 62

Query: 571 LYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLSED-KPFICT 630
           LYQRLV++HKLYQLREERDMKPETFASLVSA LYEKRFGP+ CQPVIAGL +D KPFICT
Sbjct: 63  LYQRLVFRHKLYQLREERDMKPETFASLVSAILYEKRFGPFLCQPVIAGLGDDNKPFICT 122

Query: 631 MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGW 690
           MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDME EELFETISQALLSSVDRDCLSGW
Sbjct: 123 MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEAEELFETISQALLSSVDRDCLSGW 182

Query: 691 GGHVYVVS 698
           GGHVYVV+
Sbjct: 183 GGHVYVVT 190

BLAST of Sgr014298 vs. TAIR 10
Match: AT1G77440.2 (20S proteasome beta subunit C2 )

HSP 1 Score: 346.3 bits (887), Expect = 7.7e-95
Identity = 170/188 (90.43%), Postives = 179/188 (95.21%), Query Frame = 0

Query: 511 IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTVATDFQKIYRIHDKLFLGLSGLGTDAQT 570
           IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQT+ATDFQ+I +IHD LF+GLSGL TD QT
Sbjct: 3   IFEYNGSAVVAMVGKNCFAIASDRRLGVQLQTIATDFQRISKIHDHLFIGLSGLATDVQT 62

Query: 571 LYQRLVYQHKLYQLREERDMKPETFASLVSARLYEKRFGPYFCQPVIAGLSED-KPFICT 630
           LYQRLV++HKLYQLREERDMKPETFASLVSA LYEKRFGP+ CQPVIAGL +D KPFICT
Sbjct: 63  LYQRLVFRHKLYQLREERDMKPETFASLVSAILYEKRFGPFLCQPVIAGLGDDNKPFICT 122

Query: 631 MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEPEELFETISQALLSSVDRDCLSGW 690
           MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDME EELFETISQALLSSVDRDCLSGW
Sbjct: 123 MDSIGAKELAKDFVVSGTASESLYGACEAMFKPDMEAEELFETISQALLSSVDRDCLSGW 182

Query: 691 GGHVYVVS 698
           GGHVYVV+
Sbjct: 183 GGHVYVVT 190

BLAST of Sgr014298 vs. TAIR 10
Match: AT3G47180.1 (RING/U-box superfamily protein )

HSP 1 Score: 147.1 bits (370), Expect = 6.8e-35
Identity = 88/210 (41.90%), Postives = 135/210 (64.29%), Query Frame = 0

Query: 718 ERREQEDKQPPQRI-SLAYVEQVNSDFIMALAMQQQE-----HERAYTMLETIESD--SE 777
           + +E+E KQPP ++  LA  EQ NS+  +A +          H+ + +M+  IESD  SE
Sbjct: 2   DNQEEEPKQPPNKLPDLALFEQANSEVALAASQANSHFAHAMHDSSPSMISMIESDEESE 61

Query: 778 EDESLDSNSNNCLDTND-SLQSQELQSRWDFLAEDEETSDEDMDDD--DEEDDFDLDELT 837
           ++E ++ N     D+N   +   E+    +FL + E  S+ + +DD  +EED+ D D+L+
Sbjct: 62  DEEEINENYYEYFDSNGFGVDEDEIN---EFLEDQESNSNLEEEDDFLEEEDEIDPDQLS 121

Query: 838 YEELVALGEFIGEEKRGLPMNEIPSCLHSGKFRTVESRSGMDRCVICQVEYEDGEELAAL 897
           YEEL+ALG+FIG E RGL   EI +CL++  +    +++ +DRCV+CQ+E+E+ E L  L
Sbjct: 122 YEELIALGDFIGVENRGLTPIEISTCLNASTYVFSHNKNEIDRCVVCQMEFEERESLVVL 181

Query: 898 -PCEHPYHSECISKWLQIKRVCPICGSEVS 916
            PC+HPYHSECI+KWL+ K++CPIC SE S
Sbjct: 182 RPCDHPYHSECITKWLETKKICPICCSEPS 208

BLAST of Sgr014298 vs. TAIR 10
Match: AT3G19910.1 (RING/U-box superfamily protein )

HSP 1 Score: 112.1 bits (279), Expect = 2.4e-24
Identity = 55/114 (48.25%), Postives = 81/114 (71.05%), Query Frame = 0

Query: 804 EETSDEDMDDDDEEDDFDLDELTYEELVALGEFIGEEKRGLPMNEIPSCLHSGKFRTVES 863
           E+  DE     D  D+ D DEL+YEEL+ALG+ +G E RGL  + I S L S +++  ++
Sbjct: 222 EDLEDESHTSQDAWDEMDPDELSYEELLALGDIVGTESRGLSADTIAS-LPSKRYKEGDN 281

Query: 864 RSGM-DRCVICQVEYEDGEELAALPCEHPYHSECISKWLQIKRVCPICGSEVSS 917
           ++G  + CVIC+++YED E+L  LPC+H YHSECI+ WL+I +VCP+C +EVS+
Sbjct: 282 QNGTNESCVICRLDYEDDEDLILLPCKHSYHSECINNWLKINKVCPVCSAEVST 334

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG6602111.17.8e-18278.03Proteasome subunit beta type-3-A, partial [Cucurbita argyrosperma subsp. sororia... [more]
KAA0055464.11.6e-17974.62E3 ubiquitin ligase BIG BROTHER-related-like [Cucumis melo var. makuwa] >TYK0894... [more]
RXH90981.16.0e-14264.55hypothetical protein DVH24_006926 [Malus domestica][more]
KAF2291370.14.2e-12748.58hypothetical protein GH714_023327 [Hevea brasiliensis][more]
KAG6578952.12.3e-10962.57Proteasome subunit beta type-3-A, partial [Cucurbita argyrosperma subsp. sororia... [more]
Match NameE-valueIdentityDescription
Q9XI053.7e-9488.95Proteasome subunit beta type-3-A OS=Arabidopsis thaliana OX=3702 GN=PBC1 PE=1 SV... [more]
O811531.1e-9390.43Proteasome subunit beta type-3-B OS=Arabidopsis thaliana OX=3702 GN=PBC2 PE=1 SV... [more]
O650841.1e-9387.77Proteasome subunit beta type-3 OS=Picea mariana OX=3335 GN=PBC1 PE=2 SV=1[more]
Q9LST72.0e-9287.37Proteasome subunit beta type-3 OS=Oryza sativa subsp. japonica OX=39947 GN=PBC1 ... [more]
Q9Y7T81.7e-6260.11Probable proteasome subunit beta type-3 OS=Schizosaccharomyces pombe (strain 972... [more]
Match NameE-valueIdentityDescription
A0A5D3CF097.8e-18074.62E3 ubiquitin ligase BIG BROTHER-related-like OS=Cucumis melo var. makuwa OX=1194... [more]
A0A498JC902.9e-14264.55RING-type domain-containing protein OS=Malus domestica OX=3750 GN=DVH24_006926 P... [more]
A0A6A6KQT52.0e-12748.58RING-type domain-containing protein OS=Hevea brasiliensis OX=3981 GN=GH714_02332... [more]
W9QYY71.0e-10740.17Proteasome subunit beta type-3-A OS=Morus notabilis OX=981085 GN=L484_013374 PE=... [more]
A0A6A6KS489.5e-10142.93RING-type domain-containing protein OS=Hevea brasiliensis OX=3981 GN=GH714_02317... [more]
Match NameE-valueIdentityDescription
AT1G21720.12.6e-9588.95proteasome beta subunit C1 [more]
AT1G77440.17.7e-9590.4320S proteasome beta subunit C2 [more]
AT1G77440.27.7e-9590.4320S proteasome beta subunit C2 [more]
AT3G47180.16.8e-3541.90RING/U-box superfamily protein [more]
AT3G19910.12.4e-2448.25RING/U-box superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001841Zinc finger, RING-typeSMARTSM00184ring_2coord: 870..910
e-value: 1.7E-5
score: 34.3
IPR001841Zinc finger, RING-typePFAMPF13639zf-RING_2coord: 869..910
e-value: 6.1E-11
score: 42.5
IPR001841Zinc finger, RING-typePROSITEPS50089ZF_RING_2coord: 870..910
score: 11.500827
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3D3.30.40.10Zinc/RING finger domain, C3HC4 (zinc finger)coord: 855..923
e-value: 6.0E-17
score: 62.6
IPR029055Nucleophile aminohydrolases, N-terminalGENE3D3.60.20.10Glutamine Phosphoribosylpyrophosphate, subunit 1, domain 1coord: 509..705
e-value: 5.8E-50
score: 171.6
IPR029055Nucleophile aminohydrolases, N-terminalSUPERFAMILY56235N-terminal nucleophile aminohydrolases (Ntn hydrolases)coord: 512..698
IPR001353Proteasome, subunit alpha/betaPFAMPF00227Proteasomecoord: 515..696
e-value: 2.1E-41
score: 141.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 711..727
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 765..787
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 701..730
NoneNo IPR availablePANTHERPTHR11599:SF155PROTEASOME SUBUNIT BETAcoord: 512..699
NoneNo IPR availablePANTHERPTHR11599PROTEASOME SUBUNIT ALPHA/BETAcoord: 512..699
NoneNo IPR availableSUPERFAMILY57850RING/U-boxcoord: 869..917
IPR016050Proteasome beta-type subunit, conserved sitePROSITEPS00854PROTEASOME_BETA_1coord: 520..567
IPR023333Proteasome B-type subunitPROSITEPS51476PROTEASOME_BETA_2coord: 516..696
score: 47.716663
IPR033811Proteasome beta 3 subunitCDDcd03759proteasome_beta_type_3coord: 514..699
e-value: 1.09405E-118
score: 357.708

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr014298.1Sgr014298.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0043161 proteasome-mediated ubiquitin-dependent protein catabolic process
biological_process GO:0051603 proteolysis involved in cellular protein catabolic process
cellular_component GO:0005737 cytoplasm
cellular_component GO:0005634 nucleus
cellular_component GO:0019774 proteasome core complex, beta-subunit complex
cellular_component GO:0005839 proteasome core complex
molecular_function GO:0004298 threonine-type endopeptidase activity