Sgr019399 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr019399
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
Description30-kDa cleavage and polyadenylation specificity factor 30
Locationtig00153347: 229754 .. 250301 (-)
RNA-Seq ExpressionSgr019399
SyntenySgr019399
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGTTATACATCTGAAGGCAAAACTCTGTAACTGCAGGTACAAGTTTGGTTTCTGTCCAAATGGTCCTGATTGTCGGTATAGGCATGCAAAGCTGCCTGCACCGCCACCTCCAGTGGAAGAAATCCTTCAGAAAATACAGCACTTGAGTTCATACAATTATGGTCCCTCAAACAAATTCTTTTCACAACGTGGAGTTGGCTTATCCCAACAAAACGAAAAATCTCAATTTCCTCAGGGTCCGGCCACCGTGAATCAAGGAGTGATAGGAAAACCTTCTACAGCAGAATCTGGTAATGTCCAACAACAGCAAGCTCAACAGTCCGCACAACAACCCAGCCAGACACAGATACAAAGTCTTTCTAATGGCCAGCCCAATCAATTAAACAGAACTGCAACATCTTTGCCTCAAGGAATATCTAGGTGTGTCTCAAGAGACCTGAAGTCTTTTTACACGTTCCTTTAACTCAGTTTTTTGTTTGATATGGAAACATTGTTGCAGAGATATGTTATTCAACCTTCAGTGTTCTATTATATTTGTGTTGTTGTTCGGTTACTTTAAGCCCCAATAGTCAACAATGTTTTCCTTGATATCAATTTGTGGTCCTTGTAGCTGCCCCTTGGGCTTGGTGAATGTTATCAACAAGGGTATTAATTATTATACATATTATCTATCAAGCTACTAAATTCTATGCAGAAGTTTCATTATTATATGCTCTGTTCCAATTATGATCTCATGTCAATTGCCAAGTGTCAAGAAACAAAATTTCCCCCACGCGTCGATGGTTCTTTTTTTGATTGGCTAGAACATTTTGGCTCATAGCTCATACTCTGTTATACTGGGATCTTAACTCTGTGGACTGATACTTCTTGAAGACTTTTTTGATTTAGTTTGCTATAATACTTCAAGTTGATGTAATCTCTCCAAAAGCTTCTTTAATTATACTCCCTTTTTGATTTATGCCAATTGGAGTTGATTTCTGTAAAACTCAATCGGCAAGGGTTCCCCTTTCTGGTTCTGCAACTTTTTCCTTTATCTTCTGATAAAGGTGAGCGGTTTCCCAAACCTTCTGACATTTACATAGAGTTTTTAAATACTCTGCATATCCTAATTTATTTTTGAAAGATTAATAGGTACTTTATTGTTAAAAGTTGCAACCGCGAGAATTTGGAATTATCTGTACAACAGGGGGTATGGGCAACTCAAAGAAGCAATGAAGCTAAACTTAATGAAGCTTTTGATTCTGCTGATAATGTTATTTTGATTTTCTCGGTCAATCGGACTCGACATTTCCAGGTGCCAATAATTCACTGTTGCATTTGTGTATGACTTGGTCATCGAGGACCAAAGTATTTGATTGATATAATTATACTGATTTAGTATTGGCTGCTTAGGGCTGTGCAAAGATGATGTCCAGGATTGGTGGATCTGTCAGTGGGGGCAATTGGAAATATGCCCATGGAACTGCACATTATGGGCAAAACTTTTCACTCAAATGGCTGAAGGTTGATATTTCATTTCTTAGATCTATATTTAGTTTTATGCAAAAATTTATCACATCGGATCTCACCATGCCGTTGCATTCTTAGTAAGTTAGAATGATGCTAAGGGTTGTGAATTTTTTTTTTTCCTGAAAATTTTTGCATTAACTTTATCAAAGATCATCATCGCAGTGCTCTTTAATAATGAAACTGAATGATTTTTTTATAAAAAAAATTATTTGTGATAACGAACACTTTTTTCTTGCAATCATCAGAGGATGTATGAGAACGAGGAATCAAAGGTTCACCTGTATCCATAAATAGAGGAAGGAAACTTGATCAAGTTCTCAATCTAAACCGTAAACCCTAAACTAAGTTTTGAAAACTTGATGAAGTTATTTCCAATATTGATTGGTATCTGTATTGTTCATTAAATTATTTGATATTCTTATTTGGGATAAACAACAGCCTGGACTAGAGGTACATCAAACGGAAAAAAATTGATGCCTCCCATTAGTTCTAGTGCATCATACTTCATGACTCAGAGTTTGAATCTGAAATTGCGTGGTTTATTGTATTATCTTCCTTGTATCTGTATACATAGAGAATTTGATTACTCTTTTGCTTTTTTTTTGAAGGATTTGAACATACATATTTTTCCCGATATCATTGAGAGATACATATTTTTGGTATTAGTATATTAGGAAGCTAGCTGAAATCTGATTTTTTACACAGATACATGATAAAGCTCAAACTAAATTTTGTTGTGAAGGATGATTTTGTTTCTGTCAATTTGATCCTGTATGATGCGATGAATAATTTGACCTTGTTACTTGCAGTTATGTGAACTATCCTTTCAGAAAACTCGCCATTTGAGGAATCCTTATAATGAAAACTTACCAGTAAAGGTATAGTATCTCAAGCTTGATAAAAGGTTTCTGTTTGACATCAAAATAGTATATTTCCATCTGCAGGTGTTTTTTTCATGCTTCTTTGTTGTGAACTCTTGAAAATATTTCCAGCTTTTGGCTTTTATTTTTATGATTATTATCTTGTTCTGCACTTTCTGGTGATTATACATATTACTGGTTTGTTAAAGTCAAATTATCTGAAAGCATACTTTGAAGACCTTGCAACCTTGTGAATGAAAGTTTATCCCCCCTTTTTATACCTGGGAGCTTTCTGTAGGATTTAGTTTCATGGAAGTTACTAGAAAACACTCTGCCCCATGTGGGCTGTGCGATCTTTTTTTTATGGGAAAAGATCATTTTTACCTCTATAATTAATCCGGTGAGTCAATTTTAACCTTAAACATTTAATTTCATCAAATTAAATCTTAAACTTGAATAAGTGTTACAATTAATACCTTTTCATAATTAAATTTAACAATTTCCCATGAGCCAACCAGTTAATCATATAAAATTCAACCAAAAACGTACCCAAAATTATGTCATGAAATGCCTAAACCCAATCATGAAAATACTCAAATCTAACTACAAAGTCAAATAGATAAAAATGCTCAAATTTAGCAACAAAATGTCTAAATTCAATTAATTAAAAGCATCTAAAACAATTAGAAGAACCAATTCCAGTAAAAATAGTCTTGTCCTACCATAAGAGCACCACAACTGGTCACGAATGCTTTAAAAAAATGGAAACCTAACTAATTGAGGGTAATTTTTTATTTATTATTATTATTTTTAATGGAAAATCGAGCTTTTATCAAGGGAAATGAAAGAATACAATGTGGCGTACAAAAAGACTGCCTAAGAAAAGAAGCCAACAAAACAAAGCTAACTATACAGAAAATAGCTCCTATCCAAGATCAAAAGATCTAACTGATAATTATAAAAATTCTTGGAGACCGATGCTAAAAGAGAAGCATTAAACCTAGCAAGAGAACAAACTCTTCCTAAGATCTCTCAATCCCTCTAAAAATCTTATTATTTCTCTCAATCCAACGCCCCACAAAATATCGAAAAAGCCAGCACGCCACTTGATTCGACCCTTGTCCCAAAACAGTGAATGCAGAAGGATCTCCTCCACCATAGAACAACACTCTCTGTTTCGAGCCCAGACAAATACCAAACAACTCAACAATTGCAGATCGAGTGGGCGAACGGACAATTCCAAAAGATGTGATCAAGGTTCTGAGACTAATTGAGGTATTAATTGTACTATTATTTAAGTGTAGGGTTTAAATAAATGGAATTGAAAGTTCTGAACATACAATCACAAGGATGACAAATCCTAATCGACCTTACTCTTTAATATAAGAGCATATAAGTTTTTTCTAGAACTTGAGAAAAATACAGTAGGTTTTCCTATAACAGCGATTAACAAAATATTTGACTATCAGAACCATTGCAAGTAATGTGGGCTTCATTTTTCTGCTAAAATAACCACCATTTGTACATTCTTGCCTTGACTGCAATTAAAAATGTTGGCCCAAACATGATAGAGTTAAAAAAAGGCTTGAATCCAAGAAATACATGAAAGACTTCTAATGGAGTCTTGAACATGTATCCTGATCCATCCGAAGTGCAGAACAGCAGCCATATTTCCAAGCTGAGACATGTGGACTATAATTCCAGATTTCTTATGGGAGAAGGTGGCTGAGTAAAGTTTTGAGATGATATTTGAATAATTGACTCTCATCTCTCCCTTTATTTAGCACGGAGCAAGTGCTAGTTGTTGGTAGTACAATGCTGGGATGATTCTTCAAGTGGTTGGATGTTTGAACTTTGCAAAAATCTTAATGATGATGAATTAGAAGAATGGCAAGAGCTTATTGTGTTCTTCATTTTGGATGGTTTTACCTTTTCAAATGCATAGGACAGGAAATTATGGATAGAGATTCTCTTTCACTATGGAATTGTGTAAAGCCAGGCTTTTTAGATATTTAAATACCCTACTGAGGCTAAGGTATCTTCATGGTCAGTCTTCCTTAGTTTCAGTAGCACAACCGATAAAATTGGCTTGTGCTCTATAGGAATTCTTCCAGAGATGACAACCACTTGTTTTTGAAGCCCAGGTGGCCTTTCCTAGTTATGTGCATGACCTTTAGTTTCTCCTATGCAGGTTTAAAAGGAAGGCCATTACTTAGATGCCAATGTTTTCAAAGTGCAACATGCACTCTACTCTAGGGTAGTAGGGATGGAAAAAATCCCTGCCTTCGCAATCGGGTTGGGGTTGAAGACAAGGGAAATAATTTCTCCTTTTTTTGGGGGGGATGAAGAATGAATTTCCCACCCTACTTTGACCTCATCCCCGAACCCAAAATTTTCAAATATTATTTTATAAACTTCTCGAGGCCTTTGATATACTTTGGGATCCTTTATCACAGAGCCAGTTTTCTCAAAGCCATCCTCTTTTCGTTGATGAAAGTTCTTGTCAGAAATACCAAGTTTGTTTGTGAGCCCCTGCAAAGATGGGACAATGCATTTCACTCAAAGACTTGAGGGGAGATAGTGCACCATGTTGTTCGCCTTTTATTAATCAAAAAATTGAAAATTTTAACTAATAAATAATAAAATATTAATTTTTATCTTTTTGGCTATAATAATATTTTAAACATGTTTATTATTATTTGATCAAAATTTTATTTAAAAACAGAGAAAAAATCAGGAAGTCTTTTCCCACCACAAACCTGATTCTTGAATGGGGATTGTCCAGACTAGCTCCTAATTTCCAAGTTGGGGAATAGGGTGGGGACGAAAATAAGAAATCCCTATGGAATTGAAAATAAGGAGCACCTCCAACCTAACTTGTCTTGATGCTATCCCTACTCTTAAGGCGATATGGTTCTTGCTTGCTTGGGGGTGAGAGGTGCATAAAATTCATCACCTGAGTAAAGTGAGTTACATAATTTGAAACAGAAAAAAGTATATAATTTATATTGTGAATAAAAAATAGGAATTCATGATGACATAAAATGAGTTTGAATCTAGAAAAATTACTTTAATGCCATGTATTTTCAAGAGATAATAAATGAAAAAAGGAATATATATAGTCAAAAGTATTAAAAAGTGCAGAAGATTATTTTTTCCCCTAAGGTGCGAAAGGCAACAAGGCAATCAAAAAGCGGCAAAAAGGTGAGCAAGGCAATCACCTAAGTGAGTTTCGTAGACAGTAAGTGCACCTTGGCTCGCCAATGCTATAGGCAATTTGTGGGGCTTACCTTGAAGACGCTATTAAATACTAGGGCTTTCTTTTGTAAAGATCCAGTGCAGGTTAATCATTTGGTACATTTTGGGTACTGACTGAGAAATTTGTGATATTTAGGAGGATTAAGAACACATGGCAAAAGGCCATGAACTTGTTGGATAGTGGCATTCCATAGTAGTTTGATTGAGAAGAAACAAGATTATTTTCTGGAACAATTTGATGATGCTGATGGCGTATCAAATGGGATTGCTTTGGCTCACTTTGCTTCAACTTTGAAGGAGTTGACAGCCATCCCGTTTATTTTTTAATCTGTTCAGTTGGCACCCTGTTTCTGCTTGTTTGTTTTGAAACTGCTCACGCTGTCCTGGTTTATTTTCGGTCCATCGTCCACATATCCATTGAGGATCATTGTTGGAAGAAATGATATTTTTTCCTTCATAACTTGTTCTGCAATTAGTGTAGCATTGGATGCTAATTGAGCCATTATTTTTCTTTTTGTACTCTTATTAAGTGCAGATCAGTAGAGATTGCCAAGAACTAGAGCCCTCTATTGGTGAGCAGCTGGCTTCTTTGCTTTATCTTGAGCCAGATGGTGAACTCATGGTATGCTTGCTTTGTTCATTTTGGTTGAAGAATCAGAAAACTTAGAAATGCATTCCTATTACAAAATAAATATTTTATACGTAAGCAGGGGAGGAAGAAAATGAAGTTTGAGGAAGTGGATTAAGGCAGCTTCAGGATTTATTGTTGTGCAATTGTGTGGGATACCTTTAGCTGGATTTCCTGAGTGTGTTTTTCTTATTTTTTGTACAGGCTGTCTCACTAGCAGCAGAATCGAAACGAGAAGAGGAGAAGGCAAAGGGAGTTAATCCTGATATTGGAAGTGAGAACCCAGATATTGTCCCTTTTGAGGACAACGAAGAAGAGGAAGAAGAAGAAAGTGAAGAGGAGGAAGAGGAGAGCTTTGTCCAGTCCGTTGGTCTACCAGCTCAGGGCAGAGGAAGGGGCAGGGGAATCATGTGGCCTCCACACATGCCGATGGGACGTGGTGCCAGACCCTTCCATGGAATGCAGGGTTTTCCACCTGGGATGATGGGTCCGGATGGGTTGTCTTATGGACCTGTTACACCTGATGGATTTCCGATGCCTGACATTTTTGGTATGGCTCCCCGTGGTTTCAGTCCATATGGTCCTAGGTTTTCTGGTGATTTTATGGGCCCCCCATCTGCTATGATGTTTCGCGGACGACCTTCTCAACCTGGGGCCATGTTTCCCCCTGCTGGGTTTGGCATGATGATGGGTCAAGGACGTGGTCCCTATATGGGTGGGATGGGTGTTACTGGTACTAACCCAGCTCGAGCTGGTCGGCCTGTGGGTGTGTCTCCATTGTATCCACCTCCTGCAGTACCCTCATCTCAGAACATCAACCGAGTTGTGAAGAGGGATCAAAGAGGACCAGCTAATGATCGCAATGATAGATATATTACGGGTCCGGACCAAAGCAAGGGCCAGGAGATGCCAGGGAGTGGACATGATGACGAGATGCAATACAAGCAGGGATCAAAAGCTTATCCTGATGAGCAATATGGCATGGGAACCACGTTTAGGAATGAAGAAAGTGAAAGTGAGGATGAGGCACCTCGGCGGTCAAGGCATGGAGAGGGGAAGAAGAAGCGGAGAGGCTCAGAAGGAGATGCCACTGCAATCTCTGATCAGTGATTTCATGTAGTTAGTAATATCTGGCCGAATAGCTCCGTTTTGCCACTGACCAGGGGTGTCTTCCTGCTTATGCAAGGCAGGCATCATTGAACTTAGTGTCGTTGTAAGTTCTAAAATTGGAGTATTTGAGCTTCATCTCATTCTCTTTTTTTTCTTTATTTTATACACTCCTTTTTAGGTGACTGTCGTGACGTAAACTGATACATTTCTCAGGACAGTCATAATTCATACATTGTTTTACAAATTTGTAAAAGAATTTTCAGAGGATGTTTTGAAGAGAAGAATCTCATAAAGTCGATATTCGATGATGTGAAGCTGTTATGTTTTTCCCCTTGCCCCTCCATTGTTAATCCCAATCTGCTCTCACCCCAACTTCTTCATCCTATAATAGAAAGGAAAGAAAAGGGGGAGTCTGCACTTGTTGACAGGAATGGAAGTTTGTACAAAGCATGGCTATTACAATCTTTTTAATGCATCATCCAAGATGTGTTATGTCTATGATATGCCACGTCCTTTTCATTTATGGTGAATATTTTACTTTAGATCTGATTTACCTATTCATAGTTAGACGGTTGAAAACCAAATTATTCTTTCTCATTGTAGAGTCAATTGTTTACTCTTATTTGGGATGTAATCTTTATCAGCAGTATGGTATTTTCTAAGCTTGTTTGATAGCATATAATCCGATTGGGTCGCTAATTCAATACATTGATGGGAAAAATGACAAGATAGTTTAAAATATTAGATCTATCTTGAAATAGAGAGTTAACTTGAGTATAATTTAGTTGAGACATGAATGACTCTTCTTATAAGTCTTTCATTTTTACATGTGTAATATCATAATTTTAGAAAAAGAAAATCTTGGAATAGAGTCTTCTACCAAGGTTTTTAGAGAGGGACCAAAGATTAAATAGACAATTAATAAATATGATAAAAAACCTTATTATTTAATTTTCAAAATAATACAAATATTGGAAAAGAAGGTGAGAAAGAAGAATGTAAAAATATATAAAAAAAATAAAATTTGTTTAATGAATCTATGTTATTTTAAATAAATAATTAAAAAAATATAATTTTTTAAAAATAAATAGATAAGGAAAGAAAAATAAATAAATCAGAATGGAAAATTTAAAATCAAATTAGGAGAAATTATGGAAAAAGTAAATATTTTCTTTCCTCACATATTTTGGACAGCCCATTTAGGCAATGGAAGGCCCAAACACCGATCTTATGCTCTTACCTGCTCCCGATTTTCCTGCACTGCTACGTTTGTCATCCTCGTCTGCTCTTCAAAACCCTTCTCAGTTTCATAAACCCTAGCATGGTATCTATCTTTCTCTGTCACTTCCAATAATAATAAACCCTTCTGCCAATTTTTCATATAAAAAATAAAAAATTAAATAAAAAAATCACTTGTGTTTTGCTTTTTGCATCAAGTCTCGCGGTACTCTCTCTCAGAACTCCTCCTCCAGACGCCGCCCCTCCCCTAACCTCACCATCTCTCTGCATTCTGTCCCCCATCTCTTTCTCGGCAACCCCCTTCGTCTCCCTTTCAGCCACGCCTTCACTCCAGCCACCAGGTGACGGCGGTTTTCTCTTTCGGTAAGTTTACTTGCTCTCTCAAGTTGCCGCCCCCATTTTCATGAGCGTTAGTGTTTTTCTCTGTCAGTGCCAATGATAAATCATAATCAGTAAAAGGAATCTGTTTAAGAAAGTTTACATTTTTATAAGCTCTTTGTATCTTCTCTGTTTTTCTGTAGCTCGACTATGAAGAAGATCACGAAGAAGAAAAGCTCGGCTCCTAAAGGGGCAACGAAGGGGAAGAATTTTCCACTCGATAAGGACCCTTTTTTCAGTTCTGAATCGAGGAAACGGAGGAAGACTGTCGATGAGAATGATGAAATTGAGAGCGGTGAATCAGACGAGGATACTGGGTTCATGGGCTCTGCTGCGGAGAGGGGGAAATATGAAGAAGCTGAGGATGAGCAGTTTGAGGAAGAAACTGCCGACGAGAAGAGGAATAGAGTGGCAAACGAGTACGTGGACAAAATTTGGGAAATTGCGAGGAGAGAAAAGGAGAGGAAGGATGAAGAAAAAGATTCACTTGTTGCCCAGATTCTTCAGCAGGAGCAACTCGAAGATAGCGGAAGGGTTAGGAGAGAAATTGCATCAAGGTTTTGGTTTCTCCTTGTTTAGTTTTCATTGGAATCTCTTATTTGTATTTCTGCAGATTGCATTTTGTTTACTCTCTTTGTTTTTCCAAATTCAGAGCACTTCATCTCTCTCTCTCGCTCTCACACACACGCACAGACACACTCACGCTCTTGTAGTTATAGGTTTTTATTCCTTCGATATCCGAATGCTAAGCTAAAAGCAATCTGCTTATCTAACGTGGAAAAAGGCTTGAGATGAATTTCTTCACTTTTTCTTATACGGACTCCATCTTGAATTCTCTTAAATTCAGCAATAGGTCTTTATTTTTATACGTTCATATAATGTTTCAAGTAACTTTCTTTAAGAGTCAGGCAATATGTTTAAAATAACGGTGGGATGTGAGGCACATTCTTATGCAATATCTTCATGAAAAAGGGCGAAGGAATTTACTTGTTAACCTTATATTATAATATGTGGCTAATATTATCTTGGATGTTCTCTTGCTTCACTGTGATGTTGTTTATAAAGAAGCTTGGAGAGCATGTTCCTATGATACACTTGATTCTTACACTCCGTGCACGAGGAAAAAAAGAGAGGGAAAGGGAGAGAATGTGAAAAAACAACTTTTAAATATTTTATGACGGTGCCTGGCTTGTTAATGATTATACACCTTTCAGGGTTCAGAAGCCAGAAGCTAGAGATGAATTTCAAGTCTTGATTAAGCACAGACAAACTGTTACAGCTGTGGCTCTATCTGATGATGATTCGAAGGGGTTTTCAACGTCTAAGGATGGTACCATCTTACATTGGGATGTAGATAGTGGGAAAGGGGAAAAATACCAATGGCCCAGTGATGAAGTGTTGAGATTGCATGGTGTTAAGGACCCGCAAGGTCGGGCTACAATGCACAGTAAAGTCATTTTGTCATTGGCAGTCAGTTCTGATGGTCGATACTTGGCAAGTGGAGGCTTAGACCGCCATGTGCATATATGGGACACTCGTATAAGAGAACATATTCAGGTAATTCTTATTTTATAGTTGTAGCTTATTGTTACATGGCATGGTTGAGTTTAAATCTAAACCAAAAATAATTTGGGTTGGGTTTGTGGCGTAAAGATGAACCTTTTGTTATTATTCCTCTTTTGTTATCCTATTCAATTGGGGAGTTTTGTAATTTCATTCATTCAATGAAATTGTCTCTTACAAAAAAAAAATCAATTGGAGAGTTTTTCTCCCCCCCCCAAGGGAAAAAATGACAGAAAATAAAGGATGATAGCCATCGAAAGCTTTTCTCAAACATTTCACTGTGGCTTTTTACAGGCATTTCCTGGTCATAGAGGACCTGTTTCATGTTTGACTTTCAGGCAAGGGACCTCAGAACTTTTTTCCGGTTCATATGATCGAACTGTCAAGATATGGAATGTAGAGGATAGAGCTTATATAAATACACTATTTGGTCACCAAAGTGAAGTATTGACTATTGATTGCCTACGGAAAGAAAGATTGCTTACTGTTGGACGTGACCGGAGCATGCAATTATGGAAGGTAGACCCTCTTTTTATTGCTTGCATTTTTTAACAATTCCCTCAGTGGACTCGCCGTTTGTTGCTTATTTAAATCTGTGCCGCATGCTTGTGTTTTTGTATACCTGCAGATATTCCTATTTCCCTTTCCTTATTTTGAATTATTATTCCTTTTGATGCTTGAAACAGGTTCCAGAGGAGTCCCGTTTAGTATTTCGTGCACCTGCATCATCCTTGGAATGTTGCTGTTTTATAAGCAATGATGAGTTCTTATCCGGATCAGATGATGGCAGTATTGAGCTTTGGAGTTTATTGAAAAAGAAGCCTGTTTCTATTGTACGAAATGCTCACCCTCTTTCATTCTCTTGCACGAATTTGGAGCTAAAGGAAAATGGAGCCATCCCCATTGGATGTATGGGTATGATATTCGTCAAGCATTCCATATATTTTTCTCTTCTATCATTCTGCAAATCATCACTGCACAGGAGAGAGAGAGTTGGTGGCTTTTTTCAAGTTTTAATGTTGCCATTAATGCACAAATCAATTCATAATGGTCACTTTGGTCTTTGTATTTTCTTACTAGCTTACATGTGCAGGAATGAAAAATTGGTTGGTTTTATCTCTTCTCCTTTTATCTATTCTTTCTTACTGCTCACTTTTCCATTAAAAGTAATATAAAATGAAAAGGTTGTTCAGTTCCAATAAAAAAGAAGGCAACTAATAGCCTTTCTTTGTGCCAATTATTATTCAATTGTGAATTTTGTTTACTTTGCAGGAAATGGGGATGTCAATTCTAATACTTCTCACAGTTTGTCAGCATACTCTTGGGTAAGTTCAGTCTCAGTATGTAGAAACAGTGACCTTGCTGCATCGGGTGCCGGCAACGGTTCTGTTCGTTTATGGGCTCTCACAAGTGATAAGAAAGATATTCGACCATTATATGACTTTCCTCTGGTATGCAAGATTTACATGTTTATTTTATATTTATATCTGTTTATTGTGCCTCGTCTAATTCAACTGACTTGAATGTGGCTTATATTGTGACAACTGCATTCTTTGTATATTATTACTTGAACTTGATGCCCTCGAAGGTCCCCAAGAAGAGCACTTGGACAGCTACTGAGTTTAGCTTCCTCTTGATATAACCCTTGCTTCTACTTTCAAGATCTCTTGCTTTTGGAAGTCCTTGAACAGAAAATTTAAGGGCTTCTTCTATATCTAGTCTGAAGATTTTTTTTTTTGTTATTTTAGAAATAATATACACATGAAATATGCGTTTGATGACATCGGATCTATAAGTTGTGAAACTATTTTTCAGATTCTCTCTCCAAAAAATCTAAAATTCTTTAGGACAATATTGAAAAATTTGCTTCCGGCTGGACAAGCAATAGTACCTCCACCTGCCTGTATTTCTGTACATATTAAAATTTTTTTTTTTAAGTACTTGTTTTCTTTTCTCTCTTTTTATCAGTAGAACTTTATATATTCTTTTTGATAGGAATGAATTCCATTTCTTGCTTTTCTTATCCAAAAAGAATTATTGAGTGTCTATTGTACAAAAGCTATGATGCTTACCTAATGGGTTGTTTGACAGAGAATTAAGATCTGTGTTCCATCTTTTGAAGAAAAAATAAATATAATTGCAATTATTTCTGGAGTTGCTTCTACATTAATTGTGCTTAAGATGTTGCTTACATTAGTGACTGCTGAGGAGTCTAGTGTATTGTTTACATATGGCTAAATTTTTTGATTTCAGGTTGGATTTGTGAACTCCTTGACTTTTGCCAAATCTGGAAGGTTTGTGGTAGCTGGAGTTGGGCAGGTAATCATGACACAAATTATGCTTTAAACCTGTTAATTCTTTTTTGGTAGAAGTTTGAAGCAGAAGATTACAACTATTCATTGCTATAGGTTTATAAGCCCTAGCACACATATATAGAACAATCTAGAAGCTTCTATCTTTTATTTTTTTTAAAGAACAATTTTTCATTAATACTTAAAAAGTTGCAATGATATCAACGTAAAAGGACTCCCTTTAAGCTTTAGGAGTGCACGTAATAAAGAACTACAAGAAAGTCTCCCAACTATGATTAATTATAGTAGAAGAATAATTACAAATGGGCTAGATAAAGCACACCAAGAAACAACTTTATATTTAGCAATATCCACCCTTTCATTCCACTCTAAAGACTTTCCTTCCAAGATTCTTAGATTGCTTTGATGGCATTCACCCATGTCAACTTGACTTTAATTTGGAGCTTTACTCTGGATAAAATCTGAAAATAGTCTTTGGAACATTTATCAAACACCCATGTAACTTCAAAATGATCTGGATCTTCACCATCCTTGAAACGAAGTACACATTCTGATGGCTGTAAAACAGCCATTGGCCATTTCCTCTCTAATTTTTCTGCTCTGTTTAAAAGCACCCTTATGCGGAATCCATTAAAAAATATTCATCATCTTTGGCCAAATCTAGAAGCTTTTGTCTGTATCTTGTAAAGGCTTACATTAACAACAATAGAGAAGCAGCACCGTAAGGAAGGAAAATCATGGATGGAAAATAACCCTTTGTTAATCACTTTCTTGTGCTAAAATTAGGTTAATAAATACTTCAAAATATTTTATGCAAGTCTTTGGGAATTTGGCGGTTCAAAGCTCTGCCTTGTTCAAAAAGTTCCTCCAATTGGCATCCTCTTGCTAAGAGCTAAAGAATATCACAAAAAGTTCTTTCAATTGGAATTAAAGAGGGTGGTATAATAACGAACGCTTCAAAGAGAATGCACTAAGAAGAAGCAAAGAACTTACTATGTCTAAATTAGTGAAATGACATCCAATATTTATTCATTTAGATATAGTATTGGCAAGCATGATGTCTAAATGCAAAAAATAGTATAAAATAAGGTCTCCATGTAGATATGGTGTTGGGAAGCACTATATCCAAATTAGCAACCAAGAGGACTTCCAAAATTCGTTTTCTAACTAACCATAGAAGCTTCTTAGATATGCTGTTTGGAAGCACCTCAGGAGAAATACAAATGAAGGGAGAACAAGAAATCCTTCCTAAAGCCAAAGAGATTAGGAAAAAACCACTTTGATTTGCATAAAGAAAAATGTAGAATAATTACAAAAAACATTAGCGAGCTCCATAAAATAGTGAAGAATTTGATAATGTCCCATATGTCACTAGGAACTCCATATCTTCCCACAAAAGTTCTACTATTCATTTCCATCTAAACCTCCCAAGTTGAGGCTTTAACCACATCGATCAAAAGGTTTCTGGCTTTTCTTCCATACGGTGACCCACATGGAAGTTGTTCCTCCATCTCTTGCATGATGTATGGAAAATCTCATGAATGTTAAAGGAATCAAAGATGTGACGTCACCATTTCTAGAATAAGGAAAATGCAGAACGATGTGGCTAAAATCCTCGCTGGTACCTTTGCAGAGCACACACCAACTAGATGATAGATACTAGATACATGTCGGGGAATTTCTTTTGGATTCTAGCATGGGTGCTAATACTGGCCAAAAAACAGATCATAGGAAGAAATTGATTTTCTTGGGAGCTTTCTTCTTCAAATTGTTTTTTAGCCAAGTTAGAAGGATAATTTAGGATTTCTCATTAAGCTCGGTAATGAGGAACTTGGTAGTGAAGTTTCCTGGAGCTCAAACTTCCAAACCCCACGATCTTCATAGTGAGACAGAAACCATCTAACATTTCTTTGAGGTAGTTCGCTCCTCTGTTTCTGCATTGCTATGGTCTCTTCTAAGATAAAAATCGAGCTACTGGTTTCTTTATTCCAACAAGGCTCAGAGTGAAGCACTAGCAGTGCCAGTATAGATAGAATTGTCCATTGCAGTTTAGGAAACAGCATTGAAAGGGTGATCGTTCCCTTGATCCTCAAGTGGTCCCAAAAATTCACACCCTCACCATTCCTACCTTTAATCTGGAAATAGAGGAAATCAGATGTCTTTGACTAGAAATAAAACTCCAGCCAGTTCTTGCACATTAGCATCCTGACAACTTAAGTGAAGCAGCCAGCAGGATTTTCCCCATGCTTGCTGGAAATAGTTTTGTCATAAATGCATTAATTTCCAGAGGAAACTTCTACAGCCATTTGGAAATGAAAGCAAGATGCCTTTTCCGTAGATTGCCCAATCTCCATCCTCCTCTTTCCAACTGCGAACAAGCACTTTTTCCCTTCACCAAATGACCCCCTCCATCTCTAGCACCTCTGTCCTTCGGAAATTTCCTTGAGGCTCTTTCTAGGATTTCACTGATCCTAGCAGGAATCTTGAAGATAGCCAAATAATTAGTAAGCAGGTTTCTTTCTAGAATTGATTGAAGTAGGAGCCGAATTGATTGAAGCAAGCCTACTTCCTTTGGTGATGAAGGAAGCACTTTTGTATCTCTCAAAGAAAAATTTTAAGATTATTTTTCTTAATTTTTTTTTAAAATTTCTCTGCCTTCAGTTGTAAATTCTTGTTAAGATCATCAAGCGCGACCTGGTATGATGATGATTTGGATTCACATCTAGACTACTGGCTTGCTTTATTGACTTCTCATATTGTATATATTTTTCTCTTCTATATTCTCGGTTTTGTTTCTTTGGCAAATAGTTGTCAGTCAGTTGTCTCTTCTCCCTCCATAAGTTTTGTATACTTGCTATGCTGAGGACATAATGCTTGCTCCATGCTGACACCATAAGTGAACTTCATGACAGGAACCTCGTCTAGGAAGGTGGGGGCGAATCCCGGCTGCTCGGAATGGAGTTGCAGTTCATCCCCTTAAGCTCTCATGAGACTGAACTGAAGAGTTCTATATATACCCTATCCAATTTTGACTTCTCAGAGTTTTAAAGTAAAATTGAAAAAGTTTACACCCCCCCAAATCTGTATTTATTTATTTGTGTTATTTCACCTACCTAACTTTCACCACCTTTTGGCCGTGGAAACAATAGTCAAGTTAGAGCATTCATGTTTCTTACTGAAAAGAGGTGAGTTTGCAAGCCACTGGGGGCATAAACTACTCAACAGAATTTTCCAACTGCATAGGGAGAGATGAGATGAATTGCAAAGATAGATTTTGTTTATAATCACATGGGCCAGTTGGGTTTTTCCAGAATTTCCCTGTTATGATGATGTACCTTTTCGGCAATTCATTTTCAACTGTTTCCAAGTAAGTCGGTGTGCTAAGTTTCACAGTGTTCGCATTCATCACTTGTACATCACAAACTGGAGACAATCTTCAATGGACTTCCTTTATTTCTATCTTGATTTGTGATTTTCCATGAAAAATAAATGAATTAATTCATGGAATATTGGTATCATGATGGTGTGATAAAAGACTACGGAAAACTGTATTGATTTATTTGGTTTATGGTTACTGTAATAATGATATTTGGCCTTCGAAAAAGCAAATTATTTGTCGAATCACAATTGTTTATAAGGTAATAATTTTAGAGATTTATTGTGGTCAACTTGAAGATGACTGATGTTCCCAACTTTGACATGATGTCGTTGAAAGTGAATTGCCTATTCCTATTCATGCGCATCAGAAAATGATAAAATAAATGGAAAGTATAGTTTGTGTAATATTTTTTGTTAAAAAAAGCAGTTCTTTTATCCCCTAAAAAAGGTTGAAATAACAATTTTAAGAATAATGAAAAGAGCTTTTAACAAAATAATTATAATAAGAAAGTTGAATTATAATTTTCGATTTGAAGTATCTTTGATAAACAAAGATTGATACATACTTGGTATGGAATGATTACGATTTTGTTACTATGATGAATAGAGTGTTTTGCCCCGATCTTCAATGCTCCTATTTCGGAGGAAAATACCCTCTCTTTGCAACTTTATAAAAATGCTAGGGTCGAATGCTAAAGTTGCTAGAATATGCTTGATGGTGTGGAGCTAATATAAATGCTCAAACTAAGACATGGCGTTAAAACACTTGCTGTAATGAATTGTATTAGTTTTGTATTTGAATCTATATTTCCTTTTTTGGATTGCTACACATTGAGAAAAGTTGCCTTGTTGAGTATAATACTCCTTGAATGGGAGATTAGAAGATAATTGAAAGAGCAGCTCGAAAGCTTCTTACTTTTTAGTATTTTTATCAGACTCATCAAGGAAAAAAGGTCAACCGTAATGACCATATAAGCAAACCACAAATAATTTGCCTACTTTAGTGCTAACAAGTTATGTTATAAATAATTATATACATGTCTGAATTTGAATTATAAGAGGTCTATCATTTTTTTGGCTATGTTGCTTTATTAGACATTTGATTGGAAGTTTAAAGACTTATTACACATTTGTTAAAGTTTAAAAATCTATTAGACAAAAAATCGAAAGTTTAAGAGATCTATTAAATATTTTTTTTTAAAGTTCAAGAACTTATTAAATATAATTCTGAAAGTTGGAAACCAAACTTGTAATTTAACCGTCGTTTATGATGATTTTTTTGGAAAAAAATCTTATCATTGTTACATTTTTTTTAAAGTATAATAACAAAATCACAAAAATATTTTAAGACTTGAGTTTAAAAATATGTTATAACTGGCAAGAAGAATTGACTGCACTATGTGTGTCCAGCCAAATCTTTTCCCAATAATAATATTACAGCTCAGACAATTTTAGGATACCAAAAAAAAAAAAAACCTAATAATTCCATTCTTTTACCAAACAAAGTTTTGGCATTTGTTAATACAAAAGCAAATGTCTTCCTTTTTCCTGGGTTCTTTTCTTTTAAATTTTAGTTAACCTAATTCATGTTTAATCTTGCTAAATGCTAACCTATTTCCCAAAATAGGCTCCCGGAAGGCTATAACAAGCGCGAAATCTAGAGGGCAAAATGGGCAAAAAGAACTACGAGCGAGGCCGAGGAGATCTCTCCGGCGTACTCTGATTTGAGGCCCTCGCTTCCTGCAAGTGTTCGTACGTCGCGTCCTTGGAGCCGCCGTCGCCGCAAACTCCGATGGCGGCTGAGGCAGAGAGACAGCGACGGACATCTTGACCTCCGTACACGTAGAGCGCCAGAGGAATGTCACTGCTAATGTTAACGTCGCCGGTCGGATTCTGAGACGAGGAGCAAGAAGAAGCAGTGCCACCGGCGTTGCGGCCGGAGACGTAGAACGAAGTTGCCGGCTGAATGGCGGTGTCGTTGATTACCATGGCGTAGTCTCTCACAGAAACAGGTAAGCGGCAAGGCGAGGCGGGAGTGGAGCTCTCTCCGCCGCCACCAACAACCACGCTACTCGTCGGCGTCGGAGTGGTACTCCTCACAGTCCCACAAACTGAACATCTATTCAGAATCTAAGAAGAAGAATAGGAATGAAATTCAGAAAAAGATGAACCGGAAAAAAAAAAAAAACTATGGAGTGGAGGGGGAAAGAAATGGGGAATGAGGAAGGTACGGATGCGGCGGAGCGATAGGTGGTGCCGTCGGGCTCGACGACCCAGCCGGCCTCTTTGGCGAGCTCACGGAGGACTTCATTGATATCAGCTCTGGGGGAGAGGCGGTAGCCGCCGTGCTTTCTCAGGCCATGGAAGATATTGGTGGTGATGGCTCGTCGTTGCCGTTCCCTCATCTTTGTCTTCTCTTTCTCAGTCTCGCTTCTCACTCCTCCTCCGGCCAAACCCGACCCGACTCTTTTTCCGGCCGACCCACCTGCAGGAACTTCAGTTCCTTCCTTCATCTCTCTCTCTCTCACCATAACCTCTGCGTTTTCCTTCTCTCTCTGTGTGCAGAAGATGAGATTTTGGAGACTTGTTTTGATTTTGTACTTGAAAGGAGAGGGGGGGGGGGGGCATCATGGTGGTGATGGGGATCAGACGGTTGAGAAGAAGGGTAAGAGAAATGTACGGTGGTGATTTGAGAAAAAGTAGAAGGAAGTGTACGATGGAGATGGAGATGGAAGCAGGTGATAATATTTTCCTATCTTTCGTAGTTGTTTCTTTTTCTGCATGACAGAGAAGGTGGTGAGAATCGTAGTTCGCTCACAATGAATTTGCATGTAGAAGAGGGAGTGGGGTCACGCGCCAGGAGGAAGAGGAAGGGACTGACAAAAAGATTTTCGGTTTCTTGGCGGCTGGTCAAAGAAAATGACTCTTTAAGCTCAGTACCAACACAAAGTATGAATTTCCTTACTTAGTTCCATCCCTATTCATCCTCATCGATAATGTCATTAATAAAATGTTGATATAACATTTATGATAAATTTTGATTAAATTGAGGAGATGTATCAGATAACTAAACAGACATGTCATATTAATATTCTTATTTATGATATTAATAGTAATGACTAAATTGAAACATAATCTAAAATTCAAAAACAAAATTAAAATTTTTTAAAGTTTAATTCGATAGCCCCCTCCTTTCTAGTAACAGTCTCTTGCAACTCCCATTTTTGGCAGAAGACTGAACTCAGTAAGTAACAAACATTCCTATTACTAGGGCTTTTCTTGAGATCATTGACTCAGTTCTTATTCTCTCTCCATAATTCTTCCCCTATAAAGTTACCCATCTTTCAGTGTATTGATCATTTGAGTATTTAGCAAAATACCACATCTTGTTAGTTGAACACTTTGATGATCATTTTTTGTTTTGGCTTTGTAGGTTTTAAAAAATTGGAGCCTCTACACTCACCCTTTCTAATTTCAAGTAATTCTTTTTTACTCACTACACACACCTATTTCTCTCCTTTCTGAAAACCTCTTTGACTTACACAAACGTCACCATGACCTTTTTCTATCTTGAAAGTGACTATCTGAACTTGTTTTGGATTGACCCGAGCTTGACGCTAAACAGATACCTCTCATCTAATTGGGTGTCATGTGAAGATGGAGGATATTTGGCGGTAAGAAAAGTGATTATCAACCCCCTATTCACGTTGATAATGTCCATTTAAATTCCCAATCGTCATATGTATCGAAATGTCAACTATAAACATTAAAAGAATGGCTAGATACCAAAAAGGTTGTTAGAAAACTAACTTTTATAGCAACCACAAGTAAATAAGCTATCTTGGCCGATAGGTTTGGAAAAAAGAAAACACTGAAATTTCTGATTTCACAAAGAGAATATGTATGCCGTTGAAACAAAGGGGCAATAGAAGCTAGAATGAATTAAGCTATCAAATGAGAAACCCAGCAGAGGGCAAATATGCTTGTATCCATATTTCATTAGGATTTCACCTCATAATGATATTACTCTGTGTACCTCAATTTCCTTGAAACGCCAGTATTGAGGTAAAAATTTGAAGGTATTAACACGATATAAACAAATTTGTCTGAATTGAACCTCATACCTAGTGAATAATGATCACATGATGAAAAGAGTTCTGGAAGATGATGTATCTACATTTTCTGATTCCCATTGTGGTATGATTACTTGCAGTCATACTTTCTGTCATTACTGAGGCATTTCAAGGTCTGGTCTCCTGTCAAGAGCTTCACTGGTTGACTCATCTGGAAATGCATCAGGATAAGTAG

mRNA sequence

ATGGAAGTTATACATCTGAAGGCAAAACTCTGTAACTGCAGGTACAAGTTTGGTTTCTGTCCAAATGGTCCTGATTGTCGGTATAGGCATGCAAAGCTGCCTGCACCGCCACCTCCAGTGGAAGAAATCCTTCAGAAAATACAGCACTTGAGTTCATACAATTATGGTCCCTCAAACAAATTCTTTTCACAACGTGGAGTTGGCTTATCCCAACAAAACGAAAAATCTCAATTTCCTCAGGGTCCGGCCACCGTGAATCAAGGAGTGATAGGAAAACCTTCTACAGCAGAATCTGGTAATGTCCAACAACAGCAAGCTCAACAGTCCGCACAACAACCCAGCCAGACACAGATACAAAGTCTTTCTAATGGCCAGCCCAATCAATTAAACAGAACTGCAACATCTTTGCCTCAAGGAATATCTAGGTGTGTCTCAAGAGACCTGAAGTACTTTATTGTTAAAAGTTGCAACCGCGAGAATTTGGAATTATCTGTACAACAGGGGGTATGGGCAACTCAAAGAAGCAATGAAGCTAAACTTAATGAAGCTTTTGATTCTGCTGATAATGTTATTTTGATTTTCTCGGTCAATCGGACTCGACATTTCCAGGGCTGTGCAAAGATGATGTCCAGGATTGGTGGATCTGTCAGTGGGGGCAATTGGAAATATGCCCATGGAACTGCACATTATGGGCAAAACTTTTCACTCAAATGGCTGAAGTTATGTGAACTATCCTTTCAGAAAACTCGCCATTTGAGGAATCCTTATAATGAAAACTTACCAGTAAAGATCAGTAGAGATTGCCAAGAACTAGAGCCCTCTATTGGTGAGCAGCTGGCTTCTTTGCTTTATCTTGAGCCAGATGGTGAACTCATGGCTGTCTCACTAGCAGCAGAATCGAAACGAGAAGAGGAGAAGGCAAAGGGAGTTAATCCTGATATTGGAAGTGAGAACCCAGATATTGTCCCTTTTGAGGACAACGAAGAAGAGGAAGAAGAAGAAAGTGAAGAGGAGGAAGAGGAGAGCTTTGTCCAGTCCGTTGGTCTACCAGCTCAGGGCAGAGGAAGGGGCAGGGGAATCATGTGGCCTCCACACATGCCGATGGGACGTGGTGCCAGACCCTTCCATGGAATGCAGGGTTTTCCACCTGGGATGATGGGTCCGGATGGGTTGTCTTATGGACCTGTTACACCTGATGGATTTCCGATGCCTGACATTTTTGGTATGGCTCCCCGTGGTTTCAGTCCATATGGTCCTAGGTTTTCTGGTGATTTTATGGGCCCCCCATCTGCTATGATGTTTCGCGGACGACCTTCTCAACCTGGGGCCATGTTTCCCCCTGCTGGGTTTGGCATGATGATGGGTCAAGGACGTGGTCCCTATATGGGTGGGATGGGTGTTACTGGTACTAACCCAGCTCGAGCTGGTCGGCCTGTGGGTGTGTCTCCATTGTATCCACCTCCTGCAGTACCCTCATCTCAGAACATCAACCGAGTTGTGAAGAGGGATCAAAGAGGACCAGCTAATGATCGCAATGATAGATATATTACGGGTCCGGACCAAAGCAAGGGCCAGGAGATGCCAGGGAGTGGACATGATGACGAGATGCAATACAAGCAGGGATCAAAAGCTTATCCTGATGAGCAATATGGCATGGGAACCACGTTTAGGAATGAAGAAAGTGAAAGTGAGGATGAGGCACCTCGGCGGTCAAGGCATGGAGAGGGGAAGAAGAAGCGGAGAGGCTCAGAAGGAGATGCCACTGCAATCTCTGATCAAACTCCTCCTCCAGACGCCGCCCCTCCCCTAACCTCACCATCTCTCTGCATTCTGTCCCCCATCTCTTTCTCGGCAACCCCCTTCGTCTCCCTTTCAGCCACGCCTTCACTCCAGCCACCAGGTGACGGCGGTTTTCTCTTTCGCTCGACTATGAAGAAGATCACGAAGAAGAAAAGCTCGGCTCCTAAAGGGGCAACGAAGGGGAAGAATTTTCCACTCGATAAGGACCCTTTTTTCAGTTCTGAATCGAGGAAACGGAGGAAGACTGTCGATGAGAATGATGAAATTGAGAGCGGTGAATCAGACGAGGATACTGGGTTCATGGGCTCTGCTGCGGAGAGGGGGAAATATGAAGAAGCTGAGGATGAGCAGTTTGAGGAAGAAACTGCCGACGAGAAGAGGAATAGAGTGGCAAACGAGTACGTGGACAAAATTTGGGAAATTGCGAGGAGAGAAAAGGAGAGGAAGGATGAAGAAAAAGATTCACTTGTTGCCCAGATTCTTCAGCAGGAGCAACTCGAAGATAGCGGAAGGGTTAGGAGAGAAATTGCATCAAGGGTTCAGAAGCCAGAAGCTAGAGATGAATTTCAAGTCTTGATTAAGCACAGACAAACTGTTACAGCTGTGGCTCTATCTGATGATGATTCGAAGGGGTTTTCAACGTCTAAGGATGGTACCATCTTACATTGGGATGTAGATAGTGGGAAAGGGGAAAAATACCAATGGCCCAGTGATGAAGTGTTGAGATTGCATGGTGTTAAGGACCCGCAAGGTCGGGCTACAATGCACAGTAAAGTCATTTTGTCATTGGCAGTCAGTTCTGATGGTCGATACTTGGCAAGTGGAGGCTTAGACCGCCATGTGCATATATGGGACACTCGTATAAGAGAACATATTCAGGCATTTCCTGGTCATAGAGGACCTGTTTCATGTTTGACTTTCAGGCAAGGGACCTCAGAACTTTTTTCCGGTTCATATGATCGAACTGTCAAGATATGGAATGTAGAGGATAGAGCTTATATAAATACACTATTTGGTCACCAAAGTGAAGTATTGACTATTGATTGCCTACGGAAAGAAAGATTGCTTACTGTTGGACGTGACCGGAGCATGCAATTATGGAAGGTTCCAGAGGAGTCCCGTTTAGTATTTCGTGCACCTGCATCATCCTTGGAATGTTGCTGTTTTATAAGCAATGATGAGTTCTTATCCGGATCAGATGATGGCAGTATTGAGCTTTGGAGTTTATTGAAAAAGAAGCCTGTTTCTATTGTACGAAATGCTCACCCTCTTTCATTCTCTTGCACGAATTTGGAGCTAAAGGAAAATGGAGCCATCCCCATTGGATGTATGGGAAATGGGGATGTCAATTCTAATACTTCTCACAGTTTGTCAGCATACTCTTGGGTAAGTTCAGTCTCAGTATGTAGAAACAGTGACCTTGCTGCATCGGGTGCCGGCAACGGTTCTGTTCGTTTATGGGCTCTCACAAGTGATAAGAAAGATATTCGACCATTATATGACTTTCCTCTGGTTGGATTTGTGAACTCCTTGACTTTTGCCAAATCTGGAAGGTTTGTGGTAGCTGGAGTTGGGCAGAGGGCAAAATGGGCAAAAAGAACTACGAGCGAGGCCGAGGAGATCTCTCCGGCGTACTCTGATTTGAGGCCCTCGCTTCCTGCAAGTGTTCGTACGTCGCGTCCTTGGAGCCGCCGTCGCCGCAAACTCCGATGGCGGCTGAGGCAGAGAGACAGCGACGGACATCTTGACCTCCGTACACGAGCAAGAAGAAGCAGTGCCACCGGCGTTGCGGCCGGAGACGTAGAACGAAGTTGCCGGCTGAATGGCGGTGTCGTTGATTACCATGGCGTAGTCTCTCACAGAAACAGGTACGGATGCGGCGGAGCGATAGGTGGTGCCGTCGGGCTCGACGACCCAGCCGGCCTCTTTGGCGAGCTCACGGAGGACTTCATTGATATCAGCTCTGGGGGAGAGGCGGTAGCCGCCGTGCTTTCTCAGGCCATGGAAGATATTGGTGTCTCGCTTCTCACTCCTCCTCCGGCCAAACCCGACCCGACTCTTTTTCCGGCCGACCCACCTGCAGGAACTTCAGTTCCTTCCTTCATCTCTCTCTCTCTCACCATAACCTCTGCGTTTTCCTTCTCTCTCTGTGTGCAGAAGATGAGATTTTGGAGACTTGTTTTGATTTTGTACTTGAAAGGAGAGGGGGGGGGGGGGCATCATGGTGGTGATGGGGATCAGACGGTTGAGAAGAAGGGTAAGAGAAATTTGTTTCTTTTTCTGCATGACAGAGAAGGTGGTGAGAATCGTAGTTCGCTCACAATGAATTTGCATGTAGAAGAGGGAGTGGGGTCACGCGCCAGGAGGAAGAGGAAGGGACTGACAAAAAGATTTTCGGTTTCTTGGCGGCTGGTCAAAGAAAATGACTCTTTAAGCTCAGTACCAACACAAATCATACTTTCTGTCATTACTGAGGCATTTCAAGGTCTGGTCTCCTGTCAAGAGCTTCACTGGTTGACTCATCTGGAAATGCATCAGGATAAGTAG

Coding sequence (CDS)

ATGGAAGTTATACATCTGAAGGCAAAACTCTGTAACTGCAGGTACAAGTTTGGTTTCTGTCCAAATGGTCCTGATTGTCGGTATAGGCATGCAAAGCTGCCTGCACCGCCACCTCCAGTGGAAGAAATCCTTCAGAAAATACAGCACTTGAGTTCATACAATTATGGTCCCTCAAACAAATTCTTTTCACAACGTGGAGTTGGCTTATCCCAACAAAACGAAAAATCTCAATTTCCTCAGGGTCCGGCCACCGTGAATCAAGGAGTGATAGGAAAACCTTCTACAGCAGAATCTGGTAATGTCCAACAACAGCAAGCTCAACAGTCCGCACAACAACCCAGCCAGACACAGATACAAAGTCTTTCTAATGGCCAGCCCAATCAATTAAACAGAACTGCAACATCTTTGCCTCAAGGAATATCTAGGTGTGTCTCAAGAGACCTGAAGTACTTTATTGTTAAAAGTTGCAACCGCGAGAATTTGGAATTATCTGTACAACAGGGGGTATGGGCAACTCAAAGAAGCAATGAAGCTAAACTTAATGAAGCTTTTGATTCTGCTGATAATGTTATTTTGATTTTCTCGGTCAATCGGACTCGACATTTCCAGGGCTGTGCAAAGATGATGTCCAGGATTGGTGGATCTGTCAGTGGGGGCAATTGGAAATATGCCCATGGAACTGCACATTATGGGCAAAACTTTTCACTCAAATGGCTGAAGTTATGTGAACTATCCTTTCAGAAAACTCGCCATTTGAGGAATCCTTATAATGAAAACTTACCAGTAAAGATCAGTAGAGATTGCCAAGAACTAGAGCCCTCTATTGGTGAGCAGCTGGCTTCTTTGCTTTATCTTGAGCCAGATGGTGAACTCATGGCTGTCTCACTAGCAGCAGAATCGAAACGAGAAGAGGAGAAGGCAAAGGGAGTTAATCCTGATATTGGAAGTGAGAACCCAGATATTGTCCCTTTTGAGGACAACGAAGAAGAGGAAGAAGAAGAAAGTGAAGAGGAGGAAGAGGAGAGCTTTGTCCAGTCCGTTGGTCTACCAGCTCAGGGCAGAGGAAGGGGCAGGGGAATCATGTGGCCTCCACACATGCCGATGGGACGTGGTGCCAGACCCTTCCATGGAATGCAGGGTTTTCCACCTGGGATGATGGGTCCGGATGGGTTGTCTTATGGACCTGTTACACCTGATGGATTTCCGATGCCTGACATTTTTGGTATGGCTCCCCGTGGTTTCAGTCCATATGGTCCTAGGTTTTCTGGTGATTTTATGGGCCCCCCATCTGCTATGATGTTTCGCGGACGACCTTCTCAACCTGGGGCCATGTTTCCCCCTGCTGGGTTTGGCATGATGATGGGTCAAGGACGTGGTCCCTATATGGGTGGGATGGGTGTTACTGGTACTAACCCAGCTCGAGCTGGTCGGCCTGTGGGTGTGTCTCCATTGTATCCACCTCCTGCAGTACCCTCATCTCAGAACATCAACCGAGTTGTGAAGAGGGATCAAAGAGGACCAGCTAATGATCGCAATGATAGATATATTACGGGTCCGGACCAAAGCAAGGGCCAGGAGATGCCAGGGAGTGGACATGATGACGAGATGCAATACAAGCAGGGATCAAAAGCTTATCCTGATGAGCAATATGGCATGGGAACCACGTTTAGGAATGAAGAAAGTGAAAGTGAGGATGAGGCACCTCGGCGGTCAAGGCATGGAGAGGGGAAGAAGAAGCGGAGAGGCTCAGAAGGAGATGCCACTGCAATCTCTGATCAAACTCCTCCTCCAGACGCCGCCCCTCCCCTAACCTCACCATCTCTCTGCATTCTGTCCCCCATCTCTTTCTCGGCAACCCCCTTCGTCTCCCTTTCAGCCACGCCTTCACTCCAGCCACCAGGTGACGGCGGTTTTCTCTTTCGCTCGACTATGAAGAAGATCACGAAGAAGAAAAGCTCGGCTCCTAAAGGGGCAACGAAGGGGAAGAATTTTCCACTCGATAAGGACCCTTTTTTCAGTTCTGAATCGAGGAAACGGAGGAAGACTGTCGATGAGAATGATGAAATTGAGAGCGGTGAATCAGACGAGGATACTGGGTTCATGGGCTCTGCTGCGGAGAGGGGGAAATATGAAGAAGCTGAGGATGAGCAGTTTGAGGAAGAAACTGCCGACGAGAAGAGGAATAGAGTGGCAAACGAGTACGTGGACAAAATTTGGGAAATTGCGAGGAGAGAAAAGGAGAGGAAGGATGAAGAAAAAGATTCACTTGTTGCCCAGATTCTTCAGCAGGAGCAACTCGAAGATAGCGGAAGGGTTAGGAGAGAAATTGCATCAAGGGTTCAGAAGCCAGAAGCTAGAGATGAATTTCAAGTCTTGATTAAGCACAGACAAACTGTTACAGCTGTGGCTCTATCTGATGATGATTCGAAGGGGTTTTCAACGTCTAAGGATGGTACCATCTTACATTGGGATGTAGATAGTGGGAAAGGGGAAAAATACCAATGGCCCAGTGATGAAGTGTTGAGATTGCATGGTGTTAAGGACCCGCAAGGTCGGGCTACAATGCACAGTAAAGTCATTTTGTCATTGGCAGTCAGTTCTGATGGTCGATACTTGGCAAGTGGAGGCTTAGACCGCCATGTGCATATATGGGACACTCGTATAAGAGAACATATTCAGGCATTTCCTGGTCATAGAGGACCTGTTTCATGTTTGACTTTCAGGCAAGGGACCTCAGAACTTTTTTCCGGTTCATATGATCGAACTGTCAAGATATGGAATGTAGAGGATAGAGCTTATATAAATACACTATTTGGTCACCAAAGTGAAGTATTGACTATTGATTGCCTACGGAAAGAAAGATTGCTTACTGTTGGACGTGACCGGAGCATGCAATTATGGAAGGTTCCAGAGGAGTCCCGTTTAGTATTTCGTGCACCTGCATCATCCTTGGAATGTTGCTGTTTTATAAGCAATGATGAGTTCTTATCCGGATCAGATGATGGCAGTATTGAGCTTTGGAGTTTATTGAAAAAGAAGCCTGTTTCTATTGTACGAAATGCTCACCCTCTTTCATTCTCTTGCACGAATTTGGAGCTAAAGGAAAATGGAGCCATCCCCATTGGATGTATGGGAAATGGGGATGTCAATTCTAATACTTCTCACAGTTTGTCAGCATACTCTTGGGTAAGTTCAGTCTCAGTATGTAGAAACAGTGACCTTGCTGCATCGGGTGCCGGCAACGGTTCTGTTCGTTTATGGGCTCTCACAAGTGATAAGAAAGATATTCGACCATTATATGACTTTCCTCTGGTTGGATTTGTGAACTCCTTGACTTTTGCCAAATCTGGAAGGTTTGTGGTAGCTGGAGTTGGGCAGAGGGCAAAATGGGCAAAAAGAACTACGAGCGAGGCCGAGGAGATCTCTCCGGCGTACTCTGATTTGAGGCCCTCGCTTCCTGCAAGTGTTCGTACGTCGCGTCCTTGGAGCCGCCGTCGCCGCAAACTCCGATGGCGGCTGAGGCAGAGAGACAGCGACGGACATCTTGACCTCCGTACACGAGCAAGAAGAAGCAGTGCCACCGGCGTTGCGGCCGGAGACGTAGAACGAAGTTGCCGGCTGAATGGCGGTGTCGTTGATTACCATGGCGTAGTCTCTCACAGAAACAGGTACGGATGCGGCGGAGCGATAGGTGGTGCCGTCGGGCTCGACGACCCAGCCGGCCTCTTTGGCGAGCTCACGGAGGACTTCATTGATATCAGCTCTGGGGGAGAGGCGGTAGCCGCCGTGCTTTCTCAGGCCATGGAAGATATTGGTGTCTCGCTTCTCACTCCTCCTCCGGCCAAACCCGACCCGACTCTTTTTCCGGCCGACCCACCTGCAGGAACTTCAGTTCCTTCCTTCATCTCTCTCTCTCTCACCATAACCTCTGCGTTTTCCTTCTCTCTCTGTGTGCAGAAGATGAGATTTTGGAGACTTGTTTTGATTTTGTACTTGAAAGGAGAGGGGGGGGGGGGGCATCATGGTGGTGATGGGGATCAGACGGTTGAGAAGAAGGGTAAGAGAAATTTGTTTCTTTTTCTGCATGACAGAGAAGGTGGTGAGAATCGTAGTTCGCTCACAATGAATTTGCATGTAGAAGAGGGAGTGGGGTCACGCGCCAGGAGGAAGAGGAAGGGACTGACAAAAAGATTTTCGGTTTCTTGGCGGCTGGTCAAAGAAAATGACTCTTTAAGCTCAGTACCAACACAAATCATACTTTCTGTCATTACTGAGGCATTTCAAGGTCTGGTCTCCTGTCAAGAGCTTCACTGGTTGACTCATCTGGAAATGCATCAGGATAAGTAG

Protein sequence

MEVIHLKAKLCNCRYKFGFCPNGPDCRYRHAKLPAPPPPVEEILQKIQHLSSYNYGPSNKFFSQRGVGLSQQNEKSQFPQGPATVNQGVIGKPSTAESGNVQQQQAQQSAQQPSQTQIQSLSNGQPNQLNRTATSLPQGISRCVSRDLKYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSADNVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSLAAESKREEEKAKGVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFVQSVGLPAQGRGRGRGIMWPPHMPMGRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGFSPYGPRFSGDFMGPPSAMMFRGRPSQPGAMFPPAGFGMMMGQGRGPYMGGMGVTGTNPARAGRPVGVSPLYPPPAVPSSQNINRVVKRDQRGPANDRNDRYITGPDQSKGQEMPGSGHDDEMQYKQGSKAYPDEQYGMGTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQTPPPDAAPPLTSPSLCILSPISFSATPFVSLSATPSLQPPGDGGFLFRSTMKKITKKKSSAPKGATKGKNFPLDKDPFFSSESRKRRKTVDENDEIESGESDEDTGFMGSAAERGKYEEAEDEQFEEETADEKRNRVANEYVDKIWEIARREKERKDEEKDSLVAQILQQEQLEDSGRVRREIASRVQKPEARDEFQVLIKHRQTVTAVALSDDDSKGFSTSKDGTILHWDVDSGKGEKYQWPSDEVLRLHGVKDPQGRATMHSKVILSLAVSSDGRYLASGGLDRHVHIWDTRIREHIQAFPGHRGPVSCLTFRQGTSELFSGSYDRTVKIWNVEDRAYINTLFGHQSEVLTIDCLRKERLLTVGRDRSMQLWKVPEESRLVFRAPASSLECCCFISNDEFLSGSDDGSIELWSLLKKKPVSIVRNAHPLSFSCTNLELKENGAIPIGCMGNGDVNSNTSHSLSAYSWVSSVSVCRNSDLAASGAGNGSVRLWALTSDKKDIRPLYDFPLVGFVNSLTFAKSGRFVVAGVGQRAKWAKRTTSEAEEISPAYSDLRPSLPASVRTSRPWSRRRRKLRWRLRQRDSDGHLDLRTRARRSSATGVAAGDVERSCRLNGGVVDYHGVVSHRNRYGCGGAIGGAVGLDDPAGLFGELTEDFIDISSGGEAVAAVLSQAMEDIGVSLLTPPPAKPDPTLFPADPPAGTSVPSFISLSLTITSAFSFSLCVQKMRFWRLVLILYLKGEGGGGHHGGDGDQTVEKKGKRNLFLFLHDREGGENRSSLTMNLHVEEGVGSRARRKRKGLTKRFSVSWRLVKENDSLSSVPTQIILSVITEAFQGLVSCQELHWLTHLEMHQDK
Homology
BLAST of Sgr019399 vs. NCBI nr
Match: XP_022140120.1 (30-kDa cleavage and polyadenylation specificity factor 30 [Momordica charantia])

HSP 1 Score: 1084.7 bits (2804), Expect = 0.0e+00
Identity = 551/585 (94.19%), Postives = 556/585 (95.04%), Query Frame = 0

Query: 9   KLCNCRYKFGFCPNGPDCRYRHAKLPAPPPPVEEILQKIQHLSSYNYGPSNKFFSQRGVG 68
           K CN  YKFGFCPNGPDCRYRHAKLP PPPPVEEILQKIQHLSSYNYGPSNKFFSQRG G
Sbjct: 131 KECN-MYKFGFCPNGPDCRYRHAKLPGPPPPVEEILQKIQHLSSYNYGPSNKFFSQRGAG 190

Query: 69  LSQQNEKSQFPQGPATVNQGVIGKPSTAESGNVQQQQAQQSAQQPSQTQIQSLSNGQPNQ 128
           LS Q EK QFPQGPATVNQGV+GKPSTAES NVQQ QAQQSAQQ SQTQIQSLSNGQPNQ
Sbjct: 191 LSHQIEKPQFPQGPATVNQGVVGKPSTAESANVQQPQAQQSAQQTSQTQIQSLSNGQPNQ 250

Query: 129 LNRTATSLPQGISRCVSRDLKYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD 188
           LNRTATSLPQGISR       YFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD
Sbjct: 251 LNRTATSLPQGISR-------YFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD 310

Query: 189 NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK 248
           NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK
Sbjct: 311 NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK 370

Query: 249 TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSLAAESKREEEKAK 308
           TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSLAAESKREEEKAK
Sbjct: 371 TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSLAAESKREEEKAK 430

Query: 309 GVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFVQSVGLPAQGRGRGRGIMWPPHMPM 368
           GVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESF QSVGLPAQGRGRGRGIMWPPHMPM
Sbjct: 431 GVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFGQSVGLPAQGRGRGRGIMWPPHMPM 490

Query: 369 GRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGFSPYGPRFSGDFMGP 428
           GRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGF PYGPRFSGDFMGP
Sbjct: 491 GRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGFGPYGPRFSGDFMGP 550

Query: 429 PSAMMFRGRPSQPGAMFPPAGFGMMMGQGRGPYMGGMGVTGTNPARAGRPVGVSPLYPPP 488
           PSAMMFRGRPSQPGAMFPP GFGMMMGQGRGP+MGGMGVTG NPAR GRPV VS LYPPP
Sbjct: 551 PSAMMFRGRPSQPGAMFPPGGFGMMMGQGRGPFMGGMGVTGANPARPGRPVSVSQLYPPP 610

Query: 489 AVPSSQNINRVVKRDQRGPANDRNDRYITGPDQSKGQEMPGSGHDDEMQYKQGSKAYPDE 548
           AVP SQN+NRVVKRDQRGP NDRNDRYI GPDQSKGQEMPGSGHDDEMQYKQGSKAYPDE
Sbjct: 611 AVPPSQNMNRVVKRDQRGPGNDRNDRYIAGPDQSKGQEMPGSGHDDEMQYKQGSKAYPDE 670

Query: 549 QYGMGTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQ 594
           QYG+GTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGDAT +SDQ
Sbjct: 671 QYGIGTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGDATVVSDQ 707

BLAST of Sgr019399 vs. NCBI nr
Match: XP_008445183.1 (PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30 [Cucumis melo] >KAA0036926.1 30-kDa cleavage and polyadenylation specificity factor 30 [Cucumis melo var. makuwa] >TYK05548.1 30-kDa cleavage and polyadenylation specificity factor 30 [Cucumis melo var. makuwa])

HSP 1 Score: 1065.4 bits (2754), Expect = 4.2e-307
Identity = 542/585 (92.65%), Postives = 552/585 (94.36%), Query Frame = 0

Query: 9   KLCNCRYKFGFCPNGPDCRYRHAKLPAPPPPVEEILQKIQHLSSYNYGPSNKFFSQRGVG 68
           K CN  YKFGFCPNGPDCRYRHAKLP PPP VEEILQKIQHL SYNYG SNKFFSQRGVG
Sbjct: 134 KECN-MYKFGFCPNGPDCRYRHAKLPGPPPSVEEILQKIQHLGSYNYGSSNKFFSQRGVG 193

Query: 69  LSQQNEKSQFPQGPATVNQGVIGKPSTAESGNVQQQQAQQSAQQPSQTQIQSLSNGQPNQ 128
           L QQNEKSQFPQGPA V QGVIGKPSTAES NVQQQQ QQ AQQ SQTQIQS+SNGQPNQ
Sbjct: 194 LPQQNEKSQFPQGPAPVTQGVIGKPSTAESANVQQQQVQQPAQQTSQTQIQSVSNGQPNQ 253

Query: 129 LNRTATSLPQGISRCVSRDLKYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD 188
           LNRTATSLPQGISR       YFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD
Sbjct: 254 LNRTATSLPQGISR-------YFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD 313

Query: 189 NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK 248
           NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK
Sbjct: 314 NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK 373

Query: 249 TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSLAAESKREEEKAK 308
           TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVS+AAESKREEEKAK
Sbjct: 374 TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSIAAESKREEEKAK 433

Query: 309 GVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFVQSVGLPAQGRGRGRGIMWPPHMPM 368
           GVNPDIG+ENPDIVPFEDNEEEEEEESEEEEEESF QSVGLPAQGRGRGRGIMWPPHMPM
Sbjct: 434 GVNPDIGNENPDIVPFEDNEEEEEEESEEEEEESFGQSVGLPAQGRGRGRGIMWPPHMPM 493

Query: 369 GRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGFSPYGPRFSGDFMGP 428
           GRGARPFHGMQ FPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGF PYGPRFSGDFMGP
Sbjct: 494 GRGARPFHGMQSFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGFGPYGPRFSGDFMGP 553

Query: 429 PSAMMFRGRPSQPGAMFPPAGFGMMMGQGRGPYMGGMGVTGTNPARAGRPVGVSPLYPPP 488
           PSAMMFRGRPSQPGAMF P GFGMMMGQGRGP+MGGMGVTGT+PAR GRPVGVSPLYPPP
Sbjct: 554 PSAMMFRGRPSQPGAMFTPGGFGMMMGQGRGPFMGGMGVTGTSPARPGRPVGVSPLYPPP 613

Query: 489 AVPSSQNINRVVKRDQRGPANDRNDRYITGPDQSKGQEMPGSGHDDEMQYKQGSKAYPDE 548
           AVPS+QNINR +KRDQRGP +DRNDRYI GPDQ+KGQEM  SGHD+ MQYKQGSKAYPDE
Sbjct: 614 AVPSAQNINRAIKRDQRGPTSDRNDRYIVGPDQNKGQEMLSSGHDEGMQYKQGSKAYPDE 673

Query: 549 QYGMGTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQ 594
           QYGMGTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQ
Sbjct: 674 QYGMGTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQ 710

BLAST of Sgr019399 vs. NCBI nr
Match: XP_038894441.1 (30-kDa cleavage and polyadenylation specificity factor 30-like [Benincasa hispida])

HSP 1 Score: 1057.7 bits (2734), Expect = 8.7e-305
Identity = 538/585 (91.97%), Postives = 551/585 (94.19%), Query Frame = 0

Query: 9   KLCNCRYKFGFCPNGPDCRYRHAKLPAPPPPVEEILQKIQHLSSYNYGPSNKFFSQRGVG 68
           K CN  YKFGFCPNGPDCRYRHAKLP PPP VEEILQKIQH+ SYN+GPSNK F QRGVG
Sbjct: 132 KECN-MYKFGFCPNGPDCRYRHAKLPGPPPSVEEILQKIQHIGSYNHGPSNKLFLQRGVG 191

Query: 69  LSQQNEKSQFPQGPATVNQGVIGKPSTAESGNVQQQQAQQSAQQPSQTQIQSLSNGQPNQ 128
           LSQQNEKSQFPQGPA V QGVIGKPSTAES NVQQ Q QQS QQ SQTQIQSLSNGQPNQ
Sbjct: 192 LSQQNEKSQFPQGPAPVTQGVIGKPSTAESANVQQPQVQQSTQQTSQTQIQSLSNGQPNQ 251

Query: 129 LNRTATSLPQGISRCVSRDLKYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD 188
           LNRTATSLPQGISR       YFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD
Sbjct: 252 LNRTATSLPQGISR-------YFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD 311

Query: 189 NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK 248
           NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK
Sbjct: 312 NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK 371

Query: 249 TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSLAAESKREEEKAK 308
           TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVS+AAE+KREEEKAK
Sbjct: 372 TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSIAAEAKREEEKAK 431

Query: 309 GVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFVQSVGLPAQGRGRGRGIMWPPHMPM 368
           GVNPDIG+ENPDIVPFEDNEEEEEEESEEEEEESF QSVGLPAQGRGRGRGIMWPPHMPM
Sbjct: 432 GVNPDIGNENPDIVPFEDNEEEEEEESEEEEEESFGQSVGLPAQGRGRGRGIMWPPHMPM 491

Query: 369 GRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGFSPYGPRFSGDFMGP 428
           GRGARPFHGMQGFPPGM+GPDGLSYGPVTP+GFPMPDIFGMAPRGF PYGPRFSGDFMGP
Sbjct: 492 GRGARPFHGMQGFPPGMIGPDGLSYGPVTPEGFPMPDIFGMAPRGFGPYGPRFSGDFMGP 551

Query: 429 PSAMMFRGRPSQPGAMFPPAGFGMMMGQGRGPYMGGMGVTGTNPARAGRPVGVSPLYPPP 488
           PSAMMFRGRPSQPGAMF P+GFGMMMGQGRGP+MGGMGVTGTNPAR GRPVGVSPLYPPP
Sbjct: 552 PSAMMFRGRPSQPGAMFSPSGFGMMMGQGRGPFMGGMGVTGTNPARPGRPVGVSPLYPPP 611

Query: 489 AVPSSQNINRVVKRDQRGPANDRNDRYITGPDQSKGQEMPGSGHDDEMQYKQGSKAYPDE 548
           AVPS QNINR VKRDQRGP +DRNDRYI G DQ+KGQEMPGSG+DD MQYKQGSK Y DE
Sbjct: 612 AVPSPQNINRAVKRDQRGPTSDRNDRYIVGLDQNKGQEMPGSGYDDGMQYKQGSKGYSDE 671

Query: 549 QYGMGTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQ 594
           QYGMGTTFRNEESESEDEAPRRSRHGEGK+KRRGSEGDATAISDQ
Sbjct: 672 QYGMGTTFRNEESESEDEAPRRSRHGEGKEKRRGSEGDATAISDQ 708

BLAST of Sgr019399 vs. NCBI nr
Match: XP_038889902.1 (30-kDa cleavage and polyadenylation specificity factor 30-like [Benincasa hispida])

HSP 1 Score: 1045.0 bits (2701), Expect = 5.8e-301
Identity = 534/586 (91.13%), Postives = 548/586 (93.52%), Query Frame = 0

Query: 9   KLCNCRYKFGFCPNGPDCRYRHAKLPAPPPPVEEILQKIQHLSSYNYGPSNKFFSQRGVG 68
           K CN  YKFGFCPNGPDCRYRHAKLP PPPPVEEILQKIQHL SYNYG SNKFF+QRGVG
Sbjct: 132 KECN-MYKFGFCPNGPDCRYRHAKLPGPPPPVEEILQKIQHLGSYNYGSSNKFFTQRGVG 191

Query: 69  LSQQNEKSQFPQGPATVNQGVI-GKPSTAESGNVQQQQAQQSAQQPSQTQIQSLSNGQPN 128
           LSQQNEKSQFPQG   V QGV+ GKPS AES NV QQQ QQSAQQ SQT IQSLSNGQPN
Sbjct: 192 LSQQNEKSQFPQGQPLVTQGVVTGKPSAAESANVPQQQGQQSAQQTSQTPIQSLSNGQPN 251

Query: 129 QLNRTATSLPQGISRCVSRDLKYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSA 188
           QLNRTATSLPQGISR       YFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSA
Sbjct: 252 QLNRTATSLPQGISR-------YFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSA 311

Query: 189 DNVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQ 248
           DNVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQ
Sbjct: 312 DNVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQ 371

Query: 249 KTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSLAAESKREEEKA 308
           KTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSLAAESKREEEKA
Sbjct: 372 KTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSLAAESKREEEKA 431

Query: 309 KGVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFVQSVGLPAQGRGRGRGIMWPPHMP 368
           KGVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESF QSVGLP QGRGRGRGI+WPPHMP
Sbjct: 432 KGVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFGQSVGLPPQGRGRGRGIVWPPHMP 491

Query: 369 MGRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGFSPYGPRFSGDFMG 428
           MGRGARPF GMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGF PYGPRFSGDFMG
Sbjct: 492 MGRGARPFPGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGFGPYGPRFSGDFMG 551

Query: 429 PPSAMMFRGRPSQPGAMFPPAGFGMMMGQGRGPYMGGMGVTGTNPARAGRPVGVSPLYPP 488
           PP+AMMFRGRPSQPGAMFPP GFGMMMGQGRGP+MGGMG+ GTNP+R GRPVGVSPLYPP
Sbjct: 552 PPTAMMFRGRPSQPGAMFPPGGFGMMMGQGRGPFMGGMGIAGTNPSRPGRPVGVSPLYPP 611

Query: 489 PAVPSSQNINRVVKRDQRGPANDRNDRYITGPDQSKGQEMPGSGHDDEMQYKQGSKAYPD 548
           PAVPSSQN+NR +KRDQRGPA++RNDRYI G DQ+KG EMP SG DDEMQYKQGSKAY D
Sbjct: 612 PAVPSSQNMNRAIKRDQRGPASERNDRYIVGLDQNKGLEMPSSGRDDEMQYKQGSKAYSD 671

Query: 549 EQYGMGTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQ 594
           EQYG+GTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGDATAIS+Q
Sbjct: 672 EQYGVGTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISNQ 709

BLAST of Sgr019399 vs. NCBI nr
Match: KAG6573397.1 (30-kDa cleavage and polyadenylation specificity factor 30, partial [Cucurbita argyrosperma subsp. sororia] >KAG7012560.1 30-kDa cleavage and polyadenylation specificity factor 30, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1038.1 bits (2683), Expect = 7.1e-299
Identity = 529/585 (90.43%), Postives = 546/585 (93.33%), Query Frame = 0

Query: 9   KLCNCRYKFGFCPNGPDCRYRHAKLPAPPPPVEEILQKIQHLSSYNYGPSNKFFSQRGVG 68
           K CN  YKFGFCPNGPDCRYRHAKLP PPPPVEEILQKIQHL  YNYG SNKFFSQRGVG
Sbjct: 131 KECN-MYKFGFCPNGPDCRYRHAKLPGPPPPVEEILQKIQHLGPYNYGTSNKFFSQRGVG 190

Query: 69  LSQQNEKSQFPQGPATVNQGVIGKPSTAESGNVQQQQAQQSAQQPSQTQIQSLSNGQPNQ 128
           LSQQNEKSQFP GPATV QGVIGKPS AES NVQQQQAQQSAQQ SQ  IQ+ SNGQ NQ
Sbjct: 191 LSQQNEKSQFPPGPATVTQGVIGKPSVAESANVQQQQAQQSAQQTSQAPIQTASNGQVNQ 250

Query: 129 LNRTATSLPQGISRCVSRDLKYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD 188
           LNR +TSLP GISR       YFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD
Sbjct: 251 LNRNSTSLPPGISR-------YFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD 310

Query: 189 NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK 248
           NVILIFSVNRTRHFQGCAKMM+RIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK
Sbjct: 311 NVILIFSVNRTRHFQGCAKMMTRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK 370

Query: 249 TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSLAAESKREEEKAK 308
           TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDG+LMAVS+AAESKREEEKAK
Sbjct: 371 TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGDLMAVSIAAESKREEEKAK 430

Query: 309 GVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFVQSVGLPAQGRGRGRGIMWPPHMPM 368
           GVNPDI +ENPDIVPFEDNEEEEEEESEEEEEESF QSVG PAQGRGRGRG+MWPPHMPM
Sbjct: 431 GVNPDIRNENPDIVPFEDNEEEEEEESEEEEEESFGQSVGPPAQGRGRGRGMMWPPHMPM 490

Query: 369 GRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGFSPYGPRFSGDFMGP 428
           GRGARPFHGMQGFPPGMMGPDG+SYGPVTPDGFPMPDIFGMAPRGF+PYG RFSG+FM P
Sbjct: 491 GRGARPFHGMQGFPPGMMGPDGMSYGPVTPDGFPMPDIFGMAPRGFNPYGARFSGEFMNP 550

Query: 429 PSAMMFRGRPSQPGAMFPPAGFGMMMGQGRGPYMGGMGVTGTNPARAGRPVGVSPLYPPP 488
            SAMMFRGRPSQPGAMFPP GFGMMMGQGRGP+MGGMGVTGTNPARAGRPVG  PLYPPP
Sbjct: 551 QSAMMFRGRPSQPGAMFPPGGFGMMMGQGRGPFMGGMGVTGTNPARAGRPVG-PPLYPPP 610

Query: 489 AVPSSQNINRVVKRDQRGPANDRNDRYITGPDQSKGQEMPGSGHDDEMQYKQGSKAYPDE 548
           AVPSSQN+NR +KRDQRGP +DRNDRYI G DQ++GQE+PGSGHDDEMQYKQGSKAYPDE
Sbjct: 611 AVPSSQNMNRAMKRDQRGPGSDRNDRYIAGSDQNRGQEIPGSGHDDEMQYKQGSKAYPDE 670

Query: 549 QYGMGTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQ 594
           QYGMGTT RNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQ
Sbjct: 671 QYGMGTTIRNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQ 706

BLAST of Sgr019399 vs. ExPASy Swiss-Prot
Match: A9LNK9 (30-kDa cleavage and polyadenylation specificity factor 30 OS=Arabidopsis thaliana OX=3702 GN=CPSF30 PE=1 SV=1)

HSP 1 Score: 604.4 bits (1557), Expect = 3.5e-171
Identity = 356/575 (61.91%), Postives = 407/575 (70.78%), Query Frame = 0

Query: 9   KLCNCRYKFGFCPNGPDCRYRHAKLPAPPPPVEEILQKIQHLSSYNYGPSNKFFSQRGVG 68
           K CN  YK GFCPNGPDCRYRHAKLP PPPPVEE+LQKIQ L++YNYG +N+ +  R V 
Sbjct: 118 KECN-MYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLTTYNYG-TNRLYQARNVA 177

Query: 69  LSQQNEKSQFPQGPATVNQGVIGKPSTAESGNVQQQQAQQSAQQPSQTQIQSLSNGQPNQ 128
              Q+     PQG   +     G+P   ESGN+QQQQ QQ  Q   Q   Q+L     +Q
Sbjct: 178 PQLQDR----PQGQVPMQ----GQPQ--ESGNLQQQQQQQPQQSQHQVS-QTLIPNPADQ 237

Query: 129 LNRTATSLPQGISRCVSRDLKYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD 188
            NRT+  LPQG++R       YF+VKS NREN ELSVQQGVWATQRSNEAKLNEAFDS +
Sbjct: 238 TNRTSHPLPQGVNR-------YFVVKSNNRENFELSVQQGVWATQRSNEAKLNEAFDSVE 297

Query: 189 NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK 248
           NVILIFSVNRTRHFQGCAKM SRIGG + GGNWK+ HGTA YG+NFS+KWLKLCELSF K
Sbjct: 298 NVILIFSVNRTRHFQGCAKMTSRIGGYIGGGNWKHEHGTAQYGRNFSVKWLKLCELSFHK 357

Query: 249 TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSLAAESKREEEKAK 308
           TR+LRNPYNENLPVKISRDCQELEPS+GEQLASLLYLEPD ELMA+S+AAE+KREEEKAK
Sbjct: 358 TRNLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISIAAEAKREEEKAK 417

Query: 309 GVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFVQSVGLPAQGRGRGRGIMWPPHMPM 368
           GVNP+  +ENPDIVPFEDNEEEEEEE E EEEE   +S+    QGRGRGRGIMWPP MP+
Sbjct: 418 GVNPESRAENPDIVPFEDNEEEEEEEDESEEEE---ESMAGGPQGRGRGRGIMWPPQMPL 477

Query: 369 GRGARPFHGMQGFPPGMMGP-DGLSYGPVTPDGFPMPDIFGMAPRGFSPYGPRFSGDFMG 428
           GRG RP  GM GFP G+MGP D   YGP   +G  MPD FGM PR F PYGPRF GDF G
Sbjct: 478 GRGIRPMPGMGGFPLGVMGPGDAFPYGPGGYNG--MPDPFGMGPRPFGPYGPRFGGDFRG 537

Query: 429 PPSAMMFRGRPSQPGAMFPPAGFGMMMGQGRGPYMGGMGVTGTNPARAGRPVGVSPLYPP 488
           P   MMF GRP Q    FP  G+G MMG GRGP+MGGMG    N  R GR     P+Y P
Sbjct: 538 PVPGMMFPGRPPQ---QFPHGGYG-MMGGGRGPHMGGMG----NAPRGGR-----PMYYP 597

Query: 489 PAVPSSQNINRVVKRDQRGPANDRNDRYITGPDQSKGQEMPGSGHDDEMQYKQGSKAYPD 548
           PA  S+          + GP+N +       P++S  + + G   + +  +         
Sbjct: 598 PATSSA----------RPGPSNRKT------PERSDERGVSGDQQNQDASHDM------- 631

Query: 549 EQYGMGTTFRNEESES--EDEAPRRSRHGEGKKKR 581
           EQ+ +G + RNEESES  EDEAPRRSRHGEGKK+R
Sbjct: 658 EQFEVGNSLRNEESESEDEDEAPRRSRHGEGKKRR 631

BLAST of Sgr019399 vs. ExPASy Swiss-Prot
Match: Q0DA50 (Zinc finger CCCH domain-containing protein 45 OS=Oryza sativa subsp. japonica OX=39947 GN=Os06g0677700 PE=4 SV=1)

HSP 1 Score: 587.8 bits (1514), Expect = 3.3e-166
Identity = 344/586 (58.70%), Postives = 406/586 (69.28%), Query Frame = 0

Query: 9   KLCNCRYKFGFCPNGPDCRYRHAKLPAPPPPVEEILQKIQHLSSYNYGPSNKFFSQRGVG 68
           K CN  YK GFCPNGP+CRY+H KLP PPPPVEE+LQKI  + S+     NKF   R   
Sbjct: 116 KECN-MYKMGFCPNGPNCRYKHVKLPGPPPPVEEVLQKILQIRSF-----NKFNQHRHNN 175

Query: 69  LSQQNEKSQFPQGPATVNQGVIGKPSTAESGNVQQQQAQQSAQQPSQ------------- 128
            +QQ E+ Q PQG    NQ  I   +T  +     QQAQ + QQP Q             
Sbjct: 176 YNQQGERPQHPQGSGLPNQNSIDNTTTTTAQPAVGQQAQTTNQQPPQQQQQQQQQQQQQQ 235

Query: 129 -----TQIQSLSNGQPNQLNRTATSLPQGISRCVSRDLKYFIVKSCNRENLELSVQQGVW 188
                 Q+QS+ NG  NQ  R AT LPQG SR       YFIVKSCNRENLE+SVQQG+W
Sbjct: 236 KPNTNDQVQSVPNGSSNQATRIATPLPQGPSR-------YFIVKSCNRENLEISVQQGIW 295

Query: 189 ATQRSNEAKLNEAFDSADNVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHY 248
           ATQRSNEAKLNEAF+S +NVILIFS+NRTR+FQGCAKM SRIGG + GGNWK AHGTAHY
Sbjct: 296 ATQRSNEAKLNEAFESIENVILIFSINRTRNFQGCAKMTSRIGGYIGGGNWKSAHGTAHY 355

Query: 249 GQNFSLKWLKLCELSFQKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGE 308
           G+NFS++WLKLCELSFQKT HLRNPYN+NLPVKISRDCQELEP IGEQLASLLYLEPD E
Sbjct: 356 GRNFSIQWLKLCELSFQKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDSE 415

Query: 309 LMAVSLAAESKREEEKAKGVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFVQSVGLP 368
           L A+ +AAE+K+EEEKAKGV+ D  ++N DIV F+DNEEEEEEESEEEEE +     G  
Sbjct: 416 LTAILIAAEAKKEEEKAKGVSADEAADNQDIVLFDDNEEEEEEESEEEEEGN-----GQE 475

Query: 369 AQGRGRGRGIMWPPHMPMGRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMA 428
           +QGRGRGRG+MWPP MPM RG  P  G +GFPP M+G DG  +G     GF MPD FG+ 
Sbjct: 476 SQGRGRGRGMMWPPQMPMLRGVGPMMGGRGFPPNMIG-DGFGFG----GGFGMPDPFGV- 535

Query: 429 PRGFSPYGPRFSGDFM--GPPSAMMFRGRPSQPGAMFPPAGFGMMMGQGRGPYMGGMGVT 488
           PRGF P+GPRF GDF   GP   M+F GRP QPG MF P G  MMMG GRGP MGG+G+ 
Sbjct: 536 PRGFPPFGPRFPGDFARGGPMPGMVFPGRPPQPGGMF-PMGLEMMMGPGRGPLMGGLGMG 595

Query: 489 GTNPARAGRPVGVSPLYPPPAVPSSQNINRVVKRDQRGPANDRNDRYITGPDQ-SKGQEM 548
           G  P R  RPVG++P  PPP  P+    NR  KR+QR P  +R DRY T  DQ S+G + 
Sbjct: 596 G--PGRPNRPVGMAPFMPPPPPPN----NRGTKREQRRPGGERGDRYETTSDQGSRGHDA 655

Query: 549 PGSGHDDEMQYKQGSKAYPDEQYGMGTTFRNEESESEDE-APRRSR 573
            G+         +G+++   ++YG  +  R+++SES++E APRRSR
Sbjct: 656 TGNSG------AEGARSQSGDRYGR-SALRDDDSESDEEAAPRRSR 663

BLAST of Sgr019399 vs. ExPASy Swiss-Prot
Match: Q9M0V4 (U3 snoRNP-associated protein-like YAO OS=Arabidopsis thaliana OX=3702 GN=YAO PE=2 SV=1)

HSP 1 Score: 518.8 bits (1335), Expect = 1.9e-145
Identity = 277/493 (56.19%), Postives = 360/493 (73.02%), Query Frame = 0

Query: 649  KKKSSAPKGATKGKNFPLDKDPFFSSESRKRRKTVDENDEIESGESD-EDTGFMGSAAER 708
            KK  S  +G  KG N   ++DPFF  E +KRRK   ++D+IES +SD E+ GF G   + 
Sbjct: 8    KKGGSFKRGGKKGSN---ERDPFFEEEPKKRRKVSYDDDDIESVDSDAEENGFTGGDEDG 67

Query: 709  GKYE-EAEDE-QFEEETADEKRNRVANEYVDKIWEIARREKER------KDEEKDSLVAQ 768
             + + E EDE +F +ETA EKR R+A E +++  E  RRE+E        DE+ D  + +
Sbjct: 68   RRVDGEVEDEDEFADETAGEKRKRLAEEMLNRRREAMRREREEADNDDDDDEDDDETIKK 127

Query: 769  ILQQEQLEDSGRVRREIASRVQKPEARDEFQVLIKHRQTVTAVALSDDDSKGFSTSKDGT 828
             L Q+Q EDSGR+RR IASRVQ+P + D F V++KHR++V +VALSDDDS+GFS SKDGT
Sbjct: 128  SLMQKQQEDSGRIRRLIASRVQEPLSTDGFSVIVKHRRSVVSVALSDDDSRGFSASKDGT 187

Query: 829  ILHWDVDSGKGEKYQWPSDEVLRLHGVKDPQGRATMHSKVILSLAVSSDGRYLASGGLDR 888
            I+HWDV SGK +KY WPSDE+L+ HG+K  + R   HS+  L+LAVSSDGRYLA+GG+DR
Sbjct: 188  IMHWDVSSGKTDKYIWPSDEILKSHGMKLREPRNKNHSRESLALAVSSDGRYLATGGVDR 247

Query: 889  HVHIWDTRIREHIQAFPGHRGPVSCLTFRQGTSELFSGSYDRTVKIWNVEDRAYINTLFG 948
            HVHIWD R REH+QAFPGHR  VSCL FR GTSEL+SGS+DRTVK+WNVED+A+I    G
Sbjct: 248  HVHIWDVRTREHVQAFPGHRNTVSCLCFRYGTSELYSGSFDRTVKVWNVEDKAFITENHG 307

Query: 949  HQSEVLTIDCLRKERLLTVGRDRSMQLWKVPEESRLVFRAPASSLECCCFISNDEFLSGS 1008
            HQ E+L ID LRKER LTVGRDR+M   KVPE +R+++RAPASSLE CCFIS++E+LSGS
Sbjct: 308  HQGEILAIDALRKERALTVGRDRTMLYHKVPESTRMIYRAPASSLESCCFISDNEYLSGS 367

Query: 1009 DDGSIELWSLLKKKPVSIVRNAHPLSFSCTNLELKENGAIPIGCMGNGDVNSNTSHSLSA 1068
            D+G++ LW +LKKKPV + +NAH         +   +G    G + NGD +   +++ SA
Sbjct: 368  DNGTVALWGMLKKKPVFVFKNAH---------QDIPDGITTNGILENGD-HEPVNNNCSA 427

Query: 1069 YSWVSSVSVCRNSDLAASGAGNGSVRLWALTSDKKDIRPLYDFPLVGFVNSLTFAKSGRF 1128
             SWV++V+  R SDLAASGAGNG VRLWA+ ++   IRPLY+ PL GFVNSL FAKSG+F
Sbjct: 428  NSWVNAVATSRGSDLAASGAGNGFVRLWAVETNA--IRPLYELPLTGFVNSLAFAKSGKF 485

Query: 1129 VVAGVGQRAKWAK 1133
            ++AGVGQ  ++ +
Sbjct: 488  LIAGVGQETRFGR 485

BLAST of Sgr019399 vs. ExPASy Swiss-Prot
Match: Q3MKM6 (U3 snoRNP-associated protein-like EMB2271 OS=Arabidopsis thaliana OX=3702 GN=EMB2271 PE=2 SV=1)

HSP 1 Score: 506.1 bits (1302), Expect = 1.3e-141
Identity = 277/511 (54.21%), Postives = 354/511 (69.28%), Query Frame = 0

Query: 646  KITKKKSSAPKGATKGKNFPLDKDPFFSSESRKRRK-TVDENDEIESGESDEDTGFMGSA 705
            K+ KKK    K   +GK   +D DPF   E+ KRRK   D++D+IES ES+E+       
Sbjct: 2    KLEKKKGIGAK--RRGKKSSIDHDPFLEEETEKRRKFNYDDDDDIESVESEEE------- 61

Query: 706  AERGKYEEAEDEQFEEETADEKRNRVANEYVDKIWEIARREKERKDEE----KDSLVAQI 765
               GK  E  +++F  ET  EKR R+A + +++I E  +RE E  +EE    +DSLVA+ 
Sbjct: 62   ---GKVGEEVEDEFAHETVGEKRKRLAEDTLNRIEEAKQREHEEDNEEDDDFRDSLVAKT 121

Query: 766  LQQEQLEDSGRVRREIASRVQKPEARDEFQVLIKHRQTVTAVALSDDDSKGFSTSKDGTI 825
            L QEQLE SGRVRR  A RVQ  ++ D+F+V++KH+ +VT VALSDDDS+GFS SKDGTI
Sbjct: 122  LMQEQLEKSGRVRRANALRVQDLQSSDKFRVIVKHQHSVTGVALSDDDSRGFSVSKDGTI 181

Query: 826  LHWDVDSGKGEKYQWPSDEVLRLHGVKDPQGRATMHSKVILSLAVSSDGRYLASGGLDRH 885
            LHWDV SGK ++Y+WPSDEVL+ HG+K  +   T H+K  L+LAVSSDGRYLA+GG+D H
Sbjct: 182  LHWDVSSGKSDEYKWPSDEVLKSHGLKFQESWYTRHNKQSLALAVSSDGRYLATGGVDCH 241

Query: 886  VHIWDTRIREHIQAFPGHRGPVSCLTFRQGTSELFSGSYDRTVKIWNVEDRAYINTLFGH 945
            VH+WD R REH+QAF GH G VS L FR+GT+ELFSGSYD T+ IWN E R YI + FGH
Sbjct: 242  VHLWDIRTREHVQAFTGHCGIVSSLCFREGTAELFSGSYDGTLSIWNAEHRTYIESCFGH 301

Query: 946  QSEVLTIDCLRKERLLTVGRDRSMQLWKVPEESRLVFRAPASSLECCCFISNDEFLSGSD 1005
            QSE+L+ID L +ER+L+VGRDR+MQL+KVPE +RL++RA  S+ ECCCF+++DEFLSGSD
Sbjct: 302  QSELLSIDALGRERVLSVGRDRTMQLYKVPESTRLIYRASESNFECCCFVNSDEFLSGSD 361

Query: 1006 DGSIELWSLLKKKPVSIVRNAHPLSFSCTNLELKENGAIPIGCMGNGDVNSNTSHSLSAY 1065
            +GSI LWS+LKKKPV IV NAH +                       D +S   +   A 
Sbjct: 362  NGSIALWSILKKKPVFIVNNAHHVI---------------------ADHDSVNHNCTPAC 421

Query: 1066 SWVSSVSVCRNSDLAASGAGNGSVRLWALTSDKKDIRPLYDFPLVGFVNSLTFAKSGRFV 1125
            SWVSSV+VCR S+LAASGAGNG VRLW + S    I+PLY+ PL GFVNSL FAKSGRF+
Sbjct: 422  SWVSSVAVCRGSELAASGAGNGCVRLWGVESGSSAIQPLYELPLPGFVNSLAFAKSGRFL 479

Query: 1126 VAGVGQRAKWAKRTTSEAEEISPAYSDLRPS 1152
            +AGVGQ  +  +    ++ +   A   LR S
Sbjct: 482  IAGVGQEPRLGRWGCLKSAQNGVAIHPLRLS 479

BLAST of Sgr019399 vs. ExPASy Swiss-Prot
Match: Q75LV5 (U3 snoRNP-associated protein-like YAOH OS=Oryza sativa subsp. japonica OX=39947 GN=Os03g0625900 PE=2 SV=1)

HSP 1 Score: 485.7 bits (1249), Expect = 1.8e-135
Identity = 265/493 (53.75%), Postives = 342/493 (69.37%), Query Frame = 0

Query: 645  KKITKKKSSAPKGATKGKNFPLDKDPFFSSESRKRRKTVDENDEIESGESD-EDTGFMGS 704
            K++++ K   P+  ++G+    D+DPFF SE  KRR+    +++IES +SD E      +
Sbjct: 7    KRVSRPK---PRATSRGRGGG-DEDPFFESEP-KRRRGGGRDEDIESEDSDLEGVAAAAA 66

Query: 705  AAERGKYEEAEDEQFEEETADEKRNRVANEYVDKIWEIARREKERKDEEKDS------LV 764
                   EE E+E+ E+ETA EK+ R+A E + K+ + ARR +E  ++E +        V
Sbjct: 67   GGVGDDGEEEEEEEEEQETAGEKKMRIAKELLKKVTDAARRRREDDEDEDEGEEAGRRRV 126

Query: 765  AQILQQEQLEDSGRVRREIASRVQKPEARDEFQVLIKHRQTVTAVALSDDDSKGFSTSKD 824
            A IL + Q E+SGR R E+A R+ +P+  D F++L+KHRQ VTAV LS D  KGFS SKD
Sbjct: 127  ADILLKRQFEESGRKRMELADRILQPDPEDGFKMLVKHRQPVTAVVLSKDSDKGFSASKD 186

Query: 825  GTILHWDVDSGKGEKYQWPSDEVLRLHGVKDPQGRATMHSKVILSLAVSSDGRYLASGGL 884
            G I+HWDV++GK EKY WPS+ VL  H  K P   +   SK +L+LAVS+DGRYLASGGL
Sbjct: 187  GVIVHWDVETGKSEKYLWPSENVLVSHHAKPP--LSAKRSKQVLALAVSADGRYLASGGL 246

Query: 885  DRHVHIWDTRIREHIQAFPGHRGPVSCLTFRQGTSELFSGSYDRTVKIWNVEDRAYINTL 944
            DRH+H+WD R REHIQAF GHRG +SCL+F   +SELFSGS+DR +  WN EDR Y+N L
Sbjct: 247  DRHIHLWDVRSREHIQAFSGHRGAISCLSFGPDSSELFSGSFDRKIMQWNAEDRTYMNCL 306

Query: 945  FGHQSEVLTIDCLRKERLLTVGRDRSMQLWKVPEESRLVFRAPA-SSLECCCFISNDEFL 1004
            FGHQ+EVLT+D L K+RLLTV RDR+M LWK+PEES+L+FRAPA +SLECCCFI + EFL
Sbjct: 307  FGHQNEVLTMDALSKDRLLTVARDRTMHLWKIPEESQLLFRAPATASLECCCFIDDKEFL 366

Query: 1005 SGSDDGSIELWSLLKKKPVSIVRNAHPLSFSCTNLELKENGAIPIGCMGNGDVNSNTSHS 1064
            +GSDDGS+ELWS+++KKP  I+RNAHP+  +  NL   EN     G      V+      
Sbjct: 367  TGSDDGSVELWSIMRKKPTHIIRNAHPVFRN--NLNSLENNVEENGIHKPESVS------ 426

Query: 1065 LSAYSWVSSVSVCRNSDLAASGAGNGSVRLWALTSDKKDIRPLYDFPLVGFVNSLTFAKS 1124
             SA SWVS+++  R SDLAASGA NGSVRLWA+  D K IRPL+   L GFVNSL   KS
Sbjct: 427  -SAQSWVSAIAARRGSDLAASGAANGSVRLWAIEPDSKGIRPLFSLRLDGFVNSLAIPKS 483

Query: 1125 GRFVVAGVGQRAK 1130
            GRF+VAGVGQ  +
Sbjct: 487  GRFIVAGVGQEPR 483

BLAST of Sgr019399 vs. ExPASy TrEMBL
Match: A0A6J1CE78 (30-kDa cleavage and polyadenylation specificity factor 30 OS=Momordica charantia OX=3673 GN=LOC111010855 PE=4 SV=1)

HSP 1 Score: 1084.7 bits (2804), Expect = 0.0e+00
Identity = 551/585 (94.19%), Postives = 556/585 (95.04%), Query Frame = 0

Query: 9   KLCNCRYKFGFCPNGPDCRYRHAKLPAPPPPVEEILQKIQHLSSYNYGPSNKFFSQRGVG 68
           K CN  YKFGFCPNGPDCRYRHAKLP PPPPVEEILQKIQHLSSYNYGPSNKFFSQRG G
Sbjct: 131 KECN-MYKFGFCPNGPDCRYRHAKLPGPPPPVEEILQKIQHLSSYNYGPSNKFFSQRGAG 190

Query: 69  LSQQNEKSQFPQGPATVNQGVIGKPSTAESGNVQQQQAQQSAQQPSQTQIQSLSNGQPNQ 128
           LS Q EK QFPQGPATVNQGV+GKPSTAES NVQQ QAQQSAQQ SQTQIQSLSNGQPNQ
Sbjct: 191 LSHQIEKPQFPQGPATVNQGVVGKPSTAESANVQQPQAQQSAQQTSQTQIQSLSNGQPNQ 250

Query: 129 LNRTATSLPQGISRCVSRDLKYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD 188
           LNRTATSLPQGISR       YFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD
Sbjct: 251 LNRTATSLPQGISR-------YFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD 310

Query: 189 NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK 248
           NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK
Sbjct: 311 NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK 370

Query: 249 TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSLAAESKREEEKAK 308
           TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSLAAESKREEEKAK
Sbjct: 371 TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSLAAESKREEEKAK 430

Query: 309 GVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFVQSVGLPAQGRGRGRGIMWPPHMPM 368
           GVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESF QSVGLPAQGRGRGRGIMWPPHMPM
Sbjct: 431 GVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFGQSVGLPAQGRGRGRGIMWPPHMPM 490

Query: 369 GRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGFSPYGPRFSGDFMGP 428
           GRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGF PYGPRFSGDFMGP
Sbjct: 491 GRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGFGPYGPRFSGDFMGP 550

Query: 429 PSAMMFRGRPSQPGAMFPPAGFGMMMGQGRGPYMGGMGVTGTNPARAGRPVGVSPLYPPP 488
           PSAMMFRGRPSQPGAMFPP GFGMMMGQGRGP+MGGMGVTG NPAR GRPV VS LYPPP
Sbjct: 551 PSAMMFRGRPSQPGAMFPPGGFGMMMGQGRGPFMGGMGVTGANPARPGRPVSVSQLYPPP 610

Query: 489 AVPSSQNINRVVKRDQRGPANDRNDRYITGPDQSKGQEMPGSGHDDEMQYKQGSKAYPDE 548
           AVP SQN+NRVVKRDQRGP NDRNDRYI GPDQSKGQEMPGSGHDDEMQYKQGSKAYPDE
Sbjct: 611 AVPPSQNMNRVVKRDQRGPGNDRNDRYIAGPDQSKGQEMPGSGHDDEMQYKQGSKAYPDE 670

Query: 549 QYGMGTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQ 594
           QYG+GTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGDAT +SDQ
Sbjct: 671 QYGIGTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGDATVVSDQ 707

BLAST of Sgr019399 vs. ExPASy TrEMBL
Match: A0A5D3C2W1 (30-kDa cleavage and polyadenylation specificity factor 30 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold178G00560 PE=4 SV=1)

HSP 1 Score: 1065.4 bits (2754), Expect = 2.0e-307
Identity = 542/585 (92.65%), Postives = 552/585 (94.36%), Query Frame = 0

Query: 9   KLCNCRYKFGFCPNGPDCRYRHAKLPAPPPPVEEILQKIQHLSSYNYGPSNKFFSQRGVG 68
           K CN  YKFGFCPNGPDCRYRHAKLP PPP VEEILQKIQHL SYNYG SNKFFSQRGVG
Sbjct: 134 KECN-MYKFGFCPNGPDCRYRHAKLPGPPPSVEEILQKIQHLGSYNYGSSNKFFSQRGVG 193

Query: 69  LSQQNEKSQFPQGPATVNQGVIGKPSTAESGNVQQQQAQQSAQQPSQTQIQSLSNGQPNQ 128
           L QQNEKSQFPQGPA V QGVIGKPSTAES NVQQQQ QQ AQQ SQTQIQS+SNGQPNQ
Sbjct: 194 LPQQNEKSQFPQGPAPVTQGVIGKPSTAESANVQQQQVQQPAQQTSQTQIQSVSNGQPNQ 253

Query: 129 LNRTATSLPQGISRCVSRDLKYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD 188
           LNRTATSLPQGISR       YFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD
Sbjct: 254 LNRTATSLPQGISR-------YFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD 313

Query: 189 NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK 248
           NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK
Sbjct: 314 NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK 373

Query: 249 TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSLAAESKREEEKAK 308
           TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVS+AAESKREEEKAK
Sbjct: 374 TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSIAAESKREEEKAK 433

Query: 309 GVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFVQSVGLPAQGRGRGRGIMWPPHMPM 368
           GVNPDIG+ENPDIVPFEDNEEEEEEESEEEEEESF QSVGLPAQGRGRGRGIMWPPHMPM
Sbjct: 434 GVNPDIGNENPDIVPFEDNEEEEEEESEEEEEESFGQSVGLPAQGRGRGRGIMWPPHMPM 493

Query: 369 GRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGFSPYGPRFSGDFMGP 428
           GRGARPFHGMQ FPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGF PYGPRFSGDFMGP
Sbjct: 494 GRGARPFHGMQSFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGFGPYGPRFSGDFMGP 553

Query: 429 PSAMMFRGRPSQPGAMFPPAGFGMMMGQGRGPYMGGMGVTGTNPARAGRPVGVSPLYPPP 488
           PSAMMFRGRPSQPGAMF P GFGMMMGQGRGP+MGGMGVTGT+PAR GRPVGVSPLYPPP
Sbjct: 554 PSAMMFRGRPSQPGAMFTPGGFGMMMGQGRGPFMGGMGVTGTSPARPGRPVGVSPLYPPP 613

Query: 489 AVPSSQNINRVVKRDQRGPANDRNDRYITGPDQSKGQEMPGSGHDDEMQYKQGSKAYPDE 548
           AVPS+QNINR +KRDQRGP +DRNDRYI GPDQ+KGQEM  SGHD+ MQYKQGSKAYPDE
Sbjct: 614 AVPSAQNINRAIKRDQRGPTSDRNDRYIVGPDQNKGQEMLSSGHDEGMQYKQGSKAYPDE 673

Query: 549 QYGMGTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQ 594
           QYGMGTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQ
Sbjct: 674 QYGMGTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQ 710

BLAST of Sgr019399 vs. ExPASy TrEMBL
Match: A0A1S3BC28 (30-kDa cleavage and polyadenylation specificity factor 30 OS=Cucumis melo OX=3656 GN=LOC103488286 PE=4 SV=1)

HSP 1 Score: 1065.4 bits (2754), Expect = 2.0e-307
Identity = 542/585 (92.65%), Postives = 552/585 (94.36%), Query Frame = 0

Query: 9   KLCNCRYKFGFCPNGPDCRYRHAKLPAPPPPVEEILQKIQHLSSYNYGPSNKFFSQRGVG 68
           K CN  YKFGFCPNGPDCRYRHAKLP PPP VEEILQKIQHL SYNYG SNKFFSQRGVG
Sbjct: 134 KECN-MYKFGFCPNGPDCRYRHAKLPGPPPSVEEILQKIQHLGSYNYGSSNKFFSQRGVG 193

Query: 69  LSQQNEKSQFPQGPATVNQGVIGKPSTAESGNVQQQQAQQSAQQPSQTQIQSLSNGQPNQ 128
           L QQNEKSQFPQGPA V QGVIGKPSTAES NVQQQQ QQ AQQ SQTQIQS+SNGQPNQ
Sbjct: 194 LPQQNEKSQFPQGPAPVTQGVIGKPSTAESANVQQQQVQQPAQQTSQTQIQSVSNGQPNQ 253

Query: 129 LNRTATSLPQGISRCVSRDLKYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD 188
           LNRTATSLPQGISR       YFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD
Sbjct: 254 LNRTATSLPQGISR-------YFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD 313

Query: 189 NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK 248
           NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK
Sbjct: 314 NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK 373

Query: 249 TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSLAAESKREEEKAK 308
           TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVS+AAESKREEEKAK
Sbjct: 374 TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSIAAESKREEEKAK 433

Query: 309 GVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFVQSVGLPAQGRGRGRGIMWPPHMPM 368
           GVNPDIG+ENPDIVPFEDNEEEEEEESEEEEEESF QSVGLPAQGRGRGRGIMWPPHMPM
Sbjct: 434 GVNPDIGNENPDIVPFEDNEEEEEEESEEEEEESFGQSVGLPAQGRGRGRGIMWPPHMPM 493

Query: 369 GRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGFSPYGPRFSGDFMGP 428
           GRGARPFHGMQ FPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGF PYGPRFSGDFMGP
Sbjct: 494 GRGARPFHGMQSFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGFGPYGPRFSGDFMGP 553

Query: 429 PSAMMFRGRPSQPGAMFPPAGFGMMMGQGRGPYMGGMGVTGTNPARAGRPVGVSPLYPPP 488
           PSAMMFRGRPSQPGAMF P GFGMMMGQGRGP+MGGMGVTGT+PAR GRPVGVSPLYPPP
Sbjct: 554 PSAMMFRGRPSQPGAMFTPGGFGMMMGQGRGPFMGGMGVTGTSPARPGRPVGVSPLYPPP 613

Query: 489 AVPSSQNINRVVKRDQRGPANDRNDRYITGPDQSKGQEMPGSGHDDEMQYKQGSKAYPDE 548
           AVPS+QNINR +KRDQRGP +DRNDRYI GPDQ+KGQEM  SGHD+ MQYKQGSKAYPDE
Sbjct: 614 AVPSAQNINRAIKRDQRGPTSDRNDRYIVGPDQNKGQEMLSSGHDEGMQYKQGSKAYPDE 673

Query: 549 QYGMGTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQ 594
           QYGMGTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQ
Sbjct: 674 QYGMGTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQ 710

BLAST of Sgr019399 vs. ExPASy TrEMBL
Match: A0A6J1JWQ9 (30-kDa cleavage and polyadenylation specificity factor 30-like OS=Cucurbita maxima OX=3661 GN=LOC111490386 PE=4 SV=1)

HSP 1 Score: 1036.6 bits (2679), Expect = 1.0e-298
Identity = 528/585 (90.26%), Postives = 544/585 (92.99%), Query Frame = 0

Query: 9   KLCNCRYKFGFCPNGPDCRYRHAKLPAPPPPVEEILQKIQHLSSYNYGPSNKFFSQRGVG 68
           K CN  YKFGFCPNGPDCRYRHAKLP PPPPVEEILQKIQHL  YNYG SNKFFSQRGVG
Sbjct: 131 KECN-MYKFGFCPNGPDCRYRHAKLPGPPPPVEEILQKIQHLGPYNYGTSNKFFSQRGVG 190

Query: 69  LSQQNEKSQFPQGPATVNQGVIGKPSTAESGNVQQQQAQQSAQQPSQTQIQSLSNGQPNQ 128
           LSQQNEKSQFP GP T+ QGVIGKPS AES NVQQQQAQQSAQQ SQ  IQS SNGQ NQ
Sbjct: 191 LSQQNEKSQFPPGPVTITQGVIGKPSVAESANVQQQQAQQSAQQTSQAPIQSASNGQVNQ 250

Query: 129 LNRTATSLPQGISRCVSRDLKYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD 188
           LNR +TSLP GISR       YFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD
Sbjct: 251 LNRNSTSLPPGISR-------YFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD 310

Query: 189 NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK 248
           NVILIFSVNRTRHFQGCAKMM+RIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK
Sbjct: 311 NVILIFSVNRTRHFQGCAKMMTRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK 370

Query: 249 TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSLAAESKREEEKAK 308
           TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDG+LMAVS+AAESKREEEKAK
Sbjct: 371 TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGDLMAVSIAAESKREEEKAK 430

Query: 309 GVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFVQSVGLPAQGRGRGRGIMWPPHMPM 368
           GVNPDI +ENPDIVPFEDNEEEEEEESEEEEEESF QSVG PAQGRGRGRGIMWPPHMPM
Sbjct: 431 GVNPDIRNENPDIVPFEDNEEEEEEESEEEEEESFGQSVGPPAQGRGRGRGIMWPPHMPM 490

Query: 369 GRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGFSPYGPRFSGDFMGP 428
           GRGARPFHGMQGFPPGMMGPDG+SYGPVTPDGFPMPDIFGMAPRGF+PYG RFSG+FM P
Sbjct: 491 GRGARPFHGMQGFPPGMMGPDGMSYGPVTPDGFPMPDIFGMAPRGFNPYGARFSGEFMNP 550

Query: 429 PSAMMFRGRPSQPGAMFPPAGFGMMMGQGRGPYMGGMGVTGTNPARAGRPVGVSPLYPPP 488
            SAMMFRGRPSQPGAMFPP GFGMMMGQGRGP+MGGMGVTGTNPARAGRPVG  PLYPPP
Sbjct: 551 QSAMMFRGRPSQPGAMFPPGGFGMMMGQGRGPFMGGMGVTGTNPARAGRPVG-PPLYPPP 610

Query: 489 AVPSSQNINRVVKRDQRGPANDRNDRYITGPDQSKGQEMPGSGHDDEMQYKQGSKAYPDE 548
           AVPSSQN+NR +KRDQRGP +DRNDRYI G DQ++GQE+PGSGHDDEMQYKQGSKAYPDE
Sbjct: 611 AVPSSQNMNRAMKRDQRGPGSDRNDRYIAGSDQNRGQEIPGSGHDDEMQYKQGSKAYPDE 670

Query: 549 QYGMGTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQ 594
           QYGMGT  RNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQ
Sbjct: 671 QYGMGTAIRNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQ 706

BLAST of Sgr019399 vs. ExPASy TrEMBL
Match: A0A6J1GSL5 (30-kDa cleavage and polyadenylation specificity factor 30-like OS=Cucurbita moschata OX=3662 GN=LOC111457116 PE=4 SV=1)

HSP 1 Score: 1036.2 bits (2678), Expect = 1.3e-298
Identity = 528/585 (90.26%), Postives = 545/585 (93.16%), Query Frame = 0

Query: 9   KLCNCRYKFGFCPNGPDCRYRHAKLPAPPPPVEEILQKIQHLSSYNYGPSNKFFSQRGVG 68
           K CN  YKFGFCPNGPDCRYRHAKLP PPPPVEEILQKIQHL  YNYG SNKFFSQRGVG
Sbjct: 131 KECN-MYKFGFCPNGPDCRYRHAKLPGPPPPVEEILQKIQHLGPYNYGTSNKFFSQRGVG 190

Query: 69  LSQQNEKSQFPQGPATVNQGVIGKPSTAESGNVQQQQAQQSAQQPSQTQIQSLSNGQPNQ 128
           LSQQNEKSQFP GPATV QGVIGKPS AES NVQQQQAQQSAQQ SQ  IQ+ SNGQ NQ
Sbjct: 191 LSQQNEKSQFPPGPATVTQGVIGKPSVAESANVQQQQAQQSAQQTSQAPIQTASNGQVNQ 250

Query: 129 LNRTATSLPQGISRCVSRDLKYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD 188
           LNR +TSLP GISR       YFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD
Sbjct: 251 LNRNSTSLPPGISR-------YFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD 310

Query: 189 NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK 248
           NVILIFSVNRTRHFQGCAKMM+RIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK
Sbjct: 311 NVILIFSVNRTRHFQGCAKMMTRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK 370

Query: 249 TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSLAAESKREEEKAK 308
           TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDG+LMAVS+AAESKREEEKAK
Sbjct: 371 TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGDLMAVSIAAESKREEEKAK 430

Query: 309 GVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFVQSVGLPAQGRGRGRGIMWPPHMPM 368
           GVNPDI +ENPDIVPFEDNEEEEEEESEEEEEESF QSVG PAQGRGRGRG+MWPPHMPM
Sbjct: 431 GVNPDIRNENPDIVPFEDNEEEEEEESEEEEEESFGQSVGPPAQGRGRGRGMMWPPHMPM 490

Query: 369 GRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGFSPYGPRFSGDFMGP 428
           GRGARPFHGMQGFPPGMMGPDG+SYGPVTPDGFPMPDIFGMAPRGF+PYG RFSG+FM P
Sbjct: 491 GRGARPFHGMQGFPPGMMGPDGMSYGPVTPDGFPMPDIFGMAPRGFNPYGARFSGEFMNP 550

Query: 429 PSAMMFRGRPSQPGAMFPPAGFGMMMGQGRGPYMGGMGVTGTNPARAGRPVGVSPLYPPP 488
            SAMMFRGRPSQPGAMFPP GFGMMMGQGRGP+MGGMGVTGTNPARAGRPVG  PLYPPP
Sbjct: 551 QSAMMFRGRPSQPGAMFPPGGFGMMMGQGRGPFMGGMGVTGTNPARAGRPVG-PPLYPPP 610

Query: 489 AVPSSQNINRVVKRDQRGPANDRNDRYITGPDQSKGQEMPGSGHDDEMQYKQGSKAYPDE 548
           AVPSSQN+NR +KRDQRGP +DRNDRYI G DQ++GQE+PGSGHDDEMQYKQGSKAYPDE
Sbjct: 611 AVPSSQNMNRAMKRDQRGPGSDRNDRYIAGSDQNRGQEIPGSGHDDEMQYKQGSKAYPDE 670

Query: 549 QYGMGTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQ 594
           QYGMGT  RNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQ
Sbjct: 671 QYGMGTAIRNEESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQ 706

BLAST of Sgr019399 vs. TAIR 10
Match: AT1G30460.1 (cleavage and polyadenylation specificity factor 30 )

HSP 1 Score: 604.4 bits (1557), Expect = 2.5e-172
Identity = 356/575 (61.91%), Postives = 407/575 (70.78%), Query Frame = 0

Query: 9   KLCNCRYKFGFCPNGPDCRYRHAKLPAPPPPVEEILQKIQHLSSYNYGPSNKFFSQRGVG 68
           K CN  YK GFCPNGPDCRYRHAKLP PPPPVEE+LQKIQ L++YNYG +N+ +  R V 
Sbjct: 118 KECN-MYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLTTYNYG-TNRLYQARNVA 177

Query: 69  LSQQNEKSQFPQGPATVNQGVIGKPSTAESGNVQQQQAQQSAQQPSQTQIQSLSNGQPNQ 128
              Q+     PQG   +     G+P   ESGN+QQQQ QQ  Q   Q   Q+L     +Q
Sbjct: 178 PQLQDR----PQGQVPMQ----GQPQ--ESGNLQQQQQQQPQQSQHQVS-QTLIPNPADQ 237

Query: 129 LNRTATSLPQGISRCVSRDLKYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAD 188
            NRT+  LPQG++R       YF+VKS NREN ELSVQQGVWATQRSNEAKLNEAFDS +
Sbjct: 238 TNRTSHPLPQGVNR-------YFVVKSNNRENFELSVQQGVWATQRSNEAKLNEAFDSVE 297

Query: 189 NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLCELSFQK 248
           NVILIFSVNRTRHFQGCAKM SRIGG + GGNWK+ HGTA YG+NFS+KWLKLCELSF K
Sbjct: 298 NVILIFSVNRTRHFQGCAKMTSRIGGYIGGGNWKHEHGTAQYGRNFSVKWLKLCELSFHK 357

Query: 249 TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSLAAESKREEEKAK 308
           TR+LRNPYNENLPVKISRDCQELEPS+GEQLASLLYLEPD ELMA+S+AAE+KREEEKAK
Sbjct: 358 TRNLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISIAAEAKREEEKAK 417

Query: 309 GVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFVQSVGLPAQGRGRGRGIMWPPHMPM 368
           GVNP+  +ENPDIVPFEDNEEEEEEE E EEEE   +S+    QGRGRGRGIMWPP MP+
Sbjct: 418 GVNPESRAENPDIVPFEDNEEEEEEEDESEEEE---ESMAGGPQGRGRGRGIMWPPQMPL 477

Query: 369 GRGARPFHGMQGFPPGMMGP-DGLSYGPVTPDGFPMPDIFGMAPRGFSPYGPRFSGDFMG 428
           GRG RP  GM GFP G+MGP D   YGP   +G  MPD FGM PR F PYGPRF GDF G
Sbjct: 478 GRGIRPMPGMGGFPLGVMGPGDAFPYGPGGYNG--MPDPFGMGPRPFGPYGPRFGGDFRG 537

Query: 429 PPSAMMFRGRPSQPGAMFPPAGFGMMMGQGRGPYMGGMGVTGTNPARAGRPVGVSPLYPP 488
           P   MMF GRP Q    FP  G+G MMG GRGP+MGGMG    N  R GR     P+Y P
Sbjct: 538 PVPGMMFPGRPPQ---QFPHGGYG-MMGGGRGPHMGGMG----NAPRGGR-----PMYYP 597

Query: 489 PAVPSSQNINRVVKRDQRGPANDRNDRYITGPDQSKGQEMPGSGHDDEMQYKQGSKAYPD 548
           PA  S+          + GP+N +       P++S  + + G   + +  +         
Sbjct: 598 PATSSA----------RPGPSNRKT------PERSDERGVSGDQQNQDASHDM------- 631

Query: 549 EQYGMGTTFRNEESES--EDEAPRRSRHGEGKKKR 581
           EQ+ +G + RNEESES  EDEAPRRSRHGEGKK+R
Sbjct: 658 EQFEVGNSLRNEESESEDEDEAPRRSRHGEGKKRR 631

BLAST of Sgr019399 vs. TAIR 10
Match: AT4G05410.1 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 518.8 bits (1335), Expect = 1.4e-146
Identity = 277/493 (56.19%), Postives = 360/493 (73.02%), Query Frame = 0

Query: 649  KKKSSAPKGATKGKNFPLDKDPFFSSESRKRRKTVDENDEIESGESD-EDTGFMGSAAER 708
            KK  S  +G  KG N   ++DPFF  E +KRRK   ++D+IES +SD E+ GF G   + 
Sbjct: 8    KKGGSFKRGGKKGSN---ERDPFFEEEPKKRRKVSYDDDDIESVDSDAEENGFTGGDEDG 67

Query: 709  GKYE-EAEDE-QFEEETADEKRNRVANEYVDKIWEIARREKER------KDEEKDSLVAQ 768
             + + E EDE +F +ETA EKR R+A E +++  E  RRE+E        DE+ D  + +
Sbjct: 68   RRVDGEVEDEDEFADETAGEKRKRLAEEMLNRRREAMRREREEADNDDDDDEDDDETIKK 127

Query: 769  ILQQEQLEDSGRVRREIASRVQKPEARDEFQVLIKHRQTVTAVALSDDDSKGFSTSKDGT 828
             L Q+Q EDSGR+RR IASRVQ+P + D F V++KHR++V +VALSDDDS+GFS SKDGT
Sbjct: 128  SLMQKQQEDSGRIRRLIASRVQEPLSTDGFSVIVKHRRSVVSVALSDDDSRGFSASKDGT 187

Query: 829  ILHWDVDSGKGEKYQWPSDEVLRLHGVKDPQGRATMHSKVILSLAVSSDGRYLASGGLDR 888
            I+HWDV SGK +KY WPSDE+L+ HG+K  + R   HS+  L+LAVSSDGRYLA+GG+DR
Sbjct: 188  IMHWDVSSGKTDKYIWPSDEILKSHGMKLREPRNKNHSRESLALAVSSDGRYLATGGVDR 247

Query: 889  HVHIWDTRIREHIQAFPGHRGPVSCLTFRQGTSELFSGSYDRTVKIWNVEDRAYINTLFG 948
            HVHIWD R REH+QAFPGHR  VSCL FR GTSEL+SGS+DRTVK+WNVED+A+I    G
Sbjct: 248  HVHIWDVRTREHVQAFPGHRNTVSCLCFRYGTSELYSGSFDRTVKVWNVEDKAFITENHG 307

Query: 949  HQSEVLTIDCLRKERLLTVGRDRSMQLWKVPEESRLVFRAPASSLECCCFISNDEFLSGS 1008
            HQ E+L ID LRKER LTVGRDR+M   KVPE +R+++RAPASSLE CCFIS++E+LSGS
Sbjct: 308  HQGEILAIDALRKERALTVGRDRTMLYHKVPESTRMIYRAPASSLESCCFISDNEYLSGS 367

Query: 1009 DDGSIELWSLLKKKPVSIVRNAHPLSFSCTNLELKENGAIPIGCMGNGDVNSNTSHSLSA 1068
            D+G++ LW +LKKKPV + +NAH         +   +G    G + NGD +   +++ SA
Sbjct: 368  DNGTVALWGMLKKKPVFVFKNAH---------QDIPDGITTNGILENGD-HEPVNNNCSA 427

Query: 1069 YSWVSSVSVCRNSDLAASGAGNGSVRLWALTSDKKDIRPLYDFPLVGFVNSLTFAKSGRF 1128
             SWV++V+  R SDLAASGAGNG VRLWA+ ++   IRPLY+ PL GFVNSL FAKSG+F
Sbjct: 428  NSWVNAVATSRGSDLAASGAGNGFVRLWAVETNA--IRPLYELPLTGFVNSLAFAKSGKF 485

Query: 1129 VVAGVGQRAKWAK 1133
            ++AGVGQ  ++ +
Sbjct: 488  LIAGVGQETRFGR 485

BLAST of Sgr019399 vs. TAIR 10
Match: AT4G21130.1 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 506.1 bits (1302), Expect = 9.1e-143
Identity = 277/511 (54.21%), Postives = 354/511 (69.28%), Query Frame = 0

Query: 646  KITKKKSSAPKGATKGKNFPLDKDPFFSSESRKRRK-TVDENDEIESGESDEDTGFMGSA 705
            K+ KKK    K   +GK   +D DPF   E+ KRRK   D++D+IES ES+E+       
Sbjct: 2    KLEKKKGIGAK--RRGKKSSIDHDPFLEEETEKRRKFNYDDDDDIESVESEEE------- 61

Query: 706  AERGKYEEAEDEQFEEETADEKRNRVANEYVDKIWEIARREKERKDEE----KDSLVAQI 765
               GK  E  +++F  ET  EKR R+A + +++I E  +RE E  +EE    +DSLVA+ 
Sbjct: 62   ---GKVGEEVEDEFAHETVGEKRKRLAEDTLNRIEEAKQREHEEDNEEDDDFRDSLVAKT 121

Query: 766  LQQEQLEDSGRVRREIASRVQKPEARDEFQVLIKHRQTVTAVALSDDDSKGFSTSKDGTI 825
            L QEQLE SGRVRR  A RVQ  ++ D+F+V++KH+ +VT VALSDDDS+GFS SKDGTI
Sbjct: 122  LMQEQLEKSGRVRRANALRVQDLQSSDKFRVIVKHQHSVTGVALSDDDSRGFSVSKDGTI 181

Query: 826  LHWDVDSGKGEKYQWPSDEVLRLHGVKDPQGRATMHSKVILSLAVSSDGRYLASGGLDRH 885
            LHWDV SGK ++Y+WPSDEVL+ HG+K  +   T H+K  L+LAVSSDGRYLA+GG+D H
Sbjct: 182  LHWDVSSGKSDEYKWPSDEVLKSHGLKFQESWYTRHNKQSLALAVSSDGRYLATGGVDCH 241

Query: 886  VHIWDTRIREHIQAFPGHRGPVSCLTFRQGTSELFSGSYDRTVKIWNVEDRAYINTLFGH 945
            VH+WD R REH+QAF GH G VS L FR+GT+ELFSGSYD T+ IWN E R YI + FGH
Sbjct: 242  VHLWDIRTREHVQAFTGHCGIVSSLCFREGTAELFSGSYDGTLSIWNAEHRTYIESCFGH 301

Query: 946  QSEVLTIDCLRKERLLTVGRDRSMQLWKVPEESRLVFRAPASSLECCCFISNDEFLSGSD 1005
            QSE+L+ID L +ER+L+VGRDR+MQL+KVPE +RL++RA  S+ ECCCF+++DEFLSGSD
Sbjct: 302  QSELLSIDALGRERVLSVGRDRTMQLYKVPESTRLIYRASESNFECCCFVNSDEFLSGSD 361

Query: 1006 DGSIELWSLLKKKPVSIVRNAHPLSFSCTNLELKENGAIPIGCMGNGDVNSNTSHSLSAY 1065
            +GSI LWS+LKKKPV IV NAH +                       D +S   +   A 
Sbjct: 362  NGSIALWSILKKKPVFIVNNAHHVI---------------------ADHDSVNHNCTPAC 421

Query: 1066 SWVSSVSVCRNSDLAASGAGNGSVRLWALTSDKKDIRPLYDFPLVGFVNSLTFAKSGRFV 1125
            SWVSSV+VCR S+LAASGAGNG VRLW + S    I+PLY+ PL GFVNSL FAKSGRF+
Sbjct: 422  SWVSSVAVCRGSELAASGAGNGCVRLWGVESGSSAIQPLYELPLPGFVNSLAFAKSGRFL 479

Query: 1126 VAGVGQRAKWAKRTTSEAEEISPAYSDLRPS 1152
            +AGVGQ  +  +    ++ +   A   LR S
Sbjct: 482  IAGVGQEPRLGRWGCLKSAQNGVAIHPLRLS 479

BLAST of Sgr019399 vs. TAIR 10
Match: AT4G11970.1 (YTH family protein )

HSP 1 Score: 122.5 bits (306), Expect = 2.8e-27
Identity = 84/235 (35.74%), Postives = 124/235 (52.77%), Query Frame = 0

Query: 94  STAESGNVQQQQAQQSAQQPSQTQIQSLSNGQPNQLNRTATSLPQGISRCVSRD------ 153
           S  +S     +Q   ++  P  T  +S  + + ++++    + P  +    +        
Sbjct: 11  SVVDSSLTDWKQDLGNSDDPESTSYRSKEDHKLSKVDVDRRNFPDQLESAKANKNSKPGY 70

Query: 154 -LKYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSADNVILIFSVNRTRHFQGCA 213
             +YFI+KS N +N+++SV++G+WATQ  NE  L  AF  +  VILIFSVN +  FQG A
Sbjct: 71  RTRYFIIKSLNYDNIQVSVEKGIWATQVMNEPILEGAFHKSGRVILIFSVNMSGFFQGYA 130

Query: 214 KMMSRIGGSVSGGNWKYAHGTAH-YGQNFSLKWLKLCELSFQKTRHLRNPYNENLPVKIS 273
           +M+S +G       W    G  + +G++F +KWL+L EL FQKT HL+NP N+  PVKIS
Sbjct: 131 EMLSPVGWR-RDQIWSQGGGKNNPWGRSFKVKWLRLSELPFQKTLHLKNPLNDYKPVKIS 190

Query: 274 RDCQELEPSIGEQLASLLYLEPDGELMAVSLAAESKREEEKAKGVNPDIGSENPD 321
           RDCQEL   IGE L  LL    D       L   S R++   K    +  S + D
Sbjct: 191 RDCQELPEDIGEALCELL----DANSCDDGLLNSSSRDDYSTKRSRAEPPSSSGD 240

BLAST of Sgr019399 vs. TAIR 10
Match: AT4G11970.2 (YTH family protein )

HSP 1 Score: 122.5 bits (306), Expect = 2.8e-27
Identity = 84/235 (35.74%), Postives = 124/235 (52.77%), Query Frame = 0

Query: 94  STAESGNVQQQQAQQSAQQPSQTQIQSLSNGQPNQLNRTATSLPQGISRCVSRD------ 153
           S  +S     +Q   ++  P  T  +S  + + ++++    + P  +    +        
Sbjct: 11  SVVDSSLTDWKQDLGNSDDPESTSYRSKEDHKLSKVDVDRRNFPDQLESAKANKNSKPGY 70

Query: 154 -LKYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSADNVILIFSVNRTRHFQGCA 213
             +YFI+KS N +N+++SV++G+WATQ  NE  L  AF  +  VILIFSVN +  FQG A
Sbjct: 71  RTRYFIIKSLNYDNIQVSVEKGIWATQVMNEPILEGAFHKSGRVILIFSVNMSGFFQGYA 130

Query: 214 KMMSRIGGSVSGGNWKYAHGTAH-YGQNFSLKWLKLCELSFQKTRHLRNPYNENLPVKIS 273
           +M+S +G       W    G  + +G++F +KWL+L EL FQKT HL+NP N+  PVKIS
Sbjct: 131 EMLSPVGWR-RDQIWSQGGGKNNPWGRSFKVKWLRLSELPFQKTLHLKNPLNDYKPVKIS 190

Query: 274 RDCQELEPSIGEQLASLLYLEPDGELMAVSLAAESKREEEKAKGVNPDIGSENPD 321
           RDCQEL   IGE L  LL    D       L   S R++   K    +  S + D
Sbjct: 191 RDCQELPEDIGEALCELL----DANSCDDGLLNSSSRDDYSTKRSRAEPPSSSGD 240

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022140120.10.0e+0094.1930-kDa cleavage and polyadenylation specificity factor 30 [Momordica charantia][more]
XP_008445183.14.2e-30792.65PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30 [Cucumis me... [more]
XP_038894441.18.7e-30591.9730-kDa cleavage and polyadenylation specificity factor 30-like [Benincasa hispid... [more]
XP_038889902.15.8e-30191.1330-kDa cleavage and polyadenylation specificity factor 30-like [Benincasa hispid... [more]
KAG6573397.17.1e-29990.4330-kDa cleavage and polyadenylation specificity factor 30, partial [Cucurbita ar... [more]
Match NameE-valueIdentityDescription
A9LNK93.5e-17161.9130-kDa cleavage and polyadenylation specificity factor 30 OS=Arabidopsis thalian... [more]
Q0DA503.3e-16658.70Zinc finger CCCH domain-containing protein 45 OS=Oryza sativa subsp. japonica OX... [more]
Q9M0V41.9e-14556.19U3 snoRNP-associated protein-like YAO OS=Arabidopsis thaliana OX=3702 GN=YAO PE=... [more]
Q3MKM61.3e-14154.21U3 snoRNP-associated protein-like EMB2271 OS=Arabidopsis thaliana OX=3702 GN=EMB... [more]
Q75LV51.8e-13553.75U3 snoRNP-associated protein-like YAOH OS=Oryza sativa subsp. japonica OX=39947 ... [more]
Match NameE-valueIdentityDescription
A0A6J1CE780.0e+0094.1930-kDa cleavage and polyadenylation specificity factor 30 OS=Momordica charantia... [more]
A0A5D3C2W12.0e-30792.6530-kDa cleavage and polyadenylation specificity factor 30 OS=Cucumis melo var. m... [more]
A0A1S3BC282.0e-30792.6530-kDa cleavage and polyadenylation specificity factor 30 OS=Cucumis melo OX=365... [more]
A0A6J1JWQ91.0e-29890.2630-kDa cleavage and polyadenylation specificity factor 30-like OS=Cucurbita maxi... [more]
A0A6J1GSL51.3e-29890.2630-kDa cleavage and polyadenylation specificity factor 30-like OS=Cucurbita mosc... [more]
Match NameE-valueIdentityDescription
AT1G30460.12.5e-17261.91cleavage and polyadenylation specificity factor 30 [more]
AT4G05410.11.4e-14656.19Transducin/WD40 repeat-like superfamily protein [more]
AT4G21130.19.1e-14354.21Transducin/WD40 repeat-like superfamily protein [more]
AT4G11970.12.8e-2735.74YTH family protein [more]
AT4G11970.22.8e-2735.74YTH family protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 321..341
NoneNo IPR availableGENE3D3.10.590.10ph1033 like domainscoord: 132..286
e-value: 4.0E-52
score: 177.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 473..610
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 671..687
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 322..343
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 558..586
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 300..356
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 527..541
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 646..727
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 64..134
NoneNo IPR availablePANTHERPTHR12357YTH YT521-B HOMOLOGY DOMAIN-CONTAININGcoord: 9..591
NoneNo IPR availablePANTHERPTHR12357:SF106ZINC FINGER CCCH DOMAIN-CONTAINING PROTEIN 45coord: 9..591
NoneNo IPR availablePROSITEPS50294WD_REPEATS_REGIONcoord: 895..929
score: 11.391954
NoneNo IPR availablePROSITEPS50294WD_REPEATS_REGIONcoord: 792..829
score: 8.992876
NoneNo IPR availablePROSITEPS50294WD_REPEATS_REGIONcoord: 853..887
score: 10.811958
NoneNo IPR availableCDDcd00200WD40coord: 856..1123
e-value: 2.04598E-42
score: 155.571
IPR020472G-protein beta WD-40 repeatPRINTSPR00320GPROTEINBRPTcoord: 914..928
score: 41.55
coord: 995..1009
score: 36.87
coord: 872..886
score: 35.06
IPR001680WD40 repeatSMARTSM00320WD40_4coord: 888..927
e-value: 2.4E-7
score: 40.4
coord: 1050..1088
e-value: 0.11
score: 21.7
coord: 846..885
e-value: 1.6E-6
score: 37.6
coord: 930..968
e-value: 0.059
score: 22.5
coord: 785..824
e-value: 0.0043
score: 26.3
coord: 970..1008
e-value: 0.0094
score: 25.1
IPR001680WD40 repeatPFAMPF00400WD40coord: 1053..1087
e-value: 0.065
score: 14.1
coord: 891..927
e-value: 2.2E-5
score: 25.1
coord: 977..1008
e-value: 0.018
score: 15.9
coord: 856..885
e-value: 2.7E-5
score: 24.8
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 853..894
score: 13.549507
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 895..936
score: 14.719144
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 792..833
score: 11.043142
IPR007275YTH domainPFAMPF04146YTHcoord: 149..282
e-value: 6.0E-44
score: 150.4
IPR007275YTH domainPROSITEPS50882YTHcoord: 148..283
score: 58.309341
IPR015943WD40/YVTN repeat-like-containing domain superfamilyGENE3D2.130.10.10coord: 784..1144
e-value: 1.6E-99
score: 334.8
IPR019775WD40 repeat, conserved sitePROSITEPS00678WD_REPEATS_1coord: 872..886
IPR019775WD40 repeat, conserved sitePROSITEPS00678WD_REPEATS_1coord: 914..928
IPR000571Zinc finger, CCCH-typePROSITEPS50103ZF_C3H1coord: 6..33
score: 11.797969
IPR036322WD40-repeat-containing domain superfamilySUPERFAMILY50978WD40 repeat-likecoord: 789..1123

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr019399.1Sgr019399.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000381 regulation of alternative mRNA splicing, via spliceosome
biological_process GO:0006364 rRNA processing
cellular_component GO:0005654 nucleoplasm
molecular_function GO:0003677 DNA binding
molecular_function GO:0046872 metal ion binding
molecular_function GO:0003729 mRNA binding
molecular_function GO:1990247 N6-methyladenosine-containing RNA binding
molecular_function GO:0005515 protein binding
molecular_function GO:0034511 U3 snoRNA binding
molecular_function GO:0003723 RNA binding