MS006573 (gene) Bitter gourd (TR) v1

Overview
NameMS006573
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionThaumatin
Locationscaffold404: 1437327 .. 1458599 (+)
RNA-Seq ExpressionMS006573
SyntenyMS006573
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TGTGTTTGTAAAACAGTAGTCCATGCAGCCACATTTGTTGTCAAAAACAATTGTCCACAGACCATATGGCCCGCAACACTAACCAGCGGCAGCGGACAATCCCAACTCTCCTCCACTGGATTTCAGTTGCTTTCTGGAGCGTCGAGGACCGTAGATGTGCCTGCGCCGTGGACGGGTAGGATCTGGGCTCGAACACGCTGCTTCGTCGATAGCTCTTCAAGATTTTCGTGCGAAACTGGAGACTGTGCCTCCGGCTCCATCTCTTGCAATGGCGCAGGAGGAATTCCACCGGCCACACTGGCCGAGTTCACATTAGCCCCCAATGGAGGCATGGATTTCTACGACGTCAGCCTCGTCGACGGCTTCAACCTGCCGGCTTCCATAGCCACAGTGGGTGGAACAGGCGAATGCCAATCCACAGCTTGTTCTGCAAATGTAAACGGAGTTTGCCCAACGGAGTTGCAAGTCAGGTCAGGAGATGGGAGCGTGATTGGGTGTAAGAGCGCATGCCTTGCGTTTAATGAACCACAGTATTGTTGCACCGCGGAATTCAACGACCCCAGCAAGTGCGCTCACAGCCAGTACTCGTTGATCTTCAAGAACCAATGCCCTCAGGCTTACAGCTACGCTTATGACGACAAAACCAGCACCTTCACATGCAGCGGTTCGCCCAATTATGTTGTTACCTTCTGCCCGTAACCATATAACATAAAATCTCTTTTGCTTTCCTTTCATAATTATTCTATATTTTACTGCAGTGAATAAAATTGTACTATATGTACTCATCCGTGGAATAAATTTCAAGTTATAATATATGAATAATACAAGTTGGATGTAATAAAATGTCATTATTGAAGTCAAATAATAATGTGTTCCATGAAATTGTGTGTCTAATTTTCTTATTCCTTATTGATTAATGTTTTTAAGTCATTTTGTGATTGGTTATTTCAAAGCAACATCAAATCCTTCCTAACAATAAATTTAGCAATTCAGGATAGAAAATAAACCAAAAAACCGATGCTAAATTCCATAACAAAATTATAAAAACTTCAAGGAAAAGAAAAATAAATAAATCCTAACTGTCGTTTGCCTCATCTATCTTGTCATTTATAAATATGAAATTACTTTAAAAAAGATTTTTCCTTGGGTGGAAACTCCTTATATATATAAAAAAAAAAGAGGCACGGTAGCACCATGACCTATAACTAAGAACCATGATGTTATTCTCATGAGGCAAACGATCAAAAGTTCACATTTCCATCTGCCACTATGTTAACAGCATTGAGATGCTATAGCGCCATGACCCTAGGGATGTAGCTCTGGCCTTTCTAACTTGTGGCTGCGTACTACAAGCGCCATGGTGCTTCACACATTTTGGTCTTTGGCATGTCGGGTTTTATTTTCATTCTATACAAGCATCGAGGTGCTACCCTCCACGAGTTATATGCATGAATTATCTTTTTGCTTAGGTCTAAACTGCTAATGAAGGCCAAAGTTAATTCCTTTAAACATCTGAGATTCTTTCAAATGCTTGATTCCTACGAAATGACCAATAATTCACATAAAACCTAATTAGATAACACAAACACTTAAAGTTAAGCTATATCAAAAAGCTTTGCTGTTAAAAAATCAATAGAAATGCAAAATATATACAATTTATTGGTATAATACTAGACTATTAAGTTATAGAATATATTCATTTTATTACAAAGTATATGCAACATATACTCAACATATACACCTAAAAATGTTGATGTGATCTTGTGCCGAATCCTTGGCCGCGAAGACGAGGGTTGTTGAAAACTTGCCGAAGCGGAAATTATTGAAGAGGGGCAGCAACCATCAATTTCGAAGGAGATAGCTCACAGTCTCTAGTCCAGCTCCATTTTAAATCTATAGAAGAAATACCTTCATATTGTTGCGAAATTAATGACCTTGAATAATTTCCTCTTCCATAGGACTATCCAAACACACCACAATATGCAAATTAGGTAAATCACAAGAATACTACACAACAAAATTATTAAACAATCAACCAGGATTGATCTAATTCGAGTCAACAACATCAACCAACTCAGCAGAATTTGCCAACTCAGTATTCTCATGGCCTCATATCACTACCTCACTACCAACAATAATTGAAGGCAATACGTTTGCAATCTTTGCAATAACAGCAGCTCCTCAATTATTGGCATTCCCTGTCAATCAAATCCCCAACATGACATCAATTAACTTCCTTAATTGTCTTAGTAGAAAGATGTTATCTTTGTTTTACCATATTTTTTATTACCTCATTTTATGCATTTATGTTTGTTGCCATTGAGTTCTTTTCTTCCATGTATAGACCTAACTTTAGGTCTTCTGTAATAGTCTTTCGTGCCCTTTAAAATCATATTTTAGCCAACTATAAAGCTTAAGAAAATCAATAACGATTCTTACCTCAGTTTGCGTTTGATCCTTCCTAACCTTTGAGCTAAATTTTTTATGGTATCAGAGTAGGCACGGAAAGACTAAAGATCCATCCTTGTTCTTCCTACCTCTCATCTTACCTCAATTTCCATCCTACATTCTTCAACAATCTATCATTGTCATGGCCCAATCAGAAAACTAACTTACTGTGCAATCATTCCACCAATGTGCCAGTCTTATCTCCCTCAAACTAAACTCCTCAAATTACTTGCTATGGAAATCTCAGGTATTACCTTTGATAAGAACTTTGGGATTGGAACACCACCTAAAGGAGGAAGCTCCAGCAACTAATGCATCCAAAGCCAAGAAGGGATATGCAAAAACTAGTGTACAAGCAAATGCATGGACCAACAATAATGGCATCTTGACTTCATGGTTACTAGGAATCATTGGAGAAGATGTGCTGACTTTACTTGAAGGAACAGAAACCGAAAACAGGTATGGCAATCCCTTGAAGAGTTGTTGCTCACCATGACAAAAGAAAATGAAATACACCTAAATGAGGCACTTTTCACGTTGAAAAAAGGTTCATTATCTATAGATGAGTACATTAGAAAGTTCAAAAATTTGTGTGATATATTAGCAGCTATGAAAAAGCCATTGGATGACTTAACTAAAGTATTCCATTTGCTATGAAAAAGTCATTGGATGACTTAACTAAAGTATTCCATTTGGCAAGAGGACTAGGAATCAGATATAAGGATTTCGAACTGCCATGTATCTAAAACTCCTTATCCTTCATACAATCAATTTGTTCTTGCTTTGAAAGCACACGACCAACTTATCACTGTTGAAGAAGAAGAAGAAAGGTCCAGCCAAGTAAGTCTAAATCAAGCATTCTTCTCAAATCGAGGAAGAAGCAGAGGAAGGGGAAGACAATTTTCATCGAGGGGAAAAAGCTTTTATCATAGTAACAAGCTATCAGCAAGGAACTATCGTACAAAAGGTGAACAATAAGCATCAACATTCAAATGGAAGAGTAGGTTTCAAATAGGAAAAACAAGTGACAGGTCAGATATGTGGTAAGAATAATCACACCGCACTAGAATGTTGGAATAGATTTGACCATGCTTACCAATCTGAAGAAATACCTGAAGCTCTTGTTGCCTTAACTTTGAATGAAGTTGCTGATGAAAATATCTGTACCGACTTTGGGCAACCTTTCATGTGGTAAACAATTTAGGTAAAATTCAATTTCTACACCCTTATAATGGTGTTGATAAGATCTATGTAGGGAATGGAAATGGCCTACATATTACTCACACAGGTCACACTACTTTCAAAACTCCTAAAGGTCACCTTGCTTTAAATAATGTTTTAGTTTTCCCTGCAATAAAGAAAAATTTATTGCCTGTCCGCAAATTAACAAACGATAATAATTGTTCCATAACTTTTAATGCTAACAAGTTTGTTGTTAAGGATCTACAGGAGTTAGCTTTAGCTGAAGGATATGAAAAACAAGGATTGTACAAGCTGGGGCAGCTACAACCATGTGCCAAAACTACTTCTACAACCTTTACTACGGCTGCCACCTTAAATTAAACTTCTTCTGTTGTGTGGCATAAGAGATTAGGCCACTTGAATGATCATTTCTTGAAAAATTTTGCTAATAAACAACTGATCAATGTTACTTGTTGGGCACAGGAACAATTAGTATGTGTTAGCTACCAGTTAGGAAAAAGTTGAAAGTTACCTTTTAATTCCAATAAATCTCATGCTTTGAAGCCTCTAGATAAGATTCATTGTGATTTATGGGGACCGACCCCAATCTCATCTTGCCAACATTTTAAATATTATGTTAGTTTTATTAATGATTACTTACGATCTGTTTGGATTTATTATTTGAAAAAAAAAATCAGAGTTCTTATCTTGTTTTATCAAATTCAAACTTCTCGTGGAAAATCAATTTGAACAAAGGATAAAAGTTTTTCAAAGTGTTGGTGGGGGTGAATTTCAATCCACTACCTTTAAAAATATTTTGGAACAAAGTGGCATTAATCATTAATTTTCATGTCCTTATACTCCTCGACATAATGGAGTAGTTGAAAGGAAAAATCGTCATATAGTTGAAACTGGCATCTCGCTTTTGTTTGAATCTCATATGCCTCTAAAATATTGGGTGGATGCTTTTCTTACTGTTGTCTATCTCATTAATCGGATGCCCTCATATCCTTGTGTTTTTATCGGCTATAGCATGGTTCATAAAGGATATCGATGCCTAGATCCAACAAACAACCGAATTTATATATCAAGAGGTTGTTTTTTATGAAAGTTTCTTTCCTTATGATAAACTATCTACTAACAGCTCCACCAAGAGCACATTTCCTTTAGTTGTTTCTGATTTTGCAGGCACAATGGACAAGGAGGACAATCTACCAAGTATGGAGCTACTCTATGATTATGACAATAAAATTAATGCAATTGAGAATTCAAGAATGGATATGGAAATAATCAGCCCATCCAACTCCCGGGCAATAGAAATACAACACTCTAAGCCTACTGACATGATTGAATCAAGGGAGAAAAATATATTGCAACATAAGTAACAACCTCAACTTGAATAAATGATATCCTTACCAACTCAACCTAAGCCAATGCAGTAGACCAAGCTCTCAAGTTTCCAAAACTTAACATTTCTTCCCAACACACAAATTCTACTTCAACTTTGCCTACTGATTATTCAAACTATCATGACATGTCTGATATAAGCAATAACCTCTATTATTCCTACAGGACAAATTTCATCATCAGGAGACATGATTTATAATGCAGCAAACCAAAGTCTTATACAATAACAAATAACTGATCAAGACCATGAATTGCCAACATTGAACACTACTCCTCAAAACACAGATTTGCCTACTTCTAAGTATAATACTCTTCCTAATATTAGCAATCATCTTTTCATTGATTTGCCTCTTATATCTACGGTCCAACCTTACCACCAGGAGAAAACGTTGACAAAGTTGCAGTCCAACCCATGTCTCGTGACAAAACAACATATCAGTCTCCAAGACATCATATGATCACCAGACCAAGCTAAACAAAGATACCACTCTTGATCCAGCTCTTGCACATGAGGTCCAATCTTTGAGGGAACATAAAAGTTATGCCCTTGTTACAGTCACAGAACAATCAGAATCGAAAAACTATAAGATGACACTGACTCTCCCTCAGTGGAAAACAGCTATGGAAGAGGAAATGCAAGCCCTCTTACAAAACGACATGTTGAAACTTGTGCCCAGACCATTGGGTCTAAATGGATATTTAAGACAAAGTACAAGGAAGATGAAACAATAGATCAATACAAGGCACGCTTGGTAGCTCAAGTATACACTCAAATGGAAGAGTTACATTACGAAGAAACATAGAGTCCAGTGGTCAAGCCAACAACTATCAGGTTAATTTTATCCTTAGCAAAAAGTGCATGGAATTCTAAAGGAGGATGTATACATGGAACAACCACCAGGTTTTGTCTATCCTAAAACCTCTTCTACTTATAAACCTGCCCTTGTGTGTAAACTCAAAAAATTCATATATGGGTTGAAACAAGCACCTAGAGCATGGTTTGATAGATTAGCTGATTTTCTTCTCCATATTGACTTCACTTGTTGTACTTCAAACCCCTCTTTATTTATCTACTAAAATGGATCTATACTAACACTTATGCTAGTGTATGTTGACGACATTATTCTCACAGGAAATGATAACTGCCAGATTCAACACCTTATACAAACGCTTGGTTCAGAATTCTCTTTGAAAGATCTAGGTTCCCTTCACCACTTCCTTGGCATTGAAGTCAAGTCCACCTCTAAGGGAATCATGCTTTTTCAAGAGAAATACGCTCGAGACCTTCTTGCTAAAACCAAAATGGCAGGAGCCTCTGCCATAAGCACACCTTTGGCAACTTCAACTCAAGAACGTCCAACTGACATTCAACCTATAGATACTAAACAGTATAGAAGCATTGTAGGAGTCTTACAATATCTCACTCTTACTCGTCCTGATATAGTACAAGCAGTAAATCGAGTGTGCCAACACCTTCAACAACCAGTCACTAAGAACTTCAAAGCTATCAAGAGAATGTTTCGCTATGTCCAAGGAATAATAGACTATGGTATTACTCTCTATAAACATAGTTCTCTTAACTTATACGGTTTTTGTGATGCAGATTGGGGAGGGTGTCACGTAACCAGCCGAAGTACCACAGAATTCTGCATATTCTTTGGATCCAACTGCATCTCCTGGTCATCAAAAAAGCAACCCACTATGGCAAAATCCAGCTCCGAAGCAGAATATAGGGCCATGGCGAGCTCAACGGCGGAACTGACATGGATTAGCTTCATTCTTCGAGACATTGGAGTTCCTTTACTTCAAAAGCCCCAACCATACTGTGATAACATGAGTGCTCTCCACATGTTCATAAACCCAGTCTTCCATGTAAGAACGAAACACATAGAGATAGACTACCACTTTGTTCGTGAAAAAGTACCTCTCGGATCTCTCATCACCAAATACGTTCCTTCTACTCATCAAATCGCCGACATCCTTACCAAGCCACTAACAAAGATTGTGTTCAAGGGTCTAAGAACCAAACTGGACGTTCGTTCTACCACTACCACCAGTTTGAGCAACGATATTAAACAACCAACCAGGATTGATATAATTCCCAATCAACAACATCAACCAACTCAGCAGAATCTGCCAACTCAGTATTCTCATGGCCTCATATCAATACCTCACTACCAACAATAATTGAAGACAATACGTTTGCAATCTTTGCAATAACAGCAACCCCTCAATTATTGGCATTACTGGTAAATCAAATCCCCAACAACATAGCAATTAACTTCTTTAATTGTCGTAGTAGAAAGATGCCATCTTTGCCTCTTTCACCATATTTGTTATTACCTCATTTTATGCATTTATGTTGTTACCGTTGAGTCCTTTTTTTCCATGTATAGAGCTAACCTTAGGTCTTCTATAATAGTCTTTCATGCCCTTTAAAATCATATTTTAGCCTACTGTAAAGCTTAAGAAAATCAGTAACGATTGTTACCTCAGTTTGTGTTTGATCCCTCCTAACCTTTGAGCTATGATTGTTAAAAATTATATTGGTTCACCAACTAAGGACTACGTCTAGTCTTCCACTCAGTTGTGGGTAATTTACTAATATTGTCAAGATTTGATTACAATAAACTCTCCCCAAAAATATCTCCTAGAGGTCCTAACAAAAGCCCCTTAGCTTCTAAGTAATCCAACACAACTACTTAGGTACATGAATGAAAAACTCACAAAATAATCGAGAACAAATTATCCCAAAAATATCGATCTCGATCTACCATTTCTGTATTCCCAACACCCGAACCGTCGCCACGAAACTACATAAAAAAACCCATGGCTACTTTTTTTTTTGTTCTCACGGTCACAGCACATATATATGTCAAAAAATAGAACAACATTAAATTGGGCTTTTGGCCCATTGAAAACCCCACCAAGGCTCAATTCTAACTGCTCTGTCAACGAACTGACCCATACTACTCGGCCTTCCCCAAGCTCTGTCTGATTCTACCCCGAGACTGGACAACTCTACATGGTCTTGTCCTACAGACTTAGGGACATAGCCACATACTCTCACTTGATCTAAGTGATTCCCGCTGCTTTTTTATTCGATCGAACCCACTGATCTCTGAAAGAATATTTTCTCGCTACATTAGGCTTGGCATTCCCACCCAAGCATGTCGCCACTAGAAACAGAACTCGTACCCTTCAATGCACTAATTCACTTATGTGACTTCACCAACTCGCCTGAACGTCTAACACCTAAGGTGTGAACTTGATGACCTATCAAATTCTCATTTTCCTTTTCTGTACTTCCCAACTGTTTAACTTGAGCTGATGGCAAATTCTTCTGATCTTTTCTAGTCATCAATGTTGTTGATGATCAATGTTGCTGGTCTTCTAAGGTCTCACCTTGAACCATCTACAGCTACTCTGTGCATTTTCTGTCTCAGTGAACTCTTATCAACCCCAAACTCTGACACATAAACAGAAGATTTCTTGCGTCCTATCGCCGCTACCCTGGATTCCCTCTTGAGCTTCCAGAAACCCCCAACAAACTCACTGTTGTAGCCATCATCATCTAGCTTCTCTACGGATAACAAATTCAATCTGAAGCTAGGAATAAACCTGACATCTCGCAACAACATGCTGCCGATAATTTCTAGAACTCATCAAGCTTTTCATCATCTCTTGATCACTTTACAAGCTTTTCTTAGCTTATTTCTCATCATAACTTAGGTTTACATTGTTCTTCATTTTGTATTAGTTCTTTATTGTTTTCTATGCTTTTTAGCGAACTTTCTTGTTTTTTTATTCGTTTCTTGTAACTTGCAAATTCAATTTATAAAATTTCAGCATTCATCTTTTTCTTATTCTCGTTCCTACACCAATGGAATCTTGTAATCGCACATTATTGTGGAATCAAACGAGTTCTTAGTCCCCTAAACTCAAGAGAGAAAGGGTAAAGTTCACTCAGATTACAACCTTCTCATTTATAGTTTTGTTTTTATTTCTTCAATCTCAAATCACATTTCCCTTTTACATAACCTTTATATTATTTTTTCCTTTAAAAATATGAGCTAAATTCTTTACTCCTGGCCTGGTTATGGAAAGTTAGCTAAAACTTTTTAAAAGAAAACAACGTAGAAGGTGTTTTCTCATTTCTTTTATCATTAACATTGTTTTATAAGGTTAAGGATATAATTTGGCCAATTAGTGATTTAATAAAATCTCAATACTGAAAGAGTTGAGTTATTTTTAAAAAAAATACAGAACCAGTTTGAAACAAATTGGAATTGTATTCACTGTCTGAAAGCATACAAAACAACCTAATATTAATACTACTTACACGAAAAACTAAAGTCAGTAAAAAAGGGAAGAGTGCAACGGCTAGCCTTGATTGGGGAATCTACATATTTCCAAAATAAGGTAAACTAAGGGTATTGAAAATAGAAACTAATTATGAGTAATTACATCAGAAAGTAATGGAGAAAATATACACAATTACAATTATGAAACAACTGTAAATGGTCAGTGAGTGAAATAATAAACCAGCTGTGTATTCTTTCTGCACCAGATTTAGGAAGTTTGATGGCACCTGCTTTGAGTGAAAAACTAGTTGGCTTTGAGTGAGAGATACTATCTTACAGCAAGAAGACTGTTGTATCCAGAGGCTTGGTGTGGGGTTGAGATGTTGAGATTTCAGTACAATCAAAGTTCACATCTTCCTCAAACTGGTGTGTATCTCATGCACTCCTAGTTTGCTCCTCAGATCCTTGAAATTTGTCTTTGGTAAAGCTTTCGTAAATATATTAGCTACTTGACTTGTTGAAAGCACAAATTTTGTGATTAGGTGTTCAAGTGCCACTTTTTCTCTTACAAAATGGTAATCAATTTCTATATGTTTTTTTTCTTGCATGAAACACTAGATTCTTGGCGATGTGCATGGCACTGACGTTGTCACATAGAAGTTGAGGAGGCTTTATGATTTTTATTCCAACATCTCTTAGTAAGAACATAATCTTAAGTGGGGTCCGCTGTGATTGATGTCATTGCTCTATATTCTGCTTTTGTGCTTCATCTTGCCACCGTAGTCTGTTTCTTTGAGCTCCATGATATACAATTTGCTCCTAAAAAGATACATGATCCTACGGTACTCCTTCTTGTGTTAGGACATCCTGCCCAATCTACATCAGAGAAACCATAAACATTTAGAGAGCAATTTTTGTGGAAAATAACCCATACCTCAGTGTCCATTTGATGTATCTCAATATTCGCTTCACTACTTTTAGGTCTTTGTACTTTGGTTCTTGTAGATGCTGGCAAACTTTGTTTACGGCAAAAGTTATGTCTGGTTGTGTGAGAGTTAAGTAGTGTAGTGATCCTACAATACTTCTATACTCTTTTTTTATCAATGGGATCATTGTCATCTATGGTTTCCATGGTTGTGGCAGCAGTTGGAGTACTGAGATGAGCAGTTTCTAGCATTCCAGTCTTGTGAAGGTCTTGAACATATTTTAATTGCAATATGGAGATTTTTGTTGGGTGTTTAATACTTCTATTCCAAGGAAATAATGGAGATTTCCTAAATCCTTTATGGCGAACTCCATGTCTAGTTGGTGGATAAGATCTTCCATCATTTTTGTCTTGGTACCTGTCATGATTATGTCATCAACGTAAATAAGCATTAGAATGAGGGTTGTTTCTGTTTTTAGAATAAATAATGACAGATCATAGTAGGGATTAGAGAATCCTGTGTGTAGGAGGTACTGGGAGAGCCTTTCAAACCATGCTCTTGGAGCTTGCTTCAAGCCATATAGCGATCGATTGAGTTTGCACACGAGGTGAGGCTTCTTTAGATGTTGGAATATGGTGGTTGAGTCATGTACAAGTCTTCCTTCAAGTGGCCATGAAGAAAGGCATTTTTAACATCAAGTTATTTGAGACTCCAATGAAAGTGGACAGCCAAGGTAAGGATTAACCTAATTGTTTTAGGTTTGACCACTGGACCGAAGGTTTCATAATAGTCTACTCCTTCAATTTGAGAGTATCATTGAGCCACTAATCTAGCCTTATATCTTTCGATGCTGCCATCCTCCTTCAATTTTGTTTTTAAAATCCATTTAGATCCAACAATATTCGTGTGAGGAGGGGGAGGGACAAGATTCCAAGTGTGATTGTGTTCAAGAGCTCTCATTTCTTCTTCCATGGCTTGCCTTCTAATGTGGGATTTCTAGTGCTAACTTGTAGTTTTTAGGTCAGCTAATAAGGCACGGTTTTTTTGTTTATTTCTAATGTGTTGGGTTGCCTAATGTAGTATTAGATCTAAAGTCTTGTCTTGATGGAGTTTTGTTCTTGTGATCATAGGGTGATTATTTGTTTTCAATGTATTGACTGTGTTTAATTCAGGCTGCACTTGATTTTTCTACAAAGGGGTTGGGTTGGTTTGATGAGAACCTGCAAAATAATGAGTCAAGTCAACAACAAGATACTCACTAATGTCTGGCAAAATACTTACATGGTTGGAAGGTAGTTGATTGTTGGTGTTCTCCATATTTCTCAATTCTTGTGAAATATCATGGAGAATTGTTGATTGGCTGTTATTCTGAGTATGTGTAATTTGAGAGGCAATTGCAATGTCTGTTGGATCAATACCCAAGTGAGTAGTTTGAGAACTTTGGTAATTGATTGCTTCGGTGGTTGCTTTGGTTGATAGGCCAATATCAGTAGCTGTTAAGTTATTTATATTCTGAATATTTCTATCATTTTTTGTTGCTTCGATAGATAAGCCACTATCAACATCTTCCTCCATTGTACAACTTTGATGATTTGACGGAAGGTTTGTGTGTTCTTCTTCACTGCAGCTTTCATTGTTTCTTGTCGGTATGAAAATTTGCTCAATGAAAGGCTCCTCGGTACAAGGTAATGAGTGAAGTTTGCTTACTGGTTTAGCAGAGTCAGTACACCTATGTGCTGTTGTCTGTTGAGAGATGCCTTCTTGGTCGTTGGTCGATTTTTTTTTTTTGTTGGCGTTAAGTCAATCCTCATATCCTGTAAATTCACAAATAGCAATATCCCTTTCTCTTTCTACTATTGAGAAAAAAAATGCAGCCTACACTTAAAAGGATTAGATAAAATAATCTAAAACTTCTAAGTAACAAGAACGATAAGTAGATTAGGGAGAAGAAAATTAAAGGTCGGATTTATACTGGTTCACCCAATGTGGGCTACGTCCAGTTCTCTGCTACCGTTGCAGGTTCTACTAAAATGTCAGGAAGAATCTCCCAAGAGTTTACAAGCCCCTTGCAATGATTTCTCTCAGTTTTCCCTCACACAGAAAACGCTCACGAATCACTCTACAAGCTTTTCAAACTCTCGAAAGGATACACAAGTATTTAGCTCTCAAATTCTCTCTTCTTGGAGAATTTCTCATGACTATCTATTTGTGCACACCAATCAAATGGCCTATCTATCTCTATTTATAATAGATGGTATGGCAAAAACTAAAAGGGAATGGGAAAATAAAAAAGCTCAAACTAAATAGGACAAGGAAAAATTTTAATGGACCTTTTCCTTATAGGAAAAAAATTTTCTCCTTGGTTTGGCTCCATAATCCCAACAATGTCTCACTTGGATTCAAAGCAAGCAGTATGTCTCTTCTGCAACGATAGCGACTCCTCTCTAGTCGTGGAGAAGTCCAACTGAAGTTGCACACAACTTCAATTTATCAACAGGAACGACCTTGGTAAACATATCCGCTGGATTCTGACAACTTGGAATCTTTTCAAGTAACAACATTCCATCTTCCTAAGCTAATCTGATAAAATGGTATCTTATTTGTATGTGTTTCGTTCTTGAATGAAACACTGGATTCTTTGTTAAATGAATAGCGCTCTGACTGTCACAATGCAAAGTCCCTTTTTCCCAATCTCAAGTCAAGTTCTTCCAAGAAAAACTGAAGCCATATCATTTCTTTGCTTGCTTCAGTAACTGCAATGTACTCAGCTTCTGTAGTAGAAAGTGCTACTATCTTCTGCAGTTTTGAGACCCAGCTAATCGCAGTATTACCCAACGTAAAAACATATCCGGTAGTACTCTTTCTGCCATCTGTATCCCTTGCCATATCTGCATCTACATATCCCTACAGTTTAAAATCTAATCTTTTAAAGTGCAGAGCATAATCAGTCTTCTTCCTCAAGTATCTGAATATCCACTTTACAACCTCCCAATGTTGCTTTCCTAGGTTACTCATATACCTGCTGACAACTCCCACTGCATGTGCAATATCGGGTCTTGTACAAACCATGACATACATCAAGCTTCCTATAGCTGAAGCATATGACACTCTAGCCCTATGATCTCTTTCCTGCTCAGTAGTTGGTGAGTGCTCTTTAGATAATTTAAAATGACTAGCTAAAAGAGTACTCACAGGTTTTGCATTTCCCATGTTAAACTTGTTCAATACTTTATTAACATACTCTTTCTGTGACAGTGTCAAAATACCATTTTTTCTCTCTATCCTCATCCCTAGGATTTGTTTGGTAGCTCTCAAATCTTTCATGTCAAATTCGTTAGATAATTTAGTCTTCAATCTCTGTTGTTTTAGTCTTAACCTCAGGAGGACTTATATACTGAGTATTCTCTGTGCTTCCATCTAACTCTGTCTTTCCTGAATCAACTTTATTTATGTCTTTATATAAGACTATCTCATTGAAGATCACGTTCTTGCTTCTGATAATCTTCTTGTTTTGGTCATCCCAAAATTTGTAAGCTATCTCAGCTCCACCATACCCAATAAAGAAACATTTCTTAGATTTGGCATCTAATTTACTCCTGTCTACAACATCAATGTGTAGGTAAGATACACAACCAAAAACCTTTAGATGGGAAAGGTTTAACGGTTTACCAGTCCAGACTTCTTCAGGCAATTTGAAGTCTAGTGATGTGGTCAGGCTCCTATTGATTAGATAGGTCACACAATTGATTGCCTCTGCCCAAAACATCTTAGGCAATTCTGCATGTATTCTCATGCTTCTAGCACGCTCATTCAGCATCCTGTTCATTCTCTCTGCCACACCGTTCTACTGTGGTGTCCCTAGAAGAGTCTTTACCATCACAATTCCACTATCTTCACAATATTTCTTAAATTCCCTACTAACATACCCACCACCATTATCTGACCTCAAGCTTTTTATTTTCAGGCCTGTCTCATTCTCTACCATAGCCTTCCATGTTTTGAAATAAGTCCATACCTTCCTACTTGAGTCGTCTATGAAAGTTACATAGTATCTGGAACCTCCAATGGAAGAAACTTCAGAAGGCCCCCACACATCGGGGTGAACCAACTCCAATTTTGTAGACTTTGGCGTTGAACCTGTCTTTGAGAAACTGACACGACTCTGTTTCCCAAATATACATCCTTCACATAACTTGTGTTTCACTGCCGTAAGTCCTTCTAGCTTCCCTGTAGAGTGAGGAATTTTCATACCTTTTTCACTCATATGTCCCAGCCAAATGTGCCATAATTGGGTCTGACTCGAATGATCTACAACAGCTATCATATCTTTGTCGTTGTTGTTGACATATAAAGTTCCTAACTTTCTTCCTCGAGCGATCATCATCGAACCCTTTGTAACTTTCCAGTTTCCTTGACCGAAGGATATTTCACATCCTTCATTATCCAGCTGCCCCACGGAAATTAGGTTCTTCATCATATTCTGAACGTGACGTACCTTGCGAATATTCTAGACTGAACCCTTCGCCATTTTTAAATTAACATCACCAATCCCAATGATGTCCAAAGGCTCTCCATCAGCAAGATACACATTTTCATGATTTCCTGCAACATAATTCTCAAGAATGTCACGTTGTCCTGTAGTGTGAAAAGACGCACCTGAATCCACCACTCATGTGTTATGAGCGCTCTCAACTGCAAGAACTAGAGCATAATGTATTTCTTCAGCAACAACATTTGCACCAGCTTCTTTCCCCTCAGCTTTCTTTGGAGCTTTGCAGTTCCTCTTCAGATGTCCTGTCTTACCACAATTCCAACATTCTAGTCTACTGTTTGTAGACTTGTTTTTGTTGTTTCTTGACTTTCCACGTTTTCCGTGGTCTCTGTTATTATTTCTTCCTCTGTCCACATTCAATGCTGTACCAGAAGTAGACGCAATACCAGAATCCTTTCTGCGAATCTCCTCTCCAAGAGCTACATCTCTGACATCTGCGAATTTCAATTTCTCTTTTCCACAAGAATTTGAAATAGCTGCCTTCATATGATCCCAACTGTCAGGTAAGATCAACAACAAGATAGCATTTAATTCATCCGTAAATGTTAAATCCACAGCAACTAGTTTGTTAATCAACGTATTAAATTCATTTATATGGGCAGCCACAGATGTACCTTCATCCATCTTCAAATTAAAAAATTTAGTTGCGAGATACACCTTATTATTTACCGAGGGCTTCTCATATATGTTGGACAGTGCACTCATCAGCCCCATTGCGGTAGTCTCCTTAGCTACGCTGCTCTACACATTTTTTGTTAATGTTAGGCGAATCGTACCCGACACCTTCCTGTCCAACTTTTTCCATTTGACTTCTTCCATGTCATCTGGCTTCTTGTCTAATGGCAGTTCCAATTCCTTGGAATGTAGATAATCTACTATCTGATCCTTCCAGTACAGTCATCGTTCTTCAACCTTGCTCTGATACCAATTGTTGAGAAAAAAAATGTAGCCTACACTTAAAAGACTTAGATAAAATAATCTAAAACTTCTAAGTAACAAGAACGATAAGTAGATTAGGGAGAAGAAAATTAAACGCCGGATTTATACTGGTTCACCCAATGTGGGCTACGTCCAGTTCTCTTCTACCGTTGCAGGTTCTACTAAAATGTCAGGAAGAATCTCCCAAGAGTTTTCAAGCCCCTTACAATGATTTCTCTCAGTTTTCCCTCACACAGAAAACGCTCACGAATGACTCTACAAGCCTTTCAAACTCTCAAACAGATACACAAGTATTTAGCTCTCAAATTCTCTCTCCTTGGAGAATTTCTCACGACTCTCTATTTGTGCACACTAATCAAATGACCTATCTATCTCTATTTATAATAGATGGTATGACAAAAATTAAAAGGGAATGGGAAAATAAAAAAGACCAAACTAAATAGGACAAGGAAAAATTTAAATGGACATTTTCCTTATAGGAAAAAGATTTTCTCCTTGGTTTGGCTCCATAATCCCAACATCTACATTGTTAGTTTCACTATCAAAAGGAAAGACAATTTCATAAAAGACATGTCTAGATATGTAGATTCTATTTGTAGAAAAATCCAAACATCTATACCTTTGTGCATTGGACTATAACCCAAGAAAACACAAGGATAAGTTCTCAAAAGAACTTATCTTTGTTATAGTTTCTTACGTACGGACAACATTTACAACCAAATACCTTTAAAGCATTATAATCAGGATTATTACCAAAAAGTTTAAAGAAAGGAGTTTGCATATCAAGTTTCTTTACTAGGCATCCTATTAATGAGATAAACAGCAGTTAAGAAGGCATCCACCCAATATTTAGGAGGAACCATTGATTGAAATAAATGAGTTAATCTGGCTTCAACGATGTGCCTATGTTTTCTTTCCACTACTCCACTTTGTTGTGGGGTGTGTGGGCAAGAAAATTGGTGAAAGATGCCATGGTTTTCAAAGAGACGTTTGAGTGTTGTGGGGTTAAATTCCCCTCCTCCATCACTTTGAAAAACCTAGATCTTTCTTTCAAAGTGGTTCTCAACAATATTTTGAAACTTAATAAAGCAAGACAAAAATTCTTGTTTTTTATTTTACGGATATATCCAACAATACCTAGAGAAGTCATCTACAAAACTAACATAGTATACAAAACTAACATAACATAGTATACAAAACTAACATAGTATTTAAAGTTTTGACCATAGTAGATAGGGGAGGGTCCTCATGAATCACAATGAATCTTTTGTAAGGGAAAAGAAGCAATATCATTATTCAATTTAATGCAACTTTTTGCTCATTTGACAAGTAGCAGATACAGAGTCATTTGTTTTCCCTCTAGTTATATCTAAAAGATGAGAGTTATTTAGATATTTCAATGTGAGGTCATCTGGATGACTCATTCTTTTGTGCCATATAGTGTAAGGTGCCTTATTAGAGATAGAAGTGGCCGTAAAAGCCTTAAATGCCACATGGTTTTGTTCAATCAAAGCGTACAAGAAACCTTCCTTCTTAGTTCCTTGCCCATGATTGCTACTTTCTTGTTCTTGATCACAAAGCCATCAGTGTTAAAACATATAGAGCAATAATTTTTAGAAGTTAACTTACTAATGGAGAGTAAATTCTTTGTGATCTCAGGTACAACAAGAACATTATTGAGATTCTAGTTACTTTTAATGGTATTTAAAGTGCATTTTCCTATATGTGTGATATCAAAGCCCTTTCCATCTCCTATGTATAGTTTTTTCATGACCATTACAAGGTTTTACATGGGATAGCGTACCTGGGTTGCTAACTATGTGAGCTGAAGCAAAGAATCCGCAAATAATGTGTTATCCGCAATATTCTGTTCATTGAGTGTAATAGCAGCTAGAGCTTGAGGGATATCTCCGAATTGGTATGAATGATCGAATCTATTCCAACATTGAAGAGCTATGTGGTTGAATTTCTCACATATTTGACATTGAGTGTTTGAGTCCTTTGAGATGGAGGTATTTTTTGGTTGGAAGTCTCCTGCTACGTGTTGTTGCGGCATATTGCTAAAACCAGGAGATGGATTTTGATTTTTCTGTAGTTAGTTTGAGCTTCTCCATTGATTACTACTCTACTTTTGTTGTAAATAAATGACGTCCCCTCCCTCTAGAGTTATTTCGTGGACTAAAGAAAGCGTGGTGATAGAACAATTTGAGTAAGTTTTTCTTCCTCATTCATAGTAGTAAGTTGTGTCTCATGGTTTCTTAAAGATTGGTCAAACTGATTAAAGTTTGGATATGGTGTTTTTGACAACATTGCGACTCTAAAATCCATGTATTTGTTTCACGATCCTCTTGCTATATGAAAAACTTTAGTAATTTCATCCAAAGGTTTCTTGATCGCAGCTAACATGTCACAAATAGATTTAAACTTCTTTAGATATTCTTCTAAATTGAGACTACCTTTCTTTAGCTAACCAAAGGCTTCGTTAAGATAAATCTCATTCTCCTTGATCATGGTGAGAAGTTGTTCCTCCAAGGAAATCAACATTTGACGTGTTGTATCAGTATTTTTGATCATGTTAAGGATTTCTTCAATGGTTTCCAAGAGTCAAGACACTATAAGCCCATCGTTATTCCTCCAAGTGGAGTATTCTGGATTGCATTGTGTCTTTCGCTTGCTGTCATCAATGTATTCTTCTGGTGGTGTGTCATCAGTAATATGTCCCTCGATGGAACTTTGGTGGAAAACTTGTATTATAAGAGTATTTTCTGTTTTGGCCATTGTGAGAATCTGAACAAATTGATTAGCACTTCTTTCGTGTAAGTATCAACTCTGATACCAAGAAAAAATACATAATCAGTTTGAAACAAATTGCAATTGTATTCACTGTCTGGAAGCATACAAAACAACCTAATATCCATACTACTTATACGAAAAACTAAAGTCAGTAAAAGAGGGAAGAGTGAAACGACTAGCTTTGATCGGGGAATTTACATATTTCCCAAATAAGCTAAACTAAGGGTGGGTATTGAAAATAGAAACTAATTATGAAATTGCTATAAATGGTCAGTGACTGAAATAATAAACCAGCTATGTGTTCCTTCTGCACCAGATTTGGGAAGTTTGATGACAGCTCCTTTGGGTGAAAAACCAACTAGCTTTGAGTGAGAGATACTATCTTACAGTAAGAAGACAGTTGTATACCAAGGCTTAGTGAGGGGTTGAGATGTTGAGATTTTAGGAAAATCAAATTTCACAATTTTTACAACTTATTTGATTTAACTTTGGAGAGAAAGACTTACACGACATAAAGGTTTCGTTCGAGTAGCCTAGAATAGGATCTCGACCCAAAGTTATGGTTGATTTTTATGAAATAAGTTCACTTTTCTAAATACATGTTAGGAATGGATTAATAACTTAAAAATCACTTAGACATAAAGATTGCGAAGTAGAACCGGTTTTTATTCAAGGGAACCACCTACACCTAACTGGTCTCATTTCGTATTTTCTTAGCTAATTTGAAGTAATTGGACTGGGTTTATTCCATAAACTTGAAAAACTCAAAACCAACATCATTCCCATTATATTGCACCTAAAATCCAATTGCAAAACCGTCGATCATCATCATTTTACAAAAGAAATAACAACTTCAATCTCTGTGGATCGATACTCGGAATACTCATTCCACTTATTATTTGTGACATCACTATGTTCCTTTATTTTAAATCGAAAATCTCAAACACCAATTTTCCTAAAGGTCGAGCCGATGCATAGAAGATCGTGCCTAACTTCAATCAATAACTAAGGGATCCATCCCACTTTTCATGTACGCACGAATCCAACTGACTGTTTAGTGAAAATATGAATGACTATTCAAGGAGAGACACGTGCTTAGTATGAGAATTGTTTTTTTTTGGATAAAATTAAAATCAAAATCAGGAAATTTGATGTTTATCGTTATCTTCTTTAATAGTCACTCGGTATTGTAGCAGAGAAAAATTACTATTAATATATTCCTATATATTTTTGTACTGGAATTAATTATTTTTGGATGCCTTGATATTCTTAAGGCTTTCAGGGTGATTATGAAATTATTTTCAACTTATAATGGAAAAAAATTTACTCCGATAAGATCTACTAATACAGAAAATATATACGACAAACGTGAGCATAACCCATGCTCATCCAATTTGAATCTTTTCATCCTTTGGTACTTAAAAACAAAAATGTAATCGATAATATTTTGTGTACATGAATGGCCATGAAACTTGGTCAACTACCATTAAGTTATGAAAACAAAAACGTTATAGTTTGCCGAATATATATTACGTGCAGACATCAATAAATACATGTATAATTCAAAGTGGGAAATACATGGTCAACATGAAATGGTCCAAATAATTAATAAACTATTCTTTCTTAGGCTACGGGCCACCCACATGTATATTTGTTTGGTTATTCATAATAAAAACAAATCCAAGCAGCCCAAGATTGTACTAAAAAATAGAAAATAAATAAAAACAAATTAAAGCAGCCTTACAATCTGGCGACTTGGCCAAATTGAAACAATTTGTTTTTCAGTTTTTGCATCACAAAATAGAGACTATAAGTAGGGACGAGGATAACCCCATTTAGAACATAACTCACATTAATTAGCAATAAAGAAAGCAATTAGATTAATTGTAATTGATTGATCATGGCAATTGAAGCAATGATCCACACTCTCTTCACTTTTGCTTTGCTCTTCTCCTCTGGTATGTATACATTATTAACAACATTCATTGTTTTTATGTTTTAAGTTTTGGATGAATTAATAATGTCATGTGTTTGTAAAACAGTAGTCCATGCAGCCACATTTGTTGTCAAAAACAATTGTCCACAGACCATATGGCCCGCAACACTAACCAGCGGCAGCGGACAATCCCAACTCTCCTCCACTGGATTTCAGTTGCTTTCTGGAGCGTCGAGGACCGTAGATGTGCCTGCACCGTGGACGGGTAGGATCTGGGCTCGAACGCGCTGCTTCGTCGATAGCTCTTCAAGATTTTCGTGCGAAACTGGAGACTGTGCCTCCGGCTCCATCTCTTGCAATGGCGCAGGAGGAATTCCACCGGCCACACTGGCCGAGTTCACATTAGCCCCCAATGGAGGCATGGATTTCTACGACGTCAGCCTCGTCGACGGCTTCAACCTGCCGGCTTCCATAGCCACAGTGGGTGGAACAGGAGAATGCCAATCCACAGCTTGTTCTGCAAATGTAAACGGGGTTTGCCCAACGGAGTTGCAAGTCAGGTCAGGAGATGGGAGCGTGATTGGGTGTAAGAGCGCATGCCTTGCGTTTAATGAACCACAGTATTGTTGCACCGCGGAATTCAACGACCCCAGCAAGTGCGCTCACAGCCAGTACTCGTTGATCTTCAAGAACCAATGCCCTCAGGCTTACAGCTACGCTTATGACGACAAAACCAGCACCTTCACATGCAGCGGTTCGCCCAATTATGTTGTTACCTTCTGCCCG

mRNA sequence

TGTGTTTGTAAAACAGTAGTCCATGCAGCCACATTTGTTGTCAAAAACAATTGTCCACAGACCATATGGCCCGCAACACTAACCAGCGGCAGCGGACAATCCCAACTCTCCTCCACTGGATTTCAGTTGCTTTCTGGAGCGTCGAGGACCGTAGATGTGCCTGCGCCGTGGACGGGTAGGATCTGGGCTCGAACACGCTGCTTCGTCGATAGCTCTTCAAGATTTTCGTGCGAAACTGGAGACTGTGCCTCCGGCTCCATCTCTTGCAATGGCGCAGGAGGAATTCCACCGGCCACACTGGCCGAGTTCACATTAGCCCCCAATGGAGGCATGGATTTCTACGACGTCAGCCTCGTCGACGGCTTCAACCTGCCGGCTTCCATAGCCACAGTGGGTGGAACAGGCGAATGCCAATCCACAGCTTGTTCTGCAAATGTAAACGGAGTTTGCCCAACGGAGTTGCAAGTCAGGTCAGGAGATGGGAGCGTGATTGGGTGTAAGAGCGCATGCCTTGCGTTTAATGAACCACAGTATTGTTGCACCGCGGAATTCAACGACCCCAGCAAGTGCGCTCACAGCCAGTACTCGTTGATCTTCAAGAACCAATGCCCTCAGGCTTACAGCTACGCTTATGACGACAAAACCAGCACCTTCACATGCAGCGTAGTCCATGCAGCCACATTTGTTGTCAAAAACAATTGTCCACAGACCATATGGCCCGCAACACTAACCAGCGGCAGCGGACAATCCCAACTCTCCTCCACTGGATTTCAGTTGCTTTCTGGAGCGTCGAGGACCGTAGATGTGCCTGCACCGTGGACGGGTAGGATCTGGGCTCGAACGCGCTGCTTCGTCGATAGCTCTTCAAGATTTTCGTGCGAAACTGGAGACTGTGCCTCCGGCTCCATCTCTTGCAATGGCGCAGGAGGAATTCCACCGGCCACACTGGCCGAGTTCACATTAGCCCCCAATGGAGGCATGGATTTCTACGACGTCAGCCTCGTCGACGGCTTCAACCTGCCGGCTTCCATAGCCACAGTGGGTGGAACAGGAGAATGCCAATCCACAGCTTGTTCTGCAAATGTAAACGGGGTTTGCCCAACGGAGTTGCAAGTCAGGTCAGGAGATGGGAGCGTGATTGGGTGTAAGAGCGCATGCCTTGCGTTTAATGAACCACAGTATTGTTGCACCGCGGAATTCAACGACCCCAGCAAGTGCGCTCACAGCCAGTACTCGTTGATCTTCAAGAACCAATGCCCTCAGGCTTACAGCTACGCTTATGACGACAAAACCAGCACCTTCACATGCAGCGGTTCGCCCAATTATGTTGTTACCTTCTGCCCG

Coding sequence (CDS)

TGTGTTTGTAAAACAGTAGTCCATGCAGCCACATTTGTTGTCAAAAACAATTGTCCACAGACCATATGGCCCGCAACACTAACCAGCGGCAGCGGACAATCCCAACTCTCCTCCACTGGATTTCAGTTGCTTTCTGGAGCGTCGAGGACCGTAGATGTGCCTGCGCCGTGGACGGGTAGGATCTGGGCTCGAACACGCTGCTTCGTCGATAGCTCTTCAAGATTTTCGTGCGAAACTGGAGACTGTGCCTCCGGCTCCATCTCTTGCAATGGCGCAGGAGGAATTCCACCGGCCACACTGGCCGAGTTCACATTAGCCCCCAATGGAGGCATGGATTTCTACGACGTCAGCCTCGTCGACGGCTTCAACCTGCCGGCTTCCATAGCCACAGTGGGTGGAACAGGCGAATGCCAATCCACAGCTTGTTCTGCAAATGTAAACGGAGTTTGCCCAACGGAGTTGCAAGTCAGGTCAGGAGATGGGAGCGTGATTGGGTGTAAGAGCGCATGCCTTGCGTTTAATGAACCACAGTATTGTTGCACCGCGGAATTCAACGACCCCAGCAAGTGCGCTCACAGCCAGTACTCGTTGATCTTCAAGAACCAATGCCCTCAGGCTTACAGCTACGCTTATGACGACAAAACCAGCACCTTCACATGCAGCGTAGTCCATGCAGCCACATTTGTTGTCAAAAACAATTGTCCACAGACCATATGGCCCGCAACACTAACCAGCGGCAGCGGACAATCCCAACTCTCCTCCACTGGATTTCAGTTGCTTTCTGGAGCGTCGAGGACCGTAGATGTGCCTGCACCGTGGACGGGTAGGATCTGGGCTCGAACGCGCTGCTTCGTCGATAGCTCTTCAAGATTTTCGTGCGAAACTGGAGACTGTGCCTCCGGCTCCATCTCTTGCAATGGCGCAGGAGGAATTCCACCGGCCACACTGGCCGAGTTCACATTAGCCCCCAATGGAGGCATGGATTTCTACGACGTCAGCCTCGTCGACGGCTTCAACCTGCCGGCTTCCATAGCCACAGTGGGTGGAACAGGAGAATGCCAATCCACAGCTTGTTCTGCAAATGTAAACGGGGTTTGCCCAACGGAGTTGCAAGTCAGGTCAGGAGATGGGAGCGTGATTGGGTGTAAGAGCGCATGCCTTGCGTTTAATGAACCACAGTATTGTTGCACCGCGGAATTCAACGACCCCAGCAAGTGCGCTCACAGCCAGTACTCGTTGATCTTCAAGAACCAATGCCCTCAGGCTTACAGCTACGCTTATGACGACAAAACCAGCACCTTCACATGCAGCGGTTCGCCCAATTATGTTGTTACCTTCTGCCCG

Protein sequence

CVCKTVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP
Homology
BLAST of MS006573 vs. NCBI nr
Match: RDY04111.1 (hypothetical protein CR513_12220, partial [Mucuna pruriens])

HSP 1 Score: 541.6 bits (1394), Expect = 6.5e-150
Identity = 253/443 (57.11%), Postives = 315/443 (71.11%), Query Frame = 0

Query: 16  NNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRF 75
           N C  T+WP TLT G  + QLS+TGF+L SGAS +VD+P+PW+GR WART C   S++ F
Sbjct: 10  NKCTYTVWPGTLT-GDQKPQLSTTGFELDSGASNSVDLPSPWSGRFWARTGC--SSNNGF 69

Query: 76  SCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTG 135
           SC TGDCASG + CNGAGG PPATL E T+A NGG DFYDVS VDGFN+P SI   GG+G
Sbjct: 70  SCATGDCASGQVMCNGAGGNPPATLVEITVAANGGQDFYDVSNVDGFNVPVSITPQGGSG 129

Query: 136 ECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQY 195
            C++++C + +N VCP +LQV+  DGSVI CKSACLAF   QYCCT + N    C  + Y
Sbjct: 130 GCKTSSCPSKINTVCPAQLQVKGSDGSVIACKSACLAFGGDQYCCTGDHNTAQTCPPTNY 189

Query: 196 SLIFKNQCPQAYSYAYDDKTSTFTCS----------VVHAATFVVKNNCPQTIWPATLTS 255
           S  F  QCP AYSYAYDDK  TFTCS          V   A     N CP T+WP TLT 
Sbjct: 190 SQFFSQQCPDAYSYAYDDKHGTFTCSTHDFGCILNAVSQGAKVTFTNKCPYTVWPGTLT- 249

Query: 256 GSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISC 315
           G  + QLS +GF+L +GAS +VD+P+PW+GR W RT C  ++  +FSC T DC SG ++C
Sbjct: 250 GDQKPQLSKSGFELATGASDSVDLPSPWSGRFWTRTGC-SNNGGKFSCATADCGSGQVAC 309

Query: 316 NGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGV 375
           NGAG  PPATL E T+A NGG D+YDVS VDGFN+P S++  GG+G+C++++C  N+N  
Sbjct: 310 NGAGANPPATLVEITVAENGGQDYYDVSNVDGFNVPMSVSPQGGSGDCKTSSCPKNINEA 369

Query: 376 CPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAYSY 435
           CP ELQ++  DG+VIGCKSACL FN+PQYCCT + + P  C  + +S  F+ QC +AYSY
Sbjct: 370 CPAELQLKGSDGNVIGCKSACLVFNQPQYCCTGDHDKPETCPPTNFSQFFEQQCSEAYSY 429

Query: 436 AYDDKTSTFTCSGSPNYVVTFCP 449
           AYDDK STFTCS  P+YV+TFCP
Sbjct: 430 AYDDKNSTFTCSNRPDYVITFCP 447

BLAST of MS006573 vs. NCBI nr
Match: OMO73718.1 (Thaumatin [Corchorus olitorius])

HSP 1 Score: 510.8 bits (1314), Expect = 1.2e-140
Identity = 250/443 (56.43%), Postives = 296/443 (66.82%), Query Frame = 0

Query: 7   VHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTR 66
           V +ATF   NNCP ++WP  L SG G  Q SSTGF+L S AS T+D+ APW+GRIW RT+
Sbjct: 19  VESATFTFTNNCPYSVWPGIL-SGQG-PQPSSTGFELASKASYTLDISAPWSGRIWGRTQ 78

Query: 67  CFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPA 126
           C  D+  +F C T DC SG ++CNGAG IPPA+L EFTLA +GG DFYDVSLVDGFNLP 
Sbjct: 79  C-ADTGGKFQCATADCGSGQVTCNGAGAIPPASLLEFTLAASGGQDFYDVSLVDGFNLPL 138

Query: 127 SIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFND 186
            I   GG+G+C++T+C ANVN VCP ELQV+ GDGSVI CKSACLAFN+PQYCCT  F  
Sbjct: 139 GITPQGGSGDCKATSCPANVNSVCPPELQVKGGDGSVIACKSACLAFNQPQYCCTGAFRT 198

Query: 187 PSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSVVHAATFVVKNNCPQTIWPATLTSG 246
            S                                     ATF + NNC  TIWPA LT G
Sbjct: 199 ES-------------------------------------ATFTLTNNCASTIWPAALT-G 258

Query: 247 SGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCN 306
           +G  Q+  TG +L S AS   D+P PW+ R WART+C  D++ +F C TGDCASG I+CN
Sbjct: 259 AGAPQI--TGLELASKASVNFDIPPPWSDRFWARTQCTTDATGKFKCATGDCASGQIACN 318

Query: 307 GAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGG-TGECQSTACSANVNGV 366
           GAGG+PP +LAEFTLA N G DFYD+SLVDGFN+P SI   GG    C +  C AN+N  
Sbjct: 319 GAGGVPPVSLAEFTLATNNGQDFYDISLVDGFNVPLSIVPQGGAVNNCSAVTCQANLNAG 378

Query: 367 CPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAYSY 426
           CP ELQV++ DGSV+ C SAC AFN+PQYCCT  F  P  C  + YS  FK QCPQAYSY
Sbjct: 379 CPLELQVKAADGSVVACNSACSAFNQPQYCCTGAFGSPDTCPPTNYSNYFKGQCPQAYSY 418

Query: 427 AYDDKTSTFTCSGSPNYVVTFCP 449
           AYDDK+   +C G PNYV+TFCP
Sbjct: 439 AYDDKSGLASCIGGPNYVITFCP 418

BLAST of MS006573 vs. NCBI nr
Match: RYR71254.1 (hypothetical protein Ahy_A02g005535 isoform A [Arachis hypogaea])

HSP 1 Score: 492.3 bits (1266), Expect = 4.5e-135
Identity = 240/470 (51.06%), Postives = 300/470 (63.83%), Query Frame = 0

Query: 6   VVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWART 65
           V   A     NNCP T+WP T  SG+   QLSSTGF+L SGAS T+++P+ W+G+ WART
Sbjct: 19  VGQGAQITFTNNCPYTVWPGT-QSGATSPQLSSTGFELASGASNTLELPSGWSGKFWART 78

Query: 66  RCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLA-PNGGMDFYDVSLVDGFNL 125
            C  +++  FSC T DC +  + C GAG   PA+L E T    NG  DFYDVS VDGFN+
Sbjct: 79  GC-SNNNGVFSCTTADCGN-HVECGGAGEATPASLIEITTGNNNGAQDFYDVSNVDGFNV 138

Query: 126 PASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEF 185
           P+S++  GG+G C + +C  N+N  CP ELQ +  D SV+GCKSAC+ FN P+YCC  + 
Sbjct: 139 PSSMSPQGGSGACGTASCPVNINAACPAELQFKGSDKSVVGCKSACVIFNTPEYCCNGDH 198

Query: 186 NDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCS----------------------- 245
           N P  C  + YS  F  QCP AYSYAYDDK  TFTCS                       
Sbjct: 199 NTPQTCPPTNYSQFFSQQCPNAYSYAYDDKRGTFTCSGNPSYLTTIMAITRVVLSLSFAF 258

Query: 246 ---VVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIW 305
              V H A   + N C  T+WP +  + +  +QLS+TGF+L SG S+TVDVPAPW+G+ W
Sbjct: 259 FLCVAHGAQITLTNKCSYTVWPGS-QANANSAQLSTTGFELPSGQSKTVDVPAPWSGKFW 318

Query: 306 ARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGF 365
           ART C  +++  FSC T DC +  + C+GAG   PA+L EFT+A NGG DFYDVS VDGF
Sbjct: 319 ARTGC-SNNNGVFSCATADCGN-HLECSGAGEATPASLMEFTIASNGGQDFYDVSNVDGF 378

Query: 366 NLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTA 425
           N+P+SI   GG+G C   +C AN+N  CP  LQ +  DGSVIGCKSAC+ F  P+YCCT 
Sbjct: 379 NVPSSITPQGGSGTCNVASCPANINAACPAALQFKGSDGSVIGCKSACVEFGTPEYCCTG 438

Query: 426 EFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP 449
           + N P+ C  + YS  F NQCP AYSYAYDDK  TFTCSGSPNY + FCP
Sbjct: 439 DHNTPATCPATNYSEFFSNQCPNAYSYAYDDKRGTFTCSGSPNYAINFCP 482

BLAST of MS006573 vs. NCBI nr
Match: RZC77122.1 (hypothetical protein C5167_001310 [Papaver somniferum])

HSP 1 Score: 481.5 bits (1238), Expect = 8.0e-132
Identity = 243/400 (60.75%), Postives = 270/400 (67.50%), Query Frame = 0

Query: 15  KNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSR 74
           KNNCP T+WP TLT GSG SQLS TGF+L SGAS +VD PA W+GR WART C   SS R
Sbjct: 26  KNNCPYTVWPGTLT-GSGGSQLSKTGFELASGASSSVDAPAGWSGRFWARTGCSTGSSGR 85

Query: 75  FSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGT 134
            +C T DCASG+  CNGAG IPPATL EFTL  +GG DFYDVS VDGFNLPASI   GG 
Sbjct: 86  LTCATADCASGAAECNGAGAIPPATLLEFTLNGDGGKDFYDVSNVDGFNLPASITPKGG- 145

Query: 135 GECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQ 194
             C ST C +N+N VCP EL V+   GSV+ CKSACLA  +PQYCCT  FN    C  + 
Sbjct: 146 --CSSTECRSNINSVCPPELSVKDAGGSVVACKSACLALQQPQYCCTGSFNTAETCPPTN 205

Query: 195 YSLIFKNQCPQAYSYAYDDKTSTFTCSVVHAATFVVK--NNCPQTIWPATLTSGSGQSQL 254
           YS IFK+ CPQAYSYAYDD++STFTC+    +  ++      P          G   SQL
Sbjct: 206 YSKIFKDACPQAYSYAYDDRSSTFTCAAGGKSFHLLSTYKQLPIHSMARHFDWGWIHSQL 265

Query: 255 SSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIP 314
           S TGF L  GAS +VD PA W+GR WART C  DSS R  C T DCASG + CNGAG IP
Sbjct: 266 SRTGFVLAPGASSSVDAPAGWSGRFWARTGCTTDSSGRLKCATADCASGRVECNGAGAIP 325

Query: 315 PATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQV 374
           PATL EFTL  NGG DFYDVSLVDGFNLPASI      G C ST C  NVN VCP +L V
Sbjct: 326 PATLLEFTLNGNGGQDFYDVSLVDGFNLPASITP--QRGGCSSTTCQRNVNSVCPQDLSV 385

Query: 375 RSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYS 413
           R   GSVI CKSAC AF +PQYCCT  FN P+ C  + YS
Sbjct: 386 RDAGGSVIACKSACEAFRQPQYCCTGSFNTPATCPPTDYS 419

BLAST of MS006573 vs. NCBI nr
Match: XP_019420884.1 (PREDICTED: uncharacterized protein LOC109331061 isoform X2 [Lupinus angustifolius])

HSP 1 Score: 480.7 bits (1236), Expect = 1.4e-131
Identity = 241/454 (53.08%), Postives = 298/454 (65.64%), Query Frame = 0

Query: 8   HAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRC 67
           ++ TF + N C  T+WP  LT G+G   LS+TGF L  G S T+ +PA W+GRIW RT C
Sbjct: 24  YSTTFNIVNKCSYTVWPGILT-GAGTPPLSTTGFTLAPGESNTIAIPAAWSGRIWGRTLC 83

Query: 68  FVDSSS-RFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPA 127
             D+++ +FSC TGDC S +  C G G  PPATLAEFTL   GG+DFYDVS VDG+NLP 
Sbjct: 84  SQDTATGKFSCITGDCDSSTEECAGGGAAPPATLAEFTLNGAGGLDFYDVSFVDGYNLPI 143

Query: 128 SIATVGGT--GECQSTACSANVNGVCPTELQ-VRSGDG-SVIGCKSACLAFNEPQYCCTA 187
            +   GGT  G C +T C  ++N  CPTEL+ V SG+G   + CKSAC AF +PQYCC+ 
Sbjct: 144 KVEPQGGTGAGNCTATGCVVDLNAGCPTELRVVNSGNGEESVACKSACEAFGDPQYCCSG 203

Query: 188 EFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSVV--------HAATFVVKNNC 247
            +  P  C  S YS  FK+ CP AYSYAYDD TSTFTC+            ATF   N C
Sbjct: 204 AYATPETCKPSSYSQFFKSACPLAYSYAYDDGTSTFTCASADYIVTFCPEPATFTFMNKC 263

Query: 248 PQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCE 307
             T+WP  L    G+  L +TGF+L  G SR+   PA W+GR WART C  D S R +C 
Sbjct: 264 DYTVWPGIL----GKPDLGTTGFELTEGTSRSFQAPAGWSGRFWARTNCKFDDSGRGTCA 323

Query: 308 TGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQ 367
           T DC SG I+CNGAG  PPATLAEFTL   G MD+YDVSLVDG+NLP  +A  GG+G C 
Sbjct: 324 TADCGSGDINCNGAGASPPATLAEFTLGA-GSMDYYDVSLVDGYNLPIMVAASGGSGSCA 383

Query: 368 STACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLI 427
           +T C  ++N  CP+EL+V  GD     CKSAC AF + +YCC  EF++PS C  S YS +
Sbjct: 384 TTGCGVDLNQQCPSELRVEGGD----ACKSACEAFGKAEYCCNGEFSNPSTCKPSVYSQM 443

Query: 428 FKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP 449
           FK+ CP++YSYAYDD TSTFTC+G+ +Y +TFCP
Sbjct: 444 FKSACPKSYSYAYDDATSTFTCTGA-DYTITFCP 466

BLAST of MS006573 vs. ExPASy Swiss-Prot
Match: Q9FSG7 (Thaumatin-like protein 1a OS=Malus domestica OX=3750 GN=TL1 PE=1 SV=1)

HSP 1 Score: 322.0 bits (824), Expect = 1.1e-86
Identity = 146/228 (64.04%), Postives = 174/228 (76.32%), Query Frame = 0

Query: 221 SVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWAR 280
           S  HAA     NNCP T+WP TLT G  + QLS TGF+L S ASR+VD P+PW+GR W R
Sbjct: 20  SGAHAAKITFTNNCPNTVWPGTLT-GDQKPQLSLTGFELASKASRSVDAPSPWSGRFWGR 79

Query: 281 TRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNL 340
           TRC  D++ +F+CET DC SG ++CNGAG +PPATL E T+A NGG D+YDVSLVDGFNL
Sbjct: 80  TRCSTDAAGKFTCETADCGSGQVACNGAGAVPPATLVEITIAANGGQDYYDVSLVDGFNL 139

Query: 341 PASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEF 400
           P S+A  GGTGEC+ ++C ANVN VCP  LQV++ DGSVI CKSACLAF + +YCCT   
Sbjct: 140 PMSVAPQGGTGECKPSSCPANVNKVCPAPLQVKAADGSVISCKSACLAFGDSKYCCTPPN 199

Query: 401 NDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP 449
           N P  C  ++YS IF+ QCPQAYSYAYDDK STFTCSG P+YV+TFCP
Sbjct: 200 NTPETCPPTEYSEIFEKQCPQAYSYAYDDKNSTFTCSGGPDYVITFCP 246

BLAST of MS006573 vs. ExPASy Swiss-Prot
Match: O80327 (Thaumatin-like protein 1 OS=Pyrus pyrifolia OX=3767 GN=TL1 PE=1 SV=1)

HSP 1 Score: 320.9 bits (821), Expect = 2.4e-86
Identity = 145/226 (64.16%), Postives = 170/226 (75.22%), Query Frame = 0

Query: 223 VHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTR 282
           V++A F   N CP T+WP TLT G G  QL STGF+L SGAS ++ V APW+GR W R+ 
Sbjct: 20  VYSAKFTFTNKCPNTVWPGTLTGGGG-PQLLSTGFELASGASTSLTVQAPWSGRFWGRSH 79

Query: 283 CFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPA 342
           C +DSS +F C TGDC SG ISCNGAG  PPA+L E TLA NGG DFYDVSLVDGFNLP 
Sbjct: 80  CSIDSSGKFKCSTGDCGSGQISCNGAGASPPASLVELTLATNGGQDFYDVSLVDGFNLPI 139

Query: 343 SIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFND 402
            +A  GG+G+C ST+C+AN+N VCP EL  +  DGSVIGCKSACLA N+PQYCCT  +  
Sbjct: 140 KLAPRGGSGDCNSTSCAANINTVCPAELSDKGSDGSVIGCKSACLALNQPQYCCTGAYGT 199

Query: 403 PSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP 449
           P  C  + +S +FKNQCPQAYSYAYDDK+STFTC G PNY +TFCP
Sbjct: 200 PDTCPPTDFSKVFKNQCPQAYSYAYDDKSSTFTCFGGPNYEITFCP 244

BLAST of MS006573 vs. ExPASy Swiss-Prot
Match: Q9SMH2 (Thaumatin-like protein 1 OS=Castanea sativa OX=21020 GN=TL1 PE=2 SV=1)

HSP 1 Score: 310.5 bits (794), Expect = 3.2e-83
Identity = 145/231 (62.77%), Postives = 174/231 (75.32%), Query Frame = 0

Query: 218 FTCSVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRI 277
           F  S  H+A     NNCP+TIWP TLTS   + QL +TGF L S AS T+ V APW GR 
Sbjct: 15  FFLSGAHSAKITFTNNCPRTIWPGTLTSDQ-KPQLPNTGFVLASKASLTLGVQAPWKGRF 74

Query: 278 WARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDG 337
           WARTRC   +S +F+CET DC++G ++CNG G IPPA+L E  +A N GMDFYDVSLVDG
Sbjct: 75  WARTRC-TTNSGKFTCETADCSTGQVACNGNGAIPPASLVEINIAANRGMDFYDVSLVDG 134

Query: 338 FNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCT 397
           +NLP S+AT GGTG+C++T+C ANVN VCP ELQV+  D SV+ CKSAC AFN+PQYCCT
Sbjct: 135 YNLPVSVATRGGTGDCKATSCRANVNAVCPAELQVKGSDASVLACKSACTAFNQPQYCCT 194

Query: 398 AEFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP 449
             F+    C  ++YS IFK QCPQAYSYAYDD TSTFTCSG+P+YV+TFCP
Sbjct: 195 GAFDTARTCPATKYSRIFKQQCPQAYSYAYDDSTSTFTCSGAPDYVITFCP 243

BLAST of MS006573 vs. ExPASy Swiss-Prot
Match: P83332 (Thaumatin-like protein 1 OS=Prunus persica OX=3760 PE=2 SV=1)

HSP 1 Score: 306.6 bits (784), Expect = 4.6e-82
Identity = 140/228 (61.40%), Postives = 168/228 (73.68%), Query Frame = 0

Query: 221 SVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWAR 280
           S  HAA     N C  T+WP TLT G  + QLS TGF+L +G SR+VD P+PW+GR + R
Sbjct: 20  SGAHAAKITFTNKCSYTVWPGTLT-GDQKPQLSLTGFELATGISRSVDAPSPWSGRFFGR 79

Query: 281 TRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNL 340
           TRC  D+S +F+C T DC SG +SCNG G  PPATL E T+A NGG DFYDVSLVDGFNL
Sbjct: 80  TRCSTDASGKFTCATADCGSGQVSCNGNGAAPPATLVEITIASNGGQDFYDVSLVDGFNL 139

Query: 341 PASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEF 400
           P S+A  GGTG+C+++ C A++N VCP  LQV+  DGSVI CKSACLAFN+P+YCCT   
Sbjct: 140 PMSVAPQGGTGKCKASTCPADINKVCPAPLQVKGSDGSVIACKSACLAFNQPKYCCTPPN 199

Query: 401 NDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP 449
           + P  C    YS +FK QCPQAYSYAYDDK+STFTCSG P Y++TFCP
Sbjct: 200 DKPETCPPPDYSKLFKTQCPQAYSYAYDDKSSTFTCSGRPAYLITFCP 246

BLAST of MS006573 vs. ExPASy Swiss-Prot
Match: P50694 (Glucan endo-1,3-beta-glucosidase OS=Prunus avium OX=42229 PE=1 SV=1)

HSP 1 Score: 301.6 bits (771), Expect = 1.5e-80
Identity = 138/225 (61.33%), Postives = 159/225 (70.67%), Query Frame = 0

Query: 224 HAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRC 283
           HAAT   KNNCP  +WP TLTS   + QLS+TGF+L S AS  +D P PW GR WART C
Sbjct: 22  HAATISFKNNCPYMVWPGTLTSDQ-KPQLSTTGFELASQASFQLDTPVPWNGRFWARTGC 81

Query: 284 FVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPAS 343
             D+S +F C T DCASG + CNG G IPPATLAEF +   GG DFYDVSLVDGFNLP S
Sbjct: 82  STDASGKFVCATADCASGQVMCNGNGAIPPATLAEFNIPAGGGQDFYDVSLVDGFNLPMS 141

Query: 344 IATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDP 403
           +   GGTG+C++ +C ANVN VCP+ELQ +  DGSV+ C SAC+ F  PQYCCT   N P
Sbjct: 142 VTPQGGTGDCKTASCPANVNAVCPSELQKKGSDGSVVACLSACVKFGTPQYCCTPPQNTP 201

Query: 404 SKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP 449
             C  + YS IF N CP AYSYAYDDK  TFTC+G PNY +TFCP
Sbjct: 202 ETCPPTNYSEIFHNACPDAYSYAYDDKRGTFTCNGGPNYAITFCP 245

BLAST of MS006573 vs. ExPASy TrEMBL
Match: A0A7N2M7N9 (Uncharacterized protein OS=Quercus lobata OX=97700 PE=3 SV=1)

HSP 1 Score: 548.1 bits (1411), Expect = 3.4e-152
Identity = 271/494 (54.86%), Postives = 330/494 (66.80%), Query Frame = 0

Query: 5   TVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWAR 64
           T  H+A     NNCP+TIWP TLTS   + QLS+TGF+LLS AS T+DV APW GR WAR
Sbjct: 441 TGAHSAKITFTNNCPKTIWPGTLTSDQ-KPQLSNTGFELLSKASLTLDVQAPWKGRFWAR 500

Query: 65  TRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNL 124
           T+C  +S++ F+CET DC +G ++CNG G IPPA+L E  +A N GMD+YDVSLVDGFNL
Sbjct: 501 TQCSTNSTN-FTCETADCGTGRVACNGTGAIPPASLVEINIADNRGMDYYDVSLVDGFNL 560

Query: 125 PASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEF 184
           P S+AT GGTG+C++++C A+VN VCP ELQV   DGSV+ CKSAC AFN+PQYCCT  F
Sbjct: 561 PVSVATRGGTGDCKNSSCPADVNAVCPGELQVTGSDGSVLACKSACTAFNQPQYCCTGAF 620

Query: 185 NDPSKCAHSQYSLIFKN-------------------------QCPQAYSYAYDD------ 244
           + P  C  ++YS+   +                         +C Q  S  Y        
Sbjct: 621 STPLTCPTTKYSVNLNHGHNVEVRIEPIGGTLVNGSGPCPIVECVQDISNVYQSNLLATN 680

Query: 245 ------KTSTFTCSVV-------------HAATFVVKNNCPQTIWPATLTSGSGQSQLSS 304
                 KT+  T   V             H+A     NNCP T+WP TLTS   + QLS+
Sbjct: 681 KSWVAGKTTAETMVAVDGEAADNRGSFGAHSARITFTNNCPYTVWPGTLTSDQ-KPQLST 740

Query: 305 TGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPA 364
           TGF+L S AS  +DV APW GR WART C  DSS +FSC T +C+SG +SCNG G +PPA
Sbjct: 741 TGFELASTASSAIDVQAPWKGRFWARTLCSTDSSGKFSCATAECSSGQVSCNGNGAVPPA 800

Query: 365 TLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRS 424
           +L E  +A +GGMDFYDVSLVDGFNLP S+AT GGTGEC++++C ANVN  CP ELQV+ 
Sbjct: 801 SLVEINIAADGGMDFYDVSLVDGFNLPVSVATQGGTGECKASSCPANVNAACPAELQVKG 860

Query: 425 GDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTF 449
            DGSVI CKSAC AFN+PQYCCT   N P  C  + YS IF+NQCPQAYSYAYDD+ STF
Sbjct: 861 SDGSVIACKSACTAFNQPQYCCTGANNTPQTCPPTDYSRIFENQCPQAYSYAYDDQNSTF 920

BLAST of MS006573 vs. ExPASy TrEMBL
Match: A0A371HMT6 (Uncharacterized protein (Fragment) OS=Mucuna pruriens OX=157652 GN=CR513_12220 PE=3 SV=1)

HSP 1 Score: 541.6 bits (1394), Expect = 3.1e-150
Identity = 253/443 (57.11%), Postives = 315/443 (71.11%), Query Frame = 0

Query: 16  NNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRF 75
           N C  T+WP TLT G  + QLS+TGF+L SGAS +VD+P+PW+GR WART C   S++ F
Sbjct: 10  NKCTYTVWPGTLT-GDQKPQLSTTGFELDSGASNSVDLPSPWSGRFWARTGC--SSNNGF 69

Query: 76  SCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTG 135
           SC TGDCASG + CNGAGG PPATL E T+A NGG DFYDVS VDGFN+P SI   GG+G
Sbjct: 70  SCATGDCASGQVMCNGAGGNPPATLVEITVAANGGQDFYDVSNVDGFNVPVSITPQGGSG 129

Query: 136 ECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQY 195
            C++++C + +N VCP +LQV+  DGSVI CKSACLAF   QYCCT + N    C  + Y
Sbjct: 130 GCKTSSCPSKINTVCPAQLQVKGSDGSVIACKSACLAFGGDQYCCTGDHNTAQTCPPTNY 189

Query: 196 SLIFKNQCPQAYSYAYDDKTSTFTCS----------VVHAATFVVKNNCPQTIWPATLTS 255
           S  F  QCP AYSYAYDDK  TFTCS          V   A     N CP T+WP TLT 
Sbjct: 190 SQFFSQQCPDAYSYAYDDKHGTFTCSTHDFGCILNAVSQGAKVTFTNKCPYTVWPGTLT- 249

Query: 256 GSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISC 315
           G  + QLS +GF+L +GAS +VD+P+PW+GR W RT C  ++  +FSC T DC SG ++C
Sbjct: 250 GDQKPQLSKSGFELATGASDSVDLPSPWSGRFWTRTGC-SNNGGKFSCATADCGSGQVAC 309

Query: 316 NGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGV 375
           NGAG  PPATL E T+A NGG D+YDVS VDGFN+P S++  GG+G+C++++C  N+N  
Sbjct: 310 NGAGANPPATLVEITVAENGGQDYYDVSNVDGFNVPMSVSPQGGSGDCKTSSCPKNINEA 369

Query: 376 CPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAYSY 435
           CP ELQ++  DG+VIGCKSACL FN+PQYCCT + + P  C  + +S  F+ QC +AYSY
Sbjct: 370 CPAELQLKGSDGNVIGCKSACLVFNQPQYCCTGDHDKPETCPPTNFSQFFEQQCSEAYSY 429

Query: 436 AYDDKTSTFTCSGSPNYVVTFCP 449
           AYDDK STFTCS  P+YV+TFCP
Sbjct: 430 AYDDKNSTFTCSNRPDYVITFCP 447

BLAST of MS006573 vs. ExPASy TrEMBL
Match: A0A6N2MKB6 (Uncharacterized protein OS=Salix viminalis OX=40686 GN=SVIM_LOCUS383471 PE=3 SV=1)

HSP 1 Score: 538.9 bits (1387), Expect = 2.0e-149
Identity = 267/449 (59.47%), Postives = 320/449 (71.27%), Query Frame = 0

Query: 9   AATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCF 68
           + TF   N CP T+WP TLT+ +G+ QLSSTGF L +GAS ++           ART+C 
Sbjct: 39  SVTFSFTNKCPYTVWPGTLTA-AGRPQLSSTGFTLATGASFSLSA----LQHGLARTQC- 98

Query: 69  VDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASI 128
             +S +F C T DCASG I CNGAG IPPA+LAEFTL  +GG DFYD+SLVDGFN+P SI
Sbjct: 99  -STSGKFVCATADCASGVIECNGAGAIPPASLAEFTLRGDGGKDFYDISLVDGFNIPISI 158

Query: 129 ATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPS 188
              GG+G CQST+C+ANVN VC   L VR  DG+VI CKSAC AFN+PQYCCT  ++ P 
Sbjct: 159 TPQGGSG-CQSTSCAANVNAVCDPSLAVRGADGNVIACKSACAAFNQPQYCCTGAYSTPE 218

Query: 189 KCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSV---------VHAATFVVKNNCPQTIW 248
            C  +QYS+ FK +CPQAYSYAYDD++STFTC V           + TF   N CP T+W
Sbjct: 219 TCPPTQYSMTFKQKCPQAYSYAYDDRSSTFTCPVAIDPCTHGGAQSVTFSFTNKCPYTVW 278

Query: 249 PATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCA 308
           P TLT+ +G+ QLSSTGF L +GAS ++  PA W+GR WART+C   +S +F C T DCA
Sbjct: 279 PGTLTA-AGRPQLSSTGFTLATGASFSLSAPATWSGRFWARTQC--STSGKFVCATADCA 338

Query: 309 SGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACS 368
           SG I CNGAG IPPA+LAEFTL  +GG DFYD+SLVDGFN+P SI   GG   CQST+C+
Sbjct: 339 SGVIQCNGAGAIPPASLAEFTLRGDGGKDFYDISLVDGFNIPISITPQGGGSGCQSTSCA 398

Query: 369 ANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQC 428
           ANVN VC   L VR  DG+VI CKSAC+AFN+PQYCCT   N P  C  +QYS+ FK QC
Sbjct: 399 ANVNAVCDPSLAVRGADGTVIACKSACVAFNQPQYCCTGPNNTPETCPPTQYSMTFKQQC 458

Query: 429 PQAYSYAYDDKTSTFTCSGSPNYVVTFCP 449
           PQAYSYAYDDK+STFTC    NY++TFCP
Sbjct: 459 PQAYSYAYDDKSSTFTCPSGGNYLITFCP 476

BLAST of MS006573 vs. ExPASy TrEMBL
Match: A0A1R3HTS8 (Thaumatin OS=Corchorus olitorius OX=93759 GN=COLO4_26903 PE=4 SV=1)

HSP 1 Score: 510.8 bits (1314), Expect = 5.9e-141
Identity = 250/443 (56.43%), Postives = 296/443 (66.82%), Query Frame = 0

Query: 7   VHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTR 66
           V +ATF   NNCP ++WP  L SG G  Q SSTGF+L S AS T+D+ APW+GRIW RT+
Sbjct: 19  VESATFTFTNNCPYSVWPGIL-SGQG-PQPSSTGFELASKASYTLDISAPWSGRIWGRTQ 78

Query: 67  CFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPA 126
           C  D+  +F C T DC SG ++CNGAG IPPA+L EFTLA +GG DFYDVSLVDGFNLP 
Sbjct: 79  C-ADTGGKFQCATADCGSGQVTCNGAGAIPPASLLEFTLAASGGQDFYDVSLVDGFNLPL 138

Query: 127 SIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFND 186
            I   GG+G+C++T+C ANVN VCP ELQV+ GDGSVI CKSACLAFN+PQYCCT  F  
Sbjct: 139 GITPQGGSGDCKATSCPANVNSVCPPELQVKGGDGSVIACKSACLAFNQPQYCCTGAFRT 198

Query: 187 PSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSVVHAATFVVKNNCPQTIWPATLTSG 246
            S                                     ATF + NNC  TIWPA LT G
Sbjct: 199 ES-------------------------------------ATFTLTNNCASTIWPAALT-G 258

Query: 247 SGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCN 306
           +G  Q+  TG +L S AS   D+P PW+ R WART+C  D++ +F C TGDCASG I+CN
Sbjct: 259 AGAPQI--TGLELASKASVNFDIPPPWSDRFWARTQCTTDATGKFKCATGDCASGQIACN 318

Query: 307 GAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGG-TGECQSTACSANVNGV 366
           GAGG+PP +LAEFTLA N G DFYD+SLVDGFN+P SI   GG    C +  C AN+N  
Sbjct: 319 GAGGVPPVSLAEFTLATNNGQDFYDISLVDGFNVPLSIVPQGGAVNNCSAVTCQANLNAG 378

Query: 367 CPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAYSY 426
           CP ELQV++ DGSV+ C SAC AFN+PQYCCT  F  P  C  + YS  FK QCPQAYSY
Sbjct: 379 CPLELQVKAADGSVVACNSACSAFNQPQYCCTGAFGSPDTCPPTNYSNYFKGQCPQAYSY 418

Query: 427 AYDDKTSTFTCSGSPNYVVTFCP 449
           AYDDK+   +C G PNYV+TFCP
Sbjct: 439 AYDDKSGLASCIGGPNYVITFCP 418

BLAST of MS006573 vs. ExPASy TrEMBL
Match: A0A445E6Y1 (Uncharacterized protein OS=Arachis hypogaea OX=3818 GN=Ahy_A02g005535 PE=3 SV=1)

HSP 1 Score: 492.3 bits (1266), Expect = 2.2e-135
Identity = 240/470 (51.06%), Postives = 300/470 (63.83%), Query Frame = 0

Query: 6   VVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWART 65
           V   A     NNCP T+WP T  SG+   QLSSTGF+L SGAS T+++P+ W+G+ WART
Sbjct: 19  VGQGAQITFTNNCPYTVWPGT-QSGATSPQLSSTGFELASGASNTLELPSGWSGKFWART 78

Query: 66  RCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLA-PNGGMDFYDVSLVDGFNL 125
            C  +++  FSC T DC +  + C GAG   PA+L E T    NG  DFYDVS VDGFN+
Sbjct: 79  GC-SNNNGVFSCTTADCGN-HVECGGAGEATPASLIEITTGNNNGAQDFYDVSNVDGFNV 138

Query: 126 PASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEF 185
           P+S++  GG+G C + +C  N+N  CP ELQ +  D SV+GCKSAC+ FN P+YCC  + 
Sbjct: 139 PSSMSPQGGSGACGTASCPVNINAACPAELQFKGSDKSVVGCKSACVIFNTPEYCCNGDH 198

Query: 186 NDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCS----------------------- 245
           N P  C  + YS  F  QCP AYSYAYDDK  TFTCS                       
Sbjct: 199 NTPQTCPPTNYSQFFSQQCPNAYSYAYDDKRGTFTCSGNPSYLTTIMAITRVVLSLSFAF 258

Query: 246 ---VVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIW 305
              V H A   + N C  T+WP +  + +  +QLS+TGF+L SG S+TVDVPAPW+G+ W
Sbjct: 259 FLCVAHGAQITLTNKCSYTVWPGS-QANANSAQLSTTGFELPSGQSKTVDVPAPWSGKFW 318

Query: 306 ARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGF 365
           ART C  +++  FSC T DC +  + C+GAG   PA+L EFT+A NGG DFYDVS VDGF
Sbjct: 319 ARTGC-SNNNGVFSCATADCGN-HLECSGAGEATPASLMEFTIASNGGQDFYDVSNVDGF 378

Query: 366 NLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTA 425
           N+P+SI   GG+G C   +C AN+N  CP  LQ +  DGSVIGCKSAC+ F  P+YCCT 
Sbjct: 379 NVPSSITPQGGSGTCNVASCPANINAACPAALQFKGSDGSVIGCKSACVEFGTPEYCCTG 438

Query: 426 EFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP 449
           + N P+ C  + YS  F NQCP AYSYAYDDK  TFTCSGSPNY + FCP
Sbjct: 439 DHNTPATCPATNYSEFFSNQCPNAYSYAYDDKRGTFTCSGSPNYAINFCP 482

BLAST of MS006573 vs. TAIR 10
Match: AT4G18250.1 (receptor serine/threonine kinase, putative )

HSP 1 Score: 307.0 bits (785), Expect = 2.5e-83
Identity = 171/446 (38.34%), Postives = 238/446 (53.36%), Query Frame = 0

Query: 6   VVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWART 65
           V+  +   ++N C  T+WP  + S +  SQ+S TGF L  G +R +  P+ W G I ART
Sbjct: 6   VLSMSILTIENKCNHTVWP-VIFSWNVDSQVSPTGFALRRGEARALQAPSSWYGLISART 65

Query: 66  RCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLP 125
            C +DS+  FSC TGDC SG+I C G  G  P T   F       M+ Y +S+  G+NLP
Sbjct: 66  LCSIDSTGTFSCATGDCESGTIECPGNYGWAPVTYVYFR------MNSYTISVEYGYNLP 125

Query: 126 ASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFN 185
             +     +  C S  C   +   CP +L   S + +++ C S C+ F+ P+ CCT +F 
Sbjct: 126 LMVVPSQRSRTCISAGCVVELKKTCPKDLMKMSRE-NLVACSSTCMEFDTPEACCTRDFK 185

Query: 186 DPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTC--SVVHAATFV-VKNNCPQTIWPAT 245
               C  + Y+  F+  CP A+ YAYDD  ST TC  S  +  T + ++N C  TIWP  
Sbjct: 186 SKQNCKPTVYTQNFERACPLAHIYAYDDNNSTVTCLNSTDYVITIITIENKCNNTIWPVI 245

Query: 246 LTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGS 305
               S +SQ+S+TGF L +G  R ++ P+ W G I ART C  DS+  FSC TGDC SG 
Sbjct: 246 F---SWRSQVSTTGFTLKTGEERAINAPSSWYGLISARTLCSNDSTGNFSCATGDCESGE 305

Query: 306 ISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANV 365
           I C G     P T   F +  +G ++ Y +SL  G+NLP ++  V     C S+ C  ++
Sbjct: 306 IECPGTYKWSPVTYVIFRI-DDGQINSYIISLEFGYNLPLTV--VPSNPACISSGCMVDL 365

Query: 366 NGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQA 425
           N  CP +L+  S  G ++ CKSAC      + CCT  F     C  + Y   F   CP A
Sbjct: 366 NKTCPNDLKEFSRRG-LVACKSACRQSASDENCCTNYFKYKQTCKPTPYVQNFDRACPSA 425

Query: 426 YSYAYDDKTSTFTCSGSPNYVVTFCP 449
           YSY +    STFTC+ S +YV+TFCP
Sbjct: 426 YSYPFSGNNSTFTCTNSTDYVITFCP 436

BLAST of MS006573 vs. TAIR 10
Match: AT1G20030.2 (Pathogenesis-related thaumatin superfamily protein )

HSP 1 Score: 273.9 bits (699), Expect = 2.4e-73
Identity = 130/232 (56.03%), Postives = 161/232 (69.40%), Query Frame = 0

Query: 221 SVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWAR 280
           S V + +F   N C  T+WP  L S +G S L +TGF LL G +RT++ P+ W GR W R
Sbjct: 15  SGVMSRSFTFSNKCDYTVWPGIL-SNAGVSPLPTTGFVLLKGETRTINAPSSWGGRFWGR 74

Query: 281 TRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNL 340
           T C  DS  +FSC TGDC SG I C+GAG  PPATLAEFTL  +GG+DFYDVSLVDG+N+
Sbjct: 75  TLCSTDSDGKFSCATGDCGSGKIECSGAGAAPPATLAEFTLDGSGGLDFYDVSLVDGYNV 134

Query: 341 PASIATVGGTGE-CQSTACSANVNGVCPTELQVRSGDGS---VIGCKSACLAFNEPQYCC 400
              +   GG+G+ C ST C  ++NG CP+EL+V S DG     + CKSAC AF +P+YCC
Sbjct: 135 QMLVVPQGGSGQNCSSTGCVVDLNGSCPSELRVNSVDGKEAVAMACKSACEAFRQPEYCC 194

Query: 401 TAEFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP 449
           +  F  P  C  S YS IFK+ CP+AYSYAYDDK+STFTC+ SPNYV+TFCP
Sbjct: 195 SGAFGSPDTCKPSTYSRIFKSACPRAYSYAYDDKSSTFTCAKSPNYVITFCP 245

BLAST of MS006573 vs. TAIR 10
Match: AT1G20030.1 (Pathogenesis-related thaumatin superfamily protein )

HSP 1 Score: 272.7 bits (696), Expect = 5.2e-73
Identity = 128/226 (56.64%), Postives = 158/226 (69.91%), Query Frame = 0

Query: 227 TFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVD 286
           +F   N C  T+WP  L S +G S L +TGF LL G +RT++ P+ W GR W RT C  D
Sbjct: 4   SFTFSNKCDYTVWPGIL-SNAGVSPLPTTGFVLLKGETRTINAPSSWGGRFWGRTLCSTD 63

Query: 287 SSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIAT 346
           S  +FSC TGDC SG I C+GAG  PPATLAEFTL  +GG+DFYDVSLVDG+N+   +  
Sbjct: 64  SDGKFSCATGDCGSGKIECSGAGAAPPATLAEFTLDGSGGLDFYDVSLVDGYNVQMLVVP 123

Query: 347 VGGTGE-CQSTACSANVNGVCPTELQVRSGDGS---VIGCKSACLAFNEPQYCCTAEFND 406
            GG+G+ C ST C  ++NG CP+EL+V S DG     + CKSAC AF +P+YCC+  F  
Sbjct: 124 QGGSGQNCSSTGCVVDLNGSCPSELRVNSVDGKEAVAMACKSACEAFRQPEYCCSGAFGS 183

Query: 407 PSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP 449
           P  C  S YS IFK+ CP+AYSYAYDDK+STFTC+ SPNYV+TFCP
Sbjct: 184 PDTCKPSTYSRIFKSACPRAYSYAYDDKSSTFTCAKSPNYVITFCP 228

BLAST of MS006573 vs. TAIR 10
Match: AT1G75800.1 (Pathogenesis-related thaumatin superfamily protein )

HSP 1 Score: 265.0 bits (676), Expect = 1.1e-70
Identity = 124/232 (53.45%), Postives = 159/232 (68.53%), Query Frame = 0

Query: 221 SVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWAR 280
           S V + +F++ N C  T+WP  L S +G   L +TGF L  G  RT+  P  W GR W R
Sbjct: 18  SGVRSTSFIMVNKCEYTVWPG-LLSNAGVPPLPTTGFVLQKGEERTISAPTSWGGRFWGR 77

Query: 281 TRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNL 340
           T+C  D+  +F+C TGDC SG++ C+G+G  PPATLAEFTL  + G+DFYDVSLVDG+N+
Sbjct: 78  TQCSTDTDGKFTCLTGDCGSGTLECSGSGATPPATLAEFTLDGSNGLDFYDVSLVDGYNV 137

Query: 341 PASIATVGGTG-ECQSTACSANVNGVCPTELQVRSGDG---SVIGCKSACLAFNEPQYCC 400
           P  +A  GG+G  C ST C  ++NG CP+EL+V S DG     +GCKSAC AF  P+YCC
Sbjct: 138 PMLVAPQGGSGLNCSSTGCVVDLNGSCPSELKVTSLDGRGKQSMGCKSACEAFRTPEYCC 197

Query: 401 TAEFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP 449
           +     P  C  S YSL+FK  CP+AYSYAYDD++STFTC+ SPNYV+TFCP
Sbjct: 198 SGAHGTPDTCKPSSYSLMFKTACPRAYSYAYDDQSSTFTCAESPNYVITFCP 248

BLAST of MS006573 vs. TAIR 10
Match: AT4G36010.1 (Pathogenesis-related thaumatin superfamily protein )

HSP 1 Score: 252.7 bits (644), Expect = 5.6e-67
Identity = 121/233 (51.93%), Postives = 155/233 (66.52%), Query Frame = 0

Query: 223 VHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTR 282
           V + TF + N C  T+WP  L SG+G S L +TGF L    +R + +PA W+GRIW RT 
Sbjct: 20  VSSTTFTIVNQCSYTVWPG-LLSGAGTSPLPTTGFSLNPTETRVIPIPAAWSGRIWGRTL 79

Query: 283 CFVDSSS-RFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLP 342
           C  D+++ RF+C TGDC S ++ C+G+G  PPATLAEFTL    G+DFYDVSLVDG+N+P
Sbjct: 80  CTQDATTGRFTCITGDCGSSTVECSGSGAAPPATLAEFTLNGANGLDFYDVSLVDGYNIP 139

Query: 343 ASIATVGG------TGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYC 402
            +I   GG       G C +T C A +NG CP +L+V +     + CKSAC AF  P+YC
Sbjct: 140 MTIVPQGGGDAGGVAGNCTTTGCVAELNGPCPAQLKVATTGAEGVACKSACEAFGTPEYC 199

Query: 403 CTAEFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP 449
           C+  F  P  C  S+YS  FKN CP+AYSYAYDD TSTFTC G+ +YV+TFCP
Sbjct: 200 CSGAFGTPDTCKPSEYSQFFKNACPRAYSYAYDDGTSTFTCGGA-DYVITFCP 250

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RDY04111.16.5e-15057.11hypothetical protein CR513_12220, partial [Mucuna pruriens][more]
OMO73718.11.2e-14056.43Thaumatin [Corchorus olitorius][more]
RYR71254.14.5e-13551.06hypothetical protein Ahy_A02g005535 isoform A [Arachis hypogaea][more]
RZC77122.18.0e-13260.75hypothetical protein C5167_001310 [Papaver somniferum][more]
XP_019420884.11.4e-13153.08PREDICTED: uncharacterized protein LOC109331061 isoform X2 [Lupinus angustifoliu... [more]
Match NameE-valueIdentityDescription
Q9FSG71.1e-8664.04Thaumatin-like protein 1a OS=Malus domestica OX=3750 GN=TL1 PE=1 SV=1[more]
O803272.4e-8664.16Thaumatin-like protein 1 OS=Pyrus pyrifolia OX=3767 GN=TL1 PE=1 SV=1[more]
Q9SMH23.2e-8362.77Thaumatin-like protein 1 OS=Castanea sativa OX=21020 GN=TL1 PE=2 SV=1[more]
P833324.6e-8261.40Thaumatin-like protein 1 OS=Prunus persica OX=3760 PE=2 SV=1[more]
P506941.5e-8061.33Glucan endo-1,3-beta-glucosidase OS=Prunus avium OX=42229 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A7N2M7N93.4e-15254.86Uncharacterized protein OS=Quercus lobata OX=97700 PE=3 SV=1[more]
A0A371HMT63.1e-15057.11Uncharacterized protein (Fragment) OS=Mucuna pruriens OX=157652 GN=CR513_12220 P... [more]
A0A6N2MKB62.0e-14959.47Uncharacterized protein OS=Salix viminalis OX=40686 GN=SVIM_LOCUS383471 PE=3 SV=... [more]
A0A1R3HTS85.9e-14156.43Thaumatin OS=Corchorus olitorius OX=93759 GN=COLO4_26903 PE=4 SV=1[more]
A0A445E6Y12.2e-13551.06Uncharacterized protein OS=Arachis hypogaea OX=3818 GN=Ahy_A02g005535 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT4G18250.12.5e-8338.34receptor serine/threonine kinase, putative [more]
AT1G20030.22.4e-7356.03Pathogenesis-related thaumatin superfamily protein [more]
AT1G20030.15.2e-7356.64Pathogenesis-related thaumatin superfamily protein [more]
AT1G75800.11.1e-7053.45Pathogenesis-related thaumatin superfamily protein [more]
AT4G36010.15.6e-6751.93Pathogenesis-related thaumatin superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001938Thaumatin familyPRINTSPR00347THAUMATINcoord: 276..287
score: 54.95
coord: 438..447
score: 55.0
coord: 227..239
score: 53.37
coord: 328..344
score: 59.01
IPR001938Thaumatin familySMARTSM00205tha2coord: 12..222
e-value: 4.4E-106
score: 368.4
coord: 228..448
e-value: 3.4E-119
score: 412.0
IPR001938Thaumatin familyPFAMPF00314Thaumatincoord: 232..448
e-value: 4.4E-79
score: 265.1
coord: 16..229
e-value: 2.1E-75
score: 253.0
IPR001938Thaumatin familyPANTHERPTHR31048OS03G0233200 PROTEINcoord: 7..223
coord: 224..448
IPR001938Thaumatin familyPROSITEPS51367THAUMATIN_2coord: 225..448
score: 49.416985
IPR001938Thaumatin familyPROSITEPS51367THAUMATIN_2coord: 9..221
score: 45.751801
IPR037176Osmotin/thaumatin-like superfamilyGENE3D2.60.110.10Thaumatincoord: 10..224
e-value: 2.4E-86
score: 291.3
coord: 226..448
e-value: 6.6E-92
score: 309.5
IPR037176Osmotin/thaumatin-like superfamilySUPERFAMILY49870Osmotin, thaumatin-like proteincoord: 226..448
IPR037176Osmotin/thaumatin-like superfamilySUPERFAMILY49870Osmotin, thaumatin-like proteincoord: 10..221
NoneNo IPR availablePANTHERPTHR31048:SF132PRU PROTEIN, PUTATIVE-RELATEDcoord: 224..448
NoneNo IPR availablePANTHERPTHR31048:SF132PRU PROTEIN, PUTATIVE-RELATEDcoord: 7..223
NoneNo IPR availableCDDcd09218TLP-PAcoord: 227..447
e-value: 1.96757E-111
score: 325.35
NoneNo IPR availableCDDcd09218TLP-PAcoord: 11..221
e-value: 1.06057E-104
score: 308.401

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS006573.1MS006573.1mRNA