MS010599 (gene) Bitter gourd (TR) v1

Overview
NameMS010599
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionNon-specific serine/threonine protein kinase
Locationscaffold35: 876850 .. 911400 (+)
RNA-Seq ExpressionMS010599
SyntenyMS010599
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTCCCATTCAGAATTTCGAGCTGCATTCCCGCCAACTCGTCGAGCCTGAACTCAGTACTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCAATTTCTCTGTGCGCGCGCAAACATAGCTGCTTTTTCTCCCGTTTAGTTTTGAGAAACTAGAGCACTTGAGTTTAAAGCTATGGAGTATGCAAAGTTCTTGAGTTTCATTTTCTTTTTGAAGTTTACAAGTTCTGTATTGCTGTCGCCTTTTCTTTTTGTTCCTCCGTGACCGAACTGAGCTACCTGTAAGAATCTTTTTTCTGTTCTGAAATATTTGTTTGCGAGCAGGTATTCAGACGAGGCTTCAGATGGCAACGGAAGTTCGAGATAGCCTGGAGATTGCTCATACTCCCGAGTACTTAAATTTTCTGAAATGCTACTTTCGGGCGTTCTCTATAATCCTAGTTCAGATTACAAAGCCTCAATATACTGACAATCATGAACACAAACTTCGGAACATCGTGGTGGAGATCCTTAATCGTCTTCCTCACAGTGAAGTTTTAAGACCTTTCGTGCAGGACCTGTTAAAGGTCGCCATGCAAGTGCTTACCACAGATAACGAGGAGAACGGCTTAATTTGTATCCGCATAATATTTGATCTCCTCAGAAATTTTAGACCAACTCTAGAAAATGAAGTGCAGCCATTCCTGGACTTTGTGTGCAAAATTTACCAGAACTTTAAGTTGACCGTAAGCCATTTCTTTGAAAATTCGGCTGCTGGTGGCGAAGATATAAAGCCCATGGATGTATCCACTTCAACGGACCAGACAATTACTACCGGCTACACGGGGACTGTGCAGCTTAATCCTAGTACCCGTTCATTTAAGATAGTAACGGAGAGTCCACTTGTTGTCATGTTTCTCTTCCAACTGTATAGTCGGCTTGTTCAAACAAATATCCCTGTCTTGTTGCCTCTGATGGTTTCTGCTATTTCTGTTCCGGGACCTGAAAAGGTTCCTCCCTTTTTGAAGACTCATTTCATTGAACTGAAGGGTGCGCAGGTTAAGGTATGCTGTATACCGATACTCTTTGTTTTTGTTTTTTCTTTTTGGATCCTTAAATTGGTTGCCATTGTACGTTAAAACTAAATTTCACGACTAGCCTCTTTACACGTTCATTGTGATTGATTTAGTGAGTTCATTTATCTTATCTTTCAATTTTCCATTAATTTATTTTCACAGTTGCGTGACATGTCCCTGCTATGCATGCATATTTATGTATGACATGTTAATATTATCCTCAGATACTCTTTATGGGCCATGGTGTACGTTCTCTTGATGTTGTTGAGCTTATATCATTAGAGTCTTTCGTCCTTAATGAAGTATGGCTACAATGGATATTATATGTTTTTGAAGATCATATATACCTGATATTATTAGTGATTTACTTTGTATTCATATTGCATCAAAGAATTATGGTCTAGAGTTGAATATCTGTCTTTTATCATAAGCTATGCATACCATTAGATTAGTCAATCAGGAAGTAAAGTTCCACTGACCACAGTAGTCAATCTGGCTGTACATTAACCACAGCAGTCAATCAGGAAGTATAGTAACAACGTACTTATCTCATGTCTCTGTGATTGCAGACAGTTTCTTTTTTAACATATTTGCTGAGGAGTTCTGCTGATTATATCAGGCCACACGAAGAAAGTATTTGTAAGAGTATTGTGAATTTGCTGGTTACATGTTCAGATTCCGTGTCAATTCGGAAAGTGAGAACTTTTGTCCTTTATTATCCTGGGTTTCTTTTCCAACTTAGTCAGTTGGTCGAAAAGAAAATCTGAGAACTAAGCTTAAGGTTGTTCAATGCAGGAATTGTTAGTAGCCCTGAAACATGTTCTTGGAACAGAGTATAAGAGGGGCTTATTTCCTTTGATTGATACACTGTTGGAAGAGAAGTAATATGCTTGACCTATACACATCCTAATGCTGAGCTTTGCTATTATTTAACTTATATTTTGTTATTTTATCTTTCCACCTTTTTAGGGTTCTAGTGGGAACTGGTCGGGCATGCTATGAGACATTAAGACCATTAGCCTATAGTTTACTGGCAGAAATTGTGCATCATGTCAGGGGGGATCTTACCCTATCTCAGGTATATCCATTGTGCATTTTTTTTGATAGGACTCCATTGTGCATTTTAGTGCTAATGCACAAATGTTCCAAGTTTACCTTGTTAATTTTAGTTTTCCTGCAATAGTTGGGTCGTGAAAGTTTAAATAAGGAGGACTATATCTATAGATATGAATTTTCCTGACTGTCTTGTCCTGAACCTACGTTTAAGAATGAAAAAGTGTATCTTTTTGTCATTTATATCCTTTCGCACTGTAAGCTTATGCTACTATACTTTAAACGATCGCTCTCCTGCATTTAAAATGACGTTTTCAGAAGATGGGGAGTGACATCTTGAAGAAGTCACTTGGAATTGGAGGCTTAGAGTTTTTAAGAAAGTCTGGATTTAGTTTGTTACCTATGTCAAGTCATAAGAAAATTGAAATATTCTAAAGGGAAAGAAAATTGAAAAGAAAGTAATGATTTGATAGCTTTGGTTGGAAGACGACCCTATGGTTTGAAAGGGTGTTTGTTGAGGTGAAAAAAAAGGAGAGAAGAGAGTAAAAAGATGTTTAAAGGTCCAAATGGAATAATGGTGTTGGGAGGTTGCGGCATTTTGTTGGAGATTTTTTTTCAGAATGGAGATTGGTCTTCTTATTGGAGTACGAGTGCCCAAGTGTTTTCAGGTTTTCCTAAGCCCCTCCCTCCTTGTATTTGAAGCTGGTGTTTTTTAGTTAAATCCTTGTCTATTTGCTGTCTATTATCGGGGAGAAAAGTGAAGTCTCTATTATCGGGAAGAAAACTGAAGTGTGCAGTCAGCAAAGTGTGCTTCTGGTTAAAAGGGCTTGGGAGTACGATTGGGCTTTTGGTTTCGTAGTCTTTCTGAAGATTTCCTTGCAGTTTTCTGTGCCTATTCGTTTACAGGTCGCAACGATAGCTCTTCCAATGGAAATTTTGGAGTTGCTGAGTATTTAGCATTTTTTCTCCCTATCAAAGCAAAGAATATATGGGTCTCATTTGTTAAAGCTTCCTTAGAGGATTGGTCAGAAAGAGTTCCTTTGTTTTTTTGAAGTCAGATGGCCTTTTTTGGAGGGAATTTTGTTCTTTGCCTGTTTTTGGAGGCTTCATGCTTGTCAAGCATTTCAACTTATGTTTGTTTCAATCTATCTTGTTTCCTTTTTGGGTTCTTAGGGGCGTCTAATACTTTTCTAAAGGGATGTTATATGTTTCTAGTGTAACATTTCACATTTTCAATGATTTTTTTTTTTGAAAAAAAAAAAATTTCAATGAAAAGTTGGCCATTTGTTTTCCTCTTTTTAAAATTACATCGTGCCTTTAAGACTTCCCTAGATTGTTTATGAAGTTTGGTTTGGAAGGAACCAAAGGGTGTTCAAAGGAAAAAGGTGGTCTTCCCTAGATTATTTCACCACTGCCAAGTTTAAGGCTTTCCAATGGTGTGCTATTTCCAATTTATTTGCTAATTATTCTCCCAACATGATTTGTATGAATTGAGAGGCTTTTATATGCCCTTGGTCTTAATGCTTTATGTTTTGAACAAATTTTGTTCCTATTCATCAATCAAAAGTTCGTATCTCATTCAGGAAAAAAAAAAAAACATCATGCCTTGTTAGCTTGGTTTCTGCAGCAATCTGCTGCCTTTCATCATCTGATACATGCTATGTAACGTCTGCAGCTATCACGGATTATCTACTTGTTCTCAAGTAATATGCATGATGCCTCACTATCACTTAGCATTCATACTACTTGTGCACGATTGATGCTGAACTTGGTGCGTCTAACTCCTTTAATTCCAATGATTTAAACTTCTCTTTCCCATTTTACATATACTGGCATGCTTGAATAAAATGGATTTTGTGGATTTAGGTGGAGCCAATCTTTGAGAAGGGTGTTGACCAAACTTCTATGGATGAAGCACGAATTCTCTTGGTCTGAAATAATCATACGTTGGTTGATTGCCCTCTCTATTAGTAAAACACAAGCAGTGAATTTTATGTTTAATATGATAAGCTTATTTGGTGTAGGGGCGTATTTTGGATGCTTTTGTTGGGAAGTTTAGTACGTTCAAGCATACCATTCCTCAGGTACCACAATTGCCACATTGGTTACTTGTTGATCTGTAAAATGCTGTTTTTTTAATCCGTGATCTATATAAGTCTCCCAAAAGAAAGCCACTAAGTTATTAATAATTAAGTAAATCATTATATACATCACTACTCTTTCTTAATAACCTGAAAAGCCATGATTATATGTGGATCTACATTTTATGTTCCGAATGTTCTTATTTATTTATTTTTCCATAATGTTTTACAGTTATTGGAGGAAGGTGAGGAGGGAAAAGATCGTGCAAATATGAGGTCAAAGCTTGAGCTCCCTGTGCAGGTTGCTTTTATGACTTTACTGGACTACCTTTTTGTTATAAATTTTGTTTGATATGTCACTGTCGTGATTCTTTTGAGCATCCTCTGTTTTATTTCCATTTTATTACTGGAAAATGCTCATATTTGATCTTGACCACTACTTATGAACTACACTTTAAGCCTATCTACATGAATCCTTTCGTTTCCTTATGAAAACTTCAAATAATTGCTTGCCTAACCGGCTCAACCAAAAAGTGCCCTGAGTTTCTTTGTAATTAATCAAATCCTTAATAACAAATAAGCCCATGTTATAGCAAACCAGTCAAACTTAGAAGTAGTGATTTTTGTTTAAATATCAATTTGTTATGGGAGTTGTGGCTAGAACGATCAGGGAATGATGAAAGCGAATAAAAAATGGAAAAAGGGGAGGGGGAAAATAAGTGAATGGAATCGACATATAAGAGCTCAAATTCCATACAGATGACAAATAATGATGTTGGTGTTTAATGATGTTGGTTAGTTTATTGAGTTACCTTTAATGTCTGCAGACTTTTGGATATATTATCAAAATTTATGATTTAGACAATGCAACATGTTCTTTATTGCTATTTTGATACTTCCATATGTATTTTCCAGAGTTCTGTTGTATGTTATTCCCTCTAACTACCATATTCTTGCAGGCAGTTTTAAATTTGCAGGTCCCTGTGGAACATTCTAAGGAAGTCAATGACTGTAAGCATTTGATTAAGACGTTGATCTTGGGTAACAAATCCTCTTTAGTGATTATCATTTTGTTTGGATTAGTATTTTGAAATCAGCATGTTGAGAGACTTCTGATTTTTGTGTTCCAAAGATAGCATGTTTCTTCACCTTTCATCTTGTACTGCTGGATGCTATTTTATTAGACTAATTTGTTTTGGCTCTGTTTTCTGTTAGGAATGAAGACGATCATATGGAGCATCACTCATGCACATTTACCCCGACCTCAGGTGTTTCCTTAATTCTAAAAATGTGGTTATAGTTTGGCTGTTTGTTGGTTATGTAATCTGTTGATTTATGGTTAAGGCTTCGCCATCTCCAAATGGAACACATCCACAGATGCTTGTTTCACCATCATCAAATTTGGCAACGCCTCAAGCATTCAAGGGAATGAGAGAGGACGAGGTATCATATAATTGAGGACGGATATGTGTGTGTGTGTATTTTGTTGGCATGATTCAAATCTAACTATAATTTTGGTCTGGGTAGGTGTGTAAAGCCTCTGGTGTCCTGAAAAGTGGTGTTCATTGCTTAACACTTTTCAAGGAAAAGGATGAAGAAGTAGAAATGCTTCATCTTTTCTCCCAGATATTGACTGTAATGGAACCTCGGGATCTGATGGACATGTTTTCATTGTGTATGCCTGAACTTTTTGACTGCATGATCACCAACACACAGCTGGTCCATCTGTTTTCAACATTTTTGCAAACACCTAAAGTATATAGGCCATTTGCGGATGTTTTGGTTAATTTTCTTGTCAGCAGTAAACTTGATGTTTTGAAGCACCCAGATTCACCGGGGGCAAAATTGGTCTTGCATCTCTTTCGTTTTGTATTTGGTGCTGTTGCTAAAGCACCATCAGATTTTGAGCGTATTTTACAGCCTCATGTGACTGTCATAATGGAAGTTTGTGTAAGAAGTGCTACTGAAGTTGAAAGACCGCTTGGGTACATGCAACTTCTTCGCATCATGTTTCGGGCATTGGCAGGGTGTAAATTTGAACTTTTACTACGTGATCTGATTCCTTTGCTACAACCTTGCCTTAACATGTTACTGACCATGTTTGATGGTCCAACTGGGGAAGATATGAGGGATCTGTTGTTGGAATTATGTCTCACATTGCCTGCACGCTTAAGCTCATTATTACCTCACCTTCCACGTTTGATGAAGCCTCTTGTTTTGTGTCTTAAAGGAAGTGACGACTTAGTTAGTCTAGGTTTGCGAACCCTCGAGTTCTGGGTCGATAGTTTAAATCCTGACTTCCTAGAACCGAGTATGGCAAATGTGATGTCTGAAGTGATTTTAGCCTTATGGTCTCATTTGAGGCCAATACCCTATCCTTGGGGTGCAAAAGCTTTGCAAGTTCTCGGAAAGTTAGGTGGTCGCAATAGACGTTTTCTGAAAGAGCCACTTGCACTAGAATGCAAGGAGAATCCAGAACACGGGCTTCGTTTAATTCTTACCTTTGAGCCATCTACTCCCTTTTTGGTGCCATTGGATAGATGCATTAATCTTGCTGTATCAACTGTAATGAATAAAACTGGTGGTGTTGATTCTTTCTACAGAAAACAAGCTTTGAAATTTCTTCGGGTCTGTTTATCTTCTCAGCTTAATTTGCCTGGAAATGTGGCTGATGATGGCCATACACCCAGACAATTGTCAACTTTACTAGTTTCTCCTGTTGATTCCTCTTTGAGAAGGTCTGAGACTCCCGAGGGAAAGGTTAGTTTATTGGTTTGTGTCACTTTTATATTCTTCAACTTCATTTTCTTCTGCTTTTCTATGAAAAACATTTTTCTATCTTTGGACTTTGTCTCAGATTTATCAATACATTACCAATGTGATAAAGAGCCGGTATTATGTTTGCTCATTTTGTTCTGACATTATTGTCTGCATTTTGAATTTAATGTGGTTGCGTCACCAGCATGTTGAATATAGGACTAATTCCTGAGGTTCCTTCTTTAAAAGTAAATGGTGGTTCTAGAATTTGTTTCATAGAATTCAAAAAGATTCAATTCCAAATGCTGTGCAATATTGAAGAAATATTAATATAGTTGAACATTTACACTATTCTGTATATGGTCTTGGGGGCTACTGGAATTCCATAACTTCAGCTGTATTAACCTTCCCAAACAGAAGAATCCATGTAAGAACATTGACCTTCTTGGGGCTACTGGAATTCCATAAGTTCATAGCTTCGAACAATAATCCAATGGCAAAGCAGCAGAGGGACCTCTTGGTTTAACCAAGGAACATTGACATTCTGATTGTTAATATTAATCTTTCTTCTTTTTTATTTGTCGGAATAATTGTGCTTTGCTACCTAGATTGGGACGAACTGCCTTAAAGCCGATTACTGTAGTATCTGTCAGCTTCAATGAGAAATAATTTTTTGCAGGCTGATTTGGGTGTAAAGACAAAAACCCAACTTATGGCTGAGAAATCTGTTTTCAAAATTCTATTGATGACCATTATTGCTGCTGGTTCAGAGGAGGATCTCCACGAGCCAAAGGATGATTTTGTTCTCAATGTATGCCGCCATTTTGCTATACTATTCCATATTGATTCTTCTCTAAACAGTTCTCCAGTTGCATCTGCCTCACTTGGGAGTACTTTGCTTCCTCCAAACGTCAGTGCCAATTCCAGATTAAGAAGTAGTGCTTGTTGTAACCTCAAAGAGTTAGACCCTCTCATTTTTTTGGATGCCTTGGTTGAGGTGCTGGCGGATGAAAACAGGGTCCATGCAAAAGCTGCTCTGAATGCTCTAAATTTGTTCTCTGAAATTCTTCTTTTCCTTGCTCGTGCAAAACAAACTGATGTGATGATGACAAGAGGGCCCAGCACCCCAATGATTGTTTCCAGTCCATCAAAGAGCCCTGTATATTCACCACCTCCAAGTGTCCGTATTCCAGTTTTTGAGCAACTCTTGCCACGGCTTTTGCATTGTTGTTATGGCAGCACATGGCAAGCCCAGATGGGTGGTATTATGGGACTTGGTGCTTTGGTTGGAAAGGTTACTGTTGAGACTCTGTGTCTTTTCCAAGTAAGAATTGTGCGAGGCCTGGTATATGTTCTGAAAAGGCTGCCAATTTATGCTAGTAAGGAGCAAGAGGAGACTAGCCAAGTACTCAATCAGGTTCTTCGTGTTGTGAATAATGTTGATGAAGCAAATAGTGAACCGCGCAGACAAAGCTTTCATGGGGTAGTAGATATTCTTGCTTCTGAGTTGTTTAATCCCAATTCATCAACTATCGTGAGAAAGAATGTGCAGTCATGTTTAGCTCTTTTGGCCAGTAGGACTGGTAGTGAGGTGTCTGAGTTGCTTGAACCTCTGCATCAACCTTTGCTTCAGCCTCTCTTATTGCGACCACTTCGGCTGAAGACTATTGATCAGCAGGTATTGAATTGCTTTTTGTTATTTAAATAGTTCACTTTGTTGGTATGTTAACTTGCTTTGATGTACAGGTTGGAACTGTCACAGCCTTGAATTTCTGTTTGGCATTAAGGCCGCCTCTTCTAAAGTTGACTCAGGAGTTGGTCAACTTTCTGCAAGAAGCTTTGCAAATAGCTGAGGCAGATGAGACTGTATGGGTTGTAAAGTTCATGAACCCTAAAATAGCCACATCATTGAACAAGCTCCGAACAGCTTGCATTGAGTTACTGTGCACCACCATGGCATGGGCAGATTTTAAAACACCCAATCATTCTGAGTTGCGTGCAAAGATCATCTCAATGTTTTTCAAGTCATTAACATGTCGGACTCCAGAAGTAGTTGCTGTTGCAAAGGAGGGGTTAAGACAGGTTTCGTTTTTCACTTGTAATGTCAATCTAGTTTGTTTTCTTTTATCATCATTGCGTGTAGGTTAGATTCTGATAAATACAACTGAAGTCTAATCTCCATGTTGGCAAGAATTTCACTTGGTAATCTTTTTGGGAAACACTTTTTCCTTTTTAAAATTAATTTTTACAACTTAAAAGTTAATTTAGATTATTTATTTATGAGAAATTATAATGGGTTGCAAAATTGGTTGCCATATTTGCTAATCTAAAATAATAGTTTGCAAATATGACAATTTAAAATTTTGTGCGTAGCTATTTTTAATTTTTTATGTGATTCCCTTTTTACCCACTATAAACGAGGTTGAGATATGTTGATATGCCTTTTTGACCTTTCCAACGTTTTATTTGGGATGAGTCTAAAATTATGCAATATTTGCTGATCCTTTTATATTGTATTATATTTGCTACCATTTTGAGCCAAATTGCTATTATATGCCATTGCTCTTTTATTTATTTATCTATTTGCAGGATGTGATTGCCGAATGAGAATTTTGAATTACATATATCTAGGATTTTGTCTTAGAGATTTATTTATTCATTTATTTATTATTATTATTATTTTAACATATGCTGTTATCATGAGAATTTGATTGCACTATCTCTAGGATTTTGTCTTAGAGATTTTAGACCCATGTGGTAGCCCTTTGCCAAGAGGTTTCTTGGTCATACTGATTTAGAGGTCTCAAATTCGAACCTTCGGGTGAGCTTAATATAAAAAACCCTTGATATCATCTGGGTTCGGGCTTTTGGGCGGGCGCGGATGCCCCTGGATATAGGGGAGTAAAGCTCCGACTCCCAGTTATCAAGAAAAAAAAGTTCAAACTTTTGATAGTCACCAATAAAGAATGAATGGGTTAATGAAAATTAATTATACAACAGGGTAATATAGACCTTTACTTGCGGTGGTTATTTTTTATGGGCTACAGTTGTAATATACATATGTATATATACGTGCACATACATACTCTTGTTTTAGGATTTACCCCACTTTTGTACAGGTTATTAATCAGCAAAGGATGCCCAAAGATTTGCTGCAAGGTAGCCTTAGACCTATTCTGGTAAACTTGGCACACACCAAAAATCTTAGCATGCCACTTCTTCAAGGTCTGGCTCGCCTTCTTGAACTTTTGGCCAGTTGGTTTAATGTCACATTGGGAGGCAAGCTGTTAGAGCACCTCAAGAAATGGTTGGAGCCAGAAAAACTTGCTCAAAGTCAGAAAGCGTGGAAGGCGGGTGAGGAGCCAAAAATTGCCGCAGGTAATATTTTTACTTTAATATGAATTAGTGTAAAATTAATATCTTCTCAATACAAAAAAAATTATCCATGAACCGACTGATATATCACGCAAAATTCAACCAACAAAACACCTAGATCAATCTGAAAAATGTCCTAATCCAACTATGAACCTGAACTTAGACACAAAATCTAACTGCTGTTAAAAAGTGCCCAAACTCTATACCAAAATATTTGAGCTTAGCTAAAAGGCACCCAATGGAACTAGAAGACATTATACCTTTAAAAATTGTCTAATGTCTTGTCTAGCCACGAGAGTGCCAACTAGTGTAGAAAATGTGTACAAAGGAGCAAGAAGTCACATGAAAACTCTACAATTGGTAAATAAAATAGAAAAAGTGTACTTTTAATTTTTCAAAAGGAGTTAGCTTTAAGGGTCCAAATGGCCATCATGGCTAAAATGTTATCCCAAAGTTGATTGAGAGAGAACTCTTCAATTGAAAAAGTTTGCTTATTTTGTTCAAGTGAGAAGACCTAGCAAACTGTGGTGAGGCAAGGAGCCAAACATCTATTTTCTGTGCTGAGAAAACCTGAATTGCTAAGTTTTTTGATAAAGAAACAATTCATTGAAAGAAAAAGAAGCCAAAAACAGCCTAAGGACAGGGGACGAGAAGACCCCCCCCAAAAGAGAAACTAGAACAACAAGGTTCTCCAATCTTGTAAGATCGTAGAGAGGCTGTAGTTACAAAAGAATTTAGTATTTCTATGGCACCACCAAGAGGCTGTGATTTGTACGCTATTACAAAAAGAATCAAAAGACTTAGACGTGTCATCGAAAGAGCGAAGATTCCTTTCCAACCAAATGTTCTAAAGCAACGATCTGAAAGCTCACCTCCACAAAAAATAGACCTTTCCTTTAAGATGCCATCCATAAAAACATTCAAGCAGCCAATCTTCAATGCTTCTAGGTAAGCAACCCGAGAGCCCAAAAATATCAAACAAAACCATCTCAGCTTTCTTAGTAAAAGGACAGTGAAGAAACAAGTGGTCAGTGATTCTGCTTCTCTCAAACACAAACAGCAACCAGAAGGGGAAATCGACCAACCTTGGCATTTCTTTTGGAGCCTCTCATAAGTGTTCAAAGCTTGGTAAGCTAACGACCATAAGAAAACCTTCACTTTCTTCGGGATGGCATCTTTCCAAATAAAATTAATTAGAGGAGACTGAAGCGAATTGACATTAGTATCCACAAGATTCAAAAATGCTGATTTCGTAGAGAAAAGCCCATCCCCCTCGAGGTTCCACACAATCTTATCATCATCCTCTCCGAAAACAACAGAGCCTCTGCACCAGCCTCATCCATTGCTCAACTTCTCTATCAAACAAACCCCTCCTTATGCCAACACTAAAAAACAGACGCATTCTGCTTTAAAGAAAGGGCGAACATATCCGGAAAATCGTGTGCAAGAGGCAAGATGCCCGCCCATTTATCCAGCCAAAAGCTAAGTCTAGTTCCTCGACTCACTTTAAAGACCACCGTTTCTTTAAAAGAGTCCACATTTTTGGCAATATCAAACTAAGGTCTCCCCCTAGCCTTTCCTTTGGGAGATTTCGACATCCAACCATACTTATCGACACCATAAATGCTAGCTATGACTCTCCTCTATAGAGCATTATATTCCTGTGTAAACCGCCATAGCCACTTTGTTAACAGAGTTGTGTTCTTAAAGGCGAAAACCATCGATTCCCAACCCTCCCAACTTCACAGGAAGCAAGGATATATTCTTTTTTTTAGATAAGAAACTTTCTTTGGATATATCAAAAAAGGAAGACAGACTAAAGGCTGGGGTAGAGGAAACCCCACCTAAAAAACTAAACAACCGCTATCCAGTTGTGCATGAACATGGATAGGTTGTAATTACAAAAGAATTTACTATAATTGATGCACCACCAAGAGGTAGTGTGCTGTATATTCCTACAAAAGAGATCAAAAGTATTGGATTTATCTTCAAAAACCCGATAGTTTCTTTCGAGCCAAATATGCCAAAGGAGAGCTCTAGAGGCACAACACCAGATAAACCTTGTTTTCTTAGGGAGGGTTCAACCATGAAGAGCTTCAGCCATCCAATTTTCAATCTGTCTTGGGAAGCAAGTGGAGATCTCAAAAATATTTAGGAGGAAATGCCAAATCTGTGAAGCAAAAAGGCAGTGCAGAAACATGTGATCCATGTTCTCACTGTGCTTCATACAAAGAGTGCAGACCGAGGGGGACAGTGTCCAGTTCCTACACTTTCTCTTGAGTTTATCATGGGTATTCAAACTTCTATGGGCAAGGGACCAAAGGAACACTTTTACCTTCTTTGGAGCCTTAAATTTCCACACTGTCGAAATAAGAGGGGACCTTAAAGAGGGGCTGCAATCCACCAAATTCAAAAAGGCCAAACGAGTGGAGAACCGACCATTTTTGTCTATGGACCATACTACTTGATCCTCACCTTCCCCCAACTGCACCACATCAATTTCTTGAATAAGTCCCATCCAGCTCTCAACCTCCCATTCAAAAAGGCCTCTGCGCAGTCCAATATCCTGCAAGGATATATTCCATCCCCCCCGGATAAAATCCCTCACTATCTTATCCATTTCCTTCGTGATGCTCACAGGGGCCTTCAACAAAGGGAAGTAGAAAATTGGTAAACTATTTAAAACAGATTGGGCAAGTGTCAGTCTGCCCCCCCCTCGACAAAAGTAATTTCTTCCATTTGGACAATTTGATTTTGAACTTTTCCACCATTGGTCTCCAGAACTCTATTGCTCTATGATTTCCACCAAAAGGCAGCCCCAAATTGTGAAACGGCAAAGATTCAACTCGACACCCAATAGTGGTTGCTTTGAAGGCAACTTTATCCTCATCCAAATTAATACCACTCAGAGAGGATTTGGCCAAATTAATACGCAGCCCCGATCCCCTAATGAACATTGATATAAAAGCCCACCATCTGTTAATGTTACCTTTGTATGCCGGCAGAAAATTAAAGTATCATTTGCATATTGTAGATGAGACACCAGAACCCTATCTCGACCCACCTCAAAGCCTCAAAAGATCCTCTTCTCGCAGCAAAAGTTAACTATCCGGCTGGAAGAGTCTCCTATCAACGTGAAGAGAAAACGGGAGAGGGGATCCCCTTGCCTCAAACCCTGCGTAGCATGAATTTTCCCCCTGGCCCTGCCGTTGATGAAGACAGAAAAGTTTGCCGAGGAGACACAACCTTTAACCCATGTCCTCCATCTCTTTCCAAAGCCTTTCTGCAAGAGCACAGCGTCGAGGTAATCCCAATCAACCATATCATAGGCTTTTTCATAGTCAATCTTAAATAAATAGTCACTCTTCTTGGCGTTGCCATAGTCCCCGATAGCCTCCGCAGCCACAAGCATGGCGTCTAGGATCTGTCTACCCTCAACAAAGGCGACTTGAGCATCATCAATAATGTATCTCAGAACTTTCTTAAGCCTCTCAGCCAAAACCTTGCTATAATTTTGTAAAGCGAGGTAACTAGGCTAATAGGTCTGAAATCTTTGACTTTGTTTGCTTGTGAGCTCTTCGGGATCAAGCAAATGTAGGTTTCGTTTGTGCAAAGATTAACAATCCCATTCTTAAAAAACTCTTGGAACACCCTTCCTATATTAGCCTTCAAAATGTTCCAAGACTTTTTATAGAACTCCATGGTGAAACCGTCTGGCCTCGGAGATTTCAAATTACCAAAGCTTTGGATAGCTTTCCAAATCTCCTCTTCCCCAAAAGGAACTTCCAAGCTAGGAGCCAAGGAAGGTTGCAAAGAATTCCACTCACAGCCTTCAAAAATAAATCTATCACCTTCTGCCTTTGAATACAATTGATTAAAGTAGCCTAAAACCTCTGTTTCAATCTCCTTCTCTGATGTGAGGACCGAGCCGTTAATTTCACAAAGATGTTGAATGGAGGCTTTGCTCCTTGCCGAAACCCATCTATGGATCGGAAGAAATGAGAGTTCTCATCACCATCTTTAAGCGAACTCCATTTACATTTCTGAGCCCACATTTGCTGTTCCTTTAAGGGAAACCTCCATAAGGATGCTTTCAAAGATGTTCTGTGAGCTACTCTTTCTCCTGAGAGCTCCCCCTCCTCTCCTCTTCTCTGTCTATCTCTTCTATCTTCTCAATGGTCTCCCTTTTCACGCTCTCAACAACGCCAAATGAATCCCTATTCCAAGCCACAACCTTCTTTTTTAACTCCTTAAGCTTTGTCATAAAAACATAGCCCGGCCAACCAATGGGGGAGCAAGAGTTCCACCAAGAATCAACATTTTGTAAGAAAGAGGGATGATTCAACCACATATTCTCAAACCGGAAAGGAGAAGGGCCCCACTTTTGATTTCCTAGATTGAGCTGAAGAGGAAAGTGGTCTGAGGTTGTTCTAGTCTGTCTATACAACGAAACGCCTTTAAAAAACTGAAACTAGCTTTGAGAGAGGAAGAATCTATCAAAACGGGTGCAGACCGGGCTGCTACTTAAATTGGACCATGTAAACAAACCATTTTTCAAAGGAATATCCTCAACATTCATCGCCTCAATAACTTCATTGAACTCCCTCATACTTCTAGTGACCTTACCACCTCTAGATTTCTCTATTGGAGACTGGACCACATTAAAGTCCCCTCCAATACACCAATCATCAGAGCATAAAGCAGCCAAATCAAGCAATTCTTGCCAAAAACCATCCCTGTTTCTACTCGACGAAGGGCCATACACTCCTGTCACCCAACCAATGAAGCTTTTCGCATAGCTGATACGTAGTCACTGAAAAACACACCCAGAATGAAGTCTACAACCTCAACTGTGTCCTCGTTCCACAAATTAAAATCCCCCCTAGGACCCTTCTGCAGCCAACGTGATCCAACCTATCCTCCAGGAGCTCCAAACTGTCTTCACAATCTTCCTATCAATAAACTCTAATTTTGTCTCATGCAAAATAACCAGATCAGGATCAGTCTTGTAAAGAAAATCTTTAATCAACGCTCTTTTGGAGGGAGCACCCAAACCTCTCACATTCCAGGATATTATCTTCATAAATCACCCTCAATACCCTGATCTAGGCTACAAATTGCTAGCTCTTCTTCCTCCATCTCCTTTCCCCATGCCTTTAACTTCCCCACGATCTGACGGGCCTCCATTTTTTTCGCTGAGCTACCTTTAAAGCTAGTTTTGGAAGATATTGGTCTGATACACATGTTATGCTTCCTCATTGCTGCCATCATTTTTCGGGAAAAAGGGAAGCAATTCTCATCACCCTCCTCTGAAACTCCTACTACCTCTCTGTCCCCTACCTCCTCATTGACCTCTATCGGAGCTTATGGCCCCTGTTTCTCTAAGCTGTTCTTTTCAACTACGTCTTCTTCAAAACAAGCAAAGTAATCCGTCGGAGGATCCATACCAAAAAAGGGCCGCTGGTCACTTCTGTTTCCTATCCCATCTTCATCCCCATCCTCTTTGTTCGGATATTGACTCTCAATGCTAGAAAAGGACAACTCTGAGGACCCATCACCATCCTTTGATTTTTCTCTCTCTTTCCTTTTTGATGTCTCTGCCTCCCCCATCTTCGAAGTAGAGCATAAGTTACCTTTTTTGGGAAAGAATCGGGTATGATTGGTTTTGGAGAAGGAAACCAATTTCGTTTTTGCTTCTTTGTTTGGGGACATTTGGATACTACATGGAATCTCTACAGAAAGTGGGTAAGAAACACCCATTTGTGGGGTGGGACCCACTGGAACAATGTCTTTTATGTGAGAAGGCTGTAAAAAAGGACAAAAGCCCTCTGTGCTTACTGCAAAGTCAGCAGAAGTCAGCAGGGATGGGACAATGCTCTGTTTCGGAAATTGGCACTATTTTAACCTTGGGGTAAAAGTACCCATCTTCAATCCTCCATTTATCCACTGGATTTAAGACATTTTCGACTGAGCCTTTGTAAAAACTTTCCGCTGCCGTCAGAGAGAAACTTCCATGAACGTCCGGCGCCTTCCCCGTCAAGAACCGACTGTACGTTCCCACTGTTGCTACAAAAACCTGACCGTTATGTTCTAGCTTAACTTCCGTCAGGATAAAACCACAATAGTTACCTTTGACCTTAATAATGGCCTCCATGCTATCCACGAGGAAAGAGTTTGCATCGGCAAAGTCTAGAAAACCACCATAGCAGTTGCCGGTCGCTCTAAAGGTAGCTAGACTCCACATGTGAAGGGGTATCTTGCGGAAACGGATCCATCCTCCATAGGAGGGGATAGCGGTTGCTCTGCCGTGAAGAGTCGGATTCCACTCTTCTATCTTAATGGTCATGGACCAAAAGTGACCCAGCCCTTATTCGTCATTAAAGTTCTGGCTATCTCTTCTGTAGGGCACCTTAGCAACGCTTTATCTGGTTGGAACGGATTTATAATGAAAGACTCTTCTGTCTGCTGCTTGATAGCCTCAAGGATTCTCCCCCAATCATCGTGAAAGTCCCAACGCGTAATCACAATGGTTTCCTTCCAATTTAACGCCCTAACTTCTTGAGAGACCGCGGGCTCTTGCGTCTTTACCTTTGTTTCCCAACGTTTGGGACGATCCTTTAACGATCTCAGGAAAGGAGACACCTTTTTGTATTTTTTCAGTCGGGGGATGTCTGTTCCATTCTCTTACGTCATGTCCGTTGAACGCAGCAAGAATCAATTCTTTGAAGCTCTCCCAACCTTTCAAATTTTCACCTGCGGGGGCCACTATATTCTTCTTTCCTCCCCCATTCAATACCTTTGTAATCTACACATAACTGCCTCTTTTGTTCGTAGTTTTTTTGCATCCATAGGAAACCATCCTTGCAGTCAGTTTTTCGAAAGAACTTCTGAGCTATTAATTCCTCAATCGAAATACTGCAGACTCTCTTGATATGCCTTTCTGTAATTTTGAGAAACCGACCTGCCTTTTTTGAGTCGAAAGAACAGGAAAACTACTTGCTTTCTATCCTGACGGCCTTATGGCCCCCCATAGCTAAGGCTCAGCGGAGCTTTAACCTTGGAAGACATTGTCGGAGAAATGCACTAGAACGGGAAGGAGGGAAGGCGAGGGGCGAGAGAGCATTTCCATTTTCTAAAACTACTCTATATTTTGAAACTTCATAACTTTCATCCGCTTTTTTACCACCTGACCAGCATTAGTAAATAAGAAAAGGTAACTAAGAAATTTCAACTCAAAACCTGGATTGCTAAGAAAGCTTGACGAATTGATGTTAGACCCAAATCAAATTGGTTTCTGAGAGAATTCTCATCCAGAGGTTGCTAGTGGGTGGGTAGGCAAGAAAAAGATGGTCCCAATCCTCCTTGGCGTGTTTACAAAGGACGAACTAGCTTGGGAAGATGGCCATAGAGGGGATTCTCTTGTAAATATTGTCCAAAGTATTGATAGTATACTACCAAGGACCAGAGTCCATAAGATTTTATACTTTTTGGGAGCCTTTAAATTCCACAGACAGAAGGTGGGTGGCAGCAGCTGATCCTCCCGAGCGAAAGAAGCTTGGAGAGATTTCACAGAGATTGTCCCATCTTGTTCAAGGTTCCATAATCATCTGGGATAAGAGAAATGGCAGCTGGGCTTCTAATTTTTAGAAGACCCCCTTAGAGTCAGGGATTTGACTGACTGGAATTACTTCACCAACATTAGAAACAGTGAGCTTCTGAGTCCCGCACTGCGCAACTAGATGCAAATCTTGAAATAGATGAAAACTCAAATAATAGAGGGATTAATTTTACCTTATTAGTTCCGGATTTAATCTAATAAAATTAGAACTTTTTGGTTAAAATTGACCTACTATGCCTAATACAATGGTAAAGAATGATTTTCTACCTGTTTCTTACTTGTCATGTATGTCTTATGGAGTTAAATATTTATCCCCCCGGTATTGGTTGCTGATTAACTAATGTTTGTATTTTCTCCTTTCAGCTATCATTGAACTTTTTCATCTGCTTCCCATGGCTGCATCCAAGTTTCTTGATGAACTAGTAACATTGACTATTGATTTGGAAGGAGCTCTTCCCCCTGGCCAAGTGTATAGTGAAGTCAACAGTCCTTACCGTGTTCCACTAATTAAATTTTTGAACCGATATGCACCGCTTGCTGTTGATTACTTCCTTGCTCGACTTAGTGAACCAAAATACTTCAGACGGTAAGGTTTTGAATGTTGGTTTGTTTATGGAATAACTTTGGACTGACAATCAAAGAAAAATATACCAAAAATGCAATAATCTTGTTACCATTTGCTCATTCTGATTTTAGTTATGCTTGTACTATGTAATGCAGGTTTATGTATATTATCCGATCAGATGCAGGCCAGCCTCTGAGAGAAGAACTTGCAAAATCCCCACAAAAGATACTTGCTAGTGCCTTTCCTGAATTTGCACCTAAATCTGAAGCTGCGTTAACTCCAGGTTCTTCAACCTCACCTGCTCCTTTATCAGGTGATGAAGGCCTTGTAACTCCTGATGCTTCCGATCCTCCATCTGCACCCTCAAGTGTGGTTTCCGATGCTTATTTTCGTGGGCTTGCACTTATAAAAACTTTGGTGAAATTGATGCCTGGCTGGCTACAGAACAATCGTGTAGTTTTTGATACTTTGGTACTTGTCTGGAAGTCACCGGCTAGAATAGCTCGACTGCACAATGAGCAAGAGCTAAACCTAGTGCAAGTAAGATTATCACCTTCTAATTGATGTATAACTAATTTGAAGTTCTATAACTTTTAATCTTTGGATTTATATCTTGGTTGTTGGGCTAGAAGGATTCGTTCTTATTCATTTTCTTTCATTTACATCTTTCAGTTATCTCTAACTGTAATAGATCTTGGTTCTTGTAGCCTATTTGTTACTATTTGATTTAATAATATCTTGTAAATATTTTAACTTTTTTGGTCCGTTGAGCAAAATCAAAGTTGGTGCCGGGCTAGCCGAAAATCAAAGTTTATCAGAAGCTTGGTCTAAAATTCCTGAAAAATTAGCTTTTTATGATTACATCGGTAATAATCCCGCAAAAGGTGGATTATTCCGAGCGGGTTCCATGGACAACGGGGATGGAATAGCTGTTGGGTGGTTAGGACACATACTCATTAATCCACTTGAAAATGTTTTGTCACTAGTAATACTTCTCTTAGTGTTATAGAAGGACCTGCTTCAGAAATGTGTATTTTTCAATTTTCAATTTTATTTTTCTGTTCGAAACCTTTATGCTTGGTCAAGTTTCCCGTAGTCTCTCAAAGGCGGTTCTGTTCAGAGCGGCTTTCAAAGTCGGCCAAAAGGCAGTTCTCATTTCTTCCCTTTGGAGTGGGTCGGAGAAGCTGCTTCATTAATCATATTAAATAAATTTCTTCTTTTTATTCTTCCATTTCTTTCCTCTATTAGAATTTGTTCCAAAAAACTCTATATTTCTCTCTCTCTTGAGTGTGGGTCAATAGGAAGCTGCTTTATCAATCATATTAAATGAGATTTACAATTTTATTTGAAAATTTTATGTAACGGATTTTCTTAAATGTTGGGACATTATAATGTAGGTGAAAGAAAGCAAGTGGCTAGTCAAATGCTTCTTGAATTATCTGCGACACGAAAAAGCAGAAGTGAATGTGCTCTTTGACATACTTTCCATCTTTTTATTTCACACTCGAATTGACTATACGTTTCTGAAGGAGTTTTACATAATTGAGGTAATCTCATTCTCCAAGTTCTCATCGTGAATTTATTCTGTCCCCATCCTCCACATATTTGCAGTCATATAGAAATTGATATTTGATAAGATCACATTTTTTCATCTTTTTAATTGATACATGCAATCATTGATCTGTCTTTTAGGTTGCTGAAGGTTATCCACCCAACATGAAAAAAGCACTTCTCTTACATTTTCTAAACCTGTTTCAATCGAAACAACTTGGTCATGATCATTTGGTGATTGTAATGCAAATGCTAATTCTTCCTATGCTTGCTCATGCCTTCCAAAATGGGCAAAGTTGGGAGGTTGTAGATCAAGCTATAATTAAAACAATTGTCGACAAACTTCTTGATCCTCCAGAAGAGGTGATTTTTCAATTTATTGTTGTCTGCTTGAAACAGTTATTTTTTTTTCAATTTTCTTAATTATCTGAACTTCCTTTCTAGGTGTCTGCTGAGTACGATGAGCCCTTGAGAATAGAACTTTTGCAGCTTGCAACCTTACTTCTCAAGTATCTTCAGAGCGACCTGGTCCATCACAGGAAAGAACTAATCAAGTTTGGTTGGAACCATCTCAAAAGGGAAGATAGTGCCAGTAAGCAGTGGGCATTTGTGAATGTCTGCCATTTCTTGGAGGCTTATCAAGCACCTGAAAAAATTATACTTCAGGTATTGAATTATAGACTGTTGAGATGGTTTTTTTTCTTTTTCTTCGTACAATGTATAACACTAATATATTGGAAAAGGCTTGACAATACACGGTAAATGAGGGAATCAGAAAGTTTCCTAAAAATAGTACAAGGAAAATATAAAGAAATAAGGAAAGATTACAATTGTAACCAGTGGGAAATAATAATAACAATGACAACAATTATAATAAAGATTATATAATGCAGACCAGGACAAGAAACTAATTTTCCTTTGATGTTGAGAATGAAGAGAGAAAATCTCCCTATAATCAAGTATTTCTTCGACATTACCCTCAAGCTGTGTATATGTTGAGCAAGTCAAGCTTGAACTTCAAATCTTCAAACTTGGTTCTAGGTAGGGGCTTGGTTTTTTTTTTTTGAACCAGAGACAACTTTTCATCAACGAAAAGGAAAATTGTTTAGGATACAATCCGTTGAAGGATCGAAAGAAAAAGAAAAACAAAAGAATCAAAAGACAAGCTCAACGATGAGCAAAAATAAAGAACAATATAAACCAATAACAGCCTTGGTGAGAAGAGACTATTTGAAAGCTGGATAATATATAAATTAGAATGATGATCTCCTCCTCAACCTTGTCTTTAAGGAATTCTTGGTCTATATCAACTTGTTTTTTACAACTTCTATAACCCTTTGCAATATTTTGATAGTTTCCTGATTATTACATAATAGTTCTTTAGGTCATTCCATTGACATTTTAAATTCCTCTTGCATTTTGTCTAATCACAGGCCGTCGCATATTTGACGTGCCATTTCTCTAAATATATATTTTCTTTTTACCGGTAGAAACAAGAATTTTAAAGTACAAAAGGAAGGGAAAAAACCCAAGGGAGCGTGAAAAATGTATCCAAGTGGTATCTATCGCAAATAGAAGAAAGTGTTCCAAAGGAAGTGAGAAAATGAATAAGATGCCACATTTTAAAGGAATCTCTAAGCCCTATTTCAAGATTTTCTTGAGTTCCTCTCCAAGTTTCTCATAGAAGTGCACATGGAACCAAAGACTATGAACTTGTAGCCTTTGACTTCAGCAACACACCACAAACTAGCTTTCAACACATAATTCAAAATTCCCACAGAAGCAGCAAACCATGATAAACGCTTTGAATAAACCCTCCCTACCTTTTAGAATGCTGACAGAGATGGCACCACTACAGAGATAAAACTAAATACTGAGTTTCTTCTAAATACGGTCTGTCATGTGTAAGTCTCCTCTATAAAGGGTAGACACAAAAACCTTAACTTTTGTAGGTATCTTAGTTTTGCATGTAGCCCTAAATGGGTACTTTGACGAGAAGCCATTGGAATGTGCCTCAGGGATGGAAAGGATAAAGCAGATTTGATAGAGAGCCCCTTTATAAACTGGAACCATCTTTGCAAGTCTCCTAACCATTACGCTGAATAATAATGCAATAAGACCAGCTCATTCTTGAGTTTCTCTATCATATAAGGCCCTTTCTAAATCTGCTATTCCACCTCCTTTCCACTACCCTCCAACAATGAACTATATTTGTTCGACACGCACACATCCATATATATTTTTGAAAAAATTGTTTGGCATATAATTTAAAACCATAATCACTAATCAAATGGAAGAACTAAACAATAACATTGATGATATTTTTTTCATCATCAAATTAAGAGGATCAACCAAGTAGTTAAAAGTGTTTTTTATTTTTGGTTAAAAAAAATACCTTTTGAGCTTTTTATTTGAACCTTATTGGGCAAATGGATGATGAGACGGGGTATGTCTTGAACTCATCACAAATGATGTCCACATTGTTGGCTATGAATGATGGATGCCAGCATACATCCTTATCGACTATCAAATTATGGAATTGTTCTTGGTGAAAGTATCCCAACTATGATGTGCATGTGGATGTGCAGGTTTTTGTAGCACTTCTTAGAACTTGTCAACCCGAGAATAAAATGTTGGTCAAGCAGGCCCTTGATATACTAATGCCAGCCCTACCACGGAGATTGCCTCTTGGTGATTCTCGGATGCCAATTTGGATAAGATACACAAAAAAGATACTGGTCGAGGAGGGCCACTCAATTCCTAATTTGATTCACATTTTTCAACTCATTGTGCGGCATTCAGATCTCTTCTACAGCTGCAGAGCTCAATTCGTTCCACAGATGGTGAATTCTCTCAGTCGTCTAGGATTACCTTACAATACTACAGCAGAAAATAGGAGACTTGCAATTGATCTTGCTGGATTGGTGGTTGGTTGGGAACGTCAACGACAAAATGAAATGAAGCTTGTCACTGAAAGCGATGCCCCTAGCCACAGCAATGATGGATTAACATGTCCTCCTGGTGCTGATCCCAAGCGTATGGTCGATGGTTCTACATTCCCCGAAGATTCAACCAAGCGGGTCAAGGTTGAACCGGGTCTTCAATCCCTCTGTGTCATGTCTCCTGGTGGTGCATCATCAATGCCGAATATAGAGACCCCTGGGTCAACAACACAACCTGATGAAGAATTTAAACCAAATGCTGCAATGGAGGAAATGATTATTAATTTTCTGATAAGGGTAAGCACATATTATCAGTAGTTCGCCATTGTTGCTTCAATTATTGTCATCCTCATTTTCCAGTTCTCTTGCCTCATTGAGTAAATAGTTGTCCATCTTGTCCCTGGATATTTTGCATTACTACCATGATTGACCTTAATGTAGCCCTGGAGTATGTGTGTCATATAATCTTCTATTTTTCATGTTCAGTTTCTCTCGTTATTTCATTTTTTTTGGGAGTAACAGAAGACCCATTTTTCATCAACTGGTCTTCCTAGCTATTGGCTATTTTATTTGCTACAGCATTTTATAGAAGAAAAACATGTCAAACAACTGAAATTAGAACGGTTCTGGAAATGCCATCTTTGCGGTGCATCATGATCTTGTTTTGGATTGATATGTAGTCTTTATTAGGCTTTTTAACAGCTATACTTATTATGGATAAAGGTTGCGCTGGTTATAGAGCCTAAAGACAAGGAGGCAACTGCCATGTATAAACAAGCACTAGAGCTACTCTCACAGGCTTTAGAGGTGTGGCCAAATGCCAATGTCAAATTCAATTACCTGGAGAAGTTGCTCAGTAGCATCCAGCCATCTCAGTCCAAGGACCCTTCCACTGCTCTTGCACAGGGCTTAGATGTAATGAACAAAGTTTTAGAGAAGCAGCCGCATCTGTTTGTCAGAAATAATATCAATCAGATATCTCAAGTATGTTACTGAGCCATATTGTTACTGTAAAAACTTAGTATTTGCTTAAATTGAACGAAAATCATGTTTCTCATCTCACTGGACAGATTTTAGAACCCTGTTTTAAGCACAAGATGTTAGATGCTGGGAAATCATTATGCTCCTTGCTGAGGATGGTATTTGTGGCTTATCCATTGGAAGGGGTCACAACACCACCAGATGTGAAGTTGTTGTATCAGAAAGTTGACGAGCTCATAAAGAATCACATTAATAATTTAACAGCCCCTCAAACATCATCTGAAGATAACACTGCTTCTTCCATTAGCTTTGTCCTGCTTGTAATTAAAACTTTGACAGAAGTTCAGAAGAACCTAATTGATCCATATAACCTGGGTCGTATTCTTCAGCGCCTAGCACGAGACATGGGGTCGTCAGCAGGTTCTCATTTGAGACAAGTATGCAATCATCACTTATTGTTCTCTCTGAACAGACATTATTGTCTTCTATTGCATTTTTAGCTTTCCACACAGTGTTCCTAAACTAACATTGAGTGTGTTTTATTCATGTCTGAGGTTTTCTCCCATTTGGAGCATTAAATTGGACGTTTACTGGGTTGTTTTTGCTATTGTTAATTACAATTAATTAATGTTCCGGCACTAAATAGAAATTTCCCATGTTCTGGTAGTATTTTGGTCTTTTTACAACAGGCATTCCTCATCCGGAACATGATTCTTTTCCCTCTTTTCACATTTTTTAGGCTGTTCATTGGAAATTTTTAGGATATATGCTATTAATATTATAAATTAAAGTTTCCGAGAGAAATCAGATAACCTCTCCGTAATTTCTTCTGTGGAAAACCGTGTCCAGTCTTATATTTTTATAGTATTAGCCAATTTACAGATTACTGCATGGACGTCAACCTGAGGTTTATGTGGATAAAGAGGACCTATGTACTTTAGGCAACTAAAATAATGGTAAAAGTAACTAAAGGTTACACAAACAAATCTTTTCTTTTTTTAACAAGAAACAACTTTTATATCAATATAAAAGTTCAAGAATACAAACTCCTAGAGGAGTGAGAGAAAAAAAGTACACAAACAAATCTTTAACGTTAGAAGATAAATGAATCTCTTTTGTAGAAGTCGTAATTTCAATAAAAATTATTATTACATGCTGTACCCCTTTGAATTTCTTACCTTTTTAATGACTGATGATTTAGGTATTACTTGAATAAGCTGGGTCTCTCTTCTGTAACTATCTAGCTGTTTGTTTACTGTAAAAGTTGTGTACTTCTATCCTCTGGGATGATCTTGAATTTAGGTGTGGTACTTGTGAATAGCATTTGTTTATTTGTAATAATTTCTAGGTTTTAGTCATAGATGAAATGCAATATCAGTTTGTTGTATGGCCATTTCAGTAGTAGTAAATCTTAAAATCAACCGTGAATTGTGTCTGCTGTTGCTTATACAAATGAATGGTCAGCGTGGCTGTATTATCTTTTAGGTTTTTTCCTTTCTGTGTGCTTTTAGAGCTTCTGATATTGGGAGTTCCAATTTCCAAACCATCTTTATGTTATGATAAACTTATTATTTTCTAATTTATTGGTTGGAGAGCTTTTTTGTACCTTCTTTAGTATCTTGGTTGGTTTGGCCCTAGTCATCCCTCCCCCTTGTGTACTTCTTGTGAAATATGGATATTTCTTTCAAAAAAAGTTAAGAAATATATTTGAGCTTATGATTAATATGCAATTTCATTTTATACTTTAGATTATCTTTAGACCTATAATGTACGGTTTATTTCTATATTTGTTACTCTCTATTGACATCATACGTTACACTGACAAGTTTAACATCATCGTTAGGGTCAAAGGATGGATCCTGATTCTGCAGTAACTTCTTCTCGCCAAAGTGCTGATGTCGGAACAGTCATCTCTAATTTGAAATCAGTTCTGAAGCTCATTAATGAAAGAGTCATGCTTGTTCCTGAGTGCAAACGATCGGTAACTCAGATCATGAACTCCCTGTTGTCAGAAAAAGGCACCGACGCTAGTGTGTTACTTTGCATACTTGATGTGATAAAAGGGTGGGTTGAGGACGACTTCAGTAAGATGGGCACATCTGTCTCGTCTAGTTCTTTTCTTGCTCCCAAGGAAATTGTCTCTTTCCTTCAGAAGCTATCACAAGTGGATAAGCAAAACTTCTCTTCAAGTGCTGCTGAAGAGTGGGATGGAAAATATCTGCAGCTCCTTTATGAAATTTGTGCCGATTCAAATAAGTGAGTGCTATATCTTGAACTTTCCATACGCTTGGATAAAGTTATAAGATTATAAGTTCTTTTTTTTTCCCTCTAGATATCCATTGTCTCTGCGCCAAGAAGTATTTCAGAAGGTTGAACGACAGTTCATGCTGGGTCTGAGGGCTAGGGATCCTGAAACTAGAAAGAAATTCTTCACACTATATCATGAATCACTGGGGAAAACATTGTTTATAAGGCTGCAATACATCATCCAGATTCAGGACTGGGAAGCTTTAAGTGATGTATTCTGGCTCAAACAGGGCCTCGATCTCCTTTTGGCAGTCTTAGTTGAGGATAAACCTATAACCCTTGCACCAAACTCTGCGAGGTTGCCACCACTTCTGGTATCTGGTCATGTTGCAGATTCCTCTGCAGTGCAGCCCCAAGTTAATGACGCTCAAGAGGGTCTTGAGGATGCCCCTTTAACATTTGATTCCCTTGTTCATAAGCATGCACAATTTTTAAACCGGACGAGTAAACTTCAGGTAATTGGATGGCTTGTTCATTACAAATTAATTAACATTCTTTCGTTTATTCTTGCCTCTGGTTATTCATCAAAGACACTACCAGTTAGTGTAATTGTTTAAACGGACTACAGTTATTGTTTTGAAGTGGTTTTTGGCTGTTACTCAATTCTGTTGATTTGTTTTGGTAAATTAGTCTGTACTGATGCCAGAGGTAAAGTTTAACATGTTTTACTCTCTTTAGGTCGCTGATCTTATTATACCATTGAGAGAACTGGCCCACACAGATGCAAATGTTGCGTACCATCTATGGGTTCTGGTTTTTCCTATTGTCTGGGTAACATTGCATAAGGAAGAACAGGTGGCACTGGCCAAACCAATGATTAGTCTCCTGTCAAAGGATTATCATAAGAAACAGCAAGCAAGCCGACCAAATGTTGTGCAGGCACTTCTAGAAGGGCTTCAGCTGAGTCATCCTCAGCCTCGGATGCCGAGTGAGCTCATTAAATATATTGGCAGGACTTACAATGCATGGCATATAGCACTAGCTCTTCTGGAAAGTCATGTTATGTTGTTCATGAATGAGACGAAGTGCTCTGAGTCTCTGGCTGAGCTATATCGTTTACTAAATGAGGAAGATATGAGGTGTGGATTGTGGAAGAGAAAGGCAATCTCTGCAGAAACTAAAGCTGGGCTTTCACTTGTTCAGCATGGTTACTGGCAGCGTGCTCAAATCCTTTTTTATCAATCGATGGTTAAAGCAACTCAAGGTACATATAATAACAATGTACCAAAGGCTGAGATGTGTCTTTGGGAAGAACAGTGGCTTTACTGTGCTAGCCAACTTAGTCAATGGGAAGCTTTGGTGGACTTTGGGAAGAGCATTGAAAATTATGAAATACTGCTCGACAGTCTATGGAAAGTGCCTGATTGGGCATACATGAAAGAGCATGTTATTCCAAAAGCACAAGTAGAAGAAACCCCAAAACTTCGTCTAATTCAAGCATACTTTTCTCTTCATGATAGGAGTACAAATGGTGTTGCAGATGCGGAAAATATAGTTGGAAAAGGAGTTGACCTTGCTTTAGAACAATGGTGGCAGTTGCCTGAAATGTCTGTTCATGCCAGGATTCCACTTTTGCAACAATTCCAGCAGCTAGTTGAAGTGCAGGAATCATCTAGAATTCTTGTTGATATAGCCAATGGAAACAAACATTCTGGAAGTTCTGTTGTTAGTGTGCATACAAATCTCTATGCAGATCTAAAGGATATCCTTGAGACTTGGAGACTGCGAATTCCAAATGAATGGGATAGTATGACTGTTTGGTGTGATTTACTACAATGGAGGAATGAGATGTATAATGCTGTAATTGATGCGTTTAAGGATTTTGGCACCACAAATTCCCAACTTCATCACCTTGGTTTTCGTGACAAAGCATGGAATGTCAATAAGCTTGCTCATGTTGCCCGTAAACAAGGACTTCATGATGTTTGTGTAGGAATACTTGAAAAGATGTATGGTCATTCAACCATGGAAGTGCAGGTATTATGAAGTATTTCCAACTTCACGCGACCTTTAAGAAATATTGGTATTGGTTTACAATCTATGGTTTATATAAAAGATTGAGATGTTTATATCCTATATTTTGGAACTTTTAACTGTTGTTGAGTTATTTATTCCTCAATAGTGTAATCTTCAGATCACCCTGTTATTATTAGTCATCTTCTTAAAAATACCCTCATGGAAGTGAATGAGATCAATTTTTCTTAAGATTATATGGTGCAAGTTGCATCAGCTCCCTTTCTCCGTTCATGTTATTTTATTGGTTGTTTGGGGGTGTGGAACAGTAGGATGTGGAGACTCACCGGGTTCTCTGTTGCATGTTTTTAGGAGGCTTTTGTGAAGATAAGAGAACAGGCAAAGGCTTACTTGGAGATGAAGGGAGAGCTCACCAGTGGTCTGAATCTGATCAACAGCACTAATTTAGATTATTTTCCTGTGAAACACAAAGCAGAAATTTTTCGTCTCAAGGGAGATTTCCAGCTGAAGTTAAGTGATTCTGAAGGTGCTAATCATTCATACTCCAGCGCCATAAGTCTTTTCAAGAATTTGCCCAAGGGATGGATAAGCTGGGGGAATTATTGTGACATGGTATTCCTTTTATATGCCTTTTTTAATATATTCCAGTTTATGAGATGTGCTCGCAACTGAGATTGGGTATATTTCCTTTCAGGCTTACAAAGAATCCCATGAAGAGATTTGGTTGGAATATGCTGTTAGTTGCTTTCTTCAAGGCATTAAATTTGGCATTTCCAACTCAAGAAACCATCTAGCCCGTGTATTATATCTTCTCAGCTTTGATACCCCCAATGAGCCTGTGGGCCGAGCATTTGACAAGTATTTGGACCAAATACCCCATTGGGTATGGCTGTCCTGGATTCCTCAACTCTTACTTTCCTTGCAAAGGACAGAAGCACCTCACTGTAAACTTGTTCTTCTGAAAATTGCGAACGTTTATCCACAGGTGAATCTTGTGTCATTTCTTAAATACACTCATTACACCTATTTTCTATTATATGATAGTTACCTGTAAAATTTCAGGCACTGTATTACTGGCTTCGCACTTATTTGCTTGAAAGACGAGATGTTGCAAATAAGTCTGAGCTAGGTAGGATGGCAATGGCTCAACAAAGAATGCAGCAAAATACCAGTTCTGCTGGTTCTCTTGGCTTGACCGATGGAAGTTCTAGAGTGGCTCATGGTGGCAGTTCTACCTCTACTGATAACCAAGTCCACCAAGGCACTCAATCAGGTAGTGGAATTGGATCTCATGATGGTGGGAATTCTCATAGCCAGGAACCTGAAAGGTCCACTGGTGTGGAAAGCAGCACACATGCTGGAAATGATCAATCTCTTCCACAAACTTCTTCAAATGTCAATGAAGGTACTCAAAATGCATTAAGGCGCAGTGCTGCCTTGGGCTTAGTGGGTTCTGCTGCTAGTGCATTTGATGCTGCAAAGGATATCATGGAGGCTCTTAGAAGCAAGCACACTAACTTGGCTAGTGAACTTGAGGTAACGGTCATTTCTTTAGATTTACCTTTCCTTCCTCTATTTCTCTCTGCTGCTTTCTATCAGTCAGAGTGACAGTTACGGTGGTCAACTGTTATTTTTTTTATACTAAAATTTTCTTGCTTCTTTACTCCTCTTTGTCATGATGAATTTATGAATACCGTGGCAATTAATATTTATCTCATATTTAAAATCCTTTTATCTTTGTTTTCCTTTTCTTTGAACAAGTGGGAGGACATGTTCTGAATCTTATTTTACAGCTCAAGAAAGTTACTAGTGCCTTAGGGCGGGTGCGGGTTTGGTATAGAGAAAGGAGAGTTGAACGGATAAGGAATATTAGCTCTATTGATGTAGGGTTTTTTTTCTTTTAAATATTTAAGATTAATATTAATCTATATTAGCTCTCTTGATGTAGGGTTTTTTTTCTTTTAAATATTAAAGATTAATATTAATCTATAGTACGTCCAAGACGGATCACTTGACACACCTACCTGAGGTTTTTAGCACCTTAACAAAAAATGAACTATATATTAATCTTAAAAACTGCACTTTTCTATCATATAAAATTGGTTTCTTGGGCTTTGTGATTAGAAAGGAGGCTTCAGCTGGAGGATCCCCAATTAGAAAACTTTAACACACTCAAATGAAAGCTTAGTTACACACCTGTATTAGCCCTTCCAAATTTTTGTAAACCATATGAGGTTGTCGTAGATGCTTCAAGTATGGGAATAGAAGATGTGTTATCACAAGAGGGTTAGGGAAAAGTTAAGTGATTCAAGACAAAAATGAGTACATACGAACAAGAACTACACTCTTATTTGTTCTCTAAAACAATGGGAACACTACATCATTGGAAAGGAATTTATATTATCACGGATCCCTACTCACTCGAATTCTTACATCTCCAAAAGAGTATTAGCAGAATGCATGCTCGATGGATCACATTCAATCATAGATTTGACTTATCCGGCATCTCCAATAAAGCCACGGATGCTCCAAGTAGAAAAGAGGCTGAAAAGATGGCTGATCAGATTCAAAAACTACACAAAGAGGTCAAAGAACACCTGGACGAAGCAAATGGCAAGTAGAAGCACACGTGGATGCTCAAAAGTGTGCGGAATCCTTCAAATTAGGTGACTTGGTGGTGATTAATTTGTGTAAATCAAGATTACCAACAGGATCATGTAACAAATTAACCAATAAGAAGCTTGGTCCTTCCAAGGTCCTGGACAATATTGGTGAAATTGCCTACAGGATTGATCTACTCTCCTATCTCCACATCAATCCTACCTTCAATGTGGTGAATCCATATGATCATGTTCCAAATTCGTTTACATTGGCTACCTAAAACTTGAGAACGAGTTAATTTTAGGGGGAAGAATTTGATGTAGGGGTTTTAGGATTTAGTTTACACCCATCGATATAGACTAGCCGCTAGAGTTAGTTGTTTCTTTTATAGGCGTCGTTAGTGGCTTTAAAGTTGGTTAGTTAATTAATTAGTCTGGTGTTAGTTATGTTGGCTTATTATAAATAGACCCCATTTCTTGTAATTGAAGGCTATTGGAGATTAGTAACAAATTTCCAGCAACTTGAATGCATGCTCTCCCTTTATCCATTGGTATTACAAAAAGGTTGTCCTTGGATGGTAAGTTGGGTTTCTTTTTCTTTTCCTGTTTTTTATTAATCCCTCACATTGTAAACTCTACTTTTGTTTATGCTATTCAACTAGTGAACAGCTGTGATTGAGGAATTAATGGTAACACAAGGATTTGAATACAGGATGTCCTACTTTCATCTCATGTTAAATTACTAAGTCTGTATGCCTATATTTACTTTTCCTAGCCCTAGTAGATGATGATGCATAATGAATATTTCCTTTTGCAGATTCTACTCACAGAAATTGGTTCGAGGTTTGTTACTTTGCCAGAAGAGAGACTTCTTGCTGTGGTTAATGCATTGCTCCATCGTTGCTACAAGTATCCCACTGCTACGACAGCTGAGGTTCCTCAATCTCTGAAAAAGGAGCTCTCTGGAGTTTGTAAGGCTTGCTTCTCAGCTGACGCGGTTAACAAGCATGTTGATTTTGTGAGGGAGTACAAGCAGGATTTTGAGCGTGATCTTGATCCAGAGAGCACTTCCACTTTCCCAGCAACTCTTTCTGAATTGACTGAGCGATTGAAACACTGGAAAAATGTTCTCCAGGGAAATGTTGAGGACAGGTTTCCTGCAGTCTTGAGATTAGAAGACGAAAGCCGGGTATTGCGTGACTTCCACGTTGTTGATGTGGAGGTACCAGGACAATATTTTACTGACCAGGTATATTGAACAGGTTTTTTAAATGTTCTCATCAGTGTTTTTAAAGTGTGTGTTATACTAGAACTCAATAGCTCATTAGCTCTTTAGGGTTTAGTGAGAAACACAAATCTATTTTTAGAAGCCGTGATAGGAATTAAATATTCTACATATGAAAACTACATAGTAAGATAACAAAAAGTTTAAGAAATAAAGTCTTTAGTAAGAGATTGTTTAAAGATGGATGGATTTAGCAACATATTTTCTTATTTATTTTTTAATATTATTAAAAATAAAATTTAAAAAAATCCACAGGGACATGCTTTTTGCACCAAGGCACACACCTCAAATGAAGGTGCTTCGCCTCAAAATATAAGGTGCCCGATGGTGCTTGGTGCTTAGGTGCGCCAATGTATCCAACTATTATAATGAACTGGTTTTCCCCCCTGAAGGAGTTTTAGTAAGAGGTATCTCTGTTGTTGCAAGCCATACTTAGGGGTGTCTATTATAATGTATTTCTCCCATGAAGAAATTTTAGTTAAGATTTATTTGAGTTGTATGGTTGTGAGTTTTAAATTGTTAGGCCTTGGCTATTTATATTTAATAATCGATTGAGCTATTTTTATTAGAAGAAACAAATACCTCAAAGCTGGCATCTCTGTTCTGTTCTGTTGATGCATGATGACATGCTTCTGTTACCATTATCAGGAAATTGCACCTGACCATACAGTGAAGTTAGACAGAGTTGGAGCAGACATTCCAATTGTCCGAAGACATGGGAGTAGTTTTAGACGCTTGACTTTAATTGGTTCAGATGGTTCTCAGCGTCATTTTATAGTCCAAACTTCCTTGACTCCTAATGCTAGAAGTGATGAGCGCATTTTGCAACTTTTCCGAGTTATGAATCAAATGTTTGATAAGCACAAGGAATCAAGACGCCGCCACTTGTGTATTCACACTCCAATCATTATCCCTGTTTGGTCACAGGTTAGAATCCGTTTTCTATGGATTTGATTCACTTAAATGCTAAGAATCGTCTGACTGCATCTTCATAAAAATTATTACATCGAGTTTTTCCCCCTAATTATGGTCATTGCATATATAGGTTCGCATGGTGGAAGATGATTTGATGTATAGCACTTTTCTTGAGGTGTATGAAAATCATTGTGCAAGAAACGACCAAGAGGCAGATCTTCCAATTACATATTTCAAAGAGCAACTGAATCAGGCCATATCTGGCCAAATCGCGCCGGAAGCTGTTCTAGATCTCCGCCTCCAAGCTTATGGTGATATAACAAGAAATCTTGTAAACGAGGGTATATTTTCACAGTATATGTACAAGACATTACTCAGTGGAAATCACATGTGGGCTTTTAAAAAACAATTTGCCATCCAATTAGCCCTTTCGAGTTTTATGTCTTACATGCTACAGATCGGGGGAAGGTCACCCAACAAGATTTATTTTGCTAAGAACACTGGGAAAATTTTCCAAACAGATTTTCATCCCGCATATGATACAAATGGTATGATCGAGTTCAATGAACCAGTTCCTTTCAGGTTAACTAGGAATATGCAAGCCTTCTTCTCTCATTTCGGAGTGGAAGGTTTAATTGTTTCTGCCATGTGTTCTGCTGCCCAGGCAGTTGTTTCACCGAAGGTAGGTCTTGGAAGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTTTTTTTTTTTTTTTTTTTTTTTCTTTTTTCTCCCATAGCTGTGTTATACAGTTGTTGGTAACATGTGATTCACAAATCTTGATTTTCAAAATTTCAGCAAAATCAGCACTTGTGGCATCAACTTGCTATGTTTTTCCGTGATGAGCTACTTTCATGGTCTTGGCGGAGACCCCTCGGTATGCCTTTGGCATCCATTGCAGGTGGTGGTATGAGCCCTGCTGACTTCAAGCAGAAGGTTACTATCAACGTTGATCATGTCATTGGCCGGATTAATGGAATAGCACCACAATATTTCTCTGAGGAAGTAAGCCCTCTAACCTCAATCCCACTTAACTCTCAAACGCCTATTTTCAAAACTCAAAAGCATGCATCACTTTGCTTTCTTTCCTTTCATTCTAAGTTTATGGGTTTGGCGTGTATATATAGTTCCTGTATATAGAACTGTCTGATCTGGGAATTTTAGCATTCGATTAGGAAGTCATCACTCATCAAGTTGGGAATTTTATCACATTGCTTTTCTGTCTTTTTACCGCCTGCAATTTCCCTTGACCTTACACGCTTACTTCAGGAGGAGAACGCCATGGACCCACCGCAGTCGGTACAAAGGGGTGTGTCGGACCTGGTCGATGCTGCCTTGATGCCAAGGCATCTGTGTATGATGGACCCAACATGGCATCCTTGGTTT

mRNA sequence

ATGAGTCCCATTCAGAATTTCGAGCTGCATTCCCGCCAACTCGTCGAGCCTGAACTCAGTATTCAGACGAGGCTTCAGATGGCAACGGAAGTTCGAGATAGCCTGGAGATTGCTCATACTCCCGAGTACTTAAATTTTCTGAAATGCTACTTTCGGGCGTTCTCTATAATCCTAGTTCAGATTACAAAGCCTCAATATACTGACAATCATGAACACAAACTTCGGAACATCGTGGTGGAGATCCTTAATCGTCTTCCTCACAGTGAAGTTTTAAGACCTTTCGTGCAGGACCTGTTAAAGGTCGCCATGCAAGTGCTTACCACAGATAACGAGGAGAACGGCTTAATTTGTATCCGCATAATATTTGATCTCCTCAGAAATTTTAGACCAACTCTAGAAAATGAAGTGCAGCCATTCCTGGACTTTGTGTGCAAAATTTACCAGAACTTTAAGTTGACCGTAAGCCATTTCTTTGAAAATTCGGCTGCTGGTGGCGAAGATATAAAGCCCATGGATGTATCCACTTCAACGGACCAGACAATTACTACCGGCTACACGGGGACTGTGCAGCTTAATCCTAGTACCCGTTCATTTAAGATAGTAACGGAGAGTCCACTTGTTGTCATGTTTCTCTTCCAACTGTATAGTCGGCTTGTTCAAACAAATATCCCTGTCTTGTTGCCTCTGATGGTTTCTGCTATTTCTGTTCCGGGACCTGAAAAGGTTCCTCCCTTTTTGAAGACTCATTTCATTGAACTGAAGGGTGCGCAGGTTAAGACAGTTTCTTTTTTAACATATTTGCTGAGGAGTTCTGCTGATTATATCAGGCCACACGAAGAAAGTATTTGTAAGAGTATTGTGAATTTGCTGGTTACATGTTCAGATTCCGTGTCAATTCGGAAAGAATTGTTAGTAGCCCTGAAACATGTTCTTGGAACAGAGTATAAGAGGGGCTTATTTCCTTTGATTGATACACTGTTGGAAGAGAAGGTTCTAGTGGGAACTGGTCGGGCATGCTATGAGACATTAAGACCATTAGCCTATAGTTTACTGGCAGAAATTGTGCATCATGTCAGGGGGGATCTTACCCTATCTCAGCTATCACGGATTATCTACTTGTTCTCAAGTAATATGCATGATGCCTCACTATCACTTAGCATTCATACTACTTGTGCACGATTGATGCTGAACTTGGTGGAGCCAATCTTTGAGAAGGGTGTTGACCAAACTTCTATGGATGAAGCACGAATTCTCTTGGGGCGTATTTTGGATGCTTTTGTTGGGAAGTTTAGTACGTTCAAGCATACCATTCCTCAGTTATTGGAGGAAGGTGAGGAGGGAAAAGATCGTGCAAATATGAGGTCAAAGCTTGAGCTCCCTGTGCAGGCAGTTTTAAATTTGCAGGTCCCTGTGGAACATTCTAAGGAAGTCAATGACTGTAAGCATTTGATTAAGACGTTGATCTTGGGAATGAAGACGATCATATGGAGCATCACTCATGCACATTTACCCCGACCTCAGGCTTCGCCATCTCCAAATGGAACACATCCACAGATGCTTGTTTCACCATCATCAAATTTGGCAACGCCTCAAGCATTCAAGGGAATGAGAGAGGACGAGGTGTGTAAAGCCTCTGGTGTCCTGAAAAGTGGTGTTCATTGCTTAACACTTTTCAAGGAAAAGGATGAAGAAGTAGAAATGCTTCATCTTTTCTCCCAGATATTGACTGTAATGGAACCTCGGGATCTGATGGACATGTTTTCATTGTGTATGCCTGAACTTTTTGACTGCATGATCACCAACACACAGCTGGTCCATCTGTTTTCAACATTTTTGCAAACACCTAAAGTATATAGGCCATTTGCGGATGTTTTGGTTAATTTTCTTGTCAGCAGTAAACTTGATGTTTTGAAGCACCCAGATTCACCGGGGGCAAAATTGGTCTTGCATCTCTTTCGTTTTGTATTTGGTGCTGTTGCTAAAGCACCATCAGATTTTGAGCGTATTTTACAGCCTCATGTGACTGTCATAATGGAAGTTTGTGTAAGAAGTGCTACTGAAGTTGAAAGACCGCTTGGGTACATGCAACTTCTTCGCATCATGTTTCGGGCATTGGCAGGGTGTAAATTTGAACTTTTACTACGTGATCTGATTCCTTTGCTACAACCTTGCCTTAACATGTTACTGACCATGTTTGATGGTCCAACTGGGGAAGATATGAGGGATCTGTTGTTGGAATTATGTCTCACATTGCCTGCACGCTTAAGCTCATTATTACCTCACCTTCCACGTTTGATGAAGCCTCTTGTTTTGTGTCTTAAAGGAAGTGACGACTTAGTTAGTCTAGGTTTGCGAACCCTCGAGTTCTGGGTCGATAGTTTAAATCCTGACTTCCTAGAACCGAGTATGGCAAATGTGATGTCTGAAGTGATTTTAGCCTTATGGTCTCATTTGAGGCCAATACCCTATCCTTGGGGTGCAAAAGCTTTGCAAGTTCTCGGAAAGTTAGGTGGTCGCAATAGACGTTTTCTGAAAGAGCCACTTGCACTAGAATGCAAGGAGAATCCAGAACACGGGCTTCGTTTAATTCTTACCTTTGAGCCATCTACTCCCTTTTTGGTGCCATTGGATAGATGCATTAATCTTGCTGTATCAACTGTAATGAATAAAACTGGTGGTGTTGATTCTTTCTACAGAAAACAAGCTTTGAAATTTCTTCGGGTCTGTTTATCTTCTCAGCTTAATTTGCCTGGAAATGTGGCTGATGATGGCCATACACCCAGACAATTGTCAACTTTACTAGTTTCTCCTGTTGATTCCTCTTTGAGAAGGTCTGAGACTCCCGAGGGAAAGGCTGATTTGGGTGTAAAGACAAAAACCCAACTTATGGCTGAGAAATCTGTTTTCAAAATTCTATTGATGACCATTATTGCTGCTGGTTCAGAGGAGGATCTCCACGAGCCAAAGGATGATTTTGTTCTCAATGTATGCCGCCATTTTGCTATACTATTCCATATTGATTCTTCTCTAAACAGTTCTCCAGTTGCATCTGCCTCACTTGGGAGTACTTTGCTTCCTCCAAACGTCAGTGCCAATTCCAGATTAAGAAGTAGTGCTTGTTGTAACCTCAAAGAGTTAGACCCTCTCATTTTTTTGGATGCCTTGGTTGAGGTGCTGGCGGATGAAAACAGGGTCCATGCAAAAGCTGCTCTGAATGCTCTAAATTTGTTCTCTGAAATTCTTCTTTTCCTTGCTCGTGCAAAACAAACTGATGTGATGATGACAAGAGGGCCCAGCACCCCAATGATTGTTTCCAGTCCATCAAAGAGCCCTGTATATTCACCACCTCCAAGTGTCCGTATTCCAGTTTTTGAGCAACTCTTGCCACGGCTTTTGCATTGTTGTTATGGCAGCACATGGCAAGCCCAGATGGGTGGTATTATGGGACTTGGTGCTTTGGTTGGAAAGGTTACTGTTGAGACTCTGTGTCTTTTCCAAGTAAGAATTGTGCGAGGCCTGGTATATGTTCTGAAAAGGCTGCCAATTTATGCTAGTAAGGAGCAAGAGGAGACTAGCCAAGTACTCAATCAGGTTCTTCGTGTTGTGAATAATGTTGATGAAGCAAATAGTGAACCGCGCAGACAAAGCTTTCATGGGGTAGTAGATATTCTTGCTTCTGAGTTGTTTAATCCCAATTCATCAACTATCGTGAGAAAGAATGTGCAGTCATGTTTAGCTCTTTTGGCCAGTAGGACTGGTAGTGAGGTGTCTGAGTTGCTTGAACCTCTGCATCAACCTTTGCTTCAGCCTCTCTTATTGCGACCACTTCGGCTGAAGACTATTGATCAGCAGGTTGGAACTGTCACAGCCTTGAATTTCTGTTTGGCATTAAGGCCGCCTCTTCTAAAGTTGACTCAGGAGTTGGTCAACTTTCTGCAAGAAGCTTTGCAAATAGCTGAGGCAGATGAGACTGTATGGGTTGTAAAGTTCATGAACCCTAAAATAGCCACATCATTGAACAAGCTCCGAACAGCTTGCATTGAGTTACTGTGCACCACCATGGCATGGGCAGATTTTAAAACACCCAATCATTCTGAGTTGCGTGCAAAGATCATCTCAATGTTTTTCAAGTCATTAACATGTCGGACTCCAGAAGTAGTTGCTGTTGCAAAGGAGGGGTTAAGACAGGTTATTAATCAGCAAAGGATGCCCAAAGATTTGCTGCAAGGTAGCCTTAGACCTATTCTGGTAAACTTGGCACACACCAAAAATCTTAGCATGCCACTTCTTCAAGGTCTGGCTCGCCTTCTTGAACTTTTGGCCAGTTGGTTTAATGTCACATTGGGAGGCAAGCTGTTAGAGCACCTCAAGAAATGGTTGGAGCCAGAAAAACTTGCTCAAAGTCAGAAAGCGTGGAAGGCGGGTGAGGAGCCAAAAATTGCCGCAGCTATCATTGAACTTTTTCATCTGCTTCCCATGGCTGCATCCAAGTTTCTTGATGAACTAGTAACATTGACTATTGATTTGGAAGGAGCTCTTCCCCCTGGCCAAGTGTATAGTGAAGTCAACAGTCCTTACCGTGTTCCACTAATTAAATTTTTGAACCGATATGCACCGCTTGCTGTTGATTACTTCCTTGCTCGACTTAGTGAACCAAAATACTTCAGACGGTTTATGTATATTATCCGATCAGATGCAGGCCAGCCTCTGAGAGAAGAACTTGCAAAATCCCCACAAAAGATACTTGCTAGTGCCTTTCCTGAATTTGCACCTAAATCTGAAGCTGCGTTAACTCCAGGTTCTTCAACCTCACCTGCTCCTTTATCAGGTGATGAAGGCCTTGTAACTCCTGATGCTTCCGATCCTCCATCTGCACCCTCAAGTGTGGTTTCCGATGCTTATTTTCGTGGGCTTGCACTTATAAAAACTTTGGTGAAATTGATGCCTGGCTGGCTACAGAACAATCGTGTAGTTTTTGATACTTTGGTACTTGTCTGGAAGTCACCGGCTAGAATAGCTCGACTGCACAATGAGCAAGAGCTAAACCTAGTGCAAGTGAAAGAAAGCAAGTGGCTAGTCAAATGCTTCTTGAATTATCTGCGACACGAAAAAGCAGAAGTGAATGTGCTCTTTGACATACTTTCCATCTTTTTATTTCACACTCGAATTGACTATACGTTTCTGAAGGAGTTTTACATAATTGAGGTTGCTGAAGGTTATCCACCCAACATGAAAAAAGCACTTCTCTTACATTTTCTAAACCTGTTTCAATCGAAACAACTTGGTCATGATCATTTGGTGATTGTAATGCAAATGCTAATTCTTCCTATGCTTGCTCATGCCTTCCAAAATGGGCAAAGTTGGGAGGTTGTAGATCAAGCTATAATTAAAACAATTGTCGACAAACTTCTTGATCCTCCAGAAGAGGTGTCTGCTGAGTACGATGAGCCCTTGAGAATAGAACTTTTGCAGCTTGCAACCTTACTTCTCAAGTATCTTCAGAGCGACCTGGTCCATCACAGGAAAGAACTAATCAAGTTTGGTTGGAACCATCTCAAAAGGGAAGATAGTGCCAGTAAGCAGTGGGCATTTGTGAATGTCTGCCATTTCTTGGAGGCTTATCAAGCACCTGAAAAAATTATACTTCAGGTTTTTGTAGCACTTCTTAGAACTTGTCAACCCGAGAATAAAATGTTGGTCAAGCAGGCCCTTGATATACTAATGCCAGCCCTACCACGGAGATTGCCTCTTGGTGATTCTCGGATGCCAATTTGGATAAGATACACAAAAAAGATACTGGTCGAGGAGGGCCACTCAATTCCTAATTTGATTCACATTTTTCAACTCATTGTGCGGCATTCAGATCTCTTCTACAGCTGCAGAGCTCAATTCGTTCCACAGATGGTGAATTCTCTCAGTCGTCTAGGATTACCTTACAATACTACAGCAGAAAATAGGAGACTTGCAATTGATCTTGCTGGATTGGTGGTTGGTTGGGAACGTCAACGACAAAATGAAATGAAGCTTGTCACTGAAAGCGATGCCCCTAGCCACAGCAATGATGGATTAACATGTCCTCCTGGTGCTGATCCCAAGCGTATGGTCGATGGTTCTACATTCCCCGAAGATTCAACCAAGCGGGTCAAGGTTGAACCGGGTCTTCAATCCCTCTGTGTCATGTCTCCTGGTGGTGCATCATCAATGCCGAATATAGAGACCCCTGGGTCAACAACACAACCTGATGAAGAATTTAAACCAAATGCTGCAATGGAGGAAATGATTATTAATTTTCTGATAAGGGTTGCGCTGGTTATAGAGCCTAAAGACAAGGAGGCAACTGCCATGTATAAACAAGCACTAGAGCTACTCTCACAGGCTTTAGAGGTGTGGCCAAATGCCAATGTCAAATTCAATTACCTGGAGAAGTTGCTCAGTAGCATCCAGCCATCTCAGTCCAAGGACCCTTCCACTGCTCTTGCACAGGGCTTAGATGTAATGAACAAAGTTTTAGAGAAGCAGCCGCATCTGTTTGTCAGAAATAATATCAATCAGATATCTCAAATTTTAGAACCCTGTTTTAAGCACAAGATGTTAGATGCTGGGAAATCATTATGCTCCTTGCTGAGGATGGTATTTGTGGCTTATCCATTGGAAGGGGTCACAACACCACCAGATGTGAAGTTGTTGTATCAGAAAGTTGACGAGCTCATAAAGAATCACATTAATAATTTAACAGCCCCTCAAACATCATCTGAAGATAACACTGCTTCTTCCATTAGCTTTGTCCTGCTTGTAATTAAAACTTTGACAGAAGTTCAGAAGAACCTAATTGATCCATATAACCTGGGTCGTATTCTTCAGCGCCTAGCACGAGACATGGGGTCGTCAGCAGGTTCTCATTTGAGACAAGGTCAAAGGATGGATCCTGATTCTGCAGTAACTTCTTCTCGCCAAAGTGCTGATGTCGGAACAGTCATCTCTAATTTGAAATCAGTTCTGAAGCTCATTAATGAAAGAGTCATGCTTGTTCCTGAGTGCAAACGATCGGTAACTCAGATCATGAACTCCCTGTTGTCAGAAAAAGGCACCGACGCTAGTGTGTTACTTTGCATACTTGATGTGATAAAAGGGTGGGTTGAGGACGACTTCAGTAAGATGGGCACATCTGTCTCGTCTAGTTCTTTTCTTGCTCCCAAGGAAATTGTCTCTTTCCTTCAGAAGCTATCACAAGTGGATAAGCAAAACTTCTCTTCAAGTGCTGCTGAAGAGTGGGATGGAAAATATCTGCAGCTCCTTTATGAAATTTGTGCCGATTCAAATAAATATCCATTGTCTCTGCGCCAAGAAGTATTTCAGAAGGTTGAACGACAGTTCATGCTGGGTCTGAGGGCTAGGGATCCTGAAACTAGAAAGAAATTCTTCACACTATATCATGAATCACTGGGGAAAACATTGTTTATAAGGCTGCAATACATCATCCAGATTCAGGACTGGGAAGCTTTAAGTGATGTATTCTGGCTCAAACAGGGCCTCGATCTCCTTTTGGCAGTCTTAGTTGAGGATAAACCTATAACCCTTGCACCAAACTCTGCGAGGTTGCCACCACTTCTGGTATCTGGTCATGTTGCAGATTCCTCTGCAGTGCAGCCCCAAGTTAATGACGCTCAAGAGGGTCTTGAGGATGCCCCTTTAACATTTGATTCCCTTGTTCATAAGCATGCACAATTTTTAAACCGGACGAGTAAACTTCAGGTCGCTGATCTTATTATACCATTGAGAGAACTGGCCCACACAGATGCAAATGTTGCGTACCATCTATGGGTTCTGGTTTTTCCTATTGTCTGGGTAACATTGCATAAGGAAGAACAGGTGGCACTGGCCAAACCAATGATTAGTCTCCTGTCAAAGGATTATCATAAGAAACAGCAAGCAAGCCGACCAAATGTTGTGCAGGCACTTCTAGAAGGGCTTCAGCTGAGTCATCCTCAGCCTCGGATGCCGAGTGAGCTCATTAAATATATTGGCAGGACTTACAATGCATGGCATATAGCACTAGCTCTTCTGGAAAGTCATGTTATGTTGTTCATGAATGAGACGAAGTGCTCTGAGTCTCTGGCTGAGCTATATCGTTTACTAAATGAGGAAGATATGAGGTGTGGATTGTGGAAGAGAAAGGCAATCTCTGCAGAAACTAAAGCTGGGCTTTCACTTGTTCAGCATGGTTACTGGCAGCGTGCTCAAATCCTTTTTTATCAATCGATGGTTAAAGCAACTCAAGGTACATATAATAACAATGTACCAAAGGCTGAGATGTGTCTTTGGGAAGAACAGTGGCTTTACTGTGCTAGCCAACTTAGTCAATGGGAAGCTTTGGTGGACTTTGGGAAGAGCATTGAAAATTATGAAATACTGCTCGACAGTCTATGGAAAGTGCCTGATTGGGCATACATGAAAGAGCATGTTATTCCAAAAGCACAAGTAGAAGAAACCCCAAAACTTCGTCTAATTCAAGCATACTTTTCTCTTCATGATAGGAGTACAAATGGTGTTGCAGATGCGGAAAATATAGTTGGAAAAGGAGTTGACCTTGCTTTAGAACAATGGTGGCAGTTGCCTGAAATGTCTGTTCATGCCAGGATTCCACTTTTGCAACAATTCCAGCAGCTAGTTGAAGTGCAGGAATCATCTAGAATTCTTGTTGATATAGCCAATGGAAACAAACATTCTGGAAGTTCTGTTGTTAGTGTGCATACAAATCTCTATGCAGATCTAAAGGATATCCTTGAGACTTGGAGACTGCGAATTCCAAATGAATGGGATAGTATGACTGTTTGGTGTGATTTACTACAATGGAGGAATGAGATGTATAATGCTGTAATTGATGCGTTTAAGGATTTTGGCACCACAAATTCCCAACTTCATCACCTTGGTTTTCGTGACAAAGCATGGAATGTCAATAAGCTTGCTCATGTTGCCCGTAAACAAGGACTTCATGATGTTTGTGTAGGAATACTTGAAAAGATGTATGGTCATTCAACCATGGAAGTGCAGGAGGCTTTTGTGAAGATAAGAGAACAGGCAAAGGCTTACTTGGAGATGAAGGGAGAGCTCACCAGTGGTCTGAATCTGATCAACAGCACTAATTTAGATTATTTTCCTGTGAAACACAAAGCAGAAATTTTTCGTCTCAAGGGAGATTTCCAGCTGAAGTTAAGTGATTCTGAAGGTGCTAATCATTCATACTCCAGCGCCATAAGTCTTTTCAAGAATTTGCCCAAGGGATGGATAAGCTGGGGGAATTATTGTGACATGGCTTACAAAGAATCCCATGAAGAGATTTGGTTGGAATATGCTGTTAGTTGCTTTCTTCAAGGCATTAAATTTGGCATTTCCAACTCAAGAAACCATCTAGCCCGTGTATTATATCTTCTCAGCTTTGATACCCCCAATGAGCCTGTGGGCCGAGCATTTGACAAGTATTTGGACCAAATACCCCATTGGGTATGGCTGTCCTGGATTCCTCAACTCTTACTTTCCTTGCAAAGGACAGAAGCACCTCACTGTAAACTTGTTCTTCTGAAAATTGCGAACGTTTATCCACAGGCACTGTATTACTGGCTTCGCACTTATTTGCTTGAAAGACGAGATGTTGCAAATAAGTCTGAGCTAGGTAGGATGGCAATGGCTCAACAAAGAATGCAGCAAAATACCAGTTCTGCTGGTTCTCTTGGCTTGACCGATGGAAGTTCTAGAGTGGCTCATGGTGGCAGTTCTACCTCTACTGATAACCAAGTCCACCAAGGCACTCAATCAGGTAGTGGAATTGGATCTCATGATGGTGGGAATTCTCATAGCCAGGAACCTGAAAGGTCCACTGGTGTGGAAAGCAGCACACATGCTGGAAATGATCAATCTCTTCCACAAACTTCTTCAAATGTCAATGAAGGTACTCAAAATGCATTAAGGCGCAGTGCTGCCTTGGGCTTAGTGGGTTCTGCTGCTAGTGCATTTGATGCTGCAAAGGATATCATGGAGGCTCTTAGAAGCAAGCACACTAACTTGGCTAGTGAACTTGAGATTCTACTCACAGAAATTGGTTCGAGGTTTGTTACTTTGCCAGAAGAGAGACTTCTTGCTGTGGTTAATGCATTGCTCCATCGTTGCTACAAGTATCCCACTGCTACGACAGCTGAGGTTCCTCAATCTCTGAAAAAGGAGCTCTCTGGAGTTTGTAAGGCTTGCTTCTCAGCTGACGCGGTTAACAAGCATGTTGATTTTGTGAGGGAGTACAAGCAGGATTTTGAGCGTGATCTTGATCCAGAGAGCACTTCCACTTTCCCAGCAACTCTTTCTGAATTGACTGAGCGATTGAAACACTGGAAAAATGTTCTCCAGGGAAATGTTGAGGACAGGTTTCCTGCAGTCTTGAGATTAGAAGACGAAAGCCGGGTATTGCGTGACTTCCACGTTGTTGATGTGGAGGTACCAGGACAATATTTTACTGACCAGGAAATTGCACCTGACCATACAGTGAAGTTAGACAGAGTTGGAGCAGACATTCCAATTGTCCGAAGACATGGGAGTAGTTTTAGACGCTTGACTTTAATTGGTTCAGATGGTTCTCAGCGTCATTTTATAGTCCAAACTTCCTTGACTCCTAATGCTAGAAGTGATGAGCGCATTTTGCAACTTTTCCGAGTTATGAATCAAATGTTTGATAAGCACAAGGAATCAAGACGCCGCCACTTGTGTATTCACACTCCAATCATTATCCCTGTTTGGTCACAGGTTCGCATGGTGGAAGATGATTTGATGTATAGCACTTTTCTTGAGGTGTATGAAAATCATTGTGCAAGAAACGACCAAGAGGCAGATCTTCCAATTACATATTTCAAAGAGCAACTGAATCAGGCCATATCTGGCCAAATCGCGCCGGAAGCTGTTCTAGATCTCCGCCTCCAAGCTTATGGTGATATAACAAGAAATCTTGTAAACGAGGGTATATTTTCACAGTATATGTACAAGACATTACTCAGTGGAAATCACATGTGGGCTTTTAAAAAACAATTTGCCATCCAATTAGCCCTTTCGAGTTTTATGTCTTACATGCTACAGATCGGGGGAAGGTCACCCAACAAGATTTATTTTGCTAAGAACACTGGGAAAATTTTCCAAACAGATTTTCATCCCGCATATGATACAAATGGTATGATCGAGTTCAATGAACCAGTTCCTTTCAGGTTAACTAGGAATATGCAAGCCTTCTTCTCTCATTTCGGAGTGGAAGGTTTAATTGTTTCTGCCATGTGTTCTGCTGCCCAGGCAGTTGTTTCACCGAAGCAAAATCAGCACTTGTGGCATCAACTTGCTATGTTTTTCCGTGATGAGCTACTTTCATGGTCTTGGCGGAGACCCCTCGGTATGCCTTTGGCATCCATTGCAGGTGGTGGTATGAGCCCTGCTGACTTCAAGCAGAAGGTTACTATCAACGTTGATCATGTCATTGGCCGGATTAATGGAATAGCACCACAATATTTCTCTGAGGAAGAGGAGAACGCCATGGACCCACCGCAGTCGGTACAAAGGGGTGTGTCGGACCTGGTCGATGCTGCCTTGATGCCAAGGCATCTGTGTATGATGGACCCAACATGGCATCCTTGGTTT

Coding sequence (CDS)

ATGAGTCCCATTCAGAATTTCGAGCTGCATTCCCGCCAACTCGTCGAGCCTGAACTCAGTATTCAGACGAGGCTTCAGATGGCAACGGAAGTTCGAGATAGCCTGGAGATTGCTCATACTCCCGAGTACTTAAATTTTCTGAAATGCTACTTTCGGGCGTTCTCTATAATCCTAGTTCAGATTACAAAGCCTCAATATACTGACAATCATGAACACAAACTTCGGAACATCGTGGTGGAGATCCTTAATCGTCTTCCTCACAGTGAAGTTTTAAGACCTTTCGTGCAGGACCTGTTAAAGGTCGCCATGCAAGTGCTTACCACAGATAACGAGGAGAACGGCTTAATTTGTATCCGCATAATATTTGATCTCCTCAGAAATTTTAGACCAACTCTAGAAAATGAAGTGCAGCCATTCCTGGACTTTGTGTGCAAAATTTACCAGAACTTTAAGTTGACCGTAAGCCATTTCTTTGAAAATTCGGCTGCTGGTGGCGAAGATATAAAGCCCATGGATGTATCCACTTCAACGGACCAGACAATTACTACCGGCTACACGGGGACTGTGCAGCTTAATCCTAGTACCCGTTCATTTAAGATAGTAACGGAGAGTCCACTTGTTGTCATGTTTCTCTTCCAACTGTATAGTCGGCTTGTTCAAACAAATATCCCTGTCTTGTTGCCTCTGATGGTTTCTGCTATTTCTGTTCCGGGACCTGAAAAGGTTCCTCCCTTTTTGAAGACTCATTTCATTGAACTGAAGGGTGCGCAGGTTAAGACAGTTTCTTTTTTAACATATTTGCTGAGGAGTTCTGCTGATTATATCAGGCCACACGAAGAAAGTATTTGTAAGAGTATTGTGAATTTGCTGGTTACATGTTCAGATTCCGTGTCAATTCGGAAAGAATTGTTAGTAGCCCTGAAACATGTTCTTGGAACAGAGTATAAGAGGGGCTTATTTCCTTTGATTGATACACTGTTGGAAGAGAAGGTTCTAGTGGGAACTGGTCGGGCATGCTATGAGACATTAAGACCATTAGCCTATAGTTTACTGGCAGAAATTGTGCATCATGTCAGGGGGGATCTTACCCTATCTCAGCTATCACGGATTATCTACTTGTTCTCAAGTAATATGCATGATGCCTCACTATCACTTAGCATTCATACTACTTGTGCACGATTGATGCTGAACTTGGTGGAGCCAATCTTTGAGAAGGGTGTTGACCAAACTTCTATGGATGAAGCACGAATTCTCTTGGGGCGTATTTTGGATGCTTTTGTTGGGAAGTTTAGTACGTTCAAGCATACCATTCCTCAGTTATTGGAGGAAGGTGAGGAGGGAAAAGATCGTGCAAATATGAGGTCAAAGCTTGAGCTCCCTGTGCAGGCAGTTTTAAATTTGCAGGTCCCTGTGGAACATTCTAAGGAAGTCAATGACTGTAAGCATTTGATTAAGACGTTGATCTTGGGAATGAAGACGATCATATGGAGCATCACTCATGCACATTTACCCCGACCTCAGGCTTCGCCATCTCCAAATGGAACACATCCACAGATGCTTGTTTCACCATCATCAAATTTGGCAACGCCTCAAGCATTCAAGGGAATGAGAGAGGACGAGGTGTGTAAAGCCTCTGGTGTCCTGAAAAGTGGTGTTCATTGCTTAACACTTTTCAAGGAAAAGGATGAAGAAGTAGAAATGCTTCATCTTTTCTCCCAGATATTGACTGTAATGGAACCTCGGGATCTGATGGACATGTTTTCATTGTGTATGCCTGAACTTTTTGACTGCATGATCACCAACACACAGCTGGTCCATCTGTTTTCAACATTTTTGCAAACACCTAAAGTATATAGGCCATTTGCGGATGTTTTGGTTAATTTTCTTGTCAGCAGTAAACTTGATGTTTTGAAGCACCCAGATTCACCGGGGGCAAAATTGGTCTTGCATCTCTTTCGTTTTGTATTTGGTGCTGTTGCTAAAGCACCATCAGATTTTGAGCGTATTTTACAGCCTCATGTGACTGTCATAATGGAAGTTTGTGTAAGAAGTGCTACTGAAGTTGAAAGACCGCTTGGGTACATGCAACTTCTTCGCATCATGTTTCGGGCATTGGCAGGGTGTAAATTTGAACTTTTACTACGTGATCTGATTCCTTTGCTACAACCTTGCCTTAACATGTTACTGACCATGTTTGATGGTCCAACTGGGGAAGATATGAGGGATCTGTTGTTGGAATTATGTCTCACATTGCCTGCACGCTTAAGCTCATTATTACCTCACCTTCCACGTTTGATGAAGCCTCTTGTTTTGTGTCTTAAAGGAAGTGACGACTTAGTTAGTCTAGGTTTGCGAACCCTCGAGTTCTGGGTCGATAGTTTAAATCCTGACTTCCTAGAACCGAGTATGGCAAATGTGATGTCTGAAGTGATTTTAGCCTTATGGTCTCATTTGAGGCCAATACCCTATCCTTGGGGTGCAAAAGCTTTGCAAGTTCTCGGAAAGTTAGGTGGTCGCAATAGACGTTTTCTGAAAGAGCCACTTGCACTAGAATGCAAGGAGAATCCAGAACACGGGCTTCGTTTAATTCTTACCTTTGAGCCATCTACTCCCTTTTTGGTGCCATTGGATAGATGCATTAATCTTGCTGTATCAACTGTAATGAATAAAACTGGTGGTGTTGATTCTTTCTACAGAAAACAAGCTTTGAAATTTCTTCGGGTCTGTTTATCTTCTCAGCTTAATTTGCCTGGAAATGTGGCTGATGATGGCCATACACCCAGACAATTGTCAACTTTACTAGTTTCTCCTGTTGATTCCTCTTTGAGAAGGTCTGAGACTCCCGAGGGAAAGGCTGATTTGGGTGTAAAGACAAAAACCCAACTTATGGCTGAGAAATCTGTTTTCAAAATTCTATTGATGACCATTATTGCTGCTGGTTCAGAGGAGGATCTCCACGAGCCAAAGGATGATTTTGTTCTCAATGTATGCCGCCATTTTGCTATACTATTCCATATTGATTCTTCTCTAAACAGTTCTCCAGTTGCATCTGCCTCACTTGGGAGTACTTTGCTTCCTCCAAACGTCAGTGCCAATTCCAGATTAAGAAGTAGTGCTTGTTGTAACCTCAAAGAGTTAGACCCTCTCATTTTTTTGGATGCCTTGGTTGAGGTGCTGGCGGATGAAAACAGGGTCCATGCAAAAGCTGCTCTGAATGCTCTAAATTTGTTCTCTGAAATTCTTCTTTTCCTTGCTCGTGCAAAACAAACTGATGTGATGATGACAAGAGGGCCCAGCACCCCAATGATTGTTTCCAGTCCATCAAAGAGCCCTGTATATTCACCACCTCCAAGTGTCCGTATTCCAGTTTTTGAGCAACTCTTGCCACGGCTTTTGCATTGTTGTTATGGCAGCACATGGCAAGCCCAGATGGGTGGTATTATGGGACTTGGTGCTTTGGTTGGAAAGGTTACTGTTGAGACTCTGTGTCTTTTCCAAGTAAGAATTGTGCGAGGCCTGGTATATGTTCTGAAAAGGCTGCCAATTTATGCTAGTAAGGAGCAAGAGGAGACTAGCCAAGTACTCAATCAGGTTCTTCGTGTTGTGAATAATGTTGATGAAGCAAATAGTGAACCGCGCAGACAAAGCTTTCATGGGGTAGTAGATATTCTTGCTTCTGAGTTGTTTAATCCCAATTCATCAACTATCGTGAGAAAGAATGTGCAGTCATGTTTAGCTCTTTTGGCCAGTAGGACTGGTAGTGAGGTGTCTGAGTTGCTTGAACCTCTGCATCAACCTTTGCTTCAGCCTCTCTTATTGCGACCACTTCGGCTGAAGACTATTGATCAGCAGGTTGGAACTGTCACAGCCTTGAATTTCTGTTTGGCATTAAGGCCGCCTCTTCTAAAGTTGACTCAGGAGTTGGTCAACTTTCTGCAAGAAGCTTTGCAAATAGCTGAGGCAGATGAGACTGTATGGGTTGTAAAGTTCATGAACCCTAAAATAGCCACATCATTGAACAAGCTCCGAACAGCTTGCATTGAGTTACTGTGCACCACCATGGCATGGGCAGATTTTAAAACACCCAATCATTCTGAGTTGCGTGCAAAGATCATCTCAATGTTTTTCAAGTCATTAACATGTCGGACTCCAGAAGTAGTTGCTGTTGCAAAGGAGGGGTTAAGACAGGTTATTAATCAGCAAAGGATGCCCAAAGATTTGCTGCAAGGTAGCCTTAGACCTATTCTGGTAAACTTGGCACACACCAAAAATCTTAGCATGCCACTTCTTCAAGGTCTGGCTCGCCTTCTTGAACTTTTGGCCAGTTGGTTTAATGTCACATTGGGAGGCAAGCTGTTAGAGCACCTCAAGAAATGGTTGGAGCCAGAAAAACTTGCTCAAAGTCAGAAAGCGTGGAAGGCGGGTGAGGAGCCAAAAATTGCCGCAGCTATCATTGAACTTTTTCATCTGCTTCCCATGGCTGCATCCAAGTTTCTTGATGAACTAGTAACATTGACTATTGATTTGGAAGGAGCTCTTCCCCCTGGCCAAGTGTATAGTGAAGTCAACAGTCCTTACCGTGTTCCACTAATTAAATTTTTGAACCGATATGCACCGCTTGCTGTTGATTACTTCCTTGCTCGACTTAGTGAACCAAAATACTTCAGACGGTTTATGTATATTATCCGATCAGATGCAGGCCAGCCTCTGAGAGAAGAACTTGCAAAATCCCCACAAAAGATACTTGCTAGTGCCTTTCCTGAATTTGCACCTAAATCTGAAGCTGCGTTAACTCCAGGTTCTTCAACCTCACCTGCTCCTTTATCAGGTGATGAAGGCCTTGTAACTCCTGATGCTTCCGATCCTCCATCTGCACCCTCAAGTGTGGTTTCCGATGCTTATTTTCGTGGGCTTGCACTTATAAAAACTTTGGTGAAATTGATGCCTGGCTGGCTACAGAACAATCGTGTAGTTTTTGATACTTTGGTACTTGTCTGGAAGTCACCGGCTAGAATAGCTCGACTGCACAATGAGCAAGAGCTAAACCTAGTGCAAGTGAAAGAAAGCAAGTGGCTAGTCAAATGCTTCTTGAATTATCTGCGACACGAAAAAGCAGAAGTGAATGTGCTCTTTGACATACTTTCCATCTTTTTATTTCACACTCGAATTGACTATACGTTTCTGAAGGAGTTTTACATAATTGAGGTTGCTGAAGGTTATCCACCCAACATGAAAAAAGCACTTCTCTTACATTTTCTAAACCTGTTTCAATCGAAACAACTTGGTCATGATCATTTGGTGATTGTAATGCAAATGCTAATTCTTCCTATGCTTGCTCATGCCTTCCAAAATGGGCAAAGTTGGGAGGTTGTAGATCAAGCTATAATTAAAACAATTGTCGACAAACTTCTTGATCCTCCAGAAGAGGTGTCTGCTGAGTACGATGAGCCCTTGAGAATAGAACTTTTGCAGCTTGCAACCTTACTTCTCAAGTATCTTCAGAGCGACCTGGTCCATCACAGGAAAGAACTAATCAAGTTTGGTTGGAACCATCTCAAAAGGGAAGATAGTGCCAGTAAGCAGTGGGCATTTGTGAATGTCTGCCATTTCTTGGAGGCTTATCAAGCACCTGAAAAAATTATACTTCAGGTTTTTGTAGCACTTCTTAGAACTTGTCAACCCGAGAATAAAATGTTGGTCAAGCAGGCCCTTGATATACTAATGCCAGCCCTACCACGGAGATTGCCTCTTGGTGATTCTCGGATGCCAATTTGGATAAGATACACAAAAAAGATACTGGTCGAGGAGGGCCACTCAATTCCTAATTTGATTCACATTTTTCAACTCATTGTGCGGCATTCAGATCTCTTCTACAGCTGCAGAGCTCAATTCGTTCCACAGATGGTGAATTCTCTCAGTCGTCTAGGATTACCTTACAATACTACAGCAGAAAATAGGAGACTTGCAATTGATCTTGCTGGATTGGTGGTTGGTTGGGAACGTCAACGACAAAATGAAATGAAGCTTGTCACTGAAAGCGATGCCCCTAGCCACAGCAATGATGGATTAACATGTCCTCCTGGTGCTGATCCCAAGCGTATGGTCGATGGTTCTACATTCCCCGAAGATTCAACCAAGCGGGTCAAGGTTGAACCGGGTCTTCAATCCCTCTGTGTCATGTCTCCTGGTGGTGCATCATCAATGCCGAATATAGAGACCCCTGGGTCAACAACACAACCTGATGAAGAATTTAAACCAAATGCTGCAATGGAGGAAATGATTATTAATTTTCTGATAAGGGTTGCGCTGGTTATAGAGCCTAAAGACAAGGAGGCAACTGCCATGTATAAACAAGCACTAGAGCTACTCTCACAGGCTTTAGAGGTGTGGCCAAATGCCAATGTCAAATTCAATTACCTGGAGAAGTTGCTCAGTAGCATCCAGCCATCTCAGTCCAAGGACCCTTCCACTGCTCTTGCACAGGGCTTAGATGTAATGAACAAAGTTTTAGAGAAGCAGCCGCATCTGTTTGTCAGAAATAATATCAATCAGATATCTCAAATTTTAGAACCCTGTTTTAAGCACAAGATGTTAGATGCTGGGAAATCATTATGCTCCTTGCTGAGGATGGTATTTGTGGCTTATCCATTGGAAGGGGTCACAACACCACCAGATGTGAAGTTGTTGTATCAGAAAGTTGACGAGCTCATAAAGAATCACATTAATAATTTAACAGCCCCTCAAACATCATCTGAAGATAACACTGCTTCTTCCATTAGCTTTGTCCTGCTTGTAATTAAAACTTTGACAGAAGTTCAGAAGAACCTAATTGATCCATATAACCTGGGTCGTATTCTTCAGCGCCTAGCACGAGACATGGGGTCGTCAGCAGGTTCTCATTTGAGACAAGGTCAAAGGATGGATCCTGATTCTGCAGTAACTTCTTCTCGCCAAAGTGCTGATGTCGGAACAGTCATCTCTAATTTGAAATCAGTTCTGAAGCTCATTAATGAAAGAGTCATGCTTGTTCCTGAGTGCAAACGATCGGTAACTCAGATCATGAACTCCCTGTTGTCAGAAAAAGGCACCGACGCTAGTGTGTTACTTTGCATACTTGATGTGATAAAAGGGTGGGTTGAGGACGACTTCAGTAAGATGGGCACATCTGTCTCGTCTAGTTCTTTTCTTGCTCCCAAGGAAATTGTCTCTTTCCTTCAGAAGCTATCACAAGTGGATAAGCAAAACTTCTCTTCAAGTGCTGCTGAAGAGTGGGATGGAAAATATCTGCAGCTCCTTTATGAAATTTGTGCCGATTCAAATAAATATCCATTGTCTCTGCGCCAAGAAGTATTTCAGAAGGTTGAACGACAGTTCATGCTGGGTCTGAGGGCTAGGGATCCTGAAACTAGAAAGAAATTCTTCACACTATATCATGAATCACTGGGGAAAACATTGTTTATAAGGCTGCAATACATCATCCAGATTCAGGACTGGGAAGCTTTAAGTGATGTATTCTGGCTCAAACAGGGCCTCGATCTCCTTTTGGCAGTCTTAGTTGAGGATAAACCTATAACCCTTGCACCAAACTCTGCGAGGTTGCCACCACTTCTGGTATCTGGTCATGTTGCAGATTCCTCTGCAGTGCAGCCCCAAGTTAATGACGCTCAAGAGGGTCTTGAGGATGCCCCTTTAACATTTGATTCCCTTGTTCATAAGCATGCACAATTTTTAAACCGGACGAGTAAACTTCAGGTCGCTGATCTTATTATACCATTGAGAGAACTGGCCCACACAGATGCAAATGTTGCGTACCATCTATGGGTTCTGGTTTTTCCTATTGTCTGGGTAACATTGCATAAGGAAGAACAGGTGGCACTGGCCAAACCAATGATTAGTCTCCTGTCAAAGGATTATCATAAGAAACAGCAAGCAAGCCGACCAAATGTTGTGCAGGCACTTCTAGAAGGGCTTCAGCTGAGTCATCCTCAGCCTCGGATGCCGAGTGAGCTCATTAAATATATTGGCAGGACTTACAATGCATGGCATATAGCACTAGCTCTTCTGGAAAGTCATGTTATGTTGTTCATGAATGAGACGAAGTGCTCTGAGTCTCTGGCTGAGCTATATCGTTTACTAAATGAGGAAGATATGAGGTGTGGATTGTGGAAGAGAAAGGCAATCTCTGCAGAAACTAAAGCTGGGCTTTCACTTGTTCAGCATGGTTACTGGCAGCGTGCTCAAATCCTTTTTTATCAATCGATGGTTAAAGCAACTCAAGGTACATATAATAACAATGTACCAAAGGCTGAGATGTGTCTTTGGGAAGAACAGTGGCTTTACTGTGCTAGCCAACTTAGTCAATGGGAAGCTTTGGTGGACTTTGGGAAGAGCATTGAAAATTATGAAATACTGCTCGACAGTCTATGGAAAGTGCCTGATTGGGCATACATGAAAGAGCATGTTATTCCAAAAGCACAAGTAGAAGAAACCCCAAAACTTCGTCTAATTCAAGCATACTTTTCTCTTCATGATAGGAGTACAAATGGTGTTGCAGATGCGGAAAATATAGTTGGAAAAGGAGTTGACCTTGCTTTAGAACAATGGTGGCAGTTGCCTGAAATGTCTGTTCATGCCAGGATTCCACTTTTGCAACAATTCCAGCAGCTAGTTGAAGTGCAGGAATCATCTAGAATTCTTGTTGATATAGCCAATGGAAACAAACATTCTGGAAGTTCTGTTGTTAGTGTGCATACAAATCTCTATGCAGATCTAAAGGATATCCTTGAGACTTGGAGACTGCGAATTCCAAATGAATGGGATAGTATGACTGTTTGGTGTGATTTACTACAATGGAGGAATGAGATGTATAATGCTGTAATTGATGCGTTTAAGGATTTTGGCACCACAAATTCCCAACTTCATCACCTTGGTTTTCGTGACAAAGCATGGAATGTCAATAAGCTTGCTCATGTTGCCCGTAAACAAGGACTTCATGATGTTTGTGTAGGAATACTTGAAAAGATGTATGGTCATTCAACCATGGAAGTGCAGGAGGCTTTTGTGAAGATAAGAGAACAGGCAAAGGCTTACTTGGAGATGAAGGGAGAGCTCACCAGTGGTCTGAATCTGATCAACAGCACTAATTTAGATTATTTTCCTGTGAAACACAAAGCAGAAATTTTTCGTCTCAAGGGAGATTTCCAGCTGAAGTTAAGTGATTCTGAAGGTGCTAATCATTCATACTCCAGCGCCATAAGTCTTTTCAAGAATTTGCCCAAGGGATGGATAAGCTGGGGGAATTATTGTGACATGGCTTACAAAGAATCCCATGAAGAGATTTGGTTGGAATATGCTGTTAGTTGCTTTCTTCAAGGCATTAAATTTGGCATTTCCAACTCAAGAAACCATCTAGCCCGTGTATTATATCTTCTCAGCTTTGATACCCCCAATGAGCCTGTGGGCCGAGCATTTGACAAGTATTTGGACCAAATACCCCATTGGGTATGGCTGTCCTGGATTCCTCAACTCTTACTTTCCTTGCAAAGGACAGAAGCACCTCACTGTAAACTTGTTCTTCTGAAAATTGCGAACGTTTATCCACAGGCACTGTATTACTGGCTTCGCACTTATTTGCTTGAAAGACGAGATGTTGCAAATAAGTCTGAGCTAGGTAGGATGGCAATGGCTCAACAAAGAATGCAGCAAAATACCAGTTCTGCTGGTTCTCTTGGCTTGACCGATGGAAGTTCTAGAGTGGCTCATGGTGGCAGTTCTACCTCTACTGATAACCAAGTCCACCAAGGCACTCAATCAGGTAGTGGAATTGGATCTCATGATGGTGGGAATTCTCATAGCCAGGAACCTGAAAGGTCCACTGGTGTGGAAAGCAGCACACATGCTGGAAATGATCAATCTCTTCCACAAACTTCTTCAAATGTCAATGAAGGTACTCAAAATGCATTAAGGCGCAGTGCTGCCTTGGGCTTAGTGGGTTCTGCTGCTAGTGCATTTGATGCTGCAAAGGATATCATGGAGGCTCTTAGAAGCAAGCACACTAACTTGGCTAGTGAACTTGAGATTCTACTCACAGAAATTGGTTCGAGGTTTGTTACTTTGCCAGAAGAGAGACTTCTTGCTGTGGTTAATGCATTGCTCCATCGTTGCTACAAGTATCCCACTGCTACGACAGCTGAGGTTCCTCAATCTCTGAAAAAGGAGCTCTCTGGAGTTTGTAAGGCTTGCTTCTCAGCTGACGCGGTTAACAAGCATGTTGATTTTGTGAGGGAGTACAAGCAGGATTTTGAGCGTGATCTTGATCCAGAGAGCACTTCCACTTTCCCAGCAACTCTTTCTGAATTGACTGAGCGATTGAAACACTGGAAAAATGTTCTCCAGGGAAATGTTGAGGACAGGTTTCCTGCAGTCTTGAGATTAGAAGACGAAAGCCGGGTATTGCGTGACTTCCACGTTGTTGATGTGGAGGTACCAGGACAATATTTTACTGACCAGGAAATTGCACCTGACCATACAGTGAAGTTAGACAGAGTTGGAGCAGACATTCCAATTGTCCGAAGACATGGGAGTAGTTTTAGACGCTTGACTTTAATTGGTTCAGATGGTTCTCAGCGTCATTTTATAGTCCAAACTTCCTTGACTCCTAATGCTAGAAGTGATGAGCGCATTTTGCAACTTTTCCGAGTTATGAATCAAATGTTTGATAAGCACAAGGAATCAAGACGCCGCCACTTGTGTATTCACACTCCAATCATTATCCCTGTTTGGTCACAGGTTCGCATGGTGGAAGATGATTTGATGTATAGCACTTTTCTTGAGGTGTATGAAAATCATTGTGCAAGAAACGACCAAGAGGCAGATCTTCCAATTACATATTTCAAAGAGCAACTGAATCAGGCCATATCTGGCCAAATCGCGCCGGAAGCTGTTCTAGATCTCCGCCTCCAAGCTTATGGTGATATAACAAGAAATCTTGTAAACGAGGGTATATTTTCACAGTATATGTACAAGACATTACTCAGTGGAAATCACATGTGGGCTTTTAAAAAACAATTTGCCATCCAATTAGCCCTTTCGAGTTTTATGTCTTACATGCTACAGATCGGGGGAAGGTCACCCAACAAGATTTATTTTGCTAAGAACACTGGGAAAATTTTCCAAACAGATTTTCATCCCGCATATGATACAAATGGTATGATCGAGTTCAATGAACCAGTTCCTTTCAGGTTAACTAGGAATATGCAAGCCTTCTTCTCTCATTTCGGAGTGGAAGGTTTAATTGTTTCTGCCATGTGTTCTGCTGCCCAGGCAGTTGTTTCACCGAAGCAAAATCAGCACTTGTGGCATCAACTTGCTATGTTTTTCCGTGATGAGCTACTTTCATGGTCTTGGCGGAGACCCCTCGGTATGCCTTTGGCATCCATTGCAGGTGGTGGTATGAGCCCTGCTGACTTCAAGCAGAAGGTTACTATCAACGTTGATCATGTCATTGGCCGGATTAATGGAATAGCACCACAATATTTCTCTGAGGAAGAGGAGAACGCCATGGACCCACCGCAGTCGGTACAAAGGGGTGTGTCGGACCTGGTCGATGCTGCCTTGATGCCAAGGCATCTGTGTATGATGGACCCAACATGGCATCCTTGGTTT

Protein sequence

MSPIQNFELHSRQLVEPELSIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSIILVQITKPQYTDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRIIFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENSAAGGEDIKPMDVSTSTDQTITTGYTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPVLLPLMVSAISVPGPEKVPPFLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIRKELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVRGDLTLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDEARILLGRILDAFVGKFSTFKHTIPQLLEEGEEGKDRANMRSKLELPVQAVLNLQVPVEHSKEVNDCKHLIKTLILGMKTIIWSITHAHLPRPQASPSPNGTHPQMLVSPSSNLATPQAFKGMREDEVCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMITNTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAVAKAPSDFERILQPHVTVIMEVCVRSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIPLLQPCLNMLLTMFDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLVSLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRPIPYPWGAKALQVLGKLGGRNRRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSTVMNKTGGVDSFYRKQALKFLRVCLSSQLNLPGNVADDGHTPRQLSTLLVSPVDSSLRRSETPEGKADLGVKTKTQLMAEKSVFKILLMTIIAAGSEEDLHEPKDDFVLNVCRHFAILFHIDSSLNSSPVASASLGSTLLPPNVSANSRLRSSACCNLKELDPLIFLDALVEVLADENRVHAKAALNALNLFSEILLFLARAKQTDVMMTRGPSTPMIVSSPSKSPVYSPPPSVRIPVFEQLLPRLLHCCYGSTWQAQMGGIMGLGALVGKVTVETLCLFQVRIVRGLVYVLKRLPIYASKEQEETSQVLNQVLRVVNNVDEANSEPRRQSFHGVVDILASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELLEPLHQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAEADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLTCRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELLASWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLDELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMYIIRSDAGQPLREELAKSPQKILASAFPEFAPKSEAALTPGSSTSPAPLSGDEGLVTPDASDPPSAPSSVVSDAYFRGLALIKTLVKLMPGWLQNNRVVFDTLVLVWKSPARIARLHNEQELNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEGYPPNMKKALLLHFLNLFQSKQLGHDHLVIVMQMLILPMLAHAFQNGQSWEVVDQAIIKTIVDKLLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSASKQWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGDSRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPYNTTAENRRLAIDLAGLVVGWERQRQNEMKLVTESDAPSHSNDGLTCPPGADPKRMVDGSTFPEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFLIRVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTALAQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYPLEGVTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLIDPYNLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINERVMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWVEDDFSKMGTSVSSSSFLAPKEIVSFLQKLSQVDKQNFSSSAAEEWDGKYLQLLYEICADSNKYPLSLRQEVFQKVERQFMLGLRARDPETRKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVLVEDKPITLAPNSARLPPLLVSGHVADSSAVQPQVNDAQEGLEDAPLTFDSLVHKHAQFLNRTSKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMISLLSKDYHKKQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGRTYNAWHIALALLESHVMLFMNETKCSESLAELYRLLNEEDMRCGLWKRKAISAETKAGLSLVQHGYWQRAQILFYQSMVKATQGTYNNNVPKAEMCLWEEQWLYCASQLSQWEALVDFGKSIENYEILLDSLWKVPDWAYMKEHVIPKAQVEETPKLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMSVHARIPLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVSVHTNLYADLKDILETWRLRIPNEWDSMTVWCDLLQWRNEMYNAVIDAFKDFGTTNSQLHHLGFRDKAWNVNKLAHVARKQGLHDVCVGILEKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLDYFPVKHKAEIFRLKGDFQLKLSDSEGANHSYSSAISLFKNLPKGWISWGNYCDMAYKESHEEIWLEYAVSCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKYLDQIPHWVWLSWIPQLLLSLQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNTSSAGSLGLTDGSSRVAHGGSSTSTDNQVHQGTQSGSGIGSHDGGNSHSQEPERSTGVESSTHAGNDQSLPQTSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASELEILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSADAVNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLRLEDESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGSDGSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVRMVEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIAPEAVLDLRLQAYGDITRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAKNTGKIFQTDFHPAYDTNGMIEFNEPVPFRLTRNMQAFFSHFGVEGLIVSAMCSAAQAVVSPKQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAGGGMSPADFKQKVTINVDHVIGRINGIAPQYFSEEEENAMDPPQSVQRGVSDLVDAALMPRHLCMMDPTWHPWF
Homology
BLAST of MS010599 vs. NCBI nr
Match: XP_022133382.1 (transformation/transcription domain-associated protein-like [Momordica charantia])

HSP 1 Score: 7655.8 bits (19863), Expect = 0.0e+00
Identity = 3880/3888 (99.79%), Postives = 3884/3888 (99.90%), Query Frame = 0

Query: 1    MSPIQNFELHSRQLVEPELSIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSIILVQ 60
            MSPIQNFELHSRQLVEPELSIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSIILVQ
Sbjct: 1    MSPIQNFELHSRQLVEPELSIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSIILVQ 60

Query: 61   ITKPQYTDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120
            ITKPQYTDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI
Sbjct: 61   ITKPQYTDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120

Query: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENSAAGGEDIKPMDVSTSTDQT 180
            IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENSAAGGEDIKPMDVSTSTDQT
Sbjct: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENSAAGGEDIKPMDVSTSTDQT 180

Query: 181  ITTGYTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPVLLPLMVSAISVPGPE 240
            ITTGYTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPVLLPLMVSAISVPGPE
Sbjct: 181  ITTGYTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPVLLPLMVSAISVPGPE 240

Query: 241  KVPPFLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR 300
            KVPPFLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR
Sbjct: 241  KVPPFLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR 300

Query: 301  KELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVRG 360
            KELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVRG
Sbjct: 301  KELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVRG 360

Query: 361  DLTLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDEARILLG 420
            DLTLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDEARILLG
Sbjct: 361  DLTLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDEARILLG 420

Query: 421  RILDAFVGKFSTFKHTIPQLLEEGEEGKDRANMRSKLELPVQAVLNLQVPVEHSKEVNDC 480
            RILDAFVGKFSTFKHTIPQLLEEGEEGKDRANMRSKLELPVQAVLNLQVPVEHSKEVNDC
Sbjct: 421  RILDAFVGKFSTFKHTIPQLLEEGEEGKDRANMRSKLELPVQAVLNLQVPVEHSKEVNDC 480

Query: 481  KHLIKTLILGMKTIIWSITHAHLPRPQASPSPNGTHPQMLVSPSSNLATPQAFKGMREDE 540
            KHLIKTLILGMKTIIWSITHAHLPRPQASPSPNGTHPQMLVSPSSNLATPQAFKGMREDE
Sbjct: 481  KHLIKTLILGMKTIIWSITHAHLPRPQASPSPNGTHPQMLVSPSSNLATPQAFKGMREDE 540

Query: 541  VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMIT 600
            VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMIT
Sbjct: 541  VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMIT 600

Query: 601  NTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAVA 660
            NTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDS GAKLVLHLFRFVFGAVA
Sbjct: 601  NTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSQGAKLVLHLFRFVFGAVA 660

Query: 661  KAPSDFERILQPHVTVIMEVCVRSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIPL 720
            KAPSDFERILQPHVTVIMEVCVRSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIPL
Sbjct: 661  KAPSDFERILQPHVTVIMEVCVRSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIPL 720

Query: 721  LQPCLNMLLTMFDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLV 780
            LQPCLNMLLTMFDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLV
Sbjct: 721  LQPCLNMLLTMFDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLV 780

Query: 781  SLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRPIPYPWGAKALQVLGKLGGRN 840
            SLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRPIPYPWGAKALQVLGKLGGRN
Sbjct: 781  SLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRPIPYPWGAKALQVLGKLGGRN 840

Query: 841  RRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSTVMNKTGGVDSFYRK 900
            RRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSTVMNKTGGVDSFYRK
Sbjct: 841  RRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSTVMNKTGGVDSFYRK 900

Query: 901  QALKFLRVCLSSQLNLPGNVADDGHTPRQLSTLLVSPVDSSLRRSETPEGKADLGVKTKT 960
            QALKFLRVCLSSQLNLPGNVADDGHTPRQLSTLLVSPVDSSLRRSETPEGKADLGVKTKT
Sbjct: 901  QALKFLRVCLSSQLNLPGNVADDGHTPRQLSTLLVSPVDSSLRRSETPEGKADLGVKTKT 960

Query: 961  QLMAEKSVFKILLMTIIAAGSEEDLHEPKDDFVLNVCRHFAILFHIDSSLNSSPVASASL 1020
            QLMAEKSVFKILLMTIIAAGSEEDLHEPKDDFVLNVCRHFAILFHIDSSLN+SPVASASL
Sbjct: 961  QLMAEKSVFKILLMTIIAAGSEEDLHEPKDDFVLNVCRHFAILFHIDSSLNNSPVASASL 1020

Query: 1021 GSTLLPPNVSANSRLRSSACCNLKELDPLIFLDALVEVLADENRVHAKAALNALNLFSEI 1080
            GSTLLPPNVSANSRLRSSACCNLKELDPLIFLDALVEVLADENRVHAKAALNALNLFSEI
Sbjct: 1021 GSTLLPPNVSANSRLRSSACCNLKELDPLIFLDALVEVLADENRVHAKAALNALNLFSEI 1080

Query: 1081 LLFLARAKQTDVMMTRGPSTPMIVSSPSKSPVYSPPPSVRIPVFEQLLPRLLHCCYGSTW 1140
            LLFLARAKQTDVMMTRGPSTPMIVSSPSKSPVYSPPPSVRIPVFEQLLPRLLHCCYGSTW
Sbjct: 1081 LLFLARAKQTDVMMTRGPSTPMIVSSPSKSPVYSPPPSVRIPVFEQLLPRLLHCCYGSTW 1140

Query: 1141 QAQMGGIMGLGALVGKVTVETLCLFQVRIVRGLVYVLKRLPIYASKEQEETSQVLNQVLR 1200
            QAQMGGIMGLGALVGKVTVETLCLFQVRIVRGLVYVLKRLPIYASKEQEETSQVLNQVLR
Sbjct: 1141 QAQMGGIMGLGALVGKVTVETLCLFQVRIVRGLVYVLKRLPIYASKEQEETSQVLNQVLR 1200

Query: 1201 VVNNVDEANSEPRRQSFHGVVDILASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL 1260
            VVNNVDEANSEPRRQSFHGVVDILASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL
Sbjct: 1201 VVNNVDEANSEPRRQSFHGVVDILASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL 1260

Query: 1261 EPLHQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE 1320
            EPLHQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE
Sbjct: 1261 EPLHQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE 1320

Query: 1321 ADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT 1380
            ADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT
Sbjct: 1321 ADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT 1380

Query: 1381 CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL 1440
            CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL
Sbjct: 1381 CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL 1440

Query: 1441 ASWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD 1500
            ASWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD
Sbjct: 1441 ASWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD 1500

Query: 1501 ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY 1560
            ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY
Sbjct: 1501 ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY 1560

Query: 1561 IIRSDAGQPLREELAKSPQKILASAFPEFAPKSEAALTPGSSTSPAPLSGDEGLVTPDAS 1620
            IIRSDAGQPLREELAKSPQKILASAFPEFAPKSEAALTPGSSTSPAPLSGDEGLVTPDAS
Sbjct: 1561 IIRSDAGQPLREELAKSPQKILASAFPEFAPKSEAALTPGSSTSPAPLSGDEGLVTPDAS 1620

Query: 1621 DPPSAPSSVVSDAYFRGLALIKTLVKLMPGWLQNNRVVFDTLVLVWKSPARIARLHNEQE 1680
            DPPSAPSSVVSDAYFRGLALIKTLVKLMPGWLQNNRVVFDTLVLVWKSPARIARLHNEQE
Sbjct: 1621 DPPSAPSSVVSDAYFRGLALIKTLVKLMPGWLQNNRVVFDTLVLVWKSPARIARLHNEQE 1680

Query: 1681 LNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEGY 1740
            LNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEGY
Sbjct: 1681 LNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEGY 1740

Query: 1741 PPNMKKALLLHFLNLFQSKQLGHDHLVIVMQMLILPMLAHAFQNGQSWEVVDQAIIKTIV 1800
            PPNMKKALLLHFLNLFQSKQLGHDHLVIVMQMLILPMLAHAFQNGQSWEVVDQAIIKTIV
Sbjct: 1741 PPNMKKALLLHFLNLFQSKQLGHDHLVIVMQMLILPMLAHAFQNGQSWEVVDQAIIKTIV 1800

Query: 1801 DKLLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSASK 1860
            DKLLDPPEEVSA+YDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSASK
Sbjct: 1801 DKLLDPPEEVSADYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSASK 1860

Query: 1861 QWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGDS 1920
            QWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGDS
Sbjct: 1861 QWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGDS 1920

Query: 1921 RMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPYN 1980
            RMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPYN
Sbjct: 1921 RMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPYN 1980

Query: 1981 TTAENRRLAIDLAGLVVGWERQRQNEMKLVTESDAPSHSNDGLTCPPGADPKRMVDGSTF 2040
            TTAENRRLAIDLAGLVVGWERQRQNEMKLVTESDAPSHSNDGLTCPPGADPKRMVDGSTF
Sbjct: 1981 TTAENRRLAIDLAGLVVGWERQRQNEMKLVTESDAPSHSNDGLTCPPGADPKRMVDGSTF 2040

Query: 2041 PEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFLIR 2100
            PEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFLIR
Sbjct: 2041 PEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFLIR 2100

Query: 2101 VALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTALA 2160
            VALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTALA
Sbjct: 2101 VALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTALA 2160

Query: 2161 QGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYPLEGV 2220
            QGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLD GKSLCSLLRMVFVAYPLEGV
Sbjct: 2161 QGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDTGKSLCSLLRMVFVAYPLEGV 2220

Query: 2221 TTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLIDPY 2280
            TTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLIDPY
Sbjct: 2221 TTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLIDPY 2280

Query: 2281 NLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINERV 2340
            NLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINERV
Sbjct: 2281 NLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINERV 2340

Query: 2341 MLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWVEDDFSKMGTSVSSSSFLAPKE 2400
            MLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWVEDDFSKMGTSVSSSSFLAPKE
Sbjct: 2341 MLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWVEDDFSKMGTSVSSSSFLAPKE 2400

Query: 2401 IVSFLQKLSQVDKQNFSSSAAEEWDGKYLQLLYEICADSNKYPLSLRQEVFQKVERQFML 2460
            IVSFLQKLSQVDKQNFSSSAAEEWDGKYLQLLYEICADSNKYPLSLRQEVFQKVERQFML
Sbjct: 2401 IVSFLQKLSQVDKQNFSSSAAEEWDGKYLQLLYEICADSNKYPLSLRQEVFQKVERQFML 2460

Query: 2461 GLRARDPETRKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVLVE 2520
            GLRARDPETRKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVLVE
Sbjct: 2461 GLRARDPETRKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVLVE 2520

Query: 2521 DKPITLAPNSARLPPLLVSGHVADSSAVQPQVNDAQEGLEDAPLTFDSLVHKHAQFLNRT 2580
            DKPITLAPNSARLPPLLVSGHVADSSAVQPQVNDAQEGLEDAPLTFDSLVHKHAQFLNRT
Sbjct: 2521 DKPITLAPNSARLPPLLVSGHVADSSAVQPQVNDAQEGLEDAPLTFDSLVHKHAQFLNRT 2580

Query: 2581 SKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMISLLSKDYHK 2640
            SKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMISLLSKDYHK
Sbjct: 2581 SKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMISLLSKDYHK 2640

Query: 2641 KQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGRTYNAWHIALALLESHVMLFMNETK 2700
            KQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGRTYNAWHI+LALLESHVMLFMNETK
Sbjct: 2641 KQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGRTYNAWHISLALLESHVMLFMNETK 2700

Query: 2701 CSESLAELYRLLNEEDMRCGLWKRKAISAETKAGLSLVQHGYWQRAQILFYQSMVKATQG 2760
            CSESLAELYRLLNEEDMRCGLWKRKAISAETKAGLSLVQHGYWQRAQILFYQSMVKATQG
Sbjct: 2701 CSESLAELYRLLNEEDMRCGLWKRKAISAETKAGLSLVQHGYWQRAQILFYQSMVKATQG 2760

Query: 2761 TYNNNVPKAEMCLWEEQWLYCASQLSQWEALVDFGKSIENYEILLDSLWKVPDWAYMKEH 2820
            TYNNNVPKAEMCLWEEQWLYCASQLSQWEALVDFGKSIENYEILLDSLWKVPDWAYMKEH
Sbjct: 2761 TYNNNVPKAEMCLWEEQWLYCASQLSQWEALVDFGKSIENYEILLDSLWKVPDWAYMKEH 2820

Query: 2821 VIPKAQVEETPKLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMSVHARI 2880
            VIPKAQVEETPKLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMSVHARI
Sbjct: 2821 VIPKAQVEETPKLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMSVHARI 2880

Query: 2881 PLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVSVHTNLYADLKDILETWRLRIPNEWD 2940
            PLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVSVHTNLYADLKDILETWRLRIPNEWD
Sbjct: 2881 PLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVSVHTNLYADLKDILETWRLRIPNEWD 2940

Query: 2941 SMTVWCDLLQWRNEMYNAVIDAFKDFGTTNSQLHHLGFRDKAWNVNKLAHVARKQGLHDV 3000
            SMTVWCDLLQWRNEMYNAVIDAFKDFGTTNSQLHHLGFRDKAWNVNKLAHVARKQGLHDV
Sbjct: 2941 SMTVWCDLLQWRNEMYNAVIDAFKDFGTTNSQLHHLGFRDKAWNVNKLAHVARKQGLHDV 3000

Query: 3001 CVGILEKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLDYFPVKHKAEI 3060
            CVGILEKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLDYFPVKHKAEI
Sbjct: 3001 CVGILEKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLDYFPVKHKAEI 3060

Query: 3061 FRLKGDFQLKLSDSEGANHSYSSAISLFKNLPKGWISWGNYCDMAYKESHEEIWLEYAVS 3120
            FRLKGDFQLKLSDSEGANHSYSSAISLFKNLPKGWISWGNYCDMAYKESHEEIWLEYAVS
Sbjct: 3061 FRLKGDFQLKLSDSEGANHSYSSAISLFKNLPKGWISWGNYCDMAYKESHEEIWLEYAVS 3120

Query: 3121 CFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKYLDQIPHWVWLSWIPQLLLSL 3180
            CFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKYLDQIPHWVWLSWIPQLLLSL
Sbjct: 3121 CFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKYLDQIPHWVWLSWIPQLLLSL 3180

Query: 3181 QRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNTSSA 3240
            QRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNT SA
Sbjct: 3181 QRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNTGSA 3240

Query: 3241 GSLGLTDGSSRVAHGGSSTSTDNQVHQGTQSGSGIGSHDGGNSHSQEPERSTGVESSTHA 3300
            GSLGLTDGSSRV HGGSSTSTDNQVHQGTQSGSGIGSHDGGNSHSQEPERSTGVESSTHA
Sbjct: 3241 GSLGLTDGSSRVGHGGSSTSTDNQVHQGTQSGSGIGSHDGGNSHSQEPERSTGVESSTHA 3300

Query: 3301 GNDQSLPQTSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASELEI 3360
            GNDQSLPQTSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASELEI
Sbjct: 3301 GNDQSLPQTSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASELEI 3360

Query: 3361 LLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSADAV 3420
            LLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSADAV
Sbjct: 3361 LLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSADAV 3420

Query: 3421 NKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLRLED 3480
            NKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLRLED
Sbjct: 3421 NKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLRLED 3480

Query: 3481 ESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGSDG 3540
            ESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGSDG
Sbjct: 3481 ESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGSDG 3540

Query: 3541 SQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVRMV 3600
            SQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVRMV
Sbjct: 3541 SQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVRMV 3600

Query: 3601 EDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIAPEAVLDLRLQAYGDIT 3660
            EDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIAPEAVLDLRLQAYGDIT
Sbjct: 3601 EDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIAPEAVLDLRLQAYGDIT 3660

Query: 3661 RNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAKNT 3720
            RNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAKNT
Sbjct: 3661 RNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAKNT 3720

Query: 3721 GKIFQTDFHPAYDTNGMIEFNEPVPFRLTRNMQAFFSHFGVEGLIVSAMCSAAQAVVSPK 3780
            GKIFQTDFHPAYDTNGMIEFNEPVPFRLTRNMQAFFSHFGVEGLIVSAMCSAAQAVVSPK
Sbjct: 3721 GKIFQTDFHPAYDTNGMIEFNEPVPFRLTRNMQAFFSHFGVEGLIVSAMCSAAQAVVSPK 3780

Query: 3781 QNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAGGGMSPADFKQKVTINVDHVIGRING 3840
            QNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAGGGMSPADFKQKVTINVDHVIGRING
Sbjct: 3781 QNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAGGGMSPADFKQKVTINVDHVIGRING 3840

Query: 3841 IAPQYFSEEEENAMDPPQSVQRGVSDLVDAALMPRHLCMMDPTWHPWF 3889
            IAPQYFSEEEENAMDPPQSVQRGVSDLV+AALMPRHLCMMDPTWHPWF
Sbjct: 3841 IAPQYFSEEEENAMDPPQSVQRGVSDLVNAALMPRHLCMMDPTWHPWF 3888

BLAST of MS010599 vs. NCBI nr
Match: XP_038882073.1 (transformation/transcription domain-associated protein-like [Benincasa hispida])

HSP 1 Score: 7387.7 bits (19167), Expect = 0.0e+00
Identity = 3737/3890 (96.07%), Postives = 3810/3890 (97.94%), Query Frame = 0

Query: 1    MSPIQNFELHSRQLVEPELSIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSIILVQ 60
            MSPIQNFE HSR LVEPEL IQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFS+IL++
Sbjct: 1    MSPIQNFEQHSRHLVEPELCIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSVILLK 60

Query: 61   ITKPQYTDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120
            ITKPQ+TD+HEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI
Sbjct: 61   ITKPQFTDSHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120

Query: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENSAAGGEDIKPMDVSTSTDQT 180
            IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFEN+AA  ED KPMDVSTS+DQ+
Sbjct: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENTAAVVEDTKPMDVSTSSDQS 180

Query: 181  ITTGYTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPVLLPLMVSAISVPGPE 240
            +T+G TGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIP+LLPLMVSAISVPGPE
Sbjct: 181  LTSGCTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPLLLPLMVSAISVPGPE 240

Query: 241  KVPPFLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR 300
            KVPP LKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR
Sbjct: 241  KVPPSLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR 300

Query: 301  KELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVRG 360
            KELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVRG
Sbjct: 301  KELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVRG 360

Query: 361  DLTLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDEARILLG 420
            DL+LSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDE+RILLG
Sbjct: 361  DLSLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDESRILLG 420

Query: 421  RILDAFVGKFSTFKHTIPQLLEEGEEGKDRANMRSKLELPVQAVLNLQVPVEHSKEVNDC 480
            RILDAFVGKFSTFKHTIPQLLEEGEEGKDRAN+RSKLELPVQAVLNLQVPVEHSKEVNDC
Sbjct: 421  RILDAFVGKFSTFKHTIPQLLEEGEEGKDRANLRSKLELPVQAVLNLQVPVEHSKEVNDC 480

Query: 481  KHLIKTLILGMKTIIWSITHAHLPRPQASPSPNGTHPQMLVSPSSNLATPQAFKGMREDE 540
            KHLIKTLI+GMKTIIWSITHAHLPR Q SPS NGTHPQMLV+PSSNLATPQAFKGMREDE
Sbjct: 481  KHLIKTLIMGMKTIIWSITHAHLPRSQVSPSQNGTHPQMLVNPSSNLATPQAFKGMREDE 540

Query: 541  VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMIT 600
            VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMI+
Sbjct: 541  VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMIS 600

Query: 601  NTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAVA 660
            N QLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAV+
Sbjct: 601  NAQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAVS 660

Query: 661  KAPSDFERILQPHVTVIMEVCVRSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIPL 720
            KAPSDFERILQPHVTVIMEVCV+SATEVERPLGYMQLLR+MFRALAGCKFELLLRDLI L
Sbjct: 661  KAPSDFERILQPHVTVIMEVCVKSATEVERPLGYMQLLRVMFRALAGCKFELLLRDLISL 720

Query: 721  LQPCLNMLLTMFDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLV 780
            LQPCLNMLLTM DGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSD+LV
Sbjct: 721  LQPCLNMLLTMLDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDELV 780

Query: 781  SLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRPIPYPWGAKALQVLGKLGGRN 840
             LGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRP PYPWGAKALQVLGKLGGRN
Sbjct: 781  GLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRPPPYPWGAKALQVLGKLGGRN 840

Query: 841  RRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSTVMNKTGGVDSFYRK 900
            RRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVS VMNKTGGVDSFYRK
Sbjct: 841  RRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSAVMNKTGGVDSFYRK 900

Query: 901  QALKFLRVCLSSQLNLPGNVADDGHTPRQLSTLLVSPVDSSLRRSETPEGKADLGVKTKT 960
            QALKFLRVCLSSQLNLPG VADDG+TPRQLSTLL+S VDSS RRSETPE KADLGVKTKT
Sbjct: 901  QALKFLRVCLSSQLNLPGIVADDGYTPRQLSTLLISSVDSSWRRSETPEAKADLGVKTKT 960

Query: 961  QLMAEKSVFKILLMTIIAAGSEEDLHEPKDDFVLNVCRHFAILFHIDSSLNSSPVASASL 1020
            QLMAEKSVFK+LLMTIIAAGSEEDLHEPKDDFVLNVCRHFAILFHIDSSLN+ PVASASL
Sbjct: 961  QLMAEKSVFKLLLMTIIAAGSEEDLHEPKDDFVLNVCRHFAILFHIDSSLNNPPVASASL 1020

Query: 1021 GSTLLPPNVSANSRLRSSACCNLKELDPLIFLDALVEVLADENRVHAKAALNALNLFSEI 1080
            GSTLLP NV+ANSRL+SSACCNLKELDPLIFLDALVEVLADENR+HAKAALNALNLFSE+
Sbjct: 1021 GSTLLPSNVNANSRLKSSACCNLKELDPLIFLDALVEVLADENRLHAKAALNALNLFSEM 1080

Query: 1081 LLFLARAKQTDVMMTRGPSTPMIVSSPSKSPVYSPPPSVRIPVFEQLLPRLLHCCYGSTW 1140
            LLFL R KQTDVMMTRGP TPM VSSP  SPVYSPPPSVRIPVFEQ LPRLLHCCYG TW
Sbjct: 1081 LLFLCRGKQTDVMMTRGPGTPMSVSSP-MSPVYSPPPSVRIPVFEQFLPRLLHCCYGCTW 1140

Query: 1141 QAQMGGIMGLGALVGKVTVETLCLFQVRIVRGLVYVLKRLPIYASKEQEETSQVLNQVLR 1200
            QAQMGG+MGLGALVGKVTVETLC FQV+IVRGLVYVLKRLP+YASKEQEETSQVLN VLR
Sbjct: 1141 QAQMGGVMGLGALVGKVTVETLCHFQVKIVRGLVYVLKRLPVYASKEQEETSQVLNHVLR 1200

Query: 1201 VVNNVDEANSEPRRQSFHGVVDILASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL 1260
            VVNNVDEANSEPRRQSF GVVD+LASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL
Sbjct: 1201 VVNNVDEANSEPRRQSFQGVVDVLASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL 1260

Query: 1261 EPLHQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE 1320
            EPL+QPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE
Sbjct: 1261 EPLYQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE 1320

Query: 1321 ADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT 1380
            ADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT
Sbjct: 1321 ADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT 1380

Query: 1381 CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL 1440
            CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL
Sbjct: 1381 CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL 1440

Query: 1441 ASWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD 1500
            ASWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD
Sbjct: 1441 ASWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD 1500

Query: 1501 ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY 1560
            ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFL RLSEPKYFRRFMY
Sbjct: 1501 ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLDRLSEPKYFRRFMY 1560

Query: 1561 IIRSDAGQPLREELAKSPQKILASAFPEFAPKSEAALTPGSSTSPAPLSGDEGLVTP-DA 1620
            IIRSDAGQPLREELAKSPQKILASAFPEF PKSEAALTPGSST PAPLSGDEGLVTP D 
Sbjct: 1561 IIRSDAGQPLREELAKSPQKILASAFPEFVPKSEAALTPGSSTPPAPLSGDEGLVTPSDV 1620

Query: 1621 SDPPSAPSSVVSDAYFRGLALIKTLVKLMPGWLQNNRVVFDTLVLVWKSPARIARLHNEQ 1680
            SDPPSAPSSVVSDAYFRGLAL+KTLVKLMPGWLQ+NRVVFDTLVLVWKSPARIARL NEQ
Sbjct: 1621 SDPPSAPSSVVSDAYFRGLALVKTLVKLMPGWLQSNRVVFDTLVLVWKSPARIARLRNEQ 1680

Query: 1681 ELNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEG 1740
            ELNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEG
Sbjct: 1681 ELNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEG 1740

Query: 1741 YPPNMKKALLLHFLNLFQSKQLGHDHLVIVMQMLILPMLAHAFQNGQSWEVVDQAIIKTI 1800
            YPPNMKKALLLHFLNLFQSKQLGHDHLV+VMQMLILPMLAHAFQNGQSWEVVDQAIIKTI
Sbjct: 1741 YPPNMKKALLLHFLNLFQSKQLGHDHLVVVMQMLILPMLAHAFQNGQSWEVVDQAIIKTI 1800

Query: 1801 VDKLLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSAS 1860
            VDKLLDPPEEV+AEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSAS
Sbjct: 1801 VDKLLDPPEEVTAEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSAS 1860

Query: 1861 KQWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGD 1920
            KQWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGD
Sbjct: 1861 KQWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGD 1920

Query: 1921 SRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPY 1980
            SRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPY
Sbjct: 1921 SRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPY 1980

Query: 1981 NTTAENRRLAIDLAGLVVGWERQRQNEMKLVTESDAPSHSNDGLT-CPPGADPKRMVDGS 2040
            NTTAENRRLAIDLAGLVVGWERQRQNEMK VTESD PSHSNDGLT CPPGADPKR+VDGS
Sbjct: 1981 NTTAENRRLAIDLAGLVVGWERQRQNEMKHVTESDVPSHSNDGLTSCPPGADPKRLVDGS 2040

Query: 2041 TFPEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFL 2100
            TFPEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFL
Sbjct: 2041 TFPEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFL 2100

Query: 2101 IRVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTA 2160
            IRVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTA
Sbjct: 2101 IRVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTA 2160

Query: 2161 LAQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYPLE 2220
            LAQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYPLE
Sbjct: 2161 LAQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYPLE 2220

Query: 2221 GVTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLID 2280
            GVTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLID
Sbjct: 2221 GVTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLID 2280

Query: 2281 PYNLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINE 2340
            PYNLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINE
Sbjct: 2281 PYNLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINE 2340

Query: 2341 RVMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWVEDDFSKMGTSVSSSSFLAP 2400
            RVMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGW+EDDFSKMG SVSSSSFLAP
Sbjct: 2341 RVMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWIEDDFSKMGPSVSSSSFLAP 2400

Query: 2401 KEIVSFLQKLSQVDKQNFSSSAAEEWDGKYLQLLYEICADSNKYPLSLRQEVFQKVERQF 2460
            KEIVSFLQKLSQVDKQNFSSSAAEEWDGKYLQLLYEICADSNKYP++LRQEVFQKVERQF
Sbjct: 2401 KEIVSFLQKLSQVDKQNFSSSAAEEWDGKYLQLLYEICADSNKYPMALRQEVFQKVERQF 2460

Query: 2461 MLGLRARDPETRKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVL 2520
            MLGLRARDPE RKKFFTLYHESLGKTLFIRLQYIIQ+QDWEALSDVFWLKQGLDLLLAVL
Sbjct: 2461 MLGLRARDPEIRKKFFTLYHESLGKTLFIRLQYIIQVQDWEALSDVFWLKQGLDLLLAVL 2520

Query: 2521 VEDKPITLAPNSARLPPLLVSGHVADSSAVQPQVNDAQEGLEDAPLTFDSLVHKHAQFLN 2580
            VEDKPITLAPNSARLPPLLVSGHVADSS VQ  V DAQEG+EDAPLTFDSLV KH+QFLN
Sbjct: 2521 VEDKPITLAPNSARLPPLLVSGHVADSSVVQHPVIDAQEGIEDAPLTFDSLVLKHSQFLN 2580

Query: 2581 RTSKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMISLLSKDY 2640
            R SKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMI LLSKDY
Sbjct: 2581 RMSKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMIGLLSKDY 2640

Query: 2641 HKKQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGRTYNAWHIALALLESHVMLFMNE 2700
            HK+QQASRPNVVQALLEGLQLSHPQPRMPSELIKYIG+TYNAWHIALALLESHVMLFMNE
Sbjct: 2641 HKRQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGKTYNAWHIALALLESHVMLFMNE 2700

Query: 2701 TKCSESLAELYRLLNEEDMRCGLWKRKAISAETKAGLSLVQHGYWQRAQILFYQSMVKAT 2760
            TKC+ESLAELYRLLNEEDMRCGLWKRKA +AETKAGLSLVQHGYWQRAQ LFYQSMVKAT
Sbjct: 2701 TKCAESLAELYRLLNEEDMRCGLWKRKANTAETKAGLSLVQHGYWQRAQSLFYQSMVKAT 2760

Query: 2761 QGTYNNNVPKAEMCLWEEQWLYCASQLSQWEALVDFGKSIENYEILLDSLWKVPDWAYMK 2820
            QGTYNN VPKAEMCLWEEQWL CASQLSQW+AL DFGKSIENYEILLDSLWKVPDW YMK
Sbjct: 2761 QGTYNNTVPKAEMCLWEEQWLCCASQLSQWDALADFGKSIENYEILLDSLWKVPDWTYMK 2820

Query: 2821 EHVIPKAQVEETPKLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMSVHA 2880
            EHVIPKAQVEETPKLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMSVHA
Sbjct: 2821 EHVIPKAQVEETPKLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMSVHA 2880

Query: 2881 RIPLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVSVHTNLYADLKDILETWRLRIPNE 2940
            RIPLLQQFQQLVEVQESSRILVDIANGNKHSGSSVV VH+NLYADLKDILETWRLRIPNE
Sbjct: 2881 RIPLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVGVHSNLYADLKDILETWRLRIPNE 2940

Query: 2941 WDSMTVWCDLLQWRNEMYNAVIDAFKDFGTTNSQLHHLGFRDKAWNVNKLAHVARKQGLH 3000
            WDSMTVWCDLLQWRNEMYNAVIDAFKDFG TNSQLHHLGFRDKAWNVNKLAHVARKQGL+
Sbjct: 2941 WDSMTVWCDLLQWRNEMYNAVIDAFKDFGNTNSQLHHLGFRDKAWNVNKLAHVARKQGLY 3000

Query: 3001 DVCVGILEKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLDYFPVKHKA 3060
            DVCV IL+KMYGH TMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNL+YFPVKHKA
Sbjct: 3001 DVCVAILDKMYGHLTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLEYFPVKHKA 3060

Query: 3061 EIFRLKGDFQLKLSDSEGANHSYSSAISLFKNLPKGWISWGNYCDMAYKESHEEIWLEYA 3120
            EIFRLKGDFQLKLSDSEGAN SYS+AI+LFKNLPKGWISWGNYCDMAYKESH+E WLEYA
Sbjct: 3061 EIFRLKGDFQLKLSDSEGANQSYSNAITLFKNLPKGWISWGNYCDMAYKESHDETWLEYA 3120

Query: 3121 VSCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKYLDQIPHWVWLSWIPQLLL 3180
            VSCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDK+LDQIPHWVWLSWIPQLLL
Sbjct: 3121 VSCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKFLDQIPHWVWLSWIPQLLL 3180

Query: 3181 SLQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNTS 3240
            SLQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQN +
Sbjct: 3181 SLQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNAT 3240

Query: 3241 SAGSLGLTDGSSRVAHGGSSTSTDNQVHQGTQSGSGIGSHDGGNSHSQEPERSTGVESST 3300
            SAGSLGL DG SR  HGGS+T TDNQ HQG+QSGSGIGSHDGGN+HSQEPER+TG +SST
Sbjct: 3241 SAGSLGLADGGSRAGHGGSTTPTDNQGHQGSQSGSGIGSHDGGNAHSQEPERTTGADSST 3300

Query: 3301 HAGNDQSLPQTSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASEL 3360
            HAGNDQSLPQ SSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASEL
Sbjct: 3301 HAGNDQSLPQPSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASEL 3360

Query: 3361 EILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSAD 3420
            EILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFS D
Sbjct: 3361 EILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSVD 3420

Query: 3421 AVNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLRL 3480
            AVNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVL+L
Sbjct: 3421 AVNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLKL 3480

Query: 3481 EDESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGS 3540
            E+ESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGS
Sbjct: 3481 EEESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGS 3540

Query: 3541 DGSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVR 3600
            DGSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVR
Sbjct: 3541 DGSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVR 3600

Query: 3601 MVEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIAPEAVLDLRLQAYGD 3660
            MVEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQI PEAV+DLRLQA+GD
Sbjct: 3601 MVEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQILPEAVVDLRLQAFGD 3660

Query: 3661 ITRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAK 3720
            ITRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAK
Sbjct: 3661 ITRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAK 3720

Query: 3721 NTGKIFQTDFHPAYDTNGMIEFNEPVPFRLTRNMQAFFSHFGVEGLIVSAMCSAAQAVVS 3780
            NTGKIFQTDFHPAYD NGMIEFNEPVPFRLTRNMQAFFS+FGVEGLIVSAMCSAAQAVVS
Sbjct: 3721 NTGKIFQTDFHPAYDANGMIEFNEPVPFRLTRNMQAFFSNFGVEGLIVSAMCSAAQAVVS 3780

Query: 3781 PKQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAGGGMSPADFKQKVTINVDHVIGRI 3840
            PKQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIA GGM+PADFKQKVT NVD VIGRI
Sbjct: 3781 PKQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAAGGMNPADFKQKVTTNVDLVIGRI 3840

Query: 3841 NGIAPQYFSEEEENAMDPPQSVQRGVSDLVDAALMPRHLCMMDPTWHPWF 3889
            NGIAPQYFSEEEENAMDPPQSVQRGVS+LVDAAL P++LCMMDPTWHPWF
Sbjct: 3841 NGIAPQYFSEEEENAMDPPQSVQRGVSELVDAALQPKNLCMMDPTWHPWF 3889

BLAST of MS010599 vs. NCBI nr
Match: XP_008440816.1 (PREDICTED: transformation/transcription domain-associated protein-like [Cucumis melo])

HSP 1 Score: 7353.8 bits (19079), Expect = 0.0e+00
Identity = 3720/3891 (95.61%), Postives = 3801/3891 (97.69%), Query Frame = 0

Query: 1    MSPIQNFELHSRQLVEPELSIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSIILVQ 60
            MSPIQNFE HSR LVEPEL+IQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFS+IL++
Sbjct: 1    MSPIQNFEQHSRHLVEPELNIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSVILLK 60

Query: 61   ITKPQYTDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120
            ITKPQ+TD+HEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI
Sbjct: 61   ITKPQFTDSHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120

Query: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENSAAGGEDIKPMDVSTSTDQT 180
            IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFEN AA  ED+KPM+VSTS+DQ+
Sbjct: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENPAASVEDVKPMEVSTSSDQS 180

Query: 181  ITTGYTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPVLLPLMVSAISVPGPE 240
            + +G TGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIP+LLPLMVSAISVPGPE
Sbjct: 181  MNSGCTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPLLLPLMVSAISVPGPE 240

Query: 241  KVPPFLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR 300
            KVPP LKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR
Sbjct: 241  KVPPSLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR 300

Query: 301  KELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVRG 360
            KELLVALKHVLGTEYKRGLFPLIDTLLEEKV+VGTGRACYETLRPLAYSLLAEIVHHVRG
Sbjct: 301  KELLVALKHVLGTEYKRGLFPLIDTLLEEKVVVGTGRACYETLRPLAYSLLAEIVHHVRG 360

Query: 361  DLTLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDEARILLG 420
            DL+LSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQ SMDE+RILLG
Sbjct: 361  DLSLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQASMDESRILLG 420

Query: 421  RILDAFVGKFSTFKHTIPQLLEEGEEGKDRANMRSKLELPVQAVLNLQVPVEHSKEVNDC 480
            RILD+FVGKFSTFKHTIPQLLEEGEEGKDRAN+RSKLELPVQAVLNLQVPVEHSKEVNDC
Sbjct: 421  RILDSFVGKFSTFKHTIPQLLEEGEEGKDRANLRSKLELPVQAVLNLQVPVEHSKEVNDC 480

Query: 481  KHLIKTLILGMKTIIWSITHAHLPRPQASPSPNGTHPQMLVSPSSNLATPQAFKGMREDE 540
            KHLIKTLI+GMKTI+WSITHAHLPR Q SPSPNGTHPQMLV+ SSNLATPQAFKGMREDE
Sbjct: 481  KHLIKTLIMGMKTIVWSITHAHLPRSQVSPSPNGTHPQMLVNSSSNLATPQAFKGMREDE 540

Query: 541  VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMIT 600
            VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILT+MEPRDLMDMFSLCMPELFDCMI+
Sbjct: 541  VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTIMEPRDLMDMFSLCMPELFDCMIS 600

Query: 601  NTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAVA 660
            NTQLVHLFSTFLQTPKVYRPFA+VLVNFLVSSKLD+LKHPDSPGAKLVLHLFRFVFGAV+
Sbjct: 601  NTQLVHLFSTFLQTPKVYRPFAEVLVNFLVSSKLDLLKHPDSPGAKLVLHLFRFVFGAVS 660

Query: 661  KAPSDFERILQPHVTVIMEVCVRSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIPL 720
            KAPSDFERILQPHVTVIMEVCV+SATEVERPLGYMQLLRIMFRALAGCKFELLLRDLI L
Sbjct: 661  KAPSDFERILQPHVTVIMEVCVKSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLISL 720

Query: 721  LQPCLNMLLTMFDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLV 780
            LQPCLNMLLTM DGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSD+LV
Sbjct: 721  LQPCLNMLLTMLDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDELV 780

Query: 781  SLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRPIPYPWGAKALQVLGKLGGRN 840
             LGLRTLEFWVDSLNPDFLEPSMA VMSEVILALWSHLRP+PY WGAKALQVLGKLGGRN
Sbjct: 781  GLGLRTLEFWVDSLNPDFLEPSMATVMSEVILALWSHLRPMPYSWGAKALQVLGKLGGRN 840

Query: 841  RRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSTVMNKTGGVDSFYRK 900
            RRFLKEPL LECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVS VMNKTGGVDSFYRK
Sbjct: 841  RRFLKEPLGLECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSAVMNKTGGVDSFYRK 900

Query: 901  QALKFLRVCLSSQLNLPGNVADDGHTPRQLSTLLVSPVDSSLRRSETPEGKADLGVKTKT 960
            QALKFLRVCLSSQLNLPG VADDG+TPRQLSTLLVS VDSS RRSETPE KADLGVKTKT
Sbjct: 901  QALKFLRVCLSSQLNLPGIVADDGYTPRQLSTLLVSSVDSSWRRSETPEAKADLGVKTKT 960

Query: 961  QLMAEKSVFKILLMTIIAAGSEEDLHEPKDDFVLNVCRHFAILFHIDSSLNSSPVASASL 1020
            QLMAEKSVFK+LLMTIIAAGS+EDL+EPKDDFVLNVCRHFAILFHIDSSLN+ PVASASL
Sbjct: 961  QLMAEKSVFKVLLMTIIAAGSDEDLNEPKDDFVLNVCRHFAILFHIDSSLNNPPVASASL 1020

Query: 1021 GSTLLPPNVSANSRLRSSACCNLKELDPLIFLDALVEVLADENRVHAKAALNALNLFSEI 1080
            GSTLLP NV+ANSRL+SSACCNLKELDPLIFLDALV+VLADENR+HAKAALNALNLFSE+
Sbjct: 1021 GSTLLPSNVNANSRLKSSACCNLKELDPLIFLDALVDVLADENRLHAKAALNALNLFSEM 1080

Query: 1081 LLFLARAKQTDVMMTRGPSTPMIVSSPSKSPVYSPPPSVRIPVFEQLLPRLLHCCYGSTW 1140
            LLFL R KQTDVMMTRGP TPM VSSP  SPVYSPPPSVRIPVFEQLLPRLLHCCYG TW
Sbjct: 1081 LLFLGRGKQTDVMMTRGPGTPMSVSSP-MSPVYSPPPSVRIPVFEQLLPRLLHCCYGCTW 1140

Query: 1141 QAQMGGIMGLGALVGKVTVETLCLFQVRIVRGLVYVLKRLPIYASKEQEETSQVLNQVLR 1200
            QAQMGG+MGLGALVGKVT+ETLC FQV+IVRGLVYVLKRLPIYASKEQEETSQVLN VLR
Sbjct: 1141 QAQMGGVMGLGALVGKVTIETLCHFQVKIVRGLVYVLKRLPIYASKEQEETSQVLNHVLR 1200

Query: 1201 VVNNVDEANSEPRRQSFHGVVDILASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL 1260
            VVNNVDEANSEPRRQSF GVVD+LASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL
Sbjct: 1201 VVNNVDEANSEPRRQSFQGVVDVLASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL 1260

Query: 1261 EPLHQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE 1320
            EPL+QPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE
Sbjct: 1261 EPLYQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE 1320

Query: 1321 ADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT 1380
            ADETVWVVKFMNPK+ATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT
Sbjct: 1321 ADETVWVVKFMNPKVATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT 1380

Query: 1381 CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL 1440
            CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL
Sbjct: 1381 CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL 1440

Query: 1441 ASWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD 1500
            ASWFNVTLGGKLLEHLKKWLEPEKLAQ QKAWKAGEEPKIAAAIIELFHLLPMAASKFLD
Sbjct: 1441 ASWFNVTLGGKLLEHLKKWLEPEKLAQIQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD 1500

Query: 1501 ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY 1560
            ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY
Sbjct: 1501 ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY 1560

Query: 1561 IIRSDAGQPLREELAKSPQKILASAFPEFAPKSEAALTPGSSTSPAPLSGDEGLVTP-DA 1620
            IIRSDAGQPLREELAKSPQKILASAFPEF PKSE ALTPGSST PAPLSGDEGLVTP D 
Sbjct: 1561 IIRSDAGQPLREELAKSPQKILASAFPEFVPKSEPALTPGSSTPPAPLSGDEGLVTPSDV 1620

Query: 1621 SDPPSAPSSVVSDAYFRGLALIKTLVKLMPGWLQNNRVVFDTLVLVWKSPARIARLHNEQ 1680
            SDPPSAPS VVSDAYF GL L+KTLVKLMPGWLQ+NRVVFDTLV VWKSPARIARLHNEQ
Sbjct: 1621 SDPPSAPSGVVSDAYFCGLQLVKTLVKLMPGWLQSNRVVFDTLVAVWKSPARIARLHNEQ 1680

Query: 1681 ELNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEG 1740
            ELNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEG
Sbjct: 1681 ELNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEG 1740

Query: 1741 YPPNMKKALLLHFLNLFQSKQLGHDHLVIVMQMLILPMLAHAFQNGQSWEVVDQAIIKTI 1800
            YPPNMKKALLLHFLNLFQSKQLGHDHLV+VMQMLILPMLAHAFQNGQSWEVVDQAIIKTI
Sbjct: 1741 YPPNMKKALLLHFLNLFQSKQLGHDHLVVVMQMLILPMLAHAFQNGQSWEVVDQAIIKTI 1800

Query: 1801 VDKLLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSAS 1860
            VDKLLDPPEEV+AEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSAS
Sbjct: 1801 VDKLLDPPEEVTAEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSAS 1860

Query: 1861 KQWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGD 1920
            KQWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGD
Sbjct: 1861 KQWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGD 1920

Query: 1921 SRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPY 1980
            SRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPY
Sbjct: 1921 SRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPY 1980

Query: 1981 NTTAENRRLAIDLAGLVVGWERQRQNEMKLVTESDAPSHSNDGLT-CPPGADPKRMVDGS 2040
            NTTAENRRLAIDLAGLVVGWERQRQNEMK VTESDAPSH+NDGLT CPPGAD KR+VDGS
Sbjct: 1981 NTTAENRRLAIDLAGLVVGWERQRQNEMKPVTESDAPSHNNDGLTSCPPGADSKRLVDGS 2040

Query: 2041 TFPEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFL 2100
            TF EDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFL
Sbjct: 2041 TFSEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFL 2100

Query: 2101 IRVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTA 2160
            IRVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTA
Sbjct: 2101 IRVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTA 2160

Query: 2161 LAQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYPLE 2220
            LAQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYPLE
Sbjct: 2161 LAQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYPLE 2220

Query: 2221 GVTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLID 2280
            GVTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLID
Sbjct: 2221 GVTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLID 2280

Query: 2281 PYNLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINE 2340
            PYNLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVI+NLKSVLKLINE
Sbjct: 2281 PYNLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVIANLKSVLKLINE 2340

Query: 2341 RVMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWVEDDFSKMGTSVSSSSFLAP 2400
            RVMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGW+EDDFSKMGTSVSSSSFLAP
Sbjct: 2341 RVMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWIEDDFSKMGTSVSSSSFLAP 2400

Query: 2401 KEIVSFLQKLSQVDKQNFSSSAAEEWDGKYLQLLYEICADSNKYPLSLRQEVFQKVERQF 2460
            KEIVSFLQKLSQVDKQNF+ SAAEEWDGKYLQLLYEICADSNKYP+SLRQEVFQKVERQF
Sbjct: 2401 KEIVSFLQKLSQVDKQNFAPSAAEEWDGKYLQLLYEICADSNKYPVSLRQEVFQKVERQF 2460

Query: 2461 MLGLRARDPETRKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVL 2520
            MLGLRARDPE RKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVL
Sbjct: 2461 MLGLRARDPEIRKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVL 2520

Query: 2521 VEDKPITLAPNSARLPPLLVSGHVADSSAVQPQVNDAQEGLEDAPLTFDSLVHKHAQFLN 2580
            VEDKPITLAPNSARLPPLLVSGHVADSS V   V D QEG+EDAPLTFDSLV KHAQFLN
Sbjct: 2521 VEDKPITLAPNSARLPPLLVSGHVADSSVVPHPVIDGQEGIEDAPLTFDSLVLKHAQFLN 2580

Query: 2581 RTSKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMISLLSKDY 2640
            R SKLQVADLIIPLRELAH DANVAYHLWVLVFPIVWVTLHKEEQVALAKPMI LLSKDY
Sbjct: 2581 RMSKLQVADLIIPLRELAHNDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMIGLLSKDY 2640

Query: 2641 HKKQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGRTYNAWHIALALLESHVMLFMNE 2700
            HKKQQA RPNVVQALLEGLQLSHPQPRMPSELIKYIG+TYNAWHIALALLESHVMLFMNE
Sbjct: 2641 HKKQQAQRPNVVQALLEGLQLSHPQPRMPSELIKYIGKTYNAWHIALALLESHVMLFMNE 2700

Query: 2701 TKCSESLAELYRLLNEEDMRCGLWKRKAISAETKAGLSLVQHGYWQRAQILFYQSMVKAT 2760
            TKC+ESLAELYRLLNEEDMRCGLWKRKA +AETKAGLSLVQHGYWQRAQ LFYQSMVKAT
Sbjct: 2701 TKCAESLAELYRLLNEEDMRCGLWKRKANTAETKAGLSLVQHGYWQRAQSLFYQSMVKAT 2760

Query: 2761 QGTYNNNVPKAEMCLWEEQWLYCASQLSQWEALVDFGKSIENYEILLDSLWKVPDWAYMK 2820
            QGTYNN VPKAEMCLWEEQWL CASQLSQWEAL DFGKSIENYEILLDSLWKVPDWAYMK
Sbjct: 2761 QGTYNNTVPKAEMCLWEEQWLSCASQLSQWEALADFGKSIENYEILLDSLWKVPDWAYMK 2820

Query: 2821 EHVIPKAQVEETPKLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMSVHA 2880
            EHVIPKAQVEETPKLRLIQAYFSLHDRS NGVADAENIVGKGVDLALEQWWQLPEMSVHA
Sbjct: 2821 EHVIPKAQVEETPKLRLIQAYFSLHDRSANGVADAENIVGKGVDLALEQWWQLPEMSVHA 2880

Query: 2881 RIPLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVSVHTNLYADLKDILETWRLRIPNE 2940
            RIPLLQQFQQLVEVQESSRILVDIANGNKHSGSSVV VH+NLYADLKDILETWRLRIPNE
Sbjct: 2881 RIPLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVGVHSNLYADLKDILETWRLRIPNE 2940

Query: 2941 WDSMTVWCDLLQWRNEMYNAVIDAFKDFGTTNSQLHHLGFRDKAWNVNKLAHVARKQGLH 3000
            WDSMTVWCDLLQWRNEMYNAVIDAFKDFG TNSQLHHLGFRDKAWNVNKLAHVARKQGL+
Sbjct: 2941 WDSMTVWCDLLQWRNEMYNAVIDAFKDFGNTNSQLHHLGFRDKAWNVNKLAHVARKQGLY 3000

Query: 3001 DVCVGILEKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLDYFPVKHKA 3060
            DVCV IL+KMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNL+YFPVKHKA
Sbjct: 3001 DVCVAILDKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLEYFPVKHKA 3060

Query: 3061 EIFRLKGDFQLKLSDSEGANHSYSSAISLFKNLPKGWISWGNYCDMAYKESHEEIWLEYA 3120
            EI+RLKGDFQLKLSDSEGAN SYS+AI+LFKNLPKGWISWGNYCDMAYKESH+E WLEYA
Sbjct: 3061 EIYRLKGDFQLKLSDSEGANQSYSNAITLFKNLPKGWISWGNYCDMAYKESHDETWLEYA 3120

Query: 3121 VSCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKYLDQIPHWVWLSWIPQLLL 3180
            VSCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDK+LDQIPHWVWLSWIPQLLL
Sbjct: 3121 VSCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKFLDQIPHWVWLSWIPQLLL 3180

Query: 3181 SLQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNTS 3240
            SLQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQN +
Sbjct: 3181 SLQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNAA 3240

Query: 3241 SAGSLGLTDGSSRVAHGG-SSTSTDNQVHQGTQSGSGIGSHDGGNSHSQEPERSTGVESS 3300
            SAGSLGL DG SR  HGG SST  DNQVHQGTQSGS IGSHDGGN+HSQEPER+TG +SS
Sbjct: 3241 SAGSLGLADGGSRAGHGGSSSTPADNQVHQGTQSGSAIGSHDGGNAHSQEPERTTGADSS 3300

Query: 3301 THAGNDQSLPQTSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASE 3360
            THAGNDQSLPQ SSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASE
Sbjct: 3301 THAGNDQSLPQPSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASE 3360

Query: 3361 LEILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSA 3420
            LEILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSA
Sbjct: 3361 LEILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSA 3420

Query: 3421 DAVNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLR 3480
            DAVNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVL+
Sbjct: 3421 DAVNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLK 3480

Query: 3481 LEDESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIG 3540
            LE+ESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIG
Sbjct: 3481 LEEESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIG 3540

Query: 3541 SDGSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQV 3600
            SDGSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQV
Sbjct: 3541 SDGSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQV 3600

Query: 3601 RMVEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIAPEAVLDLRLQAYG 3660
            RMVEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQI PEAV+DLRLQA+G
Sbjct: 3601 RMVEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQILPEAVVDLRLQAFG 3660

Query: 3661 DITRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFA 3720
            DITRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFA
Sbjct: 3661 DITRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFA 3720

Query: 3721 KNTGKIFQTDFHPAYDTNGMIEFNEPVPFRLTRNMQAFFSHFGVEGLIVSAMCSAAQAVV 3780
            KNTGKIFQTDFHPAYD NGMIEFNEPVPFRLTRNMQAFFS+FGVEGLIVSAMCSAAQAVV
Sbjct: 3721 KNTGKIFQTDFHPAYDANGMIEFNEPVPFRLTRNMQAFFSNFGVEGLIVSAMCSAAQAVV 3780

Query: 3781 SPKQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAGGGMSPADFKQKVTINVDHVIGR 3840
            SPKQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIA GGM+PADFKQKVT NVD VIGR
Sbjct: 3781 SPKQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAAGGMNPADFKQKVTTNVDLVIGR 3840

Query: 3841 INGIAPQYFSEEEENAMDPPQSVQRGVSDLVDAALMPRHLCMMDPTWHPWF 3889
            INGIAPQYFSEEEENAMDPPQSVQRGVS+LVDAAL P++LCMMDPTWHPWF
Sbjct: 3841 INGIAPQYFSEEEENAMDPPQSVQRGVSELVDAALQPKNLCMMDPTWHPWF 3890

BLAST of MS010599 vs. NCBI nr
Match: XP_004134864.1 (transformation/transcription domain-associated protein [Cucumis sativus] >KGN48912.1 hypothetical protein Csa_003515 [Cucumis sativus])

HSP 1 Score: 7342.3 bits (19049), Expect = 0.0e+00
Identity = 3715/3890 (95.50%), Postives = 3795/3890 (97.56%), Query Frame = 0

Query: 1    MSPIQNFELHSRQLVEPELSIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSIILVQ 60
            MSPIQNFE HSR LVEPEL+IQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFS+IL++
Sbjct: 1    MSPIQNFEQHSRHLVEPELNIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSVILLK 60

Query: 61   ITKPQYTDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120
            ITKPQ+TD+HEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI
Sbjct: 61   ITKPQFTDSHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120

Query: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENSAAGGEDIKPMDVSTSTDQT 180
            IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFEN +A  ED+KPM+VSTS+DQ+
Sbjct: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENPSASVEDVKPMEVSTSSDQS 180

Query: 181  ITTGYTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPVLLPLMVSAISVPGPE 240
            + +G TGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLV TNIP LLPLMVSAISVPGPE
Sbjct: 181  MNSGCTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVHTNIPHLLPLMVSAISVPGPE 240

Query: 241  KVPPFLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR 300
            KVPP LKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR
Sbjct: 241  KVPPSLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR 300

Query: 301  KELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVRG 360
            KELLVALKHVLGTEYKRGLFPLIDTLLEEKV+VGTGRACYETLRPLAYSLLAEIVHHVR 
Sbjct: 301  KELLVALKHVLGTEYKRGLFPLIDTLLEEKVVVGTGRACYETLRPLAYSLLAEIVHHVRV 360

Query: 361  DLTLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDEARILLG 420
            DL+L QLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDE+RILLG
Sbjct: 361  DLSLPQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDESRILLG 420

Query: 421  RILDAFVGKFSTFKHTIPQLLEEGEEGKDRANMRSKLELPVQAVLNLQVPVEHSKEVNDC 480
            RILD+FVGKFSTFKHTIPQLLEEGEEGKDRAN+RSKLELPVQAVLNLQVPVEHSKEVNDC
Sbjct: 421  RILDSFVGKFSTFKHTIPQLLEEGEEGKDRANLRSKLELPVQAVLNLQVPVEHSKEVNDC 480

Query: 481  KHLIKTLILGMKTIIWSITHAHLPRPQASPSPNGTHPQMLVSPSSNLATPQAFKGMREDE 540
            KHLIKTLI+GMKTIIWSITHAHLPR Q SPSPNGTHPQMLV+PSSNLATPQA KGMREDE
Sbjct: 481  KHLIKTLIMGMKTIIWSITHAHLPRSQVSPSPNGTHPQMLVNPSSNLATPQALKGMREDE 540

Query: 541  VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMIT 600
            VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILT+MEPRDLMDMFSLCMPELFDCMI+
Sbjct: 541  VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTIMEPRDLMDMFSLCMPELFDCMIS 600

Query: 601  NTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAVA 660
            NTQLVHLFSTFLQTPKVYRPFA+VLVNFLVSSKLD+LKHPDSPGAKLVLHLFRFVFGAV+
Sbjct: 601  NTQLVHLFSTFLQTPKVYRPFAEVLVNFLVSSKLDLLKHPDSPGAKLVLHLFRFVFGAVS 660

Query: 661  KAPSDFERILQPHVTVIMEVCVRSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIPL 720
            KAPSDFERILQPHVTVIMEVCV+SATEVERPLGYMQLLRIMFRALAGCKFELLLRDLI L
Sbjct: 661  KAPSDFERILQPHVTVIMEVCVKSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLISL 720

Query: 721  LQPCLNMLLTMFDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLV 780
            LQPCLNMLLTM DGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSD+LV
Sbjct: 721  LQPCLNMLLTMLDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDELV 780

Query: 781  SLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRPIPYPWGAKALQVLGKLGGRN 840
             LGLRTLEFWVDSLNPDFLEPSMA VMSEVILALWSHLRP+PY WGAKALQVLGKLGGRN
Sbjct: 781  GLGLRTLEFWVDSLNPDFLEPSMATVMSEVILALWSHLRPMPYSWGAKALQVLGKLGGRN 840

Query: 841  RRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSTVMNKTGGVDSFYRK 900
            RRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVS VMNKTGGVDSFYRK
Sbjct: 841  RRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSAVMNKTGGVDSFYRK 900

Query: 901  QALKFLRVCLSSQLNLPGNVADDGHTPRQLSTLLVSPVDSSLRRSETPEGKADLGVKTKT 960
            QALKFLRVCLSSQLNLPG VADDG+TPRQLSTLLVS VDSS RRSETPE KADLGVKTKT
Sbjct: 901  QALKFLRVCLSSQLNLPGIVADDGYTPRQLSTLLVSSVDSSWRRSETPEAKADLGVKTKT 960

Query: 961  QLMAEKSVFKILLMTIIAAGSEEDLHEPKDDFVLNVCRHFAILFHIDSSLNSSPVASASL 1020
            QLMAEKSVFK+LLMTIIAAGSEEDL+EPKDDFVLNVCRHFAILFHIDSSLN+ PVASAS 
Sbjct: 961  QLMAEKSVFKLLLMTIIAAGSEEDLNEPKDDFVLNVCRHFAILFHIDSSLNNPPVASASH 1020

Query: 1021 GSTLLPPNVSANSRLRSSACCNLKELDPLIFLDALVEVLADENRVHAKAALNALNLFSEI 1080
            GSTLLP NV+ANSRL+SSACCNLKELDPLIFLDALVEVLADENR+HAKAALNALNLFSE+
Sbjct: 1021 GSTLLPSNVNANSRLKSSACCNLKELDPLIFLDALVEVLADENRIHAKAALNALNLFSEM 1080

Query: 1081 LLFLARAKQTDVMMTRGPSTPMIVSSPSKSPVYSPPPSVRIPVFEQLLPRLLHCCYGSTW 1140
            LLFL R KQTDVMMTRGP TPM VSSP  SPVYSPPPSVRIPVFEQLLPRLLHCCYG +W
Sbjct: 1081 LLFLGRGKQTDVMMTRGPGTPMSVSSP-MSPVYSPPPSVRIPVFEQLLPRLLHCCYGCSW 1140

Query: 1141 QAQMGGIMGLGALVGKVTVETLCLFQVRIVRGLVYVLKRLPIYASKEQEETSQVLNQVLR 1200
            QAQMGG++GLGALVGKVTVETLC FQV+IVRGLVYVLKRLPIYASKEQEETSQVLN VLR
Sbjct: 1141 QAQMGGVIGLGALVGKVTVETLCHFQVKIVRGLVYVLKRLPIYASKEQEETSQVLNHVLR 1200

Query: 1201 VVNNVDEANSEPRRQSFHGVVDILASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL 1260
            VVNNVDEANSEPRRQSF GVVD+LASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL
Sbjct: 1201 VVNNVDEANSEPRRQSFQGVVDVLASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL 1260

Query: 1261 EPLHQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE 1320
            EPL+QPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE
Sbjct: 1261 EPLYQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE 1320

Query: 1321 ADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT 1380
            ADETVWVVKFMNPK+ATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT
Sbjct: 1321 ADETVWVVKFMNPKVATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT 1380

Query: 1381 CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL 1440
            CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL
Sbjct: 1381 CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL 1440

Query: 1441 ASWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD 1500
            ASWFNVTLGGKLLEHLKKWLEPEKLAQ QKAWKAGEEPKIAAAIIELFHLLPMAASKFLD
Sbjct: 1441 ASWFNVTLGGKLLEHLKKWLEPEKLAQIQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD 1500

Query: 1501 ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY 1560
            ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY
Sbjct: 1501 ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY 1560

Query: 1561 IIRSDAGQPLREELAKSPQKILASAFPEFAPKSEAALTPGSSTSPAPLSGDEGLVTP-DA 1620
            IIRSDAGQPLREELAKSPQKILASAFPEF PKSE ALTPGSST PAPLSGDEGLVTP D 
Sbjct: 1561 IIRSDAGQPLREELAKSPQKILASAFPEFVPKSEPALTPGSSTPPAPLSGDEGLVTPSDV 1620

Query: 1621 SDPPSAPSSVVSDAYFRGLALIKTLVKLMPGWLQNNRVVFDTLVLVWKSPARIARLHNEQ 1680
            SDPPSA SSVV DAYF GLAL+KTLVKLMPGWLQ+NRVVFDTLV VWKSPARIARLHNEQ
Sbjct: 1621 SDPPSASSSVVPDAYFCGLALVKTLVKLMPGWLQSNRVVFDTLVAVWKSPARIARLHNEQ 1680

Query: 1681 ELNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEG 1740
            ELNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEG
Sbjct: 1681 ELNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEG 1740

Query: 1741 YPPNMKKALLLHFLNLFQSKQLGHDHLVIVMQMLILPMLAHAFQNGQSWEVVDQAIIKTI 1800
            YPPNMKKALLLHFLNLFQSKQLGHDHLV+VMQMLILPMLAHAFQNGQSWEVVDQAIIKTI
Sbjct: 1741 YPPNMKKALLLHFLNLFQSKQLGHDHLVVVMQMLILPMLAHAFQNGQSWEVVDQAIIKTI 1800

Query: 1801 VDKLLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSAS 1860
            VDKLLDPPEEV+AEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSAS
Sbjct: 1801 VDKLLDPPEEVTAEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSAS 1860

Query: 1861 KQWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGD 1920
            KQWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGD
Sbjct: 1861 KQWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGD 1920

Query: 1921 SRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPY 1980
            SRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPY
Sbjct: 1921 SRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPY 1980

Query: 1981 NTTAENRRLAIDLAGLVVGWERQRQNEMKLVTESDAPSHSNDGLT-CPPGADPKRMVDGS 2040
            NTTAENRRLAIDLAGLVVGWERQRQNEMK VTESDAPSH+NDGLT CPPGAD KR+VDGS
Sbjct: 1981 NTTAENRRLAIDLAGLVVGWERQRQNEMKPVTESDAPSHNNDGLTSCPPGADSKRLVDGS 2040

Query: 2041 TFPEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFL 2100
            TF EDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFL
Sbjct: 2041 TFSEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFL 2100

Query: 2101 IRVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTA 2160
            IRVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTA
Sbjct: 2101 IRVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTA 2160

Query: 2161 LAQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYPLE 2220
            LAQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYPLE
Sbjct: 2161 LAQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYPLE 2220

Query: 2221 GVTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLID 2280
            GVTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLID
Sbjct: 2221 GVTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLID 2280

Query: 2281 PYNLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINE 2340
            PYNLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINE
Sbjct: 2281 PYNLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINE 2340

Query: 2341 RVMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWVEDDFSKMGTSVSSSSFLAP 2400
            RVMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGW+EDDFSKMGTSVSSSSFLAP
Sbjct: 2341 RVMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWIEDDFSKMGTSVSSSSFLAP 2400

Query: 2401 KEIVSFLQKLSQVDKQNFSSSAAEEWDGKYLQLLYEICADSNKYPLSLRQEVFQKVERQF 2460
            KEIVSFLQKLSQVDKQNFSSSAAEEWD KYLQLLYEICADSNKYP+SLRQEVFQKVERQF
Sbjct: 2401 KEIVSFLQKLSQVDKQNFSSSAAEEWDEKYLQLLYEICADSNKYPVSLRQEVFQKVERQF 2460

Query: 2461 MLGLRARDPETRKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVL 2520
            MLGLRARDPE RKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVL
Sbjct: 2461 MLGLRARDPEVRKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVL 2520

Query: 2521 VEDKPITLAPNSARLPPLLVSGHVADSSAVQPQVNDAQEGLEDAPLTFDSLVHKHAQFLN 2580
            VEDKPITLAPNSARLPPLLVSGHV DSS V   V D QEG+EDAPLTFDSLV KHAQFLN
Sbjct: 2521 VEDKPITLAPNSARLPPLLVSGHVGDSSVVPHPVIDGQEGIEDAPLTFDSLVLKHAQFLN 2580

Query: 2581 RTSKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMISLLSKDY 2640
            R SKLQVADLIIPLRELAH DANVAYHLWVLVFPIVWVTLHKEEQVALAKPMI LLSKDY
Sbjct: 2581 RMSKLQVADLIIPLRELAHNDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMIGLLSKDY 2640

Query: 2641 HKKQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGRTYNAWHIALALLESHVMLFMNE 2700
            HKKQQA RPNVVQALLEGLQLSHPQPRMPSELIKYIG+TYNAWHIALALLESHVMLFMNE
Sbjct: 2641 HKKQQAHRPNVVQALLEGLQLSHPQPRMPSELIKYIGKTYNAWHIALALLESHVMLFMNE 2700

Query: 2701 TKCSESLAELYRLLNEEDMRCGLWKRKAISAETKAGLSLVQHGYWQRAQILFYQSMVKAT 2760
            TKC+ESLAELYRLLNEEDMRCGLWKRKA +AETKAGLSLVQHGYWQRAQ LFYQSMVKAT
Sbjct: 2701 TKCAESLAELYRLLNEEDMRCGLWKRKANTAETKAGLSLVQHGYWQRAQSLFYQSMVKAT 2760

Query: 2761 QGTYNNNVPKAEMCLWEEQWLYCASQLSQWEALVDFGKSIENYEILLDSLWKVPDWAYMK 2820
            QGTYNN VPKAEMCLWEEQWL CASQLSQWEAL DFGKSIENYEILLDSLWKVPDWAYMK
Sbjct: 2761 QGTYNNTVPKAEMCLWEEQWLCCASQLSQWEALADFGKSIENYEILLDSLWKVPDWAYMK 2820

Query: 2821 EHVIPKAQVEETPKLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMSVHA 2880
            EHVIPKAQVEETPKLRLIQAYFSLHD+  NGVADAENIVGKGVDLALEQWWQLPEMSVHA
Sbjct: 2821 EHVIPKAQVEETPKLRLIQAYFSLHDKGANGVADAENIVGKGVDLALEQWWQLPEMSVHA 2880

Query: 2881 RIPLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVSVHTNLYADLKDILETWRLRIPNE 2940
            RIPLLQQFQQLVEVQESSRILVDIANGNKHSGSSVV VH+NLYADLKDILETWRLRIPNE
Sbjct: 2881 RIPLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVGVHSNLYADLKDILETWRLRIPNE 2940

Query: 2941 WDSMTVWCDLLQWRNEMYNAVIDAFKDFGTTNSQLHHLGFRDKAWNVNKLAHVARKQGLH 3000
            WD MTVWCDLLQWRNEMYNAVIDAFKDFG TNSQLHHLGFRDKAWNVNKLAHVARKQGL+
Sbjct: 2941 WDGMTVWCDLLQWRNEMYNAVIDAFKDFGNTNSQLHHLGFRDKAWNVNKLAHVARKQGLY 3000

Query: 3001 DVCVGILEKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLDYFPVKHKA 3060
            DVCV IL+KMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNL+YFPVKHKA
Sbjct: 3001 DVCVAILDKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLEYFPVKHKA 3060

Query: 3061 EIFRLKGDFQLKLSDSEGANHSYSSAISLFKNLPKGWISWGNYCDMAYKESHEEIWLEYA 3120
            EI+RLKGDFQLKLSDSEGAN SYS+AI+LFKNLPKGWISWGNYCDMAYKESH+E WLEYA
Sbjct: 3061 EIYRLKGDFQLKLSDSEGANQSYSNAITLFKNLPKGWISWGNYCDMAYKESHDEAWLEYA 3120

Query: 3121 VSCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKYLDQIPHWVWLSWIPQLLL 3180
            VSCFLQGIKFGISNSRNHLARVLYLLSFD PNEPVGRAFDK+LDQIPHWVWLSWIPQLLL
Sbjct: 3121 VSCFLQGIKFGISNSRNHLARVLYLLSFDAPNEPVGRAFDKFLDQIPHWVWLSWIPQLLL 3180

Query: 3181 SLQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNTS 3240
            SLQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQN +
Sbjct: 3181 SLQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNAA 3240

Query: 3241 SAGSLGLTDGSSRVAHGGSSTSTDNQVHQGTQSGSGIGSHDGGNSHSQEPERSTGVESST 3300
            SAGSLGL DG +R  HGGSST  DNQVHQGTQSGSGIGSHDGGN+HSQEPER+TG +SST
Sbjct: 3241 SAGSLGLADGGARAGHGGSSTPADNQVHQGTQSGSGIGSHDGGNAHSQEPERTTGADSST 3300

Query: 3301 HAGNDQSLPQTSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASEL 3360
            HAGNDQSLPQ SSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASEL
Sbjct: 3301 HAGNDQSLPQPSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASEL 3360

Query: 3361 EILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSAD 3420
            EILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSAD
Sbjct: 3361 EILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSAD 3420

Query: 3421 AVNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLRL 3480
            AVNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVL+L
Sbjct: 3421 AVNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLKL 3480

Query: 3481 EDESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGS 3540
            E+ESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGS
Sbjct: 3481 EEESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGS 3540

Query: 3541 DGSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVR 3600
            DGSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVR
Sbjct: 3541 DGSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVR 3600

Query: 3601 MVEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIAPEAVLDLRLQAYGD 3660
            MVEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQI PEAV+DLRLQA+GD
Sbjct: 3601 MVEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQILPEAVVDLRLQAFGD 3660

Query: 3661 ITRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAK 3720
            ITRNLVN+GIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAK
Sbjct: 3661 ITRNLVNDGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAK 3720

Query: 3721 NTGKIFQTDFHPAYDTNGMIEFNEPVPFRLTRNMQAFFSHFGVEGLIVSAMCSAAQAVVS 3780
            NTGKIFQTDFHPAYD NGMIEFNEPVPFRLTRNMQAFFS+FGVEGLIVSAMCSAAQAVVS
Sbjct: 3721 NTGKIFQTDFHPAYDANGMIEFNEPVPFRLTRNMQAFFSNFGVEGLIVSAMCSAAQAVVS 3780

Query: 3781 PKQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAGGGMSPADFKQKVTINVDHVIGRI 3840
            PKQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIA GGM+PADFKQKVT NVD VIGRI
Sbjct: 3781 PKQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAAGGMNPADFKQKVTTNVDLVIGRI 3840

Query: 3841 NGIAPQYFSEEEENAMDPPQSVQRGVSDLVDAALMPRHLCMMDPTWHPWF 3889
            NGIAPQYFSEEEENAMDPPQSVQRGVS+LVDAAL P++LCMMDPTWHPWF
Sbjct: 3841 NGIAPQYFSEEEENAMDPPQSVQRGVSELVDAALQPKNLCMMDPTWHPWF 3889

BLAST of MS010599 vs. NCBI nr
Match: XP_022950590.1 (transformation/transcription domain-associated protein-like [Cucurbita moschata])

HSP 1 Score: 7328.0 bits (19012), Expect = 0.0e+00
Identity = 3703/3889 (95.22%), Postives = 3794/3889 (97.56%), Query Frame = 0

Query: 1    MSPIQNFELHSRQLVEPELSIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSIILVQ 60
            MSPIQNFE HSR LVEPEL+IQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFS+IL++
Sbjct: 1    MSPIQNFEQHSRHLVEPELNIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSVILLK 60

Query: 61   ITKPQYTDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120
            ITKPQ+TDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI
Sbjct: 61   ITKPQFTDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120

Query: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENSAAGGEDIKPMDVSTSTDQT 180
            IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTV+HFFEN+AA GED KPMDVS+STDQ 
Sbjct: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVNHFFENTAAVGEDTKPMDVSSSTDQA 180

Query: 181  ITTGYTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPVLLPLMVSAISVPGPE 240
            +TTG TGT QLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIP+LLPLMVSAISVPGPE
Sbjct: 181  LTTGCTGTAQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPLLLPLMVSAISVPGPE 240

Query: 241  KVPPFLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR 300
            KVPP LKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR
Sbjct: 241  KVPPSLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR 300

Query: 301  KELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVRG 360
            KELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVG GRACYETLRPLAYSLLAEIVHHVRG
Sbjct: 301  KELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGAGRACYETLRPLAYSLLAEIVHHVRG 360

Query: 361  DLTLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDEARILLG 420
            DL+LSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDE+R LLG
Sbjct: 361  DLSLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDESRTLLG 420

Query: 421  RILDAFVGKFSTFKHTIPQLLEEGEEGKDRANMRSKLELPVQAVLNLQVPVEHSKEVNDC 480
            RILDAFVGKFS FKHTIPQLLEEGEEGKDRAN+RSKLELPVQAVLNLQVPVEHSKEVNDC
Sbjct: 421  RILDAFVGKFSAFKHTIPQLLEEGEEGKDRANLRSKLELPVQAVLNLQVPVEHSKEVNDC 480

Query: 481  KHLIKTLILGMKTIIWSITHAHLPRPQASPSPNGTHPQMLVSPSSNLATPQAFKGMREDE 540
            KHLIKTLI+GMKTIIWSITHAHLPR Q SPSPNGTHPQMLV+PSSNLATPQAFKGMREDE
Sbjct: 481  KHLIKTLIMGMKTIIWSITHAHLPRSQVSPSPNGTHPQMLVTPSSNLATPQAFKGMREDE 540

Query: 541  VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMIT 600
            VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMI+
Sbjct: 541  VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMIS 600

Query: 601  NTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAVA 660
            NTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAV+
Sbjct: 601  NTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAVS 660

Query: 661  KAPSDFERILQPHVTVIMEVCVRSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIPL 720
            KAPSDFERILQPHVTVIMEVCV+SATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIPL
Sbjct: 661  KAPSDFERILQPHVTVIMEVCVKSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIPL 720

Query: 721  LQPCLNMLLTMFDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLV 780
            LQPCLNMLL M DGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLV
Sbjct: 721  LQPCLNMLLIMLDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLV 780

Query: 781  SLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRPIPYPWGAKALQVLGKLGGRN 840
             LGLRTLEFWVDSLNPDFLEPSMA+VMSEVILALWSHLRP+PY WGAKALQVLGKLGGRN
Sbjct: 781  GLGLRTLEFWVDSLNPDFLEPSMASVMSEVILALWSHLRPMPYSWGAKALQVLGKLGGRN 840

Query: 841  RRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSTVMNKTGGVDSFYRK 900
            RRFLKEPLALECKENPEHGLRLILTFEP+TPFLVPLDRCINLAVS VMNKTGGVDSFYRK
Sbjct: 841  RRFLKEPLALECKENPEHGLRLILTFEPATPFLVPLDRCINLAVSAVMNKTGGVDSFYRK 900

Query: 901  QALKFLRVCLSSQLNLPGNVADDGHTPRQLSTLLVSPVDSSLRRSETPEGKADLGVKTKT 960
            QALKFLRVCLSSQLNLPG VADDG+TPRQLSTLLVS VDSS R+SET E KADLGVKTKT
Sbjct: 901  QALKFLRVCLSSQLNLPGIVADDGYTPRQLSTLLVSSVDSSWRKSETSEAKADLGVKTKT 960

Query: 961  QLMAEKSVFKILLMTIIAAGSEEDLHEPKDDFVLNVCRHFAILFHIDSSLNSSPVASASL 1020
            QLMAEKSVFK+LLMTIIAAGSEEDL+EPKDDFVLNVCRHFAILFHIDSSLN+ PVASASL
Sbjct: 961  QLMAEKSVFKLLLMTIIAAGSEEDLNEPKDDFVLNVCRHFAILFHIDSSLNNPPVASASL 1020

Query: 1021 GSTLLPPNVSANSRLRSSACCNLKELDPLIFLDALVEVLADENRVHAKAALNALNLFSEI 1080
            GSTLLP NV+ANSRL+SSACCNLKELDPL FLDALVEVLADENR HAKAALNALNLFSE+
Sbjct: 1021 GSTLLPSNVNANSRLKSSACCNLKELDPLTFLDALVEVLADENRFHAKAALNALNLFSEM 1080

Query: 1081 LLFLARAKQTDVMMTRGPSTPMIVSSPSKSPVYSPPPSVRIPVFEQLLPRLLHCCYGSTW 1140
            LLFL R KQTDVMMTRG  +PM VSSP  SP YSPPPSVRIPVFEQLLPRLLHCCYG TW
Sbjct: 1081 LLFLGRGKQTDVMMTRGSGSPMSVSSP-MSPAYSPPPSVRIPVFEQLLPRLLHCCYGCTW 1140

Query: 1141 QAQMGGIMGLGALVGKVTVETLCLFQVRIVRGLVYVLKRLPIYASKEQEETSQVLNQVLR 1200
            QAQMGG+MGLGALVGKVTVETLC FQV+IVRGLVYVLKRLPIYA+KEQEETSQVLN VLR
Sbjct: 1141 QAQMGGVMGLGALVGKVTVETLCHFQVKIVRGLVYVLKRLPIYANKEQEETSQVLNHVLR 1200

Query: 1201 VVNNVDEANSEPRRQSFHGVVDILASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL 1260
            VVNNVDEANSEPRRQSF  VVD+LASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL
Sbjct: 1201 VVNNVDEANSEPRRQSFQAVVDVLASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL 1260

Query: 1261 EPLHQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE 1320
            EPL+QPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE
Sbjct: 1261 EPLYQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE 1320

Query: 1321 ADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT 1380
            ADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTP+HSELRAKIISMFFKSLT
Sbjct: 1321 ADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPSHSELRAKIISMFFKSLT 1380

Query: 1381 CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL 1440
            CR+PEVVAVAKEGLRQVINQQRMP+DLLQGSLRPILVNLAHTKNLSMPLLQGL RLLELL
Sbjct: 1381 CRSPEVVAVAKEGLRQVINQQRMPRDLLQGSLRPILVNLAHTKNLSMPLLQGLGRLLELL 1440

Query: 1441 ASWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD 1500
            ASWFNVTLGGKLL+HLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD
Sbjct: 1441 ASWFNVTLGGKLLDHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD 1500

Query: 1501 ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY 1560
            ELVTLTI LEGALPPGQVYSEVNSPYR+PLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY
Sbjct: 1501 ELVTLTIGLEGALPPGQVYSEVNSPYRIPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY 1560

Query: 1561 IIRSDAGQPLREELAKSPQKILASAFPEFAPKSEAALTPGSSTSPAPLSGDEGLVTPDAS 1620
            IIRSDAGQPLREELAKSPQKILASAFPEF PKSEAALTPGSST PAPLSGDEGLVTPD S
Sbjct: 1561 IIRSDAGQPLREELAKSPQKILASAFPEFVPKSEAALTPGSSTPPAPLSGDEGLVTPDVS 1620

Query: 1621 DPPSAPSSVVSDAYFRGLALIKTLVKLMPGWLQNNRVVFDTLVLVWKSPARIARLHNEQE 1680
            D PSAPSSVVSDAYFRGLAL+KTLVKLMPGWLQ+NRVVFDTLV VWKSPARIARLHNEQE
Sbjct: 1621 DTPSAPSSVVSDAYFRGLALVKTLVKLMPGWLQSNRVVFDTLVAVWKSPARIARLHNEQE 1680

Query: 1681 LNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEGY 1740
            LNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEGY
Sbjct: 1681 LNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEGY 1740

Query: 1741 PPNMKKALLLHFLNLFQSKQLGHDHLVIVMQMLILPMLAHAFQNGQSWEVVDQAIIKTIV 1800
            PPNMKKALLLHFLNLFQSKQLGHDHLV+VMQMLILPMLAHAFQNGQSWEVVDQAIIKTIV
Sbjct: 1741 PPNMKKALLLHFLNLFQSKQLGHDHLVVVMQMLILPMLAHAFQNGQSWEVVDQAIIKTIV 1800

Query: 1801 DKLLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSASK 1860
            DKLLDPPEEV+AEYDEPLR+ELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSASK
Sbjct: 1801 DKLLDPPEEVTAEYDEPLRVELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSASK 1860

Query: 1861 QWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGDS 1920
            QWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGDS
Sbjct: 1861 QWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGDS 1920

Query: 1921 RMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPYN 1980
            RMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPYN
Sbjct: 1921 RMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPYN 1980

Query: 1981 TTAENRRLAIDLAGLVVGWERQRQNEMKLVTESDAPSHSNDGLT-CPPGADPKRMVDGST 2040
            TTAENRRLAIDLAGLVVGWERQRQNEMK VTESDA SHSNDGLT CPPG DPKR+VDGST
Sbjct: 1981 TTAENRRLAIDLAGLVVGWERQRQNEMKHVTESDALSHSNDGLTSCPPGTDPKRLVDGST 2040

Query: 2041 FPEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFLI 2100
            FPEDSTKRVKVEPGL SLCVMSPGGASSMPN+ETPGS TQPDEEFKPNAAMEEMIINFLI
Sbjct: 2041 FPEDSTKRVKVEPGLPSLCVMSPGGASSMPNVETPGSATQPDEEFKPNAAMEEMIINFLI 2100

Query: 2101 RVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTAL 2160
            RVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTAL
Sbjct: 2101 RVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTAL 2160

Query: 2161 AQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYPLEG 2220
            AQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKH+MLDAGKSLCSLLRMVFVAYPLEG
Sbjct: 2161 AQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHRMLDAGKSLCSLLRMVFVAYPLEG 2220

Query: 2221 VTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLIDP 2280
            VTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLIDP
Sbjct: 2221 VTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLIDP 2280

Query: 2281 YNLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINER 2340
            YNLGRILQRLARDMG SAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINER
Sbjct: 2281 YNLGRILQRLARDMGMSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINER 2340

Query: 2341 VMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWVEDDFSKMGTSVSSSSFLAPK 2400
            VMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGW+EDDFSKMGTSVSSSSFLAPK
Sbjct: 2341 VMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWIEDDFSKMGTSVSSSSFLAPK 2400

Query: 2401 EIVSFLQKLSQVDKQNFSSSAAEEWDGKYLQLLYEICADSNKYPLSLRQEVFQKVERQFM 2460
            EIVSFLQKLSQVDKQNFSSSAAEEWD KYLQLL+EICADSNKYPLSLRQEVFQKVERQFM
Sbjct: 2401 EIVSFLQKLSQVDKQNFSSSAAEEWDRKYLQLLHEICADSNKYPLSLRQEVFQKVERQFM 2460

Query: 2461 LGLRARDPETRKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVLV 2520
            LGLRARDPE RKKFFTLYHESLGKTLF RLQYIIQ+QDWEALSDVFWLKQGLDLLLAVLV
Sbjct: 2461 LGLRARDPEIRKKFFTLYHESLGKTLFTRLQYIIQVQDWEALSDVFWLKQGLDLLLAVLV 2520

Query: 2521 EDKPITLAPNSARLPPLLVSGHVADSSAVQPQVNDAQEGLEDAPLTFDSLVHKHAQFLNR 2580
            EDKPITLAPNSA+LPPLLVSGHVADSSAVQ  V DAQEG+EDAPLTFDSLV KHAQFLN+
Sbjct: 2521 EDKPITLAPNSAKLPPLLVSGHVADSSAVQHLVMDAQEGIEDAPLTFDSLVLKHAQFLNQ 2580

Query: 2581 TSKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMISLLSKDYH 2640
             SKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMI LLSKDYH
Sbjct: 2581 MSKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMIGLLSKDYH 2640

Query: 2641 KKQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGRTYNAWHIALALLESHVMLFMNET 2700
            KKQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIG+TYNAWHIALALLESHVMLFMNET
Sbjct: 2641 KKQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGKTYNAWHIALALLESHVMLFMNET 2700

Query: 2701 KCSESLAELYRLLNEEDMRCGLWKRKAISAETKAGLSLVQHGYWQRAQILFYQSMVKATQ 2760
            KC+ESLAELYRLLNEEDMRCGLWKRKA +AETKAGLSLVQHGYWQRAQ LFYQSMVKATQ
Sbjct: 2701 KCAESLAELYRLLNEEDMRCGLWKRKANTAETKAGLSLVQHGYWQRAQSLFYQSMVKATQ 2760

Query: 2761 GTYNNNVPKAEMCLWEEQWLYCASQLSQWEALVDFGKSIENYEILLDSLWKVPDWAYMKE 2820
            GTYNN VPKAEMCLWEEQWL CASQLSQWEALVDFGKSIENYEILLD+LWKVPDWAYMKE
Sbjct: 2761 GTYNNTVPKAEMCLWEEQWLSCASQLSQWEALVDFGKSIENYEILLDNLWKVPDWAYMKE 2820

Query: 2821 HVIPKAQVEETPKLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMSVHAR 2880
            HVIPKAQVEETPKLRLIQAYF+LHDR+TNGVADAENIVGKGVDLALEQWWQLPEMSVHAR
Sbjct: 2821 HVIPKAQVEETPKLRLIQAYFALHDRTTNGVADAENIVGKGVDLALEQWWQLPEMSVHAR 2880

Query: 2881 IPLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVSVHTNLYADLKDILETWRLRIPNEW 2940
            IPLLQQFQQLVEVQESSR+LVDIANGNK SG+S+  VH+NLYADLKDILETWRLRIPNEW
Sbjct: 2881 IPLLQQFQQLVEVQESSRVLVDIANGNKLSGNSIGGVHSNLYADLKDILETWRLRIPNEW 2940

Query: 2941 DSMTVWCDLLQWRNEMYNAVIDAFKDFGTTNSQLHHLGFRDKAWNVNKLAHVARKQGLHD 3000
            DSMTVWCDLLQWRNEMYNAVIDAFKDFG TNSQLHHLGFRDKAWNVNKLAHVARKQGL+D
Sbjct: 2941 DSMTVWCDLLQWRNEMYNAVIDAFKDFGNTNSQLHHLGFRDKAWNVNKLAHVARKQGLYD 3000

Query: 3001 VCVGILEKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLDYFPVKHKAE 3060
            VCV IL+ MYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNL+YFPVKHKAE
Sbjct: 3001 VCVSILDSMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLEYFPVKHKAE 3060

Query: 3061 IFRLKGDFQLKLSDSEGANHSYSSAISLFKNLPKGWISWGNYCDMAYKESHEEIWLEYAV 3120
            IFRLKGDFQLKLSDSEGAN SYS+AI+LFKNLPKGWISWGNYCDMAY+ESH+EIWLEYAV
Sbjct: 3061 IFRLKGDFQLKLSDSEGANQSYSNAITLFKNLPKGWISWGNYCDMAYEESHDEIWLEYAV 3120

Query: 3121 SCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKYLDQIPHWVWLSWIPQLLLS 3180
            SCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDK+LDQIPHWVWLSWIPQLLLS
Sbjct: 3121 SCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKFLDQIPHWVWLSWIPQLLLS 3180

Query: 3181 LQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNTSS 3240
            LQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQN +S
Sbjct: 3181 LQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNAAS 3240

Query: 3241 AGSLGLTDGSSRVAHGGSSTSTDNQVHQGTQSGSGIGSHDGGNSHSQEPERSTGVESSTH 3300
            AGSLGL DG SR  H GSST TD+QVHQGTQSG+GIGSHDGGN+HSQEPER+TG +S TH
Sbjct: 3241 AGSLGLADGGSRAGHSGSSTPTDSQVHQGTQSGTGIGSHDGGNAHSQEPERTTGADSGTH 3300

Query: 3301 AGNDQSLPQTSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASELE 3360
            AGNDQSLPQ SSNVNEGTQNA RRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASELE
Sbjct: 3301 AGNDQSLPQPSSNVNEGTQNAFRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASELE 3360

Query: 3361 ILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSADA 3420
            ILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSADA
Sbjct: 3361 ILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSADA 3420

Query: 3421 VNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLRLE 3480
            VNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVL+LE
Sbjct: 3421 VNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLKLE 3480

Query: 3481 DESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGSD 3540
            +ESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGSD
Sbjct: 3481 EESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGSD 3540

Query: 3541 GSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVRM 3600
            GSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVRM
Sbjct: 3541 GSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVRM 3600

Query: 3601 VEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIAPEAVLDLRLQAYGDI 3660
            VEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQI PEAV+DLRLQA+GDI
Sbjct: 3601 VEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIVPEAVVDLRLQAFGDI 3660

Query: 3661 TRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAKN 3720
            TRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAKN
Sbjct: 3661 TRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAKN 3720

Query: 3721 TGKIFQTDFHPAYDTNGMIEFNEPVPFRLTRNMQAFFSHFGVEGLIVSAMCSAAQAVVSP 3780
            TGKIFQTDFHPAYD +G+IEFNEPVPFRLTRNMQAFFS+FGVEGLIVSAMCSAAQAVVSP
Sbjct: 3721 TGKIFQTDFHPAYDASGVIEFNEPVPFRLTRNMQAFFSNFGVEGLIVSAMCSAAQAVVSP 3780

Query: 3781 KQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAGGGMSPADFKQKVTINVDHVIGRIN 3840
            KQN HL HQLAMFFRDELLSWSWRRPLGMPLAS+AGGGM+PADFK KVT NVD VIGRI 
Sbjct: 3781 KQNHHLRHQLAMFFRDELLSWSWRRPLGMPLASLAGGGMNPADFKHKVTTNVDLVIGRIT 3840

Query: 3841 GIAPQYFSEEEENAMDPPQSVQRGVSDLVDAALMPRHLCMMDPTWHPWF 3889
            GI+PQY SEEEENAMDPPQSVQRGVS+LVDAAL P++LCMMDPTWHPWF
Sbjct: 3841 GISPQYVSEEEENAMDPPQSVQRGVSELVDAALQPKNLCMMDPTWHPWF 3888

BLAST of MS010599 vs. ExPASy Swiss-Prot
Match: A0A0R4ITC5 (Transformation/transcription domain-associated protein OS=Danio rerio OX=7955 GN=trrap PE=3 SV=1)

HSP 1 Score: 1355.5 bits (3507), Expect = 0.0e+00
Identity = 1122/4076 (27.53%), Postives = 1915/4076 (46.98%), Query Frame = 0

Query: 22   QTRLQMATEVRDSLE-IAHTPEYLNFLKCYFRAFSIILVQITKPQYTDNHEHKLRNIVVE 81
            +T+L+M  EV ++ E +  +P+Y  FL+     F   L         +    +LR +V+E
Sbjct: 36   ETKLKMMQEVSENFENVTSSPQYSTFLEHIIPRFLTFLQDGEVQFLQEKPTQQLRKLVLE 95

Query: 82   ILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRIIFDLLRNFRPTLENEVQPFL 141
            I++R+P +E LR   +++L V  + L  ++EEN LIC+RII +L + FRP +  E+  FL
Sbjct: 96   IIHRIPTNEHLRSHAKNILSVMFRFLEIESEENVLICLRIIIELHKQFRPPISQEIHHFL 155

Query: 142  DFVCKIYQNFKLTVSHFFENSAAGGEDIKP----------MDVSTSTDQTITTGYTGTVQ 201
            DFV +IY+     V+ +FEN     E+  P          + V T+ ++  +   T T+ 
Sbjct: 156  DFVKQIYKELPKVVARYFENPQVIAENTVPSPEMVGMITSVMVKTAPERDDSETRTHTI- 215

Query: 202  LNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPVLLPLMVSAISV---PGPEKVPPFLK 261
            +   + S K++ E P++V+ ++QLY   +   +   +PL+++ I +   P   +   F K
Sbjct: 216  IPRGSLSLKVLAELPIIVVLMYQLYKLNIHNVVSEFVPLIMNTIMLQVSPQARQHKLFNK 275

Query: 262  THFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTC-SDSVSIRKELLVA 321
              + +   AQ+KT+SFL Y++R   D +  + + + K ++ LL  C  ++  +RKELL+A
Sbjct: 276  ELYADFIAAQIKTLSFLAYIIRIYQDLVGKYSQQMVKGMLQLLSNCPPETAHLRKELLIA 335

Query: 322  LKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVRGDLTLSQ 381
             KH+L T+ +    P +D L +E +L+G+G    ETLRPLAYS LA++VHHVR +L L+ 
Sbjct: 336  AKHILTTDLRSQFIPCMDKLFDESILIGSGYTARETLRPLAYSTLADLVHHVRQNLPLTD 395

Query: 382  LSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDEARILLGRILDAF 441
            LS  + LF+ N+ D SL  SI T   +L+LNLV+ I  K   +      R +L R+L+ F
Sbjct: 396  LSLAVQLFAKNIDDESLPSSIQTMSCKLLLNLVDCIRSKSEQENG--NGRDILMRMLEVF 455

Query: 442  VGKFST---------FKHTIPQ----LLEEGE-EGKDRANMRSKLELPVQAVLNLQVPV- 501
            V KF T         FK   PQ    +++ G   G       +   LP  A      P  
Sbjct: 456  VLKFHTIARYQLVSIFKKCKPQSEMGVVDTGALPGVPATPTVTTPALPPPAPPTPVTPAP 515

Query: 502  --------------EHSKEVNDCKHLIKTLILGMKTIIWSITHA-------HLPRPQASP 561
                          + + +V+DC+ L+KTL+ G+KTI W IT          +P  Q  P
Sbjct: 516  PPATSFDRAGEKEDKQTFQVSDCRSLVKTLVCGVKTITWGITSCKAPGEAQFIPNKQLQP 575

Query: 562  SPNGTHPQMLVSPSSNLATPQAFKGMREDEVCKASGVLKSGVHCLTLFKEKDEEVEMLHL 621
                 + +++      L   Q        ++           +C T+     EE E+L  
Sbjct: 576  KETQIYIKLVKYAMQALDIYQV-------QIAGNGQTYIRVANCQTV--RMKEEKEVLEH 635

Query: 622  FSQILTVMEPRDLMDMFSLCMPELFDCMITNTQLVHLFSTFLQTPKVYRPFADVLVNFLV 681
            F+ + T+M P    ++F   +P + + +  N  L  + ++FL        FA +LV +L+
Sbjct: 636  FAGVFTMMNPLTFKEIFQTTVPYMVERISKNYALQIVANSFLANLTTSALFATILVEYLL 695

Query: 682  SSKLDVLKHPDSPGAKLVLHLFRFVFGAVAKAPSDFERILQPHVTVIMEVCVRSATEVER 741
                ++  + +   + L L LF+ VFG+V+   ++ E++L+PH+  I+   +  A   + 
Sbjct: 696  ERLPEMGSNVEL--SNLYLKLFKLVFGSVSLFAAENEQMLKPHLHKIVNSSMELAQSAKE 755

Query: 742  PLGYMQLLRIMFRALAGCKFELLLRDLIPLLQPCLNMLLTMFDGPTGEDMRDLLLELCLT 801
            P  Y  LLR +FR++ G   +LL ++ +PLL   L  L  +  G   + M+DL +ELCLT
Sbjct: 756  PYNYFLLLRALFRSIGGGSHDLLYQEFLPLLPNLLQGLNMLQSGLHKQHMKDLFVELCLT 815

Query: 802  LPARLSSLLPHLPRLMKPLVLCLKGSDDLVSLGLRTLEFWVDSLNPDFLEPSMANVMSEV 861
            +P RLSSLLP+LP LM PLV  L GS  LVS GLRTLE  VD+L PDFL   +  V +E+
Sbjct: 816  VPVRLSSLLPYLPMLMDPLVSALNGSQTLVSQGLRTLELCVDNLQPDFLYDHIQPVRAEL 875

Query: 862  ILALWSHLRPIPYPWGAKALQVLGKLGGRNRRFLKEPLALECKENPEHGLRLILTF-EPS 921
            + ALW  LR         A +VLGK GG NR+ LKE   L        G  +   F +  
Sbjct: 876  MQALWRTLRNPAETISHVAYRVLGKFGGSNRKMLKESQKLLYVVTEVQGPSIKAEFTDCK 935

Query: 922  TPFLVPLDRCINLAVSTVMNKTGGVDSFYRKQALKFLRVCLSSQLNLPGNVADDGHTPRQ 981
                +P+++ I  A+  +  K+   + +YR+QA + ++  L +  +L  N          
Sbjct: 936  ASIQLPMEKAIETALDCL--KSANTEPYYRRQAWEVIKCFLVAMTSLEDN-------KHS 995

Query: 982  LSTLLVSPVDSSLRRSETPEGKADLGVKTKTQLMAEKSVFKILLMTIIAAGSEEDLHEPK 1041
            L  LL  P       +E       +  + K Q    +  F+  L     +   +DL    
Sbjct: 996  LYQLLAHP-----NFTEKWIPNVIISHRYKAQDTPARRTFEQALTGAFMSAVIKDLRPSA 1055

Query: 1042 DDFVLNVCRHFAILFHIDSSLNSSPVASASLGSTLLPPNVSA---NSRLRSSACCNLKEL 1101
              FV ++ RH+ ++             +   G  LLP   S    ++ +  S     K +
Sbjct: 1056 LPFVASLIRHYTMV-----------AVAQQCGPFLLPCYQSGSQPSTGMFHSEENGSKGM 1115

Query: 1102 DPLIFLDALVEVLADENRVHAKAALNALNLFSEILLFLARAKQTDVMMTRGPSTPMIVSS 1161
            DPL+ +DA+   +A E +   K    AL +  ++   +  +K+                 
Sbjct: 1116 DPLVLIDAIAICMAYEEKELCKIGEVALAVIFDVASIILGSKER---------------- 1175

Query: 1162 PSKSPVYSPPPSVRIPVFEQLLPRLLHCCYGSTWQAQMGGIMGLGALVGKVTVETLCLFQ 1221
                       + ++P+F  ++ RL  CCY   W A++GG++ +  L+ ++ +  +   Q
Sbjct: 1176 -----------ACQLPLFSYIVERLCACCYEQAWYAKLGGVVSIKFLMERLPLIWVLQNQ 1235

Query: 1222 VRIVRGLVYVLKRLPIYASKEQEETSQ-VLNQVL-RVVNNV-DEANSE----PRRQSFHG 1281
            +  ++ L++V+  L    S      ++  L Q+L R    + DE  +E     + +SFH 
Sbjct: 1236 LTFLKALLFVMMDLTGEVSNGAVAMAKTTLEQLLIRCATPLKDEEKTEELLSAQDKSFHL 1295

Query: 1282 VVDILASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELLEPLHQPLLQ---PLLLRPL 1341
            V   L  E+ +PNS+  VRK     L +LA  TG  V+ ++EP H+ +LQ   P     L
Sbjct: 1296 VTHDLVREVTSPNST--VRKQAMHSLQVLAQVTGKSVTIIMEP-HKEVLQDMVPPKKHLL 1355

Query: 1342 RLKTIDQQVGTVTALNFCLALRPPLLKLTQELVN---FLQEALQIAEADETVWVVKFMNP 1401
            R +  + Q+G +    FC  L+P L  +   ++    F  E L + EA++   ++K    
Sbjct: 1356 RHQPANAQIGLMEGNTFCTTLQPRLFTMDLNVMEHKVFYTELLNLCEAEDAA-LMKLPCY 1415

Query: 1402 KIATSLNKLRTACIELLCTTMAWADFKTPNH-SELRAKIISMFFKSLTCRTPEVVAVAKE 1461
            K   SL  LR A +  L            N+  + R KII+  FK+L     E+    + 
Sbjct: 1416 KSLPSLVPLRIAALNALAAC---------NYLPQSREKIIAALFKALNSTNSELQEAGEA 1475

Query: 1462 GLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELLASWFNVTLGGKL 1521
             + + +    +  D +   +RP+L+ L   ++L++ ++  L  +  L  + FN     ++
Sbjct: 1476 CMGKFLEGATIEVDQIHTHMRPLLMMLGDYRSLTLNVVNRLTSVTRLFPNSFNDKFCDQM 1535

Query: 1522 LEHLKKWLEPEKL-------AQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLDELVTL 1581
            ++HL+KW+E   +            A +  EE +I +AII LFHL+P A    +  L+ +
Sbjct: 1536 MQHLRKWMEVVVITHKGGQRGDGSPAMEGVEEMRICSAIINLFHLIPAAPQTLVKPLLEV 1595

Query: 1582 TIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFL--ARLSEPKYFRRFMYIIR 1641
             +  E A+       E  SP+R PLIKFL R+    V+ F+  A L++P++ R FM  ++
Sbjct: 1596 VMKTERAM-----LIEAGSPFREPLIKFLTRHPSQTVELFMMEATLNDPQWSRMFMSFLK 1655

Query: 1642 SDAGQPLREELAKSPQKILASAFPEFAPKSEAALTPGS-STSPAPLSGDEGLVTPDASDP 1701
                +PLR+ LA +P + +    P     S A + PGS STS A L              
Sbjct: 1656 HKDAKPLRDVLASNPNRFVPLLVP---AGSAATVRPGSPSTSTARL-------------- 1715

Query: 1702 PSAPSSVVSDAYFRGLALIKTLVKLMPGWLQNNRVVFDTLVLVWKSPARIARLHNEQELN 1761
                     D  F+ + +I  +VK   GWL     +   L  VW S A   R H +  + 
Sbjct: 1716 ---------DLQFQAIKIISIIVKNDEGWLAGQHSLVSQLRRVWVSEAFQER-HRKDNMA 1775

Query: 1762 LVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEGYPP 1821
                KE K L  C L+Y +   +E+ +LF +L  F      + TFLKE+   E+ + Y  
Sbjct: 1776 ATNWKEPKLLAFCLLSYCKRNYSEIELLFQLLRAFTGRFLCNMTFLKEYMEEEIPKNYGI 1835

Query: 1822 NMKKALLLHFLNLFQSKQLGHDHLVIVMQMLILPMLAHAFQNGQSWEVV---------DQ 1881
              K+AL   F+  F       +    V+Q ++ P   ++F+ G+  +++          +
Sbjct: 1836 THKRALFFRFVE-FNDPHFNDELKAKVLQHILNPAFLYSFEKGEGEQLLGPPNPEGDNPE 1895

Query: 1882 AIIKTIVDKLLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLVHHRK-------ELIK 1941
            +I    + K+LDP  E  A+  + LRI LLQ +TLL+++    +  + K        L+ 
Sbjct: 1896 SITSVFITKVLDP--EKQADLADSLRIYLLQFSTLLVEHAPHHIHDNNKSRNSKLRRLMT 1955

Query: 1942 FGWNHLKRE---DSASKQWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQA 2001
            F W  L  +   D A K    + + H +  +   +KI+LQVF +LL+    E + +V+QA
Sbjct: 1956 FAWPCLLPKTCVDPACKYSGHLLLAHIIAKFAIHKKIVLQVFHSLLKAHTMEARAIVRQA 2015

Query: 2002 LDILMPALPRRLPLGDSRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQ 2061
            + IL PA+P R+  G   +  W   T+KI+VEEGH++P L+HI  LIV+H  ++Y  R  
Sbjct: 2016 MAILTPAVPARMEDGHQMLTHW---TRKIIVEEGHTVPQLVHILHLIVQHFRVYYPVRHH 2075

Query: 2062 FVPQMVNSLSRLGLPYNTTAENRRLAIDLAGLVVGWERQRQNEMKLVTESDAPSHSNDGL 2121
             V  M++++ RLG   + T E R+LA+DLA +V+ WE QR  + +  +E+D P    +G 
Sbjct: 2076 LVQHMISAMQRLGFTPSVTIEQRKLAVDLAEVVIKWELQRIKDQQPESEAD-PGSVGEGT 2135

Query: 2122 TCPPGADPKRMVDGSTFPEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEE 2181
            +    A  + M   S       KR +   G         G + S+P   T    T+P E+
Sbjct: 2136 SGASAAMKRGM---SVDSAQDVKRFRTAAGAVGTVF---GRSQSIPG--TEALLTKPVEK 2195

Query: 2182 FKPNAAMEEMIINFLIRVALVIEPKDKEATA----MYKQALELLSQAL--EVWPNANVKF 2241
                    + ++NFLIR+A  +      A +    + ++ + L+  AL  ++WP++ +K 
Sbjct: 2196 -----QHTDTVVNFLIRIACQVNDSTNVAGSPGELLSRRCVNLMKTALRPDMWPSSELKL 2255

Query: 2242 NYLEKLLSSI-QPSQSKDPSTALAQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHK 2301
             + +KLL ++ QP+Q+    + +  GL+++  +L       + ++   + + +  C    
Sbjct: 2256 QWFDKLLMTVEQPNQAN--FSNICTGLEILCFLLSVLQPPAILSHFKPLQRGIAACMTCG 2315

Query: 2302 MLDAGKSLCSLLRMVFVAYPLEGVTTP-----PDVKLLYQKVDELIKNHINNLTAPQTSS 2361
                 +++ SLL  +   +P E  T+       +++ LY  V ++I   + N     +++
Sbjct: 2316 NTKVLRAVHSLLSRLMSTFPTEPSTSSVASKYEELECLYAAVGKVIYEGLTNYEKASSAN 2375

Query: 2362 EDNTASSISFVLLVIKTLTEVQKNLIDPYNLGRILQRLARDMGSSAGSHLRQGQRMDPDS 2421
                 + +   L+++K+      + ID     R++    R +      HL    + +P +
Sbjct: 2376 ----PTQLFGTLMILKSACSNNSSYID-----RLISVFMRSLQKMVREHL--SPQPNPGA 2435

Query: 2422 AVTSSRQSADVGTVISNLKSVLKLINERVMLVPECKRSVTQIMNSLLSEKGTDASVLLCI 2481
            A TS+  S  V   +  +K  L ++N       E +++  Q++ + L EK  D  +L  +
Sbjct: 2436 AETSTVTSELVMLSLDLVKMRLSVMN------MEMRKNFIQVILTSLIEKSPDPKILRAV 2495

Query: 2482 LDVIKGWVEDDFSKMGTSVSSSSFLAPKEIVSFLQKLSQVDKQNFSSSAAEEWDGKYLQL 2541
            + +++ WV++  + M T+   +    P+E    L K+    ++ F      E + ++L L
Sbjct: 2496 VKIVEEWVKNSGNPMATNQVPN----PREKSILLVKMMTYIEKRFPDDL--ELNAQFLDL 2555

Query: 2542 LYEICADSNKYPLSLRQEVFQKVERQFMLGLRARDPETRKKFFTLYHESLGKTLFIRLQY 2601
            +  +  D N   LS   ++  K+E  F+ GLR   P  R KFF ++  S+ + ++ RL Y
Sbjct: 2556 VNYVYRDDN---LS-GSDITSKLEPAFLSGLRCTQPLIRAKFFEVFDASMKRRVYERLLY 2615

Query: 2602 IIQIQDWEALSDVFWLKQGLDLLLAVLVEDKPITLAPNSARLPPLLVSGHVADSS----- 2661
            I   Q+WE++   FW+KQ  +LLLAV   +  I  +   + LP +    ++ADS      
Sbjct: 2616 ICCSQNWESMGSHFWIKQCTELLLAVCERNTTIGTSCQGSMLPSITNVINLADSHDRAAF 2675

Query: 2662 ------AVQPQVNDAQEGLE--------------------------DAPLTFDSLVHKHA 2721
                    +P+  +  E  E                          DA      L ++H 
Sbjct: 2676 AMATHIKQEPRERENSETKEEDVEIDIELAPGDQTSLPKTKEQAERDAGNQLHMLTNRHD 2735

Query: 2722 QFLNRTSKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMISLL 2781
            +FL+   +++   L+  L +L H    +A   WV +FP +W  L   +Q AL+  M   L
Sbjct: 2736 KFLDSLREVKTGALLNALVQLCHISTPLAEKTWVQLFPRLWKILSDRQQHALSGEMGPFL 2795

Query: 2782 SKDYHKKQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGRTYNAWHIALALLESHVM- 2841
                H+ Q+  +P+ +   +E +    P   +   ++KY+G+T+N W  +  +LE     
Sbjct: 2796 CSGSHQAQRDCQPSALNCFVEAMSQCVPPIPIRPCVLKYLGKTHNLWLRSTLMLEQQAFE 2855

Query: 2842 --------------------LFMNETKCSESLAELYRLLNEEDMRCGLWKRKAISAETKA 2901
                                +   + +  +SLAELY LL EEDM  GLW+++    ET  
Sbjct: 2856 KGLNLHIKPKQSTEFYEQESITPPQQEILDSLAELYSLLQEEDMWAGLWQKRCKFPETST 2915

Query: 2902 GLSLVQHGYWQRAQILFYQSMVKATQGTYNNNVPKA---EMCLWEEQWLYCASQLSQWEA 2961
             ++  QHG++++AQ  + ++M KA +    +NV  A   E  LWE+ W+ C+ +L+QWE 
Sbjct: 2916 AIAYEQHGFFEQAQETYEKAMEKARK---EHNVSPAIFPEYQLWEDHWIRCSKELNQWEP 2975

Query: 2962 LVDFG--KSIENYEILLDSLWKVPDWAYMKEHVIP---KAQVEETPKLRLIQAYFSLHDR 3021
            L ++G  K   N  ++L+  W+V +WA MKE ++        E   K+ + + Y ++   
Sbjct: 2976 LTEYGQSKGHNNPYLVLECAWRVSNWAAMKEALVQVELSCPKEMAWKVNMHRGYLAICHP 3035

Query: 3022 STNGVADAENIVGKGVDLALEQWWQLPEMSVHARIPLLQQFQQLVEVQESSRILVDIANG 3081
                +   E +V     LA+ +W +LP +  H   PLLQ  QQ++E+QE+++I   +   
Sbjct: 3036 EEQQLNFIERLVEMASSLAIREWRRLPHIVSHVHTPLLQAAQQIIELQEAAQINAGLQPA 3095

Query: 3082 NKHSGSSVVSVHTNLYADLKDILETWRLRIPNEWDSMTVWCDLLQWRNEMYNAVIDAFK- 3141
            N       +  +T+L+ D+K +++TWR R+P   D ++ W  +  WR   Y A++ A++ 
Sbjct: 3096 N-------LGRNTSLH-DMKTVVKTWRNRLPIVSDDLSHWSSIFMWRQHHYQAIVTAYEN 3155

Query: 3142 ----DFGTTNSQLHHLGFRDKAWNVNKLAHVARKQGLHDVCVGILEKMYGHSTMEVQEAF 3201
                D  T N+    LG    A  + +   +ARKQGL +V + IL +++   T+ + + F
Sbjct: 3156 NTQHDPNTNNAM---LGVHASASAIIQYGKIARKQGLVNVALDILSRIHTIPTVPIVDCF 3215

Query: 3202 VKIREQAKAYLEM-----KGELTSGLNLINSTNLDYFPVKHKAEIFRLKGDFQLKLSDSE 3261
             KIR+Q K YL++     K E   GL +I STNL YF  +  AE + LKG F  +++ SE
Sbjct: 3216 QKIRQQVKCYLQLAGVMGKNECMQGLEVIESTNLKYFTKEMTAEFYALKGMFLAQINKSE 3275

Query: 3262 GANHSYSSAISLFKNLPKGWISWGNYCDMAYKESHEEIWLEYAVSCFLQGIKF-GISNSR 3321
             AN ++S+A+ +   L K W  WG+Y +  + +  +      +++C+L   +    S SR
Sbjct: 3276 EANKAFSAAVQMHDVLVKAWAMWGDYLENIFVKDRQPHLGVSSITCYLHACRHQNESKSR 3335

Query: 3322 NHLARVLYLLSFDTPNEPVGRAFDKYLDQIPHWVWLSWIPQLLLSLQRTEAPHCKLVLLK 3381
             +LA+VL+LLSFD  N  +  A DKY   +P   WL+WIPQLL  L  +E      ++ +
Sbjct: 3336 KYLAKVLWLLSFDDKN-TLADAVDKYCIGVPPIQWLAWIPQLLTCLVGSEGKPLLNLISQ 3395

Query: 3382 IANVYPQALYYWLRT-YL---LERRDVANKSELGRMAMAQQRMQQNTSSAGSLGLTDGSS 3441
            +  VYPQA+Y+ +RT YL   +E+R+   KS+ G         QQ  SSA +        
Sbjct: 3396 VGRVYPQAVYFPIRTLYLTLKIEQRE-RYKSDSG---------QQQPSSAAA-------- 3455

Query: 3442 RVAHGGSSTSTDNQVHQGTQSGSGIGSHDGGNSHSQEP----ERSTGVESSTHAGNDQSL 3501
                         Q H  +         D G   +  P     R   ++   H     SL
Sbjct: 3456 -------------QTHSAS---------DPGPIRATAPMWRCSRIMHMQRELHPTLLSSL 3515

Query: 3502 PQTSSNV---NEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASELEILLT 3561
                  +    E     + R    GL    + AF+ +  + +A  + HT           
Sbjct: 3516 EGIVDQMVWFRENWHEEVLRQLQQGLAKCYSVAFEKSGAVSDAKITPHT----------- 3575

Query: 3562 EIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSADAVNKH 3621
                 FV    ++L++     L       T  ++   +SL +      +     D V   
Sbjct: 3576 ---LNFV----KKLVSTFGVGLENVSNVSTMFSSAASESLARRAQATAQ-----DPV--- 3635

Query: 3622 VDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLRLEDESR 3681
                ++ K  F  D D     +    L  L  +LK W  +L+   + + P    +E++ R
Sbjct: 3636 ---FQKMKGQFTTDFDFSVPGSM--KLHNLISKLKKWIKILEAKTK-QLPKFFLIEEKCR 3695

Query: 3682 VLRDF--HVVDVEVPGQYFTDQEIAPDH-TVKLDRVGADIPIVRRHGSSFRRLTLIGSDG 3741
             L +F     +VE+PG++   +   P H  +K+ R    + IV++H ++ RRL + G +G
Sbjct: 3696 FLSNFSAQTAEVEIPGEFLMPK---PTHYYIKIARFMPRVEIVQKHNTAARRLYIRGHNG 3755

Query: 3742 S-QRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVRM 3801
                + ++  +    +R +ER+LQL R++N   +K KE+ +RHL    P ++ V  Q+R+
Sbjct: 3756 KIYPYLVMNDACLTESRREERVLQLLRLLNPCLEKRKETTKRHLFFTVPRVVAVSPQMRL 3815

Query: 3802 VEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLN--QAISGQIAPEAVLDLRLQAYG 3861
            VED+    + +E+Y+  CA+   E D PI+ + ++L   QA   Q + + + D+  +  G
Sbjct: 3816 VEDNPSSLSLVEIYKQRCAKKGIEHDNPISRYYDRLATVQARGTQASHQVLRDILKEVQG 3840

Query: 3862 DITRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFA 3888
                N+V   +  ++   T  +    W F+K F IQLAL     +ML +   +P  +  A
Sbjct: 3876 ----NMVPRSMLKEWALHTFPNATDYWTFRKMFTIQLALIGLAEFMLHLNRLNPEMLQIA 3840

BLAST of MS010599 vs. ExPASy Swiss-Prot
Match: Q9Y4A5 (Transformation/transcription domain-associated protein OS=Homo sapiens OX=9606 GN=TRRAP PE=1 SV=3)

HSP 1 Score: 1355.5 bits (3507), Expect = 0.0e+00
Identity = 1113/4098 (27.16%), Postives = 1906/4098 (46.51%), Query Frame = 0

Query: 22   QTRLQMATEVRDSLE-IAHTPEYLNFLKCYFRAFSIILVQITKPQYTDNHEHKLRNIVVE 81
            +T+L+M  EV ++ E +  +P+Y  FL+     F   L         +    +LR +V+E
Sbjct: 36   ETKLKMMQEVSENFENVTSSPQYSTFLEHIIPRFLTFLQDGEVQFLQEKPAQQLRKLVLE 95

Query: 82   ILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRIIFDLLRNFRPTLENEVQPFL 141
            I++R+P +E LRP  +++L V  + L T+NEEN LIC+RII +L + FRP +  E+  FL
Sbjct: 96   IIHRIPTNEHLRPHTKNVLSVMFRFLETENEENVLICLRIIIELHKQFRPPITQEIHHFL 155

Query: 142  DFVCKIYQNFKLTVSHFFENSAAGGEDIKPMDVSTSTDQTITT--------GYTGTVQLN 201
            DFV +IY+     V+ +FEN     E+  P         TI            T T  + 
Sbjct: 156  DFVKQIYKELPKVVNRYFENPQVIPENTVPPPEMVGMITTIAVKVNPEREDSETRTHSII 215

Query: 202  P-STRSFKIVTESPLVVMFLFQLYSRLVQTNIPVLLPLMVSAISVPGPEKVPP---FLKT 261
            P  + S K++ E P++V+ ++QLY   +   +   +PL+++ I++    +      + K 
Sbjct: 216  PRGSLSLKVLAELPIIVVLMYQLYKLNIHNVVAEFVPLIMNTIAIQVSAQARQHKLYNKE 275

Query: 262  HFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTC-SDSVSIRKELLVAL 321
             + +   AQ+KT+SFL Y++R   + +  + + + K ++ LL  C +++  +RKELL+A 
Sbjct: 276  LYADFIAAQIKTLSFLAYIIRIYQELVTKYSQQMVKGMLQLLSNCPAETAHLRKELLIAA 335

Query: 322  KHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVRGDLTLSQL 381
            KH+L TE +    P +D L +E +L+G+G    ETLRPLAYS LA++VHHVR  L LS L
Sbjct: 336  KHILTTELRNQFIPCMDKLFDESILIGSGYTARETLRPLAYSTLADLVHHVRQHLPLSDL 395

Query: 382  SRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDEARILLGRILDAFV 441
            S  + LF+ N+ D SL  SI T   +L+LNLV+ I  K   ++     R +L R+L+ FV
Sbjct: 396  SLAVQLFAKNIDDESLPSSIQTMSCKLLLNLVDCIRSKSEQESG--NGRDVLMRMLEVFV 455

Query: 442  GKFST---------FKHTIPQ----LLEEGEEGKDRANMR---SKLELPVQA-------- 501
             KF T         FK   PQ     +E    G   A      +    PV A        
Sbjct: 456  LKFHTIARYQLSAIFKKCKPQSELGAVEAALPGVPTAPAAPGPAPSPAPVPAPPPPPPPP 515

Query: 502  -----VLNLQVPV-----EHSKE------VNDCKHLIKTLILGMKTIIWSITHA------ 561
                 V    VP      E  KE      V DC+ L+KTL+ G+KTI W IT        
Sbjct: 516  PPATPVTPAPVPPFEKQGEKDKEDKQTFQVTDCRSLVKTLVCGVKTITWGITSCKAPGEA 575

Query: 562  -HLPRPQASPSPNGTHPQMLVSPSSNLATPQAFKGMREDEVCKASGVLKSGVHCLTLFKE 621
              +P  Q  P     + +++      L   Q        ++           +C T+   
Sbjct: 576  QFIPNKQLQPKETQIYIKLVKYAMQALDIYQV-------QIAGNGQTYIRVANCQTV--R 635

Query: 622  KDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMITNTQLVHLFSTFLQTPKVYRP 681
              EE E+L  F+ + T+M P    ++F   +P + + +  N  L  + ++FL  P     
Sbjct: 636  MKEEKEVLEHFAGVFTMMNPLTFKEIFQTTVPYMVERISKNYALQIVANSFLANPTTSAL 695

Query: 682  FADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAVAKAPSDFERILQPHVTVIMEV 741
            FA +LV +L+    ++  + +   + L L LF+ VFG+V+   ++ E++L+PH+  I+  
Sbjct: 696  FATILVEYLLDRLPEMGSNVEL--SNLYLKLFKLVFGSVSLFAAENEQMLKPHLHKIVNS 755

Query: 742  CVRSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIPLLQPCLNMLLTMFDGPTGEDM 801
             +  A   + P  Y  LLR +FR++ G   +LL ++ +PLL   L  L  +  G   + M
Sbjct: 756  SMELAQTAKEPYNYFLLLRALFRSIGGGSHDLLYQEFLPLLPNLLQGLNMLQSGLHKQHM 815

Query: 802  RDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLVSLGLRTLEFWVDSLNPDFLE 861
            +DL +ELCLT+P RLSSLLP+LP LM PLV  L GS  LVS GLRTLE  VD+L PDFL 
Sbjct: 816  KDLFVELCLTVPVRLSSLLPYLPMLMDPLVSALNGSQTLVSQGLRTLELCVDNLQPDFLY 875

Query: 862  PSMANVMSEVILALWSHLRPIPYPWGAKALQVLGKLGGRNRRFLKEPLALECKENPEHGL 921
              +  V +E++ ALW  LR         A +VLGK GG NR+ LKE   L        G 
Sbjct: 876  DHIQPVRAELMQALWRTLRNPADSISHVAYRVLGKFGGSNRKMLKESQKLHYVVTEVQGP 935

Query: 922  RLILTFEPSTPFL-VPLDRCINLAVSTVMNKTGGVDSFYRKQALKFLRVCLSSQLNLPGN 981
             + + F      L +P+++ I  A+  +  K+   + +YR+QA + ++  L + ++L  N
Sbjct: 936  SITVEFSDCKASLQLPMEKAIETALDCL--KSANTEPYYRRQAWEVIKCFLVAMMSLEDN 995

Query: 982  VADDGHTPRQLSTLLVSPVDSSLRRSETPEGKADLGVKTKTQLMAEKSVFKILLMTIIAA 1041
                      L  LL  P       +E       +  + K Q    +  F+  L     +
Sbjct: 996  -------KHALYQLLAHP-----NFTEKTIPNVIISHRYKAQDTPARKTFEQALTGAFMS 1055

Query: 1042 GSEEDLHEPKDDFVLNVCRHFAILFHIDSSLNSSPVASASLGSTLLP---PNVSANSRLR 1101
               +DL      FV ++ RH+ ++             +   G  LLP        ++ + 
Sbjct: 1056 AVIKDLRPSALPFVASLIRHYTMV-----------AVAQQCGPFLLPCYQVGSQPSTAMF 1115

Query: 1102 SSACCNLKELDPLIFLDALVEVLADENRVHAKAALNALNLFSEILLFLARAKQTDVMMTR 1161
             S     K +DPL+ +DA+   +A E +   K    AL +  ++   +  +K+       
Sbjct: 1116 HSEENGSKGMDPLVLIDAIAICMAYEEKELCKIGEVALAVIFDVASIILGSKER------ 1175

Query: 1162 GPSTPMIVSSPSKSPVYSPPPSVRIPVFEQLLPRLLHCCYGSTWQAQMGGIMGLGALVGK 1221
                                 + ++P+F  ++ RL  CCY   W A++GG++ +  L+ +
Sbjct: 1176 ---------------------ACQLPLFSYIVERLCACCYEQAWYAKLGGVVSIKFLMER 1235

Query: 1222 VTVETLCLFQVRIVRGLVYVLKRLPIYASK-----EQEETSQVLNQVLRVVNNVDEANS- 1281
            + +  +   Q   ++ L++V+  L    S       +    Q+L +    + + + A   
Sbjct: 1236 LPLTWVLQNQQTFLKALLFVMMDLTGEVSNGAVAMAKTTLEQLLMRCATPLKDEERAEEI 1295

Query: 1282 -EPRRQSFHGVVDILASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELLEPLHQPLLQ 1341
               + +SFH V   L  E+ +PNS+  VRK     L +LA  TG  V+ ++EP H+ +LQ
Sbjct: 1296 VAAQEKSFHHVTHDLVREVTSPNST--VRKQAMHSLQVLAQVTGKSVTVIMEP-HKEVLQ 1355

Query: 1342 ---PLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVN---FLQEALQIAEADE 1401
               P     LR +  + Q+G +    FC  L+P L  +   +V    F  E L + EA++
Sbjct: 1356 DMVPPKKHLLRHQPANAQIGLMEGNTFCTTLQPRLFTMDLNVVEHKVFYTELLNLCEAED 1415

Query: 1402 TVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNH-SELRAKIISMFFKSLTCR 1461
            +  + K    K   SL  LR A +  L            N+  + R KII+  FK+L   
Sbjct: 1416 SA-LTKLPCYKSLPSLVPLRIAALNALAAC---------NYLPQSREKIIAALFKALNST 1475

Query: 1462 TPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELLAS 1521
              E+    +  +R+ +    +  D +   +RP+L+ L   ++L++ ++  L  +  L  +
Sbjct: 1476 NSELQEAGEACMRKFLEGATIEVDQIHTHMRPLLMMLGDYRSLTLNVVNRLTSVTRLFPN 1535

Query: 1522 WFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAG------------------EEPKIAAAI 1581
             FN     ++++HL+KW+E   +         G                  EE KI +AI
Sbjct: 1536 SFNDKFCDQMMQHLRKWMEVVVITHKGGQRSDGNESISECGRCPLSPFCQFEEMKICSAI 1595

Query: 1582 IELFHLLPMAASKFLDELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDY 1641
            I LFHL+P A    +  L+ + +  E A+       E  SP+R PLIKFL R+    V+ 
Sbjct: 1596 INLFHLIPAAPQTLVKPLLEVVMKTERAM-----LIEAGSPFREPLIKFLTRHPSQTVEL 1655

Query: 1642 FL--ARLSEPKYFRRFMYIIRSDAGQPLREELAKSPQKILASAFPEFAPKSEAALTPGSS 1701
            F+  A L++P++ R FM  ++    +PLR+ LA +P + +    P  A   + A+ PGS 
Sbjct: 1656 FMMEATLNDPQWSRMFMSFLKHKDARPLRDVLAANPNRFITLLLPGGA---QTAVRPGSP 1715

Query: 1702 TSPAPLSGDEGLVTPDASDPPSAPSSVVSDAYFRGLALIKTLVKLMPGWLQNNRVVFDTL 1761
            ++                      S++  D  F+ + +I  +VK    WL +   +   L
Sbjct: 1716 ST----------------------STMRLDLQFQAIKIISIIVKNDDSWLASQHSLVSQL 1775

Query: 1762 VLVWKSPARIARLHNEQELNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTR 1821
              VW S       H ++ +     KE K L  C LNY +    ++ +LF +L  F     
Sbjct: 1776 RRVWVS-ENFQERHRKENMAATNWKEPKLLAYCLLNYCKRNYGDIELLFQLLRAFTGRFL 1835

Query: 1822 IDYTFLKEFYIIEVAEGYPPNMKKALLLHFLNLFQSKQLGHDHLVIVMQMLILPMLAHAF 1881
             + TFLKE+   E+ + Y    K+AL   F++ F     G +    V+Q ++ P   ++F
Sbjct: 1836 CNMTFLKEYMEEEIPKNYSIAQKRALFFRFVD-FNDPNFGDELKAKVLQHILNPAFLYSF 1895

Query: 1882 QNGQSWEVV---------DQAIIKTIVDKLLDPPEEVSAEYDEPLRIELLQLATLLLKYL 1941
            + G+  +++          ++I    + K+LDP  E  A+  + LRI LLQ ATLL+++ 
Sbjct: 1896 EKGEGEQLLGPPNPEGDNPESITSVFITKVLDP--EKQADMLDSLRIYLLQYATLLVEHA 1955

Query: 1942 QSDLVHHRK-------ELIKFGWNHLKRE---DSASKQWAFVNVCHFLEAYQAPEKIILQ 2001
               +  + K        L+ F W  L  +   D A K    + + H +  +   +KI+LQ
Sbjct: 1956 PHHIHDNNKNRNSKLRRLMTFAWPCLLSKACVDPACKYSGHLLLAHIIAKFAIHKKIVLQ 2015

Query: 2002 VFVALLRTCQPENKMLVKQALDILMPALPRRLPLGDSRMPIWIRYTKKILVEEGHSIPNL 2061
            VF +LL+    E + +V+QA+ IL PA+P R+  G   +  W   T+KI+VEEGH++P L
Sbjct: 2016 VFHSLLKAHAMEARAIVRQAMAILTPAVPARMEDGHQMLTHW---TRKIIVEEGHTVPQL 2075

Query: 2062 IHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPYNTTAENRRLAIDLAGLVVGWERQR 2121
            +HI  LIV+H  ++Y  R   V  MV+++ RLG   + T E RRLA+DL+ +V+ WE QR
Sbjct: 2076 VHILHLIVQHFKVYYPVRHHLVQHMVSAMQRLGFTPSVTIEQRRLAVDLSEVVIKWELQR 2135

Query: 2122 QNEMKLVTESDAPSHSNDGLTCPPGADPKRMVDGSTFPEDSTKRVKVEPGLQSLCVMSPG 2181
              + +  ++ D P+ S +G+     +  + +   S       KR +   G  S      G
Sbjct: 2136 IKDQQPDSDMD-PNSSGEGVNSVSSSIKRGL---SVDSAQEVKRFRTATGAISAVF---G 2195

Query: 2182 GASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFLIRVALVIEPKDKEA----TAMYKQA 2241
             + S+P  ++     +P ++        + ++NFLIRVA  +      A      + ++ 
Sbjct: 2196 RSQSLPGADS--LLAKPIDK-----QHTDTVVNFLIRVACQVNDNTNTAGSPGEVLSRRC 2255

Query: 2242 LELLSQAL--EVWPNANVKFNYLEKLLSSI-QPSQSKDPSTALAQGLDVMNKVLEKQPHL 2301
            + LL  AL  ++WP + +K  + +KLL ++ QP+Q    +  +  GL+V++ +L      
Sbjct: 2256 VNLLKTALRPDMWPKSELKLQWFDKLLMTVEQPNQVNYGN--ICTGLEVLSFLLTVLQSP 2315

Query: 2302 FVRNNINQISQILEPCF---KHKMLDAGKSLCSLLRMVFVAYPLEG--VTTPPDVKLLYQ 2361
             + ++   + + +  C      K+L A  SL S L  +F   P      +   +++ LY 
Sbjct: 2316 AILSSFKPLQRGIAACMTCGNTKVLRAVHSLLSRLMSIFPTEPSTSSVASKYEELECLYA 2375

Query: 2362 KVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLIDPYNLGRILQRLAR 2421
             V ++I   + N       + +   S +   L+++K+      + ID     R++    R
Sbjct: 2376 AVGKVIYEGLTN----YEKATNANPSQLFGTLMILKSACSNNPSYID-----RLISVFMR 2435

Query: 2422 DMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINERVMLVP-ECKRSV 2481
             +      HL      +P +A  S+  ++    ++      L+L+  R+ ++  E +++ 
Sbjct: 2436 SLQKMVREHL------NPQAASGSTEATSGTSELV---MLSLELVKTRLAVMSMEMRKNF 2495

Query: 2482 TQIMNSLLSEKGTDASVLLCILDVIKGWVEDDFSKMGTSVSSSSFLAPKEIVSFLQKLSQ 2541
             Q + + L EK  DA +L  ++ +++ WV+++ S M  + + +  L  K I+  L K+  
Sbjct: 2496 IQAILTSLIEKSPDAKILRAVVKIVEEWVKNN-SPM--AANQTPTLREKSIL--LVKMMT 2555

Query: 2542 VDKQNFSSSAAEEWDGKYLQLLYEICADSNKYPLSLRQEVFQKVERQFMLGLRARDPETR 2601
              ++ F      E + ++L L+  +  D     LS   E+  K+E  F+ GLR   P  R
Sbjct: 2556 YIEKRFPEDL--ELNAQFLDLVNYVYRDET---LS-GSELTAKLEPAFLSGLRCAQPLIR 2615

Query: 2602 KKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVLVEDKPITLAPNS 2661
             KFF ++  S+ + ++ RL Y+   Q+WEA+ + FW+KQ ++LLLAV  +  PI  +   
Sbjct: 2616 AKFFEVFDNSMKRRVYERLLYVTCSQNWEAMGNHFWIKQCIELLLAVCEKSTPIGTSCQG 2675

Query: 2662 ARLPPLLVSGHVADSS-----------AVQPQVNDAQEGLED---------------APL 2721
            A LP +    ++ADS              +P+  +  E  E+                P 
Sbjct: 2676 AMLPSITNVINLADSHDRAAFAMVTHVKQEPRERENSESKEEDVEIDIELAPGDQTSTPK 2735

Query: 2722 T-----------FDSLVHKHAQFLNRTSKLQVADLIIPLRELAHTDANVAYHLWVLVFPI 2781
            T              L ++H +FL+   +++   L+    +L H    +A   WV +FP 
Sbjct: 2736 TKELSEKDIGNQLHMLTNRHDKFLDTLREVKTGALLSAFVQLCHISTTLAEKTWVQLFPR 2795

Query: 2782 VWVTLHKEEQVALAKPMISLLSKDYHKKQQASRPNVVQALLEGLQLSHPQPRMPSELIKY 2841
            +W  L   +Q ALA  +   L    H+ Q+  +P+ +   +E +    P   +   ++KY
Sbjct: 2796 LWKILSDRQQHALAGEISPFLCSGSHQVQRDCQPSALNCFVEAMSQCVPPIPIRPCVLKY 2855

Query: 2842 IGRTYNAWHIALALLESHVM---------------------LFMNETKCSESLAELYRLL 2901
            +G+T+N W  +  +LE                         +   + +  +SLAELY LL
Sbjct: 2856 LGKTHNLWFRSTLMLEHQAFEKGLSLQIKPKQTTEFYEQESITPPQQEILDSLAELYSLL 2915

Query: 2902 NEEDMRCGLWKRKAISAETKAGLSLVQHGYWQRAQILFYQSMVKATQGTYNNNVPKA--- 2961
             EEDM  GLW+++   +ET   ++  QHG++++AQ  + ++M KA +    +N   A   
Sbjct: 2916 QEEDMWAGLWQKRCKYSETATAIAYEQHGFFEQAQESYEKAMDKAKKEHERSNASPAIFP 2975

Query: 2962 EMCLWEEQWLYCASQLSQWEALVDFGKSIE--NYEILLDSLWKVPDWAYMKEHVIP---K 3021
            E  LWE+ W+ C+ +L+QWEAL ++G+S    N  ++L+  W+V +W  MKE ++     
Sbjct: 2976 EYQLWEDHWIRCSKELNQWEALTEYGQSKGHINPYLVLECAWRVSNWTAMKEALVQVEVS 3035

Query: 3022 AQVEETPKLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMSVHARIPLLQ 3081
               E   K+ + + Y ++       ++  E +V     LA+ +W +LP +  H   PLLQ
Sbjct: 3036 CPKEMAWKVNMYRGYLAICHPEEQQLSFIERLVEMASSLAIREWRRLPHVVSHVHTPLLQ 3095

Query: 3082 QFQQLVEVQESSRILVDIANGNKHSGSSVVSVHTNLYADLKDILETWRLRIPNEWDSMTV 3141
              QQ++E+QE+++I   +   N    +S+         D+K +++TWR R+P   D ++ 
Sbjct: 3096 AAQQIIELQEAAQINAGLQPTNLGRNNSL--------HDMKTVVKTWRNRLPIVSDDLSH 3155

Query: 3142 WCDLLQWRNEMY-----------NAVIDAFKDFGTTNSQLHH--LGFRDKAWNVNKLAHV 3201
            W  +  WR   Y           ++++ A+++    +   ++  LG    A  + +   +
Sbjct: 3156 WSSIFMWRQHHYQGKPTWSGMHSSSIVTAYENSSQHDPSSNNAMLGVHASASAIIQYGKI 3215

Query: 3202 ARKQGLHDVCVGILEKMYGHSTMEVQEAFVKIREQAKAYLEM-----KGELTSGLNLINS 3261
            ARKQGL +V + IL +++   T+ + + F KIR+Q K YL++     K E   GL +I S
Sbjct: 3216 ARKQGLVNVALDILSRIHTIPTVPIVDCFQKIRQQVKCYLQLAGVMGKNECMQGLEVIES 3275

Query: 3262 TNLDYFPVKHKAEIFRLKGDFQLKLSDSEGANHSYSSAISLFKNLPKGWISWGNYCDMAY 3321
            TNL YF  +  AE + LKG F  +++ SE AN ++S+A+ +   L K W  WG+Y +  +
Sbjct: 3276 TNLKYFTKEMTAEFYALKGMFLAQINKSEEANKAFSAAVQMHDVLVKAWAMWGDYLENIF 3335

Query: 3322 KESHEEIWLEYAVSCFLQGIKF-GISNSRNHLARVLYLLSFDTPNEPVGRAFDKYLDQIP 3381
             +  +      A++C+L   +    S SR +LA+VL+LLSFD     +  A DKY   +P
Sbjct: 3336 VKERQLHLGVSAITCYLHACRHQNESKSRKYLAKVLWLLSFDDDKNTLADAVDKYCIGVP 3395

Query: 3382 HWVWLSWIPQLLLSLQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGR 3441
               WL+WIPQLL  L  +E      ++ ++  VYPQA+Y+ +RT  L             
Sbjct: 3396 PIQWLAWIPQLLTCLVGSEGKLLLNLISQVGRVYPQAVYFPIRTLYL------------- 3455

Query: 3442 MAMAQQRMQQNTSSAGSLGLTD---GSSRVAHGGSSTSTDNQVHQGTQSGSGIGSHDGGN 3501
              +  ++ ++  S  G +  T      SR+ H                            
Sbjct: 3456 -TLKIEQRERYKSDPGPIRATAPMWRCSRIMH---------------------------- 3515

Query: 3502 SHSQEPERSTGVESSTHAGNDQSLPQTSSNVNEGTQNALRRSAALGLVGSAASAFDAAKD 3561
                + E    + SS     DQ +        E     + R    GL    + AF+ +  
Sbjct: 3516 ---MQRELHPTLLSSLEGIVDQMV-----WFRENWHEEVLRQLQQGLAKCYSVAFEKSGA 3575

Query: 3562 IMEALRSKHTNLASELEILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQS 3621
            + +A  + HT                FV    ++L++     L       T  ++   +S
Sbjct: 3576 VSDAKITPHT--------------LNFV----KKLVSTFGVGLENVSNVSTMFSSAASES 3635

Query: 3622 LKKELSGVCKACFSADAVNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKN 3681
            L +      +     D V       ++ K  F  D D     +    L  L  +LK W  
Sbjct: 3636 LARRAQATAQ-----DPV------FQKLKGQFTTDFDFSVPGSM--KLHNLISKLKKWIK 3695

Query: 3682 VLQGNVEDRFPAVLRLEDESRVLRDF--HVVDVEVPGQYFTDQEIAPDH-TVKLDRVGAD 3741
            +L+   + + P    +E++ R L +F     +VE+PG++   +   P H  +K+ R    
Sbjct: 3696 ILEAKTK-QLPKFFLIEEKCRFLSNFSAQTAEVEIPGEFLMPK---PTHYYIKIARFMPR 3755

Query: 3742 IPIVRRHGSSFRRLTLIGSDGS-QRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKES 3801
            + IV++H ++ RRL + G +G    + ++  +    +R +ER+LQL R++N   +K KE+
Sbjct: 3756 VEIVQKHNTAARRLYIRGHNGKIYPYLVMNDACLTESRREERVLQLLRLLNPCLEKRKET 3815

Query: 3802 RRRHLCIHTPIIIPVWSQVRMVEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLN-- 3861
             +RHL    P ++ V  Q+R+VED+    + +E+Y+  CA+   E D PI+ + ++L   
Sbjct: 3816 TKRHLFFTVPRVVAVSPQMRLVEDNPSSLSLVEIYKQRCAKKGIEHDNPISRYYDRLATV 3858

Query: 3862 QAISGQIAPEAVLDLRLQAYGDITRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLAL 3888
            QA   Q + + + D+      ++  N+V   +  ++   T  +    W F+K F IQLAL
Sbjct: 3876 QARGTQASHQVLRDI----LKEVQSNMVPRSMLKEWALHTFPNATDYWTFRKMFTIQLAL 3858

BLAST of MS010599 vs. ExPASy Swiss-Prot
Match: P38811 (Transcription-associated protein 1 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TRA1 PE=1 SV=1)

HSP 1 Score: 1254.6 bits (3245), Expect = 0.0e+00
Identity = 1072/4098 (26.16%), Postives = 1880/4098 (45.88%), Query Frame = 0

Query: 1    MSPIQNFELHSRQLVEPELSIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSIILVQ 60
            MS  +  E  + +  + + ++Q+R    +E+ D +E+ ++PE  +F   + +A   +L+ 
Sbjct: 1    MSLTEQIEQFASRFRDDDATLQSRYSTLSELYDIMELLNSPEDYHF---FLQAVIPLLLN 60

Query: 61   ITK--PQYTDNH--EHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLI 120
              K  P   D H  E KLRN +++I NR   ++  +P+  ++L+  + VL  +NEENG++
Sbjct: 61   QLKEVPISYDAHSPEQKLRNSMLDIFNRCLMNQTFQPYAMEVLEFLLSVLPKENEENGIL 120

Query: 121  CIRIIFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSH-FFENSAAGGEDI------- 180
            C++++  L ++F+  L++++  F+  + +IY+N    ++  F+E   A   D+       
Sbjct: 121  CMKVLTTLFKSFKSILQDKLDSFIRIIIQIYKNTPNLINQTFYEAGKAEQGDLDSPKEPQ 180

Query: 181  --------------KPMDVSTSTDQTITTGYTGTVQLNPSTRSFKIVTESPLVVMFLFQL 240
                          K      S+ +      T +  L  S  SFKI++E P+ ++ L+  
Sbjct: 181  ADELLDEFSKNDEEKDFPSKQSSTEPRFENSTSSNGLRSSMFSFKILSECPITMVTLYSS 240

Query: 241  YSRLVQTNIPVLLPLMVSAISVPGPEKVPPFLKT-----HFIELKG-------------A 300
            Y +L  T++P   PL+++ +++   ++     +      HF  +               A
Sbjct: 241  YKQLTSTSLPEFTPLIMNLLNIQIKQQQEAREQAESRGEHFTSISTEIINRPAYCDFILA 300

Query: 301  QVKTVSFLTYL-LRSSA-DYIRPHEESICKSIVNLLVTC-SDSVSIRKELLVALKHVLGT 360
            Q+K  SFL Y+ +R  A ++++ +   +   I+ LL  C S+  S RKELL A +H+L T
Sbjct: 301  QIKATSFLAYVFIRGYAPEFLQDYVNFVPDLIIRLLQDCPSELSSARKELLHATRHILST 360

Query: 361  EYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVRGDLTLSQLSRIIYL 420
             YK+   P +D L +E++L+G G   +ETLRPLAYS +A+ +H++R +L LS++ + I +
Sbjct: 361  NYKKLFLPKLDYLFDERILIGNGFTMHETLRPLAYSTVADFIHNIRSELQLSEIEKTIKI 420

Query: 421  FSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQ-TSMDEARILLGRILDAFV----- 480
            ++  + D SL+L++    A+L+LNLVE I + G +       A+ LL  I+D+++     
Sbjct: 421  YTGYLLDESLALTVQIMSAKLLLNLVERILKLGKENPQEAPRAKKLLMIIIDSYMNRFKT 480

Query: 481  ------------GKFSTFKHTIPQLLEEGEEGKDRAN---MRSKLE---------LPVQA 540
                        G++ T K    + L+   +  D+ +   MR  LE          P + 
Sbjct: 481  LNRQYDTIMKYYGRYETHKKEKAEKLKNSIQDNDKESEEFMRKVLEPSDDDHLMPQPKKE 540

Query: 541  VLNLQVPVEHSK---------EVNDCKHLIKTLILGMKTIIWSITHAHLPRPQASPSPNG 600
             +N    VE ++         E+ D K+    L+L   T        +L R   S     
Sbjct: 541  DINDSPDVEMTESDKVVKNDVEMFDIKNYAPILLLPTPTNDPIKDAFYLYRTLMSFLKTI 600

Query: 601  THPQMLVSPSSN---LATPQAFKGMRE----DEVCKASGVLKSGVHCLTLFKEKDE---- 660
             H   + +P  N   +A P+ +  +      +EV     +    +  L  FK+ +E    
Sbjct: 601  IHDLKVFNPPPNEYTVANPKLWASVSRVFSYEEVIVFKDLFHECIIGLKFFKDHNEKLSP 660

Query: 661  -------EVEMLHLFSQILTVMEPRDLMDMFSLC----------------MPELFDCMIT 720
                   ++ M  L   +    + R+LMD  +                  +P +++ M+ 
Sbjct: 661  ETTKKHFDISMPSL--PVSATKDARELMDYLAFMFMQMDNATFNEIIEQELPFVYERMLE 720

Query: 721  NTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAVA 780
            ++ L+H+  +FL +      FA +L+ FL   KL  L + D   + +++ LF+  F +V 
Sbjct: 721  DSGLLHVAQSFLTSEITSPNFAGILLRFL-KGKLKDLGNVDFNTSNVLIRLFKLSFMSVN 780

Query: 781  KAPSDFERILQPHVTVIMEVCVRSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIPL 840
              P+  E +L PH+  ++   ++ +T  E PL Y  L+R +FR++ G +FE L R + P+
Sbjct: 781  LFPNINEVVLLPHLNDLILNSLKYSTTAEEPLVYFYLIRTLFRSIGGGRFENLYRSIKPI 840

Query: 841  LQPCLNMLLTMFDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLV 900
            LQ  L  L  M         R+L +ELC+T+P RLS L P+LP LMKPLV  L+   DLV
Sbjct: 841  LQVLLQSLNQMILTARLPHERELYVELCITVPVRLSVLAPYLPFLMKPLVFALQQYPDLV 900

Query: 901  SLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRPIPYPWGA--KALQVLGKLGG 960
            S GLRTLE  +D+L  ++ +P +  V+ +V  AL++ L+P P+        +++LGKLGG
Sbjct: 901  SQGLRTLELCIDNLTAEYFDPIIEPVIDDVSKALFNLLQPQPFNHAISHNVVRILGKLGG 960

Query: 961  RNRRFLKEPLALECKENPEHGLRLILTFE-PSTPFLVPLDRCINLAVSTVMNKTGGVDSF 1020
            RNR+FLK P  L   E  E  +  I  F+    P  VPL     +  +  + ++   D  
Sbjct: 961  RNRQFLKPPTDL--TEKTELDIDAIADFKINGMPEDVPLSVTPGIQSALNILQSYKSDIH 1020

Query: 1021 YRKQALKFLRVCLSSQLNLPGNVADDGHTPRQLSTLLVSPVDSSLRRSETPEGKADL-GV 1080
            YRK A K+L   L                P   + LL + V+S        E   DL   
Sbjct: 1021 YRKSAYKYLTCVLLLM------TKSSAEFPTNYTELLKTAVNSIKLERIGIEKNFDLEPT 1080

Query: 1081 KTKTQLMAEKSVFKILLMTIIAAGSEEDLHEPKDDFVLNVCRHFAILFHIDSSLNSSPVA 1140
              K     ++++F  LL ++  A S ++L +   D + N+  HF +L  ++++L +    
Sbjct: 1081 VNKRDYSNQENLFLRLLESVFYATSIKELKDDAMDLLNNLLDHFCLL-QVNTTLLNKRNY 1140

Query: 1141 SASLGSTLLPPNVSANSRLRSSACCNLKELDPLIFLDALVEVLADENRVHAKAALNALNL 1200
            + +    L  PN                 LD  + LDA+   L+                
Sbjct: 1141 NGTFNIDLKNPNFM---------------LDSSLILDAIPFALS---------------- 1200

Query: 1201 FSEILLFLARAKQTDVMMTRGPSTPMIVSSPSKSPVYSPPPSVRIPVFEQLLPRLLHCCY 1260
                  ++   ++  V+  +       +       +Y    ++      +L  + +H CY
Sbjct: 1201 -----YYIPEVREVGVLAYKRIYEKSCL-------IYGEELALSHSFIPELAKQFIHLCY 1260

Query: 1261 GSTWQAQMGGIMGLGALVGKVTVETLCL--FQVRIVRGLVYVLKRLPIYASKEQEETSQV 1320
              T+  + GG++G+  L+  V   ++ L  +Q  +  GL++VLK     A     ++++ 
Sbjct: 1261 DETYYNKRGGVLGIKVLIDNVKSSSVFLKKYQYNLANGLLFVLKDTQSEAPSAITDSAEK 1320

Query: 1321 LNQVLRVVNNVDEANSEPRRQSFHGVVDILASELFNPNSSTIVRKNVQSCLALLASRTGS 1380
            L   L  +   D    +   +     +  +  EL N N    VR   Q  L  +++ TG 
Sbjct: 1321 LLIDLLSITFADVKEEDLGNKVLENTLTDIVCELSNANPK--VRNACQKSLHTISNLTGI 1380

Query: 1381 EVSELLEPLHQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQE 1440
             + +L++   Q LL P+  +PLR      Q+G V A+ FCL+L    L   +EL   LQE
Sbjct: 1381 PIVKLMDHSKQFLLSPIFAKPLRALPFTMQIGNVDAITFCLSLPNTFLTFNEELFRLLQE 1440

Query: 1441 ALQIAEADE---TVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKI 1500
            ++ +A+A++   +  + K      +  L +LR ACI+LL   +   +F T     +R +I
Sbjct: 1441 SIVLADAEDESLSTNIQKTTEYSTSEQLVQLRIACIKLLAIALKNEEFATAQQGNIRIRI 1500

Query: 1501 ISMFFKSLTCRTPEVVAVAKEGLR-QVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLL 1560
            +++FFK++   +PE++    E L+  +    ++PK+LLQ  L+P+L+NL+  + L++P L
Sbjct: 1501 LAVFFKTMLKTSPEIINTTYEALKGSLAENSKLPKELLQNGLKPLLMNLSDHQKLTVPGL 1560

Query: 1561 QGLARLLELLASWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEP-KIAAAIIELFH 1620
              L++LLELL ++F V +G KLL+HL  W   E L        A + P KI  +II +FH
Sbjct: 1561 DALSKLLELLIAYFKVEIGRKLLDHLTAWCRVEVLDTLFGQDLAEQMPTKIIVSIINIFH 1620

Query: 1621 LLPMAASKFLDELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARL 1680
            LLP  A  FL++L+   + LE  L       +++SP+R PL ++LNR+     +YF   +
Sbjct: 1621 LLPPQADMFLNDLLLKVMLLERKL-----RLQLDSPFRTPLARYLNRFHNPVTEYFKKNM 1680

Query: 1681 SEPKYFRRFMYIIRSDAGQPLREELAKSPQKILASAFPEFAPKSEAALTPGSSTSPAPLS 1740
            +     R+ +  + +   +P  +ELA+  +K L + +  +                    
Sbjct: 1681 T----LRQLVLFMCNIVQRPEAKELAEDFEKELDNFYDFY-------------------- 1740

Query: 1741 GDEGLVTPDASDPPSAPSSVVSDAYFRGLALIKTLVKLMPG--WLQNNRVVFDTLVLVWK 1800
                      S+ P     VVS  +F  +  +   + +  G  WL+             K
Sbjct: 1741 ---------ISNIPKNQVRVVS--FFTNMVDLFNTMVITNGDEWLK-------------K 1800

Query: 1801 SPARIARLHNEQELNLVQVKESKW----------LVKCFLNYLR----HEKAEVNVLFDI 1860
                I +L +   L L  +KE+ +          + K    YLR     E+ +  +L D 
Sbjct: 1801 KGNMILKLKDMLNLTLKTIKENSFYIDHLQLNQSIAKFQALYLRFTELSERDQNPLLLDF 1860

Query: 1861 LSIFLFHTRIDYTFLKEFYIIEVAEGYPPNMKKALLLHFLNLFQSKQLGHDHLVIVMQML 1920
            +  F F   I  ++  + +I           K+   ++   LF       D  + V++ +
Sbjct: 1861 ID-FSFSNGIKASYSLKKFIFHNIIASSNKEKQNNFINDATLFVLSDKCLDARIFVLKNV 1920

Query: 1921 ILPMLAHAFQNG---QSWEVVDQ--AIIKTIVDKLLDPPEEVSA----EYDEPLRIELLQ 1980
            I   L +        +S+ V D+    ++ + +K+      + A    ++ +  R ELLQ
Sbjct: 1921 INSTLIYEVATSGSLKSYLVEDKKPKWLELLHNKIWKNSNAILAYDVLDHHDLFRFELLQ 1980

Query: 1981 LATLLLKYLQSDLVHHRKELIKFGWNHLKREDSASKQWAFVNVCHFLEAYQAPEKIILQV 2040
            L+ + +K     +   +K++IKF WN +K ED+  KQ A++   +F+  +  P K++ QV
Sbjct: 1981 LSAIFIKADPEIIAEIKKDIIKFCWNFIKLEDTLIKQSAYLVTSYFISKFDFPIKVVTQV 2040

Query: 2041 FVALLRTCQPENKMLVKQALDILMPALPRRLPLGDSRMPIWIRYTKKILVEEGHSIPNLI 2100
            FVALLR+   E + LVKQ+LD+L P L  R+    +    WI + K+++VE   S  N+ 
Sbjct: 2041 FVALLRSSHVEARYLVKQSLDVLTPVLHERMNAAGT-PDTWINWVKRVMVENSSSQNNI- 2100

Query: 2101 HIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPYNTTAENRRLAIDLAGLVVGWERQRQ 2160
             ++Q ++ H DLF++ R  F+  +++ ++++    N+ +++  LAIDLA L++ WE +  
Sbjct: 2101 -LYQFLISHPDLFFNSRDLFISNIIHHMNKITFMSNSNSDSHTLAIDLASLILYWENK-- 2160

Query: 2161 NEMKLVTESDAPSHSNDGLTCPPGADPKRMVDGSTFPEDSTKRVK-VEPGLQSLCVMSPG 2220
              +++   ++  + S                DG     DS   +  VE    ++ V +  
Sbjct: 2161 -TLEITNVNNTKTDS----------------DGDVVMSDSKSDINPVEADTTAIIVDA-- 2220

Query: 2221 GASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFLIRVALVIEPKDKEATAMYKQALELL 2280
                  N  +P S             + E    FLIR       +  E T +  +A+ +L
Sbjct: 2221 ------NNNSPIS-----------LHLREACTAFLIRYVCASNHRAIE-TELGLRAINIL 2280

Query: 2281 SQAL--EVWPNANVKFNYLEKLLSSIQPSQSKDPSTALAQGLDVMNKVLEKQPHLFVRNN 2340
            S+ +  + W N NVK  Y EK L   Q   S++        LDV+    + +   ++  N
Sbjct: 2281 SELISDKHWTNVNVKLVYFEKFL-IFQDLDSENILYYCMNALDVLYVFFKNKTKEWIMEN 2340

Query: 2341 INQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYPLEGVTTPPDVKLLYQKVDELIKNHI 2400
            +  I  +LE C K    D  ++L  +L+++  A   +GV+       +  + +   K  I
Sbjct: 2341 LPTIQNLLEKCIKSDHHDVQEALQKVLQVIMKAIKAQGVS-------VIIEEESPGKTFI 2400

Query: 2401 NNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLIDPYNLGRILQRLARDMGSSAGSHL 2460
              LT+  T     T+S  + V L          N++       +L  L +        HL
Sbjct: 2401 QMLTSVITQDLQETSSVTAGVTLAWVLFMNFPDNIVP------LLTPLMKTFSKLCKDHL 2460

Query: 2461 RQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINERVMLVPECKRSVTQIMNSLLSEK 2520
               Q  D       + + A + T +  L+ VL +++ +V L+ + +R     + +LL + 
Sbjct: 2461 SISQPKD-----AMALEEARITTKL--LEKVLYILSLKVSLLGDSRRPFLSTV-ALLIDH 2520

Query: 2521 GTDASVLLCILDVIKGWVEDDFSKMGTSVSSSSFLAPKEIVSFLQKLSQVDKQNFSSSAA 2580
              D + L  I+++ + W+           ++  F   KE  + L K+   + +   S + 
Sbjct: 2521 SMDQNFLRKIVNMSRSWI----------FNTEIFPTVKEKAAILTKMLAFEIRGEPSLS- 2580

Query: 2581 EEWDGKYLQLLYEICADSNKYPLSLRQEVFQKVERQFMLGLRARDPETRKKFFTLYHESL 2640
                    +L YEI             E+  ++E+ F++G R  D   RK+F T+   SL
Sbjct: 2581 --------KLFYEIVLKLFDQEHFNNTEITVRMEQPFLVGTRVEDIGIRKRFMTILDNSL 2640

Query: 2641 GKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVLVEDKPITLAPNSARLPPLLVSGH 2700
             + +  RL Y+I+ Q+WE ++D  WL Q L LL      +K ++L       PP ++  +
Sbjct: 2641 ERDIKERLYYVIRDQNWEFIADYPWLNQALQLLYGSFNREKELSLKNIYCLSPPSILQEY 2700

Query: 2701 VADSSAVQPQVNDAQEGLEDAPLTFDSLVHKHAQFLNRTSKLQVADLIIPLRELAHTDAN 2760
            + +++ +  +VND         L   + V  H   +    ++  +D I  L E+ + D  
Sbjct: 2701 LPENAEMVTEVND---------LELSNFVKGHIASMQGLCRIISSDFIDSLIEIFYQDPK 2760

Query: 2761 VAYHLWVLVFPIVWVTLHKEEQVALAKPMISLLSKDYHKKQQASRPNVVQALLEGLQLSH 2820
              +  WV +FP V+ ++ K E+    + +I+LLSK YH +Q +SR NV+  LL+ +    
Sbjct: 2761 AIHRAWVTLFPQVYKSIPKNEKYGFVRSIITLLSKPYHTRQISSRTNVINMLLDSIS-KI 2820

Query: 2821 PQPRMPSELIKYIGRTYNAWHIALALLES-HVMLFMNETKCSE----SLAELYRLLNEED 2880
                +P  L+KY+  +YNAW+ ++ +LES      ++ TK  E    +L ELY  L EED
Sbjct: 2821 ESLELPPHLVKYLAISYNAWYQSINILESIQSNTSIDNTKIIEANEDALLELYVNLQEED 2880

Query: 2881 MRCGLWKRKAISAETKAGLSLVQHGYWQRAQILFYQSMVKATQGTYNNNVPKAEMCLWEE 2940
            M  GLW+R+A   ET  GLS  Q G W +AQ L+  + VKA  G    +  ++E  LWE+
Sbjct: 2881 MFYGLWRRRAKYTETNIGLSYEQIGLWDKAQQLYEVAQVKARSGALPYS--QSEYALWED 2940

Query: 2941 QWLYCASQLSQWEALVDFGKSIENYEILLDSLWKVPDWAYMK---EHVIPKAQVEETPKL 3000
             W+ CA +L  W+ L +  K     ++LL+  W+V DW   +   E  +       TP+ 
Sbjct: 2941 NWIQCAEKLQHWDVLTELAKHEGFTDLLLECGWRVADWNSDRDALEQSVKSVMDVPTPRR 3000

Query: 3001 RLIQAYFSLHD--RSTNGVADAENIVGKGVDLALEQWWQLPEMSVHARIPLLQQFQQLVE 3060
            ++ + + +L +   S  G  +   +  +G+ L+L +W  LP     A   LL  FQQ +E
Sbjct: 3001 QMFKTFLALQNFAESRKGDQEVRKLCDEGIQLSLIKWVSLPIRYTPAHKWLLHGFQQYME 3060

Query: 3061 VQESSRILVDIANGNKHSGSSVVSVHTNLYADLKDILETWRLRIPNEWDSMTVWCDLLQW 3120
              E+++I  ++        ++ V    +   ++K IL+ WR R+PN WD + +W DL+ W
Sbjct: 3061 FLEATQIYANL-------HTTTVQNLDSKAQEIKRILQAWRDRLPNTWDDVNMWNDLVTW 3120

Query: 3121 RNEMYNAVIDAFKDF------GTTNSQLH---HLGFRDKAWNVNKLAHVARKQGLHDVCV 3180
            R   +  + +A+           +NS ++   + G+ + AW +N+ AHVARK  + DVC+
Sbjct: 3121 RQHAFQVINNAYLPLIPALQQSNSNSNINTHAYRGYHEIAWVINRFAHVARKHNMPDVCI 3180

Query: 3181 GILEKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLDYFPVKHKAEIFR 3240
              L ++Y    +E+QEAF+K+REQAK + +   ELT+GL++I++TNL YF    KAE F 
Sbjct: 3181 SQLARIYTLPNIEIQEAFLKLREQAKCHYQNMNELTTGLDVISNTNLVYFGTVQKAEFFT 3240

Query: 3241 LKGDFQLKLSDSEGANHSYSSAISLFKNLPKGWISWGNYCDMAYKESHEEI-WLEYAVSC 3300
            LKG F  KL   E AN ++++A+ +  NL K W  WG + D    E    I +   A+SC
Sbjct: 3241 LKGMFLSKLRAYEEANQAFATAVQIDLNLAKAWAQWGFFNDRRLSEEPNNISFASNAISC 3300

Query: 3301 FLQGI-KFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKYLDQIPHWVWLSWIPQLLLSL 3360
            +LQ    +  S  R  L R+L+L+S D  +  +  AFD +  +IP W W+++IPQLL SL
Sbjct: 3301 YLQAAGLYKNSKIRELLCRILWLISIDDASGMLTNAFDSFRGEIPVWYWITFIPQLLTSL 3360

Query: 3361 QRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNTSSA 3420
               EA   + +L++IA  YPQAL++ LRT    + D A    + R  MA           
Sbjct: 3361 SHKEANMVRHILIRIAKSYPQALHFQLRT---TKEDFA---VIQRQTMAVM--------- 3420

Query: 3421 GSLGLTDGSSRVAHGGSSTSTDNQVHQGTQSGSGIGSHDGGNSHSQEPERSTGVESSTHA 3480
                                                                G +  T+ 
Sbjct: 3421 ----------------------------------------------------GDKPDTND 3480

Query: 3481 GNDQSLPQTSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASELEI 3540
             N +  P                             ++  +++   L++ +  LA  LE 
Sbjct: 3481 RNGRRQP-----------------------------WEYLQELNNILKTAYPLLALSLES 3540

Query: 3541 LLTEIGSRFVTLPEERLLAVVNALL------HRCYKYPTATTAEVPQSLKKELSGVCKAC 3600
            L+ +I  RF +  +E L  ++N LL      +    +P     ++P++ +K L       
Sbjct: 3541 LVAQINDRFKSTTDEDLFRLINVLLIDGTLNYNRLPFP-RKNPKLPENTEKNLVKFSTTL 3600

Query: 3601 FSADAVNK-HVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFP 3660
             +     K + DF+ + K D+E  +                +RL++W+  L+ N  DR  
Sbjct: 3601 LAPYIRPKFNADFI-DNKPDYETYI----------------KRLRYWRRRLE-NKLDRAS 3660

Query: 3661 AVLRLEDESRVLRDFH---VVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSF 3720
                LE     L +FH     D+E+PGQY  +++    H +K+ R    +  VR   SS+
Sbjct: 3661 KKENLEVLCPHLSNFHHQKFEDIEIPGQYLLNKD-NNVHFIKIARFLPTVDFVRGTHSSY 3720

Query: 3721 RRLTLIGSDGSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPII 3780
            RRL + G DGS   F VQ     ++R +ER+ QL+R+ N+   K+ E+RRR +  + PI 
Sbjct: 3721 RRLMIRGHDGSVHSFAVQYPAVRHSRREERMFQLYRLFNKSLSKNVETRRRSIQFNLPIA 3744

Query: 3781 IPVWSQVRMVEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIAPEAVLD 3840
            IP+  QVR++ D + ++T  E++   C +   + D    +  ++LN A    +    +  
Sbjct: 3781 IPLSPQVRIMNDSVSFTTLHEIHNEFCKKKGFDPDDIQDFMADKLNAAHDDALPAPDMTI 3744

Query: 3841 LRLQAYGDITRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRS 3889
            L+++ +  I    V   +   +           W F+KQFA Q +   FMSYM+ I  R+
Sbjct: 3841 LKVEIFNSIQTMFVPSNVLKDHFTSLFTQFEDFWLFRKQFASQYSSFVFMSYMMMINNRT 3744

BLAST of MS010599 vs. ExPASy Swiss-Prot
Match: Q8I8U7 (Transcription-associated protein 1 OS=Drosophila melanogaster OX=7227 GN=Nipped-A PE=1 SV=4)

HSP 1 Score: 1230.3 bits (3182), Expect = 0.0e+00
Identity = 1065/4072 (26.15%), Postives = 1886/4072 (46.32%), Query Frame = 0

Query: 3    PIQNFELHSRQLVEPELSIQTRLQMATEVRDSLE-IAHTPEYLNFLKCYFRAFSIILVQI 62
            P+  F  +   L +     + +L+   E+ +  E I  +P Y +FL    + F  IL Q 
Sbjct: 8    PVNTFRNYLNILNDSSSKDELKLKATQELSEHFEMIMQSPAYPSFLDNSLKIFMRIL-QD 67

Query: 63   TKPQY-TDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 122
             +PQ+  +N    +R +++E+++RLP +E LR  V+ ++ + +++L TDNEEN L+C+RI
Sbjct: 68   GEPQFIQENTMQHIRKLILEMIHRLPITESLRQHVKTIITMMLKILKTDNEENVLVCLRI 127

Query: 123  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENS-AAGGEDIKP--MDVSTST 182
            I +L ++FRP+  +E+Q FL FV +IY N    ++  FE S      D+K   ++V  S 
Sbjct: 128  IIELHKHFRPSFNSEIQLFLGFVKEIYTNLPNHLTSIFETSNDVWVTDLKDLNLEVLLSE 187

Query: 183  DQTITTGYTGTVQLNPSTR-----------SFKIVTESPLVVMFLFQLYSRLVQTNIPVL 242
              ++ T +      + S +           S K++ E P++V+ ++Q+Y   V   +   
Sbjct: 188  SYSVRTIHVEKALDSNSQQQIYNLLPRGILSLKVLQELPIIVVLMYQIYKNAVHQEVSEF 247

Query: 243  LPLMVSAISV-PGPEKVPPFLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKS 302
            +PL+++ I++ P   +     K  ++E  GAQ+KT+SFL Y++R   + +     S+   
Sbjct: 248  IPLILTTINLQPTVTRRNSPQKEIYVEFMGAQIKTLSFLAYIVRIFQEVVIASSLSVTSG 307

Query: 303  IVNLLVTC-SDSVSIRKELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLR 362
            ++NL+  C  ++  +RKELL+A +H+  T+ ++   P I+ L +E +L+G G    +++R
Sbjct: 308  MLNLMKNCPKEAAHLRKELLIAARHIFATDLRQKFIPSIEQLFDEDLLIGKG-VTLDSIR 367

Query: 363  PLAYSLLAEIVHHVRGDLTLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFE 422
            PLAYS LA++ HHVR  L +  L + + LFS N+HD SL++ I T   +L+LNLV+ +  
Sbjct: 368  PLAYSTLADLAHHVRQSLNIDVLIKAVNLFSKNVHDESLAVGIQTMSCKLLLNLVDCL-- 427

Query: 423  KGVDQTSMDEARILLGRILDAFVGKFSTF-KHTIPQLLEEGE---------EGKDRANMR 482
            +   +T    ++ LL ++L  FV KF T  K  +P ++++ +              A++ 
Sbjct: 428  RHHSETEPQRSKALLSKLLKVFVKKFETIAKIQLPLIIQKCKGHAFSGALVNSSGNASL- 487

Query: 483  SKLELP--VQAVLNLQVPVE-----HSKEVNDCKHLIKTLILGMKTIIWSITHAHLPRPQ 542
            S +  P     + N+QV        +S  V + + L+KTL+ G+KTI W   ++     Q
Sbjct: 488  SHINAPDLKDDISNIQVSASGSQWIYSVNVAEFRSLVKTLVGGVKTITWGFFNSKF---Q 547

Query: 543  ASPSPNGTHPQMLVSPSSNLATPQAFKGMREDEVCKASGVLKSGVHCLTLFKEKDEEVEM 602
             + +    H ++             +  M   ++   + V  +      L     EE E+
Sbjct: 548  LTDTKLANHEKIFGPEIVCSYIDLVYYAMEALDIYTIN-VNPNQQRTSGLISRSKEEKEV 607

Query: 603  LHLFSQILTVMEPRDLMDMFSLCMPELFDCMITNTQLVHLFSTFLQTPKVYRPFADVLVN 662
            L  FS I  +M  ++  ++FS  +  L + +  N  L  + ++FL  P     FA VLV 
Sbjct: 608  LEHFSGIFLMMHSQNFQEIFSTTINFLVERIYKNQSLQVIANSFLANPTTSPLFATVLVE 667

Query: 663  FLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAVAKAPSDFERILQPHVTVIMEVCVRSATE 722
            +L++   ++  + +   + L L LF+ VFG+V+  P + E++L+PH+  I+   +  A  
Sbjct: 668  YLLNKMEEMGSNLER--SNLYLRLFKLVFGSVSLFPVENEQMLRPHLHKIVNRSMELALI 727

Query: 723  VERPLGYMQLLRIMFRALAGCKFELLLRDLIPLLQPCLNMLLTMFDGPTGEDMRDLLLEL 782
             E P  Y  LLR +FR++ G   +LL ++ +PLL   L  L  +  G   + MRDL +EL
Sbjct: 728  SEEPYNYFLLLRALFRSIGGGSHDLLYQEFLPLLPNLLEGLNRLQSGFHKQHMRDLFVEL 787

Query: 783  CLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLVSLGLRTLEFWVDSLNPDFLEPSMANVM 842
            CLT+P RLSSLLP+LP LM PLV  L GS  L+S GLRTLE  VD+L PDFL   +  V 
Sbjct: 788  CLTVPVRLSSLLPYLPMLMDPLVSALNGSPTLISQGLRTLELCVDNLQPDFLYDHIQPVR 847

Query: 843  SEVILALWSHLRPIPYPWGAKALQVLGKLGGRNRRFLKEPLALECKENPEHGLRLILTF- 902
            + ++ ALW  LR         A +VLGK GG NR+ + EP AL    N +  + ++  F 
Sbjct: 848  AALMQALWKTLRNQDNA-ALVAFRVLGKFGGGNRKMMVEPQALSYIINDKPTISIVTYFQ 907

Query: 903  EPSTPFLVPLDRCINLAVSTVMNKTGGVDSFYRKQALKFLRVCLSSQLNLPGNVADDGHT 962
            E  TP   P+D  I  A   +   +   D FYR+Q+ + +R  L++ ++L     D+ H 
Sbjct: 908  EYETPIDFPVDEAIKSAFRAL--GSNSTDQFYRRQSWEVIRCFLAAFISLD----DEKHM 967

Query: 963  PRQLSTLLVSPVDSSLRRSETPEGKADLGVKTKTQLMAEKSVFKILLMTIIAAGSEEDLH 1022
              +L T  V  V++ +    T + KA      +T   A        L+ ++ A + +DL 
Sbjct: 968  LLKLFT-HVDFVENKIMNWSTFQHKAGNETVRETHQTA--------LIGMLVASATKDLR 1027

Query: 1023 EPKDDFVLNVCRHFAILFHIDSSLNSSPVASASLGSTLLPPNVSANSRLRSSACCNLKEL 1082
            +     +  V RH+ +   +  +  + P       +T                      +
Sbjct: 1028 DSVCPVMAAVVRHYTM---VAIAQQAGPFPQKGYQAT--------------------HGI 1087

Query: 1083 DPLIFLDALVEVLADENRVHAKAALNALNLFSEILLFLARAKQTDVMMTRGPSTPMIVSS 1142
            DP+I +DAL   +  E +   K  +  + +  +          T++M  +          
Sbjct: 1088 DPMILIDALASCMGHEEKELCKPGIACMGIILD--------TATNIMGNK---------- 1147

Query: 1143 PSKSPVYSPPPSVRIPVFEQLLPRLLHCCYGSTWQAQMGGIMGLGALVGKVTVETLCLFQ 1202
                       + ++P+ + L  +++  CY   W +++GG   +  L   +++  L    
Sbjct: 1148 ---------DRACKLPIIQYLAEKMVSLCYDRPWYSKVGGCQAIQFLCKHMSLRALFQNL 1207

Query: 1203 VRIVRGLVYVLKRLPIYASKEQ-EETSQVLNQVLRV--------VNNVDEANSEPRRQSF 1262
               ++  ++VL  L    S    E T   +  +L +          N+D  + + +  + 
Sbjct: 1208 FNFLKAFMFVLMDLEGDVSNGAIEITKSYMKSMLEICLTPINECYKNIDLKDLQAK--AT 1267

Query: 1263 HGVVDILASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELLEPLHQPLLQPLL---LR 1322
            + V+  L   + +PN  TIVR+     L  + +     VSE+++P H+ +L  ++     
Sbjct: 1268 YEVIHELVRHITSPN--TIVREESMVLLKHIGTIQSKTVSEVMDP-HKDVLADIIPPKKH 1327

Query: 1323 PLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVN-----FLQEALQIAEADETVWVVK 1382
             LR +  + Q+G +    FC  L P L   T +L N     F  E L ++EA++   + K
Sbjct: 1328 LLRHQPANAQIGLMDGNTFCTTLEPRL--FTIDLTNTYHKLFFHELLTLSEAEDAT-LAK 1387

Query: 1383 FMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLTCRTPEVVAV 1442
                K   +L  LRT+ +  L      +D         + KII++ FK +     E+   
Sbjct: 1388 LDCYKNVPNLIPLRTSALRALAACHYISDI------GYKEKIINIIFKVMESDKSELQTT 1447

Query: 1443 AKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELLASWFNVTLG 1502
            A   ++  I    + K+ +Q ++RP+L+ L   +NLS+P ++ L+   ++    FN  L 
Sbjct: 1448 AFHCMKHFITGVTLEKEKVQSAMRPLLLKLGDHRNLSIPAIKRLSYFTQIFPQMFNEKLS 1507

Query: 1503 GKLLEHLKKWLE---PEKLAQSQK-----AWKAGEEPKIAAAIIELFHLLPMAASKFLDE 1562
             ++L+H  K +E    E  + S       + K GE  +    +IE+F  +  A+ K++++
Sbjct: 1508 EQILQHCSKIMEIFVSEYKSTSPNVNFFASSKGGEYEQKIVILIEMFFYI-SASVKYIEK 1567

Query: 1563 LVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLAR--LSEPKYFRRFM 1622
            L  L +  E  L       E +SPYR  LIKFL R+    VD FL    + +P++ R F+
Sbjct: 1568 LCQLVLKTEKNL-----MIEASSPYREALIKFLQRFPTETVDLFLTESLMIDPQWNRLFI 1627

Query: 1623 YIIRSDAGQPLREELAKSPQKIL---ASAFPEFAPKSEAALTPGSSTSPAPLSGDEGLVT 1682
            Y+++ + G   R  +  S    L    +   EF                           
Sbjct: 1628 YLLKHETGVSFRAVIKSSRYNNLIHYLNTHTEF--------------------------- 1687

Query: 1683 PDASDPPSAPSSVVSDAYFRGLALIKTLVKLMPGWLQNNRVVFDTLVLVWKSPARIARLH 1742
                     P ++  +   + + +I TL++    W+   + + D L   W++   ++ L 
Sbjct: 1688 ---------PEALKYEIQHQAVLIIFTLMESDDQWIPTRQDIVDALKNCWQN--YLSTLS 1747

Query: 1743 NEQELNLVQVKESKW--LVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYII 1802
            +E  L         W  + K  L+Y  +   ++ +LF +L    F    D  FL++F   
Sbjct: 1748 SEDVLC------DLWHLIGKILLHYFSNNTNDIELLFQLLRALCFRFIPDVYFLRDFLQH 1807

Query: 1803 EVAEGYPPNMKKALLLHFLNLFQSKQLGHDHLVIVMQMLILPMLAHAFQNGQSWEVV--- 1862
             VA+ +  N K+    +F+  F +  L  +    ++  +I+P  A +F  G+  +++   
Sbjct: 1808 TVAQSFTVNWKRNAFFYFVENFNNSFLSEELKAKIITAVIIPCFAVSFDKGEGNKLIGAP 1867

Query: 1863 -------DQAIIKTIVDKLLDPPEEVSAEYDEPLRIELLQLATLLL----KYLQSDLVHH 1922
                   ++ I+   ++K+ DP +    +YD+ +RI LLQLA LL+    +++     ++
Sbjct: 1868 PTPYQEDEKNIVSVFINKVFDPDK----QYDDAVRIALLQLACLLVERASQHIHDGDANN 1927

Query: 1923 RKE------LIKFGWNHLKRE---DSASKQWAFVNVCHFLEAYQAPEKIILQVFVALLRT 1982
            +++      L+ F W  L  +   D  ++    + + H +      +KI+LQVF +LL+ 
Sbjct: 1928 KRQGNKLRRLMTFAWPCLLSKSSVDPTARYHGHLLLSHIIARLAIHKKIVLQVFHSLLKG 1987

Query: 1983 CQPENKMLVKQALDILMPALPRRLPLGDSRMPIWIRYTKKILVEEGHSIPNLIHIFQLIV 2042
               E + +VKQALD+L PA+P R+  G++ +  W   TKKI+VEEGH++  L HI QLI+
Sbjct: 1988 HALEARSIVKQALDVLTPAMPLRMEDGNTMLTHW---TKKIIVEEGHAMQQLFHILQLII 2047

Query: 2043 RHSDLFYSCRAQFVPQMVNSLSRLGLPYNTTAENRRLAIDLAGLVVGWERQRQNEMKLVT 2102
            RH  +++  R Q V  ++N + RLG P   + E+++LA+DLA +++ WE  R        
Sbjct: 2048 RHYKVYFPVRHQLVQHLINYMQRLGFPPTASIEHKKLAVDLAEVIIKWELHR-------- 2107

Query: 2103 ESDAPSHSNDGLTCPPGADPKRMVDGSTFPEDSTKRV---KVEPGLQSLCVMSPGGASSM 2162
              D      DG             +     E S KR     VE   +S  ++        
Sbjct: 2108 IKDDRETKTDG------------TEEELIQESSVKRSGIDLVETRKKSFDIIRE------ 2167

Query: 2163 PNIETPGSTTQPDEEFKP-NAAMEEMIINFLIRVALVIE----PKDKEATAMYKQALELL 2222
              ++  GS T+PD+  +  + +  + ++NFLIR+A  +     P      ++ ++ + LL
Sbjct: 2168 TTVQGVGSHTKPDDILRSIDKSYCDTVLNFLIRLACQVNDPQAPILSPGESLSRRCVMLL 2227

Query: 2223 SQAL--EVWPNA-NVKFNYLEKLLSSIQ-PSQSKDPSTALAQGLDVMNKVLEKQPHLFVR 2282
              A+  E+WP   ++K N+L+K+L++++ P  + +        L  +  +L     + + 
Sbjct: 2228 KMAMRPEIWPQPFDIKLNWLDKVLATVETPHHNLNNICTGIDFLTFLTTILSPDQLVSI- 2287

Query: 2283 NNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYPLEGVTTPPDVKLLYQKVDELIKN 2342
              I  + + L  C  H+     + +   L  +   +P +      D+ LLY  V ++I  
Sbjct: 2288 --IRPVQRGLSLCIIHQNTRIVRLMHMFLTRIMAIFPPDTQHKHEDLDLLYTAVSKMI-- 2347

Query: 2343 HINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLIDP--YNLGRILQRLARDMGSSA 2402
               NLT+ + S + N ASS+   L+++K  T    + ID       R+L  L RD  ++ 
Sbjct: 2348 -AENLTSYEKSPQPN-ASSLFGTLMILKACTTNNASYIDRILVQFIRVLNHLTRDHINTI 2407

Query: 2403 GSHLRQGQRMDPDSAVTSSRQSADVGTV-ISNLKSVLKLINER--VMLVPECKRSVTQIM 2462
            G +             T   QS D   + +  L   L+LI  R  VM V   K  +  I+
Sbjct: 2408 GGN-------------TVISQSPDSNALPLELLVLSLELIKNRIFVMSVEIRKLFIGTIL 2467

Query: 2463 NSLLSEKGTDASVLLCILDVIKGWVEDDFSKMGTSVSSSSFLAPKEIVSFLQKLSQVDKQ 2522
             SL+ EK T+  ++ CI+ ++  W++     + T V S      +E  + L KL Q  ++
Sbjct: 2468 VSLI-EKSTEVKIIKCIIKMLDEWIKTKEPNVMTQVPSI-----REKSALLVKLMQNVEK 2527

Query: 2523 NFSSSAAEEWDGKYLQLLYEICADSNKYPLSLRQEVFQKVERQFMLGLRARDPETRKKFF 2582
             F+     E + ++L+++  I  D     +  + E+  K+E  F+ GLR ++P  R KFF
Sbjct: 2528 KFTDEI--ELNIQFLEIINFIYRDE----ILKQTELTNKLEGAFLNGLRFQNPNVRSKFF 2587

Query: 2583 TLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVLVEDKPITLAPNSARLP 2642
             +   S+ + L  RL YII  Q W+ +   +W+KQ ++LL+        I  +    ++P
Sbjct: 2588 EILDSSMRRRLHDRLLYIICSQAWDTIGSHYWIKQCIELLILTANTMMQIQCSNEQFKIP 2647

Query: 2643 PL------------------LVSGHVADSSAVQ---------------PQVNDAQEGLED 2702
             +                   +S H      +Q                +  D Q+ L +
Sbjct: 2648 SITSVIPVNSSETQENSFVSFLSSHSESFDIIQTVDDKDDVYDIDLNADRKEDCQQILPN 2707

Query: 2703 APLTFDSLVHKHAQFLNRTSKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKE 2762
              +T   LV+K A+FL     ++   +++   +L H D  +A  +W+ +FP +W    ++
Sbjct: 2708 RRVTLVELVYKQAEFLEANRNIRTDQMLVATSQLCHIDTQLAQSVWLSMFPRIWSIFTED 2767

Query: 2763 EQVALAKPMISLLSKDYHKKQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGRTYNAW 2822
            ++  + K +I  LS   +  Q+   P+ +   +E L    P   +P  L+ Y+G+++N W
Sbjct: 2768 QRCNITKELIPFLSSGTNVNQKDCHPSTLNTFVESLTKCAPPIYIPPNLLAYLGKSHNLW 2827

Query: 2823 HIALALLESHVMLFMNETK-------------------CSESLAELYRLLNEEDMRCGLW 2882
            H A+ +LE   +    ++K                     +SL+++Y  ++EED+  GLW
Sbjct: 2828 HRAILVLEDMAVNQSMQSKDIDGGENQFSDLDVQQSNNIFDSLSKMYSSMHEEDLWAGLW 2887

Query: 2883 KRKAISAETKAGLSLVQHGYWQRAQILFYQSMVKATQGTYN---NNVPKAEMCLWEEQWL 2942
             + A   ET   +S  Q G+++ AQ  +  +M K  Q   N   N    +E+ LWE  W+
Sbjct: 2888 LKFAHYPETNIAVSYEQMGFFEEAQGAYDLAMTKFKQDLSNGVVNTYVNSELLLWENHWM 2947

Query: 2943 YCASQLSQWEALVDFGKS--IENYEILLDSLWKVPDWAYMKEHVIPKAQVEETP------ 3002
             CA +L+QW+ L+D+ ++   +N  ++L+S W+VPDW  MK   I  A+ E+        
Sbjct: 2948 RCAKELNQWDILLDYAQTNKDKNMFLILESSWRVPDWNLMK---IALAKTEQCYLKHYGF 3007

Query: 3003 KLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMSVHARIPLLQQFQQLVE 3062
            K+ L + Y S+  +      + E  V     L + +W +LP +  H  +P LQ  QQ++E
Sbjct: 3008 KINLYKGYLSILHQEERQTGNIERYVEIASSLCIREWRRLPNIVSHIHLPYLQASQQIME 3067

Query: 3063 VQESSRILVDIANGNKHSGSSVVSVHTNLYADLKDILETWRLRIPNEWDSMTVWCDLLQW 3122
            + E+S+I         H G  +     N   D+K I++TWR R+P   D ++ W D+  W
Sbjct: 3068 LHEASQI---------HQG--LAQSRNNSLHDMKAIVKTWRNRLPIISDDLSHWSDIFTW 3127

Query: 3123 RNEMYNAVIDAFKDFGTTNSQLHHLGFRDKAWNVNKLAHVARKQGLHDVCVGILEKMYGH 3182
            R   Y  +    +      S +  LG    A  +     +ARK  L  VC   L ++Y  
Sbjct: 3128 RQHHYQIITQHLEQQSDQGSTM--LGVHASAQAIISFGKIARKHNLTGVCQETLSRIYTI 3187

Query: 3183 STMEVQEAFVKIREQAKAYLEM-----KGELTSGLNLINSTNLDYFPVKHKAEIFRLKGD 3242
             ++ + + F KIR+Q K YL+M     K E+   L +I STNL YF  +  AE + LKG 
Sbjct: 3188 PSVPIVDCFQKIRQQVKCYLQMPSTSGKNEINEALEVIESTNLKYFTGEMNAEFYALKGL 3247

Query: 3243 FQLKLSDSEGANHSYSSAISLFKNLPKGWISWGNYCDMAYKESHEEIWLEYAVSCFLQGI 3302
               ++  SE A  S+S A  L   L K W  WG+Y +  + +  +      A+ C+LQ  
Sbjct: 3248 LLAQIGRSEEAGKSFSVAAQLHDGLTKAWAMWGDYMEQIFLKERKITLAVDALICYLQAS 3307

Query: 3303 KFGI-SNSRNHLARVLYLLSFDTPNEPVGRAFDKYLDQIPHWVWLSWIPQLLLSLQRTEA 3362
            +  I S +R ++A+VL+ LS+D   + +    +K++  IP   WL WIPQLL  L++ E 
Sbjct: 3308 RNQIESKTRKYIAKVLWFLSYDNNTKILISTLEKHVAGIPPSYWLPWIPQLLCCLEQFEG 3367

Query: 3363 PHCKLVLLKIANVYPQALYYWLRT-YLLERRDVANKSELGRMAMAQQRMQQNTSSAGSLG 3422
                 +L +I  +YPQA+Y+ +RT YL  + +   K +      A+Q ++ + S+     
Sbjct: 3368 DVILNLLSQIGRLYPQAVYFPIRTLYLTLKIEQREKHK-----TAEQAVKSSCSNIDGTT 3427

Query: 3423 LTDGSSRVAHGGSSTSTDNQVHQGTQSGSGIGSHDGGNSHSQEPERSTGVESSTHAGNDQ 3482
            L+ G    +HG   +    +        S +           + E    + SS     DQ
Sbjct: 3428 LSFGRG-ASHGNIPSINPIKATPPMWRCSKV--------MQLQREVHPTILSSLEGIVDQ 3487

Query: 3483 SLPQTSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASELEILLTE 3542
             +    S   E     + R    GL+   A AF+    +      +H+ +       + +
Sbjct: 3488 MVWFRESWTEE-----VLRQLRQGLIKCYAIAFEKRDTV------QHSTITPHTLHFVKK 3547

Query: 3543 IGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKE-LSGVCKACFSADAVNKH 3602
            +GS F        + + N         P + T+ +  S   E L+   +  F      K 
Sbjct: 3548 LGSTFG-------IGIENV--------PGSVTSSISNSAASESLARRAQVTFQDPVFQK- 3607

Query: 3603 VDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLRLEDESR 3662
                   K+ F  D D          L  L  +LK W  VL+  V+ + P    +ED+ R
Sbjct: 3608 ------MKEQFTNDFDFSKPGAM--KLHNLISKLKTWIKVLETKVK-KLPTSFLIEDKCR 3667

Query: 3663 VLRDF--HVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGSDGS 3722
             L +F     +VE+PG+      ++  + V++ R    + IV+++ ++ RRL + G++G 
Sbjct: 3668 FLSNFSQKTAEVELPGELLI--PLSSHYYVRIARFMPRVEIVQKNNTAARRLYIRGTNGK 3727

Query: 3723 -QRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVRMV 3782
               + +V  S   +AR +ER+LQL R++N   +K KE+ RR L I  P ++P+  Q+R+ 
Sbjct: 3728 IYPYLVVLDSGLGDARREERVLQLKRMLNYYLEKQKETSRRFLNITVPRVVPISPQMRLA 3787

Query: 3783 EDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIAPEAVLDLRLQAYGDIT 3842
            ED+    + L++++  C     + D+PI  + ++L++ +  +  P     LR + + +I 
Sbjct: 3788 EDNPNSISLLKIFKKCCQSMQVDYDMPIVKYYDRLSE-VQARGTPTTHTLLR-EIFSEIQ 3789

Query: 3843 RNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAKNT 3888
              +V + +   +  KT L+    W F+K   +QLAL+    + L +   + + +Y  +++
Sbjct: 3848 WTMVPKTLLKHWALKTFLAATDFWHFRKMLTLQLALAFLCEHALNLTRLNADMMYLHQDS 3789

BLAST of MS010599 vs. ExPASy Swiss-Prot
Match: Q54T85 (Probable transcription-associated protein 1 OS=Dictyostelium discoideum OX=44689 GN=tra1 PE=3 SV=2)

HSP 1 Score: 1228.8 bits (3178), Expect = 0.0e+00
Identity = 1132/4379 (25.85%), Postives = 1919/4379 (43.82%), Query Frame = 0

Query: 6    NFELHSRQLVE-PELSIQTR-LQMATEVRDSLEIAHTPEYLNFLKCYFRAFSIILVQITK 65
            NFE ++R+  E    + QT+ L + TE+RD++E+ HT EY  FL   F  F  IL Q   
Sbjct: 55   NFESYARRCFELNNNNEQTQLLALVTEIRDNIELVHTVEYPTFLNFLFPVFYNILRQ-GA 114

Query: 66   PQYTDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRIIFD 125
             Q+ D  E K+RN +++ILN+LP++E+LRP +  LL+++M +L  DNEEN L+C+RII +
Sbjct: 115  VQFNDGPEQKIRNTILDILNKLPNNELLRPHILVLLQLSMYLLEVDNEENALVCLRIIIE 174

Query: 126  LLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENSAAGG-----EDIKPMDVSTSTD 185
            L +N+R  LE+E+QPFL+ V K+Y +   T+   F +S++         I P   +T+T 
Sbjct: 175  LHKNYRNALESEIQPFLNIVLKLYTDLPSTIEKTFSSSSSASLSTTTTAISPTTTTTTTP 234

Query: 186  QTITTGYTGTVQLN----------PST--------------------------------- 245
             T TT  T T   N          PST                                 
Sbjct: 235  ATATTPATTTATGNTITTPPPATPPSTTATAISPTSSTTTTTTATTAAAATIATTTATTT 294

Query: 246  -------------RSFKIVTESPLVVMFLFQLYSRLVQTNIPVLLPLMVSAISVPGPEKV 305
                          SFKI+TE P+VV+ LFQLY+  + +N+P  +PL++  +S+  P   
Sbjct: 295  ITPPLPPYMIKSIESFKILTECPIVVILLFQLYNSYMSSNVPKFIPLIIETLSLQAPANS 354

Query: 306  PPFLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTC-SDSVSIRK 365
                 + +++   AQVKT+  L Y+L+   + I+ + +   +S++ LL  C + S +IRK
Sbjct: 355  TVTHHSQYVDFIAAQVKTLYLLAYVLKWHIEQIKQYSDRFPRSVIQLLQNCPAHSSAIRK 414

Query: 366  ELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVRGD 425
            ELLV L+H+L +++K      +D LL+EK+++GT R  YE+LR +AY  LA+ +H++R +
Sbjct: 415  ELLVTLRHILSSDFKSKFIVYLDLLLDEKIILGTSRTSYESLRSMAYGSLADFIHNMRNE 474

Query: 426  LTLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDEARILLGR 485
            L ++Q+S+++ ++S ++HD +  +SI     +L+++L++ I  K        ++R ++ +
Sbjct: 475  LNINQISKVVAIYSRHLHDQTNPVSIQIMSVKLIISLMDVIQRK--QDPPEYKSRSIIYK 534

Query: 486  ILDAFVGKFSTFKHTIPQLL-----EEGEEGKDRANMRSKLE--LPVQAVLNLQVPVEHS 545
            ++++F+ KFS+ K +IP+LL     E+ +E KD  +++ KL+         +    +   
Sbjct: 535  VIESFINKFSSLKRSIPKLLADQQKEKEKELKDPQSLKDKLDGLSSANTTTSSTGEIIIL 594

Query: 546  KEVNDCKHLIKTLILGMKTIIWSITHAHLPRP-------QASPSPNGTHPQMLVSPSSNL 605
              V D + LIKT+   ++ I WS++   + +P         + +   T+    + P   +
Sbjct: 595  DPVKDTRTLIKTMTSSLRNIFWSLSACPINKPGTGITTGAGATTTTTTNTNNTIIPPVRI 654

Query: 606  ATPQAFKGMREDEVCKASGVLKSGVHCLTLF----KEKDEEVEMLHLFSQILTVMEPRDL 665
            A P        +E      + KS V C  ++        EE EM+  F+    +++ R  
Sbjct: 655  ALPSI------EESLLFIKLFKSTVKCFPIYGGCNPSPQEEKEMIENFTASFMMLDQRTF 714

Query: 666  MDMFSLCMPELFDCMITNTQLVHLFSTFLQTP-------KVYRPFADVLVNFLVSSKLDV 725
             ++ +  +P L+   + N  L+ +   FL          ++ R F +VL  FL   K+  
Sbjct: 715  QEVSTFILPFLYQRSLNNPSLLLIPQGFLSVTQMNPTGVQINRVFLEVLTPFLY-EKIRN 774

Query: 726  LKHPDSPGAKLVLHLFRFVFGAVAKAPSD------------------------------- 785
            L+  D P    ++ L + +F A+    +                                
Sbjct: 775  LQPTDKPDI-CMIKLIKLIFNAIQPNNNSGVGGSGGSNSSGGGGGGGSNSSNNSTNSNTT 834

Query: 786  -------FERILQPHVTVIMEVCVRSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLI 845
                    +++L   + +++++ +  + +++  + Y+ LL+ +F++         +  L 
Sbjct: 835  TNIDSTCVQQVLSSMILILLKL-ITESKQID-SIQYLLLLKTIFKSCTRPDQSKEITLLF 894

Query: 846  PLLQPCLN-MLLTMFDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCL-KGS 905
            P++   LN +LL+         ++ LL+EL L++P ++++LLP L  L+KPL+L L   S
Sbjct: 895  PIILETLNDLLLSSSHSTMIPAVQQLLIELSLSIPVQIATLLPSLHLLVKPLMLALDSSS 954

Query: 906  DDLVSLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRPIPYPWGAKALQVLGKL 965
             +L+S   R LE  VD+   DFL  +  +  SE +  L  HLRP PY +G  A+++LGK+
Sbjct: 955  SELLSTTFRILELIVDNATGDFLLFTFRDNKSEFLQILSKHLRPAPYFYGPHAIRILGKM 1014

Query: 966  GGRNRRF--LKEPLALEC---------------------------KENPEHGLRLILTFE 1025
             G++R F  L   L+++                             EN     +L+L  E
Sbjct: 1015 AGKSRSFSVLSPILSIDSTSNSRSIPSSNKNNNNNNYYYNGSCSNSENYSKVFKLLLPCE 1074

Query: 1026 PSTPFL--VPLDRCINLAVSTVMNKTGGVDSFYRKQALKFLRVCLSSQLNLPGNVADDGH 1085
                    +PLD+ I    + ++ +    DS+ +  A   L+  +S  L+    + +   
Sbjct: 1075 TGDDKTKSIPLDKSIQSIKNILLYQLD--DSYLQSNAYSLLKYYISLYLSSQDFLINQQS 1134

Query: 1086 TPRQLSTLLVSPVDSSLRRSET-----------PEGKADLG------------------- 1145
               +L   L    +++   S T            E + D                     
Sbjct: 1135 LLNELLNNLKQSNNNNNNNSSTVNLNIIELDNENENENDNNNNNNNNNNNNNNNNNNNNN 1194

Query: 1146 ------------------VKTKTQLMAEKSVFKILLMTIIAAGSEEDLHEPKDD--FVLN 1205
                               KTK + + E   FK L+  +  + + + L E  D   F+ N
Sbjct: 1195 NNNNNNNNNNNNNNNIKTFKTKEEYLNEIKNFKDLVYCLFLSITNDHLKEKFDSLKFLNN 1254

Query: 1206 VCRHFAILFHIDSSLNSSPVASASLGSTLLPPNVSANSRLRSSACCNLKELDPLIFLDAL 1265
               HF +L+      N S +                          ++KELDP IFL+AL
Sbjct: 1255 FIYHF-VLYLSTFKFNYSII--------------------------SMKELDPKIFLEAL 1314

Query: 1266 VEVLADE---------------------NRVHAKAALNAL-NLFSEILLFLARAKQTDVM 1325
            V+V++                       N+ H  + L+ + N  ++I    + +K+ + +
Sbjct: 1315 VDVMSMSSHNIINQSNIEDLQTISTSKFNKAHITSLLDMIFNCSNQIFSENSNSKKNNEL 1374

Query: 1326 MTR------GPSTPMIVSSPSKSPVYSPPPSVRI--------------------PVFEQL 1385
            MT       G    M      K    S   +  I                    P+F+ L
Sbjct: 1375 MTSSTDVKDGEKVEMETEDSLKKDEMSAAATSEIKKETNVVVENEQDKDTVLISPIFKYL 1434

Query: 1386 LPRLLHCCYGSTWQAQMGGIMGLGALVGKVTVETLCLFQVRIVRGLVYVLKRLPI----- 1445
            +   + CCY   +  +  G++G+  ++  V +  +  FQ  I++ L++V + L       
Sbjct: 1435 VKLFIKCCYDKDFSVKGAGLIGIEYIIENVKLSWIQPFQHLILKSLLFVCEDLSYSGYQP 1494

Query: 1446 ---YASK----------------------EQEETSQVLNQVLRVVNNVDEA--------- 1505
               YAS+                      +Q  T+    +        + A         
Sbjct: 1495 TIDYASEIIINLIKLCVPNLNIVPDSMEIDQSTTTASTTETAATTTTTETATPMVTESTA 1554

Query: 1506 --------------NSEPRRQSFHGVVDI----LASELFNPNSSTIVRKNVQSCLAL--- 1565
                           S P   S      I     +S    P ++T    N+ S   +   
Sbjct: 1555 IVTEPTATTPTATPTSTPTSTSTPTPTPIPTATTSSTTTAPTTTTTTTTNLSSSSTINQK 1614

Query: 1566 ------------------------------------LASR---------TGSEVSELLEP 1625
                                                LA R         T   +S+L+E 
Sbjct: 1615 PHCKLNQLKLKDRELLKLILEILMERITSWSGHTRSLAQRMLTMISVEITKIPMSQLIED 1674

Query: 1626 LHQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRP-PLLKLTQELVNFLQEALQIA-- 1685
            L   + + L   PL+  +I  Q G +  L FCL+ +P PL+++  + V  LQE L +A  
Sbjct: 1675 LKMTVQKLLPKTPLKSLSISLQTGVIDGLTFCLSQKPSPLIEIGADTVRVLQECLNVAGD 1734

Query: 1686 EADETVW-VVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKS 1745
            E+  T    +K  + K  ++ N LR   +E++ T M   DF      E + +II MFFK 
Sbjct: 1735 ESSPTQQSQIKSSSAKSISATNNLRVCGVEMVATAMTCPDFLQFECLEFKNRIIRMFFKV 1794

Query: 1746 LTCRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLE 1805
            +T R  E+   AK GL   I QQR+ +DLLQ  LRP+L N+   K+LS+P LQGL+RLLE
Sbjct: 1795 VTARNKEMAMAAKRGLANSIQQQRLHRDLLQTCLRPVLSNITDPKSLSVPFLQGLSRLLE 1854

Query: 1806 LLASWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKF 1865
            LL++ FN  LG KL E+LKK+ E  KL+     ++  EE KI A+II++FHLLP AA K 
Sbjct: 1855 LLSNCFNAALGEKLFEYLKKFEEAGKLSYLANKYRDSEEVKICASIIDIFHLLPPAA-KL 1914

Query: 1866 LDELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRF 1925
            LD  + LTI LE +L       EV SPYR PLI+FL +Y    ++ F+ +L  P++   F
Sbjct: 1915 LDSTIILTIRLEQSL-----CKEVTSPYREPLIRFLAKYPQRTIEIFMGQL--PQFNLIF 1974

Query: 1926 MYIIR-SDAGQPLREELAKSPQKILASAFPEFAPKSEAALTPGSSTSPAPLSGDEGLVTP 1985
              I++     +P+ EELA +      S + E   KS                        
Sbjct: 1975 RLILKHQPLSKPIVEELANT-----YSIWLEAHLKS------------------------ 2034

Query: 1986 DASDPPSAPSSVVSDAYFRGLALIKTLVKLMPGWLQNNRVVFDTLVLVWKSPAR-IARLH 2045
                 PSA      D  F  L+++  + K +P WL  NR V D L+  W+  +  I    
Sbjct: 2035 -----PSA------DIRFHTLSMVSIIRKQLPNWLPENRKVLDILIEYWRPLSHMIQSAS 2094

Query: 2046 NEQELNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEV 2105
            N  +++   ++E+K +VKCFL Y +    E ++ F +LS+      +D+ FL+++Y  ++
Sbjct: 2095 NPLDISNQTLRETKIIVKCFLQYCKAHSEETDLYFYMLSVLTLRASMDFNFLRDYYQHDL 2154

Query: 2106 AEGYPPNMKKALLLHFLNLFQSKQLGHDHLVIVMQMLILPMLAHAF-----QNGQSWEVV 2165
            A       KK ++  FL  F+ + +  D+ V  +Q LI P+L + F      +     ++
Sbjct: 2155 APSSTIEQKKKIIQTFLIFFKDQTIPSDNKVQAIQNLITPILTNYFHQTDRNSSSGGGII 2214

Query: 2166 DQAIIKTIVDKLLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNH 2225
            + ++   +  + L+   EV A YD+ L IELLQL TLL+K L S LV  RKELIKF WNH
Sbjct: 2215 EDSLFIQLTKQTLE--TEVKASYDDTLLIELLQLETLLVKNLSSVLVDCRKELIKFAWNH 2274

Query: 2226 LKREDSASKQWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPAL 2285
            LK ED   KQ A++  C F+EAY+ P KI+LQV+V LLR  QPE+K LVKQALDILMP  
Sbjct: 2275 LKNEDLTCKQSAYILACGFIEAYETPHKIVLQVYVPLLRAYQPESKHLVKQALDILMPCF 2334

Query: 2286 PRRLPLGDSRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNS 2345
              RLP GD +   W+++TKKI+VEEGH+   L+HI QLIVRH  LFY  R+QFVP ++  
Sbjct: 2335 KTRLPGGDPKNSTWVKWTKKIIVEEGHTTAQLVHIIQLIVRHPQLFYPSRSQFVPHIILL 2394

Query: 2346 LSRLGLPYNTTAENRRLAIDLAGLVVGWERQRQNEMKLVTESDAPSHSNDGLTCPPGADP 2405
            L ++ L  N TAEN++L+ID+A  ++ WE+ R + ++   +S   S S+   T       
Sbjct: 2395 LPKIALGSNLTAENKKLSIDIADTIIIWEKMRMSNLQ---QSIKTSSSSLPTTTTTTTSS 2454

Query: 2406 KRMVDGSTFPEDSTKRVKVEPGLQSLCVMSPGGASSMPNIE----TPG---STTQPDEEF 2465
             +  D S+ P ++     +  G  S+   S GG ++ PN+     TPG     T  D+E+
Sbjct: 2455 NKPTDSSSLPPNT----PIAEG--SITTPSQGGVAT-PNVSDSTPTPGIHHGATNIDDEY 2514

Query: 2466 KPNAAMEEMIINFLIRVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLL 2525
            +P  +  E I  FLIR+A            + ++  ELL Q L +WP  N+KF+  EK +
Sbjct: 2515 RPPLSAIEHISLFLIRMA-------SNWYHINEKCSELLRQTLVIWPETNIKFSVFEKPM 2574

Query: 2526 SSIQPSQSKDPSTALAQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSL 2585
            ++ QP         ++  L ++N + E Q + F+ NN+  + Q L              L
Sbjct: 2575 NTDQPQM-------ISTCLSMLNLIAEYQVNTFIPNNVVALQQSLLQALNSDNAKISSLL 2634

Query: 2586 CSLLRMVFVAYPL-----------------------EGVTTPPDVKLLYQKVDELI---- 2645
             SL + +  A+PL                         +  PP V++     +E++    
Sbjct: 2635 GSLFKKILAAFPLPTNNTTTTTPVSSTTTTEQSSDSSSLPPPPPVQVTKPIPNEMVSFYT 2694

Query: 2646 --------------KNHINNLTAPQTSSEDNTASSIS-FVLLVIKTLTEVQKNLIDPYNL 2705
                          KN+  ++ +      D++ S I  ++ L++K L  + +N +   + 
Sbjct: 2695 FIGTQFEMILGAFDKNYNLSILSNIKVFSDHSESFIDPYISLIVKVLIRLTRNYLSQDSD 2754

Query: 2706 GRILQRLARDMGSSAGSHLRQGQRMDPDSA-----------------------------V 2765
            G       + + SS  +    G      SA                             +
Sbjct: 2755 GGTGSLANKPLSSSGSTSQTGGASQTATSASNVVLKKSNSEIISGLCKTYGFLKTKTTKL 2814

Query: 2766 TSSRQSADVGTVI-----SN----LKSVLKLINERVMLVPECKRSVTQIM---------- 2825
             S +++A + +++     SN    L  ++K+++  + + P    S T ++          
Sbjct: 2815 NSDQRNAFIQSLLVLIERSNDVELLSEIIKVVDYLISISPSPSPSTTPVVTETTIPSTTT 2874

Query: 2826 ---NSLLSEKGTDASVLLCILDVIKGWVEDDFSKMGTSVSSSSFLAPKEIVSFLQKLSQV 2885
                +  +   T  S               + +    + + + FL  KE ++FL KL +V
Sbjct: 2875 TTTTAATTTTTTTPSTTTTAATTTTAPTTTETTTTAATTTITPFLTIKEKINFLIKLGRV 2934

Query: 2886 DKQNFSSSAAEEWDGKYLQLLYEICADSNKYPLSLRQEVFQKVERQFMLGLR-ARDPETR 2945
            D+ + +     E    Y +L+    ++SN    S +QE+ Q +E  FM+GLR   D   R
Sbjct: 2935 DQLSNA-----ELSLSYYKLVLSFYSESNS---SSKQELSQ-LEPCFMMGLRNTVDQGMR 2994

Query: 2946 KKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVLVEDKPITLAPNS 3005
            K  F + H+S+G T + RL YII +Q W+ L   +W+K  LDLLLA+L  DK + ++   
Sbjct: 2995 KSLFNILHKSIGTTPYQRLNYIIGVQQWDILGTTYWIKHALDLLLAILPNDKFVKISNFC 3054

Query: 3006 ARLPPLLVSGH------------------------------VADSSAVQPQVNDAQEGLE 3065
            ++LP  L   +                                     Q Q    Q   +
Sbjct: 3055 SKLPTSLKFANRNGNDINQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQHHQQ 3114

Query: 3066 DAPLTFD--------SLVHK-----------HAQF---LNRTSKLQVADLIIPLRELAHT 3125
            + P+  D        S V+K           H Q+   L     L+ ++    LREL   
Sbjct: 3115 EQPMEIDENLVVEQSSSVNKGNEEFKKSLKLHTQWLESLKNEESLKFSEFNENLRELIFI 3174

Query: 3126 DANVAYHLWVLVFPIVWVTLHKEEQVALAKPMISLLSKDYHKK----------------- 3185
            D+++   LW  +F  +W  L KEEQ  L+K +  LLSKDY KK                 
Sbjct: 3175 DSHLVNDLWCHLFSDMWSDLTKEEQFKLSKSLTLLLSKDYTKKVPLVSKPIIPPTSIPIS 3234

Query: 3186 ---------------------------------------QQAS--------RPNVVQALL 3245
                                                   QQ +         PNV++  +
Sbjct: 3235 KPIITTTTSTSSSTSTTTPITTIPLINNSQLITIVTLTQQQTNPIIVPSLREPNVIKTWM 3294

Query: 3246 EGLQLSHPQPRMPSELIKYIGRTYNAWHIALALLESHVM---LFMNETKCS-ESLAELYR 3305
            E L +  P P++P E+I ++G  YN W+ A+ ++E  ++     ++ T  + + L+ LY 
Sbjct: 3295 ETLGMCKPIPKVPIEVISFLGENYNCWYYAIRMIEQQLIDRQKLLDSTDINWDYLSYLYG 3354

Query: 3306 LLNEEDMRCGLWKRKAISAETKAGLSLVQHGYWQRAQILFYQSMVKATQGTYNNNVPKAE 3365
             + E+D+  G+++++    ETK GL L Q   +Q +Q +F  +M K +        P++E
Sbjct: 3355 AIGEKDLLYGIYRKRYQCDETKLGLLLEQFYMFQSSQEVFLSAMNKYS-AVGCKPTPRSE 3414

Query: 3366 MCLWEEQWLYCASQLSQWEALVDFGKSIENYEILLDSLWKVPDWAYMKEHVIPKAQVEET 3425
              LWE+ WL CA +L+QW  + +F K    Y++ ++S WK+P W  +KE++       +T
Sbjct: 3415 NLLWEDHWLECAKRLNQWNFVHEFSKEKNMYDLTIESAWKIPQWNSVKENMKKMMSQGDT 3474

Query: 3426 PKLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMSVHARIPLLQQFQQLV 3485
               +++Q YF  +++  + V  A   +     L L++W  LPE S  +    L + QQ+V
Sbjct: 3475 SIRKILQGYFLTNEKRYHEVDPA---IVTSNQLILDKWVSLPERSFRSHTNSLVEMQQVV 3534

Query: 3486 EVQESSRILVDIANGNKHSGSSVVSVHTNLYADLKDILETWRLRIPNEWDSMTVWCDLLQ 3545
            E+QES  IL +I+N       + +S        +K I   WR R+PN+ + + +W +L+ 
Sbjct: 3535 ELQESVHILKEISNITLSQQPADLSRSFLTSNYIKSIFNIWRERLPNKDEDLLIWFELMA 3594

Query: 3546 WRNEMYNAV--------------------IDAFKDFGTTNS----------QLHHLGFR- 3605
            WR +++N +                           GTT +           ++ + F  
Sbjct: 3595 WRQQVFNIIGTPSMNGGIGANPVTPTNTTTTITNPDGTTTTTTTPLPPPQQPINQIEFAS 3654

Query: 3606 ------DKAWNVNKLAHVARKQGLHDVCVGILEKMYGHSTMEVQEAFVKIREQAKAYLEM 3636
                  + AW +NK +H+ RK  + +VC+  L KM+    +E+ + F+ ++EQ K YL++
Sbjct: 3655 PRYMVLEMAWTMNKYSHIVRKHNIIEVCLNSLSKMF-DLQIELHDIFLNLKEQIKCYLQL 3714

BLAST of MS010599 vs. ExPASy TrEMBL
Match: A0A6J1BWI4 (Non-specific serine/threonine protein kinase OS=Momordica charantia OX=3673 GN=LOC111005968 PE=4 SV=1)

HSP 1 Score: 7655.8 bits (19863), Expect = 0.0e+00
Identity = 3880/3888 (99.79%), Postives = 3884/3888 (99.90%), Query Frame = 0

Query: 1    MSPIQNFELHSRQLVEPELSIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSIILVQ 60
            MSPIQNFELHSRQLVEPELSIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSIILVQ
Sbjct: 1    MSPIQNFELHSRQLVEPELSIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSIILVQ 60

Query: 61   ITKPQYTDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120
            ITKPQYTDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI
Sbjct: 61   ITKPQYTDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120

Query: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENSAAGGEDIKPMDVSTSTDQT 180
            IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENSAAGGEDIKPMDVSTSTDQT
Sbjct: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENSAAGGEDIKPMDVSTSTDQT 180

Query: 181  ITTGYTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPVLLPLMVSAISVPGPE 240
            ITTGYTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPVLLPLMVSAISVPGPE
Sbjct: 181  ITTGYTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPVLLPLMVSAISVPGPE 240

Query: 241  KVPPFLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR 300
            KVPPFLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR
Sbjct: 241  KVPPFLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR 300

Query: 301  KELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVRG 360
            KELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVRG
Sbjct: 301  KELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVRG 360

Query: 361  DLTLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDEARILLG 420
            DLTLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDEARILLG
Sbjct: 361  DLTLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDEARILLG 420

Query: 421  RILDAFVGKFSTFKHTIPQLLEEGEEGKDRANMRSKLELPVQAVLNLQVPVEHSKEVNDC 480
            RILDAFVGKFSTFKHTIPQLLEEGEEGKDRANMRSKLELPVQAVLNLQVPVEHSKEVNDC
Sbjct: 421  RILDAFVGKFSTFKHTIPQLLEEGEEGKDRANMRSKLELPVQAVLNLQVPVEHSKEVNDC 480

Query: 481  KHLIKTLILGMKTIIWSITHAHLPRPQASPSPNGTHPQMLVSPSSNLATPQAFKGMREDE 540
            KHLIKTLILGMKTIIWSITHAHLPRPQASPSPNGTHPQMLVSPSSNLATPQAFKGMREDE
Sbjct: 481  KHLIKTLILGMKTIIWSITHAHLPRPQASPSPNGTHPQMLVSPSSNLATPQAFKGMREDE 540

Query: 541  VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMIT 600
            VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMIT
Sbjct: 541  VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMIT 600

Query: 601  NTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAVA 660
            NTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDS GAKLVLHLFRFVFGAVA
Sbjct: 601  NTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSQGAKLVLHLFRFVFGAVA 660

Query: 661  KAPSDFERILQPHVTVIMEVCVRSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIPL 720
            KAPSDFERILQPHVTVIMEVCVRSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIPL
Sbjct: 661  KAPSDFERILQPHVTVIMEVCVRSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIPL 720

Query: 721  LQPCLNMLLTMFDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLV 780
            LQPCLNMLLTMFDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLV
Sbjct: 721  LQPCLNMLLTMFDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLV 780

Query: 781  SLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRPIPYPWGAKALQVLGKLGGRN 840
            SLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRPIPYPWGAKALQVLGKLGGRN
Sbjct: 781  SLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRPIPYPWGAKALQVLGKLGGRN 840

Query: 841  RRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSTVMNKTGGVDSFYRK 900
            RRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSTVMNKTGGVDSFYRK
Sbjct: 841  RRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSTVMNKTGGVDSFYRK 900

Query: 901  QALKFLRVCLSSQLNLPGNVADDGHTPRQLSTLLVSPVDSSLRRSETPEGKADLGVKTKT 960
            QALKFLRVCLSSQLNLPGNVADDGHTPRQLSTLLVSPVDSSLRRSETPEGKADLGVKTKT
Sbjct: 901  QALKFLRVCLSSQLNLPGNVADDGHTPRQLSTLLVSPVDSSLRRSETPEGKADLGVKTKT 960

Query: 961  QLMAEKSVFKILLMTIIAAGSEEDLHEPKDDFVLNVCRHFAILFHIDSSLNSSPVASASL 1020
            QLMAEKSVFKILLMTIIAAGSEEDLHEPKDDFVLNVCRHFAILFHIDSSLN+SPVASASL
Sbjct: 961  QLMAEKSVFKILLMTIIAAGSEEDLHEPKDDFVLNVCRHFAILFHIDSSLNNSPVASASL 1020

Query: 1021 GSTLLPPNVSANSRLRSSACCNLKELDPLIFLDALVEVLADENRVHAKAALNALNLFSEI 1080
            GSTLLPPNVSANSRLRSSACCNLKELDPLIFLDALVEVLADENRVHAKAALNALNLFSEI
Sbjct: 1021 GSTLLPPNVSANSRLRSSACCNLKELDPLIFLDALVEVLADENRVHAKAALNALNLFSEI 1080

Query: 1081 LLFLARAKQTDVMMTRGPSTPMIVSSPSKSPVYSPPPSVRIPVFEQLLPRLLHCCYGSTW 1140
            LLFLARAKQTDVMMTRGPSTPMIVSSPSKSPVYSPPPSVRIPVFEQLLPRLLHCCYGSTW
Sbjct: 1081 LLFLARAKQTDVMMTRGPSTPMIVSSPSKSPVYSPPPSVRIPVFEQLLPRLLHCCYGSTW 1140

Query: 1141 QAQMGGIMGLGALVGKVTVETLCLFQVRIVRGLVYVLKRLPIYASKEQEETSQVLNQVLR 1200
            QAQMGGIMGLGALVGKVTVETLCLFQVRIVRGLVYVLKRLPIYASKEQEETSQVLNQVLR
Sbjct: 1141 QAQMGGIMGLGALVGKVTVETLCLFQVRIVRGLVYVLKRLPIYASKEQEETSQVLNQVLR 1200

Query: 1201 VVNNVDEANSEPRRQSFHGVVDILASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL 1260
            VVNNVDEANSEPRRQSFHGVVDILASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL
Sbjct: 1201 VVNNVDEANSEPRRQSFHGVVDILASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL 1260

Query: 1261 EPLHQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE 1320
            EPLHQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE
Sbjct: 1261 EPLHQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE 1320

Query: 1321 ADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT 1380
            ADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT
Sbjct: 1321 ADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT 1380

Query: 1381 CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL 1440
            CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL
Sbjct: 1381 CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL 1440

Query: 1441 ASWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD 1500
            ASWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD
Sbjct: 1441 ASWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD 1500

Query: 1501 ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY 1560
            ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY
Sbjct: 1501 ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY 1560

Query: 1561 IIRSDAGQPLREELAKSPQKILASAFPEFAPKSEAALTPGSSTSPAPLSGDEGLVTPDAS 1620
            IIRSDAGQPLREELAKSPQKILASAFPEFAPKSEAALTPGSSTSPAPLSGDEGLVTPDAS
Sbjct: 1561 IIRSDAGQPLREELAKSPQKILASAFPEFAPKSEAALTPGSSTSPAPLSGDEGLVTPDAS 1620

Query: 1621 DPPSAPSSVVSDAYFRGLALIKTLVKLMPGWLQNNRVVFDTLVLVWKSPARIARLHNEQE 1680
            DPPSAPSSVVSDAYFRGLALIKTLVKLMPGWLQNNRVVFDTLVLVWKSPARIARLHNEQE
Sbjct: 1621 DPPSAPSSVVSDAYFRGLALIKTLVKLMPGWLQNNRVVFDTLVLVWKSPARIARLHNEQE 1680

Query: 1681 LNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEGY 1740
            LNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEGY
Sbjct: 1681 LNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEGY 1740

Query: 1741 PPNMKKALLLHFLNLFQSKQLGHDHLVIVMQMLILPMLAHAFQNGQSWEVVDQAIIKTIV 1800
            PPNMKKALLLHFLNLFQSKQLGHDHLVIVMQMLILPMLAHAFQNGQSWEVVDQAIIKTIV
Sbjct: 1741 PPNMKKALLLHFLNLFQSKQLGHDHLVIVMQMLILPMLAHAFQNGQSWEVVDQAIIKTIV 1800

Query: 1801 DKLLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSASK 1860
            DKLLDPPEEVSA+YDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSASK
Sbjct: 1801 DKLLDPPEEVSADYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSASK 1860

Query: 1861 QWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGDS 1920
            QWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGDS
Sbjct: 1861 QWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGDS 1920

Query: 1921 RMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPYN 1980
            RMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPYN
Sbjct: 1921 RMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPYN 1980

Query: 1981 TTAENRRLAIDLAGLVVGWERQRQNEMKLVTESDAPSHSNDGLTCPPGADPKRMVDGSTF 2040
            TTAENRRLAIDLAGLVVGWERQRQNEMKLVTESDAPSHSNDGLTCPPGADPKRMVDGSTF
Sbjct: 1981 TTAENRRLAIDLAGLVVGWERQRQNEMKLVTESDAPSHSNDGLTCPPGADPKRMVDGSTF 2040

Query: 2041 PEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFLIR 2100
            PEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFLIR
Sbjct: 2041 PEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFLIR 2100

Query: 2101 VALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTALA 2160
            VALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTALA
Sbjct: 2101 VALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTALA 2160

Query: 2161 QGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYPLEGV 2220
            QGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLD GKSLCSLLRMVFVAYPLEGV
Sbjct: 2161 QGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDTGKSLCSLLRMVFVAYPLEGV 2220

Query: 2221 TTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLIDPY 2280
            TTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLIDPY
Sbjct: 2221 TTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLIDPY 2280

Query: 2281 NLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINERV 2340
            NLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINERV
Sbjct: 2281 NLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINERV 2340

Query: 2341 MLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWVEDDFSKMGTSVSSSSFLAPKE 2400
            MLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWVEDDFSKMGTSVSSSSFLAPKE
Sbjct: 2341 MLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWVEDDFSKMGTSVSSSSFLAPKE 2400

Query: 2401 IVSFLQKLSQVDKQNFSSSAAEEWDGKYLQLLYEICADSNKYPLSLRQEVFQKVERQFML 2460
            IVSFLQKLSQVDKQNFSSSAAEEWDGKYLQLLYEICADSNKYPLSLRQEVFQKVERQFML
Sbjct: 2401 IVSFLQKLSQVDKQNFSSSAAEEWDGKYLQLLYEICADSNKYPLSLRQEVFQKVERQFML 2460

Query: 2461 GLRARDPETRKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVLVE 2520
            GLRARDPETRKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVLVE
Sbjct: 2461 GLRARDPETRKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVLVE 2520

Query: 2521 DKPITLAPNSARLPPLLVSGHVADSSAVQPQVNDAQEGLEDAPLTFDSLVHKHAQFLNRT 2580
            DKPITLAPNSARLPPLLVSGHVADSSAVQPQVNDAQEGLEDAPLTFDSLVHKHAQFLNRT
Sbjct: 2521 DKPITLAPNSARLPPLLVSGHVADSSAVQPQVNDAQEGLEDAPLTFDSLVHKHAQFLNRT 2580

Query: 2581 SKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMISLLSKDYHK 2640
            SKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMISLLSKDYHK
Sbjct: 2581 SKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMISLLSKDYHK 2640

Query: 2641 KQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGRTYNAWHIALALLESHVMLFMNETK 2700
            KQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGRTYNAWHI+LALLESHVMLFMNETK
Sbjct: 2641 KQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGRTYNAWHISLALLESHVMLFMNETK 2700

Query: 2701 CSESLAELYRLLNEEDMRCGLWKRKAISAETKAGLSLVQHGYWQRAQILFYQSMVKATQG 2760
            CSESLAELYRLLNEEDMRCGLWKRKAISAETKAGLSLVQHGYWQRAQILFYQSMVKATQG
Sbjct: 2701 CSESLAELYRLLNEEDMRCGLWKRKAISAETKAGLSLVQHGYWQRAQILFYQSMVKATQG 2760

Query: 2761 TYNNNVPKAEMCLWEEQWLYCASQLSQWEALVDFGKSIENYEILLDSLWKVPDWAYMKEH 2820
            TYNNNVPKAEMCLWEEQWLYCASQLSQWEALVDFGKSIENYEILLDSLWKVPDWAYMKEH
Sbjct: 2761 TYNNNVPKAEMCLWEEQWLYCASQLSQWEALVDFGKSIENYEILLDSLWKVPDWAYMKEH 2820

Query: 2821 VIPKAQVEETPKLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMSVHARI 2880
            VIPKAQVEETPKLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMSVHARI
Sbjct: 2821 VIPKAQVEETPKLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMSVHARI 2880

Query: 2881 PLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVSVHTNLYADLKDILETWRLRIPNEWD 2940
            PLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVSVHTNLYADLKDILETWRLRIPNEWD
Sbjct: 2881 PLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVSVHTNLYADLKDILETWRLRIPNEWD 2940

Query: 2941 SMTVWCDLLQWRNEMYNAVIDAFKDFGTTNSQLHHLGFRDKAWNVNKLAHVARKQGLHDV 3000
            SMTVWCDLLQWRNEMYNAVIDAFKDFGTTNSQLHHLGFRDKAWNVNKLAHVARKQGLHDV
Sbjct: 2941 SMTVWCDLLQWRNEMYNAVIDAFKDFGTTNSQLHHLGFRDKAWNVNKLAHVARKQGLHDV 3000

Query: 3001 CVGILEKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLDYFPVKHKAEI 3060
            CVGILEKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLDYFPVKHKAEI
Sbjct: 3001 CVGILEKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLDYFPVKHKAEI 3060

Query: 3061 FRLKGDFQLKLSDSEGANHSYSSAISLFKNLPKGWISWGNYCDMAYKESHEEIWLEYAVS 3120
            FRLKGDFQLKLSDSEGANHSYSSAISLFKNLPKGWISWGNYCDMAYKESHEEIWLEYAVS
Sbjct: 3061 FRLKGDFQLKLSDSEGANHSYSSAISLFKNLPKGWISWGNYCDMAYKESHEEIWLEYAVS 3120

Query: 3121 CFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKYLDQIPHWVWLSWIPQLLLSL 3180
            CFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKYLDQIPHWVWLSWIPQLLLSL
Sbjct: 3121 CFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKYLDQIPHWVWLSWIPQLLLSL 3180

Query: 3181 QRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNTSSA 3240
            QRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNT SA
Sbjct: 3181 QRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNTGSA 3240

Query: 3241 GSLGLTDGSSRVAHGGSSTSTDNQVHQGTQSGSGIGSHDGGNSHSQEPERSTGVESSTHA 3300
            GSLGLTDGSSRV HGGSSTSTDNQVHQGTQSGSGIGSHDGGNSHSQEPERSTGVESSTHA
Sbjct: 3241 GSLGLTDGSSRVGHGGSSTSTDNQVHQGTQSGSGIGSHDGGNSHSQEPERSTGVESSTHA 3300

Query: 3301 GNDQSLPQTSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASELEI 3360
            GNDQSLPQTSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASELEI
Sbjct: 3301 GNDQSLPQTSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASELEI 3360

Query: 3361 LLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSADAV 3420
            LLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSADAV
Sbjct: 3361 LLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSADAV 3420

Query: 3421 NKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLRLED 3480
            NKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLRLED
Sbjct: 3421 NKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLRLED 3480

Query: 3481 ESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGSDG 3540
            ESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGSDG
Sbjct: 3481 ESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGSDG 3540

Query: 3541 SQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVRMV 3600
            SQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVRMV
Sbjct: 3541 SQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVRMV 3600

Query: 3601 EDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIAPEAVLDLRLQAYGDIT 3660
            EDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIAPEAVLDLRLQAYGDIT
Sbjct: 3601 EDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIAPEAVLDLRLQAYGDIT 3660

Query: 3661 RNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAKNT 3720
            RNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAKNT
Sbjct: 3661 RNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAKNT 3720

Query: 3721 GKIFQTDFHPAYDTNGMIEFNEPVPFRLTRNMQAFFSHFGVEGLIVSAMCSAAQAVVSPK 3780
            GKIFQTDFHPAYDTNGMIEFNEPVPFRLTRNMQAFFSHFGVEGLIVSAMCSAAQAVVSPK
Sbjct: 3721 GKIFQTDFHPAYDTNGMIEFNEPVPFRLTRNMQAFFSHFGVEGLIVSAMCSAAQAVVSPK 3780

Query: 3781 QNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAGGGMSPADFKQKVTINVDHVIGRING 3840
            QNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAGGGMSPADFKQKVTINVDHVIGRING
Sbjct: 3781 QNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAGGGMSPADFKQKVTINVDHVIGRING 3840

Query: 3841 IAPQYFSEEEENAMDPPQSVQRGVSDLVDAALMPRHLCMMDPTWHPWF 3889
            IAPQYFSEEEENAMDPPQSVQRGVSDLV+AALMPRHLCMMDPTWHPWF
Sbjct: 3841 IAPQYFSEEEENAMDPPQSVQRGVSDLVNAALMPRHLCMMDPTWHPWF 3888

BLAST of MS010599 vs. ExPASy TrEMBL
Match: A0A1S3B1J8 (Non-specific serine/threonine protein kinase OS=Cucumis melo OX=3656 GN=LOC103485125 PE=4 SV=1)

HSP 1 Score: 7353.8 bits (19079), Expect = 0.0e+00
Identity = 3720/3891 (95.61%), Postives = 3801/3891 (97.69%), Query Frame = 0

Query: 1    MSPIQNFELHSRQLVEPELSIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSIILVQ 60
            MSPIQNFE HSR LVEPEL+IQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFS+IL++
Sbjct: 1    MSPIQNFEQHSRHLVEPELNIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSVILLK 60

Query: 61   ITKPQYTDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120
            ITKPQ+TD+HEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI
Sbjct: 61   ITKPQFTDSHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120

Query: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENSAAGGEDIKPMDVSTSTDQT 180
            IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFEN AA  ED+KPM+VSTS+DQ+
Sbjct: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENPAASVEDVKPMEVSTSSDQS 180

Query: 181  ITTGYTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPVLLPLMVSAISVPGPE 240
            + +G TGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIP+LLPLMVSAISVPGPE
Sbjct: 181  MNSGCTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPLLLPLMVSAISVPGPE 240

Query: 241  KVPPFLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR 300
            KVPP LKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR
Sbjct: 241  KVPPSLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR 300

Query: 301  KELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVRG 360
            KELLVALKHVLGTEYKRGLFPLIDTLLEEKV+VGTGRACYETLRPLAYSLLAEIVHHVRG
Sbjct: 301  KELLVALKHVLGTEYKRGLFPLIDTLLEEKVVVGTGRACYETLRPLAYSLLAEIVHHVRG 360

Query: 361  DLTLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDEARILLG 420
            DL+LSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQ SMDE+RILLG
Sbjct: 361  DLSLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQASMDESRILLG 420

Query: 421  RILDAFVGKFSTFKHTIPQLLEEGEEGKDRANMRSKLELPVQAVLNLQVPVEHSKEVNDC 480
            RILD+FVGKFSTFKHTIPQLLEEGEEGKDRAN+RSKLELPVQAVLNLQVPVEHSKEVNDC
Sbjct: 421  RILDSFVGKFSTFKHTIPQLLEEGEEGKDRANLRSKLELPVQAVLNLQVPVEHSKEVNDC 480

Query: 481  KHLIKTLILGMKTIIWSITHAHLPRPQASPSPNGTHPQMLVSPSSNLATPQAFKGMREDE 540
            KHLIKTLI+GMKTI+WSITHAHLPR Q SPSPNGTHPQMLV+ SSNLATPQAFKGMREDE
Sbjct: 481  KHLIKTLIMGMKTIVWSITHAHLPRSQVSPSPNGTHPQMLVNSSSNLATPQAFKGMREDE 540

Query: 541  VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMIT 600
            VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILT+MEPRDLMDMFSLCMPELFDCMI+
Sbjct: 541  VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTIMEPRDLMDMFSLCMPELFDCMIS 600

Query: 601  NTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAVA 660
            NTQLVHLFSTFLQTPKVYRPFA+VLVNFLVSSKLD+LKHPDSPGAKLVLHLFRFVFGAV+
Sbjct: 601  NTQLVHLFSTFLQTPKVYRPFAEVLVNFLVSSKLDLLKHPDSPGAKLVLHLFRFVFGAVS 660

Query: 661  KAPSDFERILQPHVTVIMEVCVRSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIPL 720
            KAPSDFERILQPHVTVIMEVCV+SATEVERPLGYMQLLRIMFRALAGCKFELLLRDLI L
Sbjct: 661  KAPSDFERILQPHVTVIMEVCVKSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLISL 720

Query: 721  LQPCLNMLLTMFDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLV 780
            LQPCLNMLLTM DGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSD+LV
Sbjct: 721  LQPCLNMLLTMLDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDELV 780

Query: 781  SLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRPIPYPWGAKALQVLGKLGGRN 840
             LGLRTLEFWVDSLNPDFLEPSMA VMSEVILALWSHLRP+PY WGAKALQVLGKLGGRN
Sbjct: 781  GLGLRTLEFWVDSLNPDFLEPSMATVMSEVILALWSHLRPMPYSWGAKALQVLGKLGGRN 840

Query: 841  RRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSTVMNKTGGVDSFYRK 900
            RRFLKEPL LECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVS VMNKTGGVDSFYRK
Sbjct: 841  RRFLKEPLGLECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSAVMNKTGGVDSFYRK 900

Query: 901  QALKFLRVCLSSQLNLPGNVADDGHTPRQLSTLLVSPVDSSLRRSETPEGKADLGVKTKT 960
            QALKFLRVCLSSQLNLPG VADDG+TPRQLSTLLVS VDSS RRSETPE KADLGVKTKT
Sbjct: 901  QALKFLRVCLSSQLNLPGIVADDGYTPRQLSTLLVSSVDSSWRRSETPEAKADLGVKTKT 960

Query: 961  QLMAEKSVFKILLMTIIAAGSEEDLHEPKDDFVLNVCRHFAILFHIDSSLNSSPVASASL 1020
            QLMAEKSVFK+LLMTIIAAGS+EDL+EPKDDFVLNVCRHFAILFHIDSSLN+ PVASASL
Sbjct: 961  QLMAEKSVFKVLLMTIIAAGSDEDLNEPKDDFVLNVCRHFAILFHIDSSLNNPPVASASL 1020

Query: 1021 GSTLLPPNVSANSRLRSSACCNLKELDPLIFLDALVEVLADENRVHAKAALNALNLFSEI 1080
            GSTLLP NV+ANSRL+SSACCNLKELDPLIFLDALV+VLADENR+HAKAALNALNLFSE+
Sbjct: 1021 GSTLLPSNVNANSRLKSSACCNLKELDPLIFLDALVDVLADENRLHAKAALNALNLFSEM 1080

Query: 1081 LLFLARAKQTDVMMTRGPSTPMIVSSPSKSPVYSPPPSVRIPVFEQLLPRLLHCCYGSTW 1140
            LLFL R KQTDVMMTRGP TPM VSSP  SPVYSPPPSVRIPVFEQLLPRLLHCCYG TW
Sbjct: 1081 LLFLGRGKQTDVMMTRGPGTPMSVSSP-MSPVYSPPPSVRIPVFEQLLPRLLHCCYGCTW 1140

Query: 1141 QAQMGGIMGLGALVGKVTVETLCLFQVRIVRGLVYVLKRLPIYASKEQEETSQVLNQVLR 1200
            QAQMGG+MGLGALVGKVT+ETLC FQV+IVRGLVYVLKRLPIYASKEQEETSQVLN VLR
Sbjct: 1141 QAQMGGVMGLGALVGKVTIETLCHFQVKIVRGLVYVLKRLPIYASKEQEETSQVLNHVLR 1200

Query: 1201 VVNNVDEANSEPRRQSFHGVVDILASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL 1260
            VVNNVDEANSEPRRQSF GVVD+LASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL
Sbjct: 1201 VVNNVDEANSEPRRQSFQGVVDVLASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL 1260

Query: 1261 EPLHQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE 1320
            EPL+QPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE
Sbjct: 1261 EPLYQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE 1320

Query: 1321 ADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT 1380
            ADETVWVVKFMNPK+ATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT
Sbjct: 1321 ADETVWVVKFMNPKVATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT 1380

Query: 1381 CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL 1440
            CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL
Sbjct: 1381 CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL 1440

Query: 1441 ASWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD 1500
            ASWFNVTLGGKLLEHLKKWLEPEKLAQ QKAWKAGEEPKIAAAIIELFHLLPMAASKFLD
Sbjct: 1441 ASWFNVTLGGKLLEHLKKWLEPEKLAQIQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD 1500

Query: 1501 ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY 1560
            ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY
Sbjct: 1501 ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY 1560

Query: 1561 IIRSDAGQPLREELAKSPQKILASAFPEFAPKSEAALTPGSSTSPAPLSGDEGLVTP-DA 1620
            IIRSDAGQPLREELAKSPQKILASAFPEF PKSE ALTPGSST PAPLSGDEGLVTP D 
Sbjct: 1561 IIRSDAGQPLREELAKSPQKILASAFPEFVPKSEPALTPGSSTPPAPLSGDEGLVTPSDV 1620

Query: 1621 SDPPSAPSSVVSDAYFRGLALIKTLVKLMPGWLQNNRVVFDTLVLVWKSPARIARLHNEQ 1680
            SDPPSAPS VVSDAYF GL L+KTLVKLMPGWLQ+NRVVFDTLV VWKSPARIARLHNEQ
Sbjct: 1621 SDPPSAPSGVVSDAYFCGLQLVKTLVKLMPGWLQSNRVVFDTLVAVWKSPARIARLHNEQ 1680

Query: 1681 ELNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEG 1740
            ELNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEG
Sbjct: 1681 ELNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEG 1740

Query: 1741 YPPNMKKALLLHFLNLFQSKQLGHDHLVIVMQMLILPMLAHAFQNGQSWEVVDQAIIKTI 1800
            YPPNMKKALLLHFLNLFQSKQLGHDHLV+VMQMLILPMLAHAFQNGQSWEVVDQAIIKTI
Sbjct: 1741 YPPNMKKALLLHFLNLFQSKQLGHDHLVVVMQMLILPMLAHAFQNGQSWEVVDQAIIKTI 1800

Query: 1801 VDKLLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSAS 1860
            VDKLLDPPEEV+AEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSAS
Sbjct: 1801 VDKLLDPPEEVTAEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSAS 1860

Query: 1861 KQWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGD 1920
            KQWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGD
Sbjct: 1861 KQWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGD 1920

Query: 1921 SRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPY 1980
            SRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPY
Sbjct: 1921 SRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPY 1980

Query: 1981 NTTAENRRLAIDLAGLVVGWERQRQNEMKLVTESDAPSHSNDGLT-CPPGADPKRMVDGS 2040
            NTTAENRRLAIDLAGLVVGWERQRQNEMK VTESDAPSH+NDGLT CPPGAD KR+VDGS
Sbjct: 1981 NTTAENRRLAIDLAGLVVGWERQRQNEMKPVTESDAPSHNNDGLTSCPPGADSKRLVDGS 2040

Query: 2041 TFPEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFL 2100
            TF EDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFL
Sbjct: 2041 TFSEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFL 2100

Query: 2101 IRVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTA 2160
            IRVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTA
Sbjct: 2101 IRVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTA 2160

Query: 2161 LAQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYPLE 2220
            LAQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYPLE
Sbjct: 2161 LAQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYPLE 2220

Query: 2221 GVTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLID 2280
            GVTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLID
Sbjct: 2221 GVTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLID 2280

Query: 2281 PYNLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINE 2340
            PYNLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVI+NLKSVLKLINE
Sbjct: 2281 PYNLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVIANLKSVLKLINE 2340

Query: 2341 RVMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWVEDDFSKMGTSVSSSSFLAP 2400
            RVMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGW+EDDFSKMGTSVSSSSFLAP
Sbjct: 2341 RVMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWIEDDFSKMGTSVSSSSFLAP 2400

Query: 2401 KEIVSFLQKLSQVDKQNFSSSAAEEWDGKYLQLLYEICADSNKYPLSLRQEVFQKVERQF 2460
            KEIVSFLQKLSQVDKQNF+ SAAEEWDGKYLQLLYEICADSNKYP+SLRQEVFQKVERQF
Sbjct: 2401 KEIVSFLQKLSQVDKQNFAPSAAEEWDGKYLQLLYEICADSNKYPVSLRQEVFQKVERQF 2460

Query: 2461 MLGLRARDPETRKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVL 2520
            MLGLRARDPE RKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVL
Sbjct: 2461 MLGLRARDPEIRKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVL 2520

Query: 2521 VEDKPITLAPNSARLPPLLVSGHVADSSAVQPQVNDAQEGLEDAPLTFDSLVHKHAQFLN 2580
            VEDKPITLAPNSARLPPLLVSGHVADSS V   V D QEG+EDAPLTFDSLV KHAQFLN
Sbjct: 2521 VEDKPITLAPNSARLPPLLVSGHVADSSVVPHPVIDGQEGIEDAPLTFDSLVLKHAQFLN 2580

Query: 2581 RTSKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMISLLSKDY 2640
            R SKLQVADLIIPLRELAH DANVAYHLWVLVFPIVWVTLHKEEQVALAKPMI LLSKDY
Sbjct: 2581 RMSKLQVADLIIPLRELAHNDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMIGLLSKDY 2640

Query: 2641 HKKQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGRTYNAWHIALALLESHVMLFMNE 2700
            HKKQQA RPNVVQALLEGLQLSHPQPRMPSELIKYIG+TYNAWHIALALLESHVMLFMNE
Sbjct: 2641 HKKQQAQRPNVVQALLEGLQLSHPQPRMPSELIKYIGKTYNAWHIALALLESHVMLFMNE 2700

Query: 2701 TKCSESLAELYRLLNEEDMRCGLWKRKAISAETKAGLSLVQHGYWQRAQILFYQSMVKAT 2760
            TKC+ESLAELYRLLNEEDMRCGLWKRKA +AETKAGLSLVQHGYWQRAQ LFYQSMVKAT
Sbjct: 2701 TKCAESLAELYRLLNEEDMRCGLWKRKANTAETKAGLSLVQHGYWQRAQSLFYQSMVKAT 2760

Query: 2761 QGTYNNNVPKAEMCLWEEQWLYCASQLSQWEALVDFGKSIENYEILLDSLWKVPDWAYMK 2820
            QGTYNN VPKAEMCLWEEQWL CASQLSQWEAL DFGKSIENYEILLDSLWKVPDWAYMK
Sbjct: 2761 QGTYNNTVPKAEMCLWEEQWLSCASQLSQWEALADFGKSIENYEILLDSLWKVPDWAYMK 2820

Query: 2821 EHVIPKAQVEETPKLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMSVHA 2880
            EHVIPKAQVEETPKLRLIQAYFSLHDRS NGVADAENIVGKGVDLALEQWWQLPEMSVHA
Sbjct: 2821 EHVIPKAQVEETPKLRLIQAYFSLHDRSANGVADAENIVGKGVDLALEQWWQLPEMSVHA 2880

Query: 2881 RIPLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVSVHTNLYADLKDILETWRLRIPNE 2940
            RIPLLQQFQQLVEVQESSRILVDIANGNKHSGSSVV VH+NLYADLKDILETWRLRIPNE
Sbjct: 2881 RIPLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVGVHSNLYADLKDILETWRLRIPNE 2940

Query: 2941 WDSMTVWCDLLQWRNEMYNAVIDAFKDFGTTNSQLHHLGFRDKAWNVNKLAHVARKQGLH 3000
            WDSMTVWCDLLQWRNEMYNAVIDAFKDFG TNSQLHHLGFRDKAWNVNKLAHVARKQGL+
Sbjct: 2941 WDSMTVWCDLLQWRNEMYNAVIDAFKDFGNTNSQLHHLGFRDKAWNVNKLAHVARKQGLY 3000

Query: 3001 DVCVGILEKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLDYFPVKHKA 3060
            DVCV IL+KMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNL+YFPVKHKA
Sbjct: 3001 DVCVAILDKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLEYFPVKHKA 3060

Query: 3061 EIFRLKGDFQLKLSDSEGANHSYSSAISLFKNLPKGWISWGNYCDMAYKESHEEIWLEYA 3120
            EI+RLKGDFQLKLSDSEGAN SYS+AI+LFKNLPKGWISWGNYCDMAYKESH+E WLEYA
Sbjct: 3061 EIYRLKGDFQLKLSDSEGANQSYSNAITLFKNLPKGWISWGNYCDMAYKESHDETWLEYA 3120

Query: 3121 VSCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKYLDQIPHWVWLSWIPQLLL 3180
            VSCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDK+LDQIPHWVWLSWIPQLLL
Sbjct: 3121 VSCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKFLDQIPHWVWLSWIPQLLL 3180

Query: 3181 SLQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNTS 3240
            SLQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQN +
Sbjct: 3181 SLQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNAA 3240

Query: 3241 SAGSLGLTDGSSRVAHGG-SSTSTDNQVHQGTQSGSGIGSHDGGNSHSQEPERSTGVESS 3300
            SAGSLGL DG SR  HGG SST  DNQVHQGTQSGS IGSHDGGN+HSQEPER+TG +SS
Sbjct: 3241 SAGSLGLADGGSRAGHGGSSSTPADNQVHQGTQSGSAIGSHDGGNAHSQEPERTTGADSS 3300

Query: 3301 THAGNDQSLPQTSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASE 3360
            THAGNDQSLPQ SSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASE
Sbjct: 3301 THAGNDQSLPQPSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASE 3360

Query: 3361 LEILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSA 3420
            LEILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSA
Sbjct: 3361 LEILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSA 3420

Query: 3421 DAVNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLR 3480
            DAVNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVL+
Sbjct: 3421 DAVNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLK 3480

Query: 3481 LEDESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIG 3540
            LE+ESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIG
Sbjct: 3481 LEEESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIG 3540

Query: 3541 SDGSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQV 3600
            SDGSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQV
Sbjct: 3541 SDGSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQV 3600

Query: 3601 RMVEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIAPEAVLDLRLQAYG 3660
            RMVEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQI PEAV+DLRLQA+G
Sbjct: 3601 RMVEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQILPEAVVDLRLQAFG 3660

Query: 3661 DITRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFA 3720
            DITRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFA
Sbjct: 3661 DITRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFA 3720

Query: 3721 KNTGKIFQTDFHPAYDTNGMIEFNEPVPFRLTRNMQAFFSHFGVEGLIVSAMCSAAQAVV 3780
            KNTGKIFQTDFHPAYD NGMIEFNEPVPFRLTRNMQAFFS+FGVEGLIVSAMCSAAQAVV
Sbjct: 3721 KNTGKIFQTDFHPAYDANGMIEFNEPVPFRLTRNMQAFFSNFGVEGLIVSAMCSAAQAVV 3780

Query: 3781 SPKQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAGGGMSPADFKQKVTINVDHVIGR 3840
            SPKQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIA GGM+PADFKQKVT NVD VIGR
Sbjct: 3781 SPKQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAAGGMNPADFKQKVTTNVDLVIGR 3840

Query: 3841 INGIAPQYFSEEEENAMDPPQSVQRGVSDLVDAALMPRHLCMMDPTWHPWF 3889
            INGIAPQYFSEEEENAMDPPQSVQRGVS+LVDAAL P++LCMMDPTWHPWF
Sbjct: 3841 INGIAPQYFSEEEENAMDPPQSVQRGVSELVDAALQPKNLCMMDPTWHPWF 3890

BLAST of MS010599 vs. ExPASy TrEMBL
Match: A0A0A0KKP1 (Non-specific serine/threonine protein kinase OS=Cucumis sativus OX=3659 GN=Csa_6G505900 PE=4 SV=1)

HSP 1 Score: 7342.3 bits (19049), Expect = 0.0e+00
Identity = 3715/3890 (95.50%), Postives = 3795/3890 (97.56%), Query Frame = 0

Query: 1    MSPIQNFELHSRQLVEPELSIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSIILVQ 60
            MSPIQNFE HSR LVEPEL+IQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFS+IL++
Sbjct: 1    MSPIQNFEQHSRHLVEPELNIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSVILLK 60

Query: 61   ITKPQYTDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120
            ITKPQ+TD+HEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI
Sbjct: 61   ITKPQFTDSHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120

Query: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENSAAGGEDIKPMDVSTSTDQT 180
            IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFEN +A  ED+KPM+VSTS+DQ+
Sbjct: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENPSASVEDVKPMEVSTSSDQS 180

Query: 181  ITTGYTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPVLLPLMVSAISVPGPE 240
            + +G TGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLV TNIP LLPLMVSAISVPGPE
Sbjct: 181  MNSGCTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVHTNIPHLLPLMVSAISVPGPE 240

Query: 241  KVPPFLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR 300
            KVPP LKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR
Sbjct: 241  KVPPSLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR 300

Query: 301  KELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVRG 360
            KELLVALKHVLGTEYKRGLFPLIDTLLEEKV+VGTGRACYETLRPLAYSLLAEIVHHVR 
Sbjct: 301  KELLVALKHVLGTEYKRGLFPLIDTLLEEKVVVGTGRACYETLRPLAYSLLAEIVHHVRV 360

Query: 361  DLTLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDEARILLG 420
            DL+L QLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDE+RILLG
Sbjct: 361  DLSLPQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDESRILLG 420

Query: 421  RILDAFVGKFSTFKHTIPQLLEEGEEGKDRANMRSKLELPVQAVLNLQVPVEHSKEVNDC 480
            RILD+FVGKFSTFKHTIPQLLEEGEEGKDRAN+RSKLELPVQAVLNLQVPVEHSKEVNDC
Sbjct: 421  RILDSFVGKFSTFKHTIPQLLEEGEEGKDRANLRSKLELPVQAVLNLQVPVEHSKEVNDC 480

Query: 481  KHLIKTLILGMKTIIWSITHAHLPRPQASPSPNGTHPQMLVSPSSNLATPQAFKGMREDE 540
            KHLIKTLI+GMKTIIWSITHAHLPR Q SPSPNGTHPQMLV+PSSNLATPQA KGMREDE
Sbjct: 481  KHLIKTLIMGMKTIIWSITHAHLPRSQVSPSPNGTHPQMLVNPSSNLATPQALKGMREDE 540

Query: 541  VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMIT 600
            VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILT+MEPRDLMDMFSLCMPELFDCMI+
Sbjct: 541  VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTIMEPRDLMDMFSLCMPELFDCMIS 600

Query: 601  NTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAVA 660
            NTQLVHLFSTFLQTPKVYRPFA+VLVNFLVSSKLD+LKHPDSPGAKLVLHLFRFVFGAV+
Sbjct: 601  NTQLVHLFSTFLQTPKVYRPFAEVLVNFLVSSKLDLLKHPDSPGAKLVLHLFRFVFGAVS 660

Query: 661  KAPSDFERILQPHVTVIMEVCVRSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIPL 720
            KAPSDFERILQPHVTVIMEVCV+SATEVERPLGYMQLLRIMFRALAGCKFELLLRDLI L
Sbjct: 661  KAPSDFERILQPHVTVIMEVCVKSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLISL 720

Query: 721  LQPCLNMLLTMFDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLV 780
            LQPCLNMLLTM DGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSD+LV
Sbjct: 721  LQPCLNMLLTMLDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDELV 780

Query: 781  SLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRPIPYPWGAKALQVLGKLGGRN 840
             LGLRTLEFWVDSLNPDFLEPSMA VMSEVILALWSHLRP+PY WGAKALQVLGKLGGRN
Sbjct: 781  GLGLRTLEFWVDSLNPDFLEPSMATVMSEVILALWSHLRPMPYSWGAKALQVLGKLGGRN 840

Query: 841  RRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSTVMNKTGGVDSFYRK 900
            RRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVS VMNKTGGVDSFYRK
Sbjct: 841  RRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSAVMNKTGGVDSFYRK 900

Query: 901  QALKFLRVCLSSQLNLPGNVADDGHTPRQLSTLLVSPVDSSLRRSETPEGKADLGVKTKT 960
            QALKFLRVCLSSQLNLPG VADDG+TPRQLSTLLVS VDSS RRSETPE KADLGVKTKT
Sbjct: 901  QALKFLRVCLSSQLNLPGIVADDGYTPRQLSTLLVSSVDSSWRRSETPEAKADLGVKTKT 960

Query: 961  QLMAEKSVFKILLMTIIAAGSEEDLHEPKDDFVLNVCRHFAILFHIDSSLNSSPVASASL 1020
            QLMAEKSVFK+LLMTIIAAGSEEDL+EPKDDFVLNVCRHFAILFHIDSSLN+ PVASAS 
Sbjct: 961  QLMAEKSVFKLLLMTIIAAGSEEDLNEPKDDFVLNVCRHFAILFHIDSSLNNPPVASASH 1020

Query: 1021 GSTLLPPNVSANSRLRSSACCNLKELDPLIFLDALVEVLADENRVHAKAALNALNLFSEI 1080
            GSTLLP NV+ANSRL+SSACCNLKELDPLIFLDALVEVLADENR+HAKAALNALNLFSE+
Sbjct: 1021 GSTLLPSNVNANSRLKSSACCNLKELDPLIFLDALVEVLADENRIHAKAALNALNLFSEM 1080

Query: 1081 LLFLARAKQTDVMMTRGPSTPMIVSSPSKSPVYSPPPSVRIPVFEQLLPRLLHCCYGSTW 1140
            LLFL R KQTDVMMTRGP TPM VSSP  SPVYSPPPSVRIPVFEQLLPRLLHCCYG +W
Sbjct: 1081 LLFLGRGKQTDVMMTRGPGTPMSVSSP-MSPVYSPPPSVRIPVFEQLLPRLLHCCYGCSW 1140

Query: 1141 QAQMGGIMGLGALVGKVTVETLCLFQVRIVRGLVYVLKRLPIYASKEQEETSQVLNQVLR 1200
            QAQMGG++GLGALVGKVTVETLC FQV+IVRGLVYVLKRLPIYASKEQEETSQVLN VLR
Sbjct: 1141 QAQMGGVIGLGALVGKVTVETLCHFQVKIVRGLVYVLKRLPIYASKEQEETSQVLNHVLR 1200

Query: 1201 VVNNVDEANSEPRRQSFHGVVDILASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL 1260
            VVNNVDEANSEPRRQSF GVVD+LASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL
Sbjct: 1201 VVNNVDEANSEPRRQSFQGVVDVLASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL 1260

Query: 1261 EPLHQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE 1320
            EPL+QPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE
Sbjct: 1261 EPLYQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE 1320

Query: 1321 ADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT 1380
            ADETVWVVKFMNPK+ATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT
Sbjct: 1321 ADETVWVVKFMNPKVATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT 1380

Query: 1381 CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL 1440
            CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL
Sbjct: 1381 CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL 1440

Query: 1441 ASWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD 1500
            ASWFNVTLGGKLLEHLKKWLEPEKLAQ QKAWKAGEEPKIAAAIIELFHLLPMAASKFLD
Sbjct: 1441 ASWFNVTLGGKLLEHLKKWLEPEKLAQIQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD 1500

Query: 1501 ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY 1560
            ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY
Sbjct: 1501 ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY 1560

Query: 1561 IIRSDAGQPLREELAKSPQKILASAFPEFAPKSEAALTPGSSTSPAPLSGDEGLVTP-DA 1620
            IIRSDAGQPLREELAKSPQKILASAFPEF PKSE ALTPGSST PAPLSGDEGLVTP D 
Sbjct: 1561 IIRSDAGQPLREELAKSPQKILASAFPEFVPKSEPALTPGSSTPPAPLSGDEGLVTPSDV 1620

Query: 1621 SDPPSAPSSVVSDAYFRGLALIKTLVKLMPGWLQNNRVVFDTLVLVWKSPARIARLHNEQ 1680
            SDPPSA SSVV DAYF GLAL+KTLVKLMPGWLQ+NRVVFDTLV VWKSPARIARLHNEQ
Sbjct: 1621 SDPPSASSSVVPDAYFCGLALVKTLVKLMPGWLQSNRVVFDTLVAVWKSPARIARLHNEQ 1680

Query: 1681 ELNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEG 1740
            ELNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEG
Sbjct: 1681 ELNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEG 1740

Query: 1741 YPPNMKKALLLHFLNLFQSKQLGHDHLVIVMQMLILPMLAHAFQNGQSWEVVDQAIIKTI 1800
            YPPNMKKALLLHFLNLFQSKQLGHDHLV+VMQMLILPMLAHAFQNGQSWEVVDQAIIKTI
Sbjct: 1741 YPPNMKKALLLHFLNLFQSKQLGHDHLVVVMQMLILPMLAHAFQNGQSWEVVDQAIIKTI 1800

Query: 1801 VDKLLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSAS 1860
            VDKLLDPPEEV+AEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSAS
Sbjct: 1801 VDKLLDPPEEVTAEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSAS 1860

Query: 1861 KQWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGD 1920
            KQWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGD
Sbjct: 1861 KQWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGD 1920

Query: 1921 SRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPY 1980
            SRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPY
Sbjct: 1921 SRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPY 1980

Query: 1981 NTTAENRRLAIDLAGLVVGWERQRQNEMKLVTESDAPSHSNDGLT-CPPGADPKRMVDGS 2040
            NTTAENRRLAIDLAGLVVGWERQRQNEMK VTESDAPSH+NDGLT CPPGAD KR+VDGS
Sbjct: 1981 NTTAENRRLAIDLAGLVVGWERQRQNEMKPVTESDAPSHNNDGLTSCPPGADSKRLVDGS 2040

Query: 2041 TFPEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFL 2100
            TF EDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFL
Sbjct: 2041 TFSEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFL 2100

Query: 2101 IRVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTA 2160
            IRVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTA
Sbjct: 2101 IRVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTA 2160

Query: 2161 LAQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYPLE 2220
            LAQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYPLE
Sbjct: 2161 LAQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYPLE 2220

Query: 2221 GVTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLID 2280
            GVTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLID
Sbjct: 2221 GVTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLID 2280

Query: 2281 PYNLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINE 2340
            PYNLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINE
Sbjct: 2281 PYNLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINE 2340

Query: 2341 RVMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWVEDDFSKMGTSVSSSSFLAP 2400
            RVMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGW+EDDFSKMGTSVSSSSFLAP
Sbjct: 2341 RVMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWIEDDFSKMGTSVSSSSFLAP 2400

Query: 2401 KEIVSFLQKLSQVDKQNFSSSAAEEWDGKYLQLLYEICADSNKYPLSLRQEVFQKVERQF 2460
            KEIVSFLQKLSQVDKQNFSSSAAEEWD KYLQLLYEICADSNKYP+SLRQEVFQKVERQF
Sbjct: 2401 KEIVSFLQKLSQVDKQNFSSSAAEEWDEKYLQLLYEICADSNKYPVSLRQEVFQKVERQF 2460

Query: 2461 MLGLRARDPETRKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVL 2520
            MLGLRARDPE RKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVL
Sbjct: 2461 MLGLRARDPEVRKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVL 2520

Query: 2521 VEDKPITLAPNSARLPPLLVSGHVADSSAVQPQVNDAQEGLEDAPLTFDSLVHKHAQFLN 2580
            VEDKPITLAPNSARLPPLLVSGHV DSS V   V D QEG+EDAPLTFDSLV KHAQFLN
Sbjct: 2521 VEDKPITLAPNSARLPPLLVSGHVGDSSVVPHPVIDGQEGIEDAPLTFDSLVLKHAQFLN 2580

Query: 2581 RTSKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMISLLSKDY 2640
            R SKLQVADLIIPLRELAH DANVAYHLWVLVFPIVWVTLHKEEQVALAKPMI LLSKDY
Sbjct: 2581 RMSKLQVADLIIPLRELAHNDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMIGLLSKDY 2640

Query: 2641 HKKQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGRTYNAWHIALALLESHVMLFMNE 2700
            HKKQQA RPNVVQALLEGLQLSHPQPRMPSELIKYIG+TYNAWHIALALLESHVMLFMNE
Sbjct: 2641 HKKQQAHRPNVVQALLEGLQLSHPQPRMPSELIKYIGKTYNAWHIALALLESHVMLFMNE 2700

Query: 2701 TKCSESLAELYRLLNEEDMRCGLWKRKAISAETKAGLSLVQHGYWQRAQILFYQSMVKAT 2760
            TKC+ESLAELYRLLNEEDMRCGLWKRKA +AETKAGLSLVQHGYWQRAQ LFYQSMVKAT
Sbjct: 2701 TKCAESLAELYRLLNEEDMRCGLWKRKANTAETKAGLSLVQHGYWQRAQSLFYQSMVKAT 2760

Query: 2761 QGTYNNNVPKAEMCLWEEQWLYCASQLSQWEALVDFGKSIENYEILLDSLWKVPDWAYMK 2820
            QGTYNN VPKAEMCLWEEQWL CASQLSQWEAL DFGKSIENYEILLDSLWKVPDWAYMK
Sbjct: 2761 QGTYNNTVPKAEMCLWEEQWLCCASQLSQWEALADFGKSIENYEILLDSLWKVPDWAYMK 2820

Query: 2821 EHVIPKAQVEETPKLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMSVHA 2880
            EHVIPKAQVEETPKLRLIQAYFSLHD+  NGVADAENIVGKGVDLALEQWWQLPEMSVHA
Sbjct: 2821 EHVIPKAQVEETPKLRLIQAYFSLHDKGANGVADAENIVGKGVDLALEQWWQLPEMSVHA 2880

Query: 2881 RIPLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVSVHTNLYADLKDILETWRLRIPNE 2940
            RIPLLQQFQQLVEVQESSRILVDIANGNKHSGSSVV VH+NLYADLKDILETWRLRIPNE
Sbjct: 2881 RIPLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVGVHSNLYADLKDILETWRLRIPNE 2940

Query: 2941 WDSMTVWCDLLQWRNEMYNAVIDAFKDFGTTNSQLHHLGFRDKAWNVNKLAHVARKQGLH 3000
            WD MTVWCDLLQWRNEMYNAVIDAFKDFG TNSQLHHLGFRDKAWNVNKLAHVARKQGL+
Sbjct: 2941 WDGMTVWCDLLQWRNEMYNAVIDAFKDFGNTNSQLHHLGFRDKAWNVNKLAHVARKQGLY 3000

Query: 3001 DVCVGILEKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLDYFPVKHKA 3060
            DVCV IL+KMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNL+YFPVKHKA
Sbjct: 3001 DVCVAILDKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLEYFPVKHKA 3060

Query: 3061 EIFRLKGDFQLKLSDSEGANHSYSSAISLFKNLPKGWISWGNYCDMAYKESHEEIWLEYA 3120
            EI+RLKGDFQLKLSDSEGAN SYS+AI+LFKNLPKGWISWGNYCDMAYKESH+E WLEYA
Sbjct: 3061 EIYRLKGDFQLKLSDSEGANQSYSNAITLFKNLPKGWISWGNYCDMAYKESHDEAWLEYA 3120

Query: 3121 VSCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKYLDQIPHWVWLSWIPQLLL 3180
            VSCFLQGIKFGISNSRNHLARVLYLLSFD PNEPVGRAFDK+LDQIPHWVWLSWIPQLLL
Sbjct: 3121 VSCFLQGIKFGISNSRNHLARVLYLLSFDAPNEPVGRAFDKFLDQIPHWVWLSWIPQLLL 3180

Query: 3181 SLQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNTS 3240
            SLQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQN +
Sbjct: 3181 SLQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNAA 3240

Query: 3241 SAGSLGLTDGSSRVAHGGSSTSTDNQVHQGTQSGSGIGSHDGGNSHSQEPERSTGVESST 3300
            SAGSLGL DG +R  HGGSST  DNQVHQGTQSGSGIGSHDGGN+HSQEPER+TG +SST
Sbjct: 3241 SAGSLGLADGGARAGHGGSSTPADNQVHQGTQSGSGIGSHDGGNAHSQEPERTTGADSST 3300

Query: 3301 HAGNDQSLPQTSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASEL 3360
            HAGNDQSLPQ SSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASEL
Sbjct: 3301 HAGNDQSLPQPSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASEL 3360

Query: 3361 EILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSAD 3420
            EILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSAD
Sbjct: 3361 EILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSAD 3420

Query: 3421 AVNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLRL 3480
            AVNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVL+L
Sbjct: 3421 AVNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLKL 3480

Query: 3481 EDESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGS 3540
            E+ESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGS
Sbjct: 3481 EEESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGS 3540

Query: 3541 DGSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVR 3600
            DGSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVR
Sbjct: 3541 DGSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVR 3600

Query: 3601 MVEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIAPEAVLDLRLQAYGD 3660
            MVEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQI PEAV+DLRLQA+GD
Sbjct: 3601 MVEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQILPEAVVDLRLQAFGD 3660

Query: 3661 ITRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAK 3720
            ITRNLVN+GIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAK
Sbjct: 3661 ITRNLVNDGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAK 3720

Query: 3721 NTGKIFQTDFHPAYDTNGMIEFNEPVPFRLTRNMQAFFSHFGVEGLIVSAMCSAAQAVVS 3780
            NTGKIFQTDFHPAYD NGMIEFNEPVPFRLTRNMQAFFS+FGVEGLIVSAMCSAAQAVVS
Sbjct: 3721 NTGKIFQTDFHPAYDANGMIEFNEPVPFRLTRNMQAFFSNFGVEGLIVSAMCSAAQAVVS 3780

Query: 3781 PKQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAGGGMSPADFKQKVTINVDHVIGRI 3840
            PKQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIA GGM+PADFKQKVT NVD VIGRI
Sbjct: 3781 PKQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAAGGMNPADFKQKVTTNVDLVIGRI 3840

Query: 3841 NGIAPQYFSEEEENAMDPPQSVQRGVSDLVDAALMPRHLCMMDPTWHPWF 3889
            NGIAPQYFSEEEENAMDPPQSVQRGVS+LVDAAL P++LCMMDPTWHPWF
Sbjct: 3841 NGIAPQYFSEEEENAMDPPQSVQRGVSELVDAALQPKNLCMMDPTWHPWF 3889

BLAST of MS010599 vs. ExPASy TrEMBL
Match: A0A6J1GG63 (Non-specific serine/threonine protein kinase OS=Cucurbita moschata OX=3662 GN=LOC111453643 PE=4 SV=1)

HSP 1 Score: 7328.0 bits (19012), Expect = 0.0e+00
Identity = 3703/3889 (95.22%), Postives = 3794/3889 (97.56%), Query Frame = 0

Query: 1    MSPIQNFELHSRQLVEPELSIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSIILVQ 60
            MSPIQNFE HSR LVEPEL+IQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFS+IL++
Sbjct: 1    MSPIQNFEQHSRHLVEPELNIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSVILLK 60

Query: 61   ITKPQYTDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120
            ITKPQ+TDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI
Sbjct: 61   ITKPQFTDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120

Query: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENSAAGGEDIKPMDVSTSTDQT 180
            IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTV+HFFEN+AA GED KPMDVS+STDQ 
Sbjct: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVNHFFENTAAVGEDTKPMDVSSSTDQA 180

Query: 181  ITTGYTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPVLLPLMVSAISVPGPE 240
            +TTG TGT QLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIP+LLPLMVSAISVPGPE
Sbjct: 181  LTTGCTGTAQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPLLLPLMVSAISVPGPE 240

Query: 241  KVPPFLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR 300
            KVPP LKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR
Sbjct: 241  KVPPSLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR 300

Query: 301  KELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVRG 360
            KELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVG GRACYETLRPLAYSLLAEIVHHVRG
Sbjct: 301  KELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGAGRACYETLRPLAYSLLAEIVHHVRG 360

Query: 361  DLTLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDEARILLG 420
            DL+LSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDE+R LLG
Sbjct: 361  DLSLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDESRTLLG 420

Query: 421  RILDAFVGKFSTFKHTIPQLLEEGEEGKDRANMRSKLELPVQAVLNLQVPVEHSKEVNDC 480
            RILDAFVGKFS FKHTIPQLLEEGEEGKDRAN+RSKLELPVQAVLNLQVPVEHSKEVNDC
Sbjct: 421  RILDAFVGKFSAFKHTIPQLLEEGEEGKDRANLRSKLELPVQAVLNLQVPVEHSKEVNDC 480

Query: 481  KHLIKTLILGMKTIIWSITHAHLPRPQASPSPNGTHPQMLVSPSSNLATPQAFKGMREDE 540
            KHLIKTLI+GMKTIIWSITHAHLPR Q SPSPNGTHPQMLV+PSSNLATPQAFKGMREDE
Sbjct: 481  KHLIKTLIMGMKTIIWSITHAHLPRSQVSPSPNGTHPQMLVTPSSNLATPQAFKGMREDE 540

Query: 541  VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMIT 600
            VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMI+
Sbjct: 541  VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMIS 600

Query: 601  NTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAVA 660
            NTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAV+
Sbjct: 601  NTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAVS 660

Query: 661  KAPSDFERILQPHVTVIMEVCVRSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIPL 720
            KAPSDFERILQPHVTVIMEVCV+SATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIPL
Sbjct: 661  KAPSDFERILQPHVTVIMEVCVKSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIPL 720

Query: 721  LQPCLNMLLTMFDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLV 780
            LQPCLNMLL M DGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLV
Sbjct: 721  LQPCLNMLLIMLDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLV 780

Query: 781  SLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRPIPYPWGAKALQVLGKLGGRN 840
             LGLRTLEFWVDSLNPDFLEPSMA+VMSEVILALWSHLRP+PY WGAKALQVLGKLGGRN
Sbjct: 781  GLGLRTLEFWVDSLNPDFLEPSMASVMSEVILALWSHLRPMPYSWGAKALQVLGKLGGRN 840

Query: 841  RRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSTVMNKTGGVDSFYRK 900
            RRFLKEPLALECKENPEHGLRLILTFEP+TPFLVPLDRCINLAVS VMNKTGGVDSFYRK
Sbjct: 841  RRFLKEPLALECKENPEHGLRLILTFEPATPFLVPLDRCINLAVSAVMNKTGGVDSFYRK 900

Query: 901  QALKFLRVCLSSQLNLPGNVADDGHTPRQLSTLLVSPVDSSLRRSETPEGKADLGVKTKT 960
            QALKFLRVCLSSQLNLPG VADDG+TPRQLSTLLVS VDSS R+SET E KADLGVKTKT
Sbjct: 901  QALKFLRVCLSSQLNLPGIVADDGYTPRQLSTLLVSSVDSSWRKSETSEAKADLGVKTKT 960

Query: 961  QLMAEKSVFKILLMTIIAAGSEEDLHEPKDDFVLNVCRHFAILFHIDSSLNSSPVASASL 1020
            QLMAEKSVFK+LLMTIIAAGSEEDL+EPKDDFVLNVCRHFAILFHIDSSLN+ PVASASL
Sbjct: 961  QLMAEKSVFKLLLMTIIAAGSEEDLNEPKDDFVLNVCRHFAILFHIDSSLNNPPVASASL 1020

Query: 1021 GSTLLPPNVSANSRLRSSACCNLKELDPLIFLDALVEVLADENRVHAKAALNALNLFSEI 1080
            GSTLLP NV+ANSRL+SSACCNLKELDPL FLDALVEVLADENR HAKAALNALNLFSE+
Sbjct: 1021 GSTLLPSNVNANSRLKSSACCNLKELDPLTFLDALVEVLADENRFHAKAALNALNLFSEM 1080

Query: 1081 LLFLARAKQTDVMMTRGPSTPMIVSSPSKSPVYSPPPSVRIPVFEQLLPRLLHCCYGSTW 1140
            LLFL R KQTDVMMTRG  +PM VSSP  SP YSPPPSVRIPVFEQLLPRLLHCCYG TW
Sbjct: 1081 LLFLGRGKQTDVMMTRGSGSPMSVSSP-MSPAYSPPPSVRIPVFEQLLPRLLHCCYGCTW 1140

Query: 1141 QAQMGGIMGLGALVGKVTVETLCLFQVRIVRGLVYVLKRLPIYASKEQEETSQVLNQVLR 1200
            QAQMGG+MGLGALVGKVTVETLC FQV+IVRGLVYVLKRLPIYA+KEQEETSQVLN VLR
Sbjct: 1141 QAQMGGVMGLGALVGKVTVETLCHFQVKIVRGLVYVLKRLPIYANKEQEETSQVLNHVLR 1200

Query: 1201 VVNNVDEANSEPRRQSFHGVVDILASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL 1260
            VVNNVDEANSEPRRQSF  VVD+LASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL
Sbjct: 1201 VVNNVDEANSEPRRQSFQAVVDVLASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL 1260

Query: 1261 EPLHQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE 1320
            EPL+QPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE
Sbjct: 1261 EPLYQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE 1320

Query: 1321 ADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT 1380
            ADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTP+HSELRAKIISMFFKSLT
Sbjct: 1321 ADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPSHSELRAKIISMFFKSLT 1380

Query: 1381 CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL 1440
            CR+PEVVAVAKEGLRQVINQQRMP+DLLQGSLRPILVNLAHTKNLSMPLLQGL RLLELL
Sbjct: 1381 CRSPEVVAVAKEGLRQVINQQRMPRDLLQGSLRPILVNLAHTKNLSMPLLQGLGRLLELL 1440

Query: 1441 ASWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD 1500
            ASWFNVTLGGKLL+HLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD
Sbjct: 1441 ASWFNVTLGGKLLDHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD 1500

Query: 1501 ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY 1560
            ELVTLTI LEGALPPGQVYSEVNSPYR+PLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY
Sbjct: 1501 ELVTLTIGLEGALPPGQVYSEVNSPYRIPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY 1560

Query: 1561 IIRSDAGQPLREELAKSPQKILASAFPEFAPKSEAALTPGSSTSPAPLSGDEGLVTPDAS 1620
            IIRSDAGQPLREELAKSPQKILASAFPEF PKSEAALTPGSST PAPLSGDEGLVTPD S
Sbjct: 1561 IIRSDAGQPLREELAKSPQKILASAFPEFVPKSEAALTPGSSTPPAPLSGDEGLVTPDVS 1620

Query: 1621 DPPSAPSSVVSDAYFRGLALIKTLVKLMPGWLQNNRVVFDTLVLVWKSPARIARLHNEQE 1680
            D PSAPSSVVSDAYFRGLAL+KTLVKLMPGWLQ+NRVVFDTLV VWKSPARIARLHNEQE
Sbjct: 1621 DTPSAPSSVVSDAYFRGLALVKTLVKLMPGWLQSNRVVFDTLVAVWKSPARIARLHNEQE 1680

Query: 1681 LNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEGY 1740
            LNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEGY
Sbjct: 1681 LNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEGY 1740

Query: 1741 PPNMKKALLLHFLNLFQSKQLGHDHLVIVMQMLILPMLAHAFQNGQSWEVVDQAIIKTIV 1800
            PPNMKKALLLHFLNLFQSKQLGHDHLV+VMQMLILPMLAHAFQNGQSWEVVDQAIIKTIV
Sbjct: 1741 PPNMKKALLLHFLNLFQSKQLGHDHLVVVMQMLILPMLAHAFQNGQSWEVVDQAIIKTIV 1800

Query: 1801 DKLLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSASK 1860
            DKLLDPPEEV+AEYDEPLR+ELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSASK
Sbjct: 1801 DKLLDPPEEVTAEYDEPLRVELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSASK 1860

Query: 1861 QWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGDS 1920
            QWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGDS
Sbjct: 1861 QWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGDS 1920

Query: 1921 RMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPYN 1980
            RMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPYN
Sbjct: 1921 RMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPYN 1980

Query: 1981 TTAENRRLAIDLAGLVVGWERQRQNEMKLVTESDAPSHSNDGLT-CPPGADPKRMVDGST 2040
            TTAENRRLAIDLAGLVVGWERQRQNEMK VTESDA SHSNDGLT CPPG DPKR+VDGST
Sbjct: 1981 TTAENRRLAIDLAGLVVGWERQRQNEMKHVTESDALSHSNDGLTSCPPGTDPKRLVDGST 2040

Query: 2041 FPEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFLI 2100
            FPEDSTKRVKVEPGL SLCVMSPGGASSMPN+ETPGS TQPDEEFKPNAAMEEMIINFLI
Sbjct: 2041 FPEDSTKRVKVEPGLPSLCVMSPGGASSMPNVETPGSATQPDEEFKPNAAMEEMIINFLI 2100

Query: 2101 RVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTAL 2160
            RVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTAL
Sbjct: 2101 RVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTAL 2160

Query: 2161 AQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYPLEG 2220
            AQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKH+MLDAGKSLCSLLRMVFVAYPLEG
Sbjct: 2161 AQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHRMLDAGKSLCSLLRMVFVAYPLEG 2220

Query: 2221 VTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLIDP 2280
            VTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLIDP
Sbjct: 2221 VTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLIDP 2280

Query: 2281 YNLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINER 2340
            YNLGRILQRLARDMG SAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINER
Sbjct: 2281 YNLGRILQRLARDMGMSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINER 2340

Query: 2341 VMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWVEDDFSKMGTSVSSSSFLAPK 2400
            VMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGW+EDDFSKMGTSVSSSSFLAPK
Sbjct: 2341 VMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWIEDDFSKMGTSVSSSSFLAPK 2400

Query: 2401 EIVSFLQKLSQVDKQNFSSSAAEEWDGKYLQLLYEICADSNKYPLSLRQEVFQKVERQFM 2460
            EIVSFLQKLSQVDKQNFSSSAAEEWD KYLQLL+EICADSNKYPLSLRQEVFQKVERQFM
Sbjct: 2401 EIVSFLQKLSQVDKQNFSSSAAEEWDRKYLQLLHEICADSNKYPLSLRQEVFQKVERQFM 2460

Query: 2461 LGLRARDPETRKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVLV 2520
            LGLRARDPE RKKFFTLYHESLGKTLF RLQYIIQ+QDWEALSDVFWLKQGLDLLLAVLV
Sbjct: 2461 LGLRARDPEIRKKFFTLYHESLGKTLFTRLQYIIQVQDWEALSDVFWLKQGLDLLLAVLV 2520

Query: 2521 EDKPITLAPNSARLPPLLVSGHVADSSAVQPQVNDAQEGLEDAPLTFDSLVHKHAQFLNR 2580
            EDKPITLAPNSA+LPPLLVSGHVADSSAVQ  V DAQEG+EDAPLTFDSLV KHAQFLN+
Sbjct: 2521 EDKPITLAPNSAKLPPLLVSGHVADSSAVQHLVMDAQEGIEDAPLTFDSLVLKHAQFLNQ 2580

Query: 2581 TSKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMISLLSKDYH 2640
             SKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMI LLSKDYH
Sbjct: 2581 MSKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMIGLLSKDYH 2640

Query: 2641 KKQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGRTYNAWHIALALLESHVMLFMNET 2700
            KKQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIG+TYNAWHIALALLESHVMLFMNET
Sbjct: 2641 KKQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGKTYNAWHIALALLESHVMLFMNET 2700

Query: 2701 KCSESLAELYRLLNEEDMRCGLWKRKAISAETKAGLSLVQHGYWQRAQILFYQSMVKATQ 2760
            KC+ESLAELYRLLNEEDMRCGLWKRKA +AETKAGLSLVQHGYWQRAQ LFYQSMVKATQ
Sbjct: 2701 KCAESLAELYRLLNEEDMRCGLWKRKANTAETKAGLSLVQHGYWQRAQSLFYQSMVKATQ 2760

Query: 2761 GTYNNNVPKAEMCLWEEQWLYCASQLSQWEALVDFGKSIENYEILLDSLWKVPDWAYMKE 2820
            GTYNN VPKAEMCLWEEQWL CASQLSQWEALVDFGKSIENYEILLD+LWKVPDWAYMKE
Sbjct: 2761 GTYNNTVPKAEMCLWEEQWLSCASQLSQWEALVDFGKSIENYEILLDNLWKVPDWAYMKE 2820

Query: 2821 HVIPKAQVEETPKLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMSVHAR 2880
            HVIPKAQVEETPKLRLIQAYF+LHDR+TNGVADAENIVGKGVDLALEQWWQLPEMSVHAR
Sbjct: 2821 HVIPKAQVEETPKLRLIQAYFALHDRTTNGVADAENIVGKGVDLALEQWWQLPEMSVHAR 2880

Query: 2881 IPLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVSVHTNLYADLKDILETWRLRIPNEW 2940
            IPLLQQFQQLVEVQESSR+LVDIANGNK SG+S+  VH+NLYADLKDILETWRLRIPNEW
Sbjct: 2881 IPLLQQFQQLVEVQESSRVLVDIANGNKLSGNSIGGVHSNLYADLKDILETWRLRIPNEW 2940

Query: 2941 DSMTVWCDLLQWRNEMYNAVIDAFKDFGTTNSQLHHLGFRDKAWNVNKLAHVARKQGLHD 3000
            DSMTVWCDLLQWRNEMYNAVIDAFKDFG TNSQLHHLGFRDKAWNVNKLAHVARKQGL+D
Sbjct: 2941 DSMTVWCDLLQWRNEMYNAVIDAFKDFGNTNSQLHHLGFRDKAWNVNKLAHVARKQGLYD 3000

Query: 3001 VCVGILEKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLDYFPVKHKAE 3060
            VCV IL+ MYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNL+YFPVKHKAE
Sbjct: 3001 VCVSILDSMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLEYFPVKHKAE 3060

Query: 3061 IFRLKGDFQLKLSDSEGANHSYSSAISLFKNLPKGWISWGNYCDMAYKESHEEIWLEYAV 3120
            IFRLKGDFQLKLSDSEGAN SYS+AI+LFKNLPKGWISWGNYCDMAY+ESH+EIWLEYAV
Sbjct: 3061 IFRLKGDFQLKLSDSEGANQSYSNAITLFKNLPKGWISWGNYCDMAYEESHDEIWLEYAV 3120

Query: 3121 SCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKYLDQIPHWVWLSWIPQLLLS 3180
            SCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDK+LDQIPHWVWLSWIPQLLLS
Sbjct: 3121 SCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKFLDQIPHWVWLSWIPQLLLS 3180

Query: 3181 LQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNTSS 3240
            LQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQN +S
Sbjct: 3181 LQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNAAS 3240

Query: 3241 AGSLGLTDGSSRVAHGGSSTSTDNQVHQGTQSGSGIGSHDGGNSHSQEPERSTGVESSTH 3300
            AGSLGL DG SR  H GSST TD+QVHQGTQSG+GIGSHDGGN+HSQEPER+TG +S TH
Sbjct: 3241 AGSLGLADGGSRAGHSGSSTPTDSQVHQGTQSGTGIGSHDGGNAHSQEPERTTGADSGTH 3300

Query: 3301 AGNDQSLPQTSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASELE 3360
            AGNDQSLPQ SSNVNEGTQNA RRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASELE
Sbjct: 3301 AGNDQSLPQPSSNVNEGTQNAFRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASELE 3360

Query: 3361 ILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSADA 3420
            ILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSADA
Sbjct: 3361 ILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSADA 3420

Query: 3421 VNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLRLE 3480
            VNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVL+LE
Sbjct: 3421 VNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLKLE 3480

Query: 3481 DESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGSD 3540
            +ESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGSD
Sbjct: 3481 EESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGSD 3540

Query: 3541 GSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVRM 3600
            GSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVRM
Sbjct: 3541 GSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVRM 3600

Query: 3601 VEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIAPEAVLDLRLQAYGDI 3660
            VEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQI PEAV+DLRLQA+GDI
Sbjct: 3601 VEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIVPEAVVDLRLQAFGDI 3660

Query: 3661 TRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAKN 3720
            TRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAKN
Sbjct: 3661 TRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAKN 3720

Query: 3721 TGKIFQTDFHPAYDTNGMIEFNEPVPFRLTRNMQAFFSHFGVEGLIVSAMCSAAQAVVSP 3780
            TGKIFQTDFHPAYD +G+IEFNEPVPFRLTRNMQAFFS+FGVEGLIVSAMCSAAQAVVSP
Sbjct: 3721 TGKIFQTDFHPAYDASGVIEFNEPVPFRLTRNMQAFFSNFGVEGLIVSAMCSAAQAVVSP 3780

Query: 3781 KQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAGGGMSPADFKQKVTINVDHVIGRIN 3840
            KQN HL HQLAMFFRDELLSWSWRRPLGMPLAS+AGGGM+PADFK KVT NVD VIGRI 
Sbjct: 3781 KQNHHLRHQLAMFFRDELLSWSWRRPLGMPLASLAGGGMNPADFKHKVTTNVDLVIGRIT 3840

Query: 3841 GIAPQYFSEEEENAMDPPQSVQRGVSDLVDAALMPRHLCMMDPTWHPWF 3889
            GI+PQY SEEEENAMDPPQSVQRGVS+LVDAAL P++LCMMDPTWHPWF
Sbjct: 3841 GISPQYVSEEEENAMDPPQSVQRGVSELVDAALQPKNLCMMDPTWHPWF 3888

BLAST of MS010599 vs. ExPASy TrEMBL
Match: A0A6J1IRI9 (Non-specific serine/threonine protein kinase OS=Cucurbita maxima OX=3661 GN=LOC111478710 PE=4 SV=1)

HSP 1 Score: 7317.2 bits (18984), Expect = 0.0e+00
Identity = 3699/3889 (95.11%), Postives = 3790/3889 (97.45%), Query Frame = 0

Query: 1    MSPIQNFELHSRQLVEPELSIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSIILVQ 60
            MSPIQNFE HSR LVEPEL+IQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFS+IL++
Sbjct: 1    MSPIQNFEQHSRHLVEPELNIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSVILLK 60

Query: 61   ITKPQYTDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120
            ITKPQ+TDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI
Sbjct: 61   ITKPQFTDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120

Query: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENSAAGGEDIKPMDVSTSTDQT 180
            IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTV+HFFEN+AA GED KPMDVSTS+DQ 
Sbjct: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVNHFFENTAAVGEDTKPMDVSTSSDQA 180

Query: 181  ITTGYTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPVLLPLMVSAISVPGPE 240
            +TTG TGT QLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIP+LLPLMVSAISVPGPE
Sbjct: 181  LTTGCTGTAQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPLLLPLMVSAISVPGPE 240

Query: 241  KVPPFLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR 300
            KVPP LKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR
Sbjct: 241  KVPPSLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSIR 300

Query: 301  KELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVRG 360
            KELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVG GRACYETLRPLAYSLLAEIVHHVRG
Sbjct: 301  KELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGAGRACYETLRPLAYSLLAEIVHHVRG 360

Query: 361  DLTLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDEARILLG 420
            DL+LSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDE+R LLG
Sbjct: 361  DLSLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDESRTLLG 420

Query: 421  RILDAFVGKFSTFKHTIPQLLEEGEEGKDRANMRSKLELPVQAVLNLQVPVEHSKEVNDC 480
            RILDAFVGKFS FKHTIPQLLEEGEEGKDRAN+RSKLELPVQAVLNLQVPVEHSKEVNDC
Sbjct: 421  RILDAFVGKFSAFKHTIPQLLEEGEEGKDRANLRSKLELPVQAVLNLQVPVEHSKEVNDC 480

Query: 481  KHLIKTLILGMKTIIWSITHAHLPRPQASPSPNGTHPQMLVSPSSNLATPQAFKGMREDE 540
            KHLIKTLI+GMKTIIWSITHAHLPR Q SPSPNGTHPQMLV+PSSNLATPQAFKGMREDE
Sbjct: 481  KHLIKTLIMGMKTIIWSITHAHLPRSQVSPSPNGTHPQMLVTPSSNLATPQAFKGMREDE 540

Query: 541  VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMIT 600
            VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMI+
Sbjct: 541  VCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMIS 600

Query: 601  NTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAVA 660
            NTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAV+
Sbjct: 601  NTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAVS 660

Query: 661  KAPSDFERILQPHVTVIMEVCVRSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIPL 720
            KAPSDFERILQPHVTVIMEVCV+SATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIPL
Sbjct: 661  KAPSDFERILQPHVTVIMEVCVKSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIPL 720

Query: 721  LQPCLNMLLTMFDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLV 780
            LQPCLNMLL M DGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLV
Sbjct: 721  LQPCLNMLLIMLDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDLV 780

Query: 781  SLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRPIPYPWGAKALQVLGKLGGRN 840
             LGLRTLEFWVDSLNPDFLEPSMA+VMSEVILALWSHLRP+PY WGAKALQVLGKLGGRN
Sbjct: 781  GLGLRTLEFWVDSLNPDFLEPSMASVMSEVILALWSHLRPMPYSWGAKALQVLGKLGGRN 840

Query: 841  RRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSTVMNKTGGVDSFYRK 900
            RRFLKEPLALECKENPEHGLRLILTFEP+TPFLVPLDRCINLAVS VMNKTGGVDSFYRK
Sbjct: 841  RRFLKEPLALECKENPEHGLRLILTFEPATPFLVPLDRCINLAVSAVMNKTGGVDSFYRK 900

Query: 901  QALKFLRVCLSSQLNLPGNVADDGHTPRQLSTLLVSPVDSSLRRSETPEGKADLGVKTKT 960
            QALKFLRVCLSSQLNLPG VADDG+TPRQLSTLLVS VDSS R+SET E KADLGVKTKT
Sbjct: 901  QALKFLRVCLSSQLNLPGIVADDGYTPRQLSTLLVSSVDSSWRKSETSEAKADLGVKTKT 960

Query: 961  QLMAEKSVFKILLMTIIAAGSEEDLHEPKDDFVLNVCRHFAILFHIDSSLNSSPVASASL 1020
            QLMAEKSVFK+LLMTIIAAGSEEDL+EPKDDFVLNVCRHFAILFHIDSSLN+ PVASASL
Sbjct: 961  QLMAEKSVFKLLLMTIIAAGSEEDLNEPKDDFVLNVCRHFAILFHIDSSLNNPPVASASL 1020

Query: 1021 GSTLLPPNVSANSRLRSSACCNLKELDPLIFLDALVEVLADENRVHAKAALNALNLFSEI 1080
            GSTLLP NV+ANSRL+SSACCNLKELDPL FLDALVEVLADENR HAKAALNALNLFSE+
Sbjct: 1021 GSTLLPSNVNANSRLKSSACCNLKELDPLTFLDALVEVLADENRFHAKAALNALNLFSEM 1080

Query: 1081 LLFLARAKQTDVMMTRGPSTPMIVSSPSKSPVYSPPPSVRIPVFEQLLPRLLHCCYGSTW 1140
            LLFL R KQTDVMMTRG  +PM VSSP  SPVYSPPPSVRIPVFEQLLPRLLHCCYG TW
Sbjct: 1081 LLFLGRGKQTDVMMTRGSGSPMSVSSP-MSPVYSPPPSVRIPVFEQLLPRLLHCCYGCTW 1140

Query: 1141 QAQMGGIMGLGALVGKVTVETLCLFQVRIVRGLVYVLKRLPIYASKEQEETSQVLNQVLR 1200
            QAQMGG+MGLGALVGKVTVETLC FQV+IVRGLVYVLKRLPIYA+KEQEETSQVLN VLR
Sbjct: 1141 QAQMGGVMGLGALVGKVTVETLCHFQVKIVRGLVYVLKRLPIYANKEQEETSQVLNHVLR 1200

Query: 1201 VVNNVDEANSEPRRQSFHGVVDILASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL 1260
            VVNNVDEANSEPRRQSF  VVD+LASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL
Sbjct: 1201 VVNNVDEANSEPRRQSFQAVVDVLASELFNPNSSTIVRKNVQSCLALLASRTGSEVSELL 1260

Query: 1261 EPLHQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE 1320
            EPL+QPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE
Sbjct: 1261 EPLYQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQIAE 1320

Query: 1321 ADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKSLT 1380
            ADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTP+HSELRAKIISMFFKSLT
Sbjct: 1321 ADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPSHSELRAKIISMFFKSLT 1380

Query: 1381 CRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLELL 1440
            CR+PEVVAVAKEGLRQVINQQRMP+DLLQGSLRPILVNLAHTKNLSMPLLQGL RLLELL
Sbjct: 1381 CRSPEVVAVAKEGLRQVINQQRMPRDLLQGSLRPILVNLAHTKNLSMPLLQGLGRLLELL 1440

Query: 1441 ASWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD 1500
            ASWFNVTLGGKLL+HLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD
Sbjct: 1441 ASWFNVTLGGKLLDHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKFLD 1500

Query: 1501 ELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY 1560
            ELVTLTI LEGALPPGQVYSEVNSPYR+PLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY
Sbjct: 1501 ELVTLTIGLEGALPPGQVYSEVNSPYRIPLIKFLNRYAPLAVDYFLARLSEPKYFRRFMY 1560

Query: 1561 IIRSDAGQPLREELAKSPQKILASAFPEFAPKSEAALTPGSSTSPAPLSGDEGLVTPDAS 1620
            IIRSDAGQPLREELAKSPQKILASAFPEF PKSE ALTPGSST PAPLSGDEGLVTPD S
Sbjct: 1561 IIRSDAGQPLREELAKSPQKILASAFPEFVPKSEVALTPGSSTPPAPLSGDEGLVTPDVS 1620

Query: 1621 DPPSAPSSVVSDAYFRGLALIKTLVKLMPGWLQNNRVVFDTLVLVWKSPARIARLHNEQE 1680
            D PSAPSSVVSDAYFRGLAL+KTLVKLMPGWLQ+NRVVFDTLV VWKSPARIARLHNEQE
Sbjct: 1621 DTPSAPSSVVSDAYFRGLALVKTLVKLMPGWLQSNRVVFDTLVAVWKSPARIARLHNEQE 1680

Query: 1681 LNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEGY 1740
            LNLVQVKESKWLVKCFLNYLRHEK EVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEGY
Sbjct: 1681 LNLVQVKESKWLVKCFLNYLRHEKEEVNVLFDILSIFLFHTRIDYTFLKEFYIIEVAEGY 1740

Query: 1741 PPNMKKALLLHFLNLFQSKQLGHDHLVIVMQMLILPMLAHAFQNGQSWEVVDQAIIKTIV 1800
            PPNMKKALLLHFLNLFQSKQLGHDHLV+VMQMLILPMLAHAFQNGQSWEVVDQAIIKTIV
Sbjct: 1741 PPNMKKALLLHFLNLFQSKQLGHDHLVVVMQMLILPMLAHAFQNGQSWEVVDQAIIKTIV 1800

Query: 1801 DKLLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSASK 1860
            DKLLDPPEEV+AEYDEPLR+ELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSASK
Sbjct: 1801 DKLLDPPEEVTAEYDEPLRVELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKREDSASK 1860

Query: 1861 QWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGDS 1920
            QWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGDS
Sbjct: 1861 QWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLPLGDS 1920

Query: 1921 RMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPYN 1980
            RMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPYN
Sbjct: 1921 RMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLGLPYN 1980

Query: 1981 TTAENRRLAIDLAGLVVGWERQRQNEMKLVTESDAPSHSNDGLT-CPPGADPKRMVDGST 2040
            TTAENRRLAIDLAGLVVGWERQRQNEMK VTESDA SHSNDGLT CP G DPKR+VDGST
Sbjct: 1981 TTAENRRLAIDLAGLVVGWERQRQNEMKHVTESDALSHSNDGLTSCPSGTDPKRLVDGST 2040

Query: 2041 FPEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIINFLI 2100
            FPEDSTKRVKVEPGL SLCVMSPGGASSMPN+ETPGS TQPDEEFKPNAAMEEMIINFLI
Sbjct: 2041 FPEDSTKRVKVEPGLPSLCVMSPGGASSMPNVETPGSATQPDEEFKPNAAMEEMIINFLI 2100

Query: 2101 RVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTAL 2160
            RVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTAL
Sbjct: 2101 RVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPSTAL 2160

Query: 2161 AQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYPLEG 2220
            AQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKH+MLDAGKSLCSLLRMVFVAYPLEG
Sbjct: 2161 AQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHRMLDAGKSLCSLLRMVFVAYPLEG 2220

Query: 2221 VTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLIDP 2280
            VTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLIDP
Sbjct: 2221 VTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNLIDP 2280

Query: 2281 YNLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINER 2340
            YNLGRILQRLARDMG SAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINER
Sbjct: 2281 YNLGRILQRLARDMGMSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLINER 2340

Query: 2341 VMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWVEDDFSKMGTSVSSSSFLAPK 2400
            VMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGW+EDDFSKMGTSVSSSSFLAPK
Sbjct: 2341 VMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWIEDDFSKMGTSVSSSSFLAPK 2400

Query: 2401 EIVSFLQKLSQVDKQNFSSSAAEEWDGKYLQLLYEICADSNKYPLSLRQEVFQKVERQFM 2460
            EIVSFLQKLSQVDKQNFSSSAAEEWD KYLQLL+EICADSNKYPLSLRQEVFQKVERQFM
Sbjct: 2401 EIVSFLQKLSQVDKQNFSSSAAEEWDRKYLQLLHEICADSNKYPLSLRQEVFQKVERQFM 2460

Query: 2461 LGLRARDPETRKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLAVLV 2520
            LGLRARDPE RKKFFTLYHESLGKTLF RLQYIIQIQDWEALSDVFWLKQGLDLLLAVLV
Sbjct: 2461 LGLRARDPEIRKKFFTLYHESLGKTLFTRLQYIIQIQDWEALSDVFWLKQGLDLLLAVLV 2520

Query: 2521 EDKPITLAPNSARLPPLLVSGHVADSSAVQPQVNDAQEGLEDAPLTFDSLVHKHAQFLNR 2580
            EDKPITLAPNSA+LPPLLVSGHVADSSAVQ  V DAQEG+EDAPLTFDSLV KHAQFLNR
Sbjct: 2521 EDKPITLAPNSAKLPPLLVSGHVADSSAVQHLVMDAQEGIEDAPLTFDSLVLKHAQFLNR 2580

Query: 2581 TSKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMISLLSKDYH 2640
             SKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMI LLSKDYH
Sbjct: 2581 MSKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMIGLLSKDYH 2640

Query: 2641 KKQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGRTYNAWHIALALLESHVMLFMNET 2700
            KKQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIG+TYNAWHIALALLESHVMLFMNET
Sbjct: 2641 KKQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGKTYNAWHIALALLESHVMLFMNET 2700

Query: 2701 KCSESLAELYRLLNEEDMRCGLWKRKAISAETKAGLSLVQHGYWQRAQILFYQSMVKATQ 2760
            KC+ESLAELYRLLNEEDMRCGLWKRKA +AETKAGLSLVQHGYWQRAQ LFYQSMVKATQ
Sbjct: 2701 KCAESLAELYRLLNEEDMRCGLWKRKANTAETKAGLSLVQHGYWQRAQSLFYQSMVKATQ 2760

Query: 2761 GTYNNNVPKAEMCLWEEQWLYCASQLSQWEALVDFGKSIENYEILLDSLWKVPDWAYMKE 2820
            GTYNN VPKAEMCLWEEQWL CASQLSQWEALVDFGKSIENYEILLD+LWKVPDWAYMKE
Sbjct: 2761 GTYNNTVPKAEMCLWEEQWLSCASQLSQWEALVDFGKSIENYEILLDNLWKVPDWAYMKE 2820

Query: 2821 HVIPKAQVEETPKLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMSVHAR 2880
            HVIPKAQVEETPKLRLIQAYF+LHDR+TNGVADAENIVGKGVDLALEQWWQLPEMSVHAR
Sbjct: 2821 HVIPKAQVEETPKLRLIQAYFALHDRTTNGVADAENIVGKGVDLALEQWWQLPEMSVHAR 2880

Query: 2881 IPLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVSVHTNLYADLKDILETWRLRIPNEW 2940
            IPLLQQFQQLVEVQESSR+LVDIANGNK SG+S+  VH+NLYADLKDILETWRLRIPNEW
Sbjct: 2881 IPLLQQFQQLVEVQESSRVLVDIANGNKLSGNSIGGVHSNLYADLKDILETWRLRIPNEW 2940

Query: 2941 DSMTVWCDLLQWRNEMYNAVIDAFKDFGTTNSQLHHLGFRDKAWNVNKLAHVARKQGLHD 3000
            DSMTVWCDLLQWRNEMYNAVIDAFKDFG TNSQLHHLGFRDKAWNVNKLAHVARKQGL+D
Sbjct: 2941 DSMTVWCDLLQWRNEMYNAVIDAFKDFGNTNSQLHHLGFRDKAWNVNKLAHVARKQGLYD 3000

Query: 3001 VCVGILEKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLDYFPVKHKAE 3060
            VCV IL+ MYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNL+YFPVKHKAE
Sbjct: 3001 VCVSILDSMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLEYFPVKHKAE 3060

Query: 3061 IFRLKGDFQLKLSDSEGANHSYSSAISLFKNLPKGWISWGNYCDMAYKESHEEIWLEYAV 3120
            IFRLKGDFQLKLSDSEGAN SYS+AI+LFKNLPKGWISWGNYCDMAY+ESH+EIWLEYAV
Sbjct: 3061 IFRLKGDFQLKLSDSEGANQSYSNAITLFKNLPKGWISWGNYCDMAYEESHDEIWLEYAV 3120

Query: 3121 SCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKYLDQIPHWVWLSWIPQLLLS 3180
            SCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDK+LDQIPHWVWLSWIPQLLLS
Sbjct: 3121 SCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKFLDQIPHWVWLSWIPQLLLS 3180

Query: 3181 LQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNTSS 3240
            LQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQN +S
Sbjct: 3181 LQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQNAAS 3240

Query: 3241 AGSLGLTDGSSRVAHGGSSTSTDNQVHQGTQSGSGIGSHDGGNSHSQEPERSTGVESSTH 3300
            AGSLGL DG SR  H GSST TD+QVHQG QSG+GIGSHDGGN+HSQEPER+TG +S TH
Sbjct: 3241 AGSLGLVDGGSRAGHSGSSTPTDSQVHQGAQSGTGIGSHDGGNAHSQEPERTTGADSGTH 3300

Query: 3301 AGNDQSLPQTSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLASELE 3360
            AGNDQSLPQ SSNVNEGTQNA RRSAALGL GSAASAFDAAKDIMEALRSKHTNLASELE
Sbjct: 3301 AGNDQSLPQPSSNVNEGTQNAFRRSAALGLGGSAASAFDAAKDIMEALRSKHTNLASELE 3360

Query: 3361 ILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSADA 3420
            ILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSADA
Sbjct: 3361 ILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFSADA 3420

Query: 3421 VNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLRLE 3480
            VNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVL+LE
Sbjct: 3421 VNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVLKLE 3480

Query: 3481 DESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGSD 3540
            +ESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGSD
Sbjct: 3481 EESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLIGSD 3540

Query: 3541 GSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVRM 3600
            GSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVRM
Sbjct: 3541 GSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQVRM 3600

Query: 3601 VEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIAPEAVLDLRLQAYGDI 3660
            VEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQI PEAV+DLRLQA+GDI
Sbjct: 3601 VEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIVPEAVVDLRLQAFGDI 3660

Query: 3661 TRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAKN 3720
            TRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAKN
Sbjct: 3661 TRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYFAKN 3720

Query: 3721 TGKIFQTDFHPAYDTNGMIEFNEPVPFRLTRNMQAFFSHFGVEGLIVSAMCSAAQAVVSP 3780
            TGKIFQTDFHPAYD +G+IEFNEPVPFRLTRNMQAFFS+FGVEGLIVSAMCSAAQAVVSP
Sbjct: 3721 TGKIFQTDFHPAYDASGVIEFNEPVPFRLTRNMQAFFSNFGVEGLIVSAMCSAAQAVVSP 3780

Query: 3781 KQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAGGGMSPADFKQKVTINVDHVIGRIN 3840
            KQ+ HL HQLAMFFRDELLSWSWRRPLGMPLAS+AGGGM+PADF+ KVT NVD VIGRI 
Sbjct: 3781 KQSHHLRHQLAMFFRDELLSWSWRRPLGMPLASLAGGGMNPADFRHKVTTNVDLVIGRIT 3840

Query: 3841 GIAPQYFSEEEENAMDPPQSVQRGVSDLVDAALMPRHLCMMDPTWHPWF 3889
            GI+PQY SEEEENAMDPPQSVQRGVS+LVDAAL P++LCMMDPTWHPWF
Sbjct: 3841 GISPQYVSEEEENAMDPPQSVQRGVSELVDAALQPKNLCMMDPTWHPWF 3888

BLAST of MS010599 vs. TAIR 10
Match: AT2G17930.1 (Phosphatidylinositol 3- and 4-kinase family protein with FAT domain )

HSP 1 Score: 6289.9 bits (16317), Expect = 0.0e+00
Identity = 3175/3893 (81.56%), Postives = 3495/3893 (89.78%), Query Frame = 0

Query: 1    MSPIQNFELHSRQLVEPELSIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSIILVQ 60
            MSPIQNFE HSR+LV+ +L I TRL+M  EVRDSLEIAHT EYLNFLKCYF AFS+IL+Q
Sbjct: 1    MSPIQNFEQHSRRLVDLDLPIPTRLEMVVEVRDSLEIAHTAEYLNFLKCYFPAFSVILLQ 60

Query: 61   ITKPQYTDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120
            ITKPQ+ DN EHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLT DNEENGLICIRI
Sbjct: 61   ITKPQFIDNPEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTADNEENGLICIRI 120

Query: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENSAAGGEDIKPMDVSTSTDQT 180
            IFDLLRNFRPTLENEVQPFLDFVCKIY  F+ TVSHFF+N     E++KPM++ TS+DQ+
Sbjct: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYSIFRFTVSHFFDNVKM--EEVKPMEMPTSSDQS 180

Query: 181  IT-TGYTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPVLLPLMVSAISVPGP 240
            +T T   G VQLNPSTRSFKI+TESPLVVMFLFQLYSRLVQTNIP LLPLMV+AISVPGP
Sbjct: 181  LTPTPPIGNVQLNPSTRSFKIITESPLVVMFLFQLYSRLVQTNIPHLLPLMVAAISVPGP 240

Query: 241  EKVPPFLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSI 300
            E VP  LK  FIELKGAQVKTVSFLTYLL+S A+YIRPHEESICKSIVNLLVTCSDS SI
Sbjct: 241  ENVPSHLKPQFIELKGAQVKTVSFLTYLLKSCAEYIRPHEESICKSIVNLLVTCSDSASI 300

Query: 301  RKELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVR 360
            RKELLV+LKHVLGT++KRGLFPLIDTLL+E+VLVGTGRAC+E+LRPLAYSLLAEIVHHVR
Sbjct: 301  RKELLVSLKHVLGTDFKRGLFPLIDTLLDERVLVGTGRACFESLRPLAYSLLAEIVHHVR 360

Query: 361  GDLTLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDEARILL 420
            GDL+L+QLSRIIYLFS NMHD++LSLSIHTTCARLMLNLVEPIFEKGVDQ SMDEARILL
Sbjct: 361  GDLSLAQLSRIIYLFSRNMHDSTLSLSIHTTCARLMLNLVEPIFEKGVDQQSMDEARILL 420

Query: 421  GRILDAFVGKFSTFKHTIPQLLEEGEEGKDRANMRSKLELPVQAVLNLQVPVEHSKEVND 480
            GRILDAFVGKFSTFK TIPQLLEEGE GKDR  +RSKLELPVQAVLNLQVPVEHSKEVND
Sbjct: 421  GRILDAFVGKFSTFKRTIPQLLEEGEVGKDRVTLRSKLELPVQAVLNLQVPVEHSKEVND 480

Query: 481  CKHLIKTLILGMKTIIWSITHAHLPRPQASPSPNGTHPQMLVSPSSNLATPQAFKGMRED 540
            CK+LIKTL++GMKTIIWSITHAHLPRPQ      G +PQ LVS SS    PQ FKGMRED
Sbjct: 481  CKNLIKTLVMGMKTIIWSITHAHLPRPQ------GMNPQALVSQSS---APQGFKGMRED 540

Query: 541  EVCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMI 600
            EV KASGVLKSGVHCL LFKEKDEE EML+LFSQIL +MEPRDLMDMFSLCMPELF+ MI
Sbjct: 541  EVWKASGVLKSGVHCLALFKEKDEEKEMLNLFSQILAIMEPRDLMDMFSLCMPELFESMI 600

Query: 601  TNTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAV 660
             N QLV +F+  LQ PKVY+PFADVL+N LVSSKLDVLK+PDS   KLVLHLFR +FGAV
Sbjct: 601  NNNQLVQIFAALLQAPKVYKPFADVLINLLVSSKLDVLKNPDSAATKLVLHLFRCIFGAV 660

Query: 661  AKAPSDFERILQPHVTVIMEVCVRSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIP 720
             K PSDFERILQ HV VIMEVC+++ATEVE+PLGYMQLLR +FR LAGCK+ELLLRDLIP
Sbjct: 661  TKTPSDFERILQHHVPVIMEVCMKNATEVEKPLGYMQLLRTVFRGLAGCKYELLLRDLIP 720

Query: 721  LLQPCLNMLLTMFDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDL 780
            +L PCLN+LLTM +GP GEDM+DLLLELCLTLPARLSSLLP+LPRLMKPLV CL+GSD+L
Sbjct: 721  MLLPCLNILLTMLEGPAGEDMKDLLLELCLTLPARLSSLLPYLPRLMKPLVFCLRGSDEL 780

Query: 781  VSLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRPIPYPWGAKALQVLGKLGGR 840
            VSLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRP+PYPWG KALQ+LGKLGGR
Sbjct: 781  VSLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRPVPYPWGKKALQILGKLGGR 840

Query: 841  NRRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSTVMNKTGGVDSFYR 900
            NRRFLKEPL LECK+NPEHGLRL+LTFEPSTPFLVPLD+ INLAV+ V+ +  G+D +YR
Sbjct: 841  NRRFLKEPLTLECKDNPEHGLRLVLTFEPSTPFLVPLDKFINLAVAAVIQRNHGMDIYYR 900

Query: 901  KQALKFLRVCLSSQLNLPGNVADDGHTPRQLSTLLVSPVDSSLRRSETPEGKADLGVKTK 960
            KQALKFLRVCL SQLNLPG V D G TPRQLSTLL S VDSS  RSE  E KADLGVKTK
Sbjct: 901  KQALKFLRVCLLSQLNLPGCVTDVGQTPRQLSTLLRSSVDSSWHRSEAVEIKADLGVKTK 960

Query: 961  TQLMAEKSVFKILLMTIIAAGSEEDLHEPKDDFVLNVCRHFAILFHIDSSLNSSPVASAS 1020
            TQLMAEKS+FK LL+TI+AA S+ DL +  DDFV N+CRHFAI+ H+D + +++  +++S
Sbjct: 961  TQLMAEKSIFKTLLITILAASSDPDLSDTDDDFVENICRHFAIILHVDYTSSNASTSTSS 1020

Query: 1021 LGSTLLPPNVSANSRLRSSACCNLKELDPLIFLDALVEVLADENRVHAKAALNALNLFSE 1080
            LG ++    +S +SR +S+   NLK+LDPLIFLDALV+VLADENR+HAKAALNALN+F+E
Sbjct: 1021 LGGSV----ISTSSRSKSNQSSNLKQLDPLIFLDALVDVLADENRLHAKAALNALNVFAE 1080

Query: 1081 ILLFLARAKQTDVMMTR-GPSTPMIVSSPSKSPVYSPPPSVRIPVFEQLLPRLLHCCYGS 1140
             LLFLAR K  DV+M R G +  MIVSSPS +PVYSP PSVRIPVFEQLLPRLLH CYGS
Sbjct: 1081 TLLFLARVKHADVLMARGGHNASMIVSSPSTNPVYSPHPSVRIPVFEQLLPRLLHGCYGS 1140

Query: 1141 TWQAQMGGIMGLGALVGKVTVETLCLFQVRIVRGLVYVLKRLPIYASKEQEETSQVLNQV 1200
            TWQAQMGG+MGLGALVGKV VETLC FQV+IVRGLVYVLKRLP+YASKEQEETSQVL Q+
Sbjct: 1141 TWQAQMGGVMGLGALVGKVNVETLCYFQVKIVRGLVYVLKRLPVYASKEQEETSQVLMQI 1200

Query: 1201 LRVVNNVDEANSEPRRQSFHGVVDILASELFNPNSSTIVRKNVQSCLALLASRTGSEVSE 1260
            LRVVNNVDEANSE RR+SF  VV+ LA+ELFNPN+S  VRKNVQ+CLALLASRTGSEV+E
Sbjct: 1201 LRVVNNVDEANSEARRKSFQDVVEYLATELFNPNASIPVRKNVQNCLALLASRTGSEVTE 1260

Query: 1261 LLEPLHQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQI 1320
            LLEPL+Q LLQPL++RPLR KT+DQQVGTV ALNFCLALRPPLLK+T ELVNFLQEALQI
Sbjct: 1261 LLEPLYQLLLQPLIMRPLRSKTVDQQVGTVAALNFCLALRPPLLKVTPELVNFLQEALQI 1320

Query: 1321 AEADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKS 1380
            AEADETVW VK MNPK+ TSLN+LRTACIELLCTTMAW DF+T  H+ELRAKIISMFFKS
Sbjct: 1321 AEADETVWAVKLMNPKVLTSLNRLRTACIELLCTTMAWTDFRTQTHNELRAKIISMFFKS 1380

Query: 1381 LTCRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLE 1440
            LTCR PE+VAVAKEGLRQVINQQRMPK+LLQ SLRPILVNLAHTKNLSMPLLQGLARLLE
Sbjct: 1381 LTCRAPEIVAVAKEGLRQVINQQRMPKELLQSSLRPILVNLAHTKNLSMPLLQGLARLLE 1440

Query: 1441 LLASWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKF 1500
            LL++WFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLP AASKF
Sbjct: 1441 LLSNWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPHAASKF 1500

Query: 1501 LDELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRF 1560
            LDELVTLTIDLE ALPPGQVYSE+NSPYR+PL KFLNRYA LAVDYFL+RLSEPKYFRRF
Sbjct: 1501 LDELVTLTIDLEAALPPGQVYSEINSPYRLPLTKFLNRYAALAVDYFLSRLSEPKYFRRF 1560

Query: 1561 MYIIRSDAGQPLREELAKSPQKILASAFPEFAPKSEAALTPGSSTSPAPLSGDEGLVTP- 1620
            MYIIRSDAGQPLREELAKSPQKIL+ AFPE +PK +  L+  +ST PA  SGDE  ++  
Sbjct: 1561 MYIIRSDAGQPLREELAKSPQKILSYAFPEISPKPDPTLSTTASTPPATSSGDENHISVK 1620

Query: 1621 -DASDPPSAPSSVVSDAYFRGLALIKTLVKLMPGWLQNNRVVFDTLVLVWKSPARIARLH 1680
             ++S+  S  +++ SDAYF+GL LIKT+VKL+P WLQ+NR VFDTLVL+WKSPARI+RL 
Sbjct: 1621 LESSNVASTKANIASDAYFQGLYLIKTMVKLIPSWLQSNRSVFDTLVLIWKSPARISRLQ 1680

Query: 1681 NEQELNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIEV 1740
            NEQELNLVQVKESKWLVKCFLNYLRHEK+EVNVLFDILSIFLFH+RIDYTFLKEFYIIEV
Sbjct: 1681 NEQELNLVQVKESKWLVKCFLNYLRHEKSEVNVLFDILSIFLFHSRIDYTFLKEFYIIEV 1740

Query: 1741 AEGYPPNMKKALLLHFLNLFQSKQLGHDHLVIVMQMLILPMLAHAFQNGQSWEVVDQAII 1800
            AEGYPPNMK+ALLLHFLNLF SKQLGHDHLV  MQMLILPMLAHAFQNGQ+WEV+D  I+
Sbjct: 1741 AEGYPPNMKRALLLHFLNLFHSKQLGHDHLVQAMQMLILPMLAHAFQNGQTWEVIDPDIV 1800

Query: 1801 KTIVDKLLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKRED 1860
            KTIV++LLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKRED
Sbjct: 1801 KTIVERLLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKRED 1860

Query: 1861 SASKQWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRLP 1920
            SASKQWAFVNVCHFL+AYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALP+RLP
Sbjct: 1861 SASKQWAFVNVCHFLDAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPKRLP 1920

Query: 1921 LGDSRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRLG 1980
            LGDSRMPIWIRYTKKILVEEGHSIPNLIHIF L+VRHSDLFYSCRAQFVPQMVNSLSRLG
Sbjct: 1921 LGDSRMPIWIRYTKKILVEEGHSIPNLIHIFLLVVRHSDLFYSCRAQFVPQMVNSLSRLG 1980

Query: 1981 LPYNTTAENRRLAIDLAGLVVGWERQRQNEMKLVTESDAPSHSNDGLTCPPGADPKRMVD 2040
            LPYNTTAENRRLAI+LAGLVV WERQRQNEMK+VT++D  S   D +    GADPKR  D
Sbjct: 1981 LPYNTTAENRRLAIELAGLVVSWERQRQNEMKMVTDTDGTSQITDEMHTSSGADPKRSTD 2040

Query: 2041 GSTFPEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMIIN 2100
            GS   ED +KRVK+EPGLQS+CVMSPGGASS+PN+ETPGS TQPDEEFKPNAAMEEMIIN
Sbjct: 2041 GSATSEDPSKRVKIEPGLQSICVMSPGGASSIPNVETPGSATQPDEEFKPNAAMEEMIIN 2100

Query: 2101 FLIRVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDPS 2160
            FLIRVALVIEPKD+E   MYKQAL+LLSQALEVWP+ANVKFNYLEKLLSS+ PSQS DPS
Sbjct: 2101 FLIRVALVIEPKDRETNTMYKQALDLLSQALEVWPSANVKFNYLEKLLSSMPPSQS-DPS 2160

Query: 2161 TALAQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAYP 2220
            TALAQGLDVMNKVLEKQPHLF+RNNINQISQILEPCFKHKMLDAGKSLCSLL+MVF A+P
Sbjct: 2161 TALAQGLDVMNKVLEKQPHLFIRNNINQISQILEPCFKHKMLDAGKSLCSLLKMVFTAFP 2220

Query: 2221 LEGVTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKNL 2280
            L+   TPPD+KLLYQKV+ELI  H+N +TAPQTS +DN+  SISFVLLVIKTL  V KN 
Sbjct: 2221 LDAANTPPDIKLLYQKVNELINKHVNTVTAPQTSGDDNSFGSISFVLLVIKTLANVHKNF 2280

Query: 2281 IDPYNLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKLI 2340
            +D Y L RILQRLARD+GS+ GSH RQGQR D DSAVTSSRQ+ADVG VI N+KSVL+LI
Sbjct: 2281 VDSYVLVRILQRLARDLGSAVGSHPRQGQRTDSDSAVTSSRQTADVGAVICNIKSVLELI 2340

Query: 2341 NERVMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWVEDDFSKMGTSVSSSSFL 2400
            +E VML+ +CKRSVTQI+N+LLSEKGTDASVLLCILD+IK WVEDDFSK G S  S SFL
Sbjct: 2341 DETVMLIADCKRSVTQILNTLLSEKGTDASVLLCILDMIKRWVEDDFSKTGASGLSGSFL 2400

Query: 2401 APKEIVSFLQKLSQVDKQNFSSSAAEEWDGKYLQLLYEICADSNKYPLSLRQEVFQKVER 2460
              K++++FL KLS +DKQ+FSS A EEWD KYLQLLY +CADS KYPL LRQEV  KVER
Sbjct: 2401 TQKDVLTFLNKLSYIDKQHFSSEALEEWDQKYLQLLYGLCADSTKYPLGLRQEVSLKVER 2460

Query: 2461 QFMLGLRARDPETRKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLLA 2520
             FMLGLRA  P  R+KFF LYHESLGKTLF RLQYIIQIQDWEALSDVFWLKQGLDLLLA
Sbjct: 2461 HFMLGLRASHPGMRRKFFLLYHESLGKTLFARLQYIIQIQDWEALSDVFWLKQGLDLLLA 2520

Query: 2521 VLVEDKPITLAPNSARLPPLLVSGHVADSSAVQPQVNDAQEGLEDAPLTFDSLVHKHAQF 2580
            +LVEDKPI+LAPNSAR+ PLL S    D+  +Q Q     EG E+    FDS+V KHAQF
Sbjct: 2521 ILVEDKPISLAPNSARVLPLLPS----DNPGIQHQAPANLEGPEEVTSMFDSIVMKHAQF 2580

Query: 2581 LNRTSKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMISLLSK 2640
            L+ TSKLQVAD++IPLRELAHTDANVAYHLWVLVFPIVWVTL KEEQVALAKPMISLLSK
Sbjct: 2581 LSATSKLQVADVVIPLRELAHTDANVAYHLWVLVFPIVWVTLLKEEQVALAKPMISLLSK 2640

Query: 2641 DYHKKQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGRTYNAWHIALALLESHVMLFM 2700
            DYHKKQQ  RPNVVQALLEGLQLSHPQPRMPSELIKYIG+TYNAWH+ALALLESHVMLFM
Sbjct: 2641 DYHKKQQGHRPNVVQALLEGLQLSHPQPRMPSELIKYIGKTYNAWHLALALLESHVMLFM 2700

Query: 2701 NETKCSESLAELYRLLNEEDMRCGLWKRKAISAETKAGLSLVQHGYWQRAQILFYQSMVK 2760
            N++KC+ESLAELYRLLNEEDMR GLWK+++I+AET+AGLSLVQHG+WQRAQ LFYQ+MVK
Sbjct: 2701 NDSKCAESLAELYRLLNEEDMRFGLWKKRSITAETRAGLSLVQHGFWQRAQSLFYQAMVK 2760

Query: 2761 ATQGTYNNNVPKAEMCLWEEQWLYCASQLSQWEALVDFGKSIENYEILLDSLWKVPDWAY 2820
            ATQGTYNN VPKAEMCLWEEQWL+CASQLSQW+ALVDFGKSIENYEILLDSLWK+PDWAY
Sbjct: 2761 ATQGTYNNTVPKAEMCLWEEQWLHCASQLSQWDALVDFGKSIENYEILLDSLWKLPDWAY 2820

Query: 2821 MKEHVIPKAQVEETPKLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMSV 2880
            +K+HVIPKAQVEETPKLRL+Q+YF+LHDR++NGV DAEN VGKGVDLALEQWWQLPEMSV
Sbjct: 2821 LKDHVIPKAQVEETPKLRLVQSYFALHDRNSNGVGDAENTVGKGVDLALEQWWQLPEMSV 2880

Query: 2881 HARIPLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVSVHTNLYADLKDILETWRLRIP 2940
            HAR+PLLQQFQQLVEVQES+RI VDIANGNK SG++ V    N YADLKDILETWRLR P
Sbjct: 2881 HARVPLLQQFQQLVEVQESARIHVDIANGNKVSGNTAVGGLGNRYADLKDILETWRLRTP 2940

Query: 2941 NEWDSMTVWCDLLQWRNEMYNAVIDAFKDFGTTNSQLHHLGFRDKAWNVNKLAHVARKQG 3000
            NEWD+MTVW D+LQWRNEMYN VIDAFKDF T+NS LHHLGFRDKAWNVNKLA +ARKQG
Sbjct: 2941 NEWDNMTVWYDMLQWRNEMYNVVIDAFKDFATSNSPLHHLGFRDKAWNVNKLARIARKQG 3000

Query: 3001 LHDVCVGILEKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLDYFPVKH 3060
            L+DVCV ILEKMYGHSTMEVQEAFVKIREQAKAYLEMKGE  SGLNLINSTNL+YFP K 
Sbjct: 3001 LYDVCVQILEKMYGHSTMEVQEAFVKIREQAKAYLEMKGERASGLNLINSTNLEYFPDKI 3060

Query: 3061 KAEIFRLKGDFQLKLSDSEGANHSYSSAISLFKNLPKGWISWGNYCDMAYKESHEEIWLE 3120
            KAEIFRLKGDF LKL+D+E AN +YS+AI+LFKNLPKGWISWG+YCDMAY+E+ EEIWLE
Sbjct: 3061 KAEIFRLKGDFHLKLNDTESANIAYSNAITLFKNLPKGWISWGSYCDMAYQETQEEIWLE 3120

Query: 3121 YAVSCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKYLDQIPHWVWLSWIPQL 3180
            YAVSCFLQGI+FG+SNSR+H+ARVLYLLSFDT NEPVGR FDK+LDQ+PHWVWLSWIPQL
Sbjct: 3121 YAVSCFLQGIRFGVSNSRSHIARVLYLLSFDTANEPVGRVFDKHLDQVPHWVWLSWIPQL 3180

Query: 3181 LLSLQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQN 3240
            LLSLQRTEAPHCKLVLLKIA V+PQALYYWLRTYLLERRD  NKSELGR+ +A QRMQQN
Sbjct: 3181 LLSLQRTEAPHCKLVLLKIAAVFPQALYYWLRTYLLERRDAVNKSELGRLVLA-QRMQQN 3240

Query: 3241 TSSAGSLGLTDGSSRVAHGGSSTSTDNQVHQGTQSGSGIGSHDGGNSHSQEPERSTGVES 3300
             + AG            HGGS+  ++NQ+HQG Q+    G+HD GN H QE ERST  E+
Sbjct: 3241 ATGAG------------HGGSNLPSENQIHQGAQTSGAGGTHDSGNPHGQESERST-TEN 3300

Query: 3301 STHAGNDQSLPQTSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLAS 3360
            + H G+DQ + Q+SS +N+  +N +RR+ A  L  SAA AFDAAKDIMEALR KH NLAS
Sbjct: 3301 NLHPGSDQPMHQSSSAINDNNENTVRRNGA-SLAISAAGAFDAAKDIMEALRGKHNNLAS 3360

Query: 3361 ELEILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACFS 3420
            ELE+LLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQ LKKELSGVC+ACFS
Sbjct: 3361 ELEVLLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQPLKKELSGVCRACFS 3420

Query: 3421 ADAVNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAVL 3480
            ADAV KHV+FV+EYKQDFER LDPEST+TFPATL+ELT RLK WKN+LQ NVEDRFPAVL
Sbjct: 3421 ADAVTKHVEFVKEYKQDFERHLDPESTTTFPATLAELTARLKKWKNILQSNVEDRFPAVL 3480

Query: 3481 RLEDESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTLI 3540
            RLEDESRVLRDF+VVDVE+PGQYF DQE+APDHTVKLDRVGAD+PIVRRHGSSFRRLTLI
Sbjct: 3481 RLEDESRVLRDFNVVDVEIPGQYFADQEVAPDHTVKLDRVGADVPIVRRHGSSFRRLTLI 3540

Query: 3541 GSDGSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWSQ 3600
            GSDGSQ+HFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRH+ IHTPIIIPVWSQ
Sbjct: 3541 GSDGSQKHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHIGIHTPIIIPVWSQ 3600

Query: 3601 VRMVEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIAPEAVLDLRLQAY 3660
            VRMVEDDLMY+TFLEVYENHCARND+EADLPIT+FKEQLNQAISGQI+ EA+ DLRLQAY
Sbjct: 3601 VRMVEDDLMYNTFLEVYENHCARNDREADLPITHFKEQLNQAISGQISAEAIGDLRLQAY 3660

Query: 3661 GDITRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIYF 3720
             DIT+ LVN+ IFSQYMYKTL+SG+HMWAFKKQFA+QLA+SSFMS+MLQIGGRSPNK+ F
Sbjct: 3661 IDITKTLVNDSIFSQYMYKTLMSGSHMWAFKKQFAVQLAVSSFMSFMLQIGGRSPNKVLF 3720

Query: 3721 AKNTGKIFQTDFHPAYDTNGMIEFNEPVPFRLTRNMQAFFSHFGVEGLIVSAMCSAAQAV 3780
            AKNTGK+FQTDFHPAYD NGMIEFNEPVPFRLTRNMQAFFS FGVEGL++S+MCSAAQAV
Sbjct: 3721 AKNTGKMFQTDFHPAYDANGMIEFNEPVPFRLTRNMQAFFSQFGVEGLLMSSMCSAAQAV 3780

Query: 3781 VSPKQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAG-GGMSPADFKQKVTINVDHVI 3840
            +S KQN+HL +QLAMFFRDELLSW  RRPLG+P+  + G   ++PA+ K KV  NV+ VI
Sbjct: 3781 ISSKQNEHLRYQLAMFFRDELLSWFGRRPLGVPIPPVGGIATLNPAELKHKVNANVEDVI 3840

Query: 3841 GRINGIAPQYFSEEEENAMDPPQSVQRGVSDLVDAALMPRHLCMMDPTWHPWF 3889
             RI GIAPQYFSEE+EN ++PPQSVQRGV++LV+AAL PR+LCMMDPTWHPWF
Sbjct: 3841 KRIRGIAPQYFSEEDENTVEPPQSVQRGVNELVEAALSPRNLCMMDPTWHPWF 3858

BLAST of MS010599 vs. TAIR 10
Match: AT4G36080.1 (phosphotransferases, alcohol group as acceptor;binding;inositol or phosphatidylinositol kinases )

HSP 1 Score: 6071.1 bits (15749), Expect = 0.0e+00
Identity = 3052/3894 (78.38%), Postives = 3442/3894 (88.39%), Query Frame = 0

Query: 1    MSPIQNFELHSRQLVEPELSIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSIILVQ 60
            MSPIQNFE HSR+LVEP+L I+ RL M  EVRDSLEI HT EYLNFLKCYFRA S+IL+Q
Sbjct: 1    MSPIQNFEQHSRRLVEPDLPIEERLAMVVEVRDSLEITHTAEYLNFLKCYFRASSVILLQ 60

Query: 61   ITKPQYTDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120
            ITKPQ+TDN EHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI
Sbjct: 61   ITKPQFTDNIEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120

Query: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENSAAGGEDIKPMDVSTSTDQT 180
            IFDLLRNFRPTLENEVQPFLDFVCKIYQNF+LTVSHFFEN     E++KP+++ T +DQ+
Sbjct: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFRLTVSHFFENVKM--EEVKPVEIPTPSDQS 180

Query: 181  IT-TGYTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPVLLPLMVSAISVPGP 240
            ++ T  +   Q+NPSTRSFKIVTESPLVVMFLFQLYSRLVQ NIP LLPLMV+AIS+PGP
Sbjct: 181  LSITAPSRNGQINPSTRSFKIVTESPLVVMFLFQLYSRLVQINIPNLLPLMVAAISIPGP 240

Query: 241  EKVPPFLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSI 300
            EKV   +K  FIELKGAQVKTVSFLTYLL+S A+YI+PHEESICKSIVNLLVTCSDS SI
Sbjct: 241  EKVSSHMKPQFIELKGAQVKTVSFLTYLLKSCAEYIKPHEESICKSIVNLLVTCSDSASI 300

Query: 301  RKELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVR 360
            RKELLV+LKHVLGT++KRGLFPLIDTLLEE+VLVGTGRAC+E+LRPLAYSLLAEIVHHVR
Sbjct: 301  RKELLVSLKHVLGTDFKRGLFPLIDTLLEERVLVGTGRACFESLRPLAYSLLAEIVHHVR 360

Query: 361  GDLTLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDEARILL 420
             DL+LSQLSRIIYLFS NMHD++LSL+IHTTCARLMLNLVEPIFEKG+DQ SMDEARILL
Sbjct: 361  ADLSLSQLSRIIYLFSRNMHDSTLSLNIHTTCARLMLNLVEPIFEKGIDQQSMDEARILL 420

Query: 421  GRILDAFVGKFSTFKHTIPQLLEEGEEGKDRANMRSKLELPVQAVLNLQVPVEHSKEVND 480
            GRILDAFVGKF+TFK T+PQLLEEG +GKD+  +RSKLELPVQAVLNLQVP EHSKEVND
Sbjct: 421  GRILDAFVGKFNTFKRTVPQLLEEG-DGKDQITLRSKLELPVQAVLNLQVPAEHSKEVND 480

Query: 481  CKHLIKTLILGMKTIIWSITHAHLPRPQASPSPNGTHPQMLVSPSSNLATPQAFKGMRED 540
            CK+LIKTL++GMKTIIWSITHAHLPRPQ      G HPQ L S SS     Q FKGMRED
Sbjct: 481  CKNLIKTLVMGMKTIIWSITHAHLPRPQ------GMHPQALASQSS---VTQVFKGMRED 540

Query: 541  EVCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMI 600
            EV KASGVLKSGVHCL LFK+KDEE EML+LFSQIL VMEPRDLMDMFS+CMPELF+C+I
Sbjct: 541  EVWKASGVLKSGVHCLALFKDKDEEKEMLNLFSQILAVMEPRDLMDMFSICMPELFECII 600

Query: 601  TNTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAV 660
             NTQLV +F+T LQ PKVY+PFADVL+NFLVSSKLDVLK+PDS   KL+LHLFR +FGAV
Sbjct: 601  DNTQLVQIFATLLQAPKVYKPFADVLINFLVSSKLDVLKNPDSAATKLILHLFRCLFGAV 660

Query: 661  AKAPSDFERILQPHVTVIMEVCVRSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIP 720
            +KAPSDFERILQP V +IMEVC+++ATEVE+PLGYMQLLR +FR LAGCKFELLLRDL+P
Sbjct: 661  SKAPSDFERILQPQVPLIMEVCMKNATEVEKPLGYMQLLRTVFRGLAGCKFELLLRDLVP 720

Query: 721  LLQPCLNMLLTMFDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDL 780
            +L PCLN+LLTM +GP GEDMRDLLLEL LTLPARLSSLLP+LPRLM+PLV CL+GSD+L
Sbjct: 721  MLLPCLNILLTMLEGPAGEDMRDLLLELSLTLPARLSSLLPYLPRLMRPLVSCLRGSDEL 780

Query: 781  VSLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRPIPYPWGAKALQVLGKLGGR 840
            VSLGLRTLEFWVDSLNPDFLEPSMA VMSEVILALWSHL+P+PYPWG KALQ++GKLGGR
Sbjct: 781  VSLGLRTLEFWVDSLNPDFLEPSMATVMSEVILALWSHLKPVPYPWGGKALQIVGKLGGR 840

Query: 841  NRRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSTVMNKTGGVDSFYR 900
            NRRFLKEPL LECK+NPEHGLRL+LTFEPSTPFLVP+D+ INLAV+ VM K    + +Y+
Sbjct: 841  NRRFLKEPLTLECKDNPEHGLRLVLTFEPSTPFLVPMDKFINLAVAAVMQKNLTTEIYYK 900

Query: 901  KQALKFLRVCLSSQLNLPGNVADDGHTPRQLSTLLVSPVDSSLRRSETPEGKADLGVKTK 960
            KQALKFLRVCL SQLNLPG V D+G T +QLSTLL+S VDS  RRSE+ E +ADLGVKTK
Sbjct: 901  KQALKFLRVCLLSQLNLPGCVTDEGQTTKQLSTLLLSSVDSFWRRSESTEIEADLGVKTK 960

Query: 961  TQLMAEKSVFKILLMTIIAAGSEEDLHEPKDDFVLNVCRHFAILFHIDSSLNSSPVASAS 1020
            TQL+AEKS+FK LL+TIIAA S+ DL +  DDFV+N+CRHFAI+ H D + + +  ++  
Sbjct: 961  TQLIAEKSIFKTLLITIIAASSDPDLSDSDDDFVVNICRHFAIILHGDYTSSYTSTSAGP 1020

Query: 1021 LGSTLLPPNVSANSRLRSSACCNLKELDPLIFLDALVEVLADENRVHAKAALNALNLFSE 1080
            LG +L    +S +S+ +++    LK+LDPLIFLDALV+VLADENR+HAKAAL +LN+F+E
Sbjct: 1021 LGGSL----ISTSSKPKNNWSTYLKQLDPLIFLDALVDVLADENRLHAKAALTSLNVFAE 1080

Query: 1081 ILLFLARAKQTDVMMTRGP-STPMIVSSPSKSPVYSPPPSVRIPVFEQLLPRLLHCCYGS 1140
             LLFLAR K  DV+M RG  S  MIVSSPS +PVYSP PSVRIPVFEQLLPRLLHCCYGS
Sbjct: 1081 TLLFLARIKHADVLMARGAHSASMIVSSPSTNPVYSPHPSVRIPVFEQLLPRLLHCCYGS 1140

Query: 1141 TWQAQMGGIMGLGALVGKVTVETLCLFQVRIVRGLVYVLKRLPIYASKEQEETSQVLNQV 1200
            TWQAQMGG+MGLGALVGKV VETLCLFQV+IVRGLVYV KRLP+YASKEQ+ETSQVL Q+
Sbjct: 1141 TWQAQMGGVMGLGALVGKVNVETLCLFQVKIVRGLVYVQKRLPVYASKEQDETSQVLIQI 1200

Query: 1201 LRVVNNVDEANSEPRRQSFHGVVDILASELFNPNSSTIVRKNVQSCLALLASRTGSEVSE 1260
            LRVVNNVDEAN++ RRQSF  VV+ LA+ELFN N+S  VRKNVQ+CLALLASRTGSEVSE
Sbjct: 1201 LRVVNNVDEANNDARRQSFQDVVEYLATELFNSNASITVRKNVQNCLALLASRTGSEVSE 1260

Query: 1261 LLEPLHQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQI 1320
            LLEPL+QPLLQPL++RPLR KTIDQQVGTVTALNFCLALRPPLLK+T ELVNFLQEALQI
Sbjct: 1261 LLEPLYQPLLQPLIMRPLRSKTIDQQVGTVTALNFCLALRPPLLKVTPELVNFLQEALQI 1320

Query: 1321 AEADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKS 1380
            AEADE +W VK M+PK+ TSLN+LRTACIE+LCTTMAWADF+T +H+ELRAKIISMFFKS
Sbjct: 1321 AEADEALWAVKLMSPKVLTSLNRLRTACIEILCTTMAWADFRTQSHNELRAKIISMFFKS 1380

Query: 1381 LTCRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLE 1440
            LTCR PE+V VAKEGLRQVINQQRMPK+LLQ SLRPILVNLA TKNL+MPLLQGLARLLE
Sbjct: 1381 LTCRAPEIVTVAKEGLRQVINQQRMPKELLQSSLRPILVNLAQTKNLNMPLLQGLARLLE 1440

Query: 1441 LLASWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKF 1500
            LL++WFNVTLG KLLEHLKKWLEPEKLAQSQK+WKAGEEPKIAAAIIELFHLLP+AASKF
Sbjct: 1441 LLSNWFNVTLGCKLLEHLKKWLEPEKLAQSQKSWKAGEEPKIAAAIIELFHLLPLAASKF 1500

Query: 1501 LDELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRF 1560
            LDELVTLTIDLE ALPPGQVYSE+NSPYR+PL KFLNRYA LAVDYFL+RLSEPKYFRRF
Sbjct: 1501 LDELVTLTIDLEAALPPGQVYSEINSPYRLPLTKFLNRYATLAVDYFLSRLSEPKYFRRF 1560

Query: 1561 MYIIRSDAGQPLREELAKSPQKILASAFPEFAPKSEAALTPGSSTSPAPLSGDEGLVTP- 1620
            MYIIRSDAGQPLREELAKSP KIL+ AFPE  PKS+A L+  +ST PA  SGDE   TP 
Sbjct: 1561 MYIIRSDAGQPLREELAKSPHKILSYAFPEILPKSDAILSAAASTPPAASSGDE-KPTPM 1620

Query: 1621 --DASDPPSAPSSVVSDAYFRGLALIKTLVKLMPGWLQNNRVVFDTLVLVWKSPARIARL 1680
              ++S+ PS  S+V SDAYF+GL LIKT+VKL+P WLQ+NR +FD L  +WKS AR +RL
Sbjct: 1621 KSESSNTPSTKSNVASDAYFQGLYLIKTMVKLIPSWLQSNRTIFDALAHLWKSHARTSRL 1680

Query: 1681 HNEQELNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIE 1740
             NEQ L LVQVKESKWLVKCFLNYLRHEK+E+NVLFD+L IFLFH+RIDYTFL+EFYIIE
Sbjct: 1681 QNEQNLTLVQVKESKWLVKCFLNYLRHEKSEMNVLFDVLLIFLFHSRIDYTFLREFYIIE 1740

Query: 1741 VAEGYPPNMKKALLLHFLNLFQSKQLGHDHLVIVMQMLILPMLAHAFQNGQSWEVVDQAI 1800
            VAE YPPNMKKA++LHFLNLFQSKQLGHDHLV  MQMLILPMLAHAFQNGQ+WEV+D  I
Sbjct: 1741 VAEEYPPNMKKAIVLHFLNLFQSKQLGHDHLVQAMQMLILPMLAHAFQNGQTWEVIDPDI 1800

Query: 1801 IKTIVDKLLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKRE 1860
            +KTIV++LLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLV HRKELIKFGWNHLKRE
Sbjct: 1801 VKTIVERLLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLVQHRKELIKFGWNHLKRE 1860

Query: 1861 DSASKQWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRL 1920
            DSASKQWAFVNVCHFL+AYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALP+RL
Sbjct: 1861 DSASKQWAFVNVCHFLDAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPKRL 1920

Query: 1921 PLGDSRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRL 1980
            PLGDSRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRL
Sbjct: 1921 PLGDSRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRL 1980

Query: 1981 GLPYNTTAENRRLAIDLAGLVVGWERQRQNEMKLVTESDAPSHSNDGLTCPPGADPKRMV 2040
            GLPYNTTAENRRLAI+LAGLVV WERQRQNE K+VT+ DA S  +DGL    G DPK   
Sbjct: 1981 GLPYNTTAENRRLAIELAGLVVSWERQRQNESKMVTDGDATSEVSDGLHPSSGVDPKLST 2040

Query: 2041 DGSTFPEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMII 2100
             GS+  ED +KRVK+EPGL SLCVMSPGGASS+PN+ETPGS TQPDEEFKPNAAMEE+II
Sbjct: 2041 AGSSISEDPSKRVKIEPGLPSLCVMSPGGASSIPNVETPGSATQPDEEFKPNAAMEELII 2100

Query: 2101 NFLIRVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDP 2160
            NFLIRVA+VIEPKD+EA  MYKQAL+ LSQALEVWPNANVKFNYLEKLLSS+ PSQS DP
Sbjct: 2101 NFLIRVAVVIEPKDREANTMYKQALDFLSQALEVWPNANVKFNYLEKLLSSMPPSQS-DP 2160

Query: 2161 STALAQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAY 2220
            STALAQGLDVMNKVLEKQPHLF++NNI+QISQ LE  FKHKMLDAGKSLCSLL+MVF+A+
Sbjct: 2161 STALAQGLDVMNKVLEKQPHLFIKNNISQISQFLELSFKHKMLDAGKSLCSLLKMVFIAF 2220

Query: 2221 PLEGVTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKN 2280
            P +G +TPP++KLLYQKV+ELI+ H++ +TA Q S +DN+  S+SFVL+V+KTL EVQK+
Sbjct: 2221 PQDGASTPPEIKLLYQKVNELIQKHVHVVTASQASGDDNSLGSVSFVLVVLKTLAEVQKH 2280

Query: 2281 LIDPYNLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKL 2340
             +DPY L  ILQRL+RD+G +AG+H RQ QR++         +SADVG V+SN+K VL+L
Sbjct: 2281 FLDPYVLVHILQRLSRDLGLAAGAHPRQSQRIE--------SESADVGAVVSNIKLVLEL 2340

Query: 2341 INERVMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWVEDDFSKMGTSVSSSSF 2400
            I+ERVML+ +CKR VTQI+N+LLSEKGTD+S+LLC+LD++K W EDDF K G+S SS +F
Sbjct: 2341 IDERVMLLADCKRPVTQILNTLLSEKGTDSSLLLCVLDMLKRWAEDDFGKKGSSGSSGAF 2400

Query: 2401 LAPKEIVSFLQKLSQVDKQNFSSSAAEEWDGKYLQLLYEICADSNKYPLSLRQEVFQKVE 2460
            L  K+IVSFLQKLSQVDKQ+FSS A +EWD  YLQLLY +CADS KYPL+LRQE+  KVE
Sbjct: 2401 LTQKDIVSFLQKLSQVDKQHFSSVALDEWDKVYLQLLYGLCADSTKYPLALRQEISLKVE 2460

Query: 2461 RQFMLGLRARDPETRKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLL 2520
            R  MLGLRARDP+ R+KFF LYHESLG  LF RLQYIIQ QDWEA+SDVFWLKQGLDLLL
Sbjct: 2461 RHSMLGLRARDPDMRRKFFLLYHESLGNNLFARLQYIIQNQDWEAMSDVFWLKQGLDLLL 2520

Query: 2521 AVLVEDKPITLAPNSARLPPLLVSGHVADSSAVQPQVNDAQEGLEDAPLTFDSLVHKHAQ 2580
            A+L+E+KPITLAPNSAR+ PLL S     +  V  Q     EG E+    FDS+V KH+Q
Sbjct: 2521 AILIEEKPITLAPNSARVVPLLPS----QNPGVHHQPPVMPEGPEEVASMFDSIVMKHSQ 2580

Query: 2581 FLNRTSKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMISLLS 2640
            FL+  SKLQVAD++IPLRELAHTDANVAYHLWVLVFPIVW TLHKEEQ+ALAKPMISLLS
Sbjct: 2581 FLSAASKLQVADVVIPLRELAHTDANVAYHLWVLVFPIVWATLHKEEQIALAKPMISLLS 2640

Query: 2641 KDYHKKQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGRTYNAWHIALALLESHVMLF 2700
            KDYHKKQQ  RPNVVQALLEGLQLSHPQPRMPSELIKYIG+TYNAWH+AL LLE+HVMLF
Sbjct: 2641 KDYHKKQQGHRPNVVQALLEGLQLSHPQPRMPSELIKYIGKTYNAWHLALTLLETHVMLF 2700

Query: 2701 MNETKCSESLAELYRLLNEEDMRCGLWKRKAISAETKAGLSLVQHGYWQRAQILFYQSMV 2760
             N++KC+ESLAELYRLLNEED R GLWK ++I+ E++AG S+VQHG+WQRAQ LFYQ+MV
Sbjct: 2701 TNDSKCAESLAELYRLLNEEDRRFGLWKSRSITTESRAGFSMVQHGFWQRAQSLFYQAMV 2760

Query: 2761 KATQGTYNNNVPKAEMCLWEEQWLYCASQLSQWEALVDFGKSIENYEILLDSLWKVPDWA 2820
            KATQGTYNN VPK EMCLWEEQWL+CA+QL QW+ALVDFGKS ENYEILLDSLWK PDW 
Sbjct: 2761 KATQGTYNNTVPKTEMCLWEEQWLHCATQLGQWDALVDFGKSTENYEILLDSLWKAPDWT 2820

Query: 2821 YMKEHVIPKAQVEETPKLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMS 2880
            Y+K+HVIPKAQVEETPKLRL+QA FSLH+++ NGV DAENIVGKGVDLALEQWWQLPEMS
Sbjct: 2821 YLKDHVIPKAQVEETPKLRLVQACFSLHEKNANGVGDAENIVGKGVDLALEQWWQLPEMS 2880

Query: 2881 VHARIPLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVSVHTNLYADLKDILETWRLRI 2940
            +HAR+PLLQQFQQLVEVQESSRI VDIANG+K  G++ V    NLYADLKDILETWRLR 
Sbjct: 2881 LHARVPLLQQFQQLVEVQESSRIYVDIANGSKVPGNAAVGGQGNLYADLKDILETWRLRT 2940

Query: 2941 PNEWDSMTVWCDLLQWRNEMYNAVIDAFKDFGTTNSQLHHLGFRDKAWNVNKLAHVARKQ 3000
            PNEWD+MTVW D+LQWRNEMYN VIDAFKDF T+N+ LHHLG+RDKAWNVNKLA +ARKQ
Sbjct: 2941 PNEWDNMTVWYDMLQWRNEMYNVVIDAFKDFVTSNTPLHHLGYRDKAWNVNKLARIARKQ 3000

Query: 3001 GLHDVCVGILEKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLDYFPVK 3060
            GL+DVCV ILEKMYGHS MEVQEAFVKI+EQAKA+LE KGEL +GLNL+NSTNL++F  K
Sbjct: 3001 GLYDVCVQILEKMYGHSQMEVQEAFVKIKEQAKAHLETKGELATGLNLVNSTNLEFFLAK 3060

Query: 3061 HKAEIFRLKGDFQLKLSDSEGANHSYSSAISLFKNLPKGWISWGNYCDMAYKESHEEIWL 3120
            +KAEIFRLKGDF LKL+D+EGAN +YS+AI+LFKNLPKGWISWGNYCDMAY+++ +EIWL
Sbjct: 3061 NKAEIFRLKGDFHLKLNDTEGANLAYSNAITLFKNLPKGWISWGNYCDMAYQDTQDEIWL 3120

Query: 3121 EYAVSCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKYLDQIPHWVWLSWIPQ 3180
            EYAVSCFLQGI+FG+SNSR+H+ARVLYLLSFD  NEPVGR FDK+LDQ+PHWVWLSWIPQ
Sbjct: 3121 EYAVSCFLQGIRFGVSNSRSHMARVLYLLSFDPTNEPVGRIFDKHLDQVPHWVWLSWIPQ 3180

Query: 3181 LLLSLQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQ 3240
            LL+SLQRTEAPHCKLVL+KIA V+PQALYYWLRTYLLERRD  NKSEL R+ +A QRMQQ
Sbjct: 3181 LLISLQRTEAPHCKLVLMKIAAVFPQALYYWLRTYLLERRDAVNKSELSRVVLA-QRMQQ 3240

Query: 3241 NTSSAGSLGLTDGSSRVAHGGSSTSTDNQVHQGTQSGSGIGSHDGGNSHSQEPERSTGVE 3300
            N     +           HGG +  ++ Q+HQG+Q+   +G+HDGGN H QE ER+T + 
Sbjct: 3241 NVPGVSA----------GHGGGNLPSETQIHQGSQTSGAVGTHDGGNLHVQESERATMI- 3300

Query: 3301 SSTHAGNDQSLPQTSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLA 3360
            ++ H+GNDQ + Q+SS                 +  SAA AFDAAKD+MEALRSKH NLA
Sbjct: 3301 NNVHSGNDQPMNQSSS-----------------MAISAAGAFDAAKDVMEALRSKHNNLA 3360

Query: 3361 SELEILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACF 3420
            SELE+LLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQ LKKELSGVC+ACF
Sbjct: 3361 SELEVLLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQPLKKELSGVCRACF 3420

Query: 3421 SADAVNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAV 3480
            SADAV KHV FVREYKQDFERDLDPES S FP TL++LT++LK WKN+LQ NVEDRFP +
Sbjct: 3421 SADAVTKHVAFVREYKQDFERDLDPESNS-FPVTLADLTKKLKDWKNILQSNVEDRFPVL 3480

Query: 3481 LRLEDESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTL 3540
            LRLEDES+VLRDF+VVDVE+PGQYF DQE+APDHTVKLDRVGADI IVRRHGSS RRLTL
Sbjct: 3481 LRLEDESKVLRDFNVVDVEIPGQYFADQEVAPDHTVKLDRVGADIQIVRRHGSSCRRLTL 3540

Query: 3541 IGSDGSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWS 3600
            IGSDGSQ+HFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHL +HTPIIIPVWS
Sbjct: 3541 IGSDGSQKHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLGLHTPIIIPVWS 3600

Query: 3601 QVRMVEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIAPEAVLDLRLQA 3660
            QVRMVEDDLMY+TFLEVYENHC RN +E+DLPITYFKE+LNQAI+GQI+PEA+ DLRLQA
Sbjct: 3601 QVRMVEDDLMYNTFLEVYENHCGRNGRESDLPITYFKEKLNQAITGQISPEAIGDLRLQA 3660

Query: 3661 YGDITRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIY 3720
            YG+IT+N+VN+ IFSQYMYKT +SG+H+WAFKKQFA+QLA+S+FMS++LQIGGRSPNKI 
Sbjct: 3661 YGEITKNIVNDTIFSQYMYKTSMSGSHLWAFKKQFAVQLAVSNFMSFILQIGGRSPNKIL 3720

Query: 3721 FAKNTGKIFQTDFHPAYDTNGMIEFNEPVPFRLTRNMQAFFSHFGVEGLIVSAMCSAAQA 3780
            FAKN+GK+FQTDFHP+YD+NGMIE NEPVPFRLTRNM AF SHFGVEG ++S MCSA+QA
Sbjct: 3721 FAKNSGKMFQTDFHPSYDSNGMIELNEPVPFRLTRNMHAFLSHFGVEGPLMSNMCSASQA 3780

Query: 3781 VVSPKQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAG-GGMSPADFKQKVTINVDHV 3840
            V S KQN+HL +QLAMFFRDELLSW  RRPLG+P+  +AG   +S  + K KV  NVD V
Sbjct: 3781 VFSSKQNEHLRYQLAMFFRDELLSWFGRRPLGVPIPPVAGIATLSSPELKHKVNSNVDDV 3834

Query: 3841 IGRINGIAPQYFSEEEENAMDPPQSVQRGVSDLVDAALMPRHLCMMDPTWHPWF 3889
            IGRI GIAPQYFSEE+EN+++PPQSVQRGVS+LV+AAL PR+LCMMDPTWHPWF
Sbjct: 3841 IGRIRGIAPQYFSEEDENSVEPPQSVQRGVSELVEAALSPRNLCMMDPTWHPWF 3834

BLAST of MS010599 vs. TAIR 10
Match: AT4G36080.3 (phosphotransferases, alcohol group as acceptor;binding;inositol or phosphatidylinositol kinases )

HSP 1 Score: 6011.0 bits (15593), Expect = 0.0e+00
Identity = 3029/3894 (77.79%), Postives = 3418/3894 (87.78%), Query Frame = 0

Query: 1    MSPIQNFELHSRQLVEPELSIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSIILVQ 60
            MSPIQNFE HSR+LVEP+L I+ RL M  EVRDSLEI HT EYLNFLKCYFRA S+IL+Q
Sbjct: 1    MSPIQNFEQHSRRLVEPDLPIEERLAMVVEVRDSLEITHTAEYLNFLKCYFRASSVILLQ 60

Query: 61   ITKPQYTDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120
            ITKPQ+TDN EHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI
Sbjct: 61   ITKPQFTDNIEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120

Query: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENSAAGGEDIKPMDVSTSTDQT 180
            IFDLLRNFRPTLENEVQPFLDFVCKIYQNF+LTVSHFFEN     E++KP+++ T +DQ+
Sbjct: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFRLTVSHFFENVKM--EEVKPVEIPTPSDQS 180

Query: 181  IT-TGYTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPVLLPLMVSAISVPGP 240
            ++ T  +   Q+NPSTRSFKIVTESPLVVMFLFQLYSRLVQ NIP LLPLMV+AIS+PGP
Sbjct: 181  LSITAPSRNGQINPSTRSFKIVTESPLVVMFLFQLYSRLVQINIPNLLPLMVAAISIPGP 240

Query: 241  EKVPPFLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSI 300
            EKV   +K  FIELKGAQVKTVSFLTYLL+S A+YI+PHEESICKSIVNLLVTCSDS SI
Sbjct: 241  EKVSSHMKPQFIELKGAQVKTVSFLTYLLKSCAEYIKPHEESICKSIVNLLVTCSDSASI 300

Query: 301  RKELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVR 360
            RKELLV+LKHVLGT++KRGLFPLIDTLLEE+VLVGTGRAC+E+LRPLAYSLLAEIVHHVR
Sbjct: 301  RKELLVSLKHVLGTDFKRGLFPLIDTLLEERVLVGTGRACFESLRPLAYSLLAEIVHHVR 360

Query: 361  GDLTLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDEARILL 420
             DL+LSQLSRIIYLFS NMHD++LSL+IHTTCARLMLNLVEPIFEKG+DQ SMDEARILL
Sbjct: 361  ADLSLSQLSRIIYLFSRNMHDSTLSLNIHTTCARLMLNLVEPIFEKGIDQQSMDEARILL 420

Query: 421  GRILDAFVGKFSTFKHTIPQLLEEGEEGKDRANMRSKLELPVQAVLNLQVPVEHSKEVND 480
            GRILDAFVGKF+TFK T+PQLLEEG +GKD+  +RSKLELPVQAVLNLQVP EHSKEVND
Sbjct: 421  GRILDAFVGKFNTFKRTVPQLLEEG-DGKDQITLRSKLELPVQAVLNLQVPAEHSKEVND 480

Query: 481  CKHLIKTLILGMKTIIWSITHAHLPRPQASPSPNGTHPQMLVSPSSNLATPQAFKGMRED 540
            CK+LIKTL++GMKTIIWSITHAHLPRPQ      G HPQ L S SS     Q FKGMRED
Sbjct: 481  CKNLIKTLVMGMKTIIWSITHAHLPRPQ------GMHPQALASQSS---VTQVFKGMRED 540

Query: 541  EVCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMI 600
            EV KASGVLKSGVHCL LFK+KDEE EML+LFSQIL VMEPRDLMDMFS+CMPELF+C+I
Sbjct: 541  EVWKASGVLKSGVHCLALFKDKDEEKEMLNLFSQILAVMEPRDLMDMFSICMPELFECII 600

Query: 601  TNTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAV 660
             NTQLV +F+T LQ PKVY+PFADVL+NFLVSSKLDVLK+PDS   KL+LHLFR +FGAV
Sbjct: 601  DNTQLVQIFATLLQAPKVYKPFADVLINFLVSSKLDVLKNPDSAATKLILHLFRCLFGAV 660

Query: 661  AKAPSDFERILQPHVTVIMEVCVRSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIP 720
            +KAPSDFERILQP V +IMEVC+++ATEVE+PLGYMQLLR +FR LAGCKFELLLRDL+P
Sbjct: 661  SKAPSDFERILQPQVPLIMEVCMKNATEVEKPLGYMQLLRTVFRGLAGCKFELLLRDLVP 720

Query: 721  LLQPCLNMLLTMFDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDL 780
            +L PCLN+LLTM +GP GEDMRDLLLEL LTLPARLSSLLP+LPRLM+PLV CL+GSD+L
Sbjct: 721  MLLPCLNILLTMLEGPAGEDMRDLLLELSLTLPARLSSLLPYLPRLMRPLVSCLRGSDEL 780

Query: 781  VSLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRPIPYPWGAKALQVLGKLGGR 840
            VSLGLRTLEFWVDSLNPDFLEPSMA VMSEVILALWSHL+P+PYPWG KALQ++GKLGGR
Sbjct: 781  VSLGLRTLEFWVDSLNPDFLEPSMATVMSEVILALWSHLKPVPYPWGGKALQIVGKLGGR 840

Query: 841  NRRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSTVMNKTGGVDSFYR 900
            NRRFLKEPL LECK+NPEHGLRL+LTFEPSTPFLVP+D+ INLAV+ VM K    + +Y+
Sbjct: 841  NRRFLKEPLTLECKDNPEHGLRLVLTFEPSTPFLVPMDKFINLAVAAVMQKNLTTEIYYK 900

Query: 901  KQALKFLRVCLSSQLNLPGNVADDGHTPRQLSTLLVSPVDSSLRRSETPEGKADLGVKTK 960
            KQALKFLRVCL SQLNLPG V D+G T +QLSTLL+S VDS  RRSE+ E +ADLGVKTK
Sbjct: 901  KQALKFLRVCLLSQLNLPGCVTDEGQTTKQLSTLLLSSVDSFWRRSESTEIEADLGVKTK 960

Query: 961  TQLMAEKSVFKILLMTIIAAGSEEDLHEPKDDFVLNVCRHFAILFHIDSSLNSSPVASAS 1020
            TQL+AEKS+FK LL+TIIAA S+ DL +  DDFV+N+CRHFAI+ H D + + +  ++  
Sbjct: 961  TQLIAEKSIFKTLLITIIAASSDPDLSDSDDDFVVNICRHFAIILHGDYTSSYTSTSAGP 1020

Query: 1021 LGSTLLPPNVSANSRLRSSACCNLKELDPLIFLDALVEVLADENRVHAKAALNALNLFSE 1080
            LG +L    +S +S+ +++    LK+LDPLIFLDALV+VLADENR+HAKAAL +LN+F+E
Sbjct: 1021 LGGSL----ISTSSKPKNNWSTYLKQLDPLIFLDALVDVLADENRLHAKAALTSLNVFAE 1080

Query: 1081 ILLFLARAKQTDVMMTRGP-STPMIVSSPSKSPVYSPPPSVRIPVFEQLLPRLLHCCYGS 1140
             LLFLAR K  DV+M RG  S  MIVSSPS +PVYSP PSVRIPVFEQLLPRLLHCCYGS
Sbjct: 1081 TLLFLARIKHADVLMARGAHSASMIVSSPSTNPVYSPHPSVRIPVFEQLLPRLLHCCYGS 1140

Query: 1141 TWQAQMGGIMGLGALVGKVTVETLCLFQVRIVRGLVYVLKRLPIYASKEQEETSQVLNQV 1200
            TWQAQMGG+MGLGALVGKV VETLCLFQV+IVRGLVYV KRLP+YASKEQ+ETSQVL Q+
Sbjct: 1141 TWQAQMGGVMGLGALVGKVNVETLCLFQVKIVRGLVYVQKRLPVYASKEQDETSQVLIQI 1200

Query: 1201 LRVVNNVDEANSEPRRQSFHGVVDILASELFNPNSSTIVRKNVQSCLALLASRTGSEVSE 1260
            LRVVNNVDEAN++ RRQSF  VV+ LA+ELFN N+S  VRKNVQ+CLALLASRTGSEVSE
Sbjct: 1201 LRVVNNVDEANNDARRQSFQDVVEYLATELFNSNASITVRKNVQNCLALLASRTGSEVSE 1260

Query: 1261 LLEPLHQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQI 1320
            LLEPL+QPLLQPL++RPLR KTIDQQVGTVTALNFCLALRPPLLK+T ELVNFLQEALQI
Sbjct: 1261 LLEPLYQPLLQPLIMRPLRSKTIDQQVGTVTALNFCLALRPPLLKVTPELVNFLQEALQI 1320

Query: 1321 AEADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKS 1380
            AEADE +W VK M+PK+ TSLN+LRTACIE+LCTTMAWADF+T +H+ELRAKIISMFFKS
Sbjct: 1321 AEADEALWAVKLMSPKVLTSLNRLRTACIEILCTTMAWADFRTQSHNELRAKIISMFFKS 1380

Query: 1381 LTCRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLE 1440
            LTCR PE+V VAKEGLRQVINQQRMPK+LLQ SLRPILVNLA TKNL+MPLLQGLARLLE
Sbjct: 1381 LTCRAPEIVTVAKEGLRQVINQQRMPKELLQSSLRPILVNLAQTKNLNMPLLQGLARLLE 1440

Query: 1441 LLASWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKF 1500
            LL++WFNVTLG KLLEHLKKWLEPEKLAQSQK+WKAGEEPKIAAAIIELFHLLP+AASKF
Sbjct: 1441 LLSNWFNVTLGCKLLEHLKKWLEPEKLAQSQKSWKAGEEPKIAAAIIELFHLLPLAASKF 1500

Query: 1501 LDELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRF 1560
            LDELVTLTIDLE ALPPGQVYSE+NSPYR+PL K                         F
Sbjct: 1501 LDELVTLTIDLEAALPPGQVYSEINSPYRLPLTK-------------------------F 1560

Query: 1561 MYIIRSDAGQPLREELAKSPQKILASAFPEFAPKSEAALTPGSSTSPAPLSGDEGLVTP- 1620
            MYIIRSDAGQPLREELAKSP KIL+ AFPE  PKS+A L+  +ST PA  SGDE   TP 
Sbjct: 1561 MYIIRSDAGQPLREELAKSPHKILSYAFPEILPKSDAILSAAASTPPAASSGDE-KPTPM 1620

Query: 1621 --DASDPPSAPSSVVSDAYFRGLALIKTLVKLMPGWLQNNRVVFDTLVLVWKSPARIARL 1680
              ++S+ PS  S+V SDAYF+GL LIKT+VKL+P WLQ+NR +FD L  +WKS AR +RL
Sbjct: 1621 KSESSNTPSTKSNVASDAYFQGLYLIKTMVKLIPSWLQSNRTIFDALAHLWKSHARTSRL 1680

Query: 1681 HNEQELNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIE 1740
             NEQ L LVQVKESKWLVKCFLNYLRHEK+E+NVLFD+L IFLFH+RIDYTFL+EFYIIE
Sbjct: 1681 QNEQNLTLVQVKESKWLVKCFLNYLRHEKSEMNVLFDVLLIFLFHSRIDYTFLREFYIIE 1740

Query: 1741 VAEGYPPNMKKALLLHFLNLFQSKQLGHDHLVIVMQMLILPMLAHAFQNGQSWEVVDQAI 1800
            VAE YPPNMKKA++LHFLNLFQSKQLGHDHLV  MQMLILPMLAHAFQNGQ+WEV+D  I
Sbjct: 1741 VAEEYPPNMKKAIVLHFLNLFQSKQLGHDHLVQAMQMLILPMLAHAFQNGQTWEVIDPDI 1800

Query: 1801 IKTIVDKLLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKRE 1860
            +KTIV++LLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLV HRKELIKFGWNHLKRE
Sbjct: 1801 VKTIVERLLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLVQHRKELIKFGWNHLKRE 1860

Query: 1861 DSASKQWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRL 1920
            DSASKQWAFVNVCHFL+AYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALP+RL
Sbjct: 1861 DSASKQWAFVNVCHFLDAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPKRL 1920

Query: 1921 PLGDSRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRL 1980
            PLGDSRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRL
Sbjct: 1921 PLGDSRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRL 1980

Query: 1981 GLPYNTTAENRRLAIDLAGLVVGWERQRQNEMKLVTESDAPSHSNDGLTCPPGADPKRMV 2040
            GLPYNTTAENRRLAI+LAGLVV WERQRQNE K+VT+ DA S  +DGL    G DPK   
Sbjct: 1981 GLPYNTTAENRRLAIELAGLVVSWERQRQNESKMVTDGDATSEVSDGLHPSSGVDPKLST 2040

Query: 2041 DGSTFPEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMII 2100
             GS+  ED +KRVK+EPGL SLCVMSPGGASS+PN+ETPGS TQPDEEFKPNAAMEE+II
Sbjct: 2041 AGSSISEDPSKRVKIEPGLPSLCVMSPGGASSIPNVETPGSATQPDEEFKPNAAMEELII 2100

Query: 2101 NFLIRVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDP 2160
            NFLIRVA+VIEPKD+EA  MYKQAL+ LSQALEVWPNANVKFNYLEKLLSS+ PSQS DP
Sbjct: 2101 NFLIRVAVVIEPKDREANTMYKQALDFLSQALEVWPNANVKFNYLEKLLSSMPPSQS-DP 2160

Query: 2161 STALAQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAY 2220
            STALAQGLDVMNKVLEKQPHLF++NNI+QISQ LE  FKHKMLDAGKSLCSLL+MVF+A+
Sbjct: 2161 STALAQGLDVMNKVLEKQPHLFIKNNISQISQFLELSFKHKMLDAGKSLCSLLKMVFIAF 2220

Query: 2221 PLEGVTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKN 2280
            P +G +TPP++KLLYQKV+ELI+ H++ +TA Q S +DN+  S+SFVL+V+KTL EVQK+
Sbjct: 2221 PQDGASTPPEIKLLYQKVNELIQKHVHVVTASQASGDDNSLGSVSFVLVVLKTLAEVQKH 2280

Query: 2281 LIDPYNLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKL 2340
             +DPY L  ILQRL+RD+G +AG+H RQ QR++         +SADVG V+SN+K VL+L
Sbjct: 2281 FLDPYVLVHILQRLSRDLGLAAGAHPRQSQRIE--------SESADVGAVVSNIKLVLEL 2340

Query: 2341 INERVMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWVEDDFSKMGTSVSSSSF 2400
            I+ERVML+ +CKR VTQI+N+LLSEKGTD+S+LLC+LD++K W EDDF K G+S SS +F
Sbjct: 2341 IDERVMLLADCKRPVTQILNTLLSEKGTDSSLLLCVLDMLKRWAEDDFGKKGSSGSSGAF 2400

Query: 2401 LAPKEIVSFLQKLSQVDKQNFSSSAAEEWDGKYLQLLYEICADSNKYPLSLRQEVFQKVE 2460
            L  K+IVSFLQKLSQVDKQ+FSS A +EWD  YLQLLY +CADS KYPL+LRQE+  KVE
Sbjct: 2401 LTQKDIVSFLQKLSQVDKQHFSSVALDEWDKVYLQLLYGLCADSTKYPLALRQEISLKVE 2460

Query: 2461 RQFMLGLRARDPETRKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLL 2520
            R  MLGLRARDP+ R+KFF LYHESLG  LF RLQYIIQ QDWEA+SDVFWLKQGLDLLL
Sbjct: 2461 RHSMLGLRARDPDMRRKFFLLYHESLGNNLFARLQYIIQNQDWEAMSDVFWLKQGLDLLL 2520

Query: 2521 AVLVEDKPITLAPNSARLPPLLVSGHVADSSAVQPQVNDAQEGLEDAPLTFDSLVHKHAQ 2580
            A+L+E+KPITLAPNSAR+ PLL S     +  V  Q     EG E+    FDS+V KH+Q
Sbjct: 2521 AILIEEKPITLAPNSARVVPLLPS----QNPGVHHQPPVMPEGPEEVASMFDSIVMKHSQ 2580

Query: 2581 FLNRTSKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMISLLS 2640
            FL+  SKLQVAD++IPLRELAHTDANVAYHLWVLVFPIVW TLHKEEQ+ALAKPMISLLS
Sbjct: 2581 FLSAASKLQVADVVIPLRELAHTDANVAYHLWVLVFPIVWATLHKEEQIALAKPMISLLS 2640

Query: 2641 KDYHKKQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGRTYNAWHIALALLESHVMLF 2700
            KDYHKKQQ  RPNVVQALLEGLQLSHPQPRMPSELIKYIG+TYNAWH+AL LLE+HVMLF
Sbjct: 2641 KDYHKKQQGHRPNVVQALLEGLQLSHPQPRMPSELIKYIGKTYNAWHLALTLLETHVMLF 2700

Query: 2701 MNETKCSESLAELYRLLNEEDMRCGLWKRKAISAETKAGLSLVQHGYWQRAQILFYQSMV 2760
             N++KC+ESLAELYRLLNEED R GLWK ++I+ E++AG S+VQHG+WQRAQ LFYQ+MV
Sbjct: 2701 TNDSKCAESLAELYRLLNEEDRRFGLWKSRSITTESRAGFSMVQHGFWQRAQSLFYQAMV 2760

Query: 2761 KATQGTYNNNVPKAEMCLWEEQWLYCASQLSQWEALVDFGKSIENYEILLDSLWKVPDWA 2820
            KATQGTYNN VPK EMCLWEEQWL+CA+QL QW+ALVDFGKS ENYEILLDSLWK PDW 
Sbjct: 2761 KATQGTYNNTVPKTEMCLWEEQWLHCATQLGQWDALVDFGKSTENYEILLDSLWKAPDWT 2820

Query: 2821 YMKEHVIPKAQVEETPKLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMS 2880
            Y+K+HVIPKAQVEETPKLRL+QA FSLH+++ NGV DAENIVGKGVDLALEQWWQLPEMS
Sbjct: 2821 YLKDHVIPKAQVEETPKLRLVQACFSLHEKNANGVGDAENIVGKGVDLALEQWWQLPEMS 2880

Query: 2881 VHARIPLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVSVHTNLYADLKDILETWRLRI 2940
            +HAR+PLLQQFQQLVEVQESSRI VDIANG+K  G++ V    NLYADLKDILETWRLR 
Sbjct: 2881 LHARVPLLQQFQQLVEVQESSRIYVDIANGSKVPGNAAVGGQGNLYADLKDILETWRLRT 2940

Query: 2941 PNEWDSMTVWCDLLQWRNEMYNAVIDAFKDFGTTNSQLHHLGFRDKAWNVNKLAHVARKQ 3000
            PNEWD+MTVW D+LQWRNEMYN VIDAFKDF T+N+ LHHLG+RDKAWNVNKLA +ARKQ
Sbjct: 2941 PNEWDNMTVWYDMLQWRNEMYNVVIDAFKDFVTSNTPLHHLGYRDKAWNVNKLARIARKQ 3000

Query: 3001 GLHDVCVGILEKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLDYFPVK 3060
            GL+DVCV ILEKMYGHS MEVQEAFVKI+EQAKA+LE KGEL +GLNL+NSTNL++F  K
Sbjct: 3001 GLYDVCVQILEKMYGHSQMEVQEAFVKIKEQAKAHLETKGELATGLNLVNSTNLEFFLAK 3060

Query: 3061 HKAEIFRLKGDFQLKLSDSEGANHSYSSAISLFKNLPKGWISWGNYCDMAYKESHEEIWL 3120
            +KAEIFRLKGDF LKL+D+EGAN +YS+AI+LFKNLPKGWISWGNYCDMAY+++ +EIWL
Sbjct: 3061 NKAEIFRLKGDFHLKLNDTEGANLAYSNAITLFKNLPKGWISWGNYCDMAYQDTQDEIWL 3120

Query: 3121 EYAVSCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKYLDQIPHWVWLSWIPQ 3180
            EYAVSCFLQGI+FG+SNSR+H+ARVLYLLSFD  NEPVGR FDK+LDQ+PHWVWLSWIPQ
Sbjct: 3121 EYAVSCFLQGIRFGVSNSRSHMARVLYLLSFDPTNEPVGRIFDKHLDQVPHWVWLSWIPQ 3180

Query: 3181 LLLSLQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQ 3240
            LL+SLQRTEAPHCKLVL+KIA V+PQALYYWLRTYLLERRD  NKSEL R+ +A QRMQQ
Sbjct: 3181 LLISLQRTEAPHCKLVLMKIAAVFPQALYYWLRTYLLERRDAVNKSELSRVVLA-QRMQQ 3240

Query: 3241 NTSSAGSLGLTDGSSRVAHGGSSTSTDNQVHQGTQSGSGIGSHDGGNSHSQEPERSTGVE 3300
            N     +           HGG +  ++ Q+HQG+Q+   +G+HDGGN H QE ER+T + 
Sbjct: 3241 NVPGVSA----------GHGGGNLPSETQIHQGSQTSGAVGTHDGGNLHVQESERATMI- 3300

Query: 3301 SSTHAGNDQSLPQTSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLA 3360
            ++ H+GNDQ + Q+SS                 +  SAA AFDAAKD+MEALRSKH NLA
Sbjct: 3301 NNVHSGNDQPMNQSSS-----------------MAISAAGAFDAAKDVMEALRSKHNNLA 3360

Query: 3361 SELEILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACF 3420
            SELE+LLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQ LKKELSGVC+ACF
Sbjct: 3361 SELEVLLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQPLKKELSGVCRACF 3420

Query: 3421 SADAVNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAV 3480
            SADAV KHV FVREYKQDFERDLDPES S FP TL++LT++LK WKN+LQ NVEDRFP +
Sbjct: 3421 SADAVTKHVAFVREYKQDFERDLDPESNS-FPVTLADLTKKLKDWKNILQSNVEDRFPVL 3480

Query: 3481 LRLEDESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTL 3540
            LRLEDES+VLRDF+VVDVE+PGQYF DQE+APDHTVKLDRVGADI IVRRHGSS RRLTL
Sbjct: 3481 LRLEDESKVLRDFNVVDVEIPGQYFADQEVAPDHTVKLDRVGADIQIVRRHGSSCRRLTL 3540

Query: 3541 IGSDGSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWS 3600
            IGSDGSQ+HFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHL +HTPIIIPVWS
Sbjct: 3541 IGSDGSQKHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLGLHTPIIIPVWS 3600

Query: 3601 QVRMVEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIAPEAVLDLRLQA 3660
            QVRMVEDDLMY+TFLEVYENHC RN +E+DLPITYFKE+LNQAI+GQI+PEA+ DLRLQA
Sbjct: 3601 QVRMVEDDLMYNTFLEVYENHCGRNGRESDLPITYFKEKLNQAITGQISPEAIGDLRLQA 3660

Query: 3661 YGDITRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIY 3720
            YG+IT+N+VN+ IFSQYMYKT +SG+H+WAFKKQFA+QLA+S+FMS++LQIGGRSPNKI 
Sbjct: 3661 YGEITKNIVNDTIFSQYMYKTSMSGSHLWAFKKQFAVQLAVSNFMSFILQIGGRSPNKIL 3720

Query: 3721 FAKNTGKIFQTDFHPAYDTNGMIEFNEPVPFRLTRNMQAFFSHFGVEGLIVSAMCSAAQA 3780
            FAKN+GK+FQTDFHP+YD+NGMIE NEPVPFRLTRNM AF SHFGVEG ++S MCSA+QA
Sbjct: 3721 FAKNSGKMFQTDFHPSYDSNGMIELNEPVPFRLTRNMHAFLSHFGVEGPLMSNMCSASQA 3780

Query: 3781 VVSPKQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAG-GGMSPADFKQKVTINVDHV 3840
            V S KQN+HL +QLAMFFRDELLSW  RRPLG+P+  +AG   +S  + K KV  NVD V
Sbjct: 3781 VFSSKQNEHLRYQLAMFFRDELLSWFGRRPLGVPIPPVAGIATLSSPELKHKVNSNVDDV 3809

Query: 3841 IGRINGIAPQYFSEEEENAMDPPQSVQRGVSDLVDAALMPRHLCMMDPTWHPWF 3889
            IGRI GIAPQYFSEE+EN+++PPQSVQRGVS+LV+AAL PR+LCMMDPTWHPWF
Sbjct: 3841 IGRIRGIAPQYFSEEDENSVEPPQSVQRGVSELVEAALSPRNLCMMDPTWHPWF 3809

BLAST of MS010599 vs. TAIR 10
Match: AT4G36080.2 (phosphotransferases, alcohol group as acceptor;binding;inositol or phosphatidylinositol kinases )

HSP 1 Score: 6002.6 bits (15571), Expect = 0.0e+00
Identity = 3027/3894 (77.73%), Postives = 3414/3894 (87.67%), Query Frame = 0

Query: 1    MSPIQNFELHSRQLVEPELSIQTRLQMATEVRDSLEIAHTPEYLNFLKCYFRAFSIILVQ 60
            MSPIQNFE HSR+LVEP+L I+ RL M  EVRDSLEI HT EYLNFLKCYFRA S+IL+Q
Sbjct: 1    MSPIQNFEQHSRRLVEPDLPIEERLAMVVEVRDSLEITHTAEYLNFLKCYFRASSVILLQ 60

Query: 61   ITKPQYTDNHEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120
            ITKPQ+TDN EHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI
Sbjct: 61   ITKPQFTDNIEHKLRNIVVEILNRLPHSEVLRPFVQDLLKVAMQVLTTDNEENGLICIRI 120

Query: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFKLTVSHFFENSAAGGEDIKPMDVSTSTDQT 180
            IFDLLRNFRPTLENEVQPFLDFVCKIYQNF+LTVSHFFEN     E++KP+++ T +DQ+
Sbjct: 121  IFDLLRNFRPTLENEVQPFLDFVCKIYQNFRLTVSHFFENVKM--EEVKPVEIPTPSDQS 180

Query: 181  IT-TGYTGTVQLNPSTRSFKIVTESPLVVMFLFQLYSRLVQTNIPVLLPLMVSAISVPGP 240
            ++ T  +   Q+NPSTRSFKIVTESPLVVMFLFQLYSRLVQ NIP LLPLMV+AIS+PGP
Sbjct: 181  LSITAPSRNGQINPSTRSFKIVTESPLVVMFLFQLYSRLVQINIPNLLPLMVAAISIPGP 240

Query: 241  EKVPPFLKTHFIELKGAQVKTVSFLTYLLRSSADYIRPHEESICKSIVNLLVTCSDSVSI 300
            EKV   +K  FIELKGAQVKTVSFLTYLL+S A+YI+PHEESICKSIVNLLVTCSDS SI
Sbjct: 241  EKVSSHMKPQFIELKGAQVKTVSFLTYLLKSCAEYIKPHEESICKSIVNLLVTCSDSASI 300

Query: 301  RKELLVALKHVLGTEYKRGLFPLIDTLLEEKVLVGTGRACYETLRPLAYSLLAEIVHHVR 360
            RKELLV+LKHVLGT++KRGLFPLIDTLLEE+VLVGTGRAC+E+LRPLAYSLLAEIVHHVR
Sbjct: 301  RKELLVSLKHVLGTDFKRGLFPLIDTLLEERVLVGTGRACFESLRPLAYSLLAEIVHHVR 360

Query: 361  GDLTLSQLSRIIYLFSSNMHDASLSLSIHTTCARLMLNLVEPIFEKGVDQTSMDEARILL 420
             DL+LSQLSRIIYLFS NMHD++LSL+IHTTCARLMLNLVEPIFEKG+DQ SMDEARILL
Sbjct: 361  ADLSLSQLSRIIYLFSRNMHDSTLSLNIHTTCARLMLNLVEPIFEKGIDQQSMDEARILL 420

Query: 421  GRILDAFVGKFSTFKHTIPQLLEEGEEGKDRANMRSKLELPVQAVLNLQVPVEHSKEVND 480
            GRILDAFVGKF+TFK T+PQLLEEG +GKD+  +RSKLELPVQAVLNLQVP EHSKEVND
Sbjct: 421  GRILDAFVGKFNTFKRTVPQLLEEG-DGKDQITLRSKLELPVQAVLNLQVPAEHSKEVND 480

Query: 481  CKHLIKTLILGMKTIIWSITHAHLPRPQASPSPNGTHPQMLVSPSSNLATPQAFKGMRED 540
            CK+LIKTL++GMKTIIWSITHAHLPRPQ      G HPQ L S SS     Q FKGMRED
Sbjct: 481  CKNLIKTLVMGMKTIIWSITHAHLPRPQ------GMHPQALASQSS---VTQVFKGMRED 540

Query: 541  EVCKASGVLKSGVHCLTLFKEKDEEVEMLHLFSQILTVMEPRDLMDMFSLCMPELFDCMI 600
            EV KASGVLKSGVHCL LFK+KDEE EML+LFSQIL VMEPRDLMDMFS+CMPELF+C+I
Sbjct: 541  EVWKASGVLKSGVHCLALFKDKDEEKEMLNLFSQILAVMEPRDLMDMFSICMPELFECII 600

Query: 601  TNTQLVHLFSTFLQTPKVYRPFADVLVNFLVSSKLDVLKHPDSPGAKLVLHLFRFVFGAV 660
             NTQLV +F+T LQ PKVY+PFADVL+NFLVSSKLDVLK+PDS   KL+LHLFR +FGAV
Sbjct: 601  DNTQLVQIFATLLQAPKVYKPFADVLINFLVSSKLDVLKNPDSAATKLILHLFRCLFGAV 660

Query: 661  AKAPSDFERILQPHVTVIMEVCVRSATEVERPLGYMQLLRIMFRALAGCKFELLLRDLIP 720
            +KAPSDFERILQP V +IMEVC+++ATEVE+PLGYMQLLR +FR LAGCKFELLLRDL+P
Sbjct: 661  SKAPSDFERILQPQVPLIMEVCMKNATEVEKPLGYMQLLRTVFRGLAGCKFELLLRDLVP 720

Query: 721  LLQPCLNMLLTMFDGPTGEDMRDLLLELCLTLPARLSSLLPHLPRLMKPLVLCLKGSDDL 780
            +L PCLN+LLTM +GP GEDMRDLLLEL LTLPARLSSLLP+LPRLM+PLV CL+GSD+L
Sbjct: 721  MLLPCLNILLTMLEGPAGEDMRDLLLELSLTLPARLSSLLPYLPRLMRPLVSCLRGSDEL 780

Query: 781  VSLGLRTLEFWVDSLNPDFLEPSMANVMSEVILALWSHLRPIPYPWGAKALQVLGKLGGR 840
            VSLGLRTLEFWVDSLNPDFLEPSMA VMSEVILALWSHL+P+PYPWG KALQ++GKLGGR
Sbjct: 781  VSLGLRTLEFWVDSLNPDFLEPSMATVMSEVILALWSHLKPVPYPWGGKALQIVGKLGGR 840

Query: 841  NRRFLKEPLALECKENPEHGLRLILTFEPSTPFLVPLDRCINLAVSTVMNKTGGVDSFYR 900
            NRRFLKEPL LECK+NPEHGLRL+LTFEPSTPFLVP+D+ INLAV+ VM K    + +Y+
Sbjct: 841  NRRFLKEPLTLECKDNPEHGLRLVLTFEPSTPFLVPMDKFINLAVAAVMQKNLTTEIYYK 900

Query: 901  KQALKFLRVCLSSQLNLPGNVADDGHTPRQLSTLLVSPVDSSLRRSETPEGKADLGVKTK 960
            KQALKFLRVCL SQLNLPG V D+G T +QLSTLL+S VDS  RRSE+ E +ADLGVKTK
Sbjct: 901  KQALKFLRVCLLSQLNLPGCVTDEGQTTKQLSTLLLSSVDSFWRRSESTEIEADLGVKTK 960

Query: 961  TQLMAEKSVFKILLMTIIAAGSEEDLHEPKDDFVLNVCRHFAILFHIDSSLNSSPVASAS 1020
            TQL+AEKS+FK LL+TIIAA S+ DL +  DDFV+N+CRHFAI+ H D + + +  ++  
Sbjct: 961  TQLIAEKSIFKTLLITIIAASSDPDLSDSDDDFVVNICRHFAIILHGDYTSSYTSTSAGP 1020

Query: 1021 LGSTLLPPNVSANSRLRSSACCNLKELDPLIFLDALVEVLADENRVHAKAALNALNLFSE 1080
            LG +L    +S +S+ +++    LK+LDPLIFLDALV+VLADENR+HAKAAL +LN+F+E
Sbjct: 1021 LGGSL----ISTSSKPKNNWSTYLKQLDPLIFLDALVDVLADENRLHAKAALTSLNVFAE 1080

Query: 1081 ILLFLARAKQTDVMMTRGP-STPMIVSSPSKSPVYSPPPSVRIPVFEQLLPRLLHCCYGS 1140
             LLFLAR K  DV+M RG  S  MIVSSPS +PVYSP PSVRIPVFEQLLPRLLHCCYGS
Sbjct: 1081 TLLFLARIKHADVLMARGAHSASMIVSSPSTNPVYSPHPSVRIPVFEQLLPRLLHCCYGS 1140

Query: 1141 TWQAQMGGIMGLGALVGKVTVETLCLFQVRIVRGLVYVLKRLPIYASKEQEETSQVLNQV 1200
            TWQAQMGG+MGLGALVGKV VETLCLFQV+IVRGLVYV KRLP+YASKEQ+ETSQVL Q+
Sbjct: 1141 TWQAQMGGVMGLGALVGKVNVETLCLFQVKIVRGLVYVQKRLPVYASKEQDETSQVLIQI 1200

Query: 1201 LRVVNNVDEANSEPRRQSFHGVVDILASELFNPNSSTIVRKNVQSCLALLASRTGSEVSE 1260
            LRVVNNVDEAN++ RRQSF  VV+ LA+ELFN N+S  VRKNVQ+CLALLASRTGSEVSE
Sbjct: 1201 LRVVNNVDEANNDARRQSFQDVVEYLATELFNSNASITVRKNVQNCLALLASRTGSEVSE 1260

Query: 1261 LLEPLHQPLLQPLLLRPLRLKTIDQQVGTVTALNFCLALRPPLLKLTQELVNFLQEALQI 1320
            LLEPL+QPLLQPL++RPLR KTIDQQVGTVTALNFCLALRPPLLK+T ELVNFLQEALQI
Sbjct: 1261 LLEPLYQPLLQPLIMRPLRSKTIDQQVGTVTALNFCLALRPPLLKVTPELVNFLQEALQI 1320

Query: 1321 AEADETVWVVKFMNPKIATSLNKLRTACIELLCTTMAWADFKTPNHSELRAKIISMFFKS 1380
            AEADE +W VK M+PK+ TSLN+LRTACIE+LCTTMAWADF+T +H+ELRAKIISMFFKS
Sbjct: 1321 AEADEALWAVKLMSPKVLTSLNRLRTACIEILCTTMAWADFRTQSHNELRAKIISMFFKS 1380

Query: 1381 LTCRTPEVVAVAKEGLRQVINQQRMPKDLLQGSLRPILVNLAHTKNLSMPLLQGLARLLE 1440
            LTCR PE+V VAKEGLRQVINQQRMPK+LLQ SLRPILVNLA TKNL+MPLLQGLARLLE
Sbjct: 1381 LTCRAPEIVTVAKEGLRQVINQQRMPKELLQSSLRPILVNLAQTKNLNMPLLQGLARLLE 1440

Query: 1441 LLASWFNVTLGGKLLEHLKKWLEPEKLAQSQKAWKAGEEPKIAAAIIELFHLLPMAASKF 1500
            LL++WFNVTLG KLLEHLKKWLEPEKLAQSQK+WKAGEEPKIAAAIIELFHLLP+AASKF
Sbjct: 1441 LLSNWFNVTLGCKLLEHLKKWLEPEKLAQSQKSWKAGEEPKIAAAIIELFHLLPLAASKF 1500

Query: 1501 LDELVTLTIDLEGALPPGQVYSEVNSPYRVPLIKFLNRYAPLAVDYFLARLSEPKYFRRF 1560
            LDELVTLTIDLE ALPPGQV                              LSEPKYFRRF
Sbjct: 1501 LDELVTLTIDLEAALPPGQV------------------------------LSEPKYFRRF 1560

Query: 1561 MYIIRSDAGQPLREELAKSPQKILASAFPEFAPKSEAALTPGSSTSPAPLSGDEGLVTP- 1620
            MYIIRSDAGQPLREELAKSP KIL+ AFPE  PKS+A L+  +ST PA  SGDE   TP 
Sbjct: 1561 MYIIRSDAGQPLREELAKSPHKILSYAFPEILPKSDAILSAAASTPPAASSGDE-KPTPM 1620

Query: 1621 --DASDPPSAPSSVVSDAYFRGLALIKTLVKLMPGWLQNNRVVFDTLVLVWKSPARIARL 1680
              ++S+ PS  S+V SDAYF+GL LIKT+VKL+P WLQ+NR +FD L  +WKS AR +RL
Sbjct: 1621 KSESSNTPSTKSNVASDAYFQGLYLIKTMVKLIPSWLQSNRTIFDALAHLWKSHARTSRL 1680

Query: 1681 HNEQELNLVQVKESKWLVKCFLNYLRHEKAEVNVLFDILSIFLFHTRIDYTFLKEFYIIE 1740
             NEQ L LVQVKESKWLVKCFLNYLRHEK+E+NVLFD+L IFLFH+RIDYTFL+EFYIIE
Sbjct: 1681 QNEQNLTLVQVKESKWLVKCFLNYLRHEKSEMNVLFDVLLIFLFHSRIDYTFLREFYIIE 1740

Query: 1741 VAEGYPPNMKKALLLHFLNLFQSKQLGHDHLVIVMQMLILPMLAHAFQNGQSWEVVDQAI 1800
            VAE YPPNMKKA++LHFLNLFQSKQLGHDHLV  MQMLILPMLAHAFQNGQ+WEV+D  I
Sbjct: 1741 VAEEYPPNMKKAIVLHFLNLFQSKQLGHDHLVQAMQMLILPMLAHAFQNGQTWEVIDPDI 1800

Query: 1801 IKTIVDKLLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLVHHRKELIKFGWNHLKRE 1860
            +KTIV++LLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLV HRKELIKFGWNHLKRE
Sbjct: 1801 VKTIVERLLDPPEEVSAEYDEPLRIELLQLATLLLKYLQSDLVQHRKELIKFGWNHLKRE 1860

Query: 1861 DSASKQWAFVNVCHFLEAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPRRL 1920
            DSASKQWAFVNVCHFL+AYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALP+RL
Sbjct: 1861 DSASKQWAFVNVCHFLDAYQAPEKIILQVFVALLRTCQPENKMLVKQALDILMPALPKRL 1920

Query: 1921 PLGDSRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRL 1980
            PLGDSRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRL
Sbjct: 1921 PLGDSRMPIWIRYTKKILVEEGHSIPNLIHIFQLIVRHSDLFYSCRAQFVPQMVNSLSRL 1980

Query: 1981 GLPYNTTAENRRLAIDLAGLVVGWERQRQNEMKLVTESDAPSHSNDGLTCPPGADPKRMV 2040
            GLPYNTTAENRRLAI+LAGLVV WERQRQNE K+VT+ DA S  +DGL    G DPK   
Sbjct: 1981 GLPYNTTAENRRLAIELAGLVVSWERQRQNESKMVTDGDATSEVSDGLHPSSGVDPKLST 2040

Query: 2041 DGSTFPEDSTKRVKVEPGLQSLCVMSPGGASSMPNIETPGSTTQPDEEFKPNAAMEEMII 2100
             GS+  ED +KRVK+EPGL SLCVMSPGGASS+PN+ETPGS TQPDEEFKPNAAMEE+II
Sbjct: 2041 AGSSISEDPSKRVKIEPGLPSLCVMSPGGASSIPNVETPGSATQPDEEFKPNAAMEELII 2100

Query: 2101 NFLIRVALVIEPKDKEATAMYKQALELLSQALEVWPNANVKFNYLEKLLSSIQPSQSKDP 2160
            NFLIRVA+VIEPKD+EA  MYKQAL+ LSQALEVWPNANVKFNYLEKLLSS+ PSQS DP
Sbjct: 2101 NFLIRVAVVIEPKDREANTMYKQALDFLSQALEVWPNANVKFNYLEKLLSSMPPSQS-DP 2160

Query: 2161 STALAQGLDVMNKVLEKQPHLFVRNNINQISQILEPCFKHKMLDAGKSLCSLLRMVFVAY 2220
            STALAQGLDVMNKVLEKQPHLF++NNI+QISQ LE  FKHKMLDAGKSLCSLL+MVF+A+
Sbjct: 2161 STALAQGLDVMNKVLEKQPHLFIKNNISQISQFLELSFKHKMLDAGKSLCSLLKMVFIAF 2220

Query: 2221 PLEGVTTPPDVKLLYQKVDELIKNHINNLTAPQTSSEDNTASSISFVLLVIKTLTEVQKN 2280
            P +G +TPP++KLLYQKV+ELI+ H++ +TA Q S +DN+  S+SFVL+V+KTL EVQK+
Sbjct: 2221 PQDGASTPPEIKLLYQKVNELIQKHVHVVTASQASGDDNSLGSVSFVLVVLKTLAEVQKH 2280

Query: 2281 LIDPYNLGRILQRLARDMGSSAGSHLRQGQRMDPDSAVTSSRQSADVGTVISNLKSVLKL 2340
             +DPY L  ILQRL+RD+G +AG+H RQ QR++         +SADVG V+SN+K VL+L
Sbjct: 2281 FLDPYVLVHILQRLSRDLGLAAGAHPRQSQRIE--------SESADVGAVVSNIKLVLEL 2340

Query: 2341 INERVMLVPECKRSVTQIMNSLLSEKGTDASVLLCILDVIKGWVEDDFSKMGTSVSSSSF 2400
            I+ERVML+ +CKR VTQI+N+LLSEKGTD+S+LLC+LD++K W EDDF K G+S SS +F
Sbjct: 2341 IDERVMLLADCKRPVTQILNTLLSEKGTDSSLLLCVLDMLKRWAEDDFGKKGSSGSSGAF 2400

Query: 2401 LAPKEIVSFLQKLSQVDKQNFSSSAAEEWDGKYLQLLYEICADSNKYPLSLRQEVFQKVE 2460
            L  K+IVSFLQKLSQVDKQ+FSS A +EWD  YLQLLY +CADS KYPL+LRQE+  KVE
Sbjct: 2401 LTQKDIVSFLQKLSQVDKQHFSSVALDEWDKVYLQLLYGLCADSTKYPLALRQEISLKVE 2460

Query: 2461 RQFMLGLRARDPETRKKFFTLYHESLGKTLFIRLQYIIQIQDWEALSDVFWLKQGLDLLL 2520
            R  MLGLRARDP+ R+KFF LYHESLG  LF RLQYIIQ QDWEA+SDVFWLKQGLDLLL
Sbjct: 2461 RHSMLGLRARDPDMRRKFFLLYHESLGNNLFARLQYIIQNQDWEAMSDVFWLKQGLDLLL 2520

Query: 2521 AVLVEDKPITLAPNSARLPPLLVSGHVADSSAVQPQVNDAQEGLEDAPLTFDSLVHKHAQ 2580
            A+L+E+KPITLAPNSAR+ PLL S     +  V  Q     EG E+    FDS+V KH+Q
Sbjct: 2521 AILIEEKPITLAPNSARVVPLLPS----QNPGVHHQPPVMPEGPEEVASMFDSIVMKHSQ 2580

Query: 2581 FLNRTSKLQVADLIIPLRELAHTDANVAYHLWVLVFPIVWVTLHKEEQVALAKPMISLLS 2640
            FL+  SKLQVAD++IPLRELAHTDANVAYHLWVLVFPIVW TLHKEEQ+ALAKPMISLLS
Sbjct: 2581 FLSAASKLQVADVVIPLRELAHTDANVAYHLWVLVFPIVWATLHKEEQIALAKPMISLLS 2640

Query: 2641 KDYHKKQQASRPNVVQALLEGLQLSHPQPRMPSELIKYIGRTYNAWHIALALLESHVMLF 2700
            KDYHKKQQ  RPNVVQALLEGLQLSHPQPRMPSELIKYIG+TYNAWH+AL LLE+HVMLF
Sbjct: 2641 KDYHKKQQGHRPNVVQALLEGLQLSHPQPRMPSELIKYIGKTYNAWHLALTLLETHVMLF 2700

Query: 2701 MNETKCSESLAELYRLLNEEDMRCGLWKRKAISAETKAGLSLVQHGYWQRAQILFYQSMV 2760
             N++KC+ESLAELYRLLNEED R GLWK ++I+ E++AG S+VQHG+WQRAQ LFYQ+MV
Sbjct: 2701 TNDSKCAESLAELYRLLNEEDRRFGLWKSRSITTESRAGFSMVQHGFWQRAQSLFYQAMV 2760

Query: 2761 KATQGTYNNNVPKAEMCLWEEQWLYCASQLSQWEALVDFGKSIENYEILLDSLWKVPDWA 2820
            KATQGTYNN VPK EMCLWEEQWL+CA+QL QW+ALVDFGKS ENYEILLDSLWK PDW 
Sbjct: 2761 KATQGTYNNTVPKTEMCLWEEQWLHCATQLGQWDALVDFGKSTENYEILLDSLWKAPDWT 2820

Query: 2821 YMKEHVIPKAQVEETPKLRLIQAYFSLHDRSTNGVADAENIVGKGVDLALEQWWQLPEMS 2880
            Y+K+HVIPKAQVEETPKLRL+QA FSLH+++ NGV DAENIVGKGVDLALEQWWQLPEMS
Sbjct: 2821 YLKDHVIPKAQVEETPKLRLVQACFSLHEKNANGVGDAENIVGKGVDLALEQWWQLPEMS 2880

Query: 2881 VHARIPLLQQFQQLVEVQESSRILVDIANGNKHSGSSVVSVHTNLYADLKDILETWRLRI 2940
            +HAR+PLLQQFQQLVEVQESSRI VDIANG+K  G++ V    NLYADLKDILETWRLR 
Sbjct: 2881 LHARVPLLQQFQQLVEVQESSRIYVDIANGSKVPGNAAVGGQGNLYADLKDILETWRLRT 2940

Query: 2941 PNEWDSMTVWCDLLQWRNEMYNAVIDAFKDFGTTNSQLHHLGFRDKAWNVNKLAHVARKQ 3000
            PNEWD+MTVW D+LQWRNEMYN VIDAFKDF T+N+ LHHLG+RDKAWNVNKLA +ARKQ
Sbjct: 2941 PNEWDNMTVWYDMLQWRNEMYNVVIDAFKDFVTSNTPLHHLGYRDKAWNVNKLARIARKQ 3000

Query: 3001 GLHDVCVGILEKMYGHSTMEVQEAFVKIREQAKAYLEMKGELTSGLNLINSTNLDYFPVK 3060
            GL+DVCV ILEKMYGHS MEVQEAFVKI+EQAKA+LE KGEL +GLNL+NSTNL++F  K
Sbjct: 3001 GLYDVCVQILEKMYGHSQMEVQEAFVKIKEQAKAHLETKGELATGLNLVNSTNLEFFLAK 3060

Query: 3061 HKAEIFRLKGDFQLKLSDSEGANHSYSSAISLFKNLPKGWISWGNYCDMAYKESHEEIWL 3120
            +KAEIFRLKGDF LKL+D+EGAN +YS+AI+LFKNLPKGWISWGNYCDMAY+++ +EIWL
Sbjct: 3061 NKAEIFRLKGDFHLKLNDTEGANLAYSNAITLFKNLPKGWISWGNYCDMAYQDTQDEIWL 3120

Query: 3121 EYAVSCFLQGIKFGISNSRNHLARVLYLLSFDTPNEPVGRAFDKYLDQIPHWVWLSWIPQ 3180
            EYAVSCFLQGI+FG+SNSR+H+ARVLYLLSFD  NEPVGR FDK+LDQ+PHWVWLSWIPQ
Sbjct: 3121 EYAVSCFLQGIRFGVSNSRSHMARVLYLLSFDPTNEPVGRIFDKHLDQVPHWVWLSWIPQ 3180

Query: 3181 LLLSLQRTEAPHCKLVLLKIANVYPQALYYWLRTYLLERRDVANKSELGRMAMAQQRMQQ 3240
            LL+SLQRTEAPHCKLVL+KIA V+PQALYYWLRTYLLERRD  NKSEL R+ +A QRMQQ
Sbjct: 3181 LLISLQRTEAPHCKLVLMKIAAVFPQALYYWLRTYLLERRDAVNKSELSRVVLA-QRMQQ 3240

Query: 3241 NTSSAGSLGLTDGSSRVAHGGSSTSTDNQVHQGTQSGSGIGSHDGGNSHSQEPERSTGVE 3300
            N     +           HGG +  ++ Q+HQG+Q+   +G+HDGGN H QE ER+T + 
Sbjct: 3241 NVPGVSA----------GHGGGNLPSETQIHQGSQTSGAVGTHDGGNLHVQESERATMI- 3300

Query: 3301 SSTHAGNDQSLPQTSSNVNEGTQNALRRSAALGLVGSAASAFDAAKDIMEALRSKHTNLA 3360
            ++ H+GNDQ + Q+SS                 +  SAA AFDAAKD+MEALRSKH NLA
Sbjct: 3301 NNVHSGNDQPMNQSSS-----------------MAISAAGAFDAAKDVMEALRSKHNNLA 3360

Query: 3361 SELEILLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQSLKKELSGVCKACF 3420
            SELE+LLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQ LKKELSGVC+ACF
Sbjct: 3361 SELEVLLTEIGSRFVTLPEERLLAVVNALLHRCYKYPTATTAEVPQPLKKELSGVCRACF 3420

Query: 3421 SADAVNKHVDFVREYKQDFERDLDPESTSTFPATLSELTERLKHWKNVLQGNVEDRFPAV 3480
            SADAV KHV FVREYKQDFERDLDPES S FP TL++LT++LK WKN+LQ NVEDRFP +
Sbjct: 3421 SADAVTKHVAFVREYKQDFERDLDPESNS-FPVTLADLTKKLKDWKNILQSNVEDRFPVL 3480

Query: 3481 LRLEDESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTL 3540
            LRLEDES+VLRDF+VVDVE+PGQYF DQE+APDHTVKLDRVGADI IVRRHGSS RRLTL
Sbjct: 3481 LRLEDESKVLRDFNVVDVEIPGQYFADQEVAPDHTVKLDRVGADIQIVRRHGSSCRRLTL 3540

Query: 3541 IGSDGSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWS 3600
            IGSDGSQ+HFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHL +HTPIIIPVWS
Sbjct: 3541 IGSDGSQKHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLGLHTPIIIPVWS 3600

Query: 3601 QVRMVEDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIAPEAVLDLRLQA 3660
            QVRMVEDDLMY+TFLEVYENHC RN +E+DLPITYFKE+LNQAI+GQI+PEA+ DLRLQA
Sbjct: 3601 QVRMVEDDLMYNTFLEVYENHCGRNGRESDLPITYFKEKLNQAITGQISPEAIGDLRLQA 3660

Query: 3661 YGDITRNLVNEGIFSQYMYKTLLSGNHMWAFKKQFAIQLALSSFMSYMLQIGGRSPNKIY 3720
            YG+IT+N+VN+ IFSQYMYKT +SG+H+WAFKKQFA+QLA+S+FMS++LQIGGRSPNKI 
Sbjct: 3661 YGEITKNIVNDTIFSQYMYKTSMSGSHLWAFKKQFAVQLAVSNFMSFILQIGGRSPNKIL 3720

Query: 3721 FAKNTGKIFQTDFHPAYDTNGMIEFNEPVPFRLTRNMQAFFSHFGVEGLIVSAMCSAAQA 3780
            FAKN+GK+FQTDFHP+YD+NGMIE NEPVPFRLTRNM AF SHFGVEG ++S MCSA+QA
Sbjct: 3721 FAKNSGKMFQTDFHPSYDSNGMIELNEPVPFRLTRNMHAFLSHFGVEGPLMSNMCSASQA 3780

Query: 3781 VVSPKQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAG-GGMSPADFKQKVTINVDHV 3840
            V S KQN+HL +QLAMFFRDELLSW  RRPLG+P+  +AG   +S  + K KV  NVD V
Sbjct: 3781 VFSSKQNEHLRYQLAMFFRDELLSWFGRRPLGVPIPPVAGIATLSSPELKHKVNSNVDDV 3804

Query: 3841 IGRINGIAPQYFSEEEENAMDPPQSVQRGVSDLVDAALMPRHLCMMDPTWHPWF 3889
            IGRI GIAPQYFSEE+EN+++PPQSVQRGVS+LV+AAL PR+LCMMDPTWHPWF
Sbjct: 3841 IGRIRGIAPQYFSEEDENSVEPPQSVQRGVSELVEAALSPRNLCMMDPTWHPWF 3804

BLAST of MS010599 vs. TAIR 10
Match: AT1G50030.1 (target of rapamycin )

HSP 1 Score: 76.3 bits (186), Expect = 6.2e-13
Identity = 81/357 (22.69%), Postives = 155/357 (43.42%), Query Frame = 0

Query: 3476 LRLEDESRVLRDFHVVDVEVPGQYFTDQEIAPDHTVKLDRVGADIPIVRRHGSSFRRLTL 3535
            L LE  S  L     +++ VPG Y  D  +     V +      + ++       R+LT+
Sbjct: 2030 LDLESVSPELLLCRDLELAVPGTYRADAPV-----VTISSFSRQLVVITSKQRP-RKLTI 2089

Query: 3536 IGSDGSQRHFIVQTSLTPNARSDERILQLFRVMNQMFDKHKESRRRHLCIHTPIIIPVWS 3595
             G+DG    F+++     + R DER++QLF ++N + +  +++  + L I    +IP+  
Sbjct: 2090 HGNDGEDYAFLLKGH--EDLRQDERVMQLFGLVNTLLENSRKTAEKDLSIQRYSVIPLSP 2149

Query: 3596 QVRMV----EDDLMYSTFLEVYENHCARNDQEADLPITYFKEQLNQAISGQIAPEAVLDL 3655
               ++      D ++    E  +      +QE    +++  +  N  +   IA   V + 
Sbjct: 2150 NSGLIGWVPNCDTLHHLIREHRDARKIILNQENKHMLSFAPDYDNLPL---IAKVEVFEY 2209

Query: 3656 RLQ--AYGDITRNLVNEGIFSQYMYKTLLSGNHMWAFKK-QFAIQLALSSFMSYMLQIGG 3715
             L+     D++R L  +   S+           +W  ++  +   LA+ S + Y+L +G 
Sbjct: 2210 ALENTEGNDLSRVLWLKSRSSE-----------VWLERRTNYTRSLAVMSMVGYILGLGD 2269

Query: 3716 RSPNKIYFAKNTGKIFQTDFHPAYDTN-GMIEFNEPVPFRLTRNMQAFFSHFGVEGLIVS 3775
            R P+ +   + +GKI   DF   ++ +    +F E VPFRLTR +       G+EG   S
Sbjct: 2270 RHPSNLMLHRYSGKILHIDFGDCFEASMNREKFPEKVPFRLTRMLVKAMEVSGIEGNFRS 2329

Query: 3776 AMCSAAQAVVSPKQNQHLWHQLAMFFRDELLSWSWRRPLGMPLASIAGGG--MSPAD 3823
               +  Q + + K +  +   +  F  D L++W       +P  ++ G     +PAD
Sbjct: 2330 TCENVMQVLRTNKDS--VMAMMEAFVHDPLINWRLFNFNEVPQLALLGNNNPNAPAD 2362

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022133382.10.0e+0099.79transformation/transcription domain-associated protein-like [Momordica charantia... [more]
XP_038882073.10.0e+0096.07transformation/transcription domain-associated protein-like [Benincasa hispida][more]
XP_008440816.10.0e+0095.61PREDICTED: transformation/transcription domain-associated protein-like [Cucumis ... [more]
XP_004134864.10.0e+0095.50transformation/transcription domain-associated protein [Cucumis sativus] >KGN489... [more]
XP_022950590.10.0e+0095.22transformation/transcription domain-associated protein-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A0R4ITC50.0e+0027.53Transformation/transcription domain-associated protein OS=Danio rerio OX=7955 GN... [more]
Q9Y4A50.0e+0027.16Transformation/transcription domain-associated protein OS=Homo sapiens OX=9606 G... [more]
P388110.0e+0026.16Transcription-associated protein 1 OS=Saccharomyces cerevisiae (strain ATCC 2045... [more]
Q8I8U70.0e+0026.15Transcription-associated protein 1 OS=Drosophila melanogaster OX=7227 GN=Nipped-... [more]
Q54T850.0e+0025.85Probable transcription-associated protein 1 OS=Dictyostelium discoideum OX=44689... [more]
Match NameE-valueIdentityDescription
A0A6J1BWI40.0e+0099.79Non-specific serine/threonine protein kinase OS=Momordica charantia OX=3673 GN=L... [more]
A0A1S3B1J80.0e+0095.61Non-specific serine/threonine protein kinase OS=Cucumis melo OX=3656 GN=LOC10348... [more]
A0A0A0KKP10.0e+0095.50Non-specific serine/threonine protein kinase OS=Cucumis sativus OX=3659 GN=Csa_6... [more]
A0A6J1GG630.0e+0095.22Non-specific serine/threonine protein kinase OS=Cucurbita moschata OX=3662 GN=LO... [more]
A0A6J1IRI90.0e+0095.11Non-specific serine/threonine protein kinase OS=Cucurbita maxima OX=3661 GN=LOC1... [more]
Match NameE-valueIdentityDescription
AT2G17930.10.0e+0081.56Phosphatidylinositol 3- and 4-kinase family protein with FAT domain [more]
AT4G36080.10.0e+0078.38phosphotransferases, alcohol group as acceptor;binding;inositol or phosphatidyli... [more]
AT4G36080.30.0e+0077.79phosphotransferases, alcohol group as acceptor;binding;inositol or phosphatidyli... [more]
AT4G36080.20.0e+0077.73phosphotransferases, alcohol group as acceptor;binding;inositol or phosphatidyli... [more]
AT1G50030.16.2e-1322.69target of rapamycin [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 3344..3364
NoneNo IPR availableCOILSCoilCoilcoord: 1302..1322
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 3229..3315
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 2062..2082
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1594..1628
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 2011..2033
NoneNo IPR availablePANTHERPTHR11139:SF109BNAC09G09620D PROTEINcoord: 24..3886
NoneNo IPR availablePANTHERPTHR11139ATAXIA TELANGIECTASIA MUTATED ATM -RELATEDcoord: 24..3886
NoneNo IPR availableCDDcd05163PIKK_TRRAPcoord: 3513..3802
e-value: 1.39294E-114
score: 362.61
IPR003152FATC domainSMARTSM01343FATC_2coord: 3856..3888
e-value: 2.7E-4
score: 30.2
IPR003152FATC domainPROSITEPS51190FATCcoord: 3856..3888
score: 12.304352
IPR000403Phosphatidylinositol 3-/4-kinase, catalytic domainSMARTSM00146pi3k_hr1_6coord: 3549..3856
e-value: 9.7E-11
score: 50.9
IPR000403Phosphatidylinositol 3-/4-kinase, catalytic domainPFAMPF00454PI3_PI4_kinasecoord: 3555..3801
e-value: 5.3E-25
score: 88.7
IPR000403Phosphatidylinositol 3-/4-kinase, catalytic domainPROSITEPS50290PI3_4_KINASE_3coord: 3543..3752
score: 14.0713
IPR003151PIK-related kinase, FATPFAMPF02259FATcoord: 2803..3141
e-value: 1.7E-41
score: 142.6
IPR011989Armadillo-like helicalGENE3D1.25.10.10coord: 693..1508
e-value: 1.1E-11
score: 45.4
IPR036940Phosphatidylinositol 3-/4-kinase, catalytic domain superfamilyGENE3D1.10.1070.11coord: 3659..3808
e-value: 7.4E-16
score: 60.4
IPR014009PIK-related kinasePROSITEPS51189FATcoord: 2670..3213
score: 21.77673
IPR011009Protein kinase-like domain superfamilySUPERFAMILY56112Protein kinase-like (PK-like)coord: 3449..3838
IPR016024Armadillo-type foldSUPERFAMILY48371ARM repeatcoord: 554..1539
IPR016024Armadillo-type foldSUPERFAMILY48371ARM repeatcoord: 1657..2365
IPR016024Armadillo-type foldSUPERFAMILY48371ARM repeatcoord: 68..446

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS010599.1MS010599.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0016301 kinase activity
molecular_function GO:0005515 protein binding