Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GAATTCTCAAGTTAAATGCATCATGGGAGCATGAATTCTGGGTCATCCAATAACAGTATAATCAGCAACAATGGAGTTCTCAAAAGCAAACACTGCCATCATCATCTACCACTGAATCCAGGAACCTCCAAGCAAGTACTGAAATGGAGGAGAAGCGCCGAGACGCCGGAAATTTACCGGCGAACACCACGGATTCGCCTTCATCGGAGCCGCCTTCGTCTCGCCGTCGAGCTGGAGCTCAGAAGCGAAAGGTCAGCGCTCTCGGTGGCTCTAACTCCTCATCCGCTCCTTCGAAACGCGTTACTCGGGACAAATTTGCTCTTTTGCATCCTCCAAATCACAACGGTCCCTTCACTAGAGCTCGACTTGGCCCTAACAATGGCGCTGGAACAGCATCGGGTAATGCGGCTGAAGGTATCTCCGCCGCGGGATCAGTTAAGGTGGAGGGATCTTTTCTTCATTCCGAAGTTCAGCGTGGAGACACACTGGTCGCCGCGGCGGAGGAATTGAACAAGGCGAGTAGATTGGCGAATTTGGAAGCGTCTTTCGTAGCTGATTTCGAATCTATTAAATCTCGGGGTATGAATGCTCACGTCGTTCCGAATCATTGCGGTGAGTTTGGTTTACACACATGCAAAATCCTGGTTTGTGCCAATTATTTTAGAGGCTTATGGAGATGACTTTCCTGTTGAGAAAATGGAGTAATAGGAGAAAATTTTGAGGGGCTTGATGAAGGAGAGCTCCCCACTTTTCTTTGTTGTGATTAGGAATTCATATCTCAGTAATGCATATCTAACTCTACATTAGGCTTCGTGTATCTGTTTCAATCAAGTTCAGTGCACATTTTTTCACTTTAATTTTGTAAACATTTATTAGCAAATCGAGGGTAGTTTAAAGAAAGGTATATATAAGAACGTGGTGGAAGAAATAATAGTTACATCTTGAGCTCCTTCTGCCTTTCTGAAACTGTGATTGAATGCAAGAACATGAGAAAAATTTAAGCTCTCTGAAAGTTCTTATTATCCGAATCAATTGTGAAGAAGGATGGTTTGAACAACTGGAAATTAGGCCCCTTTCTTAACTTCTAGCTCATTTGTGTTGGCTTTGTTCCAGTTCTCATTTCGTAGAAAAAAACTCTGTCAAAAAGAGGAGAAAATTCAGGTTATTAAATTTTTTAATCTGTGATTTTATTTTGAAACTCAAAGGCGTATTGGGTTTAGGCCTCCAATGCATTTTTGCATGAACCTCTAGATGTAAGCTTTCAACCTTGTAAATTTATTGGTCGTGAATTAGTCATTTTTGCATCGGTGGTAGCGTAAAATTAGAGCTGCACTGCAGTACAACTCTTCTTTAATGGTCCATTAGTTTTCATTAATTTATTTGTGCGGAGGTCGAGAGGTCGAGAGTTTCAGATTGAGAATATAATTACCCTGTAGATAATCTTATGATATGTGAAATTATTTGAATGATGCCTGTTATCTTGGATTGGACAAGTACGTAGACAGATGTTTTTCATGGGCATCGAGAGGGAGTCTCAACCAGTATTTTTTGTTTCCCTTTTTTTGCCTTTCTATTTCTTGTTCATGGTTTGTTTCTTAAAGCTTCTTAATTATCTAAAATATGTATTGAGATTGATTTGACTCAGAATAACCAGCCATGCAATAAGCTTAATTCTTCTTCTTGACAAAAATTTCCTTTCTACCTTTCTAGCCCCTAGAACAAGCTTAATCCAAGGCTTTCTTTCATTAGTAAAGAGATATCCACTAGTTCATTGATTATTGATTTGGTCTTTGTTTTCAGGTTGGTTTTCATGGACAAAAGTCCACCCGATTGAGGAACGCTCGATGCCTTCTTTTTTCAGTGGAAACTCTGGCACTCGAAGCCCTGATATTTATATTGAGATACGTAATTGGATTATGAAAAAATTCCATGCAAATCCTAGTACGCAGATTGAGTCAAAAGATGTATCAGAGATGGAAATCGGAGAACTAGAGGCTAGACAGGAGGTGATGGAGTTTCTAGACCATTGGGGTTTGATTAATTTTCACCCTTTCCTATCTGCAGATTCAACTTCAACAAGTGATGTTGATGATGAAAATCAAAAGGACTCTTTGGTTGAGAAGTTGTTTCACTTTGAAACATTAGAATCCTCTCCATCTGTTGTTCCAAAGACTAATGTTACCACCGCTCCACCAAGATTGCTTCGAGAATCTGCAATTTCTGAAGAGATGGTGAGGCCTGAGGGTCCATCTGTTGAGTACCACTGTAACTCGTGCTCTGGTGATTGCTCTCGGAAACGGTACCACTGCCAGAAGCAGGTTGGTTTTCAATTCAAAACTCCTTCCTTAAACAGTTTATTGTTGGAATATCAAATTCGTATTTCAGGAATCGCATGGATGCATAATCATGACATCTAAATTTCATGAGTTGTCTCGCCAGTCAAATGTTATGTATCATTTACATATCAACTAATAGTATTCATCTTTGACAGGCAGATTTTGATTTATGTTCGGAGTGCTTTAACAATGGGAAATTTGATTCTGATATGTCTTCATCAGATTTTATTCTCATGGAGTCTGCTGAGGTTCCTGGTGCTAGTGGAGGTAAGTGGACAGATCAGGAAACTCTCCTCCTCCTTGAGGCTTTAGAGCTTTATAAGGAAAACTGGAATGAGATTGCAGAACATGTGGCCACCAAAACAAAAGCCCAATGTATATTGCACTTCATTCAAATGCCAATTGAAGATAGCTTTCTTGAATCTGAGAACAATGATGAAGTCGGTGCAAAAGAAACTGTTGTTCCACCATCAAATGAAAATGATTCGTCAGTTCCCATGGATATCACTGAATCGATGGATAACAAGACTACTAGAAAAGAGGCCTCAAATGTAGAAAATGCCAGCAAGGAAGATACAGTTGAGGTAAAAGTGGGGCAGGATAATTCAAAATCAAAGGATGTTGAAGTAAAAGCTGCTTTAGACAACTCGAAATCAGAAGATGGTGGTCAGAAGGTTTCTGAAGACATTGCCTTGAATGCTTTGAGGGAGGCATTTGAAGCCATTGGTTATGTATTAACATCTGATCAGCACCCACTTTCATTTTCTGGTGTAGGGAACCCTGTCATGGCACTGGTAAGTTGAAATTTAGAGTATAAGAACTAATGAAATAGGTAGTCTTCCTCATTTTATTTGATCGATCTTGATAATTTTTAATTCATTGGTATATCTCAAATCAATTATAGTAGATTTGTCATTTGTCGGATCAATATATGGTGTTTCCATTGTTTCTCTGAATCAGTTTAGGATTAAGTCTAATGATGTTGATGTTTTTTGCTTTTCATTAAAAAGCATCTAAAAATATAACCACTTTTTATAATTGTGCAGTGAACCAAAAACTTAAGTCATCACCTAAGTAGGTTCCTTGTGTCTGTTGAACTTGGTTTCTGGCATTTTCTTCAATTTTCCAAGTCCCAAAAATAGAAGATGATTGTTCACGTGATGAATCTTTTGGATGACATGGCAGGCATTTTAATTCTTGTTAGCCTATAAGATGCACCAGGTCTGCACTGCAAAGGAGCAGTTTGACGTTCACTTTGTATGAACTTACTACTTTGCTTGGAGGTGCACGTCAACTTTGTTGATTCTAGATTCTTGCTATCTTTTTCCTTTTCTTCTTTTATATTCAATTCAAATAATTTTCTATCCTAGCTAGAAAACGTTGGTTTAGCTTGAACCTGCTTTAGCCTTCTAGAAGTTTTCTGTGAGATATGTGTTTACATGAACCAAGTTTTGACAGTTATCTTTGTTGCAAGTTCTCATTCATTGACTTCTGTTGTTGCCCAGAATTACCTTCAGTTAGGTACTATCATGTTTATCATATTTATAGATGGAGCAGAAATGCTTAAGCCTATACTCTTTGTTCTTATCAGTAATTAGCACTCCCTATAATCATAAATTTTATTGCTGGTTGTTTAAATTATTAAATACTTCCTTTTCAGGCTGCATTCCTTGCACGCTTAGTTGGATCTGATGTTGCCGGTGCATCAGCTCATTTTTCTTTGAAAAGCATATCTCAGAAATCTCCCAGTTTAGATCTGGCTACAAGACACTGCTTTATTTTAGAAGATCCACTAGATGACAAGGCACAAGCTAATTCAGAGAGGTACTTGACGTTGAGACTTGACATGTTTGTTTCAGTTTAGTTCTATTCTTTGATCACAACTGATACTATTGCAGGGTTGTCAATGTGGAAGCTCAGCAAAATGTCAATGAACAGTGTGAAAAACAGAGGAAAGACAATTCTACTTCAGTCTTAGATGACAGAGCCTTATCAACGACTAACATTGATTACAAAAATGGAGAATCTGAGACAGAGGAAACAACAATGGAGAATAGAAATTCTTCAGATGCTACTAAAGAACACGATCCAATGGTTAATCATGGTTCAGATGGAACAAATAAATTGAAAGAACTGACAGAACCAGAAGCTCAGCAAAATGTCAATGAACAGTGTGAAAAACAGAGGAAAGACAATTCTACTTCAGTCTTAGATGACAGAGCCTTATCAACGACTAACATTGATTACAAAAATGGAGAATCTGTGACAGAGGAAACAACAATGGAGAATAGAAATTCTTCAGATGCTACTAAGGAACACGATCCAATGGTTAATCATGGTTCAGATGGAACAAATAAATTGAAAGAACTGACAGAACCAGAAGTGCCCAAGGATGATAGAACAGGCATTGTTAAGGAAACTGAAAATATGGAATCAAAATTGACAACAAATACATTTGAAAAATTGGGAGAAGAAACTTCTTTTGAAAAGCCATCACAATCTACGTTGTTATCAAAGGATATACATATCTCAGATCTACAGTATGCTGAAAAAACTGAGATTCAGAGACAAGATCCATCTCCTTCTGTCAATACTTCAAAAATAGATGATGTGCCAAATCCTTTACCTTCCGTGAATGAACTTCAGCCACTTTTTGCTGCCACTTCAGTGAAAGTAGCCTCAAGTGATGTAGCTATGGTATCTGATCCTCGAGATAAGAGTGAACCTGCACAAACTGAAACATCTAAATCTTTGGTTGACCAGGGAGCAAGCAAGGTCTCTGATTCTTTGCCCACAGAAGAGAATGCAACTCCACAGCCAGTTAAACAAAATCCAGTTCTTGATAAAGGAACAGGTGCGTGTTTCTTTTGTAGTTTTCCTCTAGTTTTAATTTTGTAGTCTATGGACCTCCCACCAGGCAGGTTACATTTTTGAGTTTTTTAAGAAACTGTTCAAGACTTTCCAAAGCTTGCACGTTCCAAGGGGATGGGGAAATACAAGTTATTTCCTATTGAACAATATTGACTTTGATTTGTTTTTAGGAACTTTATCATGCACTGTGATGTATGTTTCTGAGAAAAAGGATCATCAAGAACCGCAGAAGGAGTTTAGGACTTCTGTTAAGTTTGCTAGGAACTCTGAACTTGGGATGTCCAATGTCTTAACGCCAATTTATTTGGTCGAATCTTAGAGAAAATCTTGCTCATTAAAGGTAGGATAAATTTCTACCTGCCAATAGGAAGATTATTTCCTTCAAATTAGACAAGAGGCTTGGATGGAAGAGTAGGCAAGAGTTCAAAATGGGCTTTGACTATTAACTCTTACCCTTTTATACTACTCACAAGATGAATTGGCAAGAGTTCTCTGGTTTGCTTGGACATTCCTAAATGAAAAAACCTGAAGTGGTCTTGAACTCTGGAACTGAATAAACAAGTCTATGGGCAACACAGAAGAGAAAAGTTTGTTGAAAATAAAAATTATAAGTGAAGGTTCCGAGGTGTTGAGAGAAGAGGGGCTTGAATTGAAAGAATAGTTTGAGAGGTTGCATACCAGAGAAGAATTGCCCAGATAAAAGGTTAAACCTTAATGGTTCCACAAAAAATCCCCTGTACAAGGCAATCAGGAAAGGTTTTTACCCGAGGAAAATCGAAGTTTTTATTTTTATTTTTACGGGAGCTAAGACATGGAGGTATTAACCCAACTGCTTGACTTAAAAAAGGTTGCTATAAATATCTTGCAATATCACTTAATTGTGTTATGTGCATGAAGGCTACAGAATGGCACTGCCTTTTATTTGCCTATTGCACCTTTGCAAGAAATTTCTGGGAGTTTGTGCTTGCTGCTTTTGATTGGTTCATGGTGACGGGAAACATCCAAGATCTCTTAGCCATCTTTCTCTTGGAACACCCTTTTGGAATCATTAGAGAATAAGCTTCTTTGGTTATCTATCAAATAGTCTTCTTTTGGAATCATTGGGAAGAAAGGAATAGCTGAATTTTTATTCTCATTGTAGTTTTTCTACCTTAGTCGCTAATTGGAGGGTTTTTTTGCAACTACCTTAGTCGCTATAGTGGCTGGGTTAATCCTTTCTTCTGTACACTTCATTCATCAATAAAATCATATATTTCCTCCAAAGAAAACGGTTTAAACCCAGGAAATTAACTCTTATCTTTCCATAATAGTTCAGTGGAAGGAAGTGTGAAAAATGGATATAAATTGATCAGAAACGATGGAATTTGTGTTGAATCTCTCTTAGAAATTATACAGAAGTACTTAACCACTCTCATCAACTTTTCTCGTCTAGTTCTTCCATCTCCTTTGAATTGGAGGGATTAGATTGGCGCCATGTTCTTTTAACGTGCCACTTAATTCAGGCACATTAGTACAAACGCCAATCCTTATAAGTGCAGCATTAAGGTACTTATACATTGGAAAAAAGTGTTTAGTACTACTATTTCAAGAAATGTATCCACTTTTTTTCATGAGAGATAAATCCTTGTTCTAGTGGTAAAAACGGATGGCAGTATCTTTCTTCTTGGAGAAAGTGTAATTTTCTTGGGGGAATCATACTGTATTCGACAAAACCAATCATAGTGAAGATTCACTTTATTTCTTATCCTTCAAAGTACCCTATGCCAAAGTCCATGACCTCCAATTGGAGTAGCCATCCCAAGTTCCTTTACTCCACCCATCTTCATTAACTTAGATTGGAAGGTCTTCTTGTAACCTCCCTTGGTTGGAATGAGACACCTTCTCCCTTTGCTCATTTGGATTGGTTTTGCAGAAGAGGAAACAAGTTGGACTAGAAATCAAGAGTTTCATGGGAAGAAAGGGGGCTGTAACACTTTTTTTAATAGTATCTCAACTTTGTCTTTGCTTTAATAATATGATGGAGCAACTATTTAAGCAATTTTTTTTTTCCTTCTCCAAGTTTTTAGGGTTTTCGTGGATGAGTTAGAATGGCAGCCTATGCATAGTAGAGATAAGTTTCGGAGATTAAGCATGTAGCGTTTAGTTTTGTTTCGACTTGATTACCTTGTCTGAAGAGTGTTGGGAGAGTATTAAATTGGATGTGTTAAAGGTTAGGAGACATGTTTGAAAGAGTTGTAGAGAATGATTTGAAGGTGATGAGTCCTGCAACATATGCTTTAAGATCACAGAATTACGTTTGGTTCTTTGTGAACAAGATGGTATAATTTCTTGTGCTTTTCTCATCCTTTGTTTTTTCTAGTATAATATATTATGATGAAAACTGTGTTCGATCTGCTGTGCTCATCATTTGTGTGTGTATATATATATATATATCTAAGGACGGGAAAATCCCCGAAAAGAATTAGTTGTTACTATGACACGACCAAGAAGCTAGAAAATGAACGTGGTACAAAAAGTATTGGTGTTGGTGATATCTTCTAAGAGGTCGTTTGGTTTAAGACTTTATAGATGCTTGGAATGAACGGGTATCTGTCATTGATATGCTTGGAAATAAGATTGCTAGGTATAAAAGATTATTGTGTTTAGTTGTGCATGATAACGTTATTTGGAATTTATGAGTACAAATTATGTTTATATTTATTTTATGGAACTAATGGAGATAACCTTATATATAATTTAATAGATTAATTAATGAAACAATAAAAAAATTAATTCAAAATATCACTAAAAACTAATCTTTTTTAAATCCAAAATTGATTTATTAAAAATAATAAATCCTTTTAATTGGATAATTGAATATGATTTTAGCAGATAAGTTCATTAATTGATATATTTTTGTCGACAAAATTAATTGAATAAATATATGAATATTATTATTTTTAATTTTAATTGATTAATTATGATACATTCATAATGAATGTACTCATCAAAATTAGTCAAAGAAATAGTTAATTGATTATAAATGATCGATTAATTTGTTATAATTGAAAAATAATTAAGTAAATTAATTTTGGTCAACCAAAATCAATTAATTAATTAACTATATTCAAAATAAAATTGAATAACACGAGCAGATCATCAATCTTCTTTCATTTGCTAGGTTCGGCCAGAAGTTCTCTGAGGTTTCAAAATGTTTGTATAGTTCTTTTAGGTTTATATACTGACTGCCTGTGTTATTTATCGTGCCTGTTTATTTGTTCATAACAGATGATAATCAAAGCAAGAACAATGAAGAAGAAAATTCCAAATGTACAAGTAAGAAAGAGGAAAAAGTCGATAAGCTGAAGCGTGCTGCAGTTACGACGCTTGCAGCAGCAGCTGTGAAGGCAAAAGTTCTGGCTAATCAAGAAGAAGATCAAATTCGTCAACTTGCCATGATATTAATCGAGAAACAGGTATATCGATCAAAGTTGGTTTCGATCGATTTTTATTACATTTTAATGAAATTATTACCCCTTCTTTGGGCAGCTGCATAAGTTGGAAAGCAAGTTAGCATTCTTCAACGACATGGACAACGTGTCAATGAGAGTCAGGGAGCAACTGGACAGGTCAAAGCAAAGGCTTTTCCAGGAACGTGCGCAGATAATTGCTGCTCGACTTGGCTTACCTGCTTCGGCATCACGAGGTGTGGCACCGGTGTTGCCAGGAAACAGAATGGCTAGGAACTTCCCAAACTCAGTTCCAAAGCCTCCAATGGGCATGGCGCCCCAAAGGCCACCAACTTCTGGACCACGGGTATAGCTAGCTGCTACTAATCCTAACCCGAAATATGCAACCAACCAACACCAGTACCACAATTTCTGTAAGTTCATTTTTCACCTGCAAATCATCAGGACACACTTTCTTCTGTTGGTTCCAAGTAA
mRNA sequence
GAATTCTCAAGTTAAATGCATCATGGGAGCATGAATTCTGGGTCATCCAATAACAGTATAATCAGCAACAATGGAGTTCTCAAAAGCAAACACTGCCATCATCATCTACCACTGAATCCAGGAACCTCCAAGCAAGTACTGAAATGGAGGAGAAGCGCCGAGACGCCGGAAATTTACCGGCGAACACCACGGATTCGCCTTCATCGGAGCCGCCTTCGTCTCGCCGTCGAGCTGGAGCTCAGAAGCGAAAGGTCAGCGCTCTCGGTGGCTCTAACTCCTCATCCGCTCCTTCGAAACGCGTTACTCGGGACAAATTTGCTCTTTTGCATCCTCCAAATCACAACGGTCCCTTCACTAGAGCTCGACTTGGCCCTAACAATGGCGCTGGAACAGCATCGGGTAATGCGGCTGAAGGTATCTCCGCCGCGGGATCAGTTAAGGTGGAGGGATCTTTTCTTCATTCCGAAGTTCAGCGTGGAGACACACTGGTCGCCGCGGCGGAGGAATTGAACAAGGCGAGTAGATTGGCGAATTTGGAAGCGTCTTTCGTAGCTGATTTCGAATCTATTAAATCTCGGGGTATGAATGCTCACGTCGTTCCGAATCATTGCGGTTGGTTTTCATGGACAAAAGTCCACCCGATTGAGGAACGCTCGATGCCTTCTTTTTTCAGTGGAAACTCTGGCACTCGAAGCCCTGATATTTATATTGAGATACGTAATTGGATTATGAAAAAATTCCATGCAAATCCTAGTACGCAGATTGAGTCAAAAGATGTATCAGAGATGGAAATCGGAGAACTAGAGGCTAGACAGGAGGTGATGGAGTTTCTAGACCATTGGGGTTTGATTAATTTTCACCCTTTCCTATCTGCAGATTCAACTTCAACAAGTGATGTTGATGATGAAAATCAAAAGGACTCTTTGGTTGAGAAGTTGTTTCACTTTGAAACATTAGAATCCTCTCCATCTGTTGTTCCAAAGACTAATGTTACCACCGCTCCACCAAGATTGCTTCGAGAATCTGCAATTTCTGAAGAGATGGTGAGGCCTGAGGGTCCATCTGTTGAGTACCACTGTAACTCGTGCTCTGGTGATTGCTCTCGGAAACGGTACCACTGCCAGAAGCAGGCAGATTTTGATTTATGTTCGGAGTGCTTTAACAATGGGAAATTTGATTCTGATATGTCTTCATCAGATTTTATTCTCATGGAGTCTGCTGAGGTTCCTGGTGCTAGTGGAGGTAAGTGGACAGATCAGGAAACTCTCCTCCTCCTTGAGGCTTTAGAGCTTTATAAGGAAAACTGGAATGAGATTGCAGAACATGTGGCCACCAAAACAAAAGCCCAATGTATATTGCACTTCATTCAAATGCCAATTGAAGATAGCTTTCTTGAATCTGAGAACAATGATGAAGTCGGTGCAAAAGAAACTGTTGTTCCACCATCAAATGAAAATGATTCGTCAGTTCCCATGGATATCACTGAATCGATGGATAACAAGACTACTAGAAAAGAGGCCTCAAATGTAGAAAATGCCAGCAAGGAAGATACAGTTGAGGTAAAAGTGGGGCAGGATAATTCAAAATCAAAGGATGTTGAAGTAAAAGCTGCTTTAGACAACTCGAAATCAGAAGATGGTGGTCAGAAGGTTTCTGAAGACATTGCCTTGAATGCTTTGAGGGAGGCATTTGAAGCCATTGGTTATGTATTAACATCTGATCAGCACCCACTTTCATTTTCTGGTGTAGGGAACCCTGTCATGGCACTGGCTGCATTCCTTGCACGCTTAGTTGGATCTGATGTTGCCGGTGCATCAGCTCATTTTTCTTTGAAAAGCATATCTCAGAAATCTCCCAGTTTAGATCTGGCTACAAGACACTGCTTTATTTTAGAAGATCCACTAGATGACAAGGCACAAGCTAATTCAGAGAGGGTTGTCAATGTGGAAGCTCAGCAAAATGTCAATGAACAGTGTGAAAAACAGAGGAAAGACAATTCTACTTCAGTCTTAGATGACAGAGCCTTATCAACGACTAACATTGATTACAAAAATGGAGAATCTGAGACAGAGGAAACAACAATGGAGAATAGAAATTCTTCAGATGCTACTAAAGAACACGATCCAATGGTTAATCATGGTTCAGATGGAACAAATAAATTGAAAGAACTGACAGAACCAGAAGCTCAGCAAAATGTCAATGAACAGTGTGAAAAACAGAGGAAAGACAATTCTACTTCAGTCTTAGATGACAGAGCCTTATCAACGACTAACATTGATTACAAAAATGGAGAATCTGTGACAGAGGAAACAACAATGGAGAATAGAAATTCTTCAGATGCTACTAAGGAACACGATCCAATGGTTAATCATGGTTCAGATGGAACAAATAAATTGAAAGAACTGACAGAACCAGAAGTGCCCAAGGATGATAGAACAGGCATTGTTAAGGAAACTGAAAATATGGAATCAAAATTGACAACAAATACATTTGAAAAATTGGGAGAAGAAACTTCTTTTGAAAAGCCATCACAATCTACGTTGTTATCAAAGGATATACATATCTCAGATCTACAGTATGCTGAAAAAACTGAGATTCAGAGACAAGATCCATCTCCTTCTGTCAATACTTCAAAAATAGATGATGTGCCAAATCCTTTACCTTCCGTGAATGAACTTCAGCCACTTTTTGCTGCCACTTCAGTGAAAGTAGCCTCAAGTGATGTAGCTATGGTATCTGATCCTCGAGATAAGAGTGAACCTGCACAAACTGAAACATCTAAATCTTTGGTTGACCAGGGAGCAAGCAAGGTCTCTGATTCTTTGCCCACAGAAGAGAATGCAACTCCACAGCCAGTTAAACAAAATCCAGTTCTTGATAAAGGAACAGATGATAATCAAAGCAAGAACAATGAAGAAGAAAATTCCAAATGTACAAGTAAGAAAGAGGAAAAAGTCGATAAGCTGAAGCGTGCTGCAGTTACGACGCTTGCAGCAGCAGCTGTGAAGGCAAAAGTTCTGGCTAATCAAGAAGAAGATCAAATTCGTCAACTTGCCATGATATTAATCGAGAAACAGCTGCATAAGTTGGAAAGCAAGTTAGCATTCTTCAACGACATGGACAACGTGTCAATGAGAGTCAGGGAGCAACTGGACAGGTCAAAGCAAAGGCTTTTCCAGGAACGTGCGCAGATAATTGCTGCTCGACTTGGCTTACCTGCTTCGGCATCACGAGGTGTGGCACCGGTGTTGCCAGGAAACAGAATGGCTAGGAACTTCCCAAACTCAGTTCCAAAGCCTCCAATGGGCATGGCGCCCCAAAGGCCACCAACTTCTGGACCACGGGACACACTTTCTTCTGTTGGTTCCAAGTAA
Coding sequence (CDS)
ATGGAGGAGAAGCGCCGAGACGCCGGAAATTTACCGGCGAACACCACGGATTCGCCTTCATCGGAGCCGCCTTCGTCTCGCCGTCGAGCTGGAGCTCAGAAGCGAAAGGTCAGCGCTCTCGGTGGCTCTAACTCCTCATCCGCTCCTTCGAAACGCGTTACTCGGGACAAATTTGCTCTTTTGCATCCTCCAAATCACAACGGTCCCTTCACTAGAGCTCGACTTGGCCCTAACAATGGCGCTGGAACAGCATCGGGTAATGCGGCTGAAGGTATCTCCGCCGCGGGATCAGTTAAGGTGGAGGGATCTTTTCTTCATTCCGAAGTTCAGCGTGGAGACACACTGGTCGCCGCGGCGGAGGAATTGAACAAGGCGAGTAGATTGGCGAATTTGGAAGCGTCTTTCGTAGCTGATTTCGAATCTATTAAATCTCGGGGTATGAATGCTCACGTCGTTCCGAATCATTGCGGTTGGTTTTCATGGACAAAAGTCCACCCGATTGAGGAACGCTCGATGCCTTCTTTTTTCAGTGGAAACTCTGGCACTCGAAGCCCTGATATTTATATTGAGATACGTAATTGGATTATGAAAAAATTCCATGCAAATCCTAGTACGCAGATTGAGTCAAAAGATGTATCAGAGATGGAAATCGGAGAACTAGAGGCTAGACAGGAGGTGATGGAGTTTCTAGACCATTGGGGTTTGATTAATTTTCACCCTTTCCTATCTGCAGATTCAACTTCAACAAGTGATGTTGATGATGAAAATCAAAAGGACTCTTTGGTTGAGAAGTTGTTTCACTTTGAAACATTAGAATCCTCTCCATCTGTTGTTCCAAAGACTAATGTTACCACCGCTCCACCAAGATTGCTTCGAGAATCTGCAATTTCTGAAGAGATGGTGAGGCCTGAGGGTCCATCTGTTGAGTACCACTGTAACTCGTGCTCTGGTGATTGCTCTCGGAAACGGTACCACTGCCAGAAGCAGGCAGATTTTGATTTATGTTCGGAGTGCTTTAACAATGGGAAATTTGATTCTGATATGTCTTCATCAGATTTTATTCTCATGGAGTCTGCTGAGGTTCCTGGTGCTAGTGGAGGTAAGTGGACAGATCAGGAAACTCTCCTCCTCCTTGAGGCTTTAGAGCTTTATAAGGAAAACTGGAATGAGATTGCAGAACATGTGGCCACCAAAACAAAAGCCCAATGTATATTGCACTTCATTCAAATGCCAATTGAAGATAGCTTTCTTGAATCTGAGAACAATGATGAAGTCGGTGCAAAAGAAACTGTTGTTCCACCATCAAATGAAAATGATTCGTCAGTTCCCATGGATATCACTGAATCGATGGATAACAAGACTACTAGAAAAGAGGCCTCAAATGTAGAAAATGCCAGCAAGGAAGATACAGTTGAGGTAAAAGTGGGGCAGGATAATTCAAAATCAAAGGATGTTGAAGTAAAAGCTGCTTTAGACAACTCGAAATCAGAAGATGGTGGTCAGAAGGTTTCTGAAGACATTGCCTTGAATGCTTTGAGGGAGGCATTTGAAGCCATTGGTTATGTATTAACATCTGATCAGCACCCACTTTCATTTTCTGGTGTAGGGAACCCTGTCATGGCACTGGCTGCATTCCTTGCACGCTTAGTTGGATCTGATGTTGCCGGTGCATCAGCTCATTTTTCTTTGAAAAGCATATCTCAGAAATCTCCCAGTTTAGATCTGGCTACAAGACACTGCTTTATTTTAGAAGATCCACTAGATGACAAGGCACAAGCTAATTCAGAGAGGGTTGTCAATGTGGAAGCTCAGCAAAATGTCAATGAACAGTGTGAAAAACAGAGGAAAGACAATTCTACTTCAGTCTTAGATGACAGAGCCTTATCAACGACTAACATTGATTACAAAAATGGAGAATCTGAGACAGAGGAAACAACAATGGAGAATAGAAATTCTTCAGATGCTACTAAAGAACACGATCCAATGGTTAATCATGGTTCAGATGGAACAAATAAATTGAAAGAACTGACAGAACCAGAAGCTCAGCAAAATGTCAATGAACAGTGTGAAAAACAGAGGAAAGACAATTCTACTTCAGTCTTAGATGACAGAGCCTTATCAACGACTAACATTGATTACAAAAATGGAGAATCTGTGACAGAGGAAACAACAATGGAGAATAGAAATTCTTCAGATGCTACTAAGGAACACGATCCAATGGTTAATCATGGTTCAGATGGAACAAATAAATTGAAAGAACTGACAGAACCAGAAGTGCCCAAGGATGATAGAACAGGCATTGTTAAGGAAACTGAAAATATGGAATCAAAATTGACAACAAATACATTTGAAAAATTGGGAGAAGAAACTTCTTTTGAAAAGCCATCACAATCTACGTTGTTATCAAAGGATATACATATCTCAGATCTACAGTATGCTGAAAAAACTGAGATTCAGAGACAAGATCCATCTCCTTCTGTCAATACTTCAAAAATAGATGATGTGCCAAATCCTTTACCTTCCGTGAATGAACTTCAGCCACTTTTTGCTGCCACTTCAGTGAAAGTAGCCTCAAGTGATGTAGCTATGGTATCTGATCCTCGAGATAAGAGTGAACCTGCACAAACTGAAACATCTAAATCTTTGGTTGACCAGGGAGCAAGCAAGGTCTCTGATTCTTTGCCCACAGAAGAGAATGCAACTCCACAGCCAGTTAAACAAAATCCAGTTCTTGATAAAGGAACAGATGATAATCAAAGCAAGAACAATGAAGAAGAAAATTCCAAATGTACAAGTAAGAAAGAGGAAAAAGTCGATAAGCTGAAGCGTGCTGCAGTTACGACGCTTGCAGCAGCAGCTGTGAAGGCAAAAGTTCTGGCTAATCAAGAAGAAGATCAAATTCGTCAACTTGCCATGATATTAATCGAGAAACAGCTGCATAAGTTGGAAAGCAAGTTAGCATTCTTCAACGACATGGACAACGTGTCAATGAGAGTCAGGGAGCAACTGGACAGGTCAAAGCAAAGGCTTTTCCAGGAACGTGCGCAGATAATTGCTGCTCGACTTGGCTTACCTGCTTCGGCATCACGAGGTGTGGCACCGGTGTTGCCAGGAAACAGAATGGCTAGGAACTTCCCAAACTCAGTTCCAAAGCCTCCAATGGGCATGGCGCCCCAAAGGCCACCAACTTCTGGACCACGGGACACACTTTCTTCTGTTGGTTCCAAGTAA
Protein sequence
MEEKRRDAGNLPANTTDSPSSEPPSSRRRAGAQKRKVSALGGSNSSSAPSKRVTRDKFALLHPPNHNGPFTRARLGPNNGAGTASGNAAEGISAAGSVKVEGSFLHSEVQRGDTLVAAAEELNKASRLANLEASFVADFESIKSRGMNAHVVPNHCGWFSWTKVHPIEERSMPSFFSGNSGTRSPDIYIEIRNWIMKKFHANPSTQIESKDVSEMEIGELEARQEVMEFLDHWGLINFHPFLSADSTSTSDVDDENQKDSLVEKLFHFETLESSPSVVPKTNVTTAPPRLLRESAISEEMVRPEGPSVEYHCNSCSGDCSRKRYHCQKQADFDLCSECFNNGKFDSDMSSSDFILMESAEVPGASGGKWTDQETLLLLEALELYKENWNEIAEHVATKTKAQCILHFIQMPIEDSFLESENNDEVGAKETVVPPSNENDSSVPMDITESMDNKTTRKEASNVENASKEDTVEVKVGQDNSKSKDVEVKAALDNSKSEDGGQKVSEDIALNALREAFEAIGYVLTSDQHPLSFSGVGNPVMALAAFLARLVGSDVAGASAHFSLKSISQKSPSLDLATRHCFILEDPLDDKAQANSERVVNVEAQQNVNEQCEKQRKDNSTSVLDDRALSTTNIDYKNGESETEETTMENRNSSDATKEHDPMVNHGSDGTNKLKELTEPEAQQNVNEQCEKQRKDNSTSVLDDRALSTTNIDYKNGESVTEETTMENRNSSDATKEHDPMVNHGSDGTNKLKELTEPEVPKDDRTGIVKETENMESKLTTNTFEKLGEETSFEKPSQSTLLSKDIHISDLQYAEKTEIQRQDPSPSVNTSKIDDVPNPLPSVNELQPLFAATSVKVASSDVAMVSDPRDKSEPAQTETSKSLVDQGASKVSDSLPTEENATPQPVKQNPVLDKGTDDNQSKNNEEENSKCTSKKEEKVDKLKRAAVTTLAAAAVKAKVLANQEEDQIRQLAMILIEKQLHKLESKLAFFNDMDNVSMRVREQLDRSKQRLFQERAQIIAARLGLPASASRGVAPVLPGNRMARNFPNSVPKPPMGMAPQRPPTSGPRDTLSSVGSK
Homology
BLAST of CmaCh11G011040 vs. ExPASy Swiss-Prot
Match:
Q8VY05 (SWI/SNF complex subunit SWI3D OS=Arabidopsis thaliana OX=3702 GN=SWI3D PE=1 SV=3)
HSP 1 Score: 655.6 bits (1690), Expect = 9.7e-187
Identity = 474/1096 (43.25%), Postives = 619/1096 (56.48%), Query Frame = 0
Query: 1 MEEKRRD-AGNL--PANTTDSPSSEP-PSSRRRAGAQKRKVSALGGSN-SSSAPSKR-VT 60
MEEKRRD AG L ++ DSP+SEP P+ RRR G KRK +ALGGSN SSAPSKR +T
Sbjct: 1 MEEKRRDSAGTLAFAGSSGDSPASEPMPAPRRRGGGLKRKANALGGSNFFSSAPSKRMLT 60
Query: 61 RDKFALL-HPPNHNGPFTRARLGPNNGAGTASGNAAEGISAAGSVKVEGSFLHSEVQRGD 120
R+K L P HNGP TRAR P+ A G +E ++ A V +G E
Sbjct: 61 REKAMLASFSPVHNGPLTRARQAPSIMPSAADGVKSEVLNVA--VGADGEKPKEE----- 120
Query: 121 TLVAAAEELNKASR-LANLEASFVADFESIKSRGMNAHVVPNHCGWFSWTKVHPIEERSM 180
EE NKA R LEA ADFE+I+SR N HVVPNHCGWFSW K+HP+EERS+
Sbjct: 121 ------EERNKAIREWEALEAKIEADFEAIRSRDSNVHVVPNHCGWFSWEKIHPLEERSL 180
Query: 181 PSFFSGNSGTRSPDIYIEIRNWIMKKFHANPSTQIESKDVSEMEIGELEARQEVMEFLDH 240
PSFF+G R+ ++Y EIRNWIM KFH+NP+ QIE KD++E+E+G+ EA+QEVMEFLD+
Sbjct: 181 PSFFNGKLEGRTSEVYREIRNWIMGKFHSNPNIQIELKDLTELEVGDSEAKQEVMEFLDY 240
Query: 241 WGLINFHPFLSADSTST-SDVDDENQKDSLVEKLFHFETLESSPSVV--PKTNVTTAPPR 300
WGLINFHPF D+ ST SD DD K+SL+ L+ F+ E+ P +V P+ P
Sbjct: 241 WGLINFHPFPPTDTGSTASDHDDLGDKESLLNSLYRFQVDEACPPLVHKPRFTAQATPSG 300
Query: 301 LLRESAISEEMVRPEGPSVEYHCNSCSGDCSRKRYHCQKQADFDLCSECFNNGKFDSDMS 360
L + ++E+++ EGP+VEYHCNSCS DCSRKRYHC KQADFDLC+ECFN+GKF SDMS
Sbjct: 301 LFPDPMAADELLKQEGPAVEYHCNSCSADCSRKRYHCPKQADFDLCTECFNSGKFSSDMS 360
Query: 361 SSDFILMESAEVPGASGGKWTDQETLLLLEALELYKENWNEIAEHVATKTKAQCILHFIQ 420
SSDFILME AE PG GKWTDQETLLLLEALE++KENWNEIAEHVATKTKAQC+LHF+Q
Sbjct: 361 SSDFILMEPAEAPGVGSGKWTDQETLLLLEALEIFKENWNEIAEHVATKTKAQCMLHFLQ 420
Query: 421 MPIEDSFLESENNDEVGAKETVVPPSNENDSSVPMDITESMDNKTTRKEASNVENASK-E 480
MPIED+FL+ + + +K+T +++D+SV D E +NK E ++ + E
Sbjct: 421 MPIEDAFLDQIDYKDPISKDTTDLAVSKDDNSVLKDAPEEAENKKRVDEDETMKEVPEPE 480
Query: 481 DTVEVKVGQDNSKSKDV-----EVKA-----ALDNSKSEDGGQKVSEDIALNALREAFEA 540
D E KV Q++SK D E++A L+ + E + E+IAL AL EAFE
Sbjct: 481 DGNEEKVSQESSKPGDASEETNEMEAEQKTPKLETAIEERCKDEADENIALKALTEAFED 540
Query: 541 IGYVLTSDQHPLSFSGVGNPVMALAAFLARLVGSDVAGASAHFSLKSISQKSPSLDLATR 600
+G+ T + SF+ +GNPVM LAAFL RL GSDVA ASA S+KS+ S L LATR
Sbjct: 541 VGHSSTPEA-SFSFADLGNPVMGLAAFLVRLAGSDVATASARASIKSLHSNSGML-LATR 600
Query: 601 HCFILEDPLDDKAQANSERVVNVEAQQNVNEQCEKQRKDNSTSVLDDRALSTTNIDYKNG 660
HC+ILEDP D+K + + +A+ N DNS
Sbjct: 601 HCYILEDPPDNKKDPTKSKSCSADAEGN---------DDNS------------------- 660
Query: 661 ESETEETTMENRNSSDATKEHDPMVNHGSDGTNKLKELTEPEAQQNVNEQCEKQRKDNST 720
H D +PE EK +K
Sbjct: 661 --------------------------HKDD---------QPE---------EKSKKAEEV 720
Query: 721 SV-LDDRALSTTNIDYKNGESVTEETTMENRNSSDATKEHDPMVNHGSDGTNKLKELTEP 780
S+ DDR + T+ + +SV+EE +R + T KL + E
Sbjct: 721 SLNSDDREMPDTDTGKETQDSVSEEKQPGSRT---------------ENSTTKLDAVQEK 780
Query: 781 EVPKDDRTGIVKETENMESKLTTNTFEKLGEETSFEKPSQSTLLSKDIHISDLQYAEKTE 840
K +TT+ EK PSQ K++ L+ K
Sbjct: 781 RSSK---------------PVTTDNSEK---PVDIICPSQDKCSGKELQ-EPLKDGNKLS 840
Query: 841 IQRQDPSPSVNTSKIDDVPNPLPSVNELQPLFAATSVKVASSDVAMVSDPRDKSEPAQT- 900
+ +D S S + D P AS DV M + + +P
Sbjct: 841 SENKDASQSTVSQSAADASQP-----------------EASRDVEMKDTLQSEKDPEDVV 900
Query: 901 ----ETSKSLVDQGASKVSDSLPTEENATPQPVKQNPVLDKGT-DDNQSKNNEEENSKCT 960
E + ++GA+ V + +++ + QP+ + GT N + ++E C
Sbjct: 901 KTVGEKVQLAKEEGANDVLST--PDKSVSQQPIGSASAPENGTAGGNPNIEGKKEKDICE 954
Query: 961 SKKEE-KVDKLKRAAVTTLAAAAVKAKVLANQEEDQIRQLAMILIEKQLHKLESKLAFFN 1020
K++ ++KLKRAA++ ++AAAVKAK LA QEEDQIRQL+ LIEKQLHKLE+KL+ FN
Sbjct: 961 GTKDKYNIEKLKRAAISAISAAAVKAKNLAKQEEDQIRQLSGSLIEKQLHKLEAKLSIFN 954
Query: 1021 DMDNVSMRVREQLDRSKQRLFQERAQIIAARLGLPASASRGVAPVLPGNRMARNFPNSVP 1067
+ ++++MRVREQL+RS+QRL+ ERAQIIAARLG+P S S + LP NR+A NF N
Sbjct: 1021 EAESLTMRVREQLERSRQRLYHERAQIIAARLGVPPSMSSKAS--LPTNRIAANFANVAQ 954
BLAST of CmaCh11G011040 vs. ExPASy Swiss-Prot
Match:
Q9XI07 (SWI/SNF complex subunit SWI3C OS=Arabidopsis thaliana OX=3702 GN=SWI3C PE=1 SV=1)
HSP 1 Score: 171.4 bits (433), Expect = 5.6e-41
Identity = 150/502 (29.88%), Postives = 221/502 (44.02%), Query Frame = 0
Query: 88 AAEGISAAGSVKVEGS------FLHSEVQRGDTLVAAAEELNKASRLAN---LEASFVAD 147
AAE G + +GS ++Q T+ A + +L ++ ++ + + D
Sbjct: 104 AAERAGLIGETRGQGSLPALENISFGQLQALSTVPADSLDLERSDGSSSAYVISPPPIMD 163
Query: 148 FESIKSR-GMNAHVVPNHCGWFSWTKVHPIEERSMPSFFSGNSGTRSPDIYIEIRNWIMK 207
E + R G HV+P H WF+ V +E + +P FFSG S +P+ Y+E RN I+
Sbjct: 164 GEGVVKRFGDLVHVLPMHSDWFAPNTVDRLERQVVPQFFSGKSPNHTPESYMEFRNAIVS 223
Query: 208 KFHANPSTQIESKDVSEMEIG-ELEARQEVMEFLDHWGLINFHPFLSADSTSTSDVDDEN 267
K+ NP + D + G ++E V FLDHWG+IN+ + DV D
Sbjct: 224 KYVENPEKTLTISDCQGLVDGVDIEDFARVFRFLDHWGIINYCATAQSHPGPLRDVSDV- 283
Query: 268 QKDSLVEKLFHFETLESSPSVV--PKTN-------VTTAPPRLLRESAISEEMVRPEGPS 327
++D+ E L S S++ K N V ++ P L +S + +R
Sbjct: 284 REDTNGEVNVPSAALTSIDSLIKFDKPNCRHKGGEVYSSLPSLDGDSPDLDIRIREH--L 343
Query: 328 VEYHCNSCSGDCSRKRYHCQKQADFDLCSECFNNGKFDSDMSSSDFILMESAEVPG-ASG 387
+ HCN CS + QK+ D LC +CF++G+F S DF+ ++ + G G
Sbjct: 344 CDSHCNHCSRPLPTVYFQSQKKGDILLCCDCFHHGRFVVGHSCLDFVRVDPMKFYGDQDG 403
Query: 388 GKWTDQETLLLLEALELYKENWNEIAEHVATKTKAQCILHFIQMPIEDSFLESENNDEVG 447
WTDQETLLLLEA+ELY ENW +IA+HV +K+KAQCILHF+++P+ED L+ N EV
Sbjct: 404 DNWTDQETLLLLEAVELYNENWVQIADHVGSKSKAQCILHFLRLPVEDGLLD---NVEVS 463
Query: 448 AKETVVPPSNENDSSVPMDITESMDNKTTRKEASNVENASKEDTVEVKVGQDNSKSKDVE 507
P+N D+K T + + + D E
Sbjct: 464 GVTNTENPTN------------GYDHKGTDSNGD--------------LPGYSEQGSDTE 523
Query: 508 VKAALDNSKSEDGGQKVSEDIALNALREAFEAIGYVLTSDQHPLSFSGVGNPVMALAAFL 567
+K L F NPVMAL AFL
Sbjct: 524 IK-----------------------------------------LPFVKSPNPVMALVAFL 532
Query: 568 ARLVGSDVAGASAHFSLKSISQ 569
A VG VA + AH SL +S+
Sbjct: 584 ASAVGPRVAASCAHESLSVLSE 532
HSP 2 Score: 51.6 bits (122), Expect = 6.4e-05
Identity = 37/124 (29.84%), Postives = 69/124 (55.65%), Query Frame = 0
Query: 906 KQNPVLDKGTDDNQSKNNEEENSKCTSKKEEKVDKLKRAAVTTLAAAAVKAKVLANQEED 965
K+ +LD + + ++ DK+ A L+AAA KAK+ A+ EE
Sbjct: 544 KEASLLDGENQQQDGAHKTSSQNGAEAQTPLPQDKVMAAFRAGLSAAATKAKLFADHEER 603
Query: 966 QIRQLAMILIEKQLHKLESKLAFFNDMDNVSMRVREQLDRSKQRLFQERAQIIAARLGLP 1025
+I++L+ ++ QL ++E KL F +++ + M+ EQ+++++QR ERA++++AR G P
Sbjct: 604 EIQRLSANIVNHQLKRMELKLKQFAEIETLLMKECEQVEKTRQRFSAERARMLSARFGSP 663
Query: 1026 ASAS 1030
S
Sbjct: 664 GGIS 667
BLAST of CmaCh11G011040 vs. ExPASy Swiss-Prot
Match:
O14470 (SWI/SNF and RSC complexes subunit ssr2 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=ssr2 PE=1 SV=3)
HSP 1 Score: 166.0 bits (419), Expect = 2.3e-39
Identity = 109/368 (29.62%), Postives = 180/368 (48.91%), Query Frame = 0
Query: 151 VVPNHCGWFSWTKVHPIEERSMPSFFSGNSGTRSPDIYIEIRNWIMKKFHANPSTQIESK 210
+VP++ GWF +K+H IE RS P FF+G S ++P IY + R++++ + P+ +
Sbjct: 19 IVPSYAGWFDMSKIHDIERRSNPEFFNGKSPLKTPSIYKDYRDFMINSYRLEPNEYLTVT 78
Query: 211 DVSEMEIGELEARQEVMEFLDHWGLINFHPFLSADSTSTSDVDDENQKDSLVEKLF-HFE 270
+G++ A V FL+ WGLIN+ +D E + + + H +
Sbjct: 79 ACRRNLVGDVCAIIRVHAFLEQWGLINY------------QIDPETRPAFRLPPISGHVQ 138
Query: 271 TLESSPSVVPKTNVTTAPPRLLRESAISEEMVRPEG------------------------ 330
+ ++P V + PP + S+ S+E V+ E
Sbjct: 139 AISNTPIVTQEMLAQHPPPSTVGGSS-SQEFVKLEEKHYSPSLNAMEQTSPKEEDEKSDK 198
Query: 331 -PSVEYHCNSCSGDCSRKRYHCQKQADFDLCSECFNNGKFDSDMSSSDFILMESAEVPGA 390
P V+ C +C +CS+ YH K +D+C C+ G+F S +SSDF+ M++ +
Sbjct: 199 VPRVDKVCFTCGVNCSQTWYHNLKNKKYDICPNCYKQGRFSSSFNSSDFLCMDAIDFNHD 258
Query: 391 SGGKWTDQETLLLLEALELYKENWNEIAEHVATKTKAQCILHFIQMPIEDSFLESENNDE 450
W++QETLLLLEA+E Y ++WN+IA HV ++TK QC++HF+Q+PIED + + D
Sbjct: 259 EEKPWSNQETLLLLEAIETYGDDWNQIALHVGSRTKEQCLIHFLQIPIEDPYRQKLQGDF 318
Query: 451 VGAKETVVPPSNENDSSVPMDITESMDNKTTRKEASNVENASKEDTVEVKVGQDNSKSKD 493
K+ + P +EN++ V +T AS V+ KE V Q + +
Sbjct: 319 SPFKKGFL-PFDENENPVLSTLTYL---------ASIVQQGMKERKQNESVKQGETSFGN 363
BLAST of CmaCh11G011040 vs. ExPASy Swiss-Prot
Match:
Q53KK6 (SWI/SNF complex subunit SWI3C homolog OS=Oryza sativa subsp. japonica OX=39947 GN=SWI3C PE=1 SV=1)
HSP 1 Score: 156.4 bits (394), Expect = 1.8e-36
Identity = 95/287 (33.10%), Postives = 153/287 (53.31%), Query Frame = 0
Query: 150 HVVPNHCGWFSWTKVHPIEERSMPSFFSGNSGTRSPDIYIEIRNWIMKKFHANPSTQIES 209
HVVP H WFS VH +E + +P FFSG S +P+ Y+ +RN ++ K+ NPS ++
Sbjct: 185 HVVPKHSDWFSPGIVHRLERQVVPQFFSGKSPGNTPEKYMLLRNKVIAKYLENPSKRLAF 244
Query: 210 KDVSEM--EIGELEARQEVMEFLDHWGLINFHPFLSADST------STSDVDDENQKD-- 269
+ + EL ++ FLD WG+IN +L++ S +TS + +E +
Sbjct: 245 AECQGLVANTAELYDLSRIVRFLDTWGIIN---YLASGSVHRGLRMATSLLREEPTGELQ 304
Query: 270 ------SLVEKLFHFETLESSPSVVPKTNVTTAPPRLLRESAISEEMVRPEGPSVEYHCN 329
++ L F+ + + +++ + + ++ ++E + E C+
Sbjct: 305 LLTAPLKSIDGLILFDRPKCNLQAEDISSLASNSEVVDFDAGLAELDGKIRERLSESSCS 364
Query: 330 SCSGDCSRKRYHCQKQADFDLCSECFNNGKFDSDMSSSDFILMESAEVPGAS-GGKWTDQ 389
C + Y K+AD LCS+CF++ ++ + SS DF ++ + G WTDQ
Sbjct: 365 YCLQPLTSLHYQSLKEADIALCSDCFHDARYITGHSSLDFQRIDGDNDRSENDGDSWTDQ 424
Query: 390 ETLLLLEALELYKENWNEIAEHVATKTKAQCILHFIQMPIEDSFLES 420
ETLLLLE +E Y +NWN IAEHV TK+KAQCI HFI++P+ED LE+
Sbjct: 425 ETLLLLEGIEKYNDNWNNIAEHVGTKSKAQCIYHFIRLPVEDGLLEN 468
HSP 2 Score: 65.9 bits (159), Expect = 3.3e-09
Identity = 48/131 (36.64%), Postives = 80/131 (61.07%), Query Frame = 0
Query: 939 DKLKRAAVTTLAAAAVKAKVLANQEEDQIRQLAMILIEKQLHKLESKLAFFNDMDNVSMR 998
+K+K AA+ L+AAA KAK+ A+QEE +I++L +I QL +LE KL F +++ + ++
Sbjct: 584 EKVKHAAMCGLSAAATKAKLFADQEEREIQRLTATVINHQLKRLELKLKQFAEVETLLLK 643
Query: 999 VREQLDRSKQRLFQERAQIIAARLGLPASASRGVAPVLPG---NRMARNFPNSVPKPPMG 1058
EQ++R +QR+ +R +I++ RL P ++ LPG + M+ N P S+ PMG
Sbjct: 644 ECEQVERIRQRIASDRVRIVSTRLASPGNS-------LPGGSTSTMSSN-PMSMSPRPMG 703
Query: 1059 MAPQRPPTSGP 1067
+ P +S P
Sbjct: 704 VPGSMPQSSMP 706
BLAST of CmaCh11G011040 vs. ExPASy Swiss-Prot
Match:
Q6PDG5 (SWI/SNF complex subunit SMARCC2 OS=Mus musculus OX=10090 GN=Smarcc2 PE=1 SV=2)
HSP 1 Score: 141.4 bits (355), Expect = 6.2e-32
Identity = 84/270 (31.11%), Postives = 141/270 (52.22%), Query Frame = 0
Query: 151 VVPNHCGWFSWTKVHPIEERSMPSFFSGNSGTRSPDIYIEIRNWIMKKFHANPSTQIESK 210
++P++ WF + VH IE R++P FF+G + +++P+IY+ RN+++ + NP + S
Sbjct: 425 IIPSYAAWFDYNSVHAIERRALPEFFNGKNKSKTPEIYLAYRNFMIDTYRLNPQEYLTST 484
Query: 211 DVSEMEIGELEARQEVMEFLDHWGLINFHPFLSADSTSTSDVDDENQKDSL-VEKLFHFE 270
G++ A V FL+ WGLIN+ VD E++ + HF
Sbjct: 485 ACRRNLAGDVCAIMRVHAFLEQWGLINY------------QVDAESRPTPMGPPPTSHFH 544
Query: 271 TLESSPSVVPKTNVTTAPPRLLRESAISEEMVR-PEGPSVEYHCNSCSGDCSRKRYHCQK 330
L +PS + P+ ++S+ S++M+ PE +K
Sbjct: 545 VLADTPS-----GLVPLQPKPPQQSSASQQMLNFPEKGK-------------------EK 604
Query: 331 QADFDLCSECFNNGKFDSDMSSSDFILMESAEVPGASGGKWTDQETLLLLEALELYKENW 390
AD N +DM + + +S A+ +WT+QETLLLLEALE+YK++W
Sbjct: 605 PAD-------MQNFGLRTDMYTKKNVPSKSKAAASAT-REWTEQETLLLLEALEMYKDDW 650
Query: 391 NEIAEHVATKTKAQCILHFIQMPIEDSFLE 419
N+++EHV ++T+ +CILHF+++PIED +LE
Sbjct: 665 NKVSEHVGSRTQDECILHFLRLPIEDPYLE 650
HSP 2 Score: 45.8 bits (107), Expect = 3.5e-03
Identity = 83/298 (27.85%), Postives = 123/298 (41.28%), Query Frame = 0
Query: 788 EETSFEKPSQSTLLSKDIHISDLQYAEKTEIQRQDPSPSVNTSKIDDVPNPLPSVNELQP 847
EE S K T L + H+ ++ A K + DP+ + +S I + P E
Sbjct: 698 EEFSKMKEEVPTAL-VEAHVRKVEEAAKV-TGKADPAFGLESSGIAGTASDEPERIEESG 757
Query: 848 LFAATSVKVASSDVAMVSDPRDKSEPAQTETSKSL-----VDQGASKVSDSLPTEENATP 907
A A+ + +PR+ + E + + D+ K DS E +
Sbjct: 758 TEEARPEGQAADEKKEPKEPREGGGAVEEEAKEEISEVPKKDEEKGKEGDSEKESEKSDG 817
Query: 908 QPVKQNPVLDKGTDDNQS---KNNEEENSKCTSKKEEKVDK--LKRAAVTTLAAAAVKAK 967
P+ +P DK + Q K E + +K E + + L AA LAAAAVKAK
Sbjct: 818 DPI-VDPEKDKEPTEGQEEVLKEVAEPEGERKTKVERDIGEGNLSTAAAAALAAAAVKAK 877
Query: 968 VLANQEEDQIRQLAMILIEKQLHKLESKLAFFNDMDNVSMRVREQLDRSKQRLFQERAQI 1027
LA EE +I+ L +L+E Q+ KLE KL F +++ + R RE L+ +Q+L +R
Sbjct: 878 HLAAVEERKIKSLVALLVETQMKKLEIKLRHFEELETIMDREREALEYQRQQLLADRQAF 937
Query: 1028 IAARLGLPASASR------------GVAPVLPGNRMARNFPNSVPKPPMGMAPQRPPT 1064
+L +R P LP P S P PP G A PPT
Sbjct: 938 HMEQLKYAEMRARQQHFQQMHQQQQQQPPTLP--------PGSQPIPPTGAA--GPPT 982
BLAST of CmaCh11G011040 vs. TAIR 10
Match:
AT4G34430.1 (DNA-binding family protein )
HSP 1 Score: 655.6 bits (1690), Expect = 6.9e-188
Identity = 474/1096 (43.25%), Postives = 619/1096 (56.48%), Query Frame = 0
Query: 1 MEEKRRD-AGNL--PANTTDSPSSEP-PSSRRRAGAQKRKVSALGGSN-SSSAPSKR-VT 60
MEEKRRD AG L ++ DSP+SEP P+ RRR G KRK +ALGGSN SSAPSKR +T
Sbjct: 1 MEEKRRDSAGTLAFAGSSGDSPASEPMPAPRRRGGGLKRKANALGGSNFFSSAPSKRMLT 60
Query: 61 RDKFALL-HPPNHNGPFTRARLGPNNGAGTASGNAAEGISAAGSVKVEGSFLHSEVQRGD 120
R+K L P HNGP TRAR P+ A G +E ++ A V +G E
Sbjct: 61 REKAMLASFSPVHNGPLTRARQAPSIMPSAADGVKSEVLNVA--VGADGEKPKEE----- 120
Query: 121 TLVAAAEELNKASR-LANLEASFVADFESIKSRGMNAHVVPNHCGWFSWTKVHPIEERSM 180
EE NKA R LEA ADFE+I+SR N HVVPNHCGWFSW K+HP+EERS+
Sbjct: 121 ------EERNKAIREWEALEAKIEADFEAIRSRDSNVHVVPNHCGWFSWEKIHPLEERSL 180
Query: 181 PSFFSGNSGTRSPDIYIEIRNWIMKKFHANPSTQIESKDVSEMEIGELEARQEVMEFLDH 240
PSFF+G R+ ++Y EIRNWIM KFH+NP+ QIE KD++E+E+G+ EA+QEVMEFLD+
Sbjct: 181 PSFFNGKLEGRTSEVYREIRNWIMGKFHSNPNIQIELKDLTELEVGDSEAKQEVMEFLDY 240
Query: 241 WGLINFHPFLSADSTST-SDVDDENQKDSLVEKLFHFETLESSPSVV--PKTNVTTAPPR 300
WGLINFHPF D+ ST SD DD K+SL+ L+ F+ E+ P +V P+ P
Sbjct: 241 WGLINFHPFPPTDTGSTASDHDDLGDKESLLNSLYRFQVDEACPPLVHKPRFTAQATPSG 300
Query: 301 LLRESAISEEMVRPEGPSVEYHCNSCSGDCSRKRYHCQKQADFDLCSECFNNGKFDSDMS 360
L + ++E+++ EGP+VEYHCNSCS DCSRKRYHC KQADFDLC+ECFN+GKF SDMS
Sbjct: 301 LFPDPMAADELLKQEGPAVEYHCNSCSADCSRKRYHCPKQADFDLCTECFNSGKFSSDMS 360
Query: 361 SSDFILMESAEVPGASGGKWTDQETLLLLEALELYKENWNEIAEHVATKTKAQCILHFIQ 420
SSDFILME AE PG GKWTDQETLLLLEALE++KENWNEIAEHVATKTKAQC+LHF+Q
Sbjct: 361 SSDFILMEPAEAPGVGSGKWTDQETLLLLEALEIFKENWNEIAEHVATKTKAQCMLHFLQ 420
Query: 421 MPIEDSFLESENNDEVGAKETVVPPSNENDSSVPMDITESMDNKTTRKEASNVENASK-E 480
MPIED+FL+ + + +K+T +++D+SV D E +NK E ++ + E
Sbjct: 421 MPIEDAFLDQIDYKDPISKDTTDLAVSKDDNSVLKDAPEEAENKKRVDEDETMKEVPEPE 480
Query: 481 DTVEVKVGQDNSKSKDV-----EVKA-----ALDNSKSEDGGQKVSEDIALNALREAFEA 540
D E KV Q++SK D E++A L+ + E + E+IAL AL EAFE
Sbjct: 481 DGNEEKVSQESSKPGDASEETNEMEAEQKTPKLETAIEERCKDEADENIALKALTEAFED 540
Query: 541 IGYVLTSDQHPLSFSGVGNPVMALAAFLARLVGSDVAGASAHFSLKSISQKSPSLDLATR 600
+G+ T + SF+ +GNPVM LAAFL RL GSDVA ASA S+KS+ S L LATR
Sbjct: 541 VGHSSTPEA-SFSFADLGNPVMGLAAFLVRLAGSDVATASARASIKSLHSNSGML-LATR 600
Query: 601 HCFILEDPLDDKAQANSERVVNVEAQQNVNEQCEKQRKDNSTSVLDDRALSTTNIDYKNG 660
HC+ILEDP D+K + + +A+ N DNS
Sbjct: 601 HCYILEDPPDNKKDPTKSKSCSADAEGN---------DDNS------------------- 660
Query: 661 ESETEETTMENRNSSDATKEHDPMVNHGSDGTNKLKELTEPEAQQNVNEQCEKQRKDNST 720
H D +PE EK +K
Sbjct: 661 --------------------------HKDD---------QPE---------EKSKKAEEV 720
Query: 721 SV-LDDRALSTTNIDYKNGESVTEETTMENRNSSDATKEHDPMVNHGSDGTNKLKELTEP 780
S+ DDR + T+ + +SV+EE +R + T KL + E
Sbjct: 721 SLNSDDREMPDTDTGKETQDSVSEEKQPGSRT---------------ENSTTKLDAVQEK 780
Query: 781 EVPKDDRTGIVKETENMESKLTTNTFEKLGEETSFEKPSQSTLLSKDIHISDLQYAEKTE 840
K +TT+ EK PSQ K++ L+ K
Sbjct: 781 RSSK---------------PVTTDNSEK---PVDIICPSQDKCSGKELQ-EPLKDGNKLS 840
Query: 841 IQRQDPSPSVNTSKIDDVPNPLPSVNELQPLFAATSVKVASSDVAMVSDPRDKSEPAQT- 900
+ +D S S + D P AS DV M + + +P
Sbjct: 841 SENKDASQSTVSQSAADASQP-----------------EASRDVEMKDTLQSEKDPEDVV 900
Query: 901 ----ETSKSLVDQGASKVSDSLPTEENATPQPVKQNPVLDKGT-DDNQSKNNEEENSKCT 960
E + ++GA+ V + +++ + QP+ + GT N + ++E C
Sbjct: 901 KTVGEKVQLAKEEGANDVLST--PDKSVSQQPIGSASAPENGTAGGNPNIEGKKEKDICE 954
Query: 961 SKKEE-KVDKLKRAAVTTLAAAAVKAKVLANQEEDQIRQLAMILIEKQLHKLESKLAFFN 1020
K++ ++KLKRAA++ ++AAAVKAK LA QEEDQIRQL+ LIEKQLHKLE+KL+ FN
Sbjct: 961 GTKDKYNIEKLKRAAISAISAAAVKAKNLAKQEEDQIRQLSGSLIEKQLHKLEAKLSIFN 954
Query: 1021 DMDNVSMRVREQLDRSKQRLFQERAQIIAARLGLPASASRGVAPVLPGNRMARNFPNSVP 1067
+ ++++MRVREQL+RS+QRL+ ERAQIIAARLG+P S S + LP NR+A NF N
Sbjct: 1021 EAESLTMRVREQLERSRQRLYHERAQIIAARLGVPPSMSSKAS--LPTNRIAANFANVAQ 954
BLAST of CmaCh11G011040 vs. TAIR 10
Match:
AT4G34430.2 (DNA-binding family protein )
HSP 1 Score: 655.6 bits (1690), Expect = 6.9e-188
Identity = 474/1096 (43.25%), Postives = 619/1096 (56.48%), Query Frame = 0
Query: 1 MEEKRRD-AGNL--PANTTDSPSSEP-PSSRRRAGAQKRKVSALGGSN-SSSAPSKR-VT 60
MEEKRRD AG L ++ DSP+SEP P+ RRR G KRK +ALGGSN SSAPSKR +T
Sbjct: 1 MEEKRRDSAGTLAFAGSSGDSPASEPMPAPRRRGGGLKRKANALGGSNFFSSAPSKRMLT 60
Query: 61 RDKFALL-HPPNHNGPFTRARLGPNNGAGTASGNAAEGISAAGSVKVEGSFLHSEVQRGD 120
R+K L P HNGP TRAR P+ A G +E ++ A V +G E
Sbjct: 61 REKAMLASFSPVHNGPLTRARQAPSIMPSAADGVKSEVLNVA--VGADGEKPKEE----- 120
Query: 121 TLVAAAEELNKASR-LANLEASFVADFESIKSRGMNAHVVPNHCGWFSWTKVHPIEERSM 180
EE NKA R LEA ADFE+I+SR N HVVPNHCGWFSW K+HP+EERS+
Sbjct: 121 ------EERNKAIREWEALEAKIEADFEAIRSRDSNVHVVPNHCGWFSWEKIHPLEERSL 180
Query: 181 PSFFSGNSGTRSPDIYIEIRNWIMKKFHANPSTQIESKDVSEMEIGELEARQEVMEFLDH 240
PSFF+G R+ ++Y EIRNWIM KFH+NP+ QIE KD++E+E+G+ EA+QEVMEFLD+
Sbjct: 181 PSFFNGKLEGRTSEVYREIRNWIMGKFHSNPNIQIELKDLTELEVGDSEAKQEVMEFLDY 240
Query: 241 WGLINFHPFLSADSTST-SDVDDENQKDSLVEKLFHFETLESSPSVV--PKTNVTTAPPR 300
WGLINFHPF D+ ST SD DD K+SL+ L+ F+ E+ P +V P+ P
Sbjct: 241 WGLINFHPFPPTDTGSTASDHDDLGDKESLLNSLYRFQVDEACPPLVHKPRFTAQATPSG 300
Query: 301 LLRESAISEEMVRPEGPSVEYHCNSCSGDCSRKRYHCQKQADFDLCSECFNNGKFDSDMS 360
L + ++E+++ EGP+VEYHCNSCS DCSRKRYHC KQADFDLC+ECFN+GKF SDMS
Sbjct: 301 LFPDPMAADELLKQEGPAVEYHCNSCSADCSRKRYHCPKQADFDLCTECFNSGKFSSDMS 360
Query: 361 SSDFILMESAEVPGASGGKWTDQETLLLLEALELYKENWNEIAEHVATKTKAQCILHFIQ 420
SSDFILME AE PG GKWTDQETLLLLEALE++KENWNEIAEHVATKTKAQC+LHF+Q
Sbjct: 361 SSDFILMEPAEAPGVGSGKWTDQETLLLLEALEIFKENWNEIAEHVATKTKAQCMLHFLQ 420
Query: 421 MPIEDSFLESENNDEVGAKETVVPPSNENDSSVPMDITESMDNKTTRKEASNVENASK-E 480
MPIED+FL+ + + +K+T +++D+SV D E +NK E ++ + E
Sbjct: 421 MPIEDAFLDQIDYKDPISKDTTDLAVSKDDNSVLKDAPEEAENKKRVDEDETMKEVPEPE 480
Query: 481 DTVEVKVGQDNSKSKDV-----EVKA-----ALDNSKSEDGGQKVSEDIALNALREAFEA 540
D E KV Q++SK D E++A L+ + E + E+IAL AL EAFE
Sbjct: 481 DGNEEKVSQESSKPGDASEETNEMEAEQKTPKLETAIEERCKDEADENIALKALTEAFED 540
Query: 541 IGYVLTSDQHPLSFSGVGNPVMALAAFLARLVGSDVAGASAHFSLKSISQKSPSLDLATR 600
+G+ T + SF+ +GNPVM LAAFL RL GSDVA ASA S+KS+ S L LATR
Sbjct: 541 VGHSSTPEA-SFSFADLGNPVMGLAAFLVRLAGSDVATASARASIKSLHSNSGML-LATR 600
Query: 601 HCFILEDPLDDKAQANSERVVNVEAQQNVNEQCEKQRKDNSTSVLDDRALSTTNIDYKNG 660
HC+ILEDP D+K + + +A+ N DNS
Sbjct: 601 HCYILEDPPDNKKDPTKSKSCSADAEGN---------DDNS------------------- 660
Query: 661 ESETEETTMENRNSSDATKEHDPMVNHGSDGTNKLKELTEPEAQQNVNEQCEKQRKDNST 720
H D +PE EK +K
Sbjct: 661 --------------------------HKDD---------QPE---------EKSKKAEEV 720
Query: 721 SV-LDDRALSTTNIDYKNGESVTEETTMENRNSSDATKEHDPMVNHGSDGTNKLKELTEP 780
S+ DDR + T+ + +SV+EE +R + T KL + E
Sbjct: 721 SLNSDDREMPDTDTGKETQDSVSEEKQPGSRT---------------ENSTTKLDAVQEK 780
Query: 781 EVPKDDRTGIVKETENMESKLTTNTFEKLGEETSFEKPSQSTLLSKDIHISDLQYAEKTE 840
K +TT+ EK PSQ K++ L+ K
Sbjct: 781 RSSK---------------PVTTDNSEK---PVDIICPSQDKCSGKELQ-EPLKDGNKLS 840
Query: 841 IQRQDPSPSVNTSKIDDVPNPLPSVNELQPLFAATSVKVASSDVAMVSDPRDKSEPAQT- 900
+ +D S S + D P AS DV M + + +P
Sbjct: 841 SENKDASQSTVSQSAADASQP-----------------EASRDVEMKDTLQSEKDPEDVV 900
Query: 901 ----ETSKSLVDQGASKVSDSLPTEENATPQPVKQNPVLDKGT-DDNQSKNNEEENSKCT 960
E + ++GA+ V + +++ + QP+ + GT N + ++E C
Sbjct: 901 KTVGEKVQLAKEEGANDVLST--PDKSVSQQPIGSASAPENGTAGGNPNIEGKKEKDICE 954
Query: 961 SKKEE-KVDKLKRAAVTTLAAAAVKAKVLANQEEDQIRQLAMILIEKQLHKLESKLAFFN 1020
K++ ++KLKRAA++ ++AAAVKAK LA QEEDQIRQL+ LIEKQLHKLE+KL+ FN
Sbjct: 961 GTKDKYNIEKLKRAAISAISAAAVKAKNLAKQEEDQIRQLSGSLIEKQLHKLEAKLSIFN 954
Query: 1021 DMDNVSMRVREQLDRSKQRLFQERAQIIAARLGLPASASRGVAPVLPGNRMARNFPNSVP 1067
+ ++++MRVREQL+RS+QRL+ ERAQIIAARLG+P S S + LP NR+A NF N
Sbjct: 1021 EAESLTMRVREQLERSRQRLYHERAQIIAARLGVPPSMSSKAS--LPTNRIAANFANVAQ 954
BLAST of CmaCh11G011040 vs. TAIR 10
Match:
AT4G34430.3 (DNA-binding family protein )
HSP 1 Score: 653.7 bits (1685), Expect = 2.6e-187
Identity = 473/1096 (43.16%), Postives = 616/1096 (56.20%), Query Frame = 0
Query: 1 MEEKRRD-AGNL--PANTTDSPSSEP-PSSRRRAGAQKRKVSALGGSN-SSSAPSKR-VT 60
MEEKRRD AG L ++ DSP+SEP P+ RRR G KRK +ALGGSN SSAPSKR +T
Sbjct: 1 MEEKRRDSAGTLAFAGSSGDSPASEPMPAPRRRGGGLKRKANALGGSNFFSSAPSKRMLT 60
Query: 61 RDKFALL-HPPNHNGPFTRARLGPNNGAGTASGNAAEGISAAGSVKVEGSFLHSEVQRGD 120
R+K L P HNGP TRAR P+ A G +E ++ A V +G E
Sbjct: 61 REKAMLASFSPVHNGPLTRARQAPSIMPSAADGVKSEVLNVA--VGADGEKPKEE----- 120
Query: 121 TLVAAAEELNKASR-LANLEASFVADFESIKSRGMNAHVVPNHCGWFSWTKVHPIEERSM 180
EE NKA R LEA ADFE+I+SR N HVVPNHCGWFSW K+HP+EERS+
Sbjct: 121 ------EERNKAIREWEALEAKIEADFEAIRSRDSNVHVVPNHCGWFSWEKIHPLEERSL 180
Query: 181 PSFFSGNSGTRSPDIYIEIRNWIMKKFHANPSTQIESKDVSEMEIGELEARQEVMEFLDH 240
PSFF+G R+ ++Y EIRNWIM KFH+NP+ QIE KD++E+E+G+ EA+QEVMEFLD+
Sbjct: 181 PSFFNGKLEGRTSEVYREIRNWIMGKFHSNPNIQIELKDLTELEVGDSEAKQEVMEFLDY 240
Query: 241 WGLINFHPFLSADSTST-SDVDDENQKDSLVEKLFHFETLESSPSVV--PKTNVTTAPPR 300
WGLINFHPF D+ ST SD DD K+SL+ L+ F+ E+ P +V P+ P
Sbjct: 241 WGLINFHPFPPTDTGSTASDHDDLGDKESLLNSLYRFQVDEACPPLVHKPRFTAQATPSG 300
Query: 301 LLRESAISEEMVRPEGPSVEYHCNSCSGDCSRKRYHCQKQADFDLCSECFNNGKFDSDMS 360
L + ++E+++ EGP+VEYHCNSCS DCSRKRYHC KQADFDLC+ECFN+GKF SDMS
Sbjct: 301 LFPDPMAADELLKQEGPAVEYHCNSCSADCSRKRYHCPKQADFDLCTECFNSGKFSSDMS 360
Query: 361 SSDFILMESAEVPGASGGKWTDQETLLLLEALELYKENWNEIAEHVATKTKAQCILHFIQ 420
SSDFILME AE PG GKWTDQETLLLLEALE++KENWNEIAEHVATKTKAQC+LHF+Q
Sbjct: 361 SSDFILMEPAEAPGVGSGKWTDQETLLLLEALEIFKENWNEIAEHVATKTKAQCMLHFLQ 420
Query: 421 MPIEDSFLESENNDEVGAKETVVPPSNENDSSVPMDITESMDNKTTRKEASNVENASK-E 480
MPIED+FL+ + + +K+T +++D+SV D E +NK E ++ + E
Sbjct: 421 MPIEDAFLDQIDYKDPISKDTTDLAVSKDDNSVLKDAPEEAENKKRVDEDETMKEVPEPE 480
Query: 481 DTVEVKVGQDNSKSKDV-----EVKA-----ALDNSKSEDGGQKVSEDIALNALREAFEA 540
D E KV Q++SK D E++A L+ + E + E+IAL AL EAFE
Sbjct: 481 DGNEEKVSQESSKPGDASEETNEMEAEQKTPKLETAIEERCKDEADENIALKALTEAFED 540
Query: 541 IGYVLTSDQHPLSFSGVGNPVMALAAFLARLVGSDVAGASAHFSLKSISQKSPSLDLATR 600
+G+ T + SF+ +GNPVM LAAFL RL GSDVA ASA S+KS+ S L LATR
Sbjct: 541 VGHSSTPEA-SFSFADLGNPVMGLAAFLVRLAGSDVATASARASIKSLHSNSGML-LATR 600
Query: 601 HCFILEDPLDDKAQANSERVVNVEAQQNVNEQCEKQRKDNSTSVLDDRALSTTNIDYKNG 660
HC+ILEDP D+K + + E DNS
Sbjct: 601 HCYILEDPPDNKKDPTKSKSADAEGND-----------DNS------------------- 660
Query: 661 ESETEETTMENRNSSDATKEHDPMVNHGSDGTNKLKELTEPEAQQNVNEQCEKQRKDNST 720
H D +PE EK +K
Sbjct: 661 --------------------------HKDD---------QPE---------EKSKKAEEV 720
Query: 721 SV-LDDRALSTTNIDYKNGESVTEETTMENRNSSDATKEHDPMVNHGSDGTNKLKELTEP 780
S+ DDR + T+ + +SV+EE +R + T KL + E
Sbjct: 721 SLNSDDREMPDTDTGKETQDSVSEEKQPGSRT---------------ENSTTKLDAVQEK 780
Query: 781 EVPKDDRTGIVKETENMESKLTTNTFEKLGEETSFEKPSQSTLLSKDIHISDLQYAEKTE 840
K +TT+ EK PSQ K++ L+ K
Sbjct: 781 RSSK---------------PVTTDNSEK---PVDIICPSQDKCSGKELQ-EPLKDGNKLS 840
Query: 841 IQRQDPSPSVNTSKIDDVPNPLPSVNELQPLFAATSVKVASSDVAMVSDPRDKSEPAQT- 900
+ +D S S + D P AS DV M + + +P
Sbjct: 841 SENKDASQSTVSQSAADASQP-----------------EASRDVEMKDTLQSEKDPEDVV 900
Query: 901 ----ETSKSLVDQGASKVSDSLPTEENATPQPVKQNPVLDKGT-DDNQSKNNEEENSKCT 960
E + ++GA+ V + +++ + QP+ + GT N + ++E C
Sbjct: 901 KTVGEKVQLAKEEGANDVLST--PDKSVSQQPIGSASAPENGTAGGNPNIEGKKEKDICE 952
Query: 961 SKKEE-KVDKLKRAAVTTLAAAAVKAKVLANQEEDQIRQLAMILIEKQLHKLESKLAFFN 1020
K++ ++KLKRAA++ ++AAAVKAK LA QEEDQIRQL+ LIEKQLHKLE+KL+ FN
Sbjct: 961 GTKDKYNIEKLKRAAISAISAAAVKAKNLAKQEEDQIRQLSGSLIEKQLHKLEAKLSIFN 952
Query: 1021 DMDNVSMRVREQLDRSKQRLFQERAQIIAARLGLPASASRGVAPVLPGNRMARNFPNSVP 1067
+ ++++MRVREQL+RS+QRL+ ERAQIIAARLG+P S S + LP NR+A NF N
Sbjct: 1021 EAESLTMRVREQLERSRQRLYHERAQIIAARLGVPPSMSSKAS--LPTNRIAANFANVAQ 952
BLAST of CmaCh11G011040 vs. TAIR 10
Match:
AT4G34430.4 (DNA-binding family protein )
HSP 1 Score: 651.0 bits (1678), Expect = 1.7e-186
Identity = 474/1097 (43.21%), Postives = 619/1097 (56.43%), Query Frame = 0
Query: 1 MEEKRRD-AGNL--PANTTDSPSSEP-PSSRRRAGAQKRKVSALGGSN-SSSAPSKR-VT 60
MEEKRRD AG L ++ DSP+SEP P+ RRR G KRK +ALGGSN SSAPSKR +T
Sbjct: 1 MEEKRRDSAGTLAFAGSSGDSPASEPMPAPRRRGGGLKRKANALGGSNFFSSAPSKRMLT 60
Query: 61 RDKFALL-HPPNHNGPFTRARLGPNNGAGTASGNAAEGISAAGSVKVEGSFLHSEVQRGD 120
R+K L P HNGP TRAR P+ A G +E ++ A V +G E
Sbjct: 61 REKAMLASFSPVHNGPLTRARQAPSIMPSAADGVKSEVLNVA--VGADGEKPKEE----- 120
Query: 121 TLVAAAEELNKASR-LANLEASFVADFESIKSRGMNAHVVPNHCGWFSWTKVHPIEERSM 180
EE NKA R LEA ADFE+I+SR N HVVPNHCGWFSW K+HP+EERS+
Sbjct: 121 ------EERNKAIREWEALEAKIEADFEAIRSRDSNVHVVPNHCGWFSWEKIHPLEERSL 180
Query: 181 PSFFSGNSGTRSPDIYIEIRNWIMKKFHANPSTQIESKDVSEMEIGELEARQEVMEFLDH 240
PSFF+G R+ ++Y EIRNWIM KFH+NP+ QIE KD++E+E+G+ EA+QEVMEFLD+
Sbjct: 181 PSFFNGKLEGRTSEVYREIRNWIMGKFHSNPNIQIELKDLTELEVGDSEAKQEVMEFLDY 240
Query: 241 WGLINFHPFLSADSTST-SDVDDENQKDSLVEKLFHFETLESSPSVV--PKTNVTTAPPR 300
WGLINFHPF D+ ST SD DD K+SL+ L+ F+ E+ P +V P+ P
Sbjct: 241 WGLINFHPFPPTDTGSTASDHDDLGDKESLLNSLYRFQVDEACPPLVHKPRFTAQATPSG 300
Query: 301 LLRESAISEEMVRPEGPSVEYHCNSCSGDCSRKRYHCQKQADFDLCSECFNNGKFDSDMS 360
L + ++E+++ EGP+VEYHCNSCS DCSRKRYHC KQADFDLC+ECFN+GKF SDMS
Sbjct: 301 LFPDPMAADELLKQEGPAVEYHCNSCSADCSRKRYHCPKQADFDLCTECFNSGKFSSDMS 360
Query: 361 SSDFILMESAEVPGASGGKWTDQETLLLLEALELYKENWNEIAEHVATKTKAQCILHFIQ 420
SSDFILME AE PG GKWTDQETLLLLEALE++KENWNEIAEHVATKTKAQC+LHF+Q
Sbjct: 361 SSDFILMEPAEAPGVGSGKWTDQETLLLLEALEIFKENWNEIAEHVATKTKAQCMLHFLQ 420
Query: 421 MPIEDSFLESENNDEVGAKETVVPPSNENDSSVPMDITESMDNKTTRKEASNVENASK-E 480
MPIED+FL+ + + +K+T +++D+SV D E +NK E ++ + E
Sbjct: 421 MPIEDAFLDQIDYKDPISKDTTDLAVSKDDNSVLKDAPEEAENKKRVDEDETMKEVPEPE 480
Query: 481 DTVEVKVGQDNSKSKDV-----EVKA-----ALDNSKSEDGGQKVSEDIALNALREAFEA 540
D E KV Q++SK D E++A L+ + E + E+IAL AL EAFE
Sbjct: 481 DGNEEKVSQESSKPGDASEETNEMEAEQKTPKLETAIEERCKDEADENIALKALTEAFED 540
Query: 541 IGYVLTSDQHPLSFSGVGNPVMALAAFLARLVGSDVAGASAHFSLKSISQKSPSLDLATR 600
+G+ T + SF+ +GNPVM LAAFL RL GSDVA ASA S+KS+ S L LATR
Sbjct: 541 VGHSSTPEA-SFSFADLGNPVMGLAAFLVRLAGSDVATASARASIKSLHSNSGML-LATR 600
Query: 601 HCFILEDPLDDKAQANSERVVNVEAQQNVNEQCEKQRKDNSTSVLDDRALSTTNIDYKNG 660
HC+ILEDP D+K + + +A+ N DNS
Sbjct: 601 HCYILEDPPDNKKDPTKSKSCSADAEGN---------DDNS------------------- 660
Query: 661 ESETEETTMENRNSSDATKEHDPMVNHGSDGTNKLKELTEPEAQQNVNEQCEKQRKDNST 720
H D +PE EK +K
Sbjct: 661 --------------------------HKDD---------QPE---------EKSKKAEEV 720
Query: 721 SV-LDDRALSTTNIDYKNGESVTEETTMENRNSSDATKEHDPMVNHGSDGTNKLKELTEP 780
S+ DDR + T+ + +SV+EE +R + T KL + E
Sbjct: 721 SLNSDDREMPDTDTGKETQDSVSEEKQPGSRT---------------ENSTTKLDAVQEK 780
Query: 781 EVPKDDRTGIVKETENMESKLTTNTFEKLGEETSFEKPSQSTLLSKDIHISDLQYAEKTE 840
K +TT+ EK PSQ K++ L+ K
Sbjct: 781 RSSK---------------PVTTDNSEK---PVDIICPSQDKCSGKELQ-EPLKDGNKLS 840
Query: 841 IQRQDPSPSVNTSKIDDVPNPLPSVNELQPLFAATSVKVASSDVAMVSDPRDKSEPAQT- 900
+ +D S S + D P AS DV M + + +P
Sbjct: 841 SENKDASQSTVSQSAADASQP-----------------EASRDVEMKDTLQSEKDPEDVV 900
Query: 901 ----ETSKSLVDQGASKVSDSLPTEENATPQPVKQNPVLDKGT-DDNQSKNNEEENSKCT 960
E + ++GA+ V + +++ + QP+ + GT N + ++E C
Sbjct: 901 KTVGEKVQLAKEEGANDVLST--PDKSVSQQPIGSASAPENGTAGGNPNIEGKKEKDICE 955
Query: 961 SKKEE-KVDKLKRAAVTTLAAAAVKAKVLANQEEDQIRQLAMILIEK-QLHKLESKLAFF 1020
K++ ++KLKRAA++ ++AAAVKAK LA QEEDQIRQL+ LIEK QLHKLE+KL+ F
Sbjct: 961 GTKDKYNIEKLKRAAISAISAAAVKAKNLAKQEEDQIRQLSGSLIEKQQLHKLEAKLSIF 955
Query: 1021 NDMDNVSMRVREQLDRSKQRLFQERAQIIAARLGLPASASRGVAPVLPGNRMARNFPNSV 1067
N+ ++++MRVREQL+RS+QRL+ ERAQIIAARLG+P S S + LP NR+A NF N
Sbjct: 1021 NEAESLTMRVREQLERSRQRLYHERAQIIAARLGVPPSMSSKAS--LPTNRIAANFANVA 955
BLAST of CmaCh11G011040 vs. TAIR 10
Match:
AT1G21700.1 (SWITCH/sucrose nonfermenting 3C )
HSP 1 Score: 171.4 bits (433), Expect = 3.9e-42
Identity = 150/502 (29.88%), Postives = 221/502 (44.02%), Query Frame = 0
Query: 88 AAEGISAAGSVKVEGS------FLHSEVQRGDTLVAAAEELNKASRLAN---LEASFVAD 147
AAE G + +GS ++Q T+ A + +L ++ ++ + + D
Sbjct: 104 AAERAGLIGETRGQGSLPALENISFGQLQALSTVPADSLDLERSDGSSSAYVISPPPIMD 163
Query: 148 FESIKSR-GMNAHVVPNHCGWFSWTKVHPIEERSMPSFFSGNSGTRSPDIYIEIRNWIMK 207
E + R G HV+P H WF+ V +E + +P FFSG S +P+ Y+E RN I+
Sbjct: 164 GEGVVKRFGDLVHVLPMHSDWFAPNTVDRLERQVVPQFFSGKSPNHTPESYMEFRNAIVS 223
Query: 208 KFHANPSTQIESKDVSEMEIG-ELEARQEVMEFLDHWGLINFHPFLSADSTSTSDVDDEN 267
K+ NP + D + G ++E V FLDHWG+IN+ + DV D
Sbjct: 224 KYVENPEKTLTISDCQGLVDGVDIEDFARVFRFLDHWGIINYCATAQSHPGPLRDVSDV- 283
Query: 268 QKDSLVEKLFHFETLESSPSVV--PKTN-------VTTAPPRLLRESAISEEMVRPEGPS 327
++D+ E L S S++ K N V ++ P L +S + +R
Sbjct: 284 REDTNGEVNVPSAALTSIDSLIKFDKPNCRHKGGEVYSSLPSLDGDSPDLDIRIREH--L 343
Query: 328 VEYHCNSCSGDCSRKRYHCQKQADFDLCSECFNNGKFDSDMSSSDFILMESAEVPG-ASG 387
+ HCN CS + QK+ D LC +CF++G+F S DF+ ++ + G G
Sbjct: 344 CDSHCNHCSRPLPTVYFQSQKKGDILLCCDCFHHGRFVVGHSCLDFVRVDPMKFYGDQDG 403
Query: 388 GKWTDQETLLLLEALELYKENWNEIAEHVATKTKAQCILHFIQMPIEDSFLESENNDEVG 447
WTDQETLLLLEA+ELY ENW +IA+HV +K+KAQCILHF+++P+ED L+ N EV
Sbjct: 404 DNWTDQETLLLLEAVELYNENWVQIADHVGSKSKAQCILHFLRLPVEDGLLD---NVEVS 463
Query: 448 AKETVVPPSNENDSSVPMDITESMDNKTTRKEASNVENASKEDTVEVKVGQDNSKSKDVE 507
P+N D+K T + + + D E
Sbjct: 464 GVTNTENPTN------------GYDHKGTDSNGD--------------LPGYSEQGSDTE 523
Query: 508 VKAALDNSKSEDGGQKVSEDIALNALREAFEAIGYVLTSDQHPLSFSGVGNPVMALAAFL 567
+K L F NPVMAL AFL
Sbjct: 524 IK-----------------------------------------LPFVKSPNPVMALVAFL 532
Query: 568 ARLVGSDVAGASAHFSLKSISQ 569
A VG VA + AH SL +S+
Sbjct: 584 ASAVGPRVAASCAHESLSVLSE 532
HSP 2 Score: 51.6 bits (122), Expect = 4.6e-06
Identity = 37/124 (29.84%), Postives = 69/124 (55.65%), Query Frame = 0
Query: 906 KQNPVLDKGTDDNQSKNNEEENSKCTSKKEEKVDKLKRAAVTTLAAAAVKAKVLANQEED 965
K+ +LD + + ++ DK+ A L+AAA KAK+ A+ EE
Sbjct: 544 KEASLLDGENQQQDGAHKTSSQNGAEAQTPLPQDKVMAAFRAGLSAAATKAKLFADHEER 603
Query: 966 QIRQLAMILIEKQLHKLESKLAFFNDMDNVSMRVREQLDRSKQRLFQERAQIIAARLGLP 1025
+I++L+ ++ QL ++E KL F +++ + M+ EQ+++++QR ERA++++AR G P
Sbjct: 604 EIQRLSANIVNHQLKRMELKLKQFAEIETLLMKECEQVEKTRQRFSAERARMLSARFGSP 663
Query: 1026 ASAS 1030
S
Sbjct: 664 GGIS 667
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q8VY05 | 9.7e-187 | 43.25 | SWI/SNF complex subunit SWI3D OS=Arabidopsis thaliana OX=3702 GN=SWI3D PE=1 SV=3 | [more] |
Q9XI07 | 5.6e-41 | 29.88 | SWI/SNF complex subunit SWI3C OS=Arabidopsis thaliana OX=3702 GN=SWI3C PE=1 SV=1 | [more] |
O14470 | 2.3e-39 | 29.62 | SWI/SNF and RSC complexes subunit ssr2 OS=Schizosaccharomyces pombe (strain 972 ... | [more] |
Q53KK6 | 1.8e-36 | 33.10 | SWI/SNF complex subunit SWI3C homolog OS=Oryza sativa subsp. japonica OX=39947 G... | [more] |
Q6PDG5 | 6.2e-32 | 31.11 | SWI/SNF complex subunit SMARCC2 OS=Mus musculus OX=10090 GN=Smarcc2 PE=1 SV=2 | [more] |