MC03g_new0176 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC03g_new0176
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionDNA-directed RNA polymerase subunit
LocationMC03: 8067873 .. 8090809 (-)
RNA-Seq ExpressionMC03g_new0176
SyntenyMC03g_new0176
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATAGAGCGCAAGTGGAAGGTCTGGTGTTCACTAAAGAACCATACATTGAGGATGTTGGACCTCGTAAAATGTAAGATTAATTGGTTTCTAATCATCATATTAAGATGCTGGTTATTATTAGTCTATAATCTTTTATGTCTATTTTTTTGTTTGGCTCTTTCTGTAGCAAGAGTATGCAGTTTACTACGTTCTCCGGATCTGAAATTAGCAAAATGGCTGAAGTTCAGGTGTACAAAGGCTTATATTATGATACCACTCGGAAACCCATTGAGGGCGGCTTGTTGGATCCTCGAATGGTACAATTGCTTTCTTATATTCTCATGGGTGATTGGAGGTTCATGTTCATTAAGTTTAGTTTTTTTTACCTTTTTTTTTTATTATCAATCATAATAAACAACTACGGACTGAATCTAGCTGCTGTATTTGACTACCACAATTAATGATATACACTAATGCTCTCCCTCAAGATGGAGAAGGAATGGAATTGTTTGGCCCAAGATTGAGAGTTGAGTTGTAGTATTGTATAATGTTGACCTATGTAGAATGAAAGCCAATGTCTATATATTTTAGACGCTCAAGAAAGCTAGATGGGATACAATGTAGAATTAGATGCATAATTTTCATGAATAGCAAGGCTGTAAGTGACAAAATCTAGAAGTTGTTGATCAGTTAACTAACTCATTTGCAACAACTGAAATCTATACTTCCCCAATTTCAATCTTCGTTCGATCGGTTTAGCCTAGTTAAGTTTAAAGCCTCTCAGTGGTTGAGAACTATTATTCTTCTCTTATTTGTTGTAATTGGGGTACTTTTTGTTCCCATTTTTAGTTTTTTCTGTTTATCGTTCACTCTTTCGAGAGTTTGTATCGTTGAACATTTTTGTTCCTTTTTATTATTTCAACAAGAAGTTCGTTTCTTGGTAAAAAAGAAAAATTGAAGTCTATACTAAGCCTTTGGTGAAGATGAGTAGTCATTGGTTGCTTCTTAGCCTTCTGAGAACTGTTGAACTCTGCAAAAAATATCAAAACCACGGGTGATTTGTGAGTGTCTAAATAGAATATGCAACTAGATTTGACCAAATATCTTGAGTCGGACAAAATACATGCTTTTTAAGAGAGAGAAGAGAGCAAACCATTGCTAGGAAACCCAATGTTCTCTCCGTACTCTTCTCCCTCCCCAACCATCAACCAACCAGCCACCCACCCCCACCACCCCACCGCAGCCGGTAACCATGGCCTTCTCTTCCCCTAGCTACCTTATCCCATCCCCTACCCGCCTCTCCCACCCTTCTACCGAACATAATGAACACTTGAAACTACCCTGCCCGTAAAGCTAAGATAGAGTGTAAAACCTTCTCGATCAAGGTTGATGATTTCTTCTGAGGGAGTCGTGCCCGCTTATTCTAGATTGGAAGGGATGTCACCTACTCCCTTTCCCTGACTTGGGCATTCATCTACTGGCTTTCTTCATCTATCAACCTACTGCTAGACTCCTCACTCACACACAAATTCTTCAAAGAACACCGTAGTGATGACTACACCCTTTGGATTGAGAAGTTGAGCAACAAGAAAGGCTATTGGCTTGAGATTAACGAGCTTGACAGTAATGGAGGCAAGAATAGGATTATGCTTCCTTATGGTGTTGGAAAGGATGGATGGAAGGCATTTTACAATCTCCTCCATAACTACCCCTCTGATGACAAACCTAAGGCACACTGCCTTTCTATCGGGGATCATTTAAACCACCCTTCTCTTCAACAGAACCCTATTTCTAATGAGCAGTCGCCGACCATTTACAACCAAACCACAGATTCTCCCTCAAAATCGACCTTGATCCCTAAAATCTTCGAATGGAGTCGTGGATCATTGTTCAATGTCATTCCTTACATGATGATTGGTACTCCATCCTTCAGACCATTCAAGAGTCGCTGAGTGACTTCTGCTCCATCAACCTCATTCACCCTGATATAGCCCTCCTCAGATGTGAAGATGACAAACAAGCCATTGCACTCACCTTCACTGACGATTGGCAAACCTTTGGCCATGCAAACTTCACTTCCTTATATGGAGCACGAAGAATATAAACTCCAATCGTCAAGTTCCATCTTATGGAGTTTGGATCACAATCCGGGATCTTACCCTTCATCTGTGGACCGATACCATCATCCCCTACATCAGTGACAAGTGCAGTGGATTCTTAGAAATATCGAAGGCAGCTTCAAACCTTAGAGATCTTATGGAACTGAGGATTAAGATCAGAAACAACGGCTATGGCTTCATTCCAGCGACTGTAGAGATCCCATCTGCCATTACACATGAAGATGACCTCCTTGTTCACATTGATCCTTTCTTCATTGCCAGTAATTTGATCGGAAATCGTAGACCTGCCAATAAGGCCCCAGGACAACCATACAACAACATTGCCGGAGCTAGCCATCCCCGCTGTCAAACTCCCTCCGGCGCGGTTTGCATCGAGACTGCTGCTATTGGGCTCGAAAACCAAGGTGCAAGGCCATTTGCCTCTGCTGCACGTAATCATTTGACAGGGATGTGGACCCATCACGAAATTCTCTGTCCTTGACAAAAGGAAAAGCACAAGTCAATGACCACCCTGCCTCCTCTGCCCTGCCCCCTACTGTCATGGTACAAAGCCATTAATTATCAATAAAGGTAGGGATCACGTGAGACGTGACTGTACGGGAGAAACACTACCCATCCCCTCTCTTCCAAAAACGGAAACCTCGATTTATCCAAGACAGCCCCACCTTTGATGGCCACGATGAAATGAACTATTTCTCTAGCCCTCTTCCCGAGGATAGTCCAATCCTACCCATCTGCAACAACCCTAATACTCCTATTACCCCCACCATCCTTTTTGGTGATTTGCCAATCCTCGAGGCCACTGACACTCTCTGGTGTAAACCAATTCTTGAAGAAAACTCTTTATTCTGATTCTGCCTTGTACAAAGGAGTGGGTTGGCTATTTATAAGAGACCAACCAAGGACTAAAGCATACTTAACTAAAACGAACTAACTAAAACAGAAAGTACTAAGACTAACTCTTATATAACATAAGGTAACTTAACGGATTACATCAAACTCCCTCTTTGTTGAAGAAAACTCGTTCTCGAGTTTAGTTGGCCAATGTAAATGCGTCTTCAGCTCGATACTTATACAGATCTGCAACATTGAAGACTGGAGTGATCCTTAGATCAGTAGGAAGGTTGATTCGGTATGCATTGTCGCCATACTTTTTAAGTATTTCAAATAGGCCAATCTTTTTGTCACTGAGTTTGCTGTAAGTCCCTGTTGGAAATCTGGATTTCTTTAAATGAACCATGACCAAATCCCCCTCCTCAAATGTCTTCACTCTTCGGTGGGCATCTACTGCTGCTTTGTTGGCCGCTGCAGCCTTGGTAAGAATCTTCGTTACTTCCTTGTGTAGTTCTACTATTCTTTCCGCCATTTCCTCAGCTTCCTGATTCAAGTCAATAGAAGAGGGAAGATGAGCTAGGTCCACAGTAAGCCTAGGTAATTTTGTATAAACAACTTCAAAGGGAGACTTCCCTGTCGAACGGTTGCGCATATGGTTAGATGCAAATTCTGCTTGAGCCATGTAAGGTCCCATTGCTTCGGTTTGCTCCCCCCTAAGCATCGTATTAGATTGCCGAGGGTGCGATTTGTTACCTCCGTCTGCCCATCCGTTTGTGGATGGCTGGTAGTACTAAATTTGAGGCTGGTGTCAAATTTCTTCCATAGGGTTTTCCAAAAACGACTCAGAAATTTGACATCACGGTCGGACACTATTGTTTTTGGTATGCTGTGTAAACGGACAATCTCTCGGAAGAACAGATTAGCCACATATATAGCATCCGAAGTTTTTTTACATGGTAAGAAGGGAGACATTTTACTAAACCGGTCAACCACGACCCAGACAGAGTCATATCCTCGTTGTATTTTTGGTAATCCCAGAACGAAATCCATAGAAAGATCTTCCCAAATAGTTGTAGGGATAGGGAGAGGGCAGTATAATCCCGTATTATGAGATTGACCCTTGGCTTTTTGGCAAATATTGCATCTGTTGACAAAATTATTAACATCTTTTCTCAACTGTGGCAAAAAAAAAAAAAAAACGAGCAGCCACTAGTTCATATGTTTTGTCTCTTGCCTAAATGCCTAGCCAATCCTCCGCTATGTAAATCTTGGATCAATATCTCCCTTAAAGATGTGTGTGGGATACATAAACGGTCCCCTTAAAGAGGTACCCATCAAGGATATGGAAATCATTGTTGTTAACATGCTCATGGCATTGGAGCCAAATAGTATGAAAGTCACTGTCATGTTCATATGTATTTGGTAAATGACCAAAGGCAATTACTCGTCCAGTGAGTAGAGTTAATAGGCTAGCCTTCCTACTAAGAGCATCAATAACTTTGTTAGTTGTTCCAGCTTTATGTTTGATTACAAAATCAAATTTTTGAAGAAAAGCAATCCACCTAGCATGCATTCTATTAATTGTTTTTTGTGTTTGTATGTATTTAAGAGAGAAATGGTCCGTGAATAGTATGAATTCCTTGCCAATTAGATAGCGTTCCCACTGTTTGAGAGTTCTAATTAAAGCAGAAAGTTCTTGTTCGTAAGTACACCACTTTTGTCTAGGCGGGGTAAGCTTTTCACTAAAGTATTCTATTGGGTGGTTCCGTAACTTCCGTCGGGACAACACAGGCCCTATGCCAATTCCTGAAGCATCTACGACAACTTCAAAGGGGTGGGAAAAGTCAGGTAGGGTCAAAATGGGTGTGGAACTAAGCTTTTGTTTCAGAGTTTGGAAGCTAATGTCTTGTTCCTTATCCCAACCGAATTTACCTTTTTTCAGGGCATTTGTTAATGGGGCTGCAATAGAGCTAAAATGTCGGATAAATTTATGATAAAAGGATGCCAAGCCTAGGAAACATTGAATATCCCTAACAGATGTGGGAGTAGGCCAGTTTTGAATAGATTCAACCTTTGAAGGGTCGACCGAGATACCATGTTTTGATATATAAAAGCCTAGGAATGCTATTTCACTAGTGAGGAAGGAGCATTTAGCTAGGTTAATGTATACCTGATTGTCTTGAAGAGCTTGAAATAAGGTTTGTAAGTGTGTTAAGTGATCATCAACTGTTTTACTATAGACAAGTATGTCATTGAAATAAACTACGATAAAGTTATTAAGAAAAGGACGTAAAACCTGGTTCATGAGGCGCATGAAAGTGCTCGGTGTGTTAGATTACCCAAAAGGCATCACGAGCCATTCAAATAGCCCTTAATTTGTTTTGAAGGCTGTTTTCCACTCGTCGCCAGGTCTGATCCGTATTTGGTGATAGCCACTCTTTAGGTCTACCTTTGAAAACAGCTGTGCTCCACTAAGTTGATCCAAAAGGCCAGACTGTCGAGATAGGAAATCTATATTTGATTGTGATCTTGTTTATAGCTCGGCTATCGACACACATCTGCCACGAACCATCTTTTTTAGGTGCCAATAACACAGGCATTGGACAGGGACTAAGGCTCGGTCGTATGTGTCCCTTGAGAAGAAGGTCATTGACTTGGTCTTGTAGAATCTGATACTCAGTGGGACTCGTACGATAGTGTGGTAAATGGGGTAAGGAGGCACCAGGGATAAGGTCGATGCAGTGCTGTATATCACGCAAAGGAGGTAAGCTCGAGGGGTCTTGTGTCAAGTTAGCAAAATTATCCAATAGGTGCTGAATATCCGGGTGATGGTGGGCATGAGGCGTAGGATTTGAGTCACCTTTGACCACCAACACCCATATCAAAGGGTCGCCCTTTTGAATAAATGACTTACCATTAAAAATGGTAAATAGTTGCTTAGAAGGAGAAGGTGATTCTTTCCGGGTCTTGTGATCCAACGTCGAAGGCAATAGAAAAATTTTCTTGCCCATCCAGTGAAATTCATAAGTGTTTTCCCTGCCCTTATGTATAGCTTGGAGATCATATTGCCAAGGACGGCCCAAGAGAATGTGACAAGCATCCATATCGATGACATCACACACAATCTGATCACGATATTGGTTACCAATGCTTAAGGGTAAGAAGAAATGGATGTCTCACCCCCTTTCCGAATCCACGAAACTTTATATGGATGGGTGTGTGCATCAACTTTGAGGTTAAGGGCAAAGACTAACTTATGAGAAACCATGTTCTCACTGCTGCCACAATCGATGATGATGTTGCACACTTTTCCGTTCACAGTACAACGGGTACGGAAGAGGGCATGTCACTGGTTGAGCTGATCGACCTTTGGTGCCAAGAGAATTTTTTGGATAAAACAATTAATGAAATCACCATCATCAGGCTCGACATAGGTGAGTTCTTCCTCCCCAAGGGACTCAATCTCAGCATCACCGGTATCAATGTCATCGACCAATGCCAGGGTTCGTCGTTGCGGACACTCGTCGGACAAGTGACCCACTTGTCTGCAGCGATAACATTTTCCCATTGTCGGACGGACATACTGATTCATCCCTTTCTTGGATACCAAATCAGTGTTCTTGACTGGAGGAAAAGTTTTAGGTTGTTCCTCTGTTACTTTGGAGCTGGAGGCAGAGGAAGAGCTGCCGACAGTGATTGGTTTGCCAGTGTTCGAGGGAGAAAATTTAGAACTAACCGTCTTATCCCAAGCTGTTCGACGAACCGGTTGCCTGCCCCATCTTCGTTCTATCTTTTCTTCAATCTTTGTAGCCATAGTGATAGCATCTGTCAAGAGACCAATGGGTTGGATGTCCAACTGATCTTGAATGTCTTCTCTGAGGCCATCAACAAAGCAGGCAATCTTGTAATCTTCACTTTCGGTTAAGTTTGTTTGGGTGCCTAGACGATGGAATTCTTCGATGTAATCGGCAATGGTCTTAGAACCTTGGTGACATTTCTGATATTGCTGGTACAATAATTGTTCAAAGTTGGACAGTAAAAATCGTTGACGCATGAGGCGCAACATCCGTGGCCAACTACGAATGGGACGCTTACCTAGGCGCCGTAGATTTATTTCCAGCTGATCCCACCAGGCTGAGGCACCAGACTGAAGTTTAAATGCCACAAGCCGTACCTTCTTATCATCCGGTGTATTGGTATAGTCGAAGAAGTTCTCAACGTTCTTTACCCAGTCCAAGAAAGCTTCAACGTCCATCTTACCGTTGAAAGGAGGTAAGTCAATCTTCATTCGAAAGTCTGTACGCATGCCTCCTCTGTTTTGATGTTCGTCCCCCCTGAAACGAGGATTGTGTCGAAAAAAATCCTCCTCCTTGCCAGAAGAGTCTGATTCTTGAAGTCTCAGCTGCCAATCTTGATGTCGTCTTCCACGGCCTTCCATGTTTCTTGGTGGAACCAGAGTTTGGGATCGGTTTCTTGGACGTTCTTGAAGTCCAAATCTTTGTTGGCCCTCACGATTACCCAAATCCTCACGTAACCGTTCTTGTTCCAAAGCAATGTTTTCCAGGTGCTTTGACAAATTTTCCATCAACCAGTGGATAGTCTGCACACTGGAACGAACCTCCCCCAAGGAATCTTCTACCAACAATAAACGATCAGTAGCTGTTCTTGGAGAGAGCGGTTGAGTTGTTGAGAGATCTGCCGCAGGTCCCTTGCCGGAAGGTGTTGTGGACTTTCCTGCCATCAGATCCCAAGGAGTTTTTCGGGGCTCTCATACCAATTTGGTGTAAACCAATTCTCGAAGAAAACTCTATTCTGATTCTGCCTTGTACAAAGGAGTGGGTTGGCTATTTATAAGAGACCAACCAAGGACTAAAGCAGACTTGACTAAAACTAACTAAAACAGAAAGTACTAAGACTAACTCTTATATAACATAAGGTAACTTAACGGATTACATCACTCTCCTGGCCACAAAACCTGCATTGTCACCAAACATAGCCACCTCTAATATATCTACCCTACAACCATTGACTGCAGAACCTATCAGCATCGAATACGAGGACAAGGCTTTGGAAAAAGGCTTGTTTTCACCTAATGGTTTTCGGTAGTATTTAGAAAATTATCTAAAGGAGCCAGGCTTCTGCATTATGGCGGTTCCACAAACAAAACCAAAGACAAAAAAGCCTAATAAGATGGGTAAAACAAAGGTGGGAACCCGTGAGCTCAAAAGCCTCACTTCATCTATCAATTATGAGAAACAGAAGCTTCTTCAGGGGGGCCTTCCATTGCTAAATGAAGCTCCCGTCATGGAATGTTAGAAGGTTGGGCTCGTGGAAGAAGAGAACCCTAATAAAGCAATCCATCTCCCGCCTTAATCCAAACGTGGTGATTCTTCAAGAAACAAAGCTCTCCACTATAGACACCAAAGTATTTCCTCTTTTCTTTTCACTCTTTCGAGAGTGTGTTTTTTTGAACATTTTCATTCCTTTTCATTAATTCAATGAGAAACTTGTTTCTTGTTCAAAAAAAAAAAAGAGACTATAGACACCATCGTTGTAAAGTCCCTGTGGAACGCTCATGGATTTAATTGGACTGCCTTAGATGCAAATGGAACAGCTGGTGGAATTCTCATCCTCTGGAATGATTCAGACTTCACTGCCAATGAGATTATAGAAGATCCCACTCCTCCATCTTCACGTGAAATCCATCTAACTTGCACCATTTACCATTGAGCCAATCAAAGTTCAAGGGTCTTCCACTTTCGAATCTAAATAAAGCCTTATCTGCTAAGAAAGGGTTAACTTAGCAATTCTTCGAGAAGAAACTTTTGAGGGAGGCATCGATTGCCTTTTAGCTATCCTGAGTGCAAAATCTTGAGATAACCATGACATTATCCAAATATGTTTCCACAACTTCATGTTCCTTTTGAACACAATAATAGTTGCTCCCTTTTCTTCGAAAACCTCATCCTCTACGGTAGGGGATGCCATCATACTATCTCTGATCTTAGCCCACTCTGCAAAAGAAAAGTTAGAGGCACCCTACAATACCTTCCCTTTGAAATTCACCAATGATTCTTTTATCATGTCCCAATAGGTTATCCAATCGATCTTTCTAATCCCTGCAGGTACAATTATGAATTTTCTTCCACCCGAAGGGGGCCAAACAACCCATTTCAAAAACCAACCTTTCAAGAGTCTTTTTTCGGAGTCGACAAGTTCTGTTCCCATCCCTTACTTGCATGGAAAATCTTGAATACAAAGGACAATGCAGAAGTTCCACCAAAACATTTTCTAACCACTGAAGTTGGGAAAGTTGAAGAGAAATATGAATATTCTTCCTAGAGTCTTGAGAAAACGGTCACTATCGCACCCAACCCAATAGTAATTGTATTGAATGCTGGAATTTCTCCAATCCATTTGAAAGCAGCCAAAAAGGACGGTGAGCTGGTTAGGAAGACATTGGCAGCGGCTTAGGATAGGAGGACGTCGGCAGCGGAGTAAGGTTAGGAGGGAGTTGGCAGCGGAGGAGGACGTCGGCGACAGATTAGGGTTAGGGGAACGACGGTGTTGGGTTGGCCGACAACACCGAAGGGTAGAGCGGCGGCTAGGGTTAGGGATGGCGAGCAGCGGCGATGGCGAAAAGAGGACCGAACGAGGAACCCTAGCGGCACAGATGAGAGCAGAGGTGAGGGTGGGTAGTGGCGATGGCGGATGGAGGAGAGGCCGGTGGCTAAGATTAGGGTTAGGGTTAGAGCGGCGGCTAGGGTTAGGGTTAGGATTAGAGAGTCGACGACGAATGGAAGAAGACGATAAGGGCTAGGGTTAGGGATAGATGGCAGCTAGGGTTAGGGTTTGCAAGATGCGGTGACGGTGAACGGCGCGGCGGCTAGGGTTAGGGTTGGGAGGGAGAGAGTCGGGGTGAAGCTTCAAAATATCCACTGCTGATTTTTTTAAAAAGTCAGGGAACATTGACAAGGCTGATTCATTGACTATGTTCCAAGATTTTTTCAAAAAATGGAGTTATCAATGCTGTCCAAAATGAGACTTACATTTGTCTCATACCCAAAAAGAAAGAGGCTAAAAAGAAAGAGGCTAAAACAGTGTACGACTATCGACCGATCAGCCTCATCCCATGTGCCTACAAAATACTTGAAAGGGTTCTCCCCGAACGCCTCAAAAAGGTATTGCCTCTCACTATCACTAACCATCAATCTAGAAGGAAGATAGATATTGGATTCCGCCCTTATCGCAAATGAAATCATTGATGAATGGAAAACTTGAAAGTTAGAAGGGGTGGTGATCCAACTGGATCTTGAAAAGGCTTTCGACATGGTAGATTGGGACTTTCTTGATATAATCCTATTTCACAAAGGATTTGGGGAGAAATGGAGGAAATGGATTACGGCCTGTATCTCTACTACAAGCTTCTCCATCATCATAAACGGGAAACCGAGGAGCAAATTCACTGCTACAAGAGGCTTGAGCAAGGGGACCGCCTTTCCCCTTTTCTCTTCATTCTCATCACTGATGTCTTCAGTCGGCTGCTTCATCATGGAGAAGAACAAGGCCATATCTCAGGTTATGTCTCAAATTGTGGCTTGATTAGAATTGATCATCACCTTTTTGCAGATGACACCCTCCTCTTCTCCAAAAAGGATCCCTTCTACATCAAAAACCTACACAAGATTGTTAAGTTTTTTGAGATTGCTTCGGGTCTCAACATCAACTACAACAAATCCAGTGTATTTGGCATCTCTATCTGCACACGGACCATTAACAAAATCACCAAAGCATGGGGCTGTTCATCGGGAAGTTGGCCATGCACTTATTTGGGATTACCACTCAACGGTAACCCAAAAAAGGAAGCTTTCTGGTTGCCAATCATTGAGAAGCTTCATGTTCGGCTTAGTAGATGGAACAACTACTTCATTTCTAATTCATTTCTAAGGGGGGAGGCATACCATTCTTCAATCCATCCTCACGAACCTTCCCACATACTTCTTGTCTCTTTTCAAAATATCTGGCAAAGCGGTAGAGAAGATGGATAAGATTTTTAGAGACTTCTTATGGGAAGGACCAAGAAACAACGGCAAGCCCCATCTCATCAATTGGAATCGCACACAGCTCCCTATCAATGAAGGAGGTCTCGACATTGGGAACATTAAACAGAGGAACCATGCTCTCCTCTCAAAATGGGTGTGGCGCTACATAACCGAACCTAAACAACTATGGAGAAGACTTATCTCAGCAAAGTACTATGGAACCCTCTCAACTCTTTTGCCTATAGCTGCCATCCAATTCTAAAAGGACCTTGGCCTGCGATCACTCTACAAACCAACCTCGTAAAGGCAATAGCCATTTGTCTGGCGGGAGAGAGTAGCCAAATCAGCTTTTGGAATGATAGATGGTTTGCATAGGGCACCATTGCCTCTCTCTTTCCAAATATATACACTCTAGCAATGAATGTGGGAGCTACAATTGGAGACCAATGGTCTAGAACTACTTCCTCATGGAACTATAATTTCTGCAGACCCCTTAAAGAGGAGGAGATTGAAGAACGGATGCAATTATCTGTCTTGCTCAGCCCGATCTGCCCCCCCTTCTTGAGGATCATTTGGAGTTGGCCCCTAGAGCCATCGGGAGAGTTCACCACTAAATATGCCTTCAATCTTCTCAACACTCACCCTGCCCAATCAGCTCCCACCCTTTACCCTTGTATATGCCAAAGAAAATGTCCCAAGAAAATCAAATTCTTTTGTTGGGAACCAGCCATGGGGCTTTAAATACGCATGACAGAATACAGAGGAAGCACCCTTACATATCACTCTCCCCCACTGCTATCACCTCTGTTTTGGAAATTAAGAGACACAAAGTCACCTCTTTATTCTCTGTCCTTTTGCCACACGTTTTTGGAGCAGTGTTCTCTCGGCTTTTGGATGGTCTACAGTCCGTCACTCGCATCCAACCATTATTCTTGATTGTTTTTTGGTGGACCATCCTTTCAAACATGGCAAAGCTATTCTATGAGTTCAAATTGTTCGGGCTTTCTTATGGTCTATATGGTTAGAAAGGAATCGTCGGGTTTTCAATGATAAAACCCTCTCTTTTGGAAACTTTTTTGATAGAGTTCTTGTTTTAGCCTTGGGTTGGTGTGTAATTGTTTCTACCCCTTTCAACAATTACTCTCTTCATGATCTTATTACAAGTTGGAGAGCTTTCTTGTAATTGCCTTTGAGGCCTTTTGTATATTTCATACCATCAATGAAATATTTGTTTCATATAAAAAAAAAGGAAGAGAGAAAACCATCTTGGGTAGTCATTGGTTCAAAGGGGCTAGTGTAAAGTGAGAGGGAATGAGTTCAAACCCCATAATGGCCACCTATCTTAGAAATTAATGTGTTTCCCAATACCAAATGTTGTATGGTCAAGCGGTTGCCTTGTGAGATTAGTCGAGGTGTGTGTAAGCTGGTTTGGACACTCACGAATATCAAAAATAGAGAGAGAGAGGAGAGCATCTAGATGGCTCTTACAAGAAAACCACATTTTTCTCTATCTTCTTTAGTTGCATCGTAGGAACCTGCAGAAGTCAAACTTATGTTGCATTGGCAACAACCTTTGCTTTGCCCCCTCTTTTGTGAGGATTCTAAAGGAGTCAAATTTAACCTGCAGAAAGGAACTCTTGTGTCTAAGTTATGCTGCAGTTTTGTTATTCGTGTAGAAAGGATGTGGAAACCAACCTCTCTTTGTTCATTGTGATTTTGGGGTCAAAGGTTGGCTCTCTATCTTTGGTTTATTTGACCTCAACTTTGCCCCGAAAAAACAGAAAAGATTGACTTCTGGAAGTGCTTTCAGGATGGTGGCTAAAAAACAACTAGAATATTATGGAGAAGTGATATTAGGGATTTTATTTGGTCTTTTTGGTTAGAGAAAGACAATAGGATCTTCAACGATGGTTTAAGTCCTCAAATGTTATTTGGCAAAATGTATCAGCTTATGGCCTCAAATTGCCGTGCCATTCATATAGATTTTTGTAATTATAGCAGCTTTTTAATCAATTTATACCTCCTTGGCTGAGGGATCTCTCTTCCCTCTACCTTTTGGCTGTATTTCCGTTTTTCATTAATGGAAGTCTGTCTCTTTTGAAAAAAAACTTGAAATTACTGTCATCGGTTTTTTTAATTTCATGGTTGATGCTGGGGCCGTGATTGCAAATTTTACGACTTTATTTATTACATATTTTCATTCTCTCTCTCTATCTATTATTTGTTCAGAAAAATATTGTTCTAAATGATTGCTTTTCCTCTTGGGATTGTGTGTCAGGGTCCTGCAAATAAGGGATGTAAATGTTCTACATGCCATGCCAATTTTGGTGACTGCCCAGGACATTATGGATACTTGAGTCTTGCTCTCCCTGTTTTTAATGTTGGTTATTTTACTACGATATTGGAAATTCTAAAGTGCATTTGCAAGGTATTGAACAGATAAGTGAAATGCTTTCCCTGGATTAGTGCAAACTTTCTTGGAATATTATTTCTCATGTCTAAAGTTTCTCTACAGTCATGTTCTCGTATACTTTTGGAAGAGAAGCTTTACAAAGACTTCTTGAGGAAGATGAGAAATCCTAAATTGGAAGCCTTGAGGAAGTGTGAGCTTGCGAAAAAAATCGTAAAGAAGTGTACTACCTTGACAAGTAGCAATAAAAGCATGAAATGCTCCAGATGTGGATATTTAAATGGTTTGGATTTCACCTTTTATTTTGATGTAGACTCGAGCTAAGCCCCATTTATATGTCAATTATAATTTATTCACCTCTGTTTATAAGGCTTCTCTAATTTTTGGATGCTTGTTTTTATTTGTTACCTAAGCTATGTTTGGGACTAGTGAATTCATACTCAACTAAAATCGATTTGCAAATGCACATGCATGATTTACCTGATTATAAAATAAAAAGGCAATCCACTTTGTTTTCATGTGTTCTTGCATGATGTGGTGATCTTTCATTTTCTTATTTGAATCTCAACCATGTGATTGGTTTGCACCTGCATAAAAGTGGGCTTATAATTGATTGAACTCCATTTTTTTAATGGGAAATAGTTTCATTGATAGTATGAAATTACGAAAGAGGGGAAAGATCCCAATCCATGGGAGTTAAAAAAACCTCTCCAACAAGATGAAATAGAAGCTATAGTGATGGAAAAGAAGAGAATATTTACACCAAGTAATAGTTAAAGAAACCATAGCATCAAAAAACCTAACAAAATCTCCTTTTCAATAAAGATTCTCTTATTCCTTTCAATACAAGTACTCCAAGAAAAAGACCGCGACAAAGTTCATCCAAAGAAGCTTCTTTTCCTTCTTCAACGAGTGGCCATGGGAGAAGAGTAGGGTCATGGAAGAGCTGTAGACCACCCAAAAGCTTACAAGGATGCCTCCCAAAAAACTGCTGCAAAGTCACAATATAGGAAGATGTGGCATTTTACAGAGCTTGTAAATATATAGCAGAGTTTGTAATTATATCTCCCTTTTGGTTTCATCTTTCTTTTTATTTAGTTTTTGTTGTTCTTTCATTCAACTCAATGAAAGTCTGGTTTCTCATAAACTGAAAAAGGGCAAGTAATTATATCTCTCTTCCTTTGCGAATTGGAGCAGCTGGAACTTCTGCCATCATTCATGATGGTTAACTCCAAAATAGCTTTCAATCTTCATGGTATAAATAACAATGAAGGGCCCATGACATGATTGTTATCATATTTTTTAACCCTTTACTTCAGAATTCTATTCTTCCAACTTAATTGAAACCATACTTATCCAAAAAAGAAAAAAAAAATTGAAACCATACTTGAACATGCTCGTTAATCTTGTTGGAAATTTGACCTTCTGAACTAGTTGTGTGATCATTTTATTGTTCTGTTAATCAAAATTTGTAGGGGATCTGGGAAATGTTTTGGGTGGTAGGATATAAGTTGTGTGATGTGATATTAGAAATGGATATGAGAAGCCTTGGGAAGTGGGGAGGTTACATTTCTCTTGGAACTCTGCTGTATTTTGTTCTTTCAATTATGACATTACAGATTTTGCCATTGATAGTACACCTAGTCCGAAAGAGAGCATGTCGTTGATAAGAAATATCGGTCTTAGGAGTTAACAGCACTCGTTGAAGAACACATGAAACGATATCACCTTCGTCTGGTGCCGTATAATCAATATCATCAACAATTTCCTCACTGCTGTCCTCTTGTAGGACATCCTCTTCGACCATATTGATTATCTTGCGGTGCGGACATTCATTGGATAAATGACCGGTTTGTCCACACCTAAAACATTTCCCTAATGTAGGGCGATGGTATATATTGCCCTGTTTCTTTCCTGTCACCAAATCAGTGTGGTTAGACTTGACTAAATCAGTGTCCAGAGTTTTACCTTTTGTTGTAGTAACGGCATGGTTGGGAGAGGAAATGTTGTCACCCGTCGAAGTAGGCTTCTTGGAAGTTCCACCTTGTTCCCAATTGGTCCTTCGGGTGTACTGTTTTTTAGATCTATTTCCCACTTGTTCTTCAATTGTTGTGGCCGTAGAGATGGCCTCATTTAAATAACCTATTGGTTGGAGGGCGATCTATTCCTTAATATCCGAACGTAGTCCCCCAATGAATCTTGCCACAAGATGTTGTTGTCCTTCCAACAGATTTGTTCGAGCTCCCAAGCGGTGAAATTCTTCAGAATAGTCAGCAATCGATCTGGATCCCTACCTACAATTTTGATACTGATTATATAAAATTTGTTCAAAATTAACAGGAAGGAAGCGTTCCCGCATGAGTTTCTTCATGCGTTCCCAACTTCGAATGGGTTTTTTACCATAACGATGTCTATTAATTTCCAGCTGATCCCACCACGCCGAAGCTCCACCTTTGAGTTTTAATGCTACAAGTCTTACTTTCTTATGTTCAAGTGTTCCCATGTAACCAAAAAAATTTTCGACATTCTTTACCCAATCAAGAAACCCTTCAATATCCATTTTACCATTAAGCATAGGTAGATCAATTTTCATCTTGTATTCATTACTATCTTGGTTTCTATGTGGGTTAGGATATCTACCAAACCAGTCACCTTCAAAGGCAAGTAGTTCTTCCTCCTCCTCGCTGGAAGAATCCGCATTGAACTCAGTCCGCCGCTGAAACAAAGGATTCTGATTCAGTTCTTGATTGAAATGTCGTGGTTCTTGGAGGTTTCTTGGACCCAAACGGATTGTTTGTGGTCTTCTTGGTGGATCGGCTTGCTGTTCTTGCTGAAATAAATGTCTTCGATCCTCCCTCTGTACTTCTTGATCCAATCTTTCTTGTTGTCCCGTGGTGCGTCGGACCGGGGATCGTTCCACACTTAAAGCATCAAGTTTCTCATGTATAATCTCCAAGATGTGTTTAATCTCGCCAACATCCTATTGGATGGCTTTCACCTCCCCTTCAACCGACAGCAAGCGTTCGGCTGACGATCGTGGCGAGAGAATGGAGGGCTGCTCCATTTCTTTGTTGATGAGGGAAGAGTCATCGGTGATCGACTTTTTTCCCGCCATTTGGTGCCCAGAAATTTGGGTGCTCTGATACCAACTTGATGTACAAACACAATGGTTTCTTGAAGAGAATGAAAGATATCTTTTATTAAGAGATCAGCCTCTATTTATAGAGTTTGGCTAACAACTCACAAAACAGTAAAAACTAGTTACTGTGTAGTAACTAATAACTAAATAAAGAACAAAGCTTGAAAGCTACATCTTAAAGGAAAACCTTAAAATAACAACTAAAACACTACATCACTACCAATGGTGGGTTGAGGAGCTTTCAAGGGTTTTGAAAATGGAGAACATTGCTACATTGGTGGGAGGGATGGGCTAATTGCTATGTAGAAAGGAGCCGAAGCTATCTCGCTTATTAATTAGAGGTAATAAGGGTGCATGAGTTTTTAATGGGGACAACTAACCCTGTGTTCATGAGTGCGTTCCAGTTGGAGCTTTTGAAGTTACAAGGGACTAGTCTCAAGTGAAGCACCATGTTTGCAACTTGGATGGACAATATGAGGTAGGTAATAGATGTTTGTAGACATACTTGAGATGCTTTGCTTCTAAGAAACCAAAGACATTAAATGAGTGACTCAGGTAGAATATTGATAAAACACCTTATACCATGTCATCACTAAAATCACTCCCTCGAAATTGATATATGGCAAGCCACCACCACTACCTTTGATCTGATATGAGTAGTCCTCTATAGATCAGTTGTTGCAAGAACAAGGCACAGTTGTCAGTGACCTTAAACTTAAGCTGCCAACAAGGCCCAAGAGATTCTGACAAGCATTGAAGGGGTGCAACCTATCAAGTTGAGGACATGGTTTGCCTCAAGCTACACTCGTACTACAAGTCTTCGTTGGTAGCCCAATGGAACAAGAAGTTGGCCCCTCAGTCAGATGGAGCTCCTTTTAGGTGTTAGAACGAGTGGTGAGATTGCTGACAAGCTACAATTGCTGTCCTTGATGGAATATATCCTGTTTTTCATATTTTTTAACTTCTTACATATAGGACTGGTTTCCGAGGTCAGAAAACTGACTGGTTTTTTTTTGGTTTTTTGTTTTTTCGCTTAAAACTCCTCATCTGCCTCATCTCTTAAAAAAATTAGCATCTCTAAAACTCCTCATCTGCCTTCTATTTCGTGCCCTTATTTTTTCTTCCTTTTTTTGGATAAGAAACGGATGTATAACGATTTGATTATATAAGATAAGAAACGAATATATTAACACAATATAAAAGAAACAACCTAAAGGTGGGGATGGAAAGACCCACTCCCAAATCTCTAACATTAAGGCCTTCCAATCTTGAAGGATAAAAGATAAGTTGTAATTACAGAAAAAATTTCTTCTGTTTAGGGCACCATAAAGATGTAGTAAGCTGTGCCTTTTTCCCCTCCTTAAAAGCACTTACTGCATTTGACCAAAAAACACTGTAATAGAATGTGTAGGGATGTGCAAATGTTTTTGGACAGAAAAGGGTATGATTGAAGATTGAAGTCGTAGGAAGTGGGGAGGGTGTAAACCTCTTGAAAGTCTGCTGTATTTTGTACTTCCTTTCTTGTGGGTAATATGGGATCCTATATTTCTTCTGTGAGCTCTGTTACAAACACAATTTTATACTGGATTAGAATTGGGTTAAAAATGTTTTAAGTGCTCTTTATCTTAGATCTCTGTGTATACGTGGATGTTATTTGGCATTGTCAGCATGCATTGCATTTTAAGGTGTCCAAGAAATTTGCATATTAGTGCAAGTCTTTGTATATATTTTGATGAAAGAAAGGTTTCATTAATATGTGTTTCCATCATTATGAGAACACGTTTTAGAGTAGAGCATGTATCATTGAGGTGAATGTGGCTGATTTTAATGTCTTTATGACAGCTGTGTACATTCTGGTTTTGAGATCAATAATTATTAGCCATATATTTTATGATTCTATTGTAGGTTCGGTGAAGAAGGCTGTATCTATGTTGGGAATACTACATTATCGTGTGAAATCCAAGGATGCGGGGATGGTATCAGAGGATCTTAGAGCACCCTATAATGTGTCGAATGATATTTTGAACCCTTTTAGAGTTCTTTCTCTTTTTAAAAGGATGACAGATGAGGTATACATTGATAGTATTTAGGATCTTCTTTATATTAACGCACTCATCAATGACACTTATTTCTCTTCATATATATCAATGAAAAGTTGTTTCCTGTTAAAAAAAACTCACTAATGTCACTTGCTTCGATCGTTGTGGCCATTGAAATGAGTTGTGTATTGCATAACTTCTCTAGGTGTGACGCATATTATTTTCCTTTTACTTTCATTGCATTCCATTTTTATTTTCATGACATCCTCCTTTGTGCAGGACTGTGAGTTACTCTTTCTATCAGATAGGCCTGATAATCTCATCATTACTAACGTTGCAGTGCCTCCTATAGCTATCCGTCCTTCTGTAATTATGGATGGTTCACAAAGGTTCTTTTTCTTTCCTTCCCTTCCTTCACATATATTGAACCACAATGAAATTCAAGTTTTTGATTGTATATTTTTTTAAGACATCATATTGTCATTTGTAAACCATCTCAAGGTGGCATTTATAAATGGTTTTTAAAAATTTTCTTTTTCCTGTTGCAGCAATGAAAATGATATAACTGAGAGGTTAAAACGAATCATCCAACAGAATGCAAGTGTTAGCCAAGAGTTATCAACATCAAACTCACTACCTAAATGCCTGGTACATTGTGGACTTTGGTGTTTTTATTTTAAATGATTATATATGTGTGTGTGTGTGTGTGTGTGTATAATTTTTTCATTATTTTGCATTTCTTTTGGATAATAAAGATTGCTTTGGTTAGAATAGTCTTTTGTGGAGCTTCTTCAATATAATAAAGCGGAAGGTTGGATAAGACAAGACTTACAAAGTGTCAAGCGACCTCCTCTCGAGAGTTGAAACCGTTTCCACCTATCCAACCTTTTTTCAATTTTCTCCACAATAGTGCTCCAAAAAGCAACTGATCTCGGATTTCCTCCCAAGGGAAGCCCTAAATACGAGAATGGAAGGGAAGCAATCTGACAATTCAAGGAATCTGCAATCTGACAAACTTTTCCAGATTGTATATTAATGCCACATATTGCTGATTTCTCCCAATTTAACAGAAAAACCAAACATATTTTCATAAACTTTGATTAGCCTGAATGAGATTATCCAACATACCATCATCATCTTTACAGAAGAGGATATCATCTGCAAATTGCACGTGGGGAATGTGACTCTTATCCTTTCCAACCAAAAAGCCTTCAAAGAATTATCCCATGTGAATACTGTCCAAGATGCTATGAAAACATCCCCAACTAGGAGAAAGAGGATCACCTTGCCTCAAACCCCTAGAGGCCAAAATACGACCCTTAGGTCTACCATTAATAAATATGGAGAACTTAGTTCCTCTTACAGAGCCACTGATCCATCTACGCCATTTCTTCCCAAACTGTTTTTGTGTAAGCACATCTTCTAAAAAGGCCCAATCAACCTTATCAAATGCCTTTACAAGGTCTAGCTTTAACAACCAACCACTACTCTTCCGAGCCTTATAATCCTCCACTGCTTCATTGGCTAACAAGATCAGGTCAAGAATATTCCTACCTTCAATGAAGGCGCTTTGATTAATTGAGATTGTAGAAGGCATAACCTTCTTAAGACGCTCTACTAGAACTTTGGCCAGCCCCAGTAGCTTATAAATAGAAGAACAAAGGCTTATTGGCCTAAAGTCCCGAATCTTTGATACCTTTGCCTTTTTTGGAACAAGGCAAATGAAGTTCTCCTTAAGGCAAGCATTAATGTGGCCGTTTATATGAAATTGCTTGAATAACCGGACAAAAATCAGCTTTCTGTGACTGCCAAAACCAAATATAAAATTCTGAGGTAAATCCATCAGGACCCGGTGCTTTGTTTTTGCCAAGATCATCTAATGCCAATTGCCAACCTTATTTCTGATTCTTCAAAGCAACGTTCCAGCCATACCCGATCAAATGAGGAAATGGGTCGCCAATTAATCCTTCTTGGAGAAAATCTGGGACCACTGTTTGCAGAATATAACCTTTCGAAAAAGGTAAAAATTTCTTCTTCTATATCTTTCTGATTGACTAACGTTTGACCGTACTCATTAATCAACTCGAAAATAACCGATTTTCGCTTGCGAGCTGCTAAATATTTGTGGAAGAAGCTAGAATTTTCATCTCTCTCCTTCAGCCATTGAATCTTGTTTTTTTGGAACAAATTCCTTTCGGAATCCACATATAGCATCAGCAGATCCACTTGGAGATCACTCCTCTTTTTAGCTTCCTCATTAGATAATCCAGTCTTCCTCTTTGGAATCCAAAGAAGAAATTTGCTGGAGAATTCTTGTTTATGACTCCTTTTTACTTTTTTCATTCTGCAAGTACCAACTTCGAAGCAAAGACTTTAGATTTCTCAGCTTAGAATTCAGAGAATAACCTGCCCAACCATAGGATTGATCATTAGCTAAGTTAGATTCAATCATACTGGGCATTGAAAGATGAGGAGATTTTGGAGCTTTCGGAGCTTCTTTCTTGCATTTCTTCAGTTACTATTTCAGAAAATGATGATGTTCGTTGTTGGAGTTTGGAAAAGTCGGGCCTCTTCTCCGTTTCTTCTCTGTGCAAGTCTGAAAGAGCTGATTTGCGGGTTTCAAAAGCTCTTCTACTCTCTCTTTGGAGTTCGGGCAGCCCAAAACAAGTTAATATGTTAATTTGGATTCTATTATATGGCAAAGTCAATACAGCAGATATTTTGCAGAGAAAGTCTCCAAAGCTGGCTTTACAGCCATCCATTTGCTTTTTATGTGCAGAAAATGAAGAAAGCTTAAATCATGTCTTCTTTTTCTGCCCATATGCTGCTGTTTGTTGGAATCTTTTTCTCCAGATGTTCCAATTAGTATGGGTGTGGGATCGTGAGGCTGCTAATAATATTTTTCAGATTCTACATGGCGTTGGGTTGGCACCTAAAGCTAATCTACTTTGGATTAATGGATTTAAGGCTATCATCTCAGAATTATGGTTTGGGAGAAATCAAAGGATTTTTGAAGATTCTTGTCGCCCGCCATTGGAATGCTTCAACATTGCCAAATTCAAAGCTTCTCAATGGTGTGCTCTTTCTGATCTGTTCTCTTCGTATTCCCCTAGTATGATTTGTATGAATTGGGAGGCCTTTATTTCTCCTTTGTAGTCTTGTCTTATGTTTTATTTTCTATTTCTTGCTTTCTCTATCACTCCTTCGGGAGTTTGTATCTTTGAACAATTTTGTTCCTTTTCATATATCAATGAAAAGTTCGTATCTTGTTAAAAAAAAAATACAATCTTTGTTAGACAGCCACAAATTAAAGAACTGAAAGGGATTGGACCCCATTGGAAATCACCAGCCTCCAAAATAATAGGAAAATGATCTGAAGTGGTTCTTTCAGCACGAGACGCCTTGGAGTTGGAAAAAACGCTCTCTCAACCATTGGAGAACATGAATCTGTCCAATAAGGAATGATTGGACTCGAGACCAGGCTTAGACCAAGTGTACTTGCCCTTGGCTAAAGGGATTAACACCAAATCTAGCTTGTGAATCAAATTATTAAACCTCTTCATTGCTTTAGATGATCTAAATTATTAAACCTCTTCATTGCTTTAGATGATCTGGTACAGGATGACCTTTCAGAAGGCCACCAGATTGTGTTGAAATCGCCACCAATACACCAACATTTTTCAGATAGCGTCGAAATATCATGGAGTTATTGAAAGAAAAACTTCCTTTCCTTGTAATCTGTCGGACCATAAACGAACCTGAGCCAAATATTGTCAATACTAGAAAAGGAAATAAGAACTGATAGAGAATACCCCCTTGATGACTTCTTTTACACTAATCCGGCCTTCATTCCACATTATTAGAATTCCTCCAGACTTTCCAAGTGCATCAATGGTTGACCAACCTATATCTTTAGAACTCCATAGAGATTTAATCAAAGATGGATCAAACGGTTGAATTTTTGTTTCTTGTAGGCATACCACATCCAGGCTGTGGTTCTGGACAAATTTCTTAATTTTCAACCTTTTTTCATCCTTGCACAGCCCCCTGATATTCCATGATACAATCTTCATTGTGAGTGAGAGATAGGAATTAAATTGATACCCACCTCTTTAAAGAAAGGAGGAATTAAGACCTACTTTTCGTTTCCATTGTCTTCTACAAACAATTTTTGCAAGTCTACCTCAAAATACTGCTCGCATAAAACTTCATCTGGAGTGACGATGGAGTTAGCAACTGGTGATGAATCCGGATTACTAATGCTGAGATCTGATTCATCGTCGCCAAGAGATTCATCAGAAGAGGCAGTCAGGGAAGTTATTATTTTGGTAGAGTTGATCAGTCCATGTGTTAAGGAGGAAGGCAGATGA

mRNA sequence

ATGAATAGAGCGCAAGTGGAAGGTCTGGTGTTCACTAAAGAACCATACATTGAGGATGTTGGACCTCGTAAAATCAAGAGTATGCAGTTTACTACGTTCTCCGGATCTGAAATTAGCAAAATGGCTGAAGTTCAGGTGTACAAAGGCTTATATTATGATACCACTCGGAAACCCATTGAGGGCGGCTTGTTGGATCCTCGAATGGTACAATTGCTTTCTTATATTCTCATGGGTGATTGGAGGACGGTGAGCTGGTTAGGAAGACATTGGCAGCGGCTTAGGATAGGAGGACGTCGGCAGCGGAGTAAGGTTAGGAGGGAGTTGGCAGCGGAGGAGGACGTCGGCGACAGATTAGGGTTAGGGGAACGACGGTGTTGGGTTGGCCGACAACACCGAAGGGTAGAGCGGCGGCTAGGGTTAGGGATGGCGAGCAGCGGCGATGGCGAAAAGAGGACCGAACGAGGAACCCTAGCGGCACAGATGAGAGCAGAGGGTCCTGCAAATAAGGGATGTAAATGTTCTACATGCCATGCCAATTTTGGTGACTGCCCAGGACATTATGGATACTTGAGTCTTGCTCTCCCTGTTTTTAATGTTGGTTATTTTACTACGATATTGGAAATTCTAAAGTGCATTTGCAAGTCATGTTCTCGTATACTTTTGGAAGAGAAGCTTTACAAAGACTTCTTGAGGAAGATGAGAAATCCTAAATTGGAAGCCTTGAGGAAGTGTTCGGTGAAGAAGGCTGTATCTATGTTGGGAATACTACATTATCGTGTGAAATCCAAGGATGCGGGGATGGTATCAGAGGATCTTAGAGCACCCTATAATGTGTCGAATGATATTTTGAACCCTTTTAGAGTTCTTTCTCTTTTTAAAAGGATGACAGATGAGGACTGTGAGTTACTCTTTCTATCAGATAGGCCTGATAATCTCATCATTACTAACGTTGCAGTGCCTCCTATAGCTATCCGTCCTTCTGTAATTATGGATGGTTCACAAAGCAATGAAAATGATATAACTGAGAGGTTAAAACGAATCATCCAACAGAATGCAAGTGTTAGCCAAGAGTTATCAACATCAAACTCACTACCTAAATGCCTGTCTACCTCAAAATACTGCTCGCATAAAACTTCATCTGGAGTGACGATGGAGTTAGCAACTGGTGATGAATCCGGATTACTAATGCTGAGATCTGATTCATCGTCGCCAAGAGATTCATCAGAAGAGGCAGTCAGGGAAGTTATTATTTTGGTAGAGTTGATCAGTCCATGTGTTAAGGAGGAAGGCAGATGA

Coding sequence (CDS)

ATGAATAGAGCGCAAGTGGAAGGTCTGGTGTTCACTAAAGAACCATACATTGAGGATGTTGGACCTCGTAAAATCAAGAGTATGCAGTTTACTACGTTCTCCGGATCTGAAATTAGCAAAATGGCTGAAGTTCAGGTGTACAAAGGCTTATATTATGATACCACTCGGAAACCCATTGAGGGCGGCTTGTTGGATCCTCGAATGGTACAATTGCTTTCTTATATTCTCATGGGTGATTGGAGGACGGTGAGCTGGTTAGGAAGACATTGGCAGCGGCTTAGGATAGGAGGACGTCGGCAGCGGAGTAAGGTTAGGAGGGAGTTGGCAGCGGAGGAGGACGTCGGCGACAGATTAGGGTTAGGGGAACGACGGTGTTGGGTTGGCCGACAACACCGAAGGGTAGAGCGGCGGCTAGGGTTAGGGATGGCGAGCAGCGGCGATGGCGAAAAGAGGACCGAACGAGGAACCCTAGCGGCACAGATGAGAGCAGAGGGTCCTGCAAATAAGGGATGTAAATGTTCTACATGCCATGCCAATTTTGGTGACTGCCCAGGACATTATGGATACTTGAGTCTTGCTCTCCCTGTTTTTAATGTTGGTTATTTTACTACGATATTGGAAATTCTAAAGTGCATTTGCAAGTCATGTTCTCGTATACTTTTGGAAGAGAAGCTTTACAAAGACTTCTTGAGGAAGATGAGAAATCCTAAATTGGAAGCCTTGAGGAAGTGTTCGGTGAAGAAGGCTGTATCTATGTTGGGAATACTACATTATCGTGTGAAATCCAAGGATGCGGGGATGGTATCAGAGGATCTTAGAGCACCCTATAATGTGTCGAATGATATTTTGAACCCTTTTAGAGTTCTTTCTCTTTTTAAAAGGATGACAGATGAGGACTGTGAGTTACTCTTTCTATCAGATAGGCCTGATAATCTCATCATTACTAACGTTGCAGTGCCTCCTATAGCTATCCGTCCTTCTGTAATTATGGATGGTTCACAAAGCAATGAAAATGATATAACTGAGAGGTTAAAACGAATCATCCAACAGAATGCAAGTGTTAGCCAAGAGTTATCAACATCAAACTCACTACCTAAATGCCTGTCTACCTCAAAATACTGCTCGCATAAAACTTCATCTGGAGTGACGATGGAGTTAGCAACTGGTGATGAATCCGGATTACTAATGCTGAGATCTGATTCATCGTCGCCAAGAGATTCATCAGAAGAGGCAGTCAGGGAAGTTATTATTTTGGTAGAGTTGATCAGTCCATGTGTTAAGGAGGAAGGCAGATGA

Protein sequence

MNRAQVEGLVFTKEPYIEDVGPRKIKSMQFTTFSGSEISKMAEVQVYKGLYYDTTRKPIEGGLLDPRMVQLLSYILMGDWRTVSWLGRHWQRLRIGGRRQRSKVRRELAAEEDVGDRLGLGERRCWVGRQHRRVERRLGLGMASSGDGEKRTERGTLAAQMRAEGPANKGCKCSTCHANFGDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEKLYKDFLRKMRNPKLEALRKCSVKKAVSMLGILHYRVKSKDAGMVSEDLRAPYNVSNDILNPFRVLSLFKRMTDEDCELLFLSDRPDNLIITNVAVPPIAIRPSVIMDGSQSNENDITERLKRIIQQNASVSQELSTSNSLPKCLSTSKYCSHKTSSGVTMELATGDESGLLMLRSDSSSPRDSSEEAVREVIILVELISPCVKEEGR
Homology
BLAST of MC03g_new0176 vs. ExPASy Swiss-Prot
Match: F4JXF9 (DNA-directed RNA polymerase III subunit 1 OS=Arabidopsis thaliana OX=3702 GN=NRPC1 PE=2 SV=1)

HSP 1 Score: 233.0 bits (593), Expect = 6.2e-60
Identity = 153/395 (38.73%), Postives = 190/395 (48.10%), Query Frame = 0

Query: 11  FTKEPYIEDVGPRKIKSMQFTTFSGSEISKMAEVQVYKGLYYDTTRKPIEGGLLDPRMVQ 70
           FTK+PYIEDVGP KIKS+ F+  S  E+ K AEVQV+    YD + KP E GLLDPRM  
Sbjct: 9   FTKKPYIEDVGPLKIKSINFSVLSDLEVMKAAEVQVWNIGLYDHSFKPYENGLLDPRM-- 68

Query: 71  LLSYILMGDWRTVSWLGRHWQRLRIGGRRQRSKVRRELAAEEDVGDRLGLGERRCWVGRQ 130
                                                                       
Sbjct: 69  ------------------------------------------------------------ 128

Query: 131 HRRVERRLGLGMASSGDGEKRTERGTLAAQMRAEGPANKGCKCSTCHANFGDCPGHYGYL 190
                                             GP NK   C+TC  NF +CPGHYGYL
Sbjct: 129 ----------------------------------GPPNKKSICTTCEGNFQNCPGHYGYL 188

Query: 191 SLALPVFNVGYFTTILEILKCICKSCSRILLEEKLYKDFLRKMRNPKLEALRKCSVKKAV 250
            L LPV+NVGYF  IL+ILKCICK CS +LL+EKLY+D LRKMRNP++E L+K  + KAV
Sbjct: 189 KLDLPVYNVGYFNFILDILKCICKRCSNMLLDEKLYEDHLRKMRNPRMEPLKKTELAKAV 248

Query: 251 SM-------------------------------LGILHYRVK--------SKDAGMVSED 310
                                            +GI H R K         K A   ++ 
Sbjct: 249 VKKCSTMASQRIITCKKCGYLNGMVKKIAAQFGIGISHDRSKIHGGEIDECKSAISHTKQ 307

Query: 311 LRAPYNVSNDILNPFRVLSLFKRMTDEDCELLFLSDRPDNLIITNVAVPPIAIRPSVIMD 367
             A  N    +L+P  VL LFKRM+D+DCELL+++ RP+NLIIT + VPP++IRPSV++ 
Sbjct: 309 STAAINPLTYVLDPNLVLGLFKRMSDKDCELLYIAYRPENLIITCMLVPPLSIRPSVMIG 307

BLAST of MC03g_new0176 vs. ExPASy Swiss-Prot
Match: Q86AQ5 (DNA-directed RNA polymerase III subunit rpc1 OS=Dictyostelium discoideum OX=44689 GN=polr3a PE=3 SV=1)

HSP 1 Score: 135.2 bits (339), Expect = 1.8e-30
Identity = 81/229 (35.37%), Postives = 128/229 (55.90%), Query Frame = 0

Query: 165 GPANKGCKCSTCHANFGDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEK 224
           G ++K   C+TC  +  DC GH+GY+ L LPVF++GY   I+ IL+ ICKSCS ILL E+
Sbjct: 59  GTSDKQAMCTTCGLSIVDCVGHFGYIKLQLPVFHIGYLKNIMNILQMICKSCSTILLNEE 118

Query: 225 LYKDFLRKMRNPKLEALRKCSVKKAVSM-----------------------LGILHYRVK 284
             + +LRKMRN K++ L++ S+ K + +                         I+H + K
Sbjct: 119 KKQYYLRKMRNKKMDNLQRKSLLKKIFLECRKTKECLKCGSTNGMIKKSGAFKIIHEKYK 178

Query: 285 SK------------DAGMVSEDLRAPYNVSNDILNPFRVLSLFKRMTDEDCELLFLS--- 344
            K            DA   + ++++    + D LNP   L+LFK+++ +D E++ +    
Sbjct: 179 GKTESLQDYYALYDDAIKYNPEMKSHIKKAQDDLNPLVALNLFKKISYQDIEIMNMDPVI 238

Query: 345 DRPDNLIITNVAVPPIAIRPSVIMD-GSQSNENDITERLKRIIQQNASV 355
            RP+ LI+T + VPP++IRPSV MD GS +NE+D+T +L  I+  N  +
Sbjct: 239 GRPERLILTYMLVPPVSIRPSVPMDGGSGTNEDDLTMKLSEILHINEHI 287

BLAST of MC03g_new0176 vs. ExPASy Swiss-Prot
Match: O94666 (DNA-directed RNA polymerase III subunit rpc1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=rpc1 PE=2 SV=1)

HSP 1 Score: 129.0 bits (323), Expect = 1.3e-28
Identity = 90/275 (32.73%), Postives = 139/275 (50.55%), Query Frame = 0

Query: 153 ERGTLAAQMRAEGPANKGCKCSTCHANFGDCPGHYGYLSLALPVFNVGYFTTILEILKCI 212
           E G L   M   G +NK   C+TC  +  DC GH+GY+ LALPVF++GYF   L IL+ I
Sbjct: 49  EHGALDLHM---GTSNKQINCATCGESMADCMGHFGYVKLALPVFHIGYFKATLTILQNI 108

Query: 213 CKSCSRILLEEKLYKDFLRKMRNPKLEAL----------------RKCS-------VKKA 272
           CK CS +LL ++  + FL+ +R P ++ L                R+CS       V K 
Sbjct: 109 CKDCSSVLLSDQEKRQFLKDLRRPGIDNLRRSQICKRINDHCKKMRRCSKCDAMQGVVKK 168

Query: 273 VSMLGILHYRV----KSKD-----------AGMVSEDLRAPYNVSNDILNPFRVLSLFKR 332
              L I+H R     KS+D           A     +L+   + ++D LNP +VL+LFK+
Sbjct: 169 AGPLKIIHERFRYVRKSQDDEENFRHSFDEALKTIPELKMHLSKAHDDLNPLKVLNLFKQ 228

Query: 333 MTDEDCELLFLS---DRPDNLIITNVAVPPIAIRPSVIMDGSQSNENDITERLKRIIQQN 387
           +T  DCELL +     RP+NL+   V  PP+ IRPSV  +G+ + E+D+T ++  II  +
Sbjct: 229 ITPVDCELLGMDPEHGRPENLLWRYVPAPPVCIRPSVAQEGA-TTEDDLTVKITEIIWTS 288

BLAST of MC03g_new0176 vs. ExPASy Swiss-Prot
Match: Q6BI69 (DNA-directed RNA polymerase III subunit RPC1 OS=Debaryomyces hansenii (strain ATCC 36239 / CBS 767 / BCRC 21394 / JCM 1990 / NBRC 0083 / IGC 2968) OX=284592 GN=RPC1 PE=3 SV=2)

HSP 1 Score: 127.5 bits (319), Expect = 3.7e-28
Identity = 89/268 (33.21%), Postives = 136/268 (50.75%), Query Frame = 0

Query: 155 GTLAAQMRAEGPANKGCKCSTCHANFGDCPGHYGYLSLALPVFNVGYFTTILEILKCICK 214
           G L  +M     AN   +C+TCH N   C GH+G++ LALPVF+VGYF   +++L+CICK
Sbjct: 52  GALDTKMGISSNAN---ECATCHGNLASCHGHFGHIKLALPVFHVGYFKATIQVLQCICK 111

Query: 215 SCSRILLEEKLYKDFLRKMRNPKLEALRK-------------------CS-----VKKAV 274
           +CS +LL+E+  + FL  +R P ++ LR+                   C+     VKKA 
Sbjct: 112 NCSAVLLDEQTKRSFLNDLRRPHIDNLRRMKILKKLLEQCKKQRRCLNCNHVNGVVKKAA 171

Query: 275 SMLG-----ILH--YRVKSKDAGMVSE--------------DLRAPYNVSNDILNPFRVL 334
           S  G     I+H  +R   K A    +              +L       +D LNP +VL
Sbjct: 172 SGAGPAALKIVHDTFRWIGKKATPEKDLWDKEFDEVFSRNPELEKFVKRIHDDLNPLKVL 231

Query: 335 SLFKRMTDEDCELLFLSD----RPDNLIITNVAVPPIAIRPSVIMDGSQSNENDITERLK 374
           +LFK+++  DCELL +      RP+  I   +  PP+ IRPSV+MD +QSNE+D+T +L 
Sbjct: 232 NLFKQISPSDCELLGIDSARGGRPEMYIWRYLPAPPVCIRPSVMMD-AQSNEDDLTIKLT 291

BLAST of MC03g_new0176 vs. ExPASy Swiss-Prot
Match: Q5ZL98 (DNA-directed RNA polymerase III subunit RPC1 OS=Gallus gallus OX=9031 GN=POLR3A PE=2 SV=1)

HSP 1 Score: 122.5 bits (306), Expect = 1.2e-26
Identity = 81/230 (35.22%), Postives = 119/230 (51.74%), Query Frame = 0

Query: 165 GPANKGCKCSTCHANFGDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEK 224
           G + K   C TC  N  DC GHYGY+ L LP F+VGYF  ++ IL+ ICK+C RI+L  +
Sbjct: 61  GTSEKDRPCETCGKNLADCLGHYGYIDLELPCFHVGYFKAVIGILQMICKTCCRIMLSVE 120

Query: 225 LYKDFLRKMRNPKLEALRKCSVKKAVS-----------------------MLGILHYRVK 284
             K FL  ++ P L  L+K  +KK VS                       +L I+H + K
Sbjct: 121 EKKQFLDYLKRPGLTYLQKRGLKKKVSEKCRKKNTCPYCGAFNGTVKKCGLLKIIHEKYK 180

Query: 285 S----------------KDAGMVSEDLRAPYNVSNDILNPFRVLSLFKRMTDEDCELLFL 344
           +                + A   ++++      + + LNP  VL+LFKR+  ED  LL +
Sbjct: 181 TNKKVVDPIVSTFLQSFETAIEHNKEVEPLLGRAQENLNPLVVLNLFKRIPAEDIPLLLM 240

Query: 345 ---SDRPDNLIITNVAVPPIAIRPSVIMD-GSQSNENDITERLKRIIQQN 352
              + +P +LI+T + VPP+ IRPSV+ D  S +NE+D+T +L  II  N
Sbjct: 241 NPEAGKPSDLILTRLLVPPLCIRPSVVSDLKSGTNEDDLTMKLTEIIFLN 290

BLAST of MC03g_new0176 vs. NCBI nr
Match: XP_038894319.1 (DNA-directed RNA polymerase III subunit 1 isoform X2 [Benincasa hispida])

HSP 1 Score: 457 bits (1175), Expect = 9.97e-147
Identity = 256/398 (64.32%), Postives = 265/398 (66.58%), Query Frame = 0

Query: 1   MNRAQVEGLVFTKEPYIEDVGPRKIKSMQFTTFSGSEISKMAEVQVYKGLYYDTTRKPIE 60
           MNRAQ EGLVFTK+PYIEDVGPRKIKSMQFTTFSGSEISKMAEVQVYKGLYYDTTRKPI+
Sbjct: 1   MNRAQAEGLVFTKQPYIEDVGPRKIKSMQFTTFSGSEISKMAEVQVYKGLYYDTTRKPID 60

Query: 61  GGLLDPRMVQLLSYILMGDWRTVSWLGRHWQRLRIGGRRQRSKVRRELAAEEDVGDRLGL 120
           GGLLDPRM                                                    
Sbjct: 61  GGLLDPRM---------------------------------------------------- 120

Query: 121 GERRCWVGRQHRRVERRLGLGMASSGDGEKRTERGTLAAQMRAEGPANKGCKCSTCHANF 180
                                                       GPANKGCKC+TCHANF
Sbjct: 121 --------------------------------------------GPANKGCKCTTCHANF 180

Query: 181 GDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEKLYKDFLRKMRNPKLEA 240
           GDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEKLYKDFLRK+RNPKLE 
Sbjct: 181 GDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEKLYKDFLRKIRNPKLEP 240

Query: 241 LRKC------------------------------SVKKAVSMLGILHYRVKSKDAGMVSE 300
           LRK                               SVKKAVSMLGILHYR +SKDAG+VSE
Sbjct: 241 LRKSDLVKKIIKKCSTLTTGNKSTKCSRCGYLNGSVKKAVSMLGILHYRSRSKDAGVVSE 300

Query: 301 DLRAPYNVSNDILNPFRVLSLFKRMTDEDCELLFLSDRPDNLIITNVAVPPIAIRPSVIM 360
           DLRAPYNVSNDILNPFRVLSLFKRM+DEDCELLFLSDRP+ LI+TNV VPPIAIRPSVIM
Sbjct: 301 DLRAPYNVSNDILNPFRVLSLFKRMSDEDCELLFLSDRPEKLIVTNVPVPPIAIRPSVIM 302

Query: 361 DGSQSNENDITERLKRIIQQNASVSQELSTSNSLPKCL 368
           DGSQSNENDITERLKRIIQQNASVSQELSTSNS PKCL
Sbjct: 361 DGSQSNENDITERLKRIIQQNASVSQELSTSNSQPKCL 302

BLAST of MC03g_new0176 vs. NCBI nr
Match: XP_038894318.1 (DNA-directed RNA polymerase III subunit 1 isoform X1 [Benincasa hispida])

HSP 1 Score: 457 bits (1175), Expect = 1.12e-144
Identity = 256/398 (64.32%), Postives = 265/398 (66.58%), Query Frame = 0

Query: 1   MNRAQVEGLVFTKEPYIEDVGPRKIKSMQFTTFSGSEISKMAEVQVYKGLYYDTTRKPIE 60
           MNRAQ EGLVFTK+PYIEDVGPRKIKSMQFTTFSGSEISKMAEVQVYKGLYYDTTRKPI+
Sbjct: 1   MNRAQAEGLVFTKQPYIEDVGPRKIKSMQFTTFSGSEISKMAEVQVYKGLYYDTTRKPID 60

Query: 61  GGLLDPRMVQLLSYILMGDWRTVSWLGRHWQRLRIGGRRQRSKVRRELAAEEDVGDRLGL 120
           GGLLDPRM                                                    
Sbjct: 61  GGLLDPRM---------------------------------------------------- 120

Query: 121 GERRCWVGRQHRRVERRLGLGMASSGDGEKRTERGTLAAQMRAEGPANKGCKCSTCHANF 180
                                                       GPANKGCKC+TCHANF
Sbjct: 121 --------------------------------------------GPANKGCKCTTCHANF 180

Query: 181 GDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEKLYKDFLRKMRNPKLEA 240
           GDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEKLYKDFLRK+RNPKLE 
Sbjct: 181 GDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEKLYKDFLRKIRNPKLEP 240

Query: 241 LRKC------------------------------SVKKAVSMLGILHYRVKSKDAGMVSE 300
           LRK                               SVKKAVSMLGILHYR +SKDAG+VSE
Sbjct: 241 LRKSDLVKKIIKKCSTLTTGNKSTKCSRCGYLNGSVKKAVSMLGILHYRSRSKDAGVVSE 300

Query: 301 DLRAPYNVSNDILNPFRVLSLFKRMTDEDCELLFLSDRPDNLIITNVAVPPIAIRPSVIM 360
           DLRAPYNVSNDILNPFRVLSLFKRM+DEDCELLFLSDRP+ LI+TNV VPPIAIRPSVIM
Sbjct: 301 DLRAPYNVSNDILNPFRVLSLFKRMSDEDCELLFLSDRPEKLIVTNVPVPPIAIRPSVIM 302

Query: 361 DGSQSNENDITERLKRIIQQNASVSQELSTSNSLPKCL 368
           DGSQSNENDITERLKRIIQQNASVSQELSTSNS PKCL
Sbjct: 361 DGSQSNENDITERLKRIIQQNASVSQELSTSNSQPKCL 302

BLAST of MC03g_new0176 vs. NCBI nr
Match: XP_008465290.2 (PREDICTED: LOW QUALITY PROTEIN: DNA-directed RNA polymerase III subunit 1 [Cucumis melo])

HSP 1 Score: 456 bits (1172), Expect = 3.00e-144
Identity = 254/398 (63.82%), Postives = 266/398 (66.83%), Query Frame = 0

Query: 1   MNRAQVEGLVFTKEPYIEDVGPRKIKSMQFTTFSGSEISKMAEVQVYKGLYYDTTRKPIE 60
           MNRAQVEGLVFTK+PYIEDVGPRKIKSMQFTTFSG+EISK+AEVQVYKGLYYDTTRKPI+
Sbjct: 1   MNRAQVEGLVFTKQPYIEDVGPRKIKSMQFTTFSGAEISKLAEVQVYKGLYYDTTRKPID 60

Query: 61  GGLLDPRMVQLLSYILMGDWRTVSWLGRHWQRLRIGGRRQRSKVRRELAAEEDVGDRLGL 120
           GGLLDPRM                                                    
Sbjct: 61  GGLLDPRM---------------------------------------------------- 120

Query: 121 GERRCWVGRQHRRVERRLGLGMASSGDGEKRTERGTLAAQMRAEGPANKGCKCSTCHANF 180
                                                       GPANKGCKCSTCHANF
Sbjct: 121 --------------------------------------------GPANKGCKCSTCHANF 180

Query: 181 GDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEKLYKDFLRKMRNPKLEA 240
           GDCPGHYGY++L+LPVFNVGYFTTILEILKCICKSCSRILLEEKLYKDFLRKMRNPKLEA
Sbjct: 181 GDCPGHYGYVNLSLPVFNVGYFTTILEILKCICKSCSRILLEEKLYKDFLRKMRNPKLEA 240

Query: 241 LRKC------------------------------SVKKAVSMLGILHYRVKSKDAGMVSE 300
           LRKC                              SVKKAVSMLGILHYR +SKDAG+VSE
Sbjct: 241 LRKCDLVKKIIKKCSTLTTGNKSTKCSRCGYLNGSVKKAVSMLGILHYRARSKDAGVVSE 300

Query: 301 DLRAPYNVSNDILNPFRVLSLFKRMTDEDCELLFLSDRPDNLIITNVAVPPIAIRPSVIM 360
           DLRAPYNVSNDILNPFRVL LF+RM+DEDCELLFLSDRP+ LIITNV VPPIAIRPSVIM
Sbjct: 301 DLRAPYNVSNDILNPFRVLCLFQRMSDEDCELLFLSDRPEKLIITNVLVPPIAIRPSVIM 302

Query: 361 DGSQSNENDITERLKRIIQQNASVSQELSTSNSLPKCL 368
           DGSQSNENDITERLKRIIQQNASVSQELSTSNS  KCL
Sbjct: 361 DGSQSNENDITERLKRIIQQNASVSQELSTSNSQAKCL 302

BLAST of MC03g_new0176 vs. NCBI nr
Match: KAG6583479.1 (DNA-directed RNA polymerase III subunit 1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 449 bits (1156), Expect = 6.33e-142
Identity = 253/398 (63.57%), Postives = 263/398 (66.08%), Query Frame = 0

Query: 1   MNRAQVEGLVFTKEPYIEDVGPRKIKSMQFTTFSGSEISKMAEVQVYKGLYYDTTRKPIE 60
           MNRAQVEGLVFTK+PYIEDVGPRKIKSMQF+TFSGSEISKMAEVQVYKGLYYDTTRKPI+
Sbjct: 1   MNRAQVEGLVFTKQPYIEDVGPRKIKSMQFSTFSGSEISKMAEVQVYKGLYYDTTRKPID 60

Query: 61  GGLLDPRMVQLLSYILMGDWRTVSWLGRHWQRLRIGGRRQRSKVRRELAAEEDVGDRLGL 120
           GGLLDPRM                                                    
Sbjct: 61  GGLLDPRM---------------------------------------------------- 120

Query: 121 GERRCWVGRQHRRVERRLGLGMASSGDGEKRTERGTLAAQMRAEGPANKGCKCSTCHANF 180
                                                       GPANKGCKC+TCHANF
Sbjct: 121 --------------------------------------------GPANKGCKCATCHANF 180

Query: 181 GDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEKLYKDFLRKMRNPKLEA 240
           GDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEKLY DFLRKMRNPKLE 
Sbjct: 181 GDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEKLYNDFLRKMRNPKLEP 240

Query: 241 LRKC------------------------------SVKKAVSMLGILHYRVKSKDAGMVSE 300
           LRK                               SVKKAVSMLGILHYR +SKDAG+VSE
Sbjct: 241 LRKSELAKKIIKKCTALTSNNKSMKCSGCGYLNGSVKKAVSMLGILHYRSRSKDAGVVSE 300

Query: 301 DLRAPYNVSNDILNPFRVLSLFKRMTDEDCELLFLSDRPDNLIITNVAVPPIAIRPSVIM 360
           DLRAPYNVSNDILNPFRVLSLFKRM+D+DCELLFLSDRP+  I+TNV VPPIAIRPSVIM
Sbjct: 301 DLRAPYNVSNDILNPFRVLSLFKRMSDKDCELLFLSDRPEKFIVTNVPVPPIAIRPSVIM 302

Query: 361 DGSQSNENDITERLKRIIQQNASVSQELSTSNSLPKCL 368
           DGSQSNENDITERLKRIIQQNASVSQELSTSNS  KCL
Sbjct: 361 DGSQSNENDITERLKRIIQQNASVSQELSTSNSQLKCL 302

BLAST of MC03g_new0176 vs. NCBI nr
Match: XP_004148776.1 (DNA-directed RNA polymerase III subunit 1 isoform X1 [Cucumis sativus] >KAE8652732.1 hypothetical protein Csa_013764 [Cucumis sativus])

HSP 1 Score: 448 bits (1153), Expect = 1.72e-141
Identity = 251/398 (63.07%), Postives = 265/398 (66.58%), Query Frame = 0

Query: 1   MNRAQVEGLVFTKEPYIEDVGPRKIKSMQFTTFSGSEISKMAEVQVYKGLYYDTTRKPIE 60
           MNRAQVEGLVFTK+PYIEDVGPRKIKSMQFTTFSG+EISK+AEVQVYKGLYYDTTRKPI+
Sbjct: 1   MNRAQVEGLVFTKQPYIEDVGPRKIKSMQFTTFSGAEISKLAEVQVYKGLYYDTTRKPID 60

Query: 61  GGLLDPRMVQLLSYILMGDWRTVSWLGRHWQRLRIGGRRQRSKVRRELAAEEDVGDRLGL 120
           GGLLDPRM                                                    
Sbjct: 61  GGLLDPRM---------------------------------------------------- 120

Query: 121 GERRCWVGRQHRRVERRLGLGMASSGDGEKRTERGTLAAQMRAEGPANKGCKCSTCHANF 180
                                                       GPANKGCKC+TCHANF
Sbjct: 121 --------------------------------------------GPANKGCKCATCHANF 180

Query: 181 GDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEKLYKDFLRKMRNPKLEA 240
           GDCPGHYGY++LALPVFNVGYFTTILEILKCICKSCSRILLEEKL+KDFLRKMRNPKLEA
Sbjct: 181 GDCPGHYGYVNLALPVFNVGYFTTILEILKCICKSCSRILLEEKLFKDFLRKMRNPKLEA 240

Query: 241 LRKC------------------------------SVKKAVSMLGILHYRVKSKDAGMVSE 300
           LRK                               SVKKAVSMLGILHYR +SKDAG+VSE
Sbjct: 241 LRKVDLVKKIIKKCSTLTTGNKSTRCSRCGYLNGSVKKAVSMLGILHYRARSKDAGVVSE 300

Query: 301 DLRAPYNVSNDILNPFRVLSLFKRMTDEDCELLFLSDRPDNLIITNVAVPPIAIRPSVIM 360
           DLRAPYNVSNDILNPFRVL LF+RM+DEDCELLFLS+RP+ LIITNV VPPIAIRPSVIM
Sbjct: 301 DLRAPYNVSNDILNPFRVLCLFQRMSDEDCELLFLSNRPEKLIITNVLVPPIAIRPSVIM 302

Query: 361 DGSQSNENDITERLKRIIQQNASVSQELSTSNSLPKCL 368
           DGSQSNENDITERLKRIIQQNASVSQELSTSNS  KCL
Sbjct: 361 DGSQSNENDITERLKRIIQQNASVSQELSTSNSQAKCL 302

BLAST of MC03g_new0176 vs. ExPASy TrEMBL
Match: A0A1S3CNX9 (DNA-directed RNA polymerase subunit OS=Cucumis melo OX=3656 GN=LOC103502942 PE=3 SV=1)

HSP 1 Score: 456 bits (1172), Expect = 1.45e-144
Identity = 254/398 (63.82%), Postives = 266/398 (66.83%), Query Frame = 0

Query: 1   MNRAQVEGLVFTKEPYIEDVGPRKIKSMQFTTFSGSEISKMAEVQVYKGLYYDTTRKPIE 60
           MNRAQVEGLVFTK+PYIEDVGPRKIKSMQFTTFSG+EISK+AEVQVYKGLYYDTTRKPI+
Sbjct: 1   MNRAQVEGLVFTKQPYIEDVGPRKIKSMQFTTFSGAEISKLAEVQVYKGLYYDTTRKPID 60

Query: 61  GGLLDPRMVQLLSYILMGDWRTVSWLGRHWQRLRIGGRRQRSKVRRELAAEEDVGDRLGL 120
           GGLLDPRM                                                    
Sbjct: 61  GGLLDPRM---------------------------------------------------- 120

Query: 121 GERRCWVGRQHRRVERRLGLGMASSGDGEKRTERGTLAAQMRAEGPANKGCKCSTCHANF 180
                                                       GPANKGCKCSTCHANF
Sbjct: 121 --------------------------------------------GPANKGCKCSTCHANF 180

Query: 181 GDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEKLYKDFLRKMRNPKLEA 240
           GDCPGHYGY++L+LPVFNVGYFTTILEILKCICKSCSRILLEEKLYKDFLRKMRNPKLEA
Sbjct: 181 GDCPGHYGYVNLSLPVFNVGYFTTILEILKCICKSCSRILLEEKLYKDFLRKMRNPKLEA 240

Query: 241 LRKC------------------------------SVKKAVSMLGILHYRVKSKDAGMVSE 300
           LRKC                              SVKKAVSMLGILHYR +SKDAG+VSE
Sbjct: 241 LRKCDLVKKIIKKCSTLTTGNKSTKCSRCGYLNGSVKKAVSMLGILHYRARSKDAGVVSE 300

Query: 301 DLRAPYNVSNDILNPFRVLSLFKRMTDEDCELLFLSDRPDNLIITNVAVPPIAIRPSVIM 360
           DLRAPYNVSNDILNPFRVL LF+RM+DEDCELLFLSDRP+ LIITNV VPPIAIRPSVIM
Sbjct: 301 DLRAPYNVSNDILNPFRVLCLFQRMSDEDCELLFLSDRPEKLIITNVLVPPIAIRPSVIM 302

Query: 361 DGSQSNENDITERLKRIIQQNASVSQELSTSNSLPKCL 368
           DGSQSNENDITERLKRIIQQNASVSQELSTSNS  KCL
Sbjct: 361 DGSQSNENDITERLKRIIQQNASVSQELSTSNSQAKCL 302

BLAST of MC03g_new0176 vs. ExPASy TrEMBL
Match: A0A6J1HLT9 (DNA-directed RNA polymerase subunit OS=Cucurbita moschata OX=3662 GN=LOC111464789 PE=3 SV=1)

HSP 1 Score: 448 bits (1153), Expect = 8.35e-142
Identity = 252/398 (63.32%), Postives = 263/398 (66.08%), Query Frame = 0

Query: 1   MNRAQVEGLVFTKEPYIEDVGPRKIKSMQFTTFSGSEISKMAEVQVYKGLYYDTTRKPIE 60
           MN+AQVEGLVFTK+PYIEDVGPRKIKSMQF+TFSGSEISKMAEVQVYKGLYYDTTRKPI+
Sbjct: 1   MNKAQVEGLVFTKQPYIEDVGPRKIKSMQFSTFSGSEISKMAEVQVYKGLYYDTTRKPID 60

Query: 61  GGLLDPRMVQLLSYILMGDWRTVSWLGRHWQRLRIGGRRQRSKVRRELAAEEDVGDRLGL 120
           GGLLDPRM                                                    
Sbjct: 61  GGLLDPRM---------------------------------------------------- 120

Query: 121 GERRCWVGRQHRRVERRLGLGMASSGDGEKRTERGTLAAQMRAEGPANKGCKCSTCHANF 180
                                                       GPANKGCKC+TCHANF
Sbjct: 121 --------------------------------------------GPANKGCKCATCHANF 180

Query: 181 GDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEKLYKDFLRKMRNPKLEA 240
           GDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEKLY DFLRKMRNPKLE 
Sbjct: 181 GDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEKLYNDFLRKMRNPKLEP 240

Query: 241 LRKC------------------------------SVKKAVSMLGILHYRVKSKDAGMVSE 300
           LRK                               SVKKAVSMLGILHYR +SKDAG+VSE
Sbjct: 241 LRKSELAKKIIKKCTALTSNNKSMKCSGCGYLNGSVKKAVSMLGILHYRSRSKDAGVVSE 300

Query: 301 DLRAPYNVSNDILNPFRVLSLFKRMTDEDCELLFLSDRPDNLIITNVAVPPIAIRPSVIM 360
           DLRAPYNVSNDILNPFRVLSLFKRM+D+DCELLFLSDRP+  I+TNV VPPIAIRPSVIM
Sbjct: 301 DLRAPYNVSNDILNPFRVLSLFKRMSDKDCELLFLSDRPEKFIVTNVPVPPIAIRPSVIM 302

Query: 361 DGSQSNENDITERLKRIIQQNASVSQELSTSNSLPKCL 368
           DGSQSNENDITERLKRIIQQNASVSQELSTSNS  KCL
Sbjct: 361 DGSQSNENDITERLKRIIQQNASVSQELSTSNSQLKCL 302

BLAST of MC03g_new0176 vs. ExPASy TrEMBL
Match: A0A6J1I0L0 (DNA-directed RNA polymerase subunit OS=Cucurbita maxima OX=3661 GN=LOC111469417 PE=3 SV=1)

HSP 1 Score: 446 bits (1146), Expect = 8.65e-141
Identity = 250/398 (62.81%), Postives = 262/398 (65.83%), Query Frame = 0

Query: 1   MNRAQVEGLVFTKEPYIEDVGPRKIKSMQFTTFSGSEISKMAEVQVYKGLYYDTTRKPIE 60
           MN+ QVEGLVFTK+PYIEDVGPRKIKSMQF+TFSGSEISKMAEVQVYKGLYYDTTRKPI+
Sbjct: 1   MNKTQVEGLVFTKQPYIEDVGPRKIKSMQFSTFSGSEISKMAEVQVYKGLYYDTTRKPID 60

Query: 61  GGLLDPRMVQLLSYILMGDWRTVSWLGRHWQRLRIGGRRQRSKVRRELAAEEDVGDRLGL 120
           GGLLDPRM                                                    
Sbjct: 61  GGLLDPRM---------------------------------------------------- 120

Query: 121 GERRCWVGRQHRRVERRLGLGMASSGDGEKRTERGTLAAQMRAEGPANKGCKCSTCHANF 180
                                                       GPANKGCKC+TCHANF
Sbjct: 121 --------------------------------------------GPANKGCKCATCHANF 180

Query: 181 GDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEKLYKDFLRKMRNPKLEA 240
           GDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEKLY DFLRKMRNPKLE 
Sbjct: 181 GDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEKLYNDFLRKMRNPKLEP 240

Query: 241 LRKC------------------------------SVKKAVSMLGILHYRVKSKDAGMVSE 300
           LRK                               SVKKAVSMLGILHYR +SKDAG+VSE
Sbjct: 241 LRKSELAKKIIKKCTALTSNNKSMKCSGCGYLNGSVKKAVSMLGILHYRSRSKDAGVVSE 300

Query: 301 DLRAPYNVSNDILNPFRVLSLFKRMTDEDCELLFLSDRPDNLIITNVAVPPIAIRPSVIM 360
           DLRAPYNVSNDILNPFRVLSLFKRM+D+DCE+LFLSDRP+  I+TNV VPPIAIRPSVIM
Sbjct: 301 DLRAPYNVSNDILNPFRVLSLFKRMSDKDCEVLFLSDRPEKFIVTNVPVPPIAIRPSVIM 302

Query: 361 DGSQSNENDITERLKRIIQQNASVSQELSTSNSLPKCL 368
           DGSQSNENDITERLKRIIQQNASVSQELSTSNS  KCL
Sbjct: 361 DGSQSNENDITERLKRIIQQNASVSQELSTSNSQLKCL 302

BLAST of MC03g_new0176 vs. ExPASy TrEMBL
Match: A0A6J1DZJ6 (DNA-directed RNA polymerase OS=Momordica charantia OX=3673 GN=LOC111025743 PE=4 SV=1)

HSP 1 Score: 328 bits (842), Expect = 7.09e-109
Identity = 170/200 (85.00%), Postives = 170/200 (85.00%), Query Frame = 0

Query: 165 GPANKGCKCSTCHANFGDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEK 224
           GPANKGCKCSTCHANFGDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEK
Sbjct: 2   GPANKGCKCSTCHANFGDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEK 61

Query: 225 LYKDFLRKMRNPKLEALRKC------------------------------SVKKAVSMLG 284
           LYKDFLRKMRNPKLEALRKC                              SVKKAVSMLG
Sbjct: 62  LYKDFLRKMRNPKLEALRKCELAKKIVKKCTTLTSSNKSMKCSRCGYLNGSVKKAVSMLG 121

Query: 285 ILHYRVKSKDAGMVSEDLRAPYNVSNDILNPFRVLSLFKRMTDEDCELLFLSDRPDNLII 334
           ILHYRVKSKDAGMVSEDLRAPYNVSNDILNPFRVLSLFKRMTDEDCELLFLSDRPDNLII
Sbjct: 122 ILHYRVKSKDAGMVSEDLRAPYNVSNDILNPFRVLSLFKRMTDEDCELLFLSDRPDNLII 181

BLAST of MC03g_new0176 vs. ExPASy TrEMBL
Match: A0A2I4DDG5 (DNA-directed RNA polymerase subunit OS=Juglans regia OX=51240 GN=LOC108979063 PE=3 SV=1)

HSP 1 Score: 329 bits (844), Expect = 9.24e-98
Identity = 196/407 (48.16%), Postives = 230/407 (56.51%), Query Frame = 0

Query: 1   MNRAQVEGLVFTKEPYIEDVGPRKIKSMQFTTFSGSEISKMAEVQVYKGLYYDTTRKPIE 60
           M  AQ E + FTK PYIEDVGPRKIKSM+F+TFS SEISK+AEVQVYKG YYD  R PIE
Sbjct: 1   MAGAQAEDIAFTKHPYIEDVGPRKIKSMKFSTFSESEISKLAEVQVYKGQYYDAKRNPIE 60

Query: 61  GGLLDPRMVQLLSYILMGDWRTVSWLGRHWQRLRIGGRRQRSKVRRELAAEEDVGDRLGL 120
           GGLLDPRM                                                    
Sbjct: 61  GGLLDPRM---------------------------------------------------- 120

Query: 121 GERRCWVGRQHRRVERRLGLGMASSGDGEKRTERGTLAAQMRAEGPANKGCKCSTCHANF 180
                                                       GPANK C C+TCH +F
Sbjct: 121 --------------------------------------------GPANKNCICATCHGSF 180

Query: 181 GDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEKLYKDFLRKMRNPKLEA 240
           G+CPGH+GYL LA+PV+NVGY +TIL+ILKCICKSC+ ILLEE L K++L+KMR+PK+E 
Sbjct: 181 GNCPGHFGYLKLAVPVYNVGYMSTILDILKCICKSCAHILLEENLRKEYLKKMRSPKIEP 240

Query: 241 LRKC------------------------------SVKKAVSMLGILHYRVKSKDAGMVSE 300
           L+K                               SVK+A+SM GILH R K  D G V E
Sbjct: 241 LKKTELMKVIVKKCNNLTSGNKAVKCSRCGYVNVSVKRAMSMPGILHERQKCND-GSVEE 300

Query: 301 ---------DLRAPYNVSNDILNPFRVLSLFKRMTDEDCELLFLSDRPDNLIITNVAVPP 360
                    + ++P+NV+  ILNP +VL LFKRM DEDCELL+LSDRP+ LIITN+AVPP
Sbjct: 301 FRSAISHTKESKSPFNVATYILNPVQVLVLFKRMLDEDCELLYLSDRPEKLIITNIAVPP 310

Query: 361 IAIRPSVIMDGSQSNENDITERLKRIIQQNASVSQELSTSNSLPKCL 368
           I IRPSVIMDGSQSNENDITERLK+IIQ NAS+SQEL  ++S  KCL
Sbjct: 361 IPIRPSVIMDGSQSNENDITERLKKIIQANASLSQELLEASSASKCL 310

BLAST of MC03g_new0176 vs. TAIR 10
Match: AT5G60040.1 (nuclear RNA polymerase C1 )

HSP 1 Score: 233.0 bits (593), Expect = 4.4e-61
Identity = 153/395 (38.73%), Postives = 190/395 (48.10%), Query Frame = 0

Query: 11  FTKEPYIEDVGPRKIKSMQFTTFSGSEISKMAEVQVYKGLYYDTTRKPIEGGLLDPRMVQ 70
           FTK+PYIEDVGP KIKS+ F+  S  E+ K AEVQV+    YD + KP E GLLDPRM  
Sbjct: 9   FTKKPYIEDVGPLKIKSINFSVLSDLEVMKAAEVQVWNIGLYDHSFKPYENGLLDPRM-- 68

Query: 71  LLSYILMGDWRTVSWLGRHWQRLRIGGRRQRSKVRRELAAEEDVGDRLGLGERRCWVGRQ 130
                                                                       
Sbjct: 69  ------------------------------------------------------------ 128

Query: 131 HRRVERRLGLGMASSGDGEKRTERGTLAAQMRAEGPANKGCKCSTCHANFGDCPGHYGYL 190
                                             GP NK   C+TC  NF +CPGHYGYL
Sbjct: 129 ----------------------------------GPPNKKSICTTCEGNFQNCPGHYGYL 188

Query: 191 SLALPVFNVGYFTTILEILKCICKSCSRILLEEKLYKDFLRKMRNPKLEALRKCSVKKAV 250
            L LPV+NVGYF  IL+ILKCICK CS +LL+EKLY+D LRKMRNP++E L+K  + KAV
Sbjct: 189 KLDLPVYNVGYFNFILDILKCICKRCSNMLLDEKLYEDHLRKMRNPRMEPLKKTELAKAV 248

Query: 251 SM-------------------------------LGILHYRVK--------SKDAGMVSED 310
                                            +GI H R K         K A   ++ 
Sbjct: 249 VKKCSTMASQRIITCKKCGYLNGMVKKIAAQFGIGISHDRSKIHGGEIDECKSAISHTKQ 307

Query: 311 LRAPYNVSNDILNPFRVLSLFKRMTDEDCELLFLSDRPDNLIITNVAVPPIAIRPSVIMD 367
             A  N    +L+P  VL LFKRM+D+DCELL+++ RP+NLIIT + VPP++IRPSV++ 
Sbjct: 309 STAAINPLTYVLDPNLVLGLFKRMSDKDCELLYIAYRPENLIITCMLVPPLSIRPSVMIG 307

BLAST of MC03g_new0176 vs. TAIR 10
Match: AT5G60040.2 (nuclear RNA polymerase C1 )

HSP 1 Score: 224.9 bits (572), Expect = 1.2e-58
Identity = 153/405 (37.78%), Postives = 190/405 (46.91%), Query Frame = 0

Query: 11  FTKEPYIEDVGPRKIKSMQFTTFSGSEISKMAEVQVYKGLYYDTTRKPIEGGLLDPRMVQ 70
           FTK+PYIEDVGP KIKS+ F+  S  E+ K AEVQV+    YD + KP E GLLDPRM  
Sbjct: 9   FTKKPYIEDVGPLKIKSINFSVLSDLEVMKAAEVQVWNIGLYDHSFKPYENGLLDPRM-- 68

Query: 71  LLSYILMGDWRTVSWLGRHWQRLRIGGRRQRSKVRRELAAEEDVGDRLGLGERRCWVGRQ 130
                                                                       
Sbjct: 69  ------------------------------------------------------------ 128

Query: 131 HRRVERRLGLGMASSGDGEKRTERGTLAAQMRAEGPANKGCKCSTCHANFGDCPGHYGYL 190
                                             GP NK   C+TC  NF +CPGHYGYL
Sbjct: 129 ----------------------------------GPPNKKSICTTCEGNFQNCPGHYGYL 188

Query: 191 SLALPVFNVGYFTTILEILKCICK----------SCSRILLEEKLYKDFLRKMRNPKLEA 250
            L LPV+NVGYF  IL+ILKCICK           CS +LL+EKLY+D LRKMRNP++E 
Sbjct: 189 KLDLPVYNVGYFNFILDILKCICKVTELADYVSLRCSNMLLDEKLYEDHLRKMRNPRMEP 248

Query: 251 LRKCSVKKAVSM-------------------------------LGILHYRVK-------- 310
           L+K  + KAV                                 +GI H R K        
Sbjct: 249 LKKTELAKAVVKKCSTMASQRIITCKKCGYLNGMVKKIAAQFGIGISHDRSKIHGGEIDE 308

Query: 311 SKDAGMVSEDLRAPYNVSNDILNPFRVLSLFKRMTDEDCELLFLSDRPDNLIITNVAVPP 367
            K A   ++   A  N    +L+P  VL LFKRM+D+DCELL+++ RP+NLIIT + VPP
Sbjct: 309 CKSAISHTKQSTAAINPLTYVLDPNLVLGLFKRMSDKDCELLYIAYRPENLIITCMLVPP 317

BLAST of MC03g_new0176 vs. TAIR 10
Match: AT4G35800.1 (RNA polymerase II large subunit )

HSP 1 Score: 119.0 bits (297), Expect = 9.3e-27
Identity = 77/233 (33.05%), Postives = 127/233 (54.51%), Query Frame = 0

Query: 165 GPANKGCKCSTCHANFGDCPGHYGYLSLALPVFNVGYFTTILEILKCICKSCSRILLEEK 224
           G  ++  KC TC AN  +CPGH+GYL LA P+++VG+  T+L I++C+C +CS+IL +E+
Sbjct: 58  GTIDRKVKCETCMANMAECPGHFGYLELAKPMYHVGFMKTVLSIMRCVCFNCSKILADEE 117

Query: 225 LYK-DFLRKMRNPK------LEALR---KCS----------------VKKAVSMLGILHY 284
            +K     K++NPK      L+A +   KC                 VKK+    G    
Sbjct: 118 EHKFKQAMKIKNPKNRLKKILDACKNKTKCDGGDDIDDVQSHSTDEPVKKSRGGCGAQQP 177

Query: 285 RVKSKDAGMVSE-DLRAPYNVSND----------ILNPFRVLSLFKRMTDEDCELLFLSD 344
           ++  +   M++E  ++   N   D           L   RVLS+ KR++D DC+LL  + 
Sbjct: 178 KLTIEGMKMIAEYKIQRKKNDEPDQLPEPAERKQTLGADRVLSVLKRISDADCQLLGFNP 237

Query: 345 ---RPDNLIITNVAVPPIAIRPSVIMDGSQSNENDITERLKRIIQQNASVSQE 358
              RPD +I+  + +PP  +RPSV+MD +  +E+D+T +L  II+ N ++ ++
Sbjct: 238 KFARPDWMILEVLPIPPPPVRPSVMMDATSRSEDDLTHQLAMIIRHNENLKRQ 290

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
F4JXF96.2e-6038.73DNA-directed RNA polymerase III subunit 1 OS=Arabidopsis thaliana OX=3702 GN=NRP... [more]
Q86AQ51.8e-3035.37DNA-directed RNA polymerase III subunit rpc1 OS=Dictyostelium discoideum OX=4468... [more]
O946661.3e-2832.73DNA-directed RNA polymerase III subunit rpc1 OS=Schizosaccharomyces pombe (strai... [more]
Q6BI693.7e-2833.21DNA-directed RNA polymerase III subunit RPC1 OS=Debaryomyces hansenii (strain AT... [more]
Q5ZL981.2e-2635.22DNA-directed RNA polymerase III subunit RPC1 OS=Gallus gallus OX=9031 GN=POLR3A ... [more]
Match NameE-valueIdentityDescription
XP_038894319.19.97e-14764.32DNA-directed RNA polymerase III subunit 1 isoform X2 [Benincasa hispida][more]
XP_038894318.11.12e-14464.32DNA-directed RNA polymerase III subunit 1 isoform X1 [Benincasa hispida][more]
XP_008465290.23.00e-14463.82PREDICTED: LOW QUALITY PROTEIN: DNA-directed RNA polymerase III subunit 1 [Cucum... [more]
KAG6583479.16.33e-14263.57DNA-directed RNA polymerase III subunit 1, partial [Cucurbita argyrosperma subsp... [more]
XP_004148776.11.72e-14163.07DNA-directed RNA polymerase III subunit 1 isoform X1 [Cucumis sativus] >KAE86527... [more]
Match NameE-valueIdentityDescription
A0A1S3CNX91.45e-14463.82DNA-directed RNA polymerase subunit OS=Cucumis melo OX=3656 GN=LOC103502942 PE=3... [more]
A0A6J1HLT98.35e-14263.32DNA-directed RNA polymerase subunit OS=Cucurbita moschata OX=3662 GN=LOC11146478... [more]
A0A6J1I0L08.65e-14162.81DNA-directed RNA polymerase subunit OS=Cucurbita maxima OX=3661 GN=LOC111469417 ... [more]
A0A6J1DZJ67.09e-10985.00DNA-directed RNA polymerase OS=Momordica charantia OX=3673 GN=LOC111025743 PE=4 ... [more]
A0A2I4DDG59.24e-9848.16DNA-directed RNA polymerase subunit OS=Juglans regia OX=51240 GN=LOC108979063 PE... [more]
Match NameE-valueIdentityDescription
AT5G60040.14.4e-6138.73nuclear RNA polymerase C1 [more]
AT5G60040.21.2e-5837.78nuclear RNA polymerase C1 [more]
AT4G35800.19.3e-2733.05RNA polymerase II large subunit [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007080RNA polymerase Rpb1, domain 1PFAMPF04997RNA_pol_Rpb1_1coord: 166..359
e-value: 1.1E-41
score: 143.2
IPR044893RNA polymerase Rpb1, clamp domain superfamilyGENE3D4.10.860.120RNA polymerase II, clamp domaincoord: 133..235
e-value: 7.5E-15
score: 56.7
coord: 15..75
e-value: 1.7E-7
score: 32.9
NoneNo IPR availablePANTHERPTHR19376DNA-DIRECTED RNA POLYMERASEcoord: 150..247
coord: 9..68
NoneNo IPR availablePANTHERPTHR19376:SF57DNA-DIRECTED RNA POLYMERASE SUBUNITcoord: 277..360
NoneNo IPR availablePANTHERPTHR19376:SF57DNA-DIRECTED RNA POLYMERASE SUBUNITcoord: 150..247
NoneNo IPR availablePANTHERPTHR19376:SF57DNA-DIRECTED RNA POLYMERASE SUBUNITcoord: 9..68
NoneNo IPR availablePANTHERPTHR19376DNA-DIRECTED RNA POLYMERASEcoord: 277..360
NoneNo IPR availableSUPERFAMILY64484beta and beta-prime subunits of DNA dependent RNA-polymerasecoord: 23..81
NoneNo IPR availableSUPERFAMILY64484beta and beta-prime subunits of DNA dependent RNA-polymerasecoord: 164..358

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC03g_new0176.1MC03g_new0176.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006351 transcription, DNA-templated
molecular_function GO:0003677 DNA binding
molecular_function GO:0003899 DNA-directed 5'-3' RNA polymerase activity