CmaCh02G015690 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh02G015690
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionGATA transcription factor 26-like
LocationCma_Chr02: 8903251 .. 8920933 (+)
RNA-Seq ExpressionCmaCh02G015690
SyntenyCmaCh02G015690
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGCTAGAATTGTCTCAGAAAGTAAGGGCACAGTGACCTCCGAGAGAGGGCAATTTCGAGGAATCCGAATGGAGAATCCCTTTACTTTGAAGGTGGGTCAGATATTTACAGGGTTCGGCGTCGGTTGTGGCGTCGGTATCGGCGTTGGCCGCCCCATAAACATGGGTAATTTTCTTTCCTCCATTATCGACTTTCATTCGCCTCCACCAGTTTCTTTCATTAGCTTTCTCTTTCGCACTGACGGATCAAGAAATTTGCCTGTGATTCTAAATACATATTTGTAATTAGTTGGATTGTTCAAGTTTCATTTGAAATCTTAATTTCATCAAATTTAACCTTGAATTTTAAAGTGTTGGAACTAATACCATTTGAAGCTGAGGAGCATTTTTTTCTTTTTGTTCTTTTTAGAGTTGTGGTTAAATTCAATCTCGATAGGAGCTATCTATTTAGAACATTAAAATTATATGTTTCCTAACATGTTGGAGGGCATAGGGTTAGGTGTTGTCCCATAAAGTTTGTGGTTAGCTGTGAGTGGAGCTGAGTAAAACTAGATGCACTCGATCAATTCCTTTTTCAAATCCTGGAGCAGCTGCACGAGCCTGCCATAAATGACCAACGAATAGGAAGAATCGTAGAACAAAATGAGAGGTAGCTAACCAACTTCTAGGAGAGACATAATCGTCTGCAGTAATTTCGGTAGCTATGACTGTTGTTAAATTCTAACTCAATTGTCACTGTTTTGGTTAAGTTTAGGCATTTTAATGTTTGTGCTTTGGCCTTTGATCAGATTTAGGCGTTTTTGTTGTTGGGTTTAGGCTTTTTGCAAATTGCTGATATATGTTTGTTCCTTGGTTCATGGTCATGTCTGCTGGCAGTCTTGCTACTATATGAACGTATGAAACCGATATTCATATATTTTCAGGTGCAATACCTGTCATGAATGAAGTAATGAGTGCCACGAGAGGTGCAACCGATGCACTTTCTGGTGTAACCAGGCATTTAAATAACTCTGTGAGTCCTAACTTTTAAACATTATGCATTTTGTGTTCTTTATTTGCTATATGAGATGTTGGACCATTGAAATTCTCTCAAATATCACATGTTAGCTCAGGAAATTAGGAGCTAAGAACATCCAAGGTGGCATTGGGTGTGGAGTTGGTTTTGGCCATGGTTTCGGTGTCGGTATGTCCAAAAATATCTTCTGAATACCATTTATGTTGTTTACGAGTTTTTGTTCATGTATTGGTCTGTTTAACCGACACGTAGTTTTTTGAATGTTAGGCCTAGCTATCAAGCCTTCATTTCTACAACAAGTTCAATCTTCTGTTATGGTAGGTGGCTGACCTCTTATCCATGATATATTTCCCATTACGACCTCTTATATACATGTATTTGAAGCTTGAAACCCAAATCTGTTAGCAAGCAATGGAGAAGATGGTGACAAAATTAGGTAATAATCCTAATCTTGCAATTAGTCAAAGCGCTGTACCAGTATCACTGCAATCTGCCATGAGCATAACAAATGCTTCGGCTAACAAGCATCCTGTTGCAAGCATTAGAGAGTTTGCAAAAGAAATGCCAGAAACTGCTCCACAAAACCTATCGAGTAGATCGTTTGAAACTCGAACCGAAAAGGTCGTCGACAGTTTCTTGCAGAATCCTGTTTTTAAAGGAGGGGATACTGAACTGCAGGATGAGGTATGATAATACATGCTCGGTTTTCGGTTCTAGACAAGACGAACTCCTGTAGAAACAATGAATGTTTGATGTGGTTTGCTCATTAAATACACTTCTTGCACATGAATTGATCTTGTTATGGAGTCCTGATTGATGTTTCTTGCATTATAGGTTGGACGCCTACGGCTCGAAAACCGTCTCTTTCAAATGGTAAGCTTTTACAATGGCCAGTAAATGTGAGCTGACTTCATTGTTAGAGATGCATCATGGAAAGAACTAGGCTGTATTGACTAAGCTTCTGAATTTGTTTATTCCTAGCACAAGTGACCAACTAGTTCCTATTTAATTGTTATTGCTTAAGAAATAAAATGTTCAGACTAATGAATCTTTGCTAGCTTTCAAGAATATGCTTTCCCTCCCACTTCAGTTTACAAATAGCGACTGAAACATTTCTTATGAGAGTGTAGAAACCTCTCTCTAGTAGACGCGTTTTAAAACCATGAGGCTGACGCGCGATACGTAATGGGTTAAAGCGGATAATATCTACTAGCGGTGGGCTTAAGCTTTTATAAATGGTATCAGAGTCAGCCATTGAGCGGTGTGCTAGTGAGGACGTTGACCCCATAGGAGGTGGATTGTGAGATCCCACATTGGTTAGAGGGGAACAAAGCATTCCTTATAAAGGTGTGGAAATCTATCCCTAACAGACACGTTTTAAAATCGTGAGGTTGATGGTGATACGCAACAGGCTAAAGCGGACAAGATCTACTAACAGTAGGCTTAGGCTTTTACAAATGATATCCGGGGTGGATTGTGAGATCCCACATCGGTTGGAGAGGGGAACAAAGCATACCTTATAAGAGTGTAGAAACATCTCCGACGCATTTTAAAACCGTGAGGCTAACGGTGACACATAACGGACTAAAGCAGACAAAATCTATTAGTGGTGGGCTTGGACTATTACAACGAGGATCTCCTCGTTAATCTACCATGTTTTGTTTGTAAATTTTACTCGACTTGTTATGTTGTCTTTATGCAGGTAATGATGCATCAGAAACTTATTCAAGAGCTCAAGGAGGAGAATAACAAGCTTCACCAAATACTAGTGGAAGACCTGAAAATACCACCCAGCAAGCTCCAAGCAAGTAACACAGGTAGAGAATTTTCTCCATGTTCAGGCTGTTTGGAATGCCGAAGAAAGGAAAGGAGAAGAAGAAGCTAATCATTCAATCATTCAATCATTCAATCATTCTAAACGATTTGTTCGACACGGCTGAGCCAAAGCAGCGTTCATTTGATTCTTCATAGTCGTATTCTCTAGTTGCTGCGTTTTCGTGAGCACCCGAACACGAACCGATTGAAAGAATCTGAGGTAACGAGTTCATATCATACTTGAATACGAACAAATGACATCTGAGAAATAGATGCTCTACTGATCCACAAAACTCAGCTACATAGGAACTTGGAGGCTATATTGAGTTCATGAGATAGCCGAGTTTGTTCAAACAAATTAACTGGCCTTTGTTTGAAATTTTGATGGTTCTTGATGCATTTCATCGTCCAAACTGCAAGATTTTGAGTTCGAGTTCTTTGAGTGGTTGATGTAAAGCTATTTTTAACGCGTTTCTTGTGGTGCAAGTAAAAACACATGTTAATCTATCAGCAATAGATGTGTAGTACGTTTATATATATGTGAATAAGTCTGTCACAACATTACTAATATTAATGATATATGAGAGTAAAACCAGAGTTACATAAGAATATCGATTCATGTATCATATTTTTACGAGGAACAAACTTGTAAATGTGACAATCTATTCAGCTACCCACTTTAATGTTCGATAATTTTCAACCTGATCATTTAATATTCATGTCACTGCAATAAACACAATTCAATCACTATAGATTGCTCATATTTCTCCCTAAAATTCAAGGAATTTAGTGATTTTTCTTCCTTAAATTAATATTCAACATTAAAAAATTATGTAAAAATACAAATAAAATGCCAAAAGAAATCCCATTTCTGGAAAATAGTTTGAACGAGAAGGAGCAGAACGGCGAGTTCGAATGATCTAGAAACGTACCTTCCTTAAATAGATATCTCTTATCAGAATTTCGTTTTTCTTTCAGATTTCGGTTCGTCGTCATCGTTAGTTCGAAGCAATCAAAATGGATGTGTGCTCATCTATCTTCGATTCTCAAGCAGGATCCAGAAACCGTTGGAGCTATGATTCTCACAAGAATTCCCGCCAGATTTCGCCGGCCGTTCAATCTCATCTCCAGCAGGTCGGATTCTATATTATTTTCTTTTTATAGTGATTTTATCATGTTATCTAGAAGCAGTGGGTAGTGATTTTGAATTTGAATTCCTTTGCTTACCAAACGAACGTTGTTGTTATTGAGCAGATGAGTTTAGAGAAAGAAGATCTAATGAATTAGAAATTTAGTACCTAATTCTTCCTTCATATGGATGTGATTATAATTAACCTTATGAAGTATTGCGAATCTGCAGGTTTACCTTACTCTAGGTTGTGCTTTAGTTGCATCTGCGGCTGGAGCTTATCTTCATATGCTTTGGAACATTGGCGGTGTTCTCACAACACTTGCTAGTATCGGAAGCATCGCATGGCTAATGGTCACTCCTCCATATGAAGAGGTAGATTTCTTTACTTATTTTGTGTTCTCTGTACTGTATCTTTGGTTTATGTCTTGTGATTTTATCATCAGAAAAAGAGGGTTTCTATGTTAATGGGGGCTGCTCTTCTTCAAGGAGCTTCAATTGGTCCTTTGATCAGTGTGGCTATTGAGATTGATCCGAGGTAAGCTATCGGCAGCATACAGTTTTTATTGTCATGATTCTTTTACTCTGCCGTTGAATCAAGCGACTGTACTTTTGTGATTTGGCCTCTGCAATTGACTTGAATTGAATTGTTTTTGTACTGTTAGGCTTTGGGAATGGCTGATCATTTGAAATTACACCATTTTATTGATTTCATAAACTCTGTGTGTTGTTCCTAGTGTTCTGGTCAGTGCATTTGTGGGAACGGCAGTGGCCTTTGGTTGTTTTTCAGCAGCAGCCATGTTCGCAAGGCGCAGAGAATTCCTTTATCTGGGTGGCCTGCTTTCTTCTGGAATATCCATGTTACTCTGGCTGCGTTTTGCTTCCTCTATATTCGGTGGTTCCACTGCCATTTTCAAGTTTGAGGTGTATATATTACTGTTTCAAGCTATTCATGAACAGGAAAGTAGACATTGATCCCCTTGCCTTGCTGCAGCAGGCCTGACCTCAATTTTGTGTTATTCGTCTCATTTGTAGTTGTATTTTGGACTTCTGCTGTTGGTGGGCTACGTGGTGGTTGACACTCAGAAAATAATCGAGAGGGCGCATCTTGGTGATGTGGATTACGTGAAGCACGCATTGACTCTCCTTACTGATTTCATTGGTGCTTTTGTCCGAATTCTCGTTATAATGGTGAGTTGTTCTTCATTTTGGTTGAAGAAACATTTGAATTCTTTGGCCAAACTTTCAAAACAAAACAAGAACAAATAGTCAAATCAAGCATTAGTTAACGATGTTTTTGTTGGTTATTCCACTCACCCAATTCAATGGACTTCACTCTTTTGCAGCTAAAGAACTCCACAGTGAAGAACGAGAAGAAGAAGAAAAAGAGGAGGGGGCTAAATGAACGTTCGTACCCGAAAGCAGTAGCTTTGGTGTTGGGTTAAGTCCCGCAATGAGGGCAACCCAAAATTCTCTCCAAGTAGGATTTGAACTCACGATCAGTTAGTTAACAGCCAATTGATTGGATTTGCCGCGGACACCCGAGAGACCTCCCCAAGGTATCCTTGTAAATCTTGTGAATCGTAGCATGAGTTTTTCTTGAGATTTTCGTTATTAGTACGTAGTTAGAAAGGACTGTGCTAGCTTGTTTGGGGTTGAGCAGAGATCAGATACAGACATTTAGTTTTATCTTGTAATTTTCCCATGAAACACAACCAAAGGTTAGAAAGGTTTTCTTGTGGTGTAAGTCATCTAATTAATTGATATCATTTTCTCTCCTACATCGCCTTGCGAGATTGAAATTAAATTTCCGTCCAAAATATTTAAAATACATTTTGCAATTTAATATTGCATAATAATATAATATTGAAAAAAAAAGAAAAAAACGTTTCTTGTTGTAAATAGCTTAATAGAATATTTTTTCGGCTAACATTCTTATGCGATGCTGTTGTCTAAATTAAATTAATAACAAAATTTGAGTATTTTTTTTTCGAAAAAATGGATCACTTAAAAACAAAATTTATTCAATTTTGAATAGAAGTGAATCTATATTATTTTTTTTACTCCATAGTTCAAAATTGTTCAATAATCTATTTATTTTATAAGTTATAAAATTGATTGTAGTTGCAAAACTAAATATTACATAAGTATAAATTATTATTAATTAAAATTTAAATATTTTTTGTTGTTATAATTTTTGAATAATTATAAAATTTGTAGAATTTTTTATATAAATTAAAAAATGTTGTTTTCATAATTTCACACTTATAAACGGATGCTTTTAAAAGCTTAAGTTTTTCAATTTATCCTTGAAAATTGAAAATTAACAATGGTGTAAAGATGCATAGCGAAATCATTTTCGACCATATATAAATTATAAGCTCGAAGCAAACCCGCCCCCGCAAGGGAAAGAAACACAGCGTCTCGGCATCACTGTGTTCGTTCCAAATAGATCGATGGCTTCATCTGGTATGCTTCAACCCCCAACCCCATCTTTTGTAATTAAGAAATTCAATTTCTAAATGGGCTGTCATATTTACAGATTCGGCTCTGTTTGACTTCGCAGGGAAATCTGTGTGTTGAATAGTTGGAGGGATTTGGATGGACAAGGTTCTAGAGCTTGGACGAAGAGCCCTCTTCTACGTTAAAGTCCTCTCCGGCTATGAAGAGCGGCGGATTCGATCGTTTAGGTTGGAGCTCGAAAAGCGTCTCAAGAAGGTACTCGGGGTTGGATCTTTTGGTTTTGGTTTTGTTTTGTTATGATTAGGCAATTTCTATTAGGGTTCTTGGTTTTTATGTCAAGGTTTATTGGTATATCATGGTGGTTTTCGCTTTGAAAGGATGGAACTGGTTATTATGTACACTTAAATTCGGATGCAATGAGGGAAAGTTTAAATTTCTCAGCCATTTTGTGCTTGGAACATGAGAACTTTGACATTTTGAAGAACTTTCTATTTGAATGATTGGGAAGCATAGAGTGGGGATTCGAACTTCTGAACTTTTGGTTAAGTATATAATACCTTTGGTTTGATCAATGTTATTCTCAAGGCTACATTCTTGGTCATCGGGCTAACTTGTCCCGATGAAGTCATGAGTATTAGTTTTTGACTAAGAGTTGTGTTAGCTGATTATTACGCTAAGTCAGTAAACTCTAATAAATGCTGCCATAGGCCATTTATAAATTGAATCAGCCTACATTTGGAAGTGATGGTTTTGCACTTGGTTGGTCTTTCCTTTTCTTCATCCCACTTCTTTTTGTTTCTAAAATTTGGAGAACTTAGTTTCAAGAGGCTATCTTTTTTCCTCTTTTCCTTAGAGGACCAGACGGATTACTTATGGTTGGGACAGCTTAACTTGTATGGTTCGAATAGAAAAAATGAGATAGCTTGACATGGGTTTCTTGGGATCTTCACAGAGCAACAGAGAAGCTCTGTTGTCCAAAATAAATTATCTGACTTCTGTGATTTTGTTAAAACTATTTGAAGACGGCTGCTGAGTTTTTCAAACCTTTATTTTTGTGAGATTCTACATTGGTTGGAGAGTAGAACAAAACATTTCTTATAAAGGTGTGGAAACCTCTACCTAATAGATGCGTTTTAAAACCTTGAGGGGAAGCCCAAAGAGGACAATATCTGCTAGCAGCGGGCTTGGACTATTACAAATGGTATTAGAGCCAGACACAGGGCGGTGTGTCAACGGGGACGTTGGGCCCCCAAGGAGGGTGGATTGTGAGATCTCACATTCGTTGGAGAGGAGAACGAATCATTCCTTATAAGGGTGTGGAAACCTCTTCTTAATAGACGCGTTTAAAAACTTTGAGGGGAAGCCCAGAAAGGAAAGCCGAAAGAGGACAATATCTACTAGCAGTTGGCTTGGGCTGTTGTATTTATCAAAACTTAGTTTCCTAATGTGGATATTGAAGTTTTCATGTGTAAGGGACTATATTTTTGGGCGTTTGTTTGAGAATTAAGTAGACCTTTTTCTTTTTTGTACCAAAAACTTGCATCTTCTTTGGCATTTTTCTTTTGATATACGTCTTCAGCATTGCTTTATCTTAGCTCACGCCTTGCATTGTACCAAAAAAAAGTTAAATATTGAGCATTTATGATGTTGGATGTTAAAAAATATTCAATTCATTCACCAACAAAATTAATATGTTAAACTGGCGTTTCATCCCTTGCTAGGCAGAGGAAAGAAAAGCTGCAATAAGAAAGATACCAGAACAGGCTATCTTAGGAGAAGTTCGACGCATGGTTGAGGAGATGCAGACTCTAAATAAAAAGTTGGAGGAAACCGTAAGTTTTATTTCGTCGGTTTTCCAAAAATATTCATTTTTCTTACAGTGGAATCGATATTAAAGATGTTTCTTCTCCAATTTTCGACCTCATAAAAAAGTACACTTGTTATGAGATCAGAATCATTAAAGAGTTTACCATATTCATAATCTGTGAACATCTGCCTTCATATTTCACCAGTGGTAATCGTGTCGATATTGGCTATCGTCATTTATCAGTTAACAAGTTCTTGTTTGCTATAGTATACCCATTTCTTCTTTGAGGTTAAAAAAAGTGAAAATATATGCAGGAGTCTGCCATTGAAGAGTACTTCAAACCAATCGACAAAGAAGTAGAGACATTAATGAGAGTGCAACTTGAAGGAGAAGAGAGAACAATGAAGAACATGGTTAAAGTGATGCAGCAACAGGCTTTGTTAGAAAAAACTGAGGCAGATAAGGTAGGTAGTACCCTTCAAACTGATAAAAATCAACAAAATCAAGACCCTTCAAAAAACTGAGTCAAGGTGATTTTAGGATAAGATAGTGATGATTTGATTTGATCAGATTAATAAAGTCTGTTATCCTGCAATTCTGGAATTTGGATTTTGGTGCCACCAATGTCATAATATATACACTTCACTCATTGCATAGTCTTATTTATTTACAGAGATATAATTGATTATATCAAGACCTTCCAAGTTTTTTAGTAGTGTTCATCAGAGATGCTTATTATTAATTGGGTAGGGGCCTTACTTCTGACACAGTCCTCTGAGCTCACTCCCACTAGTTCTTAGTGAAGCTAGGTCTATCTAACTAGAGGCCAGCATCCCTCCGAGGAGGGTCCCTTCTTCTTGTGTTCTGCTGGCTTTGAAGATATAACAGCCCAAGTCCACCACGATATTGTTTTTTTTGAACTTTCTCTTCCGGACTTTCTTTAAAATTTTTAAACGCCTCTGCTAGGGAGAGGTTTCCACATTTTTATAAAGAATGCTTTGTTCCTCTCTCCAACCGATGTTGGATCTTACAATCCACTACTCTTAAAAAATCAGTGTCTTCGTTGACATTCGTTTCTCTCTTCAATCGATGTGGGATCGCACAAAAGGGATGGAGCACCATAGTTTGGGCTTATGTGTGTCGAAGTCTTTTTGTGATCATTACCCTTTAGGTATTATTTTGTTTATTGGAGTCCCTTTTAGGTCAATTTGGCTGTTTTTGGTGGGTGGTTTTTTGTGTTACTCTTGTTTTTTTCTCAATGAAAACTTGGTTTTACCTACAAAAAAAGATTCAGAGTTTTCTAGTTAAGCTTGAAAGACCTTTGGATGATGTCTTCAAATATTCTAAAGCCTTGCCACCAAGTCACCCCTTAGAAATAGACTTTATAACACGAGACCTTTTGATTGAATGAGTTACATTCAACTTCGGACCTTAACCCATAAACTCTTCTTATCTTTGAATTGTTCGACTGAAAAACCTATAAAATATTTTGAAAGAAAAGAGGTTTCCCATTCATTAATACTTCTTGTTAGTCTGTCTCTTAAATGTCAATTTTCAACCTCTCTGTTAGGCTCCATTGTTTAAAATTTAAACATTAATTCATGATTAGTCAATTTCAGGTTGCCCTAATTTCAAATGGATAAGGTAAGACTGCCCATATCTGTTTCAATTTTCTATACACCACTTTTTAAGGGATTTTTTTTATTATTAAAAAAGATCCACATTTTTTTTTTCTTTTGAAAATTTTGATGTTTACATGAAAATTATACAAATTCTAGAAGAAAACATAAAGAAAATCCTGATCCGGAGATAGTTCATGTAGTATTCAAATTATATAGACATGTATTTTTGACCAATTTTTTATTTTTTATTTTCACGCTTCTTATTTTGGAAAGAGATGGTCTTCATTTTGAACACAATTAGCATTCTTTTCGGATAACTTTGTTGACCTTCTTCGAACTTTTGGGTTGAAAGTTTAGGGTTTATTCTACGCTTAATAAAAGCTAATGAAATCGACACTTAACCCTTAATAAAAGCTAATGAAATTTAGTGTCTCTAAATTTGTAGCATATAGAAGAAGCCTTAGAGTTATTAAAAAAGGGATAAAGGGAACCTTGGAGGCAGGGAAGTGCGTTCTGCTGCAAGGATTGCCCGACATTATTGGATTGTGGCGCCACTTAGGCCGACATCATCCATCCATTTAAGCGCCCCGCCCACGTTTTGCTGATTAAAGCCCTAAAAAAAAGTTGTGGAATTCATGACATTCAAACATCTTTTATTGTTTCCAGTTGGGGGATTCCAAAATTGGTAGGTTAGGATACACAAATCAATTTTTTAACCCACTTTTACGAGCCCGTTCGATATTTTAAGCTCGATGATGTTAAGTAGAACGATTTCAACGAAGGGTTTAGAATGTTTTAGTTTGGAGGGGTAGGTTTAAGATACTTAAAACTAATAATATAATCGGGAGCAAATTTGGATTAGGATATGTTATTTCTTACTTACATTCATAAGTTTAATTTCCCTTGTTCTAACCTTTGTCTCCTTTTTATTAGGATTTTTTGTAGTGAATGTACCAATGATGTCTCGATTAAGTTGTGATGGACGATGGTTGACGACATTTCGAGACTGGAATGACGTCAAGACATATGGGTCATGTCGATTGGAAATCAAGTTGATAAGGTGAGATGCGATGTTGCATTGAGAGACCTTGAGCCTGAGTAGAAACAAGACCGAGTCAAATCACTTAACCTATTGTAGAGGCAAGTCGGTTGTATCACAGACGATTGAATATGTACGCGCAAACAACATGTGTGGTTGACATTCCACACAAGCAATGTTGTTCAGTTGGCGAAGACAAATCTTGCCTAAAATTCAACAAAGCCATGAAGTTGGCAAAAATTATATAGGAGTGTGGAGGATTATTGAGAGTGAGTCCCACGTTGGTTAATTTCCTGAAAAATCATGAGTCGTATGGCTTGAAGCCCAAAGAAAAATCACGAGAGTTTATACTCAAAGTGGATGGAGATTTGTGATTCCTAACAATGAGACTCTAGTTTTTTTTTAAGACTGATCATGAGTTAGTCAAGATCACGACACACAGTTGAAAAAGTTCGACCATGGCAATTAATTTTTTTTTCCCTTAATTCTAAATTTTGTACTTGTTAATAATAATAAAAAATAAAAATAAAAATTGTCCCTCTCATTGAAAACTATATATTATTATTATTATTATTTTGTTAAAAAAACAAAAATTATAGTCTACTAAACTAATATTTGATTATGTTAATTAATGAACAAAGATTAACAGGGTCAAGTTGAAAATTTCAACTCCACTATTTGCAAGCTGTTTACATTAATGAATATTCTAAAAAAAAAAGGATTATTTGATTCAATTTAATTAATTGATCAATTAGTCAATTACTTCTTAATAATCCACTCCTTTCTTTTTTTTAATAATTCAATCACTAACCATTTTTAAAAATACAAAATCATGCAAACAATTTAGGTTTGAAAAAATAACTTTTGATTTTTTAAAAGTTGATTTTAACAAATCCATATTTAAAATTAATTTAAAATCGACCCAATTAAATTTAAGCATATAAAAGTATAAGAATTAAAATGATATTTAATCGAATAATTTTAACAAATCCATATTTTTAATCACTACCCATTTAGATCTAAGTTTTTTTTGGTTGAAAATCAAATATAGCTTGAAATATAATTAAGGAATAATATACATTATCTCATACCATGTGCCCACTCAACCTCTGATTAGACAGCCTTTCTCTCTCTAAGAGGGGAGAGAGAAACAATTATATTGGTATATAGAGAGAATTTATGGCCAAAATATATAAATATAGACTTTTAATAATGTAAAAAAAAAAAAAACAGATTTTTTATTTTTATTTTTAATTTTTTTAAAAAAACCCATTATTTATTTTTAATTTTTTAAAAATCATTTGATTTCATTAATTTCCTTATAGATCTTTTATTATTATTATTTTGTTAATAATTTAGGAAAAGTTTATACAACATACTAAAATAATATTCTTTTTTATCAAAATTTATTGGCAAAAAAAAAATATATATATATTTTCCATATCCAATTTCTATTTATACTAAATTCTTAAATCCACAGCGCTCACAAAAATCTTTAAAAAAAAAAAATGGATAATGAAATGAGAGAGTGATATGCCCATAGGAAACGACATGTCGTATCCCGAAGTCTTAATCAATTTATGTAAAACAAATAAATAAAACGAAGTATTTCCTTCTGCAAATAAACTATTTTTTTGGCCTCATAAAATTGAATAATTCCAAGCCATATTAAATAGTTTATTTTCTGTATTTTGAGGGTATTTTTGACAAATAACAAAAAATTTGTGGGGTTTTGTAAATTTTAGAAAATAAAAAAGTCCACAGTCAAATTAGACATTAAGTTATATATTTTAATTAAAAAAAAAAAGGAAAAAAAATTAAACTTTATAATTTCTTCTAATAGTAAACCAAAACATAAAGCAAAGCTGAATAATTTCTTTTTCTTTTTCTTTCTTTTTCTTTTTATTTGTGGTTTATTATTATTATTATTTAAAAACAATTAAAGAATTTAGCGCATAGTCACAAGGCCTCTCTTTCAACCCACCCCCTCCTCTTCCCCCAACCATTTCACCATTCTTTTTCCACAGCCATTTATTAGATTCCTTTAATGGTCTTTTCTTCGCCATTGGATTTCGCTGATTCTTAACCAAAAAACCCTTCTCAATTCCTTCGTTTCTTCTTCATTTTACTCTCTTTTCTCTCATTTTTTGGACTTTTGGACATATAACTCTTCTGGGTCTTGTGCCTTTTTCTTTCTTTCTGCAATTTTTCTCTGCTGGGTTTGTTGTATACAAGTTTTTTTTTTGATACCCAGATGTATATGCTTACAGATTTGTAACAAAACACTCATATTTCATAATTTTCTTGATTTTTTTCTTGTGAATAAATAAGGTATTTAAGTCAAGTTTCTTCGAATTTTCTTTTTCTACTGAACAGTATATTTAAATAAAGAAGAAAAAGAAAAAAAAAGGGGGAAGGGAGAGTGATATGGGAAAGCATGGACCTTGCTGTCACTGTGGAGTTACAAGTGAGTTCTTGTTCCTCCATTTTGTGTATTTGTTTTGTTGTTTTTTTTTTCTTCTTCCTGTTCTGTTTCTTGTTTGGTCGATTTTCAGCTTGGTTTCTAAAAGCTTTTGTTGTTCCAATGCTCTTGTTTGCTTGAATGTGTAAATTCCTTATTGGGTTTGTGAATTAGGAGAGAACTCTATGAAGAAAACAAAATTCAATCTAAGATATATATTGCTCTCTGTATTACTGATGTTGTTTTCCTTAATTTTATCAATGATCCCCTAAGCTTAACATTTATACTGATGTGTTTCCTGAAGTCTGGACTGTTCTTAATGTGCAGCATCATTTCTCACGAGTTAGCATTGATTATGAGCTGAATTAGCATTCATGTTCTTATCTTCTCAGTTTAGCAGGTGACTTGTGATTTAGTACATTGATAAACCTTGTGTGATTCATCATCTAGCTTTCTAATTCGACTTTCGTAGTGAGTTCTGGTGGTTTCGTGTTAGCTGCTGTTTTGATGTTCTTCATGTTGGTTCTTGAATTGTTTGAAGCATTCAAGTTTTAGGATTCTGGTTTAATCAGATGAAAAATACTTTGTTATATTACATCATGAATCTTCTGTACATGTACGTACCTCTTAATTTCCGAGTATCTAATGTTCAAGTGCTGGAACAGGCACGCCTCTTTGGCGTAATGGACCTCCTGATAAACCAGTATTATGCAACGCATGTGGATCCCGATGGAGGACGAAGGGAACGCTTGCAAACTACACCCCTTTGCATGCTCGAGCTGATCCTGATGAATATGAAAATCACAGGGTTAAGAGCATTTCAATAAATAAAAACAAAGAAGTGAAACTGCTAAAAAGAAAGCTGCAACAAGCAGATGGATCGATTGGGTGGACGATTCCCGATCAAAGTCAGGGTTACTATAGAGTAGTGGATGAAGATACAAGCAATAGATCCAGTTCTGGGTCAGCCATATCTAATCCTGAGAGCTGTGCTCATTTCAGCAGTGCTGATGCAAGTGACTTGACAGGTTAAACACTACGCAACTTAGAAACATTATGATAGCTTTCATTTTTCATCCTTGCATCAATTTAGTTCTATACTAGAAGATTTTCATAACAACTTAATTCTTCAAACGTAGTTTTCTTCTATTATTCATGTTTGTGAAAGAGCTATATAATCCTATCTCTGACCGTAACCTGCATTTGGCCATCTTAAGGTCCAGCTCAGTCGATAGTGTGGGAGGCCATGGTGCCTTCGAGAAAGAGGACCTGTGTGAATCGTCCAAAGCAATCCCCGGTCGAGAAACTAACTAAAGATCTATATGGTATTTTATGTGAGCAGAGATCTTCCTATTTCTCTGAAGCTTCTGAGGAGGATCTGCTTTTTGAGAGTGAAAAGCCTATGGTCTCTGTTGAGATAGGCCATGGAAGCATTCTCATTAGGCATCCGAGCTCAATAGCTCGAGAAGAGGAGTCTGAGGCTAGCTCGGTTTCAGTTGATAACAAGCAATACTTGGTAAATGAGATCTATTCCCCCCATTCTGCTACTGTTCTTGTATGTAGTGAAAACAAGGGCATGAATTTTCCACCATCTAGGATTGGAAAGATGATGAATCCTTCTGGATCAGGGGTACAACAAGTACAAATTAAAAGGTTTGTCCAATCTACTTTACAAAATTGAAATTTCCCCATTTTTATTATATCGATCGATGTGAAAAGCTGAACGAAATGATGTTTAAGTATGTGAGATGCAAATAATGGACTGTCCCTGCTCTTTTTCATCTCGAAAACACTGATAGGACATAGAATCCATCATGGCTCCAGTGAAAAGGGAACAAAGAAAGAAAAGAAGTATCCTTGCTTGTAATATTGCAAGATACGACACACTTATCGGTCTGAGCGTGGTCTGCTCGTTTGGCTATAGTTGTTTACAAAGAAAAAATAAATTTACTTCGTCCTATATAGGTGCTCAATAGTCCTACAAAGGGCATTGTTTCATTCATCAGGCATAGTTAAGGCTAAAAAGATGTGGCTAAGAGTGTCAAATATTGTAACAACCCAAGCTTTCGGCTCCCAACTAGCAGATATTGTCCTCTTTGGATTTTCTCATTTGGGCTTTCCCTCAAGGTTTTTAAAACGCGTCTACTAGGGAGAGGTTTCAACACCCGTATCAAGAATGTTCCGTTCCTCTTTCCAATCGATGTGAGATCTCATAATCCTCCTCCTTCCAGACCCAACGTCTTCGCTGGCATACCTGCTAGTAGTGGGCTTGAGTTGTTACAAATATATTAGGAAACTAGTGTAAATTTGTTACTGCAAACATTGGAGACAAAAAAAAAAAAAAACAATCACCATTTTTTTCTCTTGAAATTGCAGGGACGACTCTCATCATGAAAGTGCACAAATTCTTGGAAGCCATAACTCGCCCTTATGCGACGTAGATATAAATGTAAGTTGCGTCTTTTATCGTCTTGCTTGATTCCATTAGAAAAGAAACTTTAAGCTTAGGTGTTATGATTCCAAGATTGCGCGTCACTCGTGTTTTTGTTAATATCATTACTTGATGACTGCTTATACCGAAAAAGGATAGAATTCTATTTGATGATAAACTAAGCTTGCGGTGCCTTACTGAATGATCTTTCCTATTCTTAGTGGAAATATACTTCGTTTTTGTTTCTAGGACATTATAAACTTCGGAGAGTTTGTGAAACAACTTACAAATGAAGAACAACAGCAGTTAATGAAGTATCTACCACAGATCGATATTGCTGAGCTTCCAGAGACGTAAGAAACAATCAATTTTCTTCTGTTCCTCCCTACAAAACACATCACCACACAGGGAAAAGAAAAAGAATTAGCTCAAGAGGGCAACATCACTTAACTGCACATGGTGCCCCATGTGTGCAAAAAGTATGGATTCCCTATTTTATCTTACCGTGTCAACGACTTGCAATCGCTCCATAATATGAACGATTCAAAGGATGTTATTGCATAGTTAATGTTTGATTTGAGTTTGATAGAGTGCACTACTAAAGAGCATGTTTGATGGATGCTTATGATGTTTTATTTGCAGCCTCAAGAGCATGTTTGATAGCCCCTATTTCAAGGAAAGTTTAACTTCCTTCCAACAACTACTTAGGGAGGGTGTTTTCGACACTTCCTTCCTTGGGACAATGATCGAAGATTGCACGACTCTGAAAATGCTTGTACTATGTAACTCTTCGAAATCCAAATGGGTCGAACGCTATCATCAACTGAAGGTTCGGGCTGATTTTATTGTGATTTGGTTTTTGTTTGATGAGAATGTGGGGAAGAAAATTTTAAATATGCATCTTCACTGAATATAATGTGCCTGCAGAAACGTAAAAATGATGGCGAAGGATCTTTTCTTTCCAACGCCAACACGTCTGTGTCGAGTAACTTCATGAATGTGAAGCGATTGCAGGAGAGCTACAATCAAAATGTTCCTGGTAAACAGAAAATTCTTTGTATGGTAAAATTAAAATGCTATTCTTCTCCGTTTGCAGGATGATAATACTTGTTTCTTCTTAACTTAGATTTAAATGGCTTACATTACGAACTTCGACGCATCATCTTACCGGTGGCTAATCTGCATAGTAAAACTAAGAAACACGTGTTTCTCCCGAATCTGAATTGTAGTATAAGGGCATTCATAGTAAAACTAAGACGATTTTGCCGATACAAAGTTGAACTATGATCTGCTGCAGACTTTATTCCATATAATGCTGTCGTGTTTAAATTCTTTTCAACCGTACTGAGTAAATATTTTCCATCTTCAGAGGTAAAGACTATCATGAAGAGTCCCAAAAGGTTGGTGATGAAGGAAAATAAGGATCCTGGAGAGAATGATGGGTCTTGCTTCAGTCCTAGAAGCTTGTTTGCCTTGCCTACTGATGGAAGCTTTGAATATTTACAATTCATCGAAAGAAGTTCGGATCAAGACCTATTGCTCGATGTGCGATCAAATAATTCATTCCCACAGGCCGAGCTCCTTCACCCGACCTCTCGTACGGGTGGTAGGCAGGCCAGCACATGTAGTAGCTCGGTTCACTCAAATCTTGTACATCACTAACAACTGTATTTTCCATTCATAACCACTGTGCCTGCTGCTATTAGCTAATAGCTATGGCCGGGACTCGCATGTTTTTTGTTGTAAAAGAAAAGTCAGTAAAGGTCGAATGGCATCTTTCGTTTTTCGATTCACCCTTTTTTGAAGTTTGAATGATGGAACCATGACTTTGGGTGAAAATTTTGTATAGGTTAACACAAGATTTGTAACATATATATATGCTTGAATATTATGTATTG

mRNA sequence

ATGGAAGCTAGAATTGTCTCAGAAAGTAAGGGCACAGTGACCTCCGAGAGAGGGCAATTTCGAGGAATCCGAATGGAGAATCCCTTTACTTTGAAGGTGGGTCAGATATTTACAGGGTTCGGCGTCGGTTGTGGCGTCGGTATCGGCGTTGGCCGCCCCATAAACATGGGTGCAATACCTGTCATGAATGAAGTAATGAGTGCCACGAGAGGTGCAACCGATGCACTTTCTGGTGTAACCAGGCATTTAAATAACTCTCTCAGGAAATTAGGAGCTAAGAACATCCAAGGTGGCATTGGGTGTGGAGTTGGTTTTGGCCATGGTTTCGGTGTCGGCCTAGCTATCAAGCCTTCATTTCTACAACAAGTTCAATCTTCTGTTATGCAAGCAATGGAGAAGATGGTGACAAAATTAGGTAATAATCCTAATCTTGCAATTAGTCAAAGCGCTGTACCAGTATCACTGCAATCTGCCATGAGCATAACAAATGCTTCGGCTAACAAGCATCCTGTTGCAAGCATTAGAGAGTTTGCAAAAGAAATGCCAGAAACTGCTCCACAAAACCTATCGAGTAGATCGTTTGAAACTCGAACCGAAAAGGTCGTCGACAGTTTCTTGCAGAATCCTGTTTTTAAAGGAGGGGATACTGAACTGCAGGATGAGGTTGGACGCCTACGGCTCGAAAACCGTCTCTTTCAAATGGTAATGATGCATCAGAAACTTATTCAAGAGCTCAAGGAGGAGAATAACAAGCTTCACCAAATACTAGTGGAAGACCTGAAAATACCACCCAGCAAGCTCCAAGCAAGTAACACAGGATCCAGAAACCGTTGGAGCTATGATTCTCACAAGAATTCCCGCCAGATTTCGCCGGCCGTTCAATCTCATCTCCAGCAGGTTTACCTTACTCTAGGTTGTGCTTTAGTTGCATCTGCGGCTGGAGCTTATCTTCATATGCTTTGGAACATTGGCGGTGTTCTCACAACACTTGCTAGTATCGGAAGCATCGCATGGCTAATGGTCACTCCTCCATATGAAGAGAAAAAGAGGGTTTCTATGTTAATGGGGGCTGCTCTTCTTCAAGGAGCTTCAATTGGTCCTTTGATCAGTGTGGCTATTGAGATTGATCCGAGTGTTCTGGTCAGTGCATTTGTGGGAACGGCAGTGGCCTTTGGTTGTTTTTCAGCAGCAGCCATGTTCGCAAGGCGCAGAGAATTCCTTTATCTGGGTGGCCTGCTTTCTTCTGGAATATCCATGTTACTCTGGCTGCGTTTTGCTTCCTCTATATTCGGTGGTTCCACTGCCATTTTCAAGTTTGAGTTGTATTTTGGACTTCTGCTGTTGGTGGGCTACGTGGTGGTTGACACTCAGAAAATAATCGAGAGGGCGCATCTTGGTGATGTGGATTACCTCGAAGCAAACCCGCCCCCGCAAGGGAAAGAAACACAGCGTCTCGGCATCACTGTGTTCGTTCCAAATAGATCGATGGCTTCATCTGTTGGAGGGATTTGGATGGACAAGGTTCTAGAGCTTGGACGAAGAGCCCTCTTCTACGTTAAAGTCCTCTCCGGCTATGAAGAGCGGCGGATTCGATCGTTTAGGTTGGAGCTCGAAAAGCGTCTCAAGAAGGCAGAGGAAAGAAAAGCTGCAATAAGAAAGATACCAGAACAGGCTATCTTAGGAGAAGTTCGACGCATGGTTGAGGAGATGCAGACTCTAAATAAAAAGTTGGAGGAAACCGAGTCTGCCATTGAAGAGTACTTCAAACCAATCGACAAAGAAGTAGAGACATTAATGAGAGTGCAACTTGAAGGAGAAGAGAGAACAATGAAGAACATGGTTAAAGTGATGCAGCAACAGGCTTTGTTAGAAAAAACTGAGGCAGATAAGGTAGGCACGCCTCTTTGGCGTAATGGACCTCCTGATAAACCAGTATTATGCAACGCATGTGGATCCCGATGGAGGACGAAGGGAACGCTTGCAAACTACACCCCTTTGCATGCTCGAGCTGATCCTGATGAATATGAAAATCACAGGGTTAAGAGCATTTCAATAAATAAAAACAAAGAAGTGAAACTGCTAAAAAGAAAGCTGCAACAAGCAGATGGATCGATTGGGTGGACGATTCCCGATCAAAGTCAGGGTTACTATAGAGTAGTGGATGAAGATACAAGCAATAGATCCAGTTCTGGGTCAGCCATATCTAATCCTGAGAGCTGTGCTCATTTCAGCAGTGCTGATGCAAGTGACTTGACAGGTCCAGCTCAGTCGATAGTGTGGGAGGCCATGGTGCCTTCGAGAAAGAGGACCTGTGTGAATCGTCCAAAGCAATCCCCGGTCGAGAAACTAACTAAAGATCTATATGGTATTTTATGTGAGCAGAGATCTTCCTATTTCTCTGAAGCTTCTGAGGAGGATCTGCTTTTTGAGAGTGAAAAGCCTATGGTCTCTGTTGAGATAGGCCATGGAAGCATTCTCATTAGGCATCCGAGCTCAATAGCTCGAGAAGAGGAGTCTGAGGCTAGCTCGGTTTCAGTTGATAACAAGCAATACTTGGTAAATGAGATCTATTCCCCCCATTCTGCTACTGTTCTTGTATGTAGTGAAAACAAGGGCATGAATTTTCCACCATCTAGGATTGGAAAGATGATGAATCCTTCTGGATCAGGGGTACAACAAGTACAAATTAAAAGGGACGACTCTCATCATGAAAGTGCACAAATTCTTGGAAGCCATAACTCGCCCTTATGCGACGTAGATATAAATGACATTATAAACTTCGGAGAGTTTGTGAAACAACTTACAAATGAAGAACAACAGCAGTTAATGAAGTATCTACCACAGATCGATATTGCTGAGCTTCCAGAGACCCTCAAGAGCATGTTTGATAGCCCCTATTTCAAGGAAAGTTTAACTTCCTTCCAACAACTACTTAGGGAGGGTGTTTTCGACACTTCCTTCCTTGGGACAATGATCGAAGATTGCACGACTCTGAAAATGCTTGTACTATGTAACTCTTCGAAATCCAAATGGGTCGAACGCTATCATCAACTGAAGAAACGTAAAAATGATGGCGAAGGATCTTTTCTTTCCAACGCCAACACGTCTGTGTCGAGTAACTTCATGAATGTGAAGCGATTGCAGGAGAGCTACAATCAAAATGTTCCTGAGGTAAAGACTATCATGAAGAGTCCCAAAAGGTTGGTGATGAAGGAAAATAAGGATCCTGGAGAGAATGATGGGTCTTGCTTCAGTCCTAGAAGCTTGTTTGCCTTGCCTACTGATGGAAGCTTTGAATATTTACAATTCATCGAAAGAAGTTCGGATCAAGACCTATTGCTCGATGTGCGATCAAATAATTCATTCCCACAGGCCGAGCTCCTTCACCCGACCTCTCGTACGGGTGGTAGGCAGGCCAGCACATGTAGTAGCTCGGTTCACTCAAATCTTGTACATCACTAACAACTGTATTTTCCATTCATAACCACTGTGCCTGCTGCTATTAGCTAATAGCTATGGCCGGGACTCGCATGTTTTTTGTTGTAAAAGAAAAGTCAGTAAAGGTCGAATGGCATCTTTCGTTTTTCGATTCACCCTTTTTTGAAGTTTGAATGATGGAACCATGACTTTGGGTGAAAATTTTGTATAGGTTAACACAAGATTTGTAACATATATATATGCTTGAATATTATGTATTG

Coding sequence (CDS)

ATGGAAGCTAGAATTGTCTCAGAAAGTAAGGGCACAGTGACCTCCGAGAGAGGGCAATTTCGAGGAATCCGAATGGAGAATCCCTTTACTTTGAAGGTGGGTCAGATATTTACAGGGTTCGGCGTCGGTTGTGGCGTCGGTATCGGCGTTGGCCGCCCCATAAACATGGGTGCAATACCTGTCATGAATGAAGTAATGAGTGCCACGAGAGGTGCAACCGATGCACTTTCTGGTGTAACCAGGCATTTAAATAACTCTCTCAGGAAATTAGGAGCTAAGAACATCCAAGGTGGCATTGGGTGTGGAGTTGGTTTTGGCCATGGTTTCGGTGTCGGCCTAGCTATCAAGCCTTCATTTCTACAACAAGTTCAATCTTCTGTTATGCAAGCAATGGAGAAGATGGTGACAAAATTAGGTAATAATCCTAATCTTGCAATTAGTCAAAGCGCTGTACCAGTATCACTGCAATCTGCCATGAGCATAACAAATGCTTCGGCTAACAAGCATCCTGTTGCAAGCATTAGAGAGTTTGCAAAAGAAATGCCAGAAACTGCTCCACAAAACCTATCGAGTAGATCGTTTGAAACTCGAACCGAAAAGGTCGTCGACAGTTTCTTGCAGAATCCTGTTTTTAAAGGAGGGGATACTGAACTGCAGGATGAGGTTGGACGCCTACGGCTCGAAAACCGTCTCTTTCAAATGGTAATGATGCATCAGAAACTTATTCAAGAGCTCAAGGAGGAGAATAACAAGCTTCACCAAATACTAGTGGAAGACCTGAAAATACCACCCAGCAAGCTCCAAGCAAGTAACACAGGATCCAGAAACCGTTGGAGCTATGATTCTCACAAGAATTCCCGCCAGATTTCGCCGGCCGTTCAATCTCATCTCCAGCAGGTTTACCTTACTCTAGGTTGTGCTTTAGTTGCATCTGCGGCTGGAGCTTATCTTCATATGCTTTGGAACATTGGCGGTGTTCTCACAACACTTGCTAGTATCGGAAGCATCGCATGGCTAATGGTCACTCCTCCATATGAAGAGAAAAAGAGGGTTTCTATGTTAATGGGGGCTGCTCTTCTTCAAGGAGCTTCAATTGGTCCTTTGATCAGTGTGGCTATTGAGATTGATCCGAGTGTTCTGGTCAGTGCATTTGTGGGAACGGCAGTGGCCTTTGGTTGTTTTTCAGCAGCAGCCATGTTCGCAAGGCGCAGAGAATTCCTTTATCTGGGTGGCCTGCTTTCTTCTGGAATATCCATGTTACTCTGGCTGCGTTTTGCTTCCTCTATATTCGGTGGTTCCACTGCCATTTTCAAGTTTGAGTTGTATTTTGGACTTCTGCTGTTGGTGGGCTACGTGGTGGTTGACACTCAGAAAATAATCGAGAGGGCGCATCTTGGTGATGTGGATTACCTCGAAGCAAACCCGCCCCCGCAAGGGAAAGAAACACAGCGTCTCGGCATCACTGTGTTCGTTCCAAATAGATCGATGGCTTCATCTGTTGGAGGGATTTGGATGGACAAGGTTCTAGAGCTTGGACGAAGAGCCCTCTTCTACGTTAAAGTCCTCTCCGGCTATGAAGAGCGGCGGATTCGATCGTTTAGGTTGGAGCTCGAAAAGCGTCTCAAGAAGGCAGAGGAAAGAAAAGCTGCAATAAGAAAGATACCAGAACAGGCTATCTTAGGAGAAGTTCGACGCATGGTTGAGGAGATGCAGACTCTAAATAAAAAGTTGGAGGAAACCGAGTCTGCCATTGAAGAGTACTTCAAACCAATCGACAAAGAAGTAGAGACATTAATGAGAGTGCAACTTGAAGGAGAAGAGAGAACAATGAAGAACATGGTTAAAGTGATGCAGCAACAGGCTTTGTTAGAAAAAACTGAGGCAGATAAGGTAGGCACGCCTCTTTGGCGTAATGGACCTCCTGATAAACCAGTATTATGCAACGCATGTGGATCCCGATGGAGGACGAAGGGAACGCTTGCAAACTACACCCCTTTGCATGCTCGAGCTGATCCTGATGAATATGAAAATCACAGGGTTAAGAGCATTTCAATAAATAAAAACAAAGAAGTGAAACTGCTAAAAAGAAAGCTGCAACAAGCAGATGGATCGATTGGGTGGACGATTCCCGATCAAAGTCAGGGTTACTATAGAGTAGTGGATGAAGATACAAGCAATAGATCCAGTTCTGGGTCAGCCATATCTAATCCTGAGAGCTGTGCTCATTTCAGCAGTGCTGATGCAAGTGACTTGACAGGTCCAGCTCAGTCGATAGTGTGGGAGGCCATGGTGCCTTCGAGAAAGAGGACCTGTGTGAATCGTCCAAAGCAATCCCCGGTCGAGAAACTAACTAAAGATCTATATGGTATTTTATGTGAGCAGAGATCTTCCTATTTCTCTGAAGCTTCTGAGGAGGATCTGCTTTTTGAGAGTGAAAAGCCTATGGTCTCTGTTGAGATAGGCCATGGAAGCATTCTCATTAGGCATCCGAGCTCAATAGCTCGAGAAGAGGAGTCTGAGGCTAGCTCGGTTTCAGTTGATAACAAGCAATACTTGGTAAATGAGATCTATTCCCCCCATTCTGCTACTGTTCTTGTATGTAGTGAAAACAAGGGCATGAATTTTCCACCATCTAGGATTGGAAAGATGATGAATCCTTCTGGATCAGGGGTACAACAAGTACAAATTAAAAGGGACGACTCTCATCATGAAAGTGCACAAATTCTTGGAAGCCATAACTCGCCCTTATGCGACGTAGATATAAATGACATTATAAACTTCGGAGAGTTTGTGAAACAACTTACAAATGAAGAACAACAGCAGTTAATGAAGTATCTACCACAGATCGATATTGCTGAGCTTCCAGAGACCCTCAAGAGCATGTTTGATAGCCCCTATTTCAAGGAAAGTTTAACTTCCTTCCAACAACTACTTAGGGAGGGTGTTTTCGACACTTCCTTCCTTGGGACAATGATCGAAGATTGCACGACTCTGAAAATGCTTGTACTATGTAACTCTTCGAAATCCAAATGGGTCGAACGCTATCATCAACTGAAGAAACGTAAAAATGATGGCGAAGGATCTTTTCTTTCCAACGCCAACACGTCTGTGTCGAGTAACTTCATGAATGTGAAGCGATTGCAGGAGAGCTACAATCAAAATGTTCCTGAGGTAAAGACTATCATGAAGAGTCCCAAAAGGTTGGTGATGAAGGAAAATAAGGATCCTGGAGAGAATGATGGGTCTTGCTTCAGTCCTAGAAGCTTGTTTGCCTTGCCTACTGATGGAAGCTTTGAATATTTACAATTCATCGAAAGAAGTTCGGATCAAGACCTATTGCTCGATGTGCGATCAAATAATTCATTCCCACAGGCCGAGCTCCTTCACCCGACCTCTCGTACGGGTGGTAGGCAGGCCAGCACATGTAGTAGCTCGGTTCACTCAAATCTTGTACATCACTAA

Protein sequence

MEARIVSESKGTVTSERGQFRGIRMENPFTLKVGQIFTGFGVGCGVGIGVGRPINMGAIPVMNEVMSATRGATDALSGVTRHLNNSLRKLGAKNIQGGIGCGVGFGHGFGVGLAIKPSFLQQVQSSVMQAMEKMVTKLGNNPNLAISQSAVPVSLQSAMSITNASANKHPVASIREFAKEMPETAPQNLSSRSFETRTEKVVDSFLQNPVFKGGDTELQDEVGRLRLENRLFQMVMMHQKLIQELKEENNKLHQILVEDLKIPPSKLQASNTGSRNRWSYDSHKNSRQISPAVQSHLQQVYLTLGCALVASAAGAYLHMLWNIGGVLTTLASIGSIAWLMVTPPYEEKKRVSMLMGAALLQGASIGPLISVAIEIDPSVLVSAFVGTAVAFGCFSAAAMFARRREFLYLGGLLSSGISMLLWLRFASSIFGGSTAIFKFELYFGLLLLVGYVVVDTQKIIERAHLGDVDYLEANPPPQGKETQRLGITVFVPNRSMASSVGGIWMDKVLELGRRALFYVKVLSGYEERRIRSFRLELEKRLKKAEERKAAIRKIPEQAILGEVRRMVEEMQTLNKKLEETESAIEEYFKPIDKEVETLMRVQLEGEERTMKNMVKVMQQQALLEKTEADKVGTPLWRNGPPDKPVLCNACGSRWRTKGTLANYTPLHARADPDEYENHRVKSISINKNKEVKLLKRKLQQADGSIGWTIPDQSQGYYRVVDEDTSNRSSSGSAISNPESCAHFSSADASDLTGPAQSIVWEAMVPSRKRTCVNRPKQSPVEKLTKDLYGILCEQRSSYFSEASEEDLLFESEKPMVSVEIGHGSILIRHPSSIAREEESEASSVSVDNKQYLVNEIYSPHSATVLVCSENKGMNFPPSRIGKMMNPSGSGVQQVQIKRDDSHHESAQILGSHNSPLCDVDINDIINFGEFVKQLTNEEQQQLMKYLPQIDIAELPETLKSMFDSPYFKESLTSFQQLLREGVFDTSFLGTMIEDCTTLKMLVLCNSSKSKWVERYHQLKKRKNDGEGSFLSNANTSVSSNFMNVKRLQESYNQNVPEVKTIMKSPKRLVMKENKDPGENDGSCFSPRSLFALPTDGSFEYLQFIERSSDQDLLLDVRSNNSFPQAELLHPTSRTGGRQASTCSSSVHSNLVHH
Homology
BLAST of CmaCh02G015690 vs. ExPASy Swiss-Prot
Match: Q8W4H1 (GATA transcription factor 26 OS=Arabidopsis thaliana OX=3702 GN=GATA26 PE=2 SV=1)

HSP 1 Score: 441.8 bits (1135), Expect = 2.4e-122
Identity = 260/510 (50.98%), Postives = 337/510 (66.08%), Query Frame = 0

Query: 633  TPLWRNGPPDKPVLCNACGSRWRTKGTLANYTPLHARADPDEYENH----RVKSISI-NK 692
            TPLWRNGPP+KPVLCNACGSRWRTKGTL NYTPLHARAD DE ++H    R+KSIS+ NK
Sbjct: 15   TPLWRNGPPEKPVLCNACGSRWRTKGTLVNYTPLHARADGDENDDHHRFQRMKSISLGNK 74

Query: 693  NKEVKLLKRKLQQADGSIGWTIPDQSQGY-YRVVDEDTSNRSSSGSAISNPESCAHFSSA 752
            NKE+K+LKRK  Q +  I   + + S G    V++ED SNRSSSGSA+SN ESCA FSSA
Sbjct: 75   NKEIKMLKRKAIQENIIIKRPVFEFSYGLKAAVIEEDASNRSSSGSAVSNSESCAQFSSA 134

Query: 753  DASDLTGPAQSIVWEAMVPSRKRTCVNRPKQSPVEKLTKDLYGILCEQRSSYFSEASEED 812
            D S    P+QS  W+  VP ++RTCV RPK S VEKLTKDLY IL EQ+SS  S +SEED
Sbjct: 135  DGS----PSQSNAWDTTVPCKRRTCVGRPKSSSVEKLTKDLYNILQEQQSSCLSVSSEED 194

Query: 813  LLFESEKPMVSVEIGHGSILIRHPSSIAREEESEASSVSVDNKQYLVNEIYSPHSATVLV 872
            LLFE+E  MVSVEIGHGS+L+++P S AREEESEASS+S    +  +++ YS HS   + 
Sbjct: 195  LLFENEMSMVSVEIGHGSVLMKNPHSFAREEESEASSLSSIENKSSISDAYS-HSVKRVE 254

Query: 873  CSENKGMNFPPSRIGKMMNPSGSGVQQVQIKRDDSHHESAQILGSHNSPLCDVDINDIIN 932
                +G  +            G  ++Q Q KR  S  E   +LGSH SPLC +D+ D+ N
Sbjct: 255  IGAVRGSYY-----------GGQTIKQEQFKRTKSQTERVHVLGSHGSPLCSIDLKDVFN 314

Query: 933  FGEFVKQLTNEEQQQLMKYLPQIDIAELPETLKSMFDSPYFKESLTSFQQLLREGVFD-T 992
            F EF++Q T EEQ++LM  LPQID  +LP +L+ MF+S  FK++ + FQQL+ +GVFD +
Sbjct: 315  FDEFIEQFTEEEQKKLMNLLPQIDSDDLPHSLRMMFESAQFKDNFSLFQQLIADGVFDVS 374

Query: 993  SFLGTMIEDCTTLKMLVLCNSSKSKWVERYHQLKKRKNDGEGSFLSNANTS---VSSNFM 1052
            S  G  +E+  T K L L + +KS+ VE Y+ LK+R+     S  + + +S   V  N +
Sbjct: 375  SSSGAKLEEIRTFKKLALTDFNKSRLVESYNLLKEREKGTGDSVTTTSKSSIPNVPKNIV 434

Query: 1053 NVKRLQESYNQNVPEVKTIMKSPKRLVMKENKDPGENDGSCFSPRSLFAL--PTDGSFEY 1112
             +KR  E+  Q   E + +M+SPKR++  +     EN+ SCF PRSL ++     GS  +
Sbjct: 435  TIKRRYENQIQVKSESRGLMRSPKRVMKMKASHETENNVSCFRPRSLASVFAQEGGSAVF 494

Query: 1113 LQFIERSSDQD-LLLDVRSNNSFPQAELLH 1130
                  SSDQD LLLD+ SN SFPQAELLH
Sbjct: 495  SYEGNCSSDQDLLLLDLPSNGSFPQAELLH 508

BLAST of CmaCh02G015690 vs. ExPASy Swiss-Prot
Match: Q5PP38 (GATA transcription factor 27 OS=Arabidopsis thaliana OX=3702 GN=GATA27 PE=2 SV=1)

HSP 1 Score: 364.0 bits (933), Expect = 6.3e-99
Identity = 231/512 (45.12%), Postives = 311/512 (60.74%), Query Frame = 0

Query: 633  TPLWRNGPPDKPVLCNACGSRWRTKGTLANYTPLHARADPD--EYENHR-----VKSISI 692
            TPLWRNGPP+KPVLCNACGSRWRTKG+L NYTPLHARA+ D  E E+HR     +K +S+
Sbjct: 15   TPLWRNGPPEKPVLCNACGSRWRTKGSLVNYTPLHARAEGDETEIEDHRTQTVMIKGMSL 74

Query: 693  NKNKEVKLLKRKLQQADGSIGWTIPDQSQGYYR-VVDEDTSNRSSSGSAISNPESCAHFS 752
            NK    K+ KRK  Q + ++     +   G+ R  +DE+ SNRSSSGS +SN ESC    
Sbjct: 75   NK----KIPKRKPYQENFTVKRANLEFHTGFKRKALDEEASNRSSSGSVVSNSESC---- 134

Query: 753  SADASDLTGPAQSIVWEAMVPSRKRTCVNRPK-QSPVEKLTKDLYGILCEQRSSYFSEAS 812
                      AQS  W++  P ++RTCV RPK  S VEKLTKDLY IL EQ+SS  S  S
Sbjct: 135  ----------AQSNAWDSTFPCKRRTCVGRPKAASSVEKLTKDLYTILQEQQSSCLSGTS 194

Query: 813  EEDLLFESEKPMVSVEIGHGSILIRHPSSIAREEESEASSVSVDNKQYLVNEIYSPHSAT 872
            EEDLLFE+E PM+   +GHGS+L+R P S AREEESEASS+ V++     ++  S HS  
Sbjct: 195  EEDLLFENETPML---LGHGSVLMRDPHSGAREEESEASSLLVES-----SKSSSVHSVK 254

Query: 873  VLVCSENKGMNFPPSRIGKMMNPSGSGVQQVQIKRDDSHHESAQILGSHNSPLCDVDIND 932
                                    G  ++Q Q+KR  S     Q+LG H+S LC +D+ D
Sbjct: 255  F----------------------GGKAMKQEQVKRSKS-----QVLGRHSSLLCSIDLKD 314

Query: 933  IINFGEFVKQLTNEEQQQLMKYLPQIDIAELPETLKSMFDSPYFKESLTSFQQLLREGVF 992
            + NF EF++  T EEQQ+LMK LPQ+D  + P++L+SMF+S  FKE+L+ FQQL+ +GVF
Sbjct: 315  VFNFDEFIENFTEEEQQKLMKLLPQVDSVDRPDSLRSMFESSQFKENLSLFQQLVADGVF 374

Query: 993  DTSFLGTMIEDCTTLKMLVLCNSSKSKWVERYHQLKKRKNDG---EGSFLSNANTSVSSN 1052
            +T+     +ED  TL  L L + +KS  +E Y+ LK+R+ +      S +S+ + S +++
Sbjct: 375  ETNSSYAKLEDIKTLAKLALSDPNKSHLLESYYMLKRREIEDCVTTTSRVSSLSPSNNNS 434

Query: 1053 FMNVKRLQESYNQNVPEVKTIMKSPKRLV---MKENKDPGENDGSCFSPRSLFALPTDGS 1112
             + ++R  ES NQN  E + +M+SPK ++    K  ++  EN  S F P S       G 
Sbjct: 435  LVTIERPCESLNQNFSETRGVMRSPKEVMKIRSKHTEENLENSVSSFKPVS-----CGGP 468

Query: 1113 FEYLQFIERSSDQDLLLDVRSNNSFPQAELLH 1130
              +       SDQDLLLDV SN SFPQAELL+
Sbjct: 495  LVFSYEDNDISDQDLLLDVPSNGSFPQAELLN 468

BLAST of CmaCh02G015690 vs. ExPASy Swiss-Prot
Match: Q9LD45 (Bax inhibitor 1 OS=Arabidopsis thaliana OX=3702 GN=BI-1 PE=1 SV=1)

HSP 1 Score: 296.6 bits (758), Expect = 1.2e-78
Identity = 148/203 (72.91%), Postives = 183/203 (90.15%), Query Frame = 0

Query: 270 SNTGSRNRWSYDSHKNSRQISPAVQSHLQQVYLTLGCALVASAAGAYLHMLWNIGGVLTT 329
           S  GSR+ WSYDS KN RQISPAVQ+HL++VYLTL CALVASA GAYLH+LWNIGG+LTT
Sbjct: 10  SQPGSRS-WSYDSLKNFRQISPAVQNHLKRVYLTLCCALVASAFGAYLHVLWNIGGILTT 69

Query: 330 LASIGSIAWLMVTPPYEEKKRVSMLMGAALLQGASIGPLISVAIEIDPSVLVSAFVGTAV 389
           +  IG++ WL+  PPYE +KR+S+L  +A+L+GAS+GPLI VAI++DPS+L++AFVGTA+
Sbjct: 70  IGCIGTMIWLLSCPPYEHQKRLSLLFVSAVLEGASVGPLIKVAIDVDPSILITAFVGTAI 129

Query: 390 AFGCFSAAAMFARRREFLYLGGLLSSGISMLLWLRFASSIFGGSTAIFKFELYFGLLLLV 449
           AF CFSAAAM ARRRE+LYLGGLLSSG+SML+WL+FASSIFGGS +IFKFELYFGLL+ V
Sbjct: 130 AFVCFSAAAMLARRREYLYLGGLLSSGLSMLMWLQFASSIFGGSASIFKFELYFGLLIFV 189

Query: 450 GYVVVDTQKIIERAHLGDVDYLE 473
           GY+VVDTQ+IIE+AHLGD+DY++
Sbjct: 190 GYMVVDTQEIIEKAHLGDMDYVK 211

BLAST of CmaCh02G015690 vs. ExPASy Swiss-Prot
Match: Q9MBD8 (Bax inhibitor 1 OS=Oryza sativa subsp. japonica OX=39947 GN=BI1 PE=2 SV=1)

HSP 1 Score: 260.0 bits (663), Expect = 1.3e-67
Identity = 131/195 (67.18%), Postives = 163/195 (83.59%), Query Frame = 0

Query: 278 WSYDSHKNSRQISPAVQSHLQQVYLTLGCALVASAAGAYLHMLWNIGGVLTTLASIGSIA 337
           W YDS KN RQISPAVQSHL+ VYLTL  AL ASA GAYLH+  NIGG+LT L  +GSIA
Sbjct: 18  WGYDSLKNFRQISPAVQSHLKLVYLTLCVALAASAVGAYLHVALNIGGMLTMLGCVGSIA 77

Query: 338 WLMVTPPYEEKKRVSMLMGAALLQGASIGPLISVAIEIDPSVLVSAFVGTAVAFGCFSAA 397
           WL   P +EE+KR  +L+ AALL+GAS+GPLI +A++ D S+LV+AFVGTA+AFGCF+ A
Sbjct: 78  WLFSVPVFEERKRFGILLAAALLEGASVGPLIKLAVDFDSSILVTAFVGTAIAFGCFTCA 137

Query: 398 AMFARRREFLYLGGLLSSGISMLLWLRFASSIFGGSTAIFKFELYFGLLLLVGYVVVDTQ 457
           A+ A+RRE+LYLGGLLSSG+S+LLWL+FA+SIFG ST  F FE+YFGLL+ +GY+V DTQ
Sbjct: 138 AIVAKRREYLYLGGLLSSGLSILLWLQFAASIFGHSTGSFMFEVYFGLLIFLGYMVYDTQ 197

Query: 458 KIIERAHLGDVDYLE 473
           +IIERAH GD+DY++
Sbjct: 198 EIIERAHHGDMDYIK 212

BLAST of CmaCh02G015690 vs. ExPASy Swiss-Prot
Match: Q9IA79 (Probable Bax inhibitor 1 OS=Paralichthys olivaceus OX=8255 GN=tmbim6 PE=2 SV=1)

HSP 1 Score: 152.1 bits (383), Expect = 3.7e-35
Identity = 85/197 (43.15%), Postives = 133/197 (67.51%), Query Frame = 0

Query: 279 SYDSHKNSRQISPAVQSHLQQVYLTLGCALVASAAGAYLHMLWNI--GGVLTTLASIGSI 338
           ++DS     QIS + Q HL+ VY +L   +  +AAG+Y+H++  +  GG+L+ L S+G +
Sbjct: 9   NFDSLFKFSQISHSTQVHLKNVYSSLAVCMFVAAAGSYVHVVTRLFQGGMLSVLGSLGMM 68

Query: 339 AWLMVTP--PYEEKKRVSMLMGAALLQGASIGPLISVAIEIDPSVLVSAFVGTAVAFGCF 398
            WL +TP     EKKR+++L G A L G  + P +   I I+PS++V+AF+GT+V F CF
Sbjct: 69  FWLAMTPHNSETEKKRLAILAGFAFLTGVGLCPTLDFVIAINPSIIVTAFLGTSVIFVCF 128

Query: 399 SAAAMFARRREFLYLGGLLSSGISMLLWLRFASSIFGGSTAIFKFELYFGLLLLVGYVVV 458
           + +A++A+RR +L+LGG L SG+S +L+L    ++F GS  +FK  +Y GLL++ G+V+ 
Sbjct: 129 TLSALYAKRRSYLFLGGTLMSGLS-ILFLMSMMNMFFGSVMLFKAHMYLGLLIMCGFVLX 188

Query: 459 DTQKIIERAHLGDVDYL 472
           DTQ IIE+A  GD DY+
Sbjct: 189 DTQLIIEKAENGDKDYV 204

BLAST of CmaCh02G015690 vs. TAIR 10
Match: AT4G17570.3 (GATA transcription factor 26 )

HSP 1 Score: 454.5 bits (1168), Expect = 2.5e-127
Identity = 263/510 (51.57%), Postives = 341/510 (66.86%), Query Frame = 0

Query: 633  TPLWRNGPPDKPVLCNACGSRWRTKGTLANYTPLHARADPDEYENH----RVKSISI-NK 692
            TPLWRNGPP+KPVLCNACGSRWRTKGTL NYTPLHARAD DE ++H    R+KSIS+ NK
Sbjct: 15   TPLWRNGPPEKPVLCNACGSRWRTKGTLVNYTPLHARADGDENDDHHRFQRMKSISLGNK 74

Query: 693  NKEVKLLKRKLQQADGSIGWTIPDQSQGY-YRVVDEDTSNRSSSGSAISNPESCAHFSSA 752
            NKE+K+LKRK  Q +  I   + + S G    V++ED SNRSSSGSA+SN ESCA FSSA
Sbjct: 75   NKEIKMLKRKAIQENIIIKRPVFEFSYGLKAAVIEEDASNRSSSGSAVSNSESCAQFSSA 134

Query: 753  DASDLTGPAQSIVWEAMVPSRKRTCVNRPKQSPVEKLTKDLYGILCEQRSSYFSEASEED 812
            D S+LTGP+QS  W+  VP ++RTCV RPK S VEKLTKDLY IL EQ+SS  S +SEED
Sbjct: 135  DGSELTGPSQSNAWDTTVPCKRRTCVGRPKSSSVEKLTKDLYNILQEQQSSCLSVSSEED 194

Query: 813  LLFESEKPMVSVEIGHGSILIRHPSSIAREEESEASSVSVDNKQYLVNEIYSPHSATVLV 872
            LLFE+E  MVSVEIGHGS+L+++P S AREEESEASS+S    +  +++ YS HS   + 
Sbjct: 195  LLFENEMSMVSVEIGHGSVLMKNPHSFAREEESEASSLSSIENKSSISDAYS-HSVKRVE 254

Query: 873  CSENKGMNFPPSRIGKMMNPSGSGVQQVQIKRDDSHHESAQILGSHNSPLCDVDINDIIN 932
                +G  +            G  ++Q Q KR  S  E   +LGSH SPLC +D+ D+ N
Sbjct: 255  IGAVRGSYY-----------GGQTIKQEQFKRTKSQTERVHVLGSHGSPLCSIDLKDVFN 314

Query: 933  FGEFVKQLTNEEQQQLMKYLPQIDIAELPETLKSMFDSPYFKESLTSFQQLLREGVFD-T 992
            F EF++Q T EEQ++LM  LPQID  +LP +L+ MF+S  FK++ + FQQL+ +GVFD +
Sbjct: 315  FDEFIEQFTEEEQKKLMNLLPQIDSDDLPHSLRMMFESAQFKDNFSLFQQLIADGVFDVS 374

Query: 993  SFLGTMIEDCTTLKMLVLCNSSKSKWVERYHQLKKRKNDGEGSFLSNANTS---VSSNFM 1052
            S  G  +E+  T K L L + +KS+ VE Y+ LK+R+     S  + + +S   V  N +
Sbjct: 375  SSSGAKLEEIRTFKKLALTDFNKSRLVESYNLLKEREKGTGDSVTTTSKSSIPNVPKNIV 434

Query: 1053 NVKRLQESYNQNVPEVKTIMKSPKRLVMKENKDPGENDGSCFSPRSLFAL--PTDGSFEY 1112
             +KR  E+  Q   E + +M+SPKR++  +     EN+ SCF PRSL ++     GS  +
Sbjct: 435  TIKRRYENQIQVKSESRGLMRSPKRVMKMKASHETENNVSCFRPRSLASVFAQEGGSAVF 494

Query: 1113 LQFIERSSDQD-LLLDVRSNNSFPQAELLH 1130
                  SSDQD LLLD+ SN SFPQAELLH
Sbjct: 495  SYEGNCSSDQDLLLLDLPSNGSFPQAELLH 512

BLAST of CmaCh02G015690 vs. TAIR 10
Match: AT4G17570.2 (GATA transcription factor 26 )

HSP 1 Score: 454.5 bits (1168), Expect = 2.5e-127
Identity = 263/510 (51.57%), Postives = 341/510 (66.86%), Query Frame = 0

Query: 633  TPLWRNGPPDKPVLCNACGSRWRTKGTLANYTPLHARADPDEYENH----RVKSISI-NK 692
            TPLWRNGPP+KPVLCNACGSRWRTKGTL NYTPLHARAD DE ++H    R+KSIS+ NK
Sbjct: 27   TPLWRNGPPEKPVLCNACGSRWRTKGTLVNYTPLHARADGDENDDHHRFQRMKSISLGNK 86

Query: 693  NKEVKLLKRKLQQADGSIGWTIPDQSQGY-YRVVDEDTSNRSSSGSAISNPESCAHFSSA 752
            NKE+K+LKRK  Q +  I   + + S G    V++ED SNRSSSGSA+SN ESCA FSSA
Sbjct: 87   NKEIKMLKRKAIQENIIIKRPVFEFSYGLKAAVIEEDASNRSSSGSAVSNSESCAQFSSA 146

Query: 753  DASDLTGPAQSIVWEAMVPSRKRTCVNRPKQSPVEKLTKDLYGILCEQRSSYFSEASEED 812
            D S+LTGP+QS  W+  VP ++RTCV RPK S VEKLTKDLY IL EQ+SS  S +SEED
Sbjct: 147  DGSELTGPSQSNAWDTTVPCKRRTCVGRPKSSSVEKLTKDLYNILQEQQSSCLSVSSEED 206

Query: 813  LLFESEKPMVSVEIGHGSILIRHPSSIAREEESEASSVSVDNKQYLVNEIYSPHSATVLV 872
            LLFE+E  MVSVEIGHGS+L+++P S AREEESEASS+S    +  +++ YS HS   + 
Sbjct: 207  LLFENEMSMVSVEIGHGSVLMKNPHSFAREEESEASSLSSIENKSSISDAYS-HSVKRVE 266

Query: 873  CSENKGMNFPPSRIGKMMNPSGSGVQQVQIKRDDSHHESAQILGSHNSPLCDVDINDIIN 932
                +G  +            G  ++Q Q KR  S  E   +LGSH SPLC +D+ D+ N
Sbjct: 267  IGAVRGSYY-----------GGQTIKQEQFKRTKSQTERVHVLGSHGSPLCSIDLKDVFN 326

Query: 933  FGEFVKQLTNEEQQQLMKYLPQIDIAELPETLKSMFDSPYFKESLTSFQQLLREGVFD-T 992
            F EF++Q T EEQ++LM  LPQID  +LP +L+ MF+S  FK++ + FQQL+ +GVFD +
Sbjct: 327  FDEFIEQFTEEEQKKLMNLLPQIDSDDLPHSLRMMFESAQFKDNFSLFQQLIADGVFDVS 386

Query: 993  SFLGTMIEDCTTLKMLVLCNSSKSKWVERYHQLKKRKNDGEGSFLSNANTS---VSSNFM 1052
            S  G  +E+  T K L L + +KS+ VE Y+ LK+R+     S  + + +S   V  N +
Sbjct: 387  SSSGAKLEEIRTFKKLALTDFNKSRLVESYNLLKEREKGTGDSVTTTSKSSIPNVPKNIV 446

Query: 1053 NVKRLQESYNQNVPEVKTIMKSPKRLVMKENKDPGENDGSCFSPRSLFAL--PTDGSFEY 1112
             +KR  E+  Q   E + +M+SPKR++  +     EN+ SCF PRSL ++     GS  +
Sbjct: 447  TIKRRYENQIQVKSESRGLMRSPKRVMKMKASHETENNVSCFRPRSLASVFAQEGGSAVF 506

Query: 1113 LQFIERSSDQD-LLLDVRSNNSFPQAELLH 1130
                  SSDQD LLLD+ SN SFPQAELLH
Sbjct: 507  SYEGNCSSDQDLLLLDLPSNGSFPQAELLH 524

BLAST of CmaCh02G015690 vs. TAIR 10
Match: AT4G17570.1 (GATA transcription factor 26 )

HSP 1 Score: 441.8 bits (1135), Expect = 1.7e-123
Identity = 260/510 (50.98%), Postives = 337/510 (66.08%), Query Frame = 0

Query: 633  TPLWRNGPPDKPVLCNACGSRWRTKGTLANYTPLHARADPDEYENH----RVKSISI-NK 692
            TPLWRNGPP+KPVLCNACGSRWRTKGTL NYTPLHARAD DE ++H    R+KSIS+ NK
Sbjct: 15   TPLWRNGPPEKPVLCNACGSRWRTKGTLVNYTPLHARADGDENDDHHRFQRMKSISLGNK 74

Query: 693  NKEVKLLKRKLQQADGSIGWTIPDQSQGY-YRVVDEDTSNRSSSGSAISNPESCAHFSSA 752
            NKE+K+LKRK  Q +  I   + + S G    V++ED SNRSSSGSA+SN ESCA FSSA
Sbjct: 75   NKEIKMLKRKAIQENIIIKRPVFEFSYGLKAAVIEEDASNRSSSGSAVSNSESCAQFSSA 134

Query: 753  DASDLTGPAQSIVWEAMVPSRKRTCVNRPKQSPVEKLTKDLYGILCEQRSSYFSEASEED 812
            D S    P+QS  W+  VP ++RTCV RPK S VEKLTKDLY IL EQ+SS  S +SEED
Sbjct: 135  DGS----PSQSNAWDTTVPCKRRTCVGRPKSSSVEKLTKDLYNILQEQQSSCLSVSSEED 194

Query: 813  LLFESEKPMVSVEIGHGSILIRHPSSIAREEESEASSVSVDNKQYLVNEIYSPHSATVLV 872
            LLFE+E  MVSVEIGHGS+L+++P S AREEESEASS+S    +  +++ YS HS   + 
Sbjct: 195  LLFENEMSMVSVEIGHGSVLMKNPHSFAREEESEASSLSSIENKSSISDAYS-HSVKRVE 254

Query: 873  CSENKGMNFPPSRIGKMMNPSGSGVQQVQIKRDDSHHESAQILGSHNSPLCDVDINDIIN 932
                +G  +            G  ++Q Q KR  S  E   +LGSH SPLC +D+ D+ N
Sbjct: 255  IGAVRGSYY-----------GGQTIKQEQFKRTKSQTERVHVLGSHGSPLCSIDLKDVFN 314

Query: 933  FGEFVKQLTNEEQQQLMKYLPQIDIAELPETLKSMFDSPYFKESLTSFQQLLREGVFD-T 992
            F EF++Q T EEQ++LM  LPQID  +LP +L+ MF+S  FK++ + FQQL+ +GVFD +
Sbjct: 315  FDEFIEQFTEEEQKKLMNLLPQIDSDDLPHSLRMMFESAQFKDNFSLFQQLIADGVFDVS 374

Query: 993  SFLGTMIEDCTTLKMLVLCNSSKSKWVERYHQLKKRKNDGEGSFLSNANTS---VSSNFM 1052
            S  G  +E+  T K L L + +KS+ VE Y+ LK+R+     S  + + +S   V  N +
Sbjct: 375  SSSGAKLEEIRTFKKLALTDFNKSRLVESYNLLKEREKGTGDSVTTTSKSSIPNVPKNIV 434

Query: 1053 NVKRLQESYNQNVPEVKTIMKSPKRLVMKENKDPGENDGSCFSPRSLFAL--PTDGSFEY 1112
             +KR  E+  Q   E + +M+SPKR++  +     EN+ SCF PRSL ++     GS  +
Sbjct: 435  TIKRRYENQIQVKSESRGLMRSPKRVMKMKASHETENNVSCFRPRSLASVFAQEGGSAVF 494

Query: 1113 LQFIERSSDQD-LLLDVRSNNSFPQAELLH 1130
                  SSDQD LLLD+ SN SFPQAELLH
Sbjct: 495  SYEGNCSSDQDLLLLDLPSNGSFPQAELLH 508

BLAST of CmaCh02G015690 vs. TAIR 10
Match: AT5G47140.1 (GATA transcription factor 27 )

HSP 1 Score: 364.0 bits (933), Expect = 4.4e-100
Identity = 231/512 (45.12%), Postives = 311/512 (60.74%), Query Frame = 0

Query: 633  TPLWRNGPPDKPVLCNACGSRWRTKGTLANYTPLHARADPD--EYENHR-----VKSISI 692
            TPLWRNGPP+KPVLCNACGSRWRTKG+L NYTPLHARA+ D  E E+HR     +K +S+
Sbjct: 15   TPLWRNGPPEKPVLCNACGSRWRTKGSLVNYTPLHARAEGDETEIEDHRTQTVMIKGMSL 74

Query: 693  NKNKEVKLLKRKLQQADGSIGWTIPDQSQGYYR-VVDEDTSNRSSSGSAISNPESCAHFS 752
            NK    K+ KRK  Q + ++     +   G+ R  +DE+ SNRSSSGS +SN ESC    
Sbjct: 75   NK----KIPKRKPYQENFTVKRANLEFHTGFKRKALDEEASNRSSSGSVVSNSESC---- 134

Query: 753  SADASDLTGPAQSIVWEAMVPSRKRTCVNRPK-QSPVEKLTKDLYGILCEQRSSYFSEAS 812
                      AQS  W++  P ++RTCV RPK  S VEKLTKDLY IL EQ+SS  S  S
Sbjct: 135  ----------AQSNAWDSTFPCKRRTCVGRPKAASSVEKLTKDLYTILQEQQSSCLSGTS 194

Query: 813  EEDLLFESEKPMVSVEIGHGSILIRHPSSIAREEESEASSVSVDNKQYLVNEIYSPHSAT 872
            EEDLLFE+E PM+   +GHGS+L+R P S AREEESEASS+ V++     ++  S HS  
Sbjct: 195  EEDLLFENETPML---LGHGSVLMRDPHSGAREEESEASSLLVES-----SKSSSVHSVK 254

Query: 873  VLVCSENKGMNFPPSRIGKMMNPSGSGVQQVQIKRDDSHHESAQILGSHNSPLCDVDIND 932
                                    G  ++Q Q+KR  S     Q+LG H+S LC +D+ D
Sbjct: 255  F----------------------GGKAMKQEQVKRSKS-----QVLGRHSSLLCSIDLKD 314

Query: 933  IINFGEFVKQLTNEEQQQLMKYLPQIDIAELPETLKSMFDSPYFKESLTSFQQLLREGVF 992
            + NF EF++  T EEQQ+LMK LPQ+D  + P++L+SMF+S  FKE+L+ FQQL+ +GVF
Sbjct: 315  VFNFDEFIENFTEEEQQKLMKLLPQVDSVDRPDSLRSMFESSQFKENLSLFQQLVADGVF 374

Query: 993  DTSFLGTMIEDCTTLKMLVLCNSSKSKWVERYHQLKKRKNDG---EGSFLSNANTSVSSN 1052
            +T+     +ED  TL  L L + +KS  +E Y+ LK+R+ +      S +S+ + S +++
Sbjct: 375  ETNSSYAKLEDIKTLAKLALSDPNKSHLLESYYMLKRREIEDCVTTTSRVSSLSPSNNNS 434

Query: 1053 FMNVKRLQESYNQNVPEVKTIMKSPKRLV---MKENKDPGENDGSCFSPRSLFALPTDGS 1112
             + ++R  ES NQN  E + +M+SPK ++    K  ++  EN  S F P S       G 
Sbjct: 435  LVTIERPCESLNQNFSETRGVMRSPKEVMKIRSKHTEENLENSVSSFKPVS-----CGGP 468

Query: 1113 FEYLQFIERSSDQDLLLDVRSNNSFPQAELLH 1130
              +       SDQDLLLDV SN SFPQAELL+
Sbjct: 495  LVFSYEDNDISDQDLLLDVPSNGSFPQAELLN 468

BLAST of CmaCh02G015690 vs. TAIR 10
Match: AT5G47120.1 (BAX inhibitor 1 )

HSP 1 Score: 296.6 bits (758), Expect = 8.7e-80
Identity = 148/203 (72.91%), Postives = 183/203 (90.15%), Query Frame = 0

Query: 270 SNTGSRNRWSYDSHKNSRQISPAVQSHLQQVYLTLGCALVASAAGAYLHMLWNIGGVLTT 329
           S  GSR+ WSYDS KN RQISPAVQ+HL++VYLTL CALVASA GAYLH+LWNIGG+LTT
Sbjct: 10  SQPGSRS-WSYDSLKNFRQISPAVQNHLKRVYLTLCCALVASAFGAYLHVLWNIGGILTT 69

Query: 330 LASIGSIAWLMVTPPYEEKKRVSMLMGAALLQGASIGPLISVAIEIDPSVLVSAFVGTAV 389
           +  IG++ WL+  PPYE +KR+S+L  +A+L+GAS+GPLI VAI++DPS+L++AFVGTA+
Sbjct: 70  IGCIGTMIWLLSCPPYEHQKRLSLLFVSAVLEGASVGPLIKVAIDVDPSILITAFVGTAI 129

Query: 390 AFGCFSAAAMFARRREFLYLGGLLSSGISMLLWLRFASSIFGGSTAIFKFELYFGLLLLV 449
           AF CFSAAAM ARRRE+LYLGGLLSSG+SML+WL+FASSIFGGS +IFKFELYFGLL+ V
Sbjct: 130 AFVCFSAAAMLARRREYLYLGGLLSSGLSMLMWLQFASSIFGGSASIFKFELYFGLLIFV 189

Query: 450 GYVVVDTQKIIERAHLGDVDYLE 473
           GY+VVDTQ+IIE+AHLGD+DY++
Sbjct: 190 GYMVVDTQEIIEKAHLGDMDYVK 211

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8W4H12.4e-12250.98GATA transcription factor 26 OS=Arabidopsis thaliana OX=3702 GN=GATA26 PE=2 SV=1[more]
Q5PP386.3e-9945.12GATA transcription factor 27 OS=Arabidopsis thaliana OX=3702 GN=GATA27 PE=2 SV=1[more]
Q9LD451.2e-7872.91Bax inhibitor 1 OS=Arabidopsis thaliana OX=3702 GN=BI-1 PE=1 SV=1[more]
Q9MBD81.3e-6767.18Bax inhibitor 1 OS=Oryza sativa subsp. japonica OX=39947 GN=BI1 PE=2 SV=1[more]
Q9IA793.7e-3543.15Probable Bax inhibitor 1 OS=Paralichthys olivaceus OX=8255 GN=tmbim6 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT4G17570.32.5e-12751.57GATA transcription factor 26 [more]
AT4G17570.22.5e-12751.57GATA transcription factor 26 [more]
AT4G17570.11.7e-12350.98GATA transcription factor 26 [more]
AT5G47140.14.4e-10045.12GATA transcription factor 27 [more]
AT5G47120.18.7e-8072.91BAX inhibitor 1 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 534..554
NoneNo IPR availableCOILSCoilCoilcoord: 235..258
NoneNo IPR availableCOILSCoilCoilcoord: 560..583
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1128..1153
NoneNo IPR availablePANTHERPTHR46855:SF14GATA TRANSCRIPTION FACTOR 26coord: 632..1153
NoneNo IPR availableCDDcd10430BI-1coord: 287..471
e-value: 3.37367E-76
score: 248.291
NoneNo IPR availableSUPERFAMILY57716Glucocorticoid receptor-like (DNA-binding domain)coord: 632..663
IPR000679Zinc finger, GATA-typeSMARTSM00401GATA_3coord: 626..673
e-value: 0.0049
score: 12.9
IPR000679Zinc finger, GATA-typePFAMPF00320GATAcoord: 633..659
e-value: 1.9E-9
score: 36.9
IPR000679Zinc finger, GATA-typeCDDcd00202ZnF_GATAcoord: 633..669
e-value: 7.53915E-11
score: 56.6122
IPR013088Zinc finger, NHR/GATA-typeGENE3D3.30.50.10coord: 628..698
e-value: 2.0E-8
score: 36.2
IPR006214Bax inhibitor 1-relatedPFAMPF01027Bax1-Icoord: 294..468
e-value: 2.1E-27
score: 96.3
IPR028020ASX, DEUBAD domainPFAMPF13919ASXHcoord: 900..990
e-value: 1.4E-8
score: 34.9
IPR038108RPN13, DEUBAD domain superfamilyGENE3D1.10.2020.20coord: 906..992
e-value: 7.1E-7
score: 31.1
IPR044589GATA transcription factor 26/27PANTHERPTHR46855OSJNBB0038F03.10 PROTEINcoord: 632..1153
IPR044867DEUBAD domainPROSITEPS51916DEUBADcoord: 912..1024
score: 14.293086

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh02G015690.1CmaCh02G015690.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding