Tan0016231 (gene) Snake gourd v1

Overview
NameTan0016231
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein DMR6-LIKE OXYGENASE 2-like
LocationLG06: 27868197 .. 27891790 (-)
RNA-Seq ExpressionTan0016231
SyntenyTan0016231
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTGGAAGAAGAAGCTGCCAATGGCTTCAGTCCAGAAGATTGCCAGCGTCAAATCCATTGCTGAAACACCTAACTTAGCCTCCATTCCCTCCACTTACATCTTCTCCGCCGCCGCCGGTGAAGAAACTGCTCCACAAGCCATGGAAGATTCAATCCCCACCGTTGATTTCGCTCTGCTCACAGCCGGTACCCCCGATCAACGGTCCAAAGTCGTCGACGAGCTTGGCAAGGCCTGTCGAGACTGGGGCTTCTTCATGGTGCGTGTGTAGATGCAAGTTAAATTATACCAAATATCTCTTACTTGAATGAATGAAGTTCGTTTTAATTTGGTTCTTAATGTTTTAACAGTTTTGTCTCTACCATTAATTTTTTGCTTTATTCAAAAATGTTCTTTTGAATGAATATTAAGAGATTTGAGAGTTGGATGGAAACATTTAACATATTAGGAACACAATTAAAACTACCTACATATTCGAGGACATTTTTCATAATATATACTTGGTTTTATCTCTTGTCTCTAACTTATGCATTAGGTTTTAAAAAATATTCCTTATGGATTTTTTTTTTGAGAAAACCTTATGGAAGTTTGTTGAACATAAGTTGTTGTGCATATATATATATATATATATATATATAATTTAGTTTTTGCCCCCTAATCTCATCTATACATTATAAAAGTTTTAAGGCTGAAGTTCTATTTTTGACCTTGAATTTTCAAGAGTTTTTTTTTTTTTACTAGCTTCGAGCCCCCCTTAACTTAAAAAAAAAGATATTTATATTAGTATTATATAATTTGTATTATAAACAAAAATTGGCCAATGACATGAATGACTCCTGATCCTCCTCCTACTTCCACATCTTCCAAATATTTGGCACTTAAGAAAAAAAATTATTATTATTAAAATACATTTGAATTTCTAAGACAGGTTCCTTTGAATAACTCTATCTACAAATATTTACGATTTATAAAAAATAAAATAATAACAAATATATACTATTATATTTTTAGATTTAATTTATTCTTTAAATTATAGTTTGTAACATGCTTGTATTGTAGTTTCAATTTTATCACAATTTAGTTTCTAAATTTTATTGTATAAGAATTTAAGTTTAGTTTATAATTGCGGTAAATTTTAAAATAAGTGCTGTAATTGTGTGTTTTAGAAAAATGAAATGTTGGTAAGTTTTAGTTATAAACCTACCAAAAAAAAATATATATATATATTTTCAGGTGATCAATCATGGAATAGCAGAGAGGGTGAGGAAGGAGATGATGGATTGTTGTAAAGAGTTTTTTGATCTGAGGGAGGAGGAGAAGAGAGTGTACGAAACAAAGCATGTACTCGACCCCATACGATACGGCACCAGCTTCAATCCTCACATAGAGAAAGTGTTGTTATGGAGAGATTATCTCAAAATCATGGTTCATCCTAACTTTCATTCTCCAACTAAACCCCCAACTTTCAGGTCTATTTCTTTTCTTCTCTCTTTTTCTATTCTATAAGAAATATTTTTGAAGTTATTTTATACTTGATGGATTTCAAATGAGTTTCGTTCATCATCCTTTAAAATAGAATGATTTTTTTTTGAAAAGATGCCCTAACGTTTGAAGTAAGTTTCAATTTTGCTACTTTTAAACTTTTAAAATTGTTTAAAAAGTATTACATCTTATGTTAAAAAATGAAAGATAGAAATACAACGCATGTGATGATGAAATTACATATGTGGTTGTGAGTTATATATTATTTAAAGTTTCAACTATGCCCCCTTAGTTATTATTTTAAAGTTTCATTCTAACTTTAAGTTTTGAGTACAACAACACATTTAAGGGAGGGATTCGAACGTTTAGTCATGAGTCATACTTGATGCCAGTTGAACTATGTTTTTGTTGACACTCTAACTTTAAGTTATTTTAATAAGAGTATCATTTTCGTTAAATTTCTAGTTAAAATGAAGACAAAAATGAGATGGGGGCAATAAGCAATAAAAAGAACTCAAGGATATTAGTTGCATTTCAAAATAGTGATAGAATTTAGAGGGAGGAGCAAATTTAGGGTTACGAGTAAAACGTTATTTGTGATCAATAGATGGAAATTAGATTTAGGAAGAAGAGAGGAGAAACAATTGAGAAATGCAAACCCAAGAAATTAATTGAATCTAGGGGACCACTATTTGTGATCAGTACCTACCCTTTTTAATTTTATGGACCATATTTTCGCGCACAATTTTTCTCCCACAAATTTATTATGATTCCACTTTTGTTCTGACCCTACCACTCTTCAATTTTTTCAGAGGGGCAATTTTCATAACGGATGTGGATGCTTGATTTTTTAGGGTTTTGGTACGGATAACCTACTTCCACGAATTTTAATGGTTGATGACCTGCATTCTGAAATTAATGAAATATGACTCATTCTAGCGTGACAATACACGCAAAATTACTCATGAGGGTAATATAAAACGATTTTTCTTGTTCACGCTCTCGTATACGGAAACGTTCTCGCTCTCGTTCTCTCGCTCTCACTCAAATGAGTGGTCGAAAATATGGAGAGAGAAAAAATAGGTCGGCAGCATAACAGCGCTATCCAGATGAATCCAAAATTTTTACTTAACCTGTTTAAGCAAGAAACACTTCTAAGGACTTTTTCAATGCCCGACCTCCCCCCAAACTACAACAAAATAGGCCAAGCTCCAGCAGAACACACAACAGAGGCAACGAAAGCACTAAACAAAAGTTAAACGGGCATGCTGAACAAGGAAACTAAATGAATAACGTCATGCTCAACAACCCAAAGAGTTCAGAAATCCAACGAGCAATAAACAGAAGTCATAGTTCAACTCATAGATACAAAGAGCTTCCTAAGTTCCAAAATAAAGGTTCCTCTAAGACAAGAATGGTCTCATAGCCCCGTCATGTCGCCACCGTCGCTCTCTAGTGAAGGGAATTAGAATCCACACAGCAAAGAAAGATTGATTGAATCGTCATGGTATCATCATGCAACGTGTTTATTAAAGCCGATCTACCTACTAGATCTAGGCTAAACATGCTAAAAATATACAAAGAAGAAAGTTAACTTACTTGTTGTAGATCGATTCAGCAACTTCTCCTCGCAGATCACGAACTCTTCGCGCTCCTCGATCTCCACGAACGAATCCCGAACACTACCAAAATAGATCTTCTTGGTTGTTCTCTTTTGGAGAAGGAAGTTGGTGGGAACTTTTTGTTTTTGGAAGAGGTGGAGAAATGAGAGAATTGAGAGAAAGCACTTTTGGATTCTTAGGTCAAAATCCAAAAAATGACAACTTGGCTTACCAAACCCATCTTACAAAGGGTTTAAATAGTGGAATGATGAAATTCACAACTTCACATAATTACCACTTTACCACAACCATATGTTATGTAAATCTCATTCACATAATTATATAATATAAGATATCTCATATCTTATGTTGCAAATCTCATTTGCAATAATTGTGGATTTAATCAAATTGAATCACATTCAATTTATTTTCTCTCTATCAATTTCTCCAATTAGCCTTTAATTCAACAATTAGGCTAACATATAGTTTATTATGAATCTCATTCATATTAAACTATATATTATATCATCTATATGATATAATTACTCTTAAAGTGAAATTGAACACTTCAATTTCATACCAAAACTTTAACCCTTCTTTATCCTAGATTGAGCCAACCAAGGGACCTAATGGACCTACAAAGGATGAGCTCCAATGATCCGAGGTTAATCACCAAACTCTTTGACCCGGCCAACCAATATTCATTAGCTACAACACACTCCACTAAAGCCTGTAGTCTGTACTCCCATCGATGTAGAGTATTACGTGTCCACCGATATAACCAATGCTCATGAGTCGACCCTTCACAAAGTTGTTCATACACACGTCAGGTCAAATTATCGCTTTACCCTCGTGTCTCATATCTTGTTCCTTAAGTCCCCACTACTCCTCTAATGAACAACATATTGTGATGGTCCAACCATACACAACACCCTTCTCGGGCTAGTGAGAAGGTGGGCGCCCGTTGTCCAAGCCACGAGACAACACTTAAGGAACAACCCTCTAGTTTCCCTATAGGCGGGAACGCAGTGAATTCCATCTCGCGTAATTAGGTTCCCACTCTCTACTTGGTTCTGTCCCGAGAAGATAGGCATATTGGGCGAGTGGATCGGGACCACCTCACCCGTACTAATCAAAGGATGGACCCGTGATGCGAGTCCGTAATACGCTCGAGGATTCGTGTCGAGTCACTAATGGTTATCTACGAATTTATTAGTCTTTTCTTTTATGATGTTATATCGATGAAGTCTAATAATTCACGATGCAGTCTTGTACAATCTCATTGTGCAGGATGCCCCACTCGCATGTCAACCACATGAACGAGTCGAGATCACCTCGTTTGTATCTAATACAAAGTGGGCGCATCCACGATGTGTATCCGAGGATTAGGTCTCCAACCCCATCCTTATATCAGAGCCGTTCGGGTCATTAACTCGAACGTGATCCTCCTTGTGTGTCAACTACACACCGCTCAAGTTCTAGTTTTCTCATATATTCAATGACCCAGAGCTTAGTTTATTGGAAAAGTTTAAAATATTTATGAGACACAAAAAGTGAAGAAAATAATAGCTCTTATTAATTTCAAAACAATATTTTACAAACACGAGATTAGGACAAAAATCCCAACATCCAGCTCACTGTTTGTCCTTACCTGCAACCAAAGACATTAAAGTAGACGGGTGAGTATAAAAATACTCAGTAAGTAGCCCACTCACGTCCAAGCTACAAATTTCACAAGAGCACACACATAGGTAAAACAAGTTCGCAACCAAAGCATCAGGGACTCAAGACTAGACGCTCACGTGTCTACTTCACCCCGAGACTCACCGACGATCTAGAAACCTAGGCGGCCATAGACGGGGACGCACCCCAACGCGAAAGAGTAATCACTCTAACGCCCACACAGAACGCAATAGAGCCTCGACTCTAAAGCCCACACAGATCGCATAAGGGCGACCCACCCAAGTGCCCACACAGAGATCACATAGGGTGCTCGACCCCATGTCCACACAGCTCGCAAAAGGGCGCACTTGACCCTAATGCCCACACAGTTTGCAAAAAGGCGCACTCGACCCTAATGCCCACACAAAGATGCCAGCCTAAAGGTTTTGTCTCGCTCGGGGTCACTCAGTGGGTTTGGTAACATTAGCCTAGGAAGTCAAGCTCAACGTTTCTCCTAGGCTTCCCCTAAACCCTTCCGAGGTCGTCCTGAAACTGAGTCCCTAACCAACAGCCCAACCAGTCAACCACACAGCCCGCCAACCAATCTTTGGCTCGACCACCAAGCTAACCACCCATCTTTTACCTTAACCACCACACAGCTTAACTGCTCGGTTCTAGGGTCAAGAACCCAACGTGCACTACCCCACCGTCGCGAGTCTCACTTAGCAACCAACCACGAGTCAAAAGCAACAAATTCATGATACACAACAGAGTCAAACCCAACACTCAGAAGCAAGTACAAAATATGCACATAACAAACAGGATAATCAGGGTCTCATGCTCCAAGCAACCATTAACATTCACACAACAGTTATAAAGTCAACCACGACGACCCTATACTCAAACTCATTCGCTTACGAGTTTGATACTTAAACCAACAGCTACTTACCTCGAATAAAGTGCGCAAATGGGAGGTCCTTACTTCTGCTAGCAGCCCTAAGCACCTACAGAACATCACAAACGATGCATAAGTTCAGTTGGGAACGAACCTCATGGCCCTACCCTTAAGGCACGACACAAAAGCGAACCCAACCGAGCTAACCGAACACCTTAAACTCATCCAACAGGAGCACATAGCCAAACCAACACATCCTAAGCTTACAATTCAGAACAAAAGATTCGAGAGCAAGTCTTACTCACAGTTACAAGCTGAACCGATTCTCCGCTCGATCTTAGCCTCGAGATTTCTTCCAAAGACCTTACTCCTTAGCTGAAGGATTTGTTCCTGTCCATCAAAACTACACAAAACAGAGAGTGAGAATAATTCGATTAGAGGCAGATCGGTTCGGATCCGGGAGAGGAAAAGAAACGAACTGAGAGGGAGAGGAAACCGAGGGTGAGAGAGAGGGAACGACGGCTAGGTTCCCTGCGAAGAAGAAAGGTGATCGAAAGGGGCGCGCGGCTGACTTTCAATGGGTGTGGGTCTTGATGAACGACAAATAGATGGAGAGAGGGAAAGGGATTGGTGGACGGGGAGGAGGCCGGAAGAGGGAGGGAAGGTATAATGAGAGAGGGACGGATCGAGAGGAAGAAAGGCAGCGGATCGATCGCGGCAGGGACCGAAAACGAGGGAGACGGAGAGGGAGGGAGATCGGGGCGAACGGGCGGCGAGAGAGATGGAGAGAGGGAAGGCGTCGAGGCCGTCGGCTCGGCCTCGACGGACGAGGGCGCGCGGCACCTCTGCGCGACGAAGGAGAGCGCGAAGGGGGGGTGGCAGCTCTCGCGAACTCCTCTCCCTTTCGTCTTCTTTTTTTTTTTTAACAAAACCTAAAACAAGTGAAAATGGAGGAAGGACCAAAATTACCTTGTTTCGCCTCTCCAAAGACGCTCGATGAGAAACTCAAGGCTGCCAACCCAATTTAAAGCGAGATTTAGAAATATTTGGGTCCCTGCAAGATCCAAAAATCAATTTCTTGGCTCAACACTTTTAAAACTAAAACATAAGTCCAAAATTAACTCTGAAAAGAACCAATAACTTACTTGAAATTTACCGGGCGTCACAGAGCGAAAGAGGGAGAGTGCGAGACCTATAGTGTGAAAAAAAGTAAGAGAAAGCTAGAGTAGCAGAAGTAGAGCGAGAGCGTGAAAGCGAGCGAGAATTATAGTGCGATTTTTCTTGTTTTTTTCTTTTCTCCAAAATTTCGACAACCTCATTACTATCATGAGCGAGAGCAAGACTTAAGTGAGAACGAGTCATAAACAAGAGTGAGTCAGAGTATGATGCTCTTATGGTAATTTGGACAGAATGCACATAGGGGTTTTTGTACGGATGACCTCTTTTCGGTTGGATTATCGATAAAAGACCCACATTTTAGATTTAACTACAAATGACCCTTTGATGTCGTCAGCCACTCTCGATCTAGGTACATGTTGGTTATTTTAGCATCATAAATGACTACCGGTCTTGTTCTCCCGTCAACTCCGAAACTATGCTAACGGAACGCTTGATAACGTCATAACTAGAGCTAGAGTGCCACAAGTAAATTGTAGGTATCACTCTCGCTCTAGTTACTTGTGACACTCTAGTTGTAGTTCAAATTAACATAACCTCTATACTTTATGTTTTTTGTATTTTCTAAATACTTTAGGTCAACATATATCACTTATTTAAATTGTACGATTCATACGTTGAAAATTTTTTAATGTTTATTTCAGATCAAAACATTAACTAGCAAAATAAGTTAAAGATGCAAAATTTATAAAGTTGAAAATTTTAAGAATACATAAATAACTCTCACACAAAACTAATTGGTAATAATATTTCAACATCAATTCATATTATAAACGAAATTATCAATAATTCTAAAATATTATAAGATTCATAAATCAAGAATAGATTTAAAAGAAATTCAATCTATAAATTCAACAAAAGTCTAATTCTTTAATAAAAGAAAAGTTAACTAGAGTGAGAGTGTCATAGGTAACAAGAGTTAGAGTGTCATAGGTAACTAGAGTGAGAGTGTCATATGTAACTAGAGTGAGAGTGTCACATGTGACTAGAGCTAGAGTGTCATAGGTAACTAGAGCGAGAGTGACACAAGTAACTAGAGTGAAAGTGACACCTACAATTTGTTGTGCACATCCGAGACCGAGAGTCTCGAAATGCCATGTCGTATTGACTTACCATCACGGGATTTCGAGACTCTCGATCTCGGATGAACACAACAAATTGTAGGTGTCACTTTCACTCTAGTTACATGTGACACTCTCGCTCTAGTTACCTATGACACTCTCACTCTAGTTACATGTGACACTCTCACTCTAGTTACCTGTGACACTCTCACTCTAGTTACCTGTGGCATTCTCACTCTAGTTACCGTTGGAAAATCTTTAAAAGAATGTGTTGTTCTTTAGGAACACACGTCCCGAAAAAAGTGTTCCGAAGGAAATCGTATTGACTTAGCATCACGGGATTTCGAGACTCTCGGTCTCGATGAACACAAAAAATTGTAGGTGTCACTCTCACTCTAGTTACTTGTGGCACTCTCACTCTAGTTACTTGTGGCACTCTCGCTCTAGTTACATGTGACACTCTCACTCTAGTTACATGTGACACTCTCACTCTAGTTACCTATGACACTCTCACTCTAGTTAACTTTTCTTATTAAAGAATTAGACTTTTGTTCAATTTATAGATTGAATTTCTTTTAAATCTATTCTTGATTTATGAATCTTATAATATTTTGAATTATTGATAATTTCGTTTATAATATGAATTGATGTTGAAATATTATTACCAATTAGTTTTGTGTGAGAGTTATTTATGTATTCTTAAAATTTTCAACTTTATAAATTTTGCATCTTTAACTTATTTTGCTAGTTAATGTTTTGATCTGAAATAAACATTAAAAAAATTTCAACGTATGAATCGTACAATTTAAATAAGTGATATATGTTGACCTAAAGTATTTAGAAAATACAAAAAACATAAAGTTTAGAGGTTATGTTAATTTGAACTAGAGGAGTGTCACAAGTAACTAGAGCGAGAGTGATACCTACAATTTACTTGTGGCACTCTAGCTCTAGTTATGACGTTATCAAGCGTTCCGTTAGCATAGTTTAGGAGTTGATGGGAGAACAAGACCGGTAGTCATTTATGATGCTAAAATAACCAACATATACCTAGATCGAGAGTGGCTGACGACATCAAAGGGTCATTTGTAGTTAAATCTAAAATGTGGGTCTTCTATCGATAATCCAACCGAAAAGAAGTATCCGTACAAAAACCCCATGCACATATATAAATGGTCATATGTCATTAATTTAAAAACATGAGTCAAATCACATTAATGCGCGGGAAAAAGAGTCAAATTCCTCCTTTTTTTTTTTTTTTTTCCAATGTGTGACTTTTCTTTCATAGTACATGAGGATTTTTTTCACTGTGCATGACGTCCTTTAAACAACGTCACCCTTTTTCTTTTCCCTTGAAGTGACACCATTTTAAATGTGAATAAAACCTTTTCAAAAAAAAAAAGTGACTAAAACGTTTTGCTTTTTTTTTTTTTACCGTACGTTCATTACCAATATCTGTCCACGTATAACCTCCATTTTACAACTCGCTTGCAACATCACGTTTTTGTCCTCATTTTATAAAGGTTATGAAAGGTTTAATGAAATTAATTATACTTTAAAACTATATGGACATAATTGGTTTTTTCTTAGTATATATTACAACTTTAAATCATATTTTGTCTTCATTATGAAAGGAGTTCTTTTTAGTATATATTATTTGCTAACTTTAAATGTTAAGGTTTAAAATGAGACAAAATCATACATTTACCCGAAAGCTTTCAATATGAACATAACTCAATTATGTACTATCGACCAATATGTTAGAGGTTCGAATCCTCCATCCTCATATTGTCAAAATTAATTTATATCAAAATTAAAGGGCATTTATATAATTTAACCGATAATGATCCATACTTTTTTCATTCATGGTAAGCCATATGTTAGCAACATTTATAGTCAATTTTACGAATTTGAGTGATAAGTTTAAGCTTTAATGGATTTCAAATAAATTTTATATGGACCTTGGAAATAGAATGATCCGATTAAAATATAGTAACTTAAAAAATGTTTTTAACAAAATAATTTTTTATTTAATAAATTTAGTTTTGCTGATTTTCTTCTTTCCTTTTTTTTTCATGAAATATTTTGGAGTATTATTTTATTTTGAGGAAGGAGCATTATTTTATTAAAAGACGAAATATCATAAAAGATTAAAATTAAAATATATCATATGATATGATAATTTGAAATCTTATTATGTTTCATATAATAATAATTTAAAAAGCAAATTGAATATTATAGGTGTAATATCTTGGTTTTTTTTTGTGGAAAAAAATGCATTATTAATATTAAAATCAATAAAAATAAGTTTAAAATTTGATAATATAAAAATACCTTTTTAAAAGAATGATTTCAAATGTAACACACATATTGTATTAATCCCTATTAGAGATGTTTTCAAGTTTTTTTTTAGTAATTACTATATATTTGTTATTGTTTTTATTTTAAAATTTTATGCATGATTAGTATTGTTTAATTTAATTCTTACGAAATAAATCTAATAAGAAGTAATGTGGTTTTAAGATTAGAAGTAATCTCGTTTTAAAATTTTTAAAATGAAAGTTTAATATTTTTTAGTAATTAACTAAAAAAAATAATTAACCAAAAAGGAAATATTGAATCGTTTTACCAATATCATTTTTAAAATGAACATTTTTTTAATTGTTTAATAAAATTTTGTAAATTAGTCACCATTGCAAATACTTCTTTTCATATAATATTTGCCACTATACATTAAATTTTAGGGGAATTATAATCGGTAGCAAAAAATAGTAGGATATTTGCAAATCTAGCCATAGTTTTTTAGATTTACAAATATAACAAAATTTTTTAAAATCTAATTTTCGTTTTTTTGTTATTCCAATTTTACTCTCATTGGTGTATCAGTAGGATATCTGATATACTTAATATTTGGTATATCAGTAGGATATCTAATATACTTAATATTTGTATATCAATAGTATATAAGTAGTATATTTGATATACTTTATATTTGGTATATCAATGGTATATCAGTAGGATATCTGATATACTTAATAATTGGTATATCAATAGTGTATCTGATATACTTTATATTTGATATATATTAATCAACTAGTATATACCTATATATTTGTAACGCCATATGCCTAGGATTTAGATCAAGATTTGGAATCCGGATTTGACCCTAACATGGGATTTCTCCAAGCCAAGCACGCTTAGCTTTGGAGTTCCTATGATTGAACCATAGAAAATTAGGAAGATGCATCTTGTTGGTATATCAATTCTTGTAAGCCTTTTTCATCCATATTTTCATAACTACGAGATCGCTCTCATTCAGATGTGGTCTCGGTTCATTTATGTACCCTTCTTCTTCGAATATCACAATATTAGTTTTATATTACTGATATACTATCGATACACTTCTTTTTAGGTATCATTTAAATGATATACTTTATATATATCATAGCGTATTATGAATATTTTCCATATTTGATATATCACTGATATACTTCATTTTAGGTATATTTATTTCATTGATATCTATAATATTGAGTATATCAGTAATATATCAGTAATATATTATCGAGTATACTAAATATGAATTTCTCATATACTACTAGCCAAATGTATCAATTATAAAAAAGTCTATTGACTAAAACTAGTAAATACAAACCATTAATAAAATAAAAAACTAAAATAAAGTATATCAGTAGTATACCAATGATATACTTAATATTAATTTATGACTTGTATATGAGTTATTGATAGACTATTGATAAACTTTATACAAATATCCGTAGTATATCATTGATATATTTTATATTACGTATGTCTAATTTCATTTTAATATAAGTGAAATGTATCAGTAGCAGATCACTAATATACTTCATATTAGGTATATTTGATACATTGGATATAAATCCCTATTATACTCTTGATATACTTCATATTAGGTATTTTTGATATACTATTAATATACTAAGTACCATAAATTCAAGTTCATCGATATATTTAAGTAAAGTGTATCAGTAATATATCACTGATATACTTCATATTAGGTATATCGTGGGTATATCAACAATATATCATTGATAGACGATTGATATACGTCATATTACAATCACGCAAGATTACACAGAACGAGTATTTTGGACATTTCGTTGACTCCAATTGGGTTGGATCAATTTGTTTTGCTATATTTGCAATAGAAACAAATGGTAGACATGCTTGCAATTGGACATTTATTTTTTTTGCCATTTGTGCAACTGGCCCTAAATTTTATTTATGTTCTTACAACTATCTTTATATCCTCCAAAATATTATTTTCATGATATAAAATTTGGATAAATAAAAACACATTTCTCATAAAAGATTATTCATACATAGTAGGTCAAAGAATATACCATGTATACATAATTTTATTTTCCATCTAATTTTTATTCTACAAAGAATGGTTCATAATGGTCTTTTCCTTCACTTGTAAACAAGTATAGCAAATCAAAGAGGGAGAGTTACACAAGAGAGCAATCTTTTTTTTCTCCTCTTGTGTATGTTTTAATAAGTTGTTTTTTTCAAGAGAACCATTTTTTTCTTTTTTTTCATTTGAATTAAAGTATCATTTTGGTCTCGTACTTTGGATATTTTATTTTATTTTAGTCATTGTACTTTCAAATTTTCAAATTTAATATGCGTACTTTTATAAAGTTTAAAATTATTCTCTACTATTAGTTTATATAATTGTTAACTTTTCTAAAGAAAAATTGTTTCTTATTTTTAGATTGAGTGAATCAATATCTATTTGTTTTTTTAAAACACTTTTTTTTTACTATGGGTGATCTAAACCATAGAAATTATGAAAACCACATATTCATTGTTTTCAAATATGTTTTTTTTTCTTTTTTTAGTACATCAATATTTGGAGGTGGGAAGATTCGAACCCATGACTTGTTAGTCATAAGTCTTACTTGATGTAAATTAAGCTATGCTTTTGTTGGCAAATATGTTTTTTTATTTCAAGTTTTATGTGGAGTAAAGATTTGAACTTCGACCTTTTAGTGGATAATATATGTTTTAATTAGTTGAGTATATTCAAAATGATTTGAAATTTGAATATAGAATTGCACTAAAAAAAGATAAAAGCTAATTCCATAAATATACTCTTCTTACTGTAAACTCATTCCATTTTTTATAAACTAAAATATATGTATATAACATAAAATATTTTAAATTATATAATATTGTAAAATAATCACGATATACAAAATTGATAACATAACATATGTTAATATGTTTATTAGTAATATTTGAGATCCAATCCAAATTAAATATATATATAGTTCGTAATTATCATTATCTATTATTTGGTATTGTCAATTGAACATAGTCAATTAATGCATATATCATTGACCAAAAGGTCAATGATTAAAATCTTCGCACCAACATGTAAACAACACTAAAAAAAAAATCACGTGTAAATATTGAAAAGATTAAAAAACAAACCAGCCCTATTGTTTAAAGAATCACTTGTTTTCAAAATTCTATTTTTCAATCAGTACGTTAATTTTTTTACGACCTAGGTTGCAAGAAAAGTAAAATTTATTTACATATATAGAAGCATAACGACGTGTTTGGCTTTGATTCGTCGCAGAGAGATCTCGAACGAGTACTGCAGAAGAGTTCGAGAAATGGCAAGAGAGTTGATGAGAGGGATCTCAGAGAGCTTGGGATTAGAAAGGTGCAATTTGGAGAGGGCAGTGAATTGGGAAAAATGTTCACAAATTCTTATTGCAAACTTATATCCACCATGTCCACAGCCAGAGCTTGCAATGGGTTTGTCCCCTCACTCTGATCATTGCCTTATCACTGTTCTTCTTCAAAACCAAATTGATGGCCTTCAAATCTTGCATGATGGTAAGTGGGTGAATGTCAATCCCATTCCCAACTCATTTTTGGTCAACACTGCTGATCAACTTGAGGTATTTTTCCTTCTCTTAATCTTCTTGTTTATTTATGACTTTGTGTTTTTTTTCTCTTTCTTTTTTAAATCACCACCACACAAAAGACAAAGATGAAAAAACCAAACTTGATTTTGAGAATGATATGAGATGTTGATATTGAGAATTAATATTATAGATGTGTTTTGGGATACTTCTAGGCTGAGGTATTTTGATTATAAACTCGAGCTATTCTATATATATATATATATAGAAGGATGAATTTAAACGAAACTATTTTGAGACAATGTTAAGAGCATGTTTGGGAGTTATTTTAAAATGGTTAAAATCATTTTTATCATATTCAAAATTATTCCAAAACATACCTTTAATCATCAAAATCAACTAATTAATGTTTGAATTTACATTTTTTAGAATTGTTTGGAGCATTGAGTGGATTATAATAAGATGTGTTATAATAGTTTATAGGTTATAATAATCTGTGGAATTATATAATATTATTTAAAATGAAGATAGTTGTTTTATTTGAGATTATGATAATCTGTGTTTGGGGTGTAAAGTACTTCGTTATTATAACCCACGAAGTTATCTTCAATTTTCTTATATAAAAATTAATTTTGAATAATTAAAAGCATGTTTTGAAGTGATTAAAAGTGATTTTAATCATTTCATAACATTATACCCCCTAACTTTGACTTTTCTTAAAAATACCCGAGTTATCTTCAACTCTACATATATTTTTGTCTTAACTCTCCATATGTTTCTCTTTCTCCTACCCATCCCTCCCTAACTTTACGTGTGTCTTGCTCTCAACTCTACTCTCTCTCCCTCATCTCCATGTTTTCTTTATTGATAAATAATTCAATAAGACCGCCACAAATATAATTATTTTATAATAATCGTTGGATGTTGATTCAATAATTTTTATGTCTCATAATAACTACTTCTATAAATTTAAGATATTTCTAAATTTTACCGTTAAATAGTGTGACTAAGATATTTCTAAATTTTACAGTTAAATAGTGTTAGTCACATTGTAGTGGCTTAATAATTTCCCTTTATCTCTCTCATTCTCCAACCACATCCCATCCCATCCAAATTCTCTCTCTCGCATCAACTTATTCTCTCCCTCCAATCAAATTACTCATAATTCTAAAACTAATGATTTCACAATTTTATTTTAAAACGTTTGTTAATTTATTCCAAAAATTAATTAAAAATTTTATGACTTTACTTTTATTAATAATTAATAATGGGATTTTAAATTCCTAGAATATTAATAGCTAACGTCCCGTTGAAAACAATTTAGTTTTTTATTTTTAAAAGTTAAGTCTATATATATACTACTTTCACACTTAGGTTTCCTTGTTTTGTTATTTATTTTACTTATGTTTTCAAAAATCAAGATAGGTTTTGAAATTTGGCTAAGAGTTCAAATAATGTTTTCTTAAGAATGATGAAACTCATATATGTAAAGAAATGATGAGAAAATAAGTACTATTTTTAAAAAGAAAAACTAAAAACTAAATAGTTATGTAATGGGGTTAAATACTTTTTTTAAGAAAAGTATAAAAATATATTTTAATTAATAATTTAAATGATTTCGTAAAGTTCGACAAATAACAAGTTTTATTAATTTATTTAAAAAATTAATTATGTAGCATATAAAACTTTATCTTTAGTTCTAATTAATAATAGAGGCTTTGCTTGATAACCATTTGGTTTTTTTTTTTTTTTTGGTTTTTGAAAATTAATTAAGCCTAAGAATACTATTTTTACCAATGTGTTTCTATGTTTTATTATCCACCTCGTACCTAGCTTTTCAAAAAAGAAAGTTAAGTTTTGAAAACTAAATAAAATAGTTTTCAAAAACTTGTTTTTACTTTTGAAATTTGACTAGAAACTTAAATGGTACCTTAAGAGATATGGAAATGATTGTAGAGAACATGAAAAGAAATTGTTGAAAAACAAGCATAATTTTCAAAAACTAAAAACCAATAAATGGGTTATCAAATAGGGTCAAAACTTTGGTTTTAAATTTTTGGTTTTTAAAAATTAAGCCTATAAGTACTACTTCTCCTATGAATTTATTTGTTTTATTATCTCCTTTTTACATATATTTCAAAAATCAAACCAAGTTTTGAAAATTATATAAAAAAATAGTTTTTAAAAACTTTTTTTTTTTTAGAATCTAACTAAAAATGTTTCTTATTTTTGGAGAAATTGAGGGAAAAACATGCTTAATTTTTTTTTTTTAAAATGTTATGAAACACGACTTAAATTGAAAATTTTAATAGTTTTGTAATATTCTTAAACTCCTACAACTATATAAAAAGCTCTCTAAAAAAAATAAATTGTCTTCTTCTCTTAGTTTTTTTCTTTCTTCCTCAATTCAATTTTATTAAAATTTTCTAGATTTTGTGTGATAAAACATAATAGCCTAGTGTCACAACCACACTAGAAAAGAGTTGGTTGTGACTCTCACAGGCGCAGACAGTTGCAAGGGTGGCACGCGAGTGTCGCCTGCCAGCCCATACGAGGGTCGAGAGAAAACAAGTGCCATGATGCGATCGAAGAGGACGGTGCATCATCTAACACACCAAGAACAGTGGGCATGAATGCCAAGTTGTGAAACAAGTATCACGGGAAGAAACTATGTCGTCAGTGGTGTCAGTGACACGCCTAGTGGGAGCCAGATGGGGCCCCAGGCCATACCCTCGAAATTCAGGGTTCCATCGTGAGTTTTGACGAGCGGCTCTGGAGTTAAGTCCGTATATAGGCAAGATGGAAGCTCACCAGATTTCGTGTCAATCGGAGTCCAGACGAGCACTCTACGCTGGCATATGACTCATCGCCAAAATGACGTGCATGGGTGTCAAAGCTAATATTTCACATAATCTACTATAGGGGGAGGCATGGTGTTGGCTCTCCACCACCCCTGGGCCTCATGGCCTAAGTGAGCTACCCCATGACACATGCAAGTGTATCATGGTTGGGACGAGTGTCTTGGGGCGTTTTCCTATGGAATGGGCTTTCCAAGGACAGTGTCGGGTGGCACGGGTATACCGACATTGCGTGCTTCTCCATGAGCTAGAGGTTGCACCATAAGGCCTGGTAGCGGCACGCGATACGAAAATGACTCAGAATGGATTTGGAAGGGTCCATGCCCCGATGACTCGAATGGATGGATGTCCAAGAGGCCCCGATCATGAAGTTAAGCCATAGAAGGTTTCGATAGCATGCCCGAGCAAGCAGCTACGTTTGATGTCCTCCCGATGCTCGGGATGGCATCACTATTCATCAGTGGGACCCGATAGTTTTCAGTTGGCTCGATATTTGCCAGTTATGTCTCGGTACGCGTTATAATGGCCCGAAAAGCTCCAGCTTGCAATGTTTGCCCCCGGTAAGCATTATAATGCCCCGGTAAGTCCCGAAATCACCCGACAAGCTCCGATTGAGTGTTTTTAGGCTGTTTTGGCCCTATGAAGGGTCCGAATGAATCTCGGTAAAGTCTTTGATCGGCGTGAGACCTTAGTGGACATGTCTAAGAGCTTGAAGATGATGTCTTGAGAAGCGATAGAGCTTTCTAATGACCTTGGCCTAGTGTAGAGGCAAGGTTAAGCCTTGGGACTTGGCCTAGAGTAGAGGCAAGTCGGCTAGAGTCACGTTCGAGTGAAAATACACATACGGACGACGTGTATGACCAAATGTTATTCCGTTGCATGTTGTAAAAAAGCTTGGTTGAGGGAAAGGAGTTCGCCTAAGGTCCGACGAAGCCACAAAAGGTTGCGATCAGAGAGACGGGGCTTGAGTTCCCCTTAAGGCTGACCATGATTGAGCGAAAAATTATGGCGCCGCACGGTTGAAGACCGTGACATTTGGTATTAGAGCTCTCTTTTGTGCGTGTAGTCGAACAAGTCAAACAAAGACAGACCTAGGATCGTGTTTAAGACTTCACAAGTGCGTCAAGAGTGCCAAGAAAACGACAGGGCAGTTGTGTTGTCGGTATCAGAGAAAGTATGTGGGTCAAAGCCGGACTCGTTGAGTGAGTTCAACCCGTCAATGAGCAAGAACCAAGTCTATTTGCCAAGGGCTTGGGGTACGTTGTGCTAAAGAGAGTTGTTCAGAAGGTGTACCAGGTAGGGTGCCTTCCAAAGAGCCATTTCCATCAGAGGGATAATGTGAATGGTTGCAAGCAAGAAGGTTGTAGTCAGTGCAACATGTGCCGGTCAAGTGCGAAGTTGTGCAGATTGCAAGTCCATGTGGGTGGACGTCTTGGCTGAGTAGAGCCAGTTGAGTTGGGTGGGAGAAGGCCCGAGAGAATGGGTTGCTCGTCCCGTGAGAGTGCTCGTAAGCTAGAGCACATGTGTCCCTTAAGTCATTCCACTGGGCCTAGGTGGGTGACTTCACAGAGTTTTCGAGACCAGTATTAGCTCATGTTGGAGCATCAGACCGGAGACAAGAGCAAGAAGAGAGAGAAGCTGCAGCAAGATCAGAGAAGCATCTTTGCAGCGTTGCCACATGAGGGTGGCCAGTTGAGAGTTGGCTGAAGGGGAGTGCCAACAGAGAAGAGGGTGTTGTCAGTGCGCTTTCAGACTTTGCTACCGTGAGAGTTGGTGCCACGATGAGTCCATTTCGTGCACCGTTATGTGCCGCTCAGAGAATATATGATGTGGTGGCCAGAGTGGCCAAGTGAGTTTAGGGCGCGCTCCCCGTGACTAGGTAAGAGAATGCCTATGCCCTTTGCCATGTCCTGTCTGGTGATTCCCGGGTCGAGTCCTAAAGTTGGGCGAGGTAGGCCCCAGAGAGGGATAGCCCACAGGGTGGCAGTTGTGTCGTGCTTGTGTTTCTGGTCGTCGGGCCACGTCCTAGAGTTAGGCGCAGTAGGTCCCTGGGGTGGTTCAGTTCATGAGTAGTTGACAGTTTTCGTATTCGATCTGATGCACGATTATATTCAGCAAGTCCGTCAGTTGCAAGCCCGAGAAACGGATCGTCATCGGGACGATGTCGAGTTTAAGTGGGAGAGTGTGTTACAATCCATTGTGTGATGATTTGGATTGAACCCCCGAGTAAGTCATCGAGGACGATGACTAGTTTAAGTGGGGGAGAGTGTCACAACCACACTAGAAGAGAGTTGGTTGTGACTCCCACAGGCGCAGACAGTTGCAAGGGTGGCACGCGAGTGTCGCCCGCCATGCCATGCGAGGGCCGAGAGAAGACAAGTGCCATGATGCGACCGAGGAGGACGGTGCATCATCTAACACACCAAGAACAGTGGGCATGAATGCCAAGTTGTGAAACAAGCATCGCGGGAAGAAAGTATGTCATCAGTGGTGTTAGTGACACGCCTAGTGGGGGCCAGATGAGGCCTCGGGCCATACCCTCGAAAGTCGGGGTTCCGTCGTGAGTTTTGACGAGCGGCTCTGGAGTTAAGTCCGTATATGGGCAAGATGGAAGCTCGCCACGTGTCAATCGGAGTCCAGACAAGCGCTCTATGCTGGCGTATGTCTACTCGGCAAAATGACGTTCATGATTGTCAAAACTAATATTTTACATAATCTACTATAGGGGGAAGCATGGTGTTTGCTCTTCACCACCCCTGGGCCTCATGACCTAAGTGAGCTACCCCATGGCACATGTAAGTGTGTCATGGTTGGGGCGAGCGTGTCAAGACGAGTGTCTTGGGCGTCTTCCTATGGAATGGGTTTTTCAAGGACAGTGTCGGGCGACACGGGTGTGCCGACATTGCGCCCTTCTCCATGAGCTAGAGGTTGCACCATAGGGCCCGGTAGCGGCACGCGATACGAAAATGACTCGACATGGATTTGGAAGAGTCCATGCCCCAGTGACTCGGATTGATGGATTTCCGAGAGGCCCCAATCATGAAGTTAAGCCATGGGAAGATTTCAATAGCATGTCCGAGCGAGCAGCTAAGTCTGATGACCTCTGGATGCTCGGGATGGCATCATTATTCATCAGTGGGACCCGATAGTTTCCAGTTGGCTCGATATTTGCCCGTTATGTCCCGGTAAGCGTTATAATGGCTCGAAAAGCTTCGATTTGCAATGTTTGCTCCCGGTAAGCATTATAATGCCCCAGTAAGTCCTGGTAAGTCCCGAAATCACCCGGTAAGCTCCGATTGAGTGTTTTAAGGCCGTTTTGGCCCTATGAAGGGTCCGAATGAGTCTCAGTAAAGTCCGGATCGACGTAAGACCTTTGTGGACATGTCTAAGAGCTTGAAGATGATGTCTTGAGAAGCGACAGAGCTTTCTAATGACCTTGACCTAGTGTAGAGGCAAGGTTAAGCCTTGAGACTTGGCCTAGAGTAGAGGCAAGTCGGCTAGAGTCGCGTTCGAGTGAAAGTACACATACGGGCGACGTGTATGACCAAGTTTTATTCTGCTGCATGTTGTAAAGAAGCTTGGTTGAGGGAAAGGACCTTGCCTATGGTCCAACGAAGCCACAAAAGGTTGCGATCAGAGAGACGGGGCTTGAGTTCCCCTTAAGGATGGCCATGATTGAGCAAAAAATTATGGCACCGCACGGTTGAAGAGTAACGACCGTGATACCTAGCACACTAGTGTGGTAGAACATAATAGCCTAGCACACTAGTGTGTTTTTATAAGAATTACAAAGTAACGTCATCTAGAAAATTATACAATACCGTATAAAAATAGAAGATAAAGATGTTCAAAAAAACAATCTCAACACTGGTATAAGAAGTTTTAAAGTTATTTCATATTTGAAAAATATATATTTCATAATTAAGATGTTTTATTATTGTTTTTTTTTTGGAAGTTATGATGGGAAAAAACCTCTCACTATTACCGAAGAAATAAGCAAAAAATAAAAAATTAAAATAATAACAGATAAATTAAAATAAAACAACAAAATAAGAAATTAAGAATTTACGAGAAAAACTTCAAATTCGAAGAAAAAGATATGGACAGAAAGAAACTTTGCTATATGAAAAATTATTACAATAACACATAATTCTCACTCCAATCTCAATTACAATCCAACCTCAATTACAAAAATTCTCTCCTAAAGCTTTATATCACTCACTCTCTTTTCCCCACGCTCTCAACACAAGAGAATACAAAGAGAGTATTTAATTAGAGCTAGTACATCAAGTTTAAAGTGTTTTTAACTGAGATACTTTAAAACCAAAGATATAGACTCATTTTAGAGTACTTGGAGCCCATTAGTTCCTTGCACATTTTCGATGTGAGATAATTTCACTTCTAATATATTGTCAAAACTCTTATAAATTCCACCTTACAAGATATTTGAAGAATTCACAACTCTTGATGCATCATTGTCTTTATTGACAATCACAATTCTCACCCTAAAGAAGTATAACCTACTCCACCGTAAAAGTAAACCACTTAGAATGTCACATTTCAAGACTTTTTTTTTCGTCTCATGTCGATAAACCTTACTGAAATTTATGGTGGAACTTTCACCTACTTGGTTCATCTGGAAGTTCTTCAGCCATCGACATAACGTCCACCATACATACTTTGCAAAATCGCTAACCGAATGCCCGTGTGCAAATCTGTGGAACTGTTAGCATCACATCCTCTATCATGAAAAAGCGGATATACCATGAGGATGAATTCTGCTTTTCTTCAGAAGAAGTCACCATCCCTCCTGACAAGAGAGACCACGTTAGCATCTGACCTAAAGCATTTTGACAAATTTGTCTTGTTAGGACCATTCTTTTTCATGTGTCCAAAGACTATCCATATTCCCAACAATCACTCTTTTGCATAAAGTTTTTCTACTTTTTCCAATTACTAACTACCAATGTCAAATCCTCTTGTGGCATTTTCTTTCCTCTGAGAGAAGTTTACTAGTAGCCTCTACAAACTTCAAAATCTCATTGTTGTACATCAAGATTGAATTCATGTATTCGTTAGAAGGCAAAAGTGACAAGATCAGCCTAAGAGTCTTATCCTCATCCTTTATTTTCACTTCGATCACCTCCAGCTCAGAGATGGTGCCATTGAGAACACTCAGACGATTTGAGATTTTCATACATTCCTCCATTCACAACGTGTACAACCGATTTGAGATGCTCTTTGACTGATACATCCCTTGAACTTCTCCCAAAGCTCTTTGGTTGTCGAAATTTAATGCACATTTGCAAGAATATTCTTATTAGCTAAATTTAGCTTGATTGCACTTGCAACTCTCATATCTGATTCATACCATTCTTCATTACTGAGGCTGGACTTCTTAGAACTTCAGCTGGAACCACCACAGGACTCCGTTGGACCACCATCACCGCTTAACTTTTCAAGAATGCTACCACTCAATCTTCCCTTCAAGTTGAAGTTCATCATTCCATCAAATTTCTCCACGTCAAGCTTTATGAGTCCAGTAAAGATTGACAGAAAGAAACTCCACTATGTGAAAAATTATTACAATCACACAAAATTCTATCTCCGATCCCAATTATAAAAACACTCTCCCAAAACTTTATATCACTCACACTCTTTTTCCCACTCTCAAACAATACAAAGAGAGAATTTAATTAGAGATAGTACACTAAGGTTAAACTATTTCTAACTGGGATGCTTTGAAACCAAAGCCATAGGTTGATTTTATAGTACTTGAAGCTCATTCACGCACTTGCATCTTTTCGATGTGGGACAATTTAACTTCTAATATATTTTCAAAAATCCTACAAGTTAAATGGTATATTTACCATTGGTTTTAATGTATGATAAAATATATTAGAAATATCACATGCAAGACACGCATTAAAATCCAAGCAGAATACTTGAGTGTAGCCCAGTTGTGGTCAAGTATTCCTCTTTGGTTAGTACATAATCATAAATCACGATTTTCATGTGTGGCTCTACATGTTTAGAAGGAATTTTTTTTTATTTTTTATTTTTGCTCTTCCCAATTGAATATTATTTCGGCCAAAAACATTATTTTGAAATAACCCAACCTAAAATTTTTTTCCAGTCTAACTTCTTAGTTGAAGTTAATCATAGTTAGATGTGATATTGGTCATTTTAACATGAATATGTGTGTGTGTTCTTATCTATATATCTATATATCTATATTTATTTAATTTATATAATATCTATCTAATATATAGAGCCAACAACTATTTAAGATTAGAGGGATTGTTAAATGATTATTTATTTAAAAAATAATTAAAACTATCCTAACATGTCACCTAAACTTTTTAGATTAAGTAGTAATGTAACAAGTAAGGTTCAATAAAATCATAATTTCAGAAATATCATAAGGGCGAAGGTGAAACATAAACGTTTAGGAAAACAACAATAAAATAACCATAATATAGACATGAATAATTTGACGTTTTTCTTCGACACGATAAAAAAAAAAAAAAGTTGATGAATAATATGGAAACTAAATATGCATGTGTTTTGGTGATATAGATCTTGAGCAATGGGAAGTACAAAAGTGTTTTACACAAAGCAATAGTGAACAACAAAGCCACAAGGATGTCTATAGCAATGGCAGTTGGACCATCAGCTGAAACAGTGGTTGGTCCTGTTCCAGAATTGGTTCATCAACAAACCAATCCTCCTTTGTTTAAGAACATCAAATACAAAGATTATTTGGAAACCATGCAGAATGGCAAACTCCAAAACAAATCTACCTTGGATCGTGTTCGTCTACTTTGATGTCGTGTTTTTTAATATTCGAATTTTTAATTATATTTTATTAGTATAAATAGACTATTAGGTCATCTTTTTCTAAGTCATGCAACTATATTCATATCCAACTCTACTATTT

mRNA sequence

ATTGGAAGAAGAAGCTGCCAATGGCTTCAGTCCAGAAGATTGCCAGCGTCAAATCCATTGCTGAAACACCTAACTTAGCCTCCATTCCCTCCACTTACATCTTCTCCGCCGCCGCCGGTGAAGAAACTGCTCCACAAGCCATGGAAGATTCAATCCCCACCGTTGATTTCGCTCTGCTCACAGCCGGTACCCCCGATCAACGGTCCAAAGTCGTCGACGAGCTTGGCAAGGCCTGTCGAGACTGGGGCTTCTTCATGGTGATCAATCATGGAATAGCAGAGAGGGTGAGGAAGGAGATGATGGATTGTTGTAAAGAGTTTTTTGATCTGAGGGAGGAGGAGAAGAGAGTGTACGAAACAAAGCATGTACTCGACCCCATACGATACGGCACCAGCTTCAATCCTCACATAGAGAAAGTGTTGTTATGGAGAGATTATCTCAAAATCATGGTTCATCCTAACTTTCATTCTCCAACTAAACCCCCAACTTTCAGAGAGATCTCGAACGAGTACTGCAGAAGAGTTCGAGAAATGGCAAGAGAGTTGATGAGAGGGATCTCAGAGAGCTTGGGATTAGAAAGGTGCAATTTGGAGAGGGCAGTGAATTGGGAAAAATGTTCACAAATTCTTATTGCAAACTTATATCCACCATGTCCACAGCCAGAGCTTGCAATGGGTTTGTCCCCTCACTCTGATCATTGCCTTATCACTGTTCTTCTTCAAAACCAAATTGATGGCCTTCAAATCTTGCATGATGGTAAGTGGGTGAATGTCAATCCCATTCCCAACTCATTTTTGGTCAACACTGCTGATCAACTTGAGATCTTGAGCAATGGGAAGTACAAAAGTGTTTTACACAAAGCAATAGTGAACAACAAAGCCACAAGGATGTCTATAGCAATGGCAGTTGGACCATCAGCTGAAACAGTGGTTGGTCCTGTTCCAGAATTGGTTCATCAACAAACCAATCCTCCTTTGTTTAAGAACATCAAATACAAAGATTATTTGGAAACCATGCAGAATGGCAAACTCCAAAACAAATCTACCTTGGATCGTGTTCGTCTACTTTGATGTCGTGTTTTTTAATATTCGAATTTTTAATTATATTTTATTAGTATAAATAGACTATTAGGTCATCTTTTTCTAAGTCATGCAACTATATTCATATCCAACTCTACTATTT

Coding sequence (CDS)

ATGGCTTCAGTCCAGAAGATTGCCAGCGTCAAATCCATTGCTGAAACACCTAACTTAGCCTCCATTCCCTCCACTTACATCTTCTCCGCCGCCGCCGGTGAAGAAACTGCTCCACAAGCCATGGAAGATTCAATCCCCACCGTTGATTTCGCTCTGCTCACAGCCGGTACCCCCGATCAACGGTCCAAAGTCGTCGACGAGCTTGGCAAGGCCTGTCGAGACTGGGGCTTCTTCATGGTGATCAATCATGGAATAGCAGAGAGGGTGAGGAAGGAGATGATGGATTGTTGTAAAGAGTTTTTTGATCTGAGGGAGGAGGAGAAGAGAGTGTACGAAACAAAGCATGTACTCGACCCCATACGATACGGCACCAGCTTCAATCCTCACATAGAGAAAGTGTTGTTATGGAGAGATTATCTCAAAATCATGGTTCATCCTAACTTTCATTCTCCAACTAAACCCCCAACTTTCAGAGAGATCTCGAACGAGTACTGCAGAAGAGTTCGAGAAATGGCAAGAGAGTTGATGAGAGGGATCTCAGAGAGCTTGGGATTAGAAAGGTGCAATTTGGAGAGGGCAGTGAATTGGGAAAAATGTTCACAAATTCTTATTGCAAACTTATATCCACCATGTCCACAGCCAGAGCTTGCAATGGGTTTGTCCCCTCACTCTGATCATTGCCTTATCACTGTTCTTCTTCAAAACCAAATTGATGGCCTTCAAATCTTGCATGATGGTAAGTGGGTGAATGTCAATCCCATTCCCAACTCATTTTTGGTCAACACTGCTGATCAACTTGAGATCTTGAGCAATGGGAAGTACAAAAGTGTTTTACACAAAGCAATAGTGAACAACAAAGCCACAAGGATGTCTATAGCAATGGCAGTTGGACCATCAGCTGAAACAGTGGTTGGTCCTGTTCCAGAATTGGTTCATCAACAAACCAATCCTCCTTTGTTTAAGAACATCAAATACAAAGATTATTTGGAAACCATGCAGAATGGCAAACTCCAAAACAAATCTACCTTGGATCGTGTTCGTCTACTTTGA

Protein sequence

MASVQKIASVKSIAETPNLASIPSTYIFSAAAGEETAPQAMEDSIPTVDFALLTAGTPDQRSKVVDELGKACRDWGFFMVINHGIAERVRKEMMDCCKEFFDLREEEKRVYETKHVLDPIRYGTSFNPHIEKVLLWRDYLKIMVHPNFHSPTKPPTFREISNEYCRRVREMARELMRGISESLGLERCNLERAVNWEKCSQILIANLYPPCPQPELAMGLSPHSDHCLITVLLQNQIDGLQILHDGKWVNVNPIPNSFLVNTADQLEILSNGKYKSVLHKAIVNNKATRMSIAMAVGPSAETVVGPVPELVHQQTNPPLFKNIKYKDYLETMQNGKLQNKSTLDRVRLL
Homology
BLAST of Tan0016231 vs. ExPASy Swiss-Prot
Match: Q6Z244 (2-oxoglutarate-dependent dioxygenase 19 OS=Oryza sativa subsp. japonica OX=39947 GN=2ODD19 PE=1 SV=1)

HSP 1 Score: 307.0 bits (785), Expect = 2.7e-82
Identity = 155/329 (47.11%), Postives = 207/329 (62.92%), Query Frame = 0

Query: 22  IPSTYIFSAAAGEETAPQAMEDSIPTVDFALLTAGTPDQRSKVVDELGKACRDWGFFMVI 81
           +PS    SAAA  + +    +  IP VD  +L  G  D+RS+ + +LG+AC DWGFFMV 
Sbjct: 7   LPSHEEQSAAAAADGSATPSQ-GIPVVDLGVLINGAADERSRAIRDLGRACEDWGFFMVT 66

Query: 82  NHGIAERVRKEMMDCCKEFFDLREEEKRVYETKHVLDPIRYGTSFNPHIEKVLLWRDYLK 141
           NHG+ E +R+ +MD CKE F L  EEK+ Y     +DPIR GT F   ++ V   RDYLK
Sbjct: 67  NHGVPEALREAIMDACKELFRLPLEEKKEYMRAKPMDPIRIGTGFYSVVDAVPCRRDYLK 126

Query: 142 IMVHPNFHSPTKPPTFREISNEYCRRVREMARELMRGISESLGLERCNLERAVNWEKCSQ 201
           +  HP FH P KP   REI+ EY    R +  EL + ISESLGL    L  A+N E C Q
Sbjct: 127 MFSHPEFHCPEKPAKLREIATEYATCTRALLLELTKAISESLGLAGGRLSEALNLESCFQ 186

Query: 202 ILIANLYPPCPQP-ELAMGLSPHSDHCLITVLLQNQIDGLQILHDGKWVNVNPIPNSFLV 261
           IL+ N YP C +P E AMGLS HSDH L+T+L QN +DGLQ+ HDG+W+   P+P SF V
Sbjct: 187 ILVGNHYPACSRPDEQAMGLSAHSDHGLLTLLFQNGVDGLQVKHDGEWLLAKPLPGSFFV 246

Query: 262 NTADQLEILSNGKYKSVLHKAIVNNKATRMSIAMAVGPSAETVVGPVPELVHQQTNPPLF 321
              DQLEI++NG+YK VLH+A+V  + +RMS    +GP  +TVV P+PE+         F
Sbjct: 247 IAGDQLEIVTNGRYKGVLHRAVVGGEQSRMSFVSLIGPCMDTVVEPLPEMAADGRGLE-F 306

Query: 322 KNIKYKDYLETMQNGKLQNKSTLDRVRLL 350
           + I+Y+DY+E  Q+  +  K+ LD VR++
Sbjct: 307 RGIRYRDYMEMQQSNSINEKTALDIVRVM 333

BLAST of Tan0016231 vs. ExPASy Swiss-Prot
Match: Q9ZSA7 (Protein DMR6-LIKE OXYGENASE 2 OS=Arabidopsis thaliana OX=3702 GN=DLO2 PE=2 SV=1)

HSP 1 Score: 236.9 bits (603), Expect = 3.5e-61
Identity = 126/329 (38.30%), Postives = 191/329 (58.05%), Query Frame = 0

Query: 22  IPSTYIFSAAAGEETAP-QAMEDSIPTVDFALLTAGTPDQRSKVVDELGKACRDWGFFMV 81
           +PS Y+   +   + +  Q   DSIP +D   L       R+ ++++   AC   GFF +
Sbjct: 18  VPSNYVRPVSDRPKMSEVQTSGDSIPLIDLHDLHG---PNRADIINQFAHACSSCGFFQI 77

Query: 82  INHGIAERVRKEMMDCCKEFFDLREEEKRVYETKHVLDPIRYGTSFNPHIEKVLLWRDYL 141
            NHG+ E   K+MM+  +EFF   E E+  + +       R  TSFN   EKV  WRD+L
Sbjct: 78  KNHGVPEETIKKMMNAAREFFRQSESERVKHYSADTKKTTRLSTSFNVSKEKVSNWRDFL 137

Query: 142 KIMVHP--NFHS--PTKPPTFREISNEYCRRVREMARELMRGISESLGLERCNLERAVNW 201
           ++  +P  +F +  P+ P +FRE++ EY   VR +   L+  ISESLGL +  +   +  
Sbjct: 138 RLHCYPIEDFINEWPSTPISFREVTAEYATSVRALVLTLLEAISESLGLAKDRVSNTIG- 197

Query: 202 EKCSQILIANLYPPCPQPELAMGLSPHSDHCLITVLLQNQIDGLQILHDGKWVNVNPIPN 261
            K  Q +  N YP CPQPEL  GL  H D  LITVLLQ+++ GLQ+  DGKW+ VNP+PN
Sbjct: 198 -KHGQHMAINYYPRCPQPELTYGLPGHKDANLITVLLQDEVSGLQVFKDGKWIAVNPVPN 257

Query: 262 SFLVNTADQLEILSNGKYKSVLHKAIVNNKATRMSIAMAVGPSAETVVGPVPELVH-QQT 321
           +F+VN  DQ++++SN KYKSVLH+A+VN+   R+SI     PS + V+ P  EL++ ++ 
Sbjct: 258 TFIVNLGDQMQVISNEKYKSVLHRAVVNSDMERISIPTFYCPSEDAVISPAQELINEEED 317

Query: 322 NPPLFKNIKYKDYLETMQNGKLQNKSTLD 345
           +P +++N  Y +Y E   +     +S +D
Sbjct: 318 SPAIYRNFTYAEYFEKFWDTAFDTESCID 341

BLAST of Tan0016231 vs. ExPASy Swiss-Prot
Match: Q9ZSA8 (Protein DMR6-LIKE OXYGENASE 1 OS=Arabidopsis thaliana OX=3702 GN=DLO1 PE=1 SV=1)

HSP 1 Score: 236.1 bits (601), Expect = 6.0e-61
Identity = 130/339 (38.35%), Postives = 192/339 (56.64%), Query Frame = 0

Query: 10  VKSIAETPNLASIPSTYIFSAAAGEETAPQAMEDSIPTVDFALLTAGTPDQRSKVVDELG 69
           V+ I++ PNL+ + S+                 DSIP +D   L       R+ +V +L 
Sbjct: 25  VRPISDRPNLSEVESS----------------GDSIPLIDLRDLHG---PNRAVIVQQLA 84

Query: 70  KACRDWGFFMVINHGIAERVRKEMMDCCKEFFDLREEEKRVYETKHVLDPIRYGTSFNPH 129
            AC  +GFF + NHG+ +    +M    +EFF   E E+  + +       R  TSFN  
Sbjct: 85  SACSTYGFFQIKNHGVPDTTVNKMQTVAREFFHQPESERVKHYSADPTKTTRLSTSFNVG 144

Query: 130 IEKVLLWRDYLKIMVHP--NF--HSPTKPPTFREISNEYCRRVREMARELMRGISESLGL 189
            +KVL WRD+L++   P  +F    P+ P +FRE++ EY   VR +   L+  ISESLGL
Sbjct: 145 ADKVLNWRDFLRLHCFPIEDFIEEWPSSPISFREVTAEYATSVRALVLRLLEAISESLGL 204

Query: 190 ERCNLERAVNWEKCSQILIANLYPPCPQPELAMGLSPHSDHCLITVLLQNQIDGLQILHD 249
           E  ++   +   K +Q +  N YPPCP+PEL  GL  H D  +ITVLLQ+Q+ GLQ+  D
Sbjct: 205 ESDHISNILG--KHAQHMAFNYYPPCPEPELTYGLPGHKDPTVITVLLQDQVSGLQVFKD 264

Query: 250 GKWVNVNPIPNSFLVNTADQLEILSNGKYKSVLHKAIVNNKATRMSIAMAVGPSAETVVG 309
            KWV V+PIPN+F+VN  DQ++++SN KYKSVLH+A+VN +  R+SI     PS + V+G
Sbjct: 265 DKWVAVSPIPNTFIVNIGDQMQVISNDKYKSVLHRAVVNTENERLSIPTFYFPSTDAVIG 324

Query: 310 PVPELVHQQTNPPLFKNIKYKDYLETMQNGKLQNKSTLD 345
           P  ELV++Q +  +++   + +Y +   N  L   S LD
Sbjct: 325 PAHELVNEQDSLAIYRTYPFVEYWDKFWNRSLATASCLD 342

BLAST of Tan0016231 vs. ExPASy Swiss-Prot
Match: Q6YYX9 (Probable 2-oxoglutarate-dependent dioxygenase SLC1 OS=Oryza sativa subsp. japonica OX=39947 GN=SLC1 PE=2 SV=1)

HSP 1 Score: 224.2 bits (570), Expect = 2.3e-57
Identity = 122/339 (35.99%), Postives = 190/339 (56.05%), Query Frame = 0

Query: 19  LASIPSTYIFSAA-----AGEETAPQAMEDSIPTVDFALLTAGTPDQRSKVVDELGKACR 78
           +  +P  Y+  A+     A    A       +P VD + L    P +R  V+  L  ACR
Sbjct: 49  ITRLPGNYVLPASDRPGQAAGAAAAAGGSVKLPVVDLSRLR--VPSERGAVLRTLDAACR 108

Query: 79  DWGFFMVINHGIAERVRKEMMDCCKEFFDLREEEKRVYETKHVLDPIRYGTSFNPHIEKV 138
           ++GFF V+NHG+   V   M+D  + FF+L + E+  Y +  V  P+RYGTSFN   + V
Sbjct: 109 EYGFFQVVNHGVGGEVVGGMLDVARRFFELPQPERERYMSADVRAPVRYGTSFNQVRDAV 168

Query: 139 LLWRDYLKIMVHPNF----HSPTKPPTFREISNEYCRRVREMARELMRGISESLGLERCN 198
           L WRD+LK+   P        PT P   RE+++ Y    + +  E+M    E+LG+    
Sbjct: 169 LCWRDFLKLACMPLAAVVESWPTSPADLREVASRYAEANQRVFMEVMEAALEALGVGGGG 228

Query: 199 LERAVNWEKCSQILIANLYPPCPQPELAMGLSPHSDHCLITVLLQNQIDGLQILHDGKWV 258
           +    +    +Q++  N YP CPQPEL +G+ PHSD+  +T++LQ+++ GLQ++H G+W+
Sbjct: 229 VME--DLAAGTQMMTVNCYPECPQPELTLGMPPHSDYGFLTLVLQDEVAGLQVMHAGEWL 288

Query: 259 NVNPIPNSFLVNTADQLEILSNGKYKSVLHKAIVNNKATRMSIAMAVGPSAETVVGPVPE 318
            V+P+P SF+VN  D LEILSNG+Y+SVLH+  VN++  R+S+A     + E VV P PE
Sbjct: 289 TVDPLPGSFVVNVGDHLEILSNGRYRSVLHRVKVNSRRLRVSVASFHSVAPERVVSPAPE 348

Query: 319 LVHQQTNPPLFKNIKYKDYLETMQNGKLQNKSTLDRVRL 349
           L+  + +P  + +     +L  + +    +KS L   RL
Sbjct: 349 LIDDR-HPRRYMDTDLATFLAYLASAAGNHKSFLHSRRL 382

BLAST of Tan0016231 vs. ExPASy Swiss-Prot
Match: Q8W2X5 (Flavanone 3-dioxygenase 2 OS=Oryza sativa subsp. japonica OX=39947 GN=F3H-2 PE=1 SV=1)

HSP 1 Score: 221.1 bits (562), Expect = 2.0e-56
Identity = 131/349 (37.54%), Postives = 189/349 (54.15%), Query Frame = 0

Query: 14  AETPNLASIPSTYIFSAAAGEETAPQAM---------EDSIPTVDFALLTAGTPDQRSKV 73
           AE      + ST +     G+   P++          +  IP VD A     +PD R+ V
Sbjct: 3   AEAEQQHQLLSTAVHDTMPGKYVRPESQRPRLDLVVSDARIPVVDLA-----SPD-RAAV 62

Query: 74  VDELGKACRDWGFFMVINHGIAERVRKEMMDCCKEFFDLREEEKRVYETKHVLDPIRYGT 133
           V  +G ACR  GFF V+NHGI   +   +M+  +EFF L  EEK    +      IR  T
Sbjct: 63  VSAVGDACRTHGFFQVVNHGIDAALIASVMEVGREFFRLPAEEKAKLYSDDPAKKIRLST 122

Query: 134 SFNPHIEKVLLWRDYLKIMVHPNFHS-----PTKPPTFREISNEYCRRVREMARELMRGI 193
           SFN   E V  WRDYL++  +P  H      P+ PP+F+EI   YC  VRE+   L   I
Sbjct: 123 SFNVRKETVHNWRDYLRLHCYP-LHQFVPDWPSNPPSFKEIIGTYCTEVRELGFRLYEAI 182

Query: 194 SESLGLERCNLERAVNWEKCSQILIANLYPPCPQPELAMGLSPHSDHCLITVLL-QNQID 253
           SESLGLE   +   +  ++  Q +  N YP CP+PEL  GL  H+D   +T+LL  +Q+ 
Sbjct: 183 SESLGLEGGYMRETLGEQE--QHMAVNYYPQCPEPELTYGLPAHTDPNALTILLMDDQVA 242

Query: 254 GLQILHDGKWVNVNPIPNSFLVNTADQLEILSNGKYKSVLHKAIVNNKATRMSIAMAVGP 313
           GLQ+L+DGKW+ VNP P + ++N  DQL+ LSNGKY+SV H+A+VN+   RMS+A  + P
Sbjct: 243 GLQVLNDGKWIAVNPQPGALVINIGDQLQALSNGKYRSVWHRAVVNSDRERMSVASFLCP 302

Query: 314 SAETVVGPVPELVHQQTNPPLFKNIKYKDYLETMQNGKLQNKSTLDRVR 348
                +GP  +L+    +P +++N  Y +Y +   +  L  +  L+  R
Sbjct: 303 CNSVELGPAKKLI-TDDSPAVYRNYTYDEYYKKFWSRNLDQEHCLELFR 341

BLAST of Tan0016231 vs. NCBI nr
Match: XP_038886412.1 (protein DMR6-LIKE OXYGENASE 2-like [Benincasa hispida])

HSP 1 Score: 595.1 bits (1533), Expect = 3.9e-166
Identity = 287/350 (82.00%), Postives = 324/350 (92.57%), Query Frame = 0

Query: 1   MASVQKIASVKSIAETPNLASIPSTYIFSAAAGEETAPQAME-DSIPTVDFALLTAGTPD 60
           MAS++K+ASVKSIAETPNLASIPS+YIFS +  ++TAP AM+ DSIPT+DF LLT+GTPD
Sbjct: 1   MASLEKMASVKSIAETPNLASIPSSYIFSGSPDDKTAPIAMDNDSIPTIDFVLLTSGTPD 60

Query: 61  QRSKVVDELGKACRDWGFFMVINHGIAERVRKEMMDCCKEFFDLREEEKRVYETKHVLDP 120
           QRSKVVDELGKACRDWGFFMVINHG+AE++R+EMM+CC+EF+DL EEEKRVYET+HVLDP
Sbjct: 61  QRSKVVDELGKACRDWGFFMVINHGVAEKLREEMMECCEEFYDLTEEEKRVYETEHVLDP 120

Query: 121 IRYGTSFNPHIEKVLLWRDYLKIMVHPNFHSPTKPPTFREISNEYCRRVREMARELMRGI 180
           IRYGTSFNPH+EKV LWRDYLKIMVHPNFH P+KP  FREIS EYC RVREM RELMRGI
Sbjct: 121 IRYGTSFNPHVEKVFLWRDYLKIMVHPNFHFPSKPLKFREISKEYCERVREMGRELMRGI 180

Query: 181 SESLGLERCNLERAVNWEKCSQILIANLYPPCPQPELAMGLSPHSDHCLITVLLQNQIDG 240
           SESLGLER +LER VNWE+CSQILIANLYPPCPQPELAMGLSPHSDHCL+TVLLQNQIDG
Sbjct: 181 SESLGLERLDLERIVNWEECSQILIANLYPPCPQPELAMGLSPHSDHCLLTVLLQNQIDG 240

Query: 241 LQILHDGKWVNVNPIPNSFLVNTADQLEILSNGKYKSVLHKAIVNNKATRMSIAMAVGPS 300
           LQILHD KWVNVNPIPNSFLVNTADQLEILSNG+YKSVLH+A+VN K+ RMS+A+A+GPS
Sbjct: 241 LQILHDHKWVNVNPIPNSFLVNTADQLEILSNGEYKSVLHRAVVNEKSKRMSLAVAIGPS 300

Query: 301 AETVVGPVPELVHQQTNPPLFKNIKYKDYLETMQNGKLQNKSTLDRVRLL 350
           A+T+V P P+L+   T+PPLFK+IKYKDYLETMQ+GKL NKSTLD VRLL
Sbjct: 301 AQTLVAPAPQLL--LTHPPLFKHIKYKDYLETMQSGKLHNKSTLDCVRLL 348

BLAST of Tan0016231 vs. NCBI nr
Match: XP_022149149.1 (protein DMR6-LIKE OXYGENASE 1-like [Momordica charantia])

HSP 1 Score: 554.7 bits (1428), Expect = 5.8e-154
Identity = 264/344 (76.74%), Postives = 301/344 (87.50%), Query Frame = 0

Query: 7   IASVKSIAETPNLASIPSTYIFSAAAGE-ETAPQAMEDSIPTVDFALLTAGTPDQRSKVV 66
           +ASVK+IA+TPNL SIPS+Y+FSA  G  + APQ +EDSIPTVDF+LLT GTPDQR++VV
Sbjct: 30  MASVKTIAQTPNLTSIPSSYVFSADGGAVDAAPQGVEDSIPTVDFSLLTMGTPDQRAEVV 89

Query: 67  DELGKACRDWGFFMVINHGIAERVRKEMMDCCKEFFDLREEEKRVYETKHVLDPIRYGTS 126
           D+LGKAC+DWGFFMVINHG+AE V  EM+D C+EFFDL EEEKR YETKHVLDPIRYGTS
Sbjct: 90  DQLGKACQDWGFFMVINHGVAETVMTEMVDICREFFDLEEEEKREYETKHVLDPIRYGTS 149

Query: 127 FNPHIEKVLLWRDYLKIMVHPNFHSPTKPPTFREISNEYCRRVREMARELMRGISESLGL 186
           FNP +EKV  WRDYLKIMVHPNFH+PTKP  FREIS EYCRRVRE+AREL RGISESLGL
Sbjct: 150 FNPKMEKVFFWRDYLKIMVHPNFHAPTKPSRFREISEEYCRRVRELARELARGISESLGL 209

Query: 187 ERCNLERAVNWEKCSQILIANLYPPCPQPELAMGLSPHSDHCLITVLLQNQIDGLQILHD 246
           E C LE+AVN + CSQIL+ NLYPPCPQ E+AMGL PHSDHCL+T +LQN+I GLQILH 
Sbjct: 210 EGCTLEKAVNLKSCSQILVGNLYPPCPQAEVAMGLPPHSDHCLLTFILQNKICGLQILHQ 269

Query: 247 GKWVNVNPIPNSFLVNTADQLEILSNGKYKSVLHKAIVNNKATRMSIAMAVGPSAETVVG 306
           GKWVNVNPIPNSFLVNTADQLEI SNGKYKS+LH+AIVN KATRMS+ +A+GPS +TVVG
Sbjct: 270 GKWVNVNPIPNSFLVNTADQLEIFSNGKYKSLLHRAIVNKKATRMSVGIAIGPSLDTVVG 329

Query: 307 PVPELVHQQTNPPLFKNIKYKDYLETMQNGKLQNKSTLDRVRLL 350
           P PELVHQ TNPPL+K+IKYKDY+E +Q   LQ+KS LD VRL+
Sbjct: 330 PAPELVHQLTNPPLYKHIKYKDYMELVQTNDLQHKSMLDLVRLV 373

BLAST of Tan0016231 vs. NCBI nr
Match: XP_022141027.1 (protein DMR6-LIKE OXYGENASE 1-like isoform X1 [Momordica charantia] >XP_022159285.1 protein DMR6-LIKE OXYGENASE 1-like isoform X1 [Momordica charantia])

HSP 1 Score: 521.2 bits (1341), Expect = 7.1e-144
Identity = 253/344 (73.55%), Postives = 295/344 (85.76%), Query Frame = 0

Query: 7   IASVKSIAETPNLASIPSTYIFSAAAG-EETAPQAMEDSIPTVDFALLTAGTPDQRSKVV 66
           +ASVK+IAETPNLASIPS+YIFSA  G  + AP+ +EDSIPT+DF+LLT GTPDQR+KVV
Sbjct: 6   VASVKTIAETPNLASIPSSYIFSANDGAAKAAPRGVEDSIPTIDFSLLTMGTPDQRAKVV 65

Query: 67  DELGKACRDWGFFMVINHGIAERVRKEMMDCCKEFFDLREEEKRVYETKHVLDPIRYGTS 126
           DELGKAC+DWGFF+V+NHG+ ERV +EM+D C+EFFDL EEEK  Y+T+HVLDPIRYGTS
Sbjct: 66  DELGKACQDWGFFVVMNHGVEERVMREMIDICREFFDLTEEEKTEYKTEHVLDPIRYGTS 125

Query: 127 FNPHIEKVLLWRDYLKIMVHPNFHSPTKPPTFREISNEYCRRVREMARELMRGISESLGL 186
           FNP  EKVL WRDYLKI VHP FHSPTKPP FREI  EYC+R  EMAREL+RGISESLGL
Sbjct: 126 FNPQKEKVLFWRDYLKIFVHPKFHSPTKPPRFREILEEYCKRSIEMARELVRGISESLGL 185

Query: 187 ERCNLERAVNWEKCSQILIANLYPPCPQPELAMGLSPHSDHCLITVLLQNQIDGLQILHD 246
           E C+LERAV++E CS +  AN YPP PQPELA GL  HSD CL+T+LLQNQI GLQILH 
Sbjct: 186 ETCHLERAVDFESCSTLFAANFYPPYPQPELARGLPSHSDQCLLTLLLQNQISGLQILHQ 245

Query: 247 GKWVNVNPIPNSFLVNTADQLEILSNGKYKSVLHKAIVNNKATRMSIAMAVGPSAETVVG 306
           G WV+VNPIPNSFLVN ADQLEILSNGKYKSVLH+A+VNNKATR+SIA+AVG S ETVV 
Sbjct: 246 GNWVDVNPIPNSFLVNVADQLEILSNGKYKSVLHRAMVNNKATRLSIAVAVGSSPETVVS 305

Query: 307 PVPELVHQQTNPPLFKNIKYKDYLETMQNGKLQNKSTLDRVRLL 350
           P P+L+++ TNPPLFK+IKY DY++ +Q+  LQ+ S LDR+RLL
Sbjct: 306 PAPQLLNKLTNPPLFKHIKYTDYMQMVQSSNLQD-SCLDRIRLL 348

BLAST of Tan0016231 vs. NCBI nr
Match: KAG6576847.1 (2-oxoglutarate-dependent dioxygenase 19, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 517.3 bits (1331), Expect = 1.0e-142
Identity = 244/350 (69.71%), Postives = 294/350 (84.00%), Query Frame = 0

Query: 7   IASVKSIAETPNLASIPSTYIFS-------AAAGEETAPQAMEDSIPTVDFALLTAGTPD 66
           + SVK+IAETPNLASIPS+YIF+        A   + AP ++EDSIPT+DF+LLT GTP 
Sbjct: 3   LPSVKAIAETPNLASIPSSYIFTTSDDSDDVATTADAAPHSVEDSIPTIDFSLLTTGTPH 62

Query: 67  QRSKVVDELGKACRDWGFFMVINHGIAERVRKEMMDCCKEFFDLREEEKRVYETKHVLDP 126
           QRSKVVDELGKAC DWGFFMVINHG+ E + KEM++ C+EFFDL EEEKRVYETKHVLDP
Sbjct: 63  QRSKVVDELGKACHDWGFFMVINHGVGEGLMKEMVEICREFFDLTEEEKRVYETKHVLDP 122

Query: 127 IRYGTSFNPHIEKVLLWRDYLKIMVHPNFHSPTKPPTFREISNEYCRRVREMARELMRGI 186
           IRYGTSFNP +E+V  WRDYLK++VHP FHSP+KP  FREI  EY +R++EMAREL+RGI
Sbjct: 123 IRYGTSFNPKMEEVFFWRDYLKVLVHPKFHSPSKPTRFREILEEYSKRIKEMARELVRGI 182

Query: 187 SESLGLERCNLERAVNWEKCSQILIANLYPPCPQPELAMGLSPHSDHCLITVLLQNQIDG 246
           SESLGLE CNLE+A + E  S +  ANLYPPCPQP+LA GL PHSD CL+TVLLQN + G
Sbjct: 183 SESLGLEACNLEKAADLESSSTLFAANLYPPCPQPQLARGLPPHSDQCLLTVLLQNHVAG 242

Query: 247 LQILHDGKWVNVNPIPNSFLVNTADQLEILSNGKYKSVLHKAIVNNKATRMSIAMAVGPS 306
           LQILHD +W+ VNP+PN  LVNTADQLEI+SNGKYKSVLH+A+VN+KATR+SIAMA+GPS
Sbjct: 243 LQILHDHQWLTVNPVPNVLLVNTADQLEIMSNGKYKSVLHRAMVNDKATRISIAMAIGPS 302

Query: 307 AETVVGPVPELVHQQTNPPLFKNIKYKDYLETMQNGKLQNKSTLDRVRLL 350
           ++T+V P+PEL+H+  +PPLFK+I YKDY+E +Q+ KL+ KS LDRVRLL
Sbjct: 303 SQTLVAPLPELLHKHNSPPLFKSIMYKDYMEMLQSNKLEGKSCLDRVRLL 352

BLAST of Tan0016231 vs. NCBI nr
Match: XP_038904929.1 (protein DMR6-LIKE OXYGENASE 2-like [Benincasa hispida])

HSP 1 Score: 516.2 bits (1328), Expect = 2.3e-142
Identity = 247/346 (71.39%), Postives = 290/346 (83.82%), Query Frame = 0

Query: 7   IASVKSIAETPNLASIPSTYIFSAAAGEET---APQAMEDSIPTVDFALLTAGTPDQRSK 66
           +ASVKS+A+TPNLAS+PS+++F+     ++     Q  EDSIP +D +LL  GTP QR+K
Sbjct: 3   MASVKSLADTPNLASVPSSFMFATDDDSDSVTAVSQGAEDSIPIIDTSLLIDGTPHQRAK 62

Query: 67  VVDELGKACRDWGFFMVINHGIAERVRKEMMDCCKEFFDLREEEKRVYETKHVLDPIRYG 126
           V++ELGKAC DWGFFMV+NHG+ ER+ +EM++ CKEFFDL+EEEKR YETK VLDPIRYG
Sbjct: 63  VINELGKACEDWGFFMVVNHGVGERLMREMVEICKEFFDLKEEEKREYETKSVLDPIRYG 122

Query: 127 TSFNPHIEKVLLWRDYLKIMVHPNFHSPTKPPTFREISNEYCRRVREMARELMRGISESL 186
           TSFNP +EKV  WRDYLKIMVHPNFHSPTKP  FREI  EYC+R+REM REL+RGISESL
Sbjct: 123 TSFNPKVEKVFFWRDYLKIMVHPNFHSPTKPTRFREILEEYCKRIREMTRELVRGISESL 182

Query: 187 GLERCNLERAVNWEKCSQILIANLYPPCPQPELAMGLSPHSDHCLITVLLQNQIDGLQIL 246
           GLE C L++A N E CS +  ANLYPPCPQPELA GL  HSDHCL+T+LLQNQI GLQIL
Sbjct: 183 GLEECCLDKAANLESCSIVFAANLYPPCPQPELARGLPSHSDHCLLTILLQNQIAGLQIL 242

Query: 247 HDGKWVNVNPIPNSFLVNTADQLEILSNGKYKSVLHKAIVNNKATRMSIAMAVGPSAETV 306
           H  KW+NVNPIPNS LVN ADQLEILSNGKYKSVLH+AIVN+KATR+SIAMAVGPS ETV
Sbjct: 243 HHDKWLNVNPIPNSLLVNVADQLEILSNGKYKSVLHRAIVNDKATRISIAMAVGPSLETV 302

Query: 307 VGPVPELVHQQT-NPPLFKNIKYKDYLETMQNGKLQNKSTLDRVRL 349
           VGP P+L+++ T NPPLFKNIKY DY+  +Q+ KLQ KSTLDR+RL
Sbjct: 303 VGPAPQLINKHTNNPPLFKNIKYIDYMGIVQSNKLQGKSTLDRIRL 348

BLAST of Tan0016231 vs. ExPASy TrEMBL
Match: A0A6J1D661 (protein DMR6-LIKE OXYGENASE 1-like OS=Momordica charantia OX=3673 GN=LOC111017638 PE=3 SV=1)

HSP 1 Score: 554.7 bits (1428), Expect = 2.8e-154
Identity = 264/344 (76.74%), Postives = 301/344 (87.50%), Query Frame = 0

Query: 7   IASVKSIAETPNLASIPSTYIFSAAAGE-ETAPQAMEDSIPTVDFALLTAGTPDQRSKVV 66
           +ASVK+IA+TPNL SIPS+Y+FSA  G  + APQ +EDSIPTVDF+LLT GTPDQR++VV
Sbjct: 30  MASVKTIAQTPNLTSIPSSYVFSADGGAVDAAPQGVEDSIPTVDFSLLTMGTPDQRAEVV 89

Query: 67  DELGKACRDWGFFMVINHGIAERVRKEMMDCCKEFFDLREEEKRVYETKHVLDPIRYGTS 126
           D+LGKAC+DWGFFMVINHG+AE V  EM+D C+EFFDL EEEKR YETKHVLDPIRYGTS
Sbjct: 90  DQLGKACQDWGFFMVINHGVAETVMTEMVDICREFFDLEEEEKREYETKHVLDPIRYGTS 149

Query: 127 FNPHIEKVLLWRDYLKIMVHPNFHSPTKPPTFREISNEYCRRVREMARELMRGISESLGL 186
           FNP +EKV  WRDYLKIMVHPNFH+PTKP  FREIS EYCRRVRE+AREL RGISESLGL
Sbjct: 150 FNPKMEKVFFWRDYLKIMVHPNFHAPTKPSRFREISEEYCRRVRELARELARGISESLGL 209

Query: 187 ERCNLERAVNWEKCSQILIANLYPPCPQPELAMGLSPHSDHCLITVLLQNQIDGLQILHD 246
           E C LE+AVN + CSQIL+ NLYPPCPQ E+AMGL PHSDHCL+T +LQN+I GLQILH 
Sbjct: 210 EGCTLEKAVNLKSCSQILVGNLYPPCPQAEVAMGLPPHSDHCLLTFILQNKICGLQILHQ 269

Query: 247 GKWVNVNPIPNSFLVNTADQLEILSNGKYKSVLHKAIVNNKATRMSIAMAVGPSAETVVG 306
           GKWVNVNPIPNSFLVNTADQLEI SNGKYKS+LH+AIVN KATRMS+ +A+GPS +TVVG
Sbjct: 270 GKWVNVNPIPNSFLVNTADQLEIFSNGKYKSLLHRAIVNKKATRMSVGIAIGPSLDTVVG 329

Query: 307 PVPELVHQQTNPPLFKNIKYKDYLETMQNGKLQNKSTLDRVRLL 350
           P PELVHQ TNPPL+K+IKYKDY+E +Q   LQ+KS LD VRL+
Sbjct: 330 PAPELVHQLTNPPLYKHIKYKDYMELVQTNDLQHKSMLDLVRLV 373

BLAST of Tan0016231 vs. ExPASy TrEMBL
Match: A0A6J1CIQ0 (protein DMR6-LIKE OXYGENASE 1-like isoform X1 OS=Momordica charantia OX=3673 GN=LOC111011533 PE=3 SV=1)

HSP 1 Score: 521.2 bits (1341), Expect = 3.4e-144
Identity = 253/344 (73.55%), Postives = 295/344 (85.76%), Query Frame = 0

Query: 7   IASVKSIAETPNLASIPSTYIFSAAAG-EETAPQAMEDSIPTVDFALLTAGTPDQRSKVV 66
           +ASVK+IAETPNLASIPS+YIFSA  G  + AP+ +EDSIPT+DF+LLT GTPDQR+KVV
Sbjct: 6   VASVKTIAETPNLASIPSSYIFSANDGAAKAAPRGVEDSIPTIDFSLLTMGTPDQRAKVV 65

Query: 67  DELGKACRDWGFFMVINHGIAERVRKEMMDCCKEFFDLREEEKRVYETKHVLDPIRYGTS 126
           DELGKAC+DWGFF+V+NHG+ ERV +EM+D C+EFFDL EEEK  Y+T+HVLDPIRYGTS
Sbjct: 66  DELGKACQDWGFFVVMNHGVEERVMREMIDICREFFDLTEEEKTEYKTEHVLDPIRYGTS 125

Query: 127 FNPHIEKVLLWRDYLKIMVHPNFHSPTKPPTFREISNEYCRRVREMARELMRGISESLGL 186
           FNP  EKVL WRDYLKI VHP FHSPTKPP FREI  EYC+R  EMAREL+RGISESLGL
Sbjct: 126 FNPQKEKVLFWRDYLKIFVHPKFHSPTKPPRFREILEEYCKRSIEMARELVRGISESLGL 185

Query: 187 ERCNLERAVNWEKCSQILIANLYPPCPQPELAMGLSPHSDHCLITVLLQNQIDGLQILHD 246
           E C+LERAV++E CS +  AN YPP PQPELA GL  HSD CL+T+LLQNQI GLQILH 
Sbjct: 186 ETCHLERAVDFESCSTLFAANFYPPYPQPELARGLPSHSDQCLLTLLLQNQISGLQILHQ 245

Query: 247 GKWVNVNPIPNSFLVNTADQLEILSNGKYKSVLHKAIVNNKATRMSIAMAVGPSAETVVG 306
           G WV+VNPIPNSFLVN ADQLEILSNGKYKSVLH+A+VNNKATR+SIA+AVG S ETVV 
Sbjct: 246 GNWVDVNPIPNSFLVNVADQLEILSNGKYKSVLHRAMVNNKATRLSIAVAVGSSPETVVS 305

Query: 307 PVPELVHQQTNPPLFKNIKYKDYLETMQNGKLQNKSTLDRVRLL 350
           P P+L+++ TNPPLFK+IKY DY++ +Q+  LQ+ S LDR+RLL
Sbjct: 306 PAPQLLNKLTNPPLFKHIKYTDYMQMVQSSNLQD-SCLDRIRLL 348

BLAST of Tan0016231 vs. ExPASy TrEMBL
Match: A0A6J1JCE3 (protein DMR6-LIKE OXYGENASE 2-like OS=Cucurbita maxima OX=3661 GN=LOC111483195 PE=3 SV=1)

HSP 1 Score: 508.1 bits (1307), Expect = 3.0e-140
Identity = 241/350 (68.86%), Postives = 289/350 (82.57%), Query Frame = 0

Query: 7   IASVKSIAETPNLASIPSTYIFS-------AAAGEETAPQAMEDSIPTVDFALLTAGTPD 66
           + SVK+IAETPNLASIPS+YIF+        AA  + AP  +E SIPT+DF+LLT GT  
Sbjct: 34  LPSVKAIAETPNLASIPSSYIFTTSDDSDDVAAAADAAPHRVEVSIPTIDFSLLTTGTSH 93

Query: 67  QRSKVVDELGKACRDWGFFMVINHGIAERVRKEMMDCCKEFFDLREEEKRVYETKHVLDP 126
           QRSKVV+ELGKAC DWGFFMVINHG+ E + KEM++ C+EFFDL+EEEKR YETKHVLDP
Sbjct: 94  QRSKVVNELGKACHDWGFFMVINHGVGEGLMKEMVEICREFFDLKEEEKREYETKHVLDP 153

Query: 127 IRYGTSFNPHIEKVLLWRDYLKIMVHPNFHSPTKPPTFREISNEYCRRVREMARELMRGI 186
           IRYGTSFNP +E+V  WRDYLK++VHP FHSP KP  FREI  EY +R+REMAREL+RGI
Sbjct: 154 IRYGTSFNPKMEEVFFWRDYLKVLVHPKFHSPPKPTRFREILEEYSKRIREMARELVRGI 213

Query: 187 SESLGLERCNLERAVNWEKCSQILIANLYPPCPQPELAMGLSPHSDHCLITVLLQNQIDG 246
           SESLGLE  NLE+A   E  S +  ANLYPPCPQP+ A GL PHSD CL+TVLLQN + G
Sbjct: 214 SESLGLEAYNLEKAAELESSSTLFAANLYPPCPQPQFARGLPPHSDQCLLTVLLQNHVSG 273

Query: 247 LQILHDGKWVNVNPIPNSFLVNTADQLEILSNGKYKSVLHKAIVNNKATRMSIAMAVGPS 306
           LQILHD +W+ VNP+PN+ LVNTADQLEI+SNGKYKSVLH+A+VN+KATR+SIAMA+GPS
Sbjct: 274 LQILHDHQWLTVNPVPNALLVNTADQLEIMSNGKYKSVLHRAMVNDKATRISIAMAIGPS 333

Query: 307 AETVVGPVPELVHQQTNPPLFKNIKYKDYLETMQNGKLQNKSTLDRVRLL 350
           ++T+V P+PEL+H+  NPPLFK IKYKDY+E +Q+ KL+ KS LDR+RLL
Sbjct: 334 SQTLVAPLPELLHKHNNPPLFKTIKYKDYMEMLQSNKLERKSCLDRLRLL 383

BLAST of Tan0016231 vs. ExPASy TrEMBL
Match: A0A0A0L0R1 (Fe2OG dioxygenase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G112650 PE=3 SV=1)

HSP 1 Score: 481.1 bits (1237), Expect = 3.9e-132
Identity = 234/348 (67.24%), Postives = 277/348 (79.60%), Query Frame = 0

Query: 7   IASVKSIAETPNLASIPSTYIFSA-----AAGEETAPQAMEDSIPTVDFALLTAGTPDQR 66
           +ASVKSIA++PNL SIPS++IF+          + + Q  EDSIP +D +LL  GTP QR
Sbjct: 1   MASVKSIADSPNLTSIPSSFIFATDDSFDDVAADASLQGAEDSIPIIDLSLLINGTPQQR 60

Query: 67  SKVVDELGKACRDWGFFMVINHGIAERVRKEMMDCCKEFFDLREEEKRVYETKHVLDPIR 126
           +KVV+ELGKAC DWGFFMV+NHG+ E++ K++M+ C EFF+L+EEEKR YETKHVLDPIR
Sbjct: 61  AKVVNELGKACEDWGFFMVVNHGVEEKLMKDLMEICVEFFELKEEEKREYETKHVLDPIR 120

Query: 127 YGTSFNPHIEKVLLWRDYLKIMVHPNFHSPTKPPTFREISNEYCRRVREMARELMRGISE 186
           YGTSFNP +EK   WRDYLKIMVHP FH+PTKP  FR I  EYC  VREM REL+RGISE
Sbjct: 121 YGTSFNPKMEKAFFWRDYLKIMVHPKFHAPTKPTRFRGILEEYCTSVREMTRELLRGISE 180

Query: 187 SLGLERCNLERAVNWEKCSQILIANLYPPCPQPELAMGLSPHSDHCLITVLLQNQIDGLQ 246
           SLGLE C LE+A + E    +  ANLYPPCPQPELA GL  HSD CL+T+LL NQI GLQ
Sbjct: 181 SLGLEGCFLEKATDLESSLILFAANLYPPCPQPELARGLPSHSDLCLLTILLTNQIAGLQ 240

Query: 247 ILHDGKWVNVNPIPNSFLVNTADQLEILSNGKYKSVLHKAIVNNKATRMSIAMAVGPSAE 306
           ILH  KW NVNPIPNSF++N  DQLEILSNGKY+SVLH+A VN+KATR+SI MAVGPS E
Sbjct: 241 ILHHDKWFNVNPIPNSFIINVGDQLEILSNGKYESVLHRAKVNDKATRISIGMAVGPSHE 300

Query: 307 TVVGPVPELVHQQT-NPPLFKNIKYKDYLETMQNGKLQNKSTLDRVRL 349
           TVVGP P+LV++ T NPP+FK+IKYKDY+E MQ+ +LQ KS LDR RL
Sbjct: 301 TVVGPAPQLVNEDTNNPPMFKSIKYKDYMEIMQSSQLQEKSILDRFRL 348

BLAST of Tan0016231 vs. ExPASy TrEMBL
Match: A0A1S3CJI6 (protein DMR6-LIKE OXYGENASE 2-like OS=Cucumis melo OX=3656 GN=LOC103501162 PE=3 SV=1)

HSP 1 Score: 480.7 bits (1236), Expect = 5.1e-132
Identity = 233/348 (66.95%), Postives = 276/348 (79.31%), Query Frame = 0

Query: 7   IASVKSIAETPNLASIPSTYIFSA-----AAGEETAPQAMEDSIPTVDFALLTAGTPDQR 66
           +ASVKSIA+TPNL SIPS++IF+          + +PQ  EDSIP +D +LL  GTP QR
Sbjct: 1   MASVKSIADTPNLTSIPSSFIFATDDSFDDVTADASPQGAEDSIPIIDLSLLINGTPQQR 60

Query: 67  SKVVDELGKACRDWGFFMVINHGIAERVRKEMMDCCKEFFDLREEEKRVYETKHVLDPIR 126
           +KVV+ELGKAC DWGFFMV+NHG+ E++ K++M+ C EFF+L+EEEKR YETKHVLDPIR
Sbjct: 61  AKVVNELGKACEDWGFFMVVNHGVEEKLMKDLMEICIEFFELKEEEKREYETKHVLDPIR 120

Query: 127 YGTSFNPHIEKVLLWRDYLKIMVHPNFHSPTKPPTFREISNEYCRRVREMARELMRGISE 186
           YGTSFNP +EK   WRDYLKI VHP FH+PTKP  FR I  EYC RVRE  REL+RGISE
Sbjct: 121 YGTSFNPKVEKAFFWRDYLKIKVHPKFHAPTKPTRFRGILEEYCTRVRETTRELVRGISE 180

Query: 187 SLGLERCNLERAVNWEKCSQILIANLYPPCPQPELAMGLSPHSDHCLITVLLQNQIDGLQ 246
           SLGLE C LE+A + E    +  ANLYPPCPQPELA GL  HSD CL+T+L+ N+I GLQ
Sbjct: 181 SLGLEGCFLEKATDLESSLILFAANLYPPCPQPELARGLPSHSDLCLLTILITNEIAGLQ 240

Query: 247 ILHDGKWVNVNPIPNSFLVNTADQLEILSNGKYKSVLHKAIVNNKATRMSIAMAVGPSAE 306
           ILH  KW NVNPIPNS ++N  DQLEILSNGKYKSVLH+A VN+KATR+SI MAVGPS E
Sbjct: 241 ILHHDKWFNVNPIPNSLIINVGDQLEILSNGKYKSVLHRAKVNDKATRISIGMAVGPSHE 300

Query: 307 TVVGPVPELVHQQT-NPPLFKNIKYKDYLETMQNGKLQNKSTLDRVRL 349
           TVVGP P+LV++ T NPP+FK+IKYKDY+E MQ+ +LQ KS LDR RL
Sbjct: 301 TVVGPAPQLVNEHTNNPPMFKSIKYKDYMEIMQSNQLQGKSILDRFRL 348

BLAST of Tan0016231 vs. TAIR 10
Match: AT2G36690.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 240.7 bits (613), Expect = 1.7e-63
Identity = 130/345 (37.68%), Postives = 202/345 (58.55%), Query Frame = 0

Query: 10  VKSIAETPNLASIPSTYIFS------AAAGEETAPQAMEDSIPTVDFALLTAGTPDQRSK 69
           VK + E   L  +P+ YI+           ++         +P +DFA L       R  
Sbjct: 21  VKHLCEN-GLTKVPTKYIWPEPDRPILTKSDKLIKPNKNLKLPLIDFAELLG---PNRPH 80

Query: 70  VVDELGKACRDWGFFMVINHGIAERVRKEMMDCCKEFFDLREEEKRVYETKHVLDPIRYG 129
           V+  + +AC+ +GFF V+NHG+   V K M+D CK FF+L  EE+  Y +  +  P+RYG
Sbjct: 81  VLRTIAEACKTYGFFQVVNHGMEGDVSKNMIDVCKRFFELPYEERSKYMSSDMSAPVRYG 140

Query: 130 TSFNPHIEKVLLWRDYLKIMVHP----NFHSPTKPPTFREISNEYCRRVREMARELMRGI 189
           TSFN   + V  WRD+LK+  HP      H P+ P  FR  +  Y +  +EM   +++ I
Sbjct: 141 TSFNQIKDNVFCWRDFLKLYAHPLPDYLPHWPSSPSDFRSSAATYAKETKEMFEMMVKAI 200

Query: 190 SESLGLERCNLERAVNWEKCSQILIANLYPPCPQPELAMGLSPHSDHCLITVLLQNQIDG 249
            ESL ++  + E A   E+ SQ+++ N YPPCP+PEL +G+ PHSD+  +T+LLQ++++G
Sbjct: 201 LESLEIDGSD-EAAKELEEGSQVVVVNCYPPCPEPELTLGMPPHSDYGFLTLLLQDEVEG 260

Query: 250 LQILHDGKWVNVNPIPNSFLVNTADQLEILSNGKYKSVLHKAIVNNKATRMSIAMAVGPS 309
           LQIL+  +WV V+PIP SF+VN  D LEI SNG+YKSVLH+ +VN+   R+S+A      
Sbjct: 261 LQILYRDEWVTVDPIPGSFVVNVGDHLEIFSNGRYKSVLHRVLVNSTKPRISVASLHSFP 320

Query: 310 AETVVGPVPELVHQQTNPPLFKNIKYKDYLETMQNGKLQNKSTLD 345
             +VV P P+LV +  NP  + +  +  +L+ + + + + K+ L+
Sbjct: 321 LTSVVKPSPKLVDKH-NPSQYMDTDFTTFLQYITSREPKWKNFLE 359

BLAST of Tan0016231 vs. TAIR 10
Match: AT4G10490.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 236.9 bits (603), Expect = 2.5e-62
Identity = 126/329 (38.30%), Postives = 191/329 (58.05%), Query Frame = 0

Query: 22  IPSTYIFSAAAGEETAP-QAMEDSIPTVDFALLTAGTPDQRSKVVDELGKACRDWGFFMV 81
           +PS Y+   +   + +  Q   DSIP +D   L       R+ ++++   AC   GFF +
Sbjct: 18  VPSNYVRPVSDRPKMSEVQTSGDSIPLIDLHDLHG---PNRADIINQFAHACSSCGFFQI 77

Query: 82  INHGIAERVRKEMMDCCKEFFDLREEEKRVYETKHVLDPIRYGTSFNPHIEKVLLWRDYL 141
            NHG+ E   K+MM+  +EFF   E E+  + +       R  TSFN   EKV  WRD+L
Sbjct: 78  KNHGVPEETIKKMMNAAREFFRQSESERVKHYSADTKKTTRLSTSFNVSKEKVSNWRDFL 137

Query: 142 KIMVHP--NFHS--PTKPPTFREISNEYCRRVREMARELMRGISESLGLERCNLERAVNW 201
           ++  +P  +F +  P+ P +FRE++ EY   VR +   L+  ISESLGL +  +   +  
Sbjct: 138 RLHCYPIEDFINEWPSTPISFREVTAEYATSVRALVLTLLEAISESLGLAKDRVSNTIG- 197

Query: 202 EKCSQILIANLYPPCPQPELAMGLSPHSDHCLITVLLQNQIDGLQILHDGKWVNVNPIPN 261
            K  Q +  N YP CPQPEL  GL  H D  LITVLLQ+++ GLQ+  DGKW+ VNP+PN
Sbjct: 198 -KHGQHMAINYYPRCPQPELTYGLPGHKDANLITVLLQDEVSGLQVFKDGKWIAVNPVPN 257

Query: 262 SFLVNTADQLEILSNGKYKSVLHKAIVNNKATRMSIAMAVGPSAETVVGPVPELVH-QQT 321
           +F+VN  DQ++++SN KYKSVLH+A+VN+   R+SI     PS + V+ P  EL++ ++ 
Sbjct: 258 TFIVNLGDQMQVISNEKYKSVLHRAVVNSDMERISIPTFYCPSEDAVISPAQELINEEED 317

Query: 322 NPPLFKNIKYKDYLETMQNGKLQNKSTLD 345
           +P +++N  Y +Y E   +     +S +D
Sbjct: 318 SPAIYRNFTYAEYFEKFWDTAFDTESCID 341

BLAST of Tan0016231 vs. TAIR 10
Match: AT4G10500.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 236.1 bits (601), Expect = 4.2e-62
Identity = 130/339 (38.35%), Postives = 192/339 (56.64%), Query Frame = 0

Query: 10  VKSIAETPNLASIPSTYIFSAAAGEETAPQAMEDSIPTVDFALLTAGTPDQRSKVVDELG 69
           V+ I++ PNL+ + S+                 DSIP +D   L       R+ +V +L 
Sbjct: 25  VRPISDRPNLSEVESS----------------GDSIPLIDLRDLHG---PNRAVIVQQLA 84

Query: 70  KACRDWGFFMVINHGIAERVRKEMMDCCKEFFDLREEEKRVYETKHVLDPIRYGTSFNPH 129
            AC  +GFF + NHG+ +    +M    +EFF   E E+  + +       R  TSFN  
Sbjct: 85  SACSTYGFFQIKNHGVPDTTVNKMQTVAREFFHQPESERVKHYSADPTKTTRLSTSFNVG 144

Query: 130 IEKVLLWRDYLKIMVHP--NF--HSPTKPPTFREISNEYCRRVREMARELMRGISESLGL 189
            +KVL WRD+L++   P  +F    P+ P +FRE++ EY   VR +   L+  ISESLGL
Sbjct: 145 ADKVLNWRDFLRLHCFPIEDFIEEWPSSPISFREVTAEYATSVRALVLRLLEAISESLGL 204

Query: 190 ERCNLERAVNWEKCSQILIANLYPPCPQPELAMGLSPHSDHCLITVLLQNQIDGLQILHD 249
           E  ++   +   K +Q +  N YPPCP+PEL  GL  H D  +ITVLLQ+Q+ GLQ+  D
Sbjct: 205 ESDHISNILG--KHAQHMAFNYYPPCPEPELTYGLPGHKDPTVITVLLQDQVSGLQVFKD 264

Query: 250 GKWVNVNPIPNSFLVNTADQLEILSNGKYKSVLHKAIVNNKATRMSIAMAVGPSAETVVG 309
            KWV V+PIPN+F+VN  DQ++++SN KYKSVLH+A+VN +  R+SI     PS + V+G
Sbjct: 265 DKWVAVSPIPNTFIVNIGDQMQVISNDKYKSVLHRAVVNTENERLSIPTFYFPSTDAVIG 324

Query: 310 PVPELVHQQTNPPLFKNIKYKDYLETMQNGKLQNKSTLD 345
           P  ELV++Q +  +++   + +Y +   N  L   S LD
Sbjct: 325 PAHELVNEQDSLAIYRTYPFVEYWDKFWNRSLATASCLD 342

BLAST of Tan0016231 vs. TAIR 10
Match: AT2G44800.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 213.0 bits (541), Expect = 3.8e-55
Identity = 113/311 (36.33%), Postives = 178/311 (57.23%), Query Frame = 0

Query: 42  EDSIPTVDFALLTAGTPDQRSKVVDELGKACRDWGFFMVINHGIAERVRKEMMDCCKEFF 101
           E ++P +D +LL    P  RS  + E+  AC+++GFF VINHGI   V  + +D   +FF
Sbjct: 49  ETTLPVIDLSLL--HQPFLRSLAIHEISMACKEFGFFQVINHGIPSSVVNDALDAATQFF 108

Query: 102 DLREEEKRVYETKHVLDPIRYGTSFNPHIEKVLLWRDYLKIMVHPNFH----SPTKPPTF 161
           DL  EEK +  + +V +P+RYGTS N   ++V  WRD++K   HP        P+ PP +
Sbjct: 109 DLPVEEKMLLVSANVHEPVRYGTSLNHSTDRVHYWRDFIKHYSHPLSKWIDMWPSNPPCY 168

Query: 162 REISNEYCRRVREMARELMRGISESLGLERCNLERAVNWEKCSQILIANLYPPCPQPELA 221
           ++   +Y      + ++L+  ISESLGLE+  L+  +  E+ SQ++  N YP CP+PE+A
Sbjct: 169 KDKVGKYAEATHLLHKQLIEAISESLGLEKNYLQEEI--EEGSQVMAVNCYPACPEPEMA 228

Query: 222 MGLSPHSDHCLITVLLQNQIDGLQILHDGK-WVNVNPIPNSFLVNTADQLEILSNGKYKS 281
           +G+ PHSD   +T+LLQ+   GLQI+   K WV V  I  + +V   DQ+E++SNG YKS
Sbjct: 229 LGMPPHSDFSSLTILLQSS-KGLQIMDCNKNWVCVPYIEGALIVQLGDQVEVMSNGIYKS 288

Query: 282 VLHKAIVNNKATRMSIAMAVGPSAETVVGPVPELVHQQTNPPLFKNIKYKDYLETMQNGK 341
           V+H+  VN +  R+S A          + P P+LV+   N P +    + D+L  + +  
Sbjct: 289 VIHRVTVNKEVKRLSFASLHSLPLHKKISPAPKLVN-PNNAPAYGEFSFNDFLNYISSND 348

Query: 342 LQNKSTLDRVR 348
              +  +D ++
Sbjct: 349 FIQERFIDTIK 353

BLAST of Tan0016231 vs. TAIR 10
Match: AT5G24530.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 213.0 bits (541), Expect = 3.8e-55
Identity = 116/304 (38.16%), Postives = 179/304 (58.88%), Query Frame = 0

Query: 49  DFALLTAGTPDQRSKVVDELGKACRDWGFFMVINHGIAERVRKEMMDCCKEFFDLREEEK 108
           DF L+   + D RS ++ ++ +AC  +GFF VINHG+ +++  EM+   +EFF +  EEK
Sbjct: 37  DFPLIDLSSTD-RSFLIQQIHQACARFGFFQVINHGVNKQIIDEMVSVAREFFSMSMEEK 96

Query: 109 RVYETKHVLDPIRYGTSFNPHIEKVLLWRDYLKIMVHPNFHS-----PTKPPTFREISNE 168
               +       R  TSFN   E+V  WRDYL++  +P  H      P+ PP+F+EI ++
Sbjct: 97  MKLYSDDPTKTTRLSTSFNVKKEEVNNWRDYLRLHCYP-IHKYVNEWPSNPPSFKEIVSK 156

Query: 169 YCRRVREMARELMRGISESLGLERCNLERAVNWEKCSQILIANLYPPCPQPELAMGLSPH 228
           Y R VRE+  ++   ISESLGLE+  +++ +  +   Q +  N YPPCP+PEL  GL  H
Sbjct: 157 YSREVREVGFKIEELISESLGLEKDYMKKVLGEQ--GQHMAVNYYPPCPEPELTYGLPAH 216

Query: 229 SDHCLITVLLQN-QIDGLQILHDGKWVNVNPIPNSFLVNTADQLEILSNGKYKSVLHKAI 288
           +D   +T+LLQ+  + GLQIL DG+W  VNP P++F++N  DQL+ LSNG YKSV H+A+
Sbjct: 217 TDPNALTILLQDTTVCGLQILIDGQWFAVNPHPDAFVINIGDQLQALSNGVYKSVWHRAV 276

Query: 289 VNNKATRMSIAMAVGPSAETVVGPVPEL--VHQQTNPPLFKNIKYKDYLETMQNGKLQNK 345
            N +  R+S+A  + P+   V+ P   L         P++K+  Y +Y +   +  L  +
Sbjct: 277 TNTENPRLSVASFLCPADCAVMSPAKPLWEAEDDETKPVYKDFTYAEYYKKFWSRNLDQE 336

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q6Z2442.7e-8247.112-oxoglutarate-dependent dioxygenase 19 OS=Oryza sativa subsp. japonica OX=39947... [more]
Q9ZSA73.5e-6138.30Protein DMR6-LIKE OXYGENASE 2 OS=Arabidopsis thaliana OX=3702 GN=DLO2 PE=2 SV=1[more]
Q9ZSA86.0e-6138.35Protein DMR6-LIKE OXYGENASE 1 OS=Arabidopsis thaliana OX=3702 GN=DLO1 PE=1 SV=1[more]
Q6YYX92.3e-5735.99Probable 2-oxoglutarate-dependent dioxygenase SLC1 OS=Oryza sativa subsp. japoni... [more]
Q8W2X52.0e-5637.54Flavanone 3-dioxygenase 2 OS=Oryza sativa subsp. japonica OX=39947 GN=F3H-2 PE=1... [more]
Match NameE-valueIdentityDescription
XP_038886412.13.9e-16682.00protein DMR6-LIKE OXYGENASE 2-like [Benincasa hispida][more]
XP_022149149.15.8e-15476.74protein DMR6-LIKE OXYGENASE 1-like [Momordica charantia][more]
XP_022141027.17.1e-14473.55protein DMR6-LIKE OXYGENASE 1-like isoform X1 [Momordica charantia] >XP_02215928... [more]
KAG6576847.11.0e-14269.712-oxoglutarate-dependent dioxygenase 19, partial [Cucurbita argyrosperma subsp. ... [more]
XP_038904929.12.3e-14271.39protein DMR6-LIKE OXYGENASE 2-like [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1D6612.8e-15476.74protein DMR6-LIKE OXYGENASE 1-like OS=Momordica charantia OX=3673 GN=LOC11101763... [more]
A0A6J1CIQ03.4e-14473.55protein DMR6-LIKE OXYGENASE 1-like isoform X1 OS=Momordica charantia OX=3673 GN=... [more]
A0A6J1JCE33.0e-14068.86protein DMR6-LIKE OXYGENASE 2-like OS=Cucurbita maxima OX=3661 GN=LOC111483195 P... [more]
A0A0A0L0R13.9e-13267.24Fe2OG dioxygenase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G... [more]
A0A1S3CJI65.1e-13266.95protein DMR6-LIKE OXYGENASE 2-like OS=Cucumis melo OX=3656 GN=LOC103501162 PE=3 ... [more]
Match NameE-valueIdentityDescription
AT2G36690.11.7e-6337.682-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT4G10490.12.5e-6238.302-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT4G10500.14.2e-6238.352-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT2G44800.13.8e-5536.332-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT5G24530.13.8e-5538.162-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR026992Non-haem dioxygenase N-terminal domainPFAMPF14226DIOX_Ncoord: 45..151
e-value: 5.5E-24
score: 85.2
IPR027443Isopenicillin N synthase-like superfamilyGENE3D2.60.120.330coord: 4..338
e-value: 1.5E-98
score: 332.2
IPR044861Isopenicillin N synthase-like, Fe(2+) 2OG dioxygenase domainPFAMPF031712OG-FeII_Oxycoord: 201..294
e-value: 8.0E-25
score: 87.3
NoneNo IPR availablePANTHERPTHR10209:SF460FLAVONOL SYNTHASE/FLAVANONE 3-HYDROXYLASEcoord: 9..337
NoneNo IPR availablePANTHERPTHR10209OXIDOREDUCTASE, 2OG-FE II OXYGENASE FAMILY PROTEINcoord: 9..337
NoneNo IPR availableSUPERFAMILY51197Clavaminate synthase-likecoord: 18..339
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 198..298
score: 12.868257

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0016231.1Tan0016231.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0046872 metal ion binding
molecular_function GO:0016491 oxidoreductase activity