Sgr029759 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr029759
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionAcyltransferase
Locationtig00153449: 2551184 .. 2573738 (-)
RNA-Seq ExpressionSgr029759
SyntenySgr029759
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATAAAATCTCAGGACCTTCGGTCACAGTTTTCAAGGGAGGTGCAGGGTCGGCGACCAAATGGCGGATGATGCACATAATCGTAGCCTTGGGTGTTTGGCTCGGCGGCATTCATCTCAATTTTACTCTAGCTCTCATCTCCGTCTTCTACCTCTCCCTGCCCAAAGCCCTCTTGTGAGTTTTGTTTTTCTCTTATTCCCTCGTTCCTTTTTCCTTTTTCCTTTTTCCTTTTCTCTGGCTCTTACAAAAGTGCGTGTTTGCGTGTCTTTCAAATCTCTTTTTTCTTTTCAGGGTCTTCGGGTTATTTTTGGTATTAGTGTTAATTCCTGTCGACGATAAAAGCAAATACGGTCGCTTATTGGCCAGGTATCTTCTGCACCCTCCTCGTTTTCCTTCTTCTCTTTCTTCTTTCATCAATCTCTCTCTCTCTCTCTCTCTCTCTCTCTGCTGACAATCTGCCGTTCGCTTCCAGGTATATATGTAAACATGCTAGCAGCTATTTTCCTGTTACCCTACATGTTGAAGATATACATGCCTTTGATCCGAATCGTGCTTATGGTTAGTGACTGCTATTTTTCTTTTTTTTTCCTCCCATTCAACCATTGTCCTATATTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCATGTCGTACACGGTGGACTGTTAGTCGTAGGTATTGCAAGATAGATCCTTGGACAATCTTTATAAATTCTTGTTTGCTTTGGAATTGTTGAGCACGTGCTTCTTTTTGGGTGGTTTTCATGTATTCAGTGGCTCTCCTTATCTCTTATCTACGTGGTTGATTAATCATATTATATTGATTGATAAGTTGAATTTAAAATGACAAAAGAAATTTGATATTGATTAACTGTGCAGTTTATATTATTAGCAAACAAAGCATTGTTTACTAGGATGGTGTCATGCAATTTATTTTGGATGTTTTGGAAACTTATAAAGACGAGTTTACATTAAGGAAGGTAGTTCAGACTTTTTTGGCAAGGGTCCAAGATACTTAAGGGTACTTCTTAATTTTTGCTTTTACATAATTTATTCCAAGTAGGAGTCACTAGTGTATTTATTTCTACTTATTTAATAGGAGAGTTTATCCCTTTTTCCATTGCTTCTTTTTTCCTTTAATGTTACATTACTTAACAAATGAGCGAGAAGATGAATGAGAAAGGTAGAAGAAAACAGTAGGCAATCTAGCCCAATAATATATCTAAACTTTCATTGATAATCCCTTTAGGAGAAAACATTATCCCCCAATTACAAGAACTGACCGATAAAACTTAGGCTAAACATTACAAAAAAGTCATCCAACTAGAATTAGTTACATTTTAAGAGTATTTACAAAAAGAAGCATGACTTCTGGCTAAATCCACACTTCCGCTCCAATCTAATGATTTTTCTTTAAAGAGTCATTGATTCCTCTCATTCCAAATCCCCTAAACAACAGCATTTATGGCATTGATCCACGAAAGCTTGGCTTTACTTTTAAATTCATTATGCAAAGAACCTGTAAAATATTATCCTCCTTTGAGCTCTTGTCAAAGACCCAAGATACACTAAATATTTTGAATAGTGTTCCTAGCAATTGCCATTATATATATATATATTTCATGAGAAACAGAGATATATATTTAAGAAAAAAAAGAGAAACAAACTAAAGGTCTTAGGGAGGAGGGATCCCTTAGCCTAAATGATTTCAACTTTTTACAAAAGATTCAAACTTATCTAAGAAAGATCTTGAGTTCCTCTCTTTCCAAAGGAGCCAAGGAAAAGCTCTATACGTGTTTGTTTCTCTTTTTTTTCTTAAATATATATCTCTGTTTCTCATGAAATATATATCTCTATAATGATTTAAACTTTTTACAAAAGATTCAAACTTATCTAAGAAAGATCTTTTTTTTTTATATAAGAAACAATTTCATTGATGATATGAAATTACAAAAGAATGGCTAAAAAGCCCCAATCCAGAGGAGTTACAAAAAACATCTCCAATTGGACAAAAGAGAAGAGAAGCTATAGTATTGGAAGATAGGAGAATATTTGCACCAAGTAATAACCAAAAAACTATAGCATCAAAAAAGCGACCATAGGGGAGCTCCTTATCTTTGAAAATCCTTTGATTCCTTTAATGCCAAGTAAGCCAAAAAAAGGCCTTGATAAAATGCATCCATAAAAGCTTTTTCTCCTTTTTAAAAGGATGACCTGCAAGAGTGTAAGAGAGTAGATAAAAGGGATTATTTGGTAGAGCCGTGGACCAGCTGAAGCATGTTTGCAAATCCCCCCAGAAAGTTGCAGCAAAGGTACAAGAAAGAAATAAATGACTTTGTGATTCAGAATCGCTCTTGCAAATGGGGCACCATTGGGGACAGGCGGGCCATATATGGCTTTCTACGTTGGAGCATGTCTTAGTGTTAATAGCTTTGTGGCTAAGCTCCCATAGAAAGAACTTAACCTTCTTAGGATAGTTGTCCTTCCAAATAGAATCATAAAACAGTGGAAGTGAAACTATCCTTTTAGAAGAAATGTCCCTCGGTAGAGAATTTGTGGAGAATAATCCATTATCTTCCAAATTCCATACCCTTTTGTCTGACACTCAAGTTAAGTGAATGGAGGATAAAAGAAACATTAATTCAGCCCATTCATTGCTTCTGCATCCTTAAGGTTTCTTCTTAGTCTTATGTTCCAACATTCTATTTCTGTCATCCACATATCCTTAACCGAAGCAAGTTGCTTGCGAGAAATGGCAAATAACAAAGGGAACTTTCTAGCCAAAGGAAGATTCAGAAGCCAAACATCATTCCAAAAGGACGTGTCAGCTCCGTCCCCTATCTTGTGGGTCCATCAGTCATGGATAGCTTTCCAGAGGCCCTTGGCAGAGGCAATGGAGTAATATCCAGATCTAATATTCCTTGGCGTGTTTCCGTATTTTGCATCAATAACTTTTCTCCATAAAGCCAATTTCTCCTCTTGATATCTCCAAATCCGTTTTGGTAGGAGGGCCTTGTTTTTGTTTCTAAGGCTGGGTATGCCAAGCCCTCCTTCCTCTAAAGGATAGAGGATCTTATCCCATTTCACGAGGTGAAGACCCGAACTGTCACTATTTCCTTTACATAGAAAATTCCTGAAATTTTTCTCAATAGTGTTGCTCACCTTTTTGGGTATGGTATATAGGGACATATAATAAATCGGGAGGTTTGTTAGAGTGCCTTGGATGAGAGTGAGTCTACCTCCCTTAGAAATAAATGAGTTCTTCCATGTTTGGAGCCTTTTTCCAATTTTCTCAGGTCCCAAAAGGATAGGGAATGCTTGTTTCCATTTAAGGGAAGGCCGAGATACAAATTAGGCCAAAATGCAGCCTTACATCCAAATCTCTGAGCAATGTCTTCAACCTTCCTTTGGTCCACATTTATGCCCATAATTTCAAATTTATGTTGATTAACATTAAGCCTCGAAGCTTCTTCAAAAAATTTGACAATGTTGAACAAGTTATTTAAAGCCCAATCATCAATGGCAGAAAAGAGGATAATATCATCGGCAAGCTGTAGGTGATGAATATTTATGGAGCCCTCACCCAAATCAAAGCCCTTAATCATGTCTTTTGATGCAGCATGAGTCATAATTCTGCTATAGCAGTCCATAACCAAAATGAAGAGAAAAGGTGAGAGAGGATCACTTTGACGAAGACCTCTAGAAGCTTGAAACTTACCCCTAGGGCGCCCATTAATGATGATAGAAAAGTTTGTGGATGATATACAACTTCTAATCCAACTTCTCCATTTTTTACCAAAACCTGTTGCCGCAAGGATTTCTTCAAGAAAGTCCCAATCCACTTTGTCAAAAGCCTTTTCGATGTCTAACTTGATAACAATACCCATCTCTTTTTTCCTTTTCCACTCGTCGATGAGCTCATTTGCAATGAAAGAAGCATCAAGAATTTGTCTACCCGCCACAAAAGCAGATTGGTGTTCAGTAATGGTGTAGGGAAGCACCTCTTTAAGCCTTTTTGATAAGACTCTCGCAATAATCTTATACATGCAGGAAATGAGGCTTATGGGTCGGTAGTCTCCAACAGAACGGGCATCAACCTTTTTAGGAATGAGACAAATATAAGTTTCATTAAGCCTTTAATGATGTTCCAAAATTTTTTAAAGAATTCGGCTGGGAAGCCATTTGGTCCAGGAGTTTTGTCGGTCCCCAACTCACTAACAGCATACCATACTTCCTCTTTGGTGAAGGCCTTCTCAAGCTTTGAGTTTTGATGTGGAGAAATAGGATCCTATTGGATTGGGTGTGGAAGAAATCTGCAACCATCTTTTTTGGAATAAAGGTTTTTATAAAAGTTTATAAATTCCCCCTCAATCTCTTTATCTTTCAATAAACTGCTACCTTCTCTAGAAATAATTTCAAGGATAGTGTTTTTTCTTCTTTTGGGGGCCATGACCCGATGAAAGAAGCTGGAGTTTATATCCCCATCTTTCAGCCATTTGGTTTTACACTTTTGTCTCCTCCTTCGCTGCCGAAGTAAGAAGCTGAGATTTAATAGAGGTTCTTTCTAGACCGTGAATTGGCAAGAGATCCCCATTTTCCTCCATAGAGTCCAGCAAAGAAAGCTCCATAAGAAGTTGATTCTTTTGTGTAGAGATGCATCCAAAGGTCTCCTTGTTCCAAGAGCGAATAATGGATTTTAACTCCTTTAACTTCTGGATAAAACCATGATCGGGCCATCCTCTCATCGGGGTATTGATCCATCAATATTCCACTAGGGGGAAAAAGGAGTGATGTTGTAGCCACATATTTTCAAAGCGAAAGGGTGAGGACCTCATTTGTCAACTCCCAACAAATGCTGATAGGGAAATGATCTGATGTGGCTCTATCCAATCTTTTGATCGTAGCATTCTCAAACTTTGAAAGAAAACCTTCGTGGCCAAAAATTTATCAATAAGGGTGAGTTTGATGGGTCTCTATAGCTAGACCATGTATGAAGCCCATTGGATAGGGGGAGATCATGGAGAGCCATTTGGTCGATGAATCTGTTAAATTTGTGCATGCTACGAGTGGGACGCCTGTTGACAGATTTTTCATGAGACCATTGAGTAACGTTAAAATCTCCACCCAACAACCAGTGGTCCGTGCATAAGAAAGCTAAATCAGATAGCTCCATCCAAAATTGAGATCTTTCCTTAACTTTAGTGGGGCCATAAATCCCTGTCAACCGAAATTTGTAACCATTAGCAAGAGAGATGAGTAAGGATAGAGAGAAGGCACCTTTGGTTATGTCTGAGACGATGAATGATGGGTTGTTCCACATAATGATAATCTCCCCAGATGATCCAATAGATTCCAAGGCAGATCAACCGATATCTCTGGAACTCCATAGAGATTTGATGAAGATTCTGTCTACCAAAGTTGTCTTTGTCTCTTGGAGAATGACTATAGTAGGATTTTGCTTTAAGATGAGATTTTTAATGAGAGCCCTCTTTTCCCACGTGCCCAAGCCTCTAACATTCCAAGATAGAATAATCATAAAAGGATAGCTGACCCACCAATCATCTTAGATGATGATGTTTATCATAGTTGATAGTAGATGCGAGACTAGTTAACTCTCGAATTTCCTTGTTCTGTTTACTCAAGGACTTGTTTTTCTTTGTCGTCGTAGGTAAAGGCAAAATCCCTAGCCCCATCATCTTTATCCACGGAACCACCAATTGCAAGTATTCAAATGGGTGAATAGTTGGGTCATAAAGCAAATGACAATGTTCATCCGTTTGCGCAGAGTCTTGTGGCTGCCCATAAGTTTTTTTCGAAGCTTTTGGATGGGTAATAGCTCTTGGTTTGCAAAAAACTCGGAAGATTCATCTTCTTGTTGACCCATAGTGATCTGGAGAGTTTGGGAACAATATGGAGTGGTTTGTTTAGTTGAATTTTGAATATCTTTCGAGGGAGAAGGAGGGGCAGAGGACAGTGGATAAGGGCTCTAGATATAATTGTCAAGACTCTGGGTCAAGAAGAGAGGGATTGGTAGAGTGTTTGGTACCGGCAATGAGGAAAGTTTTCCTCTCACAGATGTTAACGGCTAATTTTCGTGGGCTAGCAAGGAGCTTAGCTGGGTCGAGTAAGAGAATAGGATGGGAGGATGTGGCGGGTCAGGTTTGGAGCATTCGGTAGGAGAGTCAATGGATGGTTCAATAATAGTGGGCGGGTTCTTTTCGGGTAAGTCCATTTTATTAGCAATAATTGTGTGGCCGTCTTTTTTTATGGAGATGGGTCACTGTGAGGTTGAGTGTATACAGAGACGTCATTAATGATATCTGTACAATGGGGGCTGGATAATTCAAAGGGCGTACTGTATAAGTCAGGTAAAGGTTCACAGTGAGCTGGTGGACACTGGGGCCCCCTAGCCTAGGGAAACGGTCCAAGCAAACCTGCCTTGGGGTCTGGGAGATTAGAAACATTGTCCTTTGTCCTGGGAGGCTTACCGGCGGCAAATCACCACGGATTCCAGCCATGAAACCAAGATGGTGTTCTGTAGTAAAGAAAGGGTGGTCGATTTGGACCGTCAATGTGGATGAGGAAGTCGTGGGAATCCAGACCTCTGCTGGGATGAAACCAAACGGGTTTTGTTTTACTCTAATCGATGCCTCCATAAGATCAAGTCTCGACAAGGTTTTTTTTTGCTATCTCAAGGAAACCTCCACAAGCATTTCCAATAAATTTTAAAGTTTCATCATCCCATCTATCCAAAGGAAGATTTTTTAGTTTGATCCATCATCCATATGAAGGAATGACAAGGGCTCTCTTGTAAGCGTCGGTACTCCATGACAAGAATTTTAAGGGAAAATCTCTAACTTGATACCAGTCCTTTATCTTGCAGAGAATGTTGGCTTTTTCCTCACTTTCACATCTGAGGAGAGCCTGATCAGCAAAGAAAGGGTTTAGAAAGCAAAAATCACTAATGTGTTGTTGTAAAGCCCTCATGATTTTGTATCGATCATCATGGAAATGTTTCCAAATGACAATGATGGAGGAGTCTAAGCAACGGTCGGCATGGATGTTTTTCACCAATAAGTGCTCAGTTGTGCTTATCTCAAGGGGCAAATGGGTGCAGTTCTCTTGTGGCAAAGACTTTTGGGCCATTTCTCCTTTTCTAAGTATGTCACAGTAAGAGGTAGGTGCACCCTTCATGCTAGCATTTACAGACGCACCATTAATGAGAGGGGAAGGAGGAGCAGACGGGTAGTTACTAATGAGATTATAAAAAGCTTGCCAACCTCTTCTTTCTTCCCCAACAGGTATTAAAAGCTTGTTGATGCATCCATTGTTGTCAAGCTTTGCGATTTCCGCCAATGTACCTTTCCTATTGGTGGTTTTCTCTACCATAGAGTTTGGTCATCAATGTGCGTTTCCTTGAAGAACTTTTGATTGAGGGGAGTAGTGAGGAGAGTGTAGAAAGTGTTGGAGATCCATTGTAGAGAGGACCAGTGAATGGAAGTGAAAAAAGCTCTGTCTTTGGTGGTCTAAGTGATTTTTGCTCGGCCACCACGAAATCTCTGATCTACGTCAATGGAGAACTTCTTTCTTTCGATAGTTGCTGATTGGGGGGCTTGTATCTTATCCATCTATGGTGACGAAGGTTGATTCTAAAGAGGCCATGGAAGAGGGGCGACGGATGAGGATTTTAGGATGCTTAGGATGCTTGAGAGGGAAGAGGGTGAGAGCATTTTCCATTTTTTTAACTAAGAAATATCTTGAGTTCCTCTCTTTCCAAAGGAGCCAAGGAAGAGCTCTACACGTGTTTGTCCATAGAATTTTGCCTTTATCTTTAAACCACTCTCCAGAAAGCAATTCTAGTGTCTCATCTTCCACACTATTAGGAAGGCACCAACTAAGATCAAAACAATTGAAGGTGTAGTTCCAACCTTTGTAAGCGTAGGAACAATGAAGGAAAAGGTGGCTGATAGATTCCGCACTATTCTTGCAATTGCACTTCAAAACATAGGACATAAATTGAAGGACCTAACAAGAAAGAAGAAATTTGGGAAAAGGGAGGTTTGGTTGTAGGAAATTTTAGACTTCAAAGTTTGGCTCTCTATTGGGAATGGGTCATTTGTTTGTTTATGGGAAAATCATTGCTTGGGTGATGGTCCCTTTTATTCTCCTTTCCCTCGTCATTTTCACCTCTCTTCTTAGAAGTCCACTTCAGTAGCTTTTATCCTCTTTTTTTTTTTGCTGAGGATTTCACTTTGTACAATTTTGGTTTTAGGAGATCATTGTCCATAAGAGAAACTACGTATGTGTCTTCTTTGGTTTCTATGTTGTGTGAGTTTCAAATTCGCCCCAAGTTTAAAGATATTCATGTTTTAGACCCTAATCCTTTGGGTGTGTTCTCCCATAGATCCTTCTTGTTCTAATTGTGTGCTCTTGATCCTTTGGTTCCATTTGTTTCCCCTCCATTTTCCTTGCTCCAGGAGGTTAAAATCTTCGAAAATGTCAAAATTTTTGCTTGGCTGGTTCTCCATGGGAGGATTAACACTATTTATTAAATTATTATATTCAAAGGTCTTCCTTTCTATCGTTGGAGCCTCAATGGTGTCTTGTGCAAACGTGTTTGTGAAGAATTGGACCATGTGTTGTGAGTTGTTAGTTTGTAGTTCTATTTGGGATAAACTTTTGGAGACTTTTGGTGTGTTTTTGGCTTCCAGTAGATCTAGTTGCTTGATATTGGAGGTGGTGCATCATCCACCGTTTTGGGATAATAGAAGGGCTCTTTGGCATACTGCTTTTTGGGCTATTTTGTGGAATTTCTGGCATGAGAGAAATAGATGAATCTTTAGAGGCATTAAGGTTTCTTGGGGAGGTGTGGTTCACAACTCATTTTAATGCACTCCTTTGGATATCTTTGACCAAAGAGCTTTGTTTTTATCCTTTGTTTTTATTCTTAACAACTGGAGTTTTTTCTCTTTTTAATCTTAGTTTTTTGGTTTTTTTGTTTGGTTGTCTCCTTTTTGTTACTGGCTTCCCTTCTTTAGTATTGATTGCTTTCGTTCTTTCATTCATATTAATAAAAGTTTGGTTTCTCATTTTTTTAAAAAAAATCATAGAACATTTCTTTTGTAATCTTTCCGCATGTTTAAACATTCATACTTGATAACATTATATGTTCATTGGTAATATATTACATTAGTAGGAAACCACAATGTAATAAGTTTTTGCTGTGGTCCTTGTTTCACGGAGGCATCAACTGAATTCAAAGGAGAAATCCTTCGTTCTCCTCCTCTCTGAGGGGTTTGTGCTTTATCAGATACAGTGAAGAAGATCCAAACTGCAAATTATTTTTGTATCTTCCATCCTGGTTGGGAATTCTTGGCTGTCTTGGTGCTCACTCAGTGCTTCTTCATTACATTAAGGATCTCATACTACGATTTGTTGACTTTGAAAACCTTAGCTAAAGTGTTATGATCACTTGGAGTAATGTGGTCAAGGCCACTCTTTTGAGTGTTGGATTTCCTAATTATTGTGAATATTTGTGTGTAAATATTAGTTGTCAATATTGCCTTTATTATTTGGTTTCCTTGTTTGATTTTGTTTCCTTATTTGTAATGGGTCTTTCTCCTATATACGAAGACTTAAGTTGCACATATTAAAATAAAATAGAATATTTTTATTCTCTCAAATTCACATTGTATCATAGCAGTATACCTTAGCTACCTAAAAAACCTAGCCGCCGCCACTTAAAAAAACCCTAGCCGCCACCGCCTTAAAAACCCTAGCTGCTACCGTCGCCTCCTTCAAACAGTGCACTCTACTTCCGGCAGAGGTCTTCGTATATGCAGTCTCCTCGTCCACGTTCGTGTTCTTCGGTTGTGCTCGCAGGTGAAGCCGTTCGTGGTCTTCCGTTGGTATTAACCATTCAGTCCAGTCGTTTACGGTATCAATCGGTTCAGTCGTTCTAGTCGGTTCATCGTTAGCATTCCGCCTGTTTTCTCTCGTTCCTCCGGCGATTTTCTCCATTTGTGTGCCCGCCGTTCATTCTCGCTGTTTGTGCTCACCATTTGTGTGCTCGCATATGTGCCTCCTGTTGGTTTATTTCGACGACTTCATCTGCTTGTGGTTACTTGCACGTGGAGGTCTGTGGTTACTTGCTCGTCTGTTCACGTGTGCATCCAATCCGTATAGTCAGCAAGGTGTTCCTTGCCACCCGATCAAGTTTGTCTTCTTTTCTTGCTACTATGTCTGAGGTAACCGTTGAAATGGTTCGAAATTCGACTACAAGCAAGGGTAATCAGTTGTTGTTCAATGCCAATTTTGTTGACTCAAGTTTATTCAATAAGGAGCAAATTAATCAAATCTTGAAATTGCTACCAACCAATTTATCTTCTAATAAATCAAGTGTTTCCTTGGAACAAACAGGTAATTATTCTCAGGTCTTCTCTTGTAGTAATTCATCTCCATGGATTATTGATTCTAGAGCCTCCGATCATATGACTAGTTCTTCTCGTCTATTTGATTCTTACTTTCCTTATTGCAATGAAAAAATTAGAATTGCAAATGGCAGTTTTTCTTCTATTGTAGGAAAAGGAACCATTAAATTGACTGAATAAATCATTCTACCGTTTATCCTCCATGTTCCCAAATTAGCCTATAACCTTTTATCTGTCTCCAAACTTTCTAGAGATTCTAACTGTCATGTTGTTTTCATTGAATCTCATTGTATTTTTCAAAATCAGGACTCGGAGAAGATGATTGGACGCTTGGATGCTTGATGGCCTCTACTACTTTGATGACGGTCCTTCTAGTAATAAAAAAGCTCAGGTTTTTAGTTGCATTAGTTCTATTTCTACTAAAGAACAAATTATGCTTTGGCATTTAAGACTAGGACATCCCAATTTTTCATATCTCATCCACATAGTTTCCAAATCCTTACAAAGCTTCCAAACCATTCTACTTAATAAATAGTGATGTCTGGGGTGCTTCTAAAATTCAGACTCAAAGTGAAAAAGGTGGTTTGTTGCTTTTATCAATGACCATACTCACCTTTCTTGGGTTTATTAAGACAAAAAAATATGAGGTGAAAATTCTTTTTAAACGCTTTTACAATATGATTGAAACCCAATTTCAAAATAAAATTGGCATTTTGCACTCTAACTATGGAACTGAGTATTTCAATGAATATTTGGGTGATTTTTTGAAAGATAAAGGTATTTTTCATCAGTTCACTTGTCGGGATACTCCTCGACAAAATAGGATTGCTGAAAGACAAAATAGGCATTCACTTGAAGTAGCACGTGCCATAATGTTTTCCATGCATGTTCCCAAATATTTATGGGGGAAAGCAGTTCTCACGGCTGGCTACCTAATAAATAGAATGCTAGTAATGTCCTAAATTTCAAAATGCCTCTTGAGTGTTTTAAAGAAAAATTTCCAACAACTTGCATATATTCTGATTTACCTATCAAAATTTTTGGGTGTACTGCTTATGTTCACATTCCAAGTCACCTTCAATCAAAACTTGATCATAAAGCTGTTAAATGTATCTTCTTGGGTTATTCTTCTAATCAAAAGGGGTAAAAATGCTTTGATCCTCAGACCAAAAAAGTTTATGTGAGTATGGATGTGTCGTTTTTTGAAAAACAATCATATTTTACCCCAAATTCTCTTCAACAAGAGAAACCAAATTTGGAAGAAAATTTTTGGATATTTTTGTTCCTCTTCCTAGTATTATTTGTTTTGCTATTACTAGCTCTTCCATGTCATGTATAGAAGAAACTTCTCATTCAAGGGGAGAAATACACAGAATGATTCTATTGATCGGATTCCTGAACTTAAGGTTTGTACAGACGGATAGTTGCTCAAAGGAACAAAGACCAAATAATTGATCCTTCACAAAACCAATCTGAGACTCCAAGAAATGAAAATGAAACGATTGGTAACCCCTCGTCTATTCCTACTATTCAAAATACTTTACCTGTTGTGTCTGATCTTGATATTCTCATAGCTGTCAGAGAAGGTGTTTGAAGTTGTACTAAATATCCCATTGCAAACTTTTTTCATATCACAATCTATCCAACAATCATAAAGCTTTCACATTTAGATAACTAATTTGTTTGTTCCAAGGAATATACAAAAGCCTTAGAAAATCCAAATTGGAAACTAGCTGTTACGGAGGAGATGAATGCTCTGAAGCAAAATGGAACTTGGGAAATAGTAGATTTGCCAAAAGACAAGAAACCAGTAGGATGTAAATGGGTGTTTACCATAAAATGTAAGGCAGATGGTAGTATCAAGAGGTACAAGGCTAGACTAGTAGCTAAAGACTTCACTCAGACCTATGGGATTGATTATCAGAAGACCTTTGCCCCAGTTGCTAAAATAACTCTATTAGAGTTCTTTTATCTCTTGCAGTTAATCTGGATTGGCCTCTCCACCAACTTGACGTGAAAAATGTCTTTCTCAATGGTGATCTCGAGGAAGAGGTTTTTATGAGCTTACCCACAGGTTTTGAAAGGGAATTTGGATGTAGTAAAATTTGCAGATTAAAGAAATCACTTTATGGCCTCAAACAATCACCAAAGCTTGGTTCGAGCGTTTTGGAAAAGTTATGTCTAGCTATAGATTTCTTCAGAGTCAAGCAGACCATACTATTTTTTATAAACACTGTTATAGTGCTTATATTATTATATTTTGTATTATTGTAAATATCTTAGTATTAGTCATTTGTCTTTTATTGTCCTTTTACATTCTTTGTTTTAGGTTAACCTTGTATACTTGTCTATATATATCTTTTAGTGAATGGAATAGAATTATTGATTCTCTCAAACCTTTAGTCTCTTCATGGTATCAGAGCGCTAGGTCTTTTTCTAAGGCAATTAGGGTTTTGTTAACCTTAGGGTTTTGTGATTTTTCAATCCTTTAGGTTTGAAGTTTGGAACGTTTTCTTGGACTGATTGTTAGTGCTTGTTTTTTACCTTGGTATTGTGAATCTACCGCCGCTGCCACGTTGCCACCGTCGTCGTCGTCTGTTTTCGTCACCGCCCAGCCCAAAAGGAAGTCCAGTCACCGATCTGCAAAACCTGAAGTTTGGTCGAAATTCGACTTGCACGTGGGTCTCACGTGCTTCCGAAATTGCCTCCAACCGACGACGTTTGGTTCTCCGTTCCATCCTCTTGCTATTGGTGTCCATCCTTTAGTCATACCTTGGTCAGGTTGCGGCTATTTTAAGGAAAAACGTTTCGGGTCTATTAGATTTTGTGATTCTGATCAGTTTGGTTCAGTTTTGAAGTTGTGTGCTTGTTTTTTAGGTCATGACTGAGAAGAAATCAGTAGTGACGTCCGAGATGATTCCGATGATGTCAAAAATCACAGAACATAAGTTGAATTGATTTAACTACTATGCATGGAGAACAAACGTTCATCATTTTGTACGGAGCATTGATATGGAGGATCACATGACTGGAAATTCACCGACTGATGGCACCAGAAGAGCTTGGCTGCGGGATGATTCTAGGATGATACAGATCAAAAATTCGATTGAAAGTGAGATTGTAGACCTGGTTAATCATTGTGAGTTTGTTAAAGAATTACTTGAATATTTGGAATTCTTTATTTTGGGAGAGGAAACATCAATAGAATGTTTGATGTACGTAAGGCCTTTACCAACTTGAGATGGGAGACAAATCTCTTACGAGTTATTTTATGGAGTGCAAAAGAACATATGCGGAGTTTAATACGTTGCTACCAGTTAGTACTGATGTGAAAGTACAGTTTGCCTAACGAAAACAGTTAGCAATTATGAGTTTTCTAGTTGGTCTTACACCTCGATTTGATATGGCCAAAGATCAAGTGCTTTCTGGTTCGAAAATCTCATCATTGGAAGAGGCATATACTAGAGTACTTCGCATGAGAAGTCACAAACAGTTACATCATTCCAGTCCAACAGTGCTTTGGTTGGACGAACGAATGAGTACCGAGTAATAGAGGAATAGGTTCAAATAACTCGAGGGAATTATAATACTCAAAAACTGAACTCAGGAGATGTTATGTGTCATTATTGTCATAAGTCAGGCCATACAAAGCGTGATTATAGAAAGCTGTTGAACAAAGGTCAGAGAACTCAGTCTGTACATGTTGCATCTACTTCTGATGATCCCGAAAAGCTAATTATGATTTCTGCAGAAGAATTTGCTAAATTTCAACAGTATCAAGAGTCATTGACGACATCATCCTCTAATCCAATTACAGTCATCGCTGAGTCAAGTAACACAAAAAAATGTTTTCTTTCATCCTTATCCAAATGGGTCATTGATTCTAGTGCACAAATCATATGACAAGTAGTTCCAGTTTATTTTCTACTCTTTCACCATCTTTGCCTAATGTTACTATAGGGATGGAACCACCTCTTTTGTTCTAGGATCAGGCACAGTCCGTCTTACCAACTCTCTTTCATTGACCTCTGTTTTAAATTTGCCACAGTTCTCTTAATTTGATATTTGTTAGTAAGCTTCTTGTGATCTCAATTGATGTGTCTTATTCTTCCCTGGTTATTGCTTATTTCAGGATCTTACGACGAAGAGGACTATTGGTAGAGGGCGTAAATCCGGAGGTCTCTACACATTTGATACACAAATACTTACAGTCATCCCATACTCTAGAGTGTCATCTCCTTTTGAAGAACATTGTTGTTTGGGTCATCCATCTATCTTCGTGTTGAGGAGTCTTCGTCCTCAATTTCATAACTTGTCTTCTTTAGATTGTGAGTCATGTCAATTTGCTAAATTTCATCGTCTATATTCGTATCCTAGAGTCAATAAACGAGCTAGTGCTCCATTTGAGTTAATTCATTCTGATGTTTAGGGTCTTTGTCCTATTGAGTCCAAAAGTGGGATTTGGTATTTTGTTACTTTTGTCGATGATTTTTCTCGTGTAACTTGGCTATATTTAATGAAAAATCGTTCTGAGTTGATTTCTCATTTTCGTAACTTTCATGCTGAAATTCGAACTCAATTTGATGGGTCTCTTAAAGTTCTACGAAGCGATAATGCTAAAGAATATTTCTCTAATATCCTTGGATCTTATTTAGATGAGCATGGTATCCTTCATCAATCCTCATGTGTTGATACTCCATCTCAAAATGGAGTTGCAGAACGGAAAAATAGACATATTCTAGAAACAGCAAAGGTCTTAATGTTTCGAATGCATGTTCCAAAATACTTCTGTGCCGATGCTTTTTCGACGGCTTGTTTCTTAATTAATCGCATGCCTTCATCGATTCTTAAGGGTGAGATACCTTATCATACTTTGTGTCCTACACAACCTTTGTTTTCTATCAAACTTAAAATATTTGGTTGTACTTGTTTTGTTCGGGATGTTTGCCCCCAACTCACAAAATTGGACCCAAATCCTTGAAATTCATTTTCCTTGGTTATTCTCGTGTTCAAGAAAGGTATCGGTGTTATTGTCCTGATCTCAATAAATATCTCGTCTCTCCTGACATTACATTCTTTGAAGATGTTTCTTTCTTTTCATCTTCTTCGAGTAATAGTCAGGGGGAGCGTTTAGAGGAAGACAATGATTTTCTTGTTTATTCAACTTTCTCTTCTTATAAGAAAGTGCCTTTAGCGATACATCTCCCTCTGTACATGATCCTCCTCTCCCACCTATTACTTAGGTTTATTATCGCCGACAACCTCATTTGGTCACATGCCCTATACCAGAGGATTCTTCGTCATTGGATCCAGGAACGAGCATGATCTTCATATTGCTCTAAGCAAAGGTAAACGTCAGTGTACTTATCATGTTTCCTCTTTTGTCTCATATAATCATTTGTCATCTCCTACTTGTTCGTTCATTGCATCTCTTGAGTTTTTATCTATTCCTAAAACTATTCATGAAGCGTTGTCTCATTCTGGTTGGCATGCTGCAATGTTAGAGGAGATGACTGTCTTAGATGACAATGGTACTTGGGATTTAGTTTCTCTTCATGTAGGAAAGAAGCCTATTGGTTGTAAATGGGTGTTTGCCATTAAAGTTAGTCCTAACGGATCTGTCACGCATTGAAAGCTCGTCTTGTTACTAAAGGCTACGCGCAGACTTATGGAGTTGACTATTCTGATACTTTTTCTCCTGTTGCTAAATTGGCTTCTGTCAAGCTATTCATTACGTTGGCATCAATCTATCATTGCCTTGCATCGGCTTGATATTAAAAATGTCTTTCTACATGGTGATCTTCAATAAAATGTGTATATGGAGCAACCACCGGATTTTGTTGCTCAGGGGGAGAATGGAAAGGTATGTCGTCTTCGTAAATTCTTGTATGGTTTAAAGCAAAGTCCACGAGCGTAGTTTGGAAAATTTAGTCAGGTGATTGAGAACTTTGGAATGAAGAAAAGTAAGTCAGATCATTCTGTCTTTTATAAAGGATCTGAGACTAGTGTCATCTTACTAGTTGTATATGTTGATGATATTGTTATTACTGGTAATGATACATTATGTATTCTATCTCTTAAGACTTTTATTCATAGTCAATTCCATACAAAAGATTTGGGAATATTGAAATACTTCTTGGGAATTGAGATAATACGAAATAAGAAGGGAATTCTTTTATCACAAAGAAAATATGTACTTGATTTGTTAACCGAGACAGGGAAGTTAAGTGCTAAGCCACGTAGTACCCCGATGATGCCTAATTTACAGCTCACAAAAGAGGGAGAATTGCTAGAAGATCCTGAGAGGTATAGGAGGTTAGTAGGAAAGCTAAATTATCTTACAGTGACTAGGCCAGACATAACTTATACACTGAGTATTGTGAGTCAATATATGTCTTCTCCTACTGTTGACCATTGGGCCTCATTAGAACAGATTCTATGTTATTTGAAAGCTGCTTCTGGGCGTGGTTTATTATATAGGATTATGGTCATACTAATATTGAATGTTTCTCAAATACTGATTGGGTTGGATCTAAGAAAGACAGAAGATCAACTTCAGGATATTGTGTATTTGTCAGAGGTAATTAGTTTTTTAGAAGAGTAAGAAACAAAATGTGATGTCATGTTGTAGTGCTGAATCAAAATATAGAGCGATGGCACAGTCTGTGTGTGAATTATTGTGGATATATCAACTTCTGACTAAATTGGGATTTGATATCACAACTCCAACCAAACTCTGGTGTGATAATCAAGCAGCTCTCCATATTGCATCTAATCCAGTATTTCATGAACGAACCAAACACATTGAGGTTGATTGTGATTTTGGACGTGAGAAAATACAGCAAGGTTTGGTGTCCACAGGATATGTAAAGATTGGAGAGCAATTAGGAGATATCTTCACTATAGCATTAAATTGAGCATGTATAGATTATCTCTCTAACAAGCTGGGCATGATTAACATATATGCTCCAACTTGAGGGGGAGTGTTATATTGCTTATATTATTATATTTTGTATTATTGTAAATATCTTAGTCTTAGACATTTGTCTTTATTGTCTTTTACATTCTTTGCTTTAGGTTAACCTTGTATACTTGTCTATATATATCTTTTAGTGAATGAATAGAATTATTTATTCTCTCAACCTTTAGTCCCTTCAACTTTGAGAATATAAGATTGCAATTTTGATTGTCTATGTAGATGATATTATTCTCACAAGTAATGATGAAGCAGGTCTATTTGATCTCAAGAAAAGCCTTGCAAATGAGTTTCAAATCAAGGACTTGAGAATGTTGAAATATTTCCTAGGAATGGAATTCGCAAGATCGAAGAAAGGTATTTTTGTTAATCAAAGAAAATATATCCTTGACTTACTCGAAGAAACAAGTTTACTTGGCTACAGGATAGCAGAAACTCCTATTGAGCCTAATATGAAGTTACAAGCAGCAAAAGCAGTAGAGGTAAAGGACAAGGAACAATACTAGAGACTTGTGGGAAGGTTAATTTATTTGTCTTATACACGCCCTGATATTGCATTTGCTGTAAGTGTAGTAAGTCAATTCATGCATGCACCTGGACCAGCTCATTTCGAAGCTACCTATAGAATCTTTAGATATTTAAAAGGAACTCTAGGGAAAGGTATTTTATTCAAAAAGCACAATCACTTCCAAGTGGAAGTTTATACTGATGCAGATTGGGTAGGAAGTGCGACTGATAAAAAATTTACTTCTGCTACTGTTCCTTTGTTGGAGGAAACCTAGTCACGTGGTGAAGCAAAAAACAAAATGTGGTCGCCAGGAGTAGTGCTGAAGCAGAATTCAGAGCTTTAGCTCATGAATTTGTGAAGGTATATGGATCAAAAGGATACTTGAAGAATTGAAGTTTTCTCATACAACACCTATGCGGATTTATTGCGACAATAAGGCTGCCATCTCTATAGCTCATAATCTAGTTCTACATGATAGAACAAAACATATTGAAGTTGACAAGTATTTTATTAAGGAGAAGATTGATGCAAGGAATAATGTATTCCTTATCTTCCGACTACTACAGAGCAAACTGTTGATGTCCTGACTAAGGGACTTCCAAAGAAGCACTTCAATAAACTAATTGACAAGTTGGCTATGGAAGACATCTTTAAACCAGCTTAAGGGGGAGTGTTGGATTTCCTAATTATTGTGAATATTTGTGTGTAAATATTAGTTGTCAATATTGCCTTTATTATTTGATTTCCTTGTTTGATTTTGTTTCTTTATTTGTAAGGGGTTTTCTCCTGTATAAGAAGACTTAGTTGCACATATTAAAAAGATAGAATATTTTTATTCTCTCAAATTCACATGAGATCCAGTTAGAGAGAGGAGAATTTTTCAAGGAGTGGCTAGAAATTTGTTTGAGTCTTGAGATATCGTTTGTTTTATTGCTTCCTCTTCAAGTACTTAAGATTATCCTTTTATAATTACTCCTCTTTGATAACCTGTCAACACAATTGGATTTTTTTTTTTTTAATTTTTCTCGGCTTTAGTGCTACAGGACTCATCTTCTCTTTCACCTTTGTATATCGTTCTTTTCTAATCAATTTCTTTACCCAAAAATAGTTTATATATTCAAATTTTTGTGTATATTCATTTTAAATAGCTAACTTGAATGCCATCATGATAAATATCCACCTTCCATTTCAATGTTTATGCAGCTGAATCTGAAAGATTTTCGCTATTTGATACTCTATTCATCACCATTAATTTAGTTTTTATTTCTCTGAACGAATAATTTGAATCATTGTACTTTTTAGCTAGTCCTTTTCTGGTTAGTGTGTTTCCTAATTTCCTCTTCCCCTTCCCTGGAGAATTACTTTTGCTACTAATCCTAGATTTTTGTGGAAATAAAGACTCTTATTTTGGTGTTCCCCTTTAGCAATTAACTTGAAAAAGATAGCATCAATGCTATAGCTAGTTGCTTGCTTGATTTTAATGGCATTTTTTTATTATTGTTAGCCAGAGTTTTTAGTTTTCTTTCTTCTCACATGATGTATGGTCATTTTCATAATTGATCTCTATTTTGTTTTGTTAGTATAAGGAATTTTATGATGAAAATTTTTAACATTTCATCAAGTAGATGTTTTCTTTTTCATCATGTAGTTCTTGGTTATGAGCCACACTCAGTTTTACCTATTGGTGTTGTTGCGTTGGCCGACCTTACTGGTTTCATGCCTCTCCAAAAATTAAAAGTCCTTGCTAGTTCCGCTGTAAGACTTCTGCCTCTTTATTATAGGCATTTACTAGTATCCTCTGTAACTCCTTATCGTTTCATTATGTGGGGGTTATTGACCAGGTGTTTTATATACCGTTTTTGAGGCACATATGGACATGGATGGGTCTAACGCCAGCAACAAAGAAAAATTTTATCTCCTTTTGGCATCTGGCTATAGTTGTATCATTGTGCCTGGTGGAGTACAAGAGACATTTCATATGGAGCATAATTCGGAGGTCTTTCTCCATCTCTTCTATACCTATAAATTTTAGTATTGATCCTGCCATCTTCATGTAGTTTATTTTCACTTTCCCCCGTTACTAGAATGTTAGTGCTTGTGCACTTGCATACCGGAGGCAAGATAGATAAAATTTTAGTTCTTCCAGTTCGTAGATTTTGAACTTTAACCAGAACAGTATTGGATTTTCTCTCTTTTACTTCTCCGTCCTATTTACTCGTATTAGAAGCGTAGATATAACTTGTGGAGTTACTACTTGAGGCATTTGTAGTTACTTTTCCTTTGTGATTAATGCCAGTTGAGTTTCTTATCTGTTTGCTTTAGTGTGCGCAATTGATGCTGTCTAGTGTCTTGTCTTTGGATTAGGCTGCTTATTGCGGCCTCAGCAAGATAACGTTCTTAGAGGTTCATCAATCACTCCAGACATTTTGACATTTGAACATAGCAGGCGATGAAGACATCGAAATTTTGAGATGATATCTTATCTTTCTTTGTGACTTTTCAATGAAATGTTTCCTATCTAAAAAGCAAGGGTAGAGGACATGTAATTGGTGGAACTTCTTGAGTTCTAAGACCTAAAATTTGCAATATTATCAACAGCTCTTATGATGATTTGCTTCTCTAAGTTCAGTTGTCCTTCAGAACTGAGAGTTCGGTTTGCAATCTTATTTCTCATTCCTCAAGAAGCCTTGCAGGATTGACGACCCCCAAATTTTCACTTTGTAGACTGTCTTCCTGAAGGCCAGACGAGGATTTGTGCGCATAGCGATGGAGACAGGCACACCCCTGGTCCCTGTTTTCTGCTTTGGTCAGGTATTTTTCTTATATCCTCATAGTGTTATCATGTATTTCTTTTCTTATTTAGTGTGCTACAGTCTCATTCTGATTTTAATTTGTATTTTAGAAATTGATTTTAACCAGTTTTCTCCAAATTTATGTAGTCAAGCGTCTATAAGTGGTGGAAGCCTGGTGGGAAGTTCTTCCTCCAATTTTCTAGAGCGATAAAGTTCACACCAATTGTCTTCTGGGGAGTTTTTGGGTATGTGTATGTTTTCTCATGAGTCTTCTCAAATGTCATGCCTCTTCGCTTTCCTCACTATGCACGTTTTTGTCTACACAAGCGGTCATCTTACTAAAAAGAGAAAAAAAAAAAGGTACATCTTACTAAATGGCTGGATTATAGTACTGTTTTAAATTGTGTCATGATTATTGTTCATGGGCTATTCTAAGCATTAACATTTGATGAGAATGTCAGGTTTTTCACCAAAGACTTTCAATGAACGGTTTGCTTGTCCATTTTTGGTGAACTATGTGTTCCCTCTCATGTAATTCATGCCACAAGGCGCATCATGGCATAGTTGATTTTACAGCTTTGTTGATTTCTTGGAAGAGAATTACATTTAGTCATGTAATATATTGATTGTGCACCATAAAAGTTTTAGCTTTGAACGCACAGTTCAGTGAAGGCTTCTCTTGGATATGTGGAAGTTCACGATTAATTATGCCCATCTCACTTCATAAGCACTCCCCTGCAATCAGGGTCACATATTGAATGACCTTTGTTACCGTGTCATTTCTCGACTGTGTATGTTGATCGATTGTTGGCACTGCACTGTCAGTTTTGTATTCATTTTTTCATTTTTTTGGAACTTGAATGAGGAAAATTTAACTTTGCACATCTAATAAAGAGAATTGAACTTCTATGTTGGGTAAATATGTTTTTCAATAGCATGTTACGGTATTAAGATCGACAGCATTGGTGTTTGATCTCTGGTTCCATTCATAGGATAATGTCTTTGCACCGTGTCAATGTCTTGAGTATTGTTCTGGTTTCAGATCTCCCCTTCCTTTTAGGCGATGGATGCATGTGGTGGCAGGGAGTCGACTGTGTATGTTGACTGATTGTTGGCACTGACTGTCAGTTTTGTATTCTTTTTTTCTTTTTTTTGGAGCTTGAATGAGGAAAATTTAAATTTGCACATCTAATAAAGAGAATTGAACTTCGATTGTTGGGTAAATATGTTTTTCAATAGCATGTTACAGTATTGAGATCGATTGCATTGGTGTTTGATCTCTGGTTCCATTCATAGGAAATCTCTTTGCATGGTCGCAACGTCATTGAGTATTGTTCTGGTTTCAGATCTCCCCTTCCTTTTAGGCGACGGATGCATGTGGTGGTAGGGAGACCCATCGAGGTCAAGAAAAATCCAAATCCAACAAGTGACGAGGTATGTATCTCAAAAGCCCGTTTCATTCCATTAGCATATGCCTGCTGCTTCCTTGCTATTCTAAAAGCATCACGCTAGAGTGGGTTGCAATTCTTAACTGAATGAGCATGCATAAGCGTAATATTGACAACAAAGAATGTCTCTTTTTCTTCCCCATTTTTCGATTATAAAACTCCAGCCTCCGAACGACCGGGTTTGGTGGGTGTGGAGGCGGCGGCAGCAGCAGCGTGGAGCCATAAACTGGAATGGAGGGGGACAAAACTAGCTGCTTGATTATGTTTATAATTATCATGGTTTGTTACTTCTCAGGTGCTTGATGTACACGGTCAGTTTGTTGAAGCACTCAAAGATATATTTGAAAGGTACAAAGCACAGCTTGGCTATGATAATCTGCAGCTAAAAGTTCTCTGA

mRNA sequence

ATGGATAAAATCTCAGGACCTTCGGTCACAGTTTTCAAGGGAGGTGCAGGGTCGGCGACCAAATGGCGGATGATGCACATAATCGTAGCCTTGGGTGTTTGGCTCGGCGGCATTCATCTCAATTTTACTCTAGCTCTCATCTCCGTCTTCTACCTCTCCCTGCCCAAAGCCCTCTTGGTCTTCGGGTTATTTTTGGTATTAGTGTTAATTCCTGTCGACGATAAAAGCAAATACGGTCGCTTATTGGCCAGGTATATATGTAAACATGCTAGCAGCTATTTTCCTGTTACCCTACATGTTGAAGATATACATGCCTTTGATCCGAATCGTGCTTATGGTGTTTTATATACCGTTTTTGAGGCACATATGGACATGGATGGGTCTAACGCCAGCAACAAAGAAAAATTTTATCTCCTTTTGGCATCTGGCTATAGTTGTATCATTGTGCCTGGTGGAGTACAAGAGACATTTCATATGGAGCATAATTCGGAGACTGTCTTCCTGAAGGCCAGACGAGGATTTGTGCGCATAGCGATGGAGACAGGCACACCCCTGGTCCCTGTTTTCTGCTTTGGTCAGTCAAGCGTCTATAAGTGGTGGAAGCCTGGTGGGAAGTTCTTCCTCCAATTTTCTAGAGCGATAAAGTTCACACCAATTGTCTTCTGGGGAGTTTTTGGATCTCCCCTTCCTTTTAGGCGACGGATGCATGTGGTGGTAGGGAGACCCATCGAGGTCAAGAAAAATCCAAATCCAACAAGTGACGAGGTGCTTGATGTACACGGTCAGTTTGTTGAAGCACTCAAAGATATATTTGAAAGGTACAAAGCACAGCTTGGCTATGATAATCTGCAGCTAAAAGTTCTCTGA

Coding sequence (CDS)

ATGGATAAAATCTCAGGACCTTCGGTCACAGTTTTCAAGGGAGGTGCAGGGTCGGCGACCAAATGGCGGATGATGCACATAATCGTAGCCTTGGGTGTTTGGCTCGGCGGCATTCATCTCAATTTTACTCTAGCTCTCATCTCCGTCTTCTACCTCTCCCTGCCCAAAGCCCTCTTGGTCTTCGGGTTATTTTTGGTATTAGTGTTAATTCCTGTCGACGATAAAAGCAAATACGGTCGCTTATTGGCCAGGTATATATGTAAACATGCTAGCAGCTATTTTCCTGTTACCCTACATGTTGAAGATATACATGCCTTTGATCCGAATCGTGCTTATGGTGTTTTATATACCGTTTTTGAGGCACATATGGACATGGATGGGTCTAACGCCAGCAACAAAGAAAAATTTTATCTCCTTTTGGCATCTGGCTATAGTTGTATCATTGTGCCTGGTGGAGTACAAGAGACATTTCATATGGAGCATAATTCGGAGACTGTCTTCCTGAAGGCCAGACGAGGATTTGTGCGCATAGCGATGGAGACAGGCACACCCCTGGTCCCTGTTTTCTGCTTTGGTCAGTCAAGCGTCTATAAGTGGTGGAAGCCTGGTGGGAAGTTCTTCCTCCAATTTTCTAGAGCGATAAAGTTCACACCAATTGTCTTCTGGGGAGTTTTTGGATCTCCCCTTCCTTTTAGGCGACGGATGCATGTGGTGGTAGGGAGACCCATCGAGGTCAAGAAAAATCCAAATCCAACAAGTGACGAGGTGCTTGATGTACACGGTCAGTTTGTTGAAGCACTCAAAGATATATTTGAAAGGTACAAAGCACAGCTTGGCTATGATAATCTGCAGCTAAAAGTTCTCTGA

Protein sequence

MDKISGPSVTVFKGGAGSATKWRMMHIIVALGVWLGGIHLNFTLALISVFYLSLPKALLVFGLFLVLVLIPVDDKSKYGRLLARYICKHASSYFPVTLHVEDIHAFDPNRAYGVLYTVFEAHMDMDGSNASNKEKFYLLLASGYSCIIVPGGVQETFHMEHNSETVFLKARRGFVRIAMETGTPLVPVFCFGQSSVYKWWKPGGKFFLQFSRAIKFTPIVFWGVFGSPLPFRRRMHVVVGRPIEVKKNPNPTSDEVLDVHGQFVEALKDIFERYKAQLGYDNLQLKVL
Homology
BLAST of Sgr029759 vs. NCBI nr
Match: KAA0052901.1 (diacylglycerol O-acyltransferase 2 isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 480.7 bits (1236), Expect = 8.7e-132
Identity = 242/324 (74.69%), Postives = 260/324 (80.25%), Query Frame = 0

Query: 1   MDKISGPSVTVFKGGAGSATKWRMMHIIVALGVWLGGIHLNFTLALISVFYLSLPKALLV 60
           MDKIS PSVTVFKGG GSATKWRMMHIIVALG+WLGGIHLNF L LIS+FYLSL KALLV
Sbjct: 1   MDKISEPSVTVFKGGGGSATKWRMMHIIVALGIWLGGIHLNFALGLISLFYLSLSKALLV 60

Query: 61  FGLFLVLVLIPVDDKSKYGRLLARYICKHASSYFPVTLHVEDIHAFDPNRAY-------- 120
           F L L+LVLIPVDDKSKYGR+LARYIC++A SYFPVTLHVEDIHAFD NRAY        
Sbjct: 61  FALLLILVLIPVDDKSKYGRVLARYICQNACSYFPVTLHVEDIHAFDTNRAYVFGYEPHS 120

Query: 121 ---------------------------GVLYTVFEAHM-DMDGSNASNKEKFYLLLASGY 180
                                       V YT F  H+    G   + ++ F  LLA+GY
Sbjct: 121 VLPIGVVALADLTGFMPLKKLKVLASSAVFYTPFLRHIWTWMGLTPATRKNFISLLAAGY 180

Query: 181 SCIIVPGGVQETFHMEHNSETVFLKARRGFVRIAMETGTPLVPVFCFGQSSVYKWWKPGG 240
           SCIIVPGGVQETFHMEHNSETVFLK RRGFVRIAME GTPLVPVFCFGQSSVY+WWKPGG
Sbjct: 181 SCIIVPGGVQETFHMEHNSETVFLKTRRGFVRIAMEMGTPLVPVFCFGQSSVYQWWKPGG 240

Query: 241 KFFLQFSRAIKFTPIVFWGVFGSPLPFRRRMHVVVGRPIEVKKNPNPTSDEVLDVHGQFV 289
           KFFLQFSRAIKFTPIVFWGVFGSPLP+RR+MHVVVGRPIEVKKNPNPTSDEVLD+HG+FV
Sbjct: 241 KFFLQFSRAIKFTPIVFWGVFGSPLPYRRQMHVVVGRPIEVKKNPNPTSDEVLDLHGRFV 300

BLAST of Sgr029759 vs. NCBI nr
Match: XP_008448438.1 (PREDICTED: diacylglycerol O-acyltransferase 2 isoform X1 [Cucumis melo])

HSP 1 Score: 480.7 bits (1236), Expect = 8.7e-132
Identity = 242/324 (74.69%), Postives = 260/324 (80.25%), Query Frame = 0

Query: 1   MDKISGPSVTVFKGGAGSATKWRMMHIIVALGVWLGGIHLNFTLALISVFYLSLPKALLV 60
           MDKIS PSVTVFKGG GSATKWRMMHIIVALG+WLGGIHLNF L LIS+FYLSL KALLV
Sbjct: 12  MDKISEPSVTVFKGGGGSATKWRMMHIIVALGIWLGGIHLNFALGLISLFYLSLSKALLV 71

Query: 61  FGLFLVLVLIPVDDKSKYGRLLARYICKHASSYFPVTLHVEDIHAFDPNRAY-------- 120
           F L L+LVLIPVDDKSKYGR+LARYIC++A SYFPVTLHVEDIHAFD NRAY        
Sbjct: 72  FALLLILVLIPVDDKSKYGRVLARYICQNACSYFPVTLHVEDIHAFDTNRAYVFGYEPHS 131

Query: 121 ---------------------------GVLYTVFEAHM-DMDGSNASNKEKFYLLLASGY 180
                                       V YT F  H+    G   + ++ F  LLA+GY
Sbjct: 132 VLPIGVVALADLTGFMPLKKLKVLASSAVFYTPFLRHIWTWMGLTPATRKNFISLLAAGY 191

Query: 181 SCIIVPGGVQETFHMEHNSETVFLKARRGFVRIAMETGTPLVPVFCFGQSSVYKWWKPGG 240
           SCIIVPGGVQETFHMEHNSETVFLK RRGFVRIAME GTPLVPVFCFGQSSVY+WWKPGG
Sbjct: 192 SCIIVPGGVQETFHMEHNSETVFLKTRRGFVRIAMEMGTPLVPVFCFGQSSVYQWWKPGG 251

Query: 241 KFFLQFSRAIKFTPIVFWGVFGSPLPFRRRMHVVVGRPIEVKKNPNPTSDEVLDVHGQFV 289
           KFFLQFSRAIKFTPIVFWGVFGSPLP+RR+MHVVVGRPIEVKKNPNPTSDEVLD+HG+FV
Sbjct: 252 KFFLQFSRAIKFTPIVFWGVFGSPLPYRRQMHVVVGRPIEVKKNPNPTSDEVLDLHGRFV 311

BLAST of Sgr029759 vs. NCBI nr
Match: XP_038903787.1 (diacylglycerol O-acyltransferase 2D isoform X2 [Benincasa hispida])

HSP 1 Score: 478.0 bits (1229), Expect = 5.7e-131
Identity = 239/324 (73.77%), Postives = 259/324 (79.94%), Query Frame = 0

Query: 1   MDKISGPSVTVFKGGAGSATKWRMMHIIVALGVWLGGIHLNFTLALISVFYLSLPKALLV 60
           +DK S PSVTVFKGG GSATKWRMMHIIVA+ +WLGGIHLNF L LIS+FYLSL KALLV
Sbjct: 8   LDKTSRPSVTVFKGGGGSATKWRMMHIIVAIAIWLGGIHLNFVLCLISLFYLSLSKALLV 67

Query: 61  FGLFLVLVLIPVDDKSKYGRLLARYICKHASSYFPVTLHVEDIHAFDPNRAY-------- 120
           F L L+LVLIPVDDKSKYGR+LARYIC++A SYFPVTLHVEDI+AFDPNRAY        
Sbjct: 68  FALLLILVLIPVDDKSKYGRVLARYICQNACSYFPVTLHVEDIYAFDPNRAYVFGYEPHS 127

Query: 121 ---------------------------GVLYTVFEAHM-DMDGSNASNKEKFYLLLASGY 180
                                       V YT F  H+    G   + K+ F  LLA+GY
Sbjct: 128 VLPIGVVALADLTGFMPLKKLKVLASSAVFYTPFLRHIWTWMGLTPATKKNFISLLAAGY 187

Query: 181 SCIIVPGGVQETFHMEHNSETVFLKARRGFVRIAMETGTPLVPVFCFGQSSVYKWWKPGG 240
           SCIIVPGGVQETFHMEHNSETVFLK RRGFVRIAME GTPLVP+FCFGQSSVYKWWKPGG
Sbjct: 188 SCIIVPGGVQETFHMEHNSETVFLKTRRGFVRIAMEMGTPLVPIFCFGQSSVYKWWKPGG 247

Query: 241 KFFLQFSRAIKFTPIVFWGVFGSPLPFRRRMHVVVGRPIEVKKNPNPTSDEVLDVHGQFV 289
           KFFLQFSRAIKFTPIVFWG+FGSPLP+RR MHVVVGRPIEVKKNPNPTSDEVLD+HG+FV
Sbjct: 248 KFFLQFSRAIKFTPIVFWGIFGSPLPYRRPMHVVVGRPIEVKKNPNPTSDEVLDLHGRFV 307

BLAST of Sgr029759 vs. NCBI nr
Match: XP_004146185.3 (LOW QUALITY PROTEIN: diacylglycerol O-acyltransferase 2D [Cucumis sativus])

HSP 1 Score: 473.0 bits (1216), Expect = 1.8e-129
Identity = 238/324 (73.46%), Postives = 258/324 (79.63%), Query Frame = 0

Query: 1   MDKISGPSVTVFKGGAGSATKWRMMHIIVALGVWLGGIHLNFTLALISVFYLSLPKALLV 60
           +DKIS PSVTVFKGG GSATKWRMMHIIVALG+WLGGIHLNF L LIS+FYLSL KALLV
Sbjct: 14  IDKISEPSVTVFKGGVGSATKWRMMHIIVALGIWLGGIHLNFALGLISLFYLSLSKALLV 73

Query: 61  FGLFLVLVLIPVDDKSKYGRLLARYICKHASSYFPVTLHVEDIHAFDPNRAY-------- 120
           F L L+LVLIPVD KSKYGR+LARYIC++A SYFPVTLHVEDIHAFD NRAY        
Sbjct: 74  FALLLILVLIPVDHKSKYGRVLARYICQNACSYFPVTLHVEDIHAFDTNRAYVFGYEPHS 133

Query: 121 ---------------------------GVLYTVFEAHM-DMDGSNASNKEKFYLLLASGY 180
                                       V YT F  H+    G   + ++ F  LLA+GY
Sbjct: 134 VLPIGVVALADLTGFMPLKKLKVLASSAVFYTPFLRHIWTWMGLTPATRKNFISLLAAGY 193

Query: 181 SCIIVPGGVQETFHMEHNSETVFLKARRGFVRIAMETGTPLVPVFCFGQSSVYKWWKPGG 240
           SCIIVPGGVQETFHMEHNSETVFLK RRGFVRIAME GTPLVPVFCFGQSSVY+WWKPGG
Sbjct: 194 SCIIVPGGVQETFHMEHNSETVFLKTRRGFVRIAMEMGTPLVPVFCFGQSSVYQWWKPGG 253

Query: 241 KFFLQFSRAIKFTPIVFWGVFGSPLPFRRRMHVVVGRPIEVKKNPNPTSDEVLDVHGQFV 289
           KFFLQFSRAIKFTPIVFWGVFGSPLP+RR+MHVVVGRPIEVKKNPNPTSDEVLD+HG+FV
Sbjct: 254 KFFLQFSRAIKFTPIVFWGVFGSPLPYRRQMHVVVGRPIEVKKNPNPTSDEVLDLHGRFV 313

BLAST of Sgr029759 vs. NCBI nr
Match: XP_031738801.1 (diacylglycerol O-acyltransferase 2D-like isoform X2 [Cucumis sativus])

HSP 1 Score: 473.0 bits (1216), Expect = 1.8e-129
Identity = 238/324 (73.46%), Postives = 258/324 (79.63%), Query Frame = 0

Query: 1   MDKISGPSVTVFKGGAGSATKWRMMHIIVALGVWLGGIHLNFTLALISVFYLSLPKALLV 60
           +DKIS PSVTVFKGG GSATKWRMMHIIVALG+WLGGIHLNF L LIS+FYLSL KALLV
Sbjct: 14  IDKISEPSVTVFKGGGGSATKWRMMHIIVALGIWLGGIHLNFALGLISLFYLSLSKALLV 73

Query: 61  FGLFLVLVLIPVDDKSKYGRLLARYICKHASSYFPVTLHVEDIHAFDPNRAY-------- 120
           F L L+LVLIPVD KSKYGR+LARYIC++A SYFPVTLHVEDIHAFD NRAY        
Sbjct: 74  FALLLILVLIPVDHKSKYGRVLARYICQNACSYFPVTLHVEDIHAFDTNRAYVFGYEPHS 133

Query: 121 ---------------------------GVLYTVFEAHM-DMDGSNASNKEKFYLLLASGY 180
                                       V YT F  H+    G   + ++ F  LLA+GY
Sbjct: 134 VLPIGVVALADLTGFMPLKKLKVLASSAVFYTPFLRHIWTWMGLTPATRKNFISLLAAGY 193

Query: 181 SCIIVPGGVQETFHMEHNSETVFLKARRGFVRIAMETGTPLVPVFCFGQSSVYKWWKPGG 240
           SCIIVPGGVQETFHMEHNSETVFLK RRGFVRIAME GTPLVPVFCFGQSSVY+WWKPGG
Sbjct: 194 SCIIVPGGVQETFHMEHNSETVFLKTRRGFVRIAMEMGTPLVPVFCFGQSSVYQWWKPGG 253

Query: 241 KFFLQFSRAIKFTPIVFWGVFGSPLPFRRRMHVVVGRPIEVKKNPNPTSDEVLDVHGQFV 289
           KFFLQFSRAIKFTPIVFWGVFGSPLP+RR+MHVVVGRPIEVKKNPNPTSDEVLD+HG+FV
Sbjct: 254 KFFLQFSRAIKFTPIVFWGVFGSPLPYRRQMHVVVGRPIEVKKNPNPTSDEVLDLHGRFV 313

BLAST of Sgr029759 vs. ExPASy Swiss-Prot
Match: K7K424 (Diacylglycerol O-acyltransferase 2D OS=Glycine max OX=3847 GN=DGAT2D PE=1 SV=1)

HSP 1 Score: 346.7 bits (888), Expect = 2.6e-94
Identity = 167/297 (56.23%), Postives = 211/297 (71.04%), Query Frame = 0

Query: 28  IVALGVWLGGIHLNFTLALISVFYLSLPKALLVFGLFLVLVLIPVDDKSKYGRLLARYIC 87
           I+A+ +WLG IH N  L L++VF+L L K+LLVFG     +++P+++KS++GR L+R+IC
Sbjct: 33  ILAMVLWLGAIHFNIALILLAVFFLPLSKSLLVFGFLFGFMVLPINEKSRFGRRLSRFIC 92

Query: 88  KHASSYFPVTLHVEDIHAFDPNRAY----------------------------------- 147
           KHA +YFP+TLHVED+ AFDPNRAY                                   
Sbjct: 93  KHACNYFPITLHVEDMKAFDPNRAYVFGYEPHSVLPIGIVALADHTGFMPLPKVKVLASS 152

Query: 148 GVLYTVFEAHM-DMDGSNASNKEKFYLLLASGYSCIIVPGGVQETFHMEHNSETVFLKAR 207
            V YT F  H+    G   + K+ F  LLASG+SCI++PGGVQE FHM+H +E  FLKAR
Sbjct: 153 TVFYTPFLRHLWTWLGLTPATKKNFISLLASGHSCILIPGGVQEAFHMQHGTEIAFLKAR 212

Query: 208 RGFVRIAMETGTPLVPVFCFGQSSVYKWWKPGGKFFLQFSRAIKFTPIVFWGVFGSPLPF 267
           RGFVR+AM  G PLVPVFCFGQS+VYKWWKPGGK FL+F+RAIKFTPI FWG+FGSPLPF
Sbjct: 213 RGFVRVAMVKGKPLVPVFCFGQSNVYKWWKPGGKLFLKFARAIKFTPICFWGIFGSPLPF 272

Query: 268 RRRMHVVVGRPIEVKKNPNPTSDEVLDVHGQFVEALKDIFERYKAQLGYDNLQLKVL 289
           R  MHVVVGRPIEV KN  PT++EV  +HG FVEAL+D+FER+KA+ GY NL+L+++
Sbjct: 273 RHPMHVVVGRPIEVDKNREPTTEEVAKIHGLFVEALQDLFERHKARAGYPNLELRIV 329

BLAST of Sgr029759 vs. ExPASy Swiss-Prot
Match: A1A442 (Diacylglycerol O-acyltransferase 2 OS=Ricinus communis OX=3988 GN=DGAT2 PE=1 SV=1)

HSP 1 Score: 330.5 bits (846), Expect = 1.9e-89
Identity = 165/301 (54.82%), Postives = 209/301 (69.44%), Query Frame = 0

Query: 24  MMHIIVALGVWLGGIHLNFTLALISVFYLSLPKALLVFGLFLVLVLIPVDDKSKYGRLLA 83
           + H ++AL +W+G IH N  L  IS  +LS P  LL+ G F+VL+ IP+D+ SK GR L 
Sbjct: 40  IFHALLALSIWIGSIHFNLFLLFISYLFLSFPTFLLIVGFFVVLMFIPIDEHSKLGRRLC 99

Query: 84  RYICKHASSYFPVTLHVEDIHAFDPNRAYGVLY----------TVFEAH--------MDM 143
           RY+C+HA S+FPVTLHVED++AF  +RAY   Y          +V   H        M +
Sbjct: 100 RYVCRHACSHFPVTLHVEDMNAFHSDRAYVFGYEPHSVFPLGVSVLSDHFAVLPLPKMKV 159

Query: 144 DGSNA------------------SNKEKFYLLLASGYSCIIVPGGVQETFHMEHNSETVF 203
             SNA                  + K+ F  LLASGYSCI++PGGVQETF+M+H SE  F
Sbjct: 160 LASNAVFRTPVLRHIWTWCGLTSATKKNFTALLASGYSCIVIPGGVQETFYMKHGSEIAF 219

Query: 204 LKARRGFVRIAMETGTPLVPVFCFGQSSVYKWWKPGGKFFLQFSRAIKFTPIVFWGVFGS 263
           LKARRGFVR+AME G PLVPVFCFGQS+VYKWWKP G+ F++ +RAIKF+PIVFWGV GS
Sbjct: 220 LKARRGFVRVAMEMGKPLVPVFCFGQSNVYKWWKPDGELFMKIARAIKFSPIVFWGVLGS 279

Query: 264 PLPFRRRMHVVVGRPIEVKKNPNPTSDEVLDVHGQFVEALKDIFERYKAQLGYDNLQLKV 289
            LP +R MHVVVG+PIEVK+NP PT +EV +V GQFV ALKD+FER+KA++GY +L L++
Sbjct: 280 HLPLQRPMHVVVGKPIEVKQNPQPTVEEVSEVQGQFVAALKDLFERHKARVGYADLTLEI 339

BLAST of Sgr029759 vs. ExPASy Swiss-Prot
Match: Q9ASU1 (Diacylglycerol O-acyltransferase 2 OS=Arabidopsis thaliana OX=3702 GN=DGAT2 PE=1 SV=1)

HSP 1 Score: 323.9 bits (829), Expect = 1.8e-87
Identity = 160/299 (53.51%), Postives = 203/299 (67.89%), Query Frame = 0

Query: 26  HIIVALGVWLGGIHLNFTLALISVFYLSLPKALLVFGLFLVLVLIPVDDKSKYGRLLARY 85
           H I+A+ +WLG IH N  L L S+ +L    +L+V GL  + + IP+D +SKYGR LARY
Sbjct: 17  HSIIAMAIWLGAIHFNVALVLCSLIFLPPSLSLMVLGLLSLFIFIPIDHRSKYGRKLARY 76

Query: 86  ICKHASSYFPVTLHVEDIHAFDPNRAY--------------------------------- 145
           ICKHA +YFPV+L+VED  AF PNRAY                                 
Sbjct: 77  ICKHACNYFPVSLYVEDYEAFQPNRAYVFGYEPHSVLPIGVVALCDLTGFMPIPNIKVLA 136

Query: 146 --GVLYTVFEAHM-DMDGSNASNKEKFYLLLASGYSCIIVPGGVQETFHMEHNSETVFLK 205
              + YT F  H+    G  A++++ F  LL SGYSC++VPGGVQETFHM+H++E VFL 
Sbjct: 137 SSAIFYTPFLRHIWTWLGLTAASRKNFTSLLDSGYSCVLVPGGVQETFHMQHDAENVFLS 196

Query: 206 ARRGFVRIAMETGTPLVPVFCFGQSSVYKWWKPGGKFFLQFSRAIKFTPIVFWGVFGSPL 265
            RRGFVRIAME G+PLVPVFCFGQ+ VYKWWKP    +L+ SRAI+FTPI FWGVFGSPL
Sbjct: 197 RRRGFVRIAMEQGSPLVPVFCFGQARVYKWWKPDCDLYLKLSRAIRFTPICFWGVFGSPL 256

Query: 266 PFRRRMHVVVGRPIEVKKNPNPTSDEVLDVHGQFVEALKDIFERYKAQLGYDNLQLKVL 289
           P R+ MHVVVG+PIEV K   PT +E+   HGQ+VEAL+D+FER+K+++GYD L+LK+L
Sbjct: 257 PCRQPMHVVVGKPIEVTKTLKPTDEEIAKFHGQYVEALRDLFERHKSRVGYD-LELKIL 314

BLAST of Sgr029759 vs. ExPASy Swiss-Prot
Match: Q70VZ7 (2-acylglycerol O-acyltransferase 1 OS=Bos taurus OX=9913 GN=MOGAT1 PE=2 SV=1)

HSP 1 Score: 94.4 bits (233), Expect = 2.3e-18
Identity = 50/148 (33.78%), Postives = 85/148 (57.43%), Query Frame = 0

Query: 143 GYSCIIVPGGVQETFHMEHNSETVFLKARRGFVRIAMETGTPLVPVFCFGQSSVYKW-WK 202
           G   +IV GG +E+        T+F++ R+GFV+IA+  G  LVPVF FG++ ++K    
Sbjct: 179 GNISVIVLGGAEESLDAHPGKFTLFIRQRKGFVKIALTHGAYLVPVFSFGENELFKQVSN 238

Query: 203 PGGKF----------FLQFSRAIKFTPIVFWGVFGSPLPFRRRMHVVVGRPIEVKKNPNP 262
           P G +           + F+  +     +F   FG  +P+R+ +H VVGRPI V++  NP
Sbjct: 239 PEGSWLRNVQEKLQKIMGFALPLFHARGIFQYNFGL-IPYRKPIHTVVGRPIPVRQTLNP 298

Query: 263 TSDEVLDVHGQFVEALKDIFERYKAQLG 280
           TS+++ ++H  ++E L+ +FE +K + G
Sbjct: 299 TSEQIEELHQTYMEELRKLFEEHKGKYG 325

BLAST of Sgr029759 vs. ExPASy Swiss-Prot
Match: Q91ZV4 (2-acylglycerol O-acyltransferase 1 OS=Mus musculus OX=10090 GN=Mogat1 PE=1 SV=2)

HSP 1 Score: 94.4 bits (233), Expect = 2.3e-18
Identity = 54/177 (30.51%), Postives = 96/177 (54.24%), Query Frame = 0

Query: 116 YTVFEAHMDMDGSNASNKEKFYLLLA---SGYSCIIVPGGVQETFHMEHNSETVFLKARR 175
           + +F  ++  +G  + +KE    +L+    G   IIV GG +E       + T+ ++ R+
Sbjct: 149 FPLFREYLMSNGPVSVSKESLSHVLSKDGGGNVSIIVLGGAKEALEAHPGTFTLCIRQRK 208

Query: 176 GFVRIAMETGTPLVPVFCFGQSSVYKWW-KPGGKFF------LQFSRAIKFTPIVFWGVF 235
           GFV++A+  G  LVPVF FG++ +YK    P G +       +  S  +    I   G+F
Sbjct: 209 GFVKMALTHGASLVPVFSFGENDLYKQINNPKGSWLRTIQDAMYDSMGVALPLIYARGIF 268

Query: 236 G---SPLPFRRRMHVVVGRPIEVKKNPNPTSDEVLDVHGQFVEALKDIFERYKAQLG 280
                 +P+R+ ++ VVGRPI V++  NPTS+++ ++H  ++E LK +F  +K + G
Sbjct: 269 QHYFGIMPYRKLIYTVVGRPIPVQQTLNPTSEQIEELHQTYLEELKKLFNEHKGKYG 325

BLAST of Sgr029759 vs. ExPASy TrEMBL
Match: A0A5A7UH84 (Acyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold344G00070 PE=3 SV=1)

HSP 1 Score: 480.7 bits (1236), Expect = 4.2e-132
Identity = 242/324 (74.69%), Postives = 260/324 (80.25%), Query Frame = 0

Query: 1   MDKISGPSVTVFKGGAGSATKWRMMHIIVALGVWLGGIHLNFTLALISVFYLSLPKALLV 60
           MDKIS PSVTVFKGG GSATKWRMMHIIVALG+WLGGIHLNF L LIS+FYLSL KALLV
Sbjct: 1   MDKISEPSVTVFKGGGGSATKWRMMHIIVALGIWLGGIHLNFALGLISLFYLSLSKALLV 60

Query: 61  FGLFLVLVLIPVDDKSKYGRLLARYICKHASSYFPVTLHVEDIHAFDPNRAY-------- 120
           F L L+LVLIPVDDKSKYGR+LARYIC++A SYFPVTLHVEDIHAFD NRAY        
Sbjct: 61  FALLLILVLIPVDDKSKYGRVLARYICQNACSYFPVTLHVEDIHAFDTNRAYVFGYEPHS 120

Query: 121 ---------------------------GVLYTVFEAHM-DMDGSNASNKEKFYLLLASGY 180
                                       V YT F  H+    G   + ++ F  LLA+GY
Sbjct: 121 VLPIGVVALADLTGFMPLKKLKVLASSAVFYTPFLRHIWTWMGLTPATRKNFISLLAAGY 180

Query: 181 SCIIVPGGVQETFHMEHNSETVFLKARRGFVRIAMETGTPLVPVFCFGQSSVYKWWKPGG 240
           SCIIVPGGVQETFHMEHNSETVFLK RRGFVRIAME GTPLVPVFCFGQSSVY+WWKPGG
Sbjct: 181 SCIIVPGGVQETFHMEHNSETVFLKTRRGFVRIAMEMGTPLVPVFCFGQSSVYQWWKPGG 240

Query: 241 KFFLQFSRAIKFTPIVFWGVFGSPLPFRRRMHVVVGRPIEVKKNPNPTSDEVLDVHGQFV 289
           KFFLQFSRAIKFTPIVFWGVFGSPLP+RR+MHVVVGRPIEVKKNPNPTSDEVLD+HG+FV
Sbjct: 241 KFFLQFSRAIKFTPIVFWGVFGSPLPYRRQMHVVVGRPIEVKKNPNPTSDEVLDLHGRFV 300

BLAST of Sgr029759 vs. ExPASy TrEMBL
Match: A0A1S3BKJ4 (Acyltransferase OS=Cucumis melo OX=3656 GN=LOC103490635 PE=3 SV=1)

HSP 1 Score: 480.7 bits (1236), Expect = 4.2e-132
Identity = 242/324 (74.69%), Postives = 260/324 (80.25%), Query Frame = 0

Query: 1   MDKISGPSVTVFKGGAGSATKWRMMHIIVALGVWLGGIHLNFTLALISVFYLSLPKALLV 60
           MDKIS PSVTVFKGG GSATKWRMMHIIVALG+WLGGIHLNF L LIS+FYLSL KALLV
Sbjct: 12  MDKISEPSVTVFKGGGGSATKWRMMHIIVALGIWLGGIHLNFALGLISLFYLSLSKALLV 71

Query: 61  FGLFLVLVLIPVDDKSKYGRLLARYICKHASSYFPVTLHVEDIHAFDPNRAY-------- 120
           F L L+LVLIPVDDKSKYGR+LARYIC++A SYFPVTLHVEDIHAFD NRAY        
Sbjct: 72  FALLLILVLIPVDDKSKYGRVLARYICQNACSYFPVTLHVEDIHAFDTNRAYVFGYEPHS 131

Query: 121 ---------------------------GVLYTVFEAHM-DMDGSNASNKEKFYLLLASGY 180
                                       V YT F  H+    G   + ++ F  LLA+GY
Sbjct: 132 VLPIGVVALADLTGFMPLKKLKVLASSAVFYTPFLRHIWTWMGLTPATRKNFISLLAAGY 191

Query: 181 SCIIVPGGVQETFHMEHNSETVFLKARRGFVRIAMETGTPLVPVFCFGQSSVYKWWKPGG 240
           SCIIVPGGVQETFHMEHNSETVFLK RRGFVRIAME GTPLVPVFCFGQSSVY+WWKPGG
Sbjct: 192 SCIIVPGGVQETFHMEHNSETVFLKTRRGFVRIAMEMGTPLVPVFCFGQSSVYQWWKPGG 251

Query: 241 KFFLQFSRAIKFTPIVFWGVFGSPLPFRRRMHVVVGRPIEVKKNPNPTSDEVLDVHGQFV 289
           KFFLQFSRAIKFTPIVFWGVFGSPLP+RR+MHVVVGRPIEVKKNPNPTSDEVLD+HG+FV
Sbjct: 252 KFFLQFSRAIKFTPIVFWGVFGSPLPYRRQMHVVVGRPIEVKKNPNPTSDEVLDLHGRFV 311

BLAST of Sgr029759 vs. ExPASy TrEMBL
Match: A0A0A0L0W7 (Acyltransferase OS=Cucumis sativus OX=3659 GN=Csa_3G00090 PE=3 SV=1)

HSP 1 Score: 473.0 bits (1216), Expect = 8.8e-130
Identity = 238/324 (73.46%), Postives = 258/324 (79.63%), Query Frame = 0

Query: 1   MDKISGPSVTVFKGGAGSATKWRMMHIIVALGVWLGGIHLNFTLALISVFYLSLPKALLV 60
           +DKIS PSVTVFKGG GSATKWRMMHIIVALG+WLGGIHLNF L LIS+FYLSL KALLV
Sbjct: 14  IDKISEPSVTVFKGGGGSATKWRMMHIIVALGIWLGGIHLNFALGLISLFYLSLSKALLV 73

Query: 61  FGLFLVLVLIPVDDKSKYGRLLARYICKHASSYFPVTLHVEDIHAFDPNRAY-------- 120
           F L L+LVLIPVD KSKYGR+LARYIC++A SYFPVTLHVEDIHAFD NRAY        
Sbjct: 74  FALLLILVLIPVDHKSKYGRVLARYICQNACSYFPVTLHVEDIHAFDTNRAYVFGYEPHS 133

Query: 121 ---------------------------GVLYTVFEAHM-DMDGSNASNKEKFYLLLASGY 180
                                       V YT F  H+    G   + ++ F  LLA+GY
Sbjct: 134 VLPIGVVALADLTGFMPLKKLKVLASSAVFYTPFLRHIWTWMGLTPATRKNFISLLAAGY 193

Query: 181 SCIIVPGGVQETFHMEHNSETVFLKARRGFVRIAMETGTPLVPVFCFGQSSVYKWWKPGG 240
           SCIIVPGGVQETFHMEHNSETVFLK RRGFVRIAME GTPLVPVFCFGQSSVY+WWKPGG
Sbjct: 194 SCIIVPGGVQETFHMEHNSETVFLKTRRGFVRIAMEMGTPLVPVFCFGQSSVYQWWKPGG 253

Query: 241 KFFLQFSRAIKFTPIVFWGVFGSPLPFRRRMHVVVGRPIEVKKNPNPTSDEVLDVHGQFV 289
           KFFLQFSRAIKFTPIVFWGVFGSPLP+RR+MHVVVGRPIEVKKNPNPTSDEVLD+HG+FV
Sbjct: 254 KFFLQFSRAIKFTPIVFWGVFGSPLPYRRQMHVVVGRPIEVKKNPNPTSDEVLDLHGRFV 313

BLAST of Sgr029759 vs. ExPASy TrEMBL
Match: A0A6J1E6L6 (Acyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111431182 PE=3 SV=1)

HSP 1 Score: 468.4 bits (1204), Expect = 2.2e-128
Identity = 235/323 (72.76%), Postives = 259/323 (80.19%), Query Frame = 0

Query: 2   DKISGPSVTVFKGGAGSATKWRMMHIIVALGVWLGGIHLNFTLALISVFYLSLPKALLVF 61
           D+ISG +VTVFK G GS TKWR MHI+VALG+WLGGIHLN  LALIS+FYLSLPKALLVF
Sbjct: 4   DEISGSTVTVFKEGGGSPTKWRTMHIMVALGIWLGGIHLNVALALISLFYLSLPKALLVF 63

Query: 62  GLFLVLVLIPVDDKSKYGRLLARYICKHASSYFPVTLHVEDIHAFDPNRAY--------- 121
            L L+LVLIPVDDKSKYGRLLARYIC++ASSYFPVTLHVEDIHAFDPNRAY         
Sbjct: 64  ALLLMLVLIPVDDKSKYGRLLARYICQNASSYFPVTLHVEDIHAFDPNRAYVFGYEPHSV 123

Query: 122 --------------------------GVLYTVFEAHM-DMDGSNASNKEKFYLLLASGYS 181
                                      V YT F  H+    G + + ++ F  LLA+GYS
Sbjct: 124 LPIGVVALADLTGLMPLHKLKVLASSAVFYTPFLRHIWTWMGLSPATRKNFSSLLAAGYS 183

Query: 182 CIIVPGGVQETFHMEHNSETVFLKARRGFVRIAMETGTPLVPVFCFGQSSVYKWWKPGGK 241
           CIIVPGGVQETFHME NSETVFLK RRGFVRIA+E GTPLVPVFCFGQSSVY+WWKPGGK
Sbjct: 184 CIIVPGGVQETFHMERNSETVFLKTRRGFVRIAIEMGTPLVPVFCFGQSSVYQWWKPGGK 243

Query: 242 FFLQFSRAIKFTPIVFWGVFGSPLPFRRRMHVVVGRPIEVKKNPNPTSDEVLDVHGQFVE 289
           FFLQFSRAIKFTPI+FWGV GSPLP+RRRMHVVVG+PIEVKKNPNP+SDEVLD+H QF+E
Sbjct: 244 FFLQFSRAIKFTPIIFWGVLGSPLPYRRRMHVVVGKPIEVKKNPNPSSDEVLDLHRQFIE 303

BLAST of Sgr029759 vs. ExPASy TrEMBL
Match: A0A6J1HQL0 (Acyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111465195 PE=3 SV=1)

HSP 1 Score: 464.2 bits (1193), Expect = 4.1e-127
Identity = 234/323 (72.45%), Postives = 257/323 (79.57%), Query Frame = 0

Query: 2   DKISGPSVTVFKGGAGSATKWRMMHIIVALGVWLGGIHLNFTLALISVFYLSLPKALLVF 61
           D+ISG +VTVFK   GS TKWR MHI+VALG+WLGGIHLN  LALIS+FYLSLPKALLVF
Sbjct: 4   DEISGSTVTVFKEVGGSPTKWRTMHIMVALGIWLGGIHLNVALALISLFYLSLPKALLVF 63

Query: 62  GLFLVLVLIPVDDKSKYGRLLARYICKHASSYFPVTLHVEDIHAFDPNRAY--------- 121
            L L+LVLIPVDDKSKYGRLLARYIC++ASSYFPV LHVEDIHAFDPNRAY         
Sbjct: 64  ALLLILVLIPVDDKSKYGRLLARYICQNASSYFPVNLHVEDIHAFDPNRAYVFGYEPHSV 123

Query: 122 --------------------------GVLYTVFEAHM-DMDGSNASNKEKFYLLLASGYS 181
                                      V YT F  H+    G + + ++ F  LLA+GYS
Sbjct: 124 LPIGVVALADLTGLMPLHKLKVLASSAVFYTPFLRHIWTWMGLSPATRKNFSSLLAAGYS 183

Query: 182 CIIVPGGVQETFHMEHNSETVFLKARRGFVRIAMETGTPLVPVFCFGQSSVYKWWKPGGK 241
           CIIVPGGVQETFHME NSETVFLK RRGFVRIA+E GTPLVPVFCFGQSSVY+WWKPGGK
Sbjct: 184 CIIVPGGVQETFHMERNSETVFLKTRRGFVRIAIEMGTPLVPVFCFGQSSVYQWWKPGGK 243

Query: 242 FFLQFSRAIKFTPIVFWGVFGSPLPFRRRMHVVVGRPIEVKKNPNPTSDEVLDVHGQFVE 289
           FFLQFSRAIKFTPI+FWGV GSPLP+RRRMHVVVG+PIEVKKNPNP+SDEVLD+H QFVE
Sbjct: 244 FFLQFSRAIKFTPIIFWGVLGSPLPYRRRMHVVVGKPIEVKKNPNPSSDEVLDLHRQFVE 303

BLAST of Sgr029759 vs. TAIR 10
Match: AT3G51520.1 (diacylglycerol acyltransferase family )

HSP 1 Score: 323.9 bits (829), Expect = 1.3e-88
Identity = 160/299 (53.51%), Postives = 203/299 (67.89%), Query Frame = 0

Query: 26  HIIVALGVWLGGIHLNFTLALISVFYLSLPKALLVFGLFLVLVLIPVDDKSKYGRLLARY 85
           H I+A+ +WLG IH N  L L S+ +L    +L+V GL  + + IP+D +SKYGR LARY
Sbjct: 17  HSIIAMAIWLGAIHFNVALVLCSLIFLPPSLSLMVLGLLSLFIFIPIDHRSKYGRKLARY 76

Query: 86  ICKHASSYFPVTLHVEDIHAFDPNRAY--------------------------------- 145
           ICKHA +YFPV+L+VED  AF PNRAY                                 
Sbjct: 77  ICKHACNYFPVSLYVEDYEAFQPNRAYVFGYEPHSVLPIGVVALCDLTGFMPIPNIKVLA 136

Query: 146 --GVLYTVFEAHM-DMDGSNASNKEKFYLLLASGYSCIIVPGGVQETFHMEHNSETVFLK 205
              + YT F  H+    G  A++++ F  LL SGYSC++VPGGVQETFHM+H++E VFL 
Sbjct: 137 SSAIFYTPFLRHIWTWLGLTAASRKNFTSLLDSGYSCVLVPGGVQETFHMQHDAENVFLS 196

Query: 206 ARRGFVRIAMETGTPLVPVFCFGQSSVYKWWKPGGKFFLQFSRAIKFTPIVFWGVFGSPL 265
            RRGFVRIAME G+PLVPVFCFGQ+ VYKWWKP    +L+ SRAI+FTPI FWGVFGSPL
Sbjct: 197 RRRGFVRIAMEQGSPLVPVFCFGQARVYKWWKPDCDLYLKLSRAIRFTPICFWGVFGSPL 256

Query: 266 PFRRRMHVVVGRPIEVKKNPNPTSDEVLDVHGQFVEALKDIFERYKAQLGYDNLQLKVL 289
           P R+ MHVVVG+PIEV K   PT +E+   HGQ+VEAL+D+FER+K+++GYD L+LK+L
Sbjct: 257 PCRQPMHVVVGKPIEVTKTLKPTDEEIAKFHGQYVEALRDLFERHKSRVGYD-LELKIL 314

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0052901.18.7e-13274.69diacylglycerol O-acyltransferase 2 isoform X1 [Cucumis melo var. makuwa][more]
XP_008448438.18.7e-13274.69PREDICTED: diacylglycerol O-acyltransferase 2 isoform X1 [Cucumis melo][more]
XP_038903787.15.7e-13173.77diacylglycerol O-acyltransferase 2D isoform X2 [Benincasa hispida][more]
XP_004146185.31.8e-12973.46LOW QUALITY PROTEIN: diacylglycerol O-acyltransferase 2D [Cucumis sativus][more]
XP_031738801.11.8e-12973.46diacylglycerol O-acyltransferase 2D-like isoform X2 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
K7K4242.6e-9456.23Diacylglycerol O-acyltransferase 2D OS=Glycine max OX=3847 GN=DGAT2D PE=1 SV=1[more]
A1A4421.9e-8954.82Diacylglycerol O-acyltransferase 2 OS=Ricinus communis OX=3988 GN=DGAT2 PE=1 SV=... [more]
Q9ASU11.8e-8753.51Diacylglycerol O-acyltransferase 2 OS=Arabidopsis thaliana OX=3702 GN=DGAT2 PE=1... [more]
Q70VZ72.3e-1833.782-acylglycerol O-acyltransferase 1 OS=Bos taurus OX=9913 GN=MOGAT1 PE=2 SV=1[more]
Q91ZV42.3e-1830.512-acylglycerol O-acyltransferase 1 OS=Mus musculus OX=10090 GN=Mogat1 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A5A7UH844.2e-13274.69Acyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold344G0007... [more]
A0A1S3BKJ44.2e-13274.69Acyltransferase OS=Cucumis melo OX=3656 GN=LOC103490635 PE=3 SV=1[more]
A0A0A0L0W78.8e-13073.46Acyltransferase OS=Cucumis sativus OX=3659 GN=Csa_3G00090 PE=3 SV=1[more]
A0A6J1E6L62.2e-12872.76Acyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111431182 PE=3 SV=1[more]
A0A6J1HQL04.1e-12772.45Acyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111465195 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G51520.11.3e-8853.51diacylglycerol acyltransferase family [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007130Diacylglycerol acyltransferasePFAMPF03982DAGATcoord: 125..280
e-value: 5.9E-50
score: 169.8
coord: 55..114
e-value: 5.7E-6
score: 25.5
NoneNo IPR availablePANTHERPTHR12317DIACYLGLYCEROL O-ACYLTRANSFERASEcoord: 11..116
coord: 114..287
NoneNo IPR availablePANTHERPTHR12317:SF64TYPE 2 ACYL-COA DIACYLGLYCEROL ACYLTRANSFERASEcoord: 11..116
coord: 114..287

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr029759.1Sgr029759.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0048235 pollen sperm cell differentiation
biological_process GO:0050790 regulation of catalytic activity
biological_process GO:0017157 regulation of exocytosis
biological_process GO:0019432 triglyceride biosynthetic process
cellular_component GO:0005789 endoplasmic reticulum membrane
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005886 plasma membrane
molecular_function GO:0004144 diacylglycerol O-acyltransferase activity
molecular_function GO:0005096 GTPase activator activity
molecular_function GO:0045159 myosin II binding
molecular_function GO:0019905 syntaxin binding
molecular_function GO:0016747 acyltransferase activity, transferring groups other than amino-acyl groups