CsaV3_3G020720 (gene) Cucumber (Chinese Long) v3

Overview
NameCsaV3_3G020720
Typegene
OrganismCucumis sativus L. var. sativus cv. Chinese Long (Cucumber (Chinese Long) v3)
Descriptionphenolic glucoside malonyltransferase 1-like
Locationchr3: 16919481 .. 16945633 (-)
RNA-Seq ExpressionCsaV3_3G020720
SyntenyCsaV3_3G020720
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTTCTTGGCCCCTAATACTTAAATTTACAATTATTCGTTCAAAGTATAGAAGTGTTTTTTTCTTTGGAAAACAAAGTGGTTTGTTTTATTCAAATTCACGACTCACCATATTTCTTCCCAACCCTTCAATCACATGTTATATAGAGACAAAGAAGCTTAGCAATCAACTGCCACAACATTTCATGGCGACAAATCATGGGTCCACCGCCATTAAGGTCCTCGAGGTCTGCACGGTAGCTCCTCCGCCCGGATCCACCGTGCCGGTCACTCTTCCCCTCACCTTCTTCGACATCCTCTGGTTTCGCTTCCCTCCCGTCGAACGCCTCTTCTTCTACAAATCTCCGGTGCCATTCTACGTCATCGTTTCAAACCTCAAAAAATCTCTCTCTCTCGTCCTCCAACACTACCTACCCCTCGCCGGAGCAATTGTATGGCCTGAAAATTCCCCCAAACCGGCCGTCGAAACCGCCGTCCGCGACGGCATTGTGCTTACAGTCGCGGAGTCTGAGGACGACTTCGACCATCTCATCGGCGACGGGTTGCGTAAAGAGGCAAAACTTCGGCCGTTGGTGGCGGAGCTTGCAGCAGAGGAGGATCGGGCGGCAGTGGTGGCTGTGCAGGTAACCTGGTTTGGGAATGGTGGATTTAGCATCGGAATAACTTCACATCACGCAGTTCTGGACGGAAGGTCGTCGACTTCTTTCATGAAATCGTGGGCCGGATTGTGTAAGAATTTGGTTGGGGGCGGTGAGATTTTTTGCCCGGCAGCTGAGACGATGCCGTTTTATGATCGAAGCGTGGTGACAGATAAGATGGGACTTGAAGCCATTTATTTGAAATGCTTGTTGGCCCATGAAGGGCCCAACAATAGAAGCTTGAAATTTTGGGACTTCAAAACTCCGCCAGATTCATTCCGAGGTATTTCCAATAATAGTCTTTTATAATCAGTTATTTAAATGTCTACGATAAAATCTAAACGGAAAGAAAAGGGGAGAAAGCCAATTGCAAGTAAAGACTCTGACTTCCTTATTATATTTGATATTGGATACAAATGATCAAATAATGATATTCTAGAATATGTACTAAGTGATACAATGGACCAAGAAAACAAGTAACAGGGCTTGTAAACAATATGACTAATGGGCTATCTTAACACCCCCACAGATCTATATCTGTGGTACCAAAAACAGTCCATTTTTGCTTAGGTTTTCTTAAAGTTGGTAGCAGATAAAGGTTTGGTTAACACATCTACAATTTCGAAGGTGTGAACAAGAAGCCCCTGATTTTGTATAAGGTCCCGAACAAAGTAAATGTCAAGTTCAATTGTACTGCACTTAAATGATTTTGTAAAGCTTAGTTTTAGAGTGCAGTATAGAATTAGCACTCAAGTGTACTGCACTTAAATTGTCATACCACATAACAAGAGGATAAGATATTCTAATGCATAAATATGTAAAAGAGACTGAATTCATACCAACTATGTTGCAAAAAGAGCTAAACATATATGCATTCATTTGTACCTGATGTTGATATGATTATTTGCTTTTTCCGAATCCCAGTTAACTAGATTAACACCAAAATAGACACAGTATCCTAATGTAGATTTTTTATCATCCGGGTCAAAAGCCCAATCAGCATTAGCAAAGCCTACCAAGTTTAATGTAATGACCCAACTCAAGTCATTACTAATAAAACAAATAAAAACTTTTTATTTAAAATAAAAGAAAATACTAAATTTAAAGAAAACCTTTCGAATATTCATTTAAAATCAACAATAAATAAAATGTGTGAAGAAAATAATAAATAAAAATATTATTTAAAATACTAGTTCAGGCCCTCTTTAAAATAAAAGAAATATCTAAAAGTAAATATAAGTTTCTAAGATATTTTTTTGAAAATTGAGTACCGAAAACAATCAATGCTGAATAAAACATATATTGCCCCTCTATGGTGAACCACGGTTTCTTCTCGTCATTTGCAAACTTGTCATTACCTCTACCTTAGCCTGTAAAATTAAATATAGAGAAATTGTGAGTATAATTTACACTCAGTAAAGAACCCACTATTAGTCCTTCTAGATGTTTGTTAAATTTATGTTAGAGTCACGTAAGAAGGTACTGCCTAACTATACAGGTTAGACGATGTGTTATATCATACTTGCTCGTCTTTTAGGATGCAAATATACAGGTTAGACGATGTGTTATATCTAATCATTGTGAATGTAGTAATAGTAACTAATCCTCAATTAAGATACACACCTCTCAGTGCTTTTTGCCATACACATTAACCTCTCAGTAATGTGTGACGAGAAGCACTGTTGCATTCTCTTTGCCACTTGCTTAGCTAAAAGTGGTTCTCACGTATTTGGTGATTATGATCTCTACTTTTGCTCTTCTTTTGAATGAACAAGGTTAATATACTCATATTTAAATGAACAAAACTCTTACTACACACAATATAACTTTTTAAGTATCAAATACACCATTCATTACTGAGTAGCTATTCAATCATACATATATGGAAGAGATAAGGAAGATGGAAGAATGGAAGAAGAAATGCATTGCTTTCATACATAATATTTTAGACCTTTCGGCCATAACGTCCTGTTTACAAAGCAACACAATATAAAAACAAGAAAATGGAAATACAATAGAGGTGTTATGCTGCAGAGTTTGATTTTTTTAATTCTCTTTGGCTTTGATTTTTGCCTATATGTGAGGGGGATTAACGGGAGTGATGAATCTCTCTCTCTTTGCTTTAAGCTCTCACCTAAAACTCAAATTAATGGAGAGAAAATGGAGAATGTGAAGAACTTACTTTTTCAGCTAAAGTGTGCACACCTCCTTCATGAGTGAGAAACTTTATTTATAAACACGGAGTGAAACTGACAAACTAGCACTTGCCATTAAGATTCAAAAACGTTGTAGCTACTCTGCGAAGTCTGGCCTGCTTGTCCTTTTCTAACTTTCTCACCAACCTCTTTTGTCTGCTAGCAACTTGATCCCCTAACATCTACTTCCCCGTGCTAAAGAGGTTGGTTGGATTACTTCACGTACCATGTTCTATTCACTGAGCTTCACTCTTCTTTGCTTAATATTATACAATAAGAGTACTAAAACATAATAAAATAAGCATGAAGTTCTTTTGCGGTAAGCTTTTTGCAAAGTAATCAAATCACCTACCTTTTGTCTTTATTTTTTTCTATTTTCATGCGGAAGTCGTCATTATTTTACATAAACGACTATAATAACTTGTATTTCTAAAAGTTATCAATACCTTAGTATTATTTTGATGTAAAGCTTTAAGTTCATATTCCATAGCTTTTTTCTAGTGAGAGTGTTTTAAGGCTTATTTCTCACTATTTGGTTCATGTTGAGTGTAATCAACTAAAAGAACTTTGAGTTTAAAGATTCCACTATTATCTTGGGTCATCATTGGATGAATGGGAATAATTGATGTGGTCTATAGCAGTAGGAATAACTTGATGTCCATTGGTTGGCCAATAATGAAAGTGGGCTTCATCTTGATTCTCTTCAATTTCAGCTTCAGTGGATGAGTTACCTTGAGTCCCCATCTCTAAAGGATGCACAATAGTAGGATTTAGATGATCGTTAGCATAAAAAATGTAGCCTCTCTACACTATATGGCTCATATTTGGATTTTGAAGAGAATGGTGTGGATGAGAGGGCAGTTGACTACTAATTGAGTTCGTGGTTTGGTATGTTTGAGAAATGAGGCATAAGGAAAGGAGTTTTCATTAAAAAAATAAATGTCTGGATATATAGATACTACTTTCAGGTGTAGACATTTATAGCCCTTATGTGATGTGCTATACCTAAGAAAGATACATGGTTGAGATATGAGAGAAAGTTTGTGAGATTGATAGGGTCTCAGATATGGGTAGCATTTGCATTCAAATATTATTAGTGAAGGATAATTAGGTTTTTACTAAATAGCTTCTCCAAGGGACTGAGATTTTGTAAGACAGGGGTAGTAGAAAAGGCTTCATCCCAAAAATTTAAAGTGTGGCTTGGGATAAGAGAGTAAGTCTCATATCCATGACATGTCTATGTTTTCGCTCCACTATGCTATTTCGTTGTGAGGTATGGGGATAAGTTATCCTATGTTCAATGCCTTTTTGTTGAAGGAAGGGAACAAATGGTTTGAATTCACTACCTCCATTTGTTTGTAGAGATAGTATAGGTCTATTTAAGGATTTCTCTACTAGTGTATTAAACTATTAAAAGGCAAAAAAATGAATCATATTTCAAGTTTAAGAAATAAATGTATGTATATATGCTATAGGCATCTACAAAACTAATGTAATATCTAAACCCATTTCTAGATGTACTATAGGCAGGTCCCCACAAATCACAAACAATAAGTACTAAATGTACTAGATGTCTTCCCATTTTGTATTCATTGAAATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATAACATGTTTATATCTTATCAATCCATCATGTGTTTTCATTCTTCATCAGTCAATATGTGATTTGTGCTTTAGGTTAGTGATAAGTTCTTAATAAGAAATTTTTTATTGTAAGTCTTATCGTGTTAGTTGAGCTATTCTCATCTCTTGGCCTAGCGTAAATCGTCAACTACCCGAGAGGGTAAGTAGCAAGAACATGGCCAACGCGCGCAAGGAAGAGTTTTCTCTTTCTTTGCGAAATAACTTCGATCACTCGTATCCTGAAGGTGAACAAGTGTGGGCACTAGGCGTCGAGAGGTTATTGTCCTTAGTTGTGAAGTCCTATACGATTTATAAGTGAACACTTCTAGTTATATCTTGCCTAATCATATTTAGTCATATGGATCCCGAAAGGTGACCATGTGGGACATGTAATGACTTTGAGAGAAGCACACCTTAATCATAAGCATGGTTGACTAAGTTGTATCGCATCAAGGCTATCATATTACTTAATCGTGTTATAATGCATTGATGTTCCTTCAATAGCAACATTACCGAATTGAACTGGGTCGGGTCGGGTCAAATAGGTTTGGAAAAAGAGGAAGAAAAGTTCGATTTCCACCTTCATCTACTCCGTTGGTTCCGTCTTCCTTCTTCTGTTCAGAAAAGTCTTCCCCAAATCGTTCGCCCAATCTCCAATACTCCTTCTTCCTTCAACAAATCGTTCGCCTAATCTTCGATCCTCCTTATTCCTCCTTATTCCTTCTTCTATTCACGTGGATCTGAGTTCGATTTCGACGATTCTCTGTTCATAATTATGTTTCATGGGTTTTTTTTTTAAAAAAAAATTGAATCTGGGTAGCTGACATATTGAAATGCCCAAATTTTGTTCAAAACCTAGATCTGTGAAACTTCAATTGTTCAAACACTAGTATAATCTTTATGATAAATCAATTGTTGTGGTGATATTGATTGGACAAAGAAAAGAAAACAGAGGAGAAGGAAAAGAAATAGGGAAAAAGATGAAAGAGAAGGATGAAGAAAAAAAAAGAGAGTAACGTGACACTTAAGAAAGAAGGAAGGAACCAAGGGAGGAAGGAAGAGAAGAAGGAAGGAAGAATGAAAGAAGGAAAAAGGAAAAAGGAAAAAGGAAGAAAGAACGAAAGAAGGGAAGAAAGGAAGGAAAGAAAGAAAGGGAAAAAAATAATTTAAGAAAAACAAAGAAAAAGATCTGACTCGACCTGAATTTGGGTCGGGTTGGATCGGGTCATAGCATTGTGGAAACCGTGCACCTCGGGTTGGGTCGGGTTTGGATGAAAATCGACCAGACCCGACCCAGCCCAATTACACCCGTAATAAATTATTTCACTTACCGGAGATACTCACCCTACCATATAACTCTTGAAATTGAAACTTCTGTATGCATATTAGTATTCATATTTTAATCACGTTTTATTATTATATTTTTAGTCTAATATAACTTAAATTTGTTGTGCTAGCTTCTATTTTTGTTCTTCTTACAAACAAACGATTAGTGTGATTTTTTCTTATGGTAAAGGTAGGAGTTAGAAGAAATTATATTTTGGGTTAAATTACACGTATTTAGTGTGCGTGCATTTGAAATATATTTTCAGGGATTTAGTTTTAAAAGTAACGTCAATTCTAAAACAAAAATTAGAATGTGTTGGTAGCCAAACAAAACGAGTATATTTATAAAAAAAGATTAATAAAAATGAGTTTTTTTTTAAAAAAAAAATTCTTAAATGAATTCAATTGGACCATTCATCTTTGATAGGTTTAGGCATATGTTTCCTTGATTTCTAAACTTGGAAGATTGAATTAAATGCAATATTTTTACTTCCTTACCTATAGTCTATTTATTTGTCTATTATATATAAATATGTGTATGTTTAAACTTTAAACTTCCAATAATTTTGTAATTTGTTTTACAAAAACTTTTAAAAACAGATTGTTTTACAAAACTTTTAAAAATTGAAAGTTAAAAACTAAATTTGTAAAGAAAATTTAGGAACTAAATACTATGGACTAAATTTTTCATGTAACTACTTTTGTAGTGCTAACTCAAATATGGTGGAACTTGGAATAATAATTCAAACTTTAGAACCCTTAGTGTATAACCAATACTGTCCACAGCCCGCACACAACCTTAACCACTTCTAAACAATAAATAGCACAACCATAACCAAAGATAAGTAACGGAAAAACAACACGGAACTTCACTTTAATTCACTTAAGCTTAAGAGTTCTACAACAATTCAAGATACAAATTCTTAAAGACAGGTAAATATAAGAAATTCAGTCAAAAGCCTCCAAAGTTTTCAATCTTCAACGATCGTTTAGTTTCTAGTTTGATTGTTTAGCTCCGTAAACCATTGCTTACCTTTTTGCGTGGTCATTTTGGAATCGCTTAGCTTCCTAAACGATCGCTTACCTTTCATCACAATCGCTTACCCTTTTAGCTTATTGCTTAGCTTCGTACAACTTGGAAAGGAAAAACTAAAAACGTAAGTCAAAAGACTTAGTGAGTGAAGTTTTAAGAAAACCATTTGGAACACAAATCATAACCCAATCGTTTCATTAAAACAAACTTAATACCTCGTTTTACAAATATATCAGATCGCATAAATATTCATTAGTCTCACATGTTCTTTCGCCTAGAACATAAATCATTCCCTATACCCAAACACAACATCTCTTGTATTTAGACTACTGTGATATCTTTTTCATCAACAGTATCCAAGAATTTTCTAGTTCATTCCTTATTGCTCAGATCGCCAAATACAATATGCTCCTTTTACGCATTAGTCAGCTTTATTCCATAATGTAGTCCTTTTCTACATGATTGTCTTTACTTCTTATGTGTTCATAATAATTAGCTCATTGATAGGAATAAGACTAAGAGTTTGAACACAATTTCACATTTATTTGAAACACATAAAACTATCTTTAAACATATCATATGCTTTCGTTCGAAATTTCTTTATAAACATTAATGTATGAAAATATCATTACAAATCAGCATGAATTTAGACAAAAGTTTTAAAATCTGTTAACACTTCTGGAAAGACACTCACAAACTTAAGTCTCATTTATTTTCAGAACGTAGCTCCTCTCAAGTTCCTCTTCTTGTCTGAACAACTTCTTTGACAGCAAAACCCTCAATTTCCTTCTCTTTGTAAATTCTCATAATTGTTCGTTTCCTTTTGTCTTACCTTCTTATTTAAATTAATCTTAAAATAGATTAGTCACTTCTTAAACTCTTATTAATGTTGTCAGCTCCTACTTAGCCACAAAAATGCATGCAATATGCTTAAGATACCATGAATCCAACTCCACTAACCTGATAATTATAAGCTCAAAAGCGTCTTTAATCTTTCTACACGATCGTTCATCCTTCTAAGTAATTCTTCAATCTTTTACACGATCGTTTAGCCTATTATACGATCATTCAATCCTCTTACAAGATCTTTTATCCTTTTAATGATTGTTTCCTTATTATACGATCATTCAGCTTTCTTACACGATCGTTTAACTTACTACATGATCATTTACTATCTTATATGATCGTTAGACAATTTTTTTTTAGAACCTTCTATTCAGGCACGAGTCTTACAAGTACCTTGAGTTATTTTGATATGAACATTTTTTTTCATTTCATCCTCTATAAATAGAAACTAAGGTCTCAAAGCTTTTTGACATACAAATATTGTAAGATCAAGGGAATTGTTTCACAAAATTAGTTGAAGTGTGCCTAAGTTAAGCTAAAAACTCATGGATTAGAAAATGTGTATTTCTAGAAAGTATATATTCTAAAATGAAAAAGAAAATACATTTAAAACAAATTAAATAAACACAAAAAAAGTTATAAGAGAGTTGGGAAGAAAATTCATTTAATTAGGATAAAATAGGCCCAAGATTTTTAATTGGGAAGCAAAATAAGATTCTTTTTTCCTTTTAGTTGTAGTAGAGTAAGATTCAACGTCAACTTTAAAGAACAGAAGTTATGCCATTATCACTGGATAACACTCACTTCTATCGGTAAATTAATGAAGAGTAATAATAATAATAAGGAAAAATTGCATAAACAACCCCTAAGCTATGGGCTTGATTGTCATACCCCACCCCGAGCACTTGCTTGCTTGGCCTGAAATGTGTCGTGAAGCCAATTGATATCATCTCCTTCAAGAACGGCACCAACAGGCCCATAACCTAAACTCGTAAACAGTAAACAATAGTGGAAAGTAATACATAACTTATAAACATAACTGAAAGAAACTTCTTTGGTTAAAACAAACTTTATATAAGTTTACACAGACATAACCTTAATTACACAACAAAACCCAATAATAGATTTTCCAAATCCGAAGTCTCACATTACAAGGACGAATTGAACAAAACATGTACGCTAACAACTAAACCCTTCACCAGAAAGATAGCAGCGATTCAGACTCCGAGGACACTGTCTCTACCTGGAAAGTGGGAAAACATTTTGAAGAGTATGAGCTAAATAAGCCCAATGAATGGTAATTTTTCTAAATACTTTTCATAAATCTTTGTAACAGTAATATGTATATCCTTTAAACACTAAAAGATAAATTATAACATAAAAGCTTTAAACCTTAAAGCAATTAGAAAACAATAACATAATCCTGGTTTTTCCTACTCGAAACTCAAGTATCTACTTCTGAGAGGTAAGCGTACTTACCTCAAGTCATTAGCGCCCTTGCCTCATAATGGGTAATTTTAATCGTGCCTTGTTAAGGGCAAATTTTTAAGCTTGCCTAGTTATAAGTAATCTTGTTCCGTTATGAGTACTTTTTTAGATAGCTTGGTTTCGATAAAACATCCTTGCGTGAGCCTTAACAATATTTCTTTGATAAACATCATTTCTAGTAAACCTTAAACATTCCTCAATAAACAGTATGCTTGAATATATACCTAAACAATAACATGCTTGAAAGACTTTGCATAACATAACTTAAACATTTAATCATAAATCATTCACAAATCATGGCTTAAAAAATGCTTCAAATCACTTTAAATAAAACATTTTTCACTCATAGATACAAGCTTAAATCCTTAGCTTATAAATTTGTCCAATCTCCTCTTGTTCTAAAATAATGGTAAAATATTCCAATTAATTCTCTTTTACCAAGAAAATATCAAAAGTCTACTCAGAAAACCCTAAAACCACTAAGACAGCTCAAATTGCCATATGACATTGTCTGACGCACATGGGTCGGCGACGGGGCCTTTGCCTCGCGCTGAGTTGCCCACATGTTAACCTCACAAGAGATTTGCCACGCACGCTGGGCATGTTCGCCTGCTAACCTCACGCACAGACACATCGCGTGCCTCACCTGGTCGCGTGCTATCTTGGTTGCATTCGCCTTCCGCACAAGTCGTTCATCCAATGCCAACCTCCACGCACACTAACCTCTTTGAACATCAAACTGCCTTTCCATCTTGCCTAGAATAACCCATCACTCGGTTAGCCACCAAGTTGCCCAAAAACTAAATTTTCATGCAGAAAACATGACACGACTTTGAAAATCCAGAACACTTTTAAAGAAACTGATAAACATAAAATTTGAAACTTATAAACTTAAAATTTAGAACGCTTTTGAGGAAACCTCCTTCTATAAAAACACTTATAAACGTCTTAATTGTCTTACCGTAAAATTTTAACTTTAGATTCCAAAAGATGAGCTCTCCAGCCCTCTGGAAGTCCACATTGCACAAATTCACCCAAAACCTGACCTTCCTCTCTTTCTTCAACTCAAAATCAACCCTTGCTCAAAATCATATGGCAGCCCTATTCTAAAAACTTGACTCAATTTATGATCTTTTGAGAATAATTTCAGCCAAAACACACAAGTGGGGAAAATCTTCATATGAAGTAACCTATTTATATTGGAGCATGCATGAACCTTTATTAACGCCTCCTACTAGCAGAAATCTAAGTTAAGCATGCAAGCACACTTGTAAAGCAGTTCACCGCTTCCACCTCTTCATCATCTCGCCTCATGTCCCTTGACACAAAACGCCTAGTGTTAATGCATTGCTAACCTCTTTGCCCTTCTGGCATGCCAAGCTTCTCGCCCAACAGTCTTCTTATATAACCTCTAAACGCCCACAAAACCTCTAAATTACAGCATTTCTCATCCTTTGACACTCATTTAGAGGTTCTTTTTCACGTAGTGCATAGCCAACATGCCAAATAACCTCTAAAACACCTACTACACAACCCTTGGCTCCCAACCTTAAACGCATACTAACCTTTTTAGAGGTTATTCCTTCTCCTCAGTCGCCCAATGCACCGTCAAGTTAACCTCCAAAGCACCTAGTAACACTCATGTAGAGGTTATTTCCATCCTTTTGGCCGTTTTTACTACACAAAGGTAGCCTCTAAACTCTTTGCATTTAAGTCTCAACACTTGCCAAATTTTCCTCTTTTTTCACTAACCATGACACCACTTTTTATAACACTTTTATCCTTAACTCTCTTCAACACTTAAACTTCTATCACTTCAAAAAGCTTATGTTTTCTTCCAAGGTTCGAGGTTTACTTTGATTGCATTCATACCCTTCAACTTTCAATTTGATCAATTACGCTCCGTAGTTTGTTAATTTGCAATTAGCCTCTTTCTTAAATGAATCTTTATTACCTTAATTGTAATCGAACATTTTAAAATTAAAAAGATAGAAATAAAAATAATCTCTCATCACACATCTTTTCCTCTCTCCTCTCTTTTCTCCTTCCATCTACATCTTCATGATCAAAGAAATCATGAAATGACACTACAAAGCAACTACCAAAATATTAGTATCCAAATTTATCTCTCTTTTCAGGCTCACTTTTCTCCCTTTCTAGAGCTCAACAACCATTGTAGAAACCACTACCACTACTGTCATCATCGCCATTACTTTCTTTGTTGCAGAGATCGTTGTTGCCTCTTCAAAATTTCACCTTTACCCTCTTCCTCAACCCTTTTCCAAACTCCATCTTCATCTTTTTTCTTATGCAGCCGACAACTACTTGGATTTGAAATTTGGATTTAGTGTTGTGGATTTAGTTTTAGGAGTTCAGTGTGTGTTCTAGTATATATATATATATATTATATATATAAATTATCCATGTGGTGAGTCATGGTCATTGCCTGTGAGATGCTTAATATGTCTCTAAATTCGCGTGTAGTTACTAAATCATCGTTGAATATAAGTTTTTCCATTGCTTATCCAAGTGTAAGCGAAACAATAGGTTTTCTGGATAAGCAGGTTGAACATAGGAGGAATTGATATCAAAGCTTAGTTCTAAGGGTACTTCTTCATCTAGGTGGTATAAGATAACTATACAGTATAAAATAAATGTGTGTTCAAGCATAGTCTACTATCTACACTAGAGATTCTATAAAAGGTGGTAAGCATGGTGAAAAGATGGGGGAATGAACTAACTTGTGTTAAAGAAACATGAAGGAAGGTCTCTGCAATGTAAGTGCTTAAATCACGCAATAAGCTGTGATGCGATAAAGATCAATTAATTAGCTTCTACTTAAAGTGTGCACCACTCAGTGTCTTAAATACCACACTTGTTCGCCTCTCAGGATGCAAATGACAACGTTTCATTAAGTGATGACTATCTAGTAGTTGTTACTTATAAGATTTATAACACTTCTCTTTTAAAGGTGGAAATCTCTTAGCGGCAAGTATCCACACTTGTTCTCCTTTCGGGACACAATTGATGGAAATTTACTAGCCATGGTAGAGTTTGTCTCCAAAATCCTCCTTGCGCGCATTAGCTATATTCTTACTACGCTTACTCTCTCGAGTAGTTGGTGACAAGCACTAGTCAAGTAATGCATTATAGCTTAGCTAACGCAATAGACCTTATAAGTTAGACTACTTATCAAGAACTTATCACAAACCTCAACTACCTATAACACTTTGACAAATGAAGAAGTAAAGTGATGATATATTTAAAGTGGATAAATATGCCATAGGCCACTATATATATGAACACTTTACATTACAATGAAACTTAAAACAAACTTTATATAATAATAATTAAGTATAGCCCATCTAAACAACAAAGTACAACAAGTTTGTTATAGACGAATACAAAATAGAGTTCATAAAAGACCAAGATTGTCACTTACAACCCACAACTAACAAACTATAACTCAACTCATGTACAGACCGTAAGTACTTTTAGAACAAGGCGTACACTAATGAAACTAAAAAATACACTAGCATATAAGAGAATAAAGTCAGGCATCGGGTCCACGATCTCTACCTGGAATGTAAGAAATCTTTTTTGGAAAGAGTGAGCTAAGGCTCAGTGGTTGACTAGCTTTTAAATCAAACATATTTAGAAAATCAATGAGTAGTAAACAACTTATCAATCACACTTTAAGCAAAATAAATGCATAGTGCTCAAACTTCAACAACTATGTTCAAATATTTCTCTTTCAGTTGCTATACCTTGAATCTTTGTACTCTAAAGACGAAACTCTAAGCATCAATAGTGCCCTATCTCATGATGTGGCTCAAGTATGAGATGAAAAATAGGCATCCGTAGAGGTCCTAATCTCTACAAAGCATAATGGTGGAAACTCTAGGTATCTATTGAATAGACTCATCATTAGACCTCTGTGCATAGAAGAAACTCTCCTATGGGCTTGCTACAAATCTCTAGATATCTCTCAGGTATGGACTAAAACATACTCAAACAATCGCTTGCTTAAAACGTTATCTCAAACTACAATGACATGTTATACAAATCTCTTGCATTCACTAAGTCCCAAATTTATAAAATAAGCTTTGTTTAAACAACATTTAACCTAAATACTTTCAAAAACTCAACATTCATATCCTTGCTTATAAATCATGTATAAAGAAATATTCTCAGTAAACTTATTAGTTAAACTCATGCTTTTCAAACTTTTATAAACTTGAAACTCATGCTTTGGAAATCAACTATATAAATCAAGTATAGTTCTCAAATATGTTAACTCAAAAGATGCTTTAAACATAAACTTGGTAAATCATGTTTAGGAATCAATTTAAAATTCATTTTTTTCACTTACAGATGGTAGCTAATTCTGTGGCCTATAAATTTGTCCAATCTTTTCTTGGCTTGAAATTAGAGAATGTAAACTCATTTTACACCTCTAGAATTAGAAAACATCAAAATTTAGCCCAAAAATCTCAAAAACACCAAAATTGTCCAAAAATGGTGTTATGGCACGACTAGCTGCTAAGCATGGACGACAGGTAGCATGCACGCACACACAAGCCCATCCTAGGTGTCTGGTCACGTGCTAGCCTCGCGCACTAGACATGTTGCGTGATAACTTGTATATACAAGTTATTGTGACCTTTTATGAAAAATAATGAAGCCTTATCACATAAGAATGAAAGAAAATATGAAAAAAAAAGTAAAGAATTTGATTAACTTGTGTATCACATCTTACAAAAGAACTTTATCCCTGTTTTATAGTGTTTTAAATGTCTTATCATAGAATATTAGGCAAAAAGAAGTAAAGCTCGATGATAGAATGTCACGTGGAGAGACAGGTTCAAGTAACAAGGCGACGAGCAATCAAGCTAGGCGGTCTAATGCACTAGCAGACAAATTTGGTTGGTGTGAATCCTAAAAAAAGGACGAAGGGACGTTTGCACGGCGTCGGTGCAACTTTAGTCAGAACCATTTATGATGTGTGCAATGTCCTGTCACAACGCATTCCGCCTACAAATAAAAGACTCTACTTCATAACAAAGTGTTCGATTTTTGGATTGGAGAGCAAGCTCACTTTCTTCTATTCTTTCTTTCTTGAGTTTTAGGTGAGAGCTTAGAACAAAAGGAGTGAGATTTTTTCTCCAACCATTTCCCCTCTCAAATAGGTAAAAAATCAAAGCCAAAGAGTATGAGAAATCCTACTCTGAGGCTTGATTCAGTTCTTTGTATACCATTTTATACTTTTATTGCATTACATTGTAACATGTACTTTATGGCTAAGAGGCTTAAAGTTATTTTTATAATAGTAATGCACTTCCTCTATCATTCTCTAATCTCTCTATTAACATCTACTTGTTGTGAAGTCAAACTAACTCAACAACACCAAGAACTTGACTTAGTTGGTTAACTCCACGGACTTAAGGAATAGTGTTGTTGCTTATGTTAGTGTTTACTTCTTAGTAAGAATCTCTTCATCATGTAACTGCTAAGTCCTCGTTGTGTACAAGTTTTCCTACTATTTGCCCAAATATAAATGAAACAATAGGTTTTTCTGAGTGATCCAGGGTCGAACATAGAGAATTGATATCAAAGCTTACTTCTAGGGTTTACTCATCATTCAGGCAGTATAAAATAATAAACTATACAGTATAAAAAAGTGTAAGTTTAAACTCTATCTACTTATGTACGCTAAATAGTGTAAAGGTGGAAGTATTGAAGATGGTTGCAAGAATGAAATGCAAGAGTGAACTAACTTGTATTAAAGAAGGGGTGAAGGAATGTCTTAATGCTACCAAGTGAGTAAGCTATGAAAGTCTTAATGCGATATGACTTACTTAACTAGTTTATAATTAAGGCAAGCATCTCTCGGTGTCTTTAAATGCACACCTTGCACACCTTTCAGGATTTAAATGACTAAACCTAGCTAAACAAGGTATATTTATAGGGTGAAAACCTAAGGTGTTAAAGTTATAGAGTTTCACGTTTAAGGGTGAAAACCTCTTAGTGCCCAATACCCACACTTGTTCTCCTTTCGGGACACAAAGATGGAAGTTATCTCGCCAAAGTGAAGTTTGTCTCCAAACTCCTCCTTGTGCGTGTTAGTTGTATTCTTGTTACTTACCCTTTTGAGTAGTTGGCGATATCACTTGCCAAGTGATCAAAATAGCTTAGCTAACACGATAAGAATTATAAGCTAAGCATCTTATTAAGAATCTATCATAGATCCTAAACTACAAATAATGCACGAACATATGAACAATAAAGAGATTGATGAATTGCAAAGGCATAAATATATGTTGTATTAAAGAATAAAGGCCTTCAGCCACTATACAATATGAGTAAATACAATGTAAATAAATACAAAAATAAAAAGAATAGAAATGACACTACAAGTTAAAGGGAGATGGAAAATGGGACTTCACTCAGACCGCAAGCTTTCACTCATTGACTAATTGAAGAAGATGAAGAAAAACTCTACAAACGGTGAAAAGGCTATTCCTTTCCACCTATCTCTAGTGGCTGGTCCATTCAGATGTCAAGTTCCCTAGAGCTTGAATTGGCTCATACAAGCTTGCTTTTCTTGGTAGGGATGAAGTTCCTATTATAGACAGATTTAAGTTGGAAATCTATCCTTCTGAGCATGTCAGCGCCAAAAGCTACTTCCATCATCAAGTCATATGAACTCCATCTATCACTTTTGTCTTCTAGTGAACCTTTTTTCGCATTATTACGTGGTCTTCTTGCCTAGTCATTGTGGGAGTTCCTCTATGATTCGCTAGCTCATTCTTTTGTTCTTTAGTCTGTGTTAAGGCACTAAAAATAGGGTGAAACAGGCATGGTTGCTTACGTAAAATGTATTTCTAAATATTTATCCATTTACAACATAAGCTAGGCGTTTTAACATGTTTGACATGCGTTTTGGTCCCATTAGTCTATCTAAAAGACCATAAAACTTGTATTTCTACAAGTTATCAATATCCATGCTAAATCCATTTAATTAATTTAGGACCAAAGTAATGTTTTATTAATATTAGAGAATATGAATACGTCTTATGAGTTTTGTTTGTCTAAACACAGATGCCAAAGCATTGTTTATTCAAGAGAGACAATCGTAGAGATAACAATCGCCTGACAAGCGAGAGCACACATTTAGTTAAATAAGCGGCAAAGAGAAGGTGGCAACACACTCTCATTGCCAAGTTCATCCTTTGCATAAGTTGAAGAAATGTTAACGTGTGCGGCAAAAGCATCGAAAGGTTGCTTGCCTTAAATTAGGATAAATTATTCATAATACAAACAAACTGATTAGATGCAAATCATAGTTTAACTTTAAGTATTTTGATCCCAAGAGGCAAGTAAGTATAATGAAGGCACCAAAATGATGCTCACCTTAATTCTAAAGTTAATAGATGATTTGCATCGCCTCAAAATGTTAATACATATTACATTGCAATTAATTAATTAAATATTCCTTCATCTCATGCCTGTAACTTGAGTTCATTTTCTTTCATCTCATGCCTTTAACTTATTATACATAATTAGAGTAAAATAAATGTTCATATTCACACTGTATAGATTATACCGCATAGGTTTAGCTGCGCGATAGGTTCAATAGGAACATTAATTCCCTGTGTTCAACCTTGGACTTACCAGAAAACCTTTCTTCACTTATATTTGGGCAATAGAAAGAAAAACTTTTATTGCACGCATATCATGAGAAGAAAAACAACGCATAGATAGAATTTTCTTCTTTTATCGCATCATCCTGGTATAGTCGCCTAACAGATCGCTCCATTAGAGCTTAAGATATGACATATCTATGAACAATTGTACGCACATAAAACAAACCAAACATCGCACGCCTCATGGTCGCATGACATCGTACTGCGTTGTCACGCTGCTTGAAATATTCCATGCATAAGTTTCTATCATAGATCTATCATGTTGAGCTCTAAGTGAGCGACTATTAGGTGGCTGAATAAGGTTATGCGTTAAAAGGTACATTCATATTCATGTGTTGCTAACTTTTTACAAATATGCATGTCGTGCAAGTTTTTCTCTCTATCGCCCAAGTGTAAGTGAAGAGAGATTTTTGGGTACGTCCAAGATCGAACACAAGAAATTTAAGCTCCTAATTCTCTTACCGCGCAACTAAATCATGCAGTATAAACTATACAGTAAAGTTATGCACAATATACTTTTGTCCTAACTATCTACACTAAGGTGTTCGGTGGTGCAAGGTGAGCATGGCGAGATGATGGCAATGGAAATTGAACTCATGTATAGGCCTGGTGAGGAAGGAAAAGTTTAATTATCTAATTGTAAGGAATTGTGTTTTAATAATTCAAGGCGATATAAAGCATTTATTAACTTCGAATTAGGTTGAATGACCTCTTGGCACCATTGGCATACTTGCTCACCTCTTGGGATCTAAATACACAAAGTTAAGCTATGATTTATATCTAATCATCTTGGATACATTATTAGTACTTTCTCCTTGATTAAAGCATAAAACCTCTCGATTCTTTCGCCACACACATTGGCATCTCTGCAATGTATGCAAAGGATAAACTTGGTGACAAGATTGTATTACTACAATCTTATCGCCACTTGCTTAGCTAAATGCGTGTTTCCACTTGTCAGGTGATTACATACAACAACATTGTTCTTCTCTTGAATAAACAATGTTTTAACCTCGATGTTTAACTCAACAGAACTCATAACAATGACACTATGATTCTTCAGGTATTAAAAACACATTTATTTGTTGACAAGTTAACCAAATAGATTGAACATGAAAAGTAATAGACAAATGAGAGAATAAAAGAATGAATGCATTGCTTTCATAAAAATAACTTTAGGCCTCTCGGCCATAATGTACATCTTACAAAACAATACAAGAAAAATGTAAAATGGAGTACAAGAAAAGGTGTAACACCTCAAAATATAATTTATCATACTCTTTGGCTTTGGTTGCCTATTTATTAGGGAAAAGATTAAGGGGATAAACTCTCTCTCCCTCTTTTGTTCTAAGCTCTAACCTAAAACTCAAGGAAAATGAGAAGATGGACAATGTAGGAGCTTGCTCTCTAGCCGAAATGTGCACACACTGTTCTGAAGTGAGAGGCTCTATTTATCCATAAGGGGTGATGTTCATAGGGCATCCCACACATCATTAATTCTCTAAAAAGTTGTACCAGAGTGCCTTGGAGATGTTCTTTTGTCATTTTCTGGTTTTCGCGCCAACCAAATTTGTATGCTAGTGCTTCATATCGCCTAGCTTGTTTGCTTGGCGGCGCATTGGCTGGATGTGTATCCTCGCGTAAATAGTATATCGCTGAGCTTCACTTCTTTTACCTAAAATCATGTTGTAAGAACACTAAAACGCAACAAAGCAGGGTTAAGGTTCTTTTGTAAAATGTTATTCATAGATTAATCAAATCACCAATTTTTTTCCTTGTTTTTTTCTATTCTCATGCGATTACACTTCATTATTTTACATAAATGCTACAATAACTTGTATCTCTACAAGTTATCACACCTCTAAATTTAAAATAATGCTTGTCCTCAAGCGTCATCTTATGATTTTATACATGATTATTTCTTTTGTGCACGTCGCTAAACCTTTCATAAGCTTTTCTAATTTATCTCGCATTCTAGCGTCTCTTCTTTTCTTCGGCTACTCTAAACTTCAAAATATTTCTAGATTCTAAACGTGGCAAGCTTAAGTATCAAAATACCCTTGTTCACTTTCAAAAAAAATTATTTCCCCTTTTTGCGAAATGGCTGTTGTTTCAAAAGTTCTTAGTTCCTTTATTTTTCTTGTGAAAACTTTATTAGTTAAACTATTTATTTGAATTAGGCGTTGAGTGAGAAATCATAGGTACTCTATGAGTTTGTTCCTCACCACAACTATAATTATTATCGACACTTATTTGATGAGGCTTTGCTTCTTATTAACTTTTGGTGAGAGGTGGCATCATAGGAGATCTTATTTCAATCAAATTCAGAATAAAAGAATAAGTTTGTTTTGTTTTGTTTTTTGGAAAAGCTTGAAAATGAAGGGTTCTTTGAGTTTTTTAAAACTTTAGCTCAGACCTCGCACACCCCCAAATTTAGAGTCTCGCAAGGTCCTTATTGCGAAAAAGTGAATGATAAGATGCACTTTGAAAATTTTACAAAAGAGGTTTGAGGTAAAGCTAGCTCGCCTAAAGATTACCAGAAATATTTACTAAGTTCATTTCGCTCAGGAAAGCATCATCGCCAGGTACAAGTTAAATTAAAGTTAGCATAAATTTCATCATGAAAGTCATCCTTGCGAGGAACAAAAGAAAGCATGCAAATAATTCATAAATGAAGTAAACATCAAGAAAAGTAAATCGAATAAAAAGTTAAGAGACATGGACTTAAGATGACTTTGATTAAGCAACTAAATATTTATAATACAAAAGAACGAAAATGACAAAGAAATTTACAAAAACATTCAAAAGATAACACCCTCAAATTTAGTTCACAAGGGGAGGGTCAAGTTCACCGAGAGTGGGATCTGTTGGCTGAATGGAGGGAATGTCGCGATCAAATCTGGCGTGATAACATATTGAGGCATACGTCGTACAAACAATGCATGTGTATATTCCATTTGAGCCACAAATTGTCTGTCATGTCGCTCCAAAATTTGGCGCAACTTATTGGAATCAAAGAGCATTGGATTGGTCAAGTTGTTGAGGGCAAAGGATATTTACGCAACCATTGAGTTTAATTGGAAGACTGTGGCTTCTAAACCATCCACTTTGGCATTCAACCTCATGATAGTGTTACTGAGGCTGGCAATTTGCTTCTTCATCGCCATAGGGAGAAGGTTCTTTTGTACAATTTTATTCACATATTAATCAAATTGCCTACTTTCTCCCTTATTTTCTTCTATTCTCATGCGATTAGACTTTATTCTGTTACATCAAAGACTACAATAATTTGTATTTCTACAAGTTATCACACGCCCCGTTCGGCTAGCCACTTCGTGCGATCAGAAGGAGGCGTGTCGCCTAGACACCAAACACATACCTCTCTGTGAGATGGTGAGTTGAACGTCTAGTAAGTAACTTCTTTTACCGTCTCTATCGTTGGCCTCCAACTTCGAACTCAGGAGATCTTTACAACTGAACACAATGAAAAATACTAGTACAACCAACTTAGCTTAATCGCCTAGCCTTCAACACTTGAGCTTAAACTACTAAGCGATAAGCATAACATGGATATACAAACAACACAGCGGAAAGATAGAAAAACATAACTTTAATTTTATTGCTTGATTCAACACTTTACAAGTAAATATTTTTCCTCGCAAGCCACTTAAATAAAACCTAAGTTCTAATCTTTGGGTGTTCCTTCCTCCGCTAAAATGTTACTCTTCACGTCAAGGTGTTTAGGGGTATAAGAACCCCTTTCTTATCCTTCTCCTTGATCACCTAGCAACCTATCTTAAAGATAAGACTCATCCCACACACAAAACTAGGATGTTGCCCTCTTCGGACGTAATAACTTTAAACCATCAGGAGATGAATAATTTGATCAGCTTTACCTTTTCTTACTTGGCCTCAACTCCTGATTACCTGAAGAAGAGAAACCTAAACATGTAAGTCAAATACTTAGAGAGTGTAGATGTTAGAAAACCTTTTTGGGAAAACAAGTCAAGCAAAAAACCATTTCTTTTCTCAATTGCATCTTTCAACGCCTCATTTCATAAGTCTCATATTAGCTTTTCTTATTCCTTTCTACTAAGAACATGAATCTTTCCCCCCGCCAAGCTTACTACGATTTACTCTCGCCAGTATCTTGCGATTGAGCTACTGTGATGTTCTCTCCATCCATAGCATTTGGGAAGTTTCCTTATTCATTCTTTTCTTGTTGCATCAGCCATCCACGACGTCATTCACATTATACACACACCACATAAGCATTGCTCAATTGATAGAAATCAGACTAATGGTTTGCACGCATTCATCACAGTCAGCACATAGAAAGCAGTAGTTTTCTCTTTTTAAACCATAACAACAACAATAATAACACAGAGAGATACAGAACGCATTATGAATAACTCAGACTCAGAAATAATCTATTAAGAAAACGTTTACAAAACTTTAGCTTACAAATCACCCATGCAGAAGTGTTTAAGAAAGTCACTCACGTAGGAGTGTAGAGTTTAAGAAAGTCACTCACACAAGCATTTGGTTCTTTTCTCCTCTAAAACCCTCTCCTCGCCTAGAACTTTCTCAACAGAAGCTCAACTACTTACACTGTCTCAACGAAACCTCCAAATAGCAACACTTTTCATAGGTCTTCACTTCTATTTATACTAATCCTTTAGACATGTTACTCTCTTTGTCTTCCAGATTTAAATTATAACTTTACACGTGTCACCTTACTGTCTTGTTGTACTATGCAGCCAAACCACACTCAACATGTATTCCCTCGAGATGCAAACCTTATTATCTCGGGAAACCAAACTCTTCCTCACCTTATCTTCTTGAGAAGCTAAATTCATTTTTCTCGCCTTCTTATTCCTTGCGATAGACCATAACTGCTTACTTCTTTTCTAACTTCTTTTCACGTTACTTCACCCAATTGCCTAACTTTTCTTGCAATAGAAATCCTTCATCGTGCTAACTTTTCCTCCATCACCTTTTGTTAACGTGACTTCCTTTCCTTTGAATTTTATCTTTGGAATGATCAATCTTTTCATTTTAATAGGATCGTCCAACTCCTCGACGACTTTCTTCTTCAACAAGACTTCATTTTTCGGATATGGGTCTCACACGTGTCCCACTCCGCGTGCGCATCTAAGCACCCATGCACACACATAGCCTAAGTTATGCCCTACGCCATTTGACTATACATTGGCCATGTTGCGTACATGCTTCTAAGACCAAATAGCCTCTTGCCTTCAAAACACGTGATGTCCATACTTGTTCATGCAACATTCATTCATGATTAATTCATGTTAGGCTATAAGTTAGCAACTTAATAGGTGATTAGAACAGGATGATGCGATACGAGAGGTATTTCCTATGTATTCATTGTTGATTTTCCAATGATATGCATGTAGTACAAGTTTTCCTCTCTATTGTCCAAATGTAAGCGAAGCAAGGTTTCCTGGTATGTCCAAGGTCGAACGCAGAGAATTCATGTTCTTAATTGAACTTATCGCGTAGCTAATCCTTATGCGATATACTGTATAGTGTAATTATGCACAATATATTTATCCTAACTATATACACTAAGGTGTAATTGACAGTGCAAGATGATCATGGCGATGAAGTATGTGAACTCATGTAAAAGTCATGGTGTGAAAGAATATTTAACTAATTAATCGCAATGTATTATGTGTTAATATTTTGAGGAGATGCAAGCATCTATTAATTTCAAAATTAAGGCAAGCGTCCTTTCGGCGCCTTTGCCATACTTGCTCGTCTCTCATGATCCAAATACTTTAAGTTAAGCTATTATTTGTATCTAATTAGTTTGTGTGTGTTACAAGCAACTTGTCCTAATTTAAGGCGAGCATCCTTTCGATGCTTTCGCTACACACGTTAACATTTATGTAGTCTCTTCTGTGATGAGATTGTATAGCTACAATCTCTTCATCATCCGCTTAATTAAATACAAATGTTCACTTGTCAGGTGATTACTATCAACACAATTTACTTTTCTCTTGAAAAAAACAAAGCTTTGATGTTTGCGTTTAGCTAAACAAAACATACATCCAAAAGAAATGCATTATTATTCTAAACAGAACTTTAGGAACTAATTAAATACATTGAAAATGGATAGCAAAAGAGAAATGGGAGAGTGGTAGAAGAAATTCATTAATATTAAAAATAGAACTTTAGGCCTCTCGGCCATGTAGTAAATATTACAACTTAATACATGAAAAATATAAAATGGAATACATAAAATTAATCAAGCCTTCATAATATGATTTCTCATATTCCTTAACTTTGAATTTTGCCTGTGACGAGGAGAATGGTAGGGAAAAACTCTCTTTTTCTCCTTTTAAGCTCTCACTTAAAATGTCACACATCAAGGATGGAGGAAAAAGTAAGCTTGCTCTTTGAGATTCAATTTCACACACTTTGTTCTGAAGTAGATGCCTCCATTTATTGGTCGAGAGTGATGTTGACGGGACATTCCACTCCCCCTACTGAGTTGGTTTTGTCCTTTTTTTCTGACATTTTCCCCTACTTCAGATCACCTAGATTATTGTGATCGTTGCCTTGTTGGTTGCATGGGTCTCCTCTCGACGTTCAATCACCAAGCTTTACTTCTTTTTTCTAAAATCATGCAAAAAGACAAGTAAATCGTGATAAAAATAGGGCTAAAATTCTTTTGCAGTCTGCGATACACAAGTTAATCAATTATTCTACTTTTCCAGCATTTTCTTTCATTCTTATGCGATAAGGCTTCATTATTTTACATAAAAGGCTACAATAACTTGTATTTCTACAAGTTATTAGCCCTAAGCGTCACAACTTGATGCCTCGACCACCTTCAACCTTTACCTAGAATCTTGTTTGAATCCAGAAACTTCAATCAATATCCAAAGCTTATAACTTTTAATCTAGCCAAAAAATGGAAAAGTTACTTCTTCAAACTTTCTTAGAATTGTGTTAAATAACTTACCTTAAAATCCAGCTTCAAATTCGAAGTGATATGTTATGTAGTCTTCAAAGAGTCTTCACTACTTAGAATCCTTTGAAAACTGACCTTTCCCTCTTCCTTACACTTGAACAACATCTTCATCGTGTCCCTTTTGTTTGGGAGAGCCTCACCTTGCCTACTAGACTCCTTTAGCAAACTTCTACTATTAGTTTGAGAGACAACACATGAAGGAATTGAGAGCATCTTCTATTTGAAAAATTATTATTTATAGAGTTCTATGTGTAATCTAGAATTCAGATAAGTATATTTCTTATTTATTGGGAGTCAAAGGAATGTTACATATTTATATAGAGAATAAACTAAACCTTAGAGACTATGTACAATTACAATAAAGGACATATGATATAATTATAAATATATATATATATATATATCATAACACTATGCATGGTCAACTTTGCCACCTTCTACTTGCCAAAATTTGAAGTCTTGCATATTGTAGACCCTTGTTCTCCACTCAACCGACTGCCTTGAATGTCTAAAGGTCTTTCAAAATGCCTGACAACTTAAATCTTCTCCCAAAGCCTCGTAGCAAGGCCATTTGCGTAGCTAACCACATAAAGCCTTTTGCTCTTGGCTGCATGGATCAACATGCATGCACTTCCCTTGGCTTGCCATAACGCCTAGCGTTGACACATGGAGGTTCTCTTACATAACTTATCTTAGCTTGTCTCAACGCCTTGCAATATGGTCACCCTAAACCTTCTCGCTTAGGTGTGCCTCGACCACCTAGGACATACCTAAAGAGGTTCTCATGTGCAACATAAACAAATCCTTGTCTTGACTCCTAGATATGGTCTTAGAGGTTCTCAACACCTCTTGGCCACCTAGTTGTGTCCAAAAACATCTGTTTTTCAACACTTGGTCAAAATTCAATTAGTCTTCCTTTCTTTGTCTTAGCCAATTAACACTCGGGGAGGGGATGGAGAGATTTGAGTTTATAAGATTAGGAAAAAAAAACAAAATTAATTTATTTGATAAATATTTGAGTAGGGTTTTGTGTTGACATAAAAATTTATAGAATAAATTATAAGATTTCTTGGTTATTTATCCATTTAAGTAATATGATAGTGAATCTAGAGCCTACATCCAAATATAAAAGTCACACCATCATTCCATTAAATAACAAAATCCTAACAACAATTAAAAGTAGGTTACTTTTAAAAGTTTTGGAACTAAAACAAACATATTTTACAATATAGAGAACCAAAATAGAAGAAACAAAAACAAAATCCGTAGAGCAGCAGGTAGTCCGTCACATTATTTGGAAACCCCGTTATATGCAAGGTCCATAGAGCACCACCGATATATAAAATGTTAGGAATTTTGAAGTGTGGGTTTGAATGATTGTGGTAATGTAATGTAATATTAAAGAAAATAAATTGTAAATTTGTTGTTGAAAGGCACATTCAAGTTAAGCCCCCAAAATATCCAAAAACTGAAGCAACATGTACTGGAACACCGGAACCCGGCCCAACCGCTGCTACACATCTCCACCTACACGGTGGCGATGGGGTACACGTGGGTGTGCGCCTCCGCCGTGGCTGACGAGGATATTACCATCGCAGTGACGGTGGACGCCCGAGGGAGATTGGACCCACCTCTACCGGCAACATACTTTGGAAACTACGTGGTCGGGCGGTCAACCGCCTTGAAGAGGGGGAAACTGTTTGGGGAAAACGGAGTAATCGCCGCGGTGGAGACGATATCAGAGATGATTAAAAGCTTGAAAGAAGAGGGACCTCTGAAGGGTGCAGAAAACTGGGTTTTGTTGATGACGCAAACTGTTGTAAATAGCGATTACAAGCTGATTTCCACGACTGGGTCGCCGAGATTTGAGGTGTATAGCGTGGATTTCGGTTGGGGGAAACCGGAGAAGGTGGAAGTTGTGTCGATTAACCGAACCGGAGCGGTTTGTATCTCGGAAAGCCGAGACGGCGGCGGAGTGGAACTTGGGTGGACGGCGAAGAGGGATGTTATGGAGAATTTCGCTAAGCTTTTTGCTGGAAGGTCTTCAACAACTTTGAGTTAACGTTGCTTTCTTTAATACCGGAAAATAAGCCATAACTTTATCTGTTTTACTTTTTTTTTTTTAATGAAACTTTATGGTTTACTTTTTATCACGTATTTCAATTTTTTAAAAAAGAAAAAATTAAGCAATTATCTTTACTATTTTAATTAACGGTGGCTTAGTTAGAGAATATTTCAT

mRNA sequence

ATGGCGACAAATCATGGGTCCACCGCCATTAAGGTCCTCGAGGTCTGCACGGTAGCTCCTCCGCCCGGATCCACCGTGCCGGTCACTCTTCCCCTCACCTTCTTCGACATCCTCTGGTTTCGCTTCCCTCCCGTCGAACGCCTCTTCTTCTACAAATCTCCGGTGCCATTCTACGTCATCGTTTCAAACCTCAAAAAATCTCTCTCTCTCGTCCTCCAACACTACCTACCCCTCGCCGGAGCAATTGTATGGCCTGAAAATTCCCCCAAACCGGCCGTCGAAACCGCCGTCCGCGACGGCATTGTGCTTACAGTCGCGGAGTCTGAGGACGACTTCGACCATCTCATCGGCGACGGGTTGCGTAAAGAGGCAAAACTTCGGCCGTTGGTGGCGGAGCTTGCAGCAGAGGAGGATCGGGCGGCAGTGGTGGCTGTGCAGGTAACCTGGTTTGGGAATGGTGGATTTAGCATCGGAATAACTTCACATCACGCAGTTCTGGACGGAAGGTCGTCGACTTCTTTCATGAAATCGTGGGCCGGATTGTGTAAGAATTTGGTTGGGGGCGGTGAGATTTTTTGCCCGGCAGCTGAGACGATGCCGTTTTATGATCGAAGCGTGGTGACAGATAAGATGGGACTTGAAGCCATTTATTTGAAATGCTTGTTGGCCCATGAAGGGCCCAACAATAGAAGCTTGAAATTTTGGGACTTCAAAACTCCGCCAGATTCATTCCGAGGCACATTCAAGTTAAGCCCCCAAAATATCCAAAAACTGAAGCAACATGTACTGGAACACCGGAACCCGGCCCAACCGCTGCTACACATCTCCACCTACACGGTGGCGATGGGGTACACGTGGGTGTGCGCCTCCGCCGTGGCTGACGAGGATATTACCATCGCAGTGACGGTGGACGCCCGAGGGAGATTGGACCCACCTCTACCGGCAACATACTTTGGAAACTACGTGGTCGGGCGGTCAACCGCCTTGAAGAGGGGGAAACTGTTTGGGGAAAACGGAGTAATCGCCGCGGTGGAGACGATATCAGAGATGATTAAAAGCTTGAAAGAAGAGGGACCTCTGAAGGGTGCAGAAAACTGGGTTTTGTTGATGACGCAAACTGTTGTAAATAGCGATTACAAGCTGATTTCCACGACTGGGTCGCCGAGATTTGAGGTGTATAGCGTGGATTTCGGTTGGGGGAAACCGGAGAAGGTGGAAGTTGTGTCGATTAACCGAACCGGAGCGGTTTGTATCTCGGAAAGCCGAGACGGCGGCGGAGTGGAACTTGGGTGGACGGCGAAGAGGGATGTTATGGAGAATTTCGCTAAGCTTTTTGCTGGAAGGTCTTCAACAACTTTGAGTTAA

Coding sequence (CDS)

ATGGCGACAAATCATGGGTCCACCGCCATTAAGGTCCTCGAGGTCTGCACGGTAGCTCCTCCGCCCGGATCCACCGTGCCGGTCACTCTTCCCCTCACCTTCTTCGACATCCTCTGGTTTCGCTTCCCTCCCGTCGAACGCCTCTTCTTCTACAAATCTCCGGTGCCATTCTACGTCATCGTTTCAAACCTCAAAAAATCTCTCTCTCTCGTCCTCCAACACTACCTACCCCTCGCCGGAGCAATTGTATGGCCTGAAAATTCCCCCAAACCGGCCGTCGAAACCGCCGTCCGCGACGGCATTGTGCTTACAGTCGCGGAGTCTGAGGACGACTTCGACCATCTCATCGGCGACGGGTTGCGTAAAGAGGCAAAACTTCGGCCGTTGGTGGCGGAGCTTGCAGCAGAGGAGGATCGGGCGGCAGTGGTGGCTGTGCAGGTAACCTGGTTTGGGAATGGTGGATTTAGCATCGGAATAACTTCACATCACGCAGTTCTGGACGGAAGGTCGTCGACTTCTTTCATGAAATCGTGGGCCGGATTGTGTAAGAATTTGGTTGGGGGCGGTGAGATTTTTTGCCCGGCAGCTGAGACGATGCCGTTTTATGATCGAAGCGTGGTGACAGATAAGATGGGACTTGAAGCCATTTATTTGAAATGCTTGTTGGCCCATGAAGGGCCCAACAATAGAAGCTTGAAATTTTGGGACTTCAAAACTCCGCCAGATTCATTCCGAGGCACATTCAAGTTAAGCCCCCAAAATATCCAAAAACTGAAGCAACATGTACTGGAACACCGGAACCCGGCCCAACCGCTGCTACACATCTCCACCTACACGGTGGCGATGGGGTACACGTGGGTGTGCGCCTCCGCCGTGGCTGACGAGGATATTACCATCGCAGTGACGGTGGACGCCCGAGGGAGATTGGACCCACCTCTACCGGCAACATACTTTGGAAACTACGTGGTCGGGCGGTCAACCGCCTTGAAGAGGGGGAAACTGTTTGGGGAAAACGGAGTAATCGCCGCGGTGGAGACGATATCAGAGATGATTAAAAGCTTGAAAGAAGAGGGACCTCTGAAGGGTGCAGAAAACTGGGTTTTGTTGATGACGCAAACTGTTGTAAATAGCGATTACAAGCTGATTTCCACGACTGGGTCGCCGAGATTTGAGGTGTATAGCGTGGATTTCGGTTGGGGGAAACCGGAGAAGGTGGAAGTTGTGTCGATTAACCGAACCGGAGCGGTTTGTATCTCGGAAAGCCGAGACGGCGGCGGAGTGGAACTTGGGTGGACGGCGAAGAGGGATGTTATGGAGAATTTCGCTAAGCTTTTTGCTGGAAGGTCTTCAACAACTTTGAGTTAA

Protein sequence

MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVIVSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGLRKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAGLCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFKTPPDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVADEDITIAVTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLFGENGVIAAVETISEMIKSLKEEGPLKGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINRTGAVCISESRDGGGVELGWTAKRDVMENFAKLFAGRSSTTLS*
Homology
BLAST of CsaV3_3G020720 vs. NCBI nr
Match: XP_004145869.2 (phenolic glucoside malonyltransferase 1 [Cucumis sativus] >KAE8650582.1 hypothetical protein Csa_010647 [Cucumis sativus])

HSP 1 Score: 920.2 bits (2377), Expect = 6.8e-264
Identity = 454/454 (100.00%), Postives = 454/454 (100.00%), Query Frame = 0

Query: 1   MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI 60
           MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI
Sbjct: 1   MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI 60

Query: 61  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 120
           VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL
Sbjct: 61  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 120

Query: 121 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 180
           RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG
Sbjct: 121 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 180

Query: 181 LCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFKTP 240
           LCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFKTP
Sbjct: 181 LCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFKTP 240

Query: 241 PDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVADEDITIA 300
           PDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVADEDITIA
Sbjct: 241 PDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVADEDITIA 300

Query: 301 VTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLFGENGVIAAVETISEMIKSLKEEGPL 360
           VTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLFGENGVIAAVETISEMIKSLKEEGPL
Sbjct: 301 VTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLFGENGVIAAVETISEMIKSLKEEGPL 360

Query: 361 KGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINRTGAVCISE 420
           KGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINRTGAVCISE
Sbjct: 361 KGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINRTGAVCISE 420

Query: 421 SRDGGGVELGWTAKRDVMENFAKLFAGRSSTTLS 455
           SRDGGGVELGWTAKRDVMENFAKLFAGRSSTTLS
Sbjct: 421 SRDGGGVELGWTAKRDVMENFAKLFAGRSSTTLS 454

BLAST of CsaV3_3G020720 vs. NCBI nr
Match: XP_008465393.1 (PREDICTED: phenolic glucoside malonyltransferase 1-like isoform X1 [Cucumis melo])

HSP 1 Score: 831.6 bits (2147), Expect = 3.2e-237
Identity = 406/446 (91.03%), Postives = 428/446 (95.96%), Query Frame = 0

Query: 1   MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI 60
           MAT+HGSTAIKVLE+CTVAPPPGSTVP TLPLTFFDILWFRFPPVERLFFYKSPVPF+VI
Sbjct: 37  MATDHGSTAIKVLEICTVAPPPGSTVPATLPLTFFDILWFRFPPVERLFFYKSPVPFHVI 96

Query: 61  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 120
           VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETA  DGIVLTVAES+DDFDHL+GDGL
Sbjct: 97  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAAGDGIVLTVAESDDDFDHLVGDGL 156

Query: 121 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 180
           R+EAKL+PLVAELAAEE+RAAVVAVQVTWFGNG FSIGITSHHA+LDGRSSTSFMKSWAG
Sbjct: 157 REEAKLQPLVAELAAEEERAAVVAVQVTWFGNGRFSIGITSHHAILDGRSSTSFMKSWAG 216

Query: 181 LCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFKTP 240
           LCKNLVGGG+IF PAAETMPFYDRSVVTD +GLEAIYL+C LAHEGPNNRSLKFWD KTP
Sbjct: 217 LCKNLVGGGDIFFPAAETMPFYDRSVVTDNVGLEAIYLECWLAHEGPNNRSLKFWDVKTP 276

Query: 241 PDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVADEDITIA 300
           PD FRGTFKLSPQ+IQKLKQHVL+HRNP QP  HISTYTVAMGYTWVCASAVADE+I+I 
Sbjct: 277 PDLFRGTFKLSPQDIQKLKQHVLKHRNPVQPPPHISTYTVAMGYTWVCASAVADEEISIG 336

Query: 301 VTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLFGENGVIAAVETISEMIKSLKEEGPL 360
           VT+DARGR+ PPLPATYFGN VVGRSTAL+RGKL GENGVIAAVETISEMIKSLKEEGPL
Sbjct: 337 VTMDARGRVYPPLPATYFGNCVVGRSTALERGKLLGENGVIAAVETISEMIKSLKEEGPL 396

Query: 361 KGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINRTGAVCISE 420
           KGAENWVLLMTQTVVN+DYKLIST GSPRFEVYSVDFGWGKPEKVEVVSINRTGAVCISE
Sbjct: 397 KGAENWVLLMTQTVVNNDYKLISTAGSPRFEVYSVDFGWGKPEKVEVVSINRTGAVCISE 456

Query: 421 SRDGGGVELGWTAKRDVMENFAKLFA 447
           SR+GGGVE GWTA+RDVMENFAKLFA
Sbjct: 457 SRNGGGVEHGWTARRDVMENFAKLFA 482

BLAST of CsaV3_3G020720 vs. NCBI nr
Match: XP_038875303.1 (malonyl-CoA:anthocyanidin 5-O-glucoside-6''-O-malonyltransferase-like [Benincasa hispida] >XP_038875311.1 malonyl-CoA:anthocyanidin 5-O-glucoside-6''-O-malonyltransferase-like [Benincasa hispida] >XP_038878095.1 malonyl-CoA:anthocyanidin 5-O-glucoside-6''-O-malonyltransferase-like [Benincasa hispida])

HSP 1 Score: 664.8 bits (1714), Expect = 5.2e-187
Identity = 332/447 (74.27%), Postives = 368/447 (82.33%), Query Frame = 0

Query: 4   NHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVIVSN 63
           +HGS  IKVLE+CT++PPPGS VP +LPLTFFDILWFRFPPVERLFFYKSP  F+ I+ N
Sbjct: 5   HHGS--IKVLEICTISPPPGSAVPASLPLTFFDILWFRFPPVERLFFYKSPAAFHAILLN 64

Query: 64  LKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGLRKE 123
           LK SLS VLQHYLPLAGAIVWPE SPKPAV T   DG+VLTVAES+ DFDHL+ DGLR+E
Sbjct: 65  LKNSLSRVLQHYLPLAGAIVWPETSPKPAVVTTAGDGVVLTVAESDADFDHLVSDGLREE 124

Query: 124 AKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAGLCK 183
           AKLRPLVAELA EE+RAAV+AVQVT FG GGFSIGIT+HHA+LDGRSSTSF+KSWAGLCK
Sbjct: 125 AKLRPLVAELAVEEERAAVMAVQVTSFGKGGFSIGITAHHAILDGRSSTSFVKSWAGLCK 184

Query: 184 NLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFKTPPDS 243
           NLV GGE   PAAETMPFYDRSVV D  GLEAIYLK  LA  GPNNRSLK +D   PPD 
Sbjct: 185 NLVAGGEPVSPAAETMPFYDRSVVADPAGLEAIYLKSWLAMGGPNNRSLKCFDVTIPPDL 244

Query: 244 FRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAV----ADEDITI 303
           FRGTFKL+ QNIQKLKQ VL HR P  P LH+ST+TV M YTWVC S       D +   
Sbjct: 245 FRGTFKLNLQNIQKLKQFVLRHRTPTHPPLHVSTFTVTMAYTWVCTSVADGSPPDGERAF 304

Query: 304 AVTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLFGENGVIAAVETISEMIKSLKEEGP 363
            ++VDARGRLDPPLP TYFGN VV R  AL+R KL G+ GV+AAVE IS+MIKSL+EEGP
Sbjct: 305 GMSVDARGRLDPPLPVTYFGNCVVVRGIALERAKLVGQKGVVAAVEVISDMIKSLEEEGP 364

Query: 364 LKGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINRTGAVCIS 423
           L GAENWV L+ +   N ++K IST GSP+FEVYSVDFGWG PEKVEVVSI+ TGAVCIS
Sbjct: 365 LNGAENWVSLIAEGAKN-NWKPISTAGSPKFEVYSVDFGWGTPEKVEVVSIDATGAVCIS 424

Query: 424 ESRDGGGVELGWTAKRDVMENFAKLFA 447
           ESRDGGGVE+GWTAKRDVMENFA +FA
Sbjct: 425 ESRDGGGVEIGWTAKRDVMENFAAVFA 448

BLAST of CsaV3_3G020720 vs. NCBI nr
Match: XP_022948949.1 (phenolic glucoside malonyltransferase 2-like [Cucurbita moschata])

HSP 1 Score: 565.8 bits (1457), Expect = 3.3e-157
Identity = 291/454 (64.10%), Postives = 345/454 (75.99%), Query Frame = 0

Query: 1   MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI 60
           MA  H S  + VL +CTV PP GS VP +LPLTFFDILW RFPPV+R+FFYKS  PF V+
Sbjct: 12  MAEIHPS--VNVLHLCTVPPPHGSLVPFSLPLTFFDILWLRFPPVQRIFFYKSSAPFDVV 71

Query: 61  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 120
           VS LK SLS  LQHY PLAGA+VWPENSPKPAV+T + DGI+LT+A+S+  F HL+ DGL
Sbjct: 72  VSTLKNSLSAALQHYPPLAGAVVWPENSPKPAVQTVLGDGILLTLAKSDSKFSHLVSDGL 131

Query: 121 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 180
           R+ A+   LV  L A +DRAAV+A+QVT FG  GF IGITSHHA+LDGR+STSF+K WA 
Sbjct: 132 REAAEFHTLVPRLPAADDRAAVMALQVTSFGTDGFCIGITSHHAILDGRTSTSFVKLWAR 191

Query: 181 LCKNLVGGGEIFCP---AAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDF 240
           LCKNLV GGE   P   AAETMPFYDRSV+ D  GLE I+L+  LAH G +N+SLKFW  
Sbjct: 192 LCKNLVAGGESAEPVSTAAETMPFYDRSVIVDPRGLEGIFLRDWLAHGGSDNKSLKFWSP 251

Query: 241 KTPPDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVA---- 300
             P   FRGTFKL+PQNIQKLKQ VL  RNP  P +HIST+TVAM YTWVC +AVA    
Sbjct: 252 SIPQGLFRGTFKLNPQNIQKLKQLVLNRRNPVHPPVHISTFTVAMAYTWVC-TAVADGSP 311

Query: 301 -DEDITIAVTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLFGENGVIAAVETISEMIK 360
            D +I+  ++VDAR  LDPP+PA YFGN +VGR+T  +R KL GENG++ AVE IS+ IK
Sbjct: 312 NDGEISFGLSVDARRWLDPPVPANYFGNCLVGRTTDQERAKLVGENGLVTAVEGISKAIK 371

Query: 361 SLKEEGPLKGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINR 420
           SL+E+G L GAE WV L+TQ V  S+ K+++T GSPRFE+YSVDFGWG P KVEVVSI+ 
Sbjct: 372 SLEEKGALDGAEQWVSLLTQ-VSGSNRKMLTTAGSPRFELYSVDFGWGTPAKVEVVSIDE 431

Query: 421 TGAVCISESRDGGGVELGWTAKRDVMENFAKLFA 447
           TGAV + + RD GGVELGW AK+DVME FA  FA
Sbjct: 432 TGAVSVCDGRD-GGVELGWVAKKDVMEAFAAAFA 460

BLAST of CsaV3_3G020720 vs. NCBI nr
Match: KAG6607032.1 (Phenolic glucoside malonyltransferase 2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 565.8 bits (1457), Expect = 3.3e-157
Identity = 291/454 (64.10%), Postives = 346/454 (76.21%), Query Frame = 0

Query: 1   MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI 60
           MA  H S  + VL +CTV PP GS VP +LPLTFFDILW RFPPV+R+FFYKS  PF V+
Sbjct: 1   MAEIHPS--VNVLHLCTVPPPHGSLVPFSLPLTFFDILWLRFPPVQRIFFYKSSAPFDVV 60

Query: 61  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 120
           VS LK SLS  LQHY PLAGA+VWPENSPKPAV+T + DGI+LT+A+S+  F HL+ DGL
Sbjct: 61  VSTLKNSLSAALQHYPPLAGAVVWPENSPKPAVQTVLGDGILLTLAKSDSKFSHLVSDGL 120

Query: 121 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 180
           R+ A+   LV  L A +DRAAV+A+QVT FG  GF IGITSHHA+LDGR+STSF+K WA 
Sbjct: 121 REAAEFHTLVPRLPAADDRAAVMALQVTSFGTDGFCIGITSHHAILDGRTSTSFVKLWAR 180

Query: 181 LCKNLVGGGEIFCP---AAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDF 240
           LCKNLV GGE   P   AAETMPFYDRSV+ D  GLE I+L+  LAH G +N+SLKFW  
Sbjct: 181 LCKNLVAGGESAEPVSTAAETMPFYDRSVIVDPRGLEGIFLRDWLAHGGSDNKSLKFWSP 240

Query: 241 KTPPDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVA---- 300
             P   FRGTFKL+PQNIQ+LKQ VL  RNP  P +HIST+TVAM YTWVC +AVA    
Sbjct: 241 SIPQGLFRGTFKLNPQNIQELKQLVLNRRNPVHPPVHISTFTVAMAYTWVC-TAVADGSP 300

Query: 301 -DEDITIAVTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLFGENGVIAAVETISEMIK 360
            D +I+  ++VDAR  LDPP+PA YFGN +VGR+T  +R KL GENG++AAVE IS+ IK
Sbjct: 301 NDGEISFGLSVDARRWLDPPVPANYFGNCLVGRATDQERAKLVGENGLVAAVEGISKAIK 360

Query: 361 SLKEEGPLKGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINR 420
           SL+E+G L GAE WV L+TQ V  S+ K+++T GSPRFE+YSVDFGWG P KVEVVSI+ 
Sbjct: 361 SLEEKGALDGAEQWVSLLTQ-VSGSNRKMLTTAGSPRFELYSVDFGWGTPAKVEVVSIDE 420

Query: 421 TGAVCISESRDGGGVELGWTAKRDVMENFAKLFA 447
           TGAV + + RD GGVELGW AK+DVME FA  FA
Sbjct: 421 TGAVSVCDGRD-GGVELGWVAKKDVMEAFAAAFA 449

BLAST of CsaV3_3G020720 vs. ExPASy Swiss-Prot
Match: Q940Z5 (Phenolic glucoside malonyltransferase 1 OS=Arabidopsis thaliana OX=3702 GN=PMAT1 PE=1 SV=1)

HSP 1 Score: 291.6 bits (745), Expect = 1.6e-77
Identity = 182/465 (39.14%), Postives = 258/465 (55.48%), Query Frame = 0

Query: 1   MATNHGSTAIKVLEVCTVAPP-PGSTVPVTLPLTFFDILWFRFPPVERLFFYK---SPVP 60
           M      +++KV++V  V P    S+  +TLPLTFFD+LW++   VER+ FYK   +  P
Sbjct: 1   MVNEEMESSLKVIDVARVTPSNSDSSESLTLPLTFFDLLWYKLHAVERVIFYKLTDASRP 60

Query: 61  FY--VIVSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDH 120
           F+  VIV NLK SLS  L HYLPLAG +VW    PKP +     D +  TVAES  DF  
Sbjct: 61  FFDSVIVPNLKTSLSSSLSHYLPLAGKLVWEPLDPKPKIVYTPNDAVSFTVAESNADFSR 120

Query: 121 LIGDGLRKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSF 180
           L G       +L PLV EL   +D A+ V+ QVT F N GF I + +HHAVLDG+++T+F
Sbjct: 121 LTGKEPFPTTELYPLVPELHVSDDSASAVSFQVTLFPNQGFCISVNAHHAVLDGKTTTNF 180

Query: 181 MKSWAGLCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYL-------KCLLAHEGP 240
           +KSWA  CKN     + F P  + +P YDR+V+ D M L+   L       K     + P
Sbjct: 181 LKSWARTCKN----QDSFLP-QDLIPVYDRTVIKDPMDLDTKILNAWHRVAKVFTGGKEP 240

Query: 241 NN-RSLK-FWDFKTPPDSFRGTFKLSPQNIQKLKQHVLEHRNPAQ-----PLLHISTYTV 300
            N +SLK  W  +  PD FR T  L+ ++IQKL++ + +  + +        L +ST+ +
Sbjct: 241 ENPKSLKLLWSPEIGPDVFRYTLNLTREDIQKLRERLKKESSSSSVSSSPKELRLSTFVI 300

Query: 301 AMGYTWVCASAVADED----ITIAVTVDARGRLDPPLPATYFGNYVVG-RSTALKRGKLF 360
              Y   C       D    +     VD R  + PP+P++YFGN V      +L      
Sbjct: 301 VYSYALTCLIKARGGDPSRPVGYGFAVDCRSLMVPPVPSSYFGNCVSACFKMSLTAETFM 360

Query: 361 GENGVIAAVETISEMIKSLKEEGPLKGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSV 420
            E G +AA   +S+ +++L E   LK  E   +L   T ++   +++S  GS RF VY +
Sbjct: 361 SEEGFLAAARMVSDSVEALDENVALKIPE---ILEGFTTLSPGTQVLSVAGSTRFGVYGL 420

Query: 421 DFGWGKPEKVEVVSINRTGAVCISESRDG-GGVELGWTAKRDVME 440
           DFGWG+PEKV VVSI++  A+  +ESRDG GGVELG++ K+  M+
Sbjct: 421 DFGWGRPEKVVVVSIDQGEAISFAESRDGSGGVELGFSLKKHEMD 457

BLAST of CsaV3_3G020720 vs. ExPASy Swiss-Prot
Match: Q9LJB4 (Malonyl-CoA:anthocyanidin 5-O-glucoside-6''-O-malonyltransferase OS=Arabidopsis thaliana OX=3702 GN=5MAT PE=1 SV=1)

HSP 1 Score: 280.0 bits (715), Expect = 4.7e-74
Identity = 179/455 (39.34%), Postives = 256/455 (56.26%), Query Frame = 0

Query: 7   STAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSP-VPFYVIVSNLK 66
           ++A+ +LEV  V+PP  S+  +TLPLT+FD+ W +  PV+R+ FY  P +    ++S LK
Sbjct: 5   NSAVNILEVVQVSPP--SSNSLTLPLTYFDLGWLKLHPVDRVLFYHVPELTRSSLISKLK 64

Query: 67  KSLSLVLQHYLPLAGAIVWPENSPKPAVETAV--RDGIVLTVAESEDDFDHLIGDGLRKE 126
            SLS  L HYLPLAG +VW     KP++  +   +D + LTVAES  D  HL GD  R  
Sbjct: 65  SSLSATLLHYLPLAGRLVWDSIKTKPSIVYSPDDKDAVYLTVAESNGDLSHLSGDEPRPA 124

Query: 127 AKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAGLCK 186
            +   LV EL   ++ A V+AVQVT+F N GFS+G+T+HHAVLDG+++  F+K+WA  CK
Sbjct: 125 TEFHSLVPELPVSDESARVLAVQVTFFPNQGFSLGVTAHHAVLDGKTTAMFLKAWAHNCK 184

Query: 187 NLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFK-TPPD 246
                 E      + +P  DR +V D  GLE   L   ++    N  SLK +  K    D
Sbjct: 185 Q-----EQEALPHDLVPSLDRIIVQDPTGLETKLLNRWISASN-NKPSLKLFPSKIIGSD 244

Query: 247 SFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVADEDITIAVT 306
             R T++L+ ++I+KL++ V    +  Q  L +ST+ +   Y   C   +   D T  V 
Sbjct: 245 ILRVTYRLTREDIKKLRERVETESHAKQ--LRLSTFVITYAYVITCMVKMRGGDPTRFVC 304

Query: 307 V----DARGRLDPPLPATYFGNYVVGRST-ALKRGKLFGE---NGVIAAVETISEMIKSL 366
           V    D R RL+PPLP T+FGN +VG     +K   +  E    G I AVET++  +  L
Sbjct: 305 VGFASDFRSRLNPPLPPTFFGNCIVGSGDFDVKAEPILEEGEGKGFITAVETLTGWVNGL 364

Query: 367 KEEGPLKGAENWVLLMTQTV--VNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINR 426
             E      E  +LL  +    +    ++IS  GS R  +Y  DFGWGKP KVE+V+I++
Sbjct: 365 CPE----NIEKNMLLPFEAFKRMEPGRQMISVAGSTRLGIYGSDFGWGKPVKVEIVTIDK 424

Query: 427 TGAVCISESRDG-GGVELGWTAKRDVMENFAKLFA 447
             +V +SES DG GGVE+G   K+D +E F  LF+
Sbjct: 425 DASVSLSESGDGSGGVEVGVCLKKDDVERFGSLFS 445

BLAST of CsaV3_3G020720 vs. ExPASy Swiss-Prot
Match: Q9LRQ8 (Phenolic glucoside malonyltransferase 2 OS=Arabidopsis thaliana OX=3702 GN=PMAT2 PE=1 SV=1)

HSP 1 Score: 266.9 bits (681), Expect = 4.1e-70
Identity = 164/454 (36.12%), Postives = 242/454 (53.30%), Query Frame = 0

Query: 10  IKVLEVCTVAPPPGSTVPVT----LPLTFFDILWFRFPPVERLFFYKSPVP-----FYVI 69
           + V+E   V P   S +       LPLTFFD+ W  F PV+R+FFY+           +I
Sbjct: 3   LHVIETARVTPTDYSVINSANLHKLPLTFFDLPWLLFQPVKRVFFYELTESTRDHFHSII 62

Query: 70  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 129
           +  LK SLSL+L++YLPL G I W  N PKP++  +    +++T+AES+ DF HL G G 
Sbjct: 63  LPKLKDSLSLILRNYLPLTGHITWEPNEPKPSIIVSENGVVLVTIAESDADFSHLSGYGQ 122

Query: 130 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 189
           R  ++L  LV +L   +D A   ++Q+T F N GFSIG+ +HHAVLDG++S++F+K+WA 
Sbjct: 123 RPLSELHALVPKLPVSDDSATAFSIQITLFPNQGFSIGVAAHHAVLDGKTSSTFIKAWAQ 182

Query: 190 LCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLE--AIYLKCLLAHEGPNNRSL-KFWDF 249
           +CK      E+        P YDRS++     L+   I L   L  +  N RSL      
Sbjct: 183 ICKQ-----ELQSMPENLTPSYDRSLIKYPTYLDEKMIELVRSLKEDQTNIRSLTSLPSS 242

Query: 250 KTPPDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVC----ASAVA 309
           K   D    T  LS  +I++L++ V        P LH+ST+ +A  Y W C         
Sbjct: 243 KLGDDVVLATLVLSRADIERLREQV----KNVSPSLHLSTFVIAYAYAWTCFVKARGGNK 302

Query: 310 DEDITIAVTVDARGRLDPPLPATYFGNYVVGRST-ALKRGKLFGENGVIAAVETISEMIK 369
           D  +++    D R RLDP LP TYFGN ++       K  +   E G + A E IS+++K
Sbjct: 303 DRSVSLLFVGDFRDRLDPKLPGTYFGNCMIPVGCYNRKAAEFMEEKGFVTAAEIISDLVK 362

Query: 370 SLKEEGPLKGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINR 429
            L        A+ +V   +    ++ +  I+  GS R  VY  DFGWG+P KV++VSI++
Sbjct: 363 GLSSRKIETIADTFVEGFSFQSWSTQFGTIA--GSTRLGVYEADFGWGRPVKVDIVSIDQ 422

Query: 430 TGAVCISESRD-GGGVELGWTAKRDVMENFAKLF 446
             A+ ++E RD  GGVE+G   K+  M++    F
Sbjct: 423 GEAIAMAERRDESGGVEIGMCLKKTEMDSVVSFF 445

BLAST of CsaV3_3G020720 vs. ExPASy Swiss-Prot
Match: Q9FNP9 (Agmatine coumaroyltransferase OS=Arabidopsis thaliana OX=3702 GN=ACT PE=1 SV=1)

HSP 1 Score: 263.8 bits (673), Expect = 3.5e-69
Identity = 164/458 (35.81%), Postives = 242/458 (52.84%), Query Frame = 0

Query: 9   AIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYK-------SPVPFYVIV 68
           A+KV+++  V+P   S  P+ +PL+FFD+ W +  P E++FFYK         V +  I+
Sbjct: 2   ALKVIKISRVSPATASVDPLIVPLSFFDLQWLKLNPTEQVFFYKLTESSSSRDVFYSSIL 61

Query: 69  SNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGLR 128
             L++SLSL+L H+    G + W    PKP +     D + LTVAE++ DF  + G GLR
Sbjct: 62  PKLERSLSLILTHFRLFTGHLKWDSQDPKPHLVVLSGDTLSLTVAETDADFSRISGRGLR 121

Query: 129 KEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAGL 188
            E +LRPL+ EL    D  AVV++QVT F   GF IG T+HH VLDG+++  F K+WA  
Sbjct: 122 PELELRPLIPELPIYSDSGAVVSLQVTLFPKQGFCIGTTAHHVVLDGKTAEKFNKAWAHT 181

Query: 189 CKNLVGGGEIFCPAAETMP-FYDRSVVTDKMGLEAIYLKC---LLAHEGPNNRSLKF--- 248
           CK+    G I     + +P   DRSVV    GLE   L+    L   +  N R+LK    
Sbjct: 182 CKH----GTI----PKILPTVLDRSVVNVPAGLEQKMLELLPYLTEDDKENGRTLKLPPV 241

Query: 249 WDFKTPPDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVC----AS 308
            +     +  R T ++SP+NI+KLK+   +    A+  LH+ST+ V   + W C     S
Sbjct: 242 KEINAKDNVLRITIEISPENIEKLKERAKKESTRAE--LHLSTFVVTFAHVWTCMVKARS 301

Query: 309 AVADEDITIAVTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLF-GENGVIAAVETISE 368
              +  +      D R RL+PP+P TYFG  V+       + K F GE+G +  VE +S+
Sbjct: 302 GDPNRPVRFMYAADFRNRLEPPVPVTYFGTCVLAMDFYKYKAKEFMGEDGFVNTVEILSD 361

Query: 369 MIKSLKEEGPLKGAENWVLLMTQT-VVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVV 428
            +K L  +G       W +    T  +    +L+   GS +  +Y  DFGWG+P   E +
Sbjct: 362 SVKRLASQGV---ESTWKVYEEGTKTMKWGTQLLVVNGSNQIGMYETDFGWGRPIHTETM 421

Query: 429 SINRTGAVCISESRDG-GGVELGWTAKRDVMENFAKLF 446
           SI +     +S+ RDG GGVE+G + K+  M+ F  LF
Sbjct: 422 SIYKNDEFSMSKRRDGIGGVEIGISLKKLEMDTFLSLF 446

BLAST of CsaV3_3G020720 vs. ExPASy Swiss-Prot
Match: Q589Y0 (Phenolic glucoside malonyltransferase 1 OS=Nicotiana tabacum OX=4097 GN=mat1 PE=1 SV=1)

HSP 1 Score: 261.2 bits (666), Expect = 2.3e-68
Identity = 169/460 (36.74%), Postives = 247/460 (53.70%), Query Frame = 0

Query: 12  VLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPV--PFYV--IVSNLKKS 71
           V+E C V P PGS   +TLPLT+FD +W  F  + R+ FYK P+  P +V  I+  LK S
Sbjct: 4   VIEQCQVVPSPGSATELTLPLTYFDHVWLAFHRMRRILFYKLPISRPDFVQTIIPTLKDS 63

Query: 72  LSLVLQHYLPLAGAIVWPEN-SPKPAVETAVRDGIVLTVAESEDDFDHLIGDGLRKEAKL 131
           LSL L++YLPLAG +  P++ S  P +     + + +  +ES+ DF++LIG   R     
Sbjct: 64  LSLTLKYYLPLAGNVACPQDWSGYPELRYVTGNSVSVIFSESDMDFNYLIGYHPRNTKDF 123

Query: 132 RPLVAELAAEEDR-----AAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAGL 191
              V +LA  +D      A V+A+QVT F N G SIG T+HH   DG +   F+++WA L
Sbjct: 124 YHFVPQLAEPKDAPGVQLAPVLAIQVTLFPNHGISIGFTNHHVAGDGATIVKFVRAWALL 183

Query: 192 CKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFKTPP 251
             N  GG E F  A E +PFYDRSV+ D  G+       +  ++      +K  D  TPP
Sbjct: 184 --NKFGGDEQFL-ANEFIPFYDRSVIKDPNGVGMSIWNEMKKYK----HMMKMSDVVTPP 243

Query: 252 DSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVC---ASAVADEDIT 311
           D  RGTF ++  +I KLK  VL  R     L H++++TV   Y W C   + A   E+I 
Sbjct: 244 DKVRGTFIITRHDIGKLKNLVLTRR---PKLTHVTSFTVTCAYVWTCIIKSEAATGEEID 303

Query: 312 ------IAVTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLFGENGVIAAVETISEMIK 371
                      D R + +PPLP +YFGN +VG     ++  L G+ G   AVE I E I+
Sbjct: 304 ENGMEFFGCAADCRAQFNPPLPPSYFGNALVGYVARTRQVDLAGKEGFTIAVELIGEAIR 363

Query: 372 SLKEEGPLKGAENWVL----LMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVV 431
              ++      E W+L          V++  + +S  GSP+ ++Y+ DFGWG+PEK+E V
Sbjct: 364 KRMKD------EEWILSGSWFKEYDKVDAK-RSLSVAGSPKLDLYAADFGWGRPEKLEFV 423

Query: 432 SINRTGAV--CISESRDG-GGVELGWTAKRDVMENFAKLF 446
           SI+    +   +S+S+D  G +E+G +  +  M  FA +F
Sbjct: 424 SIDNDDGISMSLSKSKDSDGDLEIGLSLSKTRMNAFAAMF 446

BLAST of CsaV3_3G020720 vs. ExPASy TrEMBL
Match: A0A1S3CNS2 (phenolic glucoside malonyltransferase 1-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103503024 PE=3 SV=1)

HSP 1 Score: 831.6 bits (2147), Expect = 1.5e-237
Identity = 406/446 (91.03%), Postives = 428/446 (95.96%), Query Frame = 0

Query: 1   MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI 60
           MAT+HGSTAIKVLE+CTVAPPPGSTVP TLPLTFFDILWFRFPPVERLFFYKSPVPF+VI
Sbjct: 37  MATDHGSTAIKVLEICTVAPPPGSTVPATLPLTFFDILWFRFPPVERLFFYKSPVPFHVI 96

Query: 61  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 120
           VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETA  DGIVLTVAES+DDFDHL+GDGL
Sbjct: 97  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAAGDGIVLTVAESDDDFDHLVGDGL 156

Query: 121 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 180
           R+EAKL+PLVAELAAEE+RAAVVAVQVTWFGNG FSIGITSHHA+LDGRSSTSFMKSWAG
Sbjct: 157 REEAKLQPLVAELAAEEERAAVVAVQVTWFGNGRFSIGITSHHAILDGRSSTSFMKSWAG 216

Query: 181 LCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFKTP 240
           LCKNLVGGG+IF PAAETMPFYDRSVVTD +GLEAIYL+C LAHEGPNNRSLKFWD KTP
Sbjct: 217 LCKNLVGGGDIFFPAAETMPFYDRSVVTDNVGLEAIYLECWLAHEGPNNRSLKFWDVKTP 276

Query: 241 PDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVADEDITIA 300
           PD FRGTFKLSPQ+IQKLKQHVL+HRNP QP  HISTYTVAMGYTWVCASAVADE+I+I 
Sbjct: 277 PDLFRGTFKLSPQDIQKLKQHVLKHRNPVQPPPHISTYTVAMGYTWVCASAVADEEISIG 336

Query: 301 VTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLFGENGVIAAVETISEMIKSLKEEGPL 360
           VT+DARGR+ PPLPATYFGN VVGRSTAL+RGKL GENGVIAAVETISEMIKSLKEEGPL
Sbjct: 337 VTMDARGRVYPPLPATYFGNCVVGRSTALERGKLLGENGVIAAVETISEMIKSLKEEGPL 396

Query: 361 KGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINRTGAVCISE 420
           KGAENWVLLMTQTVVN+DYKLIST GSPRFEVYSVDFGWGKPEKVEVVSINRTGAVCISE
Sbjct: 397 KGAENWVLLMTQTVVNNDYKLISTAGSPRFEVYSVDFGWGKPEKVEVVSINRTGAVCISE 456

Query: 421 SRDGGGVELGWTAKRDVMENFAKLFA 447
           SR+GGGVE GWTA+RDVMENFAKLFA
Sbjct: 457 SRNGGGVEHGWTARRDVMENFAKLFA 482

BLAST of CsaV3_3G020720 vs. ExPASy TrEMBL
Match: A0A6J1GBE9 (phenolic glucoside malonyltransferase 2-like OS=Cucurbita moschata OX=3662 GN=LOC111452446 PE=3 SV=1)

HSP 1 Score: 565.8 bits (1457), Expect = 1.6e-157
Identity = 291/454 (64.10%), Postives = 345/454 (75.99%), Query Frame = 0

Query: 1   MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI 60
           MA  H S  + VL +CTV PP GS VP +LPLTFFDILW RFPPV+R+FFYKS  PF V+
Sbjct: 12  MAEIHPS--VNVLHLCTVPPPHGSLVPFSLPLTFFDILWLRFPPVQRIFFYKSSAPFDVV 71

Query: 61  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 120
           VS LK SLS  LQHY PLAGA+VWPENSPKPAV+T + DGI+LT+A+S+  F HL+ DGL
Sbjct: 72  VSTLKNSLSAALQHYPPLAGAVVWPENSPKPAVQTVLGDGILLTLAKSDSKFSHLVSDGL 131

Query: 121 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 180
           R+ A+   LV  L A +DRAAV+A+QVT FG  GF IGITSHHA+LDGR+STSF+K WA 
Sbjct: 132 REAAEFHTLVPRLPAADDRAAVMALQVTSFGTDGFCIGITSHHAILDGRTSTSFVKLWAR 191

Query: 181 LCKNLVGGGEIFCP---AAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDF 240
           LCKNLV GGE   P   AAETMPFYDRSV+ D  GLE I+L+  LAH G +N+SLKFW  
Sbjct: 192 LCKNLVAGGESAEPVSTAAETMPFYDRSVIVDPRGLEGIFLRDWLAHGGSDNKSLKFWSP 251

Query: 241 KTPPDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVA---- 300
             P   FRGTFKL+PQNIQKLKQ VL  RNP  P +HIST+TVAM YTWVC +AVA    
Sbjct: 252 SIPQGLFRGTFKLNPQNIQKLKQLVLNRRNPVHPPVHISTFTVAMAYTWVC-TAVADGSP 311

Query: 301 -DEDITIAVTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLFGENGVIAAVETISEMIK 360
            D +I+  ++VDAR  LDPP+PA YFGN +VGR+T  +R KL GENG++ AVE IS+ IK
Sbjct: 312 NDGEISFGLSVDARRWLDPPVPANYFGNCLVGRTTDQERAKLVGENGLVTAVEGISKAIK 371

Query: 361 SLKEEGPLKGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINR 420
           SL+E+G L GAE WV L+TQ V  S+ K+++T GSPRFE+YSVDFGWG P KVEVVSI+ 
Sbjct: 372 SLEEKGALDGAEQWVSLLTQ-VSGSNRKMLTTAGSPRFELYSVDFGWGTPAKVEVVSIDE 431

Query: 421 TGAVCISESRDGGGVELGWTAKRDVMENFAKLFA 447
           TGAV + + RD GGVELGW AK+DVME FA  FA
Sbjct: 432 TGAVSVCDGRD-GGVELGWVAKKDVMEAFAAAFA 460

BLAST of CsaV3_3G020720 vs. ExPASy TrEMBL
Match: A0A6J1K841 (phenolic glucoside malonyltransferase 1-like OS=Cucurbita maxima OX=3661 GN=LOC111492563 PE=3 SV=1)

HSP 1 Score: 544.7 bits (1402), Expect = 3.8e-151
Identity = 283/455 (62.20%), Postives = 340/455 (74.73%), Query Frame = 0

Query: 1   MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI 60
           MA  H S  + VL +CTV  P GS +P +LPLTFFDILW RFPPV+R+FFYKS  PF V+
Sbjct: 1   MAEMHPS--VNVLHLCTVPLPHGSLLPFSLPLTFFDILWLRFPPVQRIFFYKSSAPFDVV 60

Query: 61  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 120
           VS LK SLS  LQHY PLAGA+VWPENSPKPAV+T + DGI+LT+A+S+  F HL+ D L
Sbjct: 61  VSTLKNSLSAALQHYPPLAGAVVWPENSPKPAVQTVLGDGILLTLAKSDSKFSHLVSDEL 120

Query: 121 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 180
           R+ A+   LV  L A +DRAAV+A+QVT FG  GF IGITSHHA+LDGR+STSF+K WA 
Sbjct: 121 REAAEFHTLVPRLPAADDRAAVMALQVTSFGTEGFCIGITSHHAILDGRTSTSFVKLWAR 180

Query: 181 LCKNLVGGGEIFCP---AAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDF 240
           LCKNLV GGE   P   A+ETMPFYDRSV+ D  GLE I+L+  LAH G +N+SLKFW  
Sbjct: 181 LCKNLVAGGESAKPGSTASETMPFYDRSVIVDPKGLEGIFLRDWLAHGGSDNKSLKFWPP 240

Query: 241 KTPPDSFRGTFKLSPQNIQKLKQHVLEHRNPAQ-PLLHISTYTVAMGYTWVCASAVA--- 300
               D FRGTFK +PQNIQKLKQ VL   NP   P +HIST+TVAM YTWVC +AVA   
Sbjct: 241 SIQQDLFRGTFKFNPQNIQKLKQLVLNRWNPVHPPPVHISTFTVAMAYTWVC-TAVADGS 300

Query: 301 --DEDITIAVTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLFGENGVIAAVETISEMI 360
             D +I+  ++VDAR  LDPP+PA YFGN +VGR+T  ++ KL GENG++ AVE IS+ I
Sbjct: 301 PHDGEISFGLSVDARRWLDPPVPANYFGNCLVGRTTDQEKAKLVGENGLVTAVEGISKAI 360

Query: 361 KSLKEEGPLKGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSIN 420
           KSL+E G L GAE WV ++TQ V  S+ K+++T GSPRFE+YSVDFGWG P KVEVVSI+
Sbjct: 361 KSLEENGALDGAEQWVSMLTQ-VAGSNRKMLTTAGSPRFELYSVDFGWGTPAKVEVVSID 420

Query: 421 RTGAVCISESRDGGGVELGWTAKRDVMENFAKLFA 447
            TGAV + + RD GGVELGW AK+DVME FA  FA
Sbjct: 421 GTGAVSVCDGRD-GGVELGWVAKKDVMEAFAAAFA 450

BLAST of CsaV3_3G020720 vs. ExPASy TrEMBL
Match: A0A0A0L9V3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G280980 PE=3 SV=1)

HSP 1 Score: 504.6 bits (1298), Expect = 4.3e-139
Identity = 246/246 (100.00%), Postives = 246/246 (100.00%), Query Frame = 0

Query: 1   MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI 60
           MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI
Sbjct: 1   MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI 60

Query: 61  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 120
           VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL
Sbjct: 61  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 120

Query: 121 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 180
           RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG
Sbjct: 121 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 180

Query: 181 LCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFKTP 240
           LCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFKTP
Sbjct: 181 LCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFKTP 240

Query: 241 PDSFRG 247
           PDSFRG
Sbjct: 241 PDSFRG 246

BLAST of CsaV3_3G020720 vs. ExPASy TrEMBL
Match: A0A1S3CP69 (phenolic glucoside malonyltransferase 1-like isoform X2 OS=Cucumis melo OX=3656 GN=LOC103503024 PE=3 SV=1)

HSP 1 Score: 472.2 bits (1214), Expect = 2.4e-129
Identity = 235/293 (80.20%), Postives = 255/293 (87.03%), Query Frame = 0

Query: 1   MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI 60
           MAT+HGSTAIKVLE+CTVAPPPGSTVP TLPLTFFDILWFRFPPVERLFFYKSPVPF+VI
Sbjct: 37  MATDHGSTAIKVLEICTVAPPPGSTVPATLPLTFFDILWFRFPPVERLFFYKSPVPFHVI 96

Query: 61  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 120
           VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETA  DGIVLTVAES+DDFDHL+GDGL
Sbjct: 97  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAAGDGIVLTVAESDDDFDHLVGDGL 156

Query: 121 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 180
           R+EAKL+PLVAELAAEE+RAAVVAVQVTWFGNG FSIGITSHHA+LDGRSSTSFMKSWAG
Sbjct: 157 REEAKLQPLVAELAAEEERAAVVAVQVTWFGNGRFSIGITSHHAILDGRSSTSFMKSWAG 216

Query: 181 LCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFKTP 240
           LCKNLVGGG+IF PAAETMPFYDRSVVTD +GLEAIYL+C LAHEGPNNRSLKFWD KTP
Sbjct: 217 LCKNLVGGGDIFFPAAETMPFYDRSVVTDNVGLEAIYLECWLAHEGPNNRSLKFWDVKTP 276

Query: 241 PDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVA 294
           PD FRG   +  Q +  + +               +T TVA+  T+V A A+A
Sbjct: 277 PDLFRGALFVPSQGLTTIVE---------------TTVTVAIIATFVAAEAIA 314

BLAST of CsaV3_3G020720 vs. TAIR 10
Match: AT5G39050.1 (HXXXD-type acyl-transferase family protein )

HSP 1 Score: 291.6 bits (745), Expect = 1.1e-78
Identity = 182/465 (39.14%), Postives = 258/465 (55.48%), Query Frame = 0

Query: 1   MATNHGSTAIKVLEVCTVAPP-PGSTVPVTLPLTFFDILWFRFPPVERLFFYK---SPVP 60
           M      +++KV++V  V P    S+  +TLPLTFFD+LW++   VER+ FYK   +  P
Sbjct: 1   MVNEEMESSLKVIDVARVTPSNSDSSESLTLPLTFFDLLWYKLHAVERVIFYKLTDASRP 60

Query: 61  FY--VIVSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDH 120
           F+  VIV NLK SLS  L HYLPLAG +VW    PKP +     D +  TVAES  DF  
Sbjct: 61  FFDSVIVPNLKTSLSSSLSHYLPLAGKLVWEPLDPKPKIVYTPNDAVSFTVAESNADFSR 120

Query: 121 LIGDGLRKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSF 180
           L G       +L PLV EL   +D A+ V+ QVT F N GF I + +HHAVLDG+++T+F
Sbjct: 121 LTGKEPFPTTELYPLVPELHVSDDSASAVSFQVTLFPNQGFCISVNAHHAVLDGKTTTNF 180

Query: 181 MKSWAGLCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYL-------KCLLAHEGP 240
           +KSWA  CKN     + F P  + +P YDR+V+ D M L+   L       K     + P
Sbjct: 181 LKSWARTCKN----QDSFLP-QDLIPVYDRTVIKDPMDLDTKILNAWHRVAKVFTGGKEP 240

Query: 241 NN-RSLK-FWDFKTPPDSFRGTFKLSPQNIQKLKQHVLEHRNPAQ-----PLLHISTYTV 300
            N +SLK  W  +  PD FR T  L+ ++IQKL++ + +  + +        L +ST+ +
Sbjct: 241 ENPKSLKLLWSPEIGPDVFRYTLNLTREDIQKLRERLKKESSSSSVSSSPKELRLSTFVI 300

Query: 301 AMGYTWVCASAVADED----ITIAVTVDARGRLDPPLPATYFGNYVVG-RSTALKRGKLF 360
              Y   C       D    +     VD R  + PP+P++YFGN V      +L      
Sbjct: 301 VYSYALTCLIKARGGDPSRPVGYGFAVDCRSLMVPPVPSSYFGNCVSACFKMSLTAETFM 360

Query: 361 GENGVIAAVETISEMIKSLKEEGPLKGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSV 420
            E G +AA   +S+ +++L E   LK  E   +L   T ++   +++S  GS RF VY +
Sbjct: 361 SEEGFLAAARMVSDSVEALDENVALKIPE---ILEGFTTLSPGTQVLSVAGSTRFGVYGL 420

Query: 421 DFGWGKPEKVEVVSINRTGAVCISESRDG-GGVELGWTAKRDVME 440
           DFGWG+PEKV VVSI++  A+  +ESRDG GGVELG++ K+  M+
Sbjct: 421 DFGWGRPEKVVVVSIDQGEAISFAESRDGSGGVELGFSLKKHEMD 457

BLAST of CsaV3_3G020720 vs. TAIR 10
Match: AT3G29590.1 (HXXXD-type acyl-transferase family protein )

HSP 1 Score: 280.0 bits (715), Expect = 3.3e-75
Identity = 179/455 (39.34%), Postives = 256/455 (56.26%), Query Frame = 0

Query: 7   STAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSP-VPFYVIVSNLK 66
           ++A+ +LEV  V+PP  S+  +TLPLT+FD+ W +  PV+R+ FY  P +    ++S LK
Sbjct: 5   NSAVNILEVVQVSPP--SSNSLTLPLTYFDLGWLKLHPVDRVLFYHVPELTRSSLISKLK 64

Query: 67  KSLSLVLQHYLPLAGAIVWPENSPKPAVETAV--RDGIVLTVAESEDDFDHLIGDGLRKE 126
            SLS  L HYLPLAG +VW     KP++  +   +D + LTVAES  D  HL GD  R  
Sbjct: 65  SSLSATLLHYLPLAGRLVWDSIKTKPSIVYSPDDKDAVYLTVAESNGDLSHLSGDEPRPA 124

Query: 127 AKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAGLCK 186
            +   LV EL   ++ A V+AVQVT+F N GFS+G+T+HHAVLDG+++  F+K+WA  CK
Sbjct: 125 TEFHSLVPELPVSDESARVLAVQVTFFPNQGFSLGVTAHHAVLDGKTTAMFLKAWAHNCK 184

Query: 187 NLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFK-TPPD 246
                 E      + +P  DR +V D  GLE   L   ++    N  SLK +  K    D
Sbjct: 185 Q-----EQEALPHDLVPSLDRIIVQDPTGLETKLLNRWISASN-NKPSLKLFPSKIIGSD 244

Query: 247 SFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVADEDITIAVT 306
             R T++L+ ++I+KL++ V    +  Q  L +ST+ +   Y   C   +   D T  V 
Sbjct: 245 ILRVTYRLTREDIKKLRERVETESHAKQ--LRLSTFVITYAYVITCMVKMRGGDPTRFVC 304

Query: 307 V----DARGRLDPPLPATYFGNYVVGRST-ALKRGKLFGE---NGVIAAVETISEMIKSL 366
           V    D R RL+PPLP T+FGN +VG     +K   +  E    G I AVET++  +  L
Sbjct: 305 VGFASDFRSRLNPPLPPTFFGNCIVGSGDFDVKAEPILEEGEGKGFITAVETLTGWVNGL 364

Query: 367 KEEGPLKGAENWVLLMTQTV--VNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINR 426
             E      E  +LL  +    +    ++IS  GS R  +Y  DFGWGKP KVE+V+I++
Sbjct: 365 CPE----NIEKNMLLPFEAFKRMEPGRQMISVAGSTRLGIYGSDFGWGKPVKVEIVTIDK 424

Query: 427 TGAVCISESRDG-GGVELGWTAKRDVMENFAKLFA 447
             +V +SES DG GGVE+G   K+D +E F  LF+
Sbjct: 425 DASVSLSESGDGSGGVEVGVCLKKDDVERFGSLFS 445

BLAST of CsaV3_3G020720 vs. TAIR 10
Match: AT5G39090.1 (HXXXD-type acyl-transferase family protein )

HSP 1 Score: 277.7 bits (709), Expect = 1.7e-74
Identity = 174/444 (39.19%), Postives = 242/444 (54.50%), Query Frame = 0

Query: 9   AIKVLEVCTVAPP-PGSTVPVTLPLTFFDILWFRFPPVERLFFYK-----SPVPFYVIVS 68
           ++  + V  V P    S+  +TLPLTFFD+LW +   VER+ FYK       +   VIV 
Sbjct: 4   SLNFIHVSRVTPSNSNSSASLTLPLTFFDLLWLKHKAVERVIFYKLTDVNRSLFDSVIVP 63

Query: 69  NLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGLRK 128
           NLK SLS  L HYLPLAG I+W  + PKP +     D +  TVAES  DF  L G     
Sbjct: 64  NLKSSLSSSLSHYLPLAGHIIWEPHDPKPKIVYTQNDAVSFTVAESNSDFSLLTGKEPFS 123

Query: 129 EAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAGLC 188
             +L PLV EL   +D AAVV+ QVT F N GF IG+T+HHAV DG+++T+F+KSWA LC
Sbjct: 124 STELHPLVPELQNSDDSAAVVSFQVTLFPNQGFCIGVTTHHAVSDGKTTTTFLKSWAHLC 183

Query: 189 KNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDF-KTPP 248
           K+     +      + +PFYDR+V+     ++   LK  + H     +SLK     +   
Sbjct: 184 KH-----QDSSLPDDLIPFYDRTVIKGPPEIDTKVLK--IWHSIHKPKSLKLLPRPEIES 243

Query: 249 DSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVADED----I 308
           D  R TF+L+ +NI+KL+   L+  + +   + +ST+ +   Y + C      +D    +
Sbjct: 244 DVVRYTFELTRENIEKLRDK-LKRESSSFSSVRLSTFVITFSYVFTCLIGSGGDDPNRPV 303

Query: 309 TIAVTVDARGRL-DPPLPATYFGNYVVGR-STALKRGKLFGENGVIAAVETISEMIKSLK 368
                VD R  + DPP+P TYFGN V       L  G   GE G + A   IS+ ++ L 
Sbjct: 304 GYRFAVDCRRLIDDPPIPLTYFGNCVYSAVKIPLDAGMFLGEQGFVVAARLISDSVEELD 363

Query: 369 EEGPLKGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINRTGA 428
                K  E   LL T      D + +S  GS RF +Y +DFGWGKP K  +VSI++ G 
Sbjct: 364 SNVAWKIPE---LLETYEKAPVDSQFVSVAGSTRFGIYGLDFGWGKPFKSLLVSIDQRGK 423

Query: 429 VCISESRDG-GGVELGWTAKRDVM 439
           + I+ESRDG GGVE+G++ K+  M
Sbjct: 424 ISIAESRDGSGGVEIGFSLKKQEM 436

BLAST of CsaV3_3G020720 vs. TAIR 10
Match: AT3G29635.1 (HXXXD-type acyl-transferase family protein )

HSP 1 Score: 268.9 bits (686), Expect = 7.7e-72
Identity = 175/458 (38.21%), Postives = 251/458 (54.80%), Query Frame = 0

Query: 9   AIKVLEVCTVAPPPGST----VPVTLPLTFFDILWFRFPPVERLFFYK----SPVPFY-- 68
           A+KV ++  V+P   S+      + LPLTFFD+ W +F P ER+ FYK    S +  +  
Sbjct: 2   ALKVTKISQVSPASNSSNDSANSMVLPLTFFDLRWLQFHPTERVIFYKLIKDSSLESFLS 61

Query: 69  VIVSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGD 128
           VI+  L+ SLS+VL+HYLPLAG + W    PKP++  +  D + LTVAES+ DF  + G 
Sbjct: 62  VILPKLELSLSIVLRHYLPLAGRLTWSSQDPKPSIIVSPNDYVSLTVAESDADFSRISGK 121

Query: 129 GLRKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSW 188
           G+R E+++R LV EL+   D  +V+++QVT F N GF IGI SHH+V+DG++   F+KSW
Sbjct: 122 GIRPESEIRSLVPELSLSCDSPSVLSLQVTLFPNQGFCIGIASHHSVMDGKTVVRFIKSW 181

Query: 189 AGLCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEA--IYLKCLLAHEGPNNRSLKFWD 248
           A +CK+    G +  P  +  P  DR+V+     L+A  I L    +    + RSLK   
Sbjct: 182 AHICKH----GAMDLP-EDLTPVLDRTVINVPASLDAKIIELLSYFSEVKDSFRSLKLLP 241

Query: 249 FK-TPPDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVADE 308
            K   PD  R + +L+ +NI+KL++        +   LH+ST+ VA  Y W C       
Sbjct: 242 PKEISPDLVRISLELTRENIEKLREQAKRESARSHHELHLSTFVVANAYLWTCLVKTRGG 301

Query: 309 D----ITIAVTVDARGRLDPPLPATYFGNYV--VGRSTALKRGKLFGENGVIAAVETISE 368
           D    +      D R RLDPP+P  YFGN V  +G     K     GE+G +  VE +S+
Sbjct: 302 DENRPVRFMYAADFRNRLDPPVPEMYFGNCVFPIG-CFGYKANVFLGEDGFVNMVEILSD 361

Query: 369 MIKSLKEEGPLKGAENWVLLMTQT-VVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVV 428
            ++S+   G  K      L +  T  V    ++ S  GS +F +Y  DFGWGKP   E+ 
Sbjct: 362 SVRSI---GLRKLETICELYINGTKSVKPGTQIGSIAGSNQFGLYGSDFGWGKPCNSEIA 421

Query: 429 SINRTGAVCISESRD-GGGVELGWTAKRDVMENFAKLF 446
           SI+R  A  +SE RD  GGVE+G   K+  M+ F  LF
Sbjct: 422 SIDRNEAFSMSERRDEPGGVEIGLCLKKCEMDIFIYLF 450

BLAST of CsaV3_3G020720 vs. TAIR 10
Match: AT3G29670.1 (HXXXD-type acyl-transferase family protein )

HSP 1 Score: 266.9 bits (681), Expect = 2.9e-71
Identity = 164/454 (36.12%), Postives = 242/454 (53.30%), Query Frame = 0

Query: 10  IKVLEVCTVAPPPGSTVPVT----LPLTFFDILWFRFPPVERLFFYKSPVP-----FYVI 69
           + V+E   V P   S +       LPLTFFD+ W  F PV+R+FFY+           +I
Sbjct: 3   LHVIETARVTPTDYSVINSANLHKLPLTFFDLPWLLFQPVKRVFFYELTESTRDHFHSII 62

Query: 70  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 129
           +  LK SLSL+L++YLPL G I W  N PKP++  +    +++T+AES+ DF HL G G 
Sbjct: 63  LPKLKDSLSLILRNYLPLTGHITWEPNEPKPSIIVSENGVVLVTIAESDADFSHLSGYGQ 122

Query: 130 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 189
           R  ++L  LV +L   +D A   ++Q+T F N GFSIG+ +HHAVLDG++S++F+K+WA 
Sbjct: 123 RPLSELHALVPKLPVSDDSATAFSIQITLFPNQGFSIGVAAHHAVLDGKTSSTFIKAWAQ 182

Query: 190 LCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLE--AIYLKCLLAHEGPNNRSL-KFWDF 249
           +CK      E+        P YDRS++     L+   I L   L  +  N RSL      
Sbjct: 183 ICKQ-----ELQSMPENLTPSYDRSLIKYPTYLDEKMIELVRSLKEDQTNIRSLTSLPSS 242

Query: 250 KTPPDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVC----ASAVA 309
           K   D    T  LS  +I++L++ V        P LH+ST+ +A  Y W C         
Sbjct: 243 KLGDDVVLATLVLSRADIERLREQV----KNVSPSLHLSTFVIAYAYAWTCFVKARGGNK 302

Query: 310 DEDITIAVTVDARGRLDPPLPATYFGNYVVGRST-ALKRGKLFGENGVIAAVETISEMIK 369
           D  +++    D R RLDP LP TYFGN ++       K  +   E G + A E IS+++K
Sbjct: 303 DRSVSLLFVGDFRDRLDPKLPGTYFGNCMIPVGCYNRKAAEFMEEKGFVTAAEIISDLVK 362

Query: 370 SLKEEGPLKGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINR 429
            L        A+ +V   +    ++ +  I+  GS R  VY  DFGWG+P KV++VSI++
Sbjct: 363 GLSSRKIETIADTFVEGFSFQSWSTQFGTIA--GSTRLGVYEADFGWGRPVKVDIVSIDQ 422

Query: 430 TGAVCISESRD-GGGVELGWTAKRDVMENFAKLF 446
             A+ ++E RD  GGVE+G   K+  M++    F
Sbjct: 423 GEAIAMAERRDESGGVEIGMCLKKTEMDSVVSFF 445

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004145869.26.8e-264100.00phenolic glucoside malonyltransferase 1 [Cucumis sativus] >KAE8650582.1 hypothet... [more]
XP_008465393.13.2e-23791.03PREDICTED: phenolic glucoside malonyltransferase 1-like isoform X1 [Cucumis melo... [more]
XP_038875303.15.2e-18774.27malonyl-CoA:anthocyanidin 5-O-glucoside-6''-O-malonyltransferase-like [Benincasa... [more]
XP_022948949.13.3e-15764.10phenolic glucoside malonyltransferase 2-like [Cucurbita moschata][more]
KAG6607032.13.3e-15764.10Phenolic glucoside malonyltransferase 2, partial [Cucurbita argyrosperma subsp. ... [more]
Match NameE-valueIdentityDescription
Q940Z51.6e-7739.14Phenolic glucoside malonyltransferase 1 OS=Arabidopsis thaliana OX=3702 GN=PMAT1... [more]
Q9LJB44.7e-7439.34Malonyl-CoA:anthocyanidin 5-O-glucoside-6''-O-malonyltransferase OS=Arabidopsis ... [more]
Q9LRQ84.1e-7036.12Phenolic glucoside malonyltransferase 2 OS=Arabidopsis thaliana OX=3702 GN=PMAT2... [more]
Q9FNP93.5e-6935.81Agmatine coumaroyltransferase OS=Arabidopsis thaliana OX=3702 GN=ACT PE=1 SV=1[more]
Q589Y02.3e-6836.74Phenolic glucoside malonyltransferase 1 OS=Nicotiana tabacum OX=4097 GN=mat1 PE=... [more]
Match NameE-valueIdentityDescription
A0A1S3CNS21.5e-23791.03phenolic glucoside malonyltransferase 1-like isoform X1 OS=Cucumis melo OX=3656 ... [more]
A0A6J1GBE91.6e-15764.10phenolic glucoside malonyltransferase 2-like OS=Cucurbita moschata OX=3662 GN=LO... [more]
A0A6J1K8413.8e-15162.20phenolic glucoside malonyltransferase 1-like OS=Cucurbita maxima OX=3661 GN=LOC1... [more]
A0A0A0L9V34.3e-139100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G280980 PE=3 SV=1[more]
A0A1S3CP692.4e-12980.20phenolic glucoside malonyltransferase 1-like isoform X2 OS=Cucumis melo OX=3656 ... [more]
Match NameE-valueIdentityDescription
AT5G39050.11.1e-7839.14HXXXD-type acyl-transferase family protein [more]
AT3G29590.13.3e-7539.34HXXXD-type acyl-transferase family protein [more]
AT5G39090.11.7e-7439.19HXXXD-type acyl-transferase family protein [more]
AT3G29635.17.7e-7238.21HXXXD-type acyl-transferase family protein [more]
AT3G29670.12.9e-7136.12HXXXD-type acyl-transferase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Chinese Long) v3
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR023213Chloramphenicol acetyltransferase-like domain superfamilyGENE3D3.30.559.10coord: 239..448
e-value: 1.7E-44
score: 153.5
IPR023213Chloramphenicol acetyltransferase-like domain superfamilyGENE3D3.30.559.10coord: 9..231
e-value: 1.4E-61
score: 209.9
IPR003480TransferasePFAMPF02458Transferasecoord: 25..443
e-value: 3.6E-49
score: 167.6
NoneNo IPR availablePANTHERPTHR31625FAMILY NOT NAMEDcoord: 7..446
NoneNo IPR availablePANTHERPTHR31625:SF50MALONYL-COA:ISOFLAVONE 7-O-GLUCOSIDE MALONYLTRANSFERASEcoord: 7..446
NoneNo IPR availableSUPERFAMILY52777CoA-dependent acyltransferasescoord: 273..445
NoneNo IPR availableSUPERFAMILY52777CoA-dependent acyltransferasescoord: 63..205

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_3G020720.1CsaV3_3G020720.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0016747 acyltransferase activity, transferring groups other than amino-acyl groups