Cp4.1LG04g07760 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG04g07760
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionGlycosyltransferase
LocationCp4.1LG04: 3830763 .. 3842644 (-)
RNA-Seq ExpressionCp4.1LG04g07760
SyntenyCp4.1LG04g07760
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATATCCACATTCCTCAAAATCAGCCATGATGAATTCTGAAGAAGCTCCTTGCCATGTCTTCCTTGTATGCTATCCCAGCCAAGGACACATCAACCCCACTCTCAGACTTGCCAAGAAACTCGCCGCCGAGGGCCTTCTCGTCACCATTTCCACGGCGGTGCATTTCGGCAAAACACTGCAGAAAGCTGGAAGTATCGGCGCCGGCGACTGTCCCACTCCGGTCGGCAATGGCTTCATCAGATTTGAATTCTTCGAAGATCGCCTCCAAGAGATCAATCCCAAGGATATGAACTTGACCCGCTATAACAACCAGCTCGAGCTCTCCGGCCGGCCGTCGCTCACCGGCCTGATCAAGAACCAAACAGCCGAAAACCGCCCTGTTTCTTGCCTGATTGTGAACCCCTTTTTTCCATGGACATGCGAAGTTGCTAAGGAGCTTGGAATCCCCTGTGCCGTTCTTTGGGTTCAATCATGCTCTGTGTTCTCAATTTACTATCACTGTTTCCACAAATCCGTCCCGTTCCCTTCTGAATTGGAACCCAAAATCGACGTTCATCTCCCAATTTTGCCACTTTTGAAGAACGATGAAATCCCAAGCTTCTTGCATCCAAATAACATCTATGGCGTTTTGGGGAAAGTTCTGTTATCCCAATTCAGTAAATTATCAATACCCTTTTGTATTTTGATGGATAGTTTCGATGAACTCGAGAAAGATATCATCAGTTACATGTCTAATATCATTCCTTTGAAACCCATTGGCCCATTGTTCTTAAACCCACAAAATGTGGAAACAGAGGTCTCTGCCGACTGCTTAAAAGCAGAGGATTGTATGGAATGGCTAAACTCGAAGCCCCCACAATCCGTTGTGTACGTTTCATTTGGAAGCATCGTTTATTTGAAACAAGAGCAAATAGACGAGCTCGCTTATGGACTGTGCAATTCAGGGCTCTCATTCTTATGGGTTATGAAGCCGCCCAATGAAGCTCTCGGGTTAAAGGGCCATATTTTGCCGGAAGGGGTAATGGAGAAAGCAGGGGAGAGGGGAAAAGTGGTGCAATGGAGTTCGCAAGAGAGGGTTTTGTCGCATGAATCTGTTGGGTGTTTTATGACGCATTGTGGGTGGAATTCGTCGGTGGAAGCCATCGGCTGCGGCGTGCCGGTGGTGGCGTTTCCGCAGTGGGGGGATCAGGTGACCAATGCTAAGTTTTTGGTGGAGGATTATGGAGTTGGGGTGAGGCTGTCGCGTGGAGCAGAGGCAAATGAGTTGATTTCAAGGGATGAGATTGTGAGATGCATATCGGAGGTGATGACCCGCGATAGCAGCGGAGGAGAATTCAGGCGGAATGCTTTGAAATTGAAGCAGGCAGCGGCGGCGGCCGTGGTGGACGGCGGATCCTCCCACAAGAACATCCAACAGTTTGTTAATGAGACTAAGAAGAGATGTGTGAGAGGCAACTTTTAGTGAAAAGAGTTCGGTCAGAAATTGATGGAATTTGGCATCAAATACAATAAATATTAATACTATTTTAATAAAAATTAAGATAAAACAGAAAATATGGTTCTATTATATATAAATAGTTTTTTTTTTAATTAAAAAAACAAAAAAAAACAACTACAATATATTAATGTGAAAATTTTCCTACAAAACATTATTAAAAAAAAAAACTTGAATCGAAGAACAACCGCTCGTTCACTGTCATGTTCATTCGTCAAGCTTGGATTCTTGATCAGTCAACACGTTAGTTCTGATACTCAAGTTTTTATGATTAGTAACTTCTAAATCCATCTTTTATGATTGACATTTTATTAGAGTTTTTTGTATGACGTTTATCTCGGGATGCTATGGAGTTTCTTGGGTATTTTGACTAACTATGCTTATCATAAATTTTTACGTGCAAGACTCTCTTTCAACCAAATACAACTTATCTACTAAAATAGTTCAGAGTCGAGGATATTCATGGTCCTACCTAAACCACTCTAAGTTTTCAACCTTCAATAAATAAAAACTGTGATAGTAATATTGTAACTTCATTGTTTAGTCAAACAACCCGCCCGCATGTATTAGAAGATGATAATTATGGATGAATTATTATTTAAATTAGTATAGGTCCGGGGATTCTAAATTTGTAAGGAGTTCTTTCACGTCAAAATAGTGGAAGTTTTGATATTCATGGTAAGTAAATTATTTATGTTGTAGATTAGTTTTTGTTTTTATTGTGCATAATTTATTTAAATTATTATTATTATTTTTTTTAATTCCAAAATTTATCTACTCAGACAAAATTTTGACCGTAAAGGAGTTGAATTATGTTAGTACGTTACAAGTTTCGCTATTTGATTTGTTTGAATGATTATTGATTGATTTGTGTTTGTAAACTTTTAGTATATTTTTATTTAAGATTGTAGTGAATGATTTTGTATGGTAGATAAATAAAAAAAAATTAAAAAAATTCAACAATTCAACAACTCAACGCGAGAATAGAGGGTTGGATCAGGAAACAATTTTGGGTTGGGTTAGGTTATCAATCCAACCAACCCAAACTCTTAGATTGGTCCAAAAAATTTCGTCAACTTAACCCAACCTAACCTATGTACACCTCTATTAGTGACAGGGTAGCTCTAGAACGCTCTCCCACCTTAACCGGCAGCCTAAGAACACCTAGGAAAAGAAGGAGAAAGGGGTAAGTATAAAAATACTCAGTAAGTAACCTGTTTGTAGGCTTTCGTTGCATCCTTAACCATGTCCACGAACTTTTCTCTTGGCCCTAAAAAATTCGTGCCTAGCTCTACTGCTATTTGAGCATCTGGGGAGGCCTTAAAGTTTCGAGCGATTCTCTGATCTGACTCTTACTGATATATTAAGGAATCACTACAAGGGGAGTAAGATTCGGATTACCTTGTTGATCGAATATCTCAAGGCAAGAATATTGTTTGAGATTCGAATCACTCCACAAGAAAGATCGATTATGTCTAGCTTGAATGATTCTTGTTGATCAAATATCTCAACGCAAGAACACTTGTTTGAGATTCGAATCACTCCACAAGTAAGATTGATCATGTCGAGCTTGAATGATTCTACATGCAACCTAAACTACATAGAATTGCAAAGAAACTTAGTCATTGGCTAAGAAAGCACAAATGCTCTTTCTACTATATTTACAAGTCTACTTACAAATACAACTCCTTTCTATAGTCTCAACAACTATTAAAGGCATTCCAAGAGGTGCAACATTCATACTTAATGGTTATAATTAACCATTATATAATTGTAACCTGAAGTAAATAAAAAGTCTTAAAATACATTAATGAAATACAATAACTCTAAATTACTCTAAATTGTAATCCACCCGAAATTTATAATATGAAACTTCCTTCTTCTTCAATGTGGCATGAATTGAAATATCTTTTGATAATTTCAACAATATTTTCTTCACATCTTCGTTGAAGCATATTGTATGATTGATGTCTCTTGGTTCATATCTCTTACCCCTTGGGGCCTTTCTGTACGAGCACAAATTACAACATTTTCTAACACAATAGAGAACGTTAAAATAACATATATTGTAGAAATCTCATCATATGGAGCATTCGTGTGAGTTTTCATAACATAAAATGCATTCTTATCCAAAATTTCCGTTCTAAAGTCCTAATCATGCATATTCTAACCACTTTAAACAAGGTATGTTTGGCGTGTCAGAATGTCCTATCAATCCCTACAAATAGTACTTGAGAAAATCTAAATGTTTACTTATATGGATGAAGCGAGACTTTTCGAGCTAGCTTTTCTGTGTGTTAGGGGCTACAAATTTTGTGAAATTTCACTGGTAAGATATAATTCAAGTTGAATTAAGGTTAGCATCAAAACTAGAGTGAAAAAACAGTCAATGGATGTTAAACAAGACAAAAAGGGGAACTCAGAAGCACTCACGTGTGAATGGAGAGTATCTCACGCACACAACTACGCGCGAGTACATATGCATGTTTGGGAGGGTGCATGCGCACTCCTTGCGCGACACGTGGTGTGTACAGACAAGCGTACTTGAGTCGAGTCATATCAGATCAGGAAGCCGGTTCAGGTCGATACTATCAAATCGAATTGTTTCTCGAATCTCAGTTTGGGTTGATCCTCTAGATTTGCAATGCGTGGCCAACCCTTTGTCATACGCATTCTTTCACCCTATGTCCAACACTTGTTCTCGCCTGCACCAACCGCGTTCCACCTCTGGCCATTTGACATGTATATGTGCATCATGTTCAACCCAAATTCATCGCAATTTTCACAATTTTTTCTATTTTCTTCGATTTTTTTTAATCTTTTTCTACATAACTCTAATCAACTTAGTTGTAGAGTTTTAATCGATCTTTCTACCTAAATTATTTTCATTCCATTTCATCAATTATCGAGCGATATAAACACATATAAAATTCTTCTTACCGTGTCTTGAGATTCGCTAGAACTCCTCTTTTCGGTCCAAATTGACTCAACATTTCTTCATGAATGTTGTGCAAAATCCCCTCAAGAACAACATACAAATTTTAAAATCTAACTTTTCCAAAATCTCATAGATACGAGTTTTTGCTTTGGCTCGAGTATATGTGCATCATTTTAATTCTCATTTAGGCTCGAGTTTTACAGAAACATAGCATTTAAGATAGTCATGAATCCCAATATACTCACGATTCAATCACATGGGATCCACGAGTGGGTGAGGTGTGTGCACATGGAATCATCCACTAGTCCGTTTTTACAGGTTTGACAACTTCAATGTCAATTCTAATTTTACCCAAATCTTTCTTAGAGTTCTCAGTTATCTCTATTTTCTCCCAATATTATTAACCATTTATTTTGCCGTGTACCGTAAGAATTTAACGTTTAAAAAGTTTGTGATATCTTATATTGGTTGGGGGAGAGAACGAAACACCATCTATAAAGGTGTGGAAACCTTTCCTTATCGAAGGGTTATTAAAGCCTTGATGGAAAGTTCGATAAAAAAAGTCAAAAAAGAACAATATCGTGGATTTGGGCACTTACAAAAGTACACGGTTTTTCGAAAAAAAAAATGCCAATTAACCCTCCTAATCAAACGCAGAACGAAACACCATTTATAAGGATGTGGAAACCTTTCCTTATCGAACGGTTTTTAAAGCCTCGACCCGAAGTCCGATAAAAAAAAAAAGTAAAAAAAATAACAATATCTAGTAACGGGGGATTTAGGTAGTTATAAAAGTACACTGTTTTTCGGGAAAAAAATTTGCCAATTAACCCTCTAACCAATCACACTTCACAAGGGCTAATTACACACCCAGCACGTGGAATTGTGAGATCCAAGCTTCCATGCAGCTGCTGTCATTCCATGTGTTGTTTTTATTGGCATGTTACAATCTAGCTGGAAGATGCTAATGGTGGGTTTACCAAACACTGCATTTTGGACTCTTTGGACGCACAAAGAGTATTTCAAACACACGTCGCCGTCAACTTCACAGTTTTAAAACACGTGTACTATAGAGAGCGACTAGCCTAATCCGACTACTGTCTCGCACCATCTGATGACTGGCTCTGATATCATTTGTAATAGTCTAAGCTTAACGCTAGCAGATATTGTTCGTTTTAGCCTATTACGTAGGGTCGTCAACCTCACGATTTTAAAACATGTTTACTAAACAGAGATTTTCACACTAATACTCAAACTATTGTCATATTCTATGGAAAGTTTTCAAACTTTTATAAAACGTCAATAATAGTGGTTTTTTTTTTTTTTTTTTTTCCAATGATTTAATGTCTAATATATTTTATTTCTAGATTTTTTAAAAAAAATATTTAATTTTAATCATTAAGATTCGAGATATGTTTGTGATCGAAAAAATTATTGAGTAAGATCTTACTCAAAGTTCATAAAATAAGTGTTAATTTATTTTTAAATATTTAATTAAAACATGTAAATTAAAAACTATTTTAAAATGTAGTTGAAGAATAAATAATAAAAAAATTATTATAATGTAAAAACGACAACGTAGCACACACATGTTTCACGTGCTGCGGCCTAGGGACAAGGAAATTAAAATCCACTGCGTTCGTGCACACGGAAAAGAATGGTCCTCCACAATCACACGTCACTTATTTAAGTTCAATGGACGAAAAATACAAAAATGTTCATTGATGGTGAGGCTATTTGTTGGGAATGAGTCTTTCCCCTAGTAGTAGAAGAATTGGAACCTATGGAAGAAGAATTTGGTGGTGAAGATAGCTCATATTAGACCATTTTTTAGGATGTATATTTATTTAGGAACGTTCGGATCACTGTAGTAGTGAAAGATATGACTTCTGAGAAGGCTTGGAGTGGCGTAAAACCTAATGTTGATTATTTTCGAGTTTTTGGATGCATTGGTCATGTTCGTGTATTAGACGTTAAGATAAATAAGTTAGATGATGAAAGTTTTCGGTGTGTGTTGTGCTGCTAGGGATTAGTGAAGAGTTTAAATCATATAGACTCTATGATTCAGTGTCCAAGAAAATAGTTGTGAGCATATGTGTGATTTTTTAAGAAAGTAAGTGTTGGAATTGGGGGAAAGTAATGAAGAAGCTAGACTTGATACCTTCGAATGGGAAGATAGTAATATAGAAGGAAGCGAACATGACCAAAGTCAGGAAGGATCTGAAGAGGAGGTGGTAGTAGAGAAGAAGGGGAAGTTAGTATATCTTTCAGTAAGTCATCTGAATCAAATTCTTCAACATCTGAAGAAAGCACACTCGAAGTGAGGAGCAGAAGAGCGCCATATGGATGGAAGATTATGTGAGCGGAAATTTTTTTTCTGAAGAAGCTGTGCACAACAATTTAGTTCTGTTAACCTCAATTATCTTTGAAGAAGCTGTTCAAAATTCAAAATAGAGAGCTACAATGGACTTGGAGATAGAAGCCATCGAACGGAAAAGAGACTTGGGAATTGACAGATTAGGAATGAAGAAGATTGGAGTAAACTGACTTTTCAAAACCAAACTCAAGGAAAATGGCAATATTGACGATTATAAGGTTAGGTTGGTAGTAAAAGGTTATGCACAACAACATAGTATAGACTATATCGAGGTGTTTGCACCTATGACTAGGTGGGATACTATGTTGAGGATGGTTGGGAGAGAGTCCCACATTGGCTAATTTAGAGAATGATCATGGGTTTATAGGTAAAGAATACATCTCCATTGATATGAGACATTTTAGGGAAACCAAAAGCAAAGCCATGAGAGCTTGTGCTCAAAGTGGACAATATCATACAATTGTGGAGAGTCGTGATTCCTAACATACTATTCGAATGATAATTGCTTTAGCAGCTTGGAATAGTTCGAACGTGCATCAGCTTGACGTGAAAAGTGTTTTCTTATATGCAAGGTTAAAAGAAGCCATGTTTGTTGAGCAACCACAAGGTTATGACAATCCCCACGGCATGGAACAGTCGAATTGAAAACCACAAGGCTATGATAAAAAAAAGGTATACAAATTGAAGAAGACATTATATGACCTTAAACAAGCTCCATATGCATGGTACAGTCGAATTGAGCAACTGTGACCTTACTTAGGACCTTGTTCTTGAGATTAGGACCGGTTGAAGAAGGAGCCAAGATCAATGCTACCATGTATAAACAATTGACTAGAAGCCTTATATATCTAATTGCAACAAGGCTGGATTTAAAGCTTTCGGTATCCATTTTTTAAAGAAAATTCCAAGATCGAAGAAGAAGAGTTTTAGGATATAATATTTTATTTAAATGATATTTCTCTCACACACTAAATACATATTTTTCTTTTTATGTTTACCATATTTAACCTATTAAATATTAACTTGGAAGTGTTCAAGTTAAATACATCTTCTCCCTTTATATTTAATATGTAAAAATATATTTATCTTTTTGTCTTTACTATATTAAATATTAATTTAAATTAAAAGTATTAAATATCAATTTGAATTCTTCCAATTAAACTTTTTTTTTTATATATATTTAATATATATAAGTCAGAGCTCATAACTCACTCAAGATTAAGAACTCACTCAAGATTAAGAATGAGCCTAATATGATCCTTTAATAAATTTATAAAAAGAGATTAAATATTTTATAATATAGTCTTAAACAAATTCATTGTATAGGTAATCCGCTCACAATTCACTCACAATTAAGATAAAATACTTAACCTTTTTTATATACGATAGATACTTAGAGATTTTTATTTTATATTTAAAATTAGTTTAAATCACGTCTTTAAAATAATAAATGATCACCTTAATCATTATCCTAATAAATATTTTATTAAATAATTACAAGTTGGATGATCTTAATTGACTAAAATTAAACATTATGGAAATCTCAAAAATAAAAAATACTAAATTTTAAGATATAATGACCAAATAGAGTTGAAACCCAAAACCTACCGATTAAAACTTAGAATATATTTTTATTTTATACCACCTTGATGTCTTTGGTTTTCCTACCATATTTTATATATATATCACTTTCCGACCTTGTTTTTCACGGGATCTAAGTTGTCTCGCACCTCTACCCCTTCACAATAAATGCATCAATACCGTGATAAAATTTGAGAATAGAGTGGTTCAATGTTACTATATTTCAAAATTACTTCCCTCTTCGCCCGAGCATGGTTCTTTATTGAAACGTACAATAACAGGGTGTTGTTTGGCTTATCTAAACGTTTTGGATCCTTGATGAGTTCTTCTTTGGAGACTCGTCCTTCGTATGGTGTAAACAGGAAATTCCAAATCTCACCGACAGTAATTTTTTCGATAACGACTCTTCCTCCTCTCTCAAATTCTCACTCTTTCTTTTCTTGTAGCGAGGTTTTGGGAAGGGTTGATGTCTGTTTCATGTGTCGTAGGAGTCGAGAGAAGTGTGGTTTGGTTGAGGAGGCGAGAATTTCAGTGCCTCATAAATTAATTAATGGGTATATGGTTGAGAAATGTCATGAAATATTAATTTATTAATATGACAACATTTGATTTGACACTATTATTAATCAATTGGTTAGATGTATTTTATATTAAAATTACGATTATGTATGATTGAGAATTAAGCATCTACCACCTTTAAAATATTTCATTATTATTATTTTATTTTTGCTATGATTTTAGATTGAATTTTATAAATAAAATGATGTCTATTTAGTAGCATTTCAACATACATGTAATTACTCGATTGCCTTTTGGTGCAACCAAAATGAGTTCCAAAGCCTGTCTCCCCCATGTCTTCCTCGTCAGCTTCCCCGGCCAAGGCCATATCAACCCCATGCTCCGCCTCGGCAAGAAACTCGCCGCCGCGGGCCTCCTCGTTACCTTCTCCACCTCCGTCCAACTCGGGTCCCAGATGAAGAACGCTGGGAGCATCTCTGACCACCCGACACCCCTCGGCGATGGCTTCCTCCGCTTCGAATTCTTCGACGACGGCCGAACCGACACCACCCCGACACTCACCTACGACGAATACATGGTGCAGCTCCAACGCCTAGGCGCCATCTCCCTCCGCCAAATATTAGAGAACCAAATGAAAGAAAACCGCCCGGTCTCTTGCGTTATTGGGAACCCTTTTGTGCCTTGGGTTATTGACTTGGCCGACAACCTCGGAATCTCCTCCGCCGTCTTTTGGGTCCAATCATGTTCTGTTTTTTCCGTTTACTATCACCATTTTCGTGGAGCTGTCCCATTCCCTTCTCAAACACAACCAAATCTCGACGTGAAATTACCCTTTTTGCCCCTTTTGAAGTCCGATGAAATCCCAAGCTTCTTGACTCCAAATGACTCTCATCAAGCTATTGGGAAGGACATTTTGAGGCAATTTTCGAATCTCTCCAAACCCTTTTGTATATTAATGGATACTTTTGAAGAGTTGGAGGCTGAGGTCATAAACGACATGTCGAAAAATTTTCCGATCAAGGCGGTGGGGCCTTTGTTTAAGATTTGTAGTGAAATGGAAACGAAGATTCGTGGAGATTGCATGAAAGCTGCTGATGAGTGTATTGAGTGGCTCGACTCGAAGCCTATCGGATCGGTGGTTTACGTGTCGTTTGGAAGTGTGGTGTTTTTGAAACAAGACCAGATTGATGAGATTGCTTATGCGCTTCATAGTTCGGGGTTTTCTTTCTTGTGGGTTTTGAAACCGCCTTCCGTACATCTTGGAGCCGACCGCCATGTTCTTCCTCTCGAGGTTCGTAAATTTTAATGTTTTTGTTTTGAGATTCTAATCACGATCATGTCAAGTTTGAATTATTATAAACATCCAACCTAAAAATTAGTATAAACATCCAAACTAAACGGTATACAATTGCAACAAAACTTAACTACGGACTAAAAGAATGCACAAATACTCCTTTTACTGTATGTTTTAAGTCTATTTTACAAAGACAATAAATAGGATTAAAGTAGTTATGATATAGAAGAGTCGAATCTAAGAACATTTATAAATGTCTCGATATGCAAACTCTTAGTTCACAAACTTAAAAACCCCACATGATTGAAAATCTTGAAAATTTGATCTAACAATCTTATTGAAAAATAAAATATTTAGATCTAAATTTTCAAGTAACTCAGAAGACAAGGAATCACTCTAAGAGAACATTGGGATCGAATATCTCAAGACAAAATACTTGTTTGAGATTCAAATACTCCACAAACAAGATTGTTCATGTTAAGTTTGAATGATTCTAAACGTCTAACCTAAACTGTGCAACAAAACTTAGCTCTTGACTAAAAGAAAACACAAATGTTATTTTTATTGCATTTTTTAGGTCTCTTTTACAAATACAATATACATGACTTTATATAGTCTCAAAATAAACTCTTGACCTTCTATGAGACATTTCAAGAGTTTCGTAATCTTTCTATTTTATAACACTAATTAGCTACTATGTAAATGTAACCTAAAAATAAATAAAAAAAGTCTTAAAGCTTGGTATATGAAACACCATAAATCTAGTTTTCAATTATGCATTAAAAAAATTTGTAACCATTTAAGAATAAATAAAGTCCAACACGAAGTATCTTGAAGTCATTGATTGCATAAAAGAATTTTGATTCTTCTTAACATGGCATTAATTGCAACTCGGTTTAACTTGACCTGAAACTTGACTTCATTTAATGTAACTTGACTTTATCTTATGGTAATTTGGATAATATTCTTCACAACTTTCTTGAAGTTCATATTGTATGATTAAGATGTCTTGGTTCGTGTCAATTTTAGTAAAAAGAAACCCTCTTTTCAAATTATCGTGTTATTTATTTCGTTTCTTTCTATTTCGATTATTCATTTGAGTGCGTTTAGTAAATTTTTCGGTTCCATGTTGGGGGAGGATTCAAACTTTTGATCATTGTTTTCAAAATTATGTACCTTTAGTATTGATATTGGGTATACTTAAAGTTGACCAATACCTGCCATAAATTGATGGTTTGCATTCACATTGGTTGGTATGTAGGTGGTGGAAGAGATGGGAGAGAGAGGGAAGGTGGTTGAATGGAGTCCACAAGAACAAGTGCTCTCACACCCATCATTGGCATGTTTCCTCACACACTGTGGTTGGAACTCATCTGTGGAGGCCATGAGCTTAGGGGTCCCGATGGTCGCATTTCCCCAATGGGGGGATCAGGTCACCAATGCCAAGTTCCTTGTCGACGTCTTCGGCGTCGGCCTCCGCCTGTCCCGTGGCGCCAATGAAGATAGGCTAATACAAAGAGATGAGATTGAGACGTGCCTGAGAGAAGCCATGGAAGGCCCAAGGGCGGTGGAGATTAGACAGAACGCTTTGAAGCAGCAAAAGGCGGCGGAGAAGGCGGTGGCTGACGGCGGCTCCTCCGATCGAAATATTAAGGACTTCATCGATGAGATTCGAAAA

mRNA sequence

ATATCCACATTCCTCAAAATCAGCCATGATGAATTCTGAAGAAGCTCCTTGCCATGTCTTCCTTGTATGCTATCCCAGCCAAGGACACATCAACCCCACTCTCAGACTTGCCAAGAAACTCGCCGCCGAGGGCCTTCTCGTCACCATTTCCACGGCGGTGCATTTCGGCAAAACACTGCAGAAAGCTGGAAGTATCGGCGCCGGCGACTGTCCCACTCCGGTCGGCAATGGCTTCATCAGATTTGAATTCTTCGAAGATCGCCTCCAAGAGATCAATCCCAAGGATATGAACTTGACCCGCTATAACAACCAGCTCGAGCTCTCCGGCCGGCCGTCGCTCACCGGCCTGATCAAGAACCAAACAGCCGAAAACCGCCCTGTTTCTTGCCTGATTGTGAACCCCTTTTTTCCATGGACATGCGAAGTTGCTAAGGAGCTTGGAATCCCCTGTGCCGTTCTTTGGGTTCAATCATGCTCTGTGTTCTCAATTTACTATCACTGTTTCCACAAATCCGTCCCGTTCCCTTCTGAATTGGAACCCAAAATCGACGTTCATCTCCCAATTTTGCCACTTTTGAAGAACGATGAAATCCCAAGCTTCTTGCATCCAAATAACATCTATGGCGTTTTGGGGAAAGTTCTGTTATCCCAATTCAGTAAATTATCAATACCCTTTTGTATTTTGATGGATAGTTTCGATGAACTCGAGAAAGATATCATCAGTTACATGTCTAATATCATTCCTTTGAAACCCATTGGCCCATTGTTCTTAAACCCACAAAATGTGGAAACAGAGGTCTCTGCCGACTGCTTAAAAGCAGAGGATTGTATGGAATGGCTAAACTCGAAGCCCCCACAATCCGTTGTGTACGTTTCATTTGGAAGCATCGTTTATTTGAAACAAGAGCAAATAGACGAGCTCGCTTATGGACTGTGCAATTCAGGGCTCTCATTCTTATGGGTTATGAAGCCGCCCAATGAAGCTCTCGGGTTAAAGGGCCATATTTTGCCGGAAGGGGTAATGGAGAAAGCAGGGGAGAGGGGAAAAGTGGTGCAATGGAGTTCGCAAGAGAGGGTTTTGTCGCATGAATCTGTTGGGTGTTTTATGACGCATTGTGGGTGGAATTCGTCGGTGGAAGCCATCGGCTGCGGCGTGCCGGTGGTGGCGTTTCCGCAGTGGGGGGATCAGGTGACCAATGCTAAGTTTTTGGTGGAGGATTATGGAGTTGGGGTGAGGCTGTCGCGTGGAGCAGAGGCAAATGAGTTGATTTCAAGGGATGAGATTGTGAGATGCATATCGGAGGTGATGACCCGCGATAGCAGCGGAGGAGAATTCAGGCGGAATGCTTTGAAATTGAAGCAGGCAGCGGCGGCGGCCGTGGTGGACGGCGGATCCTCCCACAAGAACATCCAACATTCCAAAGCCTGTCTCCCCCATGTCTTCCTCGTCAGCTTCCCCGGCCAAGGCCATATCAACCCCATGCTCCGCCTCGGCAAGAAACTCGCCGCCGCGGGCCTCCTCGTTACCTTCTCCACCTCCGTCCAACTCGGGTCCCAGATGAAGAACGCTGGGAGCATCTCTGACCACCCGACACCCCTCGGCGATGGCTTCCTCCGCTTCGAATTCTTCGACGACGGCCGAACCGACACCACCCCGACACTCACCTACGACGAATACATGGTGCAGCTCCAACGCCTAGGCGCCATCTCCCTCCGCCAAATATTAGAGAACCAAATGAAAGAAAACCGCCCGGTCTCTTGCGTTATTGGGAACCCTTTTGTGCCTTGGGTTATTGACTTGGCCGACAACCTCGGAATCTCCTCCGCCGTCTTTTGGGTCCAATCATGTTCTGTTTTTTCCGTTTACTATCACCATTTTCGTGGAGCTGTCCCATTCCCTTCTCAAACACAACCAAATCTCGACGTGAAATTACCCTTTTTGCCCCTTTTGAAGTCCGATGAAATCCCAAGCTTCTTGACTCCAAATGACTCTCATCAAGCTATTGGGAAGGACATTTTGAGGCAATTTTCGAATCTCTCCAAACCCTTTTGTATATTAATGGATACTTTTGAAGAGTTGGAGGCTGAGGTCATAAACGACATGTCGAAAAATTTTCCGATCAAGGCGGTGGGGCCTTTGTTTAAGATTTGTAGTGAAATGGAAACGAAGATTCGTGGAGATTGCATGAAAGCTGCTGATGAGTGTATTGAGTGGCTCGACTCGAAGCCTATCGGATCGGTGGTTTACGTGTCGTTTGGAAGTGTGGTGTTTTTGAAACAAGACCAGATTGATGAGATTGCTTATGCGCTTCATAGTTCGGGGTTTTCTTTCTTGTGGGTTTTGAAACCGCCTTCCGTACATCTTGGAGCCGACCGCCATGTTCTTCCTCTCGAGGTGGTGGAAGAGATGGGAGAGAGAGGGAAGGTGGTTGAATGGAGTCCACAAGAACAAGTGCTCTCACACCCATCATTGGCATGTTTCCTCACACACTGTGGTTGGAACTCATCTGTGGAGGCCATGAGCTTAGGGGTCCCGATGGTCGCATTTCCCCAATGGGGGGATCAGGTCACCAATGCCAAGTTCCTTGTCGACGTCTTCGGCGTCGGCCTCCGCCTGTCCCGTGGCGCCAATGAAGATAGGCTAATACAAAGAGATGAGATTGAGACGTGCCTGAGAGAAGCCATGGAAGGCCCAAGGGCGGTGGAGATTAGACAGAACGCTTTGAAGCAGCAAAAGGCGGCGGAGAAGGCGGTGGCTGACGGCGGCTCCTCCGATCGAAATATTAAGGACTTCATCGATGAGATTCGAAAA

Coding sequence (CDS)

ATGATGAATTCTGAAGAAGCTCCTTGCCATGTCTTCCTTGTATGCTATCCCAGCCAAGGACACATCAACCCCACTCTCAGACTTGCCAAGAAACTCGCCGCCGAGGGCCTTCTCGTCACCATTTCCACGGCGGTGCATTTCGGCAAAACACTGCAGAAAGCTGGAAGTATCGGCGCCGGCGACTGTCCCACTCCGGTCGGCAATGGCTTCATCAGATTTGAATTCTTCGAAGATCGCCTCCAAGAGATCAATCCCAAGGATATGAACTTGACCCGCTATAACAACCAGCTCGAGCTCTCCGGCCGGCCGTCGCTCACCGGCCTGATCAAGAACCAAACAGCCGAAAACCGCCCTGTTTCTTGCCTGATTGTGAACCCCTTTTTTCCATGGACATGCGAAGTTGCTAAGGAGCTTGGAATCCCCTGTGCCGTTCTTTGGGTTCAATCATGCTCTGTGTTCTCAATTTACTATCACTGTTTCCACAAATCCGTCCCGTTCCCTTCTGAATTGGAACCCAAAATCGACGTTCATCTCCCAATTTTGCCACTTTTGAAGAACGATGAAATCCCAAGCTTCTTGCATCCAAATAACATCTATGGCGTTTTGGGGAAAGTTCTGTTATCCCAATTCAGTAAATTATCAATACCCTTTTGTATTTTGATGGATAGTTTCGATGAACTCGAGAAAGATATCATCAGTTACATGTCTAATATCATTCCTTTGAAACCCATTGGCCCATTGTTCTTAAACCCACAAAATGTGGAAACAGAGGTCTCTGCCGACTGCTTAAAAGCAGAGGATTGTATGGAATGGCTAAACTCGAAGCCCCCACAATCCGTTGTGTACGTTTCATTTGGAAGCATCGTTTATTTGAAACAAGAGCAAATAGACGAGCTCGCTTATGGACTGTGCAATTCAGGGCTCTCATTCTTATGGGTTATGAAGCCGCCCAATGAAGCTCTCGGGTTAAAGGGCCATATTTTGCCGGAAGGGGTAATGGAGAAAGCAGGGGAGAGGGGAAAAGTGGTGCAATGGAGTTCGCAAGAGAGGGTTTTGTCGCATGAATCTGTTGGGTGTTTTATGACGCATTGTGGGTGGAATTCGTCGGTGGAAGCCATCGGCTGCGGCGTGCCGGTGGTGGCGTTTCCGCAGTGGGGGGATCAGGTGACCAATGCTAAGTTTTTGGTGGAGGATTATGGAGTTGGGGTGAGGCTGTCGCGTGGAGCAGAGGCAAATGAGTTGATTTCAAGGGATGAGATTGTGAGATGCATATCGGAGGTGATGACCCGCGATAGCAGCGGAGGAGAATTCAGGCGGAATGCTTTGAAATTGAAGCAGGCAGCGGCGGCGGCCGTGGTGGACGGCGGATCCTCCCACAAGAACATCCAACATTCCAAAGCCTGTCTCCCCCATGTCTTCCTCGTCAGCTTCCCCGGCCAAGGCCATATCAACCCCATGCTCCGCCTCGGCAAGAAACTCGCCGCCGCGGGCCTCCTCGTTACCTTCTCCACCTCCGTCCAACTCGGGTCCCAGATGAAGAACGCTGGGAGCATCTCTGACCACCCGACACCCCTCGGCGATGGCTTCCTCCGCTTCGAATTCTTCGACGACGGCCGAACCGACACCACCCCGACACTCACCTACGACGAATACATGGTGCAGCTCCAACGCCTAGGCGCCATCTCCCTCCGCCAAATATTAGAGAACCAAATGAAAGAAAACCGCCCGGTCTCTTGCGTTATTGGGAACCCTTTTGTGCCTTGGGTTATTGACTTGGCCGACAACCTCGGAATCTCCTCCGCCGTCTTTTGGGTCCAATCATGTTCTGTTTTTTCCGTTTACTATCACCATTTTCGTGGAGCTGTCCCATTCCCTTCTCAAACACAACCAAATCTCGACGTGAAATTACCCTTTTTGCCCCTTTTGAAGTCCGATGAAATCCCAAGCTTCTTGACTCCAAATGACTCTCATCAAGCTATTGGGAAGGACATTTTGAGGCAATTTTCGAATCTCTCCAAACCCTTTTGTATATTAATGGATACTTTTGAAGAGTTGGAGGCTGAGGTCATAAACGACATGTCGAAAAATTTTCCGATCAAGGCGGTGGGGCCTTTGTTTAAGATTTGTAGTGAAATGGAAACGAAGATTCGTGGAGATTGCATGAAAGCTGCTGATGAGTGTATTGAGTGGCTCGACTCGAAGCCTATCGGATCGGTGGTTTACGTGTCGTTTGGAAGTGTGGTGTTTTTGAAACAAGACCAGATTGATGAGATTGCTTATGCGCTTCATAGTTCGGGGTTTTCTTTCTTGTGGGTTTTGAAACCGCCTTCCGTACATCTTGGAGCCGACCGCCATGTTCTTCCTCTCGAGGTGGTGGAAGAGATGGGAGAGAGAGGGAAGGTGGTTGAATGGAGTCCACAAGAACAAGTGCTCTCACACCCATCATTGGCATGTTTCCTCACACACTGTGGTTGGAACTCATCTGTGGAGGCCATGAGCTTAGGGGTCCCGATGGTCGCATTTCCCCAATGGGGGGATCAGGTCACCAATGCCAAGTTCCTTGTCGACGTCTTCGGCGTCGGCCTCCGCCTGTCCCGTGGCGCCAATGAAGATAGGCTAATACAAAGAGATGAGATTGAGACGTGCCTGAGAGAAGCCATGGAAGGCCCAAGGGCGGTGGAGATTAGACAGAACGCTTTGAAGCAGCAAAAGGCGGCGGAGAAGGCGGTGGCTGACGGCGGCTCCTCCGATCGAAATATTAAGGACTTCATCGATGAGATTCGAAAA

Protein sequence

MMNSEEAPCHVFLVCYPSQGHINPTLRLAKKLAAEGLLVTISTAVHFGKTLQKAGSIGAGDCPTPVGNGFIRFEFFEDRLQEINPKDMNLTRYNNQLELSGRPSLTGLIKNQTAENRPVSCLIVNPFFPWTCEVAKELGIPCAVLWVQSCSVFSIYYHCFHKSVPFPSELEPKIDVHLPILPLLKNDEIPSFLHPNNIYGVLGKVLLSQFSKLSIPFCILMDSFDELEKDIISYMSNIIPLKPIGPLFLNPQNVETEVSADCLKAEDCMEWLNSKPPQSVVYVSFGSIVYLKQEQIDELAYGLCNSGLSFLWVMKPPNEALGLKGHILPEGVMEKAGERGKVVQWSSQERVLSHESVGCFMTHCGWNSSVEAIGCGVPVVAFPQWGDQVTNAKFLVEDYGVGVRLSRGAEANELISRDEIVRCISEVMTRDSSGGEFRRNALKLKQAAAAAVVDGGSSHKNIQHSKACLPHVFLVSFPGQGHINPMLRLGKKLAAAGLLVTFSTSVQLGSQMKNAGSISDHPTPLGDGFLRFEFFDDGRTDTTPTLTYDEYMVQLQRLGAISLRQILENQMKENRPVSCVIGNPFVPWVIDLADNLGISSAVFWVQSCSVFSVYYHHFRGAVPFPSQTQPNLDVKLPFLPLLKSDEIPSFLTPNDSHQAIGKDILRQFSNLSKPFCILMDTFEELEAEVINDMSKNFPIKAVGPLFKICSEMETKIRGDCMKAADECIEWLDSKPIGSVVYVSFGSVVFLKQDQIDEIAYALHSSGFSFLWVLKPPSVHLGADRHVLPLEVVEEMGERGKVVEWSPQEQVLSHPSLACFLTHCGWNSSVEAMSLGVPMVAFPQWGDQVTNAKFLVDVFGVGLRLSRGANEDRLIQRDEIETCLREAMEGPRAVEIRQNALKQQKAAEKAVADGGSSDRNIKDFIDEIRK
Homology
BLAST of Cp4.1LG04g07760 vs. ExPASy Swiss-Prot
Match: A0A193AU77 (Gallate 1-beta-glucosyltransferase 84A24 OS=Punica granatum OX=22663 GN=UGT84A24 PE=1 SV=1)

HSP 1 Score: 582.8 bits (1501), Expect = 6.9e-165
Identity = 271/462 (58.66%), Postives = 359/462 (77.71%), Query Frame = 0

Query: 469 LPHVFLVSFPGQGHINPMLRLGKKLAAAGLLVTFSTSVQLGSQMKNAGSISDHPTPLGDG 528
           L HVFLVSFPGQGH+NP+LRLGK+LA+ GLLVTF+T   +G QM+ A +I + P+P+GDG
Sbjct: 6   LVHVFLVSFPGQGHVNPLLRLGKRLASKGLLVTFTTPESIGKQMRKASNIGEEPSPIGDG 65

Query: 529 FLRFEFFDDGRTDTTP-TLTYDEYMVQLQRLGAISLRQILENQMKENRPVSCVIGNPFVP 588
           F+RFEFF+DG  +  P     D+Y+ QL+++G   + ++++   ++NRPVSC+I NPF+P
Sbjct: 66  FIRFEFFEDGWDEDEPRRQDLDQYLPQLEKVGKEVIPRMIKKNEEQNRPVSCLINNPFIP 125

Query: 589 WVIDLADNLGISSAVFWVQSCSVFSVYYHHFRGAVPFPSQTQPNLDVKLPFLPLLKSDEI 648
           WV D+A++LG+ SA+ WVQSC+ F+ YYH++ G VPFPS++   +DV+LP +PLLK DE+
Sbjct: 126 WVSDVAESLGLPSAMLWVQSCACFAAYYHYYHGLVPFPSESAMEIDVQLPCMPLLKHDEV 185

Query: 649 PSFLTPNDSHQAIGKDILRQFSNLSKPFCILMDTFEELEAEVINDMSKNFPIKAVGPLFK 708
           PSFL P   +  + + I+ Q+ NL KPFC+LMDTF+ELE E+I  MSK  PIK VGPLFK
Sbjct: 186 PSFLYPTTPYPFLRRAIMGQYKNLDKPFCVLMDTFQELEHEIIEYMSKICPIKTVGPLFK 245

Query: 709 ICSEMETKIRGDCMKAADECIEWLDSKPIGSVVYVSFGSVVFLKQDQIDEIAYALHSSGF 768
                   +RGD MK AD+CI WLDSKP  SVVYVSFGSVV+LKQDQ DEIA+ L +SG 
Sbjct: 246 NPKAPNANVRGDFMK-ADDCISWLDSKPPASVVYVSFGSVVYLKQDQWDEIAFGLLNSGL 305

Query: 769 SFLWVLKPPSVHLGADRHVLPLEVVEEMGERGKVVEWSPQEQVLSHPSLACFLTHCGWNS 828
           +FLWV+KPP    G     LP   +E+ G++GKVV+WSPQEQVL+HPS+ACF+THCGWNS
Sbjct: 306 NFLWVMKPPHKDSGYQLLTLPEGFLEKAGDKGKVVQWSPQEQVLAHPSVACFVTHCGWNS 365

Query: 829 SVEAMSLGVPMVAFPQWGDQVTNAKFLVDVFGVGLRLSRGANEDRLIQRDEIETCLREAM 888
           S+EA+S G+P+VAFPQWGDQVT+AK+LVDVF VG+R+ RG  E++LI RD +E CL EA 
Sbjct: 366 SMEALSSGMPVVAFPQWGDQVTDAKYLVDVFKVGVRMCRGEAENKLIMRDVVEKCLLEAT 425

Query: 889 EGPRAVEIRQNALKQQKAAEKAVADGGSSDRNIKDFIDEIRK 930
            GP+A E+++NALK + AAE AVA+GGSSDRNI+ F+DE+++
Sbjct: 426 VGPKAAEVKENALKWKAAAEAAVAEGGSSDRNIQAFVDEVKR 466

BLAST of Cp4.1LG04g07760 vs. ExPASy Swiss-Prot
Match: A0A193AUF6 (Gallate 1-beta-glucosyltransferase 84A23 OS=Punica granatum OX=22663 GN=UGT84A23 PE=1 SV=1)

HSP 1 Score: 582.4 bits (1500), Expect = 9.0e-165
Identity = 272/466 (58.37%), Postives = 357/466 (76.61%), Query Frame = 0

Query: 465 SKACLPHVFLVSFPGQGHINPMLRLGKKLAAAGLLVTFSTSVQLGSQMKNAGSISDHPTP 524
           S++ L HVFLVSFPGQGH+NP+LRLGK+LA+ GLLVTF+T   +G QM+ A +ISD P P
Sbjct: 3   SESSLVHVFLVSFPGQGHVNPLLRLGKRLASKGLLVTFTTPESIGKQMRKASNISDQPAP 62

Query: 525 LGDGFLRFEFFDDGRTDTTP-TLTYDEYMVQLQRLGAISLRQILENQMKENRPVSCVIGN 584
           +GDGF+RFEFF+DG  +  P     D+Y+ QL+++G + + Q+++   ++ RPVSC+I N
Sbjct: 63  VGDGFIRFEFFEDGWDEDEPRRQDLDQYLPQLEKVGKVLIPQMIQKNAEQGRPVSCLINN 122

Query: 585 PFVPWVIDLADNLGISSAVFWVQSCSVFSVYYHHFRGAVPFPSQTQPNLDVKLPFLPLLK 644
           PF+PWV D+A+ LG+ SA+ WVQSC+ F  YYH++ G VPFPS+    +DV+LP +PLLK
Sbjct: 123 PFIPWVSDVAETLGLPSAMLWVQSCACFLAYYHYYHGLVPFPSENAMEIDVQLPSMPLLK 182

Query: 645 SDEIPSFLTPNDSHQAIGKDILRQFSNLSKPFCILMDTFEELEAEVINDMSKNFPIKAVG 704
            DE+PSFL P   +  + + IL Q+ NL KPFCILMDTF+ELE E+I   SK  PIK VG
Sbjct: 183 HDEVPSFLYPTTPYPFLRRAILGQYKNLEKPFCILMDTFQELEHEIIEYTSKICPIKTVG 242

Query: 705 PLFKICSEMETKIRGDCMKAADECIEWLDSKPIGSVVYVSFGSVVFLKQDQIDEIAYALH 764
           PLFK      T ++GD MK AD+CI WLDSKP  SVVYVSFGSVV+LKQDQ DEIAY L 
Sbjct: 243 PLFKNPKAPNTTVKGDFMK-ADDCIGWLDSKPASSVVYVSFGSVVYLKQDQWDEIAYGLL 302

Query: 765 SSGFSFLWVLKPPSVHLGADRHVLPLEVVEEMGERGKVVEWSPQEQVLSHPSLACFLTHC 824
           +SG +FLWV+KPP    G     LP   +E+ G+RGKVV+WSPQEQVL+HP+ ACF+THC
Sbjct: 303 NSGVNFLWVMKPPHKDSGYTVLTLPEGFLEKAGDRGKVVQWSPQEQVLAHPATACFVTHC 362

Query: 825 GWNSSVEAMSLGVPMVAFPQWGDQVTNAKFLVDVFGVGLRLSRGANEDRLIQRDEIETCL 884
           GWNSS+EA++ G+P+VAFPQWGDQVT+AK+LVD F VG+R+ RG  ED+LI RD +E CL
Sbjct: 363 GWNSSMEALTSGMPVVAFPQWGDQVTDAKYLVDEFKVGVRMCRGEAEDKLITRDVVEQCL 422

Query: 885 REAMEGPRAVEIRQNALKQQKAAEKAVADGGSSDRNIKDFIDEIRK 930
           REA +GP+A E+++NALK + AAE +  +GGSSDRN++ F+DE+++
Sbjct: 423 REATQGPKAAEMKKNALKWKAAAEASFVEGGSSDRNLQAFVDEVKR 467

BLAST of Cp4.1LG04g07760 vs. ExPASy Swiss-Prot
Match: V5LLZ9 (Gallate 1-beta-glucosyltransferase OS=Quercus robur OX=38942 GN=UGT84A13 PE=1 SV=1)

HSP 1 Score: 568.5 bits (1464), Expect = 1.3e-160
Identity = 269/461 (58.35%), Postives = 352/461 (76.36%), Query Frame = 0

Query: 469 LPHVFLVSFPGQGHINPMLRLGKKLAAAGLLVTFSTSVQLGSQMKNAGSISDHPTPLGDG 528
           L HVFLVSFPGQGH+NP+LRLGK+LAA GLLVTFST   +G QM+ A +I+D P P+G+G
Sbjct: 6   LVHVFLVSFPGQGHVNPLLRLGKRLAAKGLLVTFSTPESIGKQMRKASNITDEPAPVGEG 65

Query: 529 FLRFEFFDDGRTDTTP-TLTYDEYMVQLQRLGAISLRQILENQMKENRPVSCVIGNPFVP 588
           F+RFEFF+DG  +  P     D+Y+ QL+ +G   + +++    +  RPVSC+I NPF+P
Sbjct: 66  FIRFEFFEDGWDEDEPRRQDLDQYLPQLELIGKDIIPKMIRKNAEMGRPVSCLINNPFIP 125

Query: 589 WVIDLADNLGISSAVFWVQSCSVFSVYYHHFRGAVPFPSQTQPNLDVKLPFLPLLKSDEI 648
           WV D+A++LG+ SA+ WVQSC+ F  YYH++ G VPFPS+ +P +D++LP +PLLK DE 
Sbjct: 126 WVSDVAESLGLPSAMLWVQSCACFCAYYHYYHGLVPFPSEAEPFIDIQLPCMPLLKYDET 185

Query: 649 PSFLTPNDSHQAIGKDILRQFSNLSKPFCILMDTFEELEAEVINDMSKNFPIKAVGPLFK 708
           PSFL P   +  + + IL Q+ NL KPFCILMDTF+ELE EVI  MSK  PIK VGPLFK
Sbjct: 186 PSFLYPTTPYPFLRRAILGQYGNLDKPFCILMDTFQELEHEVIEFMSKICPIKTVGPLFK 245

Query: 709 ICSEMETKIRGDCMKAADECIEWLDSKPIGSVVYVSFGSVVFLKQDQIDEIAYALHSSGF 768
              +    +RGD MK AD+C+EWLDSKP  SVVY+SFGSVV+L Q Q+DEIA+ L  SG 
Sbjct: 246 -NPKAPNSVRGDFMK-ADDCLEWLDSKPPQSVVYISFGSVVYLTQKQVDEIAFGLLQSGV 305

Query: 769 SFLWVLKPPSVHLGADRHVLPLEVVEEMGERGKVVEWSPQEQVLSHPSLACFLTHCGWNS 828
           SFLWV+KPP    G +  VLP   +E+ G+ G+VV+WSPQEQVL+HPS+ACF+THCGWNS
Sbjct: 306 SFLWVMKPPHKDAGLELLVLPDGFLEKAGDNGRVVQWSPQEQVLAHPSVACFVTHCGWNS 365

Query: 829 SVEAMSLGVPMVAFPQWGDQVTNAKFLVDVFGVGLRLSRGANEDRLIQRDEIETCLREAM 888
           ++E+++ G+P+VAFPQWGDQVT+A +LVDVF  G+R+ RG  E+R+I RDE+E CL EA 
Sbjct: 366 TMESLTSGMPVVAFPQWGDQVTDAVYLVDVFKTGVRMCRGEAENRVITRDEVEKCLLEAT 425

Query: 889 EGPRAVEIRQNALKQQKAAEKAVADGGSSDRNIKDFIDEIR 929
            GP+AVE++QNA K + AAE A ++GGSSDRNI+ F+DE+R
Sbjct: 426 VGPKAVEMKQNASKWKAAAEAAFSEGGSSDRNIQAFVDEVR 464

BLAST of Cp4.1LG04g07760 vs. ExPASy Swiss-Prot
Match: Q9MB73 (Limonoid UDP-glucosyltransferase OS=Citrus unshiu OX=55188 PE=2 SV=1)

HSP 1 Score: 565.8 bits (1457), Expect = 8.7e-160
Identity = 263/462 (56.93%), Postives = 350/462 (75.76%), Query Frame = 0

Query: 469 LPHVFLVSFPGQGHINPMLRLGKKLAAAGLLVTFSTSVQLGSQMKNAGSISDHPTPLGDG 528
           L HV LVSFPG GH+NP+LRLG+ LA+ G  +T +T    G QM+ AG+ +  PTP+GDG
Sbjct: 6   LVHVLLVSFPGHGHVNPLLRLGRLLASKGFFLTLTTPESFGKQMRKAGNFTYEPTPVGDG 65

Query: 529 FLRFEFFDDGRTDTTPTL-TYDEYMVQLQRLGAISLRQILENQMKENRPVSCVIGNPFVP 588
           F+RFEFF+DG  +  P     D+YM QL+ +G   + +I++   +E RPVSC+I NPF+P
Sbjct: 66  FIRFEFFEDGWDEDDPRREDLDQYMAQLELIGKQVIPKIIKKSAEEYRPVSCLINNPFIP 125

Query: 589 WVIDLADNLGISSAVFWVQSCSVFSVYYHHFRGAVPFPSQTQPNLDVKLPFLPLLKSDEI 648
           WV D+A++LG+ SA+ WVQSC+ F+ YYH+F G VPFPS+ +P +DV+LP +PLLK DE+
Sbjct: 126 WVSDVAESLGLPSAMLWVQSCACFAAYYHYFHGLVPFPSEKEPEIDVQLPCMPLLKHDEM 185

Query: 649 PSFLTPNDSHQAIGKDILRQFSNLSKPFCILMDTFEELEAEVINDMSKNFPIKAVGPLFK 708
           PSFL P+  +  + + IL Q+ NL KPFCIL+DTF ELE E+I+ M+K  PIK VGPLFK
Sbjct: 186 PSFLHPSTPYPFLRRAILGQYENLGKPFCILLDTFYELEKEIIDYMAKICPIKPVGPLFK 245

Query: 709 ICSEMETKIRGDCMKAADECIEWLDSKPIGSVVYVSFGSVVFLKQDQIDEIAYALHSSGF 768
                   +R DCMK  DECI+WLD KP  SVVY+SFG+VV+LKQ+Q++EI YAL +SG 
Sbjct: 246 NPKAPTLTVRDDCMK-PDECIDWLDKKPPSSVVYISFGTVVYLKQEQVEEIGYALLNSGI 305

Query: 769 SFLWVLKPPSVHLGADRHVLPLEVVEEMGERGKVVEWSPQEQVLSHPSLACFLTHCGWNS 828
           SFLWV+KPP    G     LP   +E++G++GKVV+WSPQE+VL+HPS+ACF+THCGWNS
Sbjct: 306 SFLWVMKPPPEDSGVKIVDLPDGFLEKVGDKGKVVQWSPQEKVLAHPSVACFVTHCGWNS 365

Query: 829 SVEAMSLGVPMVAFPQWGDQVTNAKFLVDVFGVGLRLSRGANEDRLIQRDEIETCLREAM 888
           ++E+++ GVP++ FPQWGDQVT+A +L DVF  GLRL RG  E+R+I RDE+E CL EA 
Sbjct: 366 TMESLASGVPVITFPQWGDQVTDAMYLCDVFKTGLRLCRGEAENRIISRDEVEKCLLEAT 425

Query: 889 EGPRAVEIRQNALKQQKAAEKAVADGGSSDRNIKDFIDEIRK 930
            GP+AV + +NALK +K AE+AVADGGSSDRNI+ F+DE+R+
Sbjct: 426 AGPKAVALEENALKWKKEAEEAVADGGSSDRNIQAFVDEVRR 466

BLAST of Cp4.1LG04g07760 vs. ExPASy Swiss-Prot
Match: Q2V6K1 (Putative UDP-glucose glucosyltransferase OS=Fragaria ananassa OX=3747 GN=GT5 PE=2 SV=1)

HSP 1 Score: 562.0 bits (1447), Expect = 1.3e-158
Identity = 275/463 (59.40%), Postives = 350/463 (75.59%), Query Frame = 0

Query: 471 HVFLVSFPGQGHINPMLRLGKKLAAAGLLVTFSTSVQLGSQMKNA-GSISDHPTPLGDGF 530
           H+FLV +P QGHINPMLRLGK LAA GLLVTFST+   G++M+NA G + +HPTP+G+GF
Sbjct: 10  HIFLVCYPAQGHINPMLRLGKYLAAKGLLVTFSTTEDYGNKMRNANGIVDNHPTPVGNGF 69

Query: 531 LRFEFFDDGRTD-TTPTLTYDEYMVQ-LQRLGAISLRQILENQMKE-NRPVSCVIGNPFV 590
           +RFEFFDD   D   P  T  E+ V  L+++G   +  +++   +E    VSC++ NPF+
Sbjct: 70  IRFEFFDDSLPDPDDPRRTNLEFYVPLLEKVGKELVTGMIKKHGEEGGARVSCLVNNPFI 129

Query: 591 PWVIDLADNLGISSAVFWVQSCSVFSVYYHHFRGAVPFPSQTQPNLDVKLPFLPLLKSDE 650
           PWV D+A  LGI  A  W+QSC+VFS Y+H+    V FP++ +P LDV+LP  PLLK DE
Sbjct: 130 PWVCDVATELGIPCATLWIQSCAVFSAYFHYNAETVKFPTEAEPELDVQLPSTPLLKHDE 189

Query: 651 IPSFLTPNDSHQAIGKDILRQFSNLSKPFCILMDTFEELEAEVINDMSKNFPIKAVGPLF 710
           IPSFL P D +  +G+ IL QF  LSK   ILMDT +ELE E++ +MSK   +K VGPLF
Sbjct: 190 IPSFLHPFDPYAILGRAILGQFKKLSKSSYILMDTIQELEPEIVEEMSKVCLVKPVGPLF 249

Query: 711 KICSEMETKIRGDCMKAADECIEWLDSKPIGSVVYVSFGSVVFLKQDQIDEIAYALHSSG 770
           KI     T IRGD +K AD+C++WL SKP  SVVY+SFGS+V+LKQ+Q+DEIA+ L SSG
Sbjct: 250 KIPEATNTTIRGDLIK-ADDCLDWLSSKPPASVVYISFGSIVYLKQEQVDEIAHGLLSSG 309

Query: 771 FSFLWVLKPPSVHLGADRHVLPLEVVEEMGERGKVVEWSPQEQVLSHPSLACFLTHCGWN 830
            SFLWV++PP    G D HVLP   +E++G+ GK+V+WSPQEQVL+HPSLACFLTHCGWN
Sbjct: 310 VSFLWVMRPPRKAAGVDMHVLPEGFLEKVGDNGKLVQWSPQEQVLAHPSLACFLTHCGWN 369

Query: 831 SSVEAMSLGVPMVAFPQWGDQVTNAKFLVDVFGVGLRLSRGANEDRLIQRDEIETCLREA 890
           SSVEA++LGVP+V FPQWGDQVTNAK+LVDVFGVGLRL RG  E+RL+ RDE+E CL EA
Sbjct: 370 SSVEALTLGVPVVTFPQWGDQVTNAKYLVDVFGVGLRLCRGVAENRLVLRDEVEKCLLEA 429

Query: 891 MEGPRAVEIRQNALKQQKAAEKAVADGGSSDRNIKDFIDEIRK 930
             G +AV+++ NALK +K AE+AVA+GGSS RN+ DFIDEI +
Sbjct: 430 TVGEKAVQLKHNALKWKKVAEEAVAEGGSSQRNLHDFIDEIAR 471

BLAST of Cp4.1LG04g07760 vs. NCBI nr
Match: KJB32309.1 (hypothetical protein B456_005G234600 [Gossypium raimondii])

HSP 1 Score: 986 bits (2550), Expect = 0.0
Identity = 476/942 (50.53%), Postives = 653/942 (69.32%), Query Frame = 0

Query: 19  QGHINPTLRLAKKLAAEGLLVTISTAVHFGKTLQKAGSIGAGDCPTPVGNGFIRFEFFED 78
           QGH+NP LRL K+LA++GL VT+ST   F + + +A +I   D P PVG+GF++F  FED
Sbjct: 4   QGHVNPLLRLGKRLASKGLFVTLSTPKGFAQKMAEANNI-TDDHPIPVGDGFLQFGSFED 63

Query: 79  RLQEINPKDMNLTRYNNQLELSGRPSLTGLIKNQTAENRPVSCLIVNPFFPWTCEVAKEL 138
              + +P+  +L +Y +QLEL+G+P+++ +I+    +NRPVSCLI NPF PW  +VA+ L
Sbjct: 64  GWDDDDPRRAHLDQYMHQLELAGKPAISAMIERYAEQNRPVSCLINNPFIPWASDVAESL 123

Query: 139 GIPCAVLWVQSCSVFSIYYHCFHKSVPFPSELEPKIDVHLPILPLLKNDEIPSFLHPNNI 198
           GIP A+LWVQSC+ F+ YYH  H  V FP+E +P+IDV LP +PLLK+DE+PSFLHP+  
Sbjct: 124 GIPSAMLWVQSCACFAAYYHYNHGLVTFPTETDPEIDVQLPSMPLLKHDEVPSFLHPSTP 183

Query: 199 YGVLGKVLLSQFSKLSIPFCILMDSFDELEKDIISYMSNIIPLKPIGPLFLNPQNVETEV 258
           +  L   +L QF KL   FC+LMD+F ELE +++ YMS    +K +GPLF  P+     +
Sbjct: 184 FAYLRTAILGQFKKLDKQFCVLMDTFQELEPEMVEYMSKFCLIKTVGPLFKYPEVPNNTI 243

Query: 259 SADCLKAEDCMEWLNSKPPQSVVYVSFGSIVYLKQEQIDELAYGLCNSGLSFLWVMKPPN 318
             D +K +DC+EWL+SKP  SV+Y+SFG++VYLKQEQ+DE+A  L  +G+S+LWVMKPP 
Sbjct: 244 RCDIMKPDDCIEWLDSKPAASVIYISFGTVVYLKQEQVDEIAEALLATGISYLWVMKPPA 303

Query: 319 EALGLKGHILPEGVMEKAGERGKVVQWSSQERVLSHESVGCFMTHCGWNSSVEAIGCGVP 378
           +  GL  H LPEG +EK G+ GKVVQWS Q++VL H SV CF++HCGWNS++EA+ CGVP
Sbjct: 304 KESGLPIHTLPEGFLEKVGDNGKVVQWSPQDKVLIHPSVSCFVSHCGWNSTMEALSCGVP 363

Query: 379 VVAFPQWGDQVTNAKFLVEDYGVGVRLSRGAEANELISRDEIVRCISEVMTRDSSGGEFR 438
           +VAFPQWGDQVTNA +LV+ +  GVR+ RG   N +I ++E+ +C  E  T      + +
Sbjct: 364 IVAFPQWGDQVTNAVYLVDVFKTGVRMGRGEAENRIIPKEEVAKCFVEA-TVGPKAKDLK 423

Query: 439 RNALKLKQAAAAAVVDGGSSHKNIQH-----SKACL-------------------PHVFL 498
           RNALK K AA A +  GGSS +NIQ       + C                    P V +
Sbjct: 424 RNALKWKAAAEAVMAGGGSSDRNIQAFINEVRRRCTSTDNDAATMNFVNKHSPTEPTVVI 483

Query: 499 VSF------PGQGHINPMLRLGKKLAAAGLLVTFSTSVQLGSQMKNAGSISDHPT-PLGD 558
            +         QGH+NP+LRLGK+LA+ GLL+T ST    G QM  A +I+D    P+GD
Sbjct: 484 NALIASALRKAQGHVNPLLRLGKRLASKGLLITLSTPKVFGKQMAKANNITDDQLIPVGD 543

Query: 559 GFLRFEFFDDGRTDTTPTLTY-DEYMVQLQRLGAISLRQILENQMKENRPVSCVIGNPFV 618
           GFLRFE F DG  D  P   + D+YM QL+  G  ++  +++   ++NRPVSC+I NP++
Sbjct: 544 GFLRFESFQDGWDDDDPRRAHLDQYMHQLELAGKPAISAMIKRYAEQNRPVSCLINNPYI 603

Query: 619 PWVIDLADNLGISSAVFWVQSCSVFSVYYHHFRGAVPFPSQTQPNLDVKLPFLPLLKSDE 678
           PW  D+A++LGI SA+ WVQSC+ F+ YYH+  G VPFP++T P +DV+LP +PLLK DE
Sbjct: 604 PWASDVAESLGIPSAMLWVQSCACFAAYYHYNHGLVPFPTETDPEIDVQLPSMPLLKHDE 663

Query: 679 IPSFLTPNDSHQAIGKDILRQFSNLSKPFCILMDTFEELEAEVINDMSKNFPIKAVGPLF 738
           +PS+L P+     +   IL QF  L KPFC+L+DTF+ELE E++  MSK   IK VGPL 
Sbjct: 664 VPSYLRPSTPFAFLRTAILGQFKKLDKPFCVLIDTFQELEPEIVEYMSKFCLIKTVGPLV 723

Query: 739 KICSEMETKIRGDCMKAADECIEWLDSKPIGSVVYVSFGSVVFLKQDQIDEIAYALHSSG 798
           K      + IR D MK  D+CIEWLDSKP  SV+Y+SFG+VV+LKQ+Q+DEIA AL ++G
Sbjct: 724 KYPEVPNSTIRCDMMKP-DDCIEWLDSKPASSVIYISFGTVVYLKQEQVDEIAKALLATG 783

Query: 799 FSFLWVLKPPSVHLGADRHVLPLEVVEEMGERGKVVEWSPQEQVLSHPSLACFLTHCGWN 858
            SFLWV+KPP+   G   H LP   +E++G+ GK++ WSPQ +VL+HPS++CF++HCGWN
Sbjct: 784 ISFLWVMKPPAKEFGLPFHTLPEGFLEKVGDNGKILLWSPQVKVLTHPSISCFMSHCGWN 843

Query: 859 SSVEAMSLGVPMVAFPQWGDQVTNAKFLVDVFGVGLRLSRGANEDRLIQRDEIETCLREA 918
           S +E +S GVP++AFPQWGDQVTNA +LVDVF  GLR+ RG  +  +  ++E+  C  EA
Sbjct: 844 SVLETLSCGVPIIAFPQWGDQVTNAVYLVDVFKTGLRMGRGKGKKGITPKEEVAKCFVEA 903

Query: 919 MEGPRAVEIRQNALKQQKAAEKAVADGGSSDRNIKDFIDEIR 928
             G +A +++ NALK + AAE+A+ADGGSSDRN++ FIDE++
Sbjct: 904 TLGLKAKDLKSNALKWKLAAEEAIADGGSSDRNMQTFIDEVK 942

BLAST of Cp4.1LG04g07760 vs. NCBI nr
Match: KAB2042771.1 (hypothetical protein ES319_D02G239900v1 [Gossypium barbadense])

HSP 1 Score: 983 bits (2542), Expect = 0.0
Identity = 476/942 (50.53%), Postives = 652/942 (69.21%), Query Frame = 0

Query: 19  QGHINPTLRLAKKLAAEGLLVTISTAVHFGKTLQKAGSIGAGDCPTPVGNGFIRFEFFED 78
           QGH+NP LRL K+LA++GL VT+ST   F + + +A +I   D P PVG+GF++F  FED
Sbjct: 4   QGHVNPLLRLGKRLASKGLFVTLSTPKGFAQKMAEANNI-TDDHPIPVGDGFLQFGSFED 63

Query: 79  RLQEINPKDMNLTRYNNQLELSGRPSLTGLIKNQTAENRPVSCLIVNPFFPWTCEVAKEL 138
              + +P+  +L +Y +QLEL+G+P+++ +I+    +NRPVSCLI NPF PW  +VA+ L
Sbjct: 64  GWDDDDPRRAHLDQYMHQLELAGKPAISAMIERYAEQNRPVSCLINNPFIPWASDVAESL 123

Query: 139 GIPCAVLWVQSCSVFSIYYHCFHKSVPFPSELEPKIDVHLPILPLLKNDEIPSFLHPNNI 198
           GIP A+LWVQSC+ F+ YYH  H  V FP+E +P+IDV LP +PLLK+DE+PSFLHP+  
Sbjct: 124 GIPSAMLWVQSCACFAAYYHYNHGLVKFPTETDPEIDVQLPSMPLLKHDEVPSFLHPSTP 183

Query: 199 YGVLGKVLLSQFSKLSIPFCILMDSFDELEKDIISYMSNIIPLKPIGPLFLNPQNVETEV 258
           +  L   +L QF KL   FC+LMD+F ELE +I+ YMS    +K +G LF  P+     +
Sbjct: 184 FAFLRTAILGQFKKLDKQFCVLMDTFQELEPEIVEYMSKFCLIKTVGSLFKYPEVPNNTI 243

Query: 259 SADCLKAEDCMEWLNSKPPQSVVYVSFGSIVYLKQEQIDELAYGLCNSGLSFLWVMKPPN 318
             D +K +DC+EWL+SKP  SV+Y+SFG++VYLKQEQ+DE+A  L  +G+S+LWVMKPP 
Sbjct: 244 RCDIMKPDDCIEWLDSKPAASVIYISFGTVVYLKQEQVDEIAEALLATGISYLWVMKPPA 303

Query: 319 EALGLKGHILPEGVMEKAGERGKVVQWSSQERVLSHESVGCFMTHCGWNSSVEAIGCGVP 378
           +  GL  H LPEG +EK G+ GKVVQWS Q++VL H SV CF++HCGWNS++EA+ CGVP
Sbjct: 304 KESGLPIHTLPEGFLEKVGDNGKVVQWSPQDKVLIHPSVSCFVSHCGWNSTMEALSCGVP 363

Query: 379 VVAFPQWGDQVTNAKFLVEDYGVGVRLSRGAEANELISRDEIVRCISEVMTRDSSGGEFR 438
           +VAFPQWGDQVTNA +LV+ +  GVR+  G   N +I ++E+ +C  E  T      + +
Sbjct: 364 IVAFPQWGDQVTNAVYLVDVFKTGVRMGGGEAENRIIPKEEVAKCFVEA-TVGPKAKDLK 423

Query: 439 RNALKLKQAAAAAVVDGGSSHKNIQH-----SKACL-------------------PHVFL 498
           RNALK K AA AA+  GGSS +NIQ       + C                    P V +
Sbjct: 424 RNALKWKAAAEAAMAGGGSSDRNIQAFINEVRRRCTSTDNDAATMNFVNKHSPTEPTVVI 483

Query: 499 VSF------PGQGHINPMLRLGKKLAAAGLLVTFSTSVQLGSQMKNAGSISDHPT-PLGD 558
            +         QGH+NP+LRLGK+LA+ GLL+T ST    G QM  A +I+D    P+GD
Sbjct: 484 NALIASALRKAQGHVNPLLRLGKRLASKGLLITLSTPKVFGKQMAKANNITDDQLIPVGD 543

Query: 559 GFLRFEFFDDGRTDTTPTLTY-DEYMVQLQRLGAISLRQILENQMKENRPVSCVIGNPFV 618
           GFLRFE F DG  D  P   + D+YM QL+  G  ++  +++   ++NRPVSC+I NP++
Sbjct: 544 GFLRFESFQDGWDDDDPRRAHLDQYMHQLELAGKPAISAMIKRYAEQNRPVSCLINNPYI 603

Query: 619 PWVIDLADNLGISSAVFWVQSCSVFSVYYHHFRGAVPFPSQTQPNLDVKLPFLPLLKSDE 678
           PW  D+A++LGI SA+ WVQSC+ F+ YYH+  G VPFP++T P +DV+LP +PLLK DE
Sbjct: 604 PWASDVAESLGIPSAMLWVQSCACFAAYYHYNHGLVPFPTETDPEIDVQLPSMPLLKHDE 663

Query: 679 IPSFLTPNDSHQAIGKDILRQFSNLSKPFCILMDTFEELEAEVINDMSKNFPIKAVGPLF 738
           +PS+L P+     +   IL QF  L KPFC+L+DTF+ELE E++  MSK   IK VGPL 
Sbjct: 664 VPSYLRPSTPFAFLRTAILGQFKKLDKPFCVLIDTFQELEPEIVEYMSKFCLIKTVGPLV 723

Query: 739 KICSEMETKIRGDCMKAADECIEWLDSKPIGSVVYVSFGSVVFLKQDQIDEIAYALHSSG 798
           K      + IR D MK  D+CIEWLDSKP  SV+Y+SFG+VV+LKQ+Q+DEIA AL ++G
Sbjct: 724 KYPEVPNSTIRCDMMKP-DDCIEWLDSKPASSVIYISFGTVVYLKQEQVDEIAKALLATG 783

Query: 799 FSFLWVLKPPSVHLGADRHVLPLEVVEEMGERGKVVEWSPQEQVLSHPSLACFLTHCGWN 858
            SFLWV+KPP+   G   H LP   +E++G+ GK++ WSPQ +VL+HPS++CF++HCGWN
Sbjct: 784 ISFLWVMKPPAKEFGLPFHTLPEGFLEKVGDNGKILLWSPQVKVLTHPSISCFMSHCGWN 843

Query: 859 SSVEAMSLGVPMVAFPQWGDQVTNAKFLVDVFGVGLRLSRGANEDRLIQRDEIETCLREA 918
           S +E +S GVP++AFPQWGDQVTNA +LVDVF  GLR+ RG  +  +  ++E+  C  EA
Sbjct: 844 SVLETLSCGVPIIAFPQWGDQVTNAVYLVDVFKTGLRMGRGKGKKGITPKEEVAKCFVEA 903

Query: 919 MEGPRAVEIRQNALKQQKAAEKAVADGGSSDRNIKDFIDEIR 928
             G +A +++ NALK + AAE+A+ADGGSSDRN++ FIDE++
Sbjct: 904 TLGLKAKDLKSNALKWKLAAEEAIADGGSSDRNMQTFIDEVK 942

BLAST of Cp4.1LG04g07760 vs. NCBI nr
Match: XP_023530167.1 (putative UDP-glucose glucosyltransferase [Cucurbita pepo subsp. pepo])

HSP 1 Score: 952 bits (2462), Expect = 0.0
Identity = 463/463 (100.00%), Postives = 463/463 (100.00%), Query Frame = 0

Query: 1   MMNSEEAPCHVFLVCYPSQGHINPTLRLAKKLAAEGLLVTISTAVHFGKTLQKAGSIGAG 60
           MMNSEEAPCHVFLVCYPSQGHINPTLRLAKKLAAEGLLVTISTAVHFGKTLQKAGSIGAG
Sbjct: 1   MMNSEEAPCHVFLVCYPSQGHINPTLRLAKKLAAEGLLVTISTAVHFGKTLQKAGSIGAG 60

Query: 61  DCPTPVGNGFIRFEFFEDRLQEINPKDMNLTRYNNQLELSGRPSLTGLIKNQTAENRPVS 120
           DCPTPVGNGFIRFEFFEDRLQEINPKDMNLTRYNNQLELSGRPSLTGLIKNQTAENRPVS
Sbjct: 61  DCPTPVGNGFIRFEFFEDRLQEINPKDMNLTRYNNQLELSGRPSLTGLIKNQTAENRPVS 120

Query: 121 CLIVNPFFPWTCEVAKELGIPCAVLWVQSCSVFSIYYHCFHKSVPFPSELEPKIDVHLPI 180
           CLIVNPFFPWTCEVAKELGIPCAVLWVQSCSVFSIYYHCFHKSVPFPSELEPKIDVHLPI
Sbjct: 121 CLIVNPFFPWTCEVAKELGIPCAVLWVQSCSVFSIYYHCFHKSVPFPSELEPKIDVHLPI 180

Query: 181 LPLLKNDEIPSFLHPNNIYGVLGKVLLSQFSKLSIPFCILMDSFDELEKDIISYMSNIIP 240
           LPLLKNDEIPSFLHPNNIYGVLGKVLLSQFSKLSIPFCILMDSFDELEKDIISYMSNIIP
Sbjct: 181 LPLLKNDEIPSFLHPNNIYGVLGKVLLSQFSKLSIPFCILMDSFDELEKDIISYMSNIIP 240

Query: 241 LKPIGPLFLNPQNVETEVSADCLKAEDCMEWLNSKPPQSVVYVSFGSIVYLKQEQIDELA 300
           LKPIGPLFLNPQNVETEVSADCLKAEDCMEWLNSKPPQSVVYVSFGSIVYLKQEQIDELA
Sbjct: 241 LKPIGPLFLNPQNVETEVSADCLKAEDCMEWLNSKPPQSVVYVSFGSIVYLKQEQIDELA 300

Query: 301 YGLCNSGLSFLWVMKPPNEALGLKGHILPEGVMEKAGERGKVVQWSSQERVLSHESVGCF 360
           YGLCNSGLSFLWVMKPPNEALGLKGHILPEGVMEKAGERGKVVQWSSQERVLSHESVGCF
Sbjct: 301 YGLCNSGLSFLWVMKPPNEALGLKGHILPEGVMEKAGERGKVVQWSSQERVLSHESVGCF 360

Query: 361 MTHCGWNSSVEAIGCGVPVVAFPQWGDQVTNAKFLVEDYGVGVRLSRGAEANELISRDEI 420
           MTHCGWNSSVEAIGCGVPVVAFPQWGDQVTNAKFLVEDYGVGVRLSRGAEANELISRDEI
Sbjct: 361 MTHCGWNSSVEAIGCGVPVVAFPQWGDQVTNAKFLVEDYGVGVRLSRGAEANELISRDEI 420

Query: 421 VRCISEVMTRDSSGGEFRRNALKLKQAAAAAVVDGGSSHKNIQ 463
           VRCISEVMTRDSSGGEFRRNALKLKQAAAAAVVDGGSSHKNIQ
Sbjct: 421 VRCISEVMTRDSSGGEFRRNALKLKQAAAAAVVDGGSSHKNIQ 463

BLAST of Cp4.1LG04g07760 vs. NCBI nr
Match: TYI94980.1 (hypothetical protein E1A91_D02G245100v1 [Gossypium mustelinum])

HSP 1 Score: 947 bits (2447), Expect = 0.0
Identity = 456/900 (50.67%), Postives = 623/900 (69.22%), Query Frame = 0

Query: 61  DCPTPVGNGFIRFEFFEDRLQEINPKDMNLTRYNNQLELSGRPSLTGLIKNQTAENRPVS 120
           D P PVG+GF++F  FED   + +P+  +L +Y +QLEL+G+P+++ +I+    +NRPVS
Sbjct: 10  DHPIPVGDGFLQFGSFEDGWDDDDPRRAHLDQYMHQLELAGKPAISAMIERYAEQNRPVS 69

Query: 121 CLIVNPFFPWTCEVAKELGIPCAVLWVQSCSVFSIYYHCFHKSVPFPSELEPKIDVHLPI 180
           CLI NPF PW  +VA+ LGIP A+LWVQSC+ F+ YYH  H  V FP+E +P+IDV LP 
Sbjct: 70  CLINNPFIPWASDVAESLGIPSAMLWVQSCACFAAYYHYNHGLVKFPTETDPEIDVQLPS 129

Query: 181 LPLLKNDEIPSFLHPNNIYGVLGKVLLSQFSKLSIPFCILMDSFDELEKDIISYMSNIIP 240
           +PLLK+DE+PSFLHP+  +  L   +L QF KL   FC+LMD+F ELE +I+ YMS    
Sbjct: 130 MPLLKHDEVPSFLHPSTPFAFLRTAILGQFKKLDKQFCVLMDTFQELEPEIVEYMSKFCL 189

Query: 241 LKPIGPLFLNPQNVETEVSADCLKAEDCMEWLNSKPPQSVVYVSFGSIVYLKQEQIDELA 300
           +K +GPLF  P+     +  D +K +DC+EWL+SKP  SV+Y+SFG++VYLKQEQ+DE+A
Sbjct: 190 IKTVGPLFKYPEVPNNTIRCDIMKPDDCIEWLDSKPAASVIYISFGTVVYLKQEQVDEIA 249

Query: 301 YGLCNSGLSFLWVMKPPNEALGLKGHILPEGVMEKAGERGKVVQWSSQERVLSHESVGCF 360
             L  +G+S+LWVMKPP +  GL  H LPEG +EK G+ GKVVQWS Q++VL H SV CF
Sbjct: 250 EALLATGISYLWVMKPPAKESGLPIHTLPEGFLEKVGDNGKVVQWSPQDKVLIHPSVSCF 309

Query: 361 MTHCGWNSSVEAIGCGVPVVAFPQWGDQVTNAKFLVEDYGVGVRLSRGAEANELISRDEI 420
           ++HCGWNS++EA+ CGVP+VAFPQWGDQVTNA +LV+ +  GVR+  G   N +I ++E+
Sbjct: 310 VSHCGWNSTMEALSCGVPIVAFPQWGDQVTNAVYLVDVFKTGVRMGGGEAENRIIPKEEV 369

Query: 421 VRCISEVMTRDSSGGEFRRNALKLKQAAAAAVVDGGSSHKNIQH-----SKACL------ 480
            +C  E  T      + +RNALK K AA AA+  GGSS +NIQ       + C       
Sbjct: 370 AKCFVEA-TVGPKAKDLKRNALKWKAAAEAAMAGGGSSDRNIQAFINEVRRRCTSTDNDA 429

Query: 481 -------------PHVFLVSF------PGQGHINPMLRLGKKLAAAGLLVTFSTSVQLGS 540
                        P V + +         QGH+NP+LRLGK+LA+ GLL+T ST    G 
Sbjct: 430 ATMNFVNKHSPTEPTVVINALIASALRKAQGHVNPLLRLGKRLASKGLLITLSTPKVFGK 489

Query: 541 QMKNAGSISDHPT-PLGDGFLRFEFFDDGRTDTTPTLTY-DEYMVQLQRLGAISLRQILE 600
           QM  A +I+D    P+GDGFLRFE F DG  D  P   + D+YM QL+  G  ++  +++
Sbjct: 490 QMAKANNITDDQLIPVGDGFLRFESFQDGWDDDDPRRAHLDQYMHQLELAGKPAISAMIK 549

Query: 601 NQMKENRPVSCVIGNPFVPWVIDLADNLGISSAVFWVQSCSVFSVYYHHFRGAVPFPSQT 660
              ++NRPVSC+I NP++PW  D+A++LGI SA+ WVQSC+ F+ YYH+  G VPFP++T
Sbjct: 550 RYAEQNRPVSCLINNPYIPWASDVAESLGIPSAMLWVQSCACFAAYYHYNHGLVPFPTET 609

Query: 661 QPNLDVKLPFLPLLKSDEIPSFLTPNDSHQAIGKDILRQFSNLSKPFCILMDTFEELEAE 720
            P +DV+LP +PLLK DE+PS+L P+     +   IL QF  L KPFC+L+DTF+ELE E
Sbjct: 610 DPEIDVQLPSMPLLKHDEVPSYLRPSTPFAFLRTAILGQFKKLDKPFCVLIDTFQELEPE 669

Query: 721 VINDMSKNFPIKAVGPLFKICSEMETKIRGDCMKAADECIEWLDSKPIGSVVYVSFGSVV 780
           ++  MSK   IK VGPL K      + IR D MK  D+CIEWLDSKP  SV+Y+SFG+V 
Sbjct: 670 IVEYMSKFCLIKTVGPLVKYPEVPNSTIRCDMMKP-DDCIEWLDSKPASSVIYISFGTVF 729

Query: 781 FLKQDQIDEIAYALHSSGFSFLWVLKPPSVHLGADRHVLPLEVVEEMGERGKVVEWSPQE 840
           +LKQ+Q+DEIA AL ++G SFLWV+KPP+   G   H LP   +E++G+ GK++ WSPQ 
Sbjct: 730 YLKQEQVDEIAKALLATGISFLWVMKPPAKEFGLPFHTLPEGFLEKVGDNGKILLWSPQV 789

Query: 841 QVLSHPSLACFLTHCGWNSSVEAMSLGVPMVAFPQWGDQVTNAKFLVDVFGVGLRLSRGA 900
           +VL+HPS++CF++HCGWNS +E +S GVP++AFPQWGDQVTNA +LVDVF  GLR+ RG 
Sbjct: 790 KVLTHPSISCFMSHCGWNSVLETLSCGVPIIAFPQWGDQVTNAVYLVDVFKTGLRMGRGK 849

Query: 901 NEDRLIQRDEIETCLREAMEGPRAVEIRQNALKQQKAAEKAVADGGSSDRNIKDFIDEIR 928
            +  +  ++E+  C  EA  G +A +++ NALK + AAE+A+ADGGSSDRN++ FIDE++
Sbjct: 850 GKKGITPKEEVAKCFVEATLGLKAKDLKSNALKWKLAAEEAIADGGSSDRNMQTFIDEVK 907

BLAST of Cp4.1LG04g07760 vs. NCBI nr
Match: XP_023532006.1 (gallate 1-beta-glucosyltransferase-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 942 bits (2436), Expect = 0.0
Identity = 465/465 (100.00%), Postives = 465/465 (100.00%), Query Frame = 0

Query: 465 SKACLPHVFLVSFPGQGHINPMLRLGKKLAAAGLLVTFSTSVQLGSQMKNAGSISDHPTP 524
           SKACLPHVFLVSFPGQGHINPMLRLGKKLAAAGLLVTFSTSVQLGSQMKNAGSISDHPTP
Sbjct: 3   SKACLPHVFLVSFPGQGHINPMLRLGKKLAAAGLLVTFSTSVQLGSQMKNAGSISDHPTP 62

Query: 525 LGDGFLRFEFFDDGRTDTTPTLTYDEYMVQLQRLGAISLRQILENQMKENRPVSCVIGNP 584
           LGDGFLRFEFFDDGRTDTTPTLTYDEYMVQLQRLGAISLRQILENQMKENRPVSCVIGNP
Sbjct: 63  LGDGFLRFEFFDDGRTDTTPTLTYDEYMVQLQRLGAISLRQILENQMKENRPVSCVIGNP 122

Query: 585 FVPWVIDLADNLGISSAVFWVQSCSVFSVYYHHFRGAVPFPSQTQPNLDVKLPFLPLLKS 644
           FVPWVIDLADNLGISSAVFWVQSCSVFSVYYHHFRGAVPFPSQTQPNLDVKLPFLPLLKS
Sbjct: 123 FVPWVIDLADNLGISSAVFWVQSCSVFSVYYHHFRGAVPFPSQTQPNLDVKLPFLPLLKS 182

Query: 645 DEIPSFLTPNDSHQAIGKDILRQFSNLSKPFCILMDTFEELEAEVINDMSKNFPIKAVGP 704
           DEIPSFLTPNDSHQAIGKDILRQFSNLSKPFCILMDTFEELEAEVINDMSKNFPIKAVGP
Sbjct: 183 DEIPSFLTPNDSHQAIGKDILRQFSNLSKPFCILMDTFEELEAEVINDMSKNFPIKAVGP 242

Query: 705 LFKICSEMETKIRGDCMKAADECIEWLDSKPIGSVVYVSFGSVVFLKQDQIDEIAYALHS 764
           LFKICSEMETKIRGDCMKAADECIEWLDSKPIGSVVYVSFGSVVFLKQDQIDEIAYALHS
Sbjct: 243 LFKICSEMETKIRGDCMKAADECIEWLDSKPIGSVVYVSFGSVVFLKQDQIDEIAYALHS 302

Query: 765 SGFSFLWVLKPPSVHLGADRHVLPLEVVEEMGERGKVVEWSPQEQVLSHPSLACFLTHCG 824
           SGFSFLWVLKPPSVHLGADRHVLPLEVVEEMGERGKVVEWSPQEQVLSHPSLACFLTHCG
Sbjct: 303 SGFSFLWVLKPPSVHLGADRHVLPLEVVEEMGERGKVVEWSPQEQVLSHPSLACFLTHCG 362

Query: 825 WNSSVEAMSLGVPMVAFPQWGDQVTNAKFLVDVFGVGLRLSRGANEDRLIQRDEIETCLR 884
           WNSSVEAMSLGVPMVAFPQWGDQVTNAKFLVDVFGVGLRLSRGANEDRLIQRDEIETCLR
Sbjct: 363 WNSSVEAMSLGVPMVAFPQWGDQVTNAKFLVDVFGVGLRLSRGANEDRLIQRDEIETCLR 422

Query: 885 EAMEGPRAVEIRQNALKQQKAAEKAVADGGSSDRNIKDFIDEIRK 929
           EAMEGPRAVEIRQNALKQQKAAEKAVADGGSSDRNIKDFIDEIRK
Sbjct: 423 EAMEGPRAVEIRQNALKQQKAAEKAVADGGSSDRNIKDFIDEIRK 467

BLAST of Cp4.1LG04g07760 vs. ExPASy TrEMBL
Match: A0A0D2RKC6 (Uncharacterized protein OS=Gossypium raimondii OX=29730 GN=B456_005G234600 PE=3 SV=1)

HSP 1 Score: 986 bits (2550), Expect = 0.0
Identity = 476/942 (50.53%), Postives = 653/942 (69.32%), Query Frame = 0

Query: 19  QGHINPTLRLAKKLAAEGLLVTISTAVHFGKTLQKAGSIGAGDCPTPVGNGFIRFEFFED 78
           QGH+NP LRL K+LA++GL VT+ST   F + + +A +I   D P PVG+GF++F  FED
Sbjct: 4   QGHVNPLLRLGKRLASKGLFVTLSTPKGFAQKMAEANNI-TDDHPIPVGDGFLQFGSFED 63

Query: 79  RLQEINPKDMNLTRYNNQLELSGRPSLTGLIKNQTAENRPVSCLIVNPFFPWTCEVAKEL 138
              + +P+  +L +Y +QLEL+G+P+++ +I+    +NRPVSCLI NPF PW  +VA+ L
Sbjct: 64  GWDDDDPRRAHLDQYMHQLELAGKPAISAMIERYAEQNRPVSCLINNPFIPWASDVAESL 123

Query: 139 GIPCAVLWVQSCSVFSIYYHCFHKSVPFPSELEPKIDVHLPILPLLKNDEIPSFLHPNNI 198
           GIP A+LWVQSC+ F+ YYH  H  V FP+E +P+IDV LP +PLLK+DE+PSFLHP+  
Sbjct: 124 GIPSAMLWVQSCACFAAYYHYNHGLVTFPTETDPEIDVQLPSMPLLKHDEVPSFLHPSTP 183

Query: 199 YGVLGKVLLSQFSKLSIPFCILMDSFDELEKDIISYMSNIIPLKPIGPLFLNPQNVETEV 258
           +  L   +L QF KL   FC+LMD+F ELE +++ YMS    +K +GPLF  P+     +
Sbjct: 184 FAYLRTAILGQFKKLDKQFCVLMDTFQELEPEMVEYMSKFCLIKTVGPLFKYPEVPNNTI 243

Query: 259 SADCLKAEDCMEWLNSKPPQSVVYVSFGSIVYLKQEQIDELAYGLCNSGLSFLWVMKPPN 318
             D +K +DC+EWL+SKP  SV+Y+SFG++VYLKQEQ+DE+A  L  +G+S+LWVMKPP 
Sbjct: 244 RCDIMKPDDCIEWLDSKPAASVIYISFGTVVYLKQEQVDEIAEALLATGISYLWVMKPPA 303

Query: 319 EALGLKGHILPEGVMEKAGERGKVVQWSSQERVLSHESVGCFMTHCGWNSSVEAIGCGVP 378
           +  GL  H LPEG +EK G+ GKVVQWS Q++VL H SV CF++HCGWNS++EA+ CGVP
Sbjct: 304 KESGLPIHTLPEGFLEKVGDNGKVVQWSPQDKVLIHPSVSCFVSHCGWNSTMEALSCGVP 363

Query: 379 VVAFPQWGDQVTNAKFLVEDYGVGVRLSRGAEANELISRDEIVRCISEVMTRDSSGGEFR 438
           +VAFPQWGDQVTNA +LV+ +  GVR+ RG   N +I ++E+ +C  E  T      + +
Sbjct: 364 IVAFPQWGDQVTNAVYLVDVFKTGVRMGRGEAENRIIPKEEVAKCFVEA-TVGPKAKDLK 423

Query: 439 RNALKLKQAAAAAVVDGGSSHKNIQH-----SKACL-------------------PHVFL 498
           RNALK K AA A +  GGSS +NIQ       + C                    P V +
Sbjct: 424 RNALKWKAAAEAVMAGGGSSDRNIQAFINEVRRRCTSTDNDAATMNFVNKHSPTEPTVVI 483

Query: 499 VSF------PGQGHINPMLRLGKKLAAAGLLVTFSTSVQLGSQMKNAGSISDHPT-PLGD 558
            +         QGH+NP+LRLGK+LA+ GLL+T ST    G QM  A +I+D    P+GD
Sbjct: 484 NALIASALRKAQGHVNPLLRLGKRLASKGLLITLSTPKVFGKQMAKANNITDDQLIPVGD 543

Query: 559 GFLRFEFFDDGRTDTTPTLTY-DEYMVQLQRLGAISLRQILENQMKENRPVSCVIGNPFV 618
           GFLRFE F DG  D  P   + D+YM QL+  G  ++  +++   ++NRPVSC+I NP++
Sbjct: 544 GFLRFESFQDGWDDDDPRRAHLDQYMHQLELAGKPAISAMIKRYAEQNRPVSCLINNPYI 603

Query: 619 PWVIDLADNLGISSAVFWVQSCSVFSVYYHHFRGAVPFPSQTQPNLDVKLPFLPLLKSDE 678
           PW  D+A++LGI SA+ WVQSC+ F+ YYH+  G VPFP++T P +DV+LP +PLLK DE
Sbjct: 604 PWASDVAESLGIPSAMLWVQSCACFAAYYHYNHGLVPFPTETDPEIDVQLPSMPLLKHDE 663

Query: 679 IPSFLTPNDSHQAIGKDILRQFSNLSKPFCILMDTFEELEAEVINDMSKNFPIKAVGPLF 738
           +PS+L P+     +   IL QF  L KPFC+L+DTF+ELE E++  MSK   IK VGPL 
Sbjct: 664 VPSYLRPSTPFAFLRTAILGQFKKLDKPFCVLIDTFQELEPEIVEYMSKFCLIKTVGPLV 723

Query: 739 KICSEMETKIRGDCMKAADECIEWLDSKPIGSVVYVSFGSVVFLKQDQIDEIAYALHSSG 798
           K      + IR D MK  D+CIEWLDSKP  SV+Y+SFG+VV+LKQ+Q+DEIA AL ++G
Sbjct: 724 KYPEVPNSTIRCDMMKP-DDCIEWLDSKPASSVIYISFGTVVYLKQEQVDEIAKALLATG 783

Query: 799 FSFLWVLKPPSVHLGADRHVLPLEVVEEMGERGKVVEWSPQEQVLSHPSLACFLTHCGWN 858
            SFLWV+KPP+   G   H LP   +E++G+ GK++ WSPQ +VL+HPS++CF++HCGWN
Sbjct: 784 ISFLWVMKPPAKEFGLPFHTLPEGFLEKVGDNGKILLWSPQVKVLTHPSISCFMSHCGWN 843

Query: 859 SSVEAMSLGVPMVAFPQWGDQVTNAKFLVDVFGVGLRLSRGANEDRLIQRDEIETCLREA 918
           S +E +S GVP++AFPQWGDQVTNA +LVDVF  GLR+ RG  +  +  ++E+  C  EA
Sbjct: 844 SVLETLSCGVPIIAFPQWGDQVTNAVYLVDVFKTGLRMGRGKGKKGITPKEEVAKCFVEA 903

Query: 919 MEGPRAVEIRQNALKQQKAAEKAVADGGSSDRNIKDFIDEIR 928
             G +A +++ NALK + AAE+A+ADGGSSDRN++ FIDE++
Sbjct: 904 TLGLKAKDLKSNALKWKLAAEEAIADGGSSDRNMQTFIDEVK 942

BLAST of Cp4.1LG04g07760 vs. ExPASy TrEMBL
Match: A0A5J5SMY2 (Uncharacterized protein OS=Gossypium barbadense OX=3634 GN=ES319_D02G239900v1 PE=3 SV=1)

HSP 1 Score: 983 bits (2542), Expect = 0.0
Identity = 476/942 (50.53%), Postives = 652/942 (69.21%), Query Frame = 0

Query: 19  QGHINPTLRLAKKLAAEGLLVTISTAVHFGKTLQKAGSIGAGDCPTPVGNGFIRFEFFED 78
           QGH+NP LRL K+LA++GL VT+ST   F + + +A +I   D P PVG+GF++F  FED
Sbjct: 4   QGHVNPLLRLGKRLASKGLFVTLSTPKGFAQKMAEANNI-TDDHPIPVGDGFLQFGSFED 63

Query: 79  RLQEINPKDMNLTRYNNQLELSGRPSLTGLIKNQTAENRPVSCLIVNPFFPWTCEVAKEL 138
              + +P+  +L +Y +QLEL+G+P+++ +I+    +NRPVSCLI NPF PW  +VA+ L
Sbjct: 64  GWDDDDPRRAHLDQYMHQLELAGKPAISAMIERYAEQNRPVSCLINNPFIPWASDVAESL 123

Query: 139 GIPCAVLWVQSCSVFSIYYHCFHKSVPFPSELEPKIDVHLPILPLLKNDEIPSFLHPNNI 198
           GIP A+LWVQSC+ F+ YYH  H  V FP+E +P+IDV LP +PLLK+DE+PSFLHP+  
Sbjct: 124 GIPSAMLWVQSCACFAAYYHYNHGLVKFPTETDPEIDVQLPSMPLLKHDEVPSFLHPSTP 183

Query: 199 YGVLGKVLLSQFSKLSIPFCILMDSFDELEKDIISYMSNIIPLKPIGPLFLNPQNVETEV 258
           +  L   +L QF KL   FC+LMD+F ELE +I+ YMS    +K +G LF  P+     +
Sbjct: 184 FAFLRTAILGQFKKLDKQFCVLMDTFQELEPEIVEYMSKFCLIKTVGSLFKYPEVPNNTI 243

Query: 259 SADCLKAEDCMEWLNSKPPQSVVYVSFGSIVYLKQEQIDELAYGLCNSGLSFLWVMKPPN 318
             D +K +DC+EWL+SKP  SV+Y+SFG++VYLKQEQ+DE+A  L  +G+S+LWVMKPP 
Sbjct: 244 RCDIMKPDDCIEWLDSKPAASVIYISFGTVVYLKQEQVDEIAEALLATGISYLWVMKPPA 303

Query: 319 EALGLKGHILPEGVMEKAGERGKVVQWSSQERVLSHESVGCFMTHCGWNSSVEAIGCGVP 378
           +  GL  H LPEG +EK G+ GKVVQWS Q++VL H SV CF++HCGWNS++EA+ CGVP
Sbjct: 304 KESGLPIHTLPEGFLEKVGDNGKVVQWSPQDKVLIHPSVSCFVSHCGWNSTMEALSCGVP 363

Query: 379 VVAFPQWGDQVTNAKFLVEDYGVGVRLSRGAEANELISRDEIVRCISEVMTRDSSGGEFR 438
           +VAFPQWGDQVTNA +LV+ +  GVR+  G   N +I ++E+ +C  E  T      + +
Sbjct: 364 IVAFPQWGDQVTNAVYLVDVFKTGVRMGGGEAENRIIPKEEVAKCFVEA-TVGPKAKDLK 423

Query: 439 RNALKLKQAAAAAVVDGGSSHKNIQH-----SKACL-------------------PHVFL 498
           RNALK K AA AA+  GGSS +NIQ       + C                    P V +
Sbjct: 424 RNALKWKAAAEAAMAGGGSSDRNIQAFINEVRRRCTSTDNDAATMNFVNKHSPTEPTVVI 483

Query: 499 VSF------PGQGHINPMLRLGKKLAAAGLLVTFSTSVQLGSQMKNAGSISDHPT-PLGD 558
            +         QGH+NP+LRLGK+LA+ GLL+T ST    G QM  A +I+D    P+GD
Sbjct: 484 NALIASALRKAQGHVNPLLRLGKRLASKGLLITLSTPKVFGKQMAKANNITDDQLIPVGD 543

Query: 559 GFLRFEFFDDGRTDTTPTLTY-DEYMVQLQRLGAISLRQILENQMKENRPVSCVIGNPFV 618
           GFLRFE F DG  D  P   + D+YM QL+  G  ++  +++   ++NRPVSC+I NP++
Sbjct: 544 GFLRFESFQDGWDDDDPRRAHLDQYMHQLELAGKPAISAMIKRYAEQNRPVSCLINNPYI 603

Query: 619 PWVIDLADNLGISSAVFWVQSCSVFSVYYHHFRGAVPFPSQTQPNLDVKLPFLPLLKSDE 678
           PW  D+A++LGI SA+ WVQSC+ F+ YYH+  G VPFP++T P +DV+LP +PLLK DE
Sbjct: 604 PWASDVAESLGIPSAMLWVQSCACFAAYYHYNHGLVPFPTETDPEIDVQLPSMPLLKHDE 663

Query: 679 IPSFLTPNDSHQAIGKDILRQFSNLSKPFCILMDTFEELEAEVINDMSKNFPIKAVGPLF 738
           +PS+L P+     +   IL QF  L KPFC+L+DTF+ELE E++  MSK   IK VGPL 
Sbjct: 664 VPSYLRPSTPFAFLRTAILGQFKKLDKPFCVLIDTFQELEPEIVEYMSKFCLIKTVGPLV 723

Query: 739 KICSEMETKIRGDCMKAADECIEWLDSKPIGSVVYVSFGSVVFLKQDQIDEIAYALHSSG 798
           K      + IR D MK  D+CIEWLDSKP  SV+Y+SFG+VV+LKQ+Q+DEIA AL ++G
Sbjct: 724 KYPEVPNSTIRCDMMKP-DDCIEWLDSKPASSVIYISFGTVVYLKQEQVDEIAKALLATG 783

Query: 799 FSFLWVLKPPSVHLGADRHVLPLEVVEEMGERGKVVEWSPQEQVLSHPSLACFLTHCGWN 858
            SFLWV+KPP+   G   H LP   +E++G+ GK++ WSPQ +VL+HPS++CF++HCGWN
Sbjct: 784 ISFLWVMKPPAKEFGLPFHTLPEGFLEKVGDNGKILLWSPQVKVLTHPSISCFMSHCGWN 843

Query: 859 SSVEAMSLGVPMVAFPQWGDQVTNAKFLVDVFGVGLRLSRGANEDRLIQRDEIETCLREA 918
           S +E +S GVP++AFPQWGDQVTNA +LVDVF  GLR+ RG  +  +  ++E+  C  EA
Sbjct: 844 SVLETLSCGVPIIAFPQWGDQVTNAVYLVDVFKTGLRMGRGKGKKGITPKEEVAKCFVEA 903

Query: 919 MEGPRAVEIRQNALKQQKAAEKAVADGGSSDRNIKDFIDEIR 928
             G +A +++ NALK + AAE+A+ADGGSSDRN++ FIDE++
Sbjct: 904 TLGLKAKDLKSNALKWKLAAEEAIADGGSSDRNMQTFIDEVK 942

BLAST of Cp4.1LG04g07760 vs. ExPASy TrEMBL
Match: A0A5D2W1U2 (Uncharacterized protein OS=Gossypium mustelinum OX=34275 GN=E1A91_D02G245100v1 PE=3 SV=1)

HSP 1 Score: 947 bits (2447), Expect = 0.0
Identity = 456/900 (50.67%), Postives = 623/900 (69.22%), Query Frame = 0

Query: 61  DCPTPVGNGFIRFEFFEDRLQEINPKDMNLTRYNNQLELSGRPSLTGLIKNQTAENRPVS 120
           D P PVG+GF++F  FED   + +P+  +L +Y +QLEL+G+P+++ +I+    +NRPVS
Sbjct: 10  DHPIPVGDGFLQFGSFEDGWDDDDPRRAHLDQYMHQLELAGKPAISAMIERYAEQNRPVS 69

Query: 121 CLIVNPFFPWTCEVAKELGIPCAVLWVQSCSVFSIYYHCFHKSVPFPSELEPKIDVHLPI 180
           CLI NPF PW  +VA+ LGIP A+LWVQSC+ F+ YYH  H  V FP+E +P+IDV LP 
Sbjct: 70  CLINNPFIPWASDVAESLGIPSAMLWVQSCACFAAYYHYNHGLVKFPTETDPEIDVQLPS 129

Query: 181 LPLLKNDEIPSFLHPNNIYGVLGKVLLSQFSKLSIPFCILMDSFDELEKDIISYMSNIIP 240
           +PLLK+DE+PSFLHP+  +  L   +L QF KL   FC+LMD+F ELE +I+ YMS    
Sbjct: 130 MPLLKHDEVPSFLHPSTPFAFLRTAILGQFKKLDKQFCVLMDTFQELEPEIVEYMSKFCL 189

Query: 241 LKPIGPLFLNPQNVETEVSADCLKAEDCMEWLNSKPPQSVVYVSFGSIVYLKQEQIDELA 300
           +K +GPLF  P+     +  D +K +DC+EWL+SKP  SV+Y+SFG++VYLKQEQ+DE+A
Sbjct: 190 IKTVGPLFKYPEVPNNTIRCDIMKPDDCIEWLDSKPAASVIYISFGTVVYLKQEQVDEIA 249

Query: 301 YGLCNSGLSFLWVMKPPNEALGLKGHILPEGVMEKAGERGKVVQWSSQERVLSHESVGCF 360
             L  +G+S+LWVMKPP +  GL  H LPEG +EK G+ GKVVQWS Q++VL H SV CF
Sbjct: 250 EALLATGISYLWVMKPPAKESGLPIHTLPEGFLEKVGDNGKVVQWSPQDKVLIHPSVSCF 309

Query: 361 MTHCGWNSSVEAIGCGVPVVAFPQWGDQVTNAKFLVEDYGVGVRLSRGAEANELISRDEI 420
           ++HCGWNS++EA+ CGVP+VAFPQWGDQVTNA +LV+ +  GVR+  G   N +I ++E+
Sbjct: 310 VSHCGWNSTMEALSCGVPIVAFPQWGDQVTNAVYLVDVFKTGVRMGGGEAENRIIPKEEV 369

Query: 421 VRCISEVMTRDSSGGEFRRNALKLKQAAAAAVVDGGSSHKNIQH-----SKACL------ 480
            +C  E  T      + +RNALK K AA AA+  GGSS +NIQ       + C       
Sbjct: 370 AKCFVEA-TVGPKAKDLKRNALKWKAAAEAAMAGGGSSDRNIQAFINEVRRRCTSTDNDA 429

Query: 481 -------------PHVFLVSF------PGQGHINPMLRLGKKLAAAGLLVTFSTSVQLGS 540
                        P V + +         QGH+NP+LRLGK+LA+ GLL+T ST    G 
Sbjct: 430 ATMNFVNKHSPTEPTVVINALIASALRKAQGHVNPLLRLGKRLASKGLLITLSTPKVFGK 489

Query: 541 QMKNAGSISDHPT-PLGDGFLRFEFFDDGRTDTTPTLTY-DEYMVQLQRLGAISLRQILE 600
           QM  A +I+D    P+GDGFLRFE F DG  D  P   + D+YM QL+  G  ++  +++
Sbjct: 490 QMAKANNITDDQLIPVGDGFLRFESFQDGWDDDDPRRAHLDQYMHQLELAGKPAISAMIK 549

Query: 601 NQMKENRPVSCVIGNPFVPWVIDLADNLGISSAVFWVQSCSVFSVYYHHFRGAVPFPSQT 660
              ++NRPVSC+I NP++PW  D+A++LGI SA+ WVQSC+ F+ YYH+  G VPFP++T
Sbjct: 550 RYAEQNRPVSCLINNPYIPWASDVAESLGIPSAMLWVQSCACFAAYYHYNHGLVPFPTET 609

Query: 661 QPNLDVKLPFLPLLKSDEIPSFLTPNDSHQAIGKDILRQFSNLSKPFCILMDTFEELEAE 720
            P +DV+LP +PLLK DE+PS+L P+     +   IL QF  L KPFC+L+DTF+ELE E
Sbjct: 610 DPEIDVQLPSMPLLKHDEVPSYLRPSTPFAFLRTAILGQFKKLDKPFCVLIDTFQELEPE 669

Query: 721 VINDMSKNFPIKAVGPLFKICSEMETKIRGDCMKAADECIEWLDSKPIGSVVYVSFGSVV 780
           ++  MSK   IK VGPL K      + IR D MK  D+CIEWLDSKP  SV+Y+SFG+V 
Sbjct: 670 IVEYMSKFCLIKTVGPLVKYPEVPNSTIRCDMMKP-DDCIEWLDSKPASSVIYISFGTVF 729

Query: 781 FLKQDQIDEIAYALHSSGFSFLWVLKPPSVHLGADRHVLPLEVVEEMGERGKVVEWSPQE 840
           +LKQ+Q+DEIA AL ++G SFLWV+KPP+   G   H LP   +E++G+ GK++ WSPQ 
Sbjct: 730 YLKQEQVDEIAKALLATGISFLWVMKPPAKEFGLPFHTLPEGFLEKVGDNGKILLWSPQV 789

Query: 841 QVLSHPSLACFLTHCGWNSSVEAMSLGVPMVAFPQWGDQVTNAKFLVDVFGVGLRLSRGA 900
           +VL+HPS++CF++HCGWNS +E +S GVP++AFPQWGDQVTNA +LVDVF  GLR+ RG 
Sbjct: 790 KVLTHPSISCFMSHCGWNSVLETLSCGVPIIAFPQWGDQVTNAVYLVDVFKTGLRMGRGK 849

Query: 901 NEDRLIQRDEIETCLREAMEGPRAVEIRQNALKQQKAAEKAVADGGSSDRNIKDFIDEIR 928
            +  +  ++E+  C  EA  G +A +++ NALK + AAE+A+ADGGSSDRN++ FIDE++
Sbjct: 850 GKKGITPKEEVAKCFVEATLGLKAKDLKSNALKWKLAAEEAIADGGSSDRNMQTFIDEVK 907

BLAST of Cp4.1LG04g07760 vs. ExPASy TrEMBL
Match: A0A6J1EIC9 (Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111434621 PE=3 SV=1)

HSP 1 Score: 927 bits (2395), Expect = 0.0
Identity = 449/463 (96.98%), Postives = 457/463 (98.70%), Query Frame = 0

Query: 1   MMNSEEAPCHVFLVCYPSQGHINPTLRLAKKLAAEGLLVTISTAVHFGKTLQKAGSIGAG 60
           MM SEEAPCHVFLVCYPSQGHINPTLRLAKKLAAEGLLVTISTAVHFGKTLQKAGSIGAG
Sbjct: 1   MMYSEEAPCHVFLVCYPSQGHINPTLRLAKKLAAEGLLVTISTAVHFGKTLQKAGSIGAG 60

Query: 61  DCPTPVGNGFIRFEFFEDRLQEINPKDMNLTRYNNQLELSGRPSLTGLIKNQTAENRPVS 120
           DCPTPVGNGFIRFEFFED LQEINPKDMNLTRYNNQLELSGRPSLTGLIKNQTAENRPVS
Sbjct: 61  DCPTPVGNGFIRFEFFEDGLQEINPKDMNLTRYNNQLELSGRPSLTGLIKNQTAENRPVS 120

Query: 121 CLIVNPFFPWTCEVAKELGIPCAVLWVQSCSVFSIYYHCFHKSVPFPSELEPKIDVHLPI 180
           CLI+NPFFPWTCEVAKEL IPCAVLWVQSC+VFSIYYHCFHKSVPFPSELEPKIDVHLPI
Sbjct: 121 CLILNPFFPWTCEVAKELEIPCAVLWVQSCAVFSIYYHCFHKSVPFPSELEPKIDVHLPI 180

Query: 181 LPLLKNDEIPSFLHPNNIYGVLGKVLLSQFSKLSIPFCILMDSFDELEKDIISYMSNIIP 240
           LPLLKNDEIPSFLHPNNIYGVLGKVLLSQFSKLSIPFCILMD+FDELEKDII+YMSNIIP
Sbjct: 181 LPLLKNDEIPSFLHPNNIYGVLGKVLLSQFSKLSIPFCILMDTFDELEKDIINYMSNIIP 240

Query: 241 LKPIGPLFLNPQNVETEVSADCLKAEDCMEWLNSKPPQSVVYVSFGSIVYLKQEQIDELA 300
           LKPIGPLFLNPQNVETEVS DCLKAEDCMEWLNSKPPQSVVYVSFGSIVYLKQEQIDELA
Sbjct: 241 LKPIGPLFLNPQNVETEVSVDCLKAEDCMEWLNSKPPQSVVYVSFGSIVYLKQEQIDELA 300

Query: 301 YGLCNSGLSFLWVMKPPNEALGLKGHILPEGVMEKAGERGKVVQWSSQERVLSHESVGCF 360
           YGLCNSG SFLWVMKPPNEALGLKGHILPEGVMEKAGERGKVVQWSSQERVLSHESVGCF
Sbjct: 301 YGLCNSGFSFLWVMKPPNEALGLKGHILPEGVMEKAGERGKVVQWSSQERVLSHESVGCF 360

Query: 361 MTHCGWNSSVEAIGCGVPVVAFPQWGDQVTNAKFLVEDYGVGVRLSRGAEANELISRDEI 420
           MTHCGWNSSVEAIG GVPVVAFPQWGDQVTNAKFLVED+GVGVRLSRGAEANELISRDEI
Sbjct: 361 MTHCGWNSSVEAIGTGVPVVAFPQWGDQVTNAKFLVEDFGVGVRLSRGAEANELISRDEI 420

Query: 421 VRCISEVMTRDSSGGEFRRNALKLKQAAAAAVVDGGSSHKNIQ 463
           VRCISEVMTRD+SGGEFRRNALKLKQAAAAAVVDGG+SH+NIQ
Sbjct: 421 VRCISEVMTRDNSGGEFRRNALKLKQAAAAAVVDGGASHQNIQ 463

BLAST of Cp4.1LG04g07760 vs. ExPASy TrEMBL
Match: A0A6J1JDZ1 (Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111486011 PE=3 SV=1)

HSP 1 Score: 923 bits (2386), Expect = 0.0
Identity = 456/465 (98.06%), Postives = 460/465 (98.92%), Query Frame = 0

Query: 465 SKACLPHVFLVSFPGQGHINPMLRLGKKLAAAGLLVTFSTSVQLGSQMKNAGSISDHPTP 524
           SKACLPHVFLVSFPGQGHINPMLRLGKKLAAAGLLVTFSTS QLGSQMKNAGSIS HPT 
Sbjct: 3   SKACLPHVFLVSFPGQGHINPMLRLGKKLAAAGLLVTFSTSAQLGSQMKNAGSISGHPTR 62

Query: 525 LGDGFLRFEFFDDGRTDTTPTLTYDEYMVQLQRLGAISLRQILENQMKENRPVSCVIGNP 584
           LGDGFLRFEFFDDGRTDTTPTLTYD+YMVQLQRLGAISLRQILENQMKENRPVSCVIGNP
Sbjct: 63  LGDGFLRFEFFDDGRTDTTPTLTYDKYMVQLQRLGAISLRQILENQMKENRPVSCVIGNP 122

Query: 585 FVPWVIDLADNLGISSAVFWVQSCSVFSVYYHHFRGAVPFPSQTQPNLDVKLPFLPLLKS 644
           FVPWVIDLADNLGISSAVFWVQSCSVFSVYYHHFRGAVPFPSQTQPNLDVKLPFLPLLKS
Sbjct: 123 FVPWVIDLADNLGISSAVFWVQSCSVFSVYYHHFRGAVPFPSQTQPNLDVKLPFLPLLKS 182

Query: 645 DEIPSFLTPNDSHQAIGKDILRQFSNLSKPFCILMDTFEELEAEVINDMSKNFPIKAVGP 704
           DEIPSFLTPNDS+QAIGKDILRQFSNLSKPFCILMDTFEELEAEVINDMSKNFPIKAVGP
Sbjct: 183 DEIPSFLTPNDSYQAIGKDILRQFSNLSKPFCILMDTFEELEAEVINDMSKNFPIKAVGP 242

Query: 705 LFKICSEMETKIRGDCMKAADECIEWLDSKPIGSVVYVSFGSVVFLKQDQIDEIAYALHS 764
           LFKICSEMETKIRGDCMKAADECIEWLDSKPIGSVVYVSFGSVV+LKQDQIDEIAYALHS
Sbjct: 243 LFKICSEMETKIRGDCMKAADECIEWLDSKPIGSVVYVSFGSVVYLKQDQIDEIAYALHS 302

Query: 765 SGFSFLWVLKPPSVHLGADRHVLPLEVVEEMGERGKVVEWSPQEQVLSHPSLACFLTHCG 824
           SGFSFLWVLKPP VHLGA+RHVLPLEVVEEMGERGKVVEWSPQEQVLSHPSLACFLTHCG
Sbjct: 303 SGFSFLWVLKPPYVHLGANRHVLPLEVVEEMGERGKVVEWSPQEQVLSHPSLACFLTHCG 362

Query: 825 WNSSVEAMSLGVPMVAFPQWGDQVTNAKFLVDVFGVGLRLSRGANEDRLIQRDEIETCLR 884
           WNSSVEAMSLGVPMVAFPQWGDQVTNAKFLVDVFGVGLRLSRGANEDRLIQRDEIE CLR
Sbjct: 363 WNSSVEAMSLGVPMVAFPQWGDQVTNAKFLVDVFGVGLRLSRGANEDRLIQRDEIERCLR 422

Query: 885 EAMEGPRAVEIRQNALKQQKAAEKAVADGGSSDRNIKDFIDEIRK 929
           EAMEGPRAVEIRQNALKQQKAAEKAVADGGSSDRNIKDFIDEIRK
Sbjct: 423 EAMEGPRAVEIRQNALKQQKAAEKAVADGGSSDRNIKDFIDEIRK 467

BLAST of Cp4.1LG04g07760 vs. TAIR 10
Match: AT4G15480.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 503.1 bits (1294), Expect = 4.9e-142
Identity = 238/460 (51.74%), Postives = 328/460 (71.30%), Query Frame = 0

Query: 471 HVFLVSFPGQGHINPMLRLGKKLAAAGLLVTFSTSVQLGSQMKNAGSISD-HPTPLGDGF 530
           HV LVSF GQGH+NP+LRLGK +A+ GLLVTF T+   G +M+ A  I D    P+G G 
Sbjct: 19  HVMLVSFQGQGHVNPLLRLGKLIASKGLLVTFVTTELWGKKMRQANKIVDGELKPVGSGS 78

Query: 531 LRFEFFD-DGRTDTTPTLTYDEYMVQLQRLGAISLRQILENQMKENRPVSCVIGNPFVPW 590
           +RFEFFD +   D      +  Y+  L+ +G   + +++    + N PVSC+I NPF+PW
Sbjct: 79  IRFEFFDEEWAEDDDRRADFSLYIAHLESVGIREVSKLVRRYEEANEPVSCLINNPFIPW 138

Query: 591 VIDLADNLGISSAVFWVQSCSVFSVYYHHFRGAVPFPSQTQPNLDVKLPFLPLLKSDEIP 650
           V  +A+   I  AV WVQSC+ FS YYH+  G+V FP++T+P LDVKLP +P+LK+DEIP
Sbjct: 139 VCHVAEEFNIPCAVLWVQSCACFSAYYHYQDGSVSFPTETEPELDVKLPCVPVLKNDEIP 198

Query: 651 SFLTPNDSHQAIGKDILRQFSNLSKPFCILMDTFEELEAEVINDMSKNFPIKAVGPLFKI 710
           SFL P+       + IL QF NLSK FC+L+D+F+ LE EVI+ MS   P+K VGPLFK+
Sbjct: 199 SFLHPSSRFTGFRQAILGQFKNLSKSFCVLIDSFDSLEQEVIDYMSSLCPVKTVGPLFKV 258

Query: 711 CSEMETKIRGDCMKAADECIEWLDSKPIGSVVYVSFGSVVFLKQDQIDEIAYALHSSGFS 770
              + + + GD  K+ D+C+EWLDS+P  SVVY+SFG+V +LKQ+QI+EIA+ +  SG S
Sbjct: 259 ARTVTSDVSGDICKSTDKCLEWLDSRPKSSVVYISFGTVAYLKQEQIEEIAHGVLKSGLS 318

Query: 771 FLWVLKPPSVHLGADRHVLPLEVVEEMGE-RGKVVEWSPQEQVLSHPSLACFLTHCGWNS 830
           FLWV++PP   L  + HVLP E+ E   + +G +V+W PQEQVLSHPS+ACF+THCGWNS
Sbjct: 319 FLWVIRPPPHDLKVETHVLPQELKESSAKGKGMIVDWCPQEQVLSHPSVACFVTHCGWNS 378

Query: 831 SVEAMSLGVPMVAFPQWGDQVTNAKFLVDVFGVGLRLSRGANEDRLIQRDEIETCLREAM 890
           ++E++S GVP+V  PQWGDQVT+A +L+DVF  G+RL RGA E+R++ R+E+   L EA 
Sbjct: 379 TMESLSSGVPVVCCPQWGDQVTDAVYLIDVFKTGVRLGRGATEERVVPREEVAEKLLEAT 438

Query: 891 EGPRAVEIRQNALKQQKAAEKAVADGGSSDRNIKDFIDEI 928
            G +A E+R+NALK +  AE AVA GGSSD+N ++F++++
Sbjct: 439 VGEKAEELRKNALKWKAEAEAAVAPGGSSDKNFREFVEKL 478

BLAST of Cp4.1LG04g07760 vs. TAIR 10
Match: AT4G15500.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 478.0 bits (1229), Expect = 1.7e-134
Identity = 238/470 (50.64%), Postives = 334/470 (71.06%), Query Frame = 0

Query: 466 KACLPHVFLVSFPGQGHINPMLRLGKKLAAAGLLVTFSTSVQ-LGSQMKNAGSISDHP-T 525
           ++ LPHV LVSFPGQGHI+P+LRLGK +A+ GL+VTF T+ + LG +M+ A +I D    
Sbjct: 4   ESSLPHVMLVSFPGQGHISPLLRLGKIIASKGLIVTFVTTEEPLGKKMRQANNIQDGVLK 63

Query: 526 PLGDGFLRFEFFDDGRTDTTPTLTYDEYMVQLQRLGAISLRQILENQMK--ENRPVSCVI 585
           P+G GFLRFEFF+DG         Y E    LQ+   +S ++ ++N +K  E +PV C+I
Sbjct: 64  PVGLGFLRFEFFEDG-------FVYKEDFDLLQKSLEVSGKREIKNLVKKYEKQPVRCLI 123

Query: 586 GNPFVPWVIDLADNLGISSAVFWVQSCSVFSVYYHHFRGAVPFPSQTQPNLDVKLPFLPL 645
            N FVPWV D+A+ L I SAV WVQSC+  + YY++    V FP++T+P + V +PF PL
Sbjct: 124 NNAFVPWVCDIAEELQIPSAVLWVQSCACLAAYYYYHHQLVKFPTETEPEITVDVPFKPL 183

Query: 646 -LKSDEIPSFLTPNDSHQAIGKDILRQFSNLSKPFCILMDTFEELEAEVINDMSKNFP-- 705
            LK DEIPSFL P+    +IG  IL Q   L KPF +L++TF+ELE + I+ MS+  P  
Sbjct: 184 TLKHDEIPSFLHPSSPLSSIGGTILEQIKRLHKPFSVLIETFQELEKDTIDHMSQLCPQV 243

Query: 706 -IKAVGPLFKICSEMETKIRGDCMKAADECIEWLDSKPIGSVVYVSFGSVVFLKQDQIDE 765
               +GPLF +   + + I+GD  K   +CIEWLDS+   SVVY+SFG++ FLKQ+QIDE
Sbjct: 244 NFNPIGPLFTMAKTIRSDIKGDISKPDSDCIEWLDSREPSSVVYISFGTLAFLKQNQIDE 303

Query: 766 IAYALHSSGFSFLWVLKPPSVHLGADRHVLPLEVVEEMGERGKVVEWSPQEQVLSHPSLA 825
           IA+ + +SG S LWVL+PP   L  + HVLPL    E+ E+GK+VEW  QE+VL+HP++A
Sbjct: 304 IAHGILNSGLSCLWVLRPPLEGLAIEPHVLPL----ELEEKGKIVEWCQQEKVLAHPAVA 363

Query: 826 CFLTHCGWNSSVEAMSLGVPMVAFPQWGDQVTNAKFLVDVFGVGLRLSRGANEDRLIQRD 885
           CFL+HCGWNS++EA++ GVP++ FPQWGDQVTNA +++DVF  GLRLSRGA+++R++ R+
Sbjct: 364 CFLSHCGWNSTMEALTSGVPVICFPQWGDQVTNAVYMIDVFKTGLRLSRGASDERIVPRE 423

Query: 886 EIETCLREAMEGPRAVEIRQNALKQQKAAEKAVADGGSSDRNIKDFIDEI 928
           E+   L EA  G +AVE+R+NA + ++ AE AVA GG+S+RN ++F+D++
Sbjct: 424 EVAERLLEATVGEKAVELRENARRWKEEAESAVAYGGTSERNFQEFVDKL 462

BLAST of Cp4.1LG04g07760 vs. TAIR 10
Match: AT4G15490.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 462.6 bits (1189), Expect = 7.4e-130
Identity = 229/466 (49.14%), Postives = 323/466 (69.31%), Query Frame = 0

Query: 471 HVFLVSFPGQGHINPMLRLGKKLAAAGLLVTF-STSVQLGSQMKNAGSISDHP-TPLGDG 530
           HV LVSFPGQGH+NP+LRLGK +A+ GLLVTF +T    G +M+ A  I D    P+G G
Sbjct: 8   HVMLVSFPGQGHVNPLLRLGKLIASKGLLVTFVTTEKPWGKKMRQANKIQDGVLKPVGLG 67

Query: 531 FLRFEFFDDG-RTDTTPTLTYDEYMVQLQRLGAISLRQILENQMKENRPVSCVIGNPFVP 590
           F+RFEFF DG   D      +D +   L+ +G   ++ +++   KE  PV+C+I N FVP
Sbjct: 68  FIRFEFFSDGFADDDEKRFDFDAFRPHLEAVGKQEIKNLVKRYNKE--PVTCLINNAFVP 127

Query: 591 WVIDLADNLGISSAVFWVQSCSVFSVYYHHFRGAVPFPSQTQPNLDVKLPFLPLLKSDEI 650
           WV D+A+ L I SAV WVQSC+  + YY++    V FP++T+P++ V++P LPLLK DEI
Sbjct: 128 WVCDVAEELHIPSAVLWVQSCACLTAYYYYHHRLVKFPTKTEPDISVEIPCLPLLKHDEI 187

Query: 651 PSFLTPNDSHQAIGK---DILRQFSNLSKPFCILMDTFEELEAEVINDMSKNFP---IKA 710
           PSFL P+  + A G    D L++F N  K F + +DTF ELE ++++ MS+  P   I  
Sbjct: 188 PSFLHPSSPYTAFGDIILDQLKRFEN-HKSFYLFIDTFRELEKDIMDHMSQLCPQAIISP 247

Query: 711 VGPLFKICSEMETKIRGDCMKAADECIEWLDSKPIGSVVYVSFGSVVFLKQDQIDEIAYA 770
           VGPLFK+   + + ++GD  + A +C+EWLDS+   SVVY+SFG++  LKQ+Q++EIA+ 
Sbjct: 248 VGPLFKMAQTLSSDVKGDISEPASDCMEWLDSREPSSVVYISFGTIANLKQEQMEEIAHG 307

Query: 771 LHSSGFSFLWVLKPPSVHLGADRHVLPLEVVEEMGERGKVVEWSPQEQVLSHPSLACFLT 830
           + SSG S LWV++PP      + HVLP     E+ E+GK+VEW PQE+VL+HP++ACFL+
Sbjct: 308 VLSSGLSVLWVVRPPMEGTFVEPHVLP----RELEEKGKIVEWCPQERVLAHPAIACFLS 367

Query: 831 HCGWNSSVEAMSLGVPMVAFPQWGDQVTNAKFLVDVFGVGLRLSRGANEDRLIQRDEIET 890
           HCGWNS++EA++ GVP+V FPQWGDQVT+A +L DVF  G+RL RGA E+ ++ R+ +  
Sbjct: 368 HCGWNSTMEALTAGVPVVCFPQWGDQVTDAVYLADVFKTGVRLGRGAAEEMIVSREVVAE 427

Query: 891 CLREAMEGPRAVEIRQNALKQQKAAEKAVADGGSSDRNIKDFIDEI 928
            L EA  G +AVE+R+NA + +  AE AVADGGSSD N K+F+D++
Sbjct: 428 KLLEATVGEKAVELRENARRWKAEAEAAVADGGSSDMNFKEFVDKL 466

BLAST of Cp4.1LG04g07760 vs. TAIR 10
Match: AT3G21560.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 455.7 bits (1171), Expect = 9.0e-128
Identity = 220/474 (46.41%), Postives = 327/474 (68.99%), Query Frame = 0

Query: 462 IQHSKACLPHVFLVSFPGQGHINPMLRLGKKLAAAGLLVTFSTSVQLGSQMKNAGSISDH 521
           ++ S    PHV LVSFPGQGH+NP+LRLGK LA+ GLL+TF T+   G +M+ +  I D 
Sbjct: 3   LESSPPLPPHVMLVSFPGQGHVNPLLRLGKLLASKGLLITFVTTESWGKKMRISNKIQDR 62

Query: 522 P-TPLGDGFLRFEFFDDG--RTDTTPTLTYDEYMVQLQRLGAISLRQILENQMK-ENRPV 581
              P+G G+LR++FFDDG    D             L+ +G   ++ +++   +   +PV
Sbjct: 63  VLKPVGKGYLRYDFFDDGLPEDDEASRTNLTILRPHLELVGKREIKNLVKRYKEVTKQPV 122

Query: 582 SCVIGNPFVPWVIDLADNLGISSAVFWVQSCSVFSVYYHHFRGAVPFPSQTQPNLDVKLP 641
           +C+I NPFV WV D+A++L I  AV WVQSC+  + YY++    V FP++T+P +DV++ 
Sbjct: 123 TCLINNPFVSWVCDVAEDLQIPCAVLWVQSCACLAAYYYYHHNLVDFPTKTEPEIDVQIS 182

Query: 642 FLPLLKSDEIPSFLTPNDSHQAIGKDILRQFSNLSKPFCILMDTFEELEAEVINDMSK-N 701
            +PLLK DEIPSF+ P+  H A+ + I+ Q   L K F I +DTF  LE ++I+ MS  +
Sbjct: 183 GMPLLKHDEIPSFIHPSSPHSALREVIIDQIKRLHKTFSIFIDTFNSLEKDIIDHMSTLS 242

Query: 702 FP--IKAVGPLFKICSEME-TKIRGDCMKAADECIEWLDSKPIGSVVYVSFGSVVFLKQD 761
            P  I+ +GPL+K+   +    ++ +  +  D C+EWLDS+P+ SVVY+SFG+V +LKQ+
Sbjct: 243 LPGVIRPLGPLYKMAKTVAYDVVKVNISEPTDPCMEWLDSQPVSSVVYISFGTVAYLKQE 302

Query: 762 QIDEIAYALHSSGFSFLWVLKPPSVHLGADRHVLPLEVVEEMGERGKVVEWSPQEQVLSH 821
           QIDEIAY + ++  +FLWV++   +    ++HVLP    EE+  +GK+VEW  QE+VLSH
Sbjct: 303 QIDEIAYGVLNADVTFLWVIRQQELGFNKEKHVLP----EEVKGKGKIVEWCSQEKVLSH 362

Query: 822 PSLACFLTHCGWNSSVEAMSLGVPMVAFPQWGDQVTNAKFLVDVFGVGLRLSRGANEDRL 881
           PS+ACF+THCGWNS++EA+S GVP V FPQWGDQVT+A +++DV+  G+RLSRG  E+RL
Sbjct: 363 PSVACFVTHCGWNSTMEAVSSGVPTVCFPQWGDQVTDAVYMIDVWKTGVRLSRGEAEERL 422

Query: 882 IQRDEIETCLREAMEGPRAVEIRQNALKQQKAAEKAVADGGSSDRNIKDFIDEI 928
           + R+E+   LRE  +G +A+E+++NALK ++ AE AVA GGSSDRN++ F++++
Sbjct: 423 VPREEVAERLREVTKGEKAIELKKNALKWKEEAEAAVARGGSSDRNLEKFVEKL 472

BLAST of Cp4.1LG04g07760 vs. TAIR 10
Match: AT1G05680.1 (Uridine diphosphate glycosyltransferase 74E2 )

HSP 1 Score: 319.3 bits (817), Expect = 1.0e-86
Identity = 174/465 (37.42%), Postives = 272/465 (58.49%), Query Frame = 0

Query: 471 HVFLVSFPGQGHINPMLRLGKKLAAAGLLVTFSTSVQLGSQMKNAGSISDHPTP----LG 530
           H+ ++ FPGQGHI PM +  K+LA+ GL +T                +SD P+P      
Sbjct: 6   HLIVLPFPGQGHITPMSQFCKRLASKGLKLTLVL-------------VSDKPSPPYKTEH 65

Query: 531 DGFLRFEFFDDGRTDTTPTLTYDEYMVQLQRLGAISLRQILENQMKENRPVSCVIGNPFV 590
           D    F   +  +    P    D+YM +++     +L +++E+      P   ++ +  +
Sbjct: 66  DSITVFPISNGFQEGEEPLQDLDDYMERVETSIKNTLPKLVEDMKLSGNPPRAIVYDSTM 125

Query: 591 PWVIDLADNLGISSAVFWVQSCSVFSVYYHHFRGAVPFPSQTQPNLDV-KLPFLPLLKSD 650
           PW++D+A + G+S AVF+ Q   V ++YYH F+G+   PS    +  +   P  P+L ++
Sbjct: 126 PWLLDVAHSYGLSGAVFFTQPWLVTAIYYHVFKGSFSVPSTKYGHSTLASFPSFPMLTAN 185

Query: 651 EIPSFLTPNDSHQAIGKDILRQFSNLSKPFCILMDTFEELEAEVINDMSKNFPIKAVGP- 710
           ++PSFL  + S+  I + ++ Q SN+ +   +L +TF++LE +++  +   +P+  +GP 
Sbjct: 186 DLPSFLCESSSYPNILRIVVDQLSNIDRVDIVLCNTFDKLEEKLLKWVQSLWPVLNIGPT 245

Query: 711 -----LFKICSEMETKIRGDCMKAADECIEWLDSKPIGSVVYVSFGSVVFLKQDQIDEIA 770
                L K  SE +            EC+EWL+SK   SVVY+SFGS+V LK+DQ+ E+A
Sbjct: 246 VPSMYLDKRLSEDKNYGFSLFNAKVAECMEWLNSKEPNSVVYLSFGSLVILKEDQMLELA 305

Query: 771 YALHSSGFSFLWVLKPPSVHLGADRHVLPLEVVEEMGERGKVVEWSPQEQVLSHPSLACF 830
             L  SG  FLWV++        + H LP   VEE+GE+G +V WSPQ  VL+H S+ CF
Sbjct: 306 AGLKQSGRFFLWVVRE------TETHKLPRNYVEEIGEKGLIVSWSPQLDVLAHKSIGCF 365

Query: 831 LTHCGWNSSVEAMSLGVPMVAFPQWGDQVTNAKFLVDVFGVGLRLSRGANEDRLIQRDEI 890
           LTHCGWNS++E +SLGVPM+  P W DQ TNAKF+ DV+ VG+R+   A  D  ++R+EI
Sbjct: 366 LTHCGWNSTLEGLSLGVPMIGMPHWTDQPTNAKFMQDVWKVGVRVK--AEGDGFVRREEI 425

Query: 891 ETCLREAMEGPRAVEIRQNALKQQKAAEKAVADGGSSDRNIKDFI 925
              + E MEG +  EIR+NA K +  A++AV++GGSSD++I +F+
Sbjct: 426 MRSVEEVMEGEKGKEIRKNAEKWKVLAQEAVSEGGSSDKSINEFV 449

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A193AU776.9e-16558.66Gallate 1-beta-glucosyltransferase 84A24 OS=Punica granatum OX=22663 GN=UGT84A24... [more]
A0A193AUF69.0e-16558.37Gallate 1-beta-glucosyltransferase 84A23 OS=Punica granatum OX=22663 GN=UGT84A23... [more]
V5LLZ91.3e-16058.35Gallate 1-beta-glucosyltransferase OS=Quercus robur OX=38942 GN=UGT84A13 PE=1 SV... [more]
Q9MB738.7e-16056.93Limonoid UDP-glucosyltransferase OS=Citrus unshiu OX=55188 PE=2 SV=1[more]
Q2V6K11.3e-15859.40Putative UDP-glucose glucosyltransferase OS=Fragaria ananassa OX=3747 GN=GT5 PE=... [more]
Match NameE-valueIdentityDescription
KJB32309.10.050.53hypothetical protein B456_005G234600 [Gossypium raimondii][more]
KAB2042771.10.050.53hypothetical protein ES319_D02G239900v1 [Gossypium barbadense][more]
XP_023530167.10.0100.00putative UDP-glucose glucosyltransferase [Cucurbita pepo subsp. pepo][more]
TYI94980.10.050.67hypothetical protein E1A91_D02G245100v1 [Gossypium mustelinum][more]
XP_023532006.10.0100.00gallate 1-beta-glucosyltransferase-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
A0A0D2RKC60.050.53Uncharacterized protein OS=Gossypium raimondii OX=29730 GN=B456_005G234600 PE=3 ... [more]
A0A5J5SMY20.050.53Uncharacterized protein OS=Gossypium barbadense OX=3634 GN=ES319_D02G239900v1 PE... [more]
A0A5D2W1U20.050.67Uncharacterized protein OS=Gossypium mustelinum OX=34275 GN=E1A91_D02G245100v1 P... [more]
A0A6J1EIC90.096.98Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111434621 PE=3 SV=1[more]
A0A6J1JDZ10.098.06Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111486011 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT4G15480.14.9e-14251.74UDP-Glycosyltransferase superfamily protein [more]
AT4G15500.11.7e-13450.64UDP-Glycosyltransferase superfamily protein [more]
AT4G15490.17.4e-13049.14UDP-Glycosyltransferase superfamily protein [more]
AT3G21560.19.0e-12846.41UDP-Glycosyltransferase superfamily protein [more]
AT1G05680.11.0e-8637.42Uridine diphosphate glycosyltransferase 74E2 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 257..450
e-value: 2.7E-135
score: 454.0
coord: 715..908
e-value: 2.2E-137
score: 460.9
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 475..920
e-value: 2.2E-137
score: 460.9
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 14..462
e-value: 2.7E-135
score: 454.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 906..929
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 914..929
NoneNo IPR availablePANTHERPTHR11926:SF986UDP-GLYCOSYLTRANSFERASE 84A1coord: 7..464
NoneNo IPR availablePANTHERPTHR11926:SF986UDP-GLYCOSYLTRANSFERASE 84A1coord: 470..928
NoneNo IPR availablePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 470..928
coord: 7..464
NoneNo IPR availableSUPERFAMILY53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 470..928
NoneNo IPR availableSUPERFAMILY53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 9..462
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 726..859
e-value: 2.4E-23
score: 82.7
coord: 265..407
e-value: 3.3E-20
score: 72.4
IPR002213UDP-glucuronosyl/UDP-glucosyltransferaseCDDcd03784GT1_Gtf-likecoord: 470..911
e-value: 1.80272E-80
score: 264.799
IPR002213UDP-glucuronosyl/UDP-glucosyltransferaseCDDcd03784GT1_Gtf-likecoord: 10..440
e-value: 3.11169E-75
score: 250.546
IPR035595UDP-glycosyltransferase family, conserved sitePROSITEPS00375UDPGTcoord: 804..847
IPR035595UDP-glycosyltransferase family, conserved sitePROSITEPS00375UDPGTcoord: 345..388

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g07760.1Cp4.1LG04g07760.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0008194 UDP-glycosyltransferase activity