ClCG02G012220 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG02G012220
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGlycosyltransferase
LocationCG_Chr02: 25503529 .. 25505346 (-)
RNA-Seq ExpressionClCG02G012220
SyntenyClCG02G012220
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTGTTTTCAACCAAACCTTATATATGAATGTACAATTTAAAAAAAAAGATATCCCATATATATGAATGTACATAAGAATGTTAAAAACAATCAAGATCCAATATTTTTAAGCAAGAACATAAGCATCAAACTTGAAACCATGTCCATGCTACCACAAAACAAAGAATTCAGAATCACCGTCCTCCCGCTGTTCGCCTCCGGCCACATAATTCCGATCATCGACATGGCCAGGCTCTTTGCCCGCCACGGTGCCACCGTCACCATCATCGCCACCGAGTCCAACGCCTCCATTTTCCAAAACAACATCGACCATGACTTTGCCGCCGGATTCAAAATTCAGACCCACATTGTTAGCTTCCCTGGAGCCGAGGTTGGCCTCGCCCCCGGCATCGAAAACTACAGCGATGTCTCTTCTCGCCACCTCCAAGCCAAAATCTATCAAGCCTTTCTCATTCTTGACAAACTTATAGACCAGGTTCCTTCCACCTTCTTTAATTGGTTTTTAGTTTAATGCTTATGCGTAACCTGTAAGGTCATTTTTTTCTCCGTATCGTAATTTTATGGTAGATGATCATTCCGGCAACTCGACCAGACTGCATTCTGAGCGACCTGTCGCATCCGTGGACGACGGATACGGCAGAGAGGCTCGGGGTGCCGCGGCTGGTGTTCTCGGTGTCGAATTTCATGGCATACTCTGCAGAGCACTCTGTTATGCAACATTCTCCTCACCAGAAAGTAGCCTCAGACACAGAGGAATTCGAAATCCCAGGATTACCCCACCACATTCAAATGACCAAATCCCAGCAGCCGGAATTTCTTCTCCGACGAGACCGCTTCACGGCGATGATGGAGAGTTACAAGGAAGCAGAGAGAAGAAGCTACGGAACTGTAATGAACACATTTTATGAGCTGGATGGGGTTTATTTAGAGCATTACAAAAAGATAACTGGAATCAAAGCTTGGGGATTAGGCCCAGTTTCATTGGCAGTGAACAAAAATCTGAGAGAAAAAATTGAAAGGGGAAACAAATCGGGAATGGAGAGTGAAGAGCTAGTGAAATGGTTGAATTCCAAGGAACCAAACTCTGTTTTGTTTGTGAGTTTTGGGAGTATGACTAGGTTTCCGCCGCCGCAAATGGCTGAGATTGCACATGGGCTTGAAGATTCCGGCATAAATTTCATATGGGTTATTCGAAACAAGGACAAAAACGACAGTGGAGAGGCGCCAGAGGGGCTGCCGGAGGGGTTCGAACAGATGATTAAGAATAAAAACAGGGGATTCATTGTTCGGATTTGGGCGCCGCAACTTTTGATTTTGGAGCACCCATCGACGGGGGGTTTCTTGACGCACTGTGGGTGGAATTCGTCCATTGAGGGGATCAGCGCCGGTCAGCCGATGGTGACGTGGCCGGTAAGCTCCGAGCAGTTTTATAATGAGAAGCTTCTGACGGAGGTGTTGCAGGTGGGGGTTCCGGTAGGGGCGCGGCGGTGGTGGAATATGAGCGATGAGATGAAGGAGATTGTGAGTAGAGAGAATGTGGAAAAGGGCGTGGGGTTTCTTATGGGGGCGACGGAGGAGGCGGCGGCAATTAGAGAGCGGGCGAAACAGCTCGGGGCTGCTGCGAACAGGGCAGTTCAAAGCGGCGGCTCGTCGGAGAACAATTTGATATCGTTGATGAAAGAATTGAGGTCAATTAAGGTTAACGATAAGGATTAAAATGTCTCTCTCCACACAAATGTTTAGCCTCCAATTTTATGAAAATGTCAATAAAAATGATATTTCTGTAGAAATAGTATAATAAAAATAATATAAAAAAG

mRNA sequence

TTTGTTTTCAACCAAACCTTATATATGAATGTACAATTTAAAAAAAAAGATATCCCATATATATGAATGTACATAAGAATGTTAAAAACAATCAAGATCCAATATTTTTAAGCAAGAACATAAGCATCAAACTTGAAACCATGTCCATGCTACCACAAAACAAAGAATTCAGAATCACCGTCCTCCCGCTGTTCGCCTCCGGCCACATAATTCCGATCATCGACATGGCCAGGCTCTTTGCCCGCCACGGTGCCACCGTCACCATCATCGCCACCGAGTCCAACGCCTCCATTTTCCAAAACAACATCGACCATGACTTTGCCGCCGGATTCAAAATTCAGACCCACATTGTTAGCTTCCCTGGAGCCGAGGTTGGCCTCGCCCCCGGCATCGAAAACTACAGCGATGTCTCTTCTCGCCACCTCCAAGCCAAAATCTATCAAGCCTTTCTCATTCTTGACAAACTTATAGACCAGATGATCATTCCGGCAACTCGACCAGACTGCATTCTGAGCGACCTGTCGCATCCGTGGACGACGGATACGGCAGAGAGGCTCGGGGTGCCGCGGCTGGTGTTCTCGGTGTCGAATTTCATGGCATACTCTGCAGAGCACTCTGTTATGCAACATTCTCCTCACCAGAAAGTAGCCTCAGACACAGAGGAATTCGAAATCCCAGGATTACCCCACCACATTCAAATGACCAAATCCCAGCAGCCGGAATTTCTTCTCCGACGAGACCGCTTCACGGCGATGATGGAGAGTTACAAGGAAGCAGAGAGAAGAAGCTACGGAACTGTAATGAACACATTTTATGAGCTGGATGGGGTTTATTTAGAGCATTACAAAAAGATAACTGGAATCAAAGCTTGGGGATTAGGCCCAGTTTCATTGGCAGTGAACAAAAATCTGAGAGAAAAAATTGAAAGGGGAAACAAATCGGGAATGGAGAGTGAAGAGCTAGTGAAATGGTTGAATTCCAAGGAACCAAACTCTGTTTTGTTTGTGAGTTTTGGGAGTATGACTAGGTTTCCGCCGCCGCAAATGGCTGAGATTGCACATGGGCTTGAAGATTCCGGCATAAATTTCATATGGGTTATTCGAAACAAGGACAAAAACGACAGTGGAGAGGCGCCAGAGGGGCTGCCGGAGGGGTTCGAACAGATGATTAAGAATAAAAACAGGGGATTCATTGTTCGGATTTGGGCGCCGCAACTTTTGATTTTGGAGCACCCATCGACGGGGGGTTTCTTGACGCACTGTGGGTGGAATTCGTCCATTGAGGGGATCAGCGCCGGTCAGCCGATGGTGACGTGGCCGGTAAGCTCCGAGCAGTTTTATAATGAGAAGCTTCTGACGGAGGTGTTGCAGGTGGGGGTTCCGGTAGGGGCGCGGCGGTGGTGGAATATGAGCGATGAGATGAAGGAGATTGTGAGTAGAGAGAATGTGGAAAAGGGCGTGGGGTTTCTTATGGGGGCGACGGAGGAGGCGGCGGCAATTAGAGAGCGGGCGAAACAGCTCGGGGCTGCTGCGAACAGGGCAGTTCAAAGCGGCGGCTCGTCGGAGAACAATTTGATATCGTTGATGAAAGAATTGAGGTCAATTAAGGTTAACGATAAGGATTAAAATGTCTCTCTCCACACAAATGTTTAGCCTCCAATTTTATGAAAATGTCAATAAAAATGATATTTCTGTAGAAATAGTATAATAAAAATAATATAAAAAAG

Coding sequence (CDS)

ATGAATGTACATAAGAATGTTAAAAACAATCAAGATCCAATATTTTTAAGCAAGAACATAAGCATCAAACTTGAAACCATGTCCATGCTACCACAAAACAAAGAATTCAGAATCACCGTCCTCCCGCTGTTCGCCTCCGGCCACATAATTCCGATCATCGACATGGCCAGGCTCTTTGCCCGCCACGGTGCCACCGTCACCATCATCGCCACCGAGTCCAACGCCTCCATTTTCCAAAACAACATCGACCATGACTTTGCCGCCGGATTCAAAATTCAGACCCACATTGTTAGCTTCCCTGGAGCCGAGGTTGGCCTCGCCCCCGGCATCGAAAACTACAGCGATGTCTCTTCTCGCCACCTCCAAGCCAAAATCTATCAAGCCTTTCTCATTCTTGACAAACTTATAGACCAGATGATCATTCCGGCAACTCGACCAGACTGCATTCTGAGCGACCTGTCGCATCCGTGGACGACGGATACGGCAGAGAGGCTCGGGGTGCCGCGGCTGGTGTTCTCGGTGTCGAATTTCATGGCATACTCTGCAGAGCACTCTGTTATGCAACATTCTCCTCACCAGAAAGTAGCCTCAGACACAGAGGAATTCGAAATCCCAGGATTACCCCACCACATTCAAATGACCAAATCCCAGCAGCCGGAATTTCTTCTCCGACGAGACCGCTTCACGGCGATGATGGAGAGTTACAAGGAAGCAGAGAGAAGAAGCTACGGAACTGTAATGAACACATTTTATGAGCTGGATGGGGTTTATTTAGAGCATTACAAAAAGATAACTGGAATCAAAGCTTGGGGATTAGGCCCAGTTTCATTGGCAGTGAACAAAAATCTGAGAGAAAAAATTGAAAGGGGAAACAAATCGGGAATGGAGAGTGAAGAGCTAGTGAAATGGTTGAATTCCAAGGAACCAAACTCTGTTTTGTTTGTGAGTTTTGGGAGTATGACTAGGTTTCCGCCGCCGCAAATGGCTGAGATTGCACATGGGCTTGAAGATTCCGGCATAAATTTCATATGGGTTATTCGAAACAAGGACAAAAACGACAGTGGAGAGGCGCCAGAGGGGCTGCCGGAGGGGTTCGAACAGATGATTAAGAATAAAAACAGGGGATTCATTGTTCGGATTTGGGCGCCGCAACTTTTGATTTTGGAGCACCCATCGACGGGGGGTTTCTTGACGCACTGTGGGTGGAATTCGTCCATTGAGGGGATCAGCGCCGGTCAGCCGATGGTGACGTGGCCGGTAAGCTCCGAGCAGTTTTATAATGAGAAGCTTCTGACGGAGGTGTTGCAGGTGGGGGTTCCGGTAGGGGCGCGGCGGTGGTGGAATATGAGCGATGAGATGAAGGAGATTGTGAGTAGAGAGAATGTGGAAAAGGGCGTGGGGTTTCTTATGGGGGCGACGGAGGAGGCGGCGGCAATTAGAGAGCGGGCGAAACAGCTCGGGGCTGCTGCGAACAGGGCAGTTCAAAGCGGCGGCTCGTCGGAGAACAATTTGATATCGTTGATGAAAGAATTGAGGTCAATTAAGGTTAACGATAAGGATTAA

Protein sequence

MNVHKNVKNNQDPIFLSKNISIKLETMSMLPQNKEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFKIQTHIVSFPGAEVGLAPGIENYSDVSSRHLQAKIYQAFLILDKLIDQMIIPATRPDCILSDLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHHIQMTKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIKAWGLGPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMAEIAHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWWNMSDEMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISLMKELRSIKVNDKD
Homology
BLAST of ClCG02G012220 vs. NCBI nr
Match: XP_038900859.1 (soyasapogenol B glucuronide galactosyltransferase-like [Benincasa hispida])

HSP 1 Score: 866.7 bits (2238), Expect = 1.0e-247
Identity = 425/491 (86.56%), Postives = 455/491 (92.67%), Query Frame = 0

Query: 29  MLPQNKEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAA 88
           MLP+N EFRITVLPLFASGH+IPIIDMARLFA HGATVTII TESNA +FQN+ID DF A
Sbjct: 1   MLPENTEFRITVLPLFASGHVIPIIDMARLFAHHGATVTIITTESNACVFQNSIDRDFEA 60

Query: 89  GFKIQTHIVSFPGAEVGLAPGIENYSDVSSRHLQAKIYQAFLILDKLIDQMIIPATRPDC 148
           GFKIQTHIV+FPGAEVGLAPGIENYSDVSSRHLQAKIYQAFLILDKLIDQ IIPAT+PDC
Sbjct: 61  GFKIQTHIVTFPGAEVGLAPGIENYSDVSSRHLQAKIYQAFLILDKLIDQEIIPATQPDC 120

Query: 149 ILSDLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLP 208
           ILSDLSH WTTDTAERLGVPRLV SVSNFMAYSAEHSVMQH P QKVASDTE FEIPGLP
Sbjct: 121 ILSDLSHSWTTDTAERLGVPRLVVSVSNFMAYSAEHSVMQHHPEQKVASDTEAFEIPGLP 180

Query: 209 HHIQMTKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIK 268
           H IQMTKSQQPEF ++R+ FTAM+E YKEAERRSYGTVMNTFYELDGVYLEHYKKI GIK
Sbjct: 181 HRIQMTKSQQPEFFIQRNAFTAMLERYKEAERRSYGTVMNTFYELDGVYLEHYKKIIGIK 240

Query: 269 AWGLGPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQM 328
           AWG+GPVSLAVNK+++ K ERGNKS +ESEEL++WLNSKEPNSVL+VSFGSM RFPPPQM
Sbjct: 241 AWGIGPVSLAVNKDMKGKSERGNKSNVESEELLEWLNSKEPNSVLYVSFGSMVRFPPPQM 300

Query: 329 AEIAHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQLLIL 388
           AEIAHGLEDSGINFIWVIRNK KND GE  EGLPEGFE+ IKNKNRG I+RIWAPQLLIL
Sbjct: 301 AEIAHGLEDSGINFIWVIRNKGKNDGGEEEEGLPEGFEERIKNKNRGLIIRIWAPQLLIL 360

Query: 389 EHPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWWN 448
           EHPSTGGFLTHCGWNSSIEGISAG+PMVTWPVSSEQFY EKLLTEVLQVGVPVGA+ WWN
Sbjct: 361 EHPSTGGFLTHCGWNSSIEGISAGKPMVTWPVSSEQFYTEKLLTEVLQVGVPVGAQWWWN 420

Query: 449 MSDEMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISLM 508
           M++EMKEIVSRE V KGVGFLMGAT+EA AIR+RA+QLGAAANRAVQSGGSSE NL+SLM
Sbjct: 421 MNEEMKEIVSREKVGKGVGFLMGATKEAVAIRKRAEQLGAAANRAVQSGGSSEQNLVSLM 480

Query: 509 KELRSIKVNDK 520
           KELR++K   K
Sbjct: 481 KELRAVKFKGK 491

BLAST of ClCG02G012220 vs. NCBI nr
Match: XP_022149559.1 (soyasapogenol B glucuronide galactosyltransferase-like [Momordica charantia])

HSP 1 Score: 775.8 bits (2002), Expect = 2.4e-220
Identity = 384/490 (78.37%), Postives = 428/490 (87.35%), Query Frame = 0

Query: 29  MLPQNKEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAA 88
           M+ +N+E RITVLPLFASGHIIPI+DMARLFARHGA VTII TESNA  FQN++  DFAA
Sbjct: 1   MVLENEELRITVLPLFASGHIIPIVDMARLFARHGAAVTIITTESNARSFQNDVARDFAA 60

Query: 89  GFKIQTHIVSFPGAEVGLAPGIENYSDVSSRHLQAKIYQAFLILDKLIDQMIIPATRPDC 148
           G+KIQT  V FP AEVGL PGIEN+SDV SR LQ KIY+AFLIL+K IDQ+IIP TRPDC
Sbjct: 61  GYKIQTRTVPFPAAEVGLPPGIENFSDVVSRDLQGKIYRAFLILEKQIDQVIIPETRPDC 120

Query: 149 ILSDLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLP 208
           ILSDLS+ WTTDTA RLGVPRLVF VSNFMAYSAEHSV+QH+PHQKV SD E FE+PGLP
Sbjct: 121 ILSDLSYGWTTDTAARLGVPRLVFFVSNFMAYSAEHSVLQHAPHQKVTSDFETFELPGLP 180

Query: 209 HHIQMTKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIK 268
           H IQMTKSQQPEFL++R +FT M+E YKEAERRSYG V NTFYELDGVYLEHYK+  GIK
Sbjct: 181 HKIQMTKSQQPEFLVQRSQFTEMIEKYKEAERRSYGIVTNTFYELDGVYLEHYKRTIGIK 240

Query: 269 AWGLGPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQM 328
           AWGLGPVSLAVNK+L  KI+RGNKSGMES EL+ WLNSKEPNSVL+VSFGSMTRFP  Q+
Sbjct: 241 AWGLGPVSLAVNKDLIGKIDRGNKSGMESGELLDWLNSKEPNSVLYVSFGSMTRFPAAQI 300

Query: 329 AEIAHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQ-MIKNKNRGFIVRIWAPQLLI 388
           AEIAHGLE +G NFIWVIR K++N+ GEA EGLPEGFE+ +++ K +G IVRIWAPQLLI
Sbjct: 301 AEIAHGLESAGRNFIWVIRKKNENEGGEAEEGLPEGFEERVVREKKKGLIVRIWAPQLLI 360

Query: 389 LEHPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWW 448
           LEHPSTGGFLTHCGWNSSIEG+S GQPMVTWPVSSEQFYNEKLLTEVL+VGVPVGARRWW
Sbjct: 361 LEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFYNEKLLTEVLRVGVPVGARRWW 420

Query: 449 NMSDEMKE--IVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLI 508
           NMSDEM+E  IV RE V   VGFLMG  EEAAAIR RAK+LGAAA RAV  GGSSE N++
Sbjct: 421 NMSDEMEEEDIVGREEVAAAVGFLMGEAEEAAAIRRRAKELGAAAKRAVSEGGSSEKNVV 480

Query: 509 SLMKELRSIK 516
           S+++ELRS+K
Sbjct: 481 SVIEELRSLK 490

BLAST of ClCG02G012220 vs. NCBI nr
Match: KAF3972753.1 (hypothetical protein CMV_003760 [Castanea mollissima])

HSP 1 Score: 512.3 bits (1318), Expect = 4.9e-141
Identity = 241/481 (50.10%), Postives = 343/481 (71.31%), Query Frame = 0

Query: 35  EFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFKIQT 94
           + ++  LP   SGH+IP++D+ARLFA HG  VTII T +NA +FQ  ID D  +G +I+T
Sbjct: 7   QLKVFFLPFLVSGHMIPMVDLARLFAMHGVNVTIINTPANALLFQKAIDRDANSGHQIKT 66

Query: 95  HIVSFPGAEVGLAPGIENYSDVSSRHLQAKIYQAFLILDKLIDQMIIPATRPDCILSDLS 154
           HI+ FP A+VGL  GIEN++ ++S  +  K++ A  +L K I+Q+    T PDCI++D+ 
Sbjct: 67  HILEFPSAQVGLPEGIENFNMITSHGMSHKLHYALSLLQKPIEQLFQEMT-PDCIITDMF 126

Query: 155 HPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHHIQMT 214
           +PWT D+A +LG+PRLVF  + + +  A  S+ QH+PH  V S+ + F +P LP  I+MT
Sbjct: 127 YPWTVDSAAKLGIPRLVFHTAGYFSQCAASSIKQHAPHLSVNSNADTFLLPDLPDTIEMT 186

Query: 215 KSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIKAWGLGP 274
             Q P ++  +D +  +M+  +E+ER+SYG V+N+F+EL+  Y +HY  ITGIKAW +GP
Sbjct: 187 TLQLPRWVRTQDGYAQLMDRIRESERQSYGAVVNSFHELESAYEDHYTSITGIKAWSVGP 246

Query: 275 VSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMAEIAHG 334
           VSL VN++  +K+ERGNK   +  E + WLNSKE NSVL+VSFGS+ +F   Q+ E+AHG
Sbjct: 247 VSLCVNRDAADKVERGNKVAPDEHEWLNWLNSKECNSVLYVSFGSLNKFSTAQLIELAHG 306

Query: 335 LEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQLLILEHPSTG 394
           L+ S   FIWV+R KDK+      EG  E FE+ IK  NRG I+R WAPQLLILEHP+ G
Sbjct: 307 LDASSHQFIWVLRQKDKDQD----EGWLEDFEKHIKESNRGLIIRDWAPQLLILEHPAIG 366

Query: 395 GFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWWNMSDEMK 454
           G +THCGWNS +EG++AG PM+TWP+ +EQFYNEK +T+V+++GV VG   W    +E +
Sbjct: 367 GLVTHCGWNSILEGVTAGLPMITWPLYAEQFYNEKFVTQVIKIGVSVGVTEWRQWDEEAR 426

Query: 455 EIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISLMKELRSI 514
           E+V RE +EK V FLMG+  +AA ++ RA++L  AA RA+Q+ GSS++NL+SL+KEL+S+
Sbjct: 427 EVVKREEIEKAVIFLMGSGVQAAEMKNRARELRNAAKRAIQNSGSSQSNLMSLIKELKSL 482

Query: 515 K 516
           K
Sbjct: 487 K 482

BLAST of ClCG02G012220 vs. NCBI nr
Match: XP_030923367.1 (soyasapogenol B glucuronide galactosyltransferase-like [Quercus lobata])

HSP 1 Score: 512.3 bits (1318), Expect = 4.9e-141
Identity = 241/481 (50.10%), Postives = 342/481 (71.10%), Query Frame = 0

Query: 35  EFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFKIQT 94
           + ++  LP   SGH+IP++D+ARLFA HG  VTII T +NA +FQ  ID D  +G +I+T
Sbjct: 7   QLKVFFLPFLVSGHMIPMVDLARLFAMHGVNVTIITTPANALLFQKAIDRDANSGLQIKT 66

Query: 95  HIVSFPGAEVGLAPGIENYSDVSSRHLQAKIYQAFLILDKLIDQMIIPATRPDCILSDLS 154
           HI+ FP A+VGL  GIEN++ ++S  +  K+Y A  +L K I+Q+    T PDCI++D+ 
Sbjct: 67  HILEFPSAQVGLPEGIENFNTITSHGMSHKLYHALSLLQKPIEQLFQEMT-PDCIITDMF 126

Query: 155 HPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHHIQMT 214
           +PWT D+A +LG+PRLVF  + + +  A  S+ Q++PH  V S+ + F +P LP  I+MT
Sbjct: 127 YPWTVDSAAKLGIPRLVFHTAGYFSQCAASSIKQYAPHLSVNSNADTFLLPDLPDTIEMT 186

Query: 215 KSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIKAWGLGP 274
             Q P ++  +D +  +M+  +E+ER+SYG V+N+F+EL+  Y EHY  ITGIKAW +GP
Sbjct: 187 TLQLPRWVRTQDGYAQLMDRIRESERQSYGAVVNSFHELESAYEEHYASITGIKAWSVGP 246

Query: 275 VSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMAEIAHG 334
           VSL VN++  +K+ERGNK   +  E + WLNSKE +SVL+VSFGS+ +F   Q+ E+AHG
Sbjct: 247 VSLCVNRDAADKVERGNKVAPKENEWLNWLNSKECSSVLYVSFGSLNKFSTAQLIELAHG 306

Query: 335 LEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQLLILEHPSTG 394
           L+ S   FIWV+R KDK+      EG  E FE+ IK  NRG I+R WAPQLLILEH + G
Sbjct: 307 LDASSHQFIWVVRQKDKDQD----EGWLEDFEKHIKESNRGLIIRDWAPQLLILEHQAIG 366

Query: 395 GFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWWNMSDEMK 454
           G +THCGWNS +EG++AG PM+TWP+ +EQFYNEK +T+V+++GV VG   W    +E +
Sbjct: 367 GLVTHCGWNSILEGVTAGLPMITWPLYAEQFYNEKFVTQVIKIGVSVGVTEWRQWDEEAR 426

Query: 455 EIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISLMKELRSI 514
           E+V RE +EK V FLMG+  +AAA++ +A++LG AA RA+QS GSS++NL S++KEL S+
Sbjct: 427 EVVKREEIEKAVIFLMGSGVQAAAMKNQARELGNAARRAIQSSGSSQSNLTSMIKELTSL 482

Query: 515 K 516
           K
Sbjct: 487 K 482

BLAST of ClCG02G012220 vs. NCBI nr
Match: XP_030949641.1 (soyasapogenol B glucuronide galactosyltransferase-like [Quercus lobata])

HSP 1 Score: 504.2 bits (1297), Expect = 1.3e-138
Identity = 243/483 (50.31%), Postives = 348/483 (72.05%), Query Frame = 0

Query: 35  EFRITVLPLF-ASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFKIQ 94
           E ++  LP F A GH+IP++D ARLFA HG  VTII T +NA +FQ  ID +  +G +I+
Sbjct: 4   ELKVIFLPFFLAPGHLIPVVDTARLFAMHGVNVTIITTPANALLFQKAIDRNANSGHQIK 63

Query: 95  THIVSFPGAEVGLAPGIENYSDVSSRHLQAKIYQAFLILDKLIDQMIIPATRPDCILSDL 154
           TH++ FP  +VGL  GIEN++ V+S  + +K++    +L   I+Q +    +PDCI+SD+
Sbjct: 64  THVLQFPSDQVGLPQGIENFNTVTSLGMTSKLFHGLSLLQPQIEQ-LFQDMQPDCIVSDM 123

Query: 155 SHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHHIQM 214
            +PWT D+A +LG+PRL+  V+ + +  A++ V Q+ PH+ V SDT+ F +PGLP+ I+M
Sbjct: 124 FYPWTVDSAAKLGIPRLLLYVTCYFSLCAQNCVQQYKPHESVNSDTDLFLLPGLPNKIEM 183

Query: 215 TKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVM-NTFYELDGVYLEHYKKITGIKAWGL 274
           T+ Q PE+L   + +T +M+  KE+ERRSYG+++ N+FYEL+G Y E +K   GI+ W +
Sbjct: 184 TRLQLPEWLRTPNGYTQLMDKIKESERRSYGSILANSFYELEGAYEELHKNSMGIRTWSV 243

Query: 275 GPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMAEIA 334
           GPVSL VNK++ +K+ERGNK+ +E  EL+ WLN+KE NSVL++ FGS ++FP  Q+ E+A
Sbjct: 244 GPVSLRVNKDVADKVERGNKAAVEEHELLNWLNAKECNSVLYICFGSSSKFPTAQLIEMA 303

Query: 335 HGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQLLILEHPS 394
           HGLE SG  FIWV+R KD    G+  EG    FE+ +K  NRG I+R WAPQ+LILEHP+
Sbjct: 304 HGLEASGHQFIWVVRQKD----GDQSEGWLGDFEKRMKESNRGLIIRGWAPQILILEHPA 363

Query: 395 TGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWWNMSDE 454
            GG +THCG NS +EG++AG PM+ WP+ +EQFY EKL+TEVL++GV VG + W   ++E
Sbjct: 364 IGGQVTHCGSNSLLEGVTAGLPMIAWPLYAEQFYLEKLVTEVLKIGVAVGKKEWSIWAEE 423

Query: 455 MKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISLMKELR 514
            KE+V R N+EK V FLMG+ EEAA +R RAK+LG AA +AV+S GSS++N + L+  L+
Sbjct: 424 TKEVVKRNNIEKAVKFLMGSGEEAAEMRNRAKELGNAARKAVESRGSSQSNFMGLISGLK 481

Query: 515 SIK 516
           S+K
Sbjct: 484 SLK 481

BLAST of ClCG02G012220 vs. ExPASy Swiss-Prot
Match: D4Q9Z4 (Soyasapogenol B glucuronide galactosyltransferase OS=Glycine max OX=3847 GN=GmSGT2 PE=1 SV=1)

HSP 1 Score: 472.2 bits (1214), Expect = 7.4e-132
Identity = 224/484 (46.28%), Postives = 338/484 (69.83%), Query Frame = 0

Query: 35  EFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFKIQT 94
           E +   LP  ++ HIIP++DMARLFA H   VTII T  NA++FQ +ID D + G  I+T
Sbjct: 7   ELKSIFLPFLSTSHIIPLVDMARLFALHDVDVTIITTAHNATVFQKSIDLDASRGRPIRT 66

Query: 95  HIVSFPGAEVGLAPGIENYSDVSSRHLQAKIYQAFLILDKLIDQMIIPATRPDCILSDLS 154
           H+V+FP A+VGL  GIE ++  + R +  +IY    +L ++ ++ +    +PD I++D+ 
Sbjct: 67  HVVNFPAAQVGLPVGIEAFNVDTPREMTPRIYMGLSLLQQVFEK-LFHDLQPDFIVTDMF 126

Query: 155 HPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHHIQMT 214
           HPW+ D A +LG+PR++F  ++++A SA HSV Q++PH +   DT++F +PGLP +++MT
Sbjct: 127 HPWSVDAAAKLGIPRIMFHGASYLARSAAHSVEQYAPHLEAKFDTDKFVLPGLPDNLEMT 186

Query: 215 KSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIKAWGLGP 274
           + Q P++L   +++T +M + K++E++SYG++ N+FY+L+  Y EHYK I G K+WG+GP
Sbjct: 187 RLQLPDWLRSPNQYTELMRTIKQSEKKSYGSLFNSFYDLESAYYEHYKSIMGTKSWGIGP 246

Query: 275 VSLAVNKNLREKIERG-NKSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMAEIAH 334
           VSL  N++ ++K  RG  K   E E  +KWLNSK  +SVL+VSFGS+ +FP  Q+ EIA 
Sbjct: 247 VSLWANQDAQDKAARGYAKEEEEKEGWLKWLNSKAESSVLYVSFGSINKFPYSQLVEIAR 306

Query: 335 GLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQLLILEHPST 394
            LEDSG +FIWV+R   KND GE    L E FE+ +K  N+G+++  WAPQLLILE+P+ 
Sbjct: 307 ALEDSGHDFIWVVR---KNDGGEGDNFLEE-FEKRMKESNKGYLIWGWAPQLLILENPAI 366

Query: 395 GGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWWNMSDEM 454
           GG +THCGWN+ +E ++AG PM TWP+ +E F+NEKL+ +VL++GVPVGA+ W N ++  
Sbjct: 367 GGLVTHCGWNTVVESVNAGLPMATWPLFAEHFFNEKLVVDVLKIGVPVGAKEWRNWNEFG 426

Query: 455 KEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISLMKELRS 514
            E+V RE +   +  LM   EE   +R+RAK+L  AA  A++ GGSS NN+  L++EL+ 
Sbjct: 427 SEVVKREEIGNAIASLMSEEEEDGGMRKRAKELSVAAKSAIKVGGSSHNNMKELIRELKE 485

Query: 515 IKVN 518
           IK++
Sbjct: 487 IKLS 485

BLAST of ClCG02G012220 vs. ExPASy Swiss-Prot
Match: Q9AT54 (Scopoletin glucosyltransferase OS=Nicotiana tabacum OX=4097 GN=TOGT1 PE=1 SV=1)

HSP 1 Score: 399.1 bits (1024), Expect = 7.9e-110
Identity = 200/472 (42.37%), Postives = 302/472 (63.98%), Query Frame = 0

Query: 42  PLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFKIQTHIVSFPG 101
           P+ A GH+IP +DMA+LFA  G   TII T  N  +F   I  +   G +I+  ++ FP 
Sbjct: 10  PVMAHGHMIPTLDMAKLFASRGVKATIITTPLNEFVFSKAIQRNKHLGIEIEIRLIKFPA 69

Query: 102 AEVGLAPGIENYSDVSSRHLQAKIYQAFLILDKLIDQMIIPATRPDCILSDLSHPWTTDT 161
            E GL    E    + S       ++A  ++ + ++Q +I   RPDC++SD+  PWTTDT
Sbjct: 70  VENGLPEECERLDQIPSDEKLPNFFKAVAMMQEPLEQ-LIEECRPDCLISDMFLPWTTDT 129

Query: 162 AERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHHIQMTKSQQPEF 221
           A +  +PR+VF  ++F A   E+SV  + P + V+SD+E F +P LPH I++T++Q   F
Sbjct: 130 AAKFNIPRIVFHGTSFFALCVENSVRLNKPFKNVSSDSETFVVPDLPHEIKLTRTQVSPF 189

Query: 222 LLRRDR--FTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIKAWGLGPVSLAV 281
               +    T M+++ +E++ +SYG V N+FYEL+  Y+EHY K+ G +AW +GP+S+  
Sbjct: 190 ERSGEETAMTRMIKTVRESDSKSYGVVFNSFYELETDYVEHYTKVLGRRAWAIGPLSMC- 249

Query: 282 NKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMAEIAHGLEDSG 341
           N+++ +K ERG KS ++  E +KWL+SK+P+SV++V FGS+  F   Q+ E+A G+E SG
Sbjct: 250 NRDIEDKAERGKKSSIDKHECLKWLDSKKPSSVVYVCFGSVANFTASQLHELAMGIEASG 309

Query: 342 INFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQLLILEHPSTGGFLTH 401
             FIWV+R +  N+     + LPEGFE+  + K +G I+R WAPQ+LIL+H S G F+TH
Sbjct: 310 QEFIWVVRTELDNE-----DWLPEGFEE--RTKEKGLIIRGWAPQVLILDHESVGAFVTH 369

Query: 402 CGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWWNMSDEMKEIVSR 461
           CGWNS++EG+S G PMVTWPV +EQF+NEKL+TEVL+ G  VG+ +W   + E    V R
Sbjct: 370 CGWNSTLEGVSGGVPMVTWPVFAEQFFNEKLVTEVLKTGAGVGSIQWKRSASEG---VKR 429

Query: 462 ENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISLMKEL 512
           E + K +  +M  +EEA   R RAK     A +A++ GGSS   L +L++++
Sbjct: 430 EAIAKAIKRVM-VSEEADGFRNRAKAYKEMARKAIEEGGSSYTGLTTLLEDI 468

BLAST of ClCG02G012220 vs. ExPASy Swiss-Prot
Match: Q2V6J9 (UDP-glucose flavonoid 3-O-glucosyltransferase 7 OS=Fragaria ananassa OX=3747 GN=GT7 PE=1 SV=1)

HSP 1 Score: 397.5 bits (1020), Expect = 2.3e-109
Identity = 213/487 (43.74%), Postives = 303/487 (62.22%), Query Frame = 0

Query: 34  KEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFKIQ 93
           ++  I  LP  A GH IP+ D+A+LF+ HGA  TI+ T  NA +F            +I+
Sbjct: 9   QQLHIFFLPFMARGHSIPLTDIAKLFSSHGARCTIVTTPLNAPLFSKATQRG-----EIE 68

Query: 94  THIVSFPGAEVGLAPGIENYSDVSSRHLQAKIYQAFLILDKLIDQMIIPATRPDCILSDL 153
             ++ FP AE GL    E+   ++++ +  K  +A  +++   ++ I+   RP C+++D 
Sbjct: 69  LVLIKFPSAEAGLPQDCESADLITTQDMLGKFVKATFLIEPHFEK-ILDEHRPHCLVADA 128

Query: 154 SHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHHIQM 213
              W TD A +  +PRL F  + F A  A  SVM + PH  ++SD+E F IP LP  I+M
Sbjct: 129 FFTWATDVAAKFRIPRLYFHGTGFFALCASLSVMMYQPHSNLSSDSESFVIPNLPDEIKM 188

Query: 214 TKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIKAWGLG 273
           T+SQ P F      F  M+++  E E RSYG ++N+FYEL+  Y  HY+K+ G KAW +G
Sbjct: 189 TRSQLPVF-PDESEFMKMLKASIEIEERSYGVIVNSFYELEPAYANHYRKVFGRKAWHIG 248

Query: 274 PVSLAVNKNLREKIERGN--KSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMAEI 333
           PVS   NK + +K ERG+   S  E  E +KWL+SK+P SV++VSFGSM RF   Q+ EI
Sbjct: 249 PVSFC-NKAIEDKAERGSIKSSTAEKHECLKWLDSKKPRSVVYVSFGSMVRFADSQLLEI 308

Query: 334 AHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQLLILEHP 393
           A GLE SG +FIWV++ + K    E  E LPEGFE+ ++ K  G I+R WAPQ+LILEH 
Sbjct: 309 ATGLEASGQDFIWVVKKEKK----EVEEWLPEGFEKRMEGK--GLIIRDWAPQVLILEHE 368

Query: 394 STGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRW----W 453
           + G F+THCGWNS +E +SAG PM+TWPV  EQFYNEKL+TE+ ++GVPVG+ +W     
Sbjct: 369 AIGAFVTHCGWNSILEAVSAGVPMITWPVFGEQFYNEKLVTEIHRIGVPVGSEKWALSFV 428

Query: 454 NMSDEMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISL 513
           +++ E +  V RE +E+ V  +M   +EA   R R K+LG  A RAV+ GGSS  +L +L
Sbjct: 429 DVNAETEGRVRREAIEEAVTRIM-VGDEAVETRSRVKELGENARRAVEEGGSSFLDLSAL 480

Query: 514 MKELRSI 515
           + EL  +
Sbjct: 489 VGELNDL 480

BLAST of ClCG02G012220 vs. ExPASy Swiss-Prot
Match: Q7Y232 (UDP-glycosyltransferase 73B4 OS=Arabidopsis thaliana OX=3702 GN=UGT73B4 PE=2 SV=1)

HSP 1 Score: 389.0 bits (998), Expect = 8.2e-107
Identity = 212/495 (42.83%), Postives = 311/495 (62.83%), Query Frame = 0

Query: 34  KEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFKIQ 93
           ++  I   P  A GH+IP++DMA+LFAR GA  T++ T  NA I +  I+      FK+Q
Sbjct: 4   EQIHILFFPFMAHGHMIPLLDMAKLFARRGAKSTLLTTPINAKILEKPIE-----AFKVQ 63

Query: 94  T-------HIVSFPGAEVGLAPGIENYSDVSS--RHLQAKIYQAFLILDKLIDQMI---I 153
                    I++FP  E+GL  G EN   ++S  +     ++  FL   K + Q +   I
Sbjct: 64  NPDLEIGIKILNFPCVELGLPEGCENRDFINSYQKSDSFDLFLKFLFSTKYMKQQLESFI 123

Query: 154 PATRPDCILSDLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEE 213
             T+P  +++D+  PW T++AE++GVPRLVF  ++  A    +++  H PH+KVAS +  
Sbjct: 124 ETTKPSALVADMFFPWATESAEKIGVPRLVFHGTSSFALCCSYNMRIHKPHKKVASSSTP 183

Query: 214 FEIPGLPHHIQMTKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHY 273
           F IPGLP  I +T+  Q         F    +  +E+E  S+G ++N+FYEL+  Y + Y
Sbjct: 184 FVIPGLPGDIVITE-DQANVTNEETPFGKFWKEVRESETSSFGVLVNSFYELESSYADFY 243

Query: 274 KKITGIKAWGLGPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMT 333
           +     KAW +GP+SL+ N+ + EK  RG K+ ++ +E +KWL+SK P SV+++SFGS T
Sbjct: 244 RSFVAKKAWHIGPLSLS-NRGIAEKAGRGKKANIDEQECLKWLDSKTPGSVVYLSFGSGT 303

Query: 334 RFPPPQMAEIAHGLEDSGINFIWVI-RNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRI 393
             P  Q+ EIA GLE SG NFIWV+ +N+++  +GE  + LP+GFE+  +NK +G I+R 
Sbjct: 304 GLPNEQLLEIAFGLEGSGQNFIWVVSKNENQVGTGENEDWLPKGFEE--RNKGKGLIIRG 363

Query: 394 WAPQLLILEHPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVP 453
           WAPQ+LIL+H + GGF+THCGWNS++EGI+AG PMVTWP+ +EQFYNEKLLT+VL++GV 
Sbjct: 364 WAPQVLILDHKAIGGFVTHCGWNSTLEGIAAGLPMVTWPMGAEQFYNEKLLTKVLRIGVN 423

Query: 454 VGARRWWNMSDEMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSS 513
           VGA           +++SR  VEK V  ++G  E+A   R RAK+LG  A  AV+ GGSS
Sbjct: 424 VGATELVKKG----KLISRAQVEKAVREVIGG-EKAEERRLRAKELGEMAKAAVEEGGSS 483

Query: 514 ENNLISLMKELRSIK 516
            N++   M+EL   K
Sbjct: 484 YNDVNKFMEELNGRK 484

BLAST of ClCG02G012220 vs. ExPASy Swiss-Prot
Match: Q94C57 (UDP-glucosyl transferase 73B2 OS=Arabidopsis thaliana OX=3702 GN=UGT73B2 PE=1 SV=1)

HSP 1 Score: 386.3 bits (991), Expect = 5.3e-106
Identity = 218/491 (44.40%), Postives = 308/491 (62.73%), Query Frame = 0

Query: 33  NKEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNID--HDFAAGF 92
           +++  +   P  A GH+IP +DMA+LF+  GA  TI+ T  N+ I Q  ID   +   G 
Sbjct: 7   HRKLHVMFFPFMAYGHMIPTLDMAKLFSSRGAKSTILTTSLNSKILQKPIDTFKNLNPGL 66

Query: 93  KIQTHIVSFPGAEVGLAPGIENYSDVSSRHLQAK---IYQAFLILDKLIDQM--IIPATR 152
           +I   I +FP  E+GL  G EN    +S +   K   I + F       DQ+  ++  TR
Sbjct: 67  EIDIQIFNFPCVELGLPEGCENVDFFTSNNNDDKNEMIVKFFFSTRFFKDQLEKLLGTTR 126

Query: 153 PDCILSDLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIP 212
           PDC+++D+  PW T+ A +  VPRLVF  + + +  A + +  H P ++VAS +E F IP
Sbjct: 127 PDCLIADMFFPWATEAAGKFNVPRLVFHGTGYFSLCAGYCIGVHKPQKRVASSSEPFVIP 186

Query: 213 GLPHHIQMTKSQQPEFLLRRDRFTAM---MESYKEAERRSYGTVMNTFYELDGVYLEHYK 272
            LP +I +T+ Q    ++  D  + M   M   +E+E +S G V+N+FYEL+  Y + YK
Sbjct: 187 ELPGNIVITEEQ----IIDGDGESDMGKFMTEVRESEVKSSGVVLNSFYELEHDYADFYK 246

Query: 273 KITGIKAWGLGPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMTR 332
                +AW +GP+S+  N+   EK ERG K+ ++  E +KWL+SK+PNSV++VSFGS+  
Sbjct: 247 SCVQKRAWHIGPLSV-YNRGFEEKAERGKKANIDEAECLKWLDSKKPNSVIYVSFGSVAF 306

Query: 333 FPPPQMAEIAHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWA 392
           F   Q+ EIA GLE SG +FIWV+R K K+D     E LPEGFE+ +K K  G I+R WA
Sbjct: 307 FKNEQLFEIAAGLEASGTSFIWVVR-KTKDD---REEWLPEGFEERVKGK--GMIIRGWA 366

Query: 393 PQLLILEHPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVG 452
           PQ+LIL+H +TGGF+THCGWNS +EG++AG PMVTWPV +EQFYNEKL+T+VL+ GV VG
Sbjct: 367 PQVLILDHQATGGFVTHCGWNSLLEGVAAGLPMVTWPVGAEQFYNEKLVTQVLRTGVSVG 426

Query: 453 ARRWWNMSDEMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSEN 512
           A +  +M   M + +SRE V+K V  ++ A E A   R RAK+L A A  AV+ GGSS N
Sbjct: 427 ASK--HMKVMMGDFISREKVDKAVREVL-AGEAAEERRRRAKKLAAMAKAAVEEGGSSFN 483

Query: 513 NLISLMKELRS 514
           +L S M+E  S
Sbjct: 487 DLNSFMEEFSS 483

BLAST of ClCG02G012220 vs. ExPASy TrEMBL
Match: A0A6J1D7D4 (Glycosyltransferase OS=Momordica charantia OX=3673 GN=LOC111017964 PE=3 SV=1)

HSP 1 Score: 775.8 bits (2002), Expect = 1.1e-220
Identity = 384/490 (78.37%), Postives = 428/490 (87.35%), Query Frame = 0

Query: 29  MLPQNKEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAA 88
           M+ +N+E RITVLPLFASGHIIPI+DMARLFARHGA VTII TESNA  FQN++  DFAA
Sbjct: 1   MVLENEELRITVLPLFASGHIIPIVDMARLFARHGAAVTIITTESNARSFQNDVARDFAA 60

Query: 89  GFKIQTHIVSFPGAEVGLAPGIENYSDVSSRHLQAKIYQAFLILDKLIDQMIIPATRPDC 148
           G+KIQT  V FP AEVGL PGIEN+SDV SR LQ KIY+AFLIL+K IDQ+IIP TRPDC
Sbjct: 61  GYKIQTRTVPFPAAEVGLPPGIENFSDVVSRDLQGKIYRAFLILEKQIDQVIIPETRPDC 120

Query: 149 ILSDLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLP 208
           ILSDLS+ WTTDTA RLGVPRLVF VSNFMAYSAEHSV+QH+PHQKV SD E FE+PGLP
Sbjct: 121 ILSDLSYGWTTDTAARLGVPRLVFFVSNFMAYSAEHSVLQHAPHQKVTSDFETFELPGLP 180

Query: 209 HHIQMTKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIK 268
           H IQMTKSQQPEFL++R +FT M+E YKEAERRSYG V NTFYELDGVYLEHYK+  GIK
Sbjct: 181 HKIQMTKSQQPEFLVQRSQFTEMIEKYKEAERRSYGIVTNTFYELDGVYLEHYKRTIGIK 240

Query: 269 AWGLGPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQM 328
           AWGLGPVSLAVNK+L  KI+RGNKSGMES EL+ WLNSKEPNSVL+VSFGSMTRFP  Q+
Sbjct: 241 AWGLGPVSLAVNKDLIGKIDRGNKSGMESGELLDWLNSKEPNSVLYVSFGSMTRFPAAQI 300

Query: 329 AEIAHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQ-MIKNKNRGFIVRIWAPQLLI 388
           AEIAHGLE +G NFIWVIR K++N+ GEA EGLPEGFE+ +++ K +G IVRIWAPQLLI
Sbjct: 301 AEIAHGLESAGRNFIWVIRKKNENEGGEAEEGLPEGFEERVVREKKKGLIVRIWAPQLLI 360

Query: 389 LEHPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWW 448
           LEHPSTGGFLTHCGWNSSIEG+S GQPMVTWPVSSEQFYNEKLLTEVL+VGVPVGARRWW
Sbjct: 361 LEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFYNEKLLTEVLRVGVPVGARRWW 420

Query: 449 NMSDEMKE--IVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLI 508
           NMSDEM+E  IV RE V   VGFLMG  EEAAAIR RAK+LGAAA RAV  GGSSE N++
Sbjct: 421 NMSDEMEEEDIVGREEVAAAVGFLMGEAEEAAAIRRRAKELGAAAKRAVSEGGSSEKNVV 480

Query: 509 SLMKELRSIK 516
           S+++ELRS+K
Sbjct: 481 SVIEELRSLK 490

BLAST of ClCG02G012220 vs. ExPASy TrEMBL
Match: A0A2N9I9F6 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS48797 PE=4 SV=1)

HSP 1 Score: 535.4 bits (1378), Expect = 2.6e-148
Identity = 260/487 (53.39%), Postives = 364/487 (74.74%), Query Frame = 0

Query: 32  QNKEFRITVLPLF-ASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGF 91
           Q  + ++  LP F   GH+IP++D ARLFARHG +VTII T +NA +FQ  ID D  +G 
Sbjct: 4   QVDKLKVIFLPFFLVPGHLIPLVDTARLFARHGVSVTIITTTANALLFQRAIDCDANSGH 63

Query: 92  KIQTHIVSFPGAEVGLAPGIENYSDVSSRHLQAKIYQAFLILDKLIDQMIIPATRPDCIL 151
           +I TH++ FP A+VGL  GIENY+ ++S    +K+     +L K I+Q +    RPDCI+
Sbjct: 64  QINTHVLQFPSAQVGLPEGIENYNTMTSNDTNSKLLHGLSLLRKPIEQ-LFQDMRPDCIV 123

Query: 152 SDLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHH 211
           SD+ +PWT ++A RLG+PRLVF V+++ ++ AE  + Q+ PHQ V SDTE F IPGLP+ 
Sbjct: 124 SDMFYPWTVESAARLGIPRLVFHVTSYFSFCAETCIEQYKPHQSVNSDTEPFLIPGLPNK 183

Query: 212 IQMTKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIKAW 271
           I+MT+ + P+++  +DRFT ++   KE+ERRSYG ++N+FYEL+G Y E +K+  GIKAW
Sbjct: 184 IEMTRLKLPDWVKTQDRFTQLLNIIKESERRSYGAIVNSFYELEGGYEELHKRNMGIKAW 243

Query: 272 GLGPVSLAVNKNLREKIERGNK-SGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMA 331
            +GPVSL VNK++ +K+ERGNK +  E +E +KWLN+KE NSVL+VSFGSMT+FP PQ+ 
Sbjct: 244 SVGPVSLWVNKDVADKVERGNKVAPPEEQEWLKWLNAKECNSVLYVSFGSMTKFPTPQLI 303

Query: 332 EIAHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIK-NKNRGFIVRIWAPQLLIL 391
           E+AHGLE SG  FIWV+  KDK+      EG  E F++ +K +K+RG I+R WAPQLLIL
Sbjct: 304 EMAHGLEASGHQFIWVVPKKDKDQD----EGWLEDFQKRMKESKHRGLIIRGWAPQLLIL 363

Query: 392 EHPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWWN 451
           EHP+ GG +THCGWNS +EG++AG PM+TWP+ +EQFY+EKL+TEVL++GV VG + W  
Sbjct: 364 EHPAIGGQVTHCGWNSFLEGVTAGLPMITWPLFAEQFYHEKLVTEVLKIGVAVGKKEWSR 423

Query: 452 MSDEMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISLM 511
            ++E KE+V RE++EK V FLMG+ EEA  +++RA++LG AA RAVQ+GGSS++N + L+
Sbjct: 424 WANEAKEVVKREDIEKAVKFLMGSAEEATEMKKRARELGNAARRAVQTGGSSQSNFMDLI 483

Query: 512 KELRSIK 516
            EL+S+K
Sbjct: 484 NELKSLK 485

BLAST of ClCG02G012220 vs. ExPASy TrEMBL
Match: A0A2N9FF75 (Glycosyltransferase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS13306 PE=3 SV=1)

HSP 1 Score: 518.1 bits (1333), Expect = 4.3e-143
Identity = 248/484 (51.24%), Postives = 346/484 (71.49%), Query Frame = 0

Query: 32  QNKEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFK 91
           Q  + +   LP    GH+IP++D  RLFA HG  VTII T +NA +FQ  ID D ++G +
Sbjct: 4   QADQLKAIFLPFLVPGHMIPLVDTGRLFAMHGVNVTIITTPANALLFQKAIDRDASSGHQ 63

Query: 92  IQTHIVSFPGAEVGLAPGIENYSDVSSRHLQAKIYQAFLILDKLIDQMIIPATRPDCILS 151
           I+THI+ FP A+V L  GIEN++ ++S  +  K+Y A  +L K I+Q +    RPDCI++
Sbjct: 64  IKTHILEFPSAQVSLPKGIENFNMITSPDMSHKLYYAVSLLQKPIEQ-LFQDMRPDCIVT 123

Query: 152 DLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHHI 211
           D+ +PWT D+A +LG+PRLVF  +++ +  A   + Q++PHQ V S+T+ F +PGLP+ I
Sbjct: 124 DMFYPWTVDSANKLGIPRLVFHGTSYFSLCAASCIKQYAPHQSVKSNTDTFLLPGLPNKI 183

Query: 212 QMTKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIKAWG 271
           +MT SQ P ++   + +T +M+  KE+E+RSYG VMN+F+EL+  Y EHYK + GIKAW 
Sbjct: 184 EMTTSQLPRWVRTPEAYTQLMDKIKESEQRSYGAVMNSFHELESAYEEHYKSVMGIKAWS 243

Query: 272 LGPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMAEI 331
           +GP+SL  N +  +K+ERGNK+  E+E L  WLNSKE NSVL+VSFGS+ +F   Q+ E+
Sbjct: 244 VGPISLWANSDATDKVERGNKATTENEWL-NWLNSKECNSVLYVSFGSLNKFSTSQLIEL 303

Query: 332 AHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQLLILEHP 391
           AHGLE S   FIWV+R K+K++     EG    FE+ IK  NRG I+  WAPQLLILEHP
Sbjct: 304 AHGLEASNHQFIWVVRLKNKDED----EGWLRDFEKRIKESNRGLIIWDWAPQLLILEHP 363

Query: 392 STGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWWNMSD 451
           + GG +THCGWNS +EG++AG PM+TWP+ +EQFYNEKL+T+V+++GV VG + W  M +
Sbjct: 364 AIGGLVTHCGWNSILEGVTAGLPMITWPLYAEQFYNEKLVTDVIKIGVAVGVKEWRKMDE 423

Query: 452 EMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISLMKEL 511
           E KE V RE +EK V FLMG+  EAA ++ RA++LG AA  AVQSGGSS++NL+ L+KEL
Sbjct: 424 EAKETVKREEIEKAVTFLMGSGVEAAEMKNRARELGNAARSAVQSGGSSQSNLMGLIKEL 481

Query: 512 RSIK 516
           +S+K
Sbjct: 484 KSLK 481

BLAST of ClCG02G012220 vs. ExPASy TrEMBL
Match: A0A7N2R028 (Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1)

HSP 1 Score: 504.2 bits (1297), Expect = 6.5e-139
Identity = 243/483 (50.31%), Postives = 348/483 (72.05%), Query Frame = 0

Query: 35  EFRITVLPLF-ASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFKIQ 94
           E ++  LP F A GH+IP++D ARLFA HG  VTII T +NA +FQ  ID +  +G +I+
Sbjct: 4   ELKVIFLPFFLAPGHLIPVVDTARLFAMHGVNVTIITTPANALLFQKAIDRNANSGHQIK 63

Query: 95  THIVSFPGAEVGLAPGIENYSDVSSRHLQAKIYQAFLILDKLIDQMIIPATRPDCILSDL 154
           TH++ FP  +VGL  GIEN++ V+S  + +K++    +L   I+Q +    +PDCI+SD+
Sbjct: 64  THVLQFPSDQVGLPQGIENFNTVTSLGMTSKLFHGLSLLQPQIEQ-LFQDMQPDCIVSDM 123

Query: 155 SHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHHIQM 214
            +PWT D+A +LG+PRL+  V+ + +  A++ V Q+ PH+ V SDT+ F +PGLP+ I+M
Sbjct: 124 FYPWTVDSAAKLGIPRLLLYVTCYFSLCAQNCVQQYKPHESVNSDTDLFLLPGLPNKIEM 183

Query: 215 TKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVM-NTFYELDGVYLEHYKKITGIKAWGL 274
           T+ Q PE+L   + +T +M+  KE+ERRSYG+++ N+FYEL+G Y E +K   GI+ W +
Sbjct: 184 TRLQLPEWLRTPNGYTQLMDKIKESERRSYGSILANSFYELEGAYEELHKNSMGIRTWSV 243

Query: 275 GPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMAEIA 334
           GPVSL VNK++ +K+ERGNK+ +E  EL+ WLN+KE NSVL++ FGS ++FP  Q+ E+A
Sbjct: 244 GPVSLRVNKDVADKVERGNKAAVEEHELLNWLNAKECNSVLYICFGSSSKFPTAQLIEMA 303

Query: 335 HGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQLLILEHPS 394
           HGLE SG  FIWV+R KD    G+  EG    FE+ +K  NRG I+R WAPQ+LILEHP+
Sbjct: 304 HGLEASGHQFIWVVRQKD----GDQSEGWLGDFEKRMKESNRGLIIRGWAPQILILEHPA 363

Query: 395 TGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWWNMSDE 454
            GG +THCG NS +EG++AG PM+ WP+ +EQFY EKL+TEVL++GV VG + W   ++E
Sbjct: 364 IGGQVTHCGSNSLLEGVTAGLPMIAWPLYAEQFYLEKLVTEVLKIGVAVGKKEWSIWAEE 423

Query: 455 MKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISLMKELR 514
            KE+V R N+EK V FLMG+ EEAA +R RAK+LG AA +AV+S GSS++N + L+  L+
Sbjct: 424 TKEVVKRNNIEKAVKFLMGSGEEAAEMRNRAKELGNAARKAVESRGSSQSNFMGLISGLK 481

Query: 515 SIK 516
           S+K
Sbjct: 484 SLK 481

BLAST of ClCG02G012220 vs. ExPASy TrEMBL
Match: A0A5A7TQD0 (Glycosyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold123G00860 PE=3 SV=1)

HSP 1 Score: 503.1 bits (1294), Expect = 1.4e-138
Identity = 230/484 (47.52%), Postives = 341/484 (70.45%), Query Frame = 0

Query: 35  EFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFKIQT 94
           E  I  LP  + GH++P++DMA LFA+ GAT TI+ TE+NA++F   I  D  AG +I+ 
Sbjct: 8   ELDIMFLPFVSHGHLLPMVDMAMLFAKLGATATIVTTEANAALFHTKIHRDRVAGSRIRL 67

Query: 95  HIVSFPGAEVGLAPGIENYSDVSSRHLQAKIYQAFLILDKLIDQMIIPATRPDCILSDLS 154
           H + +P AEVGL+P I+N S  +   + +K++Q FL+L   + + +I   RPDCI+SD+ 
Sbjct: 68  HTIPWPAAEVGLSPAIQNLSTATPMTM-SKVFQVFLMLQPQL-RGLIHEMRPDCIISDVF 127

Query: 155 HPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHHIQMT 214
           +PWT+D A  LG+PRL F+ S++  Y AE  + +H PH +V S+ E+F++PGLP  ++M 
Sbjct: 128 YPWTSDVAAELGIPRLAFNGSSYFGYCAEQCMKEHKPHLEVESNNEKFKLPGLPDVVEMM 187

Query: 215 KSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIKAWGLGP 274
           +S+ P ++ R D F+ +++  +E+E+R YG +MN+FYEL+G Y EH  KI GIK W +GP
Sbjct: 188 RSELPSWIAREDDFSRLLDVIRESEKRCYGMLMNSFYELEGSYEEHSNKIIGIKTWSIGP 247

Query: 275 VSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMAEIAHG 334
           VSL  NK + +K  RG    +++  L++WLN KEPNSVL+++FGS+ +  P Q+ EIAH 
Sbjct: 248 VSLLANKEIEDKESRGGNPNIQTTNLLQWLNEKEPNSVLYINFGSLVQMNPNQLTEIAHA 307

Query: 335 LEDSGINFIWVIRNKDKNDSGE--APEGLPEGFEQMIKNKNRGFIVRIWAPQLLILEHPS 394
           ++ S  NFIWVI+   K+D  +    +GLP+GFE+ +    +G I++ WAPQL+ILEH S
Sbjct: 308 IQKSSQNFIWVIKKNSKDDDEDNIVNKGLPKGFEERMSKTKKGLIIKGWAPQLMILEHKS 367

Query: 395 TGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWWNMSDE 454
            GGFLTHCGWNS +EGIS+G PM+TWP+ +EQFYNEKLL EV+++GV VG+++WW + +E
Sbjct: 368 VGGFLTHCGWNSILEGISSGLPMITWPLFAEQFYNEKLLVEVVKIGVGVGSKKWWYLGEE 427

Query: 455 MKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISLMKELR 514
            +EI+ RE + K + FLMG + EA  +R+RA+++G AA  +V  GG+S  NL+SL KEL+
Sbjct: 428 EQEIIKREEIGKAIAFLMGESVEALEMRKRAREMGEAAKTSVNCGGASHINLVSLFKELQ 487

Query: 515 SIKV 517
             K+
Sbjct: 488 ETKL 489

BLAST of ClCG02G012220 vs. TAIR 10
Match: AT2G15490.1 (UDP-glycosyltransferase 73B4 )

HSP 1 Score: 389.0 bits (998), Expect = 5.8e-108
Identity = 212/495 (42.83%), Postives = 311/495 (62.83%), Query Frame = 0

Query: 34  KEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFKIQ 93
           ++  I   P  A GH+IP++DMA+LFAR GA  T++ T  NA I +  I+      FK+Q
Sbjct: 4   EQIHILFFPFMAHGHMIPLLDMAKLFARRGAKSTLLTTPINAKILEKPIE-----AFKVQ 63

Query: 94  T-------HIVSFPGAEVGLAPGIENYSDVSS--RHLQAKIYQAFLILDKLIDQMI---I 153
                    I++FP  E+GL  G EN   ++S  +     ++  FL   K + Q +   I
Sbjct: 64  NPDLEIGIKILNFPCVELGLPEGCENRDFINSYQKSDSFDLFLKFLFSTKYMKQQLESFI 123

Query: 154 PATRPDCILSDLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEE 213
             T+P  +++D+  PW T++AE++GVPRLVF  ++  A    +++  H PH+KVAS +  
Sbjct: 124 ETTKPSALVADMFFPWATESAEKIGVPRLVFHGTSSFALCCSYNMRIHKPHKKVASSSTP 183

Query: 214 FEIPGLPHHIQMTKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHY 273
           F IPGLP  I +T+  Q         F    +  +E+E  S+G ++N+FYEL+  Y + Y
Sbjct: 184 FVIPGLPGDIVITE-DQANVTNEETPFGKFWKEVRESETSSFGVLVNSFYELESSYADFY 243

Query: 274 KKITGIKAWGLGPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMT 333
           +     KAW +GP+SL+ N+ + EK  RG K+ ++ +E +KWL+SK P SV+++SFGS T
Sbjct: 244 RSFVAKKAWHIGPLSLS-NRGIAEKAGRGKKANIDEQECLKWLDSKTPGSVVYLSFGSGT 303

Query: 334 RFPPPQMAEIAHGLEDSGINFIWVI-RNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRI 393
             P  Q+ EIA GLE SG NFIWV+ +N+++  +GE  + LP+GFE+  +NK +G I+R 
Sbjct: 304 GLPNEQLLEIAFGLEGSGQNFIWVVSKNENQVGTGENEDWLPKGFEE--RNKGKGLIIRG 363

Query: 394 WAPQLLILEHPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVP 453
           WAPQ+LIL+H + GGF+THCGWNS++EGI+AG PMVTWP+ +EQFYNEKLLT+VL++GV 
Sbjct: 364 WAPQVLILDHKAIGGFVTHCGWNSTLEGIAAGLPMVTWPMGAEQFYNEKLLTKVLRIGVN 423

Query: 454 VGARRWWNMSDEMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSS 513
           VGA           +++SR  VEK V  ++G  E+A   R RAK+LG  A  AV+ GGSS
Sbjct: 424 VGATELVKKG----KLISRAQVEKAVREVIGG-EKAEERRLRAKELGEMAKAAVEEGGSS 483

Query: 514 ENNLISLMKELRSIK 516
            N++   M+EL   K
Sbjct: 484 YNDVNKFMEELNGRK 484

BLAST of ClCG02G012220 vs. TAIR 10
Match: AT2G15490.3 (UDP-glycosyltransferase 73B4 )

HSP 1 Score: 386.7 bits (992), Expect = 2.9e-107
Identity = 213/494 (43.12%), Postives = 310/494 (62.75%), Query Frame = 0

Query: 34  KEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFKIQ 93
           ++  I   P  A GH+IP++DMA+LFAR GA  T++ T  NA I +  I+      FK+Q
Sbjct: 4   EQIHILFFPFMAHGHMIPLLDMAKLFARRGAKSTLLTTPINAKILEKPIE-----AFKVQ 63

Query: 94  T-------HIVSFPGAEVGLAPGIENYSDVSS--RHLQAKIYQAFLILDKLIDQMI---I 153
                    I++FP  E+GL  G EN   ++S  +     ++  FL   K + Q +   I
Sbjct: 64  NPDLEIGIKILNFPCVELGLPEGCENRDFINSYQKSDSFDLFLKFLFSTKYMKQQLESFI 123

Query: 154 PATRPDCILSDLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEE 213
             T+P  +++D+  PW T++AE++GVPRLVF  ++  A    +++  H PH+KVAS +  
Sbjct: 124 ETTKPSALVADMFFPWATESAEKIGVPRLVFHGTSSFALCCSYNMRIHKPHKKVASSSTP 183

Query: 214 FEIPGLPHHIQMTKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHY 273
           F IPGLP  I +T+  Q         F    +  +E+E  S+G ++N+FYEL+  Y + Y
Sbjct: 184 FVIPGLPGDIVITE-DQANVTNEETPFGKFWKEVRESETSSFGVLVNSFYELESSYADFY 243

Query: 274 KKITGIKAWGLGPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMT 333
           +     KAW +GP+SL+ N+ + EK  RG K+ ++ +E +KWL+SK P SV+++SFGS T
Sbjct: 244 RSFVAKKAWHIGPLSLS-NRGIAEKAGRGKKANIDEQECLKWLDSKTPGSVVYLSFGSGT 303

Query: 334 RFPPPQMAEIAHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIW 393
             P  Q+ EIA GLE SG NFIWV+ +K++N  GE  + LP+GFE+  +NK +G I+R W
Sbjct: 304 GLPNEQLLEIAFGLEGSGQNFIWVV-SKNEN-QGENEDWLPKGFEE--RNKGKGLIIRGW 363

Query: 394 APQLLILEHPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPV 453
           APQ+LIL+H + GGF+THCGWNS++EGI+AG PMVTWP+ +EQFYNEKLLT+VL++GV V
Sbjct: 364 APQVLILDHKAIGGFVTHCGWNSTLEGIAAGLPMVTWPMGAEQFYNEKLLTKVLRIGVNV 423

Query: 454 GARRWWNMSDEMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSE 513
           GA           +++SR  VEK V  ++G  E+A   R RAK+LG  A  AV+ GGSS 
Sbjct: 424 GATELVKKG----KLISRAQVEKAVREVIGG-EKAEERRLRAKELGEMAKAAVEEGGSSY 481

Query: 514 NNLISLMKELRSIK 516
           N++   M+EL   K
Sbjct: 484 NDVNKFMEELNGRK 481

BLAST of ClCG02G012220 vs. TAIR 10
Match: AT4G34135.1 (UDP-glucosyltransferase 73B2 )

HSP 1 Score: 386.3 bits (991), Expect = 3.8e-107
Identity = 218/491 (44.40%), Postives = 308/491 (62.73%), Query Frame = 0

Query: 33  NKEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNID--HDFAAGF 92
           +++  +   P  A GH+IP +DMA+LF+  GA  TI+ T  N+ I Q  ID   +   G 
Sbjct: 7   HRKLHVMFFPFMAYGHMIPTLDMAKLFSSRGAKSTILTTSLNSKILQKPIDTFKNLNPGL 66

Query: 93  KIQTHIVSFPGAEVGLAPGIENYSDVSSRHLQAK---IYQAFLILDKLIDQM--IIPATR 152
           +I   I +FP  E+GL  G EN    +S +   K   I + F       DQ+  ++  TR
Sbjct: 67  EIDIQIFNFPCVELGLPEGCENVDFFTSNNNDDKNEMIVKFFFSTRFFKDQLEKLLGTTR 126

Query: 153 PDCILSDLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIP 212
           PDC+++D+  PW T+ A +  VPRLVF  + + +  A + +  H P ++VAS +E F IP
Sbjct: 127 PDCLIADMFFPWATEAAGKFNVPRLVFHGTGYFSLCAGYCIGVHKPQKRVASSSEPFVIP 186

Query: 213 GLPHHIQMTKSQQPEFLLRRDRFTAM---MESYKEAERRSYGTVMNTFYELDGVYLEHYK 272
            LP +I +T+ Q    ++  D  + M   M   +E+E +S G V+N+FYEL+  Y + YK
Sbjct: 187 ELPGNIVITEEQ----IIDGDGESDMGKFMTEVRESEVKSSGVVLNSFYELEHDYADFYK 246

Query: 273 KITGIKAWGLGPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMTR 332
                +AW +GP+S+  N+   EK ERG K+ ++  E +KWL+SK+PNSV++VSFGS+  
Sbjct: 247 SCVQKRAWHIGPLSV-YNRGFEEKAERGKKANIDEAECLKWLDSKKPNSVIYVSFGSVAF 306

Query: 333 FPPPQMAEIAHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWA 392
           F   Q+ EIA GLE SG +FIWV+R K K+D     E LPEGFE+ +K K  G I+R WA
Sbjct: 307 FKNEQLFEIAAGLEASGTSFIWVVR-KTKDD---REEWLPEGFEERVKGK--GMIIRGWA 366

Query: 393 PQLLILEHPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVG 452
           PQ+LIL+H +TGGF+THCGWNS +EG++AG PMVTWPV +EQFYNEKL+T+VL+ GV VG
Sbjct: 367 PQVLILDHQATGGFVTHCGWNSLLEGVAAGLPMVTWPVGAEQFYNEKLVTQVLRTGVSVG 426

Query: 453 ARRWWNMSDEMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSEN 512
           A +  +M   M + +SRE V+K V  ++ A E A   R RAK+L A A  AV+ GGSS N
Sbjct: 427 ASK--HMKVMMGDFISREKVDKAVREVL-AGEAAEERRRRAKKLAAMAKAAVEEGGSSFN 483

Query: 513 NLISLMKELRS 514
           +L S M+E  S
Sbjct: 487 DLNSFMEEFSS 483

BLAST of ClCG02G012220 vs. TAIR 10
Match: AT2G15480.1 (UDP-glucosyl transferase 73B5 )

HSP 1 Score: 380.6 bits (976), Expect = 2.1e-105
Identity = 207/490 (42.24%), Postives = 304/490 (62.04%), Query Frame = 0

Query: 33  NKEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNID--HDFAAGF 92
           ++   I   P  A GH+IPI+DMA+LF+R GA  T++ T  NA IF+  I+   +     
Sbjct: 6   SERIHILFFPFMAQGHMIPILDMAKLFSRRGAKSTLLTTPINAKIFEKPIEAFKNQNPDL 65

Query: 93  KIQTHIVSFPGAEVGLAPGIENYSDVSS--RHLQAKIYQAFLILDKLIDQMI---IPATR 152
           +I   I +FP  E+GL  G EN   ++S  +     ++  FL   K + Q +   I  T+
Sbjct: 66  EIGIKIFNFPCVELGLPEGCENADFINSYQKSDSGDLFLKFLFSTKYMKQQLESFIETTK 125

Query: 153 PDCILSDLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIP 212
           P  +++D+  PW T++AE+LGVPRLVF  ++F +    +++  H PH+KVA+ +  F IP
Sbjct: 126 PSALVADMFFPWATESAEKLGVPRLVFHGTSFFSLCCSYNMRIHKPHKKVATSSTPFVIP 185

Query: 213 GLPHHIQMTKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKIT 272
           GLP  I +T+  Q             M+  +E+E  S+G ++N+FYEL+  Y + Y+   
Sbjct: 186 GLPGDIVITE-DQANVAKEETPMGKFMKEVRESETNSFGVLVNSFYELESAYADFYRSFV 245

Query: 273 GIKAWGLGPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPP 332
             +AW +GP+SL+ N+ L EK  RG K+ ++ +E +KWL+SK P SV+++SFGS T F  
Sbjct: 246 AKRAWHIGPLSLS-NRELGEKARRGKKANIDEQECLKWLDSKTPGSVVYLSFGSGTNFTN 305

Query: 333 PQMAEIAHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQL 392
            Q+ EIA GLE SG +FIWV+R  +  + G+  E LPEGF++  +   +G I+  WAPQ+
Sbjct: 306 DQLLEIAFGLEGSGQSFIWVVRKNE--NQGDNEEWLPEGFKE--RTTGKGLIIPGWAPQV 365

Query: 393 LILEHPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARR 452
           LIL+H + GGF+THCGWNS+IEGI+AG PMVTWP+ +EQFYNEKLLT+VL++GV VGA  
Sbjct: 366 LILDHKAIGGFVTHCGWNSAIEGIAAGLPMVTWPMGAEQFYNEKLLTKVLRIGVNVGATE 425

Query: 453 WWNMSDEMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLI 512
                    +++SR  VEK V  ++G  E+A   R  AK+LG  A  AV+ GGSS N++ 
Sbjct: 426 LVKKG----KLISRAQVEKAVREVIGG-EKAEERRLWAKKLGEMAKAAVEEGGSSYNDVN 484

Query: 513 SLMKELRSIK 516
             M+EL   K
Sbjct: 486 KFMEELNGRK 484

BLAST of ClCG02G012220 vs. TAIR 10
Match: AT2G15480.2 (UDP-glucosyl transferase 73B5 )

HSP 1 Score: 380.6 bits (976), Expect = 2.1e-105
Identity = 208/499 (41.68%), Postives = 305/499 (61.12%), Query Frame = 0

Query: 33  NKEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNID--HDFAAGF 92
           ++   I   P  A GH+IPI+DMA+LF+R GA  T++ T  NA IF+  I+   +     
Sbjct: 6   SERIHILFFPFMAQGHMIPILDMAKLFSRRGAKSTLLTTPINAKIFEKPIEAFKNQNPDL 65

Query: 93  KIQTHIVSFPGAEVGLAPGIENYSDVSS--RHLQAKIYQAFLILDKLIDQMI---IPATR 152
           +I   I +FP  E+GL  G EN   ++S  +     ++  FL   K + Q +   I  T+
Sbjct: 66  EIGIKIFNFPCVELGLPEGCENADFINSYQKSDSGDLFLKFLFSTKYMKQQLESFIETTK 125

Query: 153 PDCILSDLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIP 212
           P  +++D+  PW T++AE+LGVPRLVF  ++F +    +++  H PH+KVA+ +  F IP
Sbjct: 126 PSALVADMFFPWATESAEKLGVPRLVFHGTSFFSLCCSYNMRIHKPHKKVATSSTPFVIP 185

Query: 213 GLPHHIQMTKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKIT 272
           GLP  I +T+  Q             M+  +E+E  S+G ++N+FYEL+  Y + Y+   
Sbjct: 186 GLPGDIVITE-DQANVAKEETPMGKFMKEVRESETNSFGVLVNSFYELESAYADFYRSFV 245

Query: 273 GIKAWGLGPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPP 332
             +AW +GP+SL+ N+ L EK  RG K+ ++ +E +KWL+SK P SV+++SFGS T F  
Sbjct: 246 AKRAWHIGPLSLS-NRELGEKARRGKKANIDEQECLKWLDSKTPGSVVYLSFGSGTNFTN 305

Query: 333 PQMAEIAHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQL 392
            Q+ EIA GLE SG +FIWV+R  +  + G+  E LPEGF++  +   +G I+  WAPQ+
Sbjct: 306 DQLLEIAFGLEGSGQSFIWVVRKNE--NQGDNEEWLPEGFKE--RTTGKGLIIPGWAPQV 365

Query: 393 LILEHPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARR 452
           LIL+H + GGF+THCGWNS+IEGI+AG PMVTWP+ +EQFYNEKLLT+VL++GV VGA  
Sbjct: 366 LILDHKAIGGFVTHCGWNSAIEGIAAGLPMVTWPMGAEQFYNEKLLTKVLRIGVNVGATE 425

Query: 453 WWNMSDEMKEIVSRENVEKGVGFLMGAT---------EEAAAIRERAKQLGAAANRAVQS 512
                    +++SR  VEK V  ++G           E+A   R RAK+LG  A  AV+ 
Sbjct: 426 LVKKG----KLISRAQVEKAVREVIGGEKAVREVIGGEKAEERRLRAKELGEMAKAAVEE 485

Query: 513 GGSSENNLISLMKELRSIK 516
           GGSS N++   M+EL   K
Sbjct: 486 GGSSYNDVNKFMEELNGRK 494

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038900859.11.0e-24786.56soyasapogenol B glucuronide galactosyltransferase-like [Benincasa hispida][more]
XP_022149559.12.4e-22078.37soyasapogenol B glucuronide galactosyltransferase-like [Momordica charantia][more]
KAF3972753.14.9e-14150.10hypothetical protein CMV_003760 [Castanea mollissima][more]
XP_030923367.14.9e-14150.10soyasapogenol B glucuronide galactosyltransferase-like [Quercus lobata][more]
XP_030949641.11.3e-13850.31soyasapogenol B glucuronide galactosyltransferase-like [Quercus lobata][more]
Match NameE-valueIdentityDescription
D4Q9Z47.4e-13246.28Soyasapogenol B glucuronide galactosyltransferase OS=Glycine max OX=3847 GN=GmSG... [more]
Q9AT547.9e-11042.37Scopoletin glucosyltransferase OS=Nicotiana tabacum OX=4097 GN=TOGT1 PE=1 SV=1[more]
Q2V6J92.3e-10943.74UDP-glucose flavonoid 3-O-glucosyltransferase 7 OS=Fragaria ananassa OX=3747 GN=... [more]
Q7Y2328.2e-10742.83UDP-glycosyltransferase 73B4 OS=Arabidopsis thaliana OX=3702 GN=UGT73B4 PE=2 SV=... [more]
Q94C575.3e-10644.40UDP-glucosyl transferase 73B2 OS=Arabidopsis thaliana OX=3702 GN=UGT73B2 PE=1 SV... [more]
Match NameE-valueIdentityDescription
A0A6J1D7D41.1e-22078.37Glycosyltransferase OS=Momordica charantia OX=3673 GN=LOC111017964 PE=3 SV=1[more]
A0A2N9I9F62.6e-14853.39Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS48797 PE=4 SV=1[more]
A0A2N9FF754.3e-14351.24Glycosyltransferase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS13306 PE=3 SV=1[more]
A0A7N2R0286.5e-13950.31Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1[more]
A0A5A7TQD01.4e-13847.52Glycosyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold123G... [more]
Match NameE-valueIdentityDescription
AT2G15490.15.8e-10842.83UDP-glycosyltransferase 73B4 [more]
AT2G15490.32.9e-10743.12UDP-glycosyltransferase 73B4 [more]
AT4G34135.13.8e-10744.40UDP-glucosyltransferase 73B2 [more]
AT2G15480.12.1e-10542.24UDP-glucosyl transferase 73B5 [more]
AT2G15480.22.1e-10541.68UDP-glucosyl transferase 73B5 [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 296..432
e-value: 1.7E-19
score: 70.0
IPR002213UDP-glucuronosyl/UDP-glucosyltransferaseCDDcd03784GT1_Gtf-likecoord: 37..493
e-value: 3.98867E-68
score: 222.812
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 290..492
e-value: 3.2E-114
score: 384.4
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 40..504
e-value: 3.2E-114
score: 384.4
NoneNo IPR availablePANTHERPTHR48049GLYCOSYLTRANSFERASEcoord: 32..516
NoneNo IPR availablePANTHERPTHR48049:SF18SOYASAPOGENOL B GLUCURONIDE GALACTOSYLTRANSFERASE-RELATEDcoord: 32..516
NoneNo IPR availableSUPERFAMILY53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 37..511
IPR035595UDP-glycosyltransferase family, conserved sitePROSITEPS00375UDPGTcoord: 381..424

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG02G012220.2ClCG02G012220.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0008194 UDP-glycosyltransferase activity