CsGy4G020580 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy4G020580
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionUDP-glucosyltransferase
LocationGy14Chr4: 27432879 .. 27435933 (-)
RNA-Seq ExpressionCsGy4G020580
SyntenyCsGy4G020580
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAAATACTGATTTCTAAAAATACGAAATTAACAATACTTATCTAGTTCTCTATAAATAAAAAAAACAGCAAAGCAATGACATCATCTACATCCCATTAAATTAGGTTTTAAGAATGATTTCTGAATTTGTCCTCTGTATACTTCACCTAGTGCTCAATATATACAATTGATTTACATTTTGAATTTGGGAATAAAGATGTAAAAACAGGAAGAGATAATTTAGATAAAGAATAGAGGTATGGGCTGATTTGACAGGTTGAGGAGGATTTCAGTTGAACAAAACATTTTAAACTTTCCTAGAAGGTACGAATTCTTGATGACATGACTTTTTGTCCATCTCACCTTTCATCATTTTCTCCTAACGCTTACAAATTTCTATGCTGTCCATCCCTTACCCTTAACCTTCAATTATAAACATATCATCAATCAAAATATAAACTTAAACCATTTTAAATAATAAAATAATAATAAGGACCTTTTTTTTTTAATTATAAAAAGCCATTAATTCAACCCATCTTCTTCCCCCAAAATGAAGAAACTGGAGCTCATCTTCATCCCCACCCCGATCATCGGCCACCTGACCTCCGCCCTCCAACTCGCCCACCTTCTCGTAACTCGACACCCTTTTCTCTCTATCACTATTTTCATCATTAAAATCCCTTTCCCCACCAGATCTGCCGATCAAATTCAATCTCTCTGTTCCTCCTATGCCAACCATCGCCTCCGATTCTTCACTCTCCCGGAGCAACCCATCCCCGGCAACACTAATAAAACCACCATCTTAAAACCCCTCGTCGAATCCCAGAAACAAAATGTCGCCGACGCCGTCGCCAATCTCATCGCCGCGCCGGATTCCCCTACACTCGCTGGCTTCGTCGTCGACATGTTCTGCATTCCAATGCTGGATGTAGCCAAACAATTTTCTGTTCCCACTTTCGTCTTCTACACTTCCAGCGCTTCCTTCCTTGCCCTTCTCTTCCATCTTCAAGAACTCTACGACTATGAATTCAATCACGACATGGACCAATTGCTCAACTCTGTAACAGAGTTTGCCCTCCCGGGTTTCAAAAATCCGATTCCGAGGAAAGTGATTTCCACCATTTTTTACGATAAGGAAACGATTGAATGGGCGCACAATCTCACTCGCAAGTTCAGAGAAGCAAGTGGGTTTTTAGTAAACACATTTTCCGAGCTCGAATCCGGTGCGATTAACTGGTTCGCCAATCAGAATCTCCCTCCGGTGTACGCCGTTGGACCTATTTTGAATGTGAAGGAAAAAAATCCCCAAATCGAACGGGATGAGATTTTGAAGTGGTTAGACGAGCAGCCACCGTCGTCGGTGGTGCTCCTCTGTTTCGGATCAATGGGAATCTTCAATGAATCTCAAACCAAAGAGATTGCAGATGCCTTAGAGCGAAGTGGAGTCCGATTCATCTGGTCCATACGGCAGGTACCACCGGAGAGTGTTCTGCCGGAAGGATTCGTGGATCGGACGAGCGGAATGGGAAAAGTGATGGGGTGGGCGCCGCAGATGGAAATATTGGAACATCCGGCGACGGGAGGGTTTGTGTCGCATTGCGGGTGGAATTCAGTGCTGGAGAGTTTGTGGAACGGTGTGGCAGTGGCGACGTGGCCGATGTATGCGGAGCAGCAGTTGAATGCGTTTCACATGGCGGTGGAGTTGGGAGTGGGGGTGGAGGTGTCCTTGGATTATAGTATGGTGGGGGCGGCGGAGGGGGAGTTGAGGGCGGATAAGATTGAGGCGGGGATACGAAAGTTGATGGAAGGTTCAGAGGAGATGAAGAAGGGGGTGATGGTTAAGAGTGAGGAAAGTAAGAAGGCAACAATGGAGGATGGATCTTCTTTCAATGATCTTAATCGTTTTATCGATCATGTGTTTCATAAAATTAATACTTGTTGATTTTAGCTGTGTGTTTCATCAAGCATGTCATCAAGAATGAATGGGAGAATATAGTATCCATGAAAACATGTTTGAAGATCAAAGAAAAAATAAAAGAGACATGCATATGCATACTAAGAAATATAACAATGTTTGATTGCTTGCAGTGTAATATGGTCACCATTTTGACAATTGGGTCGACCTATTTGGATAAATTTGTGAGCCGGACCCAATAGGTGTTAATCCAATAGGTGTTAATCCAATAGGTCAGTCCAATTCAACATTGGACTCAGACTTTAATTCAAAGAGACAGTCACAATGATTGGGTCATGAACTTGACTATTGGGTTTGGGCCTTCACTTGTAGCCCATGGTTGATCTCTACTTAAGTTGTTGTTTGCGTTTGTCACATAACTGTCTACTCGTAATAATAGAGAAAGTGTATATAAGATATCAATTTGGGTTCATCGCATAGCCTTCATAACTGTTCACCAAATATCTTTTAACTTGCTAATTTAAGTTGATCATTCCTAAAATCTATAATGCAATTAGGTTATTGGTGAAAATTGTTAGTCTCTTTAATTGAAAAATATATAAATTTTATATGATAATGTGATTAAGATTATAGGAAGGAATAAGATGTTTAATAAGTAACAAACTTGGTTTTGGAAGAATATGAAACAATAATTATGTGTAAAGTAGAGGTAGAAAAAAGAAAGAGAAGCCGTAGGAGGGAAACTCCATTTTATATAATAGAAAAGGCATGATTGATATATTCTCTTTATGTTATGAAAAATGATATATGTAGAGGTCAATGGATGTGGGTACCAAAGTCAAGCAAAAGCATTGAACACAAAGGATCATAATGGCCCCATCGCCATCTATATTGAATTTAAATCTTTGACCAAACAGACCACAAAACAAACCATGCTCATCCACATGGGGGTTTCCCAATATTAAAATTATCCTTTCAAAATATATATACTACTACAAAATTTCTCATCAATTAAAATTCTCAAAACAGAGGCTAACTAAGTTGAAAGTATATTTTTTAAAGGTTTAAAATCTTTAACTTTTAACCAACCACATGAGTAGTTTGTAAATATTTAATATTTGAAACTAATTATCAAAATCTTTGTCACAATGTAAATGGCC

mRNA sequence

CAAAATACTGATTTCTAAAAATACGAAATTAACAATACTTATCTAGTTCTCTATAAATAAAAAAAACAGCAAAGCAATGACATCATCTACATCCCATTAAATTAGGTTTTAAGAATGATTTCTGAATTTGTCCTCTGTATACTTCACCTAGTGCTCAATATATACAATTGATTTACATTTTGAATTTGGGAATAAAGATGTAAAAACAGGAAGAGATAATTTAGATAAAGAATAGAGGTATGGGCTGATTTGACAGGTTGAGGAGGATTTCAGTTGAACAAAACATTTTAAACTTTCCTAGAAGGTACGAATTCTTGATGACATGACTTTTTGTCCATCTCACCTTTCATCATTTTCTCCTAACGCTTACAAATTTCTATGCTGTCCATCCCTTACCCTTAACCTTCAATTATAAACATATCATCAATCAAAATATAAACTTAAACCATTTTAAATAATAAAATAATAATAAGGACCTTTTTTTTTTAATTATAAAAAGCCATTAATTCAACCCATCTTCTTCCCCCAAAATGAAGAAACTGGAGCTCATCTTCATCCCCACCCCGATCATCGGCCACCTGACCTCCGCCCTCCAACTCGCCCACCTTCTCGTAACTCGACACCCTTTTCTCTCTATCACTATTTTCATCATTAAAATCCCTTTCCCCACCAGATCTGCCGATCAAATTCAATCTCTCTGTTCCTCCTATGCCAACCATCGCCTCCGATTCTTCACTCTCCCGGAGCAACCCATCCCCGGCAACACTAATAAAACCACCATCTTAAAACCCCTCGTCGAATCCCAGAAACAAAATGTCGCCGACGCCGTCGCCAATCTCATCGCCGCGCCGGATTCCCCTACACTCGCTGGCTTCGTCGTCGACATGTTCTGCATTCCAATGCTGGATGTAGCCAAACAATTTTCTGTTCCCACTTTCGTCTTCTACACTTCCAGCGCTTCCTTCCTTGCCCTTCTCTTCCATCTTCAAGAACTCTACGACTATGAATTCAATCACGACATGGACCAATTGCTCAACTCTGTAACAGAGTTTGCCCTCCCGGGTTTCAAAAATCCGATTCCGAGGAAAGTGATTTCCACCATTTTTTACGATAAGGAAACGATTGAATGGGCGCACAATCTCACTCGCAAGTTCAGAGAAGCAAGTGGGTTTTTAGTAAACACATTTTCCGAGCTCGAATCCGGTGCGATTAACTGGTTCGCCAATCAGAATCTCCCTCCGGTGTACGCCGTTGGACCTATTTTGAATGTGAAGGAAAAAAATCCCCAAATCGAACGGGATGAGATTTTGAAGTGGTTAGACGAGCAGCCACCGTCGTCGGTGGTGCTCCTCTGTTTCGGATCAATGGGAATCTTCAATGAATCTCAAACCAAAGAGATTGCAGATGCCTTAGAGCGAAGTGGAGTCCGATTCATCTGGTCCATACGGCAGGTACCACCGGAGAGTGTTCTGCCGGAAGGATTCGTGGATCGGACGAGCGGAATGGGAAAAGTGATGGGGTGGGCGCCGCAGATGGAAATATTGGAACATCCGGCGACGGGAGGGTTTGTGTCGCATTGCGGGTGGAATTCAGTGCTGGAGAGTTTGTGGAACGGTGTGGCAGTGGCGACGTGGCCGATGTATGCGGAGCAGCAGTTGAATGCGTTTCACATGGCGGTGGAGTTGGGAGTGGGGGTGGAGGTGTCCTTGGATTATAGTATGGTGGGGGCGGCGGAGGGGGAGTTGAGGGCGGATAAGATTGAGGCGGGGATACGAAAGTTGATGGAAGGTTCAGAGGAGATGAAGAAGGGGGTGATGGTTAAGAGTGAGGAAAGTAAGAAGGCAACAATGGAGGATGGATCTTCTTTCAATGATCTTAATCGTTTTATCGATCATGTGTTTCATAAAATTAATACTTGTTGATTTTAGCTGTGTGTTTCATCAAGCATGTCATCAAGAATGAATGGGAGAATATAGTATCCATGAAAACATGTTTGAAGATCAAAGAAAAAATAAAAGAGACATGCATATGCATACTAAGAAATATAACAATGTTTGATTGCTTGCAGTGTAATATGGTCACCATTTTGACAATTGGGTCGACCTATTTGGATAAATTTGTGAGCCGGACCCAATAGGTGTTAATCCAATAGGTGTTAATCCAATAGGTCAGTCCAATTCAACATTGGACTCAGACTTTAATTCAAAGAGACAGTCACAATGATTGGGTCATGAACTTGACTATTGGGTTTGGGCCTTCACTTGTAGCCCATGGTTGATCTCTACTTAAGTTGTTGTTTGCGTTTGTCACATAACTGTCTACTCGTAATAATAGAGAAAGTGTATATAAGATATCAATTTGGGTTCATCGCATAGCCTTCATAACTGTTCACCAAATATCTTTTAACTTGCTAATTTAAGTTGATCATTCCTAAAATCTATAATGCAATTAGGTTATTGGTGAAAATTGTTAGTCTCTTTAATTGAAAAATATATAAATTTTATATGATAATGTGATTAAGATTATAGGAAGGAATAAGATGTTTAATAAGTAACAAACTTGGTTTTGGAAGAATATGAAACAATAATTATGTGTAAAGTAGAGGTAGAAAAAAGAAAGAGAAGCCGTAGGAGGGAAACTCCATTTTATATAATAGAAAAGGCATGATTGATATATTCTCTTTATGTTATGAAAAATGATATATGTAGAGGTCAATGGATGTGGGTACCAAAGTCAAGCAAAAGCATTGAACACAAAGGATCATAATGGCCCCATCGCCATCTATATTGAATTTAAATCTTTGACCAAACAGACCACAAAACAAACCATGCTCATCCACATGGGGGTTTCCCAATATTAAAATTATCCTTTCAAAATATATATACTACTACAAAATTTCTCATCAATTAAAATTCTCAAAACAGAGGCTAACTAAGTTGAAAGTATATTTTTTAAAGGTTTAAAATCTTTAACTTTTAACCAACCACATGAGTAGTTTGTAAATATTTAATATTTGAAACTAATTATCAAAATCTTTGTCACAATGTAAATGGCC

Coding sequence (CDS)

ATGAAGAAACTGGAGCTCATCTTCATCCCCACCCCGATCATCGGCCACCTGACCTCCGCCCTCCAACTCGCCCACCTTCTCGTAACTCGACACCCTTTTCTCTCTATCACTATTTTCATCATTAAAATCCCTTTCCCCACCAGATCTGCCGATCAAATTCAATCTCTCTGTTCCTCCTATGCCAACCATCGCCTCCGATTCTTCACTCTCCCGGAGCAACCCATCCCCGGCAACACTAATAAAACCACCATCTTAAAACCCCTCGTCGAATCCCAGAAACAAAATGTCGCCGACGCCGTCGCCAATCTCATCGCCGCGCCGGATTCCCCTACACTCGCTGGCTTCGTCGTCGACATGTTCTGCATTCCAATGCTGGATGTAGCCAAACAATTTTCTGTTCCCACTTTCGTCTTCTACACTTCCAGCGCTTCCTTCCTTGCCCTTCTCTTCCATCTTCAAGAACTCTACGACTATGAATTCAATCACGACATGGACCAATTGCTCAACTCTGTAACAGAGTTTGCCCTCCCGGGTTTCAAAAATCCGATTCCGAGGAAAGTGATTTCCACCATTTTTTACGATAAGGAAACGATTGAATGGGCGCACAATCTCACTCGCAAGTTCAGAGAAGCAAGTGGGTTTTTAGTAAACACATTTTCCGAGCTCGAATCCGGTGCGATTAACTGGTTCGCCAATCAGAATCTCCCTCCGGTGTACGCCGTTGGACCTATTTTGAATGTGAAGGAAAAAAATCCCCAAATCGAACGGGATGAGATTTTGAAGTGGTTAGACGAGCAGCCACCGTCGTCGGTGGTGCTCCTCTGTTTCGGATCAATGGGAATCTTCAATGAATCTCAAACCAAAGAGATTGCAGATGCCTTAGAGCGAAGTGGAGTCCGATTCATCTGGTCCATACGGCAGGTACCACCGGAGAGTGTTCTGCCGGAAGGATTCGTGGATCGGACGAGCGGAATGGGAAAAGTGATGGGGTGGGCGCCGCAGATGGAAATATTGGAACATCCGGCGACGGGAGGGTTTGTGTCGCATTGCGGGTGGAATTCAGTGCTGGAGAGTTTGTGGAACGGTGTGGCAGTGGCGACGTGGCCGATGTATGCGGAGCAGCAGTTGAATGCGTTTCACATGGCGGTGGAGTTGGGAGTGGGGGTGGAGGTGTCCTTGGATTATAGTATGGTGGGGGCGGCGGAGGGGGAGTTGAGGGCGGATAAGATTGAGGCGGGGATACGAAAGTTGATGGAAGGTTCAGAGGAGATGAAGAAGGGGGTGATGGTTAAGAGTGAGGAAAGTAAGAAGGCAACAATGGAGGATGGATCTTCTTTCAATGATCTTAATCGTTTTATCGATCATGTGTTTCATAAAATTAATACTTGTTGA

Protein sequence

MKKLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPFPTRSADQIQSLCSSYANHRLRFFTLPEQPIPGNTNKTTILKPLVESQKQNVADAVANLIAAPDSPTLAGFVVDMFCIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNSVTEFALPGFKNPIPRKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWFANQNLPPVYAVGPILNVKEKNPQIERDEILKWLDEQPPSSVVLLCFGSMGIFNESQTKEIADALERSGVRFIWSIRQVPPESVLPEGFVDRTSGMGKVMGWAPQMEILEHPATGGFVSHCGWNSVLESLWNGVAVATWPMYAEQQLNAFHMAVELGVGVEVSLDYSMVGAAEGELRADKIEAGIRKLMEGSEEMKKGVMVKSEESKKATMEDGSSFNDLNRFIDHVFHKINTC*
Homology
BLAST of CsGy4G020580 vs. ExPASy Swiss-Prot
Match: Q66PF3 (Putative UDP-glucose flavonoid 3-O-glucosyltransferase 3 OS=Fragaria ananassa OX=3747 GN=GT3 PE=2 SV=1)

HSP 1 Score: 416.4 bits (1069), Expect = 4.3e-115
Identity = 235/482 (48.76%), Postives = 311/482 (64.52%), Query Frame = 0

Query: 2   KKLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPFPTRSAD-QIQSLC--S 61
           K  EL+ IP+P IGHL S L++A LLV+R   L IT+ I+  P  ++  D  +QSL   S
Sbjct: 3   KPAELVLIPSPGIGHLVSTLEIAKLLVSRDDKLFITVLIMHFPAVSKGTDAYVQSLADSS 62

Query: 62  SYANHRLRFFTLPEQPIPGNTNKT-TILKPLVESQKQNVADAVANLIAAPDSPT--LAGF 121
           S  + R+ F  LP   +          L   VESQ+ +V DAVANL    DS T  LAGF
Sbjct: 63  SPISQRINFINLPHTNMDHTEGSVRNSLVGFVESQQPHVKDAVANL---RDSKTTRLAGF 122

Query: 122 VVDMFCIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNSVTEFA 181
           VVDMFC  M++VA Q  VP++VF+TS A+ L LLFHLQEL D ++N D  +  +S  E  
Sbjct: 123 VVDMFCTTMINVANQLGVPSYVFFTSGAATLGLLFHLQELRD-QYNKDCTEFKDSDAELI 182

Query: 182 LPGFKNPIPRKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWFANQ-N 241
           +P F NP+P KV+      K++ E   N+ ++FRE  G LVNTF++LES A++  ++   
Sbjct: 183 IPSFFNPLPAKVLPGRMLVKDSAEPFLNVIKRFRETKGILVNTFTDLESHALHALSSDAE 242

Query: 242 LPPVYAVGPILNVKEKNPQIERDE------ILKWLDEQPPSSVVLLCFGSMGIFNESQTK 301
           +PPVY VGP+LN+     +++ DE      ILKWLD+QPP SVV LCFGSMG F+ESQ +
Sbjct: 243 IPPVYPVGPLLNLNSNESRVDSDEVKKKNDILKWLDDQPPLSVVFLCFGSMGSFDESQVR 302

Query: 302 EIADALERSGVRFIWSIRQVPP-------------ESVLPEGFVDRTSGMGKVMGWAPQM 361
           EIA+ALE +G RF+WS+R+ PP               VLPEGF+DRT G+GKV+GWAPQ+
Sbjct: 303 EIANALEHAGHRFLWSLRRSPPTGKVAFPSDYDDHTGVLPEGFLDRTGGIGKVIGWAPQV 362

Query: 362 EILEHPATGGFVSHCGWNSVLESLWNGVAVATWPMYAEQQLNAFHMAVELGVGVEVSLDY 421
            +L HP+ GGFVSHCGWNS LESLW+GV VATWP+YAEQQLNAF    EL + VE+ + Y
Sbjct: 363 AVLAHPSVGGFVSHCGWNSTLESLWHGVPVATWPLYAEQQLNAFQPVKELELAVEIDMSY 422

Query: 422 SMVGAAEGELRADKIEAGIRKLME-GSEEMKKGVMVKSEESKKATMEDGSSFNDLNRFID 457
                +   + A +IE GIR++ME  S +++K V   SE+ KKA M+ GSS+  L  FID
Sbjct: 423 R--SKSPVLVSAKEIERGIREVMELDSSDIRKRVKEMSEKGKKALMDGGSSYTSLGHFID 478

BLAST of CsGy4G020580 vs. ExPASy Swiss-Prot
Match: D3THI6 (UDP-glycosyltransferase 71A15 OS=Malus domestica OX=3750 GN=UGT71A15 PE=1 SV=1)

HSP 1 Score: 411.4 bits (1056), Expect = 1.4e-113
Identity = 223/477 (46.75%), Postives = 309/477 (64.78%), Query Frame = 0

Query: 5   ELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPF--PTRSADQIQSLCSSYAN 64
           +L+F+P P IGH+ S +++A  L  R   L IT+ ++K+P+  P  + D       S  +
Sbjct: 6   QLVFVPAPGIGHIVSTVEMAKQLAARDDQLFITVLVMKLPYAQPFTNTD-------SSIS 65

Query: 65  HRLRFFTLPE-QPIPGN--TNKTTILKPLVESQKQNVADAVANLIAAPD------SPTLA 124
           HR+ F  LPE QP   +   N  +  +  VE+ K +V DAV N++   D       P LA
Sbjct: 66  HRINFVNLPEAQPDKQDIVPNPGSFFRMFVENHKSHVRDAVINVLPESDQSESTSKPRLA 125

Query: 125 GFVVDMFCIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNSVTE 184
           GFV+DMF   ++DVA +F VP+++F+TS+AS LAL+ H Q L D E   D+ +L +S  E
Sbjct: 126 GFVLDMFSASLIDVANEFKVPSYLFFTSNASALALMSHFQSLRD-EGGIDITELTSSTAE 185

Query: 185 FALPGFKNPIPRKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWF-AN 244
            A+P F NP P  V+     D E+ +   N   K+++  G LVNTF ELES A+++  + 
Sbjct: 186 LAVPSFINPYPAAVLPGSLLDMESTKSTLNHVSKYKQTKGILVNTFMELESHALHYLDSG 245

Query: 245 QNLPPVYAVGPILNVKEKNPQIERDEILKWLDEQPPSSVVLLCFGSMGIFNESQTKEIAD 304
             +PPVY VGP+LN+K  + + +  +IL+WLD+QPP SVV LCFGSMG F E+Q KEIA 
Sbjct: 246 DKIPPVYPVGPLLNLKSSD-EDKASDILRWLDDQPPFSVVFLCFGSMGSFGEAQVKEIAC 305

Query: 305 ALERSGVRFIWSIRQVPPE-------------SVLPEGFVDRTSGMGKVMGWAPQMEILE 364
           ALE SG RF+WS+R+ PP+             +VLPEGF+DRT+ +GKV+GWAPQ  IL 
Sbjct: 306 ALEHSGHRFLWSLRRPPPQGKRAMPSDYEDLKTVLPEGFLDRTATVGKVIGWAPQAAILG 365

Query: 365 HPATGGFVSHCGWNSVLESLWNGVAVATWPMYAEQQLNAFHMAVELGVGVEVSLDYSMVG 424
           HPATGGFVSHCGWNS LESLWNGV +A WP+YAEQ LNAF + VELG+ VE+ +DY    
Sbjct: 366 HPATGGFVSHCGWNSTLESLWNGVPIAAWPLYAEQNLNAFQLVVELGLAVEIKMDYRR-- 425

Query: 425 AAEGELRADKIEAGIRKLMEGSEEMKKGVMVKSEESKKATMEDGSSFNDLNRFIDHV 457
            ++  + A+ IE GIR++ME   +++K V   SE+SKKA ++ GSS++ L RFID +
Sbjct: 426 DSDVVVSAEDIERGIRRVMELDSDVRKRVKEMSEKSKKALVDGGSSYSSLGRFIDKI 471

BLAST of CsGy4G020580 vs. ExPASy Swiss-Prot
Match: D3UAG1 (UDP-glycosyltransferase 71A16 OS=Pyrus communis OX=23211 GN=UGT71A16 PE=1 SV=1)

HSP 1 Score: 409.5 bits (1051), Expect = 5.2e-113
Identity = 220/478 (46.03%), Postives = 308/478 (64.44%), Query Frame = 0

Query: 2   KKLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPFPTRSADQIQSLCSSYA 61
           +  +L+F+P P IGH+ S +++A  LV R   L IT+ ++K+P+     DQ  +   S  
Sbjct: 3   RSAQLVFVPAPGIGHIVSTVEMAKQLVARDDQLFITVLVMKLPY-----DQPFTNTDSSI 62

Query: 62  NHRLRFFTLPEQPIPGN---TNKTTILKPLVESQKQNVADAVANLIAAPD------SPTL 121
           +HR+ F  LPE  +       N  +  +  VE+ K +V DAV NL+   D       P L
Sbjct: 63  SHRINFVNLPEAQLDKQDTVPNPGSFFRMFVENHKTHVRDAVINLLPESDQSESTSKPRL 122

Query: 122 AGFVVDMFCIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNSVT 181
           AGFV+DMF   ++DVA +F VP++VF+TS++S LALL H Q L D E   D+ +L +S  
Sbjct: 123 AGFVLDMFSASLIDVANEFEVPSYVFFTSNSSTLALLSHFQSLRD-EGGIDITELTSSTA 182

Query: 182 EFALPGFKNPIPRKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWF-A 241
           E A+P F NP P  V+   F DKE+ +   N   ++++  G LVNTF ELES A+++  +
Sbjct: 183 ELAVPSFINPYPVAVLPGSFLDKESTKSTLNNVGRYKQTKGILVNTFLELESHALHYLDS 242

Query: 242 NQNLPPVYAVGPILNVKEKNPQIERDEILKWLDEQPPSSVVLLCFGSMGIFNESQTKEIA 301
              +PPVY VGP+LN+K  + + +  +IL+WLD+QPP SVV LCFGSMG F ++Q KEIA
Sbjct: 243 GVKIPPVYPVGPLLNLKSSH-EDKGSDILRWLDDQPPLSVVFLCFGSMGSFGDAQVKEIA 302

Query: 302 DALERSGVRFIWSIRQVPP-------------ESVLPEGFVDRTSGMGKVMGWAPQMEIL 361
             LE SG RF+WS+RQ P              ++VLPEGF+DRT+ +G+V+GWAPQ  IL
Sbjct: 303 CTLEHSGHRFLWSLRQPPSKGKRALPSDYADLKTVLPEGFLDRTATVGRVIGWAPQAAIL 362

Query: 362 EHPATGGFVSHCGWNSVLESLWNGVAVATWPMYAEQQLNAFHMAVELGVGVEVSLDYSMV 421
            HPA GGFVSHCGWNS LES+WNGV +A WPMYAEQ +NAF + VELG+ VE+ +DY   
Sbjct: 363 GHPAIGGFVSHCGWNSTLESIWNGVPIAAWPMYAEQNMNAFQLVVELGLAVEIKMDYRK- 422

Query: 422 GAAEGELRADKIEAGIRKLMEGSEEMKKGVMVKSEESKKATMEDGSSFNDLNRFIDHV 457
             ++  + A+ IE GIR++ME   +++K V   SE+SKKA ++ GSS++ L RFID +
Sbjct: 423 -DSDVVVSAEDIERGIRQVMELDSDVRKRVKEMSEKSKKALVDGGSSYSSLGRFIDQI 471

BLAST of CsGy4G020580 vs. ExPASy Swiss-Prot
Match: Q2V6K0 (UDP-glucose flavonoid 3-O-glucosyltransferase 6 OS=Fragaria ananassa OX=3747 GN=GT6 PE=1 SV=1)

HSP 1 Score: 407.9 bits (1047), Expect = 1.5e-112
Identity = 224/479 (46.76%), Postives = 303/479 (63.26%), Query Frame = 0

Query: 2   KKLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPFPTRSAD-QIQSLC--S 61
           K  ELIFIP P IGH+ S +++A LL+ R   L ITI I+K PF    +D  I+SL    
Sbjct: 3   KASELIFIPIPGIGHIVSTVEIAKLLLCRDDNLFITILIMKFPFTADGSDVYIKSLAVDP 62

Query: 62  SYANHRLRFFTLPEQPIPGNTNKTTILKPLVESQKQNVADAVANLIAAPDSPT-LAGFVV 121
           S    R+RF  LP++   G     T     ++S K +V DAV  L+      T +AGFV+
Sbjct: 63  SLKTQRIRFVNLPQEHFQG--TGATGFFTFIDSHKSHVKDAVTRLMETKSETTRIAGFVI 122

Query: 122 DMFCIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNSVTEFALP 181
           DMFC  M+D+A +F +P++VFYTS A+ L L+FHLQ L D E N D  +  +S  E  + 
Sbjct: 123 DMFCTGMIDLANEFGLPSYVFYTSGAADLGLMFHLQALRDEE-NKDCTEFKDSDAELVVS 182

Query: 182 GFKNPIP-RKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWFANQ-NL 241
            F NP+P  +V+ ++ ++KE   +  N  +++RE  G LVNTF ELE  AI   ++   +
Sbjct: 183 SFVNPLPAARVLPSVVFEKEGGNFFLNFAKRYRETKGILVNTFLELEPHAIQSLSSDGKI 242

Query: 242 PPVYAVGPILNVKEKNPQI------ERDEILKWLDEQPPSSVVLLCFGSMGIFNESQTKE 301
            PVY VGPILNVK +  Q+      ++ +IL+WLD+QPPSSVV LCFGSMG F E Q KE
Sbjct: 243 LPVYPVGPILNVKSEGNQVSSEKSKQKSDILEWLDDQPPSSVVFLCFGSMGCFGEDQVKE 302

Query: 302 IADALERSGVRFIWSIRQVPPE------------SVLPEGFVDRTSGMGKVMGWAPQMEI 361
           IA ALE+ G+RF+WS+RQ   E            +VLPEGF+DRT+ +GKV+GWAPQ+ I
Sbjct: 303 IAHALEQGGIRFLWSLRQPSKEKIGFPSDYTDYKAVLPEGFLDRTTDLGKVIGWAPQLAI 362

Query: 362 LEHPATGGFVSHCGWNSVLESLWNGVAVATWPMYAEQQLNAFHMAVELGVGVEVSLDYSM 421
           L HPA GGFVSHCGWNS LES+W GV +ATWP YAEQQ+NAF +  EL + VE+ + Y  
Sbjct: 363 LAHPAVGGFVSHCGWNSTLESIWYGVPIATWPFYAEQQVNAFELVKELKLAVEIDMGYRK 422

Query: 422 VGAAEGELRADKIEAGIRKLMEGSEEMKKGVMVKSEESKKATMEDGSSFNDLNRFIDHV 457
                  +  + IE GI+++ME   E++K V   S+ S+KA  EDGSS++ L RF+D +
Sbjct: 423 DSGV--IVSRENIEKGIKEVMEQESELRKRVKEMSQMSRKALEEDGSSYSSLGRFLDQI 476

BLAST of CsGy4G020580 vs. ExPASy Swiss-Prot
Match: Q6VAB2 (UDP-glycosyltransferase 71E1 OS=Stevia rebaudiana OX=55670 GN=UGT71E1 PE=2 SV=1)

HSP 1 Score: 390.6 bits (1002), Expect = 2.5e-107
Identity = 217/477 (45.49%), Postives = 300/477 (62.89%), Query Frame = 0

Query: 1   MKKLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPFPTRSADQIQSLCSSY 60
           M   EL+FIP+P  GHL   ++LA LL+ R   LS+TI ++ +    +   + +    S 
Sbjct: 1   MSTSELVFIPSPGAGHLPPTVELAKLLLHRDQRLSVTIIVMNLWLGPKHNTEARPCVPS- 60

Query: 61  ANHRLRFFTLP-EQPIPGNTNKTTILKPLVESQKQNVADAVANLIAAPDSPTLAGFVVDM 120
               LRF  +P ++      +  T +   VE  K  V D V  +I + DS  LAGFV+DM
Sbjct: 61  ----LRFVDIPCDESTMALISPNTFISAFVEHHKPRVRDIVRGIIES-DSVRLAGFVLDM 120

Query: 121 FCIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNSVTEFALPGF 180
           FC+PM DVA +F VP++ ++TS A+ L L+FHLQ   D+E  +D  +L NS TE ++P +
Sbjct: 121 FCMPMSDVANEFGVPSYNYFTSGAATLGLMFHLQWKRDHE-GYDATELKNSDTELSVPSY 180

Query: 181 KNPIPRKVISTIFYDKE-TIEWAHNLTRKFREASGFLVNTFSELESGAINWFANQN--LP 240
            NP+P KV+  +  DKE   +   +L  + RE+ G +VN+   +E  A+ + ++ N  +P
Sbjct: 181 VNPVPAKVLPEVVLDKEGGSKMFLDLAERIRESKGIIVNSCQAIERHALEYLSSNNNGIP 240

Query: 241 PVYAVGPILNVKEKNPQIERDEILKWLDEQPPSSVVLLCFGSMGIFNESQTKEIADALER 300
           PV+ VGPILN++ K    + DEI++WL+EQP SSVV LCFGSMG FNE Q KEIA A+ER
Sbjct: 241 PVFPVGPILNLENKKDDAKTDEIMRWLNEQPESSVVFLCFGSMGSFNEKQVKEIAVAIER 300

Query: 301 SGVRFIWSIRQVPP-------------ESVLPEGFVDRTSGMGKVMGWAPQMEILEHPAT 360
           SG RF+WS+R+  P             E VLPEGF+ RTS +GKV+GWAPQM +L HP+ 
Sbjct: 301 SGHRFLWSLRRPTPKEKIEFPKEYENLEEVLPEGFLKRTSSIGKVIGWAPQMAVLSHPSV 360

Query: 361 GGFVSHCGWNSVLESLWNGVAVATWPMYAEQQLNAFHMAVELGVGVEVSLDY---SMVGA 420
           GGFVSHCGWNS LES+W GV +A WP+YAEQ LNAF + VELG+  E+ +DY   +  G 
Sbjct: 361 GGFVSHCGWNSTLESMWCGVPMAAWPLYAEQTLNAFLLVVELGLAAEIRMDYRTDTKAGY 420

Query: 421 AEG-ELRADKIEAGIRKLMEGSEEMKKGVMVKSEESKKATMEDGSSFNDLNRFIDHV 457
             G E+  ++IE GIRKLM   E   K   VK E+S+ A +E GSS+  + +FI+HV
Sbjct: 421 DGGMEVTVEEIEDGIRKLMSDGEIRNKVKDVK-EKSRAAVVEGGSSYASIGKFIEHV 469

BLAST of CsGy4G020580 vs. NCBI nr
Match: XP_004146061.3 (anthocyanidin 3-O-glucosyltransferase 2 [Cucumis sativus] >KAE8649802.1 hypothetical protein Csa_012969 [Cucumis sativus])

HSP 1 Score: 921 bits (2381), Expect = 0.0
Identity = 463/463 (100.00%), Postives = 463/463 (100.00%), Query Frame = 0

Query: 1   MKKLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPFPTRSADQIQSLCSSY 60
           MKKLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPFPTRSADQIQSLCSSY
Sbjct: 1   MKKLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPFPTRSADQIQSLCSSY 60

Query: 61  ANHRLRFFTLPEQPIPGNTNKTTILKPLVESQKQNVADAVANLIAAPDSPTLAGFVVDMF 120
           ANHRLRFFTLPEQPIPGNTNKTTILKPLVESQKQNVADAVANLIAAPDSPTLAGFVVDMF
Sbjct: 61  ANHRLRFFTLPEQPIPGNTNKTTILKPLVESQKQNVADAVANLIAAPDSPTLAGFVVDMF 120

Query: 121 CIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNSVTEFALPGFK 180
           CIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNSVTEFALPGFK
Sbjct: 121 CIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNSVTEFALPGFK 180

Query: 181 NPIPRKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWFANQNLPPVYA 240
           NPIPRKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWFANQNLPPVYA
Sbjct: 181 NPIPRKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWFANQNLPPVYA 240

Query: 241 VGPILNVKEKNPQIERDEILKWLDEQPPSSVVLLCFGSMGIFNESQTKEIADALERSGVR 300
           VGPILNVKEKNPQIERDEILKWLDEQPPSSVVLLCFGSMGIFNESQTKEIADALERSGVR
Sbjct: 241 VGPILNVKEKNPQIERDEILKWLDEQPPSSVVLLCFGSMGIFNESQTKEIADALERSGVR 300

Query: 301 FIWSIRQVPPESVLPEGFVDRTSGMGKVMGWAPQMEILEHPATGGFVSHCGWNSVLESLW 360
           FIWSIRQVPPESVLPEGFVDRTSGMGKVMGWAPQMEILEHPATGGFVSHCGWNSVLESLW
Sbjct: 301 FIWSIRQVPPESVLPEGFVDRTSGMGKVMGWAPQMEILEHPATGGFVSHCGWNSVLESLW 360

Query: 361 NGVAVATWPMYAEQQLNAFHMAVELGVGVEVSLDYSMVGAAEGELRADKIEAGIRKLMEG 420
           NGVAVATWPMYAEQQLNAFHMAVELGVGVEVSLDYSMVGAAEGELRADKIEAGIRKLMEG
Sbjct: 361 NGVAVATWPMYAEQQLNAFHMAVELGVGVEVSLDYSMVGAAEGELRADKIEAGIRKLMEG 420

Query: 421 SEEMKKGVMVKSEESKKATMEDGSSFNDLNRFIDHVFHKINTC 463
           SEEMKKGVMVKSEESKKATMEDGSSFNDLNRFIDHVFHKINTC
Sbjct: 421 SEEMKKGVMVKSEESKKATMEDGSSFNDLNRFIDHVFHKINTC 463

BLAST of CsGy4G020580 vs. NCBI nr
Match: TYK28034.1 (anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo var. makuwa])

HSP 1 Score: 832 bits (2149), Expect = 1.25e-302
Identity = 417/462 (90.26%), Postives = 439/462 (95.02%), Query Frame = 0

Query: 1   MKKLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPFPTRSADQIQSLCSSY 60
           MKK+ELIFIPTP+IGHLTSALQLAHLLV+RHPFLSITIFI K+PFPTRSA QIQSLCSSY
Sbjct: 1   MKKVELIFIPTPVIGHLTSALQLAHLLVSRHPFLSITIFIFKVPFPTRSAHQIQSLCSSY 60

Query: 61  ANHRLRFFTLPEQPIPGNTNKTTILKPLVESQKQNVADAVANLIAAPDSPTLAGFVVDMF 120
           AN RLRFFTLPEQPIP ++ K TILKPLVESQKQN+ADAVANLIAAPDSPTLAGFVVDMF
Sbjct: 61  ANDRLRFFTLPEQPIPADSKKATILKPLVESQKQNIADAVANLIAAPDSPTLAGFVVDMF 120

Query: 121 CIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNSVTEFALPGFK 180
           CIPM+DV KQFSVPTFVFYTSSASFLALLFHLQELYD+EFNHDMDQLLNS TEFA+PG K
Sbjct: 121 CIPMVDVTKQFSVPTFVFYTSSASFLALLFHLQELYDHEFNHDMDQLLNSATEFAVPGLK 180

Query: 181 NPIPRKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWFANQNLPPVYA 240
           NPIPRKVIS++F+DKET EWAHNLTRKFREASGFLVNTF ELESGAINWF  QNLPPVYA
Sbjct: 181 NPIPRKVISSMFFDKETNEWAHNLTRKFREASGFLVNTFFELESGAINWFGKQNLPPVYA 240

Query: 241 VGPILNVKEKNPQIERDEILKWLDEQPPSSVVLLCFGSMGIFNESQTKEIADALERSGVR 300
           VGPILN+KEK+PQ +RDEILKWLDEQPPSSVVLLCFGSMG+FNESQ+KEIADALERSGVR
Sbjct: 241 VGPILNLKEKDPQ-KRDEILKWLDEQPPSSVVLLCFGSMGMFNESQSKEIADALERSGVR 300

Query: 301 FIWSIRQVPPESVLPEGFVDRTSGMGKVMGWAPQMEILEHPATGGFVSHCGWNSVLESLW 360
           FIWSIRQVPPESVLPEGFV RT G GKVMGWAPQMEILEHPATGGFVSHCGWNSVLESLW
Sbjct: 301 FIWSIRQVPPESVLPEGFVGRTRGRGKVMGWAPQMEILEHPATGGFVSHCGWNSVLESLW 360

Query: 361 NGVAVATWPMYAEQQLNAFHMAVELGVGVEVSLDYSMVGAAEGELRADKIEAGIRKLMEG 420
           +GVAVATWPMYAEQQLNAF MAVELGV VEVSLDYSMVGA E ELRA+KIEAGIRKLMEG
Sbjct: 361 SGVAVATWPMYAEQQLNAFQMAVELGVAVEVSLDYSMVGAGEEELRAEKIEAGIRKLMEG 420

Query: 421 SEEMKKGVMVKSEESKKATMEDGSSFNDLNRFIDHVFHKINT 462
           SEE+KK VMVKSEESKKATMEDGSSFNDLNRFI+HVFH INT
Sbjct: 421 SEEIKKAVMVKSEESKKATMEDGSSFNDLNRFINHVFHNINT 461

BLAST of CsGy4G020580 vs. NCBI nr
Match: KAA0066887.1 (anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo var. makuwa])

HSP 1 Score: 827 bits (2137), Expect = 7.50e-301
Identity = 415/460 (90.22%), Postives = 437/460 (95.00%), Query Frame = 0

Query: 1   MKKLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPFPTRSADQIQSLCSSY 60
           MKK+ELIFIPTP+IGHLTSALQLAHLLV+RHPFLSITIFI K+PFPTRSA QIQSLCSSY
Sbjct: 1   MKKVELIFIPTPVIGHLTSALQLAHLLVSRHPFLSITIFIFKVPFPTRSAHQIQSLCSSY 60

Query: 61  ANHRLRFFTLPEQPIPGNTNKTTILKPLVESQKQNVADAVANLIAAPDSPTLAGFVVDMF 120
           AN RLRFFTLPEQPIP ++ K TILKPLVESQKQN+ADAVANLIAAPDSPTLAGFVVDMF
Sbjct: 61  ANDRLRFFTLPEQPIPADSKKATILKPLVESQKQNIADAVANLIAAPDSPTLAGFVVDMF 120

Query: 121 CIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNSVTEFALPGFK 180
           CIPM+DV KQFSVPTFVFYTSSASFLALLFHLQELYD+EFNHDMDQLLNS TEFA+PG K
Sbjct: 121 CIPMVDVTKQFSVPTFVFYTSSASFLALLFHLQELYDHEFNHDMDQLLNSATEFAVPGLK 180

Query: 181 NPIPRKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWFANQNLPPVYA 240
           NPIPRKVIS++F+DKET EWAHNLTRKFREASGFLVNTF ELESGAINWF  QNLPPVYA
Sbjct: 181 NPIPRKVISSMFFDKETNEWAHNLTRKFREASGFLVNTFFELESGAINWFGKQNLPPVYA 240

Query: 241 VGPILNVKEKNPQIERDEILKWLDEQPPSSVVLLCFGSMGIFNESQTKEIADALERSGVR 300
           VGPILN+KEK+PQ +RDEILKWLDEQPPSSVVLLCFGSMG+FNESQ+KEIADALERSGVR
Sbjct: 241 VGPILNLKEKDPQ-KRDEILKWLDEQPPSSVVLLCFGSMGMFNESQSKEIADALERSGVR 300

Query: 301 FIWSIRQVPPESVLPEGFVDRTSGMGKVMGWAPQMEILEHPATGGFVSHCGWNSVLESLW 360
           FIWSIRQVPPESVLPEGFV RT G GKVMGWAPQMEILEHPATGGFVSHCGWNSVLESLW
Sbjct: 301 FIWSIRQVPPESVLPEGFVGRTRGRGKVMGWAPQMEILEHPATGGFVSHCGWNSVLESLW 360

Query: 361 NGVAVATWPMYAEQQLNAFHMAVELGVGVEVSLDYSMVGAAEGELRADKIEAGIRKLMEG 420
           +GVAVATWPMYAEQQLNAF MAVELGV VEVSLDYSMVGA E ELRA+KIEAGIRKLMEG
Sbjct: 361 SGVAVATWPMYAEQQLNAFQMAVELGVAVEVSLDYSMVGAGEEELRAEKIEAGIRKLMEG 420

Query: 421 SEEMKKGVMVKSEESKKATMEDGSSFNDLNRFIDHVFHKI 460
           SEE+KK VMVKSEESKKATMEDGSSFNDLNRFI+HVFH I
Sbjct: 421 SEEIKKAVMVKSEESKKATMEDGSSFNDLNRFINHVFHNI 459

BLAST of CsGy4G020580 vs. NCBI nr
Match: KAA0066888.1 (anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo var. makuwa])

HSP 1 Score: 822 bits (2123), Expect = 1.18e-298
Identity = 413/462 (89.39%), Postives = 435/462 (94.16%), Query Frame = 0

Query: 1   MKKLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPFPTRSADQIQSLCSSY 60
           MKK+ELIFIPTP I HLTSA+QLAHLL++ HPFLSITIFI K PFPTRS  Q+QSLCSS 
Sbjct: 1   MKKVELIFIPTPTISHLTSAIQLAHLLLSPHPFLSITIFIFKDPFPTRSPHQMQSLCSSS 60

Query: 61  ANHRLRFFTLPEQPIPGNTNKTTILKPLVESQKQNVADAVANLIAAPDSPTLAGFVVDMF 120
           AN RLRFFTLPEQPIPG+  K TILKPLVE QKQNVADAVANLIAAPDSPTLAGFVVDMF
Sbjct: 61  ANDRLRFFTLPEQPIPGDAKKVTILKPLVEYQKQNVADAVANLIAAPDSPTLAGFVVDMF 120

Query: 121 CIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNSVTEFALPGFK 180
           CIPM+DVAKQFSVPTFVFYTSSASFL+LLFHLQELYD+EFNH+MDQLLNS TEFA+PGFK
Sbjct: 121 CIPMVDVAKQFSVPTFVFYTSSASFLSLLFHLQELYDHEFNHNMDQLLNSATEFAVPGFK 180

Query: 181 NPIPRKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWFANQNLPPVYA 240
           NPIPRKVIST+FYDKET EWA ++ RKF EASGFLVNTFSELESGAINWFANQNLPPVYA
Sbjct: 181 NPIPRKVISTMFYDKETNEWAFDIARKFGEASGFLVNTFSELESGAINWFANQNLPPVYA 240

Query: 241 VGPILNVKEKNPQIERDEILKWLDEQPPSSVVLLCFGSMGIFNESQTKEIADALERSGVR 300
           VGPILN+KEKNPQI+RDEILKWLDEQPPSSVVLLCFGSMG+FNESQ+KEIADALERSGVR
Sbjct: 241 VGPILNLKEKNPQIKRDEILKWLDEQPPSSVVLLCFGSMGMFNESQSKEIADALERSGVR 300

Query: 301 FIWSIRQVPPESVLPEGFVDRTSGMGKVMGWAPQMEILEHPATGGFVSHCGWNSVLESLW 360
           FIWSIRQVPPESVLPEGFV RT G GKVMGWAPQMEILEHPATGGFVSHCGWNSVLESLW
Sbjct: 301 FIWSIRQVPPESVLPEGFVGRTRGRGKVMGWAPQMEILEHPATGGFVSHCGWNSVLESLW 360

Query: 361 NGVAVATWPMYAEQQLNAFHMAVELGVGVEVSLDYSMVGAAEGELRADKIEAGIRKLMEG 420
           +GVAVATWPMYAEQQLNAF MAVELGV VEVSLDYSMVGA E ELRA+KIEAGIRKLMEG
Sbjct: 361 SGVAVATWPMYAEQQLNAFQMAVELGVAVEVSLDYSMVGAGEEELRAEKIEAGIRKLMEG 420

Query: 421 SEEMKKGVMVKSEESKKATMEDGSSFNDLNRFIDHVFHKINT 462
           SEE+KK VMVKSEESKKATMEDGSSFNDLNRFI+HVFH INT
Sbjct: 421 SEEIKKAVMVKSEESKKATMEDGSSFNDLNRFINHVFHNINT 462

BLAST of CsGy4G020580 vs. NCBI nr
Match: XP_038900058.1 (anthocyanidin 3-O-glucosyltransferase 2-like [Benincasa hispida])

HSP 1 Score: 763 bits (1971), Expect = 2.18e-275
Identity = 389/469 (82.94%), Postives = 415/469 (88.49%), Query Frame = 0

Query: 1   MKKLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPFPTRSADQIQSLCSSY 60
           MKK ELIFIPTPIIGHLTSA+QLA+LLV RHPFLSITI I K+PFPTRSA  IQSLCSS 
Sbjct: 1   MKKAELIFIPTPIIGHLTSAVQLANLLVNRHPFLSITILIFKVPFPTRSAALIQSLCSSS 60

Query: 61  ANHRLRFFTLPEQPIPGNTNKTTILKPLVESQKQNVADAVANLIAAPDSPTLAGFVVDMF 120
              RLRF  LPEQPIP +T KTTILKPLVESQKQNVADAVANL A PDSPTLAGFVVDMF
Sbjct: 61  TTDRLRFINLPEQPIPDDTKKTTILKPLVESQKQNVADAVANLTAVPDSPTLAGFVVDMF 120

Query: 121 CIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNSVTEFALPGFK 180
           CIPM+DVA QFSVPTFVFYTSSASFLALLFHLQELYD EFNHDMDQLLNS TEF +PGFK
Sbjct: 121 CIPMVDVANQFSVPTFVFYTSSASFLALLFHLQELYDGEFNHDMDQLLNSATEFTVPGFK 180

Query: 181 NPIPRKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWFANQNLPPVYA 240
           NPIPRK+IST+F D+ET EWAH LTRKFREASGFLVNTFSELESG I WFANQNLPP+YA
Sbjct: 181 NPIPRKIISTMFIDRETTEWAHYLTRKFREASGFLVNTFSELESGPITWFANQNLPPLYA 240

Query: 241 VGPILNVKEKNPQIE---RDEILKWLDEQPPSSVVLLCFGSMGIFNESQTKEIADALERS 300
           +GPILN+K+KNPQIE   R+ ILKWLDEQPPSSVVLLCFGSMG FNESQTKEIADALER+
Sbjct: 241 IGPILNLKKKNPQIEETEREAILKWLDEQPPSSVVLLCFGSMGSFNESQTKEIADALERT 300

Query: 301 GVRFIWSIRQVPPESVLPEGFVDRTSGMGKVMGWAPQMEILEHPATGGFVSHCGWNSVLE 360
           G+RF+WSIRQ PPESVLPEGFVDR +G+GKVMGWAPQ EILEHPATGGFVSHCGWNSVLE
Sbjct: 301 GLRFVWSIRQDPPESVLPEGFVDRMAGIGKVMGWAPQAEILEHPATGGFVSHCGWNSVLE 360

Query: 361 SLWNGVAVATWPMYAEQQLNAFHMAVELGVGVEVSLDYSMVG----AAEGELRADKIEAG 420
           SLWNGVAVATWPMYAEQQLNAF MAVELGV V VSLDYSM+     AA   L ++KIEAG
Sbjct: 361 SLWNGVAVATWPMYAEQQLNAFEMAVELGVAVVVSLDYSMMAEEEAAAAARLTSEKIEAG 420

Query: 421 IRKLMEGSEEMKKGVMVKSEESKKATMEDGSSFNDLNRFIDHVFHKINT 462
           IRKLMEGSE++KK + VKS ESKKA  E GSSFNDLNRFI+ V H INT
Sbjct: 421 IRKLMEGSEDIKKAMKVKSAESKKAITEGGSSFNDLNRFINRVVHNINT 469

BLAST of CsGy4G020580 vs. ExPASy TrEMBL
Match: A0A0A0KZA0 (UDP-glucosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_4G618500 PE=4 SV=1)

HSP 1 Score: 917 bits (2370), Expect = 0.0
Identity = 461/463 (99.57%), Postives = 462/463 (99.78%), Query Frame = 0

Query: 1   MKKLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPFPTRSADQIQSLCSSY 60
           MKKLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPFPTRSADQIQSLCSSY
Sbjct: 1   MKKLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPFPTRSADQIQSLCSSY 60

Query: 61  ANHRLRFFTLPEQPIPGNTNKTTILKPLVESQKQNVADAVANLIAAPDSPTLAGFVVDMF 120
           ANHRLRFFTLPEQPIPGNTNKTTILKPLVESQKQNVADAVANLIAAPDSPTLAGFVVDMF
Sbjct: 61  ANHRLRFFTLPEQPIPGNTNKTTILKPLVESQKQNVADAVANLIAAPDSPTLAGFVVDMF 120

Query: 121 CIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNSVTEFALPGFK 180
           CIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNSVTEFALPGFK
Sbjct: 121 CIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNSVTEFALPGFK 180

Query: 181 NPIPRKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWFANQNLPPVYA 240
           NPIPRKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWFANQNLPPVYA
Sbjct: 181 NPIPRKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWFANQNLPPVYA 240

Query: 241 VGPILNVKEKNPQIERDEILKWLDEQPPSSVVLLCFGSMGIFNESQTKEIADALERSGVR 300
           VGPILNVKEKNPQIERDEILKWLDEQPPSSVVLLCFGSMGIFNESQTKEIADALERSGVR
Sbjct: 241 VGPILNVKEKNPQIERDEILKWLDEQPPSSVVLLCFGSMGIFNESQTKEIADALERSGVR 300

Query: 301 FIWSIRQVPPESVLPEGFVDRTSGMGKVMGWAPQMEILEHPATGGFVSHCGWNSVLESLW 360
           FIWSIRQVPPESVLPEGFVDRTSGMGKV+GWAPQMEILEHPATGGFVSHCGWNSVLESLW
Sbjct: 301 FIWSIRQVPPESVLPEGFVDRTSGMGKVVGWAPQMEILEHPATGGFVSHCGWNSVLESLW 360

Query: 361 NGVAVATWPMYAEQQLNAFHMAVELGVGVEVSLDYSMVGAAEGELRADKIEAGIRKLMEG 420
           NGVA ATWPMYAEQQLNAFHMAVELGVGVEVSLDYSMVGAAEGELRADKIEAGIRKLMEG
Sbjct: 361 NGVAGATWPMYAEQQLNAFHMAVELGVGVEVSLDYSMVGAAEGELRADKIEAGIRKLMEG 420

Query: 421 SEEMKKGVMVKSEESKKATMEDGSSFNDLNRFIDHVFHKINTC 463
           SEEMKKGVMVKSEESKKATMEDGSSFNDLNRFIDHVFHKINTC
Sbjct: 421 SEEMKKGVMVKSEESKKATMEDGSSFNDLNRFIDHVFHKINTC 463

BLAST of CsGy4G020580 vs. ExPASy TrEMBL
Match: A0A5D3DWY1 (Anthocyanidin 3-O-glucosyltransferase 2-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold384G002840 PE=4 SV=1)

HSP 1 Score: 832 bits (2149), Expect = 6.04e-303
Identity = 417/462 (90.26%), Postives = 439/462 (95.02%), Query Frame = 0

Query: 1   MKKLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPFPTRSADQIQSLCSSY 60
           MKK+ELIFIPTP+IGHLTSALQLAHLLV+RHPFLSITIFI K+PFPTRSA QIQSLCSSY
Sbjct: 1   MKKVELIFIPTPVIGHLTSALQLAHLLVSRHPFLSITIFIFKVPFPTRSAHQIQSLCSSY 60

Query: 61  ANHRLRFFTLPEQPIPGNTNKTTILKPLVESQKQNVADAVANLIAAPDSPTLAGFVVDMF 120
           AN RLRFFTLPEQPIP ++ K TILKPLVESQKQN+ADAVANLIAAPDSPTLAGFVVDMF
Sbjct: 61  ANDRLRFFTLPEQPIPADSKKATILKPLVESQKQNIADAVANLIAAPDSPTLAGFVVDMF 120

Query: 121 CIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNSVTEFALPGFK 180
           CIPM+DV KQFSVPTFVFYTSSASFLALLFHLQELYD+EFNHDMDQLLNS TEFA+PG K
Sbjct: 121 CIPMVDVTKQFSVPTFVFYTSSASFLALLFHLQELYDHEFNHDMDQLLNSATEFAVPGLK 180

Query: 181 NPIPRKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWFANQNLPPVYA 240
           NPIPRKVIS++F+DKET EWAHNLTRKFREASGFLVNTF ELESGAINWF  QNLPPVYA
Sbjct: 181 NPIPRKVISSMFFDKETNEWAHNLTRKFREASGFLVNTFFELESGAINWFGKQNLPPVYA 240

Query: 241 VGPILNVKEKNPQIERDEILKWLDEQPPSSVVLLCFGSMGIFNESQTKEIADALERSGVR 300
           VGPILN+KEK+PQ +RDEILKWLDEQPPSSVVLLCFGSMG+FNESQ+KEIADALERSGVR
Sbjct: 241 VGPILNLKEKDPQ-KRDEILKWLDEQPPSSVVLLCFGSMGMFNESQSKEIADALERSGVR 300

Query: 301 FIWSIRQVPPESVLPEGFVDRTSGMGKVMGWAPQMEILEHPATGGFVSHCGWNSVLESLW 360
           FIWSIRQVPPESVLPEGFV RT G GKVMGWAPQMEILEHPATGGFVSHCGWNSVLESLW
Sbjct: 301 FIWSIRQVPPESVLPEGFVGRTRGRGKVMGWAPQMEILEHPATGGFVSHCGWNSVLESLW 360

Query: 361 NGVAVATWPMYAEQQLNAFHMAVELGVGVEVSLDYSMVGAAEGELRADKIEAGIRKLMEG 420
           +GVAVATWPMYAEQQLNAF MAVELGV VEVSLDYSMVGA E ELRA+KIEAGIRKLMEG
Sbjct: 361 SGVAVATWPMYAEQQLNAFQMAVELGVAVEVSLDYSMVGAGEEELRAEKIEAGIRKLMEG 420

Query: 421 SEEMKKGVMVKSEESKKATMEDGSSFNDLNRFIDHVFHKINT 462
           SEE+KK VMVKSEESKKATMEDGSSFNDLNRFI+HVFH INT
Sbjct: 421 SEEIKKAVMVKSEESKKATMEDGSSFNDLNRFINHVFHNINT 461

BLAST of CsGy4G020580 vs. ExPASy TrEMBL
Match: A0A5A7VIB9 (Anthocyanidin 3-O-glucosyltransferase 2-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold271G002940 PE=4 SV=1)

HSP 1 Score: 827 bits (2137), Expect = 3.63e-301
Identity = 415/460 (90.22%), Postives = 437/460 (95.00%), Query Frame = 0

Query: 1   MKKLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPFPTRSADQIQSLCSSY 60
           MKK+ELIFIPTP+IGHLTSALQLAHLLV+RHPFLSITIFI K+PFPTRSA QIQSLCSSY
Sbjct: 1   MKKVELIFIPTPVIGHLTSALQLAHLLVSRHPFLSITIFIFKVPFPTRSAHQIQSLCSSY 60

Query: 61  ANHRLRFFTLPEQPIPGNTNKTTILKPLVESQKQNVADAVANLIAAPDSPTLAGFVVDMF 120
           AN RLRFFTLPEQPIP ++ K TILKPLVESQKQN+ADAVANLIAAPDSPTLAGFVVDMF
Sbjct: 61  ANDRLRFFTLPEQPIPADSKKATILKPLVESQKQNIADAVANLIAAPDSPTLAGFVVDMF 120

Query: 121 CIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNSVTEFALPGFK 180
           CIPM+DV KQFSVPTFVFYTSSASFLALLFHLQELYD+EFNHDMDQLLNS TEFA+PG K
Sbjct: 121 CIPMVDVTKQFSVPTFVFYTSSASFLALLFHLQELYDHEFNHDMDQLLNSATEFAVPGLK 180

Query: 181 NPIPRKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWFANQNLPPVYA 240
           NPIPRKVIS++F+DKET EWAHNLTRKFREASGFLVNTF ELESGAINWF  QNLPPVYA
Sbjct: 181 NPIPRKVISSMFFDKETNEWAHNLTRKFREASGFLVNTFFELESGAINWFGKQNLPPVYA 240

Query: 241 VGPILNVKEKNPQIERDEILKWLDEQPPSSVVLLCFGSMGIFNESQTKEIADALERSGVR 300
           VGPILN+KEK+PQ +RDEILKWLDEQPPSSVVLLCFGSMG+FNESQ+KEIADALERSGVR
Sbjct: 241 VGPILNLKEKDPQ-KRDEILKWLDEQPPSSVVLLCFGSMGMFNESQSKEIADALERSGVR 300

Query: 301 FIWSIRQVPPESVLPEGFVDRTSGMGKVMGWAPQMEILEHPATGGFVSHCGWNSVLESLW 360
           FIWSIRQVPPESVLPEGFV RT G GKVMGWAPQMEILEHPATGGFVSHCGWNSVLESLW
Sbjct: 301 FIWSIRQVPPESVLPEGFVGRTRGRGKVMGWAPQMEILEHPATGGFVSHCGWNSVLESLW 360

Query: 361 NGVAVATWPMYAEQQLNAFHMAVELGVGVEVSLDYSMVGAAEGELRADKIEAGIRKLMEG 420
           +GVAVATWPMYAEQQLNAF MAVELGV VEVSLDYSMVGA E ELRA+KIEAGIRKLMEG
Sbjct: 361 SGVAVATWPMYAEQQLNAFQMAVELGVAVEVSLDYSMVGAGEEELRAEKIEAGIRKLMEG 420

Query: 421 SEEMKKGVMVKSEESKKATMEDGSSFNDLNRFIDHVFHKI 460
           SEE+KK VMVKSEESKKATMEDGSSFNDLNRFI+HVFH I
Sbjct: 421 SEEIKKAVMVKSEESKKATMEDGSSFNDLNRFINHVFHNI 459

BLAST of CsGy4G020580 vs. ExPASy TrEMBL
Match: A0A5A7VGP8 (Anthocyanidin 3-O-glucosyltransferase 2-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold271G002950 PE=4 SV=1)

HSP 1 Score: 822 bits (2123), Expect = 5.73e-299
Identity = 413/462 (89.39%), Postives = 435/462 (94.16%), Query Frame = 0

Query: 1   MKKLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPFPTRSADQIQSLCSSY 60
           MKK+ELIFIPTP I HLTSA+QLAHLL++ HPFLSITIFI K PFPTRS  Q+QSLCSS 
Sbjct: 1   MKKVELIFIPTPTISHLTSAIQLAHLLLSPHPFLSITIFIFKDPFPTRSPHQMQSLCSSS 60

Query: 61  ANHRLRFFTLPEQPIPGNTNKTTILKPLVESQKQNVADAVANLIAAPDSPTLAGFVVDMF 120
           AN RLRFFTLPEQPIPG+  K TILKPLVE QKQNVADAVANLIAAPDSPTLAGFVVDMF
Sbjct: 61  ANDRLRFFTLPEQPIPGDAKKVTILKPLVEYQKQNVADAVANLIAAPDSPTLAGFVVDMF 120

Query: 121 CIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNSVTEFALPGFK 180
           CIPM+DVAKQFSVPTFVFYTSSASFL+LLFHLQELYD+EFNH+MDQLLNS TEFA+PGFK
Sbjct: 121 CIPMVDVAKQFSVPTFVFYTSSASFLSLLFHLQELYDHEFNHNMDQLLNSATEFAVPGFK 180

Query: 181 NPIPRKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWFANQNLPPVYA 240
           NPIPRKVIST+FYDKET EWA ++ RKF EASGFLVNTFSELESGAINWFANQNLPPVYA
Sbjct: 181 NPIPRKVISTMFYDKETNEWAFDIARKFGEASGFLVNTFSELESGAINWFANQNLPPVYA 240

Query: 241 VGPILNVKEKNPQIERDEILKWLDEQPPSSVVLLCFGSMGIFNESQTKEIADALERSGVR 300
           VGPILN+KEKNPQI+RDEILKWLDEQPPSSVVLLCFGSMG+FNESQ+KEIADALERSGVR
Sbjct: 241 VGPILNLKEKNPQIKRDEILKWLDEQPPSSVVLLCFGSMGMFNESQSKEIADALERSGVR 300

Query: 301 FIWSIRQVPPESVLPEGFVDRTSGMGKVMGWAPQMEILEHPATGGFVSHCGWNSVLESLW 360
           FIWSIRQVPPESVLPEGFV RT G GKVMGWAPQMEILEHPATGGFVSHCGWNSVLESLW
Sbjct: 301 FIWSIRQVPPESVLPEGFVGRTRGRGKVMGWAPQMEILEHPATGGFVSHCGWNSVLESLW 360

Query: 361 NGVAVATWPMYAEQQLNAFHMAVELGVGVEVSLDYSMVGAAEGELRADKIEAGIRKLMEG 420
           +GVAVATWPMYAEQQLNAF MAVELGV VEVSLDYSMVGA E ELRA+KIEAGIRKLMEG
Sbjct: 361 SGVAVATWPMYAEQQLNAFQMAVELGVAVEVSLDYSMVGAGEEELRAEKIEAGIRKLMEG 420

Query: 421 SEEMKKGVMVKSEESKKATMEDGSSFNDLNRFIDHVFHKINT 462
           SEE+KK VMVKSEESKKATMEDGSSFNDLNRFI+HVFH INT
Sbjct: 421 SEEIKKAVMVKSEESKKATMEDGSSFNDLNRFINHVFHNINT 462

BLAST of CsGy4G020580 vs. ExPASy TrEMBL
Match: A0A6J1IV63 (anthocyanidin 3-O-glucosyltransferase 2-like OS=Cucurbita maxima OX=3661 GN=LOC111480888 PE=4 SV=1)

HSP 1 Score: 697 bits (1798), Expect = 3.05e-249
Identity = 356/467 (76.23%), Postives = 400/467 (85.65%), Query Frame = 0

Query: 1   MKKLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPFPTRSADQIQSLCSSY 60
           MKKLEL+FIPTP+IGHLT+A+ LAHLL TRHP LSITI IIK+PFPT+SA  IQSLCSS 
Sbjct: 1   MKKLELVFIPTPLIGHLTAAVHLAHLLTTRHPPLSITILIIKLPFPTKSAPLIQSLCSSS 60

Query: 61  ANHRLRFFTLPEQPIPGNTNKTTILKPLVESQKQNVADAVANLIAA----PDSPTLAGFV 120
           A+ R+RF TLPEQPIP  T +T +L PLV+SQK NVA AVA+LI+A    PDSPTLAGFV
Sbjct: 61  ASDRIRFITLPEQPIPEGTKRTLLLDPLVQSQKLNVATAVADLISAADGGPDSPTLAGFV 120

Query: 121 VDMFCIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNSVTEFAL 180
           VDMFCIPM+DVA QF VPTFVFYTSSASFLALLFHLQELYD EFNHDMD+LLNS TEFA+
Sbjct: 121 VDMFCIPMVDVANQFGVPTFVFYTSSASFLALLFHLQELYDNEFNHDMDRLLNSATEFAV 180

Query: 181 PGFKNPIPRKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWFANQNLP 240
             F+NPIPRKVIST+F D+E  EW H LTR++REA+GFL+NTFSELE  AI WFA Q LP
Sbjct: 181 LCFRNPIPRKVISTMFIDREATEWTHALTRRYREANGFLINTFSELELDAIRWFAEQRLP 240

Query: 241 PVYAVGPILNVKEKNPQI--ERDEILKWLDEQPPSSVVLLCFGSMGIFNESQTKEIADAL 300
           PVYAVGPILN+  KNPQI    +EI+KWLDEQPPSSVV LCFG+MG FNESQTKEIA+AL
Sbjct: 241 PVYAVGPILNLN-KNPQIGESEEEIMKWLDEQPPSSVVFLCFGTMGSFNESQTKEIAEAL 300

Query: 301 ERSGVRFIWSIRQVPPESVLPEGFVDRTSGMGKVMGWAPQMEILEHPATGGFVSHCGWNS 360
           ER+GVRF+W+IRQ PPESVLPEGF+DRT G+GKV+GWAPQMEIL+HPATGGFVSHCGWNS
Sbjct: 301 ERTGVRFLWAIRQTPPESVLPEGFIDRTGGIGKVIGWAPQMEILKHPATGGFVSHCGWNS 360

Query: 361 VLESLWNGVAVATWPMYAEQQLNAFHMAVELGVGVEVSLDYSMVGAAEGE-----LRADK 420
           VLESLWN VAVATWPMYAEQQ+NAF M VELGV VEVSLDYSM+ A E E     LRA+K
Sbjct: 361 VLESLWNAVAVATWPMYAEQQVNAFEMVVELGVAVEVSLDYSMMVAEEEEEEEAVLRAEK 420

Query: 421 IEAGIRKLMEGSEEMKKGVMVKSEESKKATMEDGSSFNDLNRFIDHV 456
           IE  IR+LME S+E+K+ +MVK EESKKA ME GSSFN LNRFID +
Sbjct: 421 IEGAIRRLMERSDELKRALMVKGEESKKAMMESGSSFNALNRFIDAI 466

BLAST of CsGy4G020580 vs. TAIR 10
Match: AT3G21780.1 (UDP-glucosyl transferase 71B6 )

HSP 1 Score: 387.5 bits (994), Expect = 1.5e-107
Identity = 223/482 (46.27%), Postives = 300/482 (62.24%), Query Frame = 0

Query: 3   KLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPFPTRSADQIQSLCSSYAN 62
           K+EL+FIP+P I HL + +++A  LV ++  LSIT+ II   F +++   I SL S   N
Sbjct: 2   KIELVFIPSPAISHLMATVEMAEQLVDKNDNLSITVIIIS--FSSKNTSMITSLTS---N 61

Query: 63  HRLRFFTLPEQPIPGNTNKTTILKPL---VESQKQNVADAVANLI--AAPDSPTLAGFVV 122
           +RLR+     + I G   + T LK     ++S K  V DAVA L+    PD+P LAGFVV
Sbjct: 62  NRLRY-----EIISGGDQQPTELKATDSHIQSLKPLVRDAVAKLVDSTLPDAPRLAGFVV 121

Query: 123 DMFCIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNSVTEFALP 182
           DM+C  M+DVA +F VP+++FYTS+A FL LL H+Q +YD E  +DM +L +S  E  +P
Sbjct: 122 DMYCTSMIDVANEFGVPSYLFYTSNAGFLGLLLHIQFMYDAEDIYDMSELEDSDVELVVP 181

Query: 183 GFKNPIPRKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWFANQNLPP 242
              +P P K +  IF  KE + +     R+FRE  G LVNT  +LE  A+ + +N N+P 
Sbjct: 182 SLTSPYPLKCLPYIFKSKEWLTFFVTQARRFRETKGILVNTVPDLEPQALTFLSNGNIPR 241

Query: 243 VYAVGPILNVKEKNPQI---ERDEILKWLDEQPPSSVVLLCFGSMGIFNESQTKEIADAL 302
            Y VGP+L++K  N      ++ EIL+WLDEQPP SVV LCFGSMG F+E Q +E A AL
Sbjct: 242 AYPVGPLLHLKNVNCDYVDKKQSEILRWLDEQPPRSVVFLCFGSMGGFSEEQVRETALAL 301

Query: 303 ERSGVRFIWSIRQVPP-------------ESVLPEGFVDRTSGMGKVMGWAPQMEILEHP 362
           +RSG RF+WS+R+  P             E +LPEGF DRT+  GKV+GWA Q+ IL  P
Sbjct: 302 DRSGHRFLWSLRRASPNILREPPGEFTNLEEILPEGFFDRTANRGKVIGWAEQVAILAKP 361

Query: 363 ATGGFVSHCGWNSVLESLWNGVAVATWPMYAEQQLNAFHMAVELGVGVEVSLDYS---MV 422
           A GGFVSH GWNS LESLW GV +A WP+YAEQ+ NAF M  ELG+ VE+   +    ++
Sbjct: 362 AIGGFVSHGGWNSTLESLWFGVPMAIWPLYAEQKFNAFEMVEELGLAVEIKKHWRGDLLL 421

Query: 423 GAAEGELRADKIEAGIRKLMEGSEEMKKGVMVKSEESKKATMEDGSSFNDLNRFIDHVFH 461
           G +E  + A++IE GI  LME   +++K V   SE+   A M+ GSS   L RFI  V  
Sbjct: 422 GRSE-IVTAEEIEKGIICLMEQDSDVRKRVNEISEKCHVALMDGGSSETALKRFIQDVTE 472

BLAST of CsGy4G020580 vs. TAIR 10
Match: AT3G21760.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 380.2 bits (975), Expect = 2.4e-105
Identity = 225/490 (45.92%), Postives = 302/490 (61.63%), Query Frame = 0

Query: 3   KLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFII--KIPFPTRSADQIQSLCSSY 62
           KLEL+FIP+P  GHL   +++A L V R   LSITI II     F + ++    +  SS 
Sbjct: 2   KLELVFIPSPGDGHLRPLVEVAKLHVDRDDHLSITIIIIPQMHGFSSSNSSSYIASLSSD 61

Query: 63  ANHRLRF--FTLPEQPIPGNTNKTTI-----LKPLVESQKQNVADAVANLIAAPDSPT-L 122
           +  RL +   ++P++P   +T           KP V++  + + D        PDSP+ L
Sbjct: 62  SEERLSYNVLSVPDKPDSDDTKPHFFDYIDNFKPQVKATVEKLTDP-----GPPDSPSRL 121

Query: 123 AGFVVDMFCIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNS-V 182
           AGFVVDMFC+ M+DVA +F VP+++FYTS+A+FL L  H++ LYD + N+D+  L +S  
Sbjct: 122 AGFVVDMFCMMMIDVANEFGVPSYMFYTSNATFLGLQVHVEYLYDVK-NYDVSDLKDSDT 181

Query: 183 TEFALPGFKNPIPRKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWFA 242
           TE  +P    P+P K   ++   KE +      TR+FRE  G LVNTF+ELE  A+ +F+
Sbjct: 182 TELEVPCLTRPLPVKCFPSVLLTKEWLPVMFRQTRRFRETKGILVNTFAELEPQAMKFFS 241

Query: 243 --NQNLPPVYAVGPILNVKEKNPQIERD---EILKWLDEQPPSSVVLLCFGSMGIFNESQ 302
             +  LP VY VGP++N+K   P    D   EIL+WLDEQP  SVV LCFGSMG F E Q
Sbjct: 242 GVDSPLPTVYTVGPVMNLKINGPNSSDDKQSEILRWLDEQPRKSVVFLCFGSMGGFREGQ 301

Query: 303 TKEIADALERSGVRFIWSIRQVPP-------------ESVLPEGFVDRTSGMGKVMGWAP 362
            KEIA ALERSG RF+WS+R+  P             E +LPEGF++RT+ +GK++GWAP
Sbjct: 302 AKEIAIALERSGHRFVWSLRRAQPKGSIGPPEEFTNLEEILPEGFLERTAEIGKIVGWAP 361

Query: 363 QMEILEHPATGGFVSHCGWNSVLESLWNGVAVATWPMYAEQQLNAFHMAVELGVGVEV-- 422
           Q  IL +PA GGFVSHCGWNS LESLW GV +ATWP+YAEQQ+NAF M  ELG+ VEV  
Sbjct: 362 QSAILANPAIGGFVSHCGWNSTLESLWFGVPMATWPLYAEQQVNAFEMVEELGLAVEVRN 421

Query: 423 SLDYSMVGAAEGELRADKIEAGIRKLMEGSEEMKKGVMVKSEESKKATMEDGSSFNDLNR 462
           S     + A +  + A++IE GIR LME   +++  V   SE+S  A M+ GSS   L +
Sbjct: 422 SFRGDFMAADDELMTAEEIERGIRCLMEQDSDVRSRVKEMSEKSHVALMDGGSSHVALLK 481

BLAST of CsGy4G020580 vs. TAIR 10
Match: AT3G21750.1 (UDP-glucosyl transferase 71B1 )

HSP 1 Score: 369.0 bits (946), Expect = 5.6e-102
Identity = 201/482 (41.70%), Postives = 292/482 (60.58%), Query Frame = 0

Query: 3   KLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPFPTRSADQIQSLCSSYAN 62
           K+EL+FIP+P +GH+ +   LA LLV     LS+T+ +I    P+R +D   S   + + 
Sbjct: 2   KVELVFIPSPGVGHIRATTALAKLLVASDNRLSVTLIVI----PSRVSDDASSSVYTNSE 61

Query: 63  HRLRFFTLPEQPIPGNTNKTTILKPLVESQK---QNVADAVANLIAAPDSPTLAGFVVDM 122
            RLR+  LP +      ++TT L   ++SQK   + V   VA  ++      LAG VVDM
Sbjct: 62  DRLRYILLPAR------DQTTDLVSYIDSQKPQVRAVVSKVAGDVSTRSDSRLAGIVVDM 121

Query: 123 FCIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNSVTEFALPGF 182
           FC  M+D+A +F++  ++FYTS+AS+L L FH+Q LYD E   D+ +  ++  +F +P  
Sbjct: 122 FCTSMIDIADEFNLSAYIFYTSNASYLGLQFHVQSLYD-EKELDVSEFKDTEMKFDVPTL 181

Query: 183 KNPIPRKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWFA----NQNL 242
             P P K + ++  +K+   +     R FR   G LVN+ +++E  A+++F+    N N+
Sbjct: 182 TQPFPAKCLPSVMLNKKWFPYVLGRARSFRATKGILVNSVADMEPQALSFFSGGNGNTNI 241

Query: 243 PPVYAVGPILNVKEKNPQIERDEILKWLDEQPPSSVVLLCFGSMGIFNESQTKEIADALE 302
           PPVYAVGPI++++    + +R EIL WL EQP  SVV LCFGSMG F+E Q +EIA ALE
Sbjct: 242 PPVYAVGPIMDLESSGDEEKRKEILHWLKEQPTKSVVFLCFGSMGGFSEEQAREIAVALE 301

Query: 303 RSGVRFIWSIRQVPP---------------ESVLPEGFVDRTSGMGKVMGWAPQMEILEH 362
           RSG RF+WS+R+  P               E +LP+GF+DRT  +GK++ WAPQ+++L  
Sbjct: 302 RSGHRFLWSLRRASPVGNKSNPPPGEFTNLEEILPKGFLDRTVEIGKIISWAPQVDVLNS 361

Query: 363 PATGGFVSHCGWNSVLESLWNGVAVATWPMYAEQQLNAFHMAVELGVGVEVSLDYSMVGA 422
           PA G FV+HCGWNS+LESLW GV +A WP+YAEQQ NAFHM  ELG+  EV  +Y     
Sbjct: 362 PAIGAFVTHCGWNSILESLWFGVPMAAWPIYAEQQFNAFHMVDELGLAAEVKKEYRRDFL 421

Query: 423 AEGE--LRADKIEAGIRKLMEGSEEMKKGVMVKSEESKKATMEDGSSFNDLNRFIDHVFH 461
            E    + AD+IE GI+  ME   +M+K VM   ++   A ++ GSS   L +F+  V  
Sbjct: 422 VEEPEIVTADEIERGIKCAMEQDSKMRKRVMEMKDKLHVALVDGGSSNCALKKFVQDVVD 472

BLAST of CsGy4G020580 vs. TAIR 10
Match: AT4G15280.1 (UDP-glucosyl transferase 71B5 )

HSP 1 Score: 362.8 bits (930), Expect = 4.0e-100
Identity = 212/480 (44.17%), Postives = 286/480 (59.58%), Query Frame = 0

Query: 3   KLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPFPTRSADQ-IQSLCSSYA 62
           K+EL+FIP P IGHL   ++LA  L+     LSITI II   F    A   I SL +   
Sbjct: 2   KIELVFIPLPGIGHLRPTVKLAKQLIGSENRLSITIIIIPSRFDAGDASACIASLTTLSQ 61

Query: 63  NHRLRF--FTLPEQPIPGNTNKTTILKPLVESQKQNVADAVANLIAAPDSPTLAGFVVDM 122
           + RL +   ++ +QP P +       +  +E QK  V DAVA  I  P +  LAGFVVDM
Sbjct: 62  DDRLHYESISVAKQP-PTSDPDPVPAQVYIEKQKTKVRDAVAARIVDP-TRKLAGFVVDM 121

Query: 123 FCIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQLLNSVTEFALPGF 182
           FC  M+DVA +F VP ++ YTS+A+FL  + H+Q++YD +  +D+ +L NSVTE   P  
Sbjct: 122 FCSSMIDVANEFGVPCYMVYTSNATFLGTMLHVQQMYDQK-KYDVSELENSVTELEFPSL 181

Query: 183 KNPIPRKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWF--ANQNLPP 242
             P P K +  I   KE +  +    R FR+  G LVNT +ELE  A+  F     +LP 
Sbjct: 182 TRPYPVKCLPHILTSKEWLPLSLAQARCFRKMKGILVNTVAELEPHALKMFNINGDDLPQ 241

Query: 243 VYAVGPILNVKEKNPQIER-DEILKWLDEQPPSSVVLLCFGSMGIFNESQTKEIADALER 302
           VY VGP+L+++  N   E+  EIL+WLDEQP  SVV LCFGS+G F E QT+E A AL+R
Sbjct: 242 VYPVGPVLHLENGNDDDEKQSEILRWLDEQPSKSVVFLCFGSLGGFTEEQTRETAVALDR 301

Query: 303 SGVRFIWSIRQVPP-------------ESVLPEGFVDRTSGMGKVMGWAPQMEILEHPAT 362
           SG RF+W +R   P             E VLPEGF++RT   GKV+GWAPQ+ +LE PA 
Sbjct: 302 SGQRFLWCLRHASPNIKTDRPRDYTNLEEVLPEGFLERTLDRGKVIGWAPQVAVLEKPAI 361

Query: 363 GGFVSHCGWNSVLESLWNGVAVATWPMYAEQQLNAFHMAVELGVGVEVS--LDYSMVGAA 422
           GGFV+HCGWNS+LESLW GV + TWP+YAEQ++NAF M  ELG+ VE+   L   +    
Sbjct: 362 GGFVTHCGWNSILESLWFGVPMVTWPLYAEQKVNAFEMVEELGLAVEIRKYLKGDLFAGE 421

Query: 423 EGELRADKIEAGIRKLMEGSEEMKKGVMVKSEESKKATMEDGSSFNDLNRFIDHVFHKIN 462
              + A+ IE  IR++ME   +++  V   +E+   A M+ GSS   L +FI  V   ++
Sbjct: 422 METVTAEDIERAIRRVMEQDSDVRNNVKEMAEKCHFALMDGGSSKAALEKFIQDVIENMD 478

BLAST of CsGy4G020580 vs. TAIR 10
Match: AT3G21790.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 361.3 bits (926), Expect = 1.2e-99
Identity = 212/487 (43.53%), Postives = 294/487 (60.37%), Query Frame = 0

Query: 3   KLELIFIPTPIIGHLTSALQLAHLLVTRHPFLSITIFIIKIPFPTR----SADQIQSLCS 62
           K EL+FIP P IGHL S +++A LLV R   LSI++ I  +PF +     ++D I +L +
Sbjct: 2   KFELVFIPYPGIGHLRSTVEMAKLLVDRETRLSISVII--LPFISEGEVGASDYIAALSA 61

Query: 63  SYANHRLRFFTLPEQPIPGNTNKTTILKPLVESQKQNVADAVANLI----AAPDSPTLAG 122
           S +N+RLR+  +     P  T + T ++  +++Q+  V   VA L+    + PDSP +AG
Sbjct: 62  S-SNNRLRYEVISAVDQP--TIEMTTIEIHMKNQEPKVRSTVAKLLEDYSSKPDSPKIAG 121

Query: 123 FVVDMFCIPMLDVAKQFSVPTFVFYTSSASFLALLFHLQELYDYEFNHDMDQ--LLNSVT 182
           FV+DMFC  M+DVA +F  P+++FYTSSA  L++ +H+Q L D E  +D+ +    +S  
Sbjct: 122 FVLDMFCTSMVDVANEFGFPSYMFYTSSAGILSVTYHVQMLCD-ENKYDVSENDYADSEA 181

Query: 183 EFALPGFKNPIPRKVISTIFYDKETIEWAHNLTRKFREASGFLVNTFSELESGAINWFAN 242
               P    P P K +         +    N  RKFRE  G LVNT +ELE   + + ++
Sbjct: 182 VLNFPSLSRPYPVKCLPHALAANMWLPVFVNQARKFREMKGILVNTVAELEPYVLKFLSS 241

Query: 243 QNLPPVYAVGPIL---NVKEKNPQIERDEILKWLDEQPPSSVVLLCFGSMGIFNESQTKE 302
            + PPVY VGP+L   N ++ +   +R EI++WLD+QPPSSVV LCFGSMG F E Q +E
Sbjct: 242 SDTPPVYPVGPLLHLENQRDDSKDEKRLEIIRWLDQQPPSSVVFLCFGSMGGFGEEQVRE 301

Query: 303 IADALERSGVRFIWSIRQVPP-------------ESVLPEGFVDRTSGMGKVMGWAPQME 362
           IA ALERSG RF+WS+R+  P             E VLPEGF DRT  +GKV+GWAPQ+ 
Sbjct: 302 IAIALERSGHRFLWSLRRASPNIFKELPGEFTNLEEVLPEGFFDRTKDIGKVIGWAPQVA 361

Query: 363 ILEHPATGGFVSHCGWNSVLESLWNGVAVATWPMYAEQQLNAFHMAVELGVGVEVSLDY- 422
           +L +PA GGFV+HCGWNS LESLW GV  A WP+YAEQ+ NAF M  ELG+ VE+   + 
Sbjct: 362 VLANPAIGGFVTHCGWNSTLESLWFGVPTAAWPLYAEQKFNAFLMVEELGLAVEIRKYWR 421

Query: 423 --SMVGAAEGELRADKIEAGIRKLMEGSEEMKKGVMVKSEESKKATMEDGSSFNDLNRFI 461
              + G     + A++IE  I  LME   +++K V   SE+   A M+ GSS   L +FI
Sbjct: 422 GEHLAGLPTATVTAEEIEKAIMCLMEQDSDVRKRVKDMSEKCHVALMDGGSSRTALQKFI 481

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q66PF34.3e-11548.76Putative UDP-glucose flavonoid 3-O-glucosyltransferase 3 OS=Fragaria ananassa OX... [more]
D3THI61.4e-11346.75UDP-glycosyltransferase 71A15 OS=Malus domestica OX=3750 GN=UGT71A15 PE=1 SV=1[more]
D3UAG15.2e-11346.03UDP-glycosyltransferase 71A16 OS=Pyrus communis OX=23211 GN=UGT71A16 PE=1 SV=1[more]
Q2V6K01.5e-11246.76UDP-glucose flavonoid 3-O-glucosyltransferase 6 OS=Fragaria ananassa OX=3747 GN=... [more]
Q6VAB22.5e-10745.49UDP-glycosyltransferase 71E1 OS=Stevia rebaudiana OX=55670 GN=UGT71E1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
XP_004146061.30.0100.00anthocyanidin 3-O-glucosyltransferase 2 [Cucumis sativus] >KAE8649802.1 hypothet... [more]
TYK28034.11.25e-30290.26anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo var. makuwa][more]
KAA0066887.17.50e-30190.22anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo var. makuwa][more]
KAA0066888.11.18e-29889.39anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo var. makuwa][more]
XP_038900058.12.18e-27582.94anthocyanidin 3-O-glucosyltransferase 2-like [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A0A0KZA00.099.57UDP-glucosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_4G618500 PE=4 SV=1[more]
A0A5D3DWY16.04e-30390.26Anthocyanidin 3-O-glucosyltransferase 2-like OS=Cucumis melo var. makuwa OX=1194... [more]
A0A5A7VIB93.63e-30190.22Anthocyanidin 3-O-glucosyltransferase 2-like OS=Cucumis melo var. makuwa OX=1194... [more]
A0A5A7VGP85.73e-29989.39Anthocyanidin 3-O-glucosyltransferase 2-like OS=Cucumis melo var. makuwa OX=1194... [more]
A0A6J1IV633.05e-24976.23anthocyanidin 3-O-glucosyltransferase 2-like OS=Cucurbita maxima OX=3661 GN=LOC1... [more]
Match NameE-valueIdentityDescription
AT3G21780.11.5e-10746.27UDP-glucosyl transferase 71B6 [more]
AT3G21760.12.4e-10545.92UDP-Glycosyltransferase superfamily protein [more]
AT3G21750.15.6e-10241.70UDP-glucosyl transferase 71B1 [more]
AT4G15280.14.0e-10044.17UDP-glucosyl transferase 71B5 [more]
AT3G21790.11.2e-9943.53UDP-Glycosyltransferase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 258..391
e-value: 2.5E-25
score: 89.3
IPR002213UDP-glucuronosyl/UDP-glucosyltransferaseCDDcd03784GT1_Gtf-likecoord: 7..445
e-value: 4.73322E-70
score: 226.279
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 7..449
e-value: 5.5E-136
score: 456.1
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 252..437
e-value: 5.5E-136
score: 456.1
NoneNo IPR availablePANTHERPTHR48049:SF48UDP-GLYCOSYLTRANSFERASE 71B2coord: 4..457
NoneNo IPR availablePANTHERPTHR48049GLYCOSYLTRANSFERASEcoord: 4..457
NoneNo IPR availableSUPERFAMILY53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 2..456

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy4G020580.2CsGy4G020580.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0008194 UDP-glycosyltransferase activity