HG10001558 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10001558
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUDP-glycosyltransferase 87A1-like
LocationChr09: 18131444 .. 18132890 (-)
RNA-Seq ExpressionHG10001558
SyntenyHG10001558
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATCCGATCAGTGGTTCTGCAGCACCCAAACGGATTCATCTGGCGGCGTTGCCGTACCCCGGCAGAGGCCACATCAACGCTCTCATGAATCTCTGCAAGCTCCTCTCTCTAAAAAATCCCAACATTCTCATCTCCTTTATTGTCACTAACGAGTGGCTCACCTTCCTCGTCGCCGATCCCAAACCCCAAAACATCCAATTCGCCACTTTCCCCAATGTTATCCCCTCTGAGCTCTGCCGCACCAACGACTTCCCCGATTTCGTCCGATCCATCCATACCCATATGGAGGCTCCCGTTGAGACTCTACTCCGCCGCCTCGAACCGCCGCCGATTGCCATCCTCGCCGATGCCTTCGTCACTTGGGCTGTCCAATTGGGGCAACGCCTCAATGTTCTGGTCGCTTCACTCTGGCCCATGTCGGCTACTGTTTTCTCCATCCTTTACCATTTCGACCTTCTCAAGGAAAATGGGCATTTTCCAGCCGATCTCTTAGGTAATTGTAAAATCTATTTGATGGAAATTGCGCTTTTGTGATCGGAATTTAAGTTCTTTCTTACTCTGTTTTGAAGAGCGGGGAGAAGAGATTGTCGATTACTTCCCTGGAGTGTCGAAGATTCGTCTTGCAGATTTGCCGTCTTTCTTCTCCGGCGATGGTCTCCAAAGCGTCGAATTCGCCGTGAACTCCGCCCGTTCTGTCGACAAATCCCAATTTCTCATCTCCACCTCTGTTTACGAGCTTGAATCCTCTGTTATCGACTCCTTAAAAGCTAAATTTCCCTTCCCGGTCTACACCATCGGACCCAGTACTCCATATTTTGAGCTAGAATGCTCCGTCCCAAACGGCGGCACCAACGACTATCTCCGGTGGTTGGACTCCCAAGCAGAGGGCTCTGTTTTGTACATTTCGCAGGGGAGTTTTCTTTCAGTTTCTAGCGCCCAAATGGACGAGATCGTCGCTGGTGTGAAAGCTAGCGGCGTCCGATTCTTGTGGGTGGCACGTGGAGATGACGGTCGGTTGAAGGACGTGGATAGAGAAACTGGGATGGTGGTTGGATGGTGCGACCAATTGAGGGTTCTGTGCCATCGAGCTGTTGGTGGGTTTTGGACTCACGGCGGTTGGAATTCGACTCTGGAAGGGATTTTTGCGGGCGTTCCGATGCTTGCTTGGCCGATATTTTGGGATCAATTTCCGAACAGTAAGAATATTGCGGAGGATTGGAAAGTTGGGGTCCGATTTAAAACTGTTGGGGGTAAGGATTTGGTGAGGAGATTGGAAATTGCAGAGTTTGTGAAGAGATTTATGAACTCAGAGAGCGTTGAAGGAAGGGAGATGAGAAACAGAGTGTCGGAGTTTCAAGAGATTTGCCAGCTAGCGGTGGCGAAAGGTGGTTCCTCTGATTCCAACATTGATTCATTTCTCAATCATATTTCAGGAAAGTTATGA

mRNA sequence

ATGGATCCGATCAGTGGTTCTGCAGCACCCAAACGGATTCATCTGGCGGCGTTGCCGTACCCCGGCAGAGGCCACATCAACGCTCTCATGAATCTCTGCAAGCTCCTCTCTCTAAAAAATCCCAACATTCTCATCTCCTTTATTGTCACTAACGAGTGGCTCACCTTCCTCGTCGCCGATCCCAAACCCCAAAACATCCAATTCGCCACTTTCCCCAATGTTATCCCCTCTGAGCTCTGCCGCACCAACGACTTCCCCGATTTCGTCCGATCCATCCATACCCATATGGAGGCTCCCGTTGAGACTCTACTCCGCCGCCTCGAACCGCCGCCGATTGCCATCCTCGCCGATGCCTTCGTCACTTGGGCTGTCCAATTGGGGCAACGCCTCAATGTTCTGGTCGCTTCACTCTGGCCCATGTCGGCTACTGTTTTCTCCATCCTTTACCATTTCGACCTTCTCAAGGAAAATGGGCATTTTCCAGCCGATCTCTTAGAGCGGGGAGAAGAGATTGTCGATTACTTCCCTGGAGTGTCGAAGATTCGTCTTGCAGATTTGCCGTCTTTCTTCTCCGGCGATGGTCTCCAAAGCGTCGAATTCGCCGTGAACTCCGCCCGTTCTGTCGACAAATCCCAATTTCTCATCTCCACCTCTGTTTACGAGCTTGAATCCTCTGTTATCGACTCCTTAAAAGCTAAATTTCCCTTCCCGGTCTACACCATCGGACCCAGTACTCCATATTTTGAGCTAGAATGCTCCGTCCCAAACGGCGGCACCAACGACTATCTCCGGTGGTTGGACTCCCAAGCAGAGGGCTCTGTTTTGTACATTTCGCAGGGGAGTTTTCTTTCAGTTTCTAGCGCCCAAATGGACGAGATCGTCGCTGGTGTGAAAGCTAGCGGCGTCCGATTCTTGTGGGTGGCACGTGGAGATGACGGTCGGTTGAAGGACGTGGATAGAGAAACTGGGATGGTGGTTGGATGGTGCGACCAATTGAGGGTTCTGTGCCATCGAGCTGTTGGTGGGTTTTGGACTCACGGCGGTTGGAATTCGACTCTGGAAGGGATTTTTGCGGGCGTTCCGATGCTTGCTTGGCCGATATTTTGGGATCAATTTCCGAACAGTAAGAATATTGCGGAGGATTGGAAAGTTGGGGTCCGATTTAAAACTGTTGGGGGTAAGGATTTGGTGAGGAGATTGGAAATTGCAGAGTTTGTGAAGAGATTTATGAACTCAGAGAGCGTTGAAGGAAGGGAGATGAGAAACAGAGTGTCGGAGTTTCAAGAGATTTGCCAGCTAGCGGTGGCGAAAGGTGGTTCCTCTGATTCCAACATTGATTCATTTCTCAATCATATTTCAGGAAAGTTATGA

Coding sequence (CDS)

ATGGATCCGATCAGTGGTTCTGCAGCACCCAAACGGATTCATCTGGCGGCGTTGCCGTACCCCGGCAGAGGCCACATCAACGCTCTCATGAATCTCTGCAAGCTCCTCTCTCTAAAAAATCCCAACATTCTCATCTCCTTTATTGTCACTAACGAGTGGCTCACCTTCCTCGTCGCCGATCCCAAACCCCAAAACATCCAATTCGCCACTTTCCCCAATGTTATCCCCTCTGAGCTCTGCCGCACCAACGACTTCCCCGATTTCGTCCGATCCATCCATACCCATATGGAGGCTCCCGTTGAGACTCTACTCCGCCGCCTCGAACCGCCGCCGATTGCCATCCTCGCCGATGCCTTCGTCACTTGGGCTGTCCAATTGGGGCAACGCCTCAATGTTCTGGTCGCTTCACTCTGGCCCATGTCGGCTACTGTTTTCTCCATCCTTTACCATTTCGACCTTCTCAAGGAAAATGGGCATTTTCCAGCCGATCTCTTAGAGCGGGGAGAAGAGATTGTCGATTACTTCCCTGGAGTGTCGAAGATTCGTCTTGCAGATTTGCCGTCTTTCTTCTCCGGCGATGGTCTCCAAAGCGTCGAATTCGCCGTGAACTCCGCCCGTTCTGTCGACAAATCCCAATTTCTCATCTCCACCTCTGTTTACGAGCTTGAATCCTCTGTTATCGACTCCTTAAAAGCTAAATTTCCCTTCCCGGTCTACACCATCGGACCCAGTACTCCATATTTTGAGCTAGAATGCTCCGTCCCAAACGGCGGCACCAACGACTATCTCCGGTGGTTGGACTCCCAAGCAGAGGGCTCTGTTTTGTACATTTCGCAGGGGAGTTTTCTTTCAGTTTCTAGCGCCCAAATGGACGAGATCGTCGCTGGTGTGAAAGCTAGCGGCGTCCGATTCTTGTGGGTGGCACGTGGAGATGACGGTCGGTTGAAGGACGTGGATAGAGAAACTGGGATGGTGGTTGGATGGTGCGACCAATTGAGGGTTCTGTGCCATCGAGCTGTTGGTGGGTTTTGGACTCACGGCGGTTGGAATTCGACTCTGGAAGGGATTTTTGCGGGCGTTCCGATGCTTGCTTGGCCGATATTTTGGGATCAATTTCCGAACAGTAAGAATATTGCGGAGGATTGGAAAGTTGGGGTCCGATTTAAAACTGTTGGGGGTAAGGATTTGGTGAGGAGATTGGAAATTGCAGAGTTTGTGAAGAGATTTATGAACTCAGAGAGCGTTGAAGGAAGGGAGATGAGAAACAGAGTGTCGGAGTTTCAAGAGATTTGCCAGCTAGCGGTGGCGAAAGGTGGTTCCTCTGATTCCAACATTGATTCATTTCTCAATCATATTTCAGGAAAGTTATGA

Protein sequence

MDPISGSAAPKRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTNEWLTFLVADPKPQNIQFATFPNVIPSELCRTNDFPDFVRSIHTHMEAPVETLLRRLEPPPIAILADAFVTWAVQLGQRLNVLVASLWPMSATVFSILYHFDLLKENGHFPADLLERGEEIVDYFPGVSKIRLADLPSFFSGDGLQSVEFAVNSARSVDKSQFLISTSVYELESSVIDSLKAKFPFPVYTIGPSTPYFELECSVPNGGTNDYLRWLDSQAEGSVLYISQGSFLSVSSAQMDEIVAGVKASGVRFLWVARGDDGRLKDVDRETGMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEGIFAGVPMLAWPIFWDQFPNSKNIAEDWKVGVRFKTVGGKDLVRRLEIAEFVKRFMNSESVEGREMRNRVSEFQEICQLAVAKGGSSDSNIDSFLNHISGKL
Homology
BLAST of HG10001558 vs. NCBI nr
Match: XP_038902528.1 (UDP-glycosyltransferase 87A1-like [Benincasa hispida])

HSP 1 Score: 811.6 bits (2095), Expect = 3.4e-231
Identity = 400/456 (87.72%), Postives = 421/456 (92.32%), Query Frame = 0

Query: 1   MDPISGSAAPKRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTNEWLTFLVAD 60
           MDP+S     KRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVT+EW TFL AD
Sbjct: 1   MDPMS-----KRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTDEWFTFLAAD 60

Query: 61  PKPQNIQFATFPNVIPSELCRTNDFPDFVRSIHTHMEAPVETLLRRLEPPPIAILADAFV 120
           PKPQNI FATFPNVIPSEL R +DFP F+RSI THMEAPVETLLRRL+PPP AILAD F+
Sbjct: 61  PKPQNIHFATFPNVIPSELRRADDFPGFIRSIQTHMEAPVETLLRRLDPPPTAILADTFL 120

Query: 121 TWAVQLGQRLNVLVASLWPMSATVFSILYHFDLLKENGHFPADLLERGEEIVDYFPGVSK 180
           TWAVQLG+ LNV VASLWPMSATVFSILYHFDLL+ENGHFPADL ERGEEIVDYFPGVSK
Sbjct: 121 TWAVQLGKCLNVPVASLWPMSATVFSILYHFDLLEENGHFPADLSERGEEIVDYFPGVSK 180

Query: 181 IRLADLPSFFSGDGLQSVEFAVNSARSVDKSQFLISTSVYELESSVIDSLKAKFPFPVYT 240
           IRLADLPSFFSG+GLQSV FAV SAR VDKSQFLISTSVYELESSVID+LK KFPFPVYT
Sbjct: 181 IRLADLPSFFSGNGLQSVGFAVKSARFVDKSQFLISTSVYELESSVIDTLKEKFPFPVYT 240

Query: 241 IGPSTPYFELECSVPNGGTNDYLRWLDSQAEGSVLYISQGSFLSVSSAQMDEIVAGVKAS 300
           IGPSTPYFELE S     TNDYLRWLDSQAEGSVLY+SQGS+LSVSSAQMDEIVAGVKAS
Sbjct: 241 IGPSTPYFELESS----ATNDYLRWLDSQAEGSVLYVSQGSYLSVSSAQMDEIVAGVKAS 300

Query: 301 GVRFLWVARGDDGRLKDVDRETGMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEGIFAGV 360
           GVRFLWVARGDD R KDVDRETGMVVGWCDQLRVLCHRA+GGFWTHGGWNSTLEG+FAGV
Sbjct: 301 GVRFLWVARGDDDRFKDVDRETGMVVGWCDQLRVLCHRAIGGFWTHGGWNSTLEGVFAGV 360

Query: 361 PMLAWPIFWDQFPNSKNIAEDWKVGVRFKTVGGKDLVRRLEIAEFVKRFMNSESVEGREM 420
           PMLAWPIFWDQFPNSK IAEDWKVGVRFK VGG+DLV R+EIAEFVKRFMNSES+EGREM
Sbjct: 361 PMLAWPIFWDQFPNSKKIAEDWKVGVRFKAVGGRDLVSRVEIAEFVKRFMNSESIEGREM 420

Query: 421 RNRVSEFQEICQLAVAKGGSSDSNIDSFLNHISGKL 457
           RNRVSE QEIC+ AVAKGGSSDSNID+FL+HIS +L
Sbjct: 421 RNRVSELQEICRRAVAKGGSSDSNIDAFLSHISVEL 447

BLAST of HG10001558 vs. NCBI nr
Match: XP_008463842.1 (PREDICTED: UDP-glycosyltransferase 87A1-like [Cucumis melo] >KAA0035258.1 UDP-glycosyltransferase 87A1-like [Cucumis melo var. makuwa] >TYK07577.1 UDP-glycosyltransferase 87A1-like [Cucumis melo var. makuwa])

HSP 1 Score: 802.4 bits (2071), Expect = 2.1e-228
Identity = 392/454 (86.34%), Postives = 414/454 (91.19%), Query Frame = 0

Query: 1   MDPISGSAAPKRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTNEWLTFLVAD 60
           MDP+      KR+HLAALPYPGRGHINALMN CKLLSLKNPNI ISFIVT EWL+FL AD
Sbjct: 1   MDPVR-----KRVHLAALPYPGRGHINALMNFCKLLSLKNPNISISFIVTEEWLSFLAAD 60

Query: 61  PKPQNIQFATFPNVIPSELCRTNDFPDFVRSIHTHMEAPVETLLRRLEPPPIAILADAFV 120
           PKP NI F T PNVIPSEL R NDFP F+RS+ THMEAPVETLLRRLEPPP AI+AD F 
Sbjct: 61  PKPPNIHFVTIPNVIPSELHRANDFPGFIRSVQTHMEAPVETLLRRLEPPPTAIIADTFG 120

Query: 121 TWAVQLGQRLNVLVASLWPMSATVFSILYHFDLLKENGHFPADLLERGEEIVDYFPGVSK 180
           TWAVQLG+RL+V VASLWPMSATVFSILYHFDLLKENGHFPADL ERGEEIVDYFPGVSK
Sbjct: 121 TWAVQLGKRLDVPVASLWPMSATVFSILYHFDLLKENGHFPADLSERGEEIVDYFPGVSK 180

Query: 181 IRLADLPSFFSGDGLQSVEFAVNSARSVDKSQFLISTSVYELESSVIDSLKAKFPFPVYT 240
           IRLADLPSFFSG+GLQS+EFAV SARSVDK+QFLISTSVYELESSV+DSLKAKFPFPVYT
Sbjct: 181 IRLADLPSFFSGNGLQSIEFAVKSARSVDKAQFLISTSVYELESSVLDSLKAKFPFPVYT 240

Query: 241 IGPSTPYFELECSVPNGGTNDYLRWLDSQAEGSVLYISQGSFLSVSSAQMDEIVAGVKAS 300
           IGPSTPYFELE SV    +NDYLRWLDSQ +GSVLY+SQGSFLSVS+AQMDEI+AGVKAS
Sbjct: 241 IGPSTPYFELESSV----SNDYLRWLDSQTDGSVLYVSQGSFLSVSNAQMDEIIAGVKAS 300

Query: 301 GVRFLWVARGDDGRLKDVDRETGMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEGIFAGV 360
           GVRFLWVARGDD R KDVDRETGMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEG+FAGV
Sbjct: 301 GVRFLWVARGDDDRWKDVDRETGMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEGVFAGV 360

Query: 361 PMLAWPIFWDQFPNSKNIAEDWKVGVRFKTVGGKDLVRRLEIAEFVKRFMNSESVEGREM 420
           PML WPIFWDQFPNSK IAEDWKVGVRFK  GGKDLVRR EIAEFVK+FMNSESVE +EM
Sbjct: 361 PMLVWPIFWDQFPNSKKIAEDWKVGVRFKGAGGKDLVRREEIAEFVKKFMNSESVESKEM 420

Query: 421 RNRVSEFQEICQLAVAKGGSSDSNIDSFLNHISG 455
           R RVSEFQEIC+ AVAKGGSSDSNID+FLNHISG
Sbjct: 421 RKRVSEFQEICRRAVAKGGSSDSNIDAFLNHISG 445

BLAST of HG10001558 vs. NCBI nr
Match: XP_004143169.1 (UDP-glycosyltransferase 87A1 [Cucumis sativus] >KGN47051.1 hypothetical protein Csa_020768 [Cucumis sativus])

HSP 1 Score: 793.5 bits (2048), Expect = 9.6e-226
Identity = 393/456 (86.18%), Postives = 414/456 (90.79%), Query Frame = 0

Query: 1   MDPISGSAAPKRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTNEWLTFLVAD 60
           MDPIS     KRIHLAALPYPGRGHINAL+N CK+LSLK+PNI ISFIVT+EWLTFL AD
Sbjct: 1   MDPIS-----KRIHLAALPYPGRGHINALINFCKILSLKSPNISISFIVTDEWLTFLAAD 60

Query: 61  PKPQNIQFATFPNVIPSELCRTNDFPDFVRSIHTHMEAPVETLLRRLEPPPIAILADAFV 120
           PKP NI F TFPNVIPSEL R NDFP FVRSI THMEAPVETLLRRL PPP AI+AD FV
Sbjct: 61  PKPPNIHFVTFPNVIPSELHRANDFPGFVRSIQTHMEAPVETLLRRLHPPPTAIIADTFV 120

Query: 121 TWAVQLGQRLNVLVASLWPMSATVFSILYHFDLLKENGHFPADLLERGEEIVDYFPGVSK 180
            WAVQLG+RL+V VASLWPMSATVFSILYHFDLLKENGHFPADL ERGEEIVDYFPGVSK
Sbjct: 121 YWAVQLGKRLDVPVASLWPMSATVFSILYHFDLLKENGHFPADLSERGEEIVDYFPGVSK 180

Query: 181 IRLADLPSFFSGDGLQSVEFAVNSARSVDKSQFLISTSVYELESSVIDSLKAKFPFPVYT 240
           IRLADLPSFFSG+GLQ++ F+V SARSVDK+QFLISTSVYELESSVIDSLKA FPFPVYT
Sbjct: 181 IRLADLPSFFSGNGLQTLGFSVKSARSVDKAQFLISTSVYELESSVIDSLKANFPFPVYT 240

Query: 241 IGPSTPYFELECSVPNGGTNDYLRWLDSQAEGSVLYISQGSFLSVSSAQMDEIVAGVKAS 300
           IGPSTPYFELE S     +NDYL+WLDSQAEGSVLYISQGSFLSVS+ QMDEIVAGVKAS
Sbjct: 241 IGPSTPYFELESS----ASNDYLQWLDSQAEGSVLYISQGSFLSVSNTQMDEIVAGVKAS 300

Query: 301 GVRFLWVARGDDGRLKDVDRETGMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEGIFAGV 360
           GVRFLWVARGDD R KDVDRETGMVVGWCDQLRVLCH AVGGFWTHGGWNST+EG+FAGV
Sbjct: 301 GVRFLWVARGDDDRWKDVDRETGMVVGWCDQLRVLCHGAVGGFWTHGGWNSTVEGVFAGV 360

Query: 361 PMLAWPIFWDQFPNSKNIAEDWKVGVRFKTVGGKDLVRRLEIAEFVKRFMNSESVEGREM 420
           PML WPIFWDQFPNSK IAEDW+VGVRFK VGGKDLVRR EIAEFVKRFMNSESVEG+EM
Sbjct: 361 PMLVWPIFWDQFPNSKKIAEDWQVGVRFKGVGGKDLVRREEIAEFVKRFMNSESVEGKEM 420

Query: 421 RNRVSEFQEICQLAVAKGGSSDSNIDSFLNHISGKL 457
           R RVSEFQEIC+ AVAKGGSSDSNID+FL HISG L
Sbjct: 421 RKRVSEFQEICRGAVAKGGSSDSNIDAFLKHISGDL 447

BLAST of HG10001558 vs. NCBI nr
Match: XP_022948231.1 (UDP-glycosyltransferase 87A2-like [Cucurbita moschata] >XP_022948232.1 UDP-glycosyltransferase 87A2-like [Cucurbita moschata])

HSP 1 Score: 736.1 bits (1899), Expect = 1.8e-208
Identity = 362/456 (79.39%), Postives = 399/456 (87.50%), Query Frame = 0

Query: 1   MDPISGSAAPKRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTNEWLTFLVAD 60
           MDPI GSA  K+ HLAA+PYPGRGH+NALMNLCKLLSLKNPNILISFIVT+EWLTFL  +
Sbjct: 1   MDPI-GSATTKQTHLAAVPYPGRGHVNALMNLCKLLSLKNPNILISFIVTDEWLTFLAGE 60

Query: 61  PKPQNIQFATFPNVIPSELCRTNDFPDFVRSIHTHMEAPVETLLRRLEPPPIAILADAFV 120
           PKPQNI+FAT PNVIPSE+ R NDFP F+RS+++ MEAP  TLL RL PPP AI+ADAF+
Sbjct: 61  PKPQNIRFATIPNVIPSEIGRANDFPGFIRSVNSDMEAPTNTLLTRLHPPPTAIVADAFL 120

Query: 121 TWAVQLGQRLNVLVASLWPMSATVFSILYHFDLLKENGHFPADLLERGEEIVDYFPGVSK 180
           TW VQLG  L + VASLWPMS TVFSILYHF+LL+ENG FPA+L ERGE+IVDYFPGVSK
Sbjct: 121 TWMVQLGNNLCIPVASLWPMSVTVFSILYHFELLQENGDFPAELSERGEQIVDYFPGVSK 180

Query: 181 IRLADLPSFFSGDGLQSVEFAVNSARSVDKSQFLISTSVYELESSVIDSLKAKFPFPVYT 240
           IRLADLPSFFSG+G++ V  AV SARSVD SQFLISTSVYELESSVID+LKAKFP P+YT
Sbjct: 181 IRLADLPSFFSGNGVKVVGAAVKSARSVDNSQFLISTSVYELESSVIDALKAKFPIPIYT 240

Query: 241 IGPSTPYFELECSV-PNGG-TNDYLRWLDSQAEGSVLYISQGSFLSVSSAQMDEIVAGVK 300
           IGPS PYFELE SV  NGG   +YLRWLDSQ EGSVLYISQGSFLSVSSAQM+EI+AGVK
Sbjct: 241 IGPSAPYFELETSVKDNGGDPKNYLRWLDSQTEGSVLYISQGSFLSVSSAQMEEIIAGVK 300

Query: 301 ASGVRFLWVARGDDGRLKDVDRETGMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEGIFA 360
            SGVRFLWVARGDDGRLKDVD ETGMVV WCDQL+VLCH AVGGFWTHGGWNSTLEG+FA
Sbjct: 301 DSGVRFLWVARGDDGRLKDVDGETGMVVEWCDQLKVLCHSAVGGFWTHGGWNSTLEGVFA 360

Query: 361 GVPMLAWPIFWDQFPNSKNIAEDWKVGVRFKTVGGKDLVRRLEIAEFVKRFMNSESVEGR 420
           GVPMLAWPIFWDQ PNSK I EDWKVGVRF+ VGG++LV R EIAE VKRFMN E+VEGR
Sbjct: 361 GVPMLAWPIFWDQIPNSKKIVEDWKVGVRFQAVGGRNLVGREEIAETVKRFMNPENVEGR 420

Query: 421 EMRNRVSEFQEICQLAVAKGGSSDSNIDSFLNHISG 455
           EMR RVSE ++ C+ AVA+GGSSDSNID+FL  I G
Sbjct: 421 EMRKRVSELRDACRRAVARGGSSDSNIDAFLGDICG 455

BLAST of HG10001558 vs. NCBI nr
Match: XP_023532234.1 (UDP-glycosyltransferase 87A2-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 733.0 bits (1891), Expect = 1.5e-207
Identity = 361/456 (79.17%), Postives = 400/456 (87.72%), Query Frame = 0

Query: 1   MDPISGSAAPKRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTNEWLTFLVAD 60
           MDPIS SA  K+ HLAA+PYPGRGH+NALMNLCKLLSLKNPNILISFIVT+EWLTFL  +
Sbjct: 1   MDPIS-SATTKQTHLAAVPYPGRGHVNALMNLCKLLSLKNPNILISFIVTDEWLTFLAGE 60

Query: 61  PKPQNIQFATFPNVIPSELCRTNDFPDFVRSIHTHMEAPVETLLRRLEPPPIAILADAFV 120
           PKPQNI+FAT PNVIPSEL R NDFP F+RS+++ MEAP +TLL RL PP  AI+ADAF+
Sbjct: 61  PKPQNIRFATIPNVIPSELGRANDFPGFIRSVNSDMEAPTDTLLTRLHPPLTAIVADAFL 120

Query: 121 TWAVQLGQRLNVLVASLWPMSATVFSILYHFDLLKENGHFPADLLERGEEIVDYFPGVSK 180
            W VQLG  L + VASLWPMS TVFSILYHF+LL+ENG FPA+L ERGE+IVDYFPGVSK
Sbjct: 121 PWMVQLGNNLRIPVASLWPMSVTVFSILYHFELLQENGDFPAELSERGEQIVDYFPGVSK 180

Query: 181 IRLADLPSFFSGDGLQSVEFAVNSARSVDKSQFLISTSVYELESSVIDSLKAKFPFPVYT 240
           IRLADLPSFFSG+G++ V  AV SARSVD SQFLISTSVYELESSVID+LKAKFP P+YT
Sbjct: 181 IRLADLPSFFSGNGVKVVGGAVKSARSVDNSQFLISTSVYELESSVIDALKAKFPIPIYT 240

Query: 241 IGPSTPYFELECSV-PNGG-TNDYLRWLDSQAEGSVLYISQGSFLSVSSAQMDEIVAGVK 300
           IGPS PYFELE SV  NGG + +YLRWLDSQAEGSVLYISQGSFLSVSSAQM+EI+AGVK
Sbjct: 241 IGPSAPYFELETSVKDNGGDSKNYLRWLDSQAEGSVLYISQGSFLSVSSAQMEEIIAGVK 300

Query: 301 ASGVRFLWVARGDDGRLKDVDRETGMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEGIFA 360
            SGVRFLWVARGDDGRLKDVD ETGMVV WCDQL+VLCH AVGGFWTHGGWNSTLEG+FA
Sbjct: 301 DSGVRFLWVARGDDGRLKDVDGETGMVVEWCDQLKVLCHNAVGGFWTHGGWNSTLEGVFA 360

Query: 361 GVPMLAWPIFWDQFPNSKNIAEDWKVGVRFKTVGGKDLVRRLEIAEFVKRFMNSESVEGR 420
           GVPMLAWPIFWDQ PNSK I EDWKVGVRF+ VGG++LV R EIAE VKRFMN ++VEGR
Sbjct: 361 GVPMLAWPIFWDQIPNSKKIVEDWKVGVRFQAVGGRNLVGREEIAETVKRFMNPDNVEGR 420

Query: 421 EMRNRVSEFQEICQLAVAKGGSSDSNIDSFLNHISG 455
           EMR RVSE ++ C+ AVA+GGSSDSNID+FL  I G
Sbjct: 421 EMRKRVSELRDTCRRAVARGGSSDSNIDAFLRDICG 455

BLAST of HG10001558 vs. ExPASy Swiss-Prot
Match: O64733 (UDP-glycosyltransferase 87A2 OS=Arabidopsis thaliana OX=3702 GN=UGT87A2 PE=1 SV=1)

HSP 1 Score: 467.6 bits (1202), Expect = 1.6e-130
Identity = 241/457 (52.74%), Postives = 312/457 (68.27%), Query Frame = 0

Query: 1   MDPISGSAAPKRI-HLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTNEWLTFLVA 60
           MDP    + P +  H+ A+PYPGRGHIN +MNLCK L  + PN+ ++F+VT EWL F+  
Sbjct: 1   MDP--NESPPNQFRHVVAMPYPGRGHINPMMNLCKRLVRRYPNLHVTFVVTEEWLGFIGP 60

Query: 61  DPKPQNIQFATFPNVIPSELCRTNDFPDFVRSIHTHMEAPVETLLRRL-EPPPIAILADA 120
           DPKP  I F+T PN+IPSEL R  DF  F+ +++T +E P E LL  L  PPP  I AD 
Sbjct: 61  DPKPDRIHFSTLPNLIPSELVRAKDFIGFIDAVYTRLEEPFEKLLDSLNSPPPSVIFADT 120

Query: 121 FVTWAVQLGQRLNVLVASLWPMSATVFSILYHFDLLKENGHFPADLLERGEEIVDYFPGV 180
           +V WAV++G++ N+ V SLW MSAT+ S   H DLL  +GH   +  E  EE+VDY PG+
Sbjct: 121 YVIWAVRVGRKRNIPVVSLWTMSATILSFFLHSDLLISHGHALFEPSE--EEVVDYVPGL 180

Query: 181 SKIRLADLPSFFSGDGLQSVEFAVNSARSVDKSQFLISTSVYELESSVIDSLKAKFPFPV 240
           S  +L DLP  F G   +  + A      +  ++ L+ T+ YELE   ID+  +K   PV
Sbjct: 181 SPTKLRDLPPIFDGYSDRVFKTAKLCFDELPGARSLLFTTAYELEHKAIDAFTSKLDIPV 240

Query: 241 YTIGPSTPYFELECSVPNGGTNDYLRWLDSQAEGSVLYISQGSFLSVSSAQMDEIVAGVK 300
           Y IGP  P+ EL     N   N Y++WL+ Q EGSVLYISQGSFLSVS AQM+EIV G++
Sbjct: 241 YAIGPLIPFEELSVQNDNKEPN-YIQWLEEQPEGSVLYISQGSFLSVSEAQMEEIVKGLR 300

Query: 301 ASGVRFLWVARGDDGRLKD-VDRETGMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEGIF 360
            SGVRFLWVARG + +LK+ ++   G+VV WCDQLRVLCH+AVGGFWTH G+NSTLEGI+
Sbjct: 301 ESGVRFLWVARGGELKLKEALEGSLGVVVSWCDQLRVLCHKAVGGFWTHCGFNSTLEGIY 360

Query: 361 AGVPMLAWPIFWDQFPNSKNIAEDWKVGVRF-KTVGGKDLVRRLEIAEFVKRFMNSESVE 420
           +GVPMLA+P+FWDQ  N+K I EDW+VG+R  +T   + L+ R EI E VKRFM+ ES E
Sbjct: 361 SGVPMLAFPLFWDQILNAKMIVEDWRVGMRIERTKKNELLIGREEIKEVVKRFMDRESEE 420

Query: 421 GREMRNRVSEFQEICQLAVAKGGSSDSNIDSFLNHIS 454
           G+EMR R  +  EI + AVAK GSS+ NID F+ HI+
Sbjct: 421 GKEMRRRACDLSEISRGAVAKSGSSNVNIDEFVRHIT 452

BLAST of HG10001558 vs. ExPASy Swiss-Prot
Match: O64732 (UDP-glycosyltransferase 87A1 OS=Arabidopsis thaliana OX=3702 GN=UGT87A1 PE=2 SV=1)

HSP 1 Score: 462.6 bits (1189), Expect = 5.1e-129
Identity = 231/439 (52.62%), Postives = 303/439 (69.02%), Query Frame = 0

Query: 18  LPYPGRGHINALMNLCKLLSLKNPNILISFIVTNEWLTFLVADPKPQNIQFATFPNVIPS 77
           +P+PGRGHIN ++NLCK L  ++PN+ ++F+VT EWL F+ +DPKP  I FAT PN+IPS
Sbjct: 1   MPWPGRGHINPMLNLCKSLVRRDPNLTVTFVVTEEWLGFIGSDPKPNRIHFATLPNIIPS 60

Query: 78  ELCRTNDFPDFVRSIHTHMEAPVETLLRRLEPPPIAILADAFVTWAVQLGQRLNVLVASL 137
           EL R NDF  F+ ++ T +E P E LL RL  PP AI+AD ++ WAV++G + N+ VAS 
Sbjct: 61  ELVRANDFIAFIDAVLTRLEEPFEQLLDRLNSPPTAIIADTYIIWAVRVGTKRNIPVASF 120

Query: 138 WPMSATVFSILYHFDLLKENGHFPADLLE-RGEEIVDYFPGVSKIRLADLPSFFSGDGLQ 197
           W  SAT+ S+  + DLL  +GHFP +  E + +EIVDY PG+S  RL+DL     G   Q
Sbjct: 121 WTTSATILSLFINSDLLASHGHFPIEPSESKLDEIVDYIPGLSPTRLSDL-QILHGYSHQ 180

Query: 198 SVEFAVNSARSVDKSQFLISTSVYELESSVIDSLKAKFPFPVYTIGPSTPYFELECSVPN 257
                  S   + K+++L+  S YELE   ID   +KF FPVY+ GP  P  EL     N
Sbjct: 181 VFNIFKKSFGELYKAKYLLFPSAYELEPKAIDFFTSKFDFPVYSTGPLIPLEELSVGNEN 240

Query: 258 GGTNDYLRWLDSQAEGSVLYISQGSFLSVSSAQMDEIVAGVKASGVRFLWVARGDDGRLK 317
               DY +WLD Q E SVLYISQGSFLSVS AQM+EIV GV+ +GV+F WVARG + +LK
Sbjct: 241 REL-DYFKWLDEQPESSVLYISQGSFLSVSEAQMEEIVVGVREAGVKFFWVARGGELKLK 300

Query: 318 D-VDRETGMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEGIFAGVPMLAWPIFWDQFPNS 377
           + ++   G+VV WCDQLRVLCH A+GGFWTH G+NSTLEGI +GVP+L +P+FWDQF N+
Sbjct: 301 EALEGSLGVVVSWCDQLRVLCHAAIGGFWTHCGYNSTLEGICSGVPLLTFPVFWDQFLNA 360

Query: 378 KNIAEDWKVGVRFKTVGGKD-LVRRLEIAEFVKRFMNSESVEGREMRNRVSEFQEICQLA 437
           K I E+W+VG+  +     + L+   EI E VKRFM+ ES EG+EMR R  +  EIC+ A
Sbjct: 361 KMIVEEWRVGMGIERKKQMELLIVSDEIKELVKRFMDGESEEGKEMRRRTCDLSEICRGA 420

Query: 438 VAKGGSSDSNIDSFLNHIS 454
           VAKGGSSD+NID+F+  I+
Sbjct: 421 VAKGGSSDANIDAFIKDIT 437

BLAST of HG10001558 vs. ExPASy Swiss-Prot
Match: Q9SJL0 (UDP-glycosyltransferase 86A1 OS=Arabidopsis thaliana OX=3702 GN=UGT86A1 PE=2 SV=1)

HSP 1 Score: 237.7 bits (605), Expect = 2.7e-61
Identity = 148/485 (30.52%), Postives = 246/485 (50.72%), Query Frame = 0

Query: 8   AAPKRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTNE-------------WL 67
           A  ++ H+  +PYP +GH+   ++L   + L +    I+F+ T+                
Sbjct: 4   AKSRKPHIMMIPYPLQGHVIPFVHLA--IKLASHGFTITFVNTDSIHHHISTAHQDDAGD 63

Query: 68  TFLVADPKPQ-NIQFATFPNVIPSELCRTNDFPDFVRSI----HTHMEAPVETLLRRLEP 127
            F  A    Q +I++ T  +  P +  R+ +   F   I      H++  +  L RR +P
Sbjct: 64  IFSAARSSGQHDIRYTTVSDGFPLDFDRSLNHDQFFEGILHVFSAHVDDLIAKLSRRDDP 123

Query: 128 PPIAILADAFVTWAVQLGQRLNVLVASLWPMSATVFSILYHFDLLKENGHFPADLLERGE 187
           P   ++AD F  W+  +  + N++  S W   A V ++ YH DLL  NGHF +  L+  +
Sbjct: 124 PVTCLIADTFYVWSSMICDKHNLVNVSFWTEPALVLNLYYHMDLLISNGHFKS--LDNRK 183

Query: 188 EIVDYFPGVSKIRLADLPSFFSGD----GLQSVEFAV--NSARSVDKSQFLISTSVYELE 247
           +++DY PGV  I   DL S+           +V + +   + + V ++ F++  +V ELE
Sbjct: 184 DVIDYVPGVKAIEPKDLMSYLQVSDKDVDTNTVVYRILFKAFKDVKRADFVVCNTVQELE 243

Query: 248 SSVIDSLKAKFPFPVYTIGPSTPYFELECSVPNG--GTNDYLRWLDSQAEGSVLYISQGS 307
              + +L+AK   PVY IG   P F  +  VP      +D   WL  +  GSVLY+S GS
Sbjct: 244 PDSLSALQAK--QPVYAIG---PVFSTDSVVPTSLWAESDCTEWLKGRPTGSVLYVSFGS 303

Query: 308 FLSVSSAQMDEIVAGVKASGVRFLWVARGD----------DGRLKDVDRETGMVVGWCDQ 367
           +  V   ++ EI  G+  SG+ F+WV R D               D  ++ G+VV WC Q
Sbjct: 304 YAHVGKKEIVEIAHGLLLSGISFIWVLRPDIVGSNVPDFLPAGFVDQAQDRGLVVQWCCQ 363

Query: 368 LRVLCHRAVGGFWTHGGWNSTLEGIFAGVPMLAWPIFWDQFPNSKNIAEDWKVGVRFKTV 427
           + V+ + AVGGF+TH GWNS LE ++ G+P+L +P+  DQF N K + +DW +G+    +
Sbjct: 364 MEVISNPAVGGFFTHCGWNSILESVWCGLPLLCYPLLTDQFTNRKLVVDDWCIGI---NL 423

Query: 428 GGKDLVRRLEIAEFVKRFMNSESVEGREMRNRVSEFQEICQLAVAKGGSSDSNIDSFLNH 457
             K  + R +++  VKR MN E+    E+RN V + +   + AV   GSS++N + F++ 
Sbjct: 424 CEKKTITRDQVSANVKRLMNGET--SSELRNNVEKVKRHLKDAVTTVGSSETNFNLFVSE 474

BLAST of HG10001558 vs. ExPASy Swiss-Prot
Match: Q9M9E7 (UDP-glycosyltransferase 85A4 OS=Arabidopsis thaliana OX=3702 GN=UGT85A4 PE=2 SV=1)

HSP 1 Score: 223.4 bits (568), Expect = 5.2e-57
Identity = 148/486 (30.45%), Postives = 247/486 (50.82%), Query Frame = 0

Query: 6   GSAAPKRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTNEWLTFLVADPKPQ- 65
           G ++ ++ H   +PYP +GHIN ++ L KLL  +     ++F+ T+     ++    P  
Sbjct: 5   GGSSSQKPHAMCIPYPAQGHINPMLKLAKLLHAR--GFHVTFVNTDYNHRRILQSRGPHA 64

Query: 66  -----NIQFATFPNVIP-SELCRTNDFPDFVRSIHTHMEAPVETLLRRLE-----PPPIA 125
                + +F T P+ +P +++    D    + S   +  AP + L+ RL      PP   
Sbjct: 65  LNGLPSFRFETIPDGLPWTDVDAKQDMLKLIDSTINNCLAPFKDLILRLNSGSDIPPVSC 124

Query: 126 ILADAFVTWAVQLGQRLNVLVASLWPMSATVFSILYHFDLLKENGHFP----ADLLERGE 185
           I++DA +++ +   + L + V  LW  SAT   +  H+  L E    P    +DL +  E
Sbjct: 125 IISDASMSFTIDAAEELKIPVVLLWTNSATALILYLHYQKLIEKEIIPLKDSSDLKKHLE 184

Query: 186 EIVDYFPGVSKIRLADLPSFFSGDGLQS--VEFAVNSARSVDKSQFLISTSVYELESSVI 245
             +D+ P + KI+L D P F +    Q   + F ++    + ++  +   +  +LE +V+
Sbjct: 185 TEIDWIPSMKKIKLKDFPDFVTTTNPQDPMISFILHVTGRIKRASAIFINTFEKLEHNVL 244

Query: 246 DSLKAKFPFPVYTIGP----STPYFELECSVPNGGTN------DYLRWLDSQAEGSVLYI 305
            SL++  P  +Y++GP         +    +   G N      + L WLD++AE +V+Y+
Sbjct: 245 LSLRSLLP-QIYSVGPFQILENREIDKNSEIRKLGLNLWEEETESLDWLDTKAEKAVIYV 304

Query: 306 SQGSFLSVSSAQMDEIVAGVKASGVRFLWVAR-----GDDGRLK----DVDRETGMVV-G 365
           + GS   ++S Q+ E   G+  SG  FLWV R     GDD  L        +  GM++ G
Sbjct: 305 NFGSLTVLTSEQILEFAWGLARSGKEFLWVVRSGMVDGDDSILPAEFLSETKNRGMLIKG 364

Query: 366 WCDQLRVLCHRAVGGFWTHGGWNSTLEGIFAGVPMLAWPIFWDQFPNSKNIAEDWKVGVR 425
           WC Q +VL H A+GGF TH GWNSTLE ++AGVPM+ WP F DQ  N K   EDW +G+ 
Sbjct: 365 WCSQEKVLSHPAIGGFLTHCGWNSTLESLYAGVPMICWPFFADQLTNRKFCCEDWGIGME 424

Query: 426 FKTVGGKDLVRRLEIAEFVKRFMNSESVEGREMRNRVSEFQEICQLAVAKG-GSSDSNID 453
                G++ V+R  +   VK  M+ E  +G+ +R +V E++ + + A A   GSS  N +
Sbjct: 425 I----GEE-VKRERVETVVKELMDGE--KGKRLREKVVEWRRLAEEASAPPLGSSYVNFE 480

BLAST of HG10001558 vs. ExPASy Swiss-Prot
Match: F8WLS6 (7-deoxyloganetin glucosyltransferase OS=Catharanthus roseus OX=4058 GN=UGT85A23 PE=1 SV=1)

HSP 1 Score: 223.0 bits (567), Expect = 6.8e-57
Identity = 150/494 (30.36%), Postives = 239/494 (48.38%), Query Frame = 0

Query: 1   MDPISGSAAPKRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTNEWLTFLVAD 60
           M  +S S   K+ H   +PYP +GHIN ++ L KLL  K     I+F+ T      L+  
Sbjct: 1   MGSLSSSDYSKKPHAVCIPYPAQGHINPMLKLAKLLHYK--GFHITFVNTEFNHKRLLKS 60

Query: 61  PKPQNI------QFATFPNVI-PSELCRTNDFPDFVRSIHTHMEAPVETLLRRLE----- 120
               ++      QF T P+ + PS++  T D P    S  TH   P + LL++L      
Sbjct: 61  RGSDSLKGLHSFQFKTIPDGLPPSDVDATQDIPSLCESTTTHCLVPFKQLLQKLNDTSSS 120

Query: 121 --PPPIAILADAFVTWAVQLGQRLNVLVASLWPMSATVFSILYHFDLLKENGHFP---AD 180
             PP   +++DA +++ +   Q L++     W  SA       H+  L + G  P   A 
Sbjct: 121 EVPPVSCVVSDAVMSFTISAAQELDIPEVLFWTPSACGVLGYMHYAQLIDKGLTPLKDAS 180

Query: 181 LLERG--EEIVDYFPGVSKIRLADLPSFF--SGDGLQSVEFAVNSARSVDKSQFLISTSV 240
               G  ++++D+ PG+  IRL DLP+F   +      ++F +       K+  ++  + 
Sbjct: 181 YFSNGFLDQVLDWIPGMEGIRLRDLPTFLRTTNPDEYMIKFILQETERSKKASAIVLNTF 240

Query: 241 YELESSVIDSLKAKFPFPVYTIGPSTPYFEL--ECSVPNGGTN------DYLRWLDSQAE 300
            ELES VIDSL    P P+Y IGP         + S+   G+N      + L WLD++  
Sbjct: 241 QELESEVIDSLSTLLP-PIYPIGPLQILQNQVDDESLKVLGSNLWKEEPECLEWLDTKDP 300

Query: 301 GSVLYISQGSFLSVSSAQMDEIVAGVKASGVRFLWVARGD---------DGRLKDVDRET 360
            SV+Y++ GS   +++ Q+ E   G+  S   FLW+ R D              +  +E 
Sbjct: 301 NSVVYVNFGSITVMTNDQLIEFAWGLANSKQNFLWIIRPDLISGESSILGEEFVEETKER 360

Query: 361 GMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEGIFAGVPMLAWPIFWDQFPNSKNIAEDW 420
           G++  WC Q +V+ H A+GGF TH GWNST+E I +GVPM+ WP F +Q  N +     W
Sbjct: 361 GLIASWCHQEQVINHPAIGGFLTHNGWNSTIESISSGVPMICWPFFAEQQTNCRFCCNKW 420

Query: 421 KVGVRFKTVGGKDLVRRLEIAEFVKRFMNSESVEGREMRNRVSEFQEICQLAVAK-GGSS 456
            +G+   +      V+R E+   VK  M  E  +G+EM+ +  E++ I ++   K  GSS
Sbjct: 421 GIGMEINSD-----VKRDEVESLVKELMVGE--KGKEMKKKALEWKNIAEVTTTKPDGSS 480

BLAST of HG10001558 vs. ExPASy TrEMBL
Match: A0A5D3C6P9 (UDP-glycosyltransferase 87A1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold852G00150 PE=4 SV=1)

HSP 1 Score: 802.4 bits (2071), Expect = 1.0e-228
Identity = 392/454 (86.34%), Postives = 414/454 (91.19%), Query Frame = 0

Query: 1   MDPISGSAAPKRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTNEWLTFLVAD 60
           MDP+      KR+HLAALPYPGRGHINALMN CKLLSLKNPNI ISFIVT EWL+FL AD
Sbjct: 1   MDPVR-----KRVHLAALPYPGRGHINALMNFCKLLSLKNPNISISFIVTEEWLSFLAAD 60

Query: 61  PKPQNIQFATFPNVIPSELCRTNDFPDFVRSIHTHMEAPVETLLRRLEPPPIAILADAFV 120
           PKP NI F T PNVIPSEL R NDFP F+RS+ THMEAPVETLLRRLEPPP AI+AD F 
Sbjct: 61  PKPPNIHFVTIPNVIPSELHRANDFPGFIRSVQTHMEAPVETLLRRLEPPPTAIIADTFG 120

Query: 121 TWAVQLGQRLNVLVASLWPMSATVFSILYHFDLLKENGHFPADLLERGEEIVDYFPGVSK 180
           TWAVQLG+RL+V VASLWPMSATVFSILYHFDLLKENGHFPADL ERGEEIVDYFPGVSK
Sbjct: 121 TWAVQLGKRLDVPVASLWPMSATVFSILYHFDLLKENGHFPADLSERGEEIVDYFPGVSK 180

Query: 181 IRLADLPSFFSGDGLQSVEFAVNSARSVDKSQFLISTSVYELESSVIDSLKAKFPFPVYT 240
           IRLADLPSFFSG+GLQS+EFAV SARSVDK+QFLISTSVYELESSV+DSLKAKFPFPVYT
Sbjct: 181 IRLADLPSFFSGNGLQSIEFAVKSARSVDKAQFLISTSVYELESSVLDSLKAKFPFPVYT 240

Query: 241 IGPSTPYFELECSVPNGGTNDYLRWLDSQAEGSVLYISQGSFLSVSSAQMDEIVAGVKAS 300
           IGPSTPYFELE SV    +NDYLRWLDSQ +GSVLY+SQGSFLSVS+AQMDEI+AGVKAS
Sbjct: 241 IGPSTPYFELESSV----SNDYLRWLDSQTDGSVLYVSQGSFLSVSNAQMDEIIAGVKAS 300

Query: 301 GVRFLWVARGDDGRLKDVDRETGMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEGIFAGV 360
           GVRFLWVARGDD R KDVDRETGMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEG+FAGV
Sbjct: 301 GVRFLWVARGDDDRWKDVDRETGMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEGVFAGV 360

Query: 361 PMLAWPIFWDQFPNSKNIAEDWKVGVRFKTVGGKDLVRRLEIAEFVKRFMNSESVEGREM 420
           PML WPIFWDQFPNSK IAEDWKVGVRFK  GGKDLVRR EIAEFVK+FMNSESVE +EM
Sbjct: 361 PMLVWPIFWDQFPNSKKIAEDWKVGVRFKGAGGKDLVRREEIAEFVKKFMNSESVESKEM 420

Query: 421 RNRVSEFQEICQLAVAKGGSSDSNIDSFLNHISG 455
           R RVSEFQEIC+ AVAKGGSSDSNID+FLNHISG
Sbjct: 421 RKRVSEFQEICRRAVAKGGSSDSNIDAFLNHISG 445

BLAST of HG10001558 vs. ExPASy TrEMBL
Match: A0A1S3CLR3 (UDP-glycosyltransferase 87A1-like OS=Cucumis melo OX=3656 GN=LOC103501885 PE=4 SV=1)

HSP 1 Score: 802.4 bits (2071), Expect = 1.0e-228
Identity = 392/454 (86.34%), Postives = 414/454 (91.19%), Query Frame = 0

Query: 1   MDPISGSAAPKRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTNEWLTFLVAD 60
           MDP+      KR+HLAALPYPGRGHINALMN CKLLSLKNPNI ISFIVT EWL+FL AD
Sbjct: 1   MDPVR-----KRVHLAALPYPGRGHINALMNFCKLLSLKNPNISISFIVTEEWLSFLAAD 60

Query: 61  PKPQNIQFATFPNVIPSELCRTNDFPDFVRSIHTHMEAPVETLLRRLEPPPIAILADAFV 120
           PKP NI F T PNVIPSEL R NDFP F+RS+ THMEAPVETLLRRLEPPP AI+AD F 
Sbjct: 61  PKPPNIHFVTIPNVIPSELHRANDFPGFIRSVQTHMEAPVETLLRRLEPPPTAIIADTFG 120

Query: 121 TWAVQLGQRLNVLVASLWPMSATVFSILYHFDLLKENGHFPADLLERGEEIVDYFPGVSK 180
           TWAVQLG+RL+V VASLWPMSATVFSILYHFDLLKENGHFPADL ERGEEIVDYFPGVSK
Sbjct: 121 TWAVQLGKRLDVPVASLWPMSATVFSILYHFDLLKENGHFPADLSERGEEIVDYFPGVSK 180

Query: 181 IRLADLPSFFSGDGLQSVEFAVNSARSVDKSQFLISTSVYELESSVIDSLKAKFPFPVYT 240
           IRLADLPSFFSG+GLQS+EFAV SARSVDK+QFLISTSVYELESSV+DSLKAKFPFPVYT
Sbjct: 181 IRLADLPSFFSGNGLQSIEFAVKSARSVDKAQFLISTSVYELESSVLDSLKAKFPFPVYT 240

Query: 241 IGPSTPYFELECSVPNGGTNDYLRWLDSQAEGSVLYISQGSFLSVSSAQMDEIVAGVKAS 300
           IGPSTPYFELE SV    +NDYLRWLDSQ +GSVLY+SQGSFLSVS+AQMDEI+AGVKAS
Sbjct: 241 IGPSTPYFELESSV----SNDYLRWLDSQTDGSVLYVSQGSFLSVSNAQMDEIIAGVKAS 300

Query: 301 GVRFLWVARGDDGRLKDVDRETGMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEGIFAGV 360
           GVRFLWVARGDD R KDVDRETGMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEG+FAGV
Sbjct: 301 GVRFLWVARGDDDRWKDVDRETGMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEGVFAGV 360

Query: 361 PMLAWPIFWDQFPNSKNIAEDWKVGVRFKTVGGKDLVRRLEIAEFVKRFMNSESVEGREM 420
           PML WPIFWDQFPNSK IAEDWKVGVRFK  GGKDLVRR EIAEFVK+FMNSESVE +EM
Sbjct: 361 PMLVWPIFWDQFPNSKKIAEDWKVGVRFKGAGGKDLVRREEIAEFVKKFMNSESVESKEM 420

Query: 421 RNRVSEFQEICQLAVAKGGSSDSNIDSFLNHISG 455
           R RVSEFQEIC+ AVAKGGSSDSNID+FLNHISG
Sbjct: 421 RKRVSEFQEICRRAVAKGGSSDSNIDAFLNHISG 445

BLAST of HG10001558 vs. ExPASy TrEMBL
Match: A0A0A0KH18 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G181560 PE=4 SV=1)

HSP 1 Score: 793.5 bits (2048), Expect = 4.7e-226
Identity = 393/456 (86.18%), Postives = 414/456 (90.79%), Query Frame = 0

Query: 1   MDPISGSAAPKRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTNEWLTFLVAD 60
           MDPIS     KRIHLAALPYPGRGHINAL+N CK+LSLK+PNI ISFIVT+EWLTFL AD
Sbjct: 1   MDPIS-----KRIHLAALPYPGRGHINALINFCKILSLKSPNISISFIVTDEWLTFLAAD 60

Query: 61  PKPQNIQFATFPNVIPSELCRTNDFPDFVRSIHTHMEAPVETLLRRLEPPPIAILADAFV 120
           PKP NI F TFPNVIPSEL R NDFP FVRSI THMEAPVETLLRRL PPP AI+AD FV
Sbjct: 61  PKPPNIHFVTFPNVIPSELHRANDFPGFVRSIQTHMEAPVETLLRRLHPPPTAIIADTFV 120

Query: 121 TWAVQLGQRLNVLVASLWPMSATVFSILYHFDLLKENGHFPADLLERGEEIVDYFPGVSK 180
            WAVQLG+RL+V VASLWPMSATVFSILYHFDLLKENGHFPADL ERGEEIVDYFPGVSK
Sbjct: 121 YWAVQLGKRLDVPVASLWPMSATVFSILYHFDLLKENGHFPADLSERGEEIVDYFPGVSK 180

Query: 181 IRLADLPSFFSGDGLQSVEFAVNSARSVDKSQFLISTSVYELESSVIDSLKAKFPFPVYT 240
           IRLADLPSFFSG+GLQ++ F+V SARSVDK+QFLISTSVYELESSVIDSLKA FPFPVYT
Sbjct: 181 IRLADLPSFFSGNGLQTLGFSVKSARSVDKAQFLISTSVYELESSVIDSLKANFPFPVYT 240

Query: 241 IGPSTPYFELECSVPNGGTNDYLRWLDSQAEGSVLYISQGSFLSVSSAQMDEIVAGVKAS 300
           IGPSTPYFELE S     +NDYL+WLDSQAEGSVLYISQGSFLSVS+ QMDEIVAGVKAS
Sbjct: 241 IGPSTPYFELESS----ASNDYLQWLDSQAEGSVLYISQGSFLSVSNTQMDEIVAGVKAS 300

Query: 301 GVRFLWVARGDDGRLKDVDRETGMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEGIFAGV 360
           GVRFLWVARGDD R KDVDRETGMVVGWCDQLRVLCH AVGGFWTHGGWNST+EG+FAGV
Sbjct: 301 GVRFLWVARGDDDRWKDVDRETGMVVGWCDQLRVLCHGAVGGFWTHGGWNSTVEGVFAGV 360

Query: 361 PMLAWPIFWDQFPNSKNIAEDWKVGVRFKTVGGKDLVRRLEIAEFVKRFMNSESVEGREM 420
           PML WPIFWDQFPNSK IAEDW+VGVRFK VGGKDLVRR EIAEFVKRFMNSESVEG+EM
Sbjct: 361 PMLVWPIFWDQFPNSKKIAEDWQVGVRFKGVGGKDLVRREEIAEFVKRFMNSESVEGKEM 420

Query: 421 RNRVSEFQEICQLAVAKGGSSDSNIDSFLNHISGKL 457
           R RVSEFQEIC+ AVAKGGSSDSNID+FL HISG L
Sbjct: 421 RKRVSEFQEICRGAVAKGGSSDSNIDAFLKHISGDL 447

BLAST of HG10001558 vs. ExPASy TrEMBL
Match: A0A6J1G8M4 (UDP-glycosyltransferase 87A2-like OS=Cucurbita moschata OX=3662 GN=LOC111451869 PE=4 SV=1)

HSP 1 Score: 736.1 bits (1899), Expect = 8.8e-209
Identity = 362/456 (79.39%), Postives = 399/456 (87.50%), Query Frame = 0

Query: 1   MDPISGSAAPKRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTNEWLTFLVAD 60
           MDPI GSA  K+ HLAA+PYPGRGH+NALMNLCKLLSLKNPNILISFIVT+EWLTFL  +
Sbjct: 1   MDPI-GSATTKQTHLAAVPYPGRGHVNALMNLCKLLSLKNPNILISFIVTDEWLTFLAGE 60

Query: 61  PKPQNIQFATFPNVIPSELCRTNDFPDFVRSIHTHMEAPVETLLRRLEPPPIAILADAFV 120
           PKPQNI+FAT PNVIPSE+ R NDFP F+RS+++ MEAP  TLL RL PPP AI+ADAF+
Sbjct: 61  PKPQNIRFATIPNVIPSEIGRANDFPGFIRSVNSDMEAPTNTLLTRLHPPPTAIVADAFL 120

Query: 121 TWAVQLGQRLNVLVASLWPMSATVFSILYHFDLLKENGHFPADLLERGEEIVDYFPGVSK 180
           TW VQLG  L + VASLWPMS TVFSILYHF+LL+ENG FPA+L ERGE+IVDYFPGVSK
Sbjct: 121 TWMVQLGNNLCIPVASLWPMSVTVFSILYHFELLQENGDFPAELSERGEQIVDYFPGVSK 180

Query: 181 IRLADLPSFFSGDGLQSVEFAVNSARSVDKSQFLISTSVYELESSVIDSLKAKFPFPVYT 240
           IRLADLPSFFSG+G++ V  AV SARSVD SQFLISTSVYELESSVID+LKAKFP P+YT
Sbjct: 181 IRLADLPSFFSGNGVKVVGAAVKSARSVDNSQFLISTSVYELESSVIDALKAKFPIPIYT 240

Query: 241 IGPSTPYFELECSV-PNGG-TNDYLRWLDSQAEGSVLYISQGSFLSVSSAQMDEIVAGVK 300
           IGPS PYFELE SV  NGG   +YLRWLDSQ EGSVLYISQGSFLSVSSAQM+EI+AGVK
Sbjct: 241 IGPSAPYFELETSVKDNGGDPKNYLRWLDSQTEGSVLYISQGSFLSVSSAQMEEIIAGVK 300

Query: 301 ASGVRFLWVARGDDGRLKDVDRETGMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEGIFA 360
            SGVRFLWVARGDDGRLKDVD ETGMVV WCDQL+VLCH AVGGFWTHGGWNSTLEG+FA
Sbjct: 301 DSGVRFLWVARGDDGRLKDVDGETGMVVEWCDQLKVLCHSAVGGFWTHGGWNSTLEGVFA 360

Query: 361 GVPMLAWPIFWDQFPNSKNIAEDWKVGVRFKTVGGKDLVRRLEIAEFVKRFMNSESVEGR 420
           GVPMLAWPIFWDQ PNSK I EDWKVGVRF+ VGG++LV R EIAE VKRFMN E+VEGR
Sbjct: 361 GVPMLAWPIFWDQIPNSKKIVEDWKVGVRFQAVGGRNLVGREEIAETVKRFMNPENVEGR 420

Query: 421 EMRNRVSEFQEICQLAVAKGGSSDSNIDSFLNHISG 455
           EMR RVSE ++ C+ AVA+GGSSDSNID+FL  I G
Sbjct: 421 EMRKRVSELRDACRRAVARGGSSDSNIDAFLGDICG 455

BLAST of HG10001558 vs. ExPASy TrEMBL
Match: A0A6J1L0A4 (UDP-glycosyltransferase 87A2 OS=Cucurbita maxima OX=3661 GN=LOC111499872 PE=4 SV=1)

HSP 1 Score: 726.1 bits (1873), Expect = 9.2e-206
Identity = 358/456 (78.51%), Postives = 395/456 (86.62%), Query Frame = 0

Query: 1   MDPISGSAAPKRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTNEWLTFLVAD 60
           MDPIS SA  K+ HLAA+PYPGRGH+NALMNLCKLLSLKNPNILISFIVT EWLTFL  +
Sbjct: 1   MDPIS-SATTKQTHLAAVPYPGRGHVNALMNLCKLLSLKNPNILISFIVTEEWLTFLAGE 60

Query: 61  PKPQNIQFATFPNVIPSELCRTNDFPDFVRSIHTHMEAPVETLLRRLEPPPIAILADAFV 120
            KPQNI+FAT PNVIPSEL R NDFP F+RS+++ +EAP  TLL RL PPP AI+ADAF+
Sbjct: 61  SKPQNIRFATIPNVIPSELGRANDFPGFIRSVNSDLEAPTNTLLTRLHPPPTAIVADAFL 120

Query: 121 TWAVQLGQRLNVLVASLWPMSATVFSILYHFDLLKENGHFPADLLERGEEIVDYFPGVSK 180
            W VQLG  L + VASLWPMS TVFSILYHF+LL+E+G FPA+L ERGE IVDYFPGVSK
Sbjct: 121 PWVVQLGNNLRIPVASLWPMSVTVFSILYHFELLQEHGDFPAELSERGEHIVDYFPGVSK 180

Query: 181 IRLADLPSFFSGDGLQSVEFAVNSARSVDKSQFLISTSVYELESSVIDSLKAKFPFPVYT 240
           IRLADLPSFFSG+G++ V  A  SARSVD SQFLISTSVYELESSVID+LKAKFP P+YT
Sbjct: 181 IRLADLPSFFSGNGVKVVGAAEKSARSVDNSQFLISTSVYELESSVIDALKAKFPIPIYT 240

Query: 241 IGPSTPYFELECSV-PNGG-TNDYLRWLDSQAEGSVLYISQGSFLSVSSAQMDEIVAGVK 300
           IGPSTPYFELE SV  NGG   +YLRWLDSQAEGSVLYISQGSFLSVSSAQM+EI+AGVK
Sbjct: 241 IGPSTPYFELETSVKDNGGDPKNYLRWLDSQAEGSVLYISQGSFLSVSSAQMEEIIAGVK 300

Query: 301 ASGVRFLWVARGDDGRLKDVDRETGMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEGIFA 360
            SGVRFLWVARGDDG LKDVD ETGMVV WC+QL+VLCH A+GGFWTHGGWNSTLEG+FA
Sbjct: 301 DSGVRFLWVARGDDGWLKDVDEETGMVVEWCNQLKVLCHNAIGGFWTHGGWNSTLEGVFA 360

Query: 361 GVPMLAWPIFWDQFPNSKNIAEDWKVGVRFKTVGGKDLVRRLEIAEFVKRFMNSESVEGR 420
           GVPMLAWPIFWDQ PNSK I EDWKVGVRF+ VGG++LV R EIAE VKRFMN E+VEGR
Sbjct: 361 GVPMLAWPIFWDQIPNSKKIVEDWKVGVRFEAVGGRNLVGREEIAETVKRFMNPENVEGR 420

Query: 421 EMRNRVSEFQEICQLAVAKGGSSDSNIDSFLNHISG 455
           EMR RVSE +E C+ AVA+GGSSDSNID+FL  I G
Sbjct: 421 EMRKRVSELRETCRRAVARGGSSDSNIDAFLGDICG 455

BLAST of HG10001558 vs. TAIR 10
Match: AT2G30140.2 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 471.1 bits (1211), Expect = 1.0e-132
Identity = 242/457 (52.95%), Postives = 312/457 (68.27%), Query Frame = 0

Query: 1   MDPISGSAAPKRI-HLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTNEWLTFLVA 60
           MDP    + P +  H+ A+PYPGRGHIN +MNLCK L  + PN+ ++F+VT EWL F+  
Sbjct: 1   MDP--NESPPNQFRHVVAMPYPGRGHINPMMNLCKRLVRRYPNLHVTFVVTEEWLGFIGP 60

Query: 61  DPKPQNIQFATFPNVIPSELCRTNDFPDFVRSIHTHMEAPVETLLRRL-EPPPIAILADA 120
           DPKP  I F+T PN+IPSEL R  DF  F+ +++T +E P E LL  L  PPP  I AD 
Sbjct: 61  DPKPDRIHFSTLPNLIPSELVRAKDFIGFIDAVYTRLEEPFEKLLDSLNSPPPSVIFADT 120

Query: 121 FVTWAVQLGQRLNVLVASLWPMSATVFSILYHFDLLKENGHFPADLLERGEEIVDYFPGV 180
           +V WAV++G++ N+ V SLW MSAT+ S   H DLL  +GH    L E  EE+VDY PG+
Sbjct: 121 YVIWAVRVGRKRNIPVVSLWTMSATILSFFLHSDLLISHGH---ALFEPSEEVVDYVPGL 180

Query: 181 SKIRLADLPSFFSGDGLQSVEFAVNSARSVDKSQFLISTSVYELESSVIDSLKAKFPFPV 240
           S  +L DLP  F G   +  + A      +  ++ L+ T+ YELE   ID+  +K   PV
Sbjct: 181 SPTKLRDLPPIFDGYSDRVFKTAKLCFDELPGARSLLFTTAYELEHKAIDAFTSKLDIPV 240

Query: 241 YTIGPSTPYFELECSVPNGGTNDYLRWLDSQAEGSVLYISQGSFLSVSSAQMDEIVAGVK 300
           Y IGP  P+ EL     N   N Y++WL+ Q EGSVLYISQGSFLSVS AQM+EIV G++
Sbjct: 241 YAIGPLIPFEELSVQNDNKEPN-YIQWLEEQPEGSVLYISQGSFLSVSEAQMEEIVKGLR 300

Query: 301 ASGVRFLWVARGDDGRLKD-VDRETGMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEGIF 360
            SGVRFLWVARG + +LK+ ++   G+VV WCDQLRVLCH+AVGGFWTH G+NSTLEGI+
Sbjct: 301 ESGVRFLWVARGGELKLKEALEGSLGVVVSWCDQLRVLCHKAVGGFWTHCGFNSTLEGIY 360

Query: 361 AGVPMLAWPIFWDQFPNSKNIAEDWKVGVRF-KTVGGKDLVRRLEIAEFVKRFMNSESVE 420
           +GVPMLA+P+FWDQ  N+K I EDW+VG+R  +T   + L+ R EI E VKRFM+ ES E
Sbjct: 361 SGVPMLAFPLFWDQILNAKMIVEDWRVGMRIERTKKNELLIGREEIKEVVKRFMDRESEE 420

Query: 421 GREMRNRVSEFQEICQLAVAKGGSSDSNIDSFLNHIS 454
           G+EMR R  +  EI + AVAK GSS+ NID F+ HI+
Sbjct: 421 GKEMRRRACDLSEISRGAVAKSGSSNVNIDEFVRHIT 451

BLAST of HG10001558 vs. TAIR 10
Match: AT2G30140.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 467.6 bits (1202), Expect = 1.1e-131
Identity = 241/457 (52.74%), Postives = 312/457 (68.27%), Query Frame = 0

Query: 1   MDPISGSAAPKRI-HLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTNEWLTFLVA 60
           MDP    + P +  H+ A+PYPGRGHIN +MNLCK L  + PN+ ++F+VT EWL F+  
Sbjct: 1   MDP--NESPPNQFRHVVAMPYPGRGHINPMMNLCKRLVRRYPNLHVTFVVTEEWLGFIGP 60

Query: 61  DPKPQNIQFATFPNVIPSELCRTNDFPDFVRSIHTHMEAPVETLLRRL-EPPPIAILADA 120
           DPKP  I F+T PN+IPSEL R  DF  F+ +++T +E P E LL  L  PPP  I AD 
Sbjct: 61  DPKPDRIHFSTLPNLIPSELVRAKDFIGFIDAVYTRLEEPFEKLLDSLNSPPPSVIFADT 120

Query: 121 FVTWAVQLGQRLNVLVASLWPMSATVFSILYHFDLLKENGHFPADLLERGEEIVDYFPGV 180
           +V WAV++G++ N+ V SLW MSAT+ S   H DLL  +GH   +  E  EE+VDY PG+
Sbjct: 121 YVIWAVRVGRKRNIPVVSLWTMSATILSFFLHSDLLISHGHALFEPSE--EEVVDYVPGL 180

Query: 181 SKIRLADLPSFFSGDGLQSVEFAVNSARSVDKSQFLISTSVYELESSVIDSLKAKFPFPV 240
           S  +L DLP  F G   +  + A      +  ++ L+ T+ YELE   ID+  +K   PV
Sbjct: 181 SPTKLRDLPPIFDGYSDRVFKTAKLCFDELPGARSLLFTTAYELEHKAIDAFTSKLDIPV 240

Query: 241 YTIGPSTPYFELECSVPNGGTNDYLRWLDSQAEGSVLYISQGSFLSVSSAQMDEIVAGVK 300
           Y IGP  P+ EL     N   N Y++WL+ Q EGSVLYISQGSFLSVS AQM+EIV G++
Sbjct: 241 YAIGPLIPFEELSVQNDNKEPN-YIQWLEEQPEGSVLYISQGSFLSVSEAQMEEIVKGLR 300

Query: 301 ASGVRFLWVARGDDGRLKD-VDRETGMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEGIF 360
            SGVRFLWVARG + +LK+ ++   G+VV WCDQLRVLCH+AVGGFWTH G+NSTLEGI+
Sbjct: 301 ESGVRFLWVARGGELKLKEALEGSLGVVVSWCDQLRVLCHKAVGGFWTHCGFNSTLEGIY 360

Query: 361 AGVPMLAWPIFWDQFPNSKNIAEDWKVGVRF-KTVGGKDLVRRLEIAEFVKRFMNSESVE 420
           +GVPMLA+P+FWDQ  N+K I EDW+VG+R  +T   + L+ R EI E VKRFM+ ES E
Sbjct: 361 SGVPMLAFPLFWDQILNAKMIVEDWRVGMRIERTKKNELLIGREEIKEVVKRFMDRESEE 420

Query: 421 GREMRNRVSEFQEICQLAVAKGGSSDSNIDSFLNHIS 454
           G+EMR R  +  EI + AVAK GSS+ NID F+ HI+
Sbjct: 421 GKEMRRRACDLSEISRGAVAKSGSSNVNIDEFVRHIT 452

BLAST of HG10001558 vs. TAIR 10
Match: AT2G30150.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 462.6 bits (1189), Expect = 3.6e-130
Identity = 231/439 (52.62%), Postives = 303/439 (69.02%), Query Frame = 0

Query: 18  LPYPGRGHINALMNLCKLLSLKNPNILISFIVTNEWLTFLVADPKPQNIQFATFPNVIPS 77
           +P+PGRGHIN ++NLCK L  ++PN+ ++F+VT EWL F+ +DPKP  I FAT PN+IPS
Sbjct: 1   MPWPGRGHINPMLNLCKSLVRRDPNLTVTFVVTEEWLGFIGSDPKPNRIHFATLPNIIPS 60

Query: 78  ELCRTNDFPDFVRSIHTHMEAPVETLLRRLEPPPIAILADAFVTWAVQLGQRLNVLVASL 137
           EL R NDF  F+ ++ T +E P E LL RL  PP AI+AD ++ WAV++G + N+ VAS 
Sbjct: 61  ELVRANDFIAFIDAVLTRLEEPFEQLLDRLNSPPTAIIADTYIIWAVRVGTKRNIPVASF 120

Query: 138 WPMSATVFSILYHFDLLKENGHFPADLLE-RGEEIVDYFPGVSKIRLADLPSFFSGDGLQ 197
           W  SAT+ S+  + DLL  +GHFP +  E + +EIVDY PG+S  RL+DL     G   Q
Sbjct: 121 WTTSATILSLFINSDLLASHGHFPIEPSESKLDEIVDYIPGLSPTRLSDL-QILHGYSHQ 180

Query: 198 SVEFAVNSARSVDKSQFLISTSVYELESSVIDSLKAKFPFPVYTIGPSTPYFELECSVPN 257
                  S   + K+++L+  S YELE   ID   +KF FPVY+ GP  P  EL     N
Sbjct: 181 VFNIFKKSFGELYKAKYLLFPSAYELEPKAIDFFTSKFDFPVYSTGPLIPLEELSVGNEN 240

Query: 258 GGTNDYLRWLDSQAEGSVLYISQGSFLSVSSAQMDEIVAGVKASGVRFLWVARGDDGRLK 317
               DY +WLD Q E SVLYISQGSFLSVS AQM+EIV GV+ +GV+F WVARG + +LK
Sbjct: 241 REL-DYFKWLDEQPESSVLYISQGSFLSVSEAQMEEIVVGVREAGVKFFWVARGGELKLK 300

Query: 318 D-VDRETGMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEGIFAGVPMLAWPIFWDQFPNS 377
           + ++   G+VV WCDQLRVLCH A+GGFWTH G+NSTLEGI +GVP+L +P+FWDQF N+
Sbjct: 301 EALEGSLGVVVSWCDQLRVLCHAAIGGFWTHCGYNSTLEGICSGVPLLTFPVFWDQFLNA 360

Query: 378 KNIAEDWKVGVRFKTVGGKD-LVRRLEIAEFVKRFMNSESVEGREMRNRVSEFQEICQLA 437
           K I E+W+VG+  +     + L+   EI E VKRFM+ ES EG+EMR R  +  EIC+ A
Sbjct: 361 KMIVEEWRVGMGIERKKQMELLIVSDEIKELVKRFMDGESEEGKEMRRRTCDLSEICRGA 420

Query: 438 VAKGGSSDSNIDSFLNHIS 454
           VAKGGSSD+NID+F+  I+
Sbjct: 421 VAKGGSSDANIDAFIKDIT 437

BLAST of HG10001558 vs. TAIR 10
Match: AT2G36970.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 237.7 bits (605), Expect = 1.9e-62
Identity = 148/485 (30.52%), Postives = 246/485 (50.72%), Query Frame = 0

Query: 8   AAPKRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTNE-------------WL 67
           A  ++ H+  +PYP +GH+   ++L   + L +    I+F+ T+                
Sbjct: 4   AKSRKPHIMMIPYPLQGHVIPFVHLA--IKLASHGFTITFVNTDSIHHHISTAHQDDAGD 63

Query: 68  TFLVADPKPQ-NIQFATFPNVIPSELCRTNDFPDFVRSI----HTHMEAPVETLLRRLEP 127
            F  A    Q +I++ T  +  P +  R+ +   F   I      H++  +  L RR +P
Sbjct: 64  IFSAARSSGQHDIRYTTVSDGFPLDFDRSLNHDQFFEGILHVFSAHVDDLIAKLSRRDDP 123

Query: 128 PPIAILADAFVTWAVQLGQRLNVLVASLWPMSATVFSILYHFDLLKENGHFPADLLERGE 187
           P   ++AD F  W+  +  + N++  S W   A V ++ YH DLL  NGHF +  L+  +
Sbjct: 124 PVTCLIADTFYVWSSMICDKHNLVNVSFWTEPALVLNLYYHMDLLISNGHFKS--LDNRK 183

Query: 188 EIVDYFPGVSKIRLADLPSFFSGD----GLQSVEFAV--NSARSVDKSQFLISTSVYELE 247
           +++DY PGV  I   DL S+           +V + +   + + V ++ F++  +V ELE
Sbjct: 184 DVIDYVPGVKAIEPKDLMSYLQVSDKDVDTNTVVYRILFKAFKDVKRADFVVCNTVQELE 243

Query: 248 SSVIDSLKAKFPFPVYTIGPSTPYFELECSVPNG--GTNDYLRWLDSQAEGSVLYISQGS 307
              + +L+AK   PVY IG   P F  +  VP      +D   WL  +  GSVLY+S GS
Sbjct: 244 PDSLSALQAK--QPVYAIG---PVFSTDSVVPTSLWAESDCTEWLKGRPTGSVLYVSFGS 303

Query: 308 FLSVSSAQMDEIVAGVKASGVRFLWVARGD----------DGRLKDVDRETGMVVGWCDQ 367
           +  V   ++ EI  G+  SG+ F+WV R D               D  ++ G+VV WC Q
Sbjct: 304 YAHVGKKEIVEIAHGLLLSGISFIWVLRPDIVGSNVPDFLPAGFVDQAQDRGLVVQWCCQ 363

Query: 368 LRVLCHRAVGGFWTHGGWNSTLEGIFAGVPMLAWPIFWDQFPNSKNIAEDWKVGVRFKTV 427
           + V+ + AVGGF+TH GWNS LE ++ G+P+L +P+  DQF N K + +DW +G+    +
Sbjct: 364 MEVISNPAVGGFFTHCGWNSILESVWCGLPLLCYPLLTDQFTNRKLVVDDWCIGI---NL 423

Query: 428 GGKDLVRRLEIAEFVKRFMNSESVEGREMRNRVSEFQEICQLAVAKGGSSDSNIDSFLNH 457
             K  + R +++  VKR MN E+    E+RN V + +   + AV   GSS++N + F++ 
Sbjct: 424 CEKKTITRDQVSANVKRLMNGET--SSELRNNVEKVKRHLKDAVTTVGSSETNFNLFVSE 474

BLAST of HG10001558 vs. TAIR 10
Match: AT1G78270.1 (UDP-glucosyl transferase 85A4 )

HSP 1 Score: 223.4 bits (568), Expect = 3.7e-58
Identity = 148/486 (30.45%), Postives = 247/486 (50.82%), Query Frame = 0

Query: 6   GSAAPKRIHLAALPYPGRGHINALMNLCKLLSLKNPNILISFIVTNEWLTFLVADPKPQ- 65
           G ++ ++ H   +PYP +GHIN ++ L KLL  +     ++F+ T+     ++    P  
Sbjct: 5   GGSSSQKPHAMCIPYPAQGHINPMLKLAKLLHAR--GFHVTFVNTDYNHRRILQSRGPHA 64

Query: 66  -----NIQFATFPNVIP-SELCRTNDFPDFVRSIHTHMEAPVETLLRRLE-----PPPIA 125
                + +F T P+ +P +++    D    + S   +  AP + L+ RL      PP   
Sbjct: 65  LNGLPSFRFETIPDGLPWTDVDAKQDMLKLIDSTINNCLAPFKDLILRLNSGSDIPPVSC 124

Query: 126 ILADAFVTWAVQLGQRLNVLVASLWPMSATVFSILYHFDLLKENGHFP----ADLLERGE 185
           I++DA +++ +   + L + V  LW  SAT   +  H+  L E    P    +DL +  E
Sbjct: 125 IISDASMSFTIDAAEELKIPVVLLWTNSATALILYLHYQKLIEKEIIPLKDSSDLKKHLE 184

Query: 186 EIVDYFPGVSKIRLADLPSFFSGDGLQS--VEFAVNSARSVDKSQFLISTSVYELESSVI 245
             +D+ P + KI+L D P F +    Q   + F ++    + ++  +   +  +LE +V+
Sbjct: 185 TEIDWIPSMKKIKLKDFPDFVTTTNPQDPMISFILHVTGRIKRASAIFINTFEKLEHNVL 244

Query: 246 DSLKAKFPFPVYTIGP----STPYFELECSVPNGGTN------DYLRWLDSQAEGSVLYI 305
            SL++  P  +Y++GP         +    +   G N      + L WLD++AE +V+Y+
Sbjct: 245 LSLRSLLP-QIYSVGPFQILENREIDKNSEIRKLGLNLWEEETESLDWLDTKAEKAVIYV 304

Query: 306 SQGSFLSVSSAQMDEIVAGVKASGVRFLWVAR-----GDDGRLK----DVDRETGMVV-G 365
           + GS   ++S Q+ E   G+  SG  FLWV R     GDD  L        +  GM++ G
Sbjct: 305 NFGSLTVLTSEQILEFAWGLARSGKEFLWVVRSGMVDGDDSILPAEFLSETKNRGMLIKG 364

Query: 366 WCDQLRVLCHRAVGGFWTHGGWNSTLEGIFAGVPMLAWPIFWDQFPNSKNIAEDWKVGVR 425
           WC Q +VL H A+GGF TH GWNSTLE ++AGVPM+ WP F DQ  N K   EDW +G+ 
Sbjct: 365 WCSQEKVLSHPAIGGFLTHCGWNSTLESLYAGVPMICWPFFADQLTNRKFCCEDWGIGME 424

Query: 426 FKTVGGKDLVRRLEIAEFVKRFMNSESVEGREMRNRVSEFQEICQLAVAKG-GSSDSNID 453
                G++ V+R  +   VK  M+ E  +G+ +R +V E++ + + A A   GSS  N +
Sbjct: 425 I----GEE-VKRERVETVVKELMDGE--KGKRLREKVVEWRRLAEEASAPPLGSSYVNFE 480

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038902528.13.4e-23187.72UDP-glycosyltransferase 87A1-like [Benincasa hispida][more]
XP_008463842.12.1e-22886.34PREDICTED: UDP-glycosyltransferase 87A1-like [Cucumis melo] >KAA0035258.1 UDP-gl... [more]
XP_004143169.19.6e-22686.18UDP-glycosyltransferase 87A1 [Cucumis sativus] >KGN47051.1 hypothetical protein ... [more]
XP_022948231.11.8e-20879.39UDP-glycosyltransferase 87A2-like [Cucurbita moschata] >XP_022948232.1 UDP-glyco... [more]
XP_023532234.11.5e-20779.17UDP-glycosyltransferase 87A2-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
O647331.6e-13052.74UDP-glycosyltransferase 87A2 OS=Arabidopsis thaliana OX=3702 GN=UGT87A2 PE=1 SV=... [more]
O647325.1e-12952.62UDP-glycosyltransferase 87A1 OS=Arabidopsis thaliana OX=3702 GN=UGT87A1 PE=2 SV=... [more]
Q9SJL02.7e-6130.52UDP-glycosyltransferase 86A1 OS=Arabidopsis thaliana OX=3702 GN=UGT86A1 PE=2 SV=... [more]
Q9M9E75.2e-5730.45UDP-glycosyltransferase 85A4 OS=Arabidopsis thaliana OX=3702 GN=UGT85A4 PE=2 SV=... [more]
F8WLS66.8e-5730.367-deoxyloganetin glucosyltransferase OS=Catharanthus roseus OX=4058 GN=UGT85A23 ... [more]
Match NameE-valueIdentityDescription
A0A5D3C6P91.0e-22886.34UDP-glycosyltransferase 87A1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A1S3CLR31.0e-22886.34UDP-glycosyltransferase 87A1-like OS=Cucumis melo OX=3656 GN=LOC103501885 PE=4 S... [more]
A0A0A0KH184.7e-22686.18Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G181560 PE=4 SV=1[more]
A0A6J1G8M48.8e-20979.39UDP-glycosyltransferase 87A2-like OS=Cucurbita moschata OX=3662 GN=LOC111451869 ... [more]
A0A6J1L0A49.2e-20678.51UDP-glycosyltransferase 87A2 OS=Cucurbita maxima OX=3661 GN=LOC111499872 PE=4 SV... [more]
Match NameE-valueIdentityDescription
AT2G30140.21.0e-13252.95UDP-Glycosyltransferase superfamily protein [more]
AT2G30140.11.1e-13152.74UDP-Glycosyltransferase superfamily protein [more]
AT2G30150.13.6e-13052.62UDP-Glycosyltransferase superfamily protein [more]
AT2G36970.11.9e-6230.52UDP-Glycosyltransferase superfamily protein [more]
AT1G78270.13.7e-5830.45UDP-glucosyl transferase 85A4 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 14..439
e-value: 2.5E-114
score: 384.7
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 257..433
e-value: 2.5E-114
score: 384.7
NoneNo IPR availablePANTHERPTHR48047:SF80UDP-GLYCOSYLTRANSFERASE 87A1-LIKEcoord: 11..454
NoneNo IPR availablePANTHERPTHR48047GLYCOSYLTRANSFERASEcoord: 11..454
NoneNo IPR availableSUPERFAMILY53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 12..452
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 249..401
e-value: 3.4E-20
score: 72.3
IPR002213UDP-glucuronosyl/UDP-glucosyltransferaseCDDcd03784GT1_Gtf-likecoord: 14..451
e-value: 3.1415E-66
score: 215.878

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10001558.1HG10001558.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0008194 UDP-glycosyltransferase activity