CmaCh01G020540 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh01G020540
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionGlycosyltransferase family 92 protein
LocationCma_Chr01: 13045384 .. 13049037 (+)
RNA-Seq ExpressionCmaCh01G020540
SyntenyCmaCh01G020540
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
GGAAAAAAAAAAAAAAGAAAAAGAAAAAGAAAAAAGAGCAATGAATACACTAAATATGATATGATATGCGCATATGGCCACTAATCTCATCGCACCGACAGACANAGCAAAAGAGACTGGCGGAGGCCAAGGACTGAAGTTCCCCTTTCTGCCTTCATGAGGAAAGACGGTCCGGCTACCTCCATCGGCGACGCCCAAGTCGGCAAACTCTTCACCTGCTTCGAGACCAAGCCTCTGGTCGCCACATGTCTCGCCCTCACTCTTGTCATGTTCCTCTGGAATCTTCCACCTTACTACCAGAATCTCCTCTTCACCACCGCCCGCTCCTGCTCCGCCCCCACCAACACCGCCGACGCCGCCCTCGCCAATCTCACCATTCCTCCCAATACCTCTCTCCCCTTTACTGCATCCGCCGCTGCCAAAAAGTACTCCACCACCCTCCCAACCGATCCCAACAACCGAATTTTCCAGCCATTTGGTAATGCTGCGGCTCTATTCGTTTTAATGGGAGCCTATAGAGGCGGACCACGCACTTTTGCCATCGTCGGCCTTGCTTCCAAACCCATTCACGTCTACGGCCATCCTTGGTACAAATGCGAGTGGATCTTCAACAATGGATCTTCCATTAGGGCCAAGGCCTACAAGATTCTTCCCGATTGGGGCTATGGCCGTGTCTACACCGTCGTCGTCGTCAATTGCACTTTTCCCCTCAATCCCAATCACGACAATTCGGGTGGAAAGCTTACGGTTAATGCCTACTACGGTCAGTCTCAGAAAAAGTACGAAAAGTTCACTGCTCTTGAGGAATTGCCAGGCTCTTACAACGCATCCAAGTTCCGTCCTCCTTATGACTACGAATATTTGTACTGCGGTTCGTCTCTCTACGGGAACCTCAGTGCCGCCAGGATCAGAGAATGGATGGCATACCATGCCTGGTTCTTCGGGTCTAAATCCCATTTTGTGTTTCACGACGCCGGCGGGGTGTCCCCAGAGGTCAGGGCTGTCCTCGAGCCGTGGGTGCGAGCTGGAAGGGTCACAATTCAGGACATCAGAGCACAGTCAGAGTACGATGGGTATTACTATAATCAGTTTCTGGTAGTGAACGACTGCCTCCACCGGTACCGACACGCGGCAAACTGGACATTTTATTTCGATGTGGACGAGTACATATATTTGCCCGAGGGGAGTAGCTTGGAGTCCGTGTTGGAAGAGTTTTCCGCATTTACTCAATTTACAATCGAGCAGAACCCAATGTCCAGTATGCTGTGTTTGAACGATTCCGCTCAAAATTACTCCAGGTACGTGTTCGATCTAATTATGTTTGGAAGATACTTGTTGGATCTGAGGTCAGGTAGGATGTAGATAGGGATGTATGAATTGATATATATATATTTGATCCCCTTGTCAGGAAATGGGGGTTTGAAAAGCTGCTGTTTAAAGACATAAAGTCTGGAATCTGGAGAGACCGGAAGTACGCGATACAAGCCAAGAACGCGTATGCTACGGGGGTACACATGTCGGAAAATGTGATTGGAAATACAACGCACAAAACAGAGTCCAAGATTAGATACTATCATTACCACAACTCCATCATGGTGAGAGGGGAACTGTGCAGGGAATTCCTCCCCAACTCCGCCATTCATAACGTCACAATCTTCAACCAAACCCCTTTTGTGTACGACGACAAGATGAAGAAGCTTGCTGACACCATTAAGGAGTTCGAGCGCCACGCCATCGGTACTGCTAATGCGAAGCTCTTCTCCTGATGTGATTACAGCTCGAGAACATCCATCCGCTTTCATCATCTTCCAAGTTGCAAATTTGGGTGTAGGAATGTGTGAATTGTAAGTAGAAGTCGTGAATGTATGTGGTAGACAGAATTGGGGGCAAATCCGATTGGAGTGGCTGGAGTTTCTGAATGTGGATGTAATGATGGGGGAGGGGAATTATATTTGCCCAAACTTTACTCCTCAGTTACTAACTTACTCCTCCTCCGTTACTAATCTACCCTGGTACTGGTGGTGGTAAAAGAACAGTGACTTTGAATCCAGTGATAGAAAGGGATACATAGTTGCTTTAAGATTGTAATGAGTAGATTAAGGGACTTTTTTTTTTCTTTTCATTCTTTCTACTTTGAGCTCTAGCCTTGCAATTTTAAGGGACTTTTTTCTTTTTTCTTTTTTCTTTTCTTTTCGTGTATTATGAATATGATTATGAATGGGTTGAGACAGAGAAAATTGGTGAGGGGTAGTTTGTGCCTTCCTCTGCTTCCGCTTCTTTGGAGCGTGGAAGTGGAAGTACAGGGAAAGAGAACTGTGTCCTTACCGTTGCTACTTACTATGCGTATGTAAATCAACTCCACAGTAACAAAAGTGGATTCTTTGATATTTGCTTTCTGAGGACCTAAAGTAATAAAAGGCAATAATTATATATAATTACCGTTGTGTCCTTTGAATTGTTGTAATTAATTATGGTCTTTTATCTTCTTATGTGACAATTGTTATGGAAATTACTTTATTTAGAAACTTAAAAAACATCAGACAAGAAGAAGGTAGTGTGTTTTTAAATAAATAAATAAGGTCATAAATAAGAATGATGTTGATGGATTATGCATTGATTAGTAAGGGCATTTTTCAATTTGTGTGCTCGGGGAACAAGATTGTGGTCATCTTTTATTTTTATTTTTGTGTGGGTGTGTGTTGGTTTGGCGTGTCCGTTGACTACGTCGGGTTTCTTTTTGATTGGTATTAGCATTAAATGGAAAGGCCGCGTTGAAGGTAGTTTTATGACGTAGACGATGACCAATTTTCTGTTTCCTCATCGCTTACCTCGCACCAGCATCACATTTCAATAAATGATCCATCCCCTCCTGGTTATACCAATATCCTTCTCCTTCTCCTTCTCCTTCTCCTTCTCCTCCTCGGCTCCTTCTCCTTCTCCTTCTCCTTCTCCTTCTCCTTCTCCTCCTCGGCTCCTCCTCCCATTCGCAATTTCTCATTTCTGACTGCCCATAATCCCACAAAATTCCGTGTCTGAAACGTAGACAGACATCAGCAACAGCTAGGCTTGATACTGATGGGTGCCTGTTTATCCGACTGCCTCAATCATCCCAAACCCTCTTCTGTTTCTCCACCTCCTCCCACCGCCAAAGTGATCTCTTTGCAAGGCCATCTCCGCGAATACCCTGTTCCCATCTCCGTCTCCCGCGTTCTCCAGACCGAAAACTCCTCTTCTTCACTTTCCGACTCCTTTCTTTGCAACTCCGACCGCTTATACTACGATGACTTCATCCCCCCTTTGCCACTCGATGAACAGCTTCTCCCTAATCAGATCTATTTCCTCCTTCCTTCCTCCAACCTCCACCACCGATTGAGCGCCTCCCAAATGGCTGCCTTGGCTGTCAAAGCCAGCCTCGCCCTCCAAAATGCCTCCCCCAACGATCGCCGTAAAAAGGGTCGTGTTTCTCCCCTCCTCAACCTCTCGGATTCCGACCACATCATCTCCAAGGAGCCCTCCAAAAAGAACGCCGCCGCGGATACCTCTGCCTCACCCTCCGTTAGAAAATTGCAGAGATTGACGTCCAGAAGAGCAAAAATGGCTGTTCGTTCCTTTAAACTCAAATTGAGCACCATCTATGAAGGCGCCGTTCTGTAG

mRNA sequence

GGAAAAAAAAAAAAAAGAAAAAGAAAAAGAAAAAAGAGCAATGAATACACTAAATATGATATGATATGCGCATATGGCCACTAATCTCATCGCACCGACAGACANAGCAAAAGAGACTGGCGGAGGCCAAGGACTGAAGTTCCCCTTTCTGCCTTCATGAGGAAAGACGGTCCGGCTACCTCCATCGGCGACGCCCAAGTCGGCAAACTCTTCACCTGCTTCGAGACCAAGCCTCTGGTCGCCACATGTCTCGCCCTCACTCTTGTCATGTTCCTCTGGAATCTTCCACCTTACTACCAGAATCTCCTCTTCACCACCGCCCGCTCCTGCTCCGCCCCCACCAACACCGCCGACGCCGCCCTCGCCAATCTCACCATTCCTCCCAATACCTCTCTCCCCTTTACTGCATCCGCCGCTGCCAAAAAGTACTCCACCACCCTCCCAACCGATCCCAACAACCGAATTTTCCAGCCATTTGGTAATGCTGCGGCTCTATTCGTTTTAATGGGAGCCTATAGAGGCGGACCACGCACTTTTGCCATCGTCGGCCTTGCTTCCAAACCCATTCACGTCTACGGCCATCCTTGGTACAAATGCGAGTGGATCTTCAACAATGGATCTTCCATTAGGGCCAAGGCCTACAAGATTCTTCCCGATTGGGGCTATGGCCGTGTCTACACCGTCGTCGTCGTCAATTGCACTTTTCCCCTCAATCCCAATCACGACAATTCGGGTGGAAAGCTTACGGTTAATGCCTACTACGGTCAGTCTCAGAAAAAGTACGAAAAGTTCACTGCTCTTGAGGAATTGCCAGGCTCTTACAACGCATCCAAGTTCCGTCCTCCTTATGACTACGAATATTTGTACTGCGGTTCGTCTCTCTACGGGAACCTCAGTGCCGCCAGGATCAGAGAATGGATGGCATACCATGCCTGGTTCTTCGGGTCTAAATCCCATTTTGTGTTTCACGACGCCGGCGGGGTGTCCCCAGAGGTCAGGGCTGTCCTCGAGCCGTGGGTGCGAGCTGGAAGGGTCACAATTCAGGACATCAGAGCACAGTCAGAGTACGATGGGTATTACTATAATCAGTTTCTGGTAGTGAACGACTGCCTCCACCGGTACCGACACGCGGCAAACTGGACATTTTATTTCGATGTGGACGAGTACATATATTTGCCCGAGGGGAGTAGCTTGGAGTCCGTGTTGGAAGAGTTTTCCGCATTTACTCAATTTACAATCGAGCAGAACCCAATGTCCAGTATGCTGTGTTTGAACGATTCCGCTCAAAATTACTCCAGGAAATGGGGGTTTGAAAAGCTGCTGTTTAAAGACATAAAGTCTGGAATCTGGAGAGACCGGAAGTACGCGATACAAGCCAAGAACGCGTATGCTACGGGGGTACACATGTCGGAAAATGTGATTGGAAATACAACGCACAAAACAGAGTCCAAGATTAGATACTATCATTACCACAACTCCATCATGGTGAGAGGGGAACTGTGCAGGGAATTCCTCCCCAACTCCGCCATTCATAACGTCACAATCTTCAACCAAACCCCTTTTGTGTACGACGACAAGATGAAGAAGCTTGCTGACACCATTAAGGAGTTCGAGCGCCACGCCATCGACAGACATCAGCAACAGCTAGGCTTGATACTGATGGGTGCCTGTTTATCCGACTGCCTCAATCATCCCAAACCCTCTTCTGTTTCTCCACCTCCTCCCACCGCCAAAGTGATCTCTTTGCAAGGCCATCTCCGCGAATACCCTGTTCCCATCTCCGTCTCCCGCGTTCTCCAGACCGAAAACTCCTCTTCTTCACTTTCCGACTCCTTTCTTTGCAACTCCGACCGCTTATACTACGATGACTTCATCCCCCCTTTGCCACTCGATGAACAGCTTCTCCCTAATCAGATCTATTTCCTCCTTCCTTCCTCCAACCTCCACCACCGATTGAGCGCCTCCCAAATGGCTGCCTTGGCTGTCAAAGCCAGCCTCGCCCTCCAAAATGCCTCCCCCAACGATCGCCGTAAAAAGGGTCGTGTTTCTCCCCTCCTCAACCTCTCGGATTCCGACCACATCATCTCCAAGGAGCCCTCCAAAAAGAACGCCGCCGCGGATACCTCTGCCTCACCCTCCGTTAGAAAATTGCAGAGATTGACGTCCAGAAGAGCAAAAATGGCTGTTCGTTCCTTTAAACTCAAATTGAGCACCATCTATGAAGGCGCCGTTCTGTAG

Coding sequence (CDS)

ATGAGGAAAGACGGTCCGGCTACCTCCATCGGCGACGCCCAAGTCGGCAAACTCTTCACCTGCTTCGAGACCAAGCCTCTGGTCGCCACATGTCTCGCCCTCACTCTTGTCATGTTCCTCTGGAATCTTCCACCTTACTACCAGAATCTCCTCTTCACCACCGCCCGCTCCTGCTCCGCCCCCACCAACACCGCCGACGCCGCCCTCGCCAATCTCACCATTCCTCCCAATACCTCTCTCCCCTTTACTGCATCCGCCGCTGCCAAAAAGTACTCCACCACCCTCCCAACCGATCCCAACAACCGAATTTTCCAGCCATTTGGTAATGCTGCGGCTCTATTCGTTTTAATGGGAGCCTATAGAGGCGGACCACGCACTTTTGCCATCGTCGGCCTTGCTTCCAAACCCATTCACGTCTACGGCCATCCTTGGTACAAATGCGAGTGGATCTTCAACAATGGATCTTCCATTAGGGCCAAGGCCTACAAGATTCTTCCCGATTGGGGCTATGGCCGTGTCTACACCGTCGTCGTCGTCAATTGCACTTTTCCCCTCAATCCCAATCACGACAATTCGGGTGGAAAGCTTACGGTTAATGCCTACTACGGTCAGTCTCAGAAAAAGTACGAAAAGTTCACTGCTCTTGAGGAATTGCCAGGCTCTTACAACGCATCCAAGTTCCGTCCTCCTTATGACTACGAATATTTGTACTGCGGTTCGTCTCTCTACGGGAACCTCAGTGCCGCCAGGATCAGAGAATGGATGGCATACCATGCCTGGTTCTTCGGGTCTAAATCCCATTTTGTGTTTCACGACGCCGGCGGGGTGTCCCCAGAGGTCAGGGCTGTCCTCGAGCCGTGGGTGCGAGCTGGAAGGGTCACAATTCAGGACATCAGAGCACAGTCAGAGTACGATGGGTATTACTATAATCAGTTTCTGGTAGTGAACGACTGCCTCCACCGGTACCGACACGCGGCAAACTGGACATTTTATTTCGATGTGGACGAGTACATATATTTGCCCGAGGGGAGTAGCTTGGAGTCCGTGTTGGAAGAGTTTTCCGCATTTACTCAATTTACAATCGAGCAGAACCCAATGTCCAGTATGCTGTGTTTGAACGATTCCGCTCAAAATTACTCCAGGAAATGGGGGTTTGAAAAGCTGCTGTTTAAAGACATAAAGTCTGGAATCTGGAGAGACCGGAAGTACGCGATACAAGCCAAGAACGCGTATGCTACGGGGGTACACATGTCGGAAAATGTGATTGGAAATACAACGCACAAAACAGAGTCCAAGATTAGATACTATCATTACCACAACTCCATCATGGTGAGAGGGGAACTGTGCAGGGAATTCCTCCCCAACTCCGCCATTCATAACGTCACAATCTTCAACCAAACCCCTTTTGTGTACGACGACAAGATGAAGAAGCTTGCTGACACCATTAAGGAGTTCGAGCGCCACGCCATCGACAGACATCAGCAACAGCTAGGCTTGATACTGATGGGTGCCTGTTTATCCGACTGCCTCAATCATCCCAAACCCTCTTCTGTTTCTCCACCTCCTCCCACCGCCAAAGTGATCTCTTTGCAAGGCCATCTCCGCGAATACCCTGTTCCCATCTCCGTCTCCCGCGTTCTCCAGACCGAAAACTCCTCTTCTTCACTTTCCGACTCCTTTCTTTGCAACTCCGACCGCTTATACTACGATGACTTCATCCCCCCTTTGCCACTCGATGAACAGCTTCTCCCTAATCAGATCTATTTCCTCCTTCCTTCCTCCAACCTCCACCACCGATTGAGCGCCTCCCAAATGGCTGCCTTGGCTGTCAAAGCCAGCCTCGCCCTCCAAAATGCCTCCCCCAACGATCGCCGTAAAAAGGGTCGTGTTTCTCCCCTCCTCAACCTCTCGGATTCCGACCACATCATCTCCAAGGAGCCCTCCAAAAAGAACGCCGCCGCGGATACCTCTGCCTCACCCTCCGTTAGAAAATTGCAGAGATTGACGTCCAGAAGAGCAAAAATGGCTGTTCGTTCCTTTAAACTCAAATTGAGCACCATCTATGAAGGCGCCGTTCTGTAG

Protein sequence

MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSAPTNTADAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAYRGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVVVNCTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGSSLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRAQSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQFTIEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSENVIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKLADTIKEFERHAIDRHQQQLGLILMGACLSDCLNHPKPSSVSPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSLSDSFLCNSDRLYYDDFIPPLPLDEQLLPNQIYFLLPSSNLHHRLSASQMAALAVKASLALQNASPNDRRKKGRVSPLLNLSDSDHIISKEPSKKNAAADTSASPSVRKLQRLTSRRAKMAVRSFKLKLSTIYEGAVL
Homology
BLAST of CmaCh01G020540 vs. ExPASy Swiss-Prot
Match: O22807 (Galactan beta-1,4-galactosyltransferase GALS1 OS=Arabidopsis thaliana OX=3702 GN=GALS1 PE=2 SV=2)

HSP 1 Score: 739.6 bits (1908), Expect = 3.3e-212
Identity = 349/472 (73.94%), Postives = 404/472 (85.59%), Query Frame = 0

Query: 21  CFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSAPTNTADAAL---ANLTIPPN 80
           CFE KP++AT LAL+LVM +WNLPPYY NL+ +TAR CSA T T    L   +N T   N
Sbjct: 16  CFEKKPIIATLLALSLVMIVWNLPPYYHNLI-STARPCSAVTTTTTTTLLSSSNFTSAEN 75

Query: 81  --TSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAYRGGPRTFAIVGLASK 140
             TSL  T +AA++KY +T P+DPN R+FQPFGNAAALFVLMGAYRGGP TF+++GLASK
Sbjct: 76  FTTSLSTTTAAASQKYDST-PSDPNKRVFQPFGNAAALFVLMGAYRGGPTTFSVIGLASK 135

Query: 141 PIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVVVNCTFPLNPNHDNSGGK 200
           PIHVYG PWYKCEWI NNG+SIRAKA KILPDWGYGRVYTVVVVNCTF  NPN DN+GGK
Sbjct: 136 PIHVYGKPWYKCEWISNNGTSIRAKAQKILPDWGYGRVYTVVVVNCTFNSNPNSDNTGGK 195

Query: 201 LTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGSSLYGNLSAARIREWM 260
           L +NAYY +S K +E+FT LEE  G Y+ SK+ PPY Y+YLYCGSSLYGN+SA+R+REWM
Sbjct: 196 LILNAYYNESPKLFERFTTLEESAGIYDESKYSPPYQYDYLYCGSSLYGNVSASRMREWM 255

Query: 261 AYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRAQSEYDGYYYNQFLVV 320
           AYHAWFFG KSHFVFHDAGGVSPEVR VLEPW+RAGRVT+Q+IR QS+YDGYYYNQFL+V
Sbjct: 256 AYHAWFFGDKSHFVFHDAGGVSPEVRKVLEPWIRAGRVTVQNIRDQSQYDGYYYNQFLIV 315

Query: 321 NDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQFTIEQNPMSSMLCLNDS 380
           NDCLHRYR+AANWTF+FDVDEYIYLP G++LESVL+EFS  TQFTIEQNPMSS+LC+NDS
Sbjct: 316 NDCLHRYRYAANWTFFFDVDEYIYLPHGNTLESVLDEFSVNTQFTIEQNPMSSVLCINDS 375

Query: 381 AQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSENVIGNTTHKTESKIRY 440
           +Q+Y R+WGFEKLLFKD ++ I RDRKYAIQAKNA+ATGVHMSEN++G T HKTE+KIRY
Sbjct: 376 SQDYPRQWGFEKLLFKDSRTKIRRDRKYAIQAKNAFATGVHMSENIVGKTLHKTETKIRY 435

Query: 441 YHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKLADTIKEFER 488
           YHYHN+I V  ELCRE LPNSA   VT++N+ P+VYDD MKKL  TIKEFE+
Sbjct: 436 YHYHNTITVHEELCREMLPNSAKKKVTLYNKLPYVYDDNMKKLVKTIKEFEQ 485

BLAST of CmaCh01G020540 vs. ExPASy Swiss-Prot
Match: Q9LTZ9 (Galactan beta-1,4-galactosyltransferase GALS2 OS=Arabidopsis thaliana OX=3702 GN=GALS2 PE=2 SV=1)

HSP 1 Score: 424.1 bits (1089), Expect = 3.1e-117
Identity = 212/399 (53.13%), Postives = 262/399 (65.66%), Query Frame = 0

Query: 102 RIFQPFGNAAALFVLMGAYRGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSS--IRA 161
           R F  +G AA  FVLM AYRGG  TFA++GL+SKP+HVY HP Y+CEWI  N S   I  
Sbjct: 116 RTFTGYGWAAYNFVLMNAYRGGVNTFAVIGLSSKPLHVYSHPTYRCEWIPLNQSDNRILT 175

Query: 162 KAYKILPDWGYGRVYTVVVVNCTFPLNP--NHDNSGGKLTVNAYYGQSQKKY-EKFTALE 221
              KIL DWGYGRVYT VVVNCTFP N   N  N+GG L ++A  G + +   +    L 
Sbjct: 176 DGTKILTDWGYGRVYTTVVVNCTFPSNTVINPKNTGGTLLLHATTGDTDRNITDSIPVLT 235

Query: 222 ELPGSYN----ASKFRPPYDYEYLYCGSSLYGNLSAARIREWMAYHAWFFGSKSHFVFHD 281
           E P + +     S  R    Y+YLYCGSSLYGNLS  RIREW+AYH  FFG +SHFV HD
Sbjct: 236 ETPNTVDFALYESNLRRREKYDYLYCGSSLYGNLSPQRIREWIAYHVRFFGERSHFVLHD 295

Query: 282 AGGVSPEVRAVLEPWVRAGRVTIQDIRAQSEYDGYYYNQFLVVNDCLHRYRHAANWTFYF 341
           AGG++ EV  VL+PW+  GRVT+ DIR Q  +DGYY+NQF+VVNDCLHRYR  A W F+F
Sbjct: 296 AGGITEEVFEVLKPWIELGRVTVHDIREQERFDGYYHNQFMVVNDCLHRYRFMAKWMFFF 355

Query: 342 DVDEYIYLPEGSSLESVLEEFSAFTQFTIEQNPMSSMLCLN-DSAQNYSRKWGFEKLLFK 401
           DVDE+IY+P  SS+ SV+     ++QFTIEQ PMSS LC + D      RKWGFEKL ++
Sbjct: 356 DVDEFIYVPAKSSISSVMVSLEEYSQFTIEQMPMSSQLCYDGDGPARTYRKWGFEKLAYR 415

Query: 402 DIKSGIWRDRKYAIQAKNAYATGVHMSENVIGNTTHKTESKIRYYHYHNSIMVRGELCRE 461
           D+K    RDRKYA+Q +N +ATGVHMS+++ G T H+ E KIRY+HYH SI  R E CR 
Sbjct: 416 DVKKVPRRDRKYAVQPRNVFATGVHMSQHLQGKTYHRAEGKIRYFHYHGSISQRREPCRH 475

Query: 462 FLPNSAIHNVTIFNQTPFVYDDKMKKLADTIKEFERHAI 491
               + I    +    P+V D  M+ +   +K FE   I
Sbjct: 476 LYNGTRI----VHENNPYVLDTTMRDIGLAVKTFEIRTI 510

BLAST of CmaCh01G020540 vs. ExPASy Swiss-Prot
Match: O65431 (Galactan beta-1,4-galactosyltransferase GALS3 OS=Arabidopsis thaliana OX=3702 GN=GALS3 PE=2 SV=1)

HSP 1 Score: 414.1 bits (1063), Expect = 3.2e-114
Identity = 220/477 (46.12%), Postives = 294/477 (61.64%), Query Frame = 0

Query: 27  LVATCLALTLVMFLWNLPPYYQNLLFTTARSCSAPTNTADAALANLTIPPNTSLPFTASA 86
           L+  C   TL+ F+    P   +L  +  R C +  ++A       T+  ++S P    +
Sbjct: 35  LLVLCTLATLLPFI----PSSFSLSTSDFRFCISRFSSAVPLNTTTTVEESSSSP----S 94

Query: 87  AAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAYRGGPRTFAIVGLASKPIHVYGHPWYK 146
             K     L      R F  +G+AA  FV M AYRGG  +FA++GL+SKP+HVYGHP Y+
Sbjct: 95  PEKNLDRVLDNGVIKRTFTGYGSAAYNFVSMSAYRGGVNSFAVIGLSSKPLHVYGHPSYR 154

Query: 147 CEWIFNNGSS--IRAKAYKILPDWGYGRVYTVVVVNCTF----PLNPNHDNSGGKLTVNA 206
           CEW+  + +   I    +KIL DWGYGR+YT VVVNCTF     +NP   NSGG L ++A
Sbjct: 155 CEWVSLDPTQDPISTTGFKILTDWGYGRIYTTVVVNCTFSSISAVNP--QNSGGTLILHA 214

Query: 207 YYGQ-SQKKYEKFTALEELPGS-----YNASKFRPPYDYEYLYCGSSLYGNLSAARIREW 266
             G  +    +  + L E P S     YN++K    YD  YLYCGSSLYGNLS  R+REW
Sbjct: 215 TTGDPTLNLTDSISVLTEPPKSVDFDLYNSTKKTKKYD--YLYCGSSLYGNLSPQRVREW 274

Query: 267 MAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRAQSEYDGYYYNQFLV 326
           +AYH  FFG +SHFV HDAGG+  EV  VL+PW+  GRVT+ DIR Q  +DGYY+NQF++
Sbjct: 275 IAYHVRFFGERSHFVLHDAGGIHEEVFEVLKPWIELGRVTLHDIRDQERFDGYYHNQFMI 334

Query: 327 VNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQFTIEQNPMSSMLCLN- 386
           VNDCLHRYR    W F+FDVDE++++P   ++ SV+E    ++QFTIEQ PMSS +C + 
Sbjct: 335 VNDCLHRYRFMTKWMFFFDVDEFLHVPVKETISSVMESLEEYSQFTIEQMPMSSRICYSG 394

Query: 387 DSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSENVIGNTTHKTESKI 446
           D      RKWG EKL ++D+K    RDRKYA+Q +N +ATGVHMS+N+ G T HK ESKI
Sbjct: 395 DGPARTYRKWGIEKLAYRDVKKVPRRDRKYAVQPENVFATGVHMSQNLQGKTYHKAESKI 454

Query: 447 RYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKLADTIKEFERHAI 491
           RY+HYH SI  R E CR+   +S +    +F  TP+V D  +  +   ++ FE   I
Sbjct: 455 RYFHYHGSISQRREPCRQLFNDSRV----VFENTPYVLDTTICDVGLAVRTFELRTI 495

BLAST of CmaCh01G020540 vs. ExPASy TrEMBL
Match: A0A6J1J3E1 (Glycosyltransferase family 92 protein OS=Cucurbita maxima OX=3661 GN=LOC111480985 PE=3 SV=1)

HSP 1 Score: 1023.1 bits (2644), Expect = 5.5e-295
Identity = 490/490 (100.00%), Postives = 490/490 (100.00%), Query Frame = 0

Query: 1   MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA 60
           MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA
Sbjct: 1   MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA 60

Query: 61  PTNTADAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAY 120
           PTNTADAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAY
Sbjct: 61  PTNTADAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAY 120

Query: 121 RGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVVVN 180
           RGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVVVN
Sbjct: 121 RGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVVVN 180

Query: 181 CTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGS 240
           CTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGS
Sbjct: 181 CTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGS 240

Query: 241 SLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRA 300
           SLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRA
Sbjct: 241 SLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRA 300

Query: 301 QSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQFT 360
           QSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQFT
Sbjct: 301 QSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQFT 360

Query: 361 IEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSEN 420
           IEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSEN
Sbjct: 361 IEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSEN 420

Query: 421 VIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKLAD 480
           VIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKLAD
Sbjct: 421 VIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKLAD 480

Query: 481 TIKEFERHAI 491
           TIKEFERHAI
Sbjct: 481 TIKEFERHAI 490

BLAST of CmaCh01G020540 vs. ExPASy TrEMBL
Match: A0A6J1FQW1 (Glycosyltransferase family 92 protein OS=Cucurbita moschata OX=3662 GN=LOC111446100 PE=3 SV=1)

HSP 1 Score: 1012.3 bits (2616), Expect = 9.7e-292
Identity = 481/490 (98.16%), Postives = 488/490 (99.59%), Query Frame = 0

Query: 1   MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA 60
           MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA
Sbjct: 1   MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA 60

Query: 61  PTNTADAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAY 120
           PTNT DAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAY
Sbjct: 61  PTNTTDAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAY 120

Query: 121 RGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVVVN 180
           RGGPRTFA+VGLASKPIHVYGHPWYKCEW+FNNGSSIRAKAYKILPDWGYGRVYTVVVVN
Sbjct: 121 RGGPRTFAVVGLASKPIHVYGHPWYKCEWLFNNGSSIRAKAYKILPDWGYGRVYTVVVVN 180

Query: 181 CTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGS 240
           CTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGS
Sbjct: 181 CTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGS 240

Query: 241 SLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRA 300
           SLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRA
Sbjct: 241 SLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRA 300

Query: 301 QSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQFT 360
           QSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEY+YLPEGSSLESVLEEFS +TQFT
Sbjct: 301 QSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYMYLPEGSSLESVLEEFSVYTQFT 360

Query: 361 IEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSEN 420
           IEQNPMSSMLCLNDSAQ+YSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSEN
Sbjct: 361 IEQNPMSSMLCLNDSAQSYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSEN 420

Query: 421 VIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKLAD 480
           VIGNTTHKTES+IRYYHYHNSIMVRGELCREFLPN+AIHNVTIFNQTPFVYDDKMKKLAD
Sbjct: 421 VIGNTTHKTESRIRYYHYHNSIMVRGELCREFLPNTAIHNVTIFNQTPFVYDDKMKKLAD 480

Query: 481 TIKEFERHAI 491
           TIKEFERHAI
Sbjct: 481 TIKEFERHAI 490

BLAST of CmaCh01G020540 vs. ExPASy TrEMBL
Match: A0A6J1D6M5 (Glycosyltransferase family 92 protein OS=Momordica charantia OX=3673 GN=LOC111017506 PE=3 SV=1)

HSP 1 Score: 896.0 bits (2314), Expect = 1.0e-256
Identity = 426/495 (86.06%), Postives = 452/495 (91.31%), Query Frame = 0

Query: 1   MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA 60
           MRKDGP  S+  A V KLFTCFE +PLVATCLALTLVM LWNLPPYYQNLLFT +RSCSA
Sbjct: 1   MRKDGPPASLAGANVSKLFTCFEARPLVATCLALTLVMLLWNLPPYYQNLLFTPSRSCSA 60

Query: 61  PTNTADAALA---NLTIPPNTSLPFTASAAAKKYS--TTLPTDPNNRIFQPFGNAAALFV 120
           PTNT   +++   NLTI PN SLPFTASAAAKKYS  T +  DPN R+FQPFGNAAALFV
Sbjct: 61  PTNTTTTSISTHTNLTIAPNGSLPFTASAAAKKYSTPTAIRPDPNKRLFQPFGNAAALFV 120

Query: 121 LMGAYRGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYT 180
           LMGAYRGGPRTFA+VGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYT
Sbjct: 121 LMGAYRGGPRTFAVVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYT 180

Query: 181 VVVVNCTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEY 240
           VVV+NCTFPLNPN DNSGGKLT+NAYYGQS +KYEKFT LEEL GSYN SK+ PP+DYEY
Sbjct: 181 VVVINCTFPLNPNQDNSGGKLTINAYYGQSPRKYEKFTTLEELAGSYNHSKYXPPFDYEY 240

Query: 241 LYCGSSLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTI 300
           LYCGSSLYGNLSAARIREWMAYHAWFFG KSHFVFHDAGGVSPEVRAVLEPWVR GRVT+
Sbjct: 241 LYCGSSLYGNLSAARIREWMAYHAWFFGPKSHFVFHDAGGVSPEVRAVLEPWVRDGRVTV 300

Query: 301 QDIRAQSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSA 360
           QDI  Q+EYDGYYYNQFL+VNDCLHRYRHAANWTFYFDVDEYIYLPEG++LESVLEE S 
Sbjct: 301 QDIIGQTEYDGYYYNQFLIVNDCLHRYRHAANWTFYFDVDEYIYLPEGNTLESVLEELSE 360

Query: 361 FTQFTIEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGV 420
           +TQFTIEQNPMSS+LCLNDS+QNYSRKWGFEKLLFKD K+GIWRDRKYAIQAKNAYATGV
Sbjct: 361 YTQFTIEQNPMSSVLCLNDSSQNYSRKWGFEKLLFKDSKTGIWRDRKYAIQAKNAYATGV 420

Query: 421 HMSENVIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKM 480
           HMSENVIG TTHKTESKIRYYHYHNSIMVRGELCREFL  SAI+NVTIFN+ PFVYDDKM
Sbjct: 421 HMSENVIGKTTHKTESKIRYYHYHNSIMVRGELCREFLSISAINNVTIFNKVPFVYDDKM 480

Query: 481 KKLADTIKEFERHAI 491
           KKL DTIKEFE + I
Sbjct: 481 KKLGDTIKEFELNVI 495

BLAST of CmaCh01G020540 vs. ExPASy TrEMBL
Match: A0A067KKI4 (Glycosyltransferase family 92 protein OS=Jatropha curcas OX=180498 GN=JCGZ_08012 PE=3 SV=1)

HSP 1 Score: 793.5 bits (2048), Expect = 7.1e-226
Identity = 381/492 (77.44%), Postives = 423/492 (85.98%), Query Frame = 0

Query: 1   MRKD-GPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCS 60
           MRKD  P +S      GKL  CFETKPLVAT LALTLVM LWNLPPYYQNLL TT RSCS
Sbjct: 1   MRKDCPPLSSFAGGTAGKLSLCFETKPLVATVLALTLVMLLWNLPPYYQNLLSTT-RSCS 60

Query: 61  AP-TNTADAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMG 120
           AP  +TA    +N +  P TS   T S + +KYST + TDPN RIFQ +GNAAALFV MG
Sbjct: 61  APAASTASLIASNASSLPITSYASTTSVSEQKYSTPVVTDPNKRIFQAYGNAAALFVQMG 120

Query: 121 AYRGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVV 180
           AYRGGPRTFA+VGLASKPIHV+G PWYKCEWI NNGSS+RAKAYK+LPDWGYGRVYTVVV
Sbjct: 121 AYRGGPRTFAVVGLASKPIHVFGRPWYKCEWISNNGSSLRAKAYKMLPDWGYGRVYTVVV 180

Query: 181 VNCTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYC 240
           VNCTF +NPN DN+GGKL +NAYYG+SQ+KYEKF ALEE PGSYN SK+ PPY YEYLYC
Sbjct: 181 VNCTFSVNPNEDNAGGKLMLNAYYGESQRKYEKFVALEEAPGSYNESKYHPPYQYEYLYC 240

Query: 241 GSSLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDI 300
           GSSLYGNLSAAR+REWMAYHAWFFGS SHFVFHDAGGVSPEVRA LEPWVRAGR T+QDI
Sbjct: 241 GSSLYGNLSAARMREWMAYHAWFFGSSSHFVFHDAGGVSPEVRAALEPWVRAGRATVQDI 300

Query: 301 RAQSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQ 360
           R Q+E+DGYYYNQFLVVNDCLHRYR+AANWTFYFDVDEYIYLP G++LESVL+EFS +TQ
Sbjct: 301 RGQAEFDGYYYNQFLVVNDCLHRYRYAANWTFYFDVDEYIYLPLGNTLESVLKEFSDYTQ 360

Query: 361 FTIEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMS 420
           FTIEQNPMSS+LCLNDS ++YSR+WGFEKLLF++ ++GI RDRKYAIQAK A+ATGVHMS
Sbjct: 361 FTIEQNPMSSVLCLNDSTRDYSREWGFEKLLFRESRTGIRRDRKYAIQAKKAFATGVHMS 420

Query: 421 ENVIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKL 480
           ENV+G T HKTE KIRYYHYHNSI V GELCREFLP SA  NVT++++ P+VYDD MKKL
Sbjct: 421 ENVVGKTLHKTEDKIRYYHYHNSITVPGELCREFLPPSAKKNVTLYDKLPYVYDDNMKKL 480

Query: 481 ADTIKEFERHAI 491
           A TIKEFER  I
Sbjct: 481 AATIKEFERKTI 491

BLAST of CmaCh01G020540 vs. ExPASy TrEMBL
Match: A0A6A1UUH7 (Glycosyltransferase family 92 protein OS=Morella rubra OX=262757 GN=CJ030_MR8G001580 PE=3 SV=1)

HSP 1 Score: 790.0 bits (2039), Expect = 7.8e-225
Identity = 382/500 (76.40%), Postives = 426/500 (85.20%), Query Frame = 0

Query: 1   MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA 60
           MRKDGP  +     V K+ +CFETK ++ T LAL LVM +WN+PPYYQNLL TT+ SCS+
Sbjct: 1   MRKDGPPPT-----VAKMVSCFETKTILITFLALALVMLMWNIPPYYQNLLSTTSSSCSS 60

Query: 61  PTNTADA---ALANLTIPPNTSLPFTASAAAKKYST--TLP-TDPNNRIFQPFGNAAALF 120
             +TA A     AN ++P +T+ P     A +KYST  T+P  DPN R+FQPFGNAAALF
Sbjct: 61  DASTATALSITAANASLPNSTASP----VAEQKYSTPKTIPNADPNKRVFQPFGNAAALF 120

Query: 121 VLMGAYRGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVY 180
           VLMGAYRGGPRTFA+VGLASKPIHVYG PWYKCEWI NNGSSIR+ AYK+LPDWGYGRVY
Sbjct: 121 VLMGAYRGGPRTFAVVGLASKPIHVYGRPWYKCEWISNNGSSIRSIAYKMLPDWGYGRVY 180

Query: 181 TVVVVNCTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYE 240
           TVVVVNCTFP+NPN DN GGKLT+NAYYG S +KYEKFTALEE PGSYN SK+ PPYDYE
Sbjct: 181 TVVVVNCTFPVNPNQDNFGGKLTINAYYGPSPRKYEKFTALEEAPGSYNESKYHPPYDYE 240

Query: 241 YLYCGSSLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVT 300
           YLYCGSSLYGNLSAARIREWMAYHAWFFG KS+FVFHDA G+SPEVRA LEPWVRAGR T
Sbjct: 241 YLYCGSSLYGNLSAARIREWMAYHAWFFGPKSYFVFHDADGLSPEVRAALEPWVRAGRAT 300

Query: 301 IQDIRAQSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFS 360
           IQDIR Q+E+DGYYYNQFLVVNDCLHRYRH+ANWTFYFDVDEYIYLPEGS+LESVL EFS
Sbjct: 301 IQDIRGQAEFDGYYYNQFLVVNDCLHRYRHSANWTFYFDVDEYIYLPEGSTLESVLNEFS 360

Query: 361 AFTQFTIEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATG 420
            +TQFTIEQNPMSS+LCLNDS Q+YSR+WGFEKLLF+D ++GI RDRKYAIQAKNAYATG
Sbjct: 361 DYTQFTIEQNPMSSVLCLNDSTQDYSRQWGFEKLLFRDSRTGIRRDRKYAIQAKNAYATG 420

Query: 421 VHMSENVIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDK 480
           VHMSENVIG T HKTE+KIRYYHYHNSI V GE CR+FLP SA  NVT+F + P+VYDD 
Sbjct: 421 VHMSENVIGKTLHKTETKIRYYHYHNSISVMGEPCRKFLPPSAKTNVTLFEKLPYVYDDN 480

Query: 481 MKKLADTIKEFERHAIDRHQ 495
           MK+LA TIKEFER +I   Q
Sbjct: 481 MKQLAKTIKEFERKSIGNLQ 491

BLAST of CmaCh01G020540 vs. NCBI nr
Match: XP_022982004.1 (galactan beta-1,4-galactosyltransferase GALS1 [Cucurbita maxima])

HSP 1 Score: 1023.1 bits (2644), Expect = 1.1e-294
Identity = 490/490 (100.00%), Postives = 490/490 (100.00%), Query Frame = 0

Query: 1   MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA 60
           MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA
Sbjct: 1   MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA 60

Query: 61  PTNTADAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAY 120
           PTNTADAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAY
Sbjct: 61  PTNTADAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAY 120

Query: 121 RGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVVVN 180
           RGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVVVN
Sbjct: 121 RGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVVVN 180

Query: 181 CTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGS 240
           CTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGS
Sbjct: 181 CTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGS 240

Query: 241 SLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRA 300
           SLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRA
Sbjct: 241 SLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRA 300

Query: 301 QSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQFT 360
           QSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQFT
Sbjct: 301 QSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQFT 360

Query: 361 IEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSEN 420
           IEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSEN
Sbjct: 361 IEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSEN 420

Query: 421 VIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKLAD 480
           VIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKLAD
Sbjct: 421 VIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKLAD 480

Query: 481 TIKEFERHAI 491
           TIKEFERHAI
Sbjct: 481 TIKEFERHAI 490

BLAST of CmaCh01G020540 vs. NCBI nr
Match: XP_023525128.1 (galactan beta-1,4-galactosyltransferase GALS1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1016.5 bits (2627), Expect = 1.1e-292
Identity = 485/490 (98.98%), Postives = 487/490 (99.39%), Query Frame = 0

Query: 1   MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA 60
           MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA
Sbjct: 1   MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA 60

Query: 61  PTNTADAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAY 120
           PTNT DAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAY
Sbjct: 61  PTNTTDAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAY 120

Query: 121 RGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVVVN 180
           RGGPRTFA+VGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVVVN
Sbjct: 121 RGGPRTFAVVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVVVN 180

Query: 181 CTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGS 240
           CTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGS
Sbjct: 181 CTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGS 240

Query: 241 SLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRA 300
           SLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRA
Sbjct: 241 SLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRA 300

Query: 301 QSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQFT 360
           QSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFS +TQFT
Sbjct: 301 QSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSVYTQFT 360

Query: 361 IEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSEN 420
           IEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSEN
Sbjct: 361 IEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSEN 420

Query: 421 VIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKLAD 480
           VIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQ PFVYDDKMKKLAD
Sbjct: 421 VIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQAPFVYDDKMKKLAD 480

Query: 481 TIKEFERHAI 491
           TIKEFERHAI
Sbjct: 481 TIKEFERHAI 490

BLAST of CmaCh01G020540 vs. NCBI nr
Match: KAG6608669.1 (Galactan beta-1,4-galactosyltransferase GALS1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1013.1 bits (2618), Expect = 1.2e-291
Identity = 482/490 (98.37%), Postives = 487/490 (99.39%), Query Frame = 0

Query: 1   MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA 60
           MRKD PATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA
Sbjct: 1   MRKDAPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA 60

Query: 61  PTNTADAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAY 120
           PTNT DAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAY
Sbjct: 61  PTNTTDAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAY 120

Query: 121 RGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVVVN 180
           RGGPRTFA+VGLASKPIHVYGHPWYKCEW+FNNGSSIRAKAYKILPDWGYGRVYTVVVVN
Sbjct: 121 RGGPRTFAVVGLASKPIHVYGHPWYKCEWLFNNGSSIRAKAYKILPDWGYGRVYTVVVVN 180

Query: 181 CTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGS 240
           CTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGS
Sbjct: 181 CTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGS 240

Query: 241 SLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRA 300
           SLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRA
Sbjct: 241 SLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRA 300

Query: 301 QSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQFT 360
           QSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFS +TQFT
Sbjct: 301 QSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSVYTQFT 360

Query: 361 IEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSEN 420
           IEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSEN
Sbjct: 361 IEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSEN 420

Query: 421 VIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKLAD 480
           VIGNTTHKTES+IRYYHYHNSIMVRGELCREFLPN+AIHNVTIFNQTPFVYDDKMKKLAD
Sbjct: 421 VIGNTTHKTESRIRYYHYHNSIMVRGELCREFLPNTAIHNVTIFNQTPFVYDDKMKKLAD 480

Query: 481 TIKEFERHAI 491
           TIKEFERHAI
Sbjct: 481 TIKEFERHAI 490

BLAST of CmaCh01G020540 vs. NCBI nr
Match: XP_022940530.1 (galactan beta-1,4-galactosyltransferase GALS1 [Cucurbita moschata])

HSP 1 Score: 1012.3 bits (2616), Expect = 2.0e-291
Identity = 481/490 (98.16%), Postives = 488/490 (99.59%), Query Frame = 0

Query: 1   MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA 60
           MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA
Sbjct: 1   MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA 60

Query: 61  PTNTADAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAY 120
           PTNT DAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAY
Sbjct: 61  PTNTTDAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAY 120

Query: 121 RGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVVVN 180
           RGGPRTFA+VGLASKPIHVYGHPWYKCEW+FNNGSSIRAKAYKILPDWGYGRVYTVVVVN
Sbjct: 121 RGGPRTFAVVGLASKPIHVYGHPWYKCEWLFNNGSSIRAKAYKILPDWGYGRVYTVVVVN 180

Query: 181 CTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGS 240
           CTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGS
Sbjct: 181 CTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGS 240

Query: 241 SLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRA 300
           SLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRA
Sbjct: 241 SLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRA 300

Query: 301 QSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQFT 360
           QSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEY+YLPEGSSLESVLEEFS +TQFT
Sbjct: 301 QSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYMYLPEGSSLESVLEEFSVYTQFT 360

Query: 361 IEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSEN 420
           IEQNPMSSMLCLNDSAQ+YSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSEN
Sbjct: 361 IEQNPMSSMLCLNDSAQSYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSEN 420

Query: 421 VIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKLAD 480
           VIGNTTHKTES+IRYYHYHNSIMVRGELCREFLPN+AIHNVTIFNQTPFVYDDKMKKLAD
Sbjct: 421 VIGNTTHKTESRIRYYHYHNSIMVRGELCREFLPNTAIHNVTIFNQTPFVYDDKMKKLAD 480

Query: 481 TIKEFERHAI 491
           TIKEFERHAI
Sbjct: 481 TIKEFERHAI 490

BLAST of CmaCh01G020540 vs. NCBI nr
Match: KAG7037984.1 (Galactan beta-1,4-galactosyltransferase GALS1 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1011.1 bits (2613), Expect = 4.5e-291
Identity = 482/490 (98.37%), Postives = 487/490 (99.39%), Query Frame = 0

Query: 1   MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA 60
           MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA
Sbjct: 1   MRKDGPATSIGDAQVGKLFTCFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSA 60

Query: 61  PTNTADAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAY 120
           PTNT DAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAY
Sbjct: 61  PTNTTDAALANLTIPPNTSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAY 120

Query: 121 RGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVVVN 180
           RGGPRTFA+VGLASKPIHVYGHPWYKCEW+FNNGSSIRAKAYKILPDWGYGRVYTVVVVN
Sbjct: 121 RGGPRTFAVVGLASKPIHVYGHPWYKCEWLFNNGSSIRAKAYKILPDWGYGRVYTVVVVN 180

Query: 181 CTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGS 240
           CTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGS
Sbjct: 181 CTFPLNPNHDNSGGKLTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGS 240

Query: 241 SLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRA 300
           SLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRA
Sbjct: 241 SLYGNLSAARIREWMAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRA 300

Query: 301 QSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQFT 360
           QSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFS +TQFT
Sbjct: 301 QSEYDGYYYNQFLVVNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSVYTQFT 360

Query: 361 IEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSEN 420
           IEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSEN
Sbjct: 361 IEQNPMSSMLCLNDSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSEN 420

Query: 421 VIGNTTHKTESKIRYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKLAD 480
           VIGNTTHKTES+IRYYHYHNSIMVRGELCREFLPN+AI NVTIFNQTPFVYDDKMKKLAD
Sbjct: 421 VIGNTTHKTESRIRYYHYHNSIMVRGELCREFLPNTAILNVTIFNQTPFVYDDKMKKLAD 480

Query: 481 TIKEFERHAI 491
           TIKEFERHAI
Sbjct: 481 TIKEFERHAI 490

BLAST of CmaCh01G020540 vs. TAIR 10
Match: AT2G33570.1 (Domain of unknown function (DUF23) )

HSP 1 Score: 739.6 bits (1908), Expect = 2.3e-213
Identity = 349/472 (73.94%), Postives = 404/472 (85.59%), Query Frame = 0

Query: 21  CFETKPLVATCLALTLVMFLWNLPPYYQNLLFTTARSCSAPTNTADAAL---ANLTIPPN 80
           CFE KP++AT LAL+LVM +WNLPPYY NL+ +TAR CSA T T    L   +N T   N
Sbjct: 16  CFEKKPIIATLLALSLVMIVWNLPPYYHNLI-STARPCSAVTTTTTTTLLSSSNFTSAEN 75

Query: 81  --TSLPFTASAAAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAYRGGPRTFAIVGLASK 140
             TSL  T +AA++KY +T P+DPN R+FQPFGNAAALFVLMGAYRGGP TF+++GLASK
Sbjct: 76  FTTSLSTTTAAASQKYDST-PSDPNKRVFQPFGNAAALFVLMGAYRGGPTTFSVIGLASK 135

Query: 141 PIHVYGHPWYKCEWIFNNGSSIRAKAYKILPDWGYGRVYTVVVVNCTFPLNPNHDNSGGK 200
           PIHVYG PWYKCEWI NNG+SIRAKA KILPDWGYGRVYTVVVVNCTF  NPN DN+GGK
Sbjct: 136 PIHVYGKPWYKCEWISNNGTSIRAKAQKILPDWGYGRVYTVVVVNCTFNSNPNSDNTGGK 195

Query: 201 LTVNAYYGQSQKKYEKFTALEELPGSYNASKFRPPYDYEYLYCGSSLYGNLSAARIREWM 260
           L +NAYY +S K +E+FT LEE  G Y+ SK+ PPY Y+YLYCGSSLYGN+SA+R+REWM
Sbjct: 196 LILNAYYNESPKLFERFTTLEESAGIYDESKYSPPYQYDYLYCGSSLYGNVSASRMREWM 255

Query: 261 AYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRAQSEYDGYYYNQFLVV 320
           AYHAWFFG KSHFVFHDAGGVSPEVR VLEPW+RAGRVT+Q+IR QS+YDGYYYNQFL+V
Sbjct: 256 AYHAWFFGDKSHFVFHDAGGVSPEVRKVLEPWIRAGRVTVQNIRDQSQYDGYYYNQFLIV 315

Query: 321 NDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQFTIEQNPMSSMLCLNDS 380
           NDCLHRYR+AANWTF+FDVDEYIYLP G++LESVL+EFS  TQFTIEQNPMSS+LC+NDS
Sbjct: 316 NDCLHRYRYAANWTFFFDVDEYIYLPHGNTLESVLDEFSVNTQFTIEQNPMSSVLCINDS 375

Query: 381 AQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSENVIGNTTHKTESKIRY 440
           +Q+Y R+WGFEKLLFKD ++ I RDRKYAIQAKNA+ATGVHMSEN++G T HKTE+KIRY
Sbjct: 376 SQDYPRQWGFEKLLFKDSRTKIRRDRKYAIQAKNAFATGVHMSENIVGKTLHKTETKIRY 435

Query: 441 YHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKLADTIKEFER 488
           YHYHN+I V  ELCRE LPNSA   VT++N+ P+VYDD MKKL  TIKEFE+
Sbjct: 436 YHYHNTITVHEELCREMLPNSAKKKVTLYNKLPYVYDDNMKKLVKTIKEFEQ 485

BLAST of CmaCh01G020540 vs. TAIR 10
Match: AT5G44670.1 (Domain of unknown function (DUF23) )

HSP 1 Score: 424.1 bits (1089), Expect = 2.2e-118
Identity = 212/399 (53.13%), Postives = 262/399 (65.66%), Query Frame = 0

Query: 102 RIFQPFGNAAALFVLMGAYRGGPRTFAIVGLASKPIHVYGHPWYKCEWIFNNGSS--IRA 161
           R F  +G AA  FVLM AYRGG  TFA++GL+SKP+HVY HP Y+CEWI  N S   I  
Sbjct: 116 RTFTGYGWAAYNFVLMNAYRGGVNTFAVIGLSSKPLHVYSHPTYRCEWIPLNQSDNRILT 175

Query: 162 KAYKILPDWGYGRVYTVVVVNCTFPLNP--NHDNSGGKLTVNAYYGQSQKKY-EKFTALE 221
              KIL DWGYGRVYT VVVNCTFP N   N  N+GG L ++A  G + +   +    L 
Sbjct: 176 DGTKILTDWGYGRVYTTVVVNCTFPSNTVINPKNTGGTLLLHATTGDTDRNITDSIPVLT 235

Query: 222 ELPGSYN----ASKFRPPYDYEYLYCGSSLYGNLSAARIREWMAYHAWFFGSKSHFVFHD 281
           E P + +     S  R    Y+YLYCGSSLYGNLS  RIREW+AYH  FFG +SHFV HD
Sbjct: 236 ETPNTVDFALYESNLRRREKYDYLYCGSSLYGNLSPQRIREWIAYHVRFFGERSHFVLHD 295

Query: 282 AGGVSPEVRAVLEPWVRAGRVTIQDIRAQSEYDGYYYNQFLVVNDCLHRYRHAANWTFYF 341
           AGG++ EV  VL+PW+  GRVT+ DIR Q  +DGYY+NQF+VVNDCLHRYR  A W F+F
Sbjct: 296 AGGITEEVFEVLKPWIELGRVTVHDIREQERFDGYYHNQFMVVNDCLHRYRFMAKWMFFF 355

Query: 342 DVDEYIYLPEGSSLESVLEEFSAFTQFTIEQNPMSSMLCLN-DSAQNYSRKWGFEKLLFK 401
           DVDE+IY+P  SS+ SV+     ++QFTIEQ PMSS LC + D      RKWGFEKL ++
Sbjct: 356 DVDEFIYVPAKSSISSVMVSLEEYSQFTIEQMPMSSQLCYDGDGPARTYRKWGFEKLAYR 415

Query: 402 DIKSGIWRDRKYAIQAKNAYATGVHMSENVIGNTTHKTESKIRYYHYHNSIMVRGELCRE 461
           D+K    RDRKYA+Q +N +ATGVHMS+++ G T H+ E KIRY+HYH SI  R E CR 
Sbjct: 416 DVKKVPRRDRKYAVQPRNVFATGVHMSQHLQGKTYHRAEGKIRYFHYHGSISQRREPCRH 475

Query: 462 FLPNSAIHNVTIFNQTPFVYDDKMKKLADTIKEFERHAI 491
               + I    +    P+V D  M+ +   +K FE   I
Sbjct: 476 LYNGTRI----VHENNPYVLDTTMRDIGLAVKTFEIRTI 510

BLAST of CmaCh01G020540 vs. TAIR 10
Match: AT4G20170.1 (Domain of unknown function (DUF23) )

HSP 1 Score: 414.1 bits (1063), Expect = 2.3e-115
Identity = 220/477 (46.12%), Postives = 294/477 (61.64%), Query Frame = 0

Query: 27  LVATCLALTLVMFLWNLPPYYQNLLFTTARSCSAPTNTADAALANLTIPPNTSLPFTASA 86
           L+  C   TL+ F+    P   +L  +  R C +  ++A       T+  ++S P    +
Sbjct: 35  LLVLCTLATLLPFI----PSSFSLSTSDFRFCISRFSSAVPLNTTTTVEESSSSP----S 94

Query: 87  AAKKYSTTLPTDPNNRIFQPFGNAAALFVLMGAYRGGPRTFAIVGLASKPIHVYGHPWYK 146
             K     L      R F  +G+AA  FV M AYRGG  +FA++GL+SKP+HVYGHP Y+
Sbjct: 95  PEKNLDRVLDNGVIKRTFTGYGSAAYNFVSMSAYRGGVNSFAVIGLSSKPLHVYGHPSYR 154

Query: 147 CEWIFNNGSS--IRAKAYKILPDWGYGRVYTVVVVNCTF----PLNPNHDNSGGKLTVNA 206
           CEW+  + +   I    +KIL DWGYGR+YT VVVNCTF     +NP   NSGG L ++A
Sbjct: 155 CEWVSLDPTQDPISTTGFKILTDWGYGRIYTTVVVNCTFSSISAVNP--QNSGGTLILHA 214

Query: 207 YYGQ-SQKKYEKFTALEELPGS-----YNASKFRPPYDYEYLYCGSSLYGNLSAARIREW 266
             G  +    +  + L E P S     YN++K    YD  YLYCGSSLYGNLS  R+REW
Sbjct: 215 TTGDPTLNLTDSISVLTEPPKSVDFDLYNSTKKTKKYD--YLYCGSSLYGNLSPQRVREW 274

Query: 267 MAYHAWFFGSKSHFVFHDAGGVSPEVRAVLEPWVRAGRVTIQDIRAQSEYDGYYYNQFLV 326
           +AYH  FFG +SHFV HDAGG+  EV  VL+PW+  GRVT+ DIR Q  +DGYY+NQF++
Sbjct: 275 IAYHVRFFGERSHFVLHDAGGIHEEVFEVLKPWIELGRVTLHDIRDQERFDGYYHNQFMI 334

Query: 327 VNDCLHRYRHAANWTFYFDVDEYIYLPEGSSLESVLEEFSAFTQFTIEQNPMSSMLCLN- 386
           VNDCLHRYR    W F+FDVDE++++P   ++ SV+E    ++QFTIEQ PMSS +C + 
Sbjct: 335 VNDCLHRYRFMTKWMFFFDVDEFLHVPVKETISSVMESLEEYSQFTIEQMPMSSRICYSG 394

Query: 387 DSAQNYSRKWGFEKLLFKDIKSGIWRDRKYAIQAKNAYATGVHMSENVIGNTTHKTESKI 446
           D      RKWG EKL ++D+K    RDRKYA+Q +N +ATGVHMS+N+ G T HK ESKI
Sbjct: 395 DGPARTYRKWGIEKLAYRDVKKVPRRDRKYAVQPENVFATGVHMSQNLQGKTYHKAESKI 454

Query: 447 RYYHYHNSIMVRGELCREFLPNSAIHNVTIFNQTPFVYDDKMKKLADTIKEFERHAI 491
           RY+HYH SI  R E CR+   +S +    +F  TP+V D  +  +   ++ FE   I
Sbjct: 455 RYFHYHGSISQRREPCRQLFNDSRV----VFENTPYVLDTTICDVGLAVRTFELRTI 495

BLAST of CmaCh01G020540 vs. TAIR 10
Match: AT1G76600.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-terminal protein myristoylation; LOCATED IN: nucleolus, nucleus; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G21010.1); Has 220 Blast hits to 220 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 220; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 158.7 bits (400), Expect = 1.7e-38
Identity = 107/218 (49.08%), Postives = 140/218 (64.22%), Query Frame = 0

Query: 502 MGACLSDCLNHPKPSSVSPPPPTAKVISLQGHLREYPVPISVSRVLQTENSSSSLSDS-- 561
           MG C+S   N    SS      TAK++++ G LREY VP+  S+VL++E++SSS S S  
Sbjct: 1   MGLCVSVNRNEYVSSST-----TAKIVTINGDLREYDVPVLASQVLESESTSSSSSSSSS 60

Query: 562 --FLCNSDRLYYDDFIPPLPLDEQLLPNQIYFLLPSSNLHHRLSASQMAALAVKASLALQ 621
             FLCNSD LYYDDFIP +  DE L  NQIYF+LP S   +RLSAS MAALAVKAS+A++
Sbjct: 61  SYFLCNSDSLYYDDFIPAIESDEILQANQIYFVLPISKRQYRLSASDMAALAVKASVAIE 120

Query: 622 NAS--PNDRRKKGRVSPLLNLSD-SDHIIS------------------KEPSKKNAAADT 681
            A+   N RR+ GR+SP++ L+  +D+ I+                  K P++     DT
Sbjct: 121 KAAGKKNRRRRSGRISPVVTLNQANDNRIAAVNNRIGGEATNMMMQKGKLPNRTTPFKDT 180

Query: 682 ---SASPSVRKLQRLTSRRAKMAVRSFKLKLSTIYEGA 692
              S S SVRKL+R TS RAK+AVRSF+L+LSTIYEG+
Sbjct: 181 NGYSRSGSVRKLKRYTSGRAKLAVRSFRLRLSTIYEGS 213

BLAST of CmaCh01G020540 vs. TAIR 10
Match: AT1G21010.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-terminal protein myristoylation; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 14 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G76600.1); Has 206 Blast hits to 206 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 206; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 147.1 bits (370), Expect = 5.1e-35
Identity = 92/195 (47.18%), Postives = 128/195 (65.64%), Query Frame = 0

Query: 523 PTAKVISLQGHLREYPVPISVSRVLQTE-------NSSSSLSDSFLCNSDRLYYDDFIPP 582
           PT K++++ G LREY VP+  S+VL+ E       +SSS  S  F+C+SD LYYDDFIP 
Sbjct: 16  PTVKIVTVNGDLREYNVPVIASQVLEAESAAAYSSSSSSRPSSYFICDSDSLYYDDFIPA 75

Query: 583 LPLDEQLLPNQIYFLLPSSNLHHRLSASQMAALAVKASLALQNASPND--RRKKGRVSPL 642
           +  +E L  +QIYF+LP S    RL+AS MAALAVKAS+A+QN+   +  RRKK R+SP+
Sbjct: 76  IKSEEPLQADQIYFVLPISKRQSRLTASDMAALAVKASVAIQNSVKKESRRRKKVRISPV 135

Query: 643 LNLSDSDHIISKEPSKK---------------NAAADTSASPSVRKLQRLTSRRAKMAVR 694
           + L+ S+  ++   S+                 A++  + S SVR L+R TS+RAK+AVR
Sbjct: 136 MMLTGSNDSVNGNGSETTVKKGRPFVSKTAPVKASSGINRSGSVRNLRRYTSKRAKLAVR 195

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
O228073.3e-21273.94Galactan beta-1,4-galactosyltransferase GALS1 OS=Arabidopsis thaliana OX=3702 GN... [more]
Q9LTZ93.1e-11753.13Galactan beta-1,4-galactosyltransferase GALS2 OS=Arabidopsis thaliana OX=3702 GN... [more]
O654313.2e-11446.12Galactan beta-1,4-galactosyltransferase GALS3 OS=Arabidopsis thaliana OX=3702 GN... [more]
Match NameE-valueIdentityDescription
A0A6J1J3E15.5e-295100.00Glycosyltransferase family 92 protein OS=Cucurbita maxima OX=3661 GN=LOC11148098... [more]
A0A6J1FQW19.7e-29298.16Glycosyltransferase family 92 protein OS=Cucurbita moschata OX=3662 GN=LOC111446... [more]
A0A6J1D6M51.0e-25686.06Glycosyltransferase family 92 protein OS=Momordica charantia OX=3673 GN=LOC11101... [more]
A0A067KKI47.1e-22677.44Glycosyltransferase family 92 protein OS=Jatropha curcas OX=180498 GN=JCGZ_08012... [more]
A0A6A1UUH77.8e-22576.40Glycosyltransferase family 92 protein OS=Morella rubra OX=262757 GN=CJ030_MR8G00... [more]
Match NameE-valueIdentityDescription
XP_022982004.11.1e-294100.00galactan beta-1,4-galactosyltransferase GALS1 [Cucurbita maxima][more]
XP_023525128.11.1e-29298.98galactan beta-1,4-galactosyltransferase GALS1 [Cucurbita pepo subsp. pepo][more]
KAG6608669.11.2e-29198.37Galactan beta-1,4-galactosyltransferase GALS1, partial [Cucurbita argyrosperma s... [more]
XP_022940530.12.0e-29198.16galactan beta-1,4-galactosyltransferase GALS1 [Cucurbita moschata][more]
KAG7037984.14.5e-29198.37Galactan beta-1,4-galactosyltransferase GALS1 [Cucurbita argyrosperma subsp. arg... [more]
Match NameE-valueIdentityDescription
AT2G33570.12.3e-21373.94Domain of unknown function (DUF23) [more]
AT5G44670.12.2e-11853.13Domain of unknown function (DUF23) [more]
AT4G20170.12.3e-11546.12Domain of unknown function (DUF23) [more]
AT1G76600.11.7e-3849.08unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-termin... [more]
AT1G21010.15.1e-3547.18unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-termin... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008166Glycosyltransferase family 92PFAMPF01697Glyco_transf_92coord: 234..449
e-value: 4.9E-33
score: 114.9
IPR025322Protein of unknown function DUF4228, plantPFAMPF14009DUF4228coord: 502..689
e-value: 1.0E-28
score: 101.0
NoneNo IPR availablePANTHERPTHR21461:SF56GALACTAN BETA-1,4-GALACTOSYLTRANSFERASE GALS1coord: 14..492
NoneNo IPR availablePANTHERPTHR21461UNCHARACTERIZEDcoord: 14..492

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh01G020540.1CmaCh01G020540.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016757 glycosyltransferase activity