Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
GCCTCTGCCTCATTACCGACCAACAAACTGCCGACAAATATCAGCTTCCGATCACCGGACTCGTTCTTCTATTCCTCTGTTCTTCCGTTTTCGAACTCAAAAGAAAGATCCATTTGAACGGCTGGTTCACATTGCTTTTACTTCTACCTTCTGACTCACAAGGTATTGATTTTTGCAGCAGATCTCTACGGCTCTGCTTCGCACTGTTGATCGATTGCTGTTGCCTGATGGTCTTGCTTGAAGTGGAATCTATAACTTCAATAGCTTTAATTTCTACTTCGCTTGAAATATGATGTTCAGATACTCGGTTTCTGTTTAGTTTTAGTTTAGCTCATTTCCTGTATGTAGTTTAATTAGTAGTTTAGATCCTTCGCTCTGTTAATCGATTTCTGGCGCCTGATAGTGTTGCTTGAAGTAGAATCCATAGCTTCAATAGCTTTGATTTTTATTTTTATTCAAATATGTTGCTTAGATACTCGGATTCTGTTTAGCTTTAGCTCATTTCCTATATGTACTTGAATTTGTAGTTAAGATCCTTCCTTAGGGATTCTTCAATTGTACAAGCTGTAAGTTCTAAGTGCTTCTCGCGAAATTGGGTTTTCACAAGCCCCCTGGCTTAAATTATACCAATGTGCATGTGAGTTAAGAGCTACCATTACTCAGTTCGAGCACTTTTGGTCGATTGCTTGGTAGCTACAAATGAAAAGGAGTTCTTGTATCGTAGTTTCTCTTTCAGCTTCATAGTATAAATCTGGTCCAAAATTTAGGTTTGTGTCTATAACTTGAGAAATTGGGACCAAGAAGCTCTGGACAGCATATGAGTTTTGTTTCCGTGCTGTAACCTTCCGTGAGGCCCGTGTCGGTCAAATGCATGTTATTTTCGCATGTGTAGTCCATTTTCTTACAGAGGTAGTCTAGATTTTGATGATCCATTTTGATTTGTGTCAATTATGATTATGTATCATTTGAAGCTTAATTCAAGTGCTGAGCTTAGCATCCTCTTGTAATTCACAGGGAGTTTCTTAAAATGAAGTGGTAGCAATCATAACTGTAAAAGAGAAGAAAAATGAAGAGGTGGTATGGAGGAACATTGATACTGGCACTTGCCACAATCTTAGCTTTGCGTTATGGCCTTATGAATATCCAGCCTAAAAAGCAATCGGCATATGATTTTTTCAGAAATCATCCGACCAAAGATTCTCATAGTAAAAACAGTGACTCTTTGGAGGCTGAAGTAGTAAAAACATCAGAGCGGCCTCATCTTATTCATATTGAAGGACTTCGTTATCTAATTGCTCCAGATAATATTACAAAGCGAGCGTCAGAGGCCTTACTTCTGTGGTCTCATATGCACCCCCTGCTGATGAGGTCTGATGCTTTACCTGAAACAATACAAGGAGTTAAAGAGGCTTCCACAGCGTGGAATGATTTATTGTCAGCTATTAAGGCAGAAAAGACCATTATAGTTGGCAATATGTCGAAGGGTGAAATATGCCCTTCCTCTGTTACCTCACCTGACAAAATTGCACCAACTGGGGGAATCGTTCTTGAGATCCCTTGTGGTTTAGTTGAGGATTCTTCTATAACCCTTGTTGGCATACCTAATGGACAGCAAGGGGGCTTCCAGATTGAACTGTTAGGCTCTCAGGCTTCCGGAGAGCCAAATCGCCCTATTATCTTGCATTACAATGTCAGTTTGCCTGGTGATAATATGTCTGAGGAATCATTTATAGTTCAAAATACATGGACTGATGAACTTAAGTGGGGCAAAGAGGAGAGATGTCCAACTCATCTGTCAGCAAGCTCTCATCAAGGTATATTTAGATGGCATGTTTGTCAGTTGCCCATTTGCCATGAAATTTCCAGCCAAGATTAACACATCTTATGTGTAGGTGAAGAAGCCAAATTAATGTAATATTCTGCACGAACTGAATATACCTGAGTCTATATTACCTATCAGTTCTTGTCTTTCTACATACGATACATTAGATAATGATAAGAACCTACTGTGTTATTCTAGGATGCAGGCATTGAGAAAGTTAAAAAAATGCTTTCGAGTTTTGACTTATGCATGTATATGGAGTTGGTGTAGTACTTCATGGCTTGTTGATAGATTGATCTTAAATGCAGTCTGCTGTATCTTTTGTTGCTTACAGATGAAATTCAGTAATATATCCTCTGGTTGCTGATATATTTTGGAAATAAATGTGGCAGTTGATGGACTTGTTCTTTGTAATGAGCGTGTTCTCCGAAGCACAGGAGCAGAAAATATCAGTATGCATCATAATAATGGTAATACCGTAACTAATGTTTCCAGAGGGCAATCTCATGAAAGTACCAACTTTCCATTCATAGAGGGGAATTTGTTCACTGCAACATTGTGGATTGGTTTGGAAGGATTCCATATGAATGTCAATGGACGACATGAAACCTCATTTGAATATAGGGAGGTAAGGTATGAATTATGCAATCATTTTGTTAATATGATGAACTAATGAAACCAATAAATATGTTTCAATTTTGAGCTGAACTTATGGAATGCTACGTGAAAATTCTGCAGAAACTTGAACCGTGGACAGTCAATCAAGTCAAGGTAACAGGTGGTCTGGATCTTCTCTCTTCCTTTGCTAAAGGCTTACCAGTCTTTGAAGATCATGATTTTATTAACTCTTCCCACCTTGGAGCTCCTCCTATTCCGAAGAAAAGACTTCTGATGCTGGTCGGGGTTTTTTCTACTGGAAATAATTTTAAGCGTCGTATGGCATTGAGAAGGACTTGGATGCAATATGAGGTTGTACGTAGTGGTGATGTAGCGGTCCGATTTTTCATAGGCTTTGTAAGTGATGCAATCTGCGATTCTCATAGTGGTTTATTTTGACCCTATAATCTCTGAATCTCCCATGAACTTGTGGCTTTTATTCTTCTCATTTTCTAGCCTATAATCTGAAGTCTTCGACTTTTCTTTTTATTTTATTTAATGACCGAGATGTGTTCTTTTTGGGAATTGTCTTCAATGCCGATTTCTCCCATGGATAAGCCGCTCGGCATTGGAGACTGTTCTTACTGGCTCTGTTATACAGTTTTTTGACAATCTAACTTTGACATGTTTCAGCTTTGAAGTCACAAGCAGTTCTCACAAAGAAAATCAATAGTAAAGCTAAAAGGATCTCAACAAATAAATGTTGCTAGTCTAGATTTATAATTGATTATTTTAATTATTTTTTTTGTATGCTCTTGATCTGACATTGATGCACACATTTTATACAATTTTCGGTATTTATTATCGCTTGATATTGGATAATTTACTGCTGCAGGACAAGAATGCACAAGTAAATTGGGAGCTCTGGAGAGAAGTGGAAGCTTATGGTGATATTCAGTTGATGCCTTTTGTTGATTATTACAGTCTGATCACTTTGAAAACAATTGCAATTTGCATTTTTGGGGTAAGTGTATACTTAAAACAACTGAGCCTTTCACTTGTTTTGTTATTTGAAACTATTCGCATGGACCTTTTGCCTTCCCTGCATAAGTTATCTGAAATCTATCACTTTTCATTTTAACAACTTCAGACCAAGATCCTTCCTGCAAAATATATCATGAAGACAGATGATGATGCGTTTGTTAGAATTGATGAAGTTCTTTCTGGACTAAAGAGCAGACCAGCTTCTGGCCTTCTGTATGGTCTTATTTCCTTTGATTCATCACCCGATAGAGATAAAGACAGCAAGTGGCATATTAGTATGGAGGTATCTGACTCAAAACTTGATCTTATATGTTGGTCTCGTTTGTTTTATTATGATAGCAACTCTTGCTGTGTGTAGCATTGTTTTTGTAATTTATTAAACTAAATTCAAAAAGTCAGACCAACTAATCTGTTCAAGTTTTGGATAGAAAATGAAATTGGCTGGGGGTTGAAAATAGGACCTGCTGCTATTACTAAACTTAAATGTGTAAACAAATACGTTGAGTTGAGTGTTGTTTTGTATGAAATAGTGCTTCATGTGAGATTCCGCATCGGTTGGAGAGGGAAACGAAGCATTCGTTATAAGAGTGTGGAAACCTCTCTTTAACGGACGCGTTTTAAAAACCTTGAGGGGAACCCCGAAAGAGAAAACACAAAGAGAACACTATTTGCTAGTGGTAGGCTTGGGCTGTTATAGTTGGCATCAGAGCTAGACATCGGGTGGTGTGCCAGCGAGGATGTTGGCTCCCAAGGGAGTAGATTGTGAGATCCCATATCGGTTGGAGAGGGAAACAAAACATTCTTTGTAAGGGTGTGGAAACCTCTCCCTAACCGATGAGTTTTAAAAACCTTGAGAGGTTCGGACAATATCTGATAGCAGGTGGACTTGGGCGGTTACACTCAACTAAGAAGTCAAACCCAACAAGTACGACTGTTATATTATACCATATGAAACTATTACTAAACTCAAAAGGTTTAAGTTAATGGGTCAAAATATCTTTAATCATTTATACATGTTCTTTTACATACAGGAATGGCCAAATGCGACATACCCTCCGTGGGCGCATGGTCCAGGCTACGTCATATCACGAGACATCGCTAAATTCATTGTCCGAGGCCACCAGAGTAGAGCCCTCAAGGTACAAATACAAGCAACCACCTTGCTTTTTCCTATGCATTGCAAATTCAAAATGGCTCTTCTTATGCTGAACTATTTTTGTCTATGTTCAGCTTTTTAAGCTGGAAGATGTTGCAATGGGCATATGGATTGAGCAATTCAGCAAGGGTGGCAAGGAGGTACAGTACATAAATGAAGAAAGGTTTTACAACTCCGGCTGTGAAGCCAATTACATTCTTGCTCATTACCAAAGCCCAAGATTGGTACTATGCCTTTGGGAAAGGCTGCAAAAACAATTTGATTCCACTTGCTGTGATTAGTATTTATTTGATAATCATGGGTTTGTTTTATGGAATACTTTTGATTTATGTGTTTATGGGAGAGAATACTTTTTTCTAGTTCAACCATGGTGGGAAGGAGGGACATCCGATAAAATTGGAACGATACAGAGAAGATTAACATGGCCCCTACGCAAGGATGACAGGCACAAATCGAGAAATGGTATTAGAGCCAAACATTGGACTGTGTGCCCGCGAGGATGCTAGTCCTTCAAGGAGGGTTTAAGGAGGGTGGATTGTGAGATCACATGTCGGTTGGAGAGGGGAACTAAGCATTCCTTAGGGGTGCAAGTGAAAACCTTTCACGAATAGAAGCATTTTATAATCTTGAGGGGAAGCCTAGAAGAGTTGACAAGAATATATACACCTATATTTCGTAAAATAATATGCATTACAGAATCGAATACAACATATTATCACCATATGTGACACTTTCGCCTCATACGGCTCCGTCCTCGGTTAAACATAAAAACTCCACCAACCCACGTCAAAAACCCTAACCTCAATTAATAGTCTCAGGTCGGTTCGGTTTATCTTCAAAAATGAGATTGGGTCGATTTCTGTGGTTTTTTAGAA
mRNA sequence
GCCTCTGCCTCATTACCGACCAACAAACTGCCGACAAATATCAGCTTCCGATCACCGGACTCGTTCTTCTATTCCTCTGTTCTTCCGTTTTCGAACTCAAAAGAAAGATCCATTTGAACGGCTGGTTCACATTGCTTTTACTTCTACCTTCTGACTCACAAGGGAGTTTCTTAAAATGAAGTGGTAGCAATCATAACTGTAAAAGAGAAGAAAAATGAAGAGGTGGTATGGAGGAACATTGATACTGGCACTTGCCACAATCTTAGCTTTGCGTTATGGCCTTATGAATATCCAGCCTAAAAAGCAATCGGCATATGATTTTTTCAGAAATCATCCGACCAAAGATTCTCATAGTAAAAACAGTGACTCTTTGGAGGCTGAAGTAGTAAAAACATCAGAGCGGCCTCATCTTATTCATATTGAAGGACTTCGTTATCTAATTGCTCCAGATAATATTACAAAGCGAGCGTCAGAGGCCTTACTTCTGTGGTCTCATATGCACCCCCTGCTGATGAGGTCTGATGCTTTACCTGAAACAATACAAGGAGTTAAAGAGGCTTCCACAGCGTGGAATGATTTATTGTCAGCTATTAAGGCAGAAAAGACCATTATAGTTGGCAATATGTCGAAGGGTGAAATATGCCCTTCCTCTGTTACCTCACCTGACAAAATTGCACCAACTGGGGGAATCGTTCTTGAGATCCCTTGTGGTTTAGTTGAGGATTCTTCTATAACCCTTGTTGGCATACCTAATGGACAGCAAGGGGGCTTCCAGATTGAACTGTTAGGCTCTCAGGCTTCCGGAGAGCCAAATCGCCCTATTATCTTGCATTACAATGTCAGTTTGCCTGGTGATAATATGTCTGAGGAATCATTTATAGTTCAAAATACATGGACTGATGAACTTAAGTGGGGCAAAGAGGAGAGATGTCCAACTCATCTGTCAGCAAGCTCTCATCAAGTTGATGGACTTGTTCTTTGTAATGAGCGTGTTCTCCGAAGCACAGGAGCAGAAAATATCAGTATGCATCATAATAATGGTAATACCGTAACTAATGTTTCCAGAGGGCAATCTCATGAAAGTACCAACTTTCCATTCATAGAGGGGAATTTGTTCACTGCAACATTGTGGATTGGTTTGGAAGGATTCCATATGAATGTCAATGGACGACATGAAACCTCATTTGAATATAGGGAGAAACTTGAACCGTGGACAGTCAATCAAGTCAAGGTAACAGGTGGTCTGGATCTTCTCTCTTCCTTTGCTAAAGGCTTACCAGTCTTTGAAGATCATGATTTTATTAACTCTTCCCACCTTGGAGCTCCTCCTATTCCGAAGAAAAGACTTCTGATGCTGGTCGGGGTTTTTTCTACTGGAAATAATTTTAAGCGTCGTATGGCATTGAGAAGGACTTGGATGCAATATGAGGTTGTACGTAGTGGTGATGTAGCGGTCCGATTTTTCATAGGCTTTGACAAGAATGCACAAGTAAATTGGGAGCTCTGGAGAGAAGTGGAAGCTTATGGTGATATTCAGTTGATGCCTTTTGTTGATTATTACAGTCTGATCACTTTGAAAACAATTGCAATTTGCATTTTTGGGACCAAGATCCTTCCTGCAAAATATATCATGAAGACAGATGATGATGCGTTTGTTAGAATTGATGAAGTTCTTTCTGGACTAAAGAGCAGACCAGCTTCTGGCCTTCTGTATGGTCTTATTTCCTTTGATTCATCACCCGATAGAGATAAAGACAGCAAGTGGCATATTAGTATGGAGGAATGGCCAAATGCGACATACCCTCCGTGGGCGCATGGTCCAGGCTACGTCATATCACGAGACATCGCTAAATTCATTGTCCGAGGCCACCAGAGTAGAGCCCTCAAGCTTTTTAAGCTGGAAGATGTTGCAATGGGCATATGGATTGAGCAATTCAGCAAGGGTGGCAAGGAGGTACAGTACATAAATGAAGAAAGGTTTTACAACTCCGGCTGTGAAGCCAATTACATTCTTGCTCATTACCAAAGCCCAAGATTGGTACTATGCCTTTGGGAAAGGCTGCAAAAACAATTTGATTCCACTTGCTAA
Coding sequence (CDS)
ATGAAGAGGTGGTATGGAGGAACATTGATACTGGCACTTGCCACAATCTTAGCTTTGCGTTATGGCCTTATGAATATCCAGCCTAAAAAGCAATCGGCATATGATTTTTTCAGAAATCATCCGACCAAAGATTCTCATAGTAAAAACAGTGACTCTTTGGAGGCTGAAGTAGTAAAAACATCAGAGCGGCCTCATCTTATTCATATTGAAGGACTTCGTTATCTAATTGCTCCAGATAATATTACAAAGCGAGCGTCAGAGGCCTTACTTCTGTGGTCTCATATGCACCCCCTGCTGATGAGGTCTGATGCTTTACCTGAAACAATACAAGGAGTTAAAGAGGCTTCCACAGCGTGGAATGATTTATTGTCAGCTATTAAGGCAGAAAAGACCATTATAGTTGGCAATATGTCGAAGGGTGAAATATGCCCTTCCTCTGTTACCTCACCTGACAAAATTGCACCAACTGGGGGAATCGTTCTTGAGATCCCTTGTGGTTTAGTTGAGGATTCTTCTATAACCCTTGTTGGCATACCTAATGGACAGCAAGGGGGCTTCCAGATTGAACTGTTAGGCTCTCAGGCTTCCGGAGAGCCAAATCGCCCTATTATCTTGCATTACAATGTCAGTTTGCCTGGTGATAATATGTCTGAGGAATCATTTATAGTTCAAAATACATGGACTGATGAACTTAAGTGGGGCAAAGAGGAGAGATGTCCAACTCATCTGTCAGCAAGCTCTCATCAAGTTGATGGACTTGTTCTTTGTAATGAGCGTGTTCTCCGAAGCACAGGAGCAGAAAATATCAGTATGCATCATAATAATGGTAATACCGTAACTAATGTTTCCAGAGGGCAATCTCATGAAAGTACCAACTTTCCATTCATAGAGGGGAATTTGTTCACTGCAACATTGTGGATTGGTTTGGAAGGATTCCATATGAATGTCAATGGACGACATGAAACCTCATTTGAATATAGGGAGAAACTTGAACCGTGGACAGTCAATCAAGTCAAGGTAACAGGTGGTCTGGATCTTCTCTCTTCCTTTGCTAAAGGCTTACCAGTCTTTGAAGATCATGATTTTATTAACTCTTCCCACCTTGGAGCTCCTCCTATTCCGAAGAAAAGACTTCTGATGCTGGTCGGGGTTTTTTCTACTGGAAATAATTTTAAGCGTCGTATGGCATTGAGAAGGACTTGGATGCAATATGAGGTTGTACGTAGTGGTGATGTAGCGGTCCGATTTTTCATAGGCTTTGACAAGAATGCACAAGTAAATTGGGAGCTCTGGAGAGAAGTGGAAGCTTATGGTGATATTCAGTTGATGCCTTTTGTTGATTATTACAGTCTGATCACTTTGAAAACAATTGCAATTTGCATTTTTGGGACCAAGATCCTTCCTGCAAAATATATCATGAAGACAGATGATGATGCGTTTGTTAGAATTGATGAAGTTCTTTCTGGACTAAAGAGCAGACCAGCTTCTGGCCTTCTGTATGGTCTTATTTCCTTTGATTCATCACCCGATAGAGATAAAGACAGCAAGTGGCATATTAGTATGGAGGAATGGCCAAATGCGACATACCCTCCGTGGGCGCATGGTCCAGGCTACGTCATATCACGAGACATCGCTAAATTCATTGTCCGAGGCCACCAGAGTAGAGCCCTCAAGCTTTTTAAGCTGGAAGATGTTGCAATGGGCATATGGATTGAGCAATTCAGCAAGGGTGGCAAGGAGGTACAGTACATAAATGAAGAAAGGTTTTACAACTCCGGCTGTGAAGCCAATTACATTCTTGCTCATTACCAAAGCCCAAGATTGGTACTATGCCTTTGGGAAAGGCTGCAAAAACAATTTGATTCCACTTGCTAA
Protein sequence
MKRWYGGTLILALATILALRYGLMNIQPKKQSAYDFFRNHPTKDSHSKNSDSLEAEVVKTSERPHLIHIEGLRYLIAPDNITKRASEALLLWSHMHPLLMRSDALPETIQGVKEASTAWNDLLSAIKAEKTIIVGNMSKGEICPSSVTSPDKIAPTGGIVLEIPCGLVEDSSITLVGIPNGQQGGFQIELLGSQASGEPNRPIILHYNVSLPGDNMSEESFIVQNTWTDELKWGKEERCPTHLSASSHQVDGLVLCNERVLRSTGAENISMHHNNGNTVTNVSRGQSHESTNFPFIEGNLFTATLWIGLEGFHMNVNGRHETSFEYREKLEPWTVNQVKVTGGLDLLSSFAKGLPVFEDHDFINSSHLGAPPIPKKRLLMLVGVFSTGNNFKRRMALRRTWMQYEVVRSGDVAVRFFIGFDKNAQVNWELWREVEAYGDIQLMPFVDYYSLITLKTIAICIFGTKILPAKYIMKTDDDAFVRIDEVLSGLKSRPASGLLYGLISFDSSPDRDKDSKWHISMEEWPNATYPPWAHGPGYVISRDIAKFIVRGHQSRALKLFKLEDVAMGIWIEQFSKGGKEVQYINEERFYNSGCEANYILAHYQSPRLVLCLWERLQKQFDSTC
Homology
BLAST of CmaCh06G015780 vs. ExPASy Swiss-Prot
Match:
Q9ASW1 (Hydroxyproline O-galactosyltransferase GALT3 OS=Arabidopsis thaliana OX=3702 GN=GALT3 PE=2 SV=1)
HSP 1 Score: 672.5 bits (1734), Expect = 4.5e-192
Identity = 348/632 (55.06%), Postives = 437/632 (69.15%), Query Frame = 0
Query: 1 MKRWYGGTLILALATILALRYGLMNIQPKKQSAYDFFRNHPTKDSHSKNSDSLEAEVV-K 60
M+ W G I+ L I +RY +QS + +H+ + S+E E V +
Sbjct: 19 MRDWSVGVSIMVLTLIFIIRY--------EQSDH----------THTVDDSSIEGESVHE 78
Query: 61 TSERPHLIHIEGLRYLIAPDNI--TKRASEALLLWSHMHPLLMRSDALPETIQGVKEAST 120
+++PH + +E L YL + + + S +L+WS M P L R DALPET QG++EA+
Sbjct: 79 PAKKPHFMTLEDLDYLFSNKSFFGEEEVSNGMLVWSRMRPFLERPDALPETAQGIEEATL 138
Query: 121 AWNDLLSAIKAEKTIIVGNMSKGE---ICPSSVTSPDK-IAPTGGIVLEIPCGLVEDSSI 180
A L+ I EK M E ICP VT+ DK ++ ++LE+PCGL+EDSSI
Sbjct: 139 AMKGLVLEINREKRAYSSGMVSKEIRRICPDFVTAFDKDLSGLSHVLLELPCGLIEDSSI 198
Query: 181 TLVGIPNGQQGGFQIELLGSQASGEPNRPIILHYNVSLPGDNMSEESFIVQNTWTDELKW 240
TLVGIP+ FQI+L+GS SGE RPIIL YNV N S+ S IVQNTWT++L W
Sbjct: 199 TLVGIPDEHSSSFQIQLVGSGLSGETRRPIILRYNV-----NFSKPS-IVQNTWTEKLGW 258
Query: 241 GKEERCPTHLSASSHQVDGLVLCNERVLRSTGAENISMHHNNGNTVTNVSRGQSHESTNF 300
G EERC H S +H VD L LCN++ R IS +N + +S + NF
Sbjct: 259 GNEERCQYHGSLKNHLVDELPLCNKQTGRI-----ISEKSSNDDATMELSL----SNANF 318
Query: 301 PFIEGNLFTATLWIGLEGFHMNVNGRHETSFEYREKLEPWTVNQVKVTGGLDLLSSFAKG 360
PF++G+ FTA LW GLEGFHM +NGRHETSF YREKLEPW V+ VKV+GGL +LS A
Sbjct: 319 PFLKGSPFTAALWFGLEGFHMTINGRHETSFAYREKLEPWLVSAVKVSGGLKILSVLATR 378
Query: 361 LPVFEDH-DFINSSHLGAPPIPKKRLLMLVGVFSTGNNFKRRMALRRTWMQYEVVRSGDV 420
LP+ +DH I L AP + R+ +LVGVFSTGNNFKRRMALRR+WMQYE VRSG V
Sbjct: 379 LPIPDDHASLIIEEKLKAPSLSGTRIELLVGVFSTGNNFKRRMALRRSWMQYEAVRSGKV 438
Query: 421 AVRFFIGFDKNAQVNWELWREVEAYGDIQLMPFVDYYSLITLKTIAICIFGTKILPAKYI 480
AVRF IG N +VN E+WRE +AYGDIQ MPFVDYY L++LKT+A+CI GTK++PAKYI
Sbjct: 439 AVRFLIGLHTNEKVNLEMWRESKAYGDIQFMPFVDYYGLLSLKTVALCILGTKVIPAKYI 498
Query: 481 MKTDDDAFVRIDEVLSGLKSRPASGLLYGLISFDSSPDRDKDSKWHISMEEWPNATYPPW 540
MKTDDDAFVRIDE+LS L+ RP+S LLYGLISFDSSPDR++ SKW I EEWP +YPPW
Sbjct: 499 MKTDDDAFVRIDELLSSLEERPSSALLYGLISFDSSPDREQGSKWFIPKEEWPLDSYPPW 558
Query: 541 AHGPGYVISRDIAKFIVRGHQSRALKLFKLEDVAMGIWIEQFSKGGKEVQYINEERFYNS 600
AHGPGY+IS DIAKF+V+GH+ R L LFKLEDVAMGIWI+QF++ K V+YIN++RF+NS
Sbjct: 559 AHGPGYIISHDIAKFVVKGHRQRDLGLFKLEDVAMGIWIQQFNQTIKRVKYINDKRFHNS 617
Query: 601 GCEANYILAHYQSPRLVLCLWERLQKQFDSTC 625
C++NYIL HYQ+PRL+LCLWE+LQK+ S C
Sbjct: 619 DCKSNYILVHYQTPRLILCLWEKLQKENQSIC 617
BLAST of CmaCh06G015780 vs. ExPASy Swiss-Prot
Match:
Q8L7F9 (Beta-1,3-galactosyltransferase GALT1 OS=Arabidopsis thaliana OX=3702 GN=GALT1 PE=1 SV=1)
HSP 1 Score: 536.6 bits (1381), Expect = 3.8e-151
Identity = 285/639 (44.60%), Postives = 390/639 (61.03%), Query Frame = 0
Query: 1 MKRWYGGTLILALATILAL-RYGLMNIQPKK----QSAYDFFRNHPTKDSH--SKNSDSL 60
MKR+YGG L++++ L + RY +N +K +A + T
Sbjct: 1 MKRFYGGLLVVSMCMFLTVYRYVDLNTPVEKPYITAAASVVVTPNTTLPMEWLRITLPDF 60
Query: 61 EAEVVKTSERPHLIHIEGLRYLIAPDNITKRASEALLLWSHMHPLLMRSDALPETIQGVK 120
E T E I + L N++K E LL W+ + L+ + +L + +K
Sbjct: 61 MKEARNTQEAISGDDIAVVSGLFVEQNVSKEEREPLLTWNRLESLVDNAQSLVNGVDAIK 120
Query: 121 EASTAWNDLLSAIKAEKTIIVG----NMSKGEICPSSVTSPDKIAPTG-GIVLEIPCGLV 180
EA W L+SA++A+K + V K E+CP ++ + G + L+IPCGL
Sbjct: 121 EAGIVWESLVSAVEAKKLVDVNENQTRKGKEELCPQFLSKMNATEADGSSLKLQIPCGLT 180
Query: 181 EDSSITLVGIPNGQQGGFQIELLGSQASGEPNRPIILHYNVSLPGDNMSEESFIVQNTWT 240
+ SSIT++GIP+G G F+I+L G GEP+ PII+HYNV L GD +E+ IVQN+WT
Sbjct: 181 QGSSITVIGIPDGLVGSFRIDLTGQPLPGEPDPPIIVHYNVRLLGDKSTEDPVIVQNSWT 240
Query: 241 DELKWGKEERCPTHLSASSHQVDGLVLCNERVLRSTGAENISMHHNNGNTVTNVSRGQSH 300
WG EERCP + +VD L CN+ V + + +N + V+R S
Sbjct: 241 ASQDWGAEERCPKFDPDMNKKVDDLDECNKMVGGEINRTSSTSLQSNTSRGVPVAREASK 300
Query: 301 ESTNFPFIEGNLFTATLWIGLEGFHMNVNGRHETSFEYREKLEPWTVNQVKVTGGLDLLS 360
FPF +G L ATL +G EG M V+G+H TSF +R+ LEPW V+++++TG L+S
Sbjct: 301 HEKYFPFKQGFLSVATLRVGTEGMQMTVDGKHITSFAFRDTLEPWLVSEIRITGDFRLIS 360
Query: 361 SFAKGLPVFEDHD-FINSSHLGAPPI-PKKRLLMLVGVFSTGNNFKRRMALRRTWMQYEV 420
A GLP E+ + ++ L +P + P + L +++GVFST NNFKRRMA+RRTWMQY+
Sbjct: 361 ILASGLPTSEESEHVVDLEALKSPTLSPLRPLDLVIGVFSTANNFKRRMAVRRTWMQYDD 420
Query: 421 VRSGDVAVRFFIGFDKNAQVNWELWREVEAYGDIQLMPFVDYYSLITLKTIAICIFGTKI 480
VRSG VAVRFF+G K+ VN ELW E YGD+QLMPFVDYYSLI+ KT+AICIFGT++
Sbjct: 421 VRSGRVAVRFFVGLHKSPLVNLELWNEARTYGDVQLMPFVDYYSLISWKTLAICIFGTEV 480
Query: 481 LPAKYIMKTDDDAFVRIDEVLSGLK-SRPASGLLYGLISFDSSPDRDKDSKWHISMEEWP 540
AK+IMKTDDDAFVR+DEVL L + GL+YGLI+ DS P R+ DSKW+IS EEWP
Sbjct: 481 DSAKFIMKTDDDAFVRVDEVLLSLSMTNNTRGLIYGLINSDSQPIRNPDSKWYISYEEWP 540
Query: 541 NATYPPWAHGPGYVISRDIAKFIVRGHQSRALKLFKLEDVAMGIWIEQFSKGGKEVQYIN 600
YPPWAHGPGY++SRDIA+ + + + LK+FKLEDVAMGIWI + +K G E Y N
Sbjct: 541 EEKYPPWAHGPGYIVSRDIAESVGKLFKEGNLKMFKLEDVAMGIWIAELTKHGLEPHYEN 600
Query: 601 EERFYNSGCEANYILAHYQSPRLVLCLWERLQKQFDSTC 625
+ R + GC+ Y++AHYQSP + CLW + Q+ S C
Sbjct: 601 DGRIISDGCKDGYVVAHYQSPAEMTCLWRKYQETKRSLC 639
BLAST of CmaCh06G015780 vs. ExPASy Swiss-Prot
Match:
Q9LV16 (Hydroxyproline O-galactosyltransferase GALT6 OS=Arabidopsis thaliana OX=3702 GN=GALT6 PE=2 SV=2)
HSP 1 Score: 347.4 bits (890), Expect = 3.3e-94
Identity = 200/507 (39.45%), Postives = 287/507 (56.61%), Query Frame = 0
Query: 143 CPSSVTSPDKIAPTGGIVLEIPCGLVEDSSITLVGIPNG-------------------QQ 202
C SV+ G ++E+PCGL S IT+VG P +
Sbjct: 171 CSLSVSLTGSDLLKRGNIMELPCGLTLGSHITVVGKPRAAHSEKDPKISMLKEGDEAVKV 230
Query: 203 GGFQIELLGSQASGEPNRPIILHYNVSLPGDNMSEESFIVQNTWTDELKWGKEERCPTHL 262
F++EL G +A P ILH N L GD S + I QNT ++WG +RC
Sbjct: 231 SQFKLELQGLKAVEGEEPPRILHLNPRLKGD-WSGKPVIEQNT-CYRMQWGSAQRCEGWR 290
Query: 263 SASSHQ-VDGLVLCNERVLRSTGAENISMHHNNGNTVTN--VSR--GQSHEST---NFPF 322
S + VDG V C E+ R ++I+ + + +SR G+S + T FPF
Sbjct: 291 SRDDEETVDGQVKC-EKWARD---DSITSKEEESSKAASWWLSRLIGRSKKVTVEWPFPF 350
Query: 323 IEGNLFTATLWIGLEGFHMNVNGRHETSFEYREKLEPWTVNQVKVTGGLDLLSSFAKGLP 382
LF TL GLEG+H++V+G+H TSF YR + + G +D+ S FA LP
Sbjct: 351 TVDKLFVLTLSAGLEGYHVSVDGKHVTSFPYRTGFTLEDATGLTINGDIDVHSVFAGSLP 410
Query: 383 V----FEDHDFIN-SSHLGAPPIPKKRLLMLVGVFSTGNNFKRRMALRRTWMQYEVVRSG 442
F + SS+ AP +P +++ M +G+ S GN+F RMA+RR+WMQ+++V+S
Sbjct: 411 TSHPSFSPQRHLELSSNWQAPSLPDEQVDMFIGILSAGNHFAERMAVRRSWMQHKLVKSS 470
Query: 443 DVAVRFFIGFDKNAQVNWELWREVEAYGDIQLMPFVDYYSLITLKTIAICIFGTKILPAK 502
V RFF+ +VN EL +E E +GDI ++P++D Y L+ LKT+AIC +G L AK
Sbjct: 471 KVVARFFVALHSRKEVNVELKKEAEFFGDIVIVPYMDSYDLVVLKTVAICEYGAHQLAAK 530
Query: 503 YIMKTDDDAFVRIDEVLSGLKSRPASGLLY-GLISFDSSPDRDKDSKWHISMEEWPNATY 562
+IMK DDD FV++D VLS K P LY G I++ P R KW ++ EEWP Y
Sbjct: 531 FIMKCDDDTFVQVDAVLSEAKKTPTDRSLYIGNINYYHKPLR--QGKWSVTYEEWPEEDY 590
Query: 563 PPWAHGPGYVISRDIAKFIVRGHQSRALKLFKLEDVAMGIWIEQFSKGGKEVQYINEERF 617
PP+A+GPGY++S DI++FIV+ + L++FK+EDV++G+W+EQF+ G K V YI+ RF
Sbjct: 591 PPYANGPGYILSNDISRFIVKEFEKHKLRMFKMEDVSVGMWVEQFNNGTKPVDYIHSLRF 650
BLAST of CmaCh06G015780 vs. ExPASy Swiss-Prot
Match:
Q8GXG6 (Hydroxyproline O-galactosyltransferase GALT4 OS=Arabidopsis thaliana OX=3702 GN=GALT4 PE=2 SV=2)
HSP 1 Score: 337.8 bits (865), Expect = 2.6e-91
Identity = 198/538 (36.80%), Postives = 290/538 (53.90%), Query Frame = 0
Query: 119 WNDLLSA-IKAEKTIIVGNMSKGEICPSSVTSPDKIAPTGGIVLEIPCGLVEDSSITLVG 178
W+ L S IK +K + + K CP V+ + +L +PCGL S IT+V
Sbjct: 147 WDGLDSGLIKPDKAPVKTRIEK---CPDMVSVSESEFVNRSRILVLPCGLTLGSHITVVA 206
Query: 179 IPN-----------GQQGGFQIELLGSQASGEPNRPIILHYNVSLPGDNMSEESFIVQNT 238
P+ F +EL G +A + P ILH+N + GD S I QNT
Sbjct: 207 TPHWAHVEKDGDKTAMVSQFMMELQGLKAVDGEDPPRILHFNPRIKGD-WSGRPVIEQNT 266
Query: 239 WTDELKWGKEERCPTHLSASSHQ-VDGLVLCNERVLRSTGAENISMHHNNG--------- 298
++WG RC S+ + VDG V C ER R NNG
Sbjct: 267 -CYRMQWGSGLRCDGRESSDDEEYVDGEVKC-ERWKRDDDDGG-----NNGDDFDESKKT 326
Query: 299 ---NTVTNVSRGQSHESTNFPFIEGNLFTATLWIGLEGFHMNVNGRHETSFEYREKLEPW 358
N + + ++PF EG LF TL G+EG+H++VNGRH TSF YR
Sbjct: 327 WWLNRLMGRRKKMITHDWDYPFAEGKLFVLTLRAGMEGYHISVNGRHITSFPYRTGFVLE 386
Query: 359 TVNQVKVTGGLDLLSSFAKGLPVFEDHDFINSSHL------GAPPIPKKRLLMLVGVFST 418
+ V G +D+ S +A LP + F HL AP +P+K + + +G+ S
Sbjct: 387 DATGLAVKGNIDVHSVYAASLP-STNPSFAPQKHLEMQRIWKAPSLPQKPVELFIGILSA 446
Query: 419 GNNFKRRMALRRTWMQYEVVRSGDVAVRFFIGFDKNAQVNWELWREVEAYGDIQLMPFVD 478
GN+F RMA+R++WMQ ++VRS V RFF+ +VN +L +E E +GDI ++P++D
Sbjct: 447 GNHFAERMAVRKSWMQQKLVRSSKVVARFFVALHARKEVNVDLKKEAEYFGDIVIVPYMD 506
Query: 479 YYSLITLKTIAICIFGTKILPAKYIMKTDDDAFVRIDEVL-SGLKSRPASGLLYGLISFD 538
+Y L+ LKT+AIC +G + AKY+MK DDD FVR+D V+ K + L G I+F+
Sbjct: 507 HYDLVVLKTVAICEYGVNTVAAKYVMKCDDDTFVRVDAVIQEAEKVKGRESLYIGNINFN 566
Query: 539 SSPDRDKDSKWHISMEEWPNATYPPWAHGPGYVISRDIAKFIVRGHQSRALKLFKLEDVA 598
P R KW ++ EEWP YPP+A+GPGY++S D+AKFIV + + L+LFK+EDV+
Sbjct: 567 HKPLR--TGKWAVTFEEWPEEYYPPYANGPGYILSYDVAKFIVDDFEQKRLRLFKMEDVS 626
Query: 599 MGIWIEQFSKGGKEVQYINEERFYNSGCEANYILAHYQSPRLVLCLWERLQKQFDSTC 625
MG+W+E+F++ + V ++ +F GC +Y AHYQSPR ++C+W++LQ+ C
Sbjct: 627 MGMWVEKFNE-TRPVAVVHSLKFCQFGCIEDYFTAHYQSPRQMICMWDKLQRLGKPQC 669
BLAST of CmaCh06G015780 vs. ExPASy Swiss-Prot
Match:
A7XDQ9 (Hydroxyproline O-galactosyltransferase GALT2 OS=Arabidopsis thaliana OX=3702 GN=GALT2 PE=1 SV=1)
HSP 1 Score: 337.4 bits (864), Expect = 3.4e-91
Identity = 201/544 (36.95%), Postives = 282/544 (51.84%), Query Frame = 0
Query: 116 STAWNDL----LSAIKAEKTIIVGNMSKGEICPSSVTSPDKIAPTGGIVLEIPCGLVEDS 175
S AW D+ + I +I G K E CPS ++ ++ +PCGL S
Sbjct: 147 SKAWEDVDKFEVDKINESASIFEG---KVESCPSQISMNGDDLNKANRIMLLPCGLAAGS 206
Query: 176 SITLVGIPNGQQ-------------------GGFQIELLGSQASGEPNRPIILHYNVSLP 235
SIT++G P F +EL G + P ILH N +
Sbjct: 207 SITILGTPQYAHKESVPQRSRLTRSYGMVLVSQFMVELQGLKTGDGEYPPKILHLNPRIK 266
Query: 236 GDNMSEESFIVQNTWTDELKWGKEERCPTHLSASSHQVDGLVLCNERVLRSTGAENISMH 295
GD + I NT ++WG +RC + S D LV R + T + I M
Sbjct: 267 GD-WNHRPVIEHNT-CYRMQWGVAQRCDG--TPSKKDADVLVDGFRRCEKWTQNDIIDMV 326
Query: 296 HNNGNTVTN-----VSRGQSHEST-NFPFIEGNLFTATLWIGLEGFHMNVNGRHETSFEY 355
+ + T+ + R Q E T +FPF EG +F TL G++GFH+NV GRH +SF Y
Sbjct: 327 DSKESKTTSWFKRFIGREQKPEVTWSFPFAEGKVFVLTLRAGIDGFHINVGGRHVSSFPY 386
Query: 356 REKLEPWTVNQVKVTGGLDLLSSFAKGL----PVFEDHDFIN-SSHLGAPPIPKKRLLML 415
R + VTG +D+ S A L P F I SS APP+P +
Sbjct: 387 RPGFTIEDATGLAVTGDVDIHSIHATSLSTSHPSFSPQKAIEFSSEWKAPPLPGTPFRLF 446
Query: 416 VGVFSTGNNFKRRMALRRTWMQYEVVRSGDVAVRFFIGFDKNAQVNWELWREVEAYGDIQ 475
+GV S N+F RMA+R+TWMQ+ ++S DV RFF+ + +VN L +E E +GDI
Sbjct: 447 MGVLSATNHFSERMAVRKTWMQHPSIKSSDVVARFFVALNPRKEVNAMLKKEAEYFGDIV 506
Query: 476 LMPFVDYYSLITLKTIAICIFGTKILPAKYIMKTDDDAFVRIDEVLSGLKS-RPASGLLY 535
++PF+D Y L+ LKTIAIC FG + + A YIMK DDD F+R++ +L + P L
Sbjct: 507 ILPFMDRYELVVLKTIAICEFGVQNVTAPYIMKCDDDTFIRVESILKQIDGVSPEKSLYM 566
Query: 536 GLISFDSSPDRDKDSKWHISMEEWPNATYPPWAHGPGYVISRDIAKFIVRGHQSRALKLF 595
G ++ P R KW ++ EEWP A YPP+A+GPGY+IS +IAK+IV + L+LF
Sbjct: 567 GNLNLRHRPLR--TGKWTVTWEEWPEAVYPPYANGPGYIISSNIAKYIVSQNSRHKLRLF 626
Query: 596 KLEDVAMGIWIEQFSKGGKEVQYINEERFYNSGCEANYILAHYQSPRLVLCLWERLQKQF 625
K+EDV+MG+W+EQF+ + V+Y + +F GC NY AHYQSP ++CLW+ L K
Sbjct: 627 KMEDVSMGLWVEQFNASMQPVEYSHSWKFCQYGCTLNYYTAHYQSPSQMMCLWDNLLKGR 681
BLAST of CmaCh06G015780 vs. TAIR 10
Match:
AT3G06440.1 (Galactosyltransferase family protein )
HSP 1 Score: 672.5 bits (1734), Expect = 3.2e-193
Identity = 348/632 (55.06%), Postives = 437/632 (69.15%), Query Frame = 0
Query: 1 MKRWYGGTLILALATILALRYGLMNIQPKKQSAYDFFRNHPTKDSHSKNSDSLEAEVV-K 60
M+ W G I+ L I +RY +QS + +H+ + S+E E V +
Sbjct: 19 MRDWSVGVSIMVLTLIFIIRY--------EQSDH----------THTVDDSSIEGESVHE 78
Query: 61 TSERPHLIHIEGLRYLIAPDNI--TKRASEALLLWSHMHPLLMRSDALPETIQGVKEAST 120
+++PH + +E L YL + + + S +L+WS M P L R DALPET QG++EA+
Sbjct: 79 PAKKPHFMTLEDLDYLFSNKSFFGEEEVSNGMLVWSRMRPFLERPDALPETAQGIEEATL 138
Query: 121 AWNDLLSAIKAEKTIIVGNMSKGE---ICPSSVTSPDK-IAPTGGIVLEIPCGLVEDSSI 180
A L+ I EK M E ICP VT+ DK ++ ++LE+PCGL+EDSSI
Sbjct: 139 AMKGLVLEINREKRAYSSGMVSKEIRRICPDFVTAFDKDLSGLSHVLLELPCGLIEDSSI 198
Query: 181 TLVGIPNGQQGGFQIELLGSQASGEPNRPIILHYNVSLPGDNMSEESFIVQNTWTDELKW 240
TLVGIP+ FQI+L+GS SGE RPIIL YNV N S+ S IVQNTWT++L W
Sbjct: 199 TLVGIPDEHSSSFQIQLVGSGLSGETRRPIILRYNV-----NFSKPS-IVQNTWTEKLGW 258
Query: 241 GKEERCPTHLSASSHQVDGLVLCNERVLRSTGAENISMHHNNGNTVTNVSRGQSHESTNF 300
G EERC H S +H VD L LCN++ R IS +N + +S + NF
Sbjct: 259 GNEERCQYHGSLKNHLVDELPLCNKQTGRI-----ISEKSSNDDATMELSL----SNANF 318
Query: 301 PFIEGNLFTATLWIGLEGFHMNVNGRHETSFEYREKLEPWTVNQVKVTGGLDLLSSFAKG 360
PF++G+ FTA LW GLEGFHM +NGRHETSF YREKLEPW V+ VKV+GGL +LS A
Sbjct: 319 PFLKGSPFTAALWFGLEGFHMTINGRHETSFAYREKLEPWLVSAVKVSGGLKILSVLATR 378
Query: 361 LPVFEDH-DFINSSHLGAPPIPKKRLLMLVGVFSTGNNFKRRMALRRTWMQYEVVRSGDV 420
LP+ +DH I L AP + R+ +LVGVFSTGNNFKRRMALRR+WMQYE VRSG V
Sbjct: 379 LPIPDDHASLIIEEKLKAPSLSGTRIELLVGVFSTGNNFKRRMALRRSWMQYEAVRSGKV 438
Query: 421 AVRFFIGFDKNAQVNWELWREVEAYGDIQLMPFVDYYSLITLKTIAICIFGTKILPAKYI 480
AVRF IG N +VN E+WRE +AYGDIQ MPFVDYY L++LKT+A+CI GTK++PAKYI
Sbjct: 439 AVRFLIGLHTNEKVNLEMWRESKAYGDIQFMPFVDYYGLLSLKTVALCILGTKVIPAKYI 498
Query: 481 MKTDDDAFVRIDEVLSGLKSRPASGLLYGLISFDSSPDRDKDSKWHISMEEWPNATYPPW 540
MKTDDDAFVRIDE+LS L+ RP+S LLYGLISFDSSPDR++ SKW I EEWP +YPPW
Sbjct: 499 MKTDDDAFVRIDELLSSLEERPSSALLYGLISFDSSPDREQGSKWFIPKEEWPLDSYPPW 558
Query: 541 AHGPGYVISRDIAKFIVRGHQSRALKLFKLEDVAMGIWIEQFSKGGKEVQYINEERFYNS 600
AHGPGY+IS DIAKF+V+GH+ R L LFKLEDVAMGIWI+QF++ K V+YIN++RF+NS
Sbjct: 559 AHGPGYIISHDIAKFVVKGHRQRDLGLFKLEDVAMGIWIQQFNQTIKRVKYINDKRFHNS 617
Query: 601 GCEANYILAHYQSPRLVLCLWERLQKQFDSTC 625
C++NYIL HYQ+PRL+LCLWE+LQK+ S C
Sbjct: 619 DCKSNYILVHYQTPRLILCLWEKLQKENQSIC 617
BLAST of CmaCh06G015780 vs. TAIR 10
Match:
AT3G06440.2 (Galactosyltransferase family protein )
HSP 1 Score: 571.2 bits (1471), Expect = 9.9e-163
Identity = 303/565 (53.63%), Postives = 379/565 (67.08%), Query Frame = 0
Query: 1 MKRWYGGTLILALATILALRYGLMNIQPKKQSAYDFFRNHPTKDSHSKNSDSLEAEVV-K 60
M+ W G I+ L I +RY +QS + +H+ + S+E E V +
Sbjct: 19 MRDWSVGVSIMVLTLIFIIRY--------EQSDH----------THTVDDSSIEGESVHE 78
Query: 61 TSERPHLIHIEGLRYLIAPDNI--TKRASEALLLWSHMHPLLMRSDALPETIQGVKEAST 120
+++PH + +E L YL + + + S +L+WS M P L R DALPET QG++EA+
Sbjct: 79 PAKKPHFMTLEDLDYLFSNKSFFGEEEVSNGMLVWSRMRPFLERPDALPETAQGIEEATL 138
Query: 121 AWNDLLSAIKAEKTIIVGNMSKGE---ICPSSVTSPDK-IAPTGGIVLEIPCGLVEDSSI 180
A L+ I EK M E ICP VT+ DK ++ ++LE+PCGL+EDSSI
Sbjct: 139 AMKGLVLEINREKRAYSSGMVSKEIRRICPDFVTAFDKDLSGLSHVLLELPCGLIEDSSI 198
Query: 181 TLVGIPNGQQGGFQIELLGSQASGEPNRPIILHYNVSLPGDNMSEESFIVQNTWTDELKW 240
TLVGIP+ FQI+L+GS SGE RPIIL YNV N S+ S IVQNTWT++L W
Sbjct: 199 TLVGIPDEHSSSFQIQLVGSGLSGETRRPIILRYNV-----NFSKPS-IVQNTWTEKLGW 258
Query: 241 GKEERCPTHLSASSHQVDGLVLCNERVLRSTGAENISMHHNNGNTVTNVSRGQSHESTNF 300
G EERC H S +H VD L LCN++ R IS +N + +S + NF
Sbjct: 259 GNEERCQYHGSLKNHLVDELPLCNKQTGRI-----ISEKSSNDDATMELSL----SNANF 318
Query: 301 PFIEGNLFTATLWIGLEGFHMNVNGRHETSFEYREKLEPWTVNQVKVTGGLDLLSSFAKG 360
PF++G+ FTA LW GLEGFHM +NGRHETSF YREKLEPW V+ VKV+GGL +LS A
Sbjct: 319 PFLKGSPFTAALWFGLEGFHMTINGRHETSFAYREKLEPWLVSAVKVSGGLKILSVLATR 378
Query: 361 LPVFEDH-DFINSSHLGAPPIPKKRLLMLVGVFSTGNNFKRRMALRRTWMQYEVVRSGDV 420
LP+ +DH I L AP + R+ +LVGVFSTGNNFKRRMALRR+WMQYE VRSG V
Sbjct: 379 LPIPDDHASLIIEEKLKAPSLSGTRIELLVGVFSTGNNFKRRMALRRSWMQYEAVRSGKV 438
Query: 421 AVRFFIGFDKNAQVNWELWREVEAYGDIQLMPFVDYYSLITLKTIAICIFGTKILPAKYI 480
AVRF IG N +VN E+WRE +AYGDIQ MPFVDYY L++LKT+A+CI GTK++PAKYI
Sbjct: 439 AVRFLIGLHTNEKVNLEMWRESKAYGDIQFMPFVDYYGLLSLKTVALCILGTKVIPAKYI 498
Query: 481 MKTDDDAFVRIDEVLSGLKSRPASGLLYGLISFDSSPDRDKDSKWHISMEEWPNATYPPW 540
MKTDDDAFVRIDE+LS L+ RP+S LLYGLISFDSSPDR++ SKW I EEWP +YPPW
Sbjct: 499 MKTDDDAFVRIDELLSSLEERPSSALLYGLISFDSSPDREQGSKWFIPKEEWPLDSYPPW 550
Query: 541 AHGPGYVISRDIAKFIVRGHQSRAL 558
AHGPGY+IS DIAKF+V+GH+ R L
Sbjct: 559 AHGPGYIISHDIAKFVVKGHRQRDL 550
BLAST of CmaCh06G015780 vs. TAIR 10
Match:
AT1G26810.1 (galactosyltransferase1 )
HSP 1 Score: 536.6 bits (1381), Expect = 2.7e-152
Identity = 285/639 (44.60%), Postives = 390/639 (61.03%), Query Frame = 0
Query: 1 MKRWYGGTLILALATILAL-RYGLMNIQPKK----QSAYDFFRNHPTKDSH--SKNSDSL 60
MKR+YGG L++++ L + RY +N +K +A + T
Sbjct: 1 MKRFYGGLLVVSMCMFLTVYRYVDLNTPVEKPYITAAASVVVTPNTTLPMEWLRITLPDF 60
Query: 61 EAEVVKTSERPHLIHIEGLRYLIAPDNITKRASEALLLWSHMHPLLMRSDALPETIQGVK 120
E T E I + L N++K E LL W+ + L+ + +L + +K
Sbjct: 61 MKEARNTQEAISGDDIAVVSGLFVEQNVSKEEREPLLTWNRLESLVDNAQSLVNGVDAIK 120
Query: 121 EASTAWNDLLSAIKAEKTIIVG----NMSKGEICPSSVTSPDKIAPTG-GIVLEIPCGLV 180
EA W L+SA++A+K + V K E+CP ++ + G + L+IPCGL
Sbjct: 121 EAGIVWESLVSAVEAKKLVDVNENQTRKGKEELCPQFLSKMNATEADGSSLKLQIPCGLT 180
Query: 181 EDSSITLVGIPNGQQGGFQIELLGSQASGEPNRPIILHYNVSLPGDNMSEESFIVQNTWT 240
+ SSIT++GIP+G G F+I+L G GEP+ PII+HYNV L GD +E+ IVQN+WT
Sbjct: 181 QGSSITVIGIPDGLVGSFRIDLTGQPLPGEPDPPIIVHYNVRLLGDKSTEDPVIVQNSWT 240
Query: 241 DELKWGKEERCPTHLSASSHQVDGLVLCNERVLRSTGAENISMHHNNGNTVTNVSRGQSH 300
WG EERCP + +VD L CN+ V + + +N + V+R S
Sbjct: 241 ASQDWGAEERCPKFDPDMNKKVDDLDECNKMVGGEINRTSSTSLQSNTSRGVPVAREASK 300
Query: 301 ESTNFPFIEGNLFTATLWIGLEGFHMNVNGRHETSFEYREKLEPWTVNQVKVTGGLDLLS 360
FPF +G L ATL +G EG M V+G+H TSF +R+ LEPW V+++++TG L+S
Sbjct: 301 HEKYFPFKQGFLSVATLRVGTEGMQMTVDGKHITSFAFRDTLEPWLVSEIRITGDFRLIS 360
Query: 361 SFAKGLPVFEDHD-FINSSHLGAPPI-PKKRLLMLVGVFSTGNNFKRRMALRRTWMQYEV 420
A GLP E+ + ++ L +P + P + L +++GVFST NNFKRRMA+RRTWMQY+
Sbjct: 361 ILASGLPTSEESEHVVDLEALKSPTLSPLRPLDLVIGVFSTANNFKRRMAVRRTWMQYDD 420
Query: 421 VRSGDVAVRFFIGFDKNAQVNWELWREVEAYGDIQLMPFVDYYSLITLKTIAICIFGTKI 480
VRSG VAVRFF+G K+ VN ELW E YGD+QLMPFVDYYSLI+ KT+AICIFGT++
Sbjct: 421 VRSGRVAVRFFVGLHKSPLVNLELWNEARTYGDVQLMPFVDYYSLISWKTLAICIFGTEV 480
Query: 481 LPAKYIMKTDDDAFVRIDEVLSGLK-SRPASGLLYGLISFDSSPDRDKDSKWHISMEEWP 540
AK+IMKTDDDAFVR+DEVL L + GL+YGLI+ DS P R+ DSKW+IS EEWP
Sbjct: 481 DSAKFIMKTDDDAFVRVDEVLLSLSMTNNTRGLIYGLINSDSQPIRNPDSKWYISYEEWP 540
Query: 541 NATYPPWAHGPGYVISRDIAKFIVRGHQSRALKLFKLEDVAMGIWIEQFSKGGKEVQYIN 600
YPPWAHGPGY++SRDIA+ + + + LK+FKLEDVAMGIWI + +K G E Y N
Sbjct: 541 EEKYPPWAHGPGYIVSRDIAESVGKLFKEGNLKMFKLEDVAMGIWIAELTKHGLEPHYEN 600
Query: 601 EERFYNSGCEANYILAHYQSPRLVLCLWERLQKQFDSTC 625
+ R + GC+ Y++AHYQSP + CLW + Q+ S C
Sbjct: 601 DGRIISDGCKDGYVVAHYQSPAEMTCLWRKYQETKRSLC 639
BLAST of CmaCh06G015780 vs. TAIR 10
Match:
AT5G62620.1 (Galactosyltransferase family protein )
HSP 1 Score: 347.4 bits (890), Expect = 2.3e-95
Identity = 200/507 (39.45%), Postives = 287/507 (56.61%), Query Frame = 0
Query: 143 CPSSVTSPDKIAPTGGIVLEIPCGLVEDSSITLVGIPNG-------------------QQ 202
C SV+ G ++E+PCGL S IT+VG P +
Sbjct: 171 CSLSVSLTGSDLLKRGNIMELPCGLTLGSHITVVGKPRAAHSEKDPKISMLKEGDEAVKV 230
Query: 203 GGFQIELLGSQASGEPNRPIILHYNVSLPGDNMSEESFIVQNTWTDELKWGKEERCPTHL 262
F++EL G +A P ILH N L GD S + I QNT ++WG +RC
Sbjct: 231 SQFKLELQGLKAVEGEEPPRILHLNPRLKGD-WSGKPVIEQNT-CYRMQWGSAQRCEGWR 290
Query: 263 SASSHQ-VDGLVLCNERVLRSTGAENISMHHNNGNTVTN--VSR--GQSHEST---NFPF 322
S + VDG V C E+ R ++I+ + + +SR G+S + T FPF
Sbjct: 291 SRDDEETVDGQVKC-EKWARD---DSITSKEEESSKAASWWLSRLIGRSKKVTVEWPFPF 350
Query: 323 IEGNLFTATLWIGLEGFHMNVNGRHETSFEYREKLEPWTVNQVKVTGGLDLLSSFAKGLP 382
LF TL GLEG+H++V+G+H TSF YR + + G +D+ S FA LP
Sbjct: 351 TVDKLFVLTLSAGLEGYHVSVDGKHVTSFPYRTGFTLEDATGLTINGDIDVHSVFAGSLP 410
Query: 383 V----FEDHDFIN-SSHLGAPPIPKKRLLMLVGVFSTGNNFKRRMALRRTWMQYEVVRSG 442
F + SS+ AP +P +++ M +G+ S GN+F RMA+RR+WMQ+++V+S
Sbjct: 411 TSHPSFSPQRHLELSSNWQAPSLPDEQVDMFIGILSAGNHFAERMAVRRSWMQHKLVKSS 470
Query: 443 DVAVRFFIGFDKNAQVNWELWREVEAYGDIQLMPFVDYYSLITLKTIAICIFGTKILPAK 502
V RFF+ +VN EL +E E +GDI ++P++D Y L+ LKT+AIC +G L AK
Sbjct: 471 KVVARFFVALHSRKEVNVELKKEAEFFGDIVIVPYMDSYDLVVLKTVAICEYGAHQLAAK 530
Query: 503 YIMKTDDDAFVRIDEVLSGLKSRPASGLLY-GLISFDSSPDRDKDSKWHISMEEWPNATY 562
+IMK DDD FV++D VLS K P LY G I++ P R KW ++ EEWP Y
Sbjct: 531 FIMKCDDDTFVQVDAVLSEAKKTPTDRSLYIGNINYYHKPLR--QGKWSVTYEEWPEEDY 590
Query: 563 PPWAHGPGYVISRDIAKFIVRGHQSRALKLFKLEDVAMGIWIEQFSKGGKEVQYINEERF 617
PP+A+GPGY++S DI++FIV+ + L++FK+EDV++G+W+EQF+ G K V YI+ RF
Sbjct: 591 PPYANGPGYILSNDISRFIVKEFEKHKLRMFKMEDVSVGMWVEQFNNGTKPVDYIHSLRF 650
BLAST of CmaCh06G015780 vs. TAIR 10
Match:
AT1G27120.1 (Galactosyltransferase family protein )
HSP 1 Score: 337.8 bits (865), Expect = 1.8e-92
Identity = 198/538 (36.80%), Postives = 290/538 (53.90%), Query Frame = 0
Query: 119 WNDLLSA-IKAEKTIIVGNMSKGEICPSSVTSPDKIAPTGGIVLEIPCGLVEDSSITLVG 178
W+ L S IK +K + + K CP V+ + +L +PCGL S IT+V
Sbjct: 147 WDGLDSGLIKPDKAPVKTRIEK---CPDMVSVSESEFVNRSRILVLPCGLTLGSHITVVA 206
Query: 179 IPN-----------GQQGGFQIELLGSQASGEPNRPIILHYNVSLPGDNMSEESFIVQNT 238
P+ F +EL G +A + P ILH+N + GD S I QNT
Sbjct: 207 TPHWAHVEKDGDKTAMVSQFMMELQGLKAVDGEDPPRILHFNPRIKGD-WSGRPVIEQNT 266
Query: 239 WTDELKWGKEERCPTHLSASSHQ-VDGLVLCNERVLRSTGAENISMHHNNG--------- 298
++WG RC S+ + VDG V C ER R NNG
Sbjct: 267 -CYRMQWGSGLRCDGRESSDDEEYVDGEVKC-ERWKRDDDDGG-----NNGDDFDESKKT 326
Query: 299 ---NTVTNVSRGQSHESTNFPFIEGNLFTATLWIGLEGFHMNVNGRHETSFEYREKLEPW 358
N + + ++PF EG LF TL G+EG+H++VNGRH TSF YR
Sbjct: 327 WWLNRLMGRRKKMITHDWDYPFAEGKLFVLTLRAGMEGYHISVNGRHITSFPYRTGFVLE 386
Query: 359 TVNQVKVTGGLDLLSSFAKGLPVFEDHDFINSSHL------GAPPIPKKRLLMLVGVFST 418
+ V G +D+ S +A LP + F HL AP +P+K + + +G+ S
Sbjct: 387 DATGLAVKGNIDVHSVYAASLP-STNPSFAPQKHLEMQRIWKAPSLPQKPVELFIGILSA 446
Query: 419 GNNFKRRMALRRTWMQYEVVRSGDVAVRFFIGFDKNAQVNWELWREVEAYGDIQLMPFVD 478
GN+F RMA+R++WMQ ++VRS V RFF+ +VN +L +E E +GDI ++P++D
Sbjct: 447 GNHFAERMAVRKSWMQQKLVRSSKVVARFFVALHARKEVNVDLKKEAEYFGDIVIVPYMD 506
Query: 479 YYSLITLKTIAICIFGTKILPAKYIMKTDDDAFVRIDEVL-SGLKSRPASGLLYGLISFD 538
+Y L+ LKT+AIC +G + AKY+MK DDD FVR+D V+ K + L G I+F+
Sbjct: 507 HYDLVVLKTVAICEYGVNTVAAKYVMKCDDDTFVRVDAVIQEAEKVKGRESLYIGNINFN 566
Query: 539 SSPDRDKDSKWHISMEEWPNATYPPWAHGPGYVISRDIAKFIVRGHQSRALKLFKLEDVA 598
P R KW ++ EEWP YPP+A+GPGY++S D+AKFIV + + L+LFK+EDV+
Sbjct: 567 HKPLR--TGKWAVTFEEWPEEYYPPYANGPGYILSYDVAKFIVDDFEQKRLRLFKMEDVS 626
Query: 599 MGIWIEQFSKGGKEVQYINEERFYNSGCEANYILAHYQSPRLVLCLWERLQKQFDSTC 625
MG+W+E+F++ + V ++ +F GC +Y AHYQSPR ++C+W++LQ+ C
Sbjct: 627 MGMWVEKFNE-TRPVAVVHSLKFCQFGCIEDYFTAHYQSPRQMICMWDKLQRLGKPQC 669
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9ASW1 | 4.5e-192 | 55.06 | Hydroxyproline O-galactosyltransferase GALT3 OS=Arabidopsis thaliana OX=3702 GN=... | [more] |
Q8L7F9 | 3.8e-151 | 44.60 | Beta-1,3-galactosyltransferase GALT1 OS=Arabidopsis thaliana OX=3702 GN=GALT1 PE... | [more] |
Q9LV16 | 3.3e-94 | 39.45 | Hydroxyproline O-galactosyltransferase GALT6 OS=Arabidopsis thaliana OX=3702 GN=... | [more] |
Q8GXG6 | 2.6e-91 | 36.80 | Hydroxyproline O-galactosyltransferase GALT4 OS=Arabidopsis thaliana OX=3702 GN=... | [more] |
A7XDQ9 | 3.4e-91 | 36.95 | Hydroxyproline O-galactosyltransferase GALT2 OS=Arabidopsis thaliana OX=3702 GN=... | [more] |