Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCTCCGGCGAAACTAGAACAAAAGCCGGAAGAAAGATACTCAGTTCGTTAATTGAAAGGAAATTACTGGAGTTGCTATGAAAATCCGGTCGGTGAAGCTCCGAGAAGCCCACAAGGCAGCCAGCAATGGCAAGGCTTCGTTTTGCTCCGTCCTGTGGGATCAACAAGCGTCCCATATTGTCACTGCTTCGTCTTCCGAATCCGCCATATCCATCCATGATTCTCTTCTTCCTTCAAATAACCCGAAGATCATTCATCATCATCGCGAAGGGGTCACGGCTCTGGCTCTTAGCCCCAATTCCACCTGCCTCGCTTCTGGATCCATGGATCGATCCGTCAAGCTCTACAAGTTTCCCGGTATTTCTTCTGGCATTCGCTAAGCTGAACTTATATTATGATTCTTTTGGTTTTTCTAATTGTCTCTAATGTGGAATCTCTGAATTTCAAAAGCTATGCTTAGGTATTGGAAATGTTAGTTTTACTATGAACTGCTTCGATTAGGTTTGGATTATGAATGATAACTTTCAGATCTGTCGAAATCTTGAATGTTGCCTCATTTTCTTTTGAGTTGGTTTGTTTCCCACAGAAATTTATTTTAGGATCAAGCTTGTTAGCTTCATAGTTCTTTGGGAATTATGTTTCTGTTAATATTACGAGGTTCGATGGTTTCAGGTGGAGAATTTGAGACCAATATCACAAGATTTACGCTGCCAATACGTACTCTTGCGTTTAATAAGTCCGGAAGCCTCTTGGCAGCAGCTGGCGAGGACGATGGTATTAAGCTAATTAACACGATTGATGGTTCAATTGCTAGGGTTCTGAAAGGACATAAAGCAGCCGTTACTGGGTTGGCGTTTGACCCCAATAGTGAGTACATGGCATCGGTGGATTCATTTGGAACTGTAATTTTCTGGGAACTCCAATCTGGAAGTATAATACATAACCTTAAAGGCATAGCTCCTAGCACAGGTACGGACCCTTCCGTTATGAATGTTTTATGCTGGAGCCCTGACGGAGAGATGTTAGCAGTACCAGGATTAAGAAATGATGTTGTGATGTATGATAGAGATACTGCTGAAAAGCTATTTTCATTGAGGGGAGATCACACCCAGCCTATTTGTTTCCTGTCATGGTCACCTAACGGAAAATACATGGCTACTTCTTCCTTGGATAGGCAAATTCTGATATGGGATGTTGATCAAAAGCTGGATATTGATAGACAGAAATTTGACGAGAGAATATGTTGCATGGCATGGAAGCCAATTGGCAATGCGTTGGCTGTTATTGATGTCATGGGTAAGTATGGAGTGTGGGAATCAGTTGTTCCATCTTCTATGAAATCTCCAACTGAAGATATCCCAAAATTGCAATCAAGGAATAGCAACGGCTTGCTCTTGTTTGATGAAGAGGATGAGGAGCCTAGTGAACATGGCAATTTGAGTGATCTTGGAGAAGATAGCCTTTATGAATCCGAATTTACAACTCGAAAAAGGTTGCGTAAGCATTCAACAAATGAAGAAATCCTGGATGAGGCTGATAGTGAAGATTTTAGCCTTCTCCCAAAGTCGGAATCCTATAAAATGTCACGTCGTGTTAAAAAATATAAGCCAGATAATATTGATGAGGGAGATATAAACATTGCAACATATAGTAAATTGAAACTACGGGAGGCATTTCAACCAGGAGCTACTCCCTTGCAGCCAGGAAAGAAACGCTTTTTGTGTTACAATATGCTTGGAAGTATAACAACTTTTGAACATGACGGGTACTCCCACATAGAGGTAATCCTGAGCCTTCAACTTAAATCTTGTAGGTTTCTGCGTATTGTTGCTTGCTCTCATTCATGTGCTAGCACTTGGTAGGAATCAAATTCGTGAGGGATATGAAACAACTTTTATCTATATTCTAAATGACTGATGCTGTCCCTTTTCAGATAGATTTCCATGACACTGGCAGTGGCCCGCGAGTTCCTTCAATGAATGATCATTTTGGTTTTACAATGGCTGCACTGAATGAAAATGGTAGCGTCTTTGCAAACCCTTGTAAGGGTGAGAAGAATATGAGCACTCTAATGTATCGCCCATTTGGTAGCTGGGCAAATAACAGCGAGGTTAGAATGTTTTTATTTATAGTTTACAAGTTTCTCAGCTTCATACGAATGGTGCAACTTTCTATGGCTACTAAACTATGCTGACTTTGATATCTACTGTAAAAACTCAGTGGTCGATGAGATTTGATGGAGAAGAAGTTAAGGTCGTGGCAGTTGGCACCAGATGGGTGGCTGCATTTACAAGTCTTAATTATCTTCGCATCTTCACAGATGGTGGTTTACAGGTCTGTTGGACAGATTTCTTAACTAGCTTCCTATTCGCAACTTATGCCCTCTTTTCACCACATGTTTAAATGCTATAGTGTTCACATTCAGAAAACTTTTTTATTGCAAATTAAGAACCAAATTACCATATGATATATTACACAAAACCAGGAGAGATGATTCAGAATGCAGCAAACGATTATATTGACTTTGATGTGTATTTGTTTGTTAGCTTTAAGAAGCTAGGAACTTGAGTTAAACGCCAATTGTTTCCTTAACTAATAAGGAACTTACTACCATTAATTTTCTTCAATATATTTAAATTTTATATTGAATAATAATGTTGCTTAATGGCTTTATTTCAGAAACATATTCTTTCTCTGGATGGGCCAGTGGTGACTGCATCTGGCTTCAAGGATGAACTTGCATTTGTGACTCATTCTTCAACCTGTCTTCCCTCAAACGATCAGGTCTATACTCAATCAGACTTTTCTCTATTTTGAGCATGATTGATTCGTGGAATCTATGCATAGGGTGTCTGATGTCTATCTTACGAATTTCTATTGGACATTTGGAAAGTCTTTATTTGTTTGAGTTGGAGCATATTTTTAGCGCTCTTCATAGTATATTGTAGGACGGCCATAGTGAATAAAAGTTTATGATGGTGGAATTATAATCTTGAAGATGTAATTCTCTAACCTGAACTTCATTCAAAATATGCAGATGCTAGAGTTCAGAATATTAAACATCTCTAACGAAACACAGCTCTTAAGAGGACATTTGCCATTAACTCCAGGATCACATCTTGCATGGTTTGGTTTTAGTGAAGAAGGAAAATTGAGTTCCTACGATTCCAAGGTACGTTAGCTCTCGAGTAGTTTGTTGATATTCTTCAGTTGGTTATTGTGAACTATGTATCTTATTTTCCTACCGTTATTTATAGGGTGTACTCAGAGTGTTTACAAGTCAATATGGTGGCAGTTGGGTACCATTATTCAGGTTGATCAACATGCTATTTTTTTTTGGGTATATCTATTAAAAGTATAATTCACATCCTACCTTCTTGTACTCCTGTCTGAAGTTCAGATTTGTTGTGCAGTGCTCGTAAAGAGAGGAAGTCGGATGAAAAATATTGGGTGGTTGGGTTGAACACAAGCAAGTTGTTCTGTGTGATATGCAAAAATTCTGAGTCATACCCACAGGTAACAGAATTTCTTCTCCCCCAAACTCATTTACCCTGTATCATTGCTGAGCGTAATTTCTGTCTGCTTCAGGTGACACCTAAACCAATTCTTACTCTACTAAATCTTTCATTTCCTCTTGCGCTCTCTGACCTGGGTGCAGAAGCACTTGAGAATGAGTTTATGATGAATAATATGCACCTCACACAGGTCAGTTTGATACTTCCCAATCTTGTTTTAATTCTGTAGCTTTTAACTCTATGGCTATTAAGATCATCACTTATGTCAACATCATTTAGCGTGGAAGCTATGAGGAATAGAAACTAATTTTTTAATTGACATCGTTCTTTCATTTTTTTTTTCTTTCTAAGTTACTACTTTCATTTGCTTTCACTGTCCAAGTAATGTAAATGTATTCACACAGATCCATGGGAGAATGGAGGAAATGGCATTACTTGGGTTGTACGACACAGAACTAGATGATGAGGCATTCTGTATTGAAGCTGCTCAAGACAGATGCATCTTAAGGCTCATTGCTTCCTGCTGCAACGGTTGGGATTTGCAACTTACTTTAATGTCTTAGACATTTTGCATTTCATTTTCCTTGATGGCTTTTTCCTTTGACAAGGAATTAAAAATTACGATCTGAGGTTTCATCTAACTGAAGATGGTGAAATTTTTCCAGGCGACAAGCTTGTCAGAGCTTCCGAACTAGTAAAACTTTTGTCGCTGGAGAAATCAGTGAAGGGTGCAATTAAGCTGGTTACTGCTTTGAAGCTTCCTAACTTGGCGGAACGTTTCAATGCCATATTAGAGGTAGGACAAACTCCCAGAAACAGCCTCAAATCCTTTGATCTAGCAGTGGGGTCTCATATTTACATATATAATATATATATATATGTAATCCTTTCATCAACTTTTACTCTTTTTCTAATATCAGGAAAGGTTGCTTAATGAAGCCAAAGGGACAATGGAAACCACCACCCTTTCAAGATCTAACTGCAGTGGATCTGTGTTACCTAACGCTGGAAGCAGCAGTAGTACACTAAATTGTGCAGTAAAAGGCAGAAGTTCAGAAGTTACTCGTCCATCCTCACCTAAAGGCCCAATACTCTCTCTCTCTGCACCTTTGTTTACAAAGAAACTGAAGTCCGACGGAGCAAAATTCAACGATGGAAGAACAGAGGATAAACAAAGTTCAGGGGTGGTGAAGAAGGGAACAGAAAATAGTGGTGGTGACACAAATGTAGCAGCGGCAGATGTGAAGAAAGCAGTGTCATTGAGTAAGATAGAGTCCACTACCCTTGAAACTAATCAACATTTGAATTCATGTAACAGTCAGAAGGTGAAGGCAGAAGATGTGAATCAACAAGCTCAATGTACTCGCCCCCGAATCCGTTCTTGAAGTCATCAATCAAGTAAAAGCAGTTTTTTTTTTTTTTTTTTATAGTTGCATTTGTTGTTTAGAATGTGTAAATTATGAGGTAGGAATGAGAATGTACAGTTCTATTATAGGTAGGTAGCTTAAGTGGCTAAACCTTTCGTCAATTGTGACGTTATGAGGTCTGTAAGCATATCTCCATCTTCATCCTCTCCATAAAAATGAGGGAGGCTTCAAATTTTCACATTACTCCATTCTAATTTCAACCTGTCCAACTTTTCCATCC
mRNA sequence
GCTCCGGCGAAACTAGAACAAAAGCCGGAAGAAAGATACTCAGTTCGTTAATTGAAAGGAAATTACTGGAGTTGCTATGAAAATCCGGTCGGTGAAGCTCCGAGAAGCCCACAAGGCAGCCAGCAATGGCAAGGCTTCGTTTTGCTCCGTCCTGTGGGATCAACAAGCGTCCCATATTGTCACTGCTTCGTCTTCCGAATCCGCCATATCCATCCATGATTCTCTTCTTCCTTCAAATAACCCGAAGATCATTCATCATCATCGCGAAGGGGTCACGGCTCTGGCTCTTAGCCCCAATTCCACCTGCCTCGCTTCTGGATCCATGGATCGATCCGTCAAGCTCTACAAGTTTCCCGGTGGAGAATTTGAGACCAATATCACAAGATTTACGCTGCCAATACGTACTCTTGCGTTTAATAAGTCCGGAAGCCTCTTGGCAGCAGCTGGCGAGGACGATGGTATTAAGCTAATTAACACGATTGATGGTTCAATTGCTAGGGTTCTGAAAGGACATAAAGCAGCCGTTACTGGGTTGGCGTTTGACCCCAATAGTGAGTACATGGCATCGGTGGATTCATTTGGAACTGTAATTTTCTGGGAACTCCAATCTGGAAGTATAATACATAACCTTAAAGGCATAGCTCCTAGCACAGGTACGGACCCTTCCGTTATGAATGTTTTATGCTGGAGCCCTGACGGAGAGATGTTAGCAGTACCAGGATTAAGAAATGATGTTGTGATGTATGATAGAGATACTGCTGAAAAGCTATTTTCATTGAGGGGAGATCACACCCAGCCTATTTGTTTCCTGTCATGGTCACCTAACGGAAAATACATGGCTACTTCTTCCTTGGATAGGCAAATTCTGATATGGGATGTTGATCAAAAGCTGGATATTGATAGACAGAAATTTGACGAGAGAATATGTTGCATGGCATGGAAGCCAATTGGCAATGCGTTGGCTGTTATTGATGTCATGGGTAAGTATGGAGTGTGGGAATCAGTTGTTCCATCTTCTATGAAATCTCCAACTGAAGATATCCCAAAATTGCAATCAAGGAATAGCAACGGCTTGCTCTTGTTTGATGAAGAGGATGAGGAGCCTAGTGAACATGGCAATTTGAGTGATCTTGGAGAAGATAGCCTTTATGAATCCGAATTTACAACTCGAAAAAGGTTGCGTAAGCATTCAACAAATGAAGAAATCCTGGATGAGGCTGATAGTGAAGATTTTAGCCTTCTCCCAAAGTCGGAATCCTATAAAATGTCACGTCGTGTTAAAAAATATAAGCCAGATAATATTGATGAGGGAGATATAAACATTGCAACATATAGTAAATTGAAACTACGGGAGGCATTTCAACCAGGAGCTACTCCCTTGCAGCCAGGAAAGAAACGCTTTTTGTGTTACAATATGCTTGGAAGTATAACAACTTTTGAACATGACGGGTACTCCCACATAGAGATAGATTTCCATGACACTGGCAGTGGCCCGCGAGTTCCTTCAATGAATGATCATTTTGGTTTTACAATGGCTGCACTGAATGAAAATGGTAGCGTCTTTGCAAACCCTTGTAAGGGTGAGAAGAATATGAGCACTCTAATGTATCGCCCATTTGGTAGCTGGGCAAATAACAGCGAGTGGTCGATGAGATTTGATGGAGAAGAAGTTAAGGTCGTGGCAGTTGGCACCAGATGGGTGGCTGCATTTACAAGTCTTAATTATCTTCGCATCTTCACAGATGGTGGTTTACAGAAACATATTCTTTCTCTGGATGGGCCAGTGGTGACTGCATCTGGCTTCAAGGATGAACTTGCATTTGTGACTCATTCTTCAACCTGTCTTCCCTCAAACGATCAGATGCTAGAGTTCAGAATATTAAACATCTCTAACGAAACACAGCTCTTAAGAGGACATTTGCCATTAACTCCAGGATCACATCTTGCATGGTTTGGTTTTAGTGAAGAAGGAAAATTGAGTTCCTACGATTCCAAGGGTGTACTCAGAGTGTTTACAAGTCAATATGGTGGCAGTTGGGTACCATTATTCAGTGCTCGTAAAGAGAGGAAGTCGGATGAAAAATATTGGGTGGTTGGGTTGAACACAAGCAAGTTGTTCTGTGTGATATGCAAAAATTCTGAGTCATACCCACAGGTGACACCTAAACCAATTCTTACTCTACTAAATCTTTCATTTCCTCTTGCGCTCTCTGACCTGGGTGCAGAAGCACTTGAGAATGAGTTTATGATGAATAATATGCACCTCACACAGATCCATGGGAGAATGGAGGAAATGGCATTACTTGGGTTGTACGACACAGAACTAGATGATGAGGCATTCTGTATTGAAGCTGCTCAAGACAGATGCATCTTAAGGCTCATTGCTTCCTGCTGCAACGGCGACAAGCTTGTCAGAGCTTCCGAACTAGTAAAACTTTTGTCGCTGGAGAAATCAGTGAAGGGTGCAATTAAGCTGGTTACTGCTTTGAAGCTTCCTAACTTGGCGGAACGTTTCAATGCCATATTAGAGGAAAGGTTGCTTAATGAAGCCAAAGGGACAATGGAAACCACCACCCTTTCAAGATCTAACTGCAGTGGATCTGTGTTACCTAACGCTGGAAGCAGCAGTAGTACACTAAATTGTGCAGTAAAAGGCAGAAGTTCAGAAGTTACTCGTCCATCCTCACCTAAAGGCCCAATACTCTCTCTCTCTGCACCTTTGTTTACAAAGAAACTGAAGTCCGACGGAGCAAAATTCAACGATGGAAGAACAGAGGATAAACAAAGTTCAGGGGTGGTGAAGAAGGGAACAGAAAATAGTGGTGGTGACACAAATGTAGCAGCGGCAGATGTGAAGAAAGCAGTGTCATTGAGTAAGATAGAGTCCACTACCCTTGAAACTAATCAACATTTGAATTCATGTAACAGTCAGAAGGTGAAGGCAGAAGATGTGAATCAACAAGCTCAATGTACTCGCCCCCGAATCCGTTCTTGAAGTCATCAATCAAGTAAAAGCAGTTTTTTTTTTTTTTTTTTATAGTTGCATTTGTTGTTTAGAATGTGTAAATTATGAGGTAGGAATGAGAATGTACAGTTCTATTATAGGTAGGTAGCTTAAGTGGCTAAACCTTTCGTCAATTGTGACGTTATGAGGTCTGTAAGCATATCTCCATCTTCATCCTCTCCATAAAAATGAGGGAGGCTTCAAATTTTCACATTACTCCATTCTAATTTCAACCTGTCCAACTTTTCCATCC
Coding sequence (CDS)
ATGAAAATCCGGTCGGTGAAGCTCCGAGAAGCCCACAAGGCAGCCAGCAATGGCAAGGCTTCGTTTTGCTCCGTCCTGTGGGATCAACAAGCGTCCCATATTGTCACTGCTTCGTCTTCCGAATCCGCCATATCCATCCATGATTCTCTTCTTCCTTCAAATAACCCGAAGATCATTCATCATCATCGCGAAGGGGTCACGGCTCTGGCTCTTAGCCCCAATTCCACCTGCCTCGCTTCTGGATCCATGGATCGATCCGTCAAGCTCTACAAGTTTCCCGGTGGAGAATTTGAGACCAATATCACAAGATTTACGCTGCCAATACGTACTCTTGCGTTTAATAAGTCCGGAAGCCTCTTGGCAGCAGCTGGCGAGGACGATGGTATTAAGCTAATTAACACGATTGATGGTTCAATTGCTAGGGTTCTGAAAGGACATAAAGCAGCCGTTACTGGGTTGGCGTTTGACCCCAATAGTGAGTACATGGCATCGGTGGATTCATTTGGAACTGTAATTTTCTGGGAACTCCAATCTGGAAGTATAATACATAACCTTAAAGGCATAGCTCCTAGCACAGGTACGGACCCTTCCGTTATGAATGTTTTATGCTGGAGCCCTGACGGAGAGATGTTAGCAGTACCAGGATTAAGAAATGATGTTGTGATGTATGATAGAGATACTGCTGAAAAGCTATTTTCATTGAGGGGAGATCACACCCAGCCTATTTGTTTCCTGTCATGGTCACCTAACGGAAAATACATGGCTACTTCTTCCTTGGATAGGCAAATTCTGATATGGGATGTTGATCAAAAGCTGGATATTGATAGACAGAAATTTGACGAGAGAATATGTTGCATGGCATGGAAGCCAATTGGCAATGCGTTGGCTGTTATTGATGTCATGGGTAAGTATGGAGTGTGGGAATCAGTTGTTCCATCTTCTATGAAATCTCCAACTGAAGATATCCCAAAATTGCAATCAAGGAATAGCAACGGCTTGCTCTTGTTTGATGAAGAGGATGAGGAGCCTAGTGAACATGGCAATTTGAGTGATCTTGGAGAAGATAGCCTTTATGAATCCGAATTTACAACTCGAAAAAGGTTGCGTAAGCATTCAACAAATGAAGAAATCCTGGATGAGGCTGATAGTGAAGATTTTAGCCTTCTCCCAAAGTCGGAATCCTATAAAATGTCACGTCGTGTTAAAAAATATAAGCCAGATAATATTGATGAGGGAGATATAAACATTGCAACATATAGTAAATTGAAACTACGGGAGGCATTTCAACCAGGAGCTACTCCCTTGCAGCCAGGAAAGAAACGCTTTTTGTGTTACAATATGCTTGGAAGTATAACAACTTTTGAACATGACGGGTACTCCCACATAGAGATAGATTTCCATGACACTGGCAGTGGCCCGCGAGTTCCTTCAATGAATGATCATTTTGGTTTTACAATGGCTGCACTGAATGAAAATGGTAGCGTCTTTGCAAACCCTTGTAAGGGTGAGAAGAATATGAGCACTCTAATGTATCGCCCATTTGGTAGCTGGGCAAATAACAGCGAGTGGTCGATGAGATTTGATGGAGAAGAAGTTAAGGTCGTGGCAGTTGGCACCAGATGGGTGGCTGCATTTACAAGTCTTAATTATCTTCGCATCTTCACAGATGGTGGTTTACAGAAACATATTCTTTCTCTGGATGGGCCAGTGGTGACTGCATCTGGCTTCAAGGATGAACTTGCATTTGTGACTCATTCTTCAACCTGTCTTCCCTCAAACGATCAGATGCTAGAGTTCAGAATATTAAACATCTCTAACGAAACACAGCTCTTAAGAGGACATTTGCCATTAACTCCAGGATCACATCTTGCATGGTTTGGTTTTAGTGAAGAAGGAAAATTGAGTTCCTACGATTCCAAGGGTGTACTCAGAGTGTTTACAAGTCAATATGGTGGCAGTTGGGTACCATTATTCAGTGCTCGTAAAGAGAGGAAGTCGGATGAAAAATATTGGGTGGTTGGGTTGAACACAAGCAAGTTGTTCTGTGTGATATGCAAAAATTCTGAGTCATACCCACAGGTGACACCTAAACCAATTCTTACTCTACTAAATCTTTCATTTCCTCTTGCGCTCTCTGACCTGGGTGCAGAAGCACTTGAGAATGAGTTTATGATGAATAATATGCACCTCACACAGATCCATGGGAGAATGGAGGAAATGGCATTACTTGGGTTGTACGACACAGAACTAGATGATGAGGCATTCTGTATTGAAGCTGCTCAAGACAGATGCATCTTAAGGCTCATTGCTTCCTGCTGCAACGGCGACAAGCTTGTCAGAGCTTCCGAACTAGTAAAACTTTTGTCGCTGGAGAAATCAGTGAAGGGTGCAATTAAGCTGGTTACTGCTTTGAAGCTTCCTAACTTGGCGGAACGTTTCAATGCCATATTAGAGGAAAGGTTGCTTAATGAAGCCAAAGGGACAATGGAAACCACCACCCTTTCAAGATCTAACTGCAGTGGATCTGTGTTACCTAACGCTGGAAGCAGCAGTAGTACACTAAATTGTGCAGTAAAAGGCAGAAGTTCAGAAGTTACTCGTCCATCCTCACCTAAAGGCCCAATACTCTCTCTCTCTGCACCTTTGTTTACAAAGAAACTGAAGTCCGACGGAGCAAAATTCAACGATGGAAGAACAGAGGATAAACAAAGTTCAGGGGTGGTGAAGAAGGGAACAGAAAATAGTGGTGGTGACACAAATGTAGCAGCGGCAGATGTGAAGAAAGCAGTGTCATTGAGTAAGATAGAGTCCACTACCCTTGAAACTAATCAACATTTGAATTCATGTAACAGTCAGAAGGTGAAGGCAGAAGATGTGAATCAACAAGCTCAATGTACTCGCCCCCGAATCCGTTCTTGA
Protein sequence
MKIRSVKLREAHKAASNGKASFCSVLWDQQASHIVTASSSESAISIHDSLLPSNNPKIIHHHREGVTALALSPNSTCLASGSMDRSVKLYKFPGGEFETNITRFTLPIRTLAFNKSGSLLAAAGEDDGIKLINTIDGSIARVLKGHKAAVTGLAFDPNSEYMASVDSFGTVIFWELQSGSIIHNLKGIAPSTGTDPSVMNVLCWSPDGEMLAVPGLRNDVVMYDRDTAEKLFSLRGDHTQPICFLSWSPNGKYMATSSLDRQILIWDVDQKLDIDRQKFDERICCMAWKPIGNALAVIDVMGKYGVWESVVPSSMKSPTEDIPKLQSRNSNGLLLFDEEDEEPSEHGNLSDLGEDSLYESEFTTRKRLRKHSTNEEILDEADSEDFSLLPKSESYKMSRRVKKYKPDNIDEGDINIATYSKLKLREAFQPGATPLQPGKKRFLCYNMLGSITTFEHDGYSHIEIDFHDTGSGPRVPSMNDHFGFTMAALNENGSVFANPCKGEKNMSTLMYRPFGSWANNSEWSMRFDGEEVKVVAVGTRWVAAFTSLNYLRIFTDGGLQKHILSLDGPVVTASGFKDELAFVTHSSTCLPSNDQMLEFRILNISNETQLLRGHLPLTPGSHLAWFGFSEEGKLSSYDSKGVLRVFTSQYGGSWVPLFSARKERKSDEKYWVVGLNTSKLFCVICKNSESYPQVTPKPILTLLNLSFPLALSDLGAEALENEFMMNNMHLTQIHGRMEEMALLGLYDTELDDEAFCIEAAQDRCILRLIASCCNGDKLVRASELVKLLSLEKSVKGAIKLVTALKLPNLAERFNAILEERLLNEAKGTMETTTLSRSNCSGSVLPNAGSSSSTLNCAVKGRSSEVTRPSSPKGPILSLSAPLFTKKLKSDGAKFNDGRTEDKQSSGVVKKGTENSGGDTNVAAADVKKAVSLSKIESTTLETNQHLNSCNSQKVKAEDVNQQAQCTRPRIRS
Homology
BLAST of CmaCh11G002340 vs. ExPASy Swiss-Prot
Match:
O75717 (WD repeat and HMG-box DNA-binding protein 1 OS=Homo sapiens OX=9606 GN=WDHD1 PE=1 SV=1)
HSP 1 Score: 289.7 bits (740), Expect = 1.3e-76
Identity = 256/982 (26.07%), Postives = 458/982 (46.64%), Query Frame = 0
Query: 25 VLWDQQASHIVTASSSESAISIHDSLLPSNNPKIIHHHREGVTALALSPNSTCLASGSMD 84
V +D S IVT S D L ++PK I+ G A + + S L + +
Sbjct: 19 VCFDDSGSFIVTCGSDGDVRIWED--LDDDDPKFIN---VGEKAYSCALKSGKLVTAVSN 78
Query: 85 RSVKLYKFPGGEFETNITRFTLPIRTLAFNKSGSLLAAAGEDDGIKLINTIDGSIARVLK 144
+++++ FP G + +TRFT + FN G+ +AA D +K+++ +D S + +
Sbjct: 79 NTIQVHTFPEGVPDGILTRFTTNANHVVFNGDGTKIAAGSSDFLVKIVDVMDSSQQKTFR 138
Query: 145 GHKAAVTGLAFDPNSEYMASVDSFGTVIFWEL--QSGSIIHNLKGIAPSTGTDPSVMNVL 204
GH A V L+FDP ++AS G+V W++ Q+ +I L S+ L
Sbjct: 139 GHDAPVLSLSFDPKDIFLASASCDGSVRVWQISDQTCAISWPLLQKCNDVINAKSICR-L 198
Query: 205 CWSP-DGEMLAVPGLRNDVVMYDRDTAEKLFSLRGDH-TQPICFLSWSPNGKYMATSSLD 264
W P G++LA+P + V +Y R++ F L + +Q + ++WSP G+Y+A S++
Sbjct: 199 AWQPKSGKLLAIP-VEKSVKLYRRESWSHQFDLSDNFISQTLNIVTWSPCGQYLAAGSIN 258
Query: 265 RQILIWDVDQKLDIDRQKFDE--RICCMAWKPIGNALAVIDVMGKYGVWESVVPSSMKSP 324
I++W+V+ K ++R K ++ IC +AW P ++ D G G+ E+V S K+
Sbjct: 259 GLIIVWNVETKDCMERVKHEKGYAICGLAWHPTCGRISYTDAEGNLGLLENVCDPSGKTS 318
Query: 325 TEDIPKLQSRNSNGL-----------LLFDEEDEEPSEHGNLSDLGEDSLYESEFTTRKR 384
+ + ++ N L L D E PS + + ED + R R
Sbjct: 319 SSKVSSRVEKDYNDLFDGDDMSNAGDFLNDNAVEIPSFSKGIINDDEDDEDLMMASGRPR 378
Query: 385 LRKHSTNEEILDEADSEDFSLLPKSESYKMSRRVKKYKPDNIDEGDIN---IATYSK--- 444
R H + D+ +S D S+L K + K + ++ EG I+ + T +
Sbjct: 379 QRSHI----LEDDENSVDISML------KTGSSLLKEEEEDGQEGSIHNLPLVTSQRPFY 438
Query: 445 -----LKLREAFQPGATPLQPGKKRFLCYNMLGSITTFEHDGYSHIEIDFHDTGSGPRVP 504
++ FQ G+TPL RF+ +N +G I + + + I+++FHDT S
Sbjct: 439 DGPMPTPRQKPFQSGSTPLHL-THRFMVWNSIGIIRCYNDEQDNAIDVEFHDT-SIHHAT 498
Query: 505 SMNDHFGFTMAALNENGSVFANPCKGEKNMSTLMY-RPFGSWANNSEWSMRF-DGEEVKV 564
+++ +T+A L+ + A C+ +++ ++ F SW ++ EW + E+++
Sbjct: 499 HLSNTLNYTIADLSHEAILLA--CESTDELASKLHCLHFSSWDSSKEWIIDLPQNEDIEA 558
Query: 565 VAVGTRWVAAFTSLNYLRIFTDGGLQKHILSLDGPVVTASGFKDELAFVTHSSTCLPSND 624
+ +G W AA TS LR+FT GG+QK + SL GPVV+ +G ++L V H T D
Sbjct: 559 ICLGQGWAAAATSALLLRLFTIGGVQKEVFSLAGPVVSMAGHGEQLFIVYHRGTGF-DGD 618
Query: 625 QMLEFRILNI-SNETQLLRGH-LPLTPGSHLAWFGFSEEGKLSSYDSKGVLRVFTSQYGG 684
Q L ++L + + Q+L G LPLT S+LAW GFS EG DS+G++R+ G
Sbjct: 619 QCLGVQLLELGKKKKQILHGDPLPLTRKSYLAWIGFSAEGTPCYVDSEGIVRMLNRGLGN 678
Query: 685 SWVPLFSARKERK-SDEKYWVVGL--NTSKLFCVICKNSESYPQVTPKPILTLLNLSFPL 744
+W P+ + R+ K + YWVVG+ N +L C+ CK S +P P+P + +L+ P
Sbjct: 679 TWTPICNTREHCKGKSDHYWVVGIHENPQQLRCIPCKGSR-FPPTLPRPAVAILSFKLPY 738
Query: 745 ALSDLGAEALENEFMMNNMHLTQIHGRMEEMALLGL-YDTELDDEAFCIEAAQDRCILRL 804
+E +F + + H ++ +A G Y+ ++A Q ++++
Sbjct: 739 CQIATEKGQMEEQFWRSVI----FHNHLDYLAKNGYEYEESTKNQA---TKEQQELLMKM 798
Query: 805 IASCCNGDKLVRASELVKLLSLEKSVKGAIKLVTALKLPNLAERFNAILEERLLNEAKGT 864
+A C ++ R EL L++ + +V AIK + + LA++ + + E+ E T
Sbjct: 799 LALSCKLEREFRCVELADLMT-QNAVNLAIKYASRSRKLILAQKLSELAVEKAA-ELTAT 858
Query: 865 METTTLSRSNCSGSVLPNAGSSSSTLNCAVKGRSSEVTRPSSPKGPILSLSAPLFTKKLK 924
+ + NAG S++ + ++V + G P K +
Sbjct: 859 QVEEEEEEEDFRKKL--NAGYSNTATEWSQPRFRNQVEEDAEDSGEADDEEKPEIHKPGQ 918
Query: 925 SDGAKFNDGRTEDKQSSGVVKKGTENSGGDTNVAAADVKKAVSLSKIESTTLETNQHLNS 971
+ +K + ++ SG V ++ V+A+ + A+S++ ST + N +S
Sbjct: 919 NSFSK-STNSSDVSAKSGAVTFSSQGRVNPFKVSASSKEPAMSMNSARSTNILDNMGKSS 965
BLAST of CmaCh11G002340 vs. ExPASy Swiss-Prot
Match:
O13046 (WD repeat and HMG-box DNA-binding protein 1 OS=Xenopus laevis OX=8355 GN=wdhd1 PE=1 SV=1)
HSP 1 Score: 288.5 bits (737), Expect = 2.8e-76
Identity = 228/836 (27.27%), Postives = 398/836 (47.61%), Query Frame = 0
Query: 25 VLWDQQASHIVTASSSESAISIHDSLLPSNNPKIIHHHREGVTALALSPNSTCLASGSMD 84
V +D + +VT S+ I I +S L ++PK I G A + + + + + + +
Sbjct: 19 VCFDDSGNFLVTC-GSDGDIRIWES-LDDDDPKSI---SIGEKAYSFALKNGKVVTAASN 78
Query: 85 RSVKLYKFPGGEFETNITRFTLPIRTLAFNKSGSLLAAAGEDDGIKLINTIDGSIARVLK 144
+++L+ FP GE + +TRFT + FN G+ +AA D +K++ D + + L+
Sbjct: 79 NAIQLHTFPDGEPDGILTRFTTNANHVVFNTDGTRIAAGSGDFLVKVLQVEDSTQQKTLR 138
Query: 145 GHKAAVTGLAFDPNSEYMASVDSFGTVIFWELQSGSIIHNLKGIAPSTGTDPSVMNV--- 204
GH A V ++FDP Y+AS G+V W++ + + P V N
Sbjct: 139 GHSAPVLSVSFDPKDIYLASASCDGSVRIWKISD----QTCEAVLPLLEKCNDVFNAKSI 198
Query: 205 --LCWSP-DGEMLAVPGLRNDVVMYDRDTAEKLFSLRGDH-TQPICFLSWSPNGKYMATS 264
L W P G+ +A+P + V +YDRD+ + + +L D TQP+ ++WSP G+Y+
Sbjct: 199 CRLAWQPKSGKFVAIP-VGKAVHLYDRDSLKNICTLSDDFITQPVNIVAWSPCGQYLVAG 258
Query: 265 SLDRQILIWDVDQKLDIDRQKFDE--RICCMAWKPIGNALAVIDVMGKYGVWESVVPSSM 324
S+D I+ W++ K ++R K ++ IC +AW P +A D G G+ E V +
Sbjct: 259 SVDGCIVAWNIATKACLERIKHEKGYTICALAWHPHLPQIAYTDNEGNLGLLEDVCQGDV 318
Query: 325 KSPTEDIPKLQSRNSNGLLLFDEEDEEP-------SEHGNLSDLGEDSLYESEFTTRKRL 384
K P+ + ++++ + LFD +D+E G ++D +D + + T R R
Sbjct: 319 KQPSAKVSSAETKDYDE--LFDGDDDEDFLNGDMIGHEGAVNDEDDDDNF-TALTGRPRN 378
Query: 385 RKHSTNEEILDEADSEDFSLLPKSESYKMSRRVKKYKPDNIDEGDINIATYSKLKL---- 444
R I D+ S D S K+ D+ N + + +KL
Sbjct: 379 R-----GAIFDDDISSDV------PSLKLVGNENPVVEDDQASSVQNFTSVASVKLSYNG 438
Query: 445 ------REAFQPGATPLQPGKKRFLCYNMLGSITTFEHDGYSHIEIDFHDTGSGPRVPSM 504
++ FQ G+TP+ RF+ +N +G I + + + I+++FHDT + +
Sbjct: 439 PMPTPQQKPFQSGSTPVHL-MHRFMVWNSVGVIRCYNDEQDNAIDVEFHDTSIHHAI-HL 498
Query: 505 NDHFGFTMAALNENGSVFANPCK-GEKNMSTLMYRPFGSWANNSEWSMRF-DGEEVKVVA 564
+ T+A +++ + A C+ E+ S L F SW + EW + GE ++ +
Sbjct: 499 TNSLNHTLADVSQEAVLLA--CETTEELASKLQCLHFSSWDTSKEWMVDMPKGENIQAIC 558
Query: 565 VGTRWVAAFTSLNYLRIFTDGGLQKHILSLDGPVVTASGFKDELAFVTHSSTCLPSNDQM 624
+G WVA TS +RIF+ GG+QK ++SL GPVV + ++L V H DQ
Sbjct: 559 LGQGWVACATSALLIRIFSVGGVQKELISLFGPVVCMASHGEQLIVVYHRGMGF-DGDQC 618
Query: 625 LEFRILNI-SNETQLLRGH-LPLTPGSHLAWFGFSEEGKLSSYDSKGVLRVFTSQYGGSW 684
L ++L + + Q+L G LPL+ S+L+W GF+ EG DS+G++R+ G +W
Sbjct: 619 LGVQLLELGKKKKQVLHGDPLPLSRKSYLSWLGFTAEGSPCYVDSEGIVRLLNRSLGDTW 678
Query: 685 VPLFSARKERK-SDEKYWVVGL--NTSKLFCVICKNSESYPQVTPKPILTLLNLSFPLAL 744
VP+ + R+ K + YWVVG+ N ++ C+ CK S +P P+P + +L + P
Sbjct: 679 VPICNTREHCKGKSDHYWVVGIHENPQQVRCIPCKGSR-FPPTLPRPAVAVLPFNLPYCQ 738
Query: 745 SDLGAEALENEFMMNNMHLTQIHGRMEEMALLGLYDTELDDEAFCIEA--AQDRCILRLI 804
+E ++ +QI + Y+ DE F EA Q ++++
Sbjct: 739 ITTEKGQMEEQYWR-----SQIFSNHSDYLSKHGYEC---DENFKAEAQKMQQELLMKMF 798
Query: 805 ASCCNGDKLVRASELVKLLSLEKSVKGAIKLVTALKLPNLAERFNAILEERLLNEA 826
A C ++ R EL + ++ + + AIK + K LA+R + + E+ +A
Sbjct: 799 ALSCKLEREFRCMELAEFMT-QNVMNLAIKYASRSKRLILAQRLSEMALEKAAEQA 815
BLAST of CmaCh11G002340 vs. ExPASy Swiss-Prot
Match:
P59328 (WD repeat and HMG-box DNA-binding protein 1 OS=Mus musculus OX=10090 GN=Wdhd1 PE=1 SV=2)
HSP 1 Score: 287.3 bits (734), Expect = 6.3e-76
Identity = 249/971 (25.64%), Postives = 441/971 (45.42%), Query Frame = 0
Query: 25 VLWDQQASHIVTASSSESAISIHDSLLPSNNPKIIHHHREGVTALALSPNSTCLASGSMD 84
V +D S+IVT S D L ++PK ++ G A + + + L + +
Sbjct: 19 VCFDDSGSYIVTCGSDGDVRMWED--LDDDDPKSVN---VGEKAFSCALKNGKLVTAVSN 78
Query: 85 RSVKLYKFPGGEFETNITRFTLPIRTLAFNKSGSLLAAAGEDDGIKLINTIDGSIARVLK 144
+V++Y FP G + +TRFT + FN +G+ +AA D +K+++ +D S + +
Sbjct: 79 NTVQVYTFPEGVPDGILTRFTTNANHVVFNGAGNKIAAGSSDFLVKVVDVMDNSQQQTFR 138
Query: 145 GHKAAVTGLAFDPNSEYMASVDSFGTVIFWELQSGSIIHNLKGIAPSTG-TDPSVMNVLC 204
GH A V L+FDP ++AS GTV W + + + + S + + L
Sbjct: 139 GHDAPVLSLSFDPKDIFLASASCDGTVRVWNISDQTCAVSWPVLQKSNDVVNAKSICRLA 198
Query: 205 WSPD-GEMLAVPGLRNDVVMYDRDTAEKLFSLRGDH-TQPICFLSWSPNGKYMATSSLDR 264
W P G++LAVP + V +Y R+T F L +Q + ++WSP G+Y+A +++
Sbjct: 199 WQPKAGKLLAVP-VEKSVKLYRRETWSNPFDLSDSSISQTLNIVTWSPCGQYLAAGAING 258
Query: 265 QILIWDVDQKLDIDRQKFDE--RICCMAWKPIGNALAVIDVMGKYGVWESVVPSSMKSPT 324
I++W+V+ K ++R K ++ IC +AW P + + DV G GV E+V S K +
Sbjct: 259 LIVVWNVETKDCMERVKHEKGYAICGLAWHPTCSRICYTDVEGNLGVLENVCDLSGKVSS 318
Query: 325 EDIPKLQSRNSNGLLLFDEEDEEPSEHGNLSDLGEDSLYESEFTTRKRLRKHSTNEEILD 384
+ ++ N LFD +D + D D+ E ++ + + N++I+
Sbjct: 319 NKVSSRVEKDYND--LFDGDDT-----SSAGDFLNDNAVEIPSFSKGIINEDDDNDDIML 378
Query: 385 EADSEDFSLLPKSESYKMSRRVKKYKPDNIDEGDINIATYSKLKLREAFQPGATPLQPGK 444
AD D S M + +K + D+ +I ++ + F G P K
Sbjct: 379 AAD-HDLGDDENSVDVTMLKADLSHKEEGDDDQARSIHNLPLIRPQRPFYDGPMPTPRQK 438
Query: 445 ------------KRFLCYNMLGSITTFEHDGYSHIEIDFHDTGSGPRVPSMNDHFGFTMA 504
RF+ +N +G I + D S I+++FHDT +N F +TM
Sbjct: 439 PFQSSSTPLHLSHRFMVWNSVGIIRCYNDDQDSAIDVEFHDTSIHHATHLLN-AFNYTMG 498
Query: 505 ALNENGSVFANPCKGEKNMSTLMY-RPFGSWANNSEWSMRF-DGEEVKVVAVGTRWVAAF 564
L+ + A C+ +++ ++ F SW ++ EW + E+++ + +G W AA
Sbjct: 499 TLSHEAILLA--CESADELASKLHCLHFSSWDSSKEWMVDMPQNEDIEAICLGLGWAAAA 558
Query: 565 TSLNYLRIFTDGGLQKHILSLDGPVVTASGFKDELAFVTHSSTCLPSNDQMLEFRILNIS 624
T+ LR+FT GG+QK + L GPVV+ +G ++L V H T DQ L ++L +
Sbjct: 559 TTALLLRLFTIGGVQKEVFCLPGPVVSMAGHGEQLCIVYHRGTGF-DGDQCLGVQLLELG 618
Query: 625 -NETQLLRGH-LPLTPGSHLAWFGFSEEGKLSSYDSKGVLRVFTSQYGGSWVPLFSARKE 684
+ Q+L G LPLT S+L W GFS EG DS+G +R+ G +W P+ + R+
Sbjct: 619 RKKNQVLHGDPLPLTRKSYLTWLGFSAEGTPCYVDSEGCVRMLNRGLGNTWTPVCNIREH 678
Query: 685 RK-SDEKYWVVGL--NTSKLFCVICKNSESYPQVTPKPILTLLNLSFPLALSDLGAEALE 744
K + YWVVG+ N +L C+ CK S +P P+P + +L+ P + +E
Sbjct: 679 CKGKSDHYWVVGIHENPQQLRCIPCKGSR-FPPTLPRPAVAILSFKLPYCQTSTEKGQME 738
Query: 745 NEFMMNNMHLTQIHGRMEEMALLGL-YDTELDDEAFCIEAAQDRCILRLIASCCNGDKLV 804
+F H H ++ +A G Y+ + ++A Q +++++A C ++
Sbjct: 739 EQF----WHSVLFHNYLDYLAKNGYDYEESIKNQAV---KEQQELLMKMLALSCKLEREF 798
Query: 805 RASELVKLLSLEKSVKGAIKLVTALKLPNLAERFNAILEERLLNEAKGTMETTTLSRSNC 864
R EL L++ + +V AIK + + LA++ + + E+ A ET +
Sbjct: 799 RCVELADLMT-QNAVHLAIKYASRSRKLILAQKLSELAAEK----AAELAETQSEEEKEE 858
Query: 865 SGSVLPNAGSSSSTLNCAVKGRSSEVTRPSSPKGPILSLSAPLFTKKLKSDGAKFNDGRT 924
NAG S +T + + R + + +S P + F +
Sbjct: 859 DFREKLNAGYSHTTTEWS-RPRVRSQVEDAEDREDTVSEEKP---ESHNHGQNLFQSANS 918
Query: 925 EDKQS--SGVVKKGTENSGGDTNVAAADVKKAVSLSKIESTTLETNQHLNSCNSQKVKAE 969
D + SG V ++ V + + AVS + S + + + +S S +
Sbjct: 919 SDTPALKSGAVFSSSQGWVNPFKVVVSSKEPAVSANSTRSANILDSMNKSSRKSTSLNRM 954
BLAST of CmaCh11G002340 vs. ExPASy Swiss-Prot
Match:
Q9C107 (Minichromosome loss protein 1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=mcl1 PE=1 SV=1)
HSP 1 Score: 241.9 bits (616), Expect = 3.0e-62
Identity = 205/790 (25.95%), Postives = 352/790 (44.56%), Query Frame = 0
Query: 56 PKIIHHHREGVTALALSPNSTCLASGSMDRSVKLYKFPGGEFETNITRFTLPIRTLAFNK 115
P I +H++ +T +A++ N C S D +V +Y T + R TLPIR +A++
Sbjct: 48 PDSIDNHQDPITGIAVAENYFCTC--SEDATVCVYPIDSPTEHTLLARTTLPIRDVAYSV 107
Query: 116 SGSLLAAAGEDDGIKLINTIDGSIARVLKGHKAAVTGLAFDPNSEYMASVDSFGTVIFWE 175
G+ +A A ++ +K++++ D S L+ KA+ + + PN ++A G + F++
Sbjct: 108 DGNWIAIASDETAVKVVSSTDSSQIFSLRPAKASNKHVTYSPNGNFLAVSSCNGILYFYD 167
Query: 176 LQSGSIIHNLKGIAPSTGTDPSVMNVLCWSPDGEMLAVPGLRNDVVMYDRDTAEKLFS-L 235
Q+ +I L S + + + W P AV + V + D L+ L
Sbjct: 168 TQTRELIKFLTNTIASLEAESEICSKAAWHPKNGTFAVASTDHFVSVISPDDWLPLYKLL 227
Query: 236 RGDHTQPICFLSWSPNGKYMATSSLDRQILIWDVDQKLDIDRQKFDERICCMAWKPIGNA 295
++ + +SWS NG Y+A S ILIWD Q ++ + + +AW+P N
Sbjct: 228 PKENHSGVTDISWSSNGMYIAASFKKGGILIWDT-QSHEVVVELPYSTVVALAWQPFENV 287
Query: 296 LAVIDVMGKYGVWESVVPSSM----KSPTEDIPKLQSRNSNGLLLFDEEDEEPSEHGNLS 355
L+ G V+P S+ PT+ + +S+N L D + + N++
Sbjct: 288 LSFTTNQGILYSCPDVIPKSILKEENDPTKPLTSSKSKNRTSKELDDLFGSDDEQSQNVN 347
Query: 356 DL-GEDSLYESEFTTRKRLRKHSTNEEILDEADSEDFSLLPKSESYKMSRRVKKYKPDNI 415
DL G + E+EF L + D D +L K + + R I
Sbjct: 348 DLDGNSANEENEFINHDGLDSSLDLDGDSYMVDENDLNLAKKRKQKALIDRT-----TTI 407
Query: 416 DEGDINIATYSKLKLREA-----FQPGATPLQPGKKRFLCYNMLGSITTFEHDG-YSHIE 475
+ G SK +L +A G+TP Q G +R+LC N++G I T + D ++ I
Sbjct: 408 ENGS------SKRRLLQASIHKPVHTGSTPWQ-GNRRYLCLNLVGFIWTVQQDAEHNTIT 467
Query: 476 IDFHDTGSGPRVPSMNDHFGFTMAALNENGSVFANPCKGEKNMSTLMYRPFGSWANNSEW 535
++FHD + + ++D F MA L+ G+++A+P E + + Y+ W+ SEW
Sbjct: 468 VEFHDETTHRKYHFVDDQ-KFEMACLDHEGALYASPAT-ESSPGVIYYKAHVDWSRKSEW 527
Query: 536 SMR--FDGEEVKVVAVGTRWVAAFTSLNYLRIFTDGGLQKHI-LSLDGPVVTASGFKDEL 595
+M + E +++ + V TS Y+R+F+ G I S P V S F+D +
Sbjct: 528 AMALPMENESPVTISLSSSVVLVCTSAGYVRVFSRQGFPISIHRSKHLPFVACSSFQDTI 587
Query: 596 AFVTHSSTCLPSNDQMLEFRILNISNETQLLRGH-LPLTPGSHLAWFGFSEEGKLSSYDS 655
+ + N +++ + I +IS + L G + L P L FS+ G YDS
Sbjct: 588 ITIANDGLSSDGNSRLV-YSIEDISRDEMLQTGDGVALPPQGTLESVFFSDVGDPYIYDS 647
Query: 656 KGVLRV---FTSQYGGSWVPLFSAR--KERKS-DEKYWVVGLNTSKLFCVICKNSESYPQ 715
GVL V + W+P+ + RKS E YW V + ++ C++ K + YP
Sbjct: 648 TGVLLVLMHWRIPGQAKWIPVLDTNELERRKSRQESYWPVTVADNQFHCILLKGASRYPY 707
Query: 716 VTPKPILTLLNLSFPLALSDLGAE----ALENEFMMNNMHLTQIHGRMEEMALLGLYDTE 775
P+P+ T + P ++ A LE + N + LT + +G D
Sbjct: 708 F-PRPMFTEFDFRIPCNTNNPDASTSVPVLEELQLRNKLFLTLLEDS------IGDGDVT 767
Query: 776 LDDEAFC--IEAAQDRCILRLIASCCNGDKLVRASELVKLLSLEKSVKGAIKLVTALKLP 818
D++ +EA D+ +L+LI C +++ R EL K L S+ A K+ L
Sbjct: 768 EDEKISIARLEANIDKALLQLIQKACLEERIERVYELTKTLRRTTSIAAAQKIALHHSLT 812
BLAST of CmaCh11G002340 vs. ExPASy Swiss-Prot
Match:
Q8YV57 (Uncharacterized WD repeat-containing protein all2124 OS=Nostoc sp. (strain PCC 7120 / SAG 25.82 / UTEX 2576) OX=103690 GN=all2124 PE=4 SV=1)
HSP 1 Score: 120.2 bits (300), Expect = 1.3e-25
Identity = 73/272 (26.84%), Postives = 142/272 (52.21%), Query Frame = 0
Query: 29 QQASHIVTASSS---ESAISIHDSLLPSNNPKIIHHHREGVTALALSPNSTCLASGSMDR 88
QQ +H++ ++ + ++ +L + H++GV ++++S + +ASGS+D+
Sbjct: 1035 QQVNHVIAVPNNLKLATVTTLQQALFEMQERNRLEGHKDGVISISISRDGQTIASGSLDK 1094
Query: 89 SVKLYKFPGGEFETNITRFTLPIRTLAFNKSGSLLAAAGEDDGIKLINTIDGSIARVLKG 148
++KL+ G F T + + +++F+ G +A+ G D IKL T DG++ + + G
Sbjct: 1095 TIKLWSRDGRLFRT-LNGHEDAVYSVSFSPDGQTIASGGSDKTIKLWQTSDGTLLKTITG 1154
Query: 149 HKAAVTGLAFDPNSEYMASVDSFGTVIFWELQSGSIIHNLKGIAPSTGTDPSVMNVLCWS 208
H+ V + F P+ + +AS S ++ W+ SG ++ L TG V+ V +S
Sbjct: 1155 HEQTVNNVYFSPDGKNLASASSDHSIKLWDTTSGQLLMTL------TGHSAGVITVR-FS 1214
Query: 209 PDGEMLAVPGLRNDVVMYDRDTAEKLFSLRGDHTQPICFLSWSPNGKYMATSSLDRQILI 268
PDG+ +A V ++ R + L +L G H + LS+SP+GK +A++S D+ I +
Sbjct: 1215 PDGQTIAAGSEDKTVKLWHRQDGKLLKTLNG-HQDWVNSLSFSPDGKTLASASADKTIKL 1274
Query: 269 WDV-DQKLDIDRQKFDERICCMAWKPIGNALA 297
W + D KL + ++ + + + G A+A
Sbjct: 1275 WRIADGKLVKTLKGHNDSVWDVNFSSDGKAIA 1297
BLAST of CmaCh11G002340 vs. TAIR 10
Match:
AT3G42660.1 (transducin family protein / WD-40 repeat family protein )
HSP 1 Score: 1070.5 bits (2767), Expect = 8.3e-313
Identity = 554/919 (60.28%), Postives = 681/919 (74.10%), Query Frame = 0
Query: 1 MKIRSVKLREAHKAASNGKASFCSVLWDQQASHIVTASSSESAISIHDSLLPSN-NPKII 60
MK RS+KLREAHK G A+FCS+LWD +A H VT+SSS+ +IS+HD L S P I+
Sbjct: 1 MKSRSLKLREAHKV--GGSAAFCSILWDHKAEHFVTSSSSDPSISVHDGLSTSTLPPTIL 60
Query: 61 HHHREGVTALALSPNSTCLASGSMDRSVKLYKFPGGEFETNITRFTLPIRTLAFNKSGSL 120
HH++GVT+LALS +ST LASGS+D VKLYKFP GEF+TNITRFTLPIR LAFN SGSL
Sbjct: 61 RHHQDGVTSLALSNDSTLLASGSIDHCVKLYKFPSGEFQTNITRFTLPIRVLAFNGSGSL 120
Query: 121 LAAAGEDDGIKLINTIDGSIARVLKGHKAAVTGLAFDPNSEYMASVDSFGTVIFWELQSG 180
LAAAG+D+GIKLINT DGSI RVLKGHK VTGL F PN E +AS+D+ GTV+ WELQ+G
Sbjct: 121 LAAAGDDEGIKLINTFDGSIVRVLKGHKGPVTGLDFHPNGELLASIDTTGTVLCWELQNG 180
Query: 181 SIIHNLKGIAPSTGTDPSVMNVLCWSPDGEMLAVPGLRNDVVMYDRDTAEKLFSLRGDHT 240
+ LKG+AP TG + S++N+ WSPDG LAVPGLRNDVVMYDR T EKLF+LRGDH
Sbjct: 181 VVSFTLKGVAPDTGFNTSIVNIPRWSPDGRTLAVPGLRNDVVMYDRFTGEKLFALRGDHL 240
Query: 241 QPICFLSWSPNGKYMATSSLDRQILIWDVDQKLDIDRQKFDERICCMAWKPIGNALAVID 300
+ IC+L+W+PNGKY+ATS LD+Q+L+WDVD+K DIDR KF+ERICCM+WKP GNAL+VID
Sbjct: 241 EAICYLTWAPNGKYIATSGLDKQVLLWDVDKKQDIDRHKFEERICCMSWKPNGNALSVID 300
Query: 301 VMGKYGVWESVVPSSMKSPTEDIPKLQSRNSNGLLLFDEEDEEPSEHGNLS---DLGEDS 360
G+YGVWES+VPSSM SPT +P + + N +L FD+E EE + S +G+
Sbjct: 301 AKGRYGVWESLVPSSMLSPTVGVPDIVPKKRNEILDFDDEVEEEIYRASESLDDAMGDSD 360
Query: 361 LYESEFTTRKRLRKHSTNEEILDEADSE--DFSLLPKSESY-KMSRRVKKYKPDNIDEGD 420
ES T+RKRLRK + +E +D+A E D S LP + Y K S R + K
Sbjct: 361 DGESHHTSRKRLRKKTLIDEDVDDAYEELNDGSSLPSASEYRKKSHRGHREKQGARSGAF 420
Query: 421 INIATYSKLKLREAFQPGATPLQPGKKRFLCYNMLGSITTFEHDGYSHIEIDFHDTGSGP 480
I+ +K K++ +FQPGATP +PGK+ FLCYNMLG ITT EH+G S IE DFHDTG GP
Sbjct: 421 KGISASTKYKMQSSFQPGATPPEPGKRTFLCYNMLGCITTIEHEGNSRIETDFHDTGRGP 480
Query: 481 RVPSMNDHFGFTMAALNENGSVFANPCKGEKNMSTLMYRPFGSWANNSEWSMRFDGEEVK 540
RV SM D +GFTMA++NE G VFANPCKGEKNMS LMYRPF SWA+NSEW+MRF+GEEVK
Sbjct: 481 RVSSMIDIYGFTMASINETGCVFANPCKGEKNMSVLMYRPFRSWASNSEWTMRFEGEEVK 540
Query: 541 VVAVGTRWVAAFTSLNYLRIFTDGGLQKHILSLDGPVVTASGFKDELAFVTHSSTCLPSN 600
VVA G+ WVAA TSLN LR+F++GGLQKHILSLDGPVVTA G KD LA VTH S CLPSN
Sbjct: 541 VVANGSGWVAAVTSLNLLRVFSEGGLQKHILSLDGPVVTAVGCKDHLAVVTHVSDCLPSN 600
Query: 601 DQMLEFRILNISNETQLLRGHLPLTPGSHLAWFGFSEEGKLSSYDSKGVLRVFTSQYGGS 660
+Q++EFR+ NIS TQ L+G + LTPGS L W GFSEEG LSSYDS+GVLRVFTSQYGGS
Sbjct: 601 EQVMEFRVFNISKMTQELKGRVALTPGSRLTWIGFSEEGSLSSYDSEGVLRVFTSQYGGS 660
Query: 661 WVPLFSARKERKSDEKYWVVGLNTSKLFCVICKNSESYPQVTPKPILTLLNLSFPLALSD 720
W+P+FS KE+K +E YWVVGLNTS L+C+ CK +E +PQVTPKPILT+L+LS PLA SD
Sbjct: 661 WIPVFSTSKEKKQEENYWVVGLNTSSLYCIACKYAEMFPQVTPKPILTILDLSLPLASSD 720
Query: 721 LGAEALENEFMMNNMHLTQIHGRMEEMALLGLYDTELDDEAFCIEAAQDRCILRLIASCC 780
LGA +LENE ++ + L + ++++MAL+G+ T L+DEAF +E +QD+CILRLI+SCC
Sbjct: 721 LGAASLENELILKQLRLYETQRKVDDMALVGVDTTALEDEAFDLEVSQDKCILRLISSCC 780
Query: 781 NGDKLVRASELVKLLSLEKSVKGAIKLVTALKLPNLAERFNAILEERLLNEAKGTMETTT 840
+ D RASEL++LL+LEKS++ AI LVT LKLP LAE+F++ILEERLL EA T
Sbjct: 781 SSDSFARASELMELLTLEKSMRAAITLVTKLKLPFLAEKFSSILEERLLEEASEAAVTNP 840
Query: 841 LSRSNCSGSVLPNAGSSSSTLNCAVKGRSSEVTRPSSPKGPILSLSAPLFTKKLK-SDGA 900
N G V+ S N ++SE T + K LSAP KK K S+G
Sbjct: 841 ALNPN--GEVVTRV--ESKVQNPPASIQTSENTE-AVMKSSATKLSAPTLLKKSKVSEGL 900
Query: 901 KFNDGRT-EDKQSSGVVKK 911
K +T +DK +K+
Sbjct: 901 KLGKEQTKKDKSDDAKIKE 912
BLAST of CmaCh11G002340 vs. TAIR 10
Match:
AT5G67320.1 (WD-40 repeat family protein )
HSP 1 Score: 87.8 bits (216), Expect = 5.2e-17
Identity = 71/306 (23.20%), Postives = 134/306 (43.79%), Query Frame = 0
Query: 3 IRSVKLREAHKAASNGKASFCSVL-WDQQASHIVTASSSESA--ISIHDSLLPSNNPKII 62
I ++ L+ A K SN K+ + L W+ + + + T S A +++ L+ + +
Sbjct: 308 INALILKHA-KGKSNEKSKDVTTLDWNGEGTLLATGSCDGQARIWTLNGELIST-----L 367
Query: 63 HHHREGVTALALSPNSTCLASGSMDRSVKLYKFPGGEFETNITRFTLPIRTLAFNKSGSL 122
H+ + +L + L +GS+DR+ ++ E++ + P + + + S
Sbjct: 368 SKHKGPIFSLKWNKKGDYLLTGSVDRTAVVWDVKAEEWKQQFEFHSGPTLDVDWRNNVS- 427
Query: 123 LAAAGEDDGIKLINTIDGSIARVLKGHKAAVTGLAFDPNSEYMASVDSFGTVIFWELQSG 182
A + D I L + A+ GH+ V + +DP +AS T W ++
Sbjct: 428 FATSSTDSMIYLCKIGETRPAKTFTGHQGEVNCVKWDPTGSLLASCSDDSTAKIWNIKQS 487
Query: 183 SIIHNLKGIAPSTGTDPSVMNVLCWSPDGE---------MLAVPGLRNDVVMYDRDTAEK 242
+ +H+L+ T + WSP G LA + V ++D + +
Sbjct: 488 TFVHDLREHTKEIYT-------IRWSPTGPGTNNPNKQLTLASASFDSTVKLWDAELGKM 547
Query: 243 LFSLRGDHTQPICFLSWSPNGKYMATSSLDRQILIWDVDQKLDIDRQKFDERICCMAWKP 297
L S G H +P+ L++SPNG+Y+A+ SLD+ I IW + + + + I + W
Sbjct: 548 LCSFNG-HREPVYSLAFSPNGEYIASGSLDKSIHIWSIKEGKIVKTYTGNGGIFEVCWNK 598
BLAST of CmaCh11G002340 vs. TAIR 10
Match:
AT3G49660.1 (Transducin/WD40 repeat-like superfamily protein )
HSP 1 Score: 85.1 bits (209), Expect = 3.4e-16
Identity = 54/226 (23.89%), Postives = 100/226 (44.25%), Query Frame = 0
Query: 49 SLLPSNNPKIIHHHREGVTALALSPNSTCLASGSMDRSVKLYKF-----PGGEFETNITR 108
S P + + + H V+++ S + LAS S D++++ Y P E T
Sbjct: 10 SFTPYVHSQTLTSHNRAVSSVKFSSDGRLLASASADKTIRTYTINTINDPIAEPVQEFTG 69
Query: 109 FTLPIRTLAFNKSGSLLAAAGEDDGIKLINTIDGSIARVLKGHKAAVTGLAFDPNSEYMA 168
I +AF+ + +A +D +KL + GS+ + L GH + F+P S +
Sbjct: 70 HENGISDVAFSSDARFIVSASDDKTLKLWDVETGSLIKTLIGHTNYAFCVNFNPQSNMIV 129
Query: 169 SVDSFGTVIFWELQSGSIIHNLKGIA-PSTGTDPSVMNVLCWSPDGEMLAVPGLRNDVVM 228
S TV W++ +G + L + P T D ++ DG ++ +
Sbjct: 130 SGSFDETVRIWDVTTGKCLKVLPAHSDPVTAVD--------FNRDGSLIVSSSYDGLCRI 189
Query: 229 YDRDTAEKLFSLRGDHTQPICFLSWSPNGKYMATSSLDRQILIWDV 269
+D T + +L D P+ F+ +SPNGK++ +LD + +W++
Sbjct: 190 WDSGTGHCVKTLIDDENPPVSFVRFSPNGKFILVGTLDNTLRLWNI 227
BLAST of CmaCh11G002340 vs. TAIR 10
Match:
AT1G11160.1 (Transducin/WD40 repeat-like superfamily protein )
HSP 1 Score: 82.8 bits (203), Expect = 1.7e-15
Identity = 58/209 (27.75%), Postives = 90/209 (43.06%), Query Frame = 0
Query: 62 HREGVTALAL-SPNSTCLASGSMDRSVKLYKFPGGEFETNITRFTLPIRTLAFNKSGSLL 121
H V L++ S L +G D V L+ ++ T P+ ++AFN L+
Sbjct: 14 HSGNVNCLSIGKKTSRLLLTGGDDYKVNLWSIGKTTSPMSLCGHTSPVDSVAFNSEEVLV 73
Query: 122 AAAGEDDGIKLINTIDGSIARVLKGHKAAVTGLAFDPNSEYMASVDSFGTVIFWELQSGS 181
A IKL + + + R GH++ + + F P E++AS S + W+ +
Sbjct: 74 LAGASSGVIKLWDLEESKMVRAFTGHRSNCSAVEFHPFGEFLASGSSDTNLRVWDTRKKG 133
Query: 182 IIHNLKGIAPSTGTDPSVMNVLCWSPDGEMLAVPGLRNDVVMYDRDTAEKLFSLRGDHTQ 241
I KG T + +SPDG + GL N V ++D TA KL H
Sbjct: 134 CIQTYKGHTRGIST-------IEFSPDGRWVVSGGLDNVVKVWDL-TAGKLLHEFKCHEG 193
Query: 242 PICFLSWSPNGKYMATSSLDRQILIWDVD 270
PI L + P +AT S DR + WD++
Sbjct: 194 PIRSLDFHPLEFLLATGSADRTVKFWDLE 214
BLAST of CmaCh11G002340 vs. TAIR 10
Match:
AT2G41500.1 (WD-40 repeat family protein / small nuclear ribonucleoprotein Prp4p-related )
HSP 1 Score: 81.3 bits (199), Expect = 4.9e-15
Identity = 60/219 (27.40%), Postives = 101/219 (46.12%), Query Frame = 0
Query: 53 SNNPKIIHHHREGVTALALSPNSTCLASGSMDRSVKLYKFPG---GEFETNITRFTLPIR 112
+N ++ H+E T + SP CLA+ S DR+ KL+K G FE ++ R +
Sbjct: 288 TNTIAVLKDHKERATDVVFSPVDDCLATASADRTAKLWKTDGTLLQTFEGHLDR----LA 347
Query: 113 TLAFNKSGSLLAAAGEDDGIKLINTIDGSIARVLKGHKAAVTGLAFDPNSEYMASVDSFG 172
+AF+ SG L D +L + G+ + +GH +V G+AF + AS
Sbjct: 348 RVAFHPSGKYLGTTSYDKTWRLWDINTGAELLLQEGHSRSVYGIAFQQDGALAASCGLDS 407
Query: 173 TVIFWELQSGSIIHNLKG-IAPSTGTDPSVMNVLCWSPDGEMLAVPGLRNDVVMYDRDTA 232
W+L++G I +G I P + +SP+G LA G N ++D
Sbjct: 408 LARVWDLRTGRSILVFQGHIKPVFSVN--------FSPNGYHLASGGEDNQCRIWDLRMR 467
Query: 233 EKLFSLRGDHTQPICFLSWSP-NGKYMATSSLDRQILIW 267
+ L+ + H + + + P G ++AT+S D ++ IW
Sbjct: 468 KSLYIIPA-HANLVSQVKYEPQEGYFLATASYDMKVNIW 493
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
O75717 | 1.3e-76 | 26.07 | WD repeat and HMG-box DNA-binding protein 1 OS=Homo sapiens OX=9606 GN=WDHD1 PE=... | [more] |
O13046 | 2.8e-76 | 27.27 | WD repeat and HMG-box DNA-binding protein 1 OS=Xenopus laevis OX=8355 GN=wdhd1 P... | [more] |
P59328 | 6.3e-76 | 25.64 | WD repeat and HMG-box DNA-binding protein 1 OS=Mus musculus OX=10090 GN=Wdhd1 PE... | [more] |
Q9C107 | 3.0e-62 | 25.95 | Minichromosome loss protein 1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24... | [more] |
Q8YV57 | 1.3e-25 | 26.84 | Uncharacterized WD repeat-containing protein all2124 OS=Nostoc sp. (strain PCC 7... | [more] |
Match Name | E-value | Identity | Description | |
AT3G42660.1 | 8.3e-313 | 60.28 | transducin family protein / WD-40 repeat family protein | [more] |
AT5G67320.1 | 5.2e-17 | 23.20 | WD-40 repeat family protein | [more] |
AT3G49660.1 | 3.4e-16 | 23.89 | Transducin/WD40 repeat-like superfamily protein | [more] |
AT1G11160.1 | 1.7e-15 | 27.75 | Transducin/WD40 repeat-like superfamily protein | [more] |
AT2G41500.1 | 4.9e-15 | 27.40 | WD-40 repeat family protein / small nuclear ribonucleoprotein Prp4p-related | [more] |