Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCGGGAATTGTCATGGATGAAATTAATGAAGAAAGGGCTGTGAATAGACATAACGGTGCTTCAATTCATATGGAAGAGTCGTATGGAAACAAGTCGCCTAGGAGCGGGTTGAGTGTTCAGAGCCATGGAAGTGTTAATGTTGATTTCCATGTTGATGGCTTGGTTGACACCTCTATTGAGAAGCTTTATGAAAATGTCTATGATATGCAAAGTTCAGATCAGTCTCCTTCAAGGCGTAGCTTTGGATCAGATGGTGGGGAATCCAGGATTGATTCCGAACTGAATCATCTTGTAGGAGGAGAGATGAGGGAGGTAGAGATAATAAAGGAGGAAGAAGACATTGTAGAGAGGACTGAAAATGATTTTCCCAGTGATTCTGTGAAAGATTTACCATCTGTAGAGATTAAGAGCACAGAAAATTCCCAACCTGGAAGCTCAAAACGCCTTTCTTCTGGAAAAAAAGCTTCTCACTTGCAATTGAATCACGAGACATCTCCAAAATCAAGTTCCAGTGTCAAGGATTTGTCTGATAAGTCCCCTATTAGCAGAAAGAATGAAAAGAGTTCGAAAAAAACTAGTCCGGTTGCTTCTAACTCGAAGAAACAAAAAGATTCACCTTTGAGAGGCTCGAAAATACTTAATGGAACCGAGGATTTCAATGAATCAATGATGGATAATCCTGATCTAGGACCCTATCTGCTTAAGCAAGCTAGGAGTTTAGTTTCTTCAGGGGAGAATCTGCAGAAAGCTCTCTTATTAGCTCTTCGTGCTGCAAAAGCTTTTGAACTATCTGCAAATGGGAAACCGAGCTTGGAACTCGTTATGTGTTTGCACGTTACAGCAGCTATTTACTGTAGTCTAGGCCAGTACAGTGAGGCAATTCCTCTGTTGGAGCATTCCATTGAGATTCCTGCCATCAAGGAAGGCCAGGAACATGCGCTGGCCAAATTTGCAGGTCACATGCAGCTAGGTGATACCTATGCAATGCTGGGCCTGCTGGAAAATTCTCTAATCTGTTATACAACTGGGTTAGAGGTGCAGAAACAAGTCCTTGGAGAAGCAGACCCCAGAGTTGGTGAGACCTATAGGTATTTAGCTGAAGCCCATGTTCAAGCTTTGCAATTTGATGAGGCTGAGAAGTTTTGTCAAATGGCTCTCAATATACACAAAAAGAACGTCGGTCCTGCTTCTCTTGAGGAGGCTGCGGATAGGAGGCTTATGGGTCTCATATGTGAAACAAAAGGAGATCATGAAGCTGCGCTCGAGCATCTAGTTTTAGCCAGCATGGCTATGGTGGCGAATGGCCAAGAGACTGATGTGGCTGCAGTTGATTGCAGTATTGGAGATTCGTACTTATCCCTGTCCCGTTACGACGAGGCTGTTTTTGCTTACCAGAAAGCTCTCACTGTTTACAAGACCACCAAGGGAGAAAACCATCCAGCAGTTGGTTCGGTATATGTTCGTCTTGCTGATTTATACAACAAAACTGGAAAAACAAGGGAGTCGGTATCGTACTGTGAAAACGCCCTTCGAATTTATGAAAAGCCTATCTCTGGGATTCCTCCTGAGGAGATTGCTAGTGGTCTTACTAATGTTGCTGCTATTTATGAATCAATGAATGAAGCTGAGCAAGCGGTCAAATTATTGCACAAGGCACTGAAGATATATAGTAATGCCCCTGGACAGCAAAGCACGATTGCTGGAATTGAAGCCCAGATGGGTGTGTTGTATTATATGTTAGGGAACTATTCTGAATCTTATGACTCCTTCAAGAACGCTATTCCAAAGCTTCGCAACAGCGGAGAGAAAAAATCTGCTTTCTTTGGGATAGCTCTTAATCAAATGGGGCTTGCTTGTGTTCAGAAATATGCTATTAACGAAGCTGTGGAGCTATTTGAAGAAGCCAAGAGCATACTAGAAAAAGAATACGGGCCTTATCATCCCGACACGCTCGGAGTCTATAGCAACCTTGCTGGAGCATATGATGCGATTGGCAGGTAATGTAACCATCCTTCTATCACCATACAATGATCAATACTGTAATTGAATTTGTCCTTTTCCCATAATCTCTGTCTGGAACTATGAAAATCTCCATAACCAAGTCCTTTTCGTTATTGTGCCACCGATATTCAGTATTTTCTGTGAGATCCCAAATCGGTTGGAGAGGGGAACGAAACATTTCTTGTAAGGATGTGAAAGCCTCTCTTCAGTGGACACGTTTTAAAACTTTGAGGGGAAAACTCAAAGAGGACAATATTTGCTAGCGGTGAGTTTGGGTTGGTACAAATGGTATCAGAACTAGACACCGGGCGGTGTGCCAACGAGGACGTTGGGCCTCCGAGGAGGGTGGATTGTGAGATCCCATATCGGTTGGACAGGGGAACAAAGCATTCCTTAGAAGGGTGGGGAACCTCTCCCTAGTGACGCATTTTAAAACCTTGAGGGAAAATTCGGAAGGGAAAGCCCAAAGAAGACAATATCTACTAACGGTGGGTTTGGGCGATTACATTTTCATTAAGAAATGTTTAGAAATCTAGTCCTTTGAGCAACAGATGATGCAACTCCACACTTTTTTCATACATTGATCAGTTCTCCTCCCATGACATTATACAGGTTGGATGATGCAATTGAAATGTTGGAGTATGTTGTTGGCACGAGAGAGGAAAAACTCGGGACTGCCAATCCTGATGTCGACGATGAGAAGAGGAGGTTGTCTGAATTGTTGAAAGAAGCTGGTAGAGTCCGGAGCCGAAAGGCGAGATCGCTCGAGACTCTTCTTGATGCCAATACTCATCCTGTAAACAGTAAAGGTATTAAGGTTTGATAAAAGATTATACACATCACGGGAGCAACATCGTTGCCGAGATTAGATCCGATTCAAAATCATGGCTAATTATGGTCGAGTAGATACCTGATTCTTGAGGTTGTTCGAGTGCTAGAATGTGGCGTTTCATTTTGTTTGCATTGTCGATTGCTTAATCTTGTATTGGTGTATGAAGTAATATTGTTCTGTATATTTTTACACGGAAGTCGTGACATTATGATTTTGTATATCACATTCTTCTGGGGTTATAAGATTGTTTGATACAGGTTTTTAATGAGAACTCCGAGGATGATGGTCCTGCGTTTGTACCATAAATAACGAGACGTACATCTTAACTAGTTAAACTATATTTATGTTAAAATTTGATAACATTCGTTTATTATTGGAGCATATATATAATATGTCCCGTAGCTCTGGTGAAAAGGGAAGGCCTAGCAGATCGTGTGTCTAATGGTGCAGGATTCTTCGAAAAAGTGCGGGTTGACTCGGAGACCTGGGACTTTGGCTTAGCGACGAATGAAGGGGAGCTTAAAGAGTTTTCTCCCTCAGTGGTTTATGTAGTGGTTGGACGGTCAGAACTTAGTAGTTAACATTCCATAAGCGATTTGTAGAGTTTAATAAGAATTTTTGCCTAGAATCAAATATTGAGTAATAAATGCAAATTAATAATGTATAATAAGCAGTTAAAATCTAAAAATACAATTAATTTAAAAAAAAAAAGTTAAACTCTAGACTAGACAAATGTTCATACTTTTACTTTTTTGTTCAATTAACAAAAATAAAAAGGTTTTTTTTTTTTTAAATTCACAAATCAAAATAAAATTGATCCAACAATGAAAAATAAATATCTTTTTTCGAATATATTAATTACCCTCGAAACCATTCGGTCTGCCTCGTTCGCGCGTATTACTTCTCTTTCGTGTTCATTGCTCTCGCCCACTAGATAGGTTTCTTTCTTCCACTCACAATTCCTCCTGTCGCTGACTGTTACAACATCTCGTTTGCTTCGTGGAAGGGATTGTACCTGTAACTATGGTTGAAAAATCTTCTAGTTCGTCATCTATTCTGGTTGATCAGCCAGTTGTAAGTTCGAAATCAATCTCATTGTTCGTTTTCCCTCTTCTTTTTGCCTGTTCGATGTTCTGCAGCTTGTAAATGAATTGGCGTATTCGTTTATCTTTATTGGAATTTCGATTGCTTAGGTTCCAGGAGATGTGGTCCTCGACCTCTCAAACATGACCAACGAAACCATCAAGCTTGGAGGCGGTCTTCGACAGGTGTCTGTTAGGGTTTTATTGTTTTCAATGTTTGTGCATCTCCATATATGTCTCTGATCCCGTTTTAGCAACCATTAATTTCCTCAACGATTGTAGATTTAGTTTTCATAAACTTTTGAGCGGACAATCGAATGTACTTCAAATGTCGTTTAGTTTTCGACAGTCAAAGTTCGCATTGTTACATTTGCAGTCGTGTTGTTTACGGGATTGTTTACCTTCATTTTTTTTCTTTCTAAAAATAATCTTTCAGGACCATGATGCTATTTCTGTCGCCAAAGTTGGAAAATTAAGGTTCTCAAAACCAAACAAATATTGGGTTGAAAGCTCGCAGAAAAGGGTCAGTAGCACCGACTTTCTTCAATCGATTATGGGGATCCTTATATTGTTTGTTGTTTCAGTTAACTTTCGTATTGTGAACTTCAACCGCTAGTTTCGGTGGGTTTCTTCAATTGAAACGACTATCAATATTATTCCCGTGTTTAATTTTAATAAAATGTGCTATTTTCCAGTATGTGCCATGTGCAGAAGATTGCGTACTTGGAATCGTGGTTGACTCTAGATCCGATGTAAGTGCTTCTTTATTATTGTTTGAATTAAATGTATTTGTTCAATTACGAAATAACTGGCTACTACTCTCTCTGATTGATTTTCTTGGATGGTTTGACCGGTATCCATGTCTAATGAGTCTTAGGAAACCAATGATGATTTTTCTGACTACCTTATTAATTATCATAGTATACTTTTTTTTTACTTTGTATGTATGCAGAATTTTCTTGTTGACATAAAAGGTCCTTCATTGGCCTTTCTTCCTGTTCTTGCATTTGAAGGAGGTACAAGACGAAACATTCCCAAGTTTGAGGTGTGTATAAACTTTTGTGCACATATTTTTGTTATATTAATCATACGACCTATAGTTGTTCCTGTCTTCCTTCTATAGAAGCTTTTATGCGTTGTCAGTTCATTTATTATCCCGAGAGGTTATGCTTATCTGTTTCTTCTTTCATCTTTTTTGGTGGGGATGGTTAAGAAACATCTTTTGCATATCAGTAAAGTTGTATTCTCTTACCCTTTTGAATTTGGTTTATTTTTCTTTGCAATCTACTTACCTTAACTCGGGGATATGTATCTATCATAAGTGTATTCAATTGAAGTTACCAGTTTCACTTTTTTATCCTCCTTAAACATGCCTTCTTGATTCCCTGTCTTGTAGATGGGTGCCCTTCTTTATGTGCGGGTGGTGAAGGCAAACCCAGGTATGAATCCCGAGCTGGCATGCACTGATGGTTAGTTAAATCCTCCCTCTCTCAGCATCCAATAAATCGTTTGAAGGTTAAAACAGTCGATCGATGTAATATAGTGATGTTGCTGTGAACACATCCTCTCAAGAGGGGTGGGCAGGGCCTGCATACTTTATGCAAGTTCATGTTTCTCTCCTCAGTTTTTGGTTGATGTGATTTACATGGAATTTGCCTCATGTGAGTATCATTTTTCTTGATATAGCTAGTGGGAAAGCAGCTGGATTTGGCCACCTAAAGGATGGTTACATCTTTGACTGTTCAACTGGCTTATCAAGAATGTAATCTAAATTGCTTTTCTTAATTGCTCTTTCCGTCCCTGTTAAGTCAGTCTTCTAATCATTCATTTACATTGTTGATTTGGTCCTCTCTTCTAGGCTTCTAAGCTCGCCAACATGTCCTGTTCTTGAATCTCTTGGGAAGAAGCTTTCATTCGAGACGGCAGTTGGTATAAATGGCCGAGTTTGGGTATGTTCCTTCAATCCTTAGATTTTTGTTGCTAGACATTGGTCGTTACTCTGTATAATTCTCTTCCCATGCTGTCTTCTAGCACATATTAACCTTATATTTTCATCAACAAAAATTTTCTTTATTTGAAATATATATTTCTTTGTGAGTTCTAACACATGGGGTCGATAATATATATTTTAATCAATTGAACTATATTTATATTGGCCTTACACGAGAGTCATATTAATAATATTAATATGTTGTTGAAATTAGTCTTCACTCTTGGCATTGCCCTTCACTTTTACTGTTCTTTGAAATTTGAAAATGTAATTTCTTGTATCAATTTTACTAATAAATAAACTTTCTTATCTTGCTCATCTCTACTTTTCAGGTCAATGCAGATTCCCCGTCCACAACCATTGTTGTTTCTAATGCAATAATGAACTCAGAAACTCTGAGTGGGGTCCAACAGAGAATCATGGTGGACAAGCTCCTAAATAATTTAAAGCTATCAAGTTAA
mRNA sequence
ATGCCGGGAATTGTCATGGATGAAATTAATGAAGAAAGGGCTGTGAATAGACATAACGGTGCTTCAATTCATATGGAAGAGTCGTATGGAAACAAGTCGCCTAGGAGCGGGTTGAGTGTTCAGAGCCATGGAAGTGTTAATGTTGATTTCCATGTTGATGGCTTGGTTGACACCTCTATTGAGAAGCTTTATGAAAATGTCTATGATATGCAAAGTTCAGATCAGTCTCCTTCAAGGCGTAGCTTTGGATCAGATGGTGGGGAATCCAGGATTGATTCCGAACTGAATCATCTTGTAGGAGGAGAGATGAGGGAGGTAGAGATAATAAAGGAGGAAGAAGACATTGTAGAGAGGACTGAAAATGATTTTCCCAGTGATTCTGTGAAAGATTTACCATCTGTAGAGATTAAGAGCACAGAAAATTCCCAACCTGGAAGCTCAAAACGCCTTTCTTCTGGAAAAAAAGCTTCTCACTTGCAATTGAATCACGAGACATCTCCAAAATCAAGTTCCAGTGTCAAGGATTTGTCTGATAAGTCCCCTATTAGCAGAAAGAATGAAAAGAGTTCGAAAAAAACTAGTCCGGTTGCTTCTAACTCGAAGAAACAAAAAGATTCACCTTTGAGAGGCTCGAAAATACTTAATGGAACCGAGGATTTCAATGAATCAATGATGGATAATCCTGATCTAGGACCCTATCTGCTTAAGCAAGCTAGGAGTTTAGTTTCTTCAGGGGAGAATCTGCAGAAAGCTCTCTTATTAGCTCTTCGTGCTGCAAAAGCTTTTGAACTATCTGCAAATGGGAAACCGAGCTTGGAACTCGTTATGTGTTTGCACGTTACAGCAGCTATTTACTGTAGTCTAGGCCAGTACAGTGAGGCAATTCCTCTGTTGGAGCATTCCATTGAGATTCCTGCCATCAAGGAAGGCCAGGAACATGCGCTGGCCAAATTTGCAGGTCACATGCAGCTAGGTGATACCTATGCAATGCTGGGCCTGCTGGAAAATTCTCTAATCTGTTATACAACTGGGTTAGAGGTGCAGAAACAAGTCCTTGGAGAAGCAGACCCCAGAGTTGGTGAGACCTATAGGTATTTAGCTGAAGCCCATGTTCAAGCTTTGCAATTTGATGAGGCTGAGAAGTTTTGTCAAATGGCTCTCAATATACACAAAAAGAACGTCGGTCCTGCTTCTCTTGAGGAGGCTGCGGATAGGAGGCTTATGGGTCTCATATGTGAAACAAAAGGAGATCATGAAGCTGCGCTCGAGCATCTAGTTTTAGCCAGCATGGCTATGGTGGCGAATGGCCAAGAGACTGATGTGGCTGCAGTTGATTGCAGTATTGGAGATTCGTACTTATCCCTGTCCCGTTACGACGAGGCTGTTTTTGCTTACCAGAAAGCTCTCACTGTTTACAAGACCACCAAGGGAGAAAACCATCCAGCAGTTGGTTCGGTATATGTTCGTCTTGCTGATTTATACAACAAAACTGGAAAAACAAGGGAGTCGGTATCGTACTGTGAAAACGCCCTTCGAATTTATGAAAAGCCTATCTCTGGGATTCCTCCTGAGGAGATTGCTAGTGGTCTTACTAATGTTGCTGCTATTTATGAATCAATGAATGAAGCTGAGCAAGCGGTCAAATTATTGCACAAGGCACTGAAGATATATAGTAATGCCCCTGGACAGCAAAGCACGATTGCTGGAATTGAAGCCCAGATGGGTGTGTTGTATTATATGTTAGGGAACTATTCTGAATCTTATGACTCCTTCAAGAACGCTATTCCAAAGCTTCGCAACAGCGGAGAGAAAAAATCTGCTTTCTTTGGGATAGCTCTTAATCAAATGGGGCTTGCTTGTGTTCAGAAATATGCTATTAACGAAGCTGTGGAGCTATTTGAAGAAGCCAAGAGCATACTAGAAAAAGAATACGGGCCTTATCATCCCGACACGCTCGGAGTCTATAGCAACCTTGCTGGAGCATATGATGCGATTGGCAGGTTGGATGATGCAATTGAAATGTTGGAGTATGTTGTTGGCACGAGAGAGGAAAAACTCGGGACTGCCAATCCTGATGTCGACGATGAGAAGAGGAGGTTGTCTGAATTGTTGAAAGAAGCTGGTAGAGTCCGGAGCCGAAAGGCGAGATCGCTCGAGACTCTTCTTGATGCCAATACTCATCCTGTAAACAGTAAAGGGATTGTACCTGTAACTATGGTTGAAAAATCTTCTAGTTCGTCATCTATTCTGGTTGATCAGCCAGTTGTTCCAGGAGATGTGGTCCTCGACCTCTCAAACATGACCAACGAAACCATCAAGCTTGGAGGCGGTCTTCGACAGGACCATGATGCTATTTCTGTCGCCAAAGTTGGAAAATTAAGGTTCTCAAAACCAAACAAATATTGGGTTGAAAGCTCGCAGAAAAGGTATGTGCCATGTGCAGAAGATTGCGTACTTGGAATCGTGGTTGACTCTAGATCCGATAATTTTCTTGTTGACATAAAAGGTCCTTCATTGGCCTTTCTTCCTGTTCTTGCATTTGAAGGAGGTACAAGACGAAACATTCCCAAGTTTGAGATGGGTGCCCTTCTTTATGTGCGGGTGGTGAAGGCAAACCCAGGTATGAATCCCGAGCTGGCATGCACTGATGCTAGTGGGAAAGCAGCTGGATTTGGCCACCTAAAGGATGGTTACATCTTTGACTGTTCAACTGGCTTATCAAGAATGCTTCTAAGCTCGCCAACATGTCCTGTTCTTGAATCTCTTGGGAAGAAGCTTTCATTCGAGACGGCAGTTGGTATAAATGGCCGAGTTTGGGTCAATGCAGATTCCCCGTCCACAACCATTGTTGTTTCTAATGCAATAATGAACTCAGAAACTCTGAGTGGGGTCCAACAGAGAATCATGGTGGACAAGCTCCTAAATAATTTAAAGCTATCAAGTTAA
Coding sequence (CDS)
ATGCCGGGAATTGTCATGGATGAAATTAATGAAGAAAGGGCTGTGAATAGACATAACGGTGCTTCAATTCATATGGAAGAGTCGTATGGAAACAAGTCGCCTAGGAGCGGGTTGAGTGTTCAGAGCCATGGAAGTGTTAATGTTGATTTCCATGTTGATGGCTTGGTTGACACCTCTATTGAGAAGCTTTATGAAAATGTCTATGATATGCAAAGTTCAGATCAGTCTCCTTCAAGGCGTAGCTTTGGATCAGATGGTGGGGAATCCAGGATTGATTCCGAACTGAATCATCTTGTAGGAGGAGAGATGAGGGAGGTAGAGATAATAAAGGAGGAAGAAGACATTGTAGAGAGGACTGAAAATGATTTTCCCAGTGATTCTGTGAAAGATTTACCATCTGTAGAGATTAAGAGCACAGAAAATTCCCAACCTGGAAGCTCAAAACGCCTTTCTTCTGGAAAAAAAGCTTCTCACTTGCAATTGAATCACGAGACATCTCCAAAATCAAGTTCCAGTGTCAAGGATTTGTCTGATAAGTCCCCTATTAGCAGAAAGAATGAAAAGAGTTCGAAAAAAACTAGTCCGGTTGCTTCTAACTCGAAGAAACAAAAAGATTCACCTTTGAGAGGCTCGAAAATACTTAATGGAACCGAGGATTTCAATGAATCAATGATGGATAATCCTGATCTAGGACCCTATCTGCTTAAGCAAGCTAGGAGTTTAGTTTCTTCAGGGGAGAATCTGCAGAAAGCTCTCTTATTAGCTCTTCGTGCTGCAAAAGCTTTTGAACTATCTGCAAATGGGAAACCGAGCTTGGAACTCGTTATGTGTTTGCACGTTACAGCAGCTATTTACTGTAGTCTAGGCCAGTACAGTGAGGCAATTCCTCTGTTGGAGCATTCCATTGAGATTCCTGCCATCAAGGAAGGCCAGGAACATGCGCTGGCCAAATTTGCAGGTCACATGCAGCTAGGTGATACCTATGCAATGCTGGGCCTGCTGGAAAATTCTCTAATCTGTTATACAACTGGGTTAGAGGTGCAGAAACAAGTCCTTGGAGAAGCAGACCCCAGAGTTGGTGAGACCTATAGGTATTTAGCTGAAGCCCATGTTCAAGCTTTGCAATTTGATGAGGCTGAGAAGTTTTGTCAAATGGCTCTCAATATACACAAAAAGAACGTCGGTCCTGCTTCTCTTGAGGAGGCTGCGGATAGGAGGCTTATGGGTCTCATATGTGAAACAAAAGGAGATCATGAAGCTGCGCTCGAGCATCTAGTTTTAGCCAGCATGGCTATGGTGGCGAATGGCCAAGAGACTGATGTGGCTGCAGTTGATTGCAGTATTGGAGATTCGTACTTATCCCTGTCCCGTTACGACGAGGCTGTTTTTGCTTACCAGAAAGCTCTCACTGTTTACAAGACCACCAAGGGAGAAAACCATCCAGCAGTTGGTTCGGTATATGTTCGTCTTGCTGATTTATACAACAAAACTGGAAAAACAAGGGAGTCGGTATCGTACTGTGAAAACGCCCTTCGAATTTATGAAAAGCCTATCTCTGGGATTCCTCCTGAGGAGATTGCTAGTGGTCTTACTAATGTTGCTGCTATTTATGAATCAATGAATGAAGCTGAGCAAGCGGTCAAATTATTGCACAAGGCACTGAAGATATATAGTAATGCCCCTGGACAGCAAAGCACGATTGCTGGAATTGAAGCCCAGATGGGTGTGTTGTATTATATGTTAGGGAACTATTCTGAATCTTATGACTCCTTCAAGAACGCTATTCCAAAGCTTCGCAACAGCGGAGAGAAAAAATCTGCTTTCTTTGGGATAGCTCTTAATCAAATGGGGCTTGCTTGTGTTCAGAAATATGCTATTAACGAAGCTGTGGAGCTATTTGAAGAAGCCAAGAGCATACTAGAAAAAGAATACGGGCCTTATCATCCCGACACGCTCGGAGTCTATAGCAACCTTGCTGGAGCATATGATGCGATTGGCAGGTTGGATGATGCAATTGAAATGTTGGAGTATGTTGTTGGCACGAGAGAGGAAAAACTCGGGACTGCCAATCCTGATGTCGACGATGAGAAGAGGAGGTTGTCTGAATTGTTGAAAGAAGCTGGTAGAGTCCGGAGCCGAAAGGCGAGATCGCTCGAGACTCTTCTTGATGCCAATACTCATCCTGTAAACAGTAAAGGGATTGTACCTGTAACTATGGTTGAAAAATCTTCTAGTTCGTCATCTATTCTGGTTGATCAGCCAGTTGTTCCAGGAGATGTGGTCCTCGACCTCTCAAACATGACCAACGAAACCATCAAGCTTGGAGGCGGTCTTCGACAGGACCATGATGCTATTTCTGTCGCCAAAGTTGGAAAATTAAGGTTCTCAAAACCAAACAAATATTGGGTTGAAAGCTCGCAGAAAAGGTATGTGCCATGTGCAGAAGATTGCGTACTTGGAATCGTGGTTGACTCTAGATCCGATAATTTTCTTGTTGACATAAAAGGTCCTTCATTGGCCTTTCTTCCTGTTCTTGCATTTGAAGGAGGTACAAGACGAAACATTCCCAAGTTTGAGATGGGTGCCCTTCTTTATGTGCGGGTGGTGAAGGCAAACCCAGGTATGAATCCCGAGCTGGCATGCACTGATGCTAGTGGGAAAGCAGCTGGATTTGGCCACCTAAAGGATGGTTACATCTTTGACTGTTCAACTGGCTTATCAAGAATGCTTCTAAGCTCGCCAACATGTCCTGTTCTTGAATCTCTTGGGAAGAAGCTTTCATTCGAGACGGCAGTTGGTATAAATGGCCGAGTTTGGGTCAATGCAGATTCCCCGTCCACAACCATTGTTGTTTCTAATGCAATAATGAACTCAGAAACTCTGAGTGGGGTCCAACAGAGAATCATGGTGGACAAGCTCCTAAATAATTTAAAGCTATCAAGTTAA
Protein sequence
MPGIVMDEINEERAVNRHNGASIHMEESYGNKSPRSGLSVQSHGSVNVDFHVDGLVDTSIEKLYENVYDMQSSDQSPSRRSFGSDGGESRIDSELNHLVGGEMREVEIIKEEEDIVERTENDFPSDSVKDLPSVEIKSTENSQPGSSKRLSSGKKASHLQLNHETSPKSSSSVKDLSDKSPISRKNEKSSKKTSPVASNSKKQKDSPLRGSKILNGTEDFNESMMDNPDLGPYLLKQARSLVSSGENLQKALLLALRAAKAFELSANGKPSLELVMCLHVTAAIYCSLGQYSEAIPLLEHSIEIPAIKEGQEHALAKFAGHMQLGDTYAMLGLLENSLICYTTGLEVQKQVLGEADPRVGETYRYLAEAHVQALQFDEAEKFCQMALNIHKKNVGPASLEEAADRRLMGLICETKGDHEAALEHLVLASMAMVANGQETDVAAVDCSIGDSYLSLSRYDEAVFAYQKALTVYKTTKGENHPAVGSVYVRLADLYNKTGKTRESVSYCENALRIYEKPISGIPPEEIASGLTNVAAIYESMNEAEQAVKLLHKALKIYSNAPGQQSTIAGIEAQMGVLYYMLGNYSESYDSFKNAIPKLRNSGEKKSAFFGIALNQMGLACVQKYAINEAVELFEEAKSILEKEYGPYHPDTLGVYSNLAGAYDAIGRLDDAIEMLEYVVGTREEKLGTANPDVDDEKRRLSELLKEAGRVRSRKARSLETLLDANTHPVNSKGIVPVTMVEKSSSSSSILVDQPVVPGDVVLDLSNMTNETIKLGGGLRQDHDAISVAKVGKLRFSKPNKYWVESSQKRYVPCAEDCVLGIVVDSRSDNFLVDIKGPSLAFLPVLAFEGGTRRNIPKFEMGALLYVRVVKANPGMNPELACTDASGKAAGFGHLKDGYIFDCSTGLSRMLLSSPTCPVLESLGKKLSFETAVGINGRVWVNADSPSTTIVVSNAIMNSETLSGVQQRIMVDKLLNNLKLSS
Homology
BLAST of CmaCh11G000970 vs. ExPASy Swiss-Prot
Match:
F4HSX9 (Protein KINESIN LIGHT CHAIN-RELATED 3 OS=Arabidopsis thaliana OX=3702 GN=KLCR3 PE=2 SV=1)
HSP 1 Score: 759.6 bits (1960), Expect = 4.4e-218
Identity = 416/684 (60.82%), Postives = 519/684 (75.88%), Query Frame = 0
Query: 44 GSVNVDF-HVDGLVDTSIEKLYENVYDMQSSDQSPSRRSFGSDGGESRIDSELNHLVGGE 103
GSVN + D + DT+IE+L +N+ ++QSS+QSPSR+SFGS G ES+IDS+L HL GE
Sbjct: 4 GSVNESHSNADQMFDTTIEELCKNLCELQSSNQSPSRQSFGSYGDESKIDSDLQHLALGE 63
Query: 104 MREVEIIKEEEDIVERTENDFPSDSVKDLPSVEIKSTENSQPGSSKRLSSGKKASHLQLN 163
MR+++I+++E D D V ++KS ++ L+
Sbjct: 64 MRDIDILEDEGD----------EDEVAKPEEFDVKSNSSN------------------LD 123
Query: 164 HETSPKSSSSVKDLSDKSPISRKNEKSSKKTSPVASNSKKQKDSPLRGSKILNGTEDFNE 223
E P+ + +K+ K++ +K+K + G+K+ NG E E
Sbjct: 124 LEVMPRDME-----------KQTGKKNVTKSNVGVGGMRKKK---VGGTKLQNGNE---E 183
Query: 224 SMMDNPDLGPYLLKQARSLVSSGENLQKALLLALRAAKAFELSA-NGKPSLELVMCLHVT 283
+N +L +LL QAR+LVSSG++ KAL L RAAK FE SA NGKP LE +MCLHVT
Sbjct: 184 PSSENVELARFLLNQARNLVSSGDSTHKALELTHRAAKLFEASAENGKPCLEWIMCLHVT 243
Query: 284 AAIYCSLGQYSEAIPLLEHSIEIPAIKEGQEHALAKFAGHMQLGDTYAMLGLLENSLICY 343
AA++C L +Y+EAIP+L+ S+EIP ++EG+EHALAKFAG MQLGDTYAM+G LE+S+ CY
Sbjct: 244 AAVHCKLKEYNEAIPVLQRSVEIPVVEEGEEHALAKFAGLMQLGDTYAMVGQLESSISCY 303
Query: 344 TTGLEVQKQVLGEADPRVGETYRYLAEAHVQALQFDEAEKFCQMALNIHKKNVGPASLEE 403
T GL +QK+VLGE DPRVGET RYLAEA VQAL+FDEA++ C+ AL+IH+++ P S+ E
Sbjct: 304 TEGLNIQKKVLGENDPRVGETCRYLAEALVQALRFDEAQQVCETALSIHRESGLPGSIAE 363
Query: 404 AADRRLMGLICETKGDHEAALEHLVLASMAMVANGQETDVAAVDCSIGDSYLSLSRYDEA 463
AADRRLMGLICETKGDHE ALEHLVLASMAM ANGQE++VA VD SIGDSYLSLSR+DEA
Sbjct: 364 AADRRLMGLICETKGDHENALEHLVLASMAMAANGQESEVAFVDTSIGDSYLSLSRFDEA 423
Query: 464 VFAYQKALTVYKTTKGENHPAVGSVYVRLADLYNKTGKTRESVSYCENALRIYEKPISGI 523
+ AYQK+LT KT KGENHPAVGSVY+RLADLYN+TGK RE+ SYCENALRIYE I
Sbjct: 424 ICAYQKSLTALKTAKGENHPAVGSVYIRLADLYNRTGKVREAKSYCENALRIYESHNLEI 483
Query: 524 PPEEIASGLTNVAAIYESMNEAEQAVKLLHKALKIYSNAPGQQSTIAGIEAQMGVLYYML 583
PEEIASGLT+++ I ESMNE EQA+ LL KALKIY+++PGQ+ IAGIEAQMGVLYYM+
Sbjct: 484 SPEEIASGLTDISVICESMNEVEQAITLLQKALKIYADSPGQKIMIAGIEAQMGVLYYMM 543
Query: 584 GNYSESYDSFKNAIPKLRNSGEKKSAFFGIALNQMGLACVQKYAINEAVELFEEAKSILE 643
G Y ESY++FK+AI KLR +G+K+S FFGIALNQMGLAC+Q AI EAVELFEEAK ILE
Sbjct: 544 GKYMESYNTFKSAISKLRATGKKQSTFFGIALNQMGLACIQLDAIEEAVELFEEAKCILE 603
Query: 644 KEYGPYHPDTLGVYSNLAGAYDAIGRLDDAIEMLEYVVGTREEKLGTANPDVDDEKRRLS 703
+E GPYHP+TLG+YSNLAGAYDAIGRLDDAI++L +VVG REEKLGTANP +DEKRRL+
Sbjct: 604 QECGPYHPETLGLYSNLAGAYDAIGRLDDAIKLLGHVVGVREEKLGTANPVTEDEKRRLA 642
Query: 704 ELLKEAGRVRSRKARSLETLLDAN 726
+LLKEAG V RKA+SL+TL+D++
Sbjct: 664 QLLKEAGNVTGRKAKSLKTLIDSD 642
BLAST of CmaCh11G000970 vs. ExPASy Swiss-Prot
Match:
Q9LII8 (Protein KINESIN LIGHT CHAIN-RELATED 2 OS=Arabidopsis thaliana OX=3702 GN=KLCR2 PE=1 SV=1)
HSP 1 Score: 745.3 bits (1923), Expect = 8.5e-214
Identity = 422/712 (59.27%), Postives = 513/712 (72.05%), Query Frame = 0
Query: 26 EESYGNKSPRSGLSVQSHGSVNVDFHVDGLVDTSIEKLYENVYDMQSS-DQSPSRRSFGS 85
++S SPRS LS ++D +DG ++ SIE+LY NV +M+SS DQSPSR SF S
Sbjct: 12 DDSALQASPRSPLS-------SIDLAIDGAMNASIEQLYHNVCEMESSDDQSPSRASFIS 71
Query: 86 DGGESRIDSELNHLVGGEMREVEIIKEEEDIVERTENDFPSDSVKDLPSVEIKSTENSQP 145
G ESRID EL HLVG E E ++E I+E+ E +S
Sbjct: 72 YGAESRIDLELRHLVGDVGEEGE--SKKEIILEKKE----------------ESNGEGSL 131
Query: 146 GSSKRLSSGKKASHLQLNHETSPKSSSSVKDLSDKSPISRKNEKSSKKTSPVASNSKKQK 205
K LS+GKK + +TSP + K P SR + + S V+ +
Sbjct: 132 SQKKPLSNGKKVA------KTSPN--------NPKMPGSRISSRKSPDLGKVSVDE---- 191
Query: 206 DSPLRGSKILNGTEDFNESMMDNPDLGPYLLKQARSLVSSGENLQKALLLALRAAKAFEL 265
++P+LG LLKQAR LVSSGENL KAL LALRA K FE
Sbjct: 192 ---------------------ESPELGVVLLKQARELVSSGENLNKALDLALRAVKVFEK 251
Query: 266 SANGKP--SLELVMCLHVTAAIYCSLGQYSEAIPLLEHSIEIPAIKEGQEHALAKFAGHM 325
G+ L LVM LH+ AAIY LG+Y++A+P+LE SIEIP I++G++HALAKFAG M
Sbjct: 252 CGEGEKQLGLNLVMSLHILAAIYAGLGRYNDAVPVLERSIEIPMIEDGEDHALAKFAGCM 311
Query: 326 QLGDTYAMLGLLENSLICYTTGLEVQKQVLGEADPRVGETYRYLAEAHVQALQFDEAEKF 385
QLGD Y ++G +ENS++ YT GLE+Q+QVLGE+D RVGET RYLAEAHVQA+QF+EA +
Sbjct: 312 QLGDMYGLMGQVENSIMLYTAGLEIQRQVLGESDARVGETCRYLAEAHVQAMQFEEASRL 371
Query: 386 CQMALNIHKKN--VGPASLEEAADRRLMGLICETKGDHEAALEHLVLASMAMVANGQETD 445
CQMAL+IHK+N AS+EEAADR+LMGLIC+ KGD+E ALEH VLASMAM + D
Sbjct: 372 CQMALDIHKENGAAATASIEEAADRKLMGLICDAKGDYEVALEHYVLASMAMSSQNHRED 431
Query: 446 VAAVDCSIGDSYLSLSRYDEAVFAYQKALTVYKTTKGENHPAVGSVYVRLADLYNKTGKT 505
VAAVDCSIGD+Y+SL+R+DEA+FAYQKAL V+K KGE H +V VYVRLADLYNK GKT
Sbjct: 432 VAAVDCSIGDAYMSLARFDEAIFAYQKALAVFKQGKGETHSSVALVYVRLADLYNKIGKT 491
Query: 506 RESVSYCENALRIYEKPISGIPPEEIASGLTNVAAIYESMNEAEQAVKLLHKALKIYSNA 565
R+S SYCENAL+IY KP G P EE+A+G ++AIY+SMNE +QA+KLL +ALKIY+NA
Sbjct: 492 RDSKSYCENALKIYLKPTPGTPMEEVATGFIEISAIYQSMNELDQALKLLRRALKIYANA 551
Query: 566 PGQQSTIAGIEAQMGVLYYMLGNYSESYDSFKNAIPKLRNSGEKKSAFFGIALNQMGLAC 625
PGQQ+TIAGIEAQMGV+ YM+GNYSESYD FK+AI K RNSGEKK+A FGIALNQMGLAC
Sbjct: 552 PGQQNTIAGIEAQMGVVTYMMGNYSESYDIFKSAISKFRNSGEKKTALFGIALNQMGLAC 611
Query: 626 VQKYAINEAVELFEEAKSILEKEYGPYHPDTLGVYSNLAGAYDAIGRLDDAIEMLEYVVG 685
VQ+YAINEA +LFEEAK+ILEKE GPYHPDTL VYSNLAG YDA+GRLDDAIE+LEYVVG
Sbjct: 612 VQRYAINEAADLFEEAKTILEKECGPYHPDTLAVYSNLAGTYDAMGRLDDAIEILEYVVG 659
Query: 686 TREEKLGTANPDVDDEKRRLSELLKEAGRVRSRKARSLETLLDANTHPVNSK 733
TREEKLGTANP+V+DEK+RL+ LLKEAGR RS++ R+L TLLD N N +
Sbjct: 672 TREEKLGTANPEVEDEKQRLAALLKEAGRGRSKRNRALLTLLDNNPEIANGQ 659
BLAST of CmaCh11G000970 vs. ExPASy Swiss-Prot
Match:
O81629 (Protein KINESIN LIGHT CHAIN-RELATED 1 OS=Arabidopsis thaliana OX=3702 GN=KLCR1 PE=1 SV=1)
HSP 1 Score: 641.7 bits (1654), Expect = 1.3e-182
Identity = 327/561 (58.29%), Postives = 429/561 (76.47%), Query Frame = 0
Query: 179 KSPISRKNEKSSKKTSPVASNSKKQKDSPLRGSKILNGTEDFNESMMDNPDLGPYLLKQA 238
++P+ + + ++ P + S +KDSP S D ++ +DNPDLGP+LLK A
Sbjct: 34 RTPMKKTPSSTPSRSKPSPNRSTGKKDSPTVSSSTA-AVIDVDDPSLDNPDLGPFLLKLA 93
Query: 239 RSLVSSGENLQKALLLALRAAKAFEL-----------SANGKPSLELVMCLHVTAAIYCS 298
R ++SGE KAL A+RA K+FE ++G P L+L M LHV AAIYCS
Sbjct: 94 RDAIASGEGPNKALDYAIRATKSFERCCAAVAPPIPGGSDGGPVLDLAMSLHVLAAIYCS 153
Query: 299 LGQYSEAIPLLEHSIEIPAIKEGQEHALAKFAGHMQLGDTYAMLGLLENSLICYTTGLEV 358
LG++ EA+P LE +I++P G +H+LA F+GHMQLGDT +MLG ++ S+ CY GL++
Sbjct: 154 LGRFDEAVPPLERAIQVPDPTRGPDHSLAAFSGHMQLGDTLSMLGQIDRSIACYEEGLKI 213
Query: 359 QKQVLGEADPRVGETYRYLAEAHVQALQFDEAEKFCQMALNIHKKNVGPASLEEAADRRL 418
Q Q LG+ DPRVGET RYLAEA+VQA+QF++AE+ C+ L IH+ + PASLEEAADRRL
Sbjct: 214 QIQTLGDTDPRVGETCRYLAEAYVQAMQFNKAEELCKKTLEIHRAHSEPASLEEAADRRL 273
Query: 419 MGLICETKGDHEAALEHLVLASMAMVANGQETDVAAVDCSIGDSYLSLSRYDEAVFAYQK 478
M +ICE KGD+E ALEHLVLASMAM+A+GQE++VA++D SIG+ Y+SL R+DEAVF+YQK
Sbjct: 274 MAIICEAKGDYENALEHLVLASMAMIASGQESEVASIDVSIGNIYMSLCRFDEAVFSYQK 333
Query: 479 ALTVYKTTKGENHPAVGSVYVRLADLYNKTGKTRESVSYCENALRIYEKPISGIPPEEIA 538
ALTV+K +KGE HP V SV+VRLA+LY++TGK RES SYCENALRIY KP+ G EEIA
Sbjct: 334 ALTVFKASKGETHPTVASVFVRLAELYHRTGKLRESKSYCENALRIYNKPVPGTTVEEIA 393
Query: 539 SGLTNVAAIYESMNEAEQAVKLLHKALKIYSNAPGQQSTIAGIEAQMGVLYYMLGNYSES 598
GLT ++AIYES++E E+A+KLL K++K+ + PGQQS IAG+EA+MGV+YY +G Y ++
Sbjct: 394 GGLTEISAIYESVDEPEEALKLLQKSMKLLEDKPGQQSAIAGLEARMGVMYYTVGRYEDA 453
Query: 599 YDSFKNAIPKLRNSGEKKSAFFGIALNQMGLACVQKYAINEAVELFEEAKSILEKEYGPY 658
++F++A+ KLR +GE KSAFFG+ LNQMGLACVQ + I+EA ELFEEA+ ILE+E GP
Sbjct: 454 RNAFESAVTKLRAAGE-KSAFFGVVLNQMGLACVQLFKIDEAGELFEEARGILEQERGPC 513
Query: 659 HPDTLGVYSNLAGAYDAIGRLDDAIEMLEYVVGTREEKLGTANPDVDDEKRRLSELLKEA 718
DTLGVYSNLA YDA+GR++DAIE+LE V+ REEKLGTANPD +DEK+RL+ELLKEA
Sbjct: 514 DQDTLGVYSNLAATYDAMGRIEDAIEILEQVLKLREEKLGTANPDFEDEKKRLAELLKEA 573
Query: 719 GRVRSRKARSLETLLDANTHP 729
GR R+ KA+SL+ L+D N P
Sbjct: 574 GRSRNYKAKSLQNLIDPNARP 592
BLAST of CmaCh11G000970 vs. ExPASy Swiss-Prot
Match:
Q7KWX9 (Putative exosome complex component rrp40 OS=Dictyostelium discoideum OX=44689 GN=exosc3 PE=3 SV=1)
HSP 1 Score: 186.8 bits (473), Expect = 1.2e-45
Identity = 100/227 (44.05%), Postives = 138/227 (60.79%), Query Frame = 0
Query: 750 LVDQPVVPGDVVLDLSNMTNETIKLGGGLRQDHDAISVAKVGKLRFSKPNK-YWVESSQK 809
L DQ VVPGDV+ + ++ +++G GL Q D + K G LR+SK ++ YW+E+ QK
Sbjct: 4 LKDQFVVPGDVIGKIGDL---KVRIGPGLLQTKDTVLATKAGVLRYSKFHRFYWIENEQK 63
Query: 810 RYVPCAEDCVLGIVVDSRSDNFLVDIKGPSLAFLPVLAFEGGTRRNIPKFEMGALLYVRV 869
RYVP ED V+G +++ +++F VDI A L +FEG T+ N P +G L+Y RV
Sbjct: 64 RYVPQVEDMVIGTIIEKHAESFKVDIGSSCSALLSAYSFEGATKSNKPLLNVGNLIYCRV 123
Query: 870 VKANPGMNPELACTDASGKAAGFGHLKDGYIFDCSTGLSRMLLSSPTCPVLESLGKKLSF 929
AN M PE+ C KA GFG L GY+ +CS GLS LLS C +L+ LGK + +
Sbjct: 124 TVANRDMEPEVVCLSQKQKAEGFGQLIGGYMLNCSLGLSHYLLSE-DCFLLQILGKHIPY 183
Query: 930 ETAVGINGRVWVNADSPSTTIVVSNAIMNSETLSGVQQRIMVDKLLN 976
E AVG+NGRVW+N+ S TIVVSN I NS+ + Q + K L+
Sbjct: 184 EIAVGVNGRVWINSGSNHNTIVVSNTIYNSQYIQDDQIEPFILKSLS 226
BLAST of CmaCh11G000970 vs. ExPASy Swiss-Prot
Match:
Q8IPX7 (Exosome complex component RRP40 OS=Drosophila melanogaster OX=7227 GN=Rrp40 PE=1 SV=1)
HSP 1 Score: 164.1 bits (414), Expect = 8.1e-39
Identity = 87/224 (38.84%), Postives = 131/224 (58.48%), Query Frame = 0
Query: 755 VVPGDVVLDLSNMT-NETIKLGGGLRQDHDAISVAKVGKLRFSKPNKYWVESSQKRYVPC 814
V+PG+ + + + ++ + LG GLR+ D + +K G LR +P +WV++ Q+RY+P
Sbjct: 8 VMPGERIAAIEELAKSKRVILGPGLRRLDDTVVASKAGPLRHKEPGTFWVDNYQRRYIPA 67
Query: 815 AEDCVLGIVVDSRSDNFLVDIKGPSLAFLPVLAFEGGTRRNIPKFEMGALLYVRVVKANP 874
D +LGIV D + VDI A + LAFE +++N P G L+Y RV+ A+
Sbjct: 68 RGDLILGIVRAKAGDLYRVDIGATDTASISYLAFEAASKKNRPDLIPGDLIYARVLNASA 127
Query: 875 GMNPELACTDASGKAAGFGHLKDGYIFDCSTGLSRMLLSSPTCPVLESLGKKLSFETAVG 934
+ PEL C ++ GK+ G L DG+ F CS L RMLL CPVL +L ++L +E AVG
Sbjct: 128 DIEPELVCVNSVGKSGKLGVLTDGFFFKCSLNLGRMLLRE-NCPVLAALTRELPYEIAVG 187
Query: 935 INGRVWVNADSPSTTIVVSNAIMNSETLSGVQQRIMVDKLLNNL 978
+NGR+W+ A S T+ ++NAI E SG + +DK+ NL
Sbjct: 188 VNGRIWLKAHSLKETVALANAISALEQ-SGCAE---IDKICGNL 226
BLAST of CmaCh11G000970 vs. TAIR 10
Match:
AT1G27500.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 759.6 bits (1960), Expect = 3.1e-219
Identity = 416/684 (60.82%), Postives = 519/684 (75.88%), Query Frame = 0
Query: 44 GSVNVDF-HVDGLVDTSIEKLYENVYDMQSSDQSPSRRSFGSDGGESRIDSELNHLVGGE 103
GSVN + D + DT+IE+L +N+ ++QSS+QSPSR+SFGS G ES+IDS+L HL GE
Sbjct: 4 GSVNESHSNADQMFDTTIEELCKNLCELQSSNQSPSRQSFGSYGDESKIDSDLQHLALGE 63
Query: 104 MREVEIIKEEEDIVERTENDFPSDSVKDLPSVEIKSTENSQPGSSKRLSSGKKASHLQLN 163
MR+++I+++E D D V ++KS ++ L+
Sbjct: 64 MRDIDILEDEGD----------EDEVAKPEEFDVKSNSSN------------------LD 123
Query: 164 HETSPKSSSSVKDLSDKSPISRKNEKSSKKTSPVASNSKKQKDSPLRGSKILNGTEDFNE 223
E P+ + +K+ K++ +K+K + G+K+ NG E E
Sbjct: 124 LEVMPRDME-----------KQTGKKNVTKSNVGVGGMRKKK---VGGTKLQNGNE---E 183
Query: 224 SMMDNPDLGPYLLKQARSLVSSGENLQKALLLALRAAKAFELSA-NGKPSLELVMCLHVT 283
+N +L +LL QAR+LVSSG++ KAL L RAAK FE SA NGKP LE +MCLHVT
Sbjct: 184 PSSENVELARFLLNQARNLVSSGDSTHKALELTHRAAKLFEASAENGKPCLEWIMCLHVT 243
Query: 284 AAIYCSLGQYSEAIPLLEHSIEIPAIKEGQEHALAKFAGHMQLGDTYAMLGLLENSLICY 343
AA++C L +Y+EAIP+L+ S+EIP ++EG+EHALAKFAG MQLGDTYAM+G LE+S+ CY
Sbjct: 244 AAVHCKLKEYNEAIPVLQRSVEIPVVEEGEEHALAKFAGLMQLGDTYAMVGQLESSISCY 303
Query: 344 TTGLEVQKQVLGEADPRVGETYRYLAEAHVQALQFDEAEKFCQMALNIHKKNVGPASLEE 403
T GL +QK+VLGE DPRVGET RYLAEA VQAL+FDEA++ C+ AL+IH+++ P S+ E
Sbjct: 304 TEGLNIQKKVLGENDPRVGETCRYLAEALVQALRFDEAQQVCETALSIHRESGLPGSIAE 363
Query: 404 AADRRLMGLICETKGDHEAALEHLVLASMAMVANGQETDVAAVDCSIGDSYLSLSRYDEA 463
AADRRLMGLICETKGDHE ALEHLVLASMAM ANGQE++VA VD SIGDSYLSLSR+DEA
Sbjct: 364 AADRRLMGLICETKGDHENALEHLVLASMAMAANGQESEVAFVDTSIGDSYLSLSRFDEA 423
Query: 464 VFAYQKALTVYKTTKGENHPAVGSVYVRLADLYNKTGKTRESVSYCENALRIYEKPISGI 523
+ AYQK+LT KT KGENHPAVGSVY+RLADLYN+TGK RE+ SYCENALRIYE I
Sbjct: 424 ICAYQKSLTALKTAKGENHPAVGSVYIRLADLYNRTGKVREAKSYCENALRIYESHNLEI 483
Query: 524 PPEEIASGLTNVAAIYESMNEAEQAVKLLHKALKIYSNAPGQQSTIAGIEAQMGVLYYML 583
PEEIASGLT+++ I ESMNE EQA+ LL KALKIY+++PGQ+ IAGIEAQMGVLYYM+
Sbjct: 484 SPEEIASGLTDISVICESMNEVEQAITLLQKALKIYADSPGQKIMIAGIEAQMGVLYYMM 543
Query: 584 GNYSESYDSFKNAIPKLRNSGEKKSAFFGIALNQMGLACVQKYAINEAVELFEEAKSILE 643
G Y ESY++FK+AI KLR +G+K+S FFGIALNQMGLAC+Q AI EAVELFEEAK ILE
Sbjct: 544 GKYMESYNTFKSAISKLRATGKKQSTFFGIALNQMGLACIQLDAIEEAVELFEEAKCILE 603
Query: 644 KEYGPYHPDTLGVYSNLAGAYDAIGRLDDAIEMLEYVVGTREEKLGTANPDVDDEKRRLS 703
+E GPYHP+TLG+YSNLAGAYDAIGRLDDAI++L +VVG REEKLGTANP +DEKRRL+
Sbjct: 604 QECGPYHPETLGLYSNLAGAYDAIGRLDDAIKLLGHVVGVREEKLGTANPVTEDEKRRLA 642
Query: 704 ELLKEAGRVRSRKARSLETLLDAN 726
+LLKEAG V RKA+SL+TL+D++
Sbjct: 664 QLLKEAGNVTGRKAKSLKTLIDSD 642
BLAST of CmaCh11G000970 vs. TAIR 10
Match:
AT3G27960.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 745.3 bits (1923), Expect = 6.0e-215
Identity = 422/712 (59.27%), Postives = 513/712 (72.05%), Query Frame = 0
Query: 26 EESYGNKSPRSGLSVQSHGSVNVDFHVDGLVDTSIEKLYENVYDMQSS-DQSPSRRSFGS 85
++S SPRS LS ++D +DG ++ SIE+LY NV +M+SS DQSPSR SF S
Sbjct: 12 DDSALQASPRSPLS-------SIDLAIDGAMNASIEQLYHNVCEMESSDDQSPSRASFIS 71
Query: 86 DGGESRIDSELNHLVGGEMREVEIIKEEEDIVERTENDFPSDSVKDLPSVEIKSTENSQP 145
G ESRID EL HLVG E E ++E I+E+ E +S
Sbjct: 72 YGAESRIDLELRHLVGDVGEEGE--SKKEIILEKKE----------------ESNGEGSL 131
Query: 146 GSSKRLSSGKKASHLQLNHETSPKSSSSVKDLSDKSPISRKNEKSSKKTSPVASNSKKQK 205
K LS+GKK + +TSP + K P SR + + S V+ +
Sbjct: 132 SQKKPLSNGKKVA------KTSPN--------NPKMPGSRISSRKSPDLGKVSVDE---- 191
Query: 206 DSPLRGSKILNGTEDFNESMMDNPDLGPYLLKQARSLVSSGENLQKALLLALRAAKAFEL 265
++P+LG LLKQAR LVSSGENL KAL LALRA K FE
Sbjct: 192 ---------------------ESPELGVVLLKQARELVSSGENLNKALDLALRAVKVFEK 251
Query: 266 SANGKP--SLELVMCLHVTAAIYCSLGQYSEAIPLLEHSIEIPAIKEGQEHALAKFAGHM 325
G+ L LVM LH+ AAIY LG+Y++A+P+LE SIEIP I++G++HALAKFAG M
Sbjct: 252 CGEGEKQLGLNLVMSLHILAAIYAGLGRYNDAVPVLERSIEIPMIEDGEDHALAKFAGCM 311
Query: 326 QLGDTYAMLGLLENSLICYTTGLEVQKQVLGEADPRVGETYRYLAEAHVQALQFDEAEKF 385
QLGD Y ++G +ENS++ YT GLE+Q+QVLGE+D RVGET RYLAEAHVQA+QF+EA +
Sbjct: 312 QLGDMYGLMGQVENSIMLYTAGLEIQRQVLGESDARVGETCRYLAEAHVQAMQFEEASRL 371
Query: 386 CQMALNIHKKN--VGPASLEEAADRRLMGLICETKGDHEAALEHLVLASMAMVANGQETD 445
CQMAL+IHK+N AS+EEAADR+LMGLIC+ KGD+E ALEH VLASMAM + D
Sbjct: 372 CQMALDIHKENGAAATASIEEAADRKLMGLICDAKGDYEVALEHYVLASMAMSSQNHRED 431
Query: 446 VAAVDCSIGDSYLSLSRYDEAVFAYQKALTVYKTTKGENHPAVGSVYVRLADLYNKTGKT 505
VAAVDCSIGD+Y+SL+R+DEA+FAYQKAL V+K KGE H +V VYVRLADLYNK GKT
Sbjct: 432 VAAVDCSIGDAYMSLARFDEAIFAYQKALAVFKQGKGETHSSVALVYVRLADLYNKIGKT 491
Query: 506 RESVSYCENALRIYEKPISGIPPEEIASGLTNVAAIYESMNEAEQAVKLLHKALKIYSNA 565
R+S SYCENAL+IY KP G P EE+A+G ++AIY+SMNE +QA+KLL +ALKIY+NA
Sbjct: 492 RDSKSYCENALKIYLKPTPGTPMEEVATGFIEISAIYQSMNELDQALKLLRRALKIYANA 551
Query: 566 PGQQSTIAGIEAQMGVLYYMLGNYSESYDSFKNAIPKLRNSGEKKSAFFGIALNQMGLAC 625
PGQQ+TIAGIEAQMGV+ YM+GNYSESYD FK+AI K RNSGEKK+A FGIALNQMGLAC
Sbjct: 552 PGQQNTIAGIEAQMGVVTYMMGNYSESYDIFKSAISKFRNSGEKKTALFGIALNQMGLAC 611
Query: 626 VQKYAINEAVELFEEAKSILEKEYGPYHPDTLGVYSNLAGAYDAIGRLDDAIEMLEYVVG 685
VQ+YAINEA +LFEEAK+ILEKE GPYHPDTL VYSNLAG YDA+GRLDDAIE+LEYVVG
Sbjct: 612 VQRYAINEAADLFEEAKTILEKECGPYHPDTLAVYSNLAGTYDAMGRLDDAIEILEYVVG 659
Query: 686 TREEKLGTANPDVDDEKRRLSELLKEAGRVRSRKARSLETLLDANTHPVNSK 733
TREEKLGTANP+V+DEK+RL+ LLKEAGR RS++ R+L TLLD N N +
Sbjct: 672 TREEKLGTANPEVEDEKQRLAALLKEAGRGRSKRNRALLTLLDNNPEIANGQ 659
BLAST of CmaCh11G000970 vs. TAIR 10
Match:
AT4G10840.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 641.7 bits (1654), Expect = 9.4e-184
Identity = 327/561 (58.29%), Postives = 429/561 (76.47%), Query Frame = 0
Query: 179 KSPISRKNEKSSKKTSPVASNSKKQKDSPLRGSKILNGTEDFNESMMDNPDLGPYLLKQA 238
++P+ + + ++ P + S +KDSP S D ++ +DNPDLGP+LLK A
Sbjct: 34 RTPMKKTPSSTPSRSKPSPNRSTGKKDSPTVSSSTA-AVIDVDDPSLDNPDLGPFLLKLA 93
Query: 239 RSLVSSGENLQKALLLALRAAKAFEL-----------SANGKPSLELVMCLHVTAAIYCS 298
R ++SGE KAL A+RA K+FE ++G P L+L M LHV AAIYCS
Sbjct: 94 RDAIASGEGPNKALDYAIRATKSFERCCAAVAPPIPGGSDGGPVLDLAMSLHVLAAIYCS 153
Query: 299 LGQYSEAIPLLEHSIEIPAIKEGQEHALAKFAGHMQLGDTYAMLGLLENSLICYTTGLEV 358
LG++ EA+P LE +I++P G +H+LA F+GHMQLGDT +MLG ++ S+ CY GL++
Sbjct: 154 LGRFDEAVPPLERAIQVPDPTRGPDHSLAAFSGHMQLGDTLSMLGQIDRSIACYEEGLKI 213
Query: 359 QKQVLGEADPRVGETYRYLAEAHVQALQFDEAEKFCQMALNIHKKNVGPASLEEAADRRL 418
Q Q LG+ DPRVGET RYLAEA+VQA+QF++AE+ C+ L IH+ + PASLEEAADRRL
Sbjct: 214 QIQTLGDTDPRVGETCRYLAEAYVQAMQFNKAEELCKKTLEIHRAHSEPASLEEAADRRL 273
Query: 419 MGLICETKGDHEAALEHLVLASMAMVANGQETDVAAVDCSIGDSYLSLSRYDEAVFAYQK 478
M +ICE KGD+E ALEHLVLASMAM+A+GQE++VA++D SIG+ Y+SL R+DEAVF+YQK
Sbjct: 274 MAIICEAKGDYENALEHLVLASMAMIASGQESEVASIDVSIGNIYMSLCRFDEAVFSYQK 333
Query: 479 ALTVYKTTKGENHPAVGSVYVRLADLYNKTGKTRESVSYCENALRIYEKPISGIPPEEIA 538
ALTV+K +KGE HP V SV+VRLA+LY++TGK RES SYCENALRIY KP+ G EEIA
Sbjct: 334 ALTVFKASKGETHPTVASVFVRLAELYHRTGKLRESKSYCENALRIYNKPVPGTTVEEIA 393
Query: 539 SGLTNVAAIYESMNEAEQAVKLLHKALKIYSNAPGQQSTIAGIEAQMGVLYYMLGNYSES 598
GLT ++AIYES++E E+A+KLL K++K+ + PGQQS IAG+EA+MGV+YY +G Y ++
Sbjct: 394 GGLTEISAIYESVDEPEEALKLLQKSMKLLEDKPGQQSAIAGLEARMGVMYYTVGRYEDA 453
Query: 599 YDSFKNAIPKLRNSGEKKSAFFGIALNQMGLACVQKYAINEAVELFEEAKSILEKEYGPY 658
++F++A+ KLR +GE KSAFFG+ LNQMGLACVQ + I+EA ELFEEA+ ILE+E GP
Sbjct: 454 RNAFESAVTKLRAAGE-KSAFFGVVLNQMGLACVQLFKIDEAGELFEEARGILEQERGPC 513
Query: 659 HPDTLGVYSNLAGAYDAIGRLDDAIEMLEYVVGTREEKLGTANPDVDDEKRRLSELLKEA 718
DTLGVYSNLA YDA+GR++DAIE+LE V+ REEKLGTANPD +DEK+RL+ELLKEA
Sbjct: 514 DQDTLGVYSNLAATYDAMGRIEDAIEILEQVLKLREEKLGTANPDFEDEKKRLAELLKEA 573
Query: 719 GRVRSRKARSLETLLDANTHP 729
GR R+ KA+SL+ L+D N P
Sbjct: 574 GRSRNYKAKSLQNLIDPNARP 592
BLAST of CmaCh11G000970 vs. TAIR 10
Match:
AT4G10840.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 564.3 bits (1453), Expect = 1.9e-160
Identity = 287/500 (57.40%), Postives = 378/500 (75.60%), Query Frame = 0
Query: 179 KSPISRKNEKSSKKTSPVASNSKKQKDSPLRGSKILNGTEDFNESMMDNPDLGPYLLKQA 238
++P+ + + ++ P + S +KDSP S D ++ +DNPDLGP+LLK A
Sbjct: 34 RTPMKKTPSSTPSRSKPSPNRSTGKKDSPTVSSSTA-AVIDVDDPSLDNPDLGPFLLKLA 93
Query: 239 RSLVSSGENLQKALLLALRAAKAFEL-----------SANGKPSLELVMCLHVTAAIYCS 298
R ++SGE KAL A+RA K+FE ++G P L+L M LHV AAIYCS
Sbjct: 94 RDAIASGEGPNKALDYAIRATKSFERCCAAVAPPIPGGSDGGPVLDLAMSLHVLAAIYCS 153
Query: 299 LGQYSEAIPLLEHSIEIPAIKEGQEHALAKFAGHMQLGDTYAMLGLLENSLICYTTGLEV 358
LG++ EA+P LE +I++P G +H+LA F+GHMQLGDT +MLG ++ S+ CY GL++
Sbjct: 154 LGRFDEAVPPLERAIQVPDPTRGPDHSLAAFSGHMQLGDTLSMLGQIDRSIACYEEGLKI 213
Query: 359 QKQVLGEADPRVGETYRYLAEAHVQALQFDEAEKFCQMALNIHKKNVGPASLEEAADRRL 418
Q Q LG+ DPRVGET RYLAEA+VQA+QF++AE+ C+ L IH+ + PASLEEAADRRL
Sbjct: 214 QIQTLGDTDPRVGETCRYLAEAYVQAMQFNKAEELCKKTLEIHRAHSEPASLEEAADRRL 273
Query: 419 MGLICETKGDHEAALEHLVLASMAMVANGQETDVAAVDCSIGDSYLSLSRYDEAVFAYQK 478
M +ICE KGD+E ALEHLVLASMAM+A+GQE++VA++D SIG+ Y+SL R+DEAVF+YQK
Sbjct: 274 MAIICEAKGDYENALEHLVLASMAMIASGQESEVASIDVSIGNIYMSLCRFDEAVFSYQK 333
Query: 479 ALTVYKTTKGENHPAVGSVYVRLADLYNKTGKTRESVSYCENALRIYEKPISGIPPEEIA 538
ALTV+K +KGE HP V SV+VRLA+LY++TGK RES SYCENALRIY KP+ G EEIA
Sbjct: 334 ALTVFKASKGETHPTVASVFVRLAELYHRTGKLRESKSYCENALRIYNKPVPGTTVEEIA 393
Query: 539 SGLTNVAAIYESMNEAEQAVKLLHKALKIYSNAPGQQSTIAGIEAQMGVLYYMLGNYSES 598
GLT ++AIYES++E E+A+KLL K++K+ + PGQQS IAG+EA+MGV+YY +G Y ++
Sbjct: 394 GGLTEISAIYESVDEPEEALKLLQKSMKLLEDKPGQQSAIAGLEARMGVMYYTVGRYEDA 453
Query: 599 YDSFKNAIPKLRNSGEKKSAFFGIALNQMGLACVQKYAINEAVELFEEAKSILEKEYGPY 658
++F++A+ KLR +GE KSAFFG+ LNQMGLACVQ + I+EA ELFEEA+ ILE+E GP
Sbjct: 454 RNAFESAVTKLRAAGE-KSAFFGVVLNQMGLACVQLFKIDEAGELFEEARGILEQERGPC 513
Query: 659 HPDTLGVYSNLAGAYDAIGR 668
DTLGVYSNLA YDA+GR
Sbjct: 514 DQDTLGVYSNLAATYDAMGR 531
BLAST of CmaCh11G000970 vs. TAIR 10
Match:
AT2G25355.1 (PNAS-3 related )
HSP 1 Score: 350.9 bits (899), Expect = 3.3e-96
Identity = 171/234 (73.08%), Postives = 197/234 (84.19%), Query Frame = 0
Query: 744 SSSSSILVDQPVVPGDVVLDLSNMTNETIKLGGGLRQDHDAISVAKVGKLRFSKPNKYWV 803
S+S + L+DQ VVPGDVVLDLSNMTN+TIKLG GLRQD++ ISV + GKLR+SKPNKYWV
Sbjct: 6 STSPTSLIDQTVVPGDVVLDLSNMTNQTIKLGSGLRQDNEVISVMRAGKLRYSKPNKYWV 65
Query: 804 ESSQKRYVPCAEDCVLGIVVDSRSDNFLVDIKGPSLAFLPVLAFEGGTRRNIPKFEMGAL 863
ESS KRYVP ED VLGIVVD + +N+ +DIKGP LA LPVLAFEG RRN PKFE+ L
Sbjct: 66 ESSHKRYVPRPEDHVLGIVVDCKGENYWIDIKGPQLALLPVLAFEGANRRNYPKFEVSTL 125
Query: 864 LYVRVVKANPGMNPELACTDASGKAAGFGHLKDGYIFDCSTGLSRMLLSSPTCPVLESLG 923
LY RVVK N GMNPEL+C D SGKAAGFG LKDG++F+ STGLSRMLLSSPTCPVLE+LG
Sbjct: 126 LYTRVVKTNTGMNPELSCVDESGKAAGFGPLKDGFMFETSTGLSRMLLSSPTCPVLEALG 185
Query: 924 KKLSFETAVGINGRVWVNADSPSTTIVVSNAIMNSETLSGVQQRIMVDKLLNNL 978
KKLSFETA G+NGR WV+A +P I+V+NA+MNSETLSG QQRIMV+KLL +
Sbjct: 186 KKLSFETAFGLNGRCWVHAAAPRIVIIVANALMNSETLSGTQQRIMVEKLLEKI 239
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
F4HSX9 | 4.4e-218 | 60.82 | Protein KINESIN LIGHT CHAIN-RELATED 3 OS=Arabidopsis thaliana OX=3702 GN=KLCR3 P... | [more] |
Q9LII8 | 8.5e-214 | 59.27 | Protein KINESIN LIGHT CHAIN-RELATED 2 OS=Arabidopsis thaliana OX=3702 GN=KLCR2 P... | [more] |
O81629 | 1.3e-182 | 58.29 | Protein KINESIN LIGHT CHAIN-RELATED 1 OS=Arabidopsis thaliana OX=3702 GN=KLCR1 P... | [more] |
Q7KWX9 | 1.2e-45 | 44.05 | Putative exosome complex component rrp40 OS=Dictyostelium discoideum OX=44689 GN... | [more] |
Q8IPX7 | 8.1e-39 | 38.84 | Exosome complex component RRP40 OS=Drosophila melanogaster OX=7227 GN=Rrp40 PE=1... | [more] |
Match Name | E-value | Identity | Description | |
AT1G27500.1 | 3.1e-219 | 60.82 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT3G27960.1 | 6.0e-215 | 59.27 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT4G10840.1 | 9.4e-184 | 58.29 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT4G10840.2 | 1.9e-160 | 57.40 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT2G25355.1 | 3.3e-96 | 73.08 | PNAS-3 related | [more] |