Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATTCATCAAGCTTCCATGATAATGCTCCTGGGTCATCGAATTCTTATGATGGGTATTCCAATATCAGTTTTGTTACGCAATCGCTTTCTCGATCGAAGTCTGGTATTACGAGGGCGAGAATGACGAAGGTGAGGAGGCAAACGAGTTCCCTGGATTTGAGGTCTGCGGCGGTCCCGGAAACGCTTCGGCCGTTCACCGGGAATTCATTTCCGGCGGCATTTTCGGGTCAGGATTCTATCTTTGGTAAATCTGAGTCGGGTGTCATTGGGAATCAGCCATTTGTATTTGGAGAAAATAGGAGCAGAACCAGCTCGAATTTGGAGAGGGCAGAGAGGGAAGTTTTGGATGGAATGAAGAAATTGAATGTCGAAAGTGGAGGTGATGAAGCCATTGAATCAACGCTTCCGGATTACTTGAGGAAGCTGAACATTGAAGAAGGGCAAGGTAATTCTGCACGAATTCATAAAACTAGAAATGAAGGGGCTAAATCTGGTTTGTGGGACAGTAAAGTTGGTAATTCTATAGTTTCAGAACTCCCCAACAAGATGCAGCATTTGAACATTGAAGGTATTGTAGAGCTTAAGACACCAGATGTAAAAACAAAAACATTTTCTGCTGGAATCAGTGAAAATTTTCAATTCAGTGCTCAGAGCGATCCAATTAGAGAATTCAGGCCAAAGAGTAGGAGTGAAAGATATAATTCTACCATGTTTCGGTTGCGCGTTGATCAGGAGACTCAGGAGAGACACAAGGCATCTGAATTCTATTCACCAATGGATGTTTCGCCATATCAAGAAACACTGGCTCGTGGTCGGGTTTCTCGTGAAAATTCTCTGACATCCAATGAATCGCTTAATCTCGACAACAACTCCGTTGTATTTGATGAATCGATACCTGAGGTCTTAAATGATGTTATAGATGAAGATCTGGTTAATGCTACAGAAAGCTTAAATATGAGTGAACCTGGTTGTTCTGCAACTGAAGTGGAAGAATGTGAAGGCACTCTTTACCATTCTAATATAAACCATGGCGCTGAAGGTCCTCTAGAAGAATCTGTTTCAGGGGTGGATACTGAAAGCTACAAATCTGCAAATGAAGAGTTGGATATGAGTGGTGGTCTAGCTGCTATTTCAGGAGAAACAGAAGCCAATTCGAGCGTGAAATTCGAGAGGCAGGATAGCGATGGCAAGAATCAATTTAGTTTTGCTTCAAATTCAGAGGATGGATCGAGATCAAACTTTACATTTGCTGCCTCTTCTGCTGTTCGAGGTCAATTATCTGCATCCAAACGCCAATACAATAAGAAAAGTTGGGGAAAAGTTGGTCAAGATTCTTATATTTCCCCAACAATTGCTGTTGAAGTTTCATTGCCATCCTCTTCTGGGCAATTTGTTACATTTTCTGGGAATCCTTCACCAATATCATCTCAAATGAGTCAGAAGGGAGTTGATTCTTGGATGAACAAAGGCCAGGAGATGAAGCAAGAGTCGGTTTCTACGATAGCAGCAACTGTTGCTGCCCAGGAGGCTTGTGAAAAGTGGAGACTGAGGTTAGGCTTTTGATACTTGTGATTTCTTCTTGCCTTTATACTTTCTTGTGTCAAGATATCCTTTATACTTCATTGTCAGCTTCAAGATTCTTTCTCTTTTGTTTGCATAACATGTTACAGAGGCAACCAGGCATATGCAAGTGGTGATTTATCCAAGGCTGAGGACCATTACACTCGAGGAGTGAATTGTATATCAAAAGATGAGTCATCTAGAAGCTGTCTCAGGGCTTTGATGCTTTGTTACAGCAATCGTGCAGCAACAAGAATGTCTCTTGGAAGATTGCGAGACGCAATCAGTGACTGTACAATGGCTGCTGCTATAGATCCCAGCTTTTATAAAGTGTACCTTAGAGCTGCAAAGTAAGGGCTCGCTGAAGTAGATTTTGAAAATTATGTTCATAACATATCACTGAAGTACTATCTTATAAATTAGCTGTTACCTTGGCCTTGGGGAAATTGACGATGCGGTACAGTTCTTCAAGAGATGCCTGCAACCTGGGAATGATATCTATGTGGACCGGAAAATTGTAGTGGAAGCCTCGCATGGTTTGCAAAATGCTCAGGTAACAGATTTTCTTTCTAAGATGTGATAACTTTTAGTTTTTTGTTCCTTTTGACATGCGTCCTGTCCTATCATCTTTCATGGAAGTTCAATTTGTGCAACAAAGTAATGGATAATCTAAAAGCGTAAGCTGATGGGTTGTAGGAATTCAATCTTATAACAATACTTTAATACTCTTCACTTGTAGGCTTGAAAATTTACACAAGACTTAGCTAGTGGTTATCAATATTTATTAGAGATGAAACAACATTATAGGGTTTGAACAAAGGGCCTCAAACTCTGATGTTATGTTTAGTCGTCAATCAACTAAAAAGCTTAAGCTTTTGGGTTCGTTGTACCAGTACTTTGAAGGGGGTGGATTGGGGGTCCCACATCGATTGGAGAAGGGAACAAATGCTAACGAGGATGTTGGGCCCCAAAGGAGAGGGTGGATTGTGAGATCCCATATCGGATGGGGAAGAGAACGAAACATTTTTTATAAAGGTGTGGAAATCTCTTTTTAATAGATGCGTTTTAAAAACCTTGAGGGAAAGCTCGAAAGAGAGCCCAAAGAGAACAATACCGGCTAGCGATGGGCTTGAGTCATTACAATAACTGAACCTCCATAGTTTTTCAACCTTTTTGTTTCCTTTGATAGTGACCAAAATTGTCGAGGAAGGTATATATGGTTTTTTGCTTCACTTCTAATAGGCGTGATGTGATCTATATGACCAGATCTTACTGTTTCTACTGACTTGCAGAAAGCGTCTGAATGCATGAAACGTTTAGCTGAACTTCAGCTAAGAAGCACATCCACTGATATGCAGAGTGTTTTGGAATTAATTTCAGAGGCTTTGGTAATAAGCTCATGCTCCGAAAAATTAATTGAAATGAAAGCAGAGGCTCTTTTCATGGTTTGTTGCACTTCTCTAATGTTATCGTCGTTTTCTGTATTTTTTCCCTCGATTTCCTCCCTTTCCCTTTGAATTCTTTTGGGAGATCATATGAGATTTTACTGCGAGTAAACTAACTTTCATTTTCATGCATCATAACTCAATGGTTTCTGGGTTTTTATCACTTTCTTTTCTTTAAAAAAAAAAATTCTGCAGCTTCGGAGATATGAGGAGGTGATTCAGTTTTGTGAGCATACCGTAGATTCCGCTGAAAAGAATTCTCCTTCAGAAGATATTGTCAGTCAGACCTCAAATCTGGATTCTTCTGAAATCTCAAAGAAGTTGTACTTTAGGATTTGGCGATGCCGCTTGACTCTCAAGTGCTACTTCCTTCTGGGAAAACTTGAGGAGGGTCTGGCTTCTTTAACGCAAGAGGAGGAAGTGCCTACAGTGATTGGGTAATTTATTATGCTTCCTACTACTTTTGAAGAAGAAACAAGTATTTCACTGATAAAGAAGTAAATTGCAAAGAAAATAAACTTGAAAGGAAAAAGAACATTTTTCACTTTTCCAGTTTGACAATTTTAAGTTCCAGGAATGGAAGAAAGTTTTTAGAATCATCAATACCATTGGCCGTTACCATGAGGGAGCTCGTACGTCACAAGGTATCATAAGTCCACTGCCTTAGAGGCTCCTCCATCCTATCAGCCATAACTTGATCATATCCATATATCATTTTCTAATGCCTTAATATCTTTTAGTCTCATAATTGCATTATATCCCCATACCTTTTTTTCCTCCTTAAATGAGTGCAGGATGCTGGGAATGAAGCATTTCAGGCAGGAAGGAATTCAGAAGCCATTGAACACTACACAGCTGCTTTGGCGTGCAATGTCGAGTCACGTCCTTTCTCAGCTTTTTGTTTCTACAGTCGGGCTGCTGCATACAAAGCTCAGGGCCAAGTTATTGATGCTATTGCGGATTGTAGTCTTGCCATAGCCCTCGATGAAGAATTTTTCAAGGTAAGCTTTCTCTTTCTTGTGTTTATACTGGGCACCGAGCTGTCAACATAAATATCCCTCAATTCTTCTTGGTTTCTCAAAGAAAAATTTGAGTACCATTATAGATGAACACTAACAAGTATGTTCTGATAGAGCTTTTATATTCACAGGCAATTTCTAGACGGGCCACTTTGTATGAAATGATTAGAGACTATGGTCAAGCAGCTAATGATCTCCAGAAGTTTATATTGCTTTTTTCTAAGGAATTAGAGAAGACCTATCAGTTTGGAAATTCCGATAGATCGAGTACCAGTGCAAATGATTTGAGACAAGCTCGTCTCCACCTTGCAGAAGTTGAAGAAGAGTCGAGAAAGGAAATTCCATTGGACATGTACCTTATTCTGTAAGTATCACCCATCATTTAACGAATATTCAGATAATAAAAGGAAACAAAGACATCCCTCTTTCCCCTTCCAGTAGGATGGGAATTCTGAGAAAGGTCCTTTAATTTTAGGTTATGAGAATAACGGTAATTTACCTTTGTGGCAGGGGAGTTGATCCATCTGCATCTTCAGCTGAAATTAAGAAGGCATACAGGAAAGCTGCTCTCAGATACCATCCAGACAAGGTTTAATTTTGTTTACCTTGAAGATATGCTTATTTGTGTCTGTGCTTTCTTAATATGGGTTGTTCGTTCGTTTAGGCTGGTCAGTCCTTGGCTAGAGCAGACAACGGAGATAACGTACTATGGAAGGATATAGCTGGAGGAGTCCCCATGGATGCTGATAAACTTTTTAAAATGATTGGAGAGGCATATAATGTACTCTCAGATCCTATTAAGGTACCCAAAAGGGAGTTCTTCCCCAACTTTTTTATAGAAATTTCAAGAGAAAGGTTTCTTCATTGTCGACCATTATTTTCATTCTCTTTCTGCCTCAAGTTGCAATATGTCCATGCATTGCTTAGATCGAGAAGTACCCCCAACATCCTCCAGTGGCAACTCGAGTTTTACACGAGTCGCTCTTGCATATCATGTTAAGCTATGTTATCGAGCTCGATACATTTTATCAGATGATTGTAGCCTAAACTGATCTGTTTTTTCAGCGCTCCCGATATGATGCGGAAGAAGAGATGAGAACTGCCCAAAAGAAACGCAATGCAAGCAGCACCCCTAGATCACATACAGATGTTCATCAAGGTCATCAGTTCGAAAGAAGTAGTGTTAGGCCTCAGTGGCAAGATCTATGGAGATCTTATGGTTCTCGCGGATCAGAATTTACTCGATCAACCAGGTATCCATAAGAACTATGCGAACACATCAGGATATAATTCAGGCTTTCAAATGGCTAGGGTGCACCAAGTAATTGATCATTCTATCAAGAAATCCAGCTGCAGTTTGTCCCCTCAGGGTGGGACCGCCAAAAGTTGA
mRNA sequence
ATGAATTCATCAAGCTTCCATGATAATGCTCCTGGGTCATCGAATTCTTATGATGGGTATTCCAATATCAGTTTTGTTACGCAATCGCTTTCTCGATCGAAGTCTGGTATTACGAGGGCGAGAATGACGAAGGTGAGGAGGCAAACGAGTTCCCTGGATTTGAGGTCTGCGGCGGTCCCGGAAACGCTTCGGCCGTTCACCGGGAATTCATTTCCGGCGGCATTTTCGGGTCAGGATTCTATCTTTGGTAAATCTGAGTCGGGTGTCATTGGGAATCAGCCATTTGTATTTGGAGAAAATAGGAGCAGAACCAGCTCGAATTTGGAGAGGGCAGAGAGGGAAGTTTTGGATGGAATGAAGAAATTGAATGTCGAAAGTGGAGGTGATGAAGCCATTGAATCAACGCTTCCGGATTACTTGAGGAAGCTGAACATTGAAGAAGGGCAAGGTAATTCTGCACGAATTCATAAAACTAGAAATGAAGGGGCTAAATCTGGTTTGTGGGACAGTAAAGTTGGTAATTCTATAGTTTCAGAACTCCCCAACAAGATGCAGCATTTGAACATTGAAGGTATTGTAGAGCTTAAGACACCAGATGTAAAAACAAAAACATTTTCTGCTGGAATCAGTGAAAATTTTCAATTCAGTGCTCAGAGCGATCCAATTAGAGAATTCAGGCCAAAGAGTAGGAGTGAAAGATATAATTCTACCATGTTTCGGTTGCGCGTTGATCAGGAGACTCAGGAGAGACACAAGGCATCTGAATTCTATTCACCAATGGATGTTTCGCCATATCAAGAAACACTGGCTCGTGGTCGGGTTTCTCGTGAAAATTCTCTGACATCCAATGAATCGCTTAATCTCGACAACAACTCCGTTGTATTTGATGAATCGATACCTGAGGTCTTAAATGATGTTATAGATGAAGATCTGGTTAATGCTACAGAAAGCTTAAATATGAGTGAACCTGGTTGTTCTGCAACTGAAGTGGAAGAATGTGAAGGCACTCTTTACCATTCTAATATAAACCATGGCGCTGAAGGTCCTCTAGAAGAATCTGTTTCAGGGGTGGATACTGAAAGCTACAAATCTGCAAATGAAGAGTTGGATATGAGTGGTGGTCTAGCTGCTATTTCAGGAGAAACAGAAGCCAATTCGAGCGTGAAATTCGAGAGGCAGGATAGCGATGGCAAGAATCAATTTAGTTTTGCTTCAAATTCAGAGGATGGATCGAGATCAAACTTTACATTTGCTGCCTCTTCTGCTGTTCGAGGTCAATTATCTGCATCCAAACGCCAATACAATAAGAAAAGTTGGGGAAAAGTTGGTCAAGATTCTTATATTTCCCCAACAATTGCTGTTGAAGTTTCATTGCCATCCTCTTCTGGGCAATTTGTTACATTTTCTGGGAATCCTTCACCAATATCATCTCAAATGAGTCAGAAGGGAGTTGATTCTTGGATGAACAAAGGCCAGGAGATGAAGCAAGAGTCGGTTTCTACGATAGCAGCAACTGTTGCTGCCCAGGAGGCTTGTGAAAAGTGGAGACTGAGAGGCAACCAGGCATATGCAAGTGGTGATTTATCCAAGGCTGAGGACCATTACACTCGAGGAGTGAATTGTATATCAAAAGATGAGTCATCTAGAAGCTGTCTCAGGGCTTTGATGCTTTGTTACAGCAATCGTGCAGCAACAAGAATGTCTCTTGGAAGATTGCGAGACGCAATCAGTGACTGTACAATGGCTGCTGCTATAGATCCCAGCTTTTATAAAGTGTACCTTAGAGCTGCAAACTGTTACCTTGGCCTTGGGGAAATTGACGATGCGGTACAGTTCTTCAAGAGATGCCTGCAACCTGGGAATGATATCTATGTGGACCGGAAAATTGTAGTGGAAGCCTCGCATGGTTTGCAAAATGCTCAGAAAGCGTCTGAATGCATGAAACGTTTAGCTGAACTTCAGCTAAGAAGCACATCCACTGATATGCAGAGTGTTTTGGAATTAATTTCAGAGGCTTTGGTAATAAGCTCATGCTCCGAAAAATTAATTGAAATGAAAGCAGAGGCTCTTTTCATGCTTCGGAGATATGAGGAGGTGATTCAGTTTTGTGAGCATACCGTAGATTCCGCTGAAAAGAATTCTCCTTCAGAAGATATTGTCAGTCAGACCTCAAATCTGGATTCTTCTGAAATCTCAAAGAAGTTGTACTTTAGGATTTGGCGATGCCGCTTGACTCTCAAGTGCTACTTCCTTCTGGGAAAACTTGAGGAGGGTCTGGCTTCTTTAACGCAAGAGGAGGAAGTGCCTACAGTGATTGGGAATGGAAGAAAGTTTTTAGAATCATCAATACCATTGGCCGTTACCATGAGGGAGCTCGTACGTCACAAGGATGCTGGGAATGAAGCATTTCAGGCAGGAAGGAATTCAGAAGCCATTGAACACTACACAGCTGCTTTGGCGTGCAATGTCGAGTCACGTCCTTTCTCAGCTTTTTGTTTCTACAGTCGGGCTGCTGCATACAAAGCTCAGGGCCAAGTTATTGATGCTATTGCGGATTGTAGTCTTGCCATAGCCCTCGATGAAGAATTTTTCAAGGCAATTTCTAGACGGGCCACTTTGTATGAAATGATTAGAGACTATGGTCAAGCAGCTAATGATCTCCAGAAGTTTATATTGCTTTTTTCTAAGGAATTAGAGAAGACCTATCAGTTTGGAAATTCCGATAGATCGAGTACCAGTGCAAATGATTTGAGACAAGCTCGTCTCCACCTTGCAGAAGTTGAAGAAGAGTCGAGAAAGGAAATTCCATTGGACATGTACCTTATTCTGGGAGTTGATCCATCTGCATCTTCAGCTGAAATTAAGAAGGCATACAGGAAAGCTGCTCTCAGATACCATCCAGACAAGGCTGGTCAGTCCTTGGCTAGAGCAGACAACGGAGATAACGTACTATGGAAGGATATAGCTGGAGGAGTCCCCATGGATGCTGATAAACTTTTTAAAATGATTGGAGAGGCATATAATGTACTCTCAGATCCTATTAAGCGCTCCCGATATGATGCGGAAGAAGAGATGAGAACTGCCCAAAAGAAACGCAATGCAAGCAGCACCCCTAGATCACATACAGATGTTCATCAAGGTCATCAGTTCGAAAGAAGTAGTGTTAGGCCTCAGTGGCAAGATCTATGGAGATCTTATGGTTCTCGCGGATCAGAATTTACTCGATCAACCAGGGTGCACCAAGTAATTGATCATTCTATCAAGAAATCCAGCTGCAGTTTGTCCCCTCAGGGTGGGACCGCCAAAAGTTGA
Coding sequence (CDS)
ATGAATTCATCAAGCTTCCATGATAATGCTCCTGGGTCATCGAATTCTTATGATGGGTATTCCAATATCAGTTTTGTTACGCAATCGCTTTCTCGATCGAAGTCTGGTATTACGAGGGCGAGAATGACGAAGGTGAGGAGGCAAACGAGTTCCCTGGATTTGAGGTCTGCGGCGGTCCCGGAAACGCTTCGGCCGTTCACCGGGAATTCATTTCCGGCGGCATTTTCGGGTCAGGATTCTATCTTTGGTAAATCTGAGTCGGGTGTCATTGGGAATCAGCCATTTGTATTTGGAGAAAATAGGAGCAGAACCAGCTCGAATTTGGAGAGGGCAGAGAGGGAAGTTTTGGATGGAATGAAGAAATTGAATGTCGAAAGTGGAGGTGATGAAGCCATTGAATCAACGCTTCCGGATTACTTGAGGAAGCTGAACATTGAAGAAGGGCAAGGTAATTCTGCACGAATTCATAAAACTAGAAATGAAGGGGCTAAATCTGGTTTGTGGGACAGTAAAGTTGGTAATTCTATAGTTTCAGAACTCCCCAACAAGATGCAGCATTTGAACATTGAAGGTATTGTAGAGCTTAAGACACCAGATGTAAAAACAAAAACATTTTCTGCTGGAATCAGTGAAAATTTTCAATTCAGTGCTCAGAGCGATCCAATTAGAGAATTCAGGCCAAAGAGTAGGAGTGAAAGATATAATTCTACCATGTTTCGGTTGCGCGTTGATCAGGAGACTCAGGAGAGACACAAGGCATCTGAATTCTATTCACCAATGGATGTTTCGCCATATCAAGAAACACTGGCTCGTGGTCGGGTTTCTCGTGAAAATTCTCTGACATCCAATGAATCGCTTAATCTCGACAACAACTCCGTTGTATTTGATGAATCGATACCTGAGGTCTTAAATGATGTTATAGATGAAGATCTGGTTAATGCTACAGAAAGCTTAAATATGAGTGAACCTGGTTGTTCTGCAACTGAAGTGGAAGAATGTGAAGGCACTCTTTACCATTCTAATATAAACCATGGCGCTGAAGGTCCTCTAGAAGAATCTGTTTCAGGGGTGGATACTGAAAGCTACAAATCTGCAAATGAAGAGTTGGATATGAGTGGTGGTCTAGCTGCTATTTCAGGAGAAACAGAAGCCAATTCGAGCGTGAAATTCGAGAGGCAGGATAGCGATGGCAAGAATCAATTTAGTTTTGCTTCAAATTCAGAGGATGGATCGAGATCAAACTTTACATTTGCTGCCTCTTCTGCTGTTCGAGGTCAATTATCTGCATCCAAACGCCAATACAATAAGAAAAGTTGGGGAAAAGTTGGTCAAGATTCTTATATTTCCCCAACAATTGCTGTTGAAGTTTCATTGCCATCCTCTTCTGGGCAATTTGTTACATTTTCTGGGAATCCTTCACCAATATCATCTCAAATGAGTCAGAAGGGAGTTGATTCTTGGATGAACAAAGGCCAGGAGATGAAGCAAGAGTCGGTTTCTACGATAGCAGCAACTGTTGCTGCCCAGGAGGCTTGTGAAAAGTGGAGACTGAGAGGCAACCAGGCATATGCAAGTGGTGATTTATCCAAGGCTGAGGACCATTACACTCGAGGAGTGAATTGTATATCAAAAGATGAGTCATCTAGAAGCTGTCTCAGGGCTTTGATGCTTTGTTACAGCAATCGTGCAGCAACAAGAATGTCTCTTGGAAGATTGCGAGACGCAATCAGTGACTGTACAATGGCTGCTGCTATAGATCCCAGCTTTTATAAAGTGTACCTTAGAGCTGCAAACTGTTACCTTGGCCTTGGGGAAATTGACGATGCGGTACAGTTCTTCAAGAGATGCCTGCAACCTGGGAATGATATCTATGTGGACCGGAAAATTGTAGTGGAAGCCTCGCATGGTTTGCAAAATGCTCAGAAAGCGTCTGAATGCATGAAACGTTTAGCTGAACTTCAGCTAAGAAGCACATCCACTGATATGCAGAGTGTTTTGGAATTAATTTCAGAGGCTTTGGTAATAAGCTCATGCTCCGAAAAATTAATTGAAATGAAAGCAGAGGCTCTTTTCATGCTTCGGAGATATGAGGAGGTGATTCAGTTTTGTGAGCATACCGTAGATTCCGCTGAAAAGAATTCTCCTTCAGAAGATATTGTCAGTCAGACCTCAAATCTGGATTCTTCTGAAATCTCAAAGAAGTTGTACTTTAGGATTTGGCGATGCCGCTTGACTCTCAAGTGCTACTTCCTTCTGGGAAAACTTGAGGAGGGTCTGGCTTCTTTAACGCAAGAGGAGGAAGTGCCTACAGTGATTGGGAATGGAAGAAAGTTTTTAGAATCATCAATACCATTGGCCGTTACCATGAGGGAGCTCGTACGTCACAAGGATGCTGGGAATGAAGCATTTCAGGCAGGAAGGAATTCAGAAGCCATTGAACACTACACAGCTGCTTTGGCGTGCAATGTCGAGTCACGTCCTTTCTCAGCTTTTTGTTTCTACAGTCGGGCTGCTGCATACAAAGCTCAGGGCCAAGTTATTGATGCTATTGCGGATTGTAGTCTTGCCATAGCCCTCGATGAAGAATTTTTCAAGGCAATTTCTAGACGGGCCACTTTGTATGAAATGATTAGAGACTATGGTCAAGCAGCTAATGATCTCCAGAAGTTTATATTGCTTTTTTCTAAGGAATTAGAGAAGACCTATCAGTTTGGAAATTCCGATAGATCGAGTACCAGTGCAAATGATTTGAGACAAGCTCGTCTCCACCTTGCAGAAGTTGAAGAAGAGTCGAGAAAGGAAATTCCATTGGACATGTACCTTATTCTGGGAGTTGATCCATCTGCATCTTCAGCTGAAATTAAGAAGGCATACAGGAAAGCTGCTCTCAGATACCATCCAGACAAGGCTGGTCAGTCCTTGGCTAGAGCAGACAACGGAGATAACGTACTATGGAAGGATATAGCTGGAGGAGTCCCCATGGATGCTGATAAACTTTTTAAAATGATTGGAGAGGCATATAATGTACTCTCAGATCCTATTAAGCGCTCCCGATATGATGCGGAAGAAGAGATGAGAACTGCCCAAAAGAAACGCAATGCAAGCAGCACCCCTAGATCACATACAGATGTTCATCAAGGTCATCAGTTCGAAAGAAGTAGTGTTAGGCCTCAGTGGCAAGATCTATGGAGATCTTATGGTTCTCGCGGATCAGAATTTACTCGATCAACCAGGGTGCACCAAGTAATTGATCATTCTATCAAGAAATCCAGCTGCAGTTTGTCCCCTCAGGGTGGGACCGCCAAAAGTTGA
Protein sequence
MNSSSFHDNAPGSSNSYDGYSNISFVTQSLSRSKSGITRARMTKVRRQTSSLDLRSAAVPETLRPFTGNSFPAAFSGQDSIFGKSESGVIGNQPFVFGENRSRTSSNLERAEREVLDGMKKLNVESGGDEAIESTLPDYLRKLNIEEGQGNSARIHKTRNEGAKSGLWDSKVGNSIVSELPNKMQHLNIEGIVELKTPDVKTKTFSAGISENFQFSAQSDPIREFRPKSRSERYNSTMFRLRVDQETQERHKASEFYSPMDVSPYQETLARGRVSRENSLTSNESLNLDNNSVVFDESIPEVLNDVIDEDLVNATESLNMSEPGCSATEVEECEGTLYHSNINHGAEGPLEESVSGVDTESYKSANEELDMSGGLAAISGETEANSSVKFERQDSDGKNQFSFASNSEDGSRSNFTFAASSAVRGQLSASKRQYNKKSWGKVGQDSYISPTIAVEVSLPSSSGQFVTFSGNPSPISSQMSQKGVDSWMNKGQEMKQESVSTIAATVAAQEACEKWRLRGNQAYASGDLSKAEDHYTRGVNCISKDESSRSCLRALMLCYSNRAATRMSLGRLRDAISDCTMAAAIDPSFYKVYLRAANCYLGLGEIDDAVQFFKRCLQPGNDIYVDRKIVVEASHGLQNAQKASECMKRLAELQLRSTSTDMQSVLELISEALVISSCSEKLIEMKAEALFMLRRYEEVIQFCEHTVDSAEKNSPSEDIVSQTSNLDSSEISKKLYFRIWRCRLTLKCYFLLGKLEEGLASLTQEEEVPTVIGNGRKFLESSIPLAVTMRELVRHKDAGNEAFQAGRNSEAIEHYTAALACNVESRPFSAFCFYSRAAAYKAQGQVIDAIADCSLAIALDEEFFKAISRRATLYEMIRDYGQAANDLQKFILLFSKELEKTYQFGNSDRSSTSANDLRQARLHLAEVEEESRKEIPLDMYLILGVDPSASSAEIKKAYRKAALRYHPDKAGQSLARADNGDNVLWKDIAGGVPMDADKLFKMIGEAYNVLSDPIKRSRYDAEEEMRTAQKKRNASSTPRSHTDVHQGHQFERSSVRPQWQDLWRSYGSRGSEFTRSTRVHQVIDHSIKKSSCSLSPQGGTAKS
Homology
BLAST of CmaCh02G012620 vs. ExPASy Swiss-Prot
Match:
Q99615 (DnaJ homolog subfamily C member 7 OS=Homo sapiens OX=9606 GN=DNAJC7 PE=1 SV=2)
HSP 1 Score: 164.9 bits (416), Expect = 5.3e-39
Identity = 149/518 (28.76%), Postives = 234/518 (45.17%), Query Frame = 0
Query: 508 AQEACEKWRLRGNQAYASGDLSKAEDHYTRGVNCISKDESSRSCLRALMLCYSNRAATRM 567
A+ E ++ +GN YA D ++A ++YT+ ++ K+ S Y NRAAT M
Sbjct: 24 AKREAETFKEQGNAYYAKKDYNEAYNYYTKAIDMCPKNAS----------YYGNRAATLM 83
Query: 568 SLGRLRDAISDCTMAAAIDPSFYKVYLRAANCYLGLGEIDDAVQFFKRCLQPGNDIYVDR 627
LGR R+A+ D + +D SF + +LR C+L LG A + F+R L+ +D
Sbjct: 84 MLGRFREALGDAQQSVRLDDSFVRGHLREGKCHLSLGNAMAACRSFQRALE------LDH 143
Query: 628 KIVVEASHGLQNAQKASECMKRLAELQLRSTSTDMQSVLELISEALVISSCSEKLIEMKA 687
K +A +NA E +++AE D + V+ + AL + + +KA
Sbjct: 144 K-NAQAQQEFKNANAVME-YEKIAETDFE--KRDFRKVVFCMDRALEFAPACHRFKILKA 203
Query: 688 EALFMLRRYEEVIQFCEHTVDSAEKNSPSEDIVSQTSNLDSSEISKKLYFRIWRCRLTLK 747
E L ML RY E ++ + S +DS+ + LY R C
Sbjct: 204 ECLAMLGRYPE-----------------AQSVASDILRMDSTN-ADALYVR-GLCLYYED 263
Query: 748 CYFLLGKLEEGLASLTQEEEVPTVIGNGRKFLESSIPLAVTMRELVRHKDAGNEAFQAGR 807
C + + + E + K L++ K+ GN+AF+ G
Sbjct: 264 CIEKAVQFFVQALRMAPDHEKACIACRNAKALKAK-------------KEDGNKAFKEGN 323
Query: 808 NSEAIEHYTAALACNVESRPFSAFCFYSRAAAYKAQGQVIDAIADCSLAIALDEEFFKAI 867
A E YT AL + + +A + +R ++ DAI DC+ A+ LD+ + KA
Sbjct: 324 YKLAYELYTEALGIDPNNIKTNAKLYCNRGTVNSKLRKLDDAIEDCTNAVKLDDTYIKAY 383
Query: 868 SRRATLYEMIRDYGQAANDLQKFILLFSKELEKTYQFGNSDRSSTSANDLRQARLHLAEV 927
RRA Y Y +A D EK YQ ++++ L+ A+L E+
Sbjct: 384 LRRAQCYMDTEQYEEAVRD-----------YEKVYQ---TEKTKEHKQLLKNAQL---EL 443
Query: 928 EEESRKEIPLDMYLILGVDPSASSAEIKKAYRKAALRYHPDKAGQSLARADNGDNVLWKD 987
++ RK D Y ILGVD +AS EIKKAYRK AL +HPD+ + A
Sbjct: 444 KKSKRK----DYYKILGVDKNASEDEIKKAYRKRALMHHPDRHSGASAE----------- 453
Query: 988 IAGGVPMDADKLFKMIGEAYNVLSDPIKRSRYDAEEEM 1026
V + +K FK +GEA+ +LSDP K++RYD+ +++
Sbjct: 504 ----VQKEEEKKFKEVGEAFTILSDPKKKTRYDSGQDL 453
BLAST of CmaCh02G012620 vs. ExPASy Swiss-Prot
Match:
Q9QYI3 (DnaJ homolog subfamily C member 7 OS=Mus musculus OX=10090 GN=Dnajc7 PE=1 SV=2)
HSP 1 Score: 163.3 bits (412), Expect = 1.6e-38
Identity = 149/518 (28.76%), Postives = 234/518 (45.17%), Query Frame = 0
Query: 508 AQEACEKWRLRGNQAYASGDLSKAEDHYTRGVNCISKDESSRSCLRALMLCYSNRAATRM 567
A+ E ++ +GN YA D ++A ++YT+ ++ + S Y NRAAT M
Sbjct: 24 AKREAESFKEQGNAYYAKKDYNEAYNYYTKAIDMCPNNAS----------YYGNRAATLM 83
Query: 568 SLGRLRDAISDCTMAAAIDPSFYKVYLRAANCYLGLGEIDDAVQFFKRCLQPGNDIYVDR 627
LGR R+A+ D + +D SF + +LR C+L LG A + F+R L+ +D
Sbjct: 84 MLGRFREALGDAQQSVRLDDSFVRGHLREGKCHLSLGNAMAACRSFQRALE------LDH 143
Query: 628 KIVVEASHGLQNAQKASECMKRLAELQLRSTSTDMQSVLELISEALVISSCSEKLIEMKA 687
K +A +NA E +++AE+ D + V+ + AL + + +KA
Sbjct: 144 K-NAQAQQEFKNANAVME-YEKIAEVDFE--KRDFRKVVFCMDRALEFAPACHRFKILKA 203
Query: 688 EALFMLRRYEEVIQFCEHTVDSAEKNSPSEDIVSQTSNLDSSEISKKLYFRIWRCRLTLK 747
E L ML RY E ++ + S +DS+ + LY R C
Sbjct: 204 ECLAMLGRYPE-----------------AQFVASDILRMDSTN-ADALYVR-GLCLYYED 263
Query: 748 CYFLLGKLEEGLASLTQEEEVPTVIGNGRKFLESSIPLAVTMRELVRHKDAGNEAFQAGR 807
C + + + E V K L++ K+ GN+AF+ G
Sbjct: 264 CIEKAVQFFVQALRMAPDHEKACVACRNAKALKAK-------------KEDGNKAFKEGN 323
Query: 808 NSEAIEHYTAALACNVESRPFSAFCFYSRAAAYKAQGQVIDAIADCSLAIALDEEFFKAI 867
A E YT AL + + +A + +R Q+ DAI DC+ A+ LD+ + KA
Sbjct: 324 YKLAYELYTEALGIDPNNIKTNAKLYCNRGTVNSKLRQLEDAIEDCTNAVKLDDTYIKAY 383
Query: 868 SRRATLYEMIRDYGQAANDLQKFILLFSKELEKTYQFGNSDRSSTSANDLRQARLHLAEV 927
RRA Y + +A D EK YQ ++++ L+ A+L E+
Sbjct: 384 LRRAQCYMDTEQFEEAVRD-----------YEKVYQ---TEKTKEHKQLLKNAQL---EL 443
Query: 928 EEESRKEIPLDMYLILGVDPSASSAEIKKAYRKAALRYHPDKAGQSLARADNGDNVLWKD 987
++ RK D Y ILGVD +AS EIKKAYRK AL +HPD+ + A
Sbjct: 444 KKSKRK----DYYKILGVDKNASEDEIKKAYRKRALMHHPDRHSGASAE----------- 453
Query: 988 IAGGVPMDADKLFKMIGEAYNVLSDPIKRSRYDAEEEM 1026
V + +K FK +GEA+ +LSDP K++RYD+ +++
Sbjct: 504 ----VQKEEEKKFKEVGEAFTILSDPKKKTRYDSGQDL 453
BLAST of CmaCh02G012620 vs. ExPASy Swiss-Prot
Match:
Q5R8D8 (DnaJ homolog subfamily C member 7 OS=Pongo abelii OX=9601 GN=DNAJC7 PE=2 SV=1)
HSP 1 Score: 162.5 bits (410), Expect = 2.6e-38
Identity = 148/518 (28.57%), Postives = 230/518 (44.40%), Query Frame = 0
Query: 508 AQEACEKWRLRGNQAYASGDLSKAEDHYTRGVNCISKDESSRSCLRALMLCYSNRAATRM 567
A+ E ++ +GN YA D ++A ++YT+ ++ K+ S Y NRAAT M
Sbjct: 24 AKREAETFKEQGNAYYAKKDYNEAYNYYTKAIDMCPKNAS----------YYGNRAATLM 83
Query: 568 SLGRLRDAISDCTMAAAIDPSFYKVYLRAANCYLGLGEIDDAVQFFKRCLQPGNDIYVDR 627
LGR R+A+ D + +D SF + LR C+L LG A + F+R L+ +D
Sbjct: 84 MLGRFREALGDAQQSVRLDDSFVRGRLREGKCHLSLGNAMAACRSFQRALE------LDH 143
Query: 628 KIVVEASHGLQNAQKASECMKRLAELQLRSTSTDMQSVLELISEALVISSCSEKLIEMKA 687
K +A +NA E +++AE D + V+ + AL + + +KA
Sbjct: 144 K-NAQAQQEFKNANAVME-YEKIAETDFE--KRDFRKVVFCMDRALEFAPACHRFKILKA 203
Query: 688 EALFMLRRYEEVIQFCEHTVDSAEKNSPSEDIVSQTSNLDSSEISKKLYFRIWRCRLTLK 747
E L ML RY E ++ + S +DS+ + LY R C
Sbjct: 204 ECLAMLGRYPE-----------------AQSVASDILRMDSTN-ADALYVR-GLCLYYED 263
Query: 748 CYFLLGKLEEGLASLTQEEEVPTVIGNGRKFLESSIPLAVTMRELVRHKDAGNEAFQAGR 807
C + + + E + K L++ K+ GN+AF+ G
Sbjct: 264 CIEKAVQFFVQALRMAPDHEKACIACRNAKALKAK-------------KEDGNKAFKEGN 323
Query: 808 NSEAIEHYTAALACNVESRPFSAFCFYSRAAAYKAQGQVIDAIADCSLAIALDEEFFKAI 867
A E YT AL + + +A + +R ++ DAI DC+ A+ LD+ + KA
Sbjct: 324 YKLAYELYTEALGIDPNNIKTNAKLYCNRGTVNSKLRKLDDAIEDCTNAVKLDDTYIKAY 383
Query: 868 SRRATLYEMIRDYGQAANDLQKFILLFSKELEKTYQFGNSDRSSTSANDLRQARLHLAEV 927
RRA Y Y +A D EK YQ ++++ L+ A+L L
Sbjct: 384 LRRAQCYMDTEQYEEAVRD-----------YEKVYQ---TEKTKEHKQLLKSAQLEL--- 443
Query: 928 EEESRKEIPLDMYLILGVDPSASSAEIKKAYRKAALRYHPDKAGQSLARADNGDNVLWKD 987
+K D Y ILGVD +AS EIKKAYRK AL +HPD+ + A
Sbjct: 444 ----KKSKRRDYYKILGVDKNASEDEIKKAYRKRALMHHPDRHSGASAE----------- 453
Query: 988 IAGGVPMDADKLFKMIGEAYNVLSDPIKRSRYDAEEEM 1026
V + +K FK +GEA+ +LSDP K++RYD+ +++
Sbjct: 504 ----VQKEEEKKFKEVGEAFTILSDPKKKTRYDSGQDL 453
BLAST of CmaCh02G012620 vs. ExPASy Swiss-Prot
Match:
Q54IP0 (DnaJ homolog subfamily C member 7 homolog OS=Dictyostelium discoideum OX=44689 GN=dnajc7 PE=1 SV=1)
HSP 1 Score: 119.4 bits (298), Expect = 2.6e-25
Identity = 133/515 (25.83%), Postives = 213/515 (41.36%), Query Frame = 0
Query: 513 EKWRLRGNQAYASGDLSKAEDHYTRGVNCISKDESSRSCLRALMLCYSNRAATRMSL--- 572
E+ + +GN + A YT+ + E S + A Y NRAA +++
Sbjct: 4 EECKTQGNNYFKQSQYMDAIRCYTQAI------ELSNGTIAAY---YGNRAAAYLAICTK 63
Query: 573 GRLRDAISDCTMAAAIDPSFYKVYLRAANCYLGLGEIDDAVQFFKRCLQPGNDIYVDRKI 632
L+D+I D A ++ SF K Y RA+ Y+ L + D A R L ++ R
Sbjct: 64 SSLQDSIKDSLKAIELERSFIKGYTRASKAYIHLAQYDQAASIIVRGL-----VFDPRN- 123
Query: 633 VVEASHGLQNAQKASECMKRLAELQLRSTSTDMQSVLELISEALVISSCSEKLIEMKAEA 692
+ LQ + + ++ L ++ S L I L S + +L +KA
Sbjct: 124 ----NELLQEKNQIDSIQRTISSLTKEKALSNPSSSLNQIENVLSQSKYNTQLQVLKARV 183
Query: 693 LFMLRRYEEVIQFCEHTVDSAEKNSPSEDIVSQTSNLDSSEISKKLYFRIWRCRLTLKCY 752
L L++Y P + T + S + LY R L +
Sbjct: 184 LIELKQY------------------PQASNLMTTLLQEDSRNPEYLYVR--GLSLYYQNN 243
Query: 753 FLLGKLEEGLASLTQEEEVPTVIGNGRKFLESSIPLAVTMRELVRHKDAGNEAFQAGRNS 812
F L L+ SLT + + + ES + L +R + K GNE FQ+
Sbjct: 244 FPLA-LQHFQNSLTYDPD----------YSESRVALK-RLRSIESKKKEGNEYFQSKNYQ 303
Query: 813 EAIEHYTAALACNVESRPFSAFCFYSRAAAYKAQGQVIDAIADCSLAIALDEEFFKAISR 872
A + +T AL+ + + ++ + +RAAA ++ +AI DC+ A+ +D + KA R
Sbjct: 304 AAYDSFTEALSIDPKLETMNSQLYSNRAAALVHLNRISEAINDCTSAVTIDPNYGKAYIR 363
Query: 873 RATLYEMIRDYGQAANDLQKFILLFSKELEKTYQFGNSDRSSTSANDLRQARLHLAEVEE 932
RA +Y A D +K + + + + + ++ E +
Sbjct: 364 RAQCQMKQENYEDAVRDYEK--------------------AQSLDPENGELQRNIKEAKI 423
Query: 933 ESRKEIPLDMYLILGVDPSASSAEIKKAYRKAALRYHPDKAGQSLARADNGDNVLWKDIA 992
+K + D Y ILGV A EIKKAYRK AL+YHPDK Q +
Sbjct: 424 AHKKSLRKDYYKILGVSKEAGETEIKKAYRKLALQYHPDKNNQ---------------LP 432
Query: 993 GGVPMDADKLFKMIGEAYNVLSDPIKRSRYDAEEE 1025
A+K+FK IGEAY+VLSD K+ +YD ++
Sbjct: 484 EEEKAQAEKMFKDIGEAYSVLSDEKKKRQYDMGQD 432
BLAST of CmaCh02G012620 vs. ExPASy Swiss-Prot
Match:
Q9HGM9 (DnaJ homolog subfamily C member 7 homolog OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=SPBC543.02c PE=4 SV=1)
HSP 1 Score: 109.4 bits (272), Expect = 2.7e-22
Identity = 126/534 (23.60%), Postives = 209/534 (39.14%), Query Frame = 0
Query: 488 MNKGQEMKQESVSTIAATVAAQEACEKWRLRGNQAYASGDLSKAEDHYTRGVNCISKDES 547
MN G E +QE E EK + GN Y ++A YT ++ S
Sbjct: 9 MNAGTESQQEPA----------ELAEKQKAIGNAFYKEKKYAEAIKAYTEAIDLGSDS-- 68
Query: 548 SRSCLRALMLCYSNRAATRMSLGRLRDAISDCTMAAAIDPSFYKVYLRAANCYLGLGEID 607
AL + YSNRAAT M +G A+ D + I P K R Y GL ++
Sbjct: 69 ------ALAIYYSNRAATYMQIGEFELALCDAKQSDRIKPDVPKTQSRIRQAYEGLSILN 128
Query: 608 DAVQFFKRCLQPGNDIYVDRKIVVEASHGLQNAQKASECMKRLAELQLRSTSTDMQSVLE 667
+A ++Y+ K A + L Q+ ++ ST+ S +
Sbjct: 129 EA------------EVYLKNKQAGLALNALDRLQR-----------RIDSTTQPPMSWMY 188
Query: 668 LISEALVISSCSEKLIEMKAEALFMLRRYEEVIQFCEHTVDSAEKNSPSEDIVSQTSNLD 727
L ++ + + ++ ++ + L + + E + + + +N+ + + LD
Sbjct: 189 LKAQVYIFQNDMDRAQKIAHDVLRLNPKNVEALVLRGKVMYYSGENAKAITHFQEALKLD 248
Query: 728 SSEISKKLYFRIWRCRLTLKCYFLLGKLEEGLASLTQEEEVPTVIGNGRKFLESSIPLAV 787
+ K F+
Sbjct: 249 PDCTTAKTLFK------------------------------------------------- 308
Query: 788 TMRELVRHKDAGNEAFQAGRNSEAIEHYTAALACNVESRPFSAFCFYSRAAAYKAQGQVI 847
+R+L K+ GN+ F+ G +A E Y+ AL + +++ A + +RA +
Sbjct: 309 QVRKLENTKNQGNDLFRQGNYQDAYEKYSEALQIDPDNKETVAKLYMNRATVLLRLKRPE 368
Query: 848 DAIADCSLAIALDEEFFKAISRRATLYEMIRDYGQAANDLQKFILLFSKELEKTYQFGNS 907
+A++D A+A+D + K + RA +E + + +A D+Q I L
Sbjct: 369 EALSDSDNALAIDSSYLKGLKVRAKAHEALEKWEEAVRDVQSAIEL-------------- 412
Query: 908 DRSSTSANDLRQARLHLAEVEEESRKEIPLDMYLILGVDPSASSAEIKKAYRKAALRYHP 967
++ AN ++ R E+++ RK D Y ILGV A+ EIKKAYRK AL YHP
Sbjct: 429 --DASDANLRQELRRLQLELKKSKRK----DHYKILGVSKEATDIEIKKAYRKLALVYHP 412
Query: 968 DKAGQSLARADNGDNVLWKDIAGGVPMDADKLFKMIGEAYNVLSDPIKRSRYDA 1022
DK N N ++A+ FK +GEAY +LSDP R R+D+
Sbjct: 489 DK---------NAGN-----------LEAEARFKEVGEAYTILSDPESRRRFDS 412
BLAST of CmaCh02G012620 vs. TAIR 10
Match:
AT5G12430.1 (Heat shock protein DnaJ with tetratricopeptide repeat )
HSP 1 Score: 711.4 bits (1835), Expect = 1.1e-204
Identity = 497/1205 (41.24%), Postives = 685/1205 (56.85%), Query Frame = 0
Query: 4 SSFHDNAPGSSNSYDGYSN----ISFVTQSLSRSKSGITRARMTKVRRQTSSLDLRSAAV 63
S F + P S + SN SF + RS SG+++ R +KVRRQ S +L+ +
Sbjct: 2 SKFGELNPAFSGAGRSSSNNNPDASFNSAPFPRSSSGLSKPRFSKVRRQVKSQNLKPSGT 61
Query: 64 PETL--RPFTGNSFPAAFSGQDSIFGKSESGVIGNQPFVFGENRSRTSSNLERAERE--- 123
++L + F F +FSG D + G N+ FVFG SS++++ + +
Sbjct: 62 SDSLPGQSFNPFHFRGSFSG-DPTPSEIGFGRSSNEGFVFG-----GSSHVDKLQSDEKI 121
Query: 124 ---VLDGMKKLNVESGGDEAIESTLPDYLRKLN------IEEGQGNSA------------ 183
V++ M++L +ES G S LP+ ++ LN +++G NS
Sbjct: 122 GIRVMEEMERLKIESEGK---ASRLPEDMQNLNSSFSFGVKKGSNNSVFATVELPTLLSN 181
Query: 184 ---------------------------RIHKTRNEGAKSGLWDSKVGNSIVS-ELPNKM- 243
+ +N KS + VG I+S +L K+
Sbjct: 182 KLIIDSSSRSTGHVIQESMEKLNISERGTDQKQNNNVKSKVSMDYVGEKILSDDLSRKLS 241
Query: 244 -------------------------------------------QHLNIE----------- 303
Q+LN
Sbjct: 242 VGSMTTDGNHSGDSFQGSVNEKKVHDFNSSCPMNYSFVGTEPSQNLNARNVHDVSSTVNT 301
Query: 304 --------------GIVELKTPDVKTKTFSAGISENFQFSAQSDPIREFRPKSRSERYNS 363
G +E KTP+ K FS+ + + F+A+ D + R
Sbjct: 302 SDFNFVSNQDSVKTGFMEFKTPNSKVNPFSS-LDQKLGFNAKKDSVGATTRARRKGGKQP 361
Query: 364 TMFRLRVDQE--------TQERHKASEFYSPMDVSPYQETLARGRVSRENSLTSNESLNL 423
+L + +E ++A E YSPMD+SPY+ET V RE
Sbjct: 362 VKVQLNIGREFAFAESAIPNGSNEAPEAYSPMDISPYEET----EVCRE----------- 421
Query: 424 DNNSVVFDESIPEVL-NDVIDEDLVNATESLNMSEPGCSATEVEECEGTLYHSNINHGAE 483
F IP N + D +LV ATE + ++E EV + +++ E
Sbjct: 422 ------FSADIPPTAPNYLFDAELVAATERMEINE----GDEVNNYQAEEFNTGNCADHE 481
Query: 484 GPLEESVSGVDTESYKSANEELDMSGGLAAISGETEANSSVKFERQDSDGKNQFSFASNS 543
+S+SG +TES+KSA EE++ S A + E+E S K +R+++D + ++
Sbjct: 482 DLAGDSISGAETESFKSAAEEMETSSETFATASESEVTSRYKSDRKEND-----DHSLSN 541
Query: 544 EDGSRSNFTFAAS--SAVRGQLSASKRQYNKKSWGKVGQDSYISPTIAVEVSLPSSSGQF 603
D + S+FTF+AS S V+G LS SKR KK+ K+GQD YI + +LP S Q
Sbjct: 542 TDAASSSFTFSASSFSGVQGPLSTSKRINRKKNPIKLGQDPYI---LIPNATLPLKSSQH 601
Query: 604 VTFSGNPSPISS-QMSQKGVDSWMNKGQEMKQESVSTIAATV--AAQEACEKWRLRGNQA 663
+G S S+ + S++ + ++K + I V AAQEACEKWRLRGN A
Sbjct: 602 SPLTGVQSHFSTGKPSERDPLTRLHKPINNSVMEKARIEKDVSNAAQEACEKWRLRGNNA 661
Query: 664 YASGDLSKAEDHYTRGVNCISKDESSRSCLRALMLCYSNRAATRMSLGRLRDAISDCTMA 723
Y GDLS+AE+ YT+G++ + + E+SR+CLRALMLCYSNRAATRM+LGR+R+AI+DCTMA
Sbjct: 662 YKIGDLSRAEESYTQGIDSVPRIETSRNCLRALMLCYSNRAATRMALGRMREAIADCTMA 721
Query: 724 AAIDPSFYKVYLRAANCYLGLGEIDDAVQFFKRCLQPGNDIYVDRKIVVEASHGLQNAQK 783
++ID +F KV +RAANCYL LGEI+DA ++FK+CLQ G+DI VDRKI+VEAS GLQ AQ+
Sbjct: 722 SSIDSNFLKVQVRAANCYLSLGEIEDASRYFKKCLQSGSDICVDRKIIVEASEGLQKAQR 781
Query: 784 ASECMKRLA-ELQLRSTSTDMQSVLELISEALVISSCSEKLIEMKAEALFMLRRYEEVIQ 843
SECM LQLR T TD + LE++ ++L+IS+ SEKL+ MK EAL ML +Y+ I+
Sbjct: 782 VSECMHEAGRRLQLR-TLTDAEKALEILEDSLLISTYSEKLLTMKGEALLMLEKYDAAIK 841
Query: 844 FCEHTVDSAEKNSPSEDIVSQTSNLDSSEISKKLYFRIWRCRLTLKCYFLLGKLEEGLAS 903
CE TVD A KNSP DS + K + FRIW+C L LK F +GKLEE +AS
Sbjct: 842 LCEQTVDLAGKNSPP----------DSHDTPKDINFRIWQCHLMLKSSFYMGKLEEAIAS 901
Query: 904 LTQEEEVPTVI-GNGRKFLESSIPLAVTMRELVRHKDAGNEAFQAGRNSEAIEHYTAALA 963
L ++E++ + G K LESSIPLA T+REL+R K AGNEAFQ+GR++EA+EHYTAALA
Sbjct: 902 LEKQEQLLSATKREGNKTLESSIPLAATIRELLRLKAAGNEAFQSGRHTEAVEHYTAALA 961
Query: 964 CNVESRPFSAFCFYSRAAAYKAQGQVIDAIADCSLAIALDEEFFKAISRRATLYEMIRDY 1023
CNVESRPF+A CF +RAAAYKA GQ DAIADCSLAIALD+ + KAISRRATL+EMIRDY
Sbjct: 962 CNVESRPFTAVCFCNRAAAYKALGQFSDAIADCSLAIALDQNYSKAISRRATLFEMIRDY 1021
Query: 1024 GQAANDLQKFILLFSKELEKTYQFGNSDRSSTSANDLRQARLHLAEVEEESRKEIPLDMY 1066
GQAA+D+++++ + +K++E+ G DRS++ +ND+RQAR+ L+E+EE+SRKE LDMY
Sbjct: 1022 GQAASDMERYVNILTKQMEEKTS-GTLDRSTSMSNDIRQARIRLSELEEKSRKENSLDMY 1081
BLAST of CmaCh02G012620 vs. TAIR 10
Match:
AT2G41520.1 (Heat shock protein DnaJ with tetratricopeptide repeat )
HSP 1 Score: 465.7 bits (1197), Expect = 1.0e-130
Identity = 301/699 (43.06%), Postives = 426/699 (60.94%), Query Frame = 0
Query: 381 ETEANSSVKFERQDSDGKNQFSFAS-------NSEDGSRS---NFTFAASSA---VRGQL 440
ET S E DS N F AS +ED + NF+F+AS++ +R +
Sbjct: 442 ETPLAPSHSREHIDSRSSNDFKVASARDSSLFTAEDHGSTCIPNFSFSASTSQETIRHKK 501
Query: 441 SASKRQYNKKSWGKVGQDSYISPTIAVEVSLPSSSGQFVTFSGNPSPISSQMSQKGVDSW 500
+ ++Y +K V SLP S+ +++ M +
Sbjct: 502 LQAVKKYRRK----------------VNNSLPKSN------------LNATMRNNQENQP 561
Query: 501 MNKGQEMKQESVSTIAATVAAQEACEKWRLRGNQAYASGDLSKAEDHYTRGVNCISKDES 560
+N GQ KQ+S +T + CE WRLRGNQAY +G +SKAE+ YT G+N ++
Sbjct: 562 VNTGQ-AKQDS----GSTSMMPDVCEVWRLRGNQAYKNGYMSKAEECYTHGINSSPSKDN 621
Query: 561 SRSCLRALMLCYSNRAATRMSLGRLRDAISDCTMAAAIDPSFYKVYLRAANCYLGLGEID 620
S ++ L LCY NRAA R+SLGRLR+AISDC MAA++DPS+ K Y+RAANC+L LGE+
Sbjct: 622 SEYSVKPLALCYGNRAAARISLGRLREAISDCEMAASLDPSYIKAYMRAANCHLVLGELG 681
Query: 621 DAVQFFKRCLQPGNDIYVDRKIVVEASHGLQNAQKASECMKRLAELQLRSTSTDMQSVLE 680
AVQ+F +C++ + + +DR+ +EA+ GLQ AQ+ ++ + + T L
Sbjct: 682 SAVQYFNKCMKSTSSVCLDRRTTIEAAEGLQQAQRVADFTSCASIFLEKRTPDGASDALV 741
Query: 681 LISEALVISSCSEKLIEMKAEALFMLRRYEEVIQFCEHTVDSAEKNSPSEDIVSQTSNLD 740
I+ AL ISSCS+KL++MKAEALFM+RRY+EVI+ CE+T+ +AE+N S I T+
Sbjct: 742 PIANALSISSCSDKLLQMKAEALFMIRRYKEVIELCENTLQTAERNFVSAGIGGTTNVNG 801
Query: 741 SSEISKKLYFRIWRCRLTLKCYFLLGKLEEGLASLTQEEEVP-TVIGNGRKFLESSIPLA 800
L +WR K +F LG LE+ L L + ++V T N + ES L
Sbjct: 802 LGSTYHSLI--VWRWNKISKSHFYLGNLEKALDILEKLQQVEYTCNENQEECRESPASLV 861
Query: 801 VTMRELVRHKDAGNEAFQAGRNSEAIEHYTAALACNVESRPFSAFCFYSRAAAYKAQGQV 860
T+ EL+R+K+AGNEA + + EA+E YTAAL+ NV+SRPF+A CF +RAAA +A Q+
Sbjct: 862 ATISELLRYKNAGNEAVRDRKYMEAVEQYTAALSRNVDSRPFAAICFCNRAAANQALVQI 921
Query: 861 IDAIADCSLAIALDEEFFKAISRRATLYEMIRDYGQAANDLQKFILLFSKELEKTYQFGN 920
DAIADCSLA+ALDE + KA+SRRATL+EMIRDY QAA+DLQ+ I + K+ +KT
Sbjct: 922 ADAIADCSLAMALDENYTKAVSRRATLHEMIRDYDQAASDLQRLISILVKQSDKTKTPET 981
Query: 921 SDRSSTSANDLRQARLHLAEVEEESRKEIPLDMYLILGVDPSASSAEIKKAYRKAALRYH 980
S ++S +L+QAR L+ +EE+S++ I LD +LI+GV S S+A+IKKAYRKAALR+H
Sbjct: 982 SVDRASSRKELKQARQRLSVMEEKSKEGIHLDFFLIMGVKTSDSAADIKKAYRKAALRHH 1041
Query: 981 PDKAGQSLARADNGDNVLWKDIAGGVPMDADKLFKMIGEAYNVLSDPIKRSRYDAEEEMR 1040
PDKA Q L R+++ + K+I V AD+LFKMIGEAY+VLSDP KRS Y+ EEE+R
Sbjct: 1042 PDKAAQILVRSES-EGPWLKEILEEVHKGADRLFKMIGEAYSVLSDPTKRSDYELEEEIR 1100
Query: 1041 TAQKKRNASSTPRSHTDVHQGHQFERSSVRPQWQDLWRS 1066
A+ R + + ++ +Q + R W+D WR+
Sbjct: 1102 KARASRESYRSRKAAEASSPPYQ----TSRRYWKDSWRT 1100
BLAST of CmaCh02G012620 vs. TAIR 10
Match:
AT2G41520.2 (Heat shock protein DnaJ with tetratricopeptide repeat )
HSP 1 Score: 416.0 bits (1068), Expect = 9.4e-116
Identity = 283/699 (40.49%), Postives = 402/699 (57.51%), Query Frame = 0
Query: 381 ETEANSSVKFERQDSDGKNQFSFAS-------NSEDGSRS---NFTFAASSA---VRGQL 440
ET S E DS N F AS +ED + NF+F+AS++ +R +
Sbjct: 442 ETPLAPSHSREHIDSRSSNDFKVASARDSSLFTAEDHGSTCIPNFSFSASTSQETIRHKK 501
Query: 441 SASKRQYNKKSWGKVGQDSYISPTIAVEVSLPSSSGQFVTFSGNPSPISSQMSQKGVDSW 500
+ ++Y +K V SLP S+ +++ M +
Sbjct: 502 LQAVKKYRRK----------------VNNSLPKSN------------LNATMRNNQENQP 561
Query: 501 MNKGQEMKQESVSTIAATVAAQEACEKWRLRGNQAYASGDLSKAEDHYTRGVNCISKDES 560
+N GQ KQ+S +T + CE WRLRGNQAY +G +SKAE+ YT G+N ++
Sbjct: 562 VNTGQ-AKQDS----GSTSMMPDVCEVWRLRGNQAYKNGYMSKAEECYTHGINSSPSKDN 621
Query: 561 SRSCLRALMLCYSNRAATRMSLGRLRDAISDCTMAAAIDPSFYKVYLRAANCYLGLGEID 620
S ++ L LCY NRAA R+SLGRLR+AISDC MAA++DPS+ K Y+RAANC+L LGE+
Sbjct: 622 SEYSVKPLALCYGNRAAARISLGRLREAISDCEMAASLDPSYIKAYMRAANCHLVLGELG 681
Query: 621 DAVQFFKRCLQPGNDIYVDRKIVVEASHGLQNAQKASECMKRLAELQLRSTSTDMQSVLE 680
AVQ+F +C++ + + +DR+ +EA+ GLQ AQ+ ++ + + T L
Sbjct: 682 SAVQYFNKCMKSTSSVCLDRRTTIEAAEGLQQAQRVADFTSCASIFLEKRTPDGASDALV 741
Query: 681 LISEALVISSCSEKLIEMKAEALFMLRRYEEVIQFCEHTVDSAEKNSPSEDIVSQTSNLD 740
I+ AL ISSCS+KL++MKAEALFM+RRY+EVI+ CE+T+ +AE+N S I T+
Sbjct: 742 PIANALSISSCSDKLLQMKAEALFMIRRYKEVIELCENTLQTAERNFVSAGIGGTTNVNG 801
Query: 741 SSEISKKLYFRIWRCRLTLKCYFLLGKLEEGLASLTQEEEVP-TVIGNGRKFLESSIPLA 800
L +WR K +F LG LE+ L L + ++V T N + ES L
Sbjct: 802 LGSTYHSLI--VWRWNKISKSHFYLGNLEKALDILEKLQQVEYTCNENQEECRESPASLV 861
Query: 801 VTMRELVRHKDAGNEAFQAGRNSEAIEHYTAALACNVESRPFSAFCFYSRAAAYKAQGQV 860
T+ EL+R+K+A A CF +RAAA +A Q+
Sbjct: 862 ATISELLRYKNA-------------------------------AICFCNRAAANQALVQI 921
Query: 861 IDAIADCSLAIALDEEFFKAISRRATLYEMIRDYGQAANDLQKFILLFSKELEKTYQFGN 920
DAIADCSLA+ALDE + KA+SRRATL+EMIRDY QAA+DLQ+ I + K+ +KT
Sbjct: 922 ADAIADCSLAMALDENYTKAVSRRATLHEMIRDYDQAASDLQRLISILVKQSDKTKTPET 981
Query: 921 SDRSSTSANDLRQARLHLAEVEEESRKEIPLDMYLILGVDPSASSAEIKKAYRKAALRYH 980
S ++S +L+QAR L+ +EE+S++ I LD +LI+GV S S+A+IKKAYRKAALR+H
Sbjct: 982 SVDRASSRKELKQARQRLSVMEEKSKEGIHLDFFLIMGVKTSDSAADIKKAYRKAALRHH 1041
Query: 981 PDKAGQSLARADNGDNVLWKDIAGGVPMDADKLFKMIGEAYNVLSDPIKRSRYDAEEEMR 1040
PDKA Q L R+++ + K+I V AD+LFKMIGEAY+VLSDP KRS Y+ EEE+R
Sbjct: 1042 PDKAAQILVRSES-EGPWLKEILEEVHKGADRLFKMIGEAYSVLSDPTKRSDYELEEEIR 1069
Query: 1041 TAQKKRNASSTPRSHTDVHQGHQFERSSVRPQWQDLWRS 1066
A+ R + + ++ +Q + R W+D WR+
Sbjct: 1102 KARASRESYRSRKAAEASSPPYQ----TSRRYWKDSWRT 1069
BLAST of CmaCh02G012620 vs. TAIR 10
Match:
AT2G47440.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 90.9 bits (224), Expect = 6.9e-18
Identity = 104/439 (23.69%), Postives = 190/439 (43.28%), Query Frame = 0
Query: 647 MKRLAELQLRSTSTDMQSVLELISEALVISSCSEKLIEMKAEALFMLRRYEEVIQFCEHT 706
+K L + D+ S L L+ AL IS E +E+KA +L LRR+++V +
Sbjct: 26 IKDATTLMASEEANDVASALHLLDAALSISPRLETALELKARSLLFLRRFKDVADMLQDY 85
Query: 707 VDS---------AEKNSPSEDIVSQTSNLDS---------SEISKKLYFRI--------- 766
+ S + + S S D ++ S+ S S++ KK+ I
Sbjct: 86 IPSLKLDDEGSASSQGSSSSDGINLLSDASSPGSFKCFSVSDLKKKVMAGICKKCDKEGQ 145
Query: 767 WRCRLTLKCYFLLGKLEEGLASL-------TQEEEVPTVIGNGRKFL------------- 826
WR + + LG +E+ + L + E ++ + FL
Sbjct: 146 WRYVVLGQACCHLGLMEDAMVLLQTGKRLASAEFRRRSICWSDDSFLLLSESSSASSPPP 205
Query: 827 --ESSIPLAVTMRELVRHKDAGNEAFQAGRNSEAIEHYTAAL-ACNVESRPFSAFCFYSR 886
E+ L ++ L+R + A A AG SE+I H++ + + F A C+ R
Sbjct: 206 ESENFTHLLAHIKLLLRRRAAAIAALDAGLFSESIRHFSKIVDGRRPAPQGFLAECYMHR 265
Query: 887 AAAYKAQGQVIDAIADCSLAIALDEEFFKAISRRATLYEMIRDYGQAANDLQKFILLFSK 946
AAAY++ G++ +AIADC+ +AL+ +A+ RA L E +R + + +DL+ LL++
Sbjct: 266 AAAYRSAGRIAEAIADCNKTLALEPSCIQALETRAALLETVRCFPDSLHDLEHLKLLYNT 325
Query: 947 ELEKTYQFGNS-DRSSTSANDLRQARLHLAEVEEESRKEIP------LDMYLILGVDPSA 1006
L G R + ++ L ++ +++I +D Y ++GV
Sbjct: 326 ILRDRKLPGPVWKRHNVKYREIPGKLCVLTTKTQKLKQKIANGETGNVDYYGLIGVRRGC 385
Query: 1007 SSAEIKKAYRKAALRYHPDKAGQSLARADNGDNVLWKDIAGGVPMDADKLFKMIGEAYNV 1029
+ +E+ +A+ LRY PD+A + R + D + M + L+++I + Y
Sbjct: 386 TRSELDRAHLLLCLRYKPDRASSFIERCEFTDQNDVDSVRDRAKMSSLLLYRLIQKGYTA 445
BLAST of CmaCh02G012620 vs. TAIR 10
Match:
AT3G62570.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 89.0 bits (219), Expect = 2.6e-17
Identity = 119/487 (24.44%), Postives = 195/487 (40.04%), Query Frame = 0
Query: 647 MKRLAELQLRSTSTDMQSVLELISEALVISSCSEKLIEMKAEALFMLRRYEEVIQFCEHT 706
+K L D+ S + L+ AL IS SE +E+KA +L LRR+++V+ +
Sbjct: 26 IKDARSLMESEEQNDVASAIHLLDAALSISPRSETALELKARSLLFLRRFKDVVDMLQDY 85
Query: 707 VDSAEKNSPSED----IVSQTSNLDSSEISKKLYF------------------------- 766
+ S + ED + + SS++S+KL
Sbjct: 86 IPSLKLAVNEEDGSYSYEGSSYSSSSSQLSRKLLSDSSPRRDSSFKCFSVSYLKKKIMAG 145
Query: 767 --------RIWRCRLTLKCYFLLGKLEEGLASLTQEEEVPTV------------------ 826
+ WR + + LG +E+ L L + + TV
Sbjct: 146 ICKNRDQDKQWRYVVLGQACCHLGLMEDALVLLQTGKRLATVEFRRLSVSLSDDSVSLLL 205
Query: 827 -------IGNGRKF-------LESSIPLAVTMRELVRHKDAGNEAFQAGRNSEAIEHYTA 886
+ F E+ L + L+R + AG AF AG +++I H++
Sbjct: 206 SESSSSSSSSSYAFPPRKVSECETVTNLLAHTKNLLRRRSAGFAAFDAGLFADSIRHFSK 265
Query: 887 ALACNVESRP--FSAFCFYSRAAAYKAQGQVIDAIADCSLAIALDEEFFKAISRRATLYE 946
L P F A C+ RAAAYK+ G++ +AIADC+ +AL+ A+ RATL E
Sbjct: 266 ILDGRRRPAPQGFLADCYMHRAAAYKSAGKIAEAIADCNKTLALEPSCIHALETRATLLE 325
Query: 947 MIRDYGQAANDLQKFILLFSKELEKTYQFGNSDRSSTSANDLRQARLHLAEVEEESRK-- 1006
+R + +DL+ +L++ L G + R+ L E+ +S+K
Sbjct: 326 TVRCLPDSLHDLEHLKILYNTILRDRKLPGPPWKRHNV--KYREIPGKLCELTTKSKKLK 385
Query: 1007 ------EI-PLDMYLILGVDPSASSAEIKKAYRKAALRYHPDKAGQSLARADNGDNVLWK 1054
EI +D Y ++GV + +E+ +A LR+ PDKA + R D D
Sbjct: 386 AKMANGEIGNVDYYGLVGVRRGCTRSELDRANLLLCLRHKPDKALAFMERCDFFDQSEIS 445
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q99615 | 5.3e-39 | 28.76 | DnaJ homolog subfamily C member 7 OS=Homo sapiens OX=9606 GN=DNAJC7 PE=1 SV=2 | [more] |
Q9QYI3 | 1.6e-38 | 28.76 | DnaJ homolog subfamily C member 7 OS=Mus musculus OX=10090 GN=Dnajc7 PE=1 SV=2 | [more] |
Q5R8D8 | 2.6e-38 | 28.57 | DnaJ homolog subfamily C member 7 OS=Pongo abelii OX=9601 GN=DNAJC7 PE=2 SV=1 | [more] |
Q54IP0 | 2.6e-25 | 25.83 | DnaJ homolog subfamily C member 7 homolog OS=Dictyostelium discoideum OX=44689 G... | [more] |
Q9HGM9 | 2.7e-22 | 23.60 | DnaJ homolog subfamily C member 7 homolog OS=Schizosaccharomyces pombe (strain 9... | [more] |
Match Name | E-value | Identity | Description | |
AT5G12430.1 | 1.1e-204 | 41.24 | Heat shock protein DnaJ with tetratricopeptide repeat | [more] |
AT2G41520.1 | 1.0e-130 | 43.06 | Heat shock protein DnaJ with tetratricopeptide repeat | [more] |
AT2G41520.2 | 9.4e-116 | 40.49 | Heat shock protein DnaJ with tetratricopeptide repeat | [more] |
AT2G47440.1 | 6.9e-18 | 23.69 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT3G62570.1 | 2.6e-17 | 24.44 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |