Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTTCTAGTTTAGGTACCGTTCTCGCATCCCTTCCATTTACCACCACGATACCCGCCGCCGCCCTCACCTGGTTCTGCTCTGCCTCCGTGGCTTATAGCAACAGCAATTTCCCTAAACCAATCTTCAATGGCGTACGAAGGAGGAAACCTCAAGTCCACCTCCATTAATGGGGTTAAGCTCTACACTGTAGCATCCCAACAACGCTCTCTTGCTACTTGGATCAATCCCAAGAAGCTGCGAGCTCTGCGCAAAGACAAGGGTTGTTAACTAGTTTTCCCTTCCCCAGTTCAATTTGTTATTCTCGTCTTAGTACAAAACTTCGTTTATTGAACTTCTAACTCGAAACTCGTTTTATTAGCCTCTATTCGATTATTTGCTTTAGGGAAGCTCATATTATCAATGAAGTTTTGTTTCTTGTTTAAAAATAAAGAGAAGAAACAAACTAATGTTCGATGAAAATATGCTTGATAAATGAATGCTTATTGATCTAATAAGAGGGAAATAGTGACGATTCTATAATGCACAGTGTAACACTTCCTTGTTCTTGAATCGGGTTTGACCATTATCTAGTTGGCCAGTAAATGCTCTTACGGTATCTATCTTTTCTTCCGGTTCCTCCCATCTTTGAAGTGTTTATTGGTGTTTAGTTATCGGTTTTTTCTTTCCGTAGCTAGGGTTAAATTATGGGATTCCTCCTCGCTACATTATGTTAAAAAGAATATTTTCTGAATCTTTATTGTTGCTCTTTCCACCCTTGTTTTTACATTGTTCTGCTGTTATCAAACAACCTTTTATATATGCATATGCAATTTGCCCAAGTATGTAGATTACATGGAAACCGAGAATGTAGTTTTTCTCTATAATGGTAGCCAAGGTTTTCCTTGTTCCCTCAAAGCATTATTGGGTTCTGCCATCAAATTGCATCTTGCTCTTATGTTGTAACAACTGTGTTCTTCCATCCTGTTTACTGGCATGGTGGATTGGCTGCTTTGTGATCAACGGACAATTTCCCGTTTTGGGAATTGAACAAGCAGGAATCTTTGTGATCTCGGTTATTGCTTTCTTGTTCTTTTTGGTACTGGTGTGCTGGTTCTGTGGTTTCTTGCTTTGAGTAAATACTCTACTTATAATGTTTGTTGCTCATGTAACGTGATTTCTGCACTATTTACTTAGGTTTTACCAATAATTTCATTATAAAACCTTTCTTCCTCCCAGCGTTCATCTGCAACATTGCCGTCCTGTAAAAACAATTGATTATTTTTATATTTTCTATCATCCATTCTTGTTTGTAATTTTGTTCGTTGATGTGTCAGATTACACGTCAAGGGTGGACTTGATCCAGGATTTGAGGTTTGACATTGCATCAAGCAAAATAAAAGCAACCCCCGACGGAGAGTTTCTTATAGCATCAGGTGATTTTAGGCAAGCTAAAATTAGCATCAGTAATCAGTAAGAAGATTCATAAGAAATACTGATTTTTTGTAGTATCTATTGTTACTTATGTATTGCTGTAAATAGGTATCTATCCACCGCAAGTTAAAGTATATGAACTGAGGGAACTTTCATTGAAGTTCAAAAGACATTTTGATTCAGAGATAATCGACTTCCAGGTAAATTTTATGCTGGAGAGCTCTGAGTTGGGTTGGAAAGGTGTGTTGTATTCTTTAAGATTGCATGGAGTTGGTTAAATTTTTATTCAGAATTAGAAACAGAAGCATATCTGTTCTTCCTGATAAGTAGGGAGCTTATTTTTATTTTAAATGATATGACCTTTATGCAATATAAACTCTTGTATGTCCAAATATCATTTTTTGAGCATAGGACAATGAATTGCTAAAAACAGACGTGTTTTTAGGCTTTAACATGTACTTGGACTAGGCTAACATCTAATTGTCTAGTCATCTTCTTGTCAAATCTTAGTCGTAAAATATTATTCCTGTGTGTGGTTTCTTATTAAAAAGTATTATTATGCATGTGAAGAGTGCGTGCGATGAATTGAGTAGAATTTATGCTGTATGGAAGGAAAGCAATAAATTAGTATTTGCATTTTTACAAAATGCCTGTAACTAATACTCTTGTAGTGTAACTTTATATGGAGTTGTATTACTCTTGAGATGGACATCGATTCCTCATAGACAAAACTAAGGGGTTCTTTTAGGCTGAAATCTCAGTTTGCCTTGGATATCTCTGATTTCAAACCGTTTGGCATGATCAGATGCATAGTGGTTGATATATTTTTTTTTAAAAAGGTTAATTTCCGTTTGACATTATGTATATAATATAAACAAAAGTTTCTATGGACTTTAATGAAAGAACAAATTAAGAGGGCCCCCTAAAATAATTTTAAAAAATTTGCGTAATTGTTCTTCCGTGCTATTATTTTATTTTCCTTGTGCCAATTATTCTTCTTATCTGTTGCACACATGTTTCCCCTTTTTGGAATGGATTTGAAGGTGGGAATGGAAGACAGGTTTAATATTTGATGAGGCATTGATTTTGCGGATAGTTTTTTTCCTTCAGAATTTTGCAATATTAACCGAATTTAATGTGTTTTTTCCTCTCTCCTATATAGATACTGGATGATGACTACTCAAAGTTGGCATTTATGTGTGCTGATCGTTCTATTGTTCTCCATGCTAAATATGGAAAACATCACACTTTGCGGATACCAAGGTTAATTCCCCCCCACCCAATTTTGTTGTAAATTAGCTCTGATCATGTTACGGGTTTATTTTATCAATAAGAGAATACTATTTACAAAGAGCACAAATGGGTACAGAAGACTAAACGCTTACCAAAATAGCTAGCACATGTAAGCCTCCAAATGATCATTAATTAAACGAACACTGTGATTGCCAAAGAATTTCTTGTGATCGCAACTCCATCTTGAGGTTGTAACTATTCCTTTTATGAATAAGAAGCATGGCTTGATCAGGGACCGAGTCCATTCTAAAGTAGAAATCTAACTAGGTCCGAAATCCATCTAGAAAAGTTTCAATCCCTTGGAATATCCTCTAATTCATGTAGAATCCCATGCTTCAAAGGTGAGCATTATCGCATTAGCTGAAGGGAGAGCCTGAGATTAGTTGTGGATTGAAGCTGTTTCATGGCTGTCTTAGAAATGATACCAGCTATGCCAAAAATGCCATAGAGTTTAGAGACCAGCAAGCTGCCACAAAAGGAGCGTCTATTCACCAAGTCTTGGAGTATTGATAGGCATTTAAACTATTGAAAAGTCATTTAATGAAGCAAATGGCAGAATGCTTGAAAAAACTCAAGACTTCTTGAACTTTTGAATTTGGGCTCTTATTTAGTTTTTCCTTTCCAATCGCCTCAAGTGACGTCTCTCAACTTTTAGAATCTCTAGCAAGGTCGATCTTTTTTTTATTTTTTATTTTTTATTTTTAAATGTGAGAGACCTTCTCTGGAGTGCGGCAAGGCTTACCTAGTCATTTAGAAGATTTTCATTTTTCTCCTTGGATACGGATGGCTTGGGATTGATAGTCTTCACTATCTCTTCTTGCTTAGTGAGGGTGGAATTCTTCATGCCATTTAGTTGCGGAACTAAATCTTGATAAAATATTTATAACTCATGAAAGGCTAGTTTTTGTAGCTTCCTCGTGGTCTTCCTTGGATGAGTATATTTGAAATCTTGATCTTTCTTTGATTTCCATTTTATCCCTTTGATTTTGTCAGCTTGCCTTGTAAAATTATTTCATCAATAAAATAGCTTCTTATTTAAAGAAAGACAGACAGATTTGAGATATGGCTACAAATTGATCTTGCTCTAGTTTTATGGTTTCATTTTTCTTGTGGTTTATGAACGAGTAAACTCTAATATTTTTTTGTATCTATGCCATCAACAGGATGGGAAGGAATATTGAATATGACAACTGGTCTGCTGACTTGCTTTGTGCTGCATCCTCTCCAGATGTGTACAGAATCAGTTTGCAGCAGGTTTTAGTTTTTTTCTTGTCTTTTTTAAGTCATTGCAGACTCAACTTCGTTAAATTTTCTACTAAATTACACTAGGATACATAAGAAAATTCCATGAAATGTTTCTGACATAATGTTCTTGCAAAGAGGAATAACGGTATGCAACTTACGAGTTAAGTGACACTTTGTTGAGAGGTTCTTTGGCAGCTTTGGAGAAATAATGAATGCATGGCTTAATCGAACTTAGATAGTCTGTTGTCAGATTGTATTCTTTTTAGAAAAAAGGTTCTATGCATATATAAATATTCGAAGGTATCTTATTTAGGAATGCTTGATCTTTTTCCCCTATGAAATGCTAATAGTATTCATAAAAACAAATGCAACTTCAAATTTTAAAAAGATAAAACGAAGCAAACAATCACTACATAGGGTCACATGCAAACATTTTGTTATTGGTTCTATATTTTTATATAACTTCAAATATGGCAATGTGTTTGCTGTTGTGTTCATCGAACTTTAAATTAGCTTTTATTGTGTTATTGGCCATTCATAAGTCTCCATTTTTTTGGAGTCCTTTTCATTTATCTTTTCGAAAGAAGAATACAGCCTTAAAATGAAGAAATATCTATGATTCTAAGCACAGAAAAGGTAAGTGAAAGAAAGAAAAAAGGGAAAACCCAAAGAGAGTTTAACTGAGAATTGTAATATGTATGAATGAAAAGCCAACTATAACTGTTTACAATAAGGGTAAATAGAAGTAAAGGTCCAGCCCAACTTCTTGTGTGCTGAAACTCCAATGGTCTTTTGTGTTTGCACATTTTGTATATCGATTGATTAAAAGCTAATGTGTCATGTGCAATGACTTTCTGAATTTATTATGATTTGTGGTTTTCCAAGAATGTTGTTATGTGAGCTTGGAAGTTAAATTTATGATAGTTTTACATTTAAGTCTAGAAATGACCATTTTTGTTTGTTGTAGGGGCGGTTTCTTCCACCTCTCAACACAGAATCCCCAGCAATAAATGTTGTTTCTCGAAGGTCAGTTAATCTCATTCTTGTTATCATACTCTTCTCGTGTCAATTGATTGATTTTCTTTATTGAGTAGTGAATCACGTAGTTTCATATATTTTATCCAAGTTAAAAGTATGTTCAACAATTGTAGCAAGCTTCATGGGATAGTTGCTTGTGGGGGTGTAGATGGAGCTGTAGAATGTTTTGACACCAGAACGAAGTTATCTTCAATTGGAAGAATTGATGCTATTGCACCTGCAGGAGATAAGGACCAGGTAATTCTCCTTGTTGACCTATGGTAAAATCTTGTTCTGGGTAGTATGCCAACAACGAGTGTTCGATCCATTGTGAATAATTACATACATATCAACGTGACGGGAAGCATCTTGATGGTTACTTATGTTTTTAAATGACAGTTTTATTCTTTTTTTTAGTTAGGCTTCAGTGTTTTGTTGCAATGGCTTTTATTTAAATACTAATTTATGGCCCAATTGTAAACTCTCAAGGTTTCAATGAGATCTCGTGCAGAAGTGAAACGTTTAATCAACAAAATACTGAATATTATATTTTTGTGATGCAGGAGGTTACTGCATTAGCATTTGATGATATTGGTGGATTCCAAATGGCCGTCGGCAGTAGCTCAGGAAAGGTAAAGCTTCATCTCCTTAGTTGTGGTGTTCTTGGAATTGAAATCTATATGAATTTCCTGAATATTGAATGATTGCATCAAGGATGAAAATCTAAATTAACAACGGTGATGGCAGACTCTGGTTGGAGCTTTCCACTTTTTGTAAATAGCCTACACATAGGGCCCAAAAAAGAATGCGCTACCCCCGGCCCTCTTTTCAGTTAAAGAAAATTCAAGCACTGTGATTATTACATTTTGTTTTGGTCTTGACGTGCTTTGAAAAAGATAGAGATATGATTTGTACTAGAACACTGACGATAGCCGCCCCACCGGGAACACATTCACTACTGGTTCTCTTTTTTCTTATTGATGTATTATGGATGCAGAAGTGCATAAGTTTTTGTTGGCTACCTTCTTCTTACTGTTTGTTTTTGCCCATCCTTTTGGGGCCTGTAACAGGATTAGGAAATAAGTAACGCTACACATGAAACTTTCATCATCAGCACACTCATGCCTAAGCCAACTAAAAAACAATTAAAACTATTCCTTCCTTCATCTCCTTCATTTGCGTGTATTTAATAAATCATAACCAATTCCTACCCACAAACTCCTTGTTCAATGACTAACAAACCCTTAATATCTTGGTAATATTTCTAACTTCTACCCTAATCGTGATGAATCTCATTTAAATAAGGATAAGGAATCCAACCATCCCATAGTTAAAGTGAACACAAAAACCTCCCCAATGGATTTGAATAGAAAAAAGGAGAATAGTTACAAGTGCCTCTACTTAGAGAAACACCACCTAGCAGTCAGAAAAGGAAACACTTCCTCAAAAACTGCAAAATGTATCTCCAAAAGTTTTACTGCTCCTGATCATGTCCAAGTTTTTCTACCACGGTTTTTGAGAAGTACTGCAGTTCCATAATTTCCTGTATCTTCGGATATCAATTTTTATTTATATTTTGACCTGATTGATATGATCACATCACATGCAGATTTTGATATACGATTTGCGTTCATCAGATCCTATAAGAATCAAAGATCACATGTGAGTTTACCTTCTTTCATTAAATTTAGAGAGTTTTAACCATCACGTACTGAAAAGTAATATGATTATTTTGCAGGTACGACAGTCCAATTCTGAACATTAAGTGGCATAGCACTCTTAATTCTGAAAAACCAAAAATGATTACCACTGACAAGCACATTGTTAGGATTTGGGACCCAGATACTGTAAGAAATCTCTACCTCTGTTTTTACAGTTTATTAAAAGACAAAAATAAAACTTCGAGCAACATGCATTTAGCTAGGTGTAAATCTCTTTTTACTAAAGTCAATGGTATGTCTTTGACATTACAATGGTAGATTGCTGTGTATTAAACATTATATGTATTCTGAATTTTTGAGAATTATTTTAATTTGTTAATTAAAGGCTTTATTTGTTAACTGTTCTTTTGTTATTAAACCACATGTTCTGTTTCTCATTTTCTTTTGATTAATAGCAAAACTTGTATTCAGTCATTGGTAGACAAACCAAAATTCCTGATATCTTCTTTCAATTATGAAAATAATGTGTTTTAGGGAGAAGGAATGACCAGCATTGAGCCCACGCTTGGTCCAATAAATGATACATGTGTATTCAAGGACAGTGGATTGATGCTACTAGCTTTGAATTCAAGTCAAATACCTTCTTACTTTCTGCCTGCGCTGGGACCTGCACCAAAGTGGTGCTCTTATTTGGAGAATTTGACGGTGAGCTTTTGGAACTTATTCTCTTTTGTATTCCTTTGTTTGGGATCTCGGATTAGTAATGCTAATACTTTATTTGTCTTAACGTAGGAGGAGTTGGAGGAGGGTGCAGAAACAACTATATATGATGATTTCAAGTTCTTGACAAAAGAAGAACTTGAGAGGCTAAATTTGACAAACCTGATTGGGACTAATCTGCTAAGAGCTTACCTTCATGGTTTCTTTATTGATTATCGCTTGTATAAAAAGGTAATTTTATGGTTTGTAGCATAGCGGTTAAATTTATTGTTTTATCTGCTTAGGAGTAAATTTATGGTTTTCTTTAGAGTATGACATAGTATTAACTACCTGAGTATTAAATACATGAGAAATGTTGCTTAATTTTGATAGATCGTGGGGTTACCGAATTACAGTACCCTGAGAAATTTTTGTATCATGTTGATTGAGAATATTCGAAGTGCTGAGCTAGTTACTTTTTGTTATGTATGCTTGTGACTTTGCAGGCAAAATCACTGGCGGATCCTTTTGCATATGATGCTTACATAGAACAAAGGAAAAAAGAGAAACTAGATGAGGAGCGTGCCAACCGAATTACGGTAAACCATTTTTAGTCCCAGAAGTTTATAATATTTGATTACCTCAGTTTTAATCAATAGGTTTATATTACCGAAGAATTTATGACAAACTTCCATGCGGTCTGTTGGAGAGTTCTTTCCAAATAATTATACTCTAATAATAAATCTGCTTTATGTGTGAATGAGTTTTCTATCTTTTTTTTTTTTTTTTTTTTTTAGGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTAATGGGATTTTCCCCTATACTTGGACTTCGTAAGTGAAGGAGGGTTGACGCAGTCTTTATGGAATCTACTATTTTGGCTTGTCATTATTTTAATATGTTTTTAATGTAAATTGTATTTTTAACTACTTTTTTGCTTCAGGTGAAAAGAAAATTACCCAAAGTCAACAGGCGCCTCGCAAATCAAATCCTTGAAGAGGAAGAAGCTGGAACGGAAAAGAAGGATGAAGAGGATGTCAATAAGACCAAGAAGGCATCAAAGAAGAAGAAAGGGCTCAGTAGTGAAATCTTTCAAGATGAACGTTTTGCGAATATGTTTAAGAATGAGGTATGACTCGGTTCTTAGATGCTCCCTCTCATTTTTTTCTCCACATTTCTATTGGTTGTTTGCAATATGTCACGAAAAATCAAGGAAGAAAAAAAAAATTCTAGGCTTCATCTGCAAGATGTCTAAGGTCAAATTACACATGCGTATCTATTTGTTTAGTGGGGATGTTTATTAGCAAAATATCAAGAAAAAGAAAGTTGTGGCGTAGGGTGTTCGCTTGAATTTTACTCATTTAAATGTAGTTTCCTTGTTTTTTGTTGGATCATATTATAGCTCTTGTTCATCTAATGTTGGATCATGTTATGCTACAGAACTTCGAGATCAATGAGCTTTCACCAGAGTATCTGGCCCTGCATCCAGTGGCTTCTACGAAGCAGCCATCTTTGGTGGAAGAGCATTTTGAACCTGTCTTGGACGACAGTGAACAAAGCTTAAGTAATTCTGATGCCTCAGTTGAATCAGACTTTGAAGATGAACCCAGCAGAGATAAGCATAATAAGGCGCGAGCTCCAAAGTGAGTTTTTTCAACTCATAATTATTTTATATACTTTTTTCTTGGGAGCTAAAAGTTTTCCGATCTTCTAACCTATTAAAAGAAAGCTGTATAATCTAATACAAATTTTCCATCAAGGAAAGGAAATGCGTACTGTACTAACTACAAGTTAATATCTTTCTCTTCATGAGTTCATATGAATTCTTTGTGCTAAAGTCGGTATCTTTGGAGGGGATAGATAGCGTTTAATAATTTTATTGATGTGAATGTGTAGATTGTATGAGGTGAAGGATGAAAAGCATGCTGAGGCGTTCTGGAACAGTGTATCACTTGCTAAGGAGGAAAGGCTCACTATGGAGGAGAGAATTGCTGCTATGGGAGACAATAAGCAAGGTTCTGGAATTCTAAATGAAGTTAAATTGGGACCAGGAGGGTCGCGAGAGATTTCGTTTAAGCCAAGAAGCTCTGCCAAGTACATGGAAGATGATGACGATGAAGGCCCACGAAAGAAGAATCGGAGTGCCGAATTTTATGGTCCAAAGGCCAATAAATCAGGTTCCCGAGGTGGAATGAGACATGGGAGCAGCATAGGTGGAAACAACAGAGGTAGAGGTAGAGGTAGAAGAGGGCGCCGGTGAATGATTCACACTGGTGTATCATATGAAGAAAAATATATGAAACAGTTGCAGCCTTAAGCCATGAAATTAGATTTGTGCTTTGTCTTCCTTACCGAGAAGGAAGATAAGAGGAATGAGACGTCGACATGGTGAGAGCATAGAGTATTCCCATCTCACCCATGAGAAGATTTTATAAACTGTAAATTTTGTTAGGTTAGTTTTAAGTTATTAATATTGTTTCACTTGAATGAGTAGAGCCTGTTCTTTATTAATATGATCGAGAACCCAATTGAAAGAATGAAACTTTATTAATCGCTGGATCGATTGTACATGTAGTTTAGCAAACAGATTGTGTTTTAAACTGAAGTAGCAAGATAAAG
mRNA sequence
TTTTCTAGTTTAGGTACCGTTCTCGCATCCCTTCCATTTACCACCACGATACCCGCCGCCGCCCTCACCTGGTTCTGCTCTGCCTCCGTGGCTTATAGCAACAGCAATTTCCCTAAACCAATCTTCAATGGCGTACGAAGGAGGAAACCTCAAGTCCACCTCCATTAATGGGGTTAAGCTCTACACTGTAGCATCCCAACAACGCTCTCTTGCTACTTGGATCAATCCCAAGAAGCTGCGAGCTCTGCGCAAAGACAAGGATTACACGTCAAGGGTGGACTTGATCCAGGATTTGAGGTTTGACATTGCATCAAGCAAAATAAAAGCAACCCCCGACGGAGAGTTTCTTATAGCATCAGGTATCTATCCACCGCAAGTTAAAGTATATGAACTGAGGGAACTTTCATTGAAGTTCAAAAGACATTTTGATTCAGAGATAATCGACTTCCAGATACTGGATGATGACTACTCAAAGTTGGCATTTATGTGTGCTGATCGTTCTATTGTTCTCCATGCTAAATATGGAAAACATCACACTTTGCGGATACCAAGGATGGGAAGGAATATTGAATATGACAACTGGTCTGCTGACTTGCTTTGTGCTGCATCCTCTCCAGATGTGTACAGAATCAGTTTGCAGCAGGGGCGGTTTCTTCCACCTCTCAACACAGAATCCCCAGCAATAAATGTTGTTTCTCGAAGCAAGCTTCATGGGATAGTTGCTTGTGGGGGTGTAGATGGAGCTGTAGAATGTTTTGACACCAGAACGAAGTTATCTTCAATTGGAAGAATTGATGCTATTGCACCTGCAGGAGATAAGGACCAGGAGGTTACTGCATTAGCATTTGATGATATTGGTGGATTCCAAATGGCCGTCGGCAGTAGCTCAGGAAAGGCAAAATCACTGGCGGATCCTTTTGCATATGATGCTTACATAGAACAAAGGAAAAAAGAGAAACTAGATGAGGAGCGTGCCAACCGAATTACGGTGAAAAGAAAATTACCCAAAGTCAACAGGCGCCTCGCAAATCAAATCCTTGAAGAGGAAGAAGCTGGAACGGAAAAGAAGGATGAAGAGGATGTCAATAAGACCAAGAAGGCATCAAAGAAGAAGAAAGGGCTCAGTAGTGAAATCTTTCAAGATGAACGTTTTGCGAATATGTTTAAGAATGAGAACTTCGAGATCAATGAGCTTTCACCAGAGTATCTGGCCCTGCATCCAGTGGCTTCTACGAAGCAGCCATCTTTGGTGGAAGAGCATTTTGAACCTGTCTTGGACGACAGTGAACAAAGCTTAAGTAATTCTGATGCCTCAGTTGAATCAGACTTTGAAGATGAACCCAGCAGAGATAAGCATAATAAGGCGCGAGCTCCAAAATTGTATGAGGTGAAGGATGAAAAGCATGCTGAGGCGTTCTGGAACAGTGTATCACTTGCTAAGGAGGAAAGGCTCACTATGGAGGAGAGAATTGCTGCTATGGGAGACAATAAGCAAGGTTCTGGAATTCTAAATGAAGTTAAATTGGGACCAGGAGGGTCGCGAGAGATTTCGTTTAAGCCAAGAAGCTCTGCCAAGTACATGGAAGATGATGACGATGAAGGCCCACGAAAGAAGAATCGGAGTGCCGAATTTTATGGTCCAAAGGCCAATAAATCAGGTTCCCGAGGTGGAATGAGACATGGGAGCAGCATAGGTGGAAACAACAGAGGTAGAGGTAGAGGTAGAAGAGGGCGCCGGTGAATGATTCACACTGGTGTATCATATGAAGAAAAATATATGAAACAGTTGCAGCCTTAAGCCATGAAATTAGATTTGTGCTTTGTCTTCCTTACCGAGAAGGAAGATAAGAGGAATGAGACGTCGACATGGTGAGAGCATAGAGTATTCCCATCTCACCCATGAGAAGATTTTATAAACTGTAAATTTTGTTAGGTTAGTTTTAAGTTATTAATATTGTTTCACTTGAATGAGTAGAGCCTGTTCTTTATTAATATGATCGAGAACCCAATTGAAAGAATGAAACTTTATTAATCGCTGGATCGATTGTACATGTAGTTTAGCAAACAGATTGTGTTTTAAACTGAAGTAGCAAGATAAAG
Coding sequence (CDS)
ATGGCGTACGAAGGAGGAAACCTCAAGTCCACCTCCATTAATGGGGTTAAGCTCTACACTGTAGCATCCCAACAACGCTCTCTTGCTACTTGGATCAATCCCAAGAAGCTGCGAGCTCTGCGCAAAGACAAGGATTACACGTCAAGGGTGGACTTGATCCAGGATTTGAGGTTTGACATTGCATCAAGCAAAATAAAAGCAACCCCCGACGGAGAGTTTCTTATAGCATCAGGTATCTATCCACCGCAAGTTAAAGTATATGAACTGAGGGAACTTTCATTGAAGTTCAAAAGACATTTTGATTCAGAGATAATCGACTTCCAGATACTGGATGATGACTACTCAAAGTTGGCATTTATGTGTGCTGATCGTTCTATTGTTCTCCATGCTAAATATGGAAAACATCACACTTTGCGGATACCAAGGATGGGAAGGAATATTGAATATGACAACTGGTCTGCTGACTTGCTTTGTGCTGCATCCTCTCCAGATGTGTACAGAATCAGTTTGCAGCAGGGGCGGTTTCTTCCACCTCTCAACACAGAATCCCCAGCAATAAATGTTGTTTCTCGAAGCAAGCTTCATGGGATAGTTGCTTGTGGGGGTGTAGATGGAGCTGTAGAATGTTTTGACACCAGAACGAAGTTATCTTCAATTGGAAGAATTGATGCTATTGCACCTGCAGGAGATAAGGACCAGGAGGTTACTGCATTAGCATTTGATGATATTGGTGGATTCCAAATGGCCGTCGGCAGTAGCTCAGGAAAGGCAAAATCACTGGCGGATCCTTTTGCATATGATGCTTACATAGAACAAAGGAAAAAAGAGAAACTAGATGAGGAGCGTGCCAACCGAATTACGGTGAAAAGAAAATTACCCAAAGTCAACAGGCGCCTCGCAAATCAAATCCTTGAAGAGGAAGAAGCTGGAACGGAAAAGAAGGATGAAGAGGATGTCAATAAGACCAAGAAGGCATCAAAGAAGAAGAAAGGGCTCAGTAGTGAAATCTTTCAAGATGAACGTTTTGCGAATATGTTTAAGAATGAGAACTTCGAGATCAATGAGCTTTCACCAGAGTATCTGGCCCTGCATCCAGTGGCTTCTACGAAGCAGCCATCTTTGGTGGAAGAGCATTTTGAACCTGTCTTGGACGACAGTGAACAAAGCTTAAGTAATTCTGATGCCTCAGTTGAATCAGACTTTGAAGATGAACCCAGCAGAGATAAGCATAATAAGGCGCGAGCTCCAAAATTGTATGAGGTGAAGGATGAAAAGCATGCTGAGGCGTTCTGGAACAGTGTATCACTTGCTAAGGAGGAAAGGCTCACTATGGAGGAGAGAATTGCTGCTATGGGAGACAATAAGCAAGGTTCTGGAATTCTAAATGAAGTTAAATTGGGACCAGGAGGGTCGCGAGAGATTTCGTTTAAGCCAAGAAGCTCTGCCAAGTACATGGAAGATGATGACGATGAAGGCCCACGAAAGAAGAATCGGAGTGCCGAATTTTATGGTCCAAAGGCCAATAAATCAGGTTCCCGAGGTGGAATGAGACATGGGAGCAGCATAGGTGGAAACAACAGAGGTAGAGGTAGAGGTAGAAGAGGGCGCCGGTGA
Protein sequence
MAYEGGNLKSTSINGVKLYTVASQQRSLATWINPKKLRALRKDKDYTSRVDLIQDLRFDIASSKIKATPDGEFLIASGIYPPQVKVYELRELSLKFKRHFDSEIIDFQILDDDYSKLAFMCADRSIVLHAKYGKHHTLRIPRMGRNIEYDNWSADLLCAASSPDVYRISLQQGRFLPPLNTESPAINVVSRSKLHGIVACGGVDGAVECFDTRTKLSSIGRIDAIAPAGDKDQEVTALAFDDIGGFQMAVGSSSGKAKSLADPFAYDAYIEQRKKEKLDEERANRITVKRKLPKVNRRLANQILEEEEAGTEKKDEEDVNKTKKASKKKKGLSSEIFQDERFANMFKNENFEINELSPEYLALHPVASTKQPSLVEEHFEPVLDDSEQSLSNSDASVESDFEDEPSRDKHNKARAPKLYEVKDEKHAEAFWNSVSLAKEERLTMEERIAAMGDNKQGSGILNEVKLGPGGSREISFKPRSSAKYMEDDDDEGPRKKNRSAEFYGPKANKSGSRGGMRHGSSIGGNNRGRGRGRRGRR
Homology
BLAST of CmoCh06G000160 vs. ExPASy Swiss-Prot
Match:
Q7T0Q5 (Nucleolar protein 10 OS=Xenopus laevis OX=8355 GN=nol10 PE=2 SV=1)
HSP 1 Score: 206.1 bits (523), Expect = 1.0e-51
Identity = 179/686 (26.09%), Postives = 296/686 (43.15%), Query Frame = 0
Query: 8 LKSTSINGVKLYTVASQQRSLATWINPKKLRAL-RKDKDYTSRVDLIQDLRFDIASSKIK 67
++ +++N VK+Y + S +SL W++ +K RAL +KD D R++LIQD S+ IK
Sbjct: 1 MQVSNVNDVKIYNL-SCGKSLPEWLSDRKKRALQKKDVDVRRRIELIQDFEMPTVSTNIK 60
Query: 68 ATPDGEFLIASGIYPPQVKVYELRELSLKFKRHFDSEIIDFQILDDDYSKLAFMCADRSI 127
+ DG++++A+G Y P+++ Y+ +LSLKF+R DSE+I F IL +DYSK+ F+ +DR +
Sbjct: 61 VSRDGQYIMAAGTYKPRIRCYDTYQLSLKFERCLDSEVIKFDILSEDYSKIVFLQSDRYV 120
Query: 128 VLHAKYGKHHTLRIPRMGRNIEYDNWSADLLCAASSPDVYRISLQQGRFLPPLNTESPAI 187
LH+++G+++ LRIP+ GR+ Y S DL +S +VYR++L+QGR+L L TE+ I
Sbjct: 121 ELHSQHGRYYRLRIPKFGRDFAYHYPSCDLYFVGASSEVYRLNLEQGRYLNSLQTEASQI 180
Query: 188 NVVSRSKLHGIVACGGVDGAVECFDTRTKLSSIGRIDAIAPAGDKDQEVTAL----AFDD 247
NV + H + A G +G VEC+D RT+ S +G +D + D EV L A
Sbjct: 181 NVCDINPTHHLFAAGTTEGRVECWDPRTR-SRVGLLDCALSSVTADMEVEGLPSVSALKF 240
Query: 248 IGGFQMAVGSSSG----------------------------------------------- 307
G MAVG+S+G
Sbjct: 241 NGPLHMAVGTSTGQVLLYDLRSNRPVIVKDHQYGLPIKSIQFHSALDLVISADSRIIKMW 300
Query: 308 ------------------------------------------------------------ 367
Sbjct: 301 NKDNGKIFTSIEPEADVNDVCLYPNSGMLFTANEAPKMNVYYIPALGPAPRWCSFLDNLT 360
Query: 368 ----------------------------------------------------KAKSLADP 427
K K++ +P
Sbjct: 361 EELEENPENTVYDDYKFVTRKELDELGLSHLIGSPMLRAYMHGFFMDIRLYHKVKAMVNP 420
Query: 428 FAYDAYIEQRKKEKLDEERANRITVKRKLPKVNRRLANQILEEEEAGTEKKDEEDVNKTK 487
FAY+ Y +++ ++K++E RA R+ +K KLPKVN+ LA ++ E+EE +E+ ++K K
Sbjct: 421 FAYEEYKKEKIRQKIEETRAQRVQIK-KLPKVNKELALKLYEDEE------EEKQLSKKK 480
Query: 488 KASKKKKGLSSEIFQDERFANMFKNENFEINELSPEYLALHPVAS---------TKQPSL 499
K KK I D+RF MF+N +F++++ S EY L+P+ S K
Sbjct: 481 KKQKK----MPNILTDDRFKVMFENPDFQVDQESEEYRLLNPLVSKISEKRKKKLKILEK 540
BLAST of CmoCh06G000160 vs. ExPASy Swiss-Prot
Match:
Q6NVM6 (Nucleolar protein 10 OS=Xenopus tropicalis OX=8364 GN=nol10 PE=2 SV=1)
HSP 1 Score: 199.1 bits (505), Expect = 1.2e-49
Identity = 172/686 (25.07%), Postives = 291/686 (42.42%), Query Frame = 0
Query: 8 LKSTSINGVKLYTVASQQRSLATWINPKKLRAL-RKDKDYTSRVDLIQDLRFDIASSKIK 67
++ +++N VK+Y + S +SL W++ +K RAL +KD D R++LIQD S+ IK
Sbjct: 1 MQVSNVNDVKIYNL-SCGKSLPEWLSDRKKRALQKKDVDVRRRIELIQDFEMPTVSTNIK 60
Query: 68 ATPDGEFLIASGIYPPQVKVYELRELSLKFKRHFDSEIIDFQILDDDYSKLAFMCADRSI 127
+ DG++++A+G Y P+V+ Y+ +LSLKF+R D+E++ F IL +DYSK+ F+ +DR +
Sbjct: 61 VSRDGQYIMAAGTYKPRVRCYDTYQLSLKFERCLDAEVVKFDILSEDYSKIVFLQSDRYV 120
Query: 128 VLHAKYGKHHTLRIPRMGRNIEYDNWSADLLCAASSPDVYRISLQQGRFLPPLNTESPAI 187
LH+++G+++ LRIP+ GR+ Y S DL +S +VYR++L+QGR+L L T++ I
Sbjct: 121 ELHSQHGRYYRLRIPKFGRDFAYHYPSCDLYFVGASSEVYRLNLEQGRYLNSLQTDASQI 180
Query: 188 NVVSRSKLHGIVACGGVDGAVECFDTRTKLSSIGRIDAIAPAGDKDQEVTAL----AFDD 247
NV + H + A G +G VEC+D RT+ + +G +D + D EV L A
Sbjct: 181 NVCDINPAHHLFAAGTTEGKVECWDPRTR-NRVGLLDCALSSVTADMEVEGLPSVSALKF 240
Query: 248 IGGFQMAVGSSSG----------------------------------------------- 307
G MAVG+S+G
Sbjct: 241 HGPLHMAVGTSTGQVVLYDLRSNRPLIAKDHQYGLPIKSIQFHSALDLVISADSRIIKMW 300
Query: 308 ------------------------------------------------------------ 367
Sbjct: 301 NKDNGKIFTSIEPEADVNDVCLYPNSGMLFTANEAPKMNVYYIPALGPAPRWCSFLDNLT 360
Query: 368 ----------------------------------------------------KAKSLADP 427
K K++ +P
Sbjct: 361 EELEENPESTVYDDYKFVTRKELDELGLSHLIGSPLLRAYMHGFFMDIRLYHKVKAMVNP 420
Query: 428 FAYDAYIEQRKKEKLDEERANRITVKRKLPKVNRRLANQILEEEEAGTEKKDEEDVNKTK 487
FAY+ Y +++ ++K++E RA R+ +K KLPKVN+ LA ++ EEEE ++K
Sbjct: 421 FAYEEYKKEKIRQKIEEARAQRVQIK-KLPKVNKELALKLYEEEEELSQK---------- 480
Query: 488 KASKKKKGLSSEIFQDERFANMFKNENFEINELSPEYLALHPVAS--------------- 499
KKK+ I D+RF MF+N +F++++ S EY L+P+ S
Sbjct: 481 ---KKKQKKMPNILSDDRFKVMFENPDFQVDQESEEYRLLNPLVSKISEKRKKKLKILEK 540
BLAST of CmoCh06G000160 vs. ExPASy Swiss-Prot
Match:
Q802W4 (Nucleolar protein 10 OS=Danio rerio OX=7955 GN=nol10 PE=2 SV=1)
HSP 1 Score: 196.4 bits (498), Expect = 8.0e-49
Identity = 185/742 (24.93%), Postives = 309/742 (41.64%), Query Frame = 0
Query: 8 LKSTSINGVKLYTVASQQRSLATWINPKKLRAL-RKDKDYTSRVDLIQDLRFDIASSKIK 67
++ +S+N VK+Y + S +SL W++ +K R L +KD D R++LIQD + I+
Sbjct: 1 MQVSSVNNVKIYNL-SHGKSLPEWLSDRKKRVLQKKDVDIQRRIELIQDFEMPTVCTSIR 60
Query: 68 ATPDGEFLIASGIYPPQVKVYELRELSLKFKRHFDSEIIDFQILDDDYSKLAFMCADRSI 127
+ DG++++A+G Y P+V+ Y+ +LSLKF+R DS+++ F IL DDYSKL F+ DR +
Sbjct: 61 VSRDGQYILAAGTYKPRVRCYDTYQLSLKFERCLDSDVVTFDILSDDYSKLVFLHIDRYV 120
Query: 128 VLHAKYGKHHTLRIPRMGRNIEYDNWSADLLCAASSPDVYRISLQQGRFLPPLNTESPAI 187
H+++G ++ RIP+ GR+ Y + S DL +S +V+R++L+QGRFL L T++ +
Sbjct: 121 EFHSQHGHYYKTRIPKFGRDFSYHSPSCDLYFVGASSEVFRLNLEQGRFLNSLQTDAAEM 180
Query: 188 NVVSRSKLHGIVACGGVDGAVECFDTRTKLSSIGRIDAIAPAGDKDQ-----EVTALAFD 247
NV + +H + A G ++G V+C+D R + A++ D + V+AL F+
Sbjct: 181 NVCDINPVHQLFAAGTLEGRVDCWDPRVRTRVAALDCALSSITDNTEVEGLPSVSALKFN 240
Query: 248 DIGGFQMAVGSSSG---------------------------------------------- 307
D G +AVG+S+G
Sbjct: 241 DSLG--LAVGTSTGQILVYDLRSSRPLLVKDHYYGLPIKSLHFHNSLDLVLSADSKIIKM 300
Query: 308 ------------------------------------------------------------ 367
Sbjct: 301 WNKDNGKVFSSIEPQANINDVCLYPASGMLFTANEDPKMNTFYIPALGPAPRWCSFLDNL 360
Query: 368 -----------------------------------------------------KAKSLAD 427
K K++ +
Sbjct: 361 TEELEENPESTIYDDYKFVTRKDLESLGLAHLIGSPLLRAYMHGFFMDIRLYHKVKTMVN 420
Query: 428 PFAYDAYIEQRKKEKLDEERANRITVKRKLPKVNRRLANQILEEEEAGTEKKDEEDVNKT 487
PFAY+ Y + + ++K++E RA R+ +K KLPKVN+ LA +++ EED T
Sbjct: 421 PFAYEEYRKDKIRQKIEESRAQRVQLK-KLPKVNKELALKLM-----------EEDTELT 480
Query: 488 KKASKKKKGLSSEIFQDERFANMFKNENFEINELSPEYLALHP----VASTKQPSLVEEH 535
K KKK ++ + D+RF MF+N +++++E S E+ L+P VA ++ SL+EE
Sbjct: 481 NKKKKKKANVAGNLLMDDRFKVMFENPDYQVDERSEEFRLLNPIISNVAERRRKSLLEEE 540
BLAST of CmoCh06G000160 vs. ExPASy Swiss-Prot
Match:
Q66H99 (Nucleolar protein 10 OS=Rattus norvegicus OX=10116 GN=Nol10 PE=2 SV=1)
HSP 1 Score: 194.9 bits (494), Expect = 2.3e-48
Identity = 173/686 (25.22%), Postives = 292/686 (42.57%), Query Frame = 0
Query: 8 LKSTSINGVKLYTVASQQRSLATWINPKKLRAL-RKDKDYTSRVDLIQDLRFDIASSKIK 67
++ +S+N VK+Y++ S +SL W++ +K RAL +KD D R++LIQD + IK
Sbjct: 1 MQVSSLNEVKIYSL-SCGKSLPEWLSDRKKRALQKKDVDVRRRIELIQDFEMPTVCTTIK 60
Query: 68 ATPDGEFLIASGIYPPQVKVYELRELSLKFKRHFDSEIIDFQILDDDYSKLAFMCADRSI 127
+ DG++++A+G Y P+V+ Y+ +LSLKF+R DSE++ F+IL DDYSK+ F+ DR I
Sbjct: 61 VSKDGQYILATGTYKPRVRCYDTYQLSLKFERCLDSEVVTFEILSDDYSKIVFLHNDRYI 120
Query: 128 VLHAKYGKHHTLRIPRMGRNIEYDNWSADLLCAASSPDVYRISLQQGRFLPPLNTESPAI 187
H++ G ++ RIP+ GR+ Y S DL +S +VYR++L+QGR+L PL T++
Sbjct: 121 EFHSQSGFYYKTRIPKFGRDFSYHYPSCDLYFVGASSEVYRLNLEQGRYLNPLQTDAAEN 180
Query: 188 NVVSRSKLHGIVACGGVDGAVECFDTRTKLSSIGRIDAIAPAGDKDQEVTAL----AFDD 247
NV + +HG+ A G ++G VEC+D R + +G +D + D E+ +L A
Sbjct: 181 NVCDINTVHGLFATGTIEGRVECWDPRVR-KRVGVLDCALNSVTADSEINSLPTISALKF 240
Query: 248 IGGFQMAVGSSSG----------------------------------------------- 307
G MAVG+S+G
Sbjct: 241 NGALSMAVGTSTGQVLLYDLRSDKPLLVKDHQYGLPIKSVHFQDSLDLVLSADSRIVKMW 300
Query: 308 ------------------------------------------------------------ 367
Sbjct: 301 NKDSGKIFTSLEPEHDLNDVCLYPSSGMILTANESPKMGIYYIPVLGPAPRWCSFLDNLT 360
Query: 368 ----------------------------------------------------KAKSLADP 427
K K + +P
Sbjct: 361 EELEENPESTVYDDYKFVTKKDLENLGLTHLIGSPFLRAYMHGFFMDIRLYHKVKLMVNP 420
Query: 428 FAYDAYIEQRKKEKLDEERANRITVKRKLPKVNRRLANQILEEEEAGTEKKDEEDVNKTK 487
FAY+ Y + + ++K++E RA R+ +K KLPKVN+ LA +++EEEE K K
Sbjct: 421 FAYEEYRKDKIRQKIEETRAQRVQLK-KLPKVNKELALKLIEEEE-----------EKQK 480
Query: 488 KASKKKKGLSSEIFQDERFANMFKNENFEINELSPEYLALHPVAS------TKQPSLVEE 499
KKK I D+RF MF+N +F+++E S E+ L+P+ S KQ L+E+
Sbjct: 481 STLKKKVKSLPNILTDDRFKVMFENPDFQVDEDSEEFRLLNPLVSRISEKRKKQLRLLEQ 540
BLAST of CmoCh06G000160 vs. ExPASy Swiss-Prot
Match:
Q9BSC4 (Nucleolar protein 10 OS=Homo sapiens OX=9606 GN=NOL10 PE=1 SV=1)
HSP 1 Score: 193.7 bits (491), Expect = 5.2e-48
Identity = 173/689 (25.11%), Postives = 293/689 (42.53%), Query Frame = 0
Query: 8 LKSTSINGVKLYTVASQQRSLATWINPKKLRAL-RKDKDYTSRVDLIQDLRFDIASSKIK 67
++ +S+N VK+Y++ S +SL W++ +K RAL +KD D R++LIQD + IK
Sbjct: 1 MQVSSLNEVKIYSL-SCGKSLPEWLSDRKKRALQKKDVDVRRRIELIQDFEMPTVCTTIK 60
Query: 68 ATPDGEFLIASGIYPPQVKVYELRELSLKFKRHFDSEIIDFQILDDDYSKLAFMCADRSI 127
+ DG++++A+G Y P+V+ Y+ +LSLKF+R DSE++ F+IL DDYSK+ F+ DR I
Sbjct: 61 VSKDGQYILATGTYKPRVRCYDTYQLSLKFERCLDSEVVTFEILSDDYSKIVFLHNDRYI 120
Query: 128 VLHAKYGKHHTLRIPRMGRNIEYDNWSADLLCAASSPDVYRISLQQGRFLPPLNTESPAI 187
H++ G ++ RIP+ GR+ Y S DL +S +VYR++L+QGR+L PL T++
Sbjct: 121 EFHSQSGFYYKTRIPKFGRDFSYHYPSCDLYFVGASSEVYRLNLEQGRYLNPLQTDAAEN 180
Query: 188 NVVSRSKLHGIVACGGVDGAVECFDTRTKLSSIGRIDAIAPAGDKDQEVTAL----AFDD 247
NV + +HG+ A G ++G VEC+D RT+ + +G +D + D E+ +L A
Sbjct: 181 NVCDINSVHGLFATGTIEGRVECWDPRTR-NRVGLLDCALNSVTADSEINSLPTISALKF 240
Query: 248 IGGFQMAVGSSSG----------------------------------------------- 307
G MAVG+++G
Sbjct: 241 NGALTMAVGTTTGQVLLYDLRSDKPLLVKDHQYGLPIKSVHFQDSLDLILSADSRIVKMW 300
Query: 308 ------------------------------------------------------------ 367
Sbjct: 301 NKNSGKIFTSLEPEHDLNDVCLYPNSGMLLTANETPKMGIYYIPVLGPAPRWCSFLDNLT 360
Query: 368 ----------------------------------------------------KAKSLADP 427
K K + +P
Sbjct: 361 EELEENPESTVYDDYKFVTKKDLENLGLTHLIGSPFLRAYMHGFFMDIRLYHKVKLMVNP 420
Query: 428 FAYDAYIEQRKKEKLDEERANRITVKRKLPKVNRRLANQILEEEEAGTEKKDEEDVNKTK 487
FAY+ Y + + ++K++E RA R+ +K KLPKVN+ LA +++EEEE K K
Sbjct: 421 FAYEEYRKDKIRQKIEETRAQRVQLK-KLPKVNKELALKLIEEEE-----------EKQK 480
Query: 488 KASKKKKGLSSEIFQDERFANMFKNENFEINELSPEYLALHPVAS------------TKQ 499
KKK I D+RF MF+N +F+++E S E+ L+P+ S +Q
Sbjct: 481 STWKKKVKSLPNILTDDRFKVMFENPDFQVDEESEEFRLLNPLVSKISEKRKKKLRLLEQ 540
BLAST of CmoCh06G000160 vs. ExPASy TrEMBL
Match:
A0A6J1EZS1 (nucleolar protein 10 OS=Cucurbita moschata OX=3662 GN=LOC111440847 PE=3 SV=1)
HSP 1 Score: 963.4 bits (2489), Expect = 4.0e-277
Identity = 537/701 (76.60%), Postives = 537/701 (76.60%), Query Frame = 0
Query: 1 MAYEGGNLKSTSINGVKLYTVASQQRSLATWINPKKLRALRKDKDYTSRVDLIQDLRFDI 60
MAYEGGNLKSTSINGVKLYTVASQQRSLATWINPKKLRALRKDKDYTSRVDLIQDLRFDI
Sbjct: 1 MAYEGGNLKSTSINGVKLYTVASQQRSLATWINPKKLRALRKDKDYTSRVDLIQDLRFDI 60
Query: 61 ASSKIKATPDGEFLIASGIYPPQVKVYELRELSLKFKRHFDSEIIDFQILDDDYSKLAFM 120
ASSKIKATPDGEFLIASGIYPPQVKVYELRELSLKFKRHFDSEIIDFQILDDDYSKLAFM
Sbjct: 61 ASSKIKATPDGEFLIASGIYPPQVKVYELRELSLKFKRHFDSEIIDFQILDDDYSKLAFM 120
Query: 121 CADRSIVLHAKYGKHHTLRIPRMGRNIEYDNWSADLLCAASSPDVYRISLQQGRFLPPLN 180
CADRSIVLHAKYGKHHTLRIPRMGRNIEYDNWSADLLCAASSPDVYRISLQQGRFLPPLN
Sbjct: 121 CADRSIVLHAKYGKHHTLRIPRMGRNIEYDNWSADLLCAASSPDVYRISLQQGRFLPPLN 180
Query: 181 TESPAINVVSRSKLHGIVACGGVDGAVECFDTRTKLSSIGRIDAIAPAGDKDQEVTALAF 240
TESPAINVVSRSKLHGIVACGGVDGAVECFDTRTKLSSIGRIDAIAPAGDKDQEVTALAF
Sbjct: 181 TESPAINVVSRSKLHGIVACGGVDGAVECFDTRTKLSSIGRIDAIAPAGDKDQEVTALAF 240
Query: 241 DDIGGFQMAVGSSSG--------------------------------------------- 300
DDIGGFQMAVGSSSG
Sbjct: 241 DDIGGFQMAVGSSSGKILIYDLRSSDPIRIKDHMYDSPILNIKWHSTLNSEKPKMITTDK 300
Query: 301 ------------------------------------------------------------ 360
Sbjct: 301 HIVRIWDPDTGEGMTSIEPTLGPINDTCVFKDSGLMLLALNSSQIPSYFLPALGPAPKWC 360
Query: 361 -----------------------------------------------------------K 420
K
Sbjct: 361 SYLENLTEELEEGAETTIYDDFKFLTKEELERLNLTNLIGTNLLRAYLHGFFIDYRLYKK 420
Query: 421 AKSLADPFAYDAYIEQRKKEKLDEERANRITVKRKLPKVNRRLANQILEEEEAGTEKKDE 480
AKSLADPFAYDAYIEQRKKEKLDEERANRITVKRKLPKVNRRLANQILEEEEAGTEKKDE
Sbjct: 421 AKSLADPFAYDAYIEQRKKEKLDEERANRITVKRKLPKVNRRLANQILEEEEAGTEKKDE 480
Query: 481 EDVNKTKKASKKKKGLSSEIFQDERFANMFKNENFEINELSPEYLALHPVASTKQPSLVE 538
EDVNKTKKASKKKKGLSSEIFQDERFANMFKNENFEINELSPEYLALHPVASTKQPSLVE
Sbjct: 481 EDVNKTKKASKKKKGLSSEIFQDERFANMFKNENFEINELSPEYLALHPVASTKQPSLVE 540
BLAST of CmoCh06G000160 vs. ExPASy TrEMBL
Match:
A0A6J1I2M0 (nucleolar protein 10 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111469983 PE=3 SV=1)
HSP 1 Score: 928.7 bits (2399), Expect = 1.1e-266
Identity = 524/703 (74.54%), Postives = 528/703 (75.11%), Query Frame = 0
Query: 1 MAYEGGNLKSTSINGVKLYTVASQQRSLATWINPKKLRALRKDKDYTSRVDLIQDLRFDI 60
MAYEGGNLKSTSINGVKLYTVASQQRSLATWINPKKLRALRKDKDYTSRVDLIQDLRFDI
Sbjct: 1 MAYEGGNLKSTSINGVKLYTVASQQRSLATWINPKKLRALRKDKDYTSRVDLIQDLRFDI 60
Query: 61 ASSKIKATPDGEFLIASGIYPPQVKVYELRELSLKFKRHFDSEIIDFQILDDDYSKLAFM 120
ASSKIKATPDGEFLIASGIYPPQVKVYELRELSLKFKRHFDSEIIDFQILDDDYSKLAFM
Sbjct: 61 ASSKIKATPDGEFLIASGIYPPQVKVYELRELSLKFKRHFDSEIIDFQILDDDYSKLAFM 120
Query: 121 CADRSIVLHAKYGKHHTLRIPRMGRNIEYDNWSADLLCAASSPDVYRISLQQGRFLPPLN 180
CADRSIVLHAKYGKHHTLRIPRMGRNIEYDNWSADLLCAASSPDVYRISLQ+G+FLPPLN
Sbjct: 121 CADRSIVLHAKYGKHHTLRIPRMGRNIEYDNWSADLLCAASSPDVYRISLQRGQFLPPLN 180
Query: 181 TESPAINVVSRSKLHGIVACGGVDGAVECFDTRTKLSSIGRIDAIAPAGDKDQEVTALAF 240
TESPAINVVSRSKLHGIVACGGVDGAVECFDTRTKLSSIGRIDAIAPAGDKDQEVTALAF
Sbjct: 181 TESPAINVVSRSKLHGIVACGGVDGAVECFDTRTKLSSIGRIDAIAPAGDKDQEVTALAF 240
Query: 241 DDIGGFQMAVGSSSG--------------------------------------------- 300
DDIGGFQMAVGSSSG
Sbjct: 241 DDIGGFQMAVGSSSGKILIYDLRSSDPIRIKDHMYDSPILNIKWHSTLNSEKPKMITTDK 300
Query: 301 ------------------------------------------------------------ 360
Sbjct: 301 HIVRIWDPDTGEGMTSIEPTLGPINDTCVFKDSGLMLLALNSSQIPSYFLPALGPAPKWC 360
Query: 361 -----------------------------------------------------------K 420
K
Sbjct: 361 SYLENLTEELEEGAETTIYDDFKFLTKEELERLNLTNLIGTNLLRAYLHGFFIDYRLYKK 420
Query: 421 AKSLADPFAYDAYIEQRKKEKLDEERANRITVKRKLPKVNRRLANQILEEEEAGTEKKDE 480
AKSLADPFAYDAYIEQRKKEKLDEERANRITVKRKLPKVNRRLANQILEEEEA TEKKDE
Sbjct: 421 AKSLADPFAYDAYIEQRKKEKLDEERANRITVKRKLPKVNRRLANQILEEEEAETEKKDE 480
Query: 481 EDVNKTKKASKKKKGLSSEIFQDERFANMFKNENFEINELSPEYLALHPVASTKQPSLVE 538
EDVNK KKASKKKKGLSSEIFQDERFANMFKNENFEINELSPEYLALHPVASTKQPSLVE
Sbjct: 481 EDVNKAKKASKKKKGLSSEIFQDERFANMFKNENFEINELSPEYLALHPVASTKQPSLVE 540
BLAST of CmoCh06G000160 vs. ExPASy TrEMBL
Match:
A0A0A0K1Z0 (NUC153 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G058640 PE=3 SV=1)
HSP 1 Score: 852.8 bits (2202), Expect = 7.6e-244
Identity = 487/715 (68.11%), Postives = 515/715 (72.03%), Query Frame = 0
Query: 1 MAYEGGNLKSTSINGVKLYTVASQQRSLATWINPKKLRALRKDKDYTSRVDLIQDLRFDI 60
MAYEGGNLKSTSINGVKLYTVAS+QRS+A+W+NPKKLRALRKDKDYTSRVDL+QDLRFDI
Sbjct: 1 MAYEGGNLKSTSINGVKLYTVASEQRSVASWLNPKKLRALRKDKDYTSRVDLVQDLRFDI 60
Query: 61 ASSKIKATPDGEFLIASGIYPPQVKVYELRELSLKFKRHFDSEIIDFQILDDDYSKLAFM 120
A+SKIKATPDGEFLIASGIYPPQVKVYELRELSLKFKRHFDSEIIDFQILDDDYSKLAFM
Sbjct: 61 ATSKIKATPDGEFLIASGIYPPQVKVYELRELSLKFKRHFDSEIIDFQILDDDYSKLAFM 120
Query: 121 CADRSIVLHAKYGKHHTLRIPRMGRNIEYDNWSADLLCAASSPDVYRISLQQGRFLPPLN 180
CADRSIVLHAKYGKHH+LRIPRMGRNIE+D WSADLLCAASSPD+YRISLQQGRFLPPLN
Sbjct: 121 CADRSIVLHAKYGKHHSLRIPRMGRNIEFDYWSADLLCAASSPDLYRISLQQGRFLPPLN 180
Query: 181 TESPAINVVSRSKLHGIVACGGVDGAVECFDTRTKLSSIGRIDAIAPAGDKDQEVTALAF 240
TESPAINVVSRSKLHGI+ACGGVDGAVECFDTRTKLSSIGRIDA+APAGDKDQEVTALAF
Sbjct: 181 TESPAINVVSRSKLHGIIACGGVDGAVECFDTRTKLSSIGRIDAVAPAGDKDQEVTALAF 240
Query: 241 DDIGGFQMAVGSSSG--------------------------------------------- 300
DDIGGFQMAVGSSSG
Sbjct: 241 DDIGGFQMAVGSSSGKVLIYDLRSSDPIRIKDHMYDSPILDIKWHSTLNSEKPKMITTDK 300
Query: 301 ------------------------------------------------------------ 360
Sbjct: 301 HVVRIWDPDTGEGMTSIEPTLGPINDTCVFKDSGLMLLALNSSQIPSYFLPALGPAPKWC 360
Query: 361 -----------------------------------------------------------K 420
K
Sbjct: 361 SYLENLTEELEENAQPTIYDDFKFVTKEELGRLNLTNLIGTNLLRAYLHGFFIDYRLYKK 420
Query: 421 AKSLADPFAYDAYIEQRKKEKLDEERANRITVKRKLPKVNRRLANQILEEEEAGTEKKDE 480
AK+L DPFAYDAYIEQRKKEKLDEERANRITVKRKLPKVNRRLANQILEEEEA TEKK E
Sbjct: 421 AKALVDPFAYDAYIEQRKKEKLDEERANRITVKRKLPKVNRRLANQILEEEEAETEKK-E 480
Query: 481 EDVNKTKKASKKKKGLSSEIFQDERFANMFKNENFEINELSPEYLALHPVASTKQPSLVE 538
EDVNKTKKASKKKK LSSEIFQDERF NMFKNENFEI+ELS EYLALHP+ASTKQPSL+E
Sbjct: 481 EDVNKTKKASKKKKALSSEIFQDERFTNMFKNENFEIDELSQEYLALHPMASTKQPSLME 540
BLAST of CmoCh06G000160 vs. ExPASy TrEMBL
Match:
A0A1S3C010 (nucleolar protein 10 OS=Cucumis melo OX=3656 GN=LOC103495447 PE=3 SV=1)
HSP 1 Score: 851.3 bits (2198), Expect = 2.2e-243
Identity = 484/709 (68.27%), Postives = 514/709 (72.50%), Query Frame = 0
Query: 1 MAYEGGNLKSTSINGVKLYTVASQQRSLATWINPKKLRALRKDKDYTSRVDLIQDLRFDI 60
MAYEGGNLKSTSINGVKLYTVASQQRS+ATW+NPKKLRALRKDKDYTSRVDL+QDLRFDI
Sbjct: 1 MAYEGGNLKSTSINGVKLYTVASQQRSVATWLNPKKLRALRKDKDYTSRVDLVQDLRFDI 60
Query: 61 ASSKIKATPDGEFLIASGIYPPQVKVYELRELSLKFKRHFDSEIIDFQILDDDYSKLAFM 120
A+SKIKATPDGEFLIASGIYPPQVKVYELRELSLKFKRHFDSEIIDFQILDDDYSKLAFM
Sbjct: 61 ATSKIKATPDGEFLIASGIYPPQVKVYELRELSLKFKRHFDSEIIDFQILDDDYSKLAFM 120
Query: 121 CADRSIVLHAKYGKHHTLRIPRMGRNIEYDNWSADLLCAASSPDVYRISLQQGRFLPPLN 180
CADRSIVLHAKYGKHH+LRIPR+GRNIE+D WSADLLCAASSPD+YRISLQQGRFLPPLN
Sbjct: 121 CADRSIVLHAKYGKHHSLRIPRVGRNIEFDYWSADLLCAASSPDLYRISLQQGRFLPPLN 180
Query: 181 TESPAINVVSRSKLHGIVACGGVDGAVECFDTRTKLSSIGRIDAIAPAGDKDQEVTALAF 240
TESPAINVVSRSKLHGIVACGG+DGAVECFDTRTKLSSIGR+DAIAPAGDKDQEVTALAF
Sbjct: 181 TESPAINVVSRSKLHGIVACGGIDGAVECFDTRTKLSSIGRLDAIAPAGDKDQEVTALAF 240
Query: 241 DDIGGFQMAVGSSSG--------------------------------------------- 300
DDI GFQMAVGSSSG
Sbjct: 241 DDISGFQMAVGSSSGKVLIYDLRSSDPIRIKDHLYDSPILDIKWHSTLNSEKPKMITTDK 300
Query: 301 ------------------------------------------------------------ 360
Sbjct: 301 HIVRIWDPDTGEGMTSIEPTLGPINDTCVFKDSGLMLLALNSSQIPSYFLPALGPAPKWC 360
Query: 361 -----------------------------------------------------------K 420
K
Sbjct: 361 SYLENLTEELEENAQPTIYDDFKFVTKEELGRLNLTNLIGTNLLRAYLHGFFIDYRLYKK 420
Query: 421 AKSLADPFAYDAYIEQRKKEKLDEERANRITVKRKLPKVNRRLANQILEEEEAGTEKKDE 480
AK+L DPFAYDAYIEQRKKEKLDEERANRITVKRKLPKVNRRLANQILEEEEA TEKK E
Sbjct: 421 AKALVDPFAYDAYIEQRKKEKLDEERANRITVKRKLPKVNRRLANQILEEEEAETEKK-E 480
Query: 481 EDVNKTKKASKKKKGLSSEIFQDERFANMFKNENFEINELSPEYLALHPVASTKQPSLVE 538
ED+NKTKKASKKKKGL+SEIFQDERFANMFKNENFEI+ELS EYLALHP+ASTKQPSLVE
Sbjct: 481 EDINKTKKASKKKKGLNSEIFQDERFANMFKNENFEIDELSQEYLALHPMASTKQPSLVE 540
BLAST of CmoCh06G000160 vs. ExPASy TrEMBL
Match:
A0A6J1DWS5 (nucleolar protein 10 OS=Momordica charantia OX=3673 GN=LOC111025246 PE=3 SV=1)
HSP 1 Score: 820.1 bits (2117), Expect = 5.5e-234
Identity = 472/709 (66.57%), Postives = 501/709 (70.66%), Query Frame = 0
Query: 1 MAYEGGNLKSTSINGVKLYTVASQQRSLATWINPKKLRALRKDKDYTSRVDLIQDLRFDI 60
MAYEGGNLKSTSINGVKLYTVASQQRSLATW++PKK+RALRKDKDYTSRVDL+QDLRF+I
Sbjct: 1 MAYEGGNLKSTSINGVKLYTVASQQRSLATWLDPKKMRALRKDKDYTSRVDLVQDLRFEI 60
Query: 61 ASSKIKATPDGEFLIASGIYPPQVKVYELRELSLKFKRHFDSEIIDFQILDDDYSKLAFM 120
A+SKIKATPDGEFLIASGIYPPQVKVYELRELSLKFKRHFDSEIIDFQILDDDYSKLAFM
Sbjct: 61 ATSKIKATPDGEFLIASGIYPPQVKVYELRELSLKFKRHFDSEIIDFQILDDDYSKLAFM 120
Query: 121 CADRSIVLHAKYGKHHTLRIPRMGRNIEYDNWSADLLCAASSPDVYRISLQQGRFLPPLN 180
CADRSIVLHAKYGKH++LRIPR+GR+IE+D WSADLLCAASSPDVYRI+LQQGRFL PLN
Sbjct: 121 CADRSIVLHAKYGKHYSLRIPRVGRDIEFDYWSADLLCAASSPDVYRINLQQGRFLSPLN 180
Query: 181 TESPAINVVSRSKLHGIVACGGVDGAVECFDTRTKLSSIGRIDAIAPAGDKDQEVTALAF 240
TES AINVVSRSK+HGIVACGG DGAVECFDTRTKLSSIGRIDA+APAGDK QEVTALAF
Sbjct: 181 TESSAINVVSRSKIHGIVACGGEDGAVECFDTRTKLSSIGRIDAVAPAGDKAQEVTALAF 240
Query: 241 DDIGGFQMAVGSSSG--------------------------------------------- 300
DDIGGFQMAVGSSSG
Sbjct: 241 DDIGGFQMAVGSSSGKVLIYDLRSSVPIRIKDHMYDSPILDIKWHSTLNSEKPKMITTDK 300
Query: 301 ------------------------------------------------------------ 360
Sbjct: 301 HIVRIWDPDTGEGMTSIEPTLGPINDTCVFKDSGLMLLALNSSQIPSYFLPALGPAPKWC 360
Query: 361 -----------------------------------------------------------K 420
K
Sbjct: 361 SYLENLTEELEEGAQTTIYDDFKFLTKEELERLNLTNLIGTNLLRAYLHGFFIDYRLYKK 420
Query: 421 AKSLADPFAYDAYIEQRKKEKLDEERANRITVKRKLPKVNRRLANQILEEEEAGTEKKDE 480
AKSL DPFAYDAYIEQRKKEKLDEERANRIT+KRKLPKVNRRLANQILE+EEA EKKDE
Sbjct: 421 AKSLVDPFAYDAYIEQRKKEKLDEERANRITMKRKLPKVNRRLANQILEDEEAEKEKKDE 480
Query: 481 -------EDVNKTKKASKKKKGLSSEIFQDERFANMFKNENFEINELSPEYLALHPVAST 538
+DVNKTKKASKKKKG SSEIFQDERF+NMFKNENFEI+E S EYLALHPV+ST
Sbjct: 481 DTEKEKKDDVNKTKKASKKKKGFSSEIFQDERFSNMFKNENFEIDEFSQEYLALHPVSST 540
BLAST of CmoCh06G000160 vs. TAIR 10
Match:
AT3G56990.1 (embryo sac development arrest 7 )
HSP 1 Score: 498.0 bits (1281), Expect = 9.2e-141
Identity = 324/709 (45.70%), Postives = 402/709 (56.70%), Query Frame = 0
Query: 1 MAYEGGNLKSTSINGVKLYTVASQQRSLATWINPKKLRALRKDKDYTSRVDLIQDLRFDI 60
M G LKSTSINGVKLY V+S ++ TW+NPKK RALRK+ Y RV+LIQ+L+F+
Sbjct: 1 MTSYGDRLKSTSINGVKLYNVSSAP-NVPTWLNPKKQRALRKNPHYMQRVELIQELKFET 60
Query: 61 ASSKIKATPDGEFLIASGIYPPQVKVYELRELSLKFKRHFDSEIIDFQILDDDYSKLAFM 120
A+++IKATPDGE+LIASGIYPPQVKVYEL +L+LKF+RH DSEI+DF+ILDDD+SKLAF+
Sbjct: 61 ATTRIKATPDGEYLIASGIYPPQVKVYELNQLALKFERHLDSEIVDFEILDDDFSKLAFL 120
Query: 121 CADRSIVLHAKYGKHHTLRIPRMGRNIEYDNWSADLLCAASSPDVYRISLQQGRFLPPLN 180
CADRSI LHAKYGKHHTLRIPRMGR++ YD+WS DLLCAASSPD+YRI+L+QGRFL PL+
Sbjct: 121 CADRSINLHAKYGKHHTLRIPRMGRDMTYDSWSCDLLCAASSPDLYRINLEQGRFLSPLS 180
Query: 181 TESPAINVVSRSKLHGIVACGGVDGAVECFDTRTKLSSIGRIDAIAPAGDKDQEVTALAF 240
T+SPA+NVVSRS LHG+VACGG DGAVE FD R K SS RI+A+ GD EVTA+ F
Sbjct: 181 TQSPALNVVSRSNLHGLVACGGEDGAVEFFDMRMK-SSAARINAVTHGGDAAAEVTAIEF 240
Query: 241 DDIGGFQMAVGSSSG--------------------------------------------- 300
DD G Q+AVGSS+G
Sbjct: 241 DDSEGLQVAVGSSAGKVFIYDLRTSTPIRVKDHMYESPILNIKWQRTLNTQQPKLITTDK 300
Query: 301 ------------------------------------------------------------ 360
Sbjct: 301 HIVRIWDPNTGEGMTSIEPTQGGINDICVFRGSGLMLLALDSSLIPSYFIPELGPAPKWC 360
Query: 361 -----------------------------------------------------------K 420
K
Sbjct: 361 SPLENLTEELEESAQTTIYDNYKFLAMEDLEKLQLTHLIGTDLLKASMHGYFINYHLYKK 420
Query: 421 AKSLADPFAYDAYIEQRKKEKLDEERANRITVKRKLPKVNRRLANQILEEEEAGTEKKDE 480
A ++ +PFA+D Y+E+RK+EKL+E+R RIT KR+LPKVNR LA + L +E+ E K
Sbjct: 421 ALAVIEPFAFDDYLERRKQEKLEEQRTQRITKKRRLPKVNRDLAAR-LHGDESEEENKTA 480
Query: 481 EDVNKTKKASKKKKG-LSSEIFQDERFANMFKNENFEINELSPEYLALHPVAST-KQPSL 537
ED TKK KKKK L+ E F D RF +MF+N +F+I++ S EY LHPVAS+ KQPSL
Sbjct: 481 EDGEATKKVLKKKKPILTDEHFVDGRFGSMFQNPDFQIDKDSYEYGVLHPVASSKKQPSL 540
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q7T0Q5 | 1.0e-51 | 26.09 | Nucleolar protein 10 OS=Xenopus laevis OX=8355 GN=nol10 PE=2 SV=1 | [more] |
Q6NVM6 | 1.2e-49 | 25.07 | Nucleolar protein 10 OS=Xenopus tropicalis OX=8364 GN=nol10 PE=2 SV=1 | [more] |
Q802W4 | 8.0e-49 | 24.93 | Nucleolar protein 10 OS=Danio rerio OX=7955 GN=nol10 PE=2 SV=1 | [more] |
Q66H99 | 2.3e-48 | 25.22 | Nucleolar protein 10 OS=Rattus norvegicus OX=10116 GN=Nol10 PE=2 SV=1 | [more] |
Q9BSC4 | 5.2e-48 | 25.11 | Nucleolar protein 10 OS=Homo sapiens OX=9606 GN=NOL10 PE=1 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1EZS1 | 4.0e-277 | 76.60 | nucleolar protein 10 OS=Cucurbita moschata OX=3662 GN=LOC111440847 PE=3 SV=1 | [more] |
A0A6J1I2M0 | 1.1e-266 | 74.54 | nucleolar protein 10 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111469983 PE=3... | [more] |
A0A0A0K1Z0 | 7.6e-244 | 68.11 | NUC153 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G058640 PE=3... | [more] |
A0A1S3C010 | 2.2e-243 | 68.27 | nucleolar protein 10 OS=Cucumis melo OX=3656 GN=LOC103495447 PE=3 SV=1 | [more] |
A0A6J1DWS5 | 5.5e-234 | 66.57 | nucleolar protein 10 OS=Momordica charantia OX=3673 GN=LOC111025246 PE=3 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT3G56990.1 | 9.2e-141 | 45.70 | embryo sac development arrest 7 | [more] |