Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGCCGTCTTTATTTCCATAGAAAGTCATGGCGCCTCTTTTGCTTGTTTTCACCTTCCAACAAAAAACCCACTGGGCATAAATAATAACACCTACTAGGGCATATGAGCAGGGTTTATCGAACAGTTCTGCGTCTCTAAAACCACCCTCCATTTTCCCGCGCTCCACCTTCGTTGCCCCATCTTCCCTCTTCCCCTTTCGGCGGCACCTTCATTTTGCCACTCACAGTGCGCCGCCATGTTGTCCTCCAAGCAGAGCCTCCGCATTATCGAGTCCGCGCTTCTGGGTCCGTCCCCTCCGTCCCCATCGCAGAGGGTCGAGCTCTTACACGCCATTCACAATTCTATGCCTGCTTTTCGTTCGCTTCTTCAATTCCCAGTACGTATGCCTTTGTATTTCGTCGTTTGCTGATCTGGGTCTGTGACATTTTGTGATTTAGTACCTTGTATTTGTCTGTTGGTGTCCTGATTTTGTTGAATTTTGGGTTGTAGTCGCCGAAAGCTTCAGACCGTGCACAAGTTCAATCTAAAGAAGTCAGGCGGCCGGATTCTTCAACGATAACGCTCGATGATCAAGACGTCGTAATTGTAAATTCTTTGTCCATTTGTTTTCTTTTCCGGGATGTCGATCAGAATTTGAGTATGTGACGTGGGTTTTCATGGAATGTTGATTTGGTTGCTTATTTTTTGCACGACGACAGCTTAAGATACTAGTTTAAGTAGTTAACCTTCTTCCTCTGTTTCTTCTGCTTCGGTTTGGTGGGAGAATCAAGAAGTAGTCTTTGACCCTGTTCATCCTTAGATTGGTGTATCATTGTGGCTATATTTTTATGTATTGCTTTATTCGTCCAAGTTATGAATGATATGAACCATAAATTCTTTCCCGTTTTTCGTTATGGCACGACATGACTGTTATATTGGCGTTATTTGATTTCACTTCCACGCTTGTTTGTAGACTTTAAAGTTGAGTGATGACCTCCATCTGAATGAAATAGAATGCGTGCATCTACTTGTTGCTGCACATCAAGAGGTATACCTCTCTCCCTTATTTGTTTACTTGTTATCATATTTTACATCAGCTTTATTTGTCTCCCTTTCTATAACTTGTTTTTCTTGTTCTTTGCAGTGGGCTTTAATGGGACGAGATCCTTCAGAGATTTTCCACCTTGCAGCAGGGCTTTGGTATACAGAGAGAAGAGATCTAATAATGTCACTCTATACACTACTAAGGGTATTAATATTATCATTTTGTTTGTATCCTAATTTTTGGACATGCAATTGTGTTTTTTCTTGCATACTGTTAATTATATTCTTTTCTTTGCAGGCTGTTGTACTTGATCAGGGGCTTGAAGCTGGTCTTGTATCTGACATCCAAAGACACCTGGAAGATCTTATTAATAGTGGCCTACGACAACGATTGATTTCTTTGATAAAGGTTGGATCTTTGTAGTCCCTATTTTAGGTCATGCCATATGTGTAAACCTTCTGCTGAAGGTTGGAATCATGTTACTGTTACTACTATGTTATAACAGTAGCTTAGTATTTAAATTTATTGAATTTGCCTGTATTTTGTGATCTTTTAACACTTGACCGTATGAATATTTTGTTTCGTATTAATCTTGTGAGATACAGAAACACATGGTTGTTAGCCCAAACATTACAACTGAAAATAAGAAATCGATATGCTTCAAGTGTCTAGGATTAATTGTGAACGCATAATGGCTGACCATTAGGAACAAGTATTTCCATGACTATTTTCAGTAAAAGGTTCCATCTTTCTTGTTTTAGGAACTCAATCGAGAAGAACCAGCTGGTTTGGGTGGACCTAGCTGTGAGCGCTACATTCTTGATTCCAGAGGTGCTCTTGTTGAGCGTCGGGCTGTGGTTTGTCGGGAAAGACTTATACTAGGACATTGTCTTGTCCTTTCTATTCTTGTTGTTCGAATAGGTAAGCCGTGGAGGACTTTTTTTAATTGATCAATGCCATTGGTAGTAGGTTGAAGGTTTCCTATGTGTGCTGCTACGATTCACTGTCTTGCTTCATGTGGTGCTTGCTAGTAATATTTGATAGTTCAAGATGTATTTTTTTCCTCCATCGAGGTAGTCATTGTTATTGGATACTTGCTGACGTTAGACGTAGAGCTTTAGTTTAGATGATTTGATAGTGCATTTTATATTTTTCTAGTGAATGAAACTTAGTTAAAAATGTGTGAAATGATAACTTCTCCTTGGTAATTAATCTTTAATGATAAGAAACATAAATTTATTATTGGAAAACATTTGAAAGTTTTCAATTGTGAGAAACACAGGTTAATTAATGGAAAATATACTGCAAGTAATATGGCAAGAAACTCCAGTTAATAATCTGACAGCTCCATGATTTCTCTTTTATCAGGTCCAAAGGATGTGAGGGATCTCTTTTCTGTTTTAAAAGACTGCGCTGCTGAGCTTAATGAGACCAAAACTCCCATAAAGCTTCAGGCATGTTATTAATATTTTGATTACTGTAGAATGTAAAGAAAAGACCCACTTGTTTCCTGTCTCTCAGAAGTAATGTGGGTGCACACCAGTGTTGTATGTGTATATTCTTCTAGAGACATTTTCTTGTTCTGTCCTTCATTACTTTTCTTGCTAATTCTGAATGTATTTTCTTTTGCAGATTGTATTTAGTCTCCTTTTCTCGATCATTATTGCCTTTGTATCGGATGCTTTGAGCGCTGTTCCAAATAAAGCATCTATATTGTCCAGTGATGCTTCTTTTAGAAATGAATTTCAGGATACCGTGAGTTTATCAAGATAGTACTGAAACTTCTGTGGTTAGGTTTTTGATACACGCACACATTTTGTTAACATTTTAAACGAGCTTGTAATCTTTGTATAGGTGATGGTGTCTGGAAATAATCCCACAGTGGAAGGATTTGTTGATGCTGTTAGGTTTGCTTGGACTGTTCATTTGCTGTTAATACACGACATGGTTGATGCAAGAGAAGCTGTTCCCAGTGCTTCCCCAAAAGATTTAGATCACCTACAATCGTGCTTGGAAGTTATTTTCTCCCATAATGCTTTCCAGTTCATGCTTCAAGAAGTTATTCAGACTGCGGCTTACCAGGTTTGTATCTTTCAAGAAGTTATGCATCTTATAGGCAGCAGAACATTTTCTTTTGGAATTCTTAAAGTATATTTGCCCCGACATTCTTATCTTGTTGCAAGTACTTTGTTTTGTCTTCTTTTTCCCCTTGATGTTTCACCAATGCTTGTGGGAGATCCCTCAACTCTTTTTAATTGAAAACAGTTTGTTTGCCTATGGACGTGAAGTTATTAGGTGCTACTACTTGTTACGTATATTTATTCCTAATGGGTGTCGGTGGTAAGGGCTAGAGACAATTAGAAATATCCTATAAGGTCTGGGGAGTTCTAGCTCTGTATTTGATAACCTTAGAATAAGTGAGTAATTTACAATCCTTGTTGTCTCTTTTGAAAAAACTAAGTCGTTTATATTTTTGACCGTGCCTTCTTTAGTAGTCATTTGTACAAGTTAGATTATTTTGTCTTCTAATATCTTCTATTGGTTATGTTGGCAGAATGATGATGAGGACATGACCTATATGTACAATGCTTATCTGCACAAACTGGTTTCTTGTTTTTTATCTCATCCTCTGGCTAGGGACAAGGTCAGTCGGGAGATTTCCTCCTAAAATTTATAGCTTTCTTGTTTAATTTTGGCCATTGGCTCACTCCTTTGTTAAGTGCTTCATATATATATATATATATATATATATATAAATATCAGACCTGTACAGGTTAGAATTAAAACAATTTATTATCTAGCAAGTGAATGAGCTAGTGTATGGGTTTCTAGCCTTGCCTGCTAGTGTATGTGATATGTGGAAATCACTACAAGGGGAGTAAGATTCGGATTACCTTGTCGATCGAATATCTCAAGGCAAGAATATTGTTTGAGATTTGAATCACTCCACAAGCAAGATCGATCATGTCTAGCCTGAATGATTCTTGTTGATCAAATATCTCAAGGCAAGAACACTTGATTGAGATTCGAATCACTCTACAAGCAAGATTGATCATGTCGAGCTTGAATGATTCTACATGCAATCTAAACTACGTAGAATTGCAAAGAAACTTAGCCATTGGCTAAAGAAGAGCACAAATGCTTCTTTTACTATATTTTCCAAGTCTACTTACAAATACAACATACATGGCTTCATATTAGTATGGGTTTCTAGCCTTGCCTCATTGCAGTTGGATCTAACTGAAAGTATGTATTAATAAATTTGATTGGTTTTTTTATGGTATGAAGGTGAAAGAGTCAAAGGACCGAGCTATGAACACGCTAAGTCAGTTTCGTGCTACCGGGTCACAAGATTTCATGCATGATGGTGATTCAAGTTCCCATCAAGCTAGCGAAACTCTTCCTTCACCTTTTGTCTCCCTCTTGAACTTTGTTAGTGAAATTTACCGGGTAAGCTTTTATTGAAGTACTTGAAAGTTTTGAGATGCTGATAAGATAAGAGTTAAATAGGTGACAAGCTGGTGGTTTGATATTTTTAGAAAGAAAATATTTCATGTGTAGTCATGGAATTAACTCCTTAGCAACTATTAGGTCCATGCTACTATAGATCCAACTCAAAATAACTTGCTATAGCAAGAAATTTAGAATGGTGCTAAATGTTCTTCGTGAAAGTAGCTAATAAATCACATATGCTGCATATGAATTAGCTATGATTATTGAGCTTGAATTTCTTTCAATTTATTCAGAAAGAACCAGAACTGTTATCAAGCAATGATGTCTTGTGGACATTTGCAAATTTCGCTGGGGAGGATCATACAAATTTTCAAACTTTGGTGGCATTCTTGAACATGCTGAGTACCTTAGTATGTGATGCACGCTCTTTTAAGACTTTTTAATTATTAAGCTGCTCACCAATTCTTTTCACGAGAATGTACTTTACTATTAATTATTGTCTTTTATCAATCATTTCAGGCTTGTAATGAAGAGGGTGCTTCAAGGGTTTTTGAGTTACTTCAGGGAAAAGCATTCCGATCTGTTGGATGGACCACCTTATTTGATTGCCTATCAATTTATGATGAAAAATTTAGACAGTCCCTTCAGACTGCTGGGGCCCTGTTACCAGAGTTTCAGGAAGGGGATGCAAAAGCGCTTGTTGCGTATTTGAATGTTCTTCAAAAGGTAATACCTTTTTCTTACTATATTGGTCTTAAACTTTTATTCAGTTACCCACACTTTCATGAGATTGTAAAGTTTTTCTTAATCTATGACGATTTTTTGTGGAGCTTCATCTGCATTCTACTTGCTAGACTAGCTGTCTGTCATTAGAATGGTTATCTTGCATAATTATCTATCATTAGTATGGTTGTCTAGCATATTATCTATCATTAGTATGGTTGTCTAGCAAGTAGCACGCAACAACAGGTCCACAAAAGTAATAGTCACGGAGAAGAAGTCTGAATTGAAAGGGATTAAGACCTTAATATTATTAGTAAAAACAAAATGAGCAGGCAAATAGTTCTATAGCAATAATAACTTTTCATGGTAAATCCTAGTTGAAATCTATATGTCAATTTTTGAGTACAAATTGCGCAAATCACCCCCCAAAGTATTTTAGTTGTTACAATTGAAGGCTCCAATCCAAAAGAACTACACCTAAAGAATGATTCAAATAGATTATGGAATAAATTTCGTTGTTTCTTACTGATCATCATTGTTGGTTGGTTTCTGATGAGATACAGGTTGTGGAGAATGGGAATCCTGTTGAAAGAAAGAACTGGTTTCCTGATATTGAACCACTGTTCAAACTTCTTAGCTATGAAAATGTTCCTCCTTATCTGAAGGTTATTGCACTTTGGTTTGTTGTGTTTTGAGTAAAAAATATGTTTGCACAAGATTTCTTATATTTTTTGGAATTTAGATGCTTCAATGAGAATTTCGTTGATGATTTTGATCAAGTTTTTGATATCATAGGGTGCTTTGAGGAACGCAATCGCATCCTTCATTCAAGTCTCTTCTGACTTGAAGGATATCATTTGGTGCTATCTTGAGCAGTATGATTTACCTGTTCTTGTTGCATCCCATATTCAGAATGGCACAAAGTCGATTACTTCTCAGGTTATTCATCTCTGATGACCCTTTTAGATATGTAAAATATTCTATCGTTTTTAGTTTTGAAATTGTTTTTTTTTTGAGCTGCAACTAGGAGATTTTTGTTGCTAGCGTGGATATTTAGCATGCTTATATTCACTTAATGTAAATTGAAAGTGTGGCACTGGCATGATAGTGATTTCAATATGCTAATGAGTTGTCCTGTGTCGAATAACCGACAAAATAAAGATGCACTGTTTTGCATTTGATGAAAAAAAAAAACGGATATTGGCTTATATAAGCACCGTGTTTGTTGGCATTTTCAAACATGATAATTTTAGTGATGGAACCTTGTTGCACTGAGATTACACGTGAAGAAAAGATGTTGGTTTATGAAAGCACCGTGTCTTTGCAGGTTTATGATATGCAGTTCGAGGTAAATGAAATTGAAGCAAGACAGGAGAGATATCCATCAACCATATCTTTCCTTAACCTGTTAAATGCTCTTATTGCTAAAGAAAGTGATCTAAGTGATAGAGGGCGTAGGTAATGTTTATTGGTTATAATATACACTTGACTCGGTTTTATTATTTATTCTAAGGTTTGAAAGTTTTGCTGCAGATTTGTTGGAATATTTAGGTTCATTTATGATCATGTTTTTGGGCCATTCCCTCAACGAGCCTATGCGGATGCTGCTGAGAAATGGCAGTTAGTTGTTGCTTGCCTACAACATTTTAATATGTGAGTGATAGTGTGGGTTTAAACTTCAAAGTTGAAGATTTTGCTTGTGTTTATGGGCTTTCCTTGTTGTCCTCCCTGCTTCTTGTCTCTCTTTGGAGATTGGTGTCCAAAGATTTTTGTATGATCCTATGTATTCTTTCATTTTTCTCGATGGTGATGAAGGCTTGGATTTTCTTCTTGAAAAACCTTACAATTATGGGGTTAATCAAATTAAAAATGCACATTTGATTGCTCAATGTGCAGGATTTTGAAAATGTATGACATTAATGAAGAGGACGTTGACGTTGTTATTGATCGATCACAATCATCAATGGATACACAATCATCTTCACTTCAAACTCAGCTACCAGTACTTGAGTTGCTAAAAGTATGTATCTATTACTATTACTTCATTTGATCCATATGTCAAAGAGTTCCTTTTGATTGTTTTCTATTGATAGAAGTGTGTTTTTATTCATACACAATGAGTCATGGCATGATGATATTTATTACATATTTCACCAACAATTTCTGTTGTGGAGTCAATGAATTTAGAGGAACAAATTATGAGAAGGGTTCATACACAATGACTCATTGGTTTAAAATTTCAAAGAGTTTCTTGGTAGAAGCCTTTTTTGTCTATGTTTGAAATAGGTTTCTAAAAAGCATTGCGGGACTTTTATCGTTTTTCTCGTCAAAGTCTTTGATTGTTTAAGTCAGCATTTCGTTGGCTCGATGGTACCACATGATATCTTTCAGTTCCTGGATATTATGTTTGCTTGTATTCCAATTTCTGCGGAAATGTCTTGAAGTTTTCTTTTTGTTGGTTAGGATTTCATGAGCGGAAAATCTGTTTTTAGAAACATCATGGGGATTCTTCTGCCCGGTGTCAATTCTCTCATAACCGAAAGGACCAGCCAAATTTATGGCCAACTTTTGGAGAAGTCTGTGGAGCTTTCTCTTGAAATAATGATACTTGTATTAGAGAAGGATTTACTTCTGGCTGATTACTGGCGCCCTCTATATCAGGTAATTGTCTCATCTGTTGCTGTTGTTAACCTCCGTTTATTTACGCGTATGAATTAGCGAATTGGTGGATAAGACCACAAATAATTTTTCATTGAAGACTAAAATATAGAAATAGAAAGTTATGGAAAAAATGATATAGGCCAAGTTGTCTAGTTTCATTTCGTTGTGTATGTGGCTCATCCGGTTCCGTTAACTATGACTTCTAAATGGCTCCGGAACCAAGTATTTAATAAATGACAATTAATTTTGTTTTTACCGCATTACGCTTCTGTGTTTATATGCATTGGAAAACCTGAACCTGTAGTTTGATCATGTACATATCTTCAGATGTATTTCTGGTACTCTTTATGTCCACTGAGATGATAAATTGCTTCTGGATTTGACCTAATTCTTGCAGCCCTTGGAAGTTATCCTCTCTCAAGATCACAGTCAAATTGTTGCGCTGTTGGAGTATGTCAGATATGATTTTCATCCCAAGATTCAGCAGTTATCTATCAAAATCATGAGTATCTTAAGGCACTTCCATCTTCTTTTTCTCTATTTCCATTTTGTGACATTCAATATGAAAACATAGAATAATTTGTGATTTTCTAAAGATGTTATTTGTTGGTTGTAGTTCTCGTATGGTTGGGCTCGTGCAATTACTACTAAAATCTAATACTGCCAGCTCTTTAGTTGAGGATTATGCGTCCTGCCTCGAGTTAAGATCTGAAGAGTGTCATGTAATTGAAAACAGTGGAGATGATCCTGGTGTTCTTATAATGCAGGTTTGTTTTTCCATTTTTGTAATCTCAAGTTTGATTTCTTATTCTTTGATACCCGAAACCTTCGGAGTCTGGTTCTTTCTGCACCTTGCTCCTTCAATGCTAACTAATTAACGGATCACACATCATCATTTATATGCCCATTTCTGCAGCTTCTTATTGACAACATTAGCCGACCTGCTCCAAATGTTACACACTTGCTACTTAAATTTAACCTTGAGACATCCATCGAGCGGACAATTTTACAGCCAAAATTTCATTATAGGTGATTTTATAACAAGCTAAGATATTTTGTTCACTTTTGCATTGTTAAGTCACTGTACTATATTAGTTGTACTTTCCTTTGTAGTTGCTTGAAGGTTGTTCTGGAGATTTTAGAGAAGCTTTCAAATCCTGAAGTCAATGCCTTACTTTTTGAGTTTGGTTTTCAGGTTAGATGTTGTGGATTATGTATTTCTAATCAGGGGGATGTAACAGCTTTAGCGTACTGTCTTTTTTTCCGTTGTCCTTTCACACTCACCCCGCCCCCGAACCCTTTGTCATAAGTCTAAAGTGATTTTCTTTTACAGCTTCTTTATGAGCTATGCTTAGATCCCCTAACATCTGGACCAGTCATGGATCTCTTGAGCAACAAGAAGTACTATTTCTTCGTTAAGGTATGAACACCTGCTTCTGCGTATGGAAAGATCTTCTGCTCTGTTTGGGGCTTGCATGATAAAGCCAAAATTGTCACATTTTTAACAAAAAAAGATTCTGTTGATCGTAATGTCTCATTTGCTTAAGTTCTATTCACTTGTTGTATATTTTCCACCCAACCTTCCATCCTTGATACCTTCTTCATGCTTGTAATGGCAGCACTTGGATACAATTGGAGTTGTTCCTCTTCCCAAGAGAAACAACCACACTCTTCGTGTCAGCTCCCTTCATCAGGTTTGTCTTTCTGTGATGAATAGCTGTACTTCTTAACCATGCCCATGCTTTGCAATTTGTATTTTTCCTGCATTTTCTAATTGACCATTTACCATTAAGATCTGGAGACGTGACTATTTATTTTTCTTTGTTCAAGAGAGCATGGTTGCTTAAGCTTCTAGCAATTGAGTTGCATGCTGCTGACTTGAGTAGCCCGATCCATCGAGAGGCATGTCAGAGTATTCTTGCGCACCTGTACGGGCAGGAAATAGATGTTGGATCAGTTCCAGTCTTCTCACTTCAACATCATGTGGTGGATCCTGGAACTAGAACTATGAGTAAGAGCAAGGTACTTTATTCGAGATTGACGTTTTATCTTCTTATGTACAGTAAGTGGACAGTTCTCTTCTTTTTGTTAAGCTAATTTTTTCGCATTGTTGCCCTGTAGATGTTACTGCCCATGATTATGACACAAATGAAATTGACCATTTTCTTGTAACTAGGTTCAACTTTTGAAGCAATATTGGGTTGACTTGAGTTATCTTTTAAATAATTTTTTTCCTGAATTTTTTGCGTCCAGGCGTTGGAGCTACTTGAGGTTGTTCAGTTCCGAACTCCCGACACCTCAATCAAGCTTCCCCAAATTGTTTCGAATTTGAAGTACGAATTGCTGGTATGTTCTAACATGCTGGTTTCCTTGTTTTTCCTTTGCATTCTTTGTGATGCCCCACATTGGTTGGGGAGGAGAACAAACCACCATTTATAAGGGTGTGGAAACCTTCCCCTAGCAGACGCATTTTAAAGCCTTGAGGGCGAGCCCGAAAGGGCTCCCAAAGAGGACAATATCTGCTAGCGGTGTATCTGGGTCGTTAGATTCTTGGCATTGGACACTGTTCTTATTCTTGTTTTATAGACAAAGGACATACTTGGGAATCCTTCAACTTCGCAAAAGGGTGGGATCTACCATTATTCAGAGCGTGGCGATCGCCTGATTGACCTTACTTCCTTTTGTGATAAACTTTGGCAGGTTGGTGAAGGACAACATCAGTATCTTCTTTTGATTAAAAAATGACAAACTTCAAAAATATGTTCATTTGTTTCAGAAATTTAATGCTGATAATCCCCAACTGAATAACGTTGGGAGGGAAGCTGAGCTGGAAGAAGTAAAAGAGACGATTCAACAGTTTCTACGATGGGGATGGAAGTACAACAAAAATCTCGAGGAACAGGCTGCACAACTTCATATGTTAACTAGCTGGTCACAAACCATCGAGGTTTATCGTTCTTTCCTCTATTTGTGTTAGTTGCATGCTTCTTTTAGTTTAGATTTTAACATTATGTTTTCAATAGTTTCTAATGATAAACAGTTTGCTTTTATAGGTTACTGTGTCCAGAAGAATTTCATCACTTGAAAATAGATCCGATATTCTATTTCAGTATGTTCATATCTATACCTTGTTCTTTCTTCCTGCTTTTCACATTTTTAACAGTTCATTGTTGTGAACAGAGTTCTTGATGCTTCCTTGAGCGCTTCTGCTTCTCCAGATTGTTCTTTGAAGATGGCATATCTTTTATGTCAGGTTTGTTCATCCTTCCCCATTGCTAGTGTTTAACTTTCTACATCTTTCAATTATCCTAGTTGCTGATGCCCTTTTACAGGTTGCACTGACATGCATGGCCAAGCTAAGGGATGAAAGATATTCATCTCCTGGTGGTTTAAATGCTGATAGTGTCTCTTGTCTTGATATTATCATGGTGAAGCAAATATCTAATGGAGCATGTCACTCTATTTTACTAAAGCTCATCATGGCAATTCTTAGAAATGAATCATCAGAAGCTTTGAGGAGACGGTGTGGTTAATATAACTTTTGCTGGTATTGGCTGTGTGGCTTATGAGTTGGAATCTAATGTTAAGCTTTTGCATCTATCTACAGCCAATATGCTTTGCTTCTCAGCTACCTTCAGTATTGTCAAAATATGCTGGATCCGGACGTTCCAACAACGATTCTGCAAGTTTTACTTCAAAATGAGCAAGATGGAGAGGATGTAGATTTACAAAAGGTTGTCAACTTAGTGCTTTGGTGCATAAACACTTCTCAAACTTCCACTAGTTTCTCTTGAAGATCAAAACATAATTCTCTGCTTTACAGATTGACAAGGACCAGGCCGAACTTGCTCATGCTAATTTTTCAATTCTACGTAAAGAAGCTCAGTCTATCTTGAATGTGGTGAGTTGCTTGTTCTTGAGATATGAAACCAACTAAATCCTGGTTGATTTGCGTCGTTATTGACCTTAGCTTGTTATTTTTCGTGTTCCTTTAATGCTCTATATATGTAGTTTTCTTCCATAGAATTCTTTCGTCTTCTTTAATCTTTTGATCTTAATCCTTAGGTTATCAAGGATGCAACTCAAGGCAGTGAACCAGGGAAGACTATATCGCTCTACGTACTCGATGCATTAATCTGTATAGATCATGACAGATTTTTCCTGAACCAACTCCAGAACAGGGGATTTTTGAAGTCTTGTTTGGTCAGCATAAGTAACGTTTCACTTCAGGTGGTTGTTTCCTCTCTTCAGTGAAATAGTTCTGATATATTTATTAGCTGATATTGGTCCAACTCATAAAATGGAAACCCTTTAGGATGGCGTGCACTCTTTCGATTCATTGCAACGAGCATGCACCCTCGAGGCTGAACTTGCCTTATTGTCGAGGATTAGCCACAAGTACGGAAAATTTGGGGCTCAGCTTCTCTTTTCCACAGGCGCATTGGAGCACCTTGATTCATGCAGGGCAATCAATTTACAGGTGATTAACTGTCTACTATGTTATTGGATTTGTCTTAGAAGATTTAGTTTAATGAAGTGCTGTAGCAGGGAAATTTGAGGTGGGTTGATATGAAGCCTCATAGAGATGTAGCAGGGAATTTTAACAAGCAACAAGCTATCGTAACTCCGATTTTGAGACTGTTGTTTTCTATGACGTCATTAGTTGACACGTCCGAGTTTTTTGAGGTCAGCATACACTAAACAGACATTCTTTTTTTTGCTTAATCAGTTAATGTTCAAGTTTTCCTTATTAGACCCTGACCTTCGACCTCTATTACGAATATGCGACATTTATTTGAACGATTTGTTTGAATCTATTGTGATTTGATGTATATATTGTAGGTTAAAAATAAAATTGTTCGAGAAGCTATAGATTTTATTAAACGACACCAACGAGTATTTGATCAAATACTTGGGGAGGACGTATCTGAAGCCGATGATTCGACGTTGGAGCAGATTAATCTTCTCGTTGCTTCGCTTAGCAAGGTAACAACCTTGATATATTAGAAAATTAAGATCTTTTTGTTGACTAACATTTCAATAATTTTTGTTTCTGGGAAGGTTTGGCCATATGAGGAGACTGATGAATATGGTTTTGTTCAAAGTCTCTTCCAACTGATGCATTCGCTTTTCTCTCGTGAATTGGATTCTCGTACTCCTGGCCCAGCAGTTAAATTGCTTAAGGTTTGTGCTTTTTGTTTGTTTATTTATAGAATGGTTCTTTTTTTCCTGGAAAATTAAACTTATCATGCTTTTTGTTCTTCAGAACCGGAGAAGCTCAGAACTTCACTCCATTCAGCTCAACTTCAGCTTGATCTCTTATTTATATTTTCTAGTAACAAGGAAATCTCTAAGATTGCAGGTACTGTTCTTTGCCAAAAATGACTGACACGACTATAGTTTGGAATTTAACTCTCTGATGTGATCGTTCAGGTCTCTGGTACTTCCTCGAGCCATAACTCTCCTGTTCGGTCTCAACGTCCCTCGTTGGATTTGCTTGGTACTCTTCTGAACTCTACGACGATTACTCTTGAAAAAGCAGCTGAAGAAAGATTGTTGCTATTAAACAAGGTTTGGCTGCTACATTGTATTTCCCCCTTTCCTGGTTGTATATGGTTCATCCAATGCAACAATATCATGGTTTGAGGTCTTGCAGATTCGAGACATAAATGAACTGTCGAGACAGGAGGTCGAGGAAATTATCGTACTGTGTCTTGGTGAAGACTTTGCCTCGTTATCTGATAACATCCAGAGAAGGTTCCTTTCTCCACCTACTAATCATACTTTTTTTAGTCCATTTTCCGCGTAGTTTTCGGACTTGCCCTCTTATTTCGAAGATAGATTGTGGGCAAATGTGATATCCCACGTTGGTTGGGGAGGAGAACGAAACACCCTTTATAAGGGTGTGGAAACCTTTCCCTAGAGGCACGTTTTAAAGCCCAAAGAGGACAATATTTGCTAGCGGTGGGCTTGGACCGTTACACCAAAAGTTTAATACAACTATATATAACGTGTAGATGTGAGGAATATAGATACATATGAATGGTTTGATGTTGTTATGCAGGAGATATATTGCAATGATTGAAATGTGCAAGATTGTTGGAAATAAAAGTGAGATGATTACATTGCTACTCCCTCTGGCTGAGTATGTGTTAAATGTGATGCTTATTCATTTTCAAGACAGGTGAGTTAAATGTGATATGATATGCATATACTTGGGAACAGTGTGAGGTTGAAAATGATATCTATGTTTGGAGTTTGCAGCAGTGTTATTCCTGATGGGAATGCAAATATTAAAGCAATTGCATATCATGGGGAGTTGGAGTCAGGACATGAAATAAGTTCATTGTGTGGGAAATTGATTCCCGTCTTAGAAAGGCTTGAGTTGCTGAGTGAGGTATTCTTCTGTCTATCACACTCTTTTACTACTTCCAATATACTTGTTACTGAAAATTTAAACTAGTAGAGAGTTGTACCTCGGGACGTAAGGAATCCATTAGTACGAGGCCTTTTGGGGAGGTTCAAAGCAAATTCACGAGAGGTTATGCTCAAAGTGGACAATATCATACCATTGTGGAGAGTCGTGATTCCTAACATGGTATCAGCGCCATACCCTTAACTTAGCCTTAAGTTGCACAAGTGTTGAGCAAATGGTGTAGTTTGTTTGAGGGCTCTAGAGAAATAGTGAAGCCGGGGATAGAGGGCTCTATAAAGTAGTTGGTTCCTTGCTAAGAGAGTTAATATATATGGTGTTTTGTTTTGTTTTGTTTGTGTGTTCCAGAATAAAATTGGAGAGAATCTGAAGGTGTTTGGGAGATTGGTGAGTTCAGTGAAGGAGGTGGCGATTCAGAAGTTGGATGTATGAGGTTAATTGGTTGATGTTAAATGATGATATCATTTTATTTAAGTTTTAACAAAATATGTAATTAATTATTTTTGTTAATAGTAAAGTATTTATTTTTGCATTGGAAC
mRNA sequence
AGCCGTCTTTATTTCCATAGAAAGTCATGGCGCCTCTTTTGCTTGTTTTCACCTTCCAACAAAAAACCCACTGGGCATAAATAATAACACCTACTAGGGCATATGAGCAGGGTTTATCGAACAGTTCTGCGTCTCTAAAACCACCCTCCATTTTCCCGCGCTCCACCTTCGTTGCCCCATCTTCCCTCTTCCCCTTTCGGCGGCACCTTCATTTTGCCACTCACAGTGCGCCGCCATGTTGTCCTCCAAGCAGAGCCTCCGCATTATCGAGTCCGCGCTTCTGGGTCCGTCCCCTCCGTCCCCATCGCAGAGGGTCGAGCTCTTACACGCCATTCACAATTCTATGCCTGCTTTTCGTTCGCTTCTTCAATTCCCATCGCCGAAAGCTTCAGACCGTGCACAAGTTCAATCTAAAGAAGTCAGGCGGCCGGATTCTTCAACGATAACGCTCGATGATCAAGACGTCGTAATTACTTTAAAGTTGAGTGATGACCTCCATCTGAATGAAATAGAATGCGTGCATCTACTTGTTGCTGCACATCAAGAGTGGGCTTTAATGGGACGAGATCCTTCAGAGATTTTCCACCTTGCAGCAGGGCTTTGGTATACAGAGAGAAGAGATCTAATAATGTCACTCTATACACTACTAAGGGCTGTTGTACTTGATCAGGGGCTTGAAGCTGGTCTTGTATCTGACATCCAAAGACACCTGGAAGATCTTATTAATAGTGGCCTACGACAACGATTGATTTCTTTGATAAAGGAACTCAATCGAGAAGAACCAGCTGGTTTGGGTGGACCTAGCTGTGAGCGCTACATTCTTGATTCCAGAGGTGCTCTTGTTGAGCGTCGGGCTGTGGTTTGTCGGGAAAGACTTATACTAGGACATTGTCTTGTCCTTTCTATTCTTGTTGTTCGAATAGGTCCAAAGGATGTGAGGGATCTCTTTTCTGTTTTAAAAGACTGCGCTGCTGAGCTTAATGAGACCAAAACTCCCATAAAGCTTCAGATTGTATTTAGTCTCCTTTTCTCGATCATTATTGCCTTTGTATCGGATGCTTTGAGCGCTGTTCCAAATAAAGCATCTATATTGTCCAGTGATGCTTCTTTTAGAAATGAATTTCAGGATACCGTGATGGTGTCTGGAAATAATCCCACAGTGGAAGGATTTGTTGATGCTGTTAGGTTTGCTTGGACTGTTCATTTGCTGTTAATACACGACATGGTTGATGCAAGAGAAGCTGTTCCCAGTGCTTCCCCAAAAGATTTAGATCACCTACAATCGTGCTTGGAAGTTATTTTCTCCCATAATGCTTTCCAGTTCATGCTTCAAGAAGTTATTCAGACTGCGGCTTACCAGAATGATGATGAGGACATGACCTATATGTACAATGCTTATCTGCACAAACTGGTTTCTTGTTTTTTATCTCATCCTCTGGCTAGGGACAAGGTGAAAGAGTCAAAGGACCGAGCTATGAACACGCTAAGTCAGTTTCGTGCTACCGGGTCACAAGATTTCATGCATGATGGTGATTCAAGTTCCCATCAAGCTAGCGAAACTCTTCCTTCACCTTTTGTCTCCCTCTTGAACTTTGTTAGTGAAATTTACCGGAAAGAACCAGAACTGTTATCAAGCAATGATGTCTTGTGGACATTTGCAAATTTCGCTGGGGAGGATCATACAAATTTTCAAACTTTGGTGGCATTCTTGAACATGCTGAGTACCTTAGCTTGTAATGAAGAGGGTGCTTCAAGGGTTTTTGAGTTACTTCAGGGAAAAGCATTCCGATCTGTTGGATGGACCACCTTATTTGATTGCCTATCAATTTATGATGAAAAATTTAGACAGTCCCTTCAGACTGCTGGGGCCCTGTTACCAGAGTTTCAGGAAGGGGATGCAAAAGCGCTTGTTGCGTATTTGAATGTTCTTCAAAAGGTTGTGGAGAATGGGAATCCTGTTGAAAGAAAGAACTGGTTTCCTGATATTGAACCACTGTTCAAACTTCTTAGCTATGAAAATGTTCCTCCTTATCTGAAGGGTGCTTTGAGGAACGCAATCGCATCCTTCATTCAAGTCTCTTCTGACTTGAAGGATATCATTTGGTGCTATCTTGAGCAGTATGATTTACCTGTTCTTGTTGCATCCCATATTCAGAATGGCACAAAGTCGATTACTTCTCAGGTTTATGATATGCAGTTCGAGGTAAATGAAATTGAAGCAAGACAGGAGAGATATCCATCAACCATATCTTTCCTTAACCTGTTAAATGCTCTTATTGCTAAAGAAAGTGATCTAAGTGATAGAGGGCGTAGATTTGTTGGAATATTTAGGTTCATTTATGATCATGTTTTTGGGCCATTCCCTCAACGAGCCTATGCGGATGCTGCTGAGAAATGGCAGTTAGTTGTTGCTTGCCTACAACATTTTAATATGATTTTGAAAATGTATGACATTAATGAAGAGGACGTTGACGTTGTTATTGATCGATCACAATCATCAATGGATACACAATCATCTTCACTTCAAACTCAGCTACCAGTACTTGAGTTGCTAAAAGATTTCATGAGCGGAAAATCTGTTTTTAGAAACATCATGGGGATTCTTCTGCCCGGTGTCAATTCTCTCATAACCGAAAGGACCAGCCAAATTTATGGCCAACTTTTGGAGAAGTCTGTGGAGCTTTCTCTTGAAATAATGATACTTGTATTAGAGAAGGATTTACTTCTGGCTGATTACTGGCGCCCTCTATATCAGCCCTTGGAAGTTATCCTCTCTCAAGATCACAGTCAAATTGTTGCGCTGTTGGAGTATGTCAGATATGATTTTCATCCCAAGATTCAGCAGTTATCTATCAAAATCATGAGTATCTTAAGTTCTCGTATGGTTGGGCTCGTGCAATTACTACTAAAATCTAATACTGCCAGCTCTTTAGTTGAGGATTATGCGTCCTGCCTCGAGTTAAGATCTGAAGAGTGTCATGTAATTGAAAACAGTGGAGATGATCCTGGTGTTCTTATAATGCAGCTTCTTATTGACAACATTAGCCGACCTGCTCCAAATGTTACACACTTGCTACTTAAATTTAACCTTGAGACATCCATCGAGCGGACAATTTTACAGCCAAAATTTCATTATAGTTGCTTGAAGGTTGTTCTGGAGATTTTAGAGAAGCTTTCAAATCCTGAAGTCAATGCCTTACTTTTTGAGTTTGGTTTTCAGCTTCTTTATGAGCTATGCTTAGATCCCCTAACATCTGGACCAGTCATGGATCTCTTGAGCAACAAGAAGTACTATTTCTTCGTTAAGCACTTGGATACAATTGGAGTTGTTCCTCTTCCCAAGAGAAACAACCACACTCTTCGTGTCAGCTCCCTTCATCAGAGAGCATGGTTGCTTAAGCTTCTAGCAATTGAGTTGCATGCTGCTGACTTGAGTAGCCCGATCCATCGAGAGGCATGTCAGAGTATTCTTGCGCACCTGTACGGGCAGGAAATAGATGTTGGATCAGTTCCAGTCTTCTCACTTCAACATCATGTGGTGGATCCTGGAACTAGAACTATGAGTAAGAGCAAGGCGTTGGAGCTACTTGAGGTTGTTCAGTTCCGAACTCCCGACACCTCAATCAAGCTTCCCCAAATTGTTTCGAATTTGAAGTACGAATTGCTGACAAAGGACATACTTGGGAATCCTTCAACTTCGCAAAAGGGTGGGATCTACCATTATTCAGAGCGTGGCGATCGCCTGATTGACCTTACTTCCTTTTGTGATAAACTTTGGCAGAAATTTAATGCTGATAATCCCCAACTGAATAACGTTGGGAGGGAAGCTGAGCTGGAAGAAGTAAAAGAGACGATTCAACAGTTTCTACGATGGGGATGGAAGTACAACAAAAATCTCGAGGAACAGGCTGCACAACTTCATATGTTAACTAGCTGGTCACAAACCATCGAGGTTACTGTGTCCAGAAGAATTTCATCACTTGAAAATAGATCCGATATTCTATTTCAAGTTCTTGATGCTTCCTTGAGCGCTTCTGCTTCTCCAGATTGTTCTTTGAAGATGGCATATCTTTTATGTCAGGTTGCACTGACATGCATGGCCAAGCTAAGGGATGAAAGATATTCATCTCCTGGTGGTTTAAATGCTGATAGTGTCTCTTGTCTTGATATTATCATGGTGAAGCAAATATCTAATGGAGCATGTCACTCTATTTTACTAAAGCTCATCATGGCAATTCTTAGAAATGAATCATCAGAAGCTTTGAGGAGACGCCAATATGCTTTGCTTCTCAGCTACCTTCAGTATTGTCAAAATATGCTGGATCCGGACGTTCCAACAACGATTCTGCAAGTTTTACTTCAAAATGAGCAAGATGGAGAGGATGTAGATTTACAAAAGATTGACAAGGACCAGGCCGAACTTGCTCATGCTAATTTTTCAATTCTACGTAAAGAAGCTCAGTCTATCTTGAATGTGGTTATCAAGGATGCAACTCAAGGCAGTGAACCAGGGAAGACTATATCGCTCTACGTACTCGATGCATTAATCTGTATAGATCATGACAGATTTTTCCTGAACCAACTCCAGAACAGGGGATTTTTGAAGTCTTGTTTGGTCAGCATAAGTAACGTTTCACTTCAGGATGGCGTGCACTCTTTCGATTCATTGCAACGAGCATGCACCCTCGAGGCTGAACTTGCCTTATTGTCGAGGATTAGCCACAAGTACGGAAAATTTGGGGCTCAGCTTCTCTTTTCCACAGGCGCATTGGAGCACCTTGATTCATGCAGGGCAATCAATTTACAGGGAAATTTGAGGTGGGTTGATATGAAGCCTCATAGAGATGTAGCAGGGAATTTTAACAAGCAACAAGCTATCGTAACTCCGATTTTGAGACTGTTGTTTTCTATGACGTCATTAGTTGACACGTCCGAGTTTTTTGAGGTTAAAAATAAAATTGTTCGAGAAGCTATAGATTTTATTAAACGACACCAACGAGTATTTGATCAAATACTTGGGGAGGACGTATCTGAAGCCGATGATTCGACGTTGGAGCAGATTAATCTTCTCGTTGCTTCGCTTAGCAAGGTTTGGCCATATGAGGAGACTGATGAATATGGTTTTGTTCAAAGTCTCTTCCAACTGATGCATTCGCTTTTCTCTCGTGAATTGGATTCTCGTACTCCTGGCCCAGCAGTTAAATTGCTTAAGAACCGGAGAAGCTCAGAACTTCACTCCATTCAGCTCAACTTCAGCTTGATCTCTTATTTATATTTTCTAGTAACAAGGAAATCTCTAAGATTGCAGGTCTCTGGTACTTCCTCGAGCCATAACTCTCCTGTTCGGTCTCAACGTCCCTCGTTGGATTTGCTTGGTACTCTTCTGAACTCTACGACGATTACTCTTGAAAAAGCAGCTGAAGAAAGATTGTTGCTATTAAACAAGATTCGAGACATAAATGAACTGTCGAGACAGGAGGTCGAGGAAATTATCGTACTGTGTCTTGGTGAAGACTTTGCCTCGTTATCTGATAACATCCAGAGAAGGAGATATATTGCAATGATTGAAATGTGCAAGATTGTTGGAAATAAAAGTGAGATGATTACATTGCTACTCCCTCTGGCTGAGTATGTGTTAAATGTGATGCTTATTCATTTTCAAGACAGCAGTGTTATTCCTGATGGGAATGCAAATATTAAAGCAATTGCATATCATGGGGAGTTGGAGTCAGGACATGAAATAAGTTCATTGTGTGGGAAATTGATTCCCGTCTTAGAAAGGCTTGAGTTGCTGAGTGAGAATAAAATTGGAGAGAATCTGAAGGTGTTTGGGAGATTGGTGAGTTCAGTGAAGGAGGTGGCGATTCAGAAGTTGGATGTATGAGGTTAATTGGTTGATGTTAAATGATGATATCATTTTATTTAAGTTTTAACAAAATATGTAATTAATTATTTTTGTTAATAGTAAAGTATTTATTTTTGCATTGGAAC
Coding sequence (CDS)
ATGTTGTCCTCCAAGCAGAGCCTCCGCATTATCGAGTCCGCGCTTCTGGGTCCGTCCCCTCCGTCCCCATCGCAGAGGGTCGAGCTCTTACACGCCATTCACAATTCTATGCCTGCTTTTCGTTCGCTTCTTCAATTCCCATCGCCGAAAGCTTCAGACCGTGCACAAGTTCAATCTAAAGAAGTCAGGCGGCCGGATTCTTCAACGATAACGCTCGATGATCAAGACGTCGTAATTACTTTAAAGTTGAGTGATGACCTCCATCTGAATGAAATAGAATGCGTGCATCTACTTGTTGCTGCACATCAAGAGTGGGCTTTAATGGGACGAGATCCTTCAGAGATTTTCCACCTTGCAGCAGGGCTTTGGTATACAGAGAGAAGAGATCTAATAATGTCACTCTATACACTACTAAGGGCTGTTGTACTTGATCAGGGGCTTGAAGCTGGTCTTGTATCTGACATCCAAAGACACCTGGAAGATCTTATTAATAGTGGCCTACGACAACGATTGATTTCTTTGATAAAGGAACTCAATCGAGAAGAACCAGCTGGTTTGGGTGGACCTAGCTGTGAGCGCTACATTCTTGATTCCAGAGGTGCTCTTGTTGAGCGTCGGGCTGTGGTTTGTCGGGAAAGACTTATACTAGGACATTGTCTTGTCCTTTCTATTCTTGTTGTTCGAATAGGTCCAAAGGATGTGAGGGATCTCTTTTCTGTTTTAAAAGACTGCGCTGCTGAGCTTAATGAGACCAAAACTCCCATAAAGCTTCAGATTGTATTTAGTCTCCTTTTCTCGATCATTATTGCCTTTGTATCGGATGCTTTGAGCGCTGTTCCAAATAAAGCATCTATATTGTCCAGTGATGCTTCTTTTAGAAATGAATTTCAGGATACCGTGATGGTGTCTGGAAATAATCCCACAGTGGAAGGATTTGTTGATGCTGTTAGGTTTGCTTGGACTGTTCATTTGCTGTTAATACACGACATGGTTGATGCAAGAGAAGCTGTTCCCAGTGCTTCCCCAAAAGATTTAGATCACCTACAATCGTGCTTGGAAGTTATTTTCTCCCATAATGCTTTCCAGTTCATGCTTCAAGAAGTTATTCAGACTGCGGCTTACCAGAATGATGATGAGGACATGACCTATATGTACAATGCTTATCTGCACAAACTGGTTTCTTGTTTTTTATCTCATCCTCTGGCTAGGGACAAGGTGAAAGAGTCAAAGGACCGAGCTATGAACACGCTAAGTCAGTTTCGTGCTACCGGGTCACAAGATTTCATGCATGATGGTGATTCAAGTTCCCATCAAGCTAGCGAAACTCTTCCTTCACCTTTTGTCTCCCTCTTGAACTTTGTTAGTGAAATTTACCGGAAAGAACCAGAACTGTTATCAAGCAATGATGTCTTGTGGACATTTGCAAATTTCGCTGGGGAGGATCATACAAATTTTCAAACTTTGGTGGCATTCTTGAACATGCTGAGTACCTTAGCTTGTAATGAAGAGGGTGCTTCAAGGGTTTTTGAGTTACTTCAGGGAAAAGCATTCCGATCTGTTGGATGGACCACCTTATTTGATTGCCTATCAATTTATGATGAAAAATTTAGACAGTCCCTTCAGACTGCTGGGGCCCTGTTACCAGAGTTTCAGGAAGGGGATGCAAAAGCGCTTGTTGCGTATTTGAATGTTCTTCAAAAGGTTGTGGAGAATGGGAATCCTGTTGAAAGAAAGAACTGGTTTCCTGATATTGAACCACTGTTCAAACTTCTTAGCTATGAAAATGTTCCTCCTTATCTGAAGGGTGCTTTGAGGAACGCAATCGCATCCTTCATTCAAGTCTCTTCTGACTTGAAGGATATCATTTGGTGCTATCTTGAGCAGTATGATTTACCTGTTCTTGTTGCATCCCATATTCAGAATGGCACAAAGTCGATTACTTCTCAGGTTTATGATATGCAGTTCGAGGTAAATGAAATTGAAGCAAGACAGGAGAGATATCCATCAACCATATCTTTCCTTAACCTGTTAAATGCTCTTATTGCTAAAGAAAGTGATCTAAGTGATAGAGGGCGTAGATTTGTTGGAATATTTAGGTTCATTTATGATCATGTTTTTGGGCCATTCCCTCAACGAGCCTATGCGGATGCTGCTGAGAAATGGCAGTTAGTTGTTGCTTGCCTACAACATTTTAATATGATTTTGAAAATGTATGACATTAATGAAGAGGACGTTGACGTTGTTATTGATCGATCACAATCATCAATGGATACACAATCATCTTCACTTCAAACTCAGCTACCAGTACTTGAGTTGCTAAAAGATTTCATGAGCGGAAAATCTGTTTTTAGAAACATCATGGGGATTCTTCTGCCCGGTGTCAATTCTCTCATAACCGAAAGGACCAGCCAAATTTATGGCCAACTTTTGGAGAAGTCTGTGGAGCTTTCTCTTGAAATAATGATACTTGTATTAGAGAAGGATTTACTTCTGGCTGATTACTGGCGCCCTCTATATCAGCCCTTGGAAGTTATCCTCTCTCAAGATCACAGTCAAATTGTTGCGCTGTTGGAGTATGTCAGATATGATTTTCATCCCAAGATTCAGCAGTTATCTATCAAAATCATGAGTATCTTAAGTTCTCGTATGGTTGGGCTCGTGCAATTACTACTAAAATCTAATACTGCCAGCTCTTTAGTTGAGGATTATGCGTCCTGCCTCGAGTTAAGATCTGAAGAGTGTCATGTAATTGAAAACAGTGGAGATGATCCTGGTGTTCTTATAATGCAGCTTCTTATTGACAACATTAGCCGACCTGCTCCAAATGTTACACACTTGCTACTTAAATTTAACCTTGAGACATCCATCGAGCGGACAATTTTACAGCCAAAATTTCATTATAGTTGCTTGAAGGTTGTTCTGGAGATTTTAGAGAAGCTTTCAAATCCTGAAGTCAATGCCTTACTTTTTGAGTTTGGTTTTCAGCTTCTTTATGAGCTATGCTTAGATCCCCTAACATCTGGACCAGTCATGGATCTCTTGAGCAACAAGAAGTACTATTTCTTCGTTAAGCACTTGGATACAATTGGAGTTGTTCCTCTTCCCAAGAGAAACAACCACACTCTTCGTGTCAGCTCCCTTCATCAGAGAGCATGGTTGCTTAAGCTTCTAGCAATTGAGTTGCATGCTGCTGACTTGAGTAGCCCGATCCATCGAGAGGCATGTCAGAGTATTCTTGCGCACCTGTACGGGCAGGAAATAGATGTTGGATCAGTTCCAGTCTTCTCACTTCAACATCATGTGGTGGATCCTGGAACTAGAACTATGAGTAAGAGCAAGGCGTTGGAGCTACTTGAGGTTGTTCAGTTCCGAACTCCCGACACCTCAATCAAGCTTCCCCAAATTGTTTCGAATTTGAAGTACGAATTGCTGACAAAGGACATACTTGGGAATCCTTCAACTTCGCAAAAGGGTGGGATCTACCATTATTCAGAGCGTGGCGATCGCCTGATTGACCTTACTTCCTTTTGTGATAAACTTTGGCAGAAATTTAATGCTGATAATCCCCAACTGAATAACGTTGGGAGGGAAGCTGAGCTGGAAGAAGTAAAAGAGACGATTCAACAGTTTCTACGATGGGGATGGAAGTACAACAAAAATCTCGAGGAACAGGCTGCACAACTTCATATGTTAACTAGCTGGTCACAAACCATCGAGGTTACTGTGTCCAGAAGAATTTCATCACTTGAAAATAGATCCGATATTCTATTTCAAGTTCTTGATGCTTCCTTGAGCGCTTCTGCTTCTCCAGATTGTTCTTTGAAGATGGCATATCTTTTATGTCAGGTTGCACTGACATGCATGGCCAAGCTAAGGGATGAAAGATATTCATCTCCTGGTGGTTTAAATGCTGATAGTGTCTCTTGTCTTGATATTATCATGGTGAAGCAAATATCTAATGGAGCATGTCACTCTATTTTACTAAAGCTCATCATGGCAATTCTTAGAAATGAATCATCAGAAGCTTTGAGGAGACGCCAATATGCTTTGCTTCTCAGCTACCTTCAGTATTGTCAAAATATGCTGGATCCGGACGTTCCAACAACGATTCTGCAAGTTTTACTTCAAAATGAGCAAGATGGAGAGGATGTAGATTTACAAAAGATTGACAAGGACCAGGCCGAACTTGCTCATGCTAATTTTTCAATTCTACGTAAAGAAGCTCAGTCTATCTTGAATGTGGTTATCAAGGATGCAACTCAAGGCAGTGAACCAGGGAAGACTATATCGCTCTACGTACTCGATGCATTAATCTGTATAGATCATGACAGATTTTTCCTGAACCAACTCCAGAACAGGGGATTTTTGAAGTCTTGTTTGGTCAGCATAAGTAACGTTTCACTTCAGGATGGCGTGCACTCTTTCGATTCATTGCAACGAGCATGCACCCTCGAGGCTGAACTTGCCTTATTGTCGAGGATTAGCCACAAGTACGGAAAATTTGGGGCTCAGCTTCTCTTTTCCACAGGCGCATTGGAGCACCTTGATTCATGCAGGGCAATCAATTTACAGGGAAATTTGAGGTGGGTTGATATGAAGCCTCATAGAGATGTAGCAGGGAATTTTAACAAGCAACAAGCTATCGTAACTCCGATTTTGAGACTGTTGTTTTCTATGACGTCATTAGTTGACACGTCCGAGTTTTTTGAGGTTAAAAATAAAATTGTTCGAGAAGCTATAGATTTTATTAAACGACACCAACGAGTATTTGATCAAATACTTGGGGAGGACGTATCTGAAGCCGATGATTCGACGTTGGAGCAGATTAATCTTCTCGTTGCTTCGCTTAGCAAGGTTTGGCCATATGAGGAGACTGATGAATATGGTTTTGTTCAAAGTCTCTTCCAACTGATGCATTCGCTTTTCTCTCGTGAATTGGATTCTCGTACTCCTGGCCCAGCAGTTAAATTGCTTAAGAACCGGAGAAGCTCAGAACTTCACTCCATTCAGCTCAACTTCAGCTTGATCTCTTATTTATATTTTCTAGTAACAAGGAAATCTCTAAGATTGCAGGTCTCTGGTACTTCCTCGAGCCATAACTCTCCTGTTCGGTCTCAACGTCCCTCGTTGGATTTGCTTGGTACTCTTCTGAACTCTACGACGATTACTCTTGAAAAAGCAGCTGAAGAAAGATTGTTGCTATTAAACAAGATTCGAGACATAAATGAACTGTCGAGACAGGAGGTCGAGGAAATTATCGTACTGTGTCTTGGTGAAGACTTTGCCTCGTTATCTGATAACATCCAGAGAAGGAGATATATTGCAATGATTGAAATGTGCAAGATTGTTGGAAATAAAAGTGAGATGATTACATTGCTACTCCCTCTGGCTGAGTATGTGTTAAATGTGATGCTTATTCATTTTCAAGACAGCAGTGTTATTCCTGATGGGAATGCAAATATTAAAGCAATTGCATATCATGGGGAGTTGGAGTCAGGACATGAAATAAGTTCATTGTGTGGGAAATTGATTCCCGTCTTAGAAAGGCTTGAGTTGCTGAGTGAGAATAAAATTGGAGAGAATCTGAAGGTGTTTGGGAGATTGGTGAGTTCAGTGAAGGAGGTGGCGATTCAGAAGTTGGATGTATGA
Protein sequence
MLSSKQSLRIIESALLGPSPPSPSQRVELLHAIHNSMPAFRSLLQFPSPKASDRAQVQSKEVRRPDSSTITLDDQDVVITLKLSDDLHLNEIECVHLLVAAHQEWALMGRDPSEIFHLAAGLWYTERRDLIMSLYTLLRAVVLDQGLEAGLVSDIQRHLEDLINSGLRQRLISLIKELNREEPAGLGGPSCERYILDSRGALVERRAVVCRERLILGHCLVLSILVVRIGPKDVRDLFSVLKDCAAELNETKTPIKLQIVFSLLFSIIIAFVSDALSAVPNKASILSSDASFRNEFQDTVMVSGNNPTVEGFVDAVRFAWTVHLLLIHDMVDAREAVPSASPKDLDHLQSCLEVIFSHNAFQFMLQEVIQTAAYQNDDEDMTYMYNAYLHKLVSCFLSHPLARDKVKESKDRAMNTLSQFRATGSQDFMHDGDSSSHQASETLPSPFVSLLNFVSEIYRKEPELLSSNDVLWTFANFAGEDHTNFQTLVAFLNMLSTLACNEEGASRVFELLQGKAFRSVGWTTLFDCLSIYDEKFRQSLQTAGALLPEFQEGDAKALVAYLNVLQKVVENGNPVERKNWFPDIEPLFKLLSYENVPPYLKGALRNAIASFIQVSSDLKDIIWCYLEQYDLPVLVASHIQNGTKSITSQVYDMQFEVNEIEARQERYPSTISFLNLLNALIAKESDLSDRGRRFVGIFRFIYDHVFGPFPQRAYADAAEKWQLVVACLQHFNMILKMYDINEEDVDVVIDRSQSSMDTQSSSLQTQLPVLELLKDFMSGKSVFRNIMGILLPGVNSLITERTSQIYGQLLEKSVELSLEIMILVLEKDLLLADYWRPLYQPLEVILSQDHSQIVALLEYVRYDFHPKIQQLSIKIMSILSSRMVGLVQLLLKSNTASSLVEDYASCLELRSEECHVIENSGDDPGVLIMQLLIDNISRPAPNVTHLLLKFNLETSIERTILQPKFHYSCLKVVLEILEKLSNPEVNALLFEFGFQLLYELCLDPLTSGPVMDLLSNKKYYFFVKHLDTIGVVPLPKRNNHTLRVSSLHQRAWLLKLLAIELHAADLSSPIHREACQSILAHLYGQEIDVGSVPVFSLQHHVVDPGTRTMSKSKALELLEVVQFRTPDTSIKLPQIVSNLKYELLTKDILGNPSTSQKGGIYHYSERGDRLIDLTSFCDKLWQKFNADNPQLNNVGREAELEEVKETIQQFLRWGWKYNKNLEEQAAQLHMLTSWSQTIEVTVSRRISSLENRSDILFQVLDASLSASASPDCSLKMAYLLCQVALTCMAKLRDERYSSPGGLNADSVSCLDIIMVKQISNGACHSILLKLIMAILRNESSEALRRRQYALLLSYLQYCQNMLDPDVPTTILQVLLQNEQDGEDVDLQKIDKDQAELAHANFSILRKEAQSILNVVIKDATQGSEPGKTISLYVLDALICIDHDRFFLNQLQNRGFLKSCLVSISNVSLQDGVHSFDSLQRACTLEAELALLSRISHKYGKFGAQLLFSTGALEHLDSCRAINLQGNLRWVDMKPHRDVAGNFNKQQAIVTPILRLLFSMTSLVDTSEFFEVKNKIVREAIDFIKRHQRVFDQILGEDVSEADDSTLEQINLLVASLSKVWPYEETDEYGFVQSLFQLMHSLFSRELDSRTPGPAVKLLKNRRSSELHSIQLNFSLISYLYFLVTRKSLRLQVSGTSSSHNSPVRSQRPSLDLLGTLLNSTTITLEKAAEERLLLLNKIRDINELSRQEVEEIIVLCLGEDFASLSDNIQRRRYIAMIEMCKIVGNKSEMITLLLPLAEYVLNVMLIHFQDSSVIPDGNANIKAIAYHGELESGHEISSLCGKLIPVLERLELLSENKIGENLKVFGRLVSSVKEVAIQKLDV
Homology
BLAST of CmoCh12G012390 vs. ExPASy Swiss-Prot
Match:
F4KBW6 (Nuclear pore complex protein NUP205 OS=Arabidopsis thaliana OX=3702 GN=NUP205 PE=1 SV=1)
HSP 1 Score: 2358.2 bits (6110), Expect = 0.0e+00
Identity = 1210/1885 (64.19%), Postives = 1504/1885 (79.79%), Query Frame = 0
Query: 1 MLSSKQSLRIIESALLGPSPPSPSQRVELLHAIHNSMPAFRSLLQFPSPKASDRAQVQSK 60
M+S K + I+ S+LLG S P+P+QR+EL HAI NS P+ ++LL FP PK SDRAQVQSK
Sbjct: 1 MVSPKDLVAIVHSSLLGTSRPTPTQRIELTHAIRNSFPSLQNLLSFPPPKPSDRAQVQSK 60
Query: 61 EVRRPDSSTITLDDQDVVITLKLSDDLHLNEIECVHLLVAAHQEWALMGRDPSEIFHLAA 120
E+R PDS I+LDDQD+ I+LKLSD+LHLNEI+ V LLV+++QEW LMGRDP EI LA
Sbjct: 61 EIRLPDSLPISLDDQDIAISLKLSDELHLNEIDSVRLLVSSNQEWGLMGRDPLEIQRLAT 120
Query: 121 GLWYTERRDLIMSLYTLLRAVVLDQGLEAGLVSDIQRHLEDLINSGLRQRLISLIKELNR 180
GLWYT RRDL +LYTLLRAVVLD+GLE L++DIQ LE+LI +GLRQRLI+LIKELNR
Sbjct: 121 GLWYTGRRDLTSTLYTLLRAVVLDEGLEPDLIADIQGLLEELIEAGLRQRLITLIKELNR 180
Query: 181 EEPAGLGGPSCERYILDSRGALVERRAVVCRERLILGHCLVLSILVVRIGPKDVRDLFSV 240
E+P GLGGP CERY++DSRGALVERRAVV RERLILGHCLVLSILV R G KDV+D++ +
Sbjct: 181 EDPTGLGGPLCERYLIDSRGALVERRAVVQRERLILGHCLVLSILVDRPGSKDVKDIYYI 240
Query: 241 LKDCAAELNETKTPIKLQIVFSLLFSIIIAFVSDALSAVPNKASILSSDASFRNEFQDTV 300
LKD AA+L E I QI FSLLFS+II FVSDA+S + +K+S++S DASFR +FQD V
Sbjct: 241 LKDNAAQLTEGNDTISSQITFSLLFSLIITFVSDAISRLSDKSSMISQDASFRTDFQDIV 300
Query: 301 MVSGNNPTVEGFVDAVRFAWTVHLLLIHDMVDAREAVPSASPKDLDHLQSCLEVIFSHNA 360
M SG++PT +GF+ +R AW VHL+LIHD + + + +AS D+ H+ SCLE IFS N
Sbjct: 301 MASGSDPTADGFIGGIRLAWAVHLMLIHDGISGMDTISTASTTDMGHICSCLESIFSKNV 360
Query: 361 FQFMLQEVIQTAAYQNDDEDMTYMYNAYLHKLVSCFLSHPLARDKVKESKDRAMNTLSQF 420
FQF+L V++TAAYQND+ED+ Y+YNAYLHKL SCFLSHP+ARDKVKESKD AM+ L+ +
Sbjct: 361 FQFLLDNVLRTAAYQNDEEDIIYIYNAYLHKLASCFLSHPIARDKVKESKDMAMSVLNSY 420
Query: 421 RATGSQDFMHDGDSSSHQASETLPSPFVSLLNFVSEIYRKEPELLSSNDVLWTFANFAGE 480
R + D S P PF+SL+ F KEPELLS NDVLWTF NFAGE
Sbjct: 421 RTSDPL------DGSMQTEESDRPLPFISLMEF------KEPELLSGNDVLWTFVNFAGE 480
Query: 481 DHTNFQTLVAFLNMLSTLACNEEGASRVFELLQGKAFRSVGWTTLFDCLSIYDEKFRQSL 540
DHTNF+TLVAFL ML TLA +EGAS+V+ELL+G +FRS+GW TLFDC+ IYDEKF+QSL
Sbjct: 481 DHTNFKTLVAFLEMLCTLASTQEGASKVYELLRGTSFRSIGWPTLFDCIRIYDEKFKQSL 540
Query: 541 QTAGALLPEFQEGDAKALVAYLNVLQKVVENGNPVERKNWFPDIEPLFKLLSYENVPPYL 600
QTAGA++PEF EGDAKALVAYLNVLQKVVENGNP ERKNWFPDIEP FKLL YEN+PPYL
Sbjct: 541 QTAGAMMPEFLEGDAKALVAYLNVLQKVVENGNPTERKNWFPDIEPFFKLLGYENIPPYL 600
Query: 601 KGALRNAIASFIQVSSDLKDIIWCYLEQYDLPVLVASHIQNGTKSITSQVYDMQFEVNEI 660
KGALR IA+F+ V +++D IW +LEQYDLPV+V S + G +SQVYDMQFE+NE+
Sbjct: 601 KGALRKTIAAFVNVFPEMRDSIWAFLEQYDLPVVVGSQV--GKSDQSSQVYDMQFELNEV 660
Query: 661 EARQERYPSTISFLNLLNALIAKESDLSDRGRRFVGIFRFIYDHVFGPFPQRAYADAAEK 720
EAR+E+YPSTISFLNL+NALIA E D++DRGR RAY+D EK
Sbjct: 661 EARREQYPSTISFLNLINALIAGEKDVNDRGR-------------------RAYSDPCEK 720
Query: 721 WQLVVACLQHFNMILKMYDINEEDVDVVIDRSQSSMDTQSSSLQTQLPVLELLKDFMSGK 780
WQLVVACLQHF+MIL MYDI EED+D + + ++SSLQTQLP++ELLKDFMSGK
Sbjct: 721 WQLVVACLQHFHMILSMYDIQEEDLDGFTEHPHFLVSLETSSLQTQLPIIELLKDFMSGK 780
Query: 781 SVFRNIMGILLPGVNSLITERTSQIYGQLLEKSVELSLEIMILVLEKDLLLADYWRPLYQ 840
+++RN+MGIL GVNS+I+ER S+ YG++LEK+V+LSLEI++LV EKDLL++D WRPLYQ
Sbjct: 781 ALYRNLMGILQVGVNSIISERLSKTYGKILEKAVQLSLEILLLVFEKDLLVSDVWRPLYQ 840
Query: 841 PLEVILSQDHSQIVALLEYVRYDFHPKIQQLSIKIMSIL-SSRMVGLVQLLLKSNTASSL 900
PL++ILSQDH+QI+ALLEYVRYD P+IQ+ SIKIM+IL SR+VGLV +L+K + A+SL
Sbjct: 841 PLDIILSQDHNQIIALLEYVRYDSLPQIQRSSIKIMNILRCSRLVGLVPMLIKIDAANSL 900
Query: 901 VEDYASCLELRSEECHVIENSGDDPGVLIMQLLIDNISRPAPNVTHLLLKFNLETSIERT 960
+EDYA+CLE R EE V+ENS DD GVLIMQLL+DNI+RPAP++THLLLKF+L+ +E T
Sbjct: 901 IEDYAACLEGRLEEGEVVENSCDDLGVLIMQLLVDNINRPAPSITHLLLKFDLDAPVEGT 960
Query: 961 ILQPKFHYSCLKVVLEILEKLSNPEVNALLFEFGFQLLYELCLDPLTSGPVMDLLSNKKY 1020
+LQPKFHYSCLKV+LE+LEKL NP++N LLFEFGFQLL EL LDPLTSGP MDLLS+KKY
Sbjct: 961 VLQPKFHYSCLKVILEMLEKLPNPDINFLLFEFGFQLLCELNLDPLTSGPTMDLLSSKKY 1020
Query: 1021 YFFVKHLDTIGVVPLPKRN-NHTLRVSSLHQRAWLLKLLAIELHAADLSSPIHREACQSI 1080
FF++HLDTIGV LPKR+ + LR+SSLHQRAWLLKLLAI LH SS H EACQSI
Sbjct: 1021 QFFLQHLDTIGVATLPKRSGSQALRISSLHQRAWLLKLLAIALHTGSGSSSAHLEACQSI 1080
Query: 1081 LAHLYGQEIDVGSVPVFSLQHHVVD----PGTRTMSKSKALELLEVVQFRTPDTSIKLPQ 1140
L+HL+G+E+ + FS + D GT ++SKSKAL LLE++QFR+PD S++LPQ
Sbjct: 1081 LSHLFGREVTEAANEPFSSSTYPQDGLDYAGTSSISKSKALALLEILQFRSPDASMQLPQ 1140
Query: 1141 IVSNLKYELLTKDILGNPSTSQKGGIYHYSERGDRLIDLTSFCDKLWQKFNADNPQLNNV 1200
IVS+LKY+ L +DILGN TS G IY+YSERGDRLIDL+SF +KLWQK ++ P +++
Sbjct: 1141 IVSSLKYDSLVEDILGNRDTSVSGSIYYYSERGDRLIDLSSFSNKLWQKLHSGFPLVDSF 1200
Query: 1201 GREAELEEVKETIQQFLRWGWKYNKNLEEQAAQLHMLTSWSQTIEVTVSRRISSLENRSD 1260
AEL EV+ETIQQ L+WGWKYN+NLEEQAAQLHML WSQ +EV+ RRISSL+NRS+
Sbjct: 1201 PNVAELSEVRETIQQLLKWGWKYNRNLEEQAAQLHMLAGWSQIVEVSACRRISSLDNRSE 1260
Query: 1261 ILFQVLDASLSASASPDCSLKMAYLLCQVALTCMAKLRDERYSSPGGLNADSVSCLDIIM 1320
IL+++LDASLSASASPDCSLKMA++L QVALTC+AKLRD+R+S G L++D+V+CLD++M
Sbjct: 1261 ILYRILDASLSASASPDCSLKMAFVLTQVALTCIAKLRDDRFSFQGALSSDTVTCLDVMM 1320
Query: 1321 VKQISNGACHSILLKLIMAILRNESSEALRRRQYALLLSYLQYCQNMLDPDVPTTILQVL 1380
VK +S GACHS+L KL+MAILR+ESSE+LRRRQYALLLSY QYCQ+M+ DVPT+++Q L
Sbjct: 1321 VKHLSTGACHSVLFKLVMAILRHESSESLRRRQYALLLSYFQYCQHMIALDVPTSVVQFL 1380
Query: 1381 LQNEQDGEDVDLQKIDKDQAELAHANFSILRKEAQSILNVVIKDATQGSEPGKTISLYVL 1440
L NEQDGED+D+QKIDK+QA+LA ANF I++KEAQ IL++VIKDA+QGSE GKTISLYVL
Sbjct: 1381 LLNEQDGEDLDIQKIDKEQADLARANFFIIKKEAQGILDLVIKDASQGSEFGKTISLYVL 1440
Query: 1441 DALICIDHDRFFLNQLQNRGFLKSCLVSISNVSLQDGVHSFDSLQRACTLEAELALLSRI 1500
+AL+CIDH+R+FL+QLQ+RGF++SCL SISN+S QDG H +S QRACTLEAELALL RI
Sbjct: 1441 EALVCIDHERYFLSQLQSRGFIRSCLGSISNISYQDGTHLLESQQRACTLEAELALLLRI 1500
Query: 1501 SHKYGKFGAQLLFSTGALEHLDSCRAINLQGNLRWVDMKPHRDVAGNFNKQQAIVTPILR 1560
SHKYGK G Q+LFS GALEH+ SCRAI+ +GN+R VDMK DV N KQ+ I+T +LR
Sbjct: 1501 SHKYGKSGGQVLFSMGALEHIASCRAISFKGNMRRVDMKLQSDVGYNVQKQRTIITAVLR 1560
Query: 1561 LLFSMTSLVDTSEFFEVKNKIVREAIDFIKRHQRVFDQILGEDVSEADDSTLEQINLLVA 1620
L+F++TSLV+TSEFFE +NKIVR+ ++FIK HQ +FDQ+L ED ++ADD +EQI L V
Sbjct: 1561 LVFALTSLVETSEFFEGRNKIVRDVVEFIKGHQSLFDQLLREDFTQADDLLMEQIILAVG 1620
Query: 1621 SLSKVWPYEETDEYGFVQSLFQLMHSLFSRELDSRTPGPAVKLLKNRRSSELHSIQLNFS 1680
LSKVWP+EE D YGFVQ LF +M LF P +L + SEL QL FS
Sbjct: 1621 ILSKVWPFEENDGYGFVQGLFDMMSKLF-------IASPIKSILS--QGSELKLSQLRFS 1680
Query: 1681 LISYLYFLVTRKSLRLQVSGTSSSHNSPVRSQRPSLDLLGTLLNSTTITLEKAAEERLLL 1740
L SYLYFLVT+ SLRLQVS S +S + ++P+L LL +LL+ T +LE+AAE++ LL
Sbjct: 1681 LTSYLYFLVTKNSLRLQVS--DDSLDSSTKLRQPTLLLLASLLSHVTDSLERAAEKKSLL 1740
Query: 1741 LNKIRDINELSRQEVEEIIVLCLGEDFASLSDNIQRRRYIAMIEMCKIVGNKSEMITLLL 1800
L+KIRDINELSRQ+V+ II +C +++ + SDNI +RRYIAM+EMC+IVGN+ ++ITLLL
Sbjct: 1741 LHKIRDINELSRQDVDAIIKICDSQEYVTPSDNIHKRRYIAMVEMCQIVGNRDQLITLLL 1800
Query: 1801 PLAEYVLNVMLIHFQDSSVIPDGNANIKAIAYHGELESGHEISSLCGKLIPVLERLELLS 1860
LAE+VLN++LIH QD SV ++N + +Y + E++ LCGKL P ++RL LL+
Sbjct: 1801 QLAEHVLNIILIHLQDRSV----SSNERG-SYGSKSHIQQEVTDLCGKLSPTIDRLALLN 1836
Query: 1861 ENKIGENLKVFGRLVSSVKEVAIQK 1880
E K+G NLKVF RL ++VKE+AIQK
Sbjct: 1861 EGKVGHNLKVFQRLATTVKEMAIQK 1836
BLAST of CmoCh12G012390 vs. ExPASy Swiss-Prot
Match:
Q92621 (Nuclear pore complex protein Nup205 OS=Homo sapiens OX=9606 GN=NUP205 PE=1 SV=3)
HSP 1 Score: 236.5 bits (602), Expect = 2.5e-60
Identity = 400/1744 (22.94%), Postives = 704/1744 (40.37%), Query Frame = 0
Query: 38 PAFRSLLQFPSPKASDRAQVQSKEVR----RPDSSTITLDDQDVVITLKLSDDLHLNEIE 97
P F SL + P +VQ + T L +Q + LSD + E+
Sbjct: 48 PDFISLFKNPPKNVQQHEKVQKASTEGVAIQGQQGTRLLPEQLIKEAFILSDLFDIGELA 107
Query: 98 CVHLLVAA-HQEWALMGRDPSEIFHLAAGLWYTERRDLIMSLYTLL---RAVVLDQGLEA 157
V LL+A HQ+ G + A L++ +R + SL L+ R L
Sbjct: 108 AVELLLAGEHQQPHFPGLTRGLV---AVLLYWDGKRCIANSLKALIQSRRGKTWTLELSP 167
Query: 158 GLVSDIQRHLEDLINSGLRQRLISLIKELN-------REEPAGLGGPSCERYILDSRGAL 217
L S R ++L+ GL ++++L+ +++ + GLG + + D L
Sbjct: 168 ELASMTTRFTDELMEQGLTYKVLTLVSQIDVNNEFEKLQRERGLGSEKHRKEVSD----L 227
Query: 218 VERRAVVCRERLILGHCLVLSILVVRIGPKDVRDLFSVLKDCAAELNETKTPIKLQIVFS 277
++ CR+ L L +G +D L L+ E N + + L ++ +
Sbjct: 228 IKE----CRQS--LAESLFAWACQSPLGKEDTLLLIGHLERVTVEANGSLDAVNLALLMA 287
Query: 278 LLFSIIIAFVSDALSA---VPNKASILSSD---ASFRNEFQDTVMVSGNNPTVEGFVDAV 337
LL+ I+F+ + + ++ +L+ A+ + QD+ + + G V
Sbjct: 288 LLYCFDISFIEQSTEERDDMIHQLPLLTEKQYIATIHSRLQDSQLWK-----LPGLQATV 347
Query: 338 RFAWTVHLLLIHDMVDAREAVPSASPKDLDHLQSCLEVIFSHNAFQFMLQEVIQTAAYQN 397
R AW + L I + D +A + + ++ E+ + N F F+++ V+ + +
Sbjct: 348 RLAWALALRGISQLPDV-----TALAEFTEADEAMAELAIADNVFLFLMESVVVSEYFYQ 407
Query: 398 DDEDMTYMYNAYLHKLVSCFLSHPLARDKVKESKDRAMNTLSQFRATGSQDFMHDGDSSS 457
++ Y +H L++ FL+ L KVK+ ++ RA +H
Sbjct: 408 EE-----FYIRRVHNLITDFLA--LMPMKVKQLRN---------RADEDARMIHMSMQMG 467
Query: 458 HQASETLPSPFVSLLNFVSEIYRKEP-------------ELLSSNDVLWTFANFAGEDHT 517
++ +L L+ + E+Y+K P E L + ++ ++ A +
Sbjct: 468 NEPPISLRRDLEHLMLLIGELYKKNPFHLELALEYWCPTEPLQTPTIMGSYLGVAHQRPP 527
Query: 518 NFQTL-----------------VAFLNMLSTLACNEEGASRVFELL-----------QGK 577
Q + + +L ML LA + A F LL QG
Sbjct: 528 QRQVVLSKFVRQMGDLLPPTIYIPYLKMLQGLANGPQCAHYCFSLLKVNGSSHVENIQGA 587
Query: 578 AFRSVGWTTLFDCLSIYDEKFRQSLQTAGAL----LPE--FQEGDAKALVAYLNVLQKVV 637
V W F L +Y E R+ L +A ++ LP + + L+A+L + ++
Sbjct: 588 GGSPVSWEHFFHSLMLYHEHLRKDLPSADSVQYRHLPSRGITQKEQDGLIAFLQLTSTII 647
Query: 638 ---ENGNPV--ERKNWFPDIEPLFKLLSYENVPPYLKGALRNAIASFIQVSSDLKDIIWC 697
EN E W P + L L ++PP LK L +A+F + S ++ +W
Sbjct: 648 TWSENARLALCEHPQWTPVVVILGLLQC--SIPPVLKAELLKTLAAFGK-SPEIAASLWQ 707
Query: 698 YLEQYDLPVLVASHIQNGTKSITSQVYDMQFEVNEIEARQERYPSTISFLNLLNALIAKE 757
LE + V Q Q ++ E+NEIE+R E YP T +F L++ L+
Sbjct: 708 SLEYTQILQTVRIPSQR-------QAIGIEVELNEIESRCEEYPLTRAFCQLISTLVESS 767
Query: 758 --SDLSD--RGRRFVGIFRFIYDHVFGPFPQRAYADAAEKWQLVVACLQHFNMILKMYDI 817
S+L R F +F+ D VF F RAY AAEKW++ L+ F +L+ Y+
Sbjct: 768 FPSNLGAGLRPPGFDPYLQFLRDSVFLRFRTRAYRRAAEKWEVAEVVLEVFYKLLRDYEP 827
Query: 818 NEED-VDVVIDRSQSSMDTQSSSLQTQLPVLELLKDFMSGKSVFRNIMGILLPGVNSLIT 877
ED VD ++ + + + P L+ ++ + + +L GV L T
Sbjct: 828 QLEDFVDQFVELQGEEI------IAYKPPGFSLMYHLLNESPMLELALSLLEEGVKQLDT 887
Query: 878 ERTSQIYGQL-LEKSVELSLEIMILVLEKDLLLADYWRP-----LYQPLEVIL------S 937
+ G+ LEK+V+ L ++ L L+K+ L D R + PLE +L +
Sbjct: 888 --YAPFPGKKHLEKAVQHCLALLNLTLQKENLFMDLLRESQLALIVCPLEQLLQGINPRT 947
Query: 938 QDHSQIVALLEYVRY-DFHPKIQQLSIKIMSILSSRMVGLVQLL----LKSNTASSLVED 997
+ +V + Y+ + + +P++ S KI+ +S ++L+ + + L+
Sbjct: 948 KKADNVVNIARYLYHGNTNPELAFESAKILCCISCNSNIQIKLVGDFTHDQSISQKLMAG 1007
Query: 998 YASCLELRSEECHVIENSGD-----------DPGVLIMQLLIDNISRPAPNVTHLLLKFN 1057
+ CL+ E V G + + I+ LLI ++ PN+ LL F
Sbjct: 1008 FVECLDCEDAEEFVRLEEGSELEKKLVAIRHETRIHILNLLITSLECNPPNLALYLLGFE 1067
Query: 1058 LETSIERTILQPK----FHYSCLKVVLEILEKLSNPEVNAL-------LFEFGFQLLYEL 1117
L+ + T LQ +CL +L ILEK + + L E +Q++Y+L
Sbjct: 1068 LKKPVSTTNLQDPGVLGCPRTCLHAILNILEKGTEGRTGPVAVRESPQLAELCYQVIYQL 1127
Query: 1118 CLDPLTSGPVMDLLSNKKYYFFVKHLDTIGVVPLPKRNNHTLRVSSLHQRAWLLKLLAIE 1177
C TSGP M L + + F + +P +N +S L+Q +WL+K +IE
Sbjct: 1128 CACSDTSGPTMRYLRTSQDFLF----SQLQYLPF---SNKEYEISMLNQMSWLMKTASIE 1187
Query: 1178 LHAADLSSPIHREACQSILAHLYGQEIDVGSVPVFSLQHHVVDPG----------TRTMS 1237
L L+ R Q +L HL ++ V P + + D T T
Sbjct: 1188 LRVTSLNR--QRSHTQRLL-HLLLDDMPV--KPYSDGEGGIEDENRSVSGFLHFDTATKV 1247
Query: 1238 KSKALELLEVVQFRTPDTSIKLPQIVSNLKYELLTKDILGNPSTSQK-GGIYHYSERGDR 1297
+ K L +L+ + F S ++P E L D Q H + RG
Sbjct: 1248 RRKILNILDSIDF-----SQEIP--------EPLQLDFFDRAQIEQVIANCEHKNLRGQT 1307
Query: 1298 LIDLTSFCDKLWQKFNADNPQLNNVGREAELEEVKETIQQFLRWGWKYNKNLEEQAAQLH 1357
+ ++ L + NA L + + + E I L++ NK L+ A+ H
Sbjct: 1308 VCNVKLLHRVLVAEVNA----LQGMAAIGQRPLLMEEISTVLQYVVGRNKLLQCLHAKRH 1367
Query: 1358 MLTSWSQTIEVTVS---RRISSLENRS----DILFQVLDASLSASASPDCSLKMA-YLLC 1417
L SW Q +E+ ++ + + E+R DIL V D L A+ + +A +
Sbjct: 1368 ALESWRQLVEIILTACPQDLIQAEDRQLIIRDILQDVHDKILDDEAAQELMPVVAGAVFT 1427
Query: 1418 QVALTCMAKLRDERYSSP-GGLNADSVSCLD------------IIMVKQISNGACHSILL 1477
A A L +++ +S G A LD ++ I + + + IL
Sbjct: 1428 LTAHLSQAVLTEQKETSVLGPAEAHYAFMLDSCFTSPPPEENPLVGFASIGDSSLYIILK 1487
Query: 1478 KLIMAILRNESS-EALRRRQYALLLSYLQYCQNMLDPDVPTTILQVLLQNEQDGEDVDLQ 1537
KL+ IL+ + +R Y LL YLQ Q +PD + + + EDV
Sbjct: 1488 KLLDFILKTGGGFQRVRTHLYGSLLYYLQIAQRPDEPDTLEAAKKTMWERLTAPEDV--- 1547
Query: 1538 KIDKDQAELAHANFSILRKEAQSILNVVIKDATQGSEPGKTISLYVLDALICIDHDRFFL 1597
++L N +I+ +++ VV +DA G E G+ ++L +LD ++ +D + +L
Sbjct: 1548 -----FSKLQRENIAIIESYGAALMEVVCRDACDGHEIGRMLALALLDRIVSVDKQQQWL 1607
Query: 1598 NQLQNRGFLKSCLVSI--SNVSLQDGVHSFDSLQRAC-TLEAELALLSRISHKYGKFGAQ 1619
L N G+LK + S+ + +LQ + L +A T E+++A L+R++ + GA
Sbjct: 1608 LYLSNSGYLKVLVDSLVEDDRTLQSLLTPQPPLLKALYTYESKMAFLTRVAKI--QQGAL 1667
BLAST of CmoCh12G012390 vs. ExPASy TrEMBL
Match:
A0A6J1FI96 (nuclear pore complex protein NUP205 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111444365 PE=3 SV=1)
HSP 1 Score: 3654.4 bits (9475), Expect = 0.0e+00
Identity = 1882/1882 (100.00%), Postives = 1882/1882 (100.00%), Query Frame = 0
Query: 1 MLSSKQSLRIIESALLGPSPPSPSQRVELLHAIHNSMPAFRSLLQFPSPKASDRAQVQSK 60
MLSSKQSLRIIESALLGPSPPSPSQRVELLHAIHNSMPAFRSLLQFPSPKASDRAQVQSK
Sbjct: 1 MLSSKQSLRIIESALLGPSPPSPSQRVELLHAIHNSMPAFRSLLQFPSPKASDRAQVQSK 60
Query: 61 EVRRPDSSTITLDDQDVVITLKLSDDLHLNEIECVHLLVAAHQEWALMGRDPSEIFHLAA 120
EVRRPDSSTITLDDQDVVITLKLSDDLHLNEIECVHLLVAAHQEWALMGRDPSEIFHLAA
Sbjct: 61 EVRRPDSSTITLDDQDVVITLKLSDDLHLNEIECVHLLVAAHQEWALMGRDPSEIFHLAA 120
Query: 121 GLWYTERRDLIMSLYTLLRAVVLDQGLEAGLVSDIQRHLEDLINSGLRQRLISLIKELNR 180
GLWYTERRDLIMSLYTLLRAVVLDQGLEAGLVSDIQRHLEDLINSGLRQRLISLIKELNR
Sbjct: 121 GLWYTERRDLIMSLYTLLRAVVLDQGLEAGLVSDIQRHLEDLINSGLRQRLISLIKELNR 180
Query: 181 EEPAGLGGPSCERYILDSRGALVERRAVVCRERLILGHCLVLSILVVRIGPKDVRDLFSV 240
EEPAGLGGPSCERYILDSRGALVERRAVVCRERLILGHCLVLSILVVRIGPKDVRDLFSV
Sbjct: 181 EEPAGLGGPSCERYILDSRGALVERRAVVCRERLILGHCLVLSILVVRIGPKDVRDLFSV 240
Query: 241 LKDCAAELNETKTPIKLQIVFSLLFSIIIAFVSDALSAVPNKASILSSDASFRNEFQDTV 300
LKDCAAELNETKTPIKLQIVFSLLFSIIIAFVSDALSAVPNKASILSSDASFRNEFQDTV
Sbjct: 241 LKDCAAELNETKTPIKLQIVFSLLFSIIIAFVSDALSAVPNKASILSSDASFRNEFQDTV 300
Query: 301 MVSGNNPTVEGFVDAVRFAWTVHLLLIHDMVDAREAVPSASPKDLDHLQSCLEVIFSHNA 360
MVSGNNPTVEGFVDAVRFAWTVHLLLIHDMVDAREAVPSASPKDLDHLQSCLEVIFSHNA
Sbjct: 301 MVSGNNPTVEGFVDAVRFAWTVHLLLIHDMVDAREAVPSASPKDLDHLQSCLEVIFSHNA 360
Query: 361 FQFMLQEVIQTAAYQNDDEDMTYMYNAYLHKLVSCFLSHPLARDKVKESKDRAMNTLSQF 420
FQFMLQEVIQTAAYQNDDEDMTYMYNAYLHKLVSCFLSHPLARDKVKESKDRAMNTLSQF
Sbjct: 361 FQFMLQEVIQTAAYQNDDEDMTYMYNAYLHKLVSCFLSHPLARDKVKESKDRAMNTLSQF 420
Query: 421 RATGSQDFMHDGDSSSHQASETLPSPFVSLLNFVSEIYRKEPELLSSNDVLWTFANFAGE 480
RATGSQDFMHDGDSSSHQASETLPSPFVSLLNFVSEIYRKEPELLSSNDVLWTFANFAGE
Sbjct: 421 RATGSQDFMHDGDSSSHQASETLPSPFVSLLNFVSEIYRKEPELLSSNDVLWTFANFAGE 480
Query: 481 DHTNFQTLVAFLNMLSTLACNEEGASRVFELLQGKAFRSVGWTTLFDCLSIYDEKFRQSL 540
DHTNFQTLVAFLNMLSTLACNEEGASRVFELLQGKAFRSVGWTTLFDCLSIYDEKFRQSL
Sbjct: 481 DHTNFQTLVAFLNMLSTLACNEEGASRVFELLQGKAFRSVGWTTLFDCLSIYDEKFRQSL 540
Query: 541 QTAGALLPEFQEGDAKALVAYLNVLQKVVENGNPVERKNWFPDIEPLFKLLSYENVPPYL 600
QTAGALLPEFQEGDAKALVAYLNVLQKVVENGNPVERKNWFPDIEPLFKLLSYENVPPYL
Sbjct: 541 QTAGALLPEFQEGDAKALVAYLNVLQKVVENGNPVERKNWFPDIEPLFKLLSYENVPPYL 600
Query: 601 KGALRNAIASFIQVSSDLKDIIWCYLEQYDLPVLVASHIQNGTKSITSQVYDMQFEVNEI 660
KGALRNAIASFIQVSSDLKDIIWCYLEQYDLPVLVASHIQNGTKSITSQVYDMQFEVNEI
Sbjct: 601 KGALRNAIASFIQVSSDLKDIIWCYLEQYDLPVLVASHIQNGTKSITSQVYDMQFEVNEI 660
Query: 661 EARQERYPSTISFLNLLNALIAKESDLSDRGRRFVGIFRFIYDHVFGPFPQRAYADAAEK 720
EARQERYPSTISFLNLLNALIAKESDLSDRGRRFVGIFRFIYDHVFGPFPQRAYADAAEK
Sbjct: 661 EARQERYPSTISFLNLLNALIAKESDLSDRGRRFVGIFRFIYDHVFGPFPQRAYADAAEK 720
Query: 721 WQLVVACLQHFNMILKMYDINEEDVDVVIDRSQSSMDTQSSSLQTQLPVLELLKDFMSGK 780
WQLVVACLQHFNMILKMYDINEEDVDVVIDRSQSSMDTQSSSLQTQLPVLELLKDFMSGK
Sbjct: 721 WQLVVACLQHFNMILKMYDINEEDVDVVIDRSQSSMDTQSSSLQTQLPVLELLKDFMSGK 780
Query: 781 SVFRNIMGILLPGVNSLITERTSQIYGQLLEKSVELSLEIMILVLEKDLLLADYWRPLYQ 840
SVFRNIMGILLPGVNSLITERTSQIYGQLLEKSVELSLEIMILVLEKDLLLADYWRPLYQ
Sbjct: 781 SVFRNIMGILLPGVNSLITERTSQIYGQLLEKSVELSLEIMILVLEKDLLLADYWRPLYQ 840
Query: 841 PLEVILSQDHSQIVALLEYVRYDFHPKIQQLSIKIMSILSSRMVGLVQLLLKSNTASSLV 900
PLEVILSQDHSQIVALLEYVRYDFHPKIQQLSIKIMSILSSRMVGLVQLLLKSNTASSLV
Sbjct: 841 PLEVILSQDHSQIVALLEYVRYDFHPKIQQLSIKIMSILSSRMVGLVQLLLKSNTASSLV 900
Query: 901 EDYASCLELRSEECHVIENSGDDPGVLIMQLLIDNISRPAPNVTHLLLKFNLETSIERTI 960
EDYASCLELRSEECHVIENSGDDPGVLIMQLLIDNISRPAPNVTHLLLKFNLETSIERTI
Sbjct: 901 EDYASCLELRSEECHVIENSGDDPGVLIMQLLIDNISRPAPNVTHLLLKFNLETSIERTI 960
Query: 961 LQPKFHYSCLKVVLEILEKLSNPEVNALLFEFGFQLLYELCLDPLTSGPVMDLLSNKKYY 1020
LQPKFHYSCLKVVLEILEKLSNPEVNALLFEFGFQLLYELCLDPLTSGPVMDLLSNKKYY
Sbjct: 961 LQPKFHYSCLKVVLEILEKLSNPEVNALLFEFGFQLLYELCLDPLTSGPVMDLLSNKKYY 1020
Query: 1021 FFVKHLDTIGVVPLPKRNNHTLRVSSLHQRAWLLKLLAIELHAADLSSPIHREACQSILA 1080
FFVKHLDTIGVVPLPKRNNHTLRVSSLHQRAWLLKLLAIELHAADLSSPIHREACQSILA
Sbjct: 1021 FFVKHLDTIGVVPLPKRNNHTLRVSSLHQRAWLLKLLAIELHAADLSSPIHREACQSILA 1080
Query: 1081 HLYGQEIDVGSVPVFSLQHHVVDPGTRTMSKSKALELLEVVQFRTPDTSIKLPQIVSNLK 1140
HLYGQEIDVGSVPVFSLQHHVVDPGTRTMSKSKALELLEVVQFRTPDTSIKLPQIVSNLK
Sbjct: 1081 HLYGQEIDVGSVPVFSLQHHVVDPGTRTMSKSKALELLEVVQFRTPDTSIKLPQIVSNLK 1140
Query: 1141 YELLTKDILGNPSTSQKGGIYHYSERGDRLIDLTSFCDKLWQKFNADNPQLNNVGREAEL 1200
YELLTKDILGNPSTSQKGGIYHYSERGDRLIDLTSFCDKLWQKFNADNPQLNNVGREAEL
Sbjct: 1141 YELLTKDILGNPSTSQKGGIYHYSERGDRLIDLTSFCDKLWQKFNADNPQLNNVGREAEL 1200
Query: 1201 EEVKETIQQFLRWGWKYNKNLEEQAAQLHMLTSWSQTIEVTVSRRISSLENRSDILFQVL 1260
EEVKETIQQFLRWGWKYNKNLEEQAAQLHMLTSWSQTIEVTVSRRISSLENRSDILFQVL
Sbjct: 1201 EEVKETIQQFLRWGWKYNKNLEEQAAQLHMLTSWSQTIEVTVSRRISSLENRSDILFQVL 1260
Query: 1261 DASLSASASPDCSLKMAYLLCQVALTCMAKLRDERYSSPGGLNADSVSCLDIIMVKQISN 1320
DASLSASASPDCSLKMAYLLCQVALTCMAKLRDERYSSPGGLNADSVSCLDIIMVKQISN
Sbjct: 1261 DASLSASASPDCSLKMAYLLCQVALTCMAKLRDERYSSPGGLNADSVSCLDIIMVKQISN 1320
Query: 1321 GACHSILLKLIMAILRNESSEALRRRQYALLLSYLQYCQNMLDPDVPTTILQVLLQNEQD 1380
GACHSILLKLIMAILRNESSEALRRRQYALLLSYLQYCQNMLDPDVPTTILQVLLQNEQD
Sbjct: 1321 GACHSILLKLIMAILRNESSEALRRRQYALLLSYLQYCQNMLDPDVPTTILQVLLQNEQD 1380
Query: 1381 GEDVDLQKIDKDQAELAHANFSILRKEAQSILNVVIKDATQGSEPGKTISLYVLDALICI 1440
GEDVDLQKIDKDQAELAHANFSILRKEAQSILNVVIKDATQGSEPGKTISLYVLDALICI
Sbjct: 1381 GEDVDLQKIDKDQAELAHANFSILRKEAQSILNVVIKDATQGSEPGKTISLYVLDALICI 1440
Query: 1441 DHDRFFLNQLQNRGFLKSCLVSISNVSLQDGVHSFDSLQRACTLEAELALLSRISHKYGK 1500
DHDRFFLNQLQNRGFLKSCLVSISNVSLQDGVHSFDSLQRACTLEAELALLSRISHKYGK
Sbjct: 1441 DHDRFFLNQLQNRGFLKSCLVSISNVSLQDGVHSFDSLQRACTLEAELALLSRISHKYGK 1500
Query: 1501 FGAQLLFSTGALEHLDSCRAINLQGNLRWVDMKPHRDVAGNFNKQQAIVTPILRLLFSMT 1560
FGAQLLFSTGALEHLDSCRAINLQGNLRWVDMKPHRDVAGNFNKQQAIVTPILRLLFSMT
Sbjct: 1501 FGAQLLFSTGALEHLDSCRAINLQGNLRWVDMKPHRDVAGNFNKQQAIVTPILRLLFSMT 1560
Query: 1561 SLVDTSEFFEVKNKIVREAIDFIKRHQRVFDQILGEDVSEADDSTLEQINLLVASLSKVW 1620
SLVDTSEFFEVKNKIVREAIDFIKRHQRVFDQILGEDVSEADDSTLEQINLLVASLSKVW
Sbjct: 1561 SLVDTSEFFEVKNKIVREAIDFIKRHQRVFDQILGEDVSEADDSTLEQINLLVASLSKVW 1620
Query: 1621 PYEETDEYGFVQSLFQLMHSLFSRELDSRTPGPAVKLLKNRRSSELHSIQLNFSLISYLY 1680
PYEETDEYGFVQSLFQLMHSLFSRELDSRTPGPAVKLLKNRRSSELHSIQLNFSLISYLY
Sbjct: 1621 PYEETDEYGFVQSLFQLMHSLFSRELDSRTPGPAVKLLKNRRSSELHSIQLNFSLISYLY 1680
Query: 1681 FLVTRKSLRLQVSGTSSSHNSPVRSQRPSLDLLGTLLNSTTITLEKAAEERLLLLNKIRD 1740
FLVTRKSLRLQVSGTSSSHNSPVRSQRPSLDLLGTLLNSTTITLEKAAEERLLLLNKIRD
Sbjct: 1681 FLVTRKSLRLQVSGTSSSHNSPVRSQRPSLDLLGTLLNSTTITLEKAAEERLLLLNKIRD 1740
Query: 1741 INELSRQEVEEIIVLCLGEDFASLSDNIQRRRYIAMIEMCKIVGNKSEMITLLLPLAEYV 1800
INELSRQEVEEIIVLCLGEDFASLSDNIQRRRYIAMIEMCKIVGNKSEMITLLLPLAEYV
Sbjct: 1741 INELSRQEVEEIIVLCLGEDFASLSDNIQRRRYIAMIEMCKIVGNKSEMITLLLPLAEYV 1800
Query: 1801 LNVMLIHFQDSSVIPDGNANIKAIAYHGELESGHEISSLCGKLIPVLERLELLSENKIGE 1860
LNVMLIHFQDSSVIPDGNANIKAIAYHGELESGHEISSLCGKLIPVLERLELLSENKIGE
Sbjct: 1801 LNVMLIHFQDSSVIPDGNANIKAIAYHGELESGHEISSLCGKLIPVLERLELLSENKIGE 1860
Query: 1861 NLKVFGRLVSSVKEVAIQKLDV 1883
NLKVFGRLVSSVKEVAIQKLDV
Sbjct: 1861 NLKVFGRLVSSVKEVAIQKLDV 1882
BLAST of CmoCh12G012390 vs. ExPASy TrEMBL
Match:
A0A6J1FDF5 (nuclear pore complex protein NUP205 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111444365 PE=3 SV=1)
HSP 1 Score: 3647.8 bits (9458), Expect = 0.0e+00
Identity = 1881/1882 (99.95%), Postives = 1881/1882 (99.95%), Query Frame = 0
Query: 1 MLSSKQSLRIIESALLGPSPPSPSQRVELLHAIHNSMPAFRSLLQFPSPKASDRAQVQSK 60
MLSSKQSLRIIESALLGPSPPSPSQRVELLHAIHNSMPAFRSLLQFPSPKASDRAQVQSK
Sbjct: 1 MLSSKQSLRIIESALLGPSPPSPSQRVELLHAIHNSMPAFRSLLQFPSPKASDRAQVQSK 60
Query: 61 EVRRPDSSTITLDDQDVVITLKLSDDLHLNEIECVHLLVAAHQEWALMGRDPSEIFHLAA 120
EVRRPDSSTITLDDQDVVITLKLSDDLHLNEIECVHLLVAAHQEWALMGRDPSEIFHLAA
Sbjct: 61 EVRRPDSSTITLDDQDVVITLKLSDDLHLNEIECVHLLVAAHQEWALMGRDPSEIFHLAA 120
Query: 121 GLWYTERRDLIMSLYTLLRAVVLDQGLEAGLVSDIQRHLEDLINSGLRQRLISLIKELNR 180
GLWYTERRDLIMSLYTLLRAVVLDQGLEAGLVSDIQRHLEDLINSGLRQRLISLIKELNR
Sbjct: 121 GLWYTERRDLIMSLYTLLRAVVLDQGLEAGLVSDIQRHLEDLINSGLRQRLISLIKELNR 180
Query: 181 EEPAGLGGPSCERYILDSRGALVERRAVVCRERLILGHCLVLSILVVRIGPKDVRDLFSV 240
EEPAGLGGPSCERYILDSRGALVERRAVVCRERLILGHCLVLSILVVRIGPKDVRDLFSV
Sbjct: 181 EEPAGLGGPSCERYILDSRGALVERRAVVCRERLILGHCLVLSILVVRIGPKDVRDLFSV 240
Query: 241 LKDCAAELNETKTPIKLQIVFSLLFSIIIAFVSDALSAVPNKASILSSDASFRNEFQDTV 300
LKDCAAELNETKTPIKLQIVFSLLFSIIIAFVSDALSAVPNKASILSSDASFRNEFQDTV
Sbjct: 241 LKDCAAELNETKTPIKLQIVFSLLFSIIIAFVSDALSAVPNKASILSSDASFRNEFQDTV 300
Query: 301 MVSGNNPTVEGFVDAVRFAWTVHLLLIHDMVDAREAVPSASPKDLDHLQSCLEVIFSHNA 360
MVSGNNPTVEGFVDAVRFAWTVHLLLIHDMVDAREAVPSASPKDLDHLQSCLEVIFSHNA
Sbjct: 301 MVSGNNPTVEGFVDAVRFAWTVHLLLIHDMVDAREAVPSASPKDLDHLQSCLEVIFSHNA 360
Query: 361 FQFMLQEVIQTAAYQNDDEDMTYMYNAYLHKLVSCFLSHPLARDKVKESKDRAMNTLSQF 420
FQFMLQEVIQTAAYQNDDEDMTYMYNAYLHKLVSCFLSHPLARDKVKESKDRAMNTLSQF
Sbjct: 361 FQFMLQEVIQTAAYQNDDEDMTYMYNAYLHKLVSCFLSHPLARDKVKESKDRAMNTLSQF 420
Query: 421 RATGSQDFMHDGDSSSHQASETLPSPFVSLLNFVSEIYRKEPELLSSNDVLWTFANFAGE 480
RATGSQDFMHDGDSSSHQASETLPSPFVSLLNFVSEIYRKEPELLSSNDVLWTFANFAGE
Sbjct: 421 RATGSQDFMHDGDSSSHQASETLPSPFVSLLNFVSEIYRKEPELLSSNDVLWTFANFAGE 480
Query: 481 DHTNFQTLVAFLNMLSTLACNEEGASRVFELLQGKAFRSVGWTTLFDCLSIYDEKFRQSL 540
DHTNFQTLVAFLNMLSTLACNEEGASRVFELLQGKAFRSVGWTTLFDCLSIYDEKFRQSL
Sbjct: 481 DHTNFQTLVAFLNMLSTLACNEEGASRVFELLQGKAFRSVGWTTLFDCLSIYDEKFRQSL 540
Query: 541 QTAGALLPEFQEGDAKALVAYLNVLQKVVENGNPVERKNWFPDIEPLFKLLSYENVPPYL 600
QTAGALLPEFQEGDAKALVAYLNVLQKVVENGNPVERKNWFPDIEPLFKLLSYENVPPYL
Sbjct: 541 QTAGALLPEFQEGDAKALVAYLNVLQKVVENGNPVERKNWFPDIEPLFKLLSYENVPPYL 600
Query: 601 KGALRNAIASFIQVSSDLKDIIWCYLEQYDLPVLVASHIQNGTKSITSQVYDMQFEVNEI 660
KGALRNAIASFIQVSSDLKDIIWCYLEQYDLPVLVASHIQNGTKSITSQVYDMQFEVNEI
Sbjct: 601 KGALRNAIASFIQVSSDLKDIIWCYLEQYDLPVLVASHIQNGTKSITSQVYDMQFEVNEI 660
Query: 661 EARQERYPSTISFLNLLNALIAKESDLSDRGRRFVGIFRFIYDHVFGPFPQRAYADAAEK 720
EARQERYPSTISFLNLLNALIAKESDLSDRGRRFVGIFRFIYDHVFGPFPQRAYADAAEK
Sbjct: 661 EARQERYPSTISFLNLLNALIAKESDLSDRGRRFVGIFRFIYDHVFGPFPQRAYADAAEK 720
Query: 721 WQLVVACLQHFNMILKMYDINEEDVDVVIDRSQSSMDTQSSSLQTQLPVLELLKDFMSGK 780
WQLVVACLQHFNMILKMYDINEEDVDVVIDRSQSSMDTQSSSLQTQLPVLELLKDFMSGK
Sbjct: 721 WQLVVACLQHFNMILKMYDINEEDVDVVIDRSQSSMDTQSSSLQTQLPVLELLKDFMSGK 780
Query: 781 SVFRNIMGILLPGVNSLITERTSQIYGQLLEKSVELSLEIMILVLEKDLLLADYWRPLYQ 840
SVFRNIMGILLPGVNSLITERTSQIYGQLLEKSVELSLEIMILVLEKDLLLADYWRPLYQ
Sbjct: 781 SVFRNIMGILLPGVNSLITERTSQIYGQLLEKSVELSLEIMILVLEKDLLLADYWRPLYQ 840
Query: 841 PLEVILSQDHSQIVALLEYVRYDFHPKIQQLSIKIMSILSSRMVGLVQLLLKSNTASSLV 900
PLEVILSQDHSQIVALLEYVRYDFHPKIQQLSIKIMSILSSRMVGLVQLLLKSNTASSLV
Sbjct: 841 PLEVILSQDHSQIVALLEYVRYDFHPKIQQLSIKIMSILSSRMVGLVQLLLKSNTASSLV 900
Query: 901 EDYASCLELRSEECHVIENSGDDPGVLIMQLLIDNISRPAPNVTHLLLKFNLETSIERTI 960
EDYASCLELRSEECHVIENSGDDPGVLIMQLLIDNISRPAPNVTHLLLKFNLETSIERTI
Sbjct: 901 EDYASCLELRSEECHVIENSGDDPGVLIMQLLIDNISRPAPNVTHLLLKFNLETSIERTI 960
Query: 961 LQPKFHYSCLKVVLEILEKLSNPEVNALLFEFGFQLLYELCLDPLTSGPVMDLLSNKKYY 1020
LQPKFHYSCLKVVLEILEKLSNPEVNALLFEFGFQLLYELCLDPLTSGPVMDLLSNKKYY
Sbjct: 961 LQPKFHYSCLKVVLEILEKLSNPEVNALLFEFGFQLLYELCLDPLTSGPVMDLLSNKKYY 1020
Query: 1021 FFVKHLDTIGVVPLPKRNNHTLRVSSLHQRAWLLKLLAIELHAADLSSPIHREACQSILA 1080
FFVKHLDTIGVVPLPKRNNHTLRVSSLHQRAWLLKLLAIELHAADLSSPIHREACQSILA
Sbjct: 1021 FFVKHLDTIGVVPLPKRNNHTLRVSSLHQRAWLLKLLAIELHAADLSSPIHREACQSILA 1080
Query: 1081 HLYGQEIDVGSVPVFSLQHHVVDPGTRTMSKSKALELLEVVQFRTPDTSIKLPQIVSNLK 1140
HLYGQEIDVGSVPVFSLQHHVVDPGTRTMSKSKALELLEVVQFRTPDTSIKLPQIVSNLK
Sbjct: 1081 HLYGQEIDVGSVPVFSLQHHVVDPGTRTMSKSKALELLEVVQFRTPDTSIKLPQIVSNLK 1140
Query: 1141 YELLTKDILGNPSTSQKGGIYHYSERGDRLIDLTSFCDKLWQKFNADNPQLNNVGREAEL 1200
YELLTKDILGNPSTSQKGGIYHYSERGDRLIDLTSFCDKLWQKFNADNPQLNNVGREAEL
Sbjct: 1141 YELLTKDILGNPSTSQKGGIYHYSERGDRLIDLTSFCDKLWQKFNADNPQLNNVGREAEL 1200
Query: 1201 EEVKETIQQFLRWGWKYNKNLEEQAAQLHMLTSWSQTIEVTVSRRISSLENRSDILFQVL 1260
EEVKETIQQFLRWGWKYNKNLEEQAAQLHMLTSWSQTIEVTVSRRISSLENRSDILFQVL
Sbjct: 1201 EEVKETIQQFLRWGWKYNKNLEEQAAQLHMLTSWSQTIEVTVSRRISSLENRSDILFQVL 1260
Query: 1261 DASLSASASPDCSLKMAYLLCQVALTCMAKLRDERYSSPGGLNADSVSCLDIIMVKQISN 1320
DASLSASASPDCSLKMAYLLCQVALTCMAKLRDERYSSPGGLNADSVSCLDIIMVKQISN
Sbjct: 1261 DASLSASASPDCSLKMAYLLCQVALTCMAKLRDERYSSPGGLNADSVSCLDIIMVKQISN 1320
Query: 1321 GACHSILLKLIMAILRNESSEALRRRQYALLLSYLQYCQNMLDPDVPTTILQVLLQNEQD 1380
GACHSILLKLIMAILRNESSEALRRRQYALLLSYLQYCQNMLDPDVPTTILQVLLQNEQD
Sbjct: 1321 GACHSILLKLIMAILRNESSEALRRRQYALLLSYLQYCQNMLDPDVPTTILQVLLQNEQD 1380
Query: 1381 GEDVDLQKIDKDQAELAHANFSILRKEAQSILNVVIKDATQGSEPGKTISLYVLDALICI 1440
GEDVDLQKIDKDQAELAHANFSILRKEAQSILNVVIKDATQGSEPGKTISLYVLDALICI
Sbjct: 1381 GEDVDLQKIDKDQAELAHANFSILRKEAQSILNVVIKDATQGSEPGKTISLYVLDALICI 1440
Query: 1441 DHDRFFLNQLQNRGFLKSCLVSISNVSLQDGVHSFDSLQRACTLEAELALLSRISHKYGK 1500
DHDRFFLNQLQNRGFLKSCLVSISNVSLQDGVHSFDSLQRACTLEAELALLSRISHKYGK
Sbjct: 1441 DHDRFFLNQLQNRGFLKSCLVSISNVSLQDGVHSFDSLQRACTLEAELALLSRISHKYGK 1500
Query: 1501 FGAQLLFSTGALEHLDSCRAINLQGNLRWVDMKPHRDVAGNFNKQQAIVTPILRLLFSMT 1560
FGAQLLFSTGALEHLDSCRAINLQGNLRWVDMKPHRDVAGNFNKQQAIVTPILRLLFSMT
Sbjct: 1501 FGAQLLFSTGALEHLDSCRAINLQGNLRWVDMKPHRDVAGNFNKQQAIVTPILRLLFSMT 1560
Query: 1561 SLVDTSEFFEVKNKIVREAIDFIKRHQRVFDQILGEDVSEADDSTLEQINLLVASLSKVW 1620
SLVDTSEFFEVKNKIVREAIDFIKRHQRVFDQILGEDVSEADDSTLEQINLLVASLSKVW
Sbjct: 1561 SLVDTSEFFEVKNKIVREAIDFIKRHQRVFDQILGEDVSEADDSTLEQINLLVASLSKVW 1620
Query: 1621 PYEETDEYGFVQSLFQLMHSLFSRELDSRTPGPAVKLLKNRRSSELHSIQLNFSLISYLY 1680
PYEETDEYGFVQSLFQLMHSLFSRELDSRTPGPAVKLLKNRRSSELHSIQLNFSLISYLY
Sbjct: 1621 PYEETDEYGFVQSLFQLMHSLFSRELDSRTPGPAVKLLKNRRSSELHSIQLNFSLISYLY 1680
Query: 1681 FLVTRKSLRLQVSGTSSSHNSPVRSQRPSLDLLGTLLNSTTITLEKAAEERLLLLNKIRD 1740
FLVTRKSLRLQVSGTSSSHNSPVRSQRPSLDLLGTLLNSTTITLEKAAEERLLLLNKIRD
Sbjct: 1681 FLVTRKSLRLQVSGTSSSHNSPVRSQRPSLDLLGTLLNSTTITLEKAAEERLLLLNKIRD 1740
Query: 1741 INELSRQEVEEIIVLCLGEDFASLSDNIQRRRYIAMIEMCKIVGNKSEMITLLLPLAEYV 1800
INELSRQEVEEIIVLCLGEDFASLSDNIQRRRYIAMIEMCKIVGNKSEMITLLLPLAEYV
Sbjct: 1741 INELSRQEVEEIIVLCLGEDFASLSDNIQRRRYIAMIEMCKIVGNKSEMITLLLPLAEYV 1800
Query: 1801 LNVMLIHFQDSSVIPDGNANIKAIAYHGELESGHEISSLCGKLIPVLERLELLSENKIGE 1860
LNVMLIHFQD SVIPDGNANIKAIAYHGELESGHEISSLCGKLIPVLERLELLSENKIGE
Sbjct: 1801 LNVMLIHFQD-SVIPDGNANIKAIAYHGELESGHEISSLCGKLIPVLERLELLSENKIGE 1860
Query: 1861 NLKVFGRLVSSVKEVAIQKLDV 1883
NLKVFGRLVSSVKEVAIQKLDV
Sbjct: 1861 NLKVFGRLVSSVKEVAIQKLDV 1881
BLAST of CmoCh12G012390 vs. ExPASy TrEMBL
Match:
A0A6J1HLX9 (nuclear pore complex protein NUP205 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111465407 PE=3 SV=1)
HSP 1 Score: 3617.0 bits (9378), Expect = 0.0e+00
Identity = 1862/1882 (98.94%), Postives = 1873/1882 (99.52%), Query Frame = 0
Query: 1 MLSSKQSLRIIESALLGPSPPSPSQRVELLHAIHNSMPAFRSLLQFPSPKASDRAQVQSK 60
MLSSKQSLRIIESALLGP+PPSPSQRVELLHAIHNSMPAFRSLLQFPSPKASDRAQVQSK
Sbjct: 1 MLSSKQSLRIIESALLGPAPPSPSQRVELLHAIHNSMPAFRSLLQFPSPKASDRAQVQSK 60
Query: 61 EVRRPDSSTITLDDQDVVITLKLSDDLHLNEIECVHLLVAAHQEWALMGRDPSEIFHLAA 120
EVRRPDSSTITLDDQDV ITLKLSDDLHLNEIECVHLLVAAHQEWALMGRDPSEIFHLAA
Sbjct: 61 EVRRPDSSTITLDDQDVEITLKLSDDLHLNEIECVHLLVAAHQEWALMGRDPSEIFHLAA 120
Query: 121 GLWYTERRDLIMSLYTLLRAVVLDQGLEAGLVSDIQRHLEDLINSGLRQRLISLIKELNR 180
GLWYTERRDLIMSLYTLLRAVVLDQGLEAGLVSDIQR+LEDLI SGLRQRLISLIKELNR
Sbjct: 121 GLWYTERRDLIMSLYTLLRAVVLDQGLEAGLVSDIQRNLEDLIISGLRQRLISLIKELNR 180
Query: 181 EEPAGLGGPSCERYILDSRGALVERRAVVCRERLILGHCLVLSILVVRIGPKDVRDLFSV 240
EEPAGLGGPSCERYILDSRGALVERRAVVCRERLILGHCLVLSILVVRIGPKDV+DLFSV
Sbjct: 181 EEPAGLGGPSCERYILDSRGALVERRAVVCRERLILGHCLVLSILVVRIGPKDVKDLFSV 240
Query: 241 LKDCAAELNETKTPIKLQIVFSLLFSIIIAFVSDALSAVPNKASILSSDASFRNEFQDTV 300
LKDCAAELNETKTPIKLQIVFSLLFSIIIAFVSDALSAVPNKASILSSDASFRNEFQDTV
Sbjct: 241 LKDCAAELNETKTPIKLQIVFSLLFSIIIAFVSDALSAVPNKASILSSDASFRNEFQDTV 300
Query: 301 MVSGNNPTVEGFVDAVRFAWTVHLLLIHDMVDAREAVPSASPKDLDHLQSCLEVIFSHNA 360
MVSGNNPTVEGFVDAVRFAWTVHLLLIHDMVDAREAVPSASPKDLDHLQSCLEVIFSHNA
Sbjct: 301 MVSGNNPTVEGFVDAVRFAWTVHLLLIHDMVDAREAVPSASPKDLDHLQSCLEVIFSHNA 360
Query: 361 FQFMLQEVIQTAAYQNDDEDMTYMYNAYLHKLVSCFLSHPLARDKVKESKDRAMNTLSQF 420
FQFMLQEVIQTAAYQNDDEDMTYMYNAYLHKLVSCFLSHPLARDKVKESKDRAMNTLSQF
Sbjct: 361 FQFMLQEVIQTAAYQNDDEDMTYMYNAYLHKLVSCFLSHPLARDKVKESKDRAMNTLSQF 420
Query: 421 RATGSQDFMHDGDSSSHQASETLPSPFVSLLNFVSEIYRKEPELLSSNDVLWTFANFAGE 480
RATGSQDFMHD DSSSHQASETLPSPFVSLLNFVSEIYRKEPELLSSNDVLWTFANFAGE
Sbjct: 421 RATGSQDFMHDSDSSSHQASETLPSPFVSLLNFVSEIYRKEPELLSSNDVLWTFANFAGE 480
Query: 481 DHTNFQTLVAFLNMLSTLACNEEGASRVFELLQGKAFRSVGWTTLFDCLSIYDEKFRQSL 540
DHTNFQTLVAFLNMLSTLACNEEGASRVFELL+GKAFRSVGWTTLFDCLSIYDEKFRQSL
Sbjct: 481 DHTNFQTLVAFLNMLSTLACNEEGASRVFELLKGKAFRSVGWTTLFDCLSIYDEKFRQSL 540
Query: 541 QTAGALLPEFQEGDAKALVAYLNVLQKVVENGNPVERKNWFPDIEPLFKLLSYENVPPYL 600
QTAGALLPEFQEGDAKALVAYLNVLQKVVENGNPVERKNWFPDIEPLFKLLSYENVPPYL
Sbjct: 541 QTAGALLPEFQEGDAKALVAYLNVLQKVVENGNPVERKNWFPDIEPLFKLLSYENVPPYL 600
Query: 601 KGALRNAIASFIQVSSDLKDIIWCYLEQYDLPVLVASHIQNGTKSITSQVYDMQFEVNEI 660
KGALRNAI SFIQVSSDLKDIIW YLEQYDLPVLVASHIQNGTKSITSQVYDMQFEVNEI
Sbjct: 601 KGALRNAITSFIQVSSDLKDIIWRYLEQYDLPVLVASHIQNGTKSITSQVYDMQFEVNEI 660
Query: 661 EARQERYPSTISFLNLLNALIAKESDLSDRGRRFVGIFRFIYDHVFGPFPQRAYADAAEK 720
EARQERYPSTISFLNLLNALIAKESDLSDRGRRFVGIFRFIYDHVFGPFPQRAYADAAEK
Sbjct: 661 EARQERYPSTISFLNLLNALIAKESDLSDRGRRFVGIFRFIYDHVFGPFPQRAYADAAEK 720
Query: 721 WQLVVACLQHFNMILKMYDINEEDVDVVIDRSQSSMDTQSSSLQTQLPVLELLKDFMSGK 780
WQLVVACLQHFNMILKMYDINEEDVDVVIDRSQSSMDTQSSSLQTQLPV ELLKDFMSGK
Sbjct: 721 WQLVVACLQHFNMILKMYDINEEDVDVVIDRSQSSMDTQSSSLQTQLPVFELLKDFMSGK 780
Query: 781 SVFRNIMGILLPGVNSLITERTSQIYGQLLEKSVELSLEIMILVLEKDLLLADYWRPLYQ 840
SVFRNIMGILLPGVNSLITERTSQIYGQLLEKSVELSLEIMILVLEKDLLLADYWRPLYQ
Sbjct: 781 SVFRNIMGILLPGVNSLITERTSQIYGQLLEKSVELSLEIMILVLEKDLLLADYWRPLYQ 840
Query: 841 PLEVILSQDHSQIVALLEYVRYDFHPKIQQLSIKIMSILSSRMVGLVQLLLKSNTASSLV 900
PLEVILSQDHSQIVALLEYVRYDFHPKIQQLSIKIMSILSSRMVGLVQLLLKSNTASSLV
Sbjct: 841 PLEVILSQDHSQIVALLEYVRYDFHPKIQQLSIKIMSILSSRMVGLVQLLLKSNTASSLV 900
Query: 901 EDYASCLELRSEECHVIENSGDDPGVLIMQLLIDNISRPAPNVTHLLLKFNLETSIERTI 960
EDYASCLELRSEECHVIE+SGDDPGVLIMQLLIDNISRPAPNVTHLLLKFNLETSIERTI
Sbjct: 901 EDYASCLELRSEECHVIESSGDDPGVLIMQLLIDNISRPAPNVTHLLLKFNLETSIERTI 960
Query: 961 LQPKFHYSCLKVVLEILEKLSNPEVNALLFEFGFQLLYELCLDPLTSGPVMDLLSNKKYY 1020
LQPKFHYSCLKVVLEILEKLSNPEVNALLFEFGFQLLYELCLDPLTSGPVMDLLSNKKYY
Sbjct: 961 LQPKFHYSCLKVVLEILEKLSNPEVNALLFEFGFQLLYELCLDPLTSGPVMDLLSNKKYY 1020
Query: 1021 FFVKHLDTIGVVPLPKRNNHTLRVSSLHQRAWLLKLLAIELHAADLSSPIHREACQSILA 1080
FFVKHLDTIGVVPLPKRNNHTLRVSSLHQRAWLLKLLAIELHAADLSSPIHREACQSILA
Sbjct: 1021 FFVKHLDTIGVVPLPKRNNHTLRVSSLHQRAWLLKLLAIELHAADLSSPIHREACQSILA 1080
Query: 1081 HLYGQEIDVGSVPVFSLQHHVVDPGTRTMSKSKALELLEVVQFRTPDTSIKLPQIVSNLK 1140
HLYGQEIDVGSVPVFSLQHHVVDPGTRTMSKSKALELLEVVQFRTPDTSIKLPQIVSNLK
Sbjct: 1081 HLYGQEIDVGSVPVFSLQHHVVDPGTRTMSKSKALELLEVVQFRTPDTSIKLPQIVSNLK 1140
Query: 1141 YELLTKDILGNPSTSQKGGIYHYSERGDRLIDLTSFCDKLWQKFNADNPQLNNVGREAEL 1200
YELLTKDILGNPSTSQKGGIYHYSERGDRLIDLTSFCDKLWQKFNADNPQLNNVGREAEL
Sbjct: 1141 YELLTKDILGNPSTSQKGGIYHYSERGDRLIDLTSFCDKLWQKFNADNPQLNNVGREAEL 1200
Query: 1201 EEVKETIQQFLRWGWKYNKNLEEQAAQLHMLTSWSQTIEVTVSRRISSLENRSDILFQVL 1260
EEVKETIQQFLRWGWKYNKNLEEQAAQLHMLTSWSQTIEVTVSRRISSLENRSDILFQVL
Sbjct: 1201 EEVKETIQQFLRWGWKYNKNLEEQAAQLHMLTSWSQTIEVTVSRRISSLENRSDILFQVL 1260
Query: 1261 DASLSASASPDCSLKMAYLLCQVALTCMAKLRDERYSSPGGLNADSVSCLDIIMVKQISN 1320
DASLSASASPDCSLKMAYLLCQVALTCMAKLRDERYSSPGGLNADSVSCLDIIMVKQISN
Sbjct: 1261 DASLSASASPDCSLKMAYLLCQVALTCMAKLRDERYSSPGGLNADSVSCLDIIMVKQISN 1320
Query: 1321 GACHSILLKLIMAILRNESSEALRRRQYALLLSYLQYCQNMLDPDVPTTILQVLLQNEQD 1380
GACHSILLKLIMAILRNESSEALRRRQYALLLSYLQYCQNMLDPDVPTTILQVLLQNEQD
Sbjct: 1321 GACHSILLKLIMAILRNESSEALRRRQYALLLSYLQYCQNMLDPDVPTTILQVLLQNEQD 1380
Query: 1381 GEDVDLQKIDKDQAELAHANFSILRKEAQSILNVVIKDATQGSEPGKTISLYVLDALICI 1440
GEDVDLQKIDKDQAELAHANFSILRKEAQSILNVVIKDATQGSEPGKTISLYVLDALICI
Sbjct: 1381 GEDVDLQKIDKDQAELAHANFSILRKEAQSILNVVIKDATQGSEPGKTISLYVLDALICI 1440
Query: 1441 DHDRFFLNQLQNRGFLKSCLVSISNVSLQDGVHSFDSLQRACTLEAELALLSRISHKYGK 1500
DHDRFFLNQLQNRGFLKSCLVSISNVSLQDGVHSFDSLQRACTLEAELALLSRISHKYGK
Sbjct: 1441 DHDRFFLNQLQNRGFLKSCLVSISNVSLQDGVHSFDSLQRACTLEAELALLSRISHKYGK 1500
Query: 1501 FGAQLLFSTGALEHLDSCRAINLQGNLRWVDMKPHRDVAGNFNKQQAIVTPILRLLFSMT 1560
FGAQLLFSTGALEHLDSCRA+NLQGNLRWVDMKPHRDVAG+FNKQQAIVTPILRLLFSMT
Sbjct: 1501 FGAQLLFSTGALEHLDSCRAVNLQGNLRWVDMKPHRDVAGSFNKQQAIVTPILRLLFSMT 1560
Query: 1561 SLVDTSEFFEVKNKIVREAIDFIKRHQRVFDQILGEDVSEADDSTLEQINLLVASLSKVW 1620
SLVDTSEFFEVKNKIVREAIDFIKRHQRVFDQILGEDVSEADDSTLEQINLLVASLSKVW
Sbjct: 1561 SLVDTSEFFEVKNKIVREAIDFIKRHQRVFDQILGEDVSEADDSTLEQINLLVASLSKVW 1620
Query: 1621 PYEETDEYGFVQSLFQLMHSLFSRELDSRTPGPAVKLLKNRRSSELHSIQLNFSLISYLY 1680
PYEETDEYGFVQSLFQLMHSLFSRELDSRT GPAVKLLKNRRSSELHSIQLNFSLISYLY
Sbjct: 1621 PYEETDEYGFVQSLFQLMHSLFSRELDSRTSGPAVKLLKNRRSSELHSIQLNFSLISYLY 1680
Query: 1681 FLVTRKSLRLQVSGTSSSHNSPVRSQRPSLDLLGTLLNSTTITLEKAAEERLLLLNKIRD 1740
FLVTRKSLRLQVSGTSSSHNSPVRSQRPSLDLLGTLLNS TITLEKAAEERLLLLNKIRD
Sbjct: 1681 FLVTRKSLRLQVSGTSSSHNSPVRSQRPSLDLLGTLLNSMTITLEKAAEERLLLLNKIRD 1740
Query: 1741 INELSRQEVEEIIVLCLGEDFASLSDNIQRRRYIAMIEMCKIVGNKSEMITLLLPLAEYV 1800
INELSRQEVEEIIVLCLGEDFASLSDNIQRRRYIAMIEMCKIVGNKSEMITLLLPLAEYV
Sbjct: 1741 INELSRQEVEEIIVLCLGEDFASLSDNIQRRRYIAMIEMCKIVGNKSEMITLLLPLAEYV 1800
Query: 1801 LNVMLIHFQDSSVIPDGNANIKAIAYHGELESGHEISSLCGKLIPVLERLELLSENKIGE 1860
LNVMLIHFQDS+VIPDGNANIKAI+YHGELESGHEISSLCGKLIP+LERLELLSENKIGE
Sbjct: 1801 LNVMLIHFQDSTVIPDGNANIKAISYHGELESGHEISSLCGKLIPILERLELLSENKIGE 1860
Query: 1861 NLKVFGRLVSSVKEVAIQKLDV 1883
N+KVFGRL SSVKEVAIQKLDV
Sbjct: 1861 NVKVFGRLASSVKEVAIQKLDV 1882
BLAST of CmoCh12G012390 vs. ExPASy TrEMBL
Match:
A0A6J1HKK4 (nuclear pore complex protein NUP205 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111465407 PE=3 SV=1)
HSP 1 Score: 3611.6 bits (9364), Expect = 0.0e+00
Identity = 1862/1882 (98.94%), Postives = 1872/1882 (99.47%), Query Frame = 0
Query: 1 MLSSKQSLRIIESALLGPSPPSPSQRVELLHAIHNSMPAFRSLLQFPSPKASDRAQVQSK 60
MLSSKQSLRIIESALLGP+PPSPSQRVELLHAIHNSMPAFRSLLQFPSPKASDRAQVQSK
Sbjct: 1 MLSSKQSLRIIESALLGPAPPSPSQRVELLHAIHNSMPAFRSLLQFPSPKASDRAQVQSK 60
Query: 61 EVRRPDSSTITLDDQDVVITLKLSDDLHLNEIECVHLLVAAHQEWALMGRDPSEIFHLAA 120
EVRRPDSSTITLDDQDV ITLKLSDDLHLNEIECVHLLVAAHQEWALMGRDPSEIFHLAA
Sbjct: 61 EVRRPDSSTITLDDQDVEITLKLSDDLHLNEIECVHLLVAAHQEWALMGRDPSEIFHLAA 120
Query: 121 GLWYTERRDLIMSLYTLLRAVVLDQGLEAGLVSDIQRHLEDLINSGLRQRLISLIKELNR 180
GLWYTERRDLIMSLYTLLRAVVLDQGLEAGLVSDIQR+LEDLI SGLRQRLISLIKELNR
Sbjct: 121 GLWYTERRDLIMSLYTLLRAVVLDQGLEAGLVSDIQRNLEDLIISGLRQRLISLIKELNR 180
Query: 181 EEPAGLGGPSCERYILDSRGALVERRAVVCRERLILGHCLVLSILVVRIGPKDVRDLFSV 240
EEPAGLGGPSCERYILDSRGALVERRAVVCRERLILGHCLVLSILVVRIGPKDV+DLFSV
Sbjct: 181 EEPAGLGGPSCERYILDSRGALVERRAVVCRERLILGHCLVLSILVVRIGPKDVKDLFSV 240
Query: 241 LKDCAAELNETKTPIKLQIVFSLLFSIIIAFVSDALSAVPNKASILSSDASFRNEFQDTV 300
LKDCAAELNETKTPIKLQIVFSLLFSIIIAFVSDALSAVPNKASILSSDASFRNEFQDTV
Sbjct: 241 LKDCAAELNETKTPIKLQIVFSLLFSIIIAFVSDALSAVPNKASILSSDASFRNEFQDTV 300
Query: 301 MVSGNNPTVEGFVDAVRFAWTVHLLLIHDMVDAREAVPSASPKDLDHLQSCLEVIFSHNA 360
MVSGNNPTVEGFVDAVRFAWTVHLLLIHDMVDAREAVPSASPKDLDHLQSCLEVIFSHNA
Sbjct: 301 MVSGNNPTVEGFVDAVRFAWTVHLLLIHDMVDAREAVPSASPKDLDHLQSCLEVIFSHNA 360
Query: 361 FQFMLQEVIQTAAYQNDDEDMTYMYNAYLHKLVSCFLSHPLARDKVKESKDRAMNTLSQF 420
FQFMLQEVIQTAAYQNDDEDMTYMYNAYLHKLVSCFLSHPLARDKVKESKDRAMNTLSQF
Sbjct: 361 FQFMLQEVIQTAAYQNDDEDMTYMYNAYLHKLVSCFLSHPLARDKVKESKDRAMNTLSQF 420
Query: 421 RATGSQDFMHDGDSSSHQASETLPSPFVSLLNFVSEIYRKEPELLSSNDVLWTFANFAGE 480
RATGSQDFMHD DSSSHQASETLPSPFVSLLNFVSEIYRKEPELLSSNDVLWTFANFAGE
Sbjct: 421 RATGSQDFMHDSDSSSHQASETLPSPFVSLLNFVSEIYRKEPELLSSNDVLWTFANFAGE 480
Query: 481 DHTNFQTLVAFLNMLSTLACNEEGASRVFELLQGKAFRSVGWTTLFDCLSIYDEKFRQSL 540
DHTNFQTLVAFLNMLSTLACNEEGASRVFELL+GKAFRSVGWTTLFDCLSIYDEKFRQSL
Sbjct: 481 DHTNFQTLVAFLNMLSTLACNEEGASRVFELLKGKAFRSVGWTTLFDCLSIYDEKFRQSL 540
Query: 541 QTAGALLPEFQEGDAKALVAYLNVLQKVVENGNPVERKNWFPDIEPLFKLLSYENVPPYL 600
QTAGALLPEFQEGDAKALVAYLNVLQKVVENGNPVERKNWFPDIEPLFKLLSYENVPPYL
Sbjct: 541 QTAGALLPEFQEGDAKALVAYLNVLQKVVENGNPVERKNWFPDIEPLFKLLSYENVPPYL 600
Query: 601 KGALRNAIASFIQVSSDLKDIIWCYLEQYDLPVLVASHIQNGTKSITSQVYDMQFEVNEI 660
KGALRNAI SFIQVSSDLKDIIW YLEQYDLPVLVASHIQNGTKSITSQVYDMQFEVNEI
Sbjct: 601 KGALRNAITSFIQVSSDLKDIIWRYLEQYDLPVLVASHIQNGTKSITSQVYDMQFEVNEI 660
Query: 661 EARQERYPSTISFLNLLNALIAKESDLSDRGRRFVGIFRFIYDHVFGPFPQRAYADAAEK 720
EARQERYPSTISFLNLLNALIAKESDLSDRGRRFVGIFRFIYDHVFGPFPQRAYADAAEK
Sbjct: 661 EARQERYPSTISFLNLLNALIAKESDLSDRGRRFVGIFRFIYDHVFGPFPQRAYADAAEK 720
Query: 721 WQLVVACLQHFNMILKMYDINEEDVDVVIDRSQSSMDTQSSSLQTQLPVLELLKDFMSGK 780
WQLVVACLQHFNMILKMYDINEEDVDVVIDRSQSSMDTQSSSLQTQLPV ELLKDFMSGK
Sbjct: 721 WQLVVACLQHFNMILKMYDINEEDVDVVIDRSQSSMDTQSSSLQTQLPVFELLKDFMSGK 780
Query: 781 SVFRNIMGILLPGVNSLITERTSQIYGQLLEKSVELSLEIMILVLEKDLLLADYWRPLYQ 840
SVFRNIMGILLPGVNSLITERTSQIYGQLLEKSVELSLEIMILVLEKDLLLADYWRPLYQ
Sbjct: 781 SVFRNIMGILLPGVNSLITERTSQIYGQLLEKSVELSLEIMILVLEKDLLLADYWRPLYQ 840
Query: 841 PLEVILSQDHSQIVALLEYVRYDFHPKIQQLSIKIMSILSSRMVGLVQLLLKSNTASSLV 900
PLEVILSQDHSQIVALLEYVRYDFHPKIQQLSIKIMSILSSRMVGLVQLLLKSNTASSLV
Sbjct: 841 PLEVILSQDHSQIVALLEYVRYDFHPKIQQLSIKIMSILSSRMVGLVQLLLKSNTASSLV 900
Query: 901 EDYASCLELRSEECHVIENSGDDPGVLIMQLLIDNISRPAPNVTHLLLKFNLETSIERTI 960
EDYASCLELRSEECHVIE+SGDDPGVLIMQLLIDNISRPAPNVTHLLLKFNLETSIERTI
Sbjct: 901 EDYASCLELRSEECHVIESSGDDPGVLIMQLLIDNISRPAPNVTHLLLKFNLETSIERTI 960
Query: 961 LQPKFHYSCLKVVLEILEKLSNPEVNALLFEFGFQLLYELCLDPLTSGPVMDLLSNKKYY 1020
LQPKFHYSCLKVVLEILEKLSNPEVNALLFEFGFQLLYELCLDPLTSGPVMDLLSNKKYY
Sbjct: 961 LQPKFHYSCLKVVLEILEKLSNPEVNALLFEFGFQLLYELCLDPLTSGPVMDLLSNKKYY 1020
Query: 1021 FFVKHLDTIGVVPLPKRNNHTLRVSSLHQRAWLLKLLAIELHAADLSSPIHREACQSILA 1080
FFVKHLDTIGVVPLPKRNNHTLRVSSLHQRAWLLKLLAIELHAADLSSPIHREACQSILA
Sbjct: 1021 FFVKHLDTIGVVPLPKRNNHTLRVSSLHQRAWLLKLLAIELHAADLSSPIHREACQSILA 1080
Query: 1081 HLYGQEIDVGSVPVFSLQHHVVDPGTRTMSKSKALELLEVVQFRTPDTSIKLPQIVSNLK 1140
HLYGQEIDVGSVPVFSLQHHVVDPGTRTMSKSKALELLEVVQFRTPDTSIKLPQIVSNLK
Sbjct: 1081 HLYGQEIDVGSVPVFSLQHHVVDPGTRTMSKSKALELLEVVQFRTPDTSIKLPQIVSNLK 1140
Query: 1141 YELLTKDILGNPSTSQKGGIYHYSERGDRLIDLTSFCDKLWQKFNADNPQLNNVGREAEL 1200
YELLTKDILGNPSTSQKGGIYHYSERGDRLIDLTSFCDKLWQKFNADNPQLNNVGREAEL
Sbjct: 1141 YELLTKDILGNPSTSQKGGIYHYSERGDRLIDLTSFCDKLWQKFNADNPQLNNVGREAEL 1200
Query: 1201 EEVKETIQQFLRWGWKYNKNLEEQAAQLHMLTSWSQTIEVTVSRRISSLENRSDILFQVL 1260
EEVKETIQQFLRWGWKYNKNLEEQAAQLHMLTSWSQTIEVTVSRRISSLENRSDILFQVL
Sbjct: 1201 EEVKETIQQFLRWGWKYNKNLEEQAAQLHMLTSWSQTIEVTVSRRISSLENRSDILFQVL 1260
Query: 1261 DASLSASASPDCSLKMAYLLCQVALTCMAKLRDERYSSPGGLNADSVSCLDIIMVKQISN 1320
DASLSASASPDCSLKMAYLLCQVALTCMAKLRDERYSSPGGLNADSVSCLDIIMVKQISN
Sbjct: 1261 DASLSASASPDCSLKMAYLLCQVALTCMAKLRDERYSSPGGLNADSVSCLDIIMVKQISN 1320
Query: 1321 GACHSILLKLIMAILRNESSEALRRRQYALLLSYLQYCQNMLDPDVPTTILQVLLQNEQD 1380
GACHSILLKLIMAILRNESSEALRRRQYALLLSYLQYCQNMLDPDVPTTILQVLLQNEQD
Sbjct: 1321 GACHSILLKLIMAILRNESSEALRRRQYALLLSYLQYCQNMLDPDVPTTILQVLLQNEQD 1380
Query: 1381 GEDVDLQKIDKDQAELAHANFSILRKEAQSILNVVIKDATQGSEPGKTISLYVLDALICI 1440
GEDVDLQKIDKDQAELAHANFSILRKEAQSILNVVIKDATQGSEPGKTISLYVLDALICI
Sbjct: 1381 GEDVDLQKIDKDQAELAHANFSILRKEAQSILNVVIKDATQGSEPGKTISLYVLDALICI 1440
Query: 1441 DHDRFFLNQLQNRGFLKSCLVSISNVSLQDGVHSFDSLQRACTLEAELALLSRISHKYGK 1500
DHDRFFLNQLQNRGFLKSCLVSISNVSLQDGVHSFDSLQRACTLEAELALLSRISHKYGK
Sbjct: 1441 DHDRFFLNQLQNRGFLKSCLVSISNVSLQDGVHSFDSLQRACTLEAELALLSRISHKYGK 1500
Query: 1501 FGAQLLFSTGALEHLDSCRAINLQGNLRWVDMKPHRDVAGNFNKQQAIVTPILRLLFSMT 1560
FGAQLLFSTGALEHLDSCRA+NLQGNLRWVDMKPHRDVAG+FNKQQAIVTPILRLLFSMT
Sbjct: 1501 FGAQLLFSTGALEHLDSCRAVNLQGNLRWVDMKPHRDVAGSFNKQQAIVTPILRLLFSMT 1560
Query: 1561 SLVDTSEFFEVKNKIVREAIDFIKRHQRVFDQILGEDVSEADDSTLEQINLLVASLSKVW 1620
SLVDTSEFFEVKNKIVREAIDFIKRHQRVFDQILGEDVSEADDSTLEQINLLVASLSKVW
Sbjct: 1561 SLVDTSEFFEVKNKIVREAIDFIKRHQRVFDQILGEDVSEADDSTLEQINLLVASLSKVW 1620
Query: 1621 PYEETDEYGFVQSLFQLMHSLFSRELDSRTPGPAVKLLKNRRSSELHSIQLNFSLISYLY 1680
PYEETDEYGFVQSLFQLMHSLFSRELDSRT GPAVKLLKNRRSSELHSIQLNFSLISYLY
Sbjct: 1621 PYEETDEYGFVQSLFQLMHSLFSRELDSRTSGPAVKLLKNRRSSELHSIQLNFSLISYLY 1680
Query: 1681 FLVTRKSLRLQVSGTSSSHNSPVRSQRPSLDLLGTLLNSTTITLEKAAEERLLLLNKIRD 1740
FLVTRKSLRLQVSGTSSSHNSPVRSQRPSLDLLGTLLNS TITLEKAAEERLLLLNKIRD
Sbjct: 1681 FLVTRKSLRLQVSGTSSSHNSPVRSQRPSLDLLGTLLNSMTITLEKAAEERLLLLNKIRD 1740
Query: 1741 INELSRQEVEEIIVLCLGEDFASLSDNIQRRRYIAMIEMCKIVGNKSEMITLLLPLAEYV 1800
INELSRQEVEEIIVLCLGEDFASLSDNIQRRRYIAMIEMCKIVGNKSEMITLLLPLAEYV
Sbjct: 1741 INELSRQEVEEIIVLCLGEDFASLSDNIQRRRYIAMIEMCKIVGNKSEMITLLLPLAEYV 1800
Query: 1801 LNVMLIHFQDSSVIPDGNANIKAIAYHGELESGHEISSLCGKLIPVLERLELLSENKIGE 1860
LNVMLIHFQD SVIPDGNANIKAI+YHGELESGHEISSLCGKLIP+LERLELLSENKIGE
Sbjct: 1801 LNVMLIHFQD-SVIPDGNANIKAISYHGELESGHEISSLCGKLIPILERLELLSENKIGE 1860
Query: 1861 NLKVFGRLVSSVKEVAIQKLDV 1883
N+KVFGRL SSVKEVAIQKLDV
Sbjct: 1861 NVKVFGRLASSVKEVAIQKLDV 1881
BLAST of CmoCh12G012390 vs. ExPASy TrEMBL
Match:
A0A6J1CPX3 (nuclear pore complex protein NUP205 OS=Momordica charantia OX=3673 GN=LOC111013061 PE=3 SV=1)
HSP 1 Score: 3375.1 bits (8750), Expect = 0.0e+00
Identity = 1717/1881 (91.28%), Postives = 1805/1881 (95.96%), Query Frame = 0
Query: 1 MLSSKQSLRIIESALLGPSPPSPSQRVELLHAIHNSMPAFRSLLQFPSPKASDRAQVQSK 60
M+SSKQSL IIESALLGPSPPSPSQRVELLHAIH SMPAFRSLLQFP PKASDRAQVQS+
Sbjct: 1 MISSKQSLHIIESALLGPSPPSPSQRVELLHAIHESMPAFRSLLQFPPPKASDRAQVQSR 60
Query: 61 EVRRPDSSTITLDDQDVVITLKLSDDLHLNEIECVHLLVAAHQEWALMGRDPSEIFHLAA 120
EVR PDSSTITLDDQDV I LKLSDDLHLNEI+CV LLVAAHQEWALMGR+PSEIF LAA
Sbjct: 61 EVRCPDSSTITLDDQDVQIALKLSDDLHLNEIDCVRLLVAAHQEWALMGREPSEIFRLAA 120
Query: 121 GLWYTERRDLIMSLYTLLRAVVLDQGLEAGLVSDIQRHLEDLINSGLRQRLISLIKELNR 180
GLWYTERRDLIMSLYTLLRAVVLDQGLEAGL+SDIQRHLEDLIN+GLRQRLI+LIKELNR
Sbjct: 121 GLWYTERRDLIMSLYTLLRAVVLDQGLEAGLISDIQRHLEDLINNGLRQRLITLIKELNR 180
Query: 181 EEPAGLGGPSCERYILDSRGALVERRAVVCRERLILGHCLVLSILVVRIGPKDVRDLFSV 240
EEPAGLGGPSCERYILDSRGALVER+AVVCRERLI+GHCLVLS+LVVRIG KDVR LFSV
Sbjct: 181 EEPAGLGGPSCERYILDSRGALVERQAVVCRERLIIGHCLVLSVLVVRIGAKDVRGLFSV 240
Query: 241 LKDCAAELNETKTPIKLQIVFSLLFSIIIAFVSDALSAVPNKASILSSDASFRNEFQDTV 300
LKDCAAELN+TK PIKLQIVFSLLFS IIAF+SDALSAVPNKAS+LSSDASFRNEFQDTV
Sbjct: 241 LKDCAAELNKTKAPIKLQIVFSLLFSTIIAFISDALSAVPNKASLLSSDASFRNEFQDTV 300
Query: 301 MVSGNNPTVEGFVDAVRFAWTVHLLLIHDMVDAREAVPSASPKDLDHLQSCLEVIFSHNA 360
M SGN+PTVEGFV+AVRFAWTVHLLLIHDM+DAREA+P+AS KDLD+LQSCL+VIFSHNA
Sbjct: 301 MASGNDPTVEGFVNAVRFAWTVHLLLIHDMIDAREAIPNASSKDLDNLQSCLDVIFSHNA 360
Query: 361 FQFMLQEVIQTAAYQNDDEDMTYMYNAYLHKLVSCFLSHPLARDKVKESKDRAMNTLSQF 420
FQF+LQEVIQTAAYQNDDEDM YMYNAYLHKLV+CFLSHPLARDKVKESKDRAM+ LSQF
Sbjct: 361 FQFLLQEVIQTAAYQNDDEDMIYMYNAYLHKLVTCFLSHPLARDKVKESKDRAMHMLSQF 420
Query: 421 RATGSQDFMHDGDSSSHQASETLPSPFVSLLNFVSEIYRKEPELLSSNDVLWTFANFAGE 480
RATGSQDFMHDGD+SSHQASET+P PFVSLL FVSEIY+KEPELLSSNDVLWTFANFAGE
Sbjct: 421 RATGSQDFMHDGDTSSHQASETIPLPFVSLLEFVSEIYQKEPELLSSNDVLWTFANFAGE 480
Query: 481 DHTNFQTLVAFLNMLSTLACNEEGASRVFELLQGKAFRSVGWTTLFDCLSIYDEKFRQSL 540
DH+NFQTLVAFLNMLSTLACNEEGASRVFELLQGKAFRSVGWTTLFDCLSIYDEKFRQSL
Sbjct: 481 DHSNFQTLVAFLNMLSTLACNEEGASRVFELLQGKAFRSVGWTTLFDCLSIYDEKFRQSL 540
Query: 541 QTAGALLPEFQEGDAKALVAYLNVLQKVVENGNPVERKNWFPDIEPLFKLLSYENVPPYL 600
QTAGA+LPEFQEGDAKALVAYLNVLQKVVENGNPVERKNWF DIEPLFKLLSYENVPPYL
Sbjct: 541 QTAGAMLPEFQEGDAKALVAYLNVLQKVVENGNPVERKNWFSDIEPLFKLLSYENVPPYL 600
Query: 601 KGALRNAIASFIQVSSDLKDIIWCYLEQYDLPVLVASHIQNGTKSITSQVYDMQFEVNEI 660
KGALRNAIASFIQVSS+ KDIIW YLE+YDLPVLVASHIQNGTK IT+QVYDMQFE+NEI
Sbjct: 601 KGALRNAIASFIQVSSESKDIIWSYLERYDLPVLVASHIQNGTKPITAQVYDMQFELNEI 660
Query: 661 EARQERYPSTISFLNLLNALIAKESDLSDRGRRFVGIFRFIYDHVFGPFPQRAYADAAEK 720
EARQERYPST+SFLNLLNALIAKE DLSDRG RFVGIFRFIYDHVFGPFPQRAYADAAEK
Sbjct: 661 EARQERYPSTVSFLNLLNALIAKERDLSDRGLRFVGIFRFIYDHVFGPFPQRAYADAAEK 720
Query: 721 WQLVVACLQHFNMILKMYDINEEDVDVVIDRSQSSMDTQSSSLQTQLPVLELLKDFMSGK 780
WQLVVACLQHF M+LKMYDI EED+D++ D+SQSSM+ QSSSL TQLPVLELLKDFMSGK
Sbjct: 721 WQLVVACLQHFIMVLKMYDIKEEDIDIITDQSQSSMEAQSSSLHTQLPVLELLKDFMSGK 780
Query: 781 SVFRNIMGILLPGVNSLITERTSQIYGQLLEKSVELSLEIMILVLEKDLLLADYWRPLYQ 840
S+FRNIMGILLPGVNSLITERTSQIYGQLLEKSVELSLEI+ILVLEKDLLLADYWRPLYQ
Sbjct: 781 SIFRNIMGILLPGVNSLITERTSQIYGQLLEKSVELSLEIIILVLEKDLLLADYWRPLYQ 840
Query: 841 PLEVILSQDHSQIVALLEYVRYDFHPKIQQLSIKIMSILSSRMVGLVQLLLKSNTASSLV 900
PLEVILSQDH QIVALLEYVRYDF PKIQQ SIKIMSILSSRMVGLVQLLLKSNTASSLV
Sbjct: 841 PLEVILSQDHGQIVALLEYVRYDFQPKIQQFSIKIMSILSSRMVGLVQLLLKSNTASSLV 900
Query: 901 EDYASCLELRSEECHVIENSGDDPGVLIMQLLIDNISRPAPNVTHLLLKFNLETSIERTI 960
EDYASCLELRSEE H+IENSGDDPGVLIMQLLIDNISRPAP+VTHLLLKFNLE SIERT+
Sbjct: 901 EDYASCLELRSEESHIIENSGDDPGVLIMQLLIDNISRPAPSVTHLLLKFNLEISIERTV 960
Query: 961 LQPKFHYSCLKVVLEILEKLSNPEVNALLFEFGFQLLYELCLDPLTSGPVMDLLSNKKYY 1020
LQPKFHYSCLKVVLEIL+KLSNPEVNALL+EFGFQLLYELCLDPLTSGPVMDLLSNKKYY
Sbjct: 961 LQPKFHYSCLKVVLEILDKLSNPEVNALLYEFGFQLLYELCLDPLTSGPVMDLLSNKKYY 1020
Query: 1021 FFVKHLDTIGVVPLPKRNNHTLRVSSLHQRAWLLKLLAIELHAADLSSPIHREACQSILA 1080
FF+KHLDTIGVVPLPKRNNHTLRVSSLHQRAWLLKLLAIELHAADLSSPIHREACQSI+
Sbjct: 1021 FFIKHLDTIGVVPLPKRNNHTLRVSSLHQRAWLLKLLAIELHAADLSSPIHREACQSIIG 1080
Query: 1081 HLYGQEI-DVGSVPVFSLQHHVVDPGTRTMSKSKALELLEVVQFRTPDTSIKLPQIVSNL 1140
HLYGQ I D GSVP FSLQHHVVDPGTRT SKSKALELLEVVQFRTPDTSIKLPQIVSNL
Sbjct: 1081 HLYGQGIVDAGSVPFFSLQHHVVDPGTRTTSKSKALELLEVVQFRTPDTSIKLPQIVSNL 1140
Query: 1141 KYELLTKDILGNPSTSQKGGIYHYSERGDRLIDLTSFCDKLWQKFNADNPQLNNVGREAE 1200
KYELLTKDILGNPSTSQKGGIY+YSERGDRL+DLTSFCDKLWQKFN++NPQLNNVG EAE
Sbjct: 1141 KYELLTKDILGNPSTSQKGGIYYYSERGDRLVDLTSFCDKLWQKFNSNNPQLNNVGNEAE 1200
Query: 1201 LEEVKETIQQFLRWGWKYNKNLEEQAAQLHMLTSWSQTIEVTVSRRISSLENRSDILFQV 1260
LEEVKETIQQFLRWGWKYNKNLEEQAAQLHMLTSWSQTIEVTVSRRISSLENRSDILFQ+
Sbjct: 1201 LEEVKETIQQFLRWGWKYNKNLEEQAAQLHMLTSWSQTIEVTVSRRISSLENRSDILFQL 1260
Query: 1261 LDASLSASASPDCSLKMAYLLCQVALTCMAKLRDERYSSPGGLNADSVSCLDIIMVKQIS 1320
LDASLSASASPDCSLKMAYLLCQVALTCMAKLRDERYS PGG N+DSVSCLDIIMVKQIS
Sbjct: 1261 LDASLSASASPDCSLKMAYLLCQVALTCMAKLRDERYSCPGGSNSDSVSCLDIIMVKQIS 1320
Query: 1321 NGACHSILLKLIMAILRNESSEALRRRQYALLLSYLQYCQNMLDPDVPTTILQVLLQNEQ 1380
NGACHSILLKLIMAILRNESSEALRRRQYALLLSYLQYCQNMLDPDVPTT+LQVLL NEQ
Sbjct: 1321 NGACHSILLKLIMAILRNESSEALRRRQYALLLSYLQYCQNMLDPDVPTTVLQVLLLNEQ 1380
Query: 1381 DGEDVDLQKIDKDQAELAHANFSILRKEAQSILNVVIKDATQGSEPGKTISLYVLDALIC 1440
DGEDVDLQKIDKDQAELAHANFSILRKEAQSIL+VVIKDATQGSEPGKTISLY+LDALIC
Sbjct: 1381 DGEDVDLQKIDKDQAELAHANFSILRKEAQSILDVVIKDATQGSEPGKTISLYILDALIC 1440
Query: 1441 IDHDRFFLNQLQNRGFLKSCLVSISNVSLQDGVHSFDSLQRACTLEAELALLSRISHKYG 1500
IDHDRFFLNQLQ+RGFLKSCL+SISNVSLQDG HSFDSLQRACTLEAELALL RISHKYG
Sbjct: 1441 IDHDRFFLNQLQSRGFLKSCLISISNVSLQDGAHSFDSLQRACTLEAELALLLRISHKYG 1500
Query: 1501 KFGAQLLFSTGALEHLDSCRAINLQGNLRWVDMKPHRDVAGNFNKQQAIVTPILRLLFSM 1560
K GAQLLFS GALEHL SCRA+NLQG+LRW+DMKPHRDVAGN NKQQ IVTPILRLLFS+
Sbjct: 1501 KNGAQLLFSMGALEHLASCRAVNLQGSLRWIDMKPHRDVAGNINKQQTIVTPILRLLFSL 1560
Query: 1561 TSLVDTSEFFEVKNKIVREAIDFIKRHQRVFDQILGEDVSEADDSTLEQINLLVASLSKV 1620
TSLVDTS+FFEVKNKIVRE +DFIK HQR+FDQILGEDVSEADD T+EQINLLV SL KV
Sbjct: 1561 TSLVDTSDFFEVKNKIVREVVDFIKGHQRLFDQILGEDVSEADDFTMEQINLLVGSLGKV 1620
Query: 1621 WPYEETDEYGFVQSLFQLMHSLFSRELDSRTPGPAVKLLKNRRSSELHSIQLNFSLISYL 1680
WPYEETDEYGFVQSLFQLMHSLFSRE DS T GP VKLLKNRRSSEL+SI+LNFSL SYL
Sbjct: 1621 WPYEETDEYGFVQSLFQLMHSLFSRESDSLTSGPGVKLLKNRRSSELYSIELNFSLSSYL 1680
Query: 1681 YFLVTRKSLRLQVSGTSSSHNSPVRSQRPSLDLLGTLLNSTTITLEKAAEERLLLLNKIR 1740
YFLVTRKSLRL+VSG SS ++SPVR+QRPSLDLLG LLNSTT TLE+AAEER LLLNKIR
Sbjct: 1681 YFLVTRKSLRLEVSGASSGYSSPVRAQRPSLDLLGILLNSTTTTLERAAEERSLLLNKIR 1740
Query: 1741 DINELSRQEVEEIIVLCLGEDFASLSDNIQRRRYIAMIEMCKIVGNKSEMITLLLPLAEY 1800
DINELSRQEVEEIIV C+G+DFASLSD+IQRRRY+AMIEMCK+VGNK++MITLLLPLAEY
Sbjct: 1741 DINELSRQEVEEIIVRCVGDDFASLSDDIQRRRYVAMIEMCKVVGNKNQMITLLLPLAEY 1800
Query: 1801 VLNVMLIHFQDSSVIPDGNANIKAIAYHGELESGHEISSLCGKLIPVLERLELLSENKIG 1860
VLNVMLIHFQDS+VIPDGNAN+KAI+YH E S EI+SL GKLIP+LERLELLSENKIG
Sbjct: 1801 VLNVMLIHFQDSTVIPDGNANVKAISYHAESRSAQEITSLSGKLIPILERLELLSENKIG 1860
Query: 1861 ENLKVFGRLVSSVKEVAIQKL 1881
NLKVF RLV+S+KE AIQKL
Sbjct: 1861 HNLKVFRRLVTSLKETAIQKL 1881
BLAST of CmoCh12G012390 vs. TAIR 10
Match:
AT5G51200.1 (Protein of unknown function (DUF3414) )
HSP 1 Score: 2358.2 bits (6110), Expect = 0.0e+00
Identity = 1210/1885 (64.19%), Postives = 1504/1885 (79.79%), Query Frame = 0
Query: 1 MLSSKQSLRIIESALLGPSPPSPSQRVELLHAIHNSMPAFRSLLQFPSPKASDRAQVQSK 60
M+S K + I+ S+LLG S P+P+QR+EL HAI NS P+ ++LL FP PK SDRAQVQSK
Sbjct: 1 MVSPKDLVAIVHSSLLGTSRPTPTQRIELTHAIRNSFPSLQNLLSFPPPKPSDRAQVQSK 60
Query: 61 EVRRPDSSTITLDDQDVVITLKLSDDLHLNEIECVHLLVAAHQEWALMGRDPSEIFHLAA 120
E+R PDS I+LDDQD+ I+LKLSD+LHLNEI+ V LLV+++QEW LMGRDP EI LA
Sbjct: 61 EIRLPDSLPISLDDQDIAISLKLSDELHLNEIDSVRLLVSSNQEWGLMGRDPLEIQRLAT 120
Query: 121 GLWYTERRDLIMSLYTLLRAVVLDQGLEAGLVSDIQRHLEDLINSGLRQRLISLIKELNR 180
GLWYT RRDL +LYTLLRAVVLD+GLE L++DIQ LE+LI +GLRQRLI+LIKELNR
Sbjct: 121 GLWYTGRRDLTSTLYTLLRAVVLDEGLEPDLIADIQGLLEELIEAGLRQRLITLIKELNR 180
Query: 181 EEPAGLGGPSCERYILDSRGALVERRAVVCRERLILGHCLVLSILVVRIGPKDVRDLFSV 240
E+P GLGGP CERY++DSRGALVERRAVV RERLILGHCLVLSILV R G KDV+D++ +
Sbjct: 181 EDPTGLGGPLCERYLIDSRGALVERRAVVQRERLILGHCLVLSILVDRPGSKDVKDIYYI 240
Query: 241 LKDCAAELNETKTPIKLQIVFSLLFSIIIAFVSDALSAVPNKASILSSDASFRNEFQDTV 300
LKD AA+L E I QI FSLLFS+II FVSDA+S + +K+S++S DASFR +FQD V
Sbjct: 241 LKDNAAQLTEGNDTISSQITFSLLFSLIITFVSDAISRLSDKSSMISQDASFRTDFQDIV 300
Query: 301 MVSGNNPTVEGFVDAVRFAWTVHLLLIHDMVDAREAVPSASPKDLDHLQSCLEVIFSHNA 360
M SG++PT +GF+ +R AW VHL+LIHD + + + +AS D+ H+ SCLE IFS N
Sbjct: 301 MASGSDPTADGFIGGIRLAWAVHLMLIHDGISGMDTISTASTTDMGHICSCLESIFSKNV 360
Query: 361 FQFMLQEVIQTAAYQNDDEDMTYMYNAYLHKLVSCFLSHPLARDKVKESKDRAMNTLSQF 420
FQF+L V++TAAYQND+ED+ Y+YNAYLHKL SCFLSHP+ARDKVKESKD AM+ L+ +
Sbjct: 361 FQFLLDNVLRTAAYQNDEEDIIYIYNAYLHKLASCFLSHPIARDKVKESKDMAMSVLNSY 420
Query: 421 RATGSQDFMHDGDSSSHQASETLPSPFVSLLNFVSEIYRKEPELLSSNDVLWTFANFAGE 480
R + D S P PF+SL+ F KEPELLS NDVLWTF NFAGE
Sbjct: 421 RTSDPL------DGSMQTEESDRPLPFISLMEF------KEPELLSGNDVLWTFVNFAGE 480
Query: 481 DHTNFQTLVAFLNMLSTLACNEEGASRVFELLQGKAFRSVGWTTLFDCLSIYDEKFRQSL 540
DHTNF+TLVAFL ML TLA +EGAS+V+ELL+G +FRS+GW TLFDC+ IYDEKF+QSL
Sbjct: 481 DHTNFKTLVAFLEMLCTLASTQEGASKVYELLRGTSFRSIGWPTLFDCIRIYDEKFKQSL 540
Query: 541 QTAGALLPEFQEGDAKALVAYLNVLQKVVENGNPVERKNWFPDIEPLFKLLSYENVPPYL 600
QTAGA++PEF EGDAKALVAYLNVLQKVVENGNP ERKNWFPDIEP FKLL YEN+PPYL
Sbjct: 541 QTAGAMMPEFLEGDAKALVAYLNVLQKVVENGNPTERKNWFPDIEPFFKLLGYENIPPYL 600
Query: 601 KGALRNAIASFIQVSSDLKDIIWCYLEQYDLPVLVASHIQNGTKSITSQVYDMQFEVNEI 660
KGALR IA+F+ V +++D IW +LEQYDLPV+V S + G +SQVYDMQFE+NE+
Sbjct: 601 KGALRKTIAAFVNVFPEMRDSIWAFLEQYDLPVVVGSQV--GKSDQSSQVYDMQFELNEV 660
Query: 661 EARQERYPSTISFLNLLNALIAKESDLSDRGRRFVGIFRFIYDHVFGPFPQRAYADAAEK 720
EAR+E+YPSTISFLNL+NALIA E D++DRGR RAY+D EK
Sbjct: 661 EARREQYPSTISFLNLINALIAGEKDVNDRGR-------------------RAYSDPCEK 720
Query: 721 WQLVVACLQHFNMILKMYDINEEDVDVVIDRSQSSMDTQSSSLQTQLPVLELLKDFMSGK 780
WQLVVACLQHF+MIL MYDI EED+D + + ++SSLQTQLP++ELLKDFMSGK
Sbjct: 721 WQLVVACLQHFHMILSMYDIQEEDLDGFTEHPHFLVSLETSSLQTQLPIIELLKDFMSGK 780
Query: 781 SVFRNIMGILLPGVNSLITERTSQIYGQLLEKSVELSLEIMILVLEKDLLLADYWRPLYQ 840
+++RN+MGIL GVNS+I+ER S+ YG++LEK+V+LSLEI++LV EKDLL++D WRPLYQ
Sbjct: 781 ALYRNLMGILQVGVNSIISERLSKTYGKILEKAVQLSLEILLLVFEKDLLVSDVWRPLYQ 840
Query: 841 PLEVILSQDHSQIVALLEYVRYDFHPKIQQLSIKIMSIL-SSRMVGLVQLLLKSNTASSL 900
PL++ILSQDH+QI+ALLEYVRYD P+IQ+ SIKIM+IL SR+VGLV +L+K + A+SL
Sbjct: 841 PLDIILSQDHNQIIALLEYVRYDSLPQIQRSSIKIMNILRCSRLVGLVPMLIKIDAANSL 900
Query: 901 VEDYASCLELRSEECHVIENSGDDPGVLIMQLLIDNISRPAPNVTHLLLKFNLETSIERT 960
+EDYA+CLE R EE V+ENS DD GVLIMQLL+DNI+RPAP++THLLLKF+L+ +E T
Sbjct: 901 IEDYAACLEGRLEEGEVVENSCDDLGVLIMQLLVDNINRPAPSITHLLLKFDLDAPVEGT 960
Query: 961 ILQPKFHYSCLKVVLEILEKLSNPEVNALLFEFGFQLLYELCLDPLTSGPVMDLLSNKKY 1020
+LQPKFHYSCLKV+LE+LEKL NP++N LLFEFGFQLL EL LDPLTSGP MDLLS+KKY
Sbjct: 961 VLQPKFHYSCLKVILEMLEKLPNPDINFLLFEFGFQLLCELNLDPLTSGPTMDLLSSKKY 1020
Query: 1021 YFFVKHLDTIGVVPLPKRN-NHTLRVSSLHQRAWLLKLLAIELHAADLSSPIHREACQSI 1080
FF++HLDTIGV LPKR+ + LR+SSLHQRAWLLKLLAI LH SS H EACQSI
Sbjct: 1021 QFFLQHLDTIGVATLPKRSGSQALRISSLHQRAWLLKLLAIALHTGSGSSSAHLEACQSI 1080
Query: 1081 LAHLYGQEIDVGSVPVFSLQHHVVD----PGTRTMSKSKALELLEVVQFRTPDTSIKLPQ 1140
L+HL+G+E+ + FS + D GT ++SKSKAL LLE++QFR+PD S++LPQ
Sbjct: 1081 LSHLFGREVTEAANEPFSSSTYPQDGLDYAGTSSISKSKALALLEILQFRSPDASMQLPQ 1140
Query: 1141 IVSNLKYELLTKDILGNPSTSQKGGIYHYSERGDRLIDLTSFCDKLWQKFNADNPQLNNV 1200
IVS+LKY+ L +DILGN TS G IY+YSERGDRLIDL+SF +KLWQK ++ P +++
Sbjct: 1141 IVSSLKYDSLVEDILGNRDTSVSGSIYYYSERGDRLIDLSSFSNKLWQKLHSGFPLVDSF 1200
Query: 1201 GREAELEEVKETIQQFLRWGWKYNKNLEEQAAQLHMLTSWSQTIEVTVSRRISSLENRSD 1260
AEL EV+ETIQQ L+WGWKYN+NLEEQAAQLHML WSQ +EV+ RRISSL+NRS+
Sbjct: 1201 PNVAELSEVRETIQQLLKWGWKYNRNLEEQAAQLHMLAGWSQIVEVSACRRISSLDNRSE 1260
Query: 1261 ILFQVLDASLSASASPDCSLKMAYLLCQVALTCMAKLRDERYSSPGGLNADSVSCLDIIM 1320
IL+++LDASLSASASPDCSLKMA++L QVALTC+AKLRD+R+S G L++D+V+CLD++M
Sbjct: 1261 ILYRILDASLSASASPDCSLKMAFVLTQVALTCIAKLRDDRFSFQGALSSDTVTCLDVMM 1320
Query: 1321 VKQISNGACHSILLKLIMAILRNESSEALRRRQYALLLSYLQYCQNMLDPDVPTTILQVL 1380
VK +S GACHS+L KL+MAILR+ESSE+LRRRQYALLLSY QYCQ+M+ DVPT+++Q L
Sbjct: 1321 VKHLSTGACHSVLFKLVMAILRHESSESLRRRQYALLLSYFQYCQHMIALDVPTSVVQFL 1380
Query: 1381 LQNEQDGEDVDLQKIDKDQAELAHANFSILRKEAQSILNVVIKDATQGSEPGKTISLYVL 1440
L NEQDGED+D+QKIDK+QA+LA ANF I++KEAQ IL++VIKDA+QGSE GKTISLYVL
Sbjct: 1381 LLNEQDGEDLDIQKIDKEQADLARANFFIIKKEAQGILDLVIKDASQGSEFGKTISLYVL 1440
Query: 1441 DALICIDHDRFFLNQLQNRGFLKSCLVSISNVSLQDGVHSFDSLQRACTLEAELALLSRI 1500
+AL+CIDH+R+FL+QLQ+RGF++SCL SISN+S QDG H +S QRACTLEAELALL RI
Sbjct: 1441 EALVCIDHERYFLSQLQSRGFIRSCLGSISNISYQDGTHLLESQQRACTLEAELALLLRI 1500
Query: 1501 SHKYGKFGAQLLFSTGALEHLDSCRAINLQGNLRWVDMKPHRDVAGNFNKQQAIVTPILR 1560
SHKYGK G Q+LFS GALEH+ SCRAI+ +GN+R VDMK DV N KQ+ I+T +LR
Sbjct: 1501 SHKYGKSGGQVLFSMGALEHIASCRAISFKGNMRRVDMKLQSDVGYNVQKQRTIITAVLR 1560
Query: 1561 LLFSMTSLVDTSEFFEVKNKIVREAIDFIKRHQRVFDQILGEDVSEADDSTLEQINLLVA 1620
L+F++TSLV+TSEFFE +NKIVR+ ++FIK HQ +FDQ+L ED ++ADD +EQI L V
Sbjct: 1561 LVFALTSLVETSEFFEGRNKIVRDVVEFIKGHQSLFDQLLREDFTQADDLLMEQIILAVG 1620
Query: 1621 SLSKVWPYEETDEYGFVQSLFQLMHSLFSRELDSRTPGPAVKLLKNRRSSELHSIQLNFS 1680
LSKVWP+EE D YGFVQ LF +M LF P +L + SEL QL FS
Sbjct: 1621 ILSKVWPFEENDGYGFVQGLFDMMSKLF-------IASPIKSILS--QGSELKLSQLRFS 1680
Query: 1681 LISYLYFLVTRKSLRLQVSGTSSSHNSPVRSQRPSLDLLGTLLNSTTITLEKAAEERLLL 1740
L SYLYFLVT+ SLRLQVS S +S + ++P+L LL +LL+ T +LE+AAE++ LL
Sbjct: 1681 LTSYLYFLVTKNSLRLQVS--DDSLDSSTKLRQPTLLLLASLLSHVTDSLERAAEKKSLL 1740
Query: 1741 LNKIRDINELSRQEVEEIIVLCLGEDFASLSDNIQRRRYIAMIEMCKIVGNKSEMITLLL 1800
L+KIRDINELSRQ+V+ II +C +++ + SDNI +RRYIAM+EMC+IVGN+ ++ITLLL
Sbjct: 1741 LHKIRDINELSRQDVDAIIKICDSQEYVTPSDNIHKRRYIAMVEMCQIVGNRDQLITLLL 1800
Query: 1801 PLAEYVLNVMLIHFQDSSVIPDGNANIKAIAYHGELESGHEISSLCGKLIPVLERLELLS 1860
LAE+VLN++LIH QD SV ++N + +Y + E++ LCGKL P ++RL LL+
Sbjct: 1801 QLAEHVLNIILIHLQDRSV----SSNERG-SYGSKSHIQQEVTDLCGKLSPTIDRLALLN 1836
Query: 1861 ENKIGENLKVFGRLVSSVKEVAIQK 1880
E K+G NLKVF RL ++VKE+AIQK
Sbjct: 1861 EGKVGHNLKVFQRLATTVKEMAIQK 1836
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
F4KBW6 | 0.0e+00 | 64.19 | Nuclear pore complex protein NUP205 OS=Arabidopsis thaliana OX=3702 GN=NUP205 PE... | [more] |
Q92621 | 2.5e-60 | 22.94 | Nuclear pore complex protein Nup205 OS=Homo sapiens OX=9606 GN=NUP205 PE=1 SV=3 | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1FI96 | 0.0e+00 | 100.00 | nuclear pore complex protein NUP205 isoform X1 OS=Cucurbita moschata OX=3662 GN=... | [more] |
A0A6J1FDF5 | 0.0e+00 | 99.95 | nuclear pore complex protein NUP205 isoform X2 OS=Cucurbita moschata OX=3662 GN=... | [more] |
A0A6J1HLX9 | 0.0e+00 | 98.94 | nuclear pore complex protein NUP205 isoform X1 OS=Cucurbita maxima OX=3661 GN=LO... | [more] |
A0A6J1HKK4 | 0.0e+00 | 98.94 | nuclear pore complex protein NUP205 isoform X2 OS=Cucurbita maxima OX=3661 GN=LO... | [more] |
A0A6J1CPX3 | 0.0e+00 | 91.28 | nuclear pore complex protein NUP205 OS=Momordica charantia OX=3673 GN=LOC1110130... | [more] |
Match Name | E-value | Identity | Description | |
AT5G51200.1 | 0.0e+00 | 64.19 | Protein of unknown function (DUF3414) | [more] |