Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: utr5CDSpolypeptideutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCGATCGCACTGGATTACTCCGATCTCCCGCCGGAGTCTCCCGCTTTCAGCTTTCATTTCTTTCACTTTCACAATCACAAGCCAAAAAGCTTAAACCCTAAACCCCGAAATCCAAAATCTTCTTCTTTTTCTTCATTTGATCTTTGTATTATGCAAAAATGTTCTCTCCTGGTACGAAGAGACGCAATTTAAGCTCTCGGACTGACCGAAGTTCGGCGACAGCCGTGTCCGATTCTCCGATTACACCGCTCTCCGCCGCCCGGAAGCCAGTTCTCGATAATCTCGTTCCCAACCGCCCCGGCACCGGCACTCCGGCTCCGTGGGCTCCTCGCTTGTCCGTACTTGCCAGGTATTATTTATTGTTACAGCTTTAATGTTGCTCATTTCAGTGTTTCAACTCTGTAATGATGTTCGATTCGTGATTGTGTTCGTTTTAGGTGTAGTTAAGTTCGTTTCTGTGGGTTATAGATAACAACTTAATGGAGCCGAACGGTTATTTTCATTACCATGGGATGTCACTTTTGCGGTTCTTTCCTGTGTCTGGTAATTGGAGTGGGTTGATGGCTTTTTAGTGGTATTTTGATAATTGTTGGAATGCGTTCTTCAAGAATAATCTGACGAAGTATAATGTGCCTTCATTTTTTTCCTTTTCTCTCTATTGATTTGTGAGAATGGAGAGTGCTTGTAAAATTATGTGTTGAGAGATGCTTTATTAGTTTCCTTAAGCTGGCACTAATGAATGAGTTACTCGTAAAATCTAAACTTACCTACCCTTCTCGAGTTGACTATAACCACAACTTCTCATTCACTCATCAATTGCTTCTGGCAGCACAACTTACCCCCATGAATGTTTAATTAAATCTTTTTATCTTCCTAGAATTTCACCAGCTAATAGAAGTTCTAAGGAAGATGAGACAGATCCAGTCAGGCCTGTCTATGTTGGAGAGTTCCCCCAAGTGGTTCGTGATGAGCAGGCCAGTCTGGTACAGCAGTTTGCCACAAGTAATGATGTTCAATGCAGCAACGAATAGTTTATAGTTGTTGATGCTACTTTTCCTGGGATAAATGATTGATTTTTTTCCTTGCAGGTGGTGCAAGCATGTCGGGTGGAATGGATGCTGAAACATCCCTTGCTTGGATTATATGCCGAGACAAACTCTTTCTTTGGACATATTTGTTACCTGTGGCCACCATGAAGTGTGTCGTTCGTGAACTTCCGAAAAGAATTTTAGAAAGCAAAGATATTGGTAGAAATGACAGCGATCATTGGTTGCTCAGTATTGTTAGTTGGGATAACCAAAATCAAAGTTCAAGAAAGTCAGCTAAACATCAAAGTTCTGTTGGCATTATAATTTGTAATAAGAAAACAGGAGCTGTTGTGTATTGGCCTGATATCTTCTCGGATGGAGGAACTACTCCAGTTACCTGTCTGACATCTTCTAATGAGCCAGCTGTGATTTCTTCAATTTTTGATGGGAAGAGCACTTCTCATAGGCATCGAAGTTTGAATAAACCAAGGACATTCAACTCTTTGATTGCCTCTGCAGTCCCCGAGTCCCAGTACGTTTGTGTTGCCATTGCATGTAGTTCAAATGGCCAGCTTTGGCAGTACCGCTGCAGTCCTATGGGAATTCAATGTACTGAAGTACCTCAGGATATATGTAGTATTCGCAGTCAAGAGGATGGTAGCAATCAACATTTTGCGAGTGATGGGTATCCTAGGTCTCTAACTTGGTCTCGTTCTCATCTTCAACCAGATAAATTTAACAGAAAGTTTTTGCTTCTGACAGATCACGAGATACAGTGTTTTAGTCTTAAACTTTTTCCTGACGTACAAGTGTCCAAGATCTGGTCTTATGAAATTGTCGGTACAGATAGTGATTTGGGTATTAAAAAAGACCTAGCTGGTCAGAAAAGGATCTGGCCACTTGATTTGCAGGAAGATGAACGAGGTGCAGTTATCACCATTTTAGTTGCTACATTCTGTAAGGATCGCATTAGTAGTTCAAGTTATATACAATATTCCCTTCTTACCTTGCAATACAAATCTGGGGCAGGAATAGAGGCTAGTGGTGACAAGAGGATATTGGAGAAAAAGGCTCCAATCCAAGTAATAATTCCAAAAGCAAGAGTAGAAAATGAAGATTTTTTGTTCTCCATGAGACTTAGGGTTGGGGGAAAGCCTTCGGGATCTGCCCTCATTCTTTCAGGTGATGGAACGGCAACTGTCTCCCATTACTATAGAAGCTCCACTCTACTATATCAATTTGACTTACCTTATGACGCTGGTAAAGTATTAGATGCTTCAGTTCTTCCATCTACGGAGCATGGTGAAGGAGCATGGGTTGTTCTAACTGAGAAAGCAGGAATATGGGCAATACCCGTTAAAGCTATTGTCCTTGGTGGAGTTGAACCTCCTGAACGAAGTTTGTCACGTAGGGGGAGTTCAAATGAACGATCTGTGCAAGATGACACAAGAAACCTCAATTTGTCTGGCAACATTGCTAGTACAAGGGGCAGTTTTGAGGTGCAGGATGTCGTGGACAAGAAAAAGACTACTGTGGCTGGGATTGGACATCGAGTGGCTCGAGATGAAGAAGCAGAAGCTTTGCTGCGTCAGCTTTTCCATGATTTCCTGTCATCTGGTCAAGTAAATAACTCTTTTGAGAAGCTAAAGAATTCTGGTGCATTTGATAGGGAGGATGAAACAAATGTTTTTACACGGATGAGCAAGTCAATTATTGATACTTTGGCCAAACATTGGACTACAACGAGAGGTGCTGAGATTGTGTCGATGACCGTTGTTTCTACTCAGCTGATGGACAAGCAGCAGAAGCACGAAAAGTTTCTCCAGTTTCTTGCTTTGTCAAAGTGCCACGAAGAGTTGTGTTCTAGACAAAGTATGTTTTGACTCATTTAATGCAAATTATTTTTGTGGTATGGCGGTAGATATCTATGGCTGTTAATTTTTCACCTCATATGTATTTTCTGTTATCATGTTTGTTATATTGGCATATATGGTCCAGATGCTCCTTCTGTGTCTTCGGATGTATATTCTAGGGCAATAAATTTCTATTTTCTAATTATTCCTGGCACGGGTTTGAAGTGACGATGGAGTTGGTTTGGTTTGGGACTCAATTTTTGTTATTGGATCGGGTAGGTAATCTTGTGAAGAGAAAATATCAAATTCAAACTACTGAACTTAAACTATGGTCCTTATATGTGTAGGAAATTCTTTGCAAATTATCCTGGAGCATGGAGAAAAGCTTGCTGCTATGATTCAGTTAAGGGAACTGCAGAACACAATTTGCCAAAACCGTTCGACTGGACTTGGTTCCTCAAGTTCTAATTCAGAAACTCAGATGTCAGGCGGTCTATGGGATCTTATTCAATTTGTTGGTGAGCGAGCTCGGCGAAACACTGTTCTTCTAATGGATAGAGATAATCCTGAAGTGTTTTACAGTAAAGTTTCTGAGCTAGAAGAAGTATTTTATTGCTTAGAAAGGCAGCTAGATTATGTGGTAAGTGCAGATGATTCATGTGTGGTCCAAAACCAGAGGGCTTGTGAACTCTCAGAAGCATGTGTTACTATCATGCGTGCAGCTGTGCATTACAGAAATGAGCATCAGTTATGGTATCCACCATCTGAAGGCTTAACACCTTGGTATAGTCAACCGGTTGTACGGAATGGGCTATGGCACATTGCTTCTCTCATGCTTCAGCTTTTAAATGAAGTTTCTGAGCTTGATACATCTGCAAAATCTGATTTATATTGCTGCCTGGAACTATTGACTGAAGTTCTTCTTGAGGCACATGCTGGTGCTGTCACTGCAAAGGCAGAGCGAGGAGAAAAAACTGAAAGTCTTTTACATGAATTTTGGAGTAGAAGGGATGCACTCCTCAGCTCTCTTTATAAAAGAGTAAAAGATTCTGTGGAAGCGGAGCTTAAGGTTATTTTCTTGTGTGATGTGTGCTTCCTTGTGATTGAGGTTCTAGATCTGAATCAATATTAACACATGTTTTTGCAGGATTTTAGGGGAGGTTTGGTGGAGAAAAATGTAGAAATCCTCAGAAAGAATTCGTCACGTCTGTTATCTGTTGCAAAACAGCATGAATGCTACAGCATTTTATGGAATATCTGCTGTGACCTTAATGATCCAGAGCTACTCAGGAAACTTATGGTATGTATTTTTCTCTGATTTCTATGCTTCTACTATACCATTTGATTTTTGCAATTTTTTTAATGGTACTCTTTGTATCATATGGTGCTTGCATTTGATTGTTTATCATGTATGGTGATGCAGCATGAGAGCATGGGGCCTAAAGGAGGGTTTAGCTACTTTGTTTTTAAAAGACTTTATGAAAATAAACAGTTCTCCAAGCTTTTAAGACTCGGAGAGGAGTTCCATGAAGAACTGTTGATATTTTTGAAGGAGCATCCGGATCTTCTGTGGCTTCATGAGCTTTTCCTTCATCAATTTTTCTCAGCTTCAGACACTCTCCATGAATCAGCTCTGTCTGGAGATGACAGGCTTGTTTCGCCCCCTGAAGTTGAGGGAGAATTTGAATCTGACCATTGTAATTTTGAATTAAGATTGGCAGACAGAAAACGCCTTTTATACCTCTCGAAGATAGCTTTAATGGCAGGTACTGTTTTGCTTTCGTCATGTCTGCATTTATCCTTATGCTTTCCCGCAGTTCTTTGTTCTTTTCACTGTTTACATCCTTTCTTACGTTCTTAACATATGTATTTGTGCATTAAGTCAGATAGAAATATAATTGTGCCATGTGGAGTGGAGATTGTTACTCCTGAGTTCTCTTCTACCCGTGTGATTGCTGTATAAAATTTGCTAGAGGATGCTTCTTTTACAATTTTATATAAGAAACCTATAATTCAGAAATAATTAGTATGGAAGGGATGGAAATAGTCAATCTTAGCTGACTAGCTGACTTCATTAGGTGTTGCTAATTCCTACCTCCATGCTAATTGGGAATGTGCTCCCCATTTCACATGCGTATGTATAATCCCAAAATGGCCTAATAGTTTAACTTTATGCTATGTGTCATATTTGAGAATTAGTGATCGAATGTTTATCTTCATTAATTATTTACATTTGTAATTCAGCTTCTCCTAAGTCCTAACCTAGGGTTTGTAGGTTTTTTAAAAGCCTACGACAGTGATGAAAAAAAATGGCAGAAATACTATGTAGATCATCATTTGAGATATATCCTCAATATATGCTAGTATCTTTCTTTACTGATATGTAATGGTCGTTGGCGGCTTCTGAAACCAAATTGTCTTTATTTCTACTGTTCTAATATAGCAGCAGCAGGTCAAAATGCTGAGTACGAGAGTAAGTTGATGCGTATTGAGGCTGATGCAAAGATACTTAAATTACAGGTTGGTTTTCTTTATGATATAGTGAGCTTGATTGAACCTGCTTAAATCCTTATATTTTTTCAGTTCTGATTTTTGGATGATTTCTTCTCATCCTATGATATATAACCTTCATTGAATTATTGACTTTTGTCTTATTCTGATCTTAAACTATGGAACTTATATTTTCTTAATATGTTATGCTTTAGAAGAACCTATACTTTATTACTAAACTTTTTTTAACTATTAGAGATTGAAGAAGAGAAGGGTCATTTTATAATTATCAGAGATTAGTTGAGATACGTGCAAGCTAGCTTGAACGCTTATGGATATCAAATATATATCTTTTTTGTTATGGAAATAATTCATAGGATTGAGCTTAAAAAATATGCTGGTCTTCCTTTTTAGGCACGGGGAGGACTACCCAAAATATTTGAATGTGATTGGTGTAACTAATCAAGCTCTTCTCCCTTTTGATCTTATGAACAATAAAGTTTCTAGCTTACCTTGTTCATTGCCTGGCGGTTCCATACCTGATGGTTGATTCGTCCTCTCTTGTAGGAAGAAATTTTAGATCTCTATCATGCCGTTGAAACAGAGCAGCAGCTCGACTGCAAGCTCCTCCACCCCGACGGCCTTATTCAACTATGTCTCAAAGGTGAAAACCCAGCACTCTCATTGATAGCCTTTGACATATTTGCCTGGACCAGCACCTCATTCCGAGAAACCCACCGAAAGCTTTTGGAGGAATGCTGGAAAAACGCTGCGGATCAAGATGACTGGAATCGACTATATCAAGTATCAGTAGCAGAAGGATGGAGCGACGAGGAGACACTAAAAAAGCTGAGAGAAACAACTCTGTTCAAGGCTTCAAGCAGGTGTTATGGACATGGAGCAACAGAAGTATTTGGAGATGGATTTGATGTGGCATTACCCCTAAGACAAGAAAATGAAATTGCTGAAAGTTCCTCATTGAAGAACTGTGCAGGTTCTGTAGAGGCAATTCTGATGCAACACAAGCATTTTCCTGAAGCAGGGAAGTTAATGGTGACTGCCATTATGTTGGGACTGGATTTGGAAGATGATCCTATTTTAATGGAATAAATCACTCTACAAACACTATCCAATTTTTTACTCTAACTTTCTTGAATTCTGCATCCTCCAAATTCTGTATATTTTTTTAGGTTTGATTGAGATTTTTCGAAAATGACGTGTCACTATGTCGGAATTGTATCAGTGTCTTTCTTGTGTTTGCATTCATGCTTCCATAGGAGGAAATAATTGTAGCATCATCATATAGATGATAGAGGTGATGGAAATGGTCAATATTTGTTCAAAATGTCAATATTTTGTTTTCAGAATTTGGCTTCCATAAGTACATATTATATTTTAGATTAATTATCCGTGTAGTTGTAATTTGTAATTTCTAACTTCAAAAAATAAAAAAGGTTGATTTTCACACGCAACCGCGTTACCAAAAATAAATTGGC
mRNA sequence
CTCGATCGCACTGGATTACTCCGATCTCCCGCCGGAGTCTCCCGCTTTCAGCTTTCATTTCTTTCACTTTCACAATCACAAGCCAAAAAGCTTAAACCCTAAACCCCGAAATCCAAAATCTTCTTCTTTTTCTTCATTTGATCTTTGTATTATGCAAAAATGTTCTCTCCTGGTACGAAGAGACGCAATTTAAGCTCTCGGACTGACCGAAGTTCGGCGACAGCCGTGTCCGATTCTCCGATTACACCGCTCTCCGCCGCCCGGAAGCCAGTTCTCGATAATCTCGTTCCCAACCGCCCCGGCACCGGCACTCCGGCTCCGTGGGCTCCTCGCTTGTCCGTACTTGCCAGAATTTCACCAGCTAATAGAAGTTCTAAGGAAGATGAGACAGATCCAGTCAGGCCTGTCTATGTTGGAGAGTTCCCCCAAGTGGTTCGTGATGAGCAGGCCAGTCTGGTACAGCAGTTTGCCACAAGTGGTGCAAGCATGTCGGGTGGAATGGATGCTGAAACATCCCTTGCTTGGATTATATGCCGAGACAAACTCTTTCTTTGGACATATTTGTTACCTGTGGCCACCATGAAGTGTGTCGTTCGTGAACTTCCGAAAAGAATTTTAGAAAGCAAAGATATTGGTAGAAATGACAGCGATCATTGGTTGCTCAGTATTGTTAGTTGGGATAACCAAAATCAAAGTTCAAGAAAGTCAGCTAAACATCAAAGTTCTGTTGGCATTATAATTTGTAATAAGAAAACAGGAGCTGTTGTGTATTGGCCTGATATCTTCTCGGATGGAGGAACTACTCCAGTTACCTGTCTGACATCTTCTAATGAGCCAGCTGTGATTTCTTCAATTTTTGATGGGAAGAGCACTTCTCATAGGCATCGAAGTTTGAATAAACCAAGGACATTCAACTCTTTGATTGCCTCTGCAGTCCCCGAGTCCCAGTACGTTTGTGTTGCCATTGCATGTAGTTCAAATGGCCAGCTTTGGCAGTACCGCTGCAGTCCTATGGGAATTCAATGTACTGAAGTACCTCAGGATATATGTAGTATTCGCAGTCAAGAGGATGGTAGCAATCAACATTTTGCGAGTGATGGGTATCCTAGGTCTCTAACTTGGTCTCGTTCTCATCTTCAACCAGATAAATTTAACAGAAAGTTTTTGCTTCTGACAGATCACGAGATACAGTGTTTTAGTCTTAAACTTTTTCCTGACGTACAAGTGTCCAAGATCTGGTCTTATGAAATTGTCGGTACAGATAGTGATTTGGGTATTAAAAAAGACCTAGCTGGTCAGAAAAGGATCTGGCCACTTGATTTGCAGGAAGATGAACGAGGTGCAGTTATCACCATTTTAGTTGCTACATTCTGTAAGGATCGCATTAGTAGTTCAAGTTATATACAATATTCCCTTCTTACCTTGCAATACAAATCTGGGGCAGGAATAGAGGCTAGTGGTGACAAGAGGATATTGGAGAAAAAGGCTCCAATCCAAGTAATAATTCCAAAAGCAAGAGTAGAAAATGAAGATTTTTTGTTCTCCATGAGACTTAGGGTTGGGGGAAAGCCTTCGGGATCTGCCCTCATTCTTTCAGGTGATGGAACGGCAACTGTCTCCCATTACTATAGAAGCTCCACTCTACTATATCAATTTGACTTACCTTATGACGCTGGTAAAGTATTAGATGCTTCAGTTCTTCCATCTACGGAGCATGGTGAAGGAGCATGGGTTGTTCTAACTGAGAAAGCAGGAATATGGGCAATACCCGTTAAAGCTATTGTCCTTGGTGGAGTTGAACCTCCTGAACGAAGTTTGTCACGTAGGGGGAGTTCAAATGAACGATCTGTGCAAGATGACACAAGAAACCTCAATTTGTCTGGCAACATTGCTAGTACAAGGGGCAGTTTTGAGGTGCAGGATGTCGTGGACAAGAAAAAGACTACTGTGGCTGGGATTGGACATCGAGTGGCTCGAGATGAAGAAGCAGAAGCTTTGCTGCGTCAGCTTTTCCATGATTTCCTGTCATCTGGTCAAGTAAATAACTCTTTTGAGAAGCTAAAGAATTCTGGTGCATTTGATAGGGAGGATGAAACAAATGTTTTTACACGGATGAGCAAGTCAATTATTGATACTTTGGCCAAACATTGGACTACAACGAGAGGTGCTGAGATTGTGTCGATGACCGTTGTTTCTACTCAGCTGATGGACAAGCAGCAGAAGCACGAAAAGTTTCTCCAGTTTCTTGCTTTGTCAAAGTGCCACGAAGAGTTGTGTTCTAGACAAAGAAATTCTTTGCAAATTATCCTGGAGCATGGAGAAAAGCTTGCTGCTATGATTCAGTTAAGGGAACTGCAGAACACAATTTGCCAAAACCGTTCGACTGGACTTGGTTCCTCAAGTTCTAATTCAGAAACTCAGATGTCAGGCGGTCTATGGGATCTTATTCAATTTGTTGGTGAGCGAGCTCGGCGAAACACTGTTCTTCTAATGGATAGAGATAATCCTGAAGTGTTTTACAGTAAAGTTTCTGAGCTAGAAGAAGTATTTTATTGCTTAGAAAGGCAGCTAGATTATGTGGTAAGTGCAGATGATTCATGTGTGGTCCAAAACCAGAGGGCTTGTGAACTCTCAGAAGCATGTGTTACTATCATGCGTGCAGCTGTGCATTACAGAAATGAGCATCAGTTATGGTATCCACCATCTGAAGGCTTAACACCTTGGTATAGTCAACCGGTTGTACGGAATGGGCTATGGCACATTGCTTCTCTCATGCTTCAGCTTTTAAATGAAGTTTCTGAGCTTGATACATCTGCAAAATCTGATTTATATTGCTGCCTGGAACTATTGACTGAAGTTCTTCTTGAGGCACATGCTGGTGCTGTCACTGCAAAGGCAGAGCGAGGAGAAAAAACTGAAAGTCTTTTACATGAATTTTGGAGTAGAAGGGATGCACTCCTCAGCTCTCTTTATAAAAGAGTAAAAGATTCTGTGGAAGCGGAGCTTAAGGATTTTAGGGGAGGTTTGGTGGAGAAAAATGTAGAAATCCTCAGAAAGAATTCGTCACGTCTGTTATCTGTTGCAAAACAGCATGAATGCTACAGCATTTTATGGAATATCTGCTGTGACCTTAATGATCCAGAGCTACTCAGGAAACTTATGCATGAGAGCATGGGGCCTAAAGGAGGGTTTAGCTACTTTGTTTTTAAAAGACTTTATGAAAATAAACAGTTCTCCAAGCTTTTAAGACTCGGAGAGGAGTTCCATGAAGAACTGTTGATATTTTTGAAGGAGCATCCGGATCTTCTGTGGCTTCATGAGCTTTTCCTTCATCAATTTTTCTCAGCTTCAGACACTCTCCATGAATCAGCTCTGTCTGGAGATGACAGGCTTGTTTCGCCCCCTGAAGTTGAGGGAGAATTTGAATCTGACCATTGTAATTTTGAATTAAGATTGGCAGACAGAAAACGCCTTTTATACCTCTCGAAGATAGCTTTAATGGCAGCAGCAGCAGGTCAAAATGCTGAGTACGAGAGTAAGTTGATGCGTATTGAGGCTGATGCAAAGATACTTAAATTACAGGAAGAAATTTTAGATCTCTATCATGCCGTTGAAACAGAGCAGCAGCTCGACTGCAAGCTCCTCCACCCCGACGGCCTTATTCAACTATGTCTCAAAGGTGAAAACCCAGCACTCTCATTGATAGCCTTTGACATATTTGCCTGGACCAGCACCTCATTCCGAGAAACCCACCGAAAGCTTTTGGAGGAATGCTGGAAAAACGCTGCGGATCAAGATGACTGGAATCGACTATATCAAGTATCAGTAGCAGAAGGATGGAGCGACGAGGAGACACTAAAAAAGCTGAGAGAAACAACTCTGTTCAAGGCTTCAAGCAGGTGTTATGGACATGGAGCAACAGAAGTATTTGGAGATGGATTTGATGTGGCATTACCCCTAAGACAAGAAAATGAAATTGCTGAAAGTTCCTCATTGAAGAACTGTGCAGGTTCTGTAGAGGCAATTCTGATGCAACACAAGCATTTTCCTGAAGCAGGGAAGTTAATGGTGACTGCCATTATGTTGGGACTGGATTTGGAAGATGATCCTATTTTAATGGAATAAATCACTCTACAAACACTATCCAATTTTTTACTCTAACTTTCTTGAATTCTGCATCCTCCAAATTCTGTATATTTTTTTAGGTTTGATTGAGATTTTTCGAAAATGACGTGTCACTATGTCGGAATTGTATCAGTGTCTTTCTTGTGTTTGCATTCATGCTTCCATAGGAGGAAATAATTGTAGCATCATCATATAGATGATAGAGGTGATGGAAATGGTCAATATTTGTTCAAAATGTCAATATTTTGTTTTCAGAATTTGGCTTCCATAAGTACATATTATATTTTAGATTAATTATCCGTGTAGTTGTAATTTGTAATTTCTAACTTCAAAAAATAAAAAAGGTTGATTTTCACACGCAACCGCGTTACCAAAAATAAATTGGC
Coding sequence (CDS)
ATGTTCTCTCCTGGTACGAAGAGACGCAATTTAAGCTCTCGGACTGACCGAAGTTCGGCGACAGCCGTGTCCGATTCTCCGATTACACCGCTCTCCGCCGCCCGGAAGCCAGTTCTCGATAATCTCGTTCCCAACCGCCCCGGCACCGGCACTCCGGCTCCGTGGGCTCCTCGCTTGTCCGTACTTGCCAGAATTTCACCAGCTAATAGAAGTTCTAAGGAAGATGAGACAGATCCAGTCAGGCCTGTCTATGTTGGAGAGTTCCCCCAAGTGGTTCGTGATGAGCAGGCCAGTCTGGTACAGCAGTTTGCCACAAGTGGTGCAAGCATGTCGGGTGGAATGGATGCTGAAACATCCCTTGCTTGGATTATATGCCGAGACAAACTCTTTCTTTGGACATATTTGTTACCTGTGGCCACCATGAAGTGTGTCGTTCGTGAACTTCCGAAAAGAATTTTAGAAAGCAAAGATATTGGTAGAAATGACAGCGATCATTGGTTGCTCAGTATTGTTAGTTGGGATAACCAAAATCAAAGTTCAAGAAAGTCAGCTAAACATCAAAGTTCTGTTGGCATTATAATTTGTAATAAGAAAACAGGAGCTGTTGTGTATTGGCCTGATATCTTCTCGGATGGAGGAACTACTCCAGTTACCTGTCTGACATCTTCTAATGAGCCAGCTGTGATTTCTTCAATTTTTGATGGGAAGAGCACTTCTCATAGGCATCGAAGTTTGAATAAACCAAGGACATTCAACTCTTTGATTGCCTCTGCAGTCCCCGAGTCCCAGTACGTTTGTGTTGCCATTGCATGTAGTTCAAATGGCCAGCTTTGGCAGTACCGCTGCAGTCCTATGGGAATTCAATGTACTGAAGTACCTCAGGATATATGTAGTATTCGCAGTCAAGAGGATGGTAGCAATCAACATTTTGCGAGTGATGGGTATCCTAGGTCTCTAACTTGGTCTCGTTCTCATCTTCAACCAGATAAATTTAACAGAAAGTTTTTGCTTCTGACAGATCACGAGATACAGTGTTTTAGTCTTAAACTTTTTCCTGACGTACAAGTGTCCAAGATCTGGTCTTATGAAATTGTCGGTACAGATAGTGATTTGGGTATTAAAAAAGACCTAGCTGGTCAGAAAAGGATCTGGCCACTTGATTTGCAGGAAGATGAACGAGGTGCAGTTATCACCATTTTAGTTGCTACATTCTGTAAGGATCGCATTAGTAGTTCAAGTTATATACAATATTCCCTTCTTACCTTGCAATACAAATCTGGGGCAGGAATAGAGGCTAGTGGTGACAAGAGGATATTGGAGAAAAAGGCTCCAATCCAAGTAATAATTCCAAAAGCAAGAGTAGAAAATGAAGATTTTTTGTTCTCCATGAGACTTAGGGTTGGGGGAAAGCCTTCGGGATCTGCCCTCATTCTTTCAGGTGATGGAACGGCAACTGTCTCCCATTACTATAGAAGCTCCACTCTACTATATCAATTTGACTTACCTTATGACGCTGGTAAAGTATTAGATGCTTCAGTTCTTCCATCTACGGAGCATGGTGAAGGAGCATGGGTTGTTCTAACTGAGAAAGCAGGAATATGGGCAATACCCGTTAAAGCTATTGTCCTTGGTGGAGTTGAACCTCCTGAACGAAGTTTGTCACGTAGGGGGAGTTCAAATGAACGATCTGTGCAAGATGACACAAGAAACCTCAATTTGTCTGGCAACATTGCTAGTACAAGGGGCAGTTTTGAGGTGCAGGATGTCGTGGACAAGAAAAAGACTACTGTGGCTGGGATTGGACATCGAGTGGCTCGAGATGAAGAAGCAGAAGCTTTGCTGCGTCAGCTTTTCCATGATTTCCTGTCATCTGGTCAAGTAAATAACTCTTTTGAGAAGCTAAAGAATTCTGGTGCATTTGATAGGGAGGATGAAACAAATGTTTTTACACGGATGAGCAAGTCAATTATTGATACTTTGGCCAAACATTGGACTACAACGAGAGGTGCTGAGATTGTGTCGATGACCGTTGTTTCTACTCAGCTGATGGACAAGCAGCAGAAGCACGAAAAGTTTCTCCAGTTTCTTGCTTTGTCAAAGTGCCACGAAGAGTTGTGTTCTAGACAAAGAAATTCTTTGCAAATTATCCTGGAGCATGGAGAAAAGCTTGCTGCTATGATTCAGTTAAGGGAACTGCAGAACACAATTTGCCAAAACCGTTCGACTGGACTTGGTTCCTCAAGTTCTAATTCAGAAACTCAGATGTCAGGCGGTCTATGGGATCTTATTCAATTTGTTGGTGAGCGAGCTCGGCGAAACACTGTTCTTCTAATGGATAGAGATAATCCTGAAGTGTTTTACAGTAAAGTTTCTGAGCTAGAAGAAGTATTTTATTGCTTAGAAAGGCAGCTAGATTATGTGGTAAGTGCAGATGATTCATGTGTGGTCCAAAACCAGAGGGCTTGTGAACTCTCAGAAGCATGTGTTACTATCATGCGTGCAGCTGTGCATTACAGAAATGAGCATCAGTTATGGTATCCACCATCTGAAGGCTTAACACCTTGGTATAGTCAACCGGTTGTACGGAATGGGCTATGGCACATTGCTTCTCTCATGCTTCAGCTTTTAAATGAAGTTTCTGAGCTTGATACATCTGCAAAATCTGATTTATATTGCTGCCTGGAACTATTGACTGAAGTTCTTCTTGAGGCACATGCTGGTGCTGTCACTGCAAAGGCAGAGCGAGGAGAAAAAACTGAAAGTCTTTTACATGAATTTTGGAGTAGAAGGGATGCACTCCTCAGCTCTCTTTATAAAAGAGTAAAAGATTCTGTGGAAGCGGAGCTTAAGGATTTTAGGGGAGGTTTGGTGGAGAAAAATGTAGAAATCCTCAGAAAGAATTCGTCACGTCTGTTATCTGTTGCAAAACAGCATGAATGCTACAGCATTTTATGGAATATCTGCTGTGACCTTAATGATCCAGAGCTACTCAGGAAACTTATGCATGAGAGCATGGGGCCTAAAGGAGGGTTTAGCTACTTTGTTTTTAAAAGACTTTATGAAAATAAACAGTTCTCCAAGCTTTTAAGACTCGGAGAGGAGTTCCATGAAGAACTGTTGATATTTTTGAAGGAGCATCCGGATCTTCTGTGGCTTCATGAGCTTTTCCTTCATCAATTTTTCTCAGCTTCAGACACTCTCCATGAATCAGCTCTGTCTGGAGATGACAGGCTTGTTTCGCCCCCTGAAGTTGAGGGAGAATTTGAATCTGACCATTGTAATTTTGAATTAAGATTGGCAGACAGAAAACGCCTTTTATACCTCTCGAAGATAGCTTTAATGGCAGCAGCAGCAGGTCAAAATGCTGAGTACGAGAGTAAGTTGATGCGTATTGAGGCTGATGCAAAGATACTTAAATTACAGGAAGAAATTTTAGATCTCTATCATGCCGTTGAAACAGAGCAGCAGCTCGACTGCAAGCTCCTCCACCCCGACGGCCTTATTCAACTATGTCTCAAAGGTGAAAACCCAGCACTCTCATTGATAGCCTTTGACATATTTGCCTGGACCAGCACCTCATTCCGAGAAACCCACCGAAAGCTTTTGGAGGAATGCTGGAAAAACGCTGCGGATCAAGATGACTGGAATCGACTATATCAAGTATCAGTAGCAGAAGGATGGAGCGACGAGGAGACACTAAAAAAGCTGAGAGAAACAACTCTGTTCAAGGCTTCAAGCAGGTGTTATGGACATGGAGCAACAGAAGTATTTGGAGATGGATTTGATGTGGCATTACCCCTAAGACAAGAAAATGAAATTGCTGAAAGTTCCTCATTGAAGAACTGTGCAGGTTCTGTAGAGGCAATTCTGATGCAACACAAGCATTTTCCTGAAGCAGGGAAGTTAATGGTGACTGCCATTATGTTGGGACTGGATTTGGAAGATGATCCTATTTTAATGGAATAA
Protein sequence
MFSPGTKRRNLSSRTDRSSATAVSDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPRLSVLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAETSLAWIICRDKLFLWTYLLPVATMKCVVRELPKRILESKDIGRNDSDHWLLSIVSWDNQNQSSRKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISSIFDGKSTSHRHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDICSIRSQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQVSKIWSYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQYSLLTLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALILSGDGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIPVKAIVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKTTVAGIGHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSIIDTLAKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIILEHGEKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTVLLMDRDNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAAVHYRNEHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLELLTEVLLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGGLVEKNVEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFVFKRLYENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGDDRLVSPPEVEGEFESDHCNFELRLADRKRLLYLSKIALMAAAAGQNAEYESKLMRIEADAKILKLQEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFRETHRKLLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATEVFGDGFDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLDLEDDPILME
Homology
BLAST of MC04g0262 vs. ExPASy Swiss-Prot
Match:
F4IGA5 (Nuclear pore complex protein NUP133 OS=Arabidopsis thaliana OX=3702 GN=NUP133 PE=1 SV=1)
HSP 1 Score: 1407.9 bits (3643), Expect = 0.0e+00
Identity = 753/1326 (56.79%), Postives = 942/1326 (71.04%), Query Frame = 0
Query: 1 MFSPGTKRRNLSSRTDRSSATAV--SDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPR 60
MFSP TKR SSR +++ V DSP+TP + R +N + +RP TGTPAPWAPR
Sbjct: 1 MFSPLTKRAKQSSRNEKTPRNRVPPPDSPVTPATQNR----NNFISDRPATGTPAPWAPR 60
Query: 61 LSVLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAET 120
LSVLAR+SP N K ++D ++PV+VGEFPQ++RDEQ+ A +SGGMD ET
Sbjct: 61 LSVLARVSPGNNGDKGVDSDQLKPVFVGEFPQLLRDEQS------YPGDACVSGGMDKET 120
Query: 121 SLAWIICRDKLFLWTYLLPVATMKCVVRELPKRIL--ESKDIGRNDSDHWLLSIVSWDNQ 180
L+W I K+F+W++L + + KCVV ELP +L E G D WL+++VSWD
Sbjct: 121 CLSWFITGSKVFVWSHLTTLPSRKCVVLELPVVVLVNEESGSGLQDGKSWLVNVVSWDTS 180
Query: 181 NQSSRKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISSIFDGK 240
++ ++++ +S VG+++CN+KT AVVYW DIFS P + +I +G
Sbjct: 181 AGAATRASRSRSPVGVVMCNRKTRAVVYWSDIFSGQEAAP-----AEKARHLIKRQSNGI 240
Query: 241 STSHRHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDI 300
+S S NSLI +AV ++ +C+AIACSSNG+LWQ+ CSP G++ +V +I
Sbjct: 241 RSSRAENS-----DLNSLITTAVAAAERLCIAIACSSNGELWQFTCSPTGVKSNQVQLNI 300
Query: 301 CSIRSQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQV 360
S S+GYPRSL W S + +FL+LTD +I CF+++ +PD+ V
Sbjct: 301 SS----------SSVSEGYPRSLIWRFSQGLARESCWEFLMLTDCDIHCFTIEPYPDLTV 360
Query: 361 SKIWSYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQ 420
S++W +EIVGTD D GIKKD+A QK+IWPLDLQ D++G VIT+LVAT C DR SSSSY Q
Sbjct: 361 SEVWQHEIVGTDGDSGIKKDIASQKQIWPLDLQVDDQGKVITVLVATICMDRASSSSYTQ 420
Query: 421 YSLLTLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSAL 480
YSLLTLQ+KS ++++LEK+ PIQVIIPKARVE++DFLFSMRLRVGG+P GSA+
Sbjct: 421 YSLLTLQHKSEMRFADGREEKVLEKQGPIQVIIPKARVEDKDFLFSMRLRVGGRPPGSAI 480
Query: 481 ILSGDGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPST-EHGEGAWVVLTEKAGIWA 540
ILSGDGTATV + + SST LY+FDLPYDAGKVLDASVL ST EH GAW VLTEKAG+WA
Sbjct: 481 ILSGDGTATVCYCHGSSTRLYKFDLPYDAGKVLDASVLSSTDEHEYGAWTVLTEKAGVWA 540
Query: 541 IPVKAIVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKT 600
IP KA+VLGGVEPPERSLSR+ SSNERS +D+TR + + R + ++Q++ DK
Sbjct: 541 IPEKAVVLGGVEPPERSLSRKNSSNERSTRDETRVTPYGVDRTAGRENSDIQNIEDKGNP 600
Query: 601 TVAGIGHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKS 660
+ G + ARDEE+EALL QLF FL SG+V+ S EKL SGAFDR+ E NVF R SKS
Sbjct: 601 KM-GFTRQTARDEESEALLGQLFEGFLLSGKVDGSLEKLSQSGAFDRDGEANVFARKSKS 660
Query: 661 IIDTLAKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQI 720
I+DTLAKHWTTTRGAEIV+MTV+S+QL++KQQKHE FL FLALSKCHEELCS+QR+SLQI
Sbjct: 661 IVDTLAKHWTTTRGAEIVAMTVISSQLVEKQQKHENFLHFLALSKCHEELCSKQRHSLQI 720
Query: 721 ILEHGEKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTV 780
ILE+GEKLAAMIQLRELQN I QNRS GS + SE Q+S LWDLIQFVGERARRNTV
Sbjct: 721 ILENGEKLAAMIQLRELQNMINQNRSARFGSPQAGSEDQVSCALWDLIQFVGERARRNTV 780
Query: 781 LLMDRDNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAA 840
LLMDRDN EVFYSKVSELEEVFYCL RQL+Y++ AD Q QRACELS ACVTI++ A
Sbjct: 781 LLMDRDNAEVFYSKVSELEEVFYCLNRQLEYIIRADQPLGTQLQRACELSNACVTILQTA 840
Query: 841 VHYRNEHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLE 900
+ Y+NEHQ+WYPP EGL PW+SQ VV NGLW IAS ML LL E S +D SAKSD+Y LE
Sbjct: 841 LDYKNEHQMWYPPLEGLIPWHSQTVVCNGLWCIASFMLHLLTEASRIDISAKSDIYTHLE 900
Query: 901 LLTEVLLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGG 960
+LTEVLLEA AG+ AK ER E+ + LL+E+W+RRD + SLY++ K+ +EAE++ R
Sbjct: 901 VLTEVLLEACAGSTFAKLEREEENKGLLNEYWTRRDTIFDSLYRQAKEFMEAEIQGIRER 960
Query: 961 LVEKNVEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFV 1020
+ +I R S L+S+AK+H Y I+W IC DLND LLR LMHE +GP+GGFSYFV
Sbjct: 961 TEATDEDIFRNRCSNLISIAKRHAGYKIMWKICYDLNDTGLLRNLMHEGVGPQGGFSYFV 1020
Query: 1021 FKRLYENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGD 1080
F++LY+ KQFSKLLRLGEEF +ELLIFLK H DL+WLH++FLHQF SASDTLH ALS D
Sbjct: 1021 FQQLYDMKQFSKLLRLGEEFQDELLIFLKRHSDLVWLHQVFLHQFSSASDTLHTLALSQD 1080
Query: 1081 DRLVSPPEVEGEFESDHCNFELRLADRKRLLYLSKIALMAAAAGQNAEYESKLMRIEADA 1140
+ S VE + + + ADRKR L LSKIA + A ++A+ ESK+ RIEAD
Sbjct: 1081 EE--SMTTVEERTGPEPEDVQPTFADRKRFLNLSKIAYV---ADKDADSESKVKRIEADL 1140
Query: 1141 KILKLQEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFR 1200
+LKLQEEI E +L P+ LI+ CL + ++ AF++FAWTS+SFR
Sbjct: 1141 NLLKLQEEITKALPNGEARN----RLFRPEELIETCLNIQGRWTAIKAFEVFAWTSSSFR 1200
Query: 1201 ETHRKLLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATE 1260
E HR LLEECW+NAADQDDW+R +Q S EGWS+EETL+ LR T LF+AS RCYG
Sbjct: 1201 ENHRSLLEECWRNAADQDDWDRHHQASTNEGWSEEETLQNLRNTALFQASKRCYGPTRVN 1260
Query: 1261 VFGDGFDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLDLED 1320
F F LPLR+EN ++ SVE +LM HK F EAGKLM+TAIMLG +E+
Sbjct: 1261 TFDGDFAQVLPLRRENP-------EDSTSSVEDVLMSHKDFAEAGKLMLTAIMLGC-VEE 1278
Query: 1321 DPILME 1322
+ I+ E
Sbjct: 1321 EGIVAE 1278
BLAST of MC04g0262 vs. NCBI nr
Match:
XP_022135833.1 (nuclear pore complex protein NUP133 isoform X1 [Momordica charantia])
HSP 1 Score: 2628 bits (6812), Expect = 0.0
Identity = 1321/1321 (100.00%), Postives = 1321/1321 (100.00%), Query Frame = 0
Query: 1 MFSPGTKRRNLSSRTDRSSATAVSDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPRLS 60
MFSPGTKRRNLSSRTDRSSATAVSDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPRLS
Sbjct: 1 MFSPGTKRRNLSSRTDRSSATAVSDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPRLS 60
Query: 61 VLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAETSL 120
VLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAETSL
Sbjct: 61 VLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAETSL 120
Query: 121 AWIICRDKLFLWTYLLPVATMKCVVRELPKRILESKDIGRNDSDHWLLSIVSWDNQNQSS 180
AWIICRDKLFLWTYLLPVATMKCVVRELPKRILESKDIGRNDSDHWLLSIVSWDNQNQSS
Sbjct: 121 AWIICRDKLFLWTYLLPVATMKCVVRELPKRILESKDIGRNDSDHWLLSIVSWDNQNQSS 180
Query: 181 RKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISSIFDGKSTSH 240
RKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISSIFDGKSTSH
Sbjct: 181 RKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISSIFDGKSTSH 240
Query: 241 RHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDICSIR 300
RHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDICSIR
Sbjct: 241 RHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDICSIR 300
Query: 301 SQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQVSKIW 360
SQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQVSKIW
Sbjct: 301 SQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQVSKIW 360
Query: 361 SYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQYSLL 420
SYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQYSLL
Sbjct: 361 SYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQYSLL 420
Query: 421 TLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALILSG 480
TLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALILSG
Sbjct: 421 TLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALILSG 480
Query: 481 DGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIPVKA 540
DGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIPVKA
Sbjct: 481 DGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIPVKA 540
Query: 541 IVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKTTVAGI 600
IVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKTTVAGI
Sbjct: 541 IVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKTTVAGI 600
Query: 601 GHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSIIDTL 660
GHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSIIDTL
Sbjct: 601 GHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSIIDTL 660
Query: 661 AKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIILEHG 720
AKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIILEHG
Sbjct: 661 AKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIILEHG 720
Query: 721 EKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTVLLMDR 780
EKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTVLLMDR
Sbjct: 721 EKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTVLLMDR 780
Query: 781 DNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAAVHYRN 840
DNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAAVHYRN
Sbjct: 781 DNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAAVHYRN 840
Query: 841 EHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLELLTEV 900
EHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLELLTEV
Sbjct: 841 EHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLELLTEV 900
Query: 901 LLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGGLVEKN 960
LLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGGLVEKN
Sbjct: 901 LLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGGLVEKN 960
Query: 961 VEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFVFKRLY 1020
VEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFVFKRLY
Sbjct: 961 VEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFVFKRLY 1020
Query: 1021 ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGDDRLVS 1080
ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGDDRLVS
Sbjct: 1021 ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGDDRLVS 1080
Query: 1081 PPEVEGEFESDHCNFELRLADRKRLLYLSKIALMAAAAGQNAEYESKLMRIEADAKILKL 1140
PPEVEGEFESDHCNFELRLADRKRLLYLSKIALMAAAAGQNAEYESKLMRIEADAKILKL
Sbjct: 1081 PPEVEGEFESDHCNFELRLADRKRLLYLSKIALMAAAAGQNAEYESKLMRIEADAKILKL 1140
Query: 1141 QEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFRETHRK 1200
QEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFRETHRK
Sbjct: 1141 QEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFRETHRK 1200
Query: 1201 LLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATEVFGDG 1260
LLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATEVFGDG
Sbjct: 1201 LLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATEVFGDG 1260
Query: 1261 FDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLDLEDDPILM 1320
FDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLDLEDDPILM
Sbjct: 1261 FDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLDLEDDPILM 1320
BLAST of MC04g0262 vs. NCBI nr
Match:
XP_022135834.1 (nuclear pore complex protein NUP133 isoform X2 [Momordica charantia])
HSP 1 Score: 2622 bits (6796), Expect = 0.0
Identity = 1320/1321 (99.92%), Postives = 1320/1321 (99.92%), Query Frame = 0
Query: 1 MFSPGTKRRNLSSRTDRSSATAVSDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPRLS 60
MFSPGTKRRNLSSRTDRSSATAVSDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPRLS
Sbjct: 1 MFSPGTKRRNLSSRTDRSSATAVSDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPRLS 60
Query: 61 VLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAETSL 120
VLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAETSL
Sbjct: 61 VLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAETSL 120
Query: 121 AWIICRDKLFLWTYLLPVATMKCVVRELPKRILESKDIGRNDSDHWLLSIVSWDNQNQSS 180
AWIICRDKLFLWTYLLPVATMKCVVRELPKRILESKDIGRNDSDHWLLSIVSWDNQNQSS
Sbjct: 121 AWIICRDKLFLWTYLLPVATMKCVVRELPKRILESKDIGRNDSDHWLLSIVSWDNQNQSS 180
Query: 181 RKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISSIFDGKSTSH 240
RKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISSIFDGKSTSH
Sbjct: 181 RKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISSIFDGKSTSH 240
Query: 241 RHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDICSIR 300
RHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDICSIR
Sbjct: 241 RHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDICSIR 300
Query: 301 SQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQVSKIW 360
SQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQVSKIW
Sbjct: 301 SQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQVSKIW 360
Query: 361 SYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQYSLL 420
SYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQYSLL
Sbjct: 361 SYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQYSLL 420
Query: 421 TLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALILSG 480
TLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALILSG
Sbjct: 421 TLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALILSG 480
Query: 481 DGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIPVKA 540
DGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIPVKA
Sbjct: 481 DGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIPVKA 540
Query: 541 IVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKTTVAGI 600
IVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKTTVAGI
Sbjct: 541 IVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKTTVAGI 600
Query: 601 GHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSIIDTL 660
GHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSIIDTL
Sbjct: 601 GHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSIIDTL 660
Query: 661 AKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIILEHG 720
AKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIILEHG
Sbjct: 661 AKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIILEHG 720
Query: 721 EKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTVLLMDR 780
EKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTVLLMDR
Sbjct: 721 EKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTVLLMDR 780
Query: 781 DNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAAVHYRN 840
DNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAAVHYRN
Sbjct: 781 DNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAAVHYRN 840
Query: 841 EHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLELLTEV 900
EHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLELLTEV
Sbjct: 841 EHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLELLTEV 900
Query: 901 LLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGGLVEKN 960
LLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGGLVEKN
Sbjct: 901 LLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGGLVEKN 960
Query: 961 VEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFVFKRLY 1020
VEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFVFKRLY
Sbjct: 961 VEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFVFKRLY 1020
Query: 1021 ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGDDRLVS 1080
ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGDDRLVS
Sbjct: 1021 ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGDDRLVS 1080
Query: 1081 PPEVEGEFESDHCNFELRLADRKRLLYLSKIALMAAAAGQNAEYESKLMRIEADAKILKL 1140
PPEVEGEFESDHCNFELRLADRKRLLYLSKIALMAAA GQNAEYESKLMRIEADAKILKL
Sbjct: 1081 PPEVEGEFESDHCNFELRLADRKRLLYLSKIALMAAA-GQNAEYESKLMRIEADAKILKL 1140
Query: 1141 QEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFRETHRK 1200
QEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFRETHRK
Sbjct: 1141 QEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFRETHRK 1200
Query: 1201 LLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATEVFGDG 1260
LLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATEVFGDG
Sbjct: 1201 LLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATEVFGDG 1260
Query: 1261 FDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLDLEDDPILM 1320
FDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLDLEDDPILM
Sbjct: 1261 FDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLDLEDDPILM 1320
BLAST of MC04g0262 vs. NCBI nr
Match:
XP_022135835.1 (nuclear pore complex protein NUP133 isoform X3 [Momordica charantia])
HSP 1 Score: 2618 bits (6786), Expect = 0.0
Identity = 1318/1321 (99.77%), Postives = 1318/1321 (99.77%), Query Frame = 0
Query: 1 MFSPGTKRRNLSSRTDRSSATAVSDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPRLS 60
MFSPGTKRRNLSSRTDRSSATAVSDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPRLS
Sbjct: 1 MFSPGTKRRNLSSRTDRSSATAVSDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPRLS 60
Query: 61 VLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAETSL 120
VLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAETSL
Sbjct: 61 VLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAETSL 120
Query: 121 AWIICRDKLFLWTYLLPVATMKCVVRELPKRILESKDIGRNDSDHWLLSIVSWDNQNQSS 180
AWIICRDKLFLWTYLLPVATMKCVVRELPKRILESKDIGRNDSDHWLLSIVSWDNQNQSS
Sbjct: 121 AWIICRDKLFLWTYLLPVATMKCVVRELPKRILESKDIGRNDSDHWLLSIVSWDNQNQSS 180
Query: 181 RKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISSIFDGKSTSH 240
RKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISSIFDGKSTSH
Sbjct: 181 RKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISSIFDGKSTSH 240
Query: 241 RHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDICSIR 300
RHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDICSIR
Sbjct: 241 RHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDICSIR 300
Query: 301 SQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQVSKIW 360
SQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQVSKIW
Sbjct: 301 SQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQVSKIW 360
Query: 361 SYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQYSLL 420
SYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQYSLL
Sbjct: 361 SYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQYSLL 420
Query: 421 TLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALILSG 480
TLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALILSG
Sbjct: 421 TLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALILSG 480
Query: 481 DGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIPVKA 540
DGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIPVKA
Sbjct: 481 DGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIPVKA 540
Query: 541 IVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKTTVAGI 600
IVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKTTVAGI
Sbjct: 541 IVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKTTVAGI 600
Query: 601 GHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSIIDTL 660
GHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSIIDTL
Sbjct: 601 GHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSIIDTL 660
Query: 661 AKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIILEHG 720
AKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIILEHG
Sbjct: 661 AKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIILEHG 720
Query: 721 EKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTVLLMDR 780
EKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTVLLMDR
Sbjct: 721 EKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTVLLMDR 780
Query: 781 DNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAAVHYRN 840
DNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAAVHYRN
Sbjct: 781 DNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAAVHYRN 840
Query: 841 EHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLELLTEV 900
EHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLELLTEV
Sbjct: 841 EHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLELLTEV 900
Query: 901 LLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGGLVEKN 960
LLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGGLVEKN
Sbjct: 901 LLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGGLVEKN 960
Query: 961 VEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFVFKRLY 1020
VEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFVFKRLY
Sbjct: 961 VEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFVFKRLY 1020
Query: 1021 ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGDDRLVS 1080
ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGDDRLVS
Sbjct: 1021 ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGDDRLVS 1080
Query: 1081 PPEVEGEFESDHCNFELRLADRKRLLYLSKIALMAAAAGQNAEYESKLMRIEADAKILKL 1140
PPEVEGEFESDHCNFELRLADRKRLLYLSKIALMA GQNAEYESKLMRIEADAKILKL
Sbjct: 1081 PPEVEGEFESDHCNFELRLADRKRLLYLSKIALMA---GQNAEYESKLMRIEADAKILKL 1140
Query: 1141 QEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFRETHRK 1200
QEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFRETHRK
Sbjct: 1141 QEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFRETHRK 1200
Query: 1201 LLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATEVFGDG 1260
LLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATEVFGDG
Sbjct: 1201 LLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATEVFGDG 1260
Query: 1261 FDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLDLEDDPILM 1320
FDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLDLEDDPILM
Sbjct: 1261 FDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLDLEDDPILM 1318
BLAST of MC04g0262 vs. NCBI nr
Match:
XP_038887917.1 (nuclear pore complex protein NUP133 isoform X1 [Benincasa hispida])
HSP 1 Score: 2332 bits (6043), Expect = 0.0
Identity = 1176/1326 (88.69%), Postives = 1238/1326 (93.36%), Query Frame = 0
Query: 1 MFSPGTKRRNLSSRTDRSSATAVSDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPRLS 60
MFSPGTKRRNLSSRTDRSSA A+SDSPITPLSA RKPVLDNLVPNRPGTGTPAPWAPRLS
Sbjct: 1 MFSPGTKRRNLSSRTDRSSAPALSDSPITPLSAVRKPVLDNLVPNRPGTGTPAPWAPRLS 60
Query: 61 VLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAETSL 120
VLARISPANR KEDETDPV+PVYVGEFPQVVRDEQASLVQQF T GASMSGGMDA+TSL
Sbjct: 61 VLARISPANRCDKEDETDPVKPVYVGEFPQVVRDEQASLVQQFVTCGASMSGGMDAKTSL 120
Query: 121 AWIICRDKLFLWTYLLPVATMKCVVRELPKRILESKDIGRNDSDHWLLSIVSWDNQNQSS 180
AWIICRDKLFLWTYLLPVATMKCVV ELPKRI++SKDIGRN++D+WLLS+VSWD+QNQSS
Sbjct: 121 AWIICRDKLFLWTYLLPVATMKCVVCELPKRIVDSKDIGRNNNDNWLLSVVSWDSQNQSS 180
Query: 181 RKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISSIFDGKSTSH 240
RKS KHQ SVGIIICNKKTGAVVYWPDIFSD GT PVTCLTSS+EPA ISS DGKSTSH
Sbjct: 181 RKSVKHQHSVGIIICNKKTGAVVYWPDIFSDEGTAPVTCLTSSHEPAAISSFSDGKSTSH 240
Query: 241 RHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDICSIR 300
R++SLN+PRTFNSLIAS VP+SQ VCVA+ACSSNGQLWQY C PMGIQCT+V QDIC +
Sbjct: 241 RNQSLNRPRTFNSLIASMVPDSQNVCVALACSSNGQLWQYHCCPMGIQCTKVSQDICGLH 300
Query: 301 SQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQVSKIW 360
SQEDGS+Q+ +DGYPRSL+WS SHLQ DKFNRKFLLLTDHEIQCF LKLFPDVQVSK+W
Sbjct: 301 SQEDGSSQYLVNDGYPRSLSWSHSHLQLDKFNRKFLLLTDHEIQCFCLKLFPDVQVSKLW 360
Query: 361 SYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQYSLL 420
SYEIVGTD+DL IKKDLAGQKRIWPLDLQEDE GAVITILVAT CKDRISSSSYIQYSLL
Sbjct: 361 SYEIVGTDNDLCIKKDLAGQKRIWPLDLQEDEGGAVITILVATLCKDRISSSSYIQYSLL 420
Query: 421 TLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALILSG 480
TLQYK GA I+A+GDKRILEKKAPIQVIIPKARVEN+DFLFSMRLRVGGKPSGSALILSG
Sbjct: 421 TLQYKYGAEIDANGDKRILEKKAPIQVIIPKARVENDDFLFSMRLRVGGKPSGSALILSG 480
Query: 481 DGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIPVKA 540
DGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIPVKA
Sbjct: 481 DGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIPVKA 540
Query: 541 IVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKTTVAGI 600
IVLGGVEPPERSLSRRGSSNERSVQDDTR+LN SGNIASTRG+F+VQDVVD+KK T+AGI
Sbjct: 541 IVLGGVEPPERSLSRRGSSNERSVQDDTRSLNFSGNIASTRGTFDVQDVVDRKKATMAGI 600
Query: 601 GHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSIIDTL 660
HR ARDEE+EALLRQLFHDFLSS QVNNSFEKLKNSGAFDREDETNVFTRMSKSI+DTL
Sbjct: 601 SHRTARDEESEALLRQLFHDFLSSSQVNNSFEKLKNSGAFDREDETNVFTRMSKSIVDTL 660
Query: 661 AKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIILEHG 720
AKHWTTTRGAEIVSMTVVSTQLM+KQQKHEKFLQFLALSKCHEELCSRQRNSLQIILEHG
Sbjct: 661 AKHWTTTRGAEIVSMTVVSTQLMEKQQKHEKFLQFLALSKCHEELCSRQRNSLQIILEHG 720
Query: 721 EKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTVLLMDR 780
EKLAAMIQLRELQNTICQNRSTGLG SSNSET MSG LWDLIQFVGERARRNTVLLMDR
Sbjct: 721 EKLAAMIQLRELQNTICQNRSTGLGFLSSNSETPMSGALWDLIQFVGERARRNTVLLMDR 780
Query: 781 DNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAAVHYRN 840
DN EVFYSKVSELEEVF+CLE+QLDY+VSAD+S V+QNQRA ELS+ACVTIM AAVHYRN
Sbjct: 781 DNTEVFYSKVSELEEVFHCLEKQLDYLVSADESYVIQNQRASELSKACVTIMHAAVHYRN 840
Query: 841 EHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLELLTEV 900
EHQLWYPPSEGLTPWYSQ VVRNGLW IASLMLQLLNEVSELDTSAKSDLYC LELLTEV
Sbjct: 841 EHQLWYPPSEGLTPWYSQLVVRNGLWRIASLMLQLLNEVSELDTSAKSDLYCYLELLTEV 900
Query: 901 LLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGGLVEKN 960
LLEAHAGAVTAKAERGEKT+SLLHEFWSRRD+LLSSLY+R+K SVEAE KDFRG LVE+
Sbjct: 901 LLEAHAGAVTAKAERGEKTDSLLHEFWSRRDSLLSSLYQRIKKSVEAEHKDFRGDLVEQK 960
Query: 961 VEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFVFKRLY 1020
VE LRK+SSRLLSVAKQHECYSILW ICCDLNDPELLRK MHESMGPKGGFSYFVFK+L+
Sbjct: 961 VESLRKHSSRLLSVAKQHECYSILWEICCDLNDPELLRKHMHESMGPKGGFSYFVFKKLH 1020
Query: 1021 ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGDDRLVS 1080
ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLH ALS D VS
Sbjct: 1021 ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHALALSEGDGPVS 1080
Query: 1081 PPEVEGEFESDHCNFELRLADRKRLLYLSKIALMAAAAGQNAEYESKLMRIEADAKILKL 1140
E+ E ESDHCN ELRLADRKR+LYLSKI+LMAAAAG+NAEYESKLMRIEADAKILKL
Sbjct: 1081 SLELGTEVESDHCNLELRLADRKRILYLSKISLMAAAAGKNAEYESKLMRIEADAKILKL 1140
Query: 1141 QEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFRETHRK 1200
QE ILDLYHAVETEQQLD +LLHPD LIQLCLK ENP L L+AFDIFAWTSTSFRETHRK
Sbjct: 1141 QEAILDLYHAVETEQQLDRELLHPDRLIQLCLKAENPTLLLMAFDIFAWTSTSFRETHRK 1200
Query: 1201 LLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATEVFGDG 1260
LLEECWKN ADQDDWNRLYQVSVAEGWSDEET+K LRETTLFKASSRCYGHGA E+FG+G
Sbjct: 1201 LLEECWKNVADQDDWNRLYQVSVAEGWSDEETIKNLRETTLFKASSRCYGHGAAEMFGEG 1260
Query: 1261 FDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLD-----LED 1320
FDV LPLRQEN + S LK+ GSVEAILMQHKHFPEAGKLMVTAIMLG+D LED
Sbjct: 1261 FDVVLPLRQEN-LEGGSILKDSIGSVEAILMQHKHFPEAGKLMVTAIMLGVDDYDNTLED 1320
BLAST of MC04g0262 vs. NCBI nr
Match:
XP_023554346.1 (nuclear pore complex protein NUP133 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 2329 bits (6035), Expect = 0.0
Identity = 1172/1330 (88.12%), Postives = 1241/1330 (93.31%), Query Frame = 0
Query: 1 MFSPGTKRRNLSSRTDRSSATAVSDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPRLS 60
MFSPGTKRRN SSRT RS A A+SDSPITP+SA RKPVLDNL+PNRPGTGTPAPWAPRLS
Sbjct: 1 MFSPGTKRRNSSSRTGRSLAPALSDSPITPISAVRKPVLDNLIPNRPGTGTPAPWAPRLS 60
Query: 61 VLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAETSL 120
VLARISP NRS KEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGAS++GGMDAETSL
Sbjct: 61 VLARISPVNRSDKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASVAGGMDAETSL 120
Query: 121 AWIICRDKLFLWTYLLPVATMKCVVRELPKRILESKDIGRNDSDHWLLSIVSWDNQNQSS 180
AWIICRDKLFLWTYLLPVATMKCVV ELPKRIL+SKDIGRN++DHWLLS+VSWD+QNQSS
Sbjct: 121 AWIICRDKLFLWTYLLPVATMKCVVHELPKRILDSKDIGRNNNDHWLLSVVSWDSQNQSS 180
Query: 181 RKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISS---IFDGKS 240
RKS KHQ+SV IIICNKKTGAVVYWPDIFSDGG+TP+TCLTSS+EPA ISS IFDGKS
Sbjct: 181 RKSVKHQNSVAIIICNKKTGAVVYWPDIFSDGGSTPITCLTSSHEPAAISSKTSIFDGKS 240
Query: 241 TSHRHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDIC 300
TS ++ N+P TFNSLIA+AVP+SQYVCVA+ACSSNGQLWQYRCSPMGIQCTEVPQDIC
Sbjct: 241 TSLGNQRPNRPCTFNSLIAAAVPDSQYVCVALACSSNGQLWQYRCSPMGIQCTEVPQDIC 300
Query: 301 SIRSQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQVS 360
R Q+DGS Q+ SDGYPRSLTWSRSHLQ DKFNRKFLLLTDHEIQCF LKLFPD+QVS
Sbjct: 301 GFRCQDDGSCQNLVSDGYPRSLTWSRSHLQLDKFNRKFLLLTDHEIQCFCLKLFPDLQVS 360
Query: 361 KIWSYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQY 420
K+WSYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDE+GAVITILVATFCKDRISSSSYIQY
Sbjct: 361 KLWSYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDEQGAVITILVATFCKDRISSSSYIQY 420
Query: 421 SLLTLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALI 480
+LLTLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALI
Sbjct: 421 TLLTLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALI 480
Query: 481 LSGDGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIP 540
LSGDGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIP
Sbjct: 481 LSGDGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIP 540
Query: 541 VKAIVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKTTV 600
VKAIVLGGVEPPERSLSRRGSSNERSVQDD+R+LN SGNIASTRGS EVQDVVD+KK T+
Sbjct: 541 VKAIVLGGVEPPERSLSRRGSSNERSVQDDSRSLNFSGNIASTRGSLEVQDVVDRKKATM 600
Query: 601 AGIGHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSII 660
+G+ HR ARDEE+EALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSI+
Sbjct: 601 SGMAHRTARDEESEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSIV 660
Query: 661 DTLAKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIIL 720
DTLAKHWTTTRGAEIVSMTVVSTQL+DKQQKHEKFLQFLALSKCHEELCSRQRNSLQIIL
Sbjct: 661 DTLAKHWTTTRGAEIVSMTVVSTQLIDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIIL 720
Query: 721 EHGEKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTVLL 780
HGEKLAAMIQLRELQNTI QNRS GL S SSNSET MSG LWDLIQFVGERARRNTVLL
Sbjct: 721 GHGEKLAAMIQLRELQNTIFQNRSNGLSSLSSNSETPMSGALWDLIQFVGERARRNTVLL 780
Query: 781 MDRDNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAAVH 840
MDRDN EVFYSKVSELEEVF CLERQLDYVVSAD+S VQNQRACE+S+ACVTIMRAAV
Sbjct: 781 MDRDNTEVFYSKVSELEEVFNCLERQLDYVVSADESYAVQNQRACEISKACVTIMRAAVQ 840
Query: 841 YRNEHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLELL 900
YRNEHQLWYPPSEGLTPWYSQ VVRNGLWHIASLMLQLLNEVSELDTSAKSDLYC LELL
Sbjct: 841 YRNEHQLWYPPSEGLTPWYSQLVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCYLELL 900
Query: 901 TEVLLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGGLV 960
TEVLLE+HAGAVTAKAERGEKTE LLHEFWSRRD+LLSSLY+R+KDSVEAE KDFRG LV
Sbjct: 901 TEVLLESHAGAVTAKAERGEKTEGLLHEFWSRRDSLLSSLYQRIKDSVEAEHKDFRGDLV 960
Query: 961 EKNVEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFVFK 1020
E+ VE LRK+SSRLL+VAKQHECYSILW+ICCDLND ELLR LMHESMGPKGGFSYFVFK
Sbjct: 961 EQRVESLRKHSSRLLAVAKQHECYSILWSICCDLNDSELLRNLMHESMGPKGGFSYFVFK 1020
Query: 1021 RLYENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGDDR 1080
+L+ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHE ALS +
Sbjct: 1021 KLHENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHELALSEGEG 1080
Query: 1081 LVSPPEVEGEFESDHCNFELRLADRKRLLYLSKIALMAAAAGQNAEYESKLMRIEADAKI 1140
VSPPEVE E ESDHCN EL+LADRKRLLYLSKIALMAAAAG+N EYESKLMRIEADAKI
Sbjct: 1081 PVSPPEVETEVESDHCNLELKLADRKRLLYLSKIALMAAAAGRNTEYESKLMRIEADAKI 1140
Query: 1141 LKLQEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFRET 1200
LKLQE ILD A+ETEQ+LDC+LLHPD LIQLCLK +NP LSL+AFDIFAWTSTSFRET
Sbjct: 1141 LKLQEAILDPSLAIETEQKLDCELLHPDRLIQLCLKSKNPTLSLMAFDIFAWTSTSFRET 1200
Query: 1201 HRKLLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATEVF 1260
HRKLLEECWKN ADQDDWN+LY+ SVAEGWSDEET++ LRET LFKASSRCYGHGA EVF
Sbjct: 1201 HRKLLEECWKNVADQDDWNQLYEASVAEGWSDEETMRNLRETALFKASSRCYGHGAAEVF 1260
Query: 1261 GDG-FDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLD---- 1320
G+G F+ LPLRQEN S +K+C GSVEAILMQHKHFPEAGKLMVTAIMLG++
Sbjct: 1261 GEGGFNAVLPLRQENLEGGSIMVKDCVGSVEAILMQHKHFPEAGKLMVTAIMLGVEDYDN 1320
BLAST of MC04g0262 vs. ExPASy TrEMBL
Match:
A0A6J1C5Z7 (nuclear pore complex protein NUP133 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111007690 PE=3 SV=1)
HSP 1 Score: 2628 bits (6812), Expect = 0.0
Identity = 1321/1321 (100.00%), Postives = 1321/1321 (100.00%), Query Frame = 0
Query: 1 MFSPGTKRRNLSSRTDRSSATAVSDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPRLS 60
MFSPGTKRRNLSSRTDRSSATAVSDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPRLS
Sbjct: 1 MFSPGTKRRNLSSRTDRSSATAVSDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPRLS 60
Query: 61 VLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAETSL 120
VLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAETSL
Sbjct: 61 VLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAETSL 120
Query: 121 AWIICRDKLFLWTYLLPVATMKCVVRELPKRILESKDIGRNDSDHWLLSIVSWDNQNQSS 180
AWIICRDKLFLWTYLLPVATMKCVVRELPKRILESKDIGRNDSDHWLLSIVSWDNQNQSS
Sbjct: 121 AWIICRDKLFLWTYLLPVATMKCVVRELPKRILESKDIGRNDSDHWLLSIVSWDNQNQSS 180
Query: 181 RKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISSIFDGKSTSH 240
RKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISSIFDGKSTSH
Sbjct: 181 RKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISSIFDGKSTSH 240
Query: 241 RHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDICSIR 300
RHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDICSIR
Sbjct: 241 RHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDICSIR 300
Query: 301 SQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQVSKIW 360
SQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQVSKIW
Sbjct: 301 SQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQVSKIW 360
Query: 361 SYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQYSLL 420
SYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQYSLL
Sbjct: 361 SYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQYSLL 420
Query: 421 TLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALILSG 480
TLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALILSG
Sbjct: 421 TLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALILSG 480
Query: 481 DGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIPVKA 540
DGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIPVKA
Sbjct: 481 DGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIPVKA 540
Query: 541 IVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKTTVAGI 600
IVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKTTVAGI
Sbjct: 541 IVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKTTVAGI 600
Query: 601 GHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSIIDTL 660
GHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSIIDTL
Sbjct: 601 GHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSIIDTL 660
Query: 661 AKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIILEHG 720
AKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIILEHG
Sbjct: 661 AKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIILEHG 720
Query: 721 EKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTVLLMDR 780
EKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTVLLMDR
Sbjct: 721 EKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTVLLMDR 780
Query: 781 DNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAAVHYRN 840
DNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAAVHYRN
Sbjct: 781 DNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAAVHYRN 840
Query: 841 EHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLELLTEV 900
EHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLELLTEV
Sbjct: 841 EHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLELLTEV 900
Query: 901 LLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGGLVEKN 960
LLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGGLVEKN
Sbjct: 901 LLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGGLVEKN 960
Query: 961 VEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFVFKRLY 1020
VEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFVFKRLY
Sbjct: 961 VEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFVFKRLY 1020
Query: 1021 ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGDDRLVS 1080
ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGDDRLVS
Sbjct: 1021 ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGDDRLVS 1080
Query: 1081 PPEVEGEFESDHCNFELRLADRKRLLYLSKIALMAAAAGQNAEYESKLMRIEADAKILKL 1140
PPEVEGEFESDHCNFELRLADRKRLLYLSKIALMAAAAGQNAEYESKLMRIEADAKILKL
Sbjct: 1081 PPEVEGEFESDHCNFELRLADRKRLLYLSKIALMAAAAGQNAEYESKLMRIEADAKILKL 1140
Query: 1141 QEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFRETHRK 1200
QEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFRETHRK
Sbjct: 1141 QEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFRETHRK 1200
Query: 1201 LLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATEVFGDG 1260
LLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATEVFGDG
Sbjct: 1201 LLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATEVFGDG 1260
Query: 1261 FDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLDLEDDPILM 1320
FDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLDLEDDPILM
Sbjct: 1261 FDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLDLEDDPILM 1320
BLAST of MC04g0262 vs. ExPASy TrEMBL
Match:
A0A6J1C2K7 (nuclear pore complex protein NUP133 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111007690 PE=3 SV=1)
HSP 1 Score: 2622 bits (6796), Expect = 0.0
Identity = 1320/1321 (99.92%), Postives = 1320/1321 (99.92%), Query Frame = 0
Query: 1 MFSPGTKRRNLSSRTDRSSATAVSDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPRLS 60
MFSPGTKRRNLSSRTDRSSATAVSDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPRLS
Sbjct: 1 MFSPGTKRRNLSSRTDRSSATAVSDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPRLS 60
Query: 61 VLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAETSL 120
VLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAETSL
Sbjct: 61 VLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAETSL 120
Query: 121 AWIICRDKLFLWTYLLPVATMKCVVRELPKRILESKDIGRNDSDHWLLSIVSWDNQNQSS 180
AWIICRDKLFLWTYLLPVATMKCVVRELPKRILESKDIGRNDSDHWLLSIVSWDNQNQSS
Sbjct: 121 AWIICRDKLFLWTYLLPVATMKCVVRELPKRILESKDIGRNDSDHWLLSIVSWDNQNQSS 180
Query: 181 RKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISSIFDGKSTSH 240
RKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISSIFDGKSTSH
Sbjct: 181 RKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISSIFDGKSTSH 240
Query: 241 RHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDICSIR 300
RHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDICSIR
Sbjct: 241 RHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDICSIR 300
Query: 301 SQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQVSKIW 360
SQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQVSKIW
Sbjct: 301 SQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQVSKIW 360
Query: 361 SYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQYSLL 420
SYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQYSLL
Sbjct: 361 SYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQYSLL 420
Query: 421 TLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALILSG 480
TLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALILSG
Sbjct: 421 TLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALILSG 480
Query: 481 DGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIPVKA 540
DGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIPVKA
Sbjct: 481 DGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIPVKA 540
Query: 541 IVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKTTVAGI 600
IVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKTTVAGI
Sbjct: 541 IVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKTTVAGI 600
Query: 601 GHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSIIDTL 660
GHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSIIDTL
Sbjct: 601 GHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSIIDTL 660
Query: 661 AKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIILEHG 720
AKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIILEHG
Sbjct: 661 AKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIILEHG 720
Query: 721 EKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTVLLMDR 780
EKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTVLLMDR
Sbjct: 721 EKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTVLLMDR 780
Query: 781 DNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAAVHYRN 840
DNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAAVHYRN
Sbjct: 781 DNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAAVHYRN 840
Query: 841 EHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLELLTEV 900
EHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLELLTEV
Sbjct: 841 EHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLELLTEV 900
Query: 901 LLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGGLVEKN 960
LLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGGLVEKN
Sbjct: 901 LLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGGLVEKN 960
Query: 961 VEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFVFKRLY 1020
VEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFVFKRLY
Sbjct: 961 VEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFVFKRLY 1020
Query: 1021 ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGDDRLVS 1080
ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGDDRLVS
Sbjct: 1021 ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGDDRLVS 1080
Query: 1081 PPEVEGEFESDHCNFELRLADRKRLLYLSKIALMAAAAGQNAEYESKLMRIEADAKILKL 1140
PPEVEGEFESDHCNFELRLADRKRLLYLSKIALMAAA GQNAEYESKLMRIEADAKILKL
Sbjct: 1081 PPEVEGEFESDHCNFELRLADRKRLLYLSKIALMAAA-GQNAEYESKLMRIEADAKILKL 1140
Query: 1141 QEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFRETHRK 1200
QEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFRETHRK
Sbjct: 1141 QEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFRETHRK 1200
Query: 1201 LLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATEVFGDG 1260
LLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATEVFGDG
Sbjct: 1201 LLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATEVFGDG 1260
Query: 1261 FDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLDLEDDPILM 1320
FDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLDLEDDPILM
Sbjct: 1261 FDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLDLEDDPILM 1320
BLAST of MC04g0262 vs. ExPASy TrEMBL
Match:
A0A6J1C261 (nuclear pore complex protein NUP133 isoform X3 OS=Momordica charantia OX=3673 GN=LOC111007690 PE=3 SV=1)
HSP 1 Score: 2618 bits (6786), Expect = 0.0
Identity = 1318/1321 (99.77%), Postives = 1318/1321 (99.77%), Query Frame = 0
Query: 1 MFSPGTKRRNLSSRTDRSSATAVSDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPRLS 60
MFSPGTKRRNLSSRTDRSSATAVSDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPRLS
Sbjct: 1 MFSPGTKRRNLSSRTDRSSATAVSDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPRLS 60
Query: 61 VLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAETSL 120
VLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAETSL
Sbjct: 61 VLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAETSL 120
Query: 121 AWIICRDKLFLWTYLLPVATMKCVVRELPKRILESKDIGRNDSDHWLLSIVSWDNQNQSS 180
AWIICRDKLFLWTYLLPVATMKCVVRELPKRILESKDIGRNDSDHWLLSIVSWDNQNQSS
Sbjct: 121 AWIICRDKLFLWTYLLPVATMKCVVRELPKRILESKDIGRNDSDHWLLSIVSWDNQNQSS 180
Query: 181 RKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISSIFDGKSTSH 240
RKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISSIFDGKSTSH
Sbjct: 181 RKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISSIFDGKSTSH 240
Query: 241 RHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDICSIR 300
RHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDICSIR
Sbjct: 241 RHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDICSIR 300
Query: 301 SQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQVSKIW 360
SQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQVSKIW
Sbjct: 301 SQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQVSKIW 360
Query: 361 SYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQYSLL 420
SYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQYSLL
Sbjct: 361 SYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQYSLL 420
Query: 421 TLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALILSG 480
TLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALILSG
Sbjct: 421 TLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALILSG 480
Query: 481 DGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIPVKA 540
DGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIPVKA
Sbjct: 481 DGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIPVKA 540
Query: 541 IVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKTTVAGI 600
IVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKTTVAGI
Sbjct: 541 IVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKTTVAGI 600
Query: 601 GHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSIIDTL 660
GHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSIIDTL
Sbjct: 601 GHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSIIDTL 660
Query: 661 AKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIILEHG 720
AKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIILEHG
Sbjct: 661 AKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIILEHG 720
Query: 721 EKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTVLLMDR 780
EKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTVLLMDR
Sbjct: 721 EKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTVLLMDR 780
Query: 781 DNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAAVHYRN 840
DNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAAVHYRN
Sbjct: 781 DNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAAVHYRN 840
Query: 841 EHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLELLTEV 900
EHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLELLTEV
Sbjct: 841 EHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLELLTEV 900
Query: 901 LLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGGLVEKN 960
LLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGGLVEKN
Sbjct: 901 LLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGGLVEKN 960
Query: 961 VEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFVFKRLY 1020
VEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFVFKRLY
Sbjct: 961 VEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFVFKRLY 1020
Query: 1021 ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGDDRLVS 1080
ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGDDRLVS
Sbjct: 1021 ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGDDRLVS 1080
Query: 1081 PPEVEGEFESDHCNFELRLADRKRLLYLSKIALMAAAAGQNAEYESKLMRIEADAKILKL 1140
PPEVEGEFESDHCNFELRLADRKRLLYLSKIALMA GQNAEYESKLMRIEADAKILKL
Sbjct: 1081 PPEVEGEFESDHCNFELRLADRKRLLYLSKIALMA---GQNAEYESKLMRIEADAKILKL 1140
Query: 1141 QEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFRETHRK 1200
QEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFRETHRK
Sbjct: 1141 QEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFRETHRK 1200
Query: 1201 LLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATEVFGDG 1260
LLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATEVFGDG
Sbjct: 1201 LLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATEVFGDG 1260
Query: 1261 FDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLDLEDDPILM 1320
FDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLDLEDDPILM
Sbjct: 1261 FDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLDLEDDPILM 1318
BLAST of MC04g0262 vs. ExPASy TrEMBL
Match:
A0A6J1GNL3 (nuclear pore complex protein NUP133 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111455584 PE=3 SV=1)
HSP 1 Score: 2328 bits (6034), Expect = 0.0
Identity = 1172/1330 (88.12%), Postives = 1244/1330 (93.53%), Query Frame = 0
Query: 1 MFSPGTKRRNLSSRTDRSSATAVSDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPRLS 60
MFSPGTKRRN SSRT RS A A+SDSPITP+SA RKPVLDNLVPNRPGTGTPAPWAPRLS
Sbjct: 1 MFSPGTKRRNSSSRTGRSLAPALSDSPITPISAVRKPVLDNLVPNRPGTGTPAPWAPRLS 60
Query: 61 VLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAETSL 120
VLARISP NRS KEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGAS++GGMDAETSL
Sbjct: 61 VLARISPVNRSDKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASVAGGMDAETSL 120
Query: 121 AWIICRDKLFLWTYLLPVATMKCVVRELPKRILESKDIGRNDSDHWLLSIVSWDNQNQSS 180
AWIICRDKLFLWTYLLPVATMKCVVRELPKRIL+SKDIGRN++DHWLLS+VSWD+QNQSS
Sbjct: 121 AWIICRDKLFLWTYLLPVATMKCVVRELPKRILDSKDIGRNNNDHWLLSVVSWDSQNQSS 180
Query: 181 RKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISS---IFDGKS 240
RKS KHQ+SV IIICNKKTGA+VYWPDIFSDGG+TP+TCLTSS+EPA ISS IFDGKS
Sbjct: 181 RKSVKHQNSVAIIICNKKTGAIVYWPDIFSDGGSTPITCLTSSHEPAAISSKTSIFDGKS 240
Query: 241 TSHRHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDIC 300
TS ++SLN+P TFNSLIA+AVP+SQYVCVA+ACSSNGQLWQYRCSPMGIQCTEVPQDIC
Sbjct: 241 TSLGNQSLNRPCTFNSLIAAAVPDSQYVCVALACSSNGQLWQYRCSPMGIQCTEVPQDIC 300
Query: 301 SIRSQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQVS 360
+R Q+DGS Q+ SDGYPRSLTWSRSHLQ DKFNRKFLLLTDHEIQCF LKLFPD+QVS
Sbjct: 301 GLRCQDDGSCQNLVSDGYPRSLTWSRSHLQLDKFNRKFLLLTDHEIQCFCLKLFPDLQVS 360
Query: 361 KIWSYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQY 420
K+WSYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDE+G VITILVATFCKDRISSSSYIQY
Sbjct: 361 KLWSYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDEQGTVITILVATFCKDRISSSSYIQY 420
Query: 421 SLLTLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALI 480
+LLTLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENE+FLFSMRLRVGGKPSGSALI
Sbjct: 421 TLLTLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENENFLFSMRLRVGGKPSGSALI 480
Query: 481 LSGDGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIP 540
LSGDGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIP
Sbjct: 481 LSGDGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIP 540
Query: 541 VKAIVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKTTV 600
VKAIVLGGVEPPERSLSRRGSSNERSVQDD R+LN SGNIASTRGS EVQDVVD+KK T+
Sbjct: 541 VKAIVLGGVEPPERSLSRRGSSNERSVQDDKRSLNFSGNIASTRGSLEVQDVVDRKKATM 600
Query: 601 AGIGHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSII 660
+G+ HR ARDEE+EALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSI+
Sbjct: 601 SGMAHRTARDEESEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSIV 660
Query: 661 DTLAKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIIL 720
DTLAKHWTTTRGAEIVSMTVVSTQL+DKQQKHEKFLQFLALSKCHEELCSRQRNSLQIIL
Sbjct: 661 DTLAKHWTTTRGAEIVSMTVVSTQLIDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIIL 720
Query: 721 EHGEKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTVLL 780
HGEKLAAMIQLRELQNTI QNRS GLGS SSNSET MSG LWDLIQFVGERARRNTVLL
Sbjct: 721 GHGEKLAAMIQLRELQNTIFQNRSNGLGSLSSNSETPMSGALWDLIQFVGERARRNTVLL 780
Query: 781 MDRDNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAAVH 840
MDRDN EVFYSKVSELEEVF CLERQLDYVVSAD+S VQNQRACE+S+ACVTIMRAAV
Sbjct: 781 MDRDNTEVFYSKVSELEEVFNCLERQLDYVVSADESYAVQNQRACEISKACVTIMRAAVQ 840
Query: 841 YRNEHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLELL 900
YRNEHQLWYPPSEGLTPWYSQ VVRNGLWHIASLMLQLLNEVSELDTSAKSDLYC LELL
Sbjct: 841 YRNEHQLWYPPSEGLTPWYSQLVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCYLELL 900
Query: 901 TEVLLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGGLV 960
TEVLLE+HAGAVTAKAERGEKTE LLHEFWSRRD+LLSSLY+R+KDSVEAE KDFRG LV
Sbjct: 901 TEVLLESHAGAVTAKAERGEKTEGLLHEFWSRRDSLLSSLYQRIKDSVEAEHKDFRGDLV 960
Query: 961 EKNVEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFVFK 1020
E+ VE LRK+SSRLL+VAKQHECY+ILW+ICCDLND ELLR LMHESMGPKGGFSYFVFK
Sbjct: 961 EQRVESLRKHSSRLLAVAKQHECYNILWSICCDLNDSELLRNLMHESMGPKGGFSYFVFK 1020
Query: 1021 RLYENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGDDR 1080
+L+ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHE ALS D
Sbjct: 1021 KLHENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHELALSEGDG 1080
Query: 1081 LVSPPEVEGEFESDHCNFELRLADRKRLLYLSKIALMAAAAGQNAEYESKLMRIEADAKI 1140
VSPPEVE E ESD CN EL+LADRKRLLYLSKIALMAAAAG+N EY+SKLMRIEADAKI
Sbjct: 1081 PVSPPEVETEVESDRCNLELKLADRKRLLYLSKIALMAAAAGRNTEYDSKLMRIEADAKI 1140
Query: 1141 LKLQEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFRET 1200
LKLQE ILD A+ETEQ+LDC+LLHPD LIQLCLK +NP LSL+AFDIFAWTSTSFRET
Sbjct: 1141 LKLQEAILDPCLAIETEQKLDCELLHPDRLIQLCLKSKNPTLSLMAFDIFAWTSTSFRET 1200
Query: 1201 HRKLLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATEVF 1260
HRKLLEECWKN ADQDDWN+LY+ SVAEGWSDEET++ LRET LFKASSRCYGHGA E+F
Sbjct: 1201 HRKLLEECWKNVADQDDWNQLYEASVAEGWSDEETMRNLRETGLFKASSRCYGHGAAELF 1260
Query: 1261 GDG-FDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLD---- 1320
G+G FD LPLRQEN + S +K+C GSVEAILMQHKHFPEAGKLMVTAIMLG++
Sbjct: 1261 GEGGFDAVLPLRQEN-LEGSIMVKDCVGSVEAILMQHKHFPEAGKLMVTAIMLGVEDYDN 1320
BLAST of MC04g0262 vs. ExPASy TrEMBL
Match:
A0A6J1GNM9 (nuclear pore complex protein NUP133 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111455584 PE=3 SV=1)
HSP 1 Score: 2322 bits (6018), Expect = 0.0
Identity = 1171/1330 (88.05%), Postives = 1243/1330 (93.46%), Query Frame = 0
Query: 1 MFSPGTKRRNLSSRTDRSSATAVSDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPRLS 60
MFSPGTKRRN SSRT RS A A+SDSPITP+SA RKPVLDNLVPNRPGTGTPAPWAPRLS
Sbjct: 1 MFSPGTKRRNSSSRTGRSLAPALSDSPITPISAVRKPVLDNLVPNRPGTGTPAPWAPRLS 60
Query: 61 VLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAETSL 120
VLARISP NRS KEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGAS++GGMDAETSL
Sbjct: 61 VLARISPVNRSDKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASVAGGMDAETSL 120
Query: 121 AWIICRDKLFLWTYLLPVATMKCVVRELPKRILESKDIGRNDSDHWLLSIVSWDNQNQSS 180
AWIICRDKLFLWTYLLPVATMKCVVRELPKRIL+SKDIGRN++DHWLLS+VSWD+QNQSS
Sbjct: 121 AWIICRDKLFLWTYLLPVATMKCVVRELPKRILDSKDIGRNNNDHWLLSVVSWDSQNQSS 180
Query: 181 RKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISS---IFDGKS 240
RKS KHQ+SV IIICNKKTGA+VYWPDIFSDGG+TP+TCLTSS+EPA ISS IFDGKS
Sbjct: 181 RKSVKHQNSVAIIICNKKTGAIVYWPDIFSDGGSTPITCLTSSHEPAAISSKTSIFDGKS 240
Query: 241 TSHRHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDIC 300
TS ++SLN+P TFNSLIA+AVP+SQYVCVA+ACSSNGQLWQYRCSPMGIQCTEVPQDIC
Sbjct: 241 TSLGNQSLNRPCTFNSLIAAAVPDSQYVCVALACSSNGQLWQYRCSPMGIQCTEVPQDIC 300
Query: 301 SIRSQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQVS 360
+R Q+DGS Q+ SDGYPRSLTWSRSHLQ DKFNRKFLLLTDHEIQCF LKLFPD+QVS
Sbjct: 301 GLRCQDDGSCQNLVSDGYPRSLTWSRSHLQLDKFNRKFLLLTDHEIQCFCLKLFPDLQVS 360
Query: 361 KIWSYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQY 420
K+WSYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDE+G VITILVATFCKDRISSSSYIQY
Sbjct: 361 KLWSYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDEQGTVITILVATFCKDRISSSSYIQY 420
Query: 421 SLLTLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSALI 480
+LLTLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENE+FLFSMRLRVGGKPSGSALI
Sbjct: 421 TLLTLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENENFLFSMRLRVGGKPSGSALI 480
Query: 481 LSGDGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIP 540
LSGDGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIP
Sbjct: 481 LSGDGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPSTEHGEGAWVVLTEKAGIWAIP 540
Query: 541 VKAIVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKTTV 600
VKAIVLGGVEPPERSLSRRGSSNERSVQDD R+LN SGNIASTRGS EVQDVVD+KK T+
Sbjct: 541 VKAIVLGGVEPPERSLSRRGSSNERSVQDDKRSLNFSGNIASTRGSLEVQDVVDRKKATM 600
Query: 601 AGIGHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSII 660
+G+ HR ARDEE+EALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSI+
Sbjct: 601 SGMAHRTARDEESEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKSIV 660
Query: 661 DTLAKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIIL 720
DTLAKHWTTTRGAEIVSMTVVSTQL+DKQQKHEKFLQFLALSKCHEELCSRQRNSLQIIL
Sbjct: 661 DTLAKHWTTTRGAEIVSMTVVSTQLIDKQQKHEKFLQFLALSKCHEELCSRQRNSLQIIL 720
Query: 721 EHGEKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTVLL 780
HGEKLAAMIQLRELQNTI QNRS GLGS SSNSET MSG LWDLIQFVGERARRNTVLL
Sbjct: 721 GHGEKLAAMIQLRELQNTIFQNRSNGLGSLSSNSETPMSGALWDLIQFVGERARRNTVLL 780
Query: 781 MDRDNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAAVH 840
MDRDN EVFYSKVSELEEVF CLERQLDYVVSAD+S VQNQRACE+S+ACVTIMRAAV
Sbjct: 781 MDRDNTEVFYSKVSELEEVFNCLERQLDYVVSADESYAVQNQRACEISKACVTIMRAAVQ 840
Query: 841 YRNEHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLELL 900
YRNEHQLWYPPSEGLTPWYSQ VVRNGLWHIASLMLQLLNEVSELDTSAKSDLYC LELL
Sbjct: 841 YRNEHQLWYPPSEGLTPWYSQLVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCYLELL 900
Query: 901 TEVLLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGGLV 960
TEVLLE+HAGAVTAKAERGEKTE LLHEFWSRRD+LLSSLY+R+KDSVEAE KDFRG LV
Sbjct: 901 TEVLLESHAGAVTAKAERGEKTEGLLHEFWSRRDSLLSSLYQRIKDSVEAEHKDFRGDLV 960
Query: 961 EKNVEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFVFK 1020
E+ VE LRK+SSRLL+VAKQHECY+ILW+ICCDLND ELLR LMHESMGPKGGFSYFVFK
Sbjct: 961 EQRVESLRKHSSRLLAVAKQHECYNILWSICCDLNDSELLRNLMHESMGPKGGFSYFVFK 1020
Query: 1021 RLYENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGDDR 1080
+L+ENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHE ALS D
Sbjct: 1021 KLHENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHELALSEGDG 1080
Query: 1081 LVSPPEVEGEFESDHCNFELRLADRKRLLYLSKIALMAAAAGQNAEYESKLMRIEADAKI 1140
VSPPEVE E ESD CN EL+LADRKRLLYLSKIALMAAA G+N EY+SKLMRIEADAKI
Sbjct: 1081 PVSPPEVETEVESDRCNLELKLADRKRLLYLSKIALMAAA-GRNTEYDSKLMRIEADAKI 1140
Query: 1141 LKLQEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFRET 1200
LKLQE ILD A+ETEQ+LDC+LLHPD LIQLCLK +NP LSL+AFDIFAWTSTSFRET
Sbjct: 1141 LKLQEAILDPCLAIETEQKLDCELLHPDRLIQLCLKSKNPTLSLMAFDIFAWTSTSFRET 1200
Query: 1201 HRKLLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATEVF 1260
HRKLLEECWKN ADQDDWN+LY+ SVAEGWSDEET++ LRET LFKASSRCYGHGA E+F
Sbjct: 1201 HRKLLEECWKNVADQDDWNQLYEASVAEGWSDEETMRNLRETGLFKASSRCYGHGAAELF 1260
Query: 1261 GDG-FDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLD---- 1320
G+G FD LPLRQEN + S +K+C GSVEAILMQHKHFPEAGKLMVTAIMLG++
Sbjct: 1261 GEGGFDAVLPLRQEN-LEGSIMVKDCVGSVEAILMQHKHFPEAGKLMVTAIMLGVEDYDN 1320
BLAST of MC04g0262 vs. TAIR 10
Match:
AT2G05120.1 (Nucleoporin, Nup133/Nup155-like )
HSP 1 Score: 1407.9 bits (3643), Expect = 0.0e+00
Identity = 753/1326 (56.79%), Postives = 942/1326 (71.04%), Query Frame = 0
Query: 1 MFSPGTKRRNLSSRTDRSSATAV--SDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPR 60
MFSP TKR SSR +++ V DSP+TP + R +N + +RP TGTPAPWAPR
Sbjct: 1 MFSPLTKRAKQSSRNEKTPRNRVPPPDSPVTPATQNR----NNFISDRPATGTPAPWAPR 60
Query: 61 LSVLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAET 120
LSVLAR+SP N K ++D ++PV+VGEFPQ++RDEQ+ A +SGGMD ET
Sbjct: 61 LSVLARVSPGNNGDKGVDSDQLKPVFVGEFPQLLRDEQS------YPGDACVSGGMDKET 120
Query: 121 SLAWIICRDKLFLWTYLLPVATMKCVVRELPKRIL--ESKDIGRNDSDHWLLSIVSWDNQ 180
L+W I K+F+W++L + + KCVV ELP +L E G D WL+++VSWD
Sbjct: 121 CLSWFITGSKVFVWSHLTTLPSRKCVVLELPVVVLVNEESGSGLQDGKSWLVNVVSWDTS 180
Query: 181 NQSSRKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISSIFDGK 240
++ ++++ +S VG+++CN+KT AVVYW DIFS P + +I +G
Sbjct: 181 AGAATRASRSRSPVGVVMCNRKTRAVVYWSDIFSGQEAAP-----AEKARHLIKRQSNGI 240
Query: 241 STSHRHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDI 300
+S S NSLI +AV ++ +C+AIACSSNG+LWQ+ CSP G++ +V +I
Sbjct: 241 RSSRAENS-----DLNSLITTAVAAAERLCIAIACSSNGELWQFTCSPTGVKSNQVQLNI 300
Query: 301 CSIRSQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQV 360
S S+GYPRSL W S + +FL+LTD +I CF+++ +PD+ V
Sbjct: 301 SS----------SSVSEGYPRSLIWRFSQGLARESCWEFLMLTDCDIHCFTIEPYPDLTV 360
Query: 361 SKIWSYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQ 420
S++W +EIVGTD D GIKKD+A QK+IWPLDLQ D++G VIT+LVAT C DR SSSSY Q
Sbjct: 361 SEVWQHEIVGTDGDSGIKKDIASQKQIWPLDLQVDDQGKVITVLVATICMDRASSSSYTQ 420
Query: 421 YSLLTLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSAL 480
YSLLTLQ+KS ++++LEK+ PIQVIIPKARVE++DFLFSMRLRVGG+P GSA+
Sbjct: 421 YSLLTLQHKSEMRFADGREEKVLEKQGPIQVIIPKARVEDKDFLFSMRLRVGGRPPGSAI 480
Query: 481 ILSGDGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPST-EHGEGAWVVLTEKAGIWA 540
ILSGDGTATV + + SST LY+FDLPYDAGKVLDASVL ST EH GAW VLTEKAG+WA
Sbjct: 481 ILSGDGTATVCYCHGSSTRLYKFDLPYDAGKVLDASVLSSTDEHEYGAWTVLTEKAGVWA 540
Query: 541 IPVKAIVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKT 600
IP KA+VLGGVEPPERSLSR+ SSNERS +D+TR + + R + ++Q++ DK
Sbjct: 541 IPEKAVVLGGVEPPERSLSRKNSSNERSTRDETRVTPYGVDRTAGRENSDIQNIEDKGNP 600
Query: 601 TVAGIGHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKS 660
+ G + ARDEE+EALL QLF FL SG+V+ S EKL SGAFDR+ E NVF R SKS
Sbjct: 601 KM-GFTRQTARDEESEALLGQLFEGFLLSGKVDGSLEKLSQSGAFDRDGEANVFARKSKS 660
Query: 661 IIDTLAKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQI 720
I+DTLAKHWTTTRGAEIV+MTV+S+QL++KQQKHE FL FLALSKCHEELCS+QR+SLQI
Sbjct: 661 IVDTLAKHWTTTRGAEIVAMTVISSQLVEKQQKHENFLHFLALSKCHEELCSKQRHSLQI 720
Query: 721 ILEHGEKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTV 780
ILE+GEKLAAMIQLRELQN I QNRS GS + SE Q+S LWDLIQFVGERARRNTV
Sbjct: 721 ILENGEKLAAMIQLRELQNMINQNRSARFGSPQAGSEDQVSCALWDLIQFVGERARRNTV 780
Query: 781 LLMDRDNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAA 840
LLMDRDN EVFYSKVSELEEVFYCL RQL+Y++ AD Q QRACELS ACVTI++ A
Sbjct: 781 LLMDRDNAEVFYSKVSELEEVFYCLNRQLEYIIRADQPLGTQLQRACELSNACVTILQTA 840
Query: 841 VHYRNEHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLE 900
+ Y+NEHQ+WYPP EGL PW+SQ VV NGLW IAS ML LL E S +D SAKSD+Y LE
Sbjct: 841 LDYKNEHQMWYPPLEGLIPWHSQTVVCNGLWCIASFMLHLLTEASRIDISAKSDIYTHLE 900
Query: 901 LLTEVLLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGG 960
+LTEVLLEA AG+ AK ER E+ + LL+E+W+RRD + SLY++ K+ +EAE++ R
Sbjct: 901 VLTEVLLEACAGSTFAKLEREEENKGLLNEYWTRRDTIFDSLYRQAKEFMEAEIQGIRER 960
Query: 961 LVEKNVEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFV 1020
+ +I R S L+S+AK+H Y I+W IC DLND LLR LMHE +GP+GGFSYFV
Sbjct: 961 TEATDEDIFRNRCSNLISIAKRHAGYKIMWKICYDLNDTGLLRNLMHEGVGPQGGFSYFV 1020
Query: 1021 FKRLYENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGD 1080
F++LY+ KQFSKLLRLGEEF +ELLIFLK H DL+WLH++FLHQF SASDTLH ALS D
Sbjct: 1021 FQQLYDMKQFSKLLRLGEEFQDELLIFLKRHSDLVWLHQVFLHQFSSASDTLHTLALSQD 1080
Query: 1081 DRLVSPPEVEGEFESDHCNFELRLADRKRLLYLSKIALMAAAAGQNAEYESKLMRIEADA 1140
+ S VE + + + ADRKR L LSKIA + A ++A+ ESK+ RIEAD
Sbjct: 1081 EE--SMTTVEERTGPEPEDVQPTFADRKRFLNLSKIAYV---ADKDADSESKVKRIEADL 1140
Query: 1141 KILKLQEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFR 1200
+LKLQEEI E +L P+ LI+ CL + ++ AF++FAWTS+SFR
Sbjct: 1141 NLLKLQEEITKALPNGEARN----RLFRPEELIETCLNIQGRWTAIKAFEVFAWTSSSFR 1200
Query: 1201 ETHRKLLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATE 1260
E HR LLEECW+NAADQDDW+R +Q S EGWS+EETL+ LR T LF+AS RCYG
Sbjct: 1201 ENHRSLLEECWRNAADQDDWDRHHQASTNEGWSEEETLQNLRNTALFQASKRCYGPTRVN 1260
Query: 1261 VFGDGFDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLDLED 1320
F F LPLR+EN ++ SVE +LM HK F EAGKLM+TAIMLG +E+
Sbjct: 1261 TFDGDFAQVLPLRRENP-------EDSTSSVEDVLMSHKDFAEAGKLMLTAIMLGC-VEE 1278
Query: 1321 DPILME 1322
+ I+ E
Sbjct: 1321 EGIVAE 1278
BLAST of MC04g0262 vs. TAIR 10
Match:
AT2G05120.2 (Nucleoporin, Nup133/Nup155-like )
HSP 1 Score: 1318.5 bits (3411), Expect = 0.0e+00
Identity = 722/1326 (54.45%), Postives = 905/1326 (68.25%), Query Frame = 0
Query: 1 MFSPGTKRRNLSSRTDRSSATAV--SDSPITPLSAARKPVLDNLVPNRPGTGTPAPWAPR 60
MFSP TKR SSR +++ V DSP+TP + R +N + +RP TGTPAPWAPR
Sbjct: 1 MFSPLTKRAKQSSRNEKTPRNRVPPPDSPVTPATQNR----NNFISDRPATGTPAPWAPR 60
Query: 61 LSVLARISPANRSSKEDETDPVRPVYVGEFPQVVRDEQASLVQQFATSGASMSGGMDAET 120
LSVLAR+SP N K ++D ++PV+VGEFPQ++RDEQ+ A +SGGMD ET
Sbjct: 61 LSVLARVSPGNNGDKGVDSDQLKPVFVGEFPQLLRDEQS------YPGDACVSGGMDKET 120
Query: 121 SLAWIICRDKLFLWTYLLPVATMKCVVRELPKRIL--ESKDIGRNDSDHWLLSIVSWDNQ 180
L+W I K+F+W++L + + KCVV ELP +L E G D WL+++VSWD
Sbjct: 121 CLSWFITGSKVFVWSHLTTLPSRKCVVLELPVVVLVNEESGSGLQDGKSWLVNVVSWDTS 180
Query: 181 NQSSRKSAKHQSSVGIIICNKKTGAVVYWPDIFSDGGTTPVTCLTSSNEPAVISSIFDGK 240
++ ++++ +S VG+++CN+KT AV + +I +G
Sbjct: 181 AGAATRASRSRSPVGVVMCNRKTRAV-------------------AEKARHLIKRQSNGI 240
Query: 241 STSHRHRSLNKPRTFNSLIASAVPESQYVCVAIACSSNGQLWQYRCSPMGIQCTEVPQDI 300
+S S NSLI +AV ++ +C+AIACSSNG+LWQ+ CSP G++ +V +I
Sbjct: 241 RSSRAENS-----DLNSLITTAVAAAERLCIAIACSSNGELWQFTCSPTGVKSNQVQLNI 300
Query: 301 CSIRSQEDGSNQHFASDGYPRSLTWSRSHLQPDKFNRKFLLLTDHEIQCFSLKLFPDVQV 360
S S+GYPRSL W S + +FL+LTD +I CF+++ +PD+ V
Sbjct: 301 SS----------SSVSEGYPRSLIWRFSQGLARESCWEFLMLTDCDIHCFTIEPYPDLTV 360
Query: 361 SKIWSYEIVGTDSDLGIKKDLAGQKRIWPLDLQEDERGAVITILVATFCKDRISSSSYIQ 420
S++W +EIVGTD D GIKKD+A QK+IWPLDLQ D++G VIT+LVAT C DR SSSSY Q
Sbjct: 361 SEVWQHEIVGTDGDSGIKKDIASQKQIWPLDLQVDDQGKVITVLVATICMDRASSSSYTQ 420
Query: 421 YSLLTLQYKSGAGIEASGDKRILEKKAPIQVIIPKARVENEDFLFSMRLRVGGKPSGSAL 480
YSLLTLQ+KS ++++LEK+ PIQVIIPKARVE++DFLFSMRLRVGG+P GSA+
Sbjct: 421 YSLLTLQHKSEMRFADGREEKVLEKQGPIQVIIPKARVEDKDFLFSMRLRVGGRPPGSAI 480
Query: 481 ILSGDGTATVSHYYRSSTLLYQFDLPYDAGKVLDASVLPST-EHGEGAWVVLTEKAGIWA 540
ILSGDGTATV + + SST LY+FDLPYDAGKVLDASVL ST EH GAW VLTEKAG+WA
Sbjct: 481 ILSGDGTATVCYCHGSSTRLYKFDLPYDAGKVLDASVLSSTDEHEYGAWTVLTEKAGVWA 540
Query: 541 IPVKAIVLGGVEPPERSLSRRGSSNERSVQDDTRNLNLSGNIASTRGSFEVQDVVDKKKT 600
IP KA+VLGGVEPPERSLSR+ SSNERS +D+TR + + R + ++Q++ DK
Sbjct: 541 IPEKAVVLGGVEPPERSLSRKNSSNERSTRDETRVTPYGVDRTAGRENSDIQNIEDKGNP 600
Query: 601 TVAGIGHRVARDEEAEALLRQLFHDFLSSGQVNNSFEKLKNSGAFDREDETNVFTRMSKS 660
+ G + ARDEE+EALL QLF FL SG+V+ S EKL SGAFDR+ E NVF R SKS
Sbjct: 601 KM-GFTRQTARDEESEALLGQLFEGFLLSGKVDGSLEKLSQSGAFDRDGEANVFARKSKS 660
Query: 661 IIDTLAKHWTTTRGAEIVSMTVVSTQLMDKQQKHEKFLQFLALSKCHEELCSRQRNSLQI 720
I+DTLAKHWTTTRGAEIV+MTV+S+QL++KQQKHE FL FLALSKCHEELCS+QR+SLQI
Sbjct: 661 IVDTLAKHWTTTRGAEIVAMTVISSQLVEKQQKHENFLHFLALSKCHEELCSKQRHSLQI 720
Query: 721 ILEHGEKLAAMIQLRELQNTICQNRSTGLGSSSSNSETQMSGGLWDLIQFVGERARRNTV 780
ILE+GEKLAAMIQLRELQN I QNRS GS + SE Q+S LWDLIQFVGERARRNTV
Sbjct: 721 ILENGEKLAAMIQLRELQNMINQNRSARFGSPQAGSEDQVSCALWDLIQFVGERARRNTV 780
Query: 781 LLMDRDNPEVFYSKVSELEEVFYCLERQLDYVVSADDSCVVQNQRACELSEACVTIMRAA 840
LLMDRDN EVFYSKVSELEEVFYCL RQL+Y++ AD Q QRACELS ACVTI++ A
Sbjct: 781 LLMDRDNAEVFYSKVSELEEVFYCLNRQLEYIIRADQPLGTQLQRACELSNACVTILQTA 840
Query: 841 VHYRNEHQLWYPPSEGLTPWYSQPVVRNGLWHIASLMLQLLNEVSELDTSAKSDLYCCLE 900
+ Y+NEHQ+WYPP EGL PW+SQ VV NGLW IAS ML LL E S +D SAKSD+Y LE
Sbjct: 841 LDYKNEHQMWYPPLEGLIPWHSQTVVCNGLWCIASFMLHLLTEASRIDISAKSDIYTHLE 900
Query: 901 LLTEVLLEAHAGAVTAKAERGEKTESLLHEFWSRRDALLSSLYKRVKDSVEAELKDFRGG 960
+LTEVLLEA AG+ AK ER E+ + LL+E+W+RRD + SLY++ K+ +EAE++
Sbjct: 901 VLTEVLLEACAGSTFAKLEREEENKGLLNEYWTRRDTIFDSLYRQAKEFMEAEIQ----- 960
Query: 961 LVEKNVEILRKNSSRLLSVAKQHECYSILWNICCDLNDPELLRKLMHESMGPKGGFSYFV 1020
HE +GP+GGFSYFV
Sbjct: 961 ----------------------------------------------HEGVGPQGGFSYFV 1020
Query: 1021 FKRLYENKQFSKLLRLGEEFHEELLIFLKEHPDLLWLHELFLHQFFSASDTLHESALSGD 1080
F++LY+ KQFSKLLRLGEEF +ELLIFLK H DL+WLH++FLHQF SASDTLH ALS D
Sbjct: 1021 FQQLYDMKQFSKLLRLGEEFQDELLIFLKRHSDLVWLHQVFLHQFSSASDTLHTLALSQD 1080
Query: 1081 DRLVSPPEVEGEFESDHCNFELRLADRKRLLYLSKIALMAAAAGQNAEYESKLMRIEADA 1140
+ S VE + + + ADRKR L LSKIA + A ++A+ ESK+ RIEAD
Sbjct: 1081 EE--SMTTVEERTGPEPEDVQPTFADRKRFLNLSKIAYV---ADKDADSESKVKRIEADL 1140
Query: 1141 KILKLQEEILDLYHAVETEQQLDCKLLHPDGLIQLCLKGENPALSLIAFDIFAWTSTSFR 1200
+LKLQEEI E +L P+ LI+ CL + ++ AF++FAWTS+SFR
Sbjct: 1141 NLLKLQEEITKALPNGEARN----RLFRPEELIETCLNIQGRWTAIKAFEVFAWTSSSFR 1200
Query: 1201 ETHRKLLEECWKNAADQDDWNRLYQVSVAEGWSDEETLKKLRETTLFKASSRCYGHGATE 1260
E HR LLEECW+NAADQDDW+R +Q S EGWS+EETL+ LR T LF+AS RCYG
Sbjct: 1201 ENHRSLLEECWRNAADQDDWDRHHQASTNEGWSEEETLQNLRNTALFQASKRCYGPTRVN 1213
Query: 1261 VFGDGFDVALPLRQENEIAESSSLKNCAGSVEAILMQHKHFPEAGKLMVTAIMLGLDLED 1320
F F LPLR+EN ++ SVE +LM HK F EAGKLM+TAIMLG +E+
Sbjct: 1261 TFDGDFAQVLPLRRENP-------EDSTSSVEDVLMSHKDFAEAGKLMLTAIMLGC-VEE 1213
Query: 1321 DPILME 1322
+ I+ E
Sbjct: 1321 EGIVAE 1213
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
F4IGA5 | 0.0e+00 | 56.79 | Nuclear pore complex protein NUP133 OS=Arabidopsis thaliana OX=3702 GN=NUP133 PE... | [more] |
Match Name | E-value | Identity | Description | |
XP_022135833.1 | 0.0 | 100.00 | nuclear pore complex protein NUP133 isoform X1 [Momordica charantia] | [more] |
XP_022135834.1 | 0.0 | 99.92 | nuclear pore complex protein NUP133 isoform X2 [Momordica charantia] | [more] |
XP_022135835.1 | 0.0 | 99.77 | nuclear pore complex protein NUP133 isoform X3 [Momordica charantia] | [more] |
XP_038887917.1 | 0.0 | 88.69 | nuclear pore complex protein NUP133 isoform X1 [Benincasa hispida] | [more] |
XP_023554346.1 | 0.0 | 88.12 | nuclear pore complex protein NUP133 [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1C5Z7 | 0.0 | 100.00 | nuclear pore complex protein NUP133 isoform X1 OS=Momordica charantia OX=3673 GN... | [more] |
A0A6J1C2K7 | 0.0 | 99.92 | nuclear pore complex protein NUP133 isoform X2 OS=Momordica charantia OX=3673 GN... | [more] |
A0A6J1C261 | 0.0 | 99.77 | nuclear pore complex protein NUP133 isoform X3 OS=Momordica charantia OX=3673 GN... | [more] |
A0A6J1GNL3 | 0.0 | 88.12 | nuclear pore complex protein NUP133 isoform X1 OS=Cucurbita moschata OX=3662 GN=... | [more] |
A0A6J1GNM9 | 0.0 | 88.05 | nuclear pore complex protein NUP133 isoform X2 OS=Cucurbita moschata OX=3662 GN=... | [more] |