Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDSinitialstart_codonintroninternalterminalstop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGTCCGTTGATTCGCGGCATTCCATTTCTTTAACTCCTATTATATTGGAAGACTCTTACGAAGGGGAGCATGTTGAAACCAACGATTACTACTTCGAAAAGATCGGCGAACCTGTTCCTGTCAAGCTCAATGACTCCATTTTTGATCCTGGAAGTCCTCCTTCCCAGCCTCTTGCTGTGTCTGAGAGTTTTGGTCTCATATTCGTTGCCCATTCGTCTGGTTGGTAATTTCAATTGCTTCCCCTTTGTTGTGAGTACTGTTGTTTTTCTGTGACTTTTTTTTGAAGAATTTTGTTTGTTTAGGGTTTTTTGTGGTGAGGACCAAGGATGTAGTTGCTTCAGCTAAGGAGATGAAAAACGGGGGAACTGGTTCTTCAATCCAGGATTTGAGTATTGTGGATGTTTCCGTCGGGAAAGTTCATGTTCTTGCACTTTCCAATGATAATTCTTTTCTTGCTGCCGTCGTAGCTGGTGATGTTCGTCTTTTTTCAGTTGACTCGCTGCTTGATAAGGTAGTGCTTTTAGCTGAAGCTTGTCTTAATTTCAAAGCGCCGTTCCCCTGAAATGTCATTTTGGTTATTTGCATCAATGGGGAAAATATCACAATCATTGGTTATGTACGTTACTGAATTAAATTGCTGCACGGAGTAATTACATGGTTTTTCTCTCTGGAAGTTACAGTAGTATTTTGATGAAACCTCTAACTCGCCTTAAGATCGAAATCCTCTAATCTCTCAGCAAGTTTTCCATTTATTTAACGCTGTTTCACTCTTATTGCTAATGGTTTTTTGTTTTGTTTAGTTATTTTTCTTGCACGTCATTTTCTTATATTTCATTGCTGCTATGCAGGCTGAAAAACCCTATTTCTCTTGTTCAACAACTGATTCCAGTTGCATCAAAGACTTCAAATGGACCAGAAAGCCGGAAAATTCTTATCTGGTTCTTTCAAAACATGGAAAGTTATACCAAGGATCGGCTAGTGGTCCTTTTAAACATATCATGCACGATATTGATGCTGGTACGCTGTGTACTTTTGTACAGTAACTTATGTATATAATTCTTAATGGCAAATTTAAGTGGCGTTTGATAAGTTTCTTACATTTTATGTTAAATGACTGTATATTTCATTGACGATGGTGAGGACTATTGTTTTCACACTTCTATTAAACTCGGTTTGCTCATCAAAATACATTATTTAATATTGAAATATTGCATGTTGTTGCTTTTGCCCCAATTTTTTTTATCCTCGAGTAATAGGTAGTTTATCTTTTAGCTCGTGTTTCAATGTGTATATCAATGTATTATCTTATATGGTGGTGCAAACATCATATTTTCCGTCTACACCATTTGTCAGAAGCATCACACTCTGCAGTAAAAAGCTTCAAAATGTCAGTTAAAAAAAGTACAGTTGCCATGGTCATGGATATCTTGGGCTATTAAAAGGCTTATTTAGAATTTCTACTTCAAGAAGACGAATCTTTTTACTACGTATATGCAAGGAGAGATGCTGTATTTCCCTGAACTTTCTATTTTTTCCTGTGAAATTATCCAGTTCTGGGCGTCCCTTCATAAAAATTAGACGAGCTCTCTAAAATTTGGATCTTTCTACTCAATTCTTCTAGTTACTTCATCTATTGTATATCCTTGTATATTTCATTTAATCAGTGAGACTTTGTTTGTCCATATATGTTTTGTCCTTGTATAGTGAACTATTTTAGTTATTGTATTCATTTTGTTCTTTTTTCCCCCCCACATATACTATGCAGTGTCTTTGGCAGAGAAGTTCTTATTTGTTTGTATTTTCTTGCCCCTGCTTTTTTGCTAATTTATTATTATTATTATTTTATGAAAGCAATCCATTTACATGAATTATTATGAGCAAAGCTTATATATGTGGAGCTGCTATTTTTGAACATTTTAAAAATCTTATTTTGGATTTTAGTTGAATGCAGTGTGAAAGGAAAATTCATTGCTGTGGCTAAAAAGGACACTCTTTCCATTTTCTCATATAAATTCAAGGAACGACTGTCCATGTCACTCTTGCCAAGTTCAGGGAATGGTGACACTGATACGGACTTTGCAGTGAAAGGTTTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTGTGCGCGTGTGTGTGTGTAAAAATGATGTCTGTAAAGTGTTGAAACGTGAAGGTTTCATAACTTTCATCATTCTTTTTATTTTTCCTTTAGGTGAGTGATGTCTCTTATATTTTACTTTTGTGATCTATCAGAGTGGAACACAAACTTAGTGCTTCTAAGGTTAGCTACAAGTTGAGTTTGAGTTGTTAGCATGGAAAGGAGTACCTTGCGTCCTTCCTCAGTAATCTCAGCATACTTTTTTATTCTTATTTTAACCCTATTTAGTTTGGTTACTATCGTGGGATAGATTTGATTGCCAATGATTAGTGGTAAAGGAAAAAAAAATTCTTGATTGGGATAACTGTCATTGTTATATGGGGGTTATTGACTGTTGGATGATGAAAGTTCCACATCGGCTAATTTAGGGAATGATCATGGGGTTTATGATAAAAAAATACTCTCTCCATTGGTATGAGGCCTTTTGGGGAAGCCTAAAGCAAAGCCATGAAAGCTTATGCTCAAAGTGGACAATATCATACCATTGTGGAGAGTCGTGTTCATCTAACATGGTACCAGAGTCATGCCCTAAACTTAGTCGTGCCAATAGATTGGTAAATCCTCAAATGTCGAACAAAGGACTCCAAAAGAAAAGGAGTCGAGTGTCCTCGAAGGCATAGTAAAAAACGACTAAGACTCCAAAAAAAAAAAAGGAGTTGAGCCTCGATTAAGGGGAAGCGTACTTTGTTCGAAGGGAGGTGTTGGATGATGAAAGTCCCACATTGACTAATTTAGGGAATGATCATGGGGTTTATGATCAAAGAATACTCTTTCCATTGGTATGAGGCCTTTTGGGGAAGTCCAAAGCAAAGCCACGAGAGCTTATGCTCAAAGTGGACAATATCATACTATTGTGGAGAGTCGTGTTCATCTAACATTGACTAGAGAGAAATCATGGAAATTTTGAAGACATTGCTTCTTCTCCCATGCAATTTTAGATTGGGATAATTGTCATTGTCATTAGTTATTTATTTAAATTTTCAAATATTATTTTTCGCGGTGGTGGAGGAACTGGATCGGAAAAAAGAGGGATTCCAGTTGTAAGATAAGCCAAAGTTTCATTGAGATAAGTGAAAAAACACTAACAAGCTTACAAACTAAGACAAAAGGAGCTAAAATAAATCTCCAATTGCAGGGTTTTGGGTACTCTATGGCTAGTCTGGGAATTTGCCTCTTCCTTCCGCTTATGTAATTCTTTTTAATTACTATGTCGTTCCTTCTCCAAAATAAAAATATAACTAAATGAGGTTTCCAATAGAACAAGGAATTATTATACAAGTTAAATTCTAACTAAATTGTAGTAATCCAGAAAATTTAAGAAAAGTAAAAAACAAATATCTGAAAACTAATGATAATATTATGTAATAAACAACAAAAATACTTTTTTTACTCTTAATGACAACCTTATATTTATCATTCTAGAGAATACTTCTGGTACGATGTTTTAACTTGATTTTTCCTGATTTGTCAGTTGACTCTATCAAGTGGGTTCGTGCTGATTGTATCATTATAGGATGCTTTCAAGTGGCTGCAACAGGCGATGAAGAAGATTACTTTGTCCAAGTTATCAGAAGTAAAGATGGAAAAATTACTGACGTGAGTAGCTGTGATTTTCTTTATTTCTTTTTCTTTTTATTAATTTTTTCAATTTTTTTCTCCTTTTTCCATTTCTATTTTCGTGAGGAAGGGTTGTGTTCATGATTGTGCTTATACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTCTGTTTTATGGAGCATTTGGCTTGAAAGGAATGAAATAATTTTTAGAGGCTATAAGAGGTTTTGGAAAGAGGTGTGAGATCTTTTGGAAAGAGGTGTGAGATCTTGCCAAATTTAGGCATTTGTAAATTAAAAATTTAAGGACCTTTGTAATTATCATAGTCTAAGCCTTGTTCTTCTGCACGGAGTCCCTTTTTTTGGTTAGGTCCATTGTTGTTGGAATAATTTCTTTTGGCTGTTGTTTTTTTGCTAGCCCTTTTATATTCCTTCTTTCTTATTGAAACCTTGGTTTCTTGATAAAAAAAATCTCGTACACTAGTTTAATTGAGACATAAGTTGGTTTACCAATTGGTATAGTTTATCTCATTGGGAAATAAAAACATTTATAGGAGAAGTAAATCCTTGAATGTGGAAAGAAAAATAATGCGCTTGATTGTTTCTTTCCAAGGATTCTTGCCAGAACTTGTACCCCATTAGAGAACAACTCAAAAGGATGAGGACCGTGCTTACTCACAATAATCCTACAGCAAAGGGAATTTGATTCGAGAGGGGTCACGAAAGCCATATTCCTAAGAGAGCTCTATAACAAACTCAAGATTCCCAAGTTCTAATCCACTAAGATTGACCGACTTTCCCATTGTCTCCCAATTCACTAGGTGCGCTCCTCCCCTCTCTTCAACCTCTGCCCAAAGGAAATCTCGCATTGACTTCTTATTAACCTTATACACTCTGCTAGGAGAATGAAAAAGGGAAAAGAAGTAAATCGGGATACTACTAATTACTGAGCTAATGAGAGTTGCTGTATCACCTTTACAGAAAAAGGTAGTTGAGCTTTATTAGATTGGCCATGTGCAGTCTTCTTAGTAGCAAATGAGTAGTTTAGATCAAAACAAGGTTGATGGTTGTTTGAGGGAACCTTAGCAGTCGTTTTCAAGGAAATGATTGCTAAGGTTACGAATACATTGATTAAGAAGTGAGGAAGATCATGAATAATTTTTTTTCTTTCTCGTGTCTTGACAAAATCTTATATCTGTGGATGGTGTATTATGTTGTTATTATTATTATTAATATTTTTACCCCAAGCATTTTTTATATATATTGAGAACGTTTGCATACTTTCCCTGTAATTAGTGCCCTAGATTGAAGATCAATCACCTTCCTTTTATAGGTTATAAATCTGTTACTTATTGTTCTGATCTTCTGAATGATAGTAACAGCCGAGAGTTATAATAATTTTCGTATTTGTTTCCACAAACTCATACTAAATTGAATGTAATTTGCTTAACAAGAATTATGAAACAGTATCATTAACTGATCAACATGTTGTAGTGAATGTGTTATTGATTTATTTATTTTATATGTATTTTGGTAGATTAATTTTTGTCACCCTCTTTTGTAGGTTTCCTCAAATAAAGTCTTGTTATCGTTCCATGATATATATTCAGGTTTCACTCCAGACATTTTGCCTGTTGAAACTGGGCCTTGTTTATTATTGAGTTATTTAGATAAATGGTATGTAGAAATCACTTGGTCTCATTAAAGCTCTGAATTGAATTTTACATGGAATTCCTCTTTTTTTATTTTTATTTAGAGAACTTTTCCCGTTACTTTGGGGTTATTCTTTAATTCTGATCCTACATGTCAACTTTATTTATTACCTTTCTTGATTTGGGTGCAGCAAGCTCGCAATTGTTGCCAATAGGAACAATACAGATCAGCATATTGTGTTGCTTGGTTGGTTGCAAGAGGTTGAGAATGAAGTTGCTGTTATTGATATTGAAAGAGATAAGTCACTCCCGAGGATTGAGCTTCAAGGTTAGGGATCTCGTGATTTGACTACCACAGTTCATTAGCTTATCTTGTTTCCATTTCCCCCCTCCCTAGGAATGTGTCCCATTTTAATGCATTGAGTCATCAAATGAGCTATGATTTGCATTTCCTAATATAAATGTAGTCTAAAAGACATTTCTCTGGATGTGTATTTGTGTATCATCTCTTTAGAGTTTAATTGTTTGAAGAATCAAATAAGAAACATATTTTCTTTATATTATATTATGACTTTTCCTGCCTTGTGATTTTTTGATCTATTTATGGCTTCCTTAGAGTTGGTTTTTCATGACTTAAATATTCTCTTGGATTGGAGTCCTTTTCTTGTTTGTTAGGCCAGTCCTGTTTGGCTACTTTTAGGCTGTTTTTATTTTAGTCTTGTTGGCCAATGTTGGTCCTCTTTTATTCTTTCTATTTTCTTCTAAATGAAAGTTTGGCTTCTCAATACAAAATGATTACAATGCGGTCGATCAATTTATTTTGGGACATTATCCAATTAATATACTTGTGCTTTTCTTGAATTTTCTAAGAGTTGGTTCCGTTCTGTTTTCTTATAGCATTTCACTATTTATTTGATTTTCCCTGGTGTTCTCAAGTGCACACACAGAGGAAAGTATCGTTTCCTGAAATTTAGTGTTGGGTTGCAGGGCTTGTTACCATTAGAAATTTTTGATTGCCTTTACAGTCAGTTTTTATTTACACACCTGTGTGAGATTAGGTTTTAATTTTTGTACTTCTGCAGTATCTCTTCTTCTGTGATATAATTCTATCTCTTTCTTGTTTCAGACAACGGTGACGATAATTTGGTAATGGGGCTGTGCATTGATCGAGTTTCTCTCCCTGGAAAGGTGGAAGTCCAAGTTGGAAATGAAGAGATTAGAGAAGTCTCGCCATATTGCACTCTCTTGTGTCTTACTCTAGAGGGAAAACTCATTCTGTTTCATTTTTCTAGGTACTGCCATATCTGTTTTGAGACCTTGCTTGTTAGTGTTACCCAAGACCAAAGTCTATTTACCCCCTTGTTAATAGCAGCCTAAATATTATGATTATAGCTCCTGATTTTCTTTTACTATGTTTTTTTTTGAACTTTTCATGCAGTGCTAATGAATCTGAAGCTTCAGATGAGACTGTTTCTGCTTGTGATGAGGAAGAGGAAGACGATACTGTAGTGCCTACTGATGATCAGCCTCAGCTCTTTTCTAATATTGATCAGCGTCCAGTATCTAAAGTAGATGAGAGTCCAGTTATTACCAGAGAGAGTAATGCTAAAAGCCAGCAAATGGATTCTTTTGCTTTTTCACAACCATTGAAGCCTTCTACCTTGGAGAGACCCAACAACGAGATTGGGAATTTCGCTAAGCCTGTTAAAAGTTTTACTGGTCTTGGATCTGTTGCTTTTTCGGGGAAATCTGTGGACGTGCCTAGCCAATCATTGAAGTCTTCTATCTTGGAGAGACCCAACAATGAGATTGGGAATTTTGATATGCCTTTTGATAAATTTACTGGTCTCGGATCTGTTGCTTTTTCGGGGCAATCTGTGGACATGCCTAGCCAATCATTAAAGCCTTCTTTCTTGGAGAGACCCAACAATCAGATTGGGAATTTTGATAAGCCTGTTCAGAAATTTACTGGCCTTGGGTCTGTTGCTTTTTCGGAGCAATCTGCGAATGTTCCTAGCCATCCCTTTCTCAATGTTAAAGAATCAACGATAAAGCAAAGTTCGGGTGCTGCAAATGCTTTCACAGGTTTTGCTGGAAAGCCTTTTCAACCGAAGGATGTTCCAAGTACATTAACACAAAGTGGGAGACAAGTAAGTGCAGGTGCTGGTAAAATTGAATCCTTACCAGTGATACAGAGCTCGCAAGTATCTTTGCAAGACAACTTCTCGTTGGGTAAAATTTCTAACAAGAAGCAAGATGGTTCAGAGCGAAATTACGGCAACGTCCCCTTGGCAAAACCAGTAAGTGAAGCTATGTAGGAAGGATAATATTTTTCTATATCTCAATGAGTAGAGAAGAATCATATTTATACAAAGAGACACCCAATCAAATAAGCAAAAAATATGCTGTCAATCTAAACCCATTAGGAAAAGAAAAATACAGTTATAAACATATACAAGGTAACAATCCGAAGTATAAAGCCCTAATTTACAGCTTGATTTTCTAGAGTAAGGTCTAAATGAAATTTATTCAAATAACTTCTCAATATGTTCTATTGTAGGAATGACTTGGTTTAAAGATCTGAATATTATTATTATTATAATTGATACATCCATTATGACTTGGTCTGCTATTCTTTTGCTTGCTTATACTCCAATCCATCTTCATCGGGAAAAGAAATTTATAAAAGAAAAATACACCTCCACTCCATTTCTAATAGCATAAAAGGAAAAGGGGATAGAATACTTGCATGAGCATTCTTGTGCCTATATTTTCCTCTTAGGAATTATTTTTTTTTTCGTTTCCCCCCATTTAGGAACAGCTTGAACTAGAAGATGAGATTTCCTCTGTACATTTTTATGCTTGGGCTGAATAATGGTCCCTGTATGTGGTTAACAAAATAACGAGGCACATGTCCCTTCATTTATGGTTATATTGTATCTTTACCATATAATTAGTCGATTAACATCTCACTGTTTTGTTTATTGGTGTGTAGATAGTGGTTGTGACGTATTTTCTGACAGGTGTCTTCTCCCTTTGCCAGTGTGTGGGCAACTAAAAAGTAGCCTACCTCCAAGCTATAGCCCCCTAGCCAACTTTAATATTATTTCTTTTCTTCAGATTCTCTTGGTGGGGGGTGGAGAGTTTAGGAACACATGAAGGTTTTCATTTCTAGGGTAAGCTTTGTCGTTTGAAAGAGTATCTGGAAACATGGAATCATGAGGTTTTTGAGGACATGAAGATCAAGAAACAGGAATGTGTTGAATATAATTGTTGTGTTAGATGGATTGGAAGTGGTGGGTTCCTTAAGCGATGATCAAAAGGGGAAAGGTTGTCTCTAAAAGTGAAGTTTGAAGAGGTGCTTAAAAGTGAGACTATTAGTTGGACGCAACAGGCTGAGATTAAATGGGCTAGGAGAAGGATTAGAAATATTATTGGTTCACGGTTAGCAAAGATGGGAGAACCTTGTGAGAAGATAAAGAGATTTAAGAGGAGATTGTCTTTCTTTTTTACCCCATCTGTGGACCGGGCATTTCACCTACTAGGTCTTTCCTTGAAAGTTTCGATTGGTCCCCCATTTAGCTGGTGATAGGGCTGATTTGGATAACTGATTTGGATAACACTTTCTCTTGAGGAGATTAGAAGAGTGGTCTTTGGTTTTGACAAAGATAAAGCCCTTGCTGGTGGTCTTGCCTTGGCCATTTTCTTGGATTATTAGGTTTGTATTAAAGATCTTATGTGGAAAGTTTTTGTGTAACGACTTGATTTTCAATATCTCGAATCATAGGTCACCACGTACAGTCAAGGGTAAAAAGGGCGGTAAAATCTTCTTTTGTAAAACAGGAGACACAAGGAATTTTAAATTTAAATATACAGACAAAGCCGACATAAAAGTAGTTTAAAGGTAATATCCGCGGAGTCAATTCAAAACGGTTTACAAAAATACATATGTTTCGAGAAGTAGTATTTAAAATGATAATAAAATGACAAGAAGGAAGACGACTCGATCTAAAGGGCACTCCTTGTGGCTGCATGGCTCTGCACGCTCCTGTCATTAGTAGTGGTCTACACTCAATCTGAAAAATAAAGAATAATAGGGATGAGTATAAAAATACTCGGTAAGAAACCTACTTGTAGGCTCTTATCAGACTTAGTTCTATACACGTAAACTTAGGCTATGTGCTCAGAACTATCTCCAGCAATAGCCAAACTGTGGCTTAACTTTTGTACCAAATTTTGTAGGTCATTGTGTAACTAACCCTTAGCATACCGTGGCTTAACGTTCGTACCAAATTTTGTATGTCATTGTGTAACTAACCCTTAGCATACCTCCTTTAAGGAGTACACAATCCCTCTGAGGTTCACAACCTAGAGGTTTTCTATTTTCTCTTGGCTCTAGGTTTCACGTATGTTTAGGACCTTCACCATTCTTGCACAAAGTGTGCCTCGAGTCGTTGTGGCCCTTGGAAAGGCCCTAGGATAATACTCGTAAGTCTGGAGGAGCCCTAGTTACTCGAAATGGTGACCGTTCTCCATCCATATCTACATATTGAACTCTAATCAGTGGATTCTCCCTAGTGGTGCCTGACAACATCTACCCATTCCTAATATCAGTAAACTACCTGTCACTATGCCTTTCCAGGCTGGAATAGTAGACTACTTACTCATGTACCCTCATCGGGTTAGAACAGTAAAACTACCTGTCTTAAGAACAATAATTACCTACTACTTTGCCTTTTCAAGCTAGAATAGTGAAACAACCTGTCACTACGCCACTCAGGCTAGAACAGTAAAAAGGCCATGGCTCTCCCCACCAAGCACCATCGATAGTACTTCAGTAGCCCTTTATGACTCTTGAATTCCTCTATGGATGACAACTAAGCCATGTGTAACAAATCAACCATAACCTCTGTTGGTTAGTTCATGAATAGGGGTTGCGCCCTATCTGTCCCTACAAGGTACCTGGTCAAACCAAGGAGGCTCTGGAACTCATGACTAGGTACTAGGACCAAAATACGCGAAACGAGAGCTCATAGTCTATGGAAACCATAACTAGCGCTATGACACATAACTATAGTCAAACTCATAATTAGACAATAATAAGCAATCCAAACCCTAAATCATGCATTATAAACGTATAAGGCATACAACATGCTCATCAAGCTCATTAAGCGATAATAATCCAATCTTAGCATACTTGAAAGCATAAGAGCCTAATGATCTATCAGAGTTTACAAATCATGCGTGTAGAAGCTATATAATTGAAAATAGGACTAAGTTGTAAACTGTTCATCTAAGCACAATTATCATGAGACTAGGCCAAACTAGGCTTCTAACATAATACTCAATCATTGCCCTCAATCCTTATTGACTTATGCAAGTATTTCTTAGAAACTTGCTCCATATGGTTACTTACATGATTGTGAACTTCCCGAGCTTGTCGGGTCTTCGAGGATGTTCCAAATTTCTCCAAAATTCCTCAATATGTCCTAAATTAAGTCAAAAGAATAAAACTAGTGAGAATGACTTGCTTCAAATTCGTTAAAGAGAAGGTTGAGAAACTAACCGTTAGACTATACGGTCATCTTCTCCAACTTGCGTGCGTACCGTTCTGCTTTCCCCTTTTTTTTTTTTACTTTTTTTAAGAATCTGTTATTTATTTTCTTAAAATTCCAGGTGTTACATTTTGTGAGTTTTATGAAAGAGGAATTTTGAACAACTCTATTGAAACTTTCGCTATGTGGTTCTTAAGAAGGAAAAAGTGATTAGGGTTAAGGATTTTAGGCCAATTAGTTTAACTTCTAGTGTTGTATAAAATCATACCAGATTAATTAAGGAAAGTTCCCCGAAGAAAATTTCATATTTTAGGAGGCCTTCATTTCTTGAAAACAAATCTTAGAACAGGCTCTTAGTGTCAACGAGGCCATTGAGAATTATTGAAGTCAAAAGCGAGAAAGTTGTAGATGCGTCATACTCCAAGATGGTTAGATTATGCTCAATTAGAGTTACTCAATTTACATCTTAACTCTATATTCCATGTGTATGACATAGATTATAACTTTTTTTGCAGCCAAGTACTGTTTTTCGTGAAAAGAGTCGACGACTATTGTTTGCTTGTTTTTGTGCTGTTGTGTGAGATTTTGGAGGGAAAGACACAATAGGATTTTTAGATTGAAAGATCGGAGAAGGAGGTGGTGATCCTTCATGAGACATCATACCTCTATGGGTTTTCGTGTAAAATTTTTTGTAATTATTCTTAGGCCTTATTTTACTTGATTTGATCCCCATCTTGAAGTAGTAGAACTCTTATGGAGTATTACTTTTGTCTTCCCCTTGTATCCTCTCATGTTTTCTGTAAGCTTGATTTTCTATATATTTGTGTGTATATATATTTTGTTTCTTTCAGATTTTTTTCTGGCTTCTTCAATGCTTCCTATTCTCTACAGATGACTGAAATGTGCGAAGGGCTGGACATGCTTCTAGAATCTATAGAAGAGCCGGGTGGTTTTTTGGATGCCTGCACTACTTTCCAGAAAAGCTCCGTTGAAGCTTTGGAGCTTGGCTTAGCGACTCTTTCAGATCAATGTCAAATATGGAGGGTAATTATCAATTATTATTTTGTGTTCTTTAGTTTGTTAATTATTTTTCCTGTACTTTGGCTGACTGAGAGTATGGTTATTAATTTGTATAGCCTCACTATTTATTGCTAAAAGAAATGTACGGTATGAGAAAAGATTGCTAGTATTCCCTGTCCTATGGCCAATGCCAGTACTGGATATCAACCTTGAAGAAACTAATTATGTTCGACTTGAATCATTTGAGAAAGAACTATTTAACCTATTCAAGGCTTATTTCTTATGTAATTTTACTATGTTTTGTTCTTTTTTTCTCTCACTTTTGGTGTTTGTATCTTTTGAGCTTTATTCTCGTTTCATATCTTCAATGAAACATTTCGTTTTGTATTTCCAAAATAAATAAAGTATAATAGGCAGTTGATTTGCAAATTACTTGAGCTTAGAGAATATCAGAAGTAACCCAGGTTTCAAGTGTTCTCTTGGAGTGTGACTCAGAGAAGACTTACTACTATGACTAGTTGCATGCATTTCTTTATTGATCGTTTCCTCGATTTGCTGTTTATATGACTTTTAATGGGGATTCTGTAGATATCTAACGTTGATTATTCTTCTGGTTAGTATGTTTGGAATAACTTGTTGTTTTGAATTTAATTTTTCTTGCTTTGATGACATTAGTTGGTGAATTGGATGTAAAAAGTAATAGAATTTTTTTAGAACTAGAATAAAAAGAAGGTTTATGTTGATGAATGCCCTTTTGATTTTCTCATATCAAGTTTAGAAAATATTCCTCTATTGTTTAGTTCTTTCGATTTGAAATACTTGCTTTAATTTTGGTTTCTTTTTTTTTGAGCCCCTTTTTCCTTTTCTATTTCTCCCACAATCCTTTTCCTTGTTATCAATACTGTTATCTTACACGTGAATTTCATTTTTCCTTCTCTTTGCAGCGCACAATGACTGAGCGTGCACAGGAGGTACAAAATCTCTTTGACAGAACGGTTGAAGGTATTTAAACCTCAGCTTCTTGTTCGTTTATACTACATGATTGATGGCGACGTTGTTGGCATAGAAGATTCCATGTTGATGTAGCTAATATTGGAAAAGTGCATTTGTTTAGTATGCATCGAATATTTGAATGTCACAAAGAAGTAGAATAAGAAAGTTCCATGTATATCCCTTCCAAACGTTGAGGCTAATCTCATTAGTTATCACGGGAATTTGATTGACGATACGGGGATCCTCCCGTCATCACGTTCACTTGGTTGAATATCAACATGTAGGTTATAATCAATTAATTTCTGGTCATAATTTCATATGTATTTTACATTTTTAGTTTATTTGGTTTCAGTTTTGTCAAAGAAAACATACATTGAAGGTATTGTTACGCAAGCTTCTGACAGCAACTATTGGGAACATTGGGATCGCCAAAAGTTAAGTTCTGAATTAGAGCTAAAACGACAGCGCATTTTGCAGATGAATCAGGTAGGTTATCGATTGGTTCGAAAACAAATGAATTCCCTCAATTGGAGATGTGATGTTTCCATTTGGTTCTTCTCTTTTCCAGAATATGACTAACCAGTTAATTGAGTTAGAGAGACATTTTAATGGCCTTGAGCTGAATAAGTTCGGTGGAAATGATGAAATTCAAGTTAACGAAAGAGCTCTTCAAAGGAAATTTGGATCTTCGAGGTACGTCGAATTTTATTCTGTGAGAAATTTATTAGCTTTTTTGCCGTTTGCTTTTCAGCTGGTCGTTTTGAGTAGCCCGACACACGTTTCTTCCTCAAAATTTGTTTCAGGCAAAGTCATTCCTTACATAGTTTGAATAACATAATGGGATCTCAATTAGCAGCAGCTCAACTTCTTTCTGATAATCTATCAAAACAAATTGCTACACTCAATATTGAATCGCCCTCTTCGAAAAGGCAGAGTATCACGAAGGAATTGTTCGAGACTATTGGAATTACTTATGATGCTTCTTTCAGTTCCCCAAATGTGAATAAAATTCCAGAAACTTCTAGTAAGAAGCTTTTACTTTCTGCTGATTCTTTTTCAAGTAAAGATACATCGAGAAGGAAACAGCGGAGTGGAGCGAAAATTTCTGAAACGGAAACCGGGAGAAGGAGAAGAGACTCACTTGACAGGGTACTGTGTTTCCTTCAATCCCTATTTTCATTTTTTTTTACCTGTGTTTCATATGGAACTTTGCTGTCAATTTAAAATAAGATCTATATTTTTTATTAGTAAAATAAACTTGGATTAAGTGCCATACATGGAAGATGGTTCAGAACTTGGCAAGTTGTTCTCTGACCACCTTTCTTCATAATGCCCATTCATGGATTTCTTGTATGGAATGTTGTTGGAACAATGAGGATGCATATGGCCTGACACATGCCTCTCCCGCCTCTCCCTACAATACATTCTGAGACATTTTAGGTATAGTAGGAGTAGACATGATTTTAGCGAACGGCCATATGATCTTGTGGAGCATTCGTAAATTTGCAAACTTTCATTTTTTATATAGATGAACATAACTAGAGTTGATTGTTAAAACTCACTCCGGAACCCAGACTTTTGAGGTTTATGACTTGGGGCTCTAGTGGCCTTTTTCGAGCCCGTATCTGACATTCATGGCAACAATGTTGCTTCTAATCTTGCTTATGCCTCAGCTTGGCTTTCATGCACACTCTTTTTTGTGTGTTGGATGATGCACCATCCTCCTCGGTCATATCATGACACTTGCTACCTCTGCCCTCACGTGGTCGTATGGGCGCCACTCGCTTTCTGCCTATTTGCACCTGGAGGTCACAATTAGTTGTGATATTCTCCCCCACTTATACTAGTCATCGTTCTCGATGACTTTCTCGATGACATAATCGCATGCGGATTGGAACAAACTCCACTCATGCGTTCAAGAATGCACCCTTCCAACTAAGTGTTTTATGTATATTTCTATCTTCATCCACCCAAGTTGATTACCCATCTGTCCGAATTTAACATGCTTCCATCTACCTGGATCTGAATTTAACATGAATCCCCTGTAAAATTGGTGCAGAACCTGGCTAGCGTTCAACCTCCGAAAACCACCGTTAAGCGGATGATCTTGCAAGGAACACCATTGTCCAATGAGAAACAATTTTGTTCTCCCACTCTTGAAGGACCAGCAACCGTTGCTCGTCCAGCTAGTCGCATACCATCGTCTATGCTATCATCATCATCTAAAAATGCAGGTATGATCCCTAACATGAAGCATAATCAGAAGGTTTCTATAGTCATTTGTTTACTGATTCTCTGTTATAAAGTATTTTTCTTGTTCTTGAAGTTTTGAATGTTTTGGCTTATATTTATATTCGATATCCAGAACAAGGCTCCGAGAACCCCGCAACGCCTTTCTCATGGGCTAGCTCTCCTAGACAGAAATTCCAACCACTGCAAAAAACTAATGGTACAGCACCATCTCCTCTGCCAGTATTCCAATCATCTCATGAAATGGTGAAAAAAAGTAATAGTGAAGCGTACAGTGCGGCTTCAGAAAACAAATTTGCAGAGGTCACTTATCCTGAGAAGTCAAAAGCTTCTGATTTCTTCTCACTCGCTAGAAGCGACTCGGTCCAGAAATCTAATATGAACTTCGAGCAGAAATCATCTATCTTCGTAACATCATCTAAACCGATGTCCACACCGAAAGATTCCATTGAAACCTTGAATCCGAACAGTCAGAAAACTGCTAACGTAAAGGAGAGGCTTACAACTCCAAGTCCACTTTTTGGATCTGCAAATAAGCCTGAACCTGTATCTGTTGGTACGACATCTTCTTTGGTTCCGACCGTTGATGTACTGAGAAAGACTGAAGAAAAAAAACCGCCGACCGTGTTTTCACCATCAGTTCCAGCACCAGCACCTGTAAATACTCCTCCAAGTGCGTCGACATTTTTGGGATCTCCGCTAAGCAAATCATTTCCAAGTCCTGCTGCTGTTGTAGATCTCAATAAACCTCTGTCAACATCAACCCAATCGAGCTTCGCCTCTCCGGTTGTTTCTGTTTCTGATTCCCTATTTCAGGCACCTAAGATGGTATCACCACCATCTAATCTATCTTCCTTGAATCCTACATTGGTGTCCTCGAGTAAAGAACAACCGATGCCGAAATCAGATGCTGATACTGAAAAGCAAGCACCGGCTTCAAAGCCCGAGTCCCGTGAACTGAAGCTTCAACCTTCTGTAACACTTGCTGTTGGAAATCATGTAGAGCCAACTTCTGTAACCCAGACGGTTTCCAAAGATGTGGGAGGACATGTTCCATTTGTAGTAGCGGATGCTCAACCACAACAGTCATCTGCTGCTTTTGTTCCATTACCTACACCAAACTCGACTTCTAAGGCTGCTGCAAATGGTAAAAGTGAAACTTCAGATGCTTTGATTACTCAGGATGACGATATGGACGAGGAGGCCCCAGAGACGAATAACGTCGAGTTTAGTTTGAGCAGCTTGGGAGGATTTGGAACTACCTCTACGCCTATGTCGAATGCTCCTAAACCAAATCCATTTGGTGGTTCGTTTGGCAATGTGAATGCAACCTCAATGAACTCTTCCTTTACTATGGCTTCTCCTCCAAGTGGAGAGTTGTTTCGGCCTGCATCGTTTAGCTTCCAATCTCCGCTGGCTTCGCAAGCAGCATCACAACCGACGAATTCAGTTGCATTCTCTGGTAGCTTTGGCTCTGGAATGGCTACTCAAGCTTCCGCTCAAGGCGGGTTTGGTCAGCCTGCTCAGATTGGAGTAGGGCAGCAAGCACTGGGTACTGTTCTTGGTTCATTTGGACAATCAAGACAGCTTGGTCCTAGTCTACCTGGAACTGCTTCAGGATCCCCTGGCGGTTTTAATGGTGGTGGCTTTACTAGTGTGAAACCTGTTGGTGGTGGTTTTGCCGGTGTTGGTTCAGGTGGTGGCGGTGGTTTTGGTGGTGGTGGTTTTGCTGGTGCAGCCTCTACCGGTGGAGGATTTGCTGGTGCTTCTCCCCCAACGGGAGGTTTTGCAGGTGCTACCGGTGGAGGTTTTGCAGGTGCTGCAGGCGGAGGTTTTGCAGGTGCTGCAGGCGGAGGTTTTGCAGGTGCTGCAGGCGGAGGTTTTGCCGGTGCTGCAGGTGGATTCGGGGCGTTCGGCAACCAGCAAGGAAGCGGCGGGTTCTCGGCTTTTGGCGCTGCTCCGGGTGGATCAGGAGGAACTGGAAAACCTCCTGAACTTTTCACCCAGATTAGAAAGTAG
mRNA sequence
ATGGCGTCCGTTGATTCGCGGCATTCCATTTCTTTAACTCCTATTATATTGGAAGACTCTTACGAAGGGGAGCATGTTGAAACCAACGATTACTACTTCGAAAAGATCGGCGAACCTGTTCCTGTCAAGCTCAATGACTCCATTTTTGATCCTGGAAGTCCTCCTTCCCAGCCTCTTGCTGTGTCTGAGAGTTTTGGTCTCATATTCGTTGCCCATTCGTCTGGGTTTTTTGTGGTGAGGACCAAGGATGTAGTTGCTTCAGCTAAGGAGATGAAAAACGGGGGAACTGGTTCTTCAATCCAGGATTTGAGTATTGTGGATGTTTCCGTCGGGAAAGTTCATGTTCTTGCACTTTCCAATGATAATTCTTTTCTTGCTGCCGTCGTAGCTGGTGATGTTCGTCTTTTTTCAGTTGACTCGCTGCTTGATAAGGCTGAAAAACCCTATTTCTCTTGTTCAACAACTGATTCCAGTTGCATCAAAGACTTCAAATGGACCAGAAAGCCGGAAAATTCTTATCTGGTTCTTTCAAAACATGGAAAGTTATACCAAGGATCGGCTAGTGGTCCTTTTAAACATATCATGCACGATATTGATGCTGTTGAATGCAGTGTGAAAGGAAAATTCATTGCTGTGGCTAAAAAGGACACTCTTTCCATTTTCTCATATAAATTCAAGGAACGACTGTCCATGTCACTCTTGCCAAGTTCAGGGAATGGTGACACTGATACGGACTTTGCAGTGAAAGTTGACTCTATCAAGTGGGTTCGTGCTGATTGTATCATTATAGGATGCTTTCAAGTGGCTGCAACAGGCGATGAAGAAGATTACTTTGTCCAAGTTATCAGAAGTAAAGATGGAAAAATTACTGACGTTTCCTCAAATAAAGTCTTGTTATCGTTCCATGATATATATTCAGGTTTCACTCCAGACATTTTGCCTGTTGAAACTGGGCCTTGTTTATTATTGAGTTATTTAGATAAATGCAAGCTCGCAATTGTTGCCAATAGGAACAATACAGATCAGCATATTGTGTTGCTTGGTTGGTTGCAAGAGGTTGAGAATGAAGTTGCTGTTATTGATATTGAAAGAGATAAGTCACTCCCGAGGATTGAGCTTCAAGACAACGGTGACGATAATTTGGTAATGGGGCTGTGCATTGATCGAGTTTCTCTCCCTGGAAAGGTGGAAGTCCAAGTTGGAAATGAAGAGATTAGAGAAGTCTCGCCATATTGCACTCTCTTGTGTCTTACTCTAGAGGGAAAACTCATTCTGTTTCATTTTTCTAGTGCTAATGAATCTGAAGCTTCAGATGAGACTGTTTCTGCTTGTGATGAGGAAGAGGAAGACGATACTGTAGTGCCTACTGATGATCAGCCTCAGCTCTTTTCTAATATTGATCAGCGTCCAGTATCTAAAGTAGATGAGAGTCCAGTTATTACCAGAGAGAGTAATGCTAAAAGCCAGCAAATGGATTCTTTTGCTTTTTCACAACCATTGAAGCCTTCTACCTTGGAGAGACCCAACAACGAGATTGGGAATTTCGCTAAGCCTGTTAAAAGTTTTACTGGTCTTGGATCTGTTGCTTTTTCGGGGAAATCTGTGGACGTGCCTAGCCAATCATTGAAGTCTTCTATCTTGGAGAGACCCAACAATGAGATTGGGAATTTTGATATGCCTTTTGATAAATTTACTGGTCTCGGATCTGTTGCTTTTTCGGGGCAATCTGTGGACATGCCTAGCCAATCATTAAAGCCTTCTTTCTTGGAGAGACCCAACAATCAGATTGGGAATTTTGATAAGCCTGTTCAGAAATTTACTGGCCTTGGGTCTGTTGCTTTTTCGGAGCAATCTGCGAATGTTCCTAGCCATCCCTTTCTCAATGTTAAAGAATCAACGATAAAGCAAAGTTCGGGTGCTGCAAATGCTTTCACAGGTTTTGCTGGAAAGCCTTTTCAACCGAAGGATGTTCCAAGTACATTAACACAAAGTGGGAGACAAGTAAGTGCAGGTGCTGGTAAAATTGAATCCTTACCAGTGATACAGAGCTCGCAAGTATCTTTGCAAGACAACTTCTCGTTGGGTAAAATTTCTAACAAGAAGCAAGATGGTTCAGAGCGAAATTACGGCAACGTCCCCTTGGCAAAACCAATGACTGAAATGTGCGAAGGGCTGGACATGCTTCTAGAATCTATAGAAGAGCCGGGTGGTTTTTTGGATGCCTGCACTACTTTCCAGAAAAGCTCCGTTGAAGCTTTGGAGCTTGGCTTAGCGACTCTTTCAGATCAATGTCAAATATGGAGGCGCACAATGACTGAGCGTGCACAGGAGGTACAAAATCTCTTTGACAGAACGGTTGAAGTTTTGTCAAAGAAAACATACATTGAAGGTATTGTTACGCAAGCTTCTGACAGCAACTATTGGGAACATTGGGATCGCCAAAAGTTAAGTTCTGAATTAGAGCTAAAACGACAGCGCATTTTGCAGATGAATCAGAATATGACTAACCAGTTAATTGAGTTAGAGAGACATTTTAATGGCCTTGAGCTGAATAAGTTCGGTGGAAATGATGAAATTCAAGTTAACGAAAGAGCTCTTCAAAGGAAATTTGGATCTTCGAGGCAAAGTCATTCCTTACATAGTTTGAATAACATAATGGGATCTCAATTAGCAGCAGCTCAACTTCTTTCTGATAATCTATCAAAACAAATTGCTACACTCAATATTGAATCGCCCTCTTCGAAAAGGCAGAGTATCACGAAGGAATTGTTCGAGACTATTGGAATTACTTATGATGCTTCTTTCAGTTCCCCAAATGTGAATAAAATTCCAGAAACTTCTAGTAAGAAGCTTTTACTTTCTGCTGATTCTTTTTCAAGTAAAGATACATCGAGAAGGAAACAGCGGAGTGGAGCGAAAATTTCTGAAACGGAAACCGGGAGAAGGAGAAGAGACTCACTTGACAGGAACCTGGCTAGCGTTCAACCTCCGAAAACCACCGTTAAGCGGATGATCTTGCAAGGAACACCATTGTCCAATGAGAAACAATTTTGTTCTCCCACTCTTGAAGGACCAGCAACCGTTGCTCGTCCAGCTAGTCGCATACCATCGTCTATGCTATCATCATCATCTAAAAATGCAGAACAAGGCTCCGAGAACCCCGCAACGCCTTTCTCATGGGCTAGCTCTCCTAGACAGAAATTCCAACCACTGCAAAAAACTAATGGTACAGCACCATCTCCTCTGCCAGTATTCCAATCATCTCATGAAATGGTGAAAAAAAGTAATAGTGAAGCGTACAGTGCGGCTTCAGAAAACAAATTTGCAGAGGTCACTTATCCTGAGAAGTCAAAAGCTTCTGATTTCTTCTCACTCGCTAGAAGCGACTCGGTCCAGAAATCTAATATGAACTTCGAGCAGAAATCATCTATCTTCGTAACATCATCTAAACCGATGTCCACACCGAAAGATTCCATTGAAACCTTGAATCCGAACAGTCAGAAAACTGCTAACGTAAAGGAGAGGCTTACAACTCCAAGTCCACTTTTTGGATCTGCAAATAAGCCTGAACCTGTATCTGTTGGTACGACATCTTCTTTGGTTCCGACCGTTGATGTACTGAGAAAGACTGAAGAAAAAAAACCGCCGACCGTGTTTTCACCATCAGTTCCAGCACCAGCACCTGTAAATACTCCTCCAAGTGCGTCGACATTTTTGGGATCTCCGCTAAGCAAATCATTTCCAAGTCCTGCTGCTGTTGTAGATCTCAATAAACCTCTGTCAACATCAACCCAATCGAGCTTCGCCTCTCCGGTTGTTTCTGTTTCTGATTCCCTATTTCAGGCACCTAAGATGGTATCACCACCATCTAATCTATCTTCCTTGAATCCTACATTGGTGTCCTCGAGTAAAGAACAACCGATGCCGAAATCAGATGCTGATACTGAAAAGCAAGCACCGGCTTCAAAGCCCGAGTCCCGTGAACTGAAGCTTCAACCTTCTGTAACACTTGCTGTTGGAAATCATGTAGAGCCAACTTCTGTAACCCAGACGGTTTCCAAAGATGTGGGAGGACATGTTCCATTTGTAGTAGCGGATGCTCAACCACAACAGTCATCTGCTGCTTTTGTTCCATTACCTACACCAAACTCGACTTCTAAGGCTGCTGCAAATGGTAAAAGTGAAACTTCAGATGCTTTGATTACTCAGGATGACGATATGGACGAGGAGGCCCCAGAGACGAATAACGTCGAGTTTAGTTTGAGCAGCTTGGGAGGATTTGGAACTACCTCTACGCCTATGTCGAATGCTCCTAAACCAAATCCATTTGGTGGTTCGTTTGGCAATGTGAATGCAACCTCAATGAACTCTTCCTTTACTATGGCTTCTCCTCCAAGTGGAGAGTTGTTTCGGCCTGCATCGTTTAGCTTCCAATCTCCGCTGGCTTCGCAAGCAGCATCACAACCGACGAATTCAGTTGCATTCTCTGGTAGCTTTGGCTCTGGAATGGCTACTCAAGCTTCCGCTCAAGGCGGGTTTGGTCAGCCTGCTCAGATTGGAGTAGGGCAGCAAGCACTGGGTACTGTTCTTGGTTCATTTGGACAATCAAGACAGCTTGGTCCTAGTCTACCTGGAACTGCTTCAGGATCCCCTGGCGGTTTTAATGGTGGTGGCTTTACTAGTGTGAAACCTGTTGGTGGTGGTTTTGCCGGTGTTGGTTCAGGTGGTGGCGGTGGTTTTGGTGGTGGTGGTTTTGCTGGTGCAGCCTCTACCGGTGGAGGATTTGCTGGTGCTTCTCCCCCAACGGGAGGTTTTGCAGGTGCTACCGGTGGAGGTTTTGCAGGTGCTGCAGGCGGAGGTTTTGCAGGTGCTGCAGGCGGAGGTTTTGCAGGTGCTGCAGGCGGAGGTTTTGCCGGTGCTGCAGGTGGATTCGGGGCGTTCGGCAACCAGCAAGGAAGCGGCGGGTTCTCGGCTTTTGGCGCTGCTCCGGGTGGATCAGGAGGAACTGGAAAACCTCCTGAACTTTTCACCCAGATTAGAAAGTAG
Coding sequence (CDS)
ATGGCGTCCGTTGATTCGCGGCATTCCATTTCTTTAACTCCTATTATATTGGAAGACTCTTACGAAGGGGAGCATGTTGAAACCAACGATTACTACTTCGAAAAGATCGGCGAACCTGTTCCTGTCAAGCTCAATGACTCCATTTTTGATCCTGGAAGTCCTCCTTCCCAGCCTCTTGCTGTGTCTGAGAGTTTTGGTCTCATATTCGTTGCCCATTCGTCTGGGTTTTTTGTGGTGAGGACCAAGGATGTAGTTGCTTCAGCTAAGGAGATGAAAAACGGGGGAACTGGTTCTTCAATCCAGGATTTGAGTATTGTGGATGTTTCCGTCGGGAAAGTTCATGTTCTTGCACTTTCCAATGATAATTCTTTTCTTGCTGCCGTCGTAGCTGGTGATGTTCGTCTTTTTTCAGTTGACTCGCTGCTTGATAAGGCTGAAAAACCCTATTTCTCTTGTTCAACAACTGATTCCAGTTGCATCAAAGACTTCAAATGGACCAGAAAGCCGGAAAATTCTTATCTGGTTCTTTCAAAACATGGAAAGTTATACCAAGGATCGGCTAGTGGTCCTTTTAAACATATCATGCACGATATTGATGCTGTTGAATGCAGTGTGAAAGGAAAATTCATTGCTGTGGCTAAAAAGGACACTCTTTCCATTTTCTCATATAAATTCAAGGAACGACTGTCCATGTCACTCTTGCCAAGTTCAGGGAATGGTGACACTGATACGGACTTTGCAGTGAAAGTTGACTCTATCAAGTGGGTTCGTGCTGATTGTATCATTATAGGATGCTTTCAAGTGGCTGCAACAGGCGATGAAGAAGATTACTTTGTCCAAGTTATCAGAAGTAAAGATGGAAAAATTACTGACGTTTCCTCAAATAAAGTCTTGTTATCGTTCCATGATATATATTCAGGTTTCACTCCAGACATTTTGCCTGTTGAAACTGGGCCTTGTTTATTATTGAGTTATTTAGATAAATGCAAGCTCGCAATTGTTGCCAATAGGAACAATACAGATCAGCATATTGTGTTGCTTGGTTGGTTGCAAGAGGTTGAGAATGAAGTTGCTGTTATTGATATTGAAAGAGATAAGTCACTCCCGAGGATTGAGCTTCAAGACAACGGTGACGATAATTTGGTAATGGGGCTGTGCATTGATCGAGTTTCTCTCCCTGGAAAGGTGGAAGTCCAAGTTGGAAATGAAGAGATTAGAGAAGTCTCGCCATATTGCACTCTCTTGTGTCTTACTCTAGAGGGAAAACTCATTCTGTTTCATTTTTCTAGTGCTAATGAATCTGAAGCTTCAGATGAGACTGTTTCTGCTTGTGATGAGGAAGAGGAAGACGATACTGTAGTGCCTACTGATGATCAGCCTCAGCTCTTTTCTAATATTGATCAGCGTCCAGTATCTAAAGTAGATGAGAGTCCAGTTATTACCAGAGAGAGTAATGCTAAAAGCCAGCAAATGGATTCTTTTGCTTTTTCACAACCATTGAAGCCTTCTACCTTGGAGAGACCCAACAACGAGATTGGGAATTTCGCTAAGCCTGTTAAAAGTTTTACTGGTCTTGGATCTGTTGCTTTTTCGGGGAAATCTGTGGACGTGCCTAGCCAATCATTGAAGTCTTCTATCTTGGAGAGACCCAACAATGAGATTGGGAATTTTGATATGCCTTTTGATAAATTTACTGGTCTCGGATCTGTTGCTTTTTCGGGGCAATCTGTGGACATGCCTAGCCAATCATTAAAGCCTTCTTTCTTGGAGAGACCCAACAATCAGATTGGGAATTTTGATAAGCCTGTTCAGAAATTTACTGGCCTTGGGTCTGTTGCTTTTTCGGAGCAATCTGCGAATGTTCCTAGCCATCCCTTTCTCAATGTTAAAGAATCAACGATAAAGCAAAGTTCGGGTGCTGCAAATGCTTTCACAGGTTTTGCTGGAAAGCCTTTTCAACCGAAGGATGTTCCAAGTACATTAACACAAAGTGGGAGACAAGTAAGTGCAGGTGCTGGTAAAATTGAATCCTTACCAGTGATACAGAGCTCGCAAGTATCTTTGCAAGACAACTTCTCGTTGGGTAAAATTTCTAACAAGAAGCAAGATGGTTCAGAGCGAAATTACGGCAACGTCCCCTTGGCAAAACCAATGACTGAAATGTGCGAAGGGCTGGACATGCTTCTAGAATCTATAGAAGAGCCGGGTGGTTTTTTGGATGCCTGCACTACTTTCCAGAAAAGCTCCGTTGAAGCTTTGGAGCTTGGCTTAGCGACTCTTTCAGATCAATGTCAAATATGGAGGCGCACAATGACTGAGCGTGCACAGGAGGTACAAAATCTCTTTGACAGAACGGTTGAAGTTTTGTCAAAGAAAACATACATTGAAGGTATTGTTACGCAAGCTTCTGACAGCAACTATTGGGAACATTGGGATCGCCAAAAGTTAAGTTCTGAATTAGAGCTAAAACGACAGCGCATTTTGCAGATGAATCAGAATATGACTAACCAGTTAATTGAGTTAGAGAGACATTTTAATGGCCTTGAGCTGAATAAGTTCGGTGGAAATGATGAAATTCAAGTTAACGAAAGAGCTCTTCAAAGGAAATTTGGATCTTCGAGGCAAAGTCATTCCTTACATAGTTTGAATAACATAATGGGATCTCAATTAGCAGCAGCTCAACTTCTTTCTGATAATCTATCAAAACAAATTGCTACACTCAATATTGAATCGCCCTCTTCGAAAAGGCAGAGTATCACGAAGGAATTGTTCGAGACTATTGGAATTACTTATGATGCTTCTTTCAGTTCCCCAAATGTGAATAAAATTCCAGAAACTTCTAGTAAGAAGCTTTTACTTTCTGCTGATTCTTTTTCAAGTAAAGATACATCGAGAAGGAAACAGCGGAGTGGAGCGAAAATTTCTGAAACGGAAACCGGGAGAAGGAGAAGAGACTCACTTGACAGGAACCTGGCTAGCGTTCAACCTCCGAAAACCACCGTTAAGCGGATGATCTTGCAAGGAACACCATTGTCCAATGAGAAACAATTTTGTTCTCCCACTCTTGAAGGACCAGCAACCGTTGCTCGTCCAGCTAGTCGCATACCATCGTCTATGCTATCATCATCATCTAAAAATGCAGAACAAGGCTCCGAGAACCCCGCAACGCCTTTCTCATGGGCTAGCTCTCCTAGACAGAAATTCCAACCACTGCAAAAAACTAATGGTACAGCACCATCTCCTCTGCCAGTATTCCAATCATCTCATGAAATGGTGAAAAAAAGTAATAGTGAAGCGTACAGTGCGGCTTCAGAAAACAAATTTGCAGAGGTCACTTATCCTGAGAAGTCAAAAGCTTCTGATTTCTTCTCACTCGCTAGAAGCGACTCGGTCCAGAAATCTAATATGAACTTCGAGCAGAAATCATCTATCTTCGTAACATCATCTAAACCGATGTCCACACCGAAAGATTCCATTGAAACCTTGAATCCGAACAGTCAGAAAACTGCTAACGTAAAGGAGAGGCTTACAACTCCAAGTCCACTTTTTGGATCTGCAAATAAGCCTGAACCTGTATCTGTTGGTACGACATCTTCTTTGGTTCCGACCGTTGATGTACTGAGAAAGACTGAAGAAAAAAAACCGCCGACCGTGTTTTCACCATCAGTTCCAGCACCAGCACCTGTAAATACTCCTCCAAGTGCGTCGACATTTTTGGGATCTCCGCTAAGCAAATCATTTCCAAGTCCTGCTGCTGTTGTAGATCTCAATAAACCTCTGTCAACATCAACCCAATCGAGCTTCGCCTCTCCGGTTGTTTCTGTTTCTGATTCCCTATTTCAGGCACCTAAGATGGTATCACCACCATCTAATCTATCTTCCTTGAATCCTACATTGGTGTCCTCGAGTAAAGAACAACCGATGCCGAAATCAGATGCTGATACTGAAAAGCAAGCACCGGCTTCAAAGCCCGAGTCCCGTGAACTGAAGCTTCAACCTTCTGTAACACTTGCTGTTGGAAATCATGTAGAGCCAACTTCTGTAACCCAGACGGTTTCCAAAGATGTGGGAGGACATGTTCCATTTGTAGTAGCGGATGCTCAACCACAACAGTCATCTGCTGCTTTTGTTCCATTACCTACACCAAACTCGACTTCTAAGGCTGCTGCAAATGGTAAAAGTGAAACTTCAGATGCTTTGATTACTCAGGATGACGATATGGACGAGGAGGCCCCAGAGACGAATAACGTCGAGTTTAGTTTGAGCAGCTTGGGAGGATTTGGAACTACCTCTACGCCTATGTCGAATGCTCCTAAACCAAATCCATTTGGTGGTTCGTTTGGCAATGTGAATGCAACCTCAATGAACTCTTCCTTTACTATGGCTTCTCCTCCAAGTGGAGAGTTGTTTCGGCCTGCATCGTTTAGCTTCCAATCTCCGCTGGCTTCGCAAGCAGCATCACAACCGACGAATTCAGTTGCATTCTCTGGTAGCTTTGGCTCTGGAATGGCTACTCAAGCTTCCGCTCAAGGCGGGTTTGGTCAGCCTGCTCAGATTGGAGTAGGGCAGCAAGCACTGGGTACTGTTCTTGGTTCATTTGGACAATCAAGACAGCTTGGTCCTAGTCTACCTGGAACTGCTTCAGGATCCCCTGGCGGTTTTAATGGTGGTGGCTTTACTAGTGTGAAACCTGTTGGTGGTGGTTTTGCCGGTGTTGGTTCAGGTGGTGGCGGTGGTTTTGGTGGTGGTGGTTTTGCTGGTGCAGCCTCTACCGGTGGAGGATTTGCTGGTGCTTCTCCCCCAACGGGAGGTTTTGCAGGTGCTACCGGTGGAGGTTTTGCAGGTGCTGCAGGCGGAGGTTTTGCAGGTGCTGCAGGCGGAGGTTTTGCAGGTGCTGCAGGCGGAGGTTTTGCCGGTGCTGCAGGTGGATTCGGGGCGTTCGGCAACCAGCAAGGAAGCGGCGGGTTCTCGGCTTTTGGCGCTGCTCCGGGTGGATCAGGAGGAACTGGAAAACCTCCTGAACTTTTCACCCAGATTAGAAAGTAG
Protein sequence
MASVDSRHSISLTPIILEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLAVSESFGLIFVAHSSGFFVVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSNDNSFLAAVVAGDVRLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHGKLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNGDTDTDFAVKVDSIKWVRADCIIIGCFQVAATGDEEDYFVQVIRSKDGKITDVSSNKVLLSFHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVIDIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLEGKLILFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSKVDESPVITRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNFAKPVKSFTGLGSVAFSGKSVDVPSQSLKSSILERPNNEIGNFDMPFDKFTGLGSVAFSGQSVDMPSQSLKPSFLERPNNQIGNFDKPVQKFTGLGSVAFSEQSANVPSHPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPSTLTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTEMCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQNLFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIELERHFNGLELNKFGGNDEIQVNERALQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNLSKQIATLNIESPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSSKDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVKRMILQGTPLSNEKQFCSPTLEGPATVARPASRIPSSMLSSSSKNAEQGSENPATPFSWASSPRQKFQPLQKTNGTAPSPLPVFQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQKSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPVSVGTTSSLVPTVDVLRKTEEKKPPTVFSPSVPAPAPVNTPPSASTFLGSPLSKSFPSPAAVVDLNKPLSTSTQSSFASPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEKQAPASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVVADAQPQQSSAAFVPLPTPNSTSKAAANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPKPNPFGGSFGNVNATSMNSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGSFGSGMATQASAQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGFTSVKPVGGGFAGVGSGGGGGFGGGGFAGAASTGGGFAGASPPTGGFAGATGGGFAGAAGGGFAGAAGGGFAGAAGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPELFTQIRK
Homology
BLAST of Csor.00g300930 vs. ExPASy Swiss-Prot
Match:
F4I1T7 (Nuclear pore complex protein NUP214 OS=Arabidopsis thaliana OX=3702 GN=NUP214 PE=1 SV=1)
HSP 1 Score: 869.4 bits (2245), Expect = 6.7e-251
Identity = 730/1859 (39.27%), Postives = 988/1859 (53.15%), Query Frame = 0
Query: 12 LTPIILEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLAVSESFGLIFVA 71
++ + +E+ EG+ + TNDYYFE+IGEP+ +K +D+ +D +PPSQPLA+SE ++FVA
Sbjct: 1 MSRVEIEEDTEGDRISTNDYYFERIGEPISIKEDDAQYDLENPPSQPLAISERHAVLFVA 60
Query: 72 HSSGFFVVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSNDNSFLAAVVAG 131
HSSGFFV RT DV++++K G IQDLS+VDV VG V +L+LS D+S LA VA
Sbjct: 61 HSSGFFVGRTNDVISASKNSNGNGDKVFIQDLSLVDVPVGDVRILSLSADDSILAVTVAA 120
Query: 132 DVRLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHGKLYQGSASGPF 191
D+ FSVDSLL K KP FS S +S +KDF+W R ++SYLVLS GKL+ G + P
Sbjct: 121 DIHFFSVDSLLKKDAKPSFSYSPDESGFVKDFRWRRNDKHSYLVLSNTGKLFHGIDNAPP 180
Query: 192 KHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNGDTDTDFAVKVD 251
+H+M +DAVE S KG +IAVA+ ++L IFS KF E+ ++L S GD+D D VKVD
Sbjct: 181 RHVMDAVDAVEWSSKGSYIAVAQDNSLRIFSSKFNEKRCIALSFDSWIGDSDEDCFVKVD 240
Query: 252 SIKWVRADCIIIGCFQVAATGDEEDYFVQVIRSKDGKITDVSSNKVLLSFHDIYSGFTPD 311
SI+WVR +CI++GCFQ+ G EE+Y VQVIRS DGKI+D S+N V LSF D++ D
Sbjct: 241 SIRWVRNNCILLGCFQL-IEGREENYLVQVIRSPDGKISDGSTNLVALSFSDLFPCSMDD 300
Query: 312 ILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQ-EVENEVAVIDIERDKSLPR 371
++PV GP LL SY+D+CKLA+ ANR + D+HIVLL W + ++ V+V+DI+R+ LPR
Sbjct: 301 LVPVGVGPHLLFSYIDQCKLAVTANRKSIDEHIVLLDWSSGDDKSAVSVVDIDRETFLPR 360
Query: 372 IELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLEGKLILFHFSS 431
I LQ+N DDN VMGLCIDRVS+ G V V+ G++E++E+ PY L+CLTLEGKL++F+ +S
Sbjct: 361 IGLQENNDDNTVMGLCIDRVSIEGTVNVRSGDDELKELQPYFVLVCLTLEGKLVMFNVAS 420
Query: 432 ANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSKVDESPVITRESNAKSQQ 491
AS +T A + ED +D S+ + ++ I +++ K
Sbjct: 421 VAGRPASSDTDLASSSDIEDAYTPLIEDDLSKQSSEKHQQLN-------IAVQNDQKHLN 480
Query: 492 MDSFAFSQPLKPSTLERPNNEIGNFAKPVKSFTGLGSVAFSGKSVDVPSQSLKSSILERP 551
+ F+ Q L + + E + V ++ K + V + +S I
Sbjct: 481 TEKFSTEQRLPNENIF--SKEFESVKSSVSGDNNKKQEPYAEKPLQV-EDAQQSMIPRLS 540
Query: 552 NNEIGNFDMPF----DKFTGLGSVA---------FSGQSVDMPSQSLKPS-----FLERP 611
G M +KF G G QS M Q+ S F P
Sbjct: 541 GTSFGQLPMSLGYDTNKFAGFGPALPVSEKLQKDIFAQSNSMHLQANVESKSTAAFFGSP 600
Query: 612 NNQIGNFDKPVQKFTGLGSVAFSEQSANVPSHPFLNVKESTIKQSSGAANAFTGFAGKPF 671
Q P + S S + S PF +++++ KQS + TG+ P
Sbjct: 601 GLQNAILQSPQNTSSQPWSSGKSVSPPDFVSGPFPSMRDTQHKQS---VQSGTGYVNPPM 660
Query: 672 QPKDVPSTLTQSGR---------------QVSAGAGKIESLPVIQSSQVSLQDNFSLGKI 731
KD + ++GR + G KIE +P I++SQ+S Q S K
Sbjct: 661 SIKDKSVQVIETGRVSALSNLSPLLGQNQDTNEGVEKIEPIPSIRASQLSQQVKSSFEKS 720
Query: 732 SNKKQDGS---------ERNYGNVPLAKPMTEMCEGLDMLLESIEEPGGFLDACTTFQKS 791
++ +Q + E N N P + EM +D LL+SIE PGGF D+C KS
Sbjct: 721 ASHQQHKTPLSTGPLRLEHNMSNQP--SNINEMAREMDTLLQSIEGPGGFKDSCAFILKS 780
Query: 792 SVEALELGLATLSDQCQIWRRTMTERAQEVQNLFDRTVEVLSKKTYIEGIVTQASDSNYW 851
+VE LE GL +L+ +CQ W+ T+ E+ E+Q+L D+T++VL+KKTY+EG+ Q +D+ YW
Sbjct: 781 NVEELEQGLESLAGKCQTWKSTIHEQQAEIQHLLDKTIQVLAKKTYMEGMYKQTADNQYW 840
Query: 852 EHWDRQKLSSELELKRQRILQMNQNMTNQLIELERHFNGLELNKFGGNDEIQVNERALQR 911
+ W+RQKL+ ELE KRQ I+++N+++T+QLIELER+FN LEL+++ + V R +
Sbjct: 841 QLWNRQKLNPELEAKRQHIMKLNKDLTHQLIELERYFNRLELDRYNEDGGHPVARRGVPN 900
Query: 912 KFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNLSKQIATLNIESPSSKRQSITKELFETIG 971
+ SR+ SLHSL+N M SQLAAA+ LS+ LSKQ+ L I+SP K ++ +ELFETIG
Sbjct: 901 RSAPSRRVQSLHSLHNTMSSQLAAAEQLSECLSKQMTYLKIDSPVKK--NVKQELFETIG 960
Query: 972 ITYDASFSSPNVNKIPETSS-KKLLLSADSFSSKDTSRRKQRSGAKISETETGRRRRDSL 1031
I YDASFSSP+ K SS K LLLS+ S SR++Q S K S+ ET RRRR+SL
Sbjct: 961 IPYDASFSSPDAVKAKNASSAKNLLLSSIPASINQQSRQRQSSAMKNSDPETARRRRESL 1020
Query: 1032 DR---NLASVQPPKTTVKRMIL---------QGTPLSNEKQFCSPTLEGPAT-VARPASR 1091
DR N A+ +PPKTTVKRM+L Q T LS + + T + V AS
Sbjct: 1021 DRVIFNWAAFEPPKTTVKRMLLQEQQKTGMNQQTVLSERLRSANNTQDRSLLHVKDHASP 1080
Query: 1092 IPSSMLSSSSKNAEQGSENPATPFSW-----------------ASSPRQKFQPLQKTNGT 1151
+ SS + SE +TPF AS P + + +N T
Sbjct: 1081 VVSSNKGIMESFQQDTSEAQSTPFKTRPPMPQSNSPFTISPISASKPSFNWSGNKSSNTT 1140
Query: 1152 ------APSPL------------------PVFQSSHEMVKKSNSE-AYSAASENKFAEVT 1211
APS + PV + E +K E +S A N F E
Sbjct: 1141 SYAEESAPSQIKDTRTVSQPGGSSFLPKRPVASTVLEQTEKKAGEFKFSEAKANAFVETA 1200
Query: 1212 ------YPEKSKASDFFS--------------LARSDSVQKSNMNFEQKSSI------FV 1271
S SDF S S KS F SSI F
Sbjct: 1201 AGSVQRLSTTSSGSDFESSKGFGAQFSTMSSGAPASSFSSKSLFGFNSSSSIPGDKFTFP 1260
Query: 1272 TSSKPMS-TPKDSIETLNPNSQKTANVKERLTTPSPL-FGSANKPEPVSVGTTSSL---- 1331
+ P+S TP DS TL S + + P+ + SA P+ SV +TS++
Sbjct: 1261 AVTAPLSGTPLDSTSTLFTASSAPVSSSSQDPVPASIPISSAPVPQTFSVTSTSTVSATG 1320
Query: 1332 --VPTVDVLRKTEEKKPPTVFSPSVPAPAP-------VNTP---PSASTFLGSPLSKS-- 1391
VP L T K +PS P+P+P N P PS+ + S +S
Sbjct: 1321 FNVPFGKPL--TSVKVDLNQAAPSTPSPSPGPTAGFTFNLPALSPSSPEMVSSSTGQSSL 1380
Query: 1392 FPSPAAVVDLNKPLSTSTQSSFASPVVSVSDSL-----------FQAPKMVSPPSNLSSL 1451
FP A ++ +++T S S + S SL FQ+P++ +P S +
Sbjct: 1381 FPPSAPTSQVSSDQASATSSLTDSSRLFSSTSLSSTPPITPPDAFQSPQVSTPSSAVPIT 1440
Query: 1452 NPTLVSSSKEQPMPKS-----DADTEKQAPASKPESRELKLQPSVTLAVGNHVEPTSVTQ 1511
P VS K+ S + + A A+K ++ L ++ ++ G V P S +
Sbjct: 1441 EP--VSEPKKPEAQSSSILSTQSTVDSVANATKTQNEPLPVKSEIS-NPGTTVTPVSSSG 1500
Query: 1512 TVSKDVGGHVPFVVA----------DAQPQQSSAAFVPLPTPNSTSKAAANGKSETSDAL 1571
+S G + + +QPQQ S+ P P + TS A+ E D +
Sbjct: 1501 FLSGFSSGTQSSLASMAAPSFSWPGSSQPQQLSSTPAPFPASSPTS---ASPFGEKKDIV 1560
Query: 1572 ITQDDDMDEEAPE-TNNVEFSLSSLGGFGTTSTPMSNAPKPNPFGGSFGNVNATSMNSSF 1631
TQ+D+MDEEAPE + E S+ S GGFG STP APK NPFGG FGN T+ N F
Sbjct: 1561 DTQEDEMDEEAPEASQTTELSMGSFGGFGLGSTPNPGAPKTNPFGGPFGNATTTTSN-PF 1620
Query: 1632 TMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGSFGSGMATQASAQGGFGQPAQI 1682
M + PSGELF+PASF+FQ+P SQ A GSF S +Q AQ GFGQP+QI
Sbjct: 1621 NM-TVPSGELFKPASFNFQNPQPSQPAG--------FGSF-SVTPSQTPAQSGFGQPSQI 1680
BLAST of Csor.00g300930 vs. NCBI nr
Match:
KAG6573777.1 (Nuclear pore complex protein 214, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 3142 bits (8147), Expect = 0.0
Identity = 1681/1681 (100.00%), Postives = 1681/1681 (100.00%), Query Frame = 0
Query: 1 MASVDSRHSISLTPIILEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60
MASVDSRHSISLTPIILEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA
Sbjct: 1 MASVDSRHSISLTPIILEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60
Query: 61 VSESFGLIFVAHSSGFFVVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120
VSESFGLIFVAHSSGFFVVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN
Sbjct: 61 VSESFGLIFVAHSSGFFVVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120
Query: 121 DNSFLAAVVAGDVRLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180
DNSFLAAVVAGDVRLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG
Sbjct: 121 DNSFLAAVVAGDVRLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180
Query: 181 KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNG 240
KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNG
Sbjct: 181 KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNG 240
Query: 241 DTDTDFAVKVDSIKWVRADCIIIGCFQVAATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300
DTDTDFAVKVDSIKWVRADCIIIGCFQVAATGDEEDYFVQVIRSKDGKITDVSSNKVLLS
Sbjct: 241 DTDTDFAVKVDSIKWVRADCIIIGCFQVAATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300
Query: 301 FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360
FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI
Sbjct: 301 FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360
Query: 361 DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420
DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE
Sbjct: 361 DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420
Query: 421 GKLILFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSKVDESPVI 480
GKLILFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSKVDESPVI
Sbjct: 421 GKLILFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSKVDESPVI 480
Query: 481 TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNFAKPVKSFTGLGSVAFSGKSVDVPSQ 540
TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNFAKPVKSFTGLGSVAFSGKSVDVPSQ
Sbjct: 481 TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNFAKPVKSFTGLGSVAFSGKSVDVPSQ 540
Query: 541 SLKSSILERPNNEIGNFDMPFDKFTGLGSVAFSGQSVDMPSQSLKPSFLERPNNQIGNFD 600
SLKSSILERPNNEIGNFDMPFDKFTGLGSVAFSGQSVDMPSQSLKPSFLERPNNQIGNFD
Sbjct: 541 SLKSSILERPNNEIGNFDMPFDKFTGLGSVAFSGQSVDMPSQSLKPSFLERPNNQIGNFD 600
Query: 601 KPVQKFTGLGSVAFSEQSANVPSHPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPST 660
KPVQKFTGLGSVAFSEQSANVPSHPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPST
Sbjct: 601 KPVQKFTGLGSVAFSEQSANVPSHPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPST 660
Query: 661 LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE 720
LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE
Sbjct: 661 LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE 720
Query: 721 MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQN 780
MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQN
Sbjct: 721 MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQN 780
Query: 781 LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIE 840
LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIE
Sbjct: 781 LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIE 840
Query: 841 LERHFNGLELNKFGGNDEIQVNERALQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL 900
LERHFNGLELNKFGGNDEIQVNERALQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL
Sbjct: 841 LERHFNGLELNKFGGNDEIQVNERALQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL 900
Query: 901 SKQIATLNIESPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS 960
SKQIATLNIESPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS
Sbjct: 901 SKQIATLNIESPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS 960
Query: 961 KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVKRMILQGTPLSNEKQFCSP 1020
KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVKRMILQGTPLSNEKQFCSP
Sbjct: 961 KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVKRMILQGTPLSNEKQFCSP 1020
Query: 1021 TLEGPATVARPASRIPSSMLSSSSKNAEQGSENPATPFSWASSPRQKFQPLQKTNGTAPS 1080
TLEGPATVARPASRIPSSMLSSSSKNAEQGSENPATPFSWASSPRQKFQPLQKTNGTAPS
Sbjct: 1021 TLEGPATVARPASRIPSSMLSSSSKNAEQGSENPATPFSWASSPRQKFQPLQKTNGTAPS 1080
Query: 1081 PLPVFQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ 1140
PLPVFQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ
Sbjct: 1081 PLPVFQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ 1140
Query: 1141 KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPVSVGTTSSL 1200
KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPVSVGTTSSL
Sbjct: 1141 KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPVSVGTTSSL 1200
Query: 1201 VPTVDVLRKTEEKKPPTVFSPSVPAPAPVNTPPSASTFLGSPLSKSFPSPAAVVDLNKPL 1260
VPTVDVLRKTEEKKPPTVFSPSVPAPAPVNTPPSASTFLGSPLSKSFPSPAAVVDLNKPL
Sbjct: 1201 VPTVDVLRKTEEKKPPTVFSPSVPAPAPVNTPPSASTFLGSPLSKSFPSPAAVVDLNKPL 1260
Query: 1261 STSTQSSFASPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEKQAP 1320
STSTQSSFASPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEKQAP
Sbjct: 1261 STSTQSSFASPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEKQAP 1320
Query: 1321 ASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVVADAQPQQSSAAFVPLP 1380
ASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVVADAQPQQSSAAFVPLP
Sbjct: 1321 ASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVVADAQPQQSSAAFVPLP 1380
Query: 1381 TPNSTSKAAANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPKP 1440
TPNSTSKAAANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPKP
Sbjct: 1381 TPNSTSKAAANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPKP 1440
Query: 1441 NPFGGSFGNVNATSMNSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGSFG 1500
NPFGGSFGNVNATSMNSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGSFG
Sbjct: 1441 NPFGGSFGNVNATSMNSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGSFG 1500
Query: 1501 SGMATQASAQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGFT 1560
SGMATQASAQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGFT
Sbjct: 1501 SGMATQASAQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGFT 1560
Query: 1561 SVKPVGGGFAGVGSGGGGGFGGGGFAGAASTGGGFAGASPPTGGFAGATGGGFAGAAGGG 1620
SVKPVGGGFAGVGSGGGGGFGGGGFAGAASTGGGFAGASPPTGGFAGATGGGFAGAAGGG
Sbjct: 1561 SVKPVGGGFAGVGSGGGGGFGGGGFAGAASTGGGFAGASPPTGGFAGATGGGFAGAAGGG 1620
Query: 1621 FAGAAGGGFAGAAGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPELFTQIR 1680
FAGAAGGGFAGAAGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPELFTQIR
Sbjct: 1621 FAGAAGGGFAGAAGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPELFTQIR 1680
BLAST of Csor.00g300930 vs. NCBI nr
Match:
KAG7012851.1 (Nuclear pore complex protein, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 3140 bits (8140), Expect = 0.0
Identity = 1679/1681 (99.88%), Postives = 1680/1681 (99.94%), Query Frame = 0
Query: 1 MASVDSRHSISLTPIILEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60
MASVDSRHSISLTPI+LEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA
Sbjct: 1 MASVDSRHSISLTPIVLEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60
Query: 61 VSESFGLIFVAHSSGFFVVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120
VSESFGLIFVAH SGFFVVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN
Sbjct: 61 VSESFGLIFVAHLSGFFVVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120
Query: 121 DNSFLAAVVAGDVRLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180
DNSFLAAVVAGDVRLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG
Sbjct: 121 DNSFLAAVVAGDVRLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180
Query: 181 KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNG 240
KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNG
Sbjct: 181 KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNG 240
Query: 241 DTDTDFAVKVDSIKWVRADCIIIGCFQVAATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300
DTDTDFAVKVDSIKWVRADCIIIGCFQVAATGDEEDYFVQVIRSKDGKITDVSSNKVLLS
Sbjct: 241 DTDTDFAVKVDSIKWVRADCIIIGCFQVAATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300
Query: 301 FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360
FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI
Sbjct: 301 FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360
Query: 361 DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420
DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE
Sbjct: 361 DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420
Query: 421 GKLILFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSKVDESPVI 480
GKLILFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSKVDESPVI
Sbjct: 421 GKLILFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSKVDESPVI 480
Query: 481 TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNFAKPVKSFTGLGSVAFSGKSVDVPSQ 540
TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNFAKPVKSFTGLGSVAFSGKSVDVPSQ
Sbjct: 481 TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNFAKPVKSFTGLGSVAFSGKSVDVPSQ 540
Query: 541 SLKSSILERPNNEIGNFDMPFDKFTGLGSVAFSGQSVDMPSQSLKPSFLERPNNQIGNFD 600
SLKSSILERPNNEIGNFDMPFDKFTGLGSVAFSGQSVDMPSQSLKPSFLERPNNQIGNFD
Sbjct: 541 SLKSSILERPNNEIGNFDMPFDKFTGLGSVAFSGQSVDMPSQSLKPSFLERPNNQIGNFD 600
Query: 601 KPVQKFTGLGSVAFSEQSANVPSHPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPST 660
KPVQKFTGLGSVAFSEQSANVPSHPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPST
Sbjct: 601 KPVQKFTGLGSVAFSEQSANVPSHPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPST 660
Query: 661 LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE 720
LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE
Sbjct: 661 LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE 720
Query: 721 MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQN 780
MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQN
Sbjct: 721 MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQN 780
Query: 781 LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIE 840
LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIE
Sbjct: 781 LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIE 840
Query: 841 LERHFNGLELNKFGGNDEIQVNERALQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL 900
LERHFNGLELNKFGGNDEIQVNERALQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL
Sbjct: 841 LERHFNGLELNKFGGNDEIQVNERALQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL 900
Query: 901 SKQIATLNIESPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS 960
SKQIATLNIESPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS
Sbjct: 901 SKQIATLNIESPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS 960
Query: 961 KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVKRMILQGTPLSNEKQFCSP 1020
KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVKRMILQGTPLSNEKQFCSP
Sbjct: 961 KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVKRMILQGTPLSNEKQFCSP 1020
Query: 1021 TLEGPATVARPASRIPSSMLSSSSKNAEQGSENPATPFSWASSPRQKFQPLQKTNGTAPS 1080
TLEGPATVARPASRIPSSMLSSSSKNAEQGSENPATPFSWASSPRQKFQPLQKTNGTAPS
Sbjct: 1021 TLEGPATVARPASRIPSSMLSSSSKNAEQGSENPATPFSWASSPRQKFQPLQKTNGTAPS 1080
Query: 1081 PLPVFQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ 1140
PLPVFQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ
Sbjct: 1081 PLPVFQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ 1140
Query: 1141 KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPVSVGTTSSL 1200
KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPVSVGTTSSL
Sbjct: 1141 KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPVSVGTTSSL 1200
Query: 1201 VPTVDVLRKTEEKKPPTVFSPSVPAPAPVNTPPSASTFLGSPLSKSFPSPAAVVDLNKPL 1260
VPTVDVLRKTEEKKPPTVFSPSVPAPAPVNTPPSASTFLGSPLSKSFPSPAAVVDLNKPL
Sbjct: 1201 VPTVDVLRKTEEKKPPTVFSPSVPAPAPVNTPPSASTFLGSPLSKSFPSPAAVVDLNKPL 1260
Query: 1261 STSTQSSFASPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEKQAP 1320
STSTQSSFASPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEKQAP
Sbjct: 1261 STSTQSSFASPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEKQAP 1320
Query: 1321 ASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVVADAQPQQSSAAFVPLP 1380
ASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVVADAQPQQSSAAFVPLP
Sbjct: 1321 ASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVVADAQPQQSSAAFVPLP 1380
Query: 1381 TPNSTSKAAANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPKP 1440
TPNSTSKAAANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPKP
Sbjct: 1381 TPNSTSKAAANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPKP 1440
Query: 1441 NPFGGSFGNVNATSMNSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGSFG 1500
NPFGGSFGNVNATSMNSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGSFG
Sbjct: 1441 NPFGGSFGNVNATSMNSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGSFG 1500
Query: 1501 SGMATQASAQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGFT 1560
SGMATQASAQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGFT
Sbjct: 1501 SGMATQASAQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGFT 1560
Query: 1561 SVKPVGGGFAGVGSGGGGGFGGGGFAGAASTGGGFAGASPPTGGFAGATGGGFAGAAGGG 1620
SVKPVGGGFAGVGSGGGGGFGGGGFAGAASTGGGFAGASPPTGGFAGATGGGFAGAAGGG
Sbjct: 1561 SVKPVGGGFAGVGSGGGGGFGGGGFAGAASTGGGFAGASPPTGGFAGATGGGFAGAAGGG 1620
Query: 1621 FAGAAGGGFAGAAGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPELFTQIR 1680
FAGAAGGGFAGAAGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPELFTQIR
Sbjct: 1621 FAGAAGGGFAGAAGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPELFTQIR 1680
BLAST of Csor.00g300930 vs. NCBI nr
Match:
XP_022945173.1 (nuclear pore complex protein NUP214 isoform X1 [Cucurbita moschata])
HSP 1 Score: 3055 bits (7921), Expect = 0.0
Identity = 1643/1687 (97.39%), Postives = 1654/1687 (98.04%), Query Frame = 0
Query: 1 MASVDSRHSISLTPIILEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60
MASVDSRHSISLTPI LEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA
Sbjct: 1 MASVDSRHSISLTPIALEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60
Query: 61 VSESFGLIFVAHSSGFFVVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120
VSESFGLIFVAH SGFFVVRT DVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN
Sbjct: 61 VSESFGLIFVAHLSGFFVVRTTDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120
Query: 121 DNSFLAAVVAGDVRLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180
DNSFLAAVVAGDV LFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG
Sbjct: 121 DNSFLAAVVAGDVHLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180
Query: 181 KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNG 240
KLYQGSAS PFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNG
Sbjct: 181 KLYQGSASVPFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNG 240
Query: 241 DTDTDFAVKVDSIKWVRADCIIIGCFQVAATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300
DTDTDFA+KVDSIKWVRADCIIIGCFQV ATGDEEDYFVQVIRSKDGKITDVSSNKVLLS
Sbjct: 241 DTDTDFAMKVDSIKWVRADCIIIGCFQVTATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300
Query: 301 FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360
FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI
Sbjct: 301 FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360
Query: 361 DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420
DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE
Sbjct: 361 DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420
Query: 421 GKLILFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSKVDESPVI 480
GKL+LFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVS+VDESPVI
Sbjct: 421 GKLLLFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSEVDESPVI 480
Query: 481 TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNFAKPVKSFTGLGSVAFSGKSVDVPSQ 540
TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNF KPVKSFTGLGSVAFSG+SVDVPSQ
Sbjct: 481 TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNFTKPVKSFTGLGSVAFSGQSVDVPSQ 540
Query: 541 SLKSSILERPNNEIGNFDMPFDKFTGLGSVAFSGQSVDMPSQSLKPSFLERPNNQIGNFD 600
SLKSSILERPNNEIGNF+ PF KFTGLGSVAFSGQSVDM +QSLKPSFLERPNNQIGNFD
Sbjct: 541 SLKSSILERPNNEIGNFNKPFHKFTGLGSVAFSGQSVDMANQSLKPSFLERPNNQIGNFD 600
Query: 601 KPVQKFTGLGSVAFSEQSANVPSHPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPST 660
KPVQKFTGLGSVAFSEQSA+VPSHPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPST
Sbjct: 601 KPVQKFTGLGSVAFSEQSADVPSHPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPST 660
Query: 661 LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE 720
LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE
Sbjct: 661 LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE 720
Query: 721 MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQN 780
MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQN
Sbjct: 721 MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQN 780
Query: 781 LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIE 840
LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIE
Sbjct: 781 LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIE 840
Query: 841 LERHFNGLELNKFGGNDEIQVNERALQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL 900
LERHFNGLELNKFGGN+EIQVNER LQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL
Sbjct: 841 LERHFNGLELNKFGGNEEIQVNERTLQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL 900
Query: 901 SKQIATLNIESPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS 960
SKQIATLNIESPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS
Sbjct: 901 SKQIATLNIESPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS 960
Query: 961 KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVKRMILQGTPLSNEKQFCSP 1020
KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVKRMILQGTPLSNEKQF SP
Sbjct: 961 KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVKRMILQGTPLSNEKQFRSP 1020
Query: 1021 TLEGPATVARPASRIPSSMLSSSSKNAEQGSENPATPFSWASSPRQKFQPLQKTNGTAPS 1080
TLEGPAT+ARPASRIPSSMLSSSSKNAEQGS+NPATPFSWAS PRQKFQPLQKTNGTAPS
Sbjct: 1021 TLEGPATIARPASRIPSSMLSSSSKNAEQGSKNPATPFSWASPPRQKFQPLQKTNGTAPS 1080
Query: 1081 PLPVFQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ 1140
LPVFQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ
Sbjct: 1081 SLPVFQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ 1140
Query: 1141 KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPVSVGTTSSL 1200
KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPVSVGTTSSL
Sbjct: 1141 KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPVSVGTTSSL 1200
Query: 1201 VPTVDVLRKTEEKKPPTVFSPSVPAPAPVNTPPSAST-FLGSPLSKSFPSPAAVVDLNKP 1260
VPTVD LRKTEEKKPPTVFSPSVPAPAPVNTPPSAST F GSPLSKSFPSPAAVVDLNKP
Sbjct: 1201 VPTVDGLRKTEEKKPPTVFSPSVPAPAPVNTPPSASTLFSGSPLSKSFPSPAAVVDLNKP 1260
Query: 1261 LSTSTQSSFASPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEKQA 1320
LSTSTQSSFASPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEKQA
Sbjct: 1261 LSTSTQSSFASPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEKQA 1320
Query: 1321 PASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVVADAQPQQSSAAFVPL 1380
PASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVVADAQPQQS AAFVPL
Sbjct: 1321 PASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVVADAQPQQSPAAFVPL 1380
Query: 1381 PTPNSTSKAAANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPK 1440
PTPNSTSKAAANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPK
Sbjct: 1381 PTPNSTSKAAANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPK 1440
Query: 1441 PNPFGGSFGNVNATSMNSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGSF 1500
PNPFGGSFGNVNATSMNSSFT ASPPSGELFRPASFSFQSPLASQAASQPTN+VAFSGSF
Sbjct: 1441 PNPFGGSFGNVNATSMNSSFTTASPPSGELFRPASFSFQSPLASQAASQPTNAVAFSGSF 1500
Query: 1501 GSGMATQASAQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGF 1560
GSGMATQA AQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGF
Sbjct: 1501 GSGMATQAPAQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGF 1560
Query: 1561 TSVKPVGGGFAGVGSGGGGGFGGGGF-----AGAASTGGGFAGASPPTGGFAGATGGGFA 1620
TSVKPVGGGFAGVGSGGGGGFGGGGF AGAASTGGGFAGASPPTGGFAGATGGGFA
Sbjct: 1561 TSVKPVGGGFAGVGSGGGGGFGGGGFSGGGFAGAASTGGGFAGASPPTGGFAGATGGGFA 1620
Query: 1621 GAAGGGFAGAAGGGFAGAAGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPE 1680
GAAGGGFAGAAGGGFA GAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPE
Sbjct: 1621 GAAGGGFAGAAGGGFA--------GAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPE 1679
BLAST of Csor.00g300930 vs. NCBI nr
Match:
XP_022945174.1 (nuclear pore complex protein NUP214 isoform X2 [Cucurbita moschata])
HSP 1 Score: 3029 bits (7854), Expect = 0.0
Identity = 1627/1687 (96.44%), Postives = 1638/1687 (97.10%), Query Frame = 0
Query: 1 MASVDSRHSISLTPIILEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60
MASVDSRHSISLTPI LEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA
Sbjct: 1 MASVDSRHSISLTPIALEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60
Query: 61 VSESFGLIFVAHSSGFFVVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120
VSESFGLIFVAH SGFFVVRT DVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN
Sbjct: 61 VSESFGLIFVAHLSGFFVVRTTDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120
Query: 121 DNSFLAAVVAGDVRLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180
DNSFLAAVVAGDV LFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG
Sbjct: 121 DNSFLAAVVAGDVHLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180
Query: 181 KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNG 240
KLYQGSAS PFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNG
Sbjct: 181 KLYQGSASVPFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNG 240
Query: 241 DTDTDFAVKVDSIKWVRADCIIIGCFQVAATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300
DTDTDFA+KVDSIKWVRADCIIIGCFQV ATGDEEDYFVQVIRSKDGKITDVSSNKVLLS
Sbjct: 241 DTDTDFAMKVDSIKWVRADCIIIGCFQVTATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300
Query: 301 FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360
FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI
Sbjct: 301 FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360
Query: 361 DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420
DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE
Sbjct: 361 DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420
Query: 421 GKLILFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSKVDESPVI 480
GKL+LFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVS+VDESPVI
Sbjct: 421 GKLLLFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSEVDESPVI 480
Query: 481 TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNFAKPVKSFTGLGSVAFSGKSVDVPSQ 540
TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNF KPVKSFTGLGSVAFSG+SVDVPSQ
Sbjct: 481 TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNFTKPVKSFTGLGSVAFSGQSVDVPSQ 540
Query: 541 SLKSSILERPNNEIGNFDMPFDKFTGLGSVAFSGQSVDMPSQSLKPSFLERPNNQIGNFD 600
SLKSSILERPNNEIGNF+ PF KFTGLGSVAFSGQSVDM +QSLKPSFLERPNNQIGNFD
Sbjct: 541 SLKSSILERPNNEIGNFNKPFHKFTGLGSVAFSGQSVDMANQSLKPSFLERPNNQIGNFD 600
Query: 601 KPVQKFTGLGSVAFSEQSANVPSHPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPST 660
KPVQKFTGLGSVAFSEQSA+VPSHPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPST
Sbjct: 601 KPVQKFTGLGSVAFSEQSADVPSHPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPST 660
Query: 661 LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE 720
LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE
Sbjct: 661 LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE 720
Query: 721 MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQN 780
MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQN
Sbjct: 721 MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQN 780
Query: 781 LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIE 840
LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIE
Sbjct: 781 LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIE 840
Query: 841 LERHFNGLELNKFGGNDEIQVNERALQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL 900
LERHFNGLELNKFGGN+EIQVNER LQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL
Sbjct: 841 LERHFNGLELNKFGGNEEIQVNERTLQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL 900
Query: 901 SKQIATLNIESPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS 960
SKQIATLNIESPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS
Sbjct: 901 SKQIATLNIESPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS 960
Query: 961 KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVKRMILQGTPLSNEKQFCSP 1020
KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVKRMILQGTPLSNEKQF SP
Sbjct: 961 KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVKRMILQGTPLSNEKQFRSP 1020
Query: 1021 TLEGPATVARPASRIPSSMLSSSSKNAEQGSENPATPFSWASSPRQKFQPLQKTNGTAPS 1080
TLEGPAT+ARPASRIPSSMLSSSSKNAEQGS+NPATPFSWAS PRQKFQPLQKTNGTAPS
Sbjct: 1021 TLEGPATIARPASRIPSSMLSSSSKNAEQGSKNPATPFSWASPPRQKFQPLQKTNGTAPS 1080
Query: 1081 PLPVFQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ 1140
LPVFQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ
Sbjct: 1081 SLPVFQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ 1140
Query: 1141 KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPVSVGTTSSL 1200
KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPVSVGTTSSL
Sbjct: 1141 KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPVSVGTTSSL 1200
Query: 1201 VPTVDVLRKTEEKKPPTVFSPSVPAPAPVNTPPSAST-FLGSPLSKSFPSPAAVVDLNKP 1260
VPTVD LRKTEEKKPPTVFSPSVPAPAPVNTPPSAST F GSPLSKSFPSPAAVVDLNKP
Sbjct: 1201 VPTVDGLRKTEEKKPPTVFSPSVPAPAPVNTPPSASTLFSGSPLSKSFPSPAAVVDLNKP 1260
Query: 1261 LSTSTQSSFASPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEKQA 1320
LSTSTQSSFASPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEKQA
Sbjct: 1261 LSTSTQSSFASPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEKQA 1320
Query: 1321 PASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVVADAQPQQSSAAFVPL 1380
PASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVVADAQPQQS AAFVPL
Sbjct: 1321 PASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVVADAQPQQSPAAFVPL 1380
Query: 1381 PTPNSTSKAAANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPK 1440
PTPNSTSKAAANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPK
Sbjct: 1381 PTPNSTSKAAANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPK 1440
Query: 1441 PNPFGGSFGNVNATSMNSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGSF 1500
PNPFGGSFGNVNATSMNSSFT ASPPSGELFRPASFSFQSPLASQAASQPTN+VAFSGSF
Sbjct: 1441 PNPFGGSFGNVNATSMNSSFTTASPPSGELFRPASFSFQSPLASQAASQPTNAVAFSGSF 1500
Query: 1501 GSGMATQASAQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGF 1560
GSGMATQA AQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGF
Sbjct: 1501 GSGMATQAPAQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGF 1560
Query: 1561 TSVKPVGGGFAGVGSGGGGGFGGGGF-----AGAASTGGGFAGASPPTGGFAGATGGGFA 1620
TSVKPVGGGFAGVGSGGGGGFGGGGF AGAASTGGGFAGASPPTGGFAGA
Sbjct: 1561 TSVKPVGGGFAGVGSGGGGGFGGGGFSGGGFAGAASTGGGFAGASPPTGGFAGA------ 1620
Query: 1621 GAAGGGFAGAAGGGFAGAAGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPE 1680
AGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPE
Sbjct: 1621 ------------------AGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPE 1663
BLAST of Csor.00g300930 vs. NCBI nr
Match:
XP_023541587.1 (nuclear pore complex protein NUP214 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 3024 bits (7841), Expect = 0.0
Identity = 1628/1689 (96.39%), Postives = 1644/1689 (97.34%), Query Frame = 0
Query: 1 MASVDSRHSISLTPIILEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60
MASVDSRHSIS T + LEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA
Sbjct: 1 MASVDSRHSISSTHVALEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60
Query: 61 VSESFGLIFVAHSSGFFVVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120
VSESFGLIFVAH SGFFVVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN
Sbjct: 61 VSESFGLIFVAHLSGFFVVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120
Query: 121 DNSFLAAVVAGDVRLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180
DNSFLAAVVAGDV LFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG
Sbjct: 121 DNSFLAAVVAGDVHLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180
Query: 181 KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNG 240
KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDT +IFSYKFKERLSMSLLPS GNG
Sbjct: 181 KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTFTIFSYKFKERLSMSLLPSLGNG 240
Query: 241 DTDTDFAVKVDSIKWVRADCIIIGCFQVAATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300
DTDTDFAVKVDSIKWVRADCIIIGCFQV ATGDEEDYFVQVIRSKDGKITDVSSNKVLLS
Sbjct: 241 DTDTDFAVKVDSIKWVRADCIIIGCFQVTATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300
Query: 301 FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360
FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI
Sbjct: 301 FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360
Query: 361 DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420
DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE
Sbjct: 361 DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420
Query: 421 GKLILFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSKVDESPVI 480
GKLILFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSKVDESPVI
Sbjct: 421 GKLILFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSKVDESPVI 480
Query: 481 TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNFAKPVKSFTGLGSVAFSGKSVDVPSQ 540
TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNFAKP K+FTGLGSVAFSG+SVDVPSQ
Sbjct: 481 TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNFAKPAKNFTGLGSVAFSGQSVDVPSQ 540
Query: 541 SLKSSILERPNNEIGNFDMPFDKFTGLGSVAFSGQSVDMPSQSLKPSFLERPNNQIGNFD 600
+LKSSILERPNNEIGNF+ PF KFTGLGSVAFSGQSVDMP+QSLKPSFLERPNNQIGNFD
Sbjct: 541 TLKSSILERPNNEIGNFNKPFHKFTGLGSVAFSGQSVDMPNQSLKPSFLERPNNQIGNFD 600
Query: 601 KPVQKFTGLGSVAFSEQSANVPSHPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPST 660
KPVQKFTGLGSVAFSEQSA+VPSHPFLNVKEST+KQSSGAANAFTGFAGKPFQPKDVPST
Sbjct: 601 KPVQKFTGLGSVAFSEQSADVPSHPFLNVKESTVKQSSGAANAFTGFAGKPFQPKDVPST 660
Query: 661 LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE 720
LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE
Sbjct: 661 LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE 720
Query: 721 MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQN 780
MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQ
Sbjct: 721 MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQY 780
Query: 781 LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIE 840
LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQ ILQMNQNMTNQLIE
Sbjct: 781 LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQHILQMNQNMTNQLIE 840
Query: 841 LERHFNGLELNKFGGNDEIQVNERALQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL 900
LERHFNGLELNKFGGNDE QVNERALQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL
Sbjct: 841 LERHFNGLELNKFGGNDETQVNERALQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL 900
Query: 901 SKQIATLNIESPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS 960
SKQIATLNIESPSSKRQSITKELF+TIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS
Sbjct: 901 SKQIATLNIESPSSKRQSITKELFDTIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS 960
Query: 961 KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVKRMILQGTPLSNEKQFCSP 1020
KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTV+RMILQGTPLSNEK+F SP
Sbjct: 961 KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVQRMILQGTPLSNEKEFRSP 1020
Query: 1021 TLEGPATVARPASRIPSSMLSSSSKNAEQGSENPATPFSWASSPRQKFQPLQKTNGTAPS 1080
TLEGPATVARPASRI SSMLSSSSKNAEQGSENPATPFSWAS PRQKFQP QKTNGTAPS
Sbjct: 1021 TLEGPATVARPASRIASSMLSSSSKNAEQGSENPATPFSWASPPRQKFQPPQKTNGTAPS 1080
Query: 1081 PLPVFQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ 1140
PLPVFQSSHEM+KKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ
Sbjct: 1081 PLPVFQSSHEMLKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ 1140
Query: 1141 KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPVSVGTTSSL 1200
KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEP SVGTTSSL
Sbjct: 1141 KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPTSVGTTSSL 1200
Query: 1201 VPTVDVLRKTEEKKPPTVFSPSVPAPAPVNTPPSAST-FLGSPLSKSFPSPAA--VVDLN 1260
VP VD LRKTEEKKPPTVFSPSV APAPVNTP SAST F GSPLSKSFPSPAA VVDLN
Sbjct: 1201 VPIVDGLRKTEEKKPPTVFSPSVSAPAPVNTPSSASTLFSGSPLSKSFPSPAAAAVVDLN 1260
Query: 1261 KPLSTSTQSSFASPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEK 1320
KPLSTSTQSSFA PVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEK
Sbjct: 1261 KPLSTSTQSSFAFPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEK 1320
Query: 1321 QAPASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVVADAQPQQSSAAFV 1380
QAPASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFV ADAQPQQSSAAFV
Sbjct: 1321 QAPASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVTADAQPQQSSAAFV 1380
Query: 1381 PLPTPNSTSKAAANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNA 1440
PLPTPNST K +ANGKSETSDAL+TQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNA
Sbjct: 1381 PLPTPNSTPKVSANGKSETSDALVTQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNA 1440
Query: 1441 PKPNPFGGSFGNVNATSMNSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSG 1500
PKPNPFGGSFGNVNATSMNSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSG
Sbjct: 1441 PKPNPFGGSFGNVNATSMNSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSG 1500
Query: 1501 SFGSGMATQASAQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGG 1560
SFGSGMATQA AQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTA+GSPGGFNGG
Sbjct: 1501 SFGSGMATQAPAQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTATGSPGGFNGG 1560
Query: 1561 GFTSVKPVGGGFAGVGSGGGGGFGGGGF-----AGAASTGGGFAGASPPTGGFAGATGGG 1620
GFTSVKPVGGGFAGVGSGGGGGFGGGGF AGAASTGGGFAGASPPTGGFAGATGGG
Sbjct: 1561 GFTSVKPVGGGFAGVGSGGGGGFGGGGFSGGGFAGAASTGGGFAGASPPTGGFAGATGGG 1620
Query: 1621 FAGAAGGGFAGAAGGGFAGAAGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKP 1680
FAGAAGGGFAGAAGGGFA GAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKP
Sbjct: 1621 FAGAAGGGFAGAAGGGFA--------GAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKP 1680
BLAST of Csor.00g300930 vs. ExPASy TrEMBL
Match:
A0A6J1G030 (nuclear pore complex protein NUP214 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111449495 PE=4 SV=1)
HSP 1 Score: 3055 bits (7921), Expect = 0.0
Identity = 1643/1687 (97.39%), Postives = 1654/1687 (98.04%), Query Frame = 0
Query: 1 MASVDSRHSISLTPIILEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60
MASVDSRHSISLTPI LEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA
Sbjct: 1 MASVDSRHSISLTPIALEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60
Query: 61 VSESFGLIFVAHSSGFFVVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120
VSESFGLIFVAH SGFFVVRT DVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN
Sbjct: 61 VSESFGLIFVAHLSGFFVVRTTDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120
Query: 121 DNSFLAAVVAGDVRLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180
DNSFLAAVVAGDV LFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG
Sbjct: 121 DNSFLAAVVAGDVHLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180
Query: 181 KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNG 240
KLYQGSAS PFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNG
Sbjct: 181 KLYQGSASVPFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNG 240
Query: 241 DTDTDFAVKVDSIKWVRADCIIIGCFQVAATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300
DTDTDFA+KVDSIKWVRADCIIIGCFQV ATGDEEDYFVQVIRSKDGKITDVSSNKVLLS
Sbjct: 241 DTDTDFAMKVDSIKWVRADCIIIGCFQVTATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300
Query: 301 FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360
FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI
Sbjct: 301 FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360
Query: 361 DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420
DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE
Sbjct: 361 DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420
Query: 421 GKLILFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSKVDESPVI 480
GKL+LFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVS+VDESPVI
Sbjct: 421 GKLLLFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSEVDESPVI 480
Query: 481 TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNFAKPVKSFTGLGSVAFSGKSVDVPSQ 540
TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNF KPVKSFTGLGSVAFSG+SVDVPSQ
Sbjct: 481 TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNFTKPVKSFTGLGSVAFSGQSVDVPSQ 540
Query: 541 SLKSSILERPNNEIGNFDMPFDKFTGLGSVAFSGQSVDMPSQSLKPSFLERPNNQIGNFD 600
SLKSSILERPNNEIGNF+ PF KFTGLGSVAFSGQSVDM +QSLKPSFLERPNNQIGNFD
Sbjct: 541 SLKSSILERPNNEIGNFNKPFHKFTGLGSVAFSGQSVDMANQSLKPSFLERPNNQIGNFD 600
Query: 601 KPVQKFTGLGSVAFSEQSANVPSHPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPST 660
KPVQKFTGLGSVAFSEQSA+VPSHPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPST
Sbjct: 601 KPVQKFTGLGSVAFSEQSADVPSHPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPST 660
Query: 661 LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE 720
LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE
Sbjct: 661 LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE 720
Query: 721 MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQN 780
MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQN
Sbjct: 721 MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQN 780
Query: 781 LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIE 840
LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIE
Sbjct: 781 LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIE 840
Query: 841 LERHFNGLELNKFGGNDEIQVNERALQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL 900
LERHFNGLELNKFGGN+EIQVNER LQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL
Sbjct: 841 LERHFNGLELNKFGGNEEIQVNERTLQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL 900
Query: 901 SKQIATLNIESPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS 960
SKQIATLNIESPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS
Sbjct: 901 SKQIATLNIESPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS 960
Query: 961 KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVKRMILQGTPLSNEKQFCSP 1020
KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVKRMILQGTPLSNEKQF SP
Sbjct: 961 KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVKRMILQGTPLSNEKQFRSP 1020
Query: 1021 TLEGPATVARPASRIPSSMLSSSSKNAEQGSENPATPFSWASSPRQKFQPLQKTNGTAPS 1080
TLEGPAT+ARPASRIPSSMLSSSSKNAEQGS+NPATPFSWAS PRQKFQPLQKTNGTAPS
Sbjct: 1021 TLEGPATIARPASRIPSSMLSSSSKNAEQGSKNPATPFSWASPPRQKFQPLQKTNGTAPS 1080
Query: 1081 PLPVFQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ 1140
LPVFQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ
Sbjct: 1081 SLPVFQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ 1140
Query: 1141 KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPVSVGTTSSL 1200
KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPVSVGTTSSL
Sbjct: 1141 KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPVSVGTTSSL 1200
Query: 1201 VPTVDVLRKTEEKKPPTVFSPSVPAPAPVNTPPSAST-FLGSPLSKSFPSPAAVVDLNKP 1260
VPTVD LRKTEEKKPPTVFSPSVPAPAPVNTPPSAST F GSPLSKSFPSPAAVVDLNKP
Sbjct: 1201 VPTVDGLRKTEEKKPPTVFSPSVPAPAPVNTPPSASTLFSGSPLSKSFPSPAAVVDLNKP 1260
Query: 1261 LSTSTQSSFASPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEKQA 1320
LSTSTQSSFASPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEKQA
Sbjct: 1261 LSTSTQSSFASPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEKQA 1320
Query: 1321 PASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVVADAQPQQSSAAFVPL 1380
PASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVVADAQPQQS AAFVPL
Sbjct: 1321 PASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVVADAQPQQSPAAFVPL 1380
Query: 1381 PTPNSTSKAAANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPK 1440
PTPNSTSKAAANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPK
Sbjct: 1381 PTPNSTSKAAANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPK 1440
Query: 1441 PNPFGGSFGNVNATSMNSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGSF 1500
PNPFGGSFGNVNATSMNSSFT ASPPSGELFRPASFSFQSPLASQAASQPTN+VAFSGSF
Sbjct: 1441 PNPFGGSFGNVNATSMNSSFTTASPPSGELFRPASFSFQSPLASQAASQPTNAVAFSGSF 1500
Query: 1501 GSGMATQASAQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGF 1560
GSGMATQA AQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGF
Sbjct: 1501 GSGMATQAPAQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGF 1560
Query: 1561 TSVKPVGGGFAGVGSGGGGGFGGGGF-----AGAASTGGGFAGASPPTGGFAGATGGGFA 1620
TSVKPVGGGFAGVGSGGGGGFGGGGF AGAASTGGGFAGASPPTGGFAGATGGGFA
Sbjct: 1561 TSVKPVGGGFAGVGSGGGGGFGGGGFSGGGFAGAASTGGGFAGASPPTGGFAGATGGGFA 1620
Query: 1621 GAAGGGFAGAAGGGFAGAAGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPE 1680
GAAGGGFAGAAGGGFA GAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPE
Sbjct: 1621 GAAGGGFAGAAGGGFA--------GAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPE 1679
BLAST of Csor.00g300930 vs. ExPASy TrEMBL
Match:
A0A6J1G089 (nuclear pore complex protein NUP214 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111449495 PE=4 SV=1)
HSP 1 Score: 3029 bits (7854), Expect = 0.0
Identity = 1627/1687 (96.44%), Postives = 1638/1687 (97.10%), Query Frame = 0
Query: 1 MASVDSRHSISLTPIILEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60
MASVDSRHSISLTPI LEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA
Sbjct: 1 MASVDSRHSISLTPIALEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60
Query: 61 VSESFGLIFVAHSSGFFVVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120
VSESFGLIFVAH SGFFVVRT DVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN
Sbjct: 61 VSESFGLIFVAHLSGFFVVRTTDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120
Query: 121 DNSFLAAVVAGDVRLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180
DNSFLAAVVAGDV LFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG
Sbjct: 121 DNSFLAAVVAGDVHLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180
Query: 181 KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNG 240
KLYQGSAS PFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNG
Sbjct: 181 KLYQGSASVPFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNG 240
Query: 241 DTDTDFAVKVDSIKWVRADCIIIGCFQVAATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300
DTDTDFA+KVDSIKWVRADCIIIGCFQV ATGDEEDYFVQVIRSKDGKITDVSSNKVLLS
Sbjct: 241 DTDTDFAMKVDSIKWVRADCIIIGCFQVTATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300
Query: 301 FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360
FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI
Sbjct: 301 FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360
Query: 361 DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420
DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE
Sbjct: 361 DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420
Query: 421 GKLILFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSKVDESPVI 480
GKL+LFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVS+VDESPVI
Sbjct: 421 GKLLLFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSEVDESPVI 480
Query: 481 TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNFAKPVKSFTGLGSVAFSGKSVDVPSQ 540
TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNF KPVKSFTGLGSVAFSG+SVDVPSQ
Sbjct: 481 TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNFTKPVKSFTGLGSVAFSGQSVDVPSQ 540
Query: 541 SLKSSILERPNNEIGNFDMPFDKFTGLGSVAFSGQSVDMPSQSLKPSFLERPNNQIGNFD 600
SLKSSILERPNNEIGNF+ PF KFTGLGSVAFSGQSVDM +QSLKPSFLERPNNQIGNFD
Sbjct: 541 SLKSSILERPNNEIGNFNKPFHKFTGLGSVAFSGQSVDMANQSLKPSFLERPNNQIGNFD 600
Query: 601 KPVQKFTGLGSVAFSEQSANVPSHPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPST 660
KPVQKFTGLGSVAFSEQSA+VPSHPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPST
Sbjct: 601 KPVQKFTGLGSVAFSEQSADVPSHPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPST 660
Query: 661 LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE 720
LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE
Sbjct: 661 LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE 720
Query: 721 MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQN 780
MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQN
Sbjct: 721 MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQN 780
Query: 781 LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIE 840
LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIE
Sbjct: 781 LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIE 840
Query: 841 LERHFNGLELNKFGGNDEIQVNERALQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL 900
LERHFNGLELNKFGGN+EIQVNER LQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL
Sbjct: 841 LERHFNGLELNKFGGNEEIQVNERTLQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL 900
Query: 901 SKQIATLNIESPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS 960
SKQIATLNIESPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS
Sbjct: 901 SKQIATLNIESPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS 960
Query: 961 KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVKRMILQGTPLSNEKQFCSP 1020
KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVKRMILQGTPLSNEKQF SP
Sbjct: 961 KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVKRMILQGTPLSNEKQFRSP 1020
Query: 1021 TLEGPATVARPASRIPSSMLSSSSKNAEQGSENPATPFSWASSPRQKFQPLQKTNGTAPS 1080
TLEGPAT+ARPASRIPSSMLSSSSKNAEQGS+NPATPFSWAS PRQKFQPLQKTNGTAPS
Sbjct: 1021 TLEGPATIARPASRIPSSMLSSSSKNAEQGSKNPATPFSWASPPRQKFQPLQKTNGTAPS 1080
Query: 1081 PLPVFQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ 1140
LPVFQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ
Sbjct: 1081 SLPVFQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ 1140
Query: 1141 KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPVSVGTTSSL 1200
KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPVSVGTTSSL
Sbjct: 1141 KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPVSVGTTSSL 1200
Query: 1201 VPTVDVLRKTEEKKPPTVFSPSVPAPAPVNTPPSAST-FLGSPLSKSFPSPAAVVDLNKP 1260
VPTVD LRKTEEKKPPTVFSPSVPAPAPVNTPPSAST F GSPLSKSFPSPAAVVDLNKP
Sbjct: 1201 VPTVDGLRKTEEKKPPTVFSPSVPAPAPVNTPPSASTLFSGSPLSKSFPSPAAVVDLNKP 1260
Query: 1261 LSTSTQSSFASPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEKQA 1320
LSTSTQSSFASPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEKQA
Sbjct: 1261 LSTSTQSSFASPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEKQA 1320
Query: 1321 PASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVVADAQPQQSSAAFVPL 1380
PASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVVADAQPQQS AAFVPL
Sbjct: 1321 PASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVVADAQPQQSPAAFVPL 1380
Query: 1381 PTPNSTSKAAANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPK 1440
PTPNSTSKAAANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPK
Sbjct: 1381 PTPNSTSKAAANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPK 1440
Query: 1441 PNPFGGSFGNVNATSMNSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGSF 1500
PNPFGGSFGNVNATSMNSSFT ASPPSGELFRPASFSFQSPLASQAASQPTN+VAFSGSF
Sbjct: 1441 PNPFGGSFGNVNATSMNSSFTTASPPSGELFRPASFSFQSPLASQAASQPTNAVAFSGSF 1500
Query: 1501 GSGMATQASAQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGF 1560
GSGMATQA AQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGF
Sbjct: 1501 GSGMATQAPAQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGF 1560
Query: 1561 TSVKPVGGGFAGVGSGGGGGFGGGGF-----AGAASTGGGFAGASPPTGGFAGATGGGFA 1620
TSVKPVGGGFAGVGSGGGGGFGGGGF AGAASTGGGFAGASPPTGGFAGA
Sbjct: 1561 TSVKPVGGGFAGVGSGGGGGFGGGGFSGGGFAGAASTGGGFAGASPPTGGFAGA------ 1620
Query: 1621 GAAGGGFAGAAGGGFAGAAGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPE 1680
AGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPE
Sbjct: 1621 ------------------AGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPE 1663
BLAST of Csor.00g300930 vs. ExPASy TrEMBL
Match:
A0A6J1HNV2 (nuclear pore complex protein NUP214 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111466378 PE=4 SV=1)
HSP 1 Score: 2968 bits (7694), Expect = 0.0
Identity = 1593/1687 (94.43%), Postives = 1614/1687 (95.67%), Query Frame = 0
Query: 1 MASVDSRHSISLTPIILEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60
MASVDSRHS S TPI LEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA
Sbjct: 1 MASVDSRHSTSSTPIPLEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60
Query: 61 VSESFGLIFVAHSSGFFVVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120
VSESFGLIFVAH SGFF VRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN
Sbjct: 61 VSESFGLIFVAHLSGFFAVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120
Query: 121 DNSFLAAVVAGDVRLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180
DNSFLAAVVAGDV LF VDSLLDK E+P FSCSTTDSSCIKDFKWTRKPENSYLVLSKHG
Sbjct: 121 DNSFLAAVVAGDVHLFLVDSLLDKGEEPSFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180
Query: 181 KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNG 240
KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTL++FSYKFKERLSMSLLPS GNG
Sbjct: 181 KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTLTVFSYKFKERLSMSLLPSLGNG 240
Query: 241 DTDTDFAVKVDSIKWVRADCIIIGCFQVAATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300
DTDTDFAVKVDSIKWVRADCIIIGCFQV ATGDEEDYFVQVIRSKDGKITDVSSNKVLLS
Sbjct: 241 DTDTDFAVKVDSIKWVRADCIIIGCFQVTATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300
Query: 301 FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360
FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI
Sbjct: 301 FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360
Query: 361 DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420
DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE
Sbjct: 361 DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420
Query: 421 GKLILFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSKVDESPVI 480
GKLILFHFSSANESEASDETVSACDEEEED+TVVPTDDQPQLFSNIDQRPVSKVD SPVI
Sbjct: 421 GKLILFHFSSANESEASDETVSACDEEEEDETVVPTDDQPQLFSNIDQRPVSKVDGSPVI 480
Query: 481 TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNFAKPVKSFTGLGSVAFSGKSVDVPSQ 540
TRESNAKSQQMDS AFSQPLKPSTLERPNNEIGNFAKPVK+FTGLGSVAFSG+SVDVPSQ
Sbjct: 481 TRESNAKSQQMDSLAFSQPLKPSTLERPNNEIGNFAKPVKNFTGLGSVAFSGQSVDVPSQ 540
Query: 541 SLKSSILERPNNEIGNFDMPFDKFTGLGSVAFSGQSVDMPSQSLKPSFLERPNNQIGNFD 600
LKSSILERPNNEIGNF+ PF KFTGLGSVAFSGQSVDMP++SLKPSFLERPNNQIGNFD
Sbjct: 541 PLKSSILERPNNEIGNFNKPFHKFTGLGSVAFSGQSVDMPNESLKPSFLERPNNQIGNFD 600
Query: 601 KPVQKFTGLGSVAFSEQSANVPSHPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPST 660
KPVQKFTGLGSVAFSEQS +VPSHPFLNVKESTIK SSGAANAFTGFAGKPFQPKDVPST
Sbjct: 601 KPVQKFTGLGSVAFSEQSVDVPSHPFLNVKESTIKHSSGAANAFTGFAGKPFQPKDVPST 660
Query: 661 LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE 720
LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPM E
Sbjct: 661 LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMNE 720
Query: 721 MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQN 780
MCEGLDMLLESIEEPGGFLDACTTFQKSSVEAL LGLATLSDQCQIWRRTMTERAQEVQN
Sbjct: 721 MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALGLGLATLSDQCQIWRRTMTERAQEVQN 780
Query: 781 LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIE 840
LFDRTVEVLSKKTYIEGIVTQASDSNYW+HWDRQKLSSELELKRQRILQMNQNMTNQLIE
Sbjct: 781 LFDRTVEVLSKKTYIEGIVTQASDSNYWDHWDRQKLSSELELKRQRILQMNQNMTNQLIE 840
Query: 841 LERHFNGLELNKFGGNDEIQVNERALQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL 900
LERHFNGLELN FGGN+EIQVNER LQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL
Sbjct: 841 LERHFNGLELNTFGGNEEIQVNERTLQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL 900
Query: 901 SKQIATLNIESPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS 960
SKQIATLNI+SPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS
Sbjct: 901 SKQIATLNIKSPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS 960
Query: 961 KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVKRMILQGTPLSNEKQFCSP 1020
KDTSRRKQRSGAKISETETGRRRRDSLDRNLAS+QPPKTTVKRMILQGTPLSNEKQF SP
Sbjct: 961 KDTSRRKQRSGAKISETETGRRRRDSLDRNLASIQPPKTTVKRMILQGTPLSNEKQFRSP 1020
Query: 1021 TLEGPATVARPASRIPSSMLSSSSKNAEQGSENPATPFSWASSPRQKFQPLQKTNGTAPS 1080
TLEGPATVARPA RIPSSMLSSSSKNAEQGSENPATPFSWAS PRQKFQPLQKTNGTAPS
Sbjct: 1021 TLEGPATVARPAGRIPSSMLSSSSKNAEQGSENPATPFSWASPPRQKFQPLQKTNGTAPS 1080
Query: 1081 PLPVFQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ 1140
PLPV+QSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ
Sbjct: 1081 PLPVYQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ 1140
Query: 1141 KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPVSVGTTSSL 1200
KSS FVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFG+ANKPEP SVGTTSSL
Sbjct: 1141 KSSFFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGAANKPEPASVGTTSSL 1200
Query: 1201 VPTVDVLRKTEEKKPPTVFSPSVPAPAPVNTPPSAST-FLGSPLSKSFPSPAAVVDLNKP 1260
VPTVD LRKTEEKKPPTVFSPSVPA PVNTP SAST F GSPLSKSFPSPAAVVDLNKP
Sbjct: 1201 VPTVDELRKTEEKKPPTVFSPSVPASVPVNTPSSASTLFSGSPLSKSFPSPAAVVDLNKP 1260
Query: 1261 LSTSTQSSFASPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEKQA 1320
LSTSTQSSFASPVVSVSDSLFQAPKMVSPPS LSSLNP+LVSSSKEQP+PKSDADTEKQA
Sbjct: 1261 LSTSTQSSFASPVVSVSDSLFQAPKMVSPPSTLSSLNPSLVSSSKEQPIPKSDADTEKQA 1320
Query: 1321 PASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVVADAQPQQSSAAFVPL 1380
ASKPE RELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVP V+ADAQPQQSSAAFVPL
Sbjct: 1321 QASKPEFRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPSVIADAQPQQSSAAFVPL 1380
Query: 1381 PTPNSTSKAAANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPK 1440
P+PNST K +ANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPK
Sbjct: 1381 PSPNSTPKVSANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPK 1440
Query: 1441 PNPFGGSFGNVNATSMNSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGSF 1500
PNPFGGSFGN NATSMNSSFT ASPPSGELFRPASFSFQSPLASQAASQPTNSVAFS SF
Sbjct: 1441 PNPFGGSFGNANATSMNSSFTTASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSSSF 1500
Query: 1501 GSGMATQASAQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGF 1560
GSGMATQA QGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGF
Sbjct: 1501 GSGMATQAPTQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGF 1560
Query: 1561 TSVKPVGGGFAGVGSGGGGGFGGGGF-----AGAASTGGGFAGASPPTGGFAGATGGGFA 1620
TSVKPVGGGFAGVGSGGGGGFGGGGF AGAASTGGGFAGASPPTGGFAGA
Sbjct: 1561 TSVKPVGGGFAGVGSGGGGGFGGGGFGGGGFAGAASTGGGFAGASPPTGGFAGA------ 1620
Query: 1621 GAAGGGFAGAAGGGFAGAAGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPE 1680
AGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPE
Sbjct: 1621 ------------------AGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPE 1663
BLAST of Csor.00g300930 vs. ExPASy TrEMBL
Match:
A0A6J1HUR6 (nuclear pore complex protein NUP214 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111466378 PE=4 SV=1)
HSP 1 Score: 2967 bits (7693), Expect = 0.0
Identity = 1597/1684 (94.83%), Postives = 1620/1684 (96.20%), Query Frame = 0
Query: 1 MASVDSRHSISLTPIILEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60
MASVDSRHS S TPI LEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA
Sbjct: 1 MASVDSRHSTSSTPIPLEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60
Query: 61 VSESFGLIFVAHSSGFFVVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120
VSESFGLIFVAH SGFF VRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN
Sbjct: 61 VSESFGLIFVAHLSGFFAVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120
Query: 121 DNSFLAAVVAGDVRLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180
DNSFLAAVVAGDV LF VDSLLDK E+P FSCSTTDSSCIKDFKWTRKPENSYLVLSKHG
Sbjct: 121 DNSFLAAVVAGDVHLFLVDSLLDKGEEPSFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180
Query: 181 KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNG 240
KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTL++FSYKFKERLSMSLLPS GNG
Sbjct: 181 KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTLTVFSYKFKERLSMSLLPSLGNG 240
Query: 241 DTDTDFAVKVDSIKWVRADCIIIGCFQVAATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300
DTDTDFAVKVDSIKWVRADCIIIGCFQV ATGDEEDYFVQVIRSKDGKITDVSSNKVLLS
Sbjct: 241 DTDTDFAVKVDSIKWVRADCIIIGCFQVTATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300
Query: 301 FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360
FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI
Sbjct: 301 FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360
Query: 361 DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420
DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE
Sbjct: 361 DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420
Query: 421 GKLILFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSKVDESPVI 480
GKLILFHFSSANESEASDETVSACDEEEED+TVVPTDDQPQLFSNIDQRPVSKVD SPVI
Sbjct: 421 GKLILFHFSSANESEASDETVSACDEEEEDETVVPTDDQPQLFSNIDQRPVSKVDGSPVI 480
Query: 481 TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNFAKPVKSFTGLGSVAFSGKSVDVPSQ 540
TRESNAKSQQMDS AFSQPLKPSTLERPNNEIGNFAKPVK+FTGLGSVAFSG+SVDVPSQ
Sbjct: 481 TRESNAKSQQMDSLAFSQPLKPSTLERPNNEIGNFAKPVKNFTGLGSVAFSGQSVDVPSQ 540
Query: 541 SLKSSILERPNNEIGNFDMPFDKFTGLGSVAFSGQSVDMPSQSLKPSFLERPNNQIGNFD 600
LKSSILERPNNEIGNF+ PF KFTGLGSVAFSGQSVDMP++SLKPSFLERPNNQIGNFD
Sbjct: 541 PLKSSILERPNNEIGNFNKPFHKFTGLGSVAFSGQSVDMPNESLKPSFLERPNNQIGNFD 600
Query: 601 KPVQKFTGLGSVAFSEQSANVPSHPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPST 660
KPVQKFTGLGSVAFSEQS +VPSHPFLNVKESTIK SSGAANAFTGFAGKPFQPKDVPST
Sbjct: 601 KPVQKFTGLGSVAFSEQSVDVPSHPFLNVKESTIKHSSGAANAFTGFAGKPFQPKDVPST 660
Query: 661 LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE 720
LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPM E
Sbjct: 661 LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMNE 720
Query: 721 MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQN 780
MCEGLDMLLESIEEPGGFLDACTTFQKSSVEAL LGLATLSDQCQIWRRTMTERAQEVQN
Sbjct: 721 MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALGLGLATLSDQCQIWRRTMTERAQEVQN 780
Query: 781 LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIE 840
LFDRTVEVLSKKTYIEGIVTQASDSNYW+HWDRQKLSSELELKRQRILQMNQNMTNQLIE
Sbjct: 781 LFDRTVEVLSKKTYIEGIVTQASDSNYWDHWDRQKLSSELELKRQRILQMNQNMTNQLIE 840
Query: 841 LERHFNGLELNKFGGNDEIQVNERALQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL 900
LERHFNGLELN FGGN+EIQVNER LQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL
Sbjct: 841 LERHFNGLELNTFGGNEEIQVNERTLQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL 900
Query: 901 SKQIATLNIESPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS 960
SKQIATLNI+SPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS
Sbjct: 901 SKQIATLNIKSPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS 960
Query: 961 KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVKRMILQGTPLSNEKQFCSP 1020
KDTSRRKQRSGAKISETETGRRRRDSLDRNLAS+QPPKTTVKRMILQGTPLSNEKQF SP
Sbjct: 961 KDTSRRKQRSGAKISETETGRRRRDSLDRNLASIQPPKTTVKRMILQGTPLSNEKQFRSP 1020
Query: 1021 TLEGPATVARPASRIPSSMLSSSSKNAEQGSENPATPFSWASSPRQKFQPLQKTNGTAPS 1080
TLEGPATVARPA RIPSSMLSSSSKNAEQGSENPATPFSWAS PRQKFQPLQKTNGTAPS
Sbjct: 1021 TLEGPATVARPAGRIPSSMLSSSSKNAEQGSENPATPFSWASPPRQKFQPLQKTNGTAPS 1080
Query: 1081 PLPVFQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ 1140
PLPV+QSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ
Sbjct: 1081 PLPVYQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ 1140
Query: 1141 KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPVSVGTTSSL 1200
KSS FVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFG+ANKPEP SVGTTSSL
Sbjct: 1141 KSSFFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGAANKPEPASVGTTSSL 1200
Query: 1201 VPTVDVLRKTEEKKPPTVFSPSVPAPAPVNTPPSAST-FLGSPLSKSFPSPAAVVDLNKP 1260
VPTVD LRKTEEKKPPTVFSPSVPA PVNTP SAST F GSPLSKSFPSPAAVVDLNKP
Sbjct: 1201 VPTVDELRKTEEKKPPTVFSPSVPASVPVNTPSSASTLFSGSPLSKSFPSPAAVVDLNKP 1260
Query: 1261 LSTSTQSSFASPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEKQA 1320
LSTSTQSSFASPVVSVSDSLFQAPKMVSPPS LSSLNP+LVSSSKEQP+PKSDADTEKQA
Sbjct: 1261 LSTSTQSSFASPVVSVSDSLFQAPKMVSPPSTLSSLNPSLVSSSKEQPIPKSDADTEKQA 1320
Query: 1321 PASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVVADAQPQQSSAAFVPL 1380
ASKPE RELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVP V+ADAQPQQSSAAFVPL
Sbjct: 1321 QASKPEFRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPSVIADAQPQQSSAAFVPL 1380
Query: 1381 PTPNSTSKAAANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPK 1440
P+PNST K +ANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPK
Sbjct: 1381 PSPNSTPKVSANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPK 1440
Query: 1441 PNPFGGSFGNVNATSMNSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGSF 1500
PNPFGGSFGN NATSMNSSFT ASPPSGELFRPASFSFQSPLASQAASQPTNSVAFS SF
Sbjct: 1441 PNPFGGSFGNANATSMNSSFTTASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSSSF 1500
Query: 1501 GSGMATQASAQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGF 1560
GSGMATQA QGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGF
Sbjct: 1501 GSGMATQAPTQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGF 1560
Query: 1561 TSVKPVGGGFAGVGSGGGGGFGGGGFAGAASTGGGFAGASPPTGGFAGA--TGGGFAGAA 1620
TSVKPVGGGFAGVGSGGGGGFGGGGF G GGGFA A+ GGFAGA TGGGFAGA+
Sbjct: 1561 TSVKPVGGGFAGVGSGGGGGFGGGGFGGGGFGGGGFAAAASTGGGFAGAASTGGGFAGAS 1620
Query: 1621 GGGFAGAAGGGFAGAAGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPELFT 1680
GGFAGAAGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPELFT
Sbjct: 1621 ------PPTGGFAGAAGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGTGKPPELFT 1678
BLAST of Csor.00g300930 vs. ExPASy TrEMBL
Match:
A0A6J1HQ79 (nuclear pore complex protein NUP214 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111466378 PE=4 SV=1)
HSP 1 Score: 2966 bits (7689), Expect = 0.0
Identity = 1593/1692 (94.15%), Postives = 1614/1692 (95.39%), Query Frame = 0
Query: 1 MASVDSRHSISLTPIILEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60
MASVDSRHS S TPI LEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA
Sbjct: 1 MASVDSRHSTSSTPIPLEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60
Query: 61 VSESFGLIFVAHSSGFFVVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120
VSESFGLIFVAH SGFF VRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN
Sbjct: 61 VSESFGLIFVAHLSGFFAVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120
Query: 121 DNSFLAAVVAGDVRLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180
DNSFLAAVVAGDV LF VDSLLDK E+P FSCSTTDSSCIKDFKWTRKPENSYLVLSKHG
Sbjct: 121 DNSFLAAVVAGDVHLFLVDSLLDKGEEPSFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180
Query: 181 KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNG 240
KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTL++FSYKFKERLSMSLLPS GNG
Sbjct: 181 KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTLTVFSYKFKERLSMSLLPSLGNG 240
Query: 241 DTDTDFAVKVDSIKWVRADCIIIGCFQVAATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300
DTDTDFAVKVDSIKWVRADCIIIGCFQV ATGDEEDYFVQVIRSKDGKITDVSSNKVLLS
Sbjct: 241 DTDTDFAVKVDSIKWVRADCIIIGCFQVTATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300
Query: 301 FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360
FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI
Sbjct: 301 FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360
Query: 361 DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420
DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE
Sbjct: 361 DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420
Query: 421 GKLILFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSKVDESPVI 480
GKLILFHFSSANESEASDETVSACDEEEED+TVVPTDDQPQLFSNIDQRPVSKVD SPVI
Sbjct: 421 GKLILFHFSSANESEASDETVSACDEEEEDETVVPTDDQPQLFSNIDQRPVSKVDGSPVI 480
Query: 481 TRESNAKSQQMDSFAFSQPLKPSTLERPNNEIGNFAKPVKSFTGLGSVAFSGKSVDVPSQ 540
TRESNAKSQQMDS AFSQPLKPSTLERPNNEIGNFAKPVK+FTGLGSVAFSG+SVDVPSQ
Sbjct: 481 TRESNAKSQQMDSLAFSQPLKPSTLERPNNEIGNFAKPVKNFTGLGSVAFSGQSVDVPSQ 540
Query: 541 SLKSSILERPNNEIGNFDMPFDKFTGLGSVAFSGQSVDMPSQSLKPSFLERPNNQIGNFD 600
LKSSILERPNNEIGNF+ PF KFTGLGSVAFSGQSVDMP++SLKPSFLERPNNQIGNFD
Sbjct: 541 PLKSSILERPNNEIGNFNKPFHKFTGLGSVAFSGQSVDMPNESLKPSFLERPNNQIGNFD 600
Query: 601 KPVQKFTGLGSVAFSEQSANVPSHPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPST 660
KPVQKFTGLGSVAFSEQS +VPSHPFLNVKESTIK SSGAANAFTGFAGKPFQPKDVPST
Sbjct: 601 KPVQKFTGLGSVAFSEQSVDVPSHPFLNVKESTIKHSSGAANAFTGFAGKPFQPKDVPST 660
Query: 661 LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTE 720
LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPM E
Sbjct: 661 LTQSGRQVSAGAGKIESLPVIQSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMNE 720
Query: 721 MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQN 780
MCEGLDMLLESIEEPGGFLDACTTFQKSSVEAL LGLATLSDQCQIWRRTMTERAQEVQN
Sbjct: 721 MCEGLDMLLESIEEPGGFLDACTTFQKSSVEALGLGLATLSDQCQIWRRTMTERAQEVQN 780
Query: 781 LFDRTVEVLSKKTYIEGIVTQASDSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIE 840
LFDRTVEVLSKKTYIEGIVTQASDSNYW+HWDRQKLSSELELKRQRILQMNQNMTNQLIE
Sbjct: 781 LFDRTVEVLSKKTYIEGIVTQASDSNYWDHWDRQKLSSELELKRQRILQMNQNMTNQLIE 840
Query: 841 LERHFNGLELNKFGGNDEIQVNERALQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL 900
LERHFNGLELN FGGN+EIQVNER LQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL
Sbjct: 841 LERHFNGLELNTFGGNEEIQVNERTLQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNL 900
Query: 901 SKQIATLNIESPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS 960
SKQIATLNI+SPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS
Sbjct: 901 SKQIATLNIKSPSSKRQSITKELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSS 960
Query: 961 KDTSRRKQRSGAKISETETGRRRRDSLDRNLASVQPPKTTVKRMILQGTPLSNEKQFCSP 1020
KDTSRRKQRSGAKISETETGRRRRDSLDRNLAS+QPPKTTVKRMILQGTPLSNEKQF SP
Sbjct: 961 KDTSRRKQRSGAKISETETGRRRRDSLDRNLASIQPPKTTVKRMILQGTPLSNEKQFRSP 1020
Query: 1021 TLEGPATVARPASRIPSSMLSSSSKNAEQGSENPATPFSWASSPRQKFQPLQKTNGTAPS 1080
TLEGPATVARPA RIPSSMLSSSSKNAEQGSENPATPFSWAS PRQKFQPLQKTNGTAPS
Sbjct: 1021 TLEGPATVARPAGRIPSSMLSSSSKNAEQGSENPATPFSWASPPRQKFQPLQKTNGTAPS 1080
Query: 1081 PLPVFQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ 1140
PLPV+QSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ
Sbjct: 1081 PLPVYQSSHEMVKKSNSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQ 1140
Query: 1141 KSSIFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGSANKPEPVSVGTTSSL 1200
KSS FVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFG+ANKPEP SVGTTSSL
Sbjct: 1141 KSSFFVTSSKPMSTPKDSIETLNPNSQKTANVKERLTTPSPLFGAANKPEPASVGTTSSL 1200
Query: 1201 VPTVDVLRKTEEKKPPTVFSPSVPAPAPVNTPPSAST-FLGSPLSKSFPSPAAVVDLNKP 1260
VPTVD LRKTEEKKPPTVFSPSVPA PVNTP SAST F GSPLSKSFPSPAAVVDLNKP
Sbjct: 1201 VPTVDELRKTEEKKPPTVFSPSVPASVPVNTPSSASTLFSGSPLSKSFPSPAAVVDLNKP 1260
Query: 1261 LSTSTQSSFASPVVSVSDSLFQAPKMVSPPSNLSSLNPTLVSSSKEQPMPKSDADTEKQA 1320
LSTSTQSSFASPVVSVSDSLFQAPKMVSPPS LSSLNP+LVSSSKEQP+PKSDADTEKQA
Sbjct: 1261 LSTSTQSSFASPVVSVSDSLFQAPKMVSPPSTLSSLNPSLVSSSKEQPIPKSDADTEKQA 1320
Query: 1321 PASKPESRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPFVVADAQPQQSSAAFVPL 1380
ASKPE RELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVP V+ADAQPQQSSAAFVPL
Sbjct: 1321 QASKPEFRELKLQPSVTLAVGNHVEPTSVTQTVSKDVGGHVPSVIADAQPQQSSAAFVPL 1380
Query: 1381 PTPNSTSKAAANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPK 1440
P+PNST K +ANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPK
Sbjct: 1381 PSPNSTPKVSANGKSETSDALITQDDDMDEEAPETNNVEFSLSSLGGFGTTSTPMSNAPK 1440
Query: 1441 PNPFGGSFGNVNATSMNSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGSF 1500
PNPFGGSFGN NATSMNSSFT ASPPSGELFRPASFSFQSPLASQAASQPTNSVAFS SF
Sbjct: 1441 PNPFGGSFGNANATSMNSSFTTASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSSSF 1500
Query: 1501 GSGMATQASAQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGF 1560
GSGMATQA QGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGF
Sbjct: 1501 GSGMATQAPTQGGFGQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGF 1560
Query: 1561 TSVKPVGGGFAGVGSGGGGGFGGGGF----------AGAASTGGGFAGASPPTGGFAGAT 1620
TSVKPVGGGFAGVGSGGGGGFGGGGF AGAASTGGGFAGASPPTGGFAGA
Sbjct: 1561 TSVKPVGGGFAGVGSGGGGGFGGGGFGGGGFGGGGFAGAASTGGGFAGASPPTGGFAGA- 1620
Query: 1621 GGGFAGAAGGGFAGAAGGGFAGAAGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGT 1680
AGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGT
Sbjct: 1621 -----------------------AGGGFAGAAGGFGAFGNQQGSGGFSAFGAAPGGSGGT 1668
BLAST of Csor.00g300930 vs. TAIR 10
Match:
AT1G55540.1 (Nuclear pore complex protein )
HSP 1 Score: 874.8 bits (2259), Expect = 1.1e-253
Identity = 730/1856 (39.33%), Postives = 988/1856 (53.23%), Query Frame = 0
Query: 12 LTPIILEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLAVSESFGLIFVA 71
++ + +E+ EG+ + TNDYYFE+IGEP+ +K +D+ +D +PPSQPLA+SE ++FVA
Sbjct: 1 MSRVEIEEDTEGDRISTNDYYFERIGEPISIKEDDAQYDLENPPSQPLAISERHAVLFVA 60
Query: 72 HSSGFFVVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSNDNSFLAAVVAG 131
HSSGFFV RT DV++++K G IQDLS+VDV VG V +L+LS D+S LA VA
Sbjct: 61 HSSGFFVGRTNDVISASKNSNGNGDKVFIQDLSLVDVPVGDVRILSLSADDSILAVTVAA 120
Query: 132 DVRLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHGKLYQGSASGPF 191
D+ FSVDSLL K KP FS S +S +KDF+W R ++SYLVLS GKL+ G + P
Sbjct: 121 DIHFFSVDSLLKKDAKPSFSYSPDESGFVKDFRWRRNDKHSYLVLSNTGKLFHGIDNAPP 180
Query: 192 KHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNGDTDTDFAVKVD 251
+H+M +DAVE S KG +IAVA+ ++L IFS KF E+ ++L S GD+D D VKVD
Sbjct: 181 RHVMDAVDAVEWSSKGSYIAVAQDNSLRIFSSKFNEKRCIALSFDSWIGDSDEDCFVKVD 240
Query: 252 SIKWVRADCIIIGCFQVAATGDEEDYFVQVIRSKDGKITDVSSNKVLLSFHDIYSGFTPD 311
SI+WVR +CI++GCFQ+ G EE+Y VQVIRS DGKI+D S+N V LSF D++ D
Sbjct: 241 SIRWVRNNCILLGCFQL-IEGREENYLVQVIRSPDGKISDGSTNLVALSFSDLFPCSMDD 300
Query: 312 ILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQ-EVENEVAVIDIERDKSLPR 371
++PV GP LL SY+D+CKLA+ ANR + D+HIVLL W + ++ V+V+DI+R+ LPR
Sbjct: 301 LVPVGVGPHLLFSYIDQCKLAVTANRKSIDEHIVLLDWSSGDDKSAVSVVDIDRETFLPR 360
Query: 372 IELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLEGKLILFHFSS 431
I LQ+N DDN VMGLCIDRVS+ G V V+ G++E++E+ PY L+CLTLEGKL++F+ +S
Sbjct: 361 IGLQENNDDNTVMGLCIDRVSIEGTVNVRSGDDELKELQPYFVLVCLTLEGKLVMFNVAS 420
Query: 432 ANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSKVDESPVITRESNAKSQQ 491
AS +T A + ED +D S+ + ++ I +++ K
Sbjct: 421 VAGRPASSDTDLASSSDIEDAYTPLIEDDLSKQSSEKHQQLN-------IAVQNDQKHLN 480
Query: 492 MDSFAFSQPLKPSTLERPNNEIGNFAKPVKSFTGLGSVAFSGKSVDVPSQSLKSSILERP 551
+ F+ Q L + + E + V ++ K + V + +S I
Sbjct: 481 TEKFSTEQRLPNENIF--SKEFESVKSSVSGDNNKKQEPYAEKPLQV-EDAQQSMIPRLS 540
Query: 552 NNEIGNFDMPF----DKFTGLGSVA---------FSGQSVDMPSQSLKPS-----FLERP 611
G M +KF G G QS M Q+ S F P
Sbjct: 541 GTSFGQLPMSLGYDTNKFAGFGPALPVSEKLQKDIFAQSNSMHLQANVESKSTAAFFGSP 600
Query: 612 NNQIGNFDKPVQKFTGLGSVAFSEQSANVPSHPFLNVKESTIKQSSGAANAFTGFAGKPF 671
Q P + S S + S PF +++++ KQS + TG+ P
Sbjct: 601 GLQNAILQSPQNTSSQPWSSGKSVSPPDFVSGPFPSMRDTQHKQS---VQSGTGYVNPPM 660
Query: 672 QPKDVPSTLTQSGR---------------QVSAGAGKIESLPVIQSSQVSLQDNFSLGKI 731
KD + ++GR + G KIE +P I++SQ+S Q S K
Sbjct: 661 SIKDKSVQVIETGRVSALSNLSPLLGQNQDTNEGVEKIEPIPSIRASQLSQQVKSSFEKS 720
Query: 732 SNKKQDGS---------ERNYGNVPLAKPMTEMCEGLDMLLESIEEPGGFLDACTTFQKS 791
++ +Q + E N N P + EM +D LL+SIE PGGF D+C KS
Sbjct: 721 ASHQQHKTPLSTGPLRLEHNMSNQP--SNINEMAREMDTLLQSIEGPGGFKDSCAFILKS 780
Query: 792 SVEALELGLATLSDQCQIWRRTMTERAQEVQNLFDRTVEVLSKKTYIEGIVTQASDSNYW 851
+VE LE GL +L+ +CQ W+ T+ E+ E+Q+L D+T++VL+KKTY+EG+ Q +D+ YW
Sbjct: 781 NVEELEQGLESLAGKCQTWKSTIHEQQAEIQHLLDKTIQVLAKKTYMEGMYKQTADNQYW 840
Query: 852 EHWDRQKLSSELELKRQRILQMNQNMTNQLIELERHFNGLELNKFGGNDEIQVNERALQR 911
+ W+RQKL+ ELE KRQ I+++N+++T+QLIELER+FN LEL+++ + V R +
Sbjct: 841 QLWNRQKLNPELEAKRQHIMKLNKDLTHQLIELERYFNRLELDRYNEDGGHPVARRGVPN 900
Query: 912 KFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNLSKQIATLNIESPSSKRQSITKELFETIG 971
+ SR+ SLHSL+N M SQLAAA+ LS+ LSKQ+ L I+SP K ++ +ELFETIG
Sbjct: 901 RSAPSRRVQSLHSLHNTMSSQLAAAEQLSECLSKQMTYLKIDSPVKK--NVKQELFETIG 960
Query: 972 ITYDASFSSPNVNKIPETSS-KKLLLSADSFSSKDTSRRKQRSGAKISETETGRRRRDSL 1031
I YDASFSSP+ K SS K LLLS+ S SR++Q S K S+ ET RRRR+SL
Sbjct: 961 IPYDASFSSPDAVKAKNASSAKNLLLSSIPASINQQSRQRQSSAMKNSDPETARRRRESL 1020
Query: 1032 DRNLASVQPPKTTVKRMIL---------QGTPLSNEKQFCSPTLEGPAT-VARPASRIPS 1091
DRN A+ +PPKTTVKRM+L Q T LS + + T + V AS + S
Sbjct: 1021 DRNWAAFEPPKTTVKRMLLQEQQKTGMNQQTVLSERLRSANNTQDRSLLHVKDHASPVVS 1080
Query: 1092 SMLSSSSKNAEQGSENPATPFSW-----------------ASSPRQKFQPLQKTNGT--- 1151
S + SE +TPF AS P + + +N T
Sbjct: 1081 SNKGIMESFQQDTSEAQSTPFKTRPPMPQSNSPFTISPISASKPSFNWSGNKSSNTTSYA 1140
Query: 1152 ---APSPL------------------PVFQSSHEMVKKSNSE-AYSAASENKFAEVT--- 1211
APS + PV + E +K E +S A N F E
Sbjct: 1141 EESAPSQIKDTRTVSQPGGSSFLPKRPVASTVLEQTEKKAGEFKFSEAKANAFVETAAGS 1200
Query: 1212 ---YPEKSKASDFFS--------------LARSDSVQKSNMNFEQKSSI------FVTSS 1271
S SDF S S KS F SSI F +
Sbjct: 1201 VQRLSTTSSGSDFESSKGFGAQFSTMSSGAPASSFSSKSLFGFNSSSSIPGDKFTFPAVT 1260
Query: 1272 KPMS-TPKDSIETLNPNSQKTANVKERLTTPSPL-FGSANKPEPVSVGTTSSL------V 1331
P+S TP DS TL S + + P+ + SA P+ SV +TS++ V
Sbjct: 1261 APLSGTPLDSTSTLFTASSAPVSSSSQDPVPASIPISSAPVPQTFSVTSTSTVSATGFNV 1320
Query: 1332 PTVDVLRKTEEKKPPTVFSPSVPAPAP-------VNTP---PSASTFLGSPLSKS--FPS 1391
P L T K +PS P+P+P N P PS+ + S +S FP
Sbjct: 1321 PFGKPL--TSVKVDLNQAAPSTPSPSPGPTAGFTFNLPALSPSSPEMVSSSTGQSSLFPP 1380
Query: 1392 PAAVVDLNKPLSTSTQSSFASPVVSVSDSL-----------FQAPKMVSPPSNLSSLNPT 1451
A ++ +++T S S + S SL FQ+P++ +P S + P
Sbjct: 1381 SAPTSQVSSDQASATSSLTDSSRLFSSTSLSSTPPITPPDAFQSPQVSTPSSAVPITEP- 1440
Query: 1452 LVSSSKEQPMPKS-----DADTEKQAPASKPESRELKLQPSVTLAVGNHVEPTSVTQTVS 1511
VS K+ S + + A A+K ++ L ++ ++ G V P S + +S
Sbjct: 1441 -VSEPKKPEAQSSSILSTQSTVDSVANATKTQNEPLPVKSEIS-NPGTTVTPVSSSGFLS 1500
Query: 1512 KDVGGHVPFVVA----------DAQPQQSSAAFVPLPTPNSTSKAAANGKSETSDALITQ 1571
G + + +QPQQ S+ P P + TS A+ E D + TQ
Sbjct: 1501 GFSSGTQSSLASMAAPSFSWPGSSQPQQLSSTPAPFPASSPTS---ASPFGEKKDIVDTQ 1560
Query: 1572 DDDMDEEAPE-TNNVEFSLSSLGGFGTTSTPMSNAPKPNPFGGSFGNVNATSMNSSFTMA 1631
+D+MDEEAPE + E S+ S GGFG STP APK NPFGG FGN T+ N F M
Sbjct: 1561 EDEMDEEAPEASQTTELSMGSFGGFGLGSTPNPGAPKTNPFGGPFGNATTTTSN-PFNM- 1620
Query: 1632 SPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGSFGSGMATQASAQGGFGQPAQIGVG 1682
+ PSGELF+PASF+FQ+P SQ A GSF S +Q AQ GFGQP+QIG G
Sbjct: 1621 TVPSGELFKPASFNFQNPQPSQPAG--------FGSF-SVTPSQTPAQSGFGQPSQIGGG 1680
BLAST of Csor.00g300930 vs. TAIR 10
Match:
AT1G55540.2 (Nuclear pore complex protein )
HSP 1 Score: 869.4 bits (2245), Expect = 4.7e-252
Identity = 730/1859 (39.27%), Postives = 988/1859 (53.15%), Query Frame = 0
Query: 12 LTPIILEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLAVSESFGLIFVA 71
++ + +E+ EG+ + TNDYYFE+IGEP+ +K +D+ +D +PPSQPLA+SE ++FVA
Sbjct: 1 MSRVEIEEDTEGDRISTNDYYFERIGEPISIKEDDAQYDLENPPSQPLAISERHAVLFVA 60
Query: 72 HSSGFFVVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSNDNSFLAAVVAG 131
HSSGFFV RT DV++++K G IQDLS+VDV VG V +L+LS D+S LA VA
Sbjct: 61 HSSGFFVGRTNDVISASKNSNGNGDKVFIQDLSLVDVPVGDVRILSLSADDSILAVTVAA 120
Query: 132 DVRLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHGKLYQGSASGPF 191
D+ FSVDSLL K KP FS S +S +KDF+W R ++SYLVLS GKL+ G + P
Sbjct: 121 DIHFFSVDSLLKKDAKPSFSYSPDESGFVKDFRWRRNDKHSYLVLSNTGKLFHGIDNAPP 180
Query: 192 KHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNGDTDTDFAVKVD 251
+H+M +DAVE S KG +IAVA+ ++L IFS KF E+ ++L S GD+D D VKVD
Sbjct: 181 RHVMDAVDAVEWSSKGSYIAVAQDNSLRIFSSKFNEKRCIALSFDSWIGDSDEDCFVKVD 240
Query: 252 SIKWVRADCIIIGCFQVAATGDEEDYFVQVIRSKDGKITDVSSNKVLLSFHDIYSGFTPD 311
SI+WVR +CI++GCFQ+ G EE+Y VQVIRS DGKI+D S+N V LSF D++ D
Sbjct: 241 SIRWVRNNCILLGCFQL-IEGREENYLVQVIRSPDGKISDGSTNLVALSFSDLFPCSMDD 300
Query: 312 ILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQ-EVENEVAVIDIERDKSLPR 371
++PV GP LL SY+D+CKLA+ ANR + D+HIVLL W + ++ V+V+DI+R+ LPR
Sbjct: 301 LVPVGVGPHLLFSYIDQCKLAVTANRKSIDEHIVLLDWSSGDDKSAVSVVDIDRETFLPR 360
Query: 372 IELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLEGKLILFHFSS 431
I LQ+N DDN VMGLCIDRVS+ G V V+ G++E++E+ PY L+CLTLEGKL++F+ +S
Sbjct: 361 IGLQENNDDNTVMGLCIDRVSIEGTVNVRSGDDELKELQPYFVLVCLTLEGKLVMFNVAS 420
Query: 432 ANESEASDETVSACDEEEEDDTVVPTDDQPQLFSNIDQRPVSKVDESPVITRESNAKSQQ 491
AS +T A + ED +D S+ + ++ I +++ K
Sbjct: 421 VAGRPASSDTDLASSSDIEDAYTPLIEDDLSKQSSEKHQQLN-------IAVQNDQKHLN 480
Query: 492 MDSFAFSQPLKPSTLERPNNEIGNFAKPVKSFTGLGSVAFSGKSVDVPSQSLKSSILERP 551
+ F+ Q L + + E + V ++ K + V + +S I
Sbjct: 481 TEKFSTEQRLPNENIF--SKEFESVKSSVSGDNNKKQEPYAEKPLQV-EDAQQSMIPRLS 540
Query: 552 NNEIGNFDMPF----DKFTGLGSVA---------FSGQSVDMPSQSLKPS-----FLERP 611
G M +KF G G QS M Q+ S F P
Sbjct: 541 GTSFGQLPMSLGYDTNKFAGFGPALPVSEKLQKDIFAQSNSMHLQANVESKSTAAFFGSP 600
Query: 612 NNQIGNFDKPVQKFTGLGSVAFSEQSANVPSHPFLNVKESTIKQSSGAANAFTGFAGKPF 671
Q P + S S + S PF +++++ KQS + TG+ P
Sbjct: 601 GLQNAILQSPQNTSSQPWSSGKSVSPPDFVSGPFPSMRDTQHKQS---VQSGTGYVNPPM 660
Query: 672 QPKDVPSTLTQSGR---------------QVSAGAGKIESLPVIQSSQVSLQDNFSLGKI 731
KD + ++GR + G KIE +P I++SQ+S Q S K
Sbjct: 661 SIKDKSVQVIETGRVSALSNLSPLLGQNQDTNEGVEKIEPIPSIRASQLSQQVKSSFEKS 720
Query: 732 SNKKQDGS---------ERNYGNVPLAKPMTEMCEGLDMLLESIEEPGGFLDACTTFQKS 791
++ +Q + E N N P + EM +D LL+SIE PGGF D+C KS
Sbjct: 721 ASHQQHKTPLSTGPLRLEHNMSNQP--SNINEMAREMDTLLQSIEGPGGFKDSCAFILKS 780
Query: 792 SVEALELGLATLSDQCQIWRRTMTERAQEVQNLFDRTVEVLSKKTYIEGIVTQASDSNYW 851
+VE LE GL +L+ +CQ W+ T+ E+ E+Q+L D+T++VL+KKTY+EG+ Q +D+ YW
Sbjct: 781 NVEELEQGLESLAGKCQTWKSTIHEQQAEIQHLLDKTIQVLAKKTYMEGMYKQTADNQYW 840
Query: 852 EHWDRQKLSSELELKRQRILQMNQNMTNQLIELERHFNGLELNKFGGNDEIQVNERALQR 911
+ W+RQKL+ ELE KRQ I+++N+++T+QLIELER+FN LEL+++ + V R +
Sbjct: 841 QLWNRQKLNPELEAKRQHIMKLNKDLTHQLIELERYFNRLELDRYNEDGGHPVARRGVPN 900
Query: 912 KFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNLSKQIATLNIESPSSKRQSITKELFETIG 971
+ SR+ SLHSL+N M SQLAAA+ LS+ LSKQ+ L I+SP K ++ +ELFETIG
Sbjct: 901 RSAPSRRVQSLHSLHNTMSSQLAAAEQLSECLSKQMTYLKIDSPVKK--NVKQELFETIG 960
Query: 972 ITYDASFSSPNVNKIPETSS-KKLLLSADSFSSKDTSRRKQRSGAKISETETGRRRRDSL 1031
I YDASFSSP+ K SS K LLLS+ S SR++Q S K S+ ET RRRR+SL
Sbjct: 961 IPYDASFSSPDAVKAKNASSAKNLLLSSIPASINQQSRQRQSSAMKNSDPETARRRRESL 1020
Query: 1032 DR---NLASVQPPKTTVKRMIL---------QGTPLSNEKQFCSPTLEGPAT-VARPASR 1091
DR N A+ +PPKTTVKRM+L Q T LS + + T + V AS
Sbjct: 1021 DRVIFNWAAFEPPKTTVKRMLLQEQQKTGMNQQTVLSERLRSANNTQDRSLLHVKDHASP 1080
Query: 1092 IPSSMLSSSSKNAEQGSENPATPFSW-----------------ASSPRQKFQPLQKTNGT 1151
+ SS + SE +TPF AS P + + +N T
Sbjct: 1081 VVSSNKGIMESFQQDTSEAQSTPFKTRPPMPQSNSPFTISPISASKPSFNWSGNKSSNTT 1140
Query: 1152 ------APSPL------------------PVFQSSHEMVKKSNSE-AYSAASENKFAEVT 1211
APS + PV + E +K E +S A N F E
Sbjct: 1141 SYAEESAPSQIKDTRTVSQPGGSSFLPKRPVASTVLEQTEKKAGEFKFSEAKANAFVETA 1200
Query: 1212 ------YPEKSKASDFFS--------------LARSDSVQKSNMNFEQKSSI------FV 1271
S SDF S S KS F SSI F
Sbjct: 1201 AGSVQRLSTTSSGSDFESSKGFGAQFSTMSSGAPASSFSSKSLFGFNSSSSIPGDKFTFP 1260
Query: 1272 TSSKPMS-TPKDSIETLNPNSQKTANVKERLTTPSPL-FGSANKPEPVSVGTTSSL---- 1331
+ P+S TP DS TL S + + P+ + SA P+ SV +TS++
Sbjct: 1261 AVTAPLSGTPLDSTSTLFTASSAPVSSSSQDPVPASIPISSAPVPQTFSVTSTSTVSATG 1320
Query: 1332 --VPTVDVLRKTEEKKPPTVFSPSVPAPAP-------VNTP---PSASTFLGSPLSKS-- 1391
VP L T K +PS P+P+P N P PS+ + S +S
Sbjct: 1321 FNVPFGKPL--TSVKVDLNQAAPSTPSPSPGPTAGFTFNLPALSPSSPEMVSSSTGQSSL 1380
Query: 1392 FPSPAAVVDLNKPLSTSTQSSFASPVVSVSDSL-----------FQAPKMVSPPSNLSSL 1451
FP A ++ +++T S S + S SL FQ+P++ +P S +
Sbjct: 1381 FPPSAPTSQVSSDQASATSSLTDSSRLFSSTSLSSTPPITPPDAFQSPQVSTPSSAVPIT 1440
Query: 1452 NPTLVSSSKEQPMPKS-----DADTEKQAPASKPESRELKLQPSVTLAVGNHVEPTSVTQ 1511
P VS K+ S + + A A+K ++ L ++ ++ G V P S +
Sbjct: 1441 EP--VSEPKKPEAQSSSILSTQSTVDSVANATKTQNEPLPVKSEIS-NPGTTVTPVSSSG 1500
Query: 1512 TVSKDVGGHVPFVVA----------DAQPQQSSAAFVPLPTPNSTSKAAANGKSETSDAL 1571
+S G + + +QPQQ S+ P P + TS A+ E D +
Sbjct: 1501 FLSGFSSGTQSSLASMAAPSFSWPGSSQPQQLSSTPAPFPASSPTS---ASPFGEKKDIV 1560
Query: 1572 ITQDDDMDEEAPE-TNNVEFSLSSLGGFGTTSTPMSNAPKPNPFGGSFGNVNATSMNSSF 1631
TQ+D+MDEEAPE + E S+ S GGFG STP APK NPFGG FGN T+ N F
Sbjct: 1561 DTQEDEMDEEAPEASQTTELSMGSFGGFGLGSTPNPGAPKTNPFGGPFGNATTTTSN-PF 1620
Query: 1632 TMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGSFGSGMATQASAQGGFGQPAQI 1682
M + PSGELF+PASF+FQ+P SQ A GSF S +Q AQ GFGQP+QI
Sbjct: 1621 NM-TVPSGELFKPASFNFQNPQPSQPAG--------FGSF-SVTPSQTPAQSGFGQPSQI 1680
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
F4I1T7 | 6.7e-251 | 39.27 | Nuclear pore complex protein NUP214 OS=Arabidopsis thaliana OX=3702 GN=NUP214 PE... | [more] |
Match Name | E-value | Identity | Description | |
KAG6573777.1 | 0.0 | 100.00 | Nuclear pore complex protein 214, partial [Cucurbita argyrosperma subsp. sororia... | [more] |
KAG7012851.1 | 0.0 | 99.88 | Nuclear pore complex protein, partial [Cucurbita argyrosperma subsp. argyrosperm... | [more] |
XP_022945173.1 | 0.0 | 97.39 | nuclear pore complex protein NUP214 isoform X1 [Cucurbita moschata] | [more] |
XP_022945174.1 | 0.0 | 96.44 | nuclear pore complex protein NUP214 isoform X2 [Cucurbita moschata] | [more] |
XP_023541587.1 | 0.0 | 96.39 | nuclear pore complex protein NUP214 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1G030 | 0.0 | 97.39 | nuclear pore complex protein NUP214 isoform X1 OS=Cucurbita moschata OX=3662 GN=... | [more] |
A0A6J1G089 | 0.0 | 96.44 | nuclear pore complex protein NUP214 isoform X2 OS=Cucurbita moschata OX=3662 GN=... | [more] |
A0A6J1HNV2 | 0.0 | 94.43 | nuclear pore complex protein NUP214 isoform X3 OS=Cucurbita maxima OX=3661 GN=LO... | [more] |
A0A6J1HUR6 | 0.0 | 94.83 | nuclear pore complex protein NUP214 isoform X1 OS=Cucurbita maxima OX=3661 GN=LO... | [more] |
A0A6J1HQ79 | 0.0 | 94.15 | nuclear pore complex protein NUP214 isoform X2 OS=Cucurbita maxima OX=3661 GN=LO... | [more] |