Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDSinitialstart_codonintroninternalterminalstop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGTGGTTTAGAGCAGCGGTGATCAGGGCGGTGGAAGCGGGTGCCGGAGGGAAGGACAATATCACCCGCACTGTCCGTAACTATGCTGGCACCGTCGTTCACCATGCAGGAAATGCTGTTGTGGAGGGCGCCAAGATCATCCAAGACCGCATTGTAATTTCATTCTCACTTCTCCACTCAATTCTCCCTGTTTTAAATGGCGTTTGGTTCTTGACATCGTTATGAGTATTGGAACATTTATTACTAGGGCATGGAACCTTACGATATTGGTCTCTGTTTAGTGTGCGGCTTTGTATATTTAGTTATATTACGTCTGCGCGCATTACCAGTTATCGCTTCGTATCCGCGTGTGTTACTTGTTACAGTTTCTGTTTCACTAACGAGTTAAGTTCCGGGTTTGATTTGTGAGGCAGGGACCAAGGAATATGCAGGGATTTAAACAGACTGTAAAAAGGCTGGAAGAAATTTCTGTTTCGTCTAGAGGCGTAGAAAGAGTTCAGTTATTGAGAAGGTGGCTAGTTGCGCTCAAAGAGGTGGATAGATTTTCGTCAGGTTCAATCGAGGGTGATATAAACAGTCCGACAGATCAGCCTAACGATGAAACTAAAGATTCTCCGAAAAATCCTACACTGGTGCGTATATCAATTAGCGTTCTTGTATTTCCTGAACTTTTTTGTGTGTCTGAGTTGATTGAAGGATTTGTAGTTGTTTTTTGTTCGTTTTCTCCCCTGGCGCTGAACGTCGTGAAATGCTGTTTGATATCTTGCGCAGGTCTATTATGTAGACCCAGATATGGGAGGCGAGCTGAAAACGTTTCGTGATGTATTTCTTACAAGCCAAGCTCTAGAAGGCATTACGTTGTCTATGGTGAGTTAAATTGAATACATGCCAATACCTTTTTGCTTGCAAAACATTGGTATACGAAAATATGCTAGGTAGTTCCTAATATATCAGCAGCATTCATTTACTTCTCTCTCTGTTGGAATGATGTAATTACTTGATGAGCATAAAAGGCCTAAAATTTTAGCTTCTATTCATTTTCTATGTATGTATTGGTACTCTTTCCCCTCACCCGGGAATGGAGTGATTTCTTGTTCTGAACCACTTTTTGTGGGCTGGAGTTTAACCACACGATAACCCTTTTCTATGACTCAGATTCTTGAAGAAGCTACCGATGAAGAAGAATCACTACTTCTAGAGATATATGGGTGAATATTCATTCTCAGTTACACCTTACATGTTCTTTATGCCCTGCCGCTGATTTCAGAAAGTTTTCTCACGTTACTTGTGCAGACGAGAAGAAACAATATTAGAAATTTCTATGATCTTTTCCACACTCATATCATTAGACTTAAGTTCCTTTTGTCGTGGCTATATTCAGGCTGTGTCTTTTAGGAGGAAAGGAAGTACATCAAGAAATAATGATGAATGTACACAATTTGGCAAATGCATTTTCAGAATACCAAGATGAAGCACTGGTAATGTCATGAAGGCATGAATTAATTTCAGCATAAAATTAGCATGGATAGAGCAGTCATTTCCCTCTTGCACTGCCATTTTTAGGTAAAACGAGAAGAACTGCTCCAACATGTCCAAGATGCAATTGCAGGGCTGAAAATAAATGCTGATTTTGATAGGTGGTGAATAAGCATATCTTTCATATTTCATTCATTGGTCTTCATTAAATTAGATATCTGATATATTATGTACGCATGGATTTTTGTGGATAGGATTACAACTTAACTAAATTGAAATCTATACAACAAGGATGAAGTGGAAGTTAGCTCTAAGGAGAGCTGCTCTAACGAGACTTCCAAGTATGTATGATAGAGAAAGGGAATGCACATGGAGGAAGTTGGAGTTAGTGTCCTAAGAGAGCATACAGTCAAAATTTTTCATAAACCAATCCCTATCTCGCTTAGTGTCATTAATTTATCTTTGTTTTCCTCAATAAAGCAAAAAATATTTATATTTTTTTCTATTCTTTTAAATGTGATAGAATAGCAACCACTCTCTTTTACTATACAACCTTGAGCTCATCATAAGAAAGTGAAAATTGCTATAGAATCCACCATAAGAGAAAGGAACAGCTCACACTGAATGAAAGGATTAAAAAAAGCTTTGATACACTGATGGTTGTGTTAGACTGATTGATTGATTCTGATATCCAATTTTGTATCACTGAAGCCTGTGAAAAGATATTTTGATGTTTTGTTCTTCCTAGCTATCATTTTATATGATCCATGTGATGCCCTAGATACAATCTATGTAAACTTCTATTTGTTTATTCCTATTCTTATTTTTAATTTTTTGTTGTTATTCTTCTTGCTCTTAATTTTTTTTAAACAAGAAACAACACCTTTCATTAATGCCTGAAAGATTATAAAAGAGTTCAAGAATTTAAAGTCTACAAAGTGTGGGCTTCAGAAAATAAAAGAACATAGCTATTCCAACCAGATACTATTTTTGGGAAAGATAATTTTGGGTGGTTCACAGATGCTAAAAGGATAGGAAGTTCATGGAGCCTTTTGTGGAATATCATGAAAAGCTTGAAACTGCAGTTGAGATTTTTTGTAAGATAAAAGTTGGTAATGGTATCAAAACGTCTCTCTGTTAGGGTTCATGTCAACCTGTTGGCGGCAAGTTACACAAGAAATTTTGGCCTTTCAAAGGAAAAAGATTCATTGGTTTCTGAGGCATGGGAGGAAAGTAGTAGCTTGTGGAAAGTTTTCCTAAGAGGAAATTTGAAGGATGAAGTAGTGGATGACTATTGTGAACTATTAAAAAATTAATTGAAGGTTTCTTATTGAGAAAATGAAAGGGAAAATAGTTAGGTTAGATTCTTCTAAAATCTTAGAATATTGAGTTGGTGGTTTTCGTTGCCTTAAAGACTATTATGGAAATTAAAGGGTCTGAGAAAAGCGAACATTTTGGCATGGCTAGACTTTTTGGAAGGCTATACGTGGTTGATAAAATCCAGCAGATACTAACTTAATAAATAGATACTTGCGCGAGTAGGTCTCTCCTAAAGGCTCCATTGACCAAAGTCTTCTGACTTTATTGTTTAGGAACGCTCCATCTAACACATCTGGAGAGTTAGTTTCCCCCCTTCACTTTAACAATCATTAGGATTTTTTCTCAAGCTCAAAAGCTAGTTTCTTGCGTTCTCATTTCAAAGCTGAGTGGTGGTTCAGTTTTTGAGTGAGCCACCATATAAGTTCTTGGCAACATGATATTCAGAAGCATATCGAATCCAGTCATCCTCCTGAAACAGATACTAGTCATCAGAACTTAATCTCAATTTAAGAGCTTGTTCTGTAGTTTCAGTTCTTTTATCTTAAATTTTGATTCAATGATTTAATAATGCTGATGCTTGGTCAATTGGAGGAGAAAGGAATGGATAATCTTTAAACTATTTGCACCATATAATTTTGAGAGGTGGCATAAAGGTTAATATAGTGAAAGTTCAAGTAATGTTTTCTTGGCCATTTCCTGGGAGCGTATGAAAACTTATAGGATTTTTGGGTCTACTATGCTACCCCTGAAAATTTGTGCGCTAGGTTGCAGGCCCATTAACAGAGCAGTTGAAGGAAAACTAGTTTGGGTGGAATGGCAAAGTAGAGACAACACAAGATTCTTATTCAGTGGGACGGTCAAGGTATCAAGGAAGCGACGTGGAAGCCAAAAGTAGCTATTCAGTTTTCAACCTTTCTCCCACAAGGACAAGGTGAATCTATCGGTGTGGGTATTGATAATCCTCCAATCCAGTTTTTCTGACAGTAGGTGTGGGAATTGATAATCCTCCAATCCAGTTTTTCTGACAGTAGGTGTGGGACTCGTATGATGGATTAGGAAAATAAAGAGTGGCTGAAAATGTTAACTTGGGAAGTTATCGGCATGAAGTTACGGTAAGTTACAGATTGTTACGGCCTTATGGGTAGTTAATCACTAATGTGGGATTTGATCTTATCTATAATGTTGTTAGTTAATTAGTTGTGGATTTAATAACTTCGTTAGGGAAAGGGGTGAGAAGAGCTTTTCCTTAATTATGTTTAACATACTACAGAGTGTTTTCCGACCTCTCGAATACCTGAATTTGATTGTGTTATATTCCTCCTTTGACTAACTATAAAAGTTCTCTTGTGATTGAGCTAACAATGTGCTAATTTGTGGCTGATCCCATAGTGTATAGGGCAACAAGCCTTTCAATATACTATGTTGTTTAATTAAAGTTCATTGTGTAGGTTTCCAAAATTCTCAATTGGTGTTTTTCATTGAGTGGATTAACAGTATAATTCTTCTAATCCCTTTTCAAACAAATTCATTTATTCTTCCAATTTCCTGCTCTCAGAATAGATGCTAAAGCATGTAGTTTGAAAAAAACACTTGATGAAAAGAAAGGAGAGATGCCACCTTCAAGTGGAGCTCGAGATAACACATCTGATGATAAAACCACATCTTCCAAGGTGGTTATGATTTATGAATTTATTTTATATCTATAGACATGGATATGAGCCATGTATTAAATGTGGTCGTTTTTCTACTTTACCATCTTTGATTCTTTACTTTAACTTCAATGTTTTAGCAAGTATATCTTTGCTATTACTTGTTATCCTTTTTGAGTTTTAAGGAAGTATTATATTACATTAGATCTCATTATGTAAGTTACAGGTTCTGCAAGAAATACTTTCACAAGTTCAATTATACTCCAAGTTGGAGGAGCTCTTACTAAAGAAGAAGCTGTTCAACGAAGTGGATTCCCCACAGCTTCATGCTGAGAAGGTCCGTTCTTCTCTCCCTGGCTTTAGGGTTAACAAAATGGGATAAGGGAACTAACTAATCTAATGCTTTTGGTTGGTACTGGAGTTAGATTGCATATAACAAACTTCCGTGCATGTTAAGCACATCTGCCATTTTCAATACTTATATTTCCCCCCTTAAATCTTAAAATTTCTACATACGATGGTATCAACTTATATTTTGTATATCTCATTAGTACAATTTACTTATTAAACAAAATGGGATAAGGGAAAATTTTGAGATGAGAAGCATGTTCCTCAATGATGTAAGATCTTTGTTCCTAATGGAAGATTGGTAGTATGCACCATCATGGTGATGTCTCCATGCTCTTCATGTTCTTCAAGTTGATATAAGTTGTTTAAAAAAACAAAAGTTGGTAATGAGAGAACTTCTAGTCCTGTTATTTATTTATTGATTGATTGTTGATGCTTTTACCATTATACAACAACAACCAATTGATTTTGCAGGTTGACAAACTGAGAATTTTGTCAGAATCTTTGGCCAATTCTACCTTGAAAGCTGAAAAGCGTATTGTTGACCAAAGGTTTAACATATTACTAATGCCGATTTATGTCTTGCACGTATTTATTCGTTGTTGCTTTTGTGTTTGTTGCTCTTATAAGGGTTCCATGTGCAATATGTCATTTTTGTGACATTATTTTTTAATATGATACAAACTCCCATTACTCATGAAAGAACACAAAATGTGAGGACAAGAGATTCTCTTTCAAGGTCAAGGAGTTAAAACATTGCTCTCATTGCCAATATAACAAAACACACTAGTTACAAAATAATTTACTACGAGAGCACCAAAGAGTGGCCAAATCTTTCACGATTTTCTCTAAAAACATCATTTTTCATTTAGAAATTCTGGAAAGCTATACTTCCTTCCTAGCAAAATCTCCAACTTAAGCCATTCACTACATTGATCCAAAGGATATTTTCCCTCTCTCGGGTGTGACTACTTGGAACTCGCTCTCTTACTCTTTGCATACTTCGAATTAATGAGAAAAACATTAAGAGATTCTTAAAATTATAAGTACTAAAACCTCCACTTGCAGGAGGAGGGTCAATAAAGTATGTTTCATTCAATTGCACAGGGGGGGAACGGAAGTTGTCATAAAAATAGACAACTCAAGACTCAAAGCAAAATTACCTAAATTTTTTCTGGACTTGGTTTACGAGATGGGAATGTATTGTGGATGAATATTCCATGTTGTCGAAACACTAGTTTGGGAATTCAGGCCAAGAGATCAATGATTGTTCAAAAGGCAAAAAAAAAAAAAAAAAAAAAAAGGCTAGGATTAAAAGTGGGAGCTAGTTAGATACCAAACCTACCTACTACTAAATGTTTTGGTGTATGGTTGTAAGAAAAGCAAAATAGTTTGTTCGTCTTTTTCTTCTTTTTAATAACGACAGCAACAATAAAGACAACTCAAGGCATAGTTGTAAAAAAGATAGCTGAATAAAAAATATTTTTTACAGTGCCACTTCTTTTGGAGGACAATGTCTATTATGTTCTGTTGACTCAAAAAAAGAGTTATATGATATATCTAAGAAGTTAGCCAACATTGGTGTTGGGAAATAGCCTCTAAGCAGATGTCCATGTATGTGTTTTGTAACTTGGGTTTTTATTACTTTTTAATTAGGTTAATTCATTACATTTTTTCATCTGTAATATTCAAATGATTTATCAGATGCCAAGCGGCTTCCCTCACAAGAGAAATAAATTCAAAAACCATTCAGATTTGGTGGAAAATTTTTCTTCGTATGTCCTCAATTTCATTCTAGATGTCAGTACATCTCAATTTTGAAAGGAATATCACTAGGTTTTCTTTATTTGGAATTATATCTGTATTTATTTTTGTTTCACACTGGTTAGTTTCTCTTCTACTTGAACTTGTTTTATGGAGAGTGCATCAATTTCAAAGTGGTGGACATGCTTAAGGGATGATATTCTTTATCAAAGGATCGCTTGAAAATGGGGAGGAGGTTTGGATTTCAGAATTTAAACGTCCAAACTATTATTAATAAAAAAGTTTTGGGATGAAATTAAAAACTTGCATGGGTATGTTTTGGATACTTGGTGGTTAGGAGGGTACATTTATGTTAAAAGATGGGGGTACATGATAGATCTAGTGGGAGTAGATTTTTTTAAAAGCATGAGGAAGTCTAACGAGATGATCTTTTAACTCCCTGGCTGAACCCAAAATAGAACTTACTTTACCTTCCTACCCTCGATGTTAATCCTTTTATCTCCCTCAACCTTTAATTACAAACTTTAATTCACTCTTAATCTGCCTACTACTAACCACTTGGGCAATGTTTTAAGGAATTTGAATGTCATCAAGGCTGGTTGGAAAGGCTTGGAAATTTGCAGATAATTTTTAGAGGAGATAAAATTGTACATAGATGGTTAGTGCGGCCCCTTTTTTTAGCTACCAAATCATAATTATGGGTTAGGTGAGGGGTCTCATTGGTGACCCTGATGTGCTTCTTGGTCATCTGAGGCATTTCTTGGGTAATTAGTGTAATGCTTCCATATTAATTTGAACACTTCTTTTCCCTAGAGTTCGTAAGAGCCTTCTCTTGGGTATGTAAAATGTTGAGGTTCGTCTTCAATCATTATATCATATTTTTACATTGAAAGTTCGTGGTCTCGGTGAACTTCAGAATGACTTTTGGATTGGTGATCATAGGAGTGATCAAATTCATATTTTTTTTTCTTCTAAATGTATAATCTATCTTTGGTTCTAAATCCCACAAATAGATGGTGATCCTTATCGCTATTCTCTCCTTAAACATTAGTTTCTTTGTAATAGGGTTGTTCTACAGAGAAGGTCTCATTCGTAGAAATATCTTGGAGTTAGAAAGAGACAAAACATGCTTCCAAGTGAACATATTTCCTCGTGGAAATGTTCCAATCCTCCTACCTAAAGCATTTGCAACTTGAAATGGGGGGACACTTAAGAAAAAGGCAAACTAGGTCCAAAGACTTAATAACGCTGTTCATATATATTTGGTAGAATTAGGTACGCATGAAGTTCTCTCATCTGACTTCCTTTTCCACTGCATTTATATCTATTGCATCACACGTAGAAGTCATATAGGTTGCAGTTTGTACATAATACTAGAAAAGAATTTGTGAAATTCCATGACTTTCCGCAAGCTGCTTGTTGAAAAGTAAGTCAATTGTGAAACTGTTAGGCTTTCTTTGGGGAATTCATATAAATGTTTTCTTGTGTCATGGTTGATATTTTTTATAGGCTAGTGATAGGCAGGTGGCCATTTCGGTTTTTCGTGCAAAAATAAAGGCATCGAAATGAAAACAAGAGATATTATGCTCCTCAAGTGAAACTGTCAATCTATAAGCATTAATGTTTTGTCATGCTAAAATATTAAGCTGTTTCAAGAATCTTAATTGGCATTTCAAGTAGAATCGATTATGAAAGAATGAGAGAATTGATATGATTTATTATAAAGCTGGATTTATACTTGTCCTGTGACAGCATCATGCGCAAACGGGCGAACTTGGGTTAAGGGTGTCTTGAAGTGGAGATTCATCTCTGATTCTCTTGTTGAATGCTTTCTTATATTTTCTGCTTGTCATTTGCAGAGAGCAGAGGGAGGATGCACTCAACTTTCGGTTAGCTAAGTCCGAGGAAATGGTCCAAGCTGAGAAGGTAATCTTCCTTCTAAAATAGATAATTATCCCTTAGAGATGTTGATAGACCCAACATAAAATATCAGAAAGCTACTGGTAAGGACTTTTGCTTTTGGGCTATATAGATTCGTTTGTGTAATTTGCTTTTGATATCTTGCTATCAAATTTCTGGTTTTGTCAATTTCTTGACGCTTCTGTCTGTGCCAGATTTTTACTTGTTTTCTCTTCCTCATTATTATTATTATTTTGTAGTTATCAATCAAAGTTAAATTGAAGAGTTTGGATAAAATTATTCAAAAATATTTTCTAACCTCTTAGCCTTTTGTTATAGTAACGATTGGACTACAGTATGCATATAATCTTAGTACCATTTTTATGAAATAAATATTTTTGGTAGAAATTAGAAATATAGTACAAGACCTGTAATTTTAGTTTTTCTTTCATGGATAAAATAACATAATGATGAAATATATTATTATAATCTCAAAAATTAGTAATTTGGAAAATTTTCATTTTACAATATTTTGGAAAATTAAAATATCATAAATATGCATATAAAAGTATTATTGTTTTCTATGTGTTTTTAGTTTAGCCATAACTCTTGTCTAGGATAATATTTACATATTTGTATTAAATTTAGAAAAACACTGAAAAAATAAATATATTATAAAACAACTAATACTCAGAAGTAAATAATAAAACCAAAAAACTCAAACAAATTTATATTTCAACCATTGATTTGATACAATTAAACAAAAACTGAAGTTCAATCATAATTGGAGAGTGGATTTAAAAATAAGTTTACATCAAAGTTAAAACCGAAAAATAAGAAATAATAGACTTTTAAAATAATCTTACATCCCAGTAATTGGTTCAGTACATTTGAATCAAACATTGGAGTTTATTTTGAAATTTAAAAAAAAAAATTCTAAAAAATCAGGTAACATACGTTGGAAATAGGGATGTTCTTTTTTCTTGCAGGAAATCCTCTTTTCGGCGAAGAATGGGGAAAATCCACGTCAGCTAAACAGGAATGGAGACAAGGAACATTCCTTGTCCCCGCCCCCGTCCTCTCCCCAGTTGGCATTTTTTAGTAAACAAATTATTCGTTATACTATTATATATAATTTTTAAATTAAAATTTTTATTATTCAAGGACTTAGATAATTTCTTATGCTTGTAAAAGAAATAATAATAATCATTTTATATGGAAAATATCTTAGAAAACTATGATTCCTATTCAAATTTTCCAATTTATGATGATGAGAACTGTTTCCGAGGAATTCTGGGGTTATATTACTCTGATTTCTAGTTGGTCTTCCTCCCTTCTTTTCACTTCTTTCCCCTTTTGGTATATTTTATTTATCAATGAAATTTGTCCCTTGCTCGTAAAAGATTTATTTATAATTGGCACAACAGCTTAGTTGCATGCTTGAATGACGGGTGATTCCTATCATGATTGTGCTAGTATACTTATCCATTTCATGACTTATTGTAGTCAAACTTTATATTTGTATTTTTCAGGAATTGACATATGATATTAGAGAACTTGAGAACCAAAAGGATAAACTGGAAGCAGAACTGGAAAAGGTTGGGTTTCTTTTTCTGTCATGTTCAATAGACTCTTTCTTGTGACTGCAATCCGTTTATTTTTTTCTGTTGTTTCGTACATTCATTCTCTTGCCAATCTAGTTTCCTTACTTCTATGATTACTTGCTGTGCTTTCATTTTTTCCATATCTTTTCCAACGGGAAGTTTGCGATATACGCTTTATGAGTTACTATAGTATGTGTTCAATGAGAATCATTGATGTCTCAACAGCCAAGAATGAATACTCTGTAGCTTCTCCCTCCTCAAGATAGAATGAAACCAACTCCAGTACACGAGGCACTAACAGGAGCTTCAAACCGGTGAGAAATCTAGCAAAAACCAAGACTGTTGTGGTATATAGATCATCTTCTTCATGCGTTCAACGCTCTGGAATGAAAACAAGTCTAAAAAAAGTTTGGTATGCAAGTTGAGAAGTTTTTATACGGATTAAAAGAGTCTCCTTGTGCTTGGTTTTACAGATTTGCCAAAGCTATGATTAAAAGTAGTTATTATCAATGTCCAGTTGATCATCTATATTTGTGAAATCCTCATACAAAAAAACTACAATCTGAATTGTATATGTAGATAATCTTATCATTGCAGAAGATGATATATGGGAAATTCTTAACCTGAAAAAGATGCTAGCAACTGAGTTTGAGATCAAGAACTTGAGAAATTTGAGGTACTCTTTAGGAATGAAGGTGGCATGATCTATGGATGCTATTGTCATTTCTCAGATACATTCTAGACTTGTTAAAGGAAACCAGAAATCTTGGGTGTAAACTTGCTGAAGCACCTATCGATCCATAGGCCAGCACCAAAGTGAAGAGATTTGTCCAATTCACGAGTGCATGTATCAGGAAAGTTCATTTACATGTCACATATCAAACCAGACATTGCATATTCCGTAAGCTTTGTTAGCAGCACGTTAATAGTCCAAATGAACATAATCTTAGGGTTACTAACAGAATATTAGGATACCTGAATGGTACTTCAGGTCAAGAGTAATTTTTTAAGAAGTTATGAAACAAAATGGTAGCACTCTATACTAATGCCATTTGGGCTTGGGAATTAACTTATAGAAGGTTTATATCTGGATATAGCCCTTATGTATGGGGAAACTTAGTCACTTAGAGAACCAAAAAGCATTAAGTTAAAATTAGTGAAGATTTGGACCATCTCTTCCTCCTATGTTTTTTGGCTCAGCTATTATGGTGTGGAAGATTCTGAAGCCTTCAATTTCACCTGTTATTCCGAGAAGCATTCAGAACTTGATTATCAATTCCCCTGCAGTCGGATGAGTGAATTTTGGTTCCAGTAACCTTTAGATTCTTATATTCTCTTATTGATGGACTGAGTTGCGTACAAATATTTGTTTCTCGTCTGGGCTATAGGTGTCCAATCTCCTCCTTTGTTAACTTCATTCCATCACTGGAATAGAGATATTATTTCCTCTTAAAAAAAATGAATAAAATAAAATGTAGAATTGGATTTTCATTGTCAGTTTATCCAATATTCACTGTGCAGCCTTTTCTGCTTCTTTGGGGTGCGATCTTCACCATAATACTTGAACTTTGCTTTTACGTATCAGCTTTTGTGAGCATGTATATGTTTCAGGTCAACACATTGCTGTCTTCTGCTCGTATGCGCCTTCATAATGCAAGAGAAGAAAGAGAGCATTTTGATGAAGCGAGTAGCCAAATGCTTGTGCACTTGAAGACAAAGGTAATCAAGCTGTTGGTTGTGCTCTTGTTCGAACTAATTTCTTCTTTCAATAAAGTGTCAAATTGAATGCCTTTTTTTTTTCTTCTCCCTTTCATTAGTTTCTCTAATGATTTTCTTTGGTTATAGGAAGATGAGCTGTTCAAATCTGTTGCTTCATATAAAGTAGAAGCCACTGCTGTAAATGCATGTAAAAACTTTTTAGAGAATACATGGGACCTACGGATATCCCAAAGACAAAAAACGGAGGAGCTTGTTGAGTATGTTAACCTTTTTGGTTAAAATATTATTTGAGTTACGTTGCTTGTTTTGGAACCAAAGAGATGGAATCAGCTACCATGTCTTTCCCTTTAAGTTCTTTCTATTCTAGTATCAGTAAACTGCTTGATCCTTTGTTTCTTCACTTTTTCAAGTGGTGAGCTGGAAAAATATGGAGATTATTTTGTGAAGTTGGTCATCAGCCTTCTCTCTTCTTACAAGGTACCAGTTATAAGGCTATCTAGTTCACTGATTTTATCTTCAAGAGATGATTTACTGTTTTCTGATTGTCGTGCACATTGGCCTACTATTCTCATTAGGAAAAACTGGAGCCCTCACTTTCTTCAATCAGGAAACTTGAGGAAAACTTGAGCTCGATGAAAGAGTATAGTTCTTCTGTTAAAAAAAAAAAACTATTCTTATTGAGAATAACTTTGATATAATTAACCAGGCTGCTGAGACCAACTTTCCTCAGGTCAGATGTCTCGCCCAATACAGATGATGGAAGCTTACATGTCGATCAACAACGGAGAAAACTCGAAGAGGAATATTTGGATATCGAATCCAAGGTAGTATTATGAATGCTTAGAACTTTCTGTTCACTCATAGGGTGTTCATTTGATCCCGTATTTCAAAACTATGTTGCATTTTCCACTCACAATCTCTAATTTGTGGCTATGTTTTGCTTTATGTCTTTGTATTGCATTATCTAATTTAATCTCTCTGGTGTGTCTCTTTCCTGGCTTTTCTTTACTGGCCAGTTTGTTTCCACCTTAAGTACTGTGGATACAGTACGAATGCAATTCTATGAAACAAAAGGAGTTGTCAGGTACTAAAATAGTGTTGACTAGACAAGGAGTTTTCTACTTTTATCTGCCCCTTGACTACTTGACTAATAATTTAGGCACTTTTGGCCACACGCTGAACTTTTGATCCAGGAATTTTGACGAGAAGGTACAAGAGTTATTTGATGCGCTTGAAAAGATAAAACAAGAATTTGAATCCATCAAGAGACCAAAACTGTTGATTGAAACCACAAGGCAAAGGTCAGAGTTAGCGGTCAATGAAAAGTCACATATAGATTCAAGTTCATCTGAACAGACAGCCGAAGTTCGAAGACTAAAATTTGAAGATATTAATGACTCCTTAGCAAAGGGAACAAAGAATTTTAGCCTGAAAGCAGAAATGGCAAAACCAGATTCGACTGAGGAGGTCGATACAGTTGACTCAAACGAGGAGATAAATGACTGGGAATTTGATGAACTTGGAAGGGACTATGATGCAGCAGCTTCCAACGACCAAAGAAGATGA
mRNA sequence
ATGTCGTGGTTTAGAGCAGCGGTGATCAGGGCGGTGGAAGCGGGTGCCGGAGGGAAGGACAATATCACCCGCACTGTCCGTAACTATGCTGGCACCGTCGTTCACCATGCAGGAAATGCTGTTGTGGAGGGCGCCAAGATCATCCAAGACCGCATTGGACCAAGGAATATGCAGGGATTTAAACAGACTGTAAAAAGGCTGGAAGAAATTTCTGTTTCGTCTAGAGGCGTAGAAAGAGTTCAGTTATTGAGAAGGTGGCTAGTTGCGCTCAAAGAGGTGGATAGATTTTCGTCAGGTTCAATCGAGGGTGATATAAACAGTCCGACAGATCAGCCTAACGATGAAACTAAAGATTCTCCGAAAAATCCTACACTGGTCTATTATGTAGACCCAGATATGGGAGGCGAGCTGAAAACGTTTCGTGATGTATTTCTTACAAGCCAAGCTCTAGAAGGCATTACGTTGTCTATGGTTCTGCAAGAAATACTTTCACAAGTTCAATTATACTCCAAGTTGGAGGAGCTCTTACTAAAGAAGAAGCTGTTCAACGAAGTGGATTCCCCACAGCTTCATGCTGAGAAGGTTGACAAACTGAGAATTTTGTCAGAATCTTTGGCCAATTCTACCTTGAAAGCTGAAAAGCGTATTGTTGACCAAAGAGAGCAGAGGGAGGATGCACTCAACTTTCGGTTAGCTAAGTCCGAGGAAATGGTCCAAGCTGAGAAGGAATTGACATATGATATTAGAGAACTTGAGAACCAAAAGGATAAACTGGAAGCAGAACTGGAAAAGGTCAACACATTGCTGTCTTCTGCTCGTATGCGCCTTCATAATGCAAGAGAAGAAAGAGAGCATTTTGATGAAGCGAGTAGCCAAATGCTTGTGCACTTGAAGACAAAGGAAGATGAGCTGTTCAAATCTGTTGCTTCATATAAAGTAGAAGCCACTGCTGTAAATGCATGTAAAAACTTTTTAGAGAATACATGGGACCTACGGATATCCCAAAGACAAAAAACGGAGGAGCTTGTTGAGCTGCTGAGACCAACTTTCCTCAGGTCAGATGTCTCGCCCAATACAGATGATGGAAGCTTACATGTCGATCAACAACGGAGAAAACTCGAAGAGGAATATTTGGATATCGAATCCAAGTTTGTTTCCACCTTAAGTACTGTGGATACAGTACGAATGCAATTCTATGAAACAAAAGGAGTTGTCAGGAATTTTGACGAGAAGGTACAAGAGTTATTTGATGCGCTTGAAAAGATAAAACAAGAATTTGAATCCATCAAGAGACCAAAACTGTTGATTGAAACCACAAGGCAAAGGTCAGAGTTAGCGGTCAATGAAAAGTCACATATAGATTCAAGTTCATCTGAACAGACAGCCGAAGTTCGAAGACTAAAATTTGAAGATATTAATGACTCCTTAGCAAAGGGAACAAAGAATTTTAGCCTGAAAGCAGAAATGGCAAAACCAGATTCGACTGAGGAGGTCGATACAGTTGACTCAAACGAGGAGATAAATGACTGGGAATTTGATGAACTTGGAAGGGACTATGATGCAGCAGCTTCCAACGACCAAAGAAGATGA
Coding sequence (CDS)
ATGTCGTGGTTTAGAGCAGCGGTGATCAGGGCGGTGGAAGCGGGTGCCGGAGGGAAGGACAATATCACCCGCACTGTCCGTAACTATGCTGGCACCGTCGTTCACCATGCAGGAAATGCTGTTGTGGAGGGCGCCAAGATCATCCAAGACCGCATTGGACCAAGGAATATGCAGGGATTTAAACAGACTGTAAAAAGGCTGGAAGAAATTTCTGTTTCGTCTAGAGGCGTAGAAAGAGTTCAGTTATTGAGAAGGTGGCTAGTTGCGCTCAAAGAGGTGGATAGATTTTCGTCAGGTTCAATCGAGGGTGATATAAACAGTCCGACAGATCAGCCTAACGATGAAACTAAAGATTCTCCGAAAAATCCTACACTGGTCTATTATGTAGACCCAGATATGGGAGGCGAGCTGAAAACGTTTCGTGATGTATTTCTTACAAGCCAAGCTCTAGAAGGCATTACGTTGTCTATGGTTCTGCAAGAAATACTTTCACAAGTTCAATTATACTCCAAGTTGGAGGAGCTCTTACTAAAGAAGAAGCTGTTCAACGAAGTGGATTCCCCACAGCTTCATGCTGAGAAGGTTGACAAACTGAGAATTTTGTCAGAATCTTTGGCCAATTCTACCTTGAAAGCTGAAAAGCGTATTGTTGACCAAAGAGAGCAGAGGGAGGATGCACTCAACTTTCGGTTAGCTAAGTCCGAGGAAATGGTCCAAGCTGAGAAGGAATTGACATATGATATTAGAGAACTTGAGAACCAAAAGGATAAACTGGAAGCAGAACTGGAAAAGGTCAACACATTGCTGTCTTCTGCTCGTATGCGCCTTCATAATGCAAGAGAAGAAAGAGAGCATTTTGATGAAGCGAGTAGCCAAATGCTTGTGCACTTGAAGACAAAGGAAGATGAGCTGTTCAAATCTGTTGCTTCATATAAAGTAGAAGCCACTGCTGTAAATGCATGTAAAAACTTTTTAGAGAATACATGGGACCTACGGATATCCCAAAGACAAAAAACGGAGGAGCTTGTTGAGCTGCTGAGACCAACTTTCCTCAGGTCAGATGTCTCGCCCAATACAGATGATGGAAGCTTACATGTCGATCAACAACGGAGAAAACTCGAAGAGGAATATTTGGATATCGAATCCAAGTTTGTTTCCACCTTAAGTACTGTGGATACAGTACGAATGCAATTCTATGAAACAAAAGGAGTTGTCAGGAATTTTGACGAGAAGGTACAAGAGTTATTTGATGCGCTTGAAAAGATAAAACAAGAATTTGAATCCATCAAGAGACCAAAACTGTTGATTGAAACCACAAGGCAAAGGTCAGAGTTAGCGGTCAATGAAAAGTCACATATAGATTCAAGTTCATCTGAACAGACAGCCGAAGTTCGAAGACTAAAATTTGAAGATATTAATGACTCCTTAGCAAAGGGAACAAAGAATTTTAGCCTGAAAGCAGAAATGGCAAAACCAGATTCGACTGAGGAGGTCGATACAGTTGACTCAAACGAGGAGATAAATGACTGGGAATTTGATGAACTTGGAAGGGACTATGATGCAGCAGCTTCCAACGACCAAAGAAGATGA
Protein sequence
MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIGPRNMQGFKQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGDINSPTDQPNDETKDSPKNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSMVLQEILSQVQLYSKLEELLLKKKLFNEVDSPQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQAEKELTYDIRELENQKDKLEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFKSVASYKVEATAVNACKNFLENTWDLRISQRQKTEELVELLRPTFLRSDVSPNTDDGSLHVDQQRRKLEEEYLDIESKFVSTLSTVDTVRMQFYETKGVVRNFDEKVQELFDALEKIKQEFESIKRPKLLIETTRQRSELAVNEKSHIDSSSSEQTAEVRRLKFEDINDSLAKGTKNFSLKAEMAKPDSTEEVDTVDSNEEINDWEFDELGRDYDAAASNDQRR
Homology
BLAST of Csor.00g200600 vs. NCBI nr
Match:
KAG6603685.1 (hypothetical protein SDJN03_04294, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 984 bits (2544), Expect = 0.0
Identity = 529/529 (100.00%), Postives = 529/529 (100.00%), Query Frame = 0
Query: 1 MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIGPRNMQGF 60
MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIGPRNMQGF
Sbjct: 1 MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIGPRNMQGF 60
Query: 61 KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGDINSPTDQPNDETKDSP 120
KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGDINSPTDQPNDETKDSP
Sbjct: 61 KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGDINSPTDQPNDETKDSP 120
Query: 121 KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSMVLQEILSQVQLYSKLEELLLKKK 180
KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSMVLQEILSQVQLYSKLEELLLKKK
Sbjct: 121 KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSMVLQEILSQVQLYSKLEELLLKKK 180
Query: 181 LFNEVDSPQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQA 240
LFNEVDSPQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQA
Sbjct: 181 LFNEVDSPQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQA 240
Query: 241 EKELTYDIRELENQKDKLEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTK 300
EKELTYDIRELENQKDKLEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTK
Sbjct: 241 EKELTYDIRELENQKDKLEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTK 300
Query: 301 EDELFKSVASYKVEATAVNACKNFLENTWDLRISQRQKTEELVELLRPTFLRSDVSPNTD 360
EDELFKSVASYKVEATAVNACKNFLENTWDLRISQRQKTEELVELLRPTFLRSDVSPNTD
Sbjct: 301 EDELFKSVASYKVEATAVNACKNFLENTWDLRISQRQKTEELVELLRPTFLRSDVSPNTD 360
Query: 361 DGSLHVDQQRRKLEEEYLDIESKFVSTLSTVDTVRMQFYETKGVVRNFDEKVQELFDALE 420
DGSLHVDQQRRKLEEEYLDIESKFVSTLSTVDTVRMQFYETKGVVRNFDEKVQELFDALE
Sbjct: 361 DGSLHVDQQRRKLEEEYLDIESKFVSTLSTVDTVRMQFYETKGVVRNFDEKVQELFDALE 420
Query: 421 KIKQEFESIKRPKLLIETTRQRSELAVNEKSHIDSSSSEQTAEVRRLKFEDINDSLAKGT 480
KIKQEFESIKRPKLLIETTRQRSELAVNEKSHIDSSSSEQTAEVRRLKFEDINDSLAKGT
Sbjct: 421 KIKQEFESIKRPKLLIETTRQRSELAVNEKSHIDSSSSEQTAEVRRLKFEDINDSLAKGT 480
Query: 481 KNFSLKAEMAKPDSTEEVDTVDSNEEINDWEFDELGRDYDAAASNDQRR 529
KNFSLKAEMAKPDSTEEVDTVDSNEEINDWEFDELGRDYDAAASNDQRR
Sbjct: 481 KNFSLKAEMAKPDSTEEVDTVDSNEEINDWEFDELGRDYDAAASNDQRR 529
BLAST of Csor.00g200600 vs. NCBI nr
Match:
XP_022950927.1 (myosin-6-like [Cucurbita moschata])
HSP 1 Score: 903 bits (2333), Expect = 0.0
Identity = 525/686 (76.53%), Postives = 526/686 (76.68%), Query Frame = 0
Query: 1 MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIGPRNMQGF 60
MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIGPRNMQGF
Sbjct: 1 MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIGPRNMQGF 60
Query: 61 KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGDINSPTDQPNDETKDSP 120
KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGD NSPTDQPNDETKDSP
Sbjct: 61 KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGDKNSPTDQPNDETKDSP 120
Query: 121 KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSM----------------------- 180
KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSM
Sbjct: 121 KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSMILEEATDEEESLLLEIYGLCLLG 180
Query: 181 ------------------------------------------------------------ 240
Sbjct: 181 GKEVHQEIMMNVHNLANAFSEYQDEALVKREELLQHVQDAIAGLKINADFDRIDAKACSL 240
Query: 241 ------------------------------VLQEILSQVQLYSKLEELLLKKKLFNEVDS 300
VLQEILSQVQLYSKLEELLLKKKLFNEVDS
Sbjct: 241 KKTLDEKKGEMPPSSGDRDNTSDDKTTSSKVLQEILSQVQLYSKLEELLLKKKLFNEVDS 300
Query: 301 PQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQAEKELTYD 360
PQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQAEKELTYD
Sbjct: 301 PQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQAEKELTYD 360
Query: 361 IRELENQKDKLEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFKS 420
IRELENQKDKLEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFKS
Sbjct: 361 IRELENQKDKLEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFKS 420
Query: 421 VASYKVEATAVNACKNFLENTWDLRISQRQKTEELV------------------------ 480
VASYKVEATAVNACKNFLENTW+LRISQRQKTEELV
Sbjct: 421 VASYKVEATAVNACKNFLENTWELRISQRQKTEELVDGELEKYGDYFVKLVISLLSSYKE 480
Query: 481 --------------------ELLRPTFLRSDVSPNTDDGSLHVDQQRRKLEEEYLDIESK 529
ELLRPTFLRSDVSPNTDDGSLHVDQQRRKLEEEYLDIESK
Sbjct: 481 KLEPSLSSIRKLEENLSSMKELLRPTFLRSDVSPNTDDGSLHVDQQRRKLEEEYLDIESK 540
BLAST of Csor.00g200600 vs. NCBI nr
Match:
XP_023544996.1 (myosin-6-like [Cucurbita pepo subsp. pepo])
HSP 1 Score: 887 bits (2293), Expect = 0.0
Identity = 515/678 (75.96%), Postives = 520/678 (76.70%), Query Frame = 0
Query: 1 MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIGPRNMQGF 60
MSWFRAAVIRAVEAGAGGKDNITRTVRNYA TVVHHAGNAVVEGAKIIQDRIGPRNMQGF
Sbjct: 1 MSWFRAAVIRAVEAGAGGKDNITRTVRNYADTVVHHAGNAVVEGAKIIQDRIGPRNMQGF 60
Query: 61 KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGDINSPTDQPNDETKDSP 120
KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGS EGD NSPTDQPNDET+DSP
Sbjct: 61 KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSSEGDKNSPTDQPNDETRDSP 120
Query: 121 KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSM----------------------- 180
KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSM
Sbjct: 121 KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSMILEEATDEEESLLLEIYGLCLLG 180
Query: 181 ------------------------------------------------------------ 240
Sbjct: 181 GKEVHQEIMMNVHNLANAFSEYQDEALVKREELLQHVQDAIAGLKINADFDRIDAKACSL 240
Query: 241 ------------------------------VLQEILSQVQLYSKLEELLLKKKLFNEVDS 300
VLQEILSQVQLYSKLEELLLKKKLFNEVDS
Sbjct: 241 KKTLDEKKGEMPPSSGDRDNTSDDKTTSSKVLQEILSQVQLYSKLEELLLKKKLFNEVDS 300
Query: 301 PQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQAEKELTYD 360
PQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQAEKELTYD
Sbjct: 301 PQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQAEKELTYD 360
Query: 361 IRELENQKDKLEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFKS 420
IRELENQKD LEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFKS
Sbjct: 361 IRELENQKDHLEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFKS 420
Query: 421 VASYKVEATAVNACKNFLENTWDLRISQRQKTEELV-----------------------E 480
VASYKVEATAVNACKNFLENTWDLRISQRQKTEELV E
Sbjct: 421 VASYKVEATAVNACKNFLENTWDLRISQRQKTEELVDGELEKYGDYFVKLVISLLSSYKE 480
Query: 481 LLRPTFL-------------RSDVSPNTDDGSLHVDQQRRKLEEEYLDIESKFVSTLSTV 529
L P+ SDVSPNTDDGSLHVDQ+RRKLEEEYLDIESKFVSTLSTV
Sbjct: 481 KLEPSLSSIRKLEENLSSMKESDVSPNTDDGSLHVDQERRKLEEEYLDIESKFVSTLSTV 540
BLAST of Csor.00g200600 vs. NCBI nr
Match:
XP_022978391.1 (myosin-6-like [Cucurbita maxima])
HSP 1 Score: 880 bits (2275), Expect = 0.0
Identity = 516/686 (75.22%), Postives = 518/686 (75.51%), Query Frame = 0
Query: 1 MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIGPRNMQGF 60
MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIG RNMQGF
Sbjct: 1 MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIGSRNMQGF 60
Query: 61 KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGDINSPTDQPNDETKDSP 120
KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGD NSPTDQ NDETKDSP
Sbjct: 61 KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGDKNSPTDQLNDETKDSP 120
Query: 121 KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSM----------------------- 180
KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITL M
Sbjct: 121 KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLHMILEEATDEEESLLLEIYGLCLLG 180
Query: 181 ------------------------------------------------------------ 240
Sbjct: 181 GKEVHQEIMMNVHNLANAFSEYQDEALVKREELLQHVQDAIAGLKINADFDRIDAKACSL 240
Query: 241 ------------------------------VLQEILSQVQLYSKLEELLLKKKLFNEVDS 300
VLQEILSQVQLYSKLEELLLKKKLFNEVDS
Sbjct: 241 KKTLDEKKGEMPPSSGDRDNTSDDKTTSSKVLQEILSQVQLYSKLEELLLKKKLFNEVDS 300
Query: 301 PQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQAEKELTYD 360
PQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQAEKELTYD
Sbjct: 301 PQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQAEKELTYD 360
Query: 361 IRELENQKDKLEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFKS 420
IRELENQKD+LEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFKS
Sbjct: 361 IRELENQKDQLEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFKS 420
Query: 421 VASYKVEATAVNACKNFLENTWDLRISQRQKTEELV------------------------ 480
VASYKVEATAVNACKN LENTWDLRISQRQKTEELV
Sbjct: 421 VASYKVEATAVNACKNLLENTWDLRISQRQKTEELVDGELEKYGDYFVKLVISLLSSYKE 480
Query: 481 --------------------ELLRPTFLRSDVSPNTDDGSLHVDQQRRKLEEEYLDIESK 529
ELLRPTFLRSDVSPNTDDGSLHVDQQRRKLEEEYL IESK
Sbjct: 481 KLEPSLSSIRKLEENLSSMKELLRPTFLRSDVSPNTDDGSLHVDQQRRKLEEEYLYIESK 540
BLAST of Csor.00g200600 vs. NCBI nr
Match:
KAG7033862.1 (hypothetical protein SDJN02_03587 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 848 bits (2190), Expect = 1.95e-303
Identity = 511/732 (69.81%), Postives = 515/732 (70.36%), Query Frame = 0
Query: 1 MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIGPRNMQGF 60
MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIGPRNMQGF
Sbjct: 1 MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIGPRNMQGF 60
Query: 61 KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGDINSPTDQPNDETKDSP 120
KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGDINSPTDQPNDETKDSP
Sbjct: 61 KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGDINSPTDQPNDETKDSP 120
Query: 121 KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSM----------------------- 180
KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSM
Sbjct: 121 KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSMILEEATDEEESLLLEIYGLCLLG 180
Query: 181 ------------------------------------------------------------ 240
Sbjct: 181 GKEVHQEIMMNVHNLANAFSEYQDEALVKREELLQHVQDAIAGLKINADFDRIDAKACSL 240
Query: 241 ------------------------------VLQEILSQVQLYSKLEELLLKKKLFNEVDS 300
VLQEILSQVQLYSKLEELLLKKKLFNEVDS
Sbjct: 241 KKTLDEKKGEMPPSSGARDNTSDDKTTSSKVLQEILSQVQLYSKLEELLLKKKLFNEVDS 300
Query: 301 PQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQAEKELTY- 360
PQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQAEKE+ +
Sbjct: 301 PQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQAEKEILFS 360
Query: 361 ----------------------------------------DIRELENQKDKLEAELE--- 420
DI E+ N K L E E
Sbjct: 361 AKNGENPRQLNRNGDKEHSLSPPPSSPQLAFFNNLIIAEDDIWEILNLKKMLATEFEIKN 420
Query: 421 ----------KVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFKSVASYKV 480
KVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFKSVASYKV
Sbjct: 421 LRNLRYSLGMKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFKSVASYKV 480
Query: 481 EATAVNACKNFLENTWDLRISQRQKTEELV-----------------------ELLRPTF 529
EATAVNACKNFLENTWDLRISQRQKTEELV E L P+
Sbjct: 481 EATAVNACKNFLENTWDLRISQRQKTEELVDGELEKYGDYFVKLVISLLSSYKEKLEPSL 540
BLAST of Csor.00g200600 vs. ExPASy TrEMBL
Match:
A0A6J1GG85 (myosin-6-like OS=Cucurbita moschata OX=3662 GN=LOC111453873 PE=4 SV=1)
HSP 1 Score: 903 bits (2333), Expect = 0.0
Identity = 525/686 (76.53%), Postives = 526/686 (76.68%), Query Frame = 0
Query: 1 MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIGPRNMQGF 60
MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIGPRNMQGF
Sbjct: 1 MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIGPRNMQGF 60
Query: 61 KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGDINSPTDQPNDETKDSP 120
KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGD NSPTDQPNDETKDSP
Sbjct: 61 KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGDKNSPTDQPNDETKDSP 120
Query: 121 KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSM----------------------- 180
KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSM
Sbjct: 121 KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSMILEEATDEEESLLLEIYGLCLLG 180
Query: 181 ------------------------------------------------------------ 240
Sbjct: 181 GKEVHQEIMMNVHNLANAFSEYQDEALVKREELLQHVQDAIAGLKINADFDRIDAKACSL 240
Query: 241 ------------------------------VLQEILSQVQLYSKLEELLLKKKLFNEVDS 300
VLQEILSQVQLYSKLEELLLKKKLFNEVDS
Sbjct: 241 KKTLDEKKGEMPPSSGDRDNTSDDKTTSSKVLQEILSQVQLYSKLEELLLKKKLFNEVDS 300
Query: 301 PQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQAEKELTYD 360
PQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQAEKELTYD
Sbjct: 301 PQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQAEKELTYD 360
Query: 361 IRELENQKDKLEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFKS 420
IRELENQKDKLEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFKS
Sbjct: 361 IRELENQKDKLEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFKS 420
Query: 421 VASYKVEATAVNACKNFLENTWDLRISQRQKTEELV------------------------ 480
VASYKVEATAVNACKNFLENTW+LRISQRQKTEELV
Sbjct: 421 VASYKVEATAVNACKNFLENTWELRISQRQKTEELVDGELEKYGDYFVKLVISLLSSYKE 480
Query: 481 --------------------ELLRPTFLRSDVSPNTDDGSLHVDQQRRKLEEEYLDIESK 529
ELLRPTFLRSDVSPNTDDGSLHVDQQRRKLEEEYLDIESK
Sbjct: 481 KLEPSLSSIRKLEENLSSMKELLRPTFLRSDVSPNTDDGSLHVDQQRRKLEEEYLDIESK 540
BLAST of Csor.00g200600 vs. ExPASy TrEMBL
Match:
A0A6J1IMJ3 (myosin-6-like OS=Cucurbita maxima OX=3661 GN=LOC111478393 PE=4 SV=1)
HSP 1 Score: 880 bits (2275), Expect = 0.0
Identity = 516/686 (75.22%), Postives = 518/686 (75.51%), Query Frame = 0
Query: 1 MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIGPRNMQGF 60
MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIG RNMQGF
Sbjct: 1 MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIGSRNMQGF 60
Query: 61 KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGDINSPTDQPNDETKDSP 120
KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGD NSPTDQ NDETKDSP
Sbjct: 61 KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGDKNSPTDQLNDETKDSP 120
Query: 121 KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSM----------------------- 180
KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITL M
Sbjct: 121 KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLHMILEEATDEEESLLLEIYGLCLLG 180
Query: 181 ------------------------------------------------------------ 240
Sbjct: 181 GKEVHQEIMMNVHNLANAFSEYQDEALVKREELLQHVQDAIAGLKINADFDRIDAKACSL 240
Query: 241 ------------------------------VLQEILSQVQLYSKLEELLLKKKLFNEVDS 300
VLQEILSQVQLYSKLEELLLKKKLFNEVDS
Sbjct: 241 KKTLDEKKGEMPPSSGDRDNTSDDKTTSSKVLQEILSQVQLYSKLEELLLKKKLFNEVDS 300
Query: 301 PQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQAEKELTYD 360
PQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQAEKELTYD
Sbjct: 301 PQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQAEKELTYD 360
Query: 361 IRELENQKDKLEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFKS 420
IRELENQKD+LEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFKS
Sbjct: 361 IRELENQKDQLEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFKS 420
Query: 421 VASYKVEATAVNACKNFLENTWDLRISQRQKTEELV------------------------ 480
VASYKVEATAVNACKN LENTWDLRISQRQKTEELV
Sbjct: 421 VASYKVEATAVNACKNLLENTWDLRISQRQKTEELVDGELEKYGDYFVKLVISLLSSYKE 480
Query: 481 --------------------ELLRPTFLRSDVSPNTDDGSLHVDQQRRKLEEEYLDIESK 529
ELLRPTFLRSDVSPNTDDGSLHVDQQRRKLEEEYL IESK
Sbjct: 481 KLEPSLSSIRKLEENLSSMKELLRPTFLRSDVSPNTDDGSLHVDQQRRKLEEEYLYIESK 540
BLAST of Csor.00g200600 vs. ExPASy TrEMBL
Match:
A0A1S3BI64 (uncharacterized protein LOC103489834 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103489834 PE=4 SV=1)
HSP 1 Score: 748 bits (1931), Expect = 3.43e-265
Identity = 447/682 (65.54%), Postives = 488/682 (71.55%), Query Frame = 0
Query: 1 MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIGPRNMQGF 60
MSW RAAVIRAVEAGAGGKDNITRTVRN AGTVV+HAGNAVVEGAKIIQDRIGPRNMQGF
Sbjct: 1 MSWLRAAVIRAVEAGAGGKDNITRTVRNVAGTVVYHAGNAVVEGAKIIQDRIGPRNMQGF 60
Query: 61 KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGDINSPTDQPNDETKDSP 120
KQTVKRLEEISVSSRG+ERVQLLRRWLVALKEVDRFS GSIEG NSPTDQ N+E KDSP
Sbjct: 61 KQTVKRLEEISVSSRGIERVQLLRRWLVALKEVDRFSLGSIEGGKNSPTDQLNEENKDSP 120
Query: 121 KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSMVL--------------------- 180
K PTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSM+L
Sbjct: 121 KKPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSMILEEPNDEEESLLLEIYGLCLSG 180
Query: 181 ------------------------------------------------------------ 240
Sbjct: 181 GKEVRQAVMTSVHNLAKAFSEYQDEILVKREELLQYVQDAIAGLKINADFDRIDAKACSL 240
Query: 241 --------------------------------QEILSQVQLYSKLEELLLKKKLFNEVDS 300
QEILSQVQL SKLEELLLKKKLFN+ DS
Sbjct: 241 KETLDENHEELPLSREDQDNTSDGETRASKILQEILSQVQLCSKLEELLLKKKLFNDGDS 300
Query: 301 PQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQAEKELTYD 360
PQLHAEKV+KLRILSESLANSTLKAEKRIVD REQ+E+ALNFR+AKS+EMVQAEKELT D
Sbjct: 301 PQLHAEKVEKLRILSESLANSTLKAEKRIVDHREQKEEALNFRVAKSKEMVQAEKELTDD 360
Query: 361 IRELENQKDKLEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFKS 420
I ELENQKD+LEAEL+KVNTLLS+ARMRLHNAREEREHFDEAS+Q+LVHLKTKEDELFKS
Sbjct: 361 IGELENQKDRLEAELKKVNTLLSAARMRLHNAREEREHFDEASNQILVHLKTKEDELFKS 420
Query: 421 VASYKVEATAVNACKNFLENTWDLRISQRQKTEELVE----------------------- 480
VASYKVEA AVNACKNFLE+TW+L+ISQRQ EE V+
Sbjct: 421 VASYKVEAGAVNACKNFLEHTWNLQISQRQLKEEHVDGELEKYGDYFVKLVISLLSSYKG 480
Query: 481 LLRP-------------TFLRSDVSPNTDDGSLHVDQQRRKLEEEYLDIESKFVSTLSTV 529
L P + SDVSP+ DD SL+V +QRRKLEEEYLD+ESKFVSTLSTV
Sbjct: 481 KLEPALSCIRKLEENLSSMKESDVSPDIDDRSLNVHKQRRKLEEEYLDMESKFVSTLSTV 540
BLAST of Csor.00g200600 vs. ExPASy TrEMBL
Match:
A0A1S3BI59 (myosin-11 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103489834 PE=4 SV=1)
HSP 1 Score: 743 bits (1919), Expect = 2.36e-263
Identity = 447/683 (65.45%), Postives = 488/683 (71.45%), Query Frame = 0
Query: 1 MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIGPRNMQGF 60
MSW RAAVIRAVEAGAGGKDNITRTVRN AGTVV+HAGNAVVEGAKIIQDRIGPRNMQGF
Sbjct: 1 MSWLRAAVIRAVEAGAGGKDNITRTVRNVAGTVVYHAGNAVVEGAKIIQDRIGPRNMQGF 60
Query: 61 KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGDINSPTDQPNDETKDSP 120
KQTVKRLEEISVSSRG+ERVQLLRRWLVALKEVDRFS GSIEG NSPTDQ N+E KDSP
Sbjct: 61 KQTVKRLEEISVSSRGIERVQLLRRWLVALKEVDRFSLGSIEGGKNSPTDQLNEENKDSP 120
Query: 121 KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSMVL--------------------- 180
K PTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSM+L
Sbjct: 121 KKPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSMILEEPNDEEESLLLEIYGLCLSG 180
Query: 181 ------------------------------------------------------------ 240
Sbjct: 181 GKEVRQAVMTSVHNLAKAFSEYQDEILVKREELLQYVQDAIAGLKINADFDRIDAKACSL 240
Query: 241 --------------------------------QEILSQVQLYSKLEELLLKKKLFNEVDS 300
QEILSQVQL SKLEELLLKKKLFN+ DS
Sbjct: 241 KETLDENHEELPLSREDQDNTSDGETRASKILQEILSQVQLCSKLEELLLKKKLFNDGDS 300
Query: 301 PQLHAEKVDKLRILSESLANSTLKAEKRIVDQ-REQREDALNFRLAKSEEMVQAEKELTY 360
PQLHAEKV+KLRILSESLANSTLKAEKRIVD REQ+E+ALNFR+AKS+EMVQAEKELT
Sbjct: 301 PQLHAEKVEKLRILSESLANSTLKAEKRIVDHSREQKEEALNFRVAKSKEMVQAEKELTD 360
Query: 361 DIRELENQKDKLEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFK 420
DI ELENQKD+LEAEL+KVNTLLS+ARMRLHNAREEREHFDEAS+Q+LVHLKTKEDELFK
Sbjct: 361 DIGELENQKDRLEAELKKVNTLLSAARMRLHNAREEREHFDEASNQILVHLKTKEDELFK 420
Query: 421 SVASYKVEATAVNACKNFLENTWDLRISQRQKTEELVE---------------------- 480
SVASYKVEA AVNACKNFLE+TW+L+ISQRQ EE V+
Sbjct: 421 SVASYKVEAGAVNACKNFLEHTWNLQISQRQLKEEHVDGELEKYGDYFVKLVISLLSSYK 480
Query: 481 -LLRP-------------TFLRSDVSPNTDDGSLHVDQQRRKLEEEYLDIESKFVSTLST 529
L P + SDVSP+ DD SL+V +QRRKLEEEYLD+ESKFVSTLST
Sbjct: 481 GKLEPALSCIRKLEENLSSMKESDVSPDIDDRSLNVHKQRRKLEEEYLDMESKFVSTLST 540
BLAST of Csor.00g200600 vs. ExPASy TrEMBL
Match:
A0A6J1CRU2 (uncharacterized protein LOC111013735 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111013735 PE=4 SV=1)
HSP 1 Score: 685 bits (1767), Expect = 1.82e-240
Identity = 422/683 (61.79%), Postives = 471/683 (68.96%), Query Frame = 0
Query: 1 MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIGPRNMQGF 60
MSW RAAVI+AVEAGAGGKDN+TRTVR+YAGTVV+HAGNAVVEGAKIIQDRIGPRNMQGF
Sbjct: 1 MSWLRAAVIKAVEAGAGGKDNLTRTVRSYAGTVVYHAGNAVVEGAKIIQDRIGPRNMQGF 60
Query: 61 KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGDINSPTDQPNDETKDSP 120
KQTVKRLEEISVSSRG+ERVQLLRRWLVALKEVDRFS GSIEG+ NSPTDQ NDE KDSP
Sbjct: 61 KQTVKRLEEISVSSRGMERVQLLRRWLVALKEVDRFSLGSIEGNKNSPTDQLNDENKDSP 120
Query: 121 KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSMVLQE------------------- 180
K PTLVYYVDPDMGGEL+TFRDVFLTSQALEGITLSM+L+E
Sbjct: 121 KKPTLVYYVDPDMGGELRTFRDVFLTSQALEGITLSMILEEPNDEEESLLLEIYGLCLSG 180
Query: 181 ------------------------------------------------------------ 240
Sbjct: 181 GKEIRQAVMTSVHNLAKAFSEYQDEVLVKREELLQYVQDAISGLKINADFDRIDAKACSL 240
Query: 241 ----------------------------------ILSQVQLYSKLEELLLKKKLFNEVDS 300
ILSQVQL SKLEELL KKKLFNE DS
Sbjct: 241 KETLDEKKEELTPSRGDHDDSLNDETTTSKILKEILSQVQLCSKLEELLRKKKLFNEGDS 300
Query: 301 PQLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQAEKELTYD 360
PQLHAEKV+KLR+LSESLANSTLKAEKRIVD REQ+E+ALNFR+AKS+EMVQAEKE T +
Sbjct: 301 PQLHAEKVEKLRVLSESLANSTLKAEKRIVDHREQKEEALNFRVAKSKEMVQAEKEFTDE 360
Query: 361 IRELENQKDKLEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFKS 420
I ELE QKD+LEAEL+KVNTLLSSARMRLHNAREEREHFDEAS+Q+LVHLKTKEDELFKS
Sbjct: 361 IGELEKQKDQLEAELKKVNTLLSSARMRLHNAREEREHFDEASNQILVHLKTKEDELFKS 420
Query: 421 VASYKVEATAVNACKNFLENTWDLRISQRQKTEELV-----------------------E 480
VASYKVEA AVN+C +FLE+TW+L+ISQRQ E+ V E
Sbjct: 421 VASYKVEAGAVNSCIHFLEHTWNLQISQRQLKEKNVDGELEKYGDYFMKLVISLLSSYKE 480
Query: 481 LLRPTFL-------------RSDVSPNTDDGSLHVDQQRRKLEEEYLDIESKFVSTLSTV 529
L P+ SD SPN DDGSL+VD+QRRKLEEEYLDIESKFVSTLSTV
Sbjct: 481 KLEPSLSCIRKLEENLCSMKESDASPNIDDGSLNVDKQRRKLEEEYLDIESKFVSTLSTV 540
BLAST of Csor.00g200600 vs. TAIR 10
Match:
AT2G37370.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G13560.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 368.6 bits (945), Expect = 8.3e-102
Identity = 268/679 (39.47%), Postives = 369/679 (54.34%), Query Frame = 0
Query: 1 MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIGPRNMQGF 60
MSW R+AV +AVE GGK+NITRTVRNYA +VV AGNAV EGAK+IQDRIG RN++ F
Sbjct: 1 MSWLRSAVNKAVE--VGGKNNITRTVRNYADSVVLTAGNAVSEGAKLIQDRIGSRNVKSF 60
Query: 61 KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGDINSPTDQPNDETKDSP 120
VKRLEE+SVSSRG ERVQLLRRWLVAL+E++R S + + N TD + +++DSP
Sbjct: 61 SLAVKRLEEVSVSSRGSERVQLLRRWLVALREIERMSYSCFDNN-NHKTDD-HTQSEDSP 120
Query: 121 KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSMVLQ-------------------- 180
KN + VYYVDP + GE TFRDVFL S+ALEG+ LSM+L+
Sbjct: 121 KNFSTVYYVDPGLPGEPMTFRDVFLHSEALEGMVLSMILEAPNEEEVQLLLELFGLCLSG 180
Query: 181 ------------------------------------------------------------ 240
Sbjct: 181 EKEVHEAVIQNVQDLATVFLKYKDEVLAKREELLQYVQGAIGGLKLSADIARIDIEAHTL 240
Query: 241 -------------------------------EILSQVQLYSKLEELLLKKKLFNEVDSPQ 300
EIL QV+ +SKLE LLL+KK + D+ Q
Sbjct: 241 MEKLDKTKVKVLEHASSEDASKTAASTEALREILEQVRTFSKLEALLLRKKSLHNGDTLQ 300
Query: 301 LHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQAEKELTYDIR 360
H EKVDKL++LSESL NST KAEKRI+D R Q+E+AL++R++K+ E+ Q EK++ +++
Sbjct: 301 RHIEKVDKLKVLSESLLNSTSKAEKRIMDHRSQKEEALSYRVSKTTEVGQLEKDVAAELK 360
Query: 361 ELENQKDKLEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFKSVA 420
+LE K+ LEAEL++VNT ++SAR RL NA+EERE FD AS+++L+HLK+KE+EL +S+
Sbjct: 361 KLEILKEDLEAELKRVNTSITSARARLRNAQEEREQFDNASNEILMHLKSKEEELTRSIT 420
Query: 421 SYKVEATAVNACKNFLENTWDLRISQRQKTEE----------------LVELLRPTFLRS 480
S +VEA VN FLE+TW L+ Q+ + +V+LL +F +
Sbjct: 421 SCRVEADVVNKWIKFLEDTWILQSKFSQQKDNQVSGEMERYSDHFIDLIVQLL--SFYKE 480
Query: 481 DVSPN----------------------TDDGSLHVDQQRRKLEEEYLDIESKFVSTLSTV 526
+ P D+ + R++LE+EYLD+E+KFV+TLS V
Sbjct: 481 QLDPYIPKIRGVVASLEPSKGLEAEKIIDNKDTKPFESRKQLEKEYLDLEAKFVTTLSVV 540
BLAST of Csor.00g200600 vs. TAIR 10
Match:
AT5G13560.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G37370.1); Has 12055 Blast hits to 8846 proteins in 811 species: Archae - 217; Bacteria - 1046; Metazoa - 6104; Fungi - 1115; Plants - 528; Viruses - 14; Other Eukaryotes - 3031 (source: NCBI BLink). )
HSP 1 Score: 323.2 bits (827), Expect = 4.0e-88
Identity = 247/682 (36.22%), Postives = 346/682 (50.73%), Query Frame = 0
Query: 1 MSWFRAAVIRAVEAGAGGKDNITRTVRNYAGTVVHHAGNAVVEGAKIIQDRIGPRNMQGF 60
MSW R AV +AVE G + NITRTV+NYA +VV HAG AV EGAK+ QDRIG +
Sbjct: 1 MSWLRTAVNKAVE--VGNRKNITRTVKNYADSVVQHAGQAVAEGAKLFQDRIGVGAYKSV 60
Query: 61 KQTVKRLEEISVSSRGVERVQLLRRWLVALKEVDRFSSGSIEGDINSPTDQPNDETKDSP 120
QT++RLEE +VS RG ER L+ RWL LKE+DR + S++ S +Q D
Sbjct: 61 HQTIQRLEEAAVSYRGQERALLITRWLSVLKEIDRATDSSLKDKQLSSEEQ---LASDEA 120
Query: 121 KNPTLVYYVDPDMGGELKTFRDVFLTSQALEGITLSMVLQ-------------------- 180
K V Y DPD+GGE FRDVFL SQALEGI LSM+++
Sbjct: 121 KKREWVLYYDPDIGGEPLNFRDVFLQSQALEGIVLSMIIEPPHDEEITLLLEMFGLCLNG 180
Query: 181 ------------------------------------------------------------ 240
Sbjct: 181 GKEVHDAIVSSMQDLATVFSSYKDEVLVKQDELLQFAQNAITGLKINAEMLRIDAEASDL 240
Query: 241 --------------------------------EILSQVQLYSKLEELLLKKKLFNEVDSP 300
E L++++L S+LE LL++K+ + DSP
Sbjct: 241 RKKLEKMNASQIPQESEDKEHKETPLTIEAFKETLAKIRLCSRLEGLLIRKRQLSNGDSP 300
Query: 301 QLHAEKVDKLRILSESLANSTLKAEKRIVDQREQREDALNFRLAKSEEMVQAEKELTYDI 360
+HA+KVDKLR+L ESLANST KAEKRI + R Q+E+AL R+ K+ E + EKEL +I
Sbjct: 301 DIHAQKVDKLRVLLESLANSTSKAEKRISENRLQKEEALKARVVKANETGEKEKELGAEI 360
Query: 361 RELENQKDKLEAELEKVNTLLSSARMRLHNAREEREHFDEASSQMLVHLKTKEDELFKSV 420
+LE Q+D+LEA+L++VN L++A+ R NA EER+ F EA++Q++ HLKTK+D+L KSV
Sbjct: 361 AQLEKQRDELEADLKRVNLSLAAAQARFRNATEERDQFGEANNQIIAHLKTKDDDLSKSV 420
Query: 421 ASYKVEATAVNACKNFLENTWDLRIS-----QRQKTEEL--------------------- 480
+ K EA + NFLE+TW L+ S +Q +EL
Sbjct: 421 VACKKEAEVIKTWINFLEDTWLLQCSHIETKDKQTLDELEKHEDYFSDVALNILSVYKKE 480
Query: 481 -----------VELLRPTFLRSDVSPNTDDGSLHVDQQRRKLEEEYLDIESKFVSTLSTV 519
VE L+ S+ PN D G V R+ LEEEY+D E+K ++T S V
Sbjct: 481 VAPLISRIENYVENLKNLGPGSEKPPNADQGDNQVSNPRKILEEEYIDYETKIITTFSIV 540
BLAST of Csor.00g200600 vs. TAIR 10
Match:
AT1G63300.1 (Myosin heavy chain-related protein )
HSP 1 Score: 43.1 bits (100), Expect = 8.0e-04
Identity = 72/330 (21.82%), Postives = 151/330 (45.76%), Query Frame = 0
Query: 176 LLKKKLFNEVDSPQLHAEKVDKLRILSESLA--NSTLKAEKRIVD---QREQREDALNFR 235
+L++K+ + + +++ D+L I E LA LK + + ++ Q ++ L +
Sbjct: 466 ILEQKITDLYNEIEIYKRDKDELEIQMEQLALDYEILKQQNHDISYKLEQSQLQEQLKIQ 525
Query: 236 LAKSEEMVQAEKELTYDIRELENQKDKLEAELEKVNTLLSSARMRLHNAREEREHFDEAS 295
S +V D+ ELENQ + LEAEL+K + S + R+ E
Sbjct: 526 YECSSSLV--------DVTELENQVESLEAELKKQSEEFSESLCRI----------KELE 585
Query: 296 SQMLVHLKTKEDELFKSVASYKVEATAVNACKNFLENTWDLRISQRQKTEELVELLRPT- 355
SQM +T E+E+ K ++ + AV K + Q Q+ + E LR T
Sbjct: 586 SQM----ETLEEEMEKQAQVFEADIDAVTRGK----------VEQEQRAIQAEETLRKTR 645
Query: 356 FLRSDVSPNTDDGSLHVDQQRRKLEEEYLDIESKFVSTLSTVDTVRMQFYETKGVVRNFD 415
+ + V+ D + +Q ++ + E + ++ + +RMQ + + ++++ +
Sbjct: 646 WKNASVAGKLQDEFKRLSEQ---MDSMFTSNEKMAMKAMTEANELRMQKRQLEEMIKDAN 705
Query: 416 EKVQ--------ELFDALEKIKQEFESIKRPKLLIETTRQRSELAVNEKSHIDSSSSEQT 475
++++ +L + EK+ + ++R ++E ++S N+K H + ++
Sbjct: 706 DELRANQAEYEAKLHELSEKLSFKTSQMER---MLENLDEKSNEIDNQKRHEEDVTANLN 755
Query: 476 AEVRRLKFEDINDSLAKGTKNFSLKAEMAK 492
E++ LK E+I ++L K + L+AE A+
Sbjct: 766 QEIKILK-EEI-ENLKKNQDSLMLQAEQAE 755
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1GG85 | 0.0 | 76.53 | myosin-6-like OS=Cucurbita moschata OX=3662 GN=LOC111453873 PE=4 SV=1 | [more] |
A0A6J1IMJ3 | 0.0 | 75.22 | myosin-6-like OS=Cucurbita maxima OX=3661 GN=LOC111478393 PE=4 SV=1 | [more] |
A0A1S3BI64 | 3.43e-265 | 65.54 | uncharacterized protein LOC103489834 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A1S3BI59 | 2.36e-263 | 65.45 | myosin-11 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103489834 PE=4 SV=1 | [more] |
A0A6J1CRU2 | 1.82e-240 | 61.79 | uncharacterized protein LOC111013735 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
Match Name | E-value | Identity | Description | |
AT2G37370.1 | 8.3e-102 | 39.47 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT5G13560.1 | 4.0e-88 | 36.22 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT1G63300.1 | 8.0e-04 | 21.82 | Myosin heavy chain-related protein | [more] |