Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: utr5polypeptideCDSutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
CAGGAACATGTGAAGTTGAAATTCAAACGGTTACATTTTAATCAATAAATGATAAATCTTATGTTAATTATAAATTGAAACTCATGAAATTATAAATCGATTCAAATAAAAATAACGACTATAACACGTGGGAATGAAAACTAGAATTCACGTTCTTTTTTATCAGAAATATATGGTTTAATTCAAGGGATAATGGCAGGAGAGTAATTAGAATGAATCACGTGGACCAAATTATCCGCTCGCTAGTCCTCTCCGGGTCTCGTCTCAGACTCAGACTCAGACATAACCCAAATCCAATACTAATTACTCCCCCACCAACTGCTCGCTTTTTTCCTCTCTTCTCCGCCGTCATCCGCCCCTTGCCCTACCTCCGGCCGCCGCCTCCTCCGAGATCCACTCCTCTACGCCTTCCCTTTCTCTTTTCTTCTCTTTCCTTCTGCATTTCGTCTTCGCCGGGTTATCTCTGCTGTTTCACTCTCAATCCCTGATTCGGCTGTTCTTTCTTTTTTCCTTTTACTGTGGGTAATGCGTTGCTGTTAGGGTTTTGATTGCGCAGTTGAGTGGGGGATACCGCTGGAAGATTTGAAATTTAGGGTTTTTTCGGGGGGGAAAAATTTGACTTGGGGTTCTTTTGGTGTTTGTTGGGATTTCGAATTAGGGTTTTCAGTGTGATTCAATATGCCGAGGGGTTCGAGGCACAAATCTACTAGGCATGGTTTGAAGGATGCTAGGGACTCCTCCGACTCGGAGAATGATTCAAGCCTGAGGGATCGGAAGGGCAAAGAGAGTGGGAGTAGGGTATTCAAGGACTCGGCCTCTAGTGAGAAACGCAGATTCGATTCAAAGGATGCAAAAGACTTCTACGCTTCGGAGAATCTCGAGACGGAAGAGCATGGACATTCCAAGCGGCGCAAGGAAAGGTATGATGAGGGAACCACTGATAGGTGGAATGGGGGAAGCGACGATGAGCTTGGTGTTCCTTCTAAAAAGTCAAAACCATCGGTGGATTCAAAGAGCAAGAGAAGGGACGAGAGTGTGGGATTGCTGCAGTGTGATGGTGAAGAACTCAGGAAGAGTAGTGGTAAGGGTGAGGGAAGGCATCGGGAGTCCAGCCGGAAGGAGGGCAGGAATGGTGGAGGGGATAGGGATAGGGATAGGGAGAGGGAGAGGGAGAAGGAGAAGGAGAAGGAAAGGAAAGGAAGAGAAGGGAGAAGTGACAGAAGTGAAGAACACCGTGTAGAAAAGCAAGTGGAAAAGAACACAGGTCAGGCATCGAATCACCAGTTAACTCTATCATCACAGTCGCAACTCATGTATCCATTTCTCTTCCCAACTTAAGCATCCATGCGTTTTTCCATTGAACATGCTCTTTATCTTTACTGATTAGCATTCACGATTCTTTAAGTCTCGCTTCTGGGTATAAAGGGTTGATTTGTTTGCTGACGCTGGTTTTGTAGTGTTCTCTGTTGGTTCCATAATTTACTGATATGGTTGTACTTTTGTTTTTAGGCTGCCATATTTTAGGCTTTACATGCTCATTATTTTTTGGGTACTTAGGATACTATGATGGTAACCTCCTCGACCCCTTACGGGGTGGTCTTACACCCCGTCCCCTAGATTTCTGTACGTGGTGTGGAGTTAAAAAGGATGGTAGGATTACAGTAAGATCTTCTCATTTGCTGATTTGGTGAAGTATGAATGTAAAAGTCTCCTGAGTGTGTGCTTTGGTGAGATTCTGCTGTGAAGACTATCATACCATTGTTTAAACTTCGAGGAGGAGTTACGGAAGGACAACCAACCCTCTGGCCGCCTCTGCATTCCCATGGGAAAGAAGAGTTTTGTCTGTCGTGAATGGTTTAAAGGCACACCTTGGGTTGCTTTTCCTTCTTGCAGGAAACTTCACATAGAAAACCAAGCTCTGTCTGTTTGAGTCCTGCACCCAAAGAGTGTCTTCTCCCTGTCAAGTTCTAGGGAGTGAGTAGGTCAGCAAAATAATCTACCACCCATACTGCAAGCACTACCAAGGGAATTTGGATCTGTATGATTTACTCCTCTCCTCTAGCTCGACAATCTTACCCCTTCTTTCACAACTACTTTAGAGCCAAAGGTTGTGTGTTCAATCCTACAAAAATACAAAGGAATTTCCCTACCCTCCCTTTCGTAAGTCATCATGAAAGCTTCTCTTAGTATACAGCTTTTCTGCCTACAATCGTGTCATTGTTAATGGCACCATATATTTCAGGTTTTGACATACCTTATGCCAGCACACATAATGGTTTGATTGATTAACGGTGACTTTTGAGTTAATTTTAAGTGGGGATATAAGTTTCTAGATTTTATTCTATTTCTTCTTATGTCTCAATTTTCTAATGTAATGTATAATGTACAGCTTAAGAGTCTAAATTCTATTTCTCCTTTTAACTATGTTTTCAATTTCTATTTCCTTGCATTTCATTGGGGCTGTAGAATTCTGATTTTGTTTGATCAATGTTCAATTTTGTCACTTTACAAACTGATCAATATTCATCATATTTGAGCTGTGTATGTTCCTGATTTGCTAAATTGGTTAGTCTTAGGGATTGGATTATACTTGGTTTCGAAGAAGTATTGTTATTTTTTTTTTTAATTTCAATTTGTACTAATGTGCTTTAATTTTGTGCAAAATTCGGATCTGAAGGATTAATTAATTGCATTTTCTTCTACAGCTTTGCCTACTTTCCATATATCAGGATAACCAATTATTACCCTCTTTCTCTCATGCACATATTCCTGCACTAGGATATAGGTGAAAAAGTTTTATTTATCAGGTTTTGTGTTAGCCGGAGCTGATTGTGCCTGTTTGGTATCATTTGCTTATGGTATCGATGGATTGTTTGGGGCTCTATTGGAAATATTATTTTCCTTTTGGAATCTAATTCGTTTTGGTCAAAGAAATCTAATTTTATTTGAAAAAAATTATCATTTGCATTATTTAACATTATGATACTCTGGCTCCTAGGCAACTTGGAGTCACATGTCATGCTAAATATCTCCTATCAAGCTGTTCATCTTTTATTTGTTCTTTTTTCTAAAATTTAGTTGAAATATTACCCTTAAAAAGTAATCTCGTATGAATTTGGTATTATTTGTTTACATTTTTTGAAATGCTCTATTTTTAGTGCTGGTTTTCATACTTAGGAGAAGTCTTAACTATTCTTTTTATACTTCTTCCCATGGTCTATATCTTCTTTGATACCATTGTATCATTAACTGAGCATTAGTTTACAACTTATTATCATTTTATGGAAACTTGTGTAAACCATCTAAGTTGTGACGTGTTGCACCTTAAATGCAGATAATGTGTTGCAGAGCCCTGGACTAGAGAATCACCTGGAGACACGAGTTAGGAAGAGAGCTGGTTCTTTTGATGGGGATAAGCATAAAGATGATATAGGAGATGCTGAAAATAGACAGATTTCTTCAAAGAATGATGCTGTGAAGGATGGTAGACGAAAGAGTGAGAAGCACAAGGATGAGAGAAATAGGGAGAAGTACCGGGAAGATGCTGATAGGGATGGCAAGCAAAGAGATGAGCAACTTGTAAAAGATCACATTAGCAGGTCAAATGACAGAGATCTGAGAGATGAGAAAGATGCTATAGATATGCATCATAAGAGAAACAAGCCTCAAGATAGTGATCCTGATCGAGAGGTAACCAAGGCCAAACATGAAGGTGATTTAGATGCTAGGCGTGATCAAGATCACGATCGAGATCGCCATCATGCATATGAACGTGATCGTGATCATGATCAAGAGAGTAGGCGTAGACGCGACCGCGATCGTGGTCGTGACCGTGACCGTGACTATGATCGAGATGGGAGGCGAAATCGCAGTCGAAGTCGTGCTCGTGACCGTTACTCTGATTATGAATGTGATGTTGACCGTGATGGTTCACATTTTGAGGATCAATACACAAAATATGCTGATAGTAGGGGAAGGAAAAGATCTCCAAATGATCACGTTGATTCTGTTGATGCTAGATCTAAGAGTTTGAAGAATAGTCACCATTCAAACGAAGAAAAGAAGTCTTTGAGCAATGATAAAGTGGACTCAGATGCTGAGAGAGGAAGATCTCAATCGCGATCACGTCATGCAGATGTTAGTTTAAGCAGCCATAGACGGAAGAATTCACCCAGTTCTCTGTCACGTGTTGGCATGGATGAATACAGGTTGCAGCACTTTTTTATTTTATTTTATTTTTATTGTAACTCTAGGTATATGGTGTGTTTGAAGTCCTTGTTGCATGAAAAAAATTGTTCAACGATGTTTCTTCATTTATTTGTTTGAAGTTCTCTGTCACGTGTTGGCATGAAAAAAGGCATATTTGCATCTATTTTTCTCTAGCTAGTTTCTATTGTTCTATGTATTACATTCTTCATTGTACTGCATTGATAGGTTAATAAGCAACTATGTATTGGTACCTCACATCTTTTATGCTTTTGCAATAGAATGACTTTTCTAATTTCTTTCAGTTATATTTGATAACTTATCTTCTGGTCGTCTGTCATTGACAAATGTAAAATTTTCAAGCGAGGAATTTTTTCAAAAGAAGGAGAAAGCAAAATATTTTGAGCAATTTCTTCTCCAACATATTTTCATTTGCGTTAAGATAAAGGGTTGTAGTTGTAAATCTGTTATGATTATGTGGTGTTTTTTTTTTTAATAATTATTGTTTTTATGGATTTAATTATAATAGCTGGTAGTTTTTCCTGCTAGGTTCCATATTAAGAGCTAGGCTGTTTGGATTTTCTTGGCTTTGAGTTTTAATTTTTTTATCAAGAGAAAGAGCAGAGAGAAGACAAAGAACATGTCAAGCCCACAAAAATGGAGCAAAACTTTTGTTTTTTCTATTTTTCCTTGTGAAAGGCAAAGGAATTAATGCACACTAGTTGGATCCGAAAAGGTACTGACAAGGGGGTGCCTTACAGACAGTAGTAGTGAAACTTTCTGTCCTCACTGTAGTAAACCATAAAGAAGCTCTTTGTTGTTTATTTATTTATTTATTTTTATAAAAGGAGTTCTTACCACATCTTCCTGCATCGTGTTATTCTTGATCTTGGTTAGGATCTGGGGTAGGAAAAGACAGGGTTCCTTGTTGTCTATGAAAAATTCTTTGTGCCTTACATCTTGTGGCACGAGTTTTCTTTTTCCTTTTTTTCCCAAAGGATTTCTTCTTGATATGAAACGGTGCAACGATTAACGACATGATTTTGTGCTATGCGTCCTTGTTCAACTCCTTAACTATGTCTCAGCCATAAAAGTTCATTGATTAATTTTGTCATGAGTTAGTCAAAAGGATTTATTATATTTTTAAGGATTCTAATTGGTTTGTTGTAAGTGGGTATTTGACTGCATATTGCAATAGTTAACAAAGCAATATTATCCATTTTTTCTCCATTTTAGGCATCAAGATCAGGAGGATTTGAGAGACCGATACCCTAAGAAAGAAGAAAGGTCCAAATCCATTTCTACTAGAGATAAAAGTGGTTTTTCAGGAGTACAAGAAAAGGGTTCCAAGTACACATATGTGGAGAAACCCAGTGAAGCAGACGGTGGCAATGCTATTGAGCTGTCACGAGAAAGGTCTTTAAATTCTAAGGTATCTATCACCATCAAGCTCATTGGAAAGATCTCTTTCATTGATGAATGACCTTGGACATGCTAACAGATCTTATGTTTTGCAGAATATCGACATTGAAGAAAGTGGACGAAGGTGCAGTACCTCAATTGATAACAAAGACCTCTCTTCTAATAAGGATAGGCTTAGCTGGGATTTACCAGGAGAGAAGCCTCTGATGGATGAGTCACCTCAGGCAGAGTCCTTCTATAGCAAAGCTAGTCAGAGCCATCCATCACCATTCCATCCACGCCCTGCTTTTAGGGGTGGACTTGACAGTCCTTTTGATGGTTCACTAGAAGATGATAGTAGACTCAATTCTAATGGTCGTTTCCGAAGGAATAATGATCAAAATTTGGGCAGAGTACATGGCAACACTTGGAGAGGTGTTCCAAACTGGACAGCACCACTACCAAATGGCTTTATCCCCTTCCAGCACGGACCTCCTCCTCATGGAAGTTTCCAGTCAATGATGCCACAGTTTCCAGCTCCCCCTTTGTTTGGTATCAGACCTCCACTTGAAATCAATCACTCTGGAATTCCCTATCGGATGCCTGATGCTGAAAGATTTTCCAGTCACATGCATCCACTAGGGTGGCAGAATATGTTGGATGGTTCAAGCCCTTCTCACTTACATGGATGGGATGGAAACAATGGTATCTTTAGGGATGAATCTCACATTTATGGTGGAGCTGAATGGGAAGAGAACAGGCAGATGGTGAATGGTCGAGGATGGGAGTCCAAAGCTGACATGTGGAAGAGACAGAGTGGTGGCCCGAAAAGGGAATTGCCTTCCCAATTCCAGAAGGATGAGCGTTTGGTGCAGGATCCTGTTGATGATGTATCTAGTAGAGAGGCTTGTGATGAGAGCACCAATACTATTTTGACAAAAACTGTTGAAATGAGGCCTAATATCCCTTCTGCAAAAGAAAGTCCCAACACTCCTGAACTTCTCTCTGAAACACCGGCTCCAGTTAGACGGTCAATGGACGATAATTCTAAACTTAGTTGTTCTTACCTTTCTAAGCTTAAGATTTCCGCAGAACTTGCACATCCTGATTTGTACCACCAGTGTCAGAGGTTAATGGATATTGAGAACTGTGCAACTGCAGATGAGGAAACTGCTGCTTACATAGTTCTCGAGGTAAAGTCCTGGACAAAGTTTCATCATGCCACAATACCACCTATGTCTGTTTATGCCTTTCTAATGATTATATGAATTTAATTTGTTATTTATGATTGCACTATTATCTTCTTGACAGGGGGGCATGAGAGCCGTGTTCATCTCTTCAAATGGTGTGCATCAATCTCTTTCCCATCCAAACAAGAACCCAGGTTTTCAGGTATAATACATGGCTGCATTATTTGTAAGCATTATAAGTGTCCGTTTTCCTCATGTGTTCAAAAGTTGAACGGCTACATAACTGTTGTTTTAGTTATAGTTTATGATATTTGATCATCGTCCTGATTGGTTCTATGCTTCTTTTGTTCGTAAAACTTTGACTTGTACATGTTAAAGATATTAATGGTTGTGAATGATGCTCAGAAGTTATATAGCACTTCATAATTGTGGTGTTGAAAATTTTGCAGCGTGCAATGGACTTGTACAAAAAGCAGAGAATGGAAATGAAGGAAATGAAAGTTGTTTCTGGGGGAAAATTGGATGGTATTTTGGCATCCTCTGAGAGGAGGCTTGAAGAGCAAGGCTTGAATTTCAATAATGAAGAAGTTAAGGTTCCTGTTTCAACTGTTGGTGCGGAAATGGTGCAACCGCCCATATTGGCCACTGGTGATAAAGCAGTCGTTGAGTCGACTGCTGCATTGGGGAAATCGGAGGATTTGGCTTCAACTGCCAGTCAGGAGGAGGTGAAGTGTCTTGAAAACTCAGAGGAGACATTGCCAATTACCAAATCAACAGAAATGGATGTGATGGATTTGGAGCAGGAGCAGGTGAACTTAGACGTGGAAAAGGATACTGTCAAACCGAGCGACAATGTATCGGTCAATGACACCGATAAGGGGATTGTGAATGGCAAAGATTCTTGTTTTGACAATGCAGTGACTGTGAGTGGTCCTTTATCTTTTGCGGATGAAATACCCGAGACTTGTGAGGGTTTGATGCCTATGCCTATTTCAATTGGGTCTGAGTCATTAATTTTGAATAGGATACATCATTCTCCTGAAAGTACACATTGAAACAATTTTAATATCCGTCACCTTTTCTTATTTCTTATCTAGTTTTTAAGTTATTGATATTTCGATTCTTGTTGCTTCTTCTTCCTGCAAGGAATAAAATTTCCTACTGTTCTGCACCTTGGTGCCTTGTCTTGAGCGTGCTTTTCTTTTTTCCTCAGTTTATATTTTTATGCTTCATAAGTTCCAG
mRNA sequence
CAGGAACATGTGAAGTTGAAATTCAAACGGTTACATTTTAATCAATAAATGATAAATCTTATGTTAATTATAAATTGAAACTCATGAAATTATAAATCGATTCAAATAAAAATAACGACTATAACACGTGGGAATGAAAACTAGAATTCACGTTCTTTTTTATCAGAAATATATGGTTTAATTCAAGGGATAATGGCAGGAGAGTAATTAGAATGAATCACGTGGACCAAATTATCCGCTCGCTAGTCCTCTCCGGGTCTCGTCTCAGACTCAGACTCAGACATAACCCAAATCCAATACTAATTACTCCCCCACCAACTGCTCGCTTTTTTCCTCTCTTCTCCGCCGTCATCCGCCCCTTGCCCTACCTCCGGCCGCCGCCTCCTCCGAGATCCACTCCTCTACGCCTTCCCTTTCTCTTTTCTTCTCTTTCCTTCTGCATTTCGTCTTCGCCGGGTTATCTCTGCTGTTTCACTCTCAATCCCTGATTCGGCTGTTCTTTCTTTTTTCCTTTTACTGTGGGTAATGCGTTGCTGTTAGGGTTTTGATTGCGCAGTTGAGTGGGGGATACCGCTGGAAGATTTGAAATTTAGGGTTTTTTCGGGGGGGAAAAATTTGACTTGGGGTTCTTTTGGTGTTTGTTGGGATTTCGAATTAGGGTTTTCAGTGTGATTCAATATGCCGAGGGGTTCGAGGCACAAATCTACTAGGCATGGTTTGAAGGATGCTAGGGACTCCTCCGACTCGGAGAATGATTCAAGCCTGAGGGATCGGAAGGGCAAAGAGAGTGGGAGTAGGGTATTCAAGGACTCGGCCTCTAGTGAGAAACGCAGATTCGATTCAAAGGATGCAAAAGACTTCTACGCTTCGGAGAATCTCGAGACGGAAGAGCATGGACATTCCAAGCGGCGCAAGGAAAGGTATGATGAGGGAACCACTGATAGGTGGAATGGGGGAAGCGACGATGAGCTTGGTGTTCCTTCTAAAAAGTCAAAACCATCGGTGGATTCAAAGAGCAAGAGAAGGGACGAGAGTGTGGGATTGCTGCAGTGTGATGGTGAAGAACTCAGGAAGAGTAGTGGTAAGGGTGAGGGAAGGCATCGGGAGTCCAGCCGGAAGGAGGGCAGGAATGGTGGAGGGGATAGGGATAGGGATAGGGAGAGGGAGAGGGAGAAGGAGAAGGAGAAGGAAAGGAAAGGAAGAGAAGGGAGAAGTGACAGAAGTGAAGAACACCGTGTAGAAAAGCAAGTGGAAAAGAACACAGATAATGTGTTGCAGAGCCCTGGACTAGAGAATCACCTGGAGACACGAGTTAGGAAGAGAGCTGGTTCTTTTGATGGGGATAAGCATAAAGATGATATAGGAGATGCTGAAAATAGACAGATTTCTTCAAAGAATGATGCTGTGAAGGATGGTAGACGAAAGAGTGAGAAGCACAAGGATGAGAGAAATAGGGAGAAGTACCGGGAAGATGCTGATAGGGATGGCAAGCAAAGAGATGAGCAACTTGTAAAAGATCACATTAGCAGGTCAAATGACAGAGATCTGAGAGATGAGAAAGATGCTATAGATATGCATCATAAGAGAAACAAGCCTCAAGATAGTGATCCTGATCGAGAGGTAACCAAGGCCAAACATGAAGGTGATTTAGATGCTAGGCGTGATCAAGATCACGATCGAGATCGCCATCATGCATATGAACGTGATCGTGATCATGATCAAGAGAGTAGGCGTAGACGCGACCGCGATCGTGGTCGTGACCGTGACCGTGACTATGATCGAGATGGGAGGCGAAATCGCAGTCGAAGTCGTGCTCGTGACCGTTACTCTGATTATGAATGTGATGTTGACCGTGATGGTTCACATTTTGAGGATCAATACACAAAATATGCTGATAGTAGGGGAAGGAAAAGATCTCCAAATGATCACGTTGATTCTGTTGATGCTAGATCTAAGAGTTTGAAGAATAGTCACCATTCAAACGAAGAAAAGAAGTCTTTGAGCAATGATAAAGTGGACTCAGATGCTGAGAGAGGAAGATCTCAATCGCGATCACGTCATGCAGATGTTAGTTTAAGCAGCCATAGACGGAAGAATTCACCCAGTTCTCTGTCACGTGTTGGCATGGATGAATACAGGCATCAAGATCAGGAGGATTTGAGAGACCGATACCCTAAGAAAGAAGAAAGGTCCAAATCCATTTCTACTAGAGATAAAAGTGGTTTTTCAGGAGTACAAGAAAAGGGTTCCAAGTACACATATGTGGAGAAACCCAGTGAAGCAGACGGTGGCAATGCTATTGAGCTGTCACGAGAAAGGTCTTTAAATTCTAAGAATATCGACATTGAAGAAAGTGGACGAAGGTGCAGTACCTCAATTGATAACAAAGACCTCTCTTCTAATAAGGATAGGCTTAGCTGGGATTTACCAGGAGAGAAGCCTCTGATGGATGAGTCACCTCAGGCAGAGTCCTTCTATAGCAAAGCTAGTCAGAGCCATCCATCACCATTCCATCCACGCCCTGCTTTTAGGGGTGGACTTGACAGTCCTTTTGATGGTTCACTAGAAGATGATAGTAGACTCAATTCTAATGGTCGTTTCCGAAGGAATAATGATCAAAATTTGGGCAGAGTACATGGCAACACTTGGAGAGGTGTTCCAAACTGGACAGCACCACTACCAAATGGCTTTATCCCCTTCCAGCACGGACCTCCTCCTCATGGAAGTTTCCAGTCAATGATGCCACAGTTTCCAGCTCCCCCTTTGTTTGGTATCAGACCTCCACTTGAAATCAATCACTCTGGAATTCCCTATCGGATGCCTGATGCTGAAAGATTTTCCAGTCACATGCATCCACTAGGGTGGCAGAATATGTTGGATGGTTCAAGCCCTTCTCACTTACATGGATGGGATGGAAACAATGGTATCTTTAGGGATGAATCTCACATTTATGGTGGAGCTGAATGGGAAGAGAACAGGCAGATGGTGAATGGTCGAGGATGGGAGTCCAAAGCTGACATGTGGAAGAGACAGAGTGGTGGCCCGAAAAGGGAATTGCCTTCCCAATTCCAGAAGGATGAGCGTTTGGTGCAGGATCCTGTTGATGATGTATCTAGTAGAGAGGCTTGTGATGAGAGCACCAATACTATTTTGACAAAAACTGTTGAAATGAGGCCTAATATCCCTTCTGCAAAAGAAAGTCCCAACACTCCTGAACTTCTCTCTGAAACACCGGCTCCAGTTAGACGGTCAATGGACGATAATTCTAAACTTAGTTGTTCTTACCTTTCTAAGCTTAAGATTTCCGCAGAACTTGCACATCCTGATTTGTACCACCAGTGTCAGAGGTTAATGGATATTGAGAACTGTGCAACTGCAGATGAGGAAACTGCTGCTTACATAGTTCTCGAGGGGGGCATGAGAGCCGTGTTCATCTCTTCAAATGGTGTGCATCAATCTCTTTCCCATCCAAACAAGAACCCAGGTTTTCAGCGTGCAATGGACTTGTACAAAAAGCAGAGAATGGAAATGAAGGAAATGAAAGTTGTTTCTGGGGGAAAATTGGATGGTATTTTGGCATCCTCTGAGAGGAGGCTTGAAGAGCAAGGCTTGAATTTCAATAATGAAGAAGTTAAGGTTCCTGTTTCAACTGTTGGTGCGGAAATGGTGCAACCGCCCATATTGGCCACTGGTGATAAAGCAGTCGTTGAGTCGACTGCTGCATTGGGGAAATCGGAGGATTTGGCTTCAACTGCCAGTCAGGAGGAGGTGAAGTGTCTTGAAAACTCAGAGGAGACATTGCCAATTACCAAATCAACAGAAATGGATGTGATGGATTTGGAGCAGGAGCAGGTGAACTTAGACGTGGAAAAGGATACTGTCAAACCGAGCGACAATGTATCGGTCAATGACACCGATAAGGGGATTGTGAATGGCAAAGATTCTTGTTTTGACAATGCAGTGACTGTGAGTGGTCCTTTATCTTTTGCGGATGAAATACCCGAGACTTGTGAGGGTTTGATGCCTATGCCTATTTCAATTGGGTCTGAGTCATTAATTTTGAATAGGATACATCATTCTCCTGAAAGTACACATTGAAACAATTTTAATATCCGTCACCTTTTCTTATTTCTTATCTAGTTTTTAAGTTATTGATATTTCGATTCTTGTTGCTTCTTCTTCCTGCAAGGAATAAAATTTCCTACTGTTCTGCACCTTGGTGCCTTGTCTTGAGCGTGCTTTTCTTTTTTCCTCAGTTTATATTTTTATGCTTCATAAGTTCCAG
Coding sequence (CDS)
ATGCCGAGGGGTTCGAGGCACAAATCTACTAGGCATGGTTTGAAGGATGCTAGGGACTCCTCCGACTCGGAGAATGATTCAAGCCTGAGGGATCGGAAGGGCAAAGAGAGTGGGAGTAGGGTATTCAAGGACTCGGCCTCTAGTGAGAAACGCAGATTCGATTCAAAGGATGCAAAAGACTTCTACGCTTCGGAGAATCTCGAGACGGAAGAGCATGGACATTCCAAGCGGCGCAAGGAAAGGTATGATGAGGGAACCACTGATAGGTGGAATGGGGGAAGCGACGATGAGCTTGGTGTTCCTTCTAAAAAGTCAAAACCATCGGTGGATTCAAAGAGCAAGAGAAGGGACGAGAGTGTGGGATTGCTGCAGTGTGATGGTGAAGAACTCAGGAAGAGTAGTGGTAAGGGTGAGGGAAGGCATCGGGAGTCCAGCCGGAAGGAGGGCAGGAATGGTGGAGGGGATAGGGATAGGGATAGGGAGAGGGAGAGGGAGAAGGAGAAGGAGAAGGAAAGGAAAGGAAGAGAAGGGAGAAGTGACAGAAGTGAAGAACACCGTGTAGAAAAGCAAGTGGAAAAGAACACAGATAATGTGTTGCAGAGCCCTGGACTAGAGAATCACCTGGAGACACGAGTTAGGAAGAGAGCTGGTTCTTTTGATGGGGATAAGCATAAAGATGATATAGGAGATGCTGAAAATAGACAGATTTCTTCAAAGAATGATGCTGTGAAGGATGGTAGACGAAAGAGTGAGAAGCACAAGGATGAGAGAAATAGGGAGAAGTACCGGGAAGATGCTGATAGGGATGGCAAGCAAAGAGATGAGCAACTTGTAAAAGATCACATTAGCAGGTCAAATGACAGAGATCTGAGAGATGAGAAAGATGCTATAGATATGCATCATAAGAGAAACAAGCCTCAAGATAGTGATCCTGATCGAGAGGTAACCAAGGCCAAACATGAAGGTGATTTAGATGCTAGGCGTGATCAAGATCACGATCGAGATCGCCATCATGCATATGAACGTGATCGTGATCATGATCAAGAGAGTAGGCGTAGACGCGACCGCGATCGTGGTCGTGACCGTGACCGTGACTATGATCGAGATGGGAGGCGAAATCGCAGTCGAAGTCGTGCTCGTGACCGTTACTCTGATTATGAATGTGATGTTGACCGTGATGGTTCACATTTTGAGGATCAATACACAAAATATGCTGATAGTAGGGGAAGGAAAAGATCTCCAAATGATCACGTTGATTCTGTTGATGCTAGATCTAAGAGTTTGAAGAATAGTCACCATTCAAACGAAGAAAAGAAGTCTTTGAGCAATGATAAAGTGGACTCAGATGCTGAGAGAGGAAGATCTCAATCGCGATCACGTCATGCAGATGTTAGTTTAAGCAGCCATAGACGGAAGAATTCACCCAGTTCTCTGTCACGTGTTGGCATGGATGAATACAGGCATCAAGATCAGGAGGATTTGAGAGACCGATACCCTAAGAAAGAAGAAAGGTCCAAATCCATTTCTACTAGAGATAAAAGTGGTTTTTCAGGAGTACAAGAAAAGGGTTCCAAGTACACATATGTGGAGAAACCCAGTGAAGCAGACGGTGGCAATGCTATTGAGCTGTCACGAGAAAGGTCTTTAAATTCTAAGAATATCGACATTGAAGAAAGTGGACGAAGGTGCAGTACCTCAATTGATAACAAAGACCTCTCTTCTAATAAGGATAGGCTTAGCTGGGATTTACCAGGAGAGAAGCCTCTGATGGATGAGTCACCTCAGGCAGAGTCCTTCTATAGCAAAGCTAGTCAGAGCCATCCATCACCATTCCATCCACGCCCTGCTTTTAGGGGTGGACTTGACAGTCCTTTTGATGGTTCACTAGAAGATGATAGTAGACTCAATTCTAATGGTCGTTTCCGAAGGAATAATGATCAAAATTTGGGCAGAGTACATGGCAACACTTGGAGAGGTGTTCCAAACTGGACAGCACCACTACCAAATGGCTTTATCCCCTTCCAGCACGGACCTCCTCCTCATGGAAGTTTCCAGTCAATGATGCCACAGTTTCCAGCTCCCCCTTTGTTTGGTATCAGACCTCCACTTGAAATCAATCACTCTGGAATTCCCTATCGGATGCCTGATGCTGAAAGATTTTCCAGTCACATGCATCCACTAGGGTGGCAGAATATGTTGGATGGTTCAAGCCCTTCTCACTTACATGGATGGGATGGAAACAATGGTATCTTTAGGGATGAATCTCACATTTATGGTGGAGCTGAATGGGAAGAGAACAGGCAGATGGTGAATGGTCGAGGATGGGAGTCCAAAGCTGACATGTGGAAGAGACAGAGTGGTGGCCCGAAAAGGGAATTGCCTTCCCAATTCCAGAAGGATGAGCGTTTGGTGCAGGATCCTGTTGATGATGTATCTAGTAGAGAGGCTTGTGATGAGAGCACCAATACTATTTTGACAAAAACTGTTGAAATGAGGCCTAATATCCCTTCTGCAAAAGAAAGTCCCAACACTCCTGAACTTCTCTCTGAAACACCGGCTCCAGTTAGACGGTCAATGGACGATAATTCTAAACTTAGTTGTTCTTACCTTTCTAAGCTTAAGATTTCCGCAGAACTTGCACATCCTGATTTGTACCACCAGTGTCAGAGGTTAATGGATATTGAGAACTGTGCAACTGCAGATGAGGAAACTGCTGCTTACATAGTTCTCGAGGGGGGCATGAGAGCCGTGTTCATCTCTTCAAATGGTGTGCATCAATCTCTTTCCCATCCAAACAAGAACCCAGGTTTTCAGCGTGCAATGGACTTGTACAAAAAGCAGAGAATGGAAATGAAGGAAATGAAAGTTGTTTCTGGGGGAAAATTGGATGGTATTTTGGCATCCTCTGAGAGGAGGCTTGAAGAGCAAGGCTTGAATTTCAATAATGAAGAAGTTAAGGTTCCTGTTTCAACTGTTGGTGCGGAAATGGTGCAACCGCCCATATTGGCCACTGGTGATAAAGCAGTCGTTGAGTCGACTGCTGCATTGGGGAAATCGGAGGATTTGGCTTCAACTGCCAGTCAGGAGGAGGTGAAGTGTCTTGAAAACTCAGAGGAGACATTGCCAATTACCAAATCAACAGAAATGGATGTGATGGATTTGGAGCAGGAGCAGGTGAACTTAGACGTGGAAAAGGATACTGTCAAACCGAGCGACAATGTATCGGTCAATGACACCGATAAGGGGATTGTGAATGGCAAAGATTCTTGTTTTGACAATGCAGTGACTGTGAGTGGTCCTTTATCTTTTGCGGATGAAATACCCGAGACTTGTGAGGGTTTGATGCCTATGCCTATTTCAATTGGGTCTGAGTCATTAATTTTGAATAGGATACATCATTCTCCTGAAAGTACACATTGA
Protein sequence
MPRGSRHKSTRHGLKDARDSSDSENDSSLRDRKGKESGSRVFKDSASSEKRRFDSKDAKDFYASENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESVGLLQCDGEELRKSSGKGEGRHRESSRKEGRNGGGDRDRDREREREKEKEKERKGREGRSDRSEEHRVEKQVEKNTDNVLQSPGLENHLETRVRKRAGSFDGDKHKDDIGDAENRQISSKNDAVKDGRRKSEKHKDERNREKYREDADRDGKQRDEQLVKDHISRSNDRDLRDEKDAIDMHHKRNKPQDSDPDREVTKAKHEGDLDARRDQDHDRDRHHAYERDRDHDQESRRRRDRDRGRDRDRDYDRDGRRNRSRSRARDRYSDYECDVDRDGSHFEDQYTKYADSRGRKRSPNDHVDSVDARSKSLKNSHHSNEEKKSLSNDKVDSDAERGRSQSRSRHADVSLSSHRRKNSPSSLSRVGMDEYRHQDQEDLRDRYPKKEERSKSISTRDKSGFSGVQEKGSKYTYVEKPSEADGGNAIELSRERSLNSKNIDIEESGRRCSTSIDNKDLSSNKDRLSWDLPGEKPLMDESPQAESFYSKASQSHPSPFHPRPAFRGGLDSPFDGSLEDDSRLNSNGRFRRNNDQNLGRVHGNTWRGVPNWTAPLPNGFIPFQHGPPPHGSFQSMMPQFPAPPLFGIRPPLEINHSGIPYRMPDAERFSSHMHPLGWQNMLDGSSPSHLHGWDGNNGIFRDESHIYGGAEWEENRQMVNGRGWESKADMWKRQSGGPKRELPSQFQKDERLVQDPVDDVSSREACDESTNTILTKTVEMRPNIPSAKESPNTPELLSETPAPVRRSMDDNSKLSCSYLSKLKISAELAHPDLYHQCQRLMDIENCATADEETAAYIVLEGGMRAVFISSNGVHQSLSHPNKNPGFQRAMDLYKKQRMEMKEMKVVSGGKLDGILASSERRLEEQGLNFNNEEVKVPVSTVGAEMVQPPILATGDKAVVESTAALGKSEDLASTASQEEVKCLENSEETLPITKSTEMDVMDLEQEQVNLDVEKDTVKPSDNVSVNDTDKGIVNGKDSCFDNAVTVSGPLSFADEIPETCEGLMPMPISIGSESLILNRIHHSPESTH
Homology
BLAST of MC06g1386 vs. NCBI nr
Match:
XP_022158031.1 (uncharacterized protein LOC111024614 [Momordica charantia])
HSP 1 Score: 2154 bits (5582), Expect = 0.0
Identity = 1139/1139 (100.00%), Postives = 1139/1139 (100.00%), Query Frame = 0
Query: 1 MPRGSRHKSTRHGLKDARDSSDSENDSSLRDRKGKESGSRVFKDSASSEKRRFDSKDAKD 60
MPRGSRHKSTRHGLKDARDSSDSENDSSLRDRKGKESGSRVFKDSASSEKRRFDSKDAKD
Sbjct: 1 MPRGSRHKSTRHGLKDARDSSDSENDSSLRDRKGKESGSRVFKDSASSEKRRFDSKDAKD 60
Query: 61 FYASENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV 120
FYASENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV
Sbjct: 61 FYASENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV 120
Query: 121 GLLQCDGEELRKSSGKGEGRHRESSRKEGRNGGGDRDRDREREREKEKEKERKGREGRSD 180
GLLQCDGEELRKSSGKGEGRHRESSRKEGRNGGGDRDRDREREREKEKEKERKGREGRSD
Sbjct: 121 GLLQCDGEELRKSSGKGEGRHRESSRKEGRNGGGDRDRDREREREKEKEKERKGREGRSD 180
Query: 181 RSEEHRVEKQVEKNTDNVLQSPGLENHLETRVRKRAGSFDGDKHKDDIGDAENRQISSKN 240
RSEEHRVEKQVEKNTDNVLQSPGLENHLETRVRKRAGSFDGDKHKDDIGDAENRQISSKN
Sbjct: 181 RSEEHRVEKQVEKNTDNVLQSPGLENHLETRVRKRAGSFDGDKHKDDIGDAENRQISSKN 240
Query: 241 DAVKDGRRKSEKHKDERNREKYREDADRDGKQRDEQLVKDHISRSNDRDLRDEKDAIDMH 300
DAVKDGRRKSEKHKDERNREKYREDADRDGKQRDEQLVKDHISRSNDRDLRDEKDAIDMH
Sbjct: 241 DAVKDGRRKSEKHKDERNREKYREDADRDGKQRDEQLVKDHISRSNDRDLRDEKDAIDMH 300
Query: 301 HKRNKPQDSDPDREVTKAKHEGDLDARRDQDHDRDRHHAYERDRDHDQESRRRRDRDRGR 360
HKRNKPQDSDPDREVTKAKHEGDLDARRDQDHDRDRHHAYERDRDHDQESRRRRDRDRGR
Sbjct: 301 HKRNKPQDSDPDREVTKAKHEGDLDARRDQDHDRDRHHAYERDRDHDQESRRRRDRDRGR 360
Query: 361 DRDRDYDRDGRRNRSRSRARDRYSDYECDVDRDGSHFEDQYTKYADSRGRKRSPNDHVDS 420
DRDRDYDRDGRRNRSRSRARDRYSDYECDVDRDGSHFEDQYTKYADSRGRKRSPNDHVDS
Sbjct: 361 DRDRDYDRDGRRNRSRSRARDRYSDYECDVDRDGSHFEDQYTKYADSRGRKRSPNDHVDS 420
Query: 421 VDARSKSLKNSHHSNEEKKSLSNDKVDSDAERGRSQSRSRHADVSLSSHRRKNSPSSLSR 480
VDARSKSLKNSHHSNEEKKSLSNDKVDSDAERGRSQSRSRHADVSLSSHRRKNSPSSLSR
Sbjct: 421 VDARSKSLKNSHHSNEEKKSLSNDKVDSDAERGRSQSRSRHADVSLSSHRRKNSPSSLSR 480
Query: 481 VGMDEYRHQDQEDLRDRYPKKEERSKSISTRDKSGFSGVQEKGSKYTYVEKPSEADGGNA 540
VGMDEYRHQDQEDLRDRYPKKEERSKSISTRDKSGFSGVQEKGSKYTYVEKPSEADGGNA
Sbjct: 481 VGMDEYRHQDQEDLRDRYPKKEERSKSISTRDKSGFSGVQEKGSKYTYVEKPSEADGGNA 540
Query: 541 IELSRERSLNSKNIDIEESGRRCSTSIDNKDLSSNKDRLSWDLPGEKPLMDESPQAESFY 600
IELSRERSLNSKNIDIEESGRRCSTSIDNKDLSSNKDRLSWDLPGEKPLMDESPQAESFY
Sbjct: 541 IELSRERSLNSKNIDIEESGRRCSTSIDNKDLSSNKDRLSWDLPGEKPLMDESPQAESFY 600
Query: 601 SKASQSHPSPFHPRPAFRGGLDSPFDGSLEDDSRLNSNGRFRRNNDQNLGRVHGNTWRGV 660
SKASQSHPSPFHPRPAFRGGLDSPFDGSLEDDSRLNSNGRFRRNNDQNLGRVHGNTWRGV
Sbjct: 601 SKASQSHPSPFHPRPAFRGGLDSPFDGSLEDDSRLNSNGRFRRNNDQNLGRVHGNTWRGV 660
Query: 661 PNWTAPLPNGFIPFQHGPPPHGSFQSMMPQFPAPPLFGIRPPLEINHSGIPYRMPDAERF 720
PNWTAPLPNGFIPFQHGPPPHGSFQSMMPQFPAPPLFGIRPPLEINHSGIPYRMPDAERF
Sbjct: 661 PNWTAPLPNGFIPFQHGPPPHGSFQSMMPQFPAPPLFGIRPPLEINHSGIPYRMPDAERF 720
Query: 721 SSHMHPLGWQNMLDGSSPSHLHGWDGNNGIFRDESHIYGGAEWEENRQMVNGRGWESKAD 780
SSHMHPLGWQNMLDGSSPSHLHGWDGNNGIFRDESHIYGGAEWEENRQMVNGRGWESKAD
Sbjct: 721 SSHMHPLGWQNMLDGSSPSHLHGWDGNNGIFRDESHIYGGAEWEENRQMVNGRGWESKAD 780
Query: 781 MWKRQSGGPKRELPSQFQKDERLVQDPVDDVSSREACDESTNTILTKTVEMRPNIPSAKE 840
MWKRQSGGPKRELPSQFQKDERLVQDPVDDVSSREACDESTNTILTKTVEMRPNIPSAKE
Sbjct: 781 MWKRQSGGPKRELPSQFQKDERLVQDPVDDVSSREACDESTNTILTKTVEMRPNIPSAKE 840
Query: 841 SPNTPELLSETPAPVRRSMDDNSKLSCSYLSKLKISAELAHPDLYHQCQRLMDIENCATA 900
SPNTPELLSETPAPVRRSMDDNSKLSCSYLSKLKISAELAHPDLYHQCQRLMDIENCATA
Sbjct: 841 SPNTPELLSETPAPVRRSMDDNSKLSCSYLSKLKISAELAHPDLYHQCQRLMDIENCATA 900
Query: 901 DEETAAYIVLEGGMRAVFISSNGVHQSLSHPNKNPGFQRAMDLYKKQRMEMKEMKVVSGG 960
DEETAAYIVLEGGMRAVFISSNGVHQSLSHPNKNPGFQRAMDLYKKQRMEMKEMKVVSGG
Sbjct: 901 DEETAAYIVLEGGMRAVFISSNGVHQSLSHPNKNPGFQRAMDLYKKQRMEMKEMKVVSGG 960
Query: 961 KLDGILASSERRLEEQGLNFNNEEVKVPVSTVGAEMVQPPILATGDKAVVESTAALGKSE 1020
KLDGILASSERRLEEQGLNFNNEEVKVPVSTVGAEMVQPPILATGDKAVVESTAALGKSE
Sbjct: 961 KLDGILASSERRLEEQGLNFNNEEVKVPVSTVGAEMVQPPILATGDKAVVESTAALGKSE 1020
Query: 1021 DLASTASQEEVKCLENSEETLPITKSTEMDVMDLEQEQVNLDVEKDTVKPSDNVSVNDTD 1080
DLASTASQEEVKCLENSEETLPITKSTEMDVMDLEQEQVNLDVEKDTVKPSDNVSVNDTD
Sbjct: 1021 DLASTASQEEVKCLENSEETLPITKSTEMDVMDLEQEQVNLDVEKDTVKPSDNVSVNDTD 1080
Query: 1081 KGIVNGKDSCFDNAVTVSGPLSFADEIPETCEGLMPMPISIGSESLILNRIHHSPESTH 1139
KGIVNGKDSCFDNAVTVSGPLSFADEIPETCEGLMPMPISIGSESLILNRIHHSPESTH
Sbjct: 1081 KGIVNGKDSCFDNAVTVSGPLSFADEIPETCEGLMPMPISIGSESLILNRIHHSPESTH 1139
BLAST of MC06g1386 vs. NCBI nr
Match:
XP_038876328.1 (LOW QUALITY PROTEIN: filaggrin [Benincasa hispida])
HSP 1 Score: 1714 bits (4440), Expect = 0.0
Identity = 955/1191 (80.18%), Postives = 1016/1191 (85.31%), Query Frame = 0
Query: 1 MPRGSRHKSTRHGLKDARDSSDSENDSSLRDRKGKESGSRVFKDSASSEKRRFDSKDAKD 60
MPRGSRHKSTRHGLKDAR+SSDSENDS+LRDRKGKESGSRV KDSASSEKRRFDSKD K+
Sbjct: 1 MPRGSRHKSTRHGLKDARESSDSENDSTLRDRKGKESGSRVLKDSASSEKRRFDSKDTKE 60
Query: 61 FYASENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV 120
FY SENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV
Sbjct: 61 FYGSENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV 120
Query: 121 GLLQCDGEELRKSSGKGEGRHRESSRKEGRNGGGDRDRDR-------------------E 180
GL Q DGEEL+KSSGKGEGRHRESSRKEGRNGGG+R+RDR E
Sbjct: 121 GL-QGDGEELKKSSGKGEGRHRESSRKEGRNGGGERERDRDRDRDRDRDRDRDREGEGGE 180
Query: 181 REREKEKEKERKGREGRSDR---SEEHRVEKQVEKNTDNVLQSPGLENHLETRVRKRAGS 240
RERE+E+EK+RKGREGRSDR SEE RVEKQVEKNT+NVL SPGLENHLE RVRK AGS
Sbjct: 181 REREREREKDRKGREGRSDRGVASEELRVEKQVEKNTENVLHSPGLENHLEIRVRKGAGS 240
Query: 241 FDGDKHKDDIGDAENRQISSKNDAVKDGRRKSEKHKDERNREKYREDADRDGKQRDEQLV 300
FDGDK KDDIGD ENRQ+SSKND VKD RRKSEK+KDERNREKYRED DRDGK+RDEQLV
Sbjct: 241 FDGDKRKDDIGDVENRQLSSKNDTVKDVRRKSEKYKDERNREKYREDVDRDGKERDEQLV 300
Query: 301 KDHISRSNDRDLRDEKDAIDMHHKRNKPQDSDPDREVTKAKHEGDLDARRDQDHDRDRHH 360
KDHISRSNDRDLRDEKDA+DMHHKRNKPQDSD DREVTKAK EGDLDA RD
Sbjct: 301 KDHISRSNDRDLRDEKDAMDMHHKRNKPQDSDLDREVTKAKREGDLDAMRD--------- 360
Query: 361 AYERDRDHDQESRRRRDRDRGRDRDRDYDRDGRRNRSRSRARDRYSDYECDVDRDGSHFE 420
HDQESRRRR DRGRDRDRD+DRDGRRNRSRSRARDRYSDYECDVDRDGSH E
Sbjct: 361 -------HDQESRRRR--DRGRDRDRDHDRDGRRNRSRSRARDRYSDYECDVDRDGSHLE 420
Query: 421 DQYTKYADSRGRKRSPNDHVDSVDARSKSLKNSHHSNEEKKSLSNDKVDSDAERGRSQSR 480
DQYTKY DSRGRKRSPNDH DSVDARSKSLKNSHH+N+EKKSLSNDKVDSDAERGRSQSR
Sbjct: 421 DQYTKYVDSRGRKRSPNDHDDSVDARSKSLKNSHHANDEKKSLSNDKVDSDAERGRSQSR 480
Query: 481 SRHADVSLSSHRRKNSPSSLSRVGMDEYRHQDQEDLRDRYPKKEERSKSISTRDKSGFSG 540
SRH DV+LSSHRRK+SPSSLSRVG DEYRHQDQEDL+DRYPKKE+RSKSISTRDK SG
Sbjct: 481 SRHVDVNLSSHRRKSSPSSLSRVGTDEYRHQDQEDLKDRYPKKEDRSKSISTRDKGVLSG 540
Query: 541 VQEKGSKYTYVEKPSEADGGNAIELSRERSLNSKNIDIEESGRRCSTSIDNKDLSSNKDR 600
VQEKGSKY+Y EKPSE +GGNA EL R+RSLNSKN+DIEESGRR +TSID KDLSSNKDR
Sbjct: 541 VQEKGSKYSYSEKPSETEGGNATELLRDRSLNSKNVDIEESGRRHNTSIDAKDLSSNKDR 600
Query: 601 LSWDLPGEKPLMDESPQAESFYSKASQSHPSPFHPRPAFRGGLDSPFDGSLEDDSRLNSN 660
SWD+ GEKPLMD+S QAES+YSK SQ++PSPFHPRPAFRGG+D PFDGSL+DD RLNSN
Sbjct: 601 HSWDIQGEKPLMDDSSQAESYYSKGSQNNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSN 660
Query: 661 GRFRRNNDQNLGRVHGNTWRGVPNWTAPLPNGFIPFQHGPPPHGSFQSMMPQFPAPPLFG 720
RFRR +D NLGRVHGNTWRGVPNW+APLPNGFIPFQHGPPPHGSFQ MPQFPAPPLFG
Sbjct: 661 NRFRRGSDPNLGRVHGNTWRGVPNWSAPLPNGFIPFQHGPPPHGSFQLNMPQFPAPPLFG 720
Query: 721 IRPPLEINHSGIPYRMPDAERFSSHMHPLGWQNMLDGSSPSHLHGWDGNNGIFRDESHIY 780
IRPPLEINHSGI YRMPDAERFSSHMH LGWQNMLDGSSPSHLHGWDGNNGIFRDESHIY
Sbjct: 721 IRPPLEINHSGIHYRMPDAERFSSHMHSLGWQNMLDGSSPSHLHGWDGNNGIFRDESHIY 780
Query: 781 GGAEWEENRQMVNGRGWESKADMWKRQSGGPKRELPSQFQKDERLVQDPVDDVSSREACD 840
GAEW+ENRQMVNGRGW+SK +MWKRQSG KRELPSQFQKDER VQDPVDDVSSRE CD
Sbjct: 781 SGAEWDENRQMVNGRGWDSKTEMWKRQSGSLKRELPSQFQKDERSVQDPVDDVSSREVCD 840
Query: 841 ESTNTILTKTVEMRPNIPSAKESPNTPELLSETPAPVRRSMDDNSKLSCSYLSKLKISAE 900
ES +TILTKT E+RPNIPSAKESPNTPEL SETP P+RRSMDDNSKLSCSYLSKLKIS E
Sbjct: 841 ESADTILTKTAEIRPNIPSAKESPNTPELFSETPTPLRRSMDDNSKLSCSYLSKLKISTE 900
Query: 901 LAHPDLYHQCQRLMDIENCATADEETAAYIVLEGGMRAVFISSNGVHQSLSHPNKNPGFQ 960
LAHPDLYHQCQRLMDIE+ TADEETAAYIVLEGG+RAV ISSN VHQSL HP+KN FQ
Sbjct: 901 LAHPDLYHQCQRLMDIEHSVTADEETAAYIVLEGGLRAVSISSNSVHQSLFHPDKNSVFQ 960
Query: 961 RAMDLYKKQRMEMKEMKVVSGGK--------------LDGILASSERRLEEQGLNFNNEE 1020
AMDLYKKQRMEMKEM+VVSGG + G LASSER LEE+ +FN+EE
Sbjct: 961 HAMDLYKKQRMEMKEMQVVSGGMPSSERRLEEKGMQVVSGGLASSERELEEKAFDFNDEE 1020
Query: 1021 VKVPVSTVGAEMVQPPILATGDKAVVESTAALGKSEDLASTASQEEVKCLENSEETLPIT 1080
VK P+STV EM Q PI TG VE A GK ED+ASTASQEEVKCLENSEE+LPIT
Sbjct: 1021 VKAPISTVDEEMEQTPIKTTGADKEVEVADARGKLEDVASTASQEEVKCLENSEESLPIT 1080
Query: 1081 KSTEMDVMDLEQEQVNLDVEKDTVK-PSDNVSVNDTDK-------GIVNGKDS------- 1139
TE+ VM + Q NLD EKDTV +DN+ V+DTDK GI N KDS
Sbjct: 1081 NPTEV-VMIASEHQENLDAEKDTVVVANDNIPVDDTDKFSNNDVKGIANSKDSTRRGVGN 1140
BLAST of MC06g1386 vs. NCBI nr
Match:
XP_008437591.1 (PREDICTED: uncharacterized protein DDB_G0283697 [Cucumis melo])
HSP 1 Score: 1706 bits (4418), Expect = 0.0
Identity = 958/1221 (78.46%), Postives = 1021/1221 (83.62%), Query Frame = 0
Query: 1 MPRGSRHKSTRHGLKDARDSSDSENDSSLRDRKGKESGSRVFKDSASSEKRRFDSKDAKD 60
MPRGSRHKSTRHGLKDA +SSDSENDS++RDRKGKESGSRV KDSASSEKRRFDSKD K+
Sbjct: 1 MPRGSRHKSTRHGLKDAMESSDSENDSTIRDRKGKESGSRVLKDSASSEKRRFDSKDTKE 60
Query: 61 FYASENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV 120
FY SENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV
Sbjct: 61 FYGSENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV 120
Query: 121 GLLQCDGEELRKSSGKGEGRHRESSRKEGRNGGG-------------------------- 180
GL Q DGEEL+KSSGKGEGRHRESSRKEGRNGGG
Sbjct: 121 GL-QGDGEELKKSSGKGEGRHRESSRKEGRNGGGERERERERERERDRDRDRDRDRDRDR 180
Query: 181 --DRDRDREREREKE----------------KEKERKGREGRSDR---SEEHRVEKQVEK 240
DRDRDRERERE+E KEK+RKGREGRSDR SEE RVEKQVEK
Sbjct: 181 DRDRDRDREREREREREREREREREREREREKEKDRKGREGRSDRGIASEELRVEKQVEK 240
Query: 241 NTDNVLQSPGLENHLETRVRKRAGSFDGDKHKDDIGDAENRQISSKNDAVKDGRRKSEKH 300
NT+NVL SPGLENHLE R RK AGSFDGDKHKDD GD ENRQ+SSKND VKDGRRKSEK+
Sbjct: 241 NTENVLHSPGLENHLEARGRKGAGSFDGDKHKDDAGDVENRQLSSKNDTVKDGRRKSEKY 300
Query: 301 KDERNREKYREDADRDGKQRDEQLVKDHISRSNDRDLRDEKDAIDMHHKRNKPQDSDPDR 360
KDERNREKYRED DRDGK+RDEQLVK+HISRSNDRDLRDEKDA+DMHHKRNKPQDSD DR
Sbjct: 301 KDERNREKYREDVDRDGKERDEQLVKEHISRSNDRDLRDEKDAMDMHHKRNKPQDSDIDR 360
Query: 361 EVTKAKHEGDLDARRDQDHDRDRHHAYERDRDHDQESRRRRDRDRGRDRDRDYDRDGRRN 420
E+TKAK +GDLD RDQDHDR HH YERD HDQESRRRR DRGRDRDR++DRDGRRN
Sbjct: 361 EITKAKRDGDLDVMRDQDHDR--HHGYERD--HDQESRRRR--DRGRDRDREHDRDGRRN 420
Query: 421 RSRSRARDRYSDYECDVDRDGSHFEDQYTKYADSRGRKRSPNDHVDSVDARSKSLKNSHH 480
RSRSRARDRYSDYECDVDRDGSH EDQY+KY DSRGRKRSPNDH DSVDARSKSLKNSHH
Sbjct: 421 RSRSRARDRYSDYECDVDRDGSHLEDQYSKYVDSRGRKRSPNDHDDSVDARSKSLKNSHH 480
Query: 481 SNEEKKSLSNDKVDSDAERGRSQSRSRHADVSLSSHRRKNSPSSLSRVGMDEYRHQDQED 540
+N+EKKSLSNDKVDSDAERG SQSRSRH DV+LSSHRRK+SPSSLSRVG DEYRHQDQED
Sbjct: 481 ANDEKKSLSNDKVDSDAERGISQSRSRHGDVNLSSHRRKSSPSSLSRVGTDEYRHQDQED 540
Query: 541 LRDRYPKKEERSKSISTRDKSGFSGVQEKGSKYTYVEKPSEADGGNAIELSRERSLNSKN 600
LRDRYPKKEERSKSISTRDK SGVQEKGSKY+Y EKPSE +GGNA EL R+RSLNSKN
Sbjct: 541 LRDRYPKKEERSKSISTRDKGVLSGVQEKGSKYSYSEKPSETEGGNATELLRDRSLNSKN 600
Query: 601 IDIEESGRRCSTSIDNKDLSSNKDRLSWDLPGEKPLMDESPQAESFYSKASQSHPSPFHP 660
+DIEESGRR +TSID KDLSSNKDR SWD+ GEKPLMD+S QAES+YSK SQS+PSPFH
Sbjct: 601 VDIEESGRRHNTSIDAKDLSSNKDRHSWDIQGEKPLMDDSSQAESYYSKGSQSNPSPFHS 660
Query: 661 RPAFRGGLDSPFDGSLEDDSRLNSNGRFRRNNDQNLGRVHGNTWRGVPNWTAPLPNGFIP 720
RPAFRGG+D PFDGSL+DD RLNSN RFRR ND NLGRVHGN+WRGVPNW+APLPNGFIP
Sbjct: 661 RPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNLGRVHGNSWRGVPNWSAPLPNGFIP 720
Query: 721 FQHGPPPHGSFQSMMPQFPAPPLFGIRPPLEINHSGIPYRMPDAERFSSHMHPLGWQNML 780
FQHGPPPHGSFQS+MPQFPAPPLFGIRPPLEINHSGI YRMPDAERFSSHMH LGWQNML
Sbjct: 721 FQHGPPPHGSFQSIMPQFPAPPLFGIRPPLEINHSGIHYRMPDAERFSSHMHSLGWQNML 780
Query: 781 DGSSPSHLHGWDGNNGIFRDESHIYGGAEWEENRQMVNGRGWESKADMWKRQSGGPKREL 840
DGSSPSHLHGWDGNNGIFRDESHIY GAEW+ENRQMVNGRGWESK +MWKRQSG KREL
Sbjct: 781 DGSSPSHLHGWDGNNGIFRDESHIYSGAEWDENRQMVNGRGWESKPEMWKRQSGSLKREL 840
Query: 841 PSQFQKDERLVQDPVDDVSSREACDESTNTILTKTVEMRPNIPSAKESPNTPELLSETPA 900
PSQFQKDER VQD VDDVSSREACDEST T+LTKT E+RPNIPSAKESPNTPEL SETPA
Sbjct: 841 PSQFQKDERSVQDLVDDVSSREACDESTETVLTKTAEIRPNIPSAKESPNTPELFSETPA 900
Query: 901 PVRRSMDDNSKLSCSYLSKLKISAELAHPDLYHQCQRLMDIENCATADEETAAYIVLEGG 960
P+RRSMDDNSKLSCSYLSKLKIS ELAHPDLYHQC RLMDIE+CATADEETA YIVLEGG
Sbjct: 901 PLRRSMDDNSKLSCSYLSKLKISTELAHPDLYHQCLRLMDIEHCATADEETATYIVLEGG 960
Query: 961 MRAVFISSNGVHQSLSHPNKNPGFQRAMDLYKKQRMEMKEMKVVSGGKLDGILASSERRL 1020
MRAV ISS+ QSL HP+KN FQ AMDLYKKQRMEMKEM+VVS G + SSERRL
Sbjct: 961 MRAVSISSSSARQSLFHPDKNSVFQHAMDLYKKQRMEMKEMQVVSEG-----ITSSERRL 1020
Query: 1021 EEQGL-------------------NFNNEEVKVPVSTVGAEMVQPPILATGDKAVVESTA 1080
EE+G+ +FNN EVK P ST EM Q PI G VE+T
Sbjct: 1021 EEKGMQVVSGEMAASEMKLEGTAFDFNNGEVKTPDSTADVEMEQTPIKTVGVDEEVETTE 1080
Query: 1081 ALGKSEDLASTASQEEVKCLENSEETLPITKSTEMDVMDLEQEQVNLDVEKDTV-KPSDN 1139
ALGK E +AST SQEEVKCLENSEE+LP + E+D++D EQ+ VNLD EKDTV DN
Sbjct: 1081 ALGKLEAMASTGSQEEVKCLENSEESLPNSNLIEVDMIDSEQQVVNLDAEKDTVFMAKDN 1140
BLAST of MC06g1386 vs. NCBI nr
Match:
XP_031740997.1 (uncharacterized protein DDB_G0283697 [Cucumis sativus] >KAE8647802.1 hypothetical protein Csa_000310 [Cucumis sativus])
HSP 1 Score: 1706 bits (4417), Expect = 0.0
Identity = 956/1211 (78.94%), Postives = 1025/1211 (84.64%), Query Frame = 0
Query: 1 MPRGSRHKSTRHGLKDARDSSDSENDSSLRDRKGKESGSRVFKDSASSEKRRFDSKDAKD 60
MPRGSRHKSTRHGLKDAR+SSDSENDS++RDRKGKESGSRV KDSASSEKRRFDSKD K+
Sbjct: 1 MPRGSRHKSTRHGLKDARESSDSENDSTVRDRKGKESGSRVLKDSASSEKRRFDSKDTKE 60
Query: 61 FYASENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV 120
FY SENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV
Sbjct: 61 FYGSENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV 120
Query: 121 GLLQCDGEELRKSSGKGEGRHRESSRKEGRNGGGDRDRDR-------------------- 180
GL Q GEEL+KSSGKGEGRHRESSRKEGRNGGG+R+RDR
Sbjct: 121 GL-QGGGEELKKSSGKGEGRHRESSRKEGRNGGGERERDRDRDRDRDRDRDRDRERERER 180
Query: 181 ------------------EREREKEKEKERKGREGRSDR---SEEHRVEKQVEKNTDNVL 240
EREREKEKEK+RKGREGRSDR SEE RVEKQVEKN +NVL
Sbjct: 181 EREREREREREREREREREREREKEKEKDRKGREGRSDRGIASEELRVEKQVEKNAENVL 240
Query: 241 QSPGLENHLETRVRKRAGSFDGDKHKDDIGDAENRQISSKNDAVKDGRRKSEKHKDERNR 300
SPGLENHLETR RK AGSFDGDKHKDD GD ENRQ+SSKND VKDGRRKSEK+KDERNR
Sbjct: 241 HSPGLENHLETRGRKGAGSFDGDKHKDDAGDVENRQLSSKNDTVKDGRRKSEKYKDERNR 300
Query: 301 EKYREDADRDGKQRDEQLVKDHISRSNDRDLRDEKDAIDMHHKRNKPQDSDPDREVTKAK 360
EKYRED DRDGK+RDEQLVK+HISRSNDRDLRDEKDA+DMHHKRNKPQDSD DRE+TKAK
Sbjct: 301 EKYREDVDRDGKERDEQLVKEHISRSNDRDLRDEKDAMDMHHKRNKPQDSDIDREITKAK 360
Query: 361 HEGDLDARRDQDHDRDRHHAYERDRDHDQESRRRRDRDRGRDRDRDYDRDGRRNRSRSRA 420
+GDLDA RDQDHDR HH YERD HDQESRRRR DRGRDRDR++DRDGRRNRSRSRA
Sbjct: 361 RDGDLDAMRDQDHDR--HHGYERD--HDQESRRRR--DRGRDRDREHDRDGRRNRSRSRA 420
Query: 421 RDRYSDYECDVDRDGSHFEDQYTKYADSRGRKRSPNDHVDSVDARSKSLKNSHHSNEEKK 480
RDRYSDYECD+DRDGSH EDQYTKY DSRGRKRSPNDH DSVDARSKSLKNSHH+N+EKK
Sbjct: 421 RDRYSDYECDLDRDGSHLEDQYTKYVDSRGRKRSPNDHDDSVDARSKSLKNSHHANDEKK 480
Query: 481 SLSNDKVDSDAERGRSQSRSRHADVSLSSHRRKNSPSSLSRVGMDEYRHQDQEDLRDRYP 540
SLSNDKVDSDAERG SQSRSRH DV+LSSHRRK+SPSSLSRVG DEYRHQDQEDLRDRYP
Sbjct: 481 SLSNDKVDSDAERGISQSRSRHGDVNLSSHRRKSSPSSLSRVGTDEYRHQDQEDLRDRYP 540
Query: 541 KKEERSKSISTRDKSGFSGVQEKGSKYTYVEKPSEADGGNAIELSRERSLNSKNIDIEES 600
KKEERSKSISTRDK SGVQEKGSKY+Y EKPSE +G NA EL R+RSLNSKN+DIEES
Sbjct: 541 KKEERSKSISTRDKGILSGVQEKGSKYSYSEKPSETEGSNATELLRDRSLNSKNVDIEES 600
Query: 601 GRRCSTSIDNKDLSSNKDRLSWDLPGEKPLMDESPQAESFYS-KASQSHPSPFHPRPAFR 660
GRR +TSID KDLSSNKDR SWD+ GEKPLMD+ QAES+YS K SQS+PSPFH RPAFR
Sbjct: 601 GRRHNTSIDAKDLSSNKDRHSWDIQGEKPLMDDPSQAESYYSSKGSQSNPSPFHSRPAFR 660
Query: 661 GGLDSPFDGSLEDDSRLNSNGRFRRNNDQNLGRVHGNTWRGVPNWTAPLPNGFIPFQHGP 720
GG+D PFDGSL+DD RLNSN RFRR ND NLGRVHGN+WRGVPNW+APLPNGFIPFQHGP
Sbjct: 661 GGVDIPFDGSLDDDGRLNSNSRFRRGNDPNLGRVHGNSWRGVPNWSAPLPNGFIPFQHGP 720
Query: 721 PPHGSFQSMMPQFPAPPLFGIRPPLEINHSGIPYRMPDAERFSSHMHPLGWQNMLDGSSP 780
PPHGSFQS+MPQFPAPPLFGIRPPLEINHSGI YRMPDAERFSSHMH LGWQNMLDGSSP
Sbjct: 721 PPHGSFQSIMPQFPAPPLFGIRPPLEINHSGIHYRMPDAERFSSHMHSLGWQNMLDGSSP 780
Query: 781 SHLHGWDGNNGIFRDESHIYGGAEWEENRQMVNGRGWESKADMWKRQSGGPKRELPSQFQ 840
SHLHGWDGNNGIFRDESHIY GAEW+ENRQMVNGRGWESK +MWKRQSG KRELPSQFQ
Sbjct: 781 SHLHGWDGNNGIFRDESHIYNGAEWDENRQMVNGRGWESKPEMWKRQSGSLKRELPSQFQ 840
Query: 841 KDERLVQDPVDDVSSREACDESTNTILTKTVEMRPNIPSAKESPNTPELLSETPAPVRRS 900
KDER V D VDDVSSREACDEST+T+LTKT E+RPNIPSAKESPNTPEL SETPAP+R+S
Sbjct: 841 KDERSVHDLVDDVSSREACDESTDTVLTKTAEIRPNIPSAKESPNTPELFSETPAPLRQS 900
Query: 901 MDDNSKLSCSYLSKLKISAELAHPDLYHQCQRLMDIENCATADEETAAYIVLEGGMRAVF 960
MDDNSKLSCSYLSKLKIS ELAHPDLYHQC RLMDIE+CATADEETAAYIVLEGGMRAV
Sbjct: 901 MDDNSKLSCSYLSKLKISTELAHPDLYHQCLRLMDIEHCATADEETAAYIVLEGGMRAVS 960
Query: 961 ISSNGVHQSLSHPNKNPGFQRAMDLYKKQRMEMKEMKVVSGG------KLD--------G 1020
ISS+ HQSL HP+KN FQ AMDLYKKQRMEMKEM+VVS G +L+ G
Sbjct: 961 ISSSSAHQSLFHPDKNSIFQHAMDLYKKQRMEMKEMQVVSEGITSSERRLEEKEMEVVCG 1020
Query: 1021 ILASSERRLEEQGLNFNNEEVKVPVSTVGAEMVQPPILATGDKAVVESTAALGKSEDLAS 1080
+A+SE +LEE+ +FNN EVKVP STV EM Q PI G VE+T ALGK ED+AS
Sbjct: 1021 EMAASETKLEEKTFDFNNGEVKVPDSTVDVEMEQAPIKTAGVDEEVETTEALGKLEDIAS 1080
Query: 1081 TASQEEVKCLENSEETLPITKSTEMDVMDLEQEQVNLDVEKDTVK-PSDNVSVNDTDK-- 1139
T SQEEVKCLEN EE+LP + S E+D++D EQ VNL+ EKDT+ DN VND+DK
Sbjct: 1081 TGSQEEVKCLENPEESLPNSNSIEVDMIDSEQLVVNLEAEKDTIFIAKDNTPVNDSDKFN 1140
BLAST of MC06g1386 vs. NCBI nr
Match:
XP_022922431.1 (uncharacterized protein LOC111430427 isoform X2 [Cucurbita moschata])
HSP 1 Score: 1649 bits (4269), Expect = 0.0
Identity = 939/1200 (78.25%), Postives = 1019/1200 (84.92%), Query Frame = 0
Query: 1 MPRGSRHKSTRHGLKDARDSSDSENDSSLRDRKGKESGSRVFKDSASSEKRRFDSKDAKD 60
MPRGSRHKS+RHGLKDA++SSDSENDS+LRDRKGKESGSRV KDSASSEKRRF+SKD+K+
Sbjct: 1 MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRVMKDSASSEKRRFESKDSKE 60
Query: 61 FYASENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV 120
FY SENLE EEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSK VDSKSKRRDESV
Sbjct: 61 FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKTLVDSKSKRRDESV 120
Query: 121 GLLQCDGEELRKSSGKGEGRHRESSRKEGRNGGGDRDRDREREREKEKEKERKGREGRSD 180
G Q DGEE +KSSGKGEGRHRESSRKEGRNGGG+R+R+RERERE+EK+ RKGREGRSD
Sbjct: 121 GF-QGDGEEHKKSSGKGEGRHRESSRKEGRNGGGEREREREREREREKD--RKGREGRSD 180
Query: 181 R---SEEHRVEKQVEKNTDNVLQSPGLENHLETRVRKRAGSFDGDKHKDDIGDAENRQIS 240
R SE+ RVEKQVEKN++NVL SPGLENHLE RVRKR GSFDGDKHKDDIGD +NRQ+S
Sbjct: 181 RGVASEDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLS 240
Query: 241 SKNDAVKDGRRKSEKHKDERNREKYREDADRDGKQRDEQLVKDHISRSNDRDLRDEKDAI 300
SKND VKDGRRKSEK+KDERNREKYRED DRDGK+R+E LVKDHISRSNDRDLRDEKDA+
Sbjct: 241 SKNDTVKDGRRKSEKYKDERNREKYREDVDRDGKERNE-LVKDHISRSNDRDLRDEKDAM 300
Query: 301 DMHHKRNKPQDSDPDREVTKAKHEGDLDARRDQDHDRDRHHAYERDRDHDQESRRRRDRD 360
DMHHKRNKPQDSDPDREVTKAK EGD+DA RDQDHDR HHAYERD H+QESRRRRDRD
Sbjct: 301 DMHHKRNKPQDSDPDREVTKAKREGDIDAMRDQDHDR--HHAYERD--HEQESRRRRDRD 360
Query: 361 R--GRDRDRDYDRDGRRNRSRSRARDRYSDYECDVDRDGSHFEDQYTKYADSRGRKRSPN 420
R GRDRDRD+DRD RR+RSRSRARDRYSDYECDVDRDGSHF+DQYTKY DSRGRKRSPN
Sbjct: 361 RDRGRDRDRDHDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPN 420
Query: 421 DHVDSVDARSKSLKNSHHSNEEKKSLSNDKVDSDAERGRSQSRSRHADVSLSSHRRKNSP 480
DH DSVDARSKSLKNSHH+N+EKKSLSNDKVDSDAERGRSQSRSRH DVSLSSHRRK+SP
Sbjct: 421 DHDDSVDARSKSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSP 480
Query: 481 SSLSRVGMDEYRHQDQEDLRDRYPKKEERSKSISTRDKSGFSGVQEKGSKYTYVEKPSEA 540
SS SRV DEYRHQDQEDLRDRYPKKEERSKSISTRDK S VQEKGSKYTY EKPSE
Sbjct: 481 SSHSRVVTDEYRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEI 540
Query: 541 DGGNAIELSRERSLNSKNIDIEESGRRCSTSIDNKDLSSNKDRLSWDLPGEKPLMDESPQ 600
+GGNA EL R+R+LNSKN+DIEESGRR + SID KDLSSNKDR SWD+ GEKP+MD+S Q
Sbjct: 541 EGGNATELLRDRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQ 600
Query: 601 AESFYSKASQSHPSPFHPRPAFRGGLDSPFDGSLEDDSRLNSNGRFRRNNDQNLGRVHGN 660
ES+YSK SQS+PSPFHPRPAFRGG+D PFDGSL+DD RLNSN RFRR ND N+GRVHGN
Sbjct: 601 VESYYSKGSQSNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGN 660
Query: 661 TWRGVPNWTAPLPNGFIPFQHGPPPHGSFQSMMPQFPAPPLFGIRPPLEINHSGIPYRMP 720
TWRGVPNWTAPLPNGFIPFQHGPPPHGSFQS+MPQFPAPP+FGIRPPL+INHSGI YRMP
Sbjct: 661 TWRGVPNWTAPLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMP 720
Query: 721 DAERFSSHMHPLGWQNMLDGSSPSHLHGWDGNNGIFRDESHIYGGAEWEENRQMVNGRGW 780
DA+RFSSHMHPLGWQNMLDGSSPSHLHGWD NNGIFRDESHIY GAEW+ENRQMVNGRGW
Sbjct: 721 DADRFSSHMHPLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGW 780
Query: 781 ESKADMWKRQSGGPKRELPSQFQKDERLVQDPVDDVSSREACDESTNTILTKTVEMRPNI 840
+SKA+MWKRQSG KRE+PSQFQKDER VQDPVDDVSS+E DE+ +T+LTKT E+RPNI
Sbjct: 781 DSKAEMWKRQSGSLKREIPSQFQKDERSVQDPVDDVSSKEIFDENADTVLTKTSEIRPNI 840
Query: 841 PSAKESPNTPELLSETPAPVRRSMDDNSKLSCSYLSKLKISAELAHPDLYHQCQRLMDIE 900
PSAKESPNTPELLSETPAP+ RSMDDNSKLSCSYLSKL IS ELA PDLY QCQRLMDIE
Sbjct: 841 PSAKESPNTPELLSETPAPLSRSMDDNSKLSCSYLSKLNISTELALPDLYQQCQRLMDIE 900
Query: 901 NCATADEETAAYIVLEGGMRAVFISSNGVHQSLSHPNKNPGFQRAMDLYKKQRMEMKEMK 960
+CATADEETAAYIVLEGGMRAV +SSN SL PNKN FQ AMDLYKKQR EMKEM+
Sbjct: 901 HCATADEETAAYIVLEGGMRAVSVSSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQ 960
Query: 961 VVSGG----------KLDGI------LASSERRLEEQGLNFNNEEVKVPVSTVGAEMVQP 1020
+S + G+ +A SER+ EE GLNF NEEVK PVSTV AEM Q
Sbjct: 961 AISREMPSSERMLEEEQQGMQVVSRGMAFSERKHEEMGLNFKNEEVKAPVSTVDAEMTQA 1020
Query: 1021 PILATG-DKAV-------------VESTAALGKSEDLASTASQEEVKCLENSEETLPITK 1080
PI TG D A+ VE+ AALG+ EDLAS A++E VKCLENSEE++PIT
Sbjct: 1021 PIKTTGVDNAIEADAALGKLEDLAVEADAALGELEDLASPATRE-VKCLENSEESVPITN 1080
Query: 1081 STEMDVMDLEQEQVNLDVEKDTVK-PSDNVSVN-------DTDKGIVNGKDS-------- 1139
STE+D+MD EQ NLD EKDT+ SDN VN D KGIVNGK+S
Sbjct: 1081 STEVDMMDSEQ-PANLDAEKDTIVIASDNTPVNNINESSNDDMKGIVNGKESPGCGVGNS 1140
BLAST of MC06g1386 vs. ExPASy TrEMBL
Match:
A0A6J1DZU4 (uncharacterized protein LOC111024614 OS=Momordica charantia OX=3673 GN=LOC111024614 PE=4 SV=1)
HSP 1 Score: 2154 bits (5582), Expect = 0.0
Identity = 1139/1139 (100.00%), Postives = 1139/1139 (100.00%), Query Frame = 0
Query: 1 MPRGSRHKSTRHGLKDARDSSDSENDSSLRDRKGKESGSRVFKDSASSEKRRFDSKDAKD 60
MPRGSRHKSTRHGLKDARDSSDSENDSSLRDRKGKESGSRVFKDSASSEKRRFDSKDAKD
Sbjct: 1 MPRGSRHKSTRHGLKDARDSSDSENDSSLRDRKGKESGSRVFKDSASSEKRRFDSKDAKD 60
Query: 61 FYASENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV 120
FYASENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV
Sbjct: 61 FYASENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV 120
Query: 121 GLLQCDGEELRKSSGKGEGRHRESSRKEGRNGGGDRDRDREREREKEKEKERKGREGRSD 180
GLLQCDGEELRKSSGKGEGRHRESSRKEGRNGGGDRDRDREREREKEKEKERKGREGRSD
Sbjct: 121 GLLQCDGEELRKSSGKGEGRHRESSRKEGRNGGGDRDRDREREREKEKEKERKGREGRSD 180
Query: 181 RSEEHRVEKQVEKNTDNVLQSPGLENHLETRVRKRAGSFDGDKHKDDIGDAENRQISSKN 240
RSEEHRVEKQVEKNTDNVLQSPGLENHLETRVRKRAGSFDGDKHKDDIGDAENRQISSKN
Sbjct: 181 RSEEHRVEKQVEKNTDNVLQSPGLENHLETRVRKRAGSFDGDKHKDDIGDAENRQISSKN 240
Query: 241 DAVKDGRRKSEKHKDERNREKYREDADRDGKQRDEQLVKDHISRSNDRDLRDEKDAIDMH 300
DAVKDGRRKSEKHKDERNREKYREDADRDGKQRDEQLVKDHISRSNDRDLRDEKDAIDMH
Sbjct: 241 DAVKDGRRKSEKHKDERNREKYREDADRDGKQRDEQLVKDHISRSNDRDLRDEKDAIDMH 300
Query: 301 HKRNKPQDSDPDREVTKAKHEGDLDARRDQDHDRDRHHAYERDRDHDQESRRRRDRDRGR 360
HKRNKPQDSDPDREVTKAKHEGDLDARRDQDHDRDRHHAYERDRDHDQESRRRRDRDRGR
Sbjct: 301 HKRNKPQDSDPDREVTKAKHEGDLDARRDQDHDRDRHHAYERDRDHDQESRRRRDRDRGR 360
Query: 361 DRDRDYDRDGRRNRSRSRARDRYSDYECDVDRDGSHFEDQYTKYADSRGRKRSPNDHVDS 420
DRDRDYDRDGRRNRSRSRARDRYSDYECDVDRDGSHFEDQYTKYADSRGRKRSPNDHVDS
Sbjct: 361 DRDRDYDRDGRRNRSRSRARDRYSDYECDVDRDGSHFEDQYTKYADSRGRKRSPNDHVDS 420
Query: 421 VDARSKSLKNSHHSNEEKKSLSNDKVDSDAERGRSQSRSRHADVSLSSHRRKNSPSSLSR 480
VDARSKSLKNSHHSNEEKKSLSNDKVDSDAERGRSQSRSRHADVSLSSHRRKNSPSSLSR
Sbjct: 421 VDARSKSLKNSHHSNEEKKSLSNDKVDSDAERGRSQSRSRHADVSLSSHRRKNSPSSLSR 480
Query: 481 VGMDEYRHQDQEDLRDRYPKKEERSKSISTRDKSGFSGVQEKGSKYTYVEKPSEADGGNA 540
VGMDEYRHQDQEDLRDRYPKKEERSKSISTRDKSGFSGVQEKGSKYTYVEKPSEADGGNA
Sbjct: 481 VGMDEYRHQDQEDLRDRYPKKEERSKSISTRDKSGFSGVQEKGSKYTYVEKPSEADGGNA 540
Query: 541 IELSRERSLNSKNIDIEESGRRCSTSIDNKDLSSNKDRLSWDLPGEKPLMDESPQAESFY 600
IELSRERSLNSKNIDIEESGRRCSTSIDNKDLSSNKDRLSWDLPGEKPLMDESPQAESFY
Sbjct: 541 IELSRERSLNSKNIDIEESGRRCSTSIDNKDLSSNKDRLSWDLPGEKPLMDESPQAESFY 600
Query: 601 SKASQSHPSPFHPRPAFRGGLDSPFDGSLEDDSRLNSNGRFRRNNDQNLGRVHGNTWRGV 660
SKASQSHPSPFHPRPAFRGGLDSPFDGSLEDDSRLNSNGRFRRNNDQNLGRVHGNTWRGV
Sbjct: 601 SKASQSHPSPFHPRPAFRGGLDSPFDGSLEDDSRLNSNGRFRRNNDQNLGRVHGNTWRGV 660
Query: 661 PNWTAPLPNGFIPFQHGPPPHGSFQSMMPQFPAPPLFGIRPPLEINHSGIPYRMPDAERF 720
PNWTAPLPNGFIPFQHGPPPHGSFQSMMPQFPAPPLFGIRPPLEINHSGIPYRMPDAERF
Sbjct: 661 PNWTAPLPNGFIPFQHGPPPHGSFQSMMPQFPAPPLFGIRPPLEINHSGIPYRMPDAERF 720
Query: 721 SSHMHPLGWQNMLDGSSPSHLHGWDGNNGIFRDESHIYGGAEWEENRQMVNGRGWESKAD 780
SSHMHPLGWQNMLDGSSPSHLHGWDGNNGIFRDESHIYGGAEWEENRQMVNGRGWESKAD
Sbjct: 721 SSHMHPLGWQNMLDGSSPSHLHGWDGNNGIFRDESHIYGGAEWEENRQMVNGRGWESKAD 780
Query: 781 MWKRQSGGPKRELPSQFQKDERLVQDPVDDVSSREACDESTNTILTKTVEMRPNIPSAKE 840
MWKRQSGGPKRELPSQFQKDERLVQDPVDDVSSREACDESTNTILTKTVEMRPNIPSAKE
Sbjct: 781 MWKRQSGGPKRELPSQFQKDERLVQDPVDDVSSREACDESTNTILTKTVEMRPNIPSAKE 840
Query: 841 SPNTPELLSETPAPVRRSMDDNSKLSCSYLSKLKISAELAHPDLYHQCQRLMDIENCATA 900
SPNTPELLSETPAPVRRSMDDNSKLSCSYLSKLKISAELAHPDLYHQCQRLMDIENCATA
Sbjct: 841 SPNTPELLSETPAPVRRSMDDNSKLSCSYLSKLKISAELAHPDLYHQCQRLMDIENCATA 900
Query: 901 DEETAAYIVLEGGMRAVFISSNGVHQSLSHPNKNPGFQRAMDLYKKQRMEMKEMKVVSGG 960
DEETAAYIVLEGGMRAVFISSNGVHQSLSHPNKNPGFQRAMDLYKKQRMEMKEMKVVSGG
Sbjct: 901 DEETAAYIVLEGGMRAVFISSNGVHQSLSHPNKNPGFQRAMDLYKKQRMEMKEMKVVSGG 960
Query: 961 KLDGILASSERRLEEQGLNFNNEEVKVPVSTVGAEMVQPPILATGDKAVVESTAALGKSE 1020
KLDGILASSERRLEEQGLNFNNEEVKVPVSTVGAEMVQPPILATGDKAVVESTAALGKSE
Sbjct: 961 KLDGILASSERRLEEQGLNFNNEEVKVPVSTVGAEMVQPPILATGDKAVVESTAALGKSE 1020
Query: 1021 DLASTASQEEVKCLENSEETLPITKSTEMDVMDLEQEQVNLDVEKDTVKPSDNVSVNDTD 1080
DLASTASQEEVKCLENSEETLPITKSTEMDVMDLEQEQVNLDVEKDTVKPSDNVSVNDTD
Sbjct: 1021 DLASTASQEEVKCLENSEETLPITKSTEMDVMDLEQEQVNLDVEKDTVKPSDNVSVNDTD 1080
Query: 1081 KGIVNGKDSCFDNAVTVSGPLSFADEIPETCEGLMPMPISIGSESLILNRIHHSPESTH 1139
KGIVNGKDSCFDNAVTVSGPLSFADEIPETCEGLMPMPISIGSESLILNRIHHSPESTH
Sbjct: 1081 KGIVNGKDSCFDNAVTVSGPLSFADEIPETCEGLMPMPISIGSESLILNRIHHSPESTH 1139
BLAST of MC06g1386 vs. ExPASy TrEMBL
Match:
A0A1S3AUZ1 (uncharacterized protein DDB_G0283697 OS=Cucumis melo OX=3656 GN=LOC103482960 PE=4 SV=1)
HSP 1 Score: 1706 bits (4418), Expect = 0.0
Identity = 958/1221 (78.46%), Postives = 1021/1221 (83.62%), Query Frame = 0
Query: 1 MPRGSRHKSTRHGLKDARDSSDSENDSSLRDRKGKESGSRVFKDSASSEKRRFDSKDAKD 60
MPRGSRHKSTRHGLKDA +SSDSENDS++RDRKGKESGSRV KDSASSEKRRFDSKD K+
Sbjct: 1 MPRGSRHKSTRHGLKDAMESSDSENDSTIRDRKGKESGSRVLKDSASSEKRRFDSKDTKE 60
Query: 61 FYASENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV 120
FY SENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV
Sbjct: 61 FYGSENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV 120
Query: 121 GLLQCDGEELRKSSGKGEGRHRESSRKEGRNGGG-------------------------- 180
GL Q DGEEL+KSSGKGEGRHRESSRKEGRNGGG
Sbjct: 121 GL-QGDGEELKKSSGKGEGRHRESSRKEGRNGGGERERERERERERDRDRDRDRDRDRDR 180
Query: 181 --DRDRDREREREKE----------------KEKERKGREGRSDR---SEEHRVEKQVEK 240
DRDRDRERERE+E KEK+RKGREGRSDR SEE RVEKQVEK
Sbjct: 181 DRDRDRDREREREREREREREREREREREREKEKDRKGREGRSDRGIASEELRVEKQVEK 240
Query: 241 NTDNVLQSPGLENHLETRVRKRAGSFDGDKHKDDIGDAENRQISSKNDAVKDGRRKSEKH 300
NT+NVL SPGLENHLE R RK AGSFDGDKHKDD GD ENRQ+SSKND VKDGRRKSEK+
Sbjct: 241 NTENVLHSPGLENHLEARGRKGAGSFDGDKHKDDAGDVENRQLSSKNDTVKDGRRKSEKY 300
Query: 301 KDERNREKYREDADRDGKQRDEQLVKDHISRSNDRDLRDEKDAIDMHHKRNKPQDSDPDR 360
KDERNREKYRED DRDGK+RDEQLVK+HISRSNDRDLRDEKDA+DMHHKRNKPQDSD DR
Sbjct: 301 KDERNREKYREDVDRDGKERDEQLVKEHISRSNDRDLRDEKDAMDMHHKRNKPQDSDIDR 360
Query: 361 EVTKAKHEGDLDARRDQDHDRDRHHAYERDRDHDQESRRRRDRDRGRDRDRDYDRDGRRN 420
E+TKAK +GDLD RDQDHDR HH YERD HDQESRRRR DRGRDRDR++DRDGRRN
Sbjct: 361 EITKAKRDGDLDVMRDQDHDR--HHGYERD--HDQESRRRR--DRGRDRDREHDRDGRRN 420
Query: 421 RSRSRARDRYSDYECDVDRDGSHFEDQYTKYADSRGRKRSPNDHVDSVDARSKSLKNSHH 480
RSRSRARDRYSDYECDVDRDGSH EDQY+KY DSRGRKRSPNDH DSVDARSKSLKNSHH
Sbjct: 421 RSRSRARDRYSDYECDVDRDGSHLEDQYSKYVDSRGRKRSPNDHDDSVDARSKSLKNSHH 480
Query: 481 SNEEKKSLSNDKVDSDAERGRSQSRSRHADVSLSSHRRKNSPSSLSRVGMDEYRHQDQED 540
+N+EKKSLSNDKVDSDAERG SQSRSRH DV+LSSHRRK+SPSSLSRVG DEYRHQDQED
Sbjct: 481 ANDEKKSLSNDKVDSDAERGISQSRSRHGDVNLSSHRRKSSPSSLSRVGTDEYRHQDQED 540
Query: 541 LRDRYPKKEERSKSISTRDKSGFSGVQEKGSKYTYVEKPSEADGGNAIELSRERSLNSKN 600
LRDRYPKKEERSKSISTRDK SGVQEKGSKY+Y EKPSE +GGNA EL R+RSLNSKN
Sbjct: 541 LRDRYPKKEERSKSISTRDKGVLSGVQEKGSKYSYSEKPSETEGGNATELLRDRSLNSKN 600
Query: 601 IDIEESGRRCSTSIDNKDLSSNKDRLSWDLPGEKPLMDESPQAESFYSKASQSHPSPFHP 660
+DIEESGRR +TSID KDLSSNKDR SWD+ GEKPLMD+S QAES+YSK SQS+PSPFH
Sbjct: 601 VDIEESGRRHNTSIDAKDLSSNKDRHSWDIQGEKPLMDDSSQAESYYSKGSQSNPSPFHS 660
Query: 661 RPAFRGGLDSPFDGSLEDDSRLNSNGRFRRNNDQNLGRVHGNTWRGVPNWTAPLPNGFIP 720
RPAFRGG+D PFDGSL+DD RLNSN RFRR ND NLGRVHGN+WRGVPNW+APLPNGFIP
Sbjct: 661 RPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNLGRVHGNSWRGVPNWSAPLPNGFIP 720
Query: 721 FQHGPPPHGSFQSMMPQFPAPPLFGIRPPLEINHSGIPYRMPDAERFSSHMHPLGWQNML 780
FQHGPPPHGSFQS+MPQFPAPPLFGIRPPLEINHSGI YRMPDAERFSSHMH LGWQNML
Sbjct: 721 FQHGPPPHGSFQSIMPQFPAPPLFGIRPPLEINHSGIHYRMPDAERFSSHMHSLGWQNML 780
Query: 781 DGSSPSHLHGWDGNNGIFRDESHIYGGAEWEENRQMVNGRGWESKADMWKRQSGGPKREL 840
DGSSPSHLHGWDGNNGIFRDESHIY GAEW+ENRQMVNGRGWESK +MWKRQSG KREL
Sbjct: 781 DGSSPSHLHGWDGNNGIFRDESHIYSGAEWDENRQMVNGRGWESKPEMWKRQSGSLKREL 840
Query: 841 PSQFQKDERLVQDPVDDVSSREACDESTNTILTKTVEMRPNIPSAKESPNTPELLSETPA 900
PSQFQKDER VQD VDDVSSREACDEST T+LTKT E+RPNIPSAKESPNTPEL SETPA
Sbjct: 841 PSQFQKDERSVQDLVDDVSSREACDESTETVLTKTAEIRPNIPSAKESPNTPELFSETPA 900
Query: 901 PVRRSMDDNSKLSCSYLSKLKISAELAHPDLYHQCQRLMDIENCATADEETAAYIVLEGG 960
P+RRSMDDNSKLSCSYLSKLKIS ELAHPDLYHQC RLMDIE+CATADEETA YIVLEGG
Sbjct: 901 PLRRSMDDNSKLSCSYLSKLKISTELAHPDLYHQCLRLMDIEHCATADEETATYIVLEGG 960
Query: 961 MRAVFISSNGVHQSLSHPNKNPGFQRAMDLYKKQRMEMKEMKVVSGGKLDGILASSERRL 1020
MRAV ISS+ QSL HP+KN FQ AMDLYKKQRMEMKEM+VVS G + SSERRL
Sbjct: 961 MRAVSISSSSARQSLFHPDKNSVFQHAMDLYKKQRMEMKEMQVVSEG-----ITSSERRL 1020
Query: 1021 EEQGL-------------------NFNNEEVKVPVSTVGAEMVQPPILATGDKAVVESTA 1080
EE+G+ +FNN EVK P ST EM Q PI G VE+T
Sbjct: 1021 EEKGMQVVSGEMAASEMKLEGTAFDFNNGEVKTPDSTADVEMEQTPIKTVGVDEEVETTE 1080
Query: 1081 ALGKSEDLASTASQEEVKCLENSEETLPITKSTEMDVMDLEQEQVNLDVEKDTV-KPSDN 1139
ALGK E +AST SQEEVKCLENSEE+LP + E+D++D EQ+ VNLD EKDTV DN
Sbjct: 1081 ALGKLEAMASTGSQEEVKCLENSEESLPNSNLIEVDMIDSEQQVVNLDAEKDTVFMAKDN 1140
BLAST of MC06g1386 vs. ExPASy TrEMBL
Match:
A0A0A0KJV1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G139460 PE=4 SV=1)
HSP 1 Score: 1653 bits (4281), Expect = 0.0
Identity = 956/1347 (70.97%), Postives = 1025/1347 (76.10%), Query Frame = 0
Query: 1 MPRGSRHKSTRHGLKDARDSSDSENDSSLRDRKGKESGSRVFKDSASSEKRRFDSKDAKD 60
MPRGSRHKSTRHGLKDAR+SSDSENDS++RDRKGKESGSRV KDSASSEKRRFDSKD K+
Sbjct: 1 MPRGSRHKSTRHGLKDARESSDSENDSTVRDRKGKESGSRVLKDSASSEKRRFDSKDTKE 60
Query: 61 FYASENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV 120
FY SENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV
Sbjct: 61 FYGSENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV 120
Query: 121 GLLQCDGEELRKSSGKGEGRHRESSRKEGRNGGGDRDRDR-------------------- 180
GL Q GEEL+KSSGKGEGRHRESSRKEGRNGGG+R+RDR
Sbjct: 121 GL-QGGGEELKKSSGKGEGRHRESSRKEGRNGGGERERDRDRDRDRDRDRDRDRDRDRDR 180
Query: 181 ------------------------------------------------------------ 240
Sbjct: 181 DRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDR 240
Query: 241 ------------------------------------------------------------ 300
Sbjct: 241 DRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDRDR 300
Query: 301 ----------------------------------EREREKEKEKERKGREGRSDR---SE 360
EREREKEKEK+RKGREGRSDR SE
Sbjct: 301 DRDRDRDRDRDRDRDRDRDREREREREREREREREREREKEKEKDRKGREGRSDRGIASE 360
Query: 361 EHRVEKQVEKNTDNVLQSPGLENHLETRVRKRAGSFDGDKHKDDIGDAENRQISSKNDAV 420
E RVEKQVEKN +NVL SPGLENHLETR RK AGSFDGDKHKDD GD ENRQ+SSKND V
Sbjct: 361 ELRVEKQVEKNAENVLHSPGLENHLETRGRKGAGSFDGDKHKDDAGDVENRQLSSKNDTV 420
Query: 421 KDGRRKSEKHKDERNREKYREDADRDGKQRDEQLVKDHISRSNDRDLRDEKDAIDMHHKR 480
KDGRRKSEK+KDERNREKYRED DRDGK+RDEQLVK+HISRSNDRDLRDEKDA+DMHHKR
Sbjct: 421 KDGRRKSEKYKDERNREKYREDVDRDGKERDEQLVKEHISRSNDRDLRDEKDAMDMHHKR 480
Query: 481 NKPQDSDPDREVTKAKHEGDLDARRDQDHDRDRHHAYERDRDHDQESRRRRDRDRGRDRD 540
NKPQDSD DRE+TKAK +GDLDA RDQDHDR HH YERD HDQESRRRR DRGRDRD
Sbjct: 481 NKPQDSDIDREITKAKRDGDLDAMRDQDHDR--HHGYERD--HDQESRRRR--DRGRDRD 540
Query: 541 RDYDRDGRRNRSRSRARDRYSDYECDVDRDGSHFEDQYTKYADSRGRKRSPNDHVDSVDA 600
R++DRDGRRNRSRSRARDRYSDYECD+DRDGSH EDQYTKY DSRGRKRSPNDH DSVDA
Sbjct: 541 REHDRDGRRNRSRSRARDRYSDYECDLDRDGSHLEDQYTKYVDSRGRKRSPNDHDDSVDA 600
Query: 601 RSKSLKNSHHSNEEKKSLSNDKVDSDAERGRSQSRSRHADVSLSSHRRKNSPSSLSRVGM 660
RSKSLKNSHH+N+EKKSLSNDKVDSDAERG SQSRSRH DV+LSSHRRK+SPSSLSRVG
Sbjct: 601 RSKSLKNSHHANDEKKSLSNDKVDSDAERGISQSRSRHGDVNLSSHRRKSSPSSLSRVGT 660
Query: 661 DEYRHQDQEDLRDRYPKKEERSKSISTRDKSGFSGVQEKGSKYTYVEKPSEADGGNAIEL 720
DEYRHQDQEDLRDRYPKKEERSKSISTRDK SGVQEKGSKY+Y EKPSE +G NA EL
Sbjct: 661 DEYRHQDQEDLRDRYPKKEERSKSISTRDKGILSGVQEKGSKYSYSEKPSETEGSNATEL 720
Query: 721 SRERSLNSKNIDIEESGRRCSTSIDNKDLSSNKDRLSWDLPGEKPLMDESPQAESFYS-K 780
R+RSLNSKN+DIEESGRR +TSID KDLSSNKDR SWD+ GEKPLMD+ QAES+YS K
Sbjct: 721 LRDRSLNSKNVDIEESGRRHNTSIDAKDLSSNKDRHSWDIQGEKPLMDDPSQAESYYSSK 780
Query: 781 ASQSHPSPFHPRPAFRGGLDSPFDGSLEDDSRLNSNGRFRRNNDQNLGRVHGNTWRGVPN 840
SQS+PSPFH RPAFRGG+D PFDGSL+DD RLNSN RFRR ND NLGRVHGN+WRGVPN
Sbjct: 781 GSQSNPSPFHSRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNLGRVHGNSWRGVPN 840
Query: 841 WTAPLPNGFIPFQHGPPPHGSFQSMMPQFPAPPLFGIRPPLEINHSGIPYRMPDAERFSS 900
W+APLPNGFIPFQHGPPPHGSFQS+MPQFPAPPLFGIRPPLEINHSGI YRMPDAERFSS
Sbjct: 841 WSAPLPNGFIPFQHGPPPHGSFQSIMPQFPAPPLFGIRPPLEINHSGIHYRMPDAERFSS 900
Query: 901 HMHPLGWQNMLDGSSPSHLHGWDGNNGIFRDESHIYGGAEWEENRQMVNGRGWESKADMW 960
HMH LGWQNMLDGSSPSHLHGWDGNNGIFRDESHIY GAEW+ENRQMVNGRGWESK +MW
Sbjct: 901 HMHSLGWQNMLDGSSPSHLHGWDGNNGIFRDESHIYNGAEWDENRQMVNGRGWESKPEMW 960
Query: 961 KRQSGGPKRELPSQFQKDERLVQDPVDDVSSREACDESTNTILTKTVEMRPNIPSAKESP 1020
KRQSG KRELPSQFQKDER V D VDDVSSREACDEST+T+LTKT E+RPNIPSAKESP
Sbjct: 961 KRQSGSLKRELPSQFQKDERSVHDLVDDVSSREACDESTDTVLTKTAEIRPNIPSAKESP 1020
Query: 1021 NTPELLSETPAPVRRSMDDNSKLSCSYLSKLKISAELAHPDLYHQCQRLMDIENCATADE 1080
NTPEL SETPAP+R+SMDDNSKLSCSYLSKLKIS ELAHPDLYHQC RLMDIE+CATADE
Sbjct: 1021 NTPELFSETPAPLRQSMDDNSKLSCSYLSKLKISTELAHPDLYHQCLRLMDIEHCATADE 1080
Query: 1081 ETAAYIVLEGGMRAVFISSNGVHQSLSHPNKNPGFQRAMDLYKKQRMEMKEMKVVSGG-- 1139
ETAAYIVLEGGMRAV ISS+ HQSL HP+KN FQ AMDLYKKQRMEMKEM+VVS G
Sbjct: 1081 ETAAYIVLEGGMRAVSISSSSAHQSLFHPDKNSIFQHAMDLYKKQRMEMKEMQVVSEGIT 1140
BLAST of MC06g1386 vs. ExPASy TrEMBL
Match:
A0A6J1E442 (uncharacterized protein LOC111430427 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111430427 PE=4 SV=1)
HSP 1 Score: 1649 bits (4269), Expect = 0.0
Identity = 939/1200 (78.25%), Postives = 1019/1200 (84.92%), Query Frame = 0
Query: 1 MPRGSRHKSTRHGLKDARDSSDSENDSSLRDRKGKESGSRVFKDSASSEKRRFDSKDAKD 60
MPRGSRHKS+RHGLKDA++SSDSENDS+LRDRKGKESGSRV KDSASSEKRRF+SKD+K+
Sbjct: 1 MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRVMKDSASSEKRRFESKDSKE 60
Query: 61 FYASENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV 120
FY SENLE EEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSK VDSKSKRRDESV
Sbjct: 61 FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKTLVDSKSKRRDESV 120
Query: 121 GLLQCDGEELRKSSGKGEGRHRESSRKEGRNGGGDRDRDREREREKEKEKERKGREGRSD 180
G Q DGEE +KSSGKGEGRHRESSRKEGRNGGG+R+R+RERERE+EK+ RKGREGRSD
Sbjct: 121 GF-QGDGEEHKKSSGKGEGRHRESSRKEGRNGGGEREREREREREREKD--RKGREGRSD 180
Query: 181 R---SEEHRVEKQVEKNTDNVLQSPGLENHLETRVRKRAGSFDGDKHKDDIGDAENRQIS 240
R SE+ RVEKQVEKN++NVL SPGLENHLE RVRKR GSFDGDKHKDDIGD +NRQ+S
Sbjct: 181 RGVASEDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLS 240
Query: 241 SKNDAVKDGRRKSEKHKDERNREKYREDADRDGKQRDEQLVKDHISRSNDRDLRDEKDAI 300
SKND VKDGRRKSEK+KDERNREKYRED DRDGK+R+E LVKDHISRSNDRDLRDEKDA+
Sbjct: 241 SKNDTVKDGRRKSEKYKDERNREKYREDVDRDGKERNE-LVKDHISRSNDRDLRDEKDAM 300
Query: 301 DMHHKRNKPQDSDPDREVTKAKHEGDLDARRDQDHDRDRHHAYERDRDHDQESRRRRDRD 360
DMHHKRNKPQDSDPDREVTKAK EGD+DA RDQDHDR HHAYERD H+QESRRRRDRD
Sbjct: 301 DMHHKRNKPQDSDPDREVTKAKREGDIDAMRDQDHDR--HHAYERD--HEQESRRRRDRD 360
Query: 361 R--GRDRDRDYDRDGRRNRSRSRARDRYSDYECDVDRDGSHFEDQYTKYADSRGRKRSPN 420
R GRDRDRD+DRD RR+RSRSRARDRYSDYECDVDRDGSHF+DQYTKY DSRGRKRSPN
Sbjct: 361 RDRGRDRDRDHDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPN 420
Query: 421 DHVDSVDARSKSLKNSHHSNEEKKSLSNDKVDSDAERGRSQSRSRHADVSLSSHRRKNSP 480
DH DSVDARSKSLKNSHH+N+EKKSLSNDKVDSDAERGRSQSRSRH DVSLSSHRRK+SP
Sbjct: 421 DHDDSVDARSKSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSP 480
Query: 481 SSLSRVGMDEYRHQDQEDLRDRYPKKEERSKSISTRDKSGFSGVQEKGSKYTYVEKPSEA 540
SS SRV DEYRHQDQEDLRDRYPKKEERSKSISTRDK S VQEKGSKYTY EKPSE
Sbjct: 481 SSHSRVVTDEYRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEI 540
Query: 541 DGGNAIELSRERSLNSKNIDIEESGRRCSTSIDNKDLSSNKDRLSWDLPGEKPLMDESPQ 600
+GGNA EL R+R+LNSKN+DIEESGRR + SID KDLSSNKDR SWD+ GEKP+MD+S Q
Sbjct: 541 EGGNATELLRDRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQ 600
Query: 601 AESFYSKASQSHPSPFHPRPAFRGGLDSPFDGSLEDDSRLNSNGRFRRNNDQNLGRVHGN 660
ES+YSK SQS+PSPFHPRPAFRGG+D PFDGSL+DD RLNSN RFRR ND N+GRVHGN
Sbjct: 601 VESYYSKGSQSNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGN 660
Query: 661 TWRGVPNWTAPLPNGFIPFQHGPPPHGSFQSMMPQFPAPPLFGIRPPLEINHSGIPYRMP 720
TWRGVPNWTAPLPNGFIPFQHGPPPHGSFQS+MPQFPAPP+FGIRPPL+INHSGI YRMP
Sbjct: 661 TWRGVPNWTAPLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMP 720
Query: 721 DAERFSSHMHPLGWQNMLDGSSPSHLHGWDGNNGIFRDESHIYGGAEWEENRQMVNGRGW 780
DA+RFSSHMHPLGWQNMLDGSSPSHLHGWD NNGIFRDESHIY GAEW+ENRQMVNGRGW
Sbjct: 721 DADRFSSHMHPLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGW 780
Query: 781 ESKADMWKRQSGGPKRELPSQFQKDERLVQDPVDDVSSREACDESTNTILTKTVEMRPNI 840
+SKA+MWKRQSG KRE+PSQFQKDER VQDPVDDVSS+E DE+ +T+LTKT E+RPNI
Sbjct: 781 DSKAEMWKRQSGSLKREIPSQFQKDERSVQDPVDDVSSKEIFDENADTVLTKTSEIRPNI 840
Query: 841 PSAKESPNTPELLSETPAPVRRSMDDNSKLSCSYLSKLKISAELAHPDLYHQCQRLMDIE 900
PSAKESPNTPELLSETPAP+ RSMDDNSKLSCSYLSKL IS ELA PDLY QCQRLMDIE
Sbjct: 841 PSAKESPNTPELLSETPAPLSRSMDDNSKLSCSYLSKLNISTELALPDLYQQCQRLMDIE 900
Query: 901 NCATADEETAAYIVLEGGMRAVFISSNGVHQSLSHPNKNPGFQRAMDLYKKQRMEMKEMK 960
+CATADEETAAYIVLEGGMRAV +SSN SL PNKN FQ AMDLYKKQR EMKEM+
Sbjct: 901 HCATADEETAAYIVLEGGMRAVSVSSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQ 960
Query: 961 VVSGG----------KLDGI------LASSERRLEEQGLNFNNEEVKVPVSTVGAEMVQP 1020
+S + G+ +A SER+ EE GLNF NEEVK PVSTV AEM Q
Sbjct: 961 AISREMPSSERMLEEEQQGMQVVSRGMAFSERKHEEMGLNFKNEEVKAPVSTVDAEMTQA 1020
Query: 1021 PILATG-DKAV-------------VESTAALGKSEDLASTASQEEVKCLENSEETLPITK 1080
PI TG D A+ VE+ AALG+ EDLAS A++E VKCLENSEE++PIT
Sbjct: 1021 PIKTTGVDNAIEADAALGKLEDLAVEADAALGELEDLASPATRE-VKCLENSEESVPITN 1080
Query: 1081 STEMDVMDLEQEQVNLDVEKDTVK-PSDNVSVN-------DTDKGIVNGKDS-------- 1139
STE+D+MD EQ NLD EKDT+ SDN VN D KGIVNGK+S
Sbjct: 1081 STEVDMMDSEQ-PANLDAEKDTIVIASDNTPVNNINESSNDDMKGIVNGKESPGCGVGNS 1140
BLAST of MC06g1386 vs. ExPASy TrEMBL
Match:
A0A6J1I6E2 (uncharacterized protein LOC111471538 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111471538 PE=4 SV=1)
HSP 1 Score: 1647 bits (4264), Expect = 0.0
Identity = 936/1219 (76.78%), Postives = 1017/1219 (83.43%), Query Frame = 0
Query: 1 MPRGSRHKSTRHGLKDARDSSDSENDSSLRDRKGKESGSRVFKDSASSEKRRFDSKDAKD 60
MPRGSRHKS+R GLKDA++SSDSENDS+LRDRKGKESGSRV KDSASSEKRRF+SKD+K+
Sbjct: 1 MPRGSRHKSSRQGLKDAKESSDSENDSTLRDRKGKESGSRVMKDSASSEKRRFESKDSKE 60
Query: 61 FYASENLETEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKSKRRDESV 120
FY SENLE EEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSK VDSKSKRRDESV
Sbjct: 61 FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKTLVDSKSKRRDESV 120
Query: 121 GLLQCDGEELRKSSGKGEGRHRESSRKEGRNGGGDRDRDREREREKEKEKERKGREGRSD 180
G DGEE +KSSGKGEGRHRESSRKEGRNGGG+R+R+RERERE+EK+ RKGREGRSD
Sbjct: 121 GF-HGDGEEHKKSSGKGEGRHRESSRKEGRNGGGEREREREREREREKD--RKGREGRSD 180
Query: 181 R---SEEHRVEKQVEKNTDNVLQSPGLENHLETRVRKRAGSFDGDKHKDDIGDAENRQIS 240
R SE+ RVEKQVEKN++NVL SPGLENHLE RVRKR GSFDGDKHKDDIGD +NRQ+S
Sbjct: 181 RGVASEDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLS 240
Query: 241 SKNDAVKDGRRKSEKHKDERNREKYREDADRDGKQRDEQLVKDHISRSNDRDLRDEKDAI 300
SKND VKDGRRKSEK+KDERNREKYRED DRDGK+R EQLVKDHISRSNDRDLRDEKDA+
Sbjct: 241 SKNDTVKDGRRKSEKYKDERNREKYREDVDRDGKERHEQLVKDHISRSNDRDLRDEKDAM 300
Query: 301 DMHHKRNKPQDSDPDREVTKAKHEGDLDARRDQDHDRDRHHAYERDRDHDQESRRRRDRD 360
DMHHKRNKPQDSDPDREVTKAK EGD+DA RDQDHDR HHAYERD H+QESRRRR D
Sbjct: 301 DMHHKRNKPQDSDPDREVTKAKREGDIDAMRDQDHDR--HHAYERD--HEQESRRRR--D 360
Query: 361 RGRDRDRDYDRDGRRNRSRSRARDRYSDYECDVDRDGSHFEDQYTKYADSRGRKRSPNDH 420
RGRDRDRD DRD RR+RSRSRARDRYSDYECDVDRDG HF+DQYTKY DSRGRKRSPNDH
Sbjct: 361 RGRDRDRDRDRDSRRHRSRSRARDRYSDYECDVDRDGYHFDDQYTKYVDSRGRKRSPNDH 420
Query: 421 VDSVDARSKSLKNSHHSNEEKKSLSNDKVDSDAERGRSQSRSRHADVSLSSHRRKNSPSS 480
DSVDARSKSLKNSHH+N+EKKSLSNDKVDSDAERGRSQSRSRH DVSLSSHRRK+SPSS
Sbjct: 421 DDSVDARSKSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSS 480
Query: 481 LSRVGMDEYRHQDQEDLRDRYPKKEERSKSISTRDKSGFSGVQEKGSKYTYVEKPSEADG 540
SRV DEYRHQDQEDLRDRYPKKE+RSKSISTRDK S VQEKGSKYTY EKPSE +G
Sbjct: 481 HSRVVTDEYRHQDQEDLRDRYPKKEDRSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEG 540
Query: 541 GNAIELSRERSLNSKNIDIEESGRRCSTSIDNKDLSSNKDRLSWDLPGEKPLMDESPQAE 600
GNA E+ R+R+LNSKN+DIEESGRR + SID KDLSSNKDR SWD+ GEKP+MD+S Q E
Sbjct: 541 GNATEMLRDRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVE 600
Query: 601 SFYSKASQSHPSPFHPRPAFRGGLDSPFDGSLEDDSRLNSNGRFRRNNDQNLGRVHGNTW 660
S+YSK SQS+PSPFHPRPAFRGG+D PFDGSL+DD RLNSN FRR ND N+GRVHGNTW
Sbjct: 601 SYYSKGSQSNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSHFRRGNDPNMGRVHGNTW 660
Query: 661 RGVPNWTAPLPNGFIPFQHGPPPHGSFQSMMPQFPAPPLFGIRPPLEINHSGIPYRMPDA 720
RGVPNWTAPLPNGFIPFQHGPPPHGSFQS+MPQFPAPP+FGIRPPL+INHSGI YRMPDA
Sbjct: 661 RGVPNWTAPLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDA 720
Query: 721 ERFSSHMHPLGWQNMLDGSSPSHLHGWDGNNGIFRDESHIYGGAEWEENRQMVNGRGWES 780
+RFSSHMHPLGWQNMLDGSSPSHLHGWD NNGIFRDESHIY GAEW+ENRQMVNGRGW+S
Sbjct: 721 DRFSSHMHPLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDS 780
Query: 781 KADMWKRQSGGPKRELPSQFQKDERLVQDPVDDVSSREACDESTNTILTKTVEMRPNIPS 840
KA+MWKRQSG KRE+PSQFQKDERLVQDPVDDVSS+E CDE+ +T+LTKT E+RPNIPS
Sbjct: 781 KAEMWKRQSGSLKREIPSQFQKDERLVQDPVDDVSSKEICDENADTVLTKTAEIRPNIPS 840
Query: 841 AKESPNTPELLSETPAPVRRSMDDNSKLSCSYLSKLKISAELAHPDLYHQCQRLMDIENC 900
AKESPNTPELLSETPAP+ RSMDDNSKLSCSYLSKLKIS ELA PDLY QCQRLMDIE+C
Sbjct: 841 AKESPNTPELLSETPAPLSRSMDDNSKLSCSYLSKLKISTELALPDLYQQCQRLMDIEHC 900
Query: 901 ATADEETAAYIVLEGGMRAVFISSNGVHQSLSHPNKNPGFQRAMDLYKKQRMEMKEMKVV 960
ATADEETAAYIVLEGGMRAV +SSN SL PNKN FQ AMDLYKKQR EMKEM+ +
Sbjct: 901 ATADEETAAYIVLEGGMRAVSVSSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQAI 960
Query: 961 SGGK---------------LDGILASSERRLEEQGLNFNNEEVKVPVSTVGAEMVQPPIL 1020
S + G +A SER+ EE+G NFNNEEVK PVSTV AEM Q PI
Sbjct: 961 SREMPFSERMLVEEQGMQVVSGGMAFSERKHEEKGFNFNNEEVKAPVSTVDAEMTQAPIK 1020
Query: 1021 ATG-DKAV-------------VESTAALGKSEDLASTASQEEVKCLENSEETLPITKSTE 1080
TG DKA+ VE+ AALG+ EDLAS A++E VKCLENSEE++P T STE
Sbjct: 1021 TTGVDKAIEADAALGKLEDLAVEADAALGELEDLASPATRE-VKCLENSEESVPTTNSTE 1080
Query: 1081 MDVMDLEQEQVNLDVEKDTV------KPSDNV--SVNDTD-KGIVNGKDS---------- 1139
+ +MD EQ Q NLD EKDT+ P +N+ S ND D KGIVNGKDS
Sbjct: 1081 VVMMDSEQ-QANLDAEKDTIVIANDNTPVNNINESSNDDDMKGIVNGKDSPRCDELSNNN 1140
BLAST of MC06g1386 vs. TAIR 10
Match:
AT5G53440.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cytosol; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15 growth stages; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 429.5 bits (1103), Expect = 8.5e-120
Identity = 439/1260 (34.84%), Postives = 649/1260 (51.51%), Query Frame = 0
Query: 1 MPRGSRHKSTRHGLKDA-RDSSDSENDSSLRDRKGKESGS---RVFKDSASSEKRRFDSK 60
MPR +RHKS++H KDA ++ SDSE ++SL+++K KE S RV K+S S +KR
Sbjct: 1 MPRSTRHKSSKH--KDATKEYSDSEKETSLKEKKSKEESSTTVRVSKESGSGDKR----- 60
Query: 61 DAKDFYASENLETEEH---GHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKPSVDSKS 120
K++Y S N E E SKRRK + E +DRWN G DD+ G SKK+K S KS
Sbjct: 61 --KEYYDSVNGEYYEEYTSSSSKRRKGKSGESGSDRWN-GKDDDKGESSKKTKVS-SEKS 120
Query: 121 KRRDESVGLLQCDGEELRKSSGKGEGRHRESSRKEGRNGGGDRDRDREREREKEKEKERK 180
++RDE DGEE +KSSGK +G+HRESSR+E ++ +KEK+RK
Sbjct: 121 RKRDEG------DGEETKKSSGKSDGKHRESSRRE--------------SKDVDKEKDRK 180
Query: 181 GREGRSDR---SEEHRVEKQVEKNTDNVLQ----SPGLENHLETRV-RKRAGSFDGDKHK 240
+EG+SD+ ++H K T++ Q SPG EN+ E R RKR GDKH
Sbjct: 181 YKEGKSDKFYDGDDHHKSKAGSDKTESKAQDHARSPGTENYTEKRSRRKRDDHGTGDKHH 240
Query: 241 DDIGDAENRQISSKNDAVKDGRRKSEKHKDERNREKYREDADRDG-KQRDEQLVKDHISR 300
D+ D +R ++S +D +KDG+ K EK +D+ +K ED + G KQRD++ K+H+ R
Sbjct: 241 DNSDDVGDRVLTSGDDYIKDGKHKGEKSRDKYREDKEEEDIKQKGDKQRDDRPTKEHL-R 300
Query: 301 SNDRDLRDEK----------------DAIDMHHKRNKPQDSD-------PDREVTKAKHE 360
S+++ RDE +D +H+R + +D D DRE T+ +
Sbjct: 301 SDEKLTRDESKKKSKFQDNDHGHEPDSELDGYHERERNRDYDRESDRNERDRERTR---D 360
Query: 361 GDLDARRDQDHDRDRHHAYERDR---DHDQESRRRRDRDRGRDRDRDYDRDGRRNRSRSR 420
D D RD+D DRDR +RDR +HD+ R DRDR RDRDRD++RD +R + R
Sbjct: 361 RDRDYERDRDRDRDRDRERDRDRRDYEHDRYHDRDWDRDRSRDRDRDHERDRTHDREKDR 420
Query: 421 ARDRY-------SDYECDVDRDGSHFEDQYTKYADSRGRKRSPN--DHVDSV-DARSKSL 480
+RD Y SD E D DRD S +DQ +Y D R +RSP+ D+ D + +RS +
Sbjct: 421 SRDYYHDGKRSKSDRERDNDRDVSRLDDQSGRYKDRRDGRRSPDYQDYQDVITGSRSSRV 480
Query: 481 KNSHHSNEEKKSLSNDKVDSDAERGRSQSRSRHADVSLSSHRRKNSPSSLSRVGMDEYRH 540
+ ++ LS+ V E G + + S +R + E
Sbjct: 481 EPDGDMTRPERQLSSSVVQE--ENGNASDQITKGASSREVAELSGGSERGTRQKVSEKTA 540
Query: 541 QDQEDLRDRYPKKEERSKSISTRDKSGFSGVQEKGSKYTYVEKPSEADGGNAIELSRERS 600
++ + +P + + S R + E+ T +E+ GG
Sbjct: 541 NMEDGVLGEFPAERSFAAKASPRP------MVERSPSSTSLERRYNNRGG---------- 600
Query: 601 LNSKNIDIEESGRRCSTSIDNKDLSSNKDRLSWDLPGEKPLMDESPQAE-SFYSKASQSH 660
++I++EE+G R + +D S+ ++ E+ L+DE+ QAE SF +KA+Q++
Sbjct: 601 -ARRSIEVEETGHRNNA----RDYSATEE--------ERHLVDETSQAELSFNNKANQNN 660
Query: 661 PSPFHPRPAFRGGLDSPFDGSLEDDSRLNSNGRFRRNN-DQNLGRVHGNTWRGVPNWTAP 720
S F PRP R G+ SP G E+D+R+N+ GR++R D +GR N WRGVP+W +P
Sbjct: 661 -SSFPPRPESRSGVSSPRVGPREEDNRVNTGGRYKRGGVDAMMGRGQSNMWRGVPSWPSP 720
Query: 721 LPNGFIPFQHGPPPHGSFQSMMPQFPAPPLFGIRPPLEINHSGIPYRMPDAERFSSHMHP 780
L NG+ PFQH PPHG+FQ+MMPQFP+P LFG+RP +E+NH GI Y +PDAERFS HM P
Sbjct: 721 LSNGYFPFQH-VPPHGAFQTMMPQFPSPALFGVRPSMEMNHQGISYHIPDAERFSGHMRP 780
Query: 781 LGWQNMLDGSSPSHLHGWDGN-NGIFRDESHIYGGAEWEENRQMVNGRGWESKADMWKRQ 840
LGWQNM+D S SH+HG+ G+ + RDES++YGG+EW++NR+M NGRGWES AD WK +
Sbjct: 781 LGWQNMMDSSGASHMHGFFGDMSNSVRDESNMYGGSEWDQNRRM-NGRGWESGADEWKSR 840
Query: 841 SGGPKRELPSQFQKDERLVQDPVDDVSSREACDESTNTILTKTVEMRPNIPS-AKE-SPN 900
+G E+ S KD+ Q DD S S N K+VE N+ S AKE +
Sbjct: 841 NGDASMEVSSMSVKDDNSAQ-VADDESLGGQTSHSDNN-RAKSVEAGSNLTSPAKELHAS 900
Query: 901 TPELLSETPA--PVRRSMDDNSKLSCSYLSKLKISAELAHPDLYHQCQRLMDIENCATAD 960
+P+ + E A PV ++D+ + YLSKL +SA LA +L +C L+ E D
Sbjct: 901 SPKTMEEVAADDPVSETIDNTERYCRHYLSKLDVSAGLADAEL-RKCISLLIGEEHLAMD 960
Query: 961 EETAAYIVL-EGGMRAVFISSNGVHQSLSHPNKNPG-FQRAMDLYKKQRMEMKEM----- 1020
+ TA ++ L EGG R +SN + P++N FQ AMD YK+QR E+K +
Sbjct: 961 DGTAVFVNLKEGGKRVTKSNSNSLKALSLFPSQNSSVFQIAMDFYKEQRFEIKGLPNVKN 1020
Query: 1021 ------------KVVSGGKLD-------GILASSERRLEEQGLNFNNEEVKVPVSTVGAE 1080
KV + L+ I A+ + + + + +E++ S GA+
Sbjct: 1021 HEAPQVPPSNLVKVENNDDLNDARNGNSSIEATDMKIADVSDSDTSQKELQKVSSNAGAK 1080
Query: 1081 M--------VQPPILATGDKAV--VESTAALGKSEDLAS--TASQEEVKCLENSE----- 1140
M P +A+ V S G E +AS EE L++ E
Sbjct: 1081 METETRDEGSSSPNPDNSPEALNAVSSDHIEGSEEAMASDHIEGSEEAVALDHIEGDEQE 1140
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_022158031.1 | 0.0 | 100.00 | uncharacterized protein LOC111024614 [Momordica charantia] | [more] |
XP_038876328.1 | 0.0 | 80.18 | LOW QUALITY PROTEIN: filaggrin [Benincasa hispida] | [more] |
XP_008437591.1 | 0.0 | 78.46 | PREDICTED: uncharacterized protein DDB_G0283697 [Cucumis melo] | [more] |
XP_031740997.1 | 0.0 | 78.94 | uncharacterized protein DDB_G0283697 [Cucumis sativus] >KAE8647802.1 hypothetica... | [more] |
XP_022922431.1 | 0.0 | 78.25 | uncharacterized protein LOC111430427 isoform X2 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1DZU4 | 0.0 | 100.00 | uncharacterized protein LOC111024614 OS=Momordica charantia OX=3673 GN=LOC111024... | [more] |
A0A1S3AUZ1 | 0.0 | 78.46 | uncharacterized protein DDB_G0283697 OS=Cucumis melo OX=3656 GN=LOC103482960 PE=... | [more] |
A0A0A0KJV1 | 0.0 | 70.97 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G139460 PE=4 SV=1 | [more] |
A0A6J1E442 | 0.0 | 78.25 | uncharacterized protein LOC111430427 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1I6E2 | 0.0 | 76.78 | uncharacterized protein LOC111471538 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
Match Name | E-value | Identity | Description | |
AT5G53440.1 | 8.5e-120 | 34.84 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |