Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAGGACCAAAAACAACACGGAAATCTGAGAGGCAAATTGAAATTGTTCCTGTGTTCCATCTGGGTCCCAAAACCTGGTTCGGCACCTGCATCTCTCTTCCTCACAATGTATGGCGGCCCATCCAAGCTCGCTCGGGCCGGCGGCGGCGCTGGCCGCGGAGCCAGCGGAAAGCGGCCGCCCTCCTCTTTTCCTCTACCACCTGCTCACCGCCCCTCCGGCCGTCTCTCTCTCGGCGGCGGTGGCGCCGGTTCTGGCGCAAATCCTCGGAATCGAACCTCCACCGCAGCCAAATCCGAAGCCCCTCTATCCGTCGAGGAGAATTTCAGTCTCGTTACCGGTAACAATCCTTTGGCTTTTGCTATGATAATTCGGTTGGCTCCCGACTTGATCGAAGAGATCAAGCGGGTTGAGTCGCTGGGAGGAACTCCGAGAATTAAGTTTGATGCGAATGCCAAGAATTCTAGTGGTAATGTAAGTTTTCTGTATTACTTGTTCTTTACTCTAGTCATCCACATCATGGGATAATGAAGATTTTTTTTTTTTTTTAAAGATTATTTGAACTGTGTTAAGTATTTCAAATACGCTGTCGAATATGTGCACAAGTAGCTTATGTGGATTATGCTTCTGTTCGCTATTTGTTGGATTTTCCCTGTGGAATCTCTTTGTGCAACCGCTACTTCGTTACATATATTTTTGTCTAGATGGTCTTATTTAGTTGGAATTTACCGCCAATTTGAGCATTGCTATAATCTATTCTTTTGTAACATTATTGTGTTTACCTTTTTTATTTGGCAACAGTTAGCTATGGGCTGAATTGGTGAGTATGAATACTGTGAGGTCAAGGGTTAGATTACCCAATTCAGTAGAAACATAGAAAACCATGTAAGTACGAAAAATATCAGTAGGAATAGTTTATCAGATATATCAGCTTTTATGTGTCTATGATAATATAGGTTTTTCATGAGGATATATTGGAGTTCGTGTCCCAACTGACAATTGGGGCATGGCCAATTTAAAAGTTTCAAGCTGGAGATCTACACTTGATAATACTTGCTGCATTTATAACCTAAATTTTGAGCATTTTTTTCTTATTTTGGTAGACATGATTATTTAATGAATTGTGAAAGTGCTGCAATTGATGAGGTGTCAGTTGTATTCTTACAATGTGCACAACATGCTTATAAAGAATATGCAATTTATGATGATTTAATAGGAATTAGAGTGAAAGAGGTAAGTTGAAAGGGAAAACTCTAATTCTCTCTGTGTTTACTGGGGAGCTTTATCTTGAGAATATTGCTATCTCTAGGATCTGTTGAAATCCCCCATCAACCAGAAGTTTATTTATTTATTTTACTCTTAATTTTTTAAAATAAACTTTAGCACGAAAAGTTATTGGTATAATTTGAAAAGATTTCCAACAGGAGGGATAATAAAGATTAAGGCATGATACCAACAGGAGGAAGAGTAAGTGTCCTATTGTGATTCTAGTAGGAAAAGGTAGGTATAACAGGCGTTCTACAATTATATAAAAACAGACTAACAACCCAAGATTTCCCTGACATCTGTTGTTATTTTTTTGATTGAATACTCCATCATTTGTGGATACCAAAAAATACTTGATGCCATATGAGCAAGTTAAAAATAGTTGGATATAGTATAAAAGGAATACTCTTCTATATTTCTTCCAAAATCTTCACATCCAAGAAAGAGAGATGTTGGATTGCATACTCTGGGTTTTGGTGATGTTGCTGAACTCTCTAGGATCTACCCCTTTTTTCTCGATATCCTGATTCTTCTGTTTGTTAGCAGATGTGGAGATAAGTGGAGCTAGGAGTTGTTCCTATATTTATGTGAAAATGGGATGAAACACATCCTATACCTTACTTGCAAAGTGCTATGCTGTTTTCATTTTATCAATGAGTTTTTATCATACAAAAATCTCTCTTTTGATTTGTTTTTTTTTTGGAAGAAAGATATTTACTAGTTTGCTATAGGCTTCTCATTCTGCATCCCAAGAAAATTATTATTATTTTTATTGGAAATAGTCTTCATCTCTATAGGACAATATTTAAGGAGTAGAGGTCTTATCCTGTACGAAAAATGAAGAAATCATATAAAGGCTATCCACGCCTTGTGCAGTTAAAAGAAAGGAATACATATAAAATTATATTAATGATGCATAAACATGAAACCGATGGTCTATGTTGCCATCTAGTCTTGATCATCTCCAATTGTGCATATTCAAATTTAAATAGTTGTCTTCTCCTCAATCTCTATCATGTGCATTTTTCTTTCTATTGATAAGAAGGGTCTAAAAGTCTAGAGGAATTGCCTCTTTCGGCAATTTTTTTTATCACTTTACCCTTCCAATGAATAGCTGGGAAGGTGATAACAATCAGTGTAAAGGCTGAATACGAACAATACTGGATTTTCTTCCTGTTCTACTCCCCCCCTCCCAGGGCAGGTTTGTGCTAAAGTCATAATATATGTATGGATCTGACATATTCTCATCAAATCAATGTTTCCTGTTTGTCAATTGTTACAATTATACATGCACCTTAAGATCTAATTCTGCTTCCCTGTTTGAAGTTTTTTTATTTATTATTTATTATTTATTATTATTATTATTATTATTTTGAGGGCTTTGGCTATTTAAGATTGTTGTACCTGAACTGAAATTTTACACCAGGTCATTGATGTTGGGGGTAAAGAGTTTAGGTTCACATGGTCACGTGAAGTTGGTGATCTTTGTGACATATACGAAGAACGTAAAAGTGGCGAAGATGGAAGTGGTTTGCTTGTTGAATCAGGCAATGCTTGGAGAAAAGTGAATGTGCAGCGTATCTTGGATGAATCAACTACAAACCATGTTAAGAAGTTGTCTGAGGAAGCTGAACGAAGATCTAAATCTCGCAGGTATTTCTTTTGTTAATTTTAATAGAAATCAAATTATTTTGTTCAAGTTCTTATCAAATTGGTAGAAGTTTTAAAGCTGACAAAAGGAATACAAGCTTCCAAAATCTTCGCGAGGCCTGGCATTGTCCAAAATTATTCTTAAAAAAATCTCATCTCAAAACTTCCTAACTCACTCCATTCATAGAATTCAACTCTAAGCAAATTACCAAATAATTACAAATATGCCACTGCTAATAGCATATCCTTACTAATATTCTTTACAAAATGATGATTAGATTCAATTTTGAAGTTATTCGATGTTATTAGGTTTAATTGAGACTTCTTTTCTAAAACAAGAGACAAATTTCATAGATAATTGTGTTAGACAAGAAGGTAAAAACCAAACAAAAGTCAAGGAGTTTGACGAAGATTGTTGATTAGCTATCATCATAGAATTAGGAATATTGCAACAAGACTAGAAATGCAGGCCTACGTAAGAAGAGGCCTCAAATACAACAAAATCCCCAACTTATTCTCACTTGAACCAAATGCCTCCGAAAATCCTTTTGTTTCTCTCCAACAAAATCCTTTTGACTAGAAACGGATTACTGCAAATTTTATGCGGACTCTCTCTTGTCCTTGAATACATTGCTCAAAAGAAGCTGATGAGCATAGCATGAATCATCTAGGCAAGACGTAGCTTCTTCTCTGCATAACTTCGTTGGTTTTAACACCTTGAAGAATCACTTCCTACAGAAAGACCTTAAACATTTTAGGGAGTTTAGACTTCCATTCTTTTCGCTCAGCTTTTGAAAAATGAAAGTCTCTGTAGCAGCCAGCAGGAAGAGAGAAACAACTGAGACGAACAAAACAAAATCAGTGGAGTAATTGCAGATTGACATTTTAATAATAAATTGATTTTTAAGCGTACTTAGACATGAGTAATTGCATTGTTCAGAATATAGTTGCTTAATGCTTCTGCTTCTAATAGAAAACAACTACAGATGACTTAAGCTATTTTTTTATTTAATTAAATTTGGACATGAAACTATAGTATATTCCTTTGTTTAGAAGTATTGGAATGCTCTTAAAAATAAAAATTATGTTTAGTTTGTGGATGTTTTACCATTCTTCTAAGCCATAATTAGTTTTATTTATAAGTTCTATAATAATATTATGATCAACTACTCTTATAATATCTGGCCTTCCTCTCAAACCAAAAACCGGAAGAAAAGGATTTTGATGATTGCTATGTAGAACTGATAGTTTGTGAACCTTTTACTTTCATGATTTTTAAACTATAGAAAGGTTTTGTATATTGTATTAGAGATATCTGTGATGGTGTTTAGTATTATCTTTAGCTCTCTCACATGCATGCACATCTGTTGTGTTTCTGGCTACAGAGCTATTGTCTTAGAACCTGGGAATCCATCTATGAAGAATCAAATAAAGCAATTGGCTGCTGCCGAATGTATGCATCTCTTGAATCACTTTTTTTTTTTTTTTTTGTGTTACTGTTATCTGATTGCATTACTGCAAAAACTGCACCTTATGAGTTGCTCAAGTCTTTACTAATACCTAGCTTGCTTGCAGCTAATCCATGGAGGATGCATTATAAGAATAAGAAAGAGCCTCCATTTAAAAAGCAGAAAAACGAATTGTCTCAAGGTGATTGCCTGTGTACGCATATTGAATTCTTTCTTCTTATTCCATTTTCTTTTCTATATGCTTTATGCTTCATCTTTTGACATAGAGCGTTTTCTCACTTTTTGAGTGATTTCTTTCTCTTTGATGTATACTGCGTCTCCTGATGGACAATAATGTCAGTTAAGCTTAGTAATTGAGTACTAATATTTGTCATTGCAGTTGGGCCTCCAAAATCTACATTTAAGCCTGGCATGTCATCAGTACCTGCTTCCAAGGAGAGGCTATCATCTTCACCTGTTCCATCTCCACCCGAGCAATCTGGTGCTCCAATATCTCAATTCGGATCTGCAAATCCCACTAAGACTCATTGTATTGCAGAAGATATTAAACCTCGACAACCAGCTAAGATTAATGCTGCTGCTAGCAGTGAGAAGGAAATTCCAACCAAAGCCGCAAAAGGAGTTCTGGAAGCACCAGGACAGGAAGTGAATGCCGGAGCTAAACCAACAGATTTGCAAGGAATGTTGTATAATTTACTCTTGGATAACCCCAAGGGGATGAGTTTGAAGGTACGCTCAAGTAAATAGATAATTTTAATTGAAATATGGATTCCTATATAAAAATAATGTTGGATGTTTCCTGTTCGTTGTGTGTTTGTTTCCTTAAACAAGTCTGTCAATTCTATTCAAAGCATGAATTTACTGTTTTCTCCCTCTCTTTCTCTCTCGTGTAGGCATTGGAGAAAGCTGTTGGCGATAAAATCCCAAATTCTGTAAAAAAGATTGAGCCAATCATTAAAAAAGTAAGTGATGTAAATGTTAAATTATTAGAAATCCAATTCATACTTAGGTCAGTATTTATTCAGTATTTCAATGAATCTCATATACTCCCTAAATATGTTGTCTTGAATAGTAATTCAATTAAAAAGATCCAACAAGTGAACTATCAATTTGAAATGGGGAGATATTAGAAATCCAATTCTTATATTCTTGTTTAATTTGACTATTAGATTGCAACCTACCAAGCTCCAGGGAGATATTGTTTGAAGTCAGAAGTTGAGTTGGAAGGCTCTAAAAAGCCTTCATCTGAAGGTGAAAGGTACTTCCTTCAACGTTTGACGGGCTTATTTCATCTGGTTGACTTAAACAGTTTTTGAGAATGTTACTTTAAATGCATTCAAACTTACCTCTTTGAGGAAGTAGAGATATCTCAACTATATTGTCTTGATATTCCAATGTATTAATCCGTTGTTAAGACATTGCTACTTTCTTGAAGAGGTTGTAGGTTCAAATCCTCATATGTGTTAGTAATATTTTTTGTTACACAAGAAAAAAAAAATGTGAAAAAATAAAAACAAAAGAACTCCATTAGAGAGTAATTACTGTGTCTCTTCGGGGTTGTCTCTTCTTCCTCTTCCTTCCTTTTGTTAAGGAGGTCCCGCTCCGTCCAAAGCTTCTAATAAATAATAGCATTCAAGGTCGAATGGCAAAGATCGAATGAGGGAAGGATTGATTCACAGGTCTAGCTGCTCCGTATCAATTCATCTTTAGTAGCAAAAGCTCTTCCTTCTTCTCTCTTATCTGTATTCTTCCATATCTGTCTTCTTCCATGTCTTCCTTCTTTCTCTGACATTCCAGCTTTTTCTATTTTCTTTGTTTACAAACGCTTTTATACGCTCTTATTAATATTAAGCCAAGTCAGTCCTTTTGCAGCTGCCGGTGTATTTGATATTTATGGATTGCTCGATATTCATCTAGGAATCTAATTCACGTTTTGTCTTTCACAGCTCTCCTTTAGTCAGCCATCAACAAACCCCGGTACATGAAGACTTCCATGATCAACCTGTTCCAGAATCGCAATTAGAAGCAAGACATGTCATTGAATTGGAGGAAAAGGTAGAAACCTCTCAAGCAAACAAAGAATCAAATTTCTTGGAGAAAAATGGCATCCAACAGAATTCACCCGATCCTTTTGCTGAGAAAAAAGGCTCTGAAAATAGCGAAGGCCAGGCAGCTAGTTCTTCTGACAATGAAAGTGACAGTGATTCTGAAAGTGATAGTAGTGATAGTGGAAGTGATAGTGGGAACCGTAGTAGGAGTAGAAGTCGAAGCCCCGTGGGTAGTGGGAGTGGGAGTAGCAGTGATAGTGAAAGTGATGCACCTTCTAATAGCAAGGAGGGTTCTGATGAGGATGTGGATATCATGACTAGTGATGATGACAAAGAACCCAAGAATAAATTGCAAGCTTCCGTACAGGGTTTCTCTGCGTCTCCTGCTGCTTGGAAAAGTCCAGATGGTGGGGCTGTGTTGAACATAGACGATGAGAAGGAAGATGGTCACGAATCTGATGCAATTGACATCGAGAAAGATTCTTCTGATGATGAGCCAGAAGCTAAAATTGATGATCGTAGTTTACCTCCTACAGGAGAAGGTGGAAGACCTGTGGAAGAATCAAGATCCTTGTCACCATACCCTGATGAATTCCAAGAGCGCCAAAACTTTATTGGGAGTTTGTTTGAGGACAGGGAAAATACTGTTGTGGAAAGTTCCAGGCATGAACAATCTGACAGCACAGATAGGATATCTAAAGGCAAGTCTAAAAGGAGCTCTGAGTTGGAGTGCTTTGAAGAGAACGCTGTTCATACTAAGAGATTAAAATTAGAAAGCTCATCTCAACAACCTGTTTCTGGTAATTGGGGAGCCCAATTACAGAGTTCTCGCAATTTATCTCCTAGTAAACTCAACAGAGATTCTGCAAGGAACCCTACCAGTCAAGTTACTAATAAAGGTGAGTTGAAGGGCAATTCTGATTTTAGACCAAAAATGGGAAACAAAGAAATAGTTTCAGAAAAAAATTGTTCAGATGTTTCACAAGCAAGTTGGAGGCCCCATGATCAAAGTGGAGTGAGGGCTGTAGATACAGCAGTTAGACCCGACAAGCATGGTGAGAGCATTGGACGTGGCGGTAAACACAGTGAAAAGGGTGGTCATGCTAATGAAAGTTTTCATGCGTATAAAGATAGATTTTATGGAAATGTTGAAAATGAAGGGATGAATGAGAAAAAAGTTTCAAGAAATTCTAGATCTGGTGGTCCAGGAGACAAACAGATACAACCCTGTGACTCCCATCTTAGTAAACCAGGTGACATAGTTGGAAAATTCAAAGATGGCAAAACGTTTTCAAGTTCGCAGATGGGGTACTCACCAAGGGATAATAATAATAGAATTAGTGCCGACAGGTCCCCAGTTAATGGAAAAGGCCGTATTCTCCAAAGAGAGCATTCAGACCTTGAATTAGGTGAACTTCGTGAGCCCTTTCCTGAGGAAGTATTGGGTAAAAAGAAATTTGAAAGAAATAATTCATCGAAACAGTTGGAGAACAAAGGGCACACTTCAGATATCTGGAGTTCAGAGTTAAATAAAGGAAAATCTAATTTGAAGGCTAGTTTAGATAATGGAAAGCGGTCCTCACCCCATATAAGTACCAAGTTTCCAAGCAATCCAGAAGTCTCAAATCAAAAGAAGATTTCAGAACATAAAGTTGAAGATTTGACGAGGGTAAACCACCGGCCTCCGCAGTCTCATCCACAAGGACCACAATATAGTTCAAGAGTAGATCACGTTGAAGTTGAAAAGCCGGTTGATGCAAATGTAAAACCTAATCAAGGGATTGGTCCAGAAAGCTGTGGGGAAAGCAACAGGAAAGCATCTGTTGGCATTTCCCAGCTGCATGATATGAAACGAGAACAGCTTCCCTCAAAAAAAGGAAGTAAAAGACAAGCACCTAATCAAATAACTGAAGTTACTGATGCACTAAAGAACCCGATATCAGCTGAGCATGAAAATAGTGATCTAAAGAGAAGAGATTCTTCTTCAGATGAAAACAGTTGTTCATATTCCAAGTATGAAAAGGACGAGCCAGAGTTGAAGGGAGCAATCAAAGATTTCTCTCAGTAAGTTTCGTTTGTAATTCATTTGAATTGTTGAGGCTACTAATTCTGTCTCACTTGTTCTTGTATTTTTTTCATTAATATCTGTGATTACTGATTAAAATGTATACTTGTTTTTAGGTACAAGGAATATGTACAGGAGTATCGTGATAAATATGAATGTTACCTGTCCTTGAACAAAATCCTAGAAAGCTACAGGTAATGCTACTTCTTTTCGTTAATGTCCAATTTTTATCCGTTCCTAGACGATGTTGCAGTCTGCCTCCTAATGTCCTGTTATTATTTTTAAGCAAGTTTGATGATTTATTTGTGTCCTGGTATGCTGTTCCTGTAAAGAAAAGATCATTCCGTTTTTAAGCATCGATCAATTATGTAGGGCTGAGTTCTGCAAACTCGGGAAGGAGCTTGATTCTTCTAGGGGACAAAATTCAGACAAATACTTTAACCTTTTAGAACAGCTGAAAGAATCTTATCGGCTGTGTTCAACGGTATGGCTGTTTCCTAGTATATTTGCTACAAGTTACTTGTTAATATTAGTAGTTTTTCATGTTCAAGTATATATTTCTCAATGTTTACTCTGGAAAATTTCTGATGTTTCATCCAGCTAGTGCTCATTGTAGTCTAGGTTTGTGTAAGCTTTTGGTTTATATTGAAAAATTTGGACTCCAGAGGGAGAGGGAGTTCATGATTGGAAGTAAAAGGTTAGGGATAGTTTAGGATTAAGGTTGACAGTAGATTCAGATGCAGCAAGGCATGGTTGACGGAATTGGTTAGAGGCAACACTTTCTCTATCTCGGTTGATTGGGTAAATTTATTTGTTCCCGTTATTTGAATATGGGTTCTATCAAAGATGGGTTTCATGGCTTGTAGTTGATATTAGCATGAATATCTTTTGTATACCTGAATCTGAATTAGCATCTTTATCTTTTTCTCTTATATTATTTTTCCTGTATAATGAAGTTGACTCAAATAAAGAGTAATTTGCAGAGGCATAAGAGGTTGAAAAAGATATTCGTTGTTCTCCACGAAGAGCTGAAGGTGATTAACATCAACTTACAAATCCACATCACCATCTTGCTTTTGTGGTTCTAACTTTTATCTCTAATTTTGTTATTTTCTTAACATGTATTTTATTGTTGCAGCATCTAAAGGAAAGGATTAAAGATTTTGCACAAACTTATGCGAAGGATTGAGATTGGATGTGTTTCTATTCGAGCAATCGGACGCTCTTTAGACGATCATGTCTGACTTGGGGTAGAGGTCGTTAAGAGGTAGAGGTGGATGCAAGAACTGCATCAATGAATTTCTGTGCATAGTAAAATGAGTCAAATCTTTTTTTTTTTTGCTTGCTACCTTCTCCCCAACCTCAATATAGAAACATGTACATTTTTTCTTTTTGCTCTCTCCCCTCTCTCTCTTGGAATGCATTTTCCTTTATTTAGCAACGTAGAGAGGTTGGACTTTTGTTTTAGTTTAGCCTGACCTCTTCCTGTATAGACTTGTGAGGCTTCTTTTGAGTATGGATATAATCTACTTTTGTAGGTTGTACAAATTTTGAACGTGTAAAAAATGTTCCCAATAACTGTGAATATGCTAATGTCATTTGCGATCATATAGGGAATTCTCTTGGAGTTGTG
mRNA sequence
AGAGGACCAAAAACAACACGGAAATCTGAGAGGCAAATTGAAATTGTTCCTGTGTTCCATCTGGGTCCCAAAACCTGGTTCGGCACCTGCATCTCTCTTCCTCACAATGTATGGCGGCCCATCCAAGCTCGCTCGGGCCGGCGGCGGCGCTGGCCGCGGAGCCAGCGGAAAGCGGCCGCCCTCCTCTTTTCCTCTACCACCTGCTCACCGCCCCTCCGGCCGTCTCTCTCTCGGCGGCGGTGGCGCCGGTTCTGGCGCAAATCCTCGGAATCGAACCTCCACCGCAGCCAAATCCGAAGCCCCTCTATCCGTCGAGGAGAATTTCAGTCTCGTTACCGGTAACAATCCTTTGGCTTTTGCTATGATAATTCGGTTGGCTCCCGACTTGATCGAAGAGATCAAGCGGGTTGAGTCGCTGGGAGGAACTCCGAGAATTAAGTTTGATGCGAATGCCAAGAATTCTAGTGGTAATGTCATTGATGTTGGGGGTAAAGAGTTTAGGTTCACATGGTCACGTGAAGTTGGTGATCTTTGTGACATATACGAAGAACGTAAAAGTGGCGAAGATGGAAGTGGTTTGCTTGTTGAATCAGGCAATGCTTGGAGAAAAGTGAATGTGCAGCGTATCTTGGATGAATCAACTACAAACCATGTTAAGAAGTTGTCTGAGGAAGCTGAACGAAGATCTAAATCTCGCAGAGCTATTGTCTTAGAACCTGGGAATCCATCTATGAAGAATCAAATAAAGCAATTGGCTGCTGCCGAATCTAATCCATGGAGGATGCATTATAAGAATAAGAAAGAGCCTCCATTTAAAAAGCAGAAAAACGAATTGTCTCAAGTTGGGCCTCCAAAATCTACATTTAAGCCTGGCATGTCATCAGTACCTGCTTCCAAGGAGAGGCTATCATCTTCACCTGTTCCATCTCCACCCGAGCAATCTGGTGCTCCAATATCTCAATTCGGATCTGCAAATCCCACTAAGACTCATTGTATTGCAGAAGATATTAAACCTCGACAACCAGCTAAGATTAATGCTGCTGCTAGCAGTGAGAAGGAAATTCCAACCAAAGCCGCAAAAGGAGTTCTGGAAGCACCAGGACAGGAAGTGAATGCCGGAGCTAAACCAACAGATTTGCAAGGAATGTTGTATAATTTACTCTTGGATAACCCCAAGGGGATGAGTTTGAAGGCATTGGAGAAAGCTGTTGGCGATAAAATCCCAAATTCTGTAAAAAAGATTGAGCCAATCATTAAAAAAATTGCAACCTACCAAGCTCCAGGGAGATATTGTTTGAAGTCAGAAGTTGAGTTGGAAGGCTCTAAAAAGCCTTCATCTGAAGGTGAAAGCTCTCCTTTAGTCAGCCATCAACAAACCCCGGTACATGAAGACTTCCATGATCAACCTGTTCCAGAATCGCAATTAGAAGCAAGACATGTCATTGAATTGGAGGAAAAGGTAGAAACCTCTCAAGCAAACAAAGAATCAAATTTCTTGGAGAAAAATGGCATCCAACAGAATTCACCCGATCCTTTTGCTGAGAAAAAAGGCTCTGAAAATAGCGAAGGCCAGGCAGCTAGTTCTTCTGACAATGAAAGTGACAGTGATTCTGAAAGTGATAGTAGTGATAGTGGAAGTGATAGTGGGAACCGTAGTAGGAGTAGAAGTCGAAGCCCCGTGGGTAGTGGGAGTGGGAGTAGCAGTGATAGTGAAAGTGATGCACCTTCTAATAGCAAGGAGGGTTCTGATGAGGATGTGGATATCATGACTAGTGATGATGACAAAGAACCCAAGAATAAATTGCAAGCTTCCGTACAGGGTTTCTCTGCGTCTCCTGCTGCTTGGAAAAGTCCAGATGGTGGGGCTGTGTTGAACATAGACGATGAGAAGGAAGATGGTCACGAATCTGATGCAATTGACATCGAGAAAGATTCTTCTGATGATGAGCCAGAAGCTAAAATTGATGATCGTAGTTTACCTCCTACAGGAGAAGGTGGAAGACCTGTGGAAGAATCAAGATCCTTGTCACCATACCCTGATGAATTCCAAGAGCGCCAAAACTTTATTGGGAGTTTGTTTGAGGACAGGGAAAATACTGTTGTGGAAAGTTCCAGGCATGAACAATCTGACAGCACAGATAGGATATCTAAAGGCAAGTCTAAAAGGAGCTCTGAGTTGGAGTGCTTTGAAGAGAACGCTGTTCATACTAAGAGATTAAAATTAGAAAGCTCATCTCAACAACCTGTTTCTGGTAATTGGGGAGCCCAATTACAGAGTTCTCGCAATTTATCTCCTAGTAAACTCAACAGAGATTCTGCAAGGAACCCTACCAGTCAAGTTACTAATAAAGGTGAGTTGAAGGGCAATTCTGATTTTAGACCAAAAATGGGAAACAAAGAAATAGTTTCAGAAAAAAATTGTTCAGATGTTTCACAAGCAAGTTGGAGGCCCCATGATCAAAGTGGAGTGAGGGCTGTAGATACAGCAGTTAGACCCGACAAGCATGGTGAGAGCATTGGACGTGGCGGTAAACACAGTGAAAAGGGTGGTCATGCTAATGAAAGTTTTCATGCGTATAAAGATAGATTTTATGGAAATGTTGAAAATGAAGGGATGAATGAGAAAAAAGTTTCAAGAAATTCTAGATCTGGTGGTCCAGGAGACAAACAGATACAACCCTGTGACTCCCATCTTAGTAAACCAGGTGACATAGTTGGAAAATTCAAAGATGGCAAAACGTTTTCAAGTTCGCAGATGGGGTACTCACCAAGGGATAATAATAATAGAATTAGTGCCGACAGGTCCCCAGTTAATGGAAAAGGCCGTATTCTCCAAAGAGAGCATTCAGACCTTGAATTAGGTGAACTTCGTGAGCCCTTTCCTGAGGAAGTATTGGGTAAAAAGAAATTTGAAAGAAATAATTCATCGAAACAGTTGGAGAACAAAGGGCACACTTCAGATATCTGGAGTTCAGAGTTAAATAAAGGAAAATCTAATTTGAAGGCTAGTTTAGATAATGGAAAGCGGTCCTCACCCCATATAAGTACCAAGTTTCCAAGCAATCCAGAAGTCTCAAATCAAAAGAAGATTTCAGAACATAAAGTTGAAGATTTGACGAGGGTAAACCACCGGCCTCCGCAGTCTCATCCACAAGGACCACAATATAGTTCAAGAGTAGATCACGTTGAAGTTGAAAAGCCGGTTGATGCAAATGTAAAACCTAATCAAGGGATTGGTCCAGAAAGCTGTGGGGAAAGCAACAGGAAAGCATCTGTTGGCATTTCCCAGCTGCATGATATGAAACGAGAACAGCTTCCCTCAAAAAAAGGAAGTAAAAGACAAGCACCTAATCAAATAACTGAAGTTACTGATGCACTAAAGAACCCGATATCAGCTGAGCATGAAAATAGTGATCTAAAGAGAAGAGATTCTTCTTCAGATGAAAACAGTTGTTCATATTCCAAGTATGAAAAGGACGAGCCAGAGTTGAAGGGAGCAATCAAAGATTTCTCTCAGTACAAGGAATATGTACAGGAGTATCGTGATAAATATGAATGTTACCTGTCCTTGAACAAAATCCTAGAAAGCTACAGGGCTGAGTTCTGCAAACTCGGGAAGGAGCTTGATTCTTCTAGGGGACAAAATTCAGACAAATACTTTAACCTTTTAGAACAGCTGAAAGAATCTTATCGGCTGTGTTCAACGAGGCATAAGAGGTTGAAAAAGATATTCGTTGTTCTCCACGAAGAGCTGAAGCATCTAAAGGAAAGGATTAAAGATTTTGCACAAACTTATGCGAAGGATTGAGATTGGATGTGTTTCTATTCGAGCAATCGGACGCTCTTTAGACGATCATGTCTGACTTGGGGTAGAGGTCGTTAAGAGGTAGAGGTGGATGCAAGAACTGCATCAATGAATTTCTGTGCATAGTAAAATGAGTCAAATCTTTTTTTTTTTTGCTTGCTACCTTCTCCCCAACCTCAATATAGAAACATGTACATTTTTTCTTTTTGCTCTCTCCCCTCTCTCTCTTGGAATGCATTTTCCTTTATTTAGCAACGTAGAGAGGTTGGACTTTTGTTTTAGTTTAGCCTGACCTCTTCCTGTATAGACTTGTGAGGCTTCTTTTGAGTATGGATATAATCTACTTTTGTAGGTTGTACAAATTTTGAACGTGTAAAAAATGTTCCCAATAACTGTGAATATGCTAATGTCATTTGCGATCATATAGGGAATTCTCTTGGAGTTGTG
Coding sequence (CDS)
ATGTATGGCGGCCCATCCAAGCTCGCTCGGGCCGGCGGCGGCGCTGGCCGCGGAGCCAGCGGAAAGCGGCCGCCCTCCTCTTTTCCTCTACCACCTGCTCACCGCCCCTCCGGCCGTCTCTCTCTCGGCGGCGGTGGCGCCGGTTCTGGCGCAAATCCTCGGAATCGAACCTCCACCGCAGCCAAATCCGAAGCCCCTCTATCCGTCGAGGAGAATTTCAGTCTCGTTACCGGTAACAATCCTTTGGCTTTTGCTATGATAATTCGGTTGGCTCCCGACTTGATCGAAGAGATCAAGCGGGTTGAGTCGCTGGGAGGAACTCCGAGAATTAAGTTTGATGCGAATGCCAAGAATTCTAGTGGTAATGTCATTGATGTTGGGGGTAAAGAGTTTAGGTTCACATGGTCACGTGAAGTTGGTGATCTTTGTGACATATACGAAGAACGTAAAAGTGGCGAAGATGGAAGTGGTTTGCTTGTTGAATCAGGCAATGCTTGGAGAAAAGTGAATGTGCAGCGTATCTTGGATGAATCAACTACAAACCATGTTAAGAAGTTGTCTGAGGAAGCTGAACGAAGATCTAAATCTCGCAGAGCTATTGTCTTAGAACCTGGGAATCCATCTATGAAGAATCAAATAAAGCAATTGGCTGCTGCCGAATCTAATCCATGGAGGATGCATTATAAGAATAAGAAAGAGCCTCCATTTAAAAAGCAGAAAAACGAATTGTCTCAAGTTGGGCCTCCAAAATCTACATTTAAGCCTGGCATGTCATCAGTACCTGCTTCCAAGGAGAGGCTATCATCTTCACCTGTTCCATCTCCACCCGAGCAATCTGGTGCTCCAATATCTCAATTCGGATCTGCAAATCCCACTAAGACTCATTGTATTGCAGAAGATATTAAACCTCGACAACCAGCTAAGATTAATGCTGCTGCTAGCAGTGAGAAGGAAATTCCAACCAAAGCCGCAAAAGGAGTTCTGGAAGCACCAGGACAGGAAGTGAATGCCGGAGCTAAACCAACAGATTTGCAAGGAATGTTGTATAATTTACTCTTGGATAACCCCAAGGGGATGAGTTTGAAGGCATTGGAGAAAGCTGTTGGCGATAAAATCCCAAATTCTGTAAAAAAGATTGAGCCAATCATTAAAAAAATTGCAACCTACCAAGCTCCAGGGAGATATTGTTTGAAGTCAGAAGTTGAGTTGGAAGGCTCTAAAAAGCCTTCATCTGAAGGTGAAAGCTCTCCTTTAGTCAGCCATCAACAAACCCCGGTACATGAAGACTTCCATGATCAACCTGTTCCAGAATCGCAATTAGAAGCAAGACATGTCATTGAATTGGAGGAAAAGGTAGAAACCTCTCAAGCAAACAAAGAATCAAATTTCTTGGAGAAAAATGGCATCCAACAGAATTCACCCGATCCTTTTGCTGAGAAAAAAGGCTCTGAAAATAGCGAAGGCCAGGCAGCTAGTTCTTCTGACAATGAAAGTGACAGTGATTCTGAAAGTGATAGTAGTGATAGTGGAAGTGATAGTGGGAACCGTAGTAGGAGTAGAAGTCGAAGCCCCGTGGGTAGTGGGAGTGGGAGTAGCAGTGATAGTGAAAGTGATGCACCTTCTAATAGCAAGGAGGGTTCTGATGAGGATGTGGATATCATGACTAGTGATGATGACAAAGAACCCAAGAATAAATTGCAAGCTTCCGTACAGGGTTTCTCTGCGTCTCCTGCTGCTTGGAAAAGTCCAGATGGTGGGGCTGTGTTGAACATAGACGATGAGAAGGAAGATGGTCACGAATCTGATGCAATTGACATCGAGAAAGATTCTTCTGATGATGAGCCAGAAGCTAAAATTGATGATCGTAGTTTACCTCCTACAGGAGAAGGTGGAAGACCTGTGGAAGAATCAAGATCCTTGTCACCATACCCTGATGAATTCCAAGAGCGCCAAAACTTTATTGGGAGTTTGTTTGAGGACAGGGAAAATACTGTTGTGGAAAGTTCCAGGCATGAACAATCTGACAGCACAGATAGGATATCTAAAGGCAAGTCTAAAAGGAGCTCTGAGTTGGAGTGCTTTGAAGAGAACGCTGTTCATACTAAGAGATTAAAATTAGAAAGCTCATCTCAACAACCTGTTTCTGGTAATTGGGGAGCCCAATTACAGAGTTCTCGCAATTTATCTCCTAGTAAACTCAACAGAGATTCTGCAAGGAACCCTACCAGTCAAGTTACTAATAAAGGTGAGTTGAAGGGCAATTCTGATTTTAGACCAAAAATGGGAAACAAAGAAATAGTTTCAGAAAAAAATTGTTCAGATGTTTCACAAGCAAGTTGGAGGCCCCATGATCAAAGTGGAGTGAGGGCTGTAGATACAGCAGTTAGACCCGACAAGCATGGTGAGAGCATTGGACGTGGCGGTAAACACAGTGAAAAGGGTGGTCATGCTAATGAAAGTTTTCATGCGTATAAAGATAGATTTTATGGAAATGTTGAAAATGAAGGGATGAATGAGAAAAAAGTTTCAAGAAATTCTAGATCTGGTGGTCCAGGAGACAAACAGATACAACCCTGTGACTCCCATCTTAGTAAACCAGGTGACATAGTTGGAAAATTCAAAGATGGCAAAACGTTTTCAAGTTCGCAGATGGGGTACTCACCAAGGGATAATAATAATAGAATTAGTGCCGACAGGTCCCCAGTTAATGGAAAAGGCCGTATTCTCCAAAGAGAGCATTCAGACCTTGAATTAGGTGAACTTCGTGAGCCCTTTCCTGAGGAAGTATTGGGTAAAAAGAAATTTGAAAGAAATAATTCATCGAAACAGTTGGAGAACAAAGGGCACACTTCAGATATCTGGAGTTCAGAGTTAAATAAAGGAAAATCTAATTTGAAGGCTAGTTTAGATAATGGAAAGCGGTCCTCACCCCATATAAGTACCAAGTTTCCAAGCAATCCAGAAGTCTCAAATCAAAAGAAGATTTCAGAACATAAAGTTGAAGATTTGACGAGGGTAAACCACCGGCCTCCGCAGTCTCATCCACAAGGACCACAATATAGTTCAAGAGTAGATCACGTTGAAGTTGAAAAGCCGGTTGATGCAAATGTAAAACCTAATCAAGGGATTGGTCCAGAAAGCTGTGGGGAAAGCAACAGGAAAGCATCTGTTGGCATTTCCCAGCTGCATGATATGAAACGAGAACAGCTTCCCTCAAAAAAAGGAAGTAAAAGACAAGCACCTAATCAAATAACTGAAGTTACTGATGCACTAAAGAACCCGATATCAGCTGAGCATGAAAATAGTGATCTAAAGAGAAGAGATTCTTCTTCAGATGAAAACAGTTGTTCATATTCCAAGTATGAAAAGGACGAGCCAGAGTTGAAGGGAGCAATCAAAGATTTCTCTCAGTACAAGGAATATGTACAGGAGTATCGTGATAAATATGAATGTTACCTGTCCTTGAACAAAATCCTAGAAAGCTACAGGGCTGAGTTCTGCAAACTCGGGAAGGAGCTTGATTCTTCTAGGGGACAAAATTCAGACAAATACTTTAACCTTTTAGAACAGCTGAAAGAATCTTATCGGCTGTGTTCAACGAGGCATAAGAGGTTGAAAAAGATATTCGTTGTTCTCCACGAAGAGCTGAAGCATCTAAAGGAAAGGATTAAAGATTTTGCACAAACTTATGCGAAGGATTGA
Protein sequence
MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTAAKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTTNHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQKNELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAEDIKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMSLKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVSHQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHESDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFEDRENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGAQLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASWRPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEKKVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRSPVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNKGKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGPQYSSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQAPNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKEYVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCSTRHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD
Homology
BLAST of CmoCh01G000740 vs. ExPASy TrEMBL
Match:
A0A6J1GAK2 (dentin sialophosphoprotein isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111452430 PE=4 SV=1)
HSP 1 Score: 2337.0 bits (6055), Expect = 0.0e+00
Identity = 1233/1233 (100.00%), Postives = 1233/1233 (100.00%), Query Frame = 0
Query: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA
Sbjct: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
Query: 61 AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSS 120
AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSS
Sbjct: 61 AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSS 120
Query: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
Query: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK
Sbjct: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
Query: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED 300
NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED
Sbjct: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED 300
Query: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS
Sbjct: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
Query: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS 420
LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS
Sbjct: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS 420
Query: 421 HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK 480
HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK
Sbjct: 421 HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK 480
Query: 481 KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP 540
KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP
Sbjct: 481 KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP 540
Query: 541 SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE 600
SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE
Sbjct: 541 SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE 600
Query: 601 SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED 660
SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED
Sbjct: 601 SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED 660
Query: 661 RENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA 720
RENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA
Sbjct: 661 RENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA 720
Query: 721 QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW 780
QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW
Sbjct: 721 QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW 780
Query: 781 RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK 840
RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK
Sbjct: 781 RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK 840
Query: 841 KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS 900
KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS
Sbjct: 841 KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS 900
Query: 901 PVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNK 960
PVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNK
Sbjct: 901 PVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNK 960
Query: 961 GKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGPQY 1020
GKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGPQY
Sbjct: 961 GKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGPQY 1020
Query: 1021 SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA 1080
SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA
Sbjct: 1021 SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA 1080
Query: 1081 PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE
Sbjct: 1081 PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
Query: 1141 YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCST 1200
YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCST
Sbjct: 1141 YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCST 1200
Query: 1201 RHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD 1234
RHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD
Sbjct: 1201 RHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD 1233
BLAST of CmoCh01G000740 vs. ExPASy TrEMBL
Match:
A0A6J1GAJ0 (dentin sialophosphoprotein isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111452430 PE=4 SV=1)
HSP 1 Score: 2331.2 bits (6040), Expect = 0.0e+00
Identity = 1233/1237 (99.68%), Postives = 1233/1237 (99.68%), Query Frame = 0
Query: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA
Sbjct: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
Query: 61 AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSS 120
AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSS
Sbjct: 61 AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSS 120
Query: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
Query: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK
Sbjct: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
Query: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED 300
NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED
Sbjct: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED 300
Query: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS
Sbjct: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
Query: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS 420
LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS
Sbjct: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS 420
Query: 421 HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK 480
HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK
Sbjct: 421 HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK 480
Query: 481 KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP 540
KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP
Sbjct: 481 KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP 540
Query: 541 SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE 600
SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE
Sbjct: 541 SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE 600
Query: 601 SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED 660
SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED
Sbjct: 601 SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED 660
Query: 661 RENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA 720
RENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA
Sbjct: 661 RENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA 720
Query: 721 QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW 780
QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW
Sbjct: 721 QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW 780
Query: 781 RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK 840
RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK
Sbjct: 781 RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK 840
Query: 841 KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS 900
KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS
Sbjct: 841 KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS 900
Query: 901 PVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNK 960
PVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNK
Sbjct: 901 PVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNK 960
Query: 961 GKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGPQY 1020
GKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGPQY
Sbjct: 961 GKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGPQY 1020
Query: 1021 SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA 1080
SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA
Sbjct: 1021 SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA 1080
Query: 1081 PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE
Sbjct: 1081 PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
Query: 1141 YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCST 1200
YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCST
Sbjct: 1141 YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCST 1200
Query: 1201 ----RHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD 1234
RHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD
Sbjct: 1201 SNLQRHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD 1237
BLAST of CmoCh01G000740 vs. ExPASy TrEMBL
Match:
A0A6J1GB91 (dentin sialophosphoprotein isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111452430 PE=4 SV=1)
HSP 1 Score: 2329.3 bits (6035), Expect = 0.0e+00
Identity = 1233/1242 (99.28%), Postives = 1233/1242 (99.28%), Query Frame = 0
Query: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA
Sbjct: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
Query: 61 AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSS 120
AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSS
Sbjct: 61 AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSS 120
Query: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
Query: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK
Sbjct: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
Query: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED 300
NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED
Sbjct: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED 300
Query: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS
Sbjct: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
Query: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS 420
LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS
Sbjct: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS 420
Query: 421 HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK 480
HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK
Sbjct: 421 HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK 480
Query: 481 KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP 540
KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP
Sbjct: 481 KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP 540
Query: 541 SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE 600
SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE
Sbjct: 541 SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE 600
Query: 601 SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED 660
SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED
Sbjct: 601 SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED 660
Query: 661 RENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA 720
RENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA
Sbjct: 661 RENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA 720
Query: 721 QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW 780
QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW
Sbjct: 721 QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW 780
Query: 781 RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK 840
RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK
Sbjct: 781 RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK 840
Query: 841 KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS 900
KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS
Sbjct: 841 KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS 900
Query: 901 PVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNK 960
PVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNK
Sbjct: 901 PVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNK 960
Query: 961 GKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGPQY 1020
GKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGPQY
Sbjct: 961 GKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGPQY 1020
Query: 1021 SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA 1080
SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA
Sbjct: 1021 SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA 1080
Query: 1081 PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE
Sbjct: 1081 PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
Query: 1141 YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCST 1200
YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCST
Sbjct: 1141 YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCST 1200
Query: 1201 ---------RHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD 1234
RHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD
Sbjct: 1201 LTQIKSNLQRHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD 1242
BLAST of CmoCh01G000740 vs. ExPASy TrEMBL
Match:
A0A6J1KCU5 (dentin sialophosphoprotein isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111492713 PE=4 SV=1)
HSP 1 Score: 2277.3 bits (5900), Expect = 0.0e+00
Identity = 1204/1233 (97.65%), Postives = 1215/1233 (98.54%), Query Frame = 0
Query: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA
Sbjct: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
Query: 61 AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSS 120
AKSEAP SVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVES GGTPRIKFDANAKNSS
Sbjct: 61 AKSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESQGGTPRIKFDANAKNSS 120
Query: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
Query: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
NHVKKLSEEAERRSKSRRAIVLEPGN SMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK
Sbjct: 181 NHVKKLSEEAERRSKSRRAIVLEPGNLSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
Query: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED 300
NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQ GAPISQFGSANPTKTHCIAED
Sbjct: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQFGAPISQFGSANPTKTHCIAED 300
Query: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS
Sbjct: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
Query: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS 420
LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS
Sbjct: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS 420
Query: 421 HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK 480
HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK
Sbjct: 421 HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK 480
Query: 481 KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP 540
KGSENSEG+AA+SSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP
Sbjct: 481 KGSENSEGEAATSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP 540
Query: 541 SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE 600
SNSKEGSDEDVDIMTSDDDKEPK+KLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE
Sbjct: 541 SNSKEGSDEDVDIMTSDDDKEPKHKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE 600
Query: 601 SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED 660
SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGR VEESRSLSPYPDEFQERQNFIGSLFED
Sbjct: 601 SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRTVEESRSLSPYPDEFQERQNFIGSLFED 660
Query: 661 RENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA 720
RENTVV+S RHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWG
Sbjct: 661 RENTVVDSGRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGV 720
Query: 721 QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW 780
QLQSSRNLSPSKLNRDSARNPT+QVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW
Sbjct: 721 QLQSSRNLSPSKLNRDSARNPTNQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW 780
Query: 781 RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK 840
RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGN ENE MNEK
Sbjct: 781 RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNAENEEMNEK 840
Query: 841 KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS 900
KVSRNSRSGGPGDKQIQP DSHLSKPGDIVGKFKDGKTF SSQMGYSPRDNNNRISADRS
Sbjct: 841 KVSRNSRSGGPGDKQIQPSDSHLSKPGDIVGKFKDGKTFPSSQMGYSPRDNNNRISADRS 900
Query: 901 PVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNK 960
PVNGKGRILQREHSDLELGELREPFPEE LGKKKFERNNSSKQLENK HTSDIWSSELNK
Sbjct: 901 PVNGKGRILQREHSDLELGELREPFPEEALGKKKFERNNSSKQLENKEHTSDIWSSELNK 960
Query: 961 GKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGPQY 1020
GKSNLKASLDNGKRSSPHISTKFPSNPE SN+KKISEHKVEDLTR+NHRPPQSHPQG QY
Sbjct: 961 GKSNLKASLDNGKRSSPHISTKFPSNPEGSNKKKISEHKVEDLTRINHRPPQSHPQGSQY 1020
Query: 1021 SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA 1080
SSRVDHVEVEKPVDANVK NQGIGPESCGESNRKASVG+SQLHDMKREQLPSKKGSKRQA
Sbjct: 1021 SSRVDHVEVEKPVDANVKLNQGIGPESCGESNRKASVGMSQLHDMKREQLPSKKGSKRQA 1080
Query: 1081 PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
PNQI EVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE
Sbjct: 1081 PNQIIEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
Query: 1141 YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCST 1200
YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDS+RGQ+SDKYFNLL QLKESYRLCST
Sbjct: 1141 YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSARGQDSDKYFNLLGQLKESYRLCST 1200
Query: 1201 RHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD 1234
RHKRLKKIF+VLHEELKHLKERIKDFAQTYAKD
Sbjct: 1201 RHKRLKKIFIVLHEELKHLKERIKDFAQTYAKD 1233
BLAST of CmoCh01G000740 vs. ExPASy TrEMBL
Match:
A0A6J1KF98 (dentin sialophosphoprotein isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111492713 PE=4 SV=1)
HSP 1 Score: 2271.5 bits (5885), Expect = 0.0e+00
Identity = 1204/1237 (97.33%), Postives = 1215/1237 (98.22%), Query Frame = 0
Query: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA
Sbjct: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
Query: 61 AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSS 120
AKSEAP SVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVES GGTPRIKFDANAKNSS
Sbjct: 61 AKSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESQGGTPRIKFDANAKNSS 120
Query: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
Query: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
NHVKKLSEEAERRSKSRRAIVLEPGN SMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK
Sbjct: 181 NHVKKLSEEAERRSKSRRAIVLEPGNLSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
Query: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED 300
NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQ GAPISQFGSANPTKTHCIAED
Sbjct: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQFGAPISQFGSANPTKTHCIAED 300
Query: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS
Sbjct: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
Query: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS 420
LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS
Sbjct: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS 420
Query: 421 HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK 480
HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK
Sbjct: 421 HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK 480
Query: 481 KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP 540
KGSENSEG+AA+SSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP
Sbjct: 481 KGSENSEGEAATSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP 540
Query: 541 SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE 600
SNSKEGSDEDVDIMTSDDDKEPK+KLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE
Sbjct: 541 SNSKEGSDEDVDIMTSDDDKEPKHKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE 600
Query: 601 SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED 660
SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGR VEESRSLSPYPDEFQERQNFIGSLFED
Sbjct: 601 SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRTVEESRSLSPYPDEFQERQNFIGSLFED 660
Query: 661 RENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA 720
RENTVV+S RHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWG
Sbjct: 661 RENTVVDSGRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGV 720
Query: 721 QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW 780
QLQSSRNLSPSKLNRDSARNPT+QVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW
Sbjct: 721 QLQSSRNLSPSKLNRDSARNPTNQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW 780
Query: 781 RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK 840
RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGN ENE MNEK
Sbjct: 781 RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNAENEEMNEK 840
Query: 841 KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS 900
KVSRNSRSGGPGDKQIQP DSHLSKPGDIVGKFKDGKTF SSQMGYSPRDNNNRISADRS
Sbjct: 841 KVSRNSRSGGPGDKQIQPSDSHLSKPGDIVGKFKDGKTFPSSQMGYSPRDNNNRISADRS 900
Query: 901 PVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNK 960
PVNGKGRILQREHSDLELGELREPFPEE LGKKKFERNNSSKQLENK HTSDIWSSELNK
Sbjct: 901 PVNGKGRILQREHSDLELGELREPFPEEALGKKKFERNNSSKQLENKEHTSDIWSSELNK 960
Query: 961 GKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGPQY 1020
GKSNLKASLDNGKRSSPHISTKFPSNPE SN+KKISEHKVEDLTR+NHRPPQSHPQG QY
Sbjct: 961 GKSNLKASLDNGKRSSPHISTKFPSNPEGSNKKKISEHKVEDLTRINHRPPQSHPQGSQY 1020
Query: 1021 SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA 1080
SSRVDHVEVEKPVDANVK NQGIGPESCGESNRKASVG+SQLHDMKREQLPSKKGSKRQA
Sbjct: 1021 SSRVDHVEVEKPVDANVKLNQGIGPESCGESNRKASVGMSQLHDMKREQLPSKKGSKRQA 1080
Query: 1081 PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
PNQI EVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE
Sbjct: 1081 PNQIIEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
Query: 1141 YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCST 1200
YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDS+RGQ+SDKYFNLL QLKESYRLCST
Sbjct: 1141 YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSARGQDSDKYFNLLGQLKESYRLCST 1200
Query: 1201 ----RHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD 1234
RHKRLKKIF+VLHEELKHLKERIKDFAQTYAKD
Sbjct: 1201 SNLQRHKRLKKIFIVLHEELKHLKERIKDFAQTYAKD 1237
BLAST of CmoCh01G000740 vs. NCBI nr
Match:
XP_022948917.1 (dentin sialophosphoprotein isoform X3 [Cucurbita moschata])
HSP 1 Score: 2337.0 bits (6055), Expect = 0.0e+00
Identity = 1233/1233 (100.00%), Postives = 1233/1233 (100.00%), Query Frame = 0
Query: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA
Sbjct: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
Query: 61 AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSS 120
AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSS
Sbjct: 61 AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSS 120
Query: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
Query: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK
Sbjct: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
Query: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED 300
NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED
Sbjct: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED 300
Query: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS
Sbjct: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
Query: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS 420
LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS
Sbjct: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS 420
Query: 421 HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK 480
HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK
Sbjct: 421 HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK 480
Query: 481 KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP 540
KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP
Sbjct: 481 KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP 540
Query: 541 SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE 600
SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE
Sbjct: 541 SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE 600
Query: 601 SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED 660
SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED
Sbjct: 601 SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED 660
Query: 661 RENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA 720
RENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA
Sbjct: 661 RENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA 720
Query: 721 QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW 780
QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW
Sbjct: 721 QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW 780
Query: 781 RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK 840
RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK
Sbjct: 781 RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK 840
Query: 841 KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS 900
KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS
Sbjct: 841 KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS 900
Query: 901 PVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNK 960
PVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNK
Sbjct: 901 PVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNK 960
Query: 961 GKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGPQY 1020
GKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGPQY
Sbjct: 961 GKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGPQY 1020
Query: 1021 SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA 1080
SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA
Sbjct: 1021 SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA 1080
Query: 1081 PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE
Sbjct: 1081 PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
Query: 1141 YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCST 1200
YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCST
Sbjct: 1141 YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCST 1200
Query: 1201 RHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD 1234
RHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD
Sbjct: 1201 RHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD 1233
BLAST of CmoCh01G000740 vs. NCBI nr
Match:
XP_022948916.1 (dentin sialophosphoprotein isoform X2 [Cucurbita moschata])
HSP 1 Score: 2331.2 bits (6040), Expect = 0.0e+00
Identity = 1233/1237 (99.68%), Postives = 1233/1237 (99.68%), Query Frame = 0
Query: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA
Sbjct: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
Query: 61 AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSS 120
AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSS
Sbjct: 61 AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSS 120
Query: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
Query: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK
Sbjct: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
Query: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED 300
NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED
Sbjct: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED 300
Query: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS
Sbjct: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
Query: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS 420
LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS
Sbjct: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS 420
Query: 421 HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK 480
HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK
Sbjct: 421 HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK 480
Query: 481 KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP 540
KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP
Sbjct: 481 KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP 540
Query: 541 SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE 600
SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE
Sbjct: 541 SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE 600
Query: 601 SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED 660
SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED
Sbjct: 601 SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED 660
Query: 661 RENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA 720
RENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA
Sbjct: 661 RENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA 720
Query: 721 QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW 780
QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW
Sbjct: 721 QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW 780
Query: 781 RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK 840
RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK
Sbjct: 781 RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK 840
Query: 841 KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS 900
KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS
Sbjct: 841 KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS 900
Query: 901 PVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNK 960
PVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNK
Sbjct: 901 PVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNK 960
Query: 961 GKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGPQY 1020
GKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGPQY
Sbjct: 961 GKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGPQY 1020
Query: 1021 SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA 1080
SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA
Sbjct: 1021 SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA 1080
Query: 1081 PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE
Sbjct: 1081 PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
Query: 1141 YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCST 1200
YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCST
Sbjct: 1141 YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCST 1200
Query: 1201 ----RHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD 1234
RHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD
Sbjct: 1201 SNLQRHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD 1237
BLAST of CmoCh01G000740 vs. NCBI nr
Match:
XP_022948915.1 (dentin sialophosphoprotein isoform X1 [Cucurbita moschata])
HSP 1 Score: 2329.3 bits (6035), Expect = 0.0e+00
Identity = 1233/1242 (99.28%), Postives = 1233/1242 (99.28%), Query Frame = 0
Query: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA
Sbjct: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
Query: 61 AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSS 120
AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSS
Sbjct: 61 AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSS 120
Query: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
Query: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK
Sbjct: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
Query: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED 300
NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED
Sbjct: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED 300
Query: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS
Sbjct: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
Query: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS 420
LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS
Sbjct: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS 420
Query: 421 HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK 480
HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK
Sbjct: 421 HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK 480
Query: 481 KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP 540
KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP
Sbjct: 481 KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP 540
Query: 541 SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE 600
SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE
Sbjct: 541 SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE 600
Query: 601 SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED 660
SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED
Sbjct: 601 SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED 660
Query: 661 RENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA 720
RENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA
Sbjct: 661 RENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA 720
Query: 721 QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW 780
QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW
Sbjct: 721 QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW 780
Query: 781 RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK 840
RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK
Sbjct: 781 RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK 840
Query: 841 KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS 900
KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS
Sbjct: 841 KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS 900
Query: 901 PVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNK 960
PVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNK
Sbjct: 901 PVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNK 960
Query: 961 GKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGPQY 1020
GKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGPQY
Sbjct: 961 GKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGPQY 1020
Query: 1021 SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA 1080
SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA
Sbjct: 1021 SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA 1080
Query: 1081 PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE
Sbjct: 1081 PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
Query: 1141 YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCST 1200
YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCST
Sbjct: 1141 YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCST 1200
Query: 1201 ---------RHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD 1234
RHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD
Sbjct: 1201 LTQIKSNLQRHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD 1242
BLAST of CmoCh01G000740 vs. NCBI nr
Match:
KAG6606728.1 (hypothetical protein SDJN03_00070, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 2321.2 bits (6014), Expect = 0.0e+00
Identity = 1223/1233 (99.19%), Postives = 1229/1233 (99.68%), Query Frame = 0
Query: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA
Sbjct: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
Query: 61 AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSS 120
AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVES GGTPRIKFDANAKNSS
Sbjct: 61 AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESQGGTPRIKFDANAKNSS 120
Query: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
Query: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK
Sbjct: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
Query: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED 300
NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED
Sbjct: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED 300
Query: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS
Sbjct: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
Query: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS 420
LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS
Sbjct: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS 420
Query: 421 HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK 480
HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK
Sbjct: 421 HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK 480
Query: 481 KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP 540
KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP
Sbjct: 481 KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP 540
Query: 541 SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE 600
SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPA WKSPDGGAVLNIDDEKEDGHE
Sbjct: 541 SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAPWKSPDGGAVLNIDDEKEDGHE 600
Query: 601 SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED 660
SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED
Sbjct: 601 SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED 660
Query: 661 RENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA 720
RENTVV+S+RHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA
Sbjct: 661 RENTVVDSARHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA 720
Query: 721 QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW 780
QLQSSRNLSPSKLNRDS+RNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW
Sbjct: 721 QLQSSRNLSPSKLNRDSSRNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW 780
Query: 781 RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK 840
RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK
Sbjct: 781 RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK 840
Query: 841 KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS 900
KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS
Sbjct: 841 KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS 900
Query: 901 PVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNK 960
PVNGKGRILQREHSDLELGELREPFPEE LGKKKFERNNSSKQLENKGHTSDIWSSELNK
Sbjct: 901 PVNGKGRILQREHSDLELGELREPFPEEALGKKKFERNNSSKQLENKGHTSDIWSSELNK 960
Query: 961 GKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGPQY 1020
GKSNLKASLDNGKRSSPHISTKFPSNPEVSN+KKISEHKVEDLTRVNHRPPQSHPQGPQY
Sbjct: 961 GKSNLKASLDNGKRSSPHISTKFPSNPEVSNKKKISEHKVEDLTRVNHRPPQSHPQGPQY 1020
Query: 1021 SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA 1080
SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA
Sbjct: 1021 SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA 1080
Query: 1081 PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE
Sbjct: 1081 PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
Query: 1141 YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCST 1200
YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQ+SDKYFNLL QLKESYRLCST
Sbjct: 1141 YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQDSDKYFNLLGQLKESYRLCST 1200
Query: 1201 RHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD 1234
RHKRLKKIF+VLHEELKHLKERIKDFAQTYAKD
Sbjct: 1201 RHKRLKKIFIVLHEELKHLKERIKDFAQTYAKD 1233
BLAST of CmoCh01G000740 vs. NCBI nr
Match:
XP_023524753.1 (dentin sialophosphoprotein-like isoform X2 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 2303.5 bits (5968), Expect = 0.0e+00
Identity = 1214/1233 (98.46%), Postives = 1223/1233 (99.19%), Query Frame = 0
Query: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTS A
Sbjct: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSAA 60
Query: 61 AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSS 120
AKSEAP SVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVES GGTPRIKFDANAKNSS
Sbjct: 61 AKSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESQGGTPRIKFDANAKNSS 120
Query: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
Query: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK
Sbjct: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
Query: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED 300
NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHC AED
Sbjct: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCTAED 300
Query: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
IKPRQPAKINAAASSEK+IPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS
Sbjct: 301 IKPRQPAKINAAASSEKDIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
Query: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS 420
LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVE+EGSKKPSSEGESSPLVS
Sbjct: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVEVEGSKKPSSEGESSPLVS 420
Query: 421 HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK 480
HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK
Sbjct: 421 HQQTPVHEDFHDQPVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFAEK 480
Query: 481 KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP 540
KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP
Sbjct: 481 KGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESDAP 540
Query: 541 SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE 600
SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE
Sbjct: 541 SNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDGHE 600
Query: 601 SDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFED 660
SDAIDIEKDSSDDEPEAKIDDRSLPPT EGGRPVEESRSLSPYPDEFQERQNFIGSLFED
Sbjct: 601 SDAIDIEKDSSDDEPEAKIDDRSLPPTVEGGRPVEESRSLSPYPDEFQERQNFIGSLFED 660
Query: 661 RENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA 720
RENTVV+S+RHEQSDSTDR+SKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA
Sbjct: 661 RENTVVDSARHEQSDSTDRMSKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGA 720
Query: 721 QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW 780
QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW
Sbjct: 721 QLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASW 780
Query: 781 RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMNEK 840
RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGN ENEGMNEK
Sbjct: 781 RPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNAENEGMNEK 840
Query: 841 KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS 900
KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS
Sbjct: 841 KVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRS 900
Query: 901 PVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNK 960
PVNGKGRILQREHSDLELGELREPFPEE LGKKKFERNNSSKQLENK HTSDIWSSELNK
Sbjct: 901 PVNGKGRILQREHSDLELGELREPFPEEALGKKKFERNNSSKQLENKEHTSDIWSSELNK 960
Query: 961 GKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGPQY 1020
GKSNLKASLDNGKRSSPHISTKFPSNPE SN+KKISEHKVEDLTR+NHRPPQSHPQGPQY
Sbjct: 961 GKSNLKASLDNGKRSSPHISTKFPSNPEGSNKKKISEHKVEDLTRINHRPPQSHPQGPQY 1020
Query: 1021 SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA 1080
SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA
Sbjct: 1021 SSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKRQA 1080
Query: 1081 PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE
Sbjct: 1081 PNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
Query: 1141 YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCST 1200
YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQ+SDKYFNLL QLKESYRLCST
Sbjct: 1141 YVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQDSDKYFNLLGQLKESYRLCST 1200
Query: 1201 RHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD 1234
RHKRLKKIF+VLHEELKHLKERIKDFAQTYAKD
Sbjct: 1201 RHKRLKKIFIVLHEELKHLKERIKDFAQTYAKD 1233
BLAST of CmoCh01G000740 vs. TAIR 10
Match:
AT3G21290.1 (dentin sialophosphoprotein-related )
HSP 1 Score: 604.4 bits (1557), Expect = 2.1e-172
Identity = 518/1291 (40.12%), Postives = 708/1291 (54.84%), Query Frame = 0
Query: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHR---PSGRLSLGGGGAGSGANPRNRT 60
M+ G SK GG G G+ R +SFP PP +R P GR+S GGGG GS A +
Sbjct: 1 MFKGSSKRGGRGGSGGGGSGPSRNRNSFP-PPTNRHPSPIGRMSSGGGGGGSAAPRQRSN 60
Query: 61 STAAKSEAPL-----SVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKF 120
ST+ K+ A +VEE F+LV + AF MIIRL+PDL++EIKRVE+ GG +IKF
Sbjct: 61 STSVKAAASTTVSSRTVEETFNLVPRESSSAFGMIIRLSPDLVDEIKRVEAQGGAAKIKF 120
Query: 121 DANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQ 180
DA NS+ N+I+VGGKEF+FTWS E G+LCDIYEE +SGEDG+GLL+E+G AWRK+NV
Sbjct: 121 DAFPNNSTENIINVGGKEFKFTWSGEKGELCDIYEEHQSGEDGNGLLIEAGCAWRKLNVL 180
Query: 181 RILDESTTNHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKK 240
R LDESTT+H+K S EAE+R+KSR+AIVL+PGNPS+ KQLA AE +PWRM K KK
Sbjct: 181 RTLDESTTSHMKMRSVEAEQRTKSRKAIVLDPGNPSV---TKQLAHAEGSPWRMSNKQKK 240
Query: 241 EPPFKKQKNELS--QVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSAN 300
EPP KK+K + VG PK +F+PG +S P K RLS+SP PSP Q P +G N
Sbjct: 241 EPPPKKRKVDPPPVPVGGPKPSFRPG-ASTPTMKNRLSASPGPSPSNQYNTP--PYGIGN 300
Query: 301 PTKTHCIAEDIKPRQ-PAKINAAASSEKEIPTKAAKGVL-EAPGQEVNAGAKPTDLQGML 360
KTH E++ P Q ++N EKE P+ VL + G+E K DLQ +L
Sbjct: 301 MAKTHAANENVTPVQTKGRVNMI---EKE-PSAWKNNVLRDTSGREAINVNKEIDLQSLL 360
Query: 361 YNLLLDNPKGMSLKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKK 420
++L + P MSLKALEKAVGDK+PN KKIEPI+K+IA +QAP RY LK E ELE KK
Sbjct: 361 VDILKEAP--MSLKALEKAVGDKVPNPAKKIEPILKRIANFQAP-RYFLKPEAELESYKK 420
Query: 421 PSSEGESSPLVSHQQ-TPVHEDFHDQ-PVPESQLEARHVIELEEKVETSQANKESNFLEK 480
S + SSP HQQ PV E DQ PVP + + E+ E S + +E+
Sbjct: 421 HSPDSGSSP--EHQQLLPVTECSRDQLPVPGRNNTEKFSL-CEQNGEGSLDCLPVHLVEQ 480
Query: 481 NGIQQN------SPDPFAEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSR 540
Q+N SP F E+K SEN E QA SS SDSDS+SD+SDSGSD
Sbjct: 481 LSTQENVDIEHHSPGIFHEEKRSENREAQARSS----SDSDSDSDNSDSGSD-------- 540
Query: 541 SRSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKEPKNKLQASVQ------GFS 600
S+S GS SGSSSDSE A SNSK+GSDEDVDIM SD D+EP Q+ Q G
Sbjct: 541 SKSAAGSDSGSSSDSE--ASSNSKDGSDEDVDIM-SDGDREPLLTTQSLEQDAIDLPGHG 600
Query: 601 ASPAAWKSPDGGAVLNIDDEKE-----DGHESDAIDIEKDSSDDEPEAKIDDR------- 660
+S + + AV +ID DGH SD +D+E +SSD+ + D +
Sbjct: 601 SSAVEIEGHNSDAV-DIDGHDSDAVDIDGHGSDTVDVEGNSSDEGHGSDADRKKNSDNNW 660
Query: 661 ------SLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLFEDRENTVVESSRHEQSDS 720
PT G + + D +ERQNFIG LF+D ENT + ++++ D
Sbjct: 661 KMETTTGTSPTANGEVGISGQEHFTSGHDNLRERQNFIGQLFDDTENTTKNNFKNDKRDI 720
Query: 721 TDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNWGAQLQSSRNLSPSKLNRD 780
++R+ K +++++ + E + + + H K K +S +Q S +++D
Sbjct: 721 SERLGKDQNQKALDFEHYSQKSAHEKNRKSQSCNQL------------------SAVSKD 780
Query: 781 SARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQASWRPHDQSGVRAVDTAVR 840
S + ELK +++ R ++ I + S H +S
Sbjct: 781 SQHS---------ELKYDAELRNASASQTIDPLRGLLKSSIEKSNRHGKS---------- 840
Query: 841 PDKHGESIGRGGKHSEKGGH------ANESFHAYKDRFYGNVENEGMNEKKVSRNSRSGG 900
+KH +++G K S+KG H ++ S A++D N ++ + K RN + G
Sbjct: 841 -NKHSDALGNVRK-SDKGDHFPLEMLSSRSGKAFRD----NQRDDVHLKNKFPRNKKDGE 900
Query: 901 PGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISADRSPVNGKGRILQ 960
+ P ++ KP ++ G KD K S +G SP D+ A + P G G +LQ
Sbjct: 901 SAIRPSLPTETSDRKPDELDGSDKDPKNVSGLSIGSSPLDSQRTYLA-KLP-KGNGPVLQ 960
Query: 961 REHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSELNKGKSNLKASLD 1020
++ S+LELGEL EP E+ K E S +Q K TS+ K +D
Sbjct: 961 KQVSELELGELPEPLGEDT-ALKPIEEKTSFRQSNLKPSTSE-------------KLGID 1020
Query: 1021 NGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQG-----PQYSSRVD 1080
+ KR S +K + P N +EH VED R QSH Q + SS+
Sbjct: 1021 SDKRRSKKSDSKKAAPPHTVNGS--NEHVVEDSERSQKWALQSHGQNLTGTDTEISSQNK 1080
Query: 1081 HVE--VEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLP-SKKGSKRQAPN 1140
++E K + + G E GE+N+K V H KR S + SKR +
Sbjct: 1081 NLEDAAYKSRQKDSRARVGNSVEGYGETNKKTPV---VKHGSKRASTSRSSRESKRH--S 1140
Query: 1141 QITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKEYV 1200
++ + K+ S + +++ +S E SY KYEK PELKG I D QYK Y+
Sbjct: 1141 SVSNSINGHKDATSIPGGSVVREKQMTSFGEEDSSYLKYEKASPELKGPISDHLQYKAYM 1192
Query: 1201 QEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLCSTRH 1234
QEY DKY+ Y S+NKILES+R +F KLG++L ++G++ ++Y ++EQ+KESY RH
Sbjct: 1201 QEYNDKYDSYHSINKILESHRNDFQKLGQDLGFAKGRDVERYNKIVEQIKESYCKYGERH 1192
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1GAK2 | 0.0e+00 | 100.00 | dentin sialophosphoprotein isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111452... | [more] |
A0A6J1GAJ0 | 0.0e+00 | 99.68 | dentin sialophosphoprotein isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111452... | [more] |
A0A6J1GB91 | 0.0e+00 | 99.28 | dentin sialophosphoprotein isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111452... | [more] |
A0A6J1KCU5 | 0.0e+00 | 97.65 | dentin sialophosphoprotein isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC11149271... | [more] |
A0A6J1KF98 | 0.0e+00 | 97.33 | dentin sialophosphoprotein isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC11149271... | [more] |
Match Name | E-value | Identity | Description | |
XP_022948917.1 | 0.0e+00 | 100.00 | dentin sialophosphoprotein isoform X3 [Cucurbita moschata] | [more] |
XP_022948916.1 | 0.0e+00 | 99.68 | dentin sialophosphoprotein isoform X2 [Cucurbita moschata] | [more] |
XP_022948915.1 | 0.0e+00 | 99.28 | dentin sialophosphoprotein isoform X1 [Cucurbita moschata] | [more] |
KAG6606728.1 | 0.0e+00 | 99.19 | hypothetical protein SDJN03_00070, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_023524753.1 | 0.0e+00 | 98.46 | dentin sialophosphoprotein-like isoform X2 [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
AT3G21290.1 | 2.1e-172 | 40.12 | dentin sialophosphoprotein-related | [more] |