Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGGTTGAGAAGACGGTGTGGCAGTGGCTTTCCATTGCCGAGCTTAGAAAAGCTTCAGAAGGGTTTCAGTGGCGGACTAGGGAATTGAATTTGACCAAGTCATTTGAGGAGATTTGATCATTCATGGCTGAAGGGCCATTGTCATGGTTCGTTTGAGTTTGACACGGAACAGAGGTCGGACAAATGATTTTGTATGGCAATGGAATTGGTCAAATCGAACTGAATATTGCCGCCAAACGTTGCTGCTTAGTACTTTAATTGTATCCTAAACTTTTAATGTGTCCACTATTAGAACTAAATAATAATAATAATAATAAGTTTGAATTATAAAATTAAAGAAAAAAATACCTTCAACCTTGAAATTTATTTTAAAAAGTTCTATATTTCTAAAAGTTGTGATATTACTCCTCAATTTTTACCAAAGTACTTAAATACATAATTTGACGTACATGCAATAACTTCTAACTTCATATGATTTCTTCTTTTTTTTTTTTTTTTTCGGTATAAAATCTATTTCTTGAATATATGCCCAATATTAGAACTTAGAAGTTAGTTAGCGGAAGTACCAATACGAAGGCCGTATTTTGTTATTTTTTATTTTTTGAAACTATATTAAGGCCGTATTGGGCTTTATAGATAGGCCCATAAAATTTGAAAGTATGATAAAACATGAATCCGAAATATAGCCTAAGAAAATCACAAAACAGACCCGATAATACATAACTCGTTGAAATTGGTAAATAGAGGGACTCGGAATTGTTCGTGTGTCTCTTCGTCTGGGTCCAAAACCTGGTTCGACCACCTTCCGTCGTCTTCTTCTCTCTCTTCCATCACAATGTATGGCGGCCCATCCAAGCTCGGCCGACCCGGCGGCGGCGCGGGACGGGGAGCCGGAGGAAAGCGCCCACACTCTTCTTTTCCTCTACCACCTTCTCACCGCCCCTCCGGCCGTCTCTCTCTTGGCGGCGGTGGCGCCGGTTCTGTCGCCAATCCTCGGAATCGAACCACCACCGCAACCACATCCGAAGCCCCTCAATCCGTCGAGGAGAATTTCAGTCTCGTTACTGGTAACAACCCCTTGGCTTTTGCCATGATAATTCGGCTGGCTCCGGATTTGATCGATGAGATCAAGCGGGTCGAGGCGCAGGGCGGGACTCCCAGGATTAAGTTTGATGCCAACGCCAAGAATTCTAGTGGTAATGTAAGTTTTCTATAATACTTGTTCTTTAGTTACTCATCCAAATCATGTGATAATGAAGATTTCTATGCTCTTGTGTATTTTCTTTTAGATGATTTGAACTTGTGTCAAGTACTTCAACTATGCTGTTGGATGTGTACACAAGTAGGTTGTACAGATTATGCTTTTGTTCACTATTTGTTGGATTTCAATGGGGGCAGTTACTTTGACACTAAGTTGATGGGTTTTATTTGATTTGTGGAATCGCTTTGTGCAACTGCTACTTCATTGCATATATTTTGGTCTTTATGATCCTTATGCAGTGGAATTTGTACTCAATTTGAGTATTGCTTGATTATTTATTCTTTTGTAGCATTATTGTGTTTACATTTTTTATTGGGTGCCAGTTAGCTGTTTTCTGAATTGGTGAATACGAATACCATAAGGTCAAGGATTAGATTACACAATTGAGTAGCAACATGGAAATAAGTATGAATTGGTGCCAGTTAGCTGTTTTCTAAATAAGTATTTGTTTGACAAATATCAGTAGGAATACTATAATCAGATTATATCATCAGTGCTTTTACGTGTCTTTGATGCATATACGGTTTTGTTGAAGGGTATACTGGAGTTCTTGGTCCACTGACAATTGGGGCATAGCCACGAAAACTTTTTTAAGCTGTTGGTATACACTTGATAATACTTGCTGCACTTGTAGCCTAAAACTTGAGCTTGTTTTTAGACATCATAATTTAGTGATTCAAGGAAGTGCTGCCACTGTTGGGGCCTTGGTTTTAGTCTTAGAATGTGCACAATATGGTTATAAAGAATATGCAATTTATGATGGATGTGGAGTAAATTTTAGTAGGAAGTATTCAAGAGAGACAAGTTGAAAGGGATAACTCTAATTCTCTCTCTGCTTATTGGGGAGCTTTATCTTGATAAAGTTACTATCTCAAGGACTTGATGAAATTCCCTATCATATTCTCTGGTGTTGTGAGTTTGCAAGAGTTGTGTGAGATTTTTTATTTCAAACGTTTGGGTTGTCGCTTGCTTGGCATAGGGTTTGTAGTGATATGATTCGGGATTTCCTCCTCCATCTGCCTTTTCATGAGAAGGGGTGCTTCTTATGGATGGTTGGGGTGTGTGCTATTCTTTGAGTTCTATGGGAGGAGCGGAATAATAGGTTGTTTAGGAGGTTTGAGAGGGGATCTTTTGAGGTTTAGTCTATTGTTAGATATCGTGTTTCTATTTGGGCTTCGGTTGAAGACTTTTTGTAACTACTATTTAGGCTTTATTTTGTATAATTGGAGTCCCTTTTTTTAGAGGGAATCCTTTTTCTTTTGGTAGGTTTGGTTTTTGTTTGCCTCTATATTCTTTCATTTTATCTCAAAGAAAGTTGTTTCTATTAAAAAAATGAAGCAAGAATTATTTATTTATTTTACTATTATATTTTAAAATTTTGCTTTACGCTGATAAATTCATGGTGTAATTTGAAAAGCTGTCCTGTAAGAAAACATACGGAGGGGTGGGTAGGAATTACAATTATATGAAAAGCTAACAACCCTAAATTTCCCTACCATCAGTGGTTCTATAAATCAAGTACTCAATAATTTTGTGGGTGCCGAAATAATAGATGTCATATGAACTAAAAGTAGTCAGTTTTGGGAAAAACTAAAAAACTATTTTCTTTTATATTCTTCCAAAAATCTCCACACCAAGAAAGAGAGACGTTGATTTTTGGTGACAGTCAAGATTGGGATAGATAGAGAAGGAAAGGCCTTTTTGATGAAGAAGAGTTTCTAAAGATAGGTGTCGGGTTCAAATGGCTTTACCATTTGAATTCTTGAAAAATTGTTGGAACGCCATTAAAAGTGAACCCATGTTAGTGTTCCAAGGGTCTTTGAGAATCAGGTGTCAATTCATGGTTTCAAATAAATTGCTTGTGGAGTTTTGAACACTAGAGTACTAAGAGCATTACACCTTTCCAATCATTATGGAAGTTGGGGATCTCCATTGACGTTTGATATTCCAATCCTTCCTAACTTTGAGAATATGAGGTTGCTTCATCACGATTTCATTTCGTCACTTTGCTAGAGCAGAATGGCTGGACCTATTAGGATGGCTAGGGCCTAGGGCAAGGGTTGGCGATGGAATAGAAAAATTGTAAGGCCTTGTTTTGATTGCATATGCTAGGTTTTGGTTATGTTCTTTGAACACTCTAAGATCTATCATTTTTTTCTTGATATCCCGATCTTCTATTTTTGTGAAAATGGGATGAAACACATCCTACACCTTCCGTGCAAAGTGCTGTCCTGTTTATCAATGAATTTGTATCGTACAAAAATGTCCTTCTTTTGAGCAGACTTACTAGGCTTCTCAATCTGCATCGCCTCAGGAAAAAAAAATATTTTAATTACTATTATTATTGGAAAATGGAAATAGTCTACTTCTCTACTGGAAAATCTTCAAGGAGTCTGATCCCATACAGAAATAATTTTAAGGCTCTCCAAGCTTTGAGCAATTAAAAGAAAGGAATACATATAAATTTACAAGTTAATGATACTAAAACATGTGAACATAGTTCTACGTTGCCATCTAGTCTTGAATCTTGTTAATGCTCTAATTGTGGATATTCAAATTTAAATTGGGTTAAAATTCACTTTTAATACTCAAACTTCTGTGGAGGTAACAATTTAATCTCTGGACTTTGCTTTGTAACAATTTTGTGCTTGTATTTTTAATTTCGTAACAATTTAGTCCTTGAAATTTAGTATGTAACAATTTAGTTTGTGTACTTTTAAATTTGTAACAATTTAATCCTTGTTGTGAAAAGTTCCATCGAAACTAAATGTCAATTTTTATTATTTAACTATTTAGACTTTGTTGCTTATAAACATATTGAGTCGTTAATTAGTTTATTGATATACCTATATAAGAAATTCCATTAAATATGAACAATAATTTTCACAAGGAGGACTATATCATCATAGTCTTTAAAGTGCATAGACTAAATTGTTACAAAATTGTAAGTAAAAGGACTAAATTTTACAAATCAAAGCTTAGGGACTAAATTGTACTTCCATGAAAGTTCAGGGATATTATTAACCCTTAAATTATTCTCCTCTCCTCAAGCTTTATCCTGCACATTTTCTTTCTATTGATAAGAAGGGTCTAAAAGTCTTGAGGAACTGCCCCTCACCCCCCCCCCCCCCCCCGCCCCCCCCCCCCCCCCCGCCCCCCCCCCCCCCCCCGCCCCCCCCCCCCCCCCCGCCCCCCCCCCCCCCCCCGCCCCCCCCCCCCCCCCCGCCCCCCCCCCCCCCCCCGCCCCCCCCCCCCCCCCCGCCCCCCCCCCCCCCCCCGCCCCCCCCCCCCCCCCCGCCCCCCCCCCCCCCCCCGCCCCCCCCCCCCCCCCCGCCCCCCCCCCCCCCCCCGCCCCCCCCCCCCCCCCCGCCCCCCCCCCCCCCACACCGCCCCCAACCCTCCACACACAACTGTGCTAAAGTCATAATATGTGCATGAATCTGACATATTCTCATTTCTCATTATATCAATGTTTCCTGTCAGTTCATGCTATAATCATCCATGCACCTTATGATCTAAATTCTGCTACTATGTTTGAAGCTATTTTATGAGGTGCCGACGATTTAAGATTGTTGTACTTGTACTGAAACTTGGCATCAGGTCATTGATGTTGGTGGGAAAGAGTTTAGGTTCACATGGTCACGTGAAGTTGGTGATCTTTGTGACATATATGAAGAACGTAAAAGTGGAGAAGATGGAAGTGGTTTGCTTATTGAATCAGGCAATGCTTGGAGAAAACTGAATGTTCAGCGTATCTTGGACGAATCAACTACAAACCATGTTAAGAAGTTGTCTGAAGAAGCTGAACGTAGATCTAAGTCTCGTAGGTATGCAGTCCTGTATTGGGGATGTTTCTTGCTTCTCTACTTTCGGCAATTAGCACATGTGATCTAACTGCATTTTGCCCATCTCAATTTATAGATTATCTAAACTTCAACGCATTTTTATAAAACTAAGTGCGGCTAGTTTATGTGCATATCTTAATGCTCACACACTTGATAGTTTTTGCACCATTACATGAACTGTAATTTATAACGGCAAAGAAATATGTTAGGAGTATTAGTACTAACATATATGTAATCACCTAGAAGTTTATCAGAACTGTGTAAATTGGGTGAGTTAGGAGGTTTGTGGAGGAGATTTCTTTGTGTATTTGGTATCTACCAAACCTCTCAGAGAACTTGGGAGCTTGTACTTCTTTTGTCAATTTTAATGCAAATAGATTTATGTTTTTCAAGTTCTCTACTTGCTTATATATTGTGGAATTAATTGTTTAGCATCTTGGGCATCTCTACTTGGTGCCTGGGAGGGTAGATGATTGGATTATGGAAGGCCTAAATGCTTGGAATTTGATGGGAAGAGCCAAAATTCTTGGGACTTGTGCTTTTTGAGCTATTTTGTGGCGTATTTGGAAGGAGAGGAATGTGAGGTCTATCGAAGATAACTCTTTATGTTTTATTTTCTTTTGCGATGTTGTACAAAATGTAGTGTCTTGGTGGATATCTGTACAAAATTCTCTTGTAATTACAACTGGATGATTAATAATGATTGGTTTAGCTCTCTATGTAGAGTTCTGTGTAGCGGATTCCCTTTGCCCCTGACCCTTATGGTGTTCTTGTTTTTTTTTATATGTATATATTTCTCTCGTATCTTTAAAGAAAAAGAAAGGAAGCCCACCCAACCACAAAGACTCAAAGTCCCAAACTTATTCCCACATGGGTCAAATGCCTTGTAAAATCATTTTGTTTCTCCAACAAAATCCTTTTAATTAGAAGTGGAGTACTGCATGTTCAATGCAGATAACCTTCTTGCCCTTGAATATGTGGTCTCAAAGAAGCTGATGAGCTTAGCAGAACTATCTAGGCAAGATGGAGCTTCTTCTCTGCATAACTCTGTTGGTGTCAACACCTTGAAGAATCTCTGCCCATAGAGAGACCTTGAACTCTTTAGGGAGTTTAGACTTCCATTCAGGTCTTGAGACATTAAAGTCCTCTGTATCAGCTAGCAGGAAGAGAGAATGAACAGAGGAGGAACCGGAGGACTTGTGCTCCTCAAGGGTTTGAGAAGGTTTACAGTTAAATGACACTCATTTTCAAATTCTCCGTCCATAATTTACTTCTGAACCAAATATCCCAACAAAGGGAACCTTCCTTCTACACTTTCTTAACGTCAGCTACCACACAAAGGTTTTTAAATTCATCATAGAAATGGATTTGTTCCACTAGGGATCTGCCCAAACCTCACAGAGGGGCTCCATCAAGTAACCTAAAGGTGATGTATTGCTTTTGCAATATGGCTCCTAAGATTTTTGCTTGGGGAGTAGGGGCAAGTGTTAGTGGGCCATCTAAATAATACCTACTGTACTTCCTAACAATGAACTTTTGCCAAAGGGAGTGCAGTTCTCTAGAAACTTATACTGCATCTCACGAGAAGCTATGTTTTTCTGTTTTACATTTGCCATGTTAAGCCCCCTCAAGACAAACAAAATAAAATCAAGGGAGTAATTGGCACACAGAATCATTCACGAGTACTGGCATTTTAATATAAATTAATTTATAAACATACGCATGATGTTATTTAAAAAATTTGTTCAGAATATAATTGCTTAATGCTTCTGTTTCTAACATAACATAACTACAGATGACATTCAAGCTATTTTTCTATGTAAATAAATCTGGACATGAAACTATAGTGTATTCCTTTGTTTGGCAGTTGGAATGATCTTCAATTAAAATTTTATTTGAGTTTTGGTCTTAATTTTATTTTAAATTAGAATTAATTTATAGTTTAGTTTTATTTGTAAGTTTAATGATAATATTACAGTCAACTTCTTTTCTAATATTTGTCCTCCCTCTCAAACCAAAATTCAGAGGAAAAAAATGCTTTTGATGATTGAGATGTCGCACTGACAGTTTGTGAAACCTTTTACTTTTATGGGTATACCACTTCTAAACTATAGAGAGATTTTGGTTTTTGTGATAGAGTGATGTGTGATTATGTTTAATACCATCTCTAGCTCTCTCACGTTCATTCACATGTTTTGTGTTTCTTGCTGCAGAGCTATTGTCTTAGAACCTGGGAATCCTTCTATGAAGAATCAAATAAAGCAATTGGCTGCTGCCGAAGGTATGCATTTTGTAAGGATTTGTTTAGTGGATAATTTATATTATTTTATCATTGGCATGAATAGGAGATTGACTCATTCTTATATCTGGTAAAGTTGCCTTATCAACCTGCCTTAATCACTATTTCTTTTTTTTTTTTCCTTCATCTTTTTTGTTGTTGCTGTTACTGTTATCTGATTGTATTACTGCAAAAACAGATCCTTCTATATTGAGATCAAGCCTCTACTAATACCTTGCTTGCTTGCAGCTAATCCATGGAGGCACTTTAAGAATAAGAAAGAGCCTCCCTTTAAAAAGCAGAAAAACGAATTGTCTCAAGGTGATTGATTACATTTACACATATTGAGTTCTTCAATTTCTTTTTGCAAGTTTTTATGCTTTATTCTTCCTCATTTGGCATATAAATTTTTCTCCCTCTTTGAGTGTTTTCTTTCTGACATATACTGTTTTCTCCTGATGGACATTAATGTCAATTAACCCTCGTTGTGGAGTACTGATACAAGATGTCATTGCAGTTGGGCCTCCAAAATCTACATATAAACCTGGCATGTCATCGTTACCTGCTTCGAAGGATAGGCTATCATCTTCACCTATTCCATCGCCACCCGAGCAATCCGGTGCTCCAGTATCTCAATTTGGATCTGCAAACTCCACGAAGACTCATGCATTTGCAGAAGATATTAGACCTCGACTACCTGCTAAGATTAATGCTGCTGCTAGCAGCGAGAAGGAAGTCCCGACCAAAGCTACAAAAGGAGTACTTGAAACGCCAGGACTGGAAGGGAATAGTGGAGCTAAACCAACAGACTTGCAAGGAATGTTGTATAATTTACTTTTGGAGAATCCCAAGGGGATGAGTTTGAAGGTACACAAGTAAATTGATGACTTCTACAGTTAGGAACTTAGGATGTTATTTTTCATCTATCATAATTGAGGTATAGATTCCTAATTTAAAAATAATGTTTGATGTTTCCTGTTCTGTATGTTTGTTCCCAAATTCATCAGTTCTGTTAACAAATATGGGTTTTACTATTTTCTACTTCTCTCTCAATGTTAACTCTTCTGTCACTCACTCTCCCTTCCTTCCCCTCCCTCCCTCTTGTAGGCCTTGGAGAAGGCTGTTGGCGATAAAATCCCAAATGCTGTAAAAAAGATCGAGCCAATCATTAAAAAAGTAAGTGATGTAGATGTTAAATTATTAGAAATAATGCATAATCTACTGGGTCAGTGTTTATTCAGTCATCTATTTTTTTCTTTTATTATTTATATATATATATATAAGAAAGAATGTAGAATTGAACAAAAAAAGAGAAAGAACAGCCTAGGGGTCAGGCGTAGAGAGAACCCCACCCAAAAGAATAAGAAAGCTTTCTCGTTTATTCACAATTGTGGAAAGATTGTAGTTACAAAAGAATTTCCTGAATATTATTTAGTCATTTCAATGAATCTCTTAATACTCCCTACATATGTTGTCTTGAATAGTACTTCAATTAAAAAGATCCAACAAATGAACTATCAATTGAAACAAGGAGGAAATGACGAACATGAACTTAACCCTGTTCTTCGAACTCATAATAAATTAACTGTGCTCCTGAAATCTTCCTACATTGTTGTAGTTCTGAATTCTGATATCTTCCATGCATGCTGAAGTTATTGTTCTTATATTCTTGTTTGATCTTGACTATTAGATTGCGACCTACCAAGCTCCGGGGAGATATTGTTTGAAATCAGGAGTTGAGTTGGAAGGCTCTAAAAAGCCTTCATCCGAAGGTGAAAGGTACTTCAACATTTAAAGAGCTGATTTCATCTGGTTGACGTGAACAGTTATTTGCAAAACCTTAAGTCAGGTTGTTTCTTCCAGTTACCATTTCCGTTTTTGGTCAGACAGCCTCAACTGTCTCATTTTTTGAGAATATTACATTACATACCGGATCCAAACCTACAACCTATAACCTATAGAGGGAGTAAAAAAGCTTCAAGACAACCTCAAATGTCTCATTTTTTTAGAATATTACATTACATGCTGGATTCAAACTTATAATCTCTTAGAGGAGGTTTTGGGTTTAATCCATGGTGGTCACCTATCTAGAATTTAATATCCTACGAATTTATTTGACACCCAAATGTTGTAGGGTTAGGCGGGTTGTCCGTGAGATTAGTCGAGGTGCATGTAAGTTGGCTCATAAACTCACGGATATAAAAAAAAATCTCTTAGAGGAAGTAAAAATGCTTCAACACAACCTCAAATGTCTCATTTTTTTAGAATATTACATCAAACTCATAACCTCTTAGATGAAGTAGATATGCCTCAACTATCTTCTCGTCATGGTTTGGTTTTATTGATTATCAATAAACTAAGCTTTATGAGGAGAAAAAAATGATGGGTAGAAAGCTTTAGTTTCAAGGGTTTATAATAAACAAAACAACTTCATGTGAACATAATGTTCAATGGTTAAAACATTTGTACCTCTAAGAGATTGTAGGCTCAAATCTCTACATGTTGTAATATTTTCTATACAAAAAATAAAAGAACAAGTAACTTCATTAGGAGTCTAATGTACTAGCCAGTGCATTATATATGGATTGCTCGATTTTCATCAAGAGTCTAATTCATGTCTTGTACTTTTACAGCTCTCCTTTGATCAGCCATCATCAAACTCCACCTGTACATGAAGGCCTCCCAGATCAAATAACTGCTCCAGAATTGCAGTTAGAAGCAAGATGTGGCATTGAATTGGAGGAAAAGGTAGAAACCTCTCAAGCTAATAAAGAATCAAATTTCTTGGAGAAAAATTGCATCCCACAGCATTCACCCGATCCATTTGCTGAGAAAAAAGGTTCTGAAAATAGTGAAGGCCAGCCAGCTAGCTCTTCTGACAACGAAAGTGACAGTGATTCTGAAAGTGATAGCAGTGACAGTGGAAGTGATAGTGGGAATCATAGTAGGAGTAGAAGCCGAAGCCCCATGGGTAGTGGGAGTGGGAGTAGCAGTGATAGTGAAAGTGACGCACCTTCCAACAGCAAGGAGGGTTCTGATGAGGATGTGGATATCATGACCAGTGATGATGACAAAGAATCCAAGCATAAATTGCAAGCTTCTGCGCAGGGATTCTCTACATCTCCTGCTGCTTGGAAAAGTCCAGATGGTGGGGCTGTGCAGATTGTGGATGATGAGAAGGAAGATGGACAAGAATCTGATGCAATCGATATTGAGAAAGATTCTTCTGATGATGAGCCAGATGCTAAAATTGATGATCCCAGTTTACTTCCAATTGAAGAATGTGGAAGACCTGTGGAAGAACCAAGATCCTTTTCACCATACCCTGATGAATTCCAAGAACGCCAGAATTTTATTGGGAGTTTGTTTGAGGATAGGGAAAATACTGTTCTGGACGGTGGCAGGCATGAACAATCTGACAGCACAGGTCGGATATCTAAAGGCAAGTCTAAAAGGAGTTCTGATTTGGAGTGCTTAGAAGAGAAATCTGATCATACCAAGAGATTAAAATCAGAAAGCTTAGCTCAACAACCAGTTTCTGGTAATTGGGGAGCCCAATTACAGAGTCCTCAGAGTCTATCTCCTAGTAAACTCAATAGAGATTCCATCAGAAATCCTACCAGTCAAGTTACTAATAAAGGTGAAGTCAAAGGCAATTCTGATTATAGACCAAAAAAGGGAAACAAAGAAACAGTTCCAGAAAAAAATAGTTCAGATGTTTCACAAGCAGGTTGGAGGCCCCATGACCAAAGTGGAGTGAGGGCTGTAGATACAGCAGCCAGAGCCGAGAAGCATGGTGATATTGGACGTGGCACTAAACACACTGACAAGGGTGGTCATGCCAATGAAAGTTTTCATGTGTTTAAAGATACATTTTATGGAAATGCTGAAAATGAAGGGACAAAGGAGAAAAAGGTGTCAAAAAATTCTAGATCAGGCGGTCCAGGGGATAAACAGATACAATCTTTTGACTCCCATCATGGTAAACCTGGTGAAATAGTTGGAAAATTCAAAGAAGGCCAAACATTTTCAAGCTCGCAGATGGGGTATTCACCAAGGGATAATAATAATAGAGTAAGTGCCAACAGGTCCCCTGTTAATGGAAAAGGCCGAATTCTCCAAAGAGAGCTTTCAGACCTGGAGTTAGGTGAACTTCGCGAGCCTTTCCATGAGGAAGCACGGGGTAGAAAGAAATTTGAAAGAAACAACTCCTTGAAACAGTTGGAGAATAAAGAAAACACTACAGATATCTGGAGTTCAGATTTAAGTAAAGGAAAATCTAATTTGAAGGCCAGTTTAGAATATGGTAAGCGGTCCTCACCCGATGTAAGTACCAAGTTTCCCAGCAATCTGGAAGGCTCAAATAAAAAGAAAAATTCAGAACATATAGTTGAAGATTCAACCAGGCTTAATCACCGGTCTCTGCAGTCTTATCCACAGTATAATTCAAGAGTAGATCATGTTGAAGTCGATAAGTCAGTGGATACCAATGTAAAACCTAATCAAGGGATTGGTCCAGAAAGCTGTGGGGAAAGCAATAGAAAAGCATCTGTTGGCATTTCCCAGCTGAATGATATGAAAAGAGAACAGCTTCCCTCAAAAAAAGGAAGTAAAAAGCTAGCACCTAATCCAATAGCTGAAGTTACTGATGCACAAAAGAACCCAGTATCAGCCGAGCGTGAAAATAGTGATCCAAAGAGGAGAGATTCTTCTTCAGACGAAAATAGTTGTTCATATTCCAAGTATGAAAAGGACGAGCCAGAGTTGAAGGGAGCAATCAAGGATTTTTCTCAGTAAGTTCCATCTATAACTCGTTCAAGTTCTTTTACTTTTGTTATTTGTGTCTGATAACTGATGAAAAGATGTACTTATTATTAGGTACAAGGAATATGTACAGGAGTATCATGATAAATATGAATCATACCTATCCTTGAACAAAATCCTAGAAAGCTACAGGTAATACAACTTTTCATTAATGTCCGATTTTAATCCATTCCTACACATGTTGCAGTCTGCCTGCCTCAATGTCCGGTTATTATGTTTAAACAAGTTTGATGATTTATTTGTGTTCTGGTATGTTGTTCCTGTAAAGAAACATTATTAAGTTTTTTAGCATCGATTGATCTATGTAGGGCTGAGTTCTGCAAACTCGGAAAGGAACTTGATTCTGCTAGAGGACAAGATTCGGAGAAATACTTTAATGTCTTAGGACAGTTGAAAGAATCCTATCGGCTGTGTTCAACGGTATGGTCACCTATCCTAGTATCTTTACTTTAAGTTTGCAGTTATATTAGTAGTTTCTCATGCTCCAGTATATGTTTCTCCATCTTTAGGACTTAGTGGATTAATTTGACATTAGTAATTTCCCTTGCACAAAATCCCAATGTTTACTCTGGCAAAGCTCTGAGTTCTGATATTTCATTCAGCTAGTGCTCGTTGTTTAGGCTTGTGTGATCTTTTGGTTCATATTGAGAAATTGGTATCCCAGAGGGGGCAGAGGGGGAGAGGGAATTCGTGATCGGAAGTAAAAGCTTTAGGATTGTATAGGATTAAGGTTGACAGTACATCGAGGCAGCAAGGCATGGTTGACGGTATTGGTAGAGACAATACTTTGTTTATCTCTGTTGATTGGATAACCTCGTGTTCAGGTTATTGGAATCTGGATTAGATGGGTCACATGGCTTTTACTTGATGTAGGCATTAATATGTTTTGTATGACTTGATCTGTATGCACATCACATTTTTTCCTCTTTGTTATACAAAAATTATCTTATATGGTGAAGTTGACTCAAAAAAAGATTAATTTGCAGAGGCACAAGAGGTTGAAAAAAATATTCATTGTTCTCCACGAAGAGCTGAAGGTGATTAAGATTAACCTACTAATCTACATTACCTTCATGCTTTTTTGTAGTTCTAATTTTTATCTCTAGTTTGATTTTCTCAATATGTATTATATTGTTGCAGCATATAAAGGAAAGGATTAGAGATTTTGCACAAATTTATGCAAAAGATTAAGAGTAGATGTGCCTCTCTTTGCAACGGAATGTCTTTTAGACAACCATTTTTCAACTCGAGGTAGGTTATTAAAAGGTCGAGGTGGAGAACTGCATCAGTGAATTTCCATGTATAGAAAATTATTCAGTTTCTTGATACCTTCTCCGCAACCCCCCCAATATAGGAGGAACATGTAAATTTTTTTTTTGCTCTCCCTCTCTCTCTCTC
mRNA sequence
ATGAAGGTTGAGAAGACGGTGTGGCAGTGGCTTTCCATTGCCGAGCTTAGAAAAGCTTCAGAAGGGTTTCAGTGGCGGACTAGGGAATTGAATTTGACCAAAGGGACTCGGAATTGTTCGTGTGTCTCTTCGTCTGGGTCCAAAACCTGGTTCGACCACCTTCCGTCGTCTTCTTCTCTCTCTTCCATCACAATGTATGGCGGCCCATCCAAGCTCGGCCGACCCGGCGGCGGCGCGGGACGGGGAGCCGGAGGAAAGCGCCCACACTCTTCTTTTCCTCTACCACCTTCTCACCGCCCCTCCGGCCGTCTCTCTCTTGGCGGCGGTGGCGCCGGTTCTGTCGCCAATCCTCGGAATCGAACCACCACCGCAACCACATCCGAAGCCCCTCAATCCGTCGAGGAGAATTTCAGTCTCGTTACTGGTAACAACCCCTTGGCTTTTGCCATGATAATTCGGCTGGCTCCGGATTTGATCGATGAGATCAAGCGGGTCGAGGCGCAGGGCGGGACTCCCAGGATTAAGTTTGATGCCAACGCCAAGAATTCTAGTGGTAATGTCATTGATGTTGGTGGGAAAGAGTTTAGGTTCACATGGTCACGTGAAGTTGGTGATCTTTGTGACATATATGAAGAACGTAAAAGTGGAGAAGATGGAAGTGGTTTGCTTATTGAATCAGGCAATGCTTGGAGAAAACTGAATGTTCAGCGTATCTTGGACGAATCAACTACAAACCATGTTAAGAAGTTGTCTGAAGAAGCTGAACGTAGATCTAAGTCTCGTAGAGCTATTGTCTTAGAACCTGGGAATCCTTCTATGAAGAATCAAATAAAGCAATTGGCTGCTGCCGAAGCTAATCCATGGAGGCACTTTAAGAATAAGAAAGAGCCTCCCTTTAAAAAGCAGAAAAACGAATTGTCTCAAGTTGGGCCTCCAAAATCTACATATAAACCTGGCATGTCATCGTTACCTGCTTCGAAGGATAGGCTATCATCTTCACCTATTCCATCGCCACCCGAGCAATCCGGTGCTCCAGTATCTCAATTTGGATCTGCAAACTCCACGAAGACTCATGCATTTGCAGAAGATATTAGACCTCGACTACCTGCTAAGATTAATGCTGCTGCTAGCAGCGAGAAGGAAGTCCCGACCAAAGCTACAAAAGGAGTACTTGAAACGCCAGGACTGGAAGGGAATAGTGGAGCTAAACCAACAGACTTGCAAGGAATGTTGTATAATTTACTTTTGGAGAATCCCAAGGGGATGAGTTTGAAGGCCTTGGAGAAGGCTGTTGGCGATAAAATCCCAAATGCTGTAAAAAAGATCGAGCCAATCATTAAAAAAATTGCGACCTACCAAGCTCCGGGGAGATATTGTTTGAAATCAGGAGTTGAGTTGGAAGGCTCTAAAAAGCCTTCATCCGAAGGTGAAAGCTCTCCTTTGATCAGCCATCATCAAACTCCACCTGTACATGAAGGCCTCCCAGATCAAATAACTGCTCCAGAATTGCAGTTAGAAGCAAGATGTGGCATTGAATTGGAGGAAAAGGTAGAAACCTCTCAAGCTAATAAAGAATCAAATTTCTTGGAGAAAAATTGCATCCCACAGCATTCACCCGATCCATTTGCTGAGAAAAAAGGTTCTGAAAATAGTGAAGGCCAGCCAGCTAGCTCTTCTGACAACGAAAGTGACAGTGATTCTGAAAGTGATAGCAGTGACAGTGGAAGTGATAGTGGGAATCATAGTAGGAGTAGAAGCCGAAGCCCCATGGGTAGTGGGAGTGGGAGTAGCAGTGATAGTGAAAGTGACGCACCTTCCAACAGCAAGGAGGGTTCTGATGAGGATGTGGATATCATGACCAGTGATGATGACAAAGAATCCAAGCATAAATTGCAAGCTTCTGCGCAGGGATTCTCTACATCTCCTGCTGCTTGGAAAAGTCCAGATGGTGGGGCTGTGCAGATTGTGGATGATGAGAAGGAAGATGGACAAGAATCTGATGCAATCGATATTGAGAAAGATTCTTCTGATGATGAGCCAGATGCTAAAATTGATGATCCCAGTTTACTTCCAATTGAAGAATGTGGAAGACCTGTGGAAGAACCAAGATCCTTTTCACCATACCCTGATGAATTCCAAGAACGCCAGAATTTTATTGGGAGTTTGTTTGAGGATAGGGAAAATACTGTTCTGGACGGTGGCAGGCATGAACAATCTGACAGCACAGGTCGGATATCTAAAGGCAAGTCTAAAAGGAGTTCTGATTTGGAGTGCTTAGAAGAGAAATCTGATCATACCAAGAGATTAAAATCAGAAAGCTTAGCTCAACAACCAGTTTCTGGTAATTGGGGAGCCCAATTACAGAGTCCTCAGAGTCTATCTCCTAGTAAACTCAATAGAGATTCCATCAGAAATCCTACCAGTCAAGTTACTAATAAAGGTGAAGTCAAAGGCAATTCTGATTATAGACCAAAAAAGGGAAACAAAGAAACAGTTCCAGAAAAAAATAGTTCAGATGTTTCACAAGCAGGTTGGAGGCCCCATGACCAAAGTGGAGTGAGGGCTGTAGATACAGCAGCCAGAGCCGAGAAGCATGGTGATATTGGACGTGGCACTAAACACACTGACAAGGGTGGTCATGCCAATGAAAGTTTTCATGTGTTTAAAGATACATTTTATGGAAATGCTGAAAATGAAGGGACAAAGGAGAAAAAGGTGTCAAAAAATTCTAGATCAGGCGGTCCAGGGGATAAACAGATACAATCTTTTGACTCCCATCATGGTAAACCTGGTGAAATAGTTGGAAAATTCAAAGAAGGCCAAACATTTTCAAGCTCGCAGATGGGGTATTCACCAAGGGATAATAATAATAGAGTAAGTGCCAACAGGTCCCCTGTTAATGGAAAAGGCCGAATTCTCCAAAGAGAGCTTTCAGACCTGGAGTTAGGTGAACTTCGCGAGCCTTTCCATGAGGAAGCACGGGGTAGAAAGAAATTTGAAAGAAACAACTCCTTGAAACAGTTGGAGAATAAAGAAAACACTACAGATATCTGGAGTTCAGATTTAAGTAAAGGAAAATCTAATTTGAAGGCCAGTTTAGAATATGGTAAGCGGTCCTCACCCGATGTAAGTACCAAGTTTCCCAGCAATCTGGAAGGCTCAAATAAAAAGAAAAATTCAGAACATATAGTTGAAGATTCAACCAGGCTTAATCACCGGTCTCTGCAGTCTTATCCACAGTATAATTCAAGAGTAGATCATGTTGAAGTCGATAAGTCAGTGGATACCAATGTAAAACCTAATCAAGGGATTGGTCCAGAAAGCTGTGGGGAAAGCAATAGAAAAGCATCTGTTGGCATTTCCCAGCTGAATGATATGAAAAGAGAACAGCTTCCCTCAAAAAAAGGAAGTAAAAAGCTAGCACCTAATCCAATAGCTGAAGTTACTGATGCACAAAAGAACCCAGTATCAGCCGAGCGTGAAAATAGTGATCCAAAGAGGAGAGATTCTTCTTCAGACGAAAATAGTTGTTCATATTCCAAGTATGAAAAGGACGAGCCAGAGTTGAAGGGAGCAATCAAGGATTTTTCTCAGTACAAGGAATATGTACAGGAGTATCATGATAAATATGAATCATACCTATCCTTGAACAAAATCCTAGAAAGCTACAGGGCTGAGTTCTGCAAACTCGGAAAGGAACTTGATTCTGCTAGAGGACAAGATTCGGAGAAATACTTTAATGTCTTAGGACAGTTGAAAGAATCCTATCGGCTGTGTTCAACGAGGCACAAGAGGTTGAAAAAAATATTCATTGTTCTCCACGAAGAGCTGAAGCATATAAAGGAAAGGATTAGAGATTTTGCACAAATTTATGCAAAAGATTAAGAGTAGATGTGCCTCTCTTTGCAACGGAATGTCTTTTAGACAACCATTTTTCAACTCGAGGTAGGTTATTAAAAGGTCGAGGTGGAGAACTGCATCAGTGAATTTCCATGTATAGAAAATTATTCAGTTTCTTGATACCTTCTCCGCAACCCCCCCAATATAGGAGGAACATGTAAATTTTTTTTTTGCTCTCCCTCTCTCTCTCTC
Coding sequence (CDS)
ATGAAGGTTGAGAAGACGGTGTGGCAGTGGCTTTCCATTGCCGAGCTTAGAAAAGCTTCAGAAGGGTTTCAGTGGCGGACTAGGGAATTGAATTTGACCAAAGGGACTCGGAATTGTTCGTGTGTCTCTTCGTCTGGGTCCAAAACCTGGTTCGACCACCTTCCGTCGTCTTCTTCTCTCTCTTCCATCACAATGTATGGCGGCCCATCCAAGCTCGGCCGACCCGGCGGCGGCGCGGGACGGGGAGCCGGAGGAAAGCGCCCACACTCTTCTTTTCCTCTACCACCTTCTCACCGCCCCTCCGGCCGTCTCTCTCTTGGCGGCGGTGGCGCCGGTTCTGTCGCCAATCCTCGGAATCGAACCACCACCGCAACCACATCCGAAGCCCCTCAATCCGTCGAGGAGAATTTCAGTCTCGTTACTGGTAACAACCCCTTGGCTTTTGCCATGATAATTCGGCTGGCTCCGGATTTGATCGATGAGATCAAGCGGGTCGAGGCGCAGGGCGGGACTCCCAGGATTAAGTTTGATGCCAACGCCAAGAATTCTAGTGGTAATGTCATTGATGTTGGTGGGAAAGAGTTTAGGTTCACATGGTCACGTGAAGTTGGTGATCTTTGTGACATATATGAAGAACGTAAAAGTGGAGAAGATGGAAGTGGTTTGCTTATTGAATCAGGCAATGCTTGGAGAAAACTGAATGTTCAGCGTATCTTGGACGAATCAACTACAAACCATGTTAAGAAGTTGTCTGAAGAAGCTGAACGTAGATCTAAGTCTCGTAGAGCTATTGTCTTAGAACCTGGGAATCCTTCTATGAAGAATCAAATAAAGCAATTGGCTGCTGCCGAAGCTAATCCATGGAGGCACTTTAAGAATAAGAAAGAGCCTCCCTTTAAAAAGCAGAAAAACGAATTGTCTCAAGTTGGGCCTCCAAAATCTACATATAAACCTGGCATGTCATCGTTACCTGCTTCGAAGGATAGGCTATCATCTTCACCTATTCCATCGCCACCCGAGCAATCCGGTGCTCCAGTATCTCAATTTGGATCTGCAAACTCCACGAAGACTCATGCATTTGCAGAAGATATTAGACCTCGACTACCTGCTAAGATTAATGCTGCTGCTAGCAGCGAGAAGGAAGTCCCGACCAAAGCTACAAAAGGAGTACTTGAAACGCCAGGACTGGAAGGGAATAGTGGAGCTAAACCAACAGACTTGCAAGGAATGTTGTATAATTTACTTTTGGAGAATCCCAAGGGGATGAGTTTGAAGGCCTTGGAGAAGGCTGTTGGCGATAAAATCCCAAATGCTGTAAAAAAGATCGAGCCAATCATTAAAAAAATTGCGACCTACCAAGCTCCGGGGAGATATTGTTTGAAATCAGGAGTTGAGTTGGAAGGCTCTAAAAAGCCTTCATCCGAAGGTGAAAGCTCTCCTTTGATCAGCCATCATCAAACTCCACCTGTACATGAAGGCCTCCCAGATCAAATAACTGCTCCAGAATTGCAGTTAGAAGCAAGATGTGGCATTGAATTGGAGGAAAAGGTAGAAACCTCTCAAGCTAATAAAGAATCAAATTTCTTGGAGAAAAATTGCATCCCACAGCATTCACCCGATCCATTTGCTGAGAAAAAAGGTTCTGAAAATAGTGAAGGCCAGCCAGCTAGCTCTTCTGACAACGAAAGTGACAGTGATTCTGAAAGTGATAGCAGTGACAGTGGAAGTGATAGTGGGAATCATAGTAGGAGTAGAAGCCGAAGCCCCATGGGTAGTGGGAGTGGGAGTAGCAGTGATAGTGAAAGTGACGCACCTTCCAACAGCAAGGAGGGTTCTGATGAGGATGTGGATATCATGACCAGTGATGATGACAAAGAATCCAAGCATAAATTGCAAGCTTCTGCGCAGGGATTCTCTACATCTCCTGCTGCTTGGAAAAGTCCAGATGGTGGGGCTGTGCAGATTGTGGATGATGAGAAGGAAGATGGACAAGAATCTGATGCAATCGATATTGAGAAAGATTCTTCTGATGATGAGCCAGATGCTAAAATTGATGATCCCAGTTTACTTCCAATTGAAGAATGTGGAAGACCTGTGGAAGAACCAAGATCCTTTTCACCATACCCTGATGAATTCCAAGAACGCCAGAATTTTATTGGGAGTTTGTTTGAGGATAGGGAAAATACTGTTCTGGACGGTGGCAGGCATGAACAATCTGACAGCACAGGTCGGATATCTAAAGGCAAGTCTAAAAGGAGTTCTGATTTGGAGTGCTTAGAAGAGAAATCTGATCATACCAAGAGATTAAAATCAGAAAGCTTAGCTCAACAACCAGTTTCTGGTAATTGGGGAGCCCAATTACAGAGTCCTCAGAGTCTATCTCCTAGTAAACTCAATAGAGATTCCATCAGAAATCCTACCAGTCAAGTTACTAATAAAGGTGAAGTCAAAGGCAATTCTGATTATAGACCAAAAAAGGGAAACAAAGAAACAGTTCCAGAAAAAAATAGTTCAGATGTTTCACAAGCAGGTTGGAGGCCCCATGACCAAAGTGGAGTGAGGGCTGTAGATACAGCAGCCAGAGCCGAGAAGCATGGTGATATTGGACGTGGCACTAAACACACTGACAAGGGTGGTCATGCCAATGAAAGTTTTCATGTGTTTAAAGATACATTTTATGGAAATGCTGAAAATGAAGGGACAAAGGAGAAAAAGGTGTCAAAAAATTCTAGATCAGGCGGTCCAGGGGATAAACAGATACAATCTTTTGACTCCCATCATGGTAAACCTGGTGAAATAGTTGGAAAATTCAAAGAAGGCCAAACATTTTCAAGCTCGCAGATGGGGTATTCACCAAGGGATAATAATAATAGAGTAAGTGCCAACAGGTCCCCTGTTAATGGAAAAGGCCGAATTCTCCAAAGAGAGCTTTCAGACCTGGAGTTAGGTGAACTTCGCGAGCCTTTCCATGAGGAAGCACGGGGTAGAAAGAAATTTGAAAGAAACAACTCCTTGAAACAGTTGGAGAATAAAGAAAACACTACAGATATCTGGAGTTCAGATTTAAGTAAAGGAAAATCTAATTTGAAGGCCAGTTTAGAATATGGTAAGCGGTCCTCACCCGATGTAAGTACCAAGTTTCCCAGCAATCTGGAAGGCTCAAATAAAAAGAAAAATTCAGAACATATAGTTGAAGATTCAACCAGGCTTAATCACCGGTCTCTGCAGTCTTATCCACAGTATAATTCAAGAGTAGATCATGTTGAAGTCGATAAGTCAGTGGATACCAATGTAAAACCTAATCAAGGGATTGGTCCAGAAAGCTGTGGGGAAAGCAATAGAAAAGCATCTGTTGGCATTTCCCAGCTGAATGATATGAAAAGAGAACAGCTTCCCTCAAAAAAAGGAAGTAAAAAGCTAGCACCTAATCCAATAGCTGAAGTTACTGATGCACAAAAGAACCCAGTATCAGCCGAGCGTGAAAATAGTGATCCAAAGAGGAGAGATTCTTCTTCAGACGAAAATAGTTGTTCATATTCCAAGTATGAAAAGGACGAGCCAGAGTTGAAGGGAGCAATCAAGGATTTTTCTCAGTACAAGGAATATGTACAGGAGTATCATGATAAATATGAATCATACCTATCCTTGAACAAAATCCTAGAAAGCTACAGGGCTGAGTTCTGCAAACTCGGAAAGGAACTTGATTCTGCTAGAGGACAAGATTCGGAGAAATACTTTAATGTCTTAGGACAGTTGAAAGAATCCTATCGGCTGTGTTCAACGAGGCACAAGAGGTTGAAAAAAATATTCATTGTTCTCCACGAAGAGCTGAAGCATATAAAGGAAAGGATTAGAGATTTTGCACAAATTTATGCAAAAGATTAA
Protein sequence
MKVEKTVWQWLSIAELRKASEGFQWRTRELNLTKGTRNCSCVSSSGSKTWFDHLPSSSSLSSITMYGGPSKLGRPGGGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKNELSQVGPPKSTYKPGMSSLPASKDRLSSSPIPSPPEQSGAPVSQFGSANSTKTHAFAEDIRPRLPAKINAAASSEKEVPTKATKGVLETPGLEGNSGAKPTDLQGMLYNLLLENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKSGVELEGSKKPSSEGESSPLISHHQTPPVHEGLPDQITAPELQLEARCGIELEEKVETSQANKESNFLEKNCIPQHSPDPFAEKKGSENSEGQPASSSDNESDSDSESDSSDSGSDSGNHSRSRSRSPMGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQASAQGFSTSPAAWKSPDGGAVQIVDDEKEDGQESDAIDIEKDSSDDEPDAKIDDPSLLPIEECGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTVLDGGRHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNWGAQLQSPQSLSPSKLNRDSIRNPTSQVTNKGEVKGNSDYRPKKGNKETVPEKNSSDVSQAGWRPHDQSGVRAVDTAARAEKHGDIGRGTKHTDKGGHANESFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQSFDSHHGKPGEIVGKFKEGQTFSSSQMGYSPRDNNNRVSANRSPVNGKGRILQRELSDLELGELREPFHEEARGRKKFERNNSLKQLENKENTTDIWSSDLSKGKSNLKASLEYGKRSSPDVSTKFPSNLEGSNKKKNSEHIVEDSTRLNHRSLQSYPQYNSRVDHVEVDKSVDTNVKPNQGIGPESCGESNRKASVGISQLNDMKREQLPSKKGSKKLAPNPIAEVTDAQKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCSTRHKRLKKIFIVLHEELKHIKERIRDFAQIYAKD
Homology
BLAST of CcUC07G141330 vs. NCBI nr
Match:
XP_038883601.1 (dentin sialophosphoprotein isoform X1 [Benincasa hispida])
HSP 1 Score: 2118.2 bits (5487), Expect = 0.0e+00
Identity = 1131/1231 (91.88%), Postives = 1165/1231 (94.64%), Query Frame = 0
Query: 65 MYGGPSKLGRPGGGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 124
MYGG SKLGRPGGGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGG SV+NPRNRTTTA
Sbjct: 1 MYGGASKLGRPGGGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGPVSVSNPRNRTTTA 60
Query: 125 TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 184
TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS
Sbjct: 61 TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 120
Query: 185 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 244
GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 180
Query: 245 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 304
NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN
Sbjct: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240
Query: 305 ELSQVGPPKSTYKPGMSSLPASKDRLSSSPIPSPPEQSGAPVSQFGSANSTKTHAFAEDI 364
ELSQVGPPKSTYKPG+SSLPASKDRLSSSPIPSPPEQSGAPVS FGSAN+TKTH EDI
Sbjct: 241 ELSQVGPPKSTYKPGISSLPASKDRLSSSPIPSPPEQSGAPVSHFGSANTTKTHVITEDI 300
Query: 365 RPRLPAKINAAASSEKEVPTKATKGVLETPGLEGNSGAKPTDLQGMLYNLLLENPKGMSL 424
RPRLPAK+NAAASSEKE+ TKA KGVLETPG EGNSGAK TDLQGMLYNLLLENPKGMSL
Sbjct: 301 RPRLPAKVNAAASSEKEISTKAAKGVLETPGQEGNSGAKTTDLQGMLYNLLLENPKGMSL 360
Query: 425 KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKSGVELEGSKKPSSEGESSPLISH 484
KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKSGVELEGSKKPSSEGESSPLISH
Sbjct: 361 KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKSGVELEGSKKPSSEGESSPLISH 420
Query: 485 HQTPPVHEGLPDQITAPELQLEARCGIELEEKVETSQANKESNFLEKNCIPQHSPDPFAE 544
HQ PPVHE LPDQITAPELQLEAR GIELEEKVETSQANK+SNFLEKN I QHSPD FAE
Sbjct: 421 HQNPPVHEDLPDQITAPELQLEARSGIELEEKVETSQANKKSNFLEKNGIQQHSPDLFAE 480
Query: 545 KKGSENSEGQPASSSDNESDSDSESDSSDSGSDSGNHSRSRSRSPMGSGSGSSSDSESDA 604
KKGSENSE Q ASSSDNESDSDSESDSSDSGSDSGNHSRSRSRSP+GSGSGSSSDSES+
Sbjct: 481 KKGSENSERQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSRSPVGSGSGSSSDSESEV 540
Query: 605 PSNSKEGSDEDVDIMTSDDDKESKHKLQASAQGFSTSPAAWKSPDGGAVQIVDDEKEDGQ 664
PSNSKEGSDEDVDIMTSDDDKESKHKLQASAQGFSTSPAAWKSPDGGAVQI+DDEKEDGQ
Sbjct: 541 PSNSKEGSDEDVDIMTSDDDKESKHKLQASAQGFSTSPAAWKSPDGGAVQIIDDEKEDGQ 600
Query: 665 ESDAIDIEKDSSDDEPDAKIDDPSLLPIEECGRPVEEPRSFSPYPDEFQERQNFIGSLFE 724
ESDAIDIE DSSDDEPDAKIDD S LPIE GR VEEPRSFSPYPDEFQERQNFIGSLFE
Sbjct: 601 ESDAIDIENDSSDDEPDAKIDDRSFLPIEG-GRLVEEPRSFSPYPDEFQERQNFIGSLFE 660
Query: 725 DRENTVLDGGRHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNWG 784
DR+NTV+D GRHEQSDSTG+ISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVS
Sbjct: 661 DRDNTVVDSGRHEQSDSTGQISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVS---- 720
Query: 785 AQLQSPQSLSPSKLNRDSIRNPTSQVTNKGEVKGNSDYRPKKGNKETVPEKNSSDVSQAG 844
SK RDS+RNPTSQVTNKGEVKGNSD+RPKKG+KETV EKNSSDVSQAG
Sbjct: 721 -----------SKHGRDSVRNPTSQVTNKGEVKGNSDFRPKKGHKETVSEKNSSDVSQAG 780
Query: 845 WRPHDQS-GVRAVDTAARAEKHGDIGRGTKHTDKGGHANESFHVFKDTFYGNAENEGTKE 904
WRPHDQS GVRAVDTAAR +KHGDIGRGTKHT+K GHANE+FH+FKDTF+GNAENEGTKE
Sbjct: 781 WRPHDQSGGVRAVDTAARTDKHGDIGRGTKHTEKSGHANENFHMFKDTFHGNAENEGTKE 840
Query: 905 KKVSKNSRSGGPGDKQIQSFDSHHGKPGEIVGKFKEGQTFSSSQMGYSPRDNNNRVSANR 964
KKVSKNSRSGGPGDKQIQ FDSHH KPGEIVGKFK+GQTFSSSQMGYSPRDNNNR+SANR
Sbjct: 841 KKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQTFSSSQMGYSPRDNNNRISANR 900
Query: 965 SPVNGKGRILQRELSDLELGELREPFHEEARGRKKFERNNSLKQLENKENTTDIWSSDLS 1024
SPVNGKGRILQRELSDLELGELR+PF EE+RG+KKFERNNSLKQLENKE+TTDIW SDLS
Sbjct: 901 SPVNGKGRILQRELSDLELGELRDPFPEESRGKKKFERNNSLKQLENKESTTDIWGSDLS 960
Query: 1025 KGKSNLKASLEYGKRSSPDVSTKFPSNLEGSNKKKNSEHIVEDSTRLNHRSLQSYPQYNS 1084
+GKSNLK SLEYGKRS P VSTKFPSN EGSNKKK SEHIVEDSTRLN RSLQS+PQYNS
Sbjct: 961 RGKSNLKTSLEYGKRSPPHVSTKFPSNPEGSNKKKTSEHIVEDSTRLNQRSLQSHPQYNS 1020
Query: 1085 RVDHVEVDKSVDTNVKPNQGIGPESCGESNRKASVGISQLNDMKREQLPSKKGSKKLAPN 1144
RVDHVEVDKS+ NVKPNQGIGPE CGESNRKASVGISQLNDMKREQLPSKKGSK+LAPN
Sbjct: 1021 RVDHVEVDKSIAANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPN 1080
Query: 1145 PIAEVTDAQKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKEYV 1204
PI EVT+A KNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQY+EYV
Sbjct: 1081 PITEVTEALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYEEYV 1140
Query: 1205 QEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCSTRH 1264
QEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCS RH
Sbjct: 1141 QEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCSARH 1200
Query: 1265 KRLKKIFIVLHEELKHIKERIRDFAQIYAKD 1295
KRLKKIFIVLHEELKHIK+RIRDFA+ AKD
Sbjct: 1201 KRLKKIFIVLHEELKHIKDRIRDFARTSAKD 1215
BLAST of CcUC07G141330 vs. NCBI nr
Match:
XP_004146856.1 (dentin sialophosphoprotein isoform X1 [Cucumis sativus] >KGN59835.1 hypothetical protein Csa_002066 [Cucumis sativus])
HSP 1 Score: 2094.7 bits (5426), Expect = 0.0e+00
Identity = 1115/1230 (90.65%), Postives = 1149/1230 (93.41%), Query Frame = 0
Query: 65 MYGGPSKLGRPGGGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 124
MYGGPSKLGRPGGGAGRG GKRPHSSFPLPPSHRPSGRLSLGGG AGSV+NPRNRTTTA
Sbjct: 1 MYGGPSKLGRPGGGAGRGHAGKRPHSSFPLPPSHRPSGRLSLGGGAAGSVSNPRNRTTTA 60
Query: 125 TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 184
TTSEA QS EENFSLVTGNNPLAF MIIRLAPDLIDEIKRVEAQGGTPRIKFDANA NSS
Sbjct: 61 TTSEASQSAEENFSLVTGNNPLAFGMIIRLAPDLIDEIKRVEAQGGTPRIKFDANANNSS 120
Query: 185 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 244
GNVIDVGGKEFRFTWSRE GD C+IYEERKSGEDGSGLLIESGN WRKLNV R+LDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSRERGDSCEIYEERKSGEDGSGLLIESGNCWRKLNVHRVLDESTT 180
Query: 245 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 304
NHVKKLSEEAER+SKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN
Sbjct: 181 NHVKKLSEEAERKSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240
Query: 305 ELSQVGPPKSTYKPGMSSLPASKDRLSSSPIPSPPEQSGAPVSQFGSANSTKTHAFAEDI 364
ELSQVGPPKSTYKPGM SLPASKDRLSSSPIP PPEQ GAPVSQFGSAN++KTH AEDI
Sbjct: 241 ELSQVGPPKSTYKPGMPSLPASKDRLSSSPIPLPPEQFGAPVSQFGSANTSKTHVIAEDI 300
Query: 365 RPRLPAKINAAASSEKEVPTKATKGVLETPGLEGNSGAKPTDLQGMLYNLLLENPKGMSL 424
RPR+PAKIN AAS+EKE+PT A KGVLETPG EGNSG KPTDLQGMLYNLLLENPKGMSL
Sbjct: 301 RPRVPAKINPAASNEKEIPTIAPKGVLETPGQEGNSGTKPTDLQGMLYNLLLENPKGMSL 360
Query: 425 KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKSGVELEGSKKPSSEGESSPLISH 484
KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRY LKSGV LEGSKKP+SEGESSPLISH
Sbjct: 361 KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYLLKSGVGLEGSKKPTSEGESSPLISH 420
Query: 485 HQTPPVHEGLPDQITAPELQLEARCGIELEEKVETSQANKESNFLEKNCIPQHSPDPFAE 544
HQT VHE LPDQ APELQLEARCG++LEEKVETSQANKESNFLE N I Q PDPFAE
Sbjct: 421 HQT-SVHEDLPDQTNAPELQLEARCGMDLEEKVETSQANKESNFLETNGIQQ--PDPFAE 480
Query: 545 KKGSENSEGQPASSSDNESDSDSESDSSDSGSDSGNHSRSRSRSPMGSGSGSSSDSESDA 604
KK SENSEGQ ASSSDNESDSDS+SDSSDSGSDSGNHSRSRSRSP+GSGSGSSSDSESD
Sbjct: 481 KKSSENSEGQAASSSDNESDSDSDSDSSDSGSDSGNHSRSRSRSPVGSGSGSSSDSESDG 540
Query: 605 PSNSKEGSDEDVDIMTSDDDKESKHKLQASAQGFSTSPAAWKSPDGGAVQIVDDEKEDGQ 664
PSNS+EGSD DVDIMTSDDDKESK KLQAS QGFSTSPAAWKSPDGG VQI+DDEKEDGQ
Sbjct: 541 PSNSQEGSDVDVDIMTSDDDKESKQKLQASVQGFSTSPAAWKSPDGGPVQIIDDEKEDGQ 600
Query: 665 ESDAIDIEKDSSDDEPDAKIDDPSLLPIEECGRPVEEPRSFSPYPDEFQERQNFIGSLFE 724
E DAIDIEKDSSDDEPDAKID SLLP EE RPVEEPRSFSPYPDEFQERQNFIGSLFE
Sbjct: 601 EYDAIDIEKDSSDDEPDAKIDGRSLLPTEEGVRPVEEPRSFSPYPDEFQERQNFIGSLFE 660
Query: 725 DRENTVLDGGRHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNWG 784
DREN V+D RHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNWG
Sbjct: 661 DRENNVVDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNWG 720
Query: 785 AQLQSPQSLSPSKLNRDSIRNPTSQVTNKGEVKGNSDYRPKKGNKETVPEKNSSDVSQAG 844
QLQSP++LSPSKLNRDS+RN TSQVTNKGE+KGNSD+RPKKGNKETV EKNSSDVSQAG
Sbjct: 721 VQLQSPRNLSPSKLNRDSVRNLTSQVTNKGEIKGNSDFRPKKGNKETVSEKNSSDVSQAG 780
Query: 845 WRPHDQSGVRAVDTAARAEKHGDIGRGTKHTDKGGHANESFHVFKDTFYGNAENEGTKEK 904
WRPHDQSGVRAVDTA RA+KHGDIGRGTKHT+K GHANE+FHVFKDTFYGN +NEGTKEK
Sbjct: 781 WRPHDQSGVRAVDTATRADKHGDIGRGTKHTEKSGHANENFHVFKDTFYGNPDNEGTKEK 840
Query: 905 KVSKNSRSGGPGDKQIQSFDSHHGKPGEIVGKFKEGQTFSSSQMGYSPRDNNNRVSANRS 964
KVSKNSRSGGPGDKQIQ DSHH KPGEIVGKFK+GQTFSSSQMGYSPRDNNNRVSANRS
Sbjct: 841 KVSKNSRSGGPGDKQIQPLDSHHSKPGEIVGKFKDGQTFSSSQMGYSPRDNNNRVSANRS 900
Query: 965 PVNGKGRILQRELSDLELGELREPFHEEARGRKKFERNNSLKQLENKENTTDIWSSDLSK 1024
PVNGKGRILQRE SDLELGELREPFHEEARG+KKFERNNSLKQLENKENTTDIW SDL+K
Sbjct: 901 PVNGKGRILQREPSDLELGELREPFHEEARGKKKFERNNSLKQLENKENTTDIWGSDLNK 960
Query: 1025 GKSNLKASLEYGKRSSPDVSTKFPSNLEGSNKKKNSEHIVEDSTRLNHRSLQSYPQYNSR 1084
GKSNLKASLEYGKRSSP VSTKFPSN EGSNKKKNSEHIVEDS R+N+RSL S+ QYNSR
Sbjct: 961 GKSNLKASLEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSNRINNRSLLSHSQYNSR 1020
Query: 1085 VDHVEVDKSVDTNVKPNQGIGPESCGESNRKASVGISQLNDMKREQLPSKKGSKKLAPNP 1144
+DH EVDKS D NVKPNQG GPE ESNRKASVGISQLND KREQ PSKKGSK+ APNP
Sbjct: 1021 IDHAEVDKSADGNVKPNQGNGPEGYVESNRKASVGISQLNDTKREQPPSKKGSKRQAPNP 1080
Query: 1145 IAEVTDAQKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKEYVQ 1204
I EVTD KNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKEYVQ
Sbjct: 1081 ITEVTDGLKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKEYVQ 1140
Query: 1205 EYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCSTRHK 1264
EYHDKYESYLSLNKILESYR EFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCSTRHK
Sbjct: 1141 EYHDKYESYLSLNKILESYRTEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCSTRHK 1200
Query: 1265 RLKKIFIVLHEELKHIKERIRDFAQIYAKD 1295
RLKKIFIVLHEELKHIKERIRDF Q YAKD
Sbjct: 1201 RLKKIFIVLHEELKHIKERIRDFVQTYAKD 1227
BLAST of CcUC07G141330 vs. NCBI nr
Match:
XP_008447590.1 (PREDICTED: dentin sialophosphoprotein isoform X1 [Cucumis melo])
HSP 1 Score: 2090.8 bits (5416), Expect = 0.0e+00
Identity = 1113/1231 (90.41%), Postives = 1149/1231 (93.34%), Query Frame = 0
Query: 65 MYGGPSKLGRPGGGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 124
MYGGPSKLGRPGGGAGRG GGKRPHSSFPLPPSHRPSGRLSLGGG AGS +NPRNRTTTA
Sbjct: 1 MYGGPSKLGRPGGGAGRGHGGKRPHSSFPLPPSHRPSGRLSLGGGAAGSASNPRNRTTTA 60
Query: 125 TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 184
TTSEA QS EENFSLVTGNNPLAF MIIRLAPDLIDEIKRVEAQGGTPRIKFDA A NSS
Sbjct: 61 TTSEASQSTEENFSLVTGNNPLAFGMIIRLAPDLIDEIKRVEAQGGTPRIKFDAIANNSS 120
Query: 185 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 244
GNVIDVGGKEFRFTWSRE GD C+IYEERKSGEDGSGLLIESGN WRKLNV R+LDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSRERGDSCEIYEERKSGEDGSGLLIESGNCWRKLNVHRVLDESTT 180
Query: 245 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 304
NHVKKLSEEAER+SKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN
Sbjct: 181 NHVKKLSEEAERKSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240
Query: 305 ELSQVGPPKSTYKPGMSSLPASKDRLSSSPIPSPPEQSGAPVSQFGSANSTKTHAFAEDI 364
ELSQVGPPKSTYKPGM SLPASKDRLSSSPIP PPEQ G PVSQFGSAN+ KTH AEDI
Sbjct: 241 ELSQVGPPKSTYKPGMPSLPASKDRLSSSPIPLPPEQFGGPVSQFGSANTNKTHVIAEDI 300
Query: 365 RPRLPAKINAAASSEKEVPTKATKGVLETPGLEGNSGAKPTDLQGMLYNLLLENPKGMSL 424
RPR+PAKIN AAS+EKE+ T A KGVLETPG EGNSGAKPTDLQGMLYNLLLENPKGMSL
Sbjct: 301 RPRVPAKINPAASNEKEILTIAPKGVLETPGQEGNSGAKPTDLQGMLYNLLLENPKGMSL 360
Query: 425 KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKSGVELEGSKKPSSEGESSPLISH 484
KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKSGVELEGSKKP+SEGESSPL+SH
Sbjct: 361 KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKSGVELEGSKKPTSEGESSPLVSH 420
Query: 485 HQTPPVHEGLPDQITAPELQLEARCGIELEEKVETSQANKESNFLEKNCIPQHSPDPFAE 544
HQT VHE LPDQI APELQLEA CGI+LEEKVETSQANKESNFLEKN I Q PDPFAE
Sbjct: 421 HQT-SVHEDLPDQINAPELQLEAGCGIDLEEKVETSQANKESNFLEKNGIQQ--PDPFAE 480
Query: 545 KKGSENSEGQPASSSDNESDSDSESDSSDSGSDSGNHSRSRSRSPMGSGSGSSSDSESDA 604
KKGSENSEGQ ASSSDN SDSDS+SDSSDSGSDSGNHSRSRSRSP+GSGSGSSSDSESD
Sbjct: 481 KKGSENSEGQAASSSDNVSDSDSDSDSSDSGSDSGNHSRSRSRSPVGSGSGSSSDSESDG 540
Query: 605 PSNSKEGSDEDVDIMTSDDDKESKHKLQASAQGFSTSPAAWKSPDGGAVQIVDDEKEDGQ 664
PSNS+EGSDEDVDIMTSDDDKESKHKLQAS QGFSTSPAAWKSPDGG VQI+DDEKEDGQ
Sbjct: 541 PSNSQEGSDEDVDIMTSDDDKESKHKLQASVQGFSTSPAAWKSPDGGPVQIIDDEKEDGQ 600
Query: 665 ESDAIDIEKDSSDDEPDAKIDDPSLLPIEECGRPVEEPRSFSPYPDEFQERQNFIGSLFE 724
E DAIDIEKDSSDDEPDAK+D SLLP EE GRPVEEPRSFSPYPDEFQERQNFIGSLFE
Sbjct: 601 EYDAIDIEKDSSDDEPDAKVDGRSLLPTEEVGRPVEEPRSFSPYPDEFQERQNFIGSLFE 660
Query: 725 DRENTVLDGGRHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNWG 784
DREN V D RHEQSDSTGRISKGKSKRSSDLECLEEK+DHTKRLKSESLAQQPVSGNWG
Sbjct: 661 DRENNVADSARHEQSDSTGRISKGKSKRSSDLECLEEKADHTKRLKSESLAQQPVSGNWG 720
Query: 785 AQLQSPQSLSPSKLNRDSIRNPTSQVTNKGEVKGNSDYRPKKGNKETVPEKNSSDVSQAG 844
QLQSP++LSPSKLNRDS+RN TSQVTNKGE+KGNSD+RPKKGNKETV EKNSSDV QAG
Sbjct: 721 VQLQSPRNLSPSKLNRDSVRNLTSQVTNKGEIKGNSDFRPKKGNKETVSEKNSSDVPQAG 780
Query: 845 WRPHDQS-GVRAVDTAARAEKHGDIGRGTKHTDKGGHANESFHVFKDTFYGNAENEGTKE 904
WRPHDQS GVRAVDTA RA+KHGDIGRGTKH +K GHANE+FHVFKDTFYGNA+NEGTKE
Sbjct: 781 WRPHDQSGGVRAVDTATRADKHGDIGRGTKHIEKSGHANENFHVFKDTFYGNADNEGTKE 840
Query: 905 KKVSKNSRSGGPGDKQIQSFDSHHGKPGEIVGKFKEGQTFSSSQMGYSPRDNNNRVSANR 964
KKVSKNSRSGGPGDK IQ FDSH KPGEIVGKFK+GQTFSSSQMGYSPRDNNNRVSANR
Sbjct: 841 KKVSKNSRSGGPGDKHIQPFDSHQSKPGEIVGKFKDGQTFSSSQMGYSPRDNNNRVSANR 900
Query: 965 SPVNGKGRILQRELSDLELGELREPFHEEARGRKKFERNNSLKQLENKENTTDIWSSDLS 1024
SPVNGKGRILQRE SDLELGELREPF EEARG+KKFERNNSLKQLENKENTTDIW SDL+
Sbjct: 901 SPVNGKGRILQREPSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWGSDLN 960
Query: 1025 KGKSNLKASLEYGKRSSPDVSTKFPSNLEGSNKKKNSEHIVEDSTRLNHRSLQSYPQYNS 1084
KGKSNLKASLEYGKRSSP VSTKFPSN EGSNKKKNSEH+VEDS RLN+RSL S+ QYNS
Sbjct: 961 KGKSNLKASLEYGKRSSPHVSTKFPSNPEGSNKKKNSEHMVEDSNRLNNRSLLSHSQYNS 1020
Query: 1085 RVDHVEVDKSVDTNVKPNQGIGPESCGESNRKASVGISQLNDMKREQLPSKKGSKKLAPN 1144
R+DH EVDKSVD NV+PNQG GPE ESNRKASVGISQLND KREQLPSKKGSK+ APN
Sbjct: 1021 RIDHAEVDKSVDGNVRPNQGNGPEGYVESNRKASVGISQLNDTKREQLPSKKGSKRQAPN 1080
Query: 1145 PIAEVTDAQKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKEYV 1204
PI EVTD KNP+SAE ENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKEYV
Sbjct: 1081 PITEVTDGLKNPISAEHENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKEYV 1140
Query: 1205 QEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCSTRH 1264
QEYHDKYESYLSLNKILESYR EFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCSTRH
Sbjct: 1141 QEYHDKYESYLSLNKILESYRTEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCSTRH 1200
Query: 1265 KRLKKIFIVLHEELKHIKERIRDFAQIYAKD 1295
KRLKKIFIVLHEELKHIKERIRDF Q YAKD
Sbjct: 1201 KRLKKIFIVLHEELKHIKERIRDFVQTYAKD 1228
BLAST of CcUC07G141330 vs. NCBI nr
Match:
XP_023524753.1 (dentin sialophosphoprotein-like isoform X2 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1992.2 bits (5160), Expect = 0.0e+00
Identity = 1071/1235 (86.72%), Postives = 1136/1235 (91.98%), Query Frame = 0
Query: 65 MYGGPSKLGRPGGGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 124
MYGGPSKL R GGGAGRGA GKRP SSFPLPP+HRPSGRLSLGGGGAGS ANPRNRT+ A
Sbjct: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSAA 60
Query: 125 TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 184
SEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLI+EIKRVE+QGGTPRIKFDANAKNSS
Sbjct: 61 AKSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESQGGTPRIKFDANAKNSS 120
Query: 185 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 244
GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLL+ESGNAWRK+NVQRILDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
Query: 245 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQK 304
NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAE+NPWR H+KNKKEPPFKKQK
Sbjct: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
Query: 305 NELSQVGPPKSTYKPGMSSLPASKDRLSSSPIPSPPEQSGAPVSQFGSANSTKTHAFAED 364
NELSQVGPPKST+KPGMSS+PASK+RLSSSP+PSPPEQSGAP+SQFGSAN TKTH AED
Sbjct: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCTAED 300
Query: 365 IRPRLPAKINAAASSEKEVPTKATKGVLETPGLEGNSGAKPTDLQGMLYNLLLENPKGMS 424
I+PR PAKINAAASSEK++PTKA KGVLE PG E N+GAKPTDLQGMLYNLLL+NPKGMS
Sbjct: 301 IKPRQPAKINAAASSEKDIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
Query: 425 LKALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKSGVELEGSKKPSSEGESSPLIS 484
LKALEKAVGDKIPN+VKKIEPIIKKIATYQAPGRYCLKS VE+EGSKKPSSEGESSPL+S
Sbjct: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVEVEGSKKPSSEGESSPLVS 420
Query: 485 HHQTPPVHEGLPDQITAPELQLEARCGIELEEKVETSQANKESNFLEKNCIPQHSPDPFA 544
H QT PVHE DQ PE QLEAR IELEEKVETSQANKESNFLEKN I Q+SPDPFA
Sbjct: 421 HQQT-PVHEDFHDQ-PVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFA 480
Query: 545 EKKGSENSEGQPASSSDNESDSDSESDSSDSGSDSGNHSRSRSRSPMGSGSGSSSDSESD 604
EKKGSENSEGQ ASSSDNESDSDSESDSSDSGSDSGN SRSRSRSP+GSGSGSSSDSESD
Sbjct: 481 EKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESD 540
Query: 605 APSNSKEGSDEDVDIMTSDDDKESKHKLQASAQGFSTSPAAWKSPDGGAVQIVDDEKEDG 664
APSNSKEGSDEDVDIMTSDDDKE K+KLQAS QGFS SPAAWKSPDGGAV +DDEKEDG
Sbjct: 541 APSNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDG 600
Query: 665 QESDAIDIEKDSSDDEPDAKIDDPSLLPIEECGRPVEEPRSFSPYPDEFQERQNFIGSLF 724
ESDAIDIEKDSSDDEP+AKIDD SL P E GRPVEE RS SPYPDEFQERQNFIGSLF
Sbjct: 601 HESDAIDIEKDSSDDEPEAKIDDRSLPPTVEGGRPVEESRSLSPYPDEFQERQNFIGSLF 660
Query: 725 EDRENTVLDGGRHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNW 784
EDRENTV+D RHEQSDST R+SKGKSKRSS+LEC EE + HTKRLK ES +QQPVSGNW
Sbjct: 661 EDRENTVVDSARHEQSDSTDRMSKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNW 720
Query: 785 GAQLQSPQSLSPSKLNRDSIRNPTSQVTNKGEVKGNSDYRPKKGNKETVPEKNSSDVSQA 844
GAQLQS ++LSPSKLNRDS RNPTSQVTNKGE+KGNSD+RPK GNKE V EKN SDVSQA
Sbjct: 721 GAQLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQA 780
Query: 845 GWRPHDQSGVRAVDTAARAEKHGD-IGRGTKHTDKGGHANESFHVFKDTFYGNAENEGTK 904
WRPHDQSGVRAVDTA R +KHG+ IGRG KH++KGGHANESFH +KD FYGNAENEG
Sbjct: 781 SWRPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNAENEGMN 840
Query: 905 EKKVSKNSRSGGPGDKQIQSFDSHHGKPGEIVGKFKEGQTFSSSQMGYSPRDNNNRVSAN 964
EKKVS+NSRSGGPGDKQIQ DSH KPG+IVGKFK+G+TFSSSQMGYSPRDNNNR+SA+
Sbjct: 841 EKKVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISAD 900
Query: 965 RSPVNGKGRILQRELSDLELGELREPFHEEARGRKKFERNNSLKQLENKENTTDIWSSDL 1024
RSPVNGKGRILQRE SDLELGELREPF EEA G+KKFERNNS KQLENKE+T+DIWSS+L
Sbjct: 901 RSPVNGKGRILQREHSDLELGELREPFPEEALGKKKFERNNSSKQLENKEHTSDIWSSEL 960
Query: 1025 SKGKSNLKASLEYGKRSSPDVSTKFPSNLEGSNKKKNSEHIVEDSTRLNHRSLQSY---P 1084
+KGKSNLKASL+ GKRSSP +STKFPSN EGSNKKK SEH VED TR+NHR QS+ P
Sbjct: 961 NKGKSNLKASLDNGKRSSPHISTKFPSNPEGSNKKKISEHKVEDLTRINHRPPQSHPQGP 1020
Query: 1085 QYNSRVDHVEVDKSVDTNVKPNQGIGPESCGESNRKASVGISQLNDMKREQLPSKKGSKK 1144
QY+SRVDHVEV+K VD NVKPNQGIGPESCGESNRKASVGISQL+DMKREQLPSKKGSK+
Sbjct: 1021 QYSSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKR 1080
Query: 1145 LAPNPIAEVTDAQKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQY 1204
APN I EVTDA KNP+SAE ENSD KRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQY
Sbjct: 1081 QAPNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQY 1140
Query: 1205 KEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLC 1264
KEYVQEY DKYE YLSLNKILESYRAEFCKLGKELDS+RGQDS+KYFN+LGQLKESYRLC
Sbjct: 1141 KEYVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQDSDKYFNLLGQLKESYRLC 1200
Query: 1265 STRHKRLKKIFIVLHEELKHIKERIRDFAQIYAKD 1295
STRHKRLKKIFIVLHEELKH+KERI+DFAQ YAKD
Sbjct: 1201 STRHKRLKKIFIVLHEELKHLKERIKDFAQTYAKD 1233
BLAST of CcUC07G141330 vs. NCBI nr
Match:
XP_023524752.1 (dentin sialophosphoprotein-like isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1986.5 bits (5145), Expect = 0.0e+00
Identity = 1071/1239 (86.44%), Postives = 1136/1239 (91.69%), Query Frame = 0
Query: 65 MYGGPSKLGRPGGGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 124
MYGGPSKL R GGGAGRGA GKRP SSFPLPP+HRPSGRLSLGGGGAGS ANPRNRT+ A
Sbjct: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSAA 60
Query: 125 TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 184
SEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLI+EIKRVE+QGGTPRIKFDANAKNSS
Sbjct: 61 AKSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESQGGTPRIKFDANAKNSS 120
Query: 185 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 244
GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLL+ESGNAWRK+NVQRILDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
Query: 245 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQK 304
NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAE+NPWR H+KNKKEPPFKKQK
Sbjct: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
Query: 305 NELSQVGPPKSTYKPGMSSLPASKDRLSSSPIPSPPEQSGAPVSQFGSANSTKTHAFAED 364
NELSQVGPPKST+KPGMSS+PASK+RLSSSP+PSPPEQSGAP+SQFGSAN TKTH AED
Sbjct: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCTAED 300
Query: 365 IRPRLPAKINAAASSEKEVPTKATKGVLETPGLEGNSGAKPTDLQGMLYNLLLENPKGMS 424
I+PR PAKINAAASSEK++PTKA KGVLE PG E N+GAKPTDLQGMLYNLLL+NPKGMS
Sbjct: 301 IKPRQPAKINAAASSEKDIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
Query: 425 LKALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKSGVELEGSKKPSSEGESSPLIS 484
LKALEKAVGDKIPN+VKKIEPIIKKIATYQAPGRYCLKS VE+EGSKKPSSEGESSPL+S
Sbjct: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVEVEGSKKPSSEGESSPLVS 420
Query: 485 HHQTPPVHEGLPDQITAPELQLEARCGIELEEKVETSQANKESNFLEKNCIPQHSPDPFA 544
H QT PVHE DQ PE QLEAR IELEEKVETSQANKESNFLEKN I Q+SPDPFA
Sbjct: 421 HQQT-PVHEDFHDQ-PVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFA 480
Query: 545 EKKGSENSEGQPASSSDNESDSDSESDSSDSGSDSGNHSRSRSRSPMGSGSGSSSDSESD 604
EKKGSENSEGQ ASSSDNESDSDSESDSSDSGSDSGN SRSRSRSP+GSGSGSSSDSESD
Sbjct: 481 EKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESD 540
Query: 605 APSNSKEGSDEDVDIMTSDDDKESKHKLQASAQGFSTSPAAWKSPDGGAVQIVDDEKEDG 664
APSNSKEGSDEDVDIMTSDDDKE K+KLQAS QGFS SPAAWKSPDGGAV +DDEKEDG
Sbjct: 541 APSNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDG 600
Query: 665 QESDAIDIEKDSSDDEPDAKIDDPSLLPIEECGRPVEEPRSFSPYPDEFQERQNFIGSLF 724
ESDAIDIEKDSSDDEP+AKIDD SL P E GRPVEE RS SPYPDEFQERQNFIGSLF
Sbjct: 601 HESDAIDIEKDSSDDEPEAKIDDRSLPPTVEGGRPVEESRSLSPYPDEFQERQNFIGSLF 660
Query: 725 EDRENTVLDGGRHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNW 784
EDRENTV+D RHEQSDST R+SKGKSKRSS+LEC EE + HTKRLK ES +QQPVSGNW
Sbjct: 661 EDRENTVVDSARHEQSDSTDRMSKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNW 720
Query: 785 GAQLQSPQSLSPSKLNRDSIRNPTSQVTNKGEVKGNSDYRPKKGNKETVPEKNSSDVSQA 844
GAQLQS ++LSPSKLNRDS RNPTSQVTNKGE+KGNSD+RPK GNKE V EKN SDVSQA
Sbjct: 721 GAQLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQA 780
Query: 845 GWRPHDQSGVRAVDTAARAEKHGD-IGRGTKHTDKGGHANESFHVFKDTFYGNAENEGTK 904
WRPHDQSGVRAVDTA R +KHG+ IGRG KH++KGGHANESFH +KD FYGNAENEG
Sbjct: 781 SWRPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNAENEGMN 840
Query: 905 EKKVSKNSRSGGPGDKQIQSFDSHHGKPGEIVGKFKEGQTFSSSQMGYSPRDNNNRVSAN 964
EKKVS+NSRSGGPGDKQIQ DSH KPG+IVGKFK+G+TFSSSQMGYSPRDNNNR+SA+
Sbjct: 841 EKKVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISAD 900
Query: 965 RSPVNGKGRILQRELSDLELGELREPFHEEARGRKKFERNNSLKQLENKENTTDIWSSDL 1024
RSPVNGKGRILQRE SDLELGELREPF EEA G+KKFERNNS KQLENKE+T+DIWSS+L
Sbjct: 901 RSPVNGKGRILQREHSDLELGELREPFPEEALGKKKFERNNSSKQLENKEHTSDIWSSEL 960
Query: 1025 SKGKSNLKASLEYGKRSSPDVSTKFPSNLEGSNKKKNSEHIVEDSTRLNHRSLQSY---P 1084
+KGKSNLKASL+ GKRSSP +STKFPSN EGSNKKK SEH VED TR+NHR QS+ P
Sbjct: 961 NKGKSNLKASLDNGKRSSPHISTKFPSNPEGSNKKKISEHKVEDLTRINHRPPQSHPQGP 1020
Query: 1085 QYNSRVDHVEVDKSVDTNVKPNQGIGPESCGESNRKASVGISQLNDMKREQLPSKKGSKK 1144
QY+SRVDHVEV+K VD NVKPNQGIGPESCGESNRKASVGISQL+DMKREQLPSKKGSK+
Sbjct: 1021 QYSSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKR 1080
Query: 1145 LAPNPIAEVTDAQKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQY 1204
APN I EVTDA KNP+SAE ENSD KRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQY
Sbjct: 1081 QAPNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQY 1140
Query: 1205 KEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLC 1264
KEYVQEY DKYE YLSLNKILESYRAEFCKLGKELDS+RGQDS+KYFN+LGQLKESYRLC
Sbjct: 1141 KEYVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQDSDKYFNLLGQLKESYRLC 1200
Query: 1265 ST----RHKRLKKIFIVLHEELKHIKERIRDFAQIYAKD 1295
ST RHKRLKKIFIVLHEELKH+KERI+DFAQ YAKD
Sbjct: 1201 STSNLQRHKRLKKIFIVLHEELKHLKERIKDFAQTYAKD 1237
BLAST of CcUC07G141330 vs. ExPASy TrEMBL
Match:
A0A0A0LCU6 (Occludin_ELL domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G849910 PE=4 SV=1)
HSP 1 Score: 2094.7 bits (5426), Expect = 0.0e+00
Identity = 1115/1230 (90.65%), Postives = 1149/1230 (93.41%), Query Frame = 0
Query: 65 MYGGPSKLGRPGGGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 124
MYGGPSKLGRPGGGAGRG GKRPHSSFPLPPSHRPSGRLSLGGG AGSV+NPRNRTTTA
Sbjct: 1 MYGGPSKLGRPGGGAGRGHAGKRPHSSFPLPPSHRPSGRLSLGGGAAGSVSNPRNRTTTA 60
Query: 125 TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 184
TTSEA QS EENFSLVTGNNPLAF MIIRLAPDLIDEIKRVEAQGGTPRIKFDANA NSS
Sbjct: 61 TTSEASQSAEENFSLVTGNNPLAFGMIIRLAPDLIDEIKRVEAQGGTPRIKFDANANNSS 120
Query: 185 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 244
GNVIDVGGKEFRFTWSRE GD C+IYEERKSGEDGSGLLIESGN WRKLNV R+LDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSRERGDSCEIYEERKSGEDGSGLLIESGNCWRKLNVHRVLDESTT 180
Query: 245 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 304
NHVKKLSEEAER+SKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN
Sbjct: 181 NHVKKLSEEAERKSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240
Query: 305 ELSQVGPPKSTYKPGMSSLPASKDRLSSSPIPSPPEQSGAPVSQFGSANSTKTHAFAEDI 364
ELSQVGPPKSTYKPGM SLPASKDRLSSSPIP PPEQ GAPVSQFGSAN++KTH AEDI
Sbjct: 241 ELSQVGPPKSTYKPGMPSLPASKDRLSSSPIPLPPEQFGAPVSQFGSANTSKTHVIAEDI 300
Query: 365 RPRLPAKINAAASSEKEVPTKATKGVLETPGLEGNSGAKPTDLQGMLYNLLLENPKGMSL 424
RPR+PAKIN AAS+EKE+PT A KGVLETPG EGNSG KPTDLQGMLYNLLLENPKGMSL
Sbjct: 301 RPRVPAKINPAASNEKEIPTIAPKGVLETPGQEGNSGTKPTDLQGMLYNLLLENPKGMSL 360
Query: 425 KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKSGVELEGSKKPSSEGESSPLISH 484
KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRY LKSGV LEGSKKP+SEGESSPLISH
Sbjct: 361 KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYLLKSGVGLEGSKKPTSEGESSPLISH 420
Query: 485 HQTPPVHEGLPDQITAPELQLEARCGIELEEKVETSQANKESNFLEKNCIPQHSPDPFAE 544
HQT VHE LPDQ APELQLEARCG++LEEKVETSQANKESNFLE N I Q PDPFAE
Sbjct: 421 HQT-SVHEDLPDQTNAPELQLEARCGMDLEEKVETSQANKESNFLETNGIQQ--PDPFAE 480
Query: 545 KKGSENSEGQPASSSDNESDSDSESDSSDSGSDSGNHSRSRSRSPMGSGSGSSSDSESDA 604
KK SENSEGQ ASSSDNESDSDS+SDSSDSGSDSGNHSRSRSRSP+GSGSGSSSDSESD
Sbjct: 481 KKSSENSEGQAASSSDNESDSDSDSDSSDSGSDSGNHSRSRSRSPVGSGSGSSSDSESDG 540
Query: 605 PSNSKEGSDEDVDIMTSDDDKESKHKLQASAQGFSTSPAAWKSPDGGAVQIVDDEKEDGQ 664
PSNS+EGSD DVDIMTSDDDKESK KLQAS QGFSTSPAAWKSPDGG VQI+DDEKEDGQ
Sbjct: 541 PSNSQEGSDVDVDIMTSDDDKESKQKLQASVQGFSTSPAAWKSPDGGPVQIIDDEKEDGQ 600
Query: 665 ESDAIDIEKDSSDDEPDAKIDDPSLLPIEECGRPVEEPRSFSPYPDEFQERQNFIGSLFE 724
E DAIDIEKDSSDDEPDAKID SLLP EE RPVEEPRSFSPYPDEFQERQNFIGSLFE
Sbjct: 601 EYDAIDIEKDSSDDEPDAKIDGRSLLPTEEGVRPVEEPRSFSPYPDEFQERQNFIGSLFE 660
Query: 725 DRENTVLDGGRHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNWG 784
DREN V+D RHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNWG
Sbjct: 661 DRENNVVDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNWG 720
Query: 785 AQLQSPQSLSPSKLNRDSIRNPTSQVTNKGEVKGNSDYRPKKGNKETVPEKNSSDVSQAG 844
QLQSP++LSPSKLNRDS+RN TSQVTNKGE+KGNSD+RPKKGNKETV EKNSSDVSQAG
Sbjct: 721 VQLQSPRNLSPSKLNRDSVRNLTSQVTNKGEIKGNSDFRPKKGNKETVSEKNSSDVSQAG 780
Query: 845 WRPHDQSGVRAVDTAARAEKHGDIGRGTKHTDKGGHANESFHVFKDTFYGNAENEGTKEK 904
WRPHDQSGVRAVDTA RA+KHGDIGRGTKHT+K GHANE+FHVFKDTFYGN +NEGTKEK
Sbjct: 781 WRPHDQSGVRAVDTATRADKHGDIGRGTKHTEKSGHANENFHVFKDTFYGNPDNEGTKEK 840
Query: 905 KVSKNSRSGGPGDKQIQSFDSHHGKPGEIVGKFKEGQTFSSSQMGYSPRDNNNRVSANRS 964
KVSKNSRSGGPGDKQIQ DSHH KPGEIVGKFK+GQTFSSSQMGYSPRDNNNRVSANRS
Sbjct: 841 KVSKNSRSGGPGDKQIQPLDSHHSKPGEIVGKFKDGQTFSSSQMGYSPRDNNNRVSANRS 900
Query: 965 PVNGKGRILQRELSDLELGELREPFHEEARGRKKFERNNSLKQLENKENTTDIWSSDLSK 1024
PVNGKGRILQRE SDLELGELREPFHEEARG+KKFERNNSLKQLENKENTTDIW SDL+K
Sbjct: 901 PVNGKGRILQREPSDLELGELREPFHEEARGKKKFERNNSLKQLENKENTTDIWGSDLNK 960
Query: 1025 GKSNLKASLEYGKRSSPDVSTKFPSNLEGSNKKKNSEHIVEDSTRLNHRSLQSYPQYNSR 1084
GKSNLKASLEYGKRSSP VSTKFPSN EGSNKKKNSEHIVEDS R+N+RSL S+ QYNSR
Sbjct: 961 GKSNLKASLEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSNRINNRSLLSHSQYNSR 1020
Query: 1085 VDHVEVDKSVDTNVKPNQGIGPESCGESNRKASVGISQLNDMKREQLPSKKGSKKLAPNP 1144
+DH EVDKS D NVKPNQG GPE ESNRKASVGISQLND KREQ PSKKGSK+ APNP
Sbjct: 1021 IDHAEVDKSADGNVKPNQGNGPEGYVESNRKASVGISQLNDTKREQPPSKKGSKRQAPNP 1080
Query: 1145 IAEVTDAQKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKEYVQ 1204
I EVTD KNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKEYVQ
Sbjct: 1081 ITEVTDGLKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKEYVQ 1140
Query: 1205 EYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCSTRHK 1264
EYHDKYESYLSLNKILESYR EFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCSTRHK
Sbjct: 1141 EYHDKYESYLSLNKILESYRTEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCSTRHK 1200
Query: 1265 RLKKIFIVLHEELKHIKERIRDFAQIYAKD 1295
RLKKIFIVLHEELKHIKERIRDF Q YAKD
Sbjct: 1201 RLKKIFIVLHEELKHIKERIRDFVQTYAKD 1227
BLAST of CcUC07G141330 vs. ExPASy TrEMBL
Match:
A0A1S3BIQ1 (dentin sialophosphoprotein isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490006 PE=4 SV=1)
HSP 1 Score: 2090.8 bits (5416), Expect = 0.0e+00
Identity = 1113/1231 (90.41%), Postives = 1149/1231 (93.34%), Query Frame = 0
Query: 65 MYGGPSKLGRPGGGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 124
MYGGPSKLGRPGGGAGRG GGKRPHSSFPLPPSHRPSGRLSLGGG AGS +NPRNRTTTA
Sbjct: 1 MYGGPSKLGRPGGGAGRGHGGKRPHSSFPLPPSHRPSGRLSLGGGAAGSASNPRNRTTTA 60
Query: 125 TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 184
TTSEA QS EENFSLVTGNNPLAF MIIRLAPDLIDEIKRVEAQGGTPRIKFDA A NSS
Sbjct: 61 TTSEASQSTEENFSLVTGNNPLAFGMIIRLAPDLIDEIKRVEAQGGTPRIKFDAIANNSS 120
Query: 185 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 244
GNVIDVGGKEFRFTWSRE GD C+IYEERKSGEDGSGLLIESGN WRKLNV R+LDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSRERGDSCEIYEERKSGEDGSGLLIESGNCWRKLNVHRVLDESTT 180
Query: 245 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 304
NHVKKLSEEAER+SKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN
Sbjct: 181 NHVKKLSEEAERKSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240
Query: 305 ELSQVGPPKSTYKPGMSSLPASKDRLSSSPIPSPPEQSGAPVSQFGSANSTKTHAFAEDI 364
ELSQVGPPKSTYKPGM SLPASKDRLSSSPIP PPEQ G PVSQFGSAN+ KTH AEDI
Sbjct: 241 ELSQVGPPKSTYKPGMPSLPASKDRLSSSPIPLPPEQFGGPVSQFGSANTNKTHVIAEDI 300
Query: 365 RPRLPAKINAAASSEKEVPTKATKGVLETPGLEGNSGAKPTDLQGMLYNLLLENPKGMSL 424
RPR+PAKIN AAS+EKE+ T A KGVLETPG EGNSGAKPTDLQGMLYNLLLENPKGMSL
Sbjct: 301 RPRVPAKINPAASNEKEILTIAPKGVLETPGQEGNSGAKPTDLQGMLYNLLLENPKGMSL 360
Query: 425 KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKSGVELEGSKKPSSEGESSPLISH 484
KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKSGVELEGSKKP+SEGESSPL+SH
Sbjct: 361 KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKSGVELEGSKKPTSEGESSPLVSH 420
Query: 485 HQTPPVHEGLPDQITAPELQLEARCGIELEEKVETSQANKESNFLEKNCIPQHSPDPFAE 544
HQT VHE LPDQI APELQLEA CGI+LEEKVETSQANKESNFLEKN I Q PDPFAE
Sbjct: 421 HQT-SVHEDLPDQINAPELQLEAGCGIDLEEKVETSQANKESNFLEKNGIQQ--PDPFAE 480
Query: 545 KKGSENSEGQPASSSDNESDSDSESDSSDSGSDSGNHSRSRSRSPMGSGSGSSSDSESDA 604
KKGSENSEGQ ASSSDN SDSDS+SDSSDSGSDSGNHSRSRSRSP+GSGSGSSSDSESD
Sbjct: 481 KKGSENSEGQAASSSDNVSDSDSDSDSSDSGSDSGNHSRSRSRSPVGSGSGSSSDSESDG 540
Query: 605 PSNSKEGSDEDVDIMTSDDDKESKHKLQASAQGFSTSPAAWKSPDGGAVQIVDDEKEDGQ 664
PSNS+EGSDEDVDIMTSDDDKESKHKLQAS QGFSTSPAAWKSPDGG VQI+DDEKEDGQ
Sbjct: 541 PSNSQEGSDEDVDIMTSDDDKESKHKLQASVQGFSTSPAAWKSPDGGPVQIIDDEKEDGQ 600
Query: 665 ESDAIDIEKDSSDDEPDAKIDDPSLLPIEECGRPVEEPRSFSPYPDEFQERQNFIGSLFE 724
E DAIDIEKDSSDDEPDAK+D SLLP EE GRPVEEPRSFSPYPDEFQERQNFIGSLFE
Sbjct: 601 EYDAIDIEKDSSDDEPDAKVDGRSLLPTEEVGRPVEEPRSFSPYPDEFQERQNFIGSLFE 660
Query: 725 DRENTVLDGGRHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNWG 784
DREN V D RHEQSDSTGRISKGKSKRSSDLECLEEK+DHTKRLKSESLAQQPVSGNWG
Sbjct: 661 DRENNVADSARHEQSDSTGRISKGKSKRSSDLECLEEKADHTKRLKSESLAQQPVSGNWG 720
Query: 785 AQLQSPQSLSPSKLNRDSIRNPTSQVTNKGEVKGNSDYRPKKGNKETVPEKNSSDVSQAG 844
QLQSP++LSPSKLNRDS+RN TSQVTNKGE+KGNSD+RPKKGNKETV EKNSSDV QAG
Sbjct: 721 VQLQSPRNLSPSKLNRDSVRNLTSQVTNKGEIKGNSDFRPKKGNKETVSEKNSSDVPQAG 780
Query: 845 WRPHDQS-GVRAVDTAARAEKHGDIGRGTKHTDKGGHANESFHVFKDTFYGNAENEGTKE 904
WRPHDQS GVRAVDTA RA+KHGDIGRGTKH +K GHANE+FHVFKDTFYGNA+NEGTKE
Sbjct: 781 WRPHDQSGGVRAVDTATRADKHGDIGRGTKHIEKSGHANENFHVFKDTFYGNADNEGTKE 840
Query: 905 KKVSKNSRSGGPGDKQIQSFDSHHGKPGEIVGKFKEGQTFSSSQMGYSPRDNNNRVSANR 964
KKVSKNSRSGGPGDK IQ FDSH KPGEIVGKFK+GQTFSSSQMGYSPRDNNNRVSANR
Sbjct: 841 KKVSKNSRSGGPGDKHIQPFDSHQSKPGEIVGKFKDGQTFSSSQMGYSPRDNNNRVSANR 900
Query: 965 SPVNGKGRILQRELSDLELGELREPFHEEARGRKKFERNNSLKQLENKENTTDIWSSDLS 1024
SPVNGKGRILQRE SDLELGELREPF EEARG+KKFERNNSLKQLENKENTTDIW SDL+
Sbjct: 901 SPVNGKGRILQREPSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWGSDLN 960
Query: 1025 KGKSNLKASLEYGKRSSPDVSTKFPSNLEGSNKKKNSEHIVEDSTRLNHRSLQSYPQYNS 1084
KGKSNLKASLEYGKRSSP VSTKFPSN EGSNKKKNSEH+VEDS RLN+RSL S+ QYNS
Sbjct: 961 KGKSNLKASLEYGKRSSPHVSTKFPSNPEGSNKKKNSEHMVEDSNRLNNRSLLSHSQYNS 1020
Query: 1085 RVDHVEVDKSVDTNVKPNQGIGPESCGESNRKASVGISQLNDMKREQLPSKKGSKKLAPN 1144
R+DH EVDKSVD NV+PNQG GPE ESNRKASVGISQLND KREQLPSKKGSK+ APN
Sbjct: 1021 RIDHAEVDKSVDGNVRPNQGNGPEGYVESNRKASVGISQLNDTKREQLPSKKGSKRQAPN 1080
Query: 1145 PIAEVTDAQKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKEYV 1204
PI EVTD KNP+SAE ENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKEYV
Sbjct: 1081 PITEVTDGLKNPISAEHENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKEYV 1140
Query: 1205 QEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCSTRH 1264
QEYHDKYESYLSLNKILESYR EFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCSTRH
Sbjct: 1141 QEYHDKYESYLSLNKILESYRTEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCSTRH 1200
Query: 1265 KRLKKIFIVLHEELKHIKERIRDFAQIYAKD 1295
KRLKKIFIVLHEELKHIKERIRDF Q YAKD
Sbjct: 1201 KRLKKIFIVLHEELKHIKERIRDFVQTYAKD 1228
BLAST of CcUC07G141330 vs. ExPASy TrEMBL
Match:
A0A6J1KCU5 (dentin sialophosphoprotein isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111492713 PE=4 SV=1)
HSP 1 Score: 1979.9 bits (5128), Expect = 0.0e+00
Identity = 1067/1235 (86.40%), Postives = 1131/1235 (91.58%), Query Frame = 0
Query: 65 MYGGPSKLGRPGGGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 124
MYGGPSKL R GGGAGRGA GKRP SSFPLPP+HRPSGRLSLGGGGAGS ANPRNRT+TA
Sbjct: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
Query: 125 TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 184
SEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLI+EIKRVE+QGGTPRIKFDANAKNSS
Sbjct: 61 AKSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESQGGTPRIKFDANAKNSS 120
Query: 185 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 244
GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLL+ESGNAWRK+NVQRILDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
Query: 245 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQK 304
NHVKKLSEEAERRSKSRRAIVLEPGN SMKNQIKQLAAAE+NPWR H+KNKKEPPFKKQK
Sbjct: 181 NHVKKLSEEAERRSKSRRAIVLEPGNLSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
Query: 305 NELSQVGPPKSTYKPGMSSLPASKDRLSSSPIPSPPEQSGAPVSQFGSANSTKTHAFAED 364
NELSQVGPPKST+KPGMSS+PASK+RLSSSP+PSPPEQ GAP+SQFGSAN TKTH AED
Sbjct: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQFGAPISQFGSANPTKTHCIAED 300
Query: 365 IRPRLPAKINAAASSEKEVPTKATKGVLETPGLEGNSGAKPTDLQGMLYNLLLENPKGMS 424
I+PR PAKINAAASSEKE+PTKA KGVLE PG E N+GAKPTDLQGMLYNLLL+NPKGMS
Sbjct: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
Query: 425 LKALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKSGVELEGSKKPSSEGESSPLIS 484
LKALEKAVGDKIPN+VKKIEPIIKKIATYQAPGRYCLKS VELEGSKKPSSEGESSPL+S
Sbjct: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS 420
Query: 485 HHQTPPVHEGLPDQITAPELQLEARCGIELEEKVETSQANKESNFLEKNCIPQHSPDPFA 544
H QT PVHE DQ PE QLEAR IELEEKVETSQANKESNFLEKN I Q+SPDPFA
Sbjct: 421 HQQT-PVHEDFHDQ-PVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFA 480
Query: 545 EKKGSENSEGQPASSSDNESDSDSESDSSDSGSDSGNHSRSRSRSPMGSGSGSSSDSESD 604
EKKGSENSEG+ A+SSDNESDSDSESDSSDSGSDSGN SRSRSRSP+GSGSGSSSDSESD
Sbjct: 481 EKKGSENSEGEAATSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESD 540
Query: 605 APSNSKEGSDEDVDIMTSDDDKESKHKLQASAQGFSTSPAAWKSPDGGAVQIVDDEKEDG 664
APSNSKEGSDEDVDIMTSDDDKE KHKLQAS QGFS SPAAWKSPDGGAV +DDEKEDG
Sbjct: 541 APSNSKEGSDEDVDIMTSDDDKEPKHKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDG 600
Query: 665 QESDAIDIEKDSSDDEPDAKIDDPSLLPIEECGRPVEEPRSFSPYPDEFQERQNFIGSLF 724
ESDAIDIEKDSSDDEP+AKIDD SL P E GR VEE RS SPYPDEFQERQNFIGSLF
Sbjct: 601 HESDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRTVEESRSLSPYPDEFQERQNFIGSLF 660
Query: 725 EDRENTVLDGGRHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNW 784
EDRENTV+D GRHEQSDST RISKGKSKRSS+LEC EE + HTKRLK ES +QQPVSGNW
Sbjct: 661 EDRENTVVDSGRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNW 720
Query: 785 GAQLQSPQSLSPSKLNRDSIRNPTSQVTNKGEVKGNSDYRPKKGNKETVPEKNSSDVSQA 844
G QLQS ++LSPSKLNRDS RNPT+QVTNKGE+KGNSD+RPK GNKE V EKN SDVSQA
Sbjct: 721 GVQLQSSRNLSPSKLNRDSARNPTNQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQA 780
Query: 845 GWRPHDQSGVRAVDTAARAEKHGD-IGRGTKHTDKGGHANESFHVFKDTFYGNAENEGTK 904
WRPHDQSGVRAVDTA R +KHG+ IGRG KH++KGGHANESFH +KD FYGNAENE
Sbjct: 781 SWRPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNAENEEMN 840
Query: 905 EKKVSKNSRSGGPGDKQIQSFDSHHGKPGEIVGKFKEGQTFSSSQMGYSPRDNNNRVSAN 964
EKKVS+NSRSGGPGDKQIQ DSH KPG+IVGKFK+G+TF SSQMGYSPRDNNNR+SA+
Sbjct: 841 EKKVSRNSRSGGPGDKQIQPSDSHLSKPGDIVGKFKDGKTFPSSQMGYSPRDNNNRISAD 900
Query: 965 RSPVNGKGRILQRELSDLELGELREPFHEEARGRKKFERNNSLKQLENKENTTDIWSSDL 1024
RSPVNGKGRILQRE SDLELGELREPF EEA G+KKFERNNS KQLENKE+T+DIWSS+L
Sbjct: 901 RSPVNGKGRILQREHSDLELGELREPFPEEALGKKKFERNNSSKQLENKEHTSDIWSSEL 960
Query: 1025 SKGKSNLKASLEYGKRSSPDVSTKFPSNLEGSNKKKNSEHIVEDSTRLNHRSLQSYP--- 1084
+KGKSNLKASL+ GKRSSP +STKFPSN EGSNKKK SEH VED TR+NHR QS+P
Sbjct: 961 NKGKSNLKASLDNGKRSSPHISTKFPSNPEGSNKKKISEHKVEDLTRINHRPPQSHPQGS 1020
Query: 1085 QYNSRVDHVEVDKSVDTNVKPNQGIGPESCGESNRKASVGISQLNDMKREQLPSKKGSKK 1144
QY+SRVDHVEV+K VD NVK NQGIGPESCGESNRKASVG+SQL+DMKREQLPSKKGSK+
Sbjct: 1021 QYSSRVDHVEVEKPVDANVKLNQGIGPESCGESNRKASVGMSQLHDMKREQLPSKKGSKR 1080
Query: 1145 LAPNPIAEVTDAQKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQY 1204
APN I EVTDA KNP+SAE ENSD KRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQY
Sbjct: 1081 QAPNQIIEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQY 1140
Query: 1205 KEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLC 1264
KEYVQEY DKYE YLSLNKILESYRAEFCKLGKELDSARGQDS+KYFN+LGQLKESYRLC
Sbjct: 1141 KEYVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSARGQDSDKYFNLLGQLKESYRLC 1200
Query: 1265 STRHKRLKKIFIVLHEELKHIKERIRDFAQIYAKD 1295
STRHKRLKKIFIVLHEELKH+KERI+DFAQ YAKD
Sbjct: 1201 STRHKRLKKIFIVLHEELKHLKERIKDFAQTYAKD 1233
BLAST of CcUC07G141330 vs. ExPASy TrEMBL
Match:
A0A6J1GAK2 (dentin sialophosphoprotein isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111452430 PE=4 SV=1)
HSP 1 Score: 1974.9 bits (5115), Expect = 0.0e+00
Identity = 1064/1235 (86.15%), Postives = 1130/1235 (91.50%), Query Frame = 0
Query: 65 MYGGPSKLGRPGGGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 124
MYGGPSKL R GGGAGRGA GKRP SSFPLPP+HRPSGRLSLGGGGAGS ANPRNRT+TA
Sbjct: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
Query: 125 TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 184
SEAP SVEENFSLVTGNNPLAFAMIIRLAPDLI+EIKRVE+ GGTPRIKFDANAKNSS
Sbjct: 61 AKSEAPLSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESLGGTPRIKFDANAKNSS 120
Query: 185 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 244
GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLL+ESGNAWRK+NVQRILDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
Query: 245 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQK 304
NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAE+NPWR H+KNKKEPPFKKQK
Sbjct: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
Query: 305 NELSQVGPPKSTYKPGMSSLPASKDRLSSSPIPSPPEQSGAPVSQFGSANSTKTHAFAED 364
NELSQVGPPKST+KPGMSS+PASK+RLSSSP+PSPPEQSGAP+SQFGSAN TKTH AED
Sbjct: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCIAED 300
Query: 365 IRPRLPAKINAAASSEKEVPTKATKGVLETPGLEGNSGAKPTDLQGMLYNLLLENPKGMS 424
I+PR PAKINAAASSEKE+PTKA KGVLE PG E N+GAKPTDLQGMLYNLLL+NPKGMS
Sbjct: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
Query: 425 LKALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKSGVELEGSKKPSSEGESSPLIS 484
LKALEKAVGDKIPN+VKKIEPIIKKIATYQAPGRYCLKS VELEGSKKPSSEGESSPL+S
Sbjct: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS 420
Query: 485 HHQTPPVHEGLPDQITAPELQLEARCGIELEEKVETSQANKESNFLEKNCIPQHSPDPFA 544
H QT PVHE DQ PE QLEAR IELEEKVETSQANKESNFLEKN I Q+SPDPFA
Sbjct: 421 HQQT-PVHEDFHDQ-PVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFA 480
Query: 545 EKKGSENSEGQPASSSDNESDSDSESDSSDSGSDSGNHSRSRSRSPMGSGSGSSSDSESD 604
EKKGSENSEGQ ASSSDNESDSDSESDSSDSGSDSGN SRSRSRSP+GSGSGSSSDSESD
Sbjct: 481 EKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESD 540
Query: 605 APSNSKEGSDEDVDIMTSDDDKESKHKLQASAQGFSTSPAAWKSPDGGAVQIVDDEKEDG 664
APSNSKEGSDEDVDIMTSDDDKE K+KLQAS QGFS SPAAWKSPDGGAV +DDEKEDG
Sbjct: 541 APSNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDG 600
Query: 665 QESDAIDIEKDSSDDEPDAKIDDPSLLPIEECGRPVEEPRSFSPYPDEFQERQNFIGSLF 724
ESDAIDIEKDSSDDEP+AKIDD SL P E GRPVEE RS SPYPDEFQERQNFIGSLF
Sbjct: 601 HESDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRPVEESRSLSPYPDEFQERQNFIGSLF 660
Query: 725 EDRENTVLDGGRHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNW 784
EDRENTV++ RHEQSDST RISKGKSKRSS+LEC EE + HTKRLK ES +QQPVSGNW
Sbjct: 661 EDRENTVVESSRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNW 720
Query: 785 GAQLQSPQSLSPSKLNRDSIRNPTSQVTNKGEVKGNSDYRPKKGNKETVPEKNSSDVSQA 844
GAQLQS ++LSPSKLNRDS RNPTSQVTNKGE+KGNSD+RPK GNKE V EKN SDVSQA
Sbjct: 721 GAQLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQA 780
Query: 845 GWRPHDQSGVRAVDTAARAEKHGD-IGRGTKHTDKGGHANESFHVFKDTFYGNAENEGTK 904
WRPHDQSGVRAVDTA R +KHG+ IGRG KH++KGGHANESFH +KD FYGN ENEG
Sbjct: 781 SWRPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNVENEGMN 840
Query: 905 EKKVSKNSRSGGPGDKQIQSFDSHHGKPGEIVGKFKEGQTFSSSQMGYSPRDNNNRVSAN 964
EKKVS+NSRSGGPGDKQIQ DSH KPG+IVGKFK+G+TFSSSQMGYSPRDNNNR+SA+
Sbjct: 841 EKKVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRISAD 900
Query: 965 RSPVNGKGRILQRELSDLELGELREPFHEEARGRKKFERNNSLKQLENKENTTDIWSSDL 1024
RSPVNGKGRILQRE SDLELGELREPF EE G+KKFERNNS KQLENK +T+DIWSS+L
Sbjct: 901 RSPVNGKGRILQREHSDLELGELREPFPEEVLGKKKFERNNSSKQLENKGHTSDIWSSEL 960
Query: 1025 SKGKSNLKASLEYGKRSSPDVSTKFPSNLEGSNKKKNSEHIVEDSTRLNHRSLQSY---P 1084
+KGKSNLKASL+ GKRSSP +STKFPSN E SN+KK SEH VED TR+NHR QS+ P
Sbjct: 961 NKGKSNLKASLDNGKRSSPHISTKFPSNPEVSNQKKISEHKVEDLTRVNHRPPQSHPQGP 1020
Query: 1085 QYNSRVDHVEVDKSVDTNVKPNQGIGPESCGESNRKASVGISQLNDMKREQLPSKKGSKK 1144
QY+SRVDHVEV+K VD NVKPNQGIGPESCGESNRKASVGISQL+DMKREQLPSKKGSK+
Sbjct: 1021 QYSSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKGSKR 1080
Query: 1145 LAPNPIAEVTDAQKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQY 1204
APN I EVTDA KNP+SAE ENSD KRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQY
Sbjct: 1081 QAPNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQY 1140
Query: 1205 KEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLC 1264
KEYVQEY DKYE YLSLNKILESYRAEFCKLGKELDS+RGQ+S+KYFN+L QLKESYRLC
Sbjct: 1141 KEYVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQNSDKYFNLLEQLKESYRLC 1200
Query: 1265 STRHKRLKKIFIVLHEELKHIKERIRDFAQIYAKD 1295
STRHKRLKKIF+VLHEELKH+KERI+DFAQ YAKD
Sbjct: 1201 STRHKRLKKIFVVLHEELKHLKERIKDFAQTYAKD 1233
BLAST of CcUC07G141330 vs. ExPASy TrEMBL
Match:
A0A6J1KF98 (dentin sialophosphoprotein isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111492713 PE=4 SV=1)
HSP 1 Score: 1974.1 bits (5113), Expect = 0.0e+00
Identity = 1067/1239 (86.12%), Postives = 1131/1239 (91.28%), Query Frame = 0
Query: 65 MYGGPSKLGRPGGGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 124
MYGGPSKL R GGGAGRGA GKRP SSFPLPP+HRPSGRLSLGGGGAGS ANPRNRT+TA
Sbjct: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
Query: 125 TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 184
SEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLI+EIKRVE+QGGTPRIKFDANAKNSS
Sbjct: 61 AKSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESQGGTPRIKFDANAKNSS 120
Query: 185 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 244
GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLL+ESGNAWRK+NVQRILDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
Query: 245 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQK 304
NHVKKLSEEAERRSKSRRAIVLEPGN SMKNQIKQLAAAE+NPWR H+KNKKEPPFKKQK
Sbjct: 181 NHVKKLSEEAERRSKSRRAIVLEPGNLSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
Query: 305 NELSQVGPPKSTYKPGMSSLPASKDRLSSSPIPSPPEQSGAPVSQFGSANSTKTHAFAED 364
NELSQVGPPKST+KPGMSS+PASK+RLSSSP+PSPPEQ GAP+SQFGSAN TKTH AED
Sbjct: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQFGAPISQFGSANPTKTHCIAED 300
Query: 365 IRPRLPAKINAAASSEKEVPTKATKGVLETPGLEGNSGAKPTDLQGMLYNLLLENPKGMS 424
I+PR PAKINAAASSEKE+PTKA KGVLE PG E N+GAKPTDLQGMLYNLLL+NPKGMS
Sbjct: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
Query: 425 LKALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKSGVELEGSKKPSSEGESSPLIS 484
LKALEKAVGDKIPN+VKKIEPIIKKIATYQAPGRYCLKS VELEGSKKPSSEGESSPL+S
Sbjct: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKSEVELEGSKKPSSEGESSPLVS 420
Query: 485 HHQTPPVHEGLPDQITAPELQLEARCGIELEEKVETSQANKESNFLEKNCIPQHSPDPFA 544
H QT PVHE DQ PE QLEAR IELEEKVETSQANKESNFLEKN I Q+SPDPFA
Sbjct: 421 HQQT-PVHEDFHDQ-PVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPFA 480
Query: 545 EKKGSENSEGQPASSSDNESDSDSESDSSDSGSDSGNHSRSRSRSPMGSGSGSSSDSESD 604
EKKGSENSEG+ A+SSDNESDSDSESDSSDSGSDSGN SRSRSRSP+GSGSGSSSDSESD
Sbjct: 481 EKKGSENSEGEAATSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSESD 540
Query: 605 APSNSKEGSDEDVDIMTSDDDKESKHKLQASAQGFSTSPAAWKSPDGGAVQIVDDEKEDG 664
APSNSKEGSDEDVDIMTSDDDKE KHKLQAS QGFS SPAAWKSPDGGAV +DDEKEDG
Sbjct: 541 APSNSKEGSDEDVDIMTSDDDKEPKHKLQASVQGFSASPAAWKSPDGGAVLNIDDEKEDG 600
Query: 665 QESDAIDIEKDSSDDEPDAKIDDPSLLPIEECGRPVEEPRSFSPYPDEFQERQNFIGSLF 724
ESDAIDIEKDSSDDEP+AKIDD SL P E GR VEE RS SPYPDEFQERQNFIGSLF
Sbjct: 601 HESDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRTVEESRSLSPYPDEFQERQNFIGSLF 660
Query: 725 EDRENTVLDGGRHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNW 784
EDRENTV+D GRHEQSDST RISKGKSKRSS+LEC EE + HTKRLK ES +QQPVSGNW
Sbjct: 661 EDRENTVVDSGRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGNW 720
Query: 785 GAQLQSPQSLSPSKLNRDSIRNPTSQVTNKGEVKGNSDYRPKKGNKETVPEKNSSDVSQA 844
G QLQS ++LSPSKLNRDS RNPT+QVTNKGE+KGNSD+RPK GNKE V EKN SDVSQA
Sbjct: 721 GVQLQSSRNLSPSKLNRDSARNPTNQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQA 780
Query: 845 GWRPHDQSGVRAVDTAARAEKHGD-IGRGTKHTDKGGHANESFHVFKDTFYGNAENEGTK 904
WRPHDQSGVRAVDTA R +KHG+ IGRG KH++KGGHANESFH +KD FYGNAENE
Sbjct: 781 SWRPHDQSGVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNAENEEMN 840
Query: 905 EKKVSKNSRSGGPGDKQIQSFDSHHGKPGEIVGKFKEGQTFSSSQMGYSPRDNNNRVSAN 964
EKKVS+NSRSGGPGDKQIQ DSH KPG+IVGKFK+G+TF SSQMGYSPRDNNNR+SA+
Sbjct: 841 EKKVSRNSRSGGPGDKQIQPSDSHLSKPGDIVGKFKDGKTFPSSQMGYSPRDNNNRISAD 900
Query: 965 RSPVNGKGRILQRELSDLELGELREPFHEEARGRKKFERNNSLKQLENKENTTDIWSSDL 1024
RSPVNGKGRILQRE SDLELGELREPF EEA G+KKFERNNS KQLENKE+T+DIWSS+L
Sbjct: 901 RSPVNGKGRILQREHSDLELGELREPFPEEALGKKKFERNNSSKQLENKEHTSDIWSSEL 960
Query: 1025 SKGKSNLKASLEYGKRSSPDVSTKFPSNLEGSNKKKNSEHIVEDSTRLNHRSLQSYP--- 1084
+KGKSNLKASL+ GKRSSP +STKFPSN EGSNKKK SEH VED TR+NHR QS+P
Sbjct: 961 NKGKSNLKASLDNGKRSSPHISTKFPSNPEGSNKKKISEHKVEDLTRINHRPPQSHPQGS 1020
Query: 1085 QYNSRVDHVEVDKSVDTNVKPNQGIGPESCGESNRKASVGISQLNDMKREQLPSKKGSKK 1144
QY+SRVDHVEV+K VD NVK NQGIGPESCGESNRKASVG+SQL+DMKREQLPSKKGSK+
Sbjct: 1021 QYSSRVDHVEVEKPVDANVKLNQGIGPESCGESNRKASVGMSQLHDMKREQLPSKKGSKR 1080
Query: 1145 LAPNPIAEVTDAQKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQY 1204
APN I EVTDA KNP+SAE ENSD KRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQY
Sbjct: 1081 QAPNQIIEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQY 1140
Query: 1205 KEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLC 1264
KEYVQEY DKYE YLSLNKILESYRAEFCKLGKELDSARGQDS+KYFN+LGQLKESYRLC
Sbjct: 1141 KEYVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSARGQDSDKYFNLLGQLKESYRLC 1200
Query: 1265 ST----RHKRLKKIFIVLHEELKHIKERIRDFAQIYAKD 1295
ST RHKRLKKIFIVLHEELKH+KERI+DFAQ YAKD
Sbjct: 1201 STSNLQRHKRLKKIFIVLHEELKHLKERIKDFAQTYAKD 1237
BLAST of CcUC07G141330 vs. TAIR 10
Match:
AT3G21290.1 (dentin sialophosphoprotein-related )
HSP 1 Score: 617.1 bits (1590), Expect = 3.3e-176
Identity = 512/1300 (39.38%), Postives = 705/1300 (54.23%), Query Frame = 0
Query: 65 MYGGPSKLGRPGGGAGRGAGGKRPHSSFPLPPSHRPS--GRLSLGGGGAGSVANPRNRTT 124
M+ G SK G GG G G+G R +SFP P + PS GR+S GGGG GS A PR R+
Sbjct: 1 MFKGSSKRGGRGGSGGGGSGPSRNRNSFPPPTNRHPSPIGRMSSGGGGGGSAA-PRQRSN 60
Query: 125 T------ATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKF 184
+ A+T+ + ++VEE F+LV + AF MIIRL+PDL+DEIKRVEAQGG +IKF
Sbjct: 61 STSVKAAASTTVSSRTVEETFNLVPRESSSAFGMIIRLSPDLVDEIKRVEAQGGAAKIKF 120
Query: 185 DANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQ 244
DA NS+ N+I+VGGKEF+FTWS E G+LCDIYEE +SGEDG+GLLIE+G AWRKLNV
Sbjct: 121 DAFPNNSTENIINVGGKEFKFTWSGEKGELCDIYEEHQSGEDGNGLLIEAGCAWRKLNVL 180
Query: 245 RILDESTTNHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWR-HFKNKK 304
R LDESTT+H+K S EAE+R+KSR+AIVL+PGNPS+ KQLA AE +PWR K KK
Sbjct: 181 RTLDESTTSHMKMRSVEAEQRTKSRKAIVLDPGNPSV---TKQLAHAEGSPWRMSNKQKK 240
Query: 305 EPPFKKQKNELS--QVGPPKSTYKPGMSSLPASKDRLSSSPIPSPPEQSGAPVSQFGSAN 364
EPP KK+K + VG PK +++PG +S P K+RLS+SP PSP Q P +G N
Sbjct: 241 EPPPKKRKVDPPPVPVGGPKPSFRPG-ASTPTMKNRLSASPGPSPSNQYNTP--PYGIGN 300
Query: 365 STKTHAFAEDIRP-RLPAKINAAASSEKEVPTKATKGVLETPGLEGNSGAKPTDLQGMLY 424
KTHA E++ P + ++N EKE + +T G E + K DLQ +L
Sbjct: 301 MAKTHAANENVTPVQTKGRVNMI---EKEPSAWKNNVLRDTSGREAINVNKEIDLQSLLV 360
Query: 425 NLLLENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKSGVELEGSKKP 484
++L E P MSLKALEKAVGDK+PN KKIEPI+K+IA +QAP RY LK ELE KK
Sbjct: 361 DILKEAP--MSLKALEKAVGDKVPNPAKKIEPILKRIANFQAP-RYFLKPEAELESYKKH 420
Query: 485 SSEGESSPLISHHQTPPVHEGLPDQITAPELQLEARCGIELEEKVETSQANKESNFLEKN 544
S + SSP H Q PV E DQ+ P G EK + N E + +
Sbjct: 421 SPDSGSSP--EHQQLLPVTECSRDQLPVP--------GRNNTEKFSLCEQNGEGSL---D 480
Query: 545 CIP----------------QHSPDPFAEKKGSENSEGQPASSSDNESDSDSESDSSDSGS 604
C+P HSP F E+K SEN E Q SS SDSDS+SD+SDSGS
Sbjct: 481 CLPVHLVEQLSTQENVDIEHHSPGIFHEEKRSENREAQARSS----SDSDSDSDNSDSGS 540
Query: 605 DSGNHSRSRSRSPMGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQASAQ 664
D S+S GS SGSSSDSE A SNSK+GSDEDVDIM SD D+E Q+ Q
Sbjct: 541 D--------SKSAAGSDSGSSSDSE--ASSNSKDGSDEDVDIM-SDGDREPLLTTQSLEQ 600
Query: 665 ------GFSTSPAAWKSPDGGAVQIVDDEKE----DGQESDAIDIEKDSSDD----EPDA 724
G +S + + AV I + + DG SD +D+E +SSD+ + D
Sbjct: 601 DAIDLPGHGSSAVEIEGHNSDAVDIDGHDSDAVDIDGHGSDTVDVEGNSSDEGHGSDADR 660
Query: 725 KIDDPSLLPIE-----------ECGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTVL 784
K + + +E E G + F+ D +ERQNFIG LF+D ENT
Sbjct: 661 KKNSDNNWKMETTTGTSPTANGEVG--ISGQEHFTSGHDNLRERQNFIGQLFDDTENTTK 720
Query: 785 DGGRHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNWGAQLQSPQ 844
+ ++++ D + R+ K +++++ D E +KS H K KS+S Q
Sbjct: 721 NNFKNDKRDISERLGKDQNQKALDFEHYSQKSAHEKNRKSQSCNQL-------------- 780
Query: 845 SLSPSKLNRDSIRNPTSQVTNKGEVKGNSDYRPKKGNKETVPEKNSSDVSQAGWRPHDQS 904
S +++DS E+K +++ R ++ P + +S
Sbjct: 781 ----SAVSKDS---------QHSELKYDAELRNASASQTIDPLRGLL-----------KS 840
Query: 905 GVRAVDTAARAEKHGDIGRGTKHTDKGGH------ANESFHVFKDTFYGNAENEGTKEKK 964
+ + ++ KH D + +DKG H ++ S F+D N ++ + K
Sbjct: 841 SIEKSNRHGKSNKHSDALGNVRKSDKGDHFPLEMLSSRSGKAFRD----NQRDDVHLKNK 900
Query: 965 VSKNSRSGGPGDKQIQSFDSHHGKPGEIVGKFKEGQTFSSSQMGYSPRDNNNRVSANRSP 1024
+N + G + ++ KP E+ G K+ + S +G SP D+ A
Sbjct: 901 FPRNKKDGESAIRPSLPTETSDRKPDELDGSDKDPKNVSGLSIGSSPLDSQRTYLAKLP- 960
Query: 1025 VNGKGRILQRELSDLELGELREPFHEEARGRKKFERNNSLKQLENKENTTDIWSSDLSKG 1084
G G +LQ+++S+LELGEL EP E+ K E S +Q K +T++ D K
Sbjct: 961 -KGNGPVLQKQVSELELGELPEPLGEDT-ALKPIEEKTSFRQSNLKPSTSEKLGIDSDKR 1020
Query: 1085 KSNLKASLEYGKRSSPDVSTKFPSNLEGSNKKKNSEHIVEDSTRLNHRSLQSYPQYNSRV 1144
+S S K+++P P + GSN EH+VEDS R +LQS+ Q +
Sbjct: 1021 RSKKSDS----KKAAP------PHTVNGSN-----EHVVEDSERSQKWALQSHGQNLTGT 1080
Query: 1145 DHVEVD-----------KSVDTNVKPNQGIGPESCGESNRKASVGISQLNDMKREQLPSK 1204
D E+ KS + + G E GE+N+K V + + S+
Sbjct: 1081 D-TEISSQNKNLEDAAYKSRQKDSRARVGNSVEGYGETNKKTPV----VKHGSKRASTSR 1140
Query: 1205 KGSKKLAPNPIAEVTDAQKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIK 1264
+ + ++ + K+ S + +++ +S E SY KYEK PELKG I
Sbjct: 1141 SSRESKRHSSVSNSINGHKDATSIPGGSVVREKQMTSFGEEDSSYLKYEKASPELKGPIS 1192
Query: 1265 DFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSEKYFNVLGQLKE 1295
D QYK Y+QEY+DKY+SY S+NKILES+R +F KLG++L A+G+D E+Y ++ Q+KE
Sbjct: 1201 DHLQYKAYMQEYNDKYDSYHSINKILESHRNDFQKLGQDLGFAKGRDVERYNKIVEQIKE 1192
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038883601.1 | 0.0e+00 | 91.88 | dentin sialophosphoprotein isoform X1 [Benincasa hispida] | [more] |
XP_004146856.1 | 0.0e+00 | 90.65 | dentin sialophosphoprotein isoform X1 [Cucumis sativus] >KGN59835.1 hypothetical... | [more] |
XP_008447590.1 | 0.0e+00 | 90.41 | PREDICTED: dentin sialophosphoprotein isoform X1 [Cucumis melo] | [more] |
XP_023524753.1 | 0.0e+00 | 86.72 | dentin sialophosphoprotein-like isoform X2 [Cucurbita pepo subsp. pepo] | [more] |
XP_023524752.1 | 0.0e+00 | 86.44 | dentin sialophosphoprotein-like isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0LCU6 | 0.0e+00 | 90.65 | Occludin_ELL domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G84991... | [more] |
A0A1S3BIQ1 | 0.0e+00 | 90.41 | dentin sialophosphoprotein isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490006 PE... | [more] |
A0A6J1KCU5 | 0.0e+00 | 86.40 | dentin sialophosphoprotein isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC11149271... | [more] |
A0A6J1GAK2 | 0.0e+00 | 86.15 | dentin sialophosphoprotein isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111452... | [more] |
A0A6J1KF98 | 0.0e+00 | 86.12 | dentin sialophosphoprotein isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC11149271... | [more] |
Match Name | E-value | Identity | Description | |
AT3G21290.1 | 3.3e-176 | 39.38 | dentin sialophosphoprotein-related | [more] |