Cla97C01G004270 (gene) Watermelon (97103) v2.5

Overview
NameCla97C01G004270
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionProtein of unknown function, DUF547
LocationCla97Chr01: 4166043 .. 4177256 (-)
RNA-Seq ExpressionCla97C01G004270
SyntenyCla97C01G004270
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTGGATTCGATATGCATATGCGTGGTGAAGAAACAGCTTCTGGGAAGAGAGAACTTAGAGATTATTTAGCGTCTCAACGTGTTCATTACCGCCATAGGCGATCTAGAAGGTTTTATTTGCACGATTTGCATCTTTGCTCGTAACTAGATTGCTAATAGAGGACGTTTTCTTGTTTATCTGTTGCATTTTGATTTGTTTATTGTTTGGATGAGTGCATAGAGAATAAGATTTTTGCTTCCTTTCTTCTTGTTTGATTTTTAGAAACTGGAGAATGGAGAGTGGAGAGATTTGTGGGCAAATTAATGTATGTTTCGCATTTTCACCTTCACCATTTTGTTGATGTTGAATGTTGAAAGCTGGATCCAATTGGCAGGGAACTAAACTTGAACAGAGAAAAACGTGGGGATGGCAATAAGTTGTTGGTCACTTAATGTTCTCATTTTGTTCTTTCTTTTTTACTTGCACAAAGTTTCGTCATAAAGATTTTATAGAATCTATGAATTGATAAATTCATTTCGTCACTCTATAACGATCATTACGTCACGTCACTAACCGATGATCCTTTTGTTTTTCTTCTACTCCTCTTCAGTTCTTCAGACAGGAACTCCAATGTCTTTAGAGGTGGGGTTCTGCATTCTAATAGCAAGAATGATCGGAGTGACACACAGGTGAGGCAACAAATCCTAGTTGTCGTGCTCATATTAGTGTATTGTTTTTATATGTGTGCATCCATGATGACTATCTCGCGAGCTATTGGATCTTGTATGTCAGTTGTGATTATAATGACTTCGATGATAATTTGAAGTGAATGTTGTAATAACAGGCATCCCCACTTTCTGCAAGTGGTATCAGAGCACCAAGTCCACTACATGAGAGGTCTACAGATATCAATGATAATTCATCAACTAAACAGCGAGCGTCTTTGGAAAATGATGTAAGCATCTGGTCTCTCTCTTACTCATGTTCCATGTGCAGTTGGATTCCATATGTGTTATGTTCTGAGGCAATCTATTTCAAGATATCAACAACTCTGGATGGTCAAGTCCGAAGTGTCAAACAGTTCAATTGCCAGTTATGTGGAAAATATTTTAACATCTCAGTTTGTACACATACAAATTGCAGATCGAGTTGCTACAGCTGCGCTTGCAACAAGAGAGATCTATGCGAAGTATGCTAGAAAGGGCAATGGGTCGTGCATCAAGTACTTTATCTCCCGGGCATAGGCACTTAGCCCAGGTAATTCTTTCACTTTTTTCATGTATGTATAGAAGTATCATTAAGATGCTTATGATCGTCCAAATGGTTAAGAACTTAAGAGGTCCTTTTCTAGTAGGCATTTCTAAATTCAGTGGTGGGAGATTGAATCTTGAATTCAAACATCTCCTAGTAGTTTATCACGTCACGTTTATAACAGATTATTTAATTTCATACCTTCCCTAACTGAAGTGCATTTGTCCCAGACGAAGGATTTGATCTCAGAAATTGAATTACTTGAAGAAGAGGTCGCAAACCGTGAACAGCATGTGCTCTCTCTCTATAGAAGTATTTTTGAAAATTGTGTTAGTAAGCCATCTTCTCAGCAAAATTCAGTCACAGCCTCTCCAGCTCATGGGAAGCATGAATCAAGAAAACACCCCAGTATCATTTCAAGTGCGTTTTGTTCGTCGAGGAAGTTTCCTTTGGGACCTTTGCAACCTTTCTCTGTAAATGACTTGGGAAAAAGAACCTCAAATGCTGGTCCTAATTCCTTGTTTGGAGGTAAAAGCGACATAAGTATGGGGAAAACTTCAGGCACTGCAAAGGTACACTTAATAGTTAATATGTAGGTTGTTGTGTTTGTCCTGGTTTGCTAAAAAGTAAGTTTTGCATTGATCTAAGAATATCCGGTATAACTGTTAAGTTCGATCCTTCCTATTATGGCATCATTAGTTGAAAACCAGACAGGGATATCAAGCAGTGACTATTTGTAGAATATTATAAATTTATAAATATGACAACTCCTTCAAACAAAAACAGAACCAAAGAAGGTTTCTTGCCCCATGGACACATCATTCAGTATTCTACATGTTGATTTTTTTTGCATTTTGTAGTCATAACCTGCTGTGGCAAGTATTGAAATGGTCTCATCAGAGTAATTTCTTGTCAGGTTCGTGAAGCCTTTTCGCAGGCGAAGAGAACTTCTCTGCAAACTCTAAAAGACCATCTTTTTGAGTGTCCAAGTAAATTATCAGAGGAGATGGTGAGGTGCATGGCTTTTATATACTGCTCTCTTCATAGAGTGGCATCAAACAAGGCTAAAAAAAAGGCAGGTTCCTTTCCTGAAGTTAAACAGCCCCAATGTGGACCGGCGGAGGAACAATTTGGGGGTGGGAAAGCAATGCTGGAAATACATTGCATATCAACCAATAACAGCCAGTTCTCTCGTGCTTCATACGCAATCAACAATTATAGGTAGATTTCTTGCTTGGAAATGTAATTCCCTACAGGGGTAGGGGACGAGGTAGAAAAAGAAGACTTAATTGGTCCTTATCTTCTGAAAATTCGATTAACAATTTTTTTTTTTTTTTTTAATGAATGTCTGGGACTCATGCTCTGATGCCATTTACCTTTGCTTGCCCTTAGCAGTTAACCATCCCGACATAACTTTTCTTCTAATGTCTGGGAGATATTTGATTCAAGCTTTATCTATTCAATTTTAGAGAGAGAGTAAATTAATGGATCTTTTTTTTTTTTTATTGGTTTCAGAGTATTAGTTGAGCAGCTGGAAAATGTGAATGTCAGTAAGATGGGGATCGATGCTCAAACTGCATTCTGGATTAATGTGTATAATGCTCTTCTTATGCATGTAAGATCAGCTTTGCGACATTTCCAGTTCACACTTCTTATTTTGTTGATCTTATTTTGCCTGTCTTTTTCAGGCCTATTTGGCATATGGAATCCCTCATGGCTCTCTAAGAAGGTTGGCTTTGTTCCATAAGGTATTTCATTTCCTGCGGTACTTGAAACATGCAGGCATATAAATATCTTCGTGTACCAAAGAAATTTGAGATAGATATTTGAGAGTTACCAATATGTGTTATAATGATCTTGAAATATATTAGGCCGACAATTTAAGGGGTTTCGTTGGCTTCAAAATTTGTCTTTTCTTTTCATCCACACACTGTTCCTTCTAAAATCTTTGTGCTCGTGTCCTGTGAGTGAATTTAGTTGTTTTAGATAGACAAGCATTGTTGATGGGCTTTTCAGTATAGTGCAGGCTGCTTACAACATCGGTGGCCATATCATCAGTGCAAATGCAATAGAGCAATCAATTTTTTTCTTCAAATCTCCCCGAATAGGATGGGTATGCACTATTATCTTAAGCTTTTGGCGCTGTTGTGATAAGTTGATAACTAGTCCTGGAGTTCCTAATTTTTCGAACACCCTGTTAAAAAAGATGTAAAGTAAAGTTTTTGTTTTGACTATGTTTTTGTATAAACTACAGTGGCTTGAAACTATCATTTCAACTGCGCTGAGGAAGAAGTCTGGGGAAGAAAGGCAACTAATCTCTTCAAAATTGGGCCTTCATAGTCCTCAACCTCTTGTTTGCTTTGGCCTCTGCACTGGTGCCTCTTCAGATCCTGTGGTAAATTATTTCAATCTCGATTGGCTTTTTGACAAAATGGATCTGTTTTTGTCCTTTCAAACCAAATTTAATAGATAGAAAATCAAATAGCTTTTCTCAGAAACATAATAAAACAAATTTATGTGAGGCTTCCTACCGTGATTACTTACTCCTGGAGGGGTAAGAAAGTAATTTTTCAGGAATAAGTTAGTGATTGGCAGAAGGCTTTTGTAGGGTTTTGGGTGTGTGAGGTGAGTGTGGGAATTTTGGTGTTAGGCTGATTGTGGGTTTGCGGAGAGAACCAGCTCTCTCCAATAGCTGGAGTACTTTGTTCATTCTCATTTTGGTGATTAATTTATAAGAAAATTTCTTTTTCCTGAGGTACATACAAGTTTATCTTTATCATTTCCAAGAATTTTGTTGTTTTCTGAACTCAGTATGTTGCAAAGAATTACTTAGTAGCTGGCTTGTTTAAATTTGTAGCTGAAAGTGTACACTGCATCAAATATTAAAGAGGAACTGGAGGTGGCCAAAAGGGATTTTCTCCAAGCAAATATAGTTGTGAAGAAGTCAAAGAAAGTATTCCTACCAAAGGTGCTCGAGAGATTTGCACGTGAAGCATCCATCAGCTCAGAAGAACTCCCGAAATGGGTTTCTGAAAATGTCGACGGAAAACTCCACGAGTCCATACAGAAATGTACGGAACATCGGACTGGCAAGAAGGCATCTCAGATCATTGAGTGGCTACCTTACAGCTCAAGGTTCCGGTATGTATTTTTTCCCAATCTAACTGAAAAGCCATGGTGGTCGTAAGATTTATAATGTCCTTAACTTGTCGTTTTGAGGCTTCGAATATAGTAGGACCAAAAATTTTCTGCTAATCTGAGGGGAAATGGATTTAGGTAGTTGAAAGTACTCACCATTTGATTGTGTATATATATTGGGTCTTGTTGTTGTAGCTTTCCGAGCTGTTCTTCAACAAAAACACTGCATTTTGAACTCTCCCTCTGTTTATGCAAAGCCAATTTGTTGTTTCCTTTCAACATTGTGAAAACGGTTCAAATTCTATATTTCAATATCTTTGAAACAGATCTTTAGGAGAATGAAGCAATGTGATGATTCGATTTCAACTTAACAATATAAATCCTCCGACTTGTTCTAAAATGTCGAAGCATTTTGAGTAGTTTGTTGATACAGATAAAGTCAATGAAAAGAAATGTAATGGGGATTACAAACTTAGTCCAAAGCTCCCAATCCTAACATCGACTTGGGATTACAACCTATCCCATAATCTATTACATTCCACCCTCCCAAATAAACCCTTACTTCACAAAGACTCATTTGACTTCGTTTCACATTCTCCTTCATTCTTACTTTGACATTTTCTAAAACAAAAAATAGATACTTTTCTAAGGAAAAAAAATAAGTTGACTTAAAAAATAAGTTGTTTAGAAAATGAATTACAATATTAAAAATTCATTGCTAAATATTTTTTTCAAAAACAGAATCGTAGAAACAAAATTTTTATCAAATGATAGGTGTAGAGTGCTATATTTCTTCCATTGAGATATTGAAAAGCTTATTTTCATTTTGGTTTTAATTTTATGTGCTATAATTTTCTTGATACTTGATTTTTAAGAAAACAAAAACATTTGTGAGATTATCAATTAATTTGAAGAATGAAACAGAAAAGGTTTCCCAAAATAAGTTCTAATTGTTCAAACAAAGGATGAACAATGGGATTTGAAAACAAAAGATCAACGACCATTATTCCACTAAGAAGATGTACATATCTTTATGATATTTTTCACCACTTGGAATGGAAATATTGAATTTAAAATTTCCTTAAAAAAGAAAAAAAATCTAAATCCATCCATCTTTTTAAAGCCCTAAAGAAAGTAAATAGGCAAGTTTGCCTAGGTGCCAAGATAAACAAAGGGAAATAACAATGAGATTTGTAAAAAAAAATCTACCCACTTAATATTGTAAAAAAAATTCTACCCACTCATTTACAAAGGCGTGTTGACTTTTTTTCTTCTTGGTGGTTGGCTTCCATCTTCATCTATGGAAGGCGTCGGGACATTTGTTAGTAAGTTTCAAGTAAGGGTGTGCAACAGTTGGTCGGAGTCTATTTTTGACAAAACCATCACCGAACTGACTATGGTCGGTTTAATAAATGTTCAAACTGATTTCGATCGTCAAAGAGTGAAAACCGACCTTCGATCGATATTTATTTTAATGATTGTCATTTATTTCCTTTCAGGAAAAAAAAATTGAAATATTGACCCAAATCTACCAAAAATCTTTCTTTTAAAAAAATGATTACAAGTTTAGGTCCTTAATTTTAAAATTTGTGTCTATTTAGTCATTGAACGTTAAAAAGTGTAATAAATTTTTAAAATTTTTAGACTTAATTATGTGTCTATTATATTTAAATGAGATTTTATGTCTGTACTCTAAATTTTTAATTAGGTTTTTGGTAAATCTACAACTTTTTAAAAATATTAAATAGGTCAATGACTTATTAGTAAAATTTTTTAAATTTAAGAACCTATCCGACGCAAAATTGAAAGCTTAAGGATCTATTTAACACTTATTAAAGTTTAGAGACCTATGACATACAACATTAAATGTTCATGGACATATTAAACATTTATTAAACTTTAGGAACTAAAAATACACACAATTCTTAAGGTTCAAGTACTAAAATTTTAATTCAACCTAAAAAAAAAAAGAAAAAAGAAAAAAGAAAAGAAAGTTTGTCTACCATGGAAAACAAATGGGGTTGGAAAGTGGTCAACTTCTCTATGGAAATCTTTGTTGAGGCACTTTAACAACAATAGTTGCAATTTGTTCAACTTTTGCATTATTTTTCTTGTCATCTCCTCCTAATTAGTGATGCAATCATCATTATTCATTAGACAAATCAACCCTCAAATATATCAAACCAAACTTAATGTGGTTCACCTTCTTCAAACCTAGATTTGTCTTTAATTATTTTGGATTATATGTGAAAAAGAAAGGGATGTACATTCCAATTTATCTTCTTAAAATTTTATTATTTTAAAGTTTTACCATAATTACTAGGTTTTCATTAGGGTAATTGTTTTAAATGACAAAATTCTTGCAGATATTTTTAAATATAGCAAAATTTCACTGTCTATCAGTGTTTATCATTAATCGATAATGATAAATGATGGTAATCTATTAGGGTTTATAACTGATATAATGATATTTAAGTTTTTTTTTTACCTAGTTGTAATTTTAGTTTTTTTTGGTTATTTTTATTTTTATTTTTTTACATTCCATTACATTTTAAATATAAGATGGAGATACATTTTTCATGTAACTATTAGGGTTTTTTTTTAATTTATATTTTTTCTATTATTATTAATTATATTTTTATTTATTTTTAAGTTTTCTCCCTTGATAAAATTCCGTGAAACTTGTCGAATATTACTAGTCAAGCTCTGAAAGATAAGGTCTCTCCTATGCTAAATATTTTCATATTGCAAGTATATATATATATATCTTTAAAAAAACTATTTTTTGGGCATATGACGTAGGCAATTATGTTGTCATTATTATTATTGTTTTTATTTCTGTATGAAATCTGGACTTGATCCTATATTAAGTAATTTAAATGTCTAATCAGTACTATTTTTTATTACTGAACATAACTCAACTAACATAAAGTATATATCATCAACCAAGAGGTTAGATATTTGAATCTCTCATCTTGTTGAACTAAAATAAAAGCTAGTTCACTTAAAAAAGAGTTTGTTATAAAAAATACATTTAAGCTTTACCATTTTTGTTATAAAAACATTCCTATATTTCTAAAAGTTGCAATATTTTTCATGATTTTATCATAAATATTTGGAAACTACCCTTGAAGTAAAAGCCTTTAGAATTTTAATTTGGATAGAAATTAGGACATGAAATGATATGTTGGTAGCTTACAATAGTGTGGATATTCAAATTTTTGGTCTAGGTCAACGCTCATTATTCTAAAGAATTTATATTCCACGAGTAATTTTGAAACGGTTATGAAATGTTAAAAGTATTAATACAATATTTGACTTAAGAAACAAGTAAGAAAGTTTAGCAATATTATTATAATCTAGCCTTATTTGTATACTACCATTTATCCCCAGAAACTTAGTCACAATACTTTCAAAATGCCTATTTTGGGTCGCTATACTTTAAATTTTGACAATTTTGGTCCATTTATAAAATTTTAACACAAATTTTATCCATAATAAAAGCTCAATAAATAAAATTTATAATTACATCATGGTAAGAAATTTGATAAATAAAAGGTCATGATAAAAATATTATAAAAATAAATAAGATAAAATTAAAATGAAAGGACCAAAATAATTATTATTAAAAAATATGAAACGAAAATAAATATTCTAAAAATATAAACATAGTAATTGAAATAAACAAAGTCGAGTAACAAAGATGTTTTATAATTATATAGGTCTAAGTAAATAAAAGTTGAAGCTCCAAAAACAAAAATAATATTTAAACATTGAAAGTATATCACCTAAATTTATAATTTTTTTAAAAAAAATTAAATTAACAATGAAAATTTAACTCTCTAATTTTTAGTTCAGATATATATACTATAACATTTTCAACAATATATGTTGGTTAGCAATGTAATTTAATTTAAATTTAAAATAAAGTCGAAGGTACGTATGGACAATAGTAACATTTAAACCTTATTAATATAGATTTATAATTTTTCTGATGAACAATAAATTATAAGATCGCAGTGTTTGGTGAGCAAACTGAGTGAAACTTCGGCCAAACCTTACACCAAATCACTTCACTAATCTTTTTCTCCAAAACCCACTTTTTTATTTTTTATTTTATTTTATGTACAAACCCACTACTTGTTCTAATTCTGGTTTCCATTACATATTAATAAAAATTTATTACATTTTTCTCCTAATACAATCTTAGATTATATGTAAAGTATAAAATAAAATTAAAAAATCCTAAACCTTTATTTCTTTCGAAAATTGTAATGTTATTTAGACTTAAATGTTTAAAAAATTATGTGAAGGTTACTTAGAATAGTGTGACAATATAAATTTTCATTTGAGATGATAATTATTACACCGTTTTAGGTTTCAAATTTCAAAACATTTTTAAAAATACTCGTTTATATGCATAGTTATATTACCTATAACATTGGATAAAAAATCGTAGTTTATGTCTTAAATTTTGATGTGAAAAATGTTATACTTGACTAAAAACCTATATAATTAGAAATTTAGCAATAAGAACCAGTGGCTAAAATAACACAAGCATTTAAGTCTAAAAAACAAAATAGTACTTAAAATTGAAGGGAAAAAAGAGCAGTTTCAATTTCACAATTACATTTATAGTTGAGTTTTTTTTTTTTTTTTTTTTTTGTTGGGTCTTACATTCTTTCTAAATGCTTCTTTTATAACCACCACCAAGAAAAAAAAAAAAATTCAAAACAACTTAGTACGATAAAAAGTTTCAAATTTCTCCCCCATCCATGGTTGAACTCAAAAGAAAAAAAAAGTCAAAATATTATTTTATTCTCTATATCTATATTCACTTTAGTCCCTTAGCTTCTATTAAAGTTTAAATTGAGTCATTCAAAATTTTTTTAATCAAAATTGGTTAAATATAGCAACAATATTCCTACAAGAAAAAACAATATGAGATATATTTTCCATAATTTTAAAAGGAATGCCAAGAATCACTATTTTTTTAAAAAGAAAAAAAAATGGAAAAAATCAACAACAAACTAATTGCATAGGCTGTGCATGAGTTAATTTTAAAGTTTATCAAAACATGAAAAAATAGAATGAAACTCAAAGTAAGGATCCATTTGAATTAACTTAAGAAATAAATGTTTTTAAAAAATTCATTTTCATTTAAACACTTTAAACAAAATGTTTAAAATAAAAACACTTTTGTTGAGTGGTTGTCAAACATTCGATTTTTTTTTTCCTTTTCAAAATAACTTACTTTTTAATTTAAACATTTTAAAAATCATTCCAAATACACCTAAAAGAATTAACTTTAAAAAAAAAAAAAAATTTGACCTACATATGAAAAAAAAACAAAACAACTAATAAAAAATATTAATTTTTTAAATTTTGTAATAAAGCAATGGTTTGTTTACCAAACAAAAACCAAAATAAAACAACATAGCAAAGTCAGAATCTACAATATAAAGGCAAGCACGCCAGAATCTTCTTAACTTTCCCTCATCCTTCTTTGTTTTTCTCTTCCCCAAATCCAATTTTCCTTCTTTATAATTTCTCTATAATCAAATTATTTATTTTATCCATTGAATTTTCATATTCTATATAAAATATATATTATTATTAAAAAAAAAAAAAAAAAAAAAAAAAAAAGAAGAAGAAGAAGAAGAAAATAGATCGGTTTATTATAATTGATAAATACCTTTTTATTTCCCTTTGGATTTCTCATGTCCCGATAACTGCCAAATAATATCACCATGTTTCCGTTTTTCCGTTATTACCCCTTCTTCTTCTTCTTCTACCTTCTCTCCTCTGTTTTTCTCTTCTTCCCTGTTTTTCCCGCCAGAACTTTGCCGGACTTCACCACTCTCGACGCCGATAACAACAACTACGCATGGAGAAACTTCGCTCGGTTCCTCGACGCTGGCAAAGGCAGCGAAGTGAACGGCATGTCGGAACTGAAGAAGTATTTGAATCGATTCGGTTACCTTCCGATTCCTCCTCAAAACAACTTCTCCGATTTCTTCGACGATCAATTCGTATCGGCGTTGATTCTCTATCAGAATCGCTTAGGTTTATCAGTCACTGGAAAACTCGATTCCGAAACAATCGCAAGCATCATGTCGCCTAGATGCGGAATGAGTGACCTAATTAAAATCAACAACAACAACACAACAATTCACTCAACGCGTCGATACGCTTTCTTCAACGGCCAACCGAGATGGATTCGATCCTCAACTCTAACGTACGCTCTCTCACCAGATTACACAATCGAATACCTAACTTCATCAGAAATCCGCAAGGTCGTTCGACGATCGTTCTCGCGGTGGTCCGCAGTGATTCCGTTGAACTTCACCGAATCCTCTGATTACGAATCGTCAGATATCCGAATCGGATTCTACCGCGGCGATCACGGCGATGGAGAGGCGTTCGACGGAGTATTAGGCGTTTTGGCACACGCGTTTTCACCGGAAAACGGAAGGCTGCACTTAGATGCGGCGGAGCGCTGGGCGGTGGATTTCGAGCAGGAGAAATCGAAAGTAGCTGTGGATTTGGAATCGGTTGTAACGCATGAGATAGGGCATGTTCTTGGACTGGCTCATTCCGCAGTGAAGGAATCTGTGATGTATCCAAGTTTGAGTCCGCGAGGGAAGAAAGTGGACCTTAGGATCGATGACGTAGAAGGAATTCAGTATTTGTATGGTACGAACCCTAATTTCAAATTGAAATCCTTCTTGGAATCCGAAAAATCTATCAACAGTGGATCATCATCTTCTTCAATCAACACTAATTTCTTCTTCTTACTGTTATTCTACTTGTTGGTTTGGGTTGGGTCTCTGTTTTTCTGA

mRNA sequence

ATGAGTGGATTCGATATGCATATGCGTGGTGAAGAAACAGCTTCTGGGAAGAGAGAACTTAGAGATTATTTAGCGTCTCAACGTGTTCATTACCGCCATAGGCGATCTAGAAGTTCTTCAGACAGGAACTCCAATGTCTTTAGAGGTGGGGTTCTGCATTCTAATAGCAAGAATGATCGGAGTGACACACAGGCATCCCCACTTTCTGCAAGTGGTATCAGAGCACCAAGTCCACTACATGAGAGGTCTACAGATATCAATGATAATTCATCAACTAAACAGCGAGCGTCTTTGGAAAATGATATCGAGTTGCTACAGCTGCGCTTGCAACAAGAGAGATCTATGCGAAGTATGCTAGAAAGGGCAATGGGTCGTGCATCAAGTACTTTATCTCCCGGGCATAGGCACTTAGCCCAGACGAAGGATTTGATCTCAGAAATTGAATTACTTGAAGAAGAGGTCGCAAACCGTGAACAGCATGTGCTCTCTCTCTATAGAAGTATTTTTGAAAATTGTGTTAGTAAGCCATCTTCTCAGCAAAATTCAGTCACAGCCTCTCCAGCTCATGGGAAGCATGAATCAAGAAAACACCCCAGTATCATTTCAAGTGCGTTTTGTTCGTCGAGGAAGTTTCCTTTGGGACCTTTGCAACCTTTCTCTGTAAATGACTTGGGAAAAAGAACCTCAAATGCTGGTCCTAATTCCTTGTTTGGAGGTAAAAGCGACATAAGTATGGGGAAAACTTCAGGCACTGCAAAGGTTCGTGAAGCCTTTTCGCAGGCGAAGAGAACTTCTCTGCAAACTCTAAAAGACCATCTTTTTGAGTGTCCAAGTAAATTATCAGAGGAGATGGTGAGGTGCATGGCTTTTATATACTGCTCTCTTCATAGAGTGGCATCAAACAAGGCTAAAAAAAAGGCAGGTTCCTTTCCTGAAGTTAAACAGCCCCAATGTGGACCGGCGGAGGAACAATTTGGGGGTGGGAAAGCAATGCTGGAAATACATTGCATATCAACCAATAACAGCCAGTTCTCTCGTGCTTCATACGCAATCAACAATTATAGAGTATTAGTTGAGCAGCTGGAAAATGTGAATGTCAGTAAGATGGGGATCGATGCTCAAACTGCATTCTGGATTAATGTGTATAATGCTCTTCTTATGCATGCCTATTTGGCATATGGAATCCCTCATGGCTCTCTAAGAAGGTTGGCTTTGTTCCATAAGGCTGCTTACAACATCGGTGGCCATATCATCAGTGCAAATGCAATAGAGCAATCAATTTTTTTCTTCAAATCTCCCCGAATAGGATGGTGGCTTGAAACTATCATTTCAACTGCGCTGAGGAAGAAGTCTGGGGAAGAAAGGCAACTAATCTCTTCAAAATTGGGCCTTCATAGTCCTCAACCTCTTGTTTGCTTTGGCCTCTGCACTGGTGCCTCTTCAGATCCTGTGCTGAAAGTGTACACTGCATCAAATATTAAAGAGGAACTGGAGGTGGCCAAAAGGGATTTTCTCCAAGCAAATATAGTTGTGAAGAAGTCAAAGAAAGTATTCCTACCAAAGGTGCTCGAGAGATTTGCACGTGAAGCATCCATCAGCTCAGAAGAACTCCCGAAATGGGTTTCTGAAAATGTCGACGGAAAACTCCACGAGTCCATACAGAAATGTACGGAACATCGGACTGGCAAGAAGGCATCTCAGATCATTGAGTGGCTACCTTACAGCTCAAGGTTCCGAACTTTGCCGGACTTCACCACTCTCGACGCCGATAACAACAACTACGCATGGAGAAACTTCGCTCGGTTCCTCGACGCTGGCAAAGGCAGCGAAGTGAACGGCATGTCGGAACTGAAGAAGTATTTGAATCGATTCGGTTACCTTCCGATTCCTCCTCAAAACAACTTCTCCGATTTCTTCGACGATCAATTCGTATCGGCGTTGATTCTCTATCAGAATCGCTTAGGTTTATCAGTCACTGGAAAACTCGATTCCGAAACAATCGCAAGCATCATGTCGCCTAGATGCGGAATGAGTGACCTAATTAAAATCAACAACAACAACACAACAATTCACTCAACGCGTCGATACGCTTTCTTCAACGGCCAACCGAGATGGATTCGATCCTCAACTCTAACGTACGCTCTCTCACCAGATTACACAATCGAATACCTAACTTCATCAGAAATCCGCAAGGTCGTTCGACGATCGTTCTCGCGGTGGTCCGCAGTGATTCCGTTGAACTTCACCGAATCCTCTGATTACGAATCGTCAGATATCCGAATCGGATTCTACCGCGGCGATCACGGCGATGGAGAGGCGTTCGACGGAGTATTAGGCGTTTTGGCACACGCGTTTTCACCGGAAAACGGAAGGCTGCACTTAGATGCGGCGGAGCGCTGGGCGGTGGATTTCGAGCAGGAGAAATCGAAAGTAGCTGTGGATTTGGAATCGGTTGTAACGCATGAGATAGGGCATGTTCTTGGACTGGCTCATTCCGCAGTGAAGGAATCTGTGATGTATCCAAGTTTGAGTCCGCGAGGGAAGAAAGTGGACCTTAGGATCGATGACGTAGAAGGAATTCAGTATTTGTATGGTACGAACCCTAATTTCAAATTGAAATCCTTCTTGGAATCCGAAAAATCTATCAACAGTGGATCATCATCTTCTTCAATCAACACTAATTTCTTCTTCTTACTGTTATTCTACTTGTTGGTTTGGGTTGGGTCTCTGTTTTTCTGA

Coding sequence (CDS)

ATGAGTGGATTCGATATGCATATGCGTGGTGAAGAAACAGCTTCTGGGAAGAGAGAACTTAGAGATTATTTAGCGTCTCAACGTGTTCATTACCGCCATAGGCGATCTAGAAGTTCTTCAGACAGGAACTCCAATGTCTTTAGAGGTGGGGTTCTGCATTCTAATAGCAAGAATGATCGGAGTGACACACAGGCATCCCCACTTTCTGCAAGTGGTATCAGAGCACCAAGTCCACTACATGAGAGGTCTACAGATATCAATGATAATTCATCAACTAAACAGCGAGCGTCTTTGGAAAATGATATCGAGTTGCTACAGCTGCGCTTGCAACAAGAGAGATCTATGCGAAGTATGCTAGAAAGGGCAATGGGTCGTGCATCAAGTACTTTATCTCCCGGGCATAGGCACTTAGCCCAGACGAAGGATTTGATCTCAGAAATTGAATTACTTGAAGAAGAGGTCGCAAACCGTGAACAGCATGTGCTCTCTCTCTATAGAAGTATTTTTGAAAATTGTGTTAGTAAGCCATCTTCTCAGCAAAATTCAGTCACAGCCTCTCCAGCTCATGGGAAGCATGAATCAAGAAAACACCCCAGTATCATTTCAAGTGCGTTTTGTTCGTCGAGGAAGTTTCCTTTGGGACCTTTGCAACCTTTCTCTGTAAATGACTTGGGAAAAAGAACCTCAAATGCTGGTCCTAATTCCTTGTTTGGAGGTAAAAGCGACATAAGTATGGGGAAAACTTCAGGCACTGCAAAGGTTCGTGAAGCCTTTTCGCAGGCGAAGAGAACTTCTCTGCAAACTCTAAAAGACCATCTTTTTGAGTGTCCAAGTAAATTATCAGAGGAGATGGTGAGGTGCATGGCTTTTATATACTGCTCTCTTCATAGAGTGGCATCAAACAAGGCTAAAAAAAAGGCAGGTTCCTTTCCTGAAGTTAAACAGCCCCAATGTGGACCGGCGGAGGAACAATTTGGGGGTGGGAAAGCAATGCTGGAAATACATTGCATATCAACCAATAACAGCCAGTTCTCTCGTGCTTCATACGCAATCAACAATTATAGAGTATTAGTTGAGCAGCTGGAAAATGTGAATGTCAGTAAGATGGGGATCGATGCTCAAACTGCATTCTGGATTAATGTGTATAATGCTCTTCTTATGCATGCCTATTTGGCATATGGAATCCCTCATGGCTCTCTAAGAAGGTTGGCTTTGTTCCATAAGGCTGCTTACAACATCGGTGGCCATATCATCAGTGCAAATGCAATAGAGCAATCAATTTTTTTCTTCAAATCTCCCCGAATAGGATGGTGGCTTGAAACTATCATTTCAACTGCGCTGAGGAAGAAGTCTGGGGAAGAAAGGCAACTAATCTCTTCAAAATTGGGCCTTCATAGTCCTCAACCTCTTGTTTGCTTTGGCCTCTGCACTGGTGCCTCTTCAGATCCTGTGCTGAAAGTGTACACTGCATCAAATATTAAAGAGGAACTGGAGGTGGCCAAAAGGGATTTTCTCCAAGCAAATATAGTTGTGAAGAAGTCAAAGAAAGTATTCCTACCAAAGGTGCTCGAGAGATTTGCACGTGAAGCATCCATCAGCTCAGAAGAACTCCCGAAATGGGTTTCTGAAAATGTCGACGGAAAACTCCACGAGTCCATACAGAAATGTACGGAACATCGGACTGGCAAGAAGGCATCTCAGATCATTGAGTGGCTACCTTACAGCTCAAGGTTCCGAACTTTGCCGGACTTCACCACTCTCGACGCCGATAACAACAACTACGCATGGAGAAACTTCGCTCGGTTCCTCGACGCTGGCAAAGGCAGCGAAGTGAACGGCATGTCGGAACTGAAGAAGTATTTGAATCGATTCGGTTACCTTCCGATTCCTCCTCAAAACAACTTCTCCGATTTCTTCGACGATCAATTCGTATCGGCGTTGATTCTCTATCAGAATCGCTTAGGTTTATCAGTCACTGGAAAACTCGATTCCGAAACAATCGCAAGCATCATGTCGCCTAGATGCGGAATGAGTGACCTAATTAAAATCAACAACAACAACACAACAATTCACTCAACGCGTCGATACGCTTTCTTCAACGGCCAACCGAGATGGATTCGATCCTCAACTCTAACGTACGCTCTCTCACCAGATTACACAATCGAATACCTAACTTCATCAGAAATCCGCAAGGTCGTTCGACGATCGTTCTCGCGGTGGTCCGCAGTGATTCCGTTGAACTTCACCGAATCCTCTGATTACGAATCGTCAGATATCCGAATCGGATTCTACCGCGGCGATCACGGCGATGGAGAGGCGTTCGACGGAGTATTAGGCGTTTTGGCACACGCGTTTTCACCGGAAAACGGAAGGCTGCACTTAGATGCGGCGGAGCGCTGGGCGGTGGATTTCGAGCAGGAGAAATCGAAAGTAGCTGTGGATTTGGAATCGGTTGTAACGCATGAGATAGGGCATGTTCTTGGACTGGCTCATTCCGCAGTGAAGGAATCTGTGATGTATCCAAGTTTGAGTCCGCGAGGGAAGAAAGTGGACCTTAGGATCGATGACGTAGAAGGAATTCAGTATTTGTATGGTACGAACCCTAATTTCAAATTGAAATCCTTCTTGGAATCCGAAAAATCTATCAACAGTGGATCATCATCTTCTTCAATCAACACTAATTTCTTCTTCTTACTGTTATTCTACTTGTTGGTTTGGGTTGGGTCTCTGTTTTTCTGA

Protein sequence

MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLENDIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAKKKAGSFPEVKQPQCGPAEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGASSDPVLKVYTASNIKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFRTLPDFTTLDADNNNYAWRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLIKINNNNTTIHSTRRYAFFNGQPRWIRSSTLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFEQEKSKVAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFKLKSFLESEKSINSGSSSSSINTNFFFLLLFYLLVWVGSLFF
Homology
BLAST of Cla97C01G004270 vs. NCBI nr
Match: XP_011654811.1 (uncharacterized protein LOC101204173 isoform X1 [Cucumis sativus] >KAE8647898.1 hypothetical protein Csa_000413 [Cucumis sativus])

HSP 1 Score: 1068.9 bits (2763), Expect = 2.4e-308
Identity = 551/579 (95.16%), Postives = 560/579 (96.72%), Query Frame = 0

Query: 1   MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDR 60
           MSG DMHMRGEE+ASGKRELRDYLASQRVH RHRRSRSSSD+NSN FRG  LHSNSKNDR
Sbjct: 1   MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGASLHSNSKNDR 60

Query: 61  SDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLENDIELLQLRLQQERSMRSMLE 120
           SD QASPLS SGIRA SPLHE+STD NDNSSTKQRASLENDIELLQLRLQQERSMRSMLE
Sbjct: 61  SDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLENDIELLQLRLQQERSMRSMLE 120

Query: 121 RAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQ 180
           RAMGRASSTLSPGHRH AQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQ
Sbjct: 121 RAMGRASSTLSPGHRHFAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQ 180

Query: 181 NSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGK 240
           NSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFG K
Sbjct: 181 NSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGSK 240

Query: 241 SDISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS 300
           SDIS GKTSGTAKVREAFSQ KRTSL++LKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
Sbjct: 241 SDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS 300

Query: 301 NKAKKKAGSFPEVKQPQCGPAEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQ 360
           NKA+KKAGSFP+VKQPQCGP EEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQ
Sbjct: 301 NKAQKKAGSFPKVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQ 360

Query: 361 LENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISA 420
           LE VNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISA
Sbjct: 361 LEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISA 420

Query: 421 NAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGAS 480
           NAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGL SPQPLVCFGLCTGAS
Sbjct: 421 NAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGAS 480

Query: 481 SDPVLKVYTASNIKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKW 540
           SDPVLKVYTASN+KEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASISS+ELPKW
Sbjct: 481 SDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELPKW 540

Query: 541 VSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR 580
           VSENVDGKL ESIQKC EHRTGKK SQIIEWLPYSSRFR
Sbjct: 541 VSENVDGKLQESIQKCMEHRTGKKTSQIIEWLPYSSRFR 579

BLAST of Cla97C01G004270 vs. NCBI nr
Match: XP_008437070.1 (PREDICTED: uncharacterized protein LOC103482606 isoform X2 [Cucumis melo])

HSP 1 Score: 1065.1 bits (2753), Expect = 3.4e-307
Identity = 550/579 (94.99%), Postives = 560/579 (96.72%), Query Frame = 0

Query: 1   MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDR 60
           MSG DMHMRGEE+ASGKRELRDYLASQRVH RHRRSRSSSD+NSN FRGG LHSNSKNDR
Sbjct: 1   MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGGSLHSNSKNDR 60

Query: 61  SDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLENDIELLQLRLQQERSMRSMLE 120
           SD QASPLS SGIRA SPLHE+STD NDNSSTKQRASLENDIELLQLRLQQERSMRSMLE
Sbjct: 61  SDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLENDIELLQLRLQQERSMRSMLE 120

Query: 121 RAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQ 180
           RAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQ
Sbjct: 121 RAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQ 180

Query: 181 NSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGK 240
           NSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGK
Sbjct: 181 NSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGK 240

Query: 241 SDISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS 300
           SDIS GKTSGTAKVREAFSQ KRTSL++LKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
Sbjct: 241 SDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS 300

Query: 301 NKAKKKAGSFPEVKQPQCGPAEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQ 360
           NKA+KKAGSFP+VKQPQ  P EEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQ
Sbjct: 301 NKAQKKAGSFPQVKQPQREPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQ 360

Query: 361 LENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISA 420
           LE VNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISA
Sbjct: 361 LEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISA 420

Query: 421 NAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGAS 480
           NAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGL SPQPLVCFGLCTGAS
Sbjct: 421 NAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGAS 480

Query: 481 SDPVLKVYTASNIKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKW 540
           SDPVLKVYTASN+KEELE AKRDFLQANIVVKKSKKVFLPKVLERFAREASISS+ELPKW
Sbjct: 481 SDPVLKVYTASNVKEELEAAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELPKW 540

Query: 541 VSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR 580
           VS+NVDGKL ESIQKC EHRTGKK SQIIEWLPYSSRFR
Sbjct: 541 VSDNVDGKLQESIQKCMEHRTGKKTSQIIEWLPYSSRFR 579

BLAST of Cla97C01G004270 vs. NCBI nr
Match: XP_038874743.1 (uncharacterized protein LOC120067282 isoform X1 [Benincasa hispida])

HSP 1 Score: 1061.6 bits (2744), Expect = 3.8e-306
Identity = 547/579 (94.47%), Postives = 560/579 (96.72%), Query Frame = 0

Query: 1   MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDR 60
           MSGFDMHMRGEE+AS KRELRD+LASQRVH  HRRSRSSSDRNSNVFRGGVLHS+SKNDR
Sbjct: 1   MSGFDMHMRGEESASAKRELRDFLASQRVHSSHRRSRSSSDRNSNVFRGGVLHSDSKNDR 60

Query: 61  SDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLENDIELLQLRLQQERSMRSMLE 120
           SD QASPLS SGIRA SPLHE S ++NDNSS+KQRASLENDIELLQLRLQQERSMRSMLE
Sbjct: 61  SDAQASPLSTSGIRARSPLHESSKNLNDNSSSKQRASLENDIELLQLRLQQERSMRSMLE 120

Query: 121 RAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQ 180
           RAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQ
Sbjct: 121 RAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQ 180

Query: 181 NSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGK 240
           NSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGK
Sbjct: 181 NSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGK 240

Query: 241 SDISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS 300
           SD+S GKTSGTAKVREAFSQ KR SL+TLKDHLFECPSKLSEEMVRCMAFIYCSLHR AS
Sbjct: 241 SDVSTGKTSGTAKVREAFSQVKRNSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRTAS 300

Query: 301 NKAKKKAGSFPEVKQPQCGPAEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQ 360
           NKA+KKAGSFP+VKQPQCGP EEQFGG KAMLEIHCIST+NSQFSRASYAINNYRVLVEQ
Sbjct: 301 NKAQKKAGSFPKVKQPQCGPVEEQFGGVKAMLEIHCISTHNSQFSRASYAINNYRVLVEQ 360

Query: 361 LENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISA 420
           LE VNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISA
Sbjct: 361 LEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISA 420

Query: 421 NAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGAS 480
           NAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGL SPQPLVCFGLCTGAS
Sbjct: 421 NAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGAS 480

Query: 481 SDPVLKVYTASNIKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKW 540
           SDPVLKVYTASN+KEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASI S+EL KW
Sbjct: 481 SDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASIGSDELLKW 540

Query: 541 VSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR 580
           VSENVDGKLHESIQKC EHRTGKKASQIIEWLPYSSRFR
Sbjct: 541 VSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRFR 579

BLAST of Cla97C01G004270 vs. NCBI nr
Match: XP_016903702.1 (PREDICTED: uncharacterized protein LOC103482606 isoform X1 [Cucumis melo])

HSP 1 Score: 1052.4 bits (2720), Expect = 2.3e-303
Identity = 550/601 (91.51%), Postives = 560/601 (93.18%), Query Frame = 0

Query: 1   MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDR 60
           MSG DMHMRGEE+ASGKRELRDYLASQRVH RHRRSRSSSD+NSN FRGG LHSNSKNDR
Sbjct: 1   MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGGSLHSNSKNDR 60

Query: 61  SDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLENDIELLQLRLQQERSMRSMLE 120
           SD QASPLS SGIRA SPLHE+STD NDNSSTKQRASLENDIELLQLRLQQERSMRSMLE
Sbjct: 61  SDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLENDIELLQLRLQQERSMRSMLE 120

Query: 121 RAMGRASSTLSPGHRHLA----------------------QTKDLISEIELLEEEVANRE 180
           RAMGRASSTLSPGHRHLA                      QTKDLISEIELLEEEVANRE
Sbjct: 121 RAMGRASSTLSPGHRHLAQELFCKVGVPSCQGSFLLFTFYQTKDLISEIELLEEEVANRE 180

Query: 181 QHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQP 240
           QHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQP
Sbjct: 181 QHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQP 240

Query: 241 FSVNDLGKRTSNAGPNSLFGGKSDISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPS 300
           FSVNDLGKRTSNAGPNSLFGGKSDIS GKTSGTAKVREAFSQ KRTSL++LKDHLFECPS
Sbjct: 241 FSVNDLGKRTSNAGPNSLFGGKSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPS 300

Query: 301 KLSEEMVRCMAFIYCSLHRVASNKAKKKAGSFPEVKQPQCGPAEEQFGGGKAMLEIHCIS 360
           KLSEEMVRCMAFIYCSLHRVASNKA+KKAGSFP+VKQPQ  P EEQFGGGKAMLEIHCIS
Sbjct: 301 KLSEEMVRCMAFIYCSLHRVASNKAQKKAGSFPQVKQPQREPVEEQFGGGKAMLEIHCIS 360

Query: 361 TNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHG 420
           TNNSQFSRASYAINNYRVLVEQLE VNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHG
Sbjct: 361 TNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHG 420

Query: 421 SLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLI 480
           SLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLI
Sbjct: 421 SLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLI 480

Query: 481 SSKLGLHSPQPLVCFGLCTGASSDPVLKVYTASNIKEELEVAKRDFLQANIVVKKSKKVF 540
           SSKLGL SPQPLVCFGLCTGASSDPVLKVYTASN+KEELE AKRDFLQANIVVKKSKKVF
Sbjct: 481 SSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEAAKRDFLQANIVVKKSKKVF 540

Query: 541 LPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRF 580
           LPKVLERFAREASISS+ELPKWVS+NVDGKL ESIQKC EHRTGKK SQIIEWLPYSSRF
Sbjct: 541 LPKVLERFAREASISSDELPKWVSDNVDGKLQESIQKCMEHRTGKKTSQIIEWLPYSSRF 600

BLAST of Cla97C01G004270 vs. NCBI nr
Match: KAG6579407.1 (hypothetical protein SDJN03_23855, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 998.8 bits (2581), Expect = 3.0e-287
Identity = 519/579 (89.64%), Postives = 539/579 (93.09%), Query Frame = 0

Query: 1   MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDR 60
           MSGFDMH+RGEE ASG RELRDYLAS  VH RHRRSRSSSDRNSNV RGGVLHSNSKN R
Sbjct: 1   MSGFDMHLRGEEKASGNRELRDYLASHHVHARHRRSRSSSDRNSNVIRGGVLHSNSKNGR 60

Query: 61  SDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLENDIELLQLRLQQERSMRSMLE 120
           SD QASPLS SGIRA SPLHE +T+ NDNSS+K RASLENDIELLQLRLQQERSMRSMLE
Sbjct: 61  SDKQASPLSTSGIRARSPLHEGATNFNDNSSSKHRASLENDIELLQLRLQQERSMRSMLE 120

Query: 121 RAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQ 180
           RAMGRASSTLSPGHRHLAQTKDLI+EIELLEEEVANREQHVLSLYRSIFENCVSK SSQQ
Sbjct: 121 RAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFENCVSKTSSQQ 180

Query: 181 NSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGK 240
           +SVT SPAHGKHES+KHPSIISSAFCSSRKFPLGPLQPFSVNDLGKR SNAGPNSL GGK
Sbjct: 181 SSVTVSPAHGKHESKKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRASNAGPNSLLGGK 240

Query: 241 SDISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS 300
            DIS GK SG AKVREA SQ K+TSL+TLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
Sbjct: 241 GDISTGKISGPAKVREALSQVKKTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS 300

Query: 301 NKAKKKAGSFPEVKQPQCGPAEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQ 360
           N AKKKA SF +VK+P+ GP EEQ G  KAMLEIHCISTNN+QFSRASYAINNYRVLVEQ
Sbjct: 301 NNAKKKASSFAKVKRPESGPVEEQCGDVKAMLEIHCISTNNTQFSRASYAINNYRVLVEQ 360

Query: 361 LENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISA 420
           LE VNVSKM IDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGG IISA
Sbjct: 361 LEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGRIISA 420

Query: 421 NAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGAS 480
           NAIEQSIF FKSPRIGWWLETIISTALRKKSGEERQLISSKLGL S QPLVCFGLCTGAS
Sbjct: 421 NAIEQSIFSFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPLVCFGLCTGAS 480

Query: 481 SDPVLKVYTASNIKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKW 540
           SDPVLKVYTASN+KEELE+AKR+FLQANIVVKKSKKVFLPKVLERFAREASISS+ELPKW
Sbjct: 481 SDPVLKVYTASNVKEELELAKREFLQANIVVKKSKKVFLPKVLERFAREASISSDELPKW 540

Query: 541 VSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR 580
           +SENVDGKLHESIQKC + +TGKKAS IIEWLPYSSRFR
Sbjct: 541 ISENVDGKLHESIQKCMDLQTGKKASHIIEWLPYSSRFR 579

BLAST of Cla97C01G004270 vs. ExPASy Swiss-Prot
Match: O23507 (Metalloendoproteinase 1-MMP OS=Arabidopsis thaliana OX=3702 GN=1MMP PE=1 SV=1)

HSP 1 Score: 332.8 bits (852), Expect = 1.2e-89
Identity = 170/322 (52.80%), Postives = 228/322 (70.81%), Query Frame = 0

Query: 592 NNYAWRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQ 651
           +N  W +F+R +D   GS V+G+SELK+YL+RFGY+       FSD FD    SA+ LYQ
Sbjct: 48  SNSTWHDFSRLVDVQIGSHVSGVSELKRYLHRFGYVN-DGSEIFSDVFDGPLESAISLYQ 107

Query: 652 NRLGLSVTGKLDSETIASIMSPRCGMSDLIKINNNNTTIHSTRRYAFFNGQPRWIRSSTL 711
             LGL +TG+LD+ T+  +  PRCG+SD   +  NN  +H+T  Y +FNG+P+W R  TL
Sbjct: 108 ENLGLPITGRLDTSTVTLMSLPRCGVSD-THMTINNDFLHTTAHYTYFNGKPKWNR-DTL 167

Query: 712 TYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDG 771
           TYA+S  + ++YLTS +++ V RR+FS+WS+VIP++F E  D+ ++D++IGFY GDHGDG
Sbjct: 168 TYAISKTHKLDYLTSEDVKTVFRRAFSQWSSVIPVSFEEVDDFTTADLKIGFYAGDHGDG 227

Query: 772 EAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFE-QEKSKVAVDLESVVTHEIGHVLGLA 831
             FDGVLG LAHAF+PENGRLHLDAAE W VD + +  S+VAVDLESV THEIGH+LGL 
Sbjct: 228 LPFDGVLGTLAHAFAPENGRLHLDAAETWIVDDDLKGSSEVAVDLESVATHEIGHLLGLG 287

Query: 832 HSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFKLKSFLESEKSINSGSSSSS 891
           HS+ + +VMYPSL PR KKVDL +DDV G+  LYG NP  +L S  +SE SI +G+ S  
Sbjct: 288 HSSQESAVMYPSLRPRTKKVDLTVDDVAGVLKLYGPNPKLRLDSLTQSEDSIKNGTVSHR 347

Query: 892 INTNFFFLLLFYLLVWVGSLFF 913
             +  F   + Y+L+ VG + F
Sbjct: 348 FLSGNF---IGYVLLVVGLILF 363

BLAST of Cla97C01G004270 vs. ExPASy Swiss-Prot
Match: Q8GWW6 (Metalloendoproteinase 4-MMP OS=Arabidopsis thaliana OX=3702 GN=4MMP PE=1 SV=1)

HSP 1 Score: 316.6 bits (810), Expect = 9.1e-85
Identity = 156/275 (56.73%), Postives = 196/275 (71.27%), Query Frame = 0

Query: 616 ELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRC 675
           E+K++L ++GYL   PQN  SD  D  F  AL+ YQ  LGL +TGK DS+T++ I+ PRC
Sbjct: 52  EIKRHLQQYGYL---PQNKESD--DVSFEQALVRYQKNLGLPITGKPDSDTLSQILLPRC 111

Query: 676 GMSDLIKINNNNTTIHSTRRYAFFNGQPRWIRS--STLTYALSPDYTIEYLTSSEIRKVV 735
           G  D   +       H+ ++Y +F G+PRW R     LTYA S +    YL  ++IR+V 
Sbjct: 112 GFPD--DVEPKTAPFHTGKKYVYFPGRPRWTRDVPLKLTYAFSQENLTPYLAPTDIRRVF 171

Query: 736 RRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGVLAHAFSPENGRLH 795
           RR+F +W++VIP++F E+ DY  +DI+IGF+ GDHGDGE FDGVLGVLAH FSPENGRLH
Sbjct: 172 RRAFGKWASVIPVSFIETEDYVIADIKIGFFNGDHGDGEPFDGVLGVLAHTFSPENGRLH 231

Query: 796 LDAAERWAVDFEQEKSKVAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLR 855
           LD AE WAVDF++EKS VAVDLESV  HEIGHVLGL HS+VK++ MYP+L PR KKV+L 
Sbjct: 232 LDKAETWAVDFDEEKSSVAVDLESVAVHEIGHVLGLGHSSVKDAAMYPTLKPRSKKVNLN 291

Query: 856 IDDVEGIQYLYGTNPNFKLKSFLESEKSINSGSSS 889
           +DDV G+Q LYGTNPNF L S L SE S N    S
Sbjct: 292 MDDVVGVQSLYGTNPNFTLNSLLASETSTNLADGS 319

BLAST of Cla97C01G004270 vs. ExPASy Swiss-Prot
Match: O04529 (Metalloendoproteinase 2-MMP OS=Arabidopsis thaliana OX=3702 GN=2MMP PE=1 SV=1)

HSP 1 Score: 282.7 bits (722), Expect = 1.5e-74
Identity = 144/293 (49.15%), Postives = 185/293 (63.14%), Query Frame = 0

Query: 596 WRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLG 655
           W  F+ F     G  V+G+  +KKY  RFGY+P     NF+D FDD   +A+ LYQ    
Sbjct: 41  WDAFSNFTGCHHGQNVDGLYRIKKYFQRFGYIPETFSGNFTDDFDDILKAAVELYQTNFN 100

Query: 656 LSVTGKLDSETIASIMSPRCGMSDLI--------------KINNNNTTIHSTRRYAFFNG 715
           L+VTG+LD+ TI  I+ PRCG  D++              ++N + T +H+ +RY  F G
Sbjct: 101 LNVTGELDALTIQHIVIPRCGNPDVVNGTSLMHGGRRKTFEVNFSRTHLHAVKRYTLFPG 160

Query: 716 QPRWIRS-STLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIR 775
           +PRW R+   LTYA  P   +    + E++ V  R+F RWS V  LNFT S  + +SDI 
Sbjct: 161 EPRWPRNRRDLTYAFDPKNPL----TEEVKSVFSRAFGRWSDVTALNFTLSESFSTSDIT 220

Query: 776 IGFYRGDHGDGEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFEQE---KSKVAVDLES 835
           IGFY GDHGDGE FDGVLG LAHAFSP +G+ HLDA E W V  + +       AVDLES
Sbjct: 221 IGFYTGDHGDGEPFDGVLGTLAHAFSPPSGKFHLDADENWVVSGDLDSFLSVTAAVDLES 280

Query: 836 VVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNF 871
           V  HEIGH+LGL HS+V+ES+MYP+++   +KVDL  DDVEGIQYLYG NPNF
Sbjct: 281 VAVHEIGHLLGLGHSSVEESIMYPTITTGKRKVDLTNDDVEGIQYLYGANPNF 329

BLAST of Cla97C01G004270 vs. ExPASy Swiss-Prot
Match: Q9ZUJ5 (Metalloendoproteinase 5-MMP OS=Arabidopsis thaliana OX=3702 GN=5MMP PE=1 SV=1)

HSP 1 Score: 266.2 bits (679), Expect = 1.4e-69
Identity = 144/329 (43.77%), Postives = 196/329 (59.57%), Query Frame = 0

Query: 573 PYSSRFRT----LPDFTTLDADNNNYAWRNFARFLDAGKGSEVNGMSELKKYLNRFGYLP 632
           P S++F T    +P    L+A  N  AW  F++      G  +NG+S+LK+Y  RFGY  
Sbjct: 17  PISAKFYTNVSSIPPLQFLNATQN--AWETFSKLAGCHIGENINGLSKLKQYFRRFGY-- 76

Query: 633 IPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLI---KINN 692
           I    N +D FDD   SA+  YQ    L VTGKLDS T+  I+ PRCG  DLI      N
Sbjct: 77  ITTTGNCTDDFDDVLQSAINTYQKNFNLKVTGKLDSSTLRQIVKPRCGNPDLIDGVSEMN 136

Query: 693 NNTTIHSTRRYAFFNGQPRW-IRSSTLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVI 752
               + +T +Y+FF G+PRW  R   LTYA +P   +    + E+++V  R+F+RW+ V 
Sbjct: 137 GGKILRTTEKYSFFPGKPRWPKRKRDLTYAFAPQNNL----TDEVKRVFSRAFTRWAEVT 196

Query: 753 PLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDF 812
           PLNFT S     +DI IGF+ G+HGDGE FDG +G LAHA SP  G LHLD  E W +  
Sbjct: 197 PLNFTRSESILRADIVIGFFSGEHGDGEPFDGAMGTLAHASSPPTGMLHLDGDEDWLISN 256

Query: 813 EQEKSKV-----AVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEG 872
            +   ++      VDLESV  HEIGH+LGL HS+V++++M+P++S   +KV+L  DD+EG
Sbjct: 257 GEISRRILPVTTVVDLESVAVHEIGHLLGLGHSSVEDAIMFPAISGGDRKVELAKDDIEG 316

Query: 873 IQYLYGTNPNFKLKSFLESEKSINSGSSS 889
           IQ+LYG NPN        S +S ++G  S
Sbjct: 317 IQHLYGGNPNGDGGGSKPSRESQSTGGDS 337

BLAST of Cla97C01G004270 vs. ExPASy Swiss-Prot
Match: Q5XF51 (Metalloendoproteinase 3-MMP OS=Arabidopsis thaliana OX=3702 GN=3MMP PE=1 SV=1)

HSP 1 Score: 246.9 bits (629), Expect = 8.8e-64
Identity = 143/348 (41.09%), Postives = 196/348 (56.32%), Query Frame = 0

Query: 596 WRNFARFLDAGKGSEVNGMSELKKYLNRFGYL-PIPPQNNFSDFFDDQFVSALILYQNRL 655
           W +F  F     G + +G+  LK+Y   FGY+       NF+D FDD   +A+ +YQ   
Sbjct: 43  WNSFLNFTGCHAGKKYDGLYMLKQYFQHFGYITETNLSGNFTDDFDDILKNAVEMYQRNF 102

Query: 656 GLSVTGKLDSETIASIMSPRCGMSDLIKINNNNTTIHSTRR------------------Y 715
            L+VTG LD  T+  ++ PRCG  D++   N  +T+HS R+                  Y
Sbjct: 103 QLNVTGVLDELTLKHVVIPRCGNPDVV---NGTSTMHSGRKTFEVSFAGRGQRFHAVKHY 162

Query: 716 AFFNGQPRWIRS-STLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYE 775
           +FF G+PRW R+   LTYA  P   +    + E++ V  R+F+RW  V PL FT    + 
Sbjct: 163 SFFPGEPRWPRNRRDLTYAFDPRNAL----TEEVKSVFSRAFTRWEEVTPLTFTRVERFS 222

Query: 776 SSDIRIGFYRGDHGDGEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFEQEKSKV---- 835
           +SDI IGFY G+HGDGE FDG +  LAHAFSP  G  HLD  E W V  E     +    
Sbjct: 223 TSDISIGFYSGEHGDGEPFDGPMRTLAHAFSPPTGHFHLDGEENWIVSGEGGDGFISVSE 282

Query: 836 AVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFK 895
           AVDLESV  HEIGH+LGL HS+V+ S+MYP++    +KVDL  DDVEG+QYLYG NPNF 
Sbjct: 283 AVDLESVAVHEIGHLLGLGHSSVEGSIMYPTIRTGRRKVDLTTDDVEGVQYLYGANPNFN 342

Query: 896 -LKSFLESEKSINSGSS--------SSSINTN---FFFLLLFYLLVWV 908
             +S   S +  ++G S        S S+ TN   ++F ++F L +++
Sbjct: 343 GSRSPPPSTQQRDTGDSGAPGRSDGSRSVLTNLLQYYFWIIFGLFLYL 383

BLAST of Cla97C01G004270 vs. ExPASy TrEMBL
Match: A0A1S3AT95 (uncharacterized protein LOC103482606 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103482606 PE=4 SV=1)

HSP 1 Score: 1065.1 bits (2753), Expect = 1.7e-307
Identity = 550/579 (94.99%), Postives = 560/579 (96.72%), Query Frame = 0

Query: 1   MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDR 60
           MSG DMHMRGEE+ASGKRELRDYLASQRVH RHRRSRSSSD+NSN FRGG LHSNSKNDR
Sbjct: 1   MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGGSLHSNSKNDR 60

Query: 61  SDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLENDIELLQLRLQQERSMRSMLE 120
           SD QASPLS SGIRA SPLHE+STD NDNSSTKQRASLENDIELLQLRLQQERSMRSMLE
Sbjct: 61  SDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLENDIELLQLRLQQERSMRSMLE 120

Query: 121 RAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQ 180
           RAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQ
Sbjct: 121 RAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQ 180

Query: 181 NSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGK 240
           NSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGK
Sbjct: 181 NSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGK 240

Query: 241 SDISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS 300
           SDIS GKTSGTAKVREAFSQ KRTSL++LKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
Sbjct: 241 SDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS 300

Query: 301 NKAKKKAGSFPEVKQPQCGPAEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQ 360
           NKA+KKAGSFP+VKQPQ  P EEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQ
Sbjct: 301 NKAQKKAGSFPQVKQPQREPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQ 360

Query: 361 LENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISA 420
           LE VNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISA
Sbjct: 361 LEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISA 420

Query: 421 NAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGAS 480
           NAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGL SPQPLVCFGLCTGAS
Sbjct: 421 NAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGAS 480

Query: 481 SDPVLKVYTASNIKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKW 540
           SDPVLKVYTASN+KEELE AKRDFLQANIVVKKSKKVFLPKVLERFAREASISS+ELPKW
Sbjct: 481 SDPVLKVYTASNVKEELEAAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELPKW 540

Query: 541 VSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR 580
           VS+NVDGKL ESIQKC EHRTGKK SQIIEWLPYSSRFR
Sbjct: 541 VSDNVDGKLQESIQKCMEHRTGKKTSQIIEWLPYSSRFR 579

BLAST of Cla97C01G004270 vs. ExPASy TrEMBL
Match: A0A1S4E657 (uncharacterized protein LOC103482606 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103482606 PE=4 SV=1)

HSP 1 Score: 1052.4 bits (2720), Expect = 1.1e-303
Identity = 550/601 (91.51%), Postives = 560/601 (93.18%), Query Frame = 0

Query: 1   MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDR 60
           MSG DMHMRGEE+ASGKRELRDYLASQRVH RHRRSRSSSD+NSN FRGG LHSNSKNDR
Sbjct: 1   MSGLDMHMRGEESASGKRELRDYLASQRVHSRHRRSRSSSDKNSNGFRGGSLHSNSKNDR 60

Query: 61  SDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLENDIELLQLRLQQERSMRSMLE 120
           SD QASPLS SGIRA SPLHE+STD NDNSSTKQRASLENDIELLQLRLQQERSMRSMLE
Sbjct: 61  SDAQASPLSTSGIRARSPLHEQSTDFNDNSSTKQRASLENDIELLQLRLQQERSMRSMLE 120

Query: 121 RAMGRASSTLSPGHRHLA----------------------QTKDLISEIELLEEEVANRE 180
           RAMGRASSTLSPGHRHLA                      QTKDLISEIELLEEEVANRE
Sbjct: 121 RAMGRASSTLSPGHRHLAQELFCKVGVPSCQGSFLLFTFYQTKDLISEIELLEEEVANRE 180

Query: 181 QHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQP 240
           QHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQP
Sbjct: 181 QHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQP 240

Query: 241 FSVNDLGKRTSNAGPNSLFGGKSDISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPS 300
           FSVNDLGKRTSNAGPNSLFGGKSDIS GKTSGTAKVREAFSQ KRTSL++LKDHLFECPS
Sbjct: 241 FSVNDLGKRTSNAGPNSLFGGKSDISTGKTSGTAKVREAFSQMKRTSLRSLKDHLFECPS 300

Query: 301 KLSEEMVRCMAFIYCSLHRVASNKAKKKAGSFPEVKQPQCGPAEEQFGGGKAMLEIHCIS 360
           KLSEEMVRCMAFIYCSLHRVASNKA+KKAGSFP+VKQPQ  P EEQFGGGKAMLEIHCIS
Sbjct: 301 KLSEEMVRCMAFIYCSLHRVASNKAQKKAGSFPQVKQPQREPVEEQFGGGKAMLEIHCIS 360

Query: 361 TNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHG 420
           TNNSQFSRASYAINNYRVLVEQLE VNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHG
Sbjct: 361 TNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHG 420

Query: 421 SLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLI 480
           SLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLI
Sbjct: 421 SLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLI 480

Query: 481 SSKLGLHSPQPLVCFGLCTGASSDPVLKVYTASNIKEELEVAKRDFLQANIVVKKSKKVF 540
           SSKLGL SPQPLVCFGLCTGASSDPVLKVYTASN+KEELE AKRDFLQANIVVKKSKKVF
Sbjct: 481 SSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEAAKRDFLQANIVVKKSKKVF 540

Query: 541 LPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRF 580
           LPKVLERFAREASISS+ELPKWVS+NVDGKL ESIQKC EHRTGKK SQIIEWLPYSSRF
Sbjct: 541 LPKVLERFAREASISSDELPKWVSDNVDGKLQESIQKCMEHRTGKKTSQIIEWLPYSSRF 600

BLAST of Cla97C01G004270 vs. ExPASy TrEMBL
Match: A0A6J1I221 (uncharacterized protein LOC111469131 OS=Cucurbita maxima OX=3661 GN=LOC111469131 PE=4 SV=1)

HSP 1 Score: 996.5 bits (2575), Expect = 7.3e-287
Identity = 518/579 (89.46%), Postives = 538/579 (92.92%), Query Frame = 0

Query: 1   MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDR 60
           MSGFDMH+RGEE ASG RELRDYLAS  VH RHRRSRSSSDRNSNV RGGVLHSNSKN R
Sbjct: 1   MSGFDMHLRGEEKASGNRELRDYLASHHVHARHRRSRSSSDRNSNVIRGGVLHSNSKNGR 60

Query: 61  SDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLENDIELLQLRLQQERSMRSMLE 120
           SD QASPLS SGIRA SPLHE +T+ NDNSS+K RASLENDIELLQLRLQQERSMRSMLE
Sbjct: 61  SDKQASPLSTSGIRARSPLHEGATNFNDNSSSKHRASLENDIELLQLRLQQERSMRSMLE 120

Query: 121 RAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQ 180
           RAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSK SSQQ
Sbjct: 121 RAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKTSSQQ 180

Query: 181 NSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGK 240
           +SVT SPAHGKHES+KHPSIISSAFCSSRKFPLGPLQPFSVN+LGKR SNAGPNSL GGK
Sbjct: 181 SSVTVSPAHGKHESKKHPSIISSAFCSSRKFPLGPLQPFSVNNLGKRASNAGPNSLLGGK 240

Query: 241 SDISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS 300
            DIS  K SG AKVRE  SQ K+TSL+TLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
Sbjct: 241 GDISTEKFSGPAKVREVLSQVKKTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS 300

Query: 301 NKAKKKAGSFPEVKQPQCGPAEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQ 360
           N AKKKA SFP+VK+P+ GP EEQ G  KAMLEIHCISTNN+QFSRASYAINNYRVLVEQ
Sbjct: 301 NNAKKKASSFPKVKRPESGPVEEQCGDVKAMLEIHCISTNNTQFSRASYAINNYRVLVEQ 360

Query: 361 LENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISA 420
           LE VNVSKM IDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGG IISA
Sbjct: 361 LEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGRIISA 420

Query: 421 NAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGAS 480
           NAIEQSIF FKSPRIGWWLETIISTALRKKSGEERQLISSKLGL S QPLVCFGLCTGAS
Sbjct: 421 NAIEQSIFSFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPLVCFGLCTGAS 480

Query: 481 SDPVLKVYTASNIKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKW 540
           SDPVLKVYTASN+KEELE+AKR+FLQANIVVKKSKKVFLPKVLERFAREASISS+ELPKW
Sbjct: 481 SDPVLKVYTASNVKEELELAKREFLQANIVVKKSKKVFLPKVLERFAREASISSDELPKW 540

Query: 541 VSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR 580
           +SENVDGKLHESIQKC + +TGKKAS IIEWLPYSSRFR
Sbjct: 541 ISENVDGKLHESIQKCVDLQTGKKASHIIEWLPYSSRFR 579

BLAST of Cla97C01G004270 vs. ExPASy TrEMBL
Match: A0A6J1E2A4 (uncharacterized protein LOC111430177 OS=Cucurbita moschata OX=3662 GN=LOC111430177 PE=4 SV=1)

HSP 1 Score: 995.0 bits (2571), Expect = 2.1e-286
Identity = 517/579 (89.29%), Postives = 537/579 (92.75%), Query Frame = 0

Query: 1   MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDR 60
           MSGFDMH+RGEE ASG RELRDYLAS  VH RHRRSRSSSDRNSNV RGGVLHSNSKN R
Sbjct: 1   MSGFDMHLRGEEKASGNRELRDYLASHHVHARHRRSRSSSDRNSNVIRGGVLHSNSKNGR 60

Query: 61  SDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLENDIELLQLRLQQERSMRSMLE 120
           SD QASPLS SGIRA SPLHE +T+ NDNS +K RASLENDIELLQLRLQQERSMRSMLE
Sbjct: 61  SDKQASPLSTSGIRARSPLHEGATNFNDNSCSKHRASLENDIELLQLRLQQERSMRSMLE 120

Query: 121 RAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQ 180
           RAMGRASSTLSPGHRHLAQTKDLI+EIELLEEEVANREQHVLSLYRSIFENCVSK SSQQ
Sbjct: 121 RAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFENCVSKTSSQQ 180

Query: 181 NSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGK 240
           +SVT SPAHGKHES+KHPSIISSAFCSSRKFPLGPLQPFSVNDLGKR SNAGPNSL GGK
Sbjct: 181 SSVTVSPAHGKHESKKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRASNAGPNSLLGGK 240

Query: 241 SDISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS 300
            DIS GK SG AKVREA S  K+TSL+TLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
Sbjct: 241 GDISTGKISGPAKVREALSHVKKTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS 300

Query: 301 NKAKKKAGSFPEVKQPQCGPAEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQ 360
           N AKKKA SF +VK+P+ GP EEQ G  KAMLEIHCISTNN+QFSRASYAINNYRVLVEQ
Sbjct: 301 NNAKKKASSFAKVKRPESGPVEEQCGDVKAMLEIHCISTNNTQFSRASYAINNYRVLVEQ 360

Query: 361 LENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISA 420
           LE VNVSKM IDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGG IISA
Sbjct: 361 LEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGRIISA 420

Query: 421 NAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGAS 480
           NAIEQSIF FKSPRIGWWLETIISTALRKKSGEERQLISSKLGL S QPLVCFGLCTGAS
Sbjct: 421 NAIEQSIFSFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPLVCFGLCTGAS 480

Query: 481 SDPVLKVYTASNIKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKW 540
           SDPVLKVYTASN+KEELE+AKR+FLQANIVVKKSKKVFLPKVLERFAREASISS+ELPKW
Sbjct: 481 SDPVLKVYTASNVKEELELAKREFLQANIVVKKSKKVFLPKVLERFAREASISSDELPKW 540

Query: 541 VSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR 580
           +SENVDGKLHESIQKC + +TGKKAS IIEWLPYSSRFR
Sbjct: 541 ISENVDGKLHESIQKCMDLQTGKKASHIIEWLPYSSRFR 579

BLAST of Cla97C01G004270 vs. ExPASy TrEMBL
Match: A0A6J1DMR5 (uncharacterized protein LOC111021990 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111021990 PE=4 SV=1)

HSP 1 Score: 994.6 bits (2570), Expect = 2.8e-286
Identity = 522/580 (90.00%), Postives = 542/580 (93.45%), Query Frame = 0

Query: 1   MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDR 60
           MSGFD  MRGEET +GKRELRDYLASQRVH RHRRSRSSSDRNSNVFRGGVLHSN KND+
Sbjct: 1   MSGFD--MRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQ 60

Query: 61  SDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLENDIELLQLRLQQERSMRSMLE 120
           SD QASPLS SGIRA SPLHE ST  NDNSS+K RASLENDIELLQLRLQQERSMRSMLE
Sbjct: 61  SDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDIELLQLRLQQERSMRSMLE 120

Query: 121 RAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQ 180
           RAMGRASSTLSPGHRHLAQTKDLI+EIELLEEEVANREQHVLSLYRSIFE CVSKPSSQQ
Sbjct: 121 RAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQ 180

Query: 181 NSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGK 240
           NSVTASPAHGKHESRKHPS+ISSAFCSS+KFPLGPLQPFSVNDLGKRTSNAGPNSLFGGK
Sbjct: 181 NSVTASPAHGKHESRKHPSVISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGK 240

Query: 241 SDISMGKT-SGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVA 300
           S+I+ GKT SGT+KVRE  SQ KRTSL+TLKDHLFECPSKLSEEMVRCMA IYCSLHRVA
Sbjct: 241 SNINTGKTSSGTSKVRETISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVA 300

Query: 301 SNKAKKKAGSFPEVKQPQCGPAEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVE 360
           SNKA+KK GS P+VKQPQCGP EEQ   GKAMLEIH ISTNNSQFSRAS+AIN YRVLVE
Sbjct: 301 SNKAQKKRGSLPDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINTYRVLVE 360

Query: 361 QLENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIIS 420
           QLE VNVSKM IDAQTAFWINVYNALLMHAYLAYGIP  SLRRLALFHKAAYNIGGHIIS
Sbjct: 361 QLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPQSSLRRLALFHKAAYNIGGHIIS 420

Query: 421 ANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGA 480
           ANAIEQSIF FK+PRIGWWLETIISTALRKKSGEERQLISSKLGL SPQPLVCFGLCTGA
Sbjct: 421 ANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGA 480

Query: 481 SSDPVLKVYTASNIKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPK 540
           SSDPVLKVYTASN+KEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASIS +EL K
Sbjct: 481 SSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASISPDELLK 540

Query: 541 WVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR 580
            VS+NVD +LH+SIQKC +HRTGKKASQIIEWLPYSSRFR
Sbjct: 541 RVSDNVDVELHDSIQKCMDHRTGKKASQIIEWLPYSSRFR 578

BLAST of Cla97C01G004270 vs. TAIR 10
Match: AT5G47380.1 (Protein of unknown function, DUF547 )

HSP 1 Score: 517.3 bits (1331), Expect = 2.5e-146
Identity = 304/592 (51.35%), Postives = 399/592 (67.40%), Query Frame = 0

Query: 1   MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDR 60
           M GFD++  G +    +R   D       H   R   +SS+R+ +    G   S S N+ 
Sbjct: 1   MGGFDLNKDGNK----QRRNGDSWHCLDSHKHGRSKSASSERDLHTSGNGA--SQSANNF 60

Query: 61  SDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLENDIELLQLRLQQERSMRSMLE 120
           +  QAS +  +  + P PLH       +N S+  RASLE D+E L LRLQQE+SMR +LE
Sbjct: 61  TRMQASSVQTTANKRPKPLHNCQMLTKNNVSSNDRASLERDVEQLHLRLQQEKSMRMVLE 120

Query: 121 RAMGRASSTLSPGHRHLA-QTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQ 180
           RAMGRASS+LSPGHRH A Q  +LI+EIELLE EV NRE HVLSLYRSIFE  VS+  S+
Sbjct: 121 RAMGRASSSLSPGHRHFAGQANELITEIELLEAEVTNREHHVLSLYRSIFEQTVSRAPSE 180

Query: 181 QNSVTASPAHG-KHESRKH-PSIISSAFCSSRKFPLGPLQPF-SVNDLGKRTSNAGPNSL 240
           Q+S  +SPAH  K   RK  P++IS+AFCSS  FPL P     ++ D  ++TS    +S 
Sbjct: 181 QSSSISSPAHHIKQPPRKQDPNVISNAFCSSNNFPLKPWHAMVTLKDSSRKTSKKDQSSQ 240

Query: 241 FGGKSDISMGKTSGTAKVREAFSQ----AKRTSLQTLKDHLFECPSKLSEEMVRCMAFIY 300
           F  ++ I    TS +++ +  F +     K  S +TLKDHL++CP+KLSE+MV+CM+ +Y
Sbjct: 241 FQFRNCIP-STTSCSSQAKSHFLKDSVTVKSPSQRTLKDHLYQCPNKLSEDMVKCMSSVY 300

Query: 301 ----CSLHRVASNKAKKKAGSFPEVKQPQCGPAEEQFGGGKAMLEIHCISTNNSQFSRAS 360
               CS       K      S   V  P+    E++    ++M+E+  IS++  +FS+ +
Sbjct: 301 FWLCCSAMSADPEKRILSRSSTSNVIIPKNIMNEDRAWSCRSMVEVSWISSDKKRFSQVT 360

Query: 361 YAINNYRVLVEQLENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHK 420
           YAINNYR+LVEQLE V +++M  +A+ AFWIN+YNALLMHAYLAYG+P  SLRRLALFHK
Sbjct: 361 YAINNYRLLVEQLERVTINQMEGNAKLAFWINIYNALLMHAYLAYGVPAHSLRRLALFHK 420

Query: 421 AAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQ 480
           +AYNIGGHII+AN IE SIF F++PR G WLETIISTALRKK  E++  + S   L  P+
Sbjct: 421 SAYNIGGHIINANTIEYSIFCFQTPRNGRWLETIISTALRKKPAEDK--VKSMFSLDKPE 480

Query: 481 PLVCFGLCTGASSDPVLKVYTASNIKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAR 540
           PLVCF LC GA SDPVLK YTASN+KEEL+ +KR+FL AN+VVK  KKV LPK++ERF +
Sbjct: 481 PLVCFALCIGALSDPVLKAYTASNVKEELDASKREFLGANVVVKMQKKVLLPKIIERFTK 540

Query: 541 EASISSEELPKWVSENVDGKLHESIQKCTEHR-TGKKASQIIEWLPYSSRFR 580
           EAS+S ++L +W+ +N D KL ESIQKC + +   KKASQ++EWLPYSS+FR
Sbjct: 541 EASLSFDDLMRWLIDNADEKLGESIQKCVQGKPNNKKASQVVEWLPYSSKFR 583

BLAST of Cla97C01G004270 vs. TAIR 10
Match: AT4G16640.1 (Matrixin family protein )

HSP 1 Score: 332.8 bits (852), Expect = 8.7e-91
Identity = 170/322 (52.80%), Postives = 228/322 (70.81%), Query Frame = 0

Query: 592 NNYAWRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQ 651
           +N  W +F+R +D   GS V+G+SELK+YL+RFGY+       FSD FD    SA+ LYQ
Sbjct: 48  SNSTWHDFSRLVDVQIGSHVSGVSELKRYLHRFGYVN-DGSEIFSDVFDGPLESAISLYQ 107

Query: 652 NRLGLSVTGKLDSETIASIMSPRCGMSDLIKINNNNTTIHSTRRYAFFNGQPRWIRSSTL 711
             LGL +TG+LD+ T+  +  PRCG+SD   +  NN  +H+T  Y +FNG+P+W R  TL
Sbjct: 108 ENLGLPITGRLDTSTVTLMSLPRCGVSD-THMTINNDFLHTTAHYTYFNGKPKWNR-DTL 167

Query: 712 TYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDG 771
           TYA+S  + ++YLTS +++ V RR+FS+WS+VIP++F E  D+ ++D++IGFY GDHGDG
Sbjct: 168 TYAISKTHKLDYLTSEDVKTVFRRAFSQWSSVIPVSFEEVDDFTTADLKIGFYAGDHGDG 227

Query: 772 EAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFE-QEKSKVAVDLESVVTHEIGHVLGLA 831
             FDGVLG LAHAF+PENGRLHLDAAE W VD + +  S+VAVDLESV THEIGH+LGL 
Sbjct: 228 LPFDGVLGTLAHAFAPENGRLHLDAAETWIVDDDLKGSSEVAVDLESVATHEIGHLLGLG 287

Query: 832 HSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFKLKSFLESEKSINSGSSSSS 891
           HS+ + +VMYPSL PR KKVDL +DDV G+  LYG NP  +L S  +SE SI +G+ S  
Sbjct: 288 HSSQESAVMYPSLRPRTKKVDLTVDDVAGVLKLYGPNPKLRLDSLTQSEDSIKNGTVSHR 347

Query: 892 INTNFFFLLLFYLLVWVGSLFF 913
             +  F   + Y+L+ VG + F
Sbjct: 348 FLSGNF---IGYVLLVVGLILF 363

BLAST of Cla97C01G004270 vs. TAIR 10
Match: AT2G45040.1 (Matrixin family protein )

HSP 1 Score: 316.6 bits (810), Expect = 6.4e-86
Identity = 156/275 (56.73%), Postives = 196/275 (71.27%), Query Frame = 0

Query: 616 ELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRC 675
           E+K++L ++GYL   PQN  SD  D  F  AL+ YQ  LGL +TGK DS+T++ I+ PRC
Sbjct: 52  EIKRHLQQYGYL---PQNKESD--DVSFEQALVRYQKNLGLPITGKPDSDTLSQILLPRC 111

Query: 676 GMSDLIKINNNNTTIHSTRRYAFFNGQPRWIRS--STLTYALSPDYTIEYLTSSEIRKVV 735
           G  D   +       H+ ++Y +F G+PRW R     LTYA S +    YL  ++IR+V 
Sbjct: 112 GFPD--DVEPKTAPFHTGKKYVYFPGRPRWTRDVPLKLTYAFSQENLTPYLAPTDIRRVF 171

Query: 736 RRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGVLAHAFSPENGRLH 795
           RR+F +W++VIP++F E+ DY  +DI+IGF+ GDHGDGE FDGVLGVLAH FSPENGRLH
Sbjct: 172 RRAFGKWASVIPVSFIETEDYVIADIKIGFFNGDHGDGEPFDGVLGVLAHTFSPENGRLH 231

Query: 796 LDAAERWAVDFEQEKSKVAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLR 855
           LD AE WAVDF++EKS VAVDLESV  HEIGHVLGL HS+VK++ MYP+L PR KKV+L 
Sbjct: 232 LDKAETWAVDFDEEKSSVAVDLESVAVHEIGHVLGLGHSSVKDAAMYPTLKPRSKKVNLN 291

Query: 856 IDDVEGIQYLYGTNPNFKLKSFLESEKSINSGSSS 889
           +DDV G+Q LYGTNPNF L S L SE S N    S
Sbjct: 292 MDDVVGVQSLYGTNPNFTLNSLLASETSTNLADGS 319

BLAST of Cla97C01G004270 vs. TAIR 10
Match: AT1G70170.1 (matrix metalloproteinase )

HSP 1 Score: 282.7 bits (722), Expect = 1.0e-75
Identity = 144/293 (49.15%), Postives = 185/293 (63.14%), Query Frame = 0

Query: 596 WRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLG 655
           W  F+ F     G  V+G+  +KKY  RFGY+P     NF+D FDD   +A+ LYQ    
Sbjct: 41  WDAFSNFTGCHHGQNVDGLYRIKKYFQRFGYIPETFSGNFTDDFDDILKAAVELYQTNFN 100

Query: 656 LSVTGKLDSETIASIMSPRCGMSDLI--------------KINNNNTTIHSTRRYAFFNG 715
           L+VTG+LD+ TI  I+ PRCG  D++              ++N + T +H+ +RY  F G
Sbjct: 101 LNVTGELDALTIQHIVIPRCGNPDVVNGTSLMHGGRRKTFEVNFSRTHLHAVKRYTLFPG 160

Query: 716 QPRWIRS-STLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIR 775
           +PRW R+   LTYA  P   +    + E++ V  R+F RWS V  LNFT S  + +SDI 
Sbjct: 161 EPRWPRNRRDLTYAFDPKNPL----TEEVKSVFSRAFGRWSDVTALNFTLSESFSTSDIT 220

Query: 776 IGFYRGDHGDGEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFEQE---KSKVAVDLES 835
           IGFY GDHGDGE FDGVLG LAHAFSP +G+ HLDA E W V  + +       AVDLES
Sbjct: 221 IGFYTGDHGDGEPFDGVLGTLAHAFSPPSGKFHLDADENWVVSGDLDSFLSVTAAVDLES 280

Query: 836 VVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNF 871
           V  HEIGH+LGL HS+V+ES+MYP+++   +KVDL  DDVEGIQYLYG NPNF
Sbjct: 281 VAVHEIGHLLGLGHSSVEESIMYPTITTGKRKVDLTNDDVEGIQYLYGANPNF 329

BLAST of Cla97C01G004270 vs. TAIR 10
Match: AT1G59970.1 (Matrixin family protein )

HSP 1 Score: 266.2 bits (679), Expect = 1.0e-70
Identity = 144/329 (43.77%), Postives = 196/329 (59.57%), Query Frame = 0

Query: 573 PYSSRFRT----LPDFTTLDADNNNYAWRNFARFLDAGKGSEVNGMSELKKYLNRFGYLP 632
           P S++F T    +P    L+A  N  AW  F++      G  +NG+S+LK+Y  RFGY  
Sbjct: 17  PISAKFYTNVSSIPPLQFLNATQN--AWETFSKLAGCHIGENINGLSKLKQYFRRFGY-- 76

Query: 633 IPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLI---KINN 692
           I    N +D FDD   SA+  YQ    L VTGKLDS T+  I+ PRCG  DLI      N
Sbjct: 77  ITTTGNCTDDFDDVLQSAINTYQKNFNLKVTGKLDSSTLRQIVKPRCGNPDLIDGVSEMN 136

Query: 693 NNTTIHSTRRYAFFNGQPRW-IRSSTLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVI 752
               + +T +Y+FF G+PRW  R   LTYA +P   +    + E+++V  R+F+RW+ V 
Sbjct: 137 GGKILRTTEKYSFFPGKPRWPKRKRDLTYAFAPQNNL----TDEVKRVFSRAFTRWAEVT 196

Query: 753 PLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDF 812
           PLNFT S     +DI IGF+ G+HGDGE FDG +G LAHA SP  G LHLD  E W +  
Sbjct: 197 PLNFTRSESILRADIVIGFFSGEHGDGEPFDGAMGTLAHASSPPTGMLHLDGDEDWLISN 256

Query: 813 EQEKSKV-----AVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEG 872
            +   ++      VDLESV  HEIGH+LGL HS+V++++M+P++S   +KV+L  DD+EG
Sbjct: 257 GEISRRILPVTTVVDLESVAVHEIGHLLGLGHSSVEDAIMFPAISGGDRKVELAKDDIEG 316

Query: 873 IQYLYGTNPNFKLKSFLESEKSINSGSSS 889
           IQ+LYG NPN        S +S ++G  S
Sbjct: 317 IQHLYGGNPNGDGGGSKPSRESQSTGGDS 337

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011654811.12.4e-30895.16uncharacterized protein LOC101204173 isoform X1 [Cucumis sativus] >KAE8647898.1 ... [more]
XP_008437070.13.4e-30794.99PREDICTED: uncharacterized protein LOC103482606 isoform X2 [Cucumis melo][more]
XP_038874743.13.8e-30694.47uncharacterized protein LOC120067282 isoform X1 [Benincasa hispida][more]
XP_016903702.12.3e-30391.51PREDICTED: uncharacterized protein LOC103482606 isoform X1 [Cucumis melo][more]
KAG6579407.13.0e-28789.64hypothetical protein SDJN03_23855, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
O235071.2e-8952.80Metalloendoproteinase 1-MMP OS=Arabidopsis thaliana OX=3702 GN=1MMP PE=1 SV=1[more]
Q8GWW69.1e-8556.73Metalloendoproteinase 4-MMP OS=Arabidopsis thaliana OX=3702 GN=4MMP PE=1 SV=1[more]
O045291.5e-7449.15Metalloendoproteinase 2-MMP OS=Arabidopsis thaliana OX=3702 GN=2MMP PE=1 SV=1[more]
Q9ZUJ51.4e-6943.77Metalloendoproteinase 5-MMP OS=Arabidopsis thaliana OX=3702 GN=5MMP PE=1 SV=1[more]
Q5XF518.8e-6441.09Metalloendoproteinase 3-MMP OS=Arabidopsis thaliana OX=3702 GN=3MMP PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A1S3AT951.7e-30794.99uncharacterized protein LOC103482606 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A1S4E6571.1e-30391.51uncharacterized protein LOC103482606 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1I2217.3e-28789.46uncharacterized protein LOC111469131 OS=Cucurbita maxima OX=3661 GN=LOC111469131... [more]
A0A6J1E2A42.1e-28689.29uncharacterized protein LOC111430177 OS=Cucurbita moschata OX=3662 GN=LOC1114301... [more]
A0A6J1DMR52.8e-28690.00uncharacterized protein LOC111021990 isoform X1 OS=Momordica charantia OX=3673 G... [more]
Match NameE-valueIdentityDescription
AT5G47380.12.5e-14651.35Protein of unknown function, DUF547 [more]
AT4G16640.18.7e-9152.80Matrixin family protein [more]
AT2G45040.16.4e-8656.73Matrixin family protein [more]
AT1G70170.11.0e-7549.15matrix metalloproteinase [more]
AT1G59970.11.0e-7043.77Matrixin family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 95..122
NoneNo IPR availableCOILSCoilCoilcoord: 140..160
NoneNo IPR availablePIRSRPIRSR001191-1PIRSR001191-1coord: 611..883
e-value: 3.8E-63
score: 211.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 82..98
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 176..197
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 42..71
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 27..98
NoneNo IPR availablePANTHERPTHR23054UNCHARACTERIZEDcoord: 16..579
NoneNo IPR availablePANTHERPTHR23054:SF26EXPRESSED PROTEINcoord: 16..579
NoneNo IPR availableSUPERFAMILY55486Metalloproteases ("zincins"), catalytic domaincoord: 702..868
IPR021190Peptidase M10APRINTSPR00138MATRIXINcoord: 757..785
score: 57.06
coord: 670..683
score: 36.85
coord: 818..843
score: 51.03
coord: 733..748
score: 44.82
coord: 852..865
score: 63.94
IPR006026Peptidase, metallopeptidaseSMARTSM00235col_5coord: 701..866
e-value: 3.6E-41
score: 152.7
IPR025757Ternary complex factor MIP1, leucine-zipperPFAMPF14389Lzipper-MIP1coord: 92..170
e-value: 3.1E-20
score: 72.3
IPR024079Metallopeptidase, catalytic domain superfamilyGENE3D3.40.390.10Collagenase (Catalytic Domain)coord: 603..871
e-value: 3.9E-77
score: 261.0
IPR006869Domain of unknown function DUF547PFAMPF04784DUF547coord: 368..505
e-value: 7.9E-33
score: 113.2
IPR001818Peptidase M10, metallopeptidasePFAMPF00413Peptidase_M10coord: 704..865
e-value: 6.4E-49
score: 165.8
IPR002477Peptidoglycan binding-likePFAMPF01471PG_binding_1coord: 613..669
e-value: 5.0E-11
score: 42.6
IPR033739Peptidase M10A, catalytic domainCDDcd04278ZnMc_MMPcoord: 704..865
e-value: 8.33179E-65
score: 212.835
IPR036365PGBD-like superfamilySUPERFAMILY47090PGBD-likecoord: 617..680

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G004270.2Cla97C01G004270.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0031012 extracellular matrix
molecular_function GO:0004222 metalloendopeptidase activity
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0008237 metallopeptidase activity