CmoCh08G005720 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh08G005720
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
Descriptionaspartyl protease family protein At5g10770-like
LocationCmo_Chr08: 3507877 .. 3517101 (+)
RNA-Seq ExpressionCmoCh08G005720
SyntenyCmoCh08G005720
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ACAAAAGTGAGGGCTACACAAAGGTCAGCCCTCCATCTTTTGTTCCCATCCTAAACAAACCCTTCACTTCTCATCTCCCCATGGAGACTACAAGATCCCTGCATCCCCTTCTTCTTCTTCCTCTTCTTTTCGTCCTCGTTGACGCTCGTTCGAGCTCGATCGACGCCGTTAGTGCCTTTCACCAGACGCTTGTTCTTAATGGGCAGAAACTTCCATTGATGGATATGAAAATACCCGCAACTGAGTGCATCTTTCACAACCCAAGTCAGTGTTCTTCCCAAAAAATGGGGGGAAATCTCTTAATTGAAGTTTAATTATGTGTTGATTGTGATTAATCAATTGGGATCTGTTTTTCTTTTTTCTCTGCTTTTGGTATGATTTTGGGCTTTTTGTTTTTAATATTTTCTTTATTAAAAGATTTATATTGGGTTTAATTTCATTTTTGGGATTTTATATATATATATTTTTTTTTCTTTTTTTCCCTACAATTTTTTTTAGTAAAGACATTATAAATATTGAAAATTTTTTATCTACTAAATTTCCAAATTCTCCAGATTTTACCTCAACCATTTAATTCAATGATTTAATGAATAAAATATTATAATTATTTTAATTTTTTAATTGAAATTTTAAATGTAAATATATATATATTTTTTCTTTTTTTTTTTGTGCCAGTTTTGCATACCTACTGAATCCAATTTTATTAAAAAAAAAAAAAATTATTGACACAATTTTATTAAAAAATAATAGAAATTTTAAATCAAATTTAGATTATTATGTATATCAAATTAAAATTAATACATTATTATTAGTAAAAACGACCAAAAAAAACGCGTATTGATATAGATATACATATAGTTTTCTAGAACGCGTGTTGATATTGATATACATATATGTAGTATTTGTTTAAATTTTATAATTTTTGTAATTGATAATTGTGACGGAATTATTGGAATTTTGAATTTACAGGAGTGGAGAAGGAAACAGCGACCTTTGAAATGAAAGAAAGAGACTACTGTTCAGGCAATATAAAAGACCGGGACAAGAATCTCCAAGACCGCCTAGTCCTCGACCAAATTCACGTCGACTCCCTGCTATCCCGATTCAAATACTCCACTTCCCTCTTCACCCCCCACGACATCTCCGACACCCACCTCCCCTTAACCCTCGGCACCAGCCTCCAAACCCTCAACTACCTCGTCACCGTCCGCATCGGCCCTCAAAACCTAACCCTCATCGTCGACACCGGCAGCGACCTCACCTGGGTCCAATGCCTCCCTTGCCACCTCTGTTACAACCAACAACAACCCCTCTTCGATCCCTCAAACTCCCCTTCATTTCTCTCCCTCCCCTGTAACTCCAGCAACTGTTCTGCTTTCCAACCCACAGCTGGAGGCTCCGGCGCTTGTACCAACGGCAGTTCAAATCCCTGCGATTACGAGGTTAACTACGGCGACGGCTCTTACTCCCGTGGAGACCTCGGATTTGAAACCCTGAACCTGGGGAATGTTTCCGTTGAGAATTTTATATTTGGGTGTGGCCGGAATAACAAGGGCTTGTTCGGTGGAACCTCCGGATTAATGGGTTTTGGTAGAAGCGAACTCTCCGTTGTTTCTCAAACTTCCTCTGCTTACGGCGGCGTTTTTTCCTACTGTTTGCCGTCGACTGAAACCGGTTCTTCAGGTTCTTTAACAATGGGGACTGGAGATTTCTCAAATTTCCGAAACATTTCCCCAATTTCCTACACGAAAATGGTTCCAAATCCACAGATGTCGAATTTCTACACACTGAATCTGACCGGAATTACCGTTGGTGGGTTGAAATTAGTGGTGCCGCGTTTGGCTCCGAGTAAGGGAGTTTTGAGCTTACTCGATTCAGGGACGGTGATTACCAGGTTGCCTCCGTCGGTATACCAAGCTTTGAAGGAGGAATTTGTGAGGCAATTTTCTGGGTATCCAACGGCGGCTGGATACTCGCTGTTGGACACGTGTTATAATCTTAGTGGGTTGAAAGAAGTGAAAACTCCGAATGTGAAGCTTCATTTTGAAGGCGAGGGAGAGATGAGTGTGGATGTTGGGGGGCTTTTTTACTATGTGAAATCTGATGGGTCTCAGATTTGTTTGGCGTTTGCGAGTTTGGCGGATGAAGATCAGATTGGGATTATTGGGAGTTATCAGCAGAAGAATCAGAGGGTTATTTATAATTTGAAGGAATCCAAGGTGGGTTTTGCAGGTGAGCGTTGCAGTTTCTAGCTCTGTTGGGAGAGATTTTCCCGGGAAAGTTGGAATGGATTTTCCCGGGAAAGTTGGAATGGATTTTCCCGGGAAAGTTGGGGTGGAGAAACGGGGCTTGTTGTTGTACTGTTGTAAAGGGTGGGGAGAATTTGGGTTTCTATATATTTCATTTTTTACTCTCTAAATCTGAATTCAAATATTAAATTAAAATCTGGAACTACTAGTTAGGGGTTTTTTTGTCTTTTTGTTGTGATCCATATAGTTAGGGGTATTTTCGTCCTTTTGTCATGGTCAAGTAGCATCTAGTTTTTTTTTTTTTTTTTTTTTTATCGTCCAAATTTATATAATTNTCTTTAAAAAATTTCTCTATCCTCGTTTTCTAGAGCCTTCTTCTTAAATCGAGACTAGAGGCTTGAACGTACGGCTAACATAATAACGAATTGACTTAGCTCACAATATATAACTCAGAAACGTATAGTACTCTATTCAGTAATTCTCATATGGCACATATGGCTTCCTCTCCTTTGCCTTTCTCCCATCAATCAATACTACTAAGCGAAGAATTGGAAATTTCCAATTTTTTTTTTTTTTTTTTTTATGAAAATGGTGGAAAAAAAATAAGAGAGATTTAGTTAGGGGTATTTTCATCCTTTTATTATTTTCAATTAATTTATTAAATGAGTAATTTAAATTTAAAATAAAAACATTAATTACTCAAATTTGTATTTCTAATTAGAAGACCTTAATTAAATTATATCAAGCCAAGAGTAGATAAATGAAAAAATAATAATAAATAATTATGAAAAATCATATCTAACTAAATAATTAAAAGGAGGGCTATAGAAACTGATTTCCTCTTTACGGCAAAGTATTTAATAAAATTATTATGTCTATTTTTTAATAATGTAATTTAAAAGAGAATATATATTAAATTTATTAAATACAAAAATACAAACAAACAACTTCCATTTGTCTCCCTAAATATTTTCTTACCATTTCTTTTCTCCCTTTCCTTTAAAATTGCGGTTTTCCATTGGACGAATCGGTCATTTCGTCTCTCTCTCTCTCTCGTTCTTCCTCTTCTTCGCGCACTTCCCTCTTACCACTCCACCTCCTCTGCTCGTTTCAATGGCGAAGATCTCACACTCACCCCACAAAATTTCCATTTCCACCACCGCTGATCCATCTCACTCTTCTCCAATCCATCAGAGCAAGGCTCTCAAGCGCACGCGTAAGTCCGTTCGTCGTGATGCTCCCGCTCAGCGTAGTTCTGTATACCGCGGCGTCACCAGGTTAGATTAGCCTTTGTTGATTTGGTTGAAGTCGAATGAATCGATTGATTTGAGTTCTCTCTGTTTTCTATGTTTTTTTTTTCTTTTGCTTGAATCTATTTGGTTTGCTTTAGGCATAGATGGACGGGACGGTATGAGGCGCACCTGTGGGATAAGAATTCTTGGAATGAAGGACAGAATAAGAAAGGAAGACAAGGTTCTGTAGTTATCTGTTTGTTTTTTCTTTTTGAATCTGTAAGTTGAGGCGATTCTAATTTTTTTGTTCTTTGTTCTTTTGATTCATGCCTCTGTATTGCTGTGCGTGGCTTATTGATGAAATTTTGAATCGAAACGCGCAGTGTATTTAGGTATGGTCTGCGATTGGAACTTGAACTTGTGATTTAATGTTGAAACAGAGCTATCGATCGGGTTAAGTATGTGTGCTATTGTTTACTTCTGATCTACAGGAGCGTACGATGATGAAGAAGCGGCGGCTCATGCTTACGACCTTGCAGCGCTCAAGTACTGGGGAGCAGAAACTGTTATTAACTTTCCAGTAATTTTCCAATCGATTGATCGTTGTTCTGCACTCACTTGTTTTTCTTTTGGCTATGGAAATTAAGAAATTTGTTGAAGCTGTGTTCATTCTCTGATTTGATCATTTGCAGCGATTAACATACCAAGACGAGCTTAAAGAAATGGAAGGCCAATCAAGAGAAGAGTATATTAGATATTTGAGAAGGTAATAACCATTTTATTCTCATCTTAAACTTCAAAATAGAATCCTTATCCATTTTTATGAGTTCAATTAACAGGAAGAGTAGCGGCTTTTCTCGCGGTGTTTCAAAGTACAGAGGTGTAGCAAGGTACCTAACCTCGATCACATTAAAGTTTAGAGCTACAATTTTATTCAACGAAAACATGTTCATATATAAATAAATACATGTGACACTGTTTTGCCTACTATTTTAACAAGTAATCTAAGTTCGTTATACCATTGTTTACTTAACCTTCAATTTTGTTCATGAGATTATGGTTCTTAAATCTAGTTTATAGTATCAAAGGTATTGCAATTAGTAAATTTGTTGCTTCTGGCAGACAATCTTATGCATCAAACACTAATTTCATACCTTACTCCTAAGTCAAACACTCCCTATTACCTACTTATACTTACCCTTCATACACATATATGAAACACATGTGATGTGTTGCGTGTGAAGGTTTCTTTCTTAACTTTCTTCGTCGCCATTTTCGGGTTTGTTTTAGCCATCAATTTGGGAAATATCTCTTTCCACTTACAGTAGGGACAATTAATGATATCACCATTCAACCACTACTAACCCACTGGGAAGGAGGGGAAGAACATTTCCATATGTGCATAGTGTTTTTTTATTTTCTTAATTTATATTCTTGATTTTATTTTTCTCAATCTTTTTATTATGTCGTGGCAATTGATATGCAAATACCTTAAATGTTCAAATACTTTTTATCTTTGTAATTATGAGAAATAAAAAAACAGGGAAATGCAGGCACGTGGCTACGGGTATGTAACACAATTTTCTTTGGTCAAAGCTGCTTTAAGGAGCAATTATCATAATTCTTTTTCCACATTTGAATTATTGAATGATAAAGATAATGAATTTATTATTTCAGCATAAAATTTATACAACTCGTAGAAATAGGAGTAGTTTTAGTTTAATTGAAGAAATTGAATAGTTAAACCATATTTCTGCTTCTGAAGATAGAATCCGTTACCATTTTTGGGAGTGTTGAGGATTATTGGGAGTGTGTCCCACATCGGTTAATTTAATGGAAAGTCATGAGTTTTAAGTGGTGAATACTATCCCCATTGGTATGAGGTGTTTTGGGGAAGCCCAGAGCAAAGCTATGAGAACTTGTGTTTGAAGTGGACAATATCATAACACAATTCCTAATAGGAAATTATAATGCATTTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTCTCAGGCACCATCATAATGGGAGATGGGAAGCTCGAATCGGCAGGGTGTTCGGCAACAAATATCTTTACCTTGGAACATACGGTATTTTTTCTCTTTTACTGCACAAACCCAACTATTAACTGTTGAAAACAGACTAAATCTACAGTTTAATTAAAGAAAAAACCACGTAGAAGCTAAAAAGATATAAAGGGGGAAAATAAAAAACAATTGTTTCTTTAAATTATAGGTTTTTCATGATGTGGTCAACAGCCTCACTTCATTATATTATTATAGAGTTGTCATCACACATCATACCATTTTTGTGTCTTATTTCACGCATCACACTTTAATATTTTGAATGAATAAGAATAGTTTTATTTCTTTTGTTTGTTAGTTTCAATATGTCATCTAATCTTTTCATTTCATAGAATAATTTATATTATCGTCTGCCTTCATATTCTATTTTTTTTTTTACTGTCCAAACCCATAAATTTTTTTTTACTGCCTCTATATTCTATATAATGTGTATTTGTTATACCTTTCAATAAAAACCGTTCTCTCATCCCGATTATAAAGCTACATCATCGCCCTTTTAAAGAAGTATATAATAAAATAAAGTGTTTGAAAACTGTGGTAATTTATCCAACATAAAATTTAGTTACTATCTTTGGACTTTTCCTCTCGGACTTTTCCTATAGACTTTAAAAGACGTATGCTAGGAGAAAATGCTAAGGGAATATGCTAGGGGAAGGTTTTTTTCCCTTGTGGGAAAGTTTTTTCCCTTCCAAACTTTCCCTCAAGGCTTTAAGACGCGTTTGCTAGAGGAAAGTTTCCACACTCTTATAAATGGAGATTTGTTCTCCTCCCAACCAATGTGGGACATTCACAATCCACCTCCCTTCGAGGCCCAGCGTCTTCGCTGGCACTCTTTCCTTCCTCCAATCGATGTGGAACCGCCCCCAAATGTACCCTCTTTGGGGCCAGCGTCCTTACTGGCACACCGTCTCATGTCCCCCCCTTCGAGGAACAGCGAGAAGGCTGGCAAACGTCCGGTGCCTGGCTCTGATACCGTTTGTAACGCTCCAGATCCATCTCTAGCAGATATTGTCGTTTTTGGGCTCTCCCTTTTGGACTTCCCTTCAAAACTTTAAATTTGTTCTTATTTCCAACCAACCTGGGACATCGCATTTTCAAACTTTTATTTATTTAAAAAATATTTTCACCTAGACTTCCTAAGGTTTAATTTTAAAGAGATGATTATCAAATTGTGTTCATGTGTTATGTTCTTAAAAAGTATATTAATTTTGTATAGTAAATAAATCTATCCATGTTACTTTTATTCAACAAAAATTTATTTATTTTTCCTTAAAAAAAGCTAAAATATTTAAACCATATACGCAACTAAATCTAGTAGTTGGCATGTAGATCACCAAACCTCAACTGCAGCATGGCCAGCTATTTAAGATTTATTCATTTCGACACATGATATCATAAATTCATTGTCTTCTCACCAAAACTTGCATACATGTTATTCCGGGGAAGTTAGAAATTAATATCTAATTTATTATTATTATTGTTATTTCTATTTGCTTTATTTTCACTTTACAGCTACGCAAGAAGAAGCTGCTAGAGCTTACGACCTGGCGGCAATAGAACATCGTGGCCTTAACGCCGTTACAAATTTCGATATCAGCCGTTACATCAAATGTCTCCGTCCAGGGGAACAGCATATCCCAGACAATAACCGTCCATCAAGTCCAAACGCCGGCGACACTGCCTCAGAATTTGACCCCAAATCCTTTCTCGAAATTACCTTTCCGTCGCAATCTTCGAGCTCCGACCAACCAACCACCGCGCCGGAACCTCACGGCGGCCTCCCCTCATCATCGTCGGCGACGTTGGAGCTGCTAATCCATTCGTCAAAATTCAAACACATACTAGAAAGGACATCCGCCGCTGAGACTCCACAGACGCTGCCGGAATCCGTCCGGCCGCGCCGCTGTATACCGGACGACATTCAAACCTACTTCGATTGTAGTACTCAAGATTCTGACGACATTGCCGGAAGCGACGACGGAATTTTCGGGTACCTAAACTCGTATTTTCCTTCATCAGCATCAGTTTTTCATAGCGCACTCGATGCTTAATTAGGTGTAATCAAGAAAGGAGATGGAAATGAGGAAATAACTTGGGCTGGCCATGGCCATGGCGGATTACGACGATCACCAGAAACGGCGACAATGGAGGGCATGGCCTGAGGTACGCAATTAATTAAAATTATAGAAATTTTGGTTTTTACTTTTGTGAAGCCTTCTCTTTGATTCACAAGAATGGGGTAAGAAAACGTTTAATCTTTTTCCCCGATAGAAATTATCTGCATGCTACCGTAGAAATCTTGGGTGCATCCGGAATTGAGGTTGGAAGTTTAGCAAAAATCACTCATTTCTAAATTATAATAAGAAGGGCTTAGAGTCTATCTCATGGATTCATGGGTGAGATTTTATTTTTGTAAATATAAATAAAATACATGGTAAGAATGAAAAGTATCGGAAATTTCAAAAACCCTCTTTAAAAATCATCTTAGTCATGTAAAATTGCTTCTATTGTTATTATTATTTATTTTTGTAGTTTTAAAAACTTATTCGAAATTACTTTATTTCTTTTAAAGCGATTCTATTTTAAAATCATCGTAGTCTATAATTTCTTTACAAAAATAATTATTTTAATCCTTAGCATAAGATAAAAAAAGAATAGTTAGAATATAGTTTAAACCTTTATTTTTTCAGAATATTATGACAAAAAAAAAAGATTAAAATAATCAAAACTATAATAATTTAGTTTCGAATTATTTGATGTAATACTTTTTCCCATAGTTCAGAAATTCAATCCAAGTAAATTATTATGTTAAAGCTTGTGGTCACATAGGCTTCTTAGTTCTTGCTGACTTTAGGCATCCCTTTTATTATGCAGGGTCCGGGCTACACACTTGGCGAGTGTTTGTGATTTGGTTAGCTGGAAATTGAAAGTCTTGCCGGTATACCAGCTCAACTCCTCCAGGAAAATTGATTGTAAGTATAATAAATTTGTTCGATTTTGTCAAATTATTAAAGTTCCTAAAGTTCGAACGTATTTTGTATAATTTAATCTTATGTTTAGATAAGCTATATTCTCCTTTTAGGCCAATTGATCCTCCAAAAGAGAAACTTGATTCACCAAAAGCATTAAAGCAAAACATTGTTAATCATCAAAAGAGAAACTCTAAGTCAAGAGAATTTACAAATACATTAGAGTTGCATCGATGTTTGGAAAGTCAGGGCATTGCAAGGAAGGTTGTTCAGAGATTGAACAGCAAAAGAGCTGAGAAGCAGCCAAGGATTATCATACTGAGCAACAAAGCTTTGAATCTATGGAAGCCAGGCATATGCAAAATGCTTGGTAACCTGTCAATGGATACCTGCCAAAACACAAAACACCCCATAATTCTTATCAACATCCCACAAGTCTACGAGCTAAGACAAGTCGATAGACCTTTTAGATAG

mRNA sequence

ACAAAAGTGAGGGCTACACAAAGGTCAGCCCTCCATCTTTTGTTCCCATCCTAAACAAACCCTTCACTTCTCATCTCCCCATGGAGACTACAAGATCCCTGCATCCCCTTCTTCTTCTTCCTCTTCTTTTCGTCCTCGTTGACGCTCGTTCGAGCTCGATCGACGCCGTTAGTGCCTTTCACCAGACGCTTGTTCTTAATGGGCAGAAACTTCCATTGATGGATATGAAAATACCCGCAACTGAGTGCATCTTTCACAACCCAAGAGTGGAGAAGGAAACAGCGACCTTTGAAATGAAAGAAAGAGACTACTGTTCAGGCAATATAAAAGACCGGGACAAGAATCTCCAAGACCGCCTAGTCCTCGACCAAATTCACGTCGACTCCCTGCTATCCCGATTCAAATACTCCACTTCCCTCTTCACCCCCCACGACATCTCCGACACCCACCTCCCCTTAACCCTCGGCACCAGCCTCCAAACCCTCAACTACCTCGTCACCGTCCGCATCGGCCCTCAAAACCTAACCCTCATCGTCGACACCGGCAGCGACCTCACCTGGGTCCAATGCCTCCCTTGCCACCTCTGTTACAACCAACAACAACCCCTCTTCGATCCCTCAAACTCCCCTTCATTTCTCTCCCTCCCCTGTAACTCCAGCAACTGTTCTGCTTTCCAACCCACAGCTGGAGGCTCCGGCGCTTGTACCAACGGCAGTTCAAATCCCTGCGATTACGAGGTTAACTACGGCGACGGCTCTTACTCCCGTGGAGACCTCGGATTTGAAACCCTGAACCTGGGGAATGTTTCCGTTGAGAATTTTATATTTGGGTGTGGCCGGAATAACAAGGGCTTGTTCGGTGGAACCTCCGGATTAATGGGTTTTGGTAGAAGCGAACTCTCCGTTGTTTCTCAAACTTCCTCTGCTTACGGCGGCGTTTTTTCCTACTGTTTGCCGTCGACTGAAACCGGTTCTTCAGGTTCTTTAACAATGGGGACTGGAGATTTCTCAAATTTCCGAAACATTTCCCCAATTTCCTACACGAAAATGGTTCCAAATCCACAGATGTCGAATTTCTACACACTGAATCTGACCGGAATTACCGTTGGTGGGTTGAAATTAGTGGTGCCGCGTTTGGCTCCGAGTAAGGGAGTTTTGAGCTTACTCGATTCAGGGACGGTGATTACCAGGTTGCCTCCGTCGGTATACCAAGCTTTGAAGGAGGAATTTGTGAGGCAATTTTCTGGGTATCCAACGGCGGCTGGATACTCGCTGTTGGACACGTGTTATAATCTTAGTGGGTTGAAAGAAGTGAAAACTCCGAATGTGAAGCTTCATTTTGAAGGCGAGGGAGAGATGAGTGTGGATGTTGGGGGGCTTTTTTACTATGTGAAATCTGATGGGTCTCAGATTTGTTTGGCGTTTGCGAGTTTGGCGGATGAAGATCAGATTGGGATTATTGGGAGTTATCAGCAGAAGAATCAGAGGGTTATTTATAATTTGAAGGAATCCAAGAGCAAGGCTCTCAAGCGCACGCGTAAGTCCGTTCGTCGTGATGCTCCCGCTCAGCGTAGTTCTGTATACCGCGGCGTCACCAGGCATAGATGGACGGGACGGTATGAGGCGCACCTGTGGGATAAGAATTCTTGGAATGAAGGACAGAATAAGAAAGGAAGACAAGGAGCGTACGATGATGAAGAAGCGGCGGCTCATGCTTACGACCTTGCAGCGCTCAAGTACTGGGGAGCAGAAACTGTTATTAACTTTCCACGATTAACATACCAAGACGAGCTTAAAGAAATGGAAGGCCAATCAAGAGAAGAGTATATTAGATATTTGAGAAGGAAGAGTAGCGGCTTTTCTCGCGGTGTTTCAAAGTACAGAGCTACGCAAGAAGAAGCTGCTAGAGCTTACGACCTGGCGGCAATAGAACATCGTGGCCTTAACGCCGTTACAAATTTCGATATCAGCCGTTACATCAAATGTCTCCGTCCAGGGGAACAGCATATCCCAGACAATAACCGTCCATCAAGTCCAAACGCCGGCGACACTGCCTCAGAATTTGACCCCAAATCCTTTCTCGAAATTACCTTTCCGTCGCAATCTTCGAGCTCCGACCAACCAACCACCGCGCCGGAACCTCACGGCGGCCTCCCCTCATCATCGTCGGCGACGTTGGAGCTGCTAATCCATTCGTCAAAATTCAAACACATACTAGAAAGGACATCCGCCGCTGAGACTCCACAGACGCTGCCGGAATCCGTCCGGCCGCGCCGCTGTATACCGGACGACATTCAAACCTACTTCGATTGTAGTACTCAAGATTCTGACGACATTGCCGGAAGCGACGACGGAATTTTCGGGGTCCGGGCTACACACTTGGCGAGTGTTTGTGATTTGGTTAGCTGGAAATTGAAAGTCTTGCCGGTATACCAGCTCAACTCCTCCAGGAAAATTGATTATAAGCTATATTCTCCTTTTAGGCCAATTGATCCTCCAAAAGAGAAACTTGATTCACCAAAAGCATTAAAGCAAAACATTGTTAATCATCAAAAGAGAAACTCTAAGTCAAGAGAATTTACAAATACATTAGAGTTGCATCGATGTTTGGAAAGTCAGGGCATTGCAAGGAAGGTTGTTCAGAGATTGAACAGCAAAAGAGCTGAGAAGCAGCCAAGGATTATCATACTGAGCAACAAAGCTTTGAATCTATGGAAGCCAGGCATATGCAAAATGCTTGGTAACCTGTCAATGGATACCTGCCAAAACACAAAACACCCCATAATTCTTATCAACATCCCACAAGTCTACGAGCTAAGACAAGTCGATAGACCTTTTAGATAG

Coding sequence (CDS)

ATGGAGACTACAAGATCCCTGCATCCCCTTCTTCTTCTTCCTCTTCTTTTCGTCCTCGTTGACGCTCGTTCGAGCTCGATCGACGCCGTTAGTGCCTTTCACCAGACGCTTGTTCTTAATGGGCAGAAACTTCCATTGATGGATATGAAAATACCCGCAACTGAGTGCATCTTTCACAACCCAAGAGTGGAGAAGGAAACAGCGACCTTTGAAATGAAAGAAAGAGACTACTGTTCAGGCAATATAAAAGACCGGGACAAGAATCTCCAAGACCGCCTAGTCCTCGACCAAATTCACGTCGACTCCCTGCTATCCCGATTCAAATACTCCACTTCCCTCTTCACCCCCCACGACATCTCCGACACCCACCTCCCCTTAACCCTCGGCACCAGCCTCCAAACCCTCAACTACCTCGTCACCGTCCGCATCGGCCCTCAAAACCTAACCCTCATCGTCGACACCGGCAGCGACCTCACCTGGGTCCAATGCCTCCCTTGCCACCTCTGTTACAACCAACAACAACCCCTCTTCGATCCCTCAAACTCCCCTTCATTTCTCTCCCTCCCCTGTAACTCCAGCAACTGTTCTGCTTTCCAACCCACAGCTGGAGGCTCCGGCGCTTGTACCAACGGCAGTTCAAATCCCTGCGATTACGAGGTTAACTACGGCGACGGCTCTTACTCCCGTGGAGACCTCGGATTTGAAACCCTGAACCTGGGGAATGTTTCCGTTGAGAATTTTATATTTGGGTGTGGCCGGAATAACAAGGGCTTGTTCGGTGGAACCTCCGGATTAATGGGTTTTGGTAGAAGCGAACTCTCCGTTGTTTCTCAAACTTCCTCTGCTTACGGCGGCGTTTTTTCCTACTGTTTGCCGTCGACTGAAACCGGTTCTTCAGGTTCTTTAACAATGGGGACTGGAGATTTCTCAAATTTCCGAAACATTTCCCCAATTTCCTACACGAAAATGGTTCCAAATCCACAGATGTCGAATTTCTACACACTGAATCTGACCGGAATTACCGTTGGTGGGTTGAAATTAGTGGTGCCGCGTTTGGCTCCGAGTAAGGGAGTTTTGAGCTTACTCGATTCAGGGACGGTGATTACCAGGTTGCCTCCGTCGGTATACCAAGCTTTGAAGGAGGAATTTGTGAGGCAATTTTCTGGGTATCCAACGGCGGCTGGATACTCGCTGTTGGACACGTGTTATAATCTTAGTGGGTTGAAAGAAGTGAAAACTCCGAATGTGAAGCTTCATTTTGAAGGCGAGGGAGAGATGAGTGTGGATGTTGGGGGGCTTTTTTACTATGTGAAATCTGATGGGTCTCAGATTTGTTTGGCGTTTGCGAGTTTGGCGGATGAAGATCAGATTGGGATTATTGGGAGTTATCAGCAGAAGAATCAGAGGGTTATTTATAATTTGAAGGAATCCAAGAGCAAGGCTCTCAAGCGCACGCGTAAGTCCGTTCGTCGTGATGCTCCCGCTCAGCGTAGTTCTGTATACCGCGGCGTCACCAGGCATAGATGGACGGGACGGTATGAGGCGCACCTGTGGGATAAGAATTCTTGGAATGAAGGACAGAATAAGAAAGGAAGACAAGGAGCGTACGATGATGAAGAAGCGGCGGCTCATGCTTACGACCTTGCAGCGCTCAAGTACTGGGGAGCAGAAACTGTTATTAACTTTCCACGATTAACATACCAAGACGAGCTTAAAGAAATGGAAGGCCAATCAAGAGAAGAGTATATTAGATATTTGAGAAGGAAGAGTAGCGGCTTTTCTCGCGGTGTTTCAAAGTACAGAGCTACGCAAGAAGAAGCTGCTAGAGCTTACGACCTGGCGGCAATAGAACATCGTGGCCTTAACGCCGTTACAAATTTCGATATCAGCCGTTACATCAAATGTCTCCGTCCAGGGGAACAGCATATCCCAGACAATAACCGTCCATCAAGTCCAAACGCCGGCGACACTGCCTCAGAATTTGACCCCAAATCCTTTCTCGAAATTACCTTTCCGTCGCAATCTTCGAGCTCCGACCAACCAACCACCGCGCCGGAACCTCACGGCGGCCTCCCCTCATCATCGTCGGCGACGTTGGAGCTGCTAATCCATTCGTCAAAATTCAAACACATACTAGAAAGGACATCCGCCGCTGAGACTCCACAGACGCTGCCGGAATCCGTCCGGCCGCGCCGCTGTATACCGGACGACATTCAAACCTACTTCGATTGTAGTACTCAAGATTCTGACGACATTGCCGGAAGCGACGACGGAATTTTCGGGGTCCGGGCTACACACTTGGCGAGTGTTTGTGATTTGGTTAGCTGGAAATTGAAAGTCTTGCCGGTATACCAGCTCAACTCCTCCAGGAAAATTGATTATAAGCTATATTCTCCTTTTAGGCCAATTGATCCTCCAAAAGAGAAACTTGATTCACCAAAAGCATTAAAGCAAAACATTGTTAATCATCAAAAGAGAAACTCTAAGTCAAGAGAATTTACAAATACATTAGAGTTGCATCGATGTTTGGAAAGTCAGGGCATTGCAAGGAAGGTTGTTCAGAGATTGAACAGCAAAAGAGCTGAGAAGCAGCCAAGGATTATCATACTGAGCAACAAAGCTTTGAATCTATGGAAGCCAGGCATATGCAAAATGCTTGGTAACCTGTCAATGGATACCTGCCAAAACACAAAACACCCCATAATTCTTATCAACATCCCACAAGTCTACGAGCTAAGACAAGTCGATAGACCTTTTAGATAG

Protein sequence

METTRSLHPLLLLPLLFVLVDARSSSIDAVSAFHQTLVLNGQKLPLMDMKIPATECIFHNPRVEKETATFEMKERDYCSGNIKDRDKNLQDRLVLDQIHVDSLLSRFKYSTSLFTPHDISDTHLPLTLGTSLQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQQQPLFDPSNSPSFLSLPCNSSNCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDLGFETLNLGNVSVENFIFGCGRNNKGLFGGTSGLMGFGRSELSVVSQTSSAYGGVFSYCLPSTETGSSGSLTMGTGDFSNFRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRLAPSKGVLSLLDSGTVITRLPPSVYQALKEEFVRQFSGYPTAAGYSLLDTCYNLSGLKEVKTPNVKLHFEGEGEMSVDVGGLFYYVKSDGSQICLAFASLADEDQIGIIGSYQQKNQRVIYNLKESKSKALKRTRKSVRRDAPAQRSSVYRGVTRHRWTGRYEAHLWDKNSWNEGQNKKGRQGAYDDEEAAAHAYDLAALKYWGAETVINFPRLTYQDELKEMEGQSREEYIRYLRRKSSGFSRGVSKYRATQEEAARAYDLAAIEHRGLNAVTNFDISRYIKCLRPGEQHIPDNNRPSSPNAGDTASEFDPKSFLEITFPSQSSSSDQPTTAPEPHGGLPSSSSATLELLIHSSKFKHILERTSAAETPQTLPESVRPRRCIPDDIQTYFDCSTQDSDDIAGSDDGIFGVRATHLASVCDLVSWKLKVLPVYQLNSSRKIDYKLYSPFRPIDPPKEKLDSPKALKQNIVNHQKRNSKSREFTNTLELHRCLESQGIARKVVQRLNSKRAEKQPRIIILSNKALNLWKPGICKMLGNLSMDTCQNTKHPIILINIPQVYELRQVDRPFR
Homology
BLAST of CmoCh08G005720 vs. ExPASy Swiss-Prot
Match: Q8S9J6 (Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana OX=3702 GN=At5g10770 PE=2 SV=1)

HSP 1 Score: 285.4 bits (729), Expect = 2.3e-75
Identity = 180/430 (41.86%), Postives = 250/430 (58.14%), Query Frame = 0

Query: 53  ATECIFHNPRVEKETATFEMKER-DYCS--GNIKDRDKNLQDRLVLDQIHVDSLLSRFKY 112
           ++ C+  +PR     ++  +  R   CS   N K    +  + L LDQ  V+S+ S  K 
Sbjct: 46  SSSCVL-SPRASTTKSSLHVTHRHGTCSRLNNGKATSPDHVEILRLDQARVNSIHS--KL 105

Query: 113 STSLFTPH--DISDTHLPLTLGTSLQTLNYLVTVRIG-PQN-LTLIVDTGSDLTWVQCLP 172
           S  L T H  +   T LP   G++L + NY+VTV +G P+N L+LI DTGSDLTW QC P
Sbjct: 106 SKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQP 165

Query: 173 C-HLCYNQQQPLFDPSNSPSFLSLPCNSSNCSAFQPTAGGSGACTNGSSNPCDYEVNYGD 232
           C   CY+Q++P+F+PS S S+ ++ C+S+ C +     G +G+C   S++ C Y + YGD
Sbjct: 166 CVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSC---SASNCIYGIQYGD 225

Query: 233 GSYSRGDLGFETLNLGNVSV-ENFIFGCGRNNKGLFGGTSGLMGFGRSELSVVSQTSSAY 292
            S+S G L  E   L N  V +   FGCG NN+GLF G +GL+G GR +LS  SQT++AY
Sbjct: 226 QSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAY 285

Query: 293 GGVFSYCLPSTETGSSGSLTMGTGDFSNFRNISPISYTKMVPNPQMSNFYTLNLTGITVG 352
             +FSYCLPS+    +G LT G+   S     +PIS          ++FY LN+  ITVG
Sbjct: 286 NKIFSYCLPSS-ASYTGHLTFGSAGISRSVKFTPISTI-----TDGTSFYGLNIVAITVG 345

Query: 353 GLKLVVPRLAPSKGVLSLLDSGTVITRLPPSVYQALKEEFVRQFSGYPTAAGYSLLDTCY 412
           G KL +P    S    +L+DSGTVITRLPP  Y AL+  F  + S YPT +G S+LDTC+
Sbjct: 346 GQKLPIPSTVFSTPG-ALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCF 405

Query: 413 NLSGLKEVKTPNVKLHFEGEGEMSVDVGGLFYYVKSDGSQICLAFASLADEDQIGIIGSY 472
           +LSG K V  P V   F G   + +   G+FY  K   SQ+CLAFA  +D+    I G+ 
Sbjct: 406 DLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFKI--SQVCLAFAGNSDDSNAAIFGNV 460

Query: 473 QQKNQRVIYN 474
           QQ+   V+Y+
Sbjct: 466 QQQTLEVVYD 460

BLAST of CmoCh08G005720 vs. ExPASy Swiss-Prot
Match: Q94AN4 (AP2-like ethylene-responsive transcription factor At1g16060 OS=Arabidopsis thaliana OX=3702 GN=At1g16060 PE=2 SV=1)

HSP 1 Score: 250.8 bits (639), Expect = 6.2e-65
Identity = 163/312 (52.24%), Postives = 189/312 (60.58%), Query Frame = 0

Query: 483 KRTRKSVRRDAPAQRSSVYRGVTRHRWTGRYEAHLWDKNSWNEGQNKKGRQ---GAYDDE 542
           KR R+S  RDAP QRSSV+RGVTRHRWTGRYEAHLWDKNSWNE Q KKGRQ   GAYD+E
Sbjct: 41  KRKRRSQPRDAPPQRSSVHRGVTRHRWTGRYEAHLWDKNSWNETQTKKGRQVYLGAYDEE 100

Query: 543 EAAAHAYDLAALKYWGAETVINFPRLTYQDELKEMEGQSREEYIRYLRRKSSGFSRGVSK 602
           +AAA AYDLAALKYWG +T++NFP   Y++++KEME QS+EEYI  LRRKSSGFSRGVSK
Sbjct: 101 DAAARAYDLAALKYWGRDTILNFPLCNYEEDIKEMESQSKEEYIGSLRRKSSGFSRGVSK 160

Query: 603 YR-----------------------------ATQEEAARAYDLAAIEHRGLNAVTNFDIS 662
           YR                             ATQEEAA AYD+AAIE+RGLNAVTNFDIS
Sbjct: 161 YRGVAKHHHNGRWEARIGRVFGNKYLYLGTYATQEEAAIAYDIAAIEYRGLNAVTNFDIS 220

Query: 663 RYIKCLRPGEQHIPDNNRPSSPNAGDTASEFDPKSFLEITFPSQSSSSDQPTTAPEPHGG 722
           RY+K   P       NN   SP++ D +    P    +++  SQSSS D      +    
Sbjct: 221 RYLKLPVPENPIDTANNLLESPHS-DLSPFIKPNHESDLS-QSQSSSEDNDDRKTK---- 280

Query: 723 LPSSSSATLELLIHSSKFKHILERTSAAETPQTLPESVRPRRCIPDDIQTYFDCSTQDSD 762
           L  SS    E +I                 P T PE   PRR  P+DIQTYF C  Q+S 
Sbjct: 281 LLKSSPLVAEEVI----------------GPSTPPEIAPPRRSFPEDIQTYFGC--QNSG 328

BLAST of CmoCh08G005720 vs. ExPASy Swiss-Prot
Match: A0JPZ8 (AP2-like ethylene-responsive transcription factor At1g79700 OS=Arabidopsis thaliana OX=3702 GN=At1g79700 PE=2 SV=1)

HSP 1 Score: 250.8 bits (639), Expect = 6.2e-65
Identity = 160/309 (51.78%), Postives = 186/309 (60.19%), Query Frame = 0

Query: 476 ESKSKAL--KRTRKSVRRDAPAQRSSVYRGVTRHRWTGRYEAHLWDKNSWNEGQNKKGRQ 535
           ES S AL  KR RKS  R+AP QRSS YRGVTRHRWTGRYEAHLWDKNSWN+ Q KKGRQ
Sbjct: 26  ESASIALTSKRKRKSPPRNAPLQRSSPYRGVTRHRWTGRYEAHLWDKNSWNDTQTKKGRQ 85

Query: 536 ---GAYDDEEAAAHAYDLAALKYWGAETVINFPRLTYQDELKEMEGQSREEYIRYLRRKS 595
              GAYD+EEAAA AYDLAALKYWG +T++NFP  +Y +++KEMEGQS+EEYI  LRRKS
Sbjct: 86  VYLGAYDEEEAAARAYDLAALKYWGRDTLLNFPLPSYDEDVKEMEGQSKEEYIGSLRRKS 145

Query: 596 SGFSRGVSKYR-------------------ATQEEAARAYDLAAIEHRGLNAVTNFDISR 655
           SGFSRGVSKYR                   ATQEEAA AYD+AAIE+RGLNAVTNFD+SR
Sbjct: 146 SGFSRGVSKYRGVARHHHNGRWEARIGRVFATQEEAAIAYDIAAIEYRGLNAVTNFDVSR 205

Query: 656 YIKCLRPGEQHIPDNNRPSSPNAGDTASEFDPKSFLEITFPSQSSSSDQPTTAPEPHGGL 715
           Y+                 +PNA    ++ D K    I  PS+   S     +P      
Sbjct: 206 YL-----------------NPNAAADKADSDSK---PIRSPSREPESSDDNKSP------ 265

Query: 716 PSSSSATLELLIHSSKFKHILERTSAAETPQTLPESVRPRRCIPDDIQTYFDCSTQDSDD 761
                          K + ++E       P T PE +  RR  PDDIQTYF C  QDS  
Sbjct: 266 ---------------KSEEVIE-------PSTSPEVIPTRRSFPDDIQTYFGC--QDSGK 284

BLAST of CmoCh08G005720 vs. ExPASy Swiss-Prot
Match: Q9LEW3 (Aspartyl protease AED1 OS=Arabidopsis thaliana OX=3702 GN=AED1 PE=2 SV=1)

HSP 1 Score: 236.9 bits (603), Expect = 9.2e-61
Identity = 149/402 (37.06%), Postives = 224/402 (55.72%), Query Frame = 0

Query: 78  CSGNIKDRDKNLQDRLVLDQIHVDSLLSRFKYSTSLFTPHDISDTHLPLTLGTSLQTLNY 137
           CS    D   +  + +  DQ  V+S+ S+     S     +   T LP   G +L + NY
Sbjct: 74  CSHLSSDARVDHDEIIRRDQARVESIYSKLS-KNSANEVSEAKSTELPAKSGITLGSGNY 133

Query: 138 LVTVRIG--PQNLTLIVDTGSDLTWVQCLPC-HLCYNQQQPLFDPSNSPSFLSLPCNSSN 197
           +VT+ IG    +L+L+ DTGSDLTW QC PC   CY+Q++P F+PS+S ++ ++ C+S  
Sbjct: 134 IVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPM 193

Query: 198 CSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDLGFETLNLGNVSV-ENFIFGCGR 257
           C         + +C   S++ C Y + YGD S+++G L  E   L N  V E+  FGCG 
Sbjct: 194 CE-------DAESC---SASNCVYSIVYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGE 253

Query: 258 NNKGLFGGTSGLMGFGRSELSVVSQTSSAYGGVFSYCLPSTETGSSGSLTMGTGDFSNFR 317
           NN+GLF G +GL+G G  +LS+ +QT++ Y  +FSYCLPS  + S+G LT G+   S   
Sbjct: 254 NNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESV 313

Query: 318 NISPISYTKMVPNPQMSNFYTLNLTGITVGGLKL-VVPRLAPSKGVLSLLDSGTVITRLP 377
             +PIS      N      Y +++ GI+VG  +L + P    ++G  +++DSGTV TRLP
Sbjct: 314 KFTPISSFPSAFN------YGIDIIGISVGDKELAITPNSFSTEG--AIIDSGTVFTRLP 373

Query: 378 PSVYQALKEEFVRQFSGYPTAAGYSLLDTCYNLSGLKEVKTPNVKLHFEGEGEMSVDVGG 437
             VY  L+  F  + S Y + +GY L DTCY+ +GL  V  P +   F G   + +D  G
Sbjct: 374 TKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSG 433

Query: 438 LFYYVKSDGSQICLAFASLADEDQIGIIGSYQQKNQRVIYNL 475
           +   +K   SQ+CLAFA   ++D   I G+ QQ    V+Y++
Sbjct: 434 ISLPIKI--SQVCLAFA--GNDDLPAIFGNVQQTTLDVVYDV 452

BLAST of CmoCh08G005720 vs. ExPASy Swiss-Prot
Match: Q9LHE3 (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 236.1 bits (601), Expect = 1.6e-60
Identity = 164/487 (33.68%), Postives = 243/487 (49.90%), Query Frame = 0

Query: 11  LLLPLLFVLVD-----ARSSSIDAVSAFHQTLVLNGQKLPLMDMKIPATECIFHNPRVEK 70
           +LLPL F  +      + SSSI    +F    +++  + PL    + AT   F+N     
Sbjct: 1   MLLPLFFFFLHLHLHLSSSSSI----SFPDFQIIDVLQPPL---TVTATLPDFNNTHFSD 60

Query: 71  ETA---TFEMKERD-YCSGNIKDRDKNLQDRLVLDQIHVDSLLSRF--KYSTSLFTPHDI 130
           E++   T  +  RD + S   ++    L  R+  D   V ++L R   K   S  + +++
Sbjct: 61  ESSSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEV 120

Query: 131 SDTHLPLTLGTSLQTLNYLVTVRIG--PQNLTLIVDTGSDLTWVQCLPCHLCYNQQQPLF 190
           +D    +  G    +  Y V + +G  P++  +++D+GSD+ WVQC PC LCY Q  P+F
Sbjct: 121 NDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVF 180

Query: 191 DPSNSPSFLSLPCNSSNCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDLGFETL 250
           DP+ S S+  + C SS C   + +   SG C         YEV YGDGSY++G L  ETL
Sbjct: 181 DPAKSGSYTGVSCGSSVCDRIENSGCHSGGCR--------YEVMYGDGSYTKGTLALETL 240

Query: 251 NLGNVSVENFIFGCGRNNKGLFGGTSGLMGFGRSELSVVSQTSSAYGGVFSYCLPSTETG 310
                 V N   GCG  N+G+F G +GL+G G   +S V Q S   GG F YCL S  T 
Sbjct: 241 TFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTD 300

Query: 311 SSGSLTMGTGDFSNFRNISPI--SYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRLAPS 370
           S+GSL  G       R   P+  S+  +V NP+  +FY + L G+ VGG+++ +P     
Sbjct: 301 STGSLVFG-------REALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLP----- 360

Query: 371 KGVLSL---------LDSGTVITRLPPSVYQALKEEFVRQFSGYPTAAGYSLLDTCYNLS 430
            GV  L         +D+GT +TRLP + Y A ++ F  Q +  P A+G S+ DTCY+LS
Sbjct: 361 DGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLS 420

Query: 431 GLKEVKTPNVKLHFEGEGEMSVDVGGLFYYVKSDGSQICLAFASLADEDQIGIIGSYQQK 474
           G   V+ P V  +F  EG +       F     D    C AFA  A    + IIG+ QQ+
Sbjct: 421 GFVSVRVPTVSFYFT-EGPVLTLPARNFLMPVDDSGTYCFAFA--ASPTGLSIIGNIQQE 457

BLAST of CmoCh08G005720 vs. ExPASy TrEMBL
Match: A0A6J1H6S6 (aspartyl protease family protein At5g10770-like OS=Cucurbita moschata OX=3662 GN=LOC111460718 PE=3 SV=1)

HSP 1 Score: 953.4 bits (2463), Expect = 7.1e-274
Identity = 478/478 (100.00%), Postives = 478/478 (100.00%), Query Frame = 0

Query: 1   METTRSLHPLLLLPLLFVLVDARSSSIDAVSAFHQTLVLNGQKLPLMDMKIPATECIFHN 60
           METTRSLHPLLLLPLLFVLVDARSSSIDAVSAFHQTLVLNGQKLPLMDMKIPATECIFHN
Sbjct: 1   METTRSLHPLLLLPLLFVLVDARSSSIDAVSAFHQTLVLNGQKLPLMDMKIPATECIFHN 60

Query: 61  PRVEKETATFEMKERDYCSGNIKDRDKNLQDRLVLDQIHVDSLLSRFKYSTSLFTPHDIS 120
           PRVEKETATFEMKERDYCSGNIKDRDKNLQDRLVLDQIHVDSLLSRFKYSTSLFTPHDIS
Sbjct: 61  PRVEKETATFEMKERDYCSGNIKDRDKNLQDRLVLDQIHVDSLLSRFKYSTSLFTPHDIS 120

Query: 121 DTHLPLTLGTSLQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQQQPLFDPS 180
           DTHLPLTLGTSLQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQQQPLFDPS
Sbjct: 121 DTHLPLTLGTSLQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQQQPLFDPS 180

Query: 181 NSPSFLSLPCNSSNCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDLGFETLNLG 240
           NSPSFLSLPCNSSNCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDLGFETLNLG
Sbjct: 181 NSPSFLSLPCNSSNCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDLGFETLNLG 240

Query: 241 NVSVENFIFGCGRNNKGLFGGTSGLMGFGRSELSVVSQTSSAYGGVFSYCLPSTETGSSG 300
           NVSVENFIFGCGRNNKGLFGGTSGLMGFGRSELSVVSQTSSAYGGVFSYCLPSTETGSSG
Sbjct: 241 NVSVENFIFGCGRNNKGLFGGTSGLMGFGRSELSVVSQTSSAYGGVFSYCLPSTETGSSG 300

Query: 301 SLTMGTGDFSNFRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRLAPSKGVLS 360
           SLTMGTGDFSNFRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRLAPSKGVLS
Sbjct: 301 SLTMGTGDFSNFRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRLAPSKGVLS 360

Query: 361 LLDSGTVITRLPPSVYQALKEEFVRQFSGYPTAAGYSLLDTCYNLSGLKEVKTPNVKLHF 420
           LLDSGTVITRLPPSVYQALKEEFVRQFSGYPTAAGYSLLDTCYNLSGLKEVKTPNVKLHF
Sbjct: 361 LLDSGTVITRLPPSVYQALKEEFVRQFSGYPTAAGYSLLDTCYNLSGLKEVKTPNVKLHF 420

Query: 421 EGEGEMSVDVGGLFYYVKSDGSQICLAFASLADEDQIGIIGSYQQKNQRVIYNLKESK 479
           EGEGEMSVDVGGLFYYVKSDGSQICLAFASLADEDQIGIIGSYQQKNQRVIYNLKESK
Sbjct: 421 EGEGEMSVDVGGLFYYVKSDGSQICLAFASLADEDQIGIIGSYQQKNQRVIYNLKESK 478

BLAST of CmoCh08G005720 vs. ExPASy TrEMBL
Match: A0A6J1KR96 (aspartyl protease family protein At5g10770-like OS=Cucurbita maxima OX=3661 GN=LOC111497952 PE=3 SV=1)

HSP 1 Score: 888.3 bits (2294), Expect = 2.8e-254
Identity = 447/481 (92.93%), Postives = 461/481 (95.84%), Query Frame = 0

Query: 1   METTRSLHP---LLLLPLLFVLVDARSSSIDAVSAFHQTLVLNGQKLPLMDMKIPATECI 60
           MET RSLH    LLLLPL+FVLVDARSSSIDA+SAF+QTLVLN QKLPLMDMKIPAT+CI
Sbjct: 1   METARSLHSLLLLLLLPLIFVLVDARSSSIDAISAFYQTLVLNEQKLPLMDMKIPATDCI 60

Query: 61  FHNPRVEKETATFEMKERDYCSGNIKDRDKNLQDRLVLDQIHVDSLLSRFKYSTSLFTPH 120
           FH PRVEKETATFEMKERDYCSGNIK RDKNLQDRL+LD+IHVDSLLSRFKY+TSLFTPH
Sbjct: 61  FHKPRVEKETATFEMKERDYCSGNIKHRDKNLQDRLILDEIHVDSLLSRFKYATSLFTPH 120

Query: 121 DISDTHLPLTLGTSLQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQQQPLF 180
           DISDTHLPLTLGTSLQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQQQPLF
Sbjct: 121 DISDTHLPLTLGTSLQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQQQPLF 180

Query: 181 DPSNSPSFLSLPCNSSNCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDLGFETL 240
           DPSNSPSF+ LPCNS+NCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDLGFETL
Sbjct: 181 DPSNSPSFVPLPCNSNNCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDLGFETL 240

Query: 241 NLGNVSVENFIFGCGRNNKGLFGGTSGLMGFGRSELSVVSQTSSAYGGVFSYCLPSTETG 300
            LGNVSVENFIFGCGRNNKGLFGGTSGL+GFGRSELSVVSQ+SS YGGVFSYCLPSTETG
Sbjct: 241 KLGNVSVENFIFGCGRNNKGLFGGTSGLIGFGRSELSVVSQSSSVYGGVFSYCLPSTETG 300

Query: 301 SSGSLTMGTGDFSNFRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRLAPSKG 360
           SSGSLT+G GDFSNFRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRLA S G
Sbjct: 301 SSGSLTLGAGDFSNFRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRLASSNG 360

Query: 361 VLSLLDSGTVITRLPPSVYQALKEEFVRQFSGYPTAAGYSLLDTCYNLSGLKEVKTPNVK 420
           VLSLLDSGTVITRLPPSVY+ALKEEF RQFS Y TA GYSLLDTCYNLSGLK+VKTPNVK
Sbjct: 361 VLSLLDSGTVITRLPPSVYKALKEEFERQFSVYRTAPGYSLLDTCYNLSGLKQVKTPNVK 420

Query: 421 LHFEGEGEMSVDVGGLFYYVKSDGSQICLAFASLADEDQIGIIGSYQQKNQRVIYNLKES 479
           LHFE EGEMSVDVGGLFYYVKSDGSQICLAFASLADE QIGIIGSYQQKNQRVIYNLKES
Sbjct: 421 LHFEDEGEMSVDVGGLFYYVKSDGSQICLAFASLADEYQIGIIGSYQQKNQRVIYNLKES 480

BLAST of CmoCh08G005720 vs. ExPASy TrEMBL
Match: A0A0A0K8J2 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G431320 PE=3 SV=1)

HSP 1 Score: 605.9 bits (1561), Expect = 2.8e-169
Identity = 311/486 (63.99%), Postives = 370/486 (76.13%), Query Frame = 0

Query: 1   METTRSLH------PLLLLPLLFVLVDARSSSIDAVSA--FHQTLVLNGQKLPLMDMKIP 60
           ME ++SLH       LLLLPLL + VDARSSS +  +     + L+   Q  P  +    
Sbjct: 1   MEISKSLHFPLSLLLLLLLPLLSIGVDARSSSFNLGNGDNHEKGLLQLFQNFPWKEHGEA 60

Query: 61  ATECIFHNPRVEKETATFEMKERDYCSGNIKDRDKNLQDRLVLDQIHVDSLLSRFKYSTS 120
              CIF  P++ K   T EMK+RDYCSG I D +K  Q+R++LD I+V+SL S FK +  
Sbjct: 61  VVNCIFQKPKITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIF 120

Query: 121 LFTPHDISDTHLPLTLGTSLQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQ 180
               H +SD+ +P++ G  LQTLNY+VTV IG QN TLIVDTGSDLTWVQCLPC LCYNQ
Sbjct: 121 PGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQ 180

Query: 181 QQPLFDPSNSPSFLSLPCNSSNCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDL 240
           Q+PLF+PSNS SFLSLPCNS  C A QPTAG SG C+N +S  CDY+++YGDGSYSRG+L
Sbjct: 181 QEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGEL 240

Query: 241 GFETLNLGNVSVENFIFGCGRNNKGLFGGTSGLMGFGRSELSVVSQTSSAYGGVFSYCLP 300
           GFE L LG   ++NFIFGCGRNNKGLFGG SGLMG  RSELS+VSQTSS +G VFSYCLP
Sbjct: 241 GFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLP 300

Query: 301 STETGSSGSLTMGTGDFSNFRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRL 360
           +T  GSSGSLT+G  DFSNF+NISPISYT+M+ NPQMSNFY LNLTGI++GG+ L VPRL
Sbjct: 301 TTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRL 360

Query: 361 APSKGVLSLLDSGTVITRLPPSVYQALKEEFVRQFSGYPTAAGYSLLDTCYNLSGLKEVK 420
           + ++GVLSLLDSGTVITRL PS+Y+A K EF +QFSGY T  G+S+L+TC+NL+G +EV 
Sbjct: 361 SSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVN 420

Query: 421 TPNVKLHFEGEGEMSVDVGGLFYYVKSDGSQICLAFASLADEDQIGIIGSYQQKNQRVIY 479
            P VK  FEG  EM VDV G+FY+VKSD SQICLAFASL  EDQ  IIG+YQQKNQRVIY
Sbjct: 421 IPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIY 480

BLAST of CmoCh08G005720 vs. ExPASy TrEMBL
Match: A0A1S3CDQ0 (aspartyl protease family protein At5g10770 OS=Cucumis melo OX=3656 GN=LOC103499859 PE=3 SV=1)

HSP 1 Score: 605.1 bits (1559), Expect = 4.8e-169
Identity = 306/484 (63.22%), Postives = 371/484 (76.65%), Query Frame = 0

Query: 1   METTRSLH------PLLLLPLLFVLVDARSSSIDAVSAFHQTLVLNGQKLPLMDMKIPAT 60
           ME ++SLH       LLLLPLLF++VDARSS  +  +   + L+   Q  P  +      
Sbjct: 3   MEVSKSLHFPLSLLFLLLLPLLFIIVDARSSVGNGGNYHEKGLLQLFQNFPWKEHGEAVV 62

Query: 61  ECIFHNPRVEKETATFEMKERDYCSGNIKDRDKNLQDRLVLDQIHVDSLLSRFKYSTSLF 120
            CIF  P++ K   T EMK+RDYCSG I D +K  Q+R++LD I+V+SLLS  K +    
Sbjct: 63  NCIFQKPKITKGITTLEMKQRDYCSGKITDLEKIFQNRIILDAINVNSLLSHVKSAIFPG 122

Query: 121 TPHDISDTHLPLTLGTSLQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQQQ 180
             H +SD+ +P++ G  LQTLNY+VTV IG QN TLIVDTGSDLTWVQCLPC LCYNQQ+
Sbjct: 123 QTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQE 182

Query: 181 PLFDPSNSPSFLSLPCNSSNCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDLGF 240
           PLF+PSNS SFLSLPC+S  C A QPTAG SG C+N +S  CDY+++YGDGSYSRG+LG+
Sbjct: 183 PLFNPSNSSSFLSLPCSSPTCLALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGY 242

Query: 241 ETLNLGNVSVENFIFGCGRNNKGLFGGTSGLMGFGRSELSVVSQTSSAYGGVFSYCLPST 300
           E L LG   ++NFIFGCGRNNKGLFGG SGLMG  RSELS+VSQTSS +G +FSYCLP+T
Sbjct: 243 EKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSVFGSIFSYCLPTT 302

Query: 301 ETGSSGSLTMGTGDFSNFRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRLAP 360
             GSSGSLT+G  DFS+F+NISPISYT+M+ NPQMSNFY LNLTGI++GG+ L VPRL+ 
Sbjct: 303 GVGSSGSLTLGGTDFSSFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSS 362

Query: 361 SKGVLSLLDSGTVITRLPPSVYQALKEEFVRQFSGYPTAAGYSLLDTCYNLSGLKEVKTP 420
           ++GVLSLLDSGTVITRL PS+Y+A K EF +QFSGY T  G+S+L+TC+NL+G +EV  P
Sbjct: 363 NEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIP 422

Query: 421 NVKLHFEGEGEMSVDVGGLFYYVKSDGSQICLAFASLADEDQIGIIGSYQQKNQRVIYNL 479
            VK  FEG  EM VDV G+FY+VKSD SQICLAFASL  EDQ  IIG+YQQKNQRV+YN 
Sbjct: 423 TVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVVYNS 482

BLAST of CmoCh08G005720 vs. ExPASy TrEMBL
Match: A0A5A7UYY6 (Aspartyl protease family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold339G001610 PE=3 SV=1)

HSP 1 Score: 602.1 bits (1551), Expect = 4.0e-168
Identity = 308/485 (63.51%), Postives = 372/485 (76.70%), Query Frame = 0

Query: 1   METTRSLH-PL----LLLPLLFVLVDARSSS--IDAVSAFHQTLVLNGQKLPLMDMKIPA 60
           ME ++SLH PL    LLLPLL ++VDARSSS  +   S   + L+   Q  P  +     
Sbjct: 3   MEVSKSLHFPLSLLFLLLPLLSIIVDARSSSFGVGNGSNHEKGLLQLFQNFPWKEHGEAV 62

Query: 61  TECIFHNPRVEKETATFEMKERDYCSGNIKDRDKNLQDRLVLDQIHVDSLLSRFKYSTSL 120
             CIF  P++ K   T EMK+RDYCSG I D +K  Q+R++LD I+V+SLLS  K +   
Sbjct: 63  VNCIFQKPKITKGITTLEMKQRDYCSGKITDLEKIFQNRIILDAINVNSLLSHVKSAIFP 122

Query: 121 FTPHDISDTHLPLTLGTSLQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQQ 180
              H +SD+ +P++ G  LQTLNY+VTV IG QN TLIVDTGSDLTWVQCLPC LCYNQQ
Sbjct: 123 GQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQ 182

Query: 181 QPLFDPSNSPSFLSLPCNSSNCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDLG 240
           +PLF+PSNS SFLSLPC+S  C A QPTAG SG C+N +S  CDY+++YGDGSYSRG+LG
Sbjct: 183 EPLFNPSNSSSFLSLPCSSPTCLALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELG 242

Query: 241 FETLNLGNVSVENFIFGCGRNNKGLFGGTSGLMGFGRSELSVVSQTSSAYGGVFSYCLPS 300
           +E L LG   ++NFIFGCGRNNKGLFGG SGLMG  RSELS+VSQTSS +G +FSYCLP+
Sbjct: 243 YEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSVFGSIFSYCLPT 302

Query: 301 TETGSSGSLTMGTGDFSNFRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRLA 360
           T  GSSGSLT+G  DFS+F+NISPISYT+M+ NPQMSNFY LNLTGI++GG+ L VPRL+
Sbjct: 303 TGVGSSGSLTLGGTDFSSFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLS 362

Query: 361 PSKGVLSLLDSGTVITRLPPSVYQALKEEFVRQFSGYPTAAGYSLLDTCYNLSGLKEVKT 420
            ++GVLSLLDSGTVITRL PS+Y+A K EF +QFSGY T  G+S+L+TC+NL+G +EV  
Sbjct: 363 SNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNI 422

Query: 421 PNVKLHFEGEGEMSVDVGGLFYYVKSDGSQICLAFASLADEDQIGIIGSYQQKNQRVIYN 479
           P VK  FEG  EM VDV G+FY+VKSD SQICLAFASL  EDQ  IIG+YQQKNQRV+YN
Sbjct: 423 PTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVVYN 482

BLAST of CmoCh08G005720 vs. NCBI nr
Match: XP_022959733.1 (aspartyl protease family protein At5g10770-like [Cucurbita moschata])

HSP 1 Score: 953.4 bits (2463), Expect = 1.5e-273
Identity = 478/478 (100.00%), Postives = 478/478 (100.00%), Query Frame = 0

Query: 1   METTRSLHPLLLLPLLFVLVDARSSSIDAVSAFHQTLVLNGQKLPLMDMKIPATECIFHN 60
           METTRSLHPLLLLPLLFVLVDARSSSIDAVSAFHQTLVLNGQKLPLMDMKIPATECIFHN
Sbjct: 1   METTRSLHPLLLLPLLFVLVDARSSSIDAVSAFHQTLVLNGQKLPLMDMKIPATECIFHN 60

Query: 61  PRVEKETATFEMKERDYCSGNIKDRDKNLQDRLVLDQIHVDSLLSRFKYSTSLFTPHDIS 120
           PRVEKETATFEMKERDYCSGNIKDRDKNLQDRLVLDQIHVDSLLSRFKYSTSLFTPHDIS
Sbjct: 61  PRVEKETATFEMKERDYCSGNIKDRDKNLQDRLVLDQIHVDSLLSRFKYSTSLFTPHDIS 120

Query: 121 DTHLPLTLGTSLQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQQQPLFDPS 180
           DTHLPLTLGTSLQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQQQPLFDPS
Sbjct: 121 DTHLPLTLGTSLQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQQQPLFDPS 180

Query: 181 NSPSFLSLPCNSSNCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDLGFETLNLG 240
           NSPSFLSLPCNSSNCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDLGFETLNLG
Sbjct: 181 NSPSFLSLPCNSSNCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDLGFETLNLG 240

Query: 241 NVSVENFIFGCGRNNKGLFGGTSGLMGFGRSELSVVSQTSSAYGGVFSYCLPSTETGSSG 300
           NVSVENFIFGCGRNNKGLFGGTSGLMGFGRSELSVVSQTSSAYGGVFSYCLPSTETGSSG
Sbjct: 241 NVSVENFIFGCGRNNKGLFGGTSGLMGFGRSELSVVSQTSSAYGGVFSYCLPSTETGSSG 300

Query: 301 SLTMGTGDFSNFRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRLAPSKGVLS 360
           SLTMGTGDFSNFRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRLAPSKGVLS
Sbjct: 301 SLTMGTGDFSNFRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRLAPSKGVLS 360

Query: 361 LLDSGTVITRLPPSVYQALKEEFVRQFSGYPTAAGYSLLDTCYNLSGLKEVKTPNVKLHF 420
           LLDSGTVITRLPPSVYQALKEEFVRQFSGYPTAAGYSLLDTCYNLSGLKEVKTPNVKLHF
Sbjct: 361 LLDSGTVITRLPPSVYQALKEEFVRQFSGYPTAAGYSLLDTCYNLSGLKEVKTPNVKLHF 420

Query: 421 EGEGEMSVDVGGLFYYVKSDGSQICLAFASLADEDQIGIIGSYQQKNQRVIYNLKESK 479
           EGEGEMSVDVGGLFYYVKSDGSQICLAFASLADEDQIGIIGSYQQKNQRVIYNLKESK
Sbjct: 421 EGEGEMSVDVGGLFYYVKSDGSQICLAFASLADEDQIGIIGSYQQKNQRVIYNLKESK 478

BLAST of CmoCh08G005720 vs. NCBI nr
Match: KAG6593355.1 (Aspartyl protease family protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 919.5 bits (2375), Expect = 2.4e-263
Identity = 464/481 (96.47%), Postives = 468/481 (97.30%), Query Frame = 0

Query: 1   METTRSLHP---LLLLPLLFVLVDARSSSIDAVSAFHQTLVLNGQKLPLMDMKIPATECI 60
           METTRSLHP   LLLLPLLFVLVDARSSSIDA+SAFHQTLVLN QKLPLMDMKIPATECI
Sbjct: 1   METTRSLHPLLLLLLLPLLFVLVDARSSSIDAISAFHQTLVLNKQKLPLMDMKIPATECI 60

Query: 61  FHNPRVEKETATFEMKERDYCSGNIKDRDKNLQDRLVLDQIHVDSLLSRFKYSTSLFTPH 120
           FH PRVEKETATFEMKERDYCSGN+KDRDKNLQDRLVLD+IHVDSLLSRFKYS SLFTPH
Sbjct: 61  FHKPRVEKETATFEMKERDYCSGNMKDRDKNLQDRLVLDKIHVDSLLSRFKYSISLFTPH 120

Query: 121 DISDTHLPLTLGTSLQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQQQPLF 180
            ISDTHLPLTLGTSLQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQQQPLF
Sbjct: 121 GISDTHLPLTLGTSLQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQQQPLF 180

Query: 181 DPSNSPSFLSLPCNSSNCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDLGFETL 240
           DPSNSPSF+SLPCNSSNCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDLGFETL
Sbjct: 181 DPSNSPSFVSLPCNSSNCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDLGFETL 240

Query: 241 NLGNVSVENFIFGCGRNNKGLFGGTSGLMGFGRSELSVVSQTSSAYGGVFSYCLPSTETG 300
            LGNVSVENFIFGCGRNNKGLFGGTSGLMGFGRSELSVVSQTSS YGGVFSYCLPSTETG
Sbjct: 241 KLGNVSVENFIFGCGRNNKGLFGGTSGLMGFGRSELSVVSQTSSVYGGVFSYCLPSTETG 300

Query: 301 SSGSLTMGTGDFSNFRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRLAPSKG 360
           SSGSLTMG GDFSNFRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRLAPSKG
Sbjct: 301 SSGSLTMGAGDFSNFRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRLAPSKG 360

Query: 361 VLSLLDSGTVITRLPPSVYQALKEEFVRQFSGYPTAAGYSLLDTCYNLSGLKEVKTPNVK 420
           VLSLLDSGTVITRLPPSVYQALKEEFVRQFSGYPTAAGYSLLDTCYNLSGLKEVKTPNVK
Sbjct: 361 VLSLLDSGTVITRLPPSVYQALKEEFVRQFSGYPTAAGYSLLDTCYNLSGLKEVKTPNVK 420

Query: 421 LHFEGEGEMSVDVGGLFYYVKSDGSQICLAFASLADEDQIGIIGSYQQKNQRVIYNLKES 479
           LHFE EG MSVDVGGLFYYVKSDGSQICLAFASLADE QIGIIGSYQQKNQRVIYNLKES
Sbjct: 421 LHFEDEGVMSVDVGGLFYYVKSDGSQICLAFASLADEHQIGIIGSYQQKNQRVIYNLKES 480

BLAST of CmoCh08G005720 vs. NCBI nr
Match: XP_023004737.1 (aspartyl protease family protein At5g10770-like [Cucurbita maxima])

HSP 1 Score: 888.3 bits (2294), Expect = 5.8e-254
Identity = 447/481 (92.93%), Postives = 461/481 (95.84%), Query Frame = 0

Query: 1   METTRSLHP---LLLLPLLFVLVDARSSSIDAVSAFHQTLVLNGQKLPLMDMKIPATECI 60
           MET RSLH    LLLLPL+FVLVDARSSSIDA+SAF+QTLVLN QKLPLMDMKIPAT+CI
Sbjct: 1   METARSLHSLLLLLLLPLIFVLVDARSSSIDAISAFYQTLVLNEQKLPLMDMKIPATDCI 60

Query: 61  FHNPRVEKETATFEMKERDYCSGNIKDRDKNLQDRLVLDQIHVDSLLSRFKYSTSLFTPH 120
           FH PRVEKETATFEMKERDYCSGNIK RDKNLQDRL+LD+IHVDSLLSRFKY+TSLFTPH
Sbjct: 61  FHKPRVEKETATFEMKERDYCSGNIKHRDKNLQDRLILDEIHVDSLLSRFKYATSLFTPH 120

Query: 121 DISDTHLPLTLGTSLQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQQQPLF 180
           DISDTHLPLTLGTSLQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQQQPLF
Sbjct: 121 DISDTHLPLTLGTSLQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQQQPLF 180

Query: 181 DPSNSPSFLSLPCNSSNCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDLGFETL 240
           DPSNSPSF+ LPCNS+NCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDLGFETL
Sbjct: 181 DPSNSPSFVPLPCNSNNCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDLGFETL 240

Query: 241 NLGNVSVENFIFGCGRNNKGLFGGTSGLMGFGRSELSVVSQTSSAYGGVFSYCLPSTETG 300
            LGNVSVENFIFGCGRNNKGLFGGTSGL+GFGRSELSVVSQ+SS YGGVFSYCLPSTETG
Sbjct: 241 KLGNVSVENFIFGCGRNNKGLFGGTSGLIGFGRSELSVVSQSSSVYGGVFSYCLPSTETG 300

Query: 301 SSGSLTMGTGDFSNFRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRLAPSKG 360
           SSGSLT+G GDFSNFRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRLA S G
Sbjct: 301 SSGSLTLGAGDFSNFRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRLASSNG 360

Query: 361 VLSLLDSGTVITRLPPSVYQALKEEFVRQFSGYPTAAGYSLLDTCYNLSGLKEVKTPNVK 420
           VLSLLDSGTVITRLPPSVY+ALKEEF RQFS Y TA GYSLLDTCYNLSGLK+VKTPNVK
Sbjct: 361 VLSLLDSGTVITRLPPSVYKALKEEFERQFSVYRTAPGYSLLDTCYNLSGLKQVKTPNVK 420

Query: 421 LHFEGEGEMSVDVGGLFYYVKSDGSQICLAFASLADEDQIGIIGSYQQKNQRVIYNLKES 479
           LHFE EGEMSVDVGGLFYYVKSDGSQICLAFASLADE QIGIIGSYQQKNQRVIYNLKES
Sbjct: 421 LHFEDEGEMSVDVGGLFYYVKSDGSQICLAFASLADEYQIGIIGSYQQKNQRVIYNLKES 480

BLAST of CmoCh08G005720 vs. NCBI nr
Match: KAG7025700.1 (Aspartyl protease family protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 793.1 bits (2047), Expect = 2.5e-225
Identity = 396/407 (97.30%), Postives = 398/407 (97.79%), Query Frame = 0

Query: 72  MKERDYCSGNIKDRDKNLQDRLVLDQIHVDSLLSRFKYSTSLFTPHDISDTHLPLTLGTS 131
           MKERDYCSGN KDRDKNLQDRLVLD+IHVDSLLSRFKYS SLFTPHDISDTHLPLTLGTS
Sbjct: 1   MKERDYCSGNKKDRDKNLQDRLVLDKIHVDSLLSRFKYSISLFTPHDISDTHLPLTLGTS 60

Query: 132 LQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQQQPLFDPSNSPSFLSLPCN 191
           LQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQQQPLFDPSNSPSF+ LPCN
Sbjct: 61  LQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQQQPLFDPSNSPSFVFLPCN 120

Query: 192 SSNCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDLGFETLNLGNVSVENFIFGC 251
           SSNCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDLGFETL LGNVSVENFIFGC
Sbjct: 121 SSNCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDLGFETLKLGNVSVENFIFGC 180

Query: 252 GRNNKGLFGGTSGLMGFGRSELSVVSQTSSAYGGVFSYCLPSTETGSSGSLTMGTGDFSN 311
           GRNNKGLFGGTSGLMGFGRSELSVVSQTSS YGGVFSYCLPSTETGSSGSLTMG GDFSN
Sbjct: 181 GRNNKGLFGGTSGLMGFGRSELSVVSQTSSVYGGVFSYCLPSTETGSSGSLTMGAGDFSN 240

Query: 312 FRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRLAPSKGVLSLLDSGTVITRL 371
           FRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRLAPSKGVLSLLDSGTVITRL
Sbjct: 241 FRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRLAPSKGVLSLLDSGTVITRL 300

Query: 372 PPSVYQALKEEFVRQFSGYPTAAGYSLLDTCYNLSGLKEVKTPNVKLHFEGEGEMSVDVG 431
           PPSVYQALKEEFVRQFSGYPTAAGYSLLDTCYNLSGLKEVKTPNVKLHFE EG MSVDVG
Sbjct: 301 PPSVYQALKEEFVRQFSGYPTAAGYSLLDTCYNLSGLKEVKTPNVKLHFEDEGVMSVDVG 360

Query: 432 GLFYYVKSDGSQICLAFASLADEDQIGIIGSYQQKNQRVIYNLKESK 479
           GLFYYVKSDGSQICLAFASLADE QIGIIGSYQQKNQRVIYNLKESK
Sbjct: 361 GLFYYVKSDGSQICLAFASLADEHQIGIIGSYQQKNQRVIYNLKESK 407

BLAST of CmoCh08G005720 vs. NCBI nr
Match: XP_004135889.2 (aspartyl protease family protein At5g10770 [Cucumis sativus] >KGN45199.1 hypothetical protein Csa_015932 [Cucumis sativus])

HSP 1 Score: 605.9 bits (1561), Expect = 5.8e-169
Identity = 311/486 (63.99%), Postives = 370/486 (76.13%), Query Frame = 0

Query: 1   METTRSLH------PLLLLPLLFVLVDARSSSIDAVSA--FHQTLVLNGQKLPLMDMKIP 60
           ME ++SLH       LLLLPLL + VDARSSS +  +     + L+   Q  P  +    
Sbjct: 1   MEISKSLHFPLSLLLLLLLPLLSIGVDARSSSFNLGNGDNHEKGLLQLFQNFPWKEHGEA 60

Query: 61  ATECIFHNPRVEKETATFEMKERDYCSGNIKDRDKNLQDRLVLDQIHVDSLLSRFKYSTS 120
              CIF  P++ K   T EMK+RDYCSG I D +K  Q+R++LD I+V+SL S FK +  
Sbjct: 61  VVNCIFQKPKITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIF 120

Query: 121 LFTPHDISDTHLPLTLGTSLQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQ 180
               H +SD+ +P++ G  LQTLNY+VTV IG QN TLIVDTGSDLTWVQCLPC LCYNQ
Sbjct: 121 PGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQ 180

Query: 181 QQPLFDPSNSPSFLSLPCNSSNCSAFQPTAGGSGACTNGSSNPCDYEVNYGDGSYSRGDL 240
           Q+PLF+PSNS SFLSLPCNS  C A QPTAG SG C+N +S  CDY+++YGDGSYSRG+L
Sbjct: 181 QEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGEL 240

Query: 241 GFETLNLGNVSVENFIFGCGRNNKGLFGGTSGLMGFGRSELSVVSQTSSAYGGVFSYCLP 300
           GFE L LG   ++NFIFGCGRNNKGLFGG SGLMG  RSELS+VSQTSS +G VFSYCLP
Sbjct: 241 GFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLP 300

Query: 301 STETGSSGSLTMGTGDFSNFRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRL 360
           +T  GSSGSLT+G  DFSNF+NISPISYT+M+ NPQMSNFY LNLTGI++GG+ L VPRL
Sbjct: 301 TTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRL 360

Query: 361 APSKGVLSLLDSGTVITRLPPSVYQALKEEFVRQFSGYPTAAGYSLLDTCYNLSGLKEVK 420
           + ++GVLSLLDSGTVITRL PS+Y+A K EF +QFSGY T  G+S+L+TC+NL+G +EV 
Sbjct: 361 SSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVN 420

Query: 421 TPNVKLHFEGEGEMSVDVGGLFYYVKSDGSQICLAFASLADEDQIGIIGSYQQKNQRVIY 479
            P VK  FEG  EM VDV G+FY+VKSD SQICLAFASL  EDQ  IIG+YQQKNQRVIY
Sbjct: 421 IPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIY 480

BLAST of CmoCh08G005720 vs. TAIR 10
Match: AT1G79720.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 458.4 bits (1178), Expect = 1.4e-128
Identity = 243/472 (51.48%), Postives = 321/472 (68.01%), Query Frame = 0

Query: 10  LLLLPLLFVLVDARSSSIDAVSAFHQTLVLNGQKLPLMDMKIPATECIFHNPRVEKETAT 69
           L L PLL V +   S  +  V       V N    P    +  +T C   +    +E+ T
Sbjct: 9   LSLAPLLLVFLFLLSCVVHGVDEKKILSVHNNIWSPKKSYE-ASTSCFSRSLGKGRESTT 68

Query: 70  FEMKERDYCSGNIKDRDKNLQDRLVLDQIHVDSLLSRFKYSTSLFTPHDISDTHLPLTLG 129
            EMK R+ CSG   D  K ++  LVLD I V SL  + K  TS  T   +S+T +PLT G
Sbjct: 69  LEMKHRELCSGKTIDLGKKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSG 128

Query: 130 TSLQTLNYLVTVRIGPQNLTLIVDTGSDLTWVQCLPCHLCYNQQQPLFDPSNSPSFLSLP 189
             L++LNY+VTV +G +N++LIVDTGSDLTWVQC PC  CYNQQ PL+DPS S S+ ++ 
Sbjct: 129 IKLESLNYIVTVELGGKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVF 188

Query: 190 CNSSNCSAFQPTAGGSGAC--TNG-SSNPCDYEVNYGDGSYSRGDLGFETLNLGNVSVEN 249
           CNSS C         SG C   NG    PC+Y V+YGDGSY+RGDL  E++ LG+  +EN
Sbjct: 189 CNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLEN 248

Query: 250 FIFGCGRNNKGLFGGTSGLMGFGRSELSVVSQTSSAYGGVFSYCLPSTETGSSGSLTMGT 309
           F+FGCGRNNKGLFGG+SGLMG GRS +S+VSQT   + GVFSYCLPS E G+SGSL+ G 
Sbjct: 249 FVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFG- 308

Query: 310 GDFSNFRNISPISYTKMVPNPQMSNFYTLNLTGITVGGLKLVVPRLAPSKGVLSLLDSGT 369
            D S + N + +SYT +V NPQ+ +FY LNLTG ++GG++L     + S G   L+DSGT
Sbjct: 309 NDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVEL----KSSSFGRGILIDSGT 368

Query: 370 VITRLPPSVYQALKEEFVRQFSGYPTAAGYSLLDTCYNLSGLKEVKTPNVKLHFEGEGEM 429
           VITRLPPS+Y+A+K EF++QFSG+PTA GYS+LDTC+NL+  +++  P +K+ F+G  E+
Sbjct: 369 VITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAEL 428

Query: 430 SVDVGGLFYYVKSDGSQICLAFASLADEDQIGIIGSYQQKNQRVIYNLKESK 479
            VDV G+FY+VK D S +CLA ASL+ E+++GIIG+YQQKNQRVIY+  + +
Sbjct: 429 EVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQER 474

BLAST of CmoCh08G005720 vs. TAIR 10
Match: AT5G10770.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 285.4 bits (729), Expect = 1.6e-76
Identity = 180/430 (41.86%), Postives = 250/430 (58.14%), Query Frame = 0

Query: 53  ATECIFHNPRVEKETATFEMKER-DYCS--GNIKDRDKNLQDRLVLDQIHVDSLLSRFKY 112
           ++ C+  +PR     ++  +  R   CS   N K    +  + L LDQ  V+S+ S  K 
Sbjct: 46  SSSCVL-SPRASTTKSSLHVTHRHGTCSRLNNGKATSPDHVEILRLDQARVNSIHS--KL 105

Query: 113 STSLFTPH--DISDTHLPLTLGTSLQTLNYLVTVRIG-PQN-LTLIVDTGSDLTWVQCLP 172
           S  L T H  +   T LP   G++L + NY+VTV +G P+N L+LI DTGSDLTW QC P
Sbjct: 106 SKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQP 165

Query: 173 C-HLCYNQQQPLFDPSNSPSFLSLPCNSSNCSAFQPTAGGSGACTNGSSNPCDYEVNYGD 232
           C   CY+Q++P+F+PS S S+ ++ C+S+ C +     G +G+C   S++ C Y + YGD
Sbjct: 166 CVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSC---SASNCIYGIQYGD 225

Query: 233 GSYSRGDLGFETLNLGNVSV-ENFIFGCGRNNKGLFGGTSGLMGFGRSELSVVSQTSSAY 292
            S+S G L  E   L N  V +   FGCG NN+GLF G +GL+G GR +LS  SQT++AY
Sbjct: 226 QSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAY 285

Query: 293 GGVFSYCLPSTETGSSGSLTMGTGDFSNFRNISPISYTKMVPNPQMSNFYTLNLTGITVG 352
             +FSYCLPS+    +G LT G+   S     +PIS          ++FY LN+  ITVG
Sbjct: 286 NKIFSYCLPSS-ASYTGHLTFGSAGISRSVKFTPISTI-----TDGTSFYGLNIVAITVG 345

Query: 353 GLKLVVPRLAPSKGVLSLLDSGTVITRLPPSVYQALKEEFVRQFSGYPTAAGYSLLDTCY 412
           G KL +P    S    +L+DSGTVITRLPP  Y AL+  F  + S YPT +G S+LDTC+
Sbjct: 346 GQKLPIPSTVFSTPG-ALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCF 405

Query: 413 NLSGLKEVKTPNVKLHFEGEGEMSVDVGGLFYYVKSDGSQICLAFASLADEDQIGIIGSY 472
           +LSG K V  P V   F G   + +   G+FY  K   SQ+CLAFA  +D+    I G+ 
Sbjct: 406 DLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFKI--SQVCLAFAGNSDDSNAAIFGNV 460

Query: 473 QQKNQRVIYN 474
           QQ+   V+Y+
Sbjct: 466 QQQTLEVVYD 460

BLAST of CmoCh08G005720 vs. TAIR 10
Match: AT1G79700.1 (Integrase-type DNA-binding superfamily protein )

HSP 1 Score: 250.8 bits (639), Expect = 4.4e-66
Identity = 160/309 (51.78%), Postives = 186/309 (60.19%), Query Frame = 0

Query: 476 ESKSKAL--KRTRKSVRRDAPAQRSSVYRGVTRHRWTGRYEAHLWDKNSWNEGQNKKGRQ 535
           ES S AL  KR RKS  R+AP QRSS YRGVTRHRWTGRYEAHLWDKNSWN+ Q KKGRQ
Sbjct: 26  ESASIALTSKRKRKSPPRNAPLQRSSPYRGVTRHRWTGRYEAHLWDKNSWNDTQTKKGRQ 85

Query: 536 ---GAYDDEEAAAHAYDLAALKYWGAETVINFPRLTYQDELKEMEGQSREEYIRYLRRKS 595
              GAYD+EEAAA AYDLAALKYWG +T++NFP  +Y +++KEMEGQS+EEYI  LRRKS
Sbjct: 86  VYLGAYDEEEAAARAYDLAALKYWGRDTLLNFPLPSYDEDVKEMEGQSKEEYIGSLRRKS 145

Query: 596 SGFSRGVSKYR-------------------ATQEEAARAYDLAAIEHRGLNAVTNFDISR 655
           SGFSRGVSKYR                   ATQEEAA AYD+AAIE+RGLNAVTNFD+SR
Sbjct: 146 SGFSRGVSKYRGVARHHHNGRWEARIGRVFATQEEAAIAYDIAAIEYRGLNAVTNFDVSR 205

Query: 656 YIKCLRPGEQHIPDNNRPSSPNAGDTASEFDPKSFLEITFPSQSSSSDQPTTAPEPHGGL 715
           Y+                 +PNA    ++ D K    I  PS+   S     +P      
Sbjct: 206 YL-----------------NPNAAADKADSDSK---PIRSPSREPESSDDNKSP------ 265

Query: 716 PSSSSATLELLIHSSKFKHILERTSAAETPQTLPESVRPRRCIPDDIQTYFDCSTQDSDD 761
                          K + ++E       P T PE +  RR  PDDIQTYF C  QDS  
Sbjct: 266 ---------------KSEEVIE-------PSTSPEVIPTRRSFPDDIQTYFGC--QDSGK 284

BLAST of CmoCh08G005720 vs. TAIR 10
Match: AT1G16060.1 (ARIA-interacting double AP2 domain protein )

HSP 1 Score: 250.8 bits (639), Expect = 4.4e-66
Identity = 163/312 (52.24%), Postives = 189/312 (60.58%), Query Frame = 0

Query: 483 KRTRKSVRRDAPAQRSSVYRGVTRHRWTGRYEAHLWDKNSWNEGQNKKGRQ---GAYDDE 542
           KR R+S  RDAP QRSSV+RGVTRHRWTGRYEAHLWDKNSWNE Q KKGRQ   GAYD+E
Sbjct: 41  KRKRRSQPRDAPPQRSSVHRGVTRHRWTGRYEAHLWDKNSWNETQTKKGRQVYLGAYDEE 100

Query: 543 EAAAHAYDLAALKYWGAETVINFPRLTYQDELKEMEGQSREEYIRYLRRKSSGFSRGVSK 602
           +AAA AYDLAALKYWG +T++NFP   Y++++KEME QS+EEYI  LRRKSSGFSRGVSK
Sbjct: 101 DAAARAYDLAALKYWGRDTILNFPLCNYEEDIKEMESQSKEEYIGSLRRKSSGFSRGVSK 160

Query: 603 YR-----------------------------ATQEEAARAYDLAAIEHRGLNAVTNFDIS 662
           YR                             ATQEEAA AYD+AAIE+RGLNAVTNFDIS
Sbjct: 161 YRGVAKHHHNGRWEARIGRVFGNKYLYLGTYATQEEAAIAYDIAAIEYRGLNAVTNFDIS 220

Query: 663 RYIKCLRPGEQHIPDNNRPSSPNAGDTASEFDPKSFLEITFPSQSSSSDQPTTAPEPHGG 722
           RY+K   P       NN   SP++ D +    P    +++  SQSSS D      +    
Sbjct: 221 RYLKLPVPENPIDTANNLLESPHS-DLSPFIKPNHESDLS-QSQSSSEDNDDRKTK---- 280

Query: 723 LPSSSSATLELLIHSSKFKHILERTSAAETPQTLPESVRPRRCIPDDIQTYFDCSTQDSD 762
           L  SS    E +I                 P T PE   PRR  P+DIQTYF C  Q+S 
Sbjct: 281 LLKSSPLVAEEVI----------------GPSTPPEIAPPRRSFPEDIQTYFGC--QNSG 328

BLAST of CmoCh08G005720 vs. TAIR 10
Match: AT1G79700.2 (Integrase-type DNA-binding superfamily protein )

HSP 1 Score: 246.9 bits (629), Expect = 6.3e-65
Identity = 160/319 (50.16%), Postives = 186/319 (58.31%), Query Frame = 0

Query: 476 ESKSKAL--KRTRKSVRRDAPAQRSSVYRGVTRHRWTGRYEAHLWDKNSWNEGQNKKGRQ 535
           ES S AL  KR RKS  R+AP QRSS YRGVTRHRWTGRYEAHLWDKNSWN+ Q KKGRQ
Sbjct: 26  ESASIALTSKRKRKSPPRNAPLQRSSPYRGVTRHRWTGRYEAHLWDKNSWNDTQTKKGRQ 85

Query: 536 ---GAYDDEEAAAHAYDLAALKYWGAETVINFPRLTYQDELKEMEGQSREEYIRYLRRKS 595
              GAYD+EEAAA AYDLAALKYWG +T++NFP  +Y +++KEMEGQS+EEYI  LRRKS
Sbjct: 86  VYLGAYDEEEAAARAYDLAALKYWGRDTLLNFPLPSYDEDVKEMEGQSKEEYIGSLRRKS 145

Query: 596 SGFSRGVSKYR-----------------------------ATQEEAARAYDLAAIEHRGL 655
           SGFSRGVSKYR                             ATQEEAA AYD+AAIE+RGL
Sbjct: 146 SGFSRGVSKYRGVARHHHNGRWEARIGRVFGNKYLYLGTYATQEEAAIAYDIAAIEYRGL 205

Query: 656 NAVTNFDISRYIKCLRPGEQHIPDNNRPSSPNAGDTASEFDPKSFLEITFPSQSSSSDQP 715
           NAVTNFD+SRY+                 +PNA    ++ D K    I  PS+   S   
Sbjct: 206 NAVTNFDVSRYL-----------------NPNAAADKADSDSK---PIRSPSREPESSDD 265

Query: 716 TTAPEPHGGLPSSSSATLELLIHSSKFKHILERTSAAETPQTLPESVRPRRCIPDDIQTY 761
             +P                     K + ++E       P T PE +  RR  PDDIQTY
Sbjct: 266 NKSP---------------------KSEEVIE-------PSTSPEVIPTRRSFPDDIQTY 294

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8S9J62.3e-7541.86Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana OX=3702 GN=At... [more]
Q94AN46.2e-6552.24AP2-like ethylene-responsive transcription factor At1g16060 OS=Arabidopsis thali... [more]
A0JPZ86.2e-6551.78AP2-like ethylene-responsive transcription factor At1g79700 OS=Arabidopsis thali... [more]
Q9LEW39.2e-6137.06Aspartyl protease AED1 OS=Arabidopsis thaliana OX=3702 GN=AED1 PE=2 SV=1[more]
Q9LHE31.6e-6033.68Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Match NameE-valueIdentityDescription
A0A6J1H6S67.1e-274100.00aspartyl protease family protein At5g10770-like OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1KR962.8e-25492.93aspartyl protease family protein At5g10770-like OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A0A0K8J22.8e-16963.99Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G43132... [more]
A0A1S3CDQ04.8e-16963.22aspartyl protease family protein At5g10770 OS=Cucumis melo OX=3656 GN=LOC1034998... [more]
A0A5A7UYY64.0e-16863.51Aspartyl protease family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27... [more]
Match NameE-valueIdentityDescription
XP_022959733.11.5e-273100.00aspartyl protease family protein At5g10770-like [Cucurbita moschata][more]
KAG6593355.12.4e-26396.47Aspartyl protease family protein, partial [Cucurbita argyrosperma subsp. sororia... [more]
XP_023004737.15.8e-25492.93aspartyl protease family protein At5g10770-like [Cucurbita maxima][more]
KAG7025700.12.5e-22597.30Aspartyl protease family protein, partial [Cucurbita argyrosperma subsp. argyros... [more]
XP_004135889.25.8e-16963.99aspartyl protease family protein At5g10770 [Cucumis sativus] >KGN45199.1 hypothe... [more]
Match NameE-valueIdentityDescription
AT1G79720.11.4e-12851.48Eukaryotic aspartyl protease family protein [more]
AT5G10770.11.6e-7641.86Eukaryotic aspartyl protease family protein [more]
AT1G79700.14.4e-6651.78Integrase-type DNA-binding superfamily protein [more]
AT1G16060.14.4e-6652.24ARIA-interacting double AP2 domain protein [more]
AT1G79700.26.3e-6550.16Integrase-type DNA-binding superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001471AP2/ERF domainPRINTSPR00367ETHRSPELEMNTcoord: 501..512
score: 45.3
coord: 610..630
score: 49.87
IPR001471AP2/ERF domainSMARTSM00380rav1_2coord: 500..569
e-value: 2.1E-16
score: 70.5
coord: 584..634
e-value: 7.5E-4
score: 19.2
IPR001471AP2/ERF domainPFAMPF00847AP2coord: 499..555
e-value: 1.2E-8
score: 35.1
IPR001471AP2/ERF domainPROSITEPS51032AP2_ERFcoord: 552..628
score: 8.833962
IPR001471AP2/ERF domainPROSITEPS51032AP2_ERFcoord: 500..563
score: 18.399595
IPR036955AP2/ERF domain superfamilyGENE3D3.30.730.10AP2/ERF domaincoord: 500..564
e-value: 4.1E-21
score: 76.9
coord: 581..630
e-value: 2.0E-7
score: 33.0
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 115..305
e-value: 2.9E-48
score: 166.4
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 310..485
e-value: 1.0E-36
score: 128.3
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 131..477
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 137..305
e-value: 1.4E-49
score: 168.7
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 332..477
e-value: 3.1E-24
score: 85.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 639..659
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 644..658
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 671..695
NoneNo IPR availablePANTHERPTHR13683:SF827SUBFAMILY NOT NAMEDcoord: 12..478
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 12..478
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 150..161
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 137..483
score: 37.462425
IPR033873CND41-likeCDDcd05472cnd41_likecoord: 136..472
e-value: 4.45674E-124
score: 375.841
IPR016177DNA-binding domain superfamilySUPERFAMILY54171DNA-binding domaincoord: 602..630
IPR016177DNA-binding domain superfamilySUPERFAMILY54171DNA-binding domaincoord: 499..564

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh08G005720.1CmoCh08G005720.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0003677 DNA binding
molecular_function GO:0003700 DNA-binding transcription factor activity