ClCG01G000040 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG01G000040
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionEndopeptidase S2P
LocationCG_Chr01: 32189 .. 44744 (-)
RNA-Seq ExpressionClCG01G000040
SyntenyClCG01G000040
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAACAACCCATATTTTCAATTTTATTCAATTTTATTATGTTTATACCGTTCGCTTCGAAATATCACGACTGAGGGAAGGAGGGGGACATGGTGGCGGCACAATGTCAGGAAGATGGAGGCGAAGTAAGAGACCCATGGCTCAGGCCGACGCTCACGCTCATATTCCTCCCTTACTTCCGCTTACCACAAGTCGCAAAGGTCTTTCAAATGCCATTTCCTGCTGGTATTGCGATTGTAAGATCACCTCCTTCAACGAACTAATTTTCCAATTTGGACGAAGACATGCCAGGTTTCTCTTTCAATTTCAATCGTCTGTTGTGGTTTTACTGTCCGTGGAGCACTTTATCAGTACTCTTTTTTTTTTTTCTTTCCTGTGTGTTCAGCAGTGTCATCATCTTTTTTCCTATTAGATTGAGCTGATGCCAAATTGTTTGCTTATTTATTTTACATTTGTGTGTAACATTGCAGGGTTCTGAGAACGTGGTTCTCGATTGGGATTGGTTTTTCTCTTGCTGCTCTTGCTGTTGTTGTCACGGTTTCATCCGACGCTTTCTTTTTGTTGTTTATTGTATATTTTTTTAAAAAAATTCTTGTTATTCTAGCTGTTTTCATCTGAGTGCTTTCAAACCTACTGTACATCCGAAATGTCCAAATATGTAGAGATGACTGGGTCATAATAGGAGTACAAAACGGAGAAACTCACAGGAAATGATCACAAGAAGCAACACTTTTCTTATTGAAAAAGCGTTCAAGAGTCAGCCTACATATATAGGTGGTTTCTCAACAGAAAAACAAACGGCAACAGCAGAAGCGAAATAATTAACTCTTCTAACTCAAATGATAAAATTAGTTATATTACAACAAAATGGCAGCTCATCAGTGGGCTGTCTTAATATCCCCCTGCAAGCCAATGTCTGAAGCTTCACACACATGAAGTTCTCCGCAAATGTGGGAAAGCTTGATCTGAAAGTGGCTTGGTAAATATATCAGCTACTTGAGGCTTGATCTGCGGCCGGGAGATGATGAACAATGAGTTTTTGTTGTAAACTAGATCTCGAATGAAATTCATGACCAATTCAATATGTTAAGCCTGAGAATGCAATATAGGATTGGCACTAAGATAAACCTCTATCAAGTTATCATACCATAGTGTGGTCAACTTGGAAAGAGGAATATGAAGTTCTTTGAGCAAAGAAGATAGCCAAATTAACTCTAACACAACATGTTAGGTTTCTAATCTCCCTGCACTCAGTAAAACCCAACAAGGACAGGTAAAAGTAGGGGAATATGATCCCTGTTACGATATCATCCAATTCAAGTAGAAATTCCAAGGCAAGTTCACAATCACAATGAAACGTACAACCACAAAATAGTCCAAAGAGCAACGAGAAAGAAAATAGCAGAAGATGAACAACACAAAGGAACACTTTCGAGAGAGCTGGAGATTCTCTCCTATCTTCAGCCTTTAGCCAAGATCTTCTCTCTCAATCATAGACCATCAAAGCGCTTCTTAGCCTCCTCAATGGTTCCCACTTGTAGCAAATTCCCTTGGTCAACGTTCATATCTCATTTGTCCCTTGTGCAATTACTCTATTACCCCTTTGAGAATACATATATAGGATTTGAGTCTCATAATACCCCTGTTTTGAAACTCATCTTGTCCTTAAGGTGGAAAAGAGGAAACCGTTTCTTCATAAAATCGACTCCCATGTAGCCTCACTTTCAAGTAAGTTCATCCACTTCACTAACCATTCATTAGTTGCCATTTCTCCATTCCATCGAACTCCCAAGATTGATTCTGGCTTCTCTTGCAACTCAAAATTCTTCGGTGATCCGGAGGATGTTGTTGGACTGTGTACTTGCTCCCACCAACTTTTTGTGTTGAGAAATATGGAAGACATTGTGAAACGATGCCTCAGGAGGCAACTTCAATCTATAAGTCATTTCCCCAATTCGCTCAAGGATTTAGTACGGACCATAGAATTTGGGAGATAATTTATCACACCTCTTCTTAAGGACAAGGTTTCAGGAACACTTCATCTCCCACCACTTAAGAAGCACAAACATGGATGCGGATACAAAGCCAAAAATGTCAGAAGTCTTTTTGTGGGCGGTGGCTTTTAGGAGCCTTGTTACTAATTACAGATTACAGAGGAAGGTTGGGCTCTTTCCCCTCTATGCAGCTTGTGTATGAAAGGAGAAGAAAATACCAACCATATCTTCCTCCATTGTCCGTTTTGTTCAATATGGTTGGAAGTGGTTGTTGAATACTTTTGGGATCTTTGTTTATCTCCCTAAACAGATTGATAATTGGCTGATTGAAGTGCTTAGTGGCTGGAAGTTGAAGAGGAAAGCTGGAATTTTATGGGGCATGTAATAAGGACTTTCCTTTGGGCTTGTTGGCTCGAAAGGAATAAATGTCCTTTTTATGATAAGTTCACTGATTCTACTTCCTTTTGTGACAACTGTCAGCTTTTATCATCTTGGTGGTACCACAACTGCAACCAATTCTTTTGTACTTATCCAATTGCGTTAATTCCCAAATTGGAGAAGTTTTTGTAATAATCTCTCTTGAGAGGAGGTTCTCTCTTCCCCAAGCCCCTAAGTTGTACTTCTTTGACGGTTATATCATATGTTCATACCTTTTTTATCAAAAAAAAGAAAAAAAAAGGGAAAAAGAGAAAAAAGAAAAAATAAAAGCTAATAGCCAAACTAACTTTAAAAAAATGACAAGTGACAGAAAGCCCAAAACTAAAAGTTAGACTAAATCAGAGAGGCTGAAAGAAGAGTTGGTCCTTTGATTAGTCAAAGAAGGACGGCTACTCATTTGAAAATGAATGTAATGAATATATAGATGGGTAAAGAGGGAGGAAGAAATCAGGGAGCATGAGAAGCGGCAGAAGGATAGATATTTACACTGTCGTTATCCTAGAGTTCAAGAACATTTTTCAACTCAATCAAGCCCCAAACCATCATAGGTTGACCTAGTGGTAAAAAAGGAGATATAGTTTCAGTAAATGGCTAAGAGGTCATGAGTTCCATCCATGGTGGCCATCTACCTAGGATTTAATATCCTATGAGTTTTCTTGACAATGTTGTAGGGCCAAATGGGTTGTCCCGTGAGATTAGTCAAGGTACGTGTAAGCTGGCCCAGACACTCATGGATATTAAAAAAAAAGAAAAAAAACTCAATCAAGCTCCAAATTGATTCTTACAAATTACAATTCACTCTTGACTCTACTTGTATGAACGATAGTATTTGAAATAGACTTCCTACAGAAACAAACTAACACCCAAGTGACTGTTTTCGGCTCTTGCTGTATTGGTGTTCTTTATCTAAGCTTTTATTTGGTTTCTCTATTCATGGTTTATTTCCTATGTTCTATTCTTTTTTCTTTCTTTCCTAGTTTCCCCTTTAGAGTTTGTATCTTTTGAGCATTAGTTTCTTTTTAGAAAAGTTTCGCTTAATGCTAAAAAAAAAAAAAAAGAAAAAAAGAAAAAAAAAACTAAGACCAACAGCTGACTCTCGTAGCAAGCTAAGAGAGTTATTACTGCAAAACTGTTATTATTACAGTGCTATATACTGTTTGTCTGGGCAAGTAATTTTCTTTTCTTGTATTAGTTCATGTTAGTAACTAGCTTATTTCTTTATTTTCCAGGTTCTCTTTTGGGAACTTGCAACAGCCATGCATATTTTCGGCAATGTGATCCATGGTCTTCCTATTTCCTATTCCTCGTTATTTGGTCTTCCTCCGTTGGTTCATATACAATACCTTTCAATGTTTTGTTCCATTATCATTAAAATTAGATCTCATTCTTCAATTTCTCGTGCCATACCTGCTTCTCATTACATTTTTTCTTCTACACTAGATTTCCAGCTGTAGTTTCTCCCCTGCTGATGCTGGATGCATAATTGTCTCTACCTTAATATCTGTAGCTTTTCATGAATTTGGTCATGCTGCTGCTGCTGCAAGGTATGTTTCTTTCTGCATCATCAATTCTATTCCAAAAAAGTATGATTTTTGGAAAGCTTAATCCAAAAGTATGACTTTTGTCCAAAAAAGAAAGAATTTAGTTATCAGTAACCTAAGTGACTTTTTGTCTCCGTTCTCTCTTGTTGACCCCTGACCCCTGACTTTTCCTTTCCCCAGTTTGTCGGCTCCGACCACCTCAGGCTCTGCCGATTTCCACCGGTTCTGACCACCGTCGTTTAGTCACCACCCTCAAGCGCCTCTTTCCTGATTAAGGAAAACCTAAACCTCTGTTCTCTCTTCCTTGACCCTTCATCTCCGTTCTCTCTCCCCAATCCTAGATTTTTTCTTTCCCCATTTCGTCGGCTCCGACCATCTTAGACTTCGCCGACCACCGTCGTTTAGTCGCCACCCTGGGCCACCTCTTTTAGGTGGTCCTTTTTGCGTCCGTCACCCATGGTTTCTTCTTTGACCAATCCATCCACTGACCTCCTAAAATCCTATTCGCCTAGTGTCTCACCCTACCTTCGTCCGTCTTCCTTTCCTTTGCTATCCATCTTAGGCTTCTTCTCCTTTTGCCCTTTGTTTTTACCTCACCATTTTAGACTCTTCAGTCGTGTCTCCAGTCACTGGTCTGGTATCTGCAGTGTTTCTTCTTCATTTAAGATGTCTTCGAACACTTCAGAGGTGATGAGTTGTTGTATTCGGAGCACTTACTTTTGGATTAAGTGTGAGAGGAACATGTTTTCTATTGAGAATTCAAGTAATAATCAGAAGATTTTCCTTTCAATCTCTCATTTGCGTTGGTTTGAACTATCTTTGGTAGAGTTATTACAAGATCCGGTTCATACTTTCTTCTCCAAAAAAAGATCAGGATGGCTCTGGATTCATTTGTTTGGCTATATTCAGATCGTTCAATGAGCGATTTTTTGAATGTGCTATTTGGTGGCTGCCTACTGGTGGTAGGAAAAATCTACATGTTCCAGCTGGTTTCTTGAAGAAAGGTTGGATGGTTTTTTGGGAAATGATTAGAGATTTTTTGGGCAAAGCGGAATTTTTCTGGTTTTCTTCCTATCGATTTGGTGTTGGTGAGTCTTTAGATCCAGCTCTCTCAAATGATTCAAAGGTGGATAGAAAACTTTAGGTAGTTTCTTAGCCGTTTGACTCTGGTTCCTTTTAGTGGGTTAGAAAAGATAAGGAAGTGTTGACAATGAACTTTAACTATCTATTTGTTGTTTCTCGATTGTTTGCTTGCTATTTTTGGAAGGATATTCGTCTTGGTTTGGAAGACTATTTTCAGTGTAAAGTTCTAATCAACCCTTTCATGGCTGATAAGGCATTAATAAAACTTTATGATAGTTTTGCGGATATAGATTTCGATGGAAAATGGAAAAAATTGGATATGGATTTCAATGGAAAATGGCGAAAAATTGGATCCTTGCATTTGACGGTCTTACAAAGCACATTCTCATCCAAAGTTCATTGTGAGTTATGGTGGTTGGCTTTTGGAAAAACTTACCTTTGCCTTTTTGGGCTTGTCCTACTTTTGAAGCCATTGGAGAAAATCTTGAGGCCTGGTTAGTCTTTCTTCTCAAACTTTGAATTTCTTGGATTGTTCTGAAGCTTGCATTGAATTTAAAAGAAATTTTTGTGGTTTTTTCTCAGCTGAAATTGAAATTAAAGATAAGACTAATGGTAATTTTCACCTTAGATTTGGTGATATTTATCCTTTAGATATTCCTAGTATCCTCCAAGGTGAGTTATTTTTCAACGATTTTTCTAACTCTTTGGATATTGCAAGATTGGTTGAAGTTAAGCGGGATGAAGAAATGTTTGAGGTATTTGATTTTGTTGGAGAGGATGTTCCCATTCAGTTAGTCAAGGAATCTAGGTTTGATTTAGTTCCAAATGTGGCGGATCCGCTTCCTTTAGGGAGCCCTAAATGTCCCAAGGAAACCGCAGTTAGTAATCCATAGTCTGTCGACATAAAGAGCAGTAAAGAGGTTGTGAACATTGAAAGTCTCCATTTTCTTATTTTCCCATTTCTTACAGTTGGTAAAAAAAGAAGCAGTCATTTGCAAAGGAACATTTTGGGTTCAATTGAAATGTTTTGGAGGAGTCCTGTACACGTCTCTTATTAGCTGATACACAACCAGCAGACAATCCACATCTTTTGCCTCATAGTTTAAATTTCAGTCAGTCAACCATTAAAGTTCCGAGATCAAATATTAATTTTGTAAAAGGAGTATATAGTCATCCATCTCCTAAAGACACTCCTTATGGTTATGATTCTGATGGTGATTCAGATGTTAGTTTAAGCAGTGAGGATATTATGGATTCTCTAGCACAAGCTAATTTTGTTGAGGAGACTCTAGTGGTTTGTTTTGCAGATGGTATTGATAATTTGTTTGGAATGGTGAAAATGATCAAGGAAAATTGTCTCCCACTATTCCTGCCAAGTTCTTCGCTTTGAGTGAAAAATGTGGTTTTCATATGGAAGCAATTCCTCCTCAGTCTCCTTCAATTAAAGCATAGTTTGGTGGTTCTTGTCGAGGAATTGAACTATTTGTGTGGATTTCTCATGTTTTAAAATTTTACCTTTAGCTTACTGGAAAAGCTTGGCTTTTGATGCATTTGGGCAACATTTATTTTTTGGAAGGGTTTTTTTTTGGTTTGGGGATGCTCATACAGAGTTCGCTTGGAGTTTTTCTCTTGGCCAGTTTCAGCCGTGGGGTTTTTATAGTATTTTGGTTATGTGGTGCAAATTTTTAATGCTTTTTTTATTTTTTTTTTTATTTTCTTGGAAACTAGTGCTCTCCTCCATCTTGGATGGTGGTATTGGTAGTTTTTGTTTGGCTGACTAGGTTTCTGGGGTTGGCAGCTGACAAGTTAGTCTGGTTTTGAGCTCCTTTTCAAGTTTCCTTTGTTTTTAAGCTTTATGGCATTCAAGGCTCAAGTTTTGTACTTTGCTAGAAGATTGACAGTTTTGGAAATTTTTGTTTATTATTTATCTCCTTTATATCTTTCATTTTGTTCCTAATGCTCCATTTTCTATTTTGTATTTCCTCTTTTCTTTGTCTGTTTAATTGTAATTTGAGTATTAGACTCATGTCATTAATTCAATGAAAAAGTCTTGTTTTCATTTAAAAAAAAAAAATCTGATTTTTGTTCCTTTTCAATTGACCTTTGTTTAACTGTGTTATATTTACTAATTTGTATCAGATGTATATACCTGGAAAATCAGATGATCATTATTTTAGATCTTAGAATGTTGTGCACTAGATTGTTATTTCTCACTCTGGAAACTTTACGTGAATGTCAGTGAGGGCGTGAAATTGGAGTATGTTGCTGTTTTTATTGCACTCCTTTTCCCTGGTGCTTTGGTGGCTTTCAATCATGATGCACTGCAGGATTCATCATGCTTCAGTGCTCTTCGTATCTACTGTGCTGGTATTTGGCACAATACAGTTGTAAGCTCAAATTATGTTATTGGCTGACCATTGATTTTGTTTAATCTCACATTCTAGTAAGGGATCTGGGTCTGCTGACTGAAATTGCTTCAGTTTTTTTCTATCTTAGAAGAGATATATCTAAATGTACGACATCTCTCTAGCTTTCTGCAGCTTCTGGATTGATATTATTCTTTCTGCCGCTGATTTTGTTTCCTCTTTACATACATGGCCAAAGTCCCATGGTACATCAACATTTTATACTCGCATAAAGTTATTTCATAAATGCACATGAAAAGATTTTATTTTTCATTTTCCTCTCTTGACTGAATCCTGTAAACTGTGATAGGTTCTGGACGTTCCTTATACTTCACCATTGTCGAGTTATTTGTCCCATGGTCATGTAATTTTGTCATTGGATGACATGAACGTTCACAGCGTGGATGATTGGATCAATCTATCAACTCAAATAAGTGAATTAACATTTCAAAATGAAACCCCCTCCAGACTTGGTGAAAACAACCGAATGGCCGATGGCAGAAGGGGATATTGTGTTCCAAATTTCATGTTGAAAGAAAGCAATAAAGTTCAGTTCACACATGATCAGTCAGCTTGTTTTGGCGACCTTACTTCTTTTACATCTATTCCTTGTGTTAGTTCTACTGCTTTAATTGATGGTTATACGGAAGATAACAATTCTAACCTCAAAGAAGGAATATATTGCTTGAACGTCAGTGATGTCATTAAGCTTAATAAGTGCACTAGATGGGATAAAGCAGTAATCAGCGATAGCACTAGCTCCTGTATGTGTTCACAGGTAAGCACTTGATCTTTGGTGAACATATTCACTTTTGCAAGTTGTTGGCTTTAGCTTATCACATTTTTTTAAAAACTTCAATTCACAGGATGAAACTTGCTTAGGTCCAGTCCAAATGCCTGGTTTGGTATGGGTTGAGATTACATATTTGAACCCCCATTCCTCAGACTGTTCCTATTCCAGAGAATATCCACTCCCAAGTTCAAATTGTAGTGGAACTTTTATTTTTGTTGGTGATGTGGTCTCAATGGCACACTCTATTCAGCTGACCATGTACCGACCACGATTGGATTTTTGTTTCGCTGCATATCTTCCGGATGTACTGGAAAGGATTTTATTGTGCTTATTTCATACCTCTCTTGCGCTAGCCCTTCTCAACAGTTTGCCGGTAAGTTCTAATACACCATTTTGTAAACTATTTTCATATTTACTTCATGGGGATAATATCTAATAAATTATAAGGTTTAGTCTATTCTTCAAATCATGTGCTTCGCAGAATTTTTGTGAATATATCTGCAGTTCGGAATTTTGGAATTTTGTAAGTATTGTTGATGAAGTAATAATAGCTTCTCCTTTTATCTCTTCTTGGGTGAAGTGCCTTTCAATTTCTACATTCTGTGTCGTGATGAACACAATCATTTCCAATATTGATAGCGACTTGATTGTTGCAATAGGACTCTACTTTTGGAGTTTGAATTTCTAAGACATCCTTTTTCGCTAATTTTTTCCATATCCACTGAGCCATAGTATGAAATTTCGCTTCAGTACTTCTCCTGGACACCAAAGATTGCTTCTTGCTCCTAAACTAAGTGTCTTCGTAGGTCATGTGAAATCCCTCAAGAACAATTGTCTTTCAACTTCAAAATCAGCCTAAATGGCCGTTGTTACAACTTTATATAGGCTGGACAGTGGCTTAATATATTTGGAACTGATCTTGAAATGGTTAATATCATTTTAACCGTGGAATTCACTTACGCATTTTGAGTGATGCAGCCCAGTTTTTTCACTTTTTCTTACTTTTCTTAGTGGTTTGCCATTGGTTCAACACTCAAACTGAGCCAAGCCAAATTGATTACACTCCTATCACACAGCATCAATCTCCTTTGTGAAATGCTTGTCAACTTCAATATGCTATCATGTAGAAATAGATTCTGATAACAATGAAAACCTTGTTATCACAATTGATCTGTATAGGTCTCATTTGAGCACAAAAACTGACAATTGGTCAGTCAGAGAGCACAAAAAGGGCTCAAACTGGCTTGTTGACCAATCAATATTATATATACATGAAAAAAAATTGACTCTGCACACCAGCTAGTTTTAAAACCACTACCTACTTTTAGTAGGTTATCCCAATTTACTTTTTATCATTTTCCCTTTTCTTTCTTCACGTTCTCTCCTTCCAACGTTTCATTTTCCTTTTTTTCTTCTCTTTCTTCATTTTCTCTCCCTCCCAAGTTTCATCTTCATTTTTCTTTCTCTTTCTTCATTTCATCTTCTCTCTTCTTCTGATAATTTTTTTCTCTCTTCACCAGTTTCTACCGAACAACACTGCCATCTAGCTCCATGGTGACATGGCGCAACATTTTTTCTTTCTCCTTCTACATTTTCCTTCTCTCTTTCTCCTATTATCTTATCCCTCTGTTTTTTTTTTCCAACCAAAATTCTACTCACGGGAGCTGAGCAACATCATCCAACTAGTAACGCTCCAACAAAAATCCTCCCTCAAAACTAAATGAAACTGACTGACTGTTCATGTAACGGTCAATTGGTTTCAAAAACCCTCCCAAAATCAACAACAACCGTCAATCAAAACCGCCAAATTGGTGATTGGCGTCGGTTTGAACCCAAAACCAGATGATCACTCCTATAAGTATTCTTTTAATCCGTATACTTTCAGAATTTGTACAAGGGTTTTGAATTATGCTTCAACACTACTTTGACTACTATATTTTATTTTCTTTTACACGCCATAGCTAATTTTTTTTTCAGCAAAGTAACAATAACTATAAGTCTAGATGTAGATTTATTTATTTATATTTTGAAAGTAACAATAACTAGAAGTCTAGAAGTTGATTTTACTTCCAGCCCAATCTATTTCAATATACACTTCAACTTGAAGATGGTCGTGTTTAAAAAATAATATGCCTTTACTGAAGTTCCTTTATAAAATATTCAAGAGTTTATACACAATTTCGAGATGAGTGGTTCCAGGCGAGTGCATATAATTAACCAACAACAAATGTTCTATTTAGACATTTGTGAGATAAATAGATCAGTCTACCCAAATCTTTGGTATCTGTTTTTGACATTTACTTTAACTTTTGCAAATAGTAACTTTAGATTTGGTTTAATAGAAGTTTCTACTGTCTGTCAATTGAGTAAACATGCCTTTCGAAGTAAATCAATAACATATTTTCTTTTATCAACTAAAATACTCTTTTTAGATCTTATGAGCTCCATCCCTACAAAAGTACTTTAAGGCTACTGAGTTTTTTATTTGAAACTGACTTACAATTTTGTTGTCAAGATCGACAAAACCTTCATCGTTACTACATATAAGAATAATATAAAATTTAAAAATTAAAAATTAAGATAGTAGATTTTTGTGTTTCTAGAGTATTTATAGAAAATGGAATGGCCTATTTGACGTTGAAAGTATTCATAGCTATAGTGACTGTTTTTTCAAAGCATTTGAATCAAGTTATTAAACTTTCTTTAAGATTATTACTCATCGTGTATTTTTATAGGTATATTATTTGGATGGGGAATCTATTCTGGAGATAATTATTTTCCAACTCACCTCAATGAGCCCGTGGAACAAGGAAAAAGTTCTTCGATCTTTTCTAATGGGAGGGACACTCATCTCAATCTTTTTACTGCTTAGGATTTTCTTCCATTTATTTGTCAGCTGATGGTAATCTGTAACATTGATGTTTTTGTTTATCTTTGAGCAAGGTAGGTTGCTACTAATTTCTGACTGCATTGGTTGGTAAAGCCGTTTTGTTATTTGTATATATAACTTAGATCTGCTTCTACAGTAAAGTTCTTCCTCATCCAACAGCACACTGCTCTTTAAAGCCGTCAAAAGTTCCGGACCCCAATCCCTCCCCGAGGGACCCCCCCCATAAGAGCCCAAACCTCAGCCTCTTTCTCTTGATGGAGTTTTTGAGAGAGCCTTTAAGCGTTTACTTGTTCCAGCAAGAAAGCCTCTCTAAACAAAACTTCCTATCCTAGGAGAACACAGCACCTCTTCTAGAGCCACCCCAACACTCACTAGCAATCCTTTTGCAATCCTCAAAACAAGCCTAGCTTTCTTTAAACCTAAGCGATGTAGCAAACTTTCTTTTATAATTGAACTCCCGTAAAGGACCCAAATCAGCAACTATAGGCCGATGATTAGAGCAATGAAAATTCAGATGACTAATCTTCAAAAATTGGAATTTAGATACCATCGAGCAAGCGTTTCTTTGATGAGCAGACCATTTTGATATTTTCTTTTGCAGCTGAAAATATTTTCGCCGCATCCTGAATGGATAGTATTGTTGATATCAATTGGCTTGCAAAAGTGGTCCTTTTGGCTCCCCCTTCTCATCTGCCAAATCCAGGGAAGATCCATTGCCTCCTTAAGCCTGACAAGTAATTTCCAAGACTTAACCCTCTTCCCCTTTGTTGGATTTCTGTAAAGATCAGTAAATCTCCACCATCGAGAATCATCCCTCACTGTCGCATCAATGTGCCCCACTGAAAAGGATGAGACTAAGAGGTTGATATCTAGTTCCAAAGAAGAATCAGGCCCCCGCTTTTACCTCTATTCTGGACGCTGAAACAACAATCAAACCTCAACTTAATCTTAACCTTCGCATTAATCTCGTAACCACTACATCGTTTCAGACAGGAACACAATCAGTGGTTGTACACTTCTAACCTTGTGTTGAAGTGTCTGGAGCGTTTGCCAAACCTGAACATCCCAACATAAGAGGTTCATTGCATCTGATGAGGCTGAATACCAGCCTCTGCCGATATCCCTCCGTCAGATTCAAAATAGTGAAGTGAATTCGGTTTGTCTTCCCCATGCCCCCTTCACCAATGATTTCCAAATCATGTTTCAATATCATGATCCTTTATCCCACAAATTCCTTCTTCCTTTCCTCGTCCTTGCGTCTAGCAAGCTTTTTCCATCTCTTCAAATTTGCCCCCTTCGAGGCCTTTAAATCTGCATAAGGGATATCGACTAGCCCACTTTTACAGTATGGACTTAATTTAGGAAGCAACCTACCTTCTCAGAGCCATAAGGTTAAAAGAACCTCCCAGACTCT

mRNA sequence

AAACAACCCATATTTTCAATTTTATTCAATTTTATTATGTTTATACCGTTCGCTTCGAAATATCACGACTGAGGGAAGGAGGGGGACATGGTGGCGGCACAATGTCAGGAAGATGGAGGCGAAGTAAGAGACCCATGGCTCAGGCCGACGCTCACGCTCATATTCCTCCCTTACTTCCGCTTACCACAAGTCGCAAAGGTCTTTCAAATGCCATTTCCTGCTGGTATTGCGATTGTAAGATCACCTCCTTCAACGAACTAATTTTCCAATTTGGACGAAGACATGCCAGGGTTCTGAGAACGTGGTTCTCGATTGGGATTGGTTTTTCTCTTGCTGCTCTTGCTGTTGTTGTCACGGTTCTCTTTTGGGAACTTGCAACAGCCATGCATATTTTCGGCAATGTGATCCATGGTCTTCCTATTTCCTATTCCTCGTTATTTGGTCTTCCTCCGTTGATTTCCAGCTGTAGTTTCTCCCCTGCTGATGCTGGATGCATAATTGTCTCTACCTTAATATCTGTAGCTTTTCATGAATTTGGTCATGCTGCTGCTGCTGCAAGTGAGGGCGTGAAATTGGAGTATGTTGCTGTTTTTATTGCACTCCTTTTCCCTGGTGCTTTGGTGGCTTTCAATCATGATGCACTGCAGGATTCATCATGCTTCAGTGCTCTTCGTATCTACTGTGCTGGTATTTGGCACAATACAGTTCTTTCTGCAGCTTCTGGATTGATATTATTCTTTCTGCCGCTGATTTTGTTTCCTCTTTACATACATGGCCAAAGTCCCATGGTTCTGGACGTTCCTTATACTTCACCATTGTCGAGTTATTTGTCCCATGGTCATGTAATTTTGTCATTGGATGACATGAACGTTCACAGCGTGGATGATTGGATCAATCTATCAACTCAAATAAGTGAATTAACATTTCAAAATGAAACCCCCTCCAGACTTGGTGAAAACAACCGAATGGCCGATGGCAGAAGGGGATATTGTGTTCCAAATTTCATGTTGAAAGAAAGCAATAAAGTTCAGTTCACACATGATCAGTCAGCTTGTTTTGGCGACCTTACTTCTTTTACATCTATTCCTTGTGTTAGTTCTACTGCTTTAATTGATGGTTATACGGAAGATAACAATTCTAACCTCAAAGAAGGAATATATTGCTTGAACGTCAGTGATGTCATTAAGCTTAATAAGTGCACTAGATGGGATAAAGCAGTAATCAGCGATAGCACTAGCTCCTGTATGTGTTCACAGGATGAAACTTGCTTAGGTCCAGTCCAAATGCCTGGTTTGGTATGGGTTGAGATTACATATTTGAACCCCCATTCCTCAGACTGTTCCTATTCCAGAGAATATCCACTCCCAAGTTCAAATTGTAGTGGAACTTTTATTTTTGTTGGTGATGTGGTCTCAATGGCACACTCTATTCAGCTGACCATGTACCGACCACGATTGGATTTTTGTTTCGCTGCATATCTTCCGGATGTACTGGAAAGGATTTTATTGTGCTTATTTCATACCTCTCTTGCGCTAGCCCTTCTCAACAGTTTGCCGGTATATTATTTGGATGGGGAATCTATTCTGGAGATAATTATTTTCCAACTCACCTCAATGAGCCCGTGGAACAAGGAAAAAGTTCTTCGATCTTTTCTAATGGGAGGGACACTCATCTCAATCTTTTTACTGCTTAGGATTTTCTTCCATTTATTTGTCAGCTGATGGTAATCTGTAACATTGATGTTTTTGTTTATCTTTGAGCAAGGTAGGTTGCTACTAATTTCTGACTGCATTGGTTGGTAAAGCCGTTTTGTTATTTGTATATATAACTTAGATCTGCTTCTACAGTAAAGTTCTTCCTCATCCAACAGCACACTGCTCTTTAAAGCCGTCAAAAGTTCCGGACCCCAATCCCTCCCCGAGGGACCCCCCCCATAAGAGCCCAAACCTCAGCCTCTTTCTCTTGATGGAGTTTTTGAGAGAGCCTTTAAGCGTTTACTTGTTCCAGCAAGAAAGCCTCTCTAAACAAAACTTCCTATCCTAGGAGAACACAGCACCTCTTCTAGAGCCACCCCAACACTCACTAGCAATCCTTTTGCAATCCTCAAAACAAGCCTAGCTTTCTTTAAACCTAAGCGATGTAGCAAACTTTCTTTTATAATTGAACTCCCGTAAAGGACCCAAATCAGCAACTATAGGCCGATGATTAGAGCAATGAAAATTCAGATGACTAATCTTCAAAAATTGGAATTTAGATACCATCGAGCAAGCGTTTCTTTGATGAGCAGACCATTTTGATATTTTCTTTTGCAGCTGAAAATATTTTCGCCGCATCCTGAATGGATAGTATTGTTGATATCAATTGGCTTGCAAAAGTGGTCCTTTTGGCTCCCCCTTCTCATCTGCCAAATCCAGGGAAGATCCATTGCCTCCTTAAGCCTGACAAGTAATTTCCAAGACTTAACCCTCTTCCCCTTTGTTGGATTTCTGTAAAGATCAGTAAATCTCCACCATCGAGAATCATCCCTCACTGTCGCATCAATGTGCCCCACTGAAAAGGATGAGACTAAGAGGTTGATATCTAGTTCCAAAGAAGAATCAGGCCCCCGCTTTTACCTCTATTCTGGACGCTGAAACAACAATCAAACCTCAACTTAATCTTAACCTTCGCATTAATCTCGTAACCACTACATCGTTTCAGACAGGAACACAATCAGTGGTTGTACACTTCTAACCTTGTGTTGAAGTGTCTGGAGCGTTTGCCAAACCTGAACATCCCAACATAAGAGGTTCATTGCATCTGATGAGGCTGAATACCAGCCTCTGCCGATATCCCTCCGTCAGATTCAAAATAGTGAAGTGAATTCGGTTTGTCTTCCCCATGCCCCCTTCACCAATGATTTCCAAATCATGTTTCAATATCATGATCCTTTATCCCACAAATTCCTTCTTCCTTTCCTCGTCCTTGCGTCTAGCAAGCTTTTTCCATCTCTTCAAATTTGCCCCCTTCGAGGCCTTTAAATCTGCATAAGGGATATCGACTAGCCCACTTTTACAGTATGGACTTAATTTAGGAAGCAACCTACCTTCTCAGAGCCATAAGGTTAAAAGAACCTCCCAGACTCT

Coding sequence (CDS)

ATGTCAGGAAGATGGAGGCGAAGTAAGAGACCCATGGCTCAGGCCGACGCTCACGCTCATATTCCTCCCTTACTTCCGCTTACCACAAGTCGCAAAGGTCTTTCAAATGCCATTTCCTGCTGGTATTGCGATTGTAAGATCACCTCCTTCAACGAACTAATTTTCCAATTTGGACGAAGACATGCCAGGGTTCTGAGAACGTGGTTCTCGATTGGGATTGGTTTTTCTCTTGCTGCTCTTGCTGTTGTTGTCACGGTTCTCTTTTGGGAACTTGCAACAGCCATGCATATTTTCGGCAATGTGATCCATGGTCTTCCTATTTCCTATTCCTCGTTATTTGGTCTTCCTCCGTTGATTTCCAGCTGTAGTTTCTCCCCTGCTGATGCTGGATGCATAATTGTCTCTACCTTAATATCTGTAGCTTTTCATGAATTTGGTCATGCTGCTGCTGCTGCAAGTGAGGGCGTGAAATTGGAGTATGTTGCTGTTTTTATTGCACTCCTTTTCCCTGGTGCTTTGGTGGCTTTCAATCATGATGCACTGCAGGATTCATCATGCTTCAGTGCTCTTCGTATCTACTGTGCTGGTATTTGGCACAATACAGTTCTTTCTGCAGCTTCTGGATTGATATTATTCTTTCTGCCGCTGATTTTGTTTCCTCTTTACATACATGGCCAAAGTCCCATGGTTCTGGACGTTCCTTATACTTCACCATTGTCGAGTTATTTGTCCCATGGTCATGTAATTTTGTCATTGGATGACATGAACGTTCACAGCGTGGATGATTGGATCAATCTATCAACTCAAATAAGTGAATTAACATTTCAAAATGAAACCCCCTCCAGACTTGGTGAAAACAACCGAATGGCCGATGGCAGAAGGGGATATTGTGTTCCAAATTTCATGTTGAAAGAAAGCAATAAAGTTCAGTTCACACATGATCAGTCAGCTTGTTTTGGCGACCTTACTTCTTTTACATCTATTCCTTGTGTTAGTTCTACTGCTTTAATTGATGGTTATACGGAAGATAACAATTCTAACCTCAAAGAAGGAATATATTGCTTGAACGTCAGTGATGTCATTAAGCTTAATAAGTGCACTAGATGGGATAAAGCAGTAATCAGCGATAGCACTAGCTCCTGTATGTGTTCACAGGATGAAACTTGCTTAGGTCCAGTCCAAATGCCTGGTTTGGTATGGGTTGAGATTACATATTTGAACCCCCATTCCTCAGACTGTTCCTATTCCAGAGAATATCCACTCCCAAGTTCAAATTGTAGTGGAACTTTTATTTTTGTTGGTGATGTGGTCTCAATGGCACACTCTATTCAGCTGACCATGTACCGACCACGATTGGATTTTTGTTTCGCTGCATATCTTCCGGATGTACTGGAAAGGATTTTATTGTGCTTATTTCATACCTCTCTTGCGCTAGCCCTTCTCAACAGTTTGCCGGTATATTATTTGGATGGGGAATCTATTCTGGAGATAATTATTTTCCAACTCACCTCAATGAGCCCGTGGAACAAGGAAAAAGTTCTTCGATCTTTTCTAATGGGAGGGACACTCATCTCAATCTTTTTACTGCTTAGGATTTTCTTCCATTTATTTGTCAGCTGA

Protein sequence

MSGRWRRSKRPMAQADAHAHIPPLLPLTTSRKGLSNAISCWYCDCKITSFNELIFQFGRRHARVLRTWFSIGIGFSLAALAVVVTVLFWELATAMHIFGNVIHGLPISYSSLFGLPPLISSCSFSPADAGCIIVSTLISVAFHEFGHAAAAASEGVKLEYVAVFIALLFPGALVAFNHDALQDSSCFSALRIYCAGIWHNTVLSAASGLILFFLPLILFPLYIHGQSPMVLDVPYTSPLSSYLSHGHVILSLDDMNVHSVDDWINLSTQISELTFQNETPSRLGENNRMADGRRGYCVPNFMLKESNKVQFTHDQSACFGDLTSFTSIPCVSSTALIDGYTEDNNSNLKEGIYCLNVSDVIKLNKCTRWDKAVISDSTSSCMCSQDETCLGPVQMPGLVWVEITYLNPHSSDCSYSREYPLPSSNCSGTFIFVGDVVSMAHSIQLTMYRPRLDFCFAAYLPDVLERILLCLFHTSLALALLNSLPVYYLDGESILEIIIFQLTSMSPWNKEKVLRSFLMGGTLISIFLLLRIFFHLFVS
Homology
BLAST of ClCG01G000040 vs. NCBI nr
Match: XP_004145951.1 (membrane-bound transcription factor site-2 protease homolog isoform X1 [Cucumis sativus] >KGN49826.1 hypothetical protein Csa_000136 [Cucumis sativus])

HSP 1 Score: 949.9 bits (2454), Expect = 9.5e-273
Identity = 483/541 (89.28%), Postives = 501/541 (92.61%), Query Frame = 0

Query: 1   MSGRWRRSKRPMAQADAHAHIPPLLPLTTSRKGLSNAISCWYCDCKITSFNELIFQFGRR 60
           M GRWRRSKR MAQADAHAHIPPLLPLTTSRKGLSN+ISCWYCD KITSFNELIFQFGRR
Sbjct: 1   MPGRWRRSKRLMAQADAHAHIPPLLPLTTSRKGLSNSISCWYCDYKITSFNELIFQFGRR 60

Query: 61  HARVLRTWFSIGIGFSLAALAVVVTVLFWELATAMHIFG--NVIHGLPISYSSLFGLPPL 120
           HARVLRTWFSIGIGFSLAALAVV TVLF EL   MHIFG  NVI GLP+S SSLFGLP L
Sbjct: 61  HARVLRTWFSIGIGFSLAALAVVATVLFRELTIVMHIFGKSNVIRGLPVSCSSLFGLPSL 120

Query: 121 ISSCSFSPADAGCIIVSTLISVAFHEFGHAAAAASEGVKLEYVAVFIALLFPGALVAFNH 180
           ISSCS  PA AG II+STLISVAFHEFGHAAAAASEGVKLEY+AVFIALLFPGALVAFN+
Sbjct: 121 ISSCSLFPAGAGYIIISTLISVAFHEFGHAAAAASEGVKLEYIAVFIALLFPGALVAFNY 180

Query: 181 DALQDSSCFSALRIYCAGIWHNTVLSAASGLILFFLPLILFPLYIHGQSPMVLDVPYTSP 240
           DALQDSSCF+ALRIYCAGIWHNT LSAASGLILFFLPLILFPLYIHGQSP VLDVPYTSP
Sbjct: 181 DALQDSSCFNALRIYCAGIWHNTALSAASGLILFFLPLILFPLYIHGQSPTVLDVPYTSP 240

Query: 241 LSSYLSHGHVILSLDDMNVHSVDDWINLSTQISELTFQNETPSRLGENNRMADGRRGYCV 300
           LSSYLSHGHVILSLD M+VHSVDDWINLS QIS+LTFQNET SRL ENN+MA+GRRGYC 
Sbjct: 241 LSSYLSHGHVILSLDGMHVHSVDDWINLSAQISDLTFQNETHSRLVENNQMANGRRGYCF 300

Query: 301 PNFMLKESNKVQFTHDQSACFGDLTSFTSIPCVSSTALIDGYTEDNNSNLKEGIYCLNVS 360
           PNFMLKESNKVQFTHDQS CFGD TSFTSIPCVSS  LIDGYTEDN SN KEGIYCLNV+
Sbjct: 301 PNFMLKESNKVQFTHDQSTCFGDFTSFTSIPCVSSAGLIDGYTEDNYSNRKEGIYCLNVN 360

Query: 361 DVIKLNKCTRWDKAVISDSTSSCMCSQDETCLGPVQMPGLVWVEITYLNPHSSDCSYSRE 420
           DV+KLNKC+ WDKA I+D+TSSCMCSQDETCL PVQMPGLVWVEITYLNP+SSDCSYSRE
Sbjct: 361 DVMKLNKCSSWDKAAINDNTSSCMCSQDETCLSPVQMPGLVWVEITYLNPYSSDCSYSRE 420

Query: 421 YPLPSSNCSGTFIFVGDVVSMAHSIQLTMYRPRLDFCFAAYLPDVLERILLCLFHTSLAL 480
           YPLPSSNCSGTFIFVGDVVSMA SIQLTMYRPRLDF FA YLPDVLE+IL CLFHTSLAL
Sbjct: 421 YPLPSSNCSGTFIFVGDVVSMARSIQLTMYRPRLDFHFAIYLPDVLEKILSCLFHTSLAL 480

Query: 481 ALLNSLPVYYLDGESILEIIIFQLTSMSPWNKEKVLRSFLMGGTLISIFLLLRIFFHLFV 540
           ALLNSLPVY LDGESILEIIIFQLTS+SP NKEKVLRSFLMGGTLISIFLLLRIFFHL V
Sbjct: 481 ALLNSLPVYCLDGESILEIIIFQLTSLSPRNKEKVLRSFLMGGTLISIFLLLRIFFHLLV 540

BLAST of ClCG01G000040 vs. NCBI nr
Match: XP_008437652.1 (PREDICTED: membrane-bound transcription factor site-2 protease homolog isoform X1 [Cucumis melo] >XP_008437653.1 PREDICTED: membrane-bound transcription factor site-2 protease homolog isoform X1 [Cucumis melo])

HSP 1 Score: 944.9 bits (2441), Expect = 3.1e-271
Identity = 479/541 (88.54%), Postives = 497/541 (91.87%), Query Frame = 0

Query: 1   MSGRWRRSKRPMAQADAHAHIPPLLPLTTSRKGLSNAISCWYCDCKITSFNELIFQFGRR 60
           M GRWRRSKR MAQ D HAHIP LLPLTTSRKG+SNAISCWYCD KITSFNELIFQFGRR
Sbjct: 1   MPGRWRRSKRSMAQVDVHAHIPSLLPLTTSRKGISNAISCWYCDYKITSFNELIFQFGRR 60

Query: 61  HARVLRTWFSIGIGFSLAALAVVVTVLFWELATAMHIF--GNVIHGLPISYSSLFGLPPL 120
           HAR LRTWFSIGIGFSLAALAVV TVLF E    MHIF   NVI GLPIS SSLFGLP L
Sbjct: 61  HARALRTWFSIGIGFSLAALAVVATVLFRETKIIMHIFVNSNVIRGLPISCSSLFGLPSL 120

Query: 121 ISSCSFSPADAGCIIVSTLISVAFHEFGHAAAAASEGVKLEYVAVFIALLFPGALVAFNH 180
           ISSCSF PA AG II+STLISVAFHEFGHAAAAASEGVKLEYVAVFIALLFPGALVAFNH
Sbjct: 121 ISSCSFFPAGAGYIIISTLISVAFHEFGHAAAAASEGVKLEYVAVFIALLFPGALVAFNH 180

Query: 181 DALQDSSCFSALRIYCAGIWHNTVLSAASGLILFFLPLILFPLYIHGQSPMVLDVPYTSP 240
           DALQDSSCFSALRIYCAGIWHNT LSAASGLILFFLPLILFPLYIHGQSPMVLDVPYTSP
Sbjct: 181 DALQDSSCFSALRIYCAGIWHNTALSAASGLILFFLPLILFPLYIHGQSPMVLDVPYTSP 240

Query: 241 LSSYLSHGHVILSLDDMNVHSVDDWINLSTQISELTFQNETPSRLGENNRMADGRRGYCV 300
           LSSYLSHGHVILSLD M+VHSVDDWINLS +ISELTFQNET SRLGENN MA+GRRGYC 
Sbjct: 241 LSSYLSHGHVILSLDGMHVHSVDDWINLSAEISELTFQNETHSRLGENNLMANGRRGYCF 300

Query: 301 PNFMLKESNKVQFTHDQSACFGDLTSFTSIPCVSSTALIDGYTEDNNSNLKEGIYCLNVS 360
           PNFMLKESNKVQFTHDQ+ CFGD TSFTSIPCVSST LIDGYTEDNNSN KEGIYCLNV+
Sbjct: 301 PNFMLKESNKVQFTHDQATCFGDFTSFTSIPCVSSTGLIDGYTEDNNSNRKEGIYCLNVN 360

Query: 361 DVIKLNKCTRWDKAVISDSTSSCMCSQDETCLGPVQMPGLVWVEITYLNPHSSDCSYSRE 420
           DV+KLNKC+ WDKAVI+D+TSSCMCS+DETCL PVQMPGLVWVEITYLNP+SSDC YSRE
Sbjct: 361 DVMKLNKCSSWDKAVINDNTSSCMCSRDETCLSPVQMPGLVWVEITYLNPYSSDCFYSRE 420

Query: 421 YPLPSSNCSGTFIFVGDVVSMAHSIQLTMYRPRLDFCFAAYLPDVLERILLCLFHTSLAL 480
           YPLPSSNCSGTFIFVGDVVSMA SIQLTMYRPR DF FA YLPDVLE+IL CLFHTSLAL
Sbjct: 421 YPLPSSNCSGTFIFVGDVVSMARSIQLTMYRPRFDFHFATYLPDVLEKILSCLFHTSLAL 480

Query: 481 ALLNSLPVYYLDGESILEIIIFQLTSMSPWNKEKVLRSFLMGGTLISIFLLLRIFFHLFV 540
           ALLNSLPVY LDGESILEI+IFQLTS+SP NKEKVLRSFL+ GTLISIFLLLRIFFHL +
Sbjct: 481 ALLNSLPVYCLDGESILEILIFQLTSLSPRNKEKVLRSFLIAGTLISIFLLLRIFFHLLI 540

BLAST of ClCG01G000040 vs. NCBI nr
Match: XP_011654589.1 (membrane-bound transcription factor site-2 protease homolog isoform X2 [Cucumis sativus])

HSP 1 Score: 937.6 bits (2422), Expect = 4.9e-269
Identity = 480/541 (88.72%), Postives = 498/541 (92.05%), Query Frame = 0

Query: 1   MSGRWRRSKRPMAQADAHAHIPPLLPLTTSRKGLSNAISCWYCDCKITSFNELIFQFGRR 60
           M GRWRRSKR MAQADAHAHIPPLLPLTTSRKGLSN+ISCWYCD KITSFNELIFQFGR 
Sbjct: 1   MPGRWRRSKRLMAQADAHAHIPPLLPLTTSRKGLSNSISCWYCDYKITSFNELIFQFGR- 60

Query: 61  HARVLRTWFSIGIGFSLAALAVVVTVLFWELATAMHIFG--NVIHGLPISYSSLFGLPPL 120
             RVLRTWFSIGIGFSLAALAVV TVLF EL   MHIFG  NVI GLP+S SSLFGLP L
Sbjct: 61  --RVLRTWFSIGIGFSLAALAVVATVLFRELTIVMHIFGKSNVIRGLPVSCSSLFGLPSL 120

Query: 121 ISSCSFSPADAGCIIVSTLISVAFHEFGHAAAAASEGVKLEYVAVFIALLFPGALVAFNH 180
           ISSCS  PA AG II+STLISVAFHEFGHAAAAASEGVKLEY+AVFIALLFPGALVAFN+
Sbjct: 121 ISSCSLFPAGAGYIIISTLISVAFHEFGHAAAAASEGVKLEYIAVFIALLFPGALVAFNY 180

Query: 181 DALQDSSCFSALRIYCAGIWHNTVLSAASGLILFFLPLILFPLYIHGQSPMVLDVPYTSP 240
           DALQDSSCF+ALRIYCAGIWHNT LSAASGLILFFLPLILFPLYIHGQSP VLDVPYTSP
Sbjct: 181 DALQDSSCFNALRIYCAGIWHNTALSAASGLILFFLPLILFPLYIHGQSPTVLDVPYTSP 240

Query: 241 LSSYLSHGHVILSLDDMNVHSVDDWINLSTQISELTFQNETPSRLGENNRMADGRRGYCV 300
           LSSYLSHGHVILSLD M+VHSVDDWINLS QIS+LTFQNET SRL ENN+MA+GRRGYC 
Sbjct: 241 LSSYLSHGHVILSLDGMHVHSVDDWINLSAQISDLTFQNETHSRLVENNQMANGRRGYCF 300

Query: 301 PNFMLKESNKVQFTHDQSACFGDLTSFTSIPCVSSTALIDGYTEDNNSNLKEGIYCLNVS 360
           PNFMLKESNKVQFTHDQS CFGD TSFTSIPCVSS  LIDGYTEDN SN KEGIYCLNV+
Sbjct: 301 PNFMLKESNKVQFTHDQSTCFGDFTSFTSIPCVSSAGLIDGYTEDNYSNRKEGIYCLNVN 360

Query: 361 DVIKLNKCTRWDKAVISDSTSSCMCSQDETCLGPVQMPGLVWVEITYLNPHSSDCSYSRE 420
           DV+KLNKC+ WDKA I+D+TSSCMCSQDETCL PVQMPGLVWVEITYLNP+SSDCSYSRE
Sbjct: 361 DVMKLNKCSSWDKAAINDNTSSCMCSQDETCLSPVQMPGLVWVEITYLNPYSSDCSYSRE 420

Query: 421 YPLPSSNCSGTFIFVGDVVSMAHSIQLTMYRPRLDFCFAAYLPDVLERILLCLFHTSLAL 480
           YPLPSSNCSGTFIFVGDVVSMA SIQLTMYRPRLDF FA YLPDVLE+IL CLFHTSLAL
Sbjct: 421 YPLPSSNCSGTFIFVGDVVSMARSIQLTMYRPRLDFHFAIYLPDVLEKILSCLFHTSLAL 480

Query: 481 ALLNSLPVYYLDGESILEIIIFQLTSMSPWNKEKVLRSFLMGGTLISIFLLLRIFFHLFV 540
           ALLNSLPVY LDGESILEIIIFQLTS+SP NKEKVLRSFLMGGTLISIFLLLRIFFHL V
Sbjct: 481 ALLNSLPVYCLDGESILEIIIFQLTSLSPRNKEKVLRSFLMGGTLISIFLLLRIFFHLLV 538

BLAST of ClCG01G000040 vs. NCBI nr
Match: XP_008437654.1 (PREDICTED: membrane-bound transcription factor site-2 protease homolog isoform X2 [Cucumis melo])

HSP 1 Score: 932.6 bits (2409), Expect = 1.6e-267
Identity = 476/541 (87.99%), Postives = 494/541 (91.31%), Query Frame = 0

Query: 1   MSGRWRRSKRPMAQADAHAHIPPLLPLTTSRKGLSNAISCWYCDCKITSFNELIFQFGRR 60
           M GRWRRSKR MAQ D HAHIP LLPLTTSRKG+SNAISCWYCD KITSFNELIFQFGR 
Sbjct: 1   MPGRWRRSKRSMAQVDVHAHIPSLLPLTTSRKGISNAISCWYCDYKITSFNELIFQFGR- 60

Query: 61  HARVLRTWFSIGIGFSLAALAVVVTVLFWELATAMHIF--GNVIHGLPISYSSLFGLPPL 120
             R LRTWFSIGIGFSLAALAVV TVLF E    MHIF   NVI GLPIS SSLFGLP L
Sbjct: 61  --RALRTWFSIGIGFSLAALAVVATVLFRETKIIMHIFVNSNVIRGLPISCSSLFGLPSL 120

Query: 121 ISSCSFSPADAGCIIVSTLISVAFHEFGHAAAAASEGVKLEYVAVFIALLFPGALVAFNH 180
           ISSCSF PA AG II+STLISVAFHEFGHAAAAASEGVKLEYVAVFIALLFPGALVAFNH
Sbjct: 121 ISSCSFFPAGAGYIIISTLISVAFHEFGHAAAAASEGVKLEYVAVFIALLFPGALVAFNH 180

Query: 181 DALQDSSCFSALRIYCAGIWHNTVLSAASGLILFFLPLILFPLYIHGQSPMVLDVPYTSP 240
           DALQDSSCFSALRIYCAGIWHNT LSAASGLILFFLPLILFPLYIHGQSPMVLDVPYTSP
Sbjct: 181 DALQDSSCFSALRIYCAGIWHNTALSAASGLILFFLPLILFPLYIHGQSPMVLDVPYTSP 240

Query: 241 LSSYLSHGHVILSLDDMNVHSVDDWINLSTQISELTFQNETPSRLGENNRMADGRRGYCV 300
           LSSYLSHGHVILSLD M+VHSVDDWINLS +ISELTFQNET SRLGENN MA+GRRGYC 
Sbjct: 241 LSSYLSHGHVILSLDGMHVHSVDDWINLSAEISELTFQNETHSRLGENNLMANGRRGYCF 300

Query: 301 PNFMLKESNKVQFTHDQSACFGDLTSFTSIPCVSSTALIDGYTEDNNSNLKEGIYCLNVS 360
           PNFMLKESNKVQFTHDQ+ CFGD TSFTSIPCVSST LIDGYTEDNNSN KEGIYCLNV+
Sbjct: 301 PNFMLKESNKVQFTHDQATCFGDFTSFTSIPCVSSTGLIDGYTEDNNSNRKEGIYCLNVN 360

Query: 361 DVIKLNKCTRWDKAVISDSTSSCMCSQDETCLGPVQMPGLVWVEITYLNPHSSDCSYSRE 420
           DV+KLNKC+ WDKAVI+D+TSSCMCS+DETCL PVQMPGLVWVEITYLNP+SSDC YSRE
Sbjct: 361 DVMKLNKCSSWDKAVINDNTSSCMCSRDETCLSPVQMPGLVWVEITYLNPYSSDCFYSRE 420

Query: 421 YPLPSSNCSGTFIFVGDVVSMAHSIQLTMYRPRLDFCFAAYLPDVLERILLCLFHTSLAL 480
           YPLPSSNCSGTFIFVGDVVSMA SIQLTMYRPR DF FA YLPDVLE+IL CLFHTSLAL
Sbjct: 421 YPLPSSNCSGTFIFVGDVVSMARSIQLTMYRPRFDFHFATYLPDVLEKILSCLFHTSLAL 480

Query: 481 ALLNSLPVYYLDGESILEIIIFQLTSMSPWNKEKVLRSFLMGGTLISIFLLLRIFFHLFV 540
           ALLNSLPVY LDGESILEI+IFQLTS+SP NKEKVLRSFL+ GTLISIFLLLRIFFHL +
Sbjct: 481 ALLNSLPVYCLDGESILEILIFQLTSLSPRNKEKVLRSFLIAGTLISIFLLLRIFFHLLI 538

BLAST of ClCG01G000040 vs. NCBI nr
Match: XP_022957694.1 (membrane-bound transcription factor site-2 protease homolog [Cucurbita moschata])

HSP 1 Score: 926.8 bits (2394), Expect = 8.6e-266
Identity = 464/542 (85.61%), Postives = 499/542 (92.07%), Query Frame = 0

Query: 1   MSGRWRRSKRPMAQADAHAHIPPLLPLTTSRKGLSNAISCWYCDCKITSFNELIFQFGRR 60
           MSGRWR SKRP A+ADAHA +PPLLPL+TSRKGLSNA+SCWYCDCKITSFNE IF FGRR
Sbjct: 1   MSGRWRLSKRPRAEADAHARVPPLLPLSTSRKGLSNAVSCWYCDCKITSFNEPIFHFGRR 60

Query: 61  HARVLRTWFSIGIGFSLAALAVVVTVLFWELATAMHIFGN--VIHGLPISYSSLFGLPPL 120
           HARVLR WFSIGIGFSLAALAVV TVLF ELA AMHIFGN  V H LPISYSSLFGLPPL
Sbjct: 61  HARVLRAWFSIGIGFSLAALAVVTTVLFLELAIAMHIFGNSDVPHSLPISYSSLFGLPPL 120

Query: 121 ISSCSFSPADAGCIIVSTLISVAFHEFGHAAAAASEGVKLEYVAVFIALLFPGALVAFNH 180
           ISSC+FSPADAG II+S+LISVAFHEFGHAAA ASEG+KLEYVAVFIALLFPGALVAFNH
Sbjct: 121 ISSCNFSPADAGYIIISSLISVAFHEFGHAAAVASEGIKLEYVAVFIALLFPGALVAFNH 180

Query: 181 DALQDSSCFSALRIYCAGIWHNTVLSAASGLILFFLPLILFPLYIHGQSPMVLDVPYTSP 240
           D LQDSSCF+ALRIYCAGIWHN VLSAASGL+LF LPLILFPLYIHG+SPMVLDVP TSP
Sbjct: 181 DVLQDSSCFNALRIYCAGIWHNAVLSAASGLMLFCLPLILFPLYIHGESPMVLDVPSTSP 240

Query: 241 LSSYLSHGHVILSLDDMNVHSVDDWINLSTQISELTFQNETPSRLGENNRMADGRRGYCV 300
           LS YLSHGH+ILSLD M++ +VDDW+NLS QISE TFQN T SRLGEN+RMA+GR+GYCV
Sbjct: 241 LSGYLSHGHLILSLDGMHIQNVDDWVNLSAQISESTFQNGTLSRLGENDRMANGRKGYCV 300

Query: 301 PNFMLKESNKVQFTHDQSACFGDLTSFTSIPCVSSTALIDGYTEDNNSNLKEGIYCLNVS 360
           PNFMLKESNKVQF+HDQS CFGDLTSFTSIPCVSST L+DG  +D++ N KEGI+CLNV+
Sbjct: 301 PNFMLKESNKVQFSHDQSTCFGDLTSFTSIPCVSSTVLVDGDVDDSHYNRKEGIFCLNVN 360

Query: 361 DVIKLNKC-TRWDKAVISDSTSSCMCSQDETCLGPVQMPGLVWVEITYLNPHSSDCSYSR 420
           D+IKLNKC + WDKA+I+DSTS+CMCSQDETCL PVQMPG VWVEITYLNPHSSDC YSR
Sbjct: 361 DIIKLNKCISGWDKAIINDSTSTCMCSQDETCLSPVQMPGSVWVEITYLNPHSSDCFYSR 420

Query: 421 EYPLPSSNCSGTFIFVGDVVSMAHSIQLTMYRPRLDFCFAAYLPDVLERILLCLFHTSLA 480
           E PLPSSNCSGTFIFVGDVVSMAHSIQLTMYRPRLDF +A YLPDVLERI LCLFH SLA
Sbjct: 421 ENPLPSSNCSGTFIFVGDVVSMAHSIQLTMYRPRLDFHYARYLPDVLERIFLCLFHASLA 480

Query: 481 LALLNSLPVYYLDGESILEIIIFQLTSMSPWNKEKVLRSFLMGGTLISIFLLLRIFFHLF 540
           LALLNSLPVYYLDGESILEIIIFQLTSMSP NKEKVLR+FLMGGTL+SIFLLLRIFFH+F
Sbjct: 481 LALLNSLPVYYLDGESILEIIIFQLTSMSPRNKEKVLRAFLMGGTLMSIFLLLRIFFHVF 540

BLAST of ClCG01G000040 vs. ExPASy Swiss-Prot
Match: F4JUU5 (Membrane-bound transcription factor site-2 protease homolog OS=Arabidopsis thaliana OX=3702 GN=S2P PE=2 SV=2)

HSP 1 Score: 406.4 bits (1043), Expect = 5.1e-112
Identity = 230/508 (45.28%), Postives = 314/508 (61.81%), Query Frame = 0

Query: 29  TSRKGLSNAISCWYCDCKITSFNELIFQFGRRHARVLRTWFSIGIGFSLAALAVVVTVLF 88
           T  + + N  SC YCD KI++FNE IF+ GRR + VL+ WFSIG+GF +A+L ++VTV  
Sbjct: 21  TGGENIENEASCCYCDLKISNFNEPIFRLGRRFSGVLKVWFSIGLGFGVASL-ILVTVFL 80

Query: 89  WELATAMHIFGNVIHGLPISYSSLFGLPPLISSCSFSPADAGCIIVSTLISVAFHEFGHA 148
                +  +F N +       S++FG  P   S   S +    ++VST+I+V+ HE GHA
Sbjct: 81  LLQFHSNPLFSNRL------TSAVFGFSP---STRVSLSGIAYVLVSTVITVSVHELGHA 140

Query: 149 AAAASEGVKLEYVAVFIALLFPGALVAFNHDALQDSSCFSALRIYCAGIWHNTVLSAASG 208
            AAASEG+++EY+AVFIA +FPG LVAF++D LQ    F+ALRIYCAGIWHN V  A   
Sbjct: 141 LAAASEGIQMEYIAVFIAAIFPGGLVAFDNDVLQSLPSFNALRIYCAGIWHNAVFCALCV 200

Query: 209 LILFFLPLILFPLYIHGQSPMVLDVPYTSPLSSYLSHGHVILSLDDMNVHSVDDWINLST 268
             LF LP++L P Y HG+S  V+DVP  SPL  YLS G VI+SLD + VH   +W+ L+ 
Sbjct: 201 FALFLLPVMLSPFYKHGESLTVVDVPSVSPLFGYLSPGDVIVSLDGIQVHKPSEWLELAA 260

Query: 269 QISELTFQNETPS-RLGENNRMADGRRGYCVPNFMLKESNKVQFTHDQSACFGDLTSFTS 328
            + +   +    S  LG + R   G +GYCVP  +++E  K +   +Q  C GDLT+F +
Sbjct: 261 ILDKENSKTSNGSLYLGGSRRFHHG-KGYCVPISLIEEGYKGKMVENQFVCPGDLTAFRT 320

Query: 329 IPCVSSTALIDGYTEDNNSNLKEGIYCLNVSDVIKLNKC-TRWDKAVISDSTSSCMCSQD 388
           +PC             +N+ ++E   CL+  D++KL KC   W     +D+ S C+C Q 
Sbjct: 321 MPC-------------SNAAIREVSVCLDAKDIVKLQKCGDGWVTTSDTDNQSDCVCPQG 380

Query: 389 ETCLGPVQMPGLVWVEITYLNPHSSDCSYSREYPLPSSNCSGTFIFVGDVVSMAHSIQLT 448
           + CL  +Q PG++W EITY    S DCS        +SNC GTF+FVGD+++M+HS+ LT
Sbjct: 381 DLCLQAMQSPGVLWTEITYKRTSSQDCS-RLGLDFNTSNCLGTFVFVGDLIAMSHSVHLT 440

Query: 449 MYRPRLDF-CFAAYLPDVLERILLCLFHTSLALALLNSLPVYYLDGESILEIIIFQLTSM 508
            Y+PR  F  F    P++LER L C FH SLAL LLNSLPVYYLDGESILE  +   T +
Sbjct: 441 AYQPRWLFNFFGKSFPNILERSLTCTFHVSLALVLLNSLPVYYLDGESILESSLQSFTWL 500

Query: 509 SPWNKEKVLRSFLMGGTLISIFLLLRIF 534
           SP  K+K L+  L+GG+L+S     RIF
Sbjct: 501 SPRKKKKALQVCLVGGSLLSFLAFFRIF 503

BLAST of ClCG01G000040 vs. ExPASy Swiss-Prot
Match: O43462 (Membrane-bound transcription factor site-2 protease OS=Homo sapiens OX=9606 GN=MBTPS2 PE=1 SV=1)

HSP 1 Score: 119.4 bits (298), Expect = 1.3e-25
Identity = 138/518 (26.64%), Postives = 217/518 (41.89%), Query Frame = 0

Query: 37  AISCWYCDCKITSFNELIFQFGRRHARVLRTWFSIGIGFSLAALAVVVTVLFWELATAMH 96
           +IS ++   +   FN   + +GRR AR+L  WF+ G+ F + A+    +  F    T M 
Sbjct: 44  SISPFHIRWQTAVFNRAFYSWGRRKARMLYQWFNFGMVFGVIAM---FSSFFLLGKTLMQ 103

Query: 97  IFGNVIHGLPISYSSLFGLPPLISSCSFSPADAGC------------------------I 156
               ++   P SYSS        SS S S + +                           
Sbjct: 104 TLAQMMADSPSSYSSSSSSSSSSSSSSSSSSSSSSSLHNEQVLQVVVPGINLPVNQLTYF 163

Query: 157 IVSTLISVAFHEFGHAAAAASEGVKLEYVAVFIALLFPGALVAFNHDALQDSSCFSALRI 216
             + LIS   HE GH  AA  E V+     +F+ +++PGA V      LQ  S    LRI
Sbjct: 164 FTAVLISGVVHEIGHGIAAIREQVRFNGFGIFLFIIYPGAFVDLFTTHLQLISPVQQLRI 223

Query: 217 YCAGIWHNTVLSAASGLILFFLPLILFPLYIHGQSPMVLDVPYTSPL--SSYLSHGHVIL 276
           +CAGIWHN VL+    L L  LP+IL P Y  G   ++ +V   SP      L  G ++ 
Sbjct: 224 FCAGIWHNFVLALLGILALVLLPVILLPFYYTGVGVLITEVAEDSPAIGPRGLFVGDLVT 283

Query: 277 SLDDMNVHSVDDWINLSTQISELTFQNETPSRLGENNRMADGRRGYCVPNFMLKESNKVQ 336
            L D  V +V DW             NE    +    ++     GYC+    L++     
Sbjct: 284 HLQDCPVTNVQDW-------------NECLDTIAYEPQI-----GYCISASTLQQ----- 343

Query: 337 FTHDQSACFGDLTSFTSIPCVSSTALIDGYTE-DNNSNLKEGIYCLNVSDVIKLNKCTRW 396
                           S P V +   +DG TE  NN +L +  +    +   +L+ C   
Sbjct: 344 ---------------LSFP-VRAYKRLDGSTECCNNHSLTDVCFSYRNNFNKRLHTCLPA 403

Query: 397 DKAVISDSTSSCMCSQDETCLGPVQMPGLVWVEITYLNPHSSDCSYSREYPLPSSNCSGT 456
            KAV  ++T  C  ++D  C    +     +  I  L  H+          +        
Sbjct: 404 RKAV--EATQVCRTNKD--C---KKSSSSSFCIIPSLETHTRLIKVKHPPQI-------D 463

Query: 457 FIFVGDVVSMAHSIQLTMYRPRLDFCFAAYLPDVLERILLCLFHTSLALALLNSLPVYYL 516
            ++VG  + + +++ +T + PR +F  +  LP V+E  +  L   S ALA++N++P + L
Sbjct: 464 MLYVGHPLHLHYTVSITSFIPRFNF-LSIDLPVVVETFVKYLISLSGALAIVNAVPCFAL 504

Query: 517 DGESILEIII-FQLTSMSPWNKEKVLRSF--LMGGTLI 525
           DG+ IL   +   LTS+   N  K L  F  L+GG+++
Sbjct: 524 DGQWILNSFLDATLTSVIGDNDVKDLIGFFILLGGSVL 504

BLAST of ClCG01G000040 vs. ExPASy Swiss-Prot
Match: Q5RAC8 (Membrane-bound transcription factor site-2 protease OS=Pongo abelii OX=9601 GN=MBTPS2 PE=2 SV=1)

HSP 1 Score: 118.6 bits (296), Expect = 2.1e-25
Identity = 138/520 (26.54%), Postives = 217/520 (41.73%), Query Frame = 0

Query: 37  AISCWYCDCKITSFNELIFQFGRRHARVLRTWFSIGIGFSLAALAVVVTVLFWELATAMH 96
           +IS ++   +   FN   + +GRR AR+L  WF+ G+ F + A+    +  F    T M 
Sbjct: 44  SISPFHIRWQTAVFNRAFYSWGRRKARMLYQWFNFGMVFGVIAM---FSSFFLLGKTLMQ 103

Query: 97  IFGNVIHGLPISYSSLFGLPPLISSCSFSPADAGC------------------------- 156
               ++   P SYSS        SS S S + +                           
Sbjct: 104 TLAQMMADSPSSYSSSSSSSSSSSSSSSSSSSSSSSSSLHNEQVLQVVVPGINLPVNQLT 163

Query: 157 -IIVSTLISVAFHEFGHAAAAASEGVKLEYVAVFIALLFPGALVAFNHDALQDSSCFSAL 216
               + LIS   HE GH  AA  E V+     +F+ +++PGA V      LQ  S    L
Sbjct: 164 YFFAAVLISGVVHEIGHGIAAIREQVRFNGFGIFLFIIYPGAFVDLFTTHLQLISPVQQL 223

Query: 217 RIYCAGIWHNTVLSAASGLILFFLPLILFPLYIHGQSPMVLDVPYTSPL--SSYLSHGHV 276
           RI+CAGIWHN VL+    L L  LP+IL P Y  G   ++ +V   SP      L  G +
Sbjct: 224 RIFCAGIWHNFVLALLGILALVLLPVILLPFYYTGVGVLITEVAEDSPAIGPRGLFVGDL 283

Query: 277 ILSLDDMNVHSVDDWINLSTQISELTFQNETPSRLGENNRMADGRRGYCVPNFMLKESNK 336
           +  L D  V +V DW             NE    +    ++     GYC+    L++   
Sbjct: 284 VTHLQDCPVTNVQDW-------------NECLDTIAYEPQI-----GYCISASTLQQ--- 343

Query: 337 VQFTHDQSACFGDLTSFTSIPCVSSTALIDGYTE-DNNSNLKEGIYCLNVSDVIKLNKCT 396
                             S P V +   +DG TE  NN +L +  +    +   +L+ C 
Sbjct: 344 -----------------LSFP-VRAYKRLDGSTECCNNHSLTDVCFSYRNNFNKRLHTCL 403

Query: 397 RWDKAVISDSTSSCMCSQDETCLGPVQMPGLVWVEITYLNPHSSDCSYSREYPLPSSNCS 456
              KAV  ++T  C  ++D  C    +     +  I  L  H+          +      
Sbjct: 404 PARKAV--EATQVCRTNKD--C---KKSSSSSFCIIPSLETHTRLIKVKHPPQI------ 463

Query: 457 GTFIFVGDVVSMAHSIQLTMYRPRLDFCFAAYLPDVLERILLCLFHTSLALALLNSLPVY 516
              ++VG  + + +++ +T + PR +F  +  LP V+E  +  L   S ALA++N++P +
Sbjct: 464 -DMLYVGHPLHLHYTVSITSFIPRFNF-LSIDLPVVVETFVKYLISLSGALAIVNAVPCF 506

Query: 517 YLDGESILEIII-FQLTSMSPWNKEKVLRSF--LMGGTLI 525
            LDG+ IL   +   LTS+   N  K L  F  L+GG+++
Sbjct: 524 ALDGQWILNSFLDATLTSVIGDNDVKDLIGFFILLGGSVL 506

BLAST of ClCG01G000040 vs. ExPASy Swiss-Prot
Match: O54862 (Membrane-bound transcription factor site-2 protease OS=Cricetulus griseus OX=10029 GN=MBTPS2 PE=2 SV=1)

HSP 1 Score: 117.9 bits (294), Expect = 3.7e-25
Identity = 136/515 (26.41%), Postives = 217/515 (42.14%), Query Frame = 0

Query: 31  RKGLSNAISCWYCDCKITSFNELIFQFGRRHARVLRTWFSIGIGFSLAALAVVVTVLFWE 90
           + GLS  IS ++   + + FN   + +GRR AR+L  WF+ G+ F + A+    +  F  
Sbjct: 40  KNGLS--ISPFHIRWQTSVFNRAFYSWGRRKARMLYQWFNFGMVFGVIAM---FSSFFLL 99

Query: 91  LATAMHIFGNVIHGLPISYSSLFGLPPLISSCSFSPADAGCIIV---------------S 150
             T M     ++   P S SS        SS S        ++V               +
Sbjct: 100 GKTLMQTLAQMMADSPSSSSSSSSSSSSSSSSSIHNEQVLQVVVPGINLPVNQLTYFFAA 159

Query: 151 TLISVAFHEFGHAAAAASEGVKLEYVAVFIALLFPGALVAFNHDALQDSSCFSALRIYCA 210
            LIS   HE GH  AA  E V+     +F+ +++PGA V      LQ  S    LRI+CA
Sbjct: 160 VLISGVVHEIGHGIAAIREQVRFNGFGIFLFIIYPGAFVDLFTTHLQLISPVQQLRIFCA 219

Query: 211 GIWHNTVLSAASGLILFFLPLILFPLYIHGQSPMVLDVPYTSPL--SSYLSHGHVILSLD 270
           GIWHN VL+    L L  LP+IL P Y  G   ++ +V   SP      L  G ++  L 
Sbjct: 220 GIWHNFVLALLGILALVLLPVILLPFYYTGVGVLITEVAEDSPAIGPRGLFVGDLVTHLQ 279

Query: 271 DMNVHSVDDWINLSTQISELTFQNETPSRLGENNRMADGRRGYCVPNFMLKESNKVQFTH 330
           D  V +V DW             NE    +    ++     GYC+       ++ +Q   
Sbjct: 280 DCPVTNVQDW-------------NECLDTIAYEPQI-----GYCI------SASTLQQLS 339

Query: 331 DQSACFGDLTSFTSIPCVSSTALIDGYTEDNNSNLKEGIYCLNVSDVIKLNKCTRWDK-A 390
                +  L   T   C ++ +L D      N+  K    CL     ++  +  R +K  
Sbjct: 340 FPVRAYKRLDGSTE--CCNNHSLTDVCFSYRNNFNKRLHTCLPARKAVEATQVCRTNKDC 399

Query: 391 VISDSTSSCMCSQDETCLGPVQMPGLVWVEITYLNPHSSDCSYSREYPLPSSNCSGTFIF 450
             S S+S C+    ET    +++           +P   D                  ++
Sbjct: 400 KTSSSSSFCIVPSLETHTRLIKVK----------HPPQID-----------------MLY 459

Query: 451 VGDVVSMAHSIQLTMYRPRLDFCFAAYLPDVLERILLCLFHTSLALALLNSLPVYYLDGE 510
           VG  + + +++ +T + PR +F  +  LP ++E  +  L   S ALA++N++P + LDG+
Sbjct: 460 VGHPLHLHYTVSITSFIPRFNF-LSIDLPVIVETFVKYLISLSGALAIVNAVPCFALDGQ 495

Query: 511 SILEIII-FQLTSMSPWNKEKVLRSF--LMGGTLI 525
            IL   +   LTS+   N  K L  F  L+GG+++
Sbjct: 520 WILNSFLDATLTSVIGDNDVKDLIGFFILLGGSVL 495

BLAST of ClCG01G000040 vs. ExPASy Swiss-Prot
Match: Q8CHX6 (Membrane-bound transcription factor site-2 protease OS=Mus musculus OX=10090 GN=Mbtps2 PE=2 SV=1)

HSP 1 Score: 117.9 bits (294), Expect = 3.7e-25
Identity = 135/514 (26.26%), Postives = 215/514 (41.83%), Query Frame = 0

Query: 37  AISCWYCDCKITSFNELIFQFGRRHARVLRTWFSIGIGFSLAALAVVVTVLFWELATAMH 96
           +IS ++   + + FN   + +GRR AR+L  WF+ G+ F + A+    +  F    T M 
Sbjct: 44  SISPFHIRWQTSIFNRAFYSWGRRKARMLYQWFNFGMVFGVIAM---FSSFFLLGKTLMQ 103

Query: 97  IFGNVIHGLPISYSSLFGLPPLISSCSFSPA--------------------DAGCIIVST 156
               ++   P  YSS        SS S S +                           + 
Sbjct: 104 TLAQMMADSPSPYSSSSSSSSSSSSSSSSSSSLHNEQVLQVVVPGINLPVNQLTYFFAAV 163

Query: 157 LISVAFHEFGHAAAAASEGVKLEYVAVFIALLFPGALVAFNHDALQDSSCFSALRIYCAG 216
           LIS   HE GH  AA  E V+     +F+ +++PGA V      LQ  S    LRI+CAG
Sbjct: 164 LISGVVHEIGHGIAAIREQVRFNGFGIFLFIIYPGAFVDLFTTHLQLISPVQQLRIFCAG 223

Query: 217 IWHNTVLSAASGLILFFLPLILFPLYIHGQSPMVLDVPYTSPL--SSYLSHGHVILSLDD 276
           IWHN VL+    L L  LP+IL P Y  G   ++ +V   SP      L  G ++  L D
Sbjct: 224 IWHNFVLALLGILALVLLPVILLPFYYTGVGVLITEVAEDSPAIGPRGLFVGDLVTHLQD 283

Query: 277 MNVHSVDDWINLSTQISELTFQNETPSRLGENNRMADGRRGYCVPNFMLKESNKVQFTHD 336
             V +V DW             NE    +    ++     GYC+    L++         
Sbjct: 284 CPVTNVQDW-------------NECLDTIAYEPQI-----GYCISASTLQQ--------- 343

Query: 337 QSACFGDLTSFTSIPCVSSTALIDGYTE-DNNSNLKEGIYCLNVSDVIKLNKCTRWDKAV 396
                       S P V +   +DG TE  NN +L +  +    +   +L+ C    KAV
Sbjct: 344 -----------LSFP-VRAYKRLDGSTECCNNHSLTDVCFSYRNNFNKRLHTCLPARKAV 403

Query: 397 ISDSTSSCMCSQDETCLGPVQMPGLVWVEITYLNPHSSDCSYSREYPLPSSNCSGTFIFV 456
             ++T  C  ++D  C         +   +  L  H+          +         ++V
Sbjct: 404 --EATQVCRSNKD--CKSGASSSFCI---VPSLETHTRLIKVKHPPQI-------DMLYV 463

Query: 457 GDVVSMAHSIQLTMYRPRLDFCFAAYLPDVLERILLCLFHTSLALALLNSLPVYYLDGES 516
           G  + + +++ +T + PR +F  +  LP ++E  +  L   S ALA++N++P + LDG+ 
Sbjct: 464 GHPLHLHYTVSITSFIPRFNF-LSIDLPVIVETFVKYLISLSGALAIVNAVPCFALDGQW 500

Query: 517 ILEIII-FQLTSMSPWNKEKVLRSF--LMGGTLI 525
           IL   +   LTS+   N  K L  F  L+GG+++
Sbjct: 524 ILNSFLDATLTSVIGDNDVKDLIGFFILLGGSVL 500

BLAST of ClCG01G000040 vs. ExPASy TrEMBL
Match: A0A0A0KQ25 (Endopeptidase S2P OS=Cucumis sativus OX=3659 GN=Csa_5G139060 PE=4 SV=1)

HSP 1 Score: 949.9 bits (2454), Expect = 4.6e-273
Identity = 483/541 (89.28%), Postives = 501/541 (92.61%), Query Frame = 0

Query: 1   MSGRWRRSKRPMAQADAHAHIPPLLPLTTSRKGLSNAISCWYCDCKITSFNELIFQFGRR 60
           M GRWRRSKR MAQADAHAHIPPLLPLTTSRKGLSN+ISCWYCD KITSFNELIFQFGRR
Sbjct: 1   MPGRWRRSKRLMAQADAHAHIPPLLPLTTSRKGLSNSISCWYCDYKITSFNELIFQFGRR 60

Query: 61  HARVLRTWFSIGIGFSLAALAVVVTVLFWELATAMHIFG--NVIHGLPISYSSLFGLPPL 120
           HARVLRTWFSIGIGFSLAALAVV TVLF EL   MHIFG  NVI GLP+S SSLFGLP L
Sbjct: 61  HARVLRTWFSIGIGFSLAALAVVATVLFRELTIVMHIFGKSNVIRGLPVSCSSLFGLPSL 120

Query: 121 ISSCSFSPADAGCIIVSTLISVAFHEFGHAAAAASEGVKLEYVAVFIALLFPGALVAFNH 180
           ISSCS  PA AG II+STLISVAFHEFGHAAAAASEGVKLEY+AVFIALLFPGALVAFN+
Sbjct: 121 ISSCSLFPAGAGYIIISTLISVAFHEFGHAAAAASEGVKLEYIAVFIALLFPGALVAFNY 180

Query: 181 DALQDSSCFSALRIYCAGIWHNTVLSAASGLILFFLPLILFPLYIHGQSPMVLDVPYTSP 240
           DALQDSSCF+ALRIYCAGIWHNT LSAASGLILFFLPLILFPLYIHGQSP VLDVPYTSP
Sbjct: 181 DALQDSSCFNALRIYCAGIWHNTALSAASGLILFFLPLILFPLYIHGQSPTVLDVPYTSP 240

Query: 241 LSSYLSHGHVILSLDDMNVHSVDDWINLSTQISELTFQNETPSRLGENNRMADGRRGYCV 300
           LSSYLSHGHVILSLD M+VHSVDDWINLS QIS+LTFQNET SRL ENN+MA+GRRGYC 
Sbjct: 241 LSSYLSHGHVILSLDGMHVHSVDDWINLSAQISDLTFQNETHSRLVENNQMANGRRGYCF 300

Query: 301 PNFMLKESNKVQFTHDQSACFGDLTSFTSIPCVSSTALIDGYTEDNNSNLKEGIYCLNVS 360
           PNFMLKESNKVQFTHDQS CFGD TSFTSIPCVSS  LIDGYTEDN SN KEGIYCLNV+
Sbjct: 301 PNFMLKESNKVQFTHDQSTCFGDFTSFTSIPCVSSAGLIDGYTEDNYSNRKEGIYCLNVN 360

Query: 361 DVIKLNKCTRWDKAVISDSTSSCMCSQDETCLGPVQMPGLVWVEITYLNPHSSDCSYSRE 420
           DV+KLNKC+ WDKA I+D+TSSCMCSQDETCL PVQMPGLVWVEITYLNP+SSDCSYSRE
Sbjct: 361 DVMKLNKCSSWDKAAINDNTSSCMCSQDETCLSPVQMPGLVWVEITYLNPYSSDCSYSRE 420

Query: 421 YPLPSSNCSGTFIFVGDVVSMAHSIQLTMYRPRLDFCFAAYLPDVLERILLCLFHTSLAL 480
           YPLPSSNCSGTFIFVGDVVSMA SIQLTMYRPRLDF FA YLPDVLE+IL CLFHTSLAL
Sbjct: 421 YPLPSSNCSGTFIFVGDVVSMARSIQLTMYRPRLDFHFAIYLPDVLEKILSCLFHTSLAL 480

Query: 481 ALLNSLPVYYLDGESILEIIIFQLTSMSPWNKEKVLRSFLMGGTLISIFLLLRIFFHLFV 540
           ALLNSLPVY LDGESILEIIIFQLTS+SP NKEKVLRSFLMGGTLISIFLLLRIFFHL V
Sbjct: 481 ALLNSLPVYCLDGESILEIIIFQLTSLSPRNKEKVLRSFLMGGTLISIFLLLRIFFHLLV 540

BLAST of ClCG01G000040 vs. ExPASy TrEMBL
Match: A0A1S3AV46 (Endopeptidase S2P OS=Cucumis melo OX=3656 GN=LOC103482997 PE=4 SV=1)

HSP 1 Score: 944.9 bits (2441), Expect = 1.5e-271
Identity = 479/541 (88.54%), Postives = 497/541 (91.87%), Query Frame = 0

Query: 1   MSGRWRRSKRPMAQADAHAHIPPLLPLTTSRKGLSNAISCWYCDCKITSFNELIFQFGRR 60
           M GRWRRSKR MAQ D HAHIP LLPLTTSRKG+SNAISCWYCD KITSFNELIFQFGRR
Sbjct: 1   MPGRWRRSKRSMAQVDVHAHIPSLLPLTTSRKGISNAISCWYCDYKITSFNELIFQFGRR 60

Query: 61  HARVLRTWFSIGIGFSLAALAVVVTVLFWELATAMHIF--GNVIHGLPISYSSLFGLPPL 120
           HAR LRTWFSIGIGFSLAALAVV TVLF E    MHIF   NVI GLPIS SSLFGLP L
Sbjct: 61  HARALRTWFSIGIGFSLAALAVVATVLFRETKIIMHIFVNSNVIRGLPISCSSLFGLPSL 120

Query: 121 ISSCSFSPADAGCIIVSTLISVAFHEFGHAAAAASEGVKLEYVAVFIALLFPGALVAFNH 180
           ISSCSF PA AG II+STLISVAFHEFGHAAAAASEGVKLEYVAVFIALLFPGALVAFNH
Sbjct: 121 ISSCSFFPAGAGYIIISTLISVAFHEFGHAAAAASEGVKLEYVAVFIALLFPGALVAFNH 180

Query: 181 DALQDSSCFSALRIYCAGIWHNTVLSAASGLILFFLPLILFPLYIHGQSPMVLDVPYTSP 240
           DALQDSSCFSALRIYCAGIWHNT LSAASGLILFFLPLILFPLYIHGQSPMVLDVPYTSP
Sbjct: 181 DALQDSSCFSALRIYCAGIWHNTALSAASGLILFFLPLILFPLYIHGQSPMVLDVPYTSP 240

Query: 241 LSSYLSHGHVILSLDDMNVHSVDDWINLSTQISELTFQNETPSRLGENNRMADGRRGYCV 300
           LSSYLSHGHVILSLD M+VHSVDDWINLS +ISELTFQNET SRLGENN MA+GRRGYC 
Sbjct: 241 LSSYLSHGHVILSLDGMHVHSVDDWINLSAEISELTFQNETHSRLGENNLMANGRRGYCF 300

Query: 301 PNFMLKESNKVQFTHDQSACFGDLTSFTSIPCVSSTALIDGYTEDNNSNLKEGIYCLNVS 360
           PNFMLKESNKVQFTHDQ+ CFGD TSFTSIPCVSST LIDGYTEDNNSN KEGIYCLNV+
Sbjct: 301 PNFMLKESNKVQFTHDQATCFGDFTSFTSIPCVSSTGLIDGYTEDNNSNRKEGIYCLNVN 360

Query: 361 DVIKLNKCTRWDKAVISDSTSSCMCSQDETCLGPVQMPGLVWVEITYLNPHSSDCSYSRE 420
           DV+KLNKC+ WDKAVI+D+TSSCMCS+DETCL PVQMPGLVWVEITYLNP+SSDC YSRE
Sbjct: 361 DVMKLNKCSSWDKAVINDNTSSCMCSRDETCLSPVQMPGLVWVEITYLNPYSSDCFYSRE 420

Query: 421 YPLPSSNCSGTFIFVGDVVSMAHSIQLTMYRPRLDFCFAAYLPDVLERILLCLFHTSLAL 480
           YPLPSSNCSGTFIFVGDVVSMA SIQLTMYRPR DF FA YLPDVLE+IL CLFHTSLAL
Sbjct: 421 YPLPSSNCSGTFIFVGDVVSMARSIQLTMYRPRFDFHFATYLPDVLEKILSCLFHTSLAL 480

Query: 481 ALLNSLPVYYLDGESILEIIIFQLTSMSPWNKEKVLRSFLMGGTLISIFLLLRIFFHLFV 540
           ALLNSLPVY LDGESILEI+IFQLTS+SP NKEKVLRSFL+ GTLISIFLLLRIFFHL +
Sbjct: 481 ALLNSLPVYCLDGESILEILIFQLTSLSPRNKEKVLRSFLIAGTLISIFLLLRIFFHLLI 540

BLAST of ClCG01G000040 vs. ExPASy TrEMBL
Match: A0A1S3AUN7 (Endopeptidase S2P OS=Cucumis melo OX=3656 GN=LOC103482997 PE=4 SV=1)

HSP 1 Score: 932.6 bits (2409), Expect = 7.6e-268
Identity = 476/541 (87.99%), Postives = 494/541 (91.31%), Query Frame = 0

Query: 1   MSGRWRRSKRPMAQADAHAHIPPLLPLTTSRKGLSNAISCWYCDCKITSFNELIFQFGRR 60
           M GRWRRSKR MAQ D HAHIP LLPLTTSRKG+SNAISCWYCD KITSFNELIFQFGR 
Sbjct: 1   MPGRWRRSKRSMAQVDVHAHIPSLLPLTTSRKGISNAISCWYCDYKITSFNELIFQFGR- 60

Query: 61  HARVLRTWFSIGIGFSLAALAVVVTVLFWELATAMHIF--GNVIHGLPISYSSLFGLPPL 120
             R LRTWFSIGIGFSLAALAVV TVLF E    MHIF   NVI GLPIS SSLFGLP L
Sbjct: 61  --RALRTWFSIGIGFSLAALAVVATVLFRETKIIMHIFVNSNVIRGLPISCSSLFGLPSL 120

Query: 121 ISSCSFSPADAGCIIVSTLISVAFHEFGHAAAAASEGVKLEYVAVFIALLFPGALVAFNH 180
           ISSCSF PA AG II+STLISVAFHEFGHAAAAASEGVKLEYVAVFIALLFPGALVAFNH
Sbjct: 121 ISSCSFFPAGAGYIIISTLISVAFHEFGHAAAAASEGVKLEYVAVFIALLFPGALVAFNH 180

Query: 181 DALQDSSCFSALRIYCAGIWHNTVLSAASGLILFFLPLILFPLYIHGQSPMVLDVPYTSP 240
           DALQDSSCFSALRIYCAGIWHNT LSAASGLILFFLPLILFPLYIHGQSPMVLDVPYTSP
Sbjct: 181 DALQDSSCFSALRIYCAGIWHNTALSAASGLILFFLPLILFPLYIHGQSPMVLDVPYTSP 240

Query: 241 LSSYLSHGHVILSLDDMNVHSVDDWINLSTQISELTFQNETPSRLGENNRMADGRRGYCV 300
           LSSYLSHGHVILSLD M+VHSVDDWINLS +ISELTFQNET SRLGENN MA+GRRGYC 
Sbjct: 241 LSSYLSHGHVILSLDGMHVHSVDDWINLSAEISELTFQNETHSRLGENNLMANGRRGYCF 300

Query: 301 PNFMLKESNKVQFTHDQSACFGDLTSFTSIPCVSSTALIDGYTEDNNSNLKEGIYCLNVS 360
           PNFMLKESNKVQFTHDQ+ CFGD TSFTSIPCVSST LIDGYTEDNNSN KEGIYCLNV+
Sbjct: 301 PNFMLKESNKVQFTHDQATCFGDFTSFTSIPCVSSTGLIDGYTEDNNSNRKEGIYCLNVN 360

Query: 361 DVIKLNKCTRWDKAVISDSTSSCMCSQDETCLGPVQMPGLVWVEITYLNPHSSDCSYSRE 420
           DV+KLNKC+ WDKAVI+D+TSSCMCS+DETCL PVQMPGLVWVEITYLNP+SSDC YSRE
Sbjct: 361 DVMKLNKCSSWDKAVINDNTSSCMCSRDETCLSPVQMPGLVWVEITYLNPYSSDCFYSRE 420

Query: 421 YPLPSSNCSGTFIFVGDVVSMAHSIQLTMYRPRLDFCFAAYLPDVLERILLCLFHTSLAL 480
           YPLPSSNCSGTFIFVGDVVSMA SIQLTMYRPR DF FA YLPDVLE+IL CLFHTSLAL
Sbjct: 421 YPLPSSNCSGTFIFVGDVVSMARSIQLTMYRPRFDFHFATYLPDVLEKILSCLFHTSLAL 480

Query: 481 ALLNSLPVYYLDGESILEIIIFQLTSMSPWNKEKVLRSFLMGGTLISIFLLLRIFFHLFV 540
           ALLNSLPVY LDGESILEI+IFQLTS+SP NKEKVLRSFL+ GTLISIFLLLRIFFHL +
Sbjct: 481 ALLNSLPVYCLDGESILEILIFQLTSLSPRNKEKVLRSFLIAGTLISIFLLLRIFFHLLI 538

BLAST of ClCG01G000040 vs. ExPASy TrEMBL
Match: A0A6J1H2Q0 (Endopeptidase S2P OS=Cucurbita moschata OX=3662 GN=LOC111459160 PE=4 SV=1)

HSP 1 Score: 926.8 bits (2394), Expect = 4.2e-266
Identity = 464/542 (85.61%), Postives = 499/542 (92.07%), Query Frame = 0

Query: 1   MSGRWRRSKRPMAQADAHAHIPPLLPLTTSRKGLSNAISCWYCDCKITSFNELIFQFGRR 60
           MSGRWR SKRP A+ADAHA +PPLLPL+TSRKGLSNA+SCWYCDCKITSFNE IF FGRR
Sbjct: 1   MSGRWRLSKRPRAEADAHARVPPLLPLSTSRKGLSNAVSCWYCDCKITSFNEPIFHFGRR 60

Query: 61  HARVLRTWFSIGIGFSLAALAVVVTVLFWELATAMHIFGN--VIHGLPISYSSLFGLPPL 120
           HARVLR WFSIGIGFSLAALAVV TVLF ELA AMHIFGN  V H LPISYSSLFGLPPL
Sbjct: 61  HARVLRAWFSIGIGFSLAALAVVTTVLFLELAIAMHIFGNSDVPHSLPISYSSLFGLPPL 120

Query: 121 ISSCSFSPADAGCIIVSTLISVAFHEFGHAAAAASEGVKLEYVAVFIALLFPGALVAFNH 180
           ISSC+FSPADAG II+S+LISVAFHEFGHAAA ASEG+KLEYVAVFIALLFPGALVAFNH
Sbjct: 121 ISSCNFSPADAGYIIISSLISVAFHEFGHAAAVASEGIKLEYVAVFIALLFPGALVAFNH 180

Query: 181 DALQDSSCFSALRIYCAGIWHNTVLSAASGLILFFLPLILFPLYIHGQSPMVLDVPYTSP 240
           D LQDSSCF+ALRIYCAGIWHN VLSAASGL+LF LPLILFPLYIHG+SPMVLDVP TSP
Sbjct: 181 DVLQDSSCFNALRIYCAGIWHNAVLSAASGLMLFCLPLILFPLYIHGESPMVLDVPSTSP 240

Query: 241 LSSYLSHGHVILSLDDMNVHSVDDWINLSTQISELTFQNETPSRLGENNRMADGRRGYCV 300
           LS YLSHGH+ILSLD M++ +VDDW+NLS QISE TFQN T SRLGEN+RMA+GR+GYCV
Sbjct: 241 LSGYLSHGHLILSLDGMHIQNVDDWVNLSAQISESTFQNGTLSRLGENDRMANGRKGYCV 300

Query: 301 PNFMLKESNKVQFTHDQSACFGDLTSFTSIPCVSSTALIDGYTEDNNSNLKEGIYCLNVS 360
           PNFMLKESNKVQF+HDQS CFGDLTSFTSIPCVSST L+DG  +D++ N KEGI+CLNV+
Sbjct: 301 PNFMLKESNKVQFSHDQSTCFGDLTSFTSIPCVSSTVLVDGDVDDSHYNRKEGIFCLNVN 360

Query: 361 DVIKLNKC-TRWDKAVISDSTSSCMCSQDETCLGPVQMPGLVWVEITYLNPHSSDCSYSR 420
           D+IKLNKC + WDKA+I+DSTS+CMCSQDETCL PVQMPG VWVEITYLNPHSSDC YSR
Sbjct: 361 DIIKLNKCISGWDKAIINDSTSTCMCSQDETCLSPVQMPGSVWVEITYLNPHSSDCFYSR 420

Query: 421 EYPLPSSNCSGTFIFVGDVVSMAHSIQLTMYRPRLDFCFAAYLPDVLERILLCLFHTSLA 480
           E PLPSSNCSGTFIFVGDVVSMAHSIQLTMYRPRLDF +A YLPDVLERI LCLFH SLA
Sbjct: 421 ENPLPSSNCSGTFIFVGDVVSMAHSIQLTMYRPRLDFHYARYLPDVLERIFLCLFHASLA 480

Query: 481 LALLNSLPVYYLDGESILEIIIFQLTSMSPWNKEKVLRSFLMGGTLISIFLLLRIFFHLF 540
           LALLNSLPVYYLDGESILEIIIFQLTSMSP NKEKVLR+FLMGGTL+SIFLLLRIFFH+F
Sbjct: 481 LALLNSLPVYYLDGESILEIIIFQLTSMSPRNKEKVLRAFLMGGTLMSIFLLLRIFFHVF 540

BLAST of ClCG01G000040 vs. ExPASy TrEMBL
Match: A0A6J1JYM6 (Endopeptidase S2P OS=Cucurbita maxima OX=3661 GN=LOC111490932 PE=4 SV=1)

HSP 1 Score: 916.4 bits (2367), Expect = 5.6e-263
Identity = 458/542 (84.50%), Postives = 497/542 (91.70%), Query Frame = 0

Query: 1   MSGRWRRSKRPMAQADAHAHIPPLLPLTTSRKGLSNAISCWYCDCKITSFNELIFQFGRR 60
           M GRWR SKRP A+ADA+A +PPLLPL+TSRKGLSNA+SCWYCDCKITSFNE IF FGRR
Sbjct: 1   MPGRWRLSKRPRAEADANARVPPLLPLSTSRKGLSNAVSCWYCDCKITSFNEPIFHFGRR 60

Query: 61  HARVLRTWFSIGIGFSLAALAVVVTVLFWELATAMHIFGN--VIHGLPISYSSLFGLPPL 120
           HARVLR WFSIGIGFSLAALAVV TVLF ELA  MHIFGN  V H LPISYSSLFGLPPL
Sbjct: 61  HARVLRAWFSIGIGFSLAALAVVTTVLFLELAITMHIFGNSDVPHSLPISYSSLFGLPPL 120

Query: 121 ISSCSFSPADAGCIIVSTLISVAFHEFGHAAAAASEGVKLEYVAVFIALLFPGALVAFNH 180
           ISSC+F+PADAG II+S+LISVAFHEFGHAAA ASEG+KLEYVAVFIALLFPGALVAFNH
Sbjct: 121 ISSCNFAPADAGYIIISSLISVAFHEFGHAAAVASEGIKLEYVAVFIALLFPGALVAFNH 180

Query: 181 DALQDSSCFSALRIYCAGIWHNTVLSAASGLILFFLPLILFPLYIHGQSPMVLDVPYTSP 240
           D LQDSSCF+ALRIYCAGIWHN VLSAASGL+LF LPLILFPLYIHG+SPMVLDVP TSP
Sbjct: 181 DVLQDSSCFNALRIYCAGIWHNAVLSAASGLMLFCLPLILFPLYIHGESPMVLDVPSTSP 240

Query: 241 LSSYLSHGHVILSLDDMNVHSVDDWINLSTQISELTFQNETPSRLGENNRMADGRRGYCV 300
           LS YLSHGH+ILSLD M++ +VDDW+NLS QISE TFQN T SRLGEN++MA+GR+GYCV
Sbjct: 241 LSGYLSHGHLILSLDGMHIQNVDDWVNLSVQISESTFQNGTLSRLGENDQMANGRKGYCV 300

Query: 301 PNFMLKESNKVQFTHDQSACFGDLTSFTSIPCVSSTALIDGYTEDNNSNLKEGIYCLNVS 360
           PNFMLKESNKVQF+HDQS CFGDLTSFTSIPCVSST L+DG  +D++ N KEGI+CLNV+
Sbjct: 301 PNFMLKESNKVQFSHDQSTCFGDLTSFTSIPCVSSTVLVDGDVDDSHYNRKEGIFCLNVN 360

Query: 361 DVIKLNKC-TRWDKAVISDSTSSCMCSQDETCLGPVQMPGLVWVEITYLNPHSSDCSYSR 420
           DVIKLNKC + WDKA+I+DSTS+CMCSQDETCL PVQMPG +WVEITYLNPHSSDC YSR
Sbjct: 361 DVIKLNKCISGWDKAIINDSTSTCMCSQDETCLSPVQMPGSIWVEITYLNPHSSDCFYSR 420

Query: 421 EYPLPSSNCSGTFIFVGDVVSMAHSIQLTMYRPRLDFCFAAYLPDVLERILLCLFHTSLA 480
           E PLPSSNCSGTFIFVGDVVSMAHSIQLTMY+PRLDF +A YLPDVLERI LCLFH SLA
Sbjct: 421 ENPLPSSNCSGTFIFVGDVVSMAHSIQLTMYQPRLDFHYARYLPDVLERIFLCLFHASLA 480

Query: 481 LALLNSLPVYYLDGESILEIIIFQLTSMSPWNKEKVLRSFLMGGTLISIFLLLRIFFHLF 540
           LALLNSLPVYYLDGESILEIIIFQLTSMSP NKEKVLR+FLMGGTL+SIFLLLRIFFH+F
Sbjct: 481 LALLNSLPVYYLDGESILEIIIFQLTSMSPRNKEKVLRAFLMGGTLMSIFLLLRIFFHVF 540

BLAST of ClCG01G000040 vs. TAIR 10
Match: AT4G20310.2 (Peptidase M50 family protein )

HSP 1 Score: 367.1 bits (941), Expect = 2.5e-101
Identity = 217/508 (42.72%), Postives = 295/508 (58.07%), Query Frame = 0

Query: 29  TSRKGLSNAISCWYCDCKITSFNELIFQFGRRHARVLRTWFSIGIGFSLAALAVVVTVLF 88
           T  + + N  SC YCD KI++FNE IF+ GRR + VL+ WFSIG+GF +A+L ++VTV  
Sbjct: 21  TGGENIENEASCCYCDLKISNFNEPIFRLGRRFSGVLKVWFSIGLGFGVASL-ILVTVFL 80

Query: 89  WELATAMHIFGNVIHGLPISYSSLFGLPPLISSCSFSPADAGCIIVSTLISVAFHEFGHA 148
                +  +F N +       S++FG  P   S   S +    ++VST+I+V+ HE GHA
Sbjct: 81  LLQFHSNPLFSNRL------TSAVFGFSP---STRVSLSGIAYVLVSTVITVSVHELGHA 140

Query: 149 AAAASEGVKLEYVAVFIALLFPGALVAFNHDALQDSSCFSALRIYCAGIWHNTVLSAASG 208
            AAASEG+++EY+AVFIA +FPG LVAF++D LQ    F+ALRIYCAGIWHN V  A   
Sbjct: 141 LAAASEGIQMEYIAVFIAAIFPGGLVAFDNDVLQSLPSFNALRIYCAGIWHNAVFCALCV 200

Query: 209 LILFFLPLILFPLYIHGQSPMVLDVPYTSPLSSYLSHGHVILSLDDMNVHSVDDWINLST 268
             LF LP++L P Y HG+S  V+DVP  SPL  YLS G VI+SLD + VH   +W+ L+ 
Sbjct: 201 FALFLLPVMLSPFYKHGESLTVVDVPSVSPLFGYLSPGDVIVSLDGIQVHKPSEWLELAA 260

Query: 269 QISELTFQNETPS-RLGENNRMADGRRGYCVPNFMLKESNKVQFTHDQSACFGDLTSFTS 328
            + +   +    S  LG + R   G +GYCVP  +++E  K +   +Q  C GDLT+F +
Sbjct: 261 ILDKENSKTSNGSLYLGGSRRFHHG-KGYCVPISLIEEGYKGKMVENQFVCPGDLTAFRT 320

Query: 329 IPCVSSTALIDGYTEDNNSNLKEGIYCLNVSDVIKLNKC-TRWDKAVISDSTSSCMCSQD 388
           +PC             +N+ ++E   CL+  D++KL KC   W     +D+ S C+C Q 
Sbjct: 321 MPC-------------SNAAIREVSVCLDAKDIVKLQKCGDGWVTTSDTDNQSDCVCPQG 380

Query: 389 ETCLGPVQMPGLVWVEITYLNPHSSDCSYSREYPLPSSNCSGTFIFVGDVVSMAHSIQLT 448
           + CL  +Q PG++W EITY    S DCS                            + LT
Sbjct: 381 DLCLQAMQSPGVLWTEITYKRTSSQDCS--------------------------RLVHLT 440

Query: 449 MYRPRLDF-CFAAYLPDVLERILLCLFHTSLALALLNSLPVYYLDGESILEIIIFQLTSM 508
            Y+PR  F  F    P++LER L C FH SLAL LLNSLPVYYLDGESILE  +   T +
Sbjct: 441 AYQPRWLFNFFGKSFPNILERSLTCTFHVSLALVLLNSLPVYYLDGESILESSLQSFTWL 478

Query: 509 SPWNKEKVLRSFLMGGTLISIFLLLRIF 534
           SP  K+K L+  L+GG+L+S     RIF
Sbjct: 501 SPRKKKKALQVCLVGGSLLSFLAFFRIF 478

BLAST of ClCG01G000040 vs. TAIR 10
Match: AT4G20310.3 (Peptidase M50 family protein )

HSP 1 Score: 318.5 bits (815), Expect = 1.0e-86
Identity = 175/379 (46.17%), Postives = 234/379 (61.74%), Query Frame = 0

Query: 158 LEYVAVFIALLFPGALVAFNHDALQDSSCFSALRIYCAGIWHNTVLSAASGLILFFLPLI 217
           +EY+AVFIA +FPG LVAF++D LQ    F+ALRIYCAGIWHN V  A     LF LP++
Sbjct: 1   MEYIAVFIAAIFPGGLVAFDNDVLQSLPSFNALRIYCAGIWHNAVFCALCVFALFLLPVM 60

Query: 218 LFPLYIHGQSPMVLDVPYTSPLSSYLSHGHVILSLDDMNVHSVDDWINLSTQISELTFQN 277
           L P Y HG+S  V+DVP  SPL  YLS G VI+SLD + VH   +W+ L+  + +   + 
Sbjct: 61  LSPFYKHGESLTVVDVPSVSPLFGYLSPGDVIVSLDGIQVHKPSEWLELAAILDKENSKT 120

Query: 278 ETPS-RLGENNRMADGRRGYCVPNFMLKESNKVQFTHDQSACFGDLTSFTSIPCVSSTAL 337
              S  LG + R   G +GYCVP  +++E  K +   +Q  C GDLT+F ++PC      
Sbjct: 121 SNGSLYLGGSRRFHHG-KGYCVPISLIEEGYKGKMVENQFVCPGDLTAFRTMPC------ 180

Query: 338 IDGYTEDNNSNLKEGIYCLNVSDVIKLNKC-TRWDKAVISDSTSSCMCSQDETCLGPVQM 397
                  +N+ ++E   CL+  D++KL KC   W     +D+ S C+C Q + CL  +Q 
Sbjct: 181 -------SNAAIREVSVCLDAKDIVKLQKCGDGWVTTSDTDNQSDCVCPQGDLCLQAMQS 240

Query: 398 PGLVWVEITYLNPHSSDCSYSREYPLPSSNCSGTFIFVGDVVSMAHSIQLTMYRPRLDF- 457
           PG++W EITY    S DCS        +SNC GTF+FVGD+++M+HS+ LT Y+PR  F 
Sbjct: 241 PGVLWTEITYKRTSSQDCS-RLGLDFNTSNCLGTFVFVGDLIAMSHSVHLTAYQPRWLFN 300

Query: 458 CFAAYLPDVLERILLCLFHTSLALALLNSLPVYYLDGESILEIIIFQLTSMSPWNKEKVL 517
            F    P++LER L C FH SLAL LLNSLPVYYLDGESILE  +   T +SP  K+K L
Sbjct: 301 FFGKSFPNILERSLTCTFHVSLALVLLNSLPVYYLDGESILESSLQSFTWLSPRKKKKAL 360

Query: 518 RSFLMGGTLISIFLLLRIF 534
           +  L+GG+L+S     RIF
Sbjct: 361 QVCLVGGSLLSFLAFFRIF 364

BLAST of ClCG01G000040 vs. TAIR 10
Match: AT4G20310.1 (Peptidase M50 family protein )

HSP 1 Score: 266.9 bits (681), Expect = 3.5e-71
Identity = 153/359 (42.62%), Postives = 216/359 (60.17%), Query Frame = 0

Query: 29  TSRKGLSNAISCWYCDCKITSFNELIFQFGRRHARVLRTWFSIGIGFSLAALAVVVTVLF 88
           T  + + N  SC YCD KI++FNE IF+ GRR + VL+ WFSIG+GF +A+L ++VTV  
Sbjct: 21  TGGENIENEASCCYCDLKISNFNEPIFRLGRRFSGVLKVWFSIGLGFGVASL-ILVTVFL 80

Query: 89  WELATAMHIFGNVIHGLPISYSSLFGLPPLISSCSFSPADAGCIIVSTLISVAFHEFGHA 148
                +  +F N +       S++FG  P   S   S +    ++VST+I+V+ HE GHA
Sbjct: 81  LLQFHSNPLFSNRL------TSAVFGFSP---STRVSLSGIAYVLVSTVITVSVHELGHA 140

Query: 149 AAAASEGVKLEYVAVFIALLFPGALVAFNHDALQDSSCFSALRIYCAGIWHNTVLSAASG 208
            AAASEG+++EY+AVFIA +FPG LVAF++D LQ    F+ALRIYCAGIWHN V  A   
Sbjct: 141 LAAASEGIQMEYIAVFIAAIFPGGLVAFDNDVLQSLPSFNALRIYCAGIWHNAVFCALCV 200

Query: 209 LILFFLPLILFPLYIHGQSPMVLDVPYTSPLSSYLSHGHVILSLDDMNVHSVDDWINLST 268
             LF LP++L P Y HG+S  V+DVP  SPL  YLS G VI+SLD + VH   +W+ L+ 
Sbjct: 201 FALFLLPVMLSPFYKHGESLTVVDVPSVSPLFGYLSPGDVIVSLDGIQVHKPSEWLELAA 260

Query: 269 QISELTFQNETPS-RLGENNRMADGRRGYCVPNFMLKESNKVQFTHDQSACFGDLTSFTS 328
            + +   +    S  LG + R   G +GYCVP  +++E  K +   +Q  C GDLT+F +
Sbjct: 261 ILDKENSKTSNGSLYLGGSRRFHHG-KGYCVPISLIEEGYKGKMVENQFVCPGDLTAFRT 320

Query: 329 IPCVSSTALIDGYTEDNNSNLKEGIYCLNVSDVIKLNKC-TRWDKAVISDSTSSCMCSQ 386
           +PC             +N+ ++E   CL+  D++KL KC   W     +D+ S C+C Q
Sbjct: 321 MPC-------------SNAAIREVSVCLDAKDIVKLQKCGDGWVTTSDTDNQSDCVCPQ 355

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004145951.19.5e-27389.28membrane-bound transcription factor site-2 protease homolog isoform X1 [Cucumis ... [more]
XP_008437652.13.1e-27188.54PREDICTED: membrane-bound transcription factor site-2 protease homolog isoform X... [more]
XP_011654589.14.9e-26988.72membrane-bound transcription factor site-2 protease homolog isoform X2 [Cucumis ... [more]
XP_008437654.11.6e-26787.99PREDICTED: membrane-bound transcription factor site-2 protease homolog isoform X... [more]
XP_022957694.18.6e-26685.61membrane-bound transcription factor site-2 protease homolog [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
F4JUU55.1e-11245.28Membrane-bound transcription factor site-2 protease homolog OS=Arabidopsis thali... [more]
O434621.3e-2526.64Membrane-bound transcription factor site-2 protease OS=Homo sapiens OX=9606 GN=M... [more]
Q5RAC82.1e-2526.54Membrane-bound transcription factor site-2 protease OS=Pongo abelii OX=9601 GN=M... [more]
O548623.7e-2526.41Membrane-bound transcription factor site-2 protease OS=Cricetulus griseus OX=100... [more]
Q8CHX63.7e-2526.26Membrane-bound transcription factor site-2 protease OS=Mus musculus OX=10090 GN=... [more]
Match NameE-valueIdentityDescription
A0A0A0KQ254.6e-27389.28Endopeptidase S2P OS=Cucumis sativus OX=3659 GN=Csa_5G139060 PE=4 SV=1[more]
A0A1S3AV461.5e-27188.54Endopeptidase S2P OS=Cucumis melo OX=3656 GN=LOC103482997 PE=4 SV=1[more]
A0A1S3AUN77.6e-26887.99Endopeptidase S2P OS=Cucumis melo OX=3656 GN=LOC103482997 PE=4 SV=1[more]
A0A6J1H2Q04.2e-26685.61Endopeptidase S2P OS=Cucurbita moschata OX=3662 GN=LOC111459160 PE=4 SV=1[more]
A0A6J1JYM65.6e-26384.50Endopeptidase S2P OS=Cucurbita maxima OX=3661 GN=LOC111490932 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G20310.22.5e-10142.72Peptidase M50 family protein [more]
AT4G20310.31.0e-8646.17Peptidase M50 family protein [more]
AT4G20310.13.5e-7142.62Peptidase M50 family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001193Membrane-bound transcription factor site-2 proteasePRINTSPR01000SREBPS2PTASEcoord: 140..154
score: 47.5
coord: 156..171
score: 30.47
IPR001193Membrane-bound transcription factor site-2 proteasePANTHERPTHR13325PROTEASE M50 MEMBRANE-BOUND TRANSCRIPTION FACTOR SITE 2 PROTEASEcoord: 32..537
IPR008915Peptidase M50PFAMPF02163Peptidase_M50coord: 132..513
e-value: 7.0E-13
score: 48.4

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G000040.2ClCG01G000040.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0071475 cellular hyperosmotic salinity response
biological_process GO:0031293 membrane protein intracellular domain proteolysis
biological_process GO:0051091 positive regulation of DNA-binding transcription factor activity
biological_process GO:1990440 positive regulation of transcription from RNA polymerase II promoter in response to endoplasmic reticulum stress
biological_process GO:1900457 regulation of brassinosteroid mediated signaling pathway
biological_process GO:1905897 regulation of response to endoplasmic reticulum stress
biological_process GO:0006508 proteolysis
cellular_component GO:0000139 Golgi membrane
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:0004222 metalloendopeptidase activity