Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
AAGAATTTGTTATAAATTTGGAAAAGATTATTAGCATATTAACTACCTTTTCTCTTAATCATTTTTTTCCCCAAATCATTTATAAAGCAGAGATAAACCCATGGATTATGACGACAATGACTTTCAAAGCCAGAATCTTCATTTAGCTGGTGAAGGAAGTGCCAAATTTCCTCCTGTTCTTCGACAATATGCTCTCCCGAAGTTTGATTTTGATGACACCCTTCAGGGGCATGTAAGATTTGATAGTTTGGTTGAACCTGAGGTGTTCTTAGGTATTGAGAATAGCGAAGATAATCAGTGGATTGAAGATTATTCCCGTGGAAGTAGTGGGATAGGATTTACTTCTTGTGCAGCAGAATCTTGTTCTATATTGAGGCGGAAGAATGTTTGGTCTGAGGCCACTTCCTCTGAATCTGTTGAAATGTTATTGAAATCTGTTGGGCAGGAAGATATCAATCTGGCACCAGCTGTCACCGGCGAGTCAAATGCTCGTGAAAAATTGGACTACTTAACAAACCCAATGGACCCTACTTTAAAAGATGACGGTAGTAGTTTCTGTGAAATGGGTGATTTACAGCCTACATTGCTATCAAATATAAGTCTTGAGGAATTGCATGTTGTTAATGAGGACATAAGAGGGGAGCAACAACCTCAAGGGGATGATCCTACTGAATTTCAAGAGATTTGTACTGTTGATAGAAGTTTGGGTGAGGTTGATCCTGATGTTGCCCATGAACTTGTAGATATGCCAGCAAGTGAGGGAAGTTCAGGTATTGATGAAAACAGTAAACAAACATGTGCAAGTACAATTAATACTTCAGTTTCCATATCAATGGAAGATAAAGGGCAAGATGATTTTTCAGCTTCAGGAAAGCATATAAATGATTTGGTTACCTGCACGCAAGAAGGTAGTGGAAAGTTGAGCAGTCAGAAGATTGAACAACAAATAAAAGATTTATCCGAGAATCCTGTTAATACATATATTGGGAATATTGAACAAGTGGTCAATTCACACGAGTTGAATAAAGAAGACCAAAACCGTCTTTTATCCCCCTCAGTTCCTGCAGACAGATTGGTCATTGAATCTAGTATTGCTACTTTGCAGTCCCATGCAAGTATGACCTTGAAGGGGGATTGTGTGTTCCATTCAGGTAGTGGAGAGGTTACACCCGAAGTTCCTTCTGAAACTGACAAGTTTGATGATAAGGTCTTGTGTTCAAATGTGGAGATTGGGAATCCGTCTAAAACAAACATGCACGAGGTATTACCTACAGTTGTTAAAGGTGATGCTAGAGCTGTGGTGTGTGCAGGCGAGGGGAAAAACATTAATGCAGAAGTTTGTGCCTTTCAAGGGCCTAAGATTGATTCAGTTGGGCAGATGGCTTGTGCACAAGAAATAATTAGTGTAGATCAGCAACGCTTTCCCTCGGGTATTGAGATACAAACTAGTAAGTCCGAGTCTTCTGCATCTGCTATGGAGAAAAGTAATGCCTCTAAGGTTGGTGAAAGCAGCAGTGGTCATATCAGAGATATTCCAGATAAATTTACAAAGGACGAACATGGTATGATCAGTTTGAGAGATGTTCGTGGTTGCACACTTCCCATAGAAAAAAATCTGTATTCCGAAGGCCATCTGCCACCTACTACTGTGGCTGAATCAACACAATTATGCGAGGAAAATAAATTGTGCCAGTCAGGTAATGACCATGTCACACATGCCAGTTGCAAGGAAGAAGTGAGGTTGTCTTCTGATTCTATTAGTGTGAATGGCAAGTTTGCCGAGTCTCCTGTCAGAGATAAGAGAATTGTATCCTTGTCTTTTCAAGAGAGTGATGTAGAAAGTGGGATGATAGATACGAAAGTAGAGTACAGTGCTAATGCTGGTGATGAATCAGGTATGTGCTGTTTGAATGTAATTTTTGGGTATGTTCTATGTCATGTAGTTGTGCTTCACTTTTGGCCAATTGGAGAAGTTTATTCTAACTTTGGATTGGTGTTTTCTTCTCCTCCTTTTGCAAATTCATACATCAACAAAATTGTTTCTTATATATAAAATCGTATATGCATATTGAGCCAGTTACTTTTCTTTCTCTTTTTTCCTTTCATTGCAGTGTCAGTTTCTACCTTTGGGGATGCCAATGTGAGAACATGTGACACATTACAGGGCGACTCCTTACCCGTAGTTGATGCTTTGACAGACAGAAAAGATGCTGATGAAAAAGAGGACCAGTTGCAACCTGGTGTGGTGGAGTTTACTCAATCAGATAGCAAGGAAGAAAGTGGTGTGATAATTCCTGCTGAAGGAAGTTTTCCTCTGTCGGATACTTCTCAACCTGTGGGGAAATTTCATCCCCTTTCTGAAGCTGAAAAATCTGCGTGTCTCCTTACTGGTCAGGGATTTGGTGAAAGTATTGATCAAACTATTTCAAAGAATTTGAATTCTGATGACTGCAACAGAGAAAGCCAATCTATACCCCAAGCTGACATTCCTAATAATGTTATCCAAGACTGCGGACAGGAAATGGACATTGATCCAGCCTTTTCAAAGTCATCTGCAAAAGCATGTGATAGTGGTGTTAAAAAGTCAGGTGTGCCAGCATGGTTTTTCAATTAGTTCTAACTTTTAAGTGTCTTTGTGAAATTAATTTTTTAGATTTCTTTATTACGAACTGCAGCGAATTTAATATTTTTCATTAATTTATTTCTGCCCAAGTCTTCCTACATTTTGTATTAAAGGAAAGAATGAGAATGGCTTGAAGTAAAAGTCATGTCTTGAGCATCTCAATTAAATCATAACTGGATATGCTTGACTTAATAAGTAAGAACCAATATGTTTCCTCGTTGAGTTAGATTTACTTTTTTTCTTTAAAGCTTGTTACTCATCTCATTGTTTAGCAGAGATATTTTTTGTCAAGGCTTGTGAACTCTAGTCTGAAAACCTGGATGCATTCCTATGGTGCTGTATTGCATTCTTCAGCTTTCTATGGATTCAAAATTGCACAACTAAATAATCAAAGACGAGAGAAGTGCATAATGTGTTATCGCGTGATTAGTATGTATCTTAGCCTTAGTGCGTGGCAGCCACATTCTTCTTCTGCTGTCAATTTTCATTATGGATAATTACAAACTAATGATGGCCTTGCCGATGCAACGAGGATTTCAAGTCACCTGTCGAACTCTGCTTATTGCAAAGGCTTTTTTTTCTTTGTTTTTGTTTTTGTTTTTTTTTTTTTATAAATGAAAATGTGTTGTGCTGCCAGTAGTTATCTGATTGTCCTTCTGTTTAGAATTCTTCATTCATGACATCTGCATTTATTAATAATGTTGGTTTGGATCGCTTAACTTGCATAATTATGTCAGTGGATCTTTTCAACGTTTCTTACAATCAAGTTGTTAAATGATTTTTTCTTTAGATGAAAAATCTTTTCCGCCCGATGCCACGTCTTTAACACCACTTCCAGGAGAAACACTTGATAATTATCAGAAAGATCAGGAAAGTACTAAAGTCGTTTCAGAATCTGTGGGAAATAATTGTCAGCAGGCCATTGCAGTGAACATTGACAGTGAGTAACCAAATTATACAACTTCTAAATGGATGTGAAATAATATGTAATGCTAAATTATTATTATTATTGTTTTTTGTTCTTGAAGTTTGAATGTTGAGTTTGGTGGCAAATTTCCTTTTTCAGCTAGGAGCTTCTGTTTGTTGCCTGTTTATTTCTTGGTTGATTTTGTGAAACTTCTTTAGCATTATAGTCCTCTGGTTCTGATATGTTATATTCGGCCATAAAAATTTGTCTTTCATACTTCCCACTAGGATTTCTATGCTGATATCTTGTAGCATGTTATATGTCTTCATTCTTTTTGTTTGTTTGTTTGTTTTTTTTCTTTTTTTTGAGTTCACAATACTGATACCCTTCTCTATCCATTTTATCTATTCTTCATTTCTTCTTCAAGTTCAACAATATGTGGGGATAAAGATTCATACTTTTGACTTGTTGAGGTTATTATTATTATTATTATTTTGATAAGAAACATAACTTTCAGTATCCAAAGAGAGAACTAAAAAAGGAGGGCAAGATGAGATATCTCCATGCACCCACGAGTTACAAAAAAGAAGCCTAACTAGTGTAAATCTACGCTGAGCTATAATTGCAAAAAGACTTTGAAGTAGAAGACCATGTAGAAGCAAAGAGTGATCGACTCGGGCTTTTTTAGGCTTTAATACTTGAATTGATTGATGATGGTGAGGTGGCTCACAGTTGGACTGAATGGCGACGTTGTCGTTGTTGTACTGTATGACAGTCGACAAATGAGAGAGGGGTTGTCAAAGAGAGTGAGAAAGGGAGAAAGATGGATTGAAGAGAGATGGAGAGGGAGAGAGAAGTTGTTGAAGAGAGAGGGAGGGGAAGAGAGAGAAGAAATGAGAGAAGAAAGTGGGTGGTGGTTGAAACATGAAAAGAAACAAACCCCAACTGATATATATGATAAACTAGGATTGTATTTTTTGAACTTGTGAACTGAATTAAACAGAAATTTTCAACCCAAACCAAATCAAGAATTTGGCTAACCCAACCCGACCTTTGTTGATTTTGGCCGGGTTGTTTAGGTTTCTTGGGTTAGATTTTTTTGAACACCCCTACTATTTTATCTATCTTAATCAGAGTTTTCTCCAAGATGTGTTGTTGTATCTTTACCTCTCTCTAATGCACCTTGGGAAGAGACTTTATAAAGAGAGGAGCATGTTAAAAAAATAGAGAGCTTGTTTAGAGAGGAGAGATGTTGAAATATATACACATATATATTGAAGAGAAAAGAATGCGGTAGAGAGATGTAGTCCCACTGAGGGAGGAATGTGATAGAGAGGCACACATTATTTTAAAGTAAGGGATACGTTTCCCTCATCCAGGGGAATTTTTACAATGGTCAATCAAAATACATAACTTTTACAATTATGCAAGATGGAGGGTTTAGTTGTCAAAATTAAGAGTTTGCAGTGCAACAGTAACAGTCATAGTTTAGGAGTTACCAATGCTATCCTTTTAAATATAACAATGACAAGCAAGCATTAAACTTTCTTACAATTTCCATTCCTTCCGAGATTGGGACAAGTGTGATTAACAATAACTCTACACCATCTATGATGTCAGACACCAATCTAGCTACAAGTATCAAACAAGATTTTCCTTAATCAATTAATTAAAAGCTTTTGATCATGGAATATTAAATAATCTACTTTTATAGTTAATATTATTGTTTTTACGACCTTACTTGAATAGGATTGTTCAATAGATTGTACAATTGTGCCCTTAAGTTCAATATTATAGTTTTTTTTTTTTTTTTTTTAATTATTGTTTTAAAAAATGATGAGATCCTAGCCAATGGATTACATAAAACTCTTCGAATTGGTCACCAAAGTAGTAAAGCTATGAACAAGGCCCTCGATTTACAACACAAAAGGCACAAGAAAAATAGAAGTGAGAAAAGATTGGGAATCTTGTTCTTTATCTATGAAAATACGAGAGTTGCGCTCCAGCTAGATAGACCAGAAAAACTCGAATGAAGTTTAGCCAAAGGATCTTCCTTCCTTTCTTGAATGGGTCCATTGAACACAAATTTATAGTTCTAATTTTTATTGTTTAATGTGATATAGTTTTAAATTTTCTTGAATTTAAAGTTTTGGGATAATTGCTAAAACAAGAACTTCGAGGGTTTAGCTTTAGTATCTCTTATTGTTTGAGGGTTTAATTTCAACAAGTATTATGGATCCTGGGTGACTTGTGCATCTTCAAAATTCTTCGGAATTATAACTTTTGGGCTTGTATTCCTTTCATTCTTTCTCAATGAAAGTTCAGTTTCTTACCAAAAAATAATTTGAGTTTGATTCAAAATGTGATCTGCCAGTACCCATGGAGCTCCTAGCAGGACTCTAGACGATGGATTATTTGATTATTCACAATAGTGGGTGTTTCATAGTAAGCTCTCTCACTCCTCATCTCAATGAAAGAAGAGCTACTGTGCATCTTCCGAGTCCCTGACTAAGTGATTTTGAAGGCCAAGGTTTCTAAAAGGTGAAAGTGTTCATCTAGATCTTTATTTAGGAAAATTTGAACACTAGTGATTACATCCAAACGAAGTTTCCCAATTGATTGGTGTCTTCAGGCTGGTGCATCATGTTGAAAGGCATCTTCACCATTTTATTTTATTTCTATTTCTATTTCTACTTCTATTTTTAGTTGGTCTAAGCTCCTTTTCTTCTTGGGCTTGGGTTGGACCTTCCCTAATTGGTGTGAAGCTATCCTTTCCCAGTTGTGAGAATGATCCTTAAGGTCAAAATTCATCTTGTGGTCCAATCAAAAGAGTTCTCAATGTGGTGGAGTTTAATTGCCAACATGAGCACAACTCAATAGTCAAGACACCACTTTCTTCTCTGAGGTTAGAAGTTCGAATTGCCACCCCCACACAACGGGTAGAGACTTGGAATATCCTGAGAATTTCTAGACGCTTGTAAATTTTGTGCTTCAACTTGGTGTGCTCCCTCACAAGGACATATGTGATCTACCAATGGGAAATCCTAACCCTTTGGTTTTGTGGGATATCTCATTCTCGAGTATTTTTGAGAAATTGCAAAATTTATTTTGAAACCGTATCTAATGTCATAGGTTATTATAGGGTAGCCTCCATATGGCTGCCTCATTCAATTTTTTCCCCCTAGTTCTCCTCCTTTTGTGTGGGCTTCTTTCTTCAATAAGAAATCAATTTCTTATCCAATGATGATGACGACGACGACGACGACGACAATGATGATAATAATAAAAACAAAATAGTTATTATTATTATAGGGATTGATTGCAATGATGACATAACTTTCAAAGTAATTACTCAAATTGAAAGTTGGAAAATTATTTTAAATGACAAAATTGTTGAAGATATTTACAAACACTAGCAAAATATGAGAGTCTATCTGTGATAGATCACGATAAACTACTATCTGTGTCTATCATAACATAGATAGACACAGATAACAATCCTGTATTTGAAAACAACCTTTAAAAGATTGAGATGTGATTGTAACACCCCCGATGATATCTATCTCTCTCCTTTTATTTATTTATTTTTTATCTTTTATATTCGTAAGTGTTCGAGCGAGCTTACACACACCTCGACTGATTTCACGGGACAACCCACTTGATTCTACAACATTTGGGTGTCAAAGAAAATCGTAGGATATTAATTCCTAGGTAGGTGGCCACCATGGATTGAACCCATGATCTTTTAACCATTTATTGAAACGATGTCTCCTTTTCTACCACTAGGCCAACCCATGATAGTTTCTATCTCTCTCCAACTACTAGTGTTCTTATACTGATTGCTTGTATTCTCATTTTTATCGAGAAAATCACCATTTTTGACCAACAAAAAATGTAACTAAGATCAAAATTATCAAATTGAACCTTATACTCTTGCATATTCAGATCATCATTTTTATCATTTTGTACAGAAATTGTGGATTGCTTCTGTTAAATGCTCTTTATGATATGAATTACAGTGTTCTGCATGGAAAAGAACTTGCGTTTAGCAGACTACGTTTTGAGTATATGTAGAAATATCAAAAAGGCATTTAGTCTCTCCTCAAGATGAAGTGAAATTAACTGATTCTTGTTTTTGTGGTGCGGTGTTGGGAATGTTATTGAATGTCTCATATATAATTTCCAAATTTTCTTAAAACTTGATTTTTCTCCTATTACTCAATTGCAGGTGATGCTGGCAAAAAGGAAGGTTCTTTATGCTCTGCTGATTTTCCGCAATCTCATGAACAGATGTCTGTCGTGGGGAATGGTAATTCTACTGCTGATAAGCCTCCTCCTAATTTACCTGATGTTGTCAAAACTACTGTGGTTGCTCATGATCCAGATGTAAAGGATTGTAATATGAGACCAGCAAGTAAGAATGTTGAAGCTGCAGAGGCCAAGGATAGGCTGGTTGGCAACACGTCATCTGGTTCCCAGTTGCCCAAAGGAAATGTTGCCTCTGAAAGTGAAACAGCTCTTACCTTTGAGTCCAGTTCATTGGTAGATCTGCCAAAAAATGATTCTGGCATTGCAGTTGCAACAGCAGCTACTGCTTCTTTGGTGATTCTTTCCTTTCATGTCATTGGTTCCTATTAGATTTTATATTATTTTTCCTTCATATTATCTTCCTGCACACGGGAGAACTTTTCTCTCTGTAAAAAAAATCTTTATTTATTTATTTTCTTTTCTTTTTTGATAAATAATTAAACTTTCATTGAGAAGAATACAACACATTGCACTCTTTATATCATGGATTATTTTAATTGAAGGTTGCGGAGGGACCCCAATCATCTTCTGGCCTATCCAAATTGGACATCAAGAGTGCTCAGGATATTTCACACAGTAGCCCTCATGTGTCTGAGGTAAAAGTTGCACGTGCTCGTTCCAAGGGCACTCCTGAGCGTAAACCTAGGCGTGCTTCTGCAAAAGGATTGGGGAAAGAGTCATCTACAAAGGGAAATCACACGAAGAAATCAGAGAAAGTTGAGAAATCGAACAGTACGCCCATAAGCAATCCTGGAATTTTCCAGCTTGCACAGTCTAATGAGATGCAACATCATGGACATGTGGAGTCCAGTGGTGCAAAACCATTTGTCTTTATTGGTGCTTCGACTTCTAGCATTCCAGACTTGAACAATTCTGCCTCCCCATCCCCAATGTTTCAACAGCCTTTCACAGACTTGCAACAAGTTCAATTGCGTGCTCAAATATTTGTTTATGGAGCTTTGATGTGAGTCCTTCATTTTATGTTTCTTGTCCTTAGAACAACTTAACGAAACTATAGATTTCTCTTCAGTTGCGTGTTGTTGTTCAGAGGTAGCAGTGTTAGAACTGAAAGTCATTGCAATTTTCCACCATTCATTACCGTAGAAAAATTTAAGAGAAAGTCTTGTCCCTTTTTTTTCTTTCTTTTTTTATATCCGTGAGTGTCCGGGCCAGCTTACGTGCACCTTGACTAATCTCACGGGACAACCCGCCTGACCCTACAACATTTGGGTGTTAAGGAAACTTATAGGGAATTAATTCCTAGGTAGGTGGTTACCATGGATTGAACCCTTGACCTCTTAGTCATTCTTGTCCCATTAAAAAAGGTAGAAAGCAAAAAGGCACTTCATCCTTTTGAATAAGGCACGGAAAATTTCTCTAGTAGTTTGTTTAGTTGAGCTCCTCTTTCCTTATTATTTTTTTTTAATATTTTTGGCCAAAGAAAACAATTGGGCTCCTTTCTCCCAAGTGATTTTTATGGATTTAATCATTGCTGCAGTCAAGGTACAGCACCAGATGAGGCGTACATGTTGTCTGCATTTGGAGGACCAGGTAAAATCTCCTCCCTATCATTCATTATGGTTCACCATATGGATGCCACTTTTTTGTTTTTGTTTTTTTTGTTCTTTTTTTTTTTGGTTTTTTATTTTATTTGTGTGATTCTTACATTCAGATGGTGGAACAAACCTTTGGGAGAATGCCTGGCGTATGTGTGTAGACAGGCTTAATGGAAAAAAATCTCAGCCTACTAATCCAGAGACACCTTCACAATCACAATCTGGTAATTTGAGGCATTTTATACATTTCTAAGCCTCTCTCTCTCTCTCTCTCTCTCTCATTTTCAATTTGAAAATTTGTTTTGCAGGTGGTAGAAGTACTGAGCAAGCAAATAAACAAAGTACACTGCAAAGTAAAATTACATCTCCTCCAGTTAGTCGAGTTAGCAGTAAGAGTACATCAACAGTGTTAAATCCTATGATCCCTCTTTCCTCACCACTCTGGAGTATTTCCACACCTTCTAATGCTCTGCAATCTAGTATTGTGCCTCGAAGTCCAGTTATAGACTACCAACAAGCACTTACTCCATTGCATCCGTATCAGACTCCTCCTGTCAGGAACTTTATTGGTCATAATCTTTCATGGTTTTCTCAGGCTCCATTCCATAGTACCTGGGTTGCTACTCAGACTTCGACACCCGACTCCAGTGCACGATTTTCTGGTTTGCCAATTACTGAGCCTGTTCATTTAACACCAGTAAAGGAATCATCTGTGTCACAATCCTCTGCCGTGAAGCCCTCTGGTTCCATGGTTCACGGTGGAACTCCTGGCAATGTATTTACTGGAGCCTCCCCCCTGCTTGAGTTAAAAAAAGTGTCAGTAACCACAGGGCAAAATTCCACTGAATCAAAAATGAGAAGAAGAAAGAAGAATACAGTTTCTGAGGATCCTGGCCTTATAACTATCCAGCAGGTTCAACCTCATTTAAAACCAGTGCCAGCTGTAGTTACAACTACAATTTCTACTCTTGCAACATCTCCATCTGTCCACCCTAAGGCAGCTTCTGAGAATTTGATTTTATCTCCACCTCCATTGTGTCCGACAACTCACGCAAAGAGTGCAGGTCAAGATTTGAGAGGGAGAGCCATGTTTTCAGATGAAACACTCGGTAAGGTTAGGGAGGCTAAGCAACTGGCTGAGGATGCTGCATTGTTTGCATCTGAAGCTGTTAAACACAGTGCAGAAGTGTGGAGTCAATTGGACAGGCAGAAGAATTCAGAATTAGTACCAGACGTTGAAGCTAAACTAGCTTCTGCAGCTATTGCAATAGCAGCAGCTGCTGCTGTCGCAAAGGCTGCGGCTGCTGCCGCCAATGTGGCATCAAATGCTGCATGTCAAGCAAAGTTGATGGCTGATGAGGCAGTTACTTCATCTAGCCATGATGTTCCCTGTCAAAGCAATGAATTTTCTATTCATGGTAGTGCTGTTGGTGTAGGAAAGGCTACTCCAGCCTCCATCTTAAGGGGTGAAGATGGTGGAAATGGTTCCAGCTCTATTATTATTGCCGCAAGGGAGGCAGCCAGAAAGAGGGTTGAAGCAGCATCTGCTGCTTCTAAGCATGCTGAAAATGTGGATGCCATTGTCAAAGCTGCCGAGCTGGCAGCAGCAGCTGTGTCACAAGCTGGAAAGTTAGTTGCTATGGGCGATCCTCTCCCCTTAGGCAAATTAGTGGAAGCTGGTCCAGATGGTTATTGGAAAACGCCTCAAGTATCTTCTGAGCTGGTTATGAGATCAGACGATGTTAATGGAGGACGTTCCAATTCAGCAATTAAGCGTCCTAGAGACGGATCATCAAGTAAGAATGAGATTCAGGTAAGTGGCGGTGCCAAGTCACCAATTCCTGGAGAAATATCTATGGGTTCTGTTGAGAATCATTCCAAATTGGTGGACGGCATCACAAGTTCTGTAGCACCTCGTGAGAAGGATCTGAGAGGACAGAAGGACCAAAATGCTTCTGATCTGACAAAAACCATTGGTGTAGTTCCAGAATCTGAAGTTGGAGAAAGATCCTCCCAGGACGAGTGTGAAAAGGCTAAGGATTTAAGACAAAGCAGCATCAAGGAAGGTTCCCATGTCGAGGTATGAGATTGTTATATATAATTTTACAATTTCTCTTCATTTTTGATCTGTATTCTTTTTGCTTCTTTTGATATGCCGAAGCTATGAGTAGGAGTACCAATGTGAAGTTTAAATGCTTATGTCTATGCACCTATGTTTGCTGATCATTTTTTTAAACAAGAGAAGACGTTTCATTGATTGATCTGAGCTTCTAGGGATATATGTGGTTGATTTGAAGATGGATTGGTTGTTGACTAAATTTGGTTGTATGAGGGGTTATTGGCCTACCACTTATCTTGGTCTTCCATTAGGGGTAACTCTGAAGCAGTCTCTTTTTGGCAGCCGGTTCTTGAAAAGATTCGGCATAAGCTTCATAACTAGAAGTATGAGTTTATTTCGAAAGGTGGTCGATATAGTTTGATTCGAGTTGTGGTATCAAGTATGTTGATATACTATTTGTCCTTTTTCTTTTGGTGGATGGAATCGACAACTTCGAGGAAAAACAAAAGAATATGAGGGCAGAAAAACAAGCCCACCAGAACACCCCTTCTAAAGGAAGGGATTCCAACTAAGTAATATGTTGCCAAGGGAATAATTACAAAAAAGCTTCACTATCGAAGCCCAAAATGAAGGATGAAACCTCACCAAGGACCAAACCTCACATGGGTCCCTACCCCTACCGTGAAACACTCTATTGTTCCCTTCCCCAAATATTCCCTAACACCGCAATCTTATCATCGCACTATTTATCCTTATTTCAAATGCTTACTAAAATCTCAAAGATCTTAGACAAGTTTATGCATGATTCCTTTTGGGAAGTATCTTCGGGGGATGGGGGTGCACATTATATTAATTTGAAGATTACTCAGCTTTCGAAACTTTTGGATGGCCTTGGTATTGGTACTTTTTAGCAGTGGAACTTAGTCTTATTGGCCAAATGGACTTGGCGTTTTCTTCAGGAGCCTCACAGTTTGTGGCAGAATATTATTGATGCTAAATACTACAGTGATGCTTGTGGGCCTGGGTGGCCTCGACCCATCCTATATTCCTATTTGGTTCCTATAAATCTCCTTGGAAATTTATTTGTTCTACAGTGGATTTGGTTGCTAGTCGTGCCAATTGCAGGCTTGGTGATCGTTGTTCTGCTTTGTTATGGAAGGATTCCTCGTTAAGTTGTGGCATTCTTAAGACATACTAATCTTCCATTTTCTGTCGTGGCTGATGTGTGGGTTGCTGATATAGCTGAGGTTTTACGCCTTCGCCGTAATTTGAGTGACTAGGAGTTGATTGAGGGGGCTTCACTTTCTCAATTTTTGTTCATTGTTCGGCTTCAGGATTCTCTTGATATTTGGATTTGGCCTCTTTTACCTTCAGGTTCTTTTACTAGTAAATCTTCATGGATGATTTGGTGGGCACTATTGATCCTTATGATTTTAAGGATAAATATCCAAAGAAGATTAAAAATTTTCTAGGGGAGCTTAGACTTGGAGCTATCAATATGGCTGATCGTTTGCAAAGACCCATGCCTTATATGGTGTTATCTCCTTCTTGGTGTATTATGTGTAAATCCAATGTGGAAACTCCGTGCCATCTTTTCATGCAAGTTCATTTGGTTTTCGTTTTTGGATTACCATGATGGAGGCTTTTGGTTGATCCTTTCCTTTCCTGGATGATTTTTTACAGCTCCTTTCCATTATTTTGGCTGGTCGCCCATTTCATGGTGAAGAGGTCCTATTGCTTGCTCTCAGTTGTGCTTTTCTTGGAACCTTCGGGTGCTAGGAATGGTGGTCTCTTAAGGGAGGTTTCCTTTAATTTTGATCATTTTTTGGATCTTGTTCTCTCTACTGCTGTCTTTCGGTGTAAGAATACACACCCATTTACTCTATAGTTCTTCTTAATCTCCAATTGGAGGCTATTTTTGTAATTCACCTATTGGAGCTTGGAGTTTTTCTCCATATTTCATTTATCAATGAAATTGTTTCTCTTTCAAAAAAAGAAAAAAAGAAAGAGACCTATTTTGCAAAGTATCTCATTACATTCAAGCCATAAGTGCTAAAAAGAAGAGCTTTAGTAGCACAACTGCCCAAAATCCTAGCTTTACCTAGCAAATTCAACTTCCCAAATTCCTTTTTGGCTGTATATCTTTTATTTTGGTGTTATCTTCTCTCTTGTTTGGGCTTCATCTATATTTTTGATGTGTCTCCTTATTGTACTCGTTGGATTGTTTCATTTATCGATGAAATGTTTCCTATCCAAAGAAAAAAACCTAACAAATTCCAGCCTTTAAGATCTTCAAAGAGCCAATCATCAAACCTTTGTGGAAGACGAGTCATAATCTCAAGCTTTTCAAAAATAAAATTCTAGGCCTTAGAGGCAAGTGAACGGTGAAGAAACAAGTGATCAATATCCTCCAACTCCCTCAGGCATAATCTACAACTAGATGGAGACAGCTCGGTTTCTATCTTCTTTGAAGCAAATTGTCAATTTTAAGGCGTCTAAAAGAAATTGGCCAAAGGAAGATTTTAACTTCCTTTAGGCATTTACTTTTTCGTACCAGGTAGTGAGGGAGTATTAGCCTTCGAGGGGTGTTAAAGAGGCTCAAGAAAGTTGACCTTGTAGATAGAGTCCTGAACTTTCAAGGGTCCAAGTTTACCGGTTCGATCACTCACTTAGGTAGACCAAATTCAGATTTTCAATGATCTCCACCCAACTTTCCAACTCTTTATTAAAAAGCCCTCGTAGAAGGCCCAATTTTCAAGATTCATCAGTTTCCCAACACTCCAAAAATGGAAACCCTTTTTTCTTGAAGATGACATAAAGATTAGGAAATAAGACTGAAGGAAAACTTGATTATGCCAAATATCCTCCCAAAAATGGATTTGATTCTCTATACTTGCATTAAATTTAGCGAACTGCACATGGAAAAACACATTTGGCAATGTCCACCCAAGGCTTGTTCCCTTTCTCTTCTTGATTTCTTTCGTCAACCACCCTCAAAAACTAGCACCATGAATGCTTGGATTACTTTCTTTGAAAGGGGATATATTTCTCATGTGAATCTCCAAATCCATTTAATAAGCAACTATTTTGGTGAATGTGAGAGCCAACACCTAGCCCCCCAAGGGTTGTGGTTAGAGAAATCCATTCCCACTTTACTAAGTTTCAACTCAGTTTATACGCCACCATTCCATGTGAAGTTTGTGATAATCTTTTCCCTGGACTTGATCACGTCTTCAGGGCCCTTAGAAGAGAAAAATGATAACTGGGTAGGCTATTAAGAATTGATTGAGACTTTCGCATTTTGAAAGGAGATTTCTCCATTTGCCCATTACCTCTAATTCTGAATATGTTGTATTTAAAGTGGTTGTGCTTTTTTCCCGATATGCTTTGATATAAGCATTCATAGTGTAATTACACCACCCACTTTTTCAACCTAGATTGGATGGTCTTCTTGTGGTTCTTTTGTCTTGGGGATGGGAGGTCTTGTCCCTTTGCCTTTCAGGTTCTCTGTTTTTTGAGGAACGAAGTCTTTTGATGTTTCTTATATATTGAAAATAATAATAAAATAACATTCTTAGTACCTTTAACACAATGAGCTCATCTTTTCATCTTTCCCTTTACTTTCGTGAACACAATTTTACATTTAAAATAGCCCATTGGAAAGGTTTTTTGTGATCCCTGGACAATGTTTTCCCTCTTTTATAGTTTCACTTTATCGATGAGATGAAGGATATATGTTTCTCATCAAAGAAAAACAAATATTCCTTATGACCAATGTTTTCAAAATCGTTTGAGGTGTAGCCTTGGAGCGCGCTTCGAGGTGAGGTGGCATTGTAATACTTCAAGGATTATGCCTAGGTCAGATAAAAAGAGGCTTACACCTTGAAGAATAGAGGCATGTGCCTCTATACTTATGCCTTTGAGGCATGAGAGACGCAAGCCTCAGTGAAAGGTAGGATATAGACACGTTAATTTCAGAAAATTTTAAGAGGTGGGAGGCTGGCGGGACGTGCTTTTGTTTAGGTGGATTTTCTGGGAAAAGATTAACCACTCTACTGATTTAAGCAATCTGACACATTATATACCCTAATAAACAATTTTAATTGCCTCAAATATAAAAAATAACAAGCAGAAAAAGAGGTTATTTTTTATTGTTTTCTCACTTTCGTCCACTCTTTTTTCTTCTTTTTCCGTATTTCTTTTATACATTCATTATTACATATGTTGGCGATCTTTTTTCCTTTTCCCTTTTTCCATCTTTTCTGAGGTCATTGGCATACAATTTTTTCTAATATTTTGTAGTTCAATAAATTTTCAACAATATTAGTCTGAAAGATATTTTTCAAATGCTTTAATTCATTCATTATTGCTATATTTTTTTCATATACTATTTTTTGCAAATTGTTTTCCTTTGCTTTTATACTTTTTATCCTTCCTGTATTCTAGTTCCCTAGTTTTTGAGTATTGGTTTAGTTCTAATTAAAAAATTGTTCAAGTTCAACTATCTAAATGTTGCAATATTCCTCTGGTATGGTTTTACATTTTCATTCTGACTCTTCTATATAAAACGAAGCTCACATCTCATGCTGCCTCAAGGCTTATGCCTCGCCTCTCCATGAGGAAAAAGGCTTGCCTTAAGCTATTTAAAACATTGCTTATGACAGAGGAGAAAGGAGAATTAATATTTTAAAAAATTCTAGCAGATTTTTCTGTTTCCTTTTGAATCCCTTGAGTATGTCATCAGATCAGACTGCTCGAAATTCTCATTAATTACTGATAATTTATTGGTGTCTTCTGACTCTTGATCTTGTGTTAACTTGATTGGTTATGAATTAGGTTTTCAAGGATGGGAATGGATTAAAAGCATCTTGGTTCACAGCCAGTGTGCTGAGTTTGAAGGAAGGGAAAGCTTACGTGTCCTACACTGAACTTCAACCTGAAGAAGGTAATGATAAGCAAAGCACACAGTGTAAGCTGGTGGCCTTATCCTTATGCTTGTGTAACTTAAGATTGACTTAATGAATACTATTTGGTTCTTGCTTGCAAGAGAAGCAGCAGTTAAAACTATTTTTATCAGTTTCTGGTCCTAAACTAGTTCATGAGAGTATACTGTGCTAAAAAAGAGAGTATATTGAGCTTTAAAAACATGTAAAGTTCTTGTCTTCCCTCTGCTGACTGTTTACGTTCTTGATTTGTGCAGGGTCAGGCCAACTAAAGGAGTGGGTTGCTCTTGATGGTCAAGGGGGCATGGCACCAAGAATTCGAATTTCTCGTCCTATGACTAACTTGCGGACTGAAGGAACAAGGAAGCGACGCCGAGCAGCTGCAGGGGACTACATCTGGTCAGTCGGAGACAAAGTTGATGCATGGATGCAGAATAGGTGTGGTTTTCTCTCTCTCCTTTTTTTATCCCTTCCATTGCAAACAATTTTCGTTCTTTGGATGAAGAGTCAATTTCATTGATAAATGAAATAGAGGAGTAGAACTTCAAACACCTTATTGGTGAATTACAAGAAAGATCTCCAACTGGAAATTAAATAAGATAAAGTACAAAGAGCAAAAGGGTGCTCATTTCAAGGAAATATACAGTGAGGAGGGGATGGATATACTCCTGTCCGAAGCTATGTTTTTGGAGCCAAGTTTCCTAAAGAATAGTGCAAATTGCGTTTAAAAGAACTATATCTTTATTGATGACCTCGCAATATTTTCCGGATGATCTTCAAAATCTTGTACGTAACCATGATTATTGAAAAGATTCCATTGCATTCTTAGAGAAAGAGTATCAAACGAAGAGATTTCAGCACTTGCCTCTTTTGCAAGCATCAGTTTTGAAGAAAGTAGTGGACATTTCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAAAAAAAGGGAAAAAAAAAAGAAAGAAAAAAGAAAAAAAGAAAAAAAAAGAGGAGTAGTGGAAGGGTTTCGAATTTTGGAGTTTATCATGAGCATTCACCTTCCCAAACATAGGGGAAACCTTGTATTCCTTAGGAGTTAGGATACTCAACTTTCGAAGTCATTTTGCTTGAGGGTTGGTCAAGAGCCATGAATCTAGTAGTTCTTTGTTATCTTGACATCATCCAGTACAAATGAGGCTGAGCCTTTAAATAGGTTAGAGACCAAATAGTTACACAAAAATTCACTATTCACTGATAAGGACTAGCCACTTACCCTAAACTGCGACATTGTTATAAGATAGAAAGATTGTGAGTGGGTGCTATCTAGATCACCCCAAGTTCAATCTGTAACCTGGTGCTTGTTTGTATGTTTCCCAGCTGGCATGAAGGAGTGGTTGTTGAGAAGAACGCAAAAGATGAAACAACATATATTGTCCGCTTTCCAGGCATGTCACTGTGATTCAGAGTTTAACTGACCTTTTTCTATGAAGATAGTGTCTTTTATATAAATTGATCTTCATCTGGTTCTTCCAGCTCAAGGCGAAACGTCCACTATCAAAGCTTGGAATCTCCGGCCTTCTCTCATATGGAAAGATGGGGAATGGTTTGAGTTGTCCAGCTCGTATATAAATGATTTTTCTCATGAGGTTCTTACTAAACTTTTGTTTGTTTGACTCTGTTCTCTGATTCTAAATTCATAATCTGTTGAAATATGCTTAGTTGCGATTGCATTCAGATTGTTGTGCCTCAGGAAAAGCGCATGAAGTTGGGCAGCCCAGCTACAGAAGTTAAAAGGAAGGATAAAATGCCAACGATTGTGGAGGATGTAGAATCAGCGAAGACAGAAGACCCGAGTTTGCTGTTGATATCAGCAAATGAGAAAGTATTTAATATTGGTAGGAATACACAAACTGAGAACAAGTCTAATCCATTAAAAACAAGTCGGACTGGTCTGCAGAAGGGGGCATCAAGAGTGATTATTGGTGTTCCCAGGCCTGGGAAAAAGAGAAAATTTATGGAAGTGAGCAAACATTATGATGCAGACACTCGAACTACTGAAGCAAATGATTCAACTAAGTTGGCAAAGTATTTGATGCCGCACGGATCTACTTCCAAAGTGTTAAAAAGAACTTCGAAATATGACACGAAGGAAAAATCTGCAAACGATGCCAGGCCCTTGGCTGTCAAGTCTGGAAAGCAGCCAAGTGTATCAGGTAAGTCTATTTCATAGAATGAGTCATTTTTGCCAAATGCTTTTGCTGATTGTTATTCTNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAAAAAAAGGGAAAAAAAAAAGAAAGAAAAAAGAAAAAAAGAAAAAAAAAGAGGAGTAGTGGAAGGGTTTCGAATTTTGGAGTTTATCATGAGCATTCACCTTCCCAAACATAGGGGAAACCTTGTATTCCTTAGGAGTTAGGATACTCAACTTTCGAAGTCATTTTGCTTGAGGGTTGGTCAAGAGCCATGAATCTAGTAGTTCTTTGTTATCTTGACATCATCCAGTACAAATGAGGCTGAGCCTTTAAATAGGTTAGAGACCAAATAGTTACACAAAAATTCACTATTCACTGATAAGGACTAGCCACTTACCCTAAACTGCGACATTGTTATAAGATAGAAAGATTGTGAGTGGGTGCTATCTAGATCACCCCAAGTTCAATCTGTAACCTGGTGCTTGTTTGTATGTTTCCCAGCTGGCATGAAGGAGTGGTTGTTGAGAAGAACGCAAAAGATGAAACAACATATATTGTCCGCTTTCCAGGCATGTCACTGTGATTCAGAGTTTAACTGACCTTTTTCTATGAAGATAGTGTCTTTTATATAAATTGATCTTCATCTGGTTCTTCCAGCTCAAGGCGAAACGTCCACTATCAAAGCTTGGAATCTCCGGCCTTCTCTCATATGGAAAGATGGGGAATGGTTTGAGTTGTCCAGCTCGTATATAAATGATTTTTCTCATGAGGTTCTTACTAAACTTTTGTTTGTTTGACTCTGTTCTCTGATTCTAAATTCATAATCTGTTGAAATATGCTTAGTTGCGATTGCATTCAGATTGTTGTGCCTCAGGAAAAGCGCATGAAGTTGGGCAGCCCAGCTACAGAAGTTAAAAGGAAGGATAAAATGCCAACGATTGTGGAGGATGTAGAATCAGCGAAGACAGAAGACCCGAGTTTGCTGTTGATATCAGCAAATGAGAAAGTATTTAATATTGGTAGGAATACACAAACTGAGAACAAGTCTAATCCATTAAAAACAAGTCGGACTGGTCTGCAGAAGGGGGCATCAAGAGTGATTATTGGTGTTCCCAGGCCTGGGAAAAAGAGAAAATTTATGGAAGTGAGCAAACATTATGATGCAGACACTCGAACTACTGAAGCAAATGATTCAACTAAGTTGGCAAAGTATTTGATGCCGCACGGATCTACTTCCAAAGTGTTAAAAAGAACTTCGAAATATGACACGAAGGAAAAATCTGCAAACGATGCCAGGCCCTTGGCTGTCAAGTCTGGAAAGCAGCCAAGTGTATCAGGTAAGTCTATTTCATAGAATGAGTCATTTTTGCCAAATGCTTTTGCTGATTGTTATTCTTTTGGCAACAGATCATGCAGTCATTACCAAGGATCCTGAAAGTCAGAATGAGAGTACTTCAGGGAAGAATGACCAAATGGACGTTCCTCCTTTCTGTAGCACTGAAGAAGCGCCAGAGGGCTCAGTTTTATTTCCTCCAGCACACGCTCCCAAGAAAGCATCCTCATTTCACACGAAGCCAGAACGAGCAAATAAAGGAAAACTTGCTCCTGCAGTTGGGAAATTGGCAAAAATTGAGGAAGAAAAGGTTTTCAATGGGAATCCTACCAAACCAAATTCCAATGTTATTGAGCCACGTAGATCCAACCGCCGTATTCAGCCTACCTCAAGAGTAAGTTCTTGAAACGTGAACTTTAATCAATTGATGTATGTTGATTCTGTTCTTTTGTCATCGAGACCAGTTTCGTTTGAGAATGGGAATCATTTATACTTTATAGAATATGGTAGATGATTTAGCGATCCATGTTACCGTTGAGTGCTCTGCAGTAGGTTTATGTTTGTTTGTATGTTAGCCATGAAGATATTCAGGTTATAAACCTTACTTTAAAATAGTAATATCTAAATCATTTATAAAACTCAAATAATGTAAAGGGTTGTTTGGGGAGCTAGAATGTAATAGGATTACTGGGAATCAGAGGAGTATTAGAATGCGTGAACACTTAAGGAATAAACAAAAATAAGATCACGAACTATGGAAACGGTGAGAAATAGGTTTGGGTTGAAAACTGTCGGAACGAAATGACTTTTGAATTATCGATCGAGCCCAAAACGCTCAATCCAAATGTTCATTTTGGATTACATTACAATACAAGTCTATCCTATTATGGTTCCCAAAACACCCCCTAAGATATTATGATATAATAATGCTCATTAAGATCAACTGTCATGGGTTGGCCTATATGGGTAGGCCTAGGAATAACAATGATCATTAAGATTATTCACTATGGGTTGGTCTAGTGGTCAATAAGGGGCATGTAAACAATAAAGAGTTTAAAGGAAATGAGTTATGATGACTACCTACCTAGATTTAATATTTTTTGAGTCACTTGGCAACTAAATGTAGTAGTGTTAGGCATCATGCGGTTGTCATAGCGAGAATAGTTGAAGTACACACAAGCTGACTTAGACACACACAAGTATAAGAAAAATAAAATAAAGAAATAATAATTATTGTTATTATTAAGATATATTTTATTCTTTTATAAAAACCGCATAGATCGTTCATATCATATCTATTCATAATAGAAGCCAACTGCTTGTTAGTCTTCCAACTGTTTTCCCCATGCATTCAGTTTCCCTTAAGCCCAACATTGAAAGGAGCTTCTCTCTCTCTCCACTTGTTTCATTCATTGGTTTTGTTTTTTATGTACGGTGTACCCTCGTATATGTTGTGTAATGTATGTGTGCATTCGGAATCTTGGTTCTGCAGTTATTAGAGGGACTGCAAAGCTCATTGGCAATTTCAAAGATTCCTTCTATTTCACACGATAAAGGTCAACGAAGCCAGAACAGGAATGCATCTAGAGGTAAGAACTGATGTGCCAAATTCACTTCAGAATTTTGCCTCAAATTAACATTACATCATGACAAAAATTTACTACTTGATTGTGACAGGGAACAAAATCTAAGGGGGGCTGTTGCAAACTTGGTCTTAGGTCCAGGAAACATCCTCTGCCGATAATTAATGTGACTTTAACAGTAGTTTATATTGTTGTATTTACTATAGATAAATGTAGGAAAGGAAACTCTTGCTTCCTTAGTTATACTCCTACAAGTTTAGCTTTAGTTGTTCATATTGATTGAAGTTCCCTCACATTTGTGTTCTCAATCTCCTCTACCATAACAGGATGTATATTGAAACTCTCCTTGTAGGTTGGTCTTGGAATTCAAAAGAAGGTAGTTTGGTTTATTCATTTATGTCTTCTCCCCCACCCCCATGATCTATATATAATATCACGTTAGTGTTGTGTGATTTCAGAATCGAAGCATCTACTCCGCTTCGTTCCCTTTTGGAGTTAAATAGTTACTGTTTACCCTCACCACTGCCACTGCCGAGTACACTTGAAGAAGGTGGTCAGAAGGTTCAGGTACTGCCCAAATTTGTTTTCATTGAACAATTGGGAACAAAGAGTGATGTTGCATTCTTATATTATGGCCGCCTTACCAGTAAAGTTATCTATGGTTTACTAGGATACTTCTTCTTCTTCTTGCTTTTTTGCTTCTTGATATTTTTTTGTATTTTCTTCCTCAGTGCTAGATGTATTAAAAATGTAAATGGTTCGGCTTTATGAAATATTACAGGGACTGGTACCACTGATTATTGGTAGCTTTGATAATGGGCCCAACGCATTGGAATGGGGCCATGCTAAAATCTGAGAAAAATCAATTGATCTATGAGATTGACAGGGAAAACTTTGGTCCTTTGAATTATCTAATCCAACCTTAATCAAATTTATTTCTTTTATTACTACTACTACAACTTTTTAGCTGCTGTCTTCATTTGAAAACTTCTCCCAGTTACTAATAATCTCTATTTGATTTAGTATGATTTTATTCGCAAAACTATCATCCAACAAGCATTTGCAGATGGAGTATTTCATAAGTGAATTCGTAGGAAATCTTGGACAAATTAAATACAATTGTTATCCGGTTGAAATTGAAGAAATTAGTCGAGGTTTAATTTTAGTACAATCAAACCAACACGTACACATGTTAAGGCACCACCCTCATGTTATATGTACATTTAATCAAACGAATATTATTAATTAAAACGGATGCTGTCTATATAAGGTGTACGGATGCTGTCTATTGACTTAAGGTTGGAGATACAATTTTTCCACCATATATAATGGCTATTAAAGTGATTTGTAGATAGAACCAAACAGTTTTTAAAATCTTCAAAATTACTGGTATCCCTCGCTATAAGATAATTGGACACAGTACACCTTATCATTGAAAGCATGCAGTGATAGAATATGCTATTATCGATAGTCTCCCATCAGTCTATCATTGATATACTTTTATAAGAGGGTAGGGATCAATAGGAGTAGCACTAATAGCTTATATATGTTATAGTTTCAATTGATCTTATGGTTGCATTTGACCACGAATTGTACAACTTTTTTGAATAGATAGAAAGATTATTTGGTTGACTGAAAATCTGATCTTAATATGGATGACATTCCTTTATATTCAATGTTATTAGTAGATTAATATCATTAATAAAATTATACACTTTATATAGTTATATTATAGCTAATCTTCAATTAACTTCTACAAAAATATTCAAGAGCTAAATACGAGGAACTCAATAATTTTAACATACTATCAATTAATGATGGACATTCATCACCATTAGAGTTCAATCTTTAATAGAAGTTTACCATTGATACTTCTAATAAAGGAAAAATTGAAATTAAGAAAGATTTAAATATGAAGAAAAACTGTCGCACTTTATAATATTTGCTATATTAGCAAATATTTTGGCTATTTTGTCCTTTGGTATATATATATAATTACCCTAAACCTAATGGGATTGGTAGTTATGAGTGAGATTTTTAAAATTAAGAAATTATCAACAAATCTTGCTTTTTTGAAGTTCAACGATCATGAGGTGGGAGGTTATAACTTCTAACCTATTTAAGAAGGTAAATAAATAATGCGGCTTTGAGGGTATGTTTGGAAGTAATTCTAAAATGGTTAAAATAATTTGTCATTTTAAAAATCACTATAAAACCTGTTTTTAATCCTTCAAAACTAATTTTTATGGTATGAAGATTGCAATTTAAAAGTGTAAAATTTAATAGTAAACTAATTTTGAGTGATTAAAAGCATATTTTAGAGTGAATTTAAAATTGACAAAAGTAATTTTGACAATTTTAAAATCATTCCCAAACATGCTCTAAATCATGCTCCAATCCAAAAATCATCCATAACTAATGCAAGAAATGCATACTCAATAGTCACCATTCCTACGATAGTTTTGTTGATGAATGTGTTGTGGCAATGAGCTCAATGACAAGTTACATCTACAATTGGAACAAAGCCACTCAAAAGCCAACCTAATTCATCTATGACAACGAAATGAATGAACTGCCAAGGACTCAAAATTATTATTTCTAATTTTATCGTTTTCAAAACATGTTAAATACGAGCATTTTTGTTATTTTCCTAAGATCAATGATGCACACATTAAGATTTTGGGTGACCATCTAAACTCTAGCCCTTAGGTTGTGCAAGTGAGTTTGGAATTTAGAATTTCAAAGACAATATTGTTCTTGTGTTTGATAACAAAGATTCTTTAGCATCTTTTTTGCAATTCTTGATTTAGAATTGCATGTTTGAACATTCATGAAAGGTTGATTAACTTGCTTGTGAAATATTTGAACATTGGATAGGAAAGTAATTCTCACCTCATGGTCTTCGATCACAAGGTAACCCTCTACAACTCTCCTTGAAGTTGTGCCAAATTGATTTTTATGGATTTTAGAATTCCTTATCAGGGATTCAAATAACCAAAAGATTCCTAATTCACAAATTGTTAGGAGTTTCATCAGGTTTACCAAAACACTACTTTATCAGGGTCCATCTCCACCACTCATTGGTCTTTGCACACTCCCAACCACATTATCATGAATGTTTTCAATTTCAACAAAATCTCCCACCAAACTACACAACAAATCTAATTGATCATCTTTTTTAACTATCATCCTACAATTCAAAAATCAAAGCTTGTCTTTTTTCAAATAACTCAAGACTTAAAGATTTCAAAGTTGTGTTTCAAATTCAGCTTGCAATTGACAATGCGTTGAAGAGCCAAGAGGCTACCAAACAAAATCTTTGAATCACAATTCTGAACAAAAAAGAAATCTGATTTAATTGAAACATGACGTCTGACAACCTAATGGATCATAGTATTTGGTTCCATAATTTCCCAAAATTCTCTCGTTTTCTAATTTAATCCACCAGCCAATACTTGGAAAATTATTATAAATAAAAAAAAATATCAAACTATTTATAAATACAGAAAAATTTTATTGTCTATGAGATATAAGTCTATCGTGGTCTATCGCTCAAGCGATAGAAGCCTATCATGATTTATCGTTAGTAAACATTGAAATTTTTCTATTTTGTAAAAAAGTATGGCTTATTTTGCTACATTTGAAAACGACCCTAAATAATCACATTTTTAATTTTAATACAGTAATGAAAATACTTGATCAAACCTATTGAACTCTCTTCTGGATGGATATAGTATAACTTCCACTTGATAACTTGGGGGGAAAACTAAAATGATCCATTATGTTTGCTTGTAAACAAAAATTTTCTAAGATTACAAGTTACATAGATGATAAATAAAGATAAAAATAACAATATTGCCTTAGGCTCACAAATACCTGAGAGGCTCAATCACATGAAGTCAGTAACGCTGGCCATAATTCTGTGTATCGAGCACGTCGTCGATTCGAAACTGTTGTATCCAGTCATTGCATCGTGCAGGTTCGGCTCATATATCACAATTCCACAAACTAATAAAACGAATCAGGAGGGGGGCAGGGATGTTAGCTCGCGCATAAAAGACTAA
mRNA sequence
AAGAATTTGTTATAAATTTGGAAAAGATTATTAGCATATTAACTACCTTTTCTCTTAATCATTTTTTTCCCCAAATCATTTATAAAGCAGAGATAAACCCATGGATTATGACGACAATGACTTTCAAAGCCAGAATCTTCATTTAGCTGGTGAAGGAAGTGCCAAATTTCCTCCTGTTCTTCGACAATATGCTCTCCCGAAGTTTGATTTTGATGACACCCTTCAGGGGCATGTAAGATTTGATAGTTTGGTTGAACCTGAGGTGTTCTTAGGTATTGAGAATAGCGAAGATAATCAGTGGATTGAAGATTATTCCCGTGGAAGTAGTGGGATAGGATTTACTTCTTGTGCAGCAGAATCTTGTTCTATATTGAGGCGGAAGAATGTTTGGTCTGAGGCCACTTCCTCTGAATCTGTTGAAATGTTATTGAAATCTGTTGGGCAGGAAGATATCAATCTGGCACCAGCTGTCACCGGCGAGTCAAATGCTCGTGAAAAATTGGACTACTTAACAAACCCAATGGACCCTACTTTAAAAGATGACGGTAGTAGTTTCTGTGAAATGGGTGATTTACAGCCTACATTGCTATCAAATATAAGTCTTGAGGAATTGCATGTTGTTAATGAGGACATAAGAGGGGAGCAACAACCTCAAGGGGATGATCCTACTGAATTTCAAGAGATTTGTACTGTTGATAGAAGTTTGGGTGAGGTTGATCCTGATGTTGCCCATGAACTTGTAGATATGCCAGCAAGTGAGGGAAGTTCAGGTATTGATGAAAACAGTAAACAAACATGTGCAAGTACAATTAATACTTCAGTTTCCATATCAATGGAAGATAAAGGGCAAGATGATTTTTCAGCTTCAGGAAAGCATATAAATGATTTGGTTACCTGCACGCAAGAAGGTAGTGGAAAGTTGAGCAGTCAGAAGATTGAACAACAAATAAAAGATTTATCCGAGAATCCTGTTAATACATATATTGGGAATATTGAACAAGTGGTCAATTCACACGAGTTGAATAAAGAAGACCAAAACCGTCTTTTATCCCCCTCAGTTCCTGCAGACAGATTGGTCATTGAATCTAGTATTGCTACTTTGCAGTCCCATGCAAGTATGACCTTGAAGGGGGATTGTGTGTTCCATTCAGGTAGTGGAGAGGTTACACCCGAAGTTCCTTCTGAAACTGACAAGTTTGATGATAAGGTCTTGTGTTCAAATGTGGAGATTGGGAATCCGTCTAAAACAAACATGCACGAGGTATTACCTACAGTTGTTAAAGGTGATGCTAGAGCTGTGGTGTGTGCAGGCGAGGGGAAAAACATTAATGCAGAAGTTTGTGCCTTTCAAGGGCCTAAGATTGATTCAGTTGGGCAGATGGCTTGTGCACAAGAAATAATTAGTGTAGATCAGCAACGCTTTCCCTCGGGTATTGAGATACAAACTAGTAAGTCCGAGTCTTCTGCATCTGCTATGGAGAAAAGTAATGCCTCTAAGGTTGGTGAAAGCAGCAGTGGTCATATCAGAGATATTCCAGATAAATTTACAAAGGACGAACATGGTATGATCAGTTTGAGAGATGTTCGTGGTTGCACACTTCCCATAGAAAAAAATCTGTATTCCGAAGGCCATCTGCCACCTACTACTGTGGCTGAATCAACACAATTATGCGAGGAAAATAAATTGTGCCAGTCAGGTAATGACCATGTCACACATGCCAGTTGCAAGGAAGAAGTGAGGTTGTCTTCTGATTCTATTAGTGTGAATGGCAAGTTTGCCGAGTCTCCTGTCAGAGATAAGAGAATTGTATCCTTGTCTTTTCAAGAGAGTGATGTAGAAAGTGGGATGATAGATACGAAAGTAGAGTACAGTGCTAATGCTGTGTCAGTTTCTACCTTTGGGGATGCCAATGTGAGAACATGTGACACATTACAGGGCGACTCCTTACCCGTAGTTGATGCTTTGACAGACAGAAAAGATGCTGATGAAAAAGAGGACCAGTTGCAACCTGGTGTGGTGGAGTTTACTCAATCAGATAGCAAGGAAGAAAGTGGTGTGATAATTCCTGCTGAAGGAAGTTTTCCTCTGTCGGATACTTCTCAACCTGTGGGGAAATTTCATCCCCTTTCTGAAGCTGAAAAATCTGCGTGTCTCCTTACTGGTCAGGGATTTGGTGAAAGTATTGATCAAACTATTTCAAAGAATTTGAATTCTGATGACTGCAACAGAGAAAGCCAATCTATACCCCAAGCTGACATTCCTAATAATGTTATCCAAGACTGCGGACAGGAAATGGACATTGATCCAGCCTTTTCAAAGTCATCTGCAAAAGCATGTGATAGTGGTGTTAAAAAGTCAGATGAAAAATCTTTTCCGCCCGATGCCACGTCTTTAACACCACTTCCAGGAGAAACACTTGATAATTATCAGAAAGATCAGGAAAGTACTAAAGTCGTTTCAGAATCTGTGGGAAATAATTGTCAGCAGGCCATTGCAGTGAACATTGACAGTGATGCTGGCAAAAAGGAAGGTTCTTTATGCTCTGCTGATTTTCCGCAATCTCATGAACAGATGTCTGTCGTGGGGAATGGTAATTCTACTGCTGATAAGCCTCCTCCTAATTTACCTGATGTTGTCAAAACTACTGTGGTTGCTCATGATCCAGATGTAAAGGATTGTAATATGAGACCAGCAAGTAAGAATGTTGAAGCTGCAGAGGCCAAGGATAGGCTGGTTGGCAACACGTCATCTGGTTCCCAGTTGCCCAAAGGAAATGTTGCCTCTGAAAGTGAAACAGCTCTTACCTTTGAGTCCAGTTCATTGGTAGATCTGCCAAAAAATGATTCTGGCATTGCAGTTGCAACAGCAGCTACTGCTTCTTTGGTTGCGGAGGGACCCCAATCATCTTCTGGCCTATCCAAATTGGACATCAAGAGTGCTCAGGATATTTCACACAGTAGCCCTCATGTGTCTGAGGTAAAAGTTGCACGTGCTCGTTCCAAGGGCACTCCTGAGCGTAAACCTAGGCGTGCTTCTGCAAAAGGATTGGGGAAAGAGTCATCTACAAAGGGAAATCACACGAAGAAATCAGAGAAAGTTGAGAAATCGAACAGTACGCCCATAAGCAATCCTGGAATTTTCCAGCTTGCACAGTCTAATGAGATGCAACATCATGGACATGTGGAGTCCAGTGGTGCAAAACCATTTGTCTTTATTGGTGCTTCGACTTCTAGCATTCCAGACTTGAACAATTCTGCCTCCCCATCCCCAATGTTTCAACAGCCTTTCACAGACTTGCAACAAGTTCAATTGCGTGCTCAAATATTTGTTTATGGAGCTTTGATTCAAGGTACAGCACCAGATGAGGCGTACATGTTGTCTGCATTTGGAGGACCAGATGGTGGAACAAACCTTTGGGAGAATGCCTGGCGTATGTGTGTAGACAGGCTTAATGGAAAAAAATCTCAGCCTACTAATCCAGAGACACCTTCACAATCACAATCTGGTGGTAGAAGTACTGAGCAAGCAAATAAACAAAGTACACTGCAAAGTAAAATTACATCTCCTCCAGTTAGTCGAGTTAGCAGTAAGAGTACATCAACAGTGTTAAATCCTATGATCCCTCTTTCCTCACCACTCTGGAGTATTTCCACACCTTCTAATGCTCTGCAATCTAGTATTGTGCCTCGAAGTCCAGTTATAGACTACCAACAAGCACTTACTCCATTGCATCCGTATCAGACTCCTCCTGTCAGGAACTTTATTGGTCATAATCTTTCATGGTTTTCTCAGGCTCCATTCCATAGTACCTGGGTTGCTACTCAGACTTCGACACCCGACTCCAGTGCACGATTTTCTGGTTTGCCAATTACTGAGCCTGTTCATTTAACACCAGTAAAGGAATCATCTGTGTCACAATCCTCTGCCGTGAAGCCCTCTGGTTCCATGGTTCACGGTGGAACTCCTGGCAATGTATTTACTGGAGCCTCCCCCCTGCTTGAGTTAAAAAAAGTGTCAGTAACCACAGGGCAAAATTCCACTGAATCAAAAATGAGAAGAAGAAAGAAGAATACAGTTTCTGAGGATCCTGGCCTTATAACTATCCAGCAGGTTCAACCTCATTTAAAACCAGTGCCAGCTGTAGTTACAACTACAATTTCTACTCTTGCAACATCTCCATCTGTCCACCCTAAGGCAGCTTCTGAGAATTTGATTTTATCTCCACCTCCATTGTGTCCGACAACTCACGCAAAGAGTGCAGGTCAAGATTTGAGAGGGAGAGCCATGTTTTCAGATGAAACACTCGGTAAGGTTAGGGAGGCTAAGCAACTGGCTGAGGATGCTGCATTGTTTGCATCTGAAGCTGTTAAACACAGTGCAGAAGTGTGGAGTCAATTGGACAGGCAGAAGAATTCAGAATTAGTACCAGACGTTGAAGCTAAACTAGCTTCTGCAGCTATTGCAATAGCAGCAGCTGCTGCTGTCGCAAAGGCTGCGGCTGCTGCCGCCAATGTGGCATCAAATGCTGCATGTCAAGCAAAGTTGATGGCTGATGAGGCAGTTACTTCATCTAGCCATGATGTTCCCTGTCAAAGCAATGAATTTTCTATTCATGGTAGTGCTGTTGGTGTAGGAAAGGCTACTCCAGCCTCCATCTTAAGGGGTGAAGATGGTGGAAATGGTTCCAGCTCTATTATTATTGCCGCAAGGGAGGCAGCCAGAAAGAGGGTTGAAGCAGCATCTGCTGCTTCTAAGCATGCTGAAAATGTGGATGCCATTGTCAAAGCTGCCGAGCTGGCAGCAGCAGCTGTGTCACAAGCTGGAAAGTTAGTTGCTATGGGCGATCCTCTCCCCTTAGGCAAATTAGTGGAAGCTGGTCCAGATGGTTATTGGAAAACGCCTCAAGTATCTTCTGAGCTGGTTATGAGATCAGACGATGTTAATGGAGGACGTTCCAATTCAGCAATTAAGCGTCCTAGAGACGGATCATCAAGTAAGAATGAGATTCAGGTAAGTGGCGGTGCCAAGTCACCAATTCCTGGAGAAATATCTATGGGTTCTGTTGAGAATCATTCCAAATTGGTGGACGGCATCACAAGTTCTGTAGCACCTCGTGAGAAGGATCTGAGAGGACAGAAGGACCAAAATGCTTCTGATCTGACAAAAACCATTGGTGTAGTTCCAGAATCTGAAGTTGGAGAAAGATCCTCCCAGGACGAGTGTGAAAAGGCTAAGGATTTAAGACAAAGCAGCATCAAGGAAGGTTCCCATGTCGAGGTTTTCAAGGATGGGAATGGATTAAAAGCATCTTGGTTCACAGCCAGTGTGCTGAGTTTGAAGGAAGGGAAAGCTTACGTGTCCTACACTGAACTTCAACCTGAAGAAGGGTCAGGCCAACTAAAGGAGTGGGTTGCTCTTGATGGTCAAGGGGGCATGGCACCAAGAATTCGAATTTCTCGTCCTATGACTAACTTGCGGACTGAAGGAACAAGGAAGCGACGCCGAGCAGCTGCAGGGGACTACATCTGTTGCGATTGCATTCAGATTGTTGTGCCTCAGGAAAAGCGCATGAAGTTGGGCAGCCCAGCTACAGAAGTTAAAAGGAAGGATAAAATGCCAACGATTGTGGAGGATGTAGAATCAGCGAAGACAGAAGACCCGAGTTTGCTGTTGATATCAGCAAATGAGAAAGTATTTAATATTGGTAGGAATACACAAACTGAGAACAAGTCTAATCCATTAAAAACAAGTCGGACTGGTCTGCAGAAGGGGGCATCAAGAGTGATTATTGGTGTTCCCAGGCCTGGGAAAAAGAGAAAATTTATGGAAGTGAGCAAACATTATGATGCAGACACTCGAACTACTGAAGCAAATGATTCAACTAAGTTGGCAAAGTATTTGATGCCGCACGGATCTACTTCCAAAGTGTTAAAAAGAACTTCGAAATATGACACGAAGGAAAAATCTGCAAACGATGCCAGGCCCTTGGCTGTCAAGTCTGGAAAGCAGCCAAGTGTATCAGATCATGCAGTCATTACCAAGGATCCTGAAAGTCAGAATGAGAGTACTTCAGGGAAGAATGACCAAATGGACGTTCCTCCTTTCTGTAGCACTGAAGAAGCGCCAGAGGGCTCAGTTTTATTTCCTCCAGCACACGCTCCCAAGAAAGCATCCTCATTTCACACGAAGCCAGAACGAGCAAATAAAGGAAAACTTGCTCCTGCAGTTGGGAAATTGGCAAAAATTGAGGAAGAAAAGGTTTTCAATGGGAATCCTACCAAACCAAATTCCAATGTTATTGAGCCACGTAGATCCAACCGCCGTATTCAGCCTACCTCAAGATTATTAGAGGGACTGCAAAGCTCATTGGCAATTTCAAAGATTCCTTCTATTTCACACGATAAAGGTCAACGAAGCCAGAACAGGAATGCATCTAGAGGATGTATATTGAAACTCTCCTTAATCGAAGCATCTACTCCGCTTCGTTCCCTTTTGGAGTTAAATAGTTACTGTTTACCCTCACCACTGCCACTGCCGAGTACACTTGAAGAAGGTGGTCAGAAGGTTCAGGTACTGCCCAAATTTGTTTTCATTGAACAATTGGGAACAAAGAGTGATGTTGCATTCTTATATTATGGCCGCCTTACCAGTAAAGTTATCTATGGTTTACTAGGATACTTCTTCTTCTTCTTGCTTTTTTGCTTCTTGATATTTTTTTGTATTTTCTTCCTCAGTGCTAGATGTATTAAAAATGTAAATGGCTCACAAATACCTGAGAGGCTCAATCACATGAAGTCAGTAACGCTGGCCATAATTCTGTGTATCGAGCACGTCGTCGATTCGAAACTGTTGTATCCAGTCATTGCATCGTGCAGGTTCGGCTCATATATCACAATTCCACAAACTAATAAAACGAATCAGGAGGGGGGCAGGGATGTTAGCTCGCGCATAAAAGACTAA
Coding sequence (CDS)
ATGGATTATGACGACAATGACTTTCAAAGCCAGAATCTTCATTTAGCTGGTGAAGGAAGTGCCAAATTTCCTCCTGTTCTTCGACAATATGCTCTCCCGAAGTTTGATTTTGATGACACCCTTCAGGGGCATGTAAGATTTGATAGTTTGGTTGAACCTGAGGTGTTCTTAGGTATTGAGAATAGCGAAGATAATCAGTGGATTGAAGATTATTCCCGTGGAAGTAGTGGGATAGGATTTACTTCTTGTGCAGCAGAATCTTGTTCTATATTGAGGCGGAAGAATGTTTGGTCTGAGGCCACTTCCTCTGAATCTGTTGAAATGTTATTGAAATCTGTTGGGCAGGAAGATATCAATCTGGCACCAGCTGTCACCGGCGAGTCAAATGCTCGTGAAAAATTGGACTACTTAACAAACCCAATGGACCCTACTTTAAAAGATGACGGTAGTAGTTTCTGTGAAATGGGTGATTTACAGCCTACATTGCTATCAAATATAAGTCTTGAGGAATTGCATGTTGTTAATGAGGACATAAGAGGGGAGCAACAACCTCAAGGGGATGATCCTACTGAATTTCAAGAGATTTGTACTGTTGATAGAAGTTTGGGTGAGGTTGATCCTGATGTTGCCCATGAACTTGTAGATATGCCAGCAAGTGAGGGAAGTTCAGGTATTGATGAAAACAGTAAACAAACATGTGCAAGTACAATTAATACTTCAGTTTCCATATCAATGGAAGATAAAGGGCAAGATGATTTTTCAGCTTCAGGAAAGCATATAAATGATTTGGTTACCTGCACGCAAGAAGGTAGTGGAAAGTTGAGCAGTCAGAAGATTGAACAACAAATAAAAGATTTATCCGAGAATCCTGTTAATACATATATTGGGAATATTGAACAAGTGGTCAATTCACACGAGTTGAATAAAGAAGACCAAAACCGTCTTTTATCCCCCTCAGTTCCTGCAGACAGATTGGTCATTGAATCTAGTATTGCTACTTTGCAGTCCCATGCAAGTATGACCTTGAAGGGGGATTGTGTGTTCCATTCAGGTAGTGGAGAGGTTACACCCGAAGTTCCTTCTGAAACTGACAAGTTTGATGATAAGGTCTTGTGTTCAAATGTGGAGATTGGGAATCCGTCTAAAACAAACATGCACGAGGTATTACCTACAGTTGTTAAAGGTGATGCTAGAGCTGTGGTGTGTGCAGGCGAGGGGAAAAACATTAATGCAGAAGTTTGTGCCTTTCAAGGGCCTAAGATTGATTCAGTTGGGCAGATGGCTTGTGCACAAGAAATAATTAGTGTAGATCAGCAACGCTTTCCCTCGGGTATTGAGATACAAACTAGTAAGTCCGAGTCTTCTGCATCTGCTATGGAGAAAAGTAATGCCTCTAAGGTTGGTGAAAGCAGCAGTGGTCATATCAGAGATATTCCAGATAAATTTACAAAGGACGAACATGGTATGATCAGTTTGAGAGATGTTCGTGGTTGCACACTTCCCATAGAAAAAAATCTGTATTCCGAAGGCCATCTGCCACCTACTACTGTGGCTGAATCAACACAATTATGCGAGGAAAATAAATTGTGCCAGTCAGGTAATGACCATGTCACACATGCCAGTTGCAAGGAAGAAGTGAGGTTGTCTTCTGATTCTATTAGTGTGAATGGCAAGTTTGCCGAGTCTCCTGTCAGAGATAAGAGAATTGTATCCTTGTCTTTTCAAGAGAGTGATGTAGAAAGTGGGATGATAGATACGAAAGTAGAGTACAGTGCTAATGCTGTGTCAGTTTCTACCTTTGGGGATGCCAATGTGAGAACATGTGACACATTACAGGGCGACTCCTTACCCGTAGTTGATGCTTTGACAGACAGAAAAGATGCTGATGAAAAAGAGGACCAGTTGCAACCTGGTGTGGTGGAGTTTACTCAATCAGATAGCAAGGAAGAAAGTGGTGTGATAATTCCTGCTGAAGGAAGTTTTCCTCTGTCGGATACTTCTCAACCTGTGGGGAAATTTCATCCCCTTTCTGAAGCTGAAAAATCTGCGTGTCTCCTTACTGGTCAGGGATTTGGTGAAAGTATTGATCAAACTATTTCAAAGAATTTGAATTCTGATGACTGCAACAGAGAAAGCCAATCTATACCCCAAGCTGACATTCCTAATAATGTTATCCAAGACTGCGGACAGGAAATGGACATTGATCCAGCCTTTTCAAAGTCATCTGCAAAAGCATGTGATAGTGGTGTTAAAAAGTCAGATGAAAAATCTTTTCCGCCCGATGCCACGTCTTTAACACCACTTCCAGGAGAAACACTTGATAATTATCAGAAAGATCAGGAAAGTACTAAAGTCGTTTCAGAATCTGTGGGAAATAATTGTCAGCAGGCCATTGCAGTGAACATTGACAGTGATGCTGGCAAAAAGGAAGGTTCTTTATGCTCTGCTGATTTTCCGCAATCTCATGAACAGATGTCTGTCGTGGGGAATGGTAATTCTACTGCTGATAAGCCTCCTCCTAATTTACCTGATGTTGTCAAAACTACTGTGGTTGCTCATGATCCAGATGTAAAGGATTGTAATATGAGACCAGCAAGTAAGAATGTTGAAGCTGCAGAGGCCAAGGATAGGCTGGTTGGCAACACGTCATCTGGTTCCCAGTTGCCCAAAGGAAATGTTGCCTCTGAAAGTGAAACAGCTCTTACCTTTGAGTCCAGTTCATTGGTAGATCTGCCAAAAAATGATTCTGGCATTGCAGTTGCAACAGCAGCTACTGCTTCTTTGGTTGCGGAGGGACCCCAATCATCTTCTGGCCTATCCAAATTGGACATCAAGAGTGCTCAGGATATTTCACACAGTAGCCCTCATGTGTCTGAGGTAAAAGTTGCACGTGCTCGTTCCAAGGGCACTCCTGAGCGTAAACCTAGGCGTGCTTCTGCAAAAGGATTGGGGAAAGAGTCATCTACAAAGGGAAATCACACGAAGAAATCAGAGAAAGTTGAGAAATCGAACAGTACGCCCATAAGCAATCCTGGAATTTTCCAGCTTGCACAGTCTAATGAGATGCAACATCATGGACATGTGGAGTCCAGTGGTGCAAAACCATTTGTCTTTATTGGTGCTTCGACTTCTAGCATTCCAGACTTGAACAATTCTGCCTCCCCATCCCCAATGTTTCAACAGCCTTTCACAGACTTGCAACAAGTTCAATTGCGTGCTCAAATATTTGTTTATGGAGCTTTGATTCAAGGTACAGCACCAGATGAGGCGTACATGTTGTCTGCATTTGGAGGACCAGATGGTGGAACAAACCTTTGGGAGAATGCCTGGCGTATGTGTGTAGACAGGCTTAATGGAAAAAAATCTCAGCCTACTAATCCAGAGACACCTTCACAATCACAATCTGGTGGTAGAAGTACTGAGCAAGCAAATAAACAAAGTACACTGCAAAGTAAAATTACATCTCCTCCAGTTAGTCGAGTTAGCAGTAAGAGTACATCAACAGTGTTAAATCCTATGATCCCTCTTTCCTCACCACTCTGGAGTATTTCCACACCTTCTAATGCTCTGCAATCTAGTATTGTGCCTCGAAGTCCAGTTATAGACTACCAACAAGCACTTACTCCATTGCATCCGTATCAGACTCCTCCTGTCAGGAACTTTATTGGTCATAATCTTTCATGGTTTTCTCAGGCTCCATTCCATAGTACCTGGGTTGCTACTCAGACTTCGACACCCGACTCCAGTGCACGATTTTCTGGTTTGCCAATTACTGAGCCTGTTCATTTAACACCAGTAAAGGAATCATCTGTGTCACAATCCTCTGCCGTGAAGCCCTCTGGTTCCATGGTTCACGGTGGAACTCCTGGCAATGTATTTACTGGAGCCTCCCCCCTGCTTGAGTTAAAAAAAGTGTCAGTAACCACAGGGCAAAATTCCACTGAATCAAAAATGAGAAGAAGAAAGAAGAATACAGTTTCTGAGGATCCTGGCCTTATAACTATCCAGCAGGTTCAACCTCATTTAAAACCAGTGCCAGCTGTAGTTACAACTACAATTTCTACTCTTGCAACATCTCCATCTGTCCACCCTAAGGCAGCTTCTGAGAATTTGATTTTATCTCCACCTCCATTGTGTCCGACAACTCACGCAAAGAGTGCAGGTCAAGATTTGAGAGGGAGAGCCATGTTTTCAGATGAAACACTCGGTAAGGTTAGGGAGGCTAAGCAACTGGCTGAGGATGCTGCATTGTTTGCATCTGAAGCTGTTAAACACAGTGCAGAAGTGTGGAGTCAATTGGACAGGCAGAAGAATTCAGAATTAGTACCAGACGTTGAAGCTAAACTAGCTTCTGCAGCTATTGCAATAGCAGCAGCTGCTGCTGTCGCAAAGGCTGCGGCTGCTGCCGCCAATGTGGCATCAAATGCTGCATGTCAAGCAAAGTTGATGGCTGATGAGGCAGTTACTTCATCTAGCCATGATGTTCCCTGTCAAAGCAATGAATTTTCTATTCATGGTAGTGCTGTTGGTGTAGGAAAGGCTACTCCAGCCTCCATCTTAAGGGGTGAAGATGGTGGAAATGGTTCCAGCTCTATTATTATTGCCGCAAGGGAGGCAGCCAGAAAGAGGGTTGAAGCAGCATCTGCTGCTTCTAAGCATGCTGAAAATGTGGATGCCATTGTCAAAGCTGCCGAGCTGGCAGCAGCAGCTGTGTCACAAGCTGGAAAGTTAGTTGCTATGGGCGATCCTCTCCCCTTAGGCAAATTAGTGGAAGCTGGTCCAGATGGTTATTGGAAAACGCCTCAAGTATCTTCTGAGCTGGTTATGAGATCAGACGATGTTAATGGAGGACGTTCCAATTCAGCAATTAAGCGTCCTAGAGACGGATCATCAAGTAAGAATGAGATTCAGGTAAGTGGCGGTGCCAAGTCACCAATTCCTGGAGAAATATCTATGGGTTCTGTTGAGAATCATTCCAAATTGGTGGACGGCATCACAAGTTCTGTAGCACCTCGTGAGAAGGATCTGAGAGGACAGAAGGACCAAAATGCTTCTGATCTGACAAAAACCATTGGTGTAGTTCCAGAATCTGAAGTTGGAGAAAGATCCTCCCAGGACGAGTGTGAAAAGGCTAAGGATTTAAGACAAAGCAGCATCAAGGAAGGTTCCCATGTCGAGGTTTTCAAGGATGGGAATGGATTAAAAGCATCTTGGTTCACAGCCAGTGTGCTGAGTTTGAAGGAAGGGAAAGCTTACGTGTCCTACACTGAACTTCAACCTGAAGAAGGGTCAGGCCAACTAAAGGAGTGGGTTGCTCTTGATGGTCAAGGGGGCATGGCACCAAGAATTCGAATTTCTCGTCCTATGACTAACTTGCGGACTGAAGGAACAAGGAAGCGACGCCGAGCAGCTGCAGGGGACTACATCTGTTGCGATTGCATTCAGATTGTTGTGCCTCAGGAAAAGCGCATGAAGTTGGGCAGCCCAGCTACAGAAGTTAAAAGGAAGGATAAAATGCCAACGATTGTGGAGGATGTAGAATCAGCGAAGACAGAAGACCCGAGTTTGCTGTTGATATCAGCAAATGAGAAAGTATTTAATATTGGTAGGAATACACAAACTGAGAACAAGTCTAATCCATTAAAAACAAGTCGGACTGGTCTGCAGAAGGGGGCATCAAGAGTGATTATTGGTGTTCCCAGGCCTGGGAAAAAGAGAAAATTTATGGAAGTGAGCAAACATTATGATGCAGACACTCGAACTACTGAAGCAAATGATTCAACTAAGTTGGCAAAGTATTTGATGCCGCACGGATCTACTTCCAAAGTGTTAAAAAGAACTTCGAAATATGACACGAAGGAAAAATCTGCAAACGATGCCAGGCCCTTGGCTGTCAAGTCTGGAAAGCAGCCAAGTGTATCAGATCATGCAGTCATTACCAAGGATCCTGAAAGTCAGAATGAGAGTACTTCAGGGAAGAATGACCAAATGGACGTTCCTCCTTTCTGTAGCACTGAAGAAGCGCCAGAGGGCTCAGTTTTATTTCCTCCAGCACACGCTCCCAAGAAAGCATCCTCATTTCACACGAAGCCAGAACGAGCAAATAAAGGAAAACTTGCTCCTGCAGTTGGGAAATTGGCAAAAATTGAGGAAGAAAAGGTTTTCAATGGGAATCCTACCAAACCAAATTCCAATGTTATTGAGCCACGTAGATCCAACCGCCGTATTCAGCCTACCTCAAGATTATTAGAGGGACTGCAAAGCTCATTGGCAATTTCAAAGATTCCTTCTATTTCACACGATAAAGGTCAACGAAGCCAGAACAGGAATGCATCTAGAGGATGTATATTGAAACTCTCCTTAATCGAAGCATCTACTCCGCTTCGTTCCCTTTTGGAGTTAAATAGTTACTGTTTACCCTCACCACTGCCACTGCCGAGTACACTTGAAGAAGGTGGTCAGAAGGTTCAGGTACTGCCCAAATTTGTTTTCATTGAACAATTGGGAACAAAGAGTGATGTTGCATTCTTATATTATGGCCGCCTTACCAGTAAAGTTATCTATGGTTTACTAGGATACTTCTTCTTCTTCTTGCTTTTTTGCTTCTTGATATTTTTTTGTATTTTCTTCCTCAGTGCTAGATGTATTAAAAATGTAAATGGCTCACAAATACCTGAGAGGCTCAATCACATGAAGTCAGTAACGCTGGCCATAATTCTGTGTATCGAGCACGTCGTCGATTCGAAACTGTTGTATCCAGTCATTGCATCGTGCAGGTTCGGCTCATATATCACAATTCCACAAACTAATAAAACGAATCAGGAGGGGGGCAGGGATGTTAGCTCGCGCATAAAAGACTAA
Protein sequence
MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIENSEDNQWIEDYSRGSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINLAPAVTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRGEQQPQGDDPTEFQEICTVDRSLGEVDPDVAHELVDMPASEGSSGIDENSKQTCASTINTSVSISMEDKGQDDFSASGKHINDLVTCTQEGSGKLSSQKIEQQIKDLSENPVNTYIGNIEQVVNSHELNKEDQNRLLSPSVPADRLVIESSIATLQSHASMTLKGDCVFHSGSGEVTPEVPSETDKFDDKVLCSNVEIGNPSKTNMHEVLPTVVKGDARAVVCAGEGKNINAEVCAFQGPKIDSVGQMACAQEIISVDQQRFPSGIEIQTSKSESSASAMEKSNASKVGESSSGHIRDIPDKFTKDEHGMISLRDVRGCTLPIEKNLYSEGHLPPTTVAESTQLCEENKLCQSGNDHVTHASCKEEVRLSSDSISVNGKFAESPVRDKRIVSLSFQESDVESGMIDTKVEYSANAVSVSTFGDANVRTCDTLQGDSLPVVDALTDRKDADEKEDQLQPGVVEFTQSDSKEESGVIIPAEGSFPLSDTSQPVGKFHPLSEAEKSACLLTGQGFGESIDQTISKNLNSDDCNRESQSIPQADIPNNVIQDCGQEMDIDPAFSKSSAKACDSGVKKSDEKSFPPDATSLTPLPGETLDNYQKDQESTKVVSESVGNNCQQAIAVNIDSDAGKKEGSLCSADFPQSHEQMSVVGNGNSTADKPPPNLPDVVKTTVVAHDPDVKDCNMRPASKNVEAAEAKDRLVGNTSSGSQLPKGNVASESETALTFESSSLVDLPKNDSGIAVATAATASLVAEGPQSSSGLSKLDIKSAQDISHSSPHVSEVKVARARSKGTPERKPRRASAKGLGKESSTKGNHTKKSEKVEKSNSTPISNPGIFQLAQSNEMQHHGHVESSGAKPFVFIGASTSSIPDLNNSASPSPMFQQPFTDLQQVQLRAQIFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRLNGKKSQPTNPETPSQSQSGGRSTEQANKQSTLQSKITSPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSSIVPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFSGLPITEPVHLTPVKESSVSQSSAVKPSGSMVHGGTPGNVFTGASPLLELKKVSVTTGQNSTESKMRRRKKNTVSEDPGLITIQQVQPHLKPVPAVVTTTISTLATSPSVHPKAASENLILSPPPLCPTTHAKSAGQDLRGRAMFSDETLGKVREAKQLAEDAALFASEAVKHSAEVWSQLDRQKNSELVPDVEAKLASAAIAIAAAAAVAKAAAAAANVASNAACQAKLMADEAVTSSSHDVPCQSNEFSIHGSAVGVGKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKHAENVDAIVKAAELAAAAVSQAGKLVAMGDPLPLGKLVEAGPDGYWKTPQVSSELVMRSDDVNGGRSNSAIKRPRDGSSSKNEIQVSGGAKSPIPGEISMGSVENHSKLVDGITSSVAPREKDLRGQKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLRQSSIKEGSHVEVFKDGNGLKASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMTNLRTEGTRKRRRAAAGDYICCDCIQIVVPQEKRMKLGSPATEVKRKDKMPTIVEDVESAKTEDPSLLLISANEKVFNIGRNTQTENKSNPLKTSRTGLQKGASRVIIGVPRPGKKRKFMEVSKHYDADTRTTEANDSTKLAKYLMPHGSTSKVLKRTSKYDTKEKSANDARPLAVKSGKQPSVSDHAVITKDPESQNESTSGKNDQMDVPPFCSTEEAPEGSVLFPPAHAPKKASSFHTKPERANKGKLAPAVGKLAKIEEEKVFNGNPTKPNSNVIEPRRSNRRIQPTSRLLEGLQSSLAISKIPSISHDKGQRSQNRNASRGCILKLSLIEASTPLRSLLELNSYCLPSPLPLPSTLEEGGQKVQVLPKFVFIEQLGTKSDVAFLYYGRLTSKVIYGLLGYFFFFLLFCFLIFFCIFFLSARCIKNVNGSQIPERLNHMKSVTLAIILCIEHVVDSKLLYPVIASCRFGSYITIPQTNKTNQEGGRDVSSRIKD
Homology
BLAST of ClCG10G002580 vs. NCBI nr
Match:
XP_038903704.1 (uncharacterized protein LOC120090225 isoform X1 [Benincasa hispida])
HSP 1 Score: 3644.7 bits (9450), Expect = 0.0e+00
Identity = 1959/2188 (89.53%), Postives = 2014/2188 (92.05%), Query Frame = 0
Query: 1 MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE 60
MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE
Sbjct: 8 MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE 67
Query: 61 NSEDNQWIEDYSRGSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL 120
N+EDNQWIEDYSRGSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL
Sbjct: 68 NNEDNQWIEDYSRGSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL 127
Query: 121 APAVTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRG 180
AP VTGESNAREKLDYLTNPMDPTLKDDGSSF EMGDLQPTLLSNISLEELHVVNEDIRG
Sbjct: 128 APTVTGESNAREKLDYLTNPMDPTLKDDGSSFGEMGDLQPTLLSNISLEELHVVNEDIRG 187
Query: 181 E-QQPQGDDPTEFQEICTVDRSLGEVDPDVAHELVDMPASEGSSGIDENSKQTCASTINT 240
E QQPQ DDPTEFQEICTVDRSL EVDP VAHE+VDMPASEGSSGIDE+ ++ INT
Sbjct: 188 EQQQPQRDDPTEFQEICTVDRSLVEVDPGVAHEIVDMPASEGSSGIDESKQK-----INT 247
Query: 241 SVSISMEDKGQDDFSASGKHINDLVTCTQEGSGKLSSQKIEQQIKDLSENPVNTYIGNIE 300
SV+IS+EDKGQDDFSA GKHINDLVTCTQEGSGKLSSQKIEQQIKDLSENPVNTYIGNIE
Sbjct: 248 SVAISVEDKGQDDFSAYGKHINDLVTCTQEGSGKLSSQKIEQQIKDLSENPVNTYIGNIE 307
Query: 301 QVVNSHELNKEDQNRLLSPSVPADRLVIESSIATLQSHASMTLKGDCVFHSGSGEVTPEV 360
QVVNSH+ +KEDQN +LSPSVP DRLV+ESSIATLQS A+MTLKGDCVFHSGS EV PEV
Sbjct: 308 QVVNSHKSSKEDQNHVLSPSVPVDRLVVESSIATLQSDANMTLKGDCVFHSGSEEVMPEV 367
Query: 361 PSETDKFDDKVLCSNVEIGNPSKTNMHEVLPTVVKGDARAVVCAGEGKNINAEVCAFQGP 420
PS+TDKFDDKVLCSNVEIGNPSK NM EVLPTVVKGDAR V AGEGKNINAEVCAFQGP
Sbjct: 368 PSKTDKFDDKVLCSNVEIGNPSKENMCEVLPTVVKGDARTEVSAGEGKNINAEVCAFQGP 427
Query: 421 KIDSVGQMACAQEIISVDQQRFPSGIEIQTSKSESSASAMEKSNASKVGESSSGHIRDIP 480
KIDSVGQMACAQEIIS DQQ FPSG EIQTSKSE SASA+E+SNASKVGES+ GHIRDIP
Sbjct: 428 KIDSVGQMACAQEIISEDQQCFPSGTEIQTSKSEFSASAIEESNASKVGESNIGHIRDIP 487
Query: 481 DKFTKDEHGMISLRDVRGCTLPIEKNLYSEGHLPPTTVAESTQLCEENKLCQSGNDHVTH 540
DKFTKD HG IS RDVR CTLPIE NLYSEGHLPPTTVAESTQLCEE KLCQSGN HV H
Sbjct: 488 DKFTKDGHGFISSRDVRSCTLPIE-NLYSEGHLPPTTVAESTQLCEETKLCQSGNVHVEH 547
Query: 541 ASCKEEVRLSSDSISVNGKFAESPVRDKRIVSLSFQESDVESGMIDTKVEYSANA----V 600
ASCKEEVRLSSDSISVNGK AESPV+DKRIVSLSFQES VESG IDTK+EYSA A V
Sbjct: 548 ASCKEEVRLSSDSISVNGKIAESPVKDKRIVSLSFQESGVESGTIDTKLEYSAKAGDESV 607
Query: 601 SVSTFGDANVRTCDTLQGDSLPVVDALTDRKDADEKEDQLQPGVVEFTQSDSKEESGVII 660
SVSTF DANVRTCDT QGDSLPVVDALTD KDAD+KEDQLQP VVEFT SDSKEESGVII
Sbjct: 608 SVSTFEDANVRTCDTSQGDSLPVVDALTDIKDADDKEDQLQPAVVEFTPSDSKEESGVII 667
Query: 661 PAEGSFPLSDTSQPVGKFHPLSEAEKSACLLTGQGFGESIDQTISKNLNSDDCNRESQSI 720
PAEGSFPL DTSQP+GKFHPLSEAEKS C+LTGQGFGESIDQTISKN NSDDCNRESQSI
Sbjct: 668 PAEGSFPLLDTSQPMGKFHPLSEAEKSTCVLTGQGFGESIDQTISKNSNSDDCNRESQSI 727
Query: 721 PQADIPNNVIQDCGQEMDIDPAFSKSSAKACDSGVKKSDEKSFPPDATSLTPLPGETLDN 780
PQADIP+NVIQDC QEM IDPAFSKS+AKACDSGVKKSDEKS P DA SLTPLPGETLD+
Sbjct: 728 PQADIPSNVIQDCVQEMHIDPAFSKSTAKACDSGVKKSDEKSSPLDAKSLTPLPGETLDS 787
Query: 781 YQKDQESTKVVSESVGNNCQQAIAVNI-DSDAGKKEGSLCSADFPQSHEQMSVVGNGNST 840
YQKDQE+ +VVSESVGNNCQQAIAVNI DSDAGKKEGSLCSA FPQSHEQMSV+GNGNST
Sbjct: 788 YQKDQENIRVVSESVGNNCQQAIAVNIVDSDAGKKEGSLCSAAFPQSHEQMSVMGNGNST 847
Query: 841 ADKPPPNLPDVVKTTVVAHDPDVKDCNMRPASKNVEAAEAKDRLVGNTSSGSQLPKGNVA 900
ADKPPPNLPDVVKT VVAHDPDVKDCN PASKNVEAAEAKDRLVGN SSGS+LPKGN+A
Sbjct: 848 ADKPPPNLPDVVKTAVVAHDPDVKDCNKGPASKNVEAAEAKDRLVGNASSGSELPKGNIA 907
Query: 901 SESETALTFESSSLVDLPKNDSGIAVATAATASLVAEGPQSSSGLSKLDIKSAQDISHSS 960
SESETALTFES SL DLPKNDSGI VATAA+ASLV EGPQSSSGLSKLDIKSA++ISHSS
Sbjct: 908 SESETALTFESRSLEDLPKNDSGIVVATAASASLVVEGPQSSSGLSKLDIKSAEEISHSS 967
Query: 961 PHVSEVKVARARSKGTPERKPRRASAKGLGKESSTKGNHTKKSEKVEKSNSTPISNPGIF 1020
PHVSE KVARARSKGTPERKPRRA AKGLGKESSTKG+HTKKSEKVEKSNST I+NPGIF
Sbjct: 968 PHVSEAKVARARSKGTPERKPRRA-AKGLGKESSTKGSHTKKSEKVEKSNSTTINNPGIF 1027
Query: 1021 QLAQSNEMQHHGHVESSGAKPFVFIGASTSSIPDLNNSASPSPMFQQPFTDLQQVQLRAQ 1080
QLAQSNEMQHHGHVESSGAKPFVFIGAST+SIPDLNNSASPSPMFQQPFTDLQQVQLRAQ
Sbjct: 1028 QLAQSNEMQHHGHVESSGAKPFVFIGASTTSIPDLNNSASPSPMFQQPFTDLQQVQLRAQ 1087
Query: 1081 IFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRLNGKKSQPTNPETPSQSQ 1140
IFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWR CVDRLNGKKSQP NPETPSQSQ
Sbjct: 1088 IFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRTCVDRLNGKKSQPINPETPSQSQ 1147
Query: 1141 SGGRSTEQANKQSTLQSKITSPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSSI 1200
SGGRSTEQA+KQSTLQSKI SPPVSRVSSKSTSTVL+PMIPLSSPLWSISTPSNALQSSI
Sbjct: 1148 SGGRSTEQASKQSTLQSKIISPPVSRVSSKSTSTVLSPMIPLSSPLWSISTPSNALQSSI 1207
Query: 1201 VPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFSG 1260
VPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFSG
Sbjct: 1208 VPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFSG 1267
Query: 1261 LPITEPVHLTPVKESSVSQSSAVKPSGSMVHGGTPGNVFTGASPLLELKKVSVTTGQNST 1320
LPITEPVHLTPVKESSVSQSSA+KPSGSMVHGGTPGNV TGASPLLELKKVSVTTGQNST
Sbjct: 1268 LPITEPVHLTPVKESSVSQSSAMKPSGSMVHGGTPGNVLTGASPLLELKKVSVTTGQNST 1327
Query: 1321 ESKMRRRKKNTVSEDPGLITIQQVQPHLKPVPAVVTTTISTLATSPSVHPKAASENLILS 1380
+SKMRRRKKNTV+E+PGLIT+ QVQPHLKPVPAVVTTTISTLATSPSVHPK ASENLILS
Sbjct: 1328 DSKMRRRKKNTVAEEPGLITM-QVQPHLKPVPAVVTTTISTLATSPSVHPKGASENLILS 1387
Query: 1381 PPPLCPTTHAKSAGQDLRGRAMFSDETLGKVREAKQLAEDAALFASEAVKHSAEVWSQLD 1440
PPPLCPTTH KSAGQDLRGRAMFS+ETLGKVREAKQ+AEDAALFASEAVKHSAEVWSQLD
Sbjct: 1388 PPPLCPTTHPKSAGQDLRGRAMFSEETLGKVREAKQVAEDAALFASEAVKHSAEVWSQLD 1447
Query: 1441 RQKNSELVPDVEAKLASAAIAIAAAAAVAKAAAAAANVASNAACQAKLMADEAVTSSSHD 1500
RQKNSE V DVEAKLASAA+AIAAAAAVAKAAAAAANVASNAACQAKLMADEA TSSSHD
Sbjct: 1448 RQKNSEFVSDVEAKLASAAVAIAAAAAVAKAAAAAANVASNAACQAKLMADEAFTSSSHD 1507
Query: 1501 VPCQSNEFSIHGSAVGVGKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKHA 1560
VPCQSNEFS+HGSAVGVGKATPASILRGEDGGNGSSSII AAREAARKRVEAASAASKHA
Sbjct: 1508 VPCQSNEFSVHGSAVGVGKATPASILRGEDGGNGSSSIIFAAREAARKRVEAASAASKHA 1567
Query: 1561 ENVDAIVKAAELAAAAVSQAGKLVAMGDPLPLGKLVEAGPDGYWKTPQVSSELVMRSDDV 1620
ENVDAIVKAAELAAAAVSQAGKLVAM DPLPLGKLVEAGPDGYWKTPQVSSELVMRSDDV
Sbjct: 1568 ENVDAIVKAAELAAAAVSQAGKLVAMSDPLPLGKLVEAGPDGYWKTPQVSSELVMRSDDV 1627
Query: 1621 NGGRSNSAIKRPRDGSSSKNEIQVSGGAKSPIPGEISMGSVENHSKLVDGITSSVAPREK 1680
NGG SNSAIKRPRDGSSSKNEIQVS AKS IPGEIS+GSVENH KLVDGITS VAPREK
Sbjct: 1628 NGGCSNSAIKRPRDGSSSKNEIQVSVSAKSSIPGEISVGSVENHPKLVDGITSCVAPREK 1687
Query: 1681 DLRGQKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLRQSSIKEGSHVEVFKDGNGL 1740
DLRG KDQNASDLTKTIGVVPESEVGERSSQDECEKAKDL+QSSIKEGSHVEVFKDGNGL
Sbjct: 1688 DLRGLKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLKQSSIKEGSHVEVFKDGNGL 1747
Query: 1741 KASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMTNLRT 1800
KASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMT +RT
Sbjct: 1748 KASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMTTMRT 1807
Query: 1801 EGTRKRRRAAAGDYICC--DCI-------------------------------------- 1860
EGTRKRRRAAAGDYI D +
Sbjct: 1808 EGTRKRRRAAAGDYIWSVGDKVDAWMQNSWHEGVVVEKNAKDETTYIVRFPAQGETSTIK 1867
Query: 1861 ----------------------------QIVVPQEKRMKLGSPATEVKRKDKMPTIVEDV 1920
+IVVPQEKRMKLGSP EVKRKDKMPTIVEDV
Sbjct: 1868 AWNLRPSLIWKDGEWFEFSGSYVNDYSHEIVVPQEKRMKLGSPTAEVKRKDKMPTIVEDV 1927
Query: 1921 ESAKTEDPSLLLISANEKVFNIGRNTQTENKSNPLKTSRTGLQKGASRVIIGVPRPGKKR 1980
E AKT DPSLLLISANEKVFNIGRNTQ+ENKSNPLKTSRTGLQKGASRVIIGVPRPGKKR
Sbjct: 1928 ELAKTADPSLLLISANEKVFNIGRNTQSENKSNPLKTSRTGLQKGASRVIIGVPRPGKKR 1987
Query: 1981 KFMEVSKHYDADTRTTEANDSTKLAKYLMPHGSTSKVLKRTSKYDTKEKSANDARPLAVK 2040
KFMEVSKHYDADTRTTEANDSTKLAKYLMPHGSTSK LKRTSKY+TKEK+ NDA+PLAVK
Sbjct: 1988 KFMEVSKHYDADTRTTEANDSTKLAKYLMPHGSTSKGLKRTSKYETKEKTVNDAKPLAVK 2047
Query: 2041 SGKQPSVSDHAVITKDPESQNESTSGKNDQMDVPPFCSTEEAPEGSVLFPPAHAPKKASS 2100
SGKQPSVSDHAVITKD ESQNEST GKNDQMDVP FCSTEE PEGSVLFPPAHAPKKASS
Sbjct: 2048 SGKQPSVSDHAVITKDSESQNESTLGKNDQMDVPSFCSTEEVPEGSVLFPPAHAPKKASS 2107
Query: 2101 FHTKPERANKGKLAPAVGKLAKIEEEKVFNGNPTKPNSNVIEPRRSNRRIQPTSRLLEGL 2115
FHTKPERANKGKLAPAVGKL KIEEEKVFNGNPTKPNSNVIEPRRSNRRIQPTSRLLEGL
Sbjct: 2108 FHTKPERANKGKLAPAVGKLTKIEEEKVFNGNPTKPNSNVIEPRRSNRRIQPTSRLLEGL 2167
BLAST of ClCG10G002580 vs. NCBI nr
Match:
XP_038903706.1 (uncharacterized protein LOC120090225 isoform X3 [Benincasa hispida] >XP_038903707.1 uncharacterized protein LOC120090225 isoform X3 [Benincasa hispida] >XP_038903708.1 uncharacterized protein LOC120090225 isoform X3 [Benincasa hispida] >XP_038903709.1 uncharacterized protein LOC120090225 isoform X3 [Benincasa hispida])
HSP 1 Score: 3644.7 bits (9450), Expect = 0.0e+00
Identity = 1959/2188 (89.53%), Postives = 2014/2188 (92.05%), Query Frame = 0
Query: 1 MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE 60
MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE
Sbjct: 1 MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE 60
Query: 61 NSEDNQWIEDYSRGSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL 120
N+EDNQWIEDYSRGSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL
Sbjct: 61 NNEDNQWIEDYSRGSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL 120
Query: 121 APAVTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRG 180
AP VTGESNAREKLDYLTNPMDPTLKDDGSSF EMGDLQPTLLSNISLEELHVVNEDIRG
Sbjct: 121 APTVTGESNAREKLDYLTNPMDPTLKDDGSSFGEMGDLQPTLLSNISLEELHVVNEDIRG 180
Query: 181 E-QQPQGDDPTEFQEICTVDRSLGEVDPDVAHELVDMPASEGSSGIDENSKQTCASTINT 240
E QQPQ DDPTEFQEICTVDRSL EVDP VAHE+VDMPASEGSSGIDE+ ++ INT
Sbjct: 181 EQQQPQRDDPTEFQEICTVDRSLVEVDPGVAHEIVDMPASEGSSGIDESKQK-----INT 240
Query: 241 SVSISMEDKGQDDFSASGKHINDLVTCTQEGSGKLSSQKIEQQIKDLSENPVNTYIGNIE 300
SV+IS+EDKGQDDFSA GKHINDLVTCTQEGSGKLSSQKIEQQIKDLSENPVNTYIGNIE
Sbjct: 241 SVAISVEDKGQDDFSAYGKHINDLVTCTQEGSGKLSSQKIEQQIKDLSENPVNTYIGNIE 300
Query: 301 QVVNSHELNKEDQNRLLSPSVPADRLVIESSIATLQSHASMTLKGDCVFHSGSGEVTPEV 360
QVVNSH+ +KEDQN +LSPSVP DRLV+ESSIATLQS A+MTLKGDCVFHSGS EV PEV
Sbjct: 301 QVVNSHKSSKEDQNHVLSPSVPVDRLVVESSIATLQSDANMTLKGDCVFHSGSEEVMPEV 360
Query: 361 PSETDKFDDKVLCSNVEIGNPSKTNMHEVLPTVVKGDARAVVCAGEGKNINAEVCAFQGP 420
PS+TDKFDDKVLCSNVEIGNPSK NM EVLPTVVKGDAR V AGEGKNINAEVCAFQGP
Sbjct: 361 PSKTDKFDDKVLCSNVEIGNPSKENMCEVLPTVVKGDARTEVSAGEGKNINAEVCAFQGP 420
Query: 421 KIDSVGQMACAQEIISVDQQRFPSGIEIQTSKSESSASAMEKSNASKVGESSSGHIRDIP 480
KIDSVGQMACAQEIIS DQQ FPSG EIQTSKSE SASA+E+SNASKVGES+ GHIRDIP
Sbjct: 421 KIDSVGQMACAQEIISEDQQCFPSGTEIQTSKSEFSASAIEESNASKVGESNIGHIRDIP 480
Query: 481 DKFTKDEHGMISLRDVRGCTLPIEKNLYSEGHLPPTTVAESTQLCEENKLCQSGNDHVTH 540
DKFTKD HG IS RDVR CTLPIE NLYSEGHLPPTTVAESTQLCEE KLCQSGN HV H
Sbjct: 481 DKFTKDGHGFISSRDVRSCTLPIE-NLYSEGHLPPTTVAESTQLCEETKLCQSGNVHVEH 540
Query: 541 ASCKEEVRLSSDSISVNGKFAESPVRDKRIVSLSFQESDVESGMIDTKVEYSANA----V 600
ASCKEEVRLSSDSISVNGK AESPV+DKRIVSLSFQES VESG IDTK+EYSA A V
Sbjct: 541 ASCKEEVRLSSDSISVNGKIAESPVKDKRIVSLSFQESGVESGTIDTKLEYSAKAGDESV 600
Query: 601 SVSTFGDANVRTCDTLQGDSLPVVDALTDRKDADEKEDQLQPGVVEFTQSDSKEESGVII 660
SVSTF DANVRTCDT QGDSLPVVDALTD KDAD+KEDQLQP VVEFT SDSKEESGVII
Sbjct: 601 SVSTFEDANVRTCDTSQGDSLPVVDALTDIKDADDKEDQLQPAVVEFTPSDSKEESGVII 660
Query: 661 PAEGSFPLSDTSQPVGKFHPLSEAEKSACLLTGQGFGESIDQTISKNLNSDDCNRESQSI 720
PAEGSFPL DTSQP+GKFHPLSEAEKS C+LTGQGFGESIDQTISKN NSDDCNRESQSI
Sbjct: 661 PAEGSFPLLDTSQPMGKFHPLSEAEKSTCVLTGQGFGESIDQTISKNSNSDDCNRESQSI 720
Query: 721 PQADIPNNVIQDCGQEMDIDPAFSKSSAKACDSGVKKSDEKSFPPDATSLTPLPGETLDN 780
PQADIP+NVIQDC QEM IDPAFSKS+AKACDSGVKKSDEKS P DA SLTPLPGETLD+
Sbjct: 721 PQADIPSNVIQDCVQEMHIDPAFSKSTAKACDSGVKKSDEKSSPLDAKSLTPLPGETLDS 780
Query: 781 YQKDQESTKVVSESVGNNCQQAIAVNI-DSDAGKKEGSLCSADFPQSHEQMSVVGNGNST 840
YQKDQE+ +VVSESVGNNCQQAIAVNI DSDAGKKEGSLCSA FPQSHEQMSV+GNGNST
Sbjct: 781 YQKDQENIRVVSESVGNNCQQAIAVNIVDSDAGKKEGSLCSAAFPQSHEQMSVMGNGNST 840
Query: 841 ADKPPPNLPDVVKTTVVAHDPDVKDCNMRPASKNVEAAEAKDRLVGNTSSGSQLPKGNVA 900
ADKPPPNLPDVVKT VVAHDPDVKDCN PASKNVEAAEAKDRLVGN SSGS+LPKGN+A
Sbjct: 841 ADKPPPNLPDVVKTAVVAHDPDVKDCNKGPASKNVEAAEAKDRLVGNASSGSELPKGNIA 900
Query: 901 SESETALTFESSSLVDLPKNDSGIAVATAATASLVAEGPQSSSGLSKLDIKSAQDISHSS 960
SESETALTFES SL DLPKNDSGI VATAA+ASLV EGPQSSSGLSKLDIKSA++ISHSS
Sbjct: 901 SESETALTFESRSLEDLPKNDSGIVVATAASASLVVEGPQSSSGLSKLDIKSAEEISHSS 960
Query: 961 PHVSEVKVARARSKGTPERKPRRASAKGLGKESSTKGNHTKKSEKVEKSNSTPISNPGIF 1020
PHVSE KVARARSKGTPERKPRRA AKGLGKESSTKG+HTKKSEKVEKSNST I+NPGIF
Sbjct: 961 PHVSEAKVARARSKGTPERKPRRA-AKGLGKESSTKGSHTKKSEKVEKSNSTTINNPGIF 1020
Query: 1021 QLAQSNEMQHHGHVESSGAKPFVFIGASTSSIPDLNNSASPSPMFQQPFTDLQQVQLRAQ 1080
QLAQSNEMQHHGHVESSGAKPFVFIGAST+SIPDLNNSASPSPMFQQPFTDLQQVQLRAQ
Sbjct: 1021 QLAQSNEMQHHGHVESSGAKPFVFIGASTTSIPDLNNSASPSPMFQQPFTDLQQVQLRAQ 1080
Query: 1081 IFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRLNGKKSQPTNPETPSQSQ 1140
IFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWR CVDRLNGKKSQP NPETPSQSQ
Sbjct: 1081 IFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRTCVDRLNGKKSQPINPETPSQSQ 1140
Query: 1141 SGGRSTEQANKQSTLQSKITSPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSSI 1200
SGGRSTEQA+KQSTLQSKI SPPVSRVSSKSTSTVL+PMIPLSSPLWSISTPSNALQSSI
Sbjct: 1141 SGGRSTEQASKQSTLQSKIISPPVSRVSSKSTSTVLSPMIPLSSPLWSISTPSNALQSSI 1200
Query: 1201 VPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFSG 1260
VPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFSG
Sbjct: 1201 VPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFSG 1260
Query: 1261 LPITEPVHLTPVKESSVSQSSAVKPSGSMVHGGTPGNVFTGASPLLELKKVSVTTGQNST 1320
LPITEPVHLTPVKESSVSQSSA+KPSGSMVHGGTPGNV TGASPLLELKKVSVTTGQNST
Sbjct: 1261 LPITEPVHLTPVKESSVSQSSAMKPSGSMVHGGTPGNVLTGASPLLELKKVSVTTGQNST 1320
Query: 1321 ESKMRRRKKNTVSEDPGLITIQQVQPHLKPVPAVVTTTISTLATSPSVHPKAASENLILS 1380
+SKMRRRKKNTV+E+PGLIT+ QVQPHLKPVPAVVTTTISTLATSPSVHPK ASENLILS
Sbjct: 1321 DSKMRRRKKNTVAEEPGLITM-QVQPHLKPVPAVVTTTISTLATSPSVHPKGASENLILS 1380
Query: 1381 PPPLCPTTHAKSAGQDLRGRAMFSDETLGKVREAKQLAEDAALFASEAVKHSAEVWSQLD 1440
PPPLCPTTH KSAGQDLRGRAMFS+ETLGKVREAKQ+AEDAALFASEAVKHSAEVWSQLD
Sbjct: 1381 PPPLCPTTHPKSAGQDLRGRAMFSEETLGKVREAKQVAEDAALFASEAVKHSAEVWSQLD 1440
Query: 1441 RQKNSELVPDVEAKLASAAIAIAAAAAVAKAAAAAANVASNAACQAKLMADEAVTSSSHD 1500
RQKNSE V DVEAKLASAA+AIAAAAAVAKAAAAAANVASNAACQAKLMADEA TSSSHD
Sbjct: 1441 RQKNSEFVSDVEAKLASAAVAIAAAAAVAKAAAAAANVASNAACQAKLMADEAFTSSSHD 1500
Query: 1501 VPCQSNEFSIHGSAVGVGKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKHA 1560
VPCQSNEFS+HGSAVGVGKATPASILRGEDGGNGSSSII AAREAARKRVEAASAASKHA
Sbjct: 1501 VPCQSNEFSVHGSAVGVGKATPASILRGEDGGNGSSSIIFAAREAARKRVEAASAASKHA 1560
Query: 1561 ENVDAIVKAAELAAAAVSQAGKLVAMGDPLPLGKLVEAGPDGYWKTPQVSSELVMRSDDV 1620
ENVDAIVKAAELAAAAVSQAGKLVAM DPLPLGKLVEAGPDGYWKTPQVSSELVMRSDDV
Sbjct: 1561 ENVDAIVKAAELAAAAVSQAGKLVAMSDPLPLGKLVEAGPDGYWKTPQVSSELVMRSDDV 1620
Query: 1621 NGGRSNSAIKRPRDGSSSKNEIQVSGGAKSPIPGEISMGSVENHSKLVDGITSSVAPREK 1680
NGG SNSAIKRPRDGSSSKNEIQVS AKS IPGEIS+GSVENH KLVDGITS VAPREK
Sbjct: 1621 NGGCSNSAIKRPRDGSSSKNEIQVSVSAKSSIPGEISVGSVENHPKLVDGITSCVAPREK 1680
Query: 1681 DLRGQKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLRQSSIKEGSHVEVFKDGNGL 1740
DLRG KDQNASDLTKTIGVVPESEVGERSSQDECEKAKDL+QSSIKEGSHVEVFKDGNGL
Sbjct: 1681 DLRGLKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLKQSSIKEGSHVEVFKDGNGL 1740
Query: 1741 KASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMTNLRT 1800
KASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMT +RT
Sbjct: 1741 KASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMTTMRT 1800
Query: 1801 EGTRKRRRAAAGDYICC--DCI-------------------------------------- 1860
EGTRKRRRAAAGDYI D +
Sbjct: 1801 EGTRKRRRAAAGDYIWSVGDKVDAWMQNSWHEGVVVEKNAKDETTYIVRFPAQGETSTIK 1860
Query: 1861 ----------------------------QIVVPQEKRMKLGSPATEVKRKDKMPTIVEDV 1920
+IVVPQEKRMKLGSP EVKRKDKMPTIVEDV
Sbjct: 1861 AWNLRPSLIWKDGEWFEFSGSYVNDYSHEIVVPQEKRMKLGSPTAEVKRKDKMPTIVEDV 1920
Query: 1921 ESAKTEDPSLLLISANEKVFNIGRNTQTENKSNPLKTSRTGLQKGASRVIIGVPRPGKKR 1980
E AKT DPSLLLISANEKVFNIGRNTQ+ENKSNPLKTSRTGLQKGASRVIIGVPRPGKKR
Sbjct: 1921 ELAKTADPSLLLISANEKVFNIGRNTQSENKSNPLKTSRTGLQKGASRVIIGVPRPGKKR 1980
Query: 1981 KFMEVSKHYDADTRTTEANDSTKLAKYLMPHGSTSKVLKRTSKYDTKEKSANDARPLAVK 2040
KFMEVSKHYDADTRTTEANDSTKLAKYLMPHGSTSK LKRTSKY+TKEK+ NDA+PLAVK
Sbjct: 1981 KFMEVSKHYDADTRTTEANDSTKLAKYLMPHGSTSKGLKRTSKYETKEKTVNDAKPLAVK 2040
Query: 2041 SGKQPSVSDHAVITKDPESQNESTSGKNDQMDVPPFCSTEEAPEGSVLFPPAHAPKKASS 2100
SGKQPSVSDHAVITKD ESQNEST GKNDQMDVP FCSTEE PEGSVLFPPAHAPKKASS
Sbjct: 2041 SGKQPSVSDHAVITKDSESQNESTLGKNDQMDVPSFCSTEEVPEGSVLFPPAHAPKKASS 2100
Query: 2101 FHTKPERANKGKLAPAVGKLAKIEEEKVFNGNPTKPNSNVIEPRRSNRRIQPTSRLLEGL 2115
FHTKPERANKGKLAPAVGKL KIEEEKVFNGNPTKPNSNVIEPRRSNRRIQPTSRLLEGL
Sbjct: 2101 FHTKPERANKGKLAPAVGKLTKIEEEKVFNGNPTKPNSNVIEPRRSNRRIQPTSRLLEGL 2160
BLAST of ClCG10G002580 vs. NCBI nr
Match:
XP_038903705.1 (uncharacterized protein LOC120090225 isoform X2 [Benincasa hispida])
HSP 1 Score: 3612.0 bits (9365), Expect = 0.0e+00
Identity = 1946/2188 (88.94%), Postives = 2001/2188 (91.45%), Query Frame = 0
Query: 1 MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE 60
MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE
Sbjct: 8 MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE 67
Query: 61 NSEDNQWIEDYSRGSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL 120
N+EDNQWIEDYSRGSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL
Sbjct: 68 NNEDNQWIEDYSRGSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL 127
Query: 121 APAVTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRG 180
AP VTGESNAREKLDYLTNPMDPTLKDDGSSF EMGDLQPTLLSNISLEELHVVNEDIRG
Sbjct: 128 APTVTGESNAREKLDYLTNPMDPTLKDDGSSFGEMGDLQPTLLSNISLEELHVVNEDIRG 187
Query: 181 E-QQPQGDDPTEFQEICTVDRSLGEVDPDVAHELVDMPASEGSSGIDENSKQTCASTINT 240
E QQPQ DDPTEFQEICTVDRSL EVDP VAHE+VDMPASEGSSGIDE+ ++ INT
Sbjct: 188 EQQQPQRDDPTEFQEICTVDRSLVEVDPGVAHEIVDMPASEGSSGIDESKQK-----INT 247
Query: 241 SVSISMEDKGQDDFSASGKHINDLVTCTQEGSGKLSSQKIEQQIKDLSENPVNTYIGNIE 300
SV+IS+EDKGQDDFSA GKHINDLVTCTQEGSGKLSSQKIEQQIKDLSENPVNTYIGNIE
Sbjct: 248 SVAISVEDKGQDDFSAYGKHINDLVTCTQEGSGKLSSQKIEQQIKDLSENPVNTYIGNIE 307
Query: 301 QVVNSHELNKEDQNRLLSPSVPADRLVIESSIATLQSHASMTLKGDCVFHSGSGEVTPEV 360
QVVNSH+ +KEDQN +LSPSVP DRLV+ESSIATLQS A+MTLKGDCVFHSGS EV PEV
Sbjct: 308 QVVNSHKSSKEDQNHVLSPSVPVDRLVVESSIATLQSDANMTLKGDCVFHSGSEEVMPEV 367
Query: 361 PSETDKFDDKVLCSNVEIGNPSKTNMHEVLPTVVKGDARAVVCAGEGKNINAEVCAFQGP 420
PS+TDKFDDKVLCSNVEIGNPSK NM EVLPTVVKGDAR V AGEGKNINAEVCAFQGP
Sbjct: 368 PSKTDKFDDKVLCSNVEIGNPSKENMCEVLPTVVKGDARTEVSAGEGKNINAEVCAFQGP 427
Query: 421 KIDSVGQMACAQEIISVDQQRFPSGIEIQTSKSESSASAMEKSNASKVGESSSGHIRDIP 480
KIDSVGQMACAQEIIS DQQ FPSG EIQTSKSE SASA+E+SNASKVGES+ GHIRDIP
Sbjct: 428 KIDSVGQMACAQEIISEDQQCFPSGTEIQTSKSEFSASAIEESNASKVGESNIGHIRDIP 487
Query: 481 DKFTKDEHGMISLRDVRGCTLPIEKNLYSEGHLPPTTVAESTQLCEENKLCQSGNDHVTH 540
DKFTKD HG IS RDVR CTLPIE NLYSEGHLPPTTVAESTQLCEE KLCQSGN HV H
Sbjct: 488 DKFTKDGHGFISSRDVRSCTLPIE-NLYSEGHLPPTTVAESTQLCEETKLCQSGNVHVEH 547
Query: 541 ASCKEEVRLSSDSISVNGKFAESPVRDKRIVSLSFQESDVESGMIDTKVEYSANA----V 600
ASCKEEVRLSSDSISVNGK AESPV+DKRIVSLSFQES VESG IDTK+EYSA A V
Sbjct: 548 ASCKEEVRLSSDSISVNGKIAESPVKDKRIVSLSFQESGVESGTIDTKLEYSAKAGDESV 607
Query: 601 SVSTFGDANVRTCDTLQGDSLPVVDALTDRKDADEKEDQLQPGVVEFTQSDSKEESGVII 660
SVSTF DANVRTCDT QGDSLPVVDALTD KDAD+KEDQLQP VVEFT SDSKEESGVII
Sbjct: 608 SVSTFEDANVRTCDTSQGDSLPVVDALTDIKDADDKEDQLQPAVVEFTPSDSKEESGVII 667
Query: 661 PAEGSFPLSDTSQPVGKFHPLSEAEKSACLLTGQGFGESIDQTISKNLNSDDCNRESQSI 720
PAEGSFPL DTSQP+GKFHPLSEAEKS C+LTGQGFGESIDQTISKN NSDDCNRESQSI
Sbjct: 668 PAEGSFPLLDTSQPMGKFHPLSEAEKSTCVLTGQGFGESIDQTISKNSNSDDCNRESQSI 727
Query: 721 PQADIPNNVIQDCGQEMDIDPAFSKSSAKACDSGVKKSDEKSFPPDATSLTPLPGETLDN 780
PQADIP+NVIQDC QEM IDPAFSKS+AKACDSGVKKS GETLD+
Sbjct: 728 PQADIPSNVIQDCVQEMHIDPAFSKSTAKACDSGVKKS----------------GETLDS 787
Query: 781 YQKDQESTKVVSESVGNNCQQAIAVNI-DSDAGKKEGSLCSADFPQSHEQMSVVGNGNST 840
YQKDQE+ +VVSESVGNNCQQAIAVNI DSDAGKKEGSLCSA FPQSHEQMSV+GNGNST
Sbjct: 788 YQKDQENIRVVSESVGNNCQQAIAVNIVDSDAGKKEGSLCSAAFPQSHEQMSVMGNGNST 847
Query: 841 ADKPPPNLPDVVKTTVVAHDPDVKDCNMRPASKNVEAAEAKDRLVGNTSSGSQLPKGNVA 900
ADKPPPNLPDVVKT VVAHDPDVKDCN PASKNVEAAEAKDRLVGN SSGS+LPKGN+A
Sbjct: 848 ADKPPPNLPDVVKTAVVAHDPDVKDCNKGPASKNVEAAEAKDRLVGNASSGSELPKGNIA 907
Query: 901 SESETALTFESSSLVDLPKNDSGIAVATAATASLVAEGPQSSSGLSKLDIKSAQDISHSS 960
SESETALTFES SL DLPKNDSGI VATAA+ASLV EGPQSSSGLSKLDIKSA++ISHSS
Sbjct: 908 SESETALTFESRSLEDLPKNDSGIVVATAASASLVVEGPQSSSGLSKLDIKSAEEISHSS 967
Query: 961 PHVSEVKVARARSKGTPERKPRRASAKGLGKESSTKGNHTKKSEKVEKSNSTPISNPGIF 1020
PHVSE KVARARSKGTPERKPRRA AKGLGKESSTKG+HTKKSEKVEKSNST I+NPGIF
Sbjct: 968 PHVSEAKVARARSKGTPERKPRRA-AKGLGKESSTKGSHTKKSEKVEKSNSTTINNPGIF 1027
Query: 1021 QLAQSNEMQHHGHVESSGAKPFVFIGASTSSIPDLNNSASPSPMFQQPFTDLQQVQLRAQ 1080
QLAQSNEMQHHGHVESSGAKPFVFIGAST+SIPDLNNSASPSPMFQQPFTDLQQVQLRAQ
Sbjct: 1028 QLAQSNEMQHHGHVESSGAKPFVFIGASTTSIPDLNNSASPSPMFQQPFTDLQQVQLRAQ 1087
Query: 1081 IFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRLNGKKSQPTNPETPSQSQ 1140
IFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWR CVDRLNGKKSQP NPETPSQSQ
Sbjct: 1088 IFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRTCVDRLNGKKSQPINPETPSQSQ 1147
Query: 1141 SGGRSTEQANKQSTLQSKITSPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSSI 1200
SGGRSTEQA+KQSTLQSKI SPPVSRVSSKSTSTVL+PMIPLSSPLWSISTPSNALQSSI
Sbjct: 1148 SGGRSTEQASKQSTLQSKIISPPVSRVSSKSTSTVLSPMIPLSSPLWSISTPSNALQSSI 1207
Query: 1201 VPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFSG 1260
VPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFSG
Sbjct: 1208 VPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFSG 1267
Query: 1261 LPITEPVHLTPVKESSVSQSSAVKPSGSMVHGGTPGNVFTGASPLLELKKVSVTTGQNST 1320
LPITEPVHLTPVKESSVSQSSA+KPSGSMVHGGTPGNV TGASPLLELKKVSVTTGQNST
Sbjct: 1268 LPITEPVHLTPVKESSVSQSSAMKPSGSMVHGGTPGNVLTGASPLLELKKVSVTTGQNST 1327
Query: 1321 ESKMRRRKKNTVSEDPGLITIQQVQPHLKPVPAVVTTTISTLATSPSVHPKAASENLILS 1380
+SKMRRRKKNTV+E+PGLIT+ QVQPHLKPVPAVVTTTISTLATSPSVHPK ASENLILS
Sbjct: 1328 DSKMRRRKKNTVAEEPGLITM-QVQPHLKPVPAVVTTTISTLATSPSVHPKGASENLILS 1387
Query: 1381 PPPLCPTTHAKSAGQDLRGRAMFSDETLGKVREAKQLAEDAALFASEAVKHSAEVWSQLD 1440
PPPLCPTTH KSAGQDLRGRAMFS+ETLGKVREAKQ+AEDAALFASEAVKHSAEVWSQLD
Sbjct: 1388 PPPLCPTTHPKSAGQDLRGRAMFSEETLGKVREAKQVAEDAALFASEAVKHSAEVWSQLD 1447
Query: 1441 RQKNSELVPDVEAKLASAAIAIAAAAAVAKAAAAAANVASNAACQAKLMADEAVTSSSHD 1500
RQKNSE V DVEAKLASAA+AIAAAAAVAKAAAAAANVASNAACQAKLMADEA TSSSHD
Sbjct: 1448 RQKNSEFVSDVEAKLASAAVAIAAAAAVAKAAAAAANVASNAACQAKLMADEAFTSSSHD 1507
Query: 1501 VPCQSNEFSIHGSAVGVGKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKHA 1560
VPCQSNEFS+HGSAVGVGKATPASILRGEDGGNGSSSII AAREAARKRVEAASAASKHA
Sbjct: 1508 VPCQSNEFSVHGSAVGVGKATPASILRGEDGGNGSSSIIFAAREAARKRVEAASAASKHA 1567
Query: 1561 ENVDAIVKAAELAAAAVSQAGKLVAMGDPLPLGKLVEAGPDGYWKTPQVSSELVMRSDDV 1620
ENVDAIVKAAELAAAAVSQAGKLVAM DPLPLGKLVEAGPDGYWKTPQVSSELVMRSDDV
Sbjct: 1568 ENVDAIVKAAELAAAAVSQAGKLVAMSDPLPLGKLVEAGPDGYWKTPQVSSELVMRSDDV 1627
Query: 1621 NGGRSNSAIKRPRDGSSSKNEIQVSGGAKSPIPGEISMGSVENHSKLVDGITSSVAPREK 1680
NGG SNSAIKRPRDGSSSKNEIQVS AKS IPGEIS+GSVENH KLVDGITS VAPREK
Sbjct: 1628 NGGCSNSAIKRPRDGSSSKNEIQVSVSAKSSIPGEISVGSVENHPKLVDGITSCVAPREK 1687
Query: 1681 DLRGQKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLRQSSIKEGSHVEVFKDGNGL 1740
DLRG KDQNASDLTKTIGVVPESEVGERSSQDECEKAKDL+QSSIKEGSHVEVFKDGNGL
Sbjct: 1688 DLRGLKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLKQSSIKEGSHVEVFKDGNGL 1747
Query: 1741 KASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMTNLRT 1800
KASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMT +RT
Sbjct: 1748 KASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMTTMRT 1807
Query: 1801 EGTRKRRRAAAGDYICC--DCI-------------------------------------- 1860
EGTRKRRRAAAGDYI D +
Sbjct: 1808 EGTRKRRRAAAGDYIWSVGDKVDAWMQNSWHEGVVVEKNAKDETTYIVRFPAQGETSTIK 1867
Query: 1861 ----------------------------QIVVPQEKRMKLGSPATEVKRKDKMPTIVEDV 1920
+IVVPQEKRMKLGSP EVKRKDKMPTIVEDV
Sbjct: 1868 AWNLRPSLIWKDGEWFEFSGSYVNDYSHEIVVPQEKRMKLGSPTAEVKRKDKMPTIVEDV 1927
Query: 1921 ESAKTEDPSLLLISANEKVFNIGRNTQTENKSNPLKTSRTGLQKGASRVIIGVPRPGKKR 1980
E AKT DPSLLLISANEKVFNIGRNTQ+ENKSNPLKTSRTGLQKGASRVIIGVPRPGKKR
Sbjct: 1928 ELAKTADPSLLLISANEKVFNIGRNTQSENKSNPLKTSRTGLQKGASRVIIGVPRPGKKR 1987
Query: 1981 KFMEVSKHYDADTRTTEANDSTKLAKYLMPHGSTSKVLKRTSKYDTKEKSANDARPLAVK 2040
KFMEVSKHYDADTRTTEANDSTKLAKYLMPHGSTSK LKRTSKY+TKEK+ NDA+PLAVK
Sbjct: 1988 KFMEVSKHYDADTRTTEANDSTKLAKYLMPHGSTSKGLKRTSKYETKEKTVNDAKPLAVK 2047
Query: 2041 SGKQPSVSDHAVITKDPESQNESTSGKNDQMDVPPFCSTEEAPEGSVLFPPAHAPKKASS 2100
SGKQPSVSDHAVITKD ESQNEST GKNDQMDVP FCSTEE PEGSVLFPPAHAPKKASS
Sbjct: 2048 SGKQPSVSDHAVITKDSESQNESTLGKNDQMDVPSFCSTEEVPEGSVLFPPAHAPKKASS 2107
Query: 2101 FHTKPERANKGKLAPAVGKLAKIEEEKVFNGNPTKPNSNVIEPRRSNRRIQPTSRLLEGL 2115
FHTKPERANKGKLAPAVGKL KIEEEKVFNGNPTKPNSNVIEPRRSNRRIQPTSRLLEGL
Sbjct: 2108 FHTKPERANKGKLAPAVGKLTKIEEEKVFNGNPTKPNSNVIEPRRSNRRIQPTSRLLEGL 2167
BLAST of ClCG10G002580 vs. NCBI nr
Match:
KAA0041075.1 (Agenet domain-containing protein [Cucumis melo var. makuwa])
HSP 1 Score: 3463.3 bits (8979), Expect = 0.0e+00
Identity = 1897/2251 (84.27%), Postives = 1974/2251 (87.69%), Query Frame = 0
Query: 1 MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE 60
MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE
Sbjct: 1 MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE 60
Query: 61 NSEDNQWIEDYSRGSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL 120
N+ED QWIEDYSR SSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL
Sbjct: 61 NNEDTQWIEDYSRVSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL 120
Query: 121 APAVTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRG 180
VTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRG
Sbjct: 121 TATVTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRG 180
Query: 181 E-QQPQGDDPTEFQEICTVDRSLGEVDPDVAHELVDMPASEGSSGIDENSKQTCASTINT 240
E QQPQ D+PTEFQEICTVDRSLGEVDP VAHELVDMP SEGSSGIDENSK+ CA+TINT
Sbjct: 181 EQQQPQRDNPTEFQEICTVDRSLGEVDPGVAHELVDMPTSEGSSGIDENSKKPCANTINT 240
Query: 241 SVSISMEDKGQDDFSASGKHINDLVTCTQEGSGKLSSQKIEQQIKDLSENPVNTYIGNIE 300
V++ +EDKGQDDFSASGKHI+DLVTC EGSGKL QKIEQQIKDLS+NPVNT +GNIE
Sbjct: 241 PVALLVEDKGQDDFSASGKHISDLVTCAHEGSGKLGGQKIEQQIKDLSKNPVNTSVGNIE 300
Query: 301 QVVNSHELNKEDQNRLLSPSVPADRLVIESSIATLQSHASMTLKGDCVFHSGSGEVTPEV 360
QVVNSHEL+KEDQN L+SPSVP++RLV+ESSI+ LQSHAS+TLKGDCVFHSGSGEV PEV
Sbjct: 301 QVVNSHELSKEDQNPLISPSVPSERLVVESSISPLQSHASVTLKGDCVFHSGSGEVMPEV 360
Query: 361 PSETDKFDDKVLCSNVEIGNPSKTNMHEVLPTVVKGDARAVVCAGEGKNINAEVCAFQGP 420
SETDK DDKVLCSN+E GNPSK ++ EVLP VV+GDAR C EGKNINAE+CA QGP
Sbjct: 361 SSETDKLDDKVLCSNMEFGNPSKESVCEVLPAVVEGDARTETCV-EGKNINAELCAVQGP 420
Query: 421 KIDSVGQMACAQEIISVDQQRFPSGIEIQTSKSESSASAMEKSNASKVGESSSGHIRDIP 480
KIDSVGQMAC QE+IS + P GIEIQTSKSESSASAME+S ASKVGES+SGHIRDIP
Sbjct: 421 KIDSVGQMACGQEMIS---EHLPLGIEIQTSKSESSASAMEESKASKVGESTSGHIRDIP 480
Query: 481 DKFTKDEHGMISLRDVRGCTLPIEKNLYSEGHLPPTTVAESTQLCEENKLCQSGNDHVTH 540
DKFT+ DVRGCTLPIE NLY EGHLPPTTVAESTQLCEENKL SGN +V H
Sbjct: 481 DKFTE---------DVRGCTLPIE-NLYFEGHLPPTTVAESTQLCEENKL--SGNVYVEH 540
Query: 541 ASCKEEVRLSSDSISVNGKFAESPVRDKRIVSLSFQESDVESGMIDTKVEYSANA----V 600
ASCKEEVRLSSDS ++NGKFA+SPV DKRIVSLSFQES VESG IDTK+EYSANA V
Sbjct: 541 ASCKEEVRLSSDSTNLNGKFADSPVTDKRIVSLSFQESGVESGTIDTKLEYSANAGDESV 600
Query: 601 SVSTFGDANVRTCDTLQGDSLPVVDALTDRKDADEKEDQLQPGVVEFTQSDSKEESGVII 660
SVSTF ANVRTC TLQ DSL +VDALTDRKDA++KEDQLQP VVE TQSDSKEESGVII
Sbjct: 601 SVSTFEGANVRTCGTLQDDSLLLVDALTDRKDANDKEDQLQPAVVELTQSDSKEESGVII 660
Query: 661 PAEGSFPLSDTSQPVGKFHPLSEAEKSACLLTGQGFGESIDQTISKNLNSDDCNRESQSI 720
PAEGS DTSQPVGK HPLSEAE S +LTGQG GESIDQ+I KNLNS DC+RESQSI
Sbjct: 661 PAEGSSVRLDTSQPVGKLHPLSEAENSTPVLTGQGSGESIDQSILKNLNSSDCSRESQSI 720
Query: 721 PQADIPNNVIQDCGQEMDIDPAFSKSSAKACDSGVKKSDEKSFPPDATSLTPLPGETLDN 780
PQADIPNNVIQDCGQEMD+DPA SKS+A ACDSG K+S DAT LT PGETLDN
Sbjct: 721 PQADIPNNVIQDCGQEMDVDPAISKSTAIACDSGGKQS-------DATPLTQRPGETLDN 780
Query: 781 YQKDQESTKVVSESVGNNCQQAIAVNIDSDAGKKEGSLCSADFPQSHEQMSVVGNGNSTA 840
YQKDQES KVVSE+VGNNCQQ IA NIDS AGKKEGSLCSA F QSHEQ SV GNGNSTA
Sbjct: 781 YQKDQESRKVVSETVGNNCQQVIAANIDSGAGKKEGSLCSAAFSQSHEQTSVTGNGNSTA 840
Query: 841 DKPPPNLPDVVKTTVVAHDPDVKDCNMRPASKNVEAAEAKDRLVGNTSSGSQLPKGNVAS 900
KPPPNLPDVVK TV AHDPDVKDCN P SKNVEAAE KDRLVGN SSGSQLPK NV S
Sbjct: 841 AKPPPNLPDVVKATVGAHDPDVKDCNKVPPSKNVEAAEVKDRLVGNASSGSQLPKENVVS 900
Query: 901 ESETALTFESSSLVDLPKNDSGIAVATAATASLVAEGP--QSSSGLSKLDIKSAQDISHS 960
ESETALTF+SSSLVDLPKNDSGIAVATAA+ASLV E P QSSSG SKLDIK+A+DISHS
Sbjct: 901 ESETALTFQSSSLVDLPKNDSGIAVATAASASLVVEAPQSQSSSGPSKLDIKTARDISHS 960
Query: 961 SPHVSEVKVARARSKGTPERKPRRASAKGLGKESSTKGNHTKKSEKVEKSNSTPISNPGI 1020
SPHVSEVKVAR+RSKGTPER+PRRASAKGLGKESSTKG+ TKKSEKVEKSNSTPISNPGI
Sbjct: 961 SPHVSEVKVARSRSKGTPERRPRRASAKGLGKESSTKGSQTKKSEKVEKSNSTPISNPGI 1020
Query: 1021 FQLAQSNEMQHHGHVESSGAKPFVFIGASTSSIPDLNNSASPSPMFQQPFTDLQQVQLRA 1080
FQLAQSNEMQHHGHVESSGAKPFVFIGASTSS+PDLNNSASPSPMFQQPFTDLQQVQLRA
Sbjct: 1021 FQLAQSNEMQHHGHVESSGAKPFVFIGASTSSVPDLNNSASPSPMFQQPFTDLQQVQLRA 1080
Query: 1081 QIFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRLNGKKSQPTNPETPSQS 1140
QIFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRLNGKKSQ NPETPSQS
Sbjct: 1081 QIFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRLNGKKSQTINPETPSQS 1140
Query: 1141 QSGGRSTEQANKQSTLQSKITSPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSS 1200
QS GRSTEQA+KQSTLQSKI SPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSS
Sbjct: 1141 QSVGRSTEQASKQSTLQSKIISPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSS 1200
Query: 1201 IVPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFS 1260
IVPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFS
Sbjct: 1201 IVPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFS 1260
Query: 1261 GLPITEPVHLTPVKESSVSQSSAVKPSGSMVHGGTPGNVFTGASPLLELKKVSVTTGQNS 1320
GLPITEPV LTPVKESSVSQSSA+KPSGS+V GGTPGNVFTG+SPL ELKKVSVTTGQN
Sbjct: 1261 GLPITEPVQLTPVKESSVSQSSAMKPSGSLVQGGTPGNVFTGSSPLHELKKVSVTTGQNP 1320
Query: 1321 TESKMRRRKKNTVSEDPGLITIQQVQPHLKPVPAVVTTTISTLATSPSVHPKAASENLIL 1380
ESKMRRRKKNTVSEDPGLIT+ QVQPHLKPVPAVVTTTISTLATSPSVHPKA SEN+IL
Sbjct: 1321 AESKMRRRKKNTVSEDPGLITM-QVQPHLKPVPAVVTTTISTLATSPSVHPKATSENVIL 1380
Query: 1381 SPPPLCPTTHAKSAGQDLRGRAMFSDETLGKVREAKQLAEDAALFASEAVKHSAEVWSQL 1440
SPPPLCPT H K+AGQDLRG+ MFS+ETLGKVREAKQLAEDAALFASEAVKHSAEVWSQL
Sbjct: 1381 SPPPLCPTAHPKTAGQDLRGKPMFSEETLGKVREAKQLAEDAALFASEAVKHSAEVWSQL 1440
Query: 1441 DRQKNSELVPDVEAKLASAAIAIAAAAAVAKAAAAAANVASNAACQAKLMADEAVTSSSH 1500
RQKNSELV DVEAKLASAA AIAAAAAVAKAAAAAANVASNAACQAKLMADEA +SSS
Sbjct: 1441 GRQKNSELVSDVEAKLASAAAAIAAAAAVAKAAAAAANVASNAACQAKLMADEAFSSSSP 1500
Query: 1501 DVPCQSNEFSIHGSAVGVGKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKH 1560
++ CQSNEFS+HGSAVG GKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKH
Sbjct: 1501 EISCQSNEFSVHGSAVGAGKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKH 1560
Query: 1561 AENVDAIVKAAELAAAAVSQAGKLVAMGDPLPLGKLVEAGPDGYWKTPQVSSELVMRSDD 1620
AENVDAIV+AAELAAAAVSQAG LVAMGDPLPLGKLVEAGP+GYWKTPQVSSEL+MR DD
Sbjct: 1561 AENVDAIVRAAELAAAAVSQAGNLVAMGDPLPLGKLVEAGPEGYWKTPQVSSELIMRPDD 1620
Query: 1621 VNGGRSNSAIKRPRDGSSSKNEIQVSGGAKSPIPGEISMGSVENHSKLVDGITSSVAPRE 1680
VNGG SN AIKRPRDG SSKNEIQ S AK IP EISMGSVENH KLVDGITS VAPRE
Sbjct: 1621 VNGGSSNLAIKRPRDGLSSKNEIQPSVSAKPSIPEEISMGSVENHPKLVDGITSCVAPRE 1680
Query: 1681 KDLRGQKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLRQSSIKEGSHVEVFKDGNG 1740
K LRGQKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLRQSSIKEGSHVEVFKDGNG
Sbjct: 1681 KGLRGQKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLRQSSIKEGSHVEVFKDGNG 1740
Query: 1741 LKASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMTNLR 1800
LKASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMT R
Sbjct: 1741 LKASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMTTSR 1800
Query: 1801 TEGTRKRRRAAAGDYICC--DCI------------------------------------- 1860
TEGTRKRRRAAAGDYI D +
Sbjct: 1801 TEGTRKRRRAAAGDYIWSVGDKVDAWMQNSWHEGVVVEKNAKDETAYIVRFPGISRRNIH 1860
Query: 1861 ----------------QIVVPQEKRMKLGSPATEVKRKDKMPTIVEDVESAKTEDPSLLL 1920
I++P EKRMKLGSPA EVKRKDKMPTIVEDVESAK D SLL
Sbjct: 1861 YQSLESPPFSHMERWGMIIMPLEKRMKLGSPAAEVKRKDKMPTIVEDVESAKPADSSLLS 1920
Query: 1921 ISANEKVFNIGRNTQTENKSNPLKTSRTGLQKGASRVIIGVPRPGKKRKFMEVSKHYDAD 1980
IS NEKVFNIGRNTQTE K+NPLKTSRTGLQKG SRVIIGVPRPGKKRKFMEVSKHYD D
Sbjct: 1921 ISVNEKVFNIGRNTQTEKKTNPLKTSRTGLQKGTSRVIIGVPRPGKKRKFMEVSKHYDVD 1980
Query: 1981 TRTTEANDSTKLAKYLMPHGSTSKVLKRTSKYDTKEKSANDARPLAVKSGKQPSVSDHAV 2040
TRTTEANDSTKLA+YLMP GSTSK LKRTSKY+TKEKS ND +PLAVKSGKQPSVSDHAV
Sbjct: 1981 TRTTEANDSTKLARYLMPQGSTSKGLKRTSKYETKEKSTNDGKPLAVKSGKQPSVSDHAV 2040
Query: 2041 ITKDPESQNESTSGKNDQMDVPPFCSTEEAPEGSVLFPPAHAPKKASSFHTKPERANKGK 2100
IT+D ESQN ST GK+DQMDVP CSTEEAPEGS+LFPPAHAPKKASSFHTKPERANKG+
Sbjct: 2041 ITRDSESQNVSTEGKDDQMDVPSLCSTEEAPEGSLLFPPAHAPKKASSFHTKPERANKGR 2100
Query: 2101 LAPAVGKLAKIEEEKVFNGNPTKPNSNVIEPRRSNRRIQPTSRLLEGLQSSLAISKIPSI 2158
LAPAVGKLAKIEEEKVFNGN TKPNSNVIEPRRSNRRIQPTSRLLEGLQSSLAISKIPSI
Sbjct: 2101 LAPAVGKLAKIEEEKVFNGNTTKPNSNVIEPRRSNRRIQPTSRLLEGLQSSLAISKIPSI 2160
BLAST of ClCG10G002580 vs. NCBI nr
Match:
TYK12033.1 (Agenet domain-containing protein [Cucumis melo var. makuwa])
HSP 1 Score: 3459.1 bits (8968), Expect = 0.0e+00
Identity = 1898/2280 (83.25%), Postives = 1976/2280 (86.67%), Query Frame = 0
Query: 1 MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE 60
MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE
Sbjct: 1 MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE 60
Query: 61 NSEDNQWIEDYSRGSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL 120
N+ED QWIEDYSR SSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL
Sbjct: 61 NNEDTQWIEDYSRVSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL 120
Query: 121 APAVTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRG 180
VTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRG
Sbjct: 121 TATVTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRG 180
Query: 181 E-QQPQGDDPTEFQEICTVDRSLGEVDPDVAHELVDMPASEGSSGIDENSKQTCASTINT 240
E QQPQ D+PTEFQEICTVDRSLGEVDP VAHELVDMP SEGSSGIDENSK+ CA+TINT
Sbjct: 181 EQQQPQRDNPTEFQEICTVDRSLGEVDPGVAHELVDMPTSEGSSGIDENSKKPCANTINT 240
Query: 241 SVSISMEDKGQDDFSASGKHINDLVTCTQEGSGKLSSQKIEQQIKDLSENPVNTYIGNIE 300
V++ +EDKGQDDFSASGKHI+DLVTC EGSGKL QKIEQQIKDLS+NPVNT +GNIE
Sbjct: 241 PVALLVEDKGQDDFSASGKHISDLVTCAHEGSGKLGGQKIEQQIKDLSKNPVNTSVGNIE 300
Query: 301 QVVNSHELNKEDQNRLLSPSVPADRLVIESSIATLQSHASMTLKGDCVFHSGSGEVTPEV 360
QVVNSHEL+KEDQN L+SPSVP++RLV+ESSI+ LQSHAS+TLKGDCVFHSGSGEV PEV
Sbjct: 301 QVVNSHELSKEDQNPLISPSVPSERLVVESSISPLQSHASVTLKGDCVFHSGSGEVMPEV 360
Query: 361 PSETDKFDDKVLCSNVEIGNPSKTNMHEVLPTVVKGDARAVVCAGEGKNINAEVCAFQGP 420
SETDK DDKVLCSN+E GNPSK ++ EVLP VV+GDAR C EGKNINAE+CA QGP
Sbjct: 361 SSETDKLDDKVLCSNMEFGNPSKESVCEVLPAVVEGDARTETCV-EGKNINAELCAVQGP 420
Query: 421 KIDSVGQMACAQEIISVDQQRFPSGIEIQTSKSESSASAMEKSNASKVGESSSGHIRDIP 480
KIDSVGQMAC QE+IS + P GIEIQTSKSESSASAME+S ASKVGES+SGHIRDIP
Sbjct: 421 KIDSVGQMACGQEMIS---EHLPLGIEIQTSKSESSASAMEESKASKVGESTSGHIRDIP 480
Query: 481 DKFTKDEHGMISLRDVRGCTLPIEKNLYSEGHLPPTTVAESTQLCEENKLCQSGNDHVTH 540
DKFT+ DVRGCTLPIE NLY EGHLPPTTVAESTQLCEENKL SGN +V H
Sbjct: 481 DKFTE---------DVRGCTLPIE-NLYFEGHLPPTTVAESTQLCEENKL--SGNVYVEH 540
Query: 541 ASCKEEVRLSSDSISVNGKFAESPVRDKRIVSLSFQESDVESGMIDTKVEYSANA----V 600
ASCKEEVRLSSDS ++NGKFA+SPV DKRIVSLSFQES VESG IDTK+EYSANA V
Sbjct: 541 ASCKEEVRLSSDSTNLNGKFADSPVTDKRIVSLSFQESGVESGTIDTKLEYSANAGDESV 600
Query: 601 SVSTFGDANVRTCDTLQGDSLPVVDALTDRKDADEKEDQLQPGVVEFTQSDSKEESGVII 660
SVSTF ANVRTC TLQ DSL +VDALTDRKDA++KEDQLQP VVE TQSDSKEESGVII
Sbjct: 601 SVSTFEGANVRTCGTLQDDSLLLVDALTDRKDANDKEDQLQPAVVELTQSDSKEESGVII 660
Query: 661 PAEGSFPLSDTSQPVGKFHPLSEAEKSACLLTGQGFGESIDQTISKNLNSDDCNRESQSI 720
PAEGS DTSQPVGK HPLSEAE S +LTGQG GESIDQ+I KNLNS DC+RESQSI
Sbjct: 661 PAEGSSVRLDTSQPVGKLHPLSEAENSTPVLTGQGSGESIDQSILKNLNSSDCSRESQSI 720
Query: 721 PQADIPNNVIQDCGQEMDIDPAFSKSSAKACDSGVKKSDEKSFPPDATSLTPLPGETLDN 780
PQADIPNNVIQDCGQEMD+DPA SKS+A ACDSG K+S DAT LT PGETLDN
Sbjct: 721 PQADIPNNVIQDCGQEMDVDPAISKSTAIACDSGGKQS-------DATPLTQRPGETLDN 780
Query: 781 YQKDQESTKVVSESVGNNCQQAIAVNIDSDAGKKEGSLCSADFPQSHEQMSVVGNGNSTA 840
YQKDQES KVVSE+VGNNCQQ IA NIDS AGKKEGSLCSA F QSHEQ SV GNGNSTA
Sbjct: 781 YQKDQESRKVVSETVGNNCQQVIAANIDSGAGKKEGSLCSAAFSQSHEQTSVTGNGNSTA 840
Query: 841 DKPPPNLPDVVKTTVVAHDPDVKDCNMRPASKNVEAAEAKDRLVGNTSSGSQLPKGNVAS 900
KPPPNLPDVVK TV AHDPDVKDCN P SKNVEAAE KDRLVGN SSGSQLPK NV S
Sbjct: 841 AKPPPNLPDVVKATVGAHDPDVKDCNKVPPSKNVEAAEVKDRLVGNASSGSQLPKENVVS 900
Query: 901 ESETALTFESSSLVDLPKNDSGIAVATAATASLVAEGP--QSSSGLSKLDIKSAQDISHS 960
ESETALTF+SSSLVDLPKNDSGIAVATAA+ASLV E P QSSSG SKLDIK+A+DISHS
Sbjct: 901 ESETALTFQSSSLVDLPKNDSGIAVATAASASLVVEAPQSQSSSGPSKLDIKTARDISHS 960
Query: 961 SPHVSEVKVARARSKGTPERKPRRASAKGLGKESSTKGNHTKKSEKVEKSNSTPISNPGI 1020
SPHVSEVKVAR+RSKGTPER+PRRASAKGLGKESSTKG+ TKKSEKVEKSNSTPISNPGI
Sbjct: 961 SPHVSEVKVARSRSKGTPERRPRRASAKGLGKESSTKGSQTKKSEKVEKSNSTPISNPGI 1020
Query: 1021 FQLAQSNEMQHHGHVESSGAKPFVFIGASTSSIPDLNNSASPSPMFQQPFTDLQQVQLRA 1080
FQLAQSNEMQHHGHVESSGAKPFVFIGASTSS+PDLNNSASPSPMFQQPFTDLQQVQLRA
Sbjct: 1021 FQLAQSNEMQHHGHVESSGAKPFVFIGASTSSVPDLNNSASPSPMFQQPFTDLQQVQLRA 1080
Query: 1081 QIFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRLNGKKSQPTNPETPSQS 1140
QIFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRLNGKKSQ NPETPSQS
Sbjct: 1081 QIFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRLNGKKSQTINPETPSQS 1140
Query: 1141 QSGGRSTEQANKQSTLQSKITSPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSS 1200
QS GRSTEQA+KQSTLQSKI SPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSS
Sbjct: 1141 QSVGRSTEQASKQSTLQSKIISPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSS 1200
Query: 1201 IVPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFS 1260
IVPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFS
Sbjct: 1201 IVPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFS 1260
Query: 1261 GLPITEPVHLTPVKESSVSQSSAVKPSGSMVHGGTPGNVFTGASPLLELKKVSVTTGQNS 1320
GLPITEPV LTPVKESSVSQSSA+KPSGS+V GGTPGNVFTG+SPL ELKKVSVTTGQN
Sbjct: 1261 GLPITEPVQLTPVKESSVSQSSAMKPSGSLVQGGTPGNVFTGSSPLHELKKVSVTTGQNP 1320
Query: 1321 TESKMRRRKKNTVSEDPGLITIQQVQPHLKPVPAVVTTTISTLATSPSVHPKAASENLIL 1380
ESKMRRRKKNTVSEDPGLIT+ QVQPHLKPVPAVVTTTISTLATSPSVHPKA SEN+IL
Sbjct: 1321 AESKMRRRKKNTVSEDPGLITM-QVQPHLKPVPAVVTTTISTLATSPSVHPKATSENVIL 1380
Query: 1381 SPPPLCPTTHAKSAGQDLRGRAMFSDETLGKVREAKQLAEDAALFASEAVKHSAEVWSQL 1440
SPPPLCPT H K+AGQDLRG+ MFS+ETLGKVREAKQLAEDAALFASEAVKHSAEVWSQL
Sbjct: 1381 SPPPLCPTAHPKTAGQDLRGKPMFSEETLGKVREAKQLAEDAALFASEAVKHSAEVWSQL 1440
Query: 1441 DRQKNSELVPDVEAKLASAAIAIAAAAAVAKAAAAAANVASNAACQAKLMADEAVTSSSH 1500
RQKNSELV DVEAKLASAA AIAAAAAVAKAAAAAANVASNAACQAKLMADEA +SSS
Sbjct: 1441 GRQKNSELVSDVEAKLASAAAAIAAAAAVAKAAAAAANVASNAACQAKLMADEAFSSSSP 1500
Query: 1501 DVPCQSNEFSIHGSAVGVGKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKH 1560
++ CQSNEFS+HGSAVG GKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKH
Sbjct: 1501 EISCQSNEFSVHGSAVGAGKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKH 1560
Query: 1561 AENVDAIVKAAELAAAAVSQAGKLVAMGDPLPLGKLVEAGPDGYWKTPQVSSELVMRSDD 1620
AENVDAIV+AAELAAAAVSQAG LVAMGDPLPLGKLVEAGP+GYWKTPQVSSEL+MR DD
Sbjct: 1561 AENVDAIVRAAELAAAAVSQAGNLVAMGDPLPLGKLVEAGPEGYWKTPQVSSELIMRPDD 1620
Query: 1621 VNGGRSNSAIKRPRDGSSSKNEIQVSGGAKSPIPGEISMGSVENHSKLVDGITSSVAPRE 1680
VNGG SN AIKRPRDG SSKNEIQ S AK IP EISMGSVENH KLVDGITS VAPRE
Sbjct: 1621 VNGGSSNLAIKRPRDGLSSKNEIQPSVSAKPSIPEEISMGSVENHPKLVDGITSCVAPRE 1680
Query: 1681 KDLRGQKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLRQSSIKEGSHVEVFKDGNG 1740
K LRGQKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLRQSSIKEGSHVEVFKDGNG
Sbjct: 1681 KGLRGQKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLRQSSIKEGSHVEVFKDGNG 1740
Query: 1741 LKASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMTNLR 1800
LKASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMT R
Sbjct: 1741 LKASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMTTSR 1800
Query: 1801 TEGTRKRRRAAAGDYI--------------------CCDCI------------------- 1860
TEGTRKRRRAAAGDYI C C+
Sbjct: 1801 TEGTRKRRRAAAGDYIWSVGDKVDAWMQNSVDRREWICSCLKSATGMKEWLLRRTQKMKQ 1860
Query: 1861 ---------------------------------------------QIVVPQEKRMKLGSP 1920
+I++P EKRMKLGSP
Sbjct: 1861 HILSAFQASRGETSTIKAWNLRPSLIWKDGEWFELSGSYANDYSHEIIMPLEKRMKLGSP 1920
Query: 1921 ATEVKRKDKMPTIVEDVESAKTEDPSLLLISANEKVFNIGRNTQTENKSNPLKTSRTGLQ 1980
A EVKRKDKMPTIVEDVESAK D SLL IS NEKVFNIGRNTQTE K+NPLKTSRTGLQ
Sbjct: 1921 AAEVKRKDKMPTIVEDVESAKPADSSLLSISVNEKVFNIGRNTQTEKKTNPLKTSRTGLQ 1980
Query: 1981 KGASRVIIGVPRPGKKRKFMEVSKHYDADTRTTEANDSTKLAKYLMPHGSTSKVLKRTSK 2040
KG SRVIIGVPRPGKKRKFMEVSKHYD DTRTTEANDSTKLA+YLMP GSTSK LKRTSK
Sbjct: 1981 KGTSRVIIGVPRPGKKRKFMEVSKHYDVDTRTTEANDSTKLARYLMPQGSTSKGLKRTSK 2040
Query: 2041 YDTKEKSANDARPLAVKSGKQPSVSDHAVITKDPESQNESTSGKNDQMDVPPFCSTEEAP 2100
Y+TKEKS ND +PLAVKSGKQPSVSDHAVIT+D ESQN ST GK+DQMDVP CSTEEAP
Sbjct: 2041 YETKEKSTNDGKPLAVKSGKQPSVSDHAVITRDSESQNVSTEGKDDQMDVPSLCSTEEAP 2100
Query: 2101 EGSVLFPPAHAPKKASSFHTKPERANKGKLAPAVGKLAKIEEEKVFNGNPTKPNSNVIEP 2158
EGS+LFPPAHAPKKASSFHTKPERANKG+LAPAVGKLAKIEEEKVFNGN TKPNSNVIEP
Sbjct: 2101 EGSLLFPPAHAPKKASSFHTKPERANKGRLAPAVGKLAKIEEEKVFNGNTTKPNSNVIEP 2160
BLAST of ClCG10G002580 vs. ExPASy TrEMBL
Match:
A0A5A7TI44 (Agenet domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold477G00480 PE=4 SV=1)
HSP 1 Score: 3463.3 bits (8979), Expect = 0.0e+00
Identity = 1897/2251 (84.27%), Postives = 1974/2251 (87.69%), Query Frame = 0
Query: 1 MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE 60
MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE
Sbjct: 1 MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE 60
Query: 61 NSEDNQWIEDYSRGSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL 120
N+ED QWIEDYSR SSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL
Sbjct: 61 NNEDTQWIEDYSRVSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL 120
Query: 121 APAVTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRG 180
VTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRG
Sbjct: 121 TATVTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRG 180
Query: 181 E-QQPQGDDPTEFQEICTVDRSLGEVDPDVAHELVDMPASEGSSGIDENSKQTCASTINT 240
E QQPQ D+PTEFQEICTVDRSLGEVDP VAHELVDMP SEGSSGIDENSK+ CA+TINT
Sbjct: 181 EQQQPQRDNPTEFQEICTVDRSLGEVDPGVAHELVDMPTSEGSSGIDENSKKPCANTINT 240
Query: 241 SVSISMEDKGQDDFSASGKHINDLVTCTQEGSGKLSSQKIEQQIKDLSENPVNTYIGNIE 300
V++ +EDKGQDDFSASGKHI+DLVTC EGSGKL QKIEQQIKDLS+NPVNT +GNIE
Sbjct: 241 PVALLVEDKGQDDFSASGKHISDLVTCAHEGSGKLGGQKIEQQIKDLSKNPVNTSVGNIE 300
Query: 301 QVVNSHELNKEDQNRLLSPSVPADRLVIESSIATLQSHASMTLKGDCVFHSGSGEVTPEV 360
QVVNSHEL+KEDQN L+SPSVP++RLV+ESSI+ LQSHAS+TLKGDCVFHSGSGEV PEV
Sbjct: 301 QVVNSHELSKEDQNPLISPSVPSERLVVESSISPLQSHASVTLKGDCVFHSGSGEVMPEV 360
Query: 361 PSETDKFDDKVLCSNVEIGNPSKTNMHEVLPTVVKGDARAVVCAGEGKNINAEVCAFQGP 420
SETDK DDKVLCSN+E GNPSK ++ EVLP VV+GDAR C EGKNINAE+CA QGP
Sbjct: 361 SSETDKLDDKVLCSNMEFGNPSKESVCEVLPAVVEGDARTETCV-EGKNINAELCAVQGP 420
Query: 421 KIDSVGQMACAQEIISVDQQRFPSGIEIQTSKSESSASAMEKSNASKVGESSSGHIRDIP 480
KIDSVGQMAC QE+IS + P GIEIQTSKSESSASAME+S ASKVGES+SGHIRDIP
Sbjct: 421 KIDSVGQMACGQEMIS---EHLPLGIEIQTSKSESSASAMEESKASKVGESTSGHIRDIP 480
Query: 481 DKFTKDEHGMISLRDVRGCTLPIEKNLYSEGHLPPTTVAESTQLCEENKLCQSGNDHVTH 540
DKFT+ DVRGCTLPIE NLY EGHLPPTTVAESTQLCEENKL SGN +V H
Sbjct: 481 DKFTE---------DVRGCTLPIE-NLYFEGHLPPTTVAESTQLCEENKL--SGNVYVEH 540
Query: 541 ASCKEEVRLSSDSISVNGKFAESPVRDKRIVSLSFQESDVESGMIDTKVEYSANA----V 600
ASCKEEVRLSSDS ++NGKFA+SPV DKRIVSLSFQES VESG IDTK+EYSANA V
Sbjct: 541 ASCKEEVRLSSDSTNLNGKFADSPVTDKRIVSLSFQESGVESGTIDTKLEYSANAGDESV 600
Query: 601 SVSTFGDANVRTCDTLQGDSLPVVDALTDRKDADEKEDQLQPGVVEFTQSDSKEESGVII 660
SVSTF ANVRTC TLQ DSL +VDALTDRKDA++KEDQLQP VVE TQSDSKEESGVII
Sbjct: 601 SVSTFEGANVRTCGTLQDDSLLLVDALTDRKDANDKEDQLQPAVVELTQSDSKEESGVII 660
Query: 661 PAEGSFPLSDTSQPVGKFHPLSEAEKSACLLTGQGFGESIDQTISKNLNSDDCNRESQSI 720
PAEGS DTSQPVGK HPLSEAE S +LTGQG GESIDQ+I KNLNS DC+RESQSI
Sbjct: 661 PAEGSSVRLDTSQPVGKLHPLSEAENSTPVLTGQGSGESIDQSILKNLNSSDCSRESQSI 720
Query: 721 PQADIPNNVIQDCGQEMDIDPAFSKSSAKACDSGVKKSDEKSFPPDATSLTPLPGETLDN 780
PQADIPNNVIQDCGQEMD+DPA SKS+A ACDSG K+S DAT LT PGETLDN
Sbjct: 721 PQADIPNNVIQDCGQEMDVDPAISKSTAIACDSGGKQS-------DATPLTQRPGETLDN 780
Query: 781 YQKDQESTKVVSESVGNNCQQAIAVNIDSDAGKKEGSLCSADFPQSHEQMSVVGNGNSTA 840
YQKDQES KVVSE+VGNNCQQ IA NIDS AGKKEGSLCSA F QSHEQ SV GNGNSTA
Sbjct: 781 YQKDQESRKVVSETVGNNCQQVIAANIDSGAGKKEGSLCSAAFSQSHEQTSVTGNGNSTA 840
Query: 841 DKPPPNLPDVVKTTVVAHDPDVKDCNMRPASKNVEAAEAKDRLVGNTSSGSQLPKGNVAS 900
KPPPNLPDVVK TV AHDPDVKDCN P SKNVEAAE KDRLVGN SSGSQLPK NV S
Sbjct: 841 AKPPPNLPDVVKATVGAHDPDVKDCNKVPPSKNVEAAEVKDRLVGNASSGSQLPKENVVS 900
Query: 901 ESETALTFESSSLVDLPKNDSGIAVATAATASLVAEGP--QSSSGLSKLDIKSAQDISHS 960
ESETALTF+SSSLVDLPKNDSGIAVATAA+ASLV E P QSSSG SKLDIK+A+DISHS
Sbjct: 901 ESETALTFQSSSLVDLPKNDSGIAVATAASASLVVEAPQSQSSSGPSKLDIKTARDISHS 960
Query: 961 SPHVSEVKVARARSKGTPERKPRRASAKGLGKESSTKGNHTKKSEKVEKSNSTPISNPGI 1020
SPHVSEVKVAR+RSKGTPER+PRRASAKGLGKESSTKG+ TKKSEKVEKSNSTPISNPGI
Sbjct: 961 SPHVSEVKVARSRSKGTPERRPRRASAKGLGKESSTKGSQTKKSEKVEKSNSTPISNPGI 1020
Query: 1021 FQLAQSNEMQHHGHVESSGAKPFVFIGASTSSIPDLNNSASPSPMFQQPFTDLQQVQLRA 1080
FQLAQSNEMQHHGHVESSGAKPFVFIGASTSS+PDLNNSASPSPMFQQPFTDLQQVQLRA
Sbjct: 1021 FQLAQSNEMQHHGHVESSGAKPFVFIGASTSSVPDLNNSASPSPMFQQPFTDLQQVQLRA 1080
Query: 1081 QIFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRLNGKKSQPTNPETPSQS 1140
QIFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRLNGKKSQ NPETPSQS
Sbjct: 1081 QIFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRLNGKKSQTINPETPSQS 1140
Query: 1141 QSGGRSTEQANKQSTLQSKITSPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSS 1200
QS GRSTEQA+KQSTLQSKI SPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSS
Sbjct: 1141 QSVGRSTEQASKQSTLQSKIISPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSS 1200
Query: 1201 IVPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFS 1260
IVPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFS
Sbjct: 1201 IVPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFS 1260
Query: 1261 GLPITEPVHLTPVKESSVSQSSAVKPSGSMVHGGTPGNVFTGASPLLELKKVSVTTGQNS 1320
GLPITEPV LTPVKESSVSQSSA+KPSGS+V GGTPGNVFTG+SPL ELKKVSVTTGQN
Sbjct: 1261 GLPITEPVQLTPVKESSVSQSSAMKPSGSLVQGGTPGNVFTGSSPLHELKKVSVTTGQNP 1320
Query: 1321 TESKMRRRKKNTVSEDPGLITIQQVQPHLKPVPAVVTTTISTLATSPSVHPKAASENLIL 1380
ESKMRRRKKNTVSEDPGLIT+ QVQPHLKPVPAVVTTTISTLATSPSVHPKA SEN+IL
Sbjct: 1321 AESKMRRRKKNTVSEDPGLITM-QVQPHLKPVPAVVTTTISTLATSPSVHPKATSENVIL 1380
Query: 1381 SPPPLCPTTHAKSAGQDLRGRAMFSDETLGKVREAKQLAEDAALFASEAVKHSAEVWSQL 1440
SPPPLCPT H K+AGQDLRG+ MFS+ETLGKVREAKQLAEDAALFASEAVKHSAEVWSQL
Sbjct: 1381 SPPPLCPTAHPKTAGQDLRGKPMFSEETLGKVREAKQLAEDAALFASEAVKHSAEVWSQL 1440
Query: 1441 DRQKNSELVPDVEAKLASAAIAIAAAAAVAKAAAAAANVASNAACQAKLMADEAVTSSSH 1500
RQKNSELV DVEAKLASAA AIAAAAAVAKAAAAAANVASNAACQAKLMADEA +SSS
Sbjct: 1441 GRQKNSELVSDVEAKLASAAAAIAAAAAVAKAAAAAANVASNAACQAKLMADEAFSSSSP 1500
Query: 1501 DVPCQSNEFSIHGSAVGVGKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKH 1560
++ CQSNEFS+HGSAVG GKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKH
Sbjct: 1501 EISCQSNEFSVHGSAVGAGKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKH 1560
Query: 1561 AENVDAIVKAAELAAAAVSQAGKLVAMGDPLPLGKLVEAGPDGYWKTPQVSSELVMRSDD 1620
AENVDAIV+AAELAAAAVSQAG LVAMGDPLPLGKLVEAGP+GYWKTPQVSSEL+MR DD
Sbjct: 1561 AENVDAIVRAAELAAAAVSQAGNLVAMGDPLPLGKLVEAGPEGYWKTPQVSSELIMRPDD 1620
Query: 1621 VNGGRSNSAIKRPRDGSSSKNEIQVSGGAKSPIPGEISMGSVENHSKLVDGITSSVAPRE 1680
VNGG SN AIKRPRDG SSKNEIQ S AK IP EISMGSVENH KLVDGITS VAPRE
Sbjct: 1621 VNGGSSNLAIKRPRDGLSSKNEIQPSVSAKPSIPEEISMGSVENHPKLVDGITSCVAPRE 1680
Query: 1681 KDLRGQKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLRQSSIKEGSHVEVFKDGNG 1740
K LRGQKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLRQSSIKEGSHVEVFKDGNG
Sbjct: 1681 KGLRGQKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLRQSSIKEGSHVEVFKDGNG 1740
Query: 1741 LKASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMTNLR 1800
LKASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMT R
Sbjct: 1741 LKASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMTTSR 1800
Query: 1801 TEGTRKRRRAAAGDYICC--DCI------------------------------------- 1860
TEGTRKRRRAAAGDYI D +
Sbjct: 1801 TEGTRKRRRAAAGDYIWSVGDKVDAWMQNSWHEGVVVEKNAKDETAYIVRFPGISRRNIH 1860
Query: 1861 ----------------QIVVPQEKRMKLGSPATEVKRKDKMPTIVEDVESAKTEDPSLLL 1920
I++P EKRMKLGSPA EVKRKDKMPTIVEDVESAK D SLL
Sbjct: 1861 YQSLESPPFSHMERWGMIIMPLEKRMKLGSPAAEVKRKDKMPTIVEDVESAKPADSSLLS 1920
Query: 1921 ISANEKVFNIGRNTQTENKSNPLKTSRTGLQKGASRVIIGVPRPGKKRKFMEVSKHYDAD 1980
IS NEKVFNIGRNTQTE K+NPLKTSRTGLQKG SRVIIGVPRPGKKRKFMEVSKHYD D
Sbjct: 1921 ISVNEKVFNIGRNTQTEKKTNPLKTSRTGLQKGTSRVIIGVPRPGKKRKFMEVSKHYDVD 1980
Query: 1981 TRTTEANDSTKLAKYLMPHGSTSKVLKRTSKYDTKEKSANDARPLAVKSGKQPSVSDHAV 2040
TRTTEANDSTKLA+YLMP GSTSK LKRTSKY+TKEKS ND +PLAVKSGKQPSVSDHAV
Sbjct: 1981 TRTTEANDSTKLARYLMPQGSTSKGLKRTSKYETKEKSTNDGKPLAVKSGKQPSVSDHAV 2040
Query: 2041 ITKDPESQNESTSGKNDQMDVPPFCSTEEAPEGSVLFPPAHAPKKASSFHTKPERANKGK 2100
IT+D ESQN ST GK+DQMDVP CSTEEAPEGS+LFPPAHAPKKASSFHTKPERANKG+
Sbjct: 2041 ITRDSESQNVSTEGKDDQMDVPSLCSTEEAPEGSLLFPPAHAPKKASSFHTKPERANKGR 2100
Query: 2101 LAPAVGKLAKIEEEKVFNGNPTKPNSNVIEPRRSNRRIQPTSRLLEGLQSSLAISKIPSI 2158
LAPAVGKLAKIEEEKVFNGN TKPNSNVIEPRRSNRRIQPTSRLLEGLQSSLAISKIPSI
Sbjct: 2101 LAPAVGKLAKIEEEKVFNGNTTKPNSNVIEPRRSNRRIQPTSRLLEGLQSSLAISKIPSI 2160
BLAST of ClCG10G002580 vs. ExPASy TrEMBL
Match:
A0A5D3CJB8 (Agenet domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1017G00450 PE=4 SV=1)
HSP 1 Score: 3459.1 bits (8968), Expect = 0.0e+00
Identity = 1898/2280 (83.25%), Postives = 1976/2280 (86.67%), Query Frame = 0
Query: 1 MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE 60
MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE
Sbjct: 1 MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE 60
Query: 61 NSEDNQWIEDYSRGSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL 120
N+ED QWIEDYSR SSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL
Sbjct: 61 NNEDTQWIEDYSRVSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL 120
Query: 121 APAVTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRG 180
VTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRG
Sbjct: 121 TATVTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRG 180
Query: 181 E-QQPQGDDPTEFQEICTVDRSLGEVDPDVAHELVDMPASEGSSGIDENSKQTCASTINT 240
E QQPQ D+PTEFQEICTVDRSLGEVDP VAHELVDMP SEGSSGIDENSK+ CA+TINT
Sbjct: 181 EQQQPQRDNPTEFQEICTVDRSLGEVDPGVAHELVDMPTSEGSSGIDENSKKPCANTINT 240
Query: 241 SVSISMEDKGQDDFSASGKHINDLVTCTQEGSGKLSSQKIEQQIKDLSENPVNTYIGNIE 300
V++ +EDKGQDDFSASGKHI+DLVTC EGSGKL QKIEQQIKDLS+NPVNT +GNIE
Sbjct: 241 PVALLVEDKGQDDFSASGKHISDLVTCAHEGSGKLGGQKIEQQIKDLSKNPVNTSVGNIE 300
Query: 301 QVVNSHELNKEDQNRLLSPSVPADRLVIESSIATLQSHASMTLKGDCVFHSGSGEVTPEV 360
QVVNSHEL+KEDQN L+SPSVP++RLV+ESSI+ LQSHAS+TLKGDCVFHSGSGEV PEV
Sbjct: 301 QVVNSHELSKEDQNPLISPSVPSERLVVESSISPLQSHASVTLKGDCVFHSGSGEVMPEV 360
Query: 361 PSETDKFDDKVLCSNVEIGNPSKTNMHEVLPTVVKGDARAVVCAGEGKNINAEVCAFQGP 420
SETDK DDKVLCSN+E GNPSK ++ EVLP VV+GDAR C EGKNINAE+CA QGP
Sbjct: 361 SSETDKLDDKVLCSNMEFGNPSKESVCEVLPAVVEGDARTETCV-EGKNINAELCAVQGP 420
Query: 421 KIDSVGQMACAQEIISVDQQRFPSGIEIQTSKSESSASAMEKSNASKVGESSSGHIRDIP 480
KIDSVGQMAC QE+IS + P GIEIQTSKSESSASAME+S ASKVGES+SGHIRDIP
Sbjct: 421 KIDSVGQMACGQEMIS---EHLPLGIEIQTSKSESSASAMEESKASKVGESTSGHIRDIP 480
Query: 481 DKFTKDEHGMISLRDVRGCTLPIEKNLYSEGHLPPTTVAESTQLCEENKLCQSGNDHVTH 540
DKFT+ DVRGCTLPIE NLY EGHLPPTTVAESTQLCEENKL SGN +V H
Sbjct: 481 DKFTE---------DVRGCTLPIE-NLYFEGHLPPTTVAESTQLCEENKL--SGNVYVEH 540
Query: 541 ASCKEEVRLSSDSISVNGKFAESPVRDKRIVSLSFQESDVESGMIDTKVEYSANA----V 600
ASCKEEVRLSSDS ++NGKFA+SPV DKRIVSLSFQES VESG IDTK+EYSANA V
Sbjct: 541 ASCKEEVRLSSDSTNLNGKFADSPVTDKRIVSLSFQESGVESGTIDTKLEYSANAGDESV 600
Query: 601 SVSTFGDANVRTCDTLQGDSLPVVDALTDRKDADEKEDQLQPGVVEFTQSDSKEESGVII 660
SVSTF ANVRTC TLQ DSL +VDALTDRKDA++KEDQLQP VVE TQSDSKEESGVII
Sbjct: 601 SVSTFEGANVRTCGTLQDDSLLLVDALTDRKDANDKEDQLQPAVVELTQSDSKEESGVII 660
Query: 661 PAEGSFPLSDTSQPVGKFHPLSEAEKSACLLTGQGFGESIDQTISKNLNSDDCNRESQSI 720
PAEGS DTSQPVGK HPLSEAE S +LTGQG GESIDQ+I KNLNS DC+RESQSI
Sbjct: 661 PAEGSSVRLDTSQPVGKLHPLSEAENSTPVLTGQGSGESIDQSILKNLNSSDCSRESQSI 720
Query: 721 PQADIPNNVIQDCGQEMDIDPAFSKSSAKACDSGVKKSDEKSFPPDATSLTPLPGETLDN 780
PQADIPNNVIQDCGQEMD+DPA SKS+A ACDSG K+S DAT LT PGETLDN
Sbjct: 721 PQADIPNNVIQDCGQEMDVDPAISKSTAIACDSGGKQS-------DATPLTQRPGETLDN 780
Query: 781 YQKDQESTKVVSESVGNNCQQAIAVNIDSDAGKKEGSLCSADFPQSHEQMSVVGNGNSTA 840
YQKDQES KVVSE+VGNNCQQ IA NIDS AGKKEGSLCSA F QSHEQ SV GNGNSTA
Sbjct: 781 YQKDQESRKVVSETVGNNCQQVIAANIDSGAGKKEGSLCSAAFSQSHEQTSVTGNGNSTA 840
Query: 841 DKPPPNLPDVVKTTVVAHDPDVKDCNMRPASKNVEAAEAKDRLVGNTSSGSQLPKGNVAS 900
KPPPNLPDVVK TV AHDPDVKDCN P SKNVEAAE KDRLVGN SSGSQLPK NV S
Sbjct: 841 AKPPPNLPDVVKATVGAHDPDVKDCNKVPPSKNVEAAEVKDRLVGNASSGSQLPKENVVS 900
Query: 901 ESETALTFESSSLVDLPKNDSGIAVATAATASLVAEGP--QSSSGLSKLDIKSAQDISHS 960
ESETALTF+SSSLVDLPKNDSGIAVATAA+ASLV E P QSSSG SKLDIK+A+DISHS
Sbjct: 901 ESETALTFQSSSLVDLPKNDSGIAVATAASASLVVEAPQSQSSSGPSKLDIKTARDISHS 960
Query: 961 SPHVSEVKVARARSKGTPERKPRRASAKGLGKESSTKGNHTKKSEKVEKSNSTPISNPGI 1020
SPHVSEVKVAR+RSKGTPER+PRRASAKGLGKESSTKG+ TKKSEKVEKSNSTPISNPGI
Sbjct: 961 SPHVSEVKVARSRSKGTPERRPRRASAKGLGKESSTKGSQTKKSEKVEKSNSTPISNPGI 1020
Query: 1021 FQLAQSNEMQHHGHVESSGAKPFVFIGASTSSIPDLNNSASPSPMFQQPFTDLQQVQLRA 1080
FQLAQSNEMQHHGHVESSGAKPFVFIGASTSS+PDLNNSASPSPMFQQPFTDLQQVQLRA
Sbjct: 1021 FQLAQSNEMQHHGHVESSGAKPFVFIGASTSSVPDLNNSASPSPMFQQPFTDLQQVQLRA 1080
Query: 1081 QIFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRLNGKKSQPTNPETPSQS 1140
QIFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRLNGKKSQ NPETPSQS
Sbjct: 1081 QIFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRLNGKKSQTINPETPSQS 1140
Query: 1141 QSGGRSTEQANKQSTLQSKITSPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSS 1200
QS GRSTEQA+KQSTLQSKI SPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSS
Sbjct: 1141 QSVGRSTEQASKQSTLQSKIISPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSS 1200
Query: 1201 IVPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFS 1260
IVPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFS
Sbjct: 1201 IVPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFS 1260
Query: 1261 GLPITEPVHLTPVKESSVSQSSAVKPSGSMVHGGTPGNVFTGASPLLELKKVSVTTGQNS 1320
GLPITEPV LTPVKESSVSQSSA+KPSGS+V GGTPGNVFTG+SPL ELKKVSVTTGQN
Sbjct: 1261 GLPITEPVQLTPVKESSVSQSSAMKPSGSLVQGGTPGNVFTGSSPLHELKKVSVTTGQNP 1320
Query: 1321 TESKMRRRKKNTVSEDPGLITIQQVQPHLKPVPAVVTTTISTLATSPSVHPKAASENLIL 1380
ESKMRRRKKNTVSEDPGLIT+ QVQPHLKPVPAVVTTTISTLATSPSVHPKA SEN+IL
Sbjct: 1321 AESKMRRRKKNTVSEDPGLITM-QVQPHLKPVPAVVTTTISTLATSPSVHPKATSENVIL 1380
Query: 1381 SPPPLCPTTHAKSAGQDLRGRAMFSDETLGKVREAKQLAEDAALFASEAVKHSAEVWSQL 1440
SPPPLCPT H K+AGQDLRG+ MFS+ETLGKVREAKQLAEDAALFASEAVKHSAEVWSQL
Sbjct: 1381 SPPPLCPTAHPKTAGQDLRGKPMFSEETLGKVREAKQLAEDAALFASEAVKHSAEVWSQL 1440
Query: 1441 DRQKNSELVPDVEAKLASAAIAIAAAAAVAKAAAAAANVASNAACQAKLMADEAVTSSSH 1500
RQKNSELV DVEAKLASAA AIAAAAAVAKAAAAAANVASNAACQAKLMADEA +SSS
Sbjct: 1441 GRQKNSELVSDVEAKLASAAAAIAAAAAVAKAAAAAANVASNAACQAKLMADEAFSSSSP 1500
Query: 1501 DVPCQSNEFSIHGSAVGVGKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKH 1560
++ CQSNEFS+HGSAVG GKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKH
Sbjct: 1501 EISCQSNEFSVHGSAVGAGKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKH 1560
Query: 1561 AENVDAIVKAAELAAAAVSQAGKLVAMGDPLPLGKLVEAGPDGYWKTPQVSSELVMRSDD 1620
AENVDAIV+AAELAAAAVSQAG LVAMGDPLPLGKLVEAGP+GYWKTPQVSSEL+MR DD
Sbjct: 1561 AENVDAIVRAAELAAAAVSQAGNLVAMGDPLPLGKLVEAGPEGYWKTPQVSSELIMRPDD 1620
Query: 1621 VNGGRSNSAIKRPRDGSSSKNEIQVSGGAKSPIPGEISMGSVENHSKLVDGITSSVAPRE 1680
VNGG SN AIKRPRDG SSKNEIQ S AK IP EISMGSVENH KLVDGITS VAPRE
Sbjct: 1621 VNGGSSNLAIKRPRDGLSSKNEIQPSVSAKPSIPEEISMGSVENHPKLVDGITSCVAPRE 1680
Query: 1681 KDLRGQKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLRQSSIKEGSHVEVFKDGNG 1740
K LRGQKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLRQSSIKEGSHVEVFKDGNG
Sbjct: 1681 KGLRGQKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLRQSSIKEGSHVEVFKDGNG 1740
Query: 1741 LKASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMTNLR 1800
LKASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMT R
Sbjct: 1741 LKASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMTTSR 1800
Query: 1801 TEGTRKRRRAAAGDYI--------------------CCDCI------------------- 1860
TEGTRKRRRAAAGDYI C C+
Sbjct: 1801 TEGTRKRRRAAAGDYIWSVGDKVDAWMQNSVDRREWICSCLKSATGMKEWLLRRTQKMKQ 1860
Query: 1861 ---------------------------------------------QIVVPQEKRMKLGSP 1920
+I++P EKRMKLGSP
Sbjct: 1861 HILSAFQASRGETSTIKAWNLRPSLIWKDGEWFELSGSYANDYSHEIIMPLEKRMKLGSP 1920
Query: 1921 ATEVKRKDKMPTIVEDVESAKTEDPSLLLISANEKVFNIGRNTQTENKSNPLKTSRTGLQ 1980
A EVKRKDKMPTIVEDVESAK D SLL IS NEKVFNIGRNTQTE K+NPLKTSRTGLQ
Sbjct: 1921 AAEVKRKDKMPTIVEDVESAKPADSSLLSISVNEKVFNIGRNTQTEKKTNPLKTSRTGLQ 1980
Query: 1981 KGASRVIIGVPRPGKKRKFMEVSKHYDADTRTTEANDSTKLAKYLMPHGSTSKVLKRTSK 2040
KG SRVIIGVPRPGKKRKFMEVSKHYD DTRTTEANDSTKLA+YLMP GSTSK LKRTSK
Sbjct: 1981 KGTSRVIIGVPRPGKKRKFMEVSKHYDVDTRTTEANDSTKLARYLMPQGSTSKGLKRTSK 2040
Query: 2041 YDTKEKSANDARPLAVKSGKQPSVSDHAVITKDPESQNESTSGKNDQMDVPPFCSTEEAP 2100
Y+TKEKS ND +PLAVKSGKQPSVSDHAVIT+D ESQN ST GK+DQMDVP CSTEEAP
Sbjct: 2041 YETKEKSTNDGKPLAVKSGKQPSVSDHAVITRDSESQNVSTEGKDDQMDVPSLCSTEEAP 2100
Query: 2101 EGSVLFPPAHAPKKASSFHTKPERANKGKLAPAVGKLAKIEEEKVFNGNPTKPNSNVIEP 2158
EGS+LFPPAHAPKKASSFHTKPERANKG+LAPAVGKLAKIEEEKVFNGN TKPNSNVIEP
Sbjct: 2101 EGSLLFPPAHAPKKASSFHTKPERANKGRLAPAVGKLAKIEEEKVFNGNTTKPNSNVIEP 2160
BLAST of ClCG10G002580 vs. ExPASy TrEMBL
Match:
A0A1S3BKL8 (uncharacterized protein LOC103490876 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490876 PE=4 SV=1)
HSP 1 Score: 3436.7 bits (8910), Expect = 0.0e+00
Identity = 1866/2189 (85.24%), Postives = 1944/2189 (88.81%), Query Frame = 0
Query: 1 MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE 60
MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE
Sbjct: 1 MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE 60
Query: 61 NSEDNQWIEDYSRGSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL 120
N+ED QWIEDYSR SSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL
Sbjct: 61 NNEDTQWIEDYSRVSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL 120
Query: 121 APAVTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRG 180
VTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRG
Sbjct: 121 TATVTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRG 180
Query: 181 E-QQPQGDDPTEFQEICTVDRSLGEVDPDVAHELVDMPASEGSSGIDENSKQTCASTINT 240
E QQPQ D+PTEFQEICTVDRSLGEVDP VAHELVDMPASEGSSGIDENSK+ CA+TINT
Sbjct: 181 EQQQPQRDNPTEFQEICTVDRSLGEVDPGVAHELVDMPASEGSSGIDENSKKPCANTINT 240
Query: 241 SVSISMEDKGQDDFSASGKHINDLVTCTQEGSGKLSSQKIEQQIKDLSENPVNTYIGNIE 300
V++ +EDKGQDDFSASGKHI+DLVTC EGSGKL QKIEQQIKDLS+NPVNT +GNIE
Sbjct: 241 PVALLVEDKGQDDFSASGKHISDLVTCAHEGSGKLGDQKIEQQIKDLSKNPVNTSVGNIE 300
Query: 301 QVVNSHELNKEDQNRLLSPSVPADRLVIESSIATLQSHASMTLKGDCVFHSGSGEVTPEV 360
QVVNSHEL+KEDQN L+SPSVP++RLV+ESSI+ LQSHAS+TLKGDCVFHSGSGEV PEV
Sbjct: 301 QVVNSHELSKEDQNPLISPSVPSERLVVESSISPLQSHASVTLKGDCVFHSGSGEVMPEV 360
Query: 361 PSETDKFDDKVLCSNVEIGNPSKTNMHEVLPTVVKGDARAVVCAGEGKNINAEVCAFQGP 420
SETDK DDKVLCSN+E GNPSK ++ EVLP VV+GDAR C EGKNINAE+CA QGP
Sbjct: 361 SSETDKLDDKVLCSNMEFGNPSKESVCEVLPAVVEGDARTETCV-EGKNINAELCAVQGP 420
Query: 421 KIDSVGQMACAQEIISVDQQRFPSGIEIQTSKSESSASAMEKSNASKVGESSSGHIRDIP 480
KIDSVGQMAC QE+IS + P GIEIQTSKSESSASAME+S ASKVGES+SG+IRDIP
Sbjct: 421 KIDSVGQMACGQEMIS---EHLPLGIEIQTSKSESSASAMEESKASKVGESTSGYIRDIP 480
Query: 481 DKFTKDEHGMISLRDVRGCTLPIEKNLYSEGHLPPTTVAESTQLCEENKLCQSGNDHVTH 540
DKFT+D H GCTLPIE NLY EGHLPPTTVAESTQLCEENKL SGN HV H
Sbjct: 481 DKFTEDVH---------GCTLPIE-NLYFEGHLPPTTVAESTQLCEENKL--SGNVHVEH 540
Query: 541 ASCKEEVRLSSDSISVNGKFAESPVRDKRIVSLSFQESDVESGMIDTKVEYSANA----V 600
ASCKEEVRLSSDS ++NGKFA+SPV DKRIVSLSFQES VESG IDTK+EYSANA V
Sbjct: 541 ASCKEEVRLSSDSTNLNGKFADSPVTDKRIVSLSFQESGVESGTIDTKLEYSANAGDESV 600
Query: 601 SVSTFGDANVRTCDTLQGDSLPVVDALTDRKDADEKEDQLQPGVVEFTQSDSKEESGVII 660
SVSTF ANVRTC TLQGDSL +VDALTDRKDA++KEDQLQP VVE TQSDSKEESGVII
Sbjct: 601 SVSTFEGANVRTCGTLQGDSLLLVDALTDRKDANDKEDQLQPAVVELTQSDSKEESGVII 660
Query: 661 PAEGSFPLSDTSQPVGKFHPLSEAEKSACLLTGQGFGESIDQTISKNLNSDDCNRESQSI 720
PAEGS DTSQPVGK HPLSEAE S +LTGQG GESIDQ+I KNLNS DC+RESQSI
Sbjct: 661 PAEGSSVRLDTSQPVGKLHPLSEAENSTPVLTGQGSGESIDQSILKNLNSSDCSRESQSI 720
Query: 721 PQADIPNNVIQDCGQEMDIDPAFSKSSAKACDSGVKKSDEKSFPPDATSLTPLPGETLDN 780
PQADIPNNVIQDCGQEMD+DPA SKS+A ACDSG K+S DAT LT PGETLDN
Sbjct: 721 PQADIPNNVIQDCGQEMDVDPAISKSTAIACDSGGKQS-------DATPLTQRPGETLDN 780
Query: 781 YQKDQESTKVVSESVGNNCQQAIAVNIDSDAGKKEGSLCSADFPQSHEQMSVVGNGNSTA 840
YQKDQES KVVSE+VGNNCQQ IA NIDS AGKKEGSLCSA F QSHEQ SV GNGNSTA
Sbjct: 781 YQKDQESRKVVSETVGNNCQQVIAANIDSGAGKKEGSLCSAVFSQSHEQTSVTGNGNSTA 840
Query: 841 DKPPPNLPDVVKTTVVAHDPDVKDCNMRPASKNVEAAEAKDRLVGNTSSGSQLPKGNVAS 900
KPPPNLPDVVK TV AHDPDVKDCN P SKNVEAAE KDRLVGN SSGSQLPK NV S
Sbjct: 841 AKPPPNLPDVVKATVGAHDPDVKDCNKVPPSKNVEAAELKDRLVGNASSGSQLPKENVVS 900
Query: 901 ESETALTFESSSLVDLPKNDSGIAVATAATASLVAEGP--QSSSGLSKLDIKSAQDISHS 960
ESETALTF+SSSLVDLPKNDSGIAVATAA+ASLV E P QSSSG SKLDIK+A+DISHS
Sbjct: 901 ESETALTFQSSSLVDLPKNDSGIAVATAASASLVVEAPQSQSSSGPSKLDIKTARDISHS 960
Query: 961 SPHVSEVKVARARSKGTPERKPRRASAKGLGKESSTKGNHTKKSEKVEKSNSTPISNPGI 1020
SPHVSEVKVAR+RSKGTPER+PRRASAKGLGK+SSTKG+ TKKSEKVEKSNSTPISNPGI
Sbjct: 961 SPHVSEVKVARSRSKGTPERRPRRASAKGLGKDSSTKGSQTKKSEKVEKSNSTPISNPGI 1020
Query: 1021 FQLAQSNEMQHHGHVESSGAKPFVFIGASTSSIPDLNNSASPSPMFQQPFTDLQQVQLRA 1080
FQLAQSNEMQHHGHVESSGAKPFVFIGASTSS+PDLNNSASPSPMFQQPFTDLQQVQLRA
Sbjct: 1021 FQLAQSNEMQHHGHVESSGAKPFVFIGASTSSLPDLNNSASPSPMFQQPFTDLQQVQLRA 1080
Query: 1081 QIFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRLNGKKSQPTNPETPSQS 1140
QIFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRLNGKKSQ NPETPSQS
Sbjct: 1081 QIFVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRLNGKKSQTINPETPSQS 1140
Query: 1141 QSGGRSTEQANKQSTLQSKITSPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSS 1200
QS GRSTEQA+KQSTLQSKI SPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSS
Sbjct: 1141 QSVGRSTEQASKQSTLQSKIISPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSS 1200
Query: 1201 IVPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFS 1260
IVPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFS
Sbjct: 1201 IVPRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFS 1260
Query: 1261 GLPITEPVHLTPVKESSVSQSSAVKPSGSMVHGGTPGNVFTGASPLLELKKVSVTTGQNS 1320
GLPITEPVHLTPVKESSVSQSSA+KPSGS+V GGTPGNVFTG+SPL ELKKVSVTTGQN
Sbjct: 1261 GLPITEPVHLTPVKESSVSQSSAMKPSGSLVQGGTPGNVFTGSSPLHELKKVSVTTGQNP 1320
Query: 1321 TESKMRRRKKNTVSEDPGLITIQQVQPHLKPVPAVVTTTISTLATSPSVHPKAASENLIL 1380
ESKMRRRKKNTVSEDPGLIT+ QVQPHLKPVPAVVTTTISTLATSPSVHPKA SEN+IL
Sbjct: 1321 AESKMRRRKKNTVSEDPGLITM-QVQPHLKPVPAVVTTTISTLATSPSVHPKATSENVIL 1380
Query: 1381 SPPPLCPTTHAKSAGQDLRGRAMFSDETLGKVREAKQLAEDAALFASEAVKHSAEVWSQL 1440
SPPPLCPT H K+AGQDLRG+ MFS+ETLGKVREAKQLAEDAALFASEAVKHSAEVWSQL
Sbjct: 1381 SPPPLCPTAHPKTAGQDLRGKPMFSEETLGKVREAKQLAEDAALFASEAVKHSAEVWSQL 1440
Query: 1441 DRQKNSELVPDVEAKLASAAIAIAAAAAVAKAAAAAANVASNAACQAKLMADEAVTSSSH 1500
RQKNSELV DVEAKLASAA AIAAAAAVAKAAAAAANVASNAACQAKLMADEA +SSS
Sbjct: 1441 GRQKNSELVSDVEAKLASAAAAIAAAAAVAKAAAAAANVASNAACQAKLMADEAFSSSSP 1500
Query: 1501 DVPCQSNEFSIHGSAVGVGKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKH 1560
++ CQSNEFS+HGSAVG GKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKH
Sbjct: 1501 EISCQSNEFSVHGSAVGAGKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKH 1560
Query: 1561 AENVDAIVKAAELAAAAVSQAGKLVAMGDPLPLGKLVEAGPDGYWKTPQVSSELVMRSDD 1620
AENVDAIV+AAELAAAAVSQAG LVAMGDPLPLGKLVEAGP+GYWKTPQVSSEL+MR DD
Sbjct: 1561 AENVDAIVRAAELAAAAVSQAGNLVAMGDPLPLGKLVEAGPEGYWKTPQVSSELIMRPDD 1620
Query: 1621 VNGGRSNSAIKRPRDGSSSKNEIQVSGGAKSPIPGEISMGSVENHSKLVDGITSSVAPRE 1680
VNGG SN AIKRPRDG SSKNEIQ S AK IP EISMGSVENH KLVDGITS VAPRE
Sbjct: 1621 VNGGSSNLAIKRPRDGLSSKNEIQPSVSAKPSIPEEISMGSVENHPKLVDGITSCVAPRE 1680
Query: 1681 KDLRGQKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLRQSSIKEGSHVEVFKDGNG 1740
K LRGQKDQNASDLTKTIGVVPESEVGER SQDECEKAKDLRQSSIKEGSHVEVFKDGNG
Sbjct: 1681 KGLRGQKDQNASDLTKTIGVVPESEVGERLSQDECEKAKDLRQSSIKEGSHVEVFKDGNG 1740
Query: 1741 LKASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMTNLR 1800
LKASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMT R
Sbjct: 1741 LKASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMTTSR 1800
Query: 1801 TEGTRKRRRAAAGDYICC--DCI------------------------------------- 1860
TEGTRKRRRAAAGDYI D +
Sbjct: 1801 TEGTRKRRRAAAGDYIWSVGDKVDAWMQNSWHEGVVVEKNAKDETAYIVRFPARGETSTI 1860
Query: 1861 -----------------------------QIVVPQEKRMKLGSPATEVKRKDKMPTIVED 1920
+I++P EKRMKLGSPA EVKRKDKMPTIVED
Sbjct: 1861 KAWNLRPSLIWKDGEWFELSGSYANDYSHEIIMPLEKRMKLGSPAAEVKRKDKMPTIVED 1920
Query: 1921 VESAKTEDPSLLLISANEKVFNIGRNTQTENKSNPLKTSRTGLQKGASRVIIGVPRPGKK 1980
VESAK D SLL IS NEKVFNIGRNTQTE K+NPLKTSRTGLQKG SRVIIGVPRPGKK
Sbjct: 1921 VESAKPADSSLLSISVNEKVFNIGRNTQTEKKTNPLKTSRTGLQKGTSRVIIGVPRPGKK 1980
Query: 1981 RKFMEVSKHYDADTRTTEANDSTKLAKYLMPHGSTSKVLKRTSKYDTKEKSANDARPLAV 2040
RKFMEVSKHYD DTRTTEANDSTKLA+YLMP GSTSK LKRTSKY+TKEKS ND +PLAV
Sbjct: 1981 RKFMEVSKHYDVDTRTTEANDSTKLARYLMPQGSTSKGLKRTSKYETKEKSTNDGKPLAV 2040
Query: 2041 KSGKQPSVSDHAVITKDPESQNESTSGKNDQMDVPPFCSTEEAPEGSVLFPPAHAPKKAS 2100
KSGKQPSVSDHAVIT+D ESQN ST GKNDQMDVP CSTEEAPEGS+LFPPAHAPKKAS
Sbjct: 2041 KSGKQPSVSDHAVITRDSESQNVSTEGKNDQMDVPSLCSTEEAPEGSLLFPPAHAPKKAS 2100
Query: 2101 SFHTKPERANKGKLAPAVGKLAKIEEEKVFNGNPTKPNSNVIEPRRSNRRIQPTSRLLEG 2115
SFHTKPERANKG+LAPAVGKLAKIEEEKVFNGN TKPNSNVIEPRRSNRRIQPTSRLLEG
Sbjct: 2101 SFHTKPERANKGRLAPAVGKLAKIEEEKVFNGNTTKPNSNVIEPRRSNRRIQPTSRLLEG 2160
BLAST of ClCG10G002580 vs. ExPASy TrEMBL
Match:
A0A0A0L268 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G019930 PE=4 SV=1)
HSP 1 Score: 3416.7 bits (8858), Expect = 0.0e+00
Identity = 1854/2187 (84.77%), Postives = 1933/2187 (88.39%), Query Frame = 0
Query: 1 MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE 60
MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQG VRFD LVEPEVFLGIE
Sbjct: 1 MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGPVRFDGLVEPEVFLGIE 60
Query: 61 NSEDNQWIEDYSRGSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL 120
N+ED QWIEDYSR SSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL
Sbjct: 61 NNEDTQWIEDYSRVSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL 120
Query: 121 APAVTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRG 180
AP VTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNE+IRG
Sbjct: 121 APTVTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEEIRG 180
Query: 181 E-QQPQGDDPTEFQEICTVDRSLGEVDPDVAHELVDMPASEGSSGIDENSKQTCASTINT 240
E QQPQ D+PTEFQEICTVDRSLGEVDP VAHELVDMPASEGSSGIDENSK+T ASTINT
Sbjct: 181 EQQQPQRDNPTEFQEICTVDRSLGEVDPGVAHELVDMPASEGSSGIDENSKKTFASTINT 240
Query: 241 SVSISMEDKGQDDFSASGKHINDLVTCTQEGSGKLSSQKIEQQIKDLSENPVNTYIGNIE 300
VS+ EDKGQDDFSASGKHI+DLVTC EGSGKL SQKIEQQIKDLS+NPVNTY+GNIE
Sbjct: 241 PVSLLAEDKGQDDFSASGKHIDDLVTCAHEGSGKLGSQKIEQQIKDLSKNPVNTYVGNIE 300
Query: 301 QVVNSHELNKEDQNRLLSPSVPADRLVIESSIATLQSHASMTLKGDCVFHSGSGEVTPEV 360
QVVNSHEL+KE+QN LLSPSVP++RLV+ESSI+ LQSHASMTLKGDCVFHSGSG+V PEV
Sbjct: 301 QVVNSHELSKENQNPLLSPSVPSERLVVESSISPLQSHASMTLKGDCVFHSGSGKVMPEV 360
Query: 361 PSETDKFDDKVLCSNVEIGNPSKTNMHEVLPTVVKGDARAVVCAGEGKNINAEVCAFQGP 420
PSETDK DDKVLCSN+E GNPSK ++ EVLP VV+GDAR C EGKNINAEVCA QGP
Sbjct: 361 PSETDKLDDKVLCSNMEFGNPSKESVCEVLPAVVEGDARTETCV-EGKNINAEVCAVQGP 420
Query: 421 KIDSVGQMACAQEIISVDQQRFPSGIEIQTSKSESSASAMEKSNASKVGESSSGHIRDIP 480
+IDSVGQMAC QE+IS + P GIEIQTSKSE SA AME+S AS GESSSGHIRDIP
Sbjct: 421 RIDSVGQMACGQEMIS---EHLPLGIEIQTSKSELSAFAMEESRAS--GESSSGHIRDIP 480
Query: 481 DKFTKDEHGMISLRDVRGCTLPIEKNLYSEGHLPPTTVAESTQLCEENKLCQSGNDHVTH 540
DKFT+ DVRGCT +NLY EGHLPPTTVAESTQLCEENKLCQSGN HV H
Sbjct: 481 DKFTE---------DVRGCTRHSIENLYFEGHLPPTTVAESTQLCEENKLCQSGNVHVEH 540
Query: 541 ASCKEEVRLSSDSISVNGKFAESPVRDKRIVSLSFQESDVESGMIDTKVEYSANA----V 600
ASCKEEVRLSSDS VNGKFA+SPV DKRI LSFQES +ESG IDTK+EYSANA V
Sbjct: 541 ASCKEEVRLSSDSTCVNGKFADSPVTDKRIAPLSFQESGIESGTIDTKLEYSANAGDESV 600
Query: 601 SVSTFGDANVRTCDTLQGDSLPVVDALTDRKDADEKEDQLQPGVVEFTQSDSKEESGVII 660
SVSTF NVRTCDTLQGDSLP+VDALTDRKDA++KEDQLQP VVE +QSDSKEESGVII
Sbjct: 601 SVSTFEGTNVRTCDTLQGDSLPLVDALTDRKDANDKEDQLQPAVVELSQSDSKEESGVII 660
Query: 661 PAEGSFPLSDTSQPVGKFHPLSEAEKSACLLTGQGFGESIDQTISKNLNSDDCNRESQSI 720
PAEGS P +T QPVGK H LSEAE S +LTG G ESIDQ+I KN NS DCNRESQS
Sbjct: 661 PAEGSSPRLNTYQPVGKLHLLSEAENSTPVLTGHGSCESIDQSIPKNFNSSDCNRESQSK 720
Query: 721 PQADIPNNVIQDCGQEMDIDPAFSKSSAKACDSGVKKSDEKSFPPDATSLTPLPGETLDN 780
P+ADIPNNVIQDCGQEMDIDPA SKS+A ACDSG K+S DATSLT PGETLDN
Sbjct: 721 PEADIPNNVIQDCGQEMDIDPAISKSTAIACDSGGKQS-------DATSLTQRPGETLDN 780
Query: 781 YQKDQESTKVVSESVGNNCQQAIAVNIDSDAGKKEGSLCSADFPQSHEQMSVVGNGNSTA 840
YQKDQES KV SE+VGNNCQQ IA+NIDS AGKKEGSLCSA F QSHEQ SV GNGNSTA
Sbjct: 781 YQKDQESRKVFSETVGNNCQQVIALNIDSSAGKKEGSLCSATFSQSHEQTSVTGNGNSTA 840
Query: 841 DKPPPNLPDVVKTTVVAHDPDVKDCNMRPASKNVEAAEAKDRLVGNTSSGSQLPKGNVAS 900
K PNL DVVK TV AHDPDVKDCN P SKNVEAAE KDRLVG+ SGSQLPK NV S
Sbjct: 841 AKSSPNLSDVVKATVGAHDPDVKDCNKVPPSKNVEAAEVKDRLVGDAPSGSQLPKENVVS 900
Query: 901 ESETALTFESSSLVDLPKNDSGIAVATAATASLVAEGPQSSSGLSKLDIKSAQDISHSSP 960
ESETALTF+SSSLVDLPKNDSGIAVATAA+ASLV E PQSSSG SKLDIKSA+DISHSSP
Sbjct: 901 ESETALTFQSSSLVDLPKNDSGIAVATAASASLVVEAPQSSSGPSKLDIKSARDISHSSP 960
Query: 961 HVSEVKVARARSKGTPERKPRRASAKGLGKESSTKGNHTKKSEKVEKSNSTPISNPGIFQ 1020
HVSEVKVAR+RSKGTPERKPRRASAKGLGKESSTKG+ TKKSEKVEKSNST ISNPGIFQ
Sbjct: 961 HVSEVKVARSRSKGTPERKPRRASAKGLGKESSTKGSQTKKSEKVEKSNSTAISNPGIFQ 1020
Query: 1021 LAQSNEMQHHGHVESSGAKPFVFIGASTSSIPDLNNSASPSPMFQQPFTDLQQVQLRAQI 1080
LAQSNEMQ HGHVESSGAKP VFIGASTSS+PDLNNSASPSPMFQQPFTDLQQVQLRAQI
Sbjct: 1021 LAQSNEMQQHGHVESSGAKPAVFIGASTSSLPDLNNSASPSPMFQQPFTDLQQVQLRAQI 1080
Query: 1081 FVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRLNGKKSQPTNPETPSQSQS 1140
FVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDR NGKKSQ NPETPSQSQS
Sbjct: 1081 FVYGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRFNGKKSQTINPETPSQSQS 1140
Query: 1141 GGRSTEQANKQSTLQSKITSPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSSIV 1200
GGRSTEQA+KQSTLQSKI SPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSSIV
Sbjct: 1141 GGRSTEQASKQSTLQSKIISPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSSIV 1200
Query: 1201 PRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFSGL 1260
PRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFSGL
Sbjct: 1201 PRSPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFSGL 1260
Query: 1261 PITEPVHLTPVKESSVSQSSAVKPSGSMVHGGTPGNVFTGASPLLELKKVSVTTGQNSTE 1320
PITEPVHLTPVKESSV QSSA+KPSGS+VH G PGNVFTGASPL ELK+VSVTTGQN TE
Sbjct: 1261 PITEPVHLTPVKESSVPQSSAMKPSGSLVHSGNPGNVFTGASPLHELKQVSVTTGQNPTE 1320
Query: 1321 SKMRRRKKNTVSEDPGLITIQQVQPHLKPVPAVVTTTISTLATSPSVHPKAASENLILSP 1380
SKMRRRKKN+VSEDPGLIT+ QVQPHLKPVPAVVTTTISTL TSPSVH KA SEN+ILSP
Sbjct: 1321 SKMRRRKKNSVSEDPGLITM-QVQPHLKPVPAVVTTTISTLVTSPSVHLKATSENVILSP 1380
Query: 1381 PPLCPTTHAKSAGQDLRGRAMFSDETLGKVREAKQLAEDAALFASEAVKHSAEVWSQLDR 1440
PPLCPT H K+AGQDLRG+ MFS+ETLGKVREAKQLAEDAALFASEAVKHSAEVWSQL R
Sbjct: 1381 PPLCPTAHPKAAGQDLRGKPMFSEETLGKVREAKQLAEDAALFASEAVKHSAEVWSQLGR 1440
Query: 1441 QKNSELVPDVEAKLASAAIAIAAAAAVAKAAAAAANVASNAACQAKLMADEAVTSSSHDV 1500
QKNSELV DVEAKLASAA+AIAAAAAVAKAAAAAANVASNAACQAKLMADEA +SSS ++
Sbjct: 1441 QKNSELVSDVEAKLASAAVAIAAAAAVAKAAAAAANVASNAACQAKLMADEAFSSSSPEL 1500
Query: 1501 PCQSNEFSIHGSAVGVGKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKHAE 1560
CQSNEFS+HGSAVGVGKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKHAE
Sbjct: 1501 SCQSNEFSVHGSAVGVGKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKHAE 1560
Query: 1561 NVDAIVKAAELAAAAVSQAGKLVAMGDPLPLGKLVEAGPDGYWKTPQVSSELVMRSDDVN 1620
NVDAIV+AAELAAAAVSQAGKLVAMGDPLPLGKLVEAGP+GYW+TPQVSSELVM+ DDVN
Sbjct: 1561 NVDAIVRAAELAAAAVSQAGKLVAMGDPLPLGKLVEAGPEGYWRTPQVSSELVMKPDDVN 1620
Query: 1621 GGRSNSAIKRPRDGSSSKNEIQVSGGAKSPIPGEISMGSVENHSKLVDGITSSVAPREKD 1680
GG SN AIKRPRDGSSSKNEIQ S AK IPGEISMGSVENH KLVDGITS VAPREKD
Sbjct: 1621 GGSSNLAIKRPRDGSSSKNEIQASVSAKPSIPGEISMGSVENHPKLVDGITSCVAPREKD 1680
Query: 1681 LRGQKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLRQSSIKEGSHVEVFKDGNGLK 1740
LRGQKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLRQSSIKEGSHVEVFKDGNGLK
Sbjct: 1681 LRGQKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLRQSSIKEGSHVEVFKDGNGLK 1740
Query: 1741 ASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMTNLRTE 1800
ASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIR+SRPMT RTE
Sbjct: 1741 ASWFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRVSRPMTTSRTE 1800
Query: 1801 GTRKRRRAAAGDYICC--DCI--------------------------------------- 1860
GTRKRRRAAAGDYI D +
Sbjct: 1801 GTRKRRRAAAGDYIWSVGDKVDAWMQNSWHEGVVVEKNAKDETAYIVRFPARGETSTIKA 1860
Query: 1861 ---------------------------QIVVPQEKRMKLGSPATEVKRKDKMPTIVEDVE 1920
+I++PQEKRMKLGSPA EVKRKDKMPTIVEDVE
Sbjct: 1861 WNLRPSLIWKDGEWFELSGSHANDYSHEIIMPQEKRMKLGSPAAEVKRKDKMPTIVEDVE 1920
Query: 1921 SAKTEDPSLLLISANEKVFNIGRNTQTENKSNPLKTSRTGLQKGASRVIIGVPRPGKKRK 1980
S K +PSLL ISANEKVFNIGRNTQTE K+NPLKTSRTGLQKG SRVIIGVPRPGKKRK
Sbjct: 1921 STKPSNPSLLSISANEKVFNIGRNTQTEKKTNPLKTSRTGLQKGTSRVIIGVPRPGKKRK 1980
Query: 1981 FMEVSKHYDADTRTTEANDSTKLAKYLMPHGSTSKVLKRTSKYDTKEKSANDARPLAVKS 2040
FMEVSKHYD DTRTTEANDS+KLAKYLMP GSTSK LKRTSKY+TKEKS NDA+PLAVKS
Sbjct: 1981 FMEVSKHYDVDTRTTEANDSSKLAKYLMPQGSTSKGLKRTSKYETKEKSTNDAKPLAVKS 2040
Query: 2041 GKQPSVSDHAVITKDPESQNESTSGKNDQMDVPPFCSTEEAPEGSVLFPPAHAPKKASSF 2100
GKQPSVSDHAVI KD ESQN T GK+DQM+VP FCSTE APEGS+LFPPAHAPKKA SF
Sbjct: 2041 GKQPSVSDHAVIIKDSESQNVRTEGKDDQMEVPSFCSTEAAPEGSLLFPPAHAPKKAPSF 2100
Query: 2101 HTKPERANKGKLAPAVGKLAKIEEEKVFNGNPTKPNSNVIEPRRSNRRIQPTSRLLEGLQ 2115
HTKPERANKGKLAPAVGKLAKIEEEKVFNGN TKPNSNVIEPRRSNRRIQPTSRLLEGLQ
Sbjct: 2101 HTKPERANKGKLAPAVGKLAKIEEEKVFNGNTTKPNSNVIEPRRSNRRIQPTSRLLEGLQ 2160
BLAST of ClCG10G002580 vs. ExPASy TrEMBL
Match:
A0A1S3BK14 (uncharacterized protein LOC103490876 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103490876 PE=4 SV=1)
HSP 1 Score: 3387.0 bits (8781), Expect = 0.0e+00
Identity = 1844/2185 (84.39%), Postives = 1922/2185 (87.96%), Query Frame = 0
Query: 1 MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE 60
MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE
Sbjct: 1 MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE 60
Query: 61 NSEDNQWIEDYSRGSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL 120
N+ED QWIEDYSR SSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL
Sbjct: 61 NNEDTQWIEDYSRVSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL 120
Query: 121 APAVTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRG 180
VTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRG
Sbjct: 121 TATVTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRG 180
Query: 181 E-QQPQGDDPTEFQEICTVDRSLGEVDPDVAHELVDMPASEGSSGIDENSKQTCASTINT 240
E QQPQ D+PTEFQEICTVDRSLGEVDP VAHELVDMPASEGSSGIDENSK+ CA+TINT
Sbjct: 181 EQQQPQRDNPTEFQEICTVDRSLGEVDPGVAHELVDMPASEGSSGIDENSKKPCANTINT 240
Query: 241 SVSISMEDKGQDDFSASGKHINDLVTCTQEGSGKLSSQKIEQQIKDLSENPVNTYIGNIE 300
V++ +EDKGQDDFSASGKHI+DLVTC EGSGKL QKIEQQIKDLS+NPVNT +GNIE
Sbjct: 241 PVALLVEDKGQDDFSASGKHISDLVTCAHEGSGKLGDQKIEQQIKDLSKNPVNTSVGNIE 300
Query: 301 QVVNSHELNKEDQNRLLSPSVPADRLVIESSIATLQSHASMTLKGDCVFHSGSGEVTPEV 360
QVVNSHEL+KEDQN L+SPSVP++RLV+ESSI+ LQSHAS+TLKGDCVFHSGSGEV PEV
Sbjct: 301 QVVNSHELSKEDQNPLISPSVPSERLVVESSISPLQSHASVTLKGDCVFHSGSGEVMPEV 360
Query: 361 PSETDKFDDKVLCSNVEIGNPSKTNMHEVLPTVVKGDARAVVCAGEGKNINAEVCAFQGP 420
SETDK DDKVLCSN+E GNPSK ++ EVLP VV+GDAR C EGKNINAE+CA QGP
Sbjct: 361 SSETDKLDDKVLCSNMEFGNPSKESVCEVLPAVVEGDARTETCV-EGKNINAELCAVQGP 420
Query: 421 KIDSVGQMACAQEIISVDQQRFPSGIEIQTSKSESSASAMEKSNASKVGESSSGHIRDIP 480
KIDSVGQMAC QE+IS + P GIEIQTSKSESSASAME+S ASKVGES+SG+IRDIP
Sbjct: 421 KIDSVGQMACGQEMIS---EHLPLGIEIQTSKSESSASAMEESKASKVGESTSGYIRDIP 480
Query: 481 DKFTKDEHGMISLRDVRGCTLPIEKNLYSEGHLPPTTVAESTQLCEENKLCQSGNDHVTH 540
DKFT+D H GCTLPIE NLY EGHLPPTTVAESTQLCEENKL SGN HV H
Sbjct: 481 DKFTEDVH---------GCTLPIE-NLYFEGHLPPTTVAESTQLCEENKL--SGNVHVEH 540
Query: 541 ASCKEEVRLSSDSISVNGKFAESPVRDKRIVSLSFQESDVESGMIDTKVEYSANAVSVST 600
ASCKEEVRLSSDS ++NGKFA+SPV DKRIVSLSFQES VESG IDTK+EYSANA
Sbjct: 541 ASCKEEVRLSSDSTNLNGKFADSPVTDKRIVSLSFQESGVESGTIDTKLEYSANA----- 600
Query: 601 FGDANVRTCDTLQGDSLPVVDALTDRKDADEKEDQLQPGVVEFTQSDSKEESGVIIPAEG 660
GD +DRKDA++KEDQLQP VVE TQSDSKEESGVIIPAEG
Sbjct: 601 -------------GDE-------SDRKDANDKEDQLQPAVVELTQSDSKEESGVIIPAEG 660
Query: 661 SFPLSDTSQPVGKFHPLSEAEKSACLLTGQGFGESIDQTISKNLNSDDCNRESQSIPQAD 720
S DTSQPVGK HPLSEAE S +LTGQG GESIDQ+I KNLNS DC+RESQSIPQAD
Sbjct: 661 SSVRLDTSQPVGKLHPLSEAENSTPVLTGQGSGESIDQSILKNLNSSDCSRESQSIPQAD 720
Query: 721 IPNNVIQDCGQEMDIDPAFSKSSAKACDSGVKKSDEKSFPPDATSLTPLPGETLDNYQKD 780
IPNNVIQDCGQEMD+DPA SKS+A ACDSG K+S DAT LT PGETLDNYQKD
Sbjct: 721 IPNNVIQDCGQEMDVDPAISKSTAIACDSGGKQS-------DATPLTQRPGETLDNYQKD 780
Query: 781 QESTKVVSESVGNNCQQAIAVNIDSDAGKKEGSLCSADFPQSHEQMSVVGNGNSTADKPP 840
QES KVVSE+VGNNCQQ IA NIDS AGKKEGSLCSA F QSHEQ SV GNGNSTA KPP
Sbjct: 781 QESRKVVSETVGNNCQQVIAANIDSGAGKKEGSLCSAVFSQSHEQTSVTGNGNSTAAKPP 840
Query: 841 PNLPDVVKTTVVAHDPDVKDCNMRPASKNVEAAEAKDRLVGNTSSGSQLPKGNVASESET 900
PNLPDVVK TV AHDPDVKDCN P SKNVEAAE KDRLVGN SSGSQLPK NV SESET
Sbjct: 841 PNLPDVVKATVGAHDPDVKDCNKVPPSKNVEAAELKDRLVGNASSGSQLPKENVVSESET 900
Query: 901 ALTFESSSLVDLPKNDSGIAVATAATASLVAEGP--QSSSGLSKLDIKSAQDISHSSPHV 960
ALTF+SSSLVDLPKNDSGIAVATAA+ASLV E P QSSSG SKLDIK+A+DISHSSPHV
Sbjct: 901 ALTFQSSSLVDLPKNDSGIAVATAASASLVVEAPQSQSSSGPSKLDIKTARDISHSSPHV 960
Query: 961 SEVKVARARSKGTPERKPRRASAKGLGKESSTKGNHTKKSEKVEKSNSTPISNPGIFQLA 1020
SEVKVAR+RSKGTPER+PRRASAKGLGK+SSTKG+ TKKSEKVEKSNSTPISNPGIFQLA
Sbjct: 961 SEVKVARSRSKGTPERRPRRASAKGLGKDSSTKGSQTKKSEKVEKSNSTPISNPGIFQLA 1020
Query: 1021 QSNEMQHHGHVESSGAKPFVFIGASTSSIPDLNNSASPSPMFQQPFTDLQQVQLRAQIFV 1080
QSNEMQHHGHVESSGAKPFVFIGASTSS+PDLNNSASPSPMFQQPFTDLQQVQLRAQIFV
Sbjct: 1021 QSNEMQHHGHVESSGAKPFVFIGASTSSLPDLNNSASPSPMFQQPFTDLQQVQLRAQIFV 1080
Query: 1081 YGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRLNGKKSQPTNPETPSQSQSGG 1140
YGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRLNGKKSQ NPETPSQSQS G
Sbjct: 1081 YGALIQGTAPDEAYMLSAFGGPDGGTNLWENAWRMCVDRLNGKKSQTINPETPSQSQSVG 1140
Query: 1141 RSTEQANKQSTLQSKITSPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSSIVPR 1200
RSTEQA+KQSTLQSKI SPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSSIVPR
Sbjct: 1141 RSTEQASKQSTLQSKIISPPVSRVSSKSTSTVLNPMIPLSSPLWSISTPSNALQSSIVPR 1200
Query: 1201 SPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFSGLPI 1260
SPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFSGLPI
Sbjct: 1201 SPVIDYQQALTPLHPYQTPPVRNFIGHNLSWFSQAPFHSTWVATQTSTPDSSARFSGLPI 1260
Query: 1261 TEPVHLTPVKESSVSQSSAVKPSGSMVHGGTPGNVFTGASPLLELKKVSVTTGQNSTESK 1320
TEPVHLTPVKESSVSQSSA+KPSGS+V GGTPGNVFTG+SPL ELKKVSVTTGQN ESK
Sbjct: 1261 TEPVHLTPVKESSVSQSSAMKPSGSLVQGGTPGNVFTGSSPLHELKKVSVTTGQNPAESK 1320
Query: 1321 MRRRKKNTVSEDPGLITIQQVQPHLKPVPAVVTTTISTLATSPSVHPKAASENLILSPPP 1380
MRRRKKNTVSEDPGLIT+ QVQPHLKPVPAVVTTTISTLATSPSVHPKA SEN+ILSPPP
Sbjct: 1321 MRRRKKNTVSEDPGLITM-QVQPHLKPVPAVVTTTISTLATSPSVHPKATSENVILSPPP 1380
Query: 1381 LCPTTHAKSAGQDLRGRAMFSDETLGKVREAKQLAEDAALFASEAVKHSAEVWSQLDRQK 1440
LCPT H K+AGQDLRG+ MFS+ETLGKVREAKQLAEDAALFASEAVKHSAEVWSQL RQK
Sbjct: 1381 LCPTAHPKTAGQDLRGKPMFSEETLGKVREAKQLAEDAALFASEAVKHSAEVWSQLGRQK 1440
Query: 1441 NSELVPDVEAKLASAAIAIAAAAAVAKAAAAAANVASNAACQAKLMADEAVTSSSHDVPC 1500
NSELV DVEAKLASAA AIAAAAAVAKAAAAAANVASNAACQAKLMADEA +SSS ++ C
Sbjct: 1441 NSELVSDVEAKLASAAAAIAAAAAVAKAAAAAANVASNAACQAKLMADEAFSSSSPEISC 1500
Query: 1501 QSNEFSIHGSAVGVGKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKHAENV 1560
QSNEFS+HGSAVG GKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKHAENV
Sbjct: 1501 QSNEFSVHGSAVGAGKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKHAENV 1560
Query: 1561 DAIVKAAELAAAAVSQAGKLVAMGDPLPLGKLVEAGPDGYWKTPQVSSELVMRSDDVNGG 1620
DAIV+AAELAAAAVSQAG LVAMGDPLPLGKLVEAGP+GYWKTPQVSSEL+MR DDVNGG
Sbjct: 1561 DAIVRAAELAAAAVSQAGNLVAMGDPLPLGKLVEAGPEGYWKTPQVSSELIMRPDDVNGG 1620
Query: 1621 RSNSAIKRPRDGSSSKNEIQVSGGAKSPIPGEISMGSVENHSKLVDGITSSVAPREKDLR 1680
SN AIKRPRDG SSKNEIQ S AK IP EISMGSVENH KLVDGITS VAPREK LR
Sbjct: 1621 SSNLAIKRPRDGLSSKNEIQPSVSAKPSIPEEISMGSVENHPKLVDGITSCVAPREKGLR 1680
Query: 1681 GQKDQNASDLTKTIGVVPESEVGERSSQDECEKAKDLRQSSIKEGSHVEVFKDGNGLKAS 1740
GQKDQNASDLTKTIGVVPESEVGER SQDECEKAKDLRQSSIKEGSHVEVFKDGNGLKAS
Sbjct: 1681 GQKDQNASDLTKTIGVVPESEVGERLSQDECEKAKDLRQSSIKEGSHVEVFKDGNGLKAS 1740
Query: 1741 WFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMTNLRTEGT 1800
WFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMT RTEGT
Sbjct: 1741 WFTASVLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMTTSRTEGT 1800
Query: 1801 RKRRRAAAGDYICC--DCI----------------------------------------- 1860
RKRRRAAAGDYI D +
Sbjct: 1801 RKRRRAAAGDYIWSVGDKVDAWMQNSWHEGVVVEKNAKDETAYIVRFPARGETSTIKAWN 1860
Query: 1861 -------------------------QIVVPQEKRMKLGSPATEVKRKDKMPTIVEDVESA 1920
+I++P EKRMKLGSPA EVKRKDKMPTIVEDVESA
Sbjct: 1861 LRPSLIWKDGEWFELSGSYANDYSHEIIMPLEKRMKLGSPAAEVKRKDKMPTIVEDVESA 1920
Query: 1921 KTEDPSLLLISANEKVFNIGRNTQTENKSNPLKTSRTGLQKGASRVIIGVPRPGKKRKFM 1980
K D SLL IS NEKVFNIGRNTQTE K+NPLKTSRTGLQKG SRVIIGVPRPGKKRKFM
Sbjct: 1921 KPADSSLLSISVNEKVFNIGRNTQTEKKTNPLKTSRTGLQKGTSRVIIGVPRPGKKRKFM 1980
Query: 1981 EVSKHYDADTRTTEANDSTKLAKYLMPHGSTSKVLKRTSKYDTKEKSANDARPLAVKSGK 2040
EVSKHYD DTRTTEANDSTKLA+YLMP GSTSK LKRTSKY+TKEKS ND +PLAVKSGK
Sbjct: 1981 EVSKHYDVDTRTTEANDSTKLARYLMPQGSTSKGLKRTSKYETKEKSTNDGKPLAVKSGK 2040
Query: 2041 QPSVSDHAVITKDPESQNESTSGKNDQMDVPPFCSTEEAPEGSVLFPPAHAPKKASSFHT 2100
QPSVSDHAVIT+D ESQN ST GKNDQMDVP CSTEEAPEGS+LFPPAHAPKKASSFHT
Sbjct: 2041 QPSVSDHAVITRDSESQNVSTEGKNDQMDVPSLCSTEEAPEGSLLFPPAHAPKKASSFHT 2100
Query: 2101 KPERANKGKLAPAVGKLAKIEEEKVFNGNPTKPNSNVIEPRRSNRRIQPTSRLLEGLQSS 2115
KPERANKG+LAPAVGKLAKIEEEKVFNGN TKPNSNVIEPRRSNRRIQPTSRLLEGLQSS
Sbjct: 2101 KPERANKGRLAPAVGKLAKIEEEKVFNGNTTKPNSNVIEPRRSNRRIQPTSRLLEGLQSS 2136
BLAST of ClCG10G002580 vs. TAIR 10
Match:
AT4G17330.1 (G2484-1 protein )
HSP 1 Score: 854.4 bits (2206), Expect = 2.1e-247
Identity = 769/2244 (34.27%), Postives = 1077/2244 (47.99%), Query Frame = 0
Query: 1 MDYDDNDFQSQNLHLAGEGSAKFPPVLRQYALPKFDFDDTLQGHVRFDSLVEPEVFLGIE 60
MDYDD+DFQ+QNLHLAGE + KFPPVL+ YALPKFDFDDTL H+RFDSL E E FLGIE
Sbjct: 1 MDYDDSDFQNQNLHLAGEANNKFPPVLQPYALPKFDFDDTLNTHLRFDSLGESEAFLGIE 60
Query: 61 NSEDNQWIEDYSRGSSGIGFTSCAAESCSILRRKNVWSEATSSESVEMLLKSVGQEDINL 120
+EDN WIED+SRGSSGI F+S A ESC+I R NVWSEATSSESV MLL SVGQ+++ +
Sbjct: 61 GNEDNNWIEDFSRGSSGIVFSSGATESCAISRHNNVWSEATSSESVAMLLNSVGQDEVIV 120
Query: 121 APAVTGESNAREKLDYLTNPMDPTLKDDGSSFCEMGDLQPTLLSNISLEELHVVNEDIRG 180
+S+ +L C M ++P S E E +
Sbjct: 121 REDTIKKSDTSHELG-----------------CTMETVEP---GQTSHERSLSKEETVNL 180
Query: 181 EQQPQGDD-PTEFQEICTVDRSLGEVDPDVAHELVDMPASEGSSGIDENSKQTCASTINT 240
+ P DD P E + T D + D D P + + ++E ++I T
Sbjct: 181 QPNPSVDDTPGESSVVKTDDGQEQVLVKD------DSPTAVEEASVEEK------NSILT 240
Query: 241 SVSISMEDKGQDDFSASGKHINDLVTCTQEGSGKLSSQKIEQQIKDLSENPVNTYIGNIE 300
S + ++E D G D + E + S ++E D GN++
Sbjct: 241 SNTATVEAIDTTDLGKIGTETTDNLLDQTEEKANVES-RMEDDCSD----------GNVQ 300
Query: 301 QVVN-SHELNKEDQNRLLSPSVPADRLVIESSIATLQSHASMTLKGDCVFHSGSGEVTPE 360
++ S ELN + L P D VI I + + +T + G + +
Sbjct: 301 TIITCSGELNNQS---TLLPETSNDENVISDHIQSSYNRNDLTADARSILVEGHSDSHID 360
Query: 361 VPSETDKFDDKVLCSNVEIGNPSKTNMHEVLPTVVKGDARAVVCAGEGKNINAEVCAFQG 420
SE +K + + IG +K ++ E+ E ++ Q
Sbjct: 361 SASEVEKVEAE------NIGKTAKPDLKEI----------------ELSDVTVLERGDQA 420
Query: 421 PKIDSVGQMACAQEIISVDQQRFPSGIEIQTSKSESSASAMEKSNASKVGESSSGHIRDI 480
P VG Q++ SG E Q + S + S A + +G + I
Sbjct: 421 PSTLEVG----GQDV---------SGTECQ----DLLVSTVHTSVAVEASLELAGELTTI 480
Query: 481 PDKFTKDEHGMIS------LRDVRGCTLPIEKNLYSEGHLPPTTVAESTQLCEENKLCQS 540
+ + ++ ++S + T IE Y + H+ T +ES + + + ++
Sbjct: 481 TNSVSIEKPELLSHQHMEVITSEHESTFQIETETYPQIHVFET--SESVYISTMDSMVEA 540
Query: 541 GNDHVTHASCKEEVRLSSD-----SISVNGKFAESPVRDKRIVSLSFQESD---VESGMI 600
V+ S E +S+ + VN + V++ +I+S S V G
Sbjct: 541 REGGVSKKSDNEGSARTSNLEQSMELPVNANDRDQDVKNSQILSESVVSGSVGYVSGGST 600
Query: 601 DTKVEYSANAVSVST-------FGDANVRTCDTLQGDSLPVVDALTDRKD----ADEKED 660
E + + S+ T N+ L D P V +LT D +D
Sbjct: 601 SELAESESQSDSIPTDKSETMIDSSLNLEELQPLSQDGAPAV-SLTSSIDLHMVKTSSDD 660
Query: 661 QLQPGVVEFTQSDSKEESG-VIIPAEGSFPLSDTSQPVGKFHPLSEAEKSACLLTGQGFG 720
Q E + + E+G + P + S S Q EA K A
Sbjct: 661 SDQGSYSETKKVYGEPENGQTVPPVDASCSGSQMDQ---------EARKRA--------- 720
Query: 721 ESIDQTISKNLNSDDCNRE--SQSIPQADIPNNVIQDCGQEMDIDPAFSKSSAKACDS-G 780
+ T + + C R S+ AD V+Q +E+ + + KA ++
Sbjct: 721 ---EGTKQSTYSVEGCPRSEGSKDAVDADGVGQVLQQQSEELIFEENVVTEAVKAPETLS 780
Query: 781 VKKSDEKSFPPDATSLTPLPGETLDNYQKDQESTKVVSESVGNNCQQAIAVNIDSDA--- 840
V D K+ P +SL L E KD + + S G + DA
Sbjct: 781 VLDKDNKNEMPITSSLPILGSEA----GKDGQEEDNTAASGGIMAAGTPVTHPKGDAIVL 840
Query: 841 GKKEGSLCSADFPQSHEQMSVVGNGNSTADKPPPNLPDVVKTTVVAHDPDVKDCNMRPAS 900
G S CS +S+ ++ + + + P + P VKT+ + + + +P
Sbjct: 841 GDSRASTCSESSVKSY--VTAIEDAATNLKTPLDSFP-TVKTSELQFNNTETNSVKKPED 900
Query: 901 KNVEAAEAKDRLVGNTSSGSQLPKGNVASESETALTFESSSLVDLPKNDSGIAVATAATA 960
+N+ G S+GS + N S SE LT + + K + + A
Sbjct: 901 QNIS---------GFMSAGSPVLNRNETSSSEMNLTPDQ---LKAGKISKAVIFSQATLV 960
Query: 961 SLVAEGPQSSSGLSKLDIKSAQDISHSSPHVSEVKVARARSKGTPERKPRRASAKGLGKE 1020
S + G S+S L K KS SK ERKPRR S K +GKE
Sbjct: 961 SPIVVGSPSTSSLDKTAAKS--------------------SKAKSERKPRRTS-KSVGKE 1020
Query: 1021 SSTKGNHTKKSEKVE------KSNSTPISNPGIFQLAQSNEMQHHGHVESSGAKPFVFIG 1080
+S KG K + +E K+N+ S Q+ QS E Q ++S K F +
Sbjct: 1021 TSRKGTSVKGATPIEQFQSGGKTNAVNQSLASHIQITQSTEKQR--SLQSPALKAFGSLS 1080
Query: 1081 ASTSSIPDLNNSASPSPMFQQPFTDLQQVQLRAQIFVYGALIQGTAPDEAYMLSAFGGPD 1140
T+S+PDLN+SA S + ++PFTDLQQVQLRAQIFVYGALIQGTAPDEAYM+SAFGG D
Sbjct: 1081 TPTASLPDLNSSAL-SSILRRPFTDLQQVQLRAQIFVYGALIQGTAPDEAYMISAFGGAD 1140
Query: 1141 GGTNLWENAWRMCVDRLNGKKSQPTNPETPSQSQSGGRSTEQANKQSTLQSKITSPPVSR 1200
GG WE +WR CV R +KS PETP QS+ G K +P
Sbjct: 1141 GGKGSWEKSWRTCVVR--AQKSLVATPETPLQSRPG---------------KTETPSAGH 1200
Query: 1201 VSSKSTSTVLNPMIPLSSPLWSISTPSNALQSSIVPRSPVIDYQQALTPLHPYQTPPVRN 1260
+SK +S NPMIPLSSPLWS+ST + LQSS V R +Q L+ H +QTPP +N
Sbjct: 1201 TNSKESSGT-NPMIPLSSPLWSLSTSVDTLQSSSVQRGSAATHQPLLSASHAHQTPPTQN 1260
Query: 1261 FIGHNLSWFSQAPFHSTWVAT-QTSTPDSSARFSGLPITEPVHLTPVKESSVSQSSAVKP 1320
+GHN W S PF + W+A+ QTS D +RF PIT+PV LTP+KESS++ S A
Sbjct: 1261 IVGHNTPWMSPLPFRNAWLASQQTSGFDVGSRFPVYPITDPVKLTPMKESSMTLSGA--- 1320
Query: 1321 SGSMVHGGTPGNVFTGASPLLELKKVSVTTGQNSTESKMRRRKKNTVSEDPG---LITIQ 1380
V GT NV + +P LE V Q+ST K R+RKK VS + G L +++
Sbjct: 1321 --KHVQSGTSSNV-SKVTPTLEPTSTVVAPAQHSTRVKSRKRKKMPVSVESGPNILNSLK 1380
Query: 1381 QVQPHLKPVPAVVTTTISTLATSPSVHPKAASENLILSPPPLCPTTHAK----------- 1440
Q + P+ T T + L + P S + P L T K
Sbjct: 1381 QTELAASPL-VPFTPTPANLGYNAGTLPSVVSMTAV--PMDLVSTFPGKKIKSSFPSPIF 1440
Query: 1441 --SAGQDLRGRAMFSDETLGKVREAKQLAEDAALFASEAVKHSAEVWSQLDRQKNSELVP 1500
+ ++++ R++ S++T+ K++EAK AEDA+ A+ AV HS VW Q+++Q ++ L P
Sbjct: 1441 GGNLVREVKQRSVLSEDTIEKLKEAKMHAEDASALATAAVSHSEYVWKQIEQQSHAGLQP 1500
Query: 1501 DVEAKLASAAIAIAAAAAVAKAAAAAANVASNAACQAKLMADEAVTSSSHDVPCQSNEFS 1560
+ + +LASAA+AIAAAAAVAKAAAAAANVA+NAA QAKLMA+EA ++ D + S
Sbjct: 1501 ETQDRLASAAVAIAAAAAVAKAAAAAANVAANAALQAKLMAEEASLPNASDQGLPKSYDS 1560
Query: 1561 IHGSAVGVGKATPASILRGEDGGNGSSSIIIAAREAARKRVEAASAASKHAENVDAIVKA 1620
I G+ TPASIL+GE SSS++IAAREA++KRVEAA+AA+K AEN+D+IVKA
Sbjct: 1561 IL-----PGQGTPASILKGEGAVVNSSSVLIAAREASKKRVEAATAATKRAENMDSIVKA 1620
Query: 1621 AELAAAAVSQAGKLVAMGDPLPLGKLVEAGPDGYWKTPQVSSELVMRSDDVNGGRSNSAI 1680
AELA+ AVSQAG LV+MG P L KLVEAGP YW+ Q S E +
Sbjct: 1621 AELASEAVSQAGILVSMGHPPLLNKLVEAGPSNYWRQAQESQE----------------V 1680
Query: 1681 KRPRDGSSSKNEIQVSGGA-KSPIPGEISMGSVENHSKLVDGITSSVAPREKDLRGQKDQ 1740
+ + K + S G P + G N DG++ V P L+GQ+
Sbjct: 1681 QPCKTVVLEKETVSTSEGTFAGPKIVQTEFGGSVN---TADGVSGPV-PATGKLKGQEGD 1740
Query: 1741 NASDLTKTIGVVPESEVGERSSQDECEKAKDLRQSSIKEGSHVEVFKDGNGLKASWFTAS 1800
+DL K VV E EVG + S D + K + IKEGS+VEVFK+ GL+ +W++A+
Sbjct: 1741 KYADLAKNNDVVFEPEVGSKFSIDAQQTIKATKNEDIKEGSNVEVFKEEPGLRTAWYSAN 1800
Query: 1801 VLSLKEGKAYVSYTELQPEEGSGQLKEWVALDGQGGMAPRIRISRPMTNLRTEGTRKRRR 1860
VLSL++ KAYV +++L E+G+ +LKEWVAL G+G AP+IR +R +T L EGTRKRRR
Sbjct: 1801 VLSLEDDKAYVLFSDLSVEQGTDKLKEWVALKGEGDQAPKIRPARSVTALPYEGTRKRRR 1860
Query: 1861 AAAGDYI----------------------------------------------------- 1920
AA GD+I
Sbjct: 1861 AALGDHIWKIGDRVDSWVHDSWLEGVITEKNKKDENTVTVHFPAEEETLTIKAWNLRPSL 1920
Query: 1921 ---------CCDCIQIV-------VPQEKRMKLGSPATEVKRKDKMPTIVEDVESAKTED 1980
C + + P+EKR +LG+PA + KD IV+D + K
Sbjct: 1921 VWKDGKWIECSSSGETISSSHEGDTPKEKRPRLGTPALVAEVKDTSMKIVDDPDLGKPPQ 1980
Query: 1981 PSLLLISANEKVFNIGRNTQTENKSNPLKTSRTGLQKGASRVIIGVPRPGKKRKFMEVSK 2040
+L + +E FNIG++T+ ENK +PL+ RTGLQK S+VI GVP+PGKKRKFM+VSK
Sbjct: 1981 TGVLNLGVSENTFNIGKSTREENKPDPLRMKRTGLQKQGSKVIFGVPKPGKKRKFMDVSK 2036
Query: 2041 HY--DADTRTTEANDSTKLAKYLMPHGSTSKVLKRTSKYDTKEKSANDARPLAVKSGKQP 2100
HY +A T+T E + K + ++P S K SK + EK +RP K +P
Sbjct: 2041 HYVSEASTKTQERKEPVKPVRSIVPQNSGIGSWKMPSKTISIEKQTTISRPKTFKPAPKP 2036
Query: 2101 SVSDHAVITKDPESQNESTSGKNDQMDVPPFCSTEEAPEGSVLFPPAHAPKKASSFHTKP 2110
A P + +T+ + + D + P V F + SS H
Sbjct: 2101 KEKPGATARIIPRKDSRNTTASDMESDE---SAENRGPGSGVSFKGTVEEQTTSSSHDTG 2036
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038903704.1 | 0.0e+00 | 89.53 | uncharacterized protein LOC120090225 isoform X1 [Benincasa hispida] | [more] |
XP_038903706.1 | 0.0e+00 | 89.53 | uncharacterized protein LOC120090225 isoform X3 [Benincasa hispida] >XP_03890370... | [more] |
XP_038903705.1 | 0.0e+00 | 88.94 | uncharacterized protein LOC120090225 isoform X2 [Benincasa hispida] | [more] |
KAA0041075.1 | 0.0e+00 | 84.27 | Agenet domain-containing protein [Cucumis melo var. makuwa] | [more] |
TYK12033.1 | 0.0e+00 | 83.25 | Agenet domain-containing protein [Cucumis melo var. makuwa] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5A7TI44 | 0.0e+00 | 84.27 | Agenet domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27... | [more] |
A0A5D3CJB8 | 0.0e+00 | 83.25 | Agenet domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676... | [more] |
A0A1S3BKL8 | 0.0e+00 | 85.24 | uncharacterized protein LOC103490876 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A0A0L268 | 0.0e+00 | 84.77 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G019930 PE=4 SV=1 | [more] |
A0A1S3BK14 | 0.0e+00 | 84.39 | uncharacterized protein LOC103490876 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |