Cla97C08G145310 (gene) Watermelon (97103) v2.5

Overview
NameCla97C08G145310
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionRNA polymerase II C-terminal domain phosphatase-like
LocationCla97Chr08: 1788769 .. 1799223 (-)
RNA-Seq ExpressionCla97C08G145310
SyntenyCla97C08G145310
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGAAATCTTCTCCAAAACTCAATTCCCATTGCTCTTTCAAAACTCAGTTCCCACAAATTGGTCCTTGCCAAGAAAATAGCTGGGAAGACCCGATGGAGACGTCATGTTCACGATTCATGGTTGTTTGTTGAGGAAGAGGAATTTTGTGGATTCTTGATGAAAATGTTCTTCAAAAGTGAGTTTCCTTCTTCCATGTAATTTATTTAACATGCTTAACCTTTACTGCATTATGAATGTATCTATTTTTCTATTCTATGTTGAAGATTCTAAAACAAAATTGAAAGCACAAGGATCACTGTGGAAGATTGGATCCCTTCACAATCAATCATCTATTTGTCTTCCTCCCCATCTTCATAATCTCAAAACTTTAGGGTATTTCTTCCAATAAATTTATATAGGAGTAGTTGAGAAGATGCAAAAAGAAAGAGATAATTTAGAGGAATTGCAAAAATTAGGGATATAATTGATAAAAATGAATTTAATCTTCTTAATTTTAAATTCAATCTTACTTTTAAAAGCATATCAACCACTCATCTTCCACATTTTTTAAAACTACAATTTTAGAGCTATTATGTACTTAAATAAAATTTTTAAATTTTGTTTAAATTACCAAAATAGATTAACATTATAATTTTATTAACTTTTAGATGTTACTTGAGATTTATAAGATTTATAGATGGTATTAAAAAATTGTTTTTGTCCTCTAAAGTCATAATACAAGAAAAAATACTTGGAAGGAAGGATTTATCGAAAGTTTTAAAGTAACAAACAAACATCTTAGTAATAAGATCATCAAGAATATCCATTTTAAAAAAATAATTGAAAAATCAATAATTACATAAATATCAAAATTAGAATTTATTGTTTTTACTTATGAAAATTAAAACAAAACAATATAATATTTATTTGTCACATTGCTAGCATTGCAAGTGTTTATTCACTAGTATAGTATATAGACATGTGGGTAAGTTATATGAAAGGAAAAAATTCACTATCACCTTCAGTTTCACGTATACAGCCGGACGACCCTCCCTCTCTCCCTCGCGAATTCCCTCACCCACAGAGCCGGCGTTTTCATTCTTCCTCAGCCGCCCGACCGCCGCCATGCACCGTTCACCCACCGCCCGCCGCGCCTCCGACTCCGTTCCCGCCGCCGCATCCTGGTGAGAGCACCGTCTACCTCTCGCCTCTTCCGGTGTGTTTCCCCTTCTGCGACGAACGACAGCCGAAAACCTATCATCTCCAGCGTCTTCATCTCTACGACGAATGGTTTGTTCGACGCCTGTGCAACCACCCGACGTAACCCTACGGATTCGCGGCCAGATTTGCGTTCGTTTGAGTTTTCAGCAGCGGATTCCTCTATATCGGCAAGGTTAGGTAAAGACCCACTTCCTTTTCGGATTTAGTTTAGTGTTACCCGTTCGGTAACAGCTTCAAATATGCAGGATTAAGTTTATTTGAACACCAACCAGCAACCCTTTGAGGCTGTTGGCGGCGTCCAGTAGTTCGCAACTGATACGGAGCATTCGAATTAAGGTTTGAGGTTGTCTGGGGAGTTCTTCAAATAGTGTAGGAGTGCTTCTGAACGTTAAGTCCTGGTTCTATTTGTTGTGTGCCAACATGATGATTGTTCTAATTGTGTAATTATTTATAGATTAAATATTTGATGTCTTGAATCTATGATCTGGATGTTAAGCGATGTAGTGTTCTAGTGATATTATAATGACTAGCATGCGGAGGTGTTGTTATGCGATCTGTGTTTATGAGCATGATTATTTGTTGAAGCTATTTATTTTATATTTTTCTTTCCATTTCTAAATGTTACGTAAGACTACATGCATAATATTTGTTGTATGATCTTTTCCCTTCTTAATTTGGCTCTATTGTTTTTAGTGTTGGTATTGCATATTAATGCTACAATCTACTAAGGTCTTTATTTTCCCCTTTCCATTTGGTGGATATTTCAGATGAGCCTTGCAACTAATTCTCCAGCTCACTCATCAAGCAGTGACGATTTTGCTGCGTTTCTTGATGTAGCTCTAGATTCTCATTCCTCTGACTCATCACCCTACGAAAAGGCCGAGCATGACAATAATGCTGAAAGTGAGAGGTACGTTTAATTTTTAAAAGTAATGAACTTCCGTAGTTCTTTGAAGTTCATAATGCCTACAGACAGTGGTTTAGGGTTTTAATTTTAGGTTAATCGATATAAGATTCATTTGTGATTTCTGTGTGTGTAATTTGTATTAATTTTTCCTTTGAATGAATAACAGCATTGAGTTGTGGCAAGTGGTGACTATCATTTCCTTTATCCTCGTTTTATCAATTTAACATGTCAAATATTTTTTTATCCAATTTTTAGGATAAAGCGTCGTAAGGTGGAGAAACTGGAGAACTCAGAGGAGGATATTCTATATGGAGCTGAAGAGAAAAGTTTAGGTAAGTTAAACTGTTTGTACCAACCCCTTTTGTTCACCTTTTACTCTTGCTCTACCCCTTACTTTCAATGTAGCCCATTTGTGGCTGCCACCTGTATTATACAGTCTATACTTAACTACTTCCTTACTTCTTAGACAGTTACGATATGCTTGCTGTTCCTTTTTTTCCAGCCACTCTGCATATTCATAGAGCTTTTCACTTGCTATCTGTTACATATTGTGTTTTTTGTGGTTGCCTCCTCTTCCTTTCTTATATGCGAATCTGATTGGAGGTTTGTCACCCTTCTTTATCCACCTTTTATTTAAATTCTATGAGCTTCTTTAAGAACCATATCAAGTTGGCCCCAAAATGATCTATATTTAAACACCTCCCTATTAGTGTTGTTGAGTAAGAAAGTATTCCCTTGAAACTATGCAATCACAAAAGGAGGGGAAAGATAGTCACATTTCTGTATAACCAAAAGTGAAGTAACATATTCTCTGGCTTTATGTTCAAATTGATCTGAATACTCGTGGATAAAAAATAAAGAACATAGGATGGAAGTTGTTCTTATATATAATGCTTGTGTACGTTCTTTGATCTGAACGAAACTAGTCATTTTTAGTTCCTTCGAAAACTAAGAATTACAGGGATGTTAGATGGATATACACCTGAAATTTTGTGGTATCCGAATTCATGAACTATTTCAGTCCAGTTGTCCTCTGCTCACCATATATCCTTTGTTTTGGTTCCTTTCTTCCCCTACCAACGTAACCAATTCAGTCCCGTGTAATTGTAACTTATTAACATAGAAATATAAAAGCCTTCATGTTGAAGCAAGTATTTGACTCCTTGTATCTTTGCGTTGGTTTTTCTCTAATTGCTGAATTCTTCTATTTTGAAAATAATAATCAATTGCAGAAGTATTATCAAAGCAACAATTATGCAGTCATCCTGGTTCATTTGGAAATATGTGTATCATATGTGGGCAGAGGTTGGATGAGGAATCTGGCGTGACATTCGGGTATATACATCGGGTATGTTTTTCTTTATATGAATATCCAATTATCTTCTATTCTTTGTTACCAGGTATAGAATGTTTGGGTGGGGTGGGGGTGGGTTAGAAGGAACTAAGGAGGTCTGGGAGTTTACAGAATGGGCACCATTTAGAAGAGGGATGAGAAATGGTGTCAAGCCTTTTATTTTCTTCTTGGATTTACAAAATGGGCACCATCTACCTCTACTGCCTCATTTGCTATAGCCTATGAGGATGCATTAAGTTTTTGTCTATCTTTTATGAAAGGCTTGTTGATTTTCTGAACTCGTGCTTTTTTGGATCTTGTTTTAAAACCTTGGCAAAGATTTCATGTATTCCAGTTCAGTTACTAAGCCGATGGGTCATAGATGTGATGTTTACTGTTTAGTGCTGGACATTTTTGGAATCAAGCAAATAGTCTTCCTTTCATACATTAATGGGGGTCACATTCCCAAAATTCAACACTGCTTCTTCTCTCTCTCTCTCTCTCTCTCTAAAAATGGAAGAGGATAGAGGGGATAGGGATGATCTTTGTCAATAGGAAGTCTTTTACAGTTGGAAGAAGTGGAAATGGAAGAAGATTAGTTTTAAAGGAGCATAGGGGACTGAAAGTCAGGAAAGTCGAGTTGGAGATAGGTATTGTGGTATTGGTTAGAGACTGATTGCCTTTGGTGAAGGATTCAAACAATCCTTTGGGTTTTTAGCTGAGAAGTAGATTAGAGGCAGCGATTATTTTATTCCATATCCTGTCAAACAAAAGAGGTCGCTTTGCATATTATCCTTTGAAACCTTTAAAGGTAGAAAATCATTTTTATCTCAAAAGGTCAAAGAAAGAAAGGTTGGAATGCTTTAGTTGATGAAATTTCTGGGTTTTTACACCTATTTGATGTAGTTTTTGTTTTTTGGAATCTGTGTTTTGGCTTTTCTTTAAGTTTTACTTAGGTTCTGTTTGGTTCTTTAAAAGATCTTTTATGTTCAGTTCTTTTCTAACTGTGTGGTAATTTTTGTTGGTTTAGTTGTTGTGGTTTCGTATTTAAGCTAAAACTTCAGCTCTTTAATAAAGCATTGAGGTATTCAGAAGTTAAAAACTTTCCCAACTTGTGGCACCACCTTGGCAGCAGGTATGGTGGGAAGGGTGAGTCCTATGGGTACAGAGGGTGGTCCCTAATCCTCTCTGTTTTTGAATCCTGCCCCTTTTCTGGTTGTGCCTTACCTTACACACCGGTTTAGTCCCTAGGAATTCTTTGCTGAATGCTGTTGGTAATGGAGAGGAGGGGGCTTCTTTTGTGTAGGTTAGTGGATTAGAGAGCCTTATTCCTTTTCTAACCCCTTGAGTTTTCCTCCTGCTAATAAGGGAGAATTGATGGGAAATTGGGCTTCCAGTCAGTCTGCTCTTGCCTTGGTAGGTATTCAGGCGATTTTAGCTTTGGTTCCTTCAATGGTCTTGGGATCCTAAAGAAAGGGTGTTTCTGGGAAAAAGAATGATGGGGTGGGAGGAAGGAAACTTGAAAGAGAGATCGTGGAATAATCCCATTTAGTTTGCCATGTTTATGTTTTTTTTTTTTAATATATATTCATTCACTGTTGTCAAACTATATACTAGCTATGGATTTTGACTTACCATGATGGGAGAAGTATCTTCATTTTTATATGCCTCTTTGATCAATATCTATAGTTTCTTTAAAAAAAAGGGAAATAGTGAAATAATCCCATGTAGTTTGAGATGTTTTGTTAATATAATATATATGTTTCCTTCAGTTATATTTTTTCACTGTTTACAAAATGGACACTGGCTATGAATTTTAGCTTAGCATGGTTGGAGAAGTACCAATTACTTGTAAGTTGCTAGTATATCAAACTATGTGCTAACTTATGACCCATCTGATAATCTTGTAGGGACTCAGACTTAATAATGATGAAATTAACCGGCTACGTAACATAGACATGAAGAACTTGTTGCAACATAAAAAGCTTATCCTGGTTCTTGATCTGGATCACACACTTTTAAACTCAACTCAGCTGGGGCATTTGACACCTGAAGAGGAATATTTAAGGAGTCAAACAGATTCTCTAGAAGGTACGTTGTCCTCTCATCTGTACATAGTTCTACTTTCTAAAGTAACTTTGATATGGTGTTTACTGTTCTTTGATATGGTGTTTACTGTTATGCCATTCCCATCGTGCTTTTGCTTTCACCATTGCTCTTTTGTGTTTGCTGTCTTCTCGTTTCTTTCTTTCTTTTATATGTATAATATGAAACAAATAAGTATATTCCCCAAACCCTAAGCTTTTGTATTTGCTGTTCTAAATTATTTGCACCTTGAGCAGCTACGGCAAACAATCAGATTTTATCTTAAATTAAATACTTCAAATTTTGGTGGAGCCGAAAAGCCTTATGGCGGTATTTTATTGATTAAAAAAGTGTTACAAAAAGGGGGCCTTAGCCTTTATATATAGGTTTGGCTCATGGGTTAACGGGCTTTTAAGTACAAGGGTTGTCAATCATATTAACCACAATAACTAACTAACAACCTAATAAGAAGAGAAATAAAATAAAATACTTAAGCTAGATTACAAAGAGAGCCCTAGTAAAGCTAAATTACAAAGAGAACCCTAAATATTCTAACTAATCTACATCAATTCCCCCTTCATAAAAAGGAATTCATCCTCGAGTTAAACATGAAGTTCATCTAGAGGGAAATAGGTAGATAAATCTGAAATACTGAAGATGTTAGATATGTTGTAGCCTTCTGGAAGCTCCACCACATAAGCATTGCTGCCAATTTTTTTAAGGATTTGACACAGTCCAATTTTCCTCTTTTGCAACTCCTCTTGGAATCTCTCTTTCTTAAGGTATACAAGCACTAAATCACCTTCAGCAAACTCCTCATGCCGATTGCCGCCTTTTAGTGTTAGCAAATTGGGCGTGTTTATTATTAGCGTCTTGGAGATGTCTATGACTTCCTTATGTATTATATGAACCCAATCAGCCATTAAGTCAGCTTCTGAACAAATCTACAACACTAGGTAAACTTATAAGATCAAGAGTCAATTGAGGAGGGAGGAGGGTGAGTATATATGTTAGGCCTCTGGTTGTGTATCAATATCAAAGGAGGGGTAGAAAGGGATAGTTGAGTTGTAAAAGGGAGGGAAAGTCAATTAGGATTTTGGGAGGAAACTTCTTGAGAGTGAGTGGTCTAGGAAGAGGTCGATCGCTGTGATCGATCTTCTAGTCTGAGGACAATATCATTGTATGTTTGTCTTCCTTTGAATAATAAGAGGTGATACCTTTCAATATATTACCTCAAAGGGTGTTTTACTCGTAGATTGATTTTTCATATTGTTAAAAGCAAATTTAGATATTGACAAAGAAAGATCCCACTATTTTGGCCTATCTCCACTTGAAACACCAGATTAAATTACTCAAAGACCTTTTGTAACTATTCTATAGGCACTATTTTGCGTAGTTGGGCTCCCTTTTTTAGTGGGTGTTCCCTTTTTTGTGGGCTTGGCTTTTTGTATGCCGGAGTGTTTTTTCATTTTTTCTCAATGAAAGTTGTTTTCACTAGAAAAAACAGCTTCTTCTACTCTTATCATTCTTTCTGTTAAACCTTCCAAAGGAAGGATGCGGTGTTGGATATCGAGCACAATTGCATAAAGCTTGCCGGAATAGGTATCTTCCTTGCTCGTTTCCGAATTCTTGGAGTTTGTCATCCTTCTTGGCAGAATCTTTTTATTTGCTAAACTAGTAAAAGGGAGAAATTTAGATTTTGCCATATCTTTTCACTATATGAATGGAAGTATCTCTTTTCTTTTAGAAAAATGAAGGTGCTGTGAATTCAAACTTGGGTTGGTTTATTGTATTTTTTTGCTGGGGAGTAATTTTTGCGAGACCCACAATGTAAGGAGTTAAAAAGGCATGAAATTGATGTTTTTTAATTGTGGGGCCTACAAACTCCTTGGGTCATACAAGGAATTGAGTTGTTCACTCCTCCAACTCCAACTCCAACTCCTTGACCAAACACCCTTTAATCTGAAAGAGTTTTGAGTTTCAATGGCCTGTTTTGGGATGAGGGTCCTAACACAGTTTTCTTATATAAATACTGGAATATGCATCTCAGGCCTTTTTACTTTCAATTGATTATTTATTTCCTCGCTTTCTTGGACCATGAAGCAAATTGCACCTGTCTTTGGGTCTTAACTGTTCAGTAGTGTTTAATTTAATTTTATGCTTGCTTTCTTATATGGTGTCAAGTAACTGTTCTTAAAGATTTTGGCTTAATTATTTAAGCTTCCTACAAATTCCTGATGTTTGTTTTTTTGATGCTGTGGTAAATTTTCTGAATAGATGTCACGAAAGGTAGCCTTTTCCTGTTGAACTCCGTTCATACAATGACAAAGTTGAGGCCATCTGTCCATACGTTTTTGAAAGAAGCTAGTCAATTATTCGAGATGTATATATACACTATGGGGGAACGAGCATATGCTTTCGAAATGGCAAAGTTGTTGGACCCCAAGAGAGAGTATTTTAGTGCAAAAGTTATTTCTCGAGATGATGGCACTCAAAAGCATCAAAAAGGTCTTGATGTGGTGCTGGGTCAGGAAAGTGCTGTTCTGATACTCGATGATACTGAAAATGTATGTGCATCTAGAATTAGCCGATACTTTTATACACTTGATTTATTGTTGCTGCCTAGTTATCATAATTAAAATGAGTGTGTAACCGCGAGAATCATGCATTGAAAGAGGCTGATAAATTGTGTAATGCACTCAGTGGGAAATTTTTTTCTTTCACATCTGAGAGGAAGGTCTAAATAGATCTGATGAATAAGAAGATTATGTCTGATGCAAAGAAAGGATTGTAATTCTTAAATTTAATACAGCAGCATGTTGGCTCCCTAACTCCAAAAGTTATTTTATTATCCATTTCGCTTATTATTTATATTAGGTGGAGAGGCTTTCTGTAATCTCTAGTTTATCTGTCATTCTTTTTGTGTTTTATTTTGTTTTTTTTCCCTTGAGGAGTCTCCCTCCGTTACTCTTGTGGATTATTTTAGTGCAACGAGGTTATCTGATTAAACAATAGAATCTTGTAATGGGATTGGCTCTGAATGTTACTCAGTGTCTTGGTTCCTCAAGAAAAACCTATCAGAAGCTTGGTTTTCTATCTTTTTGATGCAGTATTAGATGTTGAATTGGGATACTGTTGTCCGAGTCTTTGAACACTAATTCTGTCATTTTTGGGTTGCTGAACTACCTATTCTTCATACATGTGGAGCTAAACATTATCGATTCTTATGATAGGCATGGACAAAACATAAAGAAAACTTGATATTGATGGAGAGATATCACTTTTTTGCCTCAAGTTGTCACCAATTTGGCTTCAACTGTAAATCTCTATCTGAGTTGAAGAGTGACGAGAGTGAAACTGATGGGGCACTGGCGACCATCCTGAAAGTTCTGAAGCAAGTCCATAGTATATTCTTTAACGTATTTCCCTTCCTCTCTGTTACATTTTTGTTTCCAATCTTTTTTACCTCAATCTCACTGTTAGAATTTCTTCTTCAGGAACTCTCGGATGATTTGGTTGACAGAGATGTGAGGCAGGTAAAAGTTTGTTCCCTCTTGGAACTTCCATGATCTAAAATGCTACTTGTGATGACTATATTTCTGGTTGATTTCTACATTTTAGTCTTTGACTGATCACATTCAAATTTTCCGCTCCAGACATTTCATTATGCAACTGTTCACATATAAACATCTGCTAAATTACAAGTTTTTTAATATTTGAACTAACAGGTATCGTGTCTTAACAAGTCCCTGAGTCCAAAAAAAAAAAAAAAAAAAATCTTATAAGTTTCGAAGATTTCAATTTTGACCAATAGTTTGAGTTCCATGTCTACTAGGTTCTTGAACTTTAATGACTCTCTACACTTTCAATTTTATATCTAGTAATTTCTTTAAACTTTTGTCTAACAAAACATTGATCTAGTCGACTTTTTAAAAATTTGTAGATCTATTTGACTAATTGAAAGTTTCAGCTCTTATCAAACAAAAGTTCATTTATGTCTATTAGATGAATTGATTTAAGAAATTTTCTGAATATATCAAAGTACTAATAGACATAAAATAAAATTGAAAATTTACTTGAGATGGAAATTCAATGTCATGATGGTATTGGTATTACATCATGGACTTATATTTTAATTGTCTTCAACTAAGCAGGTATTGAAGACAGTTCGTAGTAAAGTTCTTGAGGGATGCAAAGTCGTCTTCAGCCGGGTCTTCCCTACCAAATTTCAGGCTGACAACCATCATCTCTGGAAGATGGTAGAGCAGTTAGGGGGCACTTGCTCAACTGAACTCGAACGATCCGTGACACACGTGGTCTCAACAGATGCTGGAACGGAGAAGTCACGTTGGGCTTTGAAAGAGGAGAAGTTTCTGGTCCATCCACGGTGGATAGAGGCATCAAACTACTTCTGGAAACGGCAAGCGGAAGAAAACTTTCCTGTTGAACAAACCAAGAAACAATAACACAGTTTCTCTCTTTACAGTAGTTCCACATTCCTCTTAAATAGAGCTTGCATTTGTTGGAGGGTTTACCTGCTGTGCCGTTGGTGTTTCCCCATATATATTACATTGCTTATGGACTCTCACGTTCTTGTAGGACAGCATCTCATTTTTGAAGTGTACCCCAAAAATCAGCTTCAAGGGTTGATTGAAAGTCAAGCTCTTTGGTTTCTGTAGATGTGTAGATGCATTGTGTTGTAATTTTGGGTGACCATTTTGAGTTGGGTTATAATCATAGGGGTGTTATGTTTGGCTTGTGATTCCTTTTTTAAAAAAATTCAAGCCCTAACAACTGTGGTTCTGTATTTGTAATTCCTCTAATGTATTTGTCTCAGACCAAAGATAGCATTAATTGAAAAGTAAGTTTCATGAAGGAATCTAGAAGAATTACATTATGGAAATAAAAGGTTTAGATTTTTATTTGT

mRNA sequence

ATGCGAAATCTTCTCCAAAACTCAATTCCCATTGCTCTTTCAAAACTCAGTTCCCACAAATTGGTCCTTGCCAAGAAAATAGCTGGGAAGACCCGATGGAGACGTCATGTTCACGATTCATGGTTGTTTGTTGAGGAAGAGGAATTTTGTGGATTCTTGATGAAAATGTTCTTCAAAAGTGAGTTTCCTTCTTCCATCCGGACGACCCTCCCTCTCTCCCTCGCGAATTCCCTCACCCACAGAGCCGGCGTTTTCATTCTTCCTCAGCCGCCCGACCGCCGCCATGCACCGTTCACCCACCGCCCGCCGCGCCTCCGACTCCGTTCCCGCCGCCGCATCCTGGTGAGAGCACCGTCTACCTCTCGCCTCTTCCGGTGTGTTTCCCCTTCTGCGACGAACGACAGCCGAAAACCTATCATCTCCAGCGTCTTCATCTCTACGACGAATGGTTTGTTCGACGCCTGTGCAACCACCCGACGTAACCCTACGGATTCGCGGCCAGATTTGCGTTCGTTTGAGTTTTCAGCAGCGGATTCCTCTATATCGGCAAGCAACCCTTTGAGGCTGTTGGCGGCGTCCAGTAGTTCGCAACTGATACGGAGCATTCGAATTAAGATGAGCCTTGCAACTAATTCTCCAGCTCACTCATCAAGCAGTGACGATTTTGCTGCGTTTCTTGATGTAGCTCTAGATTCTCATTCCTCTGACTCATCACCCTACGAAAAGGCCGAGCATGACAATAATGCTGAAAGTGAGAGGATAAAGCGTCGTAAGGTGGAGAAACTGGAGAACTCAGAGGAGGATATTCTATATGGAGCTGAAGAGAAAAGTTTAGAAGTATTATCAAAGCAACAATTATGCAGTCATCCTGGTTCATTTGGAAATATGTGTATCATATGTGGGCAGAGGTTGGATGAGGAATCTGGCGTGACATTCGGGTATATACATCGGGGACTCAGACTTAATAATGATGAAATTAACCGGCTACGTAACATAGACATGAAGAACTTGTTGCAACATAAAAAGCTTATCCTGGTTCTTGATCTGGATCACACACTTTTAAACTCAACTCAGCTGGGGCATTTGACACCTGAAGAGGAATATTTAAGGAGTCAAACAGATTCTCTAGAAGATGTCACGAAAGGTAGCCTTTTCCTGTTGAACTCCGTTCATACAATGACAAAGTTGAGGCCATCTGTCCATACGTTTTTGAAAGAAGCTAGTCAATTATTCGAGATGTATATATACACTATGGGGGAACGAGCATATGCTTTCGAAATGGCAAAGTTGTTGGACCCCAAGAGAGAGTATTTTAGTGCAAAAGTTATTTCTCGAGATGATGGCACTCAAAAGCATCAAAAAGGTCTTGATGTGGTGCTGGGTCAGGAAAGTGCTGTTCTGATACTCGATGATACTGAAAATGCATGGACAAAACATAAAGAAAACTTGATATTGATGGAGAGATATCACTTTTTTGCCTCAAGTTGTCACCAATTTGGCTTCAACTGTAAATCTCTATCTGAGTTGAAGAGTGACGAGAGTGAAACTGATGGGGCACTGGCGACCATCCTGAAAGTTCTGAAGCAAGTCCATAGTATATTCTTTAACGTATTTCCCTTCCTCTCTGTTACATTTTTGTTTCCAATCTTTTTTACCTCAATCTCACTGTTAGAATTTCTTCTTCAGGAACTCTCGGATGATTTGGTTGACAGAGATGTGAGGCAGGTATTGAAGACAGTTCGTAGTAAAGTTCTTGAGGGATGCAAAGTCGTCTTCAGCCGGGTCTTCCCTACCAAATTTCAGGCTGACAACCATCATCTCTGGAAGATGGTAGAGCAGTTAGGGGGCACTTGCTCAACTGAACTCGAACGATCCGTGACACACGTGGTCTCAACAGATGCTGGAACGGAGAAGTCACGTTGGGCTTTGAAAGAGGAGAAGTTTCTGGTCCATCCACGGTGGATAGAGGCATCAAACTACTTCTGGAAACGGCAAGCGGAAGAAAACTTTCCTGTTGAACAAACCAAGAAACAATAACACAGTTTCTCTCTTTACAGTAGTTCCACATTCCTCTTAAATAGAGCTTGCATTTGTTGGAGGGTTTACCTGCTGTGCCGTTGGTGTTTCCCCATATATATTACATTGCTTATGGACTCTCACGTTCTTGTAGGACAGCATCTCATTTTTGAAGTGTACCCCAAAAATCAGCTTCAAGGGTTGATTGAAAGTCAAGCTCTTTGGTTTCTGTAGATGTGTAGATGCATTGTGTTGTAATTTTGGGTGACCATTTTGAGTTGGGTTATAATCATAGGGGTGTTATGTTTGGCTTGTGATTCCTTTTTTAAAAAAATTCAAGCCCTAACAACTGTGGTTCTGTATTTGTAATTCCTCTAATGTATTTGTCTCAGACCAAAGATAGCATTAATTGAAAAGTAAGTTTCATGAAGGAATCTAGAAGAATTACATTATGGAAATAAAAGGTTTAGATTTTTATTTGT

Coding sequence (CDS)

ATGCGAAATCTTCTCCAAAACTCAATTCCCATTGCTCTTTCAAAACTCAGTTCCCACAAATTGGTCCTTGCCAAGAAAATAGCTGGGAAGACCCGATGGAGACGTCATGTTCACGATTCATGGTTGTTTGTTGAGGAAGAGGAATTTTGTGGATTCTTGATGAAAATGTTCTTCAAAAGTGAGTTTCCTTCTTCCATCCGGACGACCCTCCCTCTCTCCCTCGCGAATTCCCTCACCCACAGAGCCGGCGTTTTCATTCTTCCTCAGCCGCCCGACCGCCGCCATGCACCGTTCACCCACCGCCCGCCGCGCCTCCGACTCCGTTCCCGCCGCCGCATCCTGGTGAGAGCACCGTCTACCTCTCGCCTCTTCCGGTGTGTTTCCCCTTCTGCGACGAACGACAGCCGAAAACCTATCATCTCCAGCGTCTTCATCTCTACGACGAATGGTTTGTTCGACGCCTGTGCAACCACCCGACGTAACCCTACGGATTCGCGGCCAGATTTGCGTTCGTTTGAGTTTTCAGCAGCGGATTCCTCTATATCGGCAAGCAACCCTTTGAGGCTGTTGGCGGCGTCCAGTAGTTCGCAACTGATACGGAGCATTCGAATTAAGATGAGCCTTGCAACTAATTCTCCAGCTCACTCATCAAGCAGTGACGATTTTGCTGCGTTTCTTGATGTAGCTCTAGATTCTCATTCCTCTGACTCATCACCCTACGAAAAGGCCGAGCATGACAATAATGCTGAAAGTGAGAGGATAAAGCGTCGTAAGGTGGAGAAACTGGAGAACTCAGAGGAGGATATTCTATATGGAGCTGAAGAGAAAAGTTTAGAAGTATTATCAAAGCAACAATTATGCAGTCATCCTGGTTCATTTGGAAATATGTGTATCATATGTGGGCAGAGGTTGGATGAGGAATCTGGCGTGACATTCGGGTATATACATCGGGGACTCAGACTTAATAATGATGAAATTAACCGGCTACGTAACATAGACATGAAGAACTTGTTGCAACATAAAAAGCTTATCCTGGTTCTTGATCTGGATCACACACTTTTAAACTCAACTCAGCTGGGGCATTTGACACCTGAAGAGGAATATTTAAGGAGTCAAACAGATTCTCTAGAAGATGTCACGAAAGGTAGCCTTTTCCTGTTGAACTCCGTTCATACAATGACAAAGTTGAGGCCATCTGTCCATACGTTTTTGAAAGAAGCTAGTCAATTATTCGAGATGTATATATACACTATGGGGGAACGAGCATATGCTTTCGAAATGGCAAAGTTGTTGGACCCCAAGAGAGAGTATTTTAGTGCAAAAGTTATTTCTCGAGATGATGGCACTCAAAAGCATCAAAAAGGTCTTGATGTGGTGCTGGGTCAGGAAAGTGCTGTTCTGATACTCGATGATACTGAAAATGCATGGACAAAACATAAAGAAAACTTGATATTGATGGAGAGATATCACTTTTTTGCCTCAAGTTGTCACCAATTTGGCTTCAACTGTAAATCTCTATCTGAGTTGAAGAGTGACGAGAGTGAAACTGATGGGGCACTGGCGACCATCCTGAAAGTTCTGAAGCAAGTCCATAGTATATTCTTTAACGTATTTCCCTTCCTCTCTGTTACATTTTTGTTTCCAATCTTTTTTACCTCAATCTCACTGTTAGAATTTCTTCTTCAGGAACTCTCGGATGATTTGGTTGACAGAGATGTGAGGCAGGTATTGAAGACAGTTCGTAGTAAAGTTCTTGAGGGATGCAAAGTCGTCTTCAGCCGGGTCTTCCCTACCAAATTTCAGGCTGACAACCATCATCTCTGGAAGATGGTAGAGCAGTTAGGGGGCACTTGCTCAACTGAACTCGAACGATCCGTGACACACGTGGTCTCAACAGATGCTGGAACGGAGAAGTCACGTTGGGCTTTGAAAGAGGAGAAGTTTCTGGTCCATCCACGGTGGATAGAGGCATCAAACTACTTCTGGAAACGGCAAGCGGAAGAAAACTTTCCTGTTGAACAAACCAAGAAACAATAA

Protein sequence

MRNLLQNSIPIALSKLSSHKLVLAKKIAGKTRWRRHVHDSWLFVEEEEFCGFLMKMFFKSEFPSSIRTTLPLSLANSLTHRAGVFILPQPPDRRHAPFTHRPPRLRLRSRRRILVRAPSTSRLFRCVSPSATNDSRKPIISSVFISTTNGLFDACATTRRNPTDSRPDLRSFEFSAADSSISASNPLRLLAASSSSQLIRSIRIKMSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPYEKAEHDNNAESERIKRRKVEKLENSEEDILYGAEEKSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPSVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKSLSELKSDESETDGALATILKVLKQVHSIFFNVFPFLSVTFLFPIFFTSISLLEFLLQELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELERSVTHVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ
Homology
BLAST of Cla97C08G145310 vs. NCBI nr
Match: XP_038890381.1 (RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Benincasa hispida] >XP_038890382.1 RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Benincasa hispida])

HSP 1 Score: 827.8 bits (2137), Expect = 6.9e-236
Identity = 423/473 (89.43%), Postives = 438/473 (92.60%), Query Frame = 0

Query: 206 MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPYEKAEHDNNAESERIKRRKVEKLENS 265
           MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPYEKAE DNNAESERIKRRKVEKLENS
Sbjct: 1   MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPYEKAEGDNNAESERIKRRKVEKLENS 60

Query: 266 EEDILYGAEEKSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRLNNDE 325
           EEDILYG EE+S E +SKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIH+GLRLNNDE
Sbjct: 61  EEDILYGVEEQSSEAISKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120

Query: 326 INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLF 385
           INRLRNIDMK+LL HKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSL+DVTKGSLF
Sbjct: 121 INRLRNIDMKSLLLHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLDDVTKGSLF 180

Query: 386 LLNSVHTMTKLRPSVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISR 445
           LLNSVHTMTKLRP VH+FLKEA+QLFEMYIYTMGERAYAFEMAKLLDPK+EYF+ KVISR
Sbjct: 181 LLNSVHTMTKLRPFVHSFLKEANQLFEMYIYTMGERAYAFEMAKLLDPKKEYFNGKVISR 240

Query: 446 DDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 505
           DDGTQKHQKGLDVVLGQESAVLILDDTENAW KHK+NLILMERYHFFASSCHQFGFNCKS
Sbjct: 241 DDGTQKHQKGLDVVLGQESAVLILDDTENAWPKHKKNLILMERYHFFASSCHQFGFNCKS 300

Query: 506 LSELKSDESETDGALATILKVLKQVHSIFFNVFPFLSVTFLFPIFFTSISLLEFLLQELS 565
           LSELKSDESETDGALATILKVLKQVHS+FFN                          ELS
Sbjct: 301 LSELKSDESETDGALATILKVLKQVHSVFFN--------------------------ELS 360

Query: 566 DDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELERS 625
           DDLVDRDVRQ+LKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTEL++S
Sbjct: 361 DDLVDRDVRQILKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDQS 420

Query: 626 VTHVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ 679
           VTHVVS DAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQ+EENFPVEQTKKQ
Sbjct: 421 VTHVVSMDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQSEENFPVEQTKKQ 447

BLAST of Cla97C08G145310 vs. NCBI nr
Match: XP_022925487.1 (RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Cucurbita moschata] >XP_022925488.1 RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Cucurbita moschata])

HSP 1 Score: 812.0 bits (2096), Expect = 3.9e-231
Identity = 419/473 (88.58%), Postives = 433/473 (91.54%), Query Frame = 0

Query: 206 MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPYEKAEHDNNAESERIKRRKVEKLENS 265
           MSLATNSPAHSSSSDDFAAFLDVAL+SHSSDSSP + AE  NN ESERIKRRKVEKL  S
Sbjct: 1   MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVESERIKRRKVEKLVCS 60

Query: 266 EEDILYGAEEKSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRLNNDE 325
           EED L G EE+SLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIH+GLRLNNDE
Sbjct: 61  EEDTLCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120

Query: 326 INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLF 385
           INRLRNIDMK+LLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQ DSLEDVTKGSLF
Sbjct: 121 INRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQIDSLEDVTKGSLF 180

Query: 386 LLNSVHTMTKLRPSVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISR 445
           LLNSVHTMTKLRP VHTFLKEASQLFEMYIYTMGERAYA+EMAKLLDPKREYF++KVISR
Sbjct: 181 LLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFNSKVISR 240

Query: 446 DDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 505
           DDGTQKHQKGLD+VLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS
Sbjct: 241 DDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 300

Query: 506 LSELKSDESETDGALATILKVLKQVHSIFFNVFPFLSVTFLFPIFFTSISLLEFLLQELS 565
           LSELKSDESETDGALATILKVLKQVH+IFFN                          ELS
Sbjct: 301 LSELKSDESETDGALATILKVLKQVHNIFFN--------------------------ELS 360

Query: 566 DDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELERS 625
           DDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQA+NHHLWKMVEQLGGTCSTEL+ S
Sbjct: 361 DDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQAENHHLWKMVEQLGGTCSTELDSS 420

Query: 626 VTHVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ 679
           VTHVVSTD GTEKSRWALKEEKFLVHPRWIEASNYFWKRQAE+NFPVEQ+KKQ
Sbjct: 421 VTHVVSTDPGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEDNFPVEQSKKQ 447

BLAST of Cla97C08G145310 vs. NCBI nr
Match: XP_023525838.1 (RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 810.8 bits (2093), Expect = 8.7e-231
Identity = 415/473 (87.74%), Postives = 432/473 (91.33%), Query Frame = 0

Query: 206 MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPYEKAEHDNNAESERIKRRKVEKLENS 265
           MSL TNSPAHSSSSDDFAAFLDVALDSHSSDSSP EKAE  NN E+ERIKR KVEKLENS
Sbjct: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS 60

Query: 266 EEDILYGAEEKSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRLNNDE 325
            EDILYG EE S EVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIH+GLRLNNDE
Sbjct: 61  GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120

Query: 326 INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLF 385
           INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEE+YLR+QTDSLEDVTKGSLF
Sbjct: 121 INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLEDVTKGSLF 180

Query: 386 LLNSVHTMTKLRPSVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISR 445
           LL+SVHTMTKLRP VHTFLKEASQLFEMYIYTMGERAYA+EMAKLLDPKREYFS+KVISR
Sbjct: 181 LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR 240

Query: 446 DDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 505
           DDGTQKHQKGLDVVLG ESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS
Sbjct: 241 DDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 300

Query: 506 LSELKSDESETDGALATILKVLKQVHSIFFNVFPFLSVTFLFPIFFTSISLLEFLLQELS 565
           LSELKSDESETDGALATILKVLKQVH+IFFN                          E+S
Sbjct: 301 LSELKSDESETDGALATILKVLKQVHNIFFN--------------------------EIS 360

Query: 566 DDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELERS 625
           +DLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVE+LGGTCSTEL+ S
Sbjct: 361 EDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEKLGGTCSTELDAS 420

Query: 626 VTHVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ 679
           VTH+VSTDAGTEKSRWA+KE+KFLVHP+WIEASNYFWKR+AEE FPVE TKKQ
Sbjct: 421 VTHIVSTDAGTEKSRWAVKEQKFLVHPQWIEASNYFWKREAEEKFPVEHTKKQ 447

BLAST of Cla97C08G145310 vs. NCBI nr
Match: KAG7025178.1 (RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 809.3 bits (2089), Expect = 2.5e-230
Identity = 418/473 (88.37%), Postives = 432/473 (91.33%), Query Frame = 0

Query: 206 MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPYEKAEHDNNAESERIKRRKVEKLENS 265
           MSLATNSPAHSSSSDDFAAFLDVAL+SHSSDSSP + AE  NN ESERIKRRKVEKL  S
Sbjct: 1   MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVESERIKRRKVEKLVCS 60

Query: 266 EEDILYGAEEKSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRLNNDE 325
           EED L G EE+SLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIH+GLRLNNDE
Sbjct: 61  EEDTLCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120

Query: 326 INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLF 385
           INRLRNIDMK+LLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQ DSLEDVTKGSLF
Sbjct: 121 INRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQIDSLEDVTKGSLF 180

Query: 386 LLNSVHTMTKLRPSVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISR 445
           LLNSVHTMTKLRP VHTFLKEASQLFEMYIYTMGERAYA+EMAKLLDPKREYF++KVISR
Sbjct: 181 LLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFNSKVISR 240

Query: 446 DDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 505
           DDGTQKHQKGLD+VLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS
Sbjct: 241 DDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 300

Query: 506 LSELKSDESETDGALATILKVLKQVHSIFFNVFPFLSVTFLFPIFFTSISLLEFLLQELS 565
           LSELKSDESETDGALATILKVLKQVH+IFFN                          ELS
Sbjct: 301 LSELKSDESETDGALATILKVLKQVHNIFFN--------------------------ELS 360

Query: 566 DDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELERS 625
           DDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQA+NHHLWKMVEQLGGTCSTEL+ S
Sbjct: 361 DDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQAENHHLWKMVEQLGGTCSTELDSS 420

Query: 626 VTHVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ 679
           VTHVVSTD GTEKSRWALKE KFLVHPRWIEASNYFWKRQAE+NFPVEQ+KKQ
Sbjct: 421 VTHVVSTDPGTEKSRWALKEGKFLVHPRWIEASNYFWKRQAEDNFPVEQSKKQ 447

BLAST of Cla97C08G145310 vs. NCBI nr
Match: XP_022133134.1 (RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Momordica charantia])

HSP 1 Score: 809.3 bits (2089), Expect = 2.5e-230
Identity = 418/476 (87.82%), Postives = 433/476 (90.97%), Query Frame = 0

Query: 206 MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPYEKAEHDNNAESERIKRRKVEKLENS 265
           MSL TNSPAHSSSSDDFAAFLDVALDSHSSDSSP EKAE DNN ESER+KRRKVE+LE S
Sbjct: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPEEKAEGDNNVESERMKRRKVEELEGS 60

Query: 266 E---EDILYGAEEKSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRLN 325
           E   EDI YG EE+S EVLSKQQLCSHPGSFGNMCI+CGQRLDEESGVTFGYIH+GLRLN
Sbjct: 61  EEPQEDISYGVEEQSSEVLSKQQLCSHPGSFGNMCIMCGQRLDEESGVTFGYIHKGLRLN 120

Query: 326 NDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKG 385
           NDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGH+TPEEEYLRSQTDSLEDVTKG
Sbjct: 121 NDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHITPEEEYLRSQTDSLEDVTKG 180

Query: 386 SLFLLNSVHTMTKLRPSVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKV 445
           SLFLLNSVHTMTKLRP VHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKV
Sbjct: 181 SLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKV 240

Query: 446 ISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFN 505
           ISRDDGTQKH+KGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSC QFG+N
Sbjct: 241 ISRDDGTQKHKKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGYN 300

Query: 506 CKSLSELKSDESETDGALATILKVLKQVHSIFFNVFPFLSVTFLFPIFFTSISLLEFLLQ 565
           CKSLSELKSDESETDGALATILKVLKQVH+IFFN                          
Sbjct: 301 CKSLSELKSDESETDGALATILKVLKQVHTIFFN-------------------------- 360

Query: 566 ELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTEL 625
           EL DDLVDRDVRQVLKTVRSKVLEGCKVVF+RVFPTKF ADNHHLWKMVEQLGG+CST+L
Sbjct: 361 ELLDDLVDRDVRQVLKTVRSKVLEGCKVVFTRVFPTKFPADNHHLWKMVEQLGGSCSTDL 420

Query: 626 ERSVTHVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ 679
           + SVTHVVSTDAGTEKSRWA+KE+KFLVHPRWIEASNYFWKRQ EENFPVEQTKKQ
Sbjct: 421 DSSVTHVVSTDAGTEKSRWAVKEQKFLVHPRWIEASNYFWKRQVEENFPVEQTKKQ 450

BLAST of Cla97C08G145310 vs. ExPASy Swiss-Prot
Match: Q00IB6 (RNA polymerase II C-terminal domain phosphatase-like 4 OS=Arabidopsis thaliana OX=3702 GN=CPL4 PE=1 SV=1)

HSP 1 Score: 518.1 bits (1333), Expect = 1.5e-145
Identity = 281/478 (58.79%), Postives = 351/478 (73.43%), Query Frame = 0

Query: 206 MSLATNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PYEKAEHDNNAESERIKRRKVEKLE 265
           MS+A++SP H SSSSDD AAFLD  LDS S  SS P E+ E +++ ES  +KR+K+E LE
Sbjct: 1   MSVASDSPVHSSSSSDDLAAFLDAELDSASDASSGPSEEEEAEDDVES-GLKRQKLEHLE 60

Query: 266 NSEEDILYGAEEKSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRLNN 325
                          E  S +  C HPGSFGNMC +CGQ+L EE+GV+F YIH+ +RLN 
Sbjct: 61  ---------------EASSSKGECEHPGSFGNMCFVCGQKL-EETGVSFRYIHKEMRLNE 120

Query: 326 DEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLED---VT 385
           DEI+RLR+ D + L + +KL LVLDLDHTLLN+T L  L PEEEYL+S T SL+D   V+
Sbjct: 121 DEISRLRDSDSRFLQRQRKLYLVLDLDHTLLNTTILRDLKPEEEYLKSHTHSLQDGCNVS 180

Query: 386 KGSLFLLNSVHTMTKLRPSVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSA 445
            GSLFLL  +  MTKLRP VH+FLKEAS++F MYIYTMG+R YA +MAKLLDPK EYF  
Sbjct: 181 GGSLFLLEFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGD 240

Query: 446 KVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFG 505
           +VISRDDGT +H+K LDVVLGQESAVLILDDTENAW KHK+NLI++ERYHFF+SSC QF 
Sbjct: 241 RVISRDDGTVRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQFD 300

Query: 506 FNCKSLSELKSDESETDGALATILKVLKQVHSIFFNVFPFLSVTFLFPIFFTSISLLEFL 565
              KSLSELKSDESE DGALAT+LKVLKQ H++FF                         
Sbjct: 301 HRYKSLSELKSDESEPDGALATVLKVLKQAHALFF------------------------- 360

Query: 566 LQELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCST 625
            + + + + +RDVR +LK VR ++L+GCK+VFSRVFPTK + ++H LWKM E+LG TC+T
Sbjct: 361 -ENVDEGISNRDVRLMLKQVRKEILKGCKIVFSRVFPTKAKPEDHPLWKMAEELGATCAT 420

Query: 626 ELERSVTHVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ 679
           E++ SVTHVV+ D GTEK+RWA++E+K++VH  WI+A+NY W +Q EENF +EQ KKQ
Sbjct: 421 EVDASVTHVVAMDVGTEKARWAVREKKYVVHRGWIDAANYLWMKQPEENFGLEQLKKQ 435

BLAST of Cla97C08G145310 vs. ExPASy Swiss-Prot
Match: Q8LL04 (RNA polymerase II C-terminal domain phosphatase-like 3 OS=Arabidopsis thaliana OX=3702 GN=CPL3 PE=1 SV=2)

HSP 1 Score: 234.2 bits (596), Expect = 4.4e-60
Identity = 136/356 (38.20%), Postives = 203/356 (57.02%), Query Frame = 0

Query: 328  RLRNIDMKN-LLQHKKLILVLDLDHTLLNSTQLGHL-TPEEEYLRSQTDSLEDVTKGSLF 387
            R+R ++ +N +   +KL LVLD+DHTLLNS +   + +  EE LR + +   +     LF
Sbjct: 912  RVRRLEEQNKMFASQKLSLVLDIDHTLLNSAKFNEVESRHEEILRKKEEQDREKPYRHLF 971

Query: 388  LLNSVHTMTKLRPSVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISR 447
                +   TKLRP +  FL++AS+L+E+++YTMG + YA EMAKLLDPK   F+ +VIS+
Sbjct: 972  RFLHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGVLFNGRVISK 1031

Query: 448  -DDGTQ-------KHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCH 507
             DDG            K L+ V+G ES+V+I+DD+   W +HK NLI +ERY +F  S  
Sbjct: 1032 GDDGDPLDGDERVPKSKDLEGVMGMESSVVIIDDSVRVWPQHKMNLIAVERYLYFPCSRR 1091

Query: 508  QFGFNCKSLSELKSDESETDGALATILKVLKQVHSIFFNVFPFLSVTFLFPIFFTSISLL 567
            QFG    SL EL  DE   +G LA+ L V++++H  FF+                     
Sbjct: 1092 QFGLLGPSLLELDRDEVPEEGTLASSLAVIEKIHQNFFS--------------------- 1151

Query: 568  EFLLQELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPT-KFQADNHHLWKMVEQLGG 627
                      L + DVR +L + + K+L GC++VFSR+ P  + +   H LW+  EQ G 
Sbjct: 1152 -------HTSLDEVDVRNILASEQRKILAGCRIVFSRIIPVGEAKPHLHPLWQTAEQFGA 1211

Query: 628  TCSTELERSVTHVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPV 673
             C+T+++  VTHVV+   GT+K  WAL   +F+VHP W+EAS + ++R  E  + +
Sbjct: 1212 VCTTQVDEHVTHVVTNSLGTDKVNWALTRGRFVVHPGWVEASAFLYQRANENLYAI 1239

BLAST of Cla97C08G145310 vs. ExPASy Swiss-Prot
Match: F4JCB2 (RNA polymerase II C-terminal domain phosphatase-like 5 OS=Arabidopsis thaliana OX=3702 GN=CPL5 PE=1 SV=2)

HSP 1 Score: 177.9 bits (450), Expect = 3.7e-43
Identity = 112/297 (37.71%), Postives = 165/297 (55.56%), Query Frame = 0

Query: 246 DNNAESERIKRRKVEKLENSEEDILYGAEEKSLEVLSKQQLCSHPGSFGNMCIICGQRLD 305
           +N +   + KRRK+E   N           +S   LS    C H      +CI C   + 
Sbjct: 299 ENFSSEPKAKRRKIEPTIN-----------ESSSSLSSSSSCGHWYICHGICIGCKSTVK 358

Query: 306 EESGVTFGYIHRGLRLNNDEINRLRNIDMK-NLLQHKKLILVLDLDHTLLNSTQLGHLTP 365
           +  G  F YI  GL+L+++ +   +    K + L  KKL LVLDLDHTLL++  +  L+ 
Sbjct: 359 KSQGRAFDYIFDGLQLSHEAVALTKCFTTKLSCLNEKKLHLVLDLDHTLLHTVMVPSLSQ 418

Query: 366 EEEYLRSQTDSLEDVTKGSLFLLNSV----HTMTKLRPSVHTFLKEASQLFEMYIYTMGE 425
            E+YL  +  S    T+  L+ + +V      +TKLRP +  FLKEA++ F MY+YT G 
Sbjct: 419 AEKYLIEEAGS---ATRDDLWKIKAVGDPMEFLTKLRPFLRDFLKEANEFFTMYVYTKGS 478

Query: 426 RAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHK 485
           R YA ++ +L+DPK+ YF  +VI++ +    H K LD VL +E  V+I+DDT N W  HK
Sbjct: 479 RVYAKQVLELIDPKKLYFGDRVITKTE--SPHMKTLDFVLAEERGVVIVDDTRNVWPDHK 538

Query: 486 ENLILMERYHFFASSCHQFGFNCKSLSELKSDESETDGALATILKVLKQVHSIFFNV 538
            NL+ + +Y +F       G +    SE K+DESE++G LA +LK+LK+VH  FF V
Sbjct: 539 SNLVDISKYSYFRLK----GQDSMPYSEEKTDESESEGGLANVLKLLKEVHQRFFRV 575

BLAST of Cla97C08G145310 vs. ExPASy Swiss-Prot
Match: Q95QG8 (RNA polymerase II subunit A C-terminal domain phosphatase OS=Caenorhabditis elegans OX=6239 GN=fcp-1 PE=1 SV=2)

HSP 1 Score: 139.4 bits (350), Expect = 1.5e-31
Identity = 111/408 (27.21%), Postives = 196/408 (48.04%), Query Frame = 0

Query: 279 EVLSKQQLCSHPGSFGNMCIICGQRLDEESG----------VTFGYIHR--GLRLNNDEI 338
           +V++    C+H     +MC  CG+ L E+ G               IH    L +++   
Sbjct: 68  QVIATVSECTHAIVIKDMCATCGKDLREKGGRAGQRKEQSTANVSMIHHVPELIVSDTLA 127

Query: 339 NRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFL 398
             + + D  NL+ ++KL+L++DLD T+++++        ++ +   T++ +D+TK +L  
Sbjct: 128 KEIGSADENNLITNRKLVLLVDLDQTIIHTS--------DKPMTVDTENHKDITKYNLH- 187

Query: 399 LNSVHTMTKLRPSVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRD 458
            + V+T TKLRP    FL + S ++EM+I T G+R YA  +A++LDP    F  +++SRD
Sbjct: 188 -SRVYT-TKLRPHTTEFLNKMSNMYEMHIVTYGQRQYAHRIAQILDPDARLFEQRILSRD 247

Query: 459 D--GTQKHQKGLDVVLG-QESAVLILDDTENAWTKHKENLILMERYHFF--ASSCHQFGF 518
           +    Q     L  +    ++ V+I+DD  + W  + E LI ++ Y FF      +    
Sbjct: 248 ELFSAQHKTNNLKALFPCGDNLVVIIDDRSDVW-MYSEALIQIKPYRFFKEVGDINAPKN 307

Query: 519 NCKSLSELKSDESETDGALATILKVLKQVHSIFFNVFPFLSVTFLFPIFFTSISLLEFLL 578
           + + +     D++  D  L  I +VL  +H  ++                      E LL
Sbjct: 308 SKEQMPVQIEDDAHEDKVLEEIERVLTNIHDKYYEKHDLRGSE-------------EVLL 367

Query: 579 QELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTE 638
                     DV++V+K  R KVL+GC +VFS + P   + +   ++++  Q G     +
Sbjct: 368 ----------DVKEVIKEERHKVLDGCVIVFSGIVPMGEKLERTDIYRLCTQFGAVIVPD 427

Query: 639 LERSVTHVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEEN 670
           +   VTHVV    GT+K   A +  KF+V  +W+ A    W + A+EN
Sbjct: 428 VTDDVTHVVGARYGTQKVYQANRLNKFVVTVQWVYACVEKWLK-ADEN 439

BLAST of Cla97C08G145310 vs. ExPASy Swiss-Prot
Match: Q8SV03 (RNA polymerase II subunit A C-terminal domain phosphatase OS=Encephalitozoon cuniculi (strain GB-M1) OX=284813 GN=FCP1 PE=1 SV=1)

HSP 1 Score: 107.8 bits (268), Expect = 4.8e-22
Identity = 67/208 (32.21%), Postives = 112/208 (53.85%), Query Frame = 0

Query: 287 CSHPGSFGNMCIICGQRLDEESGVTFG-YIHRGLRLNNDEINRLRNIDMKNLLQHKKLIL 346
           C+HP   G +C +CG  + EES +    Y    +++ ++E   +    M+ L    KLIL
Sbjct: 4   CNHPIRLGTLCGVCGMEIQEESHLFCALYNTDNVKITHEEAVAIHKEKMEALEMQMKLIL 63

Query: 347 VLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPSVHTFLK 406
           VLDLD T+L++T               T SLE   K   F+++      KLRP++   L+
Sbjct: 64  VLDLDQTVLHTTY-------------GTSSLEGTVK---FVIDRCRYCVKLRPNLDYMLR 123

Query: 407 EASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESA 466
             S+L+E+++YTMG RAYA  + +++DP  +YF  ++I+RD+      K L  +   +  
Sbjct: 124 RISKLYEIHVYTMGTRAYAERIVEIIDPSGKYFDDRIITRDENQGVLVKRLSRLFPHDHR 183

Query: 467 -VLILDDTENAWTKHKENLILMERYHFF 493
            ++ILDD  + W  + ENL+L+  + +F
Sbjct: 184 NIVILDDRPDVW-DYCENLVLIRPFWYF 194

BLAST of Cla97C08G145310 vs. ExPASy TrEMBL
Match: A0A6J1EFC1 (RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita moschata OX=3662 GN=LOC111432775 PE=4 SV=1)

HSP 1 Score: 812.0 bits (2096), Expect = 1.9e-231
Identity = 419/473 (88.58%), Postives = 433/473 (91.54%), Query Frame = 0

Query: 206 MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPYEKAEHDNNAESERIKRRKVEKLENS 265
           MSLATNSPAHSSSSDDFAAFLDVAL+SHSSDSSP + AE  NN ESERIKRRKVEKL  S
Sbjct: 1   MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVESERIKRRKVEKLVCS 60

Query: 266 EEDILYGAEEKSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRLNNDE 325
           EED L G EE+SLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIH+GLRLNNDE
Sbjct: 61  EEDTLCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120

Query: 326 INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLF 385
           INRLRNIDMK+LLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQ DSLEDVTKGSLF
Sbjct: 121 INRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQIDSLEDVTKGSLF 180

Query: 386 LLNSVHTMTKLRPSVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISR 445
           LLNSVHTMTKLRP VHTFLKEASQLFEMYIYTMGERAYA+EMAKLLDPKREYF++KVISR
Sbjct: 181 LLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFNSKVISR 240

Query: 446 DDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 505
           DDGTQKHQKGLD+VLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS
Sbjct: 241 DDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 300

Query: 506 LSELKSDESETDGALATILKVLKQVHSIFFNVFPFLSVTFLFPIFFTSISLLEFLLQELS 565
           LSELKSDESETDGALATILKVLKQVH+IFFN                          ELS
Sbjct: 301 LSELKSDESETDGALATILKVLKQVHNIFFN--------------------------ELS 360

Query: 566 DDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELERS 625
           DDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQA+NHHLWKMVEQLGGTCSTEL+ S
Sbjct: 361 DDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQAENHHLWKMVEQLGGTCSTELDSS 420

Query: 626 VTHVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ 679
           VTHVVSTD GTEKSRWALKEEKFLVHPRWIEASNYFWKRQAE+NFPVEQ+KKQ
Sbjct: 421 VTHVVSTDPGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEDNFPVEQSKKQ 447

BLAST of Cla97C08G145310 vs. ExPASy TrEMBL
Match: A0A6J1BV42 (RNA polymerase II C-terminal domain phosphatase-like OS=Momordica charantia OX=3673 GN=LOC111005808 PE=4 SV=1)

HSP 1 Score: 809.3 bits (2089), Expect = 1.2e-230
Identity = 418/476 (87.82%), Postives = 433/476 (90.97%), Query Frame = 0

Query: 206 MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPYEKAEHDNNAESERIKRRKVEKLENS 265
           MSL TNSPAHSSSSDDFAAFLDVALDSHSSDSSP EKAE DNN ESER+KRRKVE+LE S
Sbjct: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPEEKAEGDNNVESERMKRRKVEELEGS 60

Query: 266 E---EDILYGAEEKSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRLN 325
           E   EDI YG EE+S EVLSKQQLCSHPGSFGNMCI+CGQRLDEESGVTFGYIH+GLRLN
Sbjct: 61  EEPQEDISYGVEEQSSEVLSKQQLCSHPGSFGNMCIMCGQRLDEESGVTFGYIHKGLRLN 120

Query: 326 NDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKG 385
           NDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGH+TPEEEYLRSQTDSLEDVTKG
Sbjct: 121 NDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHITPEEEYLRSQTDSLEDVTKG 180

Query: 386 SLFLLNSVHTMTKLRPSVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKV 445
           SLFLLNSVHTMTKLRP VHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKV
Sbjct: 181 SLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKV 240

Query: 446 ISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFN 505
           ISRDDGTQKH+KGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSC QFG+N
Sbjct: 241 ISRDDGTQKHKKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGYN 300

Query: 506 CKSLSELKSDESETDGALATILKVLKQVHSIFFNVFPFLSVTFLFPIFFTSISLLEFLLQ 565
           CKSLSELKSDESETDGALATILKVLKQVH+IFFN                          
Sbjct: 301 CKSLSELKSDESETDGALATILKVLKQVHTIFFN-------------------------- 360

Query: 566 ELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTEL 625
           EL DDLVDRDVRQVLKTVRSKVLEGCKVVF+RVFPTKF ADNHHLWKMVEQLGG+CST+L
Sbjct: 361 ELLDDLVDRDVRQVLKTVRSKVLEGCKVVFTRVFPTKFPADNHHLWKMVEQLGGSCSTDL 420

Query: 626 ERSVTHVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ 679
           + SVTHVVSTDAGTEKSRWA+KE+KFLVHPRWIEASNYFWKRQ EENFPVEQTKKQ
Sbjct: 421 DSSVTHVVSTDAGTEKSRWAVKEQKFLVHPRWIEASNYFWKRQVEENFPVEQTKKQ 450

BLAST of Cla97C08G145310 vs. ExPASy TrEMBL
Match: A0A6J1ID30 (RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita maxima OX=3661 GN=LOC111471991 PE=4 SV=1)

HSP 1 Score: 805.8 bits (2080), Expect = 1.4e-229
Identity = 414/473 (87.53%), Postives = 429/473 (90.70%), Query Frame = 0

Query: 206 MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPYEKAEHDNNAESERIKRRKVEKLENS 265
           MSL TNSPAHSSSSDDFAAFLDVALDSHSSDS P EKAE  NN E+ERIKR KVEKLENS
Sbjct: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSLPNEKAEGHNNVETERIKRHKVEKLENS 60

Query: 266 EEDILYGAEEKSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRLNNDE 325
            EDILYG EE S EVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIH+GLRLNNDE
Sbjct: 61  GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120

Query: 326 INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLF 385
           INRLRNIDMK LLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLR+Q DSLEDVTKGSLF
Sbjct: 121 INRLRNIDMKKLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRNQMDSLEDVTKGSLF 180

Query: 386 LLNSVHTMTKLRPSVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISR 445
           LL+SVHTMTKLRP VHTFLKEASQLFEMYIYTMGERAYA+EMAKLLDPKREYFS+KVISR
Sbjct: 181 LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR 240

Query: 446 DDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 505
           DDGTQKHQKGLDVVLG ESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS
Sbjct: 241 DDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 300

Query: 506 LSELKSDESETDGALATILKVLKQVHSIFFNVFPFLSVTFLFPIFFTSISLLEFLLQELS 565
           LSELKSDESE+DGALATILKVLKQVH+IFFN                          ELS
Sbjct: 301 LSELKSDESESDGALATILKVLKQVHNIFFN--------------------------ELS 360

Query: 566 DDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELERS 625
           +DLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTEL+ S
Sbjct: 361 EDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDPS 420

Query: 626 VTHVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ 679
           VTH+VSTDAGTEKSRWA+KE+KFLVHP+WIEASNYFWKR+AEE FPVE TKKQ
Sbjct: 421 VTHIVSTDAGTEKSRWAIKEQKFLVHPQWIEASNYFWKREAEEKFPVEHTKKQ 447

BLAST of Cla97C08G145310 vs. ExPASy TrEMBL
Match: A0A6J1GC38 (RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita moschata OX=3662 GN=LOC111452801 PE=4 SV=1)

HSP 1 Score: 803.1 bits (2073), Expect = 8.8e-229
Identity = 414/473 (87.53%), Postives = 429/473 (90.70%), Query Frame = 0

Query: 206 MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPYEKAEHDNNAESERIKRRKVEKLENS 265
           MSL TNS AHSSSSDDFAAFLDVALDSHSSDSSP EKAE  NN E+ERIKR KVEKLENS
Sbjct: 1   MSLVTNSLAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS 60

Query: 266 EEDILYGAEEKSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRLNNDE 325
            EDILYG EE S EVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIH+GLRLNNDE
Sbjct: 61  GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120

Query: 326 INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLF 385
           INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEE+YLR+QTDSLEDVTKGSLF
Sbjct: 121 INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLEDVTKGSLF 180

Query: 386 LLNSVHTMTKLRPSVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISR 445
           LL+SVHTMTKLRP VHTFLKEASQLFEMYIYTMGERAYA+EMAKLLDPKREYFS+KVISR
Sbjct: 181 LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR 240

Query: 446 DDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 505
           DDGTQKHQKGLDVVLG ESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS
Sbjct: 241 DDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 300

Query: 506 LSELKSDESETDGALATILKVLKQVHSIFFNVFPFLSVTFLFPIFFTSISLLEFLLQELS 565
           LSELKSDESETDGALATILKVLKQVH+IFFN                          ELS
Sbjct: 301 LSELKSDESETDGALATILKVLKQVHNIFFN--------------------------ELS 360

Query: 566 DDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELERS 625
           +DLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTEL+ S
Sbjct: 361 EDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDAS 420

Query: 626 VTHVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ 679
           VTH+VSTDAG EKSRWA+KE+KFLVHP+WIEASNYFWKR+AEE F VE TKKQ
Sbjct: 421 VTHIVSTDAGMEKSRWAVKEQKFLVHPQWIEASNYFWKREAEEKFLVEHTKKQ 447

BLAST of Cla97C08G145310 vs. ExPASy TrEMBL
Match: A0A6J1CJQ5 (RNA polymerase II C-terminal domain phosphatase-like OS=Momordica charantia OX=3673 GN=LOC111012040 PE=4 SV=1)

HSP 1 Score: 800.8 bits (2067), Expect = 4.3e-228
Identity = 413/476 (86.76%), Postives = 431/476 (90.55%), Query Frame = 0

Query: 206 MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPYEKAEHDNNAESERIKRRKVEKLENS 265
           MSL T+SPAHSSSSDDFAAFLDVALDSHSSDSSP EKAE DNN ESERIKRRKVEKLE S
Sbjct: 1   MSLVTDSPAHSSSSDDFAAFLDVALDSHSSDSSPEEKAEGDNNVESERIKRRKVEKLEGS 60

Query: 266 E---EDILYGAEEKSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRLN 325
           E   EDI+Y  EE+S EVLSKQQLC HPGSFGNMCIICGQRLD ESGVTFGYIH+GLRLN
Sbjct: 61  EEPQEDIMYRVEEQSSEVLSKQQLCGHPGSFGNMCIICGQRLDGESGVTFGYIHKGLRLN 120

Query: 326 NDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKG 385
           NDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGH+TPEEEYLRSQTDSL+DVTKG
Sbjct: 121 NDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHITPEEEYLRSQTDSLKDVTKG 180

Query: 386 SLFLLNSVHTMTKLRPSVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKV 445
           SLFLLNS+HTMTKLRP +HTFLKEASQLFEMYIYTMGERAYA EMAKLLDPKR YFSA+V
Sbjct: 181 SLFLLNSIHTMTKLRPFIHTFLKEASQLFEMYIYTMGERAYAVEMAKLLDPKRAYFSARV 240

Query: 446 ISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFN 505
           ISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSC QFG+N
Sbjct: 241 ISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGYN 300

Query: 506 CKSLSELKSDESETDGALATILKVLKQVHSIFFNVFPFLSVTFLFPIFFTSISLLEFLLQ 565
           CKSLSELKSDESETDGALA+ILKVLKQVH+IFFN                          
Sbjct: 301 CKSLSELKSDESETDGALASILKVLKQVHTIFFN-------------------------- 360

Query: 566 ELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTEL 625
           ELSDDLVDRDVRQVLKTVRSKVLEGCKVVF+RVFP KFQADNHHLWKMVEQLGG+CST+L
Sbjct: 361 ELSDDLVDRDVRQVLKTVRSKVLEGCKVVFTRVFPAKFQADNHHLWKMVEQLGGSCSTDL 420

Query: 626 ERSVTHVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ 679
           + SVTHVVSTDAGTEKSRWA+KE+KFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ
Sbjct: 421 DPSVTHVVSTDAGTEKSRWAVKEQKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ 450

BLAST of Cla97C08G145310 vs. TAIR 10
Match: AT5G58003.1 (C-terminal domain phosphatase-like 4 )

HSP 1 Score: 518.1 bits (1333), Expect = 1.1e-146
Identity = 281/478 (58.79%), Postives = 351/478 (73.43%), Query Frame = 0

Query: 206 MSLATNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PYEKAEHDNNAESERIKRRKVEKLE 265
           MS+A++SP H SSSSDD AAFLD  LDS S  SS P E+ E +++ ES  +KR+K+E LE
Sbjct: 1   MSVASDSPVHSSSSSDDLAAFLDAELDSASDASSGPSEEEEAEDDVES-GLKRQKLEHLE 60

Query: 266 NSEEDILYGAEEKSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRLNN 325
                          E  S +  C HPGSFGNMC +CGQ+L EE+GV+F YIH+ +RLN 
Sbjct: 61  ---------------EASSSKGECEHPGSFGNMCFVCGQKL-EETGVSFRYIHKEMRLNE 120

Query: 326 DEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLED---VT 385
           DEI+RLR+ D + L + +KL LVLDLDHTLLN+T L  L PEEEYL+S T SL+D   V+
Sbjct: 121 DEISRLRDSDSRFLQRQRKLYLVLDLDHTLLNTTILRDLKPEEEYLKSHTHSLQDGCNVS 180

Query: 386 KGSLFLLNSVHTMTKLRPSVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSA 445
            GSLFLL  +  MTKLRP VH+FLKEAS++F MYIYTMG+R YA +MAKLLDPK EYF  
Sbjct: 181 GGSLFLLEFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGD 240

Query: 446 KVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFG 505
           +VISRDDGT +H+K LDVVLGQESAVLILDDTENAW KHK+NLI++ERYHFF+SSC QF 
Sbjct: 241 RVISRDDGTVRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQFD 300

Query: 506 FNCKSLSELKSDESETDGALATILKVLKQVHSIFFNVFPFLSVTFLFPIFFTSISLLEFL 565
              KSLSELKSDESE DGALAT+LKVLKQ H++FF                         
Sbjct: 301 HRYKSLSELKSDESEPDGALATVLKVLKQAHALFF------------------------- 360

Query: 566 LQELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCST 625
            + + + + +RDVR +LK VR ++L+GCK+VFSRVFPTK + ++H LWKM E+LG TC+T
Sbjct: 361 -ENVDEGISNRDVRLMLKQVRKEILKGCKIVFSRVFPTKAKPEDHPLWKMAEELGATCAT 420

Query: 626 ELERSVTHVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ 679
           E++ SVTHVV+ D GTEK+RWA++E+K++VH  WI+A+NY W +Q EENF +EQ KKQ
Sbjct: 421 EVDASVTHVVAMDVGTEKARWAVREKKYVVHRGWIDAANYLWMKQPEENFGLEQLKKQ 435

BLAST of Cla97C08G145310 vs. TAIR 10
Match: AT2G33540.1 (C-terminal domain phosphatase-like 3 )

HSP 1 Score: 234.2 bits (596), Expect = 3.1e-61
Identity = 136/356 (38.20%), Postives = 203/356 (57.02%), Query Frame = 0

Query: 328  RLRNIDMKN-LLQHKKLILVLDLDHTLLNSTQLGHL-TPEEEYLRSQTDSLEDVTKGSLF 387
            R+R ++ +N +   +KL LVLD+DHTLLNS +   + +  EE LR + +   +     LF
Sbjct: 912  RVRRLEEQNKMFASQKLSLVLDIDHTLLNSAKFNEVESRHEEILRKKEEQDREKPYRHLF 971

Query: 388  LLNSVHTMTKLRPSVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISR 447
                +   TKLRP +  FL++AS+L+E+++YTMG + YA EMAKLLDPK   F+ +VIS+
Sbjct: 972  RFLHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGVLFNGRVISK 1031

Query: 448  -DDGTQ-------KHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCH 507
             DDG            K L+ V+G ES+V+I+DD+   W +HK NLI +ERY +F  S  
Sbjct: 1032 GDDGDPLDGDERVPKSKDLEGVMGMESSVVIIDDSVRVWPQHKMNLIAVERYLYFPCSRR 1091

Query: 508  QFGFNCKSLSELKSDESETDGALATILKVLKQVHSIFFNVFPFLSVTFLFPIFFTSISLL 567
            QFG    SL EL  DE   +G LA+ L V++++H  FF+                     
Sbjct: 1092 QFGLLGPSLLELDRDEVPEEGTLASSLAVIEKIHQNFFS--------------------- 1151

Query: 568  EFLLQELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPT-KFQADNHHLWKMVEQLGG 627
                      L + DVR +L + + K+L GC++VFSR+ P  + +   H LW+  EQ G 
Sbjct: 1152 -------HTSLDEVDVRNILASEQRKILAGCRIVFSRIIPVGEAKPHLHPLWQTAEQFGA 1211

Query: 628  TCSTELERSVTHVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPV 673
             C+T+++  VTHVV+   GT+K  WAL   +F+VHP W+EAS + ++R  E  + +
Sbjct: 1212 VCTTQVDEHVTHVVTNSLGTDKVNWALTRGRFVVHPGWVEASAFLYQRANENLYAI 1239

BLAST of Cla97C08G145310 vs. TAIR 10
Match: AT3G17550.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )

HSP 1 Score: 183.3 bits (464), Expect = 6.3e-46
Identity = 121/312 (38.78%), Postives = 176/312 (56.41%), Query Frame = 0

Query: 262 LENSEEDILYGAEEKSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRL 321
           +EN   +      E S  + S +  C H      +CI C   +++  G  F Y+ +GL+L
Sbjct: 4   VENISMEFEPAINESSSSLSSSRSSCGHWYVRYGVCIACKSTVNKRHGRAFDYLVQGLQL 63

Query: 322 NNDEINRLRNIDMK-NLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVT 381
           +++     +    +   L  KKL LVLDLDHTLL+S ++  L+  E+ L  +  S    T
Sbjct: 64  SHEAAAFTKRFTTQFYCLNEKKLNLVLDLDHTLLHSIRVSLLSETEKCLIEEACS---TT 123

Query: 382 KGSLFLLNSVHTMTKLRPSVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSA 441
           +  L+ L+S + +TKLRP VH FLKEA++LF MY+YTMG R YA  + KL+DPKR YF  
Sbjct: 124 REDLWKLDSDY-LTKLRPFVHEFLKEANELFTMYVYTMGTRVYAESLLKLIDPKRIYFGD 183

Query: 442 KVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFG 501
           +VI+RD+    + K LD+VL +E  V+I+DDT + WT HK NL+ +  YHFF  +  +  
Sbjct: 184 RVITRDE--SPYVKTLDLVLAEERGVVIVDDTSDVWTHHKSNLVEINEYHFFRVNGPE-- 243

Query: 502 FNCKSLSELKSDESETDGALATILKVLKQVHSIFFNVFPFLSVTFLFPIFFTSISLLEFL 561
               S +E K DES+ +G LA +LK+LK+VH  FF V   L               + FL
Sbjct: 244 -ESNSYTEEKRDESKNNGGLANVLKLLKEVHYGFFRVKEEL-----------ESQDVRFL 295

Query: 562 LQELSDDLVDRD 573
           LQE+   L+ +D
Sbjct: 304 LQEIDFKLLTKD 295

BLAST of Cla97C08G145310 vs. TAIR 10
Match: AT2G04930.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )

HSP 1 Score: 180.3 bits (456), Expect = 5.4e-45
Identity = 104/261 (39.85%), Postives = 158/261 (60.54%), Query Frame = 0

Query: 280 VLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRLNNDEINRLRNIDMK-NLL 339
           V +    C H   F  +CI C  ++ +     F YI +GL+L+N+ +   +++  K + L
Sbjct: 3   VTTSSSCCGHWYVFQGICIGCKSKVHKSQFRKFDYIFKGLQLSNEAVALTKSLTTKHSCL 62

Query: 340 QHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDS--LEDVTKGSLFLLNSVHTMTKL 399
             KKL LVLDLDHTLL+S  + +L+  E YL  +  S   ED+ K    + + +  + KL
Sbjct: 63  NEKKLHLVLDLDHTLLHSKLVSNLSQAERYLIQEASSRTREDLWKFRP-IGHPIDRLIKL 122

Query: 400 RPSVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGL 459
           RP V  FLKEA+++F M++YTMG R YA  + +++DPK+ YF  +VI++D+  +   K L
Sbjct: 123 RPFVRDFLKEANEMFTMFVYTMGSRIYAKAILEMIDPKKLYFGNRVITKDESPR--MKTL 182

Query: 460 DVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKSLSELKSDESET 519
           ++VL +E  V+I+DDT + W  HK NLI + +Y +F  S    G +  S SE K+DE E 
Sbjct: 183 NLVLAEERGVVIVDDTRDIWPHHKNNLIQIRKYKYFRRS----GLDSNSYSEKKTDEGEN 242

Query: 520 DGALATILKVLKQVHSIFFNV 538
           DG LA +LK+L++VH  FF V
Sbjct: 243 DGGLANVLKLLREVHRRFFIV 256

BLAST of Cla97C08G145310 vs. TAIR 10
Match: AT5G54210.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )

HSP 1 Score: 179.1 bits (453), Expect = 1.2e-44
Identity = 102/254 (40.16%), Postives = 152/254 (59.84%), Query Frame = 0

Query: 287 CSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRLNNDEINRLRNIDMK-NLLQHKKLIL 346
           C H      +C  C   ++   G +F Y+  GL+L++  +   + +  +      KKL L
Sbjct: 32  CDHFFVRYGICCNCRSNVERHRGRSFDYLVDGLQLSDIAVTVTKRVTTQITCFNDKKLHL 91

Query: 347 VLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLN---SVHTMTKLRPSVHT 406
           VLDLDHTLL++  + +LT EE YL  + DS ED+ +     LN   S   + KLRP VH 
Sbjct: 92  VLDLDHTLLHTVMISNLTKEETYLIEEEDSREDLRR-----LNGGYSSEFLIKLRPFVHE 151

Query: 407 FLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQ 466
           FLKEA+++F MY+YTMG+R YA  +  L+DP++ YF  +VI+R++    + K LD+VL  
Sbjct: 152 FLKEANKMFSMYVYTMGDRDYAMNVLNLIDPEKVYFGDRVITRNE--SPYIKTLDLVLAD 211

Query: 467 ESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKSLSELKSDESETDGALAT 526
           E  V+I+DDT + W  HK NL+ + +Y++F+          KS +E K DES  DG+LA 
Sbjct: 212 ECGVVIVDDTPHVWPDHKRNLLEITKYNYFSDKTRHDVKYTKSYAEEKRDESRNDGSLAN 271

Query: 527 ILKVLKQVHSIFFN 537
           +LKV+KQV+  FF+
Sbjct: 272 VLKVIKQVYEGFFS 278

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038890381.16.9e-23689.43RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Benincasa his... [more]
XP_022925487.13.9e-23188.58RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Cucurbita mos... [more]
XP_023525838.18.7e-23187.74RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita pepo subsp. pe... [more]
KAG7025178.12.5e-23088.37RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita argyrosperma s... [more]
XP_022133134.12.5e-23087.82RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Momordica cha... [more]
Match NameE-valueIdentityDescription
Q00IB61.5e-14558.79RNA polymerase II C-terminal domain phosphatase-like 4 OS=Arabidopsis thaliana O... [more]
Q8LL044.4e-6038.20RNA polymerase II C-terminal domain phosphatase-like 3 OS=Arabidopsis thaliana O... [more]
F4JCB23.7e-4337.71RNA polymerase II C-terminal domain phosphatase-like 5 OS=Arabidopsis thaliana O... [more]
Q95QG81.5e-3127.21RNA polymerase II subunit A C-terminal domain phosphatase OS=Caenorhabditis eleg... [more]
Q8SV034.8e-2232.21RNA polymerase II subunit A C-terminal domain phosphatase OS=Encephalitozoon cun... [more]
Match NameE-valueIdentityDescription
A0A6J1EFC11.9e-23188.58RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita moschata OX=36... [more]
A0A6J1BV421.2e-23087.82RNA polymerase II C-terminal domain phosphatase-like OS=Momordica charantia OX=3... [more]
A0A6J1ID301.4e-22987.53RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita maxima OX=3661... [more]
A0A6J1GC388.8e-22987.53RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita moschata OX=36... [more]
A0A6J1CJQ54.3e-22886.76RNA polymerase II C-terminal domain phosphatase-like OS=Momordica charantia OX=3... [more]
Match NameE-valueIdentityDescription
AT5G58003.11.1e-14658.79C-terminal domain phosphatase-like 4 [more]
AT2G33540.13.1e-6138.20C-terminal domain phosphatase-like 3 [more]
AT3G17550.16.3e-4638.78Haloacid dehalogenase-like hydrolase (HAD) superfamily protein [more]
AT2G04930.15.4e-4539.85Haloacid dehalogenase-like hydrolase (HAD) superfamily protein [more]
AT5G54210.11.2e-4440.16Haloacid dehalogenase-like hydrolase (HAD) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 248..268
NoneNo IPR availablePANTHERPTHR23081:SF28BNAC03G12630D PROTEINcoord: 563..673
coord: 207..536
NoneNo IPR availableCDDcd17729BRCT_CTDP1coord: 572..667
e-value: 1.38984E-35
score: 127.649
NoneNo IPR availableCDDcd07521HAD_FCP1-likecoord: 342..486
e-value: 4.75963E-34
score: 124.63
IPR004274FCP1 homology domainSMARTSM00577forpap2coord: 341..496
e-value: 1.5E-53
score: 193.9
IPR004274FCP1 homology domainPFAMPF03031NIFcoord: 344..491
e-value: 5.1E-26
score: 91.3
IPR004274FCP1 homology domainPROSITEPS50969FCP1coord: 338..509
score: 30.000246
IPR001357BRCT domainSMARTSM00292BRCT_7coord: 582..662
e-value: 1.4E-6
score: 37.9
IPR001357BRCT domainPFAMPF12738PTCB-BRCTcoord: 603..654
e-value: 2.6E-8
score: 33.7
IPR001357BRCT domainPROSITEPS50172BRCTcoord: 580..672
score: 13.39524
IPR023214HAD superfamilyGENE3D3.40.50.1000coord: 325..540
e-value: 3.6E-51
score: 175.8
IPR011947FCP1-like phosphatase, phosphatase domainTIGRFAMTIGR02250TIGR02250coord: 337..492
e-value: 3.8E-53
score: 177.8
IPR036420BRCT domain superfamilyGENE3D3.40.50.10190BRCT domaincoord: 584..672
e-value: 1.7E-23
score: 84.7
IPR036420BRCT domain superfamilySUPERFAMILY52113BRCT domaincoord: 584..672
IPR039189CTD phosphatase Fcp1PANTHERPTHR23081RNA POLYMERASE II CTD PHOSPHATASEcoord: 563..673
IPR039189CTD phosphatase Fcp1PANTHERPTHR23081RNA POLYMERASE II CTD PHOSPHATASEcoord: 207..536
IPR036412HAD-like superfamilySUPERFAMILY56784HAD-likecoord: 333..496

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C08G145310.2Cla97C08G145310.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0070940 dephosphorylation of RNA polymerase II C-terminal domain
cellular_component GO:0005634 nucleus
molecular_function GO:0008420 RNA polymerase II CTD heptapeptide repeat phosphatase activity
molecular_function GO:0004721 phosphoprotein phosphatase activity