Cp4.1LG02g10800 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG02g10800
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionRNA polymerase II C-terminal domain phosphatase-like
LocationCp4.1LG02: 9224146 .. 9237939 (+)
RNA-Seq ExpressionCp4.1LG02g10800
SyntenyCp4.1LG02g10800
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ACAAGGAGAATTACACGTTCAATCGCTTGAGTTAATGTACGGTCTATCATGCATATGATCATATATGCTCAACAAACCATCCTTCAAGCACTGAGTAGTAGAAAAATAATCTAGAAGTAAATCAATTATGCCAATAACAAGAGTGATTATTCTAGCCTTCTTCTCCCACGACGTAGGCTATCGGAAAACAGCGAGCTGAGCGACAGTTTCAGTAGTGGCAGTGCGTTCTTGGGCGACTGGGCGACCGGGTGACAAACGACCCACGAGTTCTGCGAGCCACGCACAGCAACAAATCAGACCTTAAGCGGCAGTGCACTCGCGGCGGCGGCGCATGACCTTGAAGCGAATTCGGTAGCGCCGGTTTCGAGCGACGGCGAGGGTTTCCCCGGACTGTTCTAGCTCCTCCCTGGCGACTTGCGTTTTCCACTGCGGTGGCGATCTACTCCACGTTCAGCGGAAGCCACGACGCAATTGACTCCTGAAGGTGACTACCCAAGAAGCTGCGGCAATTTTTTTGCATCAACCCAACTCACGACGCCGTGAATAGCATCGAAGACCACCCTCTAAGGACTCCAAATCGCTAGTTTTGTGACCCATGCTCCGAATAGCCTCTAATCGGATTAACCGTAGTCAACAGAAGTATGATCTGTGAAGACGTACCCATTCGAACAAAAACCAGCAATGTGCCGAGCTTTTTCGGTAAGAATTAGTGGATATGACTCATTCTAAACCCTTAAGTGCTGTAAATCAGGCTTTAACTCTTGAAACTCTTCTGTATTTAGGTTTGAACTAATCAAGTTAAGGTTTAAAGTCAACATTGGGTGAAAACTAGTTGGTAAGCCTTCTTAAATACCTTTGCATTTTCAATTAAAGATTTCGAACCAAATTCTAACTCCAAACATTTTCCTTTATTGTTTCAGGAATTGATAGAACAACGAAGAAGCGCCGTAGGACGTGTTGTAAGTAATTTGAATTTGTGGAGTCCATTTGCTATATGCTTTGGTATCCTTGTATGTTTTGTATGGTGCATGCTTTGTTTGATATAAGTGATTGAATTGAATGCTGTATTGAATGGATTGTGTTTTGATGTGTCATGTTGAGTTACGCTCTTGTGATCTTAAACATGGTGTGATGTTGAAACTAAGCATGATGATTGTGTTTTAAAGCGTGAGCAGCAGGTCTAAAAAGCATGTATAGGATGTGCATATATGGGCAATTGTCAAGCTGGTAAGTGAAAAGGGTCAGAAGACCTTGCGTTTGTGATAAAGCAGAGTATGGAGAATATCATGACCTCAAGTATAAATGATTGATGGTTGGTATTGTTTTAAGGAAAGTAAGTGAATATTAAGGCCTTGGTGTTAATGATTGAGGGATGGTGTCACTACTGGGTGTTGAGGCCTCAAGTATAAATTGTCGACGGTTGATATGATGAGTCTTGAGGCAAGTATTTCTGGTCTTGGTATAAATAGTGAAAGTCCACTTGTCATGGAGACTTGAGTACAAATTATCAAGGGTCGATGCACAATTTGAGGCCTTGGGTAAAAATGATTAGGGAAGAGTGGTTCTAATATGAATTTTATGCAAGAATGTGGTTGATTCTTCTATGAATTTTATGTGGGGGGTGGTAATTGATTCTTCCTACGGATGAGACGTGTGTCTTGTTCCGAAAAGGAGACAGTCTCTATAGTCAAGGACTTCCGACCTAGTAGTCTTGTCATGAGTATCTATAAGAACATTGCCATTGTAGCTAACTGTTTCAGGAAAATCCTTCCTAGTACTATCCCTATTTCTCAAGGTGCTTTTGTAGCTGGGGCGCAAATCCTAGACCAAGTGCTCATAGCCAATGACGCAATTGAGGACCATAGAGAAGGGAAAAGAATGATTTTCAAAATTGATTTTGAGAAGGCATTTGATCATGTTGATTGAGAGTTCTTGGACACGACCCTTGGTAAGAAAGAATGACTTTGGCTTCAAAGGGAGAACTTAGATGTAGAGTTGTTCAAGAAGATTAATTTCTCCATCCTTCTTAATGGAAAACCTAGGGACTAGGGCTACTAGGTGACTTAGGCAAGGAGATTCCCTTTCTCCTTTCTCGTCATTCTAGCTGTGGATGTTCTCAAACAGGTTGGTGTCAACTAGTATGGAAAAGGGTTGCGTTGATCGTTTTCAGGTAGGGAGGGAGTCAGTTTTCCTTTTTCGTCTTCAATTTGCTAATGATTCCCTCTTGTTTTGTTTGGGGAAGGAACGCTCTTTTGTGAACTTGAATAGGTTGTTTTCCTTTGAAGTGATTTTGGACCTCAAGATTAATAGATGTAAGAGATGATCTTGGGTAATTTTTTATTTATTACAAGAACCAAAGAGGGAATATATTCCACCAAAAGAACAAGAACAACCTAAGGGTCGGGGGTAGAGAGACTCCCAAAATCCTATACAAAGAGAGCCTTCCAATAATTTATAATCATTGAAATGTAATTACAAATAAGTTTGCATGTAAAGAAATCCACCAGGAAGCTGTATTTTGTACCAAATTACAAAAGAAATCAAGACATAAAGGACTTATCTTCAAAAGCTCAAGCATTCCTTTCCTTCCACATTGCTGCAAAAGAGCTTTAAAGGCTCAACTTCCAACTACCTTGGTTTTACTTTTCAAATTCCAAGCAACTATAAAGCTCCAAGCCCAAGAAGCGTACTCACCGTGAACCAACCCACAAATGTTGAAAATCTCCAGCCTAATACAATACCTCTTGTAGCCAAAAACTTTCCTCTCGGTATACCATTAATCAAAATATAAGAGTTTGTGGTCTTCAGACTCCCTCTTATCCACTTCCTCCAAAGCCATCCAAAACCTTTCAACTCCAAAGTCACAAGGAATCTACCGTCCACCTTATCATACGCTTTCTCTAGATCAAGCTTCAGGTGGAAACCACTTTTCTTGGGCACATTCTAGGCTTCAACAATCTCGGAAGCCACTAAGATAGCATCAAGATTTGCCTTTCTTGAATAAGGGGCTACAATACTGATTGTGGAAAGCAACACTTTTTTAGCAAGCTTCTCAACCATAATCTTATACGAGGCGGAAATCAAACTAATTGGCCAAAAATAATTGTCTCTGAGAGTTTCTTCTTCTTGGGGATTAAGCAAACATAGGTTTCATTTGTGCACCTATTGATGACATGAGTATGAAAAAAATTCTTGGGTAGTATCGAAATGTTCCAAAACTTTTCATAAAACTCTCCAATCATCCCATGAGGGCCAGGGGATTTGAGGCTGCCCAAGTCTTTGACAGCTTTGAAAATTTATCCTCTTCAAAAGGTCTTTCTAGCCTTGAGCTTCTATGATCATCTAGCAGAGCCCAAGTATGCTGTCCATCATAAACCGAAGACCATTATCTGCTGTATATAACTTCTAGTTGAAGTCAATAAGCTCATCTTCTATTTCTCTATCCATAGCTAGAAGTCTTCCAAAGTCACTTTCGAGGGCAGAGGTAAAGGCTCTATTCCGCCAAGATGAAACCCATTTGTGATAAAAGGGAGTGTTTTCATGCCCCTCATTTAACCATCTAATTTTGCATCTTTTGGTTCTGGATATGTTCCTCATTTAAATTCGGCAGGGACCGCTTTCAGTCTAACTCTTATCATTTTTTTTTTTTTGCTTTGGGGACATCCTTCCTTGTTCCTCCTTATTATCAATTTGAGCAATCTAAGCAATGAAGTTCTATTTTTCATATCACATTCCCTAAGGTTTCTTTGTTCCATGCCACTAAACAAGACCTCAACCCTTTAAGCTTCTCCATAAACTTATAACCACCCAACTATTCAGATTTTGTTCAGCTTACCAACTCTCCACCTTTTCCTTGAAATTTGGAGCTTCAAGCCAAATATTCTAAAATTGTGGGGCCCATTTAAACGAGCACAAACTCAGAACAAGAGGGAAATGATCATACGTTGGCCTCTTGGAATAAGGCATTCTTTCCTATGGGGGTGACTTACTCTGATTCAATCTGTCATGAGTGGCATCCTTATCTACTTCCTTTCTGTTTTTAAGGTCCCTATTGCAGTGTGCAATAAGTTGGAGAAGACGATGAGGGGCTTTTTATGGGAGGACGTGGAGGAAGGAGGTGGGTTTCATTTGGGTAGGTGGGAGGTGGTGTCAAGGTCAGGGAGCTAGGGGGTTTAGGCATTGGTAAGCTGAGGTGTTGTAGTGGGTCTTTGTTAGCCAAATGGTTATGGTGCATTTTATGTTAAGCAATTCCTTCATCTTTTGATTTTGAATATGTTCTCTTTTCTTTCAGCTTGAAGATGACAGAGGATGAATTTCTGGTCTATTAAATTTCAAGTTAGCGCATGTGAAGAGTCTTCCCAACTCAAGAAAAAATCCAAAAAGGAAAAGAGTCTAAAATCTTCTTCCAAATTCATGTCCTTGTAACATAATATAAGTTTTGTGAGAATGCAAAATGTTTAATTTAACCCCGAATAAAACACTCGTTAGTAACTGTATTAATGCATGCCAGCAATAATAACAGTTCATGCGTGAAAGAGAAGACTGGATGGAGTGGTATGTTTCAGTTTCGTTTGAGGTTATAAGTTTATGGTAGAAATTTATGTAGAGTTTTATATATTTTATGATTTTTGAACCCCGAATGTTCTAGGCTTCGGTGGTTGCTTTGTGAAATTAGTCGAGGTATGCACAAATTTATTGGAAATGATTAAAACTAATAAGTACCGTAACCTACTAACTAAAGTCAGCTTATTTTGGGAGATGAAGATGATGTTATCGTAGAATTCTTCTGTCTTTGTCCATCTTCTTTACACATGAAGGAGCTATTTATTTTGTATTTTTCTTTCCACTTATAAATGATACATAAAACTACATGCATAATGTATTTTGTTTGGGTTGTTTCCTTTCTGAATTTTCCCTAATGTTTTCTATTCTTGGATTTGCATATTTATGCTACAATCTCCTAAGGGTTTTTTTTTTCTTGGTGGATATTTCAGATGAGCCTTGTGACTAATTCTCCGGCTCATTCATCGAGCAGTGACGATTTTGCTGCATTTCTTGATGTAGCTCTGGATTCTCATTCCTCTGACTCATCGCCGAATGAAAAGGCTGAGGGTCATAATAATGTTGAAACTGAGAGGTACTTCTAGTTTTTTTGAAATTAATAAATTTCCATTGATCTTTGAAGTTAATAATATCTGAAGACACTGGTTTAGTGTTTTTTTGGAATTATAGTTTTTGGTTAATCTAATGTGTTCAATACTTTAATGTTGAGTTCAAATGGGAAGTATTTATAGGTTGTCTGGTTCTCTTTTAAATCCTCATATGGTGAAAAATAGGAAAATAGGAGTTATTATGTTTTAAGAGTAATGGATTTCTGTTTTTAATTTCTGGACGGTCTTTCATAAGGATTGGTTTAACATTACAGATTTCCCATCTGGCTTTCTTGTGCTAGTCTATAACAATTTAAAGATTTTAAGCTTCGAATTTCAAACTTCGTTTTTTTAAGTAGAAATTTTAAGAAACCAAGTTACAAGAAGAAGAAGAATGTATTCTTTTCCTTTTGCATGAATACCAACATTGAGTTGTGGCAAGTGGCGACTATCATTTCTATACATAATAACGAGGGTTTGTAGGATTTGTTATAAGTTAGAACCATATATGATATAACTAAATTGCGGTAATTTATCTTCATTTTATCAAAAGTTAACATGTGATTTTTTCTTTCATTCAATTCATTTATTTTTTTAGGATAAAACGTCACAAGGTGGAGAAGCTGGAAAACTCAGGGGAGGATATTCTGTATGGAGTTGAAGAGCATAGTTCAGGTAAATTGAACTGTTCCTACTCCTTTTGTTGTGCCTCTTTTACTGTCTAACTCTTTACTTTCAATGTAGCTAATTTGTGCTGCCACCTGTAGTATACTTAACCACTTCCTCACTTCTTAGACAACTATGTATATGATGTACTTGTTGAATCAGTCCATTTTCCCCATCCTCTCTCTATATTTGTAGAGCTTTTCCCTTTGGACTCTTCCGTCGCATATCGTGTTTTTGGTGGTTGCCTTCTCTTCCCTTCTTGTATTTGAATCTGATTGGAAGTTTGTCACCTTTCTTTATCCCTCTTTTATTTGAATTGTATGAGCTTCTCTAAAAACCATATCATGTTGTCTCCAAATGATCTATATATCTATAAATACCAACTTATTAGTATTTTCGGGTAAGGAAGTATTCCATATAAAATATAAAATTGAAAGCGAGGGGAAAGAAACCTACATCCCTGTAACTAGAAGCGTAATCACATAAAAAAGTACTCAGCCCCAGTCTGGGCGCCACTAATCAGAAAAATCTAACTTGGAGGTGCCTGAGTTGGTCTATTTACTCTTTATCATGTACGGAATTTTGATAAAGTGATCAGCATCCGCTTTCATTCGATAAACCCCTCATTCCCAGGAGTTCAGCATATTGCCAACCACATTCTTAGAATACATAGATTCAGAGGCACACTGTCTTTTCTAGACACGGAAAAAGTTGACTTTGTTACCTTTAATAAAAATCCAATTTCATCAGACATCAAACTTCACTACACATTAAAGAGAAAATGGGACCTATTCAGGTGCCTCCAAGTTAGATTTTTCTGATTTTAGGAGTGGTGCCTGACTGGATTGTAAGAAAATGGGCCTGAGTACTTCCTCTGGCTTTATCTTAACATAGGATGGGAGTTATTCTTCCATATGGTAGCGGTGAATGATCTTTGGTCTAAATGAAAATAGTCATTTTAGATTCTTGGTTATTCGAGAACTAGGAATAACAGGGATGCTAAATGGTATGCATCAATTTCGCAGTATCTGAATTCATGAACTATTTCAGCCCATCAGTCCTCTGCTCACCATATATCCCTCGTTTTGGCTCCTTTCTTCCTCTACTATCGTGTCCCATTCAATTCCATGTAATTATAACTTATACTTGAGATATAAAAGCCTTCATGAGTATTTGTTCCCTTGTATCTCTGCATTGTTTTCTTTTCTCTCTATTTTCTGAATTCTGCTGTTTTGAAAACAACAATCAATGCAGAAGTATTATCAAAGCAGCAATTATGCAGTCATCCTGGTTCGTTTGGAAACATGTGTATCATCTGTGGGCAGAGGTTGGATGAGGAATCTGGCGTGACATTTGGGTATATACATAAGGTATAATTTCTTTATTTGGATGCATAATTATCTTCTATTCTTTGTTACCATGTATAGAATGGCTGGAAGGAACTCAGAAGGCTAGTTTACAAAATGGGATCCATTTAGAAGAGGAATGAGAAATGAGAAATGATGTCAAGCTTTTAATTTTATTTGTGGTTTTGCATTATGAACAGGAAAAAAGAGCTGTACTGATAAATATATAACTCAGCCATCTTCTCCAGGGCAAATTTTCCTTTTGGTTTTGCATTATGAACAGAAAAAAAAAGGGCTGTGCTGATAATACAAAACTCATCCATCTTCTCCATCATTGAAAAAAACCCTTTCTTAGAGAACACTATTCAAAATCTACAATCCATTGCATTGTAGGCTTTATCAAGGTCTATTTTGAAAATCCACCCTTGCTTACTTTTTGTTATCTCCTCCTCTACTGCCTCATTTGCTATAGCCTATTGGGATTGCATCAAGTTTTTGTCTACCTTTCATGAAGGGCTTGTTGATTTCCTTAACTTGTGCTATATAGCGTTTGTTTTGGAACCATGGCAATGATTTTATGTATTGCAGTCACTAAGCTGGTGGGTCATGGATGTATGTTCCATTGAGGGAAATATAGAACATGATGAGGGACCAAAAATCACTGGAGCTTAAGGATATCTTTTATGATTGTCCAATATTTTTATCTTGTTCGAGTTCATTGATTTTTACAACTCTTGAGAATCTGTCTGGTTACGCAGATTTACTATTCCTTTTTTGTTCAGCTATGGTTCTGGCATACATGTTTGTATGTTAGACATTTCCCTCCGTGGGACATAAATCAAGGAAATAATTTGTTTCATTTCCACCTTTCAACCATCCAACCTTATATTTGTGACTCCGCTTTCTAGTCTGTCTGAGACATTCAGATACTACATAAGTTTTGTATTTTCCTTTACACGATTTCCAGTTAATGGAACCCTCAATCCTGTTTGATCAACTTTCTGCTGGAAAATAGTGAATTGATCCCGTGACCCTTTGGCTTTGTGGGGATATCTCATTCCCCTTCGTTTGTATATGCCTCTTTGATCAATATATTGTCAGTTTCTTTTTTAAAAGAAGCAAGCGGTGAATTTTCCTATGTAGTTTGAGATGTTCTGTTGATATTATTTTAACTTCAGTTACATTCATTCACTGTTGAAAAAATTGAAACCTATGGAGTTTAATTTACTATGACTGGAGAAGTAATAATAACTTGTAATTGCCAGTGGATCAAACTATGTGCTAACTATATGACCCTTCTGAGAATCTTGTAGGGACTCAGGCTTAATAATGATGAAATTAACCGGCTTCGTAACATAGACATGAAGAACTTGTTGCAGCATAAAAAGCTTATCCTGGTTCTTGATCTAGATCACACACTGTTAAATTCAACTCAGCTGGGGCATTTGACACCTGAAGAGGATTATTTAAGGAATCAAACGGATTCGCTAGAAGGTACACTGTCCTCTCATCCGTGCATAGTTTTATTTTCTAAAGTAACTTCATATTGTTTTTACTTTTGTTTCTTTTCCATAGTGATTTTGCTGTCACCCCTGCTCTTTTCTTTTGTTTGTTTTAATAAGAAACATCTCAGTATATTCCCCAAGTCCCGAGCTTTTGTACTTTCTTTCTTAAATTATTAGCACCTTGAGCAGTTACAGTTGTAGGTTAGCAAACAGTCTTCTTTTATTTAAAATTAAAACTTTTAAGTTCGGCATGGACAAATTTGGAGCACATAGTTTAATAATAATTTTTTGTTTGTTTTGTTTTTTGTTGTTGTTCTTGTAAGAAATGCAGATACATATAAATACCAAAGAGAAGAAGTTACACCCAAAAGGAAGGGGTAGAGGTAACCTCCCCAAAAGAGAAGAAATGCCCTCTAATCTGTCATGATCAAAAGGTGGCTATAATTACACAATAATTTGGTATGGTCTGAACGCCACCAAGAAGCTAAATGATGTACATTATCGCAAAAGAACGAAAAGAAACAGACTTATCTTCAAATATTCATCGGTTTCATTCTTTCCAAATAAGTAAGCCAAACGAGAGTTGTAACTGCACAATTTCGAAGGACCTTGCCTTTGAGGTTCCAACTTGTCTGTCGTCCACTCATCCGCCTTACTAGGGAGGCAAATATCCAACCACATGCTATTTAAAACAAACACCGCCCTTTATATGCAAAAGGACAATTAAGAAAGAGGTGGTTCAAAATCTCCTCCCTATAGCATAGACAACAAACCGCACTAGCTGAAGAGTCTTTCTCATGTCACATAGATTTCAAAGAAGATGCTGAAAGTGTACTGTGTTTGTGTCCAAGTTTAAAACAAAATAAAGTTGCATTAAATGGACTCGGCTTCATTTATGTCAAACCGAAGTTAAACTTAAGTTATATTAAATGAAGTAAAGTCGTGTCAAAGTTGAAGTCAAGTTGCATTGGGAAAACTCTACGTGTCCATCAAAGTTCATTTAGTGCAAGTTAAACATTAATAATGAAACCATGGTTCAAACTACTGTAAGTAGAACTTTATTTTGTGACCATCCATGTCTAGGATTTGAAATTTAGATTTGGCATCTGATGGCCCCGACATTCCCCTACATCCTCTGCGACGTAGTCATATGGCCTATCCCCACAGAACAACACGAGTTCTAGCATGTTTTGTTCTCAGGTAGTGACTTCAATTATTTTAAGTCTTTCTTAGTTGTCCTATCCTTAGAATCGCTCTCATTCGGATGTGGTCTTGGTTCATTCATGTACCCGTCCTTTACTCAGGATCGCTCTCATTCGGAATCTCATAATTGCCCGGGGAAATTACAACCACCTTTCTAACTTCCTATTCAAATTTAAATACAAATAAAACGGTTTAACTGAATGTTCTAATCACCAAGGAAATGAATTCCCTTGTTTAACTGAATGTTCTAATCACCAAGGTAATTGGAGGTCTAACAAATTCAAACTTGGGTCAGTTTATTGCATTCTTGTGCTACTCATTTCTCTGTTTAATTAAAAACTAAAATTGATGCAAGGGGATCGAGTGATGTTTGGCCCCTTGCAAGTTCTATGATTCTCTTTGGACTTCATAATTACTGTGTCATATTTTACTTCATTGAATTCGCTTTTGTGGGCTTGTTTCTTTTATGCCCTTGTATTCTCTCCTTTTTTTTTTTCCTCAATAAAATTTGGTTTTTCATGAATAATATTAATAATAATAATAACAACAATAACAATAATAATAACAATAACTGATTGAATCAATCAGTATAAGCACTCTTAGGCATGACTCATGAGGCTATCTCTGCATCTTTCTACTCTGACTTAGAGAAGTTTTTGAGAGATCTGTTGAGAGTTGGGTTTCAGTGGCCTATTTTAAGATGAGGAGACTGACATAATTTTCATAATTTTGTAATGTGCATCTCAAGCCTTCTTACTTTCAATTAACTATATTTTCCCTTGGTCCCTATTACCATGAAAGCAAACAGCATAGTTTTGTTATTTATTGATTTAATTTTTTAAATCATTAATTGTCATAACTTTGAAATAGTGCTTATTTTATTTAATCTTGATTTCTTATATAATGTCTTATAATTAATTTTATAAAGTTGGCTTAATCGTCTAACATTCCTACAAATTTCTGATGTTGGTTACTCCATTTATCTTTTATGCTATGTGATAAAACTTCGGATAGATGTCACGAAAGGCAGCCTTTTCCTGTTGCATTCCGTGCATACCATGACAAAATTGAGGCCATTTGTCCATACGTTTCTGAAAGAAGCTAGTCAATTATTTGAGATGTATATATACACTATGGGGGAACGAGCATATGCATATGAAATGGCAAAGTTGTTGGACCCCAAGAGGGAGTATTTTAGTTCTAAAGTTATTTCTCGGGATGATGGCACTCAAAAACATCAAAAGGGTCTTGATGTGGTGCTGGGTCATGAAAGTGCTGTTCTGATCCTCGATGATACTGAAAATGTAAGTGTATCTAGAATTAGCTATATTTTTATACGGTTGATCTGTTGGTCCTGCCTATTTATCATCATTGAACGAGTATGTGACCGTGGGATTCCCTTCATACATTGAAAGGGCCCGATAAATTTAACATCCTAAAAAAAATACGAAAATACTCTACTTAAGGCAACATTTCGGCTCATTTTTATTGACCTGTTCGTTTATACTTATTTTAGTTAGATGTTGTGTGTAATCTCAAGTTCTCTTTCTTCTTTATGTTTTTGTCCCCTTTATTTTTTTCTTGAGGAATCTCCTTCCATTACTCCTGTGGATTTTCTTAGTGTAATGTAGTTCCATTAAACAATAGGATAGAATCTTGTAATGGGATTGCTTGTAAATGCTTACTTAGTGATATTGTCTTAGTTCCTCAAGCAGAAATCTCACAAGCCTGGTTTTCTGTTCTTTGTGATTCTGTCTTAGATATTGATGTTGAATTGGGGTACTTTTTTCGGAGCTTGGAACATTGATTTCGTCATCTTTGGGTTGCTGAACTACCTACTCATAAATATGGAGCTAAACATTGTCAATTGTTATGATAGGCCTGGACAAAGCATAAAGAAAACTTGATATTGATGGAGAGATATCATTTTTTTGCTTCAAGTTGTCACCAATTTGGGTTCAACTGTAAATCTCTATCTGAGTTGAAGAGTGACGAGAGTGAAACTGATGGGGCACTGGCGACCATTCTCAAAGTTCTCAAGCAAGTTCATAATATATTCTTTAATGTATTCCCTTTCCCTCTATTCTTACATTTTTGTTTTCGATCTTTTTTGCCTCAATCTCACTGCTAGGATTTCTTCTGCAGGAAATCTCTGAAGATTTGGTAGACAGAGATGTGAGGCAGGTAAAAGTTTGTTTCCTCTAAAAGTTTAGAGATCTGTTGTGCTTCTTGTGATGGCTGGAATTTCTCGTTCATCCCTACATTTCTATCTTTTACTCGATCAAATTGAAATATTCAATAAGCTTATGACATTTTTATTATTAGTTCACAAATAAACATGGTTGAAGTGTCAAGTTTTAGGTCTAATAGATCTTTAATGTGAGATCCCACGTCAATTGGAGAGGAGAACGAAGCATCCTTTATAAAGGTGTGGAAACCTCTCCCTAGCAGATGCGTTTTAAAACCTTGAGGGGAAACCCGAAGGGAAAAGCCCAAAGAGGACAATATCTGCTAGCGGTGGGCTTGAGCCGTTACAAATGGTATCAAAGCCAGACACCGGGCTATGTGCTAACGAGGAGGCTGAGCCCTAACGGGGTGTGGACACAAGGCTGTGTGCCAGCAAGAACGCTGGGCCCCAAAGGGGGTGGATTGGGGGATCCCACATTGATTGAAGAAGGGAACGACTGCCAACGCTGGGCCCGGAAGGGGGGTGGATTGTGATATCCCACATCGATTGGAGAGGAGAACGAAGCATCCTTTATTAGGGTGTGGAAACCCCTTCCTAGCTGACGCATTTTAAAAACCTTAAGGGAAGCCCGAAATTGGAAAGCCTAAAGCGGACAATATTTGCTAGTGGTAGGCTTAGGCCGTTGCACCTAAACTTTCAATTTTGTAACAAGTCTGTGAACTTTTAATTATGTATAGTAATTCCTAAACCATCTCCCAAGTGCTTAACCTATTTGACTATTTCAAAATTAGCACCTATTTGACTATTTCAAAATTTATTTATCTATTAGACTCAAAACTGGAAGTTTCAAGTTTTAGTAATAGTTGAATTTTATGTCTAATAGATTTGTAAATTTAGAAAAATATCAAATATATTCGTCAATTCACATGAGAAGGAAATGAGTATCGAGAATATTATTACATTTATGTCATGGACATAATATTGTCTTCAATTCAACAGGTATTAAAGACTGTTCGGAGCAAAGTCCTGGAGGGATGCAAGGTTGTCTTCAGCCGAGTGTTCCCTACCAAATTTCAGGCTGACAACCATCACCTCTGGAAAATGGTTGAGAAGTTGGGAGGCACATGCTCAACTGAACTCGACGCATCTGTGACACATATAGTCTCAACAGATGCTGGAACGGAGAAGTCACGTTGGGCGGTGAAGGAGCAGAAGTTTCTGGTTCATCCACAGTGGATAGAAGCATCAAACTACTTCTGGAAACGAGAAGCAGAAGAAAAGTTCCCGGTCGAGCACACCAAGAAACAATGACAGTTTCTCTCATTGCAGTAGTCCCATATTCCTCTTAAATGGAGCTTGCATTTGTTGGCTGTGCTGGTAGTGTTCCCTTGAATTCACCTTCTCAGGTCAGCGTCTCACTGGTTTAGATTGAAAGCCGAGCTCTGGTTTGTATAGATGTTTAGATACATGGTGCTGTAATTTTGGCCAACCATTTTGAGTTGGTTTATAATTATAGGATAAAATTTATGAGCTAAATTGTTTGTATTATGCAATTTTTTTCTGTTTTTTTATTATTAATTTAGTTTTTACGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNTTTTGGTCTCTTGTGGTTTTGCAACCAATGATACCTACGTATTGAAGCTAAATTAATTATTTGATCGATCTTGTTAGGCTATATTAATGAATTCAAAATGT

mRNA sequence

ACAAGGAGAATTACACGTTCAATCGCTTGAGTTAATGTACGGTCTATCATGCATATGATCATATATGCTCAACAAACCATCCTTCAAGCACTGAGTAGTAGAAAAATAATCTAGAAGTAAATCAATTATGCCAATAACAAGAGTGATTATTCTAGCCTTCTTCTCCCACGACGTAGGCTATCGGAAAACAGCGAGCTGAGCGACAGTTTCAGTAGTGGCAGTGCGTTCTTGGGCGACTGGGCGACCGGGTGACAAACGACCCACGAGTTCTGCGAGCCACGCACAGCAACAAATCAGACCTTAAGCGGCAGTGCACTCGCGGCGGCGGCGCATGACCTTGAAGCGAATTCGGTAGCGCCGGTTTCGAGCGACGGCGAGGGTTTCCCCGGACTGTTCTAGCTCCTCCCTGGCGACTTGCGTTTTCCACTGCGGTGGCGATCTACTCCACGTTCAGCGGAAGCCACGACGCAATTGACTCCTGAAGGTGACTACCCAAGAAGCTGCGGCAATTTTTTTGCATCAACCCAACTCACGACGCCGTGAATAGCATCGAAGACCACCCTCTAAGGACTCCAAATCGCTAGTTTTGTGACCCATGCTCCGAATAGCCTCTAATCGGATTAACCGTAGTCAACAGAAGTATGATCTGTGAAGACGTACCCATTCGAACAAAAACCAGCAATGTGCCGAGCTTTTTCGGTTTGAACTAATCAAGTTAAGGTTTAAAGTCAACATTGGGTGAAAACTAGTTGGAATTGATAGAACAACGAAGAAGCGCCGTAGGACGTGTTATGAGCCTTGTGACTAATTCTCCGGCTCATTCATCGAGCAGTGACGATTTTGCTGCATTTCTTGATGTAGCTCTGGATTCTCATTCCTCTGACTCATCGCCGAATGAAAAGGCTGAGGGTCATAATAATGTTGAAACTGAGAGGATAAAACGTCACAAGGTGGAGAAGCTGGAAAACTCAGGGGAGGATATTCTGTATGGAGTTGAAGAGCATAGTTCAGAAGTATTATCAAAGCAGCAATTATGCAGTCATCCTGGTTCGTTTGGAAACATGTGTATCATCTGTGGGCAGAGGTTGGATGAGGAATCTGGCGTGACATTTGGGTATATACATAAGGGACTCAGGCTTAATAATGATGAAATTAACCGGCTTCGTAACATAGACATGAAGAACTTGTTGCAGCATAAAAAGCTTATCCTGGTTCTTGATCTAGATCACACACTGTTAAATTCAACTCAGCTGGGGCATTTGACACCTGAAGAGGATTATTTAAGGAATCAAACGGATTCGCTAGAAGATGTCACGAAAGGCAGCCTTTTCCTGTTGCATTCCGTGCATACCATGACAAAATTGAGGCCATTTGTCCATACGTTTCTGAAAGAAGCTAGTCAATTATTTGAGATGTATATATACACTATGGGGGAACGAGCATATGCATATGAAATGGCAAAGTTGTTGGACCCCAAGAGGGAGTATTTTAGTTCTAAAGTTATTTCTCGGGATGATGGCACTCAAAAACATCAAAAGGGTCTTGATGTGGTGCTGGGTCATGAAAGTGCTGTTCTGATCCTCGATGATACTGAAAATGCCTGGACAAAGCATAAAGAAAACTTGATATTGATGGAGAGATATCATTTTTTTGCTTCAAGTTGTCACCAATTTGGGTTCAACTGTAAATCTCTATCTGAGTTGAAGAGTGACGAGAGTGAAACTGATGGGGCACTGGCGACCATTCTCAAAGTTCTCAAGCAAGTTCATAATATATTCTTTAATGAAATCTCTGAAGATTTGGTAGACAGAGATGTGAGGCAGGTATTAAAGACTGTTCGGAGCAAAGTCCTGGAGGGATGCAAGGTTGTCTTCAGCCGAGTGTTCCCTACCAAATTTCAGGCTGACAACCATCACCTCTGGAAAATGGTTGAGAAGTTGGGAGGCACATGCTCAACTGAACTCGACGCATCTGTGACACATATAGTCTCAACAGATGCTGGAACGGAGAAGTCACGTTGGGCGGTGAAGGAGCAGAAGTTTCTGGTTCATCCACAGTGGATAGAAGCATCAAACTACTTCTGGAAACGAGAAGCAGAAGAAAAGTTCCCGGTCGAGCACACCAAGAAACAATGACAGTTTCTCTCATTGCAGTAGTCCCATATTCCTCTTAAATGGAGCTTGCATTTGTTGGCTGTGCTGGTAGTGTTCCCTTGAATTCACCTTCTCAGGTCAGCGTCTCACTGGTTTAGATTGAAAGCCGAGCTCTGGTTTGTATAGATGTTTAGATACATGGTGCTGTAATTTTGGCCAACCATTTTGAGTTGGTTTATAATTATAGGATAAAATTTATGAGCTAAATTGTTTGTATTATGCAATTTTTTTCTGTTTTTTTATTATTAATTTAGTTTTTACGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNTTTTGGTCTCTTGTGGTTTTGCAACCAATGATACCTACGTATTGAAGCTAAATTAATTATTTGATCGATCTTGTTAGGCTATATTAATGAATTCAAAATGT

Coding sequence (CDS)

ATGAGCCTTGTGACTAATTCTCCGGCTCATTCATCGAGCAGTGACGATTTTGCTGCATTTCTTGATGTAGCTCTGGATTCTCATTCCTCTGACTCATCGCCGAATGAAAAGGCTGAGGGTCATAATAATGTTGAAACTGAGAGGATAAAACGTCACAAGGTGGAGAAGCTGGAAAACTCAGGGGAGGATATTCTGTATGGAGTTGAAGAGCATAGTTCAGAAGTATTATCAAAGCAGCAATTATGCAGTCATCCTGGTTCGTTTGGAAACATGTGTATCATCTGTGGGCAGAGGTTGGATGAGGAATCTGGCGTGACATTTGGGTATATACATAAGGGACTCAGGCTTAATAATGATGAAATTAACCGGCTTCGTAACATAGACATGAAGAACTTGTTGCAGCATAAAAAGCTTATCCTGGTTCTTGATCTAGATCACACACTGTTAAATTCAACTCAGCTGGGGCATTTGACACCTGAAGAGGATTATTTAAGGAATCAAACGGATTCGCTAGAAGATGTCACGAAAGGCAGCCTTTTCCTGTTGCATTCCGTGCATACCATGACAAAATTGAGGCCATTTGTCCATACGTTTCTGAAAGAAGCTAGTCAATTATTTGAGATGTATATATACACTATGGGGGAACGAGCATATGCATATGAAATGGCAAAGTTGTTGGACCCCAAGAGGGAGTATTTTAGTTCTAAAGTTATTTCTCGGGATGATGGCACTCAAAAACATCAAAAGGGTCTTGATGTGGTGCTGGGTCATGAAAGTGCTGTTCTGATCCTCGATGATACTGAAAATGCCTGGACAAAGCATAAAGAAAACTTGATATTGATGGAGAGATATCATTTTTTTGCTTCAAGTTGTCACCAATTTGGGTTCAACTGTAAATCTCTATCTGAGTTGAAGAGTGACGAGAGTGAAACTGATGGGGCACTGGCGACCATTCTCAAAGTTCTCAAGCAAGTTCATAATATATTCTTTAATGAAATCTCTGAAGATTTGGTAGACAGAGATGTGAGGCAGGTATTAAAGACTGTTCGGAGCAAAGTCCTGGAGGGATGCAAGGTTGTCTTCAGCCGAGTGTTCCCTACCAAATTTCAGGCTGACAACCATCACCTCTGGAAAATGGTTGAGAAGTTGGGAGGCACATGCTCAACTGAACTCGACGCATCTGTGACACATATAGTCTCAACAGATGCTGGAACGGAGAAGTCACGTTGGGCGGTGAAGGAGCAGAAGTTTCTGGTTCATCCACAGTGGATAGAAGCATCAAACTACTTCTGGAAACGAGAAGCAGAAGAAAAGTTCCCGGTCGAGCACACCAAGAAACAATGA

Protein sequence

MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENSGEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLEDVTKGSLFLLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISRDDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKSLSELKSDESETDGALATILKVLKQVHNIFFNEISEDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEKLGGTCSTELDASVTHIVSTDAGTEKSRWAVKEQKFLVHPQWIEASNYFWKREAEEKFPVEHTKKQ
Homology
BLAST of Cp4.1LG02g10800 vs. ExPASy Swiss-Prot
Match: Q00IB6 (RNA polymerase II C-terminal domain phosphatase-like 4 OS=Arabidopsis thaliana OX=3702 GN=CPL4 PE=1 SV=1)

HSP 1 Score: 534.3 bits (1375), Expect = 1.4e-150
Identity = 279/452 (61.73%), Postives = 347/452 (76.77%), Query Frame = 0

Query: 1   MSLVTNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PNEKAEGHNNVETERIKRHKVEKLE 60
           MS+ ++SP H SSSSDD AAFLD  LDS S  SS P+E+ E  ++VE+  +KR K+E LE
Sbjct: 1   MSVASDSPVHSSSSSDDLAAFLDAELDSASDASSGPSEEEEAEDDVES-GLKRQKLEHLE 60

Query: 61  NSGEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNN 120
                          E  S +  C HPGSFGNMC +CGQ+L EE+GV+F YIHK +RLN 
Sbjct: 61  ---------------EASSSKGECEHPGSFGNMCFVCGQKL-EETGVSFRYIHKEMRLNE 120

Query: 121 DEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLED---VT 180
           DEI+RLR+ D + L + +KL LVLDLDHTLLN+T L  L PEE+YL++ T SL+D   V+
Sbjct: 121 DEISRLRDSDSRFLQRQRKLYLVLDLDHTLLNTTILRDLKPEEEYLKSHTHSLQDGCNVS 180

Query: 181 KGSLFLLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSS 240
            GSLFLL  +  MTKLRPFVH+FLKEAS++F MYIYTMG+R YA +MAKLLDPK EYF  
Sbjct: 181 GGSLFLLEFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGD 240

Query: 241 KVISRDDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFG 300
           +VISRDDGT +H+K LDVVLG ESAVLILDDTENAW KHK+NLI++ERYHFF+SSC QF 
Sbjct: 241 RVISRDDGTVRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQFD 300

Query: 301 FNCKSLSELKSDESETDGALATILKVLKQVHNIFFNEISEDLVDRDVRQVLKTVRSKVLE 360
              KSLSELKSDESE DGALAT+LKVLKQ H +FF  + E + +RDVR +LK VR ++L+
Sbjct: 301 HRYKSLSELKSDESEPDGALATVLKVLKQAHALFFENVDEGISNRDVRLMLKQVRKEILK 360

Query: 361 GCKVVFSRVFPTKFQADNHHLWKMVEKLGGTCSTELDASVTHIVSTDAGTEKSRWAVKEQ 420
           GCK+VFSRVFPTK + ++H LWKM E+LG TC+TE+DASVTH+V+ D GTEK+RWAV+E+
Sbjct: 361 GCKIVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKARWAVREK 420

Query: 421 KFLVHPQWIEASNYFWKREAEEKFPVEHTKKQ 448
           K++VH  WI+A+NY W ++ EE F +E  KKQ
Sbjct: 421 KYVVHRGWIDAANYLWMKQPEENFGLEQLKKQ 435

BLAST of Cp4.1LG02g10800 vs. ExPASy Swiss-Prot
Match: Q8LL04 (RNA polymerase II C-terminal domain phosphatase-like 3 OS=Arabidopsis thaliana OX=3702 GN=CPL3 PE=1 SV=2)

HSP 1 Score: 243.0 bits (619), Expect = 6.2e-63
Identity = 135/330 (40.91%), Postives = 204/330 (61.82%), Query Frame = 0

Query: 123  RLRNIDMKN-LLQHKKLILVLDLDHTLLNSTQLGHL-TPEEDYLRNQTDSLEDVTKGSLF 182
            R+R ++ +N +   +KL LVLD+DHTLLNS +   + +  E+ LR + +   +     LF
Sbjct: 912  RVRRLEEQNKMFASQKLSLVLDIDHTLLNSAKFNEVESRHEEILRKKEEQDREKPYRHLF 971

Query: 183  LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR 242
                +   TKLRP +  FL++AS+L+E+++YTMG + YA EMAKLLDPK   F+ +VIS+
Sbjct: 972  RFLHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGVLFNGRVISK 1031

Query: 243  -DDGTQ-------KHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCH 302
             DDG            K L+ V+G ES+V+I+DD+   W +HK NLI +ERY +F  S  
Sbjct: 1032 GDDGDPLDGDERVPKSKDLEGVMGMESSVVIIDDSVRVWPQHKMNLIAVERYLYFPCSRR 1091

Query: 303  QFGFNCKSLSELKSDESETDGALATILKVLKQVHNIFFNEISEDLVDRDVRQVLKTVRSK 362
            QFG    SL EL  DE   +G LA+ L V++++H  FF+  S D V  DVR +L + + K
Sbjct: 1092 QFGLLGPSLLELDRDEVPEEGTLASSLAVIEKIHQNFFSHTSLDEV--DVRNILASEQRK 1151

Query: 363  VLEGCKVVFSRVFPT-KFQADNHHLWKMVEKLGGTCSTELDASVTHIVSTDAGTEKSRWA 422
            +L GC++VFSR+ P  + +   H LW+  E+ G  C+T++D  VTH+V+   GT+K  WA
Sbjct: 1152 ILAGCRIVFSRIIPVGEAKPHLHPLWQTAEQFGAVCTTQVDEHVTHVVTNSLGTDKVNWA 1211

Query: 423  VKEQKFLVHPQWIEASNYFWKREAEEKFPV 442
            +   +F+VHP W+EAS + ++R  E  + +
Sbjct: 1212 LTRGRFVVHPGWVEASAFLYQRANENLYAI 1239

BLAST of Cp4.1LG02g10800 vs. ExPASy Swiss-Prot
Match: F4JCB2 (RNA polymerase II C-terminal domain phosphatase-like 5 OS=Arabidopsis thaliana OX=3702 GN=CPL5 PE=1 SV=2)

HSP 1 Score: 189.9 bits (481), Expect = 6.3e-47
Identity = 117/307 (38.11%), Postives = 171/307 (55.70%), Query Frame = 0

Query: 48  RIKRHKVEKLENSGEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTF 107
           + KR K+E   N            SS  LS    C H      +CI C   + +  G  F
Sbjct: 306 KAKRRKIEPTIN-----------ESSSSLSSSSSCGHWYICHGICIGCKSTVKKSQGRAF 365

Query: 108 GYIHKGLRLNNDEINRLRNIDMK-NLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRN 167
            YI  GL+L+++ +   +    K + L  KKL LVLDLDHTLL++  +  L+  E YL  
Sbjct: 366 DYIFDGLQLSHEAVALTKCFTTKLSCLNEKKLHLVLDLDHTLLHTVMVPSLSQAEKYLIE 425

Query: 168 QTDSLEDVTKGSLFLLHSV----HTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEM 227
           +  S    T+  L+ + +V      +TKLRPF+  FLKEA++ F MY+YT G R YA ++
Sbjct: 426 EAGS---ATRDDLWKIKAVGDPMEFLTKLRPFLRDFLKEANEFFTMYVYTKGSRVYAKQV 485

Query: 228 AKLLDPKREYFSSKVISRDDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILME 287
            +L+DPK+ YF  +VI++ +    H K LD VL  E  V+I+DDT N W  HK NL+ + 
Sbjct: 486 LELIDPKKLYFGDRVITKTE--SPHMKTLDFVLAEERGVVIVDDTRNVWPDHKSNLVDIS 545

Query: 288 RYHFFASSCHQFGFNCKSLSELKSDESETDGALATILKVLKQVHNIFFNEISEDLVDRDV 347
           +Y +F       G +    SE K+DESE++G LA +LK+LK+VH  FF  + E+L  +DV
Sbjct: 546 KYSYFRLK----GQDSMPYSEEKTDESESEGGLANVLKLLKEVHQRFF-RVEEELESKDV 591

Query: 348 RQVLKTV 350
           R +L+ +
Sbjct: 606 RSLLQEI 591

BLAST of Cp4.1LG02g10800 vs. ExPASy Swiss-Prot
Match: Q95QG8 (RNA polymerase II subunit A C-terminal domain phosphatase OS=Caenorhabditis elegans OX=6239 GN=fcp-1 PE=1 SV=2)

HSP 1 Score: 156.0 bits (393), Expect = 1.0e-36
Identity = 110/400 (27.50%), Postives = 202/400 (50.50%), Query Frame = 0

Query: 61  GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESG----------VTFGYI 120
           G+ +  G+  +  +V++    C+H     +MC  CG+ L E+ G               I
Sbjct: 55  GKGLKPGIVLNKGQVIATVSECTHAIVIKDMCATCGKDLREKGGRAGQRKEQSTANVSMI 114

Query: 121 H--KGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQT 180
           H    L +++     + + D  NL+ ++KL+L++DLD T+++++        +  +   T
Sbjct: 115 HHVPELIVSDTLAKEIGSADENNLITNRKLVLLVDLDQTIIHTS--------DKPMTVDT 174

Query: 181 DSLEDVTKGSLFLLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDP 240
           ++ +D+TK   + LHS    TKLRP    FL + S ++EM+I T G+R YA+ +A++LDP
Sbjct: 175 ENHKDITK---YNLHSRVYTTKLRPHTTEFLNKMSNMYEMHIVTYGQRQYAHRIAQILDP 234

Query: 241 KREYFSSKVISRDD--GTQKHQKGLDVVLG-HESAVLILDDTENAWTKHKENLILMERYH 300
               F  +++SRD+    Q     L  +    ++ V+I+DD  + W  + E LI ++ Y 
Sbjct: 235 DARLFEQRILSRDELFSAQHKTNNLKALFPCGDNLVVIIDDRSDVW-MYSEALIQIKPYR 294

Query: 301 FF--ASSCHQFGFNCKSLSELKSDESETDGALATILKVLKQVHNIFFNEI----SEDLVD 360
           FF      +    + + +     D++  D  L  I +VL  +H+ ++ +     SE+++ 
Sbjct: 295 FFKEVGDINAPKNSKEQMPVQIEDDAHEDKVLEEIERVLTNIHDKYYEKHDLRGSEEVL- 354

Query: 361 RDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEKLGGTCSTELDASVTHIV 420
            DV++V+K  R KVL+GC +VFS + P   + +   ++++  + G     ++   VTH+V
Sbjct: 355 LDVKEVIKEERHKVLDGCVIVFSGIVPMGEKLERTDIYRLCTQFGAVIVPDVTDDVTHVV 414

Query: 421 STDAGTEKSRWAVKEQKFLVHPQWIEASNYFWKREAEEKF 440
               GT+K   A +  KF+V  QW+ A    W +  E  F
Sbjct: 415 GARYGTQKVYQANRLNKFVVTVQWVYACVEKWLKADENLF 441

BLAST of Cp4.1LG02g10800 vs. ExPASy Swiss-Prot
Match: Q9P376 (RNA polymerase II subunit A C-terminal domain phosphatase OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=fcp1 PE=1 SV=1)

HSP 1 Score: 114.8 bits (286), Expect = 2.6e-24
Identity = 120/497 (24.14%), Postives = 201/497 (40.44%), Query Frame = 0

Query: 68  VEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEE----------SGVTFGYIHKGLRLN 127
           +E  S  V    + C+H  ++G +C ICG+ +  +          + ++  +    L ++
Sbjct: 85  IENFSKIVAKLHEPCTHEVNYGGLCAICGKNITSQDYMGYSDMARANISMTHNTGDLTVS 144

Query: 128 NDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNST---QLGHLTPEEDYLRNQTDSLEDV 187
            +E +RL + ++K L Q K+L L++DLD T++++T    +G    +   +    D L DV
Sbjct: 145 LEEASRLESENVKRLRQEKRLSLIVDLDQTIIHATVDPTVGEWMSDPGNV--NYDVLRDV 204

Query: 188 TKGSLFLLHSVHT---MTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKRE 247
              +L    S +T     K RP +  FL++ S+L+E++IYTMG +AYA E+AK++DP  +
Sbjct: 205 RSFNLQEGPSGYTSCYYIKFRPGLAQFLQKISELYELHIYTMGTKAYAKEVAKIIDPTGK 264

Query: 248 YFSSKVISRDDGTQKHQKGLDVVLGHE-SAVLILDDTENAWTKHKENLILMERYHFFA-- 307
            F  +V+SRDD     QK L  +   + S V+++DD  + W     NLI +  Y FF   
Sbjct: 265 LFQDRVLSRDDSGSLAQKSLRRLFPCDTSMVVVIDDRGDVW-DWNPNLIKVVPYEFFVGI 324

Query: 308 --------------------------------------------------SSCHQ----- 367
                                                             SS  Q     
Sbjct: 325 GDINSNFLAKSTPLPEQEQLIPLEIPKDEPDSVDEINEENEETPEYDSSNSSYAQDSSTI 384

Query: 368 ---------FGFNCKSLSE------------------------LKSDESE---------- 427
                    F  N ++L E                        L  DE +          
Sbjct: 385 PEKTLLKDTFLQNREALEEQNKERVTALELQKSERPLAKQQNALLEDEGKPTPSHTLLHN 444

Query: 428 TDGALATILKVLKQVHNIFFNEISEDLVDR-------DVRQVLKTVRSKVLEGCKVVFSR 440
            D  L  + KVLK +H +++ E   D+  R       +V  ++  ++ KVL+GC+++FS 
Sbjct: 445 RDHELERLEKVLKDIHAVYYEE-ENDISSRSGNHKHANVGLIIPKMKQKVLKGCRLLFSG 504

BLAST of Cp4.1LG02g10800 vs. NCBI nr
Match: XP_023525838.1 (RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 884 bits (2284), Expect = 0.0
Identity = 447/447 (100.00%), Postives = 447/447 (100.00%), Query Frame = 0

Query: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS 60
           MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS
Sbjct: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS 60

Query: 61  GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120
           GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE
Sbjct: 61  GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120

Query: 121 INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLEDVTKGSLF 180
           INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLEDVTKGSLF
Sbjct: 121 INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLEDVTKGSLF 180

Query: 181 LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR 240
           LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR
Sbjct: 181 LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR 240

Query: 241 DDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 300
           DDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS
Sbjct: 241 DDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 300

Query: 301 LSELKSDESETDGALATILKVLKQVHNIFFNEISEDLVDRDVRQVLKTVRSKVLEGCKVV 360
           LSELKSDESETDGALATILKVLKQVHNIFFNEISEDLVDRDVRQVLKTVRSKVLEGCKVV
Sbjct: 301 LSELKSDESETDGALATILKVLKQVHNIFFNEISEDLVDRDVRQVLKTVRSKVLEGCKVV 360

Query: 361 FSRVFPTKFQADNHHLWKMVEKLGGTCSTELDASVTHIVSTDAGTEKSRWAVKEQKFLVH 420
           FSRVFPTKFQADNHHLWKMVEKLGGTCSTELDASVTHIVSTDAGTEKSRWAVKEQKFLVH
Sbjct: 361 FSRVFPTKFQADNHHLWKMVEKLGGTCSTELDASVTHIVSTDAGTEKSRWAVKEQKFLVH 420

Query: 421 PQWIEASNYFWKREAEEKFPVEHTKKQ 447
           PQWIEASNYFWKREAEEKFPVEHTKKQ
Sbjct: 421 PQWIEASNYFWKREAEEKFPVEHTKKQ 447

BLAST of Cp4.1LG02g10800 vs. NCBI nr
Match: XP_022949466.1 (RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita moschata])

HSP 1 Score: 872 bits (2252), Expect = 0.0
Identity = 442/447 (98.88%), Postives = 444/447 (99.33%), Query Frame = 0

Query: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS 60
           MSLVTNS AHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS
Sbjct: 1   MSLVTNSLAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS 60

Query: 61  GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120
           GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE
Sbjct: 61  GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120

Query: 121 INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLEDVTKGSLF 180
           INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLEDVTKGSLF
Sbjct: 121 INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLEDVTKGSLF 180

Query: 181 LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR 240
           LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR
Sbjct: 181 LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR 240

Query: 241 DDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 300
           DDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS
Sbjct: 241 DDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 300

Query: 301 LSELKSDESETDGALATILKVLKQVHNIFFNEISEDLVDRDVRQVLKTVRSKVLEGCKVV 360
           LSELKSDESETDGALATILKVLKQVHNIFFNE+SEDLVDRDVRQVLKTVRSKVLEGCKVV
Sbjct: 301 LSELKSDESETDGALATILKVLKQVHNIFFNELSEDLVDRDVRQVLKTVRSKVLEGCKVV 360

Query: 361 FSRVFPTKFQADNHHLWKMVEKLGGTCSTELDASVTHIVSTDAGTEKSRWAVKEQKFLVH 420
           FSRVFPTKFQADNHHLWKMVE+LGGTCSTELDASVTHIVSTDAG EKSRWAVKEQKFLVH
Sbjct: 361 FSRVFPTKFQADNHHLWKMVEQLGGTCSTELDASVTHIVSTDAGMEKSRWAVKEQKFLVH 420

Query: 421 PQWIEASNYFWKREAEEKFPVEHTKKQ 447
           PQWIEASNYFWKREAEEKF VEHTKKQ
Sbjct: 421 PQWIEASNYFWKREAEEKFLVEHTKKQ 447

BLAST of Cp4.1LG02g10800 vs. NCBI nr
Match: XP_022973448.1 (RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita maxima])

HSP 1 Score: 869 bits (2246), Expect = 0.0
Identity = 438/447 (97.99%), Postives = 443/447 (99.11%), Query Frame = 0

Query: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS 60
           MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDS PNEKAEGHNNVETERIKRHKVEKLENS
Sbjct: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSLPNEKAEGHNNVETERIKRHKVEKLENS 60

Query: 61  GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120
           GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE
Sbjct: 61  GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120

Query: 121 INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLEDVTKGSLF 180
           INRLRNIDMK LLQHKKLILVLDLDHTLLNSTQLGHLTPEE+YLRNQ DSLEDVTKGSLF
Sbjct: 121 INRLRNIDMKKLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRNQMDSLEDVTKGSLF 180

Query: 181 LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR 240
           LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR
Sbjct: 181 LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR 240

Query: 241 DDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 300
           DDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS
Sbjct: 241 DDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 300

Query: 301 LSELKSDESETDGALATILKVLKQVHNIFFNEISEDLVDRDVRQVLKTVRSKVLEGCKVV 360
           LSELKSDESE+DGALATILKVLKQVHNIFFNE+SEDLVDRDVRQVLKTVRSKVLEGCKVV
Sbjct: 301 LSELKSDESESDGALATILKVLKQVHNIFFNELSEDLVDRDVRQVLKTVRSKVLEGCKVV 360

Query: 361 FSRVFPTKFQADNHHLWKMVEKLGGTCSTELDASVTHIVSTDAGTEKSRWAVKEQKFLVH 420
           FSRVFPTKFQADNHHLWKMVE+LGGTCSTELD SVTHIVSTDAGTEKSRWA+KEQKFLVH
Sbjct: 361 FSRVFPTKFQADNHHLWKMVEQLGGTCSTELDPSVTHIVSTDAGTEKSRWAIKEQKFLVH 420

Query: 421 PQWIEASNYFWKREAEEKFPVEHTKKQ 447
           PQWIEASNYFWKREAEEKFPVEHTKKQ
Sbjct: 421 PQWIEASNYFWKREAEEKFPVEHTKKQ 447

BLAST of Cp4.1LG02g10800 vs. NCBI nr
Match: KAG6607512.1 (RNA polymerase II C-terminal domain phosphatase-like 4, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 870 bits (2247), Expect = 1.44e-313
Identity = 439/447 (98.21%), Postives = 443/447 (99.11%), Query Frame = 0

Query: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS 60
           MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS
Sbjct: 283 MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS 342

Query: 61  GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120
           GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE
Sbjct: 343 GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 402

Query: 121 INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLEDVTKGSLF 180
           INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHL PEE+YLRNQ DSLEDVTKGSLF
Sbjct: 403 INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLAPEEEYLRNQMDSLEDVTKGSLF 462

Query: 181 LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR 240
           LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR
Sbjct: 463 LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR 522

Query: 241 DDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 300
           DDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS
Sbjct: 523 DDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 582

Query: 301 LSELKSDESETDGALATILKVLKQVHNIFFNEISEDLVDRDVRQVLKTVRSKVLEGCKVV 360
           LSELKSDESETDGALATILKVLKQVHNIFFNE+SEDLVDRDVRQVLKTVRSKVLEGCKVV
Sbjct: 583 LSELKSDESETDGALATILKVLKQVHNIFFNELSEDLVDRDVRQVLKTVRSKVLEGCKVV 642

Query: 361 FSRVFPTKFQADNHHLWKMVEKLGGTCSTELDASVTHIVSTDAGTEKSRWAVKEQKFLVH 420
           FSRVFPTKFQADNHHLWKMVE+LGGTCSTELD SVTHIVSTDAGTEKSRWA+KEQKFLVH
Sbjct: 643 FSRVFPTKFQADNHHLWKMVEQLGGTCSTELDPSVTHIVSTDAGTEKSRWAIKEQKFLVH 702

Query: 421 PQWIEASNYFWKREAEEKFPVEHTKKQ 447
           PQWIEASNYFWKREAEEKF VEHTKKQ
Sbjct: 703 PQWIEASNYFWKREAEEKFLVEHTKKQ 729

BLAST of Cp4.1LG02g10800 vs. NCBI nr
Match: KAG7037160.1 (RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 830 bits (2144), Expect = 1.17e-301
Identity = 432/481 (89.81%), Postives = 436/481 (90.64%), Query Frame = 0

Query: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS 60
           MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS
Sbjct: 20  MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS 79

Query: 61  GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120
           GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE
Sbjct: 80  GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 139

Query: 121 INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLEDVTKGSLF 180
           INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHL PEE+YLRNQ DSLEDVTKGSLF
Sbjct: 140 INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLAPEEEYLRNQMDSLEDVTKGSLF 199

Query: 181 LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR 240
           LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR
Sbjct: 200 LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR 259

Query: 241 DDGTQKHQKGLDVVLGHESAVLILDDTEN------------------------------- 300
           DDGTQKHQKGLDVVLGHESAVLILDDTEN                               
Sbjct: 260 DDGTQKHQKGLDVVLGHESAVLILDDTENILMLNWGTFYGAWNIDFVIFGLLNYLLINME 319

Query: 301 ---AWTKHKENLILMERYHFFASSCHQFGFNCKSLSELKSDESETDGALATILKVLKQVH 360
              AWTKHKENLILMERYHFFASSCHQFGFNCKSLSELKSDESETDGALATILKVLKQ  
Sbjct: 320 LNIAWTKHKENLILMERYHFFASSCHQFGFNCKSLSELKSDESETDGALATILKVLKQ-- 379

Query: 361 NIFFNEISEDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEKLGGT 420
                E+SEDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVE+LGGT
Sbjct: 380 -----ELSEDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGT 439

Query: 421 CSTELDASVTHIVSTDAGTEKSRWAVKEQKFLVHPQWIEASNYFWKREAEEKFPVEHTKK 447
           CSTELD SVTHIVSTDAGTEKSRWA+KEQKFLVHPQWIEASNYFWKREAEEKF VEHTKK
Sbjct: 440 CSTELDPSVTHIVSTDAGTEKSRWAIKEQKFLVHPQWIEASNYFWKREAEEKFLVEHTKK 493

BLAST of Cp4.1LG02g10800 vs. ExPASy TrEMBL
Match: A0A6J1GC38 (RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita moschata OX=3662 GN=LOC111452801 PE=4 SV=1)

HSP 1 Score: 872 bits (2252), Expect = 0.0
Identity = 442/447 (98.88%), Postives = 444/447 (99.33%), Query Frame = 0

Query: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS 60
           MSLVTNS AHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS
Sbjct: 1   MSLVTNSLAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS 60

Query: 61  GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120
           GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE
Sbjct: 61  GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120

Query: 121 INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLEDVTKGSLF 180
           INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLEDVTKGSLF
Sbjct: 121 INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLEDVTKGSLF 180

Query: 181 LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR 240
           LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR
Sbjct: 181 LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR 240

Query: 241 DDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 300
           DDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS
Sbjct: 241 DDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 300

Query: 301 LSELKSDESETDGALATILKVLKQVHNIFFNEISEDLVDRDVRQVLKTVRSKVLEGCKVV 360
           LSELKSDESETDGALATILKVLKQVHNIFFNE+SEDLVDRDVRQVLKTVRSKVLEGCKVV
Sbjct: 301 LSELKSDESETDGALATILKVLKQVHNIFFNELSEDLVDRDVRQVLKTVRSKVLEGCKVV 360

Query: 361 FSRVFPTKFQADNHHLWKMVEKLGGTCSTELDASVTHIVSTDAGTEKSRWAVKEQKFLVH 420
           FSRVFPTKFQADNHHLWKMVE+LGGTCSTELDASVTHIVSTDAG EKSRWAVKEQKFLVH
Sbjct: 361 FSRVFPTKFQADNHHLWKMVEQLGGTCSTELDASVTHIVSTDAGMEKSRWAVKEQKFLVH 420

Query: 421 PQWIEASNYFWKREAEEKFPVEHTKKQ 447
           PQWIEASNYFWKREAEEKF VEHTKKQ
Sbjct: 421 PQWIEASNYFWKREAEEKFLVEHTKKQ 447

BLAST of Cp4.1LG02g10800 vs. ExPASy TrEMBL
Match: A0A6J1ID30 (RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita maxima OX=3661 GN=LOC111471991 PE=4 SV=1)

HSP 1 Score: 869 bits (2246), Expect = 0.0
Identity = 438/447 (97.99%), Postives = 443/447 (99.11%), Query Frame = 0

Query: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS 60
           MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDS PNEKAEGHNNVETERIKRHKVEKLENS
Sbjct: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSLPNEKAEGHNNVETERIKRHKVEKLENS 60

Query: 61  GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120
           GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE
Sbjct: 61  GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120

Query: 121 INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLEDVTKGSLF 180
           INRLRNIDMK LLQHKKLILVLDLDHTLLNSTQLGHLTPEE+YLRNQ DSLEDVTKGSLF
Sbjct: 121 INRLRNIDMKKLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRNQMDSLEDVTKGSLF 180

Query: 181 LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR 240
           LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR
Sbjct: 181 LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR 240

Query: 241 DDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 300
           DDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS
Sbjct: 241 DDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 300

Query: 301 LSELKSDESETDGALATILKVLKQVHNIFFNEISEDLVDRDVRQVLKTVRSKVLEGCKVV 360
           LSELKSDESE+DGALATILKVLKQVHNIFFNE+SEDLVDRDVRQVLKTVRSKVLEGCKVV
Sbjct: 301 LSELKSDESESDGALATILKVLKQVHNIFFNELSEDLVDRDVRQVLKTVRSKVLEGCKVV 360

Query: 361 FSRVFPTKFQADNHHLWKMVEKLGGTCSTELDASVTHIVSTDAGTEKSRWAVKEQKFLVH 420
           FSRVFPTKFQADNHHLWKMVE+LGGTCSTELD SVTHIVSTDAGTEKSRWA+KEQKFLVH
Sbjct: 361 FSRVFPTKFQADNHHLWKMVEQLGGTCSTELDPSVTHIVSTDAGTEKSRWAIKEQKFLVH 420

Query: 421 PQWIEASNYFWKREAEEKFPVEHTKKQ 447
           PQWIEASNYFWKREAEEKFPVEHTKKQ
Sbjct: 421 PQWIEASNYFWKREAEEKFPVEHTKKQ 447

BLAST of Cp4.1LG02g10800 vs. ExPASy TrEMBL
Match: A0A6J1BV42 (RNA polymerase II C-terminal domain phosphatase-like OS=Momordica charantia OX=3673 GN=LOC111005808 PE=4 SV=1)

HSP 1 Score: 810 bits (2092), Expect = 9.41e-295
Identity = 410/450 (91.11%), Postives = 432/450 (96.00%), Query Frame = 0

Query: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS 60
           MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSP EKAEG NNVE+ER+KR KVE+LE S
Sbjct: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPEEKAEGDNNVESERMKRRKVEELEGS 60

Query: 61  GE---DILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLN 120
            E   DI YGVEE SSEVLSKQQLCSHPGSFGNMCI+CGQRLDEESGVTFGYIHKGLRLN
Sbjct: 61  EEPQEDISYGVEEQSSEVLSKQQLCSHPGSFGNMCIMCGQRLDEESGVTFGYIHKGLRLN 120

Query: 121 NDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLEDVTKG 180
           NDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGH+TPEE+YLR+QTDSLEDVTKG
Sbjct: 121 NDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHITPEEEYLRSQTDSLEDVTKG 180

Query: 181 SLFLLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKV 240
           SLFLL+SVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYA+EMAKLLDPKREYFS+KV
Sbjct: 181 SLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKV 240

Query: 241 ISRDDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFN 300
           ISRDDGTQKH+KGLDVVLG ESAVLILDDTENAWTKHKENLILMERYHFFASSC QFG+N
Sbjct: 241 ISRDDGTQKHKKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGYN 300

Query: 301 CKSLSELKSDESETDGALATILKVLKQVHNIFFNEISEDLVDRDVRQVLKTVRSKVLEGC 360
           CKSLSELKSDESETDGALATILKVLKQVH IFFNE+ +DLVDRDVRQVLKTVRSKVLEGC
Sbjct: 301 CKSLSELKSDESETDGALATILKVLKQVHTIFFNELLDDLVDRDVRQVLKTVRSKVLEGC 360

Query: 361 KVVFSRVFPTKFQADNHHLWKMVEKLGGTCSTELDASVTHIVSTDAGTEKSRWAVKEQKF 420
           KVVF+RVFPTKF ADNHHLWKMVE+LGG+CST+LD+SVTH+VSTDAGTEKSRWAVKEQKF
Sbjct: 361 KVVFTRVFPTKFPADNHHLWKMVEQLGGSCSTDLDSSVTHVVSTDAGTEKSRWAVKEQKF 420

Query: 421 LVHPQWIEASNYFWKREAEEKFPVEHTKKQ 447
           LVHP+WIEASNYFWKR+ EE FPVE TKKQ
Sbjct: 421 LVHPRWIEASNYFWKRQVEENFPVEQTKKQ 450

BLAST of Cp4.1LG02g10800 vs. ExPASy TrEMBL
Match: A0A6J1EFC1 (RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita moschata OX=3662 GN=LOC111432775 PE=4 SV=1)

HSP 1 Score: 809 bits (2090), Expect = 1.70e-294
Identity = 409/447 (91.50%), Postives = 430/447 (96.20%), Query Frame = 0

Query: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS 60
           MSL TNSPAHSSSSDDFAAFLDVAL+SHSSDSSPN+ AE  NNVE+ERIKR KVEKL  S
Sbjct: 1   MSLATNSPAHSSSSDDFAAFLDVALESHSSDSSPNKNAEVGNNVESERIKRRKVEKLVCS 60

Query: 61  GEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120
            ED L GVEE S EVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE
Sbjct: 61  EEDTLCGVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDE 120

Query: 121 INRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLEDVTKGSLF 180
           INRLRNIDMK+LLQHKKLILVLDLDHTLLNSTQLGHLTPEE+YLR+Q DSLEDVTKGSLF
Sbjct: 121 INRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQIDSLEDVTKGSLF 180

Query: 181 LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR 240
           LL+SVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYF+SKVISR
Sbjct: 181 LLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFNSKVISR 240

Query: 241 DDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 300
           DDGTQKHQKGLD+VLG ESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS
Sbjct: 241 DDGTQKHQKGLDIVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKS 300

Query: 301 LSELKSDESETDGALATILKVLKQVHNIFFNEISEDLVDRDVRQVLKTVRSKVLEGCKVV 360
           LSELKSDESETDGALATILKVLKQVHNIFFNE+S+DLVDRDVRQVLKTVRSKVLEGCKVV
Sbjct: 301 LSELKSDESETDGALATILKVLKQVHNIFFNELSDDLVDRDVRQVLKTVRSKVLEGCKVV 360

Query: 361 FSRVFPTKFQADNHHLWKMVEKLGGTCSTELDASVTHIVSTDAGTEKSRWAVKEQKFLVH 420
           FSRVFPTKFQA+NHHLWKMVE+LGGTCSTELD+SVTH+VSTD GTEKSRWA+KE+KFLVH
Sbjct: 361 FSRVFPTKFQAENHHLWKMVEQLGGTCSTELDSSVTHVVSTDPGTEKSRWALKEEKFLVH 420

Query: 421 PQWIEASNYFWKREAEEKFPVEHTKKQ 447
           P+WIEASNYFWKR+AE+ FPVE +KKQ
Sbjct: 421 PRWIEASNYFWKRQAEDNFPVEQSKKQ 447

BLAST of Cp4.1LG02g10800 vs. ExPASy TrEMBL
Match: A0A6J1CJQ5 (RNA polymerase II C-terminal domain phosphatase-like OS=Momordica charantia OX=3673 GN=LOC111012040 PE=4 SV=1)

HSP 1 Score: 802 bits (2072), Expect = 1.05e-291
Identity = 406/450 (90.22%), Postives = 429/450 (95.33%), Query Frame = 0

Query: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNNVETERIKRHKVEKLENS 60
           MSLVT+SPAHSSSSDDFAAFLDVALDSHSSDSSP EKAEG NNVE+ERIKR KVEKLE S
Sbjct: 1   MSLVTDSPAHSSSSDDFAAFLDVALDSHSSDSSPEEKAEGDNNVESERIKRRKVEKLEGS 60

Query: 61  GE---DILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLN 120
            E   DI+Y VEE SSEVLSKQQLC HPGSFGNMCIICGQRLD ESGVTFGYIHKGLRLN
Sbjct: 61  EEPQEDIMYRVEEQSSEVLSKQQLCGHPGSFGNMCIICGQRLDGESGVTFGYIHKGLRLN 120

Query: 121 NDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLEDVTKG 180
           NDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGH+TPEE+YLR+QTDSL+DVTKG
Sbjct: 121 NDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHITPEEEYLRSQTDSLKDVTKG 180

Query: 181 SLFLLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKV 240
           SLFLL+S+HTMTKLRPF+HTFLKEASQLFEMYIYTMGERAYA EMAKLLDPKR YFS++V
Sbjct: 181 SLFLLNSIHTMTKLRPFIHTFLKEASQLFEMYIYTMGERAYAVEMAKLLDPKRAYFSARV 240

Query: 241 ISRDDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFN 300
           ISRDDGTQKHQKGLDVVLG ESAVLILDDTENAWTKHKENLILMERYHFFASSC QFG+N
Sbjct: 241 ISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGYN 300

Query: 301 CKSLSELKSDESETDGALATILKVLKQVHNIFFNEISEDLVDRDVRQVLKTVRSKVLEGC 360
           CKSLSELKSDESETDGALA+ILKVLKQVH IFFNE+S+DLVDRDVRQVLKTVRSKVLEGC
Sbjct: 301 CKSLSELKSDESETDGALASILKVLKQVHTIFFNELSDDLVDRDVRQVLKTVRSKVLEGC 360

Query: 361 KVVFSRVFPTKFQADNHHLWKMVEKLGGTCSTELDASVTHIVSTDAGTEKSRWAVKEQKF 420
           KVVF+RVFP KFQADNHHLWKMVE+LGG+CST+LD SVTH+VSTDAGTEKSRWAVKEQKF
Sbjct: 361 KVVFTRVFPAKFQADNHHLWKMVEQLGGSCSTDLDPSVTHVVSTDAGTEKSRWAVKEQKF 420

Query: 421 LVHPQWIEASNYFWKREAEEKFPVEHTKKQ 447
           LVHP+WIEASNYFWKR+AEE FPVE TKKQ
Sbjct: 421 LVHPRWIEASNYFWKRQAEENFPVEQTKKQ 450

BLAST of Cp4.1LG02g10800 vs. TAIR 10
Match: AT5G58003.1 (C-terminal domain phosphatase-like 4 )

HSP 1 Score: 534.3 bits (1375), Expect = 9.6e-152
Identity = 279/452 (61.73%), Postives = 347/452 (76.77%), Query Frame = 0

Query: 1   MSLVTNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PNEKAEGHNNVETERIKRHKVEKLE 60
           MS+ ++SP H SSSSDD AAFLD  LDS S  SS P+E+ E  ++VE+  +KR K+E LE
Sbjct: 1   MSVASDSPVHSSSSSDDLAAFLDAELDSASDASSGPSEEEEAEDDVES-GLKRQKLEHLE 60

Query: 61  NSGEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNN 120
                          E  S +  C HPGSFGNMC +CGQ+L EE+GV+F YIHK +RLN 
Sbjct: 61  ---------------EASSSKGECEHPGSFGNMCFVCGQKL-EETGVSFRYIHKEMRLNE 120

Query: 121 DEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLED---VT 180
           DEI+RLR+ D + L + +KL LVLDLDHTLLN+T L  L PEE+YL++ T SL+D   V+
Sbjct: 121 DEISRLRDSDSRFLQRQRKLYLVLDLDHTLLNTTILRDLKPEEEYLKSHTHSLQDGCNVS 180

Query: 181 KGSLFLLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSS 240
            GSLFLL  +  MTKLRPFVH+FLKEAS++F MYIYTMG+R YA +MAKLLDPK EYF  
Sbjct: 181 GGSLFLLEFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGD 240

Query: 241 KVISRDDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFG 300
           +VISRDDGT +H+K LDVVLG ESAVLILDDTENAW KHK+NLI++ERYHFF+SSC QF 
Sbjct: 241 RVISRDDGTVRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQFD 300

Query: 301 FNCKSLSELKSDESETDGALATILKVLKQVHNIFFNEISEDLVDRDVRQVLKTVRSKVLE 360
              KSLSELKSDESE DGALAT+LKVLKQ H +FF  + E + +RDVR +LK VR ++L+
Sbjct: 301 HRYKSLSELKSDESEPDGALATVLKVLKQAHALFFENVDEGISNRDVRLMLKQVRKEILK 360

Query: 361 GCKVVFSRVFPTKFQADNHHLWKMVEKLGGTCSTELDASVTHIVSTDAGTEKSRWAVKEQ 420
           GCK+VFSRVFPTK + ++H LWKM E+LG TC+TE+DASVTH+V+ D GTEK+RWAV+E+
Sbjct: 361 GCKIVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKARWAVREK 420

Query: 421 KFLVHPQWIEASNYFWKREAEEKFPVEHTKKQ 448
           K++VH  WI+A+NY W ++ EE F +E  KKQ
Sbjct: 421 KYVVHRGWIDAANYLWMKQPEENFGLEQLKKQ 435

BLAST of Cp4.1LG02g10800 vs. TAIR 10
Match: AT2G33540.1 (C-terminal domain phosphatase-like 3 )

HSP 1 Score: 243.0 bits (619), Expect = 4.4e-64
Identity = 135/330 (40.91%), Postives = 204/330 (61.82%), Query Frame = 0

Query: 123  RLRNIDMKN-LLQHKKLILVLDLDHTLLNSTQLGHL-TPEEDYLRNQTDSLEDVTKGSLF 182
            R+R ++ +N +   +KL LVLD+DHTLLNS +   + +  E+ LR + +   +     LF
Sbjct: 912  RVRRLEEQNKMFASQKLSLVLDIDHTLLNSAKFNEVESRHEEILRKKEEQDREKPYRHLF 971

Query: 183  LLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISR 242
                +   TKLRP +  FL++AS+L+E+++YTMG + YA EMAKLLDPK   F+ +VIS+
Sbjct: 972  RFLHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGVLFNGRVISK 1031

Query: 243  -DDGTQ-------KHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCH 302
             DDG            K L+ V+G ES+V+I+DD+   W +HK NLI +ERY +F  S  
Sbjct: 1032 GDDGDPLDGDERVPKSKDLEGVMGMESSVVIIDDSVRVWPQHKMNLIAVERYLYFPCSRR 1091

Query: 303  QFGFNCKSLSELKSDESETDGALATILKVLKQVHNIFFNEISEDLVDRDVRQVLKTVRSK 362
            QFG    SL EL  DE   +G LA+ L V++++H  FF+  S D V  DVR +L + + K
Sbjct: 1092 QFGLLGPSLLELDRDEVPEEGTLASSLAVIEKIHQNFFSHTSLDEV--DVRNILASEQRK 1151

Query: 363  VLEGCKVVFSRVFPT-KFQADNHHLWKMVEKLGGTCSTELDASVTHIVSTDAGTEKSRWA 422
            +L GC++VFSR+ P  + +   H LW+  E+ G  C+T++D  VTH+V+   GT+K  WA
Sbjct: 1152 ILAGCRIVFSRIIPVGEAKPHLHPLWQTAEQFGAVCTTQVDEHVTHVVTNSLGTDKVNWA 1211

Query: 423  VKEQKFLVHPQWIEASNYFWKREAEEKFPV 442
            +   +F+VHP W+EAS + ++R  E  + +
Sbjct: 1212 LTRGRFVVHPGWVEASAFLYQRANENLYAI 1239

BLAST of Cp4.1LG02g10800 vs. TAIR 10
Match: AT3G17550.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )

HSP 1 Score: 199.9 bits (507), Expect = 4.3e-51
Identity = 122/299 (40.80%), Postives = 178/299 (59.53%), Query Frame = 0

Query: 57  LENSGEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRL 116
           +EN   +    + E SS + S +  C H      +CI C   +++  G  F Y+ +GL+L
Sbjct: 4   VENISMEFEPAINESSSSLSSSRSSCGHWYVRYGVCIACKSTVNKRHGRAFDYLVQGLQL 63

Query: 117 NNDEINRLRNIDMK-NLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDSLEDVT 176
           +++     +    +   L  KKL LVLDLDHTLL+S ++  L+  E  L  +  S    T
Sbjct: 64  SHEAAAFTKRFTTQFYCLNEKKLNLVLDLDHTLLHSIRVSLLSETEKCLIEEACS---TT 123

Query: 177 KGSLFLLHSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSS 236
           +  L+ L S + +TKLRPFVH FLKEA++LF MY+YTMG R YA  + KL+DPKR YF  
Sbjct: 124 REDLWKLDSDY-LTKLRPFVHEFLKEANELFTMYVYTMGTRVYAESLLKLIDPKRIYFGD 183

Query: 237 KVISRDDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFG 296
           +VI+RD+    + K LD+VL  E  V+I+DDT + WT HK NL+ +  YHFF  +  +  
Sbjct: 184 RVITRDE--SPYVKTLDLVLAEERGVVIVDDTSDVWTHHKSNLVEINEYHFFRVNGPE-- 243

Query: 297 FNCKSLSELKSDESETDGALATILKVLKQVHNIFFNEISEDLVDRDVRQVLKTVRSKVL 355
               S +E K DES+ +G LA +LK+LK+VH  FF  + E+L  +DVR +L+ +  K+L
Sbjct: 244 -ESNSYTEEKRDESKNNGGLANVLKLLKEVHYGFF-RVKEELESQDVRFLLQEIDFKLL 292

BLAST of Cp4.1LG02g10800 vs. TAIR 10
Match: AT2G04930.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )

HSP 1 Score: 196.1 bits (497), Expect = 6.2e-50
Identity = 114/277 (41.16%), Postives = 168/277 (60.65%), Query Frame = 0

Query: 75  VLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKGLRLNNDEINRLRNIDMK-NLL 134
           V +    C H   F  +CI C  ++ +     F YI KGL+L+N+ +   +++  K + L
Sbjct: 3   VTTSSSCCGHWYVFQGICIGCKSKVHKSQFRKFDYIFKGLQLSNEAVALTKSLTTKHSCL 62

Query: 135 QHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRNQTDS--LEDVTKGSLFLLHSVHTMTKL 194
             KKL LVLDLDHTLL+S  + +L+  E YL  +  S   ED+ K    + H +  + KL
Sbjct: 63  NEKKLHLVLDLDHTLLHSKLVSNLSQAERYLIQEASSRTREDLWKFRP-IGHPIDRLIKL 122

Query: 195 RPFVHTFLKEASQLFEMYIYTMGERAYAYEMAKLLDPKREYFSSKVISRDDGTQKHQKGL 254
           RPFV  FLKEA+++F M++YTMG R YA  + +++DPK+ YF ++VI++D+  +   K L
Sbjct: 123 RPFVRDFLKEANEMFTMFVYTMGSRIYAKAILEMIDPKKLYFGNRVITKDESPR--MKTL 182

Query: 255 DVVLGHESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKSLSELKSDESET 314
           ++VL  E  V+I+DDT + W  HK NLI + +Y +F  S    G +  S SE K+DE E 
Sbjct: 183 NLVLAEERGVVIVDDTRDIWPHHKNNLIQIRKYKYFRRS----GLDSNSYSEKKTDEGEN 242

Query: 315 DGALATILKVLKQVHNIFF-NEISEDLVDRDVRQVLK 348
           DG LA +LK+L++VH  FF  E+ E L   DVR +LK
Sbjct: 243 DGGLANVLKLLREVHRRFFIVEVEEVLESMDVRSLLK 272

BLAST of Cp4.1LG02g10800 vs. TAIR 10
Match: AT3G19595.1 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein )

HSP 1 Score: 189.9 bits (481), Expect = 4.5e-48
Identity = 117/307 (38.11%), Postives = 171/307 (55.70%), Query Frame = 0

Query: 48  RIKRHKVEKLENSGEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTF 107
           + KR K+E   N            SS  LS    C H      +CI C   + +  G  F
Sbjct: 12  KAKRRKIEPTIN-----------ESSSSLSSSSSCGHWYICHGICIGCKSTVKKSQGRAF 71

Query: 108 GYIHKGLRLNNDEINRLRNIDMK-NLLQHKKLILVLDLDHTLLNSTQLGHLTPEEDYLRN 167
            YI  GL+L+++ +   +    K + L  KKL LVLDLDHTLL++  +  L+  E YL  
Sbjct: 72  DYIFDGLQLSHEAVALTKCFTTKLSCLNEKKLHLVLDLDHTLLHTVMVPSLSQAEKYLIE 131

Query: 168 QTDSLEDVTKGSLFLLHSV----HTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAYEM 227
           +  S    T+  L+ + +V      +TKLRPF+  FLKEA++ F MY+YT G R YA ++
Sbjct: 132 EAGS---ATRDDLWKIKAVGDPMEFLTKLRPFLRDFLKEANEFFTMYVYTKGSRVYAKQV 191

Query: 228 AKLLDPKREYFSSKVISRDDGTQKHQKGLDVVLGHESAVLILDDTENAWTKHKENLILME 287
            +L+DPK+ YF  +VI++ +    H K LD VL  E  V+I+DDT N W  HK NL+ + 
Sbjct: 192 LELIDPKKLYFGDRVITKTE--SPHMKTLDFVLAEERGVVIVDDTRNVWPDHKSNLVDIS 251

Query: 288 RYHFFASSCHQFGFNCKSLSELKSDESETDGALATILKVLKQVHNIFFNEISEDLVDRDV 347
           +Y +F       G +    SE K+DESE++G LA +LK+LK+VH  FF  + E+L  +DV
Sbjct: 252 KYSYFRLK----GQDSMPYSEEKTDESESEGGLANVLKLLKEVHQRFF-RVEEELESKDV 297

Query: 348 RQVLKTV 350
           R +L+ +
Sbjct: 312 RSLLQEI 297

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q00IB61.4e-15061.73RNA polymerase II C-terminal domain phosphatase-like 4 OS=Arabidopsis thaliana O... [more]
Q8LL046.2e-6340.91RNA polymerase II C-terminal domain phosphatase-like 3 OS=Arabidopsis thaliana O... [more]
F4JCB26.3e-4738.11RNA polymerase II C-terminal domain phosphatase-like 5 OS=Arabidopsis thaliana O... [more]
Q95QG81.0e-3627.50RNA polymerase II subunit A C-terminal domain phosphatase OS=Caenorhabditis eleg... [more]
Q9P3762.6e-2424.14RNA polymerase II subunit A C-terminal domain phosphatase OS=Schizosaccharomyces... [more]
Match NameE-valueIdentityDescription
XP_023525838.10.0100.00RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita pepo subsp. pe... [more]
XP_022949466.10.098.88RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita moschata][more]
XP_022973448.10.097.99RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita maxima][more]
KAG6607512.11.44e-31398.21RNA polymerase II C-terminal domain phosphatase-like 4, partial [Cucurbita argyr... [more]
KAG7037160.11.17e-30189.81RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita argyrosperma s... [more]
Match NameE-valueIdentityDescription
A0A6J1GC380.098.88RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita moschata OX=36... [more]
A0A6J1ID300.097.99RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita maxima OX=3661... [more]
A0A6J1BV429.41e-29591.11RNA polymerase II C-terminal domain phosphatase-like OS=Momordica charantia OX=3... [more]
A0A6J1EFC11.70e-29491.50RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita moschata OX=36... [more]
A0A6J1CJQ51.05e-29190.22RNA polymerase II C-terminal domain phosphatase-like OS=Momordica charantia OX=3... [more]
Match NameE-valueIdentityDescription
AT5G58003.19.6e-15261.73C-terminal domain phosphatase-like 4 [more]
AT2G33540.14.4e-6440.91C-terminal domain phosphatase-like 3 [more]
AT3G17550.14.3e-5140.80Haloacid dehalogenase-like hydrolase (HAD) superfamily protein [more]
AT2G04930.16.2e-5041.16Haloacid dehalogenase-like hydrolase (HAD) superfamily protein [more]
AT3G19595.14.5e-4838.11Haloacid dehalogenase-like hydrolase (HAD) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001357BRCT domainSMARTSM00292BRCT_7coord: 351..431
e-value: 6.9E-7
score: 38.9
IPR001357BRCT domainPFAMPF00533BRCTcoord: 350..426
e-value: 3.4E-8
score: 33.8
IPR001357BRCT domainPROSITEPS50172BRCTcoord: 349..441
score: 12.83884
IPR004274FCP1 homology domainSMARTSM00577forpap2coord: 136..291
e-value: 9.9E-55
score: 197.8
IPR004274FCP1 homology domainPFAMPF03031NIFcoord: 139..286
e-value: 4.0E-27
score: 94.9
IPR004274FCP1 homology domainPROSITEPS50969FCP1coord: 133..304
score: 30.657206
IPR023214HAD superfamilyGENE3D3.40.50.1000coord: 120..352
e-value: 7.7E-54
score: 184.5
IPR011947FCP1-like phosphatase, phosphatase domainTIGRFAMTIGR02250TIGR02250coord: 132..287
e-value: 7.7E-54
score: 180.0
IPR036420BRCT domain superfamilyGENE3D3.40.50.10190BRCT domaincoord: 353..441
e-value: 7.9E-23
score: 82.6
IPR036420BRCT domain superfamilySUPERFAMILY52113BRCT domaincoord: 353..441
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 32..49
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 27..49
NoneNo IPR availablePANTHERPTHR23081:SF28BNAC03G12630D PROTEINcoord: 2..442
NoneNo IPR availableCDDcd17729BRCT_CTDP1coord: 341..436
e-value: 1.72062E-34
score: 122.256
NoneNo IPR availableCDDcd07521HAD_FCP1-likecoord: 137..281
e-value: 6.32227E-36
score: 127.326
IPR039189CTD phosphatase Fcp1PANTHERPTHR23081RNA POLYMERASE II CTD PHOSPHATASEcoord: 2..442
IPR036412HAD-like superfamilySUPERFAMILY56784HAD-likecoord: 128..291

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g10800.1Cp4.1LG02g10800.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0070940 dephosphorylation of RNA polymerase II C-terminal domain
cellular_component GO:0005634 nucleus
molecular_function GO:0008420 RNA polymerase II CTD heptapeptide repeat phosphatase activity
molecular_function GO:0004721 phosphoprotein phosphatase activity