CSPI01G13400 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI01G13400
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionpolyadenylation and cleavage factor homolog 4 isoform X1
LocationChr1: 8867507 .. 8875625 (-)
RNA-Seq ExpressionCSPI01G13400
SyntenyCSPI01G13400
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAGAAAAGAGTTTGATTGAGTGACCCAAACCGAATGACTCTAATCTAATCGGAAATCGAAAATCGAAAATCGAAAATCGAAAAATACAGGATTTAGGGCTGTTCGAAATAAGGGGTGCGGCTTTCTATCTTCTTCGGGGTTTTCACAACGTGAATCCGAGCCGATCTGTGAACTGCGCGAGCGAGCGAACGTATTTTGGGCCTTTTGCAATTAGAGGCGGCGGCTGGTTTGGCGTCCGTCGACGGTGCCTGCAACGGAGCCAGCGCTGAGCAGCGGCGGTGTGGGCACGAGTCTGGGAGTGCTCTGCGAATGCTTGACTGGTAGTTCTGCGTGAAGACGGTGACAAAGTGCAGTGGACGGTAGGTAATTATCTGATCTGAGTTGTATGTCTGAATGATTGATTGACTGGGCTAGGTTGTCCGCCGATGAGAAAACTGAAAAACATGCCGGTGGAGAAGGGCTGTAAATTTCTTTTCTTTTAGGTCTCTGATGCCATGTGAATGTGATAATTTTCTGTTGAACAATCTTTGAATAGGTGAATAGGAGACTGAACTGCTAATTAGGAGAATTTCAAGCTCGTCCATATTTACGTTAGAGCTAATATTGGCAAGGAATTGATAGTGCGCACTGGCGAATAATTAAACTAAGCCACACCGTATCTCTCTGTAAAGCGCTCTCTCTTTGTTTTTCTCTCTCTAGCTTCTTCTATGGTGGGTAGGAACCCTAATTCAGTTCTCTTCCCTCCCTGGCGGGGTAAACTTCTGAACTCAATTACACTTTGCATTTCATGACCCGTTTCATGGAATCGGAAAAGCTCTTAATTTCACGAGGAAACCCTAGAAATTCTGTTTACCCATCTGACCGCCCAATCCCCACCACCAGCGGCAGGACTATGCCCAATGAGTTGCCACAAAAGCCTGCCCCTTCCATAGCTCACCGGTTTAGAGCTCAGTTAAAGCAGCGGGATGATGAATTTAGGGTTTCTGGCCATGATGTTGTGCCCCCTCCTACCGCTGAGGATATTGTGCAGTTGTATGACCTCATGTTGTCGGAGCTCACTTTTAATTCGAAGCCCATCATTACGGATCTCACTGTTCTTGCTGATGAGCAGAGAGAACATGGGAAGGGCATTGCCGACTTAATTTGTGCGCGGATTCTCGAGGTTTGTTTCTGTAACTATATTTTCCTCTAATTTGAGTTGAGAATACTCTTTCAAATGTTTCAGTGCGGTTACCTGTATATCACGTATTCCTTTTTTTTATTTGATGGGGATTTGAGCTATAAAAAAGGCTTTTATTACTGATTTGATCATTATATTTCCCGAGACCTTGCGGGGTTCCAGTTTACCACTTCAAAACTTCGTGCAGGTTCCAGTTGACCAAAAACTTCCTTCATTATATCTATTAGACAGCATTGTTAAGAATGTTGGGCACGAGTACATTAGTTATTTCGCGTCTCGTTTACCTGAGGTATGTAAATTTTGGTTTCTGGTACACACTCTTGTTTCTTATGTTTTCAATAATCATACCGGTATTAGTTTGAATTTCACGAAGTTTTCATCAATTCTTTGTTCTGTTTTTTGGTTCTAAGGTGTTTTGCGAGGCTTACAGGCAAGTTCATCCTAATTTGCACAATGCAATGCGCCACCTCTTTGGGACTTGGGCAACTGTGTTTCCACCATCCATCATTCGGAAGATTGAAGCTCAGCTTTCTCAGTTAACAGCCCAAGAGTCATCAGGTTTGACATCCTCAAGGGCTTCTGAATCTCCTCGGCCAACTCATGGTATTCATGTCAATCCAAAATACTTGCGACAGCTGGAACACTCAGTGGTGGATAAAGTGAGAATCTATCTCTTTTGCTACTTAGAATACAATATTGATTGAGTTTGCTTCTGTCTGTTTGAATATTTTTCATCCAGCATCCCTTTTCCCAAAAAAAATCCATCCTGATTCTTTTGTTGGCTTTTTTTCATTCCCCTCTCCCCCAACACTCATTTTTTAATGCAACCAACTCCAGTCAATGAATAACTTTTATTTTTAATTATTGTTGTTTTTTAATGAATAAATTTGATTAGTCCATCCAAGCCATCTATTATTTATGTTGGTGCCCGGGGTTGCATTTGGCCTAGATTTGTACAGAAGGTGAAAAAGACGATATATGAGGATTTTTATCTGTACATTGGTTGTGCTTTTTCAGAGAGAGATTGAAGATCATGCTAGCCTTTGCTCAAAATAAGAACCTTTAGAGTTTAGCTCCTTTACGCATTACTTGTAAATTACAACTATGGAGACGTACATTCAAATTAAGTGCATAGCACATAAGGATGTAGAAAGAAAAGCAAAACCAATGTGAATATGTGATGTTGAGAGAACAAAGGAAAAATGTCCTTTTCTAGAGACGATGTAAGCATAGTCAATTTAGAAAAATTATCGTTGATGGGTATTAATACCTCTACTAGTGAAGATGTAGCTATGGGCAGAATTGACATTCAAGGTGTGGTTCTCTCCAGACTAGCTTGTTCAGATCTCAATCAATCTCATTGTGCAGTTGCCTGATCCACTATATTTGTTTCTTCTTTATTTGAAGATTTAGCTGAATAGGAAACTTTAGAGTTCCAATGACAGAAATTTTAGCCCTCCGTTTGGAGCCTAGAAATTTCAAGAAATGACTTCAAATTCTATAATGAGTGTTTTAAAAAGCCCTCCTAGGTGCTGGGCAGAAGTGTAGCTTCTTGCCTTGCACAAATAATGTAAGGATTAGGCATGTGCCGTCTGTGAACCCCTGAGCTGTTGGAATTTTCATTATTTTAAATTTAAAAAATTATAGTTATTAAGGTTTCATAAAATTTAGTAAGCCTAAATATATAATTCCTTGTGTTTAGTGCACCTCACAAATAAAAAGTTCATGCCTTTTCCCCTCTTACCTTAGTGCTTAACTCTAGAAGACTGTCGAGCTTTATTGTGCCTTCAGCTTTAGAAAACATTGCTTGATATGAACCGACCAAGCATCAAATTATCAATTAGATTTTATTTAATAATTGTGATGTAATTTTAATATTTTTCAGTTTTATCCTTTTTAATAACATTTAAATAAGGTTAATTTAGTCTTTTTATGATGGATTAGGGTTTTCGTTGCAATGGCTATTTAAAGTCTTGCTATTAGGTTGTAAAAAGAGTTTTCTTGCATATTGAATTTGGAATGAAATTTTTCTCTATGTTGATAACGGAGATTCTATCACCAACCTTTGCAATTCTTGATCTAGGGATGCATGCTTGAGCATTCAGGTAAGTCTGATCATCTTCTTGTAGAGTGTTCAAATCTTAAGTCTTATCTCTTGGTCTTCGATCAAAAGGTAATCCAACTCTAGTCATTCCCTTTGGATTTTTGTCAAATTGACCTCAGGGTTTTAGAATTTAATATTAGGGTTTCATCAACTGAGTTCCTAATTTTCACTTCCTAGGATTTTCATATCACTGCTTGTGTTTTGAAATTTGAGAAACGATTTTAAATTGTTCATTTTGTTTGGAATGATAAATATTTAAAATGAATGTATTTGAAATTGTGTTTGGATTATATATAGTTCATGAAAAAGTAGATTTAAAATCATTCTTTTTCTTGCTGTAATTGCATTCATTTGAGATCCCCCAAAATCATGAGGTTTCAAACAATTATAAAAAAATTATTAGTCTATTATCTTCATTTTTCATAATTTGGACTTTTTTTATAACTATTTTGTTGGTGGAAGGATACTAGTTGCTTTGCTCTTGCTATGATATATATTTCCAAAATAACAGGACTGATCACAAGTTCCAAAATGAAGATAATTAGAAGAAAATGAGGATCACAAAAAGAATAAAAGATAGCCCCAGCCAAATCAAGAGGGCTGGTACCTCCCAAAAGATTATCACAGCCCTTCTCCCAGTTATTTCCAGATAATAACCCTACACATGGTCCCAACAATATAACGACATTCTTTCTCTCACCCAACACTGCTAGACGGCTGGAGCACTGTTTGTCTGGGGTGAAATACCTTTCTTTCCCCTTCTAGAATAAACTTTTGTAACTGGAGGCCCGACTGTTCATTTCGTATTTAAGTGATTGTAGTTAATAGGAGAACCCTTTCATCATTCCTCTGCGCTTGTTTTGGGTGTTTGTCTTTTCATATTTTGTATACAATATAAATTTGTTTTATGTCAGAAAATAGGGGAAGATAGAACCATGTTCCTTAGATTTCGGGCGATTTTGTGCTGCCGATTGTGTTCAAATATGAGATTCGGATTCTTGTTCGATACAATGTCGTGTACATTGTATATATATTTAAATGTGCATGTGAAAGACATACACAACGCACACACACCTCTCAATGTGATTCCCCGACATTAATTTAATGCAATTAGGACTGAATTTTGTTTTATTTTTTTAAATCTCATGCATACATGGGGTAAAGAAATAACGTACAAATGTGTAACTAGCTTTTAGCTTTGCAGCATAGCCAAGATTCAAGAGGGACCTCAGCTATAAAAGTTCATGATAAAAAGCTTGCTTCTGGATATGAAGAGTATGACTACGATCATGCGGATGCTCTTGAACATGGTGGACCTCAAGGATTTCATTCAATGGGAAGCATGGGCCATGATTCTTTTTCCCTTGGAACAAATAAAGCAAATATAAAGCTAGCAAAATCATCTCTGTCTTCAAGAATTGGACCCCATAGACCTCTACAATCAGTTGGTGATGAACATGAAACGGTCAGAGCCTCACCCTCGCAGAATGTCTATGATTATGAAGGTTCGAAGATGATTGATAGAAATGAGGATACCAATAAATGGAGGAGAAAGCAATATCCTGACGATAATCTGAATGGACTTGAAAGTACTTCTTCATATAATATTAGAAATGGACATGCACTTGAGGGACCAAGAGCTTTAATCGAAGCATATGGAAGTGATAAAGGAAAAGGTTATTTAAATGACAATCCACCCCAAGCTGAACATTTTTCTATCAATGTTATAGACAACAAGGCAACTCCGGTAACATGGCAGAACACTGAAGAAGAAGAGTTTGATTGGGAAGATATGAGCCCCACATTAGCTGATAGAGGCCGAAATAATGATATGTTGAAACCACCTGTCCCACCTTCAAGATTTAGGACAAGATCAGGATTTGAAAGATCAAATGCTATGCCTATAGAGCCAGGAATGAGAAGCAATTGGTCTAGTCCGGTTCAGCTACCGGGTATTGATTCCTCCATAGTTATTGAAGATGTGGCCCATTCAACACCTGTAGGTTTCCTGAACTTATTTGCTCTGTCATCTCTTTATTGTCATTGTTATCATTTGCTTCTGGCATTTTATTATAGTTTTGGATCAAAGTAATTAGTATATTTTGTGTTTTCAATCCCATGATTTCACATATTACCTGACATTGGATTCTTATCATACACTAGTAGAATAAACCTATTAGGTTCCCAGAGCTTGTTAAAGCTTTGTTATTGGTGATATACCACCTTTTCTTCCACTAGTCCTTTCTCTTTTTCTTATTGGCATCACTTATTGAGAGGCGTGTAAGACCTTATGTGTTCAGATTCTATAGCATGCATTTAGTACTTATTTTCTGTCGAGTGTGTAGTAGGATCTCTGCATTCCACTAGGTAATTATAGTATGGTTTTTTCTTTTTTCCGATTTTGCAGGATAATTGGAATATGCACAATCACATTTCTCAGACATCTCAGAACCTCATGAACAATAAAGGACAGGGAAGAAATTTCCAGATGCCTATGTTGGGGAGAGGCATAACTTCCTCTGTTGGTGAGAAGATGTCTCCTTATGGTGACAAGCTTTTGACCAATGATGCTTTACATAGGCCCACTAACATTGCTTCGAGATTGGGTTCTTCTGGTCTCGACTCTAGCATGGAGTCGCAATCAATTGTACAATCTATGGGCCCAAGGCATCCTCTGAATCTTTCTAACTCTTGCCCACCCTCTAGACCTCCAATTTTTCCTGTACCAAGACACAATGCAAGTCAGTTTGAGTCTTTAAATGGTAGCAATTCTTTCATGAATTGTGCAAATAGGACTTTTTTGCCTGAGCAGCAGATGAATAACTTGAGAAATAAGGAGCTAAGTCTTACAACTAAGTCGCCACAAGTTGGCAACCAACATACTGGGCATATTCCTTTAACTCGGGGAAACCAATTGCAGGGCATGCCTTTAAAACCGCAATTTCTACCATCTCAGGACATGCAAGATAATTTTAGTGGATCAGCAGTACCTCCAGTGTTACCACATTTAATGGCACCATCTTTGAGTCAAGGATACATTTCACAAGGACATCGCCCTGCTATTAGTGAGGGTTTGTCAAGTTCTGCCCCTATTGGACAATGGAATTTGTCTGTTCATAATAGCTCCAGTAACCCTTTGCATTTACAAGGAGGGCCGCTGCCACCTCTTCCACCTGGGCCTCATCCTACTTCTGGTCCGACTATACCTATCTCTCAAAAGGTTCCTGGACAGCAACCGGGAACTGCAATTTCTGGGCTAATAAGTTCTCTCATGGCCCGGGGTTTAATCTCATTAAACAATCAAGCTTCTGTACAGGTATATACATCTGGGTAGTACCCCTCTTACTAACTTTAGTTTGGGCATTTAATTTTTTTCTACTGTTATATTTTATCCACTAAGAGAGTTAAATGTTAGGATTCTGTTGGGTTAGAATTCAATCCAGATGTACTCAAGGTGCGACATGAATCTGCAATCACTGCTCTATATGCTGATCTTCCTCGACAATGCATGACCTGTGGTCTTCGATTCAAGACCCAGGAAGAGCATAGTAATCATATGGATTGGCATGTCACTAAAAATCGTATGTCGAAAAGTAGGAAGCAAAAGCCTTCTCGCAAATGGTTTGTAAGTATTAGCATGTGGCTTAGCGGTGCAGAGGCTTTAGGAACGGAGGCAGTTCCAGGATTTTTGCCTGCTGAGGTTGTTGTAGAGAAAAAAGATGATGAAGAACTGGCTGTTCCTGCTGACGAGGATCAGAAGACATGTGCATTATGTGGAGAACCTTTCGAGGATTTTTACAGTGATGAAACAGAGGAGTGGATGTATCGGGGTGCTGTCTACATGAATGCACCTGATGGACAAACAGCCGGCATGGATATTTCTCAGTTAGGGCCCATAGTGCATGCTAAATGCAGGACCGAAACTAATGTGGTTCCCTCTGAAAGTTTTGACCAAGATGAAGGGGTATGCTTAATTTTTTTTGTCTCTCCCCTAGACCCATGTCATGTATACATTCCTTGTATTTCGCCCTCTAATTTGGTCGATTGTTGATGTATACCAGGTGGTCTGTTAGCTGCAACATTCTTGTGTTTATTTAGACTCACACTGATAAATTTTTTACTGCAGGGAGTTAGTGAAGAGGGTAATCGAAGAAAACGATTGCGGAGCTAGCCTAGATGGATTTCTTTACTCTGGTTGTAGCGTATACAAGTTTTTTTCACATGATCTCTTGTTCTCACAGTAGTGTTAGCTAGAATTTACTTGGATTATGCTTGAAGCATTGTATATTTAAAAATGGAAATTTTCATGCGCAATATATTGCAATCTAAAAGAGGCTAACCCTCAATCTCATCTTCTTTGCTGTGGAGTTTCAGTTTTACATCTAAAATCCTTTATCTTTAAGGGTTGGTATGTACGTTGGAATGAAATGTTTTAGTTTGCAAACGTCTTGTAGGCTTTCGTATAGATGAATAATGAGTCGGAATCGTAGTCGTAATTGTAGCATTGTAGCTGCTTGTTACATCCATGCTGTGTTTGACTTAACATCTGCAGGGGATTGAATGGAGCGTGTAGTTGCTGCTTATCAGATCTTGCTCGATGTTTATCCCATTCAAACTGAAGCCCCAAGTTACACGATGGACCATCCACAGCATCTTCTTTGTTTTTGTTTTTGTTTTTTATAATGAATTTGAGTTCTCTTTGTATTAGTACCGTTTTTGTTGTGTCCTTTGTTTTTCTGGGTTCAAATTGCTATGAAGTTGGATCGAGCTTGTACTCTGACAATTCGTGAACAATACAGTATGTTGAATGTTCAGATTCGCTAAATTTGCAAGG

mRNA sequence

GAAGAAAAGAGTTTGATTGAGTGACCCAAACCGAATGACTCTAATCTAATCGGAAATCGAAAATCGAAAATCGAAAATCGAAAAATACAGGATTTAGGGCTGTTCGAAATAAGGGGTGCGGCTTTCTATCTTCTTCGGGGTTTTCACAACGTGAATCCGAGCCGATCTGTGAACTGCGCGAGCGAGCGAACGTATTTTGGGCCTTTTGCAATTAGAGGCGGCGGCTGGTTTGGCGTCCGTCGACGGTGCCTGCAACGGAGCCAGCGCTGAGCAGCGGCGGTGTGGGCACGAGTCTGGGAGTGCTCTGCGAATGCTTGACTGGTAGTTCTGCGTGAAGACGGTGACAAAGTGCAGTGGACGGTAGGTGAATAGGAGACTGAACTGCTAATTAGGAGAATTTCAAGCTCGTCCATATTTACGTTAGAGCTAATATTGGCAAGGAATTGATAGTGCGCACTGGCGAATAATTAAACTAAGCCACACCGTATCTCTCTGTAAAGCGCTCTCTCTTTGTTTTTCTCTCTCTAGCTTCTTCTATGGTGGGTAGGAACCCTAATTCAGTTCTCTTCCCTCCCTGGCGGGGTAAACTTCTGAACTCAATTACACTTTGCATTTCATGACCCGTTTCATGGAATCGGAAAAGCTCTTAATTTCACGAGGAAACCCTAGAAATTCTGTTTACCCATCTGACCGCCCAATCCCCACCACCAGCGGCAGGACTATGCCCAATGAGTTGCCACAAAAGCCTGCCCCTTCCATAGCTCACCGGTTTAGAGCTCAGTTAAAGCAGCGGGATGATGAATTTAGGGTTTCTGGCCATGATGTTGTGCCCCCTCCTACCGCTGAGGATATTGTGCAGTTGTATGACCTCATGTTGTCGGAGCTCACTTTTAATTCGAAGCCCATCATTACGGATCTCACTGTTCTTGCTGATGAGCAGAGAGAACATGGGAAGGGCATTGCCGACTTAATTTGTGCGCGGATTCTCGAGGTTCCAGTTGACCAAAAACTTCCTTCATTATATCTATTAGACAGCATTGTTAAGAATGTTGGGCACGAGTACATTAGTTATTTCGCGTCTCGTTTACCTGAGGTGTTTTGCGAGGCTTACAGGCAAGTTCATCCTAATTTGCACAATGCAATGCGCCACCTCTTTGGGACTTGGGCAACTGTGTTTCCACCATCCATCATTCGGAAGATTGAAGCTCAGCTTTCTCAGTTAACAGCCCAAGAGTCATCAGGTTTGACATCCTCAAGGGCTTCTGAATCTCCTCGGCCAACTCATGGTATTCATGTCAATCCAAAATACTTGCGACAGCTGGAACACTCAGTGGTGGATAAACATAGCCAAGATTCAAGAGGGACCTCAGCTATAAAAGTTCATGATAAAAAGCTTGCTTCTGGATATGAAGAGTATGACTACGATCATGCGGATGCTCTTGAACATGGTGGACCTCAAGGATTTCATTCAATGGGAAGCATGGGCCATGATTCTTTTTCCCTTGGAACAAATAAAGCAAATATAAAGCTAGCAAAATCATCTCTGTCTTCAAGAATTGGACCCCATAGACCTCTACAATCAGTTGGTGATGAACATGAAACGGTCAGAGCCTCACCCTCGCAGAATGTCTATGATTATGAAGGTTCGAAGATGATTGATAGAAATGAGGATACCAATAAATGGAGGAGAAAGCAATATCCTGACGATAATCTGAATGGACTTGAAAGTACTTCTTCATATAATATTAGAAATGGACATGCACTTGAGGGACCAAGAGCTTTAATCGAAGCATATGGAAGTGATAAAGGAAAAGGTTATTTAAATGACAATCCACCCCAAGCTGAACATTTTTCTATCAATGTTATAGACAACAAGGCAACTCCGGTAACATGGCAGAACACTGAAGAAGAAGAGTTTGATTGGGAAGATATGAGCCCCACATTAGCTGATAGAGGCCGAAATAATGATATGTTGAAACCACCTGTCCCACCTTCAAGATTTAGGACAAGATCAGGATTTGAAAGATCAAATGCTATGCCTATAGAGCCAGGAATGAGAAGCAATTGGTCTAGTCCGGTTCAGCTACCGGGTATTGATTCCTCCATAGTTATTGAAGATGTGGCCCATTCAACACCTGATAATTGGAATATGCACAATCACATTTCTCAGACATCTCAGAACCTCATGAACAATAAAGGACAGGGAAGAAATTTCCAGATGCCTATGTTGGGGAGAGGCATAACTTCCTCTGTTGGTGAGAAGATGTCTCCTTATGGTGACAAGCTTTTGACCAATGATGCTTTACATAGGCCCACTAACATTGCTTCGAGATTGGGTTCTTCTGGTCTCGACTCTAGCATGGAGTCGCAATCAATTGTACAATCTATGGGCCCAAGGCATCCTCTGAATCTTTCTAACTCTTGCCCACCCTCTAGACCTCCAATTTTTCCTGTACCAAGACACAATGCAAGTCAGTTTGAGTCTTTAAATGGTAGCAATTCTTTCATGAATTGTGCAAATAGGACTTTTTTGCCTGAGCAGCAGATGAATAACTTGAGAAATAAGGAGCTAAGTCTTACAACTAAGTCGCCACAAGTTGGCAACCAACATACTGGGCATATTCCTTTAACTCGGGGAAACCAATTGCAGGGCATGCCTTTAAAACCGCAATTTCTACCATCTCAGGACATGCAAGATAATTTTAGTGGATCAGCAGTACCTCCAGTGTTACCACATTTAATGGCACCATCTTTGAGTCAAGGATACATTTCACAAGGACATCGCCCTGCTATTAGTGAGGGTTTGTCAAGTTCTGCCCCTATTGGACAATGGAATTTGTCTGTTCATAATAGCTCCAGTAACCCTTTGCATTTACAAGGAGGGCCGCTGCCACCTCTTCCACCTGGGCCTCATCCTACTTCTGGTCCGACTATACCTATCTCTCAAAAGGTTCCTGGACAGCAACCGGGAACTGCAATTTCTGGGCTAATAAGTTCTCTCATGGCCCGGGGTTTAATCTCATTAAACAATCAAGCTTCTGTACAGGATTCTGTTGGGTTAGAATTCAATCCAGATGTACTCAAGGTGCGACATGAATCTGCAATCACTGCTCTATATGCTGATCTTCCTCGACAATGCATGACCTGTGGTCTTCGATTCAAGACCCAGGAAGAGCATAGTAATCATATGGATTGGCATGTCACTAAAAATCGTATGTCGAAAAGTAGGAAGCAAAAGCCTTCTCGCAAATGGTTTGTAAGTATTAGCATGTGGCTTAGCGGTGCAGAGGCTTTAGGAACGGAGGCAGTTCCAGGATTTTTGCCTGCTGAGGTTGTTGTAGAGAAAAAAGATGATGAAGAACTGGCTGTTCCTGCTGACGAGGATCAGAAGACATGTGCATTATGTGGAGAACCTTTCGAGGATTTTTACAGTGATGAAACAGAGGAGTGGATGTATCGGGGTGCTGTCTACATGAATGCACCTGATGGACAAACAGCCGGCATGGATATTTCTCAGTTAGGGCCCATAGTGCATGCTAAATGCAGGACCGAAACTAATGTGGTTCCCTCTGAAAGTTTTGACCAAGATGAAGGGGGAGTTAGTGAAGAGGGTAATCGAAGAAAACGATTGCGGAGCTAGCCTAGATGGATTTCTTTACTCTGGTTGTAGCGTATACAAGTTTTTTTCACATGATCTCTTGTTCTCACAGTAGTGTTAGCTAGAATTTACTTGGATTATGCTTGAAGCATTGTATATTTAAAAATGGAAATTTTCATGCGCAATATATTGCAATCTAAAAGAGGCTAACCCTCAATCTCATCTTCTTTGCTGTGGAGTTTCAGTTTTACATCTAAAATCCTTTATCTTTAAGGGTTGGTATGTACGTTGGAATGAAATGTTTTAGTTTGCAAACGTCTTGTAGGCTTTCGTATAGATGAATAATGAGTCGGAATCGTAGTCGTAATTGTAGCATTGTAGCTGCTTGTTACATCCATGCTGTGTTTGACTTAACATCTGCAGGGGATTGAATGGAGCGTGTAGTTGCTGCTTATCAGATCTTGCTCGATGTTTATCCCATTCAAACTGAAGCCCCAAGTTACACGATGGACCATCCACAGCATCTTCTTTGTTTTTGTTTTTGTTTTTTATAATGAATTTGAGTTCTCTTTGTATTAGTACCGTTTTTGTTGTGTCCTTTGTTTTTCTGGGTTCAAATTGCTATGAAGTTGGATCGAGCTTGTACTCTGACAATTCGTGAACAATACAGTATGTTGAATGTTCAGATTCGCTAAATTTGCAAGG

Coding sequence (CDS)

ATGACCCGTTTCATGGAATCGGAAAAGCTCTTAATTTCACGAGGAAACCCTAGAAATTCTGTTTACCCATCTGACCGCCCAATCCCCACCACCAGCGGCAGGACTATGCCCAATGAGTTGCCACAAAAGCCTGCCCCTTCCATAGCTCACCGGTTTAGAGCTCAGTTAAAGCAGCGGGATGATGAATTTAGGGTTTCTGGCCATGATGTTGTGCCCCCTCCTACCGCTGAGGATATTGTGCAGTTGTATGACCTCATGTTGTCGGAGCTCACTTTTAATTCGAAGCCCATCATTACGGATCTCACTGTTCTTGCTGATGAGCAGAGAGAACATGGGAAGGGCATTGCCGACTTAATTTGTGCGCGGATTCTCGAGGTTCCAGTTGACCAAAAACTTCCTTCATTATATCTATTAGACAGCATTGTTAAGAATGTTGGGCACGAGTACATTAGTTATTTCGCGTCTCGTTTACCTGAGGTGTTTTGCGAGGCTTACAGGCAAGTTCATCCTAATTTGCACAATGCAATGCGCCACCTCTTTGGGACTTGGGCAACTGTGTTTCCACCATCCATCATTCGGAAGATTGAAGCTCAGCTTTCTCAGTTAACAGCCCAAGAGTCATCAGGTTTGACATCCTCAAGGGCTTCTGAATCTCCTCGGCCAACTCATGGTATTCATGTCAATCCAAAATACTTGCGACAGCTGGAACACTCAGTGGTGGATAAACATAGCCAAGATTCAAGAGGGACCTCAGCTATAAAAGTTCATGATAAAAAGCTTGCTTCTGGATATGAAGAGTATGACTACGATCATGCGGATGCTCTTGAACATGGTGGACCTCAAGGATTTCATTCAATGGGAAGCATGGGCCATGATTCTTTTTCCCTTGGAACAAATAAAGCAAATATAAAGCTAGCAAAATCATCTCTGTCTTCAAGAATTGGACCCCATAGACCTCTACAATCAGTTGGTGATGAACATGAAACGGTCAGAGCCTCACCCTCGCAGAATGTCTATGATTATGAAGGTTCGAAGATGATTGATAGAAATGAGGATACCAATAAATGGAGGAGAAAGCAATATCCTGACGATAATCTGAATGGACTTGAAAGTACTTCTTCATATAATATTAGAAATGGACATGCACTTGAGGGACCAAGAGCTTTAATCGAAGCATATGGAAGTGATAAAGGAAAAGGTTATTTAAATGACAATCCACCCCAAGCTGAACATTTTTCTATCAATGTTATAGACAACAAGGCAACTCCGGTAACATGGCAGAACACTGAAGAAGAAGAGTTTGATTGGGAAGATATGAGCCCCACATTAGCTGATAGAGGCCGAAATAATGATATGTTGAAACCACCTGTCCCACCTTCAAGATTTAGGACAAGATCAGGATTTGAAAGATCAAATGCTATGCCTATAGAGCCAGGAATGAGAAGCAATTGGTCTAGTCCGGTTCAGCTACCGGGTATTGATTCCTCCATAGTTATTGAAGATGTGGCCCATTCAACACCTGATAATTGGAATATGCACAATCACATTTCTCAGACATCTCAGAACCTCATGAACAATAAAGGACAGGGAAGAAATTTCCAGATGCCTATGTTGGGGAGAGGCATAACTTCCTCTGTTGGTGAGAAGATGTCTCCTTATGGTGACAAGCTTTTGACCAATGATGCTTTACATAGGCCCACTAACATTGCTTCGAGATTGGGTTCTTCTGGTCTCGACTCTAGCATGGAGTCGCAATCAATTGTACAATCTATGGGCCCAAGGCATCCTCTGAATCTTTCTAACTCTTGCCCACCCTCTAGACCTCCAATTTTTCCTGTACCAAGACACAATGCAAGTCAGTTTGAGTCTTTAAATGGTAGCAATTCTTTCATGAATTGTGCAAATAGGACTTTTTTGCCTGAGCAGCAGATGAATAACTTGAGAAATAAGGAGCTAAGTCTTACAACTAAGTCGCCACAAGTTGGCAACCAACATACTGGGCATATTCCTTTAACTCGGGGAAACCAATTGCAGGGCATGCCTTTAAAACCGCAATTTCTACCATCTCAGGACATGCAAGATAATTTTAGTGGATCAGCAGTACCTCCAGTGTTACCACATTTAATGGCACCATCTTTGAGTCAAGGATACATTTCACAAGGACATCGCCCTGCTATTAGTGAGGGTTTGTCAAGTTCTGCCCCTATTGGACAATGGAATTTGTCTGTTCATAATAGCTCCAGTAACCCTTTGCATTTACAAGGAGGGCCGCTGCCACCTCTTCCACCTGGGCCTCATCCTACTTCTGGTCCGACTATACCTATCTCTCAAAAGGTTCCTGGACAGCAACCGGGAACTGCAATTTCTGGGCTAATAAGTTCTCTCATGGCCCGGGGTTTAATCTCATTAAACAATCAAGCTTCTGTACAGGATTCTGTTGGGTTAGAATTCAATCCAGATGTACTCAAGGTGCGACATGAATCTGCAATCACTGCTCTATATGCTGATCTTCCTCGACAATGCATGACCTGTGGTCTTCGATTCAAGACCCAGGAAGAGCATAGTAATCATATGGATTGGCATGTCACTAAAAATCGTATGTCGAAAAGTAGGAAGCAAAAGCCTTCTCGCAAATGGTTTGTAAGTATTAGCATGTGGCTTAGCGGTGCAGAGGCTTTAGGAACGGAGGCAGTTCCAGGATTTTTGCCTGCTGAGGTTGTTGTAGAGAAAAAAGATGATGAAGAACTGGCTGTTCCTGCTGACGAGGATCAGAAGACATGTGCATTATGTGGAGAACCTTTCGAGGATTTTTACAGTGATGAAACAGAGGAGTGGATGTATCGGGGTGCTGTCTACATGAATGCACCTGATGGACAAACAGCCGGCATGGATATTTCTCAGTTAGGGCCCATAGTGCATGCTAAATGCAGGACCGAAACTAATGTGGTTCCCTCTGAAAGTTTTGACCAAGATGAAGGGGGAGTTAGTGAAGAGGGTAATCGAAGAAAACGATTGCGGAGCTAG

Protein sequence

MTRFMESEKLLISRGNPRNSVYPSDRPIPTTSGRTMPNELPQKPAPSIAHRFRAQLKQRDDEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLICARILEVPVDQKLPSLYLLDSIVKNVGHEYISYFASRLPEVFCEAYRQVHPNLHNAMRHLFGTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVVDKHSQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGPQGFHSMGSMGHDSFSLGTNKANIKLAKSSLSSRIGPHRPLQSVGDEHETVRASPSQNVYDYEGSKMIDRNEDTNKWRRKQYPDDNLNGLESTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINVIDNKATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVPPSRFRTRSGFERSNAMPIEPGMRSNWSSPVQLPGIDSSIVIEDVAHSTPDNWNMHNHISQTSQNLMNNKGQGRNFQMPMLGRGITSSVGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSSMESQSIVQSMGPRHPLNLSNSCPPSRPPIFPVPRHNASQFESLNGSNSFMNCANRTFLPEQQMNNLRNKELSLTTKSPQVGNQHTGHIPLTRGNQLQGMPLKPQFLPSQDMQDNFSGSAVPPVLPHLMAPSLSQGYISQGHRPAISEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQKVPGQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPGFLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQTAGMDISQLGPIVHAKCRTETNVVPSESFDQDEGGVSEEGNRRKRLRS*
Homology
BLAST of CSPI01G13400 vs. ExPASy Swiss-Prot
Match: Q0WPF2 (Polyadenylation and cleavage factor homolog 4 OS=Arabidopsis thaliana OX=3702 GN=PCFS4 PE=1 SV=1)

HSP 1 Score: 630.6 bits (1625), Expect = 3.1e-179
Identity = 437/1010 (43.27%), Postives = 561/1010 (55.54%), Query Frame = 0

Query: 5    MESEKLLISRGNPRNSVYPSDRPIPTTSGRTMPNELPQK--PAPSIAHRFRAQLKQRDDE 64
            M+SEK+L    NPR         I +TS + M  ELPQK  P PS+  RF+A L QR+DE
Sbjct: 1    MDSEKIL----NPRLV------SINSTSRKGMSVELPQKPPPPPSLLDRFKALLNQREDE 60

Query: 65   FRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLICAR 124
            F   G + V PP+ ++IVQLY+++L ELTFNSKPIITDLT++A EQREHG+GIA+ IC R
Sbjct: 61   F--GGGEEVLPPSMDEIVQLYEVVLGELTFNSKPIITDLTIIAGEQREHGEGIANAICTR 120

Query: 125  ILEVPVDQKLPSLYLLDSIVKNVGHEYISYFASRLPEVFCEAYRQVHPNLHNAMRHLFGT 184
            ILE PV+QKLPSLYLLDSIVKN+G +Y  YF+SRLPEVFC AYRQ HP+LH +MRHLFGT
Sbjct: 121  ILEAPVEQKLPSLYLLDSIVKNIGRDYGRYFSSRLPEVFCLAYRQAHPSLHPSMRHLFGT 180

Query: 185  WATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVVDK 244
            W++VFPP ++RKI+ QL   +A   S   S  ASE  +PT GIHVNPKYLR+LE S  + 
Sbjct: 181  WSSVFPPPVLRKIDMQLQLSSAANQS---SVGASEPSQPTRGIHVNPKYLRRLEPSAAE- 240

Query: 245  HSQDSRG-TSAIKVHDKKLASGYEEYDYDHADALEHGGPQGFHSMGSMGHDSFSLGTNKA 304
               + RG  S+ +V+ +    GY +++    D LE                         
Sbjct: 241  --NNLRGINSSARVYGQNSLGGYNDFE----DQLE------------------------- 300

Query: 305  NIKLAKSSLSSRIGPHRPLQSVGDEHETVRASPSQNVYDYEGSKMIDRNEDTNKWRRKQY 364
                + SSLSS         + G       A+PS   ++Y   +   R+++  +WRRK+ 
Sbjct: 301  ----SPSSLSSTPDGFTRRSNDG-------ANPSNQAFNYGMGRATSRDDEHMEWRRKE- 360

Query: 365  PDDNLNGLESTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINVIDNK- 424
                          N+  G+  E PRALI+AYG D  K    + P +     +N + +K 
Sbjct: 361  --------------NLGQGNDHERPRALIDAYGVDTSKHVTINKPIR----DMNGMHSKM 420

Query: 425  ATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVPP-SRFRTRSGFERSNAMPIEPG 484
             TP  WQNTEEEEFDWEDMSPTL DR R  + L+  VP     R R     ++   ++  
Sbjct: 421  VTP--WQNTEEEEFDWEDMSPTL-DRSRAGEFLRSSVPALGSVRARPRVGNTSDFHLDSD 480

Query: 485  MRSNWSSPVQLPGIDSSIVIEDVAHSTPDNWNMHNHISQTSQNLMNNKGQGRNFQMPMLG 544
            +++                   V+H   +NW++  +   TS  +  +   G++ ++    
Sbjct: 481  IKNG------------------VSHQLRENWSLSQNYPHTSNRV--DTRAGKDLKVLASS 540

Query: 545  RGITSSVGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSSMESQSIVQSMGPRHPLNL 604
             G+ SS  E  +P  D +          ++ SR G +  D +    S   + GP      
Sbjct: 541  VGLVSSNSEFGAPPFDSI---------QDVNSRFGRALPDGTWPHLS---ARGP------ 600

Query: 605  SNSCPPSRPPIFPVPRHNASQFESLNGSNSFMNCANRTFLPEQQMNNLRNKELSLTTKSP 664
             NS         PVP            S    + AN    P   M+N R +   L     
Sbjct: 601  -NS--------LPVP------------SAHLHHLAN----PGNAMSN-RLQGKPLYRPEN 660

Query: 665  QVGNQHTGHIPLTRGNQLQGMPLKPQFLPSQDMQDNFSGSAVPPVLPHLMAPSLSQGYIS 724
            QV   H     +T+ NQ     +   +LPS       S +  P  +  L+       ++S
Sbjct: 661  QVSQSHLN--DMTQQNQ-----MLVNYLPS-------SSAMAPRPMQSLLT------HVS 720

Query: 725  QGHRPAISEGLSSSAPIGQWNLSVHNSSSNP-LHLQGG-PLPPLPPGPHPTSGPTIPISQ 784
             G+ P                   H S+  P L +QGG  + PL  G     G +     
Sbjct: 721  HGYPP-------------------HGSTIRPSLSIQGGEAMHPLSSGVLSQIGAS----- 780

Query: 785  KVPGQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADL 844
                Q PG A SGLI SLMA+GLISLNNQ + Q  +GLEF+ D+LK+R+ESAI+ALY DL
Sbjct: 781  ---NQPPGGAFSGLIGSLMAQGLISLNNQPAGQGPLGLEFDADMLKIRNESAISALYGDL 808

Query: 845  PRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEA 904
            PRQC TCGLRFK QEEHS HMDWHVTKNRMSK+ KQ PSRKWFVS SMWLSGAEALG EA
Sbjct: 841  PRQCTTCGLRFKCQEEHSKHMDWHVTKNRMSKNHKQNPSRKWFVSASMWLSGAEALGAEA 808

Query: 905  VPGFLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPD 964
            VPGFLP E   EKKDDE++AVPADEDQ +CALCGEPFEDFYSDETEEWMY+GAVYMNAP+
Sbjct: 901  VPGFLPTEPTTEKKDDEDMAVPADEDQTSCALCGEPFEDFYSDETEEWMYKGAVYMNAPE 808

Query: 965  GQTAGMDISQLGPIVHAKCRTETNVVPSESFDQDEGGVSEEGNRRKRLRS 1008
              T  MD SQLGPIVHAKCR E+N           GG  EEG++RK++RS
Sbjct: 961  ESTTDMDKSQLGPIVHAKCRPESN-----------GGDMEEGSQRKKMRS 808

BLAST of CSPI01G13400 vs. ExPASy Swiss-Prot
Match: Q9C710 (Polyadenylation and cleavage factor homolog 1 OS=Arabidopsis thaliana OX=3702 GN=PCFS1 PE=1 SV=1)

HSP 1 Score: 165.2 bits (417), Expect = 3.7e-39
Identity = 95/185 (51.35%), Postives = 114/185 (61.62%), Query Frame = 0

Query: 806 QASVQDS--VGLEF-NPDVLKVRHESAITALYADLPRQCMTCGLRFKTQEEHSNHMDWHV 865
           +AS  DS  VGL F NP  L VRHES I +LY+D+PRQC +CGLRFK QEEHS HMDWHV
Sbjct: 218 EASNSDSLPVGLSFDNPSSLNVRHESVIKSLYSDMPRQCSSCGLRFKCQEEHSKHMDWHV 277

Query: 866 TKNRMSKS-----RKQKPSRKWFVSISMWLSGAEALGTEAVPGFLPAEVVVEKKDDEE-- 925
            KNR  K+     ++ K SR W  S S+WL  A    T  V  F   E+  +K  DEE  
Sbjct: 278 RKNRSVKTTTRLGQQPKKSRGWLASASLWLCAATGGETVEVASF-GGEMQKKKGKDEEPK 337

Query: 926 -LAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQTAGMDISQLGPIVHA 980
            L VPADEDQK CALC EPFE+F+S E ++WMY+ AVY            +++ G IVH 
Sbjct: 338 QLMVPADEDQKNCALCVEPFEEFFSHEDDDWMYKDAVY------------LTKNGRIVHV 389

BLAST of CSPI01G13400 vs. ExPASy Swiss-Prot
Match: Q9FIX8 (Polyadenylation and cleavage factor homolog 5 OS=Arabidopsis thaliana OX=3702 GN=PCFS5 PE=1 SV=1)

HSP 1 Score: 163.3 bits (412), Expect = 1.4e-38
Identity = 90/186 (48.39%), Postives = 114/186 (61.29%), Query Frame = 0

Query: 805 NQASVQDS--VGLEF-NPDVLKVRHESAITALYADLPRQCMTCGLRFKTQEEHSNHMDWH 864
           ++AS  DS  VGL F NP  L VRHES I +LY+D+PRQC +CG+RFK QEEHS HMDWH
Sbjct: 210 SEASNNDSLPVGLSFDNPSSLNVRHESVIKSLYSDMPRQCTSCGVRFKCQEEHSKHMDWH 269

Query: 865 VTKNRMSKS-----RKQKPSRKWFVSISMWLSGAEALGTEAVPGFLPAEVVVEKKDDE-- 924
           V KNR  K+     ++ K SR W  S S+WL      GT  V  F   E+  + + D+  
Sbjct: 270 VRKNRSVKTTTRLGQQPKKSRGWLASASLWLCAPTGGGTVEVASFGGGEMQKKNEKDQVQ 329

Query: 925 -ELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQTAGMDISQLGPIVH 980
            +  VPADEDQK CALC EPFE+F+S E ++WMY+ AVY            +++ G IVH
Sbjct: 330 KQHMVPADEDQKNCALCVEPFEEFFSHEADDWMYKDAVY------------LTKNGRIVH 383

BLAST of CSPI01G13400 vs. ExPASy Swiss-Prot
Match: O94913 (Pre-mRNA cleavage complex 2 protein Pcf11 OS=Homo sapiens OX=9606 GN=PCF11 PE=1 SV=3)

HSP 1 Score: 98.6 bits (244), Expect = 4.3e-19
Identity = 59/162 (36.42%), Postives = 83/162 (51.23%), Query Frame = 0

Query: 77  EDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLICARILEVPVDQKLPSLY 136
           ED  + Y   L +LTFNSKP I  LT+LA+E     K I  LI A+  + P  +KLP +Y
Sbjct: 16  EDACRDYQSSLEDLTFNSKPHINMLTILAEENLPFAKEIVSLIEAQTAKAPSSEKLPVMY 75

Query: 137 LLDSIVKNVGHEYISYFASRLPEVFCEAYRQVHPNLHNAMRHLFGTWATVFPPSIIRKIE 196
           L+DSIVKNVG EY++ F   L   F   + +V  N   ++  L  TW  +FP   +  ++
Sbjct: 76  LMDSIVKNVGREYLTAFTKNLVATFICVFEKVDENTRKSLFKLRSTWDEIFPLKKLYALD 135

Query: 197 AQLSQLTAQESSGLTSSRASESPRP----THGIHVNPKYLRQ 235
            +++ L                P P    T  IHVNPK+L +
Sbjct: 136 VRVNSL---------DPAWPIKPLPPNVNTSSIHVNPKFLNK 168

BLAST of CSPI01G13400 vs. ExPASy Swiss-Prot
Match: Q10237 (Uncharacterized protein C4G9.04c OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=SPAC4G9.04c PE=4 SV=1)

HSP 1 Score: 85.1 bits (209), Expect = 4.9e-15
Identity = 54/133 (40.60%), Postives = 67/133 (50.38%), Query Frame = 0

Query: 78  DIVQL-YDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLICARILEVPVDQKLPSLY 137
           D+V+L Y   L +LTFNSKPII  LT +A E   +   I + I   I + P + KLP+LY
Sbjct: 2   DLVELDYLSALEDLTFNSKPIIHTLTYIAQENEPYAISIVNAIEKHIQKCPPNCKLPALY 61

Query: 138 LLDSIVKNVGHEYISYFASRLPEVFCEAYRQVHPNLHNAMRHLFGTW----------ATV 197
           LLDSI KN+G  Y  +F   L   F  AY  V P L   +  L  TW            V
Sbjct: 62  LLDSISKNLGAPYTYFFGLHLFSTFMSAYTVVEPRLRLKLDQLLATWKQRPPNSSSLEPV 121

Query: 198 FPPSIIRKIEAQL 200
           F P +  KIE  L
Sbjct: 122 FSPIVTAKIENAL 134

BLAST of CSPI01G13400 vs. ExPASy TrEMBL
Match: A0A0A0LVG0 (CID domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G109350 PE=4 SV=1)

HSP 1 Score: 2021.9 bits (5237), Expect = 0.0e+00
Identity = 1004/1007 (99.70%), Postives = 1005/1007 (99.80%), Query Frame = 0

Query: 1    MTRFMESEKLLISRGNPRNSVYPSDRPIPTTSGRTMPNELPQKPAPSIAHRFRAQLKQRD 60
            MTRFMESEKLLISRGNPRNSVYPSDRPIPTTSGRTMPNELPQKPAPSIAHRFRAQLKQRD
Sbjct: 1    MTRFMESEKLLISRGNPRNSVYPSDRPIPTTSGRTMPNELPQKPAPSIAHRFRAQLKQRD 60

Query: 61   DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120
            DEFRVSGHDVVP PTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC
Sbjct: 61   DEFRVSGHDVVPLPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120

Query: 121  ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYFASRLPEVFCEAYRQVHPNLHNAMRHLF 180
            ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYFASRLPEVFCEAYRQVHPNLHNAMRHLF
Sbjct: 121  ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYFASRLPEVFCEAYRQVHPNLHNAMRHLF 180

Query: 181  GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240
            GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV
Sbjct: 181  GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240

Query: 241  DKHSQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGPQGFHSMGSMGHDSFSLGTNK 300
            DKHSQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGPQGFHSMGSMGHDSFSLGTNK
Sbjct: 241  DKHSQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGPQGFHSMGSMGHDSFSLGTNK 300

Query: 301  ANIKLAKSSLSSRIGPHRPLQSVGDEHETVRASPSQNVYDYEGSKMIDRNEDTNKWRRKQ 360
            ANIKLAKSSLSSRIGPHRPLQSVGDEHETVRASPSQNVYDYEGSKMIDRNEDTNKWRRKQ
Sbjct: 301  ANIKLAKSSLSSRIGPHRPLQSVGDEHETVRASPSQNVYDYEGSKMIDRNEDTNKWRRKQ 360

Query: 361  YPDDNLNGLESTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINVIDNK 420
            YPDDNLNGLESTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINVIDNK
Sbjct: 361  YPDDNLNGLESTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINVIDNK 420

Query: 421  ATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVPPSRFRTRSGFERSNAMPIEPGM 480
            ATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVPPSRFRTRSGFERSNAMPIEPGM
Sbjct: 421  ATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVPPSRFRTRSGFERSNAMPIEPGM 480

Query: 481  RSNWSSPVQLPGIDSSIVIEDVAHSTPDNWNMHNHISQTSQNLMNNKGQGRNFQMPMLGR 540
            RSNWSSPV+LPGIDSSIVIEDV HSTPDNWNMHNHISQTSQNLMNNKGQGRNFQMPMLGR
Sbjct: 481  RSNWSSPVRLPGIDSSIVIEDVVHSTPDNWNMHNHISQTSQNLMNNKGQGRNFQMPMLGR 540

Query: 541  GITSSVGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSSMESQSIVQSMGPRHPLNLS 600
            GITSSVGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSSMESQSIVQSMGPRHPLNLS
Sbjct: 541  GITSSVGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSSMESQSIVQSMGPRHPLNLS 600

Query: 601  NSCPPSRPPIFPVPRHNASQFESLNGSNSFMNCANRTFLPEQQMNNLRNKELSLTTKSPQ 660
            NSCPPSRPPIFPVPRHNASQFESLNGSNSFMNCANRTFLPEQQMNNLRNKELSLTTKSPQ
Sbjct: 601  NSCPPSRPPIFPVPRHNASQFESLNGSNSFMNCANRTFLPEQQMNNLRNKELSLTTKSPQ 660

Query: 661  VGNQHTGHIPLTRGNQLQGMPLKPQFLPSQDMQDNFSGSAVPPVLPHLMAPSLSQGYISQ 720
            VGNQHTGHIPLTRGNQLQGMPLKPQFLPSQDMQDNFSGSAVPPVLPHLMAPSLSQGYISQ
Sbjct: 661  VGNQHTGHIPLTRGNQLQGMPLKPQFLPSQDMQDNFSGSAVPPVLPHLMAPSLSQGYISQ 720

Query: 721  GHRPAISEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQKVP 780
            GHRPAISEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQKVP
Sbjct: 721  GHRPAISEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQKVP 780

Query: 781  GQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQ 840
            GQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQ
Sbjct: 781  GQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQ 840

Query: 841  CMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPG 900
            CMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPG
Sbjct: 841  CMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPG 900

Query: 901  FLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQT 960
            FLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQT
Sbjct: 901  FLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQT 960

Query: 961  AGMDISQLGPIVHAKCRTETNVVPSESFDQDEGGVSEEGNRRKRLRS 1008
            AGMDISQLGPIVHAKCRTETNVVPSESFDQDEGGVSEEGNRRKRLRS
Sbjct: 961  AGMDISQLGPIVHAKCRTETNVVPSESFDQDEGGVSEEGNRRKRLRS 1007

BLAST of CSPI01G13400 vs. ExPASy TrEMBL
Match: A0A1S3CI66 (polyadenylation and cleavage factor homolog 4 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103501218 PE=4 SV=1)

HSP 1 Score: 1961.4 bits (5080), Expect = 0.0e+00
Identity = 974/1007 (96.72%), Postives = 988/1007 (98.11%), Query Frame = 0

Query: 1    MTRFMESEKLLISRGNPRNSVYPSDRPIPTTSGRTMPNELPQKPAPSIAHRFRAQLKQRD 60
            MTRFMESEKLLISRGNPRNS YPSDRPIPTTSGRTMPNELPQKP PSIAHRFRAQLKQRD
Sbjct: 1    MTRFMESEKLLISRGNPRNSAYPSDRPIPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD 60

Query: 61   DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120
            DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC
Sbjct: 61   DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120

Query: 121  ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYFASRLPEVFCEAYRQVHPNLHNAMRHLF 180
            ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYF+SRLPEVFCEAYRQVHPNLHNAMRHLF
Sbjct: 121  ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF 180

Query: 181  GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240
            GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV
Sbjct: 181  GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240

Query: 241  DKHSQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGPQGFHSMGSMGHDSFSLGTNK 300
            DKH+QDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGG Q FHSMGSMGHDSFSLGTNK
Sbjct: 241  DKHTQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGAQEFHSMGSMGHDSFSLGTNK 300

Query: 301  ANIKLAKSSLSSRIGPHRPLQSVGDEHETVRASPSQNVYDYEGSKMIDRNEDTNKWRRKQ 360
            AN+KLAKSSLSSRIG HRPLQS+GDE E+VRASPSQNVYDYEGSK++DRNEDTNKWRRKQ
Sbjct: 301  ANVKLAKSSLSSRIGHHRPLQSLGDELESVRASPSQNVYDYEGSKILDRNEDTNKWRRKQ 360

Query: 361  YPDDNLNGLESTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINVIDNK 420
            YPDDN+NGLE+TSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSI+ IDNK
Sbjct: 361  YPDDNMNGLENTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSISGIDNK 420

Query: 421  ATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVPPSRFRTRSGFERSNAMPIEPGM 480
            ATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKP VPPSRFRTRSGFERSNAMPIEPGM
Sbjct: 421  ATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPTVPPSRFRTRSGFERSNAMPIEPGM 480

Query: 481  RSNWSSPVQLPGIDSSIVIEDVAHSTPDNWNMHNHISQTSQNLMNNKGQGRNFQMPMLGR 540
            RSNWSS VQLPGIDSSIVIEDV HSTPD W MHNHISQTSQNLMNNKG GRNFQMPMLGR
Sbjct: 481  RSNWSSQVQLPGIDSSIVIEDVVHSTPDIWKMHNHISQTSQNLMNNKGPGRNFQMPMLGR 540

Query: 541  GITSSVGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSSMESQSIVQSMGPRHPLNLS 600
            GITSS GEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDS+MESQSIVQSMGPRHPLNLS
Sbjct: 541  GITSSGGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSNMESQSIVQSMGPRHPLNLS 600

Query: 601  NSCPPSRPPIFPVPRHNASQFESLNGSNSFMNCANRTFLPEQQMNNLRNKELSLTTKSPQ 660
            NSCPPSRPP+FPVPRHN SQFESLNGSNSFMN ANRTFLPEQQMNNLRNKELSLTTKSPQ
Sbjct: 601  NSCPPSRPPVFPVPRHNTSQFESLNGSNSFMNSANRTFLPEQQMNNLRNKELSLTTKSPQ 660

Query: 661  VGNQHTGHIPLTRGNQLQGMPLKPQFLPSQDMQDNFSGSAVPPVLPHLMAPSLSQGYISQ 720
            VGNQHTGHIPLTRGNQLQ MPLKPQFLPSQDMQDNFSGSAVPPVLPHL+APSLSQGYISQ
Sbjct: 661  VGNQHTGHIPLTRGNQLQSMPLKPQFLPSQDMQDNFSGSAVPPVLPHLIAPSLSQGYISQ 720

Query: 721  GHRPAISEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQKVP 780
            GHRPA SEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQKVP
Sbjct: 721  GHRPANSEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQKVP 780

Query: 781  GQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQ 840
            GQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQ
Sbjct: 781  GQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQ 840

Query: 841  CMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPG 900
            CMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPG
Sbjct: 841  CMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPG 900

Query: 901  FLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQT 960
            FLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQT
Sbjct: 901  FLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQT 960

Query: 961  AGMDISQLGPIVHAKCRTETNVVPSESFDQDEGGVSEEGNRRKRLRS 1008
            AGMD SQLGPIVHAKCRTETNVVPSESFDQDEGGVSE+GNRRKRLRS
Sbjct: 961  AGMDRSQLGPIVHAKCRTETNVVPSESFDQDEGGVSEDGNRRKRLRS 1007

BLAST of CSPI01G13400 vs. ExPASy TrEMBL
Match: A0A1S3CJP9 (polyadenylation and cleavage factor homolog 4 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501218 PE=4 SV=1)

HSP 1 Score: 1955.3 bits (5064), Expect = 0.0e+00
Identity = 974/1012 (96.25%), Postives = 988/1012 (97.63%), Query Frame = 0

Query: 1    MTRFMESEKLLISRGNPRNSVYPSDRPIPTTSGRTMPNELPQKPAPSIAHRFRAQLKQRD 60
            MTRFMESEKLLISRGNPRNS YPSDRPIPTTSGRTMPNELPQKP PSIAHRFRAQLKQRD
Sbjct: 1    MTRFMESEKLLISRGNPRNSAYPSDRPIPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD 60

Query: 61   DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120
            DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC
Sbjct: 61   DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120

Query: 121  ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYFASRLPEVFCEAYRQVHPNLHNAMRHLF 180
            ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYF+SRLPEVFCEAYRQVHPNLHNAMRHLF
Sbjct: 121  ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF 180

Query: 181  GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240
            GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV
Sbjct: 181  GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240

Query: 241  DK-----HSQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGPQGFHSMGSMGHDSFS 300
            DK     H+QDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGG Q FHSMGSMGHDSFS
Sbjct: 241  DKLLALQHTQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGAQEFHSMGSMGHDSFS 300

Query: 301  LGTNKANIKLAKSSLSSRIGPHRPLQSVGDEHETVRASPSQNVYDYEGSKMIDRNEDTNK 360
            LGTNKAN+KLAKSSLSSRIG HRPLQS+GDE E+VRASPSQNVYDYEGSK++DRNEDTNK
Sbjct: 301  LGTNKANVKLAKSSLSSRIGHHRPLQSLGDELESVRASPSQNVYDYEGSKILDRNEDTNK 360

Query: 361  WRRKQYPDDNLNGLESTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSIN 420
            WRRKQYPDDN+NGLE+TSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSI+
Sbjct: 361  WRRKQYPDDNMNGLENTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSIS 420

Query: 421  VIDNKATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVPPSRFRTRSGFERSNAMP 480
             IDNKATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKP VPPSRFRTRSGFERSNAMP
Sbjct: 421  GIDNKATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPTVPPSRFRTRSGFERSNAMP 480

Query: 481  IEPGMRSNWSSPVQLPGIDSSIVIEDVAHSTPDNWNMHNHISQTSQNLMNNKGQGRNFQM 540
            IEPGMRSNWSS VQLPGIDSSIVIEDV HSTPD W MHNHISQTSQNLMNNKG GRNFQM
Sbjct: 481  IEPGMRSNWSSQVQLPGIDSSIVIEDVVHSTPDIWKMHNHISQTSQNLMNNKGPGRNFQM 540

Query: 541  PMLGRGITSSVGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSSMESQSIVQSMGPRH 600
            PMLGRGITSS GEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDS+MESQSIVQSMGPRH
Sbjct: 541  PMLGRGITSSGGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSNMESQSIVQSMGPRH 600

Query: 601  PLNLSNSCPPSRPPIFPVPRHNASQFESLNGSNSFMNCANRTFLPEQQMNNLRNKELSLT 660
            PLNLSNSCPPSRPP+FPVPRHN SQFESLNGSNSFMN ANRTFLPEQQMNNLRNKELSLT
Sbjct: 601  PLNLSNSCPPSRPPVFPVPRHNTSQFESLNGSNSFMNSANRTFLPEQQMNNLRNKELSLT 660

Query: 661  TKSPQVGNQHTGHIPLTRGNQLQGMPLKPQFLPSQDMQDNFSGSAVPPVLPHLMAPSLSQ 720
            TKSPQVGNQHTGHIPLTRGNQLQ MPLKPQFLPSQDMQDNFSGSAVPPVLPHL+APSLSQ
Sbjct: 661  TKSPQVGNQHTGHIPLTRGNQLQSMPLKPQFLPSQDMQDNFSGSAVPPVLPHLIAPSLSQ 720

Query: 721  GYISQGHRPAISEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPI 780
            GYISQGHRPA SEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPI
Sbjct: 721  GYISQGHRPANSEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPI 780

Query: 781  SQKVPGQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYA 840
            SQKVPGQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYA
Sbjct: 781  SQKVPGQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYA 840

Query: 841  DLPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGT 900
            DLPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGT
Sbjct: 841  DLPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGT 900

Query: 901  EAVPGFLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNA 960
            EAVPGFLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNA
Sbjct: 901  EAVPGFLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNA 960

Query: 961  PDGQTAGMDISQLGPIVHAKCRTETNVVPSESFDQDEGGVSEEGNRRKRLRS 1008
            PDGQTAGMD SQLGPIVHAKCRTETNVVPSESFDQDEGGVSE+GNRRKRLRS
Sbjct: 961  PDGQTAGMDRSQLGPIVHAKCRTETNVVPSESFDQDEGGVSEDGNRRKRLRS 1012

BLAST of CSPI01G13400 vs. ExPASy TrEMBL
Match: A0A5A7UC46 (Polyadenylation and cleavage factor-like protein 4 isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold609G001150 PE=4 SV=1)

HSP 1 Score: 1912.9 bits (4954), Expect = 0.0e+00
Identity = 948/980 (96.73%), Postives = 961/980 (98.06%), Query Frame = 0

Query: 14   RGNPRNSVYPSDRPIPTTSGRTMPNELPQKPAPSIAHRFRAQLKQRDDEFRVSGHDVVPP 73
            RGNPRNS YPSDRPIPTTSGRTMPNELPQKP PSIAHRFRAQLKQRDDEFRVSGHDVVPP
Sbjct: 160  RGNPRNSAYPSDRPIPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRDDEFRVSGHDVVPP 219

Query: 74   PTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLICARILEVPVDQKLP 133
            PTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLICARILEVPVDQKLP
Sbjct: 220  PTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLICARILEVPVDQKLP 279

Query: 134  SLYLLDSIVKNVGHEYISYFASRLPEVFCEAYRQVHPNLHNAMRHLFGTWATVFPPSIIR 193
            SLYLLDSIVKNVGHEYISYF+SRLPEVFCEAYRQVHPNLHNAMRHLFGTWATVFPPSIIR
Sbjct: 280  SLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLFGTWATVFPPSIIR 339

Query: 194  KIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVVDKHSQDSRGTSAI 253
            KIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVVDKH+QDSRGTSAI
Sbjct: 340  KIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVVDKHTQDSRGTSAI 399

Query: 254  KVHDKKLASGYEEYDYDHADALEHGGPQGFHSMGSMGHDSFSLGTNKANIKLAKSSLSSR 313
            KVHDKKLASGYEEYDYDHADALEHGG Q FHSMGSMGHDSFSLGTNKAN+KLAKSSLSSR
Sbjct: 400  KVHDKKLASGYEEYDYDHADALEHGGAQEFHSMGSMGHDSFSLGTNKANVKLAKSSLSSR 459

Query: 314  IGPHRPLQSVGDEHETVRASPSQNVYDYEGSKMIDRNEDTNKWRRKQYPDDNLNGLESTS 373
            IG HRPLQS+GDE E+VRASPSQNVYDYEGSK++DRNEDTNKWRRKQYPDDN+NGLE+TS
Sbjct: 460  IGHHRPLQSLGDELESVRASPSQNVYDYEGSKILDRNEDTNKWRRKQYPDDNMNGLENTS 519

Query: 374  SYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINVIDNKATPVTWQNTEEEE 433
            SYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSI+ IDNKATPVTWQNTEEEE
Sbjct: 520  SYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSISGIDNKATPVTWQNTEEEE 579

Query: 434  FDWEDMSPTLADRGRNNDMLKPPVPPSRFRTRSGFERSNAMPIEPGMRSNWSSPVQLPGI 493
            FDWEDMSPTLADRGRNNDMLKP VPPSRFRTRSGFERSNAMPIEPGMRSNWSS VQLPGI
Sbjct: 580  FDWEDMSPTLADRGRNNDMLKPTVPPSRFRTRSGFERSNAMPIEPGMRSNWSSQVQLPGI 639

Query: 494  DSSIVIEDVAHSTPDNWNMHNHISQTSQNLMNNKGQGRNFQMPMLGRGITSSVGEKMSPY 553
            DSSIVIEDV HSTPD W MHNHISQTSQNLMNNKG GRNFQMPMLGRGITSS GEKMSPY
Sbjct: 640  DSSIVIEDVVHSTPDIWKMHNHISQTSQNLMNNKGPGRNFQMPMLGRGITSSGGEKMSPY 699

Query: 554  GDKLLTNDALHRPTNIASRLGSSGLDSSMESQSIVQSMGPRHPLNLSNSCPPSRPPIFPV 613
            GDKLLTNDALHRPTNIASRLGSSGLDS+MESQSIVQSMGPRHPLNLSNSCPPSRPP+FPV
Sbjct: 700  GDKLLTNDALHRPTNIASRLGSSGLDSNMESQSIVQSMGPRHPLNLSNSCPPSRPPVFPV 759

Query: 614  PRHNASQFESLNGSNSFMNCANRTFLPEQQMNNLRNKELSLTTKSPQVGNQHTGHIPLTR 673
            PRHN SQFESLNGSNSFMN ANRTFLPEQQMNNLRNKELSLTTKSPQVGNQHTGHIPLTR
Sbjct: 760  PRHNTSQFESLNGSNSFMNSANRTFLPEQQMNNLRNKELSLTTKSPQVGNQHTGHIPLTR 819

Query: 674  GNQLQGMPLKPQFLPSQDMQDNFSGSAVPPVLPHLMAPSLSQGYISQGHRPAISEGLSSS 733
            GNQLQ MPLKPQFLPSQDMQDNFSGSAVPPVLPHL+APSLSQGYISQGHRPA SEGLSSS
Sbjct: 820  GNQLQSMPLKPQFLPSQDMQDNFSGSAVPPVLPHLIAPSLSQGYISQGHRPANSEGLSSS 879

Query: 734  APIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQKVPGQQPGTAISGLIS 793
            APIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQKVPGQQPGTAISGLIS
Sbjct: 880  APIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQKVPGQQPGTAISGLIS 939

Query: 794  SLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQCMTCGLRFKTQEE 853
            SLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQCMTCGLRFKTQEE
Sbjct: 940  SLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQCMTCGLRFKTQEE 999

Query: 854  HSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPGFLPAEVVVEKKDD 913
            HSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPGFLPAEVVVEKKDD
Sbjct: 1000 HSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPGFLPAEVVVEKKDD 1059

Query: 914  EELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQTAGMDISQLGPIVH 973
            EELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQTAGMD SQLGPIVH
Sbjct: 1060 EELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQTAGMDRSQLGPIVH 1119

Query: 974  AKCRTETNVVPSESFDQDEG 994
            AKCRTETNVVPSESFDQDEG
Sbjct: 1120 AKCRTETNVVPSESFDQDEG 1139

BLAST of CSPI01G13400 vs. ExPASy TrEMBL
Match: A0A6J1EZ18 (polyadenylation and cleavage factor homolog 4-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111440036 PE=4 SV=1)

HSP 1 Score: 1675.2 bits (4337), Expect = 0.0e+00
Identity = 855/1008 (84.82%), Postives = 905/1008 (89.78%), Query Frame = 0

Query: 1    MTRFMESEKLLISRGNPRNSVYPSDRPIPTTSGRTMPNELPQKPAPSIAHRFRAQLKQRD 60
            M  FMESEKLLISRGNPR   Y SDRP+PTT+GR MPNELPQKP+PSIAHRFRAQLKQRD
Sbjct: 1    MNPFMESEKLLISRGNPRTLAYTSDRPLPTTTGRAMPNELPQKPSPSIAHRFRAQLKQRD 60

Query: 61   DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120
            DEFRVSG DV P PT EDIVQLY+LMLSELTFNSKPIITDLTVLA+EQREHGKGIADLIC
Sbjct: 61   DEFRVSGLDVAPLPTTEDIVQLYELMLSELTFNSKPIITDLTVLAEEQREHGKGIADLIC 120

Query: 121  ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYFASRLPEVFCEAYRQVHPNLHNAMRHLF 180
            +RILEVPVDQKLPSLYLLDSIVKNVGHEYI+YF+SRLPEVFCEAYRQVHPNLHNAMRHLF
Sbjct: 121  SRILEVPVDQKLPSLYLLDSIVKNVGHEYINYFSSRLPEVFCEAYRQVHPNLHNAMRHLF 180

Query: 181  GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240
            GTW+TVFPPSI+RKIEA+LSQLT QE+S LTSSRASESPRPTHGIHVNPKYLRQLEHSV 
Sbjct: 181  GTWSTVFPPSILRKIEARLSQLTTQETSALTSSRASESPRPTHGIHVNPKYLRQLEHSVG 240

Query: 241  DKHSQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGPQGFHSMGSMGHDSFSLGTNK 300
            DKH  D+RG S +KVHDKKLA GYEEYDYDHAD LEHGG Q F+SMGSM HDSFSLGTNK
Sbjct: 241  DKHIPDARGASTLKVHDKKLAPGYEEYDYDHADGLEHGGSQAFNSMGSMSHDSFSLGTNK 300

Query: 301  ANIKLAKSSLSSRIGPHRPLQSVGDEHETVRASPSQNVYDYEGSKMIDRNEDTNKWRRKQ 360
            ANIKLAKSSLSSRIG +RPLQSVGDE E VRASPSQNVYDYEG +MI+RNEDTNKWRRKQ
Sbjct: 301  ANIKLAKSSLSSRIGHNRPLQSVGDELEAVRASPSQNVYDYEGFRMINRNEDTNKWRRKQ 360

Query: 361  YPDDNLNGLESTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINVIDNK 420
            YPDDNLNGLEST S+NIRNG ALEGPRALIEAYGSDKGKGYLNDNPPQAEHFS+N IDNK
Sbjct: 361  YPDDNLNGLEST-SFNIRNGCALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSMNGIDNK 420

Query: 421  ATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVPPSRFRTRSGFERSNAMPIEPGM 480
             TPVTWQNTEEEEFDWEDMSPTLADRGR+NDMLKPPVPPSRFRTR GF+RSNAM IEPGM
Sbjct: 421  MTPVTWQNTEEEEFDWEDMSPTLADRGRSNDMLKPPVPPSRFRTRLGFDRSNAMSIEPGM 480

Query: 481  RSNWSSPVQLPGIDSSIVIEDVAHSTPDNWNMHNHISQTSQNLMNNKGQGRNFQMPMLGR 540
            RSN                     S  D W+MH+H+SQTSQNLM+ KG G NFQ+P+LGR
Sbjct: 481  RSN--------------------SSHQDAWSMHSHLSQTSQNLMSTKGTGGNFQIPLLGR 540

Query: 541  GITSSVGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSSMESQSIVQSMGPRHPLNLS 600
            GI SS GEKMSP+ DKL TNDALHRPT +ASRLGSS LDSSMESQS+VQSMG RHP+NLS
Sbjct: 541  GIASSGGEKMSPFVDKLPTNDALHRPT-VASRLGSSALDSSMESQSVVQSMGQRHPVNLS 600

Query: 601  NSCPPSRPPIFPVPRHNASQFESLNGSNSFMNCANRTFLPEQQMNNLRNKELSLTTKSPQ 660
            +SCPPSRPP F VP HN SQFESLNGSN+F+N ANR+FLPEQQMNN+RNKELS TTKSPQ
Sbjct: 601  DSCPPSRPP-FHVPGHNKSQFESLNGSNAFINRANRSFLPEQQMNNVRNKELSHTTKSPQ 660

Query: 661  VGNQHTGHIPLTRGNQLQGMPLKPQFLPSQDMQDNFSGSAVPPVLPHLMAPSLSQGYISQ 720
            VGNQH G I LT+GNQLQ +PLKPQFLPSQDM D+FS SAVPPVLPHLMAPSLSQGY SQ
Sbjct: 661  VGNQHGGRILLTQGNQLQTIPLKPQFLPSQDMHDSFSASAVPPVLPHLMAPSLSQGYSSQ 720

Query: 721  GHRPAISEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQKVP 780
            G RP ISE LSSS PIGQWNL VHNS SNPLHLQ GPLPPLP GPHPT          VP
Sbjct: 721  GLRPGISECLSSSVPIGQWNLPVHNSPSNPLHLQ-GPLPPLPAGPHPTISQN--AGSLVP 780

Query: 781  GQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQ 840
            GQQPGTA SGLISSLMA+GLISLNN+ASVQDSVG+EFNPDVLKVRH+SAITALYADLPRQ
Sbjct: 781  GQQPGTAFSGLISSLMAQGLISLNNKASVQDSVGVEFNPDVLKVRHDSAITALYADLPRQ 840

Query: 841  CMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPG 900
            CMTCGLRFKTQEEHSNHMDWHVT+NRMSKSRKQKPSRKWFVS SMWLSGAEALGTEAVPG
Sbjct: 841  CMTCGLRFKTQEEHSNHMDWHVTRNRMSKSRKQKPSRKWFVSTSMWLSGAEALGTEAVPG 900

Query: 901  FLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQT 960
            FLPAEV+VEKKDDEELAVPADEDQKTCALCGEPF+DFYSDETEEWMYRGAVYMNAPDGQT
Sbjct: 901  FLPAEVIVEKKDDEELAVPADEDQKTCALCGEPFDDFYSDETEEWMYRGAVYMNAPDGQT 960

Query: 961  AGMDISQLGPIVHAKCRTETNVVPSESFDQDE-GGVSEEGNRRKRLRS 1008
            AGMD SQLGPIVHAKCRTE+NVVPSESFDQDE  GVSEEG++RKRLRS
Sbjct: 961  AGMDRSQLGPIVHAKCRTESNVVPSESFDQDEQRGVSEEGSQRKRLRS 982

BLAST of CSPI01G13400 vs. NCBI nr
Match: XP_011653866.1 (polyadenylation and cleavage factor homolog 4 [Cucumis sativus] >XP_031739723.1 polyadenylation and cleavage factor homolog 4 [Cucumis sativus] >KGN64812.1 hypothetical protein Csa_013375 [Cucumis sativus])

HSP 1 Score: 2021.9 bits (5237), Expect = 0.0e+00
Identity = 1004/1007 (99.70%), Postives = 1005/1007 (99.80%), Query Frame = 0

Query: 1    MTRFMESEKLLISRGNPRNSVYPSDRPIPTTSGRTMPNELPQKPAPSIAHRFRAQLKQRD 60
            MTRFMESEKLLISRGNPRNSVYPSDRPIPTTSGRTMPNELPQKPAPSIAHRFRAQLKQRD
Sbjct: 1    MTRFMESEKLLISRGNPRNSVYPSDRPIPTTSGRTMPNELPQKPAPSIAHRFRAQLKQRD 60

Query: 61   DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120
            DEFRVSGHDVVP PTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC
Sbjct: 61   DEFRVSGHDVVPLPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120

Query: 121  ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYFASRLPEVFCEAYRQVHPNLHNAMRHLF 180
            ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYFASRLPEVFCEAYRQVHPNLHNAMRHLF
Sbjct: 121  ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYFASRLPEVFCEAYRQVHPNLHNAMRHLF 180

Query: 181  GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240
            GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV
Sbjct: 181  GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240

Query: 241  DKHSQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGPQGFHSMGSMGHDSFSLGTNK 300
            DKHSQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGPQGFHSMGSMGHDSFSLGTNK
Sbjct: 241  DKHSQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGPQGFHSMGSMGHDSFSLGTNK 300

Query: 301  ANIKLAKSSLSSRIGPHRPLQSVGDEHETVRASPSQNVYDYEGSKMIDRNEDTNKWRRKQ 360
            ANIKLAKSSLSSRIGPHRPLQSVGDEHETVRASPSQNVYDYEGSKMIDRNEDTNKWRRKQ
Sbjct: 301  ANIKLAKSSLSSRIGPHRPLQSVGDEHETVRASPSQNVYDYEGSKMIDRNEDTNKWRRKQ 360

Query: 361  YPDDNLNGLESTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINVIDNK 420
            YPDDNLNGLESTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINVIDNK
Sbjct: 361  YPDDNLNGLESTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINVIDNK 420

Query: 421  ATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVPPSRFRTRSGFERSNAMPIEPGM 480
            ATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVPPSRFRTRSGFERSNAMPIEPGM
Sbjct: 421  ATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVPPSRFRTRSGFERSNAMPIEPGM 480

Query: 481  RSNWSSPVQLPGIDSSIVIEDVAHSTPDNWNMHNHISQTSQNLMNNKGQGRNFQMPMLGR 540
            RSNWSSPV+LPGIDSSIVIEDV HSTPDNWNMHNHISQTSQNLMNNKGQGRNFQMPMLGR
Sbjct: 481  RSNWSSPVRLPGIDSSIVIEDVVHSTPDNWNMHNHISQTSQNLMNNKGQGRNFQMPMLGR 540

Query: 541  GITSSVGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSSMESQSIVQSMGPRHPLNLS 600
            GITSSVGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSSMESQSIVQSMGPRHPLNLS
Sbjct: 541  GITSSVGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSSMESQSIVQSMGPRHPLNLS 600

Query: 601  NSCPPSRPPIFPVPRHNASQFESLNGSNSFMNCANRTFLPEQQMNNLRNKELSLTTKSPQ 660
            NSCPPSRPPIFPVPRHNASQFESLNGSNSFMNCANRTFLPEQQMNNLRNKELSLTTKSPQ
Sbjct: 601  NSCPPSRPPIFPVPRHNASQFESLNGSNSFMNCANRTFLPEQQMNNLRNKELSLTTKSPQ 660

Query: 661  VGNQHTGHIPLTRGNQLQGMPLKPQFLPSQDMQDNFSGSAVPPVLPHLMAPSLSQGYISQ 720
            VGNQHTGHIPLTRGNQLQGMPLKPQFLPSQDMQDNFSGSAVPPVLPHLMAPSLSQGYISQ
Sbjct: 661  VGNQHTGHIPLTRGNQLQGMPLKPQFLPSQDMQDNFSGSAVPPVLPHLMAPSLSQGYISQ 720

Query: 721  GHRPAISEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQKVP 780
            GHRPAISEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQKVP
Sbjct: 721  GHRPAISEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQKVP 780

Query: 781  GQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQ 840
            GQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQ
Sbjct: 781  GQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQ 840

Query: 841  CMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPG 900
            CMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPG
Sbjct: 841  CMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPG 900

Query: 901  FLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQT 960
            FLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQT
Sbjct: 901  FLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQT 960

Query: 961  AGMDISQLGPIVHAKCRTETNVVPSESFDQDEGGVSEEGNRRKRLRS 1008
            AGMDISQLGPIVHAKCRTETNVVPSESFDQDEGGVSEEGNRRKRLRS
Sbjct: 961  AGMDISQLGPIVHAKCRTETNVVPSESFDQDEGGVSEEGNRRKRLRS 1007

BLAST of CSPI01G13400 vs. NCBI nr
Match: XP_008462986.1 (PREDICTED: polyadenylation and cleavage factor homolog 4 isoform X2 [Cucumis melo])

HSP 1 Score: 1961.4 bits (5080), Expect = 0.0e+00
Identity = 974/1007 (96.72%), Postives = 988/1007 (98.11%), Query Frame = 0

Query: 1    MTRFMESEKLLISRGNPRNSVYPSDRPIPTTSGRTMPNELPQKPAPSIAHRFRAQLKQRD 60
            MTRFMESEKLLISRGNPRNS YPSDRPIPTTSGRTMPNELPQKP PSIAHRFRAQLKQRD
Sbjct: 1    MTRFMESEKLLISRGNPRNSAYPSDRPIPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD 60

Query: 61   DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120
            DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC
Sbjct: 61   DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120

Query: 121  ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYFASRLPEVFCEAYRQVHPNLHNAMRHLF 180
            ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYF+SRLPEVFCEAYRQVHPNLHNAMRHLF
Sbjct: 121  ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF 180

Query: 181  GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240
            GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV
Sbjct: 181  GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240

Query: 241  DKHSQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGPQGFHSMGSMGHDSFSLGTNK 300
            DKH+QDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGG Q FHSMGSMGHDSFSLGTNK
Sbjct: 241  DKHTQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGAQEFHSMGSMGHDSFSLGTNK 300

Query: 301  ANIKLAKSSLSSRIGPHRPLQSVGDEHETVRASPSQNVYDYEGSKMIDRNEDTNKWRRKQ 360
            AN+KLAKSSLSSRIG HRPLQS+GDE E+VRASPSQNVYDYEGSK++DRNEDTNKWRRKQ
Sbjct: 301  ANVKLAKSSLSSRIGHHRPLQSLGDELESVRASPSQNVYDYEGSKILDRNEDTNKWRRKQ 360

Query: 361  YPDDNLNGLESTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINVIDNK 420
            YPDDN+NGLE+TSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSI+ IDNK
Sbjct: 361  YPDDNMNGLENTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSISGIDNK 420

Query: 421  ATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVPPSRFRTRSGFERSNAMPIEPGM 480
            ATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKP VPPSRFRTRSGFERSNAMPIEPGM
Sbjct: 421  ATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPTVPPSRFRTRSGFERSNAMPIEPGM 480

Query: 481  RSNWSSPVQLPGIDSSIVIEDVAHSTPDNWNMHNHISQTSQNLMNNKGQGRNFQMPMLGR 540
            RSNWSS VQLPGIDSSIVIEDV HSTPD W MHNHISQTSQNLMNNKG GRNFQMPMLGR
Sbjct: 481  RSNWSSQVQLPGIDSSIVIEDVVHSTPDIWKMHNHISQTSQNLMNNKGPGRNFQMPMLGR 540

Query: 541  GITSSVGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSSMESQSIVQSMGPRHPLNLS 600
            GITSS GEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDS+MESQSIVQSMGPRHPLNLS
Sbjct: 541  GITSSGGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSNMESQSIVQSMGPRHPLNLS 600

Query: 601  NSCPPSRPPIFPVPRHNASQFESLNGSNSFMNCANRTFLPEQQMNNLRNKELSLTTKSPQ 660
            NSCPPSRPP+FPVPRHN SQFESLNGSNSFMN ANRTFLPEQQMNNLRNKELSLTTKSPQ
Sbjct: 601  NSCPPSRPPVFPVPRHNTSQFESLNGSNSFMNSANRTFLPEQQMNNLRNKELSLTTKSPQ 660

Query: 661  VGNQHTGHIPLTRGNQLQGMPLKPQFLPSQDMQDNFSGSAVPPVLPHLMAPSLSQGYISQ 720
            VGNQHTGHIPLTRGNQLQ MPLKPQFLPSQDMQDNFSGSAVPPVLPHL+APSLSQGYISQ
Sbjct: 661  VGNQHTGHIPLTRGNQLQSMPLKPQFLPSQDMQDNFSGSAVPPVLPHLIAPSLSQGYISQ 720

Query: 721  GHRPAISEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQKVP 780
            GHRPA SEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQKVP
Sbjct: 721  GHRPANSEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQKVP 780

Query: 781  GQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQ 840
            GQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQ
Sbjct: 781  GQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQ 840

Query: 841  CMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPG 900
            CMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPG
Sbjct: 841  CMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPG 900

Query: 901  FLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQT 960
            FLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQT
Sbjct: 901  FLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQT 960

Query: 961  AGMDISQLGPIVHAKCRTETNVVPSESFDQDEGGVSEEGNRRKRLRS 1008
            AGMD SQLGPIVHAKCRTETNVVPSESFDQDEGGVSE+GNRRKRLRS
Sbjct: 961  AGMDRSQLGPIVHAKCRTETNVVPSESFDQDEGGVSEDGNRRKRLRS 1007

BLAST of CSPI01G13400 vs. NCBI nr
Match: XP_008462960.1 (PREDICTED: polyadenylation and cleavage factor homolog 4 isoform X1 [Cucumis melo] >XP_008462968.1 PREDICTED: polyadenylation and cleavage factor homolog 4 isoform X1 [Cucumis melo])

HSP 1 Score: 1955.3 bits (5064), Expect = 0.0e+00
Identity = 974/1012 (96.25%), Postives = 988/1012 (97.63%), Query Frame = 0

Query: 1    MTRFMESEKLLISRGNPRNSVYPSDRPIPTTSGRTMPNELPQKPAPSIAHRFRAQLKQRD 60
            MTRFMESEKLLISRGNPRNS YPSDRPIPTTSGRTMPNELPQKP PSIAHRFRAQLKQRD
Sbjct: 1    MTRFMESEKLLISRGNPRNSAYPSDRPIPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD 60

Query: 61   DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120
            DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC
Sbjct: 61   DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120

Query: 121  ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYFASRLPEVFCEAYRQVHPNLHNAMRHLF 180
            ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYF+SRLPEVFCEAYRQVHPNLHNAMRHLF
Sbjct: 121  ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF 180

Query: 181  GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240
            GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV
Sbjct: 181  GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240

Query: 241  DK-----HSQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGPQGFHSMGSMGHDSFS 300
            DK     H+QDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGG Q FHSMGSMGHDSFS
Sbjct: 241  DKLLALQHTQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGAQEFHSMGSMGHDSFS 300

Query: 301  LGTNKANIKLAKSSLSSRIGPHRPLQSVGDEHETVRASPSQNVYDYEGSKMIDRNEDTNK 360
            LGTNKAN+KLAKSSLSSRIG HRPLQS+GDE E+VRASPSQNVYDYEGSK++DRNEDTNK
Sbjct: 301  LGTNKANVKLAKSSLSSRIGHHRPLQSLGDELESVRASPSQNVYDYEGSKILDRNEDTNK 360

Query: 361  WRRKQYPDDNLNGLESTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSIN 420
            WRRKQYPDDN+NGLE+TSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSI+
Sbjct: 361  WRRKQYPDDNMNGLENTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSIS 420

Query: 421  VIDNKATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVPPSRFRTRSGFERSNAMP 480
             IDNKATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKP VPPSRFRTRSGFERSNAMP
Sbjct: 421  GIDNKATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPTVPPSRFRTRSGFERSNAMP 480

Query: 481  IEPGMRSNWSSPVQLPGIDSSIVIEDVAHSTPDNWNMHNHISQTSQNLMNNKGQGRNFQM 540
            IEPGMRSNWSS VQLPGIDSSIVIEDV HSTPD W MHNHISQTSQNLMNNKG GRNFQM
Sbjct: 481  IEPGMRSNWSSQVQLPGIDSSIVIEDVVHSTPDIWKMHNHISQTSQNLMNNKGPGRNFQM 540

Query: 541  PMLGRGITSSVGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSSMESQSIVQSMGPRH 600
            PMLGRGITSS GEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDS+MESQSIVQSMGPRH
Sbjct: 541  PMLGRGITSSGGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSNMESQSIVQSMGPRH 600

Query: 601  PLNLSNSCPPSRPPIFPVPRHNASQFESLNGSNSFMNCANRTFLPEQQMNNLRNKELSLT 660
            PLNLSNSCPPSRPP+FPVPRHN SQFESLNGSNSFMN ANRTFLPEQQMNNLRNKELSLT
Sbjct: 601  PLNLSNSCPPSRPPVFPVPRHNTSQFESLNGSNSFMNSANRTFLPEQQMNNLRNKELSLT 660

Query: 661  TKSPQVGNQHTGHIPLTRGNQLQGMPLKPQFLPSQDMQDNFSGSAVPPVLPHLMAPSLSQ 720
            TKSPQVGNQHTGHIPLTRGNQLQ MPLKPQFLPSQDMQDNFSGSAVPPVLPHL+APSLSQ
Sbjct: 661  TKSPQVGNQHTGHIPLTRGNQLQSMPLKPQFLPSQDMQDNFSGSAVPPVLPHLIAPSLSQ 720

Query: 721  GYISQGHRPAISEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPI 780
            GYISQGHRPA SEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPI
Sbjct: 721  GYISQGHRPANSEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPI 780

Query: 781  SQKVPGQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYA 840
            SQKVPGQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYA
Sbjct: 781  SQKVPGQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYA 840

Query: 841  DLPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGT 900
            DLPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGT
Sbjct: 841  DLPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGT 900

Query: 901  EAVPGFLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNA 960
            EAVPGFLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNA
Sbjct: 901  EAVPGFLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNA 960

Query: 961  PDGQTAGMDISQLGPIVHAKCRTETNVVPSESFDQDEGGVSEEGNRRKRLRS 1008
            PDGQTAGMD SQLGPIVHAKCRTETNVVPSESFDQDEGGVSE+GNRRKRLRS
Sbjct: 961  PDGQTAGMDRSQLGPIVHAKCRTETNVVPSESFDQDEGGVSEDGNRRKRLRS 1012

BLAST of CSPI01G13400 vs. NCBI nr
Match: KAA0051796.1 (polyadenylation and cleavage factor-like protein 4 isoform X2 [Cucumis melo var. makuwa] >TYK21445.1 polyadenylation and cleavage factor-like protein 4 isoform X2 [Cucumis melo var. makuwa])

HSP 1 Score: 1912.9 bits (4954), Expect = 0.0e+00
Identity = 948/980 (96.73%), Postives = 961/980 (98.06%), Query Frame = 0

Query: 14   RGNPRNSVYPSDRPIPTTSGRTMPNELPQKPAPSIAHRFRAQLKQRDDEFRVSGHDVVPP 73
            RGNPRNS YPSDRPIPTTSGRTMPNELPQKP PSIAHRFRAQLKQRDDEFRVSGHDVVPP
Sbjct: 160  RGNPRNSAYPSDRPIPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRDDEFRVSGHDVVPP 219

Query: 74   PTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLICARILEVPVDQKLP 133
            PTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLICARILEVPVDQKLP
Sbjct: 220  PTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLICARILEVPVDQKLP 279

Query: 134  SLYLLDSIVKNVGHEYISYFASRLPEVFCEAYRQVHPNLHNAMRHLFGTWATVFPPSIIR 193
            SLYLLDSIVKNVGHEYISYF+SRLPEVFCEAYRQVHPNLHNAMRHLFGTWATVFPPSIIR
Sbjct: 280  SLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLFGTWATVFPPSIIR 339

Query: 194  KIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVVDKHSQDSRGTSAI 253
            KIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVVDKH+QDSRGTSAI
Sbjct: 340  KIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVVDKHTQDSRGTSAI 399

Query: 254  KVHDKKLASGYEEYDYDHADALEHGGPQGFHSMGSMGHDSFSLGTNKANIKLAKSSLSSR 313
            KVHDKKLASGYEEYDYDHADALEHGG Q FHSMGSMGHDSFSLGTNKAN+KLAKSSLSSR
Sbjct: 400  KVHDKKLASGYEEYDYDHADALEHGGAQEFHSMGSMGHDSFSLGTNKANVKLAKSSLSSR 459

Query: 314  IGPHRPLQSVGDEHETVRASPSQNVYDYEGSKMIDRNEDTNKWRRKQYPDDNLNGLESTS 373
            IG HRPLQS+GDE E+VRASPSQNVYDYEGSK++DRNEDTNKWRRKQYPDDN+NGLE+TS
Sbjct: 460  IGHHRPLQSLGDELESVRASPSQNVYDYEGSKILDRNEDTNKWRRKQYPDDNMNGLENTS 519

Query: 374  SYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINVIDNKATPVTWQNTEEEE 433
            SYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSI+ IDNKATPVTWQNTEEEE
Sbjct: 520  SYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSISGIDNKATPVTWQNTEEEE 579

Query: 434  FDWEDMSPTLADRGRNNDMLKPPVPPSRFRTRSGFERSNAMPIEPGMRSNWSSPVQLPGI 493
            FDWEDMSPTLADRGRNNDMLKP VPPSRFRTRSGFERSNAMPIEPGMRSNWSS VQLPGI
Sbjct: 580  FDWEDMSPTLADRGRNNDMLKPTVPPSRFRTRSGFERSNAMPIEPGMRSNWSSQVQLPGI 639

Query: 494  DSSIVIEDVAHSTPDNWNMHNHISQTSQNLMNNKGQGRNFQMPMLGRGITSSVGEKMSPY 553
            DSSIVIEDV HSTPD W MHNHISQTSQNLMNNKG GRNFQMPMLGRGITSS GEKMSPY
Sbjct: 640  DSSIVIEDVVHSTPDIWKMHNHISQTSQNLMNNKGPGRNFQMPMLGRGITSSGGEKMSPY 699

Query: 554  GDKLLTNDALHRPTNIASRLGSSGLDSSMESQSIVQSMGPRHPLNLSNSCPPSRPPIFPV 613
            GDKLLTNDALHRPTNIASRLGSSGLDS+MESQSIVQSMGPRHPLNLSNSCPPSRPP+FPV
Sbjct: 700  GDKLLTNDALHRPTNIASRLGSSGLDSNMESQSIVQSMGPRHPLNLSNSCPPSRPPVFPV 759

Query: 614  PRHNASQFESLNGSNSFMNCANRTFLPEQQMNNLRNKELSLTTKSPQVGNQHTGHIPLTR 673
            PRHN SQFESLNGSNSFMN ANRTFLPEQQMNNLRNKELSLTTKSPQVGNQHTGHIPLTR
Sbjct: 760  PRHNTSQFESLNGSNSFMNSANRTFLPEQQMNNLRNKELSLTTKSPQVGNQHTGHIPLTR 819

Query: 674  GNQLQGMPLKPQFLPSQDMQDNFSGSAVPPVLPHLMAPSLSQGYISQGHRPAISEGLSSS 733
            GNQLQ MPLKPQFLPSQDMQDNFSGSAVPPVLPHL+APSLSQGYISQGHRPA SEGLSSS
Sbjct: 820  GNQLQSMPLKPQFLPSQDMQDNFSGSAVPPVLPHLIAPSLSQGYISQGHRPANSEGLSSS 879

Query: 734  APIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQKVPGQQPGTAISGLIS 793
            APIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQKVPGQQPGTAISGLIS
Sbjct: 880  APIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQKVPGQQPGTAISGLIS 939

Query: 794  SLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQCMTCGLRFKTQEE 853
            SLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQCMTCGLRFKTQEE
Sbjct: 940  SLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQCMTCGLRFKTQEE 999

Query: 854  HSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPGFLPAEVVVEKKDD 913
            HSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPGFLPAEVVVEKKDD
Sbjct: 1000 HSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPGFLPAEVVVEKKDD 1059

Query: 914  EELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQTAGMDISQLGPIVH 973
            EELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQTAGMD SQLGPIVH
Sbjct: 1060 EELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQTAGMDRSQLGPIVH 1119

Query: 974  AKCRTETNVVPSESFDQDEG 994
            AKCRTETNVVPSESFDQDEG
Sbjct: 1120 AKCRTETNVVPSESFDQDEG 1139

BLAST of CSPI01G13400 vs. NCBI nr
Match: XP_038894060.1 (polyadenylation and cleavage factor homolog 4 isoform X3 [Benincasa hispida])

HSP 1 Score: 1824.3 bits (4724), Expect = 0.0e+00
Identity = 920/1012 (90.91%), Postives = 944/1012 (93.28%), Query Frame = 0

Query: 1    MTRFMESEKLLISRGNPRNSVYPSDRPIPTTSGRTMPNELPQKPAPSIAHRFRAQLKQRD 60
            MT FMESEKLLISRGNPRNS YPSDR +PTTSGRTMPNELPQKP PSIAHRFRAQLKQRD
Sbjct: 1    MTPFMESEKLLISRGNPRNSAYPSDRQLPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD 60

Query: 61   DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120
            DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC
Sbjct: 61   DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120

Query: 121  ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYFASRLPEVFCEAYRQVHPNLHNAMRHLF 180
            ARILEVPV+QKLPSLYLLDSIVKNVG EYISYF+SRLPEVFCEAYRQVHPNLHNAMRHLF
Sbjct: 121  ARILEVPVEQKLPSLYLLDSIVKNVGQEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF 180

Query: 181  GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240
            GTWATVFPPSIIRKIEAQLSQLTAQESS LTSSRASESPRPTHGIHVNPKYLRQLEHSVV
Sbjct: 181  GTWATVFPPSIIRKIEAQLSQLTAQESSSLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240

Query: 241  DKHSQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGPQGFHSMGSMGHDSFSLGTNK 300
            DK   D+RG SA+KVHDKKLASGYEEYDYDHA+ LEHGG Q FH + SM HDSF+LGTNK
Sbjct: 241  DKQIHDARGVSALKVHDKKLASGYEEYDYDHAEVLEHGGAQAFH-LRSMAHDSFALGTNK 300

Query: 301  ANIKLAKSSLSSRIGPHRPLQSVGDEHETVRASPSQNVYDYEGSKMIDRNEDTNKWRRKQ 360
            ANIKLAKSS SSRIG +RPLQS GDE E VRASPSQNVYDYEGS+MIDR EDTNKWRRKQ
Sbjct: 301  ANIKLAKSSPSSRIGHNRPLQSAGDELEAVRASPSQNVYDYEGSRMIDRIEDTNKWRRKQ 360

Query: 361  YPDDNLNGLESTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINVIDNK 420
            YPDDNLNGLEST SYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSIN IDNK
Sbjct: 361  YPDDNLNGLEST-SYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINGIDNK 420

Query: 421  ATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVPPSRFRTRSGFERSNAMPIEPGM 480
             TPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKP VPPSRF TR+GFERSNAM IEPGM
Sbjct: 421  VTPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPSVPPSRFVTRTGFERSNAMSIEPGM 480

Query: 481  RSNWSSPVQLPGIDSSIVIEDVAHSTPDNWNMHNHISQTSQNLMNNKGQGRNFQMPMLGR 540
            RSNWSS VQLP IDSS+VIEDV  STPD WNMHNHISQTSQNLMNNKG GRNFQ P+LGR
Sbjct: 481  RSNWSSQVQLPTIDSSMVIEDVVQSTPDIWNMHNHISQTSQNLMNNKGAGRNFQTPLLGR 540

Query: 541  GITSSVGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSSMESQSIVQSMGPRHPLNLS 600
            GI  S GEKMSP+ DKLLTNDALHRPT IASRLGSSGLDSSME QSIVQSMGPRHPLNL 
Sbjct: 541  GIALSGGEKMSPFADKLLTNDALHRPTTIASRLGSSGLDSSMELQSIVQSMGPRHPLNLP 600

Query: 601  NSCPPSRPPIFPVPRHNASQFESLNGSNSFMNCANRTFLPEQQMNNLRNKELSLTTKSPQ 660
            NSCPPSRPPIFPVPRHN S FESLNG NSF+N ANR+FLPEQQMNN+RNKELSLTTK PQ
Sbjct: 601  NSCPPSRPPIFPVPRHNKSPFESLNGGNSFINRANRSFLPEQQMNNMRNKELSLTTKLPQ 660

Query: 661  VGNQHTGHIPLTRGNQLQGMPLKPQFLPSQDMQDNFSGSAVPPVLPHLMAPSLSQGYISQ 720
            VGNQHTGHIPLTRGNQLQ +PLKPQFLPSQDMQDN S S VPP LPHLMAPSLSQGYISQ
Sbjct: 661  VGNQHTGHIPLTRGNQLQAIPLKPQFLPSQDMQDNLSASVVPPALPHLMAPSLSQGYISQ 720

Query: 721  GHRPAISEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQK-- 780
            GHRPAISE LSSSAPIGQWNL VHNS SNPLHLQGGPLPPLPPGPHPTS PTIPI QK  
Sbjct: 721  GHRPAISECLSSSAPIGQWNLPVHNSPSNPLHLQGGPLPPLPPGPHPTSIPTIPIPQKAG 780

Query: 781  --VPGQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYAD 840
              VPGQ+PGT  SGLISSLMA+GLISLNNQ SVQDSVGLEFNPDVLKVRHESAITALYAD
Sbjct: 781  SLVPGQRPGTEFSGLISSLMAQGLISLNNQPSVQDSVGLEFNPDVLKVRHESAITALYAD 840

Query: 841  LPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTE 900
            LPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTE
Sbjct: 841  LPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTE 900

Query: 901  AVPGFLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAP 960
            AVPGFLP EV+VEKKDDEELAVPAD+DQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAP
Sbjct: 901  AVPGFLPPEVIVEKKDDEELAVPADDDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAP 960

Query: 961  DGQTAGMDISQLGPIVHAKCRTETNVVPSESFDQDE-GGVSEEGNRRKRLRS 1008
            DGQTAGMD SQLGPIVHAKCRTETNVV SESF+Q+E GGVSEEGNRRKRLRS
Sbjct: 961  DGQTAGMDRSQLGPIVHAKCRTETNVVTSESFEQEEQGGVSEEGNRRKRLRS 1010

BLAST of CSPI01G13400 vs. TAIR 10
Match: AT4G04885.1 (PCF11P-similar protein 4 )

HSP 1 Score: 630.6 bits (1625), Expect = 2.2e-180
Identity = 437/1010 (43.27%), Postives = 561/1010 (55.54%), Query Frame = 0

Query: 5    MESEKLLISRGNPRNSVYPSDRPIPTTSGRTMPNELPQK--PAPSIAHRFRAQLKQRDDE 64
            M+SEK+L    NPR         I +TS + M  ELPQK  P PS+  RF+A L QR+DE
Sbjct: 1    MDSEKIL----NPRLV------SINSTSRKGMSVELPQKPPPPPSLLDRFKALLNQREDE 60

Query: 65   FRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLICAR 124
            F   G + V PP+ ++IVQLY+++L ELTFNSKPIITDLT++A EQREHG+GIA+ IC R
Sbjct: 61   F--GGGEEVLPPSMDEIVQLYEVVLGELTFNSKPIITDLTIIAGEQREHGEGIANAICTR 120

Query: 125  ILEVPVDQKLPSLYLLDSIVKNVGHEYISYFASRLPEVFCEAYRQVHPNLHNAMRHLFGT 184
            ILE PV+QKLPSLYLLDSIVKN+G +Y  YF+SRLPEVFC AYRQ HP+LH +MRHLFGT
Sbjct: 121  ILEAPVEQKLPSLYLLDSIVKNIGRDYGRYFSSRLPEVFCLAYRQAHPSLHPSMRHLFGT 180

Query: 185  WATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVVDK 244
            W++VFPP ++RKI+ QL   +A   S   S  ASE  +PT GIHVNPKYLR+LE S  + 
Sbjct: 181  WSSVFPPPVLRKIDMQLQLSSAANQS---SVGASEPSQPTRGIHVNPKYLRRLEPSAAE- 240

Query: 245  HSQDSRG-TSAIKVHDKKLASGYEEYDYDHADALEHGGPQGFHSMGSMGHDSFSLGTNKA 304
               + RG  S+ +V+ +    GY +++    D LE                         
Sbjct: 241  --NNLRGINSSARVYGQNSLGGYNDFE----DQLE------------------------- 300

Query: 305  NIKLAKSSLSSRIGPHRPLQSVGDEHETVRASPSQNVYDYEGSKMIDRNEDTNKWRRKQY 364
                + SSLSS         + G       A+PS   ++Y   +   R+++  +WRRK+ 
Sbjct: 301  ----SPSSLSSTPDGFTRRSNDG-------ANPSNQAFNYGMGRATSRDDEHMEWRRKE- 360

Query: 365  PDDNLNGLESTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINVIDNK- 424
                          N+  G+  E PRALI+AYG D  K    + P +     +N + +K 
Sbjct: 361  --------------NLGQGNDHERPRALIDAYGVDTSKHVTINKPIR----DMNGMHSKM 420

Query: 425  ATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVPP-SRFRTRSGFERSNAMPIEPG 484
             TP  WQNTEEEEFDWEDMSPTL DR R  + L+  VP     R R     ++   ++  
Sbjct: 421  VTP--WQNTEEEEFDWEDMSPTL-DRSRAGEFLRSSVPALGSVRARPRVGNTSDFHLDSD 480

Query: 485  MRSNWSSPVQLPGIDSSIVIEDVAHSTPDNWNMHNHISQTSQNLMNNKGQGRNFQMPMLG 544
            +++                   V+H   +NW++  +   TS  +  +   G++ ++    
Sbjct: 481  IKNG------------------VSHQLRENWSLSQNYPHTSNRV--DTRAGKDLKVLASS 540

Query: 545  RGITSSVGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSSMESQSIVQSMGPRHPLNL 604
             G+ SS  E  +P  D +          ++ SR G +  D +    S   + GP      
Sbjct: 541  VGLVSSNSEFGAPPFDSI---------QDVNSRFGRALPDGTWPHLS---ARGP------ 600

Query: 605  SNSCPPSRPPIFPVPRHNASQFESLNGSNSFMNCANRTFLPEQQMNNLRNKELSLTTKSP 664
             NS         PVP            S    + AN    P   M+N R +   L     
Sbjct: 601  -NS--------LPVP------------SAHLHHLAN----PGNAMSN-RLQGKPLYRPEN 660

Query: 665  QVGNQHTGHIPLTRGNQLQGMPLKPQFLPSQDMQDNFSGSAVPPVLPHLMAPSLSQGYIS 724
            QV   H     +T+ NQ     +   +LPS       S +  P  +  L+       ++S
Sbjct: 661  QVSQSHLN--DMTQQNQ-----MLVNYLPS-------SSAMAPRPMQSLLT------HVS 720

Query: 725  QGHRPAISEGLSSSAPIGQWNLSVHNSSSNP-LHLQGG-PLPPLPPGPHPTSGPTIPISQ 784
             G+ P                   H S+  P L +QGG  + PL  G     G +     
Sbjct: 721  HGYPP-------------------HGSTIRPSLSIQGGEAMHPLSSGVLSQIGAS----- 780

Query: 785  KVPGQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADL 844
                Q PG A SGLI SLMA+GLISLNNQ + Q  +GLEF+ D+LK+R+ESAI+ALY DL
Sbjct: 781  ---NQPPGGAFSGLIGSLMAQGLISLNNQPAGQGPLGLEFDADMLKIRNESAISALYGDL 808

Query: 845  PRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEA 904
            PRQC TCGLRFK QEEHS HMDWHVTKNRMSK+ KQ PSRKWFVS SMWLSGAEALG EA
Sbjct: 841  PRQCTTCGLRFKCQEEHSKHMDWHVTKNRMSKNHKQNPSRKWFVSASMWLSGAEALGAEA 808

Query: 905  VPGFLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPD 964
            VPGFLP E   EKKDDE++AVPADEDQ +CALCGEPFEDFYSDETEEWMY+GAVYMNAP+
Sbjct: 901  VPGFLPTEPTTEKKDDEDMAVPADEDQTSCALCGEPFEDFYSDETEEWMYKGAVYMNAPE 808

Query: 965  GQTAGMDISQLGPIVHAKCRTETNVVPSESFDQDEGGVSEEGNRRKRLRS 1008
              T  MD SQLGPIVHAKCR E+N           GG  EEG++RK++RS
Sbjct: 961  ESTTDMDKSQLGPIVHAKCRPESN-----------GGDMEEGSQRKKMRS 808

BLAST of CSPI01G13400 vs. TAIR 10
Match: AT2G36480.1 (ENTH/VHS family protein )

HSP 1 Score: 198.7 bits (504), Expect = 2.2e-50
Identity = 232/908 (25.55%), Postives = 358/908 (39.43%), Query Frame = 0

Query: 124 LEVPVDQKLPSLYLLDSIVKNVGHEYISYFASRLPEVFCEAYRQVHPNLHNAMRHLFGTW 183
           ++VP DQKLP+LYLLDSIVKN+G +YI YF +RLPEVF +AYRQV P +H+ MRHLFGTW
Sbjct: 1   MQVPSDQKLPTLYLLDSIVKNIGRDYIKYFGARLPEVFVKAYRQVDPPMHSNMRHLFGTW 60

Query: 184 ATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESP---RPTHGIHVNPKYLRQLEHSVV 243
             VF P  ++ IE +L      + S    S A   P   RP H IHVNPKYL +      
Sbjct: 61  KGVFHPQTLQLIEKELGFNAKSDGSAAVVSTARAEPQSQRPPHSIHVNPKYLER------ 120

Query: 244 DKHSQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGPQGFHSMGSMGHDSFSLGTNK 303
            +  Q S  T  +     + A          +D LE         + S+      +G  K
Sbjct: 121 -QRLQQSGRTKGMVTDVPETAPNLTR----DSDRLER--------VSSIASGGSWVGPAK 180

Query: 304 A-NIKLAKSSLSSRIGPHRPLQSVGDEHETVRASP--SQNVYDYEGSKMIDRNEDTNKW- 363
             NI+  +  L S     + ++S+  E++     P  S++V    GS++ D   +  +W 
Sbjct: 181 VNNIRRPQRDLLSEPLYEKDIESIAGEYDYASDLPHNSRSVIKNVGSRITDDGCE-KQWY 240

Query: 364 ----RRKQYPDDNLNGLESTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHF 423
               R      D  +GL S S    R  +        +E+ G  +  G   D        
Sbjct: 241 GATNRDPDLISDQRDGLHSKS----RTSNYATARVENLESSGPSRNIGVPYD-------- 300

Query: 424 SINVIDNKATPVTWQNTEEEEFDWEDMSPTLADRG----RNNDMLKPPVPPSRFRTRSG- 483
                       +W+N+EEEEF W DM   L++         + L  P    R  + +  
Sbjct: 301 ------------SWKNSEEEEFMW-DMHSRLSETDVATINPKNELHAPDESERLESENHL 360

Query: 484 FERSNAMPIEP-----GMRSNWSSPVQLP-GIDSSIVIEDVAHSTPDNWNMHNHISQTSQ 543
            +R     ++P        +++SS  + P  I         A ST     +       S 
Sbjct: 361 LKRPRFSALDPRFDPANSTNSYSSEQKDPSSIGHWAFSSTNATSTATRKGIQPQPRVASS 420

Query: 544 NLMNNKGQGRNFQMPMLGRGITSSVGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSS 603
            ++ + G G + Q P+       +V ++       L   D        ASR  +      
Sbjct: 421 GILPSSGSGSDRQSPLHDSTSKQNVTKQDVRRAHSLPQRDPR------ASRFPA------ 480

Query: 604 MESQSIVQSMGPRHPLNLSNSCPPSRPPIFPVPRHNASQFESLNGSNSFMNCANRTFLPE 663
              Q++ +    R P + S          F          E  +  ++  N    T   E
Sbjct: 481 --KQNVPRDDSVRLPSSSSQ---------FKNTNMRELPVEIFDSKSAAENAPGLTLASE 540

Query: 664 QQMNNLRNKELSLTTKSPQVGNQHT-------GHIPLTRGNQLQGMPLKPQFLPSQDMQD 723
                  +  L    KS  + N  T        H  +  G        KP+ LP     D
Sbjct: 541 ATGQPNMSDLLEAVMKSGILSNNSTCGAIKEESHDEVNPGALTLPAASKPKTLPISLATD 600

Query: 724 NFSGSAVPPVLPHLMAPSLSQGYISQGHRPAISEGLSSSAPIGQWNLSVHNSSSNPL--- 783
           N                 L++  + Q   P +S   S +           + +S+PL   
Sbjct: 601 NL----------------LARLKVEQSSAPLVSCAASLTGITSVQTSKEKSKASDPLSCL 660

Query: 784 ---------------HLQGGPLPPLPPGPHPTSGPTIPISQKVPGQQPGTAISGLISSLM 843
                           L   P       P  ++  ++ +S      QP   + G  ++  
Sbjct: 661 LSSLVSKGLISASKTELPSAPSITQEHSPDHSTNSSMSVSVVPADAQPSVLVKGPSTAPK 720

Query: 844 ARGLI--SLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQCMTCGLRFKTQEEH 903
            +GL   S  +++  +D +GL+F  D ++  H S I++L+ DLP  C +C +R K +EE 
Sbjct: 721 VKGLAAPSETSKSEPKDLIGLKFRADKIRELHPSVISSLFDDLPHLCTSCSVRLKQKEEL 780

Query: 904 SNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPGFLPAEVVVEKKDDE 963
             HM+ H  K ++  S      R WF  +  W++   A   E  P +       E   ++
Sbjct: 781 DRHMELH-DKKKLELSGTNSKCRVWFPKVDNWIA---AKAGELEPEYEEVLSEPESAIED 815

Query: 964 ELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQTAGMDISQLGPIVHA 983
             AV ADE Q  C LCGE FED++S E  +WM++GA Y+  P   +        GPIVH 
Sbjct: 841 CQAVAADETQCACILCGEVFEDYFSQEMAQWMFKGASYLTNPPANSEAS-----GPIVHT 815

BLAST of CSPI01G13400 vs. TAIR 10
Match: AT2G36480.2 (ENTH/VHS family protein )

HSP 1 Score: 198.7 bits (504), Expect = 2.2e-50
Identity = 232/908 (25.55%), Postives = 358/908 (39.43%), Query Frame = 0

Query: 124 LEVPVDQKLPSLYLLDSIVKNVGHEYISYFASRLPEVFCEAYRQVHPNLHNAMRHLFGTW 183
           ++VP DQKLP+LYLLDSIVKN+G +YI YF +RLPEVF +AYRQV P +H+ MRHLFGTW
Sbjct: 1   MQVPSDQKLPTLYLLDSIVKNIGRDYIKYFGARLPEVFVKAYRQVDPPMHSNMRHLFGTW 60

Query: 184 ATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESP---RPTHGIHVNPKYLRQLEHSVV 243
             VF P  ++ IE +L      + S    S A   P   RP H IHVNPKYL +      
Sbjct: 61  KGVFHPQTLQLIEKELGFNAKSDGSAAVVSTARAEPQSQRPPHSIHVNPKYLER------ 120

Query: 244 DKHSQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGPQGFHSMGSMGHDSFSLGTNK 303
            +  Q S  T  +     + A          +D LE         + S+      +G  K
Sbjct: 121 -QRLQQSGRTKGMVTDVPETAPNLTR----DSDRLER--------VSSIASGGSWVGPAK 180

Query: 304 A-NIKLAKSSLSSRIGPHRPLQSVGDEHETVRASP--SQNVYDYEGSKMIDRNEDTNKW- 363
             NI+  +  L S     + ++S+  E++     P  S++V    GS++ D   +  +W 
Sbjct: 181 VNNIRRPQRDLLSEPLYEKDIESIAGEYDYASDLPHNSRSVIKNVGSRITDDGCE-KQWY 240

Query: 364 ----RRKQYPDDNLNGLESTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHF 423
               R      D  +GL S S    R  +        +E+ G  +  G   D        
Sbjct: 241 GATNRDPDLISDQRDGLHSKS----RTSNYATARVENLESSGPSRNIGVPYD-------- 300

Query: 424 SINVIDNKATPVTWQNTEEEEFDWEDMSPTLADRG----RNNDMLKPPVPPSRFRTRSG- 483
                       +W+N+EEEEF W DM   L++         + L  P    R  + +  
Sbjct: 301 ------------SWKNSEEEEFMW-DMHSRLSETDVATINPKNELHAPDESERLESENHL 360

Query: 484 FERSNAMPIEP-----GMRSNWSSPVQLP-GIDSSIVIEDVAHSTPDNWNMHNHISQTSQ 543
            +R     ++P        +++SS  + P  I         A ST     +       S 
Sbjct: 361 LKRPRFSALDPRFDPANSTNSYSSEQKDPSSIGHWAFSSTNATSTATRKGIQPQPRVASS 420

Query: 544 NLMNNKGQGRNFQMPMLGRGITSSVGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSS 603
            ++ + G G + Q P+       +V ++       L   D        ASR  +      
Sbjct: 421 GILPSSGSGSDRQSPLHDSTSKQNVTKQDVRRAHSLPQRDPR------ASRFPA------ 480

Query: 604 MESQSIVQSMGPRHPLNLSNSCPPSRPPIFPVPRHNASQFESLNGSNSFMNCANRTFLPE 663
              Q++ +    R P + S          F          E  +  ++  N    T   E
Sbjct: 481 --KQNVPRDDSVRLPSSSSQ---------FKNTNMRELPVEIFDSKSAAENAPGLTLASE 540

Query: 664 QQMNNLRNKELSLTTKSPQVGNQHT-------GHIPLTRGNQLQGMPLKPQFLPSQDMQD 723
                  +  L    KS  + N  T        H  +  G        KP+ LP     D
Sbjct: 541 ATGQPNMSDLLEAVMKSGILSNNSTCGAIKEESHDEVNPGALTLPAASKPKTLPISLATD 600

Query: 724 NFSGSAVPPVLPHLMAPSLSQGYISQGHRPAISEGLSSSAPIGQWNLSVHNSSSNPL--- 783
           N                 L++  + Q   P +S   S +           + +S+PL   
Sbjct: 601 NL----------------LARLKVEQSSAPLVSCAASLTGITSVQTSKEKSKASDPLSCL 660

Query: 784 ---------------HLQGGPLPPLPPGPHPTSGPTIPISQKVPGQQPGTAISGLISSLM 843
                           L   P       P  ++  ++ +S      QP   + G  ++  
Sbjct: 661 LSSLVSKGLISASKTELPSAPSITQEHSPDHSTNSSMSVSVVPADAQPSVLVKGPSTAPK 720

Query: 844 ARGLI--SLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQCMTCGLRFKTQEEH 903
            +GL   S  +++  +D +GL+F  D ++  H S I++L+ DLP  C +C +R K +EE 
Sbjct: 721 VKGLAAPSETSKSEPKDLIGLKFRADKIRELHPSVISSLFDDLPHLCTSCSVRLKQKEEL 780

Query: 904 SNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPGFLPAEVVVEKKDDE 963
             HM+ H  K ++  S      R WF  +  W++   A   E  P +       E   ++
Sbjct: 781 DRHMELH-DKKKLELSGTNSKCRVWFPKVDNWIA---AKAGELEPEYEEVLSEPESAIED 815

Query: 964 ELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQTAGMDISQLGPIVHA 983
             AV ADE Q  C LCGE FED++S E  +WM++GA Y+  P   +        GPIVH 
Sbjct: 841 CQAVAADETQCACILCGEVFEDYFSQEMAQWMFKGASYLTNPPANSEAS-----GPIVHT 815

BLAST of CSPI01G13400 vs. TAIR 10
Match: AT2G36480.3 (ENTH/VHS family protein )

HSP 1 Score: 198.7 bits (504), Expect = 2.2e-50
Identity = 232/908 (25.55%), Postives = 358/908 (39.43%), Query Frame = 0

Query: 124 LEVPVDQKLPSLYLLDSIVKNVGHEYISYFASRLPEVFCEAYRQVHPNLHNAMRHLFGTW 183
           ++VP DQKLP+LYLLDSIVKN+G +YI YF +RLPEVF +AYRQV P +H+ MRHLFGTW
Sbjct: 1   MQVPSDQKLPTLYLLDSIVKNIGRDYIKYFGARLPEVFVKAYRQVDPPMHSNMRHLFGTW 60

Query: 184 ATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESP---RPTHGIHVNPKYLRQLEHSVV 243
             VF P  ++ IE +L      + S    S A   P   RP H IHVNPKYL +      
Sbjct: 61  KGVFHPQTLQLIEKELGFNAKSDGSAAVVSTARAEPQSQRPPHSIHVNPKYLER------ 120

Query: 244 DKHSQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGPQGFHSMGSMGHDSFSLGTNK 303
            +  Q S  T  +     + A          +D LE         + S+      +G  K
Sbjct: 121 -QRLQQSGRTKGMVTDVPETAPNLTR----DSDRLER--------VSSIASGGSWVGPAK 180

Query: 304 A-NIKLAKSSLSSRIGPHRPLQSVGDEHETVRASP--SQNVYDYEGSKMIDRNEDTNKW- 363
             NI+  +  L S     + ++S+  E++     P  S++V    GS++ D   +  +W 
Sbjct: 181 VNNIRRPQRDLLSEPLYEKDIESIAGEYDYASDLPHNSRSVIKNVGSRITDDGCE-KQWY 240

Query: 364 ----RRKQYPDDNLNGLESTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHF 423
               R      D  +GL S S    R  +        +E+ G  +  G   D        
Sbjct: 241 GATNRDPDLISDQRDGLHSKS----RTSNYATARVENLESSGPSRNIGVPYD-------- 300

Query: 424 SINVIDNKATPVTWQNTEEEEFDWEDMSPTLADRG----RNNDMLKPPVPPSRFRTRSG- 483
                       +W+N+EEEEF W DM   L++         + L  P    R  + +  
Sbjct: 301 ------------SWKNSEEEEFMW-DMHSRLSETDVATINPKNELHAPDESERLESENHL 360

Query: 484 FERSNAMPIEP-----GMRSNWSSPVQLP-GIDSSIVIEDVAHSTPDNWNMHNHISQTSQ 543
            +R     ++P        +++SS  + P  I         A ST     +       S 
Sbjct: 361 LKRPRFSALDPRFDPANSTNSYSSEQKDPSSIGHWAFSSTNATSTATRKGIQPQPRVASS 420

Query: 544 NLMNNKGQGRNFQMPMLGRGITSSVGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSS 603
            ++ + G G + Q P+       +V ++       L   D        ASR  +      
Sbjct: 421 GILPSSGSGSDRQSPLHDSTSKQNVTKQDVRRAHSLPQRDPR------ASRFPA------ 480

Query: 604 MESQSIVQSMGPRHPLNLSNSCPPSRPPIFPVPRHNASQFESLNGSNSFMNCANRTFLPE 663
              Q++ +    R P + S          F          E  +  ++  N    T   E
Sbjct: 481 --KQNVPRDDSVRLPSSSSQ---------FKNTNMRELPVEIFDSKSAAENAPGLTLASE 540

Query: 664 QQMNNLRNKELSLTTKSPQVGNQHT-------GHIPLTRGNQLQGMPLKPQFLPSQDMQD 723
                  +  L    KS  + N  T        H  +  G        KP+ LP     D
Sbjct: 541 ATGQPNMSDLLEAVMKSGILSNNSTCGAIKEESHDEVNPGALTLPAASKPKTLPISLATD 600

Query: 724 NFSGSAVPPVLPHLMAPSLSQGYISQGHRPAISEGLSSSAPIGQWNLSVHNSSSNPL--- 783
           N                 L++  + Q   P +S   S +           + +S+PL   
Sbjct: 601 NL----------------LARLKVEQSSAPLVSCAASLTGITSVQTSKEKSKASDPLSCL 660

Query: 784 ---------------HLQGGPLPPLPPGPHPTSGPTIPISQKVPGQQPGTAISGLISSLM 843
                           L   P       P  ++  ++ +S      QP   + G  ++  
Sbjct: 661 LSSLVSKGLISASKTELPSAPSITQEHSPDHSTNSSMSVSVVPADAQPSVLVKGPSTAPK 720

Query: 844 ARGLI--SLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQCMTCGLRFKTQEEH 903
            +GL   S  +++  +D +GL+F  D ++  H S I++L+ DLP  C +C +R K +EE 
Sbjct: 721 VKGLAAPSETSKSEPKDLIGLKFRADKIRELHPSVISSLFDDLPHLCTSCSVRLKQKEEL 780

Query: 904 SNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPGFLPAEVVVEKKDDE 963
             HM+ H  K ++  S      R WF  +  W++   A   E  P +       E   ++
Sbjct: 781 DRHMELH-DKKKLELSGTNSKCRVWFPKVDNWIA---AKAGELEPEYEEVLSEPESAIED 815

Query: 964 ELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQTAGMDISQLGPIVHA 983
             AV ADE Q  C LCGE FED++S E  +WM++GA Y+  P   +        GPIVH 
Sbjct: 841 CQAVAADETQCACILCGEVFEDYFSQEMAQWMFKGASYLTNPPANSEAS-----GPIVHT 815

BLAST of CSPI01G13400 vs. TAIR 10
Match: AT1G66500.1 (Pre-mRNA cleavage complex II )

HSP 1 Score: 165.2 bits (417), Expect = 2.6e-40
Identity = 95/185 (51.35%), Postives = 114/185 (61.62%), Query Frame = 0

Query: 806 QASVQDS--VGLEF-NPDVLKVRHESAITALYADLPRQCMTCGLRFKTQEEHSNHMDWHV 865
           +AS  DS  VGL F NP  L VRHES I +LY+D+PRQC +CGLRFK QEEHS HMDWHV
Sbjct: 218 EASNSDSLPVGLSFDNPSSLNVRHESVIKSLYSDMPRQCSSCGLRFKCQEEHSKHMDWHV 277

Query: 866 TKNRMSKS-----RKQKPSRKWFVSISMWLSGAEALGTEAVPGFLPAEVVVEKKDDEE-- 925
            KNR  K+     ++ K SR W  S S+WL  A    T  V  F   E+  +K  DEE  
Sbjct: 278 RKNRSVKTTTRLGQQPKKSRGWLASASLWLCAATGGETVEVASF-GGEMQKKKGKDEEPK 337

Query: 926 -LAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQTAGMDISQLGPIVHA 980
            L VPADEDQK CALC EPFE+F+S E ++WMY+ AVY            +++ G IVH 
Sbjct: 338 QLMVPADEDQKNCALCVEPFEEFFSHEDDDWMYKDAVY------------LTKNGRIVHV 389

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q0WPF23.1e-17943.27Polyadenylation and cleavage factor homolog 4 OS=Arabidopsis thaliana OX=3702 GN... [more]
Q9C7103.7e-3951.35Polyadenylation and cleavage factor homolog 1 OS=Arabidopsis thaliana OX=3702 GN... [more]
Q9FIX81.4e-3848.39Polyadenylation and cleavage factor homolog 5 OS=Arabidopsis thaliana OX=3702 GN... [more]
O949134.3e-1936.42Pre-mRNA cleavage complex 2 protein Pcf11 OS=Homo sapiens OX=9606 GN=PCF11 PE=1 ... [more]
Q102374.9e-1540.60Uncharacterized protein C4G9.04c OS=Schizosaccharomyces pombe (strain 972 / ATCC... [more]
Match NameE-valueIdentityDescription
A0A0A0LVG00.0e+0099.70CID domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G109350 PE=4 SV... [more]
A0A1S3CI660.0e+0096.72polyadenylation and cleavage factor homolog 4 isoform X2 OS=Cucumis melo OX=3656... [more]
A0A1S3CJP90.0e+0096.25polyadenylation and cleavage factor homolog 4 isoform X1 OS=Cucumis melo OX=3656... [more]
A0A5A7UC460.0e+0096.73Polyadenylation and cleavage factor-like protein 4 isoform X2 OS=Cucumis melo va... [more]
A0A6J1EZ180.0e+0084.82polyadenylation and cleavage factor homolog 4-like isoform X2 OS=Cucurbita mosch... [more]
Match NameE-valueIdentityDescription
XP_011653866.10.0e+0099.70polyadenylation and cleavage factor homolog 4 [Cucumis sativus] >XP_031739723.1 ... [more]
XP_008462986.10.0e+0096.72PREDICTED: polyadenylation and cleavage factor homolog 4 isoform X2 [Cucumis mel... [more]
XP_008462960.10.0e+0096.25PREDICTED: polyadenylation and cleavage factor homolog 4 isoform X1 [Cucumis mel... [more]
KAA0051796.10.0e+0096.73polyadenylation and cleavage factor-like protein 4 isoform X2 [Cucumis melo var.... [more]
XP_038894060.10.0e+0090.91polyadenylation and cleavage factor homolog 4 isoform X3 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
AT4G04885.12.2e-18043.27PCF11P-similar protein 4 [more]
AT2G36480.12.2e-5025.55ENTH/VHS family protein [more]
AT2G36480.22.2e-5025.55ENTH/VHS family protein [more]
AT2G36480.32.2e-5025.55ENTH/VHS family protein [more]
AT1G66500.12.6e-4051.35Pre-mRNA cleavage complex II [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006569CID domainSMARTSM00582558neu5coord: 78..200
e-value: 7.9E-41
score: 151.6
IPR006569CID domainPFAMPF04818CIDcoord: 87..194
e-value: 2.3E-11
score: 44.0
IPR006569CID domainPROSITEPS51391CIDcoord: 75..203
score: 37.885735
IPR008942ENTH/VHSGENE3D1.25.40.90coord: 73..201
e-value: 5.7E-42
score: 144.9
IPR008942ENTH/VHSSUPERFAMILY48464ENTH/VHS domaincoord: 75..203
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 441..465
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..44
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 747..769
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 312..333
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 755..769
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 13..38
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 981..1007
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 206..226
NoneNo IPR availablePANTHERPTHR15921:SF12POLYADENYLATION AND CLEAVAGE FACTOR HOMOLOG 4coord: 42..982
NoneNo IPR availableCDDcd16982CID_Pcf11coord: 80..199
e-value: 4.37634E-54
score: 181.995
IPR045154Protein PCF11-likePANTHERPTHR15921PRE-MRNA CLEAVAGE COMPLEX IIcoord: 42..982
IPR013087Zinc finger C2H2-typePROSITEPS00028ZINC_FINGER_C2H2_1coord: 841..861

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G13400.1CSPI01G13400.1mRNA
CSPI01G13400.2CSPI01G13400.2mRNA
CSPI01G13400.3CSPI01G13400.3mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006379 mRNA cleavage
biological_process GO:0006378 mRNA polyadenylation
biological_process GO:0009911 positive regulation of flower development
biological_process GO:0006369 termination of RNA polymerase II transcription
cellular_component GO:0005737 cytoplasm
cellular_component GO:0005849 mRNA cleavage factor complex
molecular_function GO:0003729 mRNA binding
molecular_function GO:0000993 RNA polymerase II complex binding