Spg033175 (gene) Sponge gourd (cylindrica) v1

Overview
NameSpg033175
Typegene
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionpolyadenylation and cleavage factor homolog 4-like isoform X2
Locationscaffold5: 3763270 .. 3771375 (+)
RNA-Seq ExpressionSpg033175
SyntenySpg033175
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATCGGACAGCTCAAACTGGGGCTAAGCGTTTTTGCATTCTCTCTATATAACAAGCGAGTCTGAATCTCAATCCAAAGGCTCCCCTTCTTCCCGGTTTCAACCATTCTTCCCAATCTTTGCTCTTCTTTCTTATCACTGATCATTGTAAATCTTCTTCATTCTCTGTAATTTTGTAGGATTAGGGTTTTATTCTGCTTCTTTCTTGTACAATTTCAGTGTTCAAGTGGGGTTTCTTTTTTGTAATTTACATGTTGCTTTTGCCCGAACTACTCGCTGATAGGAACCAAGGTTATTATTGAGATCTTGTCTACTTGTTTTCTTTCATTGCTTTTCGTATCTGTATCGATGATGTAATTGTAGAGAATTCGAATTAGATTAGGTTTAGGATTAGGGTTTTGGTTTGTTTCGTTTGCTAATGGAAATGGAGAGCTCGCGGAGACCTTTCGATCGAACGAGGGAACCGGGTTTGAAGAAGCCCCGACTGGCCGATGAGGCTGAGCGCGGTGGGAACATCAATGGCCGGCCGTTTCCGCAGAGACCAGTTGTTTCCGGGACCAATATTGTGCAACCCAGATTTAGAGCAAGTGATAGAGATTCGGGAAGCAGTGACTCTGGTCGAGGGGGGTATCAGCCTCAGCCGCCGCAGCATCAGGAACTCGTCAGCCAGTATAGGACCGCCCTTGCTGAGCTGACTTTCAATTCGAAACCAATCATCACGAATTTGACCATAATCGCGGGTGAAAATCTCCAGGCTGCAAAAGCGATCTCTGCCACCGTTTGCGCCAACATTCTCGAGGTGAACCCAGTTTAAACTGTGTAGCAGCGTTTTGCATTCCGTTGATTGATATATATATATTTTCTACTTTTTGTGTGAATATAATTGTATGGGGAATTAGTGTAGATGGAAAAGAGGAAATACAGAAAATGTTTTGTTGTGAAGTGTAGAAGTAAAATTGTAAGTGTATTGGGATTAATGAAATCTTTTGCGGTTTTACAGCGATTTTAATTGTCTTCTGTAGAAGGTGCAACGTCTTGGTTGTGGAATATACAAATACCTCCTATAATATGATGCATACTGCGCTGTCATCTTAGTATTTGGAACTACATGATTTTTATTAGTAAAAGCTTTGTTTTCAATGGGAGGTTGTGATAATGCTGATTTTGGTGTGGCTATCATTCACGCTGCTGGTATCAAATGAAACTTTGCGAGAATAACTTAGTAACTCTCATAAGTTTGATTATTTATTGAGCTAATTGTGTATTTAATTTCTAAAACAAGTTAAACAACTCTGGAGATTGTCTTCTGTAATTGATTGACTCCCTTGTGTACTGGAATTGATGGTTATATTGTTGGTTGGTTAATTAAATCGTTGCTAAGTCCATCATTCAGGGACCCCCCCCCCCCCCCCAAAAAAAAAAAAAAAAAAAAAAAAAAGTACCCAATAAAACTACCAATTGAAATATAAGCACCAATCTGAGTTGACAGGTGATTGCATTTTGACAGGAGGCCTACGATCTTTCATCTCTATGAGAGATCATCGATCTGGGAAAATTTTTGCTGGTTTGGGCTAGGATAGTAGATAAATTTCTCGTGGTTTTGGTGAATCTGTGTATGTTGAAAGAAGCTCTCTATTCCACTAGAAAAAATGTTGCACAAATTTTCTCCACACGCTTGGAGTTGCAAGTTTGACTTGTATTGTAGTAGAAGTTTTGATACCTTAGGTGTTCCTTTCATGCTAATCCTGTTCTTTTTATGGGCGCCTTGTGGTGTTTCTTTTCTTCTCCTGATCCTAGTGAAGCTTGAAATATGAAATGCTACTAGAGAAGCATTAGAGAGTTTTGATGTGCTGTGCACCCCATATTATATTGTAGCTGTACGTGAAAATGGATAAAGTAGGCACTGGGTTTTAACATTTAGAATTTGATTCTCTAAGGTTACTTTGCTCCTAATTTCCTCATACTTTTGTATGCCTTTGAAGTCATGATTGGGCATTAGAACAATTGGTTTACAACTCATTTGGTATAATTAGAATGATGACCTCCAAAGACTGCTGGAGGTTTGATTCCCTTGATAGGTGTGGTGCAAGTTGAGGTTTGTGAATGCACAATGCACCTAACTAGTTTGGTGCTGGAAAAATCTTGAGCAGTTGTAGGAGATTATTTTAATATTTCAAGATAATGCAGTATTTGCTGCCTTTAAGCATGGTTGGGCATTGAATACAGCAATAATGCAGTATTTATCTACATTTATGTCTGACAGATGAAATGAGTTTCTTACTTGCTGCCATGGCTCTTAAGATTTAACCTTTCCCCCTCATTTAAAGATCTAAACCAATATTATATTGTGATATTTATCTCCGTTTAGTGTACATACACTTCGATGCACATCTTTACGTGTAAAAACTTACGCATTCTTATTTTAAAATGATATTCCTCCTAATACTGTACTATTTTTCTCTAAAATTGGGCATGAGTGTTATTGGTAGTTACATAACGTTATGTCTCTAAGTTGTTTTAAGCAGTCGGTAACTTGTTTCTTTGTGTGTATGCATGTACAATATGCTTGCTGATGGGAATTCAATTCCTTTTTATGGGGCATTTTTGTTTGGCCCTATGATTGTCAATTGTTCATCTGGATGCTATATTGATTTCTTATACATGTTCCAGGTTTCCAGTGAGCAGAAGCTACCATCACTTTATCTACTGGACAGTATTGTAAAGAATATTGGAAGAGATTACATAAAGTACTTTGCAGCAAGACTGCCCGAGGTTAGATTTTTGTTCTGTGAACCTCTCCCTACCAAGAGACACTCCCCCCACCCACCCTGTGGAACTGAACAAATTTTATGTCTGTGTCTACCATAAGTAATGCTTCAGAATGCTTCCACTCGATGCTCTTTCGGTATTGTTGTTCATTTTCTTCAACTGATATGAAGATATCAACGTGAATGTCATTTTCCAGCATTATTTGATCAGTAACCTGCGTCCTATCAAGTACAACAATTCTAGTAATAGCATTCATTACTATTCATTGTCTATCAATTATCCCAAATTTGTAATAGCTTTTTTTTCTATCTTTTTTTAGGTATTCTGCAAAGCTTATAGGCAAGTTGACCCTTCTGTACATCCAAGTATGAGACATCTCTTTGGCACCTGGAAAGGAGTGTTTCCTCCTCAAACTCTGCAGATTATAGAGAAAGAACTTGGCTTCATGCCCAGCAGTAGTTCTTCTTCTGGGACCATAACCTCAAAGCCAGATTTGCAGGCACAACGTCCAGCCCATAGTATCCATGTAAATCCCAAGTATATAGAGAGGCAACGGCTTCAGCAGTCAGGCAGGGTTAGTGAAGTTCTGCTACCACCCTCAATTTTGATCCTTACAATGGTCCATTTTTCCTTCTCAAAGATTGAAGTCATTCTTTGGCTTTTCCAAAGTTAGTATGTGCCCTTATAGATCGAGAATCTGTTGTCTATATTAAGTGGAAACTTAGTGATGTTTTCCTGTTTGCATGCAAAATATGGTGTCTTTATTCTTTCCAGGGACATAAATTGATAAACCATTCATCTTTGCTTGTATGTAGAAATAGATTTCTGAAAGTTTTCTAGGGTACTATCCATACTTTTTTTTTTTTTGAGACAGGTACTACCTATACTTTTGTAACCCTAAGAATCTTCTTAATTTGTGAAGTGGATTGGTTACTCTTTTTTCTTGTCCTTATTCTCAATTCATTCTTTTTACACGATGAAGGTGAAAGGAATGACTAGTGATGCTACTGGCGCAACTACAAATGTAACTCAGGATGTTGCCCAAGCCAAAATTAGTACGGGACGTCCATGGGCAGATGCTCCAATCAAAGTGCTTGTAAGTAATTTATTTCTTTTAGAAAAATCACATTAGTCATTAGTTTTTCATTCCCTTATGATAAACACCATGATATAAATTTTCCAGGACATTCATCGTCCACTTAGAGATGCACCAAATGATATGGCACAAGAGAAGAACATCACGGCTGCATATGCAGACTATGAATATGGTTCTGATCTTCCAAGGACAACAGGTATCGGAAGAAGGGTTGTTGATGAAGGGCGAGACAAACCTTGGTCTTCAGCTGGTAGCAATTTGGCAGAGAAGTTATCTGGCCAAAGAAATGGGTTCAACATCAAGCTTGGATATGAAAATTACCCTGCACCCAAGTCCGCAAACACTGGTGCACGTCTACTGCCCATGCAAAATTTTTCAAGCAGCAGCAGCAACAGAGTATTGTCTACTAACTGGAAGAACTCTGAGGAAGAGGAGTTTATGTGGGGTGAAATGAACTCTATTATGACAGGTCATGGTGCACCTGCCATTGCCAGTAGCACTGGGAAAGATCAATGGACTCCTGAGGATTCGGATAATTCGGTAAGCTTGAATATAATTGATGGGGACTACTTTTTTCTGGTCAATGATATATTAAAATACCAATAATTTCACTCATTGGCAGAAAGCAAAAAAGTGTTTACTTAACTTTCATTTTTGTAGTACATCTTTCTGGCCTGTGCAATTATGGACCATTGACCATTTCATGTGTGATTGATTGTTGAGTTGGTTTCTATGTCAGGGTATTGAAAATAAGCCATTAAGTGTACGGGATACTGGGGCAAGTGTTGATAGAGAAGCTTCCAGTGATTCACAATCATCAGAGCAGAGAGAACTAGGGGATTCTGGACAGCAAAGGTCATCAATGTGGCAAGTGCAGGAGCCAATATCTCTGGACGGGCTGAGAGGTGGGGTTCCTAGAAAGAATTCAGCTGAGTCAGGAGGTTATGGTGCTAACTCTTCTGTGGATCAAATGGGAGGTCGACCACAGATCACATCATCTAATATTGGAGCTTCAGGACATGGGTTTCTGAATAAAGGAGGTTCAGGGTCCATTGGGACTGTAGGCCATCAAAGGTTTCCATCACGAAGTGTTGCATTCCCATCCGGACAGCCACCCTTGCACCAACGCCCCCCTTCATTATTGTTAGTGGATCACGTTCCTCATCAAATGCACGACCATAAAACTTCTTCATTATCTAATCTTGACCCACGTAAAAGGCATATGCAGGATGCTGCCCTTGGCCTGCATCCTAGCGTTCGGCCAGATAACCTTCAAAAATCACAGCCTCAGGACCTTCAAGCTTCAGCTTCATCCGTACCTGCTTCTCAACCCAAGCACCAGTTCTCTTTATCTGAGTCACTAAAACCTGACGTCACGCAGTCAGAACTTTCTAGTCAGCATGCAGTATCAATTCCGGGCACCAATTTTGGACCACCCTCATCAGCTGGGACAGTTCCAGATCTACCCGCAGAAATTTTGGGGGAGCCAAGCACTAGTAGTTTGTTGGCTGCTGTAATGAAGAGTGGAATTTTCTCCAACCATTCAATGACCAGTAGCATGCAGCAGAATATCAGCTTCCAAGATGTGGGAAATATGCCACCCCGTTCAAGCATCAAACCTCCTTTACCAAGCCGGTCTTCTCCCGCCAAGACTGTGGGAGAGTCTTCTTTAGGTCCTCAATCTCTTGAAAGCCCATCAGTTCTGGTTAAGCTATCTCAGACTAAGCTAGAAGAGACATCGTTGCCACCTGATCCACTTCCACCTTCATCTCCTTTGAATAGTGCATCCACTGAAACTTCAAATGTGGTAAATGATGCTTCTAGTCCGATTTCGAACCTTTTGAGCTCATTGGTTGCAAAGGGCCTCATATCTGCTTCAAAAGGAGAATTAACTAATAGCGCAACATCCCAGATGCCTTCACAGCCTGAAAGTAAGTTAGGTGATGCTGTGACTTGTTCTTTACCAGTTCCTTCCATCCCTGTTTCCTCTTCCAGTCTGTCATCTATGGGACTTGAATCACCTTCAAAAGCTGCTGCTAAAAGCTCCACTAGTCCACCTCCATCCTCCACAACTGAAATAAATAACCTTATAGGCTTTGAATTTAGTTCACATGTTATTCGCAAATTTCATCCATCTGTGATCAGTGGACTCTTTGACGACATTCCATATCAATGTAAGATCTGTGGTCTTAGACTGAAACTTGAAGAACAGTTGGATACACACTCGCAGTGGCACACAATTAGTACTGAGGCAAACAATTCAAATAGGGCACCAAGAAGATGGTATCCAAGTTCAGATGATTGGGTTTCTGGAAATGCTAGACTTCTGCTTGACGCTGCCACTTCTCTGGACAAGTCCAACATGATGGAAGAAGATAGCGAGCCAATGGTTCCTGCAGATGAAGATCAATTTGCTTGTGTCTTATGTGGTGAACTTTTCGAAGATTTCTATAGTCAAGAGTTGGGTAAGTGGATGTTCAAAGGAGCAATGTATATCACCATCCCTTCAGCGGGTAGTAAGGTAGGAAGCACAAATGAACAAGTTGCTAGAGGACCCATTGTGCACACAGAGTGTTTAACAGAAAGTTCAGTATATGACTTGGGACTGGCAACTGATGTTAAGATGGTAATGTTCTTAGTTATTTATACTTTTGCAGGACTTGATTTGGTTTCACTATCATCCATTGATCTCCTCGATGATTTTATGTGCTGTTCCTAGACTTTTTTATTTTTTATTTATTTTTATTATTATTATTTTTAATGTGCTGTTACAAAATGATGTTTTTGTTTATCTGAGGCCAGTCCAAAGATAGTGCTGAATGTTATGGCATTGGACTGATAAAGTCAAGGAGAAGATAAAGCCCTGTTCTATGAATTTGGGTTGGTAGAATCAAGTGTTTGGTTCCCCATTCTTTGGGATGCTACTTGTCATTGCTAGGATCAATCATGAAACACCTGAATCATAATCGGTTTCTTCTTTTGGTTCTCCACAGTGAGCTCCTTTTACTGACATTTAGGACCTATAAAGCAGACACTTCAAATTACATGATATAGTGCCTACTTGCGTTAATATATCTTGTCGCACACTGAAAATATGCTCAAATCTGTATCCCTGCCTTGTATTGATGCCGAGCATGAGATTGCATGCATTCTCTTGATTTAATTTTTGCGTGTTATTAATCTATGCTTCTTTGCATTATGTCCAAACCTTCTACAGGAAATGGATGTATGATGCTTCCCCTGCAAAACGTCATATAGGAACTACTCTGGTGGACATCCTGCTTGTGCAATTGGCAGGGGGGATGCGAAAAGGGAGTACGAATATGAGTGCTTCACTCGTGGTAGTTCCGTTGAGTGCCAAAAGAGATAATAGGGAGTGGTAGCTCTATACTGGTATTTTCTAGCTTCCAAATTCGTTTTTGTTTTCTTTCTGTTTCTTTTTCTTCTAAAGATATATAGTTGAGAATACATCAATGCAATGATCATTTTTATTCAGATCTTCACCGTCTTGTTTCTCAGCGAATTTTCACTAAACATCAATATCTGTTGATATAAATATGAGGCAGTGCACTCCAAGTTCTGGTTGATTGGCATCTCTATCGACAAGCCTTTGTATTATCGAAATCGTACTAGCAAAGTGGTTTTGATTCACCATCACTTCTCTTTTCCTTGTAATTTTTCCTTCTCTTAGCTCTGCATATCATGTTAGACAGTAATGGAGAGTTGTTTAAGATTCTCTCAGCTTTTGAGAAGGGTTGTTCACTTGTCTTGAGGGTCTGGATATGACTATGATGATGTTGGATTTTGCTTCTTTTTGCTTCTATCATTTGCCCCCACCTTGTTTGGCTGATTGGACAAGTTTCGCATGGCAGATATTGTGATACTTATGGCAGATTTATTACAATAATTGTATTCAACAATCATGAACTGTGGAATTATGAGAACATTTCTGTTTGGATGGCCATTTGAGCGTTGAAAAGAAGCAAGTGATTTGACTCATATGTAAAAATTCCTTGGATTCATAGCCTTTGGACATAAGCTTGTGTACCAATTATAGCAGGTACAAAACATGATTGCTACCTAAATGCCACCCCACGCTGTGCGTGGAGAAAAGAAAAAAGGAACTTGCAACACTACTTTGGATGGGTTTTTATCAATTTAAAATTCTTTTTTGCCCTTCTAGTAATAGTATAAATTAAGCAAAGGTAAAAGTTGATCTGTTCTGCCTCCAATTGATGTAATAT

mRNA sequence

ATGGAAATGGAGAGCTCGCGGAGACCTTTCGATCGAACGAGGGAACCGGGTTTGAAGAAGCCCCGACTGGCCGATGAGGCTGAGCGCGGTGGGAACATCAATGGCCGGCCGTTTCCGCAGAGACCAGTTGTTTCCGGGACCAATATTGTGCAACCCAGATTTAGAGCAAGTGATAGAGATTCGGGAAGCAGTGACTCTGGTCGAGGGGGGTATCAGCCTCAGCCGCCGCAGCATCAGGAACTCGTCAGCCAGTATAGGACCGCCCTTGCTGAGCTGACTTTCAATTCGAAACCAATCATCACGAATTTGACCATAATCGCGGGTGAAAATCTCCAGGCTGCAAAAGCGATCTCTGCCACCGTTTGCGCCAACATTCTCGAGGTTTCCAGTGAGCAGAAGCTACCATCACTTTATCTACTGGACAGTATTGTAAAGAATATTGGAAGAGATTACATAAAGTACTTTGCAGCAAGACTGCCCGAGGTATTCTGCAAAGCTTATAGGCAAGTTGACCCTTCTGTACATCCAAGTATGAGACATCTCTTTGGCACCTGGAAAGGAGTGTTTCCTCCTCAAACTCTGCAGATTATAGAGAAAGAACTTGGCTTCATGCCCAGCAGTAGTTCTTCTTCTGGGACCATAACCTCAAAGCCAGATTTGCAGGCACAACGTCCAGCCCATAGTATCCATGTAAATCCCAAGTATATAGAGAGGCAACGGCTTCAGCAGTCAGGCAGGGTGAAAGGAATGACTAGTGATGCTACTGGCGCAACTACAAATGTAACTCAGGATGTTGCCCAAGCCAAAATTAGTACGGGACGTCCATGGGCAGATGCTCCAATCAAAGTGCTTGACATTCATCGTCCACTTAGAGATGCACCAAATGATATGGCACAAGAGAAGAACATCACGGCTGCATATGCAGACTATGAATATGGTTCTGATCTTCCAAGGACAACAGGTATCGGAAGAAGGGTTGTTGATGAAGGGCGAGACAAACCTTGGTCTTCAGCTGGTAGCAATTTGGCAGAGAAGTTATCTGGCCAAAGAAATGGGTTCAACATCAAGCTTGGATATGAAAATTACCCTGCACCCAAGTCCGCAAACACTGGTGCACGTCTACTGCCCATGCAAAATTTTTCAAGCAGCAGCAGCAACAGAGTATTGTCTACTAACTGGAAGAACTCTGAGGAAGAGGAGTTTATGTGGGGTGAAATGAACTCTATTATGACAGGTCATGGTGCACCTGCCATTGCCAGTAGCACTGGGAAAGATCAATGGACTCCTGAGGATTCGGATAATTCGGGTATTGAAAATAAGCCATTAAGTGTACGGGATACTGGGGCAAGTGTTGATAGAGAAGCTTCCAGTGATTCACAATCATCAGAGCAGAGAGAACTAGGGGATTCTGGACAGCAAAGGTCATCAATGTGGCAAGTGCAGGAGCCAATATCTCTGGACGGGCTGAGAGGTGGGGTTCCTAGAAAGAATTCAGCTGAGTCAGGAGGTTATGGTGCTAACTCTTCTGTGGATCAAATGGGAGGTCGACCACAGATCACATCATCTAATATTGGAGCTTCAGGACATGGGTTTCTGAATAAAGGAGGTTCAGGGTCCATTGGGACTGTAGGCCATCAAAGGTTTCCATCACGAAGTGTTGCATTCCCATCCGGACAGCCACCCTTGCACCAACGCCCCCCTTCATTATTGTTAGTGGATCACGTTCCTCATCAAATGCACGACCATAAAACTTCTTCATTATCTAATCTTGACCCACGTAAAAGGCATATGCAGGATGCTGCCCTTGGCCTGCATCCTAGCGTTCGGCCAGATAACCTTCAAAAATCACAGCCTCAGGACCTTCAAGCTTCAGCTTCATCCGTACCTGCTTCTCAACCCAAGCACCAGTTCTCTTTATCTGAGTCACTAAAACCTGACGTCACGCAGTCAGAACTTTCTAGTCAGCATGCAGTATCAATTCCGGGCACCAATTTTGGACCACCCTCATCAGCTGGGACAGTTCCAGATCTACCCGCAGAAATTTTGGGGGAGCCAAGCACTAGTAGTTTGTTGGCTGCTGTAATGAAGAGTGGAATTTTCTCCAACCATTCAATGACCAGTAGCATGCAGCAGAATATCAGCTTCCAAGATGTGGGAAATATGCCACCCCGTTCAAGCATCAAACCTCCTTTACCAAGCCGGTCTTCTCCCGCCAAGACTGTGGGAGAGTCTTCTTTAGGTCCTCAATCTCTTGAAAGCCCATCAGTTCTGGTTAAGCTATCTCAGACTAAGCTAGAAGAGACATCGTTGCCACCTGATCCACTTCCACCTTCATCTCCTTTGAATAGTGCATCCACTGAAACTTCAAATGTGGTAAATGATGCTTCTAGTCCGATTTCGAACCTTTTGAGCTCATTGGTTGCAAAGGGCCTCATATCTGCTTCAAAAGGAGAATTAACTAATAGCGCAACATCCCAGATGCCTTCACAGCCTGAAAGTAAGTTAGGTGATGCTGTGACTTGTTCTTTACCAGTTCCTTCCATCCCTGTTTCCTCTTCCAGTCTGTCATCTATGGGACTTGAATCACCTTCAAAAGCTGCTGCTAAAAGCTCCACTAGTCCACCTCCATCCTCCACAACTGAAATAAATAACCTTATAGGCTTTGAATTTAGTTCACATGTTATTCGCAAATTTCATCCATCTGTGATCAGTGGACTCTTTGACGACATTCCATATCAATGTAAGATCTGTGGTCTTAGACTGAAACTTGAAGAACAGTTGGATACACACTCGCAGTGGCACACAATTAGTACTGAGGCAAACAATTCAAATAGGGCACCAAGAAGATGGTATCCAAGTTCAGATGATTGGGTTTCTGGAAATGCTAGACTTCTGCTTGACGCTGCCACTTCTCTGGACAAGTCCAACATGATGGAAGAAGATAGCGAGCCAATGGTTCCTGCAGATGAAGATCAATTTGCTTGTGTCTTATGTGGTGAACTTTTCGAAGATTTCTATAGTCAAGAGTTGGGTAAGTGGATGTTCAAAGGAGCAATGTATATCACCATCCCTTCAGCGGGTAGTAAGGTAGGAAGCACAAATGAACAAGTTGCTAGAGGACCCATTGTGCACACAGAGTGTTTAACAGAAAGTTCAGTATATGACTTGGGACTGGCAACTGATGTTAAGATGGTAATGTTCTTAGTTATTTATACTTTTGCAGGACTTGATTTGGTTTCACTATCATCCATTGATCTCCTCGATGATTTTATGTGCTGCCAGTCCAAAGATAGTGCTGAATGTTATGGCATTGGACTGATAAAGAAATGGATGTATGATGCTTCCCCTGCAAAACGTCATATAGGAACTACTCTGGTGGACATCCTGCTTGTGCAATTGGCAGGGGGGATGCGAAAAGGGAGTACGAATATGAGTGCTTCACTCGTGGTAGTTCCGTTGAGTGCCAAAAGAGATAATAGGGAGTGGTAG

Coding sequence (CDS)

ATGGAAATGGAGAGCTCGCGGAGACCTTTCGATCGAACGAGGGAACCGGGTTTGAAGAAGCCCCGACTGGCCGATGAGGCTGAGCGCGGTGGGAACATCAATGGCCGGCCGTTTCCGCAGAGACCAGTTGTTTCCGGGACCAATATTGTGCAACCCAGATTTAGAGCAAGTGATAGAGATTCGGGAAGCAGTGACTCTGGTCGAGGGGGGTATCAGCCTCAGCCGCCGCAGCATCAGGAACTCGTCAGCCAGTATAGGACCGCCCTTGCTGAGCTGACTTTCAATTCGAAACCAATCATCACGAATTTGACCATAATCGCGGGTGAAAATCTCCAGGCTGCAAAAGCGATCTCTGCCACCGTTTGCGCCAACATTCTCGAGGTTTCCAGTGAGCAGAAGCTACCATCACTTTATCTACTGGACAGTATTGTAAAGAATATTGGAAGAGATTACATAAAGTACTTTGCAGCAAGACTGCCCGAGGTATTCTGCAAAGCTTATAGGCAAGTTGACCCTTCTGTACATCCAAGTATGAGACATCTCTTTGGCACCTGGAAAGGAGTGTTTCCTCCTCAAACTCTGCAGATTATAGAGAAAGAACTTGGCTTCATGCCCAGCAGTAGTTCTTCTTCTGGGACCATAACCTCAAAGCCAGATTTGCAGGCACAACGTCCAGCCCATAGTATCCATGTAAATCCCAAGTATATAGAGAGGCAACGGCTTCAGCAGTCAGGCAGGGTGAAAGGAATGACTAGTGATGCTACTGGCGCAACTACAAATGTAACTCAGGATGTTGCCCAAGCCAAAATTAGTACGGGACGTCCATGGGCAGATGCTCCAATCAAAGTGCTTGACATTCATCGTCCACTTAGAGATGCACCAAATGATATGGCACAAGAGAAGAACATCACGGCTGCATATGCAGACTATGAATATGGTTCTGATCTTCCAAGGACAACAGGTATCGGAAGAAGGGTTGTTGATGAAGGGCGAGACAAACCTTGGTCTTCAGCTGGTAGCAATTTGGCAGAGAAGTTATCTGGCCAAAGAAATGGGTTCAACATCAAGCTTGGATATGAAAATTACCCTGCACCCAAGTCCGCAAACACTGGTGCACGTCTACTGCCCATGCAAAATTTTTCAAGCAGCAGCAGCAACAGAGTATTGTCTACTAACTGGAAGAACTCTGAGGAAGAGGAGTTTATGTGGGGTGAAATGAACTCTATTATGACAGGTCATGGTGCACCTGCCATTGCCAGTAGCACTGGGAAAGATCAATGGACTCCTGAGGATTCGGATAATTCGGGTATTGAAAATAAGCCATTAAGTGTACGGGATACTGGGGCAAGTGTTGATAGAGAAGCTTCCAGTGATTCACAATCATCAGAGCAGAGAGAACTAGGGGATTCTGGACAGCAAAGGTCATCAATGTGGCAAGTGCAGGAGCCAATATCTCTGGACGGGCTGAGAGGTGGGGTTCCTAGAAAGAATTCAGCTGAGTCAGGAGGTTATGGTGCTAACTCTTCTGTGGATCAAATGGGAGGTCGACCACAGATCACATCATCTAATATTGGAGCTTCAGGACATGGGTTTCTGAATAAAGGAGGTTCAGGGTCCATTGGGACTGTAGGCCATCAAAGGTTTCCATCACGAAGTGTTGCATTCCCATCCGGACAGCCACCCTTGCACCAACGCCCCCCTTCATTATTGTTAGTGGATCACGTTCCTCATCAAATGCACGACCATAAAACTTCTTCATTATCTAATCTTGACCCACGTAAAAGGCATATGCAGGATGCTGCCCTTGGCCTGCATCCTAGCGTTCGGCCAGATAACCTTCAAAAATCACAGCCTCAGGACCTTCAAGCTTCAGCTTCATCCGTACCTGCTTCTCAACCCAAGCACCAGTTCTCTTTATCTGAGTCACTAAAACCTGACGTCACGCAGTCAGAACTTTCTAGTCAGCATGCAGTATCAATTCCGGGCACCAATTTTGGACCACCCTCATCAGCTGGGACAGTTCCAGATCTACCCGCAGAAATTTTGGGGGAGCCAAGCACTAGTAGTTTGTTGGCTGCTGTAATGAAGAGTGGAATTTTCTCCAACCATTCAATGACCAGTAGCATGCAGCAGAATATCAGCTTCCAAGATGTGGGAAATATGCCACCCCGTTCAAGCATCAAACCTCCTTTACCAAGCCGGTCTTCTCCCGCCAAGACTGTGGGAGAGTCTTCTTTAGGTCCTCAATCTCTTGAAAGCCCATCAGTTCTGGTTAAGCTATCTCAGACTAAGCTAGAAGAGACATCGTTGCCACCTGATCCACTTCCACCTTCATCTCCTTTGAATAGTGCATCCACTGAAACTTCAAATGTGGTAAATGATGCTTCTAGTCCGATTTCGAACCTTTTGAGCTCATTGGTTGCAAAGGGCCTCATATCTGCTTCAAAAGGAGAATTAACTAATAGCGCAACATCCCAGATGCCTTCACAGCCTGAAAGTAAGTTAGGTGATGCTGTGACTTGTTCTTTACCAGTTCCTTCCATCCCTGTTTCCTCTTCCAGTCTGTCATCTATGGGACTTGAATCACCTTCAAAAGCTGCTGCTAAAAGCTCCACTAGTCCACCTCCATCCTCCACAACTGAAATAAATAACCTTATAGGCTTTGAATTTAGTTCACATGTTATTCGCAAATTTCATCCATCTGTGATCAGTGGACTCTTTGACGACATTCCATATCAATGTAAGATCTGTGGTCTTAGACTGAAACTTGAAGAACAGTTGGATACACACTCGCAGTGGCACACAATTAGTACTGAGGCAAACAATTCAAATAGGGCACCAAGAAGATGGTATCCAAGTTCAGATGATTGGGTTTCTGGAAATGCTAGACTTCTGCTTGACGCTGCCACTTCTCTGGACAAGTCCAACATGATGGAAGAAGATAGCGAGCCAATGGTTCCTGCAGATGAAGATCAATTTGCTTGTGTCTTATGTGGTGAACTTTTCGAAGATTTCTATAGTCAAGAGTTGGGTAAGTGGATGTTCAAAGGAGCAATGTATATCACCATCCCTTCAGCGGGTAGTAAGGTAGGAAGCACAAATGAACAAGTTGCTAGAGGACCCATTGTGCACACAGAGTGTTTAACAGAAAGTTCAGTATATGACTTGGGACTGGCAACTGATGTTAAGATGGTAATGTTCTTAGTTATTTATACTTTTGCAGGACTTGATTTGGTTTCACTATCATCCATTGATCTCCTCGATGATTTTATGTGCTGCCAGTCCAAAGATAGTGCTGAATGTTATGGCATTGGACTGATAAAGAAATGGATGTATGATGCTTCCCCTGCAAAACGTCATATAGGAACTACTCTGGTGGACATCCTGCTTGTGCAATTGGCAGGGGGGATGCGAAAAGGGAGTACGAATATGAGTGCTTCACTCGTGGTAGTTCCGTTGAGTGCCAAAAGAGATAATAGGGAGTGGTAG

Protein sequence

MEMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGTNIVQPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISATVCANILEVSSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQAQRPAHSIHVNPKYIERQRLQQSGRVKGMTSDATGATTNVTQDVAQAKISTGRPWADAPIKVLDIHRPLRDAPNDMAQEKNITAAYADYEYGSDLPRTTGIGRRVVDEGRDKPWSSAGSNLAEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSTNWKNSEEEEFMWGEMNSIMTGHGAPAIASSTGKDQWTPEDSDNSGIENKPLSVRDTGASVDREASSDSQSSEQRELGDSGQQRSSMWQVQEPISLDGLRGGVPRKNSAESGGYGANSSVDQMGGRPQITSSNIGASGHGFLNKGGSGSIGTVGHQRFPSRSVAFPSGQPPLHQRPPSLLLVDHVPHQMHDHKTSSLSNLDPRKRHMQDAALGLHPSVRPDNLQKSQPQDLQASASSVPASQPKHQFSLSESLKPDVTQSELSSQHAVSIPGTNFGPPSSAGTVPDLPAEILGEPSTSSLLAAVMKSGIFSNHSMTSSMQQNISFQDVGNMPPRSSIKPPLPSRSSPAKTVGESSLGPQSLESPSVLVKLSQTKLEETSLPPDPLPPSSPLNSASTETSNVVNDASSPISNLLSSLVAKGLISASKGELTNSATSQMPSQPESKLGDAVTCSLPVPSIPVSSSSLSSMGLESPSKAAAKSSTSPPPSSTTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHSQWHTISTEANNSNRAPRRWYPSSDDWVSGNARLLLDAATSLDKSNMMEEDSEPMVPADEDQFACVLCGELFEDFYSQELGKWMFKGAMYITIPSAGSKVGSTNEQVARGPIVHTECLTESSVYDLGLATDVKMVMFLVIYTFAGLDLVSLSSIDLLDDFMCCQSKDSAECYGIGLIKKWMYDASPAKRHIGTTLVDILLVQLAGGMRKGSTNMSASLVVVPLSAKRDNREW
Homology
BLAST of Spg033175 vs. NCBI nr
Match: KAA0043917.1 (polyadenylation and cleavage factor-like protein 4-like isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 1831.6 bits (4743), Expect = 0.0e+00
Identity = 963/1154 (83.45%), Postives = 1028/1154 (89.08%), Query Frame = 0

Query: 1    MEMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGTNIV-QPRFRASDR 60
            MEMESSRRPFDRTREPGLKKPRLADEA+RG NINGRPFPQRPVVSG NIV QPRFRASDR
Sbjct: 1    MEMESSRRPFDRTREPGLKKPRLADEADRGANINGRPFPQRPVVSGNNIVQQPRFRASDR 60

Query: 61   DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISA 120
            DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAIS 
Sbjct: 61   DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAIST 120

Query: 121  TVCANILEVSSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR 180
            T+ ANILEV SEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR
Sbjct: 121  TIYANILEVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR 180

Query: 181  HLFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQAQRPA-HSIHVNPKYIER 240
            HLFGTWKGVFP QTLQIIEKELGF+P+ SSSS  I SKPDLQAQRP  HSIHVNPKYIER
Sbjct: 181  HLFGTWKGVFPLQTLQIIEKELGFVPTGSSSSVAINSKPDLQAQRPTPHSIHVNPKYIER 240

Query: 241  QRLQQSGRVKGMTSDATGATTNVTQDVAQAKISTGRPWADAPIKVLDIHRPLRDAPNDMA 300
            QRLQQSGRVKGM +DATG +TNV+QDVAQAKISTGRPWADAPIKVLDI RPLRDAPNDMA
Sbjct: 241  QRLQQSGRVKGMPTDATGGSTNVSQDVAQAKISTGRPWADAPIKVLDIQRPLRDAPNDMA 300

Query: 301  QEKNITAAYADYEYGSDLPRTTGIGRRVVDEGRDKPWSSAGSNLAEKLSGQRNGFNIKLG 360
            QEKN+TA Y+DYEYGSDL RT+ +GRRVVDEGRDKPW SAGSNL+EKLSGQRNGFNIKLG
Sbjct: 301  QEKNVTAGYSDYEYGSDLSRTSSVGRRVVDEGRDKPWPSAGSNLSEKLSGQRNGFNIKLG 360

Query: 361  YENYPAPKSANTGARLLPMQNFSSSSSNRVLSTNWKNSEEEEFMWGEMNSIMTGHGAPAI 420
            YENY APKS NTGARLLP+QNFSSSS+NRVLSTNWKNSEEEEFMWG+M+S++TGHGAPAI
Sbjct: 361  YENYSAPKSTNTGARLLPVQNFSSSSNNRVLSTNWKNSEEEEFMWGDMSSMLTGHGAPAI 420

Query: 421  ASSTGKDQWTPEDSDNSGIENKPLSVRDTGASVDREASSDSQSSEQRELGDSGQQRSSMW 480
             SSTGKDQWTPEDSDNSGI+NK LSVRDTGASVDREASSDSQSSEQRELGDSGQQRSS W
Sbjct: 421  NSSTGKDQWTPEDSDNSGIDNKHLSVRDTGASVDREASSDSQSSEQRELGDSGQQRSSTW 480

Query: 481  QVQEPISLDGLRGGVPRKNSAESGGYGA--------NSSVDQMGGRPQITSSNIGASGHG 540
            Q+QE ISLDGLR GVPRKNS +SGGYGA        NSSVDQMGGRPQIT SNIGASGHG
Sbjct: 481  QLQESISLDGLRAGVPRKNSGQSGGYGATLTALSGTNSSVDQMGGRPQITPSNIGASGHG 540

Query: 541  FLNKGGSGSIGTVGHQRFPSRSVAFPSGQPPLHQRPPSLLLVDHVPHQMHDHKTSSLSNL 600
            FLNKGGSG +G VGHQRFPSRSVAFPSGQPPLHQR PS LLVDHVPHQ+HD KT+S SNL
Sbjct: 541  FLNKGGSGPLGNVGHQRFPSRSVAFPSGQPPLHQRSPSQLLVDHVPHQIHDQKTTSFSNL 600

Query: 601  DPRKRHMQDAALGLHPSVRPDNLQKSQPQDLQASASSVPASQPKHQFSLSESLKPDVTQS 660
            DPRKRHMQDAALGLHPSVRPDN QK Q  DL+A ASS+P SQP+HQFSLSESLKPDVTQS
Sbjct: 601  DPRKRHMQDAALGLHPSVRPDNHQKPQTHDLRALASSIPGSQPRHQFSLSESLKPDVTQS 660

Query: 661  ELSSQHAVSIPGTNFGPPSSAGTVPD-LPAEILGEPSTSSLLAAVMKSGIFSNHSMTSSM 720
            ELSSQ AVSIPGT+FGP SSAGTVPD LPAEILG PSTSSLLAAVMKSG+FSNHS+TS+M
Sbjct: 661  ELSSQLAVSIPGTDFGPSSSAGTVPDRLPAEILGNPSTSSLLAAVMKSGLFSNHSITSNM 720

Query: 721  QQNISFQDVGNMPPRSSIKPPLPSRSSPAKTV------GESSLGPQSLESPSVLVKLSQT 780
            QQN+SFQDVGNM PRSSIKPPLP+RSSPA T       GESS+GP S+ESPS +VKLS+T
Sbjct: 721  QQNLSFQDVGNMKPRSSIKPPLPNRSSPAHTFSEPKIQGESSVGPPSVESPSTMVKLSRT 780

Query: 781  KLEETSLPPDPLPPSSPLNSASTETSNVVNDASSPISNLLSSLVAKGLISASKGELTNSA 840
            K+EE SLP DPLPPSSP++SASTETS+VVNDASSPISNLLSSLVAKGLISASKGE TNS 
Sbjct: 781  KVEEPSLPSDPLPPSSPMDSASTETSHVVNDASSPISNLLSSLVAKGLISASKGESTNSV 840

Query: 841  TSQMPSQPES-KLGDAVTCSLPVPSIPVSSSSLSSMGLESPSKAAAKSSTSPPPSSTTEI 900
            TSQMPSQPE+ K GDAVT S+PVPSI VSSS  SS  LESP KAAAKSSTSPPPS+TTEI
Sbjct: 841  TSQMPSQPENLKSGDAVTSSVPVPSIAVSSSCHSSTKLESPLKAAAKSSTSPPPSATTEI 900

Query: 901  NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHSQWHTISTEANN 960
            NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLK EEQLDTHS+WHT+ TEANN
Sbjct: 901  NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKCEEQLDTHSRWHTLRTEANN 960

Query: 961  SNRAPRRWYPSSDDWVSGNARLLLDAATSLDKSNMMEEDSEPMVPADEDQFACVLCGELF 1020
            S+ APRRWYP SDDW+SGNAR LLDA TSLD+S++MEED+EPMVPADEDQFACV+CGELF
Sbjct: 961  SSTAPRRWYPCSDDWISGNARFLLDAETSLDESDLMEEDNEPMVPADEDQFACVICGELF 1020

Query: 1021 EDFYSQELGKWMFKGAMYITIPSAGSKVGSTNEQVARGPIVHTECLTESSVYDLGLATDV 1080
            EDFYSQELG WM+KGA YITIPS GS+VG TNEQVA+GPIVHT CLTESSVYD+GLATD+
Sbjct: 1021 EDFYSQELGNWMYKGATYITIPSVGSEVGGTNEQVAKGPIVHTTCLTESSVYDVGLATDI 1080

Query: 1081 KMVMFLVIYTFAGLDLVSLSSIDLLDDFMCCQSKDSAECYGIGLIKKWMYDASPAKRHIG 1137
            KM M + +       ++   +                        +KWMYDASPAKRH+G
Sbjct: 1081 KMAMIVRMLRHWAESIMKHQN------------------------RKWMYDASPAKRHVG 1130

BLAST of Spg033175 vs. NCBI nr
Match: XP_038906013.1 (polyadenylation and cleavage factor homolog 4 [Benincasa hispida])

HSP 1 Score: 1822.8 bits (4720), Expect = 0.0e+00
Identity = 952/1084 (87.82%), Postives = 1001/1084 (92.34%), Query Frame = 0

Query: 1    MEMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGTNIV-QPRFRASDR 60
            MEMESSRRPFDRTREPGLKKPRLADEA+RGGNINGRPFPQRPVVSG NIV QPRFRASDR
Sbjct: 1    MEMESSRRPFDRTREPGLKKPRLADEADRGGNINGRPFPQRPVVSGNNIVPQPRFRASDR 60

Query: 61   DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISA 120
            DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISA
Sbjct: 61   DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISA 120

Query: 121  TVCANILEVSSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR 180
            TV ANILEV SEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR
Sbjct: 121  TVYANILEVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR 180

Query: 181  HLFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQAQRPA-HSIHVNPKYIER 240
            HLFGTWKGVFPPQ LQIIEKELGF+PS SSSSG ITSKPDLQAQRP  HSIHVNPKYIER
Sbjct: 181  HLFGTWKGVFPPQALQIIEKELGFVPSGSSSSGNITSKPDLQAQRPTPHSIHVNPKYIER 240

Query: 241  QRLQQSGRVKGMTSDATGATTNVTQDVAQAKISTGRPWADAPIKVLDIHRPLRDAPNDMA 300
            QRLQQSGRVKGMTSD TGATT  +QDVAQAKISTGRPW DAPIKVLDI RPLRDAPNDMA
Sbjct: 241  QRLQQSGRVKGMTSDGTGATTTASQDVAQAKISTGRPWVDAPIKVLDIQRPLRDAPNDMA 300

Query: 301  QEKNITAAYADYEYGSDLPRTTGIGRRVVDEGRDKPWSSAGSNLAEKLSGQRNGFNIKLG 360
            QEKNITA YADYEYGSDL RT+G+GRRVVDEGRDKPWSSAGSNLA+KLSGQRNGFN+KLG
Sbjct: 301  QEKNITAGYADYEYGSDLSRTSGVGRRVVDEGRDKPWSSAGSNLADKLSGQRNGFNVKLG 360

Query: 361  YENYPAPKSANTGARLLPMQNFSSSSSNRVLSTNWKNSEEEEFMWGEMNSIMTGHGAPAI 420
            YENYPAPKSANTGARLLPMQNFSS SSNRVLSTNWKNSEEEEFMWGEMNS++TGHGAPAI
Sbjct: 361  YENYPAPKSANTGARLLPMQNFSSGSSNRVLSTNWKNSEEEEFMWGEMNSMLTGHGAPAI 420

Query: 421  ASSTGKDQWTPEDSDNSGIENKPLSVRDTGASVDREASSDSQSSEQRELGDSGQQRSSMW 480
            A STGKDQWTPEDSDNSGI+NKPLS+RDTGASVDREASSDSQSSEQRELGDSGQQRSS W
Sbjct: 421  AGSTGKDQWTPEDSDNSGIDNKPLSLRDTGASVDREASSDSQSSEQRELGDSGQQRSSTW 480

Query: 481  QVQEPISLDGLRGGVPRKNSAESGGY--------GANSSVDQMGGRPQITSSNIGASGHG 540
            Q+QE ISLDGLR GVPRKNS +SGGY        GANSSVDQMGGRPQITSSNIGASGHG
Sbjct: 481  QLQESISLDGLRSGVPRKNSGQSGGYGATLTALSGANSSVDQMGGRPQITSSNIGASGHG 540

Query: 541  FLNKGGSGSIGTVGHQRFPSRSVAFPSGQPPLHQRPPSLLLVDHVPHQMHDHKTSSLSNL 600
            FL+KGGSG +GTVGHQRFPSRSVAFPSGQP LHQ PPS  LVDH+PHQ+HD K +S SNL
Sbjct: 541  FLSKGGSGPLGTVGHQRFPSRSVAFPSGQPSLHQCPPSPSLVDHIPHQIHDDKPTSFSNL 600

Query: 601  DPRKRHMQDAALGLHPSVRPDNLQKSQPQDLQASASSVPASQPKHQFSLSESLKPDVTQS 660
            DPRKRH+QDAALGLH SVRPDNLQK QP DLQASASS+PA QP+HQFSLSESLKP+VTQS
Sbjct: 601  DPRKRHIQDAALGLHSSVRPDNLQKPQPHDLQASASSIPAPQPRHQFSLSESLKPNVTQS 660

Query: 661  ELSSQHAVSIPGTNFGPPSSAGTVPD-LPAEILGEPSTSSLLAAVMKSGIFSNHSMTSSM 720
            ELSSQHAVSIPGT+FGP SSAGTVPD LPAEILGEPSTSSLLAAVMKSG+FSNHS+TSS+
Sbjct: 661  ELSSQHAVSIPGTDFGPSSSAGTVPDRLPAEILGEPSTSSLLAAVMKSGLFSNHSITSSI 720

Query: 721  QQNISFQDVGNMPPRSSIKPPLPSRSSPAKTV------GESSLGPQSLESPSVLVKLSQT 780
            QQNISFQDVGNM PRSSIKPPLPSRSSPA T       GESS+GP SLESPS +VKLS+T
Sbjct: 721  QQNISFQDVGNMKPRSSIKPPLPSRSSPAHTFSEPKIQGESSVGPPSLESPSTMVKLSRT 780

Query: 781  KLEETSLPPDPLPPSSPLNSASTETSNVVNDASSPISNLLSSLVAKGLISASKGELTNSA 840
            K+EE SL  +PLPPSSP+NSASTETSNVVNDASSPISNLLSSLVAKGLISASKGELT S 
Sbjct: 781  KVEEPSLLSNPLPPSSPMNSASTETSNVVNDASSPISNLLSSLVAKGLISASKGELTTSV 840

Query: 841  TSQMPSQPES-KLGDAVTCSLPVPSIPVSSSSLSSMGLESPSKAAAKSSTSPPPSSTTEI 900
            TSQM SQPE+ K  DAVT S+P PSIP SS+S SSM LESPSKAAAKSSTSPPP +TTEI
Sbjct: 841  TSQMLSQPENLKSSDAVTSSVPAPSIPASSASHSSMRLESPSKAAAKSSTSPPPPATTEI 900

Query: 901  NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHSQWHTISTEANN 960
            NN IGF+FSSHVIRKFHPSVISGLFDDIPYQCKICGLRLK EEQL+THS WHT+  EANN
Sbjct: 901  NNFIGFDFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKCEEQLETHSHWHTLRAEANN 960

Query: 961  SNRAPRRWYPSSDDWVSGNARLLLDAATSLDKSNMMEEDSEPMVPADEDQFACVLCGELF 1020
            S+ APRRWYPSS DW+SGNARLLLDAATSLD+SNMMEED+EPMVPADEDQFACV+CGELF
Sbjct: 961  SSTAPRRWYPSSGDWISGNARLLLDAATSLDESNMMEEDNEPMVPADEDQFACVICGELF 1020

Query: 1021 EDFYSQELGKWMFKGAMYITIPSAGSKVGSTNEQVARGPIVHTECLTESSVYDLGLATDV 1067
            EDFYS+ELG WMFKGA YIT PS GS++GSTNEQVARGPIVHT CLTESSVYD+GLATD+
Sbjct: 1021 EDFYSRELGNWMFKGATYITSPSVGSEIGSTNEQVARGPIVHTNCLTESSVYDVGLATDI 1080

BLAST of Spg033175 vs. NCBI nr
Match: XP_008442798.1 (PREDICTED: polyadenylation and cleavage factor homolog 4-like isoform X1 [Cucumis melo])

HSP 1 Score: 1793.9 bits (4645), Expect = 0.0e+00
Identity = 937/1084 (86.44%), Postives = 995/1084 (91.79%), Query Frame = 0

Query: 1    MEMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGTNIV-QPRFRASDR 60
            MEMESSRRPFDRTREPGLKKPRLADEA+RG NINGRPFPQRPVVSG NIV QPRFRASDR
Sbjct: 1    MEMESSRRPFDRTREPGLKKPRLADEADRGANINGRPFPQRPVVSGNNIVQQPRFRASDR 60

Query: 61   DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISA 120
            DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAIS 
Sbjct: 61   DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAIST 120

Query: 121  TVCANILEVSSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR 180
            T+ ANILEV SEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR
Sbjct: 121  TIYANILEVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR 180

Query: 181  HLFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQAQRPA-HSIHVNPKYIER 240
            HLFGTWKGVFP QTLQIIEKELGF+P+ SSSS  I SKPDLQAQRP  HSIHVNPKYIER
Sbjct: 181  HLFGTWKGVFPLQTLQIIEKELGFVPTGSSSSVAINSKPDLQAQRPTPHSIHVNPKYIER 240

Query: 241  QRLQQSGRVKGMTSDATGATTNVTQDVAQAKISTGRPWADAPIKVLDIHRPLRDAPNDMA 300
            QRLQQSGRVKGM +DATG +TNV+QDVAQAKISTGRPWADAPIKVLDI RPLRDAPNDMA
Sbjct: 241  QRLQQSGRVKGMPTDATGGSTNVSQDVAQAKISTGRPWADAPIKVLDIQRPLRDAPNDMA 300

Query: 301  QEKNITAAYADYEYGSDLPRTTGIGRRVVDEGRDKPWSSAGSNLAEKLSGQRNGFNIKLG 360
            QEKN+TA Y+DYEYGSDL RT+ +GRRVVDEGRDKPW SAGSNL+EKLSGQRNGFNIKLG
Sbjct: 301  QEKNVTAGYSDYEYGSDLSRTSSVGRRVVDEGRDKPWPSAGSNLSEKLSGQRNGFNIKLG 360

Query: 361  YENYPAPKSANTGARLLPMQNFSSSSSNRVLSTNWKNSEEEEFMWGEMNSIMTGHGAPAI 420
            YENY APKS NTGARLLP+QNFSSSS+NRVLSTNWKNSEEEEFMWG+M+S++TGHGAPAI
Sbjct: 361  YENYSAPKSTNTGARLLPVQNFSSSSNNRVLSTNWKNSEEEEFMWGDMSSMLTGHGAPAI 420

Query: 421  ASSTGKDQWTPEDSDNSGIENKPLSVRDTGASVDREASSDSQSSEQRELGDSGQQRSSMW 480
             SSTGKDQWTPEDSDNSGI+NK LSVRDTGASVDREASSDSQSSEQRELGDSGQQRSS W
Sbjct: 421  NSSTGKDQWTPEDSDNSGIDNKHLSVRDTGASVDREASSDSQSSEQRELGDSGQQRSSTW 480

Query: 481  QVQEPISLDGLRGGVPRKNSAESGGYGA--------NSSVDQMGGRPQITSSNIGASGHG 540
            Q+QE ISLDGLR GVPRKNS +SGGYGA        NSSVDQMGGRPQIT SNIGASGHG
Sbjct: 481  QLQESISLDGLRAGVPRKNSGQSGGYGATLTALSGTNSSVDQMGGRPQITPSNIGASGHG 540

Query: 541  FLNKGGSGSIGTVGHQRFPSRSVAFPSGQPPLHQRPPSLLLVDHVPHQMHDHKTSSLSNL 600
            FLNKGGSG +G VGHQRFPSRSVAFPSGQPPLHQR PS LLVDHVPHQ+HD KT+S SNL
Sbjct: 541  FLNKGGSGPLGNVGHQRFPSRSVAFPSGQPPLHQRSPSQLLVDHVPHQIHDQKTTSFSNL 600

Query: 601  DPRKRHMQDAALGLHPSVRPDNLQKSQPQDLQASASSVPASQPKHQFSLSESLKPDVTQS 660
            DPRKRHMQDAALGLHPSVRPDN QK Q  DL+A ASS+P SQP+HQFSLSESLKPDVTQS
Sbjct: 601  DPRKRHMQDAALGLHPSVRPDNHQKPQTHDLRALASSIPGSQPRHQFSLSESLKPDVTQS 660

Query: 661  ELSSQHAVSIPGTNFGPPSSAGTVPD-LPAEILGEPSTSSLLAAVMKSGIFSNHSMTSSM 720
            ELSSQ AVSIPGT+FGP SSAGTVPD LPAEILG PSTSSLLAAVMKSG+FSNHS+TS+M
Sbjct: 661  ELSSQLAVSIPGTDFGPSSSAGTVPDRLPAEILGNPSTSSLLAAVMKSGLFSNHSITSNM 720

Query: 721  QQNISFQDVGNMPPRSSIKPPLPSRSSPAKTV------GESSLGPQSLESPSVLVKLSQT 780
            QQN+SFQDVGNM PRSSIKPPLP+RSSPA T       GESS+GP S+ESPS +VKLS+T
Sbjct: 721  QQNLSFQDVGNMKPRSSIKPPLPNRSSPAHTFSEPKIQGESSVGPPSVESPSTMVKLSRT 780

Query: 781  KLEETSLPPDPLPPSSPLNSASTETSNVVNDASSPISNLLSSLVAKGLISASKGELTNSA 840
            K+EE SLP DPLPPSSP++SASTETS+VVNDASSPISNLLSSLVAKGLISASKGE TNS 
Sbjct: 781  KVEEPSLPSDPLPPSSPMDSASTETSHVVNDASSPISNLLSSLVAKGLISASKGESTNSV 840

Query: 841  TSQMPSQPES-KLGDAVTCSLPVPSIPVSSSSLSSMGLESPSKAAAKSSTSPPPSSTTEI 900
            TSQMPSQPE+ K GDAVT S+PVPSI VSSS  SS  LESP KAAAKSSTSPPPS+TTEI
Sbjct: 841  TSQMPSQPENLKSGDAVTSSVPVPSIAVSSSCHSSTKLESPLKAAAKSSTSPPPSATTEI 900

Query: 901  NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHSQWHTISTEANN 960
            NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLK EEQLDTHS+WHT+ TEANN
Sbjct: 901  NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKCEEQLDTHSRWHTLRTEANN 960

Query: 961  SNRAPRRWYPSSDDWVSGNARLLLDAATSLDKSNMMEEDSEPMVPADEDQFACVLCGELF 1020
            S+ APRRWYP SDDW+SGNAR LLDA TSLD+S++MEED+EPMVPADEDQFACV+CGELF
Sbjct: 961  SSTAPRRWYPCSDDWISGNARFLLDAETSLDESDLMEEDNEPMVPADEDQFACVICGELF 1020

Query: 1021 EDFYSQELGKWMFKGAMYITIPSAGSKVGSTNEQVARGPIVHTECLTESSVYDLGLATDV 1067
            EDFYSQELG WM+KGA YITIPS GS+VG TNEQVA+GPIVHT CLTESSVYD+GLATD+
Sbjct: 1021 EDFYSQELGNWMYKGATYITIPSVGSEVGGTNEQVAKGPIVHTTCLTESSVYDVGLATDI 1080

BLAST of Spg033175 vs. NCBI nr
Match: XP_011651991.1 (polyadenylation and cleavage factor homolog 4 isoform X1 [Cucumis sativus])

HSP 1 Score: 1789.2 bits (4633), Expect = 0.0e+00
Identity = 933/1084 (86.07%), Postives = 991/1084 (91.42%), Query Frame = 0

Query: 1    MEMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGTNIV-QPRFRASDR 60
            MEMESSRRPFDRTREPGLKKPRLADEA+RG NINGRPFPQRPVVSG NIV QPRFR SDR
Sbjct: 1    MEMESSRRPFDRTREPGLKKPRLADEADRGANINGRPFPQRPVVSGNNIVQQPRFRPSDR 60

Query: 61   DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISA 120
            DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAI++
Sbjct: 61   DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAIAS 120

Query: 121  TVCANILEVSSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR 180
            T+ ANILEV SEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR
Sbjct: 121  TIYANILEVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR 180

Query: 181  HLFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQAQRPA-HSIHVNPKYIER 240
            HLFGTWKGVFP QTLQIIEKELGF+PS SSSS  ITSKPDLQAQRP  HSIHVNPKYIER
Sbjct: 181  HLFGTWKGVFPLQTLQIIEKELGFVPSGSSSSVAITSKPDLQAQRPTPHSIHVNPKYIER 240

Query: 241  QRLQQSGRVKGMTSDATGATTNVTQDVAQAKISTGRPWADAPIKVLDIHRPLRDAPNDMA 300
            QRLQQSGRVKG+ +DATG TTNV+QDVAQAK+STGRPWADAPIKVLDI RPLRDA NDMA
Sbjct: 241  QRLQQSGRVKGIATDATGGTTNVSQDVAQAKMSTGRPWADAPIKVLDIQRPLRDAQNDMA 300

Query: 301  QEKNITAAYADYEYGSDLPRTTGIGRRVVDEGRDKPWSSAGSNLAEKLSGQRNGFNIKLG 360
            QEKN+TA Y+DY+YGSDL RT+ +GRRVVDEGRDKPWSSAGSNL+EKLSGQRNGFN+KLG
Sbjct: 301  QEKNVTAGYSDYDYGSDLSRTSSVGRRVVDEGRDKPWSSAGSNLSEKLSGQRNGFNMKLG 360

Query: 361  YENYPAPKSANTGARLLPMQNFSSSSSNRVLSTNWKNSEEEEFMWGEMNSIMTGHGAPAI 420
            YENY APKS NTGARLLP+QNFSSSS+NRVLSTNWKNSEEEEFMWG+MNS++T HGAP I
Sbjct: 361  YENYSAPKSTNTGARLLPVQNFSSSSNNRVLSTNWKNSEEEEFMWGDMNSMLTSHGAPGI 420

Query: 421  ASSTGKDQWTPEDSDNSGIENKPLSVRDTGASVDREASSDSQSSEQRELGDSGQQRSSMW 480
             SS GKDQWTPEDSDNSGI+NK +SVRDTGASVDREASSDSQSSEQRELGDSGQQRSS W
Sbjct: 421  TSSAGKDQWTPEDSDNSGIDNKHVSVRDTGASVDREASSDSQSSEQRELGDSGQQRSSTW 480

Query: 481  QVQEPISLDGLRGGVPRKNSAESGGYGA--------NSSVDQMGGRPQITSSNIGASGHG 540
            Q+QE I LDGLRGGVPRKNS +SGGYGA        NSSVDQMGGRPQIT S+IGASGHG
Sbjct: 481  QLQESIPLDGLRGGVPRKNSGQSGGYGATLTSLSGTNSSVDQMGGRPQITPSSIGASGHG 540

Query: 541  FLNKGGSGSIGTVGHQRFPSRSVAFPSGQPPLHQRPPSLLLVDHVPHQMHDHKTSSLSNL 600
            FLNKG SG +G VGHQRFPSRSVAFPSGQPPLHQR  S LLVDHVPHQ+HDHKT+S SNL
Sbjct: 541  FLNKGSSGPLGNVGHQRFPSRSVAFPSGQPPLHQRSSSQLLVDHVPHQVHDHKTTSFSNL 600

Query: 601  DPRKRHMQDAALGLHPSVRPDNLQKSQPQDLQASASSVPASQPKHQFSLSESLKPDVTQS 660
            DPRKRHMQDAALGLHPSVRPD+LQK QP DLQA ASS+P SQP+HQFSLSESLKPD+TQS
Sbjct: 601  DPRKRHMQDAALGLHPSVRPDSLQKPQPHDLQALASSIPGSQPRHQFSLSESLKPDITQS 660

Query: 661  ELSSQHAVSIPGTNFGPPSSAGTVPD-LPAEILGEPSTSSLLAAVMKSGIFSNHSMTSSM 720
            ELSSQ A  IPGT+FGP SSAGTVPD LPAEILG PSTSSLLAAVMKSG+FSNHS+TS+M
Sbjct: 661  ELSSQLAAPIPGTDFGPSSSAGTVPDRLPAEILGNPSTSSLLAAVMKSGLFSNHSITSNM 720

Query: 721  QQNISFQDVGNMPPRSSIKPPLPSRSSPAKTV------GESSLGPQSLESPSVLVKLSQT 780
            QQNISFQDVGNM PRSSIKPPLPSRSSPA T       GESS+GP SLESPS +VKLSQT
Sbjct: 721  QQNISFQDVGNMKPRSSIKPPLPSRSSPAHTFSEPKIQGESSVGPPSLESPSTMVKLSQT 780

Query: 781  KLEETSLPPDPLPPSSPLNSASTETSNVVNDASSPISNLLSSLVAKGLISASKGELTNSA 840
            K+EE SLP DPLPPSSP++SASTETSNVVNDASSPISNLLSSLVAKGLISASKGE TNS 
Sbjct: 781  KVEEPSLPSDPLPPSSPMDSASTETSNVVNDASSPISNLLSSLVAKGLISASKGESTNSV 840

Query: 841  TSQMPSQPES-KLGDAVTCSLPVPSIPVSSSSLSSMGLESPSKAAAKSSTSPPPSSTTEI 900
            TSQMPSQPE  K GDAVT S+PVPSIP+SSS  S   LESPSKAAAK STSPPPS+TTEI
Sbjct: 841  TSQMPSQPEKLKSGDAVTSSVPVPSIPISSSCHSPTKLESPSKAAAKISTSPPPSATTEI 900

Query: 901  NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHSQWHTISTEANN 960
            NNLIGFEFSSHVIRKFHPSVISGLF+DIPYQCKICGLRLK EE LD HS+WHT+ TEANN
Sbjct: 901  NNLIGFEFSSHVIRKFHPSVISGLFEDIPYQCKICGLRLKCEEHLDIHSRWHTLRTEANN 960

Query: 961  SNRAPRRWYPSSDDWVSGNARLLLDAATSLDKSNMMEEDSEPMVPADEDQFACVLCGELF 1020
            S+ APRRWYPSSDDW+SGNAR LLDA TSLD+S++MEED+EPMVPADEDQFACV+CGELF
Sbjct: 961  SSGAPRRWYPSSDDWISGNARFLLDAVTSLDESDLMEEDNEPMVPADEDQFACVICGELF 1020

Query: 1021 EDFYSQELGKWMFKGAMYITIPSAGSKVGSTNEQVARGPIVHTECLTESSVYDLGLATDV 1067
            ED YSQELG WMFKGAMYITIPS GS+VGSTNEQVARGPIVHT CLTESSVYD+GLATD+
Sbjct: 1021 EDSYSQELGDWMFKGAMYITIPSVGSEVGSTNEQVARGPIVHTACLTESSVYDVGLATDI 1080

BLAST of Spg033175 vs. NCBI nr
Match: XP_008442799.1 (PREDICTED: polyadenylation and cleavage factor homolog 4-like isoform X2 [Cucumis melo])

HSP 1 Score: 1785.8 bits (4624), Expect = 0.0e+00
Identity = 935/1084 (86.25%), Postives = 993/1084 (91.61%), Query Frame = 0

Query: 1    MEMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGTNIV-QPRFRASDR 60
            MEMESSRRPFDRTREPGLKKPRLADEA+RG NINGRPFPQRPVVSG NIV QPRFRASDR
Sbjct: 1    MEMESSRRPFDRTREPGLKKPRLADEADRGANINGRPFPQRPVVSGNNIVQQPRFRASDR 60

Query: 61   DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISA 120
            DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAIS 
Sbjct: 61   DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAIST 120

Query: 121  TVCANILEVSSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR 180
            T+ ANILEV SEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR
Sbjct: 121  TIYANILEVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR 180

Query: 181  HLFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQAQRPA-HSIHVNPKYIER 240
            HLFGTWKGVFP QTLQIIEKELGF+P+ SSSS  I SKPDLQAQRP  HSIHVNPKYIER
Sbjct: 181  HLFGTWKGVFPLQTLQIIEKELGFVPTGSSSSVAINSKPDLQAQRPTPHSIHVNPKYIER 240

Query: 241  QRLQQSGRVKGMTSDATGATTNVTQDVAQAKISTGRPWADAPIKVLDIHRPLRDAPNDMA 300
            QRLQQSGRVKGM +DATG +TNV+QDVAQAKISTGRPWADAPIK  DI RPLRDAPNDMA
Sbjct: 241  QRLQQSGRVKGMPTDATGGSTNVSQDVAQAKISTGRPWADAPIK--DIQRPLRDAPNDMA 300

Query: 301  QEKNITAAYADYEYGSDLPRTTGIGRRVVDEGRDKPWSSAGSNLAEKLSGQRNGFNIKLG 360
            QEKN+TA Y+DYEYGSDL RT+ +GRRVVDEGRDKPW SAGSNL+EKLSGQRNGFNIKLG
Sbjct: 301  QEKNVTAGYSDYEYGSDLSRTSSVGRRVVDEGRDKPWPSAGSNLSEKLSGQRNGFNIKLG 360

Query: 361  YENYPAPKSANTGARLLPMQNFSSSSSNRVLSTNWKNSEEEEFMWGEMNSIMTGHGAPAI 420
            YENY APKS NTGARLLP+QNFSSSS+NRVLSTNWKNSEEEEFMWG+M+S++TGHGAPAI
Sbjct: 361  YENYSAPKSTNTGARLLPVQNFSSSSNNRVLSTNWKNSEEEEFMWGDMSSMLTGHGAPAI 420

Query: 421  ASSTGKDQWTPEDSDNSGIENKPLSVRDTGASVDREASSDSQSSEQRELGDSGQQRSSMW 480
             SSTGKDQWTPEDSDNSGI+NK LSVRDTGASVDREASSDSQSSEQRELGDSGQQRSS W
Sbjct: 421  NSSTGKDQWTPEDSDNSGIDNKHLSVRDTGASVDREASSDSQSSEQRELGDSGQQRSSTW 480

Query: 481  QVQEPISLDGLRGGVPRKNSAESGGYGA--------NSSVDQMGGRPQITSSNIGASGHG 540
            Q+QE ISLDGLR GVPRKNS +SGGYGA        NSSVDQMGGRPQIT SNIGASGHG
Sbjct: 481  QLQESISLDGLRAGVPRKNSGQSGGYGATLTALSGTNSSVDQMGGRPQITPSNIGASGHG 540

Query: 541  FLNKGGSGSIGTVGHQRFPSRSVAFPSGQPPLHQRPPSLLLVDHVPHQMHDHKTSSLSNL 600
            FLNKGGSG +G VGHQRFPSRSVAFPSGQPPLHQR PS LLVDHVPHQ+HD KT+S SNL
Sbjct: 541  FLNKGGSGPLGNVGHQRFPSRSVAFPSGQPPLHQRSPSQLLVDHVPHQIHDQKTTSFSNL 600

Query: 601  DPRKRHMQDAALGLHPSVRPDNLQKSQPQDLQASASSVPASQPKHQFSLSESLKPDVTQS 660
            DPRKRHMQDAALGLHPSVRPDN QK Q  DL+A ASS+P SQP+HQFSLSESLKPDVTQS
Sbjct: 601  DPRKRHMQDAALGLHPSVRPDNHQKPQTHDLRALASSIPGSQPRHQFSLSESLKPDVTQS 660

Query: 661  ELSSQHAVSIPGTNFGPPSSAGTVPD-LPAEILGEPSTSSLLAAVMKSGIFSNHSMTSSM 720
            ELSSQ AVSIPGT+FGP SSAGTVPD LPAEILG PSTSSLLAAVMKSG+FSNHS+TS+M
Sbjct: 661  ELSSQLAVSIPGTDFGPSSSAGTVPDRLPAEILGNPSTSSLLAAVMKSGLFSNHSITSNM 720

Query: 721  QQNISFQDVGNMPPRSSIKPPLPSRSSPAKTV------GESSLGPQSLESPSVLVKLSQT 780
            QQN+SFQDVGNM PRSSIKPPLP+RSSPA T       GESS+GP S+ESPS +VKLS+T
Sbjct: 721  QQNLSFQDVGNMKPRSSIKPPLPNRSSPAHTFSEPKIQGESSVGPPSVESPSTMVKLSRT 780

Query: 781  KLEETSLPPDPLPPSSPLNSASTETSNVVNDASSPISNLLSSLVAKGLISASKGELTNSA 840
            K+EE SLP DPLPPSSP++SASTETS+VVNDASSPISNLLSSLVAKGLISASKGE TNS 
Sbjct: 781  KVEEPSLPSDPLPPSSPMDSASTETSHVVNDASSPISNLLSSLVAKGLISASKGESTNSV 840

Query: 841  TSQMPSQPES-KLGDAVTCSLPVPSIPVSSSSLSSMGLESPSKAAAKSSTSPPPSSTTEI 900
            TSQMPSQPE+ K GDAVT S+PVPSI VSSS  SS  LESP KAAAKSSTSPPPS+TTEI
Sbjct: 841  TSQMPSQPENLKSGDAVTSSVPVPSIAVSSSCHSSTKLESPLKAAAKSSTSPPPSATTEI 900

Query: 901  NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHSQWHTISTEANN 960
            NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLK EEQLDTHS+WHT+ TEANN
Sbjct: 901  NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKCEEQLDTHSRWHTLRTEANN 960

Query: 961  SNRAPRRWYPSSDDWVSGNARLLLDAATSLDKSNMMEEDSEPMVPADEDQFACVLCGELF 1020
            S+ APRRWYP SDDW+SGNAR LLDA TSLD+S++MEED+EPMVPADEDQFACV+CGELF
Sbjct: 961  SSTAPRRWYPCSDDWISGNARFLLDAETSLDESDLMEEDNEPMVPADEDQFACVICGELF 1020

Query: 1021 EDFYSQELGKWMFKGAMYITIPSAGSKVGSTNEQVARGPIVHTECLTESSVYDLGLATDV 1067
            EDFYSQELG WM+KGA YITIPS GS+VG TNEQVA+GPIVHT CLTESSVYD+GLATD+
Sbjct: 1021 EDFYSQELGNWMYKGATYITIPSVGSEVGGTNEQVAKGPIVHTTCLTESSVYDVGLATDI 1080

BLAST of Spg033175 vs. ExPASy Swiss-Prot
Match: Q0WPF2 (Polyadenylation and cleavage factor homolog 4 OS=Arabidopsis thaliana OX=3702 GN=PCFS4 PE=1 SV=1)

HSP 1 Score: 245.4 bits (625), Expect = 3.3e-63
Identity = 274/1002 (27.35%), Postives = 394/1002 (39.32%), Query Frame = 0

Query: 65   DSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISATVCAN 124
            D   GG +  PP   E+V  Y   L ELTFNSKPIIT+LTIIAGE  +  + I+  +C  
Sbjct: 49   DEFGGGEEVLPPSMDEIVQLYEVVLGELTFNSKPIITDLTIIAGEQREHGEGIANAICTR 108

Query: 125  ILEVSSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFGT 184
            ILE   EQKLPSLYLLDSIVKNIGRDY +YF++RLPEVFC AYRQ  PS+HPSMRHLFGT
Sbjct: 109  ILEAPVEQKLPSLYLLDSIVKNIGRDYGRYFSSRLPEVFCLAYRQAHPSLHPSMRHLFGT 168

Query: 185  WKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQAQRPAHSIHVNPKYIER-QRLQQ 244
            W  VFPP  L+ I+ +L  + S+++ S    S+P     +P   IHVNPKY+ R +    
Sbjct: 169  WSSVFPPPVLRKIDMQLQ-LSSAANQSSVGASEP----SQPTRGIHVNPKYLRRLEPSAA 228

Query: 245  SGRVKGMTSDATGATTNVTQDVAQAKISTGRPWADAPIKVLDIHRPLRDAPNDMAQEKNI 304
               ++G+ S A        +   Q  +     + D     L+    L   P+   +  N 
Sbjct: 229  ENNLRGINSSA--------RVYGQNSLGGYNDFEDQ----LESPSSLSSTPDGFTRRSN- 288

Query: 305  TAAYADYEYGSDLPRTTGIGRRVVDEGRDKPWSSAGSNLAEKLSGQRNGFNIKLGYENYP 364
                 D    S+     G+GR    +     W     NL +    +R    I    + Y 
Sbjct: 289  -----DGANPSNQAFNYGMGRATSRDDEHMEWRRK-ENLGQGNDHERPRALI----DAYG 348

Query: 365  APKSANTGARLLPMQNFSSSSSNRVLSTNWKNSEEEEFMWGEMNSIMTGHGAPAIASSTG 424
               S +      P+++ +   S  V  T W+N+EEEEF W +M        +P +  S  
Sbjct: 349  VDTSKHVTIN-KPIRDMNGMHSKMV--TPWQNTEEEEFDWEDM--------SPTLDRSRA 408

Query: 425  KDQWTPEDSDNSGIENKPLSVRDTGASVDREASSDSQSSEQRELGDSGQQRSSMWQVQEP 484
             +           +  +P      G + D    SD ++    +L                
Sbjct: 409  GEFLRSSVPALGSVRARP----RVGNTSDFHLDSDIKNGVSHQL---------------- 468

Query: 485  ISLDGLRGGVPRKNSAESGGYGANSS-VDQMGGRP-QITSSNIG-ASGHGFLNKGGSGSI 544
                       R+N + S  Y   S+ VD   G+  ++ +S++G  S +         SI
Sbjct: 469  -----------RENWSLSQNYPHTSNRVDTRAGKDLKVLASSVGLVSSNSEFGAPPFDSI 528

Query: 545  GTVGHQRFPSRSVAFPSGQ-PPLHQRPPSLLLVDHVPHQMHDHKTSSLSNLDPRKRHMQD 604
              V + RF     A P G  P L  R P+ L V                           
Sbjct: 529  QDV-NSRF---GRALPDGTWPHLSARGPNSLPV--------------------------- 588

Query: 605  AALGLHPSVRPDNLQKSQPQDLQASASSVPASQPKHQFSLSESLKPDVTQSELSSQHAVS 664
             +  LH    P N   ++ Q         P  +P++Q          V+QS L+     +
Sbjct: 589  PSAHLHHLANPGNAMSNRLQ-------GKPLYRPENQ----------VSQSHLNDMTQQN 648

Query: 665  IPGTNFGPPSSAGTVPDLPAEILGEPSTSSLLAAVMKSGIFSNHSMTSSMQQNISFQDVG 724
                N+ P SSA                                     MQ  ++    G
Sbjct: 649  QMLVNYLPSSSA--------------------------------MAPRPMQSLLTHVSHG 708

Query: 725  NMPPRSSIKPPLPSRSSPAKTVGESSLGPQSLESPSVLVKLSQTKLEETSLPPDPLPPSS 784
              P  S+I+P L                  S++    +  LS   L +      P     
Sbjct: 709  YPPHGSTIRPSL------------------SIQGGEAMHPLSSGVLSQIGASNQP----- 768

Query: 785  PLNSASTETSNVVNDASSPISNLLSSLVAKGLISASKGELTNSATSQMPSQPESKLGDAV 844
                                S L+ SL+A+GLIS     L N    Q P           
Sbjct: 769  ---------------PGGAFSGLIGSLMAQGLIS-----LNNQPAGQGP----------- 797

Query: 845  TCSLPVPSIPVSSSSLSSMGLESPSKAAAKSSTSPPPSSTTEINNLIGFEFSSHVIRKFH 904
                                                          +G EF + +++  +
Sbjct: 829  ----------------------------------------------LGLEFDADMLKIRN 797

Query: 905  PSVISGLFDDIPYQCKICGLRLKLEEQLDTHSQWHTIST--EANNSNRAPRRWYPSSDDW 964
             S IS L+ D+P QC  CGLR K +E+   H  WH        N+     R+W+ S+  W
Sbjct: 889  ESAISALYGDLPRQCTTCGLRFKCQEEHSKHMDWHVTKNRMSKNHKQNPSRKWFVSASMW 797

Query: 965  VSGNARLLLDAATSL---DKSNMMEEDSEPMVPADEDQFACVLCGELFEDFYSQELGKWM 1024
            +SG   L  +A       + +   ++D +  VPADEDQ +C LCGE FEDFYS E  +WM
Sbjct: 949  LSGAEALGAEAVPGFLPTEPTTEKKDDEDMAVPADEDQTSCALCGEPFEDFYSDETEEWM 797

Query: 1025 FKGAMYITIPSAGSKVGSTNEQVARGPIVHTECLTESSVYDL 1057
            +KGA+Y+  P   +   +  ++   GPIVH +C  ES+  D+
Sbjct: 1009 YKGAVYMNAPEEST---TDMDKSQLGPIVHAKCRPESNGGDM 797

BLAST of Spg033175 vs. ExPASy Swiss-Prot
Match: O94913 (Pre-mRNA cleavage complex 2 protein Pcf11 OS=Homo sapiens OX=9606 GN=PCF11 PE=1 SV=3)

HSP 1 Score: 100.9 bits (250), Expect = 1.0e-19
Identity = 59/160 (36.88%), Postives = 88/160 (55.00%), Query Frame = 0

Query: 79  QELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISATVCANILEVSSEQKLPSLY 138
           ++    Y+++L +LTFNSKP I  LTI+A ENL  AK I + + A   +  S +KLP +Y
Sbjct: 16  EDACRDYQSSLEDLTFNSKPHINMLTILAEENLPFAKEIVSLIEAQTAKAPSSEKLPVMY 75

Query: 139 LLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFGTWKGVFPPQTLQIIE 198
           L+DSIVKN+GR+Y+  F   L   F   + +VD +   S+  L  TW  +FP + L  ++
Sbjct: 76  LMDSIVKNVGREYLTAFTKNLVATFICVFEKVDENTRKSLFKLRSTWDEIFPLKKLYALD 135

Query: 199 KELGFMPSSSSSSGTITSKPDLQAQRPAHSIHVNPKYIER 239
             +      +S       KP L       SIHVNPK++ +
Sbjct: 136 VRV------NSLDPAWPIKP-LPPNVNTSSIHVNPKFLNK 168

BLAST of Spg033175 vs. ExPASy Swiss-Prot
Match: Q9FIX8 (Polyadenylation and cleavage factor homolog 5 OS=Arabidopsis thaliana OX=3702 GN=PCFS5 PE=1 SV=1)

HSP 1 Score: 95.9 bits (237), Expect = 3.2e-18
Identity = 54/165 (32.73%), Postives = 75/165 (45.45%), Query Frame = 0

Query: 899  HPSVISGLFDDIPYQCKICGLRLKLEEQLDTHSQWHT-------ISTEANNSNRAPRRWY 958
            H SVI  L+ D+P QC  CG+R K +E+   H  WH         +T      +  R W 
Sbjct: 234  HESVIKSLYSDMPRQCTSCGVRFKCQEEHSKHMDWHVRKNRSVKTTTRLGQQPKKSRGWL 293

Query: 959  PSSDDWV-----SGNARLLLDAATSLDKSNMMEE-DSEPMVPADEDQFACVLCGELFEDF 1018
             S+  W+      G   +       + K N  ++   + MVPADEDQ  C LC E FE+F
Sbjct: 294  ASASLWLCAPTGGGTVEVASFGGGEMQKKNEKDQVQKQHMVPADEDQKNCALCVEPFEEF 353

Query: 1019 YSQELGKWMFKGAMYITIPSAGSKVGSTNEQVARGPIVHTECLTE 1051
            +S E   WM+K A+Y+T                 G IVH +C+ E
Sbjct: 354  FSHEADDWMYKDAVYLT---------------KNGRIVHVKCMPE 383

BLAST of Spg033175 vs. ExPASy Swiss-Prot
Match: Q9C710 (Polyadenylation and cleavage factor homolog 1 OS=Arabidopsis thaliana OX=3702 GN=PCFS1 PE=1 SV=1)

HSP 1 Score: 95.1 bits (235), Expect = 5.5e-18
Identity = 55/164 (33.54%), Postives = 72/164 (43.90%), Query Frame = 0

Query: 899  HPSVISGLFDDIPYQCKICGLRLKLEEQLDTHSQWHT-------ISTEANNSNRAPRRWY 958
            H SVI  L+ D+P QC  CGLR K +E+   H  WH         +T      +  R W 
Sbjct: 241  HESVIKSLYSDMPRQCSSCGLRFKCQEEHSKHMDWHVRKNRSVKTTTRLGQQPKKSRGWL 300

Query: 959  PSSDDWVSG-----NARLLLDAATSLDKSNMMEEDSEPMVPADEDQFACVLCGELFEDFY 1018
             S+  W+          +         K    EE  + MVPADEDQ  C LC E FE+F+
Sbjct: 301  ASASLWLCAATGGETVEVASFGGEMQKKKGKDEEPKQLMVPADEDQKNCALCVEPFEEFF 360

Query: 1019 SQELGKWMFKGAMYITIPSAGSKVGSTNEQVARGPIVHTECLTE 1051
            S E   WM+K A+Y+T                 G IVH +C+ E
Sbjct: 361  SHEDDDWMYKDAVYLT---------------KNGRIVHVKCMPE 389

BLAST of Spg033175 vs. ExPASy Swiss-Prot
Match: Q10237 (Uncharacterized protein C4G9.04c OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=SPAC4G9.04c PE=4 SV=1)

HSP 1 Score: 81.6 bits (200), Expect = 6.3e-14
Identity = 46/102 (45.10%), Postives = 58/102 (56.86%), Query Frame = 0

Query: 85  YRTALAELTFNSKPIITNLTIIAGENLQAAKAISATVCANILEVSSEQKLPSLYLLDSIV 144
           Y +AL +LTFNSKPII  LT IA EN   A +I   +  +I +     KLP+LYLLDSI 
Sbjct: 8   YLSALEDLTFNSKPIIHTLTYIAQENEPYAISIVNAIEKHIQKCPPNCKLPALYLLDSIS 67

Query: 145 KNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFGTWK 187
           KN+G  Y  +F   L   F  AY  V+P +   +  L  TWK
Sbjct: 68  KNLGAPYTYFFGLHLFSTFMSAYTVVEPRLRLKLDQLLATWK 109

BLAST of Spg033175 vs. ExPASy TrEMBL
Match: A0A5A7TQ23 (Polyadenylation and cleavage factor-like protein 4-like isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold236G002570 PE=4 SV=1)

HSP 1 Score: 1831.6 bits (4743), Expect = 0.0e+00
Identity = 963/1154 (83.45%), Postives = 1028/1154 (89.08%), Query Frame = 0

Query: 1    MEMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGTNIV-QPRFRASDR 60
            MEMESSRRPFDRTREPGLKKPRLADEA+RG NINGRPFPQRPVVSG NIV QPRFRASDR
Sbjct: 1    MEMESSRRPFDRTREPGLKKPRLADEADRGANINGRPFPQRPVVSGNNIVQQPRFRASDR 60

Query: 61   DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISA 120
            DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAIS 
Sbjct: 61   DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAIST 120

Query: 121  TVCANILEVSSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR 180
            T+ ANILEV SEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR
Sbjct: 121  TIYANILEVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR 180

Query: 181  HLFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQAQRPA-HSIHVNPKYIER 240
            HLFGTWKGVFP QTLQIIEKELGF+P+ SSSS  I SKPDLQAQRP  HSIHVNPKYIER
Sbjct: 181  HLFGTWKGVFPLQTLQIIEKELGFVPTGSSSSVAINSKPDLQAQRPTPHSIHVNPKYIER 240

Query: 241  QRLQQSGRVKGMTSDATGATTNVTQDVAQAKISTGRPWADAPIKVLDIHRPLRDAPNDMA 300
            QRLQQSGRVKGM +DATG +TNV+QDVAQAKISTGRPWADAPIKVLDI RPLRDAPNDMA
Sbjct: 241  QRLQQSGRVKGMPTDATGGSTNVSQDVAQAKISTGRPWADAPIKVLDIQRPLRDAPNDMA 300

Query: 301  QEKNITAAYADYEYGSDLPRTTGIGRRVVDEGRDKPWSSAGSNLAEKLSGQRNGFNIKLG 360
            QEKN+TA Y+DYEYGSDL RT+ +GRRVVDEGRDKPW SAGSNL+EKLSGQRNGFNIKLG
Sbjct: 301  QEKNVTAGYSDYEYGSDLSRTSSVGRRVVDEGRDKPWPSAGSNLSEKLSGQRNGFNIKLG 360

Query: 361  YENYPAPKSANTGARLLPMQNFSSSSSNRVLSTNWKNSEEEEFMWGEMNSIMTGHGAPAI 420
            YENY APKS NTGARLLP+QNFSSSS+NRVLSTNWKNSEEEEFMWG+M+S++TGHGAPAI
Sbjct: 361  YENYSAPKSTNTGARLLPVQNFSSSSNNRVLSTNWKNSEEEEFMWGDMSSMLTGHGAPAI 420

Query: 421  ASSTGKDQWTPEDSDNSGIENKPLSVRDTGASVDREASSDSQSSEQRELGDSGQQRSSMW 480
             SSTGKDQWTPEDSDNSGI+NK LSVRDTGASVDREASSDSQSSEQRELGDSGQQRSS W
Sbjct: 421  NSSTGKDQWTPEDSDNSGIDNKHLSVRDTGASVDREASSDSQSSEQRELGDSGQQRSSTW 480

Query: 481  QVQEPISLDGLRGGVPRKNSAESGGYGA--------NSSVDQMGGRPQITSSNIGASGHG 540
            Q+QE ISLDGLR GVPRKNS +SGGYGA        NSSVDQMGGRPQIT SNIGASGHG
Sbjct: 481  QLQESISLDGLRAGVPRKNSGQSGGYGATLTALSGTNSSVDQMGGRPQITPSNIGASGHG 540

Query: 541  FLNKGGSGSIGTVGHQRFPSRSVAFPSGQPPLHQRPPSLLLVDHVPHQMHDHKTSSLSNL 600
            FLNKGGSG +G VGHQRFPSRSVAFPSGQPPLHQR PS LLVDHVPHQ+HD KT+S SNL
Sbjct: 541  FLNKGGSGPLGNVGHQRFPSRSVAFPSGQPPLHQRSPSQLLVDHVPHQIHDQKTTSFSNL 600

Query: 601  DPRKRHMQDAALGLHPSVRPDNLQKSQPQDLQASASSVPASQPKHQFSLSESLKPDVTQS 660
            DPRKRHMQDAALGLHPSVRPDN QK Q  DL+A ASS+P SQP+HQFSLSESLKPDVTQS
Sbjct: 601  DPRKRHMQDAALGLHPSVRPDNHQKPQTHDLRALASSIPGSQPRHQFSLSESLKPDVTQS 660

Query: 661  ELSSQHAVSIPGTNFGPPSSAGTVPD-LPAEILGEPSTSSLLAAVMKSGIFSNHSMTSSM 720
            ELSSQ AVSIPGT+FGP SSAGTVPD LPAEILG PSTSSLLAAVMKSG+FSNHS+TS+M
Sbjct: 661  ELSSQLAVSIPGTDFGPSSSAGTVPDRLPAEILGNPSTSSLLAAVMKSGLFSNHSITSNM 720

Query: 721  QQNISFQDVGNMPPRSSIKPPLPSRSSPAKTV------GESSLGPQSLESPSVLVKLSQT 780
            QQN+SFQDVGNM PRSSIKPPLP+RSSPA T       GESS+GP S+ESPS +VKLS+T
Sbjct: 721  QQNLSFQDVGNMKPRSSIKPPLPNRSSPAHTFSEPKIQGESSVGPPSVESPSTMVKLSRT 780

Query: 781  KLEETSLPPDPLPPSSPLNSASTETSNVVNDASSPISNLLSSLVAKGLISASKGELTNSA 840
            K+EE SLP DPLPPSSP++SASTETS+VVNDASSPISNLLSSLVAKGLISASKGE TNS 
Sbjct: 781  KVEEPSLPSDPLPPSSPMDSASTETSHVVNDASSPISNLLSSLVAKGLISASKGESTNSV 840

Query: 841  TSQMPSQPES-KLGDAVTCSLPVPSIPVSSSSLSSMGLESPSKAAAKSSTSPPPSSTTEI 900
            TSQMPSQPE+ K GDAVT S+PVPSI VSSS  SS  LESP KAAAKSSTSPPPS+TTEI
Sbjct: 841  TSQMPSQPENLKSGDAVTSSVPVPSIAVSSSCHSSTKLESPLKAAAKSSTSPPPSATTEI 900

Query: 901  NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHSQWHTISTEANN 960
            NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLK EEQLDTHS+WHT+ TEANN
Sbjct: 901  NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKCEEQLDTHSRWHTLRTEANN 960

Query: 961  SNRAPRRWYPSSDDWVSGNARLLLDAATSLDKSNMMEEDSEPMVPADEDQFACVLCGELF 1020
            S+ APRRWYP SDDW+SGNAR LLDA TSLD+S++MEED+EPMVPADEDQFACV+CGELF
Sbjct: 961  SSTAPRRWYPCSDDWISGNARFLLDAETSLDESDLMEEDNEPMVPADEDQFACVICGELF 1020

Query: 1021 EDFYSQELGKWMFKGAMYITIPSAGSKVGSTNEQVARGPIVHTECLTESSVYDLGLATDV 1080
            EDFYSQELG WM+KGA YITIPS GS+VG TNEQVA+GPIVHT CLTESSVYD+GLATD+
Sbjct: 1021 EDFYSQELGNWMYKGATYITIPSVGSEVGGTNEQVAKGPIVHTTCLTESSVYDVGLATDI 1080

Query: 1081 KMVMFLVIYTFAGLDLVSLSSIDLLDDFMCCQSKDSAECYGIGLIKKWMYDASPAKRHIG 1137
            KM M + +       ++   +                        +KWMYDASPAKRH+G
Sbjct: 1081 KMAMIVRMLRHWAESIMKHQN------------------------RKWMYDASPAKRHVG 1130

BLAST of Spg033175 vs. ExPASy TrEMBL
Match: A0A1S3B6K6 (polyadenylation and cleavage factor homolog 4-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103486572 PE=4 SV=1)

HSP 1 Score: 1793.9 bits (4645), Expect = 0.0e+00
Identity = 937/1084 (86.44%), Postives = 995/1084 (91.79%), Query Frame = 0

Query: 1    MEMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGTNIV-QPRFRASDR 60
            MEMESSRRPFDRTREPGLKKPRLADEA+RG NINGRPFPQRPVVSG NIV QPRFRASDR
Sbjct: 1    MEMESSRRPFDRTREPGLKKPRLADEADRGANINGRPFPQRPVVSGNNIVQQPRFRASDR 60

Query: 61   DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISA 120
            DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAIS 
Sbjct: 61   DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAIST 120

Query: 121  TVCANILEVSSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR 180
            T+ ANILEV SEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR
Sbjct: 121  TIYANILEVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR 180

Query: 181  HLFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQAQRPA-HSIHVNPKYIER 240
            HLFGTWKGVFP QTLQIIEKELGF+P+ SSSS  I SKPDLQAQRP  HSIHVNPKYIER
Sbjct: 181  HLFGTWKGVFPLQTLQIIEKELGFVPTGSSSSVAINSKPDLQAQRPTPHSIHVNPKYIER 240

Query: 241  QRLQQSGRVKGMTSDATGATTNVTQDVAQAKISTGRPWADAPIKVLDIHRPLRDAPNDMA 300
            QRLQQSGRVKGM +DATG +TNV+QDVAQAKISTGRPWADAPIKVLDI RPLRDAPNDMA
Sbjct: 241  QRLQQSGRVKGMPTDATGGSTNVSQDVAQAKISTGRPWADAPIKVLDIQRPLRDAPNDMA 300

Query: 301  QEKNITAAYADYEYGSDLPRTTGIGRRVVDEGRDKPWSSAGSNLAEKLSGQRNGFNIKLG 360
            QEKN+TA Y+DYEYGSDL RT+ +GRRVVDEGRDKPW SAGSNL+EKLSGQRNGFNIKLG
Sbjct: 301  QEKNVTAGYSDYEYGSDLSRTSSVGRRVVDEGRDKPWPSAGSNLSEKLSGQRNGFNIKLG 360

Query: 361  YENYPAPKSANTGARLLPMQNFSSSSSNRVLSTNWKNSEEEEFMWGEMNSIMTGHGAPAI 420
            YENY APKS NTGARLLP+QNFSSSS+NRVLSTNWKNSEEEEFMWG+M+S++TGHGAPAI
Sbjct: 361  YENYSAPKSTNTGARLLPVQNFSSSSNNRVLSTNWKNSEEEEFMWGDMSSMLTGHGAPAI 420

Query: 421  ASSTGKDQWTPEDSDNSGIENKPLSVRDTGASVDREASSDSQSSEQRELGDSGQQRSSMW 480
             SSTGKDQWTPEDSDNSGI+NK LSVRDTGASVDREASSDSQSSEQRELGDSGQQRSS W
Sbjct: 421  NSSTGKDQWTPEDSDNSGIDNKHLSVRDTGASVDREASSDSQSSEQRELGDSGQQRSSTW 480

Query: 481  QVQEPISLDGLRGGVPRKNSAESGGYGA--------NSSVDQMGGRPQITSSNIGASGHG 540
            Q+QE ISLDGLR GVPRKNS +SGGYGA        NSSVDQMGGRPQIT SNIGASGHG
Sbjct: 481  QLQESISLDGLRAGVPRKNSGQSGGYGATLTALSGTNSSVDQMGGRPQITPSNIGASGHG 540

Query: 541  FLNKGGSGSIGTVGHQRFPSRSVAFPSGQPPLHQRPPSLLLVDHVPHQMHDHKTSSLSNL 600
            FLNKGGSG +G VGHQRFPSRSVAFPSGQPPLHQR PS LLVDHVPHQ+HD KT+S SNL
Sbjct: 541  FLNKGGSGPLGNVGHQRFPSRSVAFPSGQPPLHQRSPSQLLVDHVPHQIHDQKTTSFSNL 600

Query: 601  DPRKRHMQDAALGLHPSVRPDNLQKSQPQDLQASASSVPASQPKHQFSLSESLKPDVTQS 660
            DPRKRHMQDAALGLHPSVRPDN QK Q  DL+A ASS+P SQP+HQFSLSESLKPDVTQS
Sbjct: 601  DPRKRHMQDAALGLHPSVRPDNHQKPQTHDLRALASSIPGSQPRHQFSLSESLKPDVTQS 660

Query: 661  ELSSQHAVSIPGTNFGPPSSAGTVPD-LPAEILGEPSTSSLLAAVMKSGIFSNHSMTSSM 720
            ELSSQ AVSIPGT+FGP SSAGTVPD LPAEILG PSTSSLLAAVMKSG+FSNHS+TS+M
Sbjct: 661  ELSSQLAVSIPGTDFGPSSSAGTVPDRLPAEILGNPSTSSLLAAVMKSGLFSNHSITSNM 720

Query: 721  QQNISFQDVGNMPPRSSIKPPLPSRSSPAKTV------GESSLGPQSLESPSVLVKLSQT 780
            QQN+SFQDVGNM PRSSIKPPLP+RSSPA T       GESS+GP S+ESPS +VKLS+T
Sbjct: 721  QQNLSFQDVGNMKPRSSIKPPLPNRSSPAHTFSEPKIQGESSVGPPSVESPSTMVKLSRT 780

Query: 781  KLEETSLPPDPLPPSSPLNSASTETSNVVNDASSPISNLLSSLVAKGLISASKGELTNSA 840
            K+EE SLP DPLPPSSP++SASTETS+VVNDASSPISNLLSSLVAKGLISASKGE TNS 
Sbjct: 781  KVEEPSLPSDPLPPSSPMDSASTETSHVVNDASSPISNLLSSLVAKGLISASKGESTNSV 840

Query: 841  TSQMPSQPES-KLGDAVTCSLPVPSIPVSSSSLSSMGLESPSKAAAKSSTSPPPSSTTEI 900
            TSQMPSQPE+ K GDAVT S+PVPSI VSSS  SS  LESP KAAAKSSTSPPPS+TTEI
Sbjct: 841  TSQMPSQPENLKSGDAVTSSVPVPSIAVSSSCHSSTKLESPLKAAAKSSTSPPPSATTEI 900

Query: 901  NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHSQWHTISTEANN 960
            NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLK EEQLDTHS+WHT+ TEANN
Sbjct: 901  NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKCEEQLDTHSRWHTLRTEANN 960

Query: 961  SNRAPRRWYPSSDDWVSGNARLLLDAATSLDKSNMMEEDSEPMVPADEDQFACVLCGELF 1020
            S+ APRRWYP SDDW+SGNAR LLDA TSLD+S++MEED+EPMVPADEDQFACV+CGELF
Sbjct: 961  SSTAPRRWYPCSDDWISGNARFLLDAETSLDESDLMEEDNEPMVPADEDQFACVICGELF 1020

Query: 1021 EDFYSQELGKWMFKGAMYITIPSAGSKVGSTNEQVARGPIVHTECLTESSVYDLGLATDV 1067
            EDFYSQELG WM+KGA YITIPS GS+VG TNEQVA+GPIVHT CLTESSVYD+GLATD+
Sbjct: 1021 EDFYSQELGNWMYKGATYITIPSVGSEVGGTNEQVAKGPIVHTTCLTESSVYDVGLATDI 1080

BLAST of Spg033175 vs. ExPASy TrEMBL
Match: A0A1S3B794 (polyadenylation and cleavage factor homolog 4-like isoform X2 OS=Cucumis melo OX=3656 GN=LOC103486572 PE=4 SV=1)

HSP 1 Score: 1785.8 bits (4624), Expect = 0.0e+00
Identity = 935/1084 (86.25%), Postives = 993/1084 (91.61%), Query Frame = 0

Query: 1    MEMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGTNIV-QPRFRASDR 60
            MEMESSRRPFDRTREPGLKKPRLADEA+RG NINGRPFPQRPVVSG NIV QPRFRASDR
Sbjct: 1    MEMESSRRPFDRTREPGLKKPRLADEADRGANINGRPFPQRPVVSGNNIVQQPRFRASDR 60

Query: 61   DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISA 120
            DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAIS 
Sbjct: 61   DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAIST 120

Query: 121  TVCANILEVSSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR 180
            T+ ANILEV SEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR
Sbjct: 121  TIYANILEVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR 180

Query: 181  HLFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQAQRPA-HSIHVNPKYIER 240
            HLFGTWKGVFP QTLQIIEKELGF+P+ SSSS  I SKPDLQAQRP  HSIHVNPKYIER
Sbjct: 181  HLFGTWKGVFPLQTLQIIEKELGFVPTGSSSSVAINSKPDLQAQRPTPHSIHVNPKYIER 240

Query: 241  QRLQQSGRVKGMTSDATGATTNVTQDVAQAKISTGRPWADAPIKVLDIHRPLRDAPNDMA 300
            QRLQQSGRVKGM +DATG +TNV+QDVAQAKISTGRPWADAPIK  DI RPLRDAPNDMA
Sbjct: 241  QRLQQSGRVKGMPTDATGGSTNVSQDVAQAKISTGRPWADAPIK--DIQRPLRDAPNDMA 300

Query: 301  QEKNITAAYADYEYGSDLPRTTGIGRRVVDEGRDKPWSSAGSNLAEKLSGQRNGFNIKLG 360
            QEKN+TA Y+DYEYGSDL RT+ +GRRVVDEGRDKPW SAGSNL+EKLSGQRNGFNIKLG
Sbjct: 301  QEKNVTAGYSDYEYGSDLSRTSSVGRRVVDEGRDKPWPSAGSNLSEKLSGQRNGFNIKLG 360

Query: 361  YENYPAPKSANTGARLLPMQNFSSSSSNRVLSTNWKNSEEEEFMWGEMNSIMTGHGAPAI 420
            YENY APKS NTGARLLP+QNFSSSS+NRVLSTNWKNSEEEEFMWG+M+S++TGHGAPAI
Sbjct: 361  YENYSAPKSTNTGARLLPVQNFSSSSNNRVLSTNWKNSEEEEFMWGDMSSMLTGHGAPAI 420

Query: 421  ASSTGKDQWTPEDSDNSGIENKPLSVRDTGASVDREASSDSQSSEQRELGDSGQQRSSMW 480
             SSTGKDQWTPEDSDNSGI+NK LSVRDTGASVDREASSDSQSSEQRELGDSGQQRSS W
Sbjct: 421  NSSTGKDQWTPEDSDNSGIDNKHLSVRDTGASVDREASSDSQSSEQRELGDSGQQRSSTW 480

Query: 481  QVQEPISLDGLRGGVPRKNSAESGGYGA--------NSSVDQMGGRPQITSSNIGASGHG 540
            Q+QE ISLDGLR GVPRKNS +SGGYGA        NSSVDQMGGRPQIT SNIGASGHG
Sbjct: 481  QLQESISLDGLRAGVPRKNSGQSGGYGATLTALSGTNSSVDQMGGRPQITPSNIGASGHG 540

Query: 541  FLNKGGSGSIGTVGHQRFPSRSVAFPSGQPPLHQRPPSLLLVDHVPHQMHDHKTSSLSNL 600
            FLNKGGSG +G VGHQRFPSRSVAFPSGQPPLHQR PS LLVDHVPHQ+HD KT+S SNL
Sbjct: 541  FLNKGGSGPLGNVGHQRFPSRSVAFPSGQPPLHQRSPSQLLVDHVPHQIHDQKTTSFSNL 600

Query: 601  DPRKRHMQDAALGLHPSVRPDNLQKSQPQDLQASASSVPASQPKHQFSLSESLKPDVTQS 660
            DPRKRHMQDAALGLHPSVRPDN QK Q  DL+A ASS+P SQP+HQFSLSESLKPDVTQS
Sbjct: 601  DPRKRHMQDAALGLHPSVRPDNHQKPQTHDLRALASSIPGSQPRHQFSLSESLKPDVTQS 660

Query: 661  ELSSQHAVSIPGTNFGPPSSAGTVPD-LPAEILGEPSTSSLLAAVMKSGIFSNHSMTSSM 720
            ELSSQ AVSIPGT+FGP SSAGTVPD LPAEILG PSTSSLLAAVMKSG+FSNHS+TS+M
Sbjct: 661  ELSSQLAVSIPGTDFGPSSSAGTVPDRLPAEILGNPSTSSLLAAVMKSGLFSNHSITSNM 720

Query: 721  QQNISFQDVGNMPPRSSIKPPLPSRSSPAKTV------GESSLGPQSLESPSVLVKLSQT 780
            QQN+SFQDVGNM PRSSIKPPLP+RSSPA T       GESS+GP S+ESPS +VKLS+T
Sbjct: 721  QQNLSFQDVGNMKPRSSIKPPLPNRSSPAHTFSEPKIQGESSVGPPSVESPSTMVKLSRT 780

Query: 781  KLEETSLPPDPLPPSSPLNSASTETSNVVNDASSPISNLLSSLVAKGLISASKGELTNSA 840
            K+EE SLP DPLPPSSP++SASTETS+VVNDASSPISNLLSSLVAKGLISASKGE TNS 
Sbjct: 781  KVEEPSLPSDPLPPSSPMDSASTETSHVVNDASSPISNLLSSLVAKGLISASKGESTNSV 840

Query: 841  TSQMPSQPES-KLGDAVTCSLPVPSIPVSSSSLSSMGLESPSKAAAKSSTSPPPSSTTEI 900
            TSQMPSQPE+ K GDAVT S+PVPSI VSSS  SS  LESP KAAAKSSTSPPPS+TTEI
Sbjct: 841  TSQMPSQPENLKSGDAVTSSVPVPSIAVSSSCHSSTKLESPLKAAAKSSTSPPPSATTEI 900

Query: 901  NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHSQWHTISTEANN 960
            NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLK EEQLDTHS+WHT+ TEANN
Sbjct: 901  NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKCEEQLDTHSRWHTLRTEANN 960

Query: 961  SNRAPRRWYPSSDDWVSGNARLLLDAATSLDKSNMMEEDSEPMVPADEDQFACVLCGELF 1020
            S+ APRRWYP SDDW+SGNAR LLDA TSLD+S++MEED+EPMVPADEDQFACV+CGELF
Sbjct: 961  SSTAPRRWYPCSDDWISGNARFLLDAETSLDESDLMEEDNEPMVPADEDQFACVICGELF 1020

Query: 1021 EDFYSQELGKWMFKGAMYITIPSAGSKVGSTNEQVARGPIVHTECLTESSVYDLGLATDV 1067
            EDFYSQELG WM+KGA YITIPS GS+VG TNEQVA+GPIVHT CLTESSVYD+GLATD+
Sbjct: 1021 EDFYSQELGNWMYKGATYITIPSVGSEVGGTNEQVAKGPIVHTTCLTESSVYDVGLATDI 1080

BLAST of Spg033175 vs. ExPASy TrEMBL
Match: A0A0A0LGI0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G750380 PE=4 SV=1)

HSP 1 Score: 1781.1 bits (4612), Expect = 0.0e+00
Identity = 931/1084 (85.89%), Postives = 989/1084 (91.24%), Query Frame = 0

Query: 1    MEMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGTNIV-QPRFRASDR 60
            MEMESSRRPFDRTREPGLKKPRLADEA+RG NINGRPFPQRPVVSG NIV QPRFR SDR
Sbjct: 1    MEMESSRRPFDRTREPGLKKPRLADEADRGANINGRPFPQRPVVSGNNIVQQPRFRPSDR 60

Query: 61   DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISA 120
            DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAI++
Sbjct: 61   DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAIAS 120

Query: 121  TVCANILEVSSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR 180
            T+ ANILEV SEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR
Sbjct: 121  TIYANILEVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR 180

Query: 181  HLFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQAQRPA-HSIHVNPKYIER 240
            HLFGTWKGVFP QTLQIIEKELGF+PS SSSS  ITSKPDLQAQRP  HSIHVNPKYIER
Sbjct: 181  HLFGTWKGVFPLQTLQIIEKELGFVPSGSSSSVAITSKPDLQAQRPTPHSIHVNPKYIER 240

Query: 241  QRLQQSGRVKGMTSDATGATTNVTQDVAQAKISTGRPWADAPIKVLDIHRPLRDAPNDMA 300
            QRLQQSGRVKG+ +DATG TTNV+QDVAQAK+STGRPWADAPIK  DI RPLRDA NDMA
Sbjct: 241  QRLQQSGRVKGIATDATGGTTNVSQDVAQAKMSTGRPWADAPIK--DIQRPLRDAQNDMA 300

Query: 301  QEKNITAAYADYEYGSDLPRTTGIGRRVVDEGRDKPWSSAGSNLAEKLSGQRNGFNIKLG 360
            QEKN+TA Y+DY+YGSDL RT+ +GRRVVDEGRDKPWSSAGSNL+EKLSGQRNGFN+KLG
Sbjct: 301  QEKNVTAGYSDYDYGSDLSRTSSVGRRVVDEGRDKPWSSAGSNLSEKLSGQRNGFNMKLG 360

Query: 361  YENYPAPKSANTGARLLPMQNFSSSSSNRVLSTNWKNSEEEEFMWGEMNSIMTGHGAPAI 420
            YENY APKS NTGARLLP+QNFSSSS+NRVLSTNWKNSEEEEFMWG+MNS++T HGAP I
Sbjct: 361  YENYSAPKSTNTGARLLPVQNFSSSSNNRVLSTNWKNSEEEEFMWGDMNSMLTSHGAPGI 420

Query: 421  ASSTGKDQWTPEDSDNSGIENKPLSVRDTGASVDREASSDSQSSEQRELGDSGQQRSSMW 480
             SS GKDQWTPEDSDNSGI+NK +SVRDTGASVDREASSDSQSSEQRELGDSGQQRSS W
Sbjct: 421  TSSAGKDQWTPEDSDNSGIDNKHVSVRDTGASVDREASSDSQSSEQRELGDSGQQRSSTW 480

Query: 481  QVQEPISLDGLRGGVPRKNSAESGGYGA--------NSSVDQMGGRPQITSSNIGASGHG 540
            Q+QE I LDGLRGGVPRKNS +SGGYGA        NSSVDQMGGRPQIT S+IGASGHG
Sbjct: 481  QLQESIPLDGLRGGVPRKNSGQSGGYGATLTSLSGTNSSVDQMGGRPQITPSSIGASGHG 540

Query: 541  FLNKGGSGSIGTVGHQRFPSRSVAFPSGQPPLHQRPPSLLLVDHVPHQMHDHKTSSLSNL 600
            FLNKG SG +G VGHQRFPSRSVAFPSGQPPLHQR  S LLVDHVPHQ+HDHKT+S SNL
Sbjct: 541  FLNKGSSGPLGNVGHQRFPSRSVAFPSGQPPLHQRSSSQLLVDHVPHQVHDHKTTSFSNL 600

Query: 601  DPRKRHMQDAALGLHPSVRPDNLQKSQPQDLQASASSVPASQPKHQFSLSESLKPDVTQS 660
            DPRKRHMQDAALGLHPSVRPD+LQK QP DLQA ASS+P SQP+HQFSLSESLKPD+TQS
Sbjct: 601  DPRKRHMQDAALGLHPSVRPDSLQKPQPHDLQALASSIPGSQPRHQFSLSESLKPDITQS 660

Query: 661  ELSSQHAVSIPGTNFGPPSSAGTVPD-LPAEILGEPSTSSLLAAVMKSGIFSNHSMTSSM 720
            ELSSQ A  IPGT+FGP SSAGTVPD LPAEILG PSTSSLLAAVMKSG+FSNHS+TS+M
Sbjct: 661  ELSSQLAAPIPGTDFGPSSSAGTVPDRLPAEILGNPSTSSLLAAVMKSGLFSNHSITSNM 720

Query: 721  QQNISFQDVGNMPPRSSIKPPLPSRSSPAKTV------GESSLGPQSLESPSVLVKLSQT 780
            QQNISFQDVGNM PRSSIKPPLPSRSSPA T       GESS+GP SLESPS +VKLSQT
Sbjct: 721  QQNISFQDVGNMKPRSSIKPPLPSRSSPAHTFSEPKIQGESSVGPPSLESPSTMVKLSQT 780

Query: 781  KLEETSLPPDPLPPSSPLNSASTETSNVVNDASSPISNLLSSLVAKGLISASKGELTNSA 840
            K+EE SLP DPLPPSSP++SASTETSNVVNDASSPISNLLSSLVAKGLISASKGE TNS 
Sbjct: 781  KVEEPSLPSDPLPPSSPMDSASTETSNVVNDASSPISNLLSSLVAKGLISASKGESTNSV 840

Query: 841  TSQMPSQPES-KLGDAVTCSLPVPSIPVSSSSLSSMGLESPSKAAAKSSTSPPPSSTTEI 900
            TSQMPSQPE  K GDAVT S+PVPSIP+SSS  S   LESPSKAAAK STSPPPS+TTEI
Sbjct: 841  TSQMPSQPEKLKSGDAVTSSVPVPSIPISSSCHSPTKLESPSKAAAKISTSPPPSATTEI 900

Query: 901  NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHSQWHTISTEANN 960
            NNLIGFEFSSHVIRKFHPSVISGLF+DIPYQCKICGLRLK EE LD HS+WHT+ TEANN
Sbjct: 901  NNLIGFEFSSHVIRKFHPSVISGLFEDIPYQCKICGLRLKCEEHLDIHSRWHTLRTEANN 960

Query: 961  SNRAPRRWYPSSDDWVSGNARLLLDAATSLDKSNMMEEDSEPMVPADEDQFACVLCGELF 1020
            S+ APRRWYPSSDDW+SGNAR LLDA TSLD+S++MEED+EPMVPADEDQFACV+CGELF
Sbjct: 961  SSGAPRRWYPSSDDWISGNARFLLDAVTSLDESDLMEEDNEPMVPADEDQFACVICGELF 1020

Query: 1021 EDFYSQELGKWMFKGAMYITIPSAGSKVGSTNEQVARGPIVHTECLTESSVYDLGLATDV 1067
            ED YSQELG WMFKGAMYITIPS GS+VGSTNEQVARGPIVHT CLTESSVYD+GLATD+
Sbjct: 1021 EDSYSQELGDWMFKGAMYITIPSVGSEVGSTNEQVARGPIVHTACLTESSVYDVGLATDI 1080

BLAST of Spg033175 vs. ExPASy TrEMBL
Match: A0A5D3DPM4 (Polyadenylation and cleavage factor-like protein 4-like isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G003500 PE=4 SV=1)

HSP 1 Score: 1776.5 bits (4600), Expect = 0.0e+00
Identity = 947/1154 (82.06%), Postives = 1008/1154 (87.35%), Query Frame = 0

Query: 1    MEMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGTNIV-QPRFRASDR 60
            MEMESSRRPFDRTREPGLKKPRLADEA+RG NINGRPFPQRPVVSG NIV QPRFRASDR
Sbjct: 1    MEMESSRRPFDRTREPGLKKPRLADEADRGANINGRPFPQRPVVSGNNIVQQPRFRASDR 60

Query: 61   DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISA 120
            DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAIS 
Sbjct: 61   DSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAIST 120

Query: 121  TVCANILEVSSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR 180
            T+ ANILEV SEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR
Sbjct: 121  TIYANILEVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMR 180

Query: 181  HLFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQAQRPA-HSIHVNPKYIER 240
            HLFGTWKGVFP QTLQIIEKELGF+P+ SSSS  I SKPDLQAQRP  HSIHVNPKYIER
Sbjct: 181  HLFGTWKGVFPLQTLQIIEKELGFVPTGSSSSVAINSKPDLQAQRPTPHSIHVNPKYIER 240

Query: 241  QRLQQSGRVKGMTSDATGATTNVTQDVAQAKISTGRPWADAPIKVLDIHRPLRDAPNDMA 300
            QRLQQSGRVKGM +DATG +TNV+QDVAQAKISTGRPWADAPIK  DI RPLRDAPNDMA
Sbjct: 241  QRLQQSGRVKGMPTDATGGSTNVSQDVAQAKISTGRPWADAPIK--DIQRPLRDAPNDMA 300

Query: 301  QEKNITAAYADYEYGSDLPRTTGIGRRVVDEGRDKPWSSAGSNLAEKLSGQRNGFNIKLG 360
            QEKN+TA Y+DYEYGSDL RT+ +GRRVVDEGRDKPW SAGSNL+EKLSGQRNGFNIKLG
Sbjct: 301  QEKNVTAGYSDYEYGSDLSRTSSVGRRVVDEGRDKPWPSAGSNLSEKLSGQRNGFNIKLG 360

Query: 361  YENYPAPKSANTGARLLPMQNFSSSSSNRVLSTNWKNSEEEEFMWGEMNSIMTGHGAPAI 420
            YENY APKS NTGARLLP+QNFSSSS+NRVLSTNWKNSEEEEFMWG+M+S++TGHGAPAI
Sbjct: 361  YENYSAPKSTNTGARLLPVQNFSSSSNNRVLSTNWKNSEEEEFMWGDMSSMLTGHGAPAI 420

Query: 421  ASSTGKDQWTPEDSDNSGIENKPLSVRDTGASVDREASSDSQSSEQRELGDSGQQRSSMW 480
             SSTGKDQWTPEDSDNSGI+NK LSVRDTGASVDREASSDSQSSEQRELGDSGQQRSS W
Sbjct: 421  NSSTGKDQWTPEDSDNSGIDNKHLSVRDTGASVDREASSDSQSSEQRELGDSGQQRSSTW 480

Query: 481  QVQEPISLDGLRGGVPRKNSAESGGYGA--------NSSVDQMGGRPQITSSNIGASGHG 540
            Q+QE ISLDGLR GVPRKNS +SGGYGA        NSSVDQMGGRPQIT SNIGASGHG
Sbjct: 481  QLQESISLDGLRAGVPRKNSGQSGGYGATLTALSGTNSSVDQMGGRPQITPSNIGASGHG 540

Query: 541  FLNKGGSGSIGTVGHQRFPSRSVAFPSGQPPLHQRPPSLLLVDHVPHQMHDHKTSSLSNL 600
            FLNKGGSG +G VGHQRFPSRSVAFPSGQPPLHQR PS LLVDHVPHQ+HD KT+S SNL
Sbjct: 541  FLNKGGSGPLGNVGHQRFPSRSVAFPSGQPPLHQRSPSQLLVDHVPHQIHDQKTTSFSNL 600

Query: 601  DPRKRHMQDAALGLHPSVRPDNLQKSQPQDLQASASSVPASQPKHQFSLSESLKPDVTQS 660
            DPRKRHMQDAALGLHPSVRPDN QK Q  DL+A ASS+P SQP+HQFSLSESLKPDVTQS
Sbjct: 601  DPRKRHMQDAALGLHPSVRPDNHQKPQTHDLRALASSIPGSQPRHQFSLSESLKPDVTQS 660

Query: 661  ELSSQHAVSIPGTNFGPPSSAGTVPD-LPAEILGEPSTSSLLAAVMKSGIFSNHSMTSSM 720
            ELSSQ AVSIPGT+FGP SSAGTVPD LPAEILG PSTSSLLAAVMKSG+FSNHS+TS+M
Sbjct: 661  ELSSQLAVSIPGTDFGPSSSAGTVPDRLPAEILGNPSTSSLLAAVMKSGLFSNHSITSNM 720

Query: 721  QQNISFQDVGNMPPRSSIKPPLPSRSSPAKTV------GESSLGPQSLESPSVLVKLSQT 780
            QQN+SFQDVGNM PRSSIKPPLP+RSSPA T       GESS+GP S+ESPS +VKLS+T
Sbjct: 721  QQNLSFQDVGNMKPRSSIKPPLPNRSSPAHTFSEPKIQGESSVGPPSVESPSTMVKLSRT 780

Query: 781  KLEETSLPPDPLPPSSPLNSASTETSNVVNDASSPISNLLSSLVAKGLISASKGELTNSA 840
            K+EE SLP DPLPPSSP++SASTETS+VVNDASSPISNLLSSLVAKGLISASKGE TNS 
Sbjct: 781  KVEEPSLPSDPLPPSSPMDSASTETSHVVNDASSPISNLLSSLVAKGLISASKGESTNSV 840

Query: 841  TSQMPSQPES-KLGDAVTCSLPVPSIPVSSSSLSSMGLESPSKAAAKSSTSPPPSSTTEI 900
            TSQMPSQPE+ K GDAVT S+PVPSI VSSS  SS  LESP KAAAKSSTSPPPS+TTEI
Sbjct: 841  TSQMPSQPENLKSGDAVTSSVPVPSIAVSSSCHSSTKLESPLKAAAKSSTSPPPSATTEI 900

Query: 901  NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHSQWHTISTEANN 960
            NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLK EEQLDTHS+WHT+ TEANN
Sbjct: 901  NNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKCEEQLDTHSRWHTLRTEANN 960

Query: 961  SNRAPRRWYPSSDDWVSGNARLLLDAATSLDKSNMMEEDSEPMVPADEDQFACVLCGELF 1020
            S+ APRRWYP SDDW+SGNAR LLDA TSLD+S++MEED+EPMVPADEDQFACV+CGELF
Sbjct: 961  SSTAPRRWYPCSDDWISGNARFLLDAETSLDESDLMEEDNEPMVPADEDQFACVICGELF 1020

Query: 1021 EDFYSQELGKWMFKGAMYITIPSAGSKVGSTNEQVARGPIVHTECLTESSVYDLGLATDV 1080
            EDFYSQELG WM+KGA YITIPS GS+VG TNEQVA+GPI            DL      
Sbjct: 1021 EDFYSQELGNWMYKGATYITIPSVGSEVGGTNEQVAKGPI------------DLIWLRYH 1080

Query: 1081 KMVMFLVIYTFAGLDLVSLSSIDLLDDFMCCQSKDSAECYGIGLIKKWMYDASPAKRHIG 1137
             +    +IY F                 + C+              KWMYDASPAKRH+G
Sbjct: 1081 PLFSLNIIYLF----------------IIICK-------------WKWMYDASPAKRHVG 1111

BLAST of Spg033175 vs. TAIR 10
Match: AT2G36480.2 (ENTH/VHS family protein )

HSP 1 Score: 440.3 bits (1131), Expect = 4.9e-123
Identity = 349/974 (35.83%), Postives = 493/974 (50.62%), Query Frame = 0

Query: 126  LEVSSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFGTW 185
            ++V S+QKLP+LYLLDSIVKNIGRDYIKYF ARLPEVF KAYRQVDP +H +MRHLFGTW
Sbjct: 1    MQVPSDQKLPTLYLLDSIVKNIGRDYIKYFGARLPEVFVKAYRQVDPPMHSNMRHLFGTW 60

Query: 186  KGVFPPQTLQIIEKELGFMPSSSSSSGTI-TSKPDLQAQRPAHSIHVNPKYIERQRLQQS 245
            KGVF PQTLQ+IEKELGF   S  S+  + T++ + Q+QRP HSIHVNPKY+ERQRLQQS
Sbjct: 61   KGVFHPQTLQLIEKELGFNAKSDGSAAVVSTARAEPQSQRPPHSIHVNPKYLERQRLQQS 120

Query: 246  GRVKGMTSDATGATTNVTQDVAQ----AKISTGRPWADAPIKVLDIHRPLRDAPNDMAQE 305
            GR KGM +D      N+T+D  +    + I++G  W   P KV +I RP RD  ++   E
Sbjct: 121  GRTKGMVTDVPETAPNLTRDSDRLERVSSIASGGSWV-GPAKVNNIRRPQRDLLSEPLYE 180

Query: 306  KNITAAYADYEYGSDLPRTT-----GIGRRVVDEGRDKPWSSAGSNLAEKLSGQRNGFNI 365
            K+I +   +Y+Y SDLP  +      +G R+ D+G +K W  A +   + +S QR+G + 
Sbjct: 181  KDIESIAGEYDYASDLPHNSRSVIKNVGSRITDDGCEKQWYGATNRDPDLISDQRDGLHS 240

Query: 366  KLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLST---NWKNSEEEEFMWGEMNSIMTG 425
            K    NY   +          ++N  SS  +R +     +WKNSEEEEFMW +M+S ++ 
Sbjct: 241  KSRTSNYATAR----------VENLESSGPSRNIGVPYDSWKNSEEEEFMW-DMHSRLSE 300

Query: 426  HGAPAIASSTGKDQWTPEDSDNSGIENKPLSVRDTGA---SVDREASSDSQSSEQRELGD 485
                 I  +   +   P++S+    EN  L      A     D   S++S SSEQ++   
Sbjct: 301  TDVATI--NPKNELHAPDESERLESENHLLKRPRFSALDPRFDPANSTNSYSSEQKDPSS 360

Query: 486  SGQQRSSMWQVQEPISLDGLRGGVPRKNSAESGGYGANSSVDQMGGRPQITSSNIGASGH 545
             G    S        +  G++                         +P++ SS       
Sbjct: 361  IGHWAFSSTNATSTATRKGIQ------------------------PQPRVASS------- 420

Query: 546  GFLNKGGSGSIGTVGHQRFPSRSVAFPSGQPPLHQRPPSLLLVDHVPHQMHDHKTSSLSN 605
            G L   GSGS                   Q PLH       +      + H     SL  
Sbjct: 421  GILPSSGSGS-----------------DRQSPLHDSTSKQNVTKQDVRRAH-----SLPQ 480

Query: 606  LDPRKRHMQDAALGLHPSVRPDNLQKSQPQDLQASASSVPASQPKHQFSLSESLKPDVTQ 665
             DPR                               AS  PA   K      +S++   + 
Sbjct: 481  RDPR-------------------------------ASRFPA---KQNVPRDDSVRLPSSS 540

Query: 666  SELSSQHAVSIPGTNFGPPSSAGTVP--DLPAEILGEPSTSSLLAAVMKSGIFSNHSMTS 725
            S+  + +   +P   F   S+A   P   L +E  G+P+ S LL AVMKSGI SN+S   
Sbjct: 541  SQFKNTNMRELPVEIFDSKSAAENAPGLTLASEATGQPNMSDLLEAVMKSGILSNNSTCG 600

Query: 726  SMQQNISFQDVGNMPPRSSIKP---PLPSRSSPAKTVGESSLGPQSLESPSVLVKLSQTK 785
            ++++               + P    LP+ S P KT+      P SL + ++L +L   K
Sbjct: 601  AIKEE----------SHDEVNPGALTLPAASKP-KTL------PISLATDNLLARL---K 660

Query: 786  LEETSLPPDPLPPSSPLNSASTETSNVVNDASSPISNLLSSLVAKGLISASKGELTNSAT 845
            +E++S P      +S     S +TS   + AS P+S LLSSLV+KGLISASK EL ++ +
Sbjct: 661  VEQSSAPLVSC-AASLTGITSVQTSKEKSKASDPLSCLLSSLVSKGLISASKTELPSAPS 720

Query: 846  SQMPSQPESKLGDAVTCSLPVPSIPVSSSSLSSMGLESPSKA-AAKSSTSPPPSSTTEIN 905
                  P+     +++ S+      V + +  S+ ++ PS A   K   +P  +S +E  
Sbjct: 721  ITQEHSPDHSTNSSMSVSV------VPADAQPSVLVKGPSTAPKVKGLAAPSETSKSEPK 780

Query: 906  NLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHSQWH-TISTEANN 965
            +LIG +F +  IR+ HPSVIS LFDD+P+ C  C +RLK +E+LD H + H     E + 
Sbjct: 781  DLIGLKFRADKIRELHPSVISSLFDDLPHLCTSCSVRLKQKEELDRHMELHDKKKLELSG 837

Query: 966  SNRAPRRWYPSSDDWVSGNA-RLLLDAATSLDKSNMMEEDSEPMVPADEDQFACVLCGEL 1025
            +N   R W+P  D+W++  A  L  +    L +     ED +  V ADE Q AC+LCGE+
Sbjct: 841  TNSKCRVWFPKVDNWIAAKAGELEPEYEEVLSEPESAIEDCQ-AVAADETQCACILCGEV 837

Query: 1026 FEDFYSQELGKWMFKGAMYITIPSAGSKVGSTNEQVARGPIVHTECLTESSVYDLGLATD 1076
            FED++SQE+ +WMFKGA Y+T P A S+        A GPIVHT CLT SS+  L +   
Sbjct: 901  FEDYFSQEMAQWMFKGASYLTNPPANSE--------ASGPIVHTGCLTTSSLQSLEVGIA 837

BLAST of Spg033175 vs. TAIR 10
Match: AT2G36480.1 (ENTH/VHS family protein )

HSP 1 Score: 439.1 bits (1128), Expect = 1.1e-122
Identity = 346/962 (35.97%), Postives = 489/962 (50.83%), Query Frame = 0

Query: 126  LEVSSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFGTW 185
            ++V S+QKLP+LYLLDSIVKNIGRDYIKYF ARLPEVF KAYRQVDP +H +MRHLFGTW
Sbjct: 1    MQVPSDQKLPTLYLLDSIVKNIGRDYIKYFGARLPEVFVKAYRQVDPPMHSNMRHLFGTW 60

Query: 186  KGVFPPQTLQIIEKELGFMPSSSSSSGTI-TSKPDLQAQRPAHSIHVNPKYIERQRLQQS 245
            KGVF PQTLQ+IEKELGF   S  S+  + T++ + Q+QRP HSIHVNPKY+ERQRLQQS
Sbjct: 61   KGVFHPQTLQLIEKELGFNAKSDGSAAVVSTARAEPQSQRPPHSIHVNPKYLERQRLQQS 120

Query: 246  GRVKGMTSDATGATTNVTQDVAQ----AKISTGRPWADAPIKVLDIHRPLRDAPNDMAQE 305
            GR KGM +D      N+T+D  +    + I++G  W   P KV +I RP RD  ++   E
Sbjct: 121  GRTKGMVTDVPETAPNLTRDSDRLERVSSIASGGSWV-GPAKVNNIRRPQRDLLSEPLYE 180

Query: 306  KNITAAYADYEYGSDLPRTT-----GIGRRVVDEGRDKPWSSAGSNLAEKLSGQRNGFNI 365
            K+I +   +Y+Y SDLP  +      +G R+ D+G +K W  A +   + +S QR+G + 
Sbjct: 181  KDIESIAGEYDYASDLPHNSRSVIKNVGSRITDDGCEKQWYGATNRDPDLISDQRDGLHS 240

Query: 366  KLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLST---NWKNSEEEEFMWGEMNSIMTG 425
            K    NY   +          ++N  SS  +R +     +WKNSEEEEFMW +M+S ++ 
Sbjct: 241  KSRTSNYATAR----------VENLESSGPSRNIGVPYDSWKNSEEEEFMW-DMHSRLSE 300

Query: 426  HGAPAIASSTGKDQWTPEDSDNSGIENKPLSVRDTGA---SVDREASSDSQSSEQRELGD 485
                 I  +   +   P++S+    EN  L      A     D   S++S SSEQ++   
Sbjct: 301  TDVATI--NPKNELHAPDESERLESENHLLKRPRFSALDPRFDPANSTNSYSSEQKDPSS 360

Query: 486  SGQQRSSMWQVQEPISLDGLRGGVPRKNSAESGGYGANSSVDQMGGRPQITSSNIGASGH 545
             G    S        +  G++                         +P++ SS       
Sbjct: 361  IGHWAFSSTNATSTATRKGIQ------------------------PQPRVASS------- 420

Query: 546  GFLNKGGSGSIGTVGHQRFPSRSVAFPSGQPPLHQRPPSLLLVDHVPHQMHDHKTSSLSN 605
            G L   GSGS                   Q PLH       +      + H     SL  
Sbjct: 421  GILPSSGSGS-----------------DRQSPLHDSTSKQNVTKQDVRRAH-----SLPQ 480

Query: 606  LDPRKRHMQDAALGLHPSVRPDNLQKSQPQDLQASASSVPASQPKHQFSLSESLKPDVTQ 665
             DPR                               AS  PA   K      +S++   + 
Sbjct: 481  RDPR-------------------------------ASRFPA---KQNVPRDDSVRLPSSS 540

Query: 666  SELSSQHAVSIPGTNFGPPSSAGTVP--DLPAEILGEPSTSSLLAAVMKSGIFSNHSMTS 725
            S+  + +   +P   F   S+A   P   L +E  G+P+ S LL AVMKSGI SN+S   
Sbjct: 541  SQFKNTNMRELPVEIFDSKSAAENAPGLTLASEATGQPNMSDLLEAVMKSGILSNNSTCG 600

Query: 726  SMQQNISFQDVGNMPPRSSIKP---PLPSRSSPAKTVGESSLGPQSLESPSVLVKLSQTK 785
            ++++               + P    LP+ S P KT+      P SL + ++L +L   K
Sbjct: 601  AIKEE----------SHDEVNPGALTLPAASKP-KTL------PISLATDNLLARL---K 660

Query: 786  LEETSLPPDPLPPSSPLNSASTETSNVVNDASSPISNLLSSLVAKGLISASKGELTNSAT 845
            +E++S P      +S     S +TS   + AS P+S LLSSLV+KGLISASK EL ++ +
Sbjct: 661  VEQSSAPLVSC-AASLTGITSVQTSKEKSKASDPLSCLLSSLVSKGLISASKTELPSAPS 720

Query: 846  SQMPSQPESKLGDAVTCSLPVPSIPVSSSSLSSMGLESPSKA-AAKSSTSPPPSSTTEIN 905
                  P+     +++ S+      V + +  S+ ++ PS A   K   +P  +S +E  
Sbjct: 721  ITQEHSPDHSTNSSMSVSV------VPADAQPSVLVKGPSTAPKVKGLAAPSETSKSEPK 780

Query: 906  NLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHSQWH-TISTEANN 965
            +LIG +F +  IR+ HPSVIS LFDD+P+ C  C +RLK +E+LD H + H     E + 
Sbjct: 781  DLIGLKFRADKIRELHPSVISSLFDDLPHLCTSCSVRLKQKEELDRHMELHDKKKLELSG 825

Query: 966  SNRAPRRWYPSSDDWVSGNA-RLLLDAATSLDKSNMMEEDSEPMVPADEDQFACVLCGEL 1025
            +N   R W+P  D+W++  A  L  +    L +     ED +  V ADE Q AC+LCGE+
Sbjct: 841  TNSKCRVWFPKVDNWIAAKAGELEPEYEEVLSEPESAIEDCQ-AVAADETQCACILCGEV 825

Query: 1026 FEDFYSQELGKWMFKGAMYITIPSAGSKVGSTNEQVARGPIVHTECLTESSVYDLGLATD 1064
            FED++SQE+ +WMFKGA Y+T P A S+        A GPIVHT CLT SS+  L +   
Sbjct: 901  FEDYFSQEMAQWMFKGASYLTNPPANSE--------ASGPIVHTGCLTTSSLQSLEVGIA 825

BLAST of Spg033175 vs. TAIR 10
Match: AT2G36480.3 (ENTH/VHS family protein )

HSP 1 Score: 439.1 bits (1128), Expect = 1.1e-122
Identity = 346/962 (35.97%), Postives = 489/962 (50.83%), Query Frame = 0

Query: 126  LEVSSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFGTW 185
            ++V S+QKLP+LYLLDSIVKNIGRDYIKYF ARLPEVF KAYRQVDP +H +MRHLFGTW
Sbjct: 1    MQVPSDQKLPTLYLLDSIVKNIGRDYIKYFGARLPEVFVKAYRQVDPPMHSNMRHLFGTW 60

Query: 186  KGVFPPQTLQIIEKELGFMPSSSSSSGTI-TSKPDLQAQRPAHSIHVNPKYIERQRLQQS 245
            KGVF PQTLQ+IEKELGF   S  S+  + T++ + Q+QRP HSIHVNPKY+ERQRLQQS
Sbjct: 61   KGVFHPQTLQLIEKELGFNAKSDGSAAVVSTARAEPQSQRPPHSIHVNPKYLERQRLQQS 120

Query: 246  GRVKGMTSDATGATTNVTQDVAQ----AKISTGRPWADAPIKVLDIHRPLRDAPNDMAQE 305
            GR KGM +D      N+T+D  +    + I++G  W   P KV +I RP RD  ++   E
Sbjct: 121  GRTKGMVTDVPETAPNLTRDSDRLERVSSIASGGSWV-GPAKVNNIRRPQRDLLSEPLYE 180

Query: 306  KNITAAYADYEYGSDLPRTT-----GIGRRVVDEGRDKPWSSAGSNLAEKLSGQRNGFNI 365
            K+I +   +Y+Y SDLP  +      +G R+ D+G +K W  A +   + +S QR+G + 
Sbjct: 181  KDIESIAGEYDYASDLPHNSRSVIKNVGSRITDDGCEKQWYGATNRDPDLISDQRDGLHS 240

Query: 366  KLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLST---NWKNSEEEEFMWGEMNSIMTG 425
            K    NY   +          ++N  SS  +R +     +WKNSEEEEFMW +M+S ++ 
Sbjct: 241  KSRTSNYATAR----------VENLESSGPSRNIGVPYDSWKNSEEEEFMW-DMHSRLSE 300

Query: 426  HGAPAIASSTGKDQWTPEDSDNSGIENKPLSVRDTGA---SVDREASSDSQSSEQRELGD 485
                 I  +   +   P++S+    EN  L      A     D   S++S SSEQ++   
Sbjct: 301  TDVATI--NPKNELHAPDESERLESENHLLKRPRFSALDPRFDPANSTNSYSSEQKDPSS 360

Query: 486  SGQQRSSMWQVQEPISLDGLRGGVPRKNSAESGGYGANSSVDQMGGRPQITSSNIGASGH 545
             G    S        +  G++                         +P++ SS       
Sbjct: 361  IGHWAFSSTNATSTATRKGIQ------------------------PQPRVASS------- 420

Query: 546  GFLNKGGSGSIGTVGHQRFPSRSVAFPSGQPPLHQRPPSLLLVDHVPHQMHDHKTSSLSN 605
            G L   GSGS                   Q PLH       +      + H     SL  
Sbjct: 421  GILPSSGSGS-----------------DRQSPLHDSTSKQNVTKQDVRRAH-----SLPQ 480

Query: 606  LDPRKRHMQDAALGLHPSVRPDNLQKSQPQDLQASASSVPASQPKHQFSLSESLKPDVTQ 665
             DPR                               AS  PA   K      +S++   + 
Sbjct: 481  RDPR-------------------------------ASRFPA---KQNVPRDDSVRLPSSS 540

Query: 666  SELSSQHAVSIPGTNFGPPSSAGTVP--DLPAEILGEPSTSSLLAAVMKSGIFSNHSMTS 725
            S+  + +   +P   F   S+A   P   L +E  G+P+ S LL AVMKSGI SN+S   
Sbjct: 541  SQFKNTNMRELPVEIFDSKSAAENAPGLTLASEATGQPNMSDLLEAVMKSGILSNNSTCG 600

Query: 726  SMQQNISFQDVGNMPPRSSIKP---PLPSRSSPAKTVGESSLGPQSLESPSVLVKLSQTK 785
            ++++               + P    LP+ S P KT+      P SL + ++L +L   K
Sbjct: 601  AIKEE----------SHDEVNPGALTLPAASKP-KTL------PISLATDNLLARL---K 660

Query: 786  LEETSLPPDPLPPSSPLNSASTETSNVVNDASSPISNLLSSLVAKGLISASKGELTNSAT 845
            +E++S P      +S     S +TS   + AS P+S LLSSLV+KGLISASK EL ++ +
Sbjct: 661  VEQSSAPLVSC-AASLTGITSVQTSKEKSKASDPLSCLLSSLVSKGLISASKTELPSAPS 720

Query: 846  SQMPSQPESKLGDAVTCSLPVPSIPVSSSSLSSMGLESPSKA-AAKSSTSPPPSSTTEIN 905
                  P+     +++ S+      V + +  S+ ++ PS A   K   +P  +S +E  
Sbjct: 721  ITQEHSPDHSTNSSMSVSV------VPADAQPSVLVKGPSTAPKVKGLAAPSETSKSEPK 780

Query: 906  NLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHSQWH-TISTEANN 965
            +LIG +F +  IR+ HPSVIS LFDD+P+ C  C +RLK +E+LD H + H     E + 
Sbjct: 781  DLIGLKFRADKIRELHPSVISSLFDDLPHLCTSCSVRLKQKEELDRHMELHDKKKLELSG 825

Query: 966  SNRAPRRWYPSSDDWVSGNA-RLLLDAATSLDKSNMMEEDSEPMVPADEDQFACVLCGEL 1025
            +N   R W+P  D+W++  A  L  +    L +     ED +  V ADE Q AC+LCGE+
Sbjct: 841  TNSKCRVWFPKVDNWIAAKAGELEPEYEEVLSEPESAIEDCQ-AVAADETQCACILCGEV 825

Query: 1026 FEDFYSQELGKWMFKGAMYITIPSAGSKVGSTNEQVARGPIVHTECLTESSVYDLGLATD 1064
            FED++SQE+ +WMFKGA Y+T P A S+        A GPIVHT CLT SS+  L +   
Sbjct: 901  FEDYFSQEMAQWMFKGASYLTNPPANSE--------ASGPIVHTGCLTTSSLQSLEVGIA 825

BLAST of Spg033175 vs. TAIR 10
Match: AT4G04885.1 (PCF11P-similar protein 4 )

HSP 1 Score: 245.4 bits (625), Expect = 2.3e-64
Identity = 274/1002 (27.35%), Postives = 394/1002 (39.32%), Query Frame = 0

Query: 65   DSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISATVCAN 124
            D   GG +  PP   E+V  Y   L ELTFNSKPIIT+LTIIAGE  +  + I+  +C  
Sbjct: 49   DEFGGGEEVLPPSMDEIVQLYEVVLGELTFNSKPIITDLTIIAGEQREHGEGIANAICTR 108

Query: 125  ILEVSSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFGT 184
            ILE   EQKLPSLYLLDSIVKNIGRDY +YF++RLPEVFC AYRQ  PS+HPSMRHLFGT
Sbjct: 109  ILEAPVEQKLPSLYLLDSIVKNIGRDYGRYFSSRLPEVFCLAYRQAHPSLHPSMRHLFGT 168

Query: 185  WKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQAQRPAHSIHVNPKYIER-QRLQQ 244
            W  VFPP  L+ I+ +L  + S+++ S    S+P     +P   IHVNPKY+ R +    
Sbjct: 169  WSSVFPPPVLRKIDMQLQ-LSSAANQSSVGASEP----SQPTRGIHVNPKYLRRLEPSAA 228

Query: 245  SGRVKGMTSDATGATTNVTQDVAQAKISTGRPWADAPIKVLDIHRPLRDAPNDMAQEKNI 304
               ++G+ S A        +   Q  +     + D     L+    L   P+   +  N 
Sbjct: 229  ENNLRGINSSA--------RVYGQNSLGGYNDFEDQ----LESPSSLSSTPDGFTRRSN- 288

Query: 305  TAAYADYEYGSDLPRTTGIGRRVVDEGRDKPWSSAGSNLAEKLSGQRNGFNIKLGYENYP 364
                 D    S+     G+GR    +     W     NL +    +R    I    + Y 
Sbjct: 289  -----DGANPSNQAFNYGMGRATSRDDEHMEWRRK-ENLGQGNDHERPRALI----DAYG 348

Query: 365  APKSANTGARLLPMQNFSSSSSNRVLSTNWKNSEEEEFMWGEMNSIMTGHGAPAIASSTG 424
               S +      P+++ +   S  V  T W+N+EEEEF W +M        +P +  S  
Sbjct: 349  VDTSKHVTIN-KPIRDMNGMHSKMV--TPWQNTEEEEFDWEDM--------SPTLDRSRA 408

Query: 425  KDQWTPEDSDNSGIENKPLSVRDTGASVDREASSDSQSSEQRELGDSGQQRSSMWQVQEP 484
             +           +  +P      G + D    SD ++    +L                
Sbjct: 409  GEFLRSSVPALGSVRARP----RVGNTSDFHLDSDIKNGVSHQL---------------- 468

Query: 485  ISLDGLRGGVPRKNSAESGGYGANSS-VDQMGGRP-QITSSNIG-ASGHGFLNKGGSGSI 544
                       R+N + S  Y   S+ VD   G+  ++ +S++G  S +         SI
Sbjct: 469  -----------RENWSLSQNYPHTSNRVDTRAGKDLKVLASSVGLVSSNSEFGAPPFDSI 528

Query: 545  GTVGHQRFPSRSVAFPSGQ-PPLHQRPPSLLLVDHVPHQMHDHKTSSLSNLDPRKRHMQD 604
              V + RF     A P G  P L  R P+ L V                           
Sbjct: 529  QDV-NSRF---GRALPDGTWPHLSARGPNSLPV--------------------------- 588

Query: 605  AALGLHPSVRPDNLQKSQPQDLQASASSVPASQPKHQFSLSESLKPDVTQSELSSQHAVS 664
             +  LH    P N   ++ Q         P  +P++Q          V+QS L+     +
Sbjct: 589  PSAHLHHLANPGNAMSNRLQ-------GKPLYRPENQ----------VSQSHLNDMTQQN 648

Query: 665  IPGTNFGPPSSAGTVPDLPAEILGEPSTSSLLAAVMKSGIFSNHSMTSSMQQNISFQDVG 724
                N+ P SSA                                     MQ  ++    G
Sbjct: 649  QMLVNYLPSSSA--------------------------------MAPRPMQSLLTHVSHG 708

Query: 725  NMPPRSSIKPPLPSRSSPAKTVGESSLGPQSLESPSVLVKLSQTKLEETSLPPDPLPPSS 784
              P  S+I+P L                  S++    +  LS   L +      P     
Sbjct: 709  YPPHGSTIRPSL------------------SIQGGEAMHPLSSGVLSQIGASNQP----- 768

Query: 785  PLNSASTETSNVVNDASSPISNLLSSLVAKGLISASKGELTNSATSQMPSQPESKLGDAV 844
                                S L+ SL+A+GLIS     L N    Q P           
Sbjct: 769  ---------------PGGAFSGLIGSLMAQGLIS-----LNNQPAGQGP----------- 797

Query: 845  TCSLPVPSIPVSSSSLSSMGLESPSKAAAKSSTSPPPSSTTEINNLIGFEFSSHVIRKFH 904
                                                          +G EF + +++  +
Sbjct: 829  ----------------------------------------------LGLEFDADMLKIRN 797

Query: 905  PSVISGLFDDIPYQCKICGLRLKLEEQLDTHSQWHTIST--EANNSNRAPRRWYPSSDDW 964
             S IS L+ D+P QC  CGLR K +E+   H  WH        N+     R+W+ S+  W
Sbjct: 889  ESAISALYGDLPRQCTTCGLRFKCQEEHSKHMDWHVTKNRMSKNHKQNPSRKWFVSASMW 797

Query: 965  VSGNARLLLDAATSL---DKSNMMEEDSEPMVPADEDQFACVLCGELFEDFYSQELGKWM 1024
            +SG   L  +A       + +   ++D +  VPADEDQ +C LCGE FEDFYS E  +WM
Sbjct: 949  LSGAEALGAEAVPGFLPTEPTTEKKDDEDMAVPADEDQTSCALCGEPFEDFYSDETEEWM 797

Query: 1025 FKGAMYITIPSAGSKVGSTNEQVARGPIVHTECLTESSVYDL 1057
            +KGA+Y+  P   +   +  ++   GPIVH +C  ES+  D+
Sbjct: 1009 YKGAVYMNAPEEST---TDMDKSQLGPIVHAKCRPESNGGDM 797

BLAST of Spg033175 vs. TAIR 10
Match: AT2G36485.1 (ENTH/VHS family protein )

HSP 1 Score: 132.9 bits (333), Expect = 1.7e-30
Identity = 80/143 (55.94%), Postives = 101/143 (70.63%), Query Frame = 0

Query: 3   MESSRRPFDRTREPG-LKKPRLADEAERGGNINGRPF-PQRPVVSGTNIVQP----RFRA 62
           ME+ RRPFDR+R+PG +KKPRL++E+ R  N N R F  QR + + T +  P    RFR 
Sbjct: 1   MENPRRPFDRSRDPGPMKKPRLSEESIRPVNSNARQFLSQRTLGTATAVTVPPASSRFRV 60

Query: 63  SDRDSGS---SDSGRGGYQPQPPQ-HQELVSQYRTALAELTFNSKPIITNLTIIAGENLQ 122
           S R++ S   SD  R  YQPQP   H ELV+QY++ALAELTFNSKPIITNLTIIAGEN+ 
Sbjct: 61  SGRETESSIVSDPSREAYQPQPVHPHYELVNQYKSALAELTFNSKPIITNLTIIAGENVH 120

Query: 123 AAKAISATVCANILEVSSEQKLP 136
           AAKA+   +C NILEV+++   P
Sbjct: 121 AAKAVVTAICNNILEVNTQFSCP 143

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0043917.10.0e+0083.45polyadenylation and cleavage factor-like protein 4-like isoform X1 [Cucumis melo... [more]
XP_038906013.10.0e+0087.82polyadenylation and cleavage factor homolog 4 [Benincasa hispida][more]
XP_008442798.10.0e+0086.44PREDICTED: polyadenylation and cleavage factor homolog 4-like isoform X1 [Cucumi... [more]
XP_011651991.10.0e+0086.07polyadenylation and cleavage factor homolog 4 isoform X1 [Cucumis sativus][more]
XP_008442799.10.0e+0086.25PREDICTED: polyadenylation and cleavage factor homolog 4-like isoform X2 [Cucumi... [more]
Match NameE-valueIdentityDescription
Q0WPF23.3e-6327.35Polyadenylation and cleavage factor homolog 4 OS=Arabidopsis thaliana OX=3702 GN... [more]
O949131.0e-1936.88Pre-mRNA cleavage complex 2 protein Pcf11 OS=Homo sapiens OX=9606 GN=PCF11 PE=1 ... [more]
Q9FIX83.2e-1832.73Polyadenylation and cleavage factor homolog 5 OS=Arabidopsis thaliana OX=3702 GN... [more]
Q9C7105.5e-1833.54Polyadenylation and cleavage factor homolog 1 OS=Arabidopsis thaliana OX=3702 GN... [more]
Q102376.3e-1445.10Uncharacterized protein C4G9.04c OS=Schizosaccharomyces pombe (strain 972 / ATCC... [more]
Match NameE-valueIdentityDescription
A0A5A7TQ230.0e+0083.45Polyadenylation and cleavage factor-like protein 4-like isoform X1 OS=Cucumis me... [more]
A0A1S3B6K60.0e+0086.44polyadenylation and cleavage factor homolog 4-like isoform X1 OS=Cucumis melo OX... [more]
A0A1S3B7940.0e+0086.25polyadenylation and cleavage factor homolog 4-like isoform X2 OS=Cucumis melo OX... [more]
A0A0A0LGI00.0e+0085.89Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G750380 PE=4 SV=1[more]
A0A5D3DPM40.0e+0082.06Polyadenylation and cleavage factor-like protein 4-like isoform X2 OS=Cucumis me... [more]
Match NameE-valueIdentityDescription
AT2G36480.24.9e-12335.83ENTH/VHS family protein [more]
AT2G36480.11.1e-12235.97ENTH/VHS family protein [more]
AT2G36480.31.1e-12235.97ENTH/VHS family protein [more]
AT4G04885.12.3e-6427.35PCF11P-similar protein 4 [more]
AT2G36485.11.7e-3055.94ENTH/VHS family protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (cylindrica) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006569CID domainSMARTSM00582558neu5coord: 80..202
e-value: 9.8E-45
score: 164.6
IPR006569CID domainPFAMPF04818CIDcoord: 87..195
e-value: 1.7E-13
score: 50.9
IPR006569CID domainPROSITEPS51391CIDcoord: 77..205
score: 37.264122
IPR008942ENTH/VHSGENE3D1.25.40.90coord: 72..202
e-value: 1.3E-42
score: 146.9
IPR008942ENTH/VHSSUPERFAMILY48464ENTH/VHS domaincoord: 77..201
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 453..482
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 605..653
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..29
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 412..516
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 63..77
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 852..880
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 715..792
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 501..516
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 610..653
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..77
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 731..768
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 421..441
NoneNo IPR availablePANTHERPTHR15921:SF14RNA POLYMERASE II-BINDING DOMAIN PROTEINcoord: 3..1055
NoneNo IPR availableCDDcd16982CID_Pcf11coord: 82..194
e-value: 6.51568E-55
score: 184.691
IPR045154Protein PCF11-likePANTHERPTHR15921PRE-MRNA CLEAVAGE COMPLEX IIcoord: 3..1055
IPR013087Zinc finger C2H2-typePROSITEPS00028ZINC_FINGER_C2H2_1coord: 914..934
IPR013087Zinc finger C2H2-typePROSITEPS50157ZINC_FINGER_C2H2_2coord: 912..939
score: 8.745844

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spg033175.1Spg033175.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006379 mRNA cleavage
biological_process GO:0006378 mRNA polyadenylation
biological_process GO:0006369 termination of RNA polymerase II transcription
cellular_component GO:0005737 cytoplasm
cellular_component GO:0005849 mRNA cleavage factor complex
molecular_function GO:0003729 mRNA binding
molecular_function GO:0000993 RNA polymerase II complex binding