Cp4.1LG20g05810 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG20g05810
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG20: 3548519 .. 3556890 (+)
RNA-Seq ExpressionCp4.1LG20g05810
SyntenyCp4.1LG20g05810
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TATTCCATAGAAGGAACTAGCGTTCGTCTCTCCGTCGAGCTCCGCCGGCGCCGATATCCGGAGAAGGTTTAGACAATTGCCGGACGAGGTGTAGACGACTGCCGGACAAGGTGTGAACGATTGCCGGACAAGGTGTAAACGATTGCCGGACAAGGTGTGAACGATTGCCGGACAAGGTGTGAACGATTGCCGGACAAGGTGGGAACGCTTCTATATAATTATAGAACGGAATTGCAGAGGTTAAGTATTATCTAAGTTTTCCACGGAGAATTAGGTCGTTGCAATATGATAAAATGCCCTTTGAGCCGCAGATAGAACAGGAACAGCCGGGAATAATGATGTAATCATATTGCAAGATTCAGGGAGAGATTGTTCCACCGTCATTCTGGAAAATACTCCAATTGCTGTCGTACAAAGAGCAGTTAGTTTTTTTCTTGCTTTTACCTTTGCGTTTTGATGTTTACTCCGTGTTTCTACTTGGTGCCTGAGAGTTCGATTGGAGTGTATTGGTATTATTCATGAACTAGGAGTAGTTCCAAAATGGTGGGAGTAATAATGGCGAATGCAAATTTGTGCATCCCTTGTTGTGAAGGAAATGGATTTCCGGCACTGCATTGTACCCAGAATTCCCATTATTTATTAGGGTTTTCGTTTTTTACTAGTTCGGTATCTGGAAGTGGCTTAAATTCTGGCAGTGCGAAGAGCAGAGTTTTAAGGCACAGGGGACATAAATGTGGAGCAATTAAGGCTTCATCAAAGGGAGAATCTGATATTCGATTGGCAAGTGGGAATCTCCTCGAAAACGATTTTCAGTTTAAGCCATCTTTCGATGAATATGTGAGGGTTATGGAGTCCGTTAGATCTAGAAGGTATAAGAGGCAGTCGGACGATCCTAATAAGATGAAGGAAAATGCGAGTGCAAAGAGCGCTGAAAGCACTTCCATTTCTAACATAGTGACTGATGTTCAAGGAAATATGGACGTAAAGAAAAAGGTTATATGTGTTGATCAGGAGGATTTGTTTGATAATTCAGAGAGAATTACACGTAAAATAGATTTGTCGGGAAATAAATTTGATAGCAAAAGGAAAGGGGTTACAAGATCAAAGGATGAGCTTAAAGGTAAGGTGACACCTTTTGACTCACAGGTAAATGATAAACAACATGTAGAGAAAAGGAATGGAAACTGGTCGAATTACATTGAGCCAAAAGTAACTAGGTCGAACCATGATAAACGACTTCATTTTAAGGCTAATACATTGGATGTGAAAAGTGAAAGCCACGGAGTACGTTATGGAAGTTCCATGAAAATATCGGAAAAGATTTGGGCTGATGATGACACTAAACGAACTAAGGATGTTCTGAAGGTTGGGAAGTATGGTGTTCAGCTCGAAGGAAACTATATTCCCGGTGACAAGGTTGGTAGAAAGAAAACTGAGCAGTCCTACAGAGGGTTATCCAAAAGTGGTAAGCAGTTTCATGAATTTACAGAAGAGAGTAGCTTAGAGGTCGAACATGCTGCCTTCAACAGTTGTGATGCAGAAGACATAATGGACAAACCAAGAGTTTCAAAGATGGAAATGGAAGAGAGAATCCAGATGCTTTCTAAGAGGTTTGCTGTCCCTTGCTCACTTCTCTTGCTGAAAGTTTAAAATTAACTTGATACAAGTTTGAGACCACAACTAATACCTTATTTGATATGCTTGCTTCGTTTTTTTGGTATCTATTATTTGTCCTCGTACTTGACTTATAAGCGAATTGTAATATTAAACTTTCAGTTCAATCCAATTAAACTTTGAACTTGAATAGTCTGTGAAATTAATGCTCTCAATTAGTTAAATCTCAATGGACGAGGTTGTTTTTGTCTGGTTTCTCTGGTTGTATTTTATAGTTTGGTCAGGATATTTTATTGTTAAGTTCAGGTGCTTTATTTCAGGGTTGACTTTAAGTATTTCTGTGGAATTTTTTGTGTTGTGCTGCCTGAATTTCAGGTAATTTATAGGCAATAATCTAATTTAGTTTGAAGATTATAACTTCACCTCTTATTCAATTATGGATTCTTGTGTTCAATCTAAAAACTTAGAAGTCTACTAAGCTAAATATAGGAGAGGTAAAAATGAAAATTTCTCCTTTTATTCCTTCTCCTCCCCTTTTTCCATCCTTGTTTCAATAATATAATGGAGTATTCTCGTAGAACTTGCATTTGGTTATAACTCAATTCCATTGAGATAATATATGTTAATGCGTTCAAATGATGATCATGAACTGTTACTAAACTATTGTTTATATTTTTCTTTTCTCTCACACATTCACTGCTGGCTCTTGAGCTATTCATTATATGCAGACCTCTCTCAAAATCATCTCCAACATTTTTTATGATTTGAAATTAATTACTAATTCGCACTCGACTGTACTACGAGGGGATTCCATGAATACGAATAAAATATCAATTACTTGAATTTTGTGGATTATATCAATTGTAATCACATTTACTTTTCAATGATAATAACAAAGACTTAATGCTTCATTAGACTTTTCATATTAATTCATTTGAATTGATTTTTTTTTAACAGATTAAATGGTGCAGACATTGATATGCCTGAGTGGATGTTTGCTCAAATGATGAGGAGTGCAAAGATTAGATATTCAGATCATTCAATATTAAGGGTTATTCAAGTGTTGGGTAAGCTAGGAAATTGGAAACGAGTGCTTCAAGTCATCGAATGGCTTCAAATGCGTGAACGGTTCAAGTCACATAAGCTGAGGTGTTTCCCTATTTCTCACCTTTACTGCTTGATTTATGTAGTGAAAGTTCTTGCCATGATACTTCAAAAGAATAACAATGGCCTACAATGCCTTTCCTCTGTCTAGAACTGTAACTTAGATTTCTACTTGTTGGGCCTTCCATTAATTTTCAAGACCACAAGTGAGAGGGAGTGTTTGGGGCGGGCATGGGTGCTCTGAGTCTAGGAAAACAAAGCTCCAACTCCTGGTCAGAGATGCCCACAGGGCAGGGTGGGATGGGAAGCTTCCAATTTCAATCCCCATTCCAGTAAAATATGTCATTAATTTTTACGGGATTGGGTTCCCGTCATGAATTTTTTCGCATTATATATACATATATATACATACATATATGTATCTTAAAAAACATTTCATTTTAAATAATTTTTGGTTGATCAGTGATCCAACAAAAAGTCTTCATCCTAACCTTTCCTAAAGAGTAAGTAGTCCTGTACCCCTAAAAAAAAATATAATTAGAGTCTTAAAATGGAAAAGATATTCTTTCTTGCATTGCCCTGTCACCATGTATGTTCTCCAACTTCCACAGATTTATATACACCACTGCCCTTGATGTACTTGGAAAAGCGAGGAGACCTGTGGAGGCACTCAATGTATTCCATGCAATGCAGGTTGGCAGATAACTACCTTCATTAGGATTTGCACTCTTGGTCTGTGCATGTTAGTGTAAATAGACGTGGAAATTGATAGAATTAGACTAACCCTGGTTGGTTTATCTCCATTTCCCTTCAGGAACACTTTTCCTCATATCCTGACTTAGTAGCATACCATAGTATTGCTGTCACTCTTGGACAAGCAGGATACATGAGGGAACTCTTTGATGTGATTGATAGCATGCGGTCTCCTCCAAAGAAGAAGTTTAAAACAGGGGCACTTGAAAAGTGGGACCCACGGCTGCAACCTGATATAGTTATCTATAATGCGGTGAGTCATAATTGATAAGATTATTTTACCTTTACAGGTAATATTTGGTATGTAAAATATTGAGCGATTTGGTGTAAAAAATGTTTTAGGGCAATATAATCTTGTTATGAATATTTTCCTAGCATGACCAGTTGTTTTGCTCATGACATATTTACTTTTAGATAAGGAAATTCAATTTTGATAAGAATCTAGAGGATATTTACTTATCATGGTTCTATTATTGAGTAAAGATATAGATTTGTATACATGGTTGGGATTGACTTCCTTTTTCATTCTTTTTTTTCCCTTTTGTATAAAAGCACCACTTTTGTTAAGATAAATGGAAGAAATATAAGAAGCGGCCATTGGAAAAAACTAGCCTCTACAACAAAAATGGGCAACTAAAAAAAGAGAAATATAGACTTAAGAACACCAATTTTTGACCGTCAAAAAAGGAGAAAAATCTCAGTAGTTGCAATCAGTTTTATGAGCATTCAGAGGTTTTCTCTTTTGGTAGGAAACTTCAACATTTTATGTTGTTAGTATGAAAATACAGAGTTAAATCTTCAAAGAATTGTTGATTTTCTTTATGTTTTCATTTTCCACCCAAAATATTCTTGCTATTTCCCTTTCCATCTATTTCATAGTATTTTAATTTACTTTGTCATTAAGGTAATATTCTCCATTGCACCCCCCTCCCTCCCAATTTGGTGCAATATATGTATTTCTTTTAGAAGTATATTCATAAGTGCTATGAAATGCTAATAAAAAACAAAAGGAATCGTAAATTCTGTGGAAATATAACTTGCATGTGTTTCTTTCGAGTTCTTCATTTATAGTTGTCTTTTTATTTCTTGCTTTCAGGTTCTAAATGCTTGTGTTAAGCGAAAAAATTGGGAAGGGGCATTTTGGGTCTTGCAGGAACTAAAGGAACAAGGTCTACAGCCTTCTACGACAACATATGGATTGGTCATGGAGGTGGTTGATTCTTTAGTTTCTTTCTATTGTTCATGTGCTTTGCAAGTCTACTTCAAAATTTATAATGATTTTTCTAATGCTTGAGATGCTGTGAATGAAAAATGCTTGATTTTGCGCACCCAATAAGGAATTTTCACTCATTAAGTTGCTTTGAATCCAGTCAGATGTTAGCATTAAAATTTTGTTCTTACCTTTTGATCAGTAGAAGTTAAATGGTTTAGGGTATTGCATGGACACAGAGTGGACATATTGGGTTGGGTTCTAGCTTTTGAAAGAGGATTTGTCCTGTTTGTAAAAGGTTACTGTGAAGTAAGTGCACCCTTGGACTTGGCCAAGGATGCATTTATGATTAGATTATGCCTTCACTTATCAGAAGCAACTTTTGAAGGATTAGTTGCGTCTATCCTTCATTTGGATATTAAGAAGCTATTTGGTTGAAGGTTGAGTTCGATGTGTACTTAAGTTCACAAGTTCATGATTTTTGAAAAAGTTTTTAGTTTATGCACCGTTAAGAATGAATTAAATTAACCCATTTCTAAAACCAGTTACCTTATTATTTTCACAAGTGTTTTTTTTTTCTTGATAAAACAAAAGGATTAAGTCGTCTTAAACTGCCCCTAGATTTTCAAATTAGTTAAAGACAAATAATTTGATCGGCCTGTACTTCCACTATTAAAATTTGAAAGTCATGCTACCACGACTTGAAGGATTTGGAAGCATTTGATGAAACTAGGCCACTACACTATCGTATGATTTTACATCTCATGACTTGCTTTTGATGGCTTATATGTATTTTGTGTTGTTTACTTCCAAACTTTTAACTTTGAAAATTGATATATTCTTGGTGATATTTTAGTATTTCCTGGTGTAATTTTGCTCGGCACTTCCGTCTGTATTCTGCTTCTTGTTGTAGTATCTTAACACATTTTAGTGGCTAAACTTAGTTTATTTGATTGTTCTGAATTTCCATTTAACATGCAAGTAAGCCTATAGTTGAAATATTCCATTTACTAAATGAAACATTTTCCAATGAATGATAGGTGATGCTTCAATGTGGCAAGTACAACTTAGTTCATGAGTTCTTCAGAAAAGTGCAGAAATCTTCAATTCCTAATGCTTTAACATATAAAGGTAGCCGCAGTGTGCTTATTTGTTTCCAGTTATATATTTGCTGAATGCTTTAGCTTGTCAAAATTCCAGTTCTTGTCAATACACTTTGGAAAGAAGGTAAAACAGATGAGGCTGTGCTGGCCATTCAGACCATGGAAAAACGAGGAATAGTTGGGTCTGCAGCTCTTTATTACGACTTTGCTCGTTGTCTTTGCAGTGCTGGTAGGTGCAAAGAAGCCCTGATGCAGGTATTTCGTAGTAACATTTGTTGTTGCTTTCCGCCTTTTTTTTATAATTATTTTTTCTCTTCCATTTTTCAAAAAGGTTTCTTTTGGTTTTTCCTTATGTGGATTTTTTTGTATAATATTCTTATTGATTATTCCCATGAGTTGGTTCAAAATGATTTCCTTGGCGTTTGAGCTCTCATCTCCTAAGAATGGAGAAGCGGTCTATTATCTAAGGAAATTAGTTGACTTGACTCAATGGTGAGCTAAACGATGTAAGGATTCGGTTTTATGTTTATCAATCCCCTCTTTTTGTTCAAGGAATATTGAGGTTAGCTTATAACCGTATCTTAAAAATGTCAAAACCTTTGAGGCTGCTGAGGATCCCAAATCAAAAAAAATAAATAAATAAAAAGATTTCATAATTTTTGTTAGATACATGAGTTACACCTCTCATTGCCAATTGGTTTTGAGATGGAATCCCATGTTACTTAATTCAATATTAGTCATAAAACTTAAATGGGTATTTGGTCCAAAAAAAAGGAAAAAGGATCCGATCCAAGAATGGTGAACCCAAAGAGGCACCATCTTGAGGTAGTATGTTGAGGATCCTACATCGAAAAGATGACGAGGCCTCATAATCGTTATAAGACACATGGATTACGCCTCTAATTGCCAATTGGTTTTGAGATGGAACTCCCATGTTTATCTAATAGAGCCTGCCTTAATGCTGTCTACAATTTTATGATTGCTCTGTATTGTCAAATTGTAATGCTTTGCCTTCAGACCTCTCCTTCTCGTGAATTGCTGGTACCTTAATATTTTTGGCCTCTCCATGGGGAGGTCAATTTTACTTCTTTATAGCTCTAATTGTTTGACTACACTTCCTGATTGCAGTATAGTTAGACGAATTATAGTTCGTGTATAACTTAGGTACCTTACAATGACCTTGCTATGTTTACAGATGGAGAAGATATGTAAAGTTGCTAATAAGCCTCTTGTAGTGACTTACACCGGTTTGATTCAAGCTTGTTTGGACTCAAAAAACTTACAAAGTGCAGTCTATATATTCAACCACATGAAGGCCTTTTGCTCCCCCAATCTTGTTACTTGTAATATACTGTTGAAAGGTTACTTGGACCATGGGATGTTCGACGAAGCTAAAGAGCTGTTTCAGAATATGTCAGAAAATGGACGAAATATCAGCGCTGTATCTGACTATAGGGATCGAGTATTACCAGATATCTACACATTCAACACCATGCTAGATGCATCCTTTGCAGAAAAGAGATGGGATGATTTCAGCCATTTCTATAACCAGATGCTTCTTTATGGGTATCACTTCAACCCAAAACGTCATCTGCGGATGATAATGGAGGCTGCTAGGGGTGGAAAGGTGGAATGTTTAAATACAACTCGTGTTTTCCTTGTTTCTCCTTCCTTTATAAGATGGCTATAGGAAAGTTGCTCATAATTGATGATATTTTTTAAATATCTGCTGATTAAGTCGTATGTTGGCCGCTTCTACTGAAGACGTATGTATAACATCGGTTTCACATTTGTGTCTGGAAGTCGAGTTCTGCTGCTGTTCGTTAAACCAGTGATTATACTGAGAATTGAAACTATATTTCTCTATGATCTGTGGATAATTTTGTGTAATTTCCTGCACTACCAAGGGAGTAAACTAGTGCAAGTTGGTCGTTTTAATTTGTTACGTGTTTGCTCTCAATTAATCAAGGACTTTTGTGCAACAGGATGAGCTACTGGAAACAACATGGAAGCACTTAGCTCAGGCTGACCGGACACTGCCACCACCGCTCATCAAAGAAAGGTTTTGCATCATGCTGGCTAGAGGTGACTACTCTGAAGCTCTCTCTTGCATTTCTAAACACCATAGTAGCGATGAACATCATTTCTCTAAGTCTGCTTGGCTAAATTTACTGAAAGAGAAAAGGTTTCCCAAGGATAGTGTTATTGAGTTAATTCATAAGGTTAGTATGCTTCTTGCTAGAAATGACTCACCAAATCCAGTGCTTCAGAATCTGTTATTGAGTGGTAAAGAATTTTGCAGAAGTAGAATTAGTGTAGCTGACCCTAGACTTGAAGAAGTTGTTTGTACAAATGAATTCCAATCTGCTGCTGTCATGCATGTTTAGCATAATTTGAGAGGAAATAATGTTCTTTGGTTCATTCCCTTGTTCTTAGGTTATGTATTATATAAAGGAACTAGAAAATGAAAATCATTATTCCTATAACTTCTTGATTAAGGAATAGAAATTTCGAAAGGATCGATCAACTTTAGTGTGATGTTGGTCGAAAAGACTTTAATCATCTGGTTGCGAGGTGATGGAAAAGAA

mRNA sequence

TATTCCATAGAAGGAACTAGCGTTCGTCTCTCCGTCGAGCTCCGCCGGCGCCGATATCCGGAGAAGGTTTAGACAATTGCCGGACGAGGTGTAGACGACTGCCGGACAAGGTGTGAACGATTGCCGGACAAGGTGTAAACGATTGCCGGACAAGGTGTGAACGATTGCCGGACAAGGTGTGAACGATTGCCGGACAAGGTGGGAACGCTTCTATATAATTATAGAACGGAATTGCAGAGGTTAAGTATTATCTAAGTTTTCCACGGAGAATTAGGTCGTTGCAATATGATAAAATGCCCTTTGAGCCGCAGATAGAACAGGAACAGCCGGGAATAATGATGTAATCATATTGCAAGATTCAGGGAGAGATTGTTCCACCGTCATTCTGGAAAATACTCCAATTGCTGTCGTACAAAGAGCAGTTAGTTTTTTTCTTGCTTTTACCTTTGCGTTTTGATGTTTACTCCGTGTTTCTACTTGGTGCCTGAGAGTTCGATTGGAGTGTATTGGTATTATTCATGAACTAGGAGTAGTTCCAAAATGGTGGGAGTAATAATGGCGAATGCAAATTTGTGCATCCCTTGTTGTGAAGGAAATGGATTTCCGGCACTGCATTGTACCCAGAATTCCCATTATTTATTAGGGTTTTCGTTTTTTACTAGTTCGGTATCTGGAAGTGGCTTAAATTCTGGCAGTGCGAAGAGCAGAGTTTTAAGGCACAGGGGACATAAATGTGGAGCAATTAAGGCTTCATCAAAGGGAGAATCTGATATTCGATTGGCAAGTGGGAATCTCCTCGAAAACGATTTTCAGTTTAAGCCATCTTTCGATGAATATGTGAGGGTTATGGAGTCCGTTAGATCTAGAAGGTATAAGAGGCAGTCGGACGATCCTAATAAGATGAAGGAAAATGCGAGTGCAAAGAGCGCTGAAAGCACTTCCATTTCTAACATAGTGACTGATGTTCAAGGAAATATGGACGTAAAGAAAAAGGTTATATGTGTTGATCAGGAGGATTTGTTTGATAATTCAGAGAGAATTACACGTAAAATAGATTTGTCGGGAAATAAATTTGATAGCAAAAGGAAAGGGGTTACAAGATCAAAGGATGAGCTTAAAGGTAAGGTGACACCTTTTGACTCACAGGTAAATGATAAACAACATGTAGAGAAAAGGAATGGAAACTGGTCGAATTACATTGAGCCAAAAGTAACTAGGTCGAACCATGATAAACGACTTCATTTTAAGGCTAATACATTGGATGTGAAAAGTGAAAGCCACGGAGTACGTTATGGAAGTTCCATGAAAATATCGGAAAAGATTTGGGCTGATGATGACACTAAACGAACTAAGGATGTTCTGAAGGTTGGGAAGTATGGTGTTCAGCTCGAAGGAAACTATATTCCCGGTGACAAGGTTGGTAGAAAGAAAACTGAGCAGTCCTACAGAGGGTTATCCAAAAGTGGTAAGCAGTTTCATGAATTTACAGAAGAGAGTAGCTTAGAGGTCGAACATGCTGCCTTCAACAGTTGTGATGCAGAAGACATAATGGACAAACCAAGAGTTTCAAAGATGGAAATGGAAGAGAGAATCCAGATGCTTTCTAAGAGATTAAATGGTGCAGACATTGATATGCCTGAGTGGATGTTTGCTCAAATGATGAGGAGTGCAAAGATTAGATATTCAGATCATTCAATATTAAGGGTTATTCAAGTGTTGGGTAAGCTAGGAAATTGGAAACGAGTGCTTCAAGTCATCGAATGGCTTCAAATGCGTGAACGGTTCAAGTCACATAAGCTGAGATTTATATACACCACTGCCCTTGATGTACTTGGAAAAGCGAGGAGACCTGTGGAGGCACTCAATGTATTCCATGCAATGCAGGAACACTTTTCCTCATATCCTGACTTAGTAGCATACCATAGTATTGCTGTCACTCTTGGACAAGCAGGATACATGAGGGAACTCTTTGATGTGATTGATAGCATGCGGTCTCCTCCAAAGAAGAAGTTTAAAACAGGGGCACTTGAAAAGTGGGACCCACGGCTGCAACCTGATATAGTTATCTATAATGCGGTTCTAAATGCTTGTGTTAAGCGAAAAAATTGGGAAGGGGCATTTTGGGTCTTGCAGGAACTAAAGGAACAAGGTCTACAGCCTTCTACGACAACATATGGATTGGTCATGGAGGTGATGCTTCAATGTGGCAAGTACAACTTAGTTCATGAGTTCTTCAGAAAAGTGCAGAAATCTTCAATTCCTAATGCTTTAACATATAAAGTTCTTGTCAATACACTTTGGAAAGAAGGTAAAACAGATGAGGCTGTGCTGGCCATTCAGACCATGGAAAAACGAGGAATAGTTGGGTCTGCAGCTCTTTATTACGACTTTGCTCGTTGTCTTTGCAGTGCTGGTAGGTGCAAAGAAGCCCTGATGCAGATGGAGAAGATATGTAAAGTTGCTAATAAGCCTCTTGTAGTGACTTACACCGGTTTGATTCAAGCTTGTTTGGACTCAAAAAACTTACAAAGTGCAGTCTATATATTCAACCACATGAAGGCCTTTTGCTCCCCCAATCTTGTTACTTGTAATATACTGTTGAAAGGTTACTTGGACCATGGGATGTTCGACGAAGCTAAAGAGCTGTTTCAGAATATGTCAGAAAATGGACGAAATATCAGCGCTGTATCTGACTATAGGGATCGAGTATTACCAGATATCTACACATTCAACACCATGCTAGATGCATCCTTTGCAGAAAAGAGATGGGATGATTTCAGCCATTTCTATAACCAGATGCTTCTTTATGGGTATCACTTCAACCCAAAACGTCATCTGCGGATGATAATGGAGGCTGCTAGGGGTGGAAAGGATGAGCTACTGGAAACAACATGGAAGCACTTAGCTCAGGCTGACCGGACACTGCCACCACCGCTCATCAAAGAAAGGTTTTGCATCATGCTGGCTAGAGGTGACTACTCTGAAGCTCTCTCTTGCATTTCTAAACACCATAGTAGCGATGAACATCATTTCTCTAAGTCTGCTTGGCTAAATTTACTGAAAGAGAAAAGGTTTCCCAAGGATAGTGTTATTGAGTTAATTCATAAGGTTAGTATGCTTCTTGCTAGAAATGACTCACCAAATCCAGTGCTTCAGAATCTGTTATTGAGTGGTAAAGAATTTTGCAGAAGTAGAATTAGTGTAGCTGACCCTAGACTTGAAGAAGTTGTTTGTACAAATGAATTCCAATCTGCTGCTGTCATGCATGTTTAGCATAATTTGAGAGGAAATAATGTTCTTTGGTTCATTCCCTTGTTCTTAGGTTATGTATTATATAAAGGAACTAGAAAATGAAAATCATTATTCCTATAACTTCTTGATTAAGGAATAGAAATTTCGAAAGGATCGATCAACTTTAGTGTGATGTTGGTCGAAAAGACTTTAATCATCTGGTTGCGAGGTGATGGAAAAGAA

Coding sequence (CDS)

ATGGTGGGAGTAATAATGGCGAATGCAAATTTGTGCATCCCTTGTTGTGAAGGAAATGGATTTCCGGCACTGCATTGTACCCAGAATTCCCATTATTTATTAGGGTTTTCGTTTTTTACTAGTTCGGTATCTGGAAGTGGCTTAAATTCTGGCAGTGCGAAGAGCAGAGTTTTAAGGCACAGGGGACATAAATGTGGAGCAATTAAGGCTTCATCAAAGGGAGAATCTGATATTCGATTGGCAAGTGGGAATCTCCTCGAAAACGATTTTCAGTTTAAGCCATCTTTCGATGAATATGTGAGGGTTATGGAGTCCGTTAGATCTAGAAGGTATAAGAGGCAGTCGGACGATCCTAATAAGATGAAGGAAAATGCGAGTGCAAAGAGCGCTGAAAGCACTTCCATTTCTAACATAGTGACTGATGTTCAAGGAAATATGGACGTAAAGAAAAAGGTTATATGTGTTGATCAGGAGGATTTGTTTGATAATTCAGAGAGAATTACACGTAAAATAGATTTGTCGGGAAATAAATTTGATAGCAAAAGGAAAGGGGTTACAAGATCAAAGGATGAGCTTAAAGGTAAGGTGACACCTTTTGACTCACAGGTAAATGATAAACAACATGTAGAGAAAAGGAATGGAAACTGGTCGAATTACATTGAGCCAAAAGTAACTAGGTCGAACCATGATAAACGACTTCATTTTAAGGCTAATACATTGGATGTGAAAAGTGAAAGCCACGGAGTACGTTATGGAAGTTCCATGAAAATATCGGAAAAGATTTGGGCTGATGATGACACTAAACGAACTAAGGATGTTCTGAAGGTTGGGAAGTATGGTGTTCAGCTCGAAGGAAACTATATTCCCGGTGACAAGGTTGGTAGAAAGAAAACTGAGCAGTCCTACAGAGGGTTATCCAAAAGTGGTAAGCAGTTTCATGAATTTACAGAAGAGAGTAGCTTAGAGGTCGAACATGCTGCCTTCAACAGTTGTGATGCAGAAGACATAATGGACAAACCAAGAGTTTCAAAGATGGAAATGGAAGAGAGAATCCAGATGCTTTCTAAGAGATTAAATGGTGCAGACATTGATATGCCTGAGTGGATGTTTGCTCAAATGATGAGGAGTGCAAAGATTAGATATTCAGATCATTCAATATTAAGGGTTATTCAAGTGTTGGGTAAGCTAGGAAATTGGAAACGAGTGCTTCAAGTCATCGAATGGCTTCAAATGCGTGAACGGTTCAAGTCACATAAGCTGAGATTTATATACACCACTGCCCTTGATGTACTTGGAAAAGCGAGGAGACCTGTGGAGGCACTCAATGTATTCCATGCAATGCAGGAACACTTTTCCTCATATCCTGACTTAGTAGCATACCATAGTATTGCTGTCACTCTTGGACAAGCAGGATACATGAGGGAACTCTTTGATGTGATTGATAGCATGCGGTCTCCTCCAAAGAAGAAGTTTAAAACAGGGGCACTTGAAAAGTGGGACCCACGGCTGCAACCTGATATAGTTATCTATAATGCGGTTCTAAATGCTTGTGTTAAGCGAAAAAATTGGGAAGGGGCATTTTGGGTCTTGCAGGAACTAAAGGAACAAGGTCTACAGCCTTCTACGACAACATATGGATTGGTCATGGAGGTGATGCTTCAATGTGGCAAGTACAACTTAGTTCATGAGTTCTTCAGAAAAGTGCAGAAATCTTCAATTCCTAATGCTTTAACATATAAAGTTCTTGTCAATACACTTTGGAAAGAAGGTAAAACAGATGAGGCTGTGCTGGCCATTCAGACCATGGAAAAACGAGGAATAGTTGGGTCTGCAGCTCTTTATTACGACTTTGCTCGTTGTCTTTGCAGTGCTGGTAGGTGCAAAGAAGCCCTGATGCAGATGGAGAAGATATGTAAAGTTGCTAATAAGCCTCTTGTAGTGACTTACACCGGTTTGATTCAAGCTTGTTTGGACTCAAAAAACTTACAAAGTGCAGTCTATATATTCAACCACATGAAGGCCTTTTGCTCCCCCAATCTTGTTACTTGTAATATACTGTTGAAAGGTTACTTGGACCATGGGATGTTCGACGAAGCTAAAGAGCTGTTTCAGAATATGTCAGAAAATGGACGAAATATCAGCGCTGTATCTGACTATAGGGATCGAGTATTACCAGATATCTACACATTCAACACCATGCTAGATGCATCCTTTGCAGAAAAGAGATGGGATGATTTCAGCCATTTCTATAACCAGATGCTTCTTTATGGGTATCACTTCAACCCAAAACGTCATCTGCGGATGATAATGGAGGCTGCTAGGGGTGGAAAGGATGAGCTACTGGAAACAACATGGAAGCACTTAGCTCAGGCTGACCGGACACTGCCACCACCGCTCATCAAAGAAAGGTTTTGCATCATGCTGGCTAGAGGTGACTACTCTGAAGCTCTCTCTTGCATTTCTAAACACCATAGTAGCGATGAACATCATTTCTCTAAGTCTGCTTGGCTAAATTTACTGAAAGAGAAAAGGTTTCCCAAGGATAGTGTTATTGAGTTAATTCATAAGGTTAGTATGCTTCTTGCTAGAAATGACTCACCAAATCCAGTGCTTCAGAATCTGTTATTGAGTGGTAAAGAATTTTGCAGAAGTAGAATTAGTGTAGCTGACCCTAGACTTGAAGAAGTTGTTTGTACAAATGAATTCCAATCTGCTGCTGTCATGCATGTTTAG

Protein sequence

MVGVIMANANLCIPCCEGNGFPALHCTQNSHYLLGFSFFTSSVSGSGLNSGSAKSRVLRHRGHKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNKMKENASAKSAESTSISNIVTDVQGNMDVKKKVICVDQEDLFDNSERITRKIDLSGNKFDSKRKGVTRSKDELKGKVTPFDSQVNDKQHVEKRNGNWSNYIEPKVTRSNHDKRLHFKANTLDVKSESHGVRYGSSMKISEKIWADDDTKRTKDVLKVGKYGVQLEGNYIPGDKVGRKKTEQSYRGLSKSGKQFHEFTEESSLEVEHAAFNSCDAEDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDELLETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWLNLLKEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVADPRLEEVVCTNEFQSAAVMHV
Homology
BLAST of Cp4.1LG20g05810 vs. ExPASy Swiss-Prot
Match: Q9SA76 (Pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=EMB2279 PE=3 SV=1)

HSP 1 Score: 728.4 bits (1879), Expect = 1.0e-208
Identity = 414/861 (48.08%), Postives = 561/861 (65.16%), Query Frame = 0

Query: 62   GHKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNKM 121
            G    A+K S  GES + +      +  F+ + S  EY R  ++ R      + D+ + +
Sbjct: 172  GESSVALKLSKSGESSVTVPE----DESFRKRYSKQEYHRSSDTSRGIERGSRGDELDLV 231

Query: 122  KENASAKSAESTSISNIVTDVQGNMDVKKKVICVDQEDLFDNSERITRKIDLSGNKFDSK 181
                     E   +  I  D + +   ++  + V   +  ++S  +T   D S  +  SK
Sbjct: 232  --------VEERRVQRIAKDARWSKS-RESSVAVKWSNSGESS--VTMPKDESFRRRYSK 291

Query: 182  RKGVTRSKDELKGKVTPFDSQVNDKQHVEKRNGNWSNYIEPKVTRSNHDKRLHFKANTLD 241
            ++   RS D  +G      S+ ++ + V +         E +V R   D R      +L 
Sbjct: 292  QEH-HRSSDTSRGIAR--GSKGDELELVVE---------ERRVQRIAKDVRWSKSDESLV 351

Query: 242  VKSESHGVRYGSSMKISEKIWADDDTKRTKDVLKVGKYGVQLEGNYIPGDKVGRKKTE-- 301
              SE    R G+  +   +     DT R  +    G  G+ L       +++  ++ E  
Sbjct: 352  PVSEDESFRRGNPKQEMVRYQRVSDTSRGIERGSKGD-GLDLLAEERRIERLANERHEIR 411

Query: 302  -QSYRGLSKSGKQFHEFTEESSLEVEHAAFN-SCDAEDIMDKPRVSKMEMEERIQMLSKR 361
                 G  + G + ++  ++S   +E  AF  S ++ DI+DKP  S++EME+RI+ L+K 
Sbjct: 412  SSKLSGTRRIGAKRNDDDDDSLFAMETPAFRFSDESSDIVDKPATSRVEMEDRIEKLAKV 471

Query: 362  LNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKS 421
            LNGADI+MPEW F++ +RSAKIRY+D++++R+I  LGKLGNW+RVLQVIEWLQ ++R+KS
Sbjct: 472  LNGADINMPEWQFSKAIRSAKIRYTDYTVMRLIHFLGKLGNWRRVLQVIEWLQRQDRYKS 531

Query: 422  HKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELF 481
            +K+R IYTTAL+VLGK+RRPVEALNVFHAM    SSYPD+VAY SIAVTLGQAG+++ELF
Sbjct: 532  NKIRIIYTTALNVLGKSRRPVEALNVFHAMLLQISSYPDMVAYRSIAVTLGQAGHIKELF 591

Query: 482  DVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQG 541
             VID+MRSPPKKKFK   LEKWDPRL+PD+V+YNAVLNACV+RK WEGAFWVLQ+LK++G
Sbjct: 592  YVIDTMRSPPKKKFKPTTLEKWDPRLEPDVVVYNAVLNACVQRKQWEGAFWVLQQLKQRG 651

Query: 542  LQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVL 601
             +PS  TYGL+MEVML C KYNLVHEFFRK+QKSSIPNAL Y+VLVNTLWKEGK+DEAV 
Sbjct: 652  QKPSPVTYGLIMEVMLACEKYNLVHEFFRKMQKSSIPNALAYRVLVNTLWKEGKSDEAVH 711

Query: 602  AIQTMEKRGIVGSAALYYDFARCLCSAGRCKEAL-------------------------- 661
             ++ ME RGIVGSAALYYD ARCLCSAGRC E L                          
Sbjct: 712  TVEDMESRGIVGSAALYYDLARCLCSAGRCNEGLNMVNFVNPVVLKLIENLIYKADLVHT 771

Query: 662  --MQMEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKG 721
               Q++KIC+VANKPLVVTYTGLIQAC+DS N+++A YIF+ MK  CSPNLVTCNI+LK 
Sbjct: 772  IQFQLKKICRVANKPLVVTYTGLIQACVDSGNIKNAAYIFDQMKKVCSPNLVTCNIMLKA 831

Query: 722  YLDHGMFDEAKELFQNMSENGRNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHF 781
            YL  G+F+EA+ELFQ MSE+G +I   SD+  RVLPD YTFNTMLD    +++WDDF + 
Sbjct: 832  YLQGGLFEEARELFQKMSEDGNHIKNSSDFESRVLPDTYTFNTMLDTCAEQEKWDDFGYA 891

Query: 782  YNQMLLYGYHFNPKRHLRMIMEAARGGKDELLETTWKHLAQADRTLPPPLIKERFCIMLA 841
            Y +ML +GYHFN KRHLRM++EA+R GK+E++E TW+H+ +++R  P PLIKERF   L 
Sbjct: 892  YREMLRHGYHFNAKRHLRMVLEASRAGKEEVMEATWEHMRRSNRIPPSPLIKERFFRKLE 951

Query: 842  RGDYSEALSCIS----KHHSSDEHHFSKSAWLNLLKEKRFPKDSVIELIHKVSMLL-ARN 886
            +GD+  A+S ++    K   ++   FS SAW  +L   RF +DSV+ L+  V+  L +R+
Sbjct: 952  KGDHISAISSLADLNGKIEETELRAFSTSAWSRVL--SRFEQDSVLRLMDDVNRRLGSRS 1002

BLAST of Cp4.1LG20g05810 vs. ExPASy Swiss-Prot
Match: Q9FJW6 (Pentatricopeptide repeat-containing protein At5g67570, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DG1 PE=1 SV=2)

HSP 1 Score: 410.2 bits (1053), Expect = 6.0e-113
Identity = 221/557 (39.68%), Postives = 339/557 (60.86%), Query Frame = 0

Query: 349 ERIQMLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEW 408
           E +++L  RL+G +I+   W F +MM  + +++++  +L+++  LG+  +WK+   V+ W
Sbjct: 183 EAVRVLVDRLSGREINEKHWKFVRMMNQSGLQFTEDQMLKIVDRLGRKQSWKQASAVVHW 242

Query: 409 LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLG 468
           +   ++ K  + RF+YT  L VLG ARRP EAL +F+ M      YPD+ AYH IAVTLG
Sbjct: 243 VYSDKKRKHLRSRFVYTKLLSVLGFARRPQEALQIFNQMLGDRQLYPDMAAYHCIAVTLG 302

Query: 469 QAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFW 528
           QAG ++EL  VI+ MR  P K  K    + WDP L+PD+V+YNA+LNACV    W+   W
Sbjct: 303 QAGLLKELLKVIERMRQKPTKLTKNLRQKNWDPVLEPDLVVYNAILNACVPTLQWKAVSW 362

Query: 529 VLQELKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKS-SIPNALTYKVLVNTLW 588
           V  EL++ GL+P+  TYGL MEVML+ GK++ VH+FFRK++ S   P A+TYKVLV  LW
Sbjct: 363 VFVELRKNGLRPNGATYGLAMEVMLESGKFDRVHDFFRKMKSSGEAPKAITYKVLVRALW 422

Query: 589 KEGKTDEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVAN-KPLV 648
           +EGK +EAV A++ ME++G++G+ ++YY+ A CLC+ GR  +A++++ ++ ++ N +PL 
Sbjct: 423 REGKIEEAVEAVRDMEQKGVIGTGSVYYELACCLCNNGRWCDAMLEVGRMKRLENCRPLE 482

Query: 649 VTYTGLIQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNM 708
           +T+TGLI A L+  ++   + IF +MK  C PN+ T N++LK Y  + MF EAKELF+ +
Sbjct: 483 ITFTGLIAASLNGGHVDDCMAIFQYMKDKCDPNIGTANMMLKVYGRNDMFSEAKELFEEI 542

Query: 709 SENGRNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHL 768
                    VS     ++P+ YT++ ML+AS    +W+ F H Y  M+L GY  +  +H 
Sbjct: 543 ---------VSRKETHLVPNEYTYSFMLEASARSLQWEYFEHVYQTMVLSGYQMDQTKHA 602

Query: 769 RMIMEAARGGKDELLETTWKHLAQADRTLPPPL-IKERFCIMLARGDYSEALSCISKHHS 828
            M++EA+R GK  LLE  +  + + D  +P PL   E  C   A+GD+  A++ I+   +
Sbjct: 603 SMLIEASRAGKWSLLEHAFDAVLE-DGEIPHPLFFTELLCHATAKGDFQRAITLINT-VA 662

Query: 829 SDEHHFSKSAWLNLLKEKR--FPKDSVIELIHKVSMLLARND-SPNPVLQNLLLSGKEFC 888
                 S+  W +L +E +    +D+    +HK+S  L   D    P + NL  S K  C
Sbjct: 663 LASFQISEEEWTDLFEEHQDWLTQDN----LHKLSDHLIECDYVSEPTVSNLSKSLKSRC 722

Query: 889 RSRISVADPRLEEVVCT 900
            S  S A P L   V T
Sbjct: 723 GSSSSSAQPLLAVDVTT 724

BLAST of Cp4.1LG20g05810 vs. ExPASy Swiss-Prot
Match: Q9LPX2 (Pentatricopeptide repeat-containing protein At1g12775, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g12775 PE=2 SV=1)

HSP 1 Score: 110.9 bits (276), Expect = 7.5e-23
Identity = 103/477 (21.59%), Postives = 198/477 (41.51%), Query Frame = 0

Query: 354 LSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRE 413
           LS  L G   D    +F  M++S  +  +     R+   + K   ++ VL + +  QM  
Sbjct: 60  LSSGLVGIKADDAVDLFRDMIQSRPLP-TVIDFNRLFSAIAKTKQYELVLALCK--QMES 119

Query: 414 RFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYM 473
           +  +H + +  +  ++   + R+   A +    + +     PD V ++++   L     +
Sbjct: 120 KGIAHSI-YTLSIMINCFCRCRKLSYAFSTMGKIMK-LGYEPDTVIFNTLLNGLCLECRV 179

Query: 474 RELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQEL 533
            E  +++D M     K               P ++  N ++N          A  ++  +
Sbjct: 180 SEALELVDRMVEMGHK---------------PTLITLNTLVNGLCLNGKVSDAVVLIDRM 239

Query: 534 KEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIP-NALTYKVLVNTLWKEGKT 593
            E G QP+  TYG V+ VM + G+  L  E  RK+++ +I  +A+ Y ++++ L K+G  
Sbjct: 240 VETGFQPNEVTYGPVLNVMCKSGQTALAMELLRKMEERNIKLDAVKYSIIIDGLCKDGSL 299

Query: 594 DEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGL 653
           D A      ME +G       Y       C+AGR  +    +  + K    P VVT++ L
Sbjct: 300 DNAFNLFNEMEIKGFKADIITYNTLIGGFCNAGRWDDGAKLLRDMIKRKISPNVVTFSVL 359

Query: 654 IQACLDSKNLQSAVYIFNH-MKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGR 713
           I + +    L+ A  +    M+   +PN +T N L+ G+      +EA ++   M   G 
Sbjct: 360 IDSFVKEGKLREADQLLKEMMQRGIAPNTITYNSLIDGFCKENRLEEAIQMVDLMISKGC 419

Query: 714 NISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIME 773
           +            PDI TFN +++      R DD    + +M L G   N   +  ++  
Sbjct: 420 D------------PDIMTFNILINGYCKANRIDDGLELFREMSLRGVIANTVTYNTLVQG 479

Query: 774 AARGGKDELLETTWKHLAQADRTLPPPLIKERFCI--MLARGDYSEALSCISKHHSS 827
             + GK E+ +  ++ +    R + P ++  +  +  +   G+  +AL    K   S
Sbjct: 480 FCQSGKLEVAKKLFQEM--VSRRVRPDIVSYKILLDGLCDNGELEKALEIFGKIEKS 502

BLAST of Cp4.1LG20g05810 vs. ExPASy Swiss-Prot
Match: Q0WPZ6 (Pentatricopeptide repeat-containing protein At2g17140 OS=Arabidopsis thaliana OX=3702 GN=At2g17140 PE=2 SV=1)

HSP 1 Score: 107.8 bits (268), Expect = 6.4e-22
Identity = 58/215 (26.98%), Postives = 107/215 (49.77%), Query Frame = 0

Query: 501 PRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQPSTTTYGLVMEVMLQCGKYNL 560
           P  +P + +YN +L +C+K +  E   W+ +++   G+ P T T+ L++  +      + 
Sbjct: 106 PENKPSVYLYNLLLESCIKERRVEFVSWLYKDMVLCGIAPQTYTFNLLIRALCDSSCVDA 165

Query: 561 VHEFFRKV-QKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQTMEKRGIVGSAALYYDFAR 620
             E F ++ +K   PN  T+ +LV    K G TD+ +  +  ME  G++ +  +Y     
Sbjct: 166 ARELFDEMPEKGCKPNEFTFGILVRGYCKAGLTDKGLELLNAMESFGVLPNKVIYNTIVS 225

Query: 621 CLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMK-----A 680
             C  GR  ++   +EK+ +    P +VT+   I A      +  A  IF+ M+      
Sbjct: 226 SFCREGRNDDSEKMVEKMREEGLVPDIVTFNSRISALCKEGKVLDASRIFSDMELDEYLG 285

Query: 681 FCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSEN 710
              PN +T N++LKG+   G+ ++AK LF+++ EN
Sbjct: 286 LPRPNSITYNLMLKGFCKVGLLEDAKTLFESIREN 320

BLAST of Cp4.1LG20g05810 vs. ExPASy Swiss-Prot
Match: Q76C99 (Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV=1)

HSP 1 Score: 107.1 bits (266), Expect = 1.1e-21
Identity = 79/321 (24.61%), Postives = 140/321 (43.61%), Query Frame = 0

Query: 424 YTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSM 483
           Y+  +D L K  R +EA  +F +M +     P++  Y ++       G + E+  ++D M
Sbjct: 304 YSLLMDYLCKNGRCMEARKIFDSMTKR-GLKPEITTYGTLLQGYATKGALVEMHGLLDLM 363

Query: 484 RSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQPSTT 543
                   + G        + PD  +++ ++ A  K+   + A  V  ++++QGL P+  
Sbjct: 364 -------VRNG--------IHPDHYVFSILICAYAKQGKVDQAMLVFSKMRQQGLNPNAV 423

Query: 544 TYGLVMEVMLQCGKYNLVHEFFRK-VQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQTM 603
           TYG V+ ++ + G+      +F + + +   P  + Y  L++ L    K + A   I  M
Sbjct: 424 TYGAVIGILCKSGRVEDAMLYFEQMIDEGLSPGNIVYNSLIHGLCTCNKWERAEELILEM 483

Query: 604 EKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQA-CLDSKN 663
             RGI  +   +       C  GR  E+    E + ++  KP V+TY  LI   CL  K 
Sbjct: 484 LDRGICLNTIFFNSIIDSHCKEGRVIESEKLFELMVRIGVKPNVITYNTLINGYCLAGKM 543

Query: 664 LQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYRD 723
            ++   +   +     PN VT + L+ GY      ++A  LF+ M  +G           
Sbjct: 544 DEAMKLLSGMVSVGLKPNTVTYSTLINGYCKISRMEDALVLFKEMESSG----------- 596

Query: 724 RVLPDIYTFNTMLDASFAEKR 743
            V PDI T+N +L   F  +R
Sbjct: 604 -VSPDIITYNIILQGLFQTRR 596

BLAST of Cp4.1LG20g05810 vs. NCBI nr
Match: XP_023519692.1 (pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1822 bits (4719), Expect = 0.0
Identity = 910/910 (100.00%), Postives = 910/910 (100.00%), Query Frame = 0

Query: 1   MVGVIMANANLCIPCCEGNGFPALHCTQNSHYLLGFSFFTSSVSGSGLNSGSAKSRVLRH 60
           MVGVIMANANLCIPCCEGNGFPALHCTQNSHYLLGFSFFTSSVSGSGLNSGSAKSRVLRH
Sbjct: 1   MVGVIMANANLCIPCCEGNGFPALHCTQNSHYLLGFSFFTSSVSGSGLNSGSAKSRVLRH 60

Query: 61  RGHKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120
           RGHKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK
Sbjct: 61  RGHKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120

Query: 121 MKENASAKSAESTSISNIVTDVQGNMDVKKKVICVDQEDLFDNSERITRKIDLSGNKFDS 180
           MKENASAKSAESTSISNIVTDVQGNMDVKKKVICVDQEDLFDNSERITRKIDLSGNKFDS
Sbjct: 121 MKENASAKSAESTSISNIVTDVQGNMDVKKKVICVDQEDLFDNSERITRKIDLSGNKFDS 180

Query: 181 KRKGVTRSKDELKGKVTPFDSQVNDKQHVEKRNGNWSNYIEPKVTRSNHDKRLHFKANTL 240
           KRKGVTRSKDELKGKVTPFDSQVNDKQHVEKRNGNWSNYIEPKVTRSNHDKRLHFKANTL
Sbjct: 181 KRKGVTRSKDELKGKVTPFDSQVNDKQHVEKRNGNWSNYIEPKVTRSNHDKRLHFKANTL 240

Query: 241 DVKSESHGVRYGSSMKISEKIWADDDTKRTKDVLKVGKYGVQLEGNYIPGDKVGRKKTEQ 300
           DVKSESHGVRYGSSMKISEKIWADDDTKRTKDVLKVGKYGVQLEGNYIPGDKVGRKKTEQ
Sbjct: 241 DVKSESHGVRYGSSMKISEKIWADDDTKRTKDVLKVGKYGVQLEGNYIPGDKVGRKKTEQ 300

Query: 301 SYRGLSKSGKQFHEFTEESSLEVEHAAFNSCDAEDIMDKPRVSKMEMEERIQMLSKRLNG 360
           SYRGLSKSGKQFHEFTEESSLEVEHAAFNSCDAEDIMDKPRVSKMEMEERIQMLSKRLNG
Sbjct: 301 SYRGLSKSGKQFHEFTEESSLEVEHAAFNSCDAEDIMDKPRVSKMEMEERIQMLSKRLNG 360

Query: 361 ADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKL 420
           ADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKL
Sbjct: 361 ADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKL 420

Query: 421 RFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVI 480
           RFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVI
Sbjct: 421 RFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVI 480

Query: 481 DSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQP 540
           DSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQP
Sbjct: 481 DSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQP 540

Query: 541 STTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQ 600
           STTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQ
Sbjct: 541 STTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQ 600

Query: 601 TMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSK 660
           TMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSK
Sbjct: 601 TMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSK 660

Query: 661 NLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYR 720
           NLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYR
Sbjct: 661 NLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYR 720

Query: 721 DRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDEL 780
           DRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDEL
Sbjct: 721 DRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDEL 780

Query: 781 LETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWLNLL 840
           LETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWLNLL
Sbjct: 781 LETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWLNLL 840

Query: 841 KEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVADPRLEEVVCTN 900
           KEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVADPRLEEVVCTN
Sbjct: 841 KEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVADPRLEEVVCTN 900

Query: 901 EFQSAAVMHV 910
           EFQSAAVMHV
Sbjct: 901 EFQSAAVMHV 910

BLAST of Cp4.1LG20g05810 vs. NCBI nr
Match: KAG7019446.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1779 bits (4609), Expect = 0.0
Identity = 888/910 (97.58%), Postives = 897/910 (98.57%), Query Frame = 0

Query: 1   MVGVIMANANLCIPCCEGNGFPALHCTQNSHYLLGFSFFTSSVSGSGLNSGSAKSRVLRH 60
           MVGVIMANANLCIPCCEGNGFPAL+CTQNSHYLLGFSFF SSVSGSGLN GSAKSRVLRH
Sbjct: 1   MVGVIMANANLCIPCCEGNGFPALYCTQNSHYLLGFSFFPSSVSGSGLNFGSAKSRVLRH 60

Query: 61  RGHKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120
           RGHKCGAIKASSKGESDI+LASGNLLE DFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK
Sbjct: 61  RGHKCGAIKASSKGESDIQLASGNLLEKDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120

Query: 121 MKENASAKSAESTSISNIVTDVQGNMDVKKKVICVDQEDLFDNSERITRKIDLSGNKFDS 180
           MKENASAKSAESTSISNIVTDVQGNMDVK KV+CVD EDLFDNSE+ITRK DLSGNKFDS
Sbjct: 121 MKENASAKSAESTSISNIVTDVQGNMDVKNKVVCVDGEDLFDNSEKITRKTDLSGNKFDS 180

Query: 181 KRKGVTRSKDELKGKVTPFDSQVNDKQHVEKRNGNWSNYIEPKVTRSNHDKRLHFKANTL 240
           KRKGVTRSKDELKGKVTPFDSQVNDKQH EKRNGNWSNYIEPK TRSNHDKRLHFKANTL
Sbjct: 181 KRKGVTRSKDELKGKVTPFDSQVNDKQHEEKRNGNWSNYIEPKATRSNHDKRLHFKANTL 240

Query: 241 DVKSESHGVRYGSSMKISEKIWADDDTKRTKDVLKVGKYGVQLEGNYIPGDKVGRKKTEQ 300
           DVKSESHGVRYGSSMKIS+KIWADDDTK TKDVLKVGKYGVQLEGNYIPGDKVGRKKTEQ
Sbjct: 241 DVKSESHGVRYGSSMKISDKIWADDDTKPTKDVLKVGKYGVQLEGNYIPGDKVGRKKTEQ 300

Query: 301 SYRGLSKSGKQFHEFTEESSLEVEHAAFNSCDAEDIMDKPRVSKMEMEERIQMLSKRLNG 360
           SYRGLSKSGK+FHEFTEESSLEVEHAAFNS DAEDIMDKPRVSKMEMEERIQMLSKRLNG
Sbjct: 301 SYRGLSKSGKRFHEFTEESSLEVEHAAFNSFDAEDIMDKPRVSKMEMEERIQMLSKRLNG 360

Query: 361 ADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKL 420
           ADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKL
Sbjct: 361 ADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKL 420

Query: 421 RFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVI 480
           RFIYTTALDVLGKARRPVEALNVFHAMQ+HFSSYPDLVAYHSIAVTLGQAGYMRELFDVI
Sbjct: 421 RFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVI 480

Query: 481 DSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQP 540
           DSMRSPPKKKFKTGA EKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQP
Sbjct: 481 DSMRSPPKKKFKTGAFEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQP 540

Query: 541 STTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQ 600
           STTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQ
Sbjct: 541 STTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQ 600

Query: 601 TMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSK 660
           TMEKRGIVGSAALYYDFARCLCSAGRC+EALMQMEKICKVANKPLVVTYTGLIQACLDSK
Sbjct: 601 TMEKRGIVGSAALYYDFARCLCSAGRCEEALMQMEKICKVANKPLVVTYTGLIQACLDSK 660

Query: 661 NLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYR 720
           NLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYR
Sbjct: 661 NLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYR 720

Query: 721 DRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDEL 780
           DRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDEL
Sbjct: 721 DRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDEL 780

Query: 781 LETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWLNLL 840
           LETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWLNLL
Sbjct: 781 LETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWLNLL 840

Query: 841 KEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVADPRLEEVVCTN 900
           KEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRI+VADPRLEEVVCTN
Sbjct: 841 KEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRITVADPRLEEVVCTN 900

Query: 901 EFQSAAVMHV 910
           E QSA VMHV
Sbjct: 901 ESQSATVMHV 910

BLAST of Cp4.1LG20g05810 vs. NCBI nr
Match: XP_022927392.1 (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucurbita moschata])

HSP 1 Score: 1775 bits (4597), Expect = 0.0
Identity = 885/910 (97.25%), Postives = 895/910 (98.35%), Query Frame = 0

Query: 1   MVGVIMANANLCIPCCEGNGFPALHCTQNSHYLLGFSFFTSSVSGSGLNSGSAKSRVLRH 60
           MVGVIMANANLCIPCCEGNGFPAL+CTQNSHYLLGFS F SSVSGSGLN GSAKSRVLRH
Sbjct: 1   MVGVIMANANLCIPCCEGNGFPALYCTQNSHYLLGFSVFPSSVSGSGLNFGSAKSRVLRH 60

Query: 61  RGHKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120
           RGHKCGAIKASSKGESDI+LASGNLLE DFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK
Sbjct: 61  RGHKCGAIKASSKGESDIQLASGNLLEKDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120

Query: 121 MKENASAKSAESTSISNIVTDVQGNMDVKKKVICVDQEDLFDNSERITRKIDLSGNKFDS 180
           MKENASAKSAEST ISNIVTDVQGNMDVK KV+CVD EDLFDNSE+ITRK DLSGNKFDS
Sbjct: 121 MKENASAKSAESTFISNIVTDVQGNMDVKNKVVCVDGEDLFDNSEKITRKTDLSGNKFDS 180

Query: 181 KRKGVTRSKDELKGKVTPFDSQVNDKQHVEKRNGNWSNYIEPKVTRSNHDKRLHFKANTL 240
           KRKGVTRSKDELKGKVTPF+SQVNDKQH EKRNGNWSNYIEPK TRSNHDKRLHFKANTL
Sbjct: 181 KRKGVTRSKDELKGKVTPFESQVNDKQHEEKRNGNWSNYIEPKATRSNHDKRLHFKANTL 240

Query: 241 DVKSESHGVRYGSSMKISEKIWADDDTKRTKDVLKVGKYGVQLEGNYIPGDKVGRKKTEQ 300
           DVKSESHGVRYGSSMKIS+KIWADDD+K TKDVLKVGKYGVQLEGNYIPGDKVGRKKTEQ
Sbjct: 241 DVKSESHGVRYGSSMKISDKIWADDDSKPTKDVLKVGKYGVQLEGNYIPGDKVGRKKTEQ 300

Query: 301 SYRGLSKSGKQFHEFTEESSLEVEHAAFNSCDAEDIMDKPRVSKMEMEERIQMLSKRLNG 360
           SYRGLSKSGK+FHEFTEESSLEVEHAAFNSCDAEDIMDKPRVSKMEMEERIQMLS RLNG
Sbjct: 301 SYRGLSKSGKRFHEFTEESSLEVEHAAFNSCDAEDIMDKPRVSKMEMEERIQMLSNRLNG 360

Query: 361 ADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKL 420
           ADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKL
Sbjct: 361 ADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKL 420

Query: 421 RFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVI 480
           RFIYTTALDVLGKARRPVEALNVFHAMQ+HFSSYPDLVAYHSIAVTLGQAGYMRELFDVI
Sbjct: 421 RFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVI 480

Query: 481 DSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQP 540
           DSMRSPPKKKFKTGA EKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQP
Sbjct: 481 DSMRSPPKKKFKTGAFEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQP 540

Query: 541 STTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQ 600
           STTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQ
Sbjct: 541 STTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQ 600

Query: 601 TMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSK 660
           TMEKRGIVGSAALYYDFARCLCSAGRC+EALMQMEKICKVANKPLVVTYTGLIQACLDSK
Sbjct: 601 TMEKRGIVGSAALYYDFARCLCSAGRCEEALMQMEKICKVANKPLVVTYTGLIQACLDSK 660

Query: 661 NLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYR 720
           NLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYR
Sbjct: 661 NLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYR 720

Query: 721 DRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDEL 780
           DRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDEL
Sbjct: 721 DRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDEL 780

Query: 781 LETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWLNLL 840
           LETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWLNLL
Sbjct: 781 LETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWLNLL 840

Query: 841 KEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVADPRLEEVVCTN 900
           KEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVADPRLEEVVCTN
Sbjct: 841 KEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVADPRLEEVVCTN 900

Query: 901 EFQSAAVMHV 910
           E QSA VMHV
Sbjct: 901 ESQSATVMHV 910

BLAST of Cp4.1LG20g05810 vs. NCBI nr
Match: XP_023000737.1 (pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucurbita maxima])

HSP 1 Score: 1764 bits (4570), Expect = 0.0
Identity = 883/910 (97.03%), Postives = 893/910 (98.13%), Query Frame = 0

Query: 1   MVGVIMANANLCIPCCEGNGFPALHCTQNSHYLLGFSFFTSSVSGSGLNSGSAKSRVLRH 60
           MVGVIMANANLCIPCCEGNGF AL+CTQNSHYLLG SFF SSVSGSGLN GSAKSRVLRH
Sbjct: 1   MVGVIMANANLCIPCCEGNGFSALYCTQNSHYLLGLSFFPSSVSGSGLNFGSAKSRVLRH 60

Query: 61  RGHKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120
           RGHKCGAIKASSKGESDI+LASGNLLE DFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK
Sbjct: 61  RGHKCGAIKASSKGESDIQLASGNLLEKDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120

Query: 121 MKENASAKSAESTSISNIVTDVQGNMDVKKKVICVDQEDLFDNSERITRKIDLSGNKFDS 180
           MKENASAKSAESTSISNIVTDVQGNMDVK KV+ VD EDLFDNSERITRK DLSGNKFDS
Sbjct: 121 MKENASAKSAESTSISNIVTDVQGNMDVKNKVVYVDGEDLFDNSERITRKTDLSGNKFDS 180

Query: 181 KRKGVTRSKDELKGKVTPFDSQVNDKQHVEKRNGNWSNYIEPKVTRSNHDKRLHFKANTL 240
           KRKGVTRSKDELKGKVTPFDSQ+NDKQH EKRNGNWSNYIEPKVTRSNHDKRLHFKANTL
Sbjct: 181 KRKGVTRSKDELKGKVTPFDSQINDKQHEEKRNGNWSNYIEPKVTRSNHDKRLHFKANTL 240

Query: 241 DVKSESHGVRYGSSMKISEKIWADDDTKRTKDVLKVGKYGVQLEGNYIPGDKVGRKKTEQ 300
           DVKSESHGVRYGSSMKISEKIWADDD K TKDVLKVGKYGVQL+GNYIPGDKVGRKKTEQ
Sbjct: 241 DVKSESHGVRYGSSMKISEKIWADDDIKPTKDVLKVGKYGVQLKGNYIPGDKVGRKKTEQ 300

Query: 301 SYRGLSKSGKQFHEFTEESSLEVEHAAFNSCDAEDIMDKPRVSKMEMEERIQMLSKRLNG 360
           SYRGLSKSGK+FHEFTEESSLEVEHAAFNSCDA DIMDKPRVSKMEMEERIQMLSKRLNG
Sbjct: 301 SYRGLSKSGKRFHEFTEESSLEVEHAAFNSCDAADIMDKPRVSKMEMEERIQMLSKRLNG 360

Query: 361 ADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKL 420
           ADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKL
Sbjct: 361 ADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKL 420

Query: 421 RFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVI 480
           RFIYTTALDVLGKARRPVEALNVFHAMQ+HFSSYPDLVAYHSIAVTLGQAGYMRELFDVI
Sbjct: 421 RFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVI 480

Query: 481 DSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQP 540
           DSMRSPPKKKFKTGA EKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQP
Sbjct: 481 DSMRSPPKKKFKTGAFEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQP 540

Query: 541 STTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQ 600
           STTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQ
Sbjct: 541 STTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQ 600

Query: 601 TMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSK 660
           TMEKRGIVGSAALYYDFARCLCSAGR +EALMQMEKICKVANKPLVVTYTGLIQACLDSK
Sbjct: 601 TMEKRGIVGSAALYYDFARCLCSAGRWEEALMQMEKICKVANKPLVVTYTGLIQACLDSK 660

Query: 661 NLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYR 720
           NLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMF+EAKELFQNMSENGRNISAVSDYR
Sbjct: 661 NLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFNEAKELFQNMSENGRNISAVSDYR 720

Query: 721 DRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDEL 780
           DRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDEL
Sbjct: 721 DRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDEL 780

Query: 781 LETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWLNLL 840
           LETTWKHLAQADR LPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWLNLL
Sbjct: 781 LETTWKHLAQADRILPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWLNLL 840

Query: 841 KEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVADPRLEEVVCTN 900
           KEKRFPKDSVI+LIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVADPRLEEVVCTN
Sbjct: 841 KEKRFPKDSVIQLIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVADPRLEEVVCTN 900

Query: 901 EFQSAAVMHV 910
           E QSAAVMHV
Sbjct: 901 ESQSAAVMHV 910

BLAST of Cp4.1LG20g05810 vs. NCBI nr
Match: KAG6583820.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1651 bits (4276), Expect = 0.0
Identity = 832/877 (94.87%), Postives = 839/877 (95.67%), Query Frame = 0

Query: 1   MVGVIMANANLCIPCCEGNGFPALHCTQNSHYLLGFSFFTSSVSGSGLNSGSAKSRVLRH 60
           MVGVIMANANLCIPCCEGNGFPAL+CTQNSHYLLGFSFF SSVSGSGLN GSAKSRVLRH
Sbjct: 1   MVGVIMANANLCIPCCEGNGFPALYCTQNSHYLLGFSFFPSSVSGSGLNFGSAKSRVLRH 60

Query: 61  RGHKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120
           RGHKCGAIKASSKGESDI+LASGNLLE DFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK
Sbjct: 61  RGHKCGAIKASSKGESDIQLASGNLLEKDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120

Query: 121 MKENASAKSAESTSISNIVTDVQGNMDVKKKVICVDQEDLFDNSERITRKIDLSGNKFDS 180
           MKENASAKSAESTSISNIVTDVQGNMDVK KV+CVD EDLFDNSE+I RK DLSGNKFDS
Sbjct: 121 MKENASAKSAESTSISNIVTDVQGNMDVKNKVVCVDGEDLFDNSEKIPRKTDLSGNKFDS 180

Query: 181 KRKGVTRSKDELKGKVTPFDSQVNDKQHVEKRNGNWSNYIEPKVTRSNHDKRLHFKANTL 240
           KRKGVTRSKDELKGKVTPFDSQVNDKQH EKRNGNWSNYIEPK TRSNHDKRLHFKANTL
Sbjct: 181 KRKGVTRSKDELKGKVTPFDSQVNDKQHEEKRNGNWSNYIEPKATRSNHDKRLHFKANTL 240

Query: 241 DVKSESHGVRYGSSMKISEKIWADDDTKRTKDVLKVGKYGVQLEGNYIPGDKVGRKKTEQ 300
           DVKSESHGVRYGSSMKIS+KIWADDDTK TKDVLKVGKYGVQLEGNYIPGDKVGRKKTEQ
Sbjct: 241 DVKSESHGVRYGSSMKISDKIWADDDTKPTKDVLKVGKYGVQLEGNYIPGDKVGRKKTEQ 300

Query: 301 SYRGLSKSGKQFHEFTEESSLEVEHAAFNSCDAEDIMDKPRVSKMEMEERIQMLSKRLNG 360
           SYRGLSKS                         EDIMDKPRVSKMEMEERIQMLSKRLNG
Sbjct: 301 SYRGLSKS-------------------------EDIMDKPRVSKMEMEERIQMLSKRLNG 360

Query: 361 ADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKL 420
           ADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKL
Sbjct: 361 ADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKL 420

Query: 421 RFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVI 480
           RFIYTTALDVLGKARRPVEALNVFHAMQ+ FSSYPDLVAYHSIAVTLGQAGYMRELFDVI
Sbjct: 421 RFIYTTALDVLGKARRPVEALNVFHAMQQQFSSYPDLVAYHSIAVTLGQAGYMRELFDVI 480

Query: 481 DSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQP 540
           DSMRSPPKKKFKTGA EKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQP
Sbjct: 481 DSMRSPPKKKFKTGAFEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQP 540

Query: 541 STTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQ 600
           STTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQ
Sbjct: 541 STTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQ 600

Query: 601 TMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSK 660
           TMEKRGIVGSAALYYDFARCLCSAGRC+EALMQMEKICKVANKPLVVTYTGLIQACLDSK
Sbjct: 601 TMEKRGIVGSAALYYDFARCLCSAGRCEEALMQMEKICKVANKPLVVTYTGLIQACLDSK 660

Query: 661 NLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYR 720
           NLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYR
Sbjct: 661 NLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYR 720

Query: 721 DRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDEL 780
           DRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDEL
Sbjct: 721 DRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDEL 780

Query: 781 LETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWLNLL 840
           LETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSA LNLL
Sbjct: 781 LETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSACLNLL 840

Query: 841 KEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLS 877
           KEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLS
Sbjct: 841 KEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLS 852

BLAST of Cp4.1LG20g05810 vs. ExPASy TrEMBL
Match: A0A6J1EH18 (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111434226 PE=4 SV=1)

HSP 1 Score: 1775 bits (4597), Expect = 0.0
Identity = 885/910 (97.25%), Postives = 895/910 (98.35%), Query Frame = 0

Query: 1   MVGVIMANANLCIPCCEGNGFPALHCTQNSHYLLGFSFFTSSVSGSGLNSGSAKSRVLRH 60
           MVGVIMANANLCIPCCEGNGFPAL+CTQNSHYLLGFS F SSVSGSGLN GSAKSRVLRH
Sbjct: 1   MVGVIMANANLCIPCCEGNGFPALYCTQNSHYLLGFSVFPSSVSGSGLNFGSAKSRVLRH 60

Query: 61  RGHKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120
           RGHKCGAIKASSKGESDI+LASGNLLE DFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK
Sbjct: 61  RGHKCGAIKASSKGESDIQLASGNLLEKDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120

Query: 121 MKENASAKSAESTSISNIVTDVQGNMDVKKKVICVDQEDLFDNSERITRKIDLSGNKFDS 180
           MKENASAKSAEST ISNIVTDVQGNMDVK KV+CVD EDLFDNSE+ITRK DLSGNKFDS
Sbjct: 121 MKENASAKSAESTFISNIVTDVQGNMDVKNKVVCVDGEDLFDNSEKITRKTDLSGNKFDS 180

Query: 181 KRKGVTRSKDELKGKVTPFDSQVNDKQHVEKRNGNWSNYIEPKVTRSNHDKRLHFKANTL 240
           KRKGVTRSKDELKGKVTPF+SQVNDKQH EKRNGNWSNYIEPK TRSNHDKRLHFKANTL
Sbjct: 181 KRKGVTRSKDELKGKVTPFESQVNDKQHEEKRNGNWSNYIEPKATRSNHDKRLHFKANTL 240

Query: 241 DVKSESHGVRYGSSMKISEKIWADDDTKRTKDVLKVGKYGVQLEGNYIPGDKVGRKKTEQ 300
           DVKSESHGVRYGSSMKIS+KIWADDD+K TKDVLKVGKYGVQLEGNYIPGDKVGRKKTEQ
Sbjct: 241 DVKSESHGVRYGSSMKISDKIWADDDSKPTKDVLKVGKYGVQLEGNYIPGDKVGRKKTEQ 300

Query: 301 SYRGLSKSGKQFHEFTEESSLEVEHAAFNSCDAEDIMDKPRVSKMEMEERIQMLSKRLNG 360
           SYRGLSKSGK+FHEFTEESSLEVEHAAFNSCDAEDIMDKPRVSKMEMEERIQMLS RLNG
Sbjct: 301 SYRGLSKSGKRFHEFTEESSLEVEHAAFNSCDAEDIMDKPRVSKMEMEERIQMLSNRLNG 360

Query: 361 ADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKL 420
           ADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKL
Sbjct: 361 ADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKL 420

Query: 421 RFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVI 480
           RFIYTTALDVLGKARRPVEALNVFHAMQ+HFSSYPDLVAYHSIAVTLGQAGYMRELFDVI
Sbjct: 421 RFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVI 480

Query: 481 DSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQP 540
           DSMRSPPKKKFKTGA EKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQP
Sbjct: 481 DSMRSPPKKKFKTGAFEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQP 540

Query: 541 STTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQ 600
           STTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQ
Sbjct: 541 STTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQ 600

Query: 601 TMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSK 660
           TMEKRGIVGSAALYYDFARCLCSAGRC+EALMQMEKICKVANKPLVVTYTGLIQACLDSK
Sbjct: 601 TMEKRGIVGSAALYYDFARCLCSAGRCEEALMQMEKICKVANKPLVVTYTGLIQACLDSK 660

Query: 661 NLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYR 720
           NLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYR
Sbjct: 661 NLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYR 720

Query: 721 DRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDEL 780
           DRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDEL
Sbjct: 721 DRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDEL 780

Query: 781 LETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWLNLL 840
           LETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWLNLL
Sbjct: 781 LETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWLNLL 840

Query: 841 KEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVADPRLEEVVCTN 900
           KEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVADPRLEEVVCTN
Sbjct: 841 KEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVADPRLEEVVCTN 900

Query: 901 EFQSAAVMHV 910
           E QSA VMHV
Sbjct: 901 ESQSATVMHV 910

BLAST of Cp4.1LG20g05810 vs. ExPASy TrEMBL
Match: A0A6J1KEH7 (pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111495096 PE=4 SV=1)

HSP 1 Score: 1764 bits (4570), Expect = 0.0
Identity = 883/910 (97.03%), Postives = 893/910 (98.13%), Query Frame = 0

Query: 1   MVGVIMANANLCIPCCEGNGFPALHCTQNSHYLLGFSFFTSSVSGSGLNSGSAKSRVLRH 60
           MVGVIMANANLCIPCCEGNGF AL+CTQNSHYLLG SFF SSVSGSGLN GSAKSRVLRH
Sbjct: 1   MVGVIMANANLCIPCCEGNGFSALYCTQNSHYLLGLSFFPSSVSGSGLNFGSAKSRVLRH 60

Query: 61  RGHKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120
           RGHKCGAIKASSKGESDI+LASGNLLE DFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK
Sbjct: 61  RGHKCGAIKASSKGESDIQLASGNLLEKDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120

Query: 121 MKENASAKSAESTSISNIVTDVQGNMDVKKKVICVDQEDLFDNSERITRKIDLSGNKFDS 180
           MKENASAKSAESTSISNIVTDVQGNMDVK KV+ VD EDLFDNSERITRK DLSGNKFDS
Sbjct: 121 MKENASAKSAESTSISNIVTDVQGNMDVKNKVVYVDGEDLFDNSERITRKTDLSGNKFDS 180

Query: 181 KRKGVTRSKDELKGKVTPFDSQVNDKQHVEKRNGNWSNYIEPKVTRSNHDKRLHFKANTL 240
           KRKGVTRSKDELKGKVTPFDSQ+NDKQH EKRNGNWSNYIEPKVTRSNHDKRLHFKANTL
Sbjct: 181 KRKGVTRSKDELKGKVTPFDSQINDKQHEEKRNGNWSNYIEPKVTRSNHDKRLHFKANTL 240

Query: 241 DVKSESHGVRYGSSMKISEKIWADDDTKRTKDVLKVGKYGVQLEGNYIPGDKVGRKKTEQ 300
           DVKSESHGVRYGSSMKISEKIWADDD K TKDVLKVGKYGVQL+GNYIPGDKVGRKKTEQ
Sbjct: 241 DVKSESHGVRYGSSMKISEKIWADDDIKPTKDVLKVGKYGVQLKGNYIPGDKVGRKKTEQ 300

Query: 301 SYRGLSKSGKQFHEFTEESSLEVEHAAFNSCDAEDIMDKPRVSKMEMEERIQMLSKRLNG 360
           SYRGLSKSGK+FHEFTEESSLEVEHAAFNSCDA DIMDKPRVSKMEMEERIQMLSKRLNG
Sbjct: 301 SYRGLSKSGKRFHEFTEESSLEVEHAAFNSCDAADIMDKPRVSKMEMEERIQMLSKRLNG 360

Query: 361 ADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKL 420
           ADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKL
Sbjct: 361 ADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKL 420

Query: 421 RFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVI 480
           RFIYTTALDVLGKARRPVEALNVFHAMQ+HFSSYPDLVAYHSIAVTLGQAGYMRELFDVI
Sbjct: 421 RFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVI 480

Query: 481 DSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQP 540
           DSMRSPPKKKFKTGA EKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQP
Sbjct: 481 DSMRSPPKKKFKTGAFEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQP 540

Query: 541 STTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQ 600
           STTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQ
Sbjct: 541 STTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQ 600

Query: 601 TMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSK 660
           TMEKRGIVGSAALYYDFARCLCSAGR +EALMQMEKICKVANKPLVVTYTGLIQACLDSK
Sbjct: 601 TMEKRGIVGSAALYYDFARCLCSAGRWEEALMQMEKICKVANKPLVVTYTGLIQACLDSK 660

Query: 661 NLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYR 720
           NLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMF+EAKELFQNMSENGRNISAVSDYR
Sbjct: 661 NLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFNEAKELFQNMSENGRNISAVSDYR 720

Query: 721 DRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDEL 780
           DRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDEL
Sbjct: 721 DRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGKDEL 780

Query: 781 LETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWLNLL 840
           LETTWKHLAQADR LPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWLNLL
Sbjct: 781 LETTWKHLAQADRILPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWLNLL 840

Query: 841 KEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVADPRLEEVVCTN 900
           KEKRFPKDSVI+LIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVADPRLEEVVCTN
Sbjct: 841 KEKRFPKDSVIQLIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVADPRLEEVVCTN 900

Query: 901 EFQSAAVMHV 910
           E QSAAVMHV
Sbjct: 901 ESQSAAVMHV 910

BLAST of Cp4.1LG20g05810 vs. ExPASy TrEMBL
Match: A0A0A0LVN7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G553530 PE=4 SV=1)

HSP 1 Score: 1480 bits (3831), Expect = 0.0
Identity = 756/907 (83.35%), Postives = 810/907 (89.31%), Query Frame = 0

Query: 1   MVGVIMANANLCIPCCEGNGFPALHCTQNSHYLLGFSFFTSSVSGSGLNSGSAKSRVLRH 60
           MVGVIMAN NLCIP CE  GFP LHCT NSH     SFF SSVSG+  +   AK+RVLRH
Sbjct: 1   MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVSGTDSSLSDAKNRVLRH 60

Query: 61  RGHKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120
           R HKCG+IKA S GESDI L SGNLLE+DFQFKPSFDEYV+VME+VR+RRYKRQ DDPNK
Sbjct: 61  RVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNK 120

Query: 121 --MKENASAKSAESTSISNI------VTDVQGNMDVKKKVICVDQEDLFDNSERITRKID 180
             MKEN SAKSAESTSIS I      VTDVQ N+DVK     VD++DLF+N+ERI  + D
Sbjct: 121 LTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAPEKD 180

Query: 181 LSGNKFDSKRKGVTRSKDELKGKVTPFDSQVNDKQHVEKRNGNWSNYIEPKVTRSNHDKR 240
           LSGNKFD +RK VTRS D++KGK+TPF S VNDKQH EKRN NWS+YIEP+VTRSN  K 
Sbjct: 181 LSGNKFD-RRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSKKP 240

Query: 241 LHFKANTLDVKSESHGVRYGSSMKISEKIWA--DDDTKRTKDVLKVGKYGVQLEGNYIPG 300
           +HFKANTL+VK ES  V  G+SMK SEKIWA  DDD K  K VLK GKYG+QLE +Y PG
Sbjct: 241 IHFKANTLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPAKGVLKAGKYGIQLERSYNPG 300

Query: 301 DKVGRKKTEQSYRGLSKSGKQFHEFTEESSLEVEHAAFNSCDAEDIMDKPRVSKMEMEER 360
           DKVGRKKTEQSYRG S SGK+F EF E++SLEVEHAAFN+ DA DIMDKPRVSKMEMEER
Sbjct: 301 DKVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEER 360

Query: 361 IQMLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQ 420
           IQMLSKRLNGADIDMPEWMF+QMMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQ+IEWLQ
Sbjct: 361 IQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQ 420

Query: 421 MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQA 480
           MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQA
Sbjct: 421 MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQA 480

Query: 481 GYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVL 540
           GYMRELFDVIDSMRSPPKKKFKTG LEKWDPRLQPDIVIYNAVLNACVKRKN EGAFWVL
Sbjct: 481 GYMRELFDVIDSMRSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVL 540

Query: 541 QELKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG 600
           QELK+Q LQPST+TYGLVMEVML+CGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG
Sbjct: 541 QELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG 600

Query: 601 KTDEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYT 660
           KTDEAVLAI+ ME RGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYT
Sbjct: 601 KTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYT 660

Query: 661 GLIQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENG 720
           GLIQACLDSK+LQSAVYIFNHMKAFCSPNLVT NILLKGYL+HGMF+EA+ELFQN+SE  
Sbjct: 661 GLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQR 720

Query: 721 RNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIM 780
           RNIS VSDYRDRVLPDIY FNTMLDASFAEKRWDDFS+FYNQM LYGYHFNPKRHLRMI+
Sbjct: 721 RNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMIL 780

Query: 781 EAARGGKDELLETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHH 840
           EAARGGKDELLETTWKHLAQADRT PPPL+KERFC+ LARGDYSEALS I  H+S D HH
Sbjct: 781 EAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAHH 840

Query: 841 FSKSAWLNLLKEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVAD 897
           FS+SAWLNLLKEKRFP+D+VIELIHKV M+L RN+SPNPV +NLLLS KEFCR+RIS+AD
Sbjct: 841 FSESAWLNLLKEKRFPRDTVIELIHKVGMVLTRNESPNPVFKNLLLSCKEFCRTRISLAD 900

BLAST of Cp4.1LG20g05810 vs. ExPASy TrEMBL
Match: A0A6J1CLQ9 (pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111012614 PE=4 SV=1)

HSP 1 Score: 1479 bits (3829), Expect = 0.0
Identity = 749/918 (81.59%), Postives = 814/918 (88.67%), Query Frame = 0

Query: 1   MVGVIMANANLCIPCCEGNGFPALHCTQNSHYLLGFSFFTSSVSGSGLNSGSAKSRVLRH 60
           MVGVIMANAN+CIPCCE NGF ALHCTQ+SH L GFS F S +SG GLN G  K+R+ R+
Sbjct: 1   MVGVIMANANMCIPCCERNGFRALHCTQSSHNLFGFSLFPSPISGIGLNVGYEKNRIFRY 60

Query: 61  RGHKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120
           RG+KCGAI+ SSKGESDIRL +GN+LENDF FKPSFDEYVRVMESVR+ RYK+Q DDPNK
Sbjct: 61  RGNKCGAIRVSSKGESDIRLQNGNVLENDFLFKPSFDEYVRVMESVRTSRYKKQPDDPNK 120

Query: 121 --MKENASAKSAESTSISNI------VTDVQGNMDVKKKVICVDQEDLFDNSERITRKID 180
             MKENASAKSAES+S+S I      VTDVQGN+DVK     VDQ+ LF+N+ER+TRK D
Sbjct: 121 LKMKENASAKSAESSSVSEIDNEKTKVTDVQGNVDVKNMFKRVDQKKLFNNAERVTRKKD 180

Query: 181 LSGNKFDSKRKGVTRSKDELKGKVTPFDSQVNDKQHVEKRNGNWSNYIEPKVTRSNHDKR 240
           L  NKFD+KRKG+TR+KDE +GKVT FDSQVNDKQH E+R  N  + IEPKV R N++  
Sbjct: 181 LLENKFDNKRKGITRTKDEFRGKVTHFDSQVNDKQHEEQRKRNRLDCIEPKVRRLNNEAL 240

Query: 241 LHFKANTLDVKSESHGVRYGSSMKISEKIWADDDTKRTKDVLKVGKYGVQLEGNYIPGDK 300
           +  KANTLD+K +   V   SSMK  E+IWAD DTK  K  L+VGK GVQL  NY+PG+K
Sbjct: 241 VCSKANTLDIKRQRQRVCDESSMKTVERIWADGDTKLAKGDLEVGKSGVQLARNYVPGEK 300

Query: 301 VGRKKTEQSYRGLSKSGKQFHEFTEESSLEVEHAAFNSCDAEDIMDKPRVSKMEMEERIQ 360
           V  KKT QSY+GLSKSGK F E TEESSLEVE AA N+ DA DIMDKPRVSKMEMEERIQ
Sbjct: 301 VSGKKTGQSYQGLSKSGKPFIESTEESSLEVERAALNNFDALDIMDKPRVSKMEMEERIQ 360

Query: 361 MLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMR 420
           MLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMR
Sbjct: 361 MLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMR 420

Query: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGY 480
           ERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQ+HFSSYPDLVAYHSIAVTLGQAGY
Sbjct: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGY 480

Query: 481 MRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQE 540
           MRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQE
Sbjct: 481 MRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQE 540

Query: 541 LKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKT 600
           LK+QGLQPST+TYGLVMEVML+CGKYNLVHEFFRKVQ+SSIPNALTYKVLVNTL KEGKT
Sbjct: 541 LKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLSKEGKT 600

Query: 601 DEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGL 660
           DEAVLAIQ ME+RGIVGSAALYYDFARCLCSAGRCKEALMQ+EKICKVANKPLVVTYTGL
Sbjct: 601 DEAVLAIQNMERRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGL 660

Query: 661 IQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRN 720
           IQACLDSKNL SAVYIFNHMKAFCSPNLVT NILLKGYLDHGMF+EA+ELFQN+SE+G++
Sbjct: 661 IQACLDSKNLDSAVYIFNHMKAFCSPNLVTYNILLKGYLDHGMFEEARELFQNLSESGQS 720

Query: 721 ISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEA 780
           IS +SDY+DRVLPDIYTFN MLDA FA KRWDDF +FYNQM LYGYHFNPKRHLRMI+EA
Sbjct: 721 ISTISDYKDRVLPDIYTFNIMLDAFFAVKRWDDFGYFYNQMFLYGYHFNPKRHLRMILEA 780

Query: 781 ARGGKDELLETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFS 840
            R GKDE+LETTWKHLAQ DRTLPPPL+KERFC+ LARGDYSEALSCIS HHSSD HHFS
Sbjct: 781 GRAGKDEILETTWKHLAQTDRTLPPPLVKERFCMKLARGDYSEALSCISNHHSSDAHHFS 840

Query: 841 KSAWLNLLKEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVADPR 900
           +SAWLNLLKEK FPKD+VI LIHKVSMLL  N  PNPV QNLL S KEFCR+RI+VAD +
Sbjct: 841 ESAWLNLLKEKGFPKDTVILLIHKVSMLLTGNHPPNPVFQNLLSSCKEFCRTRITVADSK 900

Query: 901 LEEVVCTNEFQSAAVMHV 910
           LE++VC +E QSAAVMH+
Sbjct: 901 LEQIVCRDETQSAAVMHI 918

BLAST of Cp4.1LG20g05810 vs. ExPASy TrEMBL
Match: A0A1S3C8Z0 (pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103498323 PE=4 SV=1)

HSP 1 Score: 1476 bits (3822), Expect = 0.0
Identity = 754/913 (82.58%), Postives = 811/913 (88.83%), Query Frame = 0

Query: 1   MVGVIMANANLCIPCCEGNGFPALHCTQNSHYLLGFSFFTSSVSGSG--LNSGSAKSRVL 60
           MVGVIMAN NL IP CE  GFP LHCT NSH     SFF SSVSG G  LN   AK+RVL
Sbjct: 1   MVGVIMANVNLSIPNCERYGFPTLHCTHNSHTSFWVSFFPSSVSGGGTDLNFSDAKNRVL 60

Query: 61  RHRGHKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDP 120
           RHR HKCG+IKA S GESDI L +GNLLE+DFQFKPSFDEYV+VME+VR+RRYKRQ D P
Sbjct: 61  RHRIHKCGSIKALSNGESDISLPNGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDYP 120

Query: 121 NK--MKENASAKSAESTSISNI------VTDVQGNMDVKKKVICVDQEDLFDNSERITRK 180
           NK  MKEN SAKSAESTSIS I      VTDVQ N++VK     VD++DLF+N+ERI R+
Sbjct: 121 NKLTMKENCSAKSAESTSISKIDNGKNKVTDVQHNVEVKNMFKRVDKKDLFNNTERIARE 180

Query: 181 IDLSGNKFDSKRKGVTRSKDELKGKVTPFDSQVNDKQHVEKRNGNWSNYIEPKVTRSNHD 240
             LSGNKFD + KGVTRS D++KGK+TPF S VNDKQH EK+NGNWS+YIEPKVTRSN +
Sbjct: 181 KHLSGNKFD-RSKGVTRSNDKVKGKMTPFGSLVNDKQHEEKKNGNWSSYIEPKVTRSNCE 240

Query: 241 KRLHFKANTLDVKSESHGVRYGSSMKISEKIWA--DDDTKRTKDVLKVGKYGVQLEGNYI 300
           K +HFKAN L+ K E   V YG+SMK SEKIWA  +DD K  KDVLK GKYG+QLE +Y 
Sbjct: 241 KPIHFKANALEFKKEGSRVSYGNSMKTSEKIWAWGEDDAKPAKDVLKAGKYGIQLERSYS 300

Query: 301 PGDKVGRKKTEQSYRGLSKSGKQFHEFTEESSLEVEHAAFNSCDAEDIMDKPRVSKMEME 360
           PGDKVGRKKTEQSYRG S SGK+F EFTEE+SLEVEHAAFN+ DA DIMDKPRVSKMEME
Sbjct: 301 PGDKVGRKKTEQSYRGTSTSGKRFLEFTEENSLEVEHAAFNNFDALDIMDKPRVSKMEME 360

Query: 361 ERIQMLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEW 420
           ERIQMLSKRLNGADIDMPEWMF+QMMR AKIRYSDHSILRVIQVLGKLGNW+RVLQVIEW
Sbjct: 361 ERIQMLSKRLNGADIDMPEWMFSQMMRGAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEW 420

Query: 421 LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLG 480
           LQMRERFKSHK RFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLG
Sbjct: 421 LQMRERFKSHKPRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLG 480

Query: 481 QAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFW 540
           QAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKN EGAFW
Sbjct: 481 QAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW 540

Query: 541 VLQELKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWK 600
           VLQELK+QGLQPST+TYGLVMEVML+CGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWK
Sbjct: 541 VLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWK 600

Query: 601 EGKTDEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVT 660
           EGKTDEAVLAI+ ME RG+VGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVT
Sbjct: 601 EGKTDEAVLAIENMEMRGVVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVT 660

Query: 661 YTGLIQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSE 720
           YTGLIQACLDSK+LQSAVY+FN MKAFCSPNLVT NILLKGYL+HGMF+EA+EL QN+SE
Sbjct: 661 YTGLIQACLDSKDLQSAVYVFNQMKAFCSPNLVTYNILLKGYLEHGMFEEARELLQNLSE 720

Query: 721 NGRNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRM 780
             +NIS VSDYRDRVLPDIY FNTMLDASFAEKRWDDFS+FYNQM LYGYHFNPKRHLRM
Sbjct: 721 QRQNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRM 780

Query: 781 IMEAARGGKDELLETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDE 840
           I+EAAR GKDELLETTWKHLAQADRT PPPL+KERFC+ +ARGDY+EAL CIS H+S D 
Sbjct: 781 ILEAARVGKDELLETTWKHLAQADRTPPPPLLKERFCMKVARGDYTEALRCISNHNSGDA 840

Query: 841 HHFSKSAWLNLLKEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISV 900
           HHFS+SAWLNLLKEKRFPKD+VIELIHKV M+ A N+SPNPV +NLLLS KEFCR+RISV
Sbjct: 841 HHFSESAWLNLLKEKRFPKDTVIELIHKVGMVFATNESPNPVFKNLLLSCKEFCRTRISV 900

BLAST of Cp4.1LG20g05810 vs. TAIR 10
Match: AT1G30610.2 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 750.4 bits (1936), Expect = 1.7e-216
Identity = 383/674 (56.82%), Postives = 499/674 (74.04%), Query Frame = 0

Query: 221 EPKVTRSNHDKRLHFKANTLDVKSESHGVRYGSSMKISEKIWADDDTKRTKDVLKVGKYG 280
           E +V R   D R      +L   SE    R G+  +   +     DT R  +    G  G
Sbjct: 304 ERRVQRIAKDVRWSKSDESLVPVSEDESFRRGNPKQEMVRYQRVSDTSRGIERGSKGD-G 363

Query: 281 VQLEGNYIPGDKVGRKKTE---QSYRGLSKSGKQFHEFTEESSLEVEHAAFN-SCDAEDI 340
           + L       +++  ++ E       G  + G + ++  ++S   +E  AF  S ++ DI
Sbjct: 364 LDLLAEERRIERLANERHEIRSSKLSGTRRIGAKRNDDDDDSLFAMETPAFRFSDESSDI 423

Query: 341 MDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKL 400
           +DKP  S++EME+RI+ L+K LNGADI+MPEW F++ +RSAKIRY+D++++R+I  LGKL
Sbjct: 424 VDKPATSRVEMEDRIEKLAKVLNGADINMPEWQFSKAIRSAKIRYTDYTVMRLIHFLGKL 483

Query: 401 GNWKRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPD 460
           GNW+RVLQVIEWLQ ++R+KS+K+R IYTTAL+VLGK+RRPVEALNVFHAM    SSYPD
Sbjct: 484 GNWRRVLQVIEWLQRQDRYKSNKIRIIYTTALNVLGKSRRPVEALNVFHAMLLQISSYPD 543

Query: 461 LVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNA 520
           +VAY SIAVTLGQAG+++ELF VID+MRSPPKKKFK   LEKWDPRL+PD+V+YNAVLNA
Sbjct: 544 MVAYRSIAVTLGQAGHIKELFYVIDTMRSPPKKKFKPTTLEKWDPRLEPDVVVYNAVLNA 603

Query: 521 CVKRKNWEGAFWVLQELKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNA 580
           CV+RK WEGAFWVLQ+LK++G +PS  TYGL+MEVML C KYNLVHEFFRK+QKSSIPNA
Sbjct: 604 CVQRKQWEGAFWVLQQLKQRGQKPSPVTYGLIMEVMLACEKYNLVHEFFRKMQKSSIPNA 663

Query: 581 LTYKVLVNTLWKEGKTDEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEK 640
           L Y+VLVNTLWKEGK+DEAV  ++ ME RGIVGSAALYYD ARCLCSAGRC E L  ++K
Sbjct: 664 LAYRVLVNTLWKEGKSDEAVHTVEDMESRGIVGSAALYYDLARCLCSAGRCNEGLNMLKK 723

Query: 641 ICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMF 700
           IC+VANKPLVVTYTGLIQAC+DS N+++A YIF+ MK  CSPNLVTCNI+LK YL  G+F
Sbjct: 724 ICRVANKPLVVTYTGLIQACVDSGNIKNAAYIFDQMKKVCSPNLVTCNIMLKAYLQGGLF 783

Query: 701 DEAKELFQNMSENGRNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLY 760
           +EA+ELFQ MSE+G +I   SD+  RVLPD YTFNTMLD    +++WDDF + Y +ML +
Sbjct: 784 EEARELFQKMSEDGNHIKNSSDFESRVLPDTYTFNTMLDTCAEQEKWDDFGYAYREMLRH 843

Query: 761 GYHFNPKRHLRMIMEAARGGKDELLETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEA 820
           GYHFN KRHLRM++EA+R GK+E++E TW+H+ +++R  P PLIKERF   L +GD+  A
Sbjct: 844 GYHFNAKRHLRMVLEASRAGKEEVMEATWEHMRRSNRIPPSPLIKERFFRKLEKGDHISA 903

Query: 821 LSCIS----KHHSSDEHHFSKSAWLNLLKEKRFPKDSVIELIHKVSMLL-ARNDSPNPVL 880
           +S ++    K   ++   FS SAW  +L   RF +DSV+ L+  V+  L +R++S + VL
Sbjct: 904 ISSLADLNGKIEETELRAFSTSAWSRVL--SRFEQDSVLRLMDDVNRRLGSRSESSDSVL 963

Query: 881 QNLLLSGKEFCRSR 886
            NLL S K++ ++R
Sbjct: 964 GNLLSSCKDYLKTR 974


HSP 2 Score: 38.1 bits (87), Expect = 4.4e-02
Identity = 15/30 (50.00%), Postives = 25/30 (83.33%), Query Frame = 0

Query: 87  ENDFQFKPSFDEYVRVMESVRSRRYKRQSD 117
           +  F+FKPSFD+Y+++MESV++ R K++ D
Sbjct: 68  DKGFEFKPSFDQYLQIMESVKTARKKKKFD 97

BLAST of Cp4.1LG20g05810 vs. TAIR 10
Match: AT1G30610.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 728.4 bits (1879), Expect = 7.1e-210
Identity = 414/861 (48.08%), Postives = 561/861 (65.16%), Query Frame = 0

Query: 62   GHKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNKM 121
            G    A+K S  GES + +      +  F+ + S  EY R  ++ R      + D+ + +
Sbjct: 172  GESSVALKLSKSGESSVTVPE----DESFRKRYSKQEYHRSSDTSRGIERGSRGDELDLV 231

Query: 122  KENASAKSAESTSISNIVTDVQGNMDVKKKVICVDQEDLFDNSERITRKIDLSGNKFDSK 181
                     E   +  I  D + +   ++  + V   +  ++S  +T   D S  +  SK
Sbjct: 232  --------VEERRVQRIAKDARWSKS-RESSVAVKWSNSGESS--VTMPKDESFRRRYSK 291

Query: 182  RKGVTRSKDELKGKVTPFDSQVNDKQHVEKRNGNWSNYIEPKVTRSNHDKRLHFKANTLD 241
            ++   RS D  +G      S+ ++ + V +         E +V R   D R      +L 
Sbjct: 292  QEH-HRSSDTSRGIAR--GSKGDELELVVE---------ERRVQRIAKDVRWSKSDESLV 351

Query: 242  VKSESHGVRYGSSMKISEKIWADDDTKRTKDVLKVGKYGVQLEGNYIPGDKVGRKKTE-- 301
              SE    R G+  +   +     DT R  +    G  G+ L       +++  ++ E  
Sbjct: 352  PVSEDESFRRGNPKQEMVRYQRVSDTSRGIERGSKGD-GLDLLAEERRIERLANERHEIR 411

Query: 302  -QSYRGLSKSGKQFHEFTEESSLEVEHAAFN-SCDAEDIMDKPRVSKMEMEERIQMLSKR 361
                 G  + G + ++  ++S   +E  AF  S ++ DI+DKP  S++EME+RI+ L+K 
Sbjct: 412  SSKLSGTRRIGAKRNDDDDDSLFAMETPAFRFSDESSDIVDKPATSRVEMEDRIEKLAKV 471

Query: 362  LNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKS 421
            LNGADI+MPEW F++ +RSAKIRY+D++++R+I  LGKLGNW+RVLQVIEWLQ ++R+KS
Sbjct: 472  LNGADINMPEWQFSKAIRSAKIRYTDYTVMRLIHFLGKLGNWRRVLQVIEWLQRQDRYKS 531

Query: 422  HKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELF 481
            +K+R IYTTAL+VLGK+RRPVEALNVFHAM    SSYPD+VAY SIAVTLGQAG+++ELF
Sbjct: 532  NKIRIIYTTALNVLGKSRRPVEALNVFHAMLLQISSYPDMVAYRSIAVTLGQAGHIKELF 591

Query: 482  DVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQG 541
             VID+MRSPPKKKFK   LEKWDPRL+PD+V+YNAVLNACV+RK WEGAFWVLQ+LK++G
Sbjct: 592  YVIDTMRSPPKKKFKPTTLEKWDPRLEPDVVVYNAVLNACVQRKQWEGAFWVLQQLKQRG 651

Query: 542  LQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVL 601
             +PS  TYGL+MEVML C KYNLVHEFFRK+QKSSIPNAL Y+VLVNTLWKEGK+DEAV 
Sbjct: 652  QKPSPVTYGLIMEVMLACEKYNLVHEFFRKMQKSSIPNALAYRVLVNTLWKEGKSDEAVH 711

Query: 602  AIQTMEKRGIVGSAALYYDFARCLCSAGRCKEAL-------------------------- 661
             ++ ME RGIVGSAALYYD ARCLCSAGRC E L                          
Sbjct: 712  TVEDMESRGIVGSAALYYDLARCLCSAGRCNEGLNMVNFVNPVVLKLIENLIYKADLVHT 771

Query: 662  --MQMEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKG 721
               Q++KIC+VANKPLVVTYTGLIQAC+DS N+++A YIF+ MK  CSPNLVTCNI+LK 
Sbjct: 772  IQFQLKKICRVANKPLVVTYTGLIQACVDSGNIKNAAYIFDQMKKVCSPNLVTCNIMLKA 831

Query: 722  YLDHGMFDEAKELFQNMSENGRNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHF 781
            YL  G+F+EA+ELFQ MSE+G +I   SD+  RVLPD YTFNTMLD    +++WDDF + 
Sbjct: 832  YLQGGLFEEARELFQKMSEDGNHIKNSSDFESRVLPDTYTFNTMLDTCAEQEKWDDFGYA 891

Query: 782  YNQMLLYGYHFNPKRHLRMIMEAARGGKDELLETTWKHLAQADRTLPPPLIKERFCIMLA 841
            Y +ML +GYHFN KRHLRM++EA+R GK+E++E TW+H+ +++R  P PLIKERF   L 
Sbjct: 892  YREMLRHGYHFNAKRHLRMVLEASRAGKEEVMEATWEHMRRSNRIPPSPLIKERFFRKLE 951

Query: 842  RGDYSEALSCIS----KHHSSDEHHFSKSAWLNLLKEKRFPKDSVIELIHKVSMLL-ARN 886
            +GD+  A+S ++    K   ++   FS SAW  +L   RF +DSV+ L+  V+  L +R+
Sbjct: 952  KGDHISAISSLADLNGKIEETELRAFSTSAWSRVL--SRFEQDSVLRLMDDVNRRLGSRS 1002


HSP 2 Score: 38.1 bits (87), Expect = 4.4e-02
Identity = 15/30 (50.00%), Postives = 25/30 (83.33%), Query Frame = 0

Query: 87  ENDFQFKPSFDEYVRVMESVRSRRYKRQSD 117
           +  F+FKPSFD+Y+++MESV++ R K++ D
Sbjct: 68  DKGFEFKPSFDQYLQIMESVKTARKKKKFD 97

BLAST of Cp4.1LG20g05810 vs. TAIR 10
Match: AT5G67570.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 410.2 bits (1053), Expect = 4.3e-114
Identity = 221/557 (39.68%), Postives = 339/557 (60.86%), Query Frame = 0

Query: 349 ERIQMLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEW 408
           E +++L  RL+G +I+   W F +MM  + +++++  +L+++  LG+  +WK+   V+ W
Sbjct: 183 EAVRVLVDRLSGREINEKHWKFVRMMNQSGLQFTEDQMLKIVDRLGRKQSWKQASAVVHW 242

Query: 409 LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLG 468
           +   ++ K  + RF+YT  L VLG ARRP EAL +F+ M      YPD+ AYH IAVTLG
Sbjct: 243 VYSDKKRKHLRSRFVYTKLLSVLGFARRPQEALQIFNQMLGDRQLYPDMAAYHCIAVTLG 302

Query: 469 QAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFW 528
           QAG ++EL  VI+ MR  P K  K    + WDP L+PD+V+YNA+LNACV    W+   W
Sbjct: 303 QAGLLKELLKVIERMRQKPTKLTKNLRQKNWDPVLEPDLVVYNAILNACVPTLQWKAVSW 362

Query: 529 VLQELKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKS-SIPNALTYKVLVNTLW 588
           V  EL++ GL+P+  TYGL MEVML+ GK++ VH+FFRK++ S   P A+TYKVLV  LW
Sbjct: 363 VFVELRKNGLRPNGATYGLAMEVMLESGKFDRVHDFFRKMKSSGEAPKAITYKVLVRALW 422

Query: 589 KEGKTDEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVAN-KPLV 648
           +EGK +EAV A++ ME++G++G+ ++YY+ A CLC+ GR  +A++++ ++ ++ N +PL 
Sbjct: 423 REGKIEEAVEAVRDMEQKGVIGTGSVYYELACCLCNNGRWCDAMLEVGRMKRLENCRPLE 482

Query: 649 VTYTGLIQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNM 708
           +T+TGLI A L+  ++   + IF +MK  C PN+ T N++LK Y  + MF EAKELF+ +
Sbjct: 483 ITFTGLIAASLNGGHVDDCMAIFQYMKDKCDPNIGTANMMLKVYGRNDMFSEAKELFEEI 542

Query: 709 SENGRNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHL 768
                    VS     ++P+ YT++ ML+AS    +W+ F H Y  M+L GY  +  +H 
Sbjct: 543 ---------VSRKETHLVPNEYTYSFMLEASARSLQWEYFEHVYQTMVLSGYQMDQTKHA 602

Query: 769 RMIMEAARGGKDELLETTWKHLAQADRTLPPPL-IKERFCIMLARGDYSEALSCISKHHS 828
            M++EA+R GK  LLE  +  + + D  +P PL   E  C   A+GD+  A++ I+   +
Sbjct: 603 SMLIEASRAGKWSLLEHAFDAVLE-DGEIPHPLFFTELLCHATAKGDFQRAITLINT-VA 662

Query: 829 SDEHHFSKSAWLNLLKEKR--FPKDSVIELIHKVSMLLARND-SPNPVLQNLLLSGKEFC 888
                 S+  W +L +E +    +D+    +HK+S  L   D    P + NL  S K  C
Sbjct: 663 LASFQISEEEWTDLFEEHQDWLTQDN----LHKLSDHLIECDYVSEPTVSNLSKSLKSRC 722

Query: 889 RSRISVADPRLEEVVCT 900
            S  S A P L   V T
Sbjct: 723 GSSSSSAQPLLAVDVTT 724

BLAST of Cp4.1LG20g05810 vs. TAIR 10
Match: AT1G12775.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 110.9 bits (276), Expect = 5.4e-24
Identity = 103/477 (21.59%), Postives = 198/477 (41.51%), Query Frame = 0

Query: 354 LSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRE 413
           LS  L G   D    +F  M++S  +  +     R+   + K   ++ VL + +  QM  
Sbjct: 60  LSSGLVGIKADDAVDLFRDMIQSRPLP-TVIDFNRLFSAIAKTKQYELVLALCK--QMES 119

Query: 414 RFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYM 473
           +  +H + +  +  ++   + R+   A +    + +     PD V ++++   L     +
Sbjct: 120 KGIAHSI-YTLSIMINCFCRCRKLSYAFSTMGKIMK-LGYEPDTVIFNTLLNGLCLECRV 179

Query: 474 RELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQEL 533
            E  +++D M     K               P ++  N ++N          A  ++  +
Sbjct: 180 SEALELVDRMVEMGHK---------------PTLITLNTLVNGLCLNGKVSDAVVLIDRM 239

Query: 534 KEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIP-NALTYKVLVNTLWKEGKT 593
            E G QP+  TYG V+ VM + G+  L  E  RK+++ +I  +A+ Y ++++ L K+G  
Sbjct: 240 VETGFQPNEVTYGPVLNVMCKSGQTALAMELLRKMEERNIKLDAVKYSIIIDGLCKDGSL 299

Query: 594 DEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGL 653
           D A      ME +G       Y       C+AGR  +    +  + K    P VVT++ L
Sbjct: 300 DNAFNLFNEMEIKGFKADIITYNTLIGGFCNAGRWDDGAKLLRDMIKRKISPNVVTFSVL 359

Query: 654 IQACLDSKNLQSAVYIFNH-MKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGR 713
           I + +    L+ A  +    M+   +PN +T N L+ G+      +EA ++   M   G 
Sbjct: 360 IDSFVKEGKLREADQLLKEMMQRGIAPNTITYNSLIDGFCKENRLEEAIQMVDLMISKGC 419

Query: 714 NISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIME 773
           +            PDI TFN +++      R DD    + +M L G   N   +  ++  
Sbjct: 420 D------------PDIMTFNILINGYCKANRIDDGLELFREMSLRGVIANTVTYNTLVQG 479

Query: 774 AARGGKDELLETTWKHLAQADRTLPPPLIKERFCI--MLARGDYSEALSCISKHHSS 827
             + GK E+ +  ++ +    R + P ++  +  +  +   G+  +AL    K   S
Sbjct: 480 FCQSGKLEVAKKLFQEM--VSRRVRPDIVSYKILLDGLCDNGELEKALEIFGKIEKS 502

BLAST of Cp4.1LG20g05810 vs. TAIR 10
Match: AT2G17140.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 107.8 bits (268), Expect = 4.5e-23
Identity = 58/215 (26.98%), Postives = 107/215 (49.77%), Query Frame = 0

Query: 501 PRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQPSTTTYGLVMEVMLQCGKYNL 560
           P  +P + +YN +L +C+K +  E   W+ +++   G+ P T T+ L++  +      + 
Sbjct: 106 PENKPSVYLYNLLLESCIKERRVEFVSWLYKDMVLCGIAPQTYTFNLLIRALCDSSCVDA 165

Query: 561 VHEFFRKV-QKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQTMEKRGIVGSAALYYDFAR 620
             E F ++ +K   PN  T+ +LV    K G TD+ +  +  ME  G++ +  +Y     
Sbjct: 166 ARELFDEMPEKGCKPNEFTFGILVRGYCKAGLTDKGLELLNAMESFGVLPNKVIYNTIVS 225

Query: 621 CLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMK-----A 680
             C  GR  ++   +EK+ +    P +VT+   I A      +  A  IF+ M+      
Sbjct: 226 SFCREGRNDDSEKMVEKMREEGLVPDIVTFNSRISALCKEGKVLDASRIFSDMELDEYLG 285

Query: 681 FCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSEN 710
              PN +T N++LKG+   G+ ++AK LF+++ EN
Sbjct: 286 LPRPNSITYNLMLKGFCKVGLLEDAKTLFESIREN 320

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SA761.0e-20848.08Pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Arabidop... [more]
Q9FJW66.0e-11339.68Pentatricopeptide repeat-containing protein At5g67570, chloroplastic OS=Arabidop... [more]
Q9LPX27.5e-2321.59Pentatricopeptide repeat-containing protein At1g12775, mitochondrial OS=Arabidop... [more]
Q0WPZ66.4e-2226.98Pentatricopeptide repeat-containing protein At2g17140 OS=Arabidopsis thaliana OX... [more]
Q76C991.1e-2124.61Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV... [more]
Match NameE-valueIdentityDescription
XP_023519692.10.0100.00pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucurbita ... [more]
KAG7019446.10.097.58Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
XP_022927392.10.097.25LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g30610, chlo... [more]
XP_023000737.10.097.03pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucurbita ... [more]
KAG6583820.10.094.87Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
A0A6J1EH180.097.25LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g30610, chlo... [more]
A0A6J1KEH70.097.03pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Cucurbit... [more]
A0A0A0LVN70.083.35Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G553530 PE=4 SV=1[more]
A0A6J1CLQ90.081.59pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Momordic... [more]
A0A1S3C8Z00.082.58pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Cucumis ... [more]
Match NameE-valueIdentityDescription
AT1G30610.21.7e-21656.82pentatricopeptide (PPR) repeat-containing protein [more]
AT1G30610.17.1e-21048.08pentatricopeptide (PPR) repeat-containing protein [more]
AT5G67570.14.3e-11439.68Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G12775.15.4e-2421.59Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G17140.14.5e-2326.98Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 344..364
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 107..124
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 107..132
NoneNo IPR availablePANTHERPTHR46935:SF1OS01G0674700 PROTEINcoord: 4..889
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 516..713
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 349..493
e-value: 2.3E-14
score: 55.2
coord: 494..596
e-value: 3.2E-20
score: 74.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 633..791
e-value: 7.7E-25
score: 89.9
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 645..690
e-value: 5.4E-8
score: 32.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 505..548
e-value: 2.6E-9
score: 37.1
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 578..607
e-value: 0.31
score: 11.4
coord: 423..449
e-value: 0.018
score: 15.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 681..710
e-value: 1.4E-7
score: 29.3
coord: 423..456
e-value: 2.6E-4
score: 19.0
coord: 647..674
e-value: 0.0023
score: 16.0
coord: 508..541
e-value: 6.8E-6
score: 23.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 575..609
score: 9.152743
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 679..713
score: 11.509422
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 506..540
score: 12.024604
IPR044645Pentatricopeptide repeat-containing protein DG1/EMB2279-likePANTHERPTHR46935OS01G0674700 PROTEINcoord: 4..889

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g05810.1Cp4.1LG20g05810.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009658 chloroplast organization
molecular_function GO:0005515 protein binding