CaUC03G051180 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC03G051180
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationCiama_Chr03: 1279735 .. 1291451 (-)
RNA-Seq ExpressionCaUC03G051180
SyntenyCaUC03G051180
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGGGCGCTTCGGACTCCCCATTCTACCCAATACCCACCTTCGTCTCGCCGCCATTGCTCCGCTCATTCCACTTCAAAACCCTCCCTCTGCTCCGTCTCCTTAAACCCTTCAACCGCCGGAAACTCAAATAAGAACCAGTTGATTCAATCTCTATGCAAACAGGGCAATCTCAAACAAGCCCTTTTGCTCCTCTCCCATGAATCCAATCCTACCCAACAAACGTGGGAGCTTCTAATCCTTTCCGCCGCTCGCCGGAACTCTCTTTCCGATGGCCTTGATGTCCAACGGCACCTCGTCGATGGGGGTTTCGACCAAGATCCTTTTCTGGCAACCAAGCTTATCAATATGTTTTCCGAATTGGACTCTGTAGACAATGCGCGCAAGGTGTTTGATAAAACGCGTAAGAGAACTATATATGTTTGGAATGCGTTGTTTAGAGCTCTTGCGTTGGCGGGTCGTGGAAGCGACGTATTGGAATTGTATGCCCGGATGAATATGATGGGAGTTCCTTCCGATAGGTTTACTTATACTTATTTGCTCAAAGCTTGCGTTGCTTCAGAGTGTTTGGTTTCGTTTCTCCAGAAGGGTAAAGAGATTCATGCGCATATTTTGAGACATGGGTATGAAGCTCATGTTCATGTAATGACTACTCTGGTGGATATGTACGCAAGGTTTGGGTGTGTTTCTTATGCCAGTGCAGTGTTTGATGAAATGCCTGTGAGAAACGTGGTTTCTTGGAGTGCTATGATTGCATGCTATGCAAAGAATGGGAAGGCATACGAAGCTTTGGAACTCTTTCGGGAGATGATGCTTAACACCCATGATTCAGTGCCGAATCCCGTGACGATGGTCAGTGTACTCCAAGCTTGTGCTGCTCTTGCTGCCCTGGAGCAAGGGAAGCTTATCCACGCTTACATTCTTAGGAGGGGTCTTGATTCAATCTTGCCAGTTATAAGTGCTCTTGTGACCATGTATGCAAGATGTGGTAAGCTTGAGTTAGGCCAACTAGTTTTCGACCGTATGCATAAGAGAGATGTTGTCTTATGGAATTCCTTGATTTCAAGTTATGGAGTGCATGGATATGGAAGAAAAGCAATCGAAATTTTTGAGGAGATGATTGATCATGGAGTCTCACCTAGTTACATATCATTTGTGAGTGTTTTGGGTGCTTGCAGCCATGCTGGGCTTGTTGAAGAGGGGAAGGAGTTGTTTGAGTCCATGGTAAAAGAACATGGTATACAGCCTAGTGTAGAGCACTATGCTTGTATGGTTGATCTTCTTGGGCGTGCTAACCGGTTGGATGAAGCAGCCAAGATTGTGGAAGATCTGCGTATTGAACCAGGGCCAAAAGTATGGGGTTCTCTTCTTGGTGCCTGTAGGATTCATTGTAATGTTGAGCTTGCTGAACGAGCAAGCAAAAGACTTTTCGAGCTTGAGCCTACAAATGCCGGGAATTATGTACTTCTTGCTGATATTTATGCAGAAGCTGAAATGTGGGATGAGGTAAAGAGAGTGAGAAAACGTCTTGATTCTCGTGAATTACAAAAGGTCCCTGGTAGAAGCTGGATTGAAGTACGAAGGAAGATCTATTCATTTACATCTGTTGATGAGTTTAACCCACAAGGAGAGCAGCTCCATGCCCTGTTAGTGAATTTGTCAAATGAGATGAAGCAAAGAGGATATACCCCACAAACTAAAGTAGTGCTGTATGACCTTGATGAGGAAGAAAAGGAAAGGATTGTGTTGGGTCATAGCGAAAAGCTCGCAGTTGCTTTCGGACTCATCAATACAAGCAAGGGGGACACCATAAGGATAACTAAGAACTTGAGGCTATGTGAAGACTGCCATTCTGTCACAAAATTCATTTCCAAGTTTGCCGATCGAGAGATTATGGTTCGAGATGTTAATCGTTTCCACCATTTCAAGGATGGAGTTTGCTCCTGTGGAGACTATTGGTAGTGTATATGCAAATCCTTGTCTTATTCTTCTTTTTCTAATACCTTTATGATAATAACAGGTGATTCAACAGTTCTACTGCGCAGTTACATAGTTAAACACTTCCACACACCGTGAAACTTGGTTTCTTTAGGCCTATAATTTTATTGAGACGTCTTCTAGAGAAAATGCAAAAGAAGAGGCATCTAATCTATTGATGTCAAATTGAATATTCTCTTCTGCCCTTATATGTGGGAGAGATCAAACCGCAAACTCCATTTAGTAGGATGGCTTATTTAACGCGCCCTAGTAAATTAAAGCATGCATTTTTTCAACTAACTTAACATTAAGGTATGACTTATTTAAGATGTCCTGGTAATTTAAAGCATGCATTTCCCCAGAGAACTTTTTAGTTGAATATAGAATTAAACCAAAAAGTCAGTGTCCTAGGAATAGGGATAATCTTGTAGTTTTCTGTTCTGAGTTTCTTGGTGGTTAATTGTTAGTTTCTAGTTGGATTTATTTCTTAATTGTGTTTGTTTGCTGGATTGGTTTGTTCTGTCAGTGATCAGCTCTTAATGAGTAGGTTGGTATCTTTACAAATAGAGGATGGTCTCTTGTGTTTTTAACTCTTCTGTCCTGGATGAAATGTACATAAATGTTCATGGGTGGTATCAAGAGTAAGAGCTAAGCACTTAAAAAGTTATTAAAAAGTCCTTCCAAACGGACTTTAAAAGATGGATGGTTTTCTTTGTCATGATCTTTCAAGCTCAAGGTCCTTGTTAGCAAATCGAAAGCAAGATGCCAAAAGAGATTTAGAGAAGAAAGATGTTTGCATAAAATTTGATCAGATGGAGTGTACTTTTATTCTATTACCACTATCCTATGATCCTATCGTGTAGTTTTAATAGAAGTTTCATGTAGAAGGACACCTCAAATAGCGTTTTTTAAATTGTAAGTCAGGTTTGTGTTCTGTGAGTCCCTAAGCTTTTTGCTATGAGCTTTTAAGTTTGTATTGAAAAGATCTCTAAAATTTTAGTTCACCCATGTTTAGAGCTTTGGTGTGATAGTGTTAGGCCTCTATTTTTTGGTAGGTTCTTTATCTACCATGTTGTTGACCTATTTGATATTTTCAATTTATTGTCAAAATGAGTATAGTTCAATTGTATTAGAGAATACTAACAACTTCGGACACAGAGATTTGAATCCCAACCCATTTTGTAAAAGAAAAAAACTTGAATTTATTGTATACAAACTTGGAAGTTTGGGAGCTTGACACAATTTAATTTTAAATATTTATTCATGGAAAATCTTTTAGACACTATATATTTCATAAATATCCTAAATTTTGAATTATTTCGTTAGCTTTCATTTTGATCAAAATAACCTTTTCACTGTTATTTCAAAAATTTTGAAATATTTAATTCTTTAGAGACTAACTGGGCCCCCAATCTCTTCACTTCCCATACTCAATAACTCAAACTAAAATTGTGATTGTCTTTTTGTTACAGTGTTTACTATTATCTACCCATCACCTCCCATCTTCATTCACTCAAACACTACAACAAAAAGACAATCATGAAGTTATGGTTTATATTTTGTTGTGATAAAATTGGTACAAAAATTGGACAAATGACATAAAAATGAGAGGATATATCTATATGAAAAAAAATGATATTTATATAATTTGTTTTAAACTTGATTCTTTACAAAGTGATCACTACACTAACGATAGGTGCATTCTTAAACTTTTTATTTTTTCTTAAGGCTAAGAGTATTAATGCAACTCTTAAAAGTTCAAGGATATTTGAAATGAGTATTAACGGTTCATGTTCATATACTGACATGAAGGGTATCTTTGAAACTTTTGAAAGTTGTATTTTTGAAATAAAATATTAATGATTTTCATCTTGAACGAATGACAAGGGTATTTTTTTACTTATTTTTTTTTTTAAGATTAATGGTATTTTTGAAGTTTTTGAAAGTTCAAGGTTATTTTTGACACAAAGTACAAAGTTTATGGGTATTTTGTATAATTTATCCTATACATTACATATACCATATTTTTTGGGTGTATTTACAAATATACTATAAAATATAAATAGACATCCCATATATTGTTCAAGATATAAAATGATATATTATACAATGTGTCTTATTCATGCACCATTGAAATACACCCTATATGCCCCTTTTTATTTATAGAAAATTCAAATATTTCTTGAGTATCTTGTGTTGTATTATATCAAATATATATTGTTTTCTCCCCATTCTTGCTTTTGTTTATGAGAGTTACAATTTTTGGATTTTTGGATAAACTTATCTTACTTGTGTAGGAAATTAAAATTAATTCTTTTTTTTTTTTTTTAAAGTATAGAAATCTCATTGAACCAAGCTAAAAGTAAAAACTAAAAACTAAATATAACCGTTGGTTTCAAATAGCGAATATTGTTTCATAGATATGGATATCAAAAATTTTAAATGAGATATACAAGATCATACAATTATTATATAAATTGTGTACAAATAAAAATTTAAGAGTAATAAATACTACTTTGGTTCTTGTACTTTTGCTTTTGATTCATTTTGGTCCTTGTAATTTCAAAATATTTATTTTGGTTCCATTCAAAATCAAGAGTAAAAAACCCAACCATGAATTTTCTATTTAATTTTTCTTCTTCTTTTATATCTACCTTTCCACTTTTAACAATTAGCATTAAATTTTAATTAAAACAATTTAAAGCATCATCCAAATATCACTCAATATTTTTCCTTCAATTATCAACAGAAATTTATAATTATTATTTAAAGTTTGAAAAAAGGTCCCAAGATCACAAAATTTTAACAATAAGTAAAATTTTGTATTTTTATAAAATTAAAAAAAAAAAAAATTGCAATGCATCTTGAAATTGCGTTTGGAATTTCATGATTAATTCACAATGGGTTTGCCCCCACAACTGCAAATGCGCAGTATAGCAAAATTAAGTTGGATAGTAATCTGTGACATCAACAAATTGTTAGTCTTGATTGTTTTTTAGAATATGGGTCATTTCACATTAATCTAACCAAATTGGGGTCAATTACCAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATCTTTTTAACTCTCTCTCGATGAAGAAAAACCACACGTGTAGAATAATAAAAAAGTTTCTTGAATGAACATTTTTATTCCCTTCCCCATAAAGAAGCATTTAAATAATTATTTTAAATAGGTTAGAGTTTAATTGTATTATCATCTAAGTTAAAACTTACCTATGGTATAGGTTTTTGACAAGATGATTTTAACAATAACATGAGTAAAAGTAACAAACTTTCATCCGTGATGGATTATATCTCTCTCTGACCAATTCCTAGGTTTTATTTCTATATAAACACCTCGCACCGTTGTGTTGTGTTGTGTTTAATTAGGAATTATTTTTTATTTAATTAGCGCAGTAAAATAATAGGCCTAAAATAGTCACATGACTTCATAATATTAAATTTAAATAAAATGTAACATTAAAATTTAAAAACATAAAATAAAATCTGAATTGCCAACCGACTTTTAGAATTGTACAAGACCTATGAATTAAAATTATACCTAATAACATGACATGCTAAATACTAAATGAAAAGAAATTTTGATGTTATATAAACCTAAAAGTTTCTTTTAATACAACAACGAGGATAGAACTTTCAACCTCTAGTGAGAAAAGTCGTGTCAATTACTATTGAGCGAAAGCTCATTTTAACGATAAAAACCTAAAAGTTAATAGAACTATATTTATAAAGTTTGGCATTAAACAGATCTACGAAGATTTTTGTATGAATGATCCAATACTCACTGCATTTGAATAACTTAATTTTAAAAAATCAATGAAAATTAATCTTTTGCTTATAACAAGAGCGGTCCACCACTCAATATCACATTAGAGACGTCTTATCCATCGTGAAATAGTCGGAGCCGTAACACGAGGTTTATTTTTAATTACTAATAATTTCTAAAAGAAAAAAACTGTGATCGAAATGAATATCAGTTTTAATTATCTTTTTTAAGATTGGGTTATTTGTAAAAGAAAAAATCTCTTTTAACATGGAAGCGGTGGTTTGGGAACCTTCAACGTGCTCAATCATGGACGGGGAGAGATGGAATTTTGAGAGAGAGAGAGCGATCCACATGGAAATTCAAGAAAAAGGCAAAATGAGATTATTATAGGAATTGTAAACCATAAACCCACAGATTCCCTTGTTCTTCTGTTCTCTCTCTTGGCTGTATCGTCTCCTCTCCTTCTCCATTCTCCTCCTTTTCTTCCCACCATTCATTATTCATGGCTGCGCTTCTTGAATCTGCCTGGCAGGTACTACCCTTTTTTCCCTCTATCTCTGCTTAATCTTGTTCTTTCCGCCCTAGATTTGATCTTGGTTTTCAACCACTTCCATTAAATTCTAAACTTTTTGCACTACTGTACTCTGTGGTTTGTGTCTACTTCGTGCTAAGGATTGAGTTTTCAATTTCTTATGTTGGATGTCAAGTGGGTTTTCATGTGGGACTCTTACCCTTTCGAATTTTACTCTTATACTTGTTAAATTGGTTTGTTGATGTGAGAGTGACATTGGGGTTTTGATGTAATTTCGAGGTGAATTTCATTTTCTACAGTTGATTTGAAATAGGATCTTGATGTTGATAGCTAATTGTTACCACTTTTGAGGTGCTCTGATGGGAGCTATGGATGCCCCTTCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTCTCCAGATTTTTGTGAATTTGGGTTTTCATTGACATTTGAGAGTTTCCTTTTTATTTGACAATATTGGCTTGCTGTAGTGTTGAATTACACTATGGTTCTTTATACATCTGGTACTTCTTTTTAAGGATTTTCTTTTTCCCACGTTATGCAGTATCTAATCACAAATTTCAGTGATTTTCAACTGGCTTGTATTGGAAGCTTCATAATCCATGAAAGCGTTTTCTTCTTATCTGGGCTTCCTTTTATACTTTTGGAAAGAGCAGGATGGCTGAGCAAGTACAAAATCCAGGTTTGCTTTTTTGCTTCACTAGAAATCTTTTGAGCTAAATCTTCACCACAAATTTCATAACTACACGTTCTTATGGATATTTAGTCAATATTTGATATATGAATTAGATTTACCTTTGTCAGTGATTCATGATTGATTATGAACTAGATTAGCACTTGGACTAGTTTTGATTATCCTTTTATTTCATCTCAAACGGAGAAAATAACAGGAAATAATGAGTGCTTTAAGCTTTATTTTTCATTGATTATTCTTTCTATAAAAAAGAATAAAAAGGAAAGAAAAAAAAATGCTGCTCATTGATTACCATGTTGACATTAGTGCTGATACTTATGGACACAGACAAAGTACGTATCAGCTGAGTTTTCCCCCTCAGTAATTACAGCTTAATTAACTCAATAAAGACCTAACTAACTCTTAACTATGGATGAGTTATTTTTAATTGTGATATAAATTTTTTCTGCTCTGTTCAGGCAAAGAATAATAGTCCTGCTGCTCAAGGAAAATGCATTTCACGCCTACTGCTGTATCATTTTGGTGTAAATCTGCCAGTTATGCTTGTTTCTTATCCCGTCTTCAAGCGTATGGGAATGAGAAGCACTCTTCCATTGCCATCCTGGTATCTTCAGGTTCTTTATGTCATAAAGCATATATTTTAGGAGTTTTATCTAGCTAGGTTTAGTTCACTTTCATTTTTTCCTTAGCTTGGACCATATCATTAGCTTATCTAGATTTAATTTTGTTTTGGGTTTCATTAATGTAGGAAAGTAGTTTTCGGCCAGATAATATTCTACTTTATTATTGAGGATTTTGTTTTCTACTGGGGGCATAGAATTTTGCACACCAAATGGCTGTACAAGAATGTCCACAGCGTGCATCATGAGTAAGTGAGATTTATGAATAAAAATTCTTGTATATTCTGCCTTTTCCCATGATAAAATTGTCTTAGGCTGTTTTATGGTTCTTATAAGTGCAGATATGCTACACCTTTTGGACTAACATCAGAATACGCTCACCCTGCTGAGATCCTGTTCCTTGGATTTGCTACCATCATTGGTCCTGCTCTTACTGGTCCCCATCTACTGACTCTGTGGTTATGGATGGTAGTTAGAGTGCTAGAGACAGTTGAGGCTCATTGTGGTTACCATTTTCCTTGGAGCCCTTCAAACTTCATACCTTTGTATGGGGGGTGAGTCATTAGGATTTGTAATGTTTTAAAAAGAATATGTGGTAATGTATCTGTTGTTCCTGAAATCAGCTTTGCTCTTTCTCTACAGTGCTTATTTTCATGATTATCATCACCGACTGCTCTATACGAAATCTGGCAACTACTCATCAACATTCACTTACATGGACTGGTAAGTTCTGGTCCAAAAATGGTCCAACCATTTATTTTATATATACGTTATTGGTTTGAACTTCATAGCATTTAAACCAGTTTTGTGGAACATTGATTTACTTCCCATTTAATAATTTCATGTCTATTTCCTCGGATTGTTGATTTATGTAGTAAACAAAGGGGGTGTTTGAATAATAGCAAGATGATGATGATCGAATTACCAGGGCACCCCAAATTCCTTGGTCCCTTTTCTCCTATCCTTTGAAACCAATGGTTTCATCGAATGGTTAACATTGATGGGCTAATTTAACATAACTAAGACATTTTGCCATGTAGATTATTGCTGGAAAATGTTTACTCCCTAAGCTGGATATATATATATTTTTTTTTTTAATTGTATTTTGGTGGAAAATGTGTTAATCAGTGACTTGATGTGATGCCCCCACTTTGGGTTGAGATTGTCTGGAAGTATGCTTGGACAGGGTATCTTTTTCTGAGTGATTGAGACATGCTGGAGGGAGTTGGAGAAAACTATAGGTATAATAATTTCCACAGCTTCAAGTCTTTGAGCCTAATATAATCTTAAGGTGGATTATTGAATGGTTAAGATGGAGGAGAAGAAAATTTTGATTGTAATGAGAGCTGGAGACCACCAGTGCATACATGTCCAAGTTCCCAATGGTATCTGAAGTTGGAACTTTTGAGACGTGATGGAGAACAAAAACAACATTGATTATGAAGATATTAATGTTGAGTTCTTAATGAAAAAGAAATCAAGTGCTTGGAGAAAAATAGTAGCCACTCCAGGTAGGGAGTTGAGCATTATTCTACCTGTAACATTTGTATTCTAGTTTTACCATGTTGGAAGATTCTTATCCCAAGTTTTCTCTTGATCGAATAAAAAAGGAAAATAGATTAAAAAAATGTTCATTTAGAAAATATGAGCAACTCCAAACTTGCTCTCGATTCAATCTTTATTTCGACTTTTGGGCAGTACACATGAATAGTTGTTGCTATCTTGGAGTTCTCAGCTAGCTTCATATGGAACTTGTAAGTTACCAAATTTGATCCTTTTCTGCAGGAAATATCATGACTATCCAGTGAATGGGGATAGGGGAACATGGAACTTTCGTATGAGAAAGTCGTACCTATATCATTTAACGACCTTACTTAAATAGGGGGATTTGTTCTATAAATTGTACAATTGTGTCCTTGAGTTCTAAAGGAAGTCATACTACCTATATGCATTTCATGGTTTTGCACGTGCTTTTACTTCATGTCCATCGATTATCGATAAATGCAATGAAACAGAAGACGTATACTGATGTGAATATTTCGGCTTCTTTGCTTGGCAGGATATTTGGAACTGACAAAGGTTTCAGAAATTTGGAAGCTATAAAAAAGGCAGAAAGTTGAGTACATTCGATATAGGTTTCTAATCTATAGCCACATTTGAGAAGGTTTAGTTCTTCTATATCACCTTTTCTTGTGTGGCATGATGTGTTTTGTGTCTTTTTTTTGTTGTTTAAATGCCAATATTCTTGTTCTGAAGCTAATGAAGAGCTGACTGTGGAGCTTGGCTGTCATTGTCTTTTGGTTTAGTTTAACATATTTTGTGTTCATGTACCAATTGGCTTATTTATGAAACTTAATTAGCATGAACTATAGAGCAGAACATGATTGTGTTTCCATCTCTCTTATGGATTTATGTTGGTTGTGGAAGACCTTTGATGCTCATGTTGTGTTGATGTACAAAATAAAAAAATAAAAAAATATATATATTGAGAGAACTTTTTGGCACTGAGGTAATAA

mRNA sequence

ATGTGGGCGCTTCGGACTCCCCATTCTACCCAATACCCACCTTCGTCTCGCCGCCATTGCTCCGCTCATTCCACTTCAAAACCCTCCCTCTGCTCCGTCTCCTTAAACCCTTCAACCGCCGGAAACTCAAATAAGAACCAGTTGATTCAATCTCTATGCAAACAGGGCAATCTCAAACAAGCCCTTTTGCTCCTCTCCCATGAATCCAATCCTACCCAACAAACGTGGGAGCTTCTAATCCTTTCCGCCGCTCGCCGGAACTCTCTTTCCGATGGCCTTGATGTCCAACGGCACCTCGTCGATGGGGGTTTCGACCAAGATCCTTTTCTGGCAACCAAGCTTATCAATATGTTTTCCGAATTGGACTCTGTAGACAATGCGCGCAAGGTGTTTGATAAAACGCGTAAGAGAACTATATATGTTTGGAATGCGTTGTTTAGAGCTCTTGCGTTGGCGGGTCGTGGAAGCGACGTATTGGAATTGTATGCCCGGATGAATATGATGGGAGTTCCTTCCGATAGGTTTACTTATACTTATTTGCTCAAAGCTTGCGTTGCTTCAGAGTGTTTGGTTTCGTTTCTCCAGAAGGGTAAAGAGATTCATGCGCATATTTTGAGACATGGGTATGAAGCTCATGTTCATGTAATGACTACTCTGGTGGATATGTACGCAAGGTTTGGGTGTGTTTCTTATGCCAGTGCAGTGTTTGATGAAATGCCTGTGAGAAACGTGGTTTCTTGGAGTGCTATGATTGCATGCTATGCAAAGAATGGGAAGGCATACGAAGCTTTGGAACTCTTTCGGGAGATGATGCTTAACACCCATGATTCAGTGCCGAATCCCGTGACGATGGTCAGTGTACTCCAAGCTTGTGCTGCTCTTGCTGCCCTGGAGCAAGGGAAGCTTATCCACGCTTACATTCTTAGGAGGGGTCTTGATTCAATCTTGCCAGTTATAAGTGCTCTTGTGACCATGTATGCAAGATGTGGTAAGCTTGAGTTAGGCCAACTAGTTTTCGACCGTATGCATAAGAGAGATGTTGTCTTATGGAATTCCTTGATTTCAAGTTATGGAGTGCATGGATATGGAAGAAAAGCAATCGAAATTTTTGAGGAGATGATTGATCATGGAGTCTCACCTAGTTACATATCATTTGTGAGTGTTTTGGGTGCTTGCAGCCATGCTGGGCTTGTTGAAGAGGGGAAGGAGTTGTTTGAGTCCATGGTAAAAGAACATGGTATACAGCCTAGTGTAGAGCACTATGCTTGTATGGTTGATCTTCTTGGGCGTGCTAACCGGTTGGATGAAGCAGCCAAGATTGTGGAAGATCTGCGTATTGAACCAGGGCCAAAAGTATGGGGTTCTCTTCTTGGTGCCTGTAGGATTCATTGTAATGTTGAGCTTGCTGAACGAGCAAGCAAAAGACTTTTCGAGCTTGAGCCTACAAATGCCGGGAATTATGTACTTCTTGCTGATATTTATGCAGAAGCTGAAATGTGGGATGAGGTAAAGAGAGTGAGAAAACGTCTTGATTCTCGTGAATTACAAAAGGTCCCTGGTAGAAGCTGGATTGAAGTACGAAGGAAGATCTATTCATTTACATCTGTTGATGAGTTTAACCCACAAGGAGAGCAGCTCCATGCCCTGTTAGTGAATTTGTCAAATGAGATGAAGCAAAGAGGATATACCCCACAAACTAAAGTAGTGCTGTATGACCTTGATGAGGAAGAAAAGGAAAGGATTGTGTTGGGTCATAGCGAAAAGCTCGCAGTTGCTTTCGGACTCATCAATACAAGCAAGGGGGACACCATAAGGATAACTAAGAACTTGAGGCTATGTGAAGACTGCCATTCTGTCACAAAATTCATTTCCAAGTTTGCCGATCGAGAGATTATGGTTCGAGATGTTAATCGTTTCCACCATTTCAAGGATGGAGTTTGCTCCTGTGGAGACTATTGTGATCAGCTCTTAATGAGTAGATTCCCTTGTTCTTCTGTTCTCTCTCTTGGCTGTATCGTCTCCTCTCCTTCTCCATTCTCCTCCTTTTCTTCCCACCATTCATTATTCATGGCTGCGCTTCTTGAATCTGCCTGGCAGTATCTAATCACAAATTTCAGTGATTTTCAACTGGCTTGTATTGGAAGCTTCATAATCCATGAAAGCGTTTTCTTCTTATCTGGGCTTCCTTTTATACTTTTGGAAAGAGCAGGATGGCTGAGCAAGTACAAAATCCAGGCAAAGAATAATAGTCCTGCTGCTCAAGGAAAATGCATTTCACGCCTACTGCTGTATCATTTTGGTGTAAATCTGCCAGTTATGCTTGTTTCTTATCCCGTCTTCAAGCGTATGGGAATGAGAAGCACTCTTCCATTGCCATCCTGGAAAGTAGTTTTCGGCCAGATAATATTCTACTTTATTATTGAGGATTTTGTTTTCTACTGGGGGCATAGAATTTTGCACACCAAATGGCTGTACAAGAATGTCCACAGCGTGCATCATGAATATGCTACACCTTTTGGACTAACATCAGAATACGCTCACCCTGCTGAGATCCTGTTCCTTGGATTTGCTACCATCATTGGTCCTGCTCTTACTGGTCCCCATCTACTGACTCTGTGGTTATGGATGGTAGTTAGAGTGCTAGAGACAGTTGAGGCTCATTGTGGTTACCATTTTCCTTGGAGCCCTTCAAACTTCATACCTTTGTATGGGGGGTGAGTCATTAGGATTTGTAATGTTTTAAAAAGAATATGTGGTAATGTATCTGTTGTTCCTGAAATCAGCTTTGCTCTTTCTCTACAGTGCTTATTTTCATGATTATCATCACCGACTGCTCTATACGAAATCTGGCAACTACTCATCAACATTCACTTACATGGACTGGATATTTGGAACTGACAAAGGTTTCAGAAATTTGGAAGCTATAAAAAAGGCAGAAAGTTGAGTACATTCGATATAGGTTTCTAATCTATAGCCACATTTGAGAAGGTTTAGTTCTTCTATATCACCTTTTCTTGTGTGGCATGATGTGTTTTGTGTCTTTTTTTTGTTGTTTAAATGCCAATATTCTTGTTCTGAAGCTAATGAAGAGCTGACTGTGGAGCTTGGCTGTCATTGTCTTTTGGTTTAGTTTAACATATTTTGTGTTCATGTACCAATTGGCTTATTTATGAAACTTAATTAGCATGAACTATAGAGCAGAACATGATTGTGTTTCCATCTCTCTTATGGATTTATGTTGGTTGTGGAAGACCTTTGATGCTCATGTTGTGTTGATGTACAAAATAAAAAAATAAAAAAATATATATATTGAGAGAACTTTTTGGCACTGAGGTAATAA

Coding sequence (CDS)

ATGTGGGCGCTTCGGACTCCCCATTCTACCCAATACCCACCTTCGTCTCGCCGCCATTGCTCCGCTCATTCCACTTCAAAACCCTCCCTCTGCTCCGTCTCCTTAAACCCTTCAACCGCCGGAAACTCAAATAAGAACCAGTTGATTCAATCTCTATGCAAACAGGGCAATCTCAAACAAGCCCTTTTGCTCCTCTCCCATGAATCCAATCCTACCCAACAAACGTGGGAGCTTCTAATCCTTTCCGCCGCTCGCCGGAACTCTCTTTCCGATGGCCTTGATGTCCAACGGCACCTCGTCGATGGGGGTTTCGACCAAGATCCTTTTCTGGCAACCAAGCTTATCAATATGTTTTCCGAATTGGACTCTGTAGACAATGCGCGCAAGGTGTTTGATAAAACGCGTAAGAGAACTATATATGTTTGGAATGCGTTGTTTAGAGCTCTTGCGTTGGCGGGTCGTGGAAGCGACGTATTGGAATTGTATGCCCGGATGAATATGATGGGAGTTCCTTCCGATAGGTTTACTTATACTTATTTGCTCAAAGCTTGCGTTGCTTCAGAGTGTTTGGTTTCGTTTCTCCAGAAGGGTAAAGAGATTCATGCGCATATTTTGAGACATGGGTATGAAGCTCATGTTCATGTAATGACTACTCTGGTGGATATGTACGCAAGGTTTGGGTGTGTTTCTTATGCCAGTGCAGTGTTTGATGAAATGCCTGTGAGAAACGTGGTTTCTTGGAGTGCTATGATTGCATGCTATGCAAAGAATGGGAAGGCATACGAAGCTTTGGAACTCTTTCGGGAGATGATGCTTAACACCCATGATTCAGTGCCGAATCCCGTGACGATGGTCAGTGTACTCCAAGCTTGTGCTGCTCTTGCTGCCCTGGAGCAAGGGAAGCTTATCCACGCTTACATTCTTAGGAGGGGTCTTGATTCAATCTTGCCAGTTATAAGTGCTCTTGTGACCATGTATGCAAGATGTGGTAAGCTTGAGTTAGGCCAACTAGTTTTCGACCGTATGCATAAGAGAGATGTTGTCTTATGGAATTCCTTGATTTCAAGTTATGGAGTGCATGGATATGGAAGAAAAGCAATCGAAATTTTTGAGGAGATGATTGATCATGGAGTCTCACCTAGTTACATATCATTTGTGAGTGTTTTGGGTGCTTGCAGCCATGCTGGGCTTGTTGAAGAGGGGAAGGAGTTGTTTGAGTCCATGGTAAAAGAACATGGTATACAGCCTAGTGTAGAGCACTATGCTTGTATGGTTGATCTTCTTGGGCGTGCTAACCGGTTGGATGAAGCAGCCAAGATTGTGGAAGATCTGCGTATTGAACCAGGGCCAAAAGTATGGGGTTCTCTTCTTGGTGCCTGTAGGATTCATTGTAATGTTGAGCTTGCTGAACGAGCAAGCAAAAGACTTTTCGAGCTTGAGCCTACAAATGCCGGGAATTATGTACTTCTTGCTGATATTTATGCAGAAGCTGAAATGTGGGATGAGGTAAAGAGAGTGAGAAAACGTCTTGATTCTCGTGAATTACAAAAGGTCCCTGGTAGAAGCTGGATTGAAGTACGAAGGAAGATCTATTCATTTACATCTGTTGATGAGTTTAACCCACAAGGAGAGCAGCTCCATGCCCTGTTAGTGAATTTGTCAAATGAGATGAAGCAAAGAGGATATACCCCACAAACTAAAGTAGTGCTGTATGACCTTGATGAGGAAGAAAAGGAAAGGATTGTGTTGGGTCATAGCGAAAAGCTCGCAGTTGCTTTCGGACTCATCAATACAAGCAAGGGGGACACCATAAGGATAACTAAGAACTTGAGGCTATGTGAAGACTGCCATTCTGTCACAAAATTCATTTCCAAGTTTGCCGATCGAGAGATTATGGTTCGAGATGTTAATCGTTTCCACCATTTCAAGGATGGAGTTTGCTCCTGTGGAGACTATTGTGATCAGCTCTTAATGAGTAGATTCCCTTGTTCTTCTGTTCTCTCTCTTGGCTGTATCGTCTCCTCTCCTTCTCCATTCTCCTCCTTTTCTTCCCACCATTCATTATTCATGGCTGCGCTTCTTGAATCTGCCTGGCAGTATCTAATCACAAATTTCAGTGATTTTCAACTGGCTTGTATTGGAAGCTTCATAATCCATGAAAGCGTTTTCTTCTTATCTGGGCTTCCTTTTATACTTTTGGAAAGAGCAGGATGGCTGAGCAAGTACAAAATCCAGGCAAAGAATAATAGTCCTGCTGCTCAAGGAAAATGCATTTCACGCCTACTGCTGTATCATTTTGGTGTAAATCTGCCAGTTATGCTTGTTTCTTATCCCGTCTTCAAGCGTATGGGAATGAGAAGCACTCTTCCATTGCCATCCTGGAAAGTAGTTTTCGGCCAGATAATATTCTACTTTATTATTGAGGATTTTGTTTTCTACTGGGGGCATAGAATTTTGCACACCAAATGGCTGTACAAGAATGTCCACAGCGTGCATCATGAATATGCTACACCTTTTGGACTAACATCAGAATACGCTCACCCTGCTGAGATCCTGTTCCTTGGATTTGCTACCATCATTGGTCCTGCTCTTACTGGTCCCCATCTACTGACTCTGTGGTTATGGATGGTAGTTAGAGTGCTAGAGACAGTTGAGGCTCATTGTGGTTACCATTTTCCTTGGAGCCCTTCAAACTTCATACCTTTGTATGGGGGGTGA

Protein sequence

MWALRTPHSTQYPPSSRRHCSAHSTSKPSLCSVSLNPSTAGNSNKNQLIQSLCKQGNLKQALLLLSHESNPTQQTWELLILSAARRNSLSDGLDVQRHLVDGGFDQDPFLATKLINMFSELDSVDNARKVFDKTRKRTIYVWNALFRALALAGRGSDVLELYARMNMMGVPSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHGYEAHVHVMTTLVDMYARFGCVSYASAVFDEMPVRNVVSWSAMIACYAKNGKAYEALELFREMMLNTHDSVPNPVTMVSVLQACAALAALEQGKLIHAYILRRGLDSILPVISALVTMYARCGKLELGQLVFDRMHKRDVVLWNSLISSYGVHGYGRKAIEIFEEMIDHGVSPSYISFVSVLGACSHAGLVEEGKELFESMVKEHGIQPSVEHYACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGACRIHCNVELAERASKRLFELEPTNAGNYVLLADIYAEAEMWDEVKRVRKRLDSRELQKVPGRSWIEVRRKIYSFTSVDEFNPQGEQLHALLVNLSNEMKQRGYTPQTKVVLYDLDEEEKERIVLGHSEKLAVAFGLINTSKGDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRFHHFKDGVCSCGDYCDQLLMSRFPCSSVLSLGCIVSSPSPFSSFSSHHSLFMAALLESAWQYLITNFSDFQLACIGSFIIHESVFFLSGLPFILLERAGWLSKYKIQAKNNSPAAQGKCISRLLLYHFGVNLPVMLVSYPVFKRMGMRSTLPLPSWKVVFGQIIFYFIIEDFVFYWGHRILHTKWLYKNVHSVHHEYATPFGLTSEYAHPAEILFLGFATIIGPALTGPHLLTLWLWMVVRVLETVEAHCGYHFPWSPSNFIPLYGG
Homology
BLAST of CaUC03G051180 vs. NCBI nr
Match: XP_038895613.1 (pentatricopeptide repeat-containing protein At3g46790, chloroplastic [Benincasa hispida])

HSP 1 Score: 1272.3 bits (3291), Expect = 0.0e+00
Identity = 633/652 (97.09%), Postives = 640/652 (98.16%), Query Frame = 0

Query: 1   MWALRTPHSTQYPPSSRRHCSAHSTSKPSLCSVSLNPSTAGNSNKNQLIQSLCKQGNLKQ 60
           MWALRTP STQYPPSSRRHCSAHSTSKPS+CSVSLNPSTA NSNKNQLIQSLCKQGNLKQ
Sbjct: 1   MWALRTPQSTQYPPSSRRHCSAHSTSKPSVCSVSLNPSTAANSNKNQLIQSLCKQGNLKQ 60

Query: 61  ALLLLSHESNPTQQTWELLILSAARRNSLSDGLDVQRHLVDGGFDQDPFLATKLINMFSE 120
           ALLLLSHESNPTQQTWELLILSAARRNSLSDGLDV +HLVDGGFDQDPFLATKLINMFSE
Sbjct: 61  ALLLLSHESNPTQQTWELLILSAARRNSLSDGLDVHQHLVDGGFDQDPFLATKLINMFSE 120

Query: 121 LDSVDNARKVFDKTRKRTIYVWNALFRALALAGRGSDVLELYARMNMMGVPSDRFTYTYL 180
           LDSVD ARKVFDKTRKRTIYVWNALFRALALAG G+DVLELYARMN MG+PSDRFTYTYL
Sbjct: 121 LDSVDYARKVFDKTRKRTIYVWNALFRALALAGHGNDVLELYARMNTMGLPSDRFTYTYL 180

Query: 181 LKACVASECLVSFLQKGKEIHAHILRHGYEAHVHVMTTLVDMYARFGCVSYASAVFDEMP 240
           LKACVASECLVSFLQKGKEIHAHILRHGYE HVHVMTTLVDMYARFGCVSYASAVFDEMP
Sbjct: 181 LKACVASECLVSFLQKGKEIHAHILRHGYEGHVHVMTTLVDMYARFGCVSYASAVFDEMP 240

Query: 241 VRNVVSWSAMIACYAKNGKAYEALELFREMMLNTHDSVPNPVTMVSVLQACAALAALEQG 300
           VRNVVSWSAMIACYAKNGK YEALELFREMMLNTHDSVPN VTMVSVLQACAALAALEQG
Sbjct: 241 VRNVVSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSVTMVSVLQACAALAALEQG 300

Query: 301 KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQLVFDRMHKRDVVLWNSLISSYGVH 360
           KLIHAYILRRGLDSILPVISALVTMYARCGKLEL QLVFDRMHKRDVVLWNSLISSYGVH
Sbjct: 301 KLIHAYILRRGLDSILPVISALVTMYARCGKLELSQLVFDRMHKRDVVLWNSLISSYGVH 360

Query: 361 GYGRKAIEIFEEMIDHGVSPSYISFVSVLGACSHAGLVEEGKELFESMVKEHGIQPSVEH 420
           GYGRKAIEIFEEMID GVSPSYISFVSVLGACSHAGLVEEGK+LFESMVKEHGIQPSVEH
Sbjct: 361 GYGRKAIEIFEEMIDRGVSPSYISFVSVLGACSHAGLVEEGKKLFESMVKEHGIQPSVEH 420

Query: 421 YACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGACRIHCNVELAERASKRLFELE 480
           YACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGACRIHC+VELAERASKRLFELE
Sbjct: 421 YACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGACRIHCHVELAERASKRLFELE 480

Query: 481 PTNAGNYVLLADIYAEAEMWDEVKRVRKRLDSRELQKVPGRSWIEVRRKIYSFTSVDEFN 540
           PTNAGNYVLLADIYAEAEMWDEVKRVRK LDSRELQKVPGRSWIEVRRKIYSFTSVDEFN
Sbjct: 481 PTNAGNYVLLADIYAEAEMWDEVKRVRKLLDSRELQKVPGRSWIEVRRKIYSFTSVDEFN 540

Query: 541 PQGEQLHALLVNLSNEMKQRGYTPQTKVVLYDLDEEEKERIVLGHSEKLAVAFGLINTSK 600
           PQGEQLHALLVNLSNEMKQRGYTPQTKVVLYDLDEEEKERIVLGHSEK+AVAFGLINTSK
Sbjct: 541 PQGEQLHALLVNLSNEMKQRGYTPQTKVVLYDLDEEEKERIVLGHSEKIAVAFGLINTSK 600

Query: 601 GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRFHHFKDGVCSCGDY 653
           GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRFHHFKDGVCSCGDY
Sbjct: 601 GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRFHHFKDGVCSCGDY 652

BLAST of CaUC03G051180 vs. NCBI nr
Match: XP_022141898.1 (pentatricopeptide repeat-containing protein At3g46790, chloroplastic [Momordica charantia])

HSP 1 Score: 1238.8 bits (3204), Expect = 0.0e+00
Identity = 616/652 (94.48%), Postives = 629/652 (96.47%), Query Frame = 0

Query: 1   MWALRTPHSTQYPPSSRRHCSAHSTSKPSLCSVSLNPSTAGNSNKNQLIQSLCKQGNLKQ 60
           MWALRTP ST YPPSSRRHCSAHSTS+PS+CS++LNPS A N NKNQLIQSLCKQGNLKQ
Sbjct: 1   MWALRTPQSTPYPPSSRRHCSAHSTSRPSVCSLALNPSIAANPNKNQLIQSLCKQGNLKQ 60

Query: 61  ALLLLSHESNPTQQTWELLILSAARRNSLSDGLDVQRHLVDGGFDQDPFLATKLINMFSE 120
           ALLLLSHESNPTQQT ELLILSAARRNSLSDGLDV RHLVDGGFDQDPFLATKLINMFSE
Sbjct: 61  ALLLLSHESNPTQQTCELLILSAARRNSLSDGLDVHRHLVDGGFDQDPFLATKLINMFSE 120

Query: 121 LDSVDNARKVFDKTRKRTIYVWNALFRALALAGRGSDVLELYARMNMMGVPSDRFTYTYL 180
           LDSVD+ARKVFDKTR RTIYVWNALFRALALAG G +VLELYARMNM GVPSDRFTYTYL
Sbjct: 121 LDSVDDARKVFDKTRNRTIYVWNALFRALALAGHGKEVLELYARMNMTGVPSDRFTYTYL 180

Query: 181 LKACVASECLVSFLQKGKEIHAHILRHGYEAHVHVMTTLVDMYARFGCVSYASAVFDEMP 240
           LKACVASECLVS L+KGKEIHAHILRHGYEAHVHVMTTLVDMYARFGCVSYASAVFDEMP
Sbjct: 181 LKACVASECLVSLLRKGKEIHAHILRHGYEAHVHVMTTLVDMYARFGCVSYASAVFDEMP 240

Query: 241 VRNVVSWSAMIACYAKNGKAYEALELFREMMLNTHDSVPNPVTMVSVLQACAALAALEQG 300
           VRNVVSWSA+IACYAKNGK YEALELF EMMLNTHDSVPN VTMVSVLQACAALAALEQG
Sbjct: 241 VRNVVSWSAIIACYAKNGKPYEALELFCEMMLNTHDSVPNSVTMVSVLQACAALAALEQG 300

Query: 301 KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQLVFDRMHKRDVVLWNSLISSYGVH 360
           KLIH YILRRGLDSILPVISALVTMYARCGKLELGQLVFDRMHKRDVVLWNSLIS YGVH
Sbjct: 301 KLIHGYILRRGLDSILPVISALVTMYARCGKLELGQLVFDRMHKRDVVLWNSLISGYGVH 360

Query: 361 GYGRKAIEIFEEMIDHGVSPSYISFVSVLGACSHAGLVEEGKELFESMVKEHGIQPSVEH 420
           GYGRKAIEIFEEMIDHG SPSYISFVSVLGACSHAGLVEEGKELFESMVKEHGI PSVEH
Sbjct: 361 GYGRKAIEIFEEMIDHGFSPSYISFVSVLGACSHAGLVEEGKELFESMVKEHGIHPSVEH 420

Query: 421 YACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGACRIHCNVELAERASKRLFELE 480
           YACMVDLLGRANRLDEAAKIVED+R+EPGPKVWGSLLGACRIHC+VELAERASKRLFELE
Sbjct: 421 YACMVDLLGRANRLDEAAKIVEDMRLEPGPKVWGSLLGACRIHCHVELAERASKRLFELE 480

Query: 481 PTNAGNYVLLADIYAEAEMWDEVKRVRKRLDSRELQKVPGRSWIEVRRKIYSFTSVDEFN 540
           PTNAGNYVLLADIYAEAEMWD+VKRV+K LDSRELQKVPGRSWIEVRRKIYSFTSVDEFN
Sbjct: 481 PTNAGNYVLLADIYAEAEMWDKVKRVKKLLDSRELQKVPGRSWIEVRRKIYSFTSVDEFN 540

Query: 541 PQGEQLHALLVNLSNEMKQRGYTPQTKVVLYDLDEEEKERIVLGHSEKLAVAFGLINTSK 600
           PQ EQLHALLVNLS EMKQRGY PQTKVVLYDLDEEEKERIVLGHSEKLAVAFGLINTSK
Sbjct: 541 PQVEQLHALLVNLSKEMKQRGYIPQTKVVLYDLDEEEKERIVLGHSEKLAVAFGLINTSK 600

Query: 601 GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRFHHFKDGVCSCGDY 653
           GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRFHHFKDGVCSCGDY
Sbjct: 601 GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRFHHFKDGVCSCGDY 652

BLAST of CaUC03G051180 vs. NCBI nr
Match: KAA0067772.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1229.2 bits (3179), Expect = 0.0e+00
Identity = 606/652 (92.94%), Postives = 632/652 (96.93%), Query Frame = 0

Query: 1   MWALRTPHSTQYPPSSRRHCSAHSTSKPSLCSVSLNPSTAGNSNKNQLIQSLCKQGNLKQ 60
           MWALRTPHSTQYPPSSRRH SAHSTSK S+CS SLNPST+ NSNK+QLIQSLCK+GNLKQ
Sbjct: 1   MWALRTPHSTQYPPSSRRHSSAHSTSKLSVCSFSLNPSTSANSNKDQLIQSLCKEGNLKQ 60

Query: 61  ALLLLSHESNPTQQTWELLILSAARRNSLSDGLDVQRHLVDGGFDQDPFLATKLINMFSE 120
           AL+LLSHE NPTQQT ELLILSAARR SLSD LDV +HLVDGGFDQDPFLATKLINMFSE
Sbjct: 61  ALVLLSHEPNPTQQTCELLILSAARRKSLSDALDVHQHLVDGGFDQDPFLATKLINMFSE 120

Query: 121 LDSVDNARKVFDKTRKRTIYVWNALFRALALAGRGSDVLELYARMNMMGVPSDRFTYTYL 180
           LDSVDNARKVFDKTRKRTIYVWNALFRALALAG G+DVLELY RM+MMGVP DRFTYTYL
Sbjct: 121 LDSVDNARKVFDKTRKRTIYVWNALFRALALAGHGNDVLELYPRMDMMGVPCDRFTYTYL 180

Query: 181 LKACVASECLVSFLQKGKEIHAHILRHGYEAHVHVMTTLVDMYARFGCVSYASAVFDEMP 240
           LKACVAS+CLVSFLQKGKEIHAHILRHGY AHVHVMTTLVDMYARFGCVSYASAVFDEMP
Sbjct: 181 LKACVASDCLVSFLQKGKEIHAHILRHGYGAHVHVMTTLVDMYARFGCVSYASAVFDEMP 240

Query: 241 VRNVVSWSAMIACYAKNGKAYEALELFREMMLNTHDSVPNPVTMVSVLQACAALAALEQG 300
           V+NVVSWSAMIACYAKNGK YEALELFR+MMLNTHD VPN VTMVSVLQACAA AALEQG
Sbjct: 241 VKNVVSWSAMIACYAKNGKPYEALELFRDMMLNTHDLVPNSVTMVSVLQACAAFAALEQG 300

Query: 301 KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQLVFDRMHKRDVVLWNSLISSYGVH 360
           KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQ++FDR+HK+DV+LWNSL SSYG+H
Sbjct: 301 KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQVIFDRIHKKDVILWNSLFSSYGLH 360

Query: 361 GYGRKAIEIFEEMIDHGVSPSYISFVSVLGACSHAGLVEEGKELFESMVKEHGIQPSVEH 420
           GYGRKAIEIFEEMID+G+SPSYISFVSVLGACSHAGLVEEGK+LFESMVKEHGIQPSVEH
Sbjct: 361 GYGRKAIEIFEEMIDNGISPSYISFVSVLGACSHAGLVEEGKKLFESMVKEHGIQPSVEH 420

Query: 421 YACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGACRIHCNVELAERASKRLFELE 480
           YACMVDLLGRANRLDEAAKI+EDLRIEPGPKVWGSLLGACRIHC+VELAERASKRLFELE
Sbjct: 421 YACMVDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIHCHVELAERASKRLFELE 480

Query: 481 PTNAGNYVLLADIYAEAEMWDEVKRVRKRLDSRELQKVPGRSWIEVRRKIYSFTSVDEFN 540
           PTNAGNYVLLADIYAEAEMWDEVKRVRK L+SRELQKVPGRSWIEVRRKIYSFTSVDEFN
Sbjct: 481 PTNAGNYVLLADIYAEAEMWDEVKRVRKLLNSRELQKVPGRSWIEVRRKIYSFTSVDEFN 540

Query: 541 PQGEQLHALLVNLSNEMKQRGYTPQTKVVLYDLDEEEKERIVLGHSEKLAVAFGLINTSK 600
           PQGEQLHALLVNLSNEMKQRGY PQTK+VLYDLD+EEKERIVLGHSEKLAVAFGLINTSK
Sbjct: 541 PQGEQLHALLVNLSNEMKQRGYVPQTKLVLYDLDQEEKERIVLGHSEKLAVAFGLINTSK 600

Query: 601 GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRFHHFKDGVCSCGDY 653
           GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRD+NRFHHFKDGVCSCGDY
Sbjct: 601 GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFKDGVCSCGDY 652

BLAST of CaUC03G051180 vs. NCBI nr
Match: XP_008457445.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g46790, chloroplastic [Cucumis melo] >TYJ97374.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1228.0 bits (3176), Expect = 0.0e+00
Identity = 605/652 (92.79%), Postives = 632/652 (96.93%), Query Frame = 0

Query: 1   MWALRTPHSTQYPPSSRRHCSAHSTSKPSLCSVSLNPSTAGNSNKNQLIQSLCKQGNLKQ 60
           MWALRTPHSTQYPPSSRRH SAHSTSK S+CS SLNPST+ NSNK+QLIQSLCK+GNLKQ
Sbjct: 1   MWALRTPHSTQYPPSSRRHSSAHSTSKLSVCSFSLNPSTSANSNKDQLIQSLCKEGNLKQ 60

Query: 61  ALLLLSHESNPTQQTWELLILSAARRNSLSDGLDVQRHLVDGGFDQDPFLATKLINMFSE 120
           AL+LLSHE NPTQQT ELLILSA+RR SLSD LDV +HLVDGGFDQDPFLATKLINMFSE
Sbjct: 61  ALVLLSHEPNPTQQTCELLILSASRRKSLSDALDVHQHLVDGGFDQDPFLATKLINMFSE 120

Query: 121 LDSVDNARKVFDKTRKRTIYVWNALFRALALAGRGSDVLELYARMNMMGVPSDRFTYTYL 180
           LDSVDNARKVFDKTRKRTIYVWNALFRALALAG G+DVLELY RM+MMGVP DRFTYTYL
Sbjct: 121 LDSVDNARKVFDKTRKRTIYVWNALFRALALAGHGNDVLELYPRMDMMGVPCDRFTYTYL 180

Query: 181 LKACVASECLVSFLQKGKEIHAHILRHGYEAHVHVMTTLVDMYARFGCVSYASAVFDEMP 240
           LKACVAS+CLVSFLQKGKEIHAHILRHGY AHVHVMTTLVDMYARFGCVSYASAVFDEMP
Sbjct: 181 LKACVASDCLVSFLQKGKEIHAHILRHGYGAHVHVMTTLVDMYARFGCVSYASAVFDEMP 240

Query: 241 VRNVVSWSAMIACYAKNGKAYEALELFREMMLNTHDSVPNPVTMVSVLQACAALAALEQG 300
           V+NVVSWSAMIACYAKNGK YEALELFR+MMLNTHD VPN VTMVSVLQACAA AALEQG
Sbjct: 241 VKNVVSWSAMIACYAKNGKPYEALELFRDMMLNTHDLVPNSVTMVSVLQACAAFAALEQG 300

Query: 301 KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQLVFDRMHKRDVVLWNSLISSYGVH 360
           KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQ++FDR+HK+DV+LWNSL SSYG+H
Sbjct: 301 KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQVIFDRIHKKDVILWNSLFSSYGLH 360

Query: 361 GYGRKAIEIFEEMIDHGVSPSYISFVSVLGACSHAGLVEEGKELFESMVKEHGIQPSVEH 420
           GYGRKAIEIFEEMID+G+SPSYISFVSVLGACSHAGLVEEGK+LFESMVKEHGIQPSVEH
Sbjct: 361 GYGRKAIEIFEEMIDNGISPSYISFVSVLGACSHAGLVEEGKKLFESMVKEHGIQPSVEH 420

Query: 421 YACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGACRIHCNVELAERASKRLFELE 480
           YACMVDLLGRANRLDEAAKI+EDLRIEPGPKVWGSLLGACRIHC+VELAERASKRLFELE
Sbjct: 421 YACMVDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIHCHVELAERASKRLFELE 480

Query: 481 PTNAGNYVLLADIYAEAEMWDEVKRVRKRLDSRELQKVPGRSWIEVRRKIYSFTSVDEFN 540
           PTNAGNYVLLADIYAEAEMWDEVKRVRK L+SRELQKVPGRSWIEVRRKIYSFTSVDEFN
Sbjct: 481 PTNAGNYVLLADIYAEAEMWDEVKRVRKLLNSRELQKVPGRSWIEVRRKIYSFTSVDEFN 540

Query: 541 PQGEQLHALLVNLSNEMKQRGYTPQTKVVLYDLDEEEKERIVLGHSEKLAVAFGLINTSK 600
           PQGEQLHALLVNLSNEMKQRGY PQTK+VLYDLD+EEKERIVLGHSEKLAVAFGLINTSK
Sbjct: 541 PQGEQLHALLVNLSNEMKQRGYVPQTKLVLYDLDQEEKERIVLGHSEKLAVAFGLINTSK 600

Query: 601 GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRFHHFKDGVCSCGDY 653
           GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRD+NRFHHFKDGVCSCGDY
Sbjct: 601 GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFKDGVCSCGDY 652

BLAST of CaUC03G051180 vs. NCBI nr
Match: XP_022964690.1 (pentatricopeptide repeat-containing protein At3g46790, chloroplastic [Cucurbita moschata] >KAG7019172.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1226.8 bits (3173), Expect = 0.0e+00
Identity = 608/652 (93.25%), Postives = 624/652 (95.71%), Query Frame = 0

Query: 1   MWALRTPHSTQYPPSSRRHCSAHSTSKPSLCSVSLNPSTAGNSNKNQLIQSLCKQGNLKQ 60
           MWALRTP  TQYPP SRRH SAHS SKPS+CSVSLN STA NSNKNQLIQSLCKQGNLKQ
Sbjct: 1   MWALRTPQYTQYPPLSRRHSSAHSPSKPSICSVSLNSSTAANSNKNQLIQSLCKQGNLKQ 60

Query: 61  ALLLLSHESNPTQQTWELLILSAARRNSLSDGLDVQRHLVDGGFDQDPFLATKLINMFSE 120
           ALLLLSHESNPTQ+TWELLILSAARRNSLSDGLDV R LVDGGFDQDPFLATKLINMFSE
Sbjct: 61  ALLLLSHESNPTQRTWELLILSAARRNSLSDGLDVHRRLVDGGFDQDPFLATKLINMFSE 120

Query: 121 LDSVDNARKVFDKTRKRTIYVWNALFRALALAGRGSDVLELYARMNMMGVPSDRFTYTYL 180
           L+SVDN RKVFDKTRKRTI+VWNALFRALALAG G DVLELYA+MNMMGVPSDRFTYTYL
Sbjct: 121 LESVDNVRKVFDKTRKRTIFVWNALFRALALAGHGKDVLELYAQMNMMGVPSDRFTYTYL 180

Query: 181 LKACVASECLVSFLQKGKEIHAHILRHGYEAHVHVMTTLVDMYARFGCVSYASAVFDEMP 240
           LKAC ASECLVSFLQKGKEIHAHILRHGYEAHVH MTTLVDMYARFGCVSYA AVFDEMP
Sbjct: 181 LKACAASECLVSFLQKGKEIHAHILRHGYEAHVHAMTTLVDMYARFGCVSYAGAVFDEMP 240

Query: 241 VRNVVSWSAMIACYAKNGKAYEALELFREMMLNTHDSVPNPVTMVSVLQACAALAALEQG 300
           VRNVVSWSAMIACYAKNGK YEALELFREMMLNTHD+VPN VTMVSVLQ+CAAL+ALEQG
Sbjct: 241 VRNVVSWSAMIACYAKNGKPYEALELFREMMLNTHDTVPNSVTMVSVLQSCAALSALEQG 300

Query: 301 KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQLVFDRMHKRDVVLWNSLISSYGVH 360
           KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQLVFD MHKRDVV+WNSLISSYGVH
Sbjct: 301 KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQLVFDFMHKRDVVIWNSLISSYGVH 360

Query: 361 GYGRKAIEIFEEMIDHGVSPSYISFVSVLGACSHAGLVEEGKELFESMVKEHGIQPSVEH 420
           GYGRKAIEIFEEM+DHGVSPSYISFVSVLGACSHAGLVEEGKELFESM K+HGIQP  EH
Sbjct: 361 GYGRKAIEIFEEMVDHGVSPSYISFVSVLGACSHAGLVEEGKELFESMAKKHGIQPGEEH 420

Query: 421 YACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGACRIHCNVELAERASKRLFELE 480
           YACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGACRIHC+VELAERASKRLFELE
Sbjct: 421 YACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGACRIHCHVELAERASKRLFELE 480

Query: 481 PTNAGNYVLLADIYAEAEMWDEVKRVRKRLDSRELQKVPGRSWIEVRRKIYSFTSVDEFN 540
           PTNAGNYVLLAD+YAEA MWDEVKRVRK LDSREL+KVPGRSWIEVRRKIYSFTSVDEF 
Sbjct: 481 PTNAGNYVLLADVYAEANMWDEVKRVRKLLDSRELRKVPGRSWIEVRRKIYSFTSVDEFY 540

Query: 541 PQGEQLHALLVNLSNEMKQRGYTPQTKVVLYDLDEEEKERIVLGHSEKLAVAFGLINTSK 600
           PQ EQLHALL+NLSNEMKQRGYTP TKVVLYDLDEEEKERIVLGHSEKLAVAFGLINTSK
Sbjct: 541 PQAEQLHALLMNLSNEMKQRGYTPHTKVVLYDLDEEEKERIVLGHSEKLAVAFGLINTSK 600

Query: 601 GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRFHHFKDGVCSCGDY 653
           GDTIRI+KNLRLCEDCHS TKFISKFAD EIMVRDVNRFHHFKDGVCSCGDY
Sbjct: 601 GDTIRISKNLRLCEDCHSFTKFISKFADIEIMVRDVNRFHHFKDGVCSCGDY 652

BLAST of CaUC03G051180 vs. ExPASy Swiss-Prot
Match: Q9STF3 (Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CRR2 PE=2 SV=1)

HSP 1 Score: 981.5 bits (2536), Expect = 6.5e-285
Identity = 485/634 (76.50%), Postives = 548/634 (86.44%), Query Frame = 0

Query: 24  STSKPSLCSVSL-NPSTAGNS----NKNQLIQSLCKQGNLKQALLLLSHESNPTQQTWEL 83
           S  KP  CSV+L NPS +  +    + NQLIQSLCK+G LKQA+ +LS ES+P+QQT+EL
Sbjct: 23  SPLKPPSCSVALNNPSISSGAGAKISNNQLIQSLCKEGKLKQAIRVLSQESSPSQQTYEL 82

Query: 84  LILSAARRNSLSDGLDVQRHLVDGGFDQDPFLATKLINMFSELDSVDNARKVFDKTRKRT 143
           LIL    R+SLSD L V RH++D G DQDPFLATKLI M+S+L SVD ARKVFDKTRKRT
Sbjct: 83  LILCCGHRSSLSDALRVHRHILDNGSDQDPFLATKLIGMYSDLGSVDYARKVFDKTRKRT 142

Query: 144 IYVWNALFRALALAGRGSDVLELYARMNMMGVPSDRFTYTYLLKACVASECLVSFLQKGK 203
           IYVWNALFRAL LAG G +VL LY +MN +GV SDRFTYTY+LKACVASEC V+ L KGK
Sbjct: 143 IYVWNALFRALTLAGHGEEVLGLYWKMNRIGVESDRFTYTYVLKACVASECTVNHLMKGK 202

Query: 204 EIHAHILRHGYEAHVHVMTTLVDMYARFGCVSYASAVFDEMPVRNVVSWSAMIACYAKNG 263
           EIHAH+ R GY +HV++MTTLVDMYARFGCV YAS VF  MPVRNVVSWSAMIACYAKNG
Sbjct: 203 EIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVFGGMPVRNVVSWSAMIACYAKNG 262

Query: 264 KAYEALELFREMMLNTHDSVPNPVTMVSVLQACAALAALEQGKLIHAYILRRGLDSILPV 323
           KA+EAL  FREMM  T DS PN VTMVSVLQACA+LAALEQGKLIH YILRRGLDSILPV
Sbjct: 263 KAFEALRTFREMMRETKDSSPNSVTMVSVLQACASLAALEQGKLIHGYILRRGLDSILPV 322

Query: 324 ISALVTMYARCGKLELGQLVFDRMHKRDVVLWNSLISSYGVHGYGRKAIEIFEEMIDHGV 383
           ISALVTMY RCGKLE+GQ VFDRMH RDVV WNSLISSYGVHGYG+KAI+IFEEM+ +G 
Sbjct: 323 ISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYGKKAIQIFEEMLANGA 382

Query: 384 SPSYISFVSVLGACSHAGLVEEGKELFESMVKEHGIQPSVEHYACMVDLLGRANRLDEAA 443
           SP+ ++FVSVLGACSH GLVEEGK LFE+M ++HGI+P +EHYACMVDLLGRANRLDEAA
Sbjct: 383 SPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVDLLGRANRLDEAA 442

Query: 444 KIVEDLRIEPGPKVWGSLLGACRIHCNVELAERASKRLFELEPTNAGNYVLLADIYAEAE 503
           K+V+D+R EPGPKVWGSLLG+CRIH NVELAERAS+RLF LEP NAGNYVLLADIYAEA+
Sbjct: 443 KMVQDMRTEPGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKNAGNYVLLADIYAEAQ 502

Query: 504 MWDEVKRVRKRLDSRELQKVPGRSWIEVRRKIYSFTSVDEFNPQGEQLHALLVNLSNEMK 563
           MWDEVKRV+K L+ R LQK+PGR W+EVRRK+YSF SVDEFNP  EQ+HA LV L+ +MK
Sbjct: 503 MWDEVKRVKKLLEHRGLQKLPGRCWMEVRRKMYSFVSVDEFNPLMEQIHAFLVKLAEDMK 562

Query: 564 QRGYTPQTKVVLYDLDEEEKERIVLGHSEKLAVAFGLINTSKGDTIRITKNLRLCEDCHS 623
           ++GY PQTK VLY+L+ EEKERIVLGHSEKLA+AFGLINTSKG+ IRITKNLRLCEDCH 
Sbjct: 563 EKGYIPQTKGVLYELETEEKERIVLGHSEKLALAFGLINTSKGEPIRITKNLRLCEDCHL 622

Query: 624 VTKFISKFADREIMVRDVNRFHHFKDGVCSCGDY 653
            TKFISKF ++EI+VRDVNRFH FK+GVCSCGDY
Sbjct: 623 FTKFISKFMEKEILVRDVNRFHRFKNGVCSCGDY 656

BLAST of CaUC03G051180 vs. ExPASy Swiss-Prot
Match: Q9LIC3 (Putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H85 PE=3 SV=1)

HSP 1 Score: 474.6 bits (1220), Expect = 2.6e-132
Identity = 240/609 (39.41%), Postives = 375/609 (61.58%), Query Frame = 0

Query: 49  IQSLCKQGNLKQALL---LLSHESNPTQQTWELLILSAARRNSLSDGLDVQRHLVDGGFD 108
           I  LC  G L++ALL   +L  E       ++ L+ +   + +L DG  V  H++   + 
Sbjct: 27  ISQLCSNGRLQEALLEMAMLGPEMG--FHGYDALLNACLDKRALRDGQRVHAHMIKTRYL 86

Query: 109 QDPFLATKLINMFSELDSVDNARKVFDKTRKRTIYVWNALFRALALAGRGSDVLELYARM 168
              +L T+L+  + + D +++ARKV D+  ++ +  W A+    +  G  S+ L ++A M
Sbjct: 87  PATYLRTRLLIFYGKCDCLEDARKVLDEMPEKNVVSWTAMISRYSQTGHSSEALTVFAEM 146

Query: 169 NMMGVPSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHGYEAHVHVMTTLVDMYAR 228
                  + FT+  +L +C+ +    S L  GK+IH  I++  Y++H+ V ++L+DMYA+
Sbjct: 147 MRSDGKPNEFTFATVLTSCIRA----SGLGLGKQIHGLIVKWNYDSHIFVGSSLLDMYAK 206

Query: 229 FGCVSYASAVFDEMPVRNVVSWSAMIACYAKNGKAYEALELFREMMLNTHDSVPNPVTMV 288
            G +  A  +F+ +P R+VVS +A+IA YA+ G   EALE+F    L++    PN VT  
Sbjct: 207 AGQIKEAREIFECLPERDVVSCTAIIAGYAQLGLDEEALEMFHR--LHSEGMSPNYVTYA 266

Query: 289 SVLQACAALAALEQGKLIHAYILRRGLDSILPVISALVTMYARCGKLELGQLVFDRMHKR 348
           S+L A + LA L+ GK  H ++LRR L     + ++L+ MY++CG L   + +FD M +R
Sbjct: 267 SLLTALSGLALLDHGKQAHCHVLRRELPFYAVLQNSLIDMYSKCGNLSYARRLFDNMPER 326

Query: 349 DVVLWNSLISSYGVHGYGRKAIEIFEEMIDH-GVSPSYISFVSVLGACSHAGLVEEGKEL 408
             + WN+++  Y  HG GR+ +E+F  M D   V P  ++ ++VL  CSH  + + G  +
Sbjct: 327 TAISWNAMLVGYSKHGLGREVLELFRLMRDEKRVKPDAVTLLAVLSGCSHGRMEDTGLNI 386

Query: 409 FESMVK-EHGIQPSVEHYACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGACRIH 468
           F+ MV  E+G +P  EHY C+VD+LGRA R+DEA + ++ +  +P   V GSLLGACR+H
Sbjct: 387 FDGMVAGEYGTKPGTEHYGCIVDMLGRAGRIDEAFEFIKRMPSKPTAGVLGSLLGACRVH 446

Query: 469 CNVELAERASKRLFELEPTNAGNYVLLADIYAEAEMWDEVKRVRKRLDSRELQKVPGRSW 528
            +V++ E   +RL E+EP NAGNYV+L+++YA A  W +V  VR  +  + + K PGRSW
Sbjct: 447 LSVDIGESVGRRLIEIEPENAGNYVILSNLYASAGRWADVNNVRAMMMQKAVTKEPGRSW 506

Query: 529 IEVRRKIYSFTSVDEFNPQGEQLHALLVNLSNEMKQRGYTPQTKVVLYDLDEEEKERIVL 588
           I+  + ++ F + D  +P+ E++ A +  +S +MKQ GY P    VLYD+DEE+KE+++L
Sbjct: 507 IQHEQTLHYFHANDRTHPRREEVLAKMKEISIKMKQAGYVPDLSCVLYDVDEEQKEKMLL 566

Query: 589 GHSEKLAVAFGLINTSKGDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRFHHFK 648
           GHSEKLA+ FGLI T +G  IR+ KNLR+C DCH+  K  SK  +RE+ +RD NRFH   
Sbjct: 567 GHSEKLALTFGLIATGEGIPIRVFKNLRICVDCHNFAKIFSKVFEREVSLRDKNRFHQIV 626

Query: 649 DGVCSCGDY 653
           DG+CSCGDY
Sbjct: 627 DGICSCGDY 627

BLAST of CaUC03G051180 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 474.6 bits (1220), Expect = 2.6e-132
Identity = 234/546 (42.86%), Postives = 347/546 (63.55%), Query Frame = 0

Query: 104 FDQDPFLATKLINMFSELDSVDNARKVFDKTRKRTIYVWNALFRALALAGRGSDVLELYA 163
           F ++      L++M+S+   +D+A+ VF +   R++  + ++    A  G   + ++L+ 
Sbjct: 327 FSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFE 386

Query: 164 RMNMMGVPSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHGYEAHVHVMTTLVDMY 223
            M   G+  D +T T +L  C         L +GK +H  I  +     + V   L+DMY
Sbjct: 387 EMEEEGISPDVYTVTAVLNCCAR----YRLLDEGKRVHEWIKENDLGFDIFVSNALMDMY 446

Query: 224 ARFGCVSYASAVFDEMPVRNVVSWSAMIACYAKNGKAYEALELFREMMLNTHDSVPNPVT 283
           A+ G +  A  VF EM V++++SW+ +I  Y+KN  A EAL LF  ++L      P+  T
Sbjct: 447 AKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLF-NLLLEEKRFSPDERT 506

Query: 284 MVSVLQACAALAALEQGKLIHAYILRRGLDSILPVISALVTMYARCGKLELGQLVFDRMH 343
           +  VL ACA+L+A ++G+ IH YI+R G  S   V ++LV MYA+CG L L  ++FD + 
Sbjct: 507 VACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIA 566

Query: 344 KRDVVLWNSLISSYGVHGYGRKAIEIFEEMIDHGVSPSYISFVSVLGACSHAGLVEEGKE 403
            +D+V W  +I+ YG+HG+G++AI +F +M   G+    ISFVS+L ACSH+GLV+EG  
Sbjct: 567 SKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWR 626

Query: 404 LFESMVKEHGIQPSVEHYACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGACRIH 463
            F  M  E  I+P+VEHYAC+VD+L R   L +A + +E++ I P   +WG+LL  CRIH
Sbjct: 627 FFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIH 686

Query: 464 CNVELAERASKRLFELEPTNAGNYVLLADIYAEAEMWDEVKRVRKRLDSRELQKVPGRSW 523
            +V+LAE+ ++++FELEP N G YVL+A+IYAEAE W++VKR+RKR+  R L+K PG SW
Sbjct: 687 HDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSW 746

Query: 524 IEVRRKIYSFTSVDEFNPQGEQLHALLVNLSNEMKQRGYTPQTKVVLYDLDEEEKERIVL 583
           IE++ ++  F + D  NP+ E + A L  +   M + GY+P TK  L D +E EKE  + 
Sbjct: 747 IEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIEEGYSPLTKYALIDAEEMEKEEALC 806

Query: 584 GHSEKLAVAFGLINTSKGDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRFHHFK 643
           GHSEKLA+A G+I++  G  IR+TKNLR+C DCH + KF+SK   REI++RD NRFH FK
Sbjct: 807 GHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHEMAKFMSKLTRREIVLRDSNRFHQFK 866

Query: 644 DGVCSC 650
           DG CSC
Sbjct: 867 DGHCSC 867

BLAST of CaUC03G051180 vs. ExPASy Swiss-Prot
Match: Q9LTV8 (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 472.6 bits (1215), Expect = 9.8e-132
Identity = 242/613 (39.48%), Postives = 385/613 (62.81%), Query Frame = 0

Query: 46  NQLIQSLCKQGNLKQALLLLSH----ESNPTQQTWELLILSAARRNSLSDGLDVQRHLVD 105
           N +I+   +  + + ALL+ S+      +P   T+  L+ + +  + L  G  V   +  
Sbjct: 88  NAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQVFR 147

Query: 106 GGFDQDPFLATKLINMFSELDSVDNARKVFD--KTRKRTIYVWNALFRALALAGRGSDVL 165
            GFD D F+   LI ++++   + +AR VF+     +RTI  W A+  A A  G   + L
Sbjct: 148 LGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEAL 207

Query: 166 ELYARMNMMGVPSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHGYEAHVHVMTTL 225
           E++++M  M V  D   +  L+    A  CL   L++G+ IHA +++ G E    ++ +L
Sbjct: 208 EIFSQMRKMDVKPD---WVALVSVLNAFTCLQD-LKQGRSIHASVVKMGLEIEPDLLISL 267

Query: 226 VDMYARFGCVSYASAVFDEMPVRNVVSWSAMIACYAKNGKAYEALELFREMMLNTHDSVP 285
             MYA+ G V+ A  +FD+M   N++ W+AMI+ YAKNG A EA+++F EM+    D  P
Sbjct: 268 NTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMI--NKDVRP 327

Query: 286 NPVTMVSVLQACAALAALEQGKLIHAYILRRGLDSILPVISALVTMYARCGKLELGQLVF 345
           + +++ S + ACA + +LEQ + ++ Y+ R      + + SAL+ M+A+CG +E  +LVF
Sbjct: 328 DTISITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVF 387

Query: 346 DRMHKRDVVLWNSLISSYGVHGYGRKAIEIFEEMIDHGVSPSYISFVSVLGACSHAGLVE 405
           DR   RDVV+W+++I  YG+HG  R+AI ++  M   GV P+ ++F+ +L AC+H+G+V 
Sbjct: 388 DRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMVR 447

Query: 406 EGKELFESMVKEHGIQPSVEHYACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGA 465
           EG   F  M  +H I P  +HYAC++DLLGRA  LD+A ++++ + ++PG  VWG+LL A
Sbjct: 448 EGWWFFNRMA-DHKINPQQQHYACVIDLLGRAGHLDQAYEVIKCMPVQPGVTVWGALLSA 507

Query: 466 CRIHCNVELAERASKRLFELEPTNAGNYVLLADIYAEAEMWDEVKRVRKRLDSRELQKVP 525
           C+ H +VEL E A+++LF ++P+N G+YV L+++YA A +WD V  VR R+  + L K  
Sbjct: 508 CKKHRHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEVRVRMKEKGLNKDV 567

Query: 526 GRSWIEVRRKIYSFTSVDEFNPQGEQLHALLVNLSNEMKQRGYTPQTKVVLYDLDEEEKE 585
           G SW+EVR ++ +F   D+ +P+ E++   +  + + +K+ G+       L+DL++EE E
Sbjct: 568 GCSWVEVRGRLEAFRVGDKSHPRYEEIERQVEWIESRLKEGGFVANKDASLHDLNDEEAE 627

Query: 586 RIVLGHSEKLAVAFGLINTSKGDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRF 645
             +  HSE++A+A+GLI+T +G  +RITKNLR C +CH+ TK ISK  DREI+VRD NRF
Sbjct: 628 ETLCSHSERIAIAYGLISTPQGTPLRITKNLRACVNCHAATKLISKLVDREIVVRDTNRF 687

Query: 646 HHFKDGVCSCGDY 653
           HHFKDGVCSCGDY
Sbjct: 688 HHFKDGVCSCGDY 693

BLAST of CaUC03G051180 vs. ExPASy Swiss-Prot
Match: O81767 (Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana OX=3702 GN=EMB2758 PE=3 SV=2)

HSP 1 Score: 472.2 bits (1214), Expect = 1.3e-131
Identity = 240/608 (39.47%), Postives = 379/608 (62.34%), Query Frame = 0

Query: 46  NQLIQSLCKQGNLKQALLLLSHESNPTQQTWELLILSAARRNSLSDGLDVQRHLVDGGFD 105
           N +I   C+ GN K+AL L +        T   L+ +       + G+ +  + +  G +
Sbjct: 220 NAMISGYCQSGNAKEALTLSNGLRAMDSVTVVSLLSACTEAGDFNRGVTIHSYSIKHGLE 279

Query: 106 QDPFLATKLINMFSELDSVDNARKVFDKTRKRTIYVWNALFRALALAGRGSDVLELYARM 165
            + F++ KLI++++E   + + +KVFD+   R +  WN++ +A  L  +    + L+  M
Sbjct: 280 SELFVSNKLIDLYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQPLRAISLFQEM 339

Query: 166 NMMGVPSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHG-YEAHVHVMTTLVDMYA 225
            +  +  D  T   L  A + S+  +  ++  + +    LR G +   + +   +V MYA
Sbjct: 340 RLSRIQPDCLTLISL--ASILSQ--LGDIRACRSVQGFTLRKGWFLEDITIGNAVVVMYA 399

Query: 226 RFGCVSYASAVFDEMPVRNVVSWSAMIACYAKNGKAYEALELFREMMLNTHDSVPNPVTM 285
           + G V  A AVF+ +P  +V+SW+ +I+ YA+NG A EA+E++  +M    +   N  T 
Sbjct: 400 KLGLVDSARAVFNWLPNTDVISWNTIISGYAQNGFASEAIEMY-NIMEEEGEIAANQGTW 459

Query: 286 VSVLQACAALAALEQGKLIHAYILRRGLDSILPVISALVTMYARCGKLELGQLVFDRMHK 345
           VSVL AC+   AL QG  +H  +L+ GL   + V+++L  MY +CG+LE    +F ++ +
Sbjct: 460 VSVLPACSQAGALRQGMKLHGRLLKNGLYLDVFVVTSLADMYGKCGRLEDALSLFYQIPR 519

Query: 346 RDVVLWNSLISSYGVHGYGRKAIEIFEEMIDHGVSPSYISFVSVLGACSHAGLVEEGKEL 405
            + V WN+LI+ +G HG+G KA+ +F+EM+D GV P +I+FV++L ACSH+GLV+EG+  
Sbjct: 520 VNSVPWNTLIACHGFHGHGEKAVMLFKEMLDEGVKPDHITFVTLLSACSHSGLVDEGQWC 579

Query: 406 FESMVKEHGIQPSVEHYACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGACRIHC 465
           FE M  ++GI PS++HY CMVD+ GRA +L+ A K ++ + ++P   +WG+LL ACR+H 
Sbjct: 580 FEMMQTDYGITPSLKHYGCMVDMYGRAGQLETALKFIKSMSLQPDASIWGALLSACRVHG 639

Query: 466 NVELAERASKRLFELEPTNAGNYVLLADIYAEAEMWDEVKRVRKRLDSRELQKVPGRSWI 525
           NV+L + AS+ LFE+EP + G +VLL+++YA A  W+ V  +R     + L+K PG S +
Sbjct: 640 NVDLGKIASEHLFEVEPEHVGYHVLLSNMYASAGKWEGVDEIRSIAHGKGLRKTPGWSSM 699

Query: 526 EVRRKIYSFTSVDEFNPQGEQLHALLVNLSNEMKQRGYTPQTKVVLYDLDEEEKERIVLG 585
           EV  K+  F + ++ +P  E+++  L  L  ++K  GY P  + VL D++++EKE I++ 
Sbjct: 700 EVDNKVEVFYTGNQTHPMYEEMYRELTALQAKLKMIGYVPDHRFVLQDVEDDEKEHILMS 759

Query: 586 HSEKLAVAFGLINTSKGDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRFHHFKD 645
           HSE+LA+AF LI T    TIRI KNLR+C DCHSVTKFISK  +REI+VRD NRFHHFK+
Sbjct: 760 HSERLAIAFALIATPAKTTIRIFKNLRVCGDCHSVTKFISKITEREIIVRDSNRFHHFKN 819

Query: 646 GVCSCGDY 653
           GVCSCGDY
Sbjct: 820 GVCSCGDY 822

BLAST of CaUC03G051180 vs. ExPASy TrEMBL
Match: A0A6J1CLW0 (pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111012148 PE=3 SV=1)

HSP 1 Score: 1238.8 bits (3204), Expect = 0.0e+00
Identity = 616/652 (94.48%), Postives = 629/652 (96.47%), Query Frame = 0

Query: 1   MWALRTPHSTQYPPSSRRHCSAHSTSKPSLCSVSLNPSTAGNSNKNQLIQSLCKQGNLKQ 60
           MWALRTP ST YPPSSRRHCSAHSTS+PS+CS++LNPS A N NKNQLIQSLCKQGNLKQ
Sbjct: 1   MWALRTPQSTPYPPSSRRHCSAHSTSRPSVCSLALNPSIAANPNKNQLIQSLCKQGNLKQ 60

Query: 61  ALLLLSHESNPTQQTWELLILSAARRNSLSDGLDVQRHLVDGGFDQDPFLATKLINMFSE 120
           ALLLLSHESNPTQQT ELLILSAARRNSLSDGLDV RHLVDGGFDQDPFLATKLINMFSE
Sbjct: 61  ALLLLSHESNPTQQTCELLILSAARRNSLSDGLDVHRHLVDGGFDQDPFLATKLINMFSE 120

Query: 121 LDSVDNARKVFDKTRKRTIYVWNALFRALALAGRGSDVLELYARMNMMGVPSDRFTYTYL 180
           LDSVD+ARKVFDKTR RTIYVWNALFRALALAG G +VLELYARMNM GVPSDRFTYTYL
Sbjct: 121 LDSVDDARKVFDKTRNRTIYVWNALFRALALAGHGKEVLELYARMNMTGVPSDRFTYTYL 180

Query: 181 LKACVASECLVSFLQKGKEIHAHILRHGYEAHVHVMTTLVDMYARFGCVSYASAVFDEMP 240
           LKACVASECLVS L+KGKEIHAHILRHGYEAHVHVMTTLVDMYARFGCVSYASAVFDEMP
Sbjct: 181 LKACVASECLVSLLRKGKEIHAHILRHGYEAHVHVMTTLVDMYARFGCVSYASAVFDEMP 240

Query: 241 VRNVVSWSAMIACYAKNGKAYEALELFREMMLNTHDSVPNPVTMVSVLQACAALAALEQG 300
           VRNVVSWSA+IACYAKNGK YEALELF EMMLNTHDSVPN VTMVSVLQACAALAALEQG
Sbjct: 241 VRNVVSWSAIIACYAKNGKPYEALELFCEMMLNTHDSVPNSVTMVSVLQACAALAALEQG 300

Query: 301 KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQLVFDRMHKRDVVLWNSLISSYGVH 360
           KLIH YILRRGLDSILPVISALVTMYARCGKLELGQLVFDRMHKRDVVLWNSLIS YGVH
Sbjct: 301 KLIHGYILRRGLDSILPVISALVTMYARCGKLELGQLVFDRMHKRDVVLWNSLISGYGVH 360

Query: 361 GYGRKAIEIFEEMIDHGVSPSYISFVSVLGACSHAGLVEEGKELFESMVKEHGIQPSVEH 420
           GYGRKAIEIFEEMIDHG SPSYISFVSVLGACSHAGLVEEGKELFESMVKEHGI PSVEH
Sbjct: 361 GYGRKAIEIFEEMIDHGFSPSYISFVSVLGACSHAGLVEEGKELFESMVKEHGIHPSVEH 420

Query: 421 YACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGACRIHCNVELAERASKRLFELE 480
           YACMVDLLGRANRLDEAAKIVED+R+EPGPKVWGSLLGACRIHC+VELAERASKRLFELE
Sbjct: 421 YACMVDLLGRANRLDEAAKIVEDMRLEPGPKVWGSLLGACRIHCHVELAERASKRLFELE 480

Query: 481 PTNAGNYVLLADIYAEAEMWDEVKRVRKRLDSRELQKVPGRSWIEVRRKIYSFTSVDEFN 540
           PTNAGNYVLLADIYAEAEMWD+VKRV+K LDSRELQKVPGRSWIEVRRKIYSFTSVDEFN
Sbjct: 481 PTNAGNYVLLADIYAEAEMWDKVKRVKKLLDSRELQKVPGRSWIEVRRKIYSFTSVDEFN 540

Query: 541 PQGEQLHALLVNLSNEMKQRGYTPQTKVVLYDLDEEEKERIVLGHSEKLAVAFGLINTSK 600
           PQ EQLHALLVNLS EMKQRGY PQTKVVLYDLDEEEKERIVLGHSEKLAVAFGLINTSK
Sbjct: 541 PQVEQLHALLVNLSKEMKQRGYIPQTKVVLYDLDEEEKERIVLGHSEKLAVAFGLINTSK 600

Query: 601 GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRFHHFKDGVCSCGDY 653
           GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRFHHFKDGVCSCGDY
Sbjct: 601 GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRFHHFKDGVCSCGDY 652

BLAST of CaUC03G051180 vs. ExPASy TrEMBL
Match: A0A5A7VHC3 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold352G00960 PE=3 SV=1)

HSP 1 Score: 1229.2 bits (3179), Expect = 0.0e+00
Identity = 606/652 (92.94%), Postives = 632/652 (96.93%), Query Frame = 0

Query: 1   MWALRTPHSTQYPPSSRRHCSAHSTSKPSLCSVSLNPSTAGNSNKNQLIQSLCKQGNLKQ 60
           MWALRTPHSTQYPPSSRRH SAHSTSK S+CS SLNPST+ NSNK+QLIQSLCK+GNLKQ
Sbjct: 1   MWALRTPHSTQYPPSSRRHSSAHSTSKLSVCSFSLNPSTSANSNKDQLIQSLCKEGNLKQ 60

Query: 61  ALLLLSHESNPTQQTWELLILSAARRNSLSDGLDVQRHLVDGGFDQDPFLATKLINMFSE 120
           AL+LLSHE NPTQQT ELLILSAARR SLSD LDV +HLVDGGFDQDPFLATKLINMFSE
Sbjct: 61  ALVLLSHEPNPTQQTCELLILSAARRKSLSDALDVHQHLVDGGFDQDPFLATKLINMFSE 120

Query: 121 LDSVDNARKVFDKTRKRTIYVWNALFRALALAGRGSDVLELYARMNMMGVPSDRFTYTYL 180
           LDSVDNARKVFDKTRKRTIYVWNALFRALALAG G+DVLELY RM+MMGVP DRFTYTYL
Sbjct: 121 LDSVDNARKVFDKTRKRTIYVWNALFRALALAGHGNDVLELYPRMDMMGVPCDRFTYTYL 180

Query: 181 LKACVASECLVSFLQKGKEIHAHILRHGYEAHVHVMTTLVDMYARFGCVSYASAVFDEMP 240
           LKACVAS+CLVSFLQKGKEIHAHILRHGY AHVHVMTTLVDMYARFGCVSYASAVFDEMP
Sbjct: 181 LKACVASDCLVSFLQKGKEIHAHILRHGYGAHVHVMTTLVDMYARFGCVSYASAVFDEMP 240

Query: 241 VRNVVSWSAMIACYAKNGKAYEALELFREMMLNTHDSVPNPVTMVSVLQACAALAALEQG 300
           V+NVVSWSAMIACYAKNGK YEALELFR+MMLNTHD VPN VTMVSVLQACAA AALEQG
Sbjct: 241 VKNVVSWSAMIACYAKNGKPYEALELFRDMMLNTHDLVPNSVTMVSVLQACAAFAALEQG 300

Query: 301 KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQLVFDRMHKRDVVLWNSLISSYGVH 360
           KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQ++FDR+HK+DV+LWNSL SSYG+H
Sbjct: 301 KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQVIFDRIHKKDVILWNSLFSSYGLH 360

Query: 361 GYGRKAIEIFEEMIDHGVSPSYISFVSVLGACSHAGLVEEGKELFESMVKEHGIQPSVEH 420
           GYGRKAIEIFEEMID+G+SPSYISFVSVLGACSHAGLVEEGK+LFESMVKEHGIQPSVEH
Sbjct: 361 GYGRKAIEIFEEMIDNGISPSYISFVSVLGACSHAGLVEEGKKLFESMVKEHGIQPSVEH 420

Query: 421 YACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGACRIHCNVELAERASKRLFELE 480
           YACMVDLLGRANRLDEAAKI+EDLRIEPGPKVWGSLLGACRIHC+VELAERASKRLFELE
Sbjct: 421 YACMVDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIHCHVELAERASKRLFELE 480

Query: 481 PTNAGNYVLLADIYAEAEMWDEVKRVRKRLDSRELQKVPGRSWIEVRRKIYSFTSVDEFN 540
           PTNAGNYVLLADIYAEAEMWDEVKRVRK L+SRELQKVPGRSWIEVRRKIYSFTSVDEFN
Sbjct: 481 PTNAGNYVLLADIYAEAEMWDEVKRVRKLLNSRELQKVPGRSWIEVRRKIYSFTSVDEFN 540

Query: 541 PQGEQLHALLVNLSNEMKQRGYTPQTKVVLYDLDEEEKERIVLGHSEKLAVAFGLINTSK 600
           PQGEQLHALLVNLSNEMKQRGY PQTK+VLYDLD+EEKERIVLGHSEKLAVAFGLINTSK
Sbjct: 541 PQGEQLHALLVNLSNEMKQRGYVPQTKLVLYDLDQEEKERIVLGHSEKLAVAFGLINTSK 600

Query: 601 GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRFHHFKDGVCSCGDY 653
           GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRD+NRFHHFKDGVCSCGDY
Sbjct: 601 GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFKDGVCSCGDY 652

BLAST of CaUC03G051180 vs. ExPASy TrEMBL
Match: A0A5D3BEB8 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold194G001880 PE=3 SV=1)

HSP 1 Score: 1228.0 bits (3176), Expect = 0.0e+00
Identity = 605/652 (92.79%), Postives = 632/652 (96.93%), Query Frame = 0

Query: 1   MWALRTPHSTQYPPSSRRHCSAHSTSKPSLCSVSLNPSTAGNSNKNQLIQSLCKQGNLKQ 60
           MWALRTPHSTQYPPSSRRH SAHSTSK S+CS SLNPST+ NSNK+QLIQSLCK+GNLKQ
Sbjct: 1   MWALRTPHSTQYPPSSRRHSSAHSTSKLSVCSFSLNPSTSANSNKDQLIQSLCKEGNLKQ 60

Query: 61  ALLLLSHESNPTQQTWELLILSAARRNSLSDGLDVQRHLVDGGFDQDPFLATKLINMFSE 120
           AL+LLSHE NPTQQT ELLILSA+RR SLSD LDV +HLVDGGFDQDPFLATKLINMFSE
Sbjct: 61  ALVLLSHEPNPTQQTCELLILSASRRKSLSDALDVHQHLVDGGFDQDPFLATKLINMFSE 120

Query: 121 LDSVDNARKVFDKTRKRTIYVWNALFRALALAGRGSDVLELYARMNMMGVPSDRFTYTYL 180
           LDSVDNARKVFDKTRKRTIYVWNALFRALALAG G+DVLELY RM+MMGVP DRFTYTYL
Sbjct: 121 LDSVDNARKVFDKTRKRTIYVWNALFRALALAGHGNDVLELYPRMDMMGVPCDRFTYTYL 180

Query: 181 LKACVASECLVSFLQKGKEIHAHILRHGYEAHVHVMTTLVDMYARFGCVSYASAVFDEMP 240
           LKACVAS+CLVSFLQKGKEIHAHILRHGY AHVHVMTTLVDMYARFGCVSYASAVFDEMP
Sbjct: 181 LKACVASDCLVSFLQKGKEIHAHILRHGYGAHVHVMTTLVDMYARFGCVSYASAVFDEMP 240

Query: 241 VRNVVSWSAMIACYAKNGKAYEALELFREMMLNTHDSVPNPVTMVSVLQACAALAALEQG 300
           V+NVVSWSAMIACYAKNGK YEALELFR+MMLNTHD VPN VTMVSVLQACAA AALEQG
Sbjct: 241 VKNVVSWSAMIACYAKNGKPYEALELFRDMMLNTHDLVPNSVTMVSVLQACAAFAALEQG 300

Query: 301 KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQLVFDRMHKRDVVLWNSLISSYGVH 360
           KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQ++FDR+HK+DV+LWNSL SSYG+H
Sbjct: 301 KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQVIFDRIHKKDVILWNSLFSSYGLH 360

Query: 361 GYGRKAIEIFEEMIDHGVSPSYISFVSVLGACSHAGLVEEGKELFESMVKEHGIQPSVEH 420
           GYGRKAIEIFEEMID+G+SPSYISFVSVLGACSHAGLVEEGK+LFESMVKEHGIQPSVEH
Sbjct: 361 GYGRKAIEIFEEMIDNGISPSYISFVSVLGACSHAGLVEEGKKLFESMVKEHGIQPSVEH 420

Query: 421 YACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGACRIHCNVELAERASKRLFELE 480
           YACMVDLLGRANRLDEAAKI+EDLRIEPGPKVWGSLLGACRIHC+VELAERASKRLFELE
Sbjct: 421 YACMVDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIHCHVELAERASKRLFELE 480

Query: 481 PTNAGNYVLLADIYAEAEMWDEVKRVRKRLDSRELQKVPGRSWIEVRRKIYSFTSVDEFN 540
           PTNAGNYVLLADIYAEAEMWDEVKRVRK L+SRELQKVPGRSWIEVRRKIYSFTSVDEFN
Sbjct: 481 PTNAGNYVLLADIYAEAEMWDEVKRVRKLLNSRELQKVPGRSWIEVRRKIYSFTSVDEFN 540

Query: 541 PQGEQLHALLVNLSNEMKQRGYTPQTKVVLYDLDEEEKERIVLGHSEKLAVAFGLINTSK 600
           PQGEQLHALLVNLSNEMKQRGY PQTK+VLYDLD+EEKERIVLGHSEKLAVAFGLINTSK
Sbjct: 541 PQGEQLHALLVNLSNEMKQRGYVPQTKLVLYDLDQEEKERIVLGHSEKLAVAFGLINTSK 600

Query: 601 GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRFHHFKDGVCSCGDY 653
           GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRD+NRFHHFKDGVCSCGDY
Sbjct: 601 GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFKDGVCSCGDY 652

BLAST of CaUC03G051180 vs. ExPASy TrEMBL
Match: A0A1S3C5N6 (pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103497132 PE=3 SV=1)

HSP 1 Score: 1228.0 bits (3176), Expect = 0.0e+00
Identity = 605/652 (92.79%), Postives = 632/652 (96.93%), Query Frame = 0

Query: 1   MWALRTPHSTQYPPSSRRHCSAHSTSKPSLCSVSLNPSTAGNSNKNQLIQSLCKQGNLKQ 60
           MWALRTPHSTQYPPSSRRH SAHSTSK S+CS SLNPST+ NSNK+QLIQSLCK+GNLKQ
Sbjct: 1   MWALRTPHSTQYPPSSRRHSSAHSTSKLSVCSFSLNPSTSANSNKDQLIQSLCKEGNLKQ 60

Query: 61  ALLLLSHESNPTQQTWELLILSAARRNSLSDGLDVQRHLVDGGFDQDPFLATKLINMFSE 120
           AL+LLSHE NPTQQT ELLILSA+RR SLSD LDV +HLVDGGFDQDPFLATKLINMFSE
Sbjct: 61  ALVLLSHEPNPTQQTCELLILSASRRKSLSDALDVHQHLVDGGFDQDPFLATKLINMFSE 120

Query: 121 LDSVDNARKVFDKTRKRTIYVWNALFRALALAGRGSDVLELYARMNMMGVPSDRFTYTYL 180
           LDSVDNARKVFDKTRKRTIYVWNALFRALALAG G+DVLELY RM+MMGVP DRFTYTYL
Sbjct: 121 LDSVDNARKVFDKTRKRTIYVWNALFRALALAGHGNDVLELYPRMDMMGVPCDRFTYTYL 180

Query: 181 LKACVASECLVSFLQKGKEIHAHILRHGYEAHVHVMTTLVDMYARFGCVSYASAVFDEMP 240
           LKACVAS+CLVSFLQKGKEIHAHILRHGY AHVHVMTTLVDMYARFGCVSYASAVFDEMP
Sbjct: 181 LKACVASDCLVSFLQKGKEIHAHILRHGYGAHVHVMTTLVDMYARFGCVSYASAVFDEMP 240

Query: 241 VRNVVSWSAMIACYAKNGKAYEALELFREMMLNTHDSVPNPVTMVSVLQACAALAALEQG 300
           V+NVVSWSAMIACYAKNGK YEALELFR+MMLNTHD VPN VTMVSVLQACAA AALEQG
Sbjct: 241 VKNVVSWSAMIACYAKNGKPYEALELFRDMMLNTHDLVPNSVTMVSVLQACAAFAALEQG 300

Query: 301 KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQLVFDRMHKRDVVLWNSLISSYGVH 360
           KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQ++FDR+HK+DV+LWNSL SSYG+H
Sbjct: 301 KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQVIFDRIHKKDVILWNSLFSSYGLH 360

Query: 361 GYGRKAIEIFEEMIDHGVSPSYISFVSVLGACSHAGLVEEGKELFESMVKEHGIQPSVEH 420
           GYGRKAIEIFEEMID+G+SPSYISFVSVLGACSHAGLVEEGK+LFESMVKEHGIQPSVEH
Sbjct: 361 GYGRKAIEIFEEMIDNGISPSYISFVSVLGACSHAGLVEEGKKLFESMVKEHGIQPSVEH 420

Query: 421 YACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGACRIHCNVELAERASKRLFELE 480
           YACMVDLLGRANRLDEAAKI+EDLRIEPGPKVWGSLLGACRIHC+VELAERASKRLFELE
Sbjct: 421 YACMVDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIHCHVELAERASKRLFELE 480

Query: 481 PTNAGNYVLLADIYAEAEMWDEVKRVRKRLDSRELQKVPGRSWIEVRRKIYSFTSVDEFN 540
           PTNAGNYVLLADIYAEAEMWDEVKRVRK L+SRELQKVPGRSWIEVRRKIYSFTSVDEFN
Sbjct: 481 PTNAGNYVLLADIYAEAEMWDEVKRVRKLLNSRELQKVPGRSWIEVRRKIYSFTSVDEFN 540

Query: 541 PQGEQLHALLVNLSNEMKQRGYTPQTKVVLYDLDEEEKERIVLGHSEKLAVAFGLINTSK 600
           PQGEQLHALLVNLSNEMKQRGY PQTK+VLYDLD+EEKERIVLGHSEKLAVAFGLINTSK
Sbjct: 541 PQGEQLHALLVNLSNEMKQRGYVPQTKLVLYDLDQEEKERIVLGHSEKLAVAFGLINTSK 600

Query: 601 GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRFHHFKDGVCSCGDY 653
           GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRD+NRFHHFKDGVCSCGDY
Sbjct: 601 GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFKDGVCSCGDY 652

BLAST of CaUC03G051180 vs. ExPASy TrEMBL
Match: A0A6J1HLJ1 (pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111464697 PE=3 SV=1)

HSP 1 Score: 1226.8 bits (3173), Expect = 0.0e+00
Identity = 608/652 (93.25%), Postives = 624/652 (95.71%), Query Frame = 0

Query: 1   MWALRTPHSTQYPPSSRRHCSAHSTSKPSLCSVSLNPSTAGNSNKNQLIQSLCKQGNLKQ 60
           MWALRTP  TQYPP SRRH SAHS SKPS+CSVSLN STA NSNKNQLIQSLCKQGNLKQ
Sbjct: 1   MWALRTPQYTQYPPLSRRHSSAHSPSKPSICSVSLNSSTAANSNKNQLIQSLCKQGNLKQ 60

Query: 61  ALLLLSHESNPTQQTWELLILSAARRNSLSDGLDVQRHLVDGGFDQDPFLATKLINMFSE 120
           ALLLLSHESNPTQ+TWELLILSAARRNSLSDGLDV R LVDGGFDQDPFLATKLINMFSE
Sbjct: 61  ALLLLSHESNPTQRTWELLILSAARRNSLSDGLDVHRRLVDGGFDQDPFLATKLINMFSE 120

Query: 121 LDSVDNARKVFDKTRKRTIYVWNALFRALALAGRGSDVLELYARMNMMGVPSDRFTYTYL 180
           L+SVDN RKVFDKTRKRTI+VWNALFRALALAG G DVLELYA+MNMMGVPSDRFTYTYL
Sbjct: 121 LESVDNVRKVFDKTRKRTIFVWNALFRALALAGHGKDVLELYAQMNMMGVPSDRFTYTYL 180

Query: 181 LKACVASECLVSFLQKGKEIHAHILRHGYEAHVHVMTTLVDMYARFGCVSYASAVFDEMP 240
           LKAC ASECLVSFLQKGKEIHAHILRHGYEAHVH MTTLVDMYARFGCVSYA AVFDEMP
Sbjct: 181 LKACAASECLVSFLQKGKEIHAHILRHGYEAHVHAMTTLVDMYARFGCVSYAGAVFDEMP 240

Query: 241 VRNVVSWSAMIACYAKNGKAYEALELFREMMLNTHDSVPNPVTMVSVLQACAALAALEQG 300
           VRNVVSWSAMIACYAKNGK YEALELFREMMLNTHD+VPN VTMVSVLQ+CAAL+ALEQG
Sbjct: 241 VRNVVSWSAMIACYAKNGKPYEALELFREMMLNTHDTVPNSVTMVSVLQSCAALSALEQG 300

Query: 301 KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQLVFDRMHKRDVVLWNSLISSYGVH 360
           KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQLVFD MHKRDVV+WNSLISSYGVH
Sbjct: 301 KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQLVFDFMHKRDVVIWNSLISSYGVH 360

Query: 361 GYGRKAIEIFEEMIDHGVSPSYISFVSVLGACSHAGLVEEGKELFESMVKEHGIQPSVEH 420
           GYGRKAIEIFEEM+DHGVSPSYISFVSVLGACSHAGLVEEGKELFESM K+HGIQP  EH
Sbjct: 361 GYGRKAIEIFEEMVDHGVSPSYISFVSVLGACSHAGLVEEGKELFESMAKKHGIQPGEEH 420

Query: 421 YACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGACRIHCNVELAERASKRLFELE 480
           YACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGACRIHC+VELAERASKRLFELE
Sbjct: 421 YACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGACRIHCHVELAERASKRLFELE 480

Query: 481 PTNAGNYVLLADIYAEAEMWDEVKRVRKRLDSRELQKVPGRSWIEVRRKIYSFTSVDEFN 540
           PTNAGNYVLLAD+YAEA MWDEVKRVRK LDSREL+KVPGRSWIEVRRKIYSFTSVDEF 
Sbjct: 481 PTNAGNYVLLADVYAEANMWDEVKRVRKLLDSRELRKVPGRSWIEVRRKIYSFTSVDEFY 540

Query: 541 PQGEQLHALLVNLSNEMKQRGYTPQTKVVLYDLDEEEKERIVLGHSEKLAVAFGLINTSK 600
           PQ EQLHALL+NLSNEMKQRGYTP TKVVLYDLDEEEKERIVLGHSEKLAVAFGLINTSK
Sbjct: 541 PQAEQLHALLMNLSNEMKQRGYTPHTKVVLYDLDEEEKERIVLGHSEKLAVAFGLINTSK 600

Query: 601 GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRFHHFKDGVCSCGDY 653
           GDTIRI+KNLRLCEDCHS TKFISKFAD EIMVRDVNRFHHFKDGVCSCGDY
Sbjct: 601 GDTIRISKNLRLCEDCHSFTKFISKFADIEIMVRDVNRFHHFKDGVCSCGDY 652

BLAST of CaUC03G051180 vs. TAIR 10
Match: AT3G46790.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 981.5 bits (2536), Expect = 4.6e-286
Identity = 485/634 (76.50%), Postives = 548/634 (86.44%), Query Frame = 0

Query: 24  STSKPSLCSVSL-NPSTAGNS----NKNQLIQSLCKQGNLKQALLLLSHESNPTQQTWEL 83
           S  KP  CSV+L NPS +  +    + NQLIQSLCK+G LKQA+ +LS ES+P+QQT+EL
Sbjct: 23  SPLKPPSCSVALNNPSISSGAGAKISNNQLIQSLCKEGKLKQAIRVLSQESSPSQQTYEL 82

Query: 84  LILSAARRNSLSDGLDVQRHLVDGGFDQDPFLATKLINMFSELDSVDNARKVFDKTRKRT 143
           LIL    R+SLSD L V RH++D G DQDPFLATKLI M+S+L SVD ARKVFDKTRKRT
Sbjct: 83  LILCCGHRSSLSDALRVHRHILDNGSDQDPFLATKLIGMYSDLGSVDYARKVFDKTRKRT 142

Query: 144 IYVWNALFRALALAGRGSDVLELYARMNMMGVPSDRFTYTYLLKACVASECLVSFLQKGK 203
           IYVWNALFRAL LAG G +VL LY +MN +GV SDRFTYTY+LKACVASEC V+ L KGK
Sbjct: 143 IYVWNALFRALTLAGHGEEVLGLYWKMNRIGVESDRFTYTYVLKACVASECTVNHLMKGK 202

Query: 204 EIHAHILRHGYEAHVHVMTTLVDMYARFGCVSYASAVFDEMPVRNVVSWSAMIACYAKNG 263
           EIHAH+ R GY +HV++MTTLVDMYARFGCV YAS VF  MPVRNVVSWSAMIACYAKNG
Sbjct: 203 EIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVFGGMPVRNVVSWSAMIACYAKNG 262

Query: 264 KAYEALELFREMMLNTHDSVPNPVTMVSVLQACAALAALEQGKLIHAYILRRGLDSILPV 323
           KA+EAL  FREMM  T DS PN VTMVSVLQACA+LAALEQGKLIH YILRRGLDSILPV
Sbjct: 263 KAFEALRTFREMMRETKDSSPNSVTMVSVLQACASLAALEQGKLIHGYILRRGLDSILPV 322

Query: 324 ISALVTMYARCGKLELGQLVFDRMHKRDVVLWNSLISSYGVHGYGRKAIEIFEEMIDHGV 383
           ISALVTMY RCGKLE+GQ VFDRMH RDVV WNSLISSYGVHGYG+KAI+IFEEM+ +G 
Sbjct: 323 ISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYGKKAIQIFEEMLANGA 382

Query: 384 SPSYISFVSVLGACSHAGLVEEGKELFESMVKEHGIQPSVEHYACMVDLLGRANRLDEAA 443
           SP+ ++FVSVLGACSH GLVEEGK LFE+M ++HGI+P +EHYACMVDLLGRANRLDEAA
Sbjct: 383 SPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVDLLGRANRLDEAA 442

Query: 444 KIVEDLRIEPGPKVWGSLLGACRIHCNVELAERASKRLFELEPTNAGNYVLLADIYAEAE 503
           K+V+D+R EPGPKVWGSLLG+CRIH NVELAERAS+RLF LEP NAGNYVLLADIYAEA+
Sbjct: 443 KMVQDMRTEPGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKNAGNYVLLADIYAEAQ 502

Query: 504 MWDEVKRVRKRLDSRELQKVPGRSWIEVRRKIYSFTSVDEFNPQGEQLHALLVNLSNEMK 563
           MWDEVKRV+K L+ R LQK+PGR W+EVRRK+YSF SVDEFNP  EQ+HA LV L+ +MK
Sbjct: 503 MWDEVKRVKKLLEHRGLQKLPGRCWMEVRRKMYSFVSVDEFNPLMEQIHAFLVKLAEDMK 562

Query: 564 QRGYTPQTKVVLYDLDEEEKERIVLGHSEKLAVAFGLINTSKGDTIRITKNLRLCEDCHS 623
           ++GY PQTK VLY+L+ EEKERIVLGHSEKLA+AFGLINTSKG+ IRITKNLRLCEDCH 
Sbjct: 563 EKGYIPQTKGVLYELETEEKERIVLGHSEKLALAFGLINTSKGEPIRITKNLRLCEDCHL 622

Query: 624 VTKFISKFADREIMVRDVNRFHHFKDGVCSCGDY 653
            TKFISKF ++EI+VRDVNRFH FK+GVCSCGDY
Sbjct: 623 FTKFISKFMEKEILVRDVNRFHRFKNGVCSCGDY 656

BLAST of CaUC03G051180 vs. TAIR 10
Match: AT3G13770.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 474.6 bits (1220), Expect = 1.8e-133
Identity = 240/609 (39.41%), Postives = 375/609 (61.58%), Query Frame = 0

Query: 49  IQSLCKQGNLKQALL---LLSHESNPTQQTWELLILSAARRNSLSDGLDVQRHLVDGGFD 108
           I  LC  G L++ALL   +L  E       ++ L+ +   + +L DG  V  H++   + 
Sbjct: 27  ISQLCSNGRLQEALLEMAMLGPEMG--FHGYDALLNACLDKRALRDGQRVHAHMIKTRYL 86

Query: 109 QDPFLATKLINMFSELDSVDNARKVFDKTRKRTIYVWNALFRALALAGRGSDVLELYARM 168
              +L T+L+  + + D +++ARKV D+  ++ +  W A+    +  G  S+ L ++A M
Sbjct: 87  PATYLRTRLLIFYGKCDCLEDARKVLDEMPEKNVVSWTAMISRYSQTGHSSEALTVFAEM 146

Query: 169 NMMGVPSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHGYEAHVHVMTTLVDMYAR 228
                  + FT+  +L +C+ +    S L  GK+IH  I++  Y++H+ V ++L+DMYA+
Sbjct: 147 MRSDGKPNEFTFATVLTSCIRA----SGLGLGKQIHGLIVKWNYDSHIFVGSSLLDMYAK 206

Query: 229 FGCVSYASAVFDEMPVRNVVSWSAMIACYAKNGKAYEALELFREMMLNTHDSVPNPVTMV 288
            G +  A  +F+ +P R+VVS +A+IA YA+ G   EALE+F    L++    PN VT  
Sbjct: 207 AGQIKEAREIFECLPERDVVSCTAIIAGYAQLGLDEEALEMFHR--LHSEGMSPNYVTYA 266

Query: 289 SVLQACAALAALEQGKLIHAYILRRGLDSILPVISALVTMYARCGKLELGQLVFDRMHKR 348
           S+L A + LA L+ GK  H ++LRR L     + ++L+ MY++CG L   + +FD M +R
Sbjct: 267 SLLTALSGLALLDHGKQAHCHVLRRELPFYAVLQNSLIDMYSKCGNLSYARRLFDNMPER 326

Query: 349 DVVLWNSLISSYGVHGYGRKAIEIFEEMIDH-GVSPSYISFVSVLGACSHAGLVEEGKEL 408
             + WN+++  Y  HG GR+ +E+F  M D   V P  ++ ++VL  CSH  + + G  +
Sbjct: 327 TAISWNAMLVGYSKHGLGREVLELFRLMRDEKRVKPDAVTLLAVLSGCSHGRMEDTGLNI 386

Query: 409 FESMVK-EHGIQPSVEHYACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGACRIH 468
           F+ MV  E+G +P  EHY C+VD+LGRA R+DEA + ++ +  +P   V GSLLGACR+H
Sbjct: 387 FDGMVAGEYGTKPGTEHYGCIVDMLGRAGRIDEAFEFIKRMPSKPTAGVLGSLLGACRVH 446

Query: 469 CNVELAERASKRLFELEPTNAGNYVLLADIYAEAEMWDEVKRVRKRLDSRELQKVPGRSW 528
            +V++ E   +RL E+EP NAGNYV+L+++YA A  W +V  VR  +  + + K PGRSW
Sbjct: 447 LSVDIGESVGRRLIEIEPENAGNYVILSNLYASAGRWADVNNVRAMMMQKAVTKEPGRSW 506

Query: 529 IEVRRKIYSFTSVDEFNPQGEQLHALLVNLSNEMKQRGYTPQTKVVLYDLDEEEKERIVL 588
           I+  + ++ F + D  +P+ E++ A +  +S +MKQ GY P    VLYD+DEE+KE+++L
Sbjct: 507 IQHEQTLHYFHANDRTHPRREEVLAKMKEISIKMKQAGYVPDLSCVLYDVDEEQKEKMLL 566

Query: 589 GHSEKLAVAFGLINTSKGDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRFHHFK 648
           GHSEKLA+ FGLI T +G  IR+ KNLR+C DCH+  K  SK  +RE+ +RD NRFH   
Sbjct: 567 GHSEKLALTFGLIATGEGIPIRVFKNLRICVDCHNFAKIFSKVFEREVSLRDKNRFHQIV 626

Query: 649 DGVCSCGDY 653
           DG+CSCGDY
Sbjct: 627 DGICSCGDY 627

BLAST of CaUC03G051180 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 474.6 bits (1220), Expect = 1.8e-133
Identity = 234/546 (42.86%), Postives = 347/546 (63.55%), Query Frame = 0

Query: 104 FDQDPFLATKLINMFSELDSVDNARKVFDKTRKRTIYVWNALFRALALAGRGSDVLELYA 163
           F ++      L++M+S+   +D+A+ VF +   R++  + ++    A  G   + ++L+ 
Sbjct: 327 FSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFE 386

Query: 164 RMNMMGVPSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHGYEAHVHVMTTLVDMY 223
            M   G+  D +T T +L  C         L +GK +H  I  +     + V   L+DMY
Sbjct: 387 EMEEEGISPDVYTVTAVLNCCAR----YRLLDEGKRVHEWIKENDLGFDIFVSNALMDMY 446

Query: 224 ARFGCVSYASAVFDEMPVRNVVSWSAMIACYAKNGKAYEALELFREMMLNTHDSVPNPVT 283
           A+ G +  A  VF EM V++++SW+ +I  Y+KN  A EAL LF  ++L      P+  T
Sbjct: 447 AKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLF-NLLLEEKRFSPDERT 506

Query: 284 MVSVLQACAALAALEQGKLIHAYILRRGLDSILPVISALVTMYARCGKLELGQLVFDRMH 343
           +  VL ACA+L+A ++G+ IH YI+R G  S   V ++LV MYA+CG L L  ++FD + 
Sbjct: 507 VACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIA 566

Query: 344 KRDVVLWNSLISSYGVHGYGRKAIEIFEEMIDHGVSPSYISFVSVLGACSHAGLVEEGKE 403
            +D+V W  +I+ YG+HG+G++AI +F +M   G+    ISFVS+L ACSH+GLV+EG  
Sbjct: 567 SKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWR 626

Query: 404 LFESMVKEHGIQPSVEHYACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGACRIH 463
            F  M  E  I+P+VEHYAC+VD+L R   L +A + +E++ I P   +WG+LL  CRIH
Sbjct: 627 FFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIH 686

Query: 464 CNVELAERASKRLFELEPTNAGNYVLLADIYAEAEMWDEVKRVRKRLDSRELQKVPGRSW 523
            +V+LAE+ ++++FELEP N G YVL+A+IYAEAE W++VKR+RKR+  R L+K PG SW
Sbjct: 687 HDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSW 746

Query: 524 IEVRRKIYSFTSVDEFNPQGEQLHALLVNLSNEMKQRGYTPQTKVVLYDLDEEEKERIVL 583
           IE++ ++  F + D  NP+ E + A L  +   M + GY+P TK  L D +E EKE  + 
Sbjct: 747 IEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIEEGYSPLTKYALIDAEEMEKEEALC 806

Query: 584 GHSEKLAVAFGLINTSKGDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRFHHFK 643
           GHSEKLA+A G+I++  G  IR+TKNLR+C DCH + KF+SK   REI++RD NRFH FK
Sbjct: 807 GHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHEMAKFMSKLTRREIVLRDSNRFHQFK 866

Query: 644 DGVCSC 650
           DG CSC
Sbjct: 867 DGHCSC 867

BLAST of CaUC03G051180 vs. TAIR 10
Match: AT3G12770.1 (mitochondrial editing factor 22 )

HSP 1 Score: 472.6 bits (1215), Expect = 7.0e-133
Identity = 242/613 (39.48%), Postives = 385/613 (62.81%), Query Frame = 0

Query: 46  NQLIQSLCKQGNLKQALLLLSH----ESNPTQQTWELLILSAARRNSLSDGLDVQRHLVD 105
           N +I+   +  + + ALL+ S+      +P   T+  L+ + +  + L  G  V   +  
Sbjct: 88  NAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQVFR 147

Query: 106 GGFDQDPFLATKLINMFSELDSVDNARKVFD--KTRKRTIYVWNALFRALALAGRGSDVL 165
            GFD D F+   LI ++++   + +AR VF+     +RTI  W A+  A A  G   + L
Sbjct: 148 LGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEAL 207

Query: 166 ELYARMNMMGVPSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHGYEAHVHVMTTL 225
           E++++M  M V  D   +  L+    A  CL   L++G+ IHA +++ G E    ++ +L
Sbjct: 208 EIFSQMRKMDVKPD---WVALVSVLNAFTCLQD-LKQGRSIHASVVKMGLEIEPDLLISL 267

Query: 226 VDMYARFGCVSYASAVFDEMPVRNVVSWSAMIACYAKNGKAYEALELFREMMLNTHDSVP 285
             MYA+ G V+ A  +FD+M   N++ W+AMI+ YAKNG A EA+++F EM+    D  P
Sbjct: 268 NTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMI--NKDVRP 327

Query: 286 NPVTMVSVLQACAALAALEQGKLIHAYILRRGLDSILPVISALVTMYARCGKLELGQLVF 345
           + +++ S + ACA + +LEQ + ++ Y+ R      + + SAL+ M+A+CG +E  +LVF
Sbjct: 328 DTISITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVF 387

Query: 346 DRMHKRDVVLWNSLISSYGVHGYGRKAIEIFEEMIDHGVSPSYISFVSVLGACSHAGLVE 405
           DR   RDVV+W+++I  YG+HG  R+AI ++  M   GV P+ ++F+ +L AC+H+G+V 
Sbjct: 388 DRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMVR 447

Query: 406 EGKELFESMVKEHGIQPSVEHYACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGA 465
           EG   F  M  +H I P  +HYAC++DLLGRA  LD+A ++++ + ++PG  VWG+LL A
Sbjct: 448 EGWWFFNRMA-DHKINPQQQHYACVIDLLGRAGHLDQAYEVIKCMPVQPGVTVWGALLSA 507

Query: 466 CRIHCNVELAERASKRLFELEPTNAGNYVLLADIYAEAEMWDEVKRVRKRLDSRELQKVP 525
           C+ H +VEL E A+++LF ++P+N G+YV L+++YA A +WD V  VR R+  + L K  
Sbjct: 508 CKKHRHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEVRVRMKEKGLNKDV 567

Query: 526 GRSWIEVRRKIYSFTSVDEFNPQGEQLHALLVNLSNEMKQRGYTPQTKVVLYDLDEEEKE 585
           G SW+EVR ++ +F   D+ +P+ E++   +  + + +K+ G+       L+DL++EE E
Sbjct: 568 GCSWVEVRGRLEAFRVGDKSHPRYEEIERQVEWIESRLKEGGFVANKDASLHDLNDEEAE 627

Query: 586 RIVLGHSEKLAVAFGLINTSKGDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRF 645
             +  HSE++A+A+GLI+T +G  +RITKNLR C +CH+ TK ISK  DREI+VRD NRF
Sbjct: 628 ETLCSHSERIAIAYGLISTPQGTPLRITKNLRACVNCHAATKLISKLVDREIVVRDTNRF 687

Query: 646 HHFKDGVCSCGDY 653
           HHFKDGVCSCGDY
Sbjct: 688 HHFKDGVCSCGDY 693

BLAST of CaUC03G051180 vs. TAIR 10
Match: AT4G33990.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 472.2 bits (1214), Expect = 9.1e-133
Identity = 240/608 (39.47%), Postives = 379/608 (62.34%), Query Frame = 0

Query: 46  NQLIQSLCKQGNLKQALLLLSHESNPTQQTWELLILSAARRNSLSDGLDVQRHLVDGGFD 105
           N +I   C+ GN K+AL L +        T   L+ +       + G+ +  + +  G +
Sbjct: 220 NAMISGYCQSGNAKEALTLSNGLRAMDSVTVVSLLSACTEAGDFNRGVTIHSYSIKHGLE 279

Query: 106 QDPFLATKLINMFSELDSVDNARKVFDKTRKRTIYVWNALFRALALAGRGSDVLELYARM 165
            + F++ KLI++++E   + + +KVFD+   R +  WN++ +A  L  +    + L+  M
Sbjct: 280 SELFVSNKLIDLYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQPLRAISLFQEM 339

Query: 166 NMMGVPSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHG-YEAHVHVMTTLVDMYA 225
            +  +  D  T   L  A + S+  +  ++  + +    LR G +   + +   +V MYA
Sbjct: 340 RLSRIQPDCLTLISL--ASILSQ--LGDIRACRSVQGFTLRKGWFLEDITIGNAVVVMYA 399

Query: 226 RFGCVSYASAVFDEMPVRNVVSWSAMIACYAKNGKAYEALELFREMMLNTHDSVPNPVTM 285
           + G V  A AVF+ +P  +V+SW+ +I+ YA+NG A EA+E++  +M    +   N  T 
Sbjct: 400 KLGLVDSARAVFNWLPNTDVISWNTIISGYAQNGFASEAIEMY-NIMEEEGEIAANQGTW 459

Query: 286 VSVLQACAALAALEQGKLIHAYILRRGLDSILPVISALVTMYARCGKLELGQLVFDRMHK 345
           VSVL AC+   AL QG  +H  +L+ GL   + V+++L  MY +CG+LE    +F ++ +
Sbjct: 460 VSVLPACSQAGALRQGMKLHGRLLKNGLYLDVFVVTSLADMYGKCGRLEDALSLFYQIPR 519

Query: 346 RDVVLWNSLISSYGVHGYGRKAIEIFEEMIDHGVSPSYISFVSVLGACSHAGLVEEGKEL 405
            + V WN+LI+ +G HG+G KA+ +F+EM+D GV P +I+FV++L ACSH+GLV+EG+  
Sbjct: 520 VNSVPWNTLIACHGFHGHGEKAVMLFKEMLDEGVKPDHITFVTLLSACSHSGLVDEGQWC 579

Query: 406 FESMVKEHGIQPSVEHYACMVDLLGRANRLDEAAKIVEDLRIEPGPKVWGSLLGACRIHC 465
           FE M  ++GI PS++HY CMVD+ GRA +L+ A K ++ + ++P   +WG+LL ACR+H 
Sbjct: 580 FEMMQTDYGITPSLKHYGCMVDMYGRAGQLETALKFIKSMSLQPDASIWGALLSACRVHG 639

Query: 466 NVELAERASKRLFELEPTNAGNYVLLADIYAEAEMWDEVKRVRKRLDSRELQKVPGRSWI 525
           NV+L + AS+ LFE+EP + G +VLL+++YA A  W+ V  +R     + L+K PG S +
Sbjct: 640 NVDLGKIASEHLFEVEPEHVGYHVLLSNMYASAGKWEGVDEIRSIAHGKGLRKTPGWSSM 699

Query: 526 EVRRKIYSFTSVDEFNPQGEQLHALLVNLSNEMKQRGYTPQTKVVLYDLDEEEKERIVLG 585
           EV  K+  F + ++ +P  E+++  L  L  ++K  GY P  + VL D++++EKE I++ 
Sbjct: 700 EVDNKVEVFYTGNQTHPMYEEMYRELTALQAKLKMIGYVPDHRFVLQDVEDDEKEHILMS 759

Query: 586 HSEKLAVAFGLINTSKGDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDVNRFHHFKD 645
           HSE+LA+AF LI T    TIRI KNLR+C DCHSVTKFISK  +REI+VRD NRFHHFK+
Sbjct: 760 HSERLAIAFALIATPAKTTIRIFKNLRVCGDCHSVTKFISKITEREIIVRDSNRFHHFKN 819

Query: 646 GVCSCGDY 653
           GVCSCGDY
Sbjct: 820 GVCSCGDY 822

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038895613.10.0e+0097.09pentatricopeptide repeat-containing protein At3g46790, chloroplastic [Benincasa ... [more]
XP_022141898.10.0e+0094.48pentatricopeptide repeat-containing protein At3g46790, chloroplastic [Momordica ... [more]
KAA0067772.10.0e+0092.94pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
XP_008457445.10.0e+0092.79PREDICTED: pentatricopeptide repeat-containing protein At3g46790, chloroplastic ... [more]
XP_022964690.10.0e+0093.25pentatricopeptide repeat-containing protein At3g46790, chloroplastic [Cucurbita ... [more]
Match NameE-valueIdentityDescription
Q9STF36.5e-28576.50Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidop... [more]
Q9LIC32.6e-13239.41Putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial OS... [more]
Q9SN392.6e-13242.86Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q9LTV89.8e-13239.48Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... [more]
O817671.3e-13139.47Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1CLW00.0e+0094.48pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Momordic... [more]
A0A5A7VHC30.0e+0092.94Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A5D3BEB80.0e+0092.79Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3C5N60.0e+0092.79pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Cucumis ... [more]
A0A6J1HLJ10.0e+0093.25pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Cucurbit... [more]
Match NameE-valueIdentityDescription
AT3G46790.14.6e-28676.50Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G13770.11.8e-13339.41Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G18750.11.8e-13342.86Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G12770.17.0e-13339.48mitochondrial editing factor 22 [more]
AT4G33990.19.1e-13339.47Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 519..643
e-value: 8.1E-40
score: 135.6
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 348..381
e-value: 2.8E-7
score: 28.3
coord: 245..271
e-value: 3.5E-6
score: 24.8
coord: 384..417
e-value: 7.5E-5
score: 20.6
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 216..243
e-value: 0.02
score: 15.1
coord: 141..170
e-value: 0.014
score: 15.6
coord: 245..271
e-value: 8.9E-8
score: 31.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 345..392
e-value: 1.4E-8
score: 34.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 138..172
score: 9.13082
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 243..277
score: 10.796938
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 381..416
score: 9.097937
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 346..380
score: 12.495939
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 193..302
e-value: 3.9E-21
score: 77.2
coord: 303..399
e-value: 2.5E-20
score: 74.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 400..543
e-value: 2.3E-13
score: 52.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 42..192
e-value: 5.1E-21
score: 77.4
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 228..505
IPR006694Fatty acid hydroxylasePFAMPF04116FA_hydroxylasecoord: 802..897
e-value: 9.3E-21
score: 74.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 8..23
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..23
NoneNo IPR availablePANTHERPTHR24015:SF96OS01G0848300 PROTEINcoord: 42..642
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 42..642

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC03G051180.1CaUC03G051180.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008610 lipid biosynthetic process
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005506 iron ion binding
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding
molecular_function GO:0008270 zinc ion binding