CmUC03G066790.1 (mRNA) Watermelon (USVL531) v1

Overview
NameCmUC03G066790.1
TypemRNA
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationCmU531Chr03: 30874746 .. 30889062 (+)
Sequence length3144
RNA-Seq ExpressionCmUC03G066790.1
SyntenyCmUC03G066790.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCACTTCTTGCCCTTCATCCCACTGCATCTCTTCCCAACTCCCCAAAATTTCACCCATCGCCCATCTTCCACGCCCTCAATTCATGCTCATCAATGGCTGAGCTCAAGCAATTTCAGTCCCAAATCATTCGTTTTGGTCTCTCTACTGACAATGATGCCATCGGCCGTCTCATCAAATTTTGTGCTGTTTCCAAGTATGGAGATCTTCACTATGCTCTTCTTTTATTCAACTCAATACCTTACCCAGATGCTTTCATTTACAATACTTTAATTAGAGCTTACTTACAGTTCCAATCCCCTAAATCTTCCTTACTTTTGTATTTGCAAATGCTTCATAACTCTGTCTTTCCCAATAAATTCACATTCCCTTCTTTAATTCGTGCTTGCTGTATTGATAATGCTGTCAAAGAAGGGAAACAAATTCATGCCCATGTTGTTAAATTTGGTTTCACAACTGATAGATTTTCGAACAACAATTTAATTCATATGTATGCTAATTTTCAATCCTTGGAAGAAGCTAGAAGGGTGTTTGATTCTATTGAGTTACCTGATCTTGTAGCTTGGACTACTTTGGTTACTGGGTATTCTCAATTGGGATTTGTAGATGAAGCTTTACGAGTTTTCGAGTCGATGCCTGAACATAACTCTGTTTCTTGGAATGCTATGATTTCTTGTTTTGTTCAAAACAATAGATTTCATGAGGCATTTGGTTTGTTTAATAGGATGAGATTAGAGAAGGTTGTTTTGGACAAATATGTGGCTGCTAGTATGTTATCAGCTTGTACAGGATTAGGAGCACTCGAACAAGGGAAATGGATACATAGATACATTAAGAAAAATGGGATTGAATTGGATTCAAAACTTGCAACTACTTTGATTGATATGTATTGTAAATGTGGTTGTTTGGACTGTGCTTTTCAAGTGTTTACTCATTTACCTGAAAAGGGGATTTCTTCATGGAATTGTATGATTGGAGGGATGGCTATGCATGGGAAAGGAGCGGCAGCTATTGAACTTTTTAAAGAGATGGAAACCAAAATGGTGAAACCAGACAACATAACTTTCCTTAATGTACTAAGTGCTTGTGCTCACTCTGGGTTAGTCGAAAATGGTCAATACTATTTCTGTCGCTTTACTCAAGTTTACGGTATTGAACCCGGAACCGAGCATTATGGATGCATGGTTGATTTGTACGGACGAGCCGGGATGTTGGAGGAAGCAATGAAGGTCATAGATGAGATGCCCATGAGTCCTGATGTAGGTGCGTTAGGTGCCTTTGTTGGAGCTTGTAAAATTCATGGGAACATAGAGTTGGGAGAGGAAATAGGGAAGAGAGTAATAGAATTAGAGCCTACGAATAGTGGGCGTTACGTTCTACTGGGAAATCTATACGCCGAGGCAGGTAGATGGGAAAGTGTTGCAGAAGTAAGAAAGTTAATGAATGATAGAGAAGTGAAGAAGGCTGCTGGAGTTTCCATTATTGAATTGGAAGGTGTTGTGTATGAATTTATTGCAGGAGGAAGAGTTCATCCTGAAGCAAAGGAAATTTATGATAAACTTAATGAGATGTTAGAATGTATAAGAATTGAAGGATATGTAGCAGAGAATGAAACTGAAGAGGAAAAGGATAATCCTGTTTATTACCATAGTGAGAAACTGGCAATTGCTTTTGGGTTGCTCAAAACTAAAGCAGGGGAAATGCTTAGAATTACTAAGAATCTGAGGGTTTGTAAGGACTGTCACCAAGCTTTGAAGCTTGTTTCCAAGGTTTTCCAAAGAAAAATCATTGTAAGAGATAGAAATCGTTTCCATCATTTTGCTAATGGAGAGTGTTCTTGTAATGATTATTGGTAAAGAAAATATCAACTCATCTCCCCTTATCAATTCTTTGTGATTCTTTTAAGATTGCTAAGCTGTAAATGTTACCTATATTCTGTCAAACGTTGGAGTATTACTATTGACCTTTTCATAAACATTCAAAACTATCATTGAGTTGAAATTATTTAGAATTTGGAAATCAGGACTTGAAAGATGTGGTGGTGACCTAGAACGGTGTGGTGATTTAAATTTTCTTAGATCATCGCAATTTCTAAAAAATCTTAATAACACAAATCTTTTCGTAATTAGCTAAGTATGACAATGACAATACAATCCTTCGTCATTAATCTCTACTGCTCTCTTATCACTCTTGTGCATAATCGAACAGTAGTCTAACATGCGAGATCTTGCCTAAAATCCTCCCTTTTGAACAAGACAATAAATAATAAATCAATCACCATTTTAGATGTGGATTTAGAGACGATGGAATCTCTAAGATAAAAATTCAAAGAGAATTCAAAAACAACATATCTAATGACTTTTCAAACTCATGCGAATGAAACACTCTTTGAAAACTAAAAGTACTTCAATGATCAAACAACTTCAAATAAGCTAAATCAATATCGAGAACAAGCTTCAAAGCTCTCAAGACAAACAAGCTGAAAAACCCTTCGAGACAAGTTTACTTTGAGAGAGAGATTCAGAGGATAAATGTTAAAGATTGTACTACTCCTACATACAAAATCAATACAAATTCAAGTTCATTATACAAATTAAATTCCTTTGGTGAAATCTTGTGTGAGCAAACATGTTTGACATATTTAAAAAATTGATGGGCTTTGTAAACAAAAATTTGAAATTCAATAACTTTTGCGTCTAATATGTTAGTAAACTTTGTTTTTTAAAAAAAAGAAAAAAATTAGAGACTTTTTAGATTCAAAATCGAAAATTTGGAGACTTGACGCTTAGAAAGTGAGAGAATCAAATGTGTTTGGAATACATTTTCAAGTGTATAATTTTAAAAATAAGTCATTTTGGAAAAAAAAATGGAGTGTTTGGCAACCACCAAAAATGAATTTTGAAGTGTATTCTGAACGGTTTTTATCAAAAGAGTTTAAAAAAAAATGAGTTTTTTGAAAAATATATTTTTTTCTTAAATCAATCCAAACGAGCTTTATTTAACACCAACTATATATATATATATAGAACATTGATATTGATGAAATTTTTTCATTAAAAAAAAAATGATAATATATTGAATGTTTAACAAAAGTATACATTCAAAACTAACATCCACTTGCTTCAAAAAACAAAACAAAACATGCACCATAACTTTACATAAAAATAATAATATATTTTTAAAAATTTTAAAAAAAAAATTTAAAAAAAAAAAAAAAAAAACTTGCATAGTAACTATGCGTTTGACTTTTGAAATGAATTGAGAGCTGTTCTGGATATTCCCATGTTTAAGTGAATATTCTGTTAATTCAATGAATTTTTTCCTATGTTTGGATTAAAAAAAAAAAAAAAACATATATATTCTTTCTTGCGCATTATTTTCATTAATTCTTCGTTCTTGTCAAGTTTTTTTGGTTGATATTGTATGTCTTTTTGCTTGGATTAAAAACCATATCTTGGTTGTGTGTTAGGCATAATTTTAATAATAGATCTATGATATTAATGGTGTTTGACATGCATTCATTCTCTTAAATAATTGATTCCACATCTCCATAATTGTTGTAAATCGATATCAATTGTTAAACCAAAAACATATATAACTCAGCTTTCTTACACTCAAAATTATCACAAAAACATCTATATACCTTTTCAACTACTTTATTTTTTTTTAGATTTATTTTGTTATGGATGAAATTCTCGAATAATTACATGTAGGAATTTGACAATATTATAAGAGCAAGAGCACTCCCATATAAAAAAAAGGGTTTAAAAAAGAGATGTAATTCAAGAACTATACAAAAGGAGATTAGGGTTTGGTTTCATATCATTCCCTTAGAAATACTTTAAGTTTTGATATCTTAAGTTTAGTTATTTTTTCAATATTATTATTATTATTATTATTTGTATTCTCTAGTCTTAGAGTACATGAAATAGTGCAAGTGGGTACAAAAGTTTTGAGAGAATGTTCTTGTGTAAATTTTTGTTTCACATTTTTTTAATTTCGTGATTTTTGCCTATTCCTAGGTAGAAGTGAGATTTTCTCAACAAGTGGTATCAGAAGTTTAGGTTTATAGTTTCTTGAATTTCTAAGTATGTATTATGGTTGCAATATTATTTAAATTTTCCTAGACCACCCGCTATACGATATTGTCCGCTTTGGGTTACCCCTCACGGTTTTGTTCTTGGTAGGGTTGCGCCTACCAAAAGGTCTCGTACATAGAAAGGTATCCACCCCCTTTTATAAGGAAAGCTTCGTTCTCCTTCCTAACCAATGTGGGACAATGTATAGCCAGGGTCAACGCCATAGCCCTTACAATCTCCCCCCTAACGAGTCAAGCGTCCCCGCTTGACCTACCTGAGGTTTTAACATCGATTCGGGCACCGACTCTGATATCATTTTTAACAGCCCAAACCACCCACTATACGATATTGTCCACTTTGTTATCAAGCTTTATGAGTCCAGTATAGTTTGATTGGAAGAAGTTTGGTGAAATGTTCAACTTTGACTTGGACAACTGCAAATCAAGAATATGTTAATGCAATATGAAATACACAATGTCTAGAAGGAAAGATCTAGCAATGGTATTCTTGAAGAGTTAAGCGGTGATGATAGTCTAATAGAGTATAGGAGTGATTCCAGTACATGTTTTTAAACATAGACTAAATGTTACTTGTAAATGAAGATATGGAATTAAGCCCAATCCTCACAAAATACGATTTGGGTTCATCCAATCCTTCAAAACTAGAAAATGGCAAAGCTTACCTTTTCATCAAGACGATGAAGTTAAAATTTGACATTATTGTTCAACTCGATCACAAGGAAAAACTAAAAGGAAGGCTAAAAGATAGAGAACGTAGAAGAGAGAATCTTGGAGGGGGCTAGAAGGTCAAGGAGTCTGGACTAAATTCGAAAAAGAAGTTTTTACATCAGTACATGACCATCATGATGCTACTAAAAGCATTAATCCTTAGCACCCTGCAAAGTCGAGCAACATTGGCTTGTTTCTTCAATTCAAGAGTATCGTGACGCTACACTTCTATTTGTAGCATTGGTCTGAGTGTTGTGTAGCGTTGTTCTGATCTAAGAGAGCACTCCTAGACAATGTAGATTGTAACAAGGGTTGAAAAATAGTGGTTTTCCAAAATACTTATTTGATTTCTTTATTTCTTTTTATTGATTTTGCAAACTAATCTCTTAGGATATAATACTTGGAATAGTTCTCATATTATTGCTTATTGAGAATATGCACTTGCACTTGACCACCCACTATTCGTAATTAGTAGGCTTTAGTACCAATTCAACACACTTAACACGTTTTGCTCATCTCACAACCATTCTTTACTTTCATGCATATCAATAGAATGACTTTTTTGGTGCATTTTTTTTTTTTTTTTTTTGTATGTTTTAAGCTCGATACTATGTAAGAGATAATTCGTTAACCCGATTCAACTTCTTTTAACCTTATCCAAAGGGCAACTAAAACATCTAGGTGAATACTCATATAATTAGATTTTTTTAAGCTCATAAAGATAACTAATTATTTATTCACTAAACTTAAAGGCATATTACAACATCATTCTTTAATAGACAACTTAAACATCAAAGCATATGTGACAAGTATCACGCACCTATTGCATTGATTATTTATTTATAATAAATGATAGAAGGATATCACCTTGTTAAGTGACTATTCTCAAGTCTTAATTGCAATACTTTAGAGTGGCAATTTGGGAATACTATACAAGTATTATTATTATTATAAATACTACCAAAAAAGAAAAAGTATATATATAGTCTGTTTCTCCTTCCCACTGGATATCACGAACTATAATTTATTAAATATTTGTGGGCCAGATGCTTATGTTAATTACCCTCTAAATTAAGTATTGATTTATTATAAGATCGACCATGAATAATTTCCATTTTTCAACACATTATGTTTAATTCAATTGAATTTGTATTGTGGTCCATTAAACTAAAGTTTTTAATACTATTTTAATTTCAGAATACATTCAATTTTATAGTGTATTAAATTATATTCTGTATGGTAAAATACTCTTTACAACATGAAAAATAAGATGATCTAAAAATAAAAACTTGACTCTGCCAATAATCACATATCCTTTTGGTAGCTATTTGATTTTTTGGTTTGTTATTTCCTTTTTTTTTTCCAATGATTTTAAAAAAACACATGCAAACTTTTAAAAACTAAAAGAATTATATTTATTTTTTTGAATTTGGCTAATAATGCAACTCTTTCACTTCATAAATTATGAAACATTTGTAATAATAAATTGAGAGAAAATATGCTTAAAATTATATATAACGTATTTGGTTTAATTTGAAGTGTAGATGGTTGACTTAAGATTTTTAGTTAAATATAATCCACAACAAACACTACTTAAATAAATTTTAAATTTAAAAACCAAAACATATGGAGTACAAAATTATGTATAAAAGTTGGAAAACAAACCTAAGATCTAAAGCAAAAGTTACCCAAATAATGAACACAATTATTGTTTGATATTGAAGTGACATATGATTAGTTTTAATTTCCCTAATTAAAAAGATAAACAAAATAAATTATTTAAACAAAGATTAGTGGCATTTCTTAGAATTCCCAATAAGATTAGGGGCAAAGCTTTGGAACTCATATTATAAATTCTCTAATGGCAGGCCAAAGAAAGATCCCTTTTCTTTTTTTCTTTCTTTCTTGTTTCCTTCTATAACTTCTAATTTCCAACTTCCATGAAAGCTTCTTCTCTTCACCAAAAATGGAGGGCAAAATCCTTCCTTTTCTCTTCCACTAAACCCTAATAGCCAAACCCCCTTTTTTTTTTTTTTTTTCAATTTCTTTAGAAAAGAAGAAATTGGAGAAAATTAGTATATTTTTAGGGGTTTTTTTTTTAATCTTTTTTTATTTCCAAAAAATATTTTTCAATGTCCAAAATCTCAGGGGGTTCTGATGATGGTGGGAGCTTTGTGTCTGATGGTACCCATGGAAGCTCTGAATCCCATCAACACCCTCAGCTTAAAGTTCCCAAGAAGAAAAGAAACTTGCCAGGAACACCAGGTAATTCTTTCTTCTTTTATTTCATTTAGGTTAAAATTACAACTTCGGTCCCTCCGATTTGAAGAAAATTAGATTTTATATTTATGATTTGATAAAACCTCATAAATAGTCTTTATGGTTTGAGAAAACCCTTATAAATACTCCCTACTATATAGAAATTAAATTATAACTTATTTCACGCTATGAGAACCAAATTTGCAATTTATCTTTATTTATCATGGTGTTGATTTGATTGATATATGACCTAAATTATGTACAATAAACAATTTTGTTTTTAGTTCTTAAAATGTTGGAGGGTAATTTTGATCCATATTTAGAGTTGTTTTTGTAAAGTTTTCTAATTTCTTTCATGTAATATGTGATACGTTATTTAAAGATATATTATTTTATTTCCTTTAACTTCACATTATTCTCAAGTGATTATTTTATGGCAAACAAAAAATAGAGAGTAATATTAGAAATTTCTTCTTCGTCACCATATTTATAAAAAGATTAGATCGATATGTTCAAAGATGTGATGTGAAATGAGATAGAAAATAGTATGAGAGAGACATGATATCTCATAGGCTATTTGAACAAAGAGACTTTATTATTGTTTTATTTTATTTTATTTTATTAGGTTATTTTTCAAATATAACAAAATAAGTCAACTTATTTAGAAATATAACAAAATGTTACGGTTATTAGTAATTACTAATAAATAATTGTGCTATATTTGAAAATATTTTCAATAATTTTATCATTTAGAACAATTAAACTATTTTATTTTTTTTTTTAAAAAAAATATTTTTGAATTTTGAAATTTGGATGGAATTTTAAAAATATTTTTGGAGGGCGGAATAGAAAATAAATAGGAAAATTTAAATATGAGAGTAGTTGGAAGGGCTTCAATTATTGGAGGGGGATTACAATGATTGCTATGTACTGTTTGGACTTTTCTCATGTAATGATTTTTTTTTTCTTTTTGTTTCAAAAACAATTATAATGACATTATTAAAGAAATTCTAAATTTGTTTAATTTAATTTTGGTTGTATGCCTTTCCATGAATTGATTGGCCTAATAAGTAATATTTACATTGTGCTATTTACATAAATTACCACAAATACTTGTCCTGAACTTTGAAATTGGATGGAAATGAAAATTATTTTTAAATATTTTTTTAACTTTAAAAACCCAAATTATATATGGATAATGAAGGATATTTTAAATTAAGCCCATTATTTGGGGGATTGTTATGATATTTTTAATGGTATATGATATTTATATGAACGATATATATGTTAATTAAAGAAAATAATTTAACAGGAATAATGGAACATACATTATATAAAATTTGAAATATAATATAGTTTAATAATATTAAAGGTAGGTCAGAAAACTTGGCCAACAAAATTTGACCAATCTAATTCAAACAGTGTATTCTCTGTTCATTGATAACTATTTAATTAATCTTAATTATAAATACTACTATTCTTGTATTATATTTTTTAAAACAAAAAAGAATAGTTTTTAAAACTTAAAATTTTATATCATAATTTGACTAAGAATTCTAGTATTTTTTTTAATATAAAAATAATAATTGAGAAATTTGAGAAAACAAACATAATTTGAATAACTATTTTGTTGTTGGTTTTAAGTTTTTCAAAATTCAAATTATTTCTTCTCGGTTCAATACAATGGTTTGTATTTTTCACAAGTATAAAGGCTGAATTCCTAACTAAATTCTAAAAAATAAAAACAAGTTTTTAATTTTTTTTTCTTTTTTAGTTTTCAAAAGTTGGTTTAGTTTTTAAAAACATTGGTAAAAAAGTAGATAACAAAGTAAGATGTTTCAACATGGAAAAGGTGTTTATAGGCTTATTTTTCAAAAACTAAAAACCAAAAACCAATAGTTACCAAATGGGATTAGAACTGTTTTACTTCCTATGAGGTCTCCGTCTAATAATAGTCACCCAATGAGAAGAGTAACTTATATATCTTTAATTTTAGGGATAGTTGCAAATATAACAATCATGTCTAAAGTATTAGTAGATATAGCGGAATGCAAATTAAAAACTTGTAGATATAACAAAATTTAGATTCAACTCTCGGAGTCTATTAGTGATAGACTATATCGCTAATAGAAATCTATCAGCGATAGAGTTTATCAATGATAAATTTTGCTATAGTTGCAATGCTTTAAAAATGTTGACATACACTTGGACCCCGCTTGGTAACAATTTCATTTTTTGTTTTTGTTTTTAAAAATTAAGCATGTAGACACTACTTCCACCTCCAAATTTATTCTTTTGTTGTTTAGTTTTTACCAATATTTAAGAAATCAAGCCAAATTTTGAAAACTAGAAAAAATAGCTTTTAAAATTTTGTTTTTTTTTTTTTAAATTAGACTAAGAATTCAACTATCATACTTAAGAAAGATGTCAATCATCGTAAAAAATGTGGAAGAAATAGACTTAATTTTCAAAAATAAAAACAAAAAATGAAATGATTACCAAATGGGGCCATAATTATTAGCCCCAAAAGTATTACTCATTGTAATTATCCTTAATTTTAAAGATATATGTTGAAGTTTCATATCTTTTTTATGTGAGATTTTCAATACACTCTCTCAAGACAGTGAAATTTTCATGAATATCAACATTTAAATTGTTGTATATATATGTAGATCCAGATGCGGATGTTGTATCGTTATCTCCAAAAACTCTTATGGCAACCAATCGATTCGTGTGTGAAATATGCAACAAAGGATTTCAAAGAGACCAAAACTTGCAACTGCACAGACGAGGTCACAACCTACCATGGAAGTTGAAGCAAAGGACAAGCGGGAGCGAGACAAAGAGGAGGGTGTATGTGTGCCCAGAACCGTCATGCGTGCACCACGACCCGGCGAGGGCGCTCGGAGACCTCACCGGAATCAAGAAGCATTTCAGTAGAAAGCATGGGGAGAAAAAGTGGAAATGTGAGAAATGCTCCAAAAGATATGCTGTTCAATCTGACTTGAAAGCTCATTCCAAAACCTGTGGCTCCAAGGAATACAAATGTGACTGTGGCACAATCTTTTCCAGGTTAATATATATTTTACCTTCACCAAAGCATATATTAATTCACCTATCCATATATAATATAAACTTAAATTTGAACTTTCAAGTTTGTGTTTATTTTATTCTTTTCAACTTTTAAATTTAGATTTTGGATTGGTTTTTCAACTCTAGTATCGTGTTTCAAAAAAGAAAATTAGTCCCTAAACTTTCATTCTCTTTAAAATTCAAATGATGTATGATTGGAAATAAAATTAAATGTTTAGGGTTTTGTCATTTATATCTCTTAAACTAATTAAGTTTTAAAACTTTCAAATTTGATAATAAATTGAAAATTTTAGGACTATTAGATATTTCTAAAACTTTAAGGATCAAATAGACACAAATTTCAAAATTAAGAGCCTACACTTATAATTTAACAACCATGTTTAAATACATAGTTCATAAGATAGGTATATATAATAAAAACAAACCATGTATTTAAAATTAAATGTAGAACGTCAGATGTGATGTTTGAAAACAAATAAAGATTAAAATAAAATAAAAGTTAATGATGAACTATTGATAGGAACTAGAATAATTTTTTTAATGCAAATTGAGAGAGTACAATATTTTAAAAGTTTATTAAGAATAAAATTGACTTATAAACTTATTGAGATTGGTTTAGTTTGAGAAAAGATTAAATCCAAGTTGATTGAATGGGAGTTTTCCTATCATTATTAAGTTAAAGTATATTTGTCTTTTCTTATTATATGTTTGTTTATATTGATATGCCAAAACTCTTTATTACATTTCATTTTATTCCTAAACTCTTTCAAATTTTGTATAAACTTTGTTCCCCATGCCTCAGTTTAATGCCAAAACCTATTTGTCTTTGGTTACTAGATGTCCATGAGCTATTAAATTAGGATGGTAAAGATTGAAACTTAAACCGGATATTTTAGGTTCCGTTTGGTAGTCATCTCACTTATTTATTTATTCTTTTTTTTTAGAAATTCAGTTACCACTTCCATCTATTAATTTTTATGTTTTGTTGTTTACATTATAGGATTGTTGTTAAAATCAAAGCCTGAAGAAAAAAAGTTTTCAAATGTTTAGATAGGAAATATGAAAACTATGGGTAAGAAATTGTGAGAAAACAAGCCCAATTTTAAAAAACTAAAAATAGAAAATCAAATTGATTATCAAATAAGGCCTTACCTATTGGACTATACATTAGGACGTTCATAGATGTAAGAATATTAAGAATAAATAAATCCATAAATTTAATCTTAAAAATATTCGCTCAAAAGTTCTTCAAATTGTATTTAGGCTATAAAATTGCAAATTTGCTCTTTACAGTTTGGAGAAAGTTAGAATGATCTAGTACATATGATTTATAATTAAAATTTAGTTTATGTAATTTGATAAATTCCTCATAAATAGCTCATACTAAATTATTTATAAGAATTTCATCAAACTATAGGGATTATTTATGAAATTTATCATACTACGAGGACTAAATTCTAAACTTTAAAAACCATAAAGATTAAGTTAACCATAGTTACGTACATTCATAACCTACTTGTTACCATATTTGACACAAAAATTTAAGATAAATAACTCAAACCAAACAAATTCAATCACCACTCCTAAACATAAAATTTGTCACCCCAAAGATTTAGTGGCATTTATATTATAAACATTAGTTAAATCCAATCCTCAAATATTCATATTTTATTGTTGGAGGTGGAGCTATATATCAATTAAACCTAATTCTTGAAAATCCCTATTTTGGAGCTTTCGAGATGATCAATTAATAGTGATTAACTAGGCAGGAATTTATTTTTTAATTTAAATGTTTGCAAAATTATTTCTTCTGATGTGTGCAAAATCCATCCACTGATTTTGTTTAGTTTTACAGAATAAAATTGAAAATTGTGGATTGGATTGGGAGGAACTAAAAAAATTATTGGTTTGGATAAGTTTTTTATTGAAAGAACCAATTAAAAATTGAACGGAATCGAAGATATATTTACACCCTTACAAATAAACTCTCTAATTTTATGGTTTGGTTATTTTGAAAAGTTGTAAAGAAAAGAATATGAAAAATACAATTTCAAAATTTGATCTATAAAAAACTAACATCGTCCAAACTTAGTAGTAAACATGAAGTATGATCTTGATAGAGGGCTAAGAGTTCATGAATTCAATCAAGGTGGGGGTTTCCTTGAAATTCATATATTGTAGGGTCAAGTGGATTGTGAGATGCGCATAAGCTGATTTGTACACTAATAAATAAATAAAATAATAATAATAAGCATAATTTTAATTTAAAAATAAAAAATAAAAAAAAGACTAATTCAAAACAAAAGTGAAAATAAAAATTTAAAAAAAATAGAAGAAAAATAAAACAAATGACAAACAGCAAGAACCAAACCAAAAATAAAGTGAAGGAAATAAAAACCAAATCCATTTTTATTCTAACATTGGTTCCGATTTTACTCCATGATAAGATAAATAATAACACTAACTATATTTTAAACAAAAACTTGTCACACTCATCTTTAGGTTGCATTATTATAACATCTCTTAATTTATTTTTTTTATTTTTTATTTTTTTGCAGAAGAGATAGCTTTATCACCCACAGAGCATTTTGTGATGCCTTGACAGAAGAAAACAACAAATTAGTATCTCATCAAGTAGCAACAACAATGGCTTCCACCGCCATTAATGCGCCTTCCTTTCAACCTCAACCACTTCAACATCTTCTTTCTCAAACCCCCGTCCTTTCGCCGCCATTAACGCTGCCCCACGACCTTATGCCTATTCCTCCAAAACCCTTAAATCTATCGGCTGGTCCCATGTTTTCATCTTCCATTTCTTCCGCCACCGATGGCCACCACTTTCCCTCCCCCTCTGCCCTCATGTCCGCCACCGCTCTGCTGCAGAAGGCGGCACAGATGGGAGCTGCCGTAAGTAGCCGTGGAAACTCCACTCCTTGTTTGAGTTCGCCCATGGTCCATGAGAAGGTGTTGAGTAATACTACTCTGGAACTCGAAAGTTTAAAGAGATAGATTAATTGCTTTAAGTAGGAGCTTTTAGTTAAATGAAGTATTTTTTTATTAAATCAATAACTTTAACCCCATTTCATAATGGTTTGGTTTTTAGTCTTAGCTTATAAACATTTCTTTTTTTCTAAAAACCTATTCCTTAACTATTTCGTTTTATATTTTTAGTTTTTTAAAATTAAACCTATTTACTCTATGCTTCTTACAATGATAATTTGCATAGTACAATAGTTAGATTCTTAGCCAAATTCTAAATTTTAGATTTGGATTGGTTTTTAAAGCATTGTTAAAAACTAAGAAGAAATTTGAAGGTGGAAGTAGTGTTCAAACGGGGCTTAAAAACTCAAGTCCAAATTTTGAAAACTAAAAAAAAAAAAATATTTTTAGGAAGGATAAATTTTTTGAATGGAAAAACTACTTGAAATATTTAAAACATAAAACATATAGCAAAATGTTACTATCTATACGAAAAACAATGATATTTTGCTATATTTGTAAATAATTCTTTTAAAAATTTATTTTTATTTAGAATTTAGCTAAGAATTTAATTTTTTTTAACCGGGTTTTGTGGTGATACAAACATATAATTGAGTGTTATGATACATATAAAATGTAGTATTGAACTAAAGTGTTTTGTTATGTCATTATATTCATATTGTGGACAGAAGGGTTTTGTTACAACCATGGCTCCTTCTTCTTTCTGTGGGATATTGGGTACCAACTGTTTGCAAAAATGTCAAGAAGAAAACATTTTGTCTCAACTACCTTCCAAGGGCAAAGCTGTGGACATGGAGATGATCGACAACATTCCGATGCTGAATGGAGTGCTCGATGATCAGCCGGTCGTGGAGGCGGCGAGGAAGACGATGACATTAGATTTGTTGGGGGCTGAGGGAGGAAAAGGGATGAAATTTCAAGCACAAGAAAATGTGGGGTTTGGAGGACTTGTGGAGAATATGACTTATTTTAGAAACCAAGGAGCCATGGGAAGGCAGATTTGGGAGATTTGAATATATATATATATATATATATATATATATATTTAATTCTATGCTCTAAAGAAAACCCAAAGTCATATATATAAGAATAATAATAATAAATATATATATATATAGAATATTATATGAATATAATTAATACACATAATTTAA

mRNA sequence

ATGGCTTCACTTCTTGCCCTTCATCCCACTGCATCTCTTCCCAACTCCCCAAAATTTCACCCATCGCCCATCTTCCACGCCCTCAATTCATGCTCATCAATGGCTGAGCTCAAGCAATTTCAGTCCCAAATCATTCGTTTTGGTCTCTCTACTGACAATGATGCCATCGGCCGTCTCATCAAATTTTGTGCTGTTTCCAAGTATGGAGATCTTCACTATGCTCTTCTTTTATTCAACTCAATACCTTACCCAGATGCTTTCATTTACAATACTTTAATTAGAGCTTACTTACAGTTCCAATCCCCTAAATCTTCCTTACTTTTGTATTTGCAAATGCTTCATAACTCTGTCTTTCCCAATAAATTCACATTCCCTTCTTTAATTCGTGCTTGCTGTATTGATAATGCTGTCAAAGAAGGGAAACAAATTCATGCCCATGTTGTTAAATTTGGTTTCACAACTGATAGATTTTCGAACAACAATTTAATTCATATGTATGCTAATTTTCAATCCTTGGAAGAAGCTAGAAGGGTGTTTGATTCTATTGAGTTACCTGATCTTGTAGCTTGGACTACTTTGGTTACTGGGTATTCTCAATTGGGATTTGTAGATGAAGCTTTACGAGTTTTCGAGTCGATGCCTGAACATAACTCTGTTTCTTGGAATGCTATGATTTCTTGTTTTGTTCAAAACAATAGATTTCATGAGGCATTTGGTTTGTTTAATAGGATGAGATTAGAGAAGGTTGTTTTGGACAAATATGTGGCTGCTAGTATGTTATCAGCTTGTACAGGATTAGGAGCACTCGAACAAGGGAAATGGATACATAGATACATTAAGAAAAATGGGATTGAATTGGATTCAAAACTTGCAACTACTTTGATTGATATGTATTGTAAATGTGGTTGTTTGGACTGTGCTTTTCAAGTGTTTACTCATTTACCTGAAAAGGGGATTTCTTCATGGAATTGTATGATTGGAGGGATGGCTATGCATGGGAAAGGAGCGGCAGCTATTGAACTTTTTAAAGAGATGGAAACCAAAATGGTGAAACCAGACAACATAACTTTCCTTAATGTACTAAGTGCTTGTGCTCACTCTGGGTTAGTCGAAAATGGTCAATACTATTTCTGTCGCTTTACTCAAGTTTACGGTATTGAACCCGGAACCGAGCATTATGGATGCATGGTTGATTTGTACGGACGAGCCGGGATGTTGGAGGAAGCAATGAAGGTCATAGATGAGATGCCCATGAGTCCTGATGTAGGTGCGTTAGGTGCCTTTGTTGGAGCTTGTAAAATTCATGGGAACATAGAGTTGGGAGAGGAAATAGGGAAGAGAGTAATAGAATTAGAGCCTACGAATAGTGGGCGTTACGTTCTACTGGGAAATCTATACGCCGAGGCAGGTAGATGGGAAAGTGTTGCAGAAGTAAGAAAGTTAATGAATGATAGAGAAGTGAAGAAGGCTGCTGGAGTTTCCATTATTGAATTGGAAGGTGTTGTGTATGAATTTATTGCAGGAGGAAGAGTTCATCCTGAAGCAAAGGAAATTTATGATAAACTTAATGAGATGTTAGAATGTATAAGAATTGAAGGATATGTAGCAGAGAATGAAACTGAAGAGGAAAAGGATAATCCTGTTTATTACCATAGTGAGAAACTGGCAATTGCTTTTGGGTTGCTCAAAACTAAAGCAGGGGAAATGCTTAGAATTACTAAGAATCTGAGGGTTTGGGGTTCTGATGATGGTGGGAGCTTTGTGTCTGATGGTACCCATGGAAGCTCTGAATCCCATCAACACCCTCAGCTTAAAGTTCCCAAGAAGAAAAGAAACTTGCCAGGAACACCAGATCCAGATGCGGATGTTGTATCGTTATCTCCAAAAACTCTTATGGCAACCAATCGATTCGTGTGTGAAATATGCAACAAAGGATTTCAAAGAGACCAAAACTTGCAACTGCACAGACGAGGTCACAACCTACCATGGAAGTTGAAGCAAAGGACAAGCGGGAGCGAGACAAAGAGGAGGGTGTATGTGTGCCCAGAACCGTCATGCGTGCACCACGACCCGGCGAGGGCGCTCGGAGACCTCACCGGAATCAAGAAGCATTTCAGTAGAAAGCATGGGGAGAAAAAGTGGAAATGTGAGAAATGCTCCAAAAGATATGCTGTTCAATCTGACTTGAAAGCTCATTCCAAAACCTGTGGCTCCAAGGAATACAAATGTGACTGTGGCACAATCTTTTCCAGAAGAGATAGCTTTATCACCCACAGAGCATTTTGTGATGCCTTGACAGAAGAAAACAACAAATTAGTATCTCATCAAGTAGCAACAACAATGGCTTCCACCGCCATTAATGCGCCTTCCTTTCAACCTCAACCACTTCAACATCTTCTTTCTCAAACCCCCGTCCTTTCGCCGCCATTAACGCTGCCCCACGACCTTATGCCTATTCCTCCAAAACCCTTAAATCTATCGGCTGGTCCCATGTTTTCATCTTCCATTTCTTCCGCCACCGATGGCCACCACTTTCCCTCCCCCTCTGCCCTCATGTCCGCCACCGCTCTGCTGCAGAAGGCGGCACAGATGGGAGCTGCCGTAAGTAGCCGTGGAAACTCCACTCCTTGTTTGAGTTCGCCCATGGTCCATGAGAAGAAGGGTTTTGTTACAACCATGGCTCCTTCTTCTTTCTGTGGGATATTGGGTACCAACTGTTTGCAAAAATGTCAAGAAGAAAACATTTTGTCTCAACTACCTTCCAAGGGCAAAGCTGTGGACATGGAGATGATCGACAACATTCCGATGCTGAATGGAGTGCTCGATGATCAGCCGGTCGTGGAGGCGGCGAGGAAGACGATGACATTAGATTTGTTGGGGGCTGAGGGAGGAAAAGGGATGAAATTTCAAGCACAAGAAAATGTGGGGTTTGGAGGACTTGTGGAGAATATGACTTATTTTAGAAACCAAGGAGCCATGGGAAGGCAGATTTGGGAGATTTGAATATATATATATATATATATATATATATATATTTAATTCTATGCTCTAAAGAAAACCCAAAGTCATATATATAAGAATAATAATAATAAATATATATATATATAGAATATTATATGAATATAATTAATACACATAATTTAA

Coding sequence (CDS)

ATGGCTTCACTTCTTGCCCTTCATCCCACTGCATCTCTTCCCAACTCCCCAAAATTTCACCCATCGCCCATCTTCCACGCCCTCAATTCATGCTCATCAATGGCTGAGCTCAAGCAATTTCAGTCCCAAATCATTCGTTTTGGTCTCTCTACTGACAATGATGCCATCGGCCGTCTCATCAAATTTTGTGCTGTTTCCAAGTATGGAGATCTTCACTATGCTCTTCTTTTATTCAACTCAATACCTTACCCAGATGCTTTCATTTACAATACTTTAATTAGAGCTTACTTACAGTTCCAATCCCCTAAATCTTCCTTACTTTTGTATTTGCAAATGCTTCATAACTCTGTCTTTCCCAATAAATTCACATTCCCTTCTTTAATTCGTGCTTGCTGTATTGATAATGCTGTCAAAGAAGGGAAACAAATTCATGCCCATGTTGTTAAATTTGGTTTCACAACTGATAGATTTTCGAACAACAATTTAATTCATATGTATGCTAATTTTCAATCCTTGGAAGAAGCTAGAAGGGTGTTTGATTCTATTGAGTTACCTGATCTTGTAGCTTGGACTACTTTGGTTACTGGGTATTCTCAATTGGGATTTGTAGATGAAGCTTTACGAGTTTTCGAGTCGATGCCTGAACATAACTCTGTTTCTTGGAATGCTATGATTTCTTGTTTTGTTCAAAACAATAGATTTCATGAGGCATTTGGTTTGTTTAATAGGATGAGATTAGAGAAGGTTGTTTTGGACAAATATGTGGCTGCTAGTATGTTATCAGCTTGTACAGGATTAGGAGCACTCGAACAAGGGAAATGGATACATAGATACATTAAGAAAAATGGGATTGAATTGGATTCAAAACTTGCAACTACTTTGATTGATATGTATTGTAAATGTGGTTGTTTGGACTGTGCTTTTCAAGTGTTTACTCATTTACCTGAAAAGGGGATTTCTTCATGGAATTGTATGATTGGAGGGATGGCTATGCATGGGAAAGGAGCGGCAGCTATTGAACTTTTTAAAGAGATGGAAACCAAAATGGTGAAACCAGACAACATAACTTTCCTTAATGTACTAAGTGCTTGTGCTCACTCTGGGTTAGTCGAAAATGGTCAATACTATTTCTGTCGCTTTACTCAAGTTTACGGTATTGAACCCGGAACCGAGCATTATGGATGCATGGTTGATTTGTACGGACGAGCCGGGATGTTGGAGGAAGCAATGAAGGTCATAGATGAGATGCCCATGAGTCCTGATGTAGGTGCGTTAGGTGCCTTTGTTGGAGCTTGTAAAATTCATGGGAACATAGAGTTGGGAGAGGAAATAGGGAAGAGAGTAATAGAATTAGAGCCTACGAATAGTGGGCGTTACGTTCTACTGGGAAATCTATACGCCGAGGCAGGTAGATGGGAAAGTGTTGCAGAAGTAAGAAAGTTAATGAATGATAGAGAAGTGAAGAAGGCTGCTGGAGTTTCCATTATTGAATTGGAAGGTGTTGTGTATGAATTTATTGCAGGAGGAAGAGTTCATCCTGAAGCAAAGGAAATTTATGATAAACTTAATGAGATGTTAGAATGTATAAGAATTGAAGGATATGTAGCAGAGAATGAAACTGAAGAGGAAAAGGATAATCCTGTTTATTACCATAGTGAGAAACTGGCAATTGCTTTTGGGTTGCTCAAAACTAAAGCAGGGGAAATGCTTAGAATTACTAAGAATCTGAGGGTTTGGGGTTCTGATGATGGTGGGAGCTTTGTGTCTGATGGTACCCATGGAAGCTCTGAATCCCATCAACACCCTCAGCTTAAAGTTCCCAAGAAGAAAAGAAACTTGCCAGGAACACCAGATCCAGATGCGGATGTTGTATCGTTATCTCCAAAAACTCTTATGGCAACCAATCGATTCGTGTGTGAAATATGCAACAAAGGATTTCAAAGAGACCAAAACTTGCAACTGCACAGACGAGGTCACAACCTACCATGGAAGTTGAAGCAAAGGACAAGCGGGAGCGAGACAAAGAGGAGGGTGTATGTGTGCCCAGAACCGTCATGCGTGCACCACGACCCGGCGAGGGCGCTCGGAGACCTCACCGGAATCAAGAAGCATTTCAGTAGAAAGCATGGGGAGAAAAAGTGGAAATGTGAGAAATGCTCCAAAAGATATGCTGTTCAATCTGACTTGAAAGCTCATTCCAAAACCTGTGGCTCCAAGGAATACAAATGTGACTGTGGCACAATCTTTTCCAGAAGAGATAGCTTTATCACCCACAGAGCATTTTGTGATGCCTTGACAGAAGAAAACAACAAATTAGTATCTCATCAAGTAGCAACAACAATGGCTTCCACCGCCATTAATGCGCCTTCCTTTCAACCTCAACCACTTCAACATCTTCTTTCTCAAACCCCCGTCCTTTCGCCGCCATTAACGCTGCCCCACGACCTTATGCCTATTCCTCCAAAACCCTTAAATCTATCGGCTGGTCCCATGTTTTCATCTTCCATTTCTTCCGCCACCGATGGCCACCACTTTCCCTCCCCCTCTGCCCTCATGTCCGCCACCGCTCTGCTGCAGAAGGCGGCACAGATGGGAGCTGCCGTAAGTAGCCGTGGAAACTCCACTCCTTGTTTGAGTTCGCCCATGGTCCATGAGAAGAAGGGTTTTGTTACAACCATGGCTCCTTCTTCTTTCTGTGGGATATTGGGTACCAACTGTTTGCAAAAATGTCAAGAAGAAAACATTTTGTCTCAACTACCTTCCAAGGGCAAAGCTGTGGACATGGAGATGATCGACAACATTCCGATGCTGAATGGAGTGCTCGATGATCAGCCGGTCGTGGAGGCGGCGAGGAAGACGATGACATTAGATTTGTTGGGGGCTGAGGGAGGAAAAGGGATGAAATTTCAAGCACAAGAAAATGTGGGGTTTGGAGGACTTGTGGAGAATATGACTTATTTTAGAAACCAAGGAGCCATGGGAAGGCAGATTTGGGAGATTTGA

Protein sequence

MASLLALHPTASLPNSPKFHPSPIFHALNSCSSMAELKQFQSQIIRFGLSTDNDAIGRLIKFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYLQFQSPKSSLLLYLQMLHNSVFPNKFTFPSLIRACCIDNAVKEGKQIHAHVVKFGFTTDRFSNNNLIHMYANFQSLEEARRVFDSIELPDLVAWTTLVTGYSQLGFVDEALRVFESMPEHNSVSWNAMISCFVQNNRFHEAFGLFNRMRLEKVVLDKYVAASMLSACTGLGALEQGKWIHRYIKKNGIELDSKLATTLIDMYCKCGCLDCAFQVFTHLPEKGISSWNCMIGGMAMHGKGAAAIELFKEMETKMVKPDNITFLNVLSACAHSGLVENGQYYFCRFTQVYGIEPGTEHYGCMVDLYGRAGMLEEAMKVIDEMPMSPDVGALGAFVGACKIHGNIELGEEIGKRVIELEPTNSGRYVLLGNLYAEAGRWESVAEVRKLMNDREVKKAAGVSIIELEGVVYEFIAGGRVHPEAKEIYDKLNEMLECIRIEGYVAENETEEEKDNPVYYHSEKLAIAFGLLKTKAGEMLRITKNLRVWGSDDGGSFVSDGTHGSSESHQHPQLKVPKKKRNLPGTPDPDADVVSLSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSGSETKRRVYVCPEPSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKRYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALTEENNKLVSHQVATTMASTAINAPSFQPQPLQHLLSQTPVLSPPLTLPHDLMPIPPKPLNLSAGPMFSSSISSATDGHHFPSPSALMSATALLQKAAQMGAAVSSRGNSTPCLSSPMVHEKKGFVTTMAPSSFCGILGTNCLQKCQEENILSQLPSKGKAVDMEMIDNIPMLNGVLDDQPVVEAARKTMTLDLLGAEGGKGMKFQAQENVGFGGLVENMTYFRNQGAMGRQIWEI
Homology
BLAST of CmUC03G066790.1 vs. NCBI nr
Match: XP_038880689.1 (pentatricopeptide repeat-containing protein At5g66520-like isoform X1 [Benincasa hispida])

HSP 1 Score: 1099.7 bits (2843), Expect = 0.0e+00
Identity = 535/578 (92.56%), Postives = 553/578 (95.67%), Query Frame = 0

Query: 1   MASLLALHPTASLPNSPKFHPSPIFHALNSCSSMAELKQFQSQIIRFGLSTDNDAIGRLI 60
           MASLL LHPTASLPNSPKF+PSP+FHALNSCSSM+ELKQF SQIIRFGLSTDN+AIGRLI
Sbjct: 1   MASLLPLHPTASLPNSPKFNPSPLFHALNSCSSMSELKQFHSQIIRFGLSTDNNAIGRLI 60

Query: 61  KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYLQFQSPKSSLLLYLQMLHNSVFPN 120
           KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIR YLQ +SPKSSLL YLQMLHNSV PN
Sbjct: 61  KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRGYLQSESPKSSLLSYLQMLHNSVLPN 120

Query: 121 KFTFPSLIRACCIDNAVKEGKQIHAHVVKFGFTTDRFSNNNLIHMYANFQSLEEARRVFD 180
           KFTFPSLIRACCIDNAVKEGKQIHAHVVKFGFTTDRF+NNNLIHMYANFQSLEEARRVFD
Sbjct: 121 KFTFPSLIRACCIDNAVKEGKQIHAHVVKFGFTTDRFTNNNLIHMYANFQSLEEARRVFD 180

Query: 181 SIELPDLVAWTTLVTGYSQLGFVDEALRVFESMPEHNSVSWNAMISCFVQNNRFHEAFGL 240
            IELPD+VAWTTL+TGY+QLGFVDE LRVF+SMPEHNS SWNAMISCFVQNNRFHEAF L
Sbjct: 181 CIELPDVVAWTTLLTGYAQLGFVDEGLRVFQSMPEHNSASWNAMISCFVQNNRFHEAFSL 240

Query: 241 FNRMRLEKVVLDKYVAASMLSACTGLGALEQGKWIHRYIKKNGIELDSKLATTLIDMYCK 300
           FNRMRLEKVVL+KYVAASMLSACTGLGALEQGKWIHRYIK+NGIELDSKLATTLIDMYCK
Sbjct: 241 FNRMRLEKVVLEKYVAASMLSACTGLGALEQGKWIHRYIKRNGIELDSKLATTLIDMYCK 300

Query: 301 CGCLDCAFQVFTHLPEKGISSWNCMIGGMAMHGKGAAAIELFKEMETKMVKPDNITFLNV 360
           CGCLDCAF+VF HLPEKGISSWNCMIGGMAMHGKG AAIELFKEMETK VKPDNITFLNV
Sbjct: 301 CGCLDCAFEVFVHLPEKGISSWNCMIGGMAMHGKGEAAIELFKEMETKTVKPDNITFLNV 360

Query: 361 LSACAHSGLVENGQYYFCRFTQVYGIEPGTEHYGCMVDLYGRAGMLEEAMKVIDEMPMSP 420
           LSACAHSGLVE G+YYFC FTQVYG+EP  EHYGCMVDLYGRAGMLEEAMKVI+EMPMSP
Sbjct: 361 LSACAHSGLVEKGRYYFCHFTQVYGLEPRAEHYGCMVDLYGRAGMLEEAMKVINEMPMSP 420

Query: 421 DVGALGAFVGACKIHGNIELGEEIGKRVIELEPTNSGRYVLLGNLYAEAGRWESVAEVRK 480
           DVG LGAFVGACKIHGNIELGEEIGKRVIELEPTNSGRYVLLGNLYAEAGRWE VAEVRK
Sbjct: 421 DVGVLGAFVGACKIHGNIELGEEIGKRVIELEPTNSGRYVLLGNLYAEAGRWEGVAEVRK 480

Query: 481 LMNDREVKKAAGVSIIELEGVVYEFIAGGRVHPEAKEIYDKLNEMLECIRIEGYVAENET 540
           LMNDREVKKAAGVS+IELEGVVYEFIAGGR HPEAKEIY KLN+MLE IR EGYVAEN  
Sbjct: 481 LMNDREVKKAAGVSMIELEGVVYEFIAGGRNHPEAKEIYAKLNDMLEYIRTEGYVAENGI 540

Query: 541 EEEKDNPVYYHSEKLAIAFGLLKTKAGEMLRITKNLRV 579
           EEEKDNPVYYHSEKLAIAFGLLKTKAGEMLRITKNLRV
Sbjct: 541 EEEKDNPVYYHSEKLAIAFGLLKTKAGEMLRITKNLRV 578

BLAST of CmUC03G066790.1 vs. NCBI nr
Match: TYK10244.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1089.7 bits (2817), Expect = 0.0e+00
Identity = 528/578 (91.35%), Postives = 551/578 (95.33%), Query Frame = 0

Query: 1   MASLLALHPTASLPNSPKFHPSPIFHALNSCSSMAELKQFQSQIIRFGLSTDNDAIGRLI 60
           M SLL LHP  SLPNSPKF+PSPIF ALNSCSSM+ELKQF SQIIR GLSTDN+AIGRLI
Sbjct: 1   MGSLLPLHPIPSLPNSPKFNPSPIFQALNSCSSMSELKQFHSQIIRLGLSTDNNAIGRLI 60

Query: 61  KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYLQFQSPKSSLLLYLQMLHNSVFPN 120
           KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYL F SPKSSLLLYLQMLHNSVFPN
Sbjct: 61  KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYLHFNSPKSSLLLYLQMLHNSVFPN 120

Query: 121 KFTFPSLIRACCIDNAVKEGKQIHAHVVKFGFTTDRFSNNNLIHMYANFQSLEEARRVFD 180
           KFTFPS+IRACCIDN+V+EGKQIH HVVKFGFT DRF  NNLIHMYANFQSLEEARRVFD
Sbjct: 121 KFTFPSVIRACCIDNSVEEGKQIHTHVVKFGFTKDRFCQNNLIHMYANFQSLEEARRVFD 180

Query: 181 SIELPDLVAWTTLVTGYSQLGFVDEALRVFESMPEHNSVSWNAMISCFVQNNRFHEAFGL 240
            IELPD+VAWTTL+TGY+QLG+VDE+LRVFESMPE NS SWNAMISCFVQNNRFHEAFGL
Sbjct: 181 CIELPDVVAWTTLLTGYAQLGYVDESLRVFESMPERNSASWNAMISCFVQNNRFHEAFGL 240

Query: 241 FNRMRLEKVVLDKYVAASMLSACTGLGALEQGKWIHRYIKKNGIELDSKLATTLIDMYCK 300
           FNRMRLEKVVL+K+VAASMLSACTGLGAL+QGKWIHRYI+KNGIE DSKLATTLIDMYCK
Sbjct: 241 FNRMRLEKVVLEKFVAASMLSACTGLGALDQGKWIHRYIEKNGIEFDSKLATTLIDMYCK 300

Query: 301 CGCLDCAFQVFTHLPEKGISSWNCMIGGMAMHGKGAAAIELFKEMETKMVKPDNITFLNV 360
           CGCLDCA++VF HLPEKGISSWNCMIGGMAMHGKG AAIELFKEMETKMVKPDNITFLNV
Sbjct: 301 CGCLDCAYEVFVHLPEKGISSWNCMIGGMAMHGKGEAAIELFKEMETKMVKPDNITFLNV 360

Query: 361 LSACAHSGLVENGQYYFCRFTQVYGIEPGTEHYGCMVDLYGRAGMLEEAMKVIDEMPMSP 420
           LSACAHSGLVE GQ+YF RFTQVYGIEP TEHYGCMVDLYGRAG+LEEAMKVIDEMPMSP
Sbjct: 361 LSACAHSGLVEKGQHYFNRFTQVYGIEPRTEHYGCMVDLYGRAGLLEEAMKVIDEMPMSP 420

Query: 421 DVGALGAFVGACKIHGNIELGEEIGKRVIELEPTNSGRYVLLGNLYAEAGRWESVAEVRK 480
           DVG LGAFVGACKIHGNIELGEE+GKRVIELEPTNSGRYVLLGNLYAEAGRWE VAEVRK
Sbjct: 421 DVGVLGAFVGACKIHGNIELGEEVGKRVIELEPTNSGRYVLLGNLYAEAGRWEGVAEVRK 480

Query: 481 LMNDREVKKAAGVSIIELEGVVYEFIAGGRVHPEAKEIYDKLNEMLECIRIEGYVAENET 540
           LMNDREVKKAAGVS+IELEGVVYEFIAGGR HPEAKEIYDKLNEMLECIR EGY+AENE 
Sbjct: 481 LMNDREVKKAAGVSMIELEGVVYEFIAGGRNHPEAKEIYDKLNEMLECIRNEGYIAENEI 540

Query: 541 EEEKDNPVYYHSEKLAIAFGLLKTKAGEMLRITKNLRV 579
           EEEKDNPVYYHSEKLAIAFGLLKTKAGE+LRITKNLRV
Sbjct: 541 EEEKDNPVYYHSEKLAIAFGLLKTKAGEILRITKNLRV 578

BLAST of CmUC03G066790.1 vs. NCBI nr
Match: KAE8653366.1 (hypothetical protein Csa_007505 [Cucumis sativus])

HSP 1 Score: 1086.2 bits (2808), Expect = 0.0e+00
Identity = 525/578 (90.83%), Postives = 552/578 (95.50%), Query Frame = 0

Query: 1   MASLLALHPTASLPNSPKFHPSPIFHALNSCSSMAELKQFQSQIIRFGLSTDNDAIGRLI 60
           MASLL LHP  SLPNS KF+PSPIFH+L+SCSSM+ELKQF SQIIR GLSTDN+AIGRLI
Sbjct: 1   MASLLPLHPIPSLPNSTKFNPSPIFHSLSSCSSMSELKQFHSQIIRLGLSTDNNAIGRLI 60

Query: 61  KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYLQFQSPKSSLLLYLQMLHNSVFPN 120
           KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYL F SPKSSLLLYLQMLHNSVFPN
Sbjct: 61  KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYLHFNSPKSSLLLYLQMLHNSVFPN 120

Query: 121 KFTFPSLIRACCIDNAVKEGKQIHAHVVKFGFTTDRFSNNNLIHMYANFQSLEEARRVFD 180
           KFTFPS+IRACCIDN+V+EGKQIH HVVKFGF+ DRF  NNLIHMYANFQSLE+ARRVFD
Sbjct: 121 KFTFPSVIRACCIDNSVEEGKQIHTHVVKFGFSKDRFCQNNLIHMYANFQSLEDARRVFD 180

Query: 181 SIELPDLVAWTTLVTGYSQLGFVDEALRVFESMPEHNSVSWNAMISCFVQNNRFHEAFGL 240
            IELPD+VAWTTL+TGY+QLG+VDE+LRVFESMPE NS SWNAMISCFVQNNRFHEAFGL
Sbjct: 181 CIELPDVVAWTTLLTGYAQLGYVDESLRVFESMPERNSASWNAMISCFVQNNRFHEAFGL 240

Query: 241 FNRMRLEKVVLDKYVAASMLSACTGLGALEQGKWIHRYIKKNGIELDSKLATTLIDMYCK 300
           FNRMR+EKVVL+KYVAASMLSACTGLGALEQGKWIHRYI++NGIE DSKLATTLIDMYCK
Sbjct: 241 FNRMRIEKVVLEKYVAASMLSACTGLGALEQGKWIHRYIERNGIEFDSKLATTLIDMYCK 300

Query: 301 CGCLDCAFQVFTHLPEKGISSWNCMIGGMAMHGKGAAAIELFKEMETKMVKPDNITFLNV 360
           CGCLDCA++VF HLPEKGISSWNCMIGGMAMHGKG AAIELFK+METKMVKPDNITFLNV
Sbjct: 301 CGCLDCAYEVFVHLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVKPDNITFLNV 360

Query: 361 LSACAHSGLVENGQYYFCRFTQVYGIEPGTEHYGCMVDLYGRAGMLEEAMKVIDEMPMSP 420
           LSACAHSGLVE GQ+YF RFTQVYGIEP TEHYGCMVDLYGRAG+LEEAMKVIDEMPMSP
Sbjct: 361 LSACAHSGLVEKGQHYFYRFTQVYGIEPRTEHYGCMVDLYGRAGLLEEAMKVIDEMPMSP 420

Query: 421 DVGALGAFVGACKIHGNIELGEEIGKRVIELEPTNSGRYVLLGNLYAEAGRWESVAEVRK 480
           DVG LGAFVGACKIHGNIELGEE+GKRVIELEPTNSGRYVLLGNLYAEAGRWE VAEVRK
Sbjct: 421 DVGVLGAFVGACKIHGNIELGEEVGKRVIELEPTNSGRYVLLGNLYAEAGRWEGVAEVRK 480

Query: 481 LMNDREVKKAAGVSIIELEGVVYEFIAGGRVHPEAKEIYDKLNEMLECIRIEGYVAENET 540
           LMNDREVKKAAGVS+IELEGVVYEFIAGGR HPEAKEIYDKLNEMLECIR EGYVAENE 
Sbjct: 481 LMNDREVKKAAGVSMIELEGVVYEFIAGGRNHPEAKEIYDKLNEMLECIRSEGYVAENEI 540

Query: 541 EEEKDNPVYYHSEKLAIAFGLLKTKAGEMLRITKNLRV 579
           EEEKDNPVYYHSEKLAIAFGLLKTKAGE+LRITKNLRV
Sbjct: 541 EEEKDNPVYYHSEKLAIAFGLLKTKAGEILRITKNLRV 578

BLAST of CmUC03G066790.1 vs. NCBI nr
Match: KAA0050892.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1084.3 bits (2803), Expect = 0.0e+00
Identity = 526/578 (91.00%), Postives = 549/578 (94.98%), Query Frame = 0

Query: 1   MASLLALHPTASLPNSPKFHPSPIFHALNSCSSMAELKQFQSQIIRFGLSTDNDAIGRLI 60
           M SLL LHP   LPNSPKF+PSPIF ALNSCSSM+ELKQF SQIIR GLSTDN+AIGRLI
Sbjct: 1   MGSLLPLHPIPFLPNSPKFNPSPIFQALNSCSSMSELKQFHSQIIRLGLSTDNNAIGRLI 60

Query: 61  KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYLQFQSPKSSLLLYLQMLHNSVFPN 120
           KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYL F SPKSSLLLYLQMLHNSVFPN
Sbjct: 61  KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYLHFNSPKSSLLLYLQMLHNSVFPN 120

Query: 121 KFTFPSLIRACCIDNAVKEGKQIHAHVVKFGFTTDRFSNNNLIHMYANFQSLEEARRVFD 180
           KFTFPS+IRACCIDN+V+EGKQIH HVVKFGFT DRF  NNLIHMYANFQSLEEARRVFD
Sbjct: 121 KFTFPSVIRACCIDNSVEEGKQIHTHVVKFGFTKDRFCQNNLIHMYANFQSLEEARRVFD 180

Query: 181 SIELPDLVAWTTLVTGYSQLGFVDEALRVFESMPEHNSVSWNAMISCFVQNNRFHEAFGL 240
            IELPD+VAWTTL+TGY+QLG+VDE+LRVFESMPE NS SWNAMISCFVQNNRFHEAFGL
Sbjct: 181 CIELPDVVAWTTLLTGYAQLGYVDESLRVFESMPERNSASWNAMISCFVQNNRFHEAFGL 240

Query: 241 FNRMRLEKVVLDKYVAASMLSACTGLGALEQGKWIHRYIKKNGIELDSKLATTLIDMYCK 300
           FNRMRLEKVVL+K+VAASMLSACTGLGALEQGKWIHRYI+KNGIE DSKLATTLIDMYCK
Sbjct: 241 FNRMRLEKVVLEKFVAASMLSACTGLGALEQGKWIHRYIEKNGIEFDSKLATTLIDMYCK 300

Query: 301 CGCLDCAFQVFTHLPEKGISSWNCMIGGMAMHGKGAAAIELFKEMETKMVKPDNITFLNV 360
           CGCLDCA+ VF HLPEKGISSWNCMIGGMAMHGKG AAIELFKEMETKMVKPDNITFLNV
Sbjct: 301 CGCLDCAYGVFVHLPEKGISSWNCMIGGMAMHGKGEAAIELFKEMETKMVKPDNITFLNV 360

Query: 361 LSACAHSGLVENGQYYFCRFTQVYGIEPGTEHYGCMVDLYGRAGMLEEAMKVIDEMPMSP 420
           LSACAHSGLVE GQ+YF RFTQVYGIEP TEHYGCMVDLYGRAG+LEEAMKVIDEMPMSP
Sbjct: 361 LSACAHSGLVEKGQHYFNRFTQVYGIEPRTEHYGCMVDLYGRAGLLEEAMKVIDEMPMSP 420

Query: 421 DVGALGAFVGACKIHGNIELGEEIGKRVIELEPTNSGRYVLLGNLYAEAGRWESVAEVRK 480
           DVG LGAFVGACKIHGNIELGEE+GKRVIELEPTNSGRYVLLGNLYAEAGRWE VAEVRK
Sbjct: 421 DVGVLGAFVGACKIHGNIELGEEVGKRVIELEPTNSGRYVLLGNLYAEAGRWEGVAEVRK 480

Query: 481 LMNDREVKKAAGVSIIELEGVVYEFIAGGRVHPEAKEIYDKLNEMLECIRIEGYVAENET 540
           LMNDREVKKAAGVS+IELEGVVYEFIAGGR HPEAKEIYDKLNEMLECI+ EGY+AENE 
Sbjct: 481 LMNDREVKKAAGVSMIELEGVVYEFIAGGRNHPEAKEIYDKLNEMLECIKNEGYIAENEI 540

Query: 541 EEEKDNPVYYHSEKLAIAFGLLKTKAGEMLRITKNLRV 579
           EEEKDNPVYYHSEKLAIAFGLLKTKAG++LRITKNLRV
Sbjct: 541 EEEKDNPVYYHSEKLAIAFGLLKTKAGKILRITKNLRV 578

BLAST of CmUC03G066790.1 vs. NCBI nr
Match: XP_022138850.1 (uncharacterized protein LOC111009921 isoform X1 [Momordica charantia])

HSP 1 Score: 984.9 bits (2545), Expect = 5.0e-283
Identity = 478/578 (82.70%), Postives = 516/578 (89.27%), Query Frame = 0

Query: 1   MASLLALHPTASLPNSPKFHPSPIFHALNSCSSMAELKQFQSQIIRFGLSTDNDAIGRLI 60
           M  LLAL PTAS+ NS + H SPI H L SCSSM+ELKQF SQIIR GLS DNDA+GRLI
Sbjct: 1   MPPLLALQPTASVINSSRVHTSPI-HGLQSCSSMSELKQFHSQIIRLGLSIDNDAMGRLI 60

Query: 61  KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYLQFQSPKSSLLLYLQMLHNSVFPN 120
           KFCAVSK GDL YALLLF +IPYPDAFIYNTLIR YLQ QS ++ +LLYLQMLH  V PN
Sbjct: 61  KFCAVSKNGDLDYALLLFKTIPYPDAFIYNTLIRGYLQQQSSRACILLYLQMLHKVVLPN 120

Query: 121 KFTFPSLIRACCIDNAVKEGKQIHAHVVKFGFTTDRFSNNNLIHMYANFQSLEEARRVFD 180
           KFTFPSLIRACCIDNA++EGKQ+HAHV+KFGF TD FS NNLIHMYANFQSLE+ARRVFD
Sbjct: 121 KFTFPSLIRACCIDNAIEEGKQVHAHVLKFGFRTDIFSQNNLIHMYANFQSLEDARRVFD 180

Query: 181 SIELPDLVAWTTLVTGYSQLGFVDEALRVFESMPEHNSVSWNAMISCFVQNNRFHEAFGL 240
            IELPD V WTTL+TGY+Q G VDEA +VFESMPEHNS SWNAMIS FVQNNRFHEAF L
Sbjct: 181 GIELPDAVTWTTLLTGYAQCGLVDEAFQVFESMPEHNSASWNAMISSFVQNNRFHEAFXL 240

Query: 241 FNRMRLEKVVLDKYVAASMLSACTGLGALEQGKWIHRYIKKNGIELDSKLATTLIDMYCK 300
           FNRMR EK+VLDKYVAASMLSACTGLGALEQGKWIHRYIKK+GIELDSKLATTLIDMYCK
Sbjct: 241 FNRMRSEKIVLDKYVAASMLSACTGLGALEQGKWIHRYIKKSGIELDSKLATTLIDMYCK 300

Query: 301 CGCLDCAFQVFTHLPEKGISSWNCMIGGMAMHGKGAAAIELFKEMETKMVKPDNITFLNV 360
           CGCLDCAF VFTHLPEKGISSWNCMIGGMAMHG+G AAIELFKEME KMV PDNITFLNV
Sbjct: 301 CGCLDCAFSVFTHLPEKGISSWNCMIGGMAMHGRGEAAIELFKEMEMKMVTPDNITFLNV 360

Query: 361 LSACAHSGLVENGQYYFCRFTQVYGIEPGTEHYGCMVDLYGRAGMLEEAMKVIDEMPMSP 420
           LSACAHSGLVE G++YF  F ++YGIEP TEH+GCMVDLYGRAGMLEEAMK+I EMPM+P
Sbjct: 361 LSACAHSGLVEEGRHYFRHFIELYGIEPRTEHFGCMVDLYGRAGMLEEAMKLISEMPMNP 420

Query: 421 DVGALGAFVGACKIHGNIELGEEIGKRVIELEPTNSGRYVLLGNLYAEAGRWESVAEVRK 480
           D G LGA VGACKIHG+++LGEEIG RVIELEPTNSGRYVLLGNLYA+AGRWE VAEVRK
Sbjct: 421 DAGVLGALVGACKIHGDVDLGEEIGLRVIELEPTNSGRYVLLGNLYAKAGRWEDVAEVRK 480

Query: 481 LMNDREVKKAAGVSIIELEGVVYEFIAGGRVHPEAKEIYDKLNEMLECIRIEGYVAENET 540
           LMNDREVKKA G S+IELEGVVYEFIAGGR HPEA +IY K+NEMLECIR  GYV ENE 
Sbjct: 481 LMNDREVKKAPGFSMIELEGVVYEFIAGGRAHPEADKIYVKVNEMLECIRYVGYVPENEI 540

Query: 541 EEEKDNPVYYHSEKLAIAFGLLKTKAGEMLRITKNLRV 579
           + +KDNP+YYHSEKLA+AFGLLKTKAGE LRITKNLR+
Sbjct: 541 DVDKDNPIYYHSEKLAVAFGLLKTKAGETLRITKNLRI 577

BLAST of CmUC03G066790.1 vs. ExPASy Swiss-Prot
Match: Q9FJY7 (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 480.3 bits (1235), Expect = 5.2e-134
Identity = 247/559 (44.19%), Postives = 349/559 (62.43%), Query Frame = 0

Query: 28  LNSCSSMAELKQFQSQIIRFGLSTDNDAIGRLIKFCAVSKYGD-LHYALLLFNSIPYPDA 87
           L  CS   ELKQ  +++++ GL  D+ AI + + FC  S   D L YA ++F+    PD 
Sbjct: 21  LQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDRPDT 80

Query: 88  FIYNTLIRAYLQFQSPKSSLLLYLQMLHNSVFPNKFTFPSLIRACCIDNAVKEGKQIHAH 147
           F++N +IR +     P+ SLLLY +ML +S   N +TFPSL++AC   +A +E  QIHA 
Sbjct: 81  FLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETTQIHAQ 140

Query: 148 VVKFGFTTDRFSNNNLIHMYANFQSLEEARRVFDSIELPDLVAWTTLVTGYSQLGFVDEA 207
           + K G+  D ++ N+LI+ YA   + + A  +FD I  PD V+W +++ GY + G +D A
Sbjct: 141 ITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAGKMDIA 200

Query: 208 LRVFESMPEHNSVSWNAMISCFVQNNRFHEAFGLFNRMRLEKVVLDKYVAASMLSACTGL 267
           L +F  M E N++SW  MIS +VQ +   EA  LF+ M+   V  D    A+ LSAC  L
Sbjct: 201 LTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANALSACAQL 260

Query: 268 GALEQGKWIHRYIKKNGIELDSKLATTLIDMYCKCGCLDCAFQVFTHLPEKGISSWNCMI 327
           GALEQGKWIH Y+ K  I +DS L   LIDMY KCG ++ A +VF ++ +K + +W  +I
Sbjct: 261 GALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQAWTALI 320

Query: 328 GGMAMHGKGAAAIELFKEMETKMVKPDNITFLNVLSACAHSGLVENGQYYFCRFTQVYGI 387
            G A HG G  AI  F EM+   +KP+ ITF  VL+AC+++GLVE G+  F    + Y +
Sbjct: 321 SGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSMERDYNL 380

Query: 388 EPGTEHYGCMVDLYGRAGMLEEAMKVIDEMPMSPDVGALGAFVGACKIHGNIELGEEIGK 447
           +P  EHYGC+VDL GRAG+L+EA + I EMP+ P+    GA + AC+IH NIELGEEIG+
Sbjct: 381 KPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELGEEIGE 440

Query: 448 RVIELEPTNSGRYVLLGNLYAEAGRWESVAEVRKLMNDREVKKAAGVSIIELEGVVYEFI 507
            +I ++P + GRYV   N++A   +W+  AE R+LM ++ V K  G S I LEG  +EF+
Sbjct: 441 ILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLEGTTHEFL 500

Query: 508 AGGRVHPEAKEIYDKLNEMLECIRIEGYVAENE-------TEEEKDNPVYYHSEKLAIAF 567
           AG R HPE ++I  K   M   +   GYV E E        ++E++  V+ HSEKLAI +
Sbjct: 501 AGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQHSEKLAITY 560

Query: 568 GLLKTKAGEMLRITKNLRV 579
           GL+KTK G ++RI KNLRV
Sbjct: 561 GLIKTKPGTIIRIMKNLRV 579

BLAST of CmUC03G066790.1 vs. ExPASy Swiss-Prot
Match: Q9FI80 (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 460.3 bits (1183), Expect = 5.6e-128
Identity = 228/589 (38.71%), Postives = 356/589 (60.44%), Query Frame = 0

Query: 15  NSPKFHPSPIFHALNSCSSMAELKQFQSQIIRFGLSTDNDAIGRLIKFCAVS--KYGDLH 74
           +SP  HPS +F  +N+C ++ +L Q  +  I+ G   D  A   +++FCA S   + DL 
Sbjct: 17  SSPASHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLD 76

Query: 75  YALLLFNSIPYPDAFIYNTLIRAYLQFQSPKSSL---LLYLQMLHNSVFPNKFTFPSLIR 134
           YA  +FN +P  + F +NT+IR + +    K+ +   L Y  M    V PN+FTFPS+++
Sbjct: 77  YAHKIFNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLK 136

Query: 135 ACCIDNAVKEGKQIHAHVVKFGFTTDRFSNNNLIHMYANFQSLEEARRVF-------DSI 194
           AC     ++EGKQIH   +K+GF  D F  +NL+ MY     +++AR +F       D +
Sbjct: 137 ACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMV 196

Query: 195 ELPD-------LVAWTTLVTGYSQLGFVDEALRVFESMPEHNSVSWNAMISCFVQNNRFH 254
            + D       +V W  ++ GY +LG    A  +F+ M + + VSWN MIS +  N  F 
Sbjct: 197 VMTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNGFFK 256

Query: 255 EAFGLFNRMRLEKVVLDKYVAASMLSACTGLGALEQGKWIHRYIKKNGIELDSKLATTLI 314
           +A  +F  M+   +  +     S+L A + LG+LE G+W+H Y + +GI +D  L + LI
Sbjct: 257 DAVEVFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALI 316

Query: 315 DMYCKCGCLDCAFQVFTHLPEKGISSWNCMIGGMAMHGKGAAAIELFKEMETKMVKPDNI 374
           DMY KCG ++ A  VF  LP + + +W+ MI G A+HG+   AI+ F +M    V+P ++
Sbjct: 317 DMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDV 376

Query: 375 TFLNVLSACAHSGLVENGQYYFCRFTQVYGIEPGTEHYGCMVDLYGRAGMLEEAMKVIDE 434
            ++N+L+AC+H GLVE G+ YF +   V G+EP  EHYGCMVDL GR+G+L+EA + I  
Sbjct: 377 AYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILN 436

Query: 435 MPMSPDVGALGAFVGACKIHGNIELGEEIGKRVIELEPTNSGRYVLLGNLYAEAGRWESV 494
           MP+ PD     A +GAC++ GN+E+G+ +   ++++ P +SG YV L N+YA  G W  V
Sbjct: 437 MPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEV 496

Query: 495 AEVRKLMNDREVKKAAGVSIIELEGVVYEFIAGGRVHPEAKEIYDKLNEMLECIRIEGY- 554
           +E+R  M +++++K  G S+I+++GV++EF+     HP+AKEI   L E+ + +R+ GY 
Sbjct: 497 SEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYR 556

Query: 555 -----VAENETEEEKDNPVYYHSEKLAIAFGLLKTKAGEMLRITKNLRV 579
                V  N  EE+K+N ++YHSEK+A AFGL+ T  G+ +RI KNLR+
Sbjct: 557 PITTQVLLNLEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRI 605

BLAST of CmUC03G066790.1 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 449.9 bits (1156), Expect = 7.5e-125
Identity = 247/680 (36.32%), Postives = 365/680 (53.68%), Query Frame = 0

Query: 9   PTASLPNSPKFHPSPIFHALNSCSSMAELKQFQSQIIRFGLSTDNDAIGRLIKFCAVSKY 68
           P++S P        P    L++C ++  L+   +Q+I+ GL   N A+ +LI+FC +S +
Sbjct: 21  PSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPH 80

Query: 69  GD-LHYALLLFNSIPYPDAFIYNTLIRAYLQFQSPKSSLLLYLQMLHNSVFPNKFTFPSL 128
            + L YA+ +F +I  P+  I+NT+ R +     P S+L LY+ M+   + PN +TFP +
Sbjct: 81  FEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFV 140

Query: 129 IRACCIDNAVKEGKQIHAHVVKFGFTTDRFSNNNLIHMYANFQSLEEARRVFDSIELPDL 188
           +++C    A KEG+QIH HV+K G   D + + +LI MY     LE+A +VFD     D+
Sbjct: 141 LKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDV 200

Query: 189 VAWTTLVTGYSQLGFVDEALRVFESMPEHNSVSWNAMISCFVQNNRFHEAFGLF------ 248
           V++T L+ GY+  G+++ A ++F+ +P  + VSWNAMIS + +   + EA  LF      
Sbjct: 201 VSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKT 260

Query: 249 --------------------------------------NRMRLEKVVLDKY--------- 308
                                                 + +++   ++D Y         
Sbjct: 261 NVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETA 320

Query: 309 ------------------------------------------------VAASMLSACTGL 368
                                                              S+L AC  L
Sbjct: 321 CGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHL 380

Query: 369 GALEQGKWIHRYIKK--NGIELDSKLATTLIDMYCKCGCLDCAFQVFTHLPEKGISSWNC 428
           GA++ G+WIH YI K   G+   S L T+LIDMY KCG ++ A QVF  +  K +SSWN 
Sbjct: 381 GAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNA 440

Query: 429 MIGGMAMHGKGAAAIELFKEMETKMVKPDNITFLNVLSACAHSGLVENGQYYFCRFTQVY 488
           MI G AMHG+  A+ +LF  M    ++PD+ITF+ +LSAC+HSG+++ G++ F   TQ Y
Sbjct: 441 MIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDY 500

Query: 489 GIEPGTEHYGCMVDLYGRAGMLEEAMKVIDEMPMSPDVGALGAFVGACKIHGNIELGEEI 548
            + P  EHYGCM+DL G +G+ +EA ++I+ M M PD     + + ACK+HGN+ELGE  
Sbjct: 501 KMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESF 560

Query: 549 GKRVIELEPTNSGRYVLLGNLYAEAGRWESVAEVRKLMNDREVKKAAGVSIIELEGVVYE 579
            + +I++EP N G YVLL N+YA AGRW  VA+ R L+ND+ +KK  G S IE++ VV+E
Sbjct: 561 AENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHE 620

BLAST of CmUC03G066790.1 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 440.7 bits (1132), Expect = 4.6e-122
Identity = 244/688 (35.47%), Postives = 370/688 (53.78%), Query Frame = 0

Query: 5   LALHPTASLPNSPKFHPSPIFH--ALNSCSSMAELKQFQSQIIRFGLSTDNDAIGRLIKF 64
           L  HP  S PN P  +     H   +  C S+ +LKQ    +IR G  +D  +  +L   
Sbjct: 12  LPRHPNFSNPNQPTTNNERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAM 71

Query: 65  CAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYLQFQSPKSSLLLYLQMLHNS-VFPNK 124
            A+S +  L YA  +F+ IP P++F +NTLIRAY     P  S+  +L M+  S  +PNK
Sbjct: 72  AALSSFASLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNK 131

Query: 125 FTFPSLIRACCIDNAVKEGKQIHAHVVKFGFTTDRFSNNNLIH----------------- 184
           +TFP LI+A    +++  G+ +H   VK    +D F  N+LIH                 
Sbjct: 132 YTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTT 191

Query: 185 ------------------------------------------------------------ 244
                                                                       
Sbjct: 192 IKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFG 251

Query: 245 ------------------------MYANFQSLEEARRVFDSIELPDLVAWTTLVTGYSQL 304
                                   MY    S+E+A+R+FD++E  D V WTT++ GY+  
Sbjct: 252 RQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAIS 311

Query: 305 GFVDEALRVFESMPEHNSVSWNAMISCFVQNNRFHEAFGLFNRMRLEK-VVLDKYVAASM 364
              + A  V  SMP+ + V+WNA+IS + QN + +EA  +F+ ++L+K + L++    S 
Sbjct: 312 EDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVST 371

Query: 365 LSACTGLGALEQGKWIHRYIKKNGIELDSKLATTLIDMYCKCGCLDCAFQVFTHLPEKGI 424
           LSAC  +GALE G+WIH YIKK+GI ++  + + LI MY KCG L+ + +VF  + ++ +
Sbjct: 372 LSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDV 431

Query: 425 SSWNCMIGGMAMHGKGAAAIELFKEMETKMVKPDNITFLNVLSACAHSGLVENGQYYFCR 484
             W+ MIGG+AMHG G  A+++F +M+   VKP+ +TF NV  AC+H+GLV+  +  F +
Sbjct: 432 FVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQ 491

Query: 485 FTQVYGIEPGTEHYGCMVDLYGRAGMLEEAMKVIDEMPMSPDVGALGAFVGACKIHGNIE 544
               YGI P  +HY C+VD+ GR+G LE+A+K I+ MP+ P     GA +GACKIH N+ 
Sbjct: 492 MESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLN 551

Query: 545 LGEEIGKRVIELEPTNSGRYVLLGNLYAEAGRWESVAEVRKLMNDREVKKAAGVSIIELE 581
           L E    R++ELEP N G +VLL N+YA+ G+WE+V+E+RK M    +KK  G S IE++
Sbjct: 552 LAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEID 611

BLAST of CmUC03G066790.1 vs. ExPASy Swiss-Prot
Match: Q9FG16 (Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H88 PE=3 SV=1)

HSP 1 Score: 432.2 bits (1110), Expect = 1.6e-119
Identity = 226/568 (39.79%), Postives = 340/568 (59.86%), Query Frame = 0

Query: 23  PIFHALNSCSSMAELKQFQSQIIRFGLSTDNDAIGRLIKFCAVSKYGD-----LHYALLL 82
           P    L SCSS ++LK     ++R  L +D     RL+  C      +     L YA  +
Sbjct: 14  PKLALLQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYAYGI 73

Query: 83  FNSIPYPDAFIYNTLIRAYLQFQSPKSSLLLYLQMLHNSVFPNKFTFPSLIRACCIDNAV 142
           F+ I  P+ F++N LIR +     P  +   Y QML + ++P+  TFP LI+A      V
Sbjct: 74  FSQIQNPNLFVFNLLIRCFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFLIKASSEMECV 133

Query: 143 KEGKQIHAHVVKFGFTTDRFSNNNLIHMYANFQSLEEARRVFDSIELPDLVAWTTLVTGY 202
             G+Q H+ +V+FGF  D +  N+L+HMYAN   +  A R+F  +   D+V+WT++V GY
Sbjct: 134 LVGEQTHSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWTSMVAGY 193

Query: 203 SQLGFVDEALRVFESMPEHNSVSWNAMISCFVQNNRFHEAFGLFNRMRLEKVVLDKYVAA 262
            + G V+ A  +F+ MP  N  +W+ MI+ + +NN F +A  LF  M+ E VV ++ V  
Sbjct: 194 CKCGMVENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVANETVMV 253

Query: 263 SMLSACTGLGALEQGKWIHRYIKKNGIELDSKLATTLIDMYCKCGCLDCAFQVFTHLPEK 322
           S++S+C  LGALE G+  + Y+ K+ + ++  L T L+DM+ +CG ++ A  VF  LPE 
Sbjct: 254 SVISSCAHLGALEFGERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEKAIHVFEGLPET 313

Query: 323 GISSWNCMIGGMAMHGKGAAAIELFKEMETKMVKPDNITFLNVLSACAHSGLVENGQYYF 382
              SW+ +I G+A+HG    A+  F +M +    P ++TF  VLSAC+H GLVE G   +
Sbjct: 314 DSLSWSSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGLVEKGLEIY 373

Query: 383 CRFTQVYGIEPGTEHYGCMVDLYGRAGMLEEAMKVIDEMPMSPDVGALGAFVGACKIHGN 442
               + +GIEP  EHYGC+VD+ GRAG L EA   I +M + P+   LGA +GACKI+ N
Sbjct: 374 ENMKKDHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGALLGACKIYKN 433

Query: 443 IELGEEIGKRVIELEPTNSGRYVLLGNLYAEAGRWESVAEVRKLMNDREVKKAAGVSIIE 502
            E+ E +G  +I+++P +SG YVLL N+YA AG+W+ +  +R +M ++ VKK  G S+IE
Sbjct: 434 TEVAERVGNMLIKVKPEHSGYYVLLSNIYACAGQWDKIESLRDMMKEKLVKKPPGWSLIE 493

Query: 503 LEGVVYEFIAG-GRVHPEAKEIYDKLNEMLECIRIEGYVAE------NETEEEKDNPVYY 562
           ++G + +F  G  + HPE  +I  K  E+L  IR+ GY         +  EEEK++ ++ 
Sbjct: 494 IDGKINKFTMGDDQKHPEMGKIRRKWEEILGKIRLIGYKGNTGDAFFDVDEEEKESSIHM 553

Query: 563 HSEKLAIAFGLLKTKAGEMLRITKNLRV 579
           HSEKLAIA+G++KTK G  +RI KNLRV
Sbjct: 554 HSEKLAIAYGMMKTKPGTTIRIVKNLRV 581

BLAST of CmUC03G066790.1 vs. ExPASy TrEMBL
Match: A0A5D3CGC5 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold16G003910 PE=3 SV=1)

HSP 1 Score: 1089.7 bits (2817), Expect = 0.0e+00
Identity = 528/578 (91.35%), Postives = 551/578 (95.33%), Query Frame = 0

Query: 1   MASLLALHPTASLPNSPKFHPSPIFHALNSCSSMAELKQFQSQIIRFGLSTDNDAIGRLI 60
           M SLL LHP  SLPNSPKF+PSPIF ALNSCSSM+ELKQF SQIIR GLSTDN+AIGRLI
Sbjct: 1   MGSLLPLHPIPSLPNSPKFNPSPIFQALNSCSSMSELKQFHSQIIRLGLSTDNNAIGRLI 60

Query: 61  KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYLQFQSPKSSLLLYLQMLHNSVFPN 120
           KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYL F SPKSSLLLYLQMLHNSVFPN
Sbjct: 61  KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYLHFNSPKSSLLLYLQMLHNSVFPN 120

Query: 121 KFTFPSLIRACCIDNAVKEGKQIHAHVVKFGFTTDRFSNNNLIHMYANFQSLEEARRVFD 180
           KFTFPS+IRACCIDN+V+EGKQIH HVVKFGFT DRF  NNLIHMYANFQSLEEARRVFD
Sbjct: 121 KFTFPSVIRACCIDNSVEEGKQIHTHVVKFGFTKDRFCQNNLIHMYANFQSLEEARRVFD 180

Query: 181 SIELPDLVAWTTLVTGYSQLGFVDEALRVFESMPEHNSVSWNAMISCFVQNNRFHEAFGL 240
            IELPD+VAWTTL+TGY+QLG+VDE+LRVFESMPE NS SWNAMISCFVQNNRFHEAFGL
Sbjct: 181 CIELPDVVAWTTLLTGYAQLGYVDESLRVFESMPERNSASWNAMISCFVQNNRFHEAFGL 240

Query: 241 FNRMRLEKVVLDKYVAASMLSACTGLGALEQGKWIHRYIKKNGIELDSKLATTLIDMYCK 300
           FNRMRLEKVVL+K+VAASMLSACTGLGAL+QGKWIHRYI+KNGIE DSKLATTLIDMYCK
Sbjct: 241 FNRMRLEKVVLEKFVAASMLSACTGLGALDQGKWIHRYIEKNGIEFDSKLATTLIDMYCK 300

Query: 301 CGCLDCAFQVFTHLPEKGISSWNCMIGGMAMHGKGAAAIELFKEMETKMVKPDNITFLNV 360
           CGCLDCA++VF HLPEKGISSWNCMIGGMAMHGKG AAIELFKEMETKMVKPDNITFLNV
Sbjct: 301 CGCLDCAYEVFVHLPEKGISSWNCMIGGMAMHGKGEAAIELFKEMETKMVKPDNITFLNV 360

Query: 361 LSACAHSGLVENGQYYFCRFTQVYGIEPGTEHYGCMVDLYGRAGMLEEAMKVIDEMPMSP 420
           LSACAHSGLVE GQ+YF RFTQVYGIEP TEHYGCMVDLYGRAG+LEEAMKVIDEMPMSP
Sbjct: 361 LSACAHSGLVEKGQHYFNRFTQVYGIEPRTEHYGCMVDLYGRAGLLEEAMKVIDEMPMSP 420

Query: 421 DVGALGAFVGACKIHGNIELGEEIGKRVIELEPTNSGRYVLLGNLYAEAGRWESVAEVRK 480
           DVG LGAFVGACKIHGNIELGEE+GKRVIELEPTNSGRYVLLGNLYAEAGRWE VAEVRK
Sbjct: 421 DVGVLGAFVGACKIHGNIELGEEVGKRVIELEPTNSGRYVLLGNLYAEAGRWEGVAEVRK 480

Query: 481 LMNDREVKKAAGVSIIELEGVVYEFIAGGRVHPEAKEIYDKLNEMLECIRIEGYVAENET 540
           LMNDREVKKAAGVS+IELEGVVYEFIAGGR HPEAKEIYDKLNEMLECIR EGY+AENE 
Sbjct: 481 LMNDREVKKAAGVSMIELEGVVYEFIAGGRNHPEAKEIYDKLNEMLECIRNEGYIAENEI 540

Query: 541 EEEKDNPVYYHSEKLAIAFGLLKTKAGEMLRITKNLRV 579
           EEEKDNPVYYHSEKLAIAFGLLKTKAGE+LRITKNLRV
Sbjct: 541 EEEKDNPVYYHSEKLAIAFGLLKTKAGEILRITKNLRV 578

BLAST of CmUC03G066790.1 vs. ExPASy TrEMBL
Match: A0A0A0LWF1 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G569490 PE=3 SV=1)

HSP 1 Score: 1084.7 bits (2804), Expect = 0.0e+00
Identity = 524/577 (90.81%), Postives = 551/577 (95.49%), Query Frame = 0

Query: 1   MASLLALHPTASLPNSPKFHPSPIFHALNSCSSMAELKQFQSQIIRFGLSTDNDAIGRLI 60
           MASLL LHP  SLPNS KF+PSPIFH+L+SCSSM+ELKQF SQIIR GLSTDN+AIGRLI
Sbjct: 1   MASLLPLHPIPSLPNSTKFNPSPIFHSLSSCSSMSELKQFHSQIIRLGLSTDNNAIGRLI 60

Query: 61  KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYLQFQSPKSSLLLYLQMLHNSVFPN 120
           KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYL F SPKSSLLLYLQMLHNSVFPN
Sbjct: 61  KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYLHFNSPKSSLLLYLQMLHNSVFPN 120

Query: 121 KFTFPSLIRACCIDNAVKEGKQIHAHVVKFGFTTDRFSNNNLIHMYANFQSLEEARRVFD 180
           KFTFPS+IRACCIDN+V+EGKQIH HVVKFGF+ DRF  NNLIHMYANFQSLE+ARRVFD
Sbjct: 121 KFTFPSVIRACCIDNSVEEGKQIHTHVVKFGFSKDRFCQNNLIHMYANFQSLEDARRVFD 180

Query: 181 SIELPDLVAWTTLVTGYSQLGFVDEALRVFESMPEHNSVSWNAMISCFVQNNRFHEAFGL 240
            IELPD+VAWTTL+TGY+QLG+VDE+LRVFESMPE NS SWNAMISCFVQNNRFHEAFGL
Sbjct: 181 CIELPDVVAWTTLLTGYAQLGYVDESLRVFESMPERNSASWNAMISCFVQNNRFHEAFGL 240

Query: 241 FNRMRLEKVVLDKYVAASMLSACTGLGALEQGKWIHRYIKKNGIELDSKLATTLIDMYCK 300
           FNRMR+EKVVL+KYVAASMLSACTGLGALEQGKWIHRYI++NGIE DSKLATTLIDMYCK
Sbjct: 241 FNRMRIEKVVLEKYVAASMLSACTGLGALEQGKWIHRYIERNGIEFDSKLATTLIDMYCK 300

Query: 301 CGCLDCAFQVFTHLPEKGISSWNCMIGGMAMHGKGAAAIELFKEMETKMVKPDNITFLNV 360
           CGCLDCA++VF HLPEKGISSWNCMIGGMAMHGKG AAIELFK+METKMVKPDNITFLNV
Sbjct: 301 CGCLDCAYEVFVHLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVKPDNITFLNV 360

Query: 361 LSACAHSGLVENGQYYFCRFTQVYGIEPGTEHYGCMVDLYGRAGMLEEAMKVIDEMPMSP 420
           LSACAHSGLVE GQ+YF RFTQVYGIEP TEHYGCMVDLYGRAG+LEEAMKVIDEMPMSP
Sbjct: 361 LSACAHSGLVEKGQHYFYRFTQVYGIEPRTEHYGCMVDLYGRAGLLEEAMKVIDEMPMSP 420

Query: 421 DVGALGAFVGACKIHGNIELGEEIGKRVIELEPTNSGRYVLLGNLYAEAGRWESVAEVRK 480
           DVG LGAFVGACKIHGNIELGEE+GKRVIELEPTNSGRYVLLGNLYAEAGRWE VAEVRK
Sbjct: 421 DVGVLGAFVGACKIHGNIELGEEVGKRVIELEPTNSGRYVLLGNLYAEAGRWEGVAEVRK 480

Query: 481 LMNDREVKKAAGVSIIELEGVVYEFIAGGRVHPEAKEIYDKLNEMLECIRIEGYVAENET 540
           LMNDREVKKAAGVS+IELEGVVYEFIAGGR HPEAKEIYDKLNEMLECIR EGYVAENE 
Sbjct: 481 LMNDREVKKAAGVSMIELEGVVYEFIAGGRNHPEAKEIYDKLNEMLECIRSEGYVAENEI 540

Query: 541 EEEKDNPVYYHSEKLAIAFGLLKTKAGEMLRITKNLR 578
           EEEKDNPVYYHSEKLAIAFGLLKTKAGE+LRITKNLR
Sbjct: 541 EEEKDNPVYYHSEKLAIAFGLLKTKAGEILRITKNLR 577

BLAST of CmUC03G066790.1 vs. ExPASy TrEMBL
Match: A0A5A7U4W9 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold761G00170 PE=3 SV=1)

HSP 1 Score: 1084.3 bits (2803), Expect = 0.0e+00
Identity = 526/578 (91.00%), Postives = 549/578 (94.98%), Query Frame = 0

Query: 1   MASLLALHPTASLPNSPKFHPSPIFHALNSCSSMAELKQFQSQIIRFGLSTDNDAIGRLI 60
           M SLL LHP   LPNSPKF+PSPIF ALNSCSSM+ELKQF SQIIR GLSTDN+AIGRLI
Sbjct: 1   MGSLLPLHPIPFLPNSPKFNPSPIFQALNSCSSMSELKQFHSQIIRLGLSTDNNAIGRLI 60

Query: 61  KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYLQFQSPKSSLLLYLQMLHNSVFPN 120
           KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYL F SPKSSLLLYLQMLHNSVFPN
Sbjct: 61  KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYLHFNSPKSSLLLYLQMLHNSVFPN 120

Query: 121 KFTFPSLIRACCIDNAVKEGKQIHAHVVKFGFTTDRFSNNNLIHMYANFQSLEEARRVFD 180
           KFTFPS+IRACCIDN+V+EGKQIH HVVKFGFT DRF  NNLIHMYANFQSLEEARRVFD
Sbjct: 121 KFTFPSVIRACCIDNSVEEGKQIHTHVVKFGFTKDRFCQNNLIHMYANFQSLEEARRVFD 180

Query: 181 SIELPDLVAWTTLVTGYSQLGFVDEALRVFESMPEHNSVSWNAMISCFVQNNRFHEAFGL 240
            IELPD+VAWTTL+TGY+QLG+VDE+LRVFESMPE NS SWNAMISCFVQNNRFHEAFGL
Sbjct: 181 CIELPDVVAWTTLLTGYAQLGYVDESLRVFESMPERNSASWNAMISCFVQNNRFHEAFGL 240

Query: 241 FNRMRLEKVVLDKYVAASMLSACTGLGALEQGKWIHRYIKKNGIELDSKLATTLIDMYCK 300
           FNRMRLEKVVL+K+VAASMLSACTGLGALEQGKWIHRYI+KNGIE DSKLATTLIDMYCK
Sbjct: 241 FNRMRLEKVVLEKFVAASMLSACTGLGALEQGKWIHRYIEKNGIEFDSKLATTLIDMYCK 300

Query: 301 CGCLDCAFQVFTHLPEKGISSWNCMIGGMAMHGKGAAAIELFKEMETKMVKPDNITFLNV 360
           CGCLDCA+ VF HLPEKGISSWNCMIGGMAMHGKG AAIELFKEMETKMVKPDNITFLNV
Sbjct: 301 CGCLDCAYGVFVHLPEKGISSWNCMIGGMAMHGKGEAAIELFKEMETKMVKPDNITFLNV 360

Query: 361 LSACAHSGLVENGQYYFCRFTQVYGIEPGTEHYGCMVDLYGRAGMLEEAMKVIDEMPMSP 420
           LSACAHSGLVE GQ+YF RFTQVYGIEP TEHYGCMVDLYGRAG+LEEAMKVIDEMPMSP
Sbjct: 361 LSACAHSGLVEKGQHYFNRFTQVYGIEPRTEHYGCMVDLYGRAGLLEEAMKVIDEMPMSP 420

Query: 421 DVGALGAFVGACKIHGNIELGEEIGKRVIELEPTNSGRYVLLGNLYAEAGRWESVAEVRK 480
           DVG LGAFVGACKIHGNIELGEE+GKRVIELEPTNSGRYVLLGNLYAEAGRWE VAEVRK
Sbjct: 421 DVGVLGAFVGACKIHGNIELGEEVGKRVIELEPTNSGRYVLLGNLYAEAGRWEGVAEVRK 480

Query: 481 LMNDREVKKAAGVSIIELEGVVYEFIAGGRVHPEAKEIYDKLNEMLECIRIEGYVAENET 540
           LMNDREVKKAAGVS+IELEGVVYEFIAGGR HPEAKEIYDKLNEMLECI+ EGY+AENE 
Sbjct: 481 LMNDREVKKAAGVSMIELEGVVYEFIAGGRNHPEAKEIYDKLNEMLECIKNEGYIAENEI 540

Query: 541 EEEKDNPVYYHSEKLAIAFGLLKTKAGEMLRITKNLRV 579
           EEEKDNPVYYHSEKLAIAFGLLKTKAG++LRITKNLRV
Sbjct: 541 EEEKDNPVYYHSEKLAIAFGLLKTKAGKILRITKNLRV 578

BLAST of CmUC03G066790.1 vs. ExPASy TrEMBL
Match: A0A6J1CAN4 (uncharacterized protein LOC111009921 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111009921 PE=3 SV=1)

HSP 1 Score: 984.9 bits (2545), Expect = 2.4e-283
Identity = 478/578 (82.70%), Postives = 516/578 (89.27%), Query Frame = 0

Query: 1   MASLLALHPTASLPNSPKFHPSPIFHALNSCSSMAELKQFQSQIIRFGLSTDNDAIGRLI 60
           M  LLAL PTAS+ NS + H SPI H L SCSSM+ELKQF SQIIR GLS DNDA+GRLI
Sbjct: 1   MPPLLALQPTASVINSSRVHTSPI-HGLQSCSSMSELKQFHSQIIRLGLSIDNDAMGRLI 60

Query: 61  KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYLQFQSPKSSLLLYLQMLHNSVFPN 120
           KFCAVSK GDL YALLLF +IPYPDAFIYNTLIR YLQ QS ++ +LLYLQMLH  V PN
Sbjct: 61  KFCAVSKNGDLDYALLLFKTIPYPDAFIYNTLIRGYLQQQSSRACILLYLQMLHKVVLPN 120

Query: 121 KFTFPSLIRACCIDNAVKEGKQIHAHVVKFGFTTDRFSNNNLIHMYANFQSLEEARRVFD 180
           KFTFPSLIRACCIDNA++EGKQ+HAHV+KFGF TD FS NNLIHMYANFQSLE+ARRVFD
Sbjct: 121 KFTFPSLIRACCIDNAIEEGKQVHAHVLKFGFRTDIFSQNNLIHMYANFQSLEDARRVFD 180

Query: 181 SIELPDLVAWTTLVTGYSQLGFVDEALRVFESMPEHNSVSWNAMISCFVQNNRFHEAFGL 240
            IELPD V WTTL+TGY+Q G VDEA +VFESMPEHNS SWNAMIS FVQNNRFHEAF L
Sbjct: 181 GIELPDAVTWTTLLTGYAQCGLVDEAFQVFESMPEHNSASWNAMISSFVQNNRFHEAFXL 240

Query: 241 FNRMRLEKVVLDKYVAASMLSACTGLGALEQGKWIHRYIKKNGIELDSKLATTLIDMYCK 300
           FNRMR EK+VLDKYVAASMLSACTGLGALEQGKWIHRYIKK+GIELDSKLATTLIDMYCK
Sbjct: 241 FNRMRSEKIVLDKYVAASMLSACTGLGALEQGKWIHRYIKKSGIELDSKLATTLIDMYCK 300

Query: 301 CGCLDCAFQVFTHLPEKGISSWNCMIGGMAMHGKGAAAIELFKEMETKMVKPDNITFLNV 360
           CGCLDCAF VFTHLPEKGISSWNCMIGGMAMHG+G AAIELFKEME KMV PDNITFLNV
Sbjct: 301 CGCLDCAFSVFTHLPEKGISSWNCMIGGMAMHGRGEAAIELFKEMEMKMVTPDNITFLNV 360

Query: 361 LSACAHSGLVENGQYYFCRFTQVYGIEPGTEHYGCMVDLYGRAGMLEEAMKVIDEMPMSP 420
           LSACAHSGLVE G++YF  F ++YGIEP TEH+GCMVDLYGRAGMLEEAMK+I EMPM+P
Sbjct: 361 LSACAHSGLVEEGRHYFRHFIELYGIEPRTEHFGCMVDLYGRAGMLEEAMKLISEMPMNP 420

Query: 421 DVGALGAFVGACKIHGNIELGEEIGKRVIELEPTNSGRYVLLGNLYAEAGRWESVAEVRK 480
           D G LGA VGACKIHG+++LGEEIG RVIELEPTNSGRYVLLGNLYA+AGRWE VAEVRK
Sbjct: 421 DAGVLGALVGACKIHGDVDLGEEIGLRVIELEPTNSGRYVLLGNLYAKAGRWEDVAEVRK 480

Query: 481 LMNDREVKKAAGVSIIELEGVVYEFIAGGRVHPEAKEIYDKLNEMLECIRIEGYVAENET 540
           LMNDREVKKA G S+IELEGVVYEFIAGGR HPEA +IY K+NEMLECIR  GYV ENE 
Sbjct: 481 LMNDREVKKAPGFSMIELEGVVYEFIAGGRAHPEADKIYVKVNEMLECIRYVGYVPENEI 540

Query: 541 EEEKDNPVYYHSEKLAIAFGLLKTKAGEMLRITKNLRV 579
           + +KDNP+YYHSEKLA+AFGLLKTKAGE LRITKNLR+
Sbjct: 541 DVDKDNPIYYHSEKLAVAFGLLKTKAGETLRITKNLRI 577

BLAST of CmUC03G066790.1 vs. ExPASy TrEMBL
Match: A0A6J1CAX3 (uncharacterized protein LOC111009921 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111009921 PE=3 SV=1)

HSP 1 Score: 984.9 bits (2545), Expect = 2.4e-283
Identity = 478/578 (82.70%), Postives = 516/578 (89.27%), Query Frame = 0

Query: 1   MASLLALHPTASLPNSPKFHPSPIFHALNSCSSMAELKQFQSQIIRFGLSTDNDAIGRLI 60
           M  LLAL PTAS+ NS + H SPI H L SCSSM+ELKQF SQIIR GLS DNDA+GRLI
Sbjct: 1   MPPLLALQPTASVINSSRVHTSPI-HGLQSCSSMSELKQFHSQIIRLGLSIDNDAMGRLI 60

Query: 61  KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYLQFQSPKSSLLLYLQMLHNSVFPN 120
           KFCAVSK GDL YALLLF +IPYPDAFIYNTLIR YLQ QS ++ +LLYLQMLH  V PN
Sbjct: 61  KFCAVSKNGDLDYALLLFKTIPYPDAFIYNTLIRGYLQQQSSRACILLYLQMLHKVVLPN 120

Query: 121 KFTFPSLIRACCIDNAVKEGKQIHAHVVKFGFTTDRFSNNNLIHMYANFQSLEEARRVFD 180
           KFTFPSLIRACCIDNA++EGKQ+HAHV+KFGF TD FS NNLIHMYANFQSLE+ARRVFD
Sbjct: 121 KFTFPSLIRACCIDNAIEEGKQVHAHVLKFGFRTDIFSQNNLIHMYANFQSLEDARRVFD 180

Query: 181 SIELPDLVAWTTLVTGYSQLGFVDEALRVFESMPEHNSVSWNAMISCFVQNNRFHEAFGL 240
            IELPD V WTTL+TGY+Q G VDEA +VFESMPEHNS SWNAMIS FVQNNRFHEAF L
Sbjct: 181 GIELPDAVTWTTLLTGYAQCGLVDEAFQVFESMPEHNSASWNAMISSFVQNNRFHEAFXL 240

Query: 241 FNRMRLEKVVLDKYVAASMLSACTGLGALEQGKWIHRYIKKNGIELDSKLATTLIDMYCK 300
           FNRMR EK+VLDKYVAASMLSACTGLGALEQGKWIHRYIKK+GIELDSKLATTLIDMYCK
Sbjct: 241 FNRMRSEKIVLDKYVAASMLSACTGLGALEQGKWIHRYIKKSGIELDSKLATTLIDMYCK 300

Query: 301 CGCLDCAFQVFTHLPEKGISSWNCMIGGMAMHGKGAAAIELFKEMETKMVKPDNITFLNV 360
           CGCLDCAF VFTHLPEKGISSWNCMIGGMAMHG+G AAIELFKEME KMV PDNITFLNV
Sbjct: 301 CGCLDCAFSVFTHLPEKGISSWNCMIGGMAMHGRGEAAIELFKEMEMKMVTPDNITFLNV 360

Query: 361 LSACAHSGLVENGQYYFCRFTQVYGIEPGTEHYGCMVDLYGRAGMLEEAMKVIDEMPMSP 420
           LSACAHSGLVE G++YF  F ++YGIEP TEH+GCMVDLYGRAGMLEEAMK+I EMPM+P
Sbjct: 361 LSACAHSGLVEEGRHYFRHFIELYGIEPRTEHFGCMVDLYGRAGMLEEAMKLISEMPMNP 420

Query: 421 DVGALGAFVGACKIHGNIELGEEIGKRVIELEPTNSGRYVLLGNLYAEAGRWESVAEVRK 480
           D G LGA VGACKIHG+++LGEEIG RVIELEPTNSGRYVLLGNLYA+AGRWE VAEVRK
Sbjct: 421 DAGVLGALVGACKIHGDVDLGEEIGLRVIELEPTNSGRYVLLGNLYAKAGRWEDVAEVRK 480

Query: 481 LMNDREVKKAAGVSIIELEGVVYEFIAGGRVHPEAKEIYDKLNEMLECIRIEGYVAENET 540
           LMNDREVKKA G S+IELEGVVYEFIAGGR HPEA +IY K+NEMLECIR  GYV ENE 
Sbjct: 481 LMNDREVKKAPGFSMIELEGVVYEFIAGGRAHPEADKIYVKVNEMLECIRYVGYVPENEI 540

Query: 541 EEEKDNPVYYHSEKLAIAFGLLKTKAGEMLRITKNLRV 579
           + +KDNP+YYHSEKLA+AFGLLKTKAGE LRITKNLR+
Sbjct: 541 DVDKDNPIYYHSEKLAVAFGLLKTKAGETLRITKNLRI 577

BLAST of CmUC03G066790.1 vs. TAIR 10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 480.3 bits (1235), Expect = 3.7e-135
Identity = 247/559 (44.19%), Postives = 349/559 (62.43%), Query Frame = 0

Query: 28  LNSCSSMAELKQFQSQIIRFGLSTDNDAIGRLIKFCAVSKYGD-LHYALLLFNSIPYPDA 87
           L  CS   ELKQ  +++++ GL  D+ AI + + FC  S   D L YA ++F+    PD 
Sbjct: 21  LQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDRPDT 80

Query: 88  FIYNTLIRAYLQFQSPKSSLLLYLQMLHNSVFPNKFTFPSLIRACCIDNAVKEGKQIHAH 147
           F++N +IR +     P+ SLLLY +ML +S   N +TFPSL++AC   +A +E  QIHA 
Sbjct: 81  FLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETTQIHAQ 140

Query: 148 VVKFGFTTDRFSNNNLIHMYANFQSLEEARRVFDSIELPDLVAWTTLVTGYSQLGFVDEA 207
           + K G+  D ++ N+LI+ YA   + + A  +FD I  PD V+W +++ GY + G +D A
Sbjct: 141 ITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAGKMDIA 200

Query: 208 LRVFESMPEHNSVSWNAMISCFVQNNRFHEAFGLFNRMRLEKVVLDKYVAASMLSACTGL 267
           L +F  M E N++SW  MIS +VQ +   EA  LF+ M+   V  D    A+ LSAC  L
Sbjct: 201 LTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANALSACAQL 260

Query: 268 GALEQGKWIHRYIKKNGIELDSKLATTLIDMYCKCGCLDCAFQVFTHLPEKGISSWNCMI 327
           GALEQGKWIH Y+ K  I +DS L   LIDMY KCG ++ A +VF ++ +K + +W  +I
Sbjct: 261 GALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQAWTALI 320

Query: 328 GGMAMHGKGAAAIELFKEMETKMVKPDNITFLNVLSACAHSGLVENGQYYFCRFTQVYGI 387
            G A HG G  AI  F EM+   +KP+ ITF  VL+AC+++GLVE G+  F    + Y +
Sbjct: 321 SGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSMERDYNL 380

Query: 388 EPGTEHYGCMVDLYGRAGMLEEAMKVIDEMPMSPDVGALGAFVGACKIHGNIELGEEIGK 447
           +P  EHYGC+VDL GRAG+L+EA + I EMP+ P+    GA + AC+IH NIELGEEIG+
Sbjct: 381 KPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELGEEIGE 440

Query: 448 RVIELEPTNSGRYVLLGNLYAEAGRWESVAEVRKLMNDREVKKAAGVSIIELEGVVYEFI 507
            +I ++P + GRYV   N++A   +W+  AE R+LM ++ V K  G S I LEG  +EF+
Sbjct: 441 ILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLEGTTHEFL 500

Query: 508 AGGRVHPEAKEIYDKLNEMLECIRIEGYVAENE-------TEEEKDNPVYYHSEKLAIAF 567
           AG R HPE ++I  K   M   +   GYV E E        ++E++  V+ HSEKLAI +
Sbjct: 501 AGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQHSEKLAITY 560

Query: 568 GLLKTKAGEMLRITKNLRV 579
           GL+KTK G ++RI KNLRV
Sbjct: 561 GLIKTKPGTIIRIMKNLRV 579

BLAST of CmUC03G066790.1 vs. TAIR 10
Match: AT5G48910.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 460.3 bits (1183), Expect = 4.0e-129
Identity = 228/589 (38.71%), Postives = 356/589 (60.44%), Query Frame = 0

Query: 15  NSPKFHPSPIFHALNSCSSMAELKQFQSQIIRFGLSTDNDAIGRLIKFCAVS--KYGDLH 74
           +SP  HPS +F  +N+C ++ +L Q  +  I+ G   D  A   +++FCA S   + DL 
Sbjct: 17  SSPASHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLD 76

Query: 75  YALLLFNSIPYPDAFIYNTLIRAYLQFQSPKSSL---LLYLQMLHNSVFPNKFTFPSLIR 134
           YA  +FN +P  + F +NT+IR + +    K+ +   L Y  M    V PN+FTFPS+++
Sbjct: 77  YAHKIFNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLK 136

Query: 135 ACCIDNAVKEGKQIHAHVVKFGFTTDRFSNNNLIHMYANFQSLEEARRVF-------DSI 194
           AC     ++EGKQIH   +K+GF  D F  +NL+ MY     +++AR +F       D +
Sbjct: 137 ACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMV 196

Query: 195 ELPD-------LVAWTTLVTGYSQLGFVDEALRVFESMPEHNSVSWNAMISCFVQNNRFH 254
            + D       +V W  ++ GY +LG    A  +F+ M + + VSWN MIS +  N  F 
Sbjct: 197 VMTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNGFFK 256

Query: 255 EAFGLFNRMRLEKVVLDKYVAASMLSACTGLGALEQGKWIHRYIKKNGIELDSKLATTLI 314
           +A  +F  M+   +  +     S+L A + LG+LE G+W+H Y + +GI +D  L + LI
Sbjct: 257 DAVEVFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALI 316

Query: 315 DMYCKCGCLDCAFQVFTHLPEKGISSWNCMIGGMAMHGKGAAAIELFKEMETKMVKPDNI 374
           DMY KCG ++ A  VF  LP + + +W+ MI G A+HG+   AI+ F +M    V+P ++
Sbjct: 317 DMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDV 376

Query: 375 TFLNVLSACAHSGLVENGQYYFCRFTQVYGIEPGTEHYGCMVDLYGRAGMLEEAMKVIDE 434
            ++N+L+AC+H GLVE G+ YF +   V G+EP  EHYGCMVDL GR+G+L+EA + I  
Sbjct: 377 AYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILN 436

Query: 435 MPMSPDVGALGAFVGACKIHGNIELGEEIGKRVIELEPTNSGRYVLLGNLYAEAGRWESV 494
           MP+ PD     A +GAC++ GN+E+G+ +   ++++ P +SG YV L N+YA  G W  V
Sbjct: 437 MPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEV 496

Query: 495 AEVRKLMNDREVKKAAGVSIIELEGVVYEFIAGGRVHPEAKEIYDKLNEMLECIRIEGY- 554
           +E+R  M +++++K  G S+I+++GV++EF+     HP+AKEI   L E+ + +R+ GY 
Sbjct: 497 SEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYR 556

Query: 555 -----VAENETEEEKDNPVYYHSEKLAIAFGLLKTKAGEMLRITKNLRV 579
                V  N  EE+K+N ++YHSEK+A AFGL+ T  G+ +RI KNLR+
Sbjct: 557 PITTQVLLNLEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRI 605

BLAST of CmUC03G066790.1 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 449.9 bits (1156), Expect = 5.3e-126
Identity = 247/680 (36.32%), Postives = 365/680 (53.68%), Query Frame = 0

Query: 9   PTASLPNSPKFHPSPIFHALNSCSSMAELKQFQSQIIRFGLSTDNDAIGRLIKFCAVSKY 68
           P++S P        P    L++C ++  L+   +Q+I+ GL   N A+ +LI+FC +S +
Sbjct: 21  PSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPH 80

Query: 69  GD-LHYALLLFNSIPYPDAFIYNTLIRAYLQFQSPKSSLLLYLQMLHNSVFPNKFTFPSL 128
            + L YA+ +F +I  P+  I+NT+ R +     P S+L LY+ M+   + PN +TFP +
Sbjct: 81  FEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFV 140

Query: 129 IRACCIDNAVKEGKQIHAHVVKFGFTTDRFSNNNLIHMYANFQSLEEARRVFDSIELPDL 188
           +++C    A KEG+QIH HV+K G   D + + +LI MY     LE+A +VFD     D+
Sbjct: 141 LKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDV 200

Query: 189 VAWTTLVTGYSQLGFVDEALRVFESMPEHNSVSWNAMISCFVQNNRFHEAFGLF------ 248
           V++T L+ GY+  G+++ A ++F+ +P  + VSWNAMIS + +   + EA  LF      
Sbjct: 201 VSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKT 260

Query: 249 --------------------------------------NRMRLEKVVLDKY--------- 308
                                                 + +++   ++D Y         
Sbjct: 261 NVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETA 320

Query: 309 ------------------------------------------------VAASMLSACTGL 368
                                                              S+L AC  L
Sbjct: 321 CGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHL 380

Query: 369 GALEQGKWIHRYIKK--NGIELDSKLATTLIDMYCKCGCLDCAFQVFTHLPEKGISSWNC 428
           GA++ G+WIH YI K   G+   S L T+LIDMY KCG ++ A QVF  +  K +SSWN 
Sbjct: 381 GAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNA 440

Query: 429 MIGGMAMHGKGAAAIELFKEMETKMVKPDNITFLNVLSACAHSGLVENGQYYFCRFTQVY 488
           MI G AMHG+  A+ +LF  M    ++PD+ITF+ +LSAC+HSG+++ G++ F   TQ Y
Sbjct: 441 MIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDY 500

Query: 489 GIEPGTEHYGCMVDLYGRAGMLEEAMKVIDEMPMSPDVGALGAFVGACKIHGNIELGEEI 548
            + P  EHYGCM+DL G +G+ +EA ++I+ M M PD     + + ACK+HGN+ELGE  
Sbjct: 501 KMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESF 560

Query: 549 GKRVIELEPTNSGRYVLLGNLYAEAGRWESVAEVRKLMNDREVKKAAGVSIIELEGVVYE 579
            + +I++EP N G YVLL N+YA AGRW  VA+ R L+ND+ +KK  G S IE++ VV+E
Sbjct: 561 AENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHE 620

BLAST of CmUC03G066790.1 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 440.7 bits (1132), Expect = 3.2e-123
Identity = 244/688 (35.47%), Postives = 370/688 (53.78%), Query Frame = 0

Query: 5   LALHPTASLPNSPKFHPSPIFH--ALNSCSSMAELKQFQSQIIRFGLSTDNDAIGRLIKF 64
           L  HP  S PN P  +     H   +  C S+ +LKQ    +IR G  +D  +  +L   
Sbjct: 12  LPRHPNFSNPNQPTTNNERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAM 71

Query: 65  CAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYLQFQSPKSSLLLYLQMLHNS-VFPNK 124
            A+S +  L YA  +F+ IP P++F +NTLIRAY     P  S+  +L M+  S  +PNK
Sbjct: 72  AALSSFASLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNK 131

Query: 125 FTFPSLIRACCIDNAVKEGKQIHAHVVKFGFTTDRFSNNNLIH----------------- 184
           +TFP LI+A    +++  G+ +H   VK    +D F  N+LIH                 
Sbjct: 132 YTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTT 191

Query: 185 ------------------------------------------------------------ 244
                                                                       
Sbjct: 192 IKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFG 251

Query: 245 ------------------------MYANFQSLEEARRVFDSIELPDLVAWTTLVTGYSQL 304
                                   MY    S+E+A+R+FD++E  D V WTT++ GY+  
Sbjct: 252 RQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAIS 311

Query: 305 GFVDEALRVFESMPEHNSVSWNAMISCFVQNNRFHEAFGLFNRMRLEK-VVLDKYVAASM 364
              + A  V  SMP+ + V+WNA+IS + QN + +EA  +F+ ++L+K + L++    S 
Sbjct: 312 EDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVST 371

Query: 365 LSACTGLGALEQGKWIHRYIKKNGIELDSKLATTLIDMYCKCGCLDCAFQVFTHLPEKGI 424
           LSAC  +GALE G+WIH YIKK+GI ++  + + LI MY KCG L+ + +VF  + ++ +
Sbjct: 372 LSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDV 431

Query: 425 SSWNCMIGGMAMHGKGAAAIELFKEMETKMVKPDNITFLNVLSACAHSGLVENGQYYFCR 484
             W+ MIGG+AMHG G  A+++F +M+   VKP+ +TF NV  AC+H+GLV+  +  F +
Sbjct: 432 FVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQ 491

Query: 485 FTQVYGIEPGTEHYGCMVDLYGRAGMLEEAMKVIDEMPMSPDVGALGAFVGACKIHGNIE 544
               YGI P  +HY C+VD+ GR+G LE+A+K I+ MP+ P     GA +GACKIH N+ 
Sbjct: 492 MESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLN 551

Query: 545 LGEEIGKRVIELEPTNSGRYVLLGNLYAEAGRWESVAEVRKLMNDREVKKAAGVSIIELE 581
           L E    R++ELEP N G +VLL N+YA+ G+WE+V+E+RK M    +KK  G S IE++
Sbjct: 552 LAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEID 611

BLAST of CmUC03G066790.1 vs. TAIR 10
Match: AT5G06540.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 432.2 bits (1110), Expect = 1.2e-120
Identity = 226/568 (39.79%), Postives = 340/568 (59.86%), Query Frame = 0

Query: 23  PIFHALNSCSSMAELKQFQSQIIRFGLSTDNDAIGRLIKFCAVSKYGD-----LHYALLL 82
           P    L SCSS ++LK     ++R  L +D     RL+  C      +     L YA  +
Sbjct: 14  PKLALLQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYAYGI 73

Query: 83  FNSIPYPDAFIYNTLIRAYLQFQSPKSSLLLYLQMLHNSVFPNKFTFPSLIRACCIDNAV 142
           F+ I  P+ F++N LIR +     P  +   Y QML + ++P+  TFP LI+A      V
Sbjct: 74  FSQIQNPNLFVFNLLIRCFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFLIKASSEMECV 133

Query: 143 KEGKQIHAHVVKFGFTTDRFSNNNLIHMYANFQSLEEARRVFDSIELPDLVAWTTLVTGY 202
             G+Q H+ +V+FGF  D +  N+L+HMYAN   +  A R+F  +   D+V+WT++V GY
Sbjct: 134 LVGEQTHSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWTSMVAGY 193

Query: 203 SQLGFVDEALRVFESMPEHNSVSWNAMISCFVQNNRFHEAFGLFNRMRLEKVVLDKYVAA 262
            + G V+ A  +F+ MP  N  +W+ MI+ + +NN F +A  LF  M+ E VV ++ V  
Sbjct: 194 CKCGMVENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVANETVMV 253

Query: 263 SMLSACTGLGALEQGKWIHRYIKKNGIELDSKLATTLIDMYCKCGCLDCAFQVFTHLPEK 322
           S++S+C  LGALE G+  + Y+ K+ + ++  L T L+DM+ +CG ++ A  VF  LPE 
Sbjct: 254 SVISSCAHLGALEFGERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEKAIHVFEGLPET 313

Query: 323 GISSWNCMIGGMAMHGKGAAAIELFKEMETKMVKPDNITFLNVLSACAHSGLVENGQYYF 382
              SW+ +I G+A+HG    A+  F +M +    P ++TF  VLSAC+H GLVE G   +
Sbjct: 314 DSLSWSSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGLVEKGLEIY 373

Query: 383 CRFTQVYGIEPGTEHYGCMVDLYGRAGMLEEAMKVIDEMPMSPDVGALGAFVGACKIHGN 442
               + +GIEP  EHYGC+VD+ GRAG L EA   I +M + P+   LGA +GACKI+ N
Sbjct: 374 ENMKKDHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGALLGACKIYKN 433

Query: 443 IELGEEIGKRVIELEPTNSGRYVLLGNLYAEAGRWESVAEVRKLMNDREVKKAAGVSIIE 502
            E+ E +G  +I+++P +SG YVLL N+YA AG+W+ +  +R +M ++ VKK  G S+IE
Sbjct: 434 TEVAERVGNMLIKVKPEHSGYYVLLSNIYACAGQWDKIESLRDMMKEKLVKKPPGWSLIE 493

Query: 503 LEGVVYEFIAG-GRVHPEAKEIYDKLNEMLECIRIEGYVAE------NETEEEKDNPVYY 562
           ++G + +F  G  + HPE  +I  K  E+L  IR+ GY         +  EEEK++ ++ 
Sbjct: 494 IDGKINKFTMGDDQKHPEMGKIRRKWEEILGKIRLIGYKGNTGDAFFDVDEEEKESSIHM 553

Query: 563 HSEKLAIAFGLLKTKAGEMLRITKNLRV 579
           HSEKLAIA+G++KTK G  +RI KNLRV
Sbjct: 554 HSEKLAIAYGMMKTKPGTTIRIVKNLRV 581

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038880689.10.0e+0092.56pentatricopeptide repeat-containing protein At5g66520-like isoform X1 [Benincasa... [more]
TYK10244.10.0e+0091.35pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
KAE8653366.10.0e+0090.83hypothetical protein Csa_007505 [Cucumis sativus][more]
KAA0050892.10.0e+0091.00pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
XP_022138850.15.0e-28382.70uncharacterized protein LOC111009921 isoform X1 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Q9FJY75.2e-13444.19Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
Q9FI805.6e-12838.71Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX... [more]
Q9LN017.5e-12536.32Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
O823804.6e-12235.47Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9FG161.6e-11939.79Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A5D3CGC50.0e+0091.35Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A0A0LWF10.0e+0090.81DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G5694... [more]
A0A5A7U4W90.0e+0091.00Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1CAN42.4e-28382.70uncharacterized protein LOC111009921 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1CAX32.4e-28382.70uncharacterized protein LOC111009921 isoform X2 OS=Momordica charantia OX=3673 G... [more]
Match NameE-valueIdentityDescription
AT5G66520.13.7e-13544.19Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G48910.14.0e-12938.71Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G08070.15.3e-12636.32Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G29760.13.2e-12335.47Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G06540.11.2e-12039.79Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL531) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013087Zinc finger C2H2-typeSMARTSM00355c2h2final6coord: 679..709
e-value: 96.0
score: 4.6
coord: 637..659
e-value: 0.017
score: 24.3
coord: 714..734
e-value: 58.0
score: 6.5
IPR013087Zinc finger C2H2-typePFAMPF00096zf-C2H2coord: 637..659
e-value: 0.011
score: 16.1
IPR013087Zinc finger C2H2-typePROSITEPS00028ZINC_FINGER_C2H2_1coord: 639..659
IPR013087Zinc finger C2H2-typePROSITEPS50157ZINC_FINGER_C2H2_2coord: 714..742
score: 8.600363
IPR013087Zinc finger C2H2-typePROSITEPS50157ZINC_FINGER_C2H2_2coord: 637..659
score: 10.948853
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 172..278
e-value: 5.2E-24
score: 87.2
coord: 381..542
e-value: 2.4E-11
score: 45.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 27..171
e-value: 1.9E-17
score: 65.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 279..380
e-value: 7.1E-12
score: 47.2
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 197..472
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 392..417
e-value: 3.1E-5
score: 23.9
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 188..217
e-value: 4.2E-5
score: 21.4
coord: 292..320
e-value: 0.0016
score: 16.5
coord: 321..353
e-value: 7.6E-5
score: 20.6
coord: 393..416
e-value: 9.2E-5
score: 20.4
coord: 219..252
e-value: 1.3E-8
score: 32.5
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 217..263
e-value: 1.4E-8
score: 34.8
coord: 84..132
e-value: 3.1E-8
score: 33.7
coord: 318..365
e-value: 9.2E-10
score: 38.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 186..220
score: 11.027125
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 318..352
score: 9.996763
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 389..419
score: 8.736214
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 85..119
score: 9.185627
NoneNo IPR availableGENE3D3.30.160.60Classic Zinc Fingercoord: 693..773
e-value: 4.7E-8
score: 34.9
NoneNo IPR availableGENE3D3.30.160.60Classic Zinc Fingercoord: 615..683
e-value: 2.2E-5
score: 26.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 587..623
NoneNo IPR availablePANTHERPTHR47928:SF91OS06G0231400 PROTEINcoord: 26..593
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 26..593
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 492..578
e-value: 1.4E-13
score: 50.8
IPR036236Zinc finger C2H2 superfamilySUPERFAMILY57667beta-beta-alpha zinc fingerscoord: 636..734

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmUC03G066790CmUC03G066790gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmUC03G066790.1-exonCmUC03G066790.1-exon-CmU531Chr03:30874746..30876480exon
CmUC03G066790.1-exonCmUC03G066790.1-exon-CmU531Chr03:30881705..30881821exon
CmUC03G066790.1-exonCmUC03G066790.1-exon-CmU531Chr03:30884488..30884887exon
CmUC03G066790.1-exonCmUC03G066790.1-exon-CmU531Chr03:30887467..30887872exon
CmUC03G066790.1-exonCmUC03G066790.1-exon-CmU531Chr03:30888577..30889062exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmUC03G066790.1-cdsCmUC03G066790.1-cds-CmU531Chr03:30874746..30876480CDS
CmUC03G066790.1-cdsCmUC03G066790.1-cds-CmU531Chr03:30881705..30881821CDS
CmUC03G066790.1-cdsCmUC03G066790.1-cds-CmU531Chr03:30884488..30884887CDS
CmUC03G066790.1-cdsCmUC03G066790.1-cds-CmU531Chr03:30887467..30887872CDS
CmUC03G066790.1-cdsCmUC03G066790.1-cds-CmU531Chr03:30888577..30888921CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmUC03G066790.1-three_prime_utrCmUC03G066790.1-three_prime_utr-CmU531Chr03:30888942..30889062three_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmUC03G066790.1CmUC03G066790.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding