CsGy3G021830 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy3G021830
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionPentatricopeptide repeat-containing protein
LocationGy14Chr3: 20100774 .. 20112571 (+)
RNA-Seq ExpressionCsGy3G021830
SyntenyCsGy3G021830
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGGGAGATGAGAGAAATGAAGAACAACCAAAATGGGTGCATTTTGATTATTTTTAAGTGAATAACTCGACTTCTCAACTTCTATATTCTCCAGAATGTTTCAATTGAATGCGAAAGAGTAGCTGCGCCCTGCAGGGATGTTCTCATTTGTGACTACCAACGCTCTTAAACAGTTAACAAGAAGCATTGGCAACTTTGTAAGTCCTCCTTCAATCTCAATGCCTCTTCAACCACCATCTCGTCCTTCTTTCAAGCAAACTCTGCTTAATCGAATCAAAAACTGTTCCACCATAAACGAACTGCATGGTTTATGTGCTTCCATGATCAAAACTAATGCAATCCAAGATTGTTTTCTGGTGCATCACTTTATTAGCGCGTCTTTTGCTCTTAACTCTGTACATTACCCAGTTTTCGCCTTTACCCAGATGGAAAATCCTAATGTTTTTGTGTATAATGCGATGATTAAGGGATTTGTATACTGTGGGTACCCATTTCGTGCTCTACAATGTTATGTACATATGTTGGAAGAATCGAACGTCTTGCCAACTAGTTATACGTTTTCTTCGTTGGTTAAAGCTTGCACCTTTATGTGTGCTGTTGAGTTGGGACAGATGGTGCATTGTCACATTTGGAAGAAGGGGTTTGAATCCCATTTGTTTGTTCAAACTGCTTTGGTTGATTTTTACTCAAAGTTGGAGATACTTAGTGAGGCAAGAAAGGTGTTTGATGAAATGTGTGAAAGAGATGCTTTTGCATGGACTGCTATGGTTTCTGCTCTAGCTCGTGTTGGAGATATGGATTCCGCTAGGAAGTTGTTTGAGGAGATGCCTGAAAGGAATACTGCAACTTGGAATACCATGATTGACGGCTATGCAAGATTGGGAAATGTGGAGTCTGCAGAGCTTCTGTTCAATCAGATGCCAACCAAGGATATAATCTCCTGGACAACCATGATCACTTGTTATTCTCAGAACAAACAATATCAAGATGCATTGGCGATTTATAGTGAGATGAGATTGAATGGGATTATTCCCGATGAGGTAACAATGTCAACTGTTGCTTCAGCTTGCGCCCACATTGGAGCTCTTGAACTAGGAAAAGAGATACATCATTATGTAATGTCTCAGGGGCTTAATCTTGACGTTTATATTGGTTCTGCATTAGTTGATATGTATGCTAAGTGTGGGAGTTTAGATTTGTCTCTTTTGATTTTCTTCAAATTGACAGATAAAAATTTATATTGCTGGAATGCAGTAATTGAAGGACTTGCTGTTCATGGTTATGCGGAGAAGGCTTTGAGGATGTTCGCTATCATGGAGAGGGAGAAGATCATGCCCAATGGTGTTACCTTTATTAGTATATTAAGTGCTTGCACACATGCTGGGTTAGTTGACGAAGGCAGGAGTAGATTTTTAAGCATGACTCGTGATTACGACATTCGTCCTGATATCAGACACTATGGTTGCATGGTTGATATGTTAAGTAAATCAGGATATCTCAACGAAGCGTTAGAATTGATTAAAAGTATGGAATTTGAACCAAACTCTATTATTTGGGGAGCCTTGTTGAATGGGTGCAAACTTCATGGAAACTGTGAGATCGCTGAAGATGCTGTTGAACAGTTGATGATTTTGGAACCCATGAATAGTGGGCATTACAATCTTTTGGTCAGCATGTATGCTGAAGAAAAGGATTGGATGGAGGTTGTGCATATTCGATCAATGATGAAAGAAAAAGGAGTAGAAAAGAAATATCCTGGCTCAAGTTGGATTGAATTGGAAGGGACAATTCATCAGTTTTCAGCTTCAGCTGATTCTCACCCTGATTCTGACAAAATATACTTCATACTGACAGAATTAGATGGACAACTGAAGCTAGCTGGTTACATACTCGAGCCTTCAGTATGCAGTACTGGTTTGCTTTTTTCAGAGGAAATTTGATCAACATTAATTGAGGCCATACCGTCATAGTGAGATCGAATGTTATTTGCATATCAATCATTTCAGCTTCATTGAATATGGTATATTGAAGTGAAAATTCTCGAGGTCAAGTGCTAAATGACAAAGCTGGGCTACTATAGGAGTTCATAATTATTCAGATCAAGGCTCAAGTTAGCCTCATCAAGAGCCATGGTAATCTTAAGTTTAATCTTACGTTTTGCTAGAATGTCTTCTCATTTTAAGTAAATTCTCAATATTTGTTATAATGAAGTGAATGAGTACTTCTTAGTCTTTTACATGCACAATTGAGCATCTTGATGTTGGCGTTCACCGGAACATAATTCACTTGAATCCTTCTTTCATGTGCGCTCTTCGTACTCCCCATTCTACATGCACAATTGAGAATCTTGATGATGCATTTCTTTTCAATGTTTGGCTTGTATCATTGAGGCTACCTATTCATATTAAGAATATATTTGATGATTAAATTTAATCTACAGTCGAAAAAATGAAGTGAATGGTTACTTCTTAGTTTTTTACGTGCACGATTGAGCATCTTGATGCATTTTCTTTCTCTGTTTAACTTGTCTAATTGAGGCTATCTATTCATATTAAGAATATATTGACCGATTATATCTAACCAACTCTTTTAGTGCTTAAGTTTTTAGATTAGTGATGGTTGAAACTCTATGATATGGTTAAAGTTGTTACCAATCTCAACATCTAAACTCAAACTCGTGCAATAGTTTGACTAATGAGGCCCAGTAACTATGTTATTGATATTGAAATAAAAGAGGAACAACTCTAAAAATCTTTCAACTTAGGCATGTAAAGTTCGAATTCCCCACTCTTGTTGATCGAAGTAGATTCAATCATTTTCTTTATGCTTTCAGTGTACTCGCATTCATTTTGTGATTTTTGCTCCAGCTATCAAAGAAAGGGCAGGGACTAAATTTATAACGAAATAACGTACCACTATTGTACAACCGGAGCTGACTCAGTGAAGTGATTTGATCGAAGAACCCAAGCTAACAGGATATCCTGTAACAGGTCGGTCCTTTTTCCTTCTTTGTACAATACTTTCATATGTTTCAAACACACATGACCATATTGTCTGCATTATGCACTTCTTTAGAAACAAGCTAGGTTTTGAGGAAAATGGAAACCAAGACTTATCAGAATCTTCACGTATAATCTCATTCATGACAATAAATTGCCAATACTGTCTGTACCGTGTGATAATATTTCCAAAACATCCAACTTGTCCCACAGATGGTTTCTTAGGTTGGAACGGCGTGGAGAAAAGAAAACTTCTAGTTTCTTCTTATGTTGTTTTAATTTTCATTTTCTAAACGAAGAACATGCCTAAGTCTTCTCTTTCCTACTTCAATGGTGTTCTTAATCCATCAATATGGCATTGGAGAATGCTAAACACTGGCTAAATTAAATGGAAGTGCGACTCTCTTCTTCAAGCACTAGCCATTTTTATTATTATGGAAGTAGCTTGCTTTGTTCTGACAAGATGCATGGTTTAAGGGATAGAGGAAATTCTTCAAACCTTGTAGGCTATGCATCTGCTGTAAGATATGGTTCTTAGGGAACATTATACACGTTTCTATCTTAGTCTCTTTTCATGTACTTAATGAAAAATTTCGTATCATTTCAAGGTTCAACGATAGTGATTGCTATTATTCCACTATTTTGACTTTCCCTTACCAAGTTGGTGGGGGGAAATCCTACCCAAATCTCGCATTAGTCTTTCTCATACTCTGCCAAAGTTCACTATTCTGAAACTTCATCTCAAACAGAAGTATGCTTTCTGACCTTGCTATAAACTTACATCCACAAAGGTCATAAAAATTTGTAACATAGAGGTGGATTATTATTCAGGTTGATTATAGAGATAGAAGTGGCCTCTGTTTACCAACAAGTTGGTATTGTATTTTCTTGAGCATTTTATCACAATAAGAAAGGAAAAAAAAAATAATCAAAAGAAACTTATCTAGAGACGCCCAAATTCATAGTTTCTTCTAAGAGGACTTCATGCACTTGGCAATGGACCAGGATGGGGTTAATATTTAATCACAAAATATTTTTACGGTCCAAAATTTTTTTAAAAATTGATCTGAAGTGCTTACTACTTCCAGTTCAATTCACTAGACCCTTTTATAGTTCCCTTGTCTCTGAGAGTAACAATAAAGTCTGTTGTTGTTCTTTAGCGGTGTAAACCTTTTCTCCATCCAACAATTGTAGTCTAAGTAGTCTTTTCCTTTTAGTTTAGAACTTGACGCCCTCGTTATTCTCATATCTCATACCATCTATGAAATTGTTTCTTATTTTCAAAGAAGGTCCATGCTCGAGATGAAAACTTCTTGAAAAAGTCAGAACAAAAAGCCTTGATTTGTCTAATAGTTTAACTGAGGTTAAATTTATACAAGTATAAACTTATCTAAGCTTATAATCGATTTGGATGAAACTAATGGACGAATATAGAGCTAACGATTTAAACGTGGCTGCACAAACTAACAATTAGAAGACGATATAAAGCTTGAGACTTTAAGATTAAAAGTATACACCTCCCCTCCCCAAAGCTTAGAGATAAAAATTGATCCCCACACCGAAAGCCCTCTCGTCCTTCTGAATTTACTTCTTATTGACCATCCGTTTAAGAATCATCATCAAAACCTACTCTTGACTTATATAGAATGAGCGGAACCATCGTATATTTAAAGATAAAGAATCAGATTTCACACTTTTTTTTATCACTTTTTGTATACATCTCTATCATGGTGTAAATTCACTTCTAACTTTTGTGACTACAATCTCACTTCTCTTAAATCTCAATGGAATATCATGATGAAATCTTTCTTGGCACTTGATGCCCTTTTGTAATTTCATTTTATCAATGAAATTTGATTGACTGTTTCTAAAGTCCCCCACACCAAAAGAGGAAAACTATTGCCAGTACAAGGGAAACAAAACAGAGGAAAATGAACCAAGAGACTCGAATTTTGTAAAGTGACAACTGACCAGGATAGGAAAAAGAAGAAAGTTTCATAAAACTGTTAAACAAATGATGATTAAAAAAATAGGCTCTACAACCATTAGGGAGGGGCTTGACCGATCAAAGCTCGAGTTCAACATCAACTGTATTCATGAAAAGTCCAAAAAAAAAAAAGAAGAAAAATCTAGAAGCCGAATAAGTTGATAAGGATGTCGCTATGACAACAATAGGTTTACCGACATAGAGGAGACATTACTCAAACTGGAAGCCATTATTACTTTTTCTTTAAAACATTAAAAACCCCATCATCTGACCAAAAATTTAGACTTTTTTTTGGAGGGGGTGAAGAAAGAAACACCAAACAAAAACAAACAAGACTGAAAAACCTGAAGCCAAACTCAGTTGAAAGATGGAACTTCTTTTAAAAGGTTAAATCTTTTCTCCTTGTGTCTTTTTAAATGGGATACAACCAACTTTTGATGGCCTGCAGTGTACAGACTTTCTCAGCCTCATTCATTATGGAACATCTAATTAATTTTCTAAGTTGATTGGGATGAGTAGTTTGTGTGATTCAAAAGTAAAAGTACTGATAAGAAATTTAGAATAATGCTTCCAATTGTGTCAGTTTTTGTGCTGAAGCCTGAGAATAGAAAGAGTTTTTGTAGTGTTGTGCGTTTGTGGGGCTTTCTTGCATCCTCGAAAACCTAGAAAGGAATTCCATCTTCTTAAAATTATAAAACAATAGTTCAATTATATATTTATTAAACAAATAACTGTACCTTGCAATATTACCCTTTTTAACTCTTCACGAACATTTCAAAACTATTGGAGTGAAATAGTTTTGAAAACTACGGGAGTAGTTGTACTTCAGAATTTTTCAGAATTATGAAAGTTGAGACATGAAATTTTGTGACAGTTCTATTGTAACACTCTTGTTTTAGGGTCCAAACTAGGATTCAAAATCCGAACTTGGATTCAAAATCGAACTTGGCACCTCATGATCTCATATTCTTCACAAACCAATATGAGTCCTTTTAGCATGGTTTGTTCTCACTCACACGCTTCGTAGAAAATTTATAGGAAGACACCCAATGTAGAATTGCTTAAAGTTAAGCACGTTTAATTTTGAGATTCCTATATTTGAGTCTTTCTTAACTATAGGTATAGGTTAGTAGCTTTCAATTCTTTTAACTATACTTTTATAGCCTAAAGATCCCTCTCATTCAAATGTGATCTCCGTTAATTTATCATGTATTCCTTCTAAACTCGAATGCTATAGTGATATTGCAACTTTTGAAAGAATGGTAGTACTTTTTTAAAGAAAAAGTGAAAAGTGTGTAATATTTAGAAGTTAGCCAAAAACAAAACAGCCTCTAAACCAAACCAAATTTGACTAGAGTTCACACGCTAAGACAAAAGCTAGGGAGGGAACATATTATTATTGCTACACAACAAAGTTAATGTTTTGGTCAAACCTTCTTCCCAATAAGGTTAAGTTGTGTGAGGTTTGGTCCCTTAGTTTCTCCAAGGTTGTCTCCACCTGTGATTTATCTTTCTTCTCTTTCAAGCTCAAACCACTCCCAAATATTTCACTAAATGATTGCCATAATCAATCCCTTGGTGGGGTTCGACTTACTTTGTCTTCAACTACTCTTTTCTCCACATATGAAGATACAAAAAAGTGTCATTTTAGATGTCATTTCAAATACTTGAACTCACAAACTATTGTTTTAATCATCAACTAATTTCTATATGATGGGGTTTGAGATACATCAAATAAATGTAATTTAATTTGATGTATTCAGAGTCCATTCCTTGGACGTATTACTACAGTAGGATTTTAGAGAAATTCAAATAGGATAAAATTTGTAAGTTACTTCTTTCCAAACTTGCAGGGGACCTTTTTTACTTTTTATAAATGTTTCAAAACTATCATTTTAAAAAAATTAAAATGAATATTGTCAGAACGTTGGAAGAAAATTAAGATCTAGAAGGGTGGGGAGGCAACCTAGAATAATATGACAGACTCAATTTTTATTTTGCATCGCTAGGTTGTCACAAGTTTTAACTATCATTTATCATTCAAGAAACTGACATGTTTCTACTGTAAATGTTCAATTTTGAAATAGTTATGGAAGGTCAAGGAAATAATGTAACTTTTTAGAGTAAGAGTATTTTTTAAAAAATGAATGACAAATAAGTACACAAATGGTTAAAACAATAATTTATATTTGTCATACAATAAAAACACCCCTGGGGAACAAAAACTACACAATGATTGCAAGATAATAGAAACATGTGAAAGGGGAATTGATGAGGATAATTTTAAAATTTTCTACTTAACTTGCTCCTCACTTTTTCAAAATTAGGTTTTATTTGCCTTTTCGAGTAGCTAATGAAAATGAGGAAATGATAAGGTCTAAGTTTTCTTATAATTTATGAATATTATTTTATTTTATCTTTGTAATATTAGAAACTATTTTTATATACTTCTTTGGTAATGTACTTTAGTTGATTTTGATTTATTTTGGTTCTGAATGCTTTTAAAATGTTTATCTTAATCTTTTTTTTTTAACTTTTCTTCATTTTAGTTCTTGTACTTTCAATATTTGTTCATTTCATTGTGCTTTCAAAATGTCGATTTTTATACTTTTAAAATGTTTATCTTAGTTTTTGTTTTTTTTAATTTTTGTTCATTTTAGTTCTTGTATTTTCAATATTTGTTCATTTTATCGTGCTTTCAAAATGTTAATTTTAGTTTTTGTAATTTTAAGAAAATGACCATTTTTGTCTCTCATTTTCATTTGCATTTGAATTTTTCGGTTGAAACTTTGCATTACATCTTATTCACTAAGGTAGTAATTTAGTATGAACTTTGATTTGTAATGATTTTGTCTCAAACTTTGCATTACATCTTATTCACTAAAGTAGTAATTTAGTATGAACTTTGATTTGTAATGATTTTGTCTCTGTAGTTTTTTATTTTGCAATAATTTAATCCTTGGACATAAGTATGTAACAATTTTAGTTATTGTAAATTCAAACATAACAATTGAGTCTAACGCAAACAAAAAAAATCAAAATTAGAGGTTAATTTTTCTTATTTTATATTATAAGCTTTGTATTTTTGTAAAAATATTGAGTTTATAGTTACTTTGTTTATTCATGAAGTTTTTATACTAATTTTTACTATATGAACTAAATTGTTACGAATGACAAGGTATAAGGTCAAAATTGTTACATTGTAAAATTCATGAACTTAGTTGTTACAAATATGAAGGATTAAATTGTTACAAATCAAAGTTTAAGTACAAAGTTGCTTTTACAAAAGTTTAGGTTTTTTAACCGGTTTGTATTAAAGATTTTTCGTTATACATATAATGTATGTTACAATTTTATAAATAGAGATCAAAATGGTCACCTTTTAAATTATGGAAACTCAAAATAGACATTTCAAAAGTAAATGGAGCAAAGTGAATTGATGGTGAAAGATCAAAGTAGTATTTAAATTTTAAAGTAGTCCTATTTGCTTTATTTCCTTTTAATAAAAAATATTGTAGATTAATTTTTAAATATAGCTGTGATTGAAGATCCACATGCGTTTACATATTGTATTATGTAATATTATAGAATATAACTGATTTGATTTTTCTTTAAAGAAAACACTGAACTAAATATATTTCATAACATCTCCAATGTACACTATTACAAAATTGAAATTAAAATTTCTTTTCAATGTAACAATCTTGAAATAATAATAATTGAAGTGACAGCCTCAAAGAAATGATCACATATCAAACTCCCTCATATCCTGGTTCAAACTTTGAAACATTGCCTCCTATTCAAACTACATCCAATATTTTTAAATTAACTTCTTAAGTCATATTTAATATTCAATATTCTTCAATTAAAATCTACATTTCTGATTATATTTATCAATCATTGAACAATATCATTTTCATTCTTCTCTTTTTCCCTTGACCAATCTAAACCAATGTCATTATTGTCTTTACTCCATAGATTTTTTTTTTTTATTATATTGCATACATCAATTTGAACTTATTAATTTTTAAAATAAAAGTTTATCATATACTTCTTTTACAGTGAAACTGAATTTCATCTTGAGTATGTTGATTAAAAATATGAATTTTGGATAAGATTTTATTTTTTAATTTTAAAGATGGAGTTACGTAAAACTTTTAAAAGTGTAAGTACCAACTTAAACTAATAGTAAAGCTTGTAGTCTATAAACCTTATTAATATTTATTTTAAAATGATAGGTTGGCTTTTGAGAATTTGCAATTTTTTTATATTCGATTGGGTCAAAGAACTAAGAACTCATAAACGGACAATTTGTAATGTTGTTAACAACTTTCAAATTAAATAAAATGATAAATATATTTCAAATAATTGCTTCAATACCCTCTTAATTTTGTAGGCCCCAATAGATAATTATTTGGTTTTGTTTTTTTCTTTTCTAATTAAACCTATTTTATACGTATTTCTTAAATATAATTATTAATTTTTTATTTAAATTTCAAACACAAAATTAACTTTTGAAATCTGTTTTTTTTTTAATTCTTGAAATCTGTCATTCACAGTTATTTTTCTAAAAAATATTGCTACATGCTTAATAATTATTATAAAAATTGATACCTAAAAAAATGAAAAACAGAAATTTTACTAAATTGATCTTAATGTTTTTAAATTTAGAATTTGATGTTGAATTTTAACTTAGGGATATTTTTAAAAATAACAAAATAAACTAAAATAATTAAAATTTTGGATTCACTATAGGTGATAGAATTTGTAAAATTGATTTTTTAAAAACCTTAAATAAGAAAAGTGAGAAAAGTAGGATTTGGGTGTTAGATACTTAAAGTTTGAATGATGGAAATGATTCAAATTTAGTATGGTAAATGAAAGGAATGAGAAAAGAGAGAGAGGAAAATGAAATTTGAACCAAAATGTGATTTTGTGAAGGTATATAAATTAGTATAAGAAAAAGAAGTGGATGCATTTAATAATGTAGATTATATAAAGAAAGAGAGAAGAAAAGAAAGGGTAGCAATCATCCAAACACGTGCAACGGCCCCCATGTGCAACATTGGTGTCGGCTTGACCATTCTAATGTTGTTTTACTTACCATAAATTTATAATATCAGCATCAACTTTCCAACCATTTTTCTTTTTCTCTTTACTCTTATTGTTAGACCTAATGCTATTTTATATTTCAATCTCTCTTTAATTATCAAACCTTTAACCTATTTACTATTTCTAAAATCTAAACCACTTTTTCTTTAACAATGTTGCCTTTGACTAATAATGTCCCCAATCAATTCTGGAAAATCATTAACTAATTAATCTCTTTTCTTTTTTCTTTTTTCTTTACTTCCTTATGATTGGTCAAACAAAAGCCTTTGGGTTCTTTTGGACCATCTCCTCTTACCTTTCCTTATCTATTATTTTAATTCTAGGAAGTAGATTACATGTTAGTTTTTTTCATTTTGCTTTCAAATTTTTATTCATTCTTCTTCATATTTAACAACTCACTAATTCTAACTTTTTCAACTTACTTTGCCTCTTAAATCTAATTTTTTTTTAATTTTAGAAAATATAAAATATTAATAGTTTAAGACTTAGTAAATATAATATTAAAAATTTAATAGTTTTAGTATGATACTTTTAGGGAGCTATAAATGAATTTATATTCAGATGATATGGATATAAAAGAATTTTAGAAACTATAGTACATATCAACCATGGAAAAAATTATACTAAAATATTACAATTATGTAAAGTATTTATGGAAATAAGAATAATGCAAATTTGCTCTCTCTCCTTGGCGAGTCTCATTTCATTTTCTTATTCTCTCTCATTTCTTACTTTTTTTTTTTTTCATTTTTCATATCAATTTTAATTTCTTTATTCTATTTGAATTTTATTTTTCTTTCTTATTTTAATAGTAGGTAAATTTTAATTTTTTATTTCTTAGTTTAATACTATTTTTTCTTAATAAAGAAAAACCTATTTTTTATGGAAAAGTTTGTTTAATTTTATTGTGATTTAAACTTACTAAAAATGTGTGTTCTTTATCTTTTTAAATGGATGAAATTTCCTTCAATTTTTTTTTTCAATTCATGTTTACATTAATTTTTTAATTTATTTTTACGTTTGTTACATGACATGATAGATTTGTAAAGATCAATCATTAATCGAAATTTCATTACTTGAGTGTGCAATTTTATGTTAGTAATTCGTTCGGTTTATATTAATGTTATTCTCTATGCATGTTTACCTTTGTAATCTATTTGTTTTGTATTAATTAAAGACGGTAAATTTCTAGTAATTGCTTGTGATTGAAAAATAATTAAATATAAAGTAAAATATGTAATACAATAATTTAACAAATTCCATTTAATGTAAATATAAAGTAAAATATGTAATACAATAATTTAACAATTTTCATTCAATGGAAGTAAAATCAATGTGTTCCACAACTTGGACATCGTCAATTTTTAGTTGGCTGTCCTCATCGACTTTGCACTCTTTGTTGCTTTAGTCATTCCTGTTTTGAAGTATGACAATTGACCGATTCCTAAATTATAATATCTCGCTTCAAATATTGGCATCATCATCATCGAACTAATGTAACAAATCTCGCTCTGTGTTTGTGTCAGATGAGGCATTTTTCTTCCACATGCTGGTTAACTTCTTCCAATTCATCAGGCTCTTGCACTCGAGTACGTCTTCTTTGAACTGGTGTAGCTACAATAGGTACATCATACATATAGTGTATGTTCTTCATGTACAACTTGTTATCTTATTGTTGCATATATCTAGGACCTGTTGTGG

mRNA sequence

CGGGAGATGAGAGAAATGAAGAACAACCAAAATGGGTGCATTTTGATTATTTTTAAGTGAATAACTCGACTTCTCAACTTCTATATTCTCCAGAATGTTTCAATTGAATGCGAAAGAGTAGCTGCGCCCTGCAGGGATGTTCTCATTTGTGACTACCAACGCTCTTAAACAGTTAACAAGAAGCATTGGCAACTTTGTAAGTCCTCCTTCAATCTCAATGCCTCTTCAACCACCATCTCGTCCTTCTTTCAAGCAAACTCTGCTTAATCGAATCAAAAACTGTTCCACCATAAACGAACTGCATGGTTTATGTGCTTCCATGATCAAAACTAATGCAATCCAAGATTGTTTTCTGGTGCATCACTTTATTAGCGCGTCTTTTGCTCTTAACTCTGTACATTACCCAGTTTTCGCCTTTACCCAGATGGAAAATCCTAATGTTTTTGTGTATAATGCGATGATTAAGGGATTTGTATACTGTGGGTACCCATTTCGTGCTCTACAATGTTATGTACATATGTTGGAAGAATCGAACGTCTTGCCAACTAGTTATACGTTTTCTTCGTTGGTTAAAGCTTGCACCTTTATGTGTGCTGTTGAGTTGGGACAGATGGTGCATTGTCACATTTGGAAGAAGGGGTTTGAATCCCATTTGTTTGTTCAAACTGCTTTGGTTGATTTTTACTCAAAGTTGGAGATACTTAGTGAGGCAAGAAAGGTGTTTGATGAAATGTGTGAAAGAGATGCTTTTGCATGGACTGCTATGGTTTCTGCTCTAGCTCGTGTTGGAGATATGGATTCCGCTAGGAAGTTGTTTGAGGAGATGCCTGAAAGGAATACTGCAACTTGGAATACCATGATTGACGGCTATGCAAGATTGGGAAATGTGGAGTCTGCAGAGCTTCTGTTCAATCAGATGCCAACCAAGGATATAATCTCCTGGACAACCATGATCACTTGTTATTCTCAGAACAAACAATATCAAGATGCATTGGCGATTTATAGTGAGATGAGATTGAATGGGATTATTCCCGATGAGGTAACAATGTCAACTGTTGCTTCAGCTTGCGCCCACATTGGAGCTCTTGAACTAGGAAAAGAGATACATCATTATGTAATGTCTCAGGGGCTTAATCTTGACGTTTATATTGGTTCTGCATTAGTTGATATGTATGCTAAGTGTGGGAGTTTAGATTTGTCTCTTTTGATTTTCTTCAAATTGACAGATAAAAATTTATATTGCTGGAATGCAGTAATTGAAGGACTTGCTGTTCATGGTTATGCGGAGAAGGCTTTGAGGATGTTCGCTATCATGGAGAGGGAGAAGATCATGCCCAATGGTGTTACCTTTATTAGTATATTAAGTGCTTGCACACATGCTGGGTTAGTTGACGAAGGCAGGAGTAGATTTTTAAGCATGACTCGTGATTACGACATTCGTCCTGATATCAGACACTATGGTTGCATGGTTGATATGTTAAGTAAATCAGGATATCTCAACGAAGCGTTAGAATTGATTAAAAGTATGGAATTTGAACCAAACTCTATTATTTGGGGAGCCTTGTTGAATGGGTGCAAACTTCATGGAAACTGTGAGATCGCTGAAGATGCTGTTGAACAGTTGATGATTTTGGAACCCATGAATAGTGGGCATTACAATCTTTTGGTCAGCATGTATGCTGAAGAAAAGGATTGGATGGAGGTTGTGCATATTCGATCAATGATGAAAGAAAAAGGAGTAGAAAAGAAATATCCTGGCTCAAGTTGGATTGAATTGGAAGGGACAATTCATCAGTTTTCAGCTTCAGCTGATTCTCACCCTGATTCTGACAAAATATACTTCATACTGACAGAATTAGATGGACAACTGAAGCTAGCTGGTTACATACTCGAGCCTTCAGTATGCAGTACTGGTTTGCTTTTTTCAGAGGAAATTTGATCAACATTAATTGAGGCCATACCGTCATAGTGAGATCGAATGTTATTTGCATATCAATCATTTCAGCTTCATTGAATATGGTATATTGAAGTGAAAATTCTCGAGGTCAAGTGCTAAATGACAAAGCTGGGCTACTATAGGAGTTCATAATTATTCAGATCAAGGCTCAAGTTAGCCTCATCAAGAGCCATGCTATCAAAGAAAGGGCAGGGACTAAATTTATAACGAAATAACGTACCACTATTGTACAACCGGAGCTGACTCAGTGAAGTGATTTGATCGAAGAACCCAAGCTAACAGGATATCCTGTAACAGATGAGGCATTTTTCTTCCACATGCTGGTTAACTTCTTCCAATTCATCAGGCTCTTGCACTCGAGTACGTCTTCTTTGAACTGGTGTAGCTACAATAGGTACATCATACATATAGTGTATGTTCTTCATGTACAACTTGTTATCTTATTGTTGCATATATCTAGGACCTGTTGTGG

Coding sequence (CDS)

ATGTTCTCATTTGTGACTACCAACGCTCTTAAACAGTTAACAAGAAGCATTGGCAACTTTGTAAGTCCTCCTTCAATCTCAATGCCTCTTCAACCACCATCTCGTCCTTCTTTCAAGCAAACTCTGCTTAATCGAATCAAAAACTGTTCCACCATAAACGAACTGCATGGTTTATGTGCTTCCATGATCAAAACTAATGCAATCCAAGATTGTTTTCTGGTGCATCACTTTATTAGCGCGTCTTTTGCTCTTAACTCTGTACATTACCCAGTTTTCGCCTTTACCCAGATGGAAAATCCTAATGTTTTTGTGTATAATGCGATGATTAAGGGATTTGTATACTGTGGGTACCCATTTCGTGCTCTACAATGTTATGTACATATGTTGGAAGAATCGAACGTCTTGCCAACTAGTTATACGTTTTCTTCGTTGGTTAAAGCTTGCACCTTTATGTGTGCTGTTGAGTTGGGACAGATGGTGCATTGTCACATTTGGAAGAAGGGGTTTGAATCCCATTTGTTTGTTCAAACTGCTTTGGTTGATTTTTACTCAAAGTTGGAGATACTTAGTGAGGCAAGAAAGGTGTTTGATGAAATGTGTGAAAGAGATGCTTTTGCATGGACTGCTATGGTTTCTGCTCTAGCTCGTGTTGGAGATATGGATTCCGCTAGGAAGTTGTTTGAGGAGATGCCTGAAAGGAATACTGCAACTTGGAATACCATGATTGACGGCTATGCAAGATTGGGAAATGTGGAGTCTGCAGAGCTTCTGTTCAATCAGATGCCAACCAAGGATATAATCTCCTGGACAACCATGATCACTTGTTATTCTCAGAACAAACAATATCAAGATGCATTGGCGATTTATAGTGAGATGAGATTGAATGGGATTATTCCCGATGAGGTAACAATGTCAACTGTTGCTTCAGCTTGCGCCCACATTGGAGCTCTTGAACTAGGAAAAGAGATACATCATTATGTAATGTCTCAGGGGCTTAATCTTGACGTTTATATTGGTTCTGCATTAGTTGATATGTATGCTAAGTGTGGGAGTTTAGATTTGTCTCTTTTGATTTTCTTCAAATTGACAGATAAAAATTTATATTGCTGGAATGCAGTAATTGAAGGACTTGCTGTTCATGGTTATGCGGAGAAGGCTTTGAGGATGTTCGCTATCATGGAGAGGGAGAAGATCATGCCCAATGGTGTTACCTTTATTAGTATATTAAGTGCTTGCACACATGCTGGGTTAGTTGACGAAGGCAGGAGTAGATTTTTAAGCATGACTCGTGATTACGACATTCGTCCTGATATCAGACACTATGGTTGCATGGTTGATATGTTAAGTAAATCAGGATATCTCAACGAAGCGTTAGAATTGATTAAAAGTATGGAATTTGAACCAAACTCTATTATTTGGGGAGCCTTGTTGAATGGGTGCAAACTTCATGGAAACTGTGAGATCGCTGAAGATGCTGTTGAACAGTTGATGATTTTGGAACCCATGAATAGTGGGCATTACAATCTTTTGGTCAGCATGTATGCTGAAGAAAAGGATTGGATGGAGGTTGTGCATATTCGATCAATGATGAAAGAAAAAGGAGTAGAAAAGAAATATCCTGGCTCAAGTTGGATTGAATTGGAAGGGACAATTCATCAGTTTTCAGCTTCAGCTGATTCTCACCCTGATTCTGACAAAATATACTTCATACTGACAGAATTAGATGGACAACTGAAGCTAGCTGGTTACATACTCGAGCCTTCAGTATGCAGTACTGGTTTGCTTTTTTCAGAGGAAATTTGA

Protein sequence

MFSFVTTNALKQLTRSIGNFVSPPSISMPLQPPSRPSFKQTLLNRIKNCSTINELHGLCASMIKTNAIQDCFLVHHFISASFALNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFRALQCYVHMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMDSARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSEMRLNGIIPDEVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYIGSALVDMYAKCGSLDLSLLIFFKLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQLKLAGYILEPSVCSTGLLFSEEI*
Homology
BLAST of CsGy3G021830 vs. ExPASy Swiss-Prot
Match: Q56X05 (Pentatricopeptide repeat-containing protein At1g06143 OS=Arabidopsis thaliana OX=3702 GN=EMB1444 PE=2 SV=2)

HSP 1 Score: 632.5 bits (1630), Expect = 4.9e-180
Identity = 322/586 (54.95%), Postives = 414/586 (70.65%), Query Frame = 0

Query: 1   MFSFVTTNALKQLTRSIGNFVSPPSISMPLQPPSRPSFKQTLLNRIKNCSTINELHGLCA 60
           M +F   ++L+  +  + +F    S S+ L PP+       L   IK CST   L    A
Sbjct: 1   MNAFANVHSLRVPSHHLRDF----SASLSLAPPN-------LKKIIKQCSTPKLLESALA 60

Query: 61  SMIKTNAIQDCFLVHHFISASFALNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFR 120
           +MIKT+  QDC L++ FI+A  +   +   V   TQM+ PNVFVYNA+ KGFV C +P R
Sbjct: 61  AMIKTSLNQDCRLMNQFITACTSFKRLDLAVSTMTQMQEPNVFVYNALFKGFVTCSHPIR 120

Query: 121 ALQCYVHMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV 180
           +L+ YV ML +S V P+SYT+SSLVKA +F  A   G+ +  HIWK GF  H+ +QT L+
Sbjct: 121 SLELYVRMLRDS-VSPSSYTYSSLVKASSF--ASRFGESLQAHIWKFGFGFHVKIQTTLI 180

Query: 181 DFYSKLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMDSARKLFEEMPERNTATWNT 240
           DFYS    + EARKVFDEM ERD  AWT MVSA  RV DMDSA  L  +M E+N AT N 
Sbjct: 181 DFYSATGRIREARKVFDEMPERDDIAWTTMVSAYRRVLDMDSANSLANQMSEKNEATSNC 240

Query: 241 MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSEMRLNGIIPD 300
           +I+GY  LGN+E AE LFNQMP KDIISWTTMI  YSQNK+Y++A+A++ +M   GIIPD
Sbjct: 241 LINGYMGLGNLEQAESLFNQMPVKDIISWTTMIKGYSQNKRYREAIAVFYKMMEEGIIPD 300

Query: 301 EVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYIGSALVDMYAKCGSLDLSLLIFF 360
           EVTMSTV SACAH+G LE+GKE+H Y +  G  LDVYIGSALVDMY+KCGSL+ +LL+FF
Sbjct: 301 EVTMSTVISACAHLGVLEIGKEVHMYTLQNGFVLDVYIGSALVDMYSKCGSLERALLVFF 360

Query: 361 KLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISILSACTHAGLVDE 420
            L  KNL+CWN++IEGLA HG+A++AL+MFA ME E + PN VTF+S+ +ACTHAGLVDE
Sbjct: 361 NLPKKNLFCWNSIIEGLAAHGFAQEALKMFAKMEMESVKPNAVTFVSVFTACTHAGLVDE 420

Query: 421 GRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGC 480
           GR  + SM  DY I  ++ HYG MV + SK+G + EALELI +MEFEPN++IWGALL+GC
Sbjct: 421 GRRIYRSMIDDYSIVSNVEHYGGMVHLFSKAGLIYEALELIGNMEFEPNAVIWGALLDGC 480

Query: 481 KLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVVHIRSMMKEKGVEKKYP 540
           ++H N  IAE A  +LM+LEPMNSG+Y LLVSMYAE+  W +V  IR  M+E G+EK  P
Sbjct: 481 RIHKNLVIAEIAFNKLMVLEPMNSGYYFLLVSMYAEQNRWRDVAEIRGRMRELGIEKICP 540

Query: 541 GSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQLKLAGYILE 587
           G+S I ++   H F+A+  SH  SD++  +L E+  Q+ LAGY+ E
Sbjct: 541 GTSSIRIDKRDHLFAAADKSHSASDEVCLLLDEIYDQMGLAGYVQE 572

BLAST of CsGy3G021830 vs. ExPASy Swiss-Prot
Match: Q9LS72 (Pentatricopeptide repeat-containing protein At3g29230 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E27 PE=2 SV=1)

HSP 1 Score: 406.4 bits (1043), Expect = 5.7e-112
Identity = 209/586 (35.67%), Postives = 343/586 (58.53%), Query Frame = 0

Query: 27  SMPLQPPSRPSFKQTLLNRIKN---CSTINELHGLCASMIKTNAIQDCFLVHHFISASFA 86
           S+P++ PS  S ++    R+++   C+ +N++  L A +I+ N  +D  +    ISA   
Sbjct: 4   SLPVRAPSWVSSRRIFEERLQDLPKCANLNQVKQLHAQIIRRNLHEDLHIAPKLISALSL 63

Query: 87  LNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFRALQCYVHMLEESNVLPTSYTFSS 146
               +  V  F Q++ PNV + N++I+       P++A   +  M +   +   ++T+  
Sbjct: 64  CRQTNLAVRVFNQVQEPNVHLCNSLIRAHAQNSQPYQAFFVFSEM-QRFGLFADNFTYPF 123

Query: 147 LVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYS------------------- 206
           L+KAC+    + + +M+H HI K G  S ++V  AL+D YS                   
Sbjct: 124 LLKACSGQSWLPVVKMMHNHIEKLGLSSDIYVPNALIDCYSRCGGLGVRDAMKLFEKMSE 183

Query: 207 --------------KLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMDSARKLFEEM 266
                         K   L +AR++FDEM +RD  +W  M+   AR  +M  A +LFE+M
Sbjct: 184 RDTVSWNSMLGGLVKAGELRDARRLFDEMPQRDLISWNTMLDGYARCREMSKAFELFEKM 243

Query: 267 PERNTATWNTMIDGYARLGNVESAELLFNQM--PTKDIISWTTMITCYSQNKQYQDALAI 326
           PERNT +W+TM+ GY++ G++E A ++F++M  P K++++WT +I  Y++    ++A  +
Sbjct: 244 PERNTVSWSTMVMGYSKAGDMEMARVMFDKMPLPAKNVVTWTIIIAGYAEKGLLKEADRL 303

Query: 327 YSEMRLNGIIPDEVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYIGSALVDMYAK 386
             +M  +G+  D   + ++ +AC   G L LG  IH  +    L  + Y+ +AL+DMYAK
Sbjct: 304 VDQMVASGLKFDAAAVISILAACTESGLLSLGMRIHSILKRSNLGSNAYVLNALLDMYAK 363

Query: 387 CGSLDLSLLIFFKLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISI 446
           CG+L  +  +F  +  K+L  WN ++ GL VHG+ ++A+ +F+ M RE I P+ VTFI++
Sbjct: 364 CGNLKKAFDVFNDIPKKDLVSWNTMLHGLGVHGHGKEAIELFSRMRREGIRPDKVTFIAV 423

Query: 447 LSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEP 506
           L +C HAGL+DEG   F SM + YD+ P + HYGC+VD+L + G L EA++++++M  EP
Sbjct: 424 LCSCNHAGLIDEGIDYFYSMEKVYDLVPQVEHYGCLVDLLGRVGRLKEAIKVVQTMPMEP 483

Query: 507 NSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVVHIRS 566
           N +IWGALL  C++H   +IA++ ++ L+ L+P + G+Y+LL ++YA  +DW  V  IRS
Sbjct: 484 NVVIWGALLGACRMHNEVDIAKEVLDNLVKLDPCDPGNYSLLSNIYAAAEDWEGVADIRS 543

Query: 567 MMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFILTEL 575
            MK  GVEK   G+S +ELE  IH+F+    SHP SD+IY +L  L
Sbjct: 544 KMKSMGVEKP-SGASSVELEDGIHEFTVFDKSHPKSDQIYQMLGSL 587

BLAST of CsGy3G021830 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 396.4 bits (1017), Expect = 5.9e-109
Identity = 219/654 (33.49%), Postives = 345/654 (52.75%), Query Frame = 0

Query: 24  PSISMPL-------QPPSRPSFKQTLLNRIKNCSTINELHGLCASMIK----------TN 83
           PS S P         PP         L+ + NC T+  L  + A MIK          + 
Sbjct: 11  PSSSYPFHFLPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSK 70

Query: 84  AIQDCFLVHHFISASFALNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFRALQCYV 143
            I+ C L  HF         + Y +  F  ++ PN+ ++N M +G      P  AL+ YV
Sbjct: 71  LIEFCILSPHF-------EGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYV 130

Query: 144 HMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKL 203
            M+    +LP SYTF  ++K+C    A + GQ +H H+ K G +  L+V T+L+  Y + 
Sbjct: 131 CMI-SLGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQN 190

Query: 204 EILSEARKVFDEMCERDAFAWTAMVSALARVGDMDSARKLFEEMPERNTATWNTMIDGYA 263
             L +A KVFD+   RD  ++TA++   A  G +++A+KLF+E+P ++  +WN MI GYA
Sbjct: 191 GRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYA 250

Query: 264 RLGN-------------------------------------------------------- 323
             GN                                                        
Sbjct: 251 ETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLK 310

Query: 324 --------------VESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSEMRLNG 383
                         +E+A  LF ++P KD+ISW T+I  Y+    Y++AL ++ EM  +G
Sbjct: 311 IVNALIDLYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSG 370

Query: 384 IIPDEVTMSTVASACAHIGALELGKEIHHYV--MSQGLNLDVYIGSALVDMYAKCGSLDL 443
             P++VTM ++  ACAH+GA+++G+ IH Y+    +G+     + ++L+DMYAKCG ++ 
Sbjct: 371 ETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEA 430

Query: 444 SLLIFFKLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISILSACTH 503
           +  +F  +  K+L  WNA+I G A+HG A+ +  +F+ M +  I P+ +TF+ +LSAC+H
Sbjct: 431 AHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSH 490

Query: 504 AGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWG 563
           +G++D GR  F +MT+DY + P + HYGCM+D+L  SG   EA E+I  ME EP+ +IW 
Sbjct: 491 SGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWC 550

Query: 564 ALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVVHIRSMMKEKG 589
           +LL  CK+HGN E+ E   E L+ +EP N G Y LL ++YA    W EV   R+++ +KG
Sbjct: 551 SLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKG 610

BLAST of CsGy3G021830 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 379.8 bits (974), Expect = 5.7e-104
Identity = 210/649 (32.36%), Postives = 343/649 (52.85%), Query Frame = 0

Query: 24  PSISMPLQPPSRPSFKQTLLNRIKNCSTINELHGLCASMIKTNAIQDCFLVHHF--ISAS 83
           P+ S P QP +  + +   ++ I+ C ++ +L      MI+T    D +       ++A 
Sbjct: 16  PNFSNPNQPTTN-NERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAAL 75

Query: 84  FALNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFRALQCYVHMLEESNVLPTSYTF 143
            +  S+ Y    F ++  PN F +N +I+ +     P  ++  ++ M+ ES   P  YTF
Sbjct: 76  SSFASLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTF 135

Query: 144 SSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEILSEARKVFDEMCE 203
             L+KA   + ++ LGQ +H    K    S +FV  +L+  Y     L  A KVF  + E
Sbjct: 136 PFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKE 195

Query: 204 RDAFAWTAMV-----------------------------------SALARV--------- 263
           +D  +W +M+                                   SA A++         
Sbjct: 196 KDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQV 255

Query: 264 --------------------------GDMDSARKLFEEMPERNTATWNTMIDGYARLGNV 323
                                     G ++ A++LF+ M E++  TW TM+DGYA   + 
Sbjct: 256 CSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDY 315

Query: 324 ESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSEMRL-NGIIPDEVTMSTVASA 383
           E+A  + N MP KDI++W  +I+ Y QN +  +AL ++ E++L   +  +++T+ +  SA
Sbjct: 316 EAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSA 375

Query: 384 CAHIGALELGKEIHHYVMSQGLNLDVYIGSALVDMYAKCGSLDLSLLIFFKLTDKNLYCW 443
           CA +GALELG+ IH Y+   G+ ++ ++ SAL+ MY+KCG L+ S  +F  +  ++++ W
Sbjct: 376 CAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVW 435

Query: 444 NAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISILSACTHAGLVDEGRSRFLSMTR 503
           +A+I GLA+HG   +A+ MF  M+   + PNGVTF ++  AC+H GLVDE  S F  M  
Sbjct: 436 SAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMES 495

Query: 504 DYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGCKLHGNCEIAE 563
           +Y I P+ +HY C+VD+L +SGYL +A++ I++M   P++ +WGALL  CK+H N  +AE
Sbjct: 496 NYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAE 555

Query: 564 DAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGT 600
            A  +L+ LEP N G + LL ++YA+   W  V  +R  M+  G+ KK PG S IE++G 
Sbjct: 556 MACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGL-KKEPGCSSIEIDGM 615

BLAST of CsGy3G021830 vs. ExPASy Swiss-Prot
Match: Q9FJY7 (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 379.4 bits (973), Expect = 7.5e-104
Identity = 198/544 (36.40%), Postives = 314/544 (57.72%), Query Frame = 0

Query: 46  IKNCSTINELHGLCASMIKTNAIQDCFLVHHFIS---ASFALNSVHYPVFAFTQMENPNV 105
           ++ CS   EL  + A M+KT  +QD + +  F+S   +S + + + Y    F   + P+ 
Sbjct: 21  LQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDRPDT 80

Query: 106 FVYNAMIKGFVYCGYPFRALQCYVHMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVHC 165
           F++N MI+GF     P R+L  Y  ML  S+    +YTF SL+KAC+ + A E    +H 
Sbjct: 81  FLWNLMIRGFSCSDEPERSLLLYQRML-CSSAPHNAYTFPSLLKACSNLSAFEETTQIHA 140

Query: 166 HIWKKGFESHLFVQTALVDFYSKLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMDS 225
            I K G+E+                               D +A  +++++ A  G+   
Sbjct: 141 QITKLGYEN-------------------------------DVYAVNSLINSYAVTGNFKL 200

Query: 226 ARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQY 285
           A  LF+ +PE +  +WN++I GY + G ++ A  LF +M  K+ ISWTTMI+ Y Q    
Sbjct: 201 AHLLFDRIPEPDDVSWNSVIKGYVKAGKMDIALTLFRKMAEKNAISWTTMISGYVQADMN 260

Query: 286 QDALAIYSEMRLNGIIPDEVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYIGSAL 345
           ++AL ++ EM+ + + PD V+++   SACA +GALE GK IH Y+    + +D  +G  L
Sbjct: 261 KEALQLFHEMQNSDVEPDNVSLANALSACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVL 320

Query: 346 VDMYAKCGSLDLSLLIFFKLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNG 405
           +DMYAKCG ++ +L +F  +  K++  W A+I G A HG+  +A+  F  M++  I PN 
Sbjct: 321 IDMYAKCGEMEEALEVFKNIKKKSVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNV 380

Query: 406 VTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIK 465
           +TF ++L+AC++ GLV+EG+  F SM RDY+++P I HYGC+VD+L ++G L+EA   I+
Sbjct: 381 ITFTAVLTACSYTGLVEEGKLIFYSMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQ 440

Query: 466 SMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWME 525
            M  +PN++IWGALL  C++H N E+ E+  E L+ ++P + G Y    +++A +K W +
Sbjct: 441 EMPLKPNAVIWGALLKACRIHKNIELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDK 500

Query: 526 VVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQLKLAG 585
               R +MKE+GV  K PG S I LEGT H+F A   SHP+ +KI      +  +L+  G
Sbjct: 501 AAETRRLMKEQGV-AKVPGCSTISLEGTTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENG 531

Query: 586 YILE 587
           Y+ E
Sbjct: 561 YVPE 531

BLAST of CsGy3G021830 vs. NCBI nr
Match: XP_011651448.1 (pentatricopeptide repeat-containing protein At1g06143 [Cucumis sativus] >XP_011651449.1 pentatricopeptide repeat-containing protein At1g06143 [Cucumis sativus] >XP_011651450.1 pentatricopeptide repeat-containing protein At1g06143 [Cucumis sativus] >KGN57932.1 hypothetical protein Csa_011399 [Cucumis sativus])

HSP 1 Score: 1210 bits (3131), Expect = 0.0
Identity = 595/600 (99.17%), Postives = 596/600 (99.33%), Query Frame = 0

Query: 1   MFSFVTTNALKQLTRSIGNFVSPPSISMPLQPPSRPSFKQTLLNRIKNCSTINELHGLCA 60
           MFSFVTTNALKQLTRSIGNFVSPPSISMPLQPPS PSFKQTLLNRIKNCSTINELHGLCA
Sbjct: 1   MFSFVTTNALKQLTRSIGNFVSPPSISMPLQPPSCPSFKQTLLNRIKNCSTINELHGLCA 60

Query: 61  SMIKTNAIQDCFLVHHFISASFALNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFR 120
           SMIKTNAIQDCFLVH FISASFALNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFR
Sbjct: 61  SMIKTNAIQDCFLVHQFISASFALNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFR 120

Query: 121 ALQCYVHMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV 180
           ALQCYVHMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV
Sbjct: 121 ALQCYVHMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV 180

Query: 181 DFYSKLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMDSARKLFEEMPERNTATWNT 240
           DFYSKLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMDSARKLFEEMPERNTATWNT
Sbjct: 181 DFYSKLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMDSARKLFEEMPERNTATWNT 240

Query: 241 MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSEMRLNGIIPD 300
           MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSEMRLNGIIPD
Sbjct: 241 MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSEMRLNGIIPD 300

Query: 301 EVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYIGSALVDMYAKCGSLDLSLLIFF 360
           EVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYIGSALVDMYAKCGSLDLSLLIFF
Sbjct: 301 EVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYIGSALVDMYAKCGSLDLSLLIFF 360

Query: 361 KLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISILSACTHAGLVDE 420
           KLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISILSACTHAGLVDE
Sbjct: 361 KLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISILSACTHAGLVDE 420

Query: 421 GRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGC 480
           GRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGC
Sbjct: 421 GRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGC 480

Query: 481 KLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVVHIRSMMKEKGVEKKYP 540
           KLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEV HIRSMMKEKGVEKKYP
Sbjct: 481 KLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVAHIRSMMKEKGVEKKYP 540

Query: 541 GSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQLKLAGYILEPSVCSTGLLFSEEI 600
           GSSWIELEGTIHQFSASADSHPDSDKIYF+LTELDGQLKLAGYILEPSVCST LLFSEEI
Sbjct: 541 GSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTALLFSEEI 600

BLAST of CsGy3G021830 vs. NCBI nr
Match: XP_008447444.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g06145-like [Cucumis melo] >KAA0038095.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK20512.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1120 bits (2896), Expect = 0.0
Identity = 556/600 (92.67%), Postives = 569/600 (94.83%), Query Frame = 0

Query: 1   MFSFVTTNALKQLTRSIGNFVSPPSISMPLQPPSRPSFKQTLLNRIKNCSTINELHGLCA 60
           MFSFVTT ALKQLTRSIGNFVSP SISMPLQ PSRPSFKQTLLNRIKNCS INELH + A
Sbjct: 1   MFSFVTTIALKQLTRSIGNFVSP-SISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYA 60

Query: 61  SMIKTNAIQDCFLVHHFISASFALNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFR 120
           SMIK+NAIQDCFLVH FISASFA NSVHYPVFAFTQMENPNVFVYNAMIKGFVY GYPFR
Sbjct: 61  SMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFR 120

Query: 121 ALQCYVHMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV 180
            LQCYVHMLE SNVLP SYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV
Sbjct: 121 GLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV 180

Query: 181 DFYSKLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMDSARKLFEEMPERNTATWNT 240
           DFYSKLE LSEARKVFDEMCERDAFAWT MVSALARVGDMD+ARKLFEEMPERNTATWNT
Sbjct: 181 DFYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNT 240

Query: 241 MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSEMRLNGIIPD 300
           MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSE RLNGIIPD
Sbjct: 241 MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPD 300

Query: 301 EVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYIGSALVDMYAKCGSLDLSLLIFF 360
           +VTMSTV SACAH+GALELGKEIH YVMSQGLN DVYIGSALVDMYAKCGSLD SLLIFF
Sbjct: 301 QVTMSTVVSACAHVGALELGKEIHQYVMSQGLNHDVYIGSALVDMYAKCGSLDWSLLIFF 360

Query: 361 KLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISILSACTHAGLVDE 420
           KL DKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKI+PNGVTFISILSACTHAGLV+E
Sbjct: 361 KLKDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEE 420

Query: 421 GRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGC 480
           GRSRFLSMTRDY I P+IRHYGCMVDMLSK+G L EALELIKSMEFEPNSIIWGALLNGC
Sbjct: 421 GRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGC 480

Query: 481 KLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVVHIRSMMKEKGVEKKYP 540
           KLHGN  IA+DAVEQLMILEPMNSGHYNLLVSM AEEKDWMEV HIR MMKE+GVEKKYP
Sbjct: 481 KLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYP 540

Query: 541 GSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQLKLAGYILEPSVCSTGLLFSEEI 600
           GSSWIELEGTIHQFSASADSHPDSDKIYF+LTELDGQLKLAGYILEPSVCST L+F EEI
Sbjct: 541 GSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTALVFPEEI 599

BLAST of CsGy3G021830 vs. NCBI nr
Match: XP_038888390.1 (pentatricopeptide repeat-containing protein At1g06143 [Benincasa hispida])

HSP 1 Score: 1066 bits (2757), Expect = 0.0
Identity = 523/595 (87.90%), Postives = 551/595 (92.61%), Query Frame = 0

Query: 1   MFSFVTTNALKQLTRSIGNFVSPPSISMPLQPPSRPSFKQTLLNRIKNCSTINELHGLCA 60
           MFSFV TNALKQLTRSI NFVS  SISMP QPPS PSFKQTLLNRIKNCSTINEL G+ A
Sbjct: 1   MFSFVITNALKQLTRSISNFVSS-SISMPPQPPSIPSFKQTLLNRIKNCSTINELDGIYA 60

Query: 61  SMIKTNAIQDCFLVHHFISASFALNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFR 120
           SMIKTNA QDCFLV+ FIS S A NSV YPV AFTQMENPNVFVYNAMI+GFVYCGYPF 
Sbjct: 61  SMIKTNATQDCFLVNQFISTSLAFNSVDYPVIAFTQMENPNVFVYNAMIRGFVYCGYPFG 120

Query: 121 ALQCYVHMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV 180
           ALQCYVHMLEE+ V PTSYTFSSLVKACTFMCAVELG+M+HCHIWK GFESHLFVQTAL+
Sbjct: 121 ALQCYVHMLEEAKVFPTSYTFSSLVKACTFMCAVELGRMIHCHIWKSGFESHLFVQTALI 180

Query: 181 DFYSKLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMDSARKLFEEMPERNTATWNT 240
           DFYS LE LSEARKVFDEM ERD+FAWT MVSALAR GDMDSARKLFEEMPE NTATWNT
Sbjct: 181 DFYSNLERLSEARKVFDEMRERDSFAWTTMVSALARAGDMDSARKLFEEMPESNTATWNT 240

Query: 241 MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSEMRLNGIIPD 300
           MIDGYARLGNVESAE LFNQMP +DIISWTTMITCYSQNKQYQ+AL IY +MRLNGIIPD
Sbjct: 241 MIDGYARLGNVESAEFLFNQMPVRDIISWTTMITCYSQNKQYQEALMIYIKMRLNGIIPD 300

Query: 301 EVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYIGSALVDMYAKCGSLDLSLLIFF 360
           EVT+STV SACAH+GALELGK IHHYVMSQGLNLDVYIGSALVDMYAKCGSLD SLL+FF
Sbjct: 301 EVTLSTVVSACAHVGALELGKTIHHYVMSQGLNLDVYIGSALVDMYAKCGSLDRSLLVFF 360

Query: 361 KLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISILSACTHAGLVDE 420
           KL DKNLYCWNAVIEGLAVHGYAEKALRMF IMEREKI PNGVTFISILSACTHAGLV+E
Sbjct: 361 KLMDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREKIGPNGVTFISILSACTHAGLVEE 420

Query: 421 GRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGC 480
           GRSRFLSMTRDY IRP+I HYGCMVDMLSK+G+L+EALELIKSMEFEPNSIIWGALLNGC
Sbjct: 421 GRSRFLSMTRDYGIRPEIGHYGCMVDMLSKAGFLDEALELIKSMEFEPNSIIWGALLNGC 480

Query: 481 KLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVVHIRSMMKEKGVEKKYP 540
           KLHGN EIA+DAV+QLMILEPM+SGHYNLLVSMYAEEKDWMEV HIR+MMKE+GVEKKYP
Sbjct: 481 KLHGNSEIAKDAVQQLMILEPMSSGHYNLLVSMYAEEKDWMEVAHIRAMMKEQGVEKKYP 540

Query: 541 GSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQLKLAGYILEPSVCSTGLL 595
           GSSWIEL+G IHQFSASADSHPDSD+IYF+LTELDGQLKLAGYILEP VC   L+
Sbjct: 541 GSSWIELDGRIHQFSASADSHPDSDEIYFLLTELDGQLKLAGYILEPPVCCNALV 594

BLAST of CsGy3G021830 vs. NCBI nr
Match: XP_022967388.1 (pentatricopeptide repeat-containing protein At1g06143 [Cucurbita maxima])

HSP 1 Score: 991 bits (2562), Expect = 0.0
Identity = 489/589 (83.02%), Postives = 521/589 (88.46%), Query Frame = 0

Query: 1   MFSFVTTNALKQLTRSIGNFVSPPSISMPLQPPSRPSFKQTLLNRIKNCSTINELHGLCA 60
           MFS   TNALKQ+TRSI NFVS  S S  LQ P  P+FKQTLL+RIKNCSTINEL G+ A
Sbjct: 1   MFSITPTNALKQITRSISNFVSS-STSRTLQGPYVPTFKQTLLDRIKNCSTINELDGIYA 60

Query: 61  SMIKTNAIQDCFLVHHFISASFALNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFR 120
           SMIK NA QDCFLV+ FISAS   NSV YPV AFTQMENPNVFVYNAMI+GFVYCGYPFR
Sbjct: 61  SMIKANATQDCFLVNQFISASLTFNSVDYPVLAFTQMENPNVFVYNAMIRGFVYCGYPFR 120

Query: 121 ALQCYVHMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV 180
           A+QCYVHMLE S VLP+SYTFSSLVKACT MCA++LG+M+HC IW  G E  +FVQT+L+
Sbjct: 121 AIQCYVHMLE-SQVLPSSYTFSSLVKACTCMCALDLGRMIHCQIWTHGLELDVFVQTSLI 180

Query: 181 DFYSKLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMDSARKLFEEMPERNTATWNT 240
           D YS LE   +ARKVFDEM ERD FAWT MVSALAR GDMDSARKLFEEMPE NTATWNT
Sbjct: 181 DLYSNLERFGDARKVFDEMRERDTFAWTTMVSALARAGDMDSARKLFEEMPESNTATWNT 240

Query: 241 MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSEMRLNGIIPD 300
           MIDGYARLGNVESAE LFNQMP +DIISWTTMITCYSQNKQY++AL IY +MRLNGIIPD
Sbjct: 241 MIDGYARLGNVESAEFLFNQMPARDIISWTTMITCYSQNKQYEEALMIYGDMRLNGIIPD 300

Query: 301 EVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYIGSALVDMYAKCGSLDLSLLIFF 360
           EVTMSTV SACAH+GALELGKEIHHY MS+GLNLDVYIGSALVDMYAKCGSLD SLL+FF
Sbjct: 301 EVTMSTVVSACAHVGALELGKEIHHYAMSRGLNLDVYIGSALVDMYAKCGSLDRSLLVFF 360

Query: 361 KLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISILSACTHAGLVDE 420
           KL DKNLYCWNAVIEGLAVHGYAEKALRMF IMEREKIMPNGVTFISILSACTHAGLV E
Sbjct: 361 KLKDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREKIMPNGVTFISILSACTHAGLVVE 420

Query: 421 GRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGC 480
           GRSRFLSM RDY I P++ HYGCMVDMLSK+G L+EALELI  MEFEPNSIIWGALLNGC
Sbjct: 421 GRSRFLSMIRDYGIHPEVEHYGCMVDMLSKAGLLDEALELINGMEFEPNSIIWGALLNGC 480

Query: 481 KLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVVHIRSMMKEKGVEKKYP 540
           KLHGN EIA+DAV +L ILEP NSGHYNLLVSMYAEEK W+EV HIR+MMKE GVEKKYP
Sbjct: 481 KLHGNSEIAKDAVRRLNILEPKNSGHYNLLVSMYAEEKHWIEVAHIRAMMKENGVEKKYP 540

Query: 541 GSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQLKLAGYILEPSV 589
           GSSWIELEG IHQFSASAD HPDSDKIYFILTELDGQLKLAG +LEPSV
Sbjct: 541 GSSWIELEGRIHQFSASADCHPDSDKIYFILTELDGQLKLAGNVLEPSV 587

BLAST of CsGy3G021830 vs. NCBI nr
Match: KAG7011261.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 991 bits (2561), Expect = 0.0
Identity = 489/589 (83.02%), Postives = 520/589 (88.29%), Query Frame = 0

Query: 1   MFSFVTTNALKQLTRSIGNFVSPPSISMPLQPPSRPSFKQTLLNRIKNCSTINELHGLCA 60
           MFS   TNALKQ+TRSI NFVS  S    LQ     +FKQTLL+RIKNCSTINEL G+ A
Sbjct: 1   MFSITPTNALKQITRSISNFVSS-STPRTLQGSYVSTFKQTLLDRIKNCSTINELDGIYA 60

Query: 61  SMIKTNAIQDCFLVHHFISASFALNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFR 120
           SMIKTNA QDCFLV+ FISAS   NSV YPV AFTQMENPNVFVYNAMI+GFVYCGYPFR
Sbjct: 61  SMIKTNATQDCFLVNQFISASLTFNSVDYPVLAFTQMENPNVFVYNAMIRGFVYCGYPFR 120

Query: 121 ALQCYVHMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV 180
           A+QCYVHMLE S VLP+SYTFSSLVKACT MCA++LG+M+HCHIWK G E  +FVQT+L+
Sbjct: 121 AIQCYVHMLE-SKVLPSSYTFSSLVKACTCMCALDLGRMIHCHIWKNGLELDVFVQTSLI 180

Query: 181 DFYSKLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMDSARKLFEEMPERNTATWNT 240
           D YS LE   +ARKVFDEM ERD FAWT MVSALAR GDMDSARKLFEEMPE NTATWNT
Sbjct: 181 DLYSNLERFGDARKVFDEMRERDTFAWTTMVSALARAGDMDSARKLFEEMPESNTATWNT 240

Query: 241 MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSEMRLNGIIPD 300
           MIDGYARLGNVESAE LFNQMP +DIISWTTMITCYSQNKQY++AL IY  MRLNGIIPD
Sbjct: 241 MIDGYARLGNVESAEFLFNQMPARDIISWTTMITCYSQNKQYEEALTIYGNMRLNGIIPD 300

Query: 301 EVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYIGSALVDMYAKCGSLDLSLLIFF 360
           EVTMSTV SACAH+GALELGKEIHHY +S+GLNLDVYIGSALVDMYAKCGSLD SLL+FF
Sbjct: 301 EVTMSTVVSACAHVGALELGKEIHHYAISRGLNLDVYIGSALVDMYAKCGSLDRSLLVFF 360

Query: 361 KLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISILSACTHAGLVDE 420
           KL DKNLYCWNAVIEGLAVHGYAEKALRMF IMEREKIMPNGVTFISILSACTHAGLV E
Sbjct: 361 KLKDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREKIMPNGVTFISILSACTHAGLVIE 420

Query: 421 GRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGC 480
           GRSRF SM RDY IRP++ HYGCMVDMLSK+G LNEALELI  MEFEPNSIIWGALLNGC
Sbjct: 421 GRSRFSSMIRDYGIRPEVEHYGCMVDMLSKAGLLNEALELINGMEFEPNSIIWGALLNGC 480

Query: 481 KLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVVHIRSMMKEKGVEKKYP 540
           KLHGN EIA+DAV QL ILEP NSGHYNLLVSMYAEEK WM+V HIR+MMKE GVEKKYP
Sbjct: 481 KLHGNSEIAKDAVRQLTILEPKNSGHYNLLVSMYAEEKHWMKVAHIRAMMKENGVEKKYP 540

Query: 541 GSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQLKLAGYILEPSV 589
           GSSWIELEG IHQFSASAD HPDSDKIYFILTEL+GQLKLAG +LEPSV
Sbjct: 541 GSSWIELEGRIHQFSASADCHPDSDKIYFILTELEGQLKLAGNVLEPSV 587

BLAST of CsGy3G021830 vs. ExPASy TrEMBL
Match: A0A0A0LB99 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G395920 PE=4 SV=1)

HSP 1 Score: 1210 bits (3131), Expect = 0.0
Identity = 595/600 (99.17%), Postives = 596/600 (99.33%), Query Frame = 0

Query: 1   MFSFVTTNALKQLTRSIGNFVSPPSISMPLQPPSRPSFKQTLLNRIKNCSTINELHGLCA 60
           MFSFVTTNALKQLTRSIGNFVSPPSISMPLQPPS PSFKQTLLNRIKNCSTINELHGLCA
Sbjct: 1   MFSFVTTNALKQLTRSIGNFVSPPSISMPLQPPSCPSFKQTLLNRIKNCSTINELHGLCA 60

Query: 61  SMIKTNAIQDCFLVHHFISASFALNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFR 120
           SMIKTNAIQDCFLVH FISASFALNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFR
Sbjct: 61  SMIKTNAIQDCFLVHQFISASFALNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFR 120

Query: 121 ALQCYVHMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV 180
           ALQCYVHMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV
Sbjct: 121 ALQCYVHMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV 180

Query: 181 DFYSKLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMDSARKLFEEMPERNTATWNT 240
           DFYSKLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMDSARKLFEEMPERNTATWNT
Sbjct: 181 DFYSKLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMDSARKLFEEMPERNTATWNT 240

Query: 241 MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSEMRLNGIIPD 300
           MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSEMRLNGIIPD
Sbjct: 241 MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSEMRLNGIIPD 300

Query: 301 EVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYIGSALVDMYAKCGSLDLSLLIFF 360
           EVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYIGSALVDMYAKCGSLDLSLLIFF
Sbjct: 301 EVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYIGSALVDMYAKCGSLDLSLLIFF 360

Query: 361 KLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISILSACTHAGLVDE 420
           KLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISILSACTHAGLVDE
Sbjct: 361 KLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISILSACTHAGLVDE 420

Query: 421 GRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGC 480
           GRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGC
Sbjct: 421 GRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGC 480

Query: 481 KLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVVHIRSMMKEKGVEKKYP 540
           KLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEV HIRSMMKEKGVEKKYP
Sbjct: 481 KLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVAHIRSMMKEKGVEKKYP 540

Query: 541 GSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQLKLAGYILEPSVCSTGLLFSEEI 600
           GSSWIELEGTIHQFSASADSHPDSDKIYF+LTELDGQLKLAGYILEPSVCST LLFSEEI
Sbjct: 541 GSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTALLFSEEI 600

BLAST of CsGy3G021830 vs. ExPASy TrEMBL
Match: A0A5A7T9J0 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold237G00860 PE=4 SV=1)

HSP 1 Score: 1120 bits (2896), Expect = 0.0
Identity = 556/600 (92.67%), Postives = 569/600 (94.83%), Query Frame = 0

Query: 1   MFSFVTTNALKQLTRSIGNFVSPPSISMPLQPPSRPSFKQTLLNRIKNCSTINELHGLCA 60
           MFSFVTT ALKQLTRSIGNFVSP SISMPLQ PSRPSFKQTLLNRIKNCS INELH + A
Sbjct: 1   MFSFVTTIALKQLTRSIGNFVSP-SISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYA 60

Query: 61  SMIKTNAIQDCFLVHHFISASFALNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFR 120
           SMIK+NAIQDCFLVH FISASFA NSVHYPVFAFTQMENPNVFVYNAMIKGFVY GYPFR
Sbjct: 61  SMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFR 120

Query: 121 ALQCYVHMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV 180
            LQCYVHMLE SNVLP SYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV
Sbjct: 121 GLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV 180

Query: 181 DFYSKLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMDSARKLFEEMPERNTATWNT 240
           DFYSKLE LSEARKVFDEMCERDAFAWT MVSALARVGDMD+ARKLFEEMPERNTATWNT
Sbjct: 181 DFYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNT 240

Query: 241 MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSEMRLNGIIPD 300
           MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSE RLNGIIPD
Sbjct: 241 MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPD 300

Query: 301 EVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYIGSALVDMYAKCGSLDLSLLIFF 360
           +VTMSTV SACAH+GALELGKEIH YVMSQGLN DVYIGSALVDMYAKCGSLD SLLIFF
Sbjct: 301 QVTMSTVVSACAHVGALELGKEIHQYVMSQGLNHDVYIGSALVDMYAKCGSLDWSLLIFF 360

Query: 361 KLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISILSACTHAGLVDE 420
           KL DKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKI+PNGVTFISILSACTHAGLV+E
Sbjct: 361 KLKDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEE 420

Query: 421 GRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGC 480
           GRSRFLSMTRDY I P+IRHYGCMVDMLSK+G L EALELIKSMEFEPNSIIWGALLNGC
Sbjct: 421 GRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGC 480

Query: 481 KLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVVHIRSMMKEKGVEKKYP 540
           KLHGN  IA+DAVEQLMILEPMNSGHYNLLVSM AEEKDWMEV HIR MMKE+GVEKKYP
Sbjct: 481 KLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYP 540

Query: 541 GSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQLKLAGYILEPSVCSTGLLFSEEI 600
           GSSWIELEGTIHQFSASADSHPDSDKIYF+LTELDGQLKLAGYILEPSVCST L+F EEI
Sbjct: 541 GSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTALVFPEEI 599

BLAST of CsGy3G021830 vs. ExPASy TrEMBL
Match: A0A1S3BHH1 (pentatricopeptide repeat-containing protein At1g06145-like OS=Cucumis melo OX=3656 GN=LOC103489889 PE=4 SV=1)

HSP 1 Score: 1120 bits (2896), Expect = 0.0
Identity = 556/600 (92.67%), Postives = 569/600 (94.83%), Query Frame = 0

Query: 1   MFSFVTTNALKQLTRSIGNFVSPPSISMPLQPPSRPSFKQTLLNRIKNCSTINELHGLCA 60
           MFSFVTT ALKQLTRSIGNFVSP SISMPLQ PSRPSFKQTLLNRIKNCS INELH + A
Sbjct: 1   MFSFVTTIALKQLTRSIGNFVSP-SISMPLQSPSRPSFKQTLLNRIKNCSNINELHVVYA 60

Query: 61  SMIKTNAIQDCFLVHHFISASFALNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFR 120
           SMIK+NAIQDCFLVH FISASFA NSVHYPVFAFTQMENPNVFVYNAMIKGFVY GYPFR
Sbjct: 61  SMIKSNAIQDCFLVHQFISASFAFNSVHYPVFAFTQMENPNVFVYNAMIKGFVYRGYPFR 120

Query: 121 ALQCYVHMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV 180
            LQCYVHMLE SNVLP SYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV
Sbjct: 121 GLQCYVHMLEGSNVLPNSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV 180

Query: 181 DFYSKLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMDSARKLFEEMPERNTATWNT 240
           DFYSKLE LSEARKVFDEMCERDAFAWT MVSALARVGDMD+ARKLFEEMPERNTATWNT
Sbjct: 181 DFYSKLEKLSEARKVFDEMCERDAFAWTTMVSALARVGDMDTARKLFEEMPERNTATWNT 240

Query: 241 MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSEMRLNGIIPD 300
           MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSE RLNGIIPD
Sbjct: 241 MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSETRLNGIIPD 300

Query: 301 EVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYIGSALVDMYAKCGSLDLSLLIFF 360
           +VTMSTV SACAH+GALELGKEIH YVMSQGLN DVYIGSALVDMYAKCGSLD SLLIFF
Sbjct: 301 QVTMSTVVSACAHVGALELGKEIHQYVMSQGLNHDVYIGSALVDMYAKCGSLDWSLLIFF 360

Query: 361 KLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISILSACTHAGLVDE 420
           KL DKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKI+PNGVTFISILSACTHAGLV+E
Sbjct: 361 KLKDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKILPNGVTFISILSACTHAGLVEE 420

Query: 421 GRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGC 480
           GRSRFLSMTRDY I P+IRHYGCMVDMLSK+G L EALELIKSMEFEPNSIIWGALLNGC
Sbjct: 421 GRSRFLSMTRDYGISPEIRHYGCMVDMLSKAGLLKEALELIKSMEFEPNSIIWGALLNGC 480

Query: 481 KLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVVHIRSMMKEKGVEKKYP 540
           KLHGN  IA+DAVEQLMILEPMNSGHYNLLVSM AEEKDWMEV HIR MMKE+GVEKKYP
Sbjct: 481 KLHGNSVIAKDAVEQLMILEPMNSGHYNLLVSMCAEEKDWMEVAHIRLMMKEQGVEKKYP 540

Query: 541 GSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQLKLAGYILEPSVCSTGLLFSEEI 600
           GSSWIELEGTIHQFSASADSHPDSDKIYF+LTELDGQLKLAGYILEPSVCST L+F EEI
Sbjct: 541 GSSWIELEGTIHQFSASADSHPDSDKIYFVLTELDGQLKLAGYILEPSVCSTALVFPEEI 599

BLAST of CsGy3G021830 vs. ExPASy TrEMBL
Match: A0A6J1HUY6 (pentatricopeptide repeat-containing protein At1g06143 OS=Cucurbita maxima OX=3661 GN=LOC111466933 PE=4 SV=1)

HSP 1 Score: 991 bits (2562), Expect = 0.0
Identity = 489/589 (83.02%), Postives = 521/589 (88.46%), Query Frame = 0

Query: 1   MFSFVTTNALKQLTRSIGNFVSPPSISMPLQPPSRPSFKQTLLNRIKNCSTINELHGLCA 60
           MFS   TNALKQ+TRSI NFVS  S S  LQ P  P+FKQTLL+RIKNCSTINEL G+ A
Sbjct: 1   MFSITPTNALKQITRSISNFVSS-STSRTLQGPYVPTFKQTLLDRIKNCSTINELDGIYA 60

Query: 61  SMIKTNAIQDCFLVHHFISASFALNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFR 120
           SMIK NA QDCFLV+ FISAS   NSV YPV AFTQMENPNVFVYNAMI+GFVYCGYPFR
Sbjct: 61  SMIKANATQDCFLVNQFISASLTFNSVDYPVLAFTQMENPNVFVYNAMIRGFVYCGYPFR 120

Query: 121 ALQCYVHMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV 180
           A+QCYVHMLE S VLP+SYTFSSLVKACT MCA++LG+M+HC IW  G E  +FVQT+L+
Sbjct: 121 AIQCYVHMLE-SQVLPSSYTFSSLVKACTCMCALDLGRMIHCQIWTHGLELDVFVQTSLI 180

Query: 181 DFYSKLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMDSARKLFEEMPERNTATWNT 240
           D YS LE   +ARKVFDEM ERD FAWT MVSALAR GDMDSARKLFEEMPE NTATWNT
Sbjct: 181 DLYSNLERFGDARKVFDEMRERDTFAWTTMVSALARAGDMDSARKLFEEMPESNTATWNT 240

Query: 241 MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSEMRLNGIIPD 300
           MIDGYARLGNVESAE LFNQMP +DIISWTTMITCYSQNKQY++AL IY +MRLNGIIPD
Sbjct: 241 MIDGYARLGNVESAEFLFNQMPARDIISWTTMITCYSQNKQYEEALMIYGDMRLNGIIPD 300

Query: 301 EVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYIGSALVDMYAKCGSLDLSLLIFF 360
           EVTMSTV SACAH+GALELGKEIHHY MS+GLNLDVYIGSALVDMYAKCGSLD SLL+FF
Sbjct: 301 EVTMSTVVSACAHVGALELGKEIHHYAMSRGLNLDVYIGSALVDMYAKCGSLDRSLLVFF 360

Query: 361 KLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISILSACTHAGLVDE 420
           KL DKNLYCWNAVIEGLAVHGYAEKALRMF IMEREKIMPNGVTFISILSACTHAGLV E
Sbjct: 361 KLKDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREKIMPNGVTFISILSACTHAGLVVE 420

Query: 421 GRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGC 480
           GRSRFLSM RDY I P++ HYGCMVDMLSK+G L+EALELI  MEFEPNSIIWGALLNGC
Sbjct: 421 GRSRFLSMIRDYGIHPEVEHYGCMVDMLSKAGLLDEALELINGMEFEPNSIIWGALLNGC 480

Query: 481 KLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVVHIRSMMKEKGVEKKYP 540
           KLHGN EIA+DAV +L ILEP NSGHYNLLVSMYAEEK W+EV HIR+MMKE GVEKKYP
Sbjct: 481 KLHGNSEIAKDAVRRLNILEPKNSGHYNLLVSMYAEEKHWIEVAHIRAMMKENGVEKKYP 540

Query: 541 GSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQLKLAGYILEPSV 589
           GSSWIELEG IHQFSASAD HPDSDKIYFILTELDGQLKLAG +LEPSV
Sbjct: 541 GSSWIELEGRIHQFSASADCHPDSDKIYFILTELDGQLKLAGNVLEPSV 587

BLAST of CsGy3G021830 vs. ExPASy TrEMBL
Match: A0A6J1HIB3 (pentatricopeptide repeat-containing protein At1g06143 OS=Cucurbita moschata OX=3662 GN=LOC111463844 PE=4 SV=1)

HSP 1 Score: 990 bits (2560), Expect = 0.0
Identity = 488/589 (82.85%), Postives = 521/589 (88.46%), Query Frame = 0

Query: 1   MFSFVTTNALKQLTRSIGNFVSPPSISMPLQPPSRPSFKQTLLNRIKNCSTINELHGLCA 60
           MFS   TNALKQ+TRSI NFVS  S    LQ     +FKQTLL+RIKNCSTINEL G+ A
Sbjct: 1   MFSITPTNALKQITRSISNFVSS-STPRTLQGSYVSTFKQTLLDRIKNCSTINELDGIYA 60

Query: 61  SMIKTNAIQDCFLVHHFISASFALNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFR 120
           SMIKTNA QDCFLV+ FISAS   NSV YPV AFTQMENPNVFVYNAMI+GFVYCGYPFR
Sbjct: 61  SMIKTNATQDCFLVNQFISASLTFNSVDYPVLAFTQMENPNVFVYNAMIRGFVYCGYPFR 120

Query: 121 ALQCYVHMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV 180
           A+QCYVHMLE S VLP+SYTFSSLVKACT MCA++LG+M+HCHIWK G E  +FVQT+L+
Sbjct: 121 AIQCYVHMLE-SKVLPSSYTFSSLVKACTCMCALDLGRMIHCHIWKNGLELDVFVQTSLI 180

Query: 181 DFYSKLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMDSARKLFEEMPERNTATWNT 240
           D YS LE   +ARKVFDEM ERD FAWT MVSALAR GDMDSARKLFEEMPE NTATWNT
Sbjct: 181 DLYSNLERFGDARKVFDEMRERDTFAWTTMVSALARAGDMDSARKLFEEMPESNTATWNT 240

Query: 241 MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSEMRLNGIIPD 300
           MIDGYARLGNVESAE LFNQMP +DIISWTTMITCYSQNKQY++AL IY  MRLNGIIPD
Sbjct: 241 MIDGYARLGNVESAEFLFNQMPARDIISWTTMITCYSQNKQYEEALTIYGNMRLNGIIPD 300

Query: 301 EVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYIGSALVDMYAKCGSLDLSLLIFF 360
           EVTMSTV SACAH+GALELGKEIHHY MS+GLNLDVYIGSALVDMYAKCGSLD SLL+FF
Sbjct: 301 EVTMSTVVSACAHVGALELGKEIHHYAMSRGLNLDVYIGSALVDMYAKCGSLDRSLLVFF 360

Query: 361 KLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISILSACTHAGLVDE 420
           KL DKNLYCWNAVIEGLAVHGYAEKALRMF IMEREKIMPNGVTFISILSACTHAGLV E
Sbjct: 361 KLKDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREKIMPNGVTFISILSACTHAGLVIE 420

Query: 421 GRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGC 480
           GRSRF SM RDY IRP++ HYGCMVDMLSK+G L+EALELI  MEFEPNSIIWGALLNGC
Sbjct: 421 GRSRFSSMIRDYGIRPEVEHYGCMVDMLSKAGLLDEALELINGMEFEPNSIIWGALLNGC 480

Query: 481 KLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVVHIRSMMKEKGVEKKYP 540
           KLHGN EIA+DAV+QL +LEP NSGHYNLLVSMYAEEK WM+V HIR+MMKE GVEKKYP
Sbjct: 481 KLHGNSEIAKDAVQQLTVLEPKNSGHYNLLVSMYAEEKHWMKVAHIRAMMKENGVEKKYP 540

Query: 541 GSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQLKLAGYILEPSV 589
           GSSWIELEG IHQFSASA+ HPDSDKIYFILTELDGQLKLAG +LEPSV
Sbjct: 541 GSSWIELEGRIHQFSASANCHPDSDKIYFILTELDGQLKLAGNVLEPSV 587

BLAST of CsGy3G021830 vs. TAIR 10
Match: AT1G06150.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 632.5 bits (1630), Expect = 3.5e-181
Identity = 322/586 (54.95%), Postives = 414/586 (70.65%), Query Frame = 0

Query: 1    MFSFVTTNALKQLTRSIGNFVSPPSISMPLQPPSRPSFKQTLLNRIKNCSTINELHGLCA 60
            M +F   ++L+  +  + +F    S S+ L PP+       L   IK CST   L    A
Sbjct: 746  MNAFANVHSLRVPSHHLRDF----SASLSLAPPN-------LKKIIKQCSTPKLLESALA 805

Query: 61   SMIKTNAIQDCFLVHHFISASFALNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFR 120
            +MIKT+  QDC L++ FI+A  +   +   V   TQM+ PNVFVYNA+ KGFV C +P R
Sbjct: 806  AMIKTSLNQDCRLMNQFITACTSFKRLDLAVSTMTQMQEPNVFVYNALFKGFVTCSHPIR 865

Query: 121  ALQCYVHMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALV 180
            +L+ YV ML +S V P+SYT+SSLVKA +F  A   G+ +  HIWK GF  H+ +QT L+
Sbjct: 866  SLELYVRMLRDS-VSPSSYTYSSLVKASSF--ASRFGESLQAHIWKFGFGFHVKIQTTLI 925

Query: 181  DFYSKLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMDSARKLFEEMPERNTATWNT 240
            DFYS    + EARKVFDEM ERD  AWT MVSA  RV DMDSA  L  +M E+N AT N 
Sbjct: 926  DFYSATGRIREARKVFDEMPERDDIAWTTMVSAYRRVLDMDSANSLANQMSEKNEATSNC 985

Query: 241  MIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSEMRLNGIIPD 300
            +I+GY  LGN+E AE LFNQMP KDIISWTTMI  YSQNK+Y++A+A++ +M   GIIPD
Sbjct: 986  LINGYMGLGNLEQAESLFNQMPVKDIISWTTMIKGYSQNKRYREAIAVFYKMMEEGIIPD 1045

Query: 301  EVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYIGSALVDMYAKCGSLDLSLLIFF 360
            EVTMSTV SACAH+G LE+GKE+H Y +  G  LDVYIGSALVDMY+KCGSL+ +LL+FF
Sbjct: 1046 EVTMSTVISACAHLGVLEIGKEVHMYTLQNGFVLDVYIGSALVDMYSKCGSLERALLVFF 1105

Query: 361  KLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISILSACTHAGLVDE 420
             L  KNL+CWN++IEGLA HG+A++AL+MFA ME E + PN VTF+S+ +ACTHAGLVDE
Sbjct: 1106 NLPKKNLFCWNSIIEGLAAHGFAQEALKMFAKMEMESVKPNAVTFVSVFTACTHAGLVDE 1165

Query: 421  GRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGC 480
            GR  + SM  DY I  ++ HYG MV + SK+G + EALELI +MEFEPN++IWGALL+GC
Sbjct: 1166 GRRIYRSMIDDYSIVSNVEHYGGMVHLFSKAGLIYEALELIGNMEFEPNAVIWGALLDGC 1225

Query: 481  KLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVVHIRSMMKEKGVEKKYP 540
            ++H N  IAE A  +LM+LEPMNSG+Y LLVSMYAE+  W +V  IR  M+E G+EK  P
Sbjct: 1226 RIHKNLVIAEIAFNKLMVLEPMNSGYYFLLVSMYAEQNRWRDVAEIRGRMRELGIEKICP 1285

Query: 541  GSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQLKLAGYILE 587
            G+S I ++   H F+A+  SH  SD++  +L E+  Q+ LAGY+ E
Sbjct: 1286 GTSSIRIDKRDHLFAAADKSHSASDEVCLLLDEIYDQMGLAGYVQE 1317

BLAST of CsGy3G021830 vs. TAIR 10
Match: AT3G29230.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 406.4 bits (1043), Expect = 4.1e-113
Identity = 209/586 (35.67%), Postives = 343/586 (58.53%), Query Frame = 0

Query: 27  SMPLQPPSRPSFKQTLLNRIKN---CSTINELHGLCASMIKTNAIQDCFLVHHFISASFA 86
           S+P++ PS  S ++    R+++   C+ +N++  L A +I+ N  +D  +    ISA   
Sbjct: 4   SLPVRAPSWVSSRRIFEERLQDLPKCANLNQVKQLHAQIIRRNLHEDLHIAPKLISALSL 63

Query: 87  LNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFRALQCYVHMLEESNVLPTSYTFSS 146
               +  V  F Q++ PNV + N++I+       P++A   +  M +   +   ++T+  
Sbjct: 64  CRQTNLAVRVFNQVQEPNVHLCNSLIRAHAQNSQPYQAFFVFSEM-QRFGLFADNFTYPF 123

Query: 147 LVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYS------------------- 206
           L+KAC+    + + +M+H HI K G  S ++V  AL+D YS                   
Sbjct: 124 LLKACSGQSWLPVVKMMHNHIEKLGLSSDIYVPNALIDCYSRCGGLGVRDAMKLFEKMSE 183

Query: 207 --------------KLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMDSARKLFEEM 266
                         K   L +AR++FDEM +RD  +W  M+   AR  +M  A +LFE+M
Sbjct: 184 RDTVSWNSMLGGLVKAGELRDARRLFDEMPQRDLISWNTMLDGYARCREMSKAFELFEKM 243

Query: 267 PERNTATWNTMIDGYARLGNVESAELLFNQM--PTKDIISWTTMITCYSQNKQYQDALAI 326
           PERNT +W+TM+ GY++ G++E A ++F++M  P K++++WT +I  Y++    ++A  +
Sbjct: 244 PERNTVSWSTMVMGYSKAGDMEMARVMFDKMPLPAKNVVTWTIIIAGYAEKGLLKEADRL 303

Query: 327 YSEMRLNGIIPDEVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYIGSALVDMYAK 386
             +M  +G+  D   + ++ +AC   G L LG  IH  +    L  + Y+ +AL+DMYAK
Sbjct: 304 VDQMVASGLKFDAAAVISILAACTESGLLSLGMRIHSILKRSNLGSNAYVLNALLDMYAK 363

Query: 387 CGSLDLSLLIFFKLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISI 446
           CG+L  +  +F  +  K+L  WN ++ GL VHG+ ++A+ +F+ M RE I P+ VTFI++
Sbjct: 364 CGNLKKAFDVFNDIPKKDLVSWNTMLHGLGVHGHGKEAIELFSRMRREGIRPDKVTFIAV 423

Query: 447 LSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEP 506
           L +C HAGL+DEG   F SM + YD+ P + HYGC+VD+L + G L EA++++++M  EP
Sbjct: 424 LCSCNHAGLIDEGIDYFYSMEKVYDLVPQVEHYGCLVDLLGRVGRLKEAIKVVQTMPMEP 483

Query: 507 NSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVVHIRS 566
           N +IWGALL  C++H   +IA++ ++ L+ L+P + G+Y+LL ++YA  +DW  V  IRS
Sbjct: 484 NVVIWGALLGACRMHNEVDIAKEVLDNLVKLDPCDPGNYSLLSNIYAAAEDWEGVADIRS 543

Query: 567 MMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFILTEL 575
            MK  GVEK   G+S +ELE  IH+F+    SHP SD+IY +L  L
Sbjct: 544 KMKSMGVEKP-SGASSVELEDGIHEFTVFDKSHPKSDQIYQMLGSL 587

BLAST of CsGy3G021830 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 396.4 bits (1017), Expect = 4.2e-110
Identity = 219/654 (33.49%), Postives = 345/654 (52.75%), Query Frame = 0

Query: 24  PSISMPL-------QPPSRPSFKQTLLNRIKNCSTINELHGLCASMIK----------TN 83
           PS S P         PP         L+ + NC T+  L  + A MIK          + 
Sbjct: 11  PSSSYPFHFLPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSK 70

Query: 84  AIQDCFLVHHFISASFALNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFRALQCYV 143
            I+ C L  HF         + Y +  F  ++ PN+ ++N M +G      P  AL+ YV
Sbjct: 71  LIEFCILSPHF-------EGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYV 130

Query: 144 HMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKL 203
            M+    +LP SYTF  ++K+C    A + GQ +H H+ K G +  L+V T+L+  Y + 
Sbjct: 131 CMI-SLGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQN 190

Query: 204 EILSEARKVFDEMCERDAFAWTAMVSALARVGDMDSARKLFEEMPERNTATWNTMIDGYA 263
             L +A KVFD+   RD  ++TA++   A  G +++A+KLF+E+P ++  +WN MI GYA
Sbjct: 191 GRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYA 250

Query: 264 RLGN-------------------------------------------------------- 323
             GN                                                        
Sbjct: 251 ETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLK 310

Query: 324 --------------VESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSEMRLNG 383
                         +E+A  LF ++P KD+ISW T+I  Y+    Y++AL ++ EM  +G
Sbjct: 311 IVNALIDLYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSG 370

Query: 384 IIPDEVTMSTVASACAHIGALELGKEIHHYV--MSQGLNLDVYIGSALVDMYAKCGSLDL 443
             P++VTM ++  ACAH+GA+++G+ IH Y+    +G+     + ++L+DMYAKCG ++ 
Sbjct: 371 ETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEA 430

Query: 444 SLLIFFKLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISILSACTH 503
           +  +F  +  K+L  WNA+I G A+HG A+ +  +F+ M +  I P+ +TF+ +LSAC+H
Sbjct: 431 AHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSH 490

Query: 504 AGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWG 563
           +G++D GR  F +MT+DY + P + HYGCM+D+L  SG   EA E+I  ME EP+ +IW 
Sbjct: 491 SGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWC 550

Query: 564 ALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVVHIRSMMKEKG 589
           +LL  CK+HGN E+ E   E L+ +EP N G Y LL ++YA    W EV   R+++ +KG
Sbjct: 551 SLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKG 610

BLAST of CsGy3G021830 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 379.8 bits (974), Expect = 4.1e-105
Identity = 210/649 (32.36%), Postives = 343/649 (52.85%), Query Frame = 0

Query: 24  PSISMPLQPPSRPSFKQTLLNRIKNCSTINELHGLCASMIKTNAIQDCFLVHHF--ISAS 83
           P+ S P QP +  + +   ++ I+ C ++ +L      MI+T    D +       ++A 
Sbjct: 16  PNFSNPNQPTTN-NERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAAL 75

Query: 84  FALNSVHYPVFAFTQMENPNVFVYNAMIKGFVYCGYPFRALQCYVHMLEESNVLPTSYTF 143
            +  S+ Y    F ++  PN F +N +I+ +     P  ++  ++ M+ ES   P  YTF
Sbjct: 76  SSFASLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTF 135

Query: 144 SSLVKACTFMCAVELGQMVHCHIWKKGFESHLFVQTALVDFYSKLEILSEARKVFDEMCE 203
             L+KA   + ++ LGQ +H    K    S +FV  +L+  Y     L  A KVF  + E
Sbjct: 136 PFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKE 195

Query: 204 RDAFAWTAMV-----------------------------------SALARV--------- 263
           +D  +W +M+                                   SA A++         
Sbjct: 196 KDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQV 255

Query: 264 --------------------------GDMDSARKLFEEMPERNTATWNTMIDGYARLGNV 323
                                     G ++ A++LF+ M E++  TW TM+DGYA   + 
Sbjct: 256 CSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDY 315

Query: 324 ESAELLFNQMPTKDIISWTTMITCYSQNKQYQDALAIYSEMRL-NGIIPDEVTMSTVASA 383
           E+A  + N MP KDI++W  +I+ Y QN +  +AL ++ E++L   +  +++T+ +  SA
Sbjct: 316 EAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSA 375

Query: 384 CAHIGALELGKEIHHYVMSQGLNLDVYIGSALVDMYAKCGSLDLSLLIFFKLTDKNLYCW 443
           CA +GALELG+ IH Y+   G+ ++ ++ SAL+ MY+KCG L+ S  +F  +  ++++ W
Sbjct: 376 CAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVW 435

Query: 444 NAVIEGLAVHGYAEKALRMFAIMEREKIMPNGVTFISILSACTHAGLVDEGRSRFLSMTR 503
           +A+I GLA+HG   +A+ MF  M+   + PNGVTF ++  AC+H GLVDE  S F  M  
Sbjct: 436 SAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMES 495

Query: 504 DYDIRPDIRHYGCMVDMLSKSGYLNEALELIKSMEFEPNSIIWGALLNGCKLHGNCEIAE 563
           +Y I P+ +HY C+VD+L +SGYL +A++ I++M   P++ +WGALL  CK+H N  +AE
Sbjct: 496 NYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAE 555

Query: 564 DAVEQLMILEPMNSGHYNLLVSMYAEEKDWMEVVHIRSMMKEKGVEKKYPGSSWIELEGT 600
            A  +L+ LEP N G + LL ++YA+   W  V  +R  M+  G+ KK PG S IE++G 
Sbjct: 556 MACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGL-KKEPGCSSIEIDGM 615

BLAST of CsGy3G021830 vs. TAIR 10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 379.4 bits (973), Expect = 5.3e-105
Identity = 198/544 (36.40%), Postives = 314/544 (57.72%), Query Frame = 0

Query: 46  IKNCSTINELHGLCASMIKTNAIQDCFLVHHFIS---ASFALNSVHYPVFAFTQMENPNV 105
           ++ CS   EL  + A M+KT  +QD + +  F+S   +S + + + Y    F   + P+ 
Sbjct: 21  LQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDRPDT 80

Query: 106 FVYNAMIKGFVYCGYPFRALQCYVHMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVHC 165
           F++N MI+GF     P R+L  Y  ML  S+    +YTF SL+KAC+ + A E    +H 
Sbjct: 81  FLWNLMIRGFSCSDEPERSLLLYQRML-CSSAPHNAYTFPSLLKACSNLSAFEETTQIHA 140

Query: 166 HIWKKGFESHLFVQTALVDFYSKLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMDS 225
            I K G+E+                               D +A  +++++ A  G+   
Sbjct: 141 QITKLGYEN-------------------------------DVYAVNSLINSYAVTGNFKL 200

Query: 226 ARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQY 285
           A  LF+ +PE +  +WN++I GY + G ++ A  LF +M  K+ ISWTTMI+ Y Q    
Sbjct: 201 AHLLFDRIPEPDDVSWNSVIKGYVKAGKMDIALTLFRKMAEKNAISWTTMISGYVQADMN 260

Query: 286 QDALAIYSEMRLNGIIPDEVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYIGSAL 345
           ++AL ++ EM+ + + PD V+++   SACA +GALE GK IH Y+    + +D  +G  L
Sbjct: 261 KEALQLFHEMQNSDVEPDNVSLANALSACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVL 320

Query: 346 VDMYAKCGSLDLSLLIFFKLTDKNLYCWNAVIEGLAVHGYAEKALRMFAIMEREKIMPNG 405
           +DMYAKCG ++ +L +F  +  K++  W A+I G A HG+  +A+  F  M++  I PN 
Sbjct: 321 IDMYAKCGEMEEALEVFKNIKKKSVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNV 380

Query: 406 VTFISILSACTHAGLVDEGRSRFLSMTRDYDIRPDIRHYGCMVDMLSKSGYLNEALELIK 465
           +TF ++L+AC++ GLV+EG+  F SM RDY+++P I HYGC+VD+L ++G L+EA   I+
Sbjct: 381 ITFTAVLTACSYTGLVEEGKLIFYSMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQ 440

Query: 466 SMEFEPNSIIWGALLNGCKLHGNCEIAEDAVEQLMILEPMNSGHYNLLVSMYAEEKDWME 525
            M  +PN++IWGALL  C++H N E+ E+  E L+ ++P + G Y    +++A +K W +
Sbjct: 441 EMPLKPNAVIWGALLKACRIHKNIELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDK 500

Query: 526 VVHIRSMMKEKGVEKKYPGSSWIELEGTIHQFSASADSHPDSDKIYFILTELDGQLKLAG 585
               R +MKE+GV  K PG S I LEGT H+F A   SHP+ +KI      +  +L+  G
Sbjct: 501 AAETRRLMKEQGV-AKVPGCSTISLEGTTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENG 531

Query: 586 YILE 587
           Y+ E
Sbjct: 561 YVPE 531

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q56X054.9e-18054.95Pentatricopeptide repeat-containing protein At1g06143 OS=Arabidopsis thaliana OX... [more]
Q9LS725.7e-11235.67Pentatricopeptide repeat-containing protein At3g29230 OS=Arabidopsis thaliana OX... [more]
Q9LN015.9e-10933.49Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
O823805.7e-10432.36Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9FJY77.5e-10436.40Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_011651448.10.099.17pentatricopeptide repeat-containing protein At1g06143 [Cucumis sativus] >XP_0116... [more]
XP_008447444.10.092.67PREDICTED: pentatricopeptide repeat-containing protein At1g06145-like [Cucumis m... [more]
XP_038888390.10.087.90pentatricopeptide repeat-containing protein At1g06143 [Benincasa hispida][more]
XP_022967388.10.083.02pentatricopeptide repeat-containing protein At1g06143 [Cucurbita maxima][more]
KAG7011261.10.083.02Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
Match NameE-valueIdentityDescription
A0A0A0LB990.099.17Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G395920 PE=4 SV=1[more]
A0A5A7T9J00.092.67Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3BHH10.092.67pentatricopeptide repeat-containing protein At1g06145-like OS=Cucumis melo OX=36... [more]
A0A6J1HUY60.083.02pentatricopeptide repeat-containing protein At1g06143 OS=Cucurbita maxima OX=366... [more]
A0A6J1HIB30.082.85pentatricopeptide repeat-containing protein At1g06143 OS=Cucurbita moschata OX=3... [more]
Match NameE-valueIdentityDescription
AT1G06150.13.5e-18154.95basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
AT3G29230.14.1e-11335.67Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G08070.14.2e-11033.49Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G29760.14.1e-10532.36Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G66520.15.3e-10536.40Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 206..234
e-value: 4.6E-6
score: 24.5
coord: 104..137
e-value: 0.0018
score: 16.3
coord: 368..401
e-value: 4.4E-5
score: 21.4
coord: 267..301
e-value: 5.8E-9
score: 33.6
coord: 237..266
e-value: 3.9E-8
score: 31.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 506..535
e-value: 0.032
score: 14.5
coord: 440..465
e-value: 0.003
score: 17.7
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 233..261
e-value: 1.9E-7
score: 30.7
coord: 199..230
e-value: 1.8E-5
score: 24.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 100..148
e-value: 3.7E-9
score: 36.6
coord: 365..412
e-value: 2.6E-8
score: 33.9
coord: 264..312
e-value: 3.8E-11
score: 43.0
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 101..136
score: 9.086975
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 203..237
score: 11.432693
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 503..537
score: 9.218511
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 265..299
score: 12.013642
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 366..400
score: 10.490022
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 234..330
e-value: 7.6E-26
score: 93.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 157..233
e-value: 5.4E-15
score: 57.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 37..156
e-value: 2.3E-10
score: 42.4
coord: 340..545
e-value: 8.4E-38
score: 132.5
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 172..297
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 34..582
NoneNo IPR availablePANTHERPTHR47928:SF32HELIX LOOP HELIX PROTEIN, PUTATIVE-RELATEDcoord: 34..582

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy3G021830.2CsGy3G021830.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding