Tan0004354 (gene) Snake gourd v1

Overview
NameTan0004354
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG03: 50627014 .. 50640673 (+)
RNA-Seq ExpressionTan0004354
SyntenyTan0004354
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ACCTCCTTTCACTTTTCATTTTATGGTTTCTTTGGTCGGAAAAGAAGGAACTTCGAAGGGGAAGTTGCGCGCCTCCTCCCTCCCCGTGTGTGTTTCTTCTTCGCGCAGCCAACCACCACCGTTGGCGTTCTTCTCGCGCACAAGCCATCTCTATTTGCGAGGAGTTCGCCGCCGTCAACAAACCTTCCTCTCCGTCAGATCTGCTTCCCTCTTCGTCGTCGTCTCCATCTTCCTCCTCGCGACCATCGTCTCCCTCTACCTCTCCCTCGACTGACGTCATCTACTTCGCTGGTTTGCCCTCCTCGACAGAACTCTCTCCCCCTCTCAATCCCTCAGTTCCTTCTCCCTCTCAGATCTCCATCCCTCGAGCCCTCTCCCTCTCCGACGCCTTTTCCTCTCTCCCTTTCAGAATGCAACCACTCGGCTCCCCCTCCCAATTCCTCAGTTTCACTATTTCCCTTTCAGATTTAACGGATCTCTCTTCCCCTCTCTACCTCTCAGATTTATCTCTCGCTAACTAGCTTGTTTGTGTTGCATAGGACGTTGAATTTTCAATTAGGAGGTTGCCTAGCGTTCGTTGTGGTCCGGGCTTGGTGTTGGAAGCTTCAGTTCGATGATACAAGTAAGTCTAGATCCCTTCCTTGCGTATTCAAATGTTAAAACTAGGCCTAAGTGTAGGTTTCATACCAAAGGTCGCCTAAGAAGGCAACTTGTTAGCTGACGAGTCCCTAAATCTATTCAAATGCAATATTTGTTGCTTGTAGGACCCTTGGTTGTCGCAGGCGAATTGGTGTTGACCTGGGAAAGGAAGAGCTCATCGTGTGTTCTACAACCAAGGTAAGTAACCATTTGGTTTTAAGCATCGAGCCATCAGTGAAAGAGCTTGGTGAGTGTGTGGACTTTACTGAATCTGGCATGTGTCTAAGTTGTGGATAGCGACTTAGTAGTATAGCGAAATTTGCATCTTTGAAAATCTCTGGCCTGATTGGAAGTGTTAAGCTTGTTTTATATGTTCATTGCACGTGTATGGGACTTCTGGTTTGTTAAAATTGTTGAGCATGTTACGTATGTTGTTTTACTTGTGTTTGGACTCTTTGGCTTATCGTGGTTATGGTTAGTGAAACTGTTGGTTGTTAGGGACCATGTATACAGGTGGTGGAAACCTGTGAACTTAAATCCCGAGTCTAACTTCCATAAGGGTCGCGAGACCTAGTTGAGGTGTTTTGGTTGGTCGGGGTGGTTTGAACGAGCAAGTAGTTGGGTGGAAGGACCCCTTTAGGTTAGGCACTGTTCAGGGTAACCTAGGGGCTTAGGAGGAGCCTCAAAAGTCGTGAGAGAACCCGTACTAACTAGCCTAGCTTCCCCAACCTTCGAGTCATTCTGAGCGAGTCCAGACCTTTAGGTTGACGGTATTGTATAGCATTTGGGTCGAGCACTCATTTGCAGGACTAGCATTTGAGTCGAGCACTCATTTGTAGGATTAGTATTTGGGTCGGGCATCCATTTGCAAGATTAGCGTTTGAGTCGAACACTCATTTGCAGGATTGACATTTGGGTTGAGCACTCGATTTGTAGGATTAGCATTTGGGTCGAGCATCCATTTGCAGGATAAGGCATTAAGGTCGAGCATTCTTTTTCGGGTTGTGGCATTTTGGTCGAGCGTCCTCTTGCGTTTCTGTGCAGGCATTAGAGCCTCGCTCTTTCGCGTCGGGGTGCACCCCGTTCTAGACACCTAGGTTTTCTGGATCGTCGGTAGGTCTCGAGAAGGGGTTTGCATAACGGGGTGCCTAATTGTTCAATAGTCTTGGTTGTTGATGTTTGTTTTGGATGTTTCTGTTGTGAAGTAGTGGCTTTGACGTGAGTGGGCTATTTACTGAGTACTCTTATACTCACCCGTCTATTTTCCTATTTTTGGTTGCACGTAAGGGTAAACAGTGAGTCGAAAGACGACGACGGCGACGTGGCGAGGCTATAGGACCCCTCTATATTTTGTTGAGACTACTTTTTTATGTTTAGTATCCCTGTTTGGGTTTAGGTTCATACTTTTCTTGTGTTAAGTTATGTGTCCGTTGGATTATATCGATTCAATTTGGATTAGTGGTTGCATATCCTTCTTTTGTTTCGTATTTGGTAGCTTGTTGGTTTGTTTAAGCATGTAACTACCTCGTTATCTCCTTCTTGTGTTCTATGGAGCTCAACCGTTGTTGTATTATAGTTTGGAGGAGGTTGGACATTGGGTGGAGTCTTTTGAAGTGTTTCTTATGGTTAAAACAGGTTAAGGAAATTTTTGGATTTGTTCGGGTAGTGTCGTTATGTTGCCGAAATTTCGGCACAGAGGGTGATGCTCAGTGACGTCCCTAGTTCGAGTTGGTTTAAATACCAGGGGGGTTAGATCTTATTAAGATTAAATTTATAAAAATCGGACCTCTAAGTATTTTGCAATAGTAGTGGCAATGATCATGCATAGGGATTGAAGTTCTTAGATTTAATTAAGTACGATAACTAATCATGCACAATTTAATTTAGAGAAATGCAACTTTAATTTGAGTTTTAAAACAATAGATAAACCTAAATTATTATTTACAAGGGTGGAGTCCAATTCTGAAAGGAATGAACTCTTCACTCCGTAACAGAAACATTAGATCATCTTTGATTTATTTAGAATTAAACATAAAAACAAGGGTTAAGCAAAAACTTACATTGAAATATATGGAGAATTATCTGGCAGTTTTTTACTTCTTGTGTTCTAAGACTTCCACAAAGTCTTCCTTGCTATGACTCGAGCATGGGGCCAAGGCTTGTGGGAGCTTCTTGAAAGAAAATCCACAAGGGATTGAAGAAGAATAAATTTACAGAGACTTCTCTAATTCTCTATTGAGGAGATGTGTAGAAAAGGTTCACACAATGACTCACCCTTAGGGTCCTATTTATCATGGCTTAGGGAGAATAATTAAATAATTTGAATCATATTCAAATCTTATTCAAATCTAATATATTAAAATACATTTATCTAACTAATTTGAATCATATTCAAATCTTTCAAAATTCTCCTATGACATTTATTCTTAATTTGAATCTAATTAAAATTAATAAATATCATTTATTTAATGTATCGTAATATATTAATGATTTTATTATTTAATGTATCAAAATACAATAAATATATTTTTCTCTTTGATACTTAATATATCTAATCTATTAAATATTAATTTGAACATTTCAAAATTTTCTTTCAACTAAATTAATCCTTATTTGTTCATATGAACTAGACAGGGGACCTTTGTTGGGATTTTATGCCTTAAACTCGTGTAGTTTGTAAATATCTGATGAAATTAATATAGAGTTATTTTTTCATCATTTATCGTGTCCTTGCATTTAAAGCTTAAAATCCAATAAACTAAGCTCCTGGGTTATGGTTATGAGAACTTGAACAGTGTGTAGTAGACACATAGGTAGAATCGTGTTCAAGTCTGTAACCCAAATGATCTATAGTATATGGATAAGTTTAGGAACCTCATCTTGGTAACACTATGGATGCGATTCATTTTGTATTTGATATAAACGAGGTAATCCAACTCGTTCATGTAGTTGACATGCGAGTGGGGGCATCCTCTGTAATGAGTTTGTACAAGATCGGACCGCGAATTATTAGCCAACGGATATAACACCGTTAACAGATTAGGTATAATAATTTATAGATGCCCATTAGTGACTCGACCTGACTCCTGAGCGGGTTACGAACTCTTGTCTATGAGGATTTGTCCTTAGACTAGTATGGGTGAGAGTGGCCAAAGTTGTCGAGTCAATATGTCTACCTTCTCAGAGACGAAACCTCGTGTGGAGCTAGAAACTTAACTCTGCGATATAGAATTCACTCGTTCCCGAGATAGGGAAAATAGATTAGTTGTTCATGATAACGCTACACACAAGTGTACATGATCGAAGTAATAACACTGGAATACTCAGTGGTATCGCATCACAGGAACAGGTTCACAAGCTTAACTGAAATAACAATATTTGTAATGGAAGATGCAAGTGAGGTAACCAAGGTTGTAAAATTTGTTGATGAGAGGGTGTTCTAAATTATTGAAGACAAAAAACGAAATCAATGATAAAAAGAAGGGAATTCAAGTTGCTCTTGCCCCTTCGGAGAAAATGCTGTTGAAAATGCAACTGCAATTGTGTCAACAATCTTAAACCTTGAGTAAATTGGGTTCTTCTACTAATCGCTTTTGCTCTCAAAACCTTTAGGAGTCTAAGCAATTAAACTCATATGTCTATGGCTAATTAAGCTTAGATAGTTTTGTACACAACCCTCTTGGTTTCGAAGTCTCGCCCTATGTCTAGGGTCGAAGTTTCATATCTTATGTCTAAGTTCGTTCCTCCGTTAGTTTTATGAGATTCGCTTTTACTTGACTCTTTCAATATCAAGTTCAAGCTTGTTGAACAAGTCCTAGTGATTGGCTAGGGTTGCCCAAATTTATTCAATAAAATCAAACTTAGGGTTAAGTGCCGACACAATTTCCTTGCTACTTTAAGCTTACCATCTCTCTTCTAGGGCATTCGCAACTTGAACCCCTAAAAAGAGATTTAGCTCATGGTTAAAACTAAAGAAAACATGAGATTTACATGTTGAATTTTAGAGACTTTGAAATCAGTAAACACAAAAACAGTAACTACAAGAGTCCAACTTATTTGGAAATCAACAACTAGAAGGAGCTTATGCGGAAATCAACAACAGAAATTTAATTGCATAACTTGGATAAAGAATACATAATATTGTTGAACTGAAATGAAATGAAAACTCAGTAGTCTTTAAATCTAAAGGCAGTAAAGTAAAATACAAAGAAAAACCTAGAAACTCCCTTGTTGGTCGATTACAAACCCCAAGCACAAGTTTTGAATATTCACTCTAATATTGTGTAAAAGAGGGACTAACCGAGAAGTTATTACAAGCTAAACAAAGAAGAAAAAGTTGTTGTAAAAATAGAACTAAAACATGAAAAGAAATGAGGGATTTTTTTTAAAGGAGGGATTTTTTTTTTTTTTTTTCGTTTTGCTTCTTCGCGCAACATCGACATTCATTTAATCAAGCTTTGGTTTCCATCTTTCTTCATTTTTGTTTCTCTTTCGATAATCAAGCAAGAACACCTTCGGCTTGGAAATTTTTCTCCTCTTCGAAGCAACCGGCAACCGATTGTCAGGATGAGTTTTTTTTAATTTTTTTTTTCATTTCCAGGTAAGTCATTCGATTTCAAGGTTTATTAAATTTCCAGTTGAACCCCCTGATAGCCATAAGCCCTCTTATGTTTTGATTTCTTTGATTTCCCCCTCTTTCCCCCTTAATTTTTTCTCTAACTTTTTTTCTTGTCTCCTTCAATTTCAAGGTAACATCTGGTTTTCTTGGGGGTTGATTTGCGGAAAAGAATATGAGGTTTTGTTTTCCTTAGTCAAAAATTTCAGCCTAGGTCGACTGTTATATATCAGTGAAAATGCCCAATCGAAAAACTAAAAAGAATCATTTTCTAAAAGGAAGGAGCACAAAGAATAAAAATAAAAAGAGAAAGTAAAAGCCCAACATTAATACAAGTTGTATGGAAGTAGTTTTCAAAGGAAAACATGAATTTTTGGATTCTTCTAAAGAATGATAACAGTCTAAAAGTTCCAAAAATAGCCACGTGGGGGTTACCGTATGCTAAGTTGTTTGTGGGTAATTCAGTTGCTGAGATAGTATTTGTGAATTGATTTCTGATTTAGTCCCATTCGTACACAAGCTTATTTTCTTTCATGTTTGTTGTGATTTCTTGTTTGATTTCATCTACCTTTGTCCAAGTTAACCTCCTCTGTGATTCCATACTCTTATTTTCTTGCTTTTCTTGTCGATTTCCTATATATTGATTAAACATATTGCAAATGAAGACATGTCTATATGAACCCATGCTTGAAATTACATCTTGAATTTCTTTTTGCTGGCATTTGTGATTGACACCAACTAGTTGCTAACTTGGTAGATTTGATGTGTTGGCTATTGATGTGGTAGAGTTTAAGAACTTTTGATCTCTATTATAACTATCTGTCCATATATAGGGGAAATTAAGTAACCATTCTTTTAAGAATTATGGACACATTTGAAATTATAGTTGGATTGTAGAAGATTTTATACTATATTTTAATACTTTATCTTCGGTACATAGTTTAATAGAAGGCATATTGGATGCTAGTTCTTTATATTCCAACTTAAGATACACCTCATTAACTTTTAAATGTAAATTCTATAATTTTAGTATCAGGATTTACTAAATGTAATTCTAGCCTAGCTATATTGGGTTCATTTAGTTAATTTTTCAAAAGAATAATTGAATAATTAAACAAAAACCTATGTATACTGAAAATTACTTTGATTTCTTGCATTATTGGCTGTTGGTAAGGTTTTATCCTACTTCATTAGGATTTAGTTTAGAACCCACCCTGTACCATAATAGTACTCTCACAGCATATTGTCTTGTAATATTTCCAAGAATTTTTGTACGGACCCCCCAACCAACAGTTTTGCAGGTGCTTTATGTCTTCAGAGCTCAATGTGAGGAGGAGACATTAAAGAAGGACCATCAGCGAGCTAGTTTTTATTCCTTTAATTGTTACTTCTTTTAGTAAATGATCTGTTGGGTTTAGTTTACTTTCTAACTTATGTATCTCATTCTTTAACTCATTAAGTTTGCTGGAGTTTGGTGATAGTTCTAGTTCGATTGACAGGGCTTAGTAGACTTAGTTGTTATTGGAAGTTAATGAAGATGATTGGTTTCCTTTCTTATGATGTTCTAACTTCTGGGCTTTATTAGGTGATTGCTTGATACGAGGTATGACTCATTCTTTTGTCTGAGTTTTGTGAGTTATTATAGGCCTCAAGAGCGTGAAAATGAGACTTATCCCTTTCTCCTTTTCACTTTATCTTTCTTTATAGCTAATTGGAGATCTTTCTTGTAATCCACCTTTGGTGCTTGGAGTTTTTCTCCATTTCATTTATCAATGAAATTATTTCTATTATAAAAAAAATCAATTATTAGTTTTTGATTACACATGCCTTAAATATCTCTGATTTTCTGTTTTAGGCTTGCTCGATGTCGACCAACCACTTTCGAGGTTGTTTGGTTGTGAGGTTTGATTCTACAACTTTAAACTCTGTCGATTGTTGTCTAGCGTGTTTTTGAGGTTCCTTAGGTGTTGGTTGTTAATCATTAGAGTTTTAATTGTTTCAATTACTCTTTAAATTTTGTAAATTTAGTTCTAACGTGTAGGGACCTTAATGTTGCAGTTTCGAATCGATTCAAACACCTTTGATTAGCAAACAACAAGCGTAATGGCTCTTTCGGTAAGTTTTAATGTGTTTGGGACCTTTCGCAAATCACTTTAGGTGTTATTGTTTTGGTTCTAACGCATATGAGGCTTTTGTTTTGGAGTCCTTTTGAATACCTCCAAAAGCATCATTCGACAAGGACTAAAGTCTATTTTGGTAAGTTTTGGTTGAATACATTTGGACTCTAATTCTTGATCACTCTTTGTTTATAGTTTCAAAACACTTATAGCAAGAACCGAAGCCATTTTAGGTGAGGTTTAGGTAGTAATGATTCATTAAATCCTTTTAAAACTTCTCTGTACCGTTGTAAGTTTATTTCTAATCTTGAAAATTTTGGTTTTGGTTGCAACCTTGGTTTGGCTTTTGGGTGCTCAAAGCTAGTTTAAGGAAGCGTTGGAAGTTTCTAAGTTTGGTGTGCGGTTACTAGGGGCAAAGTAGAAACTTTTATTTTATTTTGAGTTAGTTTTTGGGATCGTTACATAGTTAGAGTTTGTGTTGATGCTCAAAGAAACCTCATGCATATTTATGGAGGCCCACATCAGAAATGACATGTATTGAAGAAGTTGTGGGTAGTACATGTCTTTGGCCTCAAAGTGTACTTAAGAGCTCGTAAGACACTTTTGTAGCATCAGTGATTGGTTGTTTTTTTAAGCCTTTGTTTACGAACTTTCTTAGTTTCTTAAGAGGCAAAATGACACTTAAACTCTTCCTGCACTTTATGGCTACTTTGTAGGTTTCAATGGGTGAAGTTGACACTTTTAGGTCATTTTAATTAAGTTTTAGGCACCAAAGAATTCTTAAGCACCTTTAATATACTTTTGTAGCATCAGGAGGTTGGTTCTGTGCTACAGTTTCCCTTTAGGCACTTTCTTAGTTTCTTTAAAAGCACAATGTCACTTAAACTCCTTTTACACATTTTTGTTCAACTCTGTAGGTTTCATAGGTAGTTTTAGATCGTTAGTGAGTTATTTTAGGCCTCAAAATATGCTTAAGAGCCTTTAAGACGCTTTTGAAGATTCGAGGGGCCGATTGTGTGCATAAGTTTCAGTTTAGGCACTTTTTTTACTTTCTTTGAGCGTGAAAATGAGACTTAAACTCCTTTTATACACTTTTTTTGTAACTTTGTAATTTTCCTAGGTACTTTTAGGTCGTTTTGAGTTATTTTTGGACTCAAAGTGTGCTTAAGAGCCCTTAAGATACTTTTGAACATTTGAGGGGTTGGTTGTTTGTCTAAATTTTAGTTTAGGCATTTTTTTAGTTTCTTTGAGTGTGAAAACGAGACTTAAACTCTTTGCATCAACTTTTTGGCTACTTTGTATGTTCCAAGGGGTGAAGTAGACATTATTAGGTTGTTTTGAGTTACTTTAAGATACTTGATTACTCTGGTTTTTCTAATTGATGTAATGCATGCTAGGAGCGTAGAGACTTTACCTTGTGGATTGTGTTTGTTGGAAATTAATGAATTGGTCTAGTACTTTACATATTTAGAATATTGGTTCATTTGTGAATTTTCCATGATTTTGTGAGTTAGAGTTGAAAGTTGTGGTTGGTTAATCTGGATATAGATTATCTTTTTTTACCAACAGGTGCTAAGTAAAATTTTTGGTTTTGTTATAGTTACGTCGTTATGTTGTCGAAATTTTCAAAGCATATAGGTTGTTAGCTTAGTGGTGTTCCTTAGATTAGGTTAACATTTGGGGGAGTTAGCTTAGTGCTATTGGTAGTCCAGTTGCATGACCTTTTGACAAAGTTTTAATCACTATGTACATTTATGTATTAATATTTAAAATTATTGATTTCTTTAATAATTATTTTATTTGATTAAATATTCTTCTTTATTTCTTTCTCGATAATTTTTTTGTAGGTTGATGACAGAATATCATGGGAAGAATGGCACCTCCTCCTAGCGTTCCCTGATTATGAAAATTGCACCACTTTCTCCTTCTCTGAAGAATTTAAGATGTTTCTTCCCCGCTGCAAACTACGGCACAGGTTCAGCTCCTTGTGCCTTCTCAGAGTCGGATACGGCGGTAGGCCGAGACTGGAATGCTGTCATCGCCGCCGCCCCGTTCACTGGTGTCCTCCAAGATGAAGACCTTCTTCGCAAGACCCACATTTCTTCCTCTAAAACTTCCACTAATTCTACTGGGATCTATGTTCTAGACCTTATTAACCATGGCTCTTTAGAACCAGACCGAACCCTTTATGGTAAAATGCTAAACAAATGCACCAATTTGCGCAAACTCAAGCAGGGCAGAGCCATCCATGCACACATCCAGGGTTCTATGTTTGAGAATGATCTGGTTCTTCAGAACTTTATCCTAAACATGTACGCCAAATGTGGTAGTCTCGAGGAGGCACAAAACATGTTTGATAAAATGCCTACAAGAGACATGGTTAGTTGGACTGTGCTTATCAGTGGGTATTCTCAGAGTGATCGAGCAACTGAGGCTCTTGCTTTGTTCCCGCAGATGCTCCACCAGGGCTTTCAACCTAATGAGTTTACTTTGTCTAGTCTGTTGAAGGCTTCTGGAGCTGGCTGTAGTGATGACCTTGGAAGGCAACTTCATGCATTTTCCCTCAAATATGGCTATGATATGAATGTTCATGTGGGAAGTTCATTGCTTGATATGTATGCTAGGTGGGGGCATATGCGAGAAGCCGAAGCGATTTTTAATGGACTGGCTGCAAAAAATGTGGTGTCTTGGAATGCTCTGATTGCTGGTCATGCTCGGAAGGGTGAAGGGGACCATGCGATAAGGCTGTTTTGGCAAATGTTGAGATGCGATTTCGAACCTACACATTTTACATACTCTAGTGTTTTTACTGCTTGTGCCAGCTGTGGATCTTTGGAGCAAGGCAAATGGGTTCATGCCCATGTAATAAAATCTGGGGGACAACCCATTGCTTATATTGGAAACACTCTCATTGACATGTATGCTAAATCAGGCAGCATCAAGGATGCAAAGAAGGTTTTCCAGCGGTTGGTTAAACAGGATGTGGTTTCGTGGAACTCGATTATATCTGGCTATGCGCAACACGGATTGGGAGTTGAAGCTTTAGAGCTATTTGAAGAGATGCTGAAGGCCAAAGTTCAACCTAATGAAATTACATTCCTCTCTGTTCTTACCGCTTGTAGCCATTCCGGGCTTCTGGATGAAGGACAATATTATTTTGAACTGATGAAGAAATACGGGATAGAACCACAGGTTGCACACTATGTGACAGTTGTTGATCTTTTAGGCCGAGCAGGACGACTAAATGAAGCCAACAAGTTCTTAATAGAAATGCCTATTGAACCTACCGCAGCTGTCTGGGGAGCCTTGCTTGGTGCTTGTAGGATGCATAAGAATATGGATTTAGGTGTTTATGCTGCTGAACGGATTTTTGAGCTTGACCCTCATGACTCAGGCCCTCATGTATTATTGTCTAATATTTATGCTTCTGCTGGTAGATTGAATGATGCAGCAAATGTGAGGAAGATGATGAAACAGAGTGGAGTTAAGAAAGAACCTGCTTGCAGCTGGATTGAAATTGAGAATGAAGTCCATACGTTTGTGGCAAATGATGATTCACATCCAATGAGAGAGGAAATCCAGAAGATGTGGGAGAAAATAAGTGGGAAAATTAAAGAGATTGGGTATGTGCCAGACACAAGCCATGTGCTTTTCTTCATGGACCAGCAGGACAGAGAAGTAAAGCTACAATACCACAGCGAGAAGCTAGCATTAGCATTTTCAGTGTTGAAAACTCCTCCTGGATTAACCATTCGGATTAAGAAGAACATTAGAATATGTGGTGACTGCCATTCTGCATTCAAGTTTGCTTCAAAAGTTTTGGGAAGAGAAATCATTGTAAGAGATACCAATAGATTTCACCATTTCCTTGATGGCTTGTGTTCTTGCAGGGACTATTGGTAG

mRNA sequence

ACCTCCTTTCACTTTTCATTTTATGGTTTCTTTGGTCGGAAAAGAAGGAACTTCGAAGGGGAAGTTGCGCGCCTCCTCCCTCCCCGTGTGTGTTTCTTCTTCGCGCAGCCAACCACCACCGTTGGCGTTCTTCTCGCGCACAAGCCATCTCTATTTGCGAGGAGTTCGCCGCCGTCAACAAACCTTCCTCTCCGTCAGATCTGCTTCCCTCTTCGTCGTCGTCTCCATCTTCCTCCTCGCGACCATCGTCTCCCTCTACCTCTCCCTCGACTGACGTCATCTACTTCGCTGGTTTGCCCTCCTCGACAGAACTCTCTCCCCCTCTCAATCCCTCAGTTCCTTCTCCCTCTCAGATCTCCATCCCTCGAGCCCTCTCCCTCTCCGACGCCTTTTCCTCTCTCCCTTTCAGAATGCAACCACTCGGCTCCCCCTCCCAATTCCTCAGTTTCACTATTTCCCTTTCAGATTTAACGGATCTCTCTTCCCCTCTCTACCTCTCAGATTTATCTCTCGCTAACTAGCTTGTTTGTGTTGCATAGGACGTTGAATTTTCAATTAGGAGGTTGCCTAGCGTTCGTTGTGGTCCGGGCTTGGTGTTGGAAGCTTCAGTTCGATGATACAAGACCCTTGGTTGTCGCAGGCGAATTGGTGTTGACCTGGGAAAGGAAGAGCTCATCGTGTGTTCTACAACCAAGGTTGATGACAGAATATCATGGGAAGAATGGCACCTCCTCCTAGCGTTCCCTGATTATGAAAATTGCACCACTTTCTCCTTCTCTGAAGAATTTAAGATGTTTCTTCCCCGCTGCAAACTACGGCACAGGTTCAGCTCCTTGTGCCTTCTCAGAGTCGGATACGGCGGTAGGCCGAGACTGGAATGCTGTCATCGCCGCCGCCCCGTTCACTGGTGTCCTCCAAGATGAAGACCTTCTTCGCAAGACCCACATTTCTTCCTCTAAAACTTCCACTAATTCTACTGGGATCTATGTTCTAGACCTTATTAACCATGGCTCTTTAGAACCAGACCGAACCCTTTATGGTAAAATGCTAAACAAATGCACCAATTTGCGCAAACTCAAGCAGGGCAGAGCCATCCATGCACACATCCAGGGTTCTATGTTTGAGAATGATCTGGTTCTTCAGAACTTTATCCTAAACATGTACGCCAAATGTGGTAGTCTCGAGGAGGCACAAAACATGTTTGATAAAATGCCTACAAGAGACATGGTTAGTTGGACTGTGCTTATCAGTGGGTATTCTCAGAGTGATCGAGCAACTGAGGCTCTTGCTTTGTTCCCGCAGATGCTCCACCAGGGCTTTCAACCTAATGAGTTTACTTTGTCTAGTCTGTTGAAGGCTTCTGGAGCTGGCTGTAGTGATGACCTTGGAAGGCAACTTCATGCATTTTCCCTCAAATATGGCTATGATATGAATGTTCATGTGGGAAGTTCATTGCTTGATATGTATGCTAGGTGGGGGCATATGCGAGAAGCCGAAGCGATTTTTAATGGACTGGCTGCAAAAAATGTGGTGTCTTGGAATGCTCTGATTGCTGGTCATGCTCGGAAGGGTGAAGGGGACCATGCGATAAGGCTGTTTTGGCAAATGTTGAGATGCGATTTCGAACCTACACATTTTACATACTCTAGTGTTTTTACTGCTTGTGCCAGCTGTGGATCTTTGGAGCAAGGCAAATGGGTTCATGCCCATGTAATAAAATCTGGGGGACAACCCATTGCTTATATTGGAAACACTCTCATTGACATGTATGCTAAATCAGGCAGCATCAAGGATGCAAAGAAGGTTTTCCAGCGGTTGGTTAAACAGGATGTGGTTTCGTGGAACTCGATTATATCTGGCTATGCGCAACACGGATTGGGAGTTGAAGCTTTAGAGCTATTTGAAGAGATGCTGAAGGCCAAAGTTCAACCTAATGAAATTACATTCCTCTCTGTTCTTACCGCTTGTAGCCATTCCGGGCTTCTGGATGAAGGACAATATTATTTTGAACTGATGAAGAAATACGGGATAGAACCACAGGTTGCACACTATGTGACAGTTGTTGATCTTTTAGGCCGAGCAGGACGACTAAATGAAGCCAACAAGTTCTTAATAGAAATGCCTATTGAACCTACCGCAGCTGTCTGGGGAGCCTTGCTTGGTGCTTGTAGGATGCATAAGAATATGGATTTAGGTGTTTATGCTGCTGAACGGATTTTTGAGCTTGACCCTCATGACTCAGGCCCTCATGTATTATTGTCTAATATTTATGCTTCTGCTGGTAGATTGAATGATGCAGCAAATGTGAGGAAGATGATGAAACAGAGTGGAGTTAAGAAAGAACCTGCTTGCAGCTGGATTGAAATTGAGAATGAAGTCCATACGTTTGTGGCAAATGATGATTCACATCCAATGAGAGAGGAAATCCAGAAGATGTGGGAGAAAATAAGTGGGAAAATTAAAGAGATTGGGTATGTGCCAGACACAAGCCATGTGCTTTTCTTCATGGACCAGCAGGACAGAGAAGTAAAGCTACAATACCACAGCGAGAAGCTAGCATTAGCATTTTCAGTGTTGAAAACTCCTCCTGGATTAACCATTCGGATTAAGAAGAACATTAGAATATGTGGTGACTGCCATTCTGCATTCAAGTTTGCTTCAAAAGTTTTGGGAAGAGAAATCATTGTAAGAGATACCAATAGATTTCACCATTTCCTTGATGGCTTGTGTTCTTGCAGGGACTATTGGTAG

Coding sequence (CDS)

ATGAAAATTGCACCACTTTCTCCTTCTCTGAAGAATTTAAGATGTTTCTTCCCCGCTGCAAACTACGGCACAGGTTCAGCTCCTTGTGCCTTCTCAGAGTCGGATACGGCGGTAGGCCGAGACTGGAATGCTGTCATCGCCGCCGCCCCGTTCACTGGTGTCCTCCAAGATGAAGACCTTCTTCGCAAGACCCACATTTCTTCCTCTAAAACTTCCACTAATTCTACTGGGATCTATGTTCTAGACCTTATTAACCATGGCTCTTTAGAACCAGACCGAACCCTTTATGGTAAAATGCTAAACAAATGCACCAATTTGCGCAAACTCAAGCAGGGCAGAGCCATCCATGCACACATCCAGGGTTCTATGTTTGAGAATGATCTGGTTCTTCAGAACTTTATCCTAAACATGTACGCCAAATGTGGTAGTCTCGAGGAGGCACAAAACATGTTTGATAAAATGCCTACAAGAGACATGGTTAGTTGGACTGTGCTTATCAGTGGGTATTCTCAGAGTGATCGAGCAACTGAGGCTCTTGCTTTGTTCCCGCAGATGCTCCACCAGGGCTTTCAACCTAATGAGTTTACTTTGTCTAGTCTGTTGAAGGCTTCTGGAGCTGGCTGTAGTGATGACCTTGGAAGGCAACTTCATGCATTTTCCCTCAAATATGGCTATGATATGAATGTTCATGTGGGAAGTTCATTGCTTGATATGTATGCTAGGTGGGGGCATATGCGAGAAGCCGAAGCGATTTTTAATGGACTGGCTGCAAAAAATGTGGTGTCTTGGAATGCTCTGATTGCTGGTCATGCTCGGAAGGGTGAAGGGGACCATGCGATAAGGCTGTTTTGGCAAATGTTGAGATGCGATTTCGAACCTACACATTTTACATACTCTAGTGTTTTTACTGCTTGTGCCAGCTGTGGATCTTTGGAGCAAGGCAAATGGGTTCATGCCCATGTAATAAAATCTGGGGGACAACCCATTGCTTATATTGGAAACACTCTCATTGACATGTATGCTAAATCAGGCAGCATCAAGGATGCAAAGAAGGTTTTCCAGCGGTTGGTTAAACAGGATGTGGTTTCGTGGAACTCGATTATATCTGGCTATGCGCAACACGGATTGGGAGTTGAAGCTTTAGAGCTATTTGAAGAGATGCTGAAGGCCAAAGTTCAACCTAATGAAATTACATTCCTCTCTGTTCTTACCGCTTGTAGCCATTCCGGGCTTCTGGATGAAGGACAATATTATTTTGAACTGATGAAGAAATACGGGATAGAACCACAGGTTGCACACTATGTGACAGTTGTTGATCTTTTAGGCCGAGCAGGACGACTAAATGAAGCCAACAAGTTCTTAATAGAAATGCCTATTGAACCTACCGCAGCTGTCTGGGGAGCCTTGCTTGGTGCTTGTAGGATGCATAAGAATATGGATTTAGGTGTTTATGCTGCTGAACGGATTTTTGAGCTTGACCCTCATGACTCAGGCCCTCATGTATTATTGTCTAATATTTATGCTTCTGCTGGTAGATTGAATGATGCAGCAAATGTGAGGAAGATGATGAAACAGAGTGGAGTTAAGAAAGAACCTGCTTGCAGCTGGATTGAAATTGAGAATGAAGTCCATACGTTTGTGGCAAATGATGATTCACATCCAATGAGAGAGGAAATCCAGAAGATGTGGGAGAAAATAAGTGGGAAAATTAAAGAGATTGGGTATGTGCCAGACACAAGCCATGTGCTTTTCTTCATGGACCAGCAGGACAGAGAAGTAAAGCTACAATACCACAGCGAGAAGCTAGCATTAGCATTTTCAGTGTTGAAAACTCCTCCTGGATTAACCATTCGGATTAAGAAGAACATTAGAATATGTGGTGACTGCCATTCTGCATTCAAGTTTGCTTCAAAAGTTTTGGGAAGAGAAATCATTGTAAGAGATACCAATAGATTTCACCATTTCCTTGATGGCTTGTGTTCTTGCAGGGACTATTGGTAG

Protein sequence

MKIAPLSPSLKNLRCFFPAANYGTGSAPCAFSESDTAVGRDWNAVIAAAPFTGVLQDEDLLRKTHISSSKTSTNSTGIYVLDLINHGSLEPDRTLYGKMLNKCTNLRKLKQGRAIHAHIQGSMFENDLVLQNFILNMYAKCGSLEEAQNMFDKMPTRDMVSWTVLISGYSQSDRATEALALFPQMLHQGFQPNEFTLSSLLKASGAGCSDDLGRQLHAFSLKYGYDMNVHVGSSLLDMYARWGHMREAEAIFNGLAAKNVVSWNALIAGHARKGEGDHAIRLFWQMLRCDFEPTHFTYSSVFTACASCGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKDAKKVFQRLVKQDVVSWNSIISGYAQHGLGVEALELFEEMLKAKVQPNEITFLSVLTACSHSGLLDEGQYYFELMKKYGIEPQVAHYVTVVDLLGRAGRLNEANKFLIEMPIEPTAAVWGALLGACRMHKNMDLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKQSGVKKEPACSWIEIENEVHTFVANDDSHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMDQQDREVKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREIIVRDTNRFHHFLDGLCSCRDYW
Homology
BLAST of Tan0004354 vs. ExPASy Swiss-Prot
Match: Q9LIQ7 (Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H87 PE=3 SV=1)

HSP 1 Score: 821.6 bits (2121), Expect = 6.3e-237
Identity = 395/619 (63.81%), Postives = 481/619 (77.71%), Query Frame = 0

Query: 49  APFTGVLQDEDLLRKTHISSSKTSTNSTGIYVLDLINHGSLEP-DRTLYGKMLNKCTNLR 108
           AP +   +DE L   ++    +TS+N       DL   GS  P DR  Y  +L KCT  +
Sbjct: 24  APVSEDSEDESLKFPSNDLLLRTSSN-------DL--EGSYIPADRRFYNTLLKKCTVFK 83

Query: 109 KLKQGRAIHAHIQGSMFENDLVLQNFILNMYAKCGSLEEAQNMFDKMPTRDMVSWTVLIS 168
            L QGR +HAHI  S+F +D+V+ N +LNMYAKCGSLEEA+ +F+KMP RD V+WT LIS
Sbjct: 84  LLIQGRIVHAHILQSIFRHDIVMGNTLLNMYAKCGSLEEARKVFEKMPQRDFVTWTTLIS 143

Query: 169 GYSQSDRATEALALFPQMLHQGFQPNEFTLSSLLKASGAGCSDDLGRQLHAFSLKYGYDM 228
           GYSQ DR  +AL  F QML  G+ PNEFTLSS++KA+ A      G QLH F +K G+D 
Sbjct: 144 GYSQHDRPCDALLFFNQMLRFGYSPNEFTLSSVIKAAAAERRGCCGHQLHGFCVKCGFDS 203

Query: 229 NVHVGSSLLDMYARWGHMREAEAIFNGLAAKNVVSWNALIAGHARKGEGDHAIRLFWQML 288
           NVHVGS+LLD+Y R+G M +A+ +F+ L ++N VSWNALIAGHAR+   + A+ LF  ML
Sbjct: 204 NVHVGSALLDLYTRYGLMDDAQLVFDALESRNDVSWNALIAGHARRSGTEKALELFQGML 263

Query: 289 RCDFEPTHFTYSSVFTACASCGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIK 348
           R  F P+HF+Y+S+F AC+S G LEQGKWVHA++IKSG + +A+ GNTL+DMYAKSGSI 
Sbjct: 264 RDGFRPSHFSYASLFGACSSTGFLEQGKWVHAYMIKSGEKLVAFAGNTLLDMYAKSGSIH 323

Query: 349 DAKKVFQRLVKQDVVSWNSIISGYAQHGLGVEALELFEEMLKAKVQPNEITFLSVLTACS 408
           DA+K+F RL K+DVVSWNS+++ YAQHG G EA+  FEEM +  ++PNEI+FLSVLTACS
Sbjct: 324 DARKIFDRLAKRDVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNEISFLSVLTACS 383

Query: 409 HSGLLDEGQYYFELMKKYGIEPQVAHYVTVVDLLGRAGRLNEANKFLIEMPIEPTAAVWG 468
           HSGLLDEG +Y+ELMKK GI P+  HYVTVVDLLGRAG LN A +F+ EMPIEPTAA+W 
Sbjct: 384 HSGLLDEGWHYYELMKKDGIVPEAWHYVTVVDLLGRAGDLNRALRFIEEMPIEPTAAIWK 443

Query: 469 ALLGACRMHKNMDLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKQSG 528
           ALL ACRMHKN +LG YAAE +FELDP D GPHV+L NIYAS GR NDAA VRK MK+SG
Sbjct: 444 ALLNACRMHKNTELGAYAAEHVFELDPDDPGPHVILYNIYASGGRWNDAARVRKKMKESG 503

Query: 529 VKKEPACSWIEIENEVHTFVANDDSHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMD 588
           VKKEPACSW+EIEN +H FVAND+ HP REEI + WE++  KIKE+GYVPDTSHV+  +D
Sbjct: 504 VKKEPACSWVEIENAIHMFVANDERHPQREEIARKWEEVLAKIKELGYVPDTSHVIVHVD 563

Query: 589 QQDREVKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREIIVR 648
           QQ+REV LQYHSEK+ALAF++L TPPG TI IKKNIR+CGDCH+A K ASKV+GREIIVR
Sbjct: 564 QQEREVNLQYHSEKIALAFALLNTPPGSTIHIKKNIRVCGDCHTAIKLASKVVGREIIVR 623

Query: 649 DTNRFHHFLDGLCSCRDYW 667
           DTNRFHHF DG CSC+DYW
Sbjct: 624 DTNRFHHFKDGNCSCKDYW 633

BLAST of Tan0004354 vs. ExPASy Swiss-Prot
Match: Q9LIC3 (Putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H85 PE=3 SV=1)

HSP 1 Score: 528.5 bits (1360), Expect = 1.1e-148
Identity = 270/630 (42.86%), Postives = 403/630 (63.97%), Query Frame = 0

Query: 59  DLLRKTHISSSKTSTN---STGIYVLDLINHGSLE----------PDRTLYG--KMLNKC 118
           +L+R  H S S + TN    T + +  L ++G L+          P+   +G   +LN C
Sbjct: 3   NLMRLIHRSFSSSPTNYVLQTILPISQLCSNGRLQEALLEMAMLGPEMGFHGYDALLNAC 62

Query: 119 TNLRKLKQGRAIHAHIQGSMFENDLVLQNFILNMYAKCGSLEEAQNMFDKMPTRDMVSWT 178
            + R L+ G+ +HAH+  + +     L+  +L  Y KC  LE+A+ + D+MP +++VSWT
Sbjct: 63  LDKRALRDGQRVHAHMIKTRYLPATYLRTRLLIFYGKCDCLEDARKVLDEMPEKNVVSWT 122

Query: 179 VLISGYSQSDRATEALALFPQMLHQGFQPNEFT----LSSLLKASGAGCSDDLGRQLHAF 238
            +IS YSQ+  ++EAL +F +M+    +PNEFT    L+S ++ASG G    LG+Q+H  
Sbjct: 123 AMISRYSQTGHSSEALTVFAEMMRSDGKPNEFTFATVLTSCIRASGLG----LGKQIHGL 182

Query: 239 SLKYGYDMNVHVGSSLLDMYARWGHMREAEAIFNGLAAKNVVSWNALIAGHARKGEGDHA 298
            +K+ YD ++ VGSSLLDMYA+ G ++EA  IF  L  ++VVS  A+IAG+A+ G  + A
Sbjct: 183 IVKWNYDSHIFVGSSLLDMYAKAGQIKEAREIFECLPERDVVSCTAIIAGYAQLGLDEEA 242

Query: 299 IRLFWQMLRCDFEPTHFTYSSVFTACASCGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDM 358
           + +F ++      P + TY+S+ TA +    L+ GK  H HV++      A + N+LIDM
Sbjct: 243 LEMFHRLHSEGMSPNYVTYASLLTALSGLALLDHGKQAHCHVLRRELPFYAVLQNSLIDM 302

Query: 359 YAKSGSIKDAKKVFQRLVKQDVVSWNSIISGYAQHGLGVEALELFEEMLKAK-VQPNEIT 418
           Y+K G++  A+++F  + ++  +SWN+++ GY++HGLG E LELF  M   K V+P+ +T
Sbjct: 303 YSKCGNLSYARRLFDNMPERTAISWNAMLVGYSKHGLGREVLELFRLMRDEKRVKPDAVT 362

Query: 419 FLSVLTACSHSGLLDEGQYYFELM--KKYGIEPQVAHYVTVVDLLGRAGRLNEANKFLIE 478
            L+VL+ CSH  + D G   F+ M   +YG +P   HY  +VD+LGRAGR++EA +F+  
Sbjct: 363 LLAVLSGCSHGRMEDTGLNIFDGMVAGEYGTKPGTEHYGCIVDMLGRAGRIDEAFEFIKR 422

Query: 479 MPIEPTAAVWGALLGACRMHKNMDLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDA 538
           MP +PTA V G+LLGACR+H ++D+G     R+ E++P ++G +V+LSN+YASAGR  D 
Sbjct: 423 MPSKPTAGVLGSLLGACRVHLSVDIGESVGRRLIEIEPENAGNYVILSNLYASAGRWADV 482

Query: 539 ANVRKMMKQSGVKKEPACSWIEIENEVHTFVANDDSHPMREEIQKMWEKISGKIKEIGYV 598
            NVR MM Q  V KEP  SWI+ E  +H F AND +HP REE+    ++IS K+K+ GYV
Sbjct: 483 NNVRAMMMQKAVTKEPGRSWIQHEQTLHYFHANDRTHPRREEVLAKMKEISIKMKQAGYV 542

Query: 599 PDTSHVLFFMDQQDREVKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFA 658
           PD S VL+ +D++ +E  L  HSEKLAL F ++ T  G+ IR+ KN+RIC DCH+  K  
Sbjct: 543 PDLSCVLYDVDEEQKEKMLLGHSEKLALTFGLIATGEGIPIRVFKNLRICVDCHNFAKIF 602

Query: 659 SKVLGREIIVRDTNRFHHFLDGLCSCRDYW 667
           SKV  RE+ +RD NRFH  +DG+CSC DYW
Sbjct: 603 SKVFEREVSLRDKNRFHQIVDGICSCGDYW 628

BLAST of Tan0004354 vs. ExPASy Swiss-Prot
Match: Q9LW63 (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 515.8 bits (1327), Expect = 7.4e-145
Identity = 255/613 (41.60%), Postives = 378/613 (61.66%), Query Frame = 0

Query: 91  PDRTLYGKMLNKCTNLRKLKQGRAIHAHIQGSMFENDLVLQNFILNMYAK---CGSLEEA 150
           PD  ++  +L  CT +  L+ G ++H  I     + DL   N ++NMYAK    GS    
Sbjct: 103 PDHNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLYTGNALMNMYAKLLGMGSKISV 162

Query: 151 QNMFDKMPTR---------------------------------DMVSWTVLISGYSQSDR 210
            N+FD+MP R                                 D+VS+  +I+GY+QS  
Sbjct: 163 GNVFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEVMPRKDVVSYNTIIAGYAQSGM 222

Query: 211 ATEALALFPQMLHQGFQPNEFTLSSLLKASGAGCSDDLGRQLHAFSLKYGYDMNVHVGSS 270
             +AL +  +M     +P+ FTLSS+L           G+++H + ++ G D +V++GSS
Sbjct: 223 YEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVYIGSS 282

Query: 271 LLDMYARWGHMREAEAIFNGLAAKNVVSWNALIAGHARKGEGDHAIRLFWQMLRCDFEPT 330
           L+DMYA+   + ++E +F+ L  ++ +SWN+L+AG+ + G  + A+RLF QM+    +P 
Sbjct: 283 LVDMYAKSARIEDSERVFSRLYCRDGISWNSLVAGYVQNGRYNEALRLFRQMVTAKVKPG 342

Query: 331 HFTYSSVFTACASCGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKDAKKVFQ 390
              +SSV  ACA   +L  GK +H +V++ G     +I + L+DMY+K G+IK A+K+F 
Sbjct: 343 AVAFSSVIPACAHLATLHLGKQLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFD 402

Query: 391 RLVKQDVVSWNSIISGYAQHGLGVEALELFEEMLKAKVQPNEITFLSVLTACSHSGLLDE 450
           R+   D VSW +II G+A HG G EA+ LFEEM +  V+PN++ F++VLTACSH GL+DE
Sbjct: 403 RMNVLDEVSWTAIIMGHALHGHGHEAVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDE 462

Query: 451 GQYYFELMKK-YGIEPQVAHYVTVVDLLGRAGRLNEANKFLIEMPIEPTAAVWGALLGAC 510
              YF  M K YG+  ++ HY  V DLLGRAG+L EA  F+ +M +EPT +VW  LL +C
Sbjct: 463 AWGYFNSMTKVYGLNQELEHYAAVADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSC 522

Query: 511 RMHKNMDLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKQSGVKKEPA 570
            +HKN++L    AE+IF +D  + G +VL+ N+YAS GR  + A +R  M++ G++K+PA
Sbjct: 523 SVHKNLELAEKVAEKIFTVDSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPA 582

Query: 571 CSWIEIENEVHTFVANDDSHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMDQQDREV 630
           CSWIE++N+ H FV+ D SHP  ++I +  + +  ++++ GYV DTS VL  +D++ +  
Sbjct: 583 CSWIEMKNKTHGFVSGDRSHPSMDKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRE 642

Query: 631 KLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREIIVRDTNRFH 667
            L  HSE+LA+AF ++ T PG TIR+ KNIRIC DCH A KF SK+  REIIVRD +RFH
Sbjct: 643 LLFGHSERLAVAFGIINTEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFH 702

BLAST of Tan0004354 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 511.9 bits (1317), Expect = 1.1e-143
Identity = 253/580 (43.62%), Postives = 366/580 (63.10%), Query Frame = 0

Query: 88  SLEPDRTLYGKMLNKCTNLRKLKQGRAIHAHIQGSMFENDLVLQNFILNMYAKCGSLEEA 147
           +L+P       +L   + LR +  G+ IH +   S F++ + +   +++MYAKCGSLE A
Sbjct: 231 NLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETA 290

Query: 148 QNMFDKMPTRDMVSWTVLISGYSQSDRATEALALFPQMLHQGFQPNEFTLSSLLKASGAG 207
           + +FD M  R++VSW  +I  Y Q++   EA+ +F +ML +G +P + ++   L A    
Sbjct: 291 RQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADL 350

Query: 208 CSDDLGRQLHAFSLKYGYDMNVHVGSSLLDMYARWGHMREAEAIFNGLAAKNVVSWNALI 267
              + GR +H  S++ G D NV V +SL+ MY +   +  A ++F  L ++ +VSWNA+I
Sbjct: 351 GDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMI 410

Query: 268 AGHARKGEGDHAIRLFWQMLRCDFEPTHFTYSSVFTACASCGSLEQGKWVHAHVIKSGGQ 327
            G A+ G    A+  F QM     +P  FTY SV TA A        KW+H  V++S   
Sbjct: 411 LGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLD 470

Query: 328 PIAYIGNTLIDMYAKSGSIKDAKKVFQRLVKQDVVSWNSIISGYAQHGLGVEALELFEEM 387
              ++   L+DMYAK G+I  A+ +F  + ++ V +WN++I GY  HG G  ALELFEEM
Sbjct: 471 KNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEM 530

Query: 388 LKAKVQPNEITFLSVLTACSHSGLLDEGQYYFELMKK-YGIEPQVAHYVTVVDLLGRAGR 447
            K  ++PN +TFLSV++ACSHSGL++ G   F +MK+ Y IE  + HY  +VDLLGRAGR
Sbjct: 531 QKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGR 590

Query: 448 LNEANKFLIEMPIEPTAAVWGALLGACRMHKNMDLGVYAAERIFELDPHDSGPHVLLSNI 507
           LNEA  F+++MP++P   V+GA+LGAC++HKN++    AAER+FEL+P D G HVLL+NI
Sbjct: 591 LNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANI 650

Query: 508 YASAGRLNDAANVRKMMKQSGVKKEPACSWIEIENEVHTFVANDDSHPMREEIQKMWEKI 567
           Y +A        VR  M + G++K P CS +EI+NEVH+F +   +HP  ++I    EK+
Sbjct: 651 YRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKL 710

Query: 568 SGKIKEIGYVPDTSHVLFFMDQQDREVKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRIC 627
              IKE GYVPDT+ VL  ++   +E  L  HSEKLA++F +L T  G TI ++KN+R+C
Sbjct: 711 ICHIKEAGYVPDTNLVL-GVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKNLRVC 770

Query: 628 GDCHSAFKFASKVLGREIIVRDTNRFHHFLDGLCSCRDYW 667
            DCH+A K+ S V GREI+VRD  RFHHF +G CSC DYW
Sbjct: 771 ADCHNATKYISLVTGREIVVRDMQRFHHFKNGACSCGDYW 809

BLAST of Tan0004354 vs. ExPASy Swiss-Prot
Match: Q9SI53 (Pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H44 PE=2 SV=1)

HSP 1 Score: 504.6 bits (1298), Expect = 1.7e-141
Identity = 247/587 (42.08%), Postives = 365/587 (62.18%), Query Frame = 0

Query: 81  LDLINHGSLEPDRTLYGKMLNKCTNLRKLKQGRAIHAHIQGSMFENDLVLQNFILNMYAK 140
           +D +    L  D   Y +++  C + R + +G  I  H+  +     + L N ++NMY K
Sbjct: 49  MDSLQSHGLWADSATYSELIKCCISNRAVHEGNLICRHLYFNGHRPMMFLVNVLINMYVK 108

Query: 141 CGSLEEAQNMFDKMPTRDMVSWTVLISGYSQSDRATEALALFPQMLHQGFQPNEFTLSSL 200
              L +A  +FD+MP R+++SWT +IS YS+     +AL L   ML    +PN +T SS+
Sbjct: 109 FNLLNDAHQLFDQMPQRNVISWTTMISAYSKCKIHQKALELLVLMLRDNVRPNVYTYSSV 168

Query: 201 LKASGAGCSDDLGRQLHAFSLKYGYDMNVHVGSSLLDMYARWGHMREAEAIFNGLAAKNV 260
           L++   G SD   R LH   +K G + +V V S+L+D++A+ G   +A ++F+ +   + 
Sbjct: 169 LRSCN-GMSD--VRMLHCGIIKEGLESDVFVRSALIDVFAKLGEPEDALSVFDEMVTGDA 228

Query: 261 VSWNALIAGHARKGEGDHAIRLFWQMLRCDFEPTHFTYSSVFTACASCGSLEQGKWVHAH 320
           + WN++I G A+    D A+ LF +M R  F     T +SV  AC     LE G   H H
Sbjct: 229 IVWNSIIGGFAQNSRSDVALELFKRMKRAGFIAEQATLTSVLRACTGLALLELGMQAHVH 288

Query: 321 VIKSGGQPIAYIGNTLIDMYAKSGSIKDAKKVFQRLVKQDVVSWNSIISGYAQHGLGVEA 380
           ++K     I  + N L+DMY K GS++DA +VF ++ ++DV++W+++ISG AQ+G   EA
Sbjct: 289 IVKYDQDLI--LNNALVDMYCKCGSLEDALRVFNQMKERDVITWSTMISGLAQNGYSQEA 348

Query: 381 LELFEEMLKAKVQPNEITFLSVLTACSHSGLLDEGQYYFELMKK-YGIEPQVAHYVTVVD 440
           L+LFE M  +  +PN IT + VL ACSH+GLL++G YYF  MKK YGI+P   HY  ++D
Sbjct: 349 LKLFERMKSSGTKPNYITIVGVLFACSHAGLLEDGWYYFRSMKKLYGIDPVREHYGCMID 408

Query: 441 LLGRAGRLNEANKFLIEMPIEPTAAVWGALLGACRMHKNMDLGVYAAERIFELDPHDSGP 500
           LLG+AG+L++A K L EM  EP A  W  LLGACR+ +NM L  YAA+++  LDP D+G 
Sbjct: 409 LLGKAGKLDDAVKLLNEMECEPDAVTWRTLLGACRVQRNMVLAEYAAKKVIALDPEDAGT 468

Query: 501 HVLLSNIYASAGRLNDAANVRKMMKQSGVKKEPACSWIEIENEVHTFVANDDSHPMREEI 560
           + LLSNIYA++ + +    +R  M+  G+KKEP CSWIE+  ++H F+  D+SHP   E+
Sbjct: 469 YTLLSNIYANSQKWDSVEEIRTRMRDRGIKKEPGCSWIEVNKQIHAFIIGDNSHPQIVEV 528

Query: 561 QKMWEKISGKIKEIGYVPDTSHVLFFMDQQDREVKLQYHSEKLALAFSVLKTPPGLTIRI 620
            K   ++  ++  IGYVP+T+ VL  ++ +  E  L++HSEKLALAF ++  P    IRI
Sbjct: 529 SKKLNQLIHRLTGIGYVPETNFVLQDLEGEQMEDSLRHHSEKLALAFGLMTLPIEKVIRI 588

Query: 621 KKNIRICGDCHSAFKFASKVLGREIIVRDTNRFHHFLDGLCSCRDYW 667
           +KN+RICGDCH   K ASK+  R I++RD  R+HHF DG CSC DYW
Sbjct: 589 RKNLRICGDCHVFCKLASKLEIRSIVIRDPIRYHHFQDGKCSCGDYW 630

BLAST of Tan0004354 vs. NCBI nr
Match: XP_022973115.1 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial [Cucurbita maxima])

HSP 1 Score: 1238.4 bits (3203), Expect = 0.0e+00
Identity = 601/678 (88.64%), Postives = 631/678 (93.07%), Query Frame = 0

Query: 1   MKIAPLSPS------------LKNLRCFFPAANYGTGSAPCAFSESDTAVGRDWNAVIAA 60
           MKIAP+SPS            LK  +CFF AANYGTGS PC+ +ESD+A GRDWNA  AA
Sbjct: 1   MKIAPISPSSLQNLVLSDLPKLKPFKCFFSAANYGTGSPPCSLTESDSAEGRDWNAAAAA 60

Query: 61  APFTGVLQDEDLLRKTHISSSKTSTNSTGIYVLDLINHGSLEPDRTLYGKMLNKCTNLRK 120
            PFTGVLQDEDLLRKTHISSS+TST+STG+YVLDLINHG LEP+RTLY KMLNKCT+LRK
Sbjct: 61  VPFTGVLQDEDLLRKTHISSSETSTSSTGLYVLDLINHGKLEPERTLYSKMLNKCTHLRK 120

Query: 121 LKQGRAIHAHIQGSMFENDLVLQNFILNMYAKCGSLEEAQNMFDKMPTRDMVSWTVLISG 180
           LK GR IH+HIQGS FENDLV+QN ILNMYAKCGSLEEA N+FDKMPTRDMVSWTVLISG
Sbjct: 121 LKLGRVIHSHIQGSTFENDLVIQNSILNMYAKCGSLEEAHNLFDKMPTRDMVSWTVLISG 180

Query: 181 YSQSDRATEALALFPQMLHQGFQPNEFTLSSLLKASGAGCSDDLGRQLHAFSLKYGYDMN 240
           YSQS RA EAL LFPQM HQGFQPNEFTLSSLLKASGA  SD+ GRQLHAFSLKYG++MN
Sbjct: 181 YSQSGRAFEALGLFPQMFHQGFQPNEFTLSSLLKASGASPSDEHGRQLHAFSLKYGFNMN 240

Query: 241 VHVGSSLLDMYARWGHMREAEAIFNGLAAKNVVSWNALIAGHARKGEGDHAIRLFWQMLR 300
           VHVGSSLLDMYARWGHM+EAEAIFNGLAAKNVVSWNALIAGHARKGEG+H ++LF QMLR
Sbjct: 241 VHVGSSLLDMYARWGHMQEAEAIFNGLAAKNVVSWNALIAGHARKGEGEHVMKLFRQMLR 300

Query: 301 CDFEPTHFTYSSVFTACASCGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKD 360
            +FEPTHFTYSSVFTACAS GS EQGKWVHAHVIKSGGQP+AYIGNTLIDMYAKSGSIKD
Sbjct: 301 QNFEPTHFTYSSVFTACASSGSFEQGKWVHAHVIKSGGQPVAYIGNTLIDMYAKSGSIKD 360

Query: 361 AKKVFQRLVKQDVVSWNSIISGYAQHGLGVEALELFEEMLKAKVQPNEITFLSVLTACSH 420
           AKKVFQRLVKQDVVSWNSIISGYAQHGLG EAL+LFEEMLKAKVQPNEITFLSVLTACSH
Sbjct: 361 AKKVFQRLVKQDVVSWNSIISGYAQHGLGAEALQLFEEMLKAKVQPNEITFLSVLTACSH 420

Query: 421 SGLLDEGQYYFELMKKYGIEPQVAHYVTVVDLLGRAGRLNEANKFLIEMPIEPTAAVWGA 480
           SGLLDEGQYYFELMKKY IEPQV+H+VTVVDLLGRAGRL+EANKF+ EMPIEPTAAVWGA
Sbjct: 421 SGLLDEGQYYFELMKKYEIEPQVSHHVTVVDLLGRAGRLDEANKFIKEMPIEPTAAVWGA 480

Query: 481 LLGACRMHKNMDLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKQSGV 540
           LLGACRMHKNMDLG YAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMK+SGV
Sbjct: 481 LLGACRMHKNMDLGAYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESGV 540

Query: 541 KKEPACSWIEIENEVHTFVANDDSHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMDQ 600
           KKEPACSW+EIEN VH FVAND+SHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMDQ
Sbjct: 541 KKEPACSWVEIENGVHMFVANDESHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMDQ 600

Query: 601 QDREVKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREIIVRD 660
           QDREVKLQYHSEKLALAFSVLKTPPG TIRIKKNIRICGDCHSAFKFASKVLGREIIVRD
Sbjct: 601 QDREVKLQYHSEKLALAFSVLKTPPGFTIRIKKNIRICGDCHSAFKFASKVLGREIIVRD 660

Query: 661 TNRFHHFLDGLCSCRDYW 667
           TNRFHHFLDGLCSCRDYW
Sbjct: 661 TNRFHHFLDGLCSCRDYW 678

BLAST of Tan0004354 vs. NCBI nr
Match: XP_022142695.1 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial [Momordica charantia] >XP_022142696.1 pentatricopeptide repeat-containing protein At3g24000, mitochondrial [Momordica charantia] >XP_022142697.1 pentatricopeptide repeat-containing protein At3g24000, mitochondrial [Momordica charantia] >XP_022142698.1 pentatricopeptide repeat-containing protein At3g24000, mitochondrial [Momordica charantia] >XP_022142700.1 pentatricopeptide repeat-containing protein At3g24000, mitochondrial [Momordica charantia] >XP_022142701.1 pentatricopeptide repeat-containing protein At3g24000, mitochondrial [Momordica charantia] >XP_022142702.1 pentatricopeptide repeat-containing protein At3g24000, mitochondrial [Momordica charantia])

HSP 1 Score: 1226.8 bits (3173), Expect = 0.0e+00
Identity = 600/678 (88.50%), Postives = 631/678 (93.07%), Query Frame = 0

Query: 1   MKIAPLS-----------PSLKNLRCFFPAANYGTG-SAPCAFSESDTAVGRDWNAVIAA 60
           MKIAPLS           P LK L+CFF AANYG G +APCAFSESDTA GR+WNA +AA
Sbjct: 1   MKIAPLSLSSLKNLVFSDPKLKPLKCFFFAANYGAGPAAPCAFSESDTAEGREWNAAVAA 60

Query: 61  APFTGVLQDEDLLRKTHISSSKTSTNSTGIYVLDLINHGSLEPDRTLYGKMLNKCTNLRK 120
            PF GVLQDEDLLRKTHI S +TST STG+YVLDL+NHGSLEPDRTLY KMLNKCT+LRK
Sbjct: 61  TPFNGVLQDEDLLRKTHI-SPQTSTPSTGLYVLDLLNHGSLEPDRTLYSKMLNKCTHLRK 120

Query: 121 LKQGRAIHAHIQGSMFENDLVLQNFILNMYAKCGSLEEAQNMFDKMPTRDMVSWTVLISG 180
           LKQGR IHAHIQGS FE+DLVLQNFILNMYAKCGS+EEA+N+FDKMPTRDMVSWTV+ISG
Sbjct: 121 LKQGRVIHAHIQGSSFESDLVLQNFILNMYAKCGSVEEARNVFDKMPTRDMVSWTVMISG 180

Query: 181 YSQSDRATEALALFPQMLHQGFQPNEFTLSSLLKASGAGCSDDLGRQLHAFSLKYGYDMN 240
           +SQS  A+EALALFPQMLHQGFQPNEFTLSSLLKASG G SDD GRQLHAFSLKYGYD+N
Sbjct: 181 FSQSGLASEALALFPQMLHQGFQPNEFTLSSLLKASGTGPSDDHGRQLHAFSLKYGYDVN 240

Query: 241 VHVGSSLLDMYARWGHMREAEAIFNGLAAKNVVSWNALIAGHARKGEGDHAIRLFWQMLR 300
           VHVGSSLLDMYAR GHMREA+AIF+GLA KNVVSWNALIAGHARKGEG+H +RLFWQMLR
Sbjct: 241 VHVGSSLLDMYARCGHMREAKAIFDGLAGKNVVSWNALIAGHARKGEGEHVMRLFWQMLR 300

Query: 301 CDFEPTHFTYSSVFTACASCGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKD 360
            D EPTHFTYSSVF+ACAS GSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKD
Sbjct: 301 QDLEPTHFTYSSVFSACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKD 360

Query: 361 AKKVFQRLVKQDVVSWNSIISGYAQHGLGVEALELFEEMLKAKVQPNEITFLSVLTACSH 420
           AKKVFQRLVKQDVVSWNSIISGYAQHGLGVEAL+LFEEMLK KVQPN+ITFLSVLTACSH
Sbjct: 361 AKKVFQRLVKQDVVSWNSIISGYAQHGLGVEALQLFEEMLKVKVQPNQITFLSVLTACSH 420

Query: 421 SGLLDEGQYYFELMKKYGIEPQVAHYVTVVDLLGRAGRLNEANKFLIEMPIEPTAAVWGA 480
           SGLLDEGQYYFELMK Y IEPQ+AHYVTVVDLLGRAGRLNEAN F+ EMP++PTAAVWGA
Sbjct: 421 SGLLDEGQYYFELMKNYEIEPQIAHYVTVVDLLGRAGRLNEANNFIKEMPVKPTAAVWGA 480

Query: 481 LLGACRMHKNMDLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKQSGV 540
           LLGA RMHKNMDLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMK+SGV
Sbjct: 481 LLGASRMHKNMDLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESGV 540

Query: 541 KKEPACSWIEIENEVHTFVANDDSHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMDQ 600
           KKEPACSW+EIENEVH FVANDDSHPMREEIQ+MWEKISGKI+EIGYVPDTSHVLFFMDQ
Sbjct: 541 KKEPACSWVEIENEVHMFVANDDSHPMREEIQRMWEKISGKIREIGYVPDTSHVLFFMDQ 600

Query: 601 QDREVKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREIIVRD 660
           QDREVKLQYHSEKLALAFSVLKTPPG TIRIKKNIRICGDCHSAFKFASKVL REIIVRD
Sbjct: 601 QDREVKLQYHSEKLALAFSVLKTPPGFTIRIKKNIRICGDCHSAFKFASKVLRREIIVRD 660

Query: 661 TNRFHHFLDGLCSCRDYW 667
           TNRFHHFL GLCSCRDYW
Sbjct: 661 TNRFHHFLVGLCSCRDYW 677

BLAST of Tan0004354 vs. NCBI nr
Match: XP_038893938.1 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X1 [Benincasa hispida])

HSP 1 Score: 1216.1 bits (3145), Expect = 0.0e+00
Identity = 596/668 (89.22%), Postives = 620/668 (92.81%), Query Frame = 0

Query: 2   KIAPLS---PSLKNLRCFFPAANYGTGSAPCAFSESDTAVGRDWNAVIAAAPFTGVLQDE 61
           K  PL    P LK LR F  AA YGTG  PCAF ES TA  +DWN  +  APF G+LQDE
Sbjct: 15  KYGPLQLFHPKLKPLRFFPIAAKYGTGLTPCAFMESGTAESQDWNPTV--APFNGILQDE 74

Query: 62  DLLRKTHISSSKTSTNSTGIYVLDLINHGSLEPDRTLYGKMLNKCTNLRKLKQGRAIHAH 121
           DLLRKTHISSS TSTNSTG+YVLDLIN GSLEP+RTLY KMLNKCT LRKLKQGRAIHAH
Sbjct: 75  DLLRKTHISSSFTSTNSTGLYVLDLINRGSLEPERTLYCKMLNKCTYLRKLKQGRAIHAH 134

Query: 122 IQGSMFENDLVLQNFILNMYAKCGSLEEAQNMFDKMPTRDMVSWTVLISGYSQSDRATEA 181
           IQGS FENDLVL N ILNMYAKCGSLEEAQN+FDKMP RDMVSWTVLISGYSQS RA+EA
Sbjct: 135 IQGSTFENDLVLLNCILNMYAKCGSLEEAQNLFDKMPIRDMVSWTVLISGYSQSGRASEA 194

Query: 182 LALFPQMLHQGFQPNEFTLSSLLKASGAGCSDDLGRQLHAFSLKYGYDMNVHVGSSLLDM 241
           LA FP+MLH GFQPNEFTLSSLLKASGAG SDD GRQLHAFSLKYGYDMNVHVGSSLLDM
Sbjct: 195 LAWFPKMLHLGFQPNEFTLSSLLKASGAGPSDDNGRQLHAFSLKYGYDMNVHVGSSLLDM 254

Query: 242 YARWGHMREAEAIFNGLAAKNVVSWNALIAGHARKGEGDHAIRLFWQMLRCDFEPTHFTY 301
           YARWGHMREA  IFN LAAKNVVSWNALIAG+ARKGEG+H +RLFWQMLR DFEPTHFTY
Sbjct: 255 YARWGHMREATVIFNSLAAKNVVSWNALIAGYARKGEGEHVMRLFWQMLRQDFEPTHFTY 314

Query: 302 SSVFTACASCGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKDAKKVFQRLVK 361
           SSVF ACAS GSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKDAKKVFQRLVK
Sbjct: 315 SSVFIACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKDAKKVFQRLVK 374

Query: 362 QDVVSWNSIISGYAQHGLGVEALELFEEMLKAKVQPNEITFLSVLTACSHSGLLDEGQYY 421
           QD+VSWNSIISGYA HGLGVEAL+LFEEML+AKVQPNEITFLSVLTACSHSGLLD+G+YY
Sbjct: 375 QDIVSWNSIISGYAHHGLGVEALQLFEEMLRAKVQPNEITFLSVLTACSHSGLLDDGRYY 434

Query: 422 FELMKKYGIEPQVAHYVTVVDLLGRAGRLNEANKFLIEMPIEPTAAVWGALLGACRMHKN 481
           FELMKKY IEPQVAH+VTVVDLLGRAGRL+EANKF+ EMPIEPTAAVWGALLGACRMHKN
Sbjct: 435 FELMKKYEIEPQVAHHVTVVDLLGRAGRLHEANKFIEEMPIEPTAAVWGALLGACRMHKN 494

Query: 482 MDLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKQSGVKKEPACSWIE 541
           MDLGVYAAER+FELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMK+SGVKKEPACSW+E
Sbjct: 495 MDLGVYAAERVFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESGVKKEPACSWVE 554

Query: 542 IENEVHTFVANDDSHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMDQQDREVKLQYH 601
           IENEVH FVANDDSHPMREEI++MWEKISGKIKEIGYVPDTSHVLFFMDQQDREVKLQYH
Sbjct: 555 IENEVHMFVANDDSHPMREEIRRMWEKISGKIKEIGYVPDTSHVLFFMDQQDREVKLQYH 614

Query: 602 SEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREIIVRDTNRFHHFLDG 661
           SEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREIIVRDTNRFHHFL G
Sbjct: 615 SEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREIIVRDTNRFHHFLHG 674

Query: 662 LCSCRDYW 667
           LCSCRDYW
Sbjct: 675 LCSCRDYW 680

BLAST of Tan0004354 vs. NCBI nr
Match: XP_023520189.1 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1216.1 bits (3145), Expect = 0.0e+00
Identity = 599/687 (87.19%), Postives = 626/687 (91.12%), Query Frame = 0

Query: 1   MKIAPL----------SPSLKNL-----------RCFFPAANYGTGSAPCAFSESDTAVG 60
           MKIAP+          S SLKNL           +CFF AANYGTGS PC+ +ESD+A  
Sbjct: 1   MKIAPISSSSSSSSSSSSSLKNLVLSNLPKLNPFKCFFSAANYGTGSPPCSLTESDSAAA 60

Query: 61  RDWNAVIAAAPFTGVLQDEDLLRKTHISSSKTSTNSTGIYVLDLINHGSLEPDRTLYGKM 120
                  AA PFTGVLQDEDLLRKTHISSS+TST+STG+YVLDLINHG LEP+RTLY KM
Sbjct: 61  -------AAVPFTGVLQDEDLLRKTHISSSETSTSSTGLYVLDLINHGKLEPERTLYSKM 120

Query: 121 LNKCTNLRKLKQGRAIHAHIQGSMFENDLVLQNFILNMYAKCGSLEEAQNMFDKMPTRDM 180
           LNKCT LRKLK GR IH+HIQGS FENDLV+QN ILNMYAKCGSLEEA N+FDKMPTRDM
Sbjct: 121 LNKCTLLRKLKLGRVIHSHIQGSTFENDLVIQNSILNMYAKCGSLEEAHNLFDKMPTRDM 180

Query: 181 VSWTVLISGYSQSDRATEALALFPQMLHQGFQPNEFTLSSLLKASGAGCSDDLGRQLHAF 240
           VSWTVLISGYSQS RA EAL LFPQM HQGFQPNEFTLSSLLKASGA  SDD GRQLHAF
Sbjct: 181 VSWTVLISGYSQSGRAFEALGLFPQMFHQGFQPNEFTLSSLLKASGASPSDDHGRQLHAF 240

Query: 241 SLKYGYDMNVHVGSSLLDMYARWGHMREAEAIFNGLAAKNVVSWNALIAGHARKGEGDHA 300
           SLKYG+DMNVHVGSSLLDMYARWGHM+EAEAIFNGLAAKNVVSWNALIAGHARKGEG+H 
Sbjct: 241 SLKYGFDMNVHVGSSLLDMYARWGHMQEAEAIFNGLAAKNVVSWNALIAGHARKGEGEHV 300

Query: 301 IRLFWQMLRCDFEPTHFTYSSVFTACASCGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDM 360
           ++LF QMLR +FEPTHFTYSSVFTACAS GS EQGKWVHAHVIKSGGQP+AYIGNTLIDM
Sbjct: 301 MKLFRQMLRHNFEPTHFTYSSVFTACASSGSFEQGKWVHAHVIKSGGQPVAYIGNTLIDM 360

Query: 361 YAKSGSIKDAKKVFQRLVKQDVVSWNSIISGYAQHGLGVEALELFEEMLKAKVQPNEITF 420
           YAKSGSIKDAKKVFQRLVKQDVVSWNSIISGYAQHGLG EAL+LFEEMLKAKVQPNEITF
Sbjct: 361 YAKSGSIKDAKKVFQRLVKQDVVSWNSIISGYAQHGLGAEALQLFEEMLKAKVQPNEITF 420

Query: 421 LSVLTACSHSGLLDEGQYYFELMKKYGIEPQVAHYVTVVDLLGRAGRLNEANKFLIEMPI 480
           LSVLTACSHSGLLDEGQYYFELMKKY IEPQV+H+VTVVDLLGRAGRL+EANKF+ EMPI
Sbjct: 421 LSVLTACSHSGLLDEGQYYFELMKKYEIEPQVSHHVTVVDLLGRAGRLDEANKFIKEMPI 480

Query: 481 EPTAAVWGALLGACRMHKNMDLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANV 540
           EPTAAVWGALLGACRMHKNMDLG YAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANV
Sbjct: 481 EPTAAVWGALLGACRMHKNMDLGAYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANV 540

Query: 541 RKMMKQSGVKKEPACSWIEIENEVHTFVANDDSHPMREEIQKMWEKISGKIKEIGYVPDT 600
           RKMMK+SGVKKEPACSW+EIENEVH FVAND+SHPMREEIQKMWEKISGKIKEIGYVPDT
Sbjct: 541 RKMMKESGVKKEPACSWVEIENEVHMFVANDESHPMREEIQKMWEKISGKIKEIGYVPDT 600

Query: 601 SHVLFFMDQQDREVKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKV 660
           SHVLFFMDQQDREVKLQYHSEKLALAFSVLKTPPG TIRIKKNIRICGDCHSAFKFASKV
Sbjct: 601 SHVLFFMDQQDREVKLQYHSEKLALAFSVLKTPPGFTIRIKKNIRICGDCHSAFKFASKV 660

Query: 661 LGREIIVRDTNRFHHFLDGLCSCRDYW 667
           LGREIIVRDTNRFHHFLDGLCSCRDYW
Sbjct: 661 LGREIIVRDTNRFHHFLDGLCSCRDYW 680

BLAST of Tan0004354 vs. NCBI nr
Match: KAG6583676.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia] >KAG7019337.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1214.9 bits (3142), Expect = 0.0e+00
Identity = 596/678 (87.91%), Postives = 622/678 (91.74%), Query Frame = 0

Query: 1   MKIAPLS------------PSLKNLRCFFPAANYGTGSAPCAFSESDTAVGRDWNAVIAA 60
           MKIAP+S            P LK  + FF AANYGTG  PC+F+ESD+A         AA
Sbjct: 1   MKIAPISSSSLKNLVLSDLPKLKPFKWFFSAANYGTGPPPCSFTESDSAAA-------AA 60

Query: 61  APFTGVLQDEDLLRKTHISSSKTSTNSTGIYVLDLINHGSLEPDRTLYGKMLNKCTNLRK 120
            PFTGVLQDEDLLRKTH+SSS+TSTNSTG+YVLDLINHG LEP+RTLY KMLNKCT LRK
Sbjct: 61  VPFTGVLQDEDLLRKTHMSSSETSTNSTGLYVLDLINHGKLEPERTLYSKMLNKCTLLRK 120

Query: 121 LKQGRAIHAHIQGSMFENDLVLQNFILNMYAKCGSLEEAQNMFDKMPTRDMVSWTVLISG 180
           LK GR IH+HIQGS FENDLV+QN ILNMYAKCGSLEEA N+FDKMPTRDMVSWTVLISG
Sbjct: 121 LKLGRVIHSHIQGSTFENDLVIQNSILNMYAKCGSLEEAHNLFDKMPTRDMVSWTVLISG 180

Query: 181 YSQSDRATEALALFPQMLHQGFQPNEFTLSSLLKASGAGCSDDLGRQLHAFSLKYGYDMN 240
           YSQS RA EAL LFPQM HQGFQPNEFTLSSLLKASGA  SDD GRQLHAFSLKYG+DMN
Sbjct: 181 YSQSGRAFEALGLFPQMFHQGFQPNEFTLSSLLKASGASPSDDHGRQLHAFSLKYGFDMN 240

Query: 241 VHVGSSLLDMYARWGHMREAEAIFNGLAAKNVVSWNALIAGHARKGEGDHAIRLFWQMLR 300
           VHVGSSLLDMYARWGHM+EAEAIFNGLAAKNVVSWNALIAGHARKGEG+H ++LF QMLR
Sbjct: 241 VHVGSSLLDMYARWGHMQEAEAIFNGLAAKNVVSWNALIAGHARKGEGEHVMKLFRQMLR 300

Query: 301 CDFEPTHFTYSSVFTACASCGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKD 360
            +FEPTHFTYSSVFTACAS GS EQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKD
Sbjct: 301 QNFEPTHFTYSSVFTACASSGSFEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKD 360

Query: 361 AKKVFQRLVKQDVVSWNSIISGYAQHGLGVEALELFEEMLKAKVQPNEITFLSVLTACSH 420
           AKKVFQRLVKQDVVSWNSIISGYAQHGLG EAL+LFEEMLKAKVQPNEITFLSVLTACSH
Sbjct: 361 AKKVFQRLVKQDVVSWNSIISGYAQHGLGAEALQLFEEMLKAKVQPNEITFLSVLTACSH 420

Query: 421 SGLLDEGQYYFELMKKYGIEPQVAHYVTVVDLLGRAGRLNEANKFLIEMPIEPTAAVWGA 480
           SGLLDEGQYYFELMKKY IEPQV+H+VTVVDLLGRAGRL+EANKF+ EMPIEPTAAVWGA
Sbjct: 421 SGLLDEGQYYFELMKKYEIEPQVSHHVTVVDLLGRAGRLDEANKFIKEMPIEPTAAVWGA 480

Query: 481 LLGACRMHKNMDLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKQSGV 540
           LLGACRMHKNMDLG YAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMK+SGV
Sbjct: 481 LLGACRMHKNMDLGAYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESGV 540

Query: 541 KKEPACSWIEIENEVHTFVANDDSHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMDQ 600
           KKEPACSW+EIENEVH FVAND+SHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMDQ
Sbjct: 541 KKEPACSWVEIENEVHMFVANDESHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMDQ 600

Query: 601 QDREVKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREIIVRD 660
           QDREVKLQYHSEKLALAFSVLKTPPG TIRIKKNIRICGDCHSAFKFASKVL REIIVRD
Sbjct: 601 QDREVKLQYHSEKLALAFSVLKTPPGFTIRIKKNIRICGDCHSAFKFASKVLRREIIVRD 660

Query: 661 TNRFHHFLDGLCSCRDYW 667
           TNRFHHFLDGLCSCRDYW
Sbjct: 661 TNRFHHFLDGLCSCRDYW 671

BLAST of Tan0004354 vs. ExPASy TrEMBL
Match: A0A6J1IAI9 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Cucurbita maxima OX=3661 GN=LOC111471639 PE=3 SV=1)

HSP 1 Score: 1238.4 bits (3203), Expect = 0.0e+00
Identity = 601/678 (88.64%), Postives = 631/678 (93.07%), Query Frame = 0

Query: 1   MKIAPLSPS------------LKNLRCFFPAANYGTGSAPCAFSESDTAVGRDWNAVIAA 60
           MKIAP+SPS            LK  +CFF AANYGTGS PC+ +ESD+A GRDWNA  AA
Sbjct: 1   MKIAPISPSSLQNLVLSDLPKLKPFKCFFSAANYGTGSPPCSLTESDSAEGRDWNAAAAA 60

Query: 61  APFTGVLQDEDLLRKTHISSSKTSTNSTGIYVLDLINHGSLEPDRTLYGKMLNKCTNLRK 120
            PFTGVLQDEDLLRKTHISSS+TST+STG+YVLDLINHG LEP+RTLY KMLNKCT+LRK
Sbjct: 61  VPFTGVLQDEDLLRKTHISSSETSTSSTGLYVLDLINHGKLEPERTLYSKMLNKCTHLRK 120

Query: 121 LKQGRAIHAHIQGSMFENDLVLQNFILNMYAKCGSLEEAQNMFDKMPTRDMVSWTVLISG 180
           LK GR IH+HIQGS FENDLV+QN ILNMYAKCGSLEEA N+FDKMPTRDMVSWTVLISG
Sbjct: 121 LKLGRVIHSHIQGSTFENDLVIQNSILNMYAKCGSLEEAHNLFDKMPTRDMVSWTVLISG 180

Query: 181 YSQSDRATEALALFPQMLHQGFQPNEFTLSSLLKASGAGCSDDLGRQLHAFSLKYGYDMN 240
           YSQS RA EAL LFPQM HQGFQPNEFTLSSLLKASGA  SD+ GRQLHAFSLKYG++MN
Sbjct: 181 YSQSGRAFEALGLFPQMFHQGFQPNEFTLSSLLKASGASPSDEHGRQLHAFSLKYGFNMN 240

Query: 241 VHVGSSLLDMYARWGHMREAEAIFNGLAAKNVVSWNALIAGHARKGEGDHAIRLFWQMLR 300
           VHVGSSLLDMYARWGHM+EAEAIFNGLAAKNVVSWNALIAGHARKGEG+H ++LF QMLR
Sbjct: 241 VHVGSSLLDMYARWGHMQEAEAIFNGLAAKNVVSWNALIAGHARKGEGEHVMKLFRQMLR 300

Query: 301 CDFEPTHFTYSSVFTACASCGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKD 360
            +FEPTHFTYSSVFTACAS GS EQGKWVHAHVIKSGGQP+AYIGNTLIDMYAKSGSIKD
Sbjct: 301 QNFEPTHFTYSSVFTACASSGSFEQGKWVHAHVIKSGGQPVAYIGNTLIDMYAKSGSIKD 360

Query: 361 AKKVFQRLVKQDVVSWNSIISGYAQHGLGVEALELFEEMLKAKVQPNEITFLSVLTACSH 420
           AKKVFQRLVKQDVVSWNSIISGYAQHGLG EAL+LFEEMLKAKVQPNEITFLSVLTACSH
Sbjct: 361 AKKVFQRLVKQDVVSWNSIISGYAQHGLGAEALQLFEEMLKAKVQPNEITFLSVLTACSH 420

Query: 421 SGLLDEGQYYFELMKKYGIEPQVAHYVTVVDLLGRAGRLNEANKFLIEMPIEPTAAVWGA 480
           SGLLDEGQYYFELMKKY IEPQV+H+VTVVDLLGRAGRL+EANKF+ EMPIEPTAAVWGA
Sbjct: 421 SGLLDEGQYYFELMKKYEIEPQVSHHVTVVDLLGRAGRLDEANKFIKEMPIEPTAAVWGA 480

Query: 481 LLGACRMHKNMDLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKQSGV 540
           LLGACRMHKNMDLG YAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMK+SGV
Sbjct: 481 LLGACRMHKNMDLGAYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESGV 540

Query: 541 KKEPACSWIEIENEVHTFVANDDSHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMDQ 600
           KKEPACSW+EIEN VH FVAND+SHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMDQ
Sbjct: 541 KKEPACSWVEIENGVHMFVANDESHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMDQ 600

Query: 601 QDREVKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREIIVRD 660
           QDREVKLQYHSEKLALAFSVLKTPPG TIRIKKNIRICGDCHSAFKFASKVLGREIIVRD
Sbjct: 601 QDREVKLQYHSEKLALAFSVLKTPPGFTIRIKKNIRICGDCHSAFKFASKVLGREIIVRD 660

Query: 661 TNRFHHFLDGLCSCRDYW 667
           TNRFHHFLDGLCSCRDYW
Sbjct: 661 TNRFHHFLDGLCSCRDYW 678

BLAST of Tan0004354 vs. ExPASy TrEMBL
Match: A0A6J1CNX3 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Momordica charantia OX=3673 GN=LOC111012750 PE=3 SV=1)

HSP 1 Score: 1226.8 bits (3173), Expect = 0.0e+00
Identity = 600/678 (88.50%), Postives = 631/678 (93.07%), Query Frame = 0

Query: 1   MKIAPLS-----------PSLKNLRCFFPAANYGTG-SAPCAFSESDTAVGRDWNAVIAA 60
           MKIAPLS           P LK L+CFF AANYG G +APCAFSESDTA GR+WNA +AA
Sbjct: 1   MKIAPLSLSSLKNLVFSDPKLKPLKCFFFAANYGAGPAAPCAFSESDTAEGREWNAAVAA 60

Query: 61  APFTGVLQDEDLLRKTHISSSKTSTNSTGIYVLDLINHGSLEPDRTLYGKMLNKCTNLRK 120
            PF GVLQDEDLLRKTHI S +TST STG+YVLDL+NHGSLEPDRTLY KMLNKCT+LRK
Sbjct: 61  TPFNGVLQDEDLLRKTHI-SPQTSTPSTGLYVLDLLNHGSLEPDRTLYSKMLNKCTHLRK 120

Query: 121 LKQGRAIHAHIQGSMFENDLVLQNFILNMYAKCGSLEEAQNMFDKMPTRDMVSWTVLISG 180
           LKQGR IHAHIQGS FE+DLVLQNFILNMYAKCGS+EEA+N+FDKMPTRDMVSWTV+ISG
Sbjct: 121 LKQGRVIHAHIQGSSFESDLVLQNFILNMYAKCGSVEEARNVFDKMPTRDMVSWTVMISG 180

Query: 181 YSQSDRATEALALFPQMLHQGFQPNEFTLSSLLKASGAGCSDDLGRQLHAFSLKYGYDMN 240
           +SQS  A+EALALFPQMLHQGFQPNEFTLSSLLKASG G SDD GRQLHAFSLKYGYD+N
Sbjct: 181 FSQSGLASEALALFPQMLHQGFQPNEFTLSSLLKASGTGPSDDHGRQLHAFSLKYGYDVN 240

Query: 241 VHVGSSLLDMYARWGHMREAEAIFNGLAAKNVVSWNALIAGHARKGEGDHAIRLFWQMLR 300
           VHVGSSLLDMYAR GHMREA+AIF+GLA KNVVSWNALIAGHARKGEG+H +RLFWQMLR
Sbjct: 241 VHVGSSLLDMYARCGHMREAKAIFDGLAGKNVVSWNALIAGHARKGEGEHVMRLFWQMLR 300

Query: 301 CDFEPTHFTYSSVFTACASCGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKD 360
            D EPTHFTYSSVF+ACAS GSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKD
Sbjct: 301 QDLEPTHFTYSSVFSACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKD 360

Query: 361 AKKVFQRLVKQDVVSWNSIISGYAQHGLGVEALELFEEMLKAKVQPNEITFLSVLTACSH 420
           AKKVFQRLVKQDVVSWNSIISGYAQHGLGVEAL+LFEEMLK KVQPN+ITFLSVLTACSH
Sbjct: 361 AKKVFQRLVKQDVVSWNSIISGYAQHGLGVEALQLFEEMLKVKVQPNQITFLSVLTACSH 420

Query: 421 SGLLDEGQYYFELMKKYGIEPQVAHYVTVVDLLGRAGRLNEANKFLIEMPIEPTAAVWGA 480
           SGLLDEGQYYFELMK Y IEPQ+AHYVTVVDLLGRAGRLNEAN F+ EMP++PTAAVWGA
Sbjct: 421 SGLLDEGQYYFELMKNYEIEPQIAHYVTVVDLLGRAGRLNEANNFIKEMPVKPTAAVWGA 480

Query: 481 LLGACRMHKNMDLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKQSGV 540
           LLGA RMHKNMDLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMK+SGV
Sbjct: 481 LLGASRMHKNMDLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESGV 540

Query: 541 KKEPACSWIEIENEVHTFVANDDSHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMDQ 600
           KKEPACSW+EIENEVH FVANDDSHPMREEIQ+MWEKISGKI+EIGYVPDTSHVLFFMDQ
Sbjct: 541 KKEPACSWVEIENEVHMFVANDDSHPMREEIQRMWEKISGKIREIGYVPDTSHVLFFMDQ 600

Query: 601 QDREVKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREIIVRD 660
           QDREVKLQYHSEKLALAFSVLKTPPG TIRIKKNIRICGDCHSAFKFASKVL REIIVRD
Sbjct: 601 QDREVKLQYHSEKLALAFSVLKTPPGFTIRIKKNIRICGDCHSAFKFASKVLRREIIVRD 660

Query: 661 TNRFHHFLDGLCSCRDYW 667
           TNRFHHFL GLCSCRDYW
Sbjct: 661 TNRFHHFLVGLCSCRDYW 677

BLAST of Tan0004354 vs. ExPASy TrEMBL
Match: A0A6J1EHG0 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Cucurbita moschata OX=3662 GN=LOC111434330 PE=3 SV=1)

HSP 1 Score: 1212.2 bits (3135), Expect = 0.0e+00
Identity = 595/678 (87.76%), Postives = 621/678 (91.59%), Query Frame = 0

Query: 1   MKIAPLS------------PSLKNLRCFFPAANYGTGSAPCAFSESDTAVGRDWNAVIAA 60
           MKIAP+S            P LK  + FF AANYGTG  PC+F+ESD+A         AA
Sbjct: 1   MKIAPISSSSLKNLVLSDLPKLKPFKWFFSAANYGTGPPPCSFTESDSAAA-------AA 60

Query: 61  APFTGVLQDEDLLRKTHISSSKTSTNSTGIYVLDLINHGSLEPDRTLYGKMLNKCTNLRK 120
            PFTGVLQDEDLLRKTH+SSS+TSTNSTG+YVLDLINHG LEP+RTLY KMLNKCT LRK
Sbjct: 61  VPFTGVLQDEDLLRKTHMSSSETSTNSTGLYVLDLINHGKLEPERTLYSKMLNKCTLLRK 120

Query: 121 LKQGRAIHAHIQGSMFENDLVLQNFILNMYAKCGSLEEAQNMFDKMPTRDMVSWTVLISG 180
           LK GR IH+HIQGS FENDLV+QN ILNMYAKCGSLEEA N+FDKMPTRDMVSWTVLISG
Sbjct: 121 LKLGRVIHSHIQGSTFENDLVIQNSILNMYAKCGSLEEAHNLFDKMPTRDMVSWTVLISG 180

Query: 181 YSQSDRATEALALFPQMLHQGFQPNEFTLSSLLKASGAGCSDDLGRQLHAFSLKYGYDMN 240
           YSQS RA EAL LFPQM HQGFQPNEFTLSSLLKASGA  SDD GRQLHAFSLKYG+DMN
Sbjct: 181 YSQSGRAFEALGLFPQMFHQGFQPNEFTLSSLLKASGASPSDDHGRQLHAFSLKYGFDMN 240

Query: 241 VHVGSSLLDMYARWGHMREAEAIFNGLAAKNVVSWNALIAGHARKGEGDHAIRLFWQMLR 300
           VHVGSSLLDMYARWGHM+EAEAIFNGLAAKNVVSWNALIAGHARKGEG+H ++LF QMLR
Sbjct: 241 VHVGSSLLDMYARWGHMQEAEAIFNGLAAKNVVSWNALIAGHARKGEGEHVMKLFRQMLR 300

Query: 301 CDFEPTHFTYSSVFTACASCGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKD 360
            +FEPTHFTYSSVFTACAS GS EQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKD
Sbjct: 301 QNFEPTHFTYSSVFTACASSGSFEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKD 360

Query: 361 AKKVFQRLVKQDVVSWNSIISGYAQHGLGVEALELFEEMLKAKVQPNEITFLSVLTACSH 420
           AKKVFQRLVKQDVVSWNSIISGYAQHGLG EAL+LFEEMLKAKVQPNEITFLSVLTACSH
Sbjct: 361 AKKVFQRLVKQDVVSWNSIISGYAQHGLGAEALQLFEEMLKAKVQPNEITFLSVLTACSH 420

Query: 421 SGLLDEGQYYFELMKKYGIEPQVAHYVTVVDLLGRAGRLNEANKFLIEMPIEPTAAVWGA 480
           SGLLDEGQYYFELMKKY IEPQV+H+VTVVDLLGRAGRL+EANKF+ EMPIEPTAAVWGA
Sbjct: 421 SGLLDEGQYYFELMKKYEIEPQVSHHVTVVDLLGRAGRLDEANKFIKEMPIEPTAAVWGA 480

Query: 481 LLGACRMHKNMDLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKQSGV 540
           LLGACRMHKNMDLG YAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMK+SGV
Sbjct: 481 LLGACRMHKNMDLGAYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESGV 540

Query: 541 KKEPACSWIEIENEVHTFVANDDSHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMDQ 600
           KKEPACSW+EIENEVH FVAND+SHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMDQ
Sbjct: 541 KKEPACSWVEIENEVHMFVANDESHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMDQ 600

Query: 601 QDREVKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREIIVRD 660
           QDREVKLQYHSEKLALAFSVLKTPPG TIRIKKNIRICGDCHSAFKFASKVL REIIVRD
Sbjct: 601 QDREVKLQYHSEKLALAFSVLKTPPGFTIRIKKNIRICGDCHSAFKFASKVLRREIIVRD 660

Query: 661 TNRFHHFLDGLCSCRDYW 667
           TNRFHHF DGLCSCRDYW
Sbjct: 661 TNRFHHFHDGLCSCRDYW 671

BLAST of Tan0004354 vs. ExPASy TrEMBL
Match: A0A5D3CU56 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold477G00540 PE=3 SV=1)

HSP 1 Score: 1203.0 bits (3111), Expect = 0.0e+00
Identity = 585/657 (89.04%), Postives = 616/657 (93.76%), Query Frame = 0

Query: 10  LKNLRCFFPAANYGTGSAPCAFSESDTAVGRDWNAVIAAAPFTGVLQDEDLLRKTHISSS 69
           LK LRCF  AA YGTG APCAF+ES+ A  +DWN   A APFTGVLQDEDLLR THISSS
Sbjct: 24  LKTLRCFLFAAKYGTGLAPCAFTESNMAESQDWNP--ATAPFTGVLQDEDLLRTTHISSS 83

Query: 70  KTSTNSTGIYVLDLINHGSLEPDRTLYGKMLNKCTNLRKLKQGRAIHAHIQGSMFENDLV 129
             S++STG+YVLDLIN GSLEP+RTLY KMLNKCT LRKLKQGRAIHAHIQ S FEND V
Sbjct: 84  DVSSSSTGLYVLDLINCGSLEPERTLYSKMLNKCTYLRKLKQGRAIHAHIQSSAFENDPV 143

Query: 130 LQNFILNMYAKCGSLEEAQNMFDKMPTRDMVSWTVLISGYSQSDRATEALALFPQMLHQG 189
           L NFILNMYAKCGSLEEAQ++FDKMPT+D VSWTVLISGYSQS RA+EALALFP+MLH G
Sbjct: 144 LLNFILNMYAKCGSLEEAQDLFDKMPTKDRVSWTVLISGYSQSRRASEALALFPKMLHLG 203

Query: 190 FQPNEFTLSSLLKASGAGCSDDLGRQLHAFSLKYGYDMNVHVGSSLLDMYARWGHMREAE 249
           FQPNEFTLSSLLKASGAG SDD GRQLHAFSLKYGYDMNVHVGSSLLDMYARWGHMREA+
Sbjct: 204 FQPNEFTLSSLLKASGAGPSDDHGRQLHAFSLKYGYDMNVHVGSSLLDMYARWGHMREAK 263

Query: 250 AIFNGLAAKNVVSWNALIAGHARKGEGDHAIRLFWQMLRCDFEPTHFTYSSVFTACASCG 309
            IF  LAAKNVVSWNALIAGHARKGEG+H +RLF QMLR  FEPTHFTYSSVFTACAS G
Sbjct: 264 VIFKSLAAKNVVSWNALIAGHARKGEGEHVMRLFSQMLRQGFEPTHFTYSSVFTACASSG 323

Query: 310 SLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKDAKKVFQRLVKQDVVSWNSIIS 369
           SLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKDAKKVFQRLVK+D+VSWNSIIS
Sbjct: 324 SLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKDAKKVFQRLVKRDIVSWNSIIS 383

Query: 370 GYAQHGLGVEALELFEEMLKAKVQPNEITFLSVLTACSHSGLLDEGQYYFELMKKYGIEP 429
           GYAQHGLG EAL+LFE++LKAKVQPNEITFLSVLTACSHSGLLDEG+YYFELMKK+GIEP
Sbjct: 384 GYAQHGLGAEALQLFEQVLKAKVQPNEITFLSVLTACSHSGLLDEGKYYFELMKKHGIEP 443

Query: 430 QVAHYVTVVDLLGRAGRLNEANKFLIEMPIEPTAAVWGALLGACRMHKNMDLGVYAAERI 489
           QVAH+VTVVDLLGRAGRLNEANKF+ EMP+EPTAAVWGALLGACRMHKNMDLGVYAAE+I
Sbjct: 444 QVAHHVTVVDLLGRAGRLNEANKFIEEMPMEPTAAVWGALLGACRMHKNMDLGVYAAEKI 503

Query: 490 FELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKQSGVKKEPACSWIEIENEVHTFVAN 549
           FELDPHDSGPHVLLSNIYASAGRL DA NVRKMMK+SGVKKEPACSW+EIENEVH FVAN
Sbjct: 504 FELDPHDSGPHVLLSNIYASAGRLRDAGNVRKMMKESGVKKEPACSWVEIENEVHMFVAN 563

Query: 550 DDSHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMDQQDREVKLQYHSEKLALAFSVL 609
           DDSHPMREEIQ+MWEKISGKIKEIGYVPDTSHVLFFMDQQDREVKLQYHSEKLALAF+VL
Sbjct: 564 DDSHPMREEIQRMWEKISGKIKEIGYVPDTSHVLFFMDQQDREVKLQYHSEKLALAFAVL 623

Query: 610 KTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREIIVRDTNRFHHFLDGLCSCRDYW 667
           KTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREIIVRDTNRFHHFL G+CSCRDYW
Sbjct: 624 KTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREIIVRDTNRFHHFLHGMCSCRDYW 678

BLAST of Tan0004354 vs. ExPASy TrEMBL
Match: A0A1S3CMN0 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X1 OS=Cucumis melo OX=3656 GN=LOC103502212 PE=3 SV=1)

HSP 1 Score: 1203.0 bits (3111), Expect = 0.0e+00
Identity = 585/657 (89.04%), Postives = 616/657 (93.76%), Query Frame = 0

Query: 10  LKNLRCFFPAANYGTGSAPCAFSESDTAVGRDWNAVIAAAPFTGVLQDEDLLRKTHISSS 69
           LK LRCF  AA YGTG APCAF+ES+ A  +DWN   A APFTGVLQDEDLLR THISSS
Sbjct: 26  LKTLRCFLFAAKYGTGLAPCAFTESNMAESQDWNP--ATAPFTGVLQDEDLLRTTHISSS 85

Query: 70  KTSTNSTGIYVLDLINHGSLEPDRTLYGKMLNKCTNLRKLKQGRAIHAHIQGSMFENDLV 129
             S++STG+YVLDLIN GSLEP+RTLY KMLNKCT LRKLKQGRAIHAHIQ S FEND V
Sbjct: 86  DVSSSSTGLYVLDLINCGSLEPERTLYSKMLNKCTYLRKLKQGRAIHAHIQSSAFENDPV 145

Query: 130 LQNFILNMYAKCGSLEEAQNMFDKMPTRDMVSWTVLISGYSQSDRATEALALFPQMLHQG 189
           L NFILNMYAKCGSLEEAQ++FDKMPT+D VSWTVLISGYSQS RA+EALALFP+MLH G
Sbjct: 146 LLNFILNMYAKCGSLEEAQDLFDKMPTKDRVSWTVLISGYSQSRRASEALALFPKMLHLG 205

Query: 190 FQPNEFTLSSLLKASGAGCSDDLGRQLHAFSLKYGYDMNVHVGSSLLDMYARWGHMREAE 249
           FQPNEFTLSSLLKASGAG SDD GRQLHAFSLKYGYDMNVHVGSSLLDMYARWGHMREA+
Sbjct: 206 FQPNEFTLSSLLKASGAGPSDDHGRQLHAFSLKYGYDMNVHVGSSLLDMYARWGHMREAK 265

Query: 250 AIFNGLAAKNVVSWNALIAGHARKGEGDHAIRLFWQMLRCDFEPTHFTYSSVFTACASCG 309
            IF  LAAKNVVSWNALIAGHARKGEG+H +RLF QMLR  FEPTHFTYSSVFTACAS G
Sbjct: 266 VIFKSLAAKNVVSWNALIAGHARKGEGEHVMRLFSQMLRQGFEPTHFTYSSVFTACASSG 325

Query: 310 SLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKDAKKVFQRLVKQDVVSWNSIIS 369
           SLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKDAKKVFQRLVK+D+VSWNSIIS
Sbjct: 326 SLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKDAKKVFQRLVKRDIVSWNSIIS 385

Query: 370 GYAQHGLGVEALELFEEMLKAKVQPNEITFLSVLTACSHSGLLDEGQYYFELMKKYGIEP 429
           GYAQHGLG EAL+LFE++LKAKVQPNEITFLSVLTACSHSGLLDEG+YYFELMKK+GIEP
Sbjct: 386 GYAQHGLGAEALQLFEQVLKAKVQPNEITFLSVLTACSHSGLLDEGKYYFELMKKHGIEP 445

Query: 430 QVAHYVTVVDLLGRAGRLNEANKFLIEMPIEPTAAVWGALLGACRMHKNMDLGVYAAERI 489
           QVAH+VTVVDLLGRAGRLNEANKF+ EMP+EPTAAVWGALLGACRMHKNMDLGVYAAE+I
Sbjct: 446 QVAHHVTVVDLLGRAGRLNEANKFIEEMPMEPTAAVWGALLGACRMHKNMDLGVYAAEKI 505

Query: 490 FELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKQSGVKKEPACSWIEIENEVHTFVAN 549
           FELDPHDSGPHVLLSNIYASAGRL DA NVRKMMK+SGVKKEPACSW+EIENEVH FVAN
Sbjct: 506 FELDPHDSGPHVLLSNIYASAGRLRDAGNVRKMMKESGVKKEPACSWVEIENEVHMFVAN 565

Query: 550 DDSHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMDQQDREVKLQYHSEKLALAFSVL 609
           DDSHPMREEIQ+MWEKISGKIKEIGYVPDTSHVLFFMDQQDREVKLQYHSEKLALAF+VL
Sbjct: 566 DDSHPMREEIQRMWEKISGKIKEIGYVPDTSHVLFFMDQQDREVKLQYHSEKLALAFAVL 625

Query: 610 KTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREIIVRDTNRFHHFLDGLCSCRDYW 667
           KTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREIIVRDTNRFHHFL G+CSCRDYW
Sbjct: 626 KTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREIIVRDTNRFHHFLHGMCSCRDYW 680

BLAST of Tan0004354 vs. TAIR 10
Match: AT3G24000.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 801.6 bits (2069), Expect = 4.8e-232
Identity = 388/610 (63.61%), Postives = 473/610 (77.54%), Query Frame = 0

Query: 49  APFTGVLQDEDLLRKTHISSSKTSTNSTGIYVLDLINHGSLEP-DRTLYGKMLNKCTNLR 108
           AP +   +DE L   ++    +TS+N       DL   GS  P DR  Y  +L KCT  +
Sbjct: 24  APVSEDSEDESLKFPSNDLLLRTSSN-------DL--EGSYIPADRRFYNTLLKKCTVFK 83

Query: 109 KLKQGRAIHAHIQGSMFENDLVLQNFILNMYAKCGSLEEAQNMFDKMPTRDMVSWTVLIS 168
            L QGR +HAHI  S+F +D+V+ N +LNMYAKCGSLEEA+ +F+KMP RD V+WT LIS
Sbjct: 84  LLIQGRIVHAHILQSIFRHDIVMGNTLLNMYAKCGSLEEARKVFEKMPQRDFVTWTTLIS 143

Query: 169 GYSQSDRATEALALFPQMLHQGFQPNEFTLSSLLKASGAGCSDDLGRQLHAFSLKYGYDM 228
           GYSQ DR  +AL  F QML  G+ PNEFTLSS++KA+ A      G QLH F +K G+D 
Sbjct: 144 GYSQHDRPCDALLFFNQMLRFGYSPNEFTLSSVIKAAAAERRGCCGHQLHGFCVKCGFDS 203

Query: 229 NVHVGSSLLDMYARWGHMREAEAIFNGLAAKNVVSWNALIAGHARKGEGDHAIRLFWQML 288
           NVHVGS+LLD+Y R+G M +A+ +F+ L ++N VSWNALIAGHAR+   + A+ LF  ML
Sbjct: 204 NVHVGSALLDLYTRYGLMDDAQLVFDALESRNDVSWNALIAGHARRSGTEKALELFQGML 263

Query: 289 RCDFEPTHFTYSSVFTACASCGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIK 348
           R  F P+HF+Y+S+F AC+S G LEQGKWVHA++IKSG + +A+ GNTL+DMYAKSGSI 
Sbjct: 264 RDGFRPSHFSYASLFGACSSTGFLEQGKWVHAYMIKSGEKLVAFAGNTLLDMYAKSGSIH 323

Query: 349 DAKKVFQRLVKQDVVSWNSIISGYAQHGLGVEALELFEEMLKAKVQPNEITFLSVLTACS 408
           DA+K+F RL K+DVVSWNS+++ YAQHG G EA+  FEEM +  ++PNEI+FLSVLTACS
Sbjct: 324 DARKIFDRLAKRDVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNEISFLSVLTACS 383

Query: 409 HSGLLDEGQYYFELMKKYGIEPQVAHYVTVVDLLGRAGRLNEANKFLIEMPIEPTAAVWG 468
           HSGLLDEG +Y+ELMKK GI P+  HYVTVVDLLGRAG LN A +F+ EMPIEPTAA+W 
Sbjct: 384 HSGLLDEGWHYYELMKKDGIVPEAWHYVTVVDLLGRAGDLNRALRFIEEMPIEPTAAIWK 443

Query: 469 ALLGACRMHKNMDLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKQSG 528
           ALL ACRMHKN +LG YAAE +FELDP D GPHV+L NIYAS GR NDAA VRK MK+SG
Sbjct: 444 ALLNACRMHKNTELGAYAAEHVFELDPDDPGPHVILYNIYASGGRWNDAARVRKKMKESG 503

Query: 529 VKKEPACSWIEIENEVHTFVANDDSHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMD 588
           VKKEPACSW+EIEN +H FVAND+ HP REEI + WE++  KIKE+GYVPDTSHV+  +D
Sbjct: 504 VKKEPACSWVEIENAIHMFVANDERHPQREEIARKWEEVLAKIKELGYVPDTSHVIVHVD 563

Query: 589 QQDREVKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREIIVR 648
           QQ+REV LQYHSEK+ALAF++L TPPG TI IKKNIR+CGDCH+A K ASKV+GREIIVR
Sbjct: 564 QQEREVNLQYHSEKIALAFALLNTPPGSTIHIKKNIRVCGDCHTAIKLASKVVGREIIVR 623

Query: 649 DTNRFHHFLD 658
           DTNRFHHF D
Sbjct: 624 DTNRFHHFKD 624

BLAST of Tan0004354 vs. TAIR 10
Match: AT3G13770.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 528.5 bits (1360), Expect = 7.9e-150
Identity = 270/630 (42.86%), Postives = 403/630 (63.97%), Query Frame = 0

Query: 59  DLLRKTHISSSKTSTN---STGIYVLDLINHGSLE----------PDRTLYG--KMLNKC 118
           +L+R  H S S + TN    T + +  L ++G L+          P+   +G   +LN C
Sbjct: 3   NLMRLIHRSFSSSPTNYVLQTILPISQLCSNGRLQEALLEMAMLGPEMGFHGYDALLNAC 62

Query: 119 TNLRKLKQGRAIHAHIQGSMFENDLVLQNFILNMYAKCGSLEEAQNMFDKMPTRDMVSWT 178
            + R L+ G+ +HAH+  + +     L+  +L  Y KC  LE+A+ + D+MP +++VSWT
Sbjct: 63  LDKRALRDGQRVHAHMIKTRYLPATYLRTRLLIFYGKCDCLEDARKVLDEMPEKNVVSWT 122

Query: 179 VLISGYSQSDRATEALALFPQMLHQGFQPNEFT----LSSLLKASGAGCSDDLGRQLHAF 238
            +IS YSQ+  ++EAL +F +M+    +PNEFT    L+S ++ASG G    LG+Q+H  
Sbjct: 123 AMISRYSQTGHSSEALTVFAEMMRSDGKPNEFTFATVLTSCIRASGLG----LGKQIHGL 182

Query: 239 SLKYGYDMNVHVGSSLLDMYARWGHMREAEAIFNGLAAKNVVSWNALIAGHARKGEGDHA 298
            +K+ YD ++ VGSSLLDMYA+ G ++EA  IF  L  ++VVS  A+IAG+A+ G  + A
Sbjct: 183 IVKWNYDSHIFVGSSLLDMYAKAGQIKEAREIFECLPERDVVSCTAIIAGYAQLGLDEEA 242

Query: 299 IRLFWQMLRCDFEPTHFTYSSVFTACASCGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDM 358
           + +F ++      P + TY+S+ TA +    L+ GK  H HV++      A + N+LIDM
Sbjct: 243 LEMFHRLHSEGMSPNYVTYASLLTALSGLALLDHGKQAHCHVLRRELPFYAVLQNSLIDM 302

Query: 359 YAKSGSIKDAKKVFQRLVKQDVVSWNSIISGYAQHGLGVEALELFEEMLKAK-VQPNEIT 418
           Y+K G++  A+++F  + ++  +SWN+++ GY++HGLG E LELF  M   K V+P+ +T
Sbjct: 303 YSKCGNLSYARRLFDNMPERTAISWNAMLVGYSKHGLGREVLELFRLMRDEKRVKPDAVT 362

Query: 419 FLSVLTACSHSGLLDEGQYYFELM--KKYGIEPQVAHYVTVVDLLGRAGRLNEANKFLIE 478
            L+VL+ CSH  + D G   F+ M   +YG +P   HY  +VD+LGRAGR++EA +F+  
Sbjct: 363 LLAVLSGCSHGRMEDTGLNIFDGMVAGEYGTKPGTEHYGCIVDMLGRAGRIDEAFEFIKR 422

Query: 479 MPIEPTAAVWGALLGACRMHKNMDLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDA 538
           MP +PTA V G+LLGACR+H ++D+G     R+ E++P ++G +V+LSN+YASAGR  D 
Sbjct: 423 MPSKPTAGVLGSLLGACRVHLSVDIGESVGRRLIEIEPENAGNYVILSNLYASAGRWADV 482

Query: 539 ANVRKMMKQSGVKKEPACSWIEIENEVHTFVANDDSHPMREEIQKMWEKISGKIKEIGYV 598
            NVR MM Q  V KEP  SWI+ E  +H F AND +HP REE+    ++IS K+K+ GYV
Sbjct: 483 NNVRAMMMQKAVTKEPGRSWIQHEQTLHYFHANDRTHPRREEVLAKMKEISIKMKQAGYV 542

Query: 599 PDTSHVLFFMDQQDREVKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFA 658
           PD S VL+ +D++ +E  L  HSEKLAL F ++ T  G+ IR+ KN+RIC DCH+  K  
Sbjct: 543 PDLSCVLYDVDEEQKEKMLLGHSEKLALTFGLIATGEGIPIRVFKNLRICVDCHNFAKIF 602

Query: 659 SKVLGREIIVRDTNRFHHFLDGLCSCRDYW 667
           SKV  RE+ +RD NRFH  +DG+CSC DYW
Sbjct: 603 SKVFEREVSLRDKNRFHQIVDGICSCGDYW 628

BLAST of Tan0004354 vs. TAIR 10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 515.8 bits (1327), Expect = 5.3e-146
Identity = 255/613 (41.60%), Postives = 378/613 (61.66%), Query Frame = 0

Query: 91  PDRTLYGKMLNKCTNLRKLKQGRAIHAHIQGSMFENDLVLQNFILNMYAK---CGSLEEA 150
           PD  ++  +L  CT +  L+ G ++H  I     + DL   N ++NMYAK    GS    
Sbjct: 103 PDHNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLYTGNALMNMYAKLLGMGSKISV 162

Query: 151 QNMFDKMPTR---------------------------------DMVSWTVLISGYSQSDR 210
            N+FD+MP R                                 D+VS+  +I+GY+QS  
Sbjct: 163 GNVFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEVMPRKDVVSYNTIIAGYAQSGM 222

Query: 211 ATEALALFPQMLHQGFQPNEFTLSSLLKASGAGCSDDLGRQLHAFSLKYGYDMNVHVGSS 270
             +AL +  +M     +P+ FTLSS+L           G+++H + ++ G D +V++GSS
Sbjct: 223 YEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVYIGSS 282

Query: 271 LLDMYARWGHMREAEAIFNGLAAKNVVSWNALIAGHARKGEGDHAIRLFWQMLRCDFEPT 330
           L+DMYA+   + ++E +F+ L  ++ +SWN+L+AG+ + G  + A+RLF QM+    +P 
Sbjct: 283 LVDMYAKSARIEDSERVFSRLYCRDGISWNSLVAGYVQNGRYNEALRLFRQMVTAKVKPG 342

Query: 331 HFTYSSVFTACASCGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKDAKKVFQ 390
              +SSV  ACA   +L  GK +H +V++ G     +I + L+DMY+K G+IK A+K+F 
Sbjct: 343 AVAFSSVIPACAHLATLHLGKQLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFD 402

Query: 391 RLVKQDVVSWNSIISGYAQHGLGVEALELFEEMLKAKVQPNEITFLSVLTACSHSGLLDE 450
           R+   D VSW +II G+A HG G EA+ LFEEM +  V+PN++ F++VLTACSH GL+DE
Sbjct: 403 RMNVLDEVSWTAIIMGHALHGHGHEAVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDE 462

Query: 451 GQYYFELMKK-YGIEPQVAHYVTVVDLLGRAGRLNEANKFLIEMPIEPTAAVWGALLGAC 510
              YF  M K YG+  ++ HY  V DLLGRAG+L EA  F+ +M +EPT +VW  LL +C
Sbjct: 463 AWGYFNSMTKVYGLNQELEHYAAVADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSC 522

Query: 511 RMHKNMDLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKQSGVKKEPA 570
            +HKN++L    AE+IF +D  + G +VL+ N+YAS GR  + A +R  M++ G++K+PA
Sbjct: 523 SVHKNLELAEKVAEKIFTVDSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPA 582

Query: 571 CSWIEIENEVHTFVANDDSHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMDQQDREV 630
           CSWIE++N+ H FV+ D SHP  ++I +  + +  ++++ GYV DTS VL  +D++ +  
Sbjct: 583 CSWIEMKNKTHGFVSGDRSHPSMDKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRE 642

Query: 631 KLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREIIVRDTNRFH 667
            L  HSE+LA+AF ++ T PG TIR+ KNIRIC DCH A KF SK+  REIIVRD +RFH
Sbjct: 643 LLFGHSERLAVAFGIINTEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFH 702

BLAST of Tan0004354 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 511.9 bits (1317), Expect = 7.6e-145
Identity = 253/580 (43.62%), Postives = 366/580 (63.10%), Query Frame = 0

Query: 88  SLEPDRTLYGKMLNKCTNLRKLKQGRAIHAHIQGSMFENDLVLQNFILNMYAKCGSLEEA 147
           +L+P       +L   + LR +  G+ IH +   S F++ + +   +++MYAKCGSLE A
Sbjct: 231 NLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETA 290

Query: 148 QNMFDKMPTRDMVSWTVLISGYSQSDRATEALALFPQMLHQGFQPNEFTLSSLLKASGAG 207
           + +FD M  R++VSW  +I  Y Q++   EA+ +F +ML +G +P + ++   L A    
Sbjct: 291 RQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADL 350

Query: 208 CSDDLGRQLHAFSLKYGYDMNVHVGSSLLDMYARWGHMREAEAIFNGLAAKNVVSWNALI 267
              + GR +H  S++ G D NV V +SL+ MY +   +  A ++F  L ++ +VSWNA+I
Sbjct: 351 GDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMI 410

Query: 268 AGHARKGEGDHAIRLFWQMLRCDFEPTHFTYSSVFTACASCGSLEQGKWVHAHVIKSGGQ 327
            G A+ G    A+  F QM     +P  FTY SV TA A        KW+H  V++S   
Sbjct: 411 LGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLD 470

Query: 328 PIAYIGNTLIDMYAKSGSIKDAKKVFQRLVKQDVVSWNSIISGYAQHGLGVEALELFEEM 387
              ++   L+DMYAK G+I  A+ +F  + ++ V +WN++I GY  HG G  ALELFEEM
Sbjct: 471 KNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEM 530

Query: 388 LKAKVQPNEITFLSVLTACSHSGLLDEGQYYFELMKK-YGIEPQVAHYVTVVDLLGRAGR 447
            K  ++PN +TFLSV++ACSHSGL++ G   F +MK+ Y IE  + HY  +VDLLGRAGR
Sbjct: 531 QKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGR 590

Query: 448 LNEANKFLIEMPIEPTAAVWGALLGACRMHKNMDLGVYAAERIFELDPHDSGPHVLLSNI 507
           LNEA  F+++MP++P   V+GA+LGAC++HKN++    AAER+FEL+P D G HVLL+NI
Sbjct: 591 LNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANI 650

Query: 508 YASAGRLNDAANVRKMMKQSGVKKEPACSWIEIENEVHTFVANDDSHPMREEIQKMWEKI 567
           Y +A        VR  M + G++K P CS +EI+NEVH+F +   +HP  ++I    EK+
Sbjct: 651 YRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKL 710

Query: 568 SGKIKEIGYVPDTSHVLFFMDQQDREVKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRIC 627
              IKE GYVPDT+ VL  ++   +E  L  HSEKLA++F +L T  G TI ++KN+R+C
Sbjct: 711 ICHIKEAGYVPDTNLVL-GVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKNLRVC 770

Query: 628 GDCHSAFKFASKVLGREIIVRDTNRFHHFLDGLCSCRDYW 667
            DCH+A K+ S V GREI+VRD  RFHHF +G CSC DYW
Sbjct: 771 ADCHNATKYISLVTGREIVVRDMQRFHHFKNGACSCGDYW 809

BLAST of Tan0004354 vs. TAIR 10
Match: AT2G03880.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 504.6 bits (1298), Expect = 1.2e-142
Identity = 247/587 (42.08%), Postives = 365/587 (62.18%), Query Frame = 0

Query: 81  LDLINHGSLEPDRTLYGKMLNKCTNLRKLKQGRAIHAHIQGSMFENDLVLQNFILNMYAK 140
           +D +    L  D   Y +++  C + R + +G  I  H+  +     + L N ++NMY K
Sbjct: 49  MDSLQSHGLWADSATYSELIKCCISNRAVHEGNLICRHLYFNGHRPMMFLVNVLINMYVK 108

Query: 141 CGSLEEAQNMFDKMPTRDMVSWTVLISGYSQSDRATEALALFPQMLHQGFQPNEFTLSSL 200
              L +A  +FD+MP R+++SWT +IS YS+     +AL L   ML    +PN +T SS+
Sbjct: 109 FNLLNDAHQLFDQMPQRNVISWTTMISAYSKCKIHQKALELLVLMLRDNVRPNVYTYSSV 168

Query: 201 LKASGAGCSDDLGRQLHAFSLKYGYDMNVHVGSSLLDMYARWGHMREAEAIFNGLAAKNV 260
           L++   G SD   R LH   +K G + +V V S+L+D++A+ G   +A ++F+ +   + 
Sbjct: 169 LRSCN-GMSD--VRMLHCGIIKEGLESDVFVRSALIDVFAKLGEPEDALSVFDEMVTGDA 228

Query: 261 VSWNALIAGHARKGEGDHAIRLFWQMLRCDFEPTHFTYSSVFTACASCGSLEQGKWVHAH 320
           + WN++I G A+    D A+ LF +M R  F     T +SV  AC     LE G   H H
Sbjct: 229 IVWNSIIGGFAQNSRSDVALELFKRMKRAGFIAEQATLTSVLRACTGLALLELGMQAHVH 288

Query: 321 VIKSGGQPIAYIGNTLIDMYAKSGSIKDAKKVFQRLVKQDVVSWNSIISGYAQHGLGVEA 380
           ++K     I  + N L+DMY K GS++DA +VF ++ ++DV++W+++ISG AQ+G   EA
Sbjct: 289 IVKYDQDLI--LNNALVDMYCKCGSLEDALRVFNQMKERDVITWSTMISGLAQNGYSQEA 348

Query: 381 LELFEEMLKAKVQPNEITFLSVLTACSHSGLLDEGQYYFELMKK-YGIEPQVAHYVTVVD 440
           L+LFE M  +  +PN IT + VL ACSH+GLL++G YYF  MKK YGI+P   HY  ++D
Sbjct: 349 LKLFERMKSSGTKPNYITIVGVLFACSHAGLLEDGWYYFRSMKKLYGIDPVREHYGCMID 408

Query: 441 LLGRAGRLNEANKFLIEMPIEPTAAVWGALLGACRMHKNMDLGVYAAERIFELDPHDSGP 500
           LLG+AG+L++A K L EM  EP A  W  LLGACR+ +NM L  YAA+++  LDP D+G 
Sbjct: 409 LLGKAGKLDDAVKLLNEMECEPDAVTWRTLLGACRVQRNMVLAEYAAKKVIALDPEDAGT 468

Query: 501 HVLLSNIYASAGRLNDAANVRKMMKQSGVKKEPACSWIEIENEVHTFVANDDSHPMREEI 560
           + LLSNIYA++ + +    +R  M+  G+KKEP CSWIE+  ++H F+  D+SHP   E+
Sbjct: 469 YTLLSNIYANSQKWDSVEEIRTRMRDRGIKKEPGCSWIEVNKQIHAFIIGDNSHPQIVEV 528

Query: 561 QKMWEKISGKIKEIGYVPDTSHVLFFMDQQDREVKLQYHSEKLALAFSVLKTPPGLTIRI 620
            K   ++  ++  IGYVP+T+ VL  ++ +  E  L++HSEKLALAF ++  P    IRI
Sbjct: 529 SKKLNQLIHRLTGIGYVPETNFVLQDLEGEQMEDSLRHHSEKLALAFGLMTLPIEKVIRI 588

Query: 621 KKNIRICGDCHSAFKFASKVLGREIIVRDTNRFHHFLDGLCSCRDYW 667
           +KN+RICGDCH   K ASK+  R I++RD  R+HHF DG CSC DYW
Sbjct: 589 RKNLRICGDCHVFCKLASKLEIRSIVIRDPIRYHHFQDGKCSCGDYW 630

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LIQ76.3e-23763.81Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidop... [more]
Q9LIC31.1e-14842.86Putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial OS... [more]
Q9LW637.4e-14541.60Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
Q3E6Q11.1e-14343.62Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Q9SI531.7e-14142.08Pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_022973115.10.0e+0088.64pentatricopeptide repeat-containing protein At3g24000, mitochondrial [Cucurbita ... [more]
XP_022142695.10.0e+0088.50pentatricopeptide repeat-containing protein At3g24000, mitochondrial [Momordica ... [more]
XP_038893938.10.0e+0089.22pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X1 ... [more]
XP_023520189.10.0e+0087.19pentatricopeptide repeat-containing protein At3g24000, mitochondrial [Cucurbita ... [more]
KAG6583676.10.0e+0087.91Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
A0A6J1IAI90.0e+0088.64pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Cucurbit... [more]
A0A6J1CNX30.0e+0088.50pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Momordic... [more]
A0A6J1EHG00.0e+0087.76pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Cucurbit... [more]
A0A5D3CU560.0e+0089.04Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3CMN00.0e+0089.04pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X1 ... [more]
Match NameE-valueIdentityDescription
AT3G24000.14.8e-23263.61Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G13770.17.9e-15042.86Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G23330.15.3e-14641.60Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G11290.17.6e-14543.62Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G03880.11.2e-14242.08Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 362..396
e-value: 6.3E-9
score: 33.5
coord: 160..194
e-value: 1.4E-5
score: 23.0
coord: 334..362
e-value: 1.3E-4
score: 20.0
coord: 261..294
e-value: 9.5E-6
score: 23.5
coord: 397..430
e-value: 0.0018
score: 16.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 132..157
e-value: 0.0035
score: 17.5
coord: 334..359
e-value: 0.0011
score: 19.0
coord: 233..254
e-value: 0.11
score: 12.8
coord: 505..528
e-value: 1.4
score: 9.3
coord: 433..457
e-value: 0.23
score: 11.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 258..306
e-value: 5.2E-13
score: 49.0
coord: 360..407
e-value: 1.2E-13
score: 51.0
coord: 158..203
e-value: 1.4E-9
score: 38.0
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 158..192
score: 11.816339
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 127..157
score: 8.681407
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 360..394
score: 13.011121
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 294..328
score: 8.560833
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 395..429
score: 10.490022
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 259..293
score: 11.695765
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 80..207
e-value: 5.6E-25
score: 89.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 334..572
e-value: 9.7E-45
score: 155.2
coord: 208..328
e-value: 8.1E-23
score: 83.3
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 268..519
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 532..655
e-value: 1.1E-39
score: 135.1
NoneNo IPR availablePANTHERPTHR47926:SF184BNACNNG64210D PROTEINcoord: 70..644
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 70..644

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0004354.2Tan0004354.2mRNA
Tan0004354.1Tan0004354.1mRNA
Tan0004354.3Tan0004354.3mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding