HG10001548 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10001548
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionpentatricopeptide repeat-containing protein At2g30100, chloroplastic
LocationChr09: 18036758 .. 18049493 (+)
RNA-Seq ExpressionHG10001548
SyntenyHG10001548
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGTTTAACGGCTTTATTATTCGAAGCTATGAAGAGAGTCGATTATCAGATAAAGTTCAAGTTATGGATCTTGAACGACGATGTGAAATTGGTCAATCAAAACGTGTGTTTCTCTTTACTGACACTTTAGATGACCCTATTTGTAGGATACGTAACAGTCCCATGTATAAAATGTTGGTAATTTGATTTGATTTGATTTAATTAATTGTGTTTTTTTATGTTGCTTAATTAATTATTAATTTGTGAATTAATTTTTTTTTTTTTTTTGTATTATTTAGGTTGCTGAGTGGAACAAGGAAGTGGTTGGTGTTATTCAAGGCTCTATAAAAGCGGTTTTTTTTACTGCTCATAAACCACCGACCGGTTTGGTGGTTAAAATGGGCTACATTCTTGGCCTGAGAGTGGCGCCGCGGTATCGCCGTCGTGGGATTGGCTCCGGCCTCGTCCGCCGTTTGGAAGATTGGTTCGTTTCTAATGATGTTGATTACTGTTGCATGGCCACTGAGAAAGATAATCACGCCTCTCTTAATCTCTTCATCAATAATTTGAGGTATTTTTCATTTTTAATTATTTTTATTTTCTAATTCTCATTTTTTAAGTTAATAATTTCGAGTTCGGAGAGTCGAAGATCGAACCATAATCTTTAAAATGTTAATTAGTATCATTTACCTAATGTGTTATATATACTCGGATTAGCTATTAATTCTATCTTAAAATGTAGTTTTTGTCGAGTATATATTTTGAGAACTATCAAATTTGTATGTAATTAGTTTTTTTTTTTTTTTAATTACAAACTAAACACATGATAATATTATTATTTGGTGGAATCTTTTTGTTTTTTTTAGCGAGATTTATTAGATATAAAATTAAAAATTTTAAGGGTCAAATCAAACTATTTCGAGAATGATTCTTGAAATGGTTACAATCATTTTTTAAAATAAATTCATTTAAAAAAGTAGTTAATTAATGTTTAACTTTACACTTTTAAACATGATTTACGTATGATTTTGAATGATCAAATTCATGTTTAACTCAAAGTGGTTTGAAATATGAAAAAAAAGACTCTTATACTCGTAACTTGAGTTAATTAGCAAGATCTTCATATGCAACTCAACTGTAAAATTCGAAAGTTTAGGGGCATATTAGAAATTTATTGAAACATTTTTTCTCCCAACAGGTACATAAAGTTTAGAACAGGAAGAATCCTAGTAAACCCAGTAAGAAATCATCCATACAATATCAATTCATCAGAAATCAACATTCAAAAGCTAAAAATAGAAGAAGCAGAAGCAATATACAAAAAACACATGGCTTCAACAGAGTTCTTCCCCAAAGACATAAAAAACATATTGAAAAACAAGTTGAGTTTAGGGACATGGGTGGCAAATTTCAAACAACGACGGTTATCGTCGACAGTCGCCGGAGGAAACGAGCAGATTACGGCGAGTAGTTGGGCCATTGTAAGTCTATGGAACAGTGGGGAAGTTTTCAAGCTAAGGCTAGGAAAAGCACCATTTCCATGGCTTATTTACACAAAGAGTTTAAAAATGATGGATAAAATTTTGCCTTGCTTTAAAGTGATTTTGGTGCCTAATTATTTCAAGCCATTTGGGTTCTATTTTGTTTATGGATTGCACCATGAAGGCCCTTTTTCTGAGAGATTGGTTGGAGCTTTGTGCAAATTTGTGCACAATATGGCTTTGAACAATTCAAAGGATCATAATTGTAAAGCTATTGTTACTGAGATTAGTGGTGATGAAGATGATGATGAGCTGAAAATGAAGATTCCCCATTGGAAATTGCTATCATGTTATGAAGATTTTTGGTGCATAAAGTCCTTGAAAAGTAAGAGAAATAATAATAATATTATTATTAGTAATGATAATGATAATGAGCATCATATATTGGAATGGAAAAATACCCCACCTATTAGAACTCTCTTTGTAGACCCAAGAGAGGTATAAAAGAATAAAAGAAAAAATAGGTTTAACCCTTTCTCAACACGTCTCAATCTAATTATTGTTCGTACTTGAGAGCAGAATGAAGAAAGTAGCTAGAAGAAAAGTTATTATAAAGAAGAAGAAGAAGATGACGACGACTTCTGAACTGAAATAGTTATGTTGTAAAATTTTGTCTTTTTCTGTACGTGGGTGTTTCTGAATTTAACTAGAGTCTATTCGTACTATTGTATTCATGACTAAATATGAATGAAAAGAAATATCAACGTCGTTTTCTATTAATAGTATGTACTATTGCTTTCGAAACCCTAATAATTAATAATCAAACGTTAATGTTAATTCTTTTAATGTTGATTGGGAATGTCTAGATTTTGCATCTTATGATTCCCTTTTTGTTTACGTGTCAGTGAAGAGAAAAATCTTTAGGCATTCTTAATTTAGATGTCGCGCGCCCATTCTTTAGCTTTATAAAGATTCAAAGTTAAAAAGCAATGAGAGGAGAGACATTATTTAGGGAATATTCTCATCACATGTGCTATATGCGAAGGCAAATAATAAGATTGAAAAGTAGGAAGATATATATATAAATAGATATTAGGTTAAATATTAAGTGTTGAGCTTAAAAAAGGTGAGAATTTGGAATTATTAGGGTTTAAAGGAAGGGCAGATAGGGGAAGGGCAAAGGGCAAAGAGGGGAGAAATTTGAGAACAAAATAAAAAGAGAGGAGATTTTTGGAATATATTATTATTAGGGCTTTTAAAAAACAAAGCCATGAATGAGTTAGGCCAAAATTTTATTCCTCCTAAAATGGAAATATGTCTTTGCAATTTATGGAGAATGTGAAGTTGATTTGAAGTGTTTATTTGTTTATGAAATTTTGGTGGTTAATTAATTATTCACATATATAAGTTTTCCATTTTTTGTGTTAATTTTTCAAATGGCTGCTGTTTCACTTCCAACTTTGTGTTTTTCACATTCTTTGAAAACAAACTTTTCTTCCCTCTTGGACCACTTGAAAGAGAATTCATTCTTGTGAGAAAAATGTATTTGGATTATATTAACGATGTTCACGAGACGAAGCGGGAATGAGAACGGGGACCTCTTTTTCATTTCTCATCCCATCCTTTTAAATTTTACAGATGTGTTAGGGGTGATATTAGATCGTATCTGAGTTATTCCATCTCATGTATCTTATTATGAAGTGATCTCTCGAATTTTTTCAAAGTAGGATTCATCAACATGCCCCGCGAGATATGGGTGTACATGGGTCAGGTTGGGTTGGATAGGGTTAAATGTGTTTTTTTGGATTAAGCTGAAAATTCAGATGGGTTGGGTTGACAACCAAAAAGACTCAAATAAAGTCTCTAACCCAACCCAACCCTACCCTTAAAATTCAAGTTGGATTGGGTTGTCAAGTTATAACTTTTTTTTTTTTTTTAATTTAAAACGTAAATATATCAACAACATATAATGATTAATAACTAAAATCTCATAAAATTCAAATATTAAATATCAAATATCTATGCTATTTATTAATTATTACAAACTTTAAACAAAGACAAATATCATGATAATTCTAAAAGAAAAATATTAAAGTAATTCACAAATTCATCCATTCAGAGTAATCAAACAATTATAAAAATAATTATGCAAAATATCAGCCAAAATAATCAATAAATTTTAAGTCTAATAAAAGAAAATACTACATTATTATATTGAAAACAAGATTTAAACAAAGATTTGCATGTTTTGAAACAACTCCAACATCATACCCAAAGTGTGAAGCACTACTTAATTTCCATTGTTTCTTCCATATTGTAATTCTTAACGACGTTATTTTGTGTAGACAATACATTTGTTAACAATGTAGCAACGTCATTATACGTAGAGAATATATTTGCTTTATTTACACTGCCAATCTTATAATGGTCAGCGTTTATGAATATATTGTTGGTTGTACTTTAGAAGTGAAGAATAAAAATCACAAATTATTCTTATTATTGCTAAGCCTTATTATAATTCCTGTACATATTCAAACTATTAATTAATACTTACTTATTATTGCAAAATTTAAGTTGACCCCTAATACAACCAGTCAAATGTTGTAGTGTTGATTTCGAGATTTTACTTGAGTCATCTCAAGTTAAGATATTGATCATGTTTTTCTTCACCTCCCATCGTATACTATAAAACGGATTAGAACAACGATTTTACCTTTAAAAGTAAGAAAAACGCTTATGTTGATGAAAATGCATGGATGATCAAATAAAAAAAAGGAGACTAACCAAAAATAATAGAAGATAATATTTTTTTTAAAAAAAAAAAAAAAAAACATGTCACATACCTTTGTATTGCATGTTAGGGTAATTTGTAGCTTTGTTTTGTGTGGACCAAAACAGTAAAAATTAAGATTTGAAGATTAAACAATGGTTTGAACAAACAAAAAATTATTTTATAAATTTTATGTCGATAATAATTGAATTTTAAAATTGATTTTTATGTCCGTAAATCGAGGAGAGAAAACATGGAAGTTAACAAAATCTACGGTTTAAAAATAAAACATCTAATTGGAAATATAACTTAAAAATAAAAAGTCAAAAATCACTTTTAAAATTTTAAAAATTTTAAAAATAACCTTATAACTTTAAAAAAAAAAAAGAGTTCTAATAGTTTTACATAAAAATTATTAGTGTTTTGGGTCAAAAATGCTTTAAAATTTTTAAAAGTTTCAACATTTTCTTTAAACTTAAAAAAGAATTAAATTTTAAAAAAATACCTATAAGTATACAAACAGAAGTAGTAAATATTCTCTTACTTCTCTATTTTTCTCTTTTTTTTCTCTTTTTTTTTTCTTCCATCTCTTCTTCAATACTCTACTACTTCCCTTTTTGTTAATTATTTGACATTCGTTTATATCAAATAAATAAATAAAATGTTCACTTCAATTGAAGGTATACTGTAGACTATAAAGAGAAGAAAAAGAATAATGGAAATACTCAGAGATCTTAAGACTTAAATATTTTGACAAGATTGTACATTTTTATTTTATTTTTGTATTTGTATATTTATAGTTTTTGTATATTTTAAGGGGTTGTTTTCAAATTTCGAAAAATAAGTCAAACTATTTACACATATAGAAAAATTTTATTTTCTATCAGTGATGGATTGCGATAGAATTCTATCGCTTGAGCGATAGATAACAATAGAAATCTATTGCGTTCTATCGCTGATAAACAATGAAATTTTTCTATATTCGTAAATAATTTGATCTTTTTTCTGTTCATAGTAATTTCTCTATTTTAAGGGGAGATTTTGATTTTGGAATTTTCTGAGATTTGATGTTTTAACTGTTTCTATTTTGGTTGTTGAAAAAATTGGTGTAAGAATTGGACAAATAGGAGAAAAATGAGAGTATTCATATTAAAAATATGAGAATATATGTGTGTGTATATATAATTTGTTTTAAATTTAATATTTTTGGACTCAAGAAAACTTTAAAACAAATTGATCACTATACTAACTGCATGAGCATTTTTAAACTTTTTTTAAAAAAATATTAAAGTTATTTTTGCAACGAGGTATTAACATTTTCTATTTATTAACTTAAAGTAAGGGTGTTTTTTAATTTTTTTTTTTTTTTTTTTTTAAGTTTAATGGTATTTTTCATACTTTTGAAACATCATGGGCATTTTTGTAACAAAACAGTAACAATTTTCATTCAAAATTAATGGTGACGTAATTTTGAAATCTCTTTAAAAGTTTGATGATATTTTTGAAACTTTTAAAATTTCAATGATATTTTTTTCACAAAGGGAAAATTACCTTTTTAGTCCCCAAGTTTTGAACAATATGTGCGTTTGGTCCCTAAGTTTTAAAAATGTACCTTTTTAGTCTCTTAGTTTATAAAAATAGGTCCATTTGGTCCCTGAGTTTTCAAAATATACCTTTTCAGTCCCTAAGTTTTTAAAAATAGGTCTAAGAGATCCCTCAAATAATTTTTTATTATTAATTTAAATAATTATATGACATTTTACATTATAAGAATAATTTTTAAAAACCATTTCGTCTCTAAAACTATTTTTTCTAATTTTAAATATCATATAATTATTTTTAAAAAATTAAAGAAAAAAAATACTCGGATGACCTTTTAGACCTATTTTTAAAAACTCAGGGACTGAAAAGGTATATTTTGAAAACTCAGGGACCAAATGGACCTATTCTTATAAACTGAGGGACTAAAATGGTATATTTTTAAAACTTAGGGACCAAACGCACATATTGTTCAAAACTTGAGGACTAAAAAGGTAATTTTCCCTTTCACAAATTATAAAGTTTAAAAATATTTTTTATAATTTAGGCTAAAGATTACTATTTAAAAGTTGGAAACATCAAAACTAGCCACCTTAAATTAGATTTTAAAAGTTAAAACAAAATAAAATTAGGAAATGACATAATAGCAGGAAGTGTGTCTTGCAACAGAAGAGTAAAAAATGAAAATATTTCATTGGTTTTTATTTTTATTTATTTTTCAAAATCAAGGTATATTCACTCATTCCATTCAATTAATAGTGTTGGATTTTTAAGGAGAGTTCGGGATGTTAGGTTGAGTTATGAAGTCTCGAGTTAATATATTAGTATATAGAAAGTCTGTGTTTAGGATGCAAAGTTATAAAATAAAATTTTTTGATAAATATGTAAAATAGAAAAATAAAAAATAAAAAAAGATAAAATTGCTCGATGAATATTCAAAGTTGAGTTATTTGAAGTTGGTGGACCAAAAGAGCGGCCCAGACCACTTCAGCAGTTGCATCTACCATCCAGCCCACATAGGGGCAGCCCGGCCCAAGTTTGGTAAAGGCATAGGCCTCGGCCCAAATTGTCAGTTGAATGGGCTGCGATTTACAATTTACAAGCCCAAGTGAAATTGCCATATAAGGAAATTAATTACTATTTTACTTTATTGTGGTTACTAGGGTCAGAATTCATTAAAAAAAAAAAAAATACTAATTTTACCTTTAAACTTTCATTTTCTTTTGTTAATTTTGGTACATGTAACTCCAAAAGTTTTATACTGCTCTTCATACTTTTAGGGCATGTTTGAAAGGTTGTCATTGTCAAAATCATCGTGAAATATGTTTTTATTCATTCAAAACTAATTTTAATTATTGAAATTTGCATTTAAGAGTATAAAATTAAAAATTAAATTAATTTTGAGTAATTAAAGTGTATTTTTGAGTAATTTTAGAAATGACAAAAGTGTTAAAAATCACTCCTAAATATATCCTTATATATGGTAGTTCTAGCACTTTTTTAAAAGGTTTGTATGATCCTATAATCTGAAAAATAATTATTTTTGTCCTCTATTTGCAAATTTTTAATACATGAGTTGTACATCATCAAAGCCCTTGGTTATAATTACATTTCAAATTTAATCGGTTGTAGTATTATGCTGAAATTAAAACCAAAAACATGAGTTTAAAAAAATAAATAAAAAAAAAACAAAACGTTTCACTTTTTTTCTTTTTTTCTTTTAAAATGAGGTTAAAAAGAATGTTTAGTGTAAGATTAAAATATACTAAACATAAAATAAAATGGATTAAAGTAGACGTTTTTAAAATATGAGAACTAAAATGAACCTAACTTGAAAGTAGTGAATCTTCTACATTCCACTATAATTTTAATTTTTTTTTAAAAAAAATTGTTCTAGAATAAATCTGTAATACATTCGAACGTCTTGTTTCTTGGTCAAGAATATACCTTAACCAATTGAGTTGTGTTCAGATTATTCATTAAAATGAAAAGAGAAAAAAAAGGAAAAAGAAGAAGGATAGTATCCTCATAATAATGCACATAATTTAAACTTTAAAAAATATCAAATTTTTTGGTTTAGAAGTTTTTTTTAGGCCAATTTGTGACCATTAGATCTTTCACATTATTCAAGATCAATTTGGTGTCATTGGATTTGCTTGATGGCTAAGTTTCTTTTAGTTGTAACTTATAAGTAGAGAAATTTTATCTTATAAGAGAGTTTGTTCTTCCTCTATAAGAAGGAAAAATTGTAACACCTATTTTATATAGTGAGATTCTTCTATCATTATCCGTGACTTTTACCTTAATTTGTTTAAAGGATTTCCACGTAAAATTGTTGAGTGTGTGTGTGTTTTTTTTTAAAGAAAATCATTGTGTTATTGTTTCTCGAACTACTTCTTCATTATTATTTTCTCAATTTATTATAATAACAATCCTACTCTATTCGATCTGAATCTCTTTTCAACAAAGTCGAGGTGGTCAACTAACCTACTTTGTATTAAATAAATAAATAAATAAAGTAATAGGTGGATTACTTCAATCCAAATAAATAATAATGACAGAATGAAATATGTTATGTATTCTGATTTTTCATTTTTCAATCTAAATAAATTAATGAGAGTTGTGATTCTCATTTTTCATTTTCAATTGGTCAAATATGTTATGTATTTCACGTTCCAACTCATGCAAAGCTGAATGTTTTTAGCATAATTAAAGATAGCTTATAATAATTGGCCTAACAAACTTTATATTAAAATAATAATAATAAAAAATAAAATAAAAACATTTCTTTGCCCACGTGGTAATAAAATAAATATATTTGGTACTTTGTCTTGATTCATCCTACTCCAACCATTTTTTGGGAATAGTAATTTTTTAATAAAATTTAAAATTTAAAATTTAAGAAGGGCTGAAAATAATTAAAATAAAATAAACTAATTGACAAACGTTCAACTTATATGAAGTTTTTTTTTTAATTGAGAGAGAGAGTAATATGGTCCATAATGAGTTGTCAATATAGATATTTTTAGTAAATTAAGGATTAATTAAATATTTTAAGCTTTTAATTTAATTGTAGTTTACTTAATCAACAACTAGTTTGGTTGGGGCTAATTCTAAGTATAATTGAGGCAAAGTCAGCCTATAATATATGAATGTGTTAACGATCAAGAGAATTGTGGTTCAAATTCTTCTAAGCAAAAGAAAAAAATAAGTATGGATGAAGACCAATTACTCTGGTCGAAATTTCAACCTTCCACTTTACGATTATGAAACTAAAAAAAATGAAAAAAAAAATTATTTGCATGGTTCTGATTAGAGAAAAGATTGATAATCATTTTGTTTTTTGTTTTATGTTTTTGAAATATTCAATGTTGGTTCTCTCCAAATTTACTTACTATGATTTTCACTTTTTTGTTTTTTTTTTTAAAATATTTGAACCAAAAAAAGAAAAAAGCTTTTAAAAACTATTTTTTTTTTAGTTTTCAAAATTTCGCTTGATTTTGAAAATATTTGTAAAAATTAAACAACATAACATATGCATTTAGAAGTGAAAATAGAGTTTAGAAGTTTCTTTTTTTAAAAAACTAAAGCCCCGTTTGGTAATGATTTTGTTTAGTTTTCTAATTTTAACTGTATGCTTATTTCTTCTCTAAATTTAACACTATAGTTTTCACATCTCTTGAAGAAACGTTTAGCTTGGATTTTGGAAACATTAGTGAGAAATTGATGATAAAACAAAAAAATTTATAAGTGAAATGAAAGTAGTGTTTATAAAATTAAGGTAAAAATATTTTTTTTTGTTTCTAGGTTTGGGGTCTAGTTTCCATTTCGTCTCTATGTTTTCGAATGTTACACTTTTAGTCTTTAAGTTTTGTGTTTGATTTCAATTTAGTCCCTATATTTTAAAATGTTATAATTTTACCCTTGACATTTGAGTTTCATTTCAATTTGATCTCTATGTTTCAAGATTTACACTTTTAACATCAATTTTTCACTAAATTATATACTTCATTAAGTTTTAATAACTTTTAATCACTATTAAAATTAATTTTAAATTTTCACTTCATAATTATTTTAAATTATTTTTTTACACTAGTTTTTTTTCCCTCAAAACCTATGACTAAAAAATTTTTTTTTACCTAAATTTAATTTTGAAAAAATTATCAAAAGGCCTATCTTAGGGGTAATTGTGTAATTTCGTTACATTCATTTTGCGTTAATTCGTTTTGGAAGTCTTCACTTCAAGAACACGCATTGAAAAGAAAAATTAAATTTTGGTATAAAACTCCAAAAGCTTTGTCTTGCAGTGGCGGCAAAAACGCTGCCATTGCGACCACTCACTCCGACTTCCACCGTCACCATCGCTCAATCCTCCGCCAGAAGATTAGGGTTTCTTTCTTCTGTTTTCCGAACATCAATTCACTTCAATTCAATTCAATTCACCGCTTCTTTAAAGGGGTTTCTTGCTGCCTCTCTGTATTTTTCTCTCTATCTCTTCCCGTCTTTCGTTTGTTTCATTTTCTGTTTCTTTGTTACTGTTATAACGAGTTCTGGTTGTGCTCAAAACTGATGCCGTGTAAACCCTAATCTTCATTTCTTGCTTTACTTCATTCGTTTATTCGAATTTTTTAGTTTGAAGTTAGTGATTTGGATTAGAATTACGAAATGATTTGTGCTCAGGGCTTTACTCCGTTAACCCAATTTGGGTTTTCATTTTCTTTATCTTCTGCTCTGAAAACTGAGAAACATGGGTTTTCTACTCCCCAATTGCATAGTCCTTCGCCGGTAAAGTTTTGCTTTATGGTTTCTCGTATTTCTTGCAACTATCAGGATTCTACTTTCTCTGTCTCGCGAGCTAGTAAGTTTCGGGACTTAAGGTTGTTCAAATCGGTTGAGTTGGATCAGTTCATCACGAGTGATGACGAAGATGAAATGGGAGATGGGTTTTTTGAGGCAATTGAGGAATTGGAAAGAATGACGAGGGAACCATCGGATGTTCTTGAAGAAATGAACGATCGCCTTTCGGCGAGGGAATTTCAGCTCGTGTTGGTGTATTTCTCTCAAGAAGGGAGAGATTCATGGTGTGCTCTTGAGGTTTTTGAGTGGCTCCAAAAGGAAAATCGGGTTGACAAGGAGACCATGGAGTTGATGGTGTCTATAATGTGTAGTTGGATCAAGAAGTTGGTCGAGGGACGACATAACGTCAGTGATGTTGTTGACCTTCTTGTGGATATGGATTGTGTAGGTTTGAAGCCTCATTTTAGCATGATAGAAAAGGTTATCTCTTTGTATTGGGAAATGGGTGAGAAGGAAAAGGCAATTTCGTTTGTGAAAGAGGTCTTGGGACGCAATCTTGCTTTTATGAAGGACGATTGGGAGGGACATAAAGGGGGACCAAGCGGTTATCTCGCATGGAAGATGATGGTAAGCCTTCAGCTAATCAGGATTTTTCGTATCTTTAGCCATTAACTTGCTTAAGAGTCAACAGTGGTATTCTTTTGCTGTGTCTATCTATTACATTGAAACTTCATATATGAAATGTAAATATCTTCAATCATTGTTTACTAGCTTTGAATGTCATATTTTGAATCCTGAAATTCATGATTGTAGAGATTCCTTGCTCTACCTTCTTTATTTTACTCCTCCTCTACTTTAATCATTTAGGTTGGTATCTCAATCTTATTGCACTTTTATTATTTGAGCATCAGGTGTTTGTTGTGTTGACTTTGGTCAATGTAGTAAAGGATATTTCTTTGGGGTTGTGGTTGTCTAATCACAATTGAAGGGGTTCTGGAGGAACTACGTAGGTTTTTATCAGAGAGCTAGACGAGCGTTCAGCACACTATGGATTGGTAGTACTTTTGGGATATTTGGTTAAGCTGTGTTTCAATGTGGTTAAGCATGTTTGAATCTATTGTTATGTTGGTGTTGTACATACTGAAGTTTATTTCTGCAGATGTAGGATTTAAACATGAATATCTTACGGGTTTTTTTCTCTAGAACTTTATTGTCATTAGTTTCATTTATTGGAAACTTTCGGGCAACTTTATTCATTGTCAATTTCTTTGGAGCTTACTCTCTCATTAGTGAAAGTATACATCTTTATGGTCTTTTCCAATTGTTATGTTTGTTCGGCATGACCCCTAGTGAAAATTCAATTGCTGATGATCTTGAAAATCAATTCTCTTAGGTTGATGGTGACTATAGGGGTGCAGTGAAAATGGTGCTGCATCTTAGAGAATCTGGATTAAAGCCAGAGGTTTACTGCTATCTTATCGCGATGACTGCTGTGGTTAAAGAGCTGAATGAATTTGCAAAAGCTCTACGGAAACTCAAAAGTTATGCAAGAGATGGGATAGTGGCTGAACTCGATAAAAACAACGTTGAACTTGTTGAGAAGTATCAGACAGAGCTTCTAGCTGATGGAGTACGGTTATCCAACTGGGTGCTTGAAGAGGGAAGCTTTTCGATTCATGGGGTGGTTCATGAGAGACTCCTTGCTATGTACATTTGTGCTGGGCAAGGACTCGAGGCAGAGAGACAGCTTTGGGAAATGAAGCTTGTAGGCAAGGAGGCCGATGCTGATCTCTACGATATCGTGCTAGCGATTTGTGCTTCACAGAAGGAGACAAAAGCAATGAAACGGTTGCTTACCAGGATTGAGATTACGAGTCCCATGCGTAAGAAGAAGAGTTTGACATGGCTACTAAGGGGTTACATAAAAGGAGGGCATTTCCGTGATGCTGCAGAAACATTAGTAAAAATGGTCGATTTGGGTTTTCTCCCAGAGTACTTGGACAGAGTAGCTGTGCTGCAAGGACTAAGAAAACAGATTCGGGAACCTGAAAATGTCGATACTTATCTCGATCTCCTCAAATGTCTCTCTGATGCCAATCTAATTGGACCTAGTCTTGTATATTTGCACTTACAGAAACACAAGCTTTGGGTCATTAAAATGCTTTGA

mRNA sequence

ATGGAGTTTAACGGCTTTATTATTCGAAGCTATGAAGAGAGTCGATTATCAGATAAAGTTCAAGTTATGGATCTTGAACGACGATGTGAAATTGGTCAATCAAAACGTGTGTTTCTCTTTACTGACACTTTAGATGACCCTATTTGTAGGATACGTAACAGTCCCATGTATAAAATGTTGGTTGCTGAGTGGAACAAGGAAGTGGTTGGTGTTATTCAAGGCTCTATAAAAGCGGTTTTTTTTACTGCTCATAAACCACCGACCGGTTTGGTGGTTAAAATGGGCTACATTCTTGGCCTGAGAGTGGCGCCGCGGTATCGCCGTCGTGGGATTGGCTCCGGCCTCGTCCGCCGTTTGGAAGATTGGTTCGTTTCTAATGATGTTGATTACTGTTGCATGGCCACTGAGAAAGATAATCACGCCTCTCTTAATCTCTTCATCAATAATTTGAGGTACATAAAGTTTAGAACAGGAAGAATCCTAGTAAACCCAGTAAGAAATCATCCATACAATATCAATTCATCAGAAATCAACATTCAAAAGCTAAAAATAGAAGAAGCAGAAGCAATATACAAAAAACACATGGCTTCAACAGAGTTCTTCCCCAAAGACATAAAAAACATATTGAAAAACAAGTTGAGTTTAGGGACATGGGTGGCAAATTTCAAACAACGACGGTTATCGTCGACAGTCGCCGGAGGAAACGAGCAGATTACGGCGAGTAGTTGGGCCATTGTAAGTCTATGGAACAGTGGGGAAGTTTTCAAGCTAAGGCTAGGAAAAGCACCATTTCCATGGCTTATTTACACAAAGAGTTTAAAAATGATGGATAAAATTTTGCCTTGCTTTAAAGTGATTTTGGTGCCTAATTATTTCAAGCCATTTGGGTTCTATTTTGTTTATGGATTGCACCATGAAGGCCCTTTTTCTGAGAGATTGGTTGGAGCTTTGTGCAAATTTGTGCACAATATGGCTTTGAACAATTCAAAGGATCATAATTGTAAAGCTATTGTTACTGAGATTAGTGGTGATGAAGATGATGATGAGCTGAAAATGAAGATTCCCCATTGGAAATTGCTATCATGTTATGAAGATTTTTGGTGCATAAAGTCCTTGAAAAGTAAGAGAAATAATAATAATATTATTATTAGTAATGATAATGATAATGAGCATCATATATTGGAATGGAAAAATACCCCACCTATTAGAACTCTCTTTGTAGACCCAAGAGAGGGCTTTACTCCGTTAACCCAATTTGGGTTTTCATTTTCTTTATCTTCTGCTCTGAAAACTGAGAAACATGGGTTTTCTACTCCCCAATTGCATAGTCCTTCGCCGGTAAAGTTTTGCTTTATGGTTTCTCGTATTTCTTGCAACTATCAGGATTCTACTTTCTCTGTCTCGCGAGCTAGTAAGTTTCGGGACTTAAGGTTGTTCAAATCGGTTGAGTTGGATCAGTTCATCACGAGTGATGACGAAGATGAAATGGGAGATGGGTTTTTTGAGGCAATTGAGGAATTGGAAAGAATGACGAGGGAACCATCGGATGTTCTTGAAGAAATGAACGATCGCCTTTCGGCGAGGGAATTTCAGCTCGTGTTGGTGTATTTCTCTCAAGAAGGGAGAGATTCATGGTGTGCTCTTGAGGTTTTTGAGTGGCTCCAAAAGGAAAATCGGGTTGACAAGGAGACCATGGAGTTGATGGTGTCTATAATGTGTAGTTGGATCAAGAAGTTGGTCGAGGGACGACATAACGTCAGTGATGTTGTTGACCTTCTTGTGGATATGGATTGTGTAGGTTTGAAGCCTCATTTTAGCATGATAGAAAAGGTTATCTCTTTGTATTGGGAAATGGGTGAGAAGGAAAAGGCAATTTCGTTTGTGAAAGAGGTCTTGGGACGCAATCTTGCTTTTATGAAGGACGATTGGGAGGGACATAAAGGGGGACCAAGCGGTTATCTCGCATGGAAGATGATGGTTGATGGTGACTATAGGGGTGCAGTGAAAATGGTGCTGCATCTTAGAGAATCTGGATTAAAGCCAGAGGTTTACTGCTATCTTATCGCGATGACTGCTGTGGTTAAAGAGCTGAATGAATTTGCAAAAGCTCTACGGAAACTCAAAAGTTATGCAAGAGATGGGATAGTGGCTGAACTCGATAAAAACAACGTTGAACTTGTTGAGAAGTATCAGACAGAGCTTCTAGCTGATGGAGTACGGTTATCCAACTGGGTGCTTGAAGAGGGAAGCTTTTCGATTCATGGGGTGGTTCATGAGAGACTCCTTGCTATGTACATTTGTGCTGGGCAAGGACTCGAGGCAGAGAGACAGCTTTGGGAAATGAAGCTTGTAGGCAAGGAGGCCGATGCTGATCTCTACGATATCGTGCTAGCGATTTGTGCTTCACAGAAGGAGACAAAAGCAATGAAACGGTTGCTTACCAGGATTGAGATTACGAGTCCCATGCGTAAGAAGAAGAGTTTGACATGGCTACTAAGGGGTTACATAAAAGGAGGGCATTTCCGTGATGCTGCAGAAACATTAGTAAAAATGGTCGATTTGGGTTTTCTCCCAGAGTACTTGGACAGAGTAGCTGTGCTGCAAGGACTAAGAAAACAGATTCGGGAACCTGAAAATGTCGATACTTATCTCGATCTCCTCAAATGTCTCTCTGATGCCAATCTAATTGGACCTAGTCTTGTATATTTGCACTTACAGAAACACAAGCTTTGGGTCATTAAAATGCTTTGA

Coding sequence (CDS)

ATGGAGTTTAACGGCTTTATTATTCGAAGCTATGAAGAGAGTCGATTATCAGATAAAGTTCAAGTTATGGATCTTGAACGACGATGTGAAATTGGTCAATCAAAACGTGTGTTTCTCTTTACTGACACTTTAGATGACCCTATTTGTAGGATACGTAACAGTCCCATGTATAAAATGTTGGTTGCTGAGTGGAACAAGGAAGTGGTTGGTGTTATTCAAGGCTCTATAAAAGCGGTTTTTTTTACTGCTCATAAACCACCGACCGGTTTGGTGGTTAAAATGGGCTACATTCTTGGCCTGAGAGTGGCGCCGCGGTATCGCCGTCGTGGGATTGGCTCCGGCCTCGTCCGCCGTTTGGAAGATTGGTTCGTTTCTAATGATGTTGATTACTGTTGCATGGCCACTGAGAAAGATAATCACGCCTCTCTTAATCTCTTCATCAATAATTTGAGGTACATAAAGTTTAGAACAGGAAGAATCCTAGTAAACCCAGTAAGAAATCATCCATACAATATCAATTCATCAGAAATCAACATTCAAAAGCTAAAAATAGAAGAAGCAGAAGCAATATACAAAAAACACATGGCTTCAACAGAGTTCTTCCCCAAAGACATAAAAAACATATTGAAAAACAAGTTGAGTTTAGGGACATGGGTGGCAAATTTCAAACAACGACGGTTATCGTCGACAGTCGCCGGAGGAAACGAGCAGATTACGGCGAGTAGTTGGGCCATTGTAAGTCTATGGAACAGTGGGGAAGTTTTCAAGCTAAGGCTAGGAAAAGCACCATTTCCATGGCTTATTTACACAAAGAGTTTAAAAATGATGGATAAAATTTTGCCTTGCTTTAAAGTGATTTTGGTGCCTAATTATTTCAAGCCATTTGGGTTCTATTTTGTTTATGGATTGCACCATGAAGGCCCTTTTTCTGAGAGATTGGTTGGAGCTTTGTGCAAATTTGTGCACAATATGGCTTTGAACAATTCAAAGGATCATAATTGTAAAGCTATTGTTACTGAGATTAGTGGTGATGAAGATGATGATGAGCTGAAAATGAAGATTCCCCATTGGAAATTGCTATCATGTTATGAAGATTTTTGGTGCATAAAGTCCTTGAAAAGTAAGAGAAATAATAATAATATTATTATTAGTAATGATAATGATAATGAGCATCATATATTGGAATGGAAAAATACCCCACCTATTAGAACTCTCTTTGTAGACCCAAGAGAGGGCTTTACTCCGTTAACCCAATTTGGGTTTTCATTTTCTTTATCTTCTGCTCTGAAAACTGAGAAACATGGGTTTTCTACTCCCCAATTGCATAGTCCTTCGCCGGTAAAGTTTTGCTTTATGGTTTCTCGTATTTCTTGCAACTATCAGGATTCTACTTTCTCTGTCTCGCGAGCTAGTAAGTTTCGGGACTTAAGGTTGTTCAAATCGGTTGAGTTGGATCAGTTCATCACGAGTGATGACGAAGATGAAATGGGAGATGGGTTTTTTGAGGCAATTGAGGAATTGGAAAGAATGACGAGGGAACCATCGGATGTTCTTGAAGAAATGAACGATCGCCTTTCGGCGAGGGAATTTCAGCTCGTGTTGGTGTATTTCTCTCAAGAAGGGAGAGATTCATGGTGTGCTCTTGAGGTTTTTGAGTGGCTCCAAAAGGAAAATCGGGTTGACAAGGAGACCATGGAGTTGATGGTGTCTATAATGTGTAGTTGGATCAAGAAGTTGGTCGAGGGACGACATAACGTCAGTGATGTTGTTGACCTTCTTGTGGATATGGATTGTGTAGGTTTGAAGCCTCATTTTAGCATGATAGAAAAGGTTATCTCTTTGTATTGGGAAATGGGTGAGAAGGAAAAGGCAATTTCGTTTGTGAAAGAGGTCTTGGGACGCAATCTTGCTTTTATGAAGGACGATTGGGAGGGACATAAAGGGGGACCAAGCGGTTATCTCGCATGGAAGATGATGGTTGATGGTGACTATAGGGGTGCAGTGAAAATGGTGCTGCATCTTAGAGAATCTGGATTAAAGCCAGAGGTTTACTGCTATCTTATCGCGATGACTGCTGTGGTTAAAGAGCTGAATGAATTTGCAAAAGCTCTACGGAAACTCAAAAGTTATGCAAGAGATGGGATAGTGGCTGAACTCGATAAAAACAACGTTGAACTTGTTGAGAAGTATCAGACAGAGCTTCTAGCTGATGGAGTACGGTTATCCAACTGGGTGCTTGAAGAGGGAAGCTTTTCGATTCATGGGGTGGTTCATGAGAGACTCCTTGCTATGTACATTTGTGCTGGGCAAGGACTCGAGGCAGAGAGACAGCTTTGGGAAATGAAGCTTGTAGGCAAGGAGGCCGATGCTGATCTCTACGATATCGTGCTAGCGATTTGTGCTTCACAGAAGGAGACAAAAGCAATGAAACGGTTGCTTACCAGGATTGAGATTACGAGTCCCATGCGTAAGAAGAAGAGTTTGACATGGCTACTAAGGGGTTACATAAAAGGAGGGCATTTCCGTGATGCTGCAGAAACATTAGTAAAAATGGTCGATTTGGGTTTTCTCCCAGAGTACTTGGACAGAGTAGCTGTGCTGCAAGGACTAAGAAAACAGATTCGGGAACCTGAAAATGTCGATACTTATCTCGATCTCCTCAAATGTCTCTCTGATGCCAATCTAATTGGACCTAGTCTTGTATATTTGCACTTACAGAAACACAAGCTTTGGGTCATTAAAATGCTTTGA

Protein sequence

MEFNGFIIRSYEESRLSDKVQVMDLERRCEIGQSKRVFLFTDTLDDPICRIRNSPMYKMLVAEWNKEVVGVIQGSIKAVFFTAHKPPTGLVVKMGYILGLRVAPRYRRRGIGSGLVRRLEDWFVSNDVDYCCMATEKDNHASLNLFINNLRYIKFRTGRILVNPVRNHPYNINSSEINIQKLKIEEAEAIYKKHMASTEFFPKDIKNILKNKLSLGTWVANFKQRRLSSTVAGGNEQITASSWAIVSLWNSGEVFKLRLGKAPFPWLIYTKSLKMMDKILPCFKVILVPNYFKPFGFYFVYGLHHEGPFSERLVGALCKFVHNMALNNSKDHNCKAIVTEISGDEDDDELKMKIPHWKLLSCYEDFWCIKSLKSKRNNNNIIISNDNDNEHHILEWKNTPPIRTLFVDPREGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQKHKLWVIKML
Homology
BLAST of HG10001548 vs. NCBI nr
Match: XP_038901728.1 (pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Benincasa hispida])

HSP 1 Score: 994.2 bits (2569), Expect = 7.5e-286
Identity = 494/506 (97.63%), Postives = 501/506 (99.01%), Query Frame = 0

Query: 411 EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRA 470
           +GFTPLTQFGFSFSLSSALKT++HGFSTPQL+SP PVKFCFMVSRISCNYQDSTFSVSRA
Sbjct: 5   QGFTPLTQFGFSFSLSSALKTDRHGFSTPQLYSPPPVKFCFMVSRISCNYQDSTFSVSRA 64

Query: 471 SKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREF 530
            KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREF
Sbjct: 65  GKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREF 124

Query: 531 QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVV 590
           QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG+HNV DVV
Sbjct: 125 QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVGDVV 184

Query: 591 DLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGP 650
           DLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGP
Sbjct: 185 DLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGP 244

Query: 651 SGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY 710
           SGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY
Sbjct: 245 SGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY 304

Query: 711 ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQ 770
           ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQ
Sbjct: 305 ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQ 364

Query: 771 GLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTW 830
           GLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTW
Sbjct: 365 GLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTW 424

Query: 831 LLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCL 890
           LLRGYIKGGHF DAAETLVKM+DLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDL KCL
Sbjct: 425 LLRGYIKGGHFHDAAETLVKMIDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLCKCL 484

Query: 891 SDANLIGPSLVYLHLQKHKLWVIKML 917
           SDANLIGPSLVYLHLQKHKLWV+KML
Sbjct: 485 SDANLIGPSLVYLHLQKHKLWVVKML 510

BLAST of HG10001548 vs. NCBI nr
Match: XP_022944005.1 (pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita moschata])

HSP 1 Score: 935.6 bits (2417), Expect = 3.2e-268
Identity = 467/506 (92.29%), Postives = 489/506 (96.64%), Query Frame = 0

Query: 411 EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRA 470
           +GFTPLTQFGFSFSLSS LK+E+ GFS PQL S SPV FCFMVSRI+CN+Q+STFSVSRA
Sbjct: 5   QGFTPLTQFGFSFSLSSGLKSERLGFSAPQLCSRSPVNFCFMVSRITCNHQNSTFSVSRA 64

Query: 471 SKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREF 530
            KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTR+PSDVLEEMNDRLSAREF
Sbjct: 65  GKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREF 124

Query: 531 QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVV 590
           QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG+HNV DVV
Sbjct: 125 QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVV 184

Query: 591 DLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGP 650
           DLLVDMDCVGLKPHFSMIEKVISLYW+MGEKEKAISFVKEVLGR L FMKD+WEGHKGGP
Sbjct: 185 DLLVDMDCVGLKPHFSMIEKVISLYWDMGEKEKAISFVKEVLGRKLDFMKDNWEGHKGGP 244

Query: 651 SGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY 710
           SGYLAWKMMVDGDYRGAVKMVL+LRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY
Sbjct: 245 SGYLAWKMMVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY 304

Query: 711 ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQ 770
           ARDG+VAELDK+NVELV++YQ+ELLADGVRLSNWVL+EG  S HGVVHERLLAMYICAGQ
Sbjct: 305 ARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQ 364

Query: 771 GLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTW 830
           GLEAERQLWEMKLVGKEADADLYDIVLAICASQKET+AM RLLTRIEITSP  KKKSLTW
Sbjct: 365 GLEAERQLWEMKLVGKEADADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTW 424

Query: 831 LLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCL 890
           LLRGYIKGGHFRDAAETLVKMV+LGFLPEYLDRVAVLQGLRK+IREPENV+TYLDL KCL
Sbjct: 425 LLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCL 484

Query: 891 SDANLIGPSLVYLHLQKHKLWVIKML 917
           SDANLIGPSLVYLHLQK+KLWVIKML
Sbjct: 485 SDANLIGPSLVYLHLQKYKLWVIKML 510

BLAST of HG10001548 vs. NCBI nr
Match: KAG7010495.1 (Pentatricopeptide repeat-containing protein, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 934.5 bits (2414), Expect = 7.0e-268
Identity = 466/506 (92.09%), Postives = 489/506 (96.64%), Query Frame = 0

Query: 411 EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRA 470
           +GF+PLTQFGFSFSLSS LK+E+ GFS PQL S SPV FCFMVSRI+CN+Q+STFSVSRA
Sbjct: 5   QGFSPLTQFGFSFSLSSGLKSERLGFSAPQLCSRSPVNFCFMVSRITCNHQNSTFSVSRA 64

Query: 471 SKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREF 530
            KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTR+PSDVLEEMNDRLSAREF
Sbjct: 65  GKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREF 124

Query: 531 QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVV 590
           QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG+HNV DVV
Sbjct: 125 QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVGDVV 184

Query: 591 DLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGP 650
           DLLVDMDCVGLKPHFSMIEKVISLYW+MGEKEKAISFVKEVLGR L FMKD+WEGHKGGP
Sbjct: 185 DLLVDMDCVGLKPHFSMIEKVISLYWDMGEKEKAISFVKEVLGRKLDFMKDNWEGHKGGP 244

Query: 651 SGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY 710
           SGYLAWKMMVDGDYRGAVKMVL+LRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY
Sbjct: 245 SGYLAWKMMVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY 304

Query: 711 ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQ 770
           ARDG+VAELDK+NVELV++YQ+ELLADGVRLSNWVL+EG  S HGVVHERLLAMYICAGQ
Sbjct: 305 ARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQ 364

Query: 771 GLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTW 830
           GLEAERQLWEMKLVGKEADADLYDIVLAICASQKET+AM RLLTRIEITSP  KKKSLTW
Sbjct: 365 GLEAERQLWEMKLVGKEADADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTW 424

Query: 831 LLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCL 890
           LLRGYIKGGHFRDAAETLVKMV+LGFLPEYLDRVAVLQGLRK+IREPENV+TYLDL KCL
Sbjct: 425 LLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCL 484

Query: 891 SDANLIGPSLVYLHLQKHKLWVIKML 917
           SDANLIGPSLVYLHLQK+KLWVIKML
Sbjct: 485 SDANLIGPSLVYLHLQKYKLWVIKML 510

BLAST of HG10001548 vs. NCBI nr
Match: XP_023512972.1 (pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 932.9 bits (2410), Expect = 2.0e-267
Identity = 466/506 (92.09%), Postives = 488/506 (96.44%), Query Frame = 0

Query: 411 EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRA 470
           +GFTPLTQFGFSFSLSS LK+E+ GFS PQL S SPV FCFMVSRI+CN+Q+STFSVSRA
Sbjct: 5   QGFTPLTQFGFSFSLSSGLKSERLGFSAPQLCSRSPVNFCFMVSRITCNHQNSTFSVSRA 64

Query: 471 SKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREF 530
            KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTR+PSDVLEEMNDRLSAREF
Sbjct: 65  GKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREF 124

Query: 531 QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVV 590
           QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG+HNV DVV
Sbjct: 125 QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVGDVV 184

Query: 591 DLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGP 650
           DLLVDMDCVGLKPHFSMIEKVISLYW+MGEKEKAISFVKEVLGR L FMKD+WEGHKGGP
Sbjct: 185 DLLVDMDCVGLKPHFSMIEKVISLYWDMGEKEKAISFVKEVLGRKLDFMKDNWEGHKGGP 244

Query: 651 SGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY 710
           SGYLAWKMMVDGDYRGAVKMVL+LRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY
Sbjct: 245 SGYLAWKMMVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY 304

Query: 711 ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQ 770
           ARDG+VAELDK+NVELV++YQ+ELLADGVRLSNWVL+EG  S H VVHERLLAMYICAGQ
Sbjct: 305 ARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHRVVHERLLAMYICAGQ 364

Query: 771 GLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTW 830
           GLEAERQLWEMKLVGKEADADLYDIVLAICASQKET+AM RLLTRIEITSP  KKKSLTW
Sbjct: 365 GLEAERQLWEMKLVGKEADADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTW 424

Query: 831 LLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCL 890
           LLRGYIKGGHFRDAAETLVKMV+LGFLPEYLDRVAVLQGLRK+IREPENV+TYLDL KCL
Sbjct: 425 LLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCL 484

Query: 891 SDANLIGPSLVYLHLQKHKLWVIKML 917
           SDANLIGPSLVYLHLQK+KLWVIKML
Sbjct: 485 SDANLIGPSLVYLHLQKYKLWVIKML 510

BLAST of HG10001548 vs. NCBI nr
Match: KAG6570645.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 932.6 bits (2409), Expect = 2.7e-267
Identity = 465/506 (91.90%), Postives = 488/506 (96.44%), Query Frame = 0

Query: 411 EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRA 470
           +GF+PLTQFGFSFSLSS LK+E+ GFS PQL S SPV FCFMVSRI+CN+Q+STFSVSRA
Sbjct: 5   QGFSPLTQFGFSFSLSSGLKSERLGFSAPQLCSRSPVNFCFMVSRITCNHQNSTFSVSRA 64

Query: 471 SKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREF 530
            KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERM R+PSDVLEEMNDRLSAREF
Sbjct: 65  GKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMARDPSDVLEEMNDRLSAREF 124

Query: 531 QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVV 590
           QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG+HNV DVV
Sbjct: 125 QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVGDVV 184

Query: 591 DLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGP 650
           DLLVDMDCVGLKPHFSMIEKVISLYW+MGEKEKAISFVKEVLGR L FMKD+WEGHKGGP
Sbjct: 185 DLLVDMDCVGLKPHFSMIEKVISLYWDMGEKEKAISFVKEVLGRKLDFMKDNWEGHKGGP 244

Query: 651 SGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY 710
           SGYLAWKMMVDGDYRGAVKMVL+LRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY
Sbjct: 245 SGYLAWKMMVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY 304

Query: 711 ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQ 770
           ARDG+VAELDK+NVELV++YQ+ELLADGVRLSNWVL+EG  S HGVVHERLLAMYICAGQ
Sbjct: 305 ARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQ 364

Query: 771 GLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTW 830
           GLEAERQLWEMKLVGKEADADLYDIVLAICASQKET+AM RLLTRIEITSP  KKKSLTW
Sbjct: 365 GLEAERQLWEMKLVGKEADADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTW 424

Query: 831 LLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCL 890
           LLRGYIKGGHFRDAAETLVKMV+LGFLPEYLDRVAVLQGLRK+IREPENV+TYLDL KCL
Sbjct: 425 LLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCL 484

Query: 891 SDANLIGPSLVYLHLQKHKLWVIKML 917
           SDANLIGPSLVYLHLQK+KLWVIKML
Sbjct: 485 SDANLIGPSLVYLHLQKYKLWVIKML 510

BLAST of HG10001548 vs. ExPASy Swiss-Prot
Match: Q0WNN7 (Pentatricopeptide repeat-containing protein At2g30100, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At2g30100 PE=2 SV=2)

HSP 1 Score: 603.6 bits (1555), Expect = 3.7e-171
Identity = 308/489 (62.99%), Postives = 380/489 (77.71%), Query Frame = 0

Query: 439 PQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDE----D 498
           P+LH    VK     SRI CN + +      A KFR++ L +SVELDQFITS++E    +
Sbjct: 22  PRLHRNHSVK---PNSRIICNLKLN----YSAGKFREMGLSRSVELDQFITSEEEEGEAE 81

Query: 499 EMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWL 558
           E+G+GFFEAIEELERMTREPSD+LEEMN RLS+RE QL+LVYF+QEGRDSWC LEVFEWL
Sbjct: 82  EIGEGFFEAIEELERMTREPSDILEEMNHRLSSRELQLMLVYFAQEGRDSWCTLEVFEWL 141

Query: 559 QKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEKVISL 618
           +KENRVD+E MELMVSIMC W+KKL+E   N   V DLL++MDCVGLKP FSM++KVI+L
Sbjct: 142 KKENRVDEEIMELMVSIMCGWVKKLIEDECNAHQVFDLLIEMDCVGLKPGFSMMDKVIAL 201

Query: 619 YWEMGEKEKAISFVKEVLGRNLAFMKD-----DWEGHKGGPSGYLAWKMMVDGDYRGAVK 678
           Y EMG+KE A+ FVKEVL R   F          EG KGGP GYLAWK MVDGDYR AV 
Sbjct: 202 YCEMGKKESAVLFVKEVLRRRDGFGYSVVGGGGSEGRKGGPVGYLAWKFMVDGDYRKAVD 261

Query: 679 MVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGIVAELDKNNVELVEK 738
           MV+ LR SGLKPE Y YLIAMTA+VKELN   K LR+LK +AR G VAE+D ++  L+EK
Sbjct: 262 MVMELRLSGLKPEAYSYLIAMTAIVKELNSLGKTLRELKRFARAGFVAEIDDHDRVLIEK 321

Query: 739 YQTELLADGVRLSNWVLEEG--SFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKE 798
           YQ+E L+ G++L+ W +EEG  + SI GVVHERLLAMYICAG+G EAE+QLW+MKL G+E
Sbjct: 322 YQSETLSRGLQLATWAVEEGQENDSIIGVVHERLLAMYICAGRGPEAEKQLWKMKLAGRE 381

Query: 799 ADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAET 858
            +ADL+DIV+AICASQKE  A+ RLLTR+E     RKKK+L+WLLRGY+KGGHF +AAET
Sbjct: 382 PEADLHDIVMAICASQKEVNAVSRLLTRVEFMGSQRKKKTLSWLLRGYVKGGHFEEAAET 441

Query: 859 LVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQK 917
           LV M+D G  PEY+DRVAV+QG+ ++I+ P +V+ Y+ L K L DA L+GP LVY+++ K
Sbjct: 442 LVSMIDSGLHPEYIDRVAVMQGMTRKIQRPRDVEAYMSLCKRLFDAGLVGPCLVYMYIDK 501

BLAST of HG10001548 vs. ExPASy Swiss-Prot
Match: O64815 (Probable N-acetyltransferase HLS1-like OS=Arabidopsis thaliana OX=3702 GN=At2g23060 PE=2 SV=1)

HSP 1 Score: 319.7 bits (818), Expect = 1.1e-85
Identity = 180/432 (41.67%), Postives = 256/432 (59.26%), Query Frame = 0

Query: 8   IRSYEESRLSDKVQVMDLERRCEIGQSKRVFLFTDTLDDPICRIRNSPMYKMLVAE---- 67
           +R Y+ S+  D   V D+ERRCE+G + ++ LFTD L DPICR+R+SP Y MLVAE    
Sbjct: 7   VREYDPSK--DLATVEDVERRCEVGPAGKLSLFTDLLGDPICRVRHSPSYLMLVAEIGPK 66

Query: 68  WNKEVVGVIQGSIKAVF---------FTAHKPPTGLVV------KMGYILGLRVAPRYRR 127
             KE+VG+I+G IK V           T +K    +V+      K+ YILGLRV+P +RR
Sbjct: 67  EKKELVGMIRGCIKTVTCGITTKRLDLTHNKSQNDVVITKPLYTKLAYILGLRVSPTHRR 126

Query: 128 RGIGSGLVRRLEDWFVSNDVDYCCMATEKDNHASLNLFINNLRYIKFRTGRILVNPVRNH 187
           +GIG  LV+ +EDWF  N  +Y   ATE DNHAS+NLF     Y +FRT  ILVNPV  H
Sbjct: 127 QGIGFKLVKAMEDWFSQNGAEYSYFATENDNHASVNLFTGKCGYAEFRTPSILVNPVYAH 186

Query: 188 PYNINSSEINIQKLKIEEAEAIYKKHMASTEFFPKDIKNILKNKLSLGTWVANFKQRRLS 247
             NI S  + + KL+  +AE +Y+   ++TEFFP+DI ++L NKLSLGT+VA  +     
Sbjct: 187 RVNI-SRRVTVIKLEPSDAELLYRLRFSTTEFFPRDIDSVLNNKLSLGTFVAVPR----G 246

Query: 248 STVAGGNEQITAS---------SWAIVSLWNSGEVFKLRLGKAPFPWLIYTKSLKMMDKI 307
           S    G+     S         SWA++S+WN  + F+L +  A     + +K+ +M+DK 
Sbjct: 247 SCYGSGSRSWPGSAKFLEYPPDSWAVLSVWNCKDSFRLEVRGASRLRRVVSKATRMVDKT 306

Query: 308 LPCFKVILVPNYFKPFGFYFVYGLHHEGPFSERLVGALCKFVHNMALNNSKDHNCKAIVT 367
           LP  K+  +P  F+PFG +F+YG+  EGP +E++V ALC   HN+A    K+  C  +  
Sbjct: 307 LPFLKIPSIPAVFRPFGLHFMYGIGGEGPRAEKMVKALCDHAHNLA----KEGGCGVVAA 366

Query: 368 EISGDEDDDELKMKIPHWKLLSCYEDFWCIKSLKSKRNNNNIIISNDNDNEHHILEWKNT 412
           E++G+E    L+  IPHWK+LSC ED WCIK L             ++ ++  + +W  +
Sbjct: 367 EVAGEE---PLRRGIPHWKVLSCAEDLWCIKRL------------GEDYSDGSVGDWTKS 412

BLAST of HG10001548 vs. ExPASy Swiss-Prot
Match: Q42381 (Probable N-acetyltransferase HLS1 OS=Arabidopsis thaliana OX=3702 GN=HLS1 PE=1 SV=1)

HSP 1 Score: 310.5 bits (794), Expect = 6.5e-83
Identity = 177/425 (41.65%), Postives = 253/425 (59.53%), Query Frame = 0

Query: 7   IIRSYEESRLSDKVQVMDLERRCEIGQSKRVFLFTDTLDDPICRIRNSPMYKMLVAEW-- 66
           ++R Y+ +R  D V V D+ERRCE+G S ++ LFTD L DPICRIR+SP Y MLVAE   
Sbjct: 3   VVREYDPTR--DLVGVEDVERRCEVGPSGKLSLFTDLLGDPICRIRHSPSYLMLVAEMGT 62

Query: 67  -NKEVVGVIQGSIKAV-----FFTAHKPPTGLV----VKMGYILGLRVAPRYRRRGIGSG 126
             KE+VG+I+G IK V         HK    +V     K+ Y+LGLRV+P +RR+GIG  
Sbjct: 63  EKKEIVGMIRGCIKTVTCGQKLDLNHKSQNDVVKPLYTKLAYVLGLRVSPFHRRQGIGFK 122

Query: 127 LVRRLEDWFVSNDVDYCCMATEKDNHASLNLFINNLRYIKFRTGRILVNPVRNHPYNINS 186
           LV+ +E+WF  N  +Y  +ATE DN AS+NLF     Y +FRT  ILVNPV  H  N+ S
Sbjct: 123 LVKMMEEWFRQNGAEYSYIATENDNQASVNLFTGKCGYSEFRTPSILVNPVYAHRVNV-S 182

Query: 187 SEINIQKLKIEEAEAIYKKHMASTEFFPKDIKNILKNKLSLGTWVANFKQRRLSSTVAGG 246
             + + KL+  +AE +Y+   ++TEFFP+DI ++L NKLSLGT+VA     R S   +G 
Sbjct: 183 RRVTVIKLEPVDAETLYRIRFSTTEFFPRDIDSVLNNKLSLGTFVA---VPRGSCYGSGS 242

Query: 247 NE--------QITASSWAIVSLWNSGEVFKLRLGKAPFPWLIYTKSLKMMDKILPCFKVI 306
                     +    SWA++S+WN  + F L +  A     +  K+ +++DK LP  K+ 
Sbjct: 243 GSWPGSAKFLEYPPESWAVLSVWNCKDSFLLEVRGASRLRRVVAKTTRVVDKTLPFLKLP 302

Query: 307 LVPNYFKPFGFYFVYGLHHEGPFSERLVGALCKFVHNMALNNSKDHNCKAIVTEISGDED 366
            +P+ F+PFG +F+YG+  EGP + ++V +LC   HN+A    K   C  +  E++G   
Sbjct: 303 SIPSVFEPFGLHFMYGIGGEGPRAVKMVKSLCAHAHNLA----KAGGCGVVAAEVAG--- 362

Query: 367 DDELKMKIPHWKLLSCYEDFWCIKSLKSKRNNNNIIISNDNDNEHHILEWKNTPPIRTLF 412
           +D L+  IPHWK+LSC ED WCIK L             D+ ++  + +W  +PP  ++F
Sbjct: 363 EDPLRRGIPHWKVLSCDEDLWCIKRL------------GDDYSDGVVGDWTKSPPGVSIF 402

BLAST of HG10001548 vs. ExPASy Swiss-Prot
Match: Q0WVV0 (Pentatricopeptide repeat-containing protein At1g10910, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At1g10910 PE=2 SV=1)

HSP 1 Score: 56.2 bits (134), Expect = 2.2e-06
Identity = 46/215 (21.40%), Postives = 89/215 (41.40%), Query Frame = 0

Query: 658 MMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGIVA 717
           ++ +G     +K+   ++  GLKP+V  Y   +   +K  N + KA+          ++ 
Sbjct: 176 LVKNGKLDSCIKLFDQMKRDGLKPDVVTYNTLLAGCIKVKNGYPKAIE---------LIG 235

Query: 718 ELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQGLEAERQ 777
           EL  N +++                             V++  +LA+    G+  EAE  
Sbjct: 236 ELPHNGIQM---------------------------DSVMYGTVLAICASNGRSEEAENF 295

Query: 778 LWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTWLLRGYIK 837
           + +MK+ G   +   Y  +L   + + + K    L+T ++    +  K  +T LL+ YIK
Sbjct: 296 IQQMKVEGHSPNIYHYSSLLNSYSWKGDYKKADELMTEMKSIGLVPNKVMMTTLLKVYIK 354

Query: 838 GGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRK 873
           GG F  + E L ++   G+    +    ++ GL K
Sbjct: 356 GGLFDRSRELLSELESAGYAENEMPYCMLMDGLSK 354

BLAST of HG10001548 vs. ExPASy TrEMBL
Match: A0A6J1FYE9 (pentatricopeptide repeat-containing protein At2g30100, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111448564 PE=4 SV=1)

HSP 1 Score: 935.6 bits (2417), Expect = 1.5e-268
Identity = 467/506 (92.29%), Postives = 489/506 (96.64%), Query Frame = 0

Query: 411 EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRA 470
           +GFTPLTQFGFSFSLSS LK+E+ GFS PQL S SPV FCFMVSRI+CN+Q+STFSVSRA
Sbjct: 5   QGFTPLTQFGFSFSLSSGLKSERLGFSAPQLCSRSPVNFCFMVSRITCNHQNSTFSVSRA 64

Query: 471 SKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREF 530
            KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTR+PSDVLEEMNDRLSAREF
Sbjct: 65  GKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREF 124

Query: 531 QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVV 590
           QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG+HNV DVV
Sbjct: 125 QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVV 184

Query: 591 DLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGP 650
           DLLVDMDCVGLKPHFSMIEKVISLYW+MGEKEKAISFVKEVLGR L FMKD+WEGHKGGP
Sbjct: 185 DLLVDMDCVGLKPHFSMIEKVISLYWDMGEKEKAISFVKEVLGRKLDFMKDNWEGHKGGP 244

Query: 651 SGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY 710
           SGYLAWKMMVDGDYRGAVKMVL+LRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY
Sbjct: 245 SGYLAWKMMVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY 304

Query: 711 ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQ 770
           ARDG+VAELDK+NVELV++YQ+ELLADGVRLSNWVL+EG  S HGVVHERLLAMYICAGQ
Sbjct: 305 ARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQ 364

Query: 771 GLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTW 830
           GLEAERQLWEMKLVGKEADADLYDIVLAICASQKET+AM RLLTRIEITSP  KKKSLTW
Sbjct: 365 GLEAERQLWEMKLVGKEADADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTW 424

Query: 831 LLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCL 890
           LLRGYIKGGHFRDAAETLVKMV+LGFLPEYLDRVAVLQGLRK+IREPENV+TYLDL KCL
Sbjct: 425 LLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCL 484

Query: 891 SDANLIGPSLVYLHLQKHKLWVIKML 917
           SDANLIGPSLVYLHLQK+KLWVIKML
Sbjct: 485 SDANLIGPSLVYLHLQKYKLWVIKML 510

BLAST of HG10001548 vs. ExPASy TrEMBL
Match: A0A6J1JH85 (pentatricopeptide repeat-containing protein At2g30100, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111484464 PE=4 SV=1)

HSP 1 Score: 928.7 bits (2399), Expect = 1.9e-266
Identity = 463/505 (91.68%), Postives = 489/505 (96.83%), Query Frame = 0

Query: 412 GFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRAS 471
           GFTPLT+FGFSFSLSS LK+++ GFS PQL S SPV FCF+VSRI+CN+Q+STFSVSRA 
Sbjct: 6   GFTPLTKFGFSFSLSSGLKSKRLGFSAPQLCSRSPVNFCFIVSRITCNHQNSTFSVSRAG 65

Query: 472 KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREFQ 531
           KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTR+PSDVLEEMNDRLSAREFQ
Sbjct: 66  KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQ 125

Query: 532 LVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVD 591
           LVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG+HNV DVVD
Sbjct: 126 LVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVGDVVD 185

Query: 592 LLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGPS 651
           LLVDMDCVGLKPHFSMIEKVISLYW+MGEKEKAISFVKEVLGR L FMKD+WEGHKGGPS
Sbjct: 186 LLVDMDCVGLKPHFSMIEKVISLYWDMGEKEKAISFVKEVLGRKLDFMKDNWEGHKGGPS 245

Query: 652 GYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYA 711
           GYLAWKMMVDGDYRGAVKMVL+LRESGLKPEVYC+LIAMTAVVKELNEFAKALRKLKSYA
Sbjct: 246 GYLAWKMMVDGDYRGAVKMVLNLRESGLKPEVYCFLIAMTAVVKELNEFAKALRKLKSYA 305

Query: 712 RDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQG 771
           RDG+VAELDK+NVELV++YQ+ELLADGVRLSNWVL+EGS S HGVVHERLLAMYICAGQG
Sbjct: 306 RDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGSSSSHGVVHERLLAMYICAGQG 365

Query: 772 LEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTWL 831
           LEAERQLWEMKLVGKEADADLYDIVLAICASQKET+AM RLL+RIEITSP  KKKSLTWL
Sbjct: 366 LEAERQLWEMKLVGKEADADLYDIVLAICASQKETRAMNRLLSRIEITSPRLKKKSLTWL 425

Query: 832 LRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLS 891
           LRGYIKGGHFRDAAETLVKMV+LGFLPEYLDRVAVLQGLRK+IREPENV+TYLDL KCLS
Sbjct: 426 LRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLS 485

Query: 892 DANLIGPSLVYLHLQKHKLWVIKML 917
           DANLIGPSLVYLHLQK+KLWVIKML
Sbjct: 486 DANLIGPSLVYLHLQKYKLWVIKML 510

BLAST of HG10001548 vs. ExPASy TrEMBL
Match: A0A0A0KC35 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G182120 PE=4 SV=1)

HSP 1 Score: 921.8 bits (2381), Expect = 2.3e-264
Identity = 464/506 (91.70%), Postives = 481/506 (95.06%), Query Frame = 0

Query: 411 EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRA 470
           +GFTPLTQFGFSFSLSS L++++ GFSTP+L         +MVS ISCNYQDSTFSVSRA
Sbjct: 5   QGFTPLTQFGFSFSLSSPLESQRCGFSTPRL---------YMVSPISCNYQDSTFSVSRA 64

Query: 471 SKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREF 530
           +KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSARE 
Sbjct: 65  AKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREI 124

Query: 531 QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVV 590
           QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNV DVV
Sbjct: 125 QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVGDVV 184

Query: 591 DLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGP 650
           DLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKA+ FVKEVLGRNLAFMKDDWEGHKGGP
Sbjct: 185 DLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWEGHKGGP 244

Query: 651 SGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY 710
           SGYLAWKMMVDGDYRGAVKMVLHLRESGL+PEVY YLIAMTAVVKELNEFAKALRKLK Y
Sbjct: 245 SGYLAWKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRKLKGY 304

Query: 711 ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQ 770
           ARDG VAELDKNNVELV KYQTELLADGV+LSNWVLEEGS SI GVVHERLLAMYICAGQ
Sbjct: 305 ARDGFVAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYICAGQ 364

Query: 771 GLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTW 830
           G+EAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPM KKKSLTW
Sbjct: 365 GVEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKKSLTW 424

Query: 831 LLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCL 890
           LLRGYIKGGHFRDAA TLVKM++LGFLPEYLDRVAVLQGLRK+IREPE+V TYLDL KCL
Sbjct: 425 LLRGYIKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLRKEIREPESVHTYLDLCKCL 484

Query: 891 SDANLIGPSLVYLHLQKHKLWVIKML 917
           SDANLIGPSLVYLHLQKHKLW+IKML
Sbjct: 485 SDANLIGPSLVYLHLQKHKLWIIKML 501

BLAST of HG10001548 vs. ExPASy TrEMBL
Match: A0A1S3CNE0 (pentatricopeptide repeat-containing protein At2g30100, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103502924 PE=4 SV=1)

HSP 1 Score: 921.8 bits (2381), Expect = 2.3e-264
Identity = 465/506 (91.90%), Postives = 480/506 (94.86%), Query Frame = 0

Query: 411 EGFTPLTQFGFSFSLSSALKTEKHGFSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSRA 470
           +GFTPLTQFGFSFSLSS L+T+++GFSTP+L         +MVS ISCNYQDSTFSVSRA
Sbjct: 5   QGFTPLTQFGFSFSLSSPLETQRYGFSTPRL---------YMVSPISCNYQDSTFSVSRA 64

Query: 471 SKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREF 530
           +KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSARE 
Sbjct: 65  AKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREI 124

Query: 531 QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVV 590
           QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWI KLVEGRHNV DVV
Sbjct: 125 QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWINKLVEGRHNVGDVV 184

Query: 591 DLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGGP 650
           DLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAI FVKEVLGRNLAFMKDDWEGHKGGP
Sbjct: 185 DLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAIFFVKEVLGRNLAFMKDDWEGHKGGP 244

Query: 651 SGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSY 710
           SGYLAWKMMVDGDYRGAVKMVLHLRESGL+PEVY YLIAMTAVVKELNEFAKALRKLKSY
Sbjct: 245 SGYLAWKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRKLKSY 304

Query: 711 ARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAGQ 770
           ARDG VAELDKNNVELV KYQTELLADGVRLSNWVLEEGS SIHGVVHERLLAMYICAGQ
Sbjct: 305 ARDGYVAELDKNNVELVAKYQTELLADGVRLSNWVLEEGSSSIHGVVHERLLAMYICAGQ 364

Query: 771 GLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTW 830
           G+EAERQLWEMKL+GKEADADLYDIVLAICASQKE KAMKRLLTRIEITSPM KKKSLTW
Sbjct: 365 GVEAERQLWEMKLLGKEADADLYDIVLAICASQKEIKAMKRLLTRIEITSPMIKKKSLTW 424

Query: 831 LLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCL 890
           LLRGYIKGGHFRDAA T+VKM++LGFLPEYLDRVAVLQGLRK IREPE V TYLDL KCL
Sbjct: 425 LLRGYIKGGHFRDAAGTVVKMINLGFLPEYLDRVAVLQGLRKGIREPEIVHTYLDLCKCL 484

Query: 891 SDANLIGPSLVYLHLQKHKLWVIKML 917
           SDANLIGPSLVYLHLQKHKLW+IKML
Sbjct: 485 SDANLIGPSLVYLHLQKHKLWIIKML 501

BLAST of HG10001548 vs. ExPASy TrEMBL
Match: A0A6J1D3T2 (pentatricopeptide repeat-containing protein At2g30100, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111017036 PE=4 SV=1)

HSP 1 Score: 904.8 bits (2337), Expect = 2.9e-259
Identity = 455/507 (89.74%), Postives = 478/507 (94.28%), Query Frame = 0

Query: 411 EGFTPLTQFGFSFSLSSALKTEKHG-FSTPQLHSPSPVKFCFMVSRISCNYQDSTFSVSR 470
           +GFTP+TQFGFSFSLSSALKT++   FSTPQL+  SPV FCFM+S I+CN+++STFSV +
Sbjct: 5   QGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLK 64

Query: 471 ASKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSARE 530
           A KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSARE
Sbjct: 65  AGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSARE 124

Query: 531 FQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDV 590
           FQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNV DV
Sbjct: 125 FQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDV 184

Query: 591 VDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAISFVKEVLGRNLAFMKDDWEGHKGG 650
           VDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKE+AISFVKEVLGR +AFMKDD EGHKGG
Sbjct: 185 VDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGG 244

Query: 651 PSGYLAWKMMVDGDYRGAVKMVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKS 710
           PSGYLAWKMMVDGDYRGAVK+VLHLRESGL PEVY YLIAMTAVVKELNEFAKALRKLKS
Sbjct: 245 PSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALRKLKS 304

Query: 711 YARDGIVAELDKNNVELVEKYQTELLADGVRLSNWVLEEGSFSIHGVVHERLLAMYICAG 770
           Y RDGIVAELDK+NV LVE YQTELLADGVRLSNWVLEEGS SIHGV HERLLAMYICAG
Sbjct: 305 YTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAG 364

Query: 771 QGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLT 830
           +GLEAERQLWEMKLVGKEAD+DLYDIVLAICASQKET+AM RLLTRIEI SP+ KKKSL+
Sbjct: 365 RGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKETRAMNRLLTRIEIISPLLKKKSLS 424

Query: 831 WLLRGYIKGGHFRDAAETLVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKC 890
           WLLRGYIKGGHF DAAETLVKMVDLGFLPEYLDRVAVLQGLRK+IREP +V+TY  L KC
Sbjct: 425 WLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKC 484

Query: 891 LSDANLIGPSLVYLHLQKHKLWVIKML 917
           LSDANLIGP LVYLHLQKHKLWVIKML
Sbjct: 485 LSDANLIGPGLVYLHLQKHKLWVIKML 511

BLAST of HG10001548 vs. TAIR 10
Match: AT2G30100.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 603.6 bits (1555), Expect = 2.6e-172
Identity = 308/489 (62.99%), Postives = 380/489 (77.71%), Query Frame = 0

Query: 439 PQLHSPSPVKFCFMVSRISCNYQDSTFSVSRASKFRDLRLFKSVELDQFITSDDE----D 498
           P+LH    VK     SRI CN + +      A KFR++ L +SVELDQFITS++E    +
Sbjct: 22  PRLHRNHSVK---PNSRIICNLKLN----YSAGKFREMGLSRSVELDQFITSEEEEGEAE 81

Query: 499 EMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWL 558
           E+G+GFFEAIEELERMTREPSD+LEEMN RLS+RE QL+LVYF+QEGRDSWC LEVFEWL
Sbjct: 82  EIGEGFFEAIEELERMTREPSDILEEMNHRLSSRELQLMLVYFAQEGRDSWCTLEVFEWL 141

Query: 559 QKENRVDKETMELMVSIMCSWIKKLVEGRHNVSDVVDLLVDMDCVGLKPHFSMIEKVISL 618
           +KENRVD+E MELMVSIMC W+KKL+E   N   V DLL++MDCVGLKP FSM++KVI+L
Sbjct: 142 KKENRVDEEIMELMVSIMCGWVKKLIEDECNAHQVFDLLIEMDCVGLKPGFSMMDKVIAL 201

Query: 619 YWEMGEKEKAISFVKEVLGRNLAFMKD-----DWEGHKGGPSGYLAWKMMVDGDYRGAVK 678
           Y EMG+KE A+ FVKEVL R   F          EG KGGP GYLAWK MVDGDYR AV 
Sbjct: 202 YCEMGKKESAVLFVKEVLRRRDGFGYSVVGGGGSEGRKGGPVGYLAWKFMVDGDYRKAVD 261

Query: 679 MVLHLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGIVAELDKNNVELVEK 738
           MV+ LR SGLKPE Y YLIAMTA+VKELN   K LR+LK +AR G VAE+D ++  L+EK
Sbjct: 262 MVMELRLSGLKPEAYSYLIAMTAIVKELNSLGKTLRELKRFARAGFVAEIDDHDRVLIEK 321

Query: 739 YQTELLADGVRLSNWVLEEG--SFSIHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKE 798
           YQ+E L+ G++L+ W +EEG  + SI GVVHERLLAMYICAG+G EAE+QLW+MKL G+E
Sbjct: 322 YQSETLSRGLQLATWAVEEGQENDSIIGVVHERLLAMYICAGRGPEAEKQLWKMKLAGRE 381

Query: 799 ADADLYDIVLAICASQKETKAMKRLLTRIEITSPMRKKKSLTWLLRGYIKGGHFRDAAET 858
            +ADL+DIV+AICASQKE  A+ RLLTR+E     RKKK+L+WLLRGY+KGGHF +AAET
Sbjct: 382 PEADLHDIVMAICASQKEVNAVSRLLTRVEFMGSQRKKKTLSWLLRGYVKGGHFEEAAET 441

Query: 859 LVKMVDLGFLPEYLDRVAVLQGLRKQIREPENVDTYLDLLKCLSDANLIGPSLVYLHLQK 917
           LV M+D G  PEY+DRVAV+QG+ ++I+ P +V+ Y+ L K L DA L+GP LVY+++ K
Sbjct: 442 LVSMIDSGLHPEYIDRVAVMQGMTRKIQRPRDVEAYMSLCKRLFDAGLVGPCLVYMYIDK 501

BLAST of HG10001548 vs. TAIR 10
Match: AT2G30090.1 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 355.9 bits (912), Expect = 9.6e-98
Identity = 193/407 (47.42%), Postives = 258/407 (63.39%), Query Frame = 0

Query: 7   IIRSYEESRLSDKVQVMDLERRCEIGQSKRVFLFTDTLDDPICRIRNSPMYKMLVAEWNK 66
           +IR Y++ R  D++Q+  +E+ CEIG   +  LFTDTL DPICRIRNSP + MLVA    
Sbjct: 14  VIRCYDDRR--DRIQMGRMEKSCEIGHDHQTLLFTDTLGDPICRIRNSPFFIMLVAGVGN 73

Query: 67  EVVGVIQGSIKAVFFTAHKPPTGLVVKMGYILGLRVAPRYRRRGIGSGLVRRLEDWFVSN 126
           ++VG IQGS+K V F          V++GY+LGLRV P YRRRGIGS LVR+LE+WF S+
Sbjct: 74  KLVGSIQGSVKPVEFHDKS------VRVGYVLGLRVVPSYRRRGIGSILVRKLEEWFESH 133

Query: 127 DVDYCCMATEKDNHASLNLFINNLRYIKFRTGRILVNPVRNHPYNINSSEINIQKLKIEE 186
           + DY  MATEKDN AS  LFI  L Y+ FR   ILVNPV         S+I I+KLK++E
Sbjct: 134 NADYAYMATEKDNEASHGLFIGRLGYVVFRNPAILVNPVNPGRGLKLPSDIGIRKLKVKE 193

Query: 187 AEAIYKKHM-ASTEFFPKDIKNILKNKLSLGTWVANFKQRRLSSTVAGGNEQITASSWAI 246
           AE++Y++++ A+TEFFP DI  IL+NKLS+GTWVA +            N      SWA+
Sbjct: 194 AESLYRRNVAATTEFFPDDINKILRNKLSIGTWVAYY------------NNVDNTRSWAM 253

Query: 247 VSLWNSGEVFKLRLGKAPFPWLIYTKSLKMMDKILPCFKVILVPNYFKPFGFYFVYGLHH 306
           +S+W+S +VFKLR+ +AP  +L+ TK  K+    L    + ++P+ F PFGFYF+YG+H 
Sbjct: 254 LSVWDSSKVFKLRIERAPLSYLLLTKVSKLFGNFLSLLGLTVLPDLFTPFGFYFLYGVHS 313

Query: 307 EGPFSERLVGALCKFVHNMALNNSKDHNCKAIVTEI-SGDEDDDELKMKIPHWKLLSCYE 366
           EGP   +LV ALC+ VHNMA  N     CK +V E+  G   DD L+  IPHWK+LSC +
Sbjct: 314 EGPHCGKLVRALCEHVHNMAALND-GCACKVVVVEVDKGSNGDDSLQRCIPHWKMLSCDD 373

Query: 367 DFWCIKSLKSKRNNNNIIISNDNDNEHHILEWKNTPPIRTLFVDPRE 412
           D WCIK LK ++N  ++               + +    +LFVDPRE
Sbjct: 374 DMWCIKPLKCEKNKFDLS--------------ERSKSRSSLFVDPRE 385

BLAST of HG10001548 vs. TAIR 10
Match: AT2G23060.1 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 319.7 bits (818), Expect = 7.6e-87
Identity = 180/432 (41.67%), Postives = 256/432 (59.26%), Query Frame = 0

Query: 8   IRSYEESRLSDKVQVMDLERRCEIGQSKRVFLFTDTLDDPICRIRNSPMYKMLVAE---- 67
           +R Y+ S+  D   V D+ERRCE+G + ++ LFTD L DPICR+R+SP Y MLVAE    
Sbjct: 7   VREYDPSK--DLATVEDVERRCEVGPAGKLSLFTDLLGDPICRVRHSPSYLMLVAEIGPK 66

Query: 68  WNKEVVGVIQGSIKAVF---------FTAHKPPTGLVV------KMGYILGLRVAPRYRR 127
             KE+VG+I+G IK V           T +K    +V+      K+ YILGLRV+P +RR
Sbjct: 67  EKKELVGMIRGCIKTVTCGITTKRLDLTHNKSQNDVVITKPLYTKLAYILGLRVSPTHRR 126

Query: 128 RGIGSGLVRRLEDWFVSNDVDYCCMATEKDNHASLNLFINNLRYIKFRTGRILVNPVRNH 187
           +GIG  LV+ +EDWF  N  +Y   ATE DNHAS+NLF     Y +FRT  ILVNPV  H
Sbjct: 127 QGIGFKLVKAMEDWFSQNGAEYSYFATENDNHASVNLFTGKCGYAEFRTPSILVNPVYAH 186

Query: 188 PYNINSSEINIQKLKIEEAEAIYKKHMASTEFFPKDIKNILKNKLSLGTWVANFKQRRLS 247
             NI S  + + KL+  +AE +Y+   ++TEFFP+DI ++L NKLSLGT+VA  +     
Sbjct: 187 RVNI-SRRVTVIKLEPSDAELLYRLRFSTTEFFPRDIDSVLNNKLSLGTFVAVPR----G 246

Query: 248 STVAGGNEQITAS---------SWAIVSLWNSGEVFKLRLGKAPFPWLIYTKSLKMMDKI 307
           S    G+     S         SWA++S+WN  + F+L +  A     + +K+ +M+DK 
Sbjct: 247 SCYGSGSRSWPGSAKFLEYPPDSWAVLSVWNCKDSFRLEVRGASRLRRVVSKATRMVDKT 306

Query: 308 LPCFKVILVPNYFKPFGFYFVYGLHHEGPFSERLVGALCKFVHNMALNNSKDHNCKAIVT 367
           LP  K+  +P  F+PFG +F+YG+  EGP +E++V ALC   HN+A    K+  C  +  
Sbjct: 307 LPFLKIPSIPAVFRPFGLHFMYGIGGEGPRAEKMVKALCDHAHNLA----KEGGCGVVAA 366

Query: 368 EISGDEDDDELKMKIPHWKLLSCYEDFWCIKSLKSKRNNNNIIISNDNDNEHHILEWKNT 412
           E++G+E    L+  IPHWK+LSC ED WCIK L             ++ ++  + +W  +
Sbjct: 367 EVAGEE---PLRRGIPHWKVLSCAEDLWCIKRL------------GEDYSDGSVGDWTKS 412

BLAST of HG10001548 vs. TAIR 10
Match: AT4G37580.1 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 310.5 bits (794), Expect = 4.6e-84
Identity = 177/425 (41.65%), Postives = 253/425 (59.53%), Query Frame = 0

Query: 7   IIRSYEESRLSDKVQVMDLERRCEIGQSKRVFLFTDTLDDPICRIRNSPMYKMLVAEW-- 66
           ++R Y+ +R  D V V D+ERRCE+G S ++ LFTD L DPICRIR+SP Y MLVAE   
Sbjct: 3   VVREYDPTR--DLVGVEDVERRCEVGPSGKLSLFTDLLGDPICRIRHSPSYLMLVAEMGT 62

Query: 67  -NKEVVGVIQGSIKAV-----FFTAHKPPTGLV----VKMGYILGLRVAPRYRRRGIGSG 126
             KE+VG+I+G IK V         HK    +V     K+ Y+LGLRV+P +RR+GIG  
Sbjct: 63  EKKEIVGMIRGCIKTVTCGQKLDLNHKSQNDVVKPLYTKLAYVLGLRVSPFHRRQGIGFK 122

Query: 127 LVRRLEDWFVSNDVDYCCMATEKDNHASLNLFINNLRYIKFRTGRILVNPVRNHPYNINS 186
           LV+ +E+WF  N  +Y  +ATE DN AS+NLF     Y +FRT  ILVNPV  H  N+ S
Sbjct: 123 LVKMMEEWFRQNGAEYSYIATENDNQASVNLFTGKCGYSEFRTPSILVNPVYAHRVNV-S 182

Query: 187 SEINIQKLKIEEAEAIYKKHMASTEFFPKDIKNILKNKLSLGTWVANFKQRRLSSTVAGG 246
             + + KL+  +AE +Y+   ++TEFFP+DI ++L NKLSLGT+VA     R S   +G 
Sbjct: 183 RRVTVIKLEPVDAETLYRIRFSTTEFFPRDIDSVLNNKLSLGTFVA---VPRGSCYGSGS 242

Query: 247 NE--------QITASSWAIVSLWNSGEVFKLRLGKAPFPWLIYTKSLKMMDKILPCFKVI 306
                     +    SWA++S+WN  + F L +  A     +  K+ +++DK LP  K+ 
Sbjct: 243 GSWPGSAKFLEYPPESWAVLSVWNCKDSFLLEVRGASRLRRVVAKTTRVVDKTLPFLKLP 302

Query: 307 LVPNYFKPFGFYFVYGLHHEGPFSERLVGALCKFVHNMALNNSKDHNCKAIVTEISGDED 366
            +P+ F+PFG +F+YG+  EGP + ++V +LC   HN+A    K   C  +  E++G   
Sbjct: 303 SIPSVFEPFGLHFMYGIGGEGPRAVKMVKSLCAHAHNLA----KAGGCGVVAAEVAG--- 362

Query: 367 DDELKMKIPHWKLLSCYEDFWCIKSLKSKRNNNNIIISNDNDNEHHILEWKNTPPIRTLF 412
           +D L+  IPHWK+LSC ED WCIK L             D+ ++  + +W  +PP  ++F
Sbjct: 363 EDPLRRGIPHWKVLSCDEDLWCIKRL------------GDDYSDGVVGDWTKSPPGVSIF 402

BLAST of HG10001548 vs. TAIR 10
Match: AT5G67430.1 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 280.8 bits (717), Expect = 3.9e-75
Identity = 166/416 (39.90%), Postives = 233/416 (56.01%), Query Frame = 0

Query: 3   FNGFIIRSYEESRLSDKVQVMDLERRCEIGQSKRVFLFTDTLDDPICRIRNSPMYKMLVA 62
           FN  ++R Y+  R  D   V +LE  CE+G      L  D + DP+ RIR SP + MLVA
Sbjct: 5   FNVVVVREYDPKR--DLTSVEELEESCEVGS-----LLVDLMGDPLARIRQSPSFHMLVA 64

Query: 63  EWNKEVVGVIQGSIKAVFFTAHK-------PPTGLVVKMGYILGLRVAPRYRRRGIGSGL 122
           E   E+VG+I+G+IK V    +         P     K+ ++ GLRV+P YRR GIG  L
Sbjct: 65  EIGNEIVGMIRGTIKMVTRGVNALRQADDVSPEINTTKLAFVSGLRVSPFYRRMGIGLKL 124

Query: 123 VRRLEDWFVSNDVDYCCMATEKDNHASLNLFINNLRYIKFRTGRILVNPVRNHPYNINSS 182
           V+RLE+WF+ ND  Y  + TE DN AS+ LF     Y KFRT   LVNPV NH   + S 
Sbjct: 125 VQRLEEWFLRNDAVYSYVQTENDNIASVKLFTEKSGYSKFRTPTFLVNPVFNHRVTV-SR 184

Query: 183 EINIQKLKIEEAEAIYKKHMASTEFFPKDIKNILKNKLSLGTWVANFKQRRLSSTVAGGN 242
            + I KL   +AE++Y+   ++TEFFP DI +IL NKLSLGT++A     R    V+G  
Sbjct: 185 RVKIIKLAPSDAESLYRNRFSTTEFFPSDINSILTNKLSLGTYLA---VPRGGDNVSGSL 244

Query: 243 EQITASSWAIVSLWNSGEVFKLRLGKAPFPWLIYTKSLKMMDKILPCFKVILVPNYFKPF 302
              T  SWA++S+WNS +V++L++  A     +  KS ++ D   P  K+   PN FK F
Sbjct: 245 PDQT-GSWAVISIWNSKDVYRLQVKGASRLKRMLAKSTRVFDGAFPFLKIPSFPNLFKSF 304

Query: 303 GFYFVYGLHHEGPFSERLVGALCKFVHNMALNNSKDHNCKAIVTEISGDEDDDELKMKIP 362
             +F+YG+  EGP +  +V ALC   HN+A    +   C  +  E++  E    L++ IP
Sbjct: 305 AMHFMYGIGGEGPRAAEMVEALCSHAHNLA----RKSGCAVVAAEVASCE---PLRVGIP 364

Query: 363 HWKLLSCYEDFWCIKSLKSKRNNNNIIISNDNDNEHHILEWKNTPPIRTLFVDPRE 412
           HWK+LS  ED WC+K L+            D+D     ++W  +PP  ++FVDPRE
Sbjct: 365 HWKVLS-PEDLWCLKRLR-----------YDDDG----VDWTKSPPGLSIFVDPRE 385

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038901728.17.5e-28697.63pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Benincasa ... [more]
XP_022944005.13.2e-26892.29pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita ... [more]
KAG7010495.17.0e-26892.09Pentatricopeptide repeat-containing protein, chloroplastic [Cucurbita argyrosper... [more]
XP_023512972.12.0e-26792.09pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita ... [more]
KAG6570645.12.7e-26791.90Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
Q0WNN73.7e-17162.99Pentatricopeptide repeat-containing protein At2g30100, chloroplastic OS=Arabidop... [more]
O648151.1e-8541.67Probable N-acetyltransferase HLS1-like OS=Arabidopsis thaliana OX=3702 GN=At2g23... [more]
Q423816.5e-8341.65Probable N-acetyltransferase HLS1 OS=Arabidopsis thaliana OX=3702 GN=HLS1 PE=1 S... [more]
Q0WVV02.2e-0621.40Pentatricopeptide repeat-containing protein At1g10910, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1FYE91.5e-26892.29pentatricopeptide repeat-containing protein At2g30100, chloroplastic OS=Cucurbit... [more]
A0A6J1JH851.9e-26691.68pentatricopeptide repeat-containing protein At2g30100, chloroplastic OS=Cucurbit... [more]
A0A0A0KC352.3e-26491.70Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G182120 PE=4 SV=1[more]
A0A1S3CNE02.3e-26491.90pentatricopeptide repeat-containing protein At2g30100, chloroplastic OS=Cucumis ... [more]
A0A6J1D3T22.9e-25989.74pentatricopeptide repeat-containing protein At2g30100, chloroplastic OS=Momordic... [more]
Match NameE-valueIdentityDescription
AT2G30100.12.6e-17262.99pentatricopeptide (PPR) repeat-containing protein [more]
AT2G30090.19.6e-9847.42Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
AT2G23060.17.6e-8741.67Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
AT4G37580.14.6e-8441.65Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
AT5G67430.13.9e-7539.90Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 660..896
e-value: 8.9E-11
score: 43.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 500..638
e-value: 3.0E-6
score: 28.9
NoneNo IPR availableGENE3D3.40.630.30coord: 4..149
e-value: 9.9E-16
score: 60.0
NoneNo IPR availablePANTHERPTHR47880OS05G0353300 PROTEINcoord: 423..916
NoneNo IPR availableCDDcd04301NAT_SFcoord: 59..131
e-value: 1.96548E-7
score: 46.8853
IPR000182GNAT domainPFAMPF00583Acetyltransf_1coord: 51..146
e-value: 7.1E-13
score: 48.8
IPR000182GNAT domainPROSITEPS51186GNATcoord: 6..197
score: 14.513806
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 824..858
score: 8.769097
IPR016181Acyl-CoA N-acyltransferaseSUPERFAMILY55729Acyl-CoA N-acyltransferases (Nat)coord: 16..146

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10001548.1HG10001548.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0008080 N-acetyltransferase activity
molecular_function GO:0005515 protein binding