Bhi01G000722 (gene) Wax gourd (B227) v1

Overview
NameBhi01G000722
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationchr1: 18533676 .. 18540097 (-)
RNA-Seq ExpressionBhi01G000722
SyntenyBhi01G000722
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCTCCCTCTGCGCCTTCATTTGACATTTCTCTTCCATGGTTGAGCTTCTCGTATTTGGGCTGATTCGATCACTATAAATCCTCAAGGCTACACTCTCTTATACCGTTCGTTTGACTATTCAACTTGCGCGGTTCATCTGCCGCACTTGTTCATCGGTAATATTCACTTTCACTATCACATTTGGAGTCTGAAATTTCGAAATTTGTGAAACGAAGAACTTTCTTAGGCTTTTAATGATTAACCCAGACGATTTTTTATGGTGGGCAATTTGAATTTTATATGTTCTTGCCATATTCAGTTTTGGGTTTATATGTTATGGAAGTATGTTGTCTGAATGTAGGATTGGAGTCAGCTGCCTGAAGAGGTTCTTCGTTGTTTTCATGAATTCAGCTTTTCTGTGTAAGAAATTTATCTTTCTTACTTCTGAATTCAGTTTGTCTTGATATGAAACATTTCTTTGGTTCTTTTTTTGTCATCAGTACTTGCATATGAGTAAGGGTGATCATGTACTTGTCATTCTACATTCCTTTGTTTAATGTGATCATAATTTGCTGTTATTTCATTTATCTATTTCGTCAGTATCCATTATCTTGTAAGGGGCTTCTAACGCTGCTTTGGAATTTCAGGTGAATAGCTGAGAGTTTGAGACATTGTCAACTATTGACATGCGTACATGAAGATGCTTCTAAGGAACATAGGTAAATCAGAAGCTCTTGATTTTTTTAAAATACAAATGAAAAGAACTAAACTAATTGGAATTGTTCACGGTAAAAATCTATTTGAGAACCACAATCTGGTTCCCCCACCCAGTTATCAATTTGCATCTAGCTCATAAACTTATCAGATTGTGACCAAAGGAGTAAGTTTATTGCTCCTGAAGGTATGGGGTTCATATTGACCTCTGTTTTAGAAGTACTTCACGACTAGAGGAGGGATAACCAGTGCCTTATGAGTCATATTTGCGAGATAACCATTTTAGTGATTGACTATTGATTGACTGATCAGAAAATTGCGTCCCTTGCCTACTACAAGAATTGTTCTTTCAAACCGTTGGGCTTACTTGATGAGTTAAAGGCTTTAATTAATATTTCCCTCTTGGTCGTTGCTATCTCAAAGGGAAGCTTTGGGTTATCTGGATTGAGTTCATGGGCCCATATGGGCCAGGGTTGGCTGTTGATTGATAACTTCTGCATTTTGCCTGAAAGATTAATAGGAACAGATGTTTACTTTTTGGGTTGTAAAGCTTTAATTTTTATCTCCTTAGGTGGTTGCTGTTTCAAGGAAAGCATTTGGCCATTTGGATTTCCTGGTCCATGCCTGTAATTTTGAGTATTTGGTTAGAAGGAAATAGGCATATGTTTGAAAGATGGAATTGTGAAATTCTGTGTTGTAATAAAAAGAATTTTGTTCTTTAGTTTCCTCAAAGAATTAGCAATACTTGATCATAGAGGCCAAGAATTCAGGAACGAACATGAAACATTGGTGTTGAAAGAGGAAGCCACATCAGACTGTAATTTGTTTAACGTACATGGATTAGACTATTCAAAGGGAAACTCTATGCCTTGAGGTCATAAGTTCAAGTATTAACAATTACCTGCCTGATATGTTAATTTCTTTTAAGAAGAAACACTTGCTTTTTATCCAGCCTTAACAGTTGGCATACTAATGAAAGATTTGTGGTTCTGCTCCTCTTTTCGGTTTCCAATTATCATGGGTATACCTGTAAAGTATTAAGTTCCTTGTTGGGTTCTTGTTTTTGCATTGAATTTGGGTTGAATTTCCAAGTCAACTATTCTTTTTAATTCATTTCTTTGAATGAAAGAACTTTATTTCATTCATAACATTGATTACAGATTATGTTTATTTCAGGTGCAGGACAAATCAATTGTCTTGATTTGAAGTACAGAAACCCTATTAAATTTTCTTTTAAATTTTTTTCCTCGTATGCTGGGGATTCTTCTCAAACAACAAATAGAAATGGGGCCCCTGTTTCTGGTGGGGGTGGTCTGGTGCCGGCAACAAAGTATGAGGACAAGAGACAAGTTTTAGATGGTGTGTGCCAAATTTTGGAGACTGGTCCTTGGGGATCTTTGGTTGAGAATAAGTTAGCGGAGCTTGACGCAAAACCAAATACAGAATTGGTAATTGGAGTCTTAAGGAGGCTGAAGGACGTAAACAATGCAGTAAATTACTTTCGATGGGCTGAGAGAGTAACAGACCTAGCACATTGCCCTGAAGCATACAATTCACTTCTCATGGTTATGGCTAGAACTAGAAAGTTTAATTGCTTGGAACAAATATTGGAAGAAATGAGTATTGCAGGTTTTGGCCCGTCAAATAACACATGTATTGAAATTGTACTAAGCCTTGTCAAATCTCGCAAGCTTAGAGAAGCTTTTACATTTATGCAAACTATGAGAAAGTTAAAATTCCGCCCAGCCTTTTCAGCATACACAACTTTGATTGGTGCACTATCTGCATCTCATGATTCTGATTGCATGCTCACCTTATTTCAGCAAATGCAGGAGCTTGGCTATGAAGTTAATGTTCATTTATTCACTACTCTCATTCGTGTATTTGCTAGAGAGGGTCGAGTTGATGCTGCACTCTCTCTTTTGGATGAGATGAAGAGCAATTCTTTAGAACCAGATGTTGTTCTTTATAATGTCTGTATAGATTGCTTTGGGAAGGCTGGGAAGGTGGATATGGCTTGGAAATTTTTTCATGAAATGAAAGCTAATGGTTTGGTTCTTGATGATGTAACTCATACTAGCATGATAGGAGTTCTCTGTAAAGCTGACAGGATGAATGAAGCAGTTGAGCTATTTGAACATATGGATCAAAACAAGCAAGTGCCTTGTGCATATGCATATAATACTATGATCATGGGTTATGGTATGGCTGGAAAGTTTGATGAGGCATACAGCTTACTTGAGAGACAGAGAAGAAAAGGATGCATTCCAAGTGTCGTCGCATATAATTGCATTCTTACTTGTCTTGGGAGGAAGGGGCGGGTAGACGAGGCATTAAAACTTTTTGAAGAGATGAAGAAAGATGCCATTCCCAATCTTTCAACATATAATATTATGATTGACATGCTTTGTAAGGCTGGACAACTCGAGACAGCGTTGGTTGTCCGGGATGCCATGAAAGATGCTGGGTTGTTTCCTAATGTTATTACAGTAAACATAATGGTTGACAGATTGTGTAAAGCCCAAAGACTTGATGATGCTTGTTCTATTTTCGAAGGGTTGGATCATAAAACTTGCACACCTGATACAGTAACATATTGTTCTCTTATAGAAGGATTGGGTAAGCATGGGAGAGTAGATGATGCCTACAAGCTATATGAACAGATGTTGGATTCTGACCAGATCCCAAATGCTGTTGTGTATACATCTCTCATAAGGAACTTTTTCAAGTGTGGAAGGAAGGAGGATGGCCACAAGATATATAACGAAATGATACGTCTAGGTTGTTCTCCTGACCTGCTGCTTCTTAATACCTACATGGATTGCGTTTTTAAAGCTGGAGAAATTGAGAAGGGCCGGGCCTTGTTTCAGGAGATTAAGGCCCTAGGATTTATTCCAGATGTGAGGAGTTATACAATCCTAATTCATGGCTTGGTGAAAGCAGGTTTTGCGCATGAATCTTATGAGTTGTTCTACACAATGAAGGAACAAGGTTGTGTTCTGGATACTCGTGCATATAACACCGTTATCAATGGATTCTGCAAGTCTGGCAAGGTAAATAAAGCTTATCAATTGCTAGAGGAGATGAAGACAAAGGGTCATGAACCCACTGTTGTTACTTATGGTTCTGTTATCGATGGGCTTGCAAAGATTGACCGGCTTGATGAGGCATATATGCTCTTTGAAGAAGCAAAGTCGAAAGGAGTAGAACTAAATGTTGTTATATATAGCAGTCTAATCGATGGATTTGGGAAAGTGGGTAGAATCGATGAAGCATACTTGATCATGGAAGAGTTGATGCAAAAAGGTTTGACACCTAGTGTATACACATGGAATTGCTTGCTTGATGCATTGGTGAAAGCAGAAGAAATTAGCGAAGCCCTTGTTTGCTTTCAGTCTATGAAAGACTTGAAATGTACTCCTAATTATATAACTTATAGCATTCTAATTCATGGTCTTTGTAAGATTAGAAAATTCAATAAGGCATTCGTGTTCTGGCAAGAGATGCAGAAGCAAGGCTTAAAGCCTAATGTATTCACCTACACCACCATGATCTCAGGACTCGCTAAGGCTGGAAACATTGCGGAGGCAAATGCTCTTTTCGAGAAGTTTAAGGAAAAGGGCGGTGTGCCTGATTCTGCTATTTACAATGCTATAATAGAAGGGTTAAGTAATGCAAACAGGGCATTGGATGCTTATAGAATTTTTGAGGAAACTCGATCGAAAGGTTGTAGTATTCACACGAAAACTTGTGTTGTTCTATTAGATTCACTGCATAAGGCTGAATGCATCGAGCAGGCTGCAATCGTGGGTGCTGTATTACGGGAAACTGCTAAGGCTCAGCATGCTGCAAGATCCTGGACATAACGTGGTATCTATGAGAGCAACTAGATGGAGCTGTTTTAATCAAAAGTAGTGATTGCCTTTGCAGGTAATAGGAAATACTTGAAGTATGAGTGAAAATAACCTCGAGGGCGATGAGTAATGTTGTTGAAGTCATACTAAGTGAAGTATGCAGATACAGGTAACATTACATAACCTTACTGGGTAGAATGAGAACGGCTAGACATGGAAGCTGAAGATCGTTAACAATTCAGTACTTACATTTTGCTGGTCGGTTTCAAGATTTTGTAGAATCTGTGAGTCTGTGACCTCTTCTTAGTGCATCTAAAGTGTCCAAAGCGAGGACAGTTTTCAATGCTGAACAACGATGGCACAGCAGGTCACTTTGGTTATGCTTGTTCTTATGCAATTCCCTTGGTAAGTCGAGAACAAAGAGCTTTTAGGTGCTCTGAAGTCCAGTTGCAAGTCTTCATTCATGGCTTTCATGGCTTTCCAGGATGCCATATCCCTCAGTCTCATTAGCGGTGGTTTCGTTCTGGACATCATGCTGACAATGGTAAACTATCTCCCGGCTCGTGATATTGATACGCAACTTAAAACTAACCTGTGCCAAAGCTACCATTTTCTTGATAAATTCATCCTTGATTTTTTCCCAGTTATATTTAAAGGTCATGTTAAAGTTACCATTTTGATTCTGTAGTTCCATGGACTTTTTTCCCCACATGTTAACATGTTATGCATTTAATTGCATACAGTGAGTTATTCAAGTTTTGCAGTTTTTTATTTGGTTGAGAAAAATGGGTACAAATTTTGGTGTTAAGTAAAACAATACACAGGTCAATTTCAAGCTATGGTTCTGTTTGGGAAAAAGAAAAATCTTATGTACCATCTAACCTCGTTAAGATATATATATTTGTGCTTCTTCATTCACTAATATAAGAAAAGTTTAAATACAAATTTAATTTATGAATATTAAAATTTATTTCTATTTGGTCAATGAACTTTTTAAAATTTCTAATAGATCTCTAAACTTTAGGTTTTCTTCTCGTAGGTCATTGAATTTTAAAAGTGTCTAGAGAATTCTTAAACTATCAATTTTGTGTATAATAGATTCTTGAATTTTCAATTTTGATTATAATAGGTTTTTGACCTATTCCATCTTTTTTAAAAACCCATAGATCTACTAGATACAAAATTAAAAGTTTAAGTTTCATTACACATTAAATCTAATAGATGAATTAGTAATTACATTTTTTAATATGTTAGGGATAAATTAGACACACATATTAGAAATCTTTTAATTCAAATATTCTATTAGACACAAAATTAAAAGTTTAAAGATTTATTAGATACTTTTGAAAGTTAAAGAACTAAATATATACAAGCTTCGAAGTTTAGGAACAAAACTTGTACCTTAACCTTTATATAAAAAAAGGAAATTTGTATCAAATAATACAAACAGGGAAAAAAAACTTATCTTATATTCCATGTTTGAAATATTTGCTAATATAGAAAATGTTGGCTGGTCAAGGTATTGGTTTTTTGGAAATTCCAAATTTACCCTATCTTTTCCTTCATTCACTTTCCTCTTTTACATTTTCGATTTCATTCTCGTGTCTTTGTTTCAGGTGTGCAAATGATTGGTGAATTCGAAATCGTGAAGCTCTTCGATTCAATCTCTCCGTGATACTACGACCAATTCCAATTCGTGCATTCGTTTTCCTTTTCCCTATTGCATCTTCGACTGTGAGTTATTTTTCTTTGAGCATTCTTTTTTTCGATTTTGTTTTTCGTTTTGTTTTTCGATTTCTTCAGCTCATTTACTCTTTTCTTTCTCTTTTTCTCAAGAACCTAATGATTACTTCAGCTCACTGAGAACATAGTGG

mRNA sequence

CTCTCCCTCTGCGCCTTCATTTGACATTTCTCTTCCATGGTTGAGCTTCTCGTATTTGGGCTGATTCGATCACTATAAATCCTCAAGGCTACACTCTCTTATACCGTTCGTTTGACTATTCAACTTGCGCGGTTCATCTGCCGCACTTGTTCATCGGATTGGAGTCAGCTGCCTGAAGAGGTTCTTCGTTGTTTTCATGAATTCAGCTTTTCTGTGTGAATAGCTGAGAGTTTGAGACATTGTCAACTATTGACATGCGTACATGAAGATGCTTCTAAGGAACATAGGTGCAGGACAAATCAATTGTCTTGATTTGAAGTACAGAAACCCTATTAAATTTTCTTTTAAATTTTTTTCCTCGTATGCTGGGGATTCTTCTCAAACAACAAATAGAAATGGGGCCCCTGTTTCTGGTGGGGGTGGTCTGGTGCCGGCAACAAAGTATGAGGACAAGAGACAAGTTTTAGATGGTGTGTGCCAAATTTTGGAGACTGGTCCTTGGGGATCTTTGGTTGAGAATAAGTTAGCGGAGCTTGACGCAAAACCAAATACAGAATTGGTAATTGGAGTCTTAAGGAGGCTGAAGGACGTAAACAATGCAGTAAATTACTTTCGATGGGCTGAGAGAGTAACAGACCTAGCACATTGCCCTGAAGCATACAATTCACTTCTCATGGTTATGGCTAGAACTAGAAAGTTTAATTGCTTGGAACAAATATTGGAAGAAATGAGTATTGCAGGTTTTGGCCCGTCAAATAACACATGTATTGAAATTGTACTAAGCCTTGTCAAATCTCGCAAGCTTAGAGAAGCTTTTACATTTATGCAAACTATGAGAAAGTTAAAATTCCGCCCAGCCTTTTCAGCATACACAACTTTGATTGGTGCACTATCTGCATCTCATGATTCTGATTGCATGCTCACCTTATTTCAGCAAATGCAGGAGCTTGGCTATGAAGTTAATGTTCATTTATTCACTACTCTCATTCGTGTATTTGCTAGAGAGGGTCGAGTTGATGCTGCACTCTCTCTTTTGGATGAGATGAAGAGCAATTCTTTAGAACCAGATGTTGTTCTTTATAATGTCTGTATAGATTGCTTTGGGAAGGCTGGGAAGGTGGATATGGCTTGGAAATTTTTTCATGAAATGAAAGCTAATGGTTTGGTTCTTGATGATGTAACTCATACTAGCATGATAGGAGTTCTCTGTAAAGCTGACAGGATGAATGAAGCAGTTGAGCTATTTGAACATATGGATCAAAACAAGCAAGTGCCTTGTGCATATGCATATAATACTATGATCATGGGTTATGGTATGGCTGGAAAGTTTGATGAGGCATACAGCTTACTTGAGAGACAGAGAAGAAAAGGATGCATTCCAAGTGTCGTCGCATATAATTGCATTCTTACTTGTCTTGGGAGGAAGGGGCGGGTAGACGAGGCATTAAAACTTTTTGAAGAGATGAAGAAAGATGCCATTCCCAATCTTTCAACATATAATATTATGATTGACATGCTTTGTAAGGCTGGACAACTCGAGACAGCGTTGGTTGTCCGGGATGCCATGAAAGATGCTGGGTTGTTTCCTAATGTTATTACAGTAAACATAATGGTTGACAGATTGTGTAAAGCCCAAAGACTTGATGATGCTTGTTCTATTTTCGAAGGGTTGGATCATAAAACTTGCACACCTGATACAGTAACATATTGTTCTCTTATAGAAGGATTGGGTAAGCATGGGAGAGTAGATGATGCCTACAAGCTATATGAACAGATGTTGGATTCTGACCAGATCCCAAATGCTGTTGTGTATACATCTCTCATAAGGAACTTTTTCAAGTGTGGAAGGAAGGAGGATGGCCACAAGATATATAACGAAATGATACGTCTAGGTTGTTCTCCTGACCTGCTGCTTCTTAATACCTACATGGATTGCGTTTTTAAAGCTGGAGAAATTGAGAAGGGCCGGGCCTTGTTTCAGGAGATTAAGGCCCTAGGATTTATTCCAGATGTGAGGAGTTATACAATCCTAATTCATGGCTTGGTGAAAGCAGGTTTTGCGCATGAATCTTATGAGTTGTTCTACACAATGAAGGAACAAGGTTGTGTTCTGGATACTCGTGCATATAACACCGTTATCAATGGATTCTGCAAGTCTGGCAAGGTAAATAAAGCTTATCAATTGCTAGAGGAGATGAAGACAAAGGGTCATGAACCCACTGTTGTTACTTATGGTTCTGTTATCGATGGGCTTGCAAAGATTGACCGGCTTGATGAGGCATATATGCTCTTTGAAGAAGCAAAGTCGAAAGGAGTAGAACTAAATGTTGTTATATATAGCAGTCTAATCGATGGATTTGGGAAAGTGGGTAGAATCGATGAAGCATACTTGATCATGGAAGAGTTGATGCAAAAAGGTTTGACACCTAGTGTATACACATGGAATTGCTTGCTTGATGCATTGGTGAAAGCAGAAGAAATTAGCGAAGCCCTTGTTTGCTTTCAGTCTATGAAAGACTTGAAATGTACTCCTAATTATATAACTTATAGCATTCTAATTCATGGTCTTTGTAAGATTAGAAAATTCAATAAGGCATTCGTGTTCTGGCAAGAGATGCAGAAGCAAGGCTTAAAGCCTAATGTATTCACCTACACCACCATGATCTCAGGACTCGCTAAGGCTGGAAACATTGCGGAGGCAAATGCTCTTTTCGAGAAGTTTAAGGAAAAGGGCGGTGTGCCTGATTCTGCTATTTACAATGCTATAATAGAAGGGTTAAGTAATGCAAACAGGGCATTGGATGCTTATAGAATTTTTGAGGAAACTCGATCGAAAGGTTGTAGTATTCACACGAAAACTTGTGTTGTTCTATTAGATTCACTGCATAAGGCTGAATGCATCGAGCAGGCTGCAATCGTGGGTGCTGTATTACGGGAAACTGCTAAGGCTCAGCATGCTGCAAGATCCTGGACATAACGTGGTATCTATGAGAGCAACTAGATGGAGCTGTTTTAATCAAAAGTAGTGATTGCCTTTGCAGGTAATAGGAAATACTTGAAGTATGAGTGAAAATAACCTCGAGGGCGATGAGTAATGTTGTTGAAGTCATACTAAGTGAAGTATGCAGATACAGGTAACATTACATAACCTTACTGGGTAGAATGAGAACGGCTAGACATGGAAGCTGAAGATCGTTAACAATTCAGTACTTACATTTTGCTGGTCGGTTTCAAGATTTTGTAGAATCTGTGAGTCTGTGACCTCTTCTTAGTGCATCTAAAGTGTCCAAAGCGAGGACAGTTTTCAATGCTGAACAACGATGGCACAGCAGGTCACTTTGGTTATGCTTGTTCTTATGCAATTCCCTTGGTAAGTCGAGAACAAAGAGCTTTTAGGTGCTCTGAAGTCCAGTTGCAAGTCTTCATTCATGGCTTTCATGGCTTTCCAGGATGCCATATCCCTCAGTCTCATTAGCGGTGGTTTCGTTCTGGACATCATGCTGACAATGGTGTGCAAATGATTGGTGAATTCGAAATCGTGAAGCTCTTCGATTCAATCTCTCCGTGATACTACGACCAATTCCAATTCGTGCATTCGTTTTCCTTTTCCCTATTGCATCTTCGACTGTGAGTTATTTTTCTTTGAGCATTCTTTTTTTCGATTTTGTTTTTCGTTTTGTTTTTCGATTTCTTCAGCTCATTTACTCTTTTCTTTCTCTTTTTCTCAAGAACCTAATGATTACTTCAGCTCACTGAGAACATAGTGG

Coding sequence (CDS)

ATGAAGATGCTTCTAAGGAACATAGGTGCAGGACAAATCAATTGTCTTGATTTGAAGTACAGAAACCCTATTAAATTTTCTTTTAAATTTTTTTCCTCGTATGCTGGGGATTCTTCTCAAACAACAAATAGAAATGGGGCCCCTGTTTCTGGTGGGGGTGGTCTGGTGCCGGCAACAAAGTATGAGGACAAGAGACAAGTTTTAGATGGTGTGTGCCAAATTTTGGAGACTGGTCCTTGGGGATCTTTGGTTGAGAATAAGTTAGCGGAGCTTGACGCAAAACCAAATACAGAATTGGTAATTGGAGTCTTAAGGAGGCTGAAGGACGTAAACAATGCAGTAAATTACTTTCGATGGGCTGAGAGAGTAACAGACCTAGCACATTGCCCTGAAGCATACAATTCACTTCTCATGGTTATGGCTAGAACTAGAAAGTTTAATTGCTTGGAACAAATATTGGAAGAAATGAGTATTGCAGGTTTTGGCCCGTCAAATAACACATGTATTGAAATTGTACTAAGCCTTGTCAAATCTCGCAAGCTTAGAGAAGCTTTTACATTTATGCAAACTATGAGAAAGTTAAAATTCCGCCCAGCCTTTTCAGCATACACAACTTTGATTGGTGCACTATCTGCATCTCATGATTCTGATTGCATGCTCACCTTATTTCAGCAAATGCAGGAGCTTGGCTATGAAGTTAATGTTCATTTATTCACTACTCTCATTCGTGTATTTGCTAGAGAGGGTCGAGTTGATGCTGCACTCTCTCTTTTGGATGAGATGAAGAGCAATTCTTTAGAACCAGATGTTGTTCTTTATAATGTCTGTATAGATTGCTTTGGGAAGGCTGGGAAGGTGGATATGGCTTGGAAATTTTTTCATGAAATGAAAGCTAATGGTTTGGTTCTTGATGATGTAACTCATACTAGCATGATAGGAGTTCTCTGTAAAGCTGACAGGATGAATGAAGCAGTTGAGCTATTTGAACATATGGATCAAAACAAGCAAGTGCCTTGTGCATATGCATATAATACTATGATCATGGGTTATGGTATGGCTGGAAAGTTTGATGAGGCATACAGCTTACTTGAGAGACAGAGAAGAAAAGGATGCATTCCAAGTGTCGTCGCATATAATTGCATTCTTACTTGTCTTGGGAGGAAGGGGCGGGTAGACGAGGCATTAAAACTTTTTGAAGAGATGAAGAAAGATGCCATTCCCAATCTTTCAACATATAATATTATGATTGACATGCTTTGTAAGGCTGGACAACTCGAGACAGCGTTGGTTGTCCGGGATGCCATGAAAGATGCTGGGTTGTTTCCTAATGTTATTACAGTAAACATAATGGTTGACAGATTGTGTAAAGCCCAAAGACTTGATGATGCTTGTTCTATTTTCGAAGGGTTGGATCATAAAACTTGCACACCTGATACAGTAACATATTGTTCTCTTATAGAAGGATTGGGTAAGCATGGGAGAGTAGATGATGCCTACAAGCTATATGAACAGATGTTGGATTCTGACCAGATCCCAAATGCTGTTGTGTATACATCTCTCATAAGGAACTTTTTCAAGTGTGGAAGGAAGGAGGATGGCCACAAGATATATAACGAAATGATACGTCTAGGTTGTTCTCCTGACCTGCTGCTTCTTAATACCTACATGGATTGCGTTTTTAAAGCTGGAGAAATTGAGAAGGGCCGGGCCTTGTTTCAGGAGATTAAGGCCCTAGGATTTATTCCAGATGTGAGGAGTTATACAATCCTAATTCATGGCTTGGTGAAAGCAGGTTTTGCGCATGAATCTTATGAGTTGTTCTACACAATGAAGGAACAAGGTTGTGTTCTGGATACTCGTGCATATAACACCGTTATCAATGGATTCTGCAAGTCTGGCAAGGTAAATAAAGCTTATCAATTGCTAGAGGAGATGAAGACAAAGGGTCATGAACCCACTGTTGTTACTTATGGTTCTGTTATCGATGGGCTTGCAAAGATTGACCGGCTTGATGAGGCATATATGCTCTTTGAAGAAGCAAAGTCGAAAGGAGTAGAACTAAATGTTGTTATATATAGCAGTCTAATCGATGGATTTGGGAAAGTGGGTAGAATCGATGAAGCATACTTGATCATGGAAGAGTTGATGCAAAAAGGTTTGACACCTAGTGTATACACATGGAATTGCTTGCTTGATGCATTGGTGAAAGCAGAAGAAATTAGCGAAGCCCTTGTTTGCTTTCAGTCTATGAAAGACTTGAAATGTACTCCTAATTATATAACTTATAGCATTCTAATTCATGGTCTTTGTAAGATTAGAAAATTCAATAAGGCATTCGTGTTCTGGCAAGAGATGCAGAAGCAAGGCTTAAAGCCTAATGTATTCACCTACACCACCATGATCTCAGGACTCGCTAAGGCTGGAAACATTGCGGAGGCAAATGCTCTTTTCGAGAAGTTTAAGGAAAAGGGCGGTGTGCCTGATTCTGCTATTTACAATGCTATAATAGAAGGGTTAAGTAATGCAAACAGGGCATTGGATGCTTATAGAATTTTTGAGGAAACTCGATCGAAAGGTTGTAGTATTCACACGAAAACTTGTGTTGTTCTATTAGATTCACTGCATAAGGCTGAATGCATCGAGCAGGCTGCAATCGTGGGTGCTGTATTACGGGAAACTGCTAAGGCTCAGCATGCTGCAAGATCCTGGACATAA

Protein sequence

MKMLLRNIGAGQINCLDLKYRNPIKFSFKFFSSYAGDSSQTTNRNGAPVSGGGGLVPATKYEDKRQVLDGVCQILETGPWGSLVENKLAELDAKPNTELVIGVLRRLKDVNNAVNYFRWAERVTDLAHCPEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSLVKSRKLREAFTFMQTMRKLKFRPAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNVHLFTTLIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANGLVLDDVTHTSMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDEAYSLLERQRRKGCIPSVVAYNCILTCLGRKGRVDEALKLFEEMKKDAIPNLSTYNIMIDMLCKAGQLETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCTPDTVTYCSLIEGLGKHGRVDDAYKLYEQMLDSDQIPNAVVYTSLIRNFFKCGRKEDGHKIYNEMIRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLVKAGFAHESYELFYTMKEQGCVLDTRAYNTVINGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSVIDGLAKIDRLDEAYMLFEEAKSKGVELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGLTPSVYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFVFWQEMQKQGLKPNVFTYTTMISGLAKAGNIAEANALFEKFKEKGGVPDSAIYNAIIEGLSNANRALDAYRIFEETRSKGCSIHTKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAARSWT
Homology
BLAST of Bhi01G000722 vs. TAIR 10
Match: AT3G06920.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 1360.9 bits (3521), Expect = 0.0e+00
Identity = 652/843 (77.34%), Postives = 748/843 (88.73%), Query Frame = 0

Query: 61  YEDKRQVLDGVCQILETGPWGSLVENKLAELDAKPNTELVIGVLRRLKDVNNAVNYFRWA 120
           +E  RQ ++ +C +LETGPWG   EN L+ L  KP  E VIGVLRRLKDVN A+ YFRW 
Sbjct: 29  FEGNRQTVNDICNVLETGPWGPSAENTLSALSFKPQPEFVIGVLRRLKDVNRAIEYFRWY 88

Query: 121 ERVTDLAHCPEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSLVKSRK 180
           ER T+L HCPE+YNSLL+VMAR R F+ L+QIL EMS+AGFGPS NTCIE+VL  VK+ K
Sbjct: 89  ERRTELPHCPESYNSLLLVMARCRNFDALDQILGEMSVAGFGPSVNTCIEMVLGCVKANK 148

Query: 181 LREAFTFMQTMRKLKFRPAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNVHLFTT 240
           LRE +  +Q MRK KFRPAFSAYTTLIGA SA + SD MLTLFQQMQELGYE  VHLFTT
Sbjct: 149 LREGYDVVQMMRKFKFRPAFSAYTTLIGAFSAVNHSDMMLTLFQQMQELGYEPTVHLFTT 208

Query: 241 LIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANG 300
           LIR FA+EGRVD+ALSLLDEMKS+SL+ D+VLYNVCID FGK GKVDMAWKFFHE++ANG
Sbjct: 209 LIRGFAKEGRVDSALSLLDEMKSSSLDADIVLYNVCIDSFGKVGKVDMAWKFFHEIEANG 268

Query: 301 LVLDDVTHTSMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDEAY 360
           L  D+VT+TSMIGVLCKA+R++EAVE+FEH+++N++VPC YAYNTMIMGYG AGKFDEAY
Sbjct: 269 LKPDEVTYTSMIGVLCKANRLDEAVEMFEHLEKNRRVPCTYAYNTMIMGYGSAGKFDEAY 328

Query: 361 SLLERQRRKGCIPSVVAYNCILTCLGRKGRVDEALKLFEEMKKDAIPNLSTYNIMIDMLC 420
           SLLERQR KG IPSV+AYNCILTCL + G+VDEALK+FEEMKKDA PNLSTYNI+IDMLC
Sbjct: 329 SLLERQRAKGSIPSVIAYNCILTCLRKMGKVDEALKVFEEMKKDAAPNLSTYNILIDMLC 388

Query: 421 KAGQLETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCTPDTV 480
           +AG+L+TA  +RD+M+ AGLFPNV TVNIMVDRLCK+Q+LD+AC++FE +D+K CTPD +
Sbjct: 389 RAGKLDTAFELRDSMQKAGLFPNVRTVNIMVDRLCKSQKLDEACAMFEEMDYKVCTPDEI 448

Query: 481 TYCSLIEGLGKHGRVDDAYKLYEQMLDSDQIPNAVVYTSLIRNFFKCGRKEDGHKIYNEM 540
           T+CSLI+GLGK GRVDDAYK+YE+MLDSD   N++VYTSLI+NFF  GRKEDGHKIY +M
Sbjct: 449 TFCSLIDGLGKVGRVDDAYKVYEKMLDSDCRTNSIVYTSLIKNFFNHGRKEDGHKIYKDM 508

Query: 541 IRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLVKAGFA 600
           I   CSPDL LLNTYMDC+FKAGE EKGRA+F+EIKA  F+PD RSY+ILIHGL+KAGFA
Sbjct: 509 INQNCSPDLQLLNTYMDCMFKAGEPEKGRAMFEEIKARRFVPDARSYSILIHGLIKAGFA 568

Query: 601 HESYELFYTMKEQGCVLDTRAYNTVINGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSV 660
           +E+YELFY+MKEQGCVLDTRAYN VI+GFCK GKVNKAYQLLEEMKTKG EPTVVTYGSV
Sbjct: 569 NETYELFYSMKEQGCVLDTRAYNIVIDGFCKCGKVNKAYQLLEEMKTKGFEPTVVTYGSV 628

Query: 661 IDGLAKIDRLDEAYMLFEEAKSKGVELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGL 720
           IDGLAKIDRLDEAYMLFEEAKSK +ELNVVIYSSLIDGFGKVGRIDEAYLI+EELMQKGL
Sbjct: 629 IDGLAKIDRLDEAYMLFEEAKSKRIELNVVIYSSLIDGFGKVGRIDEAYLILEELMQKGL 688

Query: 721 TPSVYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFV 780
           TP++YTWN LLDALVKAEEI+EALVCFQSMK+LKCTPN +TY ILI+GLCK+RKFNKAFV
Sbjct: 689 TPNLYTWNSLLDALVKAEEINEALVCFQSMKELKCTPNQVTYGILINGLCKVRKFNKAFV 748

Query: 781 FWQEMQKQGLKPNVFTYTTMISGLAKAGNIAEANALFEKFKEKGGVPDSAIYNAIIEGLS 840
           FWQEMQKQG+KP+  +YTTMISGLAKAGNIAEA ALF++FK  GGVPDSA YNA+IEGLS
Sbjct: 749 FWQEMQKQGMKPSTISYTTMISGLAKAGNIAEAGALFDRFKANGGVPDSACYNAMIEGLS 808

Query: 841 NANRALDAYRIFEETRSKGCSIHTKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAA 900
           N NRA+DA+ +FEETR +G  IH KTCVVLLD+LHK +C+EQAAIVGAVLRET KA+HAA
Sbjct: 809 NGNRAMDAFSLFEETRRRGLPIHNKTCVVLLDTLHKNDCLEQAAIVGAVLRETGKARHAA 868

Query: 901 RSW 904
           RSW
Sbjct: 869 RSW 871

BLAST of Bhi01G000722 vs. TAIR 10
Match: AT4G31850.1 (proton gradient regulation 3 )

HSP 1 Score: 373.2 bits (957), Expect = 5.7e-103
Identity = 244/793 (30.77%), Postives = 389/793 (49.05%), Query Frame = 0

Query: 121  ERVTDLAHCPE--AYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSLVKS 180
            E++    H P+   Y +LL   +  R  + ++Q   EM   G  P   T   +V +L K+
Sbjct: 317  EKMKTGRHKPDRVTYITLLDRFSDNRDLDSVKQFWSEMEKDGHVPDVVTFTILVDALCKA 376

Query: 181  RKLREAFTFMQTMRKLKFRPAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNVHLF 240
                EAF  +  MR     P    Y TLI  L   H  D  L LF  M+ LG +   + +
Sbjct: 377  GNFGEAFDTLDVMRDQGILPNLHTYNTLICGLLRVHRLDDALELFGNMESLGVKPTAYTY 436

Query: 241  TTLIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKA 300
               I  + + G   +AL   ++MK+  + P++V  N  +    KAG+   A + F+ +K 
Sbjct: 437  IVFIDYYGKSGDSVSALETFEKMKTKGIAPNIVACNASLYSLAKAGRDREAKQIFYGLKD 496

Query: 301  NGLVLDDVTHTSMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDE 360
             GLV D VT+  M+    K   ++EA++L   M +N   P     N++I     A + DE
Sbjct: 497  IGLVPDSVTYNMMMKCYSKVGEIDEAIKLLSEMMENGCEPDVIVVNSLINTLYKADRVDE 556

Query: 361  AYSLLERQRRKGCIPSVVAYNCILTCLGRKGRVDEALKLFEEM-KKDAIPNLSTYNIMID 420
            A+ +  R +     P+VV YN +L  LG+ G++ EA++LFE M +K   PN  T+N + D
Sbjct: 557  AWKMFMRMKEMKLKPTVVTYNTLLAGLGKNGKIQEAIELFEGMVQKGCPPNTITFNTLFD 616

Query: 421  MLCKAGQLETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCTP 480
             LCK  ++  AL +   M D G  P+V T N ++  L K  ++ +A   F  +  K   P
Sbjct: 617  CLCKNDEVTLALKMLFKMMDMGCVPDVFTYNTIIFGLVKNGQVKEAMCFFHQM-KKLVYP 676

Query: 481  DTVTYCSLIEGLGKHGRVDDAYKLYEQML--DSDQ------------------IPNAVVY 540
            D VT C+L+ G+ K   ++DAYK+    L   +DQ                  I NAV +
Sbjct: 677  DFVTLCTLLPGVVKASLIEDAYKIITNFLYNCADQPANLFWEDLIGSILAEAGIDNAVSF 736

Query: 541  TS-LIRNFFKCGRKEDGHKIYNEMIR---------------------LGCSPDLLLLNTY 600
            +  L+ N    G   DG  I   +IR                     LG  P L   N  
Sbjct: 737  SERLVAN----GICRDGDSILVPIIRYSCKHNNVSGARTLFEKFTKDLGVQPKLPTYNLL 796

Query: 601  MDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLVKAGFAHESYELFYTMKEQGC 660
            +  + +A  IE  + +F ++K+ G IPDV +Y  L+    K+G   E +EL+  M    C
Sbjct: 797  IGGLLEADMIEIAQDVFLQVKSTGCIPDVATYNFLLDAYGKSGKIDELFELYKEMSTHEC 856

Query: 661  VLDTRAYNTVINGFCKSGKVNKAYQLLEE-MKTKGHEPTVVTYGSVIDGLAKIDRLDEAY 720
              +T  +N VI+G  K+G V+ A  L  + M  +   PT  TYG +IDGL+K  RL EA 
Sbjct: 857  EANTITHNIVISGLVKAGNVDDALDLYYDLMSDRDFSPTACTYGPLIDGLSKSGRLYEAK 916

Query: 721  MLFEEAKSKGVELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGLTPSVYTWNCLLDAL 780
             LFE     G   N  IY+ LI+GFGK G  D A  + + ++++G+ P + T++ L+D L
Sbjct: 917  QLFEGMLDYGCRPNCAIYNILINGFGKAGEADAACALFKRMVKEGVRPDLKTYSVLVDCL 976

Query: 781  VKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFVFWQEMQ-KQGLKPN 840
                 + E L  F+ +K+    P+ + Y+++I+GL K  +  +A V + EM+  +G+ P+
Sbjct: 977  CMVGRVDEGLHYFKELKESGLNPDVVCYNLIINGLGKSHRLEEALVLFNEMKTSRGITPD 1036

Query: 841  VFTYTTMISGLAKAGNIAEANALFEKFKEKGGVPDSAIYNAIIEGLSNANRALDAYRIFE 867
            ++TY ++I  L  AG + EA  ++ + +  G  P+   +NA+I G S + +   AY +++
Sbjct: 1037 LYTYNSLILNLGIAGMVEEAGKIYNEIQRAGLEPNVFTFNALIRGYSLSGKPEHAYAVYQ 1096

BLAST of Bhi01G000722 vs. TAIR 10
Match: AT1G06710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 326.2 bits (835), Expect = 8.1e-89
Identity = 210/729 (28.81%), Postives = 329/729 (45.13%), Query Frame = 0

Query: 187 FMQTMRKLKFRPAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNVHLFTTLIRVFA 246
           F+   R++ ++     Y  L+  +    D        QQ+++   EV       L+R   
Sbjct: 152 FVWAGRQIGYKHTAPVYNALVDLIVRDDDEKVPEEFLQQIRDDDKEVFGEFLNVLVRKHC 211

Query: 247 REGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANGLVLDDV 306
           R G    AL  L  +K     P    YN  I  F KA ++D A     EM    L +D  
Sbjct: 212 RNGSFSIALEELGRLKDFRFRPSRSTYNCLIQAFLKADRLDSASLIHREMSLANLRMDGF 271

Query: 307 THTSMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDEAYSLLERQ 366
           T       LCK  +  EA+ L E       VP    Y  +I G   A  F+EA   L R 
Sbjct: 272 TLRCFAYSLCKVGKWREALTLVE---TENFVPDTVFYTKLISGLCEASLFEEAMDFLNRM 331

Query: 367 RRKGCIPSVVAYNCILT-CLGRK--GRVDEALKLFEEMKKDAIPNLSTYNIMIDMLCKAG 426
           R   C+P+VV Y+ +L  CL +K  GR    L +   M +   P+   +N ++   C +G
Sbjct: 332 RATSCLPNVVTYSTLLCGCLNKKQLGRCKRVLNMM--MMEGCYPSPKIFNSLVHAYCTSG 391

Query: 427 QLETALVVRDAMKDAGLFPNVITVNIMVDRLC------KAQRLDDACSIFEGLDHKTCTP 486
               A  +   M   G  P  +  NI++  +C          LD A   +  +       
Sbjct: 392 DHSYAYKLLKKMVKCGHMPGYVVYNILIGSICGDKDSLNCDLLDLAEKAYSEMLAAGVVL 451

Query: 487 DTVTYCSLIEGLGKHGRVDDAYKLYEQMLDSDQIPNAVVYTSLIRNFFKCGRKEDGHKIY 546
           + +   S    L   G+ + A+ +  +M+    IP+   Y+ ++       + E    ++
Sbjct: 452 NKINVSSFTRCLCSAGKYEKAFSVIREMIGQGFIPDTSTYSKVLNYLCNASKMELAFLLF 511

Query: 547 NEMIRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLVKA 606
            EM R G   D+      +D   KAG IE+ R  F E++ +G  P+V +YT LIH  +KA
Sbjct: 512 EEMKRGGLVADVYTYTIMVDSFCKAGLIEQARKWFNEMREVGCTPNVVTYTALIHAYLKA 571

Query: 607 GFAHESYELFYTMKEQGCVLDTRAYNTVINGFCKSGKVNKAYQLLEEM------------ 666
                + ELF TM  +GC+ +   Y+ +I+G CK+G+V KA Q+ E M            
Sbjct: 572 KKVSYANELFETMLSEGCLPNIVTYSALIDGHCKAGQVEKACQIFERMCGSKDVPDVDMY 631

Query: 667 ----KTKGHEPTVVTYGSVIDGLAKIDRLDEAYMLFEEAKSKGVELNVVIYSSLIDGFGK 726
                     P VVTYG+++DG  K  R++EA  L +    +G E N ++Y +LIDG  K
Sbjct: 632 FKQYDDNSERPNVVTYGALLDGFCKSHRVEEARKLLDAMSMEGCEPNQIVYDALIDGLCK 691

Query: 727 VGRIDEAYLIMEELMQKGLTPSVYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYIT 786
           VG++DEA  +  E+ + G   ++YT++ L+D   K +    A      M +  C PN + 
Sbjct: 692 VGKLDEAQEVKTEMSEHGFPATLYTYSSLIDRYFKVKRQDLASKVLSKMLENSCAPNVVI 751

Query: 787 YSILIHGLCKIRKFNKAFVFWQEMQKQGLKPNVFTYTTMISGLAKAGNIAEANALFEKFK 846
           Y+ +I GLCK+ K ++A+   Q M+++G +PNV TYT MI G    G I     L E+  
Sbjct: 752 YTEMIDGLCKVGKTDEAYKLMQMMEEKGCQPNVVTYTAMIDGFGMIGKIETCLELLERMG 811

Query: 847 EKGGVPDSAIYNAIIEGLSNANRALD-AYRIFEETRSKGCSIHTKTCVVLLDSLHKAECI 890
            KG  P+   Y  +I+     N ALD A+ + EE +      HT     +++  +K E I
Sbjct: 812 SKGVAPNYVTYRVLIDHCCK-NGALDVAHNLLEEMKQTHWPTHTAGYRKVIEGFNK-EFI 871

BLAST of Bhi01G000722 vs. TAIR 10
Match: AT5G55840.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 304.7 bits (779), Expect = 2.5e-82
Identity = 194/759 (25.56%), Postives = 356/759 (46.90%), Query Frame = 0

Query: 130 PEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSLVKSRKLREAFTFMQ 189
           P  Y+ L+ V  R        +I   M + GF PS  TC  I+ S+VKS +    ++F++
Sbjct: 163 PSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAILGSVVKSGEDVSVWSFLK 222

Query: 190 TMRKLKFRPAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNVHLFTTLIRVFAREG 249
            M K K  P  + +  LI  L A    +    L Q+M++ GY   +  + T++  + ++G
Sbjct: 223 EMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAPTIVTYNTVLHWYCKKG 282

Query: 250 RVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANGLVLDDVTHT 309
           R  AA+ LLD MKS  ++ DV  YN+ I    ++ ++   +    +M+   +  ++VT+ 
Sbjct: 283 RFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLLRDMRKRMIHPNEVTYN 342

Query: 310 SMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDEAYSLLERQRRK 369
           ++I       ++  A +L   M      P    +N +I G+   G F EA  +      K
Sbjct: 343 TLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISEGNFKEALKMFYMMEAK 402

Query: 370 GCIPSVVAYNCILTCLGRKGRVDEALKLFEEMKKDAI-PNLSTYNIMIDMLCKAGQLETA 429
           G  PS V+Y  +L  L +    D A   +  MK++ +     TY  MID LCK G L+ A
Sbjct: 403 GLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITYTGMIDGLCKNGFLDEA 462

Query: 430 LVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCTPDTVTYCSLIEG 489
           +V+ + M   G+ P+++T + +++  CK  R   A  I   +     +P+ + Y +LI  
Sbjct: 463 VVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYRVGLSPNGIIYSTLIYN 522

Query: 490 LGKHGRVDDAYKLYEQMLDSDQIPNAVVYTSLIRNFFKCGRKEDGHKIYNEMIRLGCSPD 549
             + G + +A ++YE M+      +   +  L+ +  K G+  +  +    M   G  P+
Sbjct: 523 CCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAEAEEFMRCMTSDGILPN 582

Query: 550 LLLLNTYMDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLVKAGFAHESYELFY 609
            +  +  ++    +GE  K  ++F E+  +G  P   +Y  L+ GL K G   E+ +   
Sbjct: 583 TVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYGSLLKGLCKGGHLREAEKFLK 642

Query: 610 TMKEQGCVLDTRAYNTVINGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSVIDGLAKID 669
           ++      +DT  YNT++   CKSG + KA  L  EM  +   P   TY S+I GL +  
Sbjct: 643 SLHAVPAAVDTVMYNTLLTAMCKSGNLAKAVSLFGEMVQRSILPDSYTYTSLISGLCRKG 702

Query: 670 RLDEAYMLFEEAKSKGVEL-NVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGLTPSVYTW 729
           +   A +  +EA+++G  L N V+Y+  +DG  K G+        E++   G TP + T 
Sbjct: 703 KTVIAILFAKEAEARGNVLPNKVMYTCFVDGMFKAGQWKAGIYFREQMDNLGHTPDIVTT 762

Query: 730 NCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFVFWQEMQK 789
           N ++D   +  +I +       M +    PN  TY+IL+HG  K +  + +F+ ++ +  
Sbjct: 763 NAMIDGYSRMGKIEKTNDLLPEMGNQNGGPNLTTYNILLHGYSKRKDVSTSFLLYRSIIL 822

Query: 790 QGLKPNVFTYTTMISGLAKAGNIAEANALFEKFKEKGGVPDSAIYNAIIEGLSNANRALD 849
            G+ P+  T  +++ G+ ++  +     + + F  +G   D   +N +I           
Sbjct: 823 NGILPDKLTCHSLVLGICESNMLEIGLKILKAFICRGVEVDRYTFNMLISKCCANGEINW 882

Query: 850 AYRIFEETRSKGCSIHTKTCVVLLDSLHKAECIEQAAIV 887
           A+ + +   S G S+   TC  ++  L++    +++ +V
Sbjct: 883 AFDLVKVMTSLGISLDKDTCDAMVSVLNRNHRFQESRMV 921

BLAST of Bhi01G000722 vs. TAIR 10
Match: AT5G61990.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 304.3 bits (778), Expect = 3.3e-82
Identity = 196/695 (28.20%), Postives = 319/695 (45.90%), Query Frame = 0

Query: 198 PAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNVHLFTTLIRVFAREGRVDAALSL 257
           P    Y  LI  L      +   +L  +M  LG  ++ H ++ LI    +    DAA  L
Sbjct: 275 PLKYTYDVLIDGLCKIKRLEDAKSLLVEMDSLGVSLDNHTYSLLIDGLLKGRNADAAKGL 334

Query: 258 LDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANGLVLDDVTHTSMIGVLCK 317
           + EM S+ +     +Y+ CI    K G ++ A   F  M A+GL+     + S+I   C+
Sbjct: 335 VHEMVSHGINIKPYMYDCCICVMSKEGVMEKAKALFDGMIASGLIPQAQAYASLIEGYCR 394

Query: 318 ADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDEAYSLLERQRRKGCIPSVVA 377
              + +  EL   M +   V   Y Y T++ G   +G  D AY++++     GC P+VV 
Sbjct: 395 EKNVRQGYELLVEMKKRNIVISPYTYGTVVKGMCSSGDLDGAYNIVKEMIASGCRPNVVI 454

Query: 378 YNCILTCLGRKGRVDEALKLFEEMKKDAIPNLSTYNIMIDMLCKAGQLETALVVRDAMKD 437
           Y  ++    +  R  +A+++ +E                                  MK+
Sbjct: 455 YTTLIKTFLQNSRFGDAMRVLKE----------------------------------MKE 514

Query: 438 AGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCTPDTVTYCSLIEGLGKHGRVDD 497
            G+ P++   N ++  L KA+R+D+A S    +      P+  TY + I G  +      
Sbjct: 515 QGIAPDIFCYNSLIIGLSKAKRMDEARSFLVEMVENGLKPNAFTYGAFISGYIEASEFAS 574

Query: 498 AYKLYEQMLDSDQIPNAVVYTSLIRNFFKCGRKEDGHKIYNEMIRLGCSPDLLLLNTYMD 557
           A K  ++M +   +PN V+ T LI  + K G+  +    Y  M+  G   D       M+
Sbjct: 575 ADKYVKEMRECGVLPNKVLCTGLINEYCKKGKVIEACSAYRSMVDQGILGDAKTYTVLMN 634

Query: 558 CVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLVKAGFAHESYELFYTMKEQGCVL 617
            +FK  +++    +F+E++  G  PDV SY +LI+G  K G   ++  +F  M E+G   
Sbjct: 635 GLFKNDKVDDAEEIFREMRGKGIAPDVFSYGVLINGFSKLGNMQKASSIFDEMVEEGLTP 694

Query: 618 DTRAYNTVINGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSVIDGLAKIDRLDEAYMLF 677
           +   YN ++ GFC+SG++ KA +LL+EM  KG  P  VTY ++IDG  K   L EA+ LF
Sbjct: 695 NVIIYNMLLGGFCRSGEIEKAKELLDEMSVKGLHPNAVTYCTIIDGYCKSGDLAEAFRLF 754

Query: 678 EEAKSKGVELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGLTPSVYTWNCLLDALV-- 737
           +E K KG+  +  +Y++L+DG  ++  ++ A  I     +KG   S   +N L++ +   
Sbjct: 755 DEMKLKGLVPDSFVYTTLVDGCCRLNDVERAITIF-GTNKKGCASSTAPFNALINWVFKF 814

Query: 738 -KAEEISEAL-VCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFVFWQEMQKQGLKPN 797
            K E  +E L        D    PN +TY+I+I  LCK      A   + +MQ   L P 
Sbjct: 815 GKTELKTEVLNRLMDGSFDRFGKPNDVTYNIMIDYLCKEGNLEAAKELFHQMQNANLMPT 874

Query: 798 VFTYTTMISGLAKAGNIAEANALFEKFKEKGGVPDSAIYNAII-----EGLSNANRALDA 857
           V TYT++++G  K G  AE   +F++    G  PD  +Y+ II     EG++     L  
Sbjct: 875 VITYTSLLNGYDKMGRRAEMFPVFDEAIAAGIEPDHIMYSVIINAFLKEGMTTKALVLVD 934

Query: 858 YRIFEETRSKGCSIHTKTCVVLLDSLHKAECIEQA 884
               +     GC +   TC  LL    K   +E A
Sbjct: 935 QMFAKNAVDDGCKLSISTCRALLSGFAKVGEMEVA 934

BLAST of Bhi01G000722 vs. ExPASy Swiss-Prot
Match: Q9M907 (Pentatricopeptide repeat-containing protein At3g06920 OS=Arabidopsis thaliana OX=3702 GN=At3g06920 PE=2 SV=1)

HSP 1 Score: 1360.9 bits (3521), Expect = 0.0e+00
Identity = 652/843 (77.34%), Postives = 748/843 (88.73%), Query Frame = 0

Query: 61  YEDKRQVLDGVCQILETGPWGSLVENKLAELDAKPNTELVIGVLRRLKDVNNAVNYFRWA 120
           +E  RQ ++ +C +LETGPWG   EN L+ L  KP  E VIGVLRRLKDVN A+ YFRW 
Sbjct: 29  FEGNRQTVNDICNVLETGPWGPSAENTLSALSFKPQPEFVIGVLRRLKDVNRAIEYFRWY 88

Query: 121 ERVTDLAHCPEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSLVKSRK 180
           ER T+L HCPE+YNSLL+VMAR R F+ L+QIL EMS+AGFGPS NTCIE+VL  VK+ K
Sbjct: 89  ERRTELPHCPESYNSLLLVMARCRNFDALDQILGEMSVAGFGPSVNTCIEMVLGCVKANK 148

Query: 181 LREAFTFMQTMRKLKFRPAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNVHLFTT 240
           LRE +  +Q MRK KFRPAFSAYTTLIGA SA + SD MLTLFQQMQELGYE  VHLFTT
Sbjct: 149 LREGYDVVQMMRKFKFRPAFSAYTTLIGAFSAVNHSDMMLTLFQQMQELGYEPTVHLFTT 208

Query: 241 LIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANG 300
           LIR FA+EGRVD+ALSLLDEMKS+SL+ D+VLYNVCID FGK GKVDMAWKFFHE++ANG
Sbjct: 209 LIRGFAKEGRVDSALSLLDEMKSSSLDADIVLYNVCIDSFGKVGKVDMAWKFFHEIEANG 268

Query: 301 LVLDDVTHTSMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDEAY 360
           L  D+VT+TSMIGVLCKA+R++EAVE+FEH+++N++VPC YAYNTMIMGYG AGKFDEAY
Sbjct: 269 LKPDEVTYTSMIGVLCKANRLDEAVEMFEHLEKNRRVPCTYAYNTMIMGYGSAGKFDEAY 328

Query: 361 SLLERQRRKGCIPSVVAYNCILTCLGRKGRVDEALKLFEEMKKDAIPNLSTYNIMIDMLC 420
           SLLERQR KG IPSV+AYNCILTCL + G+VDEALK+FEEMKKDA PNLSTYNI+IDMLC
Sbjct: 329 SLLERQRAKGSIPSVIAYNCILTCLRKMGKVDEALKVFEEMKKDAAPNLSTYNILIDMLC 388

Query: 421 KAGQLETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCTPDTV 480
           +AG+L+TA  +RD+M+ AGLFPNV TVNIMVDRLCK+Q+LD+AC++FE +D+K CTPD +
Sbjct: 389 RAGKLDTAFELRDSMQKAGLFPNVRTVNIMVDRLCKSQKLDEACAMFEEMDYKVCTPDEI 448

Query: 481 TYCSLIEGLGKHGRVDDAYKLYEQMLDSDQIPNAVVYTSLIRNFFKCGRKEDGHKIYNEM 540
           T+CSLI+GLGK GRVDDAYK+YE+MLDSD   N++VYTSLI+NFF  GRKEDGHKIY +M
Sbjct: 449 TFCSLIDGLGKVGRVDDAYKVYEKMLDSDCRTNSIVYTSLIKNFFNHGRKEDGHKIYKDM 508

Query: 541 IRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLVKAGFA 600
           I   CSPDL LLNTYMDC+FKAGE EKGRA+F+EIKA  F+PD RSY+ILIHGL+KAGFA
Sbjct: 509 INQNCSPDLQLLNTYMDCMFKAGEPEKGRAMFEEIKARRFVPDARSYSILIHGLIKAGFA 568

Query: 601 HESYELFYTMKEQGCVLDTRAYNTVINGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSV 660
           +E+YELFY+MKEQGCVLDTRAYN VI+GFCK GKVNKAYQLLEEMKTKG EPTVVTYGSV
Sbjct: 569 NETYELFYSMKEQGCVLDTRAYNIVIDGFCKCGKVNKAYQLLEEMKTKGFEPTVVTYGSV 628

Query: 661 IDGLAKIDRLDEAYMLFEEAKSKGVELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGL 720
           IDGLAKIDRLDEAYMLFEEAKSK +ELNVVIYSSLIDGFGKVGRIDEAYLI+EELMQKGL
Sbjct: 629 IDGLAKIDRLDEAYMLFEEAKSKRIELNVVIYSSLIDGFGKVGRIDEAYLILEELMQKGL 688

Query: 721 TPSVYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFV 780
           TP++YTWN LLDALVKAEEI+EALVCFQSMK+LKCTPN +TY ILI+GLCK+RKFNKAFV
Sbjct: 689 TPNLYTWNSLLDALVKAEEINEALVCFQSMKELKCTPNQVTYGILINGLCKVRKFNKAFV 748

Query: 781 FWQEMQKQGLKPNVFTYTTMISGLAKAGNIAEANALFEKFKEKGGVPDSAIYNAIIEGLS 840
           FWQEMQKQG+KP+  +YTTMISGLAKAGNIAEA ALF++FK  GGVPDSA YNA+IEGLS
Sbjct: 749 FWQEMQKQGMKPSTISYTTMISGLAKAGNIAEAGALFDRFKANGGVPDSACYNAMIEGLS 808

Query: 841 NANRALDAYRIFEETRSKGCSIHTKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAA 900
           N NRA+DA+ +FEETR +G  IH KTCVVLLD+LHK +C+EQAAIVGAVLRET KA+HAA
Sbjct: 809 NGNRAMDAFSLFEETRRRGLPIHNKTCVVLLDTLHKNDCLEQAAIVGAVLRETGKARHAA 868

Query: 901 RSW 904
           RSW
Sbjct: 869 RSW 871

BLAST of Bhi01G000722 vs. ExPASy Swiss-Prot
Match: Q9SZ52 (Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PGR3 PE=1 SV=1)

HSP 1 Score: 373.2 bits (957), Expect = 8.1e-102
Identity = 244/793 (30.77%), Postives = 389/793 (49.05%), Query Frame = 0

Query: 121  ERVTDLAHCPE--AYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSLVKS 180
            E++    H P+   Y +LL   +  R  + ++Q   EM   G  P   T   +V +L K+
Sbjct: 317  EKMKTGRHKPDRVTYITLLDRFSDNRDLDSVKQFWSEMEKDGHVPDVVTFTILVDALCKA 376

Query: 181  RKLREAFTFMQTMRKLKFRPAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNVHLF 240
                EAF  +  MR     P    Y TLI  L   H  D  L LF  M+ LG +   + +
Sbjct: 377  GNFGEAFDTLDVMRDQGILPNLHTYNTLICGLLRVHRLDDALELFGNMESLGVKPTAYTY 436

Query: 241  TTLIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKA 300
               I  + + G   +AL   ++MK+  + P++V  N  +    KAG+   A + F+ +K 
Sbjct: 437  IVFIDYYGKSGDSVSALETFEKMKTKGIAPNIVACNASLYSLAKAGRDREAKQIFYGLKD 496

Query: 301  NGLVLDDVTHTSMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDE 360
             GLV D VT+  M+    K   ++EA++L   M +N   P     N++I     A + DE
Sbjct: 497  IGLVPDSVTYNMMMKCYSKVGEIDEAIKLLSEMMENGCEPDVIVVNSLINTLYKADRVDE 556

Query: 361  AYSLLERQRRKGCIPSVVAYNCILTCLGRKGRVDEALKLFEEM-KKDAIPNLSTYNIMID 420
            A+ +  R +     P+VV YN +L  LG+ G++ EA++LFE M +K   PN  T+N + D
Sbjct: 557  AWKMFMRMKEMKLKPTVVTYNTLLAGLGKNGKIQEAIELFEGMVQKGCPPNTITFNTLFD 616

Query: 421  MLCKAGQLETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCTP 480
             LCK  ++  AL +   M D G  P+V T N ++  L K  ++ +A   F  +  K   P
Sbjct: 617  CLCKNDEVTLALKMLFKMMDMGCVPDVFTYNTIIFGLVKNGQVKEAMCFFHQM-KKLVYP 676

Query: 481  DTVTYCSLIEGLGKHGRVDDAYKLYEQML--DSDQ------------------IPNAVVY 540
            D VT C+L+ G+ K   ++DAYK+    L   +DQ                  I NAV +
Sbjct: 677  DFVTLCTLLPGVVKASLIEDAYKIITNFLYNCADQPANLFWEDLIGSILAEAGIDNAVSF 736

Query: 541  TS-LIRNFFKCGRKEDGHKIYNEMIR---------------------LGCSPDLLLLNTY 600
            +  L+ N    G   DG  I   +IR                     LG  P L   N  
Sbjct: 737  SERLVAN----GICRDGDSILVPIIRYSCKHNNVSGARTLFEKFTKDLGVQPKLPTYNLL 796

Query: 601  MDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLVKAGFAHESYELFYTMKEQGC 660
            +  + +A  IE  + +F ++K+ G IPDV +Y  L+    K+G   E +EL+  M    C
Sbjct: 797  IGGLLEADMIEIAQDVFLQVKSTGCIPDVATYNFLLDAYGKSGKIDELFELYKEMSTHEC 856

Query: 661  VLDTRAYNTVINGFCKSGKVNKAYQLLEE-MKTKGHEPTVVTYGSVIDGLAKIDRLDEAY 720
              +T  +N VI+G  K+G V+ A  L  + M  +   PT  TYG +IDGL+K  RL EA 
Sbjct: 857  EANTITHNIVISGLVKAGNVDDALDLYYDLMSDRDFSPTACTYGPLIDGLSKSGRLYEAK 916

Query: 721  MLFEEAKSKGVELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGLTPSVYTWNCLLDAL 780
             LFE     G   N  IY+ LI+GFGK G  D A  + + ++++G+ P + T++ L+D L
Sbjct: 917  QLFEGMLDYGCRPNCAIYNILINGFGKAGEADAACALFKRMVKEGVRPDLKTYSVLVDCL 976

Query: 781  VKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFVFWQEMQ-KQGLKPN 840
                 + E L  F+ +K+    P+ + Y+++I+GL K  +  +A V + EM+  +G+ P+
Sbjct: 977  CMVGRVDEGLHYFKELKESGLNPDVVCYNLIINGLGKSHRLEEALVLFNEMKTSRGITPD 1036

Query: 841  VFTYTTMISGLAKAGNIAEANALFEKFKEKGGVPDSAIYNAIIEGLSNANRALDAYRIFE 867
            ++TY ++I  L  AG + EA  ++ + +  G  P+   +NA+I G S + +   AY +++
Sbjct: 1037 LYTYNSLILNLGIAGMVEEAGKIYNEIQRAGLEPNVFTFNALIRGYSLSGKPEHAYAVYQ 1096

BLAST of Bhi01G000722 vs. ExPASy Swiss-Prot
Match: Q9M9X9 (Pentatricopeptide repeat-containing protein At1g06710, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g06710 PE=3 SV=1)

HSP 1 Score: 326.2 bits (835), Expect = 1.1e-87
Identity = 210/729 (28.81%), Postives = 329/729 (45.13%), Query Frame = 0

Query: 187 FMQTMRKLKFRPAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNVHLFTTLIRVFA 246
           F+   R++ ++     Y  L+  +    D        QQ+++   EV       L+R   
Sbjct: 152 FVWAGRQIGYKHTAPVYNALVDLIVRDDDEKVPEEFLQQIRDDDKEVFGEFLNVLVRKHC 211

Query: 247 REGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANGLVLDDV 306
           R G    AL  L  +K     P    YN  I  F KA ++D A     EM    L +D  
Sbjct: 212 RNGSFSIALEELGRLKDFRFRPSRSTYNCLIQAFLKADRLDSASLIHREMSLANLRMDGF 271

Query: 307 THTSMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDEAYSLLERQ 366
           T       LCK  +  EA+ L E       VP    Y  +I G   A  F+EA   L R 
Sbjct: 272 TLRCFAYSLCKVGKWREALTLVE---TENFVPDTVFYTKLISGLCEASLFEEAMDFLNRM 331

Query: 367 RRKGCIPSVVAYNCILT-CLGRK--GRVDEALKLFEEMKKDAIPNLSTYNIMIDMLCKAG 426
           R   C+P+VV Y+ +L  CL +K  GR    L +   M +   P+   +N ++   C +G
Sbjct: 332 RATSCLPNVVTYSTLLCGCLNKKQLGRCKRVLNMM--MMEGCYPSPKIFNSLVHAYCTSG 391

Query: 427 QLETALVVRDAMKDAGLFPNVITVNIMVDRLC------KAQRLDDACSIFEGLDHKTCTP 486
               A  +   M   G  P  +  NI++  +C          LD A   +  +       
Sbjct: 392 DHSYAYKLLKKMVKCGHMPGYVVYNILIGSICGDKDSLNCDLLDLAEKAYSEMLAAGVVL 451

Query: 487 DTVTYCSLIEGLGKHGRVDDAYKLYEQMLDSDQIPNAVVYTSLIRNFFKCGRKEDGHKIY 546
           + +   S    L   G+ + A+ +  +M+    IP+   Y+ ++       + E    ++
Sbjct: 452 NKINVSSFTRCLCSAGKYEKAFSVIREMIGQGFIPDTSTYSKVLNYLCNASKMELAFLLF 511

Query: 547 NEMIRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLVKA 606
            EM R G   D+      +D   KAG IE+ R  F E++ +G  P+V +YT LIH  +KA
Sbjct: 512 EEMKRGGLVADVYTYTIMVDSFCKAGLIEQARKWFNEMREVGCTPNVVTYTALIHAYLKA 571

Query: 607 GFAHESYELFYTMKEQGCVLDTRAYNTVINGFCKSGKVNKAYQLLEEM------------ 666
                + ELF TM  +GC+ +   Y+ +I+G CK+G+V KA Q+ E M            
Sbjct: 572 KKVSYANELFETMLSEGCLPNIVTYSALIDGHCKAGQVEKACQIFERMCGSKDVPDVDMY 631

Query: 667 ----KTKGHEPTVVTYGSVIDGLAKIDRLDEAYMLFEEAKSKGVELNVVIYSSLIDGFGK 726
                     P VVTYG+++DG  K  R++EA  L +    +G E N ++Y +LIDG  K
Sbjct: 632 FKQYDDNSERPNVVTYGALLDGFCKSHRVEEARKLLDAMSMEGCEPNQIVYDALIDGLCK 691

Query: 727 VGRIDEAYLIMEELMQKGLTPSVYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYIT 786
           VG++DEA  +  E+ + G   ++YT++ L+D   K +    A      M +  C PN + 
Sbjct: 692 VGKLDEAQEVKTEMSEHGFPATLYTYSSLIDRYFKVKRQDLASKVLSKMLENSCAPNVVI 751

Query: 787 YSILIHGLCKIRKFNKAFVFWQEMQKQGLKPNVFTYTTMISGLAKAGNIAEANALFEKFK 846
           Y+ +I GLCK+ K ++A+   Q M+++G +PNV TYT MI G    G I     L E+  
Sbjct: 752 YTEMIDGLCKVGKTDEAYKLMQMMEEKGCQPNVVTYTAMIDGFGMIGKIETCLELLERMG 811

Query: 847 EKGGVPDSAIYNAIIEGLSNANRALD-AYRIFEETRSKGCSIHTKTCVVLLDSLHKAECI 890
            KG  P+   Y  +I+     N ALD A+ + EE +      HT     +++  +K E I
Sbjct: 812 SKGVAPNYVTYRVLIDHCCK-NGALDVAHNLLEEMKQTHWPTHTAGYRKVIEGFNK-EFI 871

BLAST of Bhi01G000722 vs. ExPASy Swiss-Prot
Match: Q76C99 (Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV=1)

HSP 1 Score: 316.6 bits (810), Expect = 9.0e-85
Identity = 206/705 (29.22%), Postives = 338/705 (47.94%), Query Frame = 0

Query: 177 KSRKLREAFTFMQTMRKLKFRPAFSAYTTLIGALSA-SHDSDCMLTLFQQMQELGYEVNV 236
           ++ +L   F  +  + K  FR    A+T L+  L A    SD M  + ++M ELG   NV
Sbjct: 99  RAGRLDLGFAALGNVIKKGFRVDAIAFTPLLKGLCADKRTSDAMDIVLRRMTELGCIPNV 158

Query: 237 HLFTTLIRVFAREGRVDAALSLLDEM---KSNSLEPDVVLYNVCIDCFGKAGKVDMAWKF 296
             +  L++    E R   AL LL  M   +     PDVV Y   I+ F K G  D A+  
Sbjct: 159 FSYNILLKGLCDENRSQEALELLHMMADDRGGGSPPDVVSYTTVINGFFKEGDSDKAYST 218

Query: 297 FHEMKANGLVLDDVTHTSMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGM 356
           +HEM   G++ D VT+ S+I  LCKA  M++A+E+   M +N  +P    YN+++ GY  
Sbjct: 219 YHEMLDRGILPDVVTYNSIIAALCKAQAMDKAMEVLNTMVKNGVMPDCMTYNSILHGYCS 278

Query: 357 AGKFDEAYSLLERQRRKGCIPSVVAYNCILTCLGRKGRVDEALKLFEEMKKDAI-PNLST 416
           +G+  EA   L++ R  G  P VV Y+ ++  L + GR  EA K+F+ M K  + P ++T
Sbjct: 279 SGQPKEAIGFLKKMRSDGVEPDVVTYSLLMDYLCKNGRCMEARKIFDSMTKRGLKPEITT 338

Query: 417 YNIMIDMLCKAGQLETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLD 476
           Y  ++      G L     + D M   G+ P+    +I++    K  ++D A  +F  + 
Sbjct: 339 YGTLLQGYATKGALVEMHGLLDLMVRNGIHPDHYVFSILICAYAKQGKVDQAMLVFSKMR 398

Query: 477 HKTCTPDTVTYCSLIEGLGKHGRVDDAYKLYEQMLDSDQIPNAVVYTSLIRNFFKCGRKE 536
            +   P+ VTY ++I  L K GRV+DA   +EQM+D    P  +VY SLI     C + E
Sbjct: 399 QQGLNPNAVTYGAVIGILCKSGRVEDAMLYFEQMIDEGLSPGNIVYNSLIHGLCTCNKWE 458

Query: 537 DGHKIYNEMIRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILI 596
              ++  EM+  G   + +  N+ +D   K G + +   LF+ +  +G  P+V +     
Sbjct: 459 RAEELILEMLDRGICLNTIFFNSIIDSHCKEGRVIESEKLFELMVRIGVKPNVIT----- 518

Query: 597 HGLVKAGFAHESYELFYTMKEQGCVLDTRAYNTVINGFCKSGKVNKAYQLLEEMKTKGHE 656
                                         YNT+ING+C +GK+++A +LL  M + G +
Sbjct: 519 ------------------------------YNTLINGYCLAGKMDEAMKLLSGMVSVGLK 578

Query: 657 PTVVTYGSVIDGLAKIDRLDEAYMLFEEAKSKGVELNVVIYSSLIDGFGKVGRIDEAYLI 716
           P  VTY ++I+G  KI R+++A +LF+E +S GV  +++ Y+ ++ G  +  R   A  +
Sbjct: 579 PNTVTYSTLINGYCKISRMEDALVLFKEMESSGVSPDIITYNIILQGLFQTRRTAAAKEL 638

Query: 717 MEELMQKGLTPSVYTWNCLLDALVKAEEISEALVCFQS--MKDLKCTPNYITYSILIHGL 776
              + + G    + T+N +L  L K +   +AL  FQ+  + DLK      T++I+I  L
Sbjct: 639 YVRITESGTQIELSTYNIILHGLCKNKLTDDALQMFQNLCLMDLKLEAR--TFNIMIDAL 698

Query: 777 CKIRKFNKAFVFWQEMQKQGLKPNVFTYTTMISGLAKAGNIAEANALFEKFKEKGGVPDS 836
            K+ + ++A   +      GL PN +TY  M   +   G + E + LF   ++ G   DS
Sbjct: 699 LKVGRNDEAKDLFVAFSSNGLVPNYWTYRLMAENIIGQGLLEELDQLFLSMEDNGCTVDS 758

Query: 837 AIYNAIIEGLSNANRALDAYRIFEETRSKGCSIHTKTCVVLLDSL 875
            + N I+  L        A         K  S+   T  + +D L
Sbjct: 759 GMLNFIVRELLQRGEITRAGTYLSMIDEKHFSLEASTASLFIDLL 766

BLAST of Bhi01G000722 vs. ExPASy Swiss-Prot
Match: Q9LVQ5 (Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX=3702 GN=At5g55840 PE=3 SV=2)

HSP 1 Score: 304.7 bits (779), Expect = 3.5e-81
Identity = 194/759 (25.56%), Postives = 356/759 (46.90%), Query Frame = 0

Query: 130 PEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSLVKSRKLREAFTFMQ 189
           P  Y+ L+ V  R        +I   M + GF PS  TC  I+ S+VKS +    ++F++
Sbjct: 123 PSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAILGSVVKSGEDVSVWSFLK 182

Query: 190 TMRKLKFRPAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNVHLFTTLIRVFAREG 249
            M K K  P  + +  LI  L A    +    L Q+M++ GY   +  + T++  + ++G
Sbjct: 183 EMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAPTIVTYNTVLHWYCKKG 242

Query: 250 RVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANGLVLDDVTHT 309
           R  AA+ LLD MKS  ++ DV  YN+ I    ++ ++   +    +M+   +  ++VT+ 
Sbjct: 243 RFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLLRDMRKRMIHPNEVTYN 302

Query: 310 SMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDEAYSLLERQRRK 369
           ++I       ++  A +L   M      P    +N +I G+   G F EA  +      K
Sbjct: 303 TLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISEGNFKEALKMFYMMEAK 362

Query: 370 GCIPSVVAYNCILTCLGRKGRVDEALKLFEEMKKDAI-PNLSTYNIMIDMLCKAGQLETA 429
           G  PS V+Y  +L  L +    D A   +  MK++ +     TY  MID LCK G L+ A
Sbjct: 363 GLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITYTGMIDGLCKNGFLDEA 422

Query: 430 LVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCTPDTVTYCSLIEG 489
           +V+ + M   G+ P+++T + +++  CK  R   A  I   +     +P+ + Y +LI  
Sbjct: 423 VVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYRVGLSPNGIIYSTLIYN 482

Query: 490 LGKHGRVDDAYKLYEQMLDSDQIPNAVVYTSLIRNFFKCGRKEDGHKIYNEMIRLGCSPD 549
             + G + +A ++YE M+      +   +  L+ +  K G+  +  +    M   G  P+
Sbjct: 483 CCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAEAEEFMRCMTSDGILPN 542

Query: 550 LLLLNTYMDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLVKAGFAHESYELFY 609
            +  +  ++    +GE  K  ++F E+  +G  P   +Y  L+ GL K G   E+ +   
Sbjct: 543 TVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYGSLLKGLCKGGHLREAEKFLK 602

Query: 610 TMKEQGCVLDTRAYNTVINGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSVIDGLAKID 669
           ++      +DT  YNT++   CKSG + KA  L  EM  +   P   TY S+I GL +  
Sbjct: 603 SLHAVPAAVDTVMYNTLLTAMCKSGNLAKAVSLFGEMVQRSILPDSYTYTSLISGLCRKG 662

Query: 670 RLDEAYMLFEEAKSKGVEL-NVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGLTPSVYTW 729
           +   A +  +EA+++G  L N V+Y+  +DG  K G+        E++   G TP + T 
Sbjct: 663 KTVIAILFAKEAEARGNVLPNKVMYTCFVDGMFKAGQWKAGIYFREQMDNLGHTPDIVTT 722

Query: 730 NCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFVFWQEMQK 789
           N ++D   +  +I +       M +    PN  TY+IL+HG  K +  + +F+ ++ +  
Sbjct: 723 NAMIDGYSRMGKIEKTNDLLPEMGNQNGGPNLTTYNILLHGYSKRKDVSTSFLLYRSIIL 782

Query: 790 QGLKPNVFTYTTMISGLAKAGNIAEANALFEKFKEKGGVPDSAIYNAIIEGLSNANRALD 849
            G+ P+  T  +++ G+ ++  +     + + F  +G   D   +N +I           
Sbjct: 783 NGILPDKLTCHSLVLGICESNMLEIGLKILKAFICRGVEVDRYTFNMLISKCCANGEINW 842

Query: 850 AYRIFEETRSKGCSIHTKTCVVLLDSLHKAECIEQAAIV 887
           A+ + +   S G S+   TC  ++  L++    +++ +V
Sbjct: 843 AFDLVKVMTSLGISLDKDTCDAMVSVLNRNHRFQESRMV 881

BLAST of Bhi01G000722 vs. NCBI nr
Match: XP_038896865.1 (pentatricopeptide repeat-containing protein At3g06920 isoform X1 [Benincasa hispida] >XP_038896883.1 pentatricopeptide repeat-containing protein At3g06920 isoform X1 [Benincasa hispida] >XP_038896892.1 pentatricopeptide repeat-containing protein At3g06920 isoform X1 [Benincasa hispida])

HSP 1 Score: 1828.1 bits (4734), Expect = 0.0e+00
Identity = 904/904 (100.00%), Postives = 904/904 (100.00%), Query Frame = 0

Query: 1   MKMLLRNIGAGQINCLDLKYRNPIKFSFKFFSSYAGDSSQTTNRNGAPVSGGGGLVPATK 60
           MKMLLRNIGAGQINCLDLKYRNPIKFSFKFFSSYAGDSSQTTNRNGAPVSGGGGLVPATK
Sbjct: 1   MKMLLRNIGAGQINCLDLKYRNPIKFSFKFFSSYAGDSSQTTNRNGAPVSGGGGLVPATK 60

Query: 61  YEDKRQVLDGVCQILETGPWGSLVENKLAELDAKPNTELVIGVLRRLKDVNNAVNYFRWA 120
           YEDKRQVLDGVCQILETGPWGSLVENKLAELDAKPNTELVIGVLRRLKDVNNAVNYFRWA
Sbjct: 61  YEDKRQVLDGVCQILETGPWGSLVENKLAELDAKPNTELVIGVLRRLKDVNNAVNYFRWA 120

Query: 121 ERVTDLAHCPEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSLVKSRK 180
           ERVTDLAHCPEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSLVKSRK
Sbjct: 121 ERVTDLAHCPEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSLVKSRK 180

Query: 181 LREAFTFMQTMRKLKFRPAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNVHLFTT 240
           LREAFTFMQTMRKLKFRPAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNVHLFTT
Sbjct: 181 LREAFTFMQTMRKLKFRPAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNVHLFTT 240

Query: 241 LIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANG 300
           LIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANG
Sbjct: 241 LIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANG 300

Query: 301 LVLDDVTHTSMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDEAY 360
           LVLDDVTHTSMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDEAY
Sbjct: 301 LVLDDVTHTSMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDEAY 360

Query: 361 SLLERQRRKGCIPSVVAYNCILTCLGRKGRVDEALKLFEEMKKDAIPNLSTYNIMIDMLC 420
           SLLERQRRKGCIPSVVAYNCILTCLGRKGRVDEALKLFEEMKKDAIPNLSTYNIMIDMLC
Sbjct: 361 SLLERQRRKGCIPSVVAYNCILTCLGRKGRVDEALKLFEEMKKDAIPNLSTYNIMIDMLC 420

Query: 421 KAGQLETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCTPDTV 480
           KAGQLETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCTPDTV
Sbjct: 421 KAGQLETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCTPDTV 480

Query: 481 TYCSLIEGLGKHGRVDDAYKLYEQMLDSDQIPNAVVYTSLIRNFFKCGRKEDGHKIYNEM 540
           TYCSLIEGLGKHGRVDDAYKLYEQMLDSDQIPNAVVYTSLIRNFFKCGRKEDGHKIYNEM
Sbjct: 481 TYCSLIEGLGKHGRVDDAYKLYEQMLDSDQIPNAVVYTSLIRNFFKCGRKEDGHKIYNEM 540

Query: 541 IRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLVKAGFA 600
           IRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLVKAGFA
Sbjct: 541 IRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLVKAGFA 600

Query: 601 HESYELFYTMKEQGCVLDTRAYNTVINGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSV 660
           HESYELFYTMKEQGCVLDTRAYNTVINGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSV
Sbjct: 601 HESYELFYTMKEQGCVLDTRAYNTVINGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSV 660

Query: 661 IDGLAKIDRLDEAYMLFEEAKSKGVELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGL 720
           IDGLAKIDRLDEAYMLFEEAKSKGVELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGL
Sbjct: 661 IDGLAKIDRLDEAYMLFEEAKSKGVELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGL 720

Query: 721 TPSVYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFV 780
           TPSVYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFV
Sbjct: 721 TPSVYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFV 780

Query: 781 FWQEMQKQGLKPNVFTYTTMISGLAKAGNIAEANALFEKFKEKGGVPDSAIYNAIIEGLS 840
           FWQEMQKQGLKPNVFTYTTMISGLAKAGNIAEANALFEKFKEKGGVPDSAIYNAIIEGLS
Sbjct: 781 FWQEMQKQGLKPNVFTYTTMISGLAKAGNIAEANALFEKFKEKGGVPDSAIYNAIIEGLS 840

Query: 841 NANRALDAYRIFEETRSKGCSIHTKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAA 900
           NANRALDAYRIFEETRSKGCSIHTKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAA
Sbjct: 841 NANRALDAYRIFEETRSKGCSIHTKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAA 900

Query: 901 RSWT 905
           RSWT
Sbjct: 901 RSWT 904

BLAST of Bhi01G000722 vs. NCBI nr
Match: XP_038896901.1 (pentatricopeptide repeat-containing protein At3g06920 isoform X2 [Benincasa hispida])

HSP 1 Score: 1813.9 bits (4697), Expect = 0.0e+00
Identity = 896/896 (100.00%), Postives = 896/896 (100.00%), Query Frame = 0

Query: 9   GAGQINCLDLKYRNPIKFSFKFFSSYAGDSSQTTNRNGAPVSGGGGLVPATKYEDKRQVL 68
           GAGQINCLDLKYRNPIKFSFKFFSSYAGDSSQTTNRNGAPVSGGGGLVPATKYEDKRQVL
Sbjct: 5   GAGQINCLDLKYRNPIKFSFKFFSSYAGDSSQTTNRNGAPVSGGGGLVPATKYEDKRQVL 64

Query: 69  DGVCQILETGPWGSLVENKLAELDAKPNTELVIGVLRRLKDVNNAVNYFRWAERVTDLAH 128
           DGVCQILETGPWGSLVENKLAELDAKPNTELVIGVLRRLKDVNNAVNYFRWAERVTDLAH
Sbjct: 65  DGVCQILETGPWGSLVENKLAELDAKPNTELVIGVLRRLKDVNNAVNYFRWAERVTDLAH 124

Query: 129 CPEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSLVKSRKLREAFTFM 188
           CPEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSLVKSRKLREAFTFM
Sbjct: 125 CPEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSLVKSRKLREAFTFM 184

Query: 189 QTMRKLKFRPAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNVHLFTTLIRVFARE 248
           QTMRKLKFRPAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNVHLFTTLIRVFARE
Sbjct: 185 QTMRKLKFRPAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNVHLFTTLIRVFARE 244

Query: 249 GRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANGLVLDDVTH 308
           GRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANGLVLDDVTH
Sbjct: 245 GRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANGLVLDDVTH 304

Query: 309 TSMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDEAYSLLERQRR 368
           TSMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDEAYSLLERQRR
Sbjct: 305 TSMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDEAYSLLERQRR 364

Query: 369 KGCIPSVVAYNCILTCLGRKGRVDEALKLFEEMKKDAIPNLSTYNIMIDMLCKAGQLETA 428
           KGCIPSVVAYNCILTCLGRKGRVDEALKLFEEMKKDAIPNLSTYNIMIDMLCKAGQLETA
Sbjct: 365 KGCIPSVVAYNCILTCLGRKGRVDEALKLFEEMKKDAIPNLSTYNIMIDMLCKAGQLETA 424

Query: 429 LVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCTPDTVTYCSLIEG 488
           LVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCTPDTVTYCSLIEG
Sbjct: 425 LVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCTPDTVTYCSLIEG 484

Query: 489 LGKHGRVDDAYKLYEQMLDSDQIPNAVVYTSLIRNFFKCGRKEDGHKIYNEMIRLGCSPD 548
           LGKHGRVDDAYKLYEQMLDSDQIPNAVVYTSLIRNFFKCGRKEDGHKIYNEMIRLGCSPD
Sbjct: 485 LGKHGRVDDAYKLYEQMLDSDQIPNAVVYTSLIRNFFKCGRKEDGHKIYNEMIRLGCSPD 544

Query: 549 LLLLNTYMDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLVKAGFAHESYELFY 608
           LLLLNTYMDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLVKAGFAHESYELFY
Sbjct: 545 LLLLNTYMDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLVKAGFAHESYELFY 604

Query: 609 TMKEQGCVLDTRAYNTVINGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSVIDGLAKID 668
           TMKEQGCVLDTRAYNTVINGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSVIDGLAKID
Sbjct: 605 TMKEQGCVLDTRAYNTVINGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSVIDGLAKID 664

Query: 669 RLDEAYMLFEEAKSKGVELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGLTPSVYTWN 728
           RLDEAYMLFEEAKSKGVELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGLTPSVYTWN
Sbjct: 665 RLDEAYMLFEEAKSKGVELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGLTPSVYTWN 724

Query: 729 CLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFVFWQEMQKQ 788
           CLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFVFWQEMQKQ
Sbjct: 725 CLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFVFWQEMQKQ 784

Query: 789 GLKPNVFTYTTMISGLAKAGNIAEANALFEKFKEKGGVPDSAIYNAIIEGLSNANRALDA 848
           GLKPNVFTYTTMISGLAKAGNIAEANALFEKFKEKGGVPDSAIYNAIIEGLSNANRALDA
Sbjct: 785 GLKPNVFTYTTMISGLAKAGNIAEANALFEKFKEKGGVPDSAIYNAIIEGLSNANRALDA 844

Query: 849 YRIFEETRSKGCSIHTKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAARSWT 905
           YRIFEETRSKGCSIHTKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAARSWT
Sbjct: 845 YRIFEETRSKGCSIHTKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAARSWT 900

BLAST of Bhi01G000722 vs. NCBI nr
Match: XP_016898965.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g06920 isoform X2 [Cucumis melo])

HSP 1 Score: 1710.7 bits (4429), Expect = 0.0e+00
Identity = 841/904 (93.03%), Postives = 871/904 (96.35%), Query Frame = 0

Query: 1   MKMLLRNIGAGQINCLDLKYRNPIKFSFKFFSSYAGDSSQTTNRNGAPVSGGGGLVPATK 60
           MKMLLRN GAGQINCLDLKY NPIKFS +FFSS+ GDSSQTTN NG PVSGGG L+P+ K
Sbjct: 1   MKMLLRNKGAGQINCLDLKYGNPIKFSVRFFSSWIGDSSQTTNGNGGPVSGGGDLLPSAK 60

Query: 61  YEDKRQVLDGVCQILETGPWGSLVENKLAELDAKPNTELVIGVLRRLKDVNNAVNYFRWA 120
            E+KRQV+DGVCQILETGPWGS VEN+LAEL   PN ELVIGVLRRLKDVNNAVNYFRWA
Sbjct: 61  NENKRQVVDGVCQILETGPWGSSVENRLAELHINPNPELVIGVLRRLKDVNNAVNYFRWA 120

Query: 121 ERVTDLAHCPEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSLVKSRK 180
           ERVTD AH  EAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLS +KSRK
Sbjct: 121 ERVTDQAHSHEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSFIKSRK 180

Query: 181 LREAFTFMQTMRKLKFRPAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNVHLFTT 240
           LREAFTF+QTMR+LKFRPAFSAYT LIGALS S DSDCMLTLFQQMQELGY VNVHLFTT
Sbjct: 181 LREAFTFIQTMRRLKFRPAFSAYTNLIGALSTSRDSDCMLTLFQQMQELGYAVNVHLFTT 240

Query: 241 LIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANG 300
           LIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANG
Sbjct: 241 LIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANG 300

Query: 301 LVLDDVTHTSMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDEAY 360
           LVLDDVT+TSMIGVLCKADR+NEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFD+AY
Sbjct: 301 LVLDDVTYTSMIGVLCKADRLNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDDAY 360

Query: 361 SLLERQRRKGCIPSVVAYNCILTCLGRKGRVDEALKLFEEMKKDAIPNLSTYNIMIDMLC 420
           SLLERQRRKG IPSVV+YNCIL+CLGRKG+VDEALK FEEMKKDA+PN+STYNIMIDMLC
Sbjct: 361 SLLERQRRKGSIPSVVSYNCILSCLGRKGQVDEALKKFEEMKKDAMPNISTYNIMIDMLC 420

Query: 421 KAGQLETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCTPDTV 480
           KAG+LETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLD+KTCTPD V
Sbjct: 421 KAGKLETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDYKTCTPDAV 480

Query: 481 TYCSLIEGLGKHGRVDDAYKLYEQMLDSDQIPNAVVYTSLIRNFFKCGRKEDGHKIYNEM 540
           TYCSLIEGLGKHGRVD+AYKLYEQMLD++QIPNAVVYTSLIRNFFKCGRKEDGHKIYNEM
Sbjct: 481 TYCSLIEGLGKHGRVDEAYKLYEQMLDANQIPNAVVYTSLIRNFFKCGRKEDGHKIYNEM 540

Query: 541 IRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLVKAGFA 600
           IRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQ+IK LGFIPD RSYTILIHGLVKAGFA
Sbjct: 541 IRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQDIKTLGFIPDARSYTILIHGLVKAGFA 600

Query: 601 HESYELFYTMKEQGCVLDTRAYNTVINGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSV 660
           HE+YELFYTMKEQGCVLDTRAYNTVI+GFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSV
Sbjct: 601 HEAYELFYTMKEQGCVLDTRAYNTVIDGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSV 660

Query: 661 IDGLAKIDRLDEAYMLFEEAKSKGVELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGL 720
           IDGLAKIDRLDEAYMLFEEAKSKG+ELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGL
Sbjct: 661 IDGLAKIDRLDEAYMLFEEAKSKGIELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGL 720

Query: 721 TPSVYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFV 780
           TP+VYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFV
Sbjct: 721 TPNVYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFV 780

Query: 781 FWQEMQKQGLKPNVFTYTTMISGLAKAGNIAEANALFEKFKEKGGVPDSAIYNAIIEGLS 840
           FWQEMQKQG KPNVFTYTTMISGLAKAGNI EAN LFEKFKEKGGV DSAIYNAIIEGLS
Sbjct: 781 FWQEMQKQGFKPNVFTYTTMISGLAKAGNIVEANTLFEKFKEKGGVADSAIYNAIIEGLS 840

Query: 841 NANRALDAYRIFEETRSKGCSIHTKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAA 900
           NANRALDAYR+FEE R KGCSI+TKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAA
Sbjct: 841 NANRALDAYRLFEEARLKGCSIYTKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAA 900

Query: 901 RSWT 905
           RSWT
Sbjct: 901 RSWT 904

BLAST of Bhi01G000722 vs. NCBI nr
Match: XP_004134213.1 (pentatricopeptide repeat-containing protein At3g06920 [Cucumis sativus] >KGN57112.1 hypothetical protein Csa_010703 [Cucumis sativus])

HSP 1 Score: 1706.8 bits (4419), Expect = 0.0e+00
Identity = 838/904 (92.70%), Postives = 869/904 (96.13%), Query Frame = 0

Query: 1   MKMLLRNIGAGQINCLDLKYRNPIKFSFKFFSSYAGDSSQTTNRNGAPVSGGGGLVPATK 60
           MK+LLRN GAGQINCLDLK  NPIKFS +FFSS+ GDSSQTTN NG PV GGG L+P+ K
Sbjct: 1   MKILLRNKGAGQINCLDLKCGNPIKFSVRFFSSWIGDSSQTTNGNGGPVPGGGDLLPSAK 60

Query: 61  YEDKRQVLDGVCQILETGPWGSLVENKLAELDAKPNTELVIGVLRRLKDVNNAVNYFRWA 120
            E+KRQV+D VCQILETGPWGS VEN+LAELD  PN ELVIGVLRRLKDVNNAVNYFRWA
Sbjct: 61  NENKRQVIDSVCQILETGPWGSSVENRLAELDLNPNPELVIGVLRRLKDVNNAVNYFRWA 120

Query: 121 ERVTDLAHCPEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSLVKSRK 180
           ER+TD AHC EAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLS +KSRK
Sbjct: 121 ERLTDRAHCREAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSFIKSRK 180

Query: 181 LREAFTFMQTMRKLKFRPAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNVHLFTT 240
           LREAFTF+QTMRKLKFRPAFSAYT LIGALS S DSDCMLTLFQQMQELGY VNVHLFTT
Sbjct: 181 LREAFTFIQTMRKLKFRPAFSAYTNLIGALSTSRDSDCMLTLFQQMQELGYAVNVHLFTT 240

Query: 241 LIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANG 300
           LIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANG
Sbjct: 241 LIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANG 300

Query: 301 LVLDDVTHTSMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDEAY 360
           LVLDDVT+TSMIGVLCKADR+NEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKF++AY
Sbjct: 301 LVLDDVTYTSMIGVLCKADRLNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFEDAY 360

Query: 361 SLLERQRRKGCIPSVVAYNCILTCLGRKGRVDEALKLFEEMKKDAIPNLSTYNIMIDMLC 420
           SLLERQRRKGCIPSVV+YNCIL+CLGRKG+VDEALK FEEMKKDAIPNLSTYNIMIDMLC
Sbjct: 361 SLLERQRRKGCIPSVVSYNCILSCLGRKGQVDEALKKFEEMKKDAIPNLSTYNIMIDMLC 420

Query: 421 KAGQLETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCTPDTV 480
           KAG+LETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTC PD V
Sbjct: 421 KAGKLETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCRPDAV 480

Query: 481 TYCSLIEGLGKHGRVDDAYKLYEQMLDSDQIPNAVVYTSLIRNFFKCGRKEDGHKIYNEM 540
           TYCSLIEGLG+HGRVD+AYKLYEQMLD++QIPNAVVYTSLIRNFFKCGRKEDGHKIYNEM
Sbjct: 481 TYCSLIEGLGRHGRVDEAYKLYEQMLDANQIPNAVVYTSLIRNFFKCGRKEDGHKIYNEM 540

Query: 541 IRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLVKAGFA 600
           +RLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQEIK LGFIPD RSYTILIHGLVKAGFA
Sbjct: 541 LRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQEIKNLGFIPDARSYTILIHGLVKAGFA 600

Query: 601 HESYELFYTMKEQGCVLDTRAYNTVINGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSV 660
           HE+YELFYTMKEQGCVLDTRAYNTVI+GFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSV
Sbjct: 601 HEAYELFYTMKEQGCVLDTRAYNTVIDGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSV 660

Query: 661 IDGLAKIDRLDEAYMLFEEAKSKGVELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGL 720
           IDGLAKIDRLDEAYMLFEEAKSKG+ELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGL
Sbjct: 661 IDGLAKIDRLDEAYMLFEEAKSKGIELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGL 720

Query: 721 TPSVYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFV 780
           TP+VYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFV
Sbjct: 721 TPNVYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFV 780

Query: 781 FWQEMQKQGLKPNVFTYTTMISGLAKAGNIAEANALFEKFKEKGGVPDSAIYNAIIEGLS 840
           FWQEMQKQG KPNVFTYTTMISGLAKAGNI EA+ LFEKFKEKGGV DSAIYNAIIEGLS
Sbjct: 781 FWQEMQKQGFKPNVFTYTTMISGLAKAGNIVEADTLFEKFKEKGGVADSAIYNAIIEGLS 840

Query: 841 NANRALDAYRIFEETRSKGCSIHTKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAA 900
           NANRA DAYR+FEE R KGCSI+TKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAA
Sbjct: 841 NANRASDAYRLFEEARLKGCSIYTKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAA 900

Query: 901 RSWT 905
           RSWT
Sbjct: 901 RSWT 904

BLAST of Bhi01G000722 vs. NCBI nr
Match: XP_016898964.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g06920 isoform X1 [Cucumis melo])

HSP 1 Score: 1704.9 bits (4414), Expect = 0.0e+00
Identity = 841/909 (92.52%), Postives = 871/909 (95.82%), Query Frame = 0

Query: 1   MKMLLRN-----IGAGQINCLDLKYRNPIKFSFKFFSSYAGDSSQTTNRNGAPVSGGGGL 60
           MKMLLRN      GAGQINCLDLKY NPIKFS +FFSS+ GDSSQTTN NG PVSGGG L
Sbjct: 1   MKMLLRNKGYLISGAGQINCLDLKYGNPIKFSVRFFSSWIGDSSQTTNGNGGPVSGGGDL 60

Query: 61  VPATKYEDKRQVLDGVCQILETGPWGSLVENKLAELDAKPNTELVIGVLRRLKDVNNAVN 120
           +P+ K E+KRQV+DGVCQILETGPWGS VEN+LAEL   PN ELVIGVLRRLKDVNNAVN
Sbjct: 61  LPSAKNENKRQVVDGVCQILETGPWGSSVENRLAELHINPNPELVIGVLRRLKDVNNAVN 120

Query: 121 YFRWAERVTDLAHCPEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSL 180
           YFRWAERVTD AH  EAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLS 
Sbjct: 121 YFRWAERVTDQAHSHEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSF 180

Query: 181 VKSRKLREAFTFMQTMRKLKFRPAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNV 240
           +KSRKLREAFTF+QTMR+LKFRPAFSAYT LIGALS S DSDCMLTLFQQMQELGY VNV
Sbjct: 181 IKSRKLREAFTFIQTMRRLKFRPAFSAYTNLIGALSTSRDSDCMLTLFQQMQELGYAVNV 240

Query: 241 HLFTTLIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHE 300
           HLFTTLIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHE
Sbjct: 241 HLFTTLIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHE 300

Query: 301 MKANGLVLDDVTHTSMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGK 360
           MKANGLVLDDVT+TSMIGVLCKADR+NEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGK
Sbjct: 301 MKANGLVLDDVTYTSMIGVLCKADRLNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGK 360

Query: 361 FDEAYSLLERQRRKGCIPSVVAYNCILTCLGRKGRVDEALKLFEEMKKDAIPNLSTYNIM 420
           FD+AYSLLERQRRKG IPSVV+YNCIL+CLGRKG+VDEALK FEEMKKDA+PN+STYNIM
Sbjct: 361 FDDAYSLLERQRRKGSIPSVVSYNCILSCLGRKGQVDEALKKFEEMKKDAMPNISTYNIM 420

Query: 421 IDMLCKAGQLETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTC 480
           IDMLCKAG+LETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLD+KTC
Sbjct: 421 IDMLCKAGKLETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDYKTC 480

Query: 481 TPDTVTYCSLIEGLGKHGRVDDAYKLYEQMLDSDQIPNAVVYTSLIRNFFKCGRKEDGHK 540
           TPD VTYCSLIEGLGKHGRVD+AYKLYEQMLD++QIPNAVVYTSLIRNFFKCGRKEDGHK
Sbjct: 481 TPDAVTYCSLIEGLGKHGRVDEAYKLYEQMLDANQIPNAVVYTSLIRNFFKCGRKEDGHK 540

Query: 541 IYNEMIRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLV 600
           IYNEMIRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQ+IK LGFIPD RSYTILIHGLV
Sbjct: 541 IYNEMIRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQDIKTLGFIPDARSYTILIHGLV 600

Query: 601 KAGFAHESYELFYTMKEQGCVLDTRAYNTVINGFCKSGKVNKAYQLLEEMKTKGHEPTVV 660
           KAGFAHE+YELFYTMKEQGCVLDTRAYNTVI+GFCKSGKVNKAYQLLEEMKTKGHEPTVV
Sbjct: 601 KAGFAHEAYELFYTMKEQGCVLDTRAYNTVIDGFCKSGKVNKAYQLLEEMKTKGHEPTVV 660

Query: 661 TYGSVIDGLAKIDRLDEAYMLFEEAKSKGVELNVVIYSSLIDGFGKVGRIDEAYLIMEEL 720
           TYGSVIDGLAKIDRLDEAYMLFEEAKSKG+ELNVVIYSSLIDGFGKVGRIDEAYLIMEEL
Sbjct: 661 TYGSVIDGLAKIDRLDEAYMLFEEAKSKGIELNVVIYSSLIDGFGKVGRIDEAYLIMEEL 720

Query: 721 MQKGLTPSVYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKF 780
           MQKGLTP+VYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKF
Sbjct: 721 MQKGLTPNVYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKF 780

Query: 781 NKAFVFWQEMQKQGLKPNVFTYTTMISGLAKAGNIAEANALFEKFKEKGGVPDSAIYNAI 840
           NKAFVFWQEMQKQG KPNVFTYTTMISGLAKAGNI EAN LFEKFKEKGGV DSAIYNAI
Sbjct: 781 NKAFVFWQEMQKQGFKPNVFTYTTMISGLAKAGNIVEANTLFEKFKEKGGVADSAIYNAI 840

Query: 841 IEGLSNANRALDAYRIFEETRSKGCSIHTKTCVVLLDSLHKAECIEQAAIVGAVLRETAK 900
           IEGLSNANRALDAYR+FEE R KGCSI+TKTCVVLLDSLHKAECIEQAAIVGAVLRETAK
Sbjct: 841 IEGLSNANRALDAYRLFEEARLKGCSIYTKTCVVLLDSLHKAECIEQAAIVGAVLRETAK 900

Query: 901 AQHAARSWT 905
           AQHAARSWT
Sbjct: 901 AQHAARSWT 909

BLAST of Bhi01G000722 vs. ExPASy TrEMBL
Match: A0A1S4DTD7 (pentatricopeptide repeat-containing protein At3g06920 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103483846 PE=4 SV=1)

HSP 1 Score: 1710.7 bits (4429), Expect = 0.0e+00
Identity = 841/904 (93.03%), Postives = 871/904 (96.35%), Query Frame = 0

Query: 1   MKMLLRNIGAGQINCLDLKYRNPIKFSFKFFSSYAGDSSQTTNRNGAPVSGGGGLVPATK 60
           MKMLLRN GAGQINCLDLKY NPIKFS +FFSS+ GDSSQTTN NG PVSGGG L+P+ K
Sbjct: 1   MKMLLRNKGAGQINCLDLKYGNPIKFSVRFFSSWIGDSSQTTNGNGGPVSGGGDLLPSAK 60

Query: 61  YEDKRQVLDGVCQILETGPWGSLVENKLAELDAKPNTELVIGVLRRLKDVNNAVNYFRWA 120
            E+KRQV+DGVCQILETGPWGS VEN+LAEL   PN ELVIGVLRRLKDVNNAVNYFRWA
Sbjct: 61  NENKRQVVDGVCQILETGPWGSSVENRLAELHINPNPELVIGVLRRLKDVNNAVNYFRWA 120

Query: 121 ERVTDLAHCPEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSLVKSRK 180
           ERVTD AH  EAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLS +KSRK
Sbjct: 121 ERVTDQAHSHEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSFIKSRK 180

Query: 181 LREAFTFMQTMRKLKFRPAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNVHLFTT 240
           LREAFTF+QTMR+LKFRPAFSAYT LIGALS S DSDCMLTLFQQMQELGY VNVHLFTT
Sbjct: 181 LREAFTFIQTMRRLKFRPAFSAYTNLIGALSTSRDSDCMLTLFQQMQELGYAVNVHLFTT 240

Query: 241 LIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANG 300
           LIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANG
Sbjct: 241 LIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANG 300

Query: 301 LVLDDVTHTSMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDEAY 360
           LVLDDVT+TSMIGVLCKADR+NEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFD+AY
Sbjct: 301 LVLDDVTYTSMIGVLCKADRLNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDDAY 360

Query: 361 SLLERQRRKGCIPSVVAYNCILTCLGRKGRVDEALKLFEEMKKDAIPNLSTYNIMIDMLC 420
           SLLERQRRKG IPSVV+YNCIL+CLGRKG+VDEALK FEEMKKDA+PN+STYNIMIDMLC
Sbjct: 361 SLLERQRRKGSIPSVVSYNCILSCLGRKGQVDEALKKFEEMKKDAMPNISTYNIMIDMLC 420

Query: 421 KAGQLETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCTPDTV 480
           KAG+LETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLD+KTCTPD V
Sbjct: 421 KAGKLETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDYKTCTPDAV 480

Query: 481 TYCSLIEGLGKHGRVDDAYKLYEQMLDSDQIPNAVVYTSLIRNFFKCGRKEDGHKIYNEM 540
           TYCSLIEGLGKHGRVD+AYKLYEQMLD++QIPNAVVYTSLIRNFFKCGRKEDGHKIYNEM
Sbjct: 481 TYCSLIEGLGKHGRVDEAYKLYEQMLDANQIPNAVVYTSLIRNFFKCGRKEDGHKIYNEM 540

Query: 541 IRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLVKAGFA 600
           IRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQ+IK LGFIPD RSYTILIHGLVKAGFA
Sbjct: 541 IRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQDIKTLGFIPDARSYTILIHGLVKAGFA 600

Query: 601 HESYELFYTMKEQGCVLDTRAYNTVINGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSV 660
           HE+YELFYTMKEQGCVLDTRAYNTVI+GFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSV
Sbjct: 601 HEAYELFYTMKEQGCVLDTRAYNTVIDGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSV 660

Query: 661 IDGLAKIDRLDEAYMLFEEAKSKGVELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGL 720
           IDGLAKIDRLDEAYMLFEEAKSKG+ELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGL
Sbjct: 661 IDGLAKIDRLDEAYMLFEEAKSKGIELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGL 720

Query: 721 TPSVYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFV 780
           TP+VYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFV
Sbjct: 721 TPNVYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFV 780

Query: 781 FWQEMQKQGLKPNVFTYTTMISGLAKAGNIAEANALFEKFKEKGGVPDSAIYNAIIEGLS 840
           FWQEMQKQG KPNVFTYTTMISGLAKAGNI EAN LFEKFKEKGGV DSAIYNAIIEGLS
Sbjct: 781 FWQEMQKQGFKPNVFTYTTMISGLAKAGNIVEANTLFEKFKEKGGVADSAIYNAIIEGLS 840

Query: 841 NANRALDAYRIFEETRSKGCSIHTKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAA 900
           NANRALDAYR+FEE R KGCSI+TKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAA
Sbjct: 841 NANRALDAYRLFEEARLKGCSIYTKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAA 900

Query: 901 RSWT 905
           RSWT
Sbjct: 901 RSWT 904

BLAST of Bhi01G000722 vs. ExPASy TrEMBL
Match: A0A0A0L914 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G154350 PE=4 SV=1)

HSP 1 Score: 1706.8 bits (4419), Expect = 0.0e+00
Identity = 838/904 (92.70%), Postives = 869/904 (96.13%), Query Frame = 0

Query: 1   MKMLLRNIGAGQINCLDLKYRNPIKFSFKFFSSYAGDSSQTTNRNGAPVSGGGGLVPATK 60
           MK+LLRN GAGQINCLDLK  NPIKFS +FFSS+ GDSSQTTN NG PV GGG L+P+ K
Sbjct: 1   MKILLRNKGAGQINCLDLKCGNPIKFSVRFFSSWIGDSSQTTNGNGGPVPGGGDLLPSAK 60

Query: 61  YEDKRQVLDGVCQILETGPWGSLVENKLAELDAKPNTELVIGVLRRLKDVNNAVNYFRWA 120
            E+KRQV+D VCQILETGPWGS VEN+LAELD  PN ELVIGVLRRLKDVNNAVNYFRWA
Sbjct: 61  NENKRQVIDSVCQILETGPWGSSVENRLAELDLNPNPELVIGVLRRLKDVNNAVNYFRWA 120

Query: 121 ERVTDLAHCPEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSLVKSRK 180
           ER+TD AHC EAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLS +KSRK
Sbjct: 121 ERLTDRAHCREAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSFIKSRK 180

Query: 181 LREAFTFMQTMRKLKFRPAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNVHLFTT 240
           LREAFTF+QTMRKLKFRPAFSAYT LIGALS S DSDCMLTLFQQMQELGY VNVHLFTT
Sbjct: 181 LREAFTFIQTMRKLKFRPAFSAYTNLIGALSTSRDSDCMLTLFQQMQELGYAVNVHLFTT 240

Query: 241 LIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANG 300
           LIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANG
Sbjct: 241 LIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANG 300

Query: 301 LVLDDVTHTSMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDEAY 360
           LVLDDVT+TSMIGVLCKADR+NEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKF++AY
Sbjct: 301 LVLDDVTYTSMIGVLCKADRLNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFEDAY 360

Query: 361 SLLERQRRKGCIPSVVAYNCILTCLGRKGRVDEALKLFEEMKKDAIPNLSTYNIMIDMLC 420
           SLLERQRRKGCIPSVV+YNCIL+CLGRKG+VDEALK FEEMKKDAIPNLSTYNIMIDMLC
Sbjct: 361 SLLERQRRKGCIPSVVSYNCILSCLGRKGQVDEALKKFEEMKKDAIPNLSTYNIMIDMLC 420

Query: 421 KAGQLETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCTPDTV 480
           KAG+LETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTC PD V
Sbjct: 421 KAGKLETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCRPDAV 480

Query: 481 TYCSLIEGLGKHGRVDDAYKLYEQMLDSDQIPNAVVYTSLIRNFFKCGRKEDGHKIYNEM 540
           TYCSLIEGLG+HGRVD+AYKLYEQMLD++QIPNAVVYTSLIRNFFKCGRKEDGHKIYNEM
Sbjct: 481 TYCSLIEGLGRHGRVDEAYKLYEQMLDANQIPNAVVYTSLIRNFFKCGRKEDGHKIYNEM 540

Query: 541 IRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLVKAGFA 600
           +RLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQEIK LGFIPD RSYTILIHGLVKAGFA
Sbjct: 541 LRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQEIKNLGFIPDARSYTILIHGLVKAGFA 600

Query: 601 HESYELFYTMKEQGCVLDTRAYNTVINGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSV 660
           HE+YELFYTMKEQGCVLDTRAYNTVI+GFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSV
Sbjct: 601 HEAYELFYTMKEQGCVLDTRAYNTVIDGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSV 660

Query: 661 IDGLAKIDRLDEAYMLFEEAKSKGVELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGL 720
           IDGLAKIDRLDEAYMLFEEAKSKG+ELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGL
Sbjct: 661 IDGLAKIDRLDEAYMLFEEAKSKGIELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGL 720

Query: 721 TPSVYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFV 780
           TP+VYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFV
Sbjct: 721 TPNVYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFV 780

Query: 781 FWQEMQKQGLKPNVFTYTTMISGLAKAGNIAEANALFEKFKEKGGVPDSAIYNAIIEGLS 840
           FWQEMQKQG KPNVFTYTTMISGLAKAGNI EA+ LFEKFKEKGGV DSAIYNAIIEGLS
Sbjct: 781 FWQEMQKQGFKPNVFTYTTMISGLAKAGNIVEADTLFEKFKEKGGVADSAIYNAIIEGLS 840

Query: 841 NANRALDAYRIFEETRSKGCSIHTKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAA 900
           NANRA DAYR+FEE R KGCSI+TKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAA
Sbjct: 841 NANRASDAYRLFEEARLKGCSIYTKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAA 900

Query: 901 RSWT 905
           RSWT
Sbjct: 901 RSWT 904

BLAST of Bhi01G000722 vs. ExPASy TrEMBL
Match: A0A1S4DSK3 (pentatricopeptide repeat-containing protein At3g06920 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103483846 PE=4 SV=1)

HSP 1 Score: 1704.9 bits (4414), Expect = 0.0e+00
Identity = 841/909 (92.52%), Postives = 871/909 (95.82%), Query Frame = 0

Query: 1   MKMLLRN-----IGAGQINCLDLKYRNPIKFSFKFFSSYAGDSSQTTNRNGAPVSGGGGL 60
           MKMLLRN      GAGQINCLDLKY NPIKFS +FFSS+ GDSSQTTN NG PVSGGG L
Sbjct: 1   MKMLLRNKGYLISGAGQINCLDLKYGNPIKFSVRFFSSWIGDSSQTTNGNGGPVSGGGDL 60

Query: 61  VPATKYEDKRQVLDGVCQILETGPWGSLVENKLAELDAKPNTELVIGVLRRLKDVNNAVN 120
           +P+ K E+KRQV+DGVCQILETGPWGS VEN+LAEL   PN ELVIGVLRRLKDVNNAVN
Sbjct: 61  LPSAKNENKRQVVDGVCQILETGPWGSSVENRLAELHINPNPELVIGVLRRLKDVNNAVN 120

Query: 121 YFRWAERVTDLAHCPEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSL 180
           YFRWAERVTD AH  EAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLS 
Sbjct: 121 YFRWAERVTDQAHSHEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSF 180

Query: 181 VKSRKLREAFTFMQTMRKLKFRPAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNV 240
           +KSRKLREAFTF+QTMR+LKFRPAFSAYT LIGALS S DSDCMLTLFQQMQELGY VNV
Sbjct: 181 IKSRKLREAFTFIQTMRRLKFRPAFSAYTNLIGALSTSRDSDCMLTLFQQMQELGYAVNV 240

Query: 241 HLFTTLIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHE 300
           HLFTTLIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHE
Sbjct: 241 HLFTTLIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHE 300

Query: 301 MKANGLVLDDVTHTSMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGK 360
           MKANGLVLDDVT+TSMIGVLCKADR+NEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGK
Sbjct: 301 MKANGLVLDDVTYTSMIGVLCKADRLNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGK 360

Query: 361 FDEAYSLLERQRRKGCIPSVVAYNCILTCLGRKGRVDEALKLFEEMKKDAIPNLSTYNIM 420
           FD+AYSLLERQRRKG IPSVV+YNCIL+CLGRKG+VDEALK FEEMKKDA+PN+STYNIM
Sbjct: 361 FDDAYSLLERQRRKGSIPSVVSYNCILSCLGRKGQVDEALKKFEEMKKDAMPNISTYNIM 420

Query: 421 IDMLCKAGQLETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTC 480
           IDMLCKAG+LETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLD+KTC
Sbjct: 421 IDMLCKAGKLETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDYKTC 480

Query: 481 TPDTVTYCSLIEGLGKHGRVDDAYKLYEQMLDSDQIPNAVVYTSLIRNFFKCGRKEDGHK 540
           TPD VTYCSLIEGLGKHGRVD+AYKLYEQMLD++QIPNAVVYTSLIRNFFKCGRKEDGHK
Sbjct: 481 TPDAVTYCSLIEGLGKHGRVDEAYKLYEQMLDANQIPNAVVYTSLIRNFFKCGRKEDGHK 540

Query: 541 IYNEMIRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLV 600
           IYNEMIRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQ+IK LGFIPD RSYTILIHGLV
Sbjct: 541 IYNEMIRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQDIKTLGFIPDARSYTILIHGLV 600

Query: 601 KAGFAHESYELFYTMKEQGCVLDTRAYNTVINGFCKSGKVNKAYQLLEEMKTKGHEPTVV 660
           KAGFAHE+YELFYTMKEQGCVLDTRAYNTVI+GFCKSGKVNKAYQLLEEMKTKGHEPTVV
Sbjct: 601 KAGFAHEAYELFYTMKEQGCVLDTRAYNTVIDGFCKSGKVNKAYQLLEEMKTKGHEPTVV 660

Query: 661 TYGSVIDGLAKIDRLDEAYMLFEEAKSKGVELNVVIYSSLIDGFGKVGRIDEAYLIMEEL 720
           TYGSVIDGLAKIDRLDEAYMLFEEAKSKG+ELNVVIYSSLIDGFGKVGRIDEAYLIMEEL
Sbjct: 661 TYGSVIDGLAKIDRLDEAYMLFEEAKSKGIELNVVIYSSLIDGFGKVGRIDEAYLIMEEL 720

Query: 721 MQKGLTPSVYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKF 780
           MQKGLTP+VYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKF
Sbjct: 721 MQKGLTPNVYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKF 780

Query: 781 NKAFVFWQEMQKQGLKPNVFTYTTMISGLAKAGNIAEANALFEKFKEKGGVPDSAIYNAI 840
           NKAFVFWQEMQKQG KPNVFTYTTMISGLAKAGNI EAN LFEKFKEKGGV DSAIYNAI
Sbjct: 781 NKAFVFWQEMQKQGFKPNVFTYTTMISGLAKAGNIVEANTLFEKFKEKGGVADSAIYNAI 840

Query: 841 IEGLSNANRALDAYRIFEETRSKGCSIHTKTCVVLLDSLHKAECIEQAAIVGAVLRETAK 900
           IEGLSNANRALDAYR+FEE R KGCSI+TKTCVVLLDSLHKAECIEQAAIVGAVLRETAK
Sbjct: 841 IEGLSNANRALDAYRLFEEARLKGCSIYTKTCVVLLDSLHKAECIEQAAIVGAVLRETAK 900

Query: 901 AQHAARSWT 905
           AQHAARSWT
Sbjct: 901 AQHAARSWT 909

BLAST of Bhi01G000722 vs. ExPASy TrEMBL
Match: A0A5D3CY28 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold209G001060 PE=4 SV=1)

HSP 1 Score: 1699.9 bits (4401), Expect = 0.0e+00
Identity = 834/896 (93.08%), Postives = 863/896 (96.32%), Query Frame = 0

Query: 9   GAGQINCLDLKYRNPIKFSFKFFSSYAGDSSQTTNRNGAPVSGGGGLVPATKYEDKRQVL 68
           GAGQINCLDLKY NPIK S +FFSS+ GDSSQTTN NG PVSGGG L+P+ K E+KRQV+
Sbjct: 72  GAGQINCLDLKYGNPIKISVRFFSSWIGDSSQTTNENGGPVSGGGDLLPSAKNENKRQVV 131

Query: 69  DGVCQILETGPWGSLVENKLAELDAKPNTELVIGVLRRLKDVNNAVNYFRWAERVTDLAH 128
           DGVCQILETGPWGS VEN+LAEL   PN ELVIGVLRRLKDVNNAVNYFRWAERVTD AH
Sbjct: 132 DGVCQILETGPWGSSVENRLAELHINPNPELVIGVLRRLKDVNNAVNYFRWAERVTDQAH 191

Query: 129 CPEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSLVKSRKLREAFTFM 188
             EAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLS +KSRKLREAFTF+
Sbjct: 192 SHEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSFIKSRKLREAFTFI 251

Query: 189 QTMRKLKFRPAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNVHLFTTLIRVFARE 248
           QTMR+LKFRPAFSAYT LIGALS S DSDCMLTLFQQMQELGY VNVHLFTTLIRVFARE
Sbjct: 252 QTMRRLKFRPAFSAYTNLIGALSTSRDSDCMLTLFQQMQELGYAVNVHLFTTLIRVFARE 311

Query: 249 GRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANGLVLDDVTH 308
           GRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANGLVLDDVT+
Sbjct: 312 GRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANGLVLDDVTY 371

Query: 309 TSMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDEAYSLLERQRR 368
           TSMIGVLCKADR+NEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFD+AYSLLERQRR
Sbjct: 372 TSMIGVLCKADRLNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDDAYSLLERQRR 431

Query: 369 KGCIPSVVAYNCILTCLGRKGRVDEALKLFEEMKKDAIPNLSTYNIMIDMLCKAGQLETA 428
           KG IPSVV+YNCIL+CLGRKG+VDEALK FEEMKKDA+PN+STYNIMIDMLCKAG+LETA
Sbjct: 432 KGSIPSVVSYNCILSCLGRKGQVDEALKKFEEMKKDAMPNISTYNIMIDMLCKAGKLETA 491

Query: 429 LVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCTPDTVTYCSLIEG 488
           LVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCTPD VTYCSLIEG
Sbjct: 492 LVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCTPDAVTYCSLIEG 551

Query: 489 LGKHGRVDDAYKLYEQMLDSDQIPNAVVYTSLIRNFFKCGRKEDGHKIYNEMIRLGCSPD 548
           LGKHGRVD+AYKLYEQMLD++QIPNAVVYTSLIRNFFKCGRKEDGHKIYNEMIRLGCSPD
Sbjct: 552 LGKHGRVDEAYKLYEQMLDANQIPNAVVYTSLIRNFFKCGRKEDGHKIYNEMIRLGCSPD 611

Query: 549 LLLLNTYMDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLVKAGFAHESYELFY 608
           LLLLNTYMDCVFKAGEIEKGRALFQ+IK LGFIPD RSYTILIHGLVKAGFAHE+YELFY
Sbjct: 612 LLLLNTYMDCVFKAGEIEKGRALFQDIKTLGFIPDARSYTILIHGLVKAGFAHEAYELFY 671

Query: 609 TMKEQGCVLDTRAYNTVINGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSVIDGLAKID 668
           TMKEQGCVLDTRAYNTVI+GFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSVIDGLAKID
Sbjct: 672 TMKEQGCVLDTRAYNTVIDGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSVIDGLAKID 731

Query: 669 RLDEAYMLFEEAKSKGVELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGLTPSVYTWN 728
           RLDEAYMLFEEAKSKG+ELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGLTP+VYTWN
Sbjct: 732 RLDEAYMLFEEAKSKGIELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGLTPNVYTWN 791

Query: 729 CLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFVFWQEMQKQ 788
           CLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFVFWQEMQKQ
Sbjct: 792 CLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFVFWQEMQKQ 851

Query: 789 GLKPNVFTYTTMISGLAKAGNIAEANALFEKFKEKGGVPDSAIYNAIIEGLSNANRALDA 848
           G KPNVFTYTTMISGLAKAGNI EAN LFEKFKEKGGV DSAIYNAIIEGLSNANRALDA
Sbjct: 852 GFKPNVFTYTTMISGLAKAGNIVEANTLFEKFKEKGGVADSAIYNAIIEGLSNANRALDA 911

Query: 849 YRIFEETRSKGCSIHTKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAARSWT 905
           YR+FEE R KGCSI+TKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAARSWT
Sbjct: 912 YRLFEEARLKGCSIYTKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAARSWT 967

BLAST of Bhi01G000722 vs. ExPASy TrEMBL
Match: A0A6J1ICA8 (pentatricopeptide repeat-containing protein At3g06920-like OS=Cucurbita maxima OX=3661 GN=LOC111474199 PE=4 SV=1)

HSP 1 Score: 1659.0 bits (4295), Expect = 0.0e+00
Identity = 815/904 (90.15%), Postives = 857/904 (94.80%), Query Frame = 0

Query: 1   MKMLLRNIGAGQINCLDLKYRNPIKFSFKFFSSYAGDSSQTTNRNGAPVSGGGGLVPATK 60
           MKMLLR+ GAGQI CL LK++NP  FS K  SS   +SS+ TN NGAPVS G  LV + K
Sbjct: 1   MKMLLRSKGAGQIYCLALKFKNPFSFSVKLLSSCIENSSR-TNGNGAPVSDGCNLVSSAK 60

Query: 61  YEDKRQVLDGVCQILETGPWGSLVENKLAELDAKPNTELVIGVLRRLKDVNNAVNYFRWA 120
            EDKR ++D VCQILE GPW   VEN LAELD KPN ELVIGVLRRLKDVN AVNYFRWA
Sbjct: 61  NEDKRLIVDSVCQILEAGPWRPSVENALAELDVKPNPELVIGVLRRLKDVNVAVNYFRWA 120

Query: 121 ERVTDLAHCPEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIVLSLVKSRK 180
           ERVTD A CPEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEI+LSL+KS K
Sbjct: 121 ERVTDQASCPEAYNSLLMVMARTRKFNCLEQILEEMSIAGFGPSNNTCIEIILSLIKSHK 180

Query: 181 LREAFTFMQTMRKLKFRPAFSAYTTLIGALSASHDSDCMLTLFQQMQELGYEVNVHLFTT 240
           LREAFTFMQTMRK KFRPAFSAYTTLIGALSAS+DSD MLTLF QMQELGYEVNVHLFTT
Sbjct: 181 LREAFTFMQTMRKFKFRPAFSAYTTLIGALSASNDSDSMLTLFHQMQELGYEVNVHLFTT 240

Query: 241 LIRVFAREGRVDAALSLLDEMKSNSLEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANG 300
           LIRVFAREGRVDAALSLLDEMK N+ EPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANG
Sbjct: 241 LIRVFAREGRVDAALSLLDEMKMNAFEPDVVLYNVCIDCFGKAGKVDMAWKFFHEMKANG 300

Query: 301 LVLDDVTHTSMIGVLCKADRMNEAVELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDEAY 360
           L+LDDVT+TSMIGVLCKADR++EA+ELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDEAY
Sbjct: 301 LILDDVTYTSMIGVLCKADRLDEAIELFEHMDQNKQVPCAYAYNTMIMGYGMAGKFDEAY 360

Query: 361 SLLERQRRKGCIPSVVAYNCILTCLGRKGRVDEALKLFEEMKKDAIPNLSTYNIMIDMLC 420
           SLLERQRRKGCIPSVVAYNCILTCLGRKGRV EALK+FEEMKKDAIPNLSTYNI+IDMLC
Sbjct: 361 SLLERQRRKGCIPSVVAYNCILTCLGRKGRVAEALKVFEEMKKDAIPNLSTYNIVIDMLC 420

Query: 421 KAGQLETALVVRDAMKDAGLFPNVITVNIMVDRLCKAQRLDDACSIFEGLDHKTCTPDTV 480
           K+G+LETALV+RDAMK+AGLFPNV+TVNIMVDRLCKAQRLDDACSIFEGLDHK CTP+TV
Sbjct: 421 KSGKLETALVIRDAMKEAGLFPNVMTVNIMVDRLCKAQRLDDACSIFEGLDHKACTPNTV 480

Query: 481 TYCSLIEGLGKHGRVDDAYKLYEQMLDSDQIPNAVVYTSLIRNFFKCGRKEDGHKIYNEM 540
           TYCSLI+GLGKHGRVD+AYKLYE+MLDSDQIPNAVV+TSLIRNFF+CGRKEDGHKIYNEM
Sbjct: 481 TYCSLIDGLGKHGRVDEAYKLYEKMLDSDQIPNAVVFTSLIRNFFRCGRKEDGHKIYNEM 540

Query: 541 IRLGCSPDLLLLNTYMDCVFKAGEIEKGRALFQEIKALGFIPDVRSYTILIHGLVKAGFA 600
           IRLGCSPDL+LLNTYMDCVFKAGE +KGRALFQEIKA GFIPD RSY++LIHGLVKAGFA
Sbjct: 541 IRLGCSPDLMLLNTYMDCVFKAGETKKGRALFQEIKAQGFIPDARSYSVLIHGLVKAGFA 600

Query: 601 HESYELFYTMKEQGCVLDTRAYNTVINGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSV 660
           HE+YELFYTMKEQGCVLDTRAYNTVI+GFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSV
Sbjct: 601 HETYELFYTMKEQGCVLDTRAYNTVIDGFCKSGKVNKAYQLLEEMKTKGHEPTVVTYGSV 660

Query: 661 IDGLAKIDRLDEAYMLFEEAKSKGVELNVVIYSSLIDGFGKVGRIDEAYLIMEELMQKGL 720
           IDGLAKIDRLDEAYMLFEEAKSKGVELNV+IYSSLIDGFGKVGRIDEAYLIMEELMQKGL
Sbjct: 661 IDGLAKIDRLDEAYMLFEEAKSKGVELNVIIYSSLIDGFGKVGRIDEAYLIMEELMQKGL 720

Query: 721 TPSVYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFV 780
           TP+VYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFV
Sbjct: 721 TPNVYTWNCLLDALVKAEEISEALVCFQSMKDLKCTPNYITYSILIHGLCKIRKFNKAFV 780

Query: 781 FWQEMQKQGLKPNVFTYTTMISGLAKAGNIAEANALFEKFKEKGGVPDSAIYNAIIEGLS 840
           FWQEMQKQGLKPNVFTYTTMISGLAKAGN+ EANALFEKFK KGGVPDSA YNAII GLS
Sbjct: 781 FWQEMQKQGLKPNVFTYTTMISGLAKAGNVVEANALFEKFKAKGGVPDSATYNAIIVGLS 840

Query: 841 NANRALDAYRIFEETRSKGCSIHTKTCVVLLDSLHKAECIEQAAIVGAVLRETAKAQHAA 900
           NANRALDAYR+FEETRSKGCS++TKTCVVLLDSLHKAECIEQAAIVG VLRETAKAQHAA
Sbjct: 841 NANRALDAYRLFEETRSKGCSVYTKTCVVLLDSLHKAECIEQAAIVGTVLRETAKAQHAA 900

Query: 901 RSWT 905
           RSWT
Sbjct: 901 RSWT 903

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT3G06920.10.0e+0077.34Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G31850.15.7e-10330.77proton gradient regulation 3 [more]
AT1G06710.18.1e-8928.81Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G55840.12.5e-8225.56Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G61990.13.3e-8228.20Pentatricopeptide repeat (PPR) superfamily protein [more]
Match NameE-valueIdentityDescription
Q9M9070.0e+0077.34Pentatricopeptide repeat-containing protein At3g06920 OS=Arabidopsis thaliana OX... [more]
Q9SZ528.1e-10230.77Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidop... [more]
Q9M9X91.1e-8728.81Pentatricopeptide repeat-containing protein At1g06710, mitochondrial OS=Arabidop... [more]
Q76C999.0e-8529.22Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV... [more]
Q9LVQ53.5e-8125.56Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_038896865.10.0e+00100.00pentatricopeptide repeat-containing protein At3g06920 isoform X1 [Benincasa hisp... [more]
XP_038896901.10.0e+00100.00pentatricopeptide repeat-containing protein At3g06920 isoform X2 [Benincasa hisp... [more]
XP_016898965.10.0e+0093.03PREDICTED: pentatricopeptide repeat-containing protein At3g06920 isoform X2 [Cuc... [more]
XP_004134213.10.0e+0092.70pentatricopeptide repeat-containing protein At3g06920 [Cucumis sativus] >KGN5711... [more]
XP_016898964.10.0e+0092.52PREDICTED: pentatricopeptide repeat-containing protein At3g06920 isoform X1 [Cuc... [more]
Match NameE-valueIdentityDescription
A0A1S4DTD70.0e+0093.03pentatricopeptide repeat-containing protein At3g06920 isoform X2 OS=Cucumis melo... [more]
A0A0A0L9140.0e+0092.70Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G154350 PE=4 SV=1[more]
A0A1S4DSK30.0e+0092.52pentatricopeptide repeat-containing protein At3g06920 isoform X1 OS=Cucumis melo... [more]
A0A5D3CY280.0e+0093.08Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1ICA80.0e+0090.15pentatricopeptide repeat-containing protein At3g06920-like OS=Cucurbita maxima O... [more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 80..212
e-value: 4.7E-15
score: 57.4
coord: 213..318
e-value: 1.4E-31
score: 111.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 319..434
e-value: 7.0E-33
score: 116.3
coord: 713..904
e-value: 1.1E-45
score: 158.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 435..508
e-value: 1.2E-19
score: 72.6
coord: 647..712
e-value: 8.1E-19
score: 69.9
coord: 578..646
e-value: 2.9E-21
score: 77.9
coord: 509..577
e-value: 3.4E-13
score: 51.5
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 365..769
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 411..444
e-value: 2.3E-8
score: 31.7
coord: 237..270
e-value: 9.0E-9
score: 33.0
coord: 760..794
e-value: 2.3E-10
score: 38.0
coord: 271..304
e-value: 1.2E-9
score: 35.7
coord: 655..689
e-value: 3.0E-6
score: 25.0
coord: 376..403
e-value: 3.7E-8
score: 31.1
coord: 725..758
e-value: 6.3E-8
score: 30.3
coord: 341..374
e-value: 9.9E-8
score: 29.7
coord: 553..584
e-value: 2.3E-4
score: 19.1
coord: 306..337
e-value: 6.1E-6
score: 24.1
coord: 586..619
e-value: 4.6E-7
score: 27.6
coord: 795..828
e-value: 5.0E-8
score: 30.6
coord: 445..479
e-value: 1.3E-4
score: 19.9
coord: 690..723
e-value: 1.3E-6
score: 26.2
coord: 480..513
e-value: 1.4E-8
score: 32.4
coord: 621..654
e-value: 7.8E-11
score: 39.5
coord: 831..861
e-value: 5.3E-4
score: 18.0
coord: 515..548
e-value: 7.3E-8
score: 30.1
coord: 202..235
e-value: 4.0E-4
score: 18.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 618..666
e-value: 4.8E-17
score: 61.9
coord: 553..596
e-value: 2.6E-7
score: 30.7
coord: 722..771
e-value: 9.7E-16
score: 57.7
coord: 268..317
e-value: 3.1E-15
score: 56.1
coord: 373..421
e-value: 8.7E-18
score: 64.3
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 230..262
e-value: 4.0E-7
score: 29.7
coord: 683..714
e-value: 6.0E-8
score: 32.3
coord: 439..468
e-value: 1.2E-7
score: 31.3
coord: 474..505
e-value: 2.2E-11
score: 43.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 515..545
e-value: 8.1E-5
score: 22.6
coord: 342..371
e-value: 4.7E-6
score: 26.5
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 152..210
e-value: 2.1E-4
score: 21.3
coord: 783..837
e-value: 9.1E-11
score: 41.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 269..303
score: 11.99172
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 443..477
score: 9.843305
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 304..338
score: 10.994242
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 234..268
score: 12.101333
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 374..404
score: 11.41077
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 339..373
score: 11.783455
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 758..792
score: 12.572669
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 653..687
score: 11.060009
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 583..617
score: 11.038087
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 828..862
score: 10.161182
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 478..512
score: 12.693243
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 793..827
score: 12.33152
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 688..722
score: 12.408249
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 513..547
score: 11.99172
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 618..652
score: 14.304554
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 548..582
score: 9.185627
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 723..757
score: 10.785976
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 408..442
score: 12.298636
NoneNo IPR availablePANTHERPTHR47938RESPIRATORY COMPLEX I CHAPERONE (CIA84), PUTATIVE (AFU_ORTHOLOGUE AFUA_2G06020)-RELATEDcoord: 72..894
NoneNo IPR availablePANTHERPTHR47938:SF28OS07G0249100 PROTEINcoord: 72..894

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi01M000722Bhi01M000722mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032544 plastid translation
biological_process GO:0043489 RNA stabilization
cellular_component GO:0009536 plastid
molecular_function GO:0003729 mRNA binding
molecular_function GO:0005515 protein binding