Bhi04G001830 (gene) Wax gourd (B227) v1

Overview
NameBhi04G001830
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationchr4: 61390423 .. 61392994 (+)
RNA-Seq ExpressionBhi04G001830
SyntenyBhi04G001830
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAAAATACGTTTGTAATCTGCCTGAACTGTCATCTCTCTTGGTACTCATTGTGAGCTTTGTTATAATCTTCACAAAATTGAAATCCCTAACATCACAAACTCAAAAACTCAAAAACTCAATCAGTTCGAGGTCTGGAAGAACCCATGGAAGCGATCAGGTGAGCTCGGTAGTTCCCATCTCTCTTTGTATGGTTAGGGATTGAATCGTTTCTAATCGATCGCAACTCCCTGATTTCCATTTCCTTCTGCAATAGGTTGAAGGCTTGAAGCAACTCCGTTCCCTACGGCCAAATTCCCACTTTTTCCTACATTCTTCGCAGCTGTAAGTAACACTTCACCCTTAGAATTTCCAAAACCTAAGTTGTGAAAATCTGCCAAGAGACTAGCATCAGTATTAATCTTATGAAAATTTTCATTGTAGAATTCTTGGTATGGTTGTGATTTTGCTTGCTTGGGAAATCTCAGCCTGGACTGGACTTCCTTTGGTTGATTAATTGCCTTCTTATTCCTATTATTCTTCTAAACTATGCCATTGAATCTTGTCTTTCCAATGGAGTAGTTGAATTTGGAGGTTGCAGGAAGAAGAAAAAAAAAATTGAGCCTGGTGTTGTTAAAATTCTTCTAAAACATGCTCGATTCATTTGGAGCAATTTCTTGTTTGTGCCTTGTGTACTGTCTTTCTCTGCCCGTTGTGAAGTGACAACGCCTAGTTTCTGAAATTATGCGGCGACACCTTCTTCGTCCCTGTAACTATAACACTATTGAAACTATTGCTGCTCATGTTGTCCCCAAAACGCCATTGCTTCACAAATCAATCTCTTCCTCATCTTCTCTTTACCAACGAGACTTAAATGTTCACGACGAATCAAAAACTCTGATAACCATCAACATAAATCATAAACAGTGTGGAGATCAACCACATTTCTCAATTGGGTCTCCATGTAGGGTCCAGAAACTCATTGCATCCCAATCTGATCCTCTTCTTGCCAAGGAAATTTTTTACTATGCTTGTCGTCAACCCCATTTTCGCCCATCATCTTCCTCTTTACCCATTCTTATTCTGAAGCTAGGGCGCTCCAAATACTTCTCTCTGATTGATGATCTTCTCCTTAGCTTCAAGTCTAGAGGCTACCCTGTCACTCCAACTGTCTTCTCCTACATGATCAAAATCTATGGTGAAGCTGATTTACCTGATAAAGCTCTCAAGGCCTTTTATACCATGATTGAGTTTGGGTGTACACCTTCTTCCAAACAGTTGAACAGGATACTAGAAATTTTGGTTTCTCATCGTAACTTCATTCGACCAGCTTTTGATCTTTTCAAGAACGCCCGTCATCATGGAGTGCTGCCCAACACAAAGTCTTACAACATTCTTATGCGTGCATTCTGTTGGAATGGAAATCTTAGCATTGCCTACACACTGTTCAACAAAATGTTCAAACGAGATGTCATTCCAGATGTCGAATCATACCGGATATTAATGCAGGGCCTGTGCAGGAAGAATCAAGTGAATGGTGCTGTTGATTTGCTAGAAGATATGTTAAATAAAGGATACATTCCAGACTCGTTGAGCTATGCCACTTTGTTAAATAGTTTATGTAGGAAGAAGAAGCTAAGGGAAGCTTATAAACTTCTCTGTAGAATGAAGGTTAAAGGTTGCAATCCTGATGTTGCTCATTACAATACAGCTATAATTGGATTTTGCAGAGAAGGGCGTGCCCTTGATGCTTGTAAGATTCTGGAGGATATGCAGTCGAATGGTTGTTTGCCTAATTTAGTATCTTACCAGAGTTTGACTAATGGATTATGTGATCAAGGAATGTTTGAATTGGCAAAGGATTATGTTGAGGAGATGACGTTAAAGGGTTTTTGCCCACATTTCTCTATCATTCATGCTTTGGTTAAGGGTTTCCGTAATGTTGGCAGGATCGACGAATCGTGTAGTATTCTTGAAGATATGCTAAATCATGGGAAAGCCCCTCATTCTGATACTTGGGAGATTATTATATCTGGAATTTGTGAAGTTGAGGACACTGTCAAATTATGTGAAATTTTAGGGAAGATTTTGAAGAAAGATGTAAGAAGAGACACTAGAATAGTTGAAGCAGGCTCTGGTTTGGGTGAGTATTTAATTAGGAAGTTACAAGCTTCCAAATCACGAAGGGTATGAATATTTCTTAGTTCAAATAGTAGGGAGAGGACTCAAATTTTTAATCTCTTGGTTGAGATATATGCCATAATCAATTGAACTATAGCTCAAGTTGAACAAACAGCTTGAATTTTTGGTGTA

mRNA sequence

GAAAAATACGTTTGTAATCTGCCTGAACTGTCATCTCTCTTGGTACTCATTGTGAGCTTTGTTATAATCTTCACAAAATTGAAATCCCTAACATCACAAACTCAAAAACTCAAAAACTCAATCAGTTCGAGGTCTGGAAGAACCCATGGAAGCGATCAGGTGAGCTCGGTTGAAGGCTTGAAGCAACTCCGTTCCCTACGGCCAAATTCCCACTTTTTCCTACATTCTTCGCAGCTAATTCTTGGTATGGTTGTGATTTTGCTTGCTTGGGAAATCTCAGCCTGGACTGGACTTCCTTTGGTTGATTAATTGCCTTCTTATTCCTATTATTCTTCTAAACTATGCCATTGAATCTTGTCTTTCCAATGGAGTAGTTGAATTTGGAGGTTGCAGGAAGAAGAAAAAAAAAATTGAGCCTGGTGTTGTTAAAATTCTTCTAAAACATGCTCGATTCATTTGGAGCAATTTCTTGTTTGTGCCTTGTGTACTGTCTTTCTCTGCCCGTTGTGAAGTGACAACGCCTAGTTTCTGAAATTATGCGGCGACACCTTCTTCGTCCCTGTAACTATAACACTATTGAAACTATTGCTGCTCATGTTGTCCCCAAAACGCCATTGCTTCACAAATCAATCTCTTCCTCATCTTCTCTTTACCAACGAGACTTAAATGTTCACGACGAATCAAAAACTCTGATAACCATCAACATAAATCATAAACAGTGTGGAGATCAACCACATTTCTCAATTGGGTCTCCATGTAGGGTCCAGAAACTCATTGCATCCCAATCTGATCCTCTTCTTGCCAAGGAAATTTTTTACTATGCTTGTCGTCAACCCCATTTTCGCCCATCATCTTCCTCTTTACCCATTCTTATTCTGAAGCTAGGGCGCTCCAAATACTTCTCTCTGATTGATGATCTTCTCCTTAGCTTCAAGTCTAGAGGCTACCCTGTCACTCCAACTGTCTTCTCCTACATGATCAAAATCTATGGTGAAGCTGATTTACCTGATAAAGCTCTCAAGGCCTTTTATACCATGATTGAGTTTGGGTGTACACCTTCTTCCAAACAGTTGAACAGGATACTAGAAATTTTGGTTTCTCATCGTAACTTCATTCGACCAGCTTTTGATCTTTTCAAGAACGCCCGTCATCATGGAGTGCTGCCCAACACAAAGTCTTACAACATTCTTATGCGTGCATTCTGTTGGAATGGAAATCTTAGCATTGCCTACACACTGTTCAACAAAATGTTCAAACGAGATGTCATTCCAGATGTCGAATCATACCGGATATTAATGCAGGGCCTGTGCAGGAAGAATCAAGTGAATGGTGCTGTTGATTTGCTAGAAGATATGTTAAATAAAGGATACATTCCAGACTCGTTGAGCTATGCCACTTTGTTAAATAGTTTATGTAGGAAGAAGAAGCTAAGGGAAGCTTATAAACTTCTCTGTAGAATGAAGGTTAAAGGTTGCAATCCTGATGTTGCTCATTACAATACAGCTATAATTGGATTTTGCAGAGAAGGGCGTGCCCTTGATGCTTGTAAGATTCTGGAGGATATGCAGTCGAATGGTTGTTTGCCTAATTTAGTATCTTACCAGAGTTTGACTAATGGATTATGTGATCAAGGAATGTTTGAATTGGCAAAGGATTATGTTGAGGAGATGACGTTAAAGGGTTTTTGCCCACATTTCTCTATCATTCATGCTTTGGTTAAGGGTTTCCGTAATGTTGGCAGGATCGACGAATCGTGTAGTATTCTTGAAGATATGCTAAATCATGGGAAAGCCCCTCATTCTGATACTTGGGAGATTATTATATCTGGAATTTGTGAAGTTGAGGACACTGTCAAATTATGTGAAATTTTAGGGAAGATTTTGAAGAAAGATGTAAGAAGAGACACTAGAATAGTTGAAGCAGGCTCTGGTTTGGGTGAGTATTTAATTAGGAAGTTACAAGCTTCCAAATCACGAAGGGTATGAATATTTCTTAGTTCAAATAGTAGGGAGAGGACTCAAATTTTTAATCTCTTGGTTGAGATATATGCCATAATCAATTGAACTATAGCTCAAGTTGAACAAACAGCTTGAATTTTTGGTGTA

Coding sequence (CDS)

ATGCGGCGACACCTTCTTCGTCCCTGTAACTATAACACTATTGAAACTATTGCTGCTCATGTTGTCCCCAAAACGCCATTGCTTCACAAATCAATCTCTTCCTCATCTTCTCTTTACCAACGAGACTTAAATGTTCACGACGAATCAAAAACTCTGATAACCATCAACATAAATCATAAACAGTGTGGAGATCAACCACATTTCTCAATTGGGTCTCCATGTAGGGTCCAGAAACTCATTGCATCCCAATCTGATCCTCTTCTTGCCAAGGAAATTTTTTACTATGCTTGTCGTCAACCCCATTTTCGCCCATCATCTTCCTCTTTACCCATTCTTATTCTGAAGCTAGGGCGCTCCAAATACTTCTCTCTGATTGATGATCTTCTCCTTAGCTTCAAGTCTAGAGGCTACCCTGTCACTCCAACTGTCTTCTCCTACATGATCAAAATCTATGGTGAAGCTGATTTACCTGATAAAGCTCTCAAGGCCTTTTATACCATGATTGAGTTTGGGTGTACACCTTCTTCCAAACAGTTGAACAGGATACTAGAAATTTTGGTTTCTCATCGTAACTTCATTCGACCAGCTTTTGATCTTTTCAAGAACGCCCGTCATCATGGAGTGCTGCCCAACACAAAGTCTTACAACATTCTTATGCGTGCATTCTGTTGGAATGGAAATCTTAGCATTGCCTACACACTGTTCAACAAAATGTTCAAACGAGATGTCATTCCAGATGTCGAATCATACCGGATATTAATGCAGGGCCTGTGCAGGAAGAATCAAGTGAATGGTGCTGTTGATTTGCTAGAAGATATGTTAAATAAAGGATACATTCCAGACTCGTTGAGCTATGCCACTTTGTTAAATAGTTTATGTAGGAAGAAGAAGCTAAGGGAAGCTTATAAACTTCTCTGTAGAATGAAGGTTAAAGGTTGCAATCCTGATGTTGCTCATTACAATACAGCTATAATTGGATTTTGCAGAGAAGGGCGTGCCCTTGATGCTTGTAAGATTCTGGAGGATATGCAGTCGAATGGTTGTTTGCCTAATTTAGTATCTTACCAGAGTTTGACTAATGGATTATGTGATCAAGGAATGTTTGAATTGGCAAAGGATTATGTTGAGGAGATGACGTTAAAGGGTTTTTGCCCACATTTCTCTATCATTCATGCTTTGGTTAAGGGTTTCCGTAATGTTGGCAGGATCGACGAATCGTGTAGTATTCTTGAAGATATGCTAAATCATGGGAAAGCCCCTCATTCTGATACTTGGGAGATTATTATATCTGGAATTTGTGAAGTTGAGGACACTGTCAAATTATGTGAAATTTTAGGGAAGATTTTGAAGAAAGATGTAAGAAGAGACACTAGAATAGTTGAAGCAGGCTCTGGTTTGGGTGAGTATTTAATTAGGAAGTTACAAGCTTCCAAATCACGAAGGGTATGA

Protein sequence

MRRHLLRPCNYNTIETIAAHVVPKTPLLHKSISSSSSLYQRDLNVHDESKTLITININHKQCGDQPHFSIGSPCRVQKLIASQSDPLLAKEIFYYACRQPHFRPSSSSLPILILKLGRSKYFSLIDDLLLSFKSRGYPVTPTVFSYMIKIYGEADLPDKALKAFYTMIEFGCTPSSKQLNRILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFKRDVIPDVESYRILMQGLCRKNQVNGAVDLLEDMLNKGYIPDSLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDVAHYNTAIIGFCREGRALDACKILEDMQSNGCLPNLVSYQSLTNGLCDQGMFELAKDYVEEMTLKGFCPHFSIIHALVKGFRNVGRIDESCSILEDMLNHGKAPHSDTWEIIISGICEVEDTVKLCEILGKILKKDVRRDTRIVEAGSGLGEYLIRKLQASKSRRV
Homology
BLAST of Bhi04G001830 vs. TAIR 10
Match: AT4G01400.3 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT5G46100.1); Has 40053 Blast hits to 12380 proteins in 263 species: Archae - 4; Bacteria - 27; Metazoa - 366; Fungi - 374; Plants - 38347; Viruses - 0; Other Eukaryotes - 935 (source: NCBI BLink). )

HSP 1 Score: 552.0 bits (1421), Expect = 4.8e-157
Identity = 262/451 (58.09%), Postives = 341/451 (75.61%), Query Frame = 0

Query: 28  LHKSISSSSSLYQRDLNVHDESKTLITININHKQCGDQPHFSIGSPCRVQKLIASQSDPL 87
           L   +S+SS       + H+  K +++           P   IGSP RVQKLIASQSDPL
Sbjct: 16  LTSPLSTSSRFLFYSSSEHEARKPIVS----------NPKSPIGSPTRVQKLIASQSDPL 75

Query: 88  LAKEIFYYACRQPHFRPSSSSLPILILKLGRSKYFSLIDDLLLSFKSRGYPVTPTVFSYM 147
           LAKEIF YA +QP+FR S SS  ILILKLGR +YF+LIDD+L   +S GYP+T  +F+Y+
Sbjct: 76  LAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGEIFTYL 135

Query: 148 IKIYGEADLPDKALKAFYTMIEFGCTPSSKQLNRILEILVSHRNFIRPAFDLFKNARHHG 207
           IK+Y EA LP+K L  FY M+EF  TP  K LNRIL++LVSHR +++ AF+LFK++R HG
Sbjct: 136 IKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKSSRLHG 195

Query: 208 VLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFKRDVIPDVESYRILMQGLCRKNQVNGAV 267
           V+PNT+SYN+LM+AFC N +LSIAY LF KM +RDV+PDV+SY+IL+QG CRK QVNGA+
Sbjct: 196 VMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQVNGAM 255

Query: 268 DLLEDMLNKGYIPDSLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDVAHYNTAIIGF 327
           +LL+DMLNKG++PD LSY TLLNSLCRK +LREAYKLLCRMK+KGCNPD+ HYNT I+GF
Sbjct: 256 ELLDDMLNKGFVPDRLSYTTLLNSLCRKTQLREAYKLLCRMKLKGCNPDLVHYNTMILGF 315

Query: 328 CREGRALDACKILEDMQSNGCLPNLVSYQSLTNGLCDQGMFELAKDYVEEMTLKGFCPHF 387
           CRE RA+DA K+L+DM SNGC PN VSY++L  GLCDQGMF+  K Y+EEM  KGF PHF
Sbjct: 316 CREDRAMDARKVLDDMLSNGCSPNSVSYRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHF 375

Query: 388 SIIHALVKGFRNVGRIDESCSILEDMLNHGKAPHSDTWEIIISGICEVEDTVKLCEILGK 447
           S+ + LVKGF + G+++E+C ++E ++ +G+  HSDTWE++I  IC  +++ K+   L  
Sbjct: 376 SVSNCLVKGFCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLED 435

Query: 448 ILKKDVRRDTRIVEAGSGLGEYLIRKLQASK 479
            +K+++  DTRIV+ G GLG YL  KLQ  +
Sbjct: 436 AVKEEITGDTRIVDVGIGLGSYLSSKLQMKR 456

BLAST of Bhi04G001830 vs. TAIR 10
Match: AT4G01400.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: COG4 transport (InterPro:IPR013167), Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT5G46100.1); Has 26268 Blast hits to 8959 proteins in 289 species: Archae - 0; Bacteria - 3; Metazoa - 247; Fungi - 222; Plants - 25350; Viruses - 0; Other Eukaryotes - 446 (source: NCBI BLink). )

HSP 1 Score: 382.9 bits (982), Expect = 3.9e-106
Identity = 197/436 (45.18%), Postives = 268/436 (61.47%), Query Frame = 0

Query: 28  LHKSISSSSSLYQRDLNVHDESKTLITININHKQCGDQPHFSIGSPCRVQKLIASQSDPL 87
           L   +S+SS       + H+  K +++           P   IGSP RVQKLIASQSDPL
Sbjct: 16  LTSPLSTSSRFLFYSSSEHEARKPIVS----------NPKSPIGSPTRVQKLIASQSDPL 75

Query: 88  LAKEIFYYACRQPHFRPSSSSLPILILKLGRSKYFSLIDDLLLSFKSRGYPVTPTVFSYM 147
           LAKEIF YA +QP+FR S SS  ILILKLGR +YF+LIDD+L   +S GYP+T  +F+Y+
Sbjct: 76  LAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGEIFTYL 135

Query: 148 IKIYGEADLPDKALKAFYTMIEFGCTPSSKQLNRILEILVSHRNFIRPAFDLFKNARHHG 207
           IK+Y EA LP+K L  FY M+EF  TP  K LNRIL++LVSHR +++ AF+LFK++R HG
Sbjct: 136 IKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKSSRLHG 195

Query: 208 VLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFKRDVIPDVESYRILMQGLCRKNQVNGAV 267
           V+PNT+SYN+LM+AFC N +LSIAY LF KM +RDV+PDV+SY+IL+QG CRK QVNGA+
Sbjct: 196 VMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQVNGAM 255

Query: 268 DLLEDMLNKGYIPDSLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDVAHYNTAIIGF 327
           +LL+DMLNKG++PD                                              
Sbjct: 256 ELLDDMLNKGFVPD---------------------------------------------- 315

Query: 328 CREGRALDACKILEDMQSNGCLPNLVSYQSLTNGLCDQGMFELAKDYVEEMTLKGFCPHF 387
                                       ++L  GLCDQGMF+  K Y+EEM  KGF PHF
Sbjct: 316 ----------------------------RTLIGGLCDQGMFDEGKKYLEEMISKGFSPHF 367

Query: 388 SIIHALVKGFRNVGRIDESCSILEDMLNHGKAPHSDTWEIIISGICEVEDTVKLCEILGK 447
           S+ + LVKGF + G+++E+C ++E ++ +G+  HSDTWE++I  IC  +++ K+   L  
Sbjct: 376 SVSNCLVKGFCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLED 367

Query: 448 ILKKDVRRDTRIVEAG 464
            +K+++  DTRIV+ G
Sbjct: 436 AVKEEITGDTRIVDVG 367

BLAST of Bhi04G001830 vs. TAIR 10
Match: AT5G46100.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 207.2 bits (526), Expect = 2.9e-53
Identity = 114/372 (30.65%), Postives = 194/372 (52.15%), Query Frame = 0

Query: 72  SPCRVQKLIASQSDPLLAKEIFYYACRQ--PHFRPSSSSLPILILKLGRSKYFSLIDDLL 131
           +P +V KL+ ++ D   +  +F  A  +    +    SS   ++L+L  +  F   +DL+
Sbjct: 15  TPSQVIKLMRAEKDVEKSMAVFDSATAEYANGYVHDQSSFGYMVLRLVSANKFKAAEDLI 74

Query: 132 LSFKSRGYPVTPTVFSYMIKIYGEADLPDKALKAFYTMIEFGCTPSSKQLNRILEILVSH 191
           +  K     V+  +   + + YG    P  +L+ F+ M +F C PS K    +L ILV  
Sbjct: 75  VRMKIENCVVSEDILLSICRGYGRVHRPFDSLRVFHKMKDFDCDPSQKAYVTVLAILV-E 134

Query: 192 RNFIRPAFDLFKNARHHGVLPNTKSYNILMRAFCWN-GNLSIAYTLFNKMFKRDVIPDVE 251
            N +  AF  +KN R  G+ P   S N+L++A C N G +     +F +M KR   PD  
Sbjct: 135 ENQLNLAFKFYKNMREIGLPPTVASLNVLIKALCRNDGTVDAGLKIFLEMPKRGCDPDSY 194

Query: 252 SYRILMQGLCRKNQVNGAVDLLEDMLNKGYIPDSLSYATLLNSLCRKKKLREAYKLLCRM 311
           +Y  L+ GLCR  +++ A  L  +M+ K   P  ++Y +L+N LC  K + EA + L  M
Sbjct: 195 TYGTLISGLCRFGRIDEAKKLFTEMVEKDCAPTVVTYTSLINGLCGSKNVDEAMRYLEEM 254

Query: 312 KVKGCNPDVAHYNTAIIGFCREGRALDACKILEDMQSNGCLPNLVSYQSLTNGLCDQGMF 371
           K KG  P+V  Y++ + G C++GR+L A ++ E M + GC PN+V+Y +L  GLC +   
Sbjct: 255 KSKGIEPNVFTYSSLMDGLCKDGRSLQAMELFEMMMARGCRPNMVTYTTLITGLCKEQKI 314

Query: 372 ELAKDYVEEMTLKGFCPHFSIIHALVKGFRNVGRIDESCSILEDMLNHGKAPHSDTWEI- 431
           + A + ++ M L+G  P   +   ++ GF  + +  E+ + L++M+  G  P+  TW I 
Sbjct: 315 QEAVELLDRMNLQGLKPDAGLYGKVISGFCAISKFREAANFLDEMILGGITPNRLTWNIH 374

Query: 432 ------IISGIC 434
                 ++ G+C
Sbjct: 375 VKTSNEVVRGLC 385

BLAST of Bhi04G001830 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 196.1 bits (497), Expect = 6.7e-50
Identity = 98/307 (31.92%), Postives = 168/307 (54.72%), Query Frame = 0

Query: 140 TPTVFSYMIKIYGEADLPDKALKAFYTMIEFGCTPSSKQLNRILEILVSHRNFIRPAFDL 199
           T +VF  ++K Y    L DKAL   +     G  P     N +L+  +  +  I  A ++
Sbjct: 133 TSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENV 192

Query: 200 FKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFKRDVIPDVESYRILMQGLCR 259
           FK      V PN  +YNIL+R FC+ GN+ +A TLF+KM  +  +P+V +Y  L+ G C+
Sbjct: 193 FKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCK 252

Query: 260 KNQVNGAVDLLEDMLNKGYIPDSLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDVAH 319
             +++    LL  M  KG  P+ +SY  ++N LCR+ +++E   +L  M  +G + D   
Sbjct: 253 LRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVT 312

Query: 320 YNTAIIGFCREGRALDACKILEDMQSNGCLPNLVSYQSLTNGLCDQGMFELAKDYVEEMT 379
           YNT I G+C+EG    A  +  +M  +G  P++++Y SL + +C  G    A +++++M 
Sbjct: 313 YNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMR 372

Query: 380 LKGFCPHFSIIHALVKGFRNVGRIDESCSILEDMLNHGKAPHSDTWEIIISGIC---EVE 439
           ++G CP+      LV GF   G ++E+  +L +M ++G +P   T+  +I+G C   ++E
Sbjct: 373 VRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKME 432

Query: 440 DTVKLCE 444
           D + + E
Sbjct: 433 DAIAVLE 439

BLAST of Bhi04G001830 vs. TAIR 10
Match: AT3G48810.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 182.6 bits (462), Expect = 7.7e-46
Identity = 114/446 (25.56%), Postives = 192/446 (43.05%), Query Frame = 0

Query: 55  ININHKQCGDQPHFSIGSPCRVQKLIASQSDPLLAKEIFYYACRQPHFRPSSSSLPILIL 114
           +N+NH       H  I     V K +  +S   LA   F        F+ +  +  ++I 
Sbjct: 26  LNVNHLLTESPNHAEI-KELDVVKRLRQESCVPLALHFFKSIANSNLFKHTPLTFEVMIR 85

Query: 115 KLGRSKYFSLIDDLLLSFKSRGYPVTPTVFSYMIKIYGEADLPDKALKAFYTMIEFGCTP 174
           KL        +  LL   K +G+  +  +F  +I +Y +  L ++A++ FY + EFGC P
Sbjct: 86  KLAMDGQVDSVQYLLQQMKLQGFHCSEDLFISVISVYRQVGLAERAVEMFYRIKEFGCDP 145

Query: 175 SSKQLNRILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTL 234
           S K  N +L+ L+   N I+  + ++++ +  G  PN  +YN+L++A C N  +  A  L
Sbjct: 146 SVKIYNHVLDTLLG-ENRIQMIYMVYRDMKRDGFEPNVFTYNVLLKALCKNNKVDGAKKL 205

Query: 235 FNKMFKRDVIPD------------------------------VESYRILMQGLCRKNQVN 294
             +M  +   PD                              V  Y  L+ GLC+++   
Sbjct: 206 LVEMSNKGCCPDAVSYTTVISSMCEVGLVKEGRELAERFEPVVSVYNALINGLCKEHDYK 265

Query: 295 GAVDLLEDMLNKGYIPDSLSYATLLNSLCRKKKLREAYKLLCRMKVKGCN---------- 354
           GA +L+ +M+ KG  P+ +SY+TL+N LC   ++  A+  L +M  +GC+          
Sbjct: 266 GAFELMREMVEKGISPNVISYSTLINVLCNSGQIELAFSFLTQMLKRGCHPNIYTLSSLV 325

Query: 355 --------------------------PDVAHYNTAIIGFCREGRALDACKILEDMQSNGC 414
                                     P+V  YNT + GFC  G  + A  +   M+  GC
Sbjct: 326 KGCFLRGTTFDALDLWNQMIRGFGLQPNVVAYNTLVQGFCSHGNIVKAVSVFSHMEEIGC 385

Query: 415 LPNLVSYQSLTNGLCDQGMFELAKDYVEEMTLKGFCPHFSIIHALVKGFRNVGRIDESCS 435
            PN+ +Y SL NG   +G  + A     +M   G CP+  +   +V+      +  E+ S
Sbjct: 386 SPNIRTYGSLINGFAKRGSLDGAVYIWNKMLTSGCCPNVVVYTNMVEALCRHSKFKEAES 445

BLAST of Bhi04G001830 vs. ExPASy Swiss-Prot
Match: Q8LDU5 (Pentatricopeptide repeat-containing protein At4g01400, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g01400 PE=2 SV=2)

HSP 1 Score: 552.0 bits (1421), Expect = 6.8e-156
Identity = 262/451 (58.09%), Postives = 341/451 (75.61%), Query Frame = 0

Query: 28  LHKSISSSSSLYQRDLNVHDESKTLITININHKQCGDQPHFSIGSPCRVQKLIASQSDPL 87
           L   +S+SS       + H+  K +++           P   IGSP RVQKLIASQSDPL
Sbjct: 16  LTSPLSTSSRFLFYSSSEHEARKPIVS----------NPKSPIGSPTRVQKLIASQSDPL 75

Query: 88  LAKEIFYYACRQPHFRPSSSSLPILILKLGRSKYFSLIDDLLLSFKSRGYPVTPTVFSYM 147
           LAKEIF YA +QP+FR S SS  ILILKLGR +YF+LIDD+L   +S GYP+T  +F+Y+
Sbjct: 76  LAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGEIFTYL 135

Query: 148 IKIYGEADLPDKALKAFYTMIEFGCTPSSKQLNRILEILVSHRNFIRPAFDLFKNARHHG 207
           IK+Y EA LP+K L  FY M+EF  TP  K LNRIL++LVSHR +++ AF+LFK++R HG
Sbjct: 136 IKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKSSRLHG 195

Query: 208 VLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFKRDVIPDVESYRILMQGLCRKNQVNGAV 267
           V+PNT+SYN+LM+AFC N +LSIAY LF KM +RDV+PDV+SY+IL+QG CRK QVNGA+
Sbjct: 196 VMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQVNGAM 255

Query: 268 DLLEDMLNKGYIPDSLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDVAHYNTAIIGF 327
           +LL+DMLNKG++PD LSY TLLNSLCRK +LREAYKLLCRMK+KGCNPD+ HYNT I+GF
Sbjct: 256 ELLDDMLNKGFVPDRLSYTTLLNSLCRKTQLREAYKLLCRMKLKGCNPDLVHYNTMILGF 315

Query: 328 CREGRALDACKILEDMQSNGCLPNLVSYQSLTNGLCDQGMFELAKDYVEEMTLKGFCPHF 387
           CRE RA+DA K+L+DM SNGC PN VSY++L  GLCDQGMF+  K Y+EEM  KGF PHF
Sbjct: 316 CREDRAMDARKVLDDMLSNGCSPNSVSYRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHF 375

Query: 388 SIIHALVKGFRNVGRIDESCSILEDMLNHGKAPHSDTWEIIISGICEVEDTVKLCEILGK 447
           S+ + LVKGF + G+++E+C ++E ++ +G+  HSDTWE++I  IC  +++ K+   L  
Sbjct: 376 SVSNCLVKGFCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLED 435

Query: 448 ILKKDVRRDTRIVEAGSGLGEYLIRKLQASK 479
            +K+++  DTRIV+ G GLG YL  KLQ  +
Sbjct: 436 AVKEEITGDTRIVDVGIGLGSYLSSKLQMKR 456

BLAST of Bhi04G001830 vs. ExPASy Swiss-Prot
Match: Q9FNL2 (Pentatricopeptide repeat-containing protein At5g46100 OS=Arabidopsis thaliana OX=3702 GN=At5g46100 PE=2 SV=1)

HSP 1 Score: 207.2 bits (526), Expect = 4.1e-52
Identity = 114/372 (30.65%), Postives = 194/372 (52.15%), Query Frame = 0

Query: 72  SPCRVQKLIASQSDPLLAKEIFYYACRQ--PHFRPSSSSLPILILKLGRSKYFSLIDDLL 131
           +P +V KL+ ++ D   +  +F  A  +    +    SS   ++L+L  +  F   +DL+
Sbjct: 15  TPSQVIKLMRAEKDVEKSMAVFDSATAEYANGYVHDQSSFGYMVLRLVSANKFKAAEDLI 74

Query: 132 LSFKSRGYPVTPTVFSYMIKIYGEADLPDKALKAFYTMIEFGCTPSSKQLNRILEILVSH 191
           +  K     V+  +   + + YG    P  +L+ F+ M +F C PS K    +L ILV  
Sbjct: 75  VRMKIENCVVSEDILLSICRGYGRVHRPFDSLRVFHKMKDFDCDPSQKAYVTVLAILV-E 134

Query: 192 RNFIRPAFDLFKNARHHGVLPNTKSYNILMRAFCWN-GNLSIAYTLFNKMFKRDVIPDVE 251
            N +  AF  +KN R  G+ P   S N+L++A C N G +     +F +M KR   PD  
Sbjct: 135 ENQLNLAFKFYKNMREIGLPPTVASLNVLIKALCRNDGTVDAGLKIFLEMPKRGCDPDSY 194

Query: 252 SYRILMQGLCRKNQVNGAVDLLEDMLNKGYIPDSLSYATLLNSLCRKKKLREAYKLLCRM 311
           +Y  L+ GLCR  +++ A  L  +M+ K   P  ++Y +L+N LC  K + EA + L  M
Sbjct: 195 TYGTLISGLCRFGRIDEAKKLFTEMVEKDCAPTVVTYTSLINGLCGSKNVDEAMRYLEEM 254

Query: 312 KVKGCNPDVAHYNTAIIGFCREGRALDACKILEDMQSNGCLPNLVSYQSLTNGLCDQGMF 371
           K KG  P+V  Y++ + G C++GR+L A ++ E M + GC PN+V+Y +L  GLC +   
Sbjct: 255 KSKGIEPNVFTYSSLMDGLCKDGRSLQAMELFEMMMARGCRPNMVTYTTLITGLCKEQKI 314

Query: 372 ELAKDYVEEMTLKGFCPHFSIIHALVKGFRNVGRIDESCSILEDMLNHGKAPHSDTWEI- 431
           + A + ++ M L+G  P   +   ++ GF  + +  E+ + L++M+  G  P+  TW I 
Sbjct: 315 QEAVELLDRMNLQGLKPDAGLYGKVISGFCAISKFREAANFLDEMILGGITPNRLTWNIH 374

Query: 432 ------IISGIC 434
                 ++ G+C
Sbjct: 375 VKTSNEVVRGLC 385

BLAST of Bhi04G001830 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 196.1 bits (497), Expect = 9.4e-49
Identity = 98/307 (31.92%), Postives = 168/307 (54.72%), Query Frame = 0

Query: 140 TPTVFSYMIKIYGEADLPDKALKAFYTMIEFGCTPSSKQLNRILEILVSHRNFIRPAFDL 199
           T +VF  ++K Y    L DKAL   +     G  P     N +L+  +  +  I  A ++
Sbjct: 133 TSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENV 192

Query: 200 FKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFKRDVIPDVESYRILMQGLCR 259
           FK      V PN  +YNIL+R FC+ GN+ +A TLF+KM  +  +P+V +Y  L+ G C+
Sbjct: 193 FKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCK 252

Query: 260 KNQVNGAVDLLEDMLNKGYIPDSLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDVAH 319
             +++    LL  M  KG  P+ +SY  ++N LCR+ +++E   +L  M  +G + D   
Sbjct: 253 LRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVT 312

Query: 320 YNTAIIGFCREGRALDACKILEDMQSNGCLPNLVSYQSLTNGLCDQGMFELAKDYVEEMT 379
           YNT I G+C+EG    A  +  +M  +G  P++++Y SL + +C  G    A +++++M 
Sbjct: 313 YNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMR 372

Query: 380 LKGFCPHFSIIHALVKGFRNVGRIDESCSILEDMLNHGKAPHSDTWEIIISGIC---EVE 439
           ++G CP+      LV GF   G ++E+  +L +M ++G +P   T+  +I+G C   ++E
Sbjct: 373 VRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKME 432

Query: 440 DTVKLCE 444
           D + + E
Sbjct: 433 DAIAVLE 439

BLAST of Bhi04G001830 vs. ExPASy Swiss-Prot
Match: Q9M302 (Pentatricopeptide repeat-containing protein At3g48810 OS=Arabidopsis thaliana OX=3702 GN=At3g48810 PE=2 SV=1)

HSP 1 Score: 182.6 bits (462), Expect = 1.1e-44
Identity = 114/446 (25.56%), Postives = 192/446 (43.05%), Query Frame = 0

Query: 55  ININHKQCGDQPHFSIGSPCRVQKLIASQSDPLLAKEIFYYACRQPHFRPSSSSLPILIL 114
           +N+NH       H  I     V K +  +S   LA   F        F+ +  +  ++I 
Sbjct: 26  LNVNHLLTESPNHAEI-KELDVVKRLRQESCVPLALHFFKSIANSNLFKHTPLTFEVMIR 85

Query: 115 KLGRSKYFSLIDDLLLSFKSRGYPVTPTVFSYMIKIYGEADLPDKALKAFYTMIEFGCTP 174
           KL        +  LL   K +G+  +  +F  +I +Y +  L ++A++ FY + EFGC P
Sbjct: 86  KLAMDGQVDSVQYLLQQMKLQGFHCSEDLFISVISVYRQVGLAERAVEMFYRIKEFGCDP 145

Query: 175 SSKQLNRILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTL 234
           S K  N +L+ L+   N I+  + ++++ +  G  PN  +YN+L++A C N  +  A  L
Sbjct: 146 SVKIYNHVLDTLLG-ENRIQMIYMVYRDMKRDGFEPNVFTYNVLLKALCKNNKVDGAKKL 205

Query: 235 FNKMFKRDVIPD------------------------------VESYRILMQGLCRKNQVN 294
             +M  +   PD                              V  Y  L+ GLC+++   
Sbjct: 206 LVEMSNKGCCPDAVSYTTVISSMCEVGLVKEGRELAERFEPVVSVYNALINGLCKEHDYK 265

Query: 295 GAVDLLEDMLNKGYIPDSLSYATLLNSLCRKKKLREAYKLLCRMKVKGCN---------- 354
           GA +L+ +M+ KG  P+ +SY+TL+N LC   ++  A+  L +M  +GC+          
Sbjct: 266 GAFELMREMVEKGISPNVISYSTLINVLCNSGQIELAFSFLTQMLKRGCHPNIYTLSSLV 325

Query: 355 --------------------------PDVAHYNTAIIGFCREGRALDACKILEDMQSNGC 414
                                     P+V  YNT + GFC  G  + A  +   M+  GC
Sbjct: 326 KGCFLRGTTFDALDLWNQMIRGFGLQPNVVAYNTLVQGFCSHGNIVKAVSVFSHMEEIGC 385

Query: 415 LPNLVSYQSLTNGLCDQGMFELAKDYVEEMTLKGFCPHFSIIHALVKGFRNVGRIDESCS 435
            PN+ +Y SL NG   +G  + A     +M   G CP+  +   +V+      +  E+ S
Sbjct: 386 SPNIRTYGSLINGFAKRGSLDGAVYIWNKMLTSGCCPNVVVYTNMVEALCRHSKFKEAES 445

BLAST of Bhi04G001830 vs. ExPASy Swiss-Prot
Match: Q9FMF6 (Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g64320 PE=2 SV=1)

HSP 1 Score: 179.9 bits (455), Expect = 7.0e-44
Identity = 117/421 (27.79%), Postives = 200/421 (47.51%), Query Frame = 0

Query: 72  SPCRVQKLIASQSDPLLAKEIFYYACRQPHFRPSSSSLPILILKLGRSKYFSLIDDLLLS 131
           +P ++ KL+    +   + E+F +   Q  +R S     +LI KLG +  F  ID LL+ 
Sbjct: 77  TPFQLYKLLELPLNVSTSMELFSWTGSQNGYRHSFDVYQVLIGKLGANGEFKTIDRLLIQ 136

Query: 132 FKSRGYPVTPTVFSYMIKIYGEADLPDKALKAFYTMIE-FGCTPSSKQLNRILEILVS-- 191
            K  G     ++F  +++ Y +A  P +  +    M   + C P+ K  N +LEILVS  
Sbjct: 137 MKDEGIVFKESLFISIMRDYDKAGFPGQTTRLMLEMRNVYSCEPTFKSYNVVLEILVSGN 196

Query: 192 -HR-------------------------------NFIRPAFDLFKNARHHGVLPNTKSYN 251
            H+                               N I  A  L ++   HG +PN+  Y 
Sbjct: 197 CHKVAANVFYDMLSRKIPPTLFTFGVVMKAFCAVNEIDSALSLLRDMTKHGCVPNSVIYQ 256

Query: 252 ILMRAFCWNGNLSIAYTLFNKMFKRDVIPDVESYRILMQGLCRKNQVNGAVDLLEDMLNK 311
            L+ +      ++ A  L  +MF    +PD E++  ++ GLC+ +++N A  ++  ML +
Sbjct: 257 TLIHSLSKCNRVNEALQLLEEMFLMGCVPDAETFNDVILGLCKFDRINEAAKMVNRMLIR 316

Query: 312 GYIPDSLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDVAHYNTAIIGFCREGRALDA 371
           G+ PD ++Y  L+N LC+  ++  A  L  R+      P++  +NT I GF   GR  DA
Sbjct: 317 GFAPDDITYGYLMNGLCKIGRVDAAKDLFYRIP----KPEIVIFNTLIHGFVTHGRLDDA 376

Query: 372 CKILEDM-QSNGCLPNLVSYQSLTNGLCDQGMFELAKDYVEEMTLKGFCPHFSIIHALVK 431
             +L DM  S G +P++ +Y SL  G   +G+  LA + + +M  KG  P+      LV 
Sbjct: 377 KAVLSDMVTSYGIVPDVCTYNSLIYGYWKEGLVGLALEVLHDMRNKGCKPNVYSYTILVD 436

Query: 432 GFRNVGRIDESCSILEDMLNHGKAPHSDTWEIIISGICEVEDTVKLCEILGKILKKDVRR 457
           GF  +G+IDE+ ++L +M   G  P++  +  +IS  C+     +  EI  ++ +K  + 
Sbjct: 437 GFCKLGKIDEAYNVLNEMSADGLKPNTVGFNCLISAFCKEHRIPEAVEIFREMPRKGCKP 493

BLAST of Bhi04G001830 vs. ExPASy TrEMBL
Match: A0A6J1EXV3 (pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like OS=Cucurbita moschata OX=3662 GN=LOC111439286 PE=4 SV=1)

HSP 1 Score: 902.1 bits (2330), Expect = 9.9e-259
Identity = 435/481 (90.44%), Postives = 454/481 (94.39%), Query Frame = 0

Query: 1   MRRHLLRPCNYNTIETIAAHVVPKTPLLHKSISSSSSLYQRDLNVHDESKTLITININHK 60
           MR+HLLRPCNY T+ET+A H+ PKTPLLH SISSSSSLYQ DLNVH+E KTL   NINHK
Sbjct: 1   MRQHLLRPCNYKTLETVAVHLAPKTPLLHNSISSSSSLYQPDLNVHNELKTLSATNINHK 60

Query: 61  QCGDQPHFSIGSPCRVQKLIASQSDPLLAKEIFYYACRQPHFRPSSSSLPILILKLGRSK 120
               QP FSIGSPCRVQKLIASQSDPLLAKEIF YACRQPHFRPSSSS  +LILKLGRSK
Sbjct: 61  HLEQQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSFLVLILKLGRSK 120

Query: 121 YFSLIDDLLLSFKSRGYPVTPTVFSYMIKIYGEADLPDKALKAFYTMIEFGCTPSSKQLN 180
           YFSLIDDLLLSFKSRGYP++PTVFSY+IKIYGEADLPDKALK FYTMIEFGCTPSSKQLN
Sbjct: 121 YFSLIDDLLLSFKSRGYPLSPTVFSYIIKIYGEADLPDKALKTFYTMIEFGCTPSSKQLN 180

Query: 181 RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFK 240
           RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILMR FCWNG+LSIAYTLFNKMFK
Sbjct: 181 RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILMRVFCWNGDLSIAYTLFNKMFK 240

Query: 241 RDVIPDVESYRILMQGLCRKNQVNGAVDLLEDMLNKGYIPDSLSYATLLNSLCRKKKLRE 300
           RDV+PDVESYRILMQGLCRKNQV GAVDLLEDMLNKGY+PD+LSYATLLNSLCRKKKLRE
Sbjct: 241 RDVVPDVESYRILMQGLCRKNQVIGAVDLLEDMLNKGYVPDTLSYATLLNSLCRKKKLRE 300

Query: 301 AYKLLCRMKVKGCNPDVAHYNTAIIGFCREGRALDACKILEDMQSNGCLPNLVSYQSLTN 360
           AYKLLCRMKVKGCNPDVAHYNT I GFCREGRALDACKILEDMQSN CLPNLVSYQSLTN
Sbjct: 301 AYKLLCRMKVKGCNPDVAHYNTVITGFCREGRALDACKILEDMQSNRCLPNLVSYQSLTN 360

Query: 361 GLCDQGMFELAKDYVEEMTLKGFCPHFSIIHALVKGFRNVGRIDESCSILEDMLNHGKAP 420
           GLCDQGMFELAKDYVEEMTLKGFCPHFS+IH LVKGF NVGRID+SCS+LEDML HGKAP
Sbjct: 361 GLCDQGMFELAKDYVEEMTLKGFCPHFSVIHTLVKGFINVGRIDDSCSVLEDMLKHGKAP 420

Query: 421 HSDTWEIIISGICEVEDTVKLCEILGKILKKDVRRDTRIVEAGSGLGEYLIRKLQASKSR 480
           HS+TWE+IISG+CEVEDTVKLCEIL KILKKDVRRDTRIVEAGSGLGEYLIRKLQASKSR
Sbjct: 421 HSETWEMIISGVCEVEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQASKSR 480

Query: 481 R 482
           R
Sbjct: 481 R 481

BLAST of Bhi04G001830 vs. ExPASy TrEMBL
Match: A0A6J1ICW5 (pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like OS=Cucurbita maxima OX=3661 GN=LOC111472648 PE=4 SV=1)

HSP 1 Score: 901.7 bits (2329), Expect = 1.3e-258
Identity = 438/481 (91.06%), Postives = 455/481 (94.59%), Query Frame = 0

Query: 1   MRRHLLRPCNYNTIETIAAHVVPKTPLLHKSISSSSSLYQRDLNVHDESKTLITININHK 60
           MR+HLLRPCNY TIET+A H+ PKTPLLH SISSSSSLYQ DLNVH+E KTL   NINHK
Sbjct: 1   MRQHLLRPCNYKTIETVAVHLAPKTPLLHNSISSSSSLYQPDLNVHNELKTLNDTNINHK 60

Query: 61  QCGDQPHFSIGSPCRVQKLIASQSDPLLAKEIFYYACRQPHFRPSSSSLPILILKLGRSK 120
              +QP FSIGSPCRVQKLIASQSDPLLAKEIF YACRQPHFRPSSSS  +LILKLGRSK
Sbjct: 61  HLEEQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSFLVLILKLGRSK 120

Query: 121 YFSLIDDLLLSFKSRGYPVTPTVFSYMIKIYGEADLPDKALKAFYTMIEFGCTPSSKQLN 180
           YFSLI+DLLLSFKSRGYP++PTVFSY+IKIYGEADLPDKALK FYTMIEFGCTPSSKQLN
Sbjct: 121 YFSLINDLLLSFKSRGYPLSPTVFSYIIKIYGEADLPDKALKTFYTMIEFGCTPSSKQLN 180

Query: 181 RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFK 240
           RILEILVSHR+FIRPAFDLFKNARHHGVLPNTKSYNILMRAFCWNG+LSIAYTLFNKMFK
Sbjct: 181 RILEILVSHRDFIRPAFDLFKNARHHGVLPNTKSYNILMRAFCWNGDLSIAYTLFNKMFK 240

Query: 241 RDVIPDVESYRILMQGLCRKNQVNGAVDLLEDMLNKGYIPDSLSYATLLNSLCRKKKLRE 300
           RDVIPDVESYRILMQGLCRKNQV GAVDLLEDMLNKGYIPD+LSYATLLNSLCRKKKLRE
Sbjct: 241 RDVIPDVESYRILMQGLCRKNQVIGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE 300

Query: 301 AYKLLCRMKVKGCNPDVAHYNTAIIGFCREGRALDACKILEDMQSNGCLPNLVSYQSLTN 360
           AYKLLCRMKVKGCNPDVAHYNT I GFCREGRALDACKILEDMQ NGCLPNLVSYQSLTN
Sbjct: 301 AYKLLCRMKVKGCNPDVAHYNTVITGFCREGRALDACKILEDMQLNGCLPNLVSYQSLTN 360

Query: 361 GLCDQGMFELAKDYVEEMTLKGFCPHFSIIHALVKGFRNVGRIDESCSILEDMLNHGKAP 420
           GLCDQGMFELAKDYVEEMTL GFCPHFS+IH LVKGF NVGRID+SCS+LEDML HGKAP
Sbjct: 361 GLCDQGMFELAKDYVEEMTLNGFCPHFSVIHTLVKGFINVGRIDDSCSVLEDMLKHGKAP 420

Query: 421 HSDTWEIIISGICEVEDTVKLCEILGKILKKDVRRDTRIVEAGSGLGEYLIRKLQASKSR 480
           HS+TWEIIISGICEVEDTVKLCEIL KILKKDVRRDTRIVEAGSGLGEYLIRKLQASKSR
Sbjct: 421 HSETWEIIISGICEVEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQASKSR 480

Query: 481 R 482
           R
Sbjct: 481 R 481

BLAST of Bhi04G001830 vs. ExPASy TrEMBL
Match: A0A5A7SWW3 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold155G00670 PE=4 SV=1)

HSP 1 Score: 889.0 bits (2296), Expect = 8.6e-255
Identity = 433/479 (90.40%), Postives = 451/479 (94.15%), Query Frame = 0

Query: 4   HLLRPCNYNTIETIAAHVVPKTPLLHKSISSSSSLYQRDLNVHDESKTLITININHKQCG 63
           HLLRP NY TIET+AAHV    PLLH  ISSSSSLYQ  LNVH+ESKTLIT NINHKQC 
Sbjct: 86  HLLRPGNYRTIETVAAHVARNAPLLHNLISSSSSLYQPHLNVHNESKTLIT-NINHKQCE 145

Query: 64  DQPHFSIGSPCRVQKLIASQSDPLLAKEIFYYACRQPHFRPSSSSLPILILKLGRSKYFS 123
           DQP FSIGSPCRVQKLIASQSDPLLAKEIF YACRQPHFRPSSSSL +LILKLGRSKYFS
Sbjct: 146 DQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSKYFS 205

Query: 124 LIDDLLLSFKSRGYPVTPTVFSYMIKIYGEADLPDKALKAFYTMIEFGCTPSSKQLNRIL 183
           LIDDLLLSFKSRGYPVTPT FSY+IKIYGEADLPDKALK FYTMIEFGCTPSSKQLNRIL
Sbjct: 206 LIDDLLLSFKSRGYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIEFGCTPSSKQLNRIL 265

Query: 184 EILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFKRDV 243
           EILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNIL+RAFCWNGN+SIAY LFNKMF+ DV
Sbjct: 266 EILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYKLFNKMFEGDV 325

Query: 244 IPDVESYRILMQGLCRKNQVNGAVDLLEDMLNKGYIPDSLSYATLLNSLCRKKKLREAYK 303
           IPDVE+YR LMQGLCRKNQVNGAVDLLEDMLNKGYIPD+LSYATLLNSLCRKKKL+EAYK
Sbjct: 326 IPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLKEAYK 385

Query: 304 LLCRMKVKGCNPDVAHYNTAIIGFCREGRALDACKILEDMQSNGCLPNLVSYQSLTNGLC 363
           LLCRMKVKGCNPD+AHYNT I+GFCREGRALDACKILEDMQSNGCLPNLVSY+SLTNGLC
Sbjct: 386 LLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTNGLC 445

Query: 364 DQGMFELAKDYVEEMTLKGFCPHFSIIHALVKGFRNVGRIDESCSILEDMLNHGKAPHSD 423
           DQGMFELAK YVEEMTLKGF PHFS+IHALVKGF NVGR+DESCS+LE ML HGKAPHSD
Sbjct: 446 DQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHNVGRMDESCSVLEGMLKHGKAPHSD 505

Query: 424 TWEIIISGICEVEDTVKLCEILGKILKKDVRRDTRIVEAGSGLGEYLIRKLQASKSRRV 483
           TWEIIISGICEVEDTVK CEIL KILKKDVRRDTRIVEAG+GLGEYLIRKLQASKSRR+
Sbjct: 506 TWEIIISGICEVEDTVKFCEILEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASKSRRI 563

BLAST of Bhi04G001830 vs. ExPASy TrEMBL
Match: A0A1S3CIG1 (pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103501324 PE=4 SV=1)

HSP 1 Score: 889.0 bits (2296), Expect = 8.6e-255
Identity = 433/479 (90.40%), Postives = 451/479 (94.15%), Query Frame = 0

Query: 4   HLLRPCNYNTIETIAAHVVPKTPLLHKSISSSSSLYQRDLNVHDESKTLITININHKQCG 63
           HLLRP NY TIET+AAHV    PLLH  ISSSSSLYQ  LNVH+ESKTLIT NINHKQC 
Sbjct: 4   HLLRPGNYRTIETVAAHVARNAPLLHNLISSSSSLYQPHLNVHNESKTLIT-NINHKQCE 63

Query: 64  DQPHFSIGSPCRVQKLIASQSDPLLAKEIFYYACRQPHFRPSSSSLPILILKLGRSKYFS 123
           DQP FSIGSPCRVQKLIASQSDPLLAKEIF YACRQPHFRPSSSSL +LILKLGRSKYFS
Sbjct: 64  DQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSKYFS 123

Query: 124 LIDDLLLSFKSRGYPVTPTVFSYMIKIYGEADLPDKALKAFYTMIEFGCTPSSKQLNRIL 183
           LIDDLLLSFKSRGYPVTPT FSY+IKIYGEADLPDKALK FYTMIEFGCTPSSKQLNRIL
Sbjct: 124 LIDDLLLSFKSRGYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIEFGCTPSSKQLNRIL 183

Query: 184 EILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFKRDV 243
           EILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNIL+RAFCWNGN+SIAY LFNKMF+ DV
Sbjct: 184 EILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYKLFNKMFEGDV 243

Query: 244 IPDVESYRILMQGLCRKNQVNGAVDLLEDMLNKGYIPDSLSYATLLNSLCRKKKLREAYK 303
           IPDVE+YR LMQGLCRKNQVNGAVDLLEDMLNKGYIPD+LSYATLLNSLCRKKKL+EAYK
Sbjct: 244 IPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLKEAYK 303

Query: 304 LLCRMKVKGCNPDVAHYNTAIIGFCREGRALDACKILEDMQSNGCLPNLVSYQSLTNGLC 363
           LLCRMKVKGCNPD+AHYNT I+GFCREGRALDACKILEDMQSNGCLPNLVSY+SLTNGLC
Sbjct: 304 LLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTNGLC 363

Query: 364 DQGMFELAKDYVEEMTLKGFCPHFSIIHALVKGFRNVGRIDESCSILEDMLNHGKAPHSD 423
           DQGMFELAK YVEEMTLKGF PHFS+IHALVKGF NVGR+DESCS+LE ML HGKAPHSD
Sbjct: 364 DQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHNVGRMDESCSVLEGMLKHGKAPHSD 423

Query: 424 TWEIIISGICEVEDTVKLCEILGKILKKDVRRDTRIVEAGSGLGEYLIRKLQASKSRRV 483
           TWEIIISGICEVEDTVK CEIL KILKKDVRRDTRIVEAG+GLGEYLIRKLQASKSRR+
Sbjct: 424 TWEIIISGICEVEDTVKFCEILEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASKSRRI 481

BLAST of Bhi04G001830 vs. ExPASy TrEMBL
Match: A0A0A0K8U0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G014780 PE=4 SV=1)

HSP 1 Score: 870.9 bits (2249), Expect = 2.4e-249
Identity = 425/483 (87.99%), Postives = 450/483 (93.17%), Query Frame = 0

Query: 1   MRRHLLRPCNYNTIETI-AAHVVPKTPLLHKSISSSSSLYQRDLNVHDESKTLITININH 60
           M +HLLRPCNY TIET+ AAHV  K+PLL   ISSSSSLYQ  LNVH+ESK LIT N+ H
Sbjct: 1   MWQHLLRPCNYRTIETVAAAHVARKSPLLRNLISSSSSLYQPHLNVHNESKFLIT-NVKH 60

Query: 61  KQCGDQPHFSIGSPCRVQKLIASQSDPLLAKEIFYYACRQPHFRPSSSSLPILILKLGRS 120
           +QC DQP FSIGSPCRVQKLIASQSDPLLAKEIF YACRQPHFRPSSSSL +LILKLGRS
Sbjct: 61  EQCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRS 120

Query: 121 KYFSLIDDLLLSFKSRGYPVTPTVFSYMIKIYGEADLPDKALKAFYTMIEFGCTPSSKQL 180
           KYFSLIDDLLLSFKSR YPVTPT FSY+IKIYGEADLPDKALK FYTMI+FGCTPSSKQL
Sbjct: 121 KYFSLIDDLLLSFKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQL 180

Query: 181 NRILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMF 240
           NRILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNIL+RAFCWNGN+SIAYTLFNKMF
Sbjct: 181 NRILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMF 240

Query: 241 KRDVIPDVESYRILMQGLCRKNQVNGAVDLLEDMLNKGYIPDSLSYATLLNSLCRKKKLR 300
           +R+VIPDVE+YR LMQGLCRKNQVNGAVDLLEDMLNKGYIPD+LSYATLLNSLCRKKKLR
Sbjct: 241 ERNVIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLR 300

Query: 301 EAYKLLCRMKVKGCNPDVAHYNTAIIGFCREGRALDACKILEDMQSNGCLPNLVSYQSLT 360
           EAYKLLCRMKVKGCNPD+AHYNT I+GFCREGRALDACKILEDMQSNGCLPNLVSY+SLT
Sbjct: 301 EAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLT 360

Query: 361 NGLCDQGMFELAKDYVEEMTLKGFCPHFSIIHALVKGFRNVGRIDESCSILEDMLNHGKA 420
           NGLCDQGMFELAK YVEEMTLKGF PHFS+IHALVKGF ++GRI ESCS+LEDML  GKA
Sbjct: 361 NGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKA 420

Query: 421 PHSDTWEIIISGICEVEDTVKLCEILGKILKKDVRRDTRIVEAGSGLGEYLIRKLQASKS 480
           PHSDTWEIIISGICEVEDT K CE+  KILKKDVRRDTRIVEAG+GLGEYLIRKLQAS S
Sbjct: 421 PHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASIS 480

Query: 481 RRV 483
           RR+
Sbjct: 481 RRI 482

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT4G01400.34.8e-15758.09FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT4G01400.13.9e-10645.18FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT5G46100.12.9e-5330.65Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G39710.16.7e-5031.92Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G48810.17.7e-4625.56Pentatricopeptide repeat (PPR) superfamily protein [more]
Match NameE-valueIdentityDescription
Q8LDU56.8e-15658.09Pentatricopeptide repeat-containing protein At4g01400, mitochondrial OS=Arabidop... [more]
Q9FNL24.1e-5230.65Pentatricopeptide repeat-containing protein At5g46100 OS=Arabidopsis thaliana OX... [more]
Q9FIX39.4e-4931.92Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Q9M3021.1e-4425.56Pentatricopeptide repeat-containing protein At3g48810 OS=Arabidopsis thaliana OX... [more]
Q9FMF67.0e-4427.79Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1EXV39.9e-25990.44pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like OS=Cuc... [more]
A0A6J1ICW51.3e-25891.06pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like OS=Cuc... [more]
A0A5A7SWW38.6e-25590.40Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3CIG18.6e-25590.40pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like OS=Cuc... [more]
A0A0A0K8U02.4e-24987.99Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G014780 PE=4 SV=1[more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 143..172
e-value: 0.0028
score: 17.8
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 276..309
e-value: 4.1E-8
score: 32.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 210..259
e-value: 1.0E-14
score: 54.4
coord: 315..364
e-value: 1.2E-12
score: 47.8
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 249..281
e-value: 6.0E-7
score: 27.3
coord: 284..317
e-value: 1.7E-8
score: 32.1
coord: 353..385
e-value: 1.8E-5
score: 22.6
coord: 143..175
e-value: 6.1E-5
score: 20.9
coord: 214..247
e-value: 1.1E-5
score: 23.3
coord: 320..351
e-value: 3.3E-8
score: 31.2
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 246..280
score: 11.794416
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 316..350
score: 11.783455
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 281..315
score: 12.24383
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 140..174
score: 9.470621
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 351..385
score: 10.742131
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 211..245
score: 11.366925
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 192..297
e-value: 2.0E-29
score: 104.3
coord: 72..191
e-value: 2.7E-11
score: 45.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 298..474
e-value: 1.6E-33
score: 118.4
NoneNo IPR availablePANTHERPTHR47942TETRATRICOPEPTIDE REPEAT (TPR)-LIKE SUPERFAMILY PROTEIN-RELATEDcoord: 56..473
NoneNo IPR availablePANTHERPTHR47942:SF2OS09G0532800 PROTEINcoord: 56..473

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi04M001830Bhi04M001830mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000373 Group II intron splicing
biological_process GO:0032981 mitochondrial respiratory chain complex I assembly
biological_process GO:0000963 mitochondrial RNA processing
biological_process GO:0015031 protein transport
biological_process GO:0060628 regulation of ER to Golgi vesicle-mediated transport
biological_process GO:0006890 retrograde vesicle-mediated transport, Golgi to endoplasmic reticulum
cellular_component GO:0070939 Dsl1/NZR complex
cellular_component GO:0016020 membrane
cellular_component GO:0005739 mitochondrion
molecular_function GO:0005515 protein binding