Bhi10G000717 (gene) Wax gourd (B227) v1

Overview
NameBhi10G000717
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
DescriptionEukaryotic aspartyl protease family protein
Locationchr10: 18798768 .. 18801078 (+)
RNA-Seq ExpressionBhi10G000717
SyntenyBhi10G000717
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGGTTGATATAATTGAGTGGAAAATGTAATAAATTATAAAGTGTGTGTTAGTAGTTAAGATAATTAATTATTTTATTATTAGAAGGTAGCGAGAAAACAGAACATTAGGTGTATGAATTTGAATTAGTTAGTTATATAAAACAGGGGAAAGAAAAGAAAAGGAAAAGTCCCCATTAATAGCCACACTATATTTAACCCTTTTCAGTTAAGCTTCGTCTTCCTTCTTCCCTTTCCCCCCTTTTTAAGTCCGCCATTATCATCTTTCTTCTTGCTCCATCTCTCTAACATTACATTCTCTGTTTTGTATGTTAGGTTACAGGAAGCCAATGTCGCCCATTTCACATTTTTGTTTGTTCTTCCTCTTTTTCTTCCTCTCTGTTCCCATCGCATTCGGCGACGGCAGCCATGATCAAGAAAATGTAAAACTGGATCTACTTCACCGCCACCATCCACAAGTCTCCGAGAAGCTTCACGGCGATATAAAACTTGAAAATATGAACGATCGAATCAAAGATATTCTCGAGCACGATCAAAAGCGTTACCAAACGATCTCCTCGTCGTTGAATCGGAATGAACTTGATGAGCAATTAAGGAAGGAGGCGGCGGAGTTGGCGGAGAAGGATCTAAAACTTCCACCGATATCGTCAACACCAATAGGGTTGAAAATGATATCGGGTTCTGACTATGGAAGTAGTGAGTATTTTGTTCAATTGAAAGTCGGAACGCCGCCGCAGACGTTTATGTTGATCGCCGATACTGGAAGTGATTTGACGTGGATGAAATGTAGATATCGGCGGTGTATCGGAAATTGTAGCAGTAATCCGAATCATAAAACCCGAAATGAACGGAAAGTGAGATTTAGAAATGCGTTTTTGGCGAATTATTCGTCATCTTTTAAGACGATTGATTGCAGCTCGAAGATGTGTACGAATGATCTTGCTGATTTGTTCTCAATTGGGGAATGCCAAACCCCAACTAGCCCTTGTCTCTATGATTACAGGTAGGTACGTACATTTATTCAACGAAAACTAATTTTGTTACCTCTAAACTAAAATAAATCCTAATGTAAATCTCTATTAAAAAAACAAAACCATTCTAAAATATTTTTAAGAGCCTAATCCTATGTTCTCCTTCTTAATTATTATTATTATTATTAGAGTGGATGGATACACCTATCTAGAGTATATGTATATATGTTGAGAAGTAGCTAATGAATATTATTATTATTATTAATTTTTATTTGGAATGGAAATAGCTACTCAGGTGGAGCAAGTGCAAAGGGATTATTCGCAATCGAGACTCTAACCGTAGGCTTAACAAACGGAAAAGAAAAACAACTTCACAATTCTATAATCGGCTGCACCGAATCAGTTCAAGGCAGAATCTTCGGCGGAGCCGACGGCGTCATTGGCTTAGGCACTAGCAGCTATTCTTTCACTTACAAAGCCGCCGAAAACGCCAACGGCGGCGGCTTCGCCTACTGCCTTGTCGACCATCTCAGCGACCGTACCGCCACCAGCTACTTCATCCTCGGCAACCCTATCTCTTCCACTGACTCTGCCGCCGCCGCCTCCTCCGTCGCCCCTACCGGCAACATGTCCTTCACTAAACTCTTCCTCGGCGACCCTTACAGCAGCTTCTACGGCGTCGATCTCGTCGGAATCTCCGCAGACGGCGTCATGCTCAACATCCCTCCCCGTGTTTGGGACATCAATTCCGGCGGCGGAACCATCGTCGACTCCGGAACTAGTCTCACCATGCTGGCGGCGCCGGCGTTTGACATGGTCATGGAAGCTCTGGTTCCGAAGCTGAAGCATTTCGAGAATATTGAAATTGAACCTTTCGATTTTTGCTTCAATAATAGCCGATATACCCATGAAATGGCTCCGAAGCTCCGATTCCATTTCGGCGACGGCACGGTGTTCCAGCCGCCGCCGAAAAGCTACATTGTGTCGGTGGGTGAATATATCAGCTGTATTGGTTTCGTTTCTATGCCTTTTCCGGCGACCAATATCATTGGGAATATTCTTCAGCAAAATCACCTTTGGAAATTTGATTTCCATGCAGGAACAGTCGGTTTTGCCCCTTCTGAATGCGTCTAAGGAACTTCATCATCTTCTTTCTTCCTTTCGTTTCTTGATTTTCAATTTTATATAATTATTATTATTGTTTTTTCTTTTCTCTTTCTGTAATACCTGTTATTATTAATTATATTATACAATGTGAGCAATCTCTTTTGTCCTTTTTATTTTATTTTCTCTTTATTTTTTCTCAATTGGAATAAAATTGGAGAGTGGAGATTCCTTTTAAA

mRNA sequence

AGGTTGATATAATTGAGTGGAAAATGTAATAAATTATAAAGTGTGTGTTAGTAGTTAAGATAATTAATTATTTTATTATTAGAAGGTAGCGAGAAAACAGAACATTAGGTGTATGAATTTGAATTAGTTAGTTATATAAAACAGGGGAAAGAAAAGAAAAGGAAAAGTCCCCATTAATAGCCACACTATATTTAACCCTTTTCAGTTAAGCTTCGTCTTCCTTCTTCCCTTTCCCCCCTTTTTAAGTCCGCCATTATCATCTTTCTTCTTGCTCCATCTCTCTAACATTACATTCTCTGTTTTGTATGTTAGGTTACAGGAAGCCAATGTCGCCCATTTCACATTTTTGTTTGTTCTTCCTCTTTTTCTTCCTCTCTGTTCCCATCGCATTCGGCGACGGCAGCCATGATCAAGAAAATGTAAAACTGGATCTACTTCACCGCCACCATCCACAAGTCTCCGAGAAGCTTCACGGCGATATAAAACTTGAAAATATGAACGATCGAATCAAAGATATTCTCGAGCACGATCAAAAGCGTTACCAAACGATCTCCTCGTCGTTGAATCGGAATGAACTTGATGAGCAATTAAGGAAGGAGGCGGCGGAGTTGGCGGAGAAGGATCTAAAACTTCCACCGATATCGTCAACACCAATAGGGTTGAAAATGATATCGGGTTCTGACTATGGAAGTAGTGAGTATTTTGTTCAATTGAAAGTCGGAACGCCGCCGCAGACGTTTATGTTGATCGCCGATACTGGAAGTGATTTGACGTGGATGAAATGTAGATATCGGCGGTGTATCGGAAATTGTAGCAGTAATCCGAATCATAAAACCCGAAATGAACGGAAAGTGAGATTTAGAAATGCGTTTTTGGCGAATTATTCGTCATCTTTTAAGACGATTGATTGCAGCTCGAAGATGTGTACGAATGATCTTGCTGATTTGTTCTCAATTGGGGAATGCCAAACCCCAACTAGCCCTTGTCTCTATGATTACAGCTACTCAGGTGGAGCAAGTGCAAAGGGATTATTCGCAATCGAGACTCTAACCGTAGGCTTAACAAACGGAAAAGAAAAACAACTTCACAATTCTATAATCGGCTGCACCGAATCAGTTCAAGGCAGAATCTTCGGCGGAGCCGACGGCGTCATTGGCTTAGGCACTAGCAGCTATTCTTTCACTTACAAAGCCGCCGAAAACGCCAACGGCGGCGGCTTCGCCTACTGCCTTGTCGACCATCTCAGCGACCGTACCGCCACCAGCTACTTCATCCTCGGCAACCCTATCTCTTCCACTGACTCTGCCGCCGCCGCCTCCTCCGTCGCCCCTACCGGCAACATGTCCTTCACTAAACTCTTCCTCGGCGACCCTTACAGCAGCTTCTACGGCGTCGATCTCGTCGGAATCTCCGCAGACGGCGTCATGCTCAACATCCCTCCCCGTGTTTGGGACATCAATTCCGGCGGCGGAACCATCGTCGACTCCGGAACTAGTCTCACCATGCTGGCGGCGCCGGCGTTTGACATGGTCATGGAAGCTCTGGTTCCGAAGCTGAAGCATTTCGAGAATATTGAAATTGAACCTTTCGATTTTTGCTTCAATAATAGCCGATATACCCATGAAATGGCTCCGAAGCTCCGATTCCATTTCGGCGACGGCACGGTGTTCCAGCCGCCGCCGAAAAGCTACATTGTGTCGGTGGGTGAATATATCAGCTGTATTGGTTTCGTTTCTATGCCTTTTCCGGCGACCAATATCATTGGGAATATTCTTCAGCAAAATCACCTTTGGAAATTTGATTTCCATGCAGGAACAGTCGGTTTTGCCCCTTCTGAATGCGTCTAAGGAACTTCATCATCTTCTTTCTTCCTTTCGTTTCTTGATTTTCAATTTTATATAATTATTATTATTGTTTTTTCTTTTCTCTTTCTGTAATACCTGTTATTATTAATTATATTATACAATGTGAGCAATCTCTTTTGTCCTTTTTATTTTATTTTCTCTTTATTTTTTCTCAATTGGAATAAAATTGGAGAGTGGAGATTCCTTTTAAA

Coding sequence (CDS)

ATGTTAGGTTACAGGAAGCCAATGTCGCCCATTTCACATTTTTGTTTGTTCTTCCTCTTTTTCTTCCTCTCTGTTCCCATCGCATTCGGCGACGGCAGCCATGATCAAGAAAATGTAAAACTGGATCTACTTCACCGCCACCATCCACAAGTCTCCGAGAAGCTTCACGGCGATATAAAACTTGAAAATATGAACGATCGAATCAAAGATATTCTCGAGCACGATCAAAAGCGTTACCAAACGATCTCCTCGTCGTTGAATCGGAATGAACTTGATGAGCAATTAAGGAAGGAGGCGGCGGAGTTGGCGGAGAAGGATCTAAAACTTCCACCGATATCGTCAACACCAATAGGGTTGAAAATGATATCGGGTTCTGACTATGGAAGTAGTGAGTATTTTGTTCAATTGAAAGTCGGAACGCCGCCGCAGACGTTTATGTTGATCGCCGATACTGGAAGTGATTTGACGTGGATGAAATGTAGATATCGGCGGTGTATCGGAAATTGTAGCAGTAATCCGAATCATAAAACCCGAAATGAACGGAAAGTGAGATTTAGAAATGCGTTTTTGGCGAATTATTCGTCATCTTTTAAGACGATTGATTGCAGCTCGAAGATGTGTACGAATGATCTTGCTGATTTGTTCTCAATTGGGGAATGCCAAACCCCAACTAGCCCTTGTCTCTATGATTACAGCTACTCAGGTGGAGCAAGTGCAAAGGGATTATTCGCAATCGAGACTCTAACCGTAGGCTTAACAAACGGAAAAGAAAAACAACTTCACAATTCTATAATCGGCTGCACCGAATCAGTTCAAGGCAGAATCTTCGGCGGAGCCGACGGCGTCATTGGCTTAGGCACTAGCAGCTATTCTTTCACTTACAAAGCCGCCGAAAACGCCAACGGCGGCGGCTTCGCCTACTGCCTTGTCGACCATCTCAGCGACCGTACCGCCACCAGCTACTTCATCCTCGGCAACCCTATCTCTTCCACTGACTCTGCCGCCGCCGCCTCCTCCGTCGCCCCTACCGGCAACATGTCCTTCACTAAACTCTTCCTCGGCGACCCTTACAGCAGCTTCTACGGCGTCGATCTCGTCGGAATCTCCGCAGACGGCGTCATGCTCAACATCCCTCCCCGTGTTTGGGACATCAATTCCGGCGGCGGAACCATCGTCGACTCCGGAACTAGTCTCACCATGCTGGCGGCGCCGGCGTTTGACATGGTCATGGAAGCTCTGGTTCCGAAGCTGAAGCATTTCGAGAATATTGAAATTGAACCTTTCGATTTTTGCTTCAATAATAGCCGATATACCCATGAAATGGCTCCGAAGCTCCGATTCCATTTCGGCGACGGCACGGTGTTCCAGCCGCCGCCGAAAAGCTACATTGTGTCGGTGGGTGAATATATCAGCTGTATTGGTTTCGTTTCTATGCCTTTTCCGGCGACCAATATCATTGGGAATATTCTTCAGCAAAATCACCTTTGGAAATTTGATTTCCATGCAGGAACAGTCGGTTTTGCCCCTTCTGAATGCGTCTAA

Protein sequence

MLGYRKPMSPISHFCLFFLFFFLSVPIAFGDGSHDQENVKLDLLHRHHPQVSEKLHGDIKLENMNDRIKDILEHDQKRYQTISSSLNRNELDEQLRKEAAELAEKDLKLPPISSTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSNPNHKTRNERKVRFRNAFLANYSSSFKTIDCSSKMCTNDLADLFSIGECQTPTSPCLYDYSYSGGASAKGLFAIETLTVGLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSSYSFTYKAAENANGGGFAYCLVDHLSDRTATSYFILGNPISSTDSAAAASSVAPTGNMSFTKLFLGDPYSSFYGVDLVGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALVPKLKHFENIEIEPFDFCFNNSRYTHEMAPKLRFHFGDGTVFQPPPKSYIVSVGEYISCIGFVSMPFPATNIIGNILQQNHLWKFDFHAGTVGFAPSECV
Homology
BLAST of Bhi10G000717 vs. TAIR 10
Match: AT3G12700.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 339.0 bits (868), Expect = 6.8e-93
Identity = 194/501 (38.72%), Postives = 282/501 (56.29%), Query Frame = 0

Query: 15  CLFFLFFFLSVPIAFGDGSHDQENVKLDLLHRHHPQVSEKLHGDIKLENMNDRIKDILEH 74
           CL      ++V  +  D S     V+L L HR           D  L     RI+D++  
Sbjct: 30  CLITTLLLITVADSMKDTS-----VRLKLAHR-----------DTLLPKPLSRIEDVIGA 89

Query: 75  DQKRYQTISSSLNRNELDEQLRKEAAELAEKDLKLPPISSTPIGLKMISGSDYGSSEYFV 134
           DQKR+  IS   N                         S+  + + + SG DYG+++YF 
Sbjct: 90  DQKRHSLISRKRN-------------------------STVGVKMDLGSGIDYGTAQYFT 149

Query: 135 QLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSNPNHKTRNERKVRFRNAFLANYS 194
           +++VGTP + F ++ DTGS+LTW+ CRYR            + ++ R+V     F A+ S
Sbjct: 150 EIRVGTPAKKFRVVVDTGSELTWVNCRYRA-----------RGKDNRRV-----FRADES 209

Query: 195 SSFKTIDCSSKMCTNDLADLFSIGECQTPTSPCLYDYSYSGGASAKGLFAIETLTVGLTN 254
            SFKT+ C ++ C  DL +LFS+  C TP++PC YDY Y+ G++A+G+FA ET+TVGLTN
Sbjct: 210 KSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTN 269

Query: 255 GKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSSYSFTYKAAENANGGGFAYCLVDHLS 314
           G+  +L   +IGC+ S  G+ F GADGV+GL  S +SFT   A +  G  F+YCLVDHLS
Sbjct: 270 GRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFT-STATSLYGAKFSYCLVDHLS 329

Query: 315 DRTATSYFILGNPISSTDSAAAASSVAPTGNMSFTKLFLGDPYSSFYGVDLVGISADGVM 374
           ++  ++Y I G+      S +  ++   T  +  T++        FY ++++GIS    M
Sbjct: 330 NKNVSNYLIFGS------SRSTKTAFRRTTPLDLTRI------PPFYAINVIGISLGYDM 389

Query: 375 LNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALVPKLKHFENIEIE--PFDFCF 434
           L+IP +VWD  SGGGTI+DSGTSLT+LA  A+  V+  L   L   + ++ E  P ++CF
Sbjct: 390 LDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCF 449

Query: 435 N-NSRYTHEMAPKLRFHFGDGTVFQPPPKSYIVSVGEYISCIGFVSMPFPATNIIGNILQ 494
           +  S +     P+L FH   G  F+P  KSY+V     + C+GFVS   PATN+IGNI+Q
Sbjct: 450 SFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQ 460

Query: 495 QNHLWKFDFHAGTVGFAPSEC 513
           QN+LW+FD  A T+ FAPS C
Sbjct: 510 QNYLWEFDLMASTLSFAPSAC 460

BLAST of Bhi10G000717 vs. TAIR 10
Match: AT3G25700.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 225.7 bits (574), Expect = 8.4e-59
Identity = 147/409 (35.94%), Postives = 210/409 (51.34%), Query Frame = 0

Query: 121 MISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSNPNHKTRNE 180
           ++SG+  GS +YFV L++G PPQ+ +LIADTGSDL W+KC   R   NCS   +H     
Sbjct: 73  VVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACR---NCS---HHSP--- 132

Query: 181 RKVRFRNAFLANYSSSFKTIDCSSKMC-TNDLADLFSIGECQTPTSPCLYDYSYSGGASA 240
                   F   +SS+F    C   +C      D   I       S C Y+Y Y+ G+  
Sbjct: 133 -----ATVFFPRHSSTFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLT 192

Query: 241 KGLFAIETLTVGLTNGKEKQLHNSIIGC-----TESVQGRIFGGADGVIGLGTSSYSFTY 300
            GLFA ET ++  ++GKE +L +   GC      +SV G  F GA+GV+GLG    SF  
Sbjct: 193 SGLFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFAS 252

Query: 301 KAAENANGGGFAYCLVDHLSDRTATSYFILGNPISSTDSAAAASSVAPTGNMSFTKLF-- 360
           +      G  F+YCL+D+      TSY I+GN                 G    +KLF  
Sbjct: 253 QLGRRF-GNKFSYCLMDYTLSPPPTSYLIIGN-----------------GGDGISKLFFT 312

Query: 361 --LGDPYS-SFYGVDLVGISADGVMLNIPPRVWDI--NSGGGTIVDSGTSLTMLAAPAFD 420
             L +P S +FY V L  +  +G  L I P +W+I  +  GGT+VDSGT+L  LA PA+ 
Sbjct: 313 PLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYR 372

Query: 421 MVMEALVPKLKHFENIEIEP-FDFCFNNSRYT--HEMAPKLRFHFGDGTVFQPPPKSYIV 480
            V+ A+  ++K      + P FD C N S  T   ++ P+L+F F  G VF PPP++Y +
Sbjct: 373 SVIAAVRRRVKLPIADALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFI 432

Query: 481 SVGEYISCIGFVSM-PFPATNIIGNILQQNHLWKFDFHAGTVGFAPSEC 513
              E I C+   S+ P    ++IGN++QQ  L++FD     +GF+   C
Sbjct: 433 ETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGC 449

BLAST of Bhi10G000717 vs. TAIR 10
Match: AT3G59080.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 191.8 bits (486), Expect = 1.3e-48
Identity = 149/486 (30.66%), Postives = 231/486 (47.53%), Query Frame = 0

Query: 44  LHRHHPQVSEKLHGDIKLENMNDRIKDILEHDQKRYQTISSSLNRNELDEQLRKEAAELA 103
           L R     +EK   +  LE    +I+D L   Q  ++ +    N+N + ++ +K   E+ 
Sbjct: 84  LKRRETTTTEKATTNSVLEL---QIRD-LTRIQTLHKRVLEKNNQNTVSQKQKKNDKEV- 143

Query: 104 EKDLKLPPISST------PIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTW 163
              +   P++S+       +   + SG   GS EYF+ + VG+PP+ F LI DTGSDL W
Sbjct: 144 ---VTTTPVASSVEEQAGQLVATLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNW 203

Query: 164 MKCRYRRCIGNCSSNPNHKTRNERKVRFRNAFL-ANYSSSFKTIDCSSKMCTNDLADLFS 223
           ++C    C      N               AF     S+S+K I C+ + C N ++    
Sbjct: 204 IQC--LPCYDCFQQN--------------GAFYDPKASASYKNITCNDQRC-NLVSSPDP 263

Query: 224 IGECQTPTSPCLYDYSYSGGASAKGLFAIETLTVGL-TNGKEKQLH---NSIIGCTESVQ 283
              C++    C Y Y Y   ++  G FA+ET TV L TNG   +L+   N + GC    +
Sbjct: 264 PMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCGHWNR 323

Query: 284 GRIFGGADGVIGLGTSSYSFTYKAAENANGGGFAYCLVDHLSDRTATSYFILGNPISSTD 343
           G +F GA G++GLG    SF+    ++  G  F+YCLVD  SD   +S  I G       
Sbjct: 324 G-LFHGAAGLLGLGRGPLSFS-SQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGED----- 383

Query: 344 SAAAASSVAPTGNMSFTKLFLG--DPYSSFYGVDLVGISADGVMLNIPPRVWDINS--GG 403
                  +    N++FT    G  +   +FY V +  I   G +LNIP   W+I+S   G
Sbjct: 384 -----KDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAG 443

Query: 404 GTIVDSGTSLTMLAAPAFDMVMEALVPKLKHFENI--EIEPFDFCFNNSRYTHEMAPKLR 463
           GTI+DSGT+L+  A PA++ +   +  K K    +  +    D CFN S   +   P+L 
Sbjct: 444 GTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNVQLPELG 503

Query: 464 FHFGDGTVFQPPPKSYIVSVGEYISCIGFVSMPFPATNIIGNILQQNHLWKFDFHAGTVG 513
             F DG V+  P ++  + + E + C+  +  P  A +IIGN  QQN    +D     +G
Sbjct: 504 IAFADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLG 532

BLAST of Bhi10G000717 vs. TAIR 10
Match: AT2G42980.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 191.4 bits (485), Expect = 1.8e-48
Identity = 144/484 (29.75%), Postives = 229/484 (47.31%), Query Frame = 0

Query: 46  RHHPQVSEKLHGDIKLE--NMNDRIKDILEHDQKRYQTISSSLNRNELDEQLRKEAAELA 105
           + H + S K    IK E       + D+   D  R +T+ +  N+++  +Q  ++  +  
Sbjct: 71  KEHTRESVKPQSRIKQETKRTTHSVVDLQIQDLTRIKTLHARFNKSK--KQKNEKVRKKI 130

Query: 106 EKDLKL---PPISSTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKC 165
             D+ L   P +S   +   + SG   GS EYF+ + VGTPP+ F LI DTGSDL W++C
Sbjct: 131 TSDISLVGAPEVSPGKLIATLESGMTLGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQC 190

Query: 166 RYRRCIGNCSSNPNHKTRNERKVRFRNAFLANYSSSFKTIDCSSKMCTNDLADLFSIGEC 225
                       P +   ++  + +        S+SFK I C+   C+  ++      +C
Sbjct: 191 L-----------PCYDCFHQNGMFYD----PKTSASFKNITCNDPRCSL-ISSPDPPVQC 250

Query: 226 QTPTSPCLYDYSYSGGASAKGLFAIETLTVGLT----NGKEKQLHNSIIGCTESVQGRIF 285
           ++    C Y Y Y   ++  G FA+ET TV LT       E ++ N + GC    +G +F
Sbjct: 251 ESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKVGNMMFGCGHWNRG-LF 310

Query: 286 GGADGVIGLGTSSYSFTYKAAENANGGGFAYCLVDHLSDRTATSYFILGNPISSTDSAAA 345
            GA G++GLG    SF+    ++  G  F+YCLVD  S+   +S  I G           
Sbjct: 311 SGASGLLGLGRGPLSFS-SQLQSLYGHSFSYCLVDRNSNTNVSSKLIFGED--------- 370

Query: 346 ASSVAPTGNMSFTKLFLGDPYS--SFYGVDLVGISADGVMLNIPPRVWDINS--GGGTIV 405
              +    N++FT    G   S  +FY + +  I   G  L+IP   W+I+S   GGTI+
Sbjct: 371 -KDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNISSDGDGGTII 430

Query: 406 DSGTSLTMLAAPAFDMVMEALVPKLKHFENI--EIEPFDFCFNNS--RYTHEMAPKLRFH 465
           DSGT+L+  A PA++++      K+K    I  +    D CFN S     +   P+L   
Sbjct: 431 DSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDPCFNVSGIEENNIHLPELGIA 490

Query: 466 FGDGTVFQPPPKSYIVSVGEYISCIGFVSMPFPATNIIGNILQQNHLWKFDFHAGTVGFA 513
           F DGTV+  P ++  + + E + C+  +  P    +IIGN  QQN    +D     +GF 
Sbjct: 491 FVDGTVWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKRSRLGFT 524

BLAST of Bhi10G000717 vs. TAIR 10
Match: AT3G59080.2 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 174.9 bits (442), Expect = 1.7e-43
Identity = 140/485 (28.87%), Postives = 214/485 (44.12%), Query Frame = 0

Query: 44  LHRHHPQVSEKLHGDIKLENMNDRIKDILEHDQKRYQTISSSLNRNELDEQLRKEAAELA 103
           L R     +EK   +  LE    +I+D L   Q  ++ +    N+N + ++ +K   E+ 
Sbjct: 84  LKRRETTTTEKATTNSVLEL---QIRD-LTRIQTLHKRVLEKNNQNTVSQKQKKNDKEV- 143

Query: 104 EKDLKLPPISST------PIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTW 163
              +   P++S+       +   + SG   GS EYF+ + VG+PP+ F LI DTGSDL W
Sbjct: 144 ---VTTTPVASSVEEQAGQLVATLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNW 203

Query: 164 MKCRYRRCIGNCSSNPNHKTRNERKVRFRNAFLANYSSSFKTIDCSSKMCTNDLADLFSI 223
           ++C                                        DC               
Sbjct: 204 IQC------------------------------------LPCYDCFQ------------- 263

Query: 224 GECQTPTSPCLYDYSYSGGASAKGLFAIETLTVGL-TNGKEKQLH---NSIIGCTESVQG 283
              Q     C Y Y Y   ++  G FA+ET TV L TNG   +L+   N + GC    +G
Sbjct: 264 ---QNDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCGHWNRG 323

Query: 284 RIFGGADGVIGLGTSSYSFTYKAAENANGGGFAYCLVDHLSDRTATSYFILGNPISSTDS 343
            +F GA G++GLG    SF+    ++  G  F+YCLVD  SD   +S  I G        
Sbjct: 324 -LFHGAAGLLGLGRGPLSFS-SQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGED------ 383

Query: 344 AAAASSVAPTGNMSFTKLFLG--DPYSSFYGVDLVGISADGVMLNIPPRVWDINS--GGG 403
                 +    N++FT    G  +   +FY V +  I   G +LNIP   W+I+S   GG
Sbjct: 384 ----KDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAGG 443

Query: 404 TIVDSGTSLTMLAAPAFDMVMEALVPKLKHFENI--EIEPFDFCFNNSRYTHEMAPKLRF 463
           TI+DSGT+L+  A PA++ +   +  K K    +  +    D CFN S   +   P+L  
Sbjct: 444 TIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNVQLPELGI 496

Query: 464 HFGDGTVFQPPPKSYIVSVGEYISCIGFVSMPFPATNIIGNILQQNHLWKFDFHAGTVGF 513
            F DG V+  P ++  + + E + C+  +  P  A +IIGN  QQN    +D     +G+
Sbjct: 504 AFADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLGY 496

BLAST of Bhi10G000717 vs. ExPASy Swiss-Prot
Match: Q9LTW4 (Aspartic proteinase NANA, chloroplast OS=Arabidopsis thaliana OX=3702 GN=NANA PE=1 SV=1)

HSP 1 Score: 339.0 bits (868), Expect = 9.6e-92
Identity = 194/501 (38.72%), Postives = 282/501 (56.29%), Query Frame = 0

Query: 15  CLFFLFFFLSVPIAFGDGSHDQENVKLDLLHRHHPQVSEKLHGDIKLENMNDRIKDILEH 74
           CL      ++V  +  D S     V+L L HR           D  L     RI+D++  
Sbjct: 30  CLITTLLLITVADSMKDTS-----VRLKLAHR-----------DTLLPKPLSRIEDVIGA 89

Query: 75  DQKRYQTISSSLNRNELDEQLRKEAAELAEKDLKLPPISSTPIGLKMISGSDYGSSEYFV 134
           DQKR+  IS   N                         S+  + + + SG DYG+++YF 
Sbjct: 90  DQKRHSLISRKRN-------------------------STVGVKMDLGSGIDYGTAQYFT 149

Query: 135 QLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSNPNHKTRNERKVRFRNAFLANYS 194
           +++VGTP + F ++ DTGS+LTW+ CRYR            + ++ R+V     F A+ S
Sbjct: 150 EIRVGTPAKKFRVVVDTGSELTWVNCRYRA-----------RGKDNRRV-----FRADES 209

Query: 195 SSFKTIDCSSKMCTNDLADLFSIGECQTPTSPCLYDYSYSGGASAKGLFAIETLTVGLTN 254
            SFKT+ C ++ C  DL +LFS+  C TP++PC YDY Y+ G++A+G+FA ET+TVGLTN
Sbjct: 210 KSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTN 269

Query: 255 GKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSSYSFTYKAAENANGGGFAYCLVDHLS 314
           G+  +L   +IGC+ S  G+ F GADGV+GL  S +SFT   A +  G  F+YCLVDHLS
Sbjct: 270 GRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFT-STATSLYGAKFSYCLVDHLS 329

Query: 315 DRTATSYFILGNPISSTDSAAAASSVAPTGNMSFTKLFLGDPYSSFYGVDLVGISADGVM 374
           ++  ++Y I G+      S +  ++   T  +  T++        FY ++++GIS    M
Sbjct: 330 NKNVSNYLIFGS------SRSTKTAFRRTTPLDLTRI------PPFYAINVIGISLGYDM 389

Query: 375 LNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALVPKLKHFENIEIE--PFDFCF 434
           L+IP +VWD  SGGGTI+DSGTSLT+LA  A+  V+  L   L   + ++ E  P ++CF
Sbjct: 390 LDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCF 449

Query: 435 N-NSRYTHEMAPKLRFHFGDGTVFQPPPKSYIVSVGEYISCIGFVSMPFPATNIIGNILQ 494
           +  S +     P+L FH   G  F+P  KSY+V     + C+GFVS   PATN+IGNI+Q
Sbjct: 450 SFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQ 460

Query: 495 QNHLWKFDFHAGTVGFAPSEC 513
           QN+LW+FD  A T+ FAPS C
Sbjct: 510 QNYLWEFDLMASTLSFAPSAC 460

BLAST of Bhi10G000717 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 158.3 bits (399), Expect = 2.3e-37
Identity = 131/400 (32.75%), Postives = 183/400 (45.75%), Query Frame = 0

Query: 121 MISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCR-YRRCIGNCSSNPNHKTRN 180
           ++SG   GS EYF +L VGTP +   ++ DTGSD+ W++C   RRC     S+P    R 
Sbjct: 131 VVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYS--QSDPIFDPRK 190

Query: 181 ERKVRFRNAFLANYSSSFKTIDCSSKMCTNDLADLFSIGECQTPTSPCLYDYSYSGGASA 240
                         S ++ TI CSS  C      L S G C T    CLY  SY  G+  
Sbjct: 191 --------------SKTYATIPCSSPHCRR----LDSAG-CNTRRKTCLYQVSYGDGSFT 250

Query: 241 KGLFAIETLTVGLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSSYSFTYKAAEN 300
            G F+ ETLT      +  ++    +GC    +G +F GA G++GLG    SF  +    
Sbjct: 251 VGDFSTETLTF-----RRNRVKGVALGCGHDNEG-LFVGAAGLLGLGKGKLSFPGQTGHR 310

Query: 301 ANGGGFAYCLVDHLSDRTATSYFILGNPISSTDSAAAASSVAPTGNMSFTKLFLGDPYSS 360
            N   F+YCLV    DR+A+S      P S     AA S +A      FT L       +
Sbjct: 311 FN-QKFSYCLV----DRSASS-----KPSSVVFGNAAVSRIA-----RFTPLLSNPKLDT 370

Query: 361 FYGVDLVGISADGVML-NIPPRVWDIN--SGGGTIVDSGTSLTMLAAPAFDMVMEALVPK 420
           FY V L+GIS  G  +  +   ++ ++    GG I+DSGTS+T L  PA+  + +A    
Sbjct: 371 FYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVG 430

Query: 421 LKHFENI-EIEPFDFCFNNSRYTHEMAPKLRFHFGDGTVFQPPPKSYIVSV---GEYISC 480
            K  +   +   FD CF+ S       P +  HF  G     P  +Y++ V   G++  C
Sbjct: 431 AKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHF-RGADVSLPATNYLIPVDTNGKF--C 484

Query: 481 IGFVSMPFPATNIIGNILQQNHLWKFDFHAGTVGFAPSEC 513
             F        +IIGNI QQ     +D  +  VGFAP  C
Sbjct: 491 FAFAG-TMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of Bhi10G000717 vs. ExPASy Swiss-Prot
Match: Q9LHE3 (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 158.3 bits (399), Expect = 2.3e-37
Identity = 127/450 (28.22%), Postives = 203/450 (45.11%), Query Frame = 0

Query: 78  RYQTISSSLNRNELDEQLRKE----AAELAEKDLKLPPISST-----PIGLKMISGSDYG 137
           R+ +++   + + L  ++R++    +A L     K+ P S +       G  ++SG D G
Sbjct: 68  RFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGSDIVSGMDQG 127

Query: 138 SSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSNPNHKTRNERKVRFRNA 197
           S EYFV++ VG+PP+   ++ D+GSD+ W++C+  +      S+P               
Sbjct: 128 SGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYK-QSDP--------------V 187

Query: 198 FLANYSSSFKTIDCSSKMCTNDLADLFSIGECQTPTSPCLYDYSYSGGASAKGLFAIETL 257
           F    S S+  + C S +C     D      C   +  C Y+  Y  G+  KG  A+ETL
Sbjct: 188 FDPAKSGSYTGVSCGSSVC-----DRIENSGCH--SGGCRYEVMYGDGSYTKGTLALETL 247

Query: 258 TVGLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSSYSFTYKAAENANGGGFAYC 317
           T   T      + N  +GC    +G +F GA G++G+G  S SF  + +    GG F YC
Sbjct: 248 TFAKT-----VVRNVAMGCGHRNRG-MFIGAAGLLGIGGGSMSFVGQLS-GQTGGAFGYC 307

Query: 318 LVDHLSDRTATSYFILGNPISSTDSAAAASSVAPTGNMSFTKLFLGDPYSSFYGVDLVGI 377
           LV   +D              ST S        P G  S+  L       SFY V L G+
Sbjct: 308 LVSRGTD--------------STGSLVFGREALPVG-ASWVPLVRNPRAPSFYYVGLKGL 367

Query: 378 SADGVMLNIPPRVWDI--NSGGGTIVDSGTSLTMLAAPAFDMVMEALVPKLKHFENIE-I 437
              GV + +P  V+D+     GG ++D+GT++T L   A+    +    +  +      +
Sbjct: 368 GVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGV 427

Query: 438 EPFDFCFNNSRYTHEMAPKLRFHFGDGTVFQPPPKSYIVSV---GEYISCIGFVSMPFPA 497
             FD C++ S +     P + F+F +G V   P +++++ V   G Y  C  F + P   
Sbjct: 428 SIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTY--CFAFAASP-TG 470

Query: 498 TNIIGNILQQNHLWKFDFHAGTVGFAPSEC 513
            +IIGNI Q+     FD   G VGF P+ C
Sbjct: 488 LSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of Bhi10G000717 vs. ExPASy Swiss-Prot
Match: Q9LS40 (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 154.8 bits (390), Expect = 2.6e-36
Identity = 113/399 (28.32%), Postives = 187/399 (46.87%), Query Frame = 0

Query: 121 MISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNC--SSNPNHKTR 180
           ++SG+  GS EYF ++ VGTP +   L+ DTGSD+ W++C       +C   S+P     
Sbjct: 151 VVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQC---EPCADCYQQSDP----- 210

Query: 181 NERKVRFRNAFLANYSSSFKTIDCSSKMCTNDLADLFSIGECQTPTSPCLYDYSYSGGAS 240
                     F    SS++K++ CS+  C+     L     C+  ++ CLY  SY  G+ 
Sbjct: 211 ---------VFNPTSSSTYKSLTCSAPQCS-----LLETSACR--SNKCLYQVSYGDGSF 270

Query: 241 AKGLFAIETLTVGLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSSYSFTYKAAE 300
             G  A +T+T G  +GK   ++N  +GC    +G +F GA G++GLG    S T     
Sbjct: 271 TVGELATDTVTFG-NSGK---INNVALGCGHDNEG-LFTGAAGLLGLGGGVLSIT----N 330

Query: 301 NANGGGFAYCLVDHLSDRTATSYFILGNPISSTDSAAAASSVAPTGNMSFTKLFLGDPYS 360
                 F+YCLVD  S ++++  F               +SV   G  +   L       
Sbjct: 331 QMKATSFSYCLVDRDSGKSSSLDF---------------NSVQLGGGDATAPLLRNKKID 390

Query: 361 SFYGVDLVGISADGVMLNIPPRVWDINS--GGGTIVDSGTSLTMLAAPAFDMVMEALVPK 420
           +FY V L G S  G  + +P  ++D+++   GG I+D GT++T L   A++ + +A +  
Sbjct: 391 TFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKL 450

Query: 421 LKHFE--NIEIEPFDFCFNNSRYTHEMAPKLRFHFGDGTVFQPPPKSYIVSVGEY-ISCI 480
             + +  +  I  FD C++ S  +    P + FHF  G     P K+Y++ V +    C 
Sbjct: 451 TVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCF 500

Query: 481 GFVSMPFPATNIIGNILQQNHLWKFDFHAGTVGFAPSEC 513
            F      + +IIGN+ QQ     +D     +G + ++C
Sbjct: 511 AFAPTS-SSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of Bhi10G000717 vs. ExPASy Swiss-Prot
Match: Q6XBF8 (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)

HSP 1 Score: 142.1 bits (357), Expect = 1.7e-32
Identity = 111/385 (28.83%), Postives = 170/385 (44.16%), Query Frame = 0

Query: 129 SSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSNPNHKTRNERKVRFRNA 188
           S EY + + +GTPP   M IADTGSDL W +C        C         ++   +    
Sbjct: 87  SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQC------APC---------DDCYTQVDPL 146

Query: 189 FLANYSSSFKTIDCSSKMCTNDLADLFSIGECQTPTSPCLYDYSYSGGASAKGLFAIETL 248
           F    SS++K + CSS  CT     L +   C T  + C Y  SY   +  KG  A++TL
Sbjct: 147 FDPKTSSTYKDVSCSSSQCT----ALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTL 206

Query: 249 TVGLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSSYSFTYKAAENANGGGFAYC 308
           T+G ++ +  QL N IIGC  +  G       G++GLG    S   +  ++ + G F+YC
Sbjct: 207 TLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSID-GKFSYC 266

Query: 309 LVDHLSDRTATSYFILGNPISSTDSAAAASSVAPTGNMSFTKLFLGDPYSSFYGVDLVGI 368
           LV   S +  TS    G     T++  + S V        T L       +FY + L  I
Sbjct: 267 LVPLTSKKDQTSKINFG-----TNAIVSGSGVVS------TPLIAKASQETFYYLTLKSI 326

Query: 369 SADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALVPKLKHFENIEIEP- 428
           S     +       + +S G  I+DSGT+LT+L    +  + +A+   +   +  + +  
Sbjct: 327 SVGSKQIQYSGSDSE-SSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSG 386

Query: 429 FDFCFNNSRYTHEMAPKLRFHFGDGTVFQPPPKSYIVSVGEYISCIGFVSMPFPATNIIG 488
              C+  S       P +  HF DG   +    +  V V E + C  F     P+ +I G
Sbjct: 387 LSLCY--SATGDLKVPVITMHF-DGADVKLDSSNAFVQVSEDLVCFAFRGS--PSFSIYG 434

Query: 489 NILQQNHLWKFDFHAGTVGFAPSEC 513
           N+ Q N L  +D  + TV F P++C
Sbjct: 447 NVAQMNFLVGYDTVSKTVSFKPTDC 434

BLAST of Bhi10G000717 vs. NCBI nr
Match: XP_038901983.1 (aspartic proteinase NANA, chloroplast [Benincasa hispida])

HSP 1 Score: 1048.1 bits (2709), Expect = 2.4e-302
Identity = 513/513 (100.00%), Postives = 513/513 (100.00%), Query Frame = 0

Query: 1   MLGYRKPMSPISHFCLFFLFFFLSVPIAFGDGSHDQENVKLDLLHRHHPQVSEKLHGDIK 60
           MLGYRKPMSPISHFCLFFLFFFLSVPIAFGDGSHDQENVKLDLLHRHHPQVSEKLHGDIK
Sbjct: 1   MLGYRKPMSPISHFCLFFLFFFLSVPIAFGDGSHDQENVKLDLLHRHHPQVSEKLHGDIK 60

Query: 61  LENMNDRIKDILEHDQKRYQTISSSLNRNELDEQLRKEAAELAEKDLKLPPISSTPIGLK 120
           LENMNDRIKDILEHDQKRYQTISSSLNRNELDEQLRKEAAELAEKDLKLPPISSTPIGLK
Sbjct: 61  LENMNDRIKDILEHDQKRYQTISSSLNRNELDEQLRKEAAELAEKDLKLPPISSTPIGLK 120

Query: 121 MISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSNPNHKTRNE 180
           MISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSNPNHKTRNE
Sbjct: 121 MISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSNPNHKTRNE 180

Query: 181 RKVRFRNAFLANYSSSFKTIDCSSKMCTNDLADLFSIGECQTPTSPCLYDYSYSGGASAK 240
           RKVRFRNAFLANYSSSFKTIDCSSKMCTNDLADLFSIGECQTPTSPCLYDYSYSGGASAK
Sbjct: 181 RKVRFRNAFLANYSSSFKTIDCSSKMCTNDLADLFSIGECQTPTSPCLYDYSYSGGASAK 240

Query: 241 GLFAIETLTVGLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSSYSFTYKAAENA 300
           GLFAIETLTVGLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSSYSFTYKAAENA
Sbjct: 241 GLFAIETLTVGLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSSYSFTYKAAENA 300

Query: 301 NGGGFAYCLVDHLSDRTATSYFILGNPISSTDSAAAASSVAPTGNMSFTKLFLGDPYSSF 360
           NGGGFAYCLVDHLSDRTATSYFILGNPISSTDSAAAASSVAPTGNMSFTKLFLGDPYSSF
Sbjct: 301 NGGGFAYCLVDHLSDRTATSYFILGNPISSTDSAAAASSVAPTGNMSFTKLFLGDPYSSF 360

Query: 361 YGVDLVGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALVPKLKHF 420
           YGVDLVGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALVPKLKHF
Sbjct: 361 YGVDLVGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALVPKLKHF 420

Query: 421 ENIEIEPFDFCFNNSRYTHEMAPKLRFHFGDGTVFQPPPKSYIVSVGEYISCIGFVSMPF 480
           ENIEIEPFDFCFNNSRYTHEMAPKLRFHFGDGTVFQPPPKSYIVSVGEYISCIGFVSMPF
Sbjct: 421 ENIEIEPFDFCFNNSRYTHEMAPKLRFHFGDGTVFQPPPKSYIVSVGEYISCIGFVSMPF 480

Query: 481 PATNIIGNILQQNHLWKFDFHAGTVGFAPSECV 514
           PATNIIGNILQQNHLWKFDFHAGTVGFAPSECV
Sbjct: 481 PATNIIGNILQQNHLWKFDFHAGTVGFAPSECV 513

BLAST of Bhi10G000717 vs. NCBI nr
Match: XP_004140022.2 (aspartic proteinase NANA, chloroplast [Cucumis sativus] >KGN46781.1 hypothetical protein Csa_021058 [Cucumis sativus])

HSP 1 Score: 797.0 bits (2057), Expect = 9.8e-227
Identity = 386/539 (71.61%), Postives = 454/539 (84.23%), Query Frame = 0

Query: 1   MLGYRKPMSPISHFCLFF----LFFFLSVPIAF-----------------GDGSHDQENV 60
           MLGYRKPMSPIS+FC FF    LFFFLS   +F                  D   +QE +
Sbjct: 1   MLGYRKPMSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEII 60

Query: 61  KLDLLHRHHPQVSEKLHGDIKLENMNDRIKDILEHDQKRYQTISSSLNRNEL-DEQLRKE 120
           K DLLHRHHPQV+EK+HGD+K++++++R+KDI EHD  R+++IS S+N+ ++ D +LR E
Sbjct: 61  KFDLLHRHHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRHRSISKSMNQKQVEDARLRAE 120

Query: 121 AAELAEKDLK----LPPISSTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSD 180
           A    E+++     LPP +STPIG++MISG+D+GSSEYFV+LKVGTP QTFMLIADTGSD
Sbjct: 121 AEAATEEEVAKSAILPPATSTPIGMRMISGADFGSSEYFVELKVGTPAQTFMLIADTGSD 180

Query: 181 LTWMKCRYRRCIGNCSSNPNHKTRNERKVRFRNAFLANYSSSFKTIDCSSKMCTNDLADL 240
           LTWMKCRYRRC GNCSSN NHK++NE+K RFR+AFLAN+SSSFKT+ CSS MCTNDLADL
Sbjct: 181 LTWMKCRYRRCFGNCSSNVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADL 240

Query: 241 FSIGECQTPTSPCLYDYSYSGGASAKGLFAIETLTVGLTNGKEKQLHNSIIGCTESVQGR 300
           F++ EC  PTSPC+YDYSY+GGASAKG+FA ETLTVGLTNGKEKQLHNSIIGCTESVQG 
Sbjct: 241 FAVRECHNPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGS 300

Query: 301 IFGGADGVIGLGTSSYSFTYKAAENANGGGFAYCLVDHLSDRTATSYFILGNPISSTDSA 360
           +FGGADGV+GLGTSSYS TYKAAENANGGGF+YCLVDHL+D+ A SYF+LG P  ST  +
Sbjct: 301 VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPST--S 360

Query: 361 AAASSVAPTGNMSFTKLFLGDPYSSFYGVDLVGISADGVMLNIPPRVWDINSGGGTIVDS 420
           A+ SS      M++TKL++GDPYSSFYGVDL+GISA+G+MLNIP RVWDINSGGGTI+DS
Sbjct: 361 ASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLNIPSRVWDINSGGGTIIDS 420

Query: 421 GTSLTMLAAPAFDMVMEALVPKLKHFENIEIEPFDFCFNNSRYTHEMAPKLRFHFGDGTV 480
           GTSLT+LAAPAFDMVMEAL P+LK F+ +EIEPFDFCFNNS+YTHEMAPKLRFHFGDGTV
Sbjct: 421 GTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTV 480

Query: 481 FQPPPKSYIVSVGEYISCIGFVSMPFPATNIIGNILQQNHLWKFDFHAGTVGFAPSECV 514
           F+PP KSYIVSVG++ISCIGFVSMPFPA NIIGNILQQNHLW+FDF    VGFAPSEC+
Sbjct: 481 FEPPTKSYIVSVGKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI 537

BLAST of Bhi10G000717 vs. NCBI nr
Match: XP_008456273.1 (PREDICTED: aspartic proteinase CDR1 [Cucumis melo])

HSP 1 Score: 764.2 bits (1972), Expect = 7.0e-217
Identity = 371/532 (69.74%), Postives = 439/532 (82.52%), Query Frame = 0

Query: 1   MLGYRKPMSPISHFCLFF-LFFFLSVPIAF-------------GDGSHDQENVKLDLLHR 60
           MLGYRKPMSPIS+FC FF L FFLS   +F              D   +Q+ ++ DLLHR
Sbjct: 1   MLGYRKPMSPISNFCFFFLLLFFLSFSSSFLFALGDEANNYNNNDDEDEQQTIRFDLLHR 60

Query: 61  HHPQVSEKLHGDIKLENMNDRIKDILEHDQKRYQTISSSLNRNEL-DEQLRKEAAELAE- 120
           HHPQVSEKL+GD+K++++++R+KDI EHD+ R+++IS S+N+ ++ D +LR EA    + 
Sbjct: 61  HHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRHRSISKSMNQKQIEDARLRAEAEAATQV 120

Query: 121 ---KDLKLPPISSTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCR 180
              K   LPP +STPIG+KMISG+D+GSSEYFVQLKVGTP QTFMLIADTGSDLTW+KCR
Sbjct: 121 EVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCR 180

Query: 181 YRRCIGNCSSNPNHKTRNERKVRFRNAFLANYSSSFKTIDCSSKMCTNDLADLFSIGECQ 240
           YRRC GNCS N NHK++NE+K RFR+A LAN SS+FKT+ CSS MCTN+LA+LF++ EC 
Sbjct: 181 YRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECD 240

Query: 241 TPTSPCLYDYSYSGGASAKGLFAIETLTVGLTNGKEKQLHNSIIGCTESVQGRIFGGADG 300
           TPTSPC+YDYSY+GGASAKG+FA ETLTVGLTNGKEKQL NSIIGCTE VQG +F GADG
Sbjct: 241 TPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADG 300

Query: 301 VIGLGTSSYSFTYKAAENANGGGFAYCLVDHLSDRTATSYFILGNPISSTDSAAAASSVA 360
           V+GLGTSSYS TYKAAENANGGGF+YCLVDHL+D+ A SYF+LG P  ST  +A+ SS  
Sbjct: 301 VMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPST--SASTSSAK 360

Query: 361 PTGNMSFTKLFLGDPYSSFYGVDLVGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTML 420
           P   MS+TKL++GDPYSSFYGVDL+GISADG MLNIPPRVWD   G GTI+DSGTSLT+L
Sbjct: 361 PPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVWDSYKGCGTIIDSGTSLTVL 420

Query: 421 AAPAFDMVMEALVPKLKHFENIEIEPFDFCFNNSRYTHEMAPKLRFHFGDGTVFQPPPKS 480
           A PAFD+VME L  +LK F+ IEIEPF+FCFNNS+YTH+MAPKLRFHFGDGTVF+PP KS
Sbjct: 421 ATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKS 480

Query: 481 YIVSVGEYISCIGFVSMPFPATNIIGNILQQNHLWKFDFHAGTVGFAPSECV 514
           YIVSVGE+ISCIG VSMPFP+ NIIGNILQQNHLW+FDF    VGFA SEC+
Sbjct: 481 YIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASECI 530

BLAST of Bhi10G000717 vs. NCBI nr
Match: KAA0033565.1 (aspartic proteinase CDR1 [Cucumis melo var. makuwa] >TYJ95622.1 aspartic proteinase CDR1 [Cucumis melo var. makuwa])

HSP 1 Score: 691.8 bits (1784), Expect = 4.4e-195
Identity = 331/460 (71.96%), Postives = 392/460 (85.22%), Query Frame = 0

Query: 59  IKLENMNDRIKDILEHDQKRYQTISSSLNRNEL-DEQLRKEAAELAE----KDLKLPPIS 118
           +K++++++R+KDI EHD+ R+++IS S+N+ ++ D +LR EA    +    K   LPP +
Sbjct: 1   MKIQDLHERMKDIHEHDRNRHRSISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPAT 60

Query: 119 STPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSNP 178
           STPIG+KMISG+D+GSSEYFVQLKVGTP QTFMLIADTGSDLTW+KCRYRRC GNCS N 
Sbjct: 61  STPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNV 120

Query: 179 NHKTRNERKVRFRNAFLANYSSSFKTIDCSSKMCTNDLADLFSIGECQTPTSPCLYDYSY 238
           NHK++NE+K RFR+A LAN SS+FKT+ CSS MCTN+LA+LF++ EC TPTSPC+YDYSY
Sbjct: 121 NHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSY 180

Query: 239 SGGASAKGLFAIETLTVGLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSSYSFT 298
           +GGASAKG+FA ETLTVGLTNGKEKQL NSIIGCTE VQG +F GADGV+GLGTSSYS T
Sbjct: 181 AGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLT 240

Query: 299 YKAAENANGGGFAYCLVDHLSDRTATSYFILGNPISSTDSAAAASSVAPTGNMSFTKLFL 358
           YKAAENANGGGF+YCLVDHL+D+ A SYF+LG P  ST  +A+ SS  P   MS+TKL++
Sbjct: 241 YKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPST--SASTSSAKPPAKMSYTKLYV 300

Query: 359 GDPYSSFYGVDLVGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEAL 418
           GDPYSSFYGVDL+GISADG MLNIPPRVWD   G GTI+DSGTSLT+LA PAFD+VME L
Sbjct: 301 GDPYSSFYGVDLIGISADGQMLNIPPRVWDSYKGCGTIIDSGTSLTVLATPAFDVVMEVL 360

Query: 419 VPKLKHFENIEIEPFDFCFNNSRYTHEMAPKLRFHFGDGTVFQPPPKSYIVSVGEYISCI 478
             +LK F+ IEIEPF+FCFNNS+YTH+MAPKLRFHFGDGTVF+PP KSYIVSVGE+ISCI
Sbjct: 361 TSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCI 420

Query: 479 GFVSMPFPATNIIGNILQQNHLWKFDFHAGTVGFAPSECV 514
           G VSMPFP+ NIIGNILQQNHLW+FDF    VGFA SEC+
Sbjct: 421 GIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASECI 458

BLAST of Bhi10G000717 vs. NCBI nr
Match: XP_022943788.1 (aspartic proteinase NANA, chloroplast-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 621.7 bits (1602), Expect = 5.6e-174
Identity = 309/521 (59.31%), Postives = 381/521 (73.13%), Query Frame = 0

Query: 1   MLGYRKPMSPISHFCLFFLF-FFLSVPIAF-GDGSHDQEN------VKLDLLHRHHPQVS 60
           MLGY  PMSPIS   +FF F FFLSV +AF GD    Q+       VKLD++HRHHP V 
Sbjct: 1   MLGYTNPMSPISPLLIFFFFNFFLSVHVAFDGDEQQQQQKPSEMPMVKLDVMHRHHPHVQ 60

Query: 61  EKLHGDIKLENMNDRIKDILEHDQKRYQTISSSLNRNELDEQLRKEAAELAEKDLKLPPI 120
           EKL+G+ +     DR +DI EHD  R ++IS+S+  ++ D Q              LP  
Sbjct: 61  EKLYGERRSLGSTDRFRDIHEHDHNRQRSISTSMKMSKTDRQ--------------LPMP 120

Query: 121 SSTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSN 180
           SS PI LK+ SG D+G++EYFVQ +VGTPPQ F+LI DTGSDLTW+KCRYRRC+GNC+++
Sbjct: 121 SSAPIQLKISSGFDFGTNEYFVQFRVGTPPQKFLLIVDTGSDLTWLKCRYRRCLGNCTAH 180

Query: 181 PNHKTRNERKVRFRNAFLANYSSSFKTIDCSSKMCTNDLADLFSIGECQTPTSPCLYDYS 240
            +HK+R E KV+F + FLAN+SSSFK I C S  C  DL  LF+I +CQ P++PC+YDYS
Sbjct: 181 AHHKSRVEHKVKFDHPFLANHSSSFKLITCGSDFCLGDLQLLFAIPDCQVPSNPCVYDYS 240

Query: 241 YSGGASAKGLFAIETLTVGLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSSYSF 300
           Y GG +A GLFA ET+TVGLTNGKEKQLH+++IGCTE        G DG++GLGT ++SF
Sbjct: 241 YIGGGAATGLFANETVTVGLTNGKEKQLHDTLIGCTELFNAMQLKGVDGILGLGTGAHSF 300

Query: 301 TYKAAENANGGGFAYCLVDHLSDRTATSYFILGNPISSTDSAAAASSVAPTGNMSFTKLF 360
            ++AA + NGGGF+YCL+DHLS  +ATSYFILG P       A   SVAP GNM+F  L 
Sbjct: 301 AHRAALDKNGGGFSYCLIDHLSHHSATSYFILGYP------PAEPLSVAPVGNMTFINLH 360

Query: 361 LGDPYSSFYGVDLVGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEA 420
           LG P++S+YGV L+GIS DGV LNIPPRVWDI  GGGTI+DSGTSL+ML APAFD+ MEA
Sbjct: 361 LGGPFNSYYGVGLIGISIDGVTLNIPPRVWDIQKGGGTILDSGTSLSMLTAPAFDVFMEA 420

Query: 421 LVPKLKHFENIEIEPFDFCFNNSRYTHEMAPKLRFHFGDGTVFQPPPKSYIVSVGEYISC 480
           +V KLK F+ I  +PF +CFN + Y+HEMAPKLRFHF  G VF+PPPKSYIV V + I C
Sbjct: 421 MVQKLKKFQQILADPFAYCFNKTHYSHEMAPKLRFHFEKGVVFEPPPKSYIVKVDD-ILC 480

Query: 481 IGFVSMPFPATNIIGNILQQNHLWKFDFHAGTVGFAPSECV 514
           +GF S+PFP TNIIGNILQQN LW+FDF    VGFAPS+C+
Sbjct: 481 LGFTSIPFPDTNIIGNILQQNFLWQFDFFNKKVGFAPSQCI 500

BLAST of Bhi10G000717 vs. ExPASy TrEMBL
Match: A0A0A0KG92 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G134390 PE=3 SV=1)

HSP 1 Score: 797.0 bits (2057), Expect = 4.8e-227
Identity = 386/539 (71.61%), Postives = 454/539 (84.23%), Query Frame = 0

Query: 1   MLGYRKPMSPISHFCLFF----LFFFLSVPIAF-----------------GDGSHDQENV 60
           MLGYRKPMSPIS+FC FF    LFFFLS   +F                  D   +QE +
Sbjct: 1   MLGYRKPMSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEII 60

Query: 61  KLDLLHRHHPQVSEKLHGDIKLENMNDRIKDILEHDQKRYQTISSSLNRNEL-DEQLRKE 120
           K DLLHRHHPQV+EK+HGD+K++++++R+KDI EHD  R+++IS S+N+ ++ D +LR E
Sbjct: 61  KFDLLHRHHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRHRSISKSMNQKQVEDARLRAE 120

Query: 121 AAELAEKDLK----LPPISSTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSD 180
           A    E+++     LPP +STPIG++MISG+D+GSSEYFV+LKVGTP QTFMLIADTGSD
Sbjct: 121 AEAATEEEVAKSAILPPATSTPIGMRMISGADFGSSEYFVELKVGTPAQTFMLIADTGSD 180

Query: 181 LTWMKCRYRRCIGNCSSNPNHKTRNERKVRFRNAFLANYSSSFKTIDCSSKMCTNDLADL 240
           LTWMKCRYRRC GNCSSN NHK++NE+K RFR+AFLAN+SSSFKT+ CSS MCTNDLADL
Sbjct: 181 LTWMKCRYRRCFGNCSSNVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADL 240

Query: 241 FSIGECQTPTSPCLYDYSYSGGASAKGLFAIETLTVGLTNGKEKQLHNSIIGCTESVQGR 300
           F++ EC  PTSPC+YDYSY+GGASAKG+FA ETLTVGLTNGKEKQLHNSIIGCTESVQG 
Sbjct: 241 FAVRECHNPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGS 300

Query: 301 IFGGADGVIGLGTSSYSFTYKAAENANGGGFAYCLVDHLSDRTATSYFILGNPISSTDSA 360
           +FGGADGV+GLGTSSYS TYKAAENANGGGF+YCLVDHL+D+ A SYF+LG P  ST  +
Sbjct: 301 VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPST--S 360

Query: 361 AAASSVAPTGNMSFTKLFLGDPYSSFYGVDLVGISADGVMLNIPPRVWDINSGGGTIVDS 420
           A+ SS      M++TKL++GDPYSSFYGVDL+GISA+G+MLNIP RVWDINSGGGTI+DS
Sbjct: 361 ASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLNIPSRVWDINSGGGTIIDS 420

Query: 421 GTSLTMLAAPAFDMVMEALVPKLKHFENIEIEPFDFCFNNSRYTHEMAPKLRFHFGDGTV 480
           GTSLT+LAAPAFDMVMEAL P+LK F+ +EIEPFDFCFNNS+YTHEMAPKLRFHFGDGTV
Sbjct: 421 GTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTV 480

Query: 481 FQPPPKSYIVSVGEYISCIGFVSMPFPATNIIGNILQQNHLWKFDFHAGTVGFAPSECV 514
           F+PP KSYIVSVG++ISCIGFVSMPFPA NIIGNILQQNHLW+FDF    VGFAPSEC+
Sbjct: 481 FEPPTKSYIVSVGKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI 537

BLAST of Bhi10G000717 vs. ExPASy TrEMBL
Match: A0A1S3C2F3 (aspartic proteinase CDR1 OS=Cucumis melo OX=3656 GN=LOC103496268 PE=3 SV=1)

HSP 1 Score: 764.2 bits (1972), Expect = 3.4e-217
Identity = 371/532 (69.74%), Postives = 439/532 (82.52%), Query Frame = 0

Query: 1   MLGYRKPMSPISHFCLFF-LFFFLSVPIAF-------------GDGSHDQENVKLDLLHR 60
           MLGYRKPMSPIS+FC FF L FFLS   +F              D   +Q+ ++ DLLHR
Sbjct: 1   MLGYRKPMSPISNFCFFFLLLFFLSFSSSFLFALGDEANNYNNNDDEDEQQTIRFDLLHR 60

Query: 61  HHPQVSEKLHGDIKLENMNDRIKDILEHDQKRYQTISSSLNRNEL-DEQLRKEAAELAE- 120
           HHPQVSEKL+GD+K++++++R+KDI EHD+ R+++IS S+N+ ++ D +LR EA    + 
Sbjct: 61  HHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRHRSISKSMNQKQIEDARLRAEAEAATQV 120

Query: 121 ---KDLKLPPISSTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCR 180
              K   LPP +STPIG+KMISG+D+GSSEYFVQLKVGTP QTFMLIADTGSDLTW+KCR
Sbjct: 121 EVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCR 180

Query: 181 YRRCIGNCSSNPNHKTRNERKVRFRNAFLANYSSSFKTIDCSSKMCTNDLADLFSIGECQ 240
           YRRC GNCS N NHK++NE+K RFR+A LAN SS+FKT+ CSS MCTN+LA+LF++ EC 
Sbjct: 181 YRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECD 240

Query: 241 TPTSPCLYDYSYSGGASAKGLFAIETLTVGLTNGKEKQLHNSIIGCTESVQGRIFGGADG 300
           TPTSPC+YDYSY+GGASAKG+FA ETLTVGLTNGKEKQL NSIIGCTE VQG +F GADG
Sbjct: 241 TPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADG 300

Query: 301 VIGLGTSSYSFTYKAAENANGGGFAYCLVDHLSDRTATSYFILGNPISSTDSAAAASSVA 360
           V+GLGTSSYS TYKAAENANGGGF+YCLVDHL+D+ A SYF+LG P  ST  +A+ SS  
Sbjct: 301 VMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPST--SASTSSAK 360

Query: 361 PTGNMSFTKLFLGDPYSSFYGVDLVGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTML 420
           P   MS+TKL++GDPYSSFYGVDL+GISADG MLNIPPRVWD   G GTI+DSGTSLT+L
Sbjct: 361 PPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVWDSYKGCGTIIDSGTSLTVL 420

Query: 421 AAPAFDMVMEALVPKLKHFENIEIEPFDFCFNNSRYTHEMAPKLRFHFGDGTVFQPPPKS 480
           A PAFD+VME L  +LK F+ IEIEPF+FCFNNS+YTH+MAPKLRFHFGDGTVF+PP KS
Sbjct: 421 ATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKS 480

Query: 481 YIVSVGEYISCIGFVSMPFPATNIIGNILQQNHLWKFDFHAGTVGFAPSECV 514
           YIVSVGE+ISCIG VSMPFP+ NIIGNILQQNHLW+FDF    VGFA SEC+
Sbjct: 481 YIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASECI 530

BLAST of Bhi10G000717 vs. ExPASy TrEMBL
Match: A0A5D3B701 (Aspartic proteinase CDR1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold104G00400 PE=3 SV=1)

HSP 1 Score: 691.8 bits (1784), Expect = 2.2e-195
Identity = 331/460 (71.96%), Postives = 392/460 (85.22%), Query Frame = 0

Query: 59  IKLENMNDRIKDILEHDQKRYQTISSSLNRNEL-DEQLRKEAAELAE----KDLKLPPIS 118
           +K++++++R+KDI EHD+ R+++IS S+N+ ++ D +LR EA    +    K   LPP +
Sbjct: 1   MKIQDLHERMKDIHEHDRNRHRSISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPAT 60

Query: 119 STPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSNP 178
           STPIG+KMISG+D+GSSEYFVQLKVGTP QTFMLIADTGSDLTW+KCRYRRC GNCS N 
Sbjct: 61  STPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNV 120

Query: 179 NHKTRNERKVRFRNAFLANYSSSFKTIDCSSKMCTNDLADLFSIGECQTPTSPCLYDYSY 238
           NHK++NE+K RFR+A LAN SS+FKT+ CSS MCTN+LA+LF++ EC TPTSPC+YDYSY
Sbjct: 121 NHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSY 180

Query: 239 SGGASAKGLFAIETLTVGLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSSYSFT 298
           +GGASAKG+FA ETLTVGLTNGKEKQL NSIIGCTE VQG +F GADGV+GLGTSSYS T
Sbjct: 181 AGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLT 240

Query: 299 YKAAENANGGGFAYCLVDHLSDRTATSYFILGNPISSTDSAAAASSVAPTGNMSFTKLFL 358
           YKAAENANGGGF+YCLVDHL+D+ A SYF+LG P  ST  +A+ SS  P   MS+TKL++
Sbjct: 241 YKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPST--SASTSSAKPPAKMSYTKLYV 300

Query: 359 GDPYSSFYGVDLVGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEAL 418
           GDPYSSFYGVDL+GISADG MLNIPPRVWD   G GTI+DSGTSLT+LA PAFD+VME L
Sbjct: 301 GDPYSSFYGVDLIGISADGQMLNIPPRVWDSYKGCGTIIDSGTSLTVLATPAFDVVMEVL 360

Query: 419 VPKLKHFENIEIEPFDFCFNNSRYTHEMAPKLRFHFGDGTVFQPPPKSYIVSVGEYISCI 478
             +LK F+ IEIEPF+FCFNNS+YTH+MAPKLRFHFGDGTVF+PP KSYIVSVGE+ISCI
Sbjct: 361 TSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCI 420

Query: 479 GFVSMPFPATNIIGNILQQNHLWKFDFHAGTVGFAPSECV 514
           G VSMPFP+ NIIGNILQQNHLW+FDF    VGFA SEC+
Sbjct: 421 GIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASECI 458

BLAST of Bhi10G000717 vs. ExPASy TrEMBL
Match: A0A6J1FXD5 (aspartic proteinase NANA, chloroplast-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111448433 PE=3 SV=1)

HSP 1 Score: 621.7 bits (1602), Expect = 2.7e-174
Identity = 309/521 (59.31%), Postives = 381/521 (73.13%), Query Frame = 0

Query: 1   MLGYRKPMSPISHFCLFFLF-FFLSVPIAF-GDGSHDQEN------VKLDLLHRHHPQVS 60
           MLGY  PMSPIS   +FF F FFLSV +AF GD    Q+       VKLD++HRHHP V 
Sbjct: 1   MLGYTNPMSPISPLLIFFFFNFFLSVHVAFDGDEQQQQQKPSEMPMVKLDVMHRHHPHVQ 60

Query: 61  EKLHGDIKLENMNDRIKDILEHDQKRYQTISSSLNRNELDEQLRKEAAELAEKDLKLPPI 120
           EKL+G+ +     DR +DI EHD  R ++IS+S+  ++ D Q              LP  
Sbjct: 61  EKLYGERRSLGSTDRFRDIHEHDHNRQRSISTSMKMSKTDRQ--------------LPMP 120

Query: 121 SSTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSN 180
           SS PI LK+ SG D+G++EYFVQ +VGTPPQ F+LI DTGSDLTW+KCRYRRC+GNC+++
Sbjct: 121 SSAPIQLKISSGFDFGTNEYFVQFRVGTPPQKFLLIVDTGSDLTWLKCRYRRCLGNCTAH 180

Query: 181 PNHKTRNERKVRFRNAFLANYSSSFKTIDCSSKMCTNDLADLFSIGECQTPTSPCLYDYS 240
            +HK+R E KV+F + FLAN+SSSFK I C S  C  DL  LF+I +CQ P++PC+YDYS
Sbjct: 181 AHHKSRVEHKVKFDHPFLANHSSSFKLITCGSDFCLGDLQLLFAIPDCQVPSNPCVYDYS 240

Query: 241 YSGGASAKGLFAIETLTVGLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSSYSF 300
           Y GG +A GLFA ET+TVGLTNGKEKQLH+++IGCTE        G DG++GLGT ++SF
Sbjct: 241 YIGGGAATGLFANETVTVGLTNGKEKQLHDTLIGCTELFNAMQLKGVDGILGLGTGAHSF 300

Query: 301 TYKAAENANGGGFAYCLVDHLSDRTATSYFILGNPISSTDSAAAASSVAPTGNMSFTKLF 360
            ++AA + NGGGF+YCL+DHLS  +ATSYFILG P       A   SVAP GNM+F  L 
Sbjct: 301 AHRAALDKNGGGFSYCLIDHLSHHSATSYFILGYP------PAEPLSVAPVGNMTFINLH 360

Query: 361 LGDPYSSFYGVDLVGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEA 420
           LG P++S+YGV L+GIS DGV LNIPPRVWDI  GGGTI+DSGTSL+ML APAFD+ MEA
Sbjct: 361 LGGPFNSYYGVGLIGISIDGVTLNIPPRVWDIQKGGGTILDSGTSLSMLTAPAFDVFMEA 420

Query: 421 LVPKLKHFENIEIEPFDFCFNNSRYTHEMAPKLRFHFGDGTVFQPPPKSYIVSVGEYISC 480
           +V KLK F+ I  +PF +CFN + Y+HEMAPKLRFHF  G VF+PPPKSYIV V + I C
Sbjct: 421 MVQKLKKFQQILADPFAYCFNKTHYSHEMAPKLRFHFEKGVVFEPPPKSYIVKVDD-ILC 480

Query: 481 IGFVSMPFPATNIIGNILQQNHLWKFDFHAGTVGFAPSECV 514
           +GF S+PFP TNIIGNILQQN LW+FDF    VGFAPS+C+
Sbjct: 481 LGFTSIPFPDTNIIGNILQQNFLWQFDFFNKKVGFAPSQCI 500

BLAST of Bhi10G000717 vs. ExPASy TrEMBL
Match: A0A6J1FVB3 (aspartic proteinase NANA, chloroplast-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111448433 PE=3 SV=1)

HSP 1 Score: 610.5 bits (1573), Expect = 6.3e-171
Identity = 304/514 (59.14%), Postives = 376/514 (73.15%), Query Frame = 0

Query: 8   MSPISHFCLFFLF-FFLSVPIAF-GDGSHDQEN------VKLDLLHRHHPQVSEKLHGDI 67
           MSPIS   +FF F FFLSV +AF GD    Q+       VKLD++HRHHP V EKL+G+ 
Sbjct: 1   MSPISPLLIFFFFNFFLSVHVAFDGDEQQQQQKPSEMPMVKLDVMHRHHPHVQEKLYGER 60

Query: 68  KLENMNDRIKDILEHDQKRYQTISSSLNRNELDEQLRKEAAELAEKDLKLPPISSTPIGL 127
           +     DR +DI EHD  R ++IS+S+  ++ D Q              LP  SS PI L
Sbjct: 61  RSLGSTDRFRDIHEHDHNRQRSISTSMKMSKTDRQ--------------LPMPSSAPIQL 120

Query: 128 KMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSNPNHKTRN 187
           K+ SG D+G++EYFVQ +VGTPPQ F+LI DTGSDLTW+KCRYRRC+GNC+++ +HK+R 
Sbjct: 121 KISSGFDFGTNEYFVQFRVGTPPQKFLLIVDTGSDLTWLKCRYRRCLGNCTAHAHHKSRV 180

Query: 188 ERKVRFRNAFLANYSSSFKTIDCSSKMCTNDLADLFSIGECQTPTSPCLYDYSYSGGASA 247
           E KV+F + FLAN+SSSFK I C S  C  DL  LF+I +CQ P++PC+YDYSY GG +A
Sbjct: 181 EHKVKFDHPFLANHSSSFKLITCGSDFCLGDLQLLFAIPDCQVPSNPCVYDYSYIGGGAA 240

Query: 248 KGLFAIETLTVGLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSSYSFTYKAAEN 307
            GLFA ET+TVGLTNGKEKQLH+++IGCTE        G DG++GLGT ++SF ++AA +
Sbjct: 241 TGLFANETVTVGLTNGKEKQLHDTLIGCTELFNAMQLKGVDGILGLGTGAHSFAHRAALD 300

Query: 308 ANGGGFAYCLVDHLSDRTATSYFILGNPISSTDSAAAASSVAPTGNMSFTKLFLGDPYSS 367
            NGGGF+YCL+DHLS  +ATSYFILG P       A   SVAP GNM+F  L LG P++S
Sbjct: 301 KNGGGFSYCLIDHLSHHSATSYFILGYP------PAEPLSVAPVGNMTFINLHLGGPFNS 360

Query: 368 FYGVDLVGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALVPKLKH 427
           +YGV L+GIS DGV LNIPPRVWDI  GGGTI+DSGTSL+ML APAFD+ MEA+V KLK 
Sbjct: 361 YYGVGLIGISIDGVTLNIPPRVWDIQKGGGTILDSGTSLSMLTAPAFDVFMEAMVQKLKK 420

Query: 428 FENIEIEPFDFCFNNSRYTHEMAPKLRFHFGDGTVFQPPPKSYIVSVGEYISCIGFVSMP 487
           F+ I  +PF +CFN + Y+HEMAPKLRFHF  G VF+PPPKSYIV V + I C+GF S+P
Sbjct: 421 FQQILADPFAYCFNKTHYSHEMAPKLRFHFEKGVVFEPPPKSYIVKVDD-ILCLGFTSIP 480

Query: 488 FPATNIIGNILQQNHLWKFDFHAGTVGFAPSECV 514
           FP TNIIGNILQQN LW+FDF    VGFAPS+C+
Sbjct: 481 FPDTNIIGNILQQNFLWQFDFFNKKVGFAPSQCI 493

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT3G12700.16.8e-9338.72Eukaryotic aspartyl protease family protein [more]
AT3G25700.18.4e-5935.94Eukaryotic aspartyl protease family protein [more]
AT3G59080.11.3e-4830.66Eukaryotic aspartyl protease family protein [more]
AT2G42980.11.8e-4829.75Eukaryotic aspartyl protease family protein [more]
AT3G59080.21.7e-4328.87Eukaryotic aspartyl protease family protein [more]
Match NameE-valueIdentityDescription
Q9LTW49.6e-9238.72Aspartic proteinase NANA, chloroplast OS=Arabidopsis thaliana OX=3702 GN=NANA PE... [more]
Q9LNJ32.3e-3732.75Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Q9LHE32.3e-3728.22Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Q9LS402.6e-3628.32Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Q6XBF81.7e-3228.83Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
XP_038901983.12.4e-302100.00aspartic proteinase NANA, chloroplast [Benincasa hispida][more]
XP_004140022.29.8e-22771.61aspartic proteinase NANA, chloroplast [Cucumis sativus] >KGN46781.1 hypothetical... [more]
XP_008456273.17.0e-21769.74PREDICTED: aspartic proteinase CDR1 [Cucumis melo][more]
KAA0033565.14.4e-19571.96aspartic proteinase CDR1 [Cucumis melo var. makuwa] >TYJ95622.1 aspartic protein... [more]
XP_022943788.15.6e-17459.31aspartic proteinase NANA, chloroplast-like isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A0A0KG924.8e-22771.61Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G13439... [more]
A0A1S3C2F33.4e-21769.74aspartic proteinase CDR1 OS=Cucumis melo OX=3656 GN=LOC103496268 PE=3 SV=1[more]
A0A5D3B7012.2e-19571.96Aspartic proteinase CDR1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffol... [more]
A0A6J1FXD52.7e-17459.31aspartic proteinase NANA, chloroplast-like isoform X1 OS=Cucurbita moschata OX=3... [more]
A0A6J1FVB36.3e-17159.14aspartic proteinase NANA, chloroplast-like isoform X2 OS=Cucurbita moschata OX=3... [more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 88..108
NoneNo IPR availablePANTHERPTHR47967:SF69ASPARTIC PROTEINASE NANA, CHLOROPLASTcoord: 28..512
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 28..512
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 138..158
score: 49.14
coord: 484..499
score: 23.01
coord: 390..401
score: 48.62
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 132..326
e-value: 6.1E-44
score: 150.4
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 361..508
e-value: 2.1E-29
score: 102.4
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 117..326
e-value: 1.2E-36
score: 128.6
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 338..513
e-value: 5.4E-38
score: 132.5
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 124..512
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 147..158
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 132..508
score: 36.5858
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 131..512
e-value: 9.52291E-77
score: 240.243

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi10M000717Bhi10M000717mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005576 extracellular region
molecular_function GO:0004190 aspartic-type endopeptidase activity