HG10010824 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10010824
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionaspartic proteinase CDR1-like
LocationChr06: 26264250 .. 26265630 (-)
RNA-Seq ExpressionHG10010824
SyntenyHG10010824
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCACCCATTCTCTCCCTTATTTTCTTAATCTCCTCCACCGCCGTCTTAGTCGCCGTCACTGGCCGTGACTATGACTTCACTGTCGAACTCATCCACCGTGACTCCCCCAAGTCCCCTATGTACAACCCATCGGAGACTCACTACCACCGCCTTGCCAACACCCTCCGTCGTTCCATCAGCCGTAACACAGCGGCGCTGACAGACACAGCAGAGGCTCCTATTTACAACTACGGAGGCCAATACCTCATGAAAATATCCCTTGGAACGCCACCGTTTTCAATTATAGCCGTTGCTGACACAGGAAGTGACATCATTTGGACCCAGTGTGAACCATGCCCAAATTGCTATGAACAAAACGCGCCAATGTTTGACCCGAGTAAATCCACGACTTACAAAAATGTGCCGTGTTCCTCGCCAATTTGCTCGTTTGCAGGCCAAGAACGTTCCTGTTCCACTCAGTCTGAGTGTTTGTACTCGATTACGTACGGTGATAGGTCGCATAGCCAAGGAGATCTTGCCGTTGATACCGTTACAATGGGGTCTACCTCTGGCCGTCCCGTGACGTTTCCTCATATTGCGATTGGTTGTGGTCATGACAATGCTGGCACTTTCGATGCTAATGTCTCTGGCATTGTTGGGCTCGGGCATGGTCCAGCTTCACTTGTCAATCAAATGGGACCAGCTACTGGCGGAAAATTCTCTTATTGTTTAGCTTCGATTGGAAACAACAGTATTGAGTCTAGCAAACTTAACTTTGGCTCTAATGCTATCGTCTCGGGCTATAAAACTGTATCAACTCCGATTTATACTAGTGGTAAATAAATTAATTGATATGTATATATATATTTAGGTTGTATATAATTAATCTTTTGTTTAATTTTTTTAATGAATAGATACATACAAAACTTTCTACTCGCTCAAGCTAGAAGCTATAAGCGTTGGGGAGAACACATTTGATTTTCCAGTGGTCTCTTCAAGATTAGGCGGAGAGGCAAACATCATCATTGACTCTGGCACAACGCTTACAATCCTCCCAGTGGATTTATACAATAACTTCGCAACAGCAATTTCTGACTCGATCAACCTTCAGCATGCTAATGACCCGAATCAATTCTTGGATTATTGCTATGCAACCACCACTAATGACTATAAAGTGCCGCCCGTTACAATGCACTTTGAAGGAGCTGATGTGCTCCTCCAGCGAGAAAATGTGTTCATTAGGGTGTCGGACGACGTCGTTTGCTTGGCTTTTACTGCTAGCCAGGACGAAGACAATATTTCCATATATGGAAATATTTCACAGAACAACTTCTTGGTTGGTTACGATATTAAGAACATGTCTGTTTCTTTTAAGCCGGCGGATTGTGTTGCCATGTGA

mRNA sequence

ATGGCACCCATTCTCTCCCTTATTTTCTTAATCTCCTCCACCGCCGTCTTAGTCGCCGTCACTGGCCGTGACTATGACTTCACTGTCGAACTCATCCACCGTGACTCCCCCAAGTCCCCTATGTACAACCCATCGGAGACTCACTACCACCGCCTTGCCAACACCCTCCGTCGTTCCATCAGCCGTAACACAGCGGCGCTGACAGACACAGCAGAGGCTCCTATTTACAACTACGGAGGCCAATACCTCATGAAAATATCCCTTGGAACGCCACCGTTTTCAATTATAGCCGTTGCTGACACAGGAAGTGACATCATTTGGACCCAGTGTGAACCATGCCCAAATTGCTATGAACAAAACGCGCCAATGTTTGACCCGAGTAAATCCACGACTTACAAAAATGTGCCGTGTTCCTCGCCAATTTGCTCGTTTGCAGGCCAAGAACGTTCCTGTTCCACTCAGTCTGAGTGTTTGTACTCGATTACGTACGGTGATAGGTCGCATAGCCAAGGAGATCTTGCCGTTGATACCGTTACAATGGGGTCTACCTCTGGCCGTCCCGTGACGTTTCCTCATATTGCGATTGGTTGTGGTCATGACAATGCTGGCACTTTCGATGCTAATGTCTCTGGCATTGTTGGGCTCGGGCATGGTCCAGCTTCACTTGTCAATCAAATGGGACCAGCTACTGGCGGAAAATTCTCTTATTGTTTAGCTTCGATTGGAAACAACAGTATTGAGTCTAGCAAACTTAACTTTGGCTCTAATGCTATCGTCTCGGGCTATAAAACTGTATCAACTCCGATTTATACTAGTGATACATACAAAACTTTCTACTCGCTCAAGCTAGAAGCTATAAGCGTTGGGGAGAACACATTTGATTTTCCAGTGGTCTCTTCAAGATTAGGCGGAGAGGCAAACATCATCATTGACTCTGGCACAACGCTTACAATCCTCCCAGTGGATTTATACAATAACTTCGCAACAGCAATTTCTGACTCGATCAACCTTCAGCATGCTAATGACCCGAATCAATTCTTGGATTATTGCTATGCAACCACCACTAATGACTATAAAGTGCCGCCCGTTACAATGCACTTTGAAGGAGCTGATGTGCTCCTCCAGCGAGAAAATGTGTTCATTAGGGTGTCGGACGACGTCGTTTGCTTGGCTTTTACTGCTAGCCAGGACGAAGACAATATTTCCATATATGGAAATATTTCACAGAACAACTTCTTGGTTGGTTACGATATTAAGAACATGTCTGTTTCTTTTAAGCCGGCGGATTGTGTTGCCATGTGA

Coding sequence (CDS)

ATGGCACCCATTCTCTCCCTTATTTTCTTAATCTCCTCCACCGCCGTCTTAGTCGCCGTCACTGGCCGTGACTATGACTTCACTGTCGAACTCATCCACCGTGACTCCCCCAAGTCCCCTATGTACAACCCATCGGAGACTCACTACCACCGCCTTGCCAACACCCTCCGTCGTTCCATCAGCCGTAACACAGCGGCGCTGACAGACACAGCAGAGGCTCCTATTTACAACTACGGAGGCCAATACCTCATGAAAATATCCCTTGGAACGCCACCGTTTTCAATTATAGCCGTTGCTGACACAGGAAGTGACATCATTTGGACCCAGTGTGAACCATGCCCAAATTGCTATGAACAAAACGCGCCAATGTTTGACCCGAGTAAATCCACGACTTACAAAAATGTGCCGTGTTCCTCGCCAATTTGCTCGTTTGCAGGCCAAGAACGTTCCTGTTCCACTCAGTCTGAGTGTTTGTACTCGATTACGTACGGTGATAGGTCGCATAGCCAAGGAGATCTTGCCGTTGATACCGTTACAATGGGGTCTACCTCTGGCCGTCCCGTGACGTTTCCTCATATTGCGATTGGTTGTGGTCATGACAATGCTGGCACTTTCGATGCTAATGTCTCTGGCATTGTTGGGCTCGGGCATGGTCCAGCTTCACTTGTCAATCAAATGGGACCAGCTACTGGCGGAAAATTCTCTTATTGTTTAGCTTCGATTGGAAACAACAGTATTGAGTCTAGCAAACTTAACTTTGGCTCTAATGCTATCGTCTCGGGCTATAAAACTGTATCAACTCCGATTTATACTAGTGATACATACAAAACTTTCTACTCGCTCAAGCTAGAAGCTATAAGCGTTGGGGAGAACACATTTGATTTTCCAGTGGTCTCTTCAAGATTAGGCGGAGAGGCAAACATCATCATTGACTCTGGCACAACGCTTACAATCCTCCCAGTGGATTTATACAATAACTTCGCAACAGCAATTTCTGACTCGATCAACCTTCAGCATGCTAATGACCCGAATCAATTCTTGGATTATTGCTATGCAACCACCACTAATGACTATAAAGTGCCGCCCGTTACAATGCACTTTGAAGGAGCTGATGTGCTCCTCCAGCGAGAAAATGTGTTCATTAGGGTGTCGGACGACGTCGTTTGCTTGGCTTTTACTGCTAGCCAGGACGAAGACAATATTTCCATATATGGAAATATTTCACAGAACAACTTCTTGGTTGGTTACGATATTAAGAACATGTCTGTTTCTTTTAAGCCGGCGGATTGTGTTGCCATGTGA

Protein sequence

MAPILSLIFLISSTAVLVAVTGRDYDFTVELIHRDSPKSPMYNPSETHYHRLANTLRRSISRNTAALTDTAEAPIYNYGGQYLMKISLGTPPFSIIAVADTGSDIIWTQCEPCPNCYEQNAPMFDPSKSTTYKNVPCSSPICSFAGQERSCSTQSECLYSITYGDRSHSQGDLAVDTVTMGSTSGRPVTFPHIAIGCGHDNAGTFDANVSGIVGLGHGPASLVNQMGPATGGKFSYCLASIGNNSIESSKLNFGSNAIVSGYKTVSTPIYTSDTYKTFYSLKLEAISVGENTFDFPVVSSRLGGEANIIIDSGTTLTILPVDLYNNFATAISDSINLQHANDPNQFLDYCYATTTNDYKVPPVTMHFEGADVLLQRENVFIRVSDDVVCLAFTASQDEDNISIYGNISQNNFLVGYDIKNMSVSFKPADCVAM
Homology
BLAST of HG10010824 vs. NCBI nr
Match: XP_038876324.1 (aspartic proteinase CDR1-like [Benincasa hispida])

HSP 1 Score: 740.0 bits (1909), Expect = 1.2e-209
Identity = 369/435 (84.83%), Postives = 395/435 (90.80%), Query Frame = 0

Query: 1   MAPILSL-IFLISSTAVLVAVTGRDYDFTVELIHRDSPKSPMYNPSETHYHRLANTLRRS 60
           MA + SL IFLISS AVL A TGR++ FTVELIHRDSPKSPMYNPSETHYHRLAN LRRS
Sbjct: 1   MASVFSLIIFLISSAAVLAAATGREFGFTVELIHRDSPKSPMYNPSETHYHRLANALRRS 60

Query: 61  ISRNTAALTDTAEAPIYNYGGQYLMKISLGTPPFSIIAVADTGSDIIWTQCEPCPNCYEQ 120
           ISRNTAA+TDTA APIYNY GQYLMKISLGTPPFSIIAVADTGSD+IWTQCEPCPNCYEQ
Sbjct: 61  ISRNTAAVTDTAVAPIYNYRGQYLMKISLGTPPFSIIAVADTGSDVIWTQCEPCPNCYEQ 120

Query: 121 NAPMFDPSKSTTYKNVPCSSPICSFAGQERSCSTQSECLYSITYGDRSHSQGDLAVDTVT 180
           +APMF+PSKSTTYKNVPCSSPICS+AG++ SCS  SECLYSI+YGDRSHSQGD AVDTVT
Sbjct: 121 SAPMFNPSKSTTYKNVPCSSPICSYAGEDSSCSAHSECLYSISYGDRSHSQGDFAVDTVT 180

Query: 181 MGSTSGRPVTFPHIAIGCGHDNAGTFDANVSGIVGLGHGPASLVNQMGPATGGKFSYCLA 240
           MGSTSG PVTFPH+AIGCGHDNAGTFDA+VSGIVGLG G ASLV+QMGPATGGKFSYCLA
Sbjct: 181 MGSTSGSPVTFPHMAIGCGHDNAGTFDASVSGIVGLGQGSASLVSQMGPATGGKFSYCLA 240

Query: 241 SIGNNSIESSKLNFGSNAIVSGYKTVSTPIYTSDTYKTFYSLKLEAISVGENTFDFPVVS 300
            IGN+S ESSKLNFGSNA VSG + VSTPIYTS  YKTFYSLKLEA+SVGEN FDFP+VS
Sbjct: 241 PIGNSSAESSKLNFGSNADVSGSEAVSTPIYTSVKYKTFYSLKLEAVSVGENKFDFPIVS 300

Query: 301 SRLGGEANIIIDSGTTLTILPVDLYNNFATAISDSINLQHANDPNQFLDYCYATTTNDYK 360
           SRLGGE NIIIDSGTTLT LPVDLYNNFAT ISDSINLQ  +DPNQFLDYC+ATTT+DY+
Sbjct: 301 SRLGGEGNIIIDSGTTLTFLPVDLYNNFATTISDSINLQRTDDPNQFLDYCFATTTDDYE 360

Query: 361 VPPVTMHFEGADVLLQRENVFIRVSDDVVCLAFTASQ-DEDNISIYGNISQNNFLVGYDI 420
            P VTMHFEGADV L RENVFIR+SDD+VCLAF ASQ D++ I IYGNISQNNFLVGYDI
Sbjct: 361 APSVTMHFEGADVPLNRENVFIRISDDIVCLAFKASQDDQEMIFIYGNISQNNFLVGYDI 420

Query: 421 KNMSVSFKPADCVAM 434
           KNM VSFK ADCVAM
Sbjct: 421 KNMVVSFKQADCVAM 435

BLAST of HG10010824 vs. NCBI nr
Match: XP_016902483.1 (PREDICTED: aspartic proteinase CDR1-like [Cucumis melo])

HSP 1 Score: 669.5 bits (1726), Expect = 2.0e-188
Identity = 336/435 (77.24%), Postives = 377/435 (86.67%), Query Frame = 0

Query: 1   MAPILSLIFLISSTAVLVAVTGRDYDFTVELIHRDSPKSPMYNPSETHYHRLANTLRRSI 60
           MAPI S++FLI STAV  A T RDY FTVELIHRDS KSPMYN SETHY R+AN LRRSI
Sbjct: 1   MAPIFSILFLI-STAVFSATTARDYGFTVELIHRDSTKSPMYNSSETHYDRIANALRRSI 60

Query: 61  SRNTAALT-DTAEAPIYNYGGQYLMKISLGTPPFSIIAVADTGSDIIWTQCEPCPNCYEQ 120
           +RN A LT DTAEAPIYN GG+YL++IS+GTPPFSI+AVADTGSD+IWTQCEPC NCY+Q
Sbjct: 61  NRNKAVLTSDTAEAPIYNNGGEYLVEISIGTPPFSILAVADTGSDVIWTQCEPCSNCYQQ 120

Query: 121 NAPMFDPSKSTTYKNVPCSSPICSFAGQERSCSTQSECLYSITYGDRSHSQGDLAVDTVT 180
           +APMFDPSKS TYKNVPCSSP+CS++G   SCS  SECLYSI YGD+SHS G+LAVDTVT
Sbjct: 121 SAPMFDPSKSATYKNVPCSSPVCSYSGDGSSCSDDSECLYSIAYGDKSHSDGNLAVDTVT 180

Query: 181 MGSTSGRPVTFPHIAIGCGHDNAGTFDANVSGIVGLGHGPASLVNQMGPATGGKFSYCLA 240
           M STSGRPV FP   IGCGHDNAGTF+ANVSGIVGLG GPASLV Q+GPATGGKFSYCL 
Sbjct: 181 MQSTSGRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLM 240

Query: 241 SIGNNSIE-SSKLNFGSNAIVSGYKTVSTPIYTSDTYKTFYSLKLEAISVGENTFDFPVV 300
            IGN S+E S+KLNFGSNA VSG   VSTPIYTSD YKTFYSLKLEA+SVG+N FDFP V
Sbjct: 241 PIGNASMEDSTKLNFGSNADVSGSGAVSTPIYTSDQYKTFYSLKLEAVSVGDNKFDFPEV 300

Query: 301 SSRLGGEANIIIDSGTTLTILPVDLYNNFATAISDSINLQHANDPNQFLDYCYATTTNDY 360
           SS+LGGEANIIIDSGTTLT LP DL +NF +AI+DSINL  A DP+QFLDYC++TTT+DY
Sbjct: 301 SSKLGGEANIIIDSGTTLTYLPSDLMSNFGSAIADSINLPRAEDPSQFLDYCFSTTTDDY 360

Query: 361 KVPPVTMHFEGADVLLQRENVFIRVSDDVVCLAFTASQDEDNISIYGNISQNNFLVGYDI 420
           +VP VTMHFEGADV LQREN+FIR+S+D +CLAF A  D DNI IYGNI+Q+NFLVGYDI
Sbjct: 361 EVPSVTMHFEGADVPLQRENMFIRLSEDTICLAFGAFSD-DNIFIYGNIAQSNFLVGYDI 420

Query: 421 KNMSVSFKPADCVAM 434
           KN++VSF+PADC AM
Sbjct: 421 KNLAVSFQPADCNAM 433

BLAST of HG10010824 vs. NCBI nr
Match: KAA0039977.1 (aspartic proteinase CDR1-like [Cucumis melo var. makuwa] >TYK24525.1 aspartic proteinase CDR1-like [Cucumis melo var. makuwa])

HSP 1 Score: 668.3 bits (1723), Expect = 4.4e-188
Identity = 335/435 (77.01%), Postives = 378/435 (86.90%), Query Frame = 0

Query: 1   MAPILSLIFLISSTAVLVAVTGRDYDFTVELIHRDSPKSPMYNPSETHYHRLANTLRRSI 60
           MAPI S++FLI STAV  A T RDY FTVELIHRDS KSPMYN SETHY R+AN LRRSI
Sbjct: 1   MAPIFSILFLI-STAVFSATTARDYGFTVELIHRDSTKSPMYNSSETHYDRIANALRRSI 60

Query: 61  SRNTAALT-DTAEAPIYNYGGQYLMKISLGTPPFSIIAVADTGSDIIWTQCEPCPNCYEQ 120
           +RN A LT DTAEAPIYN GG+YL++IS+GTPPFSI+AVADTGSD+IWTQCEPC NCY+Q
Sbjct: 61  NRNKAVLTSDTAEAPIYNNGGEYLVEISIGTPPFSILAVADTGSDVIWTQCEPCSNCYQQ 120

Query: 121 NAPMFDPSKSTTYKNVPCSSPICSFAGQERSCSTQSECLYSITYGDRSHSQGDLAVDTVT 180
           +APMFDPSKS TYKNVPCSSP+CS++G   SCS  SECLYSI YGD+SHS G+LAVDTVT
Sbjct: 121 SAPMFDPSKSATYKNVPCSSPVCSYSGDGSSCSDDSECLYSIAYGDKSHSDGNLAVDTVT 180

Query: 181 MGSTSGRPVTFPHIAIGCGHDNAGTFDANVSGIVGLGHGPASLVNQMGPATGGKFSYCLA 240
           M STSGRPV+FP   IGCGHDNAGTF+ANVSGIVGLG GPASLV Q+GPATGGKFSYCL 
Sbjct: 181 MQSTSGRPVSFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLM 240

Query: 241 SIGNNSIE-SSKLNFGSNAIVSGYKTVSTPIYTSDTYKTFYSLKLEAISVGENTFDFPVV 300
            IGN S+E S+KLNFGSNA VSG   VSTPIYTSD YKTFYSLKLEA+SVG+N FDFP V
Sbjct: 241 PIGNASMEDSTKLNFGSNADVSGSGAVSTPIYTSDQYKTFYSLKLEAVSVGDNKFDFPEV 300

Query: 301 SSRLGGEANIIIDSGTTLTILPVDLYNNFATAISDSINLQHANDPNQFLDYCYATTTNDY 360
           SS+LGGEANIIIDSGTTLT LP DL +NF +AI+DSINL  A DP+QFLDYC++TTT+DY
Sbjct: 301 SSKLGGEANIIIDSGTTLTYLPSDLMSNFGSAIADSINLPRAEDPSQFLDYCFSTTTDDY 360

Query: 361 KVPPVTMHFEGADVLLQRENVFIRVSDDVVCLAFTASQDEDNISIYGNISQNNFLVGYDI 420
           +VP VTMHFEGADV LQREN+FIR+S+D +CLAF A  D DNI IYGNI+Q+NFLVGYDI
Sbjct: 361 EVPSVTMHFEGADVPLQRENMFIRLSEDTICLAFGAFSD-DNIFIYGNIAQSNFLVGYDI 420

Query: 421 KNMSVSFKPADCVAM 434
           KN++VSF+PA+C AM
Sbjct: 421 KNLAVSFQPAECNAM 433

BLAST of HG10010824 vs. NCBI nr
Match: XP_038907012.1 (aspartic proteinase CDR1-like [Benincasa hispida])

HSP 1 Score: 654.1 bits (1686), Expect = 8.7e-184
Identity = 333/436 (76.38%), Postives = 371/436 (85.09%), Query Frame = 0

Query: 1   MAPILSLIFLISSTAVLVAVTGRDYDFTVELIHRDSPKSPMYNPSETHYHRLANTLRRSI 60
           MAPI SLI  ISS AV+   TGRDY F+VELIHRDSPKSPMYNPS+THYHRLA++LRRSI
Sbjct: 1   MAPIFSLILFISS-AVVSTATGRDYGFSVELIHRDSPKSPMYNPSKTHYHRLADSLRRSI 60

Query: 61  SRNTAALTDTAEAPIYNYGGQYLMKISLGTPPFSIIAVADTGSDIIWTQCEPCPNCYEQN 120
           SRNTAALTD AEAPIYN  G+YLMKIS+GTPPFSIIAVADTGSDI WTQC PC NC+ QN
Sbjct: 61  SRNTAALTDIAEAPIYNNQGEYLMKISVGTPPFSIIAVADTGSDITWTQCHPCKNCFHQN 120

Query: 121 APMFDPSKSTTYKNVPCSSPICSFAGQERSCSTQ---SECLYSITYGDRSHSQGDLAVDT 180
           APMF+PSKSTTYK V CSSPIC FAG   SCSTQ   S+C+YSITYGDRSHS+GD AVDT
Sbjct: 121 APMFNPSKSTTYKKVACSSPICLFAGDSGSCSTQSLESDCVYSITYGDRSHSKGDFAVDT 180

Query: 181 VTMGSTSGRPVTFPHIAIGCGHDNAGTFDANVSGIVGLGHGPASLVNQMGPATGGKFSYC 240
           VTMGSTSGR V FP +AIGCGHDNAGTF+ N+SGIVGLG GPASLV QMGPATGGKFSYC
Sbjct: 181 VTMGSTSGRGVAFPRMAIGCGHDNAGTFNVNISGIVGLGLGPASLVTQMGPATGGKFSYC 240

Query: 241 LASIGNNSIESSKLNFGSNAIVSGYKTVSTPIYTSDTYKTFYSLKLEAISVGENTFDFPV 300
           L  +G+++I  SKLNFGSNA VSG K VSTPIYTSD +K FYS+KLEA+SVG+N FDFPV
Sbjct: 241 LTPVGSSNIVPSKLNFGSNADVSGSKAVSTPIYTSDKFK-FYSVKLEAVSVGKNKFDFPV 300

Query: 301 VSSRLGGEANIIIDSGTTLTILPVDLYNNFATAISDSINLQHANDPNQFLDYCYATTTND 360
            S  LGG+ANII+DSG+TLT LPV+LYNNF+ AIS S+NL+  NDP Q LD+C+ TTT+D
Sbjct: 301 YSV-LGGKANIILDSGSTLTFLPVNLYNNFSKAISKSLNLKRTNDPEQLLDHCFVTTTDD 360

Query: 361 YKVPPVTMHFEGADVLLQRENVFIRVSDDVVCLAFTASQDEDNISIYGNISQNNFLVGYD 420
           YK+P +TMHFEGADV L RENVFIRVSD VVCLAF   QD+D   I+GNI+Q NFLVGYD
Sbjct: 361 YKLPLITMHFEGADVPLLRENVFIRVSDSVVCLAFGTHQDDD-FMIFGNIAQTNFLVGYD 420

Query: 421 IKNMSVSFKPADCVAM 434
           IKNM VSFKP DCVAM
Sbjct: 421 IKNMLVSFKPMDCVAM 432

BLAST of HG10010824 vs. NCBI nr
Match: XP_031744104.1 (aspartic proteinase CDR1 [Cucumis sativus] >KGN46270.1 hypothetical protein Csa_005679 [Cucumis sativus])

HSP 1 Score: 645.6 bits (1664), Expect = 3.1e-181
Identity = 319/435 (73.33%), Postives = 371/435 (85.29%), Query Frame = 0

Query: 1   MAPILSLIFLISSTAVLVAVTGRDYDFTVELIHRDSPKSPMYNPSETHYHRLANTLRRSI 60
           MAP+ SL+FLIS+ +V  AVT RDY FTVELIHRDSPKSPMYN SETH+ R+ N LRRS 
Sbjct: 1   MAPVFSLLFLISTASVFSAVTARDYGFTVELIHRDSPKSPMYNSSETHFDRIVNALRRSS 60

Query: 61  SRNTAAL-TDTAEAPIYNYGGQYLMKISLGTPPFSIIAVADTGSDIIWTQCEPCPNCYEQ 120
            RNT  L +DTAEAPI+N GG+YL++IS+GTPPFSI+AVADTGSD+IWTQC+PC NCY+Q
Sbjct: 61  HRNTVVLESDTAEAPIFNNGGEYLVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQ 120

Query: 121 NAPMFDPSKSTTYKNVPCSSPICSFAGQERSCSTQSECLYSITYGDRSHSQGDLAVDTVT 180
           NAPMFDPSKSTTYKNV CSSP+CS++G   SCS  SECLYSI YGD SHSQG+LAVDTVT
Sbjct: 121 NAPMFDPSKSTTYKNVACSSPVCSYSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVT 180

Query: 181 MGSTSGRPVTFPHIAIGCGHDNAGTFDANVSGIVGLGHGPASLVNQMGPATGGKFSYCLA 240
           M STSGRPV FP   IGCGHDNAGTF+ANVSGIVGLG GPASLV Q+GPATGGKFSYCL 
Sbjct: 181 MQSTSGRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLI 240

Query: 241 SIGNNSI-ESSKLNFGSNAIVSGYKTVSTPIYTSDTYKTFYSLKLEAISVGENTFDFPVV 300
            IG  S  +S+KLNFGSNA VSG  TVSTPIY+S  YKTFYSLKLEA+SVG+  F+FP  
Sbjct: 241 PIGTGSTNDSTKLNFGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEG 300

Query: 301 SSRLGGEANIIIDSGTTLTILPVDLYNNFATAISDSINLQHANDPNQFLDYCYATTTNDY 360
           +S+LGGE+NIIIDSGTTLT LP  L N+F +AIS S++L HA DP++FLDYC+ATTT+DY
Sbjct: 301 ASKLGGESNIIIDSGTTLTYLPSALLNSFGSAISQSMSLPHAQDPSEFLDYCFATTTDDY 360

Query: 361 KVPPVTMHFEGADVLLQRENVFIRVSDDVVCLAFTASQDEDNISIYGNISQNNFLVGYDI 420
           ++PPVTMHFEGADV LQREN+F+R+SDD +CLAF  S  +DNI IYGNI+Q+NFLVGYDI
Sbjct: 361 EMPPVTMHFEGADVPLQRENLFVRLSDDTICLAF-GSFPDDNIFIYGNIAQSNFLVGYDI 420

Query: 421 KNMSVSFKPADCVAM 434
           KN++VSF+PA C A+
Sbjct: 421 KNLAVSFQPAHCGAV 434

BLAST of HG10010824 vs. ExPASy Swiss-Prot
Match: Q6XBF8 (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)

HSP 1 Score: 414.5 bits (1064), Expect = 1.5e-114
Identity = 214/435 (49.20%), Postives = 287/435 (65.98%), Query Frame = 0

Query: 4   ILSLIFLISSTAVLVAVTGRDYDFTVELIHRDSPKSPMYNPSETHYHRLANTLRRSISR- 63
           +L  + L+SS  +  A       FT +LIHRDSPKSP YNP ET   RL N + RS++R 
Sbjct: 8   VLLSLCLLSSLFLSNANAKPKLGFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRV 67

Query: 64  ---NTAALTDTAEAPIYNYGGQYLMKISLGTPPFSIIAVADTGSDIIWTQCEPCPNCYEQ 123
                   T   +  + +  G+YLM +S+GTPPF I+A+ADTGSD++WTQC PC +CY Q
Sbjct: 68  FHFTEKDNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQ 127

Query: 124 NAPMFDPSKSTTYKNVPCSSPICSFAGQERSCST-QSECLYSITYGDRSHSQGDLAVDTV 183
             P+FDP  S+TYK+V CSS  C+    + SCST  + C YS++YGD S+++G++AVDT+
Sbjct: 128 VDPLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTL 187

Query: 184 TMGSTSGRPVTFPHIAIGCGHDNAGTFDANVSGIVGLGHGPASLVNQMGPATGGKFSYCL 243
           T+GS+  RP+   +I IGCGH+NAGTF+   SGIVGLG GP SL+ Q+G +  GKFSYCL
Sbjct: 188 TLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCL 247

Query: 244 ASIGNNSIESSKLNFGSNAIVSGYKTVSTPIYTSDTYKTFYSLKLEAISVGENTFDFPVV 303
             + +   ++SK+NFG+NAIVSG   VSTP+    + +TFY L L++ISVG     +   
Sbjct: 248 VPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGS 307

Query: 304 SSRLGGEANIIIDSGTTLTILPVDLYNNFATAISDSINLQHANDPNQFLDYCYATTTNDY 363
            S    E NIIIDSGTTLT+LP + Y+    A++ SI+ +   DP   L  CY + T D 
Sbjct: 308 DSE-SSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCY-SATGDL 367

Query: 364 KVPPVTMHFEGADVLLQRENVFIRVSDDVVCLAFTASQDEDNISIYGNISQNNFLVGYDI 423
           KVP +TMHF+GADV L   N F++VS+D+VC AF  S    + SIYGN++Q NFLVGYD 
Sbjct: 368 KVPVITMHFDGADVKLDSSNAFVQVSEDLVCFAFRGS---PSFSIYGNVAQMNFLVGYDT 427

Query: 424 KNMSVSFKPADCVAM 434
            + +VSFKP DC  M
Sbjct: 428 VSKTVSFKPTDCAKM 437

BLAST of HG10010824 vs. ExPASy Swiss-Prot
Match: Q3EBM5 (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 334.3 bits (856), Expect = 2.0e-90
Identity = 187/450 (41.56%), Postives = 274/450 (60.89%), Query Frame = 0

Query: 4   ILSLIFLISSTAVLVAVTGRDYDFTVELIHRDSPKSPMYNPSETHYHRLANTLRRSISR- 63
           IL   FL  S  V ++ +G   +F+VELIHRDSP SP+YNP  T   RL     RS+SR 
Sbjct: 5   ILLCFFLFFS--VTLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRS 64

Query: 64  ---NTAALTDTAEAPIYNYGGQYLMKISLGTPPFSIIAVADTGSDIIWTQCEPCPNCYEQ 123
              N        ++ +    G++ M I++GTPP  + A+ADTGSD+ W QC+PC  CY++
Sbjct: 65  RRFNHQLSQTDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKE 124

Query: 124 NAPMFDPSKSTTYKNVPCSSPIC-SFAGQERSCSTQSE-CLYSITYGDRSHSQGDLAVDT 183
           N P+FD  KS+TYK+ PC S  C + +  ER C   +  C Y  +YGD+S S+GD+A +T
Sbjct: 125 NGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATET 184

Query: 184 VTMGSTSGRPVTFPHIAIGCGHDNAGTFDANVSGIVGLGHGPASLVNQMGPATGGKFSYC 243
           V++ S SG PV+FP    GCG++N GTFD   SGI+GLG G  SL++Q+G +   KFSYC
Sbjct: 185 VSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYC 244

Query: 244 LASIGNNSIESSKLNFGSNAIVSGYK----TVSTPIYTSDTYKTFYSLKLEAISVGE--- 303
           L+     +  +S +N G+N+I S        VSTP+   +   T+Y L LEAISVG+   
Sbjct: 245 LSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPL-TYYYLTLEAISVGKKKI 304

Query: 304 -------NTFDFPVVSSRLGGEANIIIDSGTTLTILPVDLYNNFATAISDSI-NLQHAND 363
                  N  D  ++S   G   NIIIDSGTTLT+L    ++ F++A+ +S+   +  +D
Sbjct: 305 PYTGSSYNPNDDGILSETSG---NIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSD 364

Query: 364 PNQFLDYCYATTTNDYKVPPVTMHFEGADVLLQRENVFIRVSDDVVCLAFTASQDEDNIS 423
           P   L +C+ + + +  +P +T+HF GADV L   N F+++S+D+VCL+   + +   ++
Sbjct: 365 PQGLLSHCFKSGSAEIGLPEITVHFTGADVRLSPINAFVKLSEDMVCLSMVPTTE---VA 424

Query: 424 IYGNISQNNFLVGYDIKNMSVSFKPADCVA 433
           IYGN +Q +FLVGYD++  +VSF+  DC A
Sbjct: 425 IYGNFAQMDFLVGYDLETRTVSFQHMDCSA 445

BLAST of HG10010824 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 256.5 bits (654), Expect = 5.3e-67
Identity = 143/365 (39.18%), Postives = 214/365 (58.63%), Query Frame = 0

Query: 72  EAPIYNYGGQYLMKISLGTPPFSIIAVADTGSDIIWTQCEPCPNCYEQNAPMFDPSKSTT 131
           E P+Y   G+YLM +++GTP  S  A+ DTGSD+IWTQCEPC  C+ Q  P+F+P  S++
Sbjct: 86  ETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSS 145

Query: 132 YKNVPCSSPICSFAGQERSCSTQSECLYSITYGDRSHSQGDLAVDTVTMGSTSGRPVTFP 191
           +  +PC S  C     E +C+  +EC Y+  YGD S +QG +A +T T  ++S      P
Sbjct: 146 FSTLPCESQYCQDLPSE-TCN-NNECQYTYGYGDGSTTQGYMATETFTFETSS-----VP 205

Query: 192 HIAIGCGHDNAGTFDANVSGIVGLGHGPASLVNQMGPATGGKFSYCLASIGNNSIESSKL 251
           +IA GCG DN G    N +G++G+G GP SL +Q+G    G+FSYC+ S G++S  +  L
Sbjct: 206 NIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGV---GQFSYCMTSYGSSSPSTLAL 265

Query: 252 NFGSNAIVSGYKTVSTPIYTSDTYKTFYSLKLEAISVGENTFDFPVVSSRL--GGEANII 311
              ++ +  G  + ST +  S    T+Y + L+ I+VG +    P  + +L   G   +I
Sbjct: 266 GSAASGVPEG--SPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMI 325

Query: 312 IDSGTTLTILPVDLYNNFATAISDSINLQHANDPNQFLDYCYATTT--NDYKVPPVTMHF 371
           IDSGTTLT LP D YN  A A +D INL   ++ +  L  C+   +  +  +VP ++M F
Sbjct: 326 IDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQF 385

Query: 372 EGADVLLQRENVFIRVSDDVVCLAFTASQDEDNISIYGNISQNNFLVGYDIKNMSVSFKP 431
           +G  + L  +N+ I  ++ V+CLA   S  +  ISI+GNI Q    V YD++N++VSF P
Sbjct: 386 DGGVLNLGEQNILISPAEGVICLAM-GSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVP 437

Query: 432 ADCVA 433
             C A
Sbjct: 446 TQCGA 437

BLAST of HG10010824 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 243.8 bits (621), Expect = 3.5e-63
Identity = 152/416 (36.54%), Postives = 221/416 (53.12%), Query Frame = 0

Query: 27  FTVELIHRDSPKSPMYNPSETHYHRLANTLRRS---ISRNTAALTDTA--EAPIYNYGGQ 86
           F + L H DS K      + T +  L   + R    + R  A L   +  E  +Y   G+
Sbjct: 41  FQIMLEHVDSGK------NLTKFQLLERAIERGSRRLQRLEAMLNGPSGVETSVYAGDGE 100

Query: 87  YLMKISLGTPPFSIIAVADTGSDIIWTQCEPCPNCYEQNAPMFDPSKSTTYKNVPCSSPI 146
           YLM +S+GTP     A+ DTGSD+IWTQC+PC  C+ Q+ P+F+P  S+++  +PCSS +
Sbjct: 101 YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQL 160

Query: 147 CSFAGQERSCSTQSECLYSITYGDRSHSQGDLAVDTVTMGSTSGRPVTFPHIAIGCGHDN 206
           C  A    +CS  + C Y+  YGD S +QG +  +T+T GS     V+ P+I  GCG +N
Sbjct: 161 CQ-ALSSPTCS-NNFCQYTYGYGDGSETQGSMGTETLTFGS-----VSIPNITFGCGENN 220

Query: 207 AGTFDANVSGIVGLGHGPASLVNQMGPATGGKFSYCLASIGNNSIESSKLNFGSNAIVSG 266
            G    N +G+VG+G GP SL +Q+      KFSYC+  IG+++  +  L   +N++ +G
Sbjct: 221 QGFGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSTPSNLLLGSLANSVTAG 280

Query: 267 YKTVSTPIYTSDTYKTFYSLKLEAISVGENTFDFPVVSSRL---GGEANIIIDSGTTLTI 326
             + +T +  S    TFY + L  +SVG         +  L    G   IIIDSGTTLT 
Sbjct: 281 --SPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTY 340

Query: 327 LPVDLYNNFATAISDSINLQHANDPNQFLDYCYATTT--NDYKVPPVTMHFEGADVLLQR 386
              + Y +        INL   N  +   D C+ T +  ++ ++P   MHF+G D+ L  
Sbjct: 341 FVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGDLELPS 400

Query: 387 ENVFIRVSDDVVCLAFTASQDEDNISIYGNISQNNFLVGYDIKNMSVSFKPADCVA 433
           EN FI  S+ ++CLA  +S     +SI+GNI Q N LV YD  N  VSF  A C A
Sbjct: 401 ENYFISPSNGLICLAMGSS--SQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQCGA 436

BLAST of HG10010824 vs. ExPASy Swiss-Prot
Match: Q9LS40 (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 213.4 bits (542), Expect = 5.1e-54
Identity = 143/410 (34.88%), Postives = 209/410 (50.98%), Query Frame = 0

Query: 27  FTVELIHRDSPKSPMYNPSETHYHRLANTLRRSISRNTAALTDTAEAPIYNYGGQYLMKI 86
           F VE + R   K P+YN  +T Y              T  LT    +      G+Y  +I
Sbjct: 122 FAVEGVDRSDLK-PVYN-EDTRY-------------QTEDLTTPVVSGASQGSGEYFSRI 181

Query: 87  SLGTPPFSIIAVADTGSDIIWTQCEPCPNCYEQNAPMFDPSKSTTYKNVPCSSPICSFAG 146
            +GTP   +  V DTGSD+ W QCEPC +CY+Q+ P+F+P+ S+TYK++ CS+P CS   
Sbjct: 182 GVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLL- 241

Query: 147 QERSCSTQSECLYSITYGDRSHSQGDLAVDTVTMGSTSGRPVTFPHIAIGCGHDNAGTFD 206
            E S    ++CLY ++YGD S + G+LA DTVT G+ SG+     ++A+GCGHDN G F 
Sbjct: 242 -ETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGN-SGK---INNVALGCGHDNEGLF- 301

Query: 207 ANVSGIVGLGHGPASLVNQMGPATGGKFSYCLASIGNNSIESSKLNFGSNAIVSGYKTVS 266
              +G++GLG G  S+ NQM   +   FSYCL  +  +S +SS L+F  N++  G    +
Sbjct: 302 TGAAGLLGLGGGVLSITNQMKATS---FSYCL--VDRDSGKSSSLDF--NSVQLGGGDAT 361

Query: 267 TPIYTSDTYKTFYSLKLEAISVGENTFDFP--VVSSRLGGEANIIIDSGTTLTILPVDLY 326
            P+  +    TFY + L   SVG      P  +      G   +I+D GT +T L    Y
Sbjct: 362 APLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAY 421

Query: 327 NNFATA-ISDSINLQHANDPNQFLDYCY-ATTTNDYKVPPVTMHFEGADVL-LQRENVFI 386
           N+   A +  ++NL+  +      D CY  ++ +  KVP V  HF G   L L  +N  I
Sbjct: 422 NSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLI 481

Query: 387 RVSDD-VVCLAFTASQDEDNISIYGNISQNNFLVGYDIKNMSVSFKPADC 431
            V D    C AF  +    ++SI GN+ Q    + YD+    +      C
Sbjct: 482 PVDDSGTFCFAFAPT--SSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of HG10010824 vs. ExPASy TrEMBL
Match: A0A1S4E2N4 (aspartic proteinase CDR1-like OS=Cucumis melo OX=3656 GN=LOC107991689 PE=3 SV=1)

HSP 1 Score: 669.5 bits (1726), Expect = 9.7e-189
Identity = 336/435 (77.24%), Postives = 377/435 (86.67%), Query Frame = 0

Query: 1   MAPILSLIFLISSTAVLVAVTGRDYDFTVELIHRDSPKSPMYNPSETHYHRLANTLRRSI 60
           MAPI S++FLI STAV  A T RDY FTVELIHRDS KSPMYN SETHY R+AN LRRSI
Sbjct: 1   MAPIFSILFLI-STAVFSATTARDYGFTVELIHRDSTKSPMYNSSETHYDRIANALRRSI 60

Query: 61  SRNTAALT-DTAEAPIYNYGGQYLMKISLGTPPFSIIAVADTGSDIIWTQCEPCPNCYEQ 120
           +RN A LT DTAEAPIYN GG+YL++IS+GTPPFSI+AVADTGSD+IWTQCEPC NCY+Q
Sbjct: 61  NRNKAVLTSDTAEAPIYNNGGEYLVEISIGTPPFSILAVADTGSDVIWTQCEPCSNCYQQ 120

Query: 121 NAPMFDPSKSTTYKNVPCSSPICSFAGQERSCSTQSECLYSITYGDRSHSQGDLAVDTVT 180
           +APMFDPSKS TYKNVPCSSP+CS++G   SCS  SECLYSI YGD+SHS G+LAVDTVT
Sbjct: 121 SAPMFDPSKSATYKNVPCSSPVCSYSGDGSSCSDDSECLYSIAYGDKSHSDGNLAVDTVT 180

Query: 181 MGSTSGRPVTFPHIAIGCGHDNAGTFDANVSGIVGLGHGPASLVNQMGPATGGKFSYCLA 240
           M STSGRPV FP   IGCGHDNAGTF+ANVSGIVGLG GPASLV Q+GPATGGKFSYCL 
Sbjct: 181 MQSTSGRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLM 240

Query: 241 SIGNNSIE-SSKLNFGSNAIVSGYKTVSTPIYTSDTYKTFYSLKLEAISVGENTFDFPVV 300
            IGN S+E S+KLNFGSNA VSG   VSTPIYTSD YKTFYSLKLEA+SVG+N FDFP V
Sbjct: 241 PIGNASMEDSTKLNFGSNADVSGSGAVSTPIYTSDQYKTFYSLKLEAVSVGDNKFDFPEV 300

Query: 301 SSRLGGEANIIIDSGTTLTILPVDLYNNFATAISDSINLQHANDPNQFLDYCYATTTNDY 360
           SS+LGGEANIIIDSGTTLT LP DL +NF +AI+DSINL  A DP+QFLDYC++TTT+DY
Sbjct: 301 SSKLGGEANIIIDSGTTLTYLPSDLMSNFGSAIADSINLPRAEDPSQFLDYCFSTTTDDY 360

Query: 361 KVPPVTMHFEGADVLLQRENVFIRVSDDVVCLAFTASQDEDNISIYGNISQNNFLVGYDI 420
           +VP VTMHFEGADV LQREN+FIR+S+D +CLAF A  D DNI IYGNI+Q+NFLVGYDI
Sbjct: 361 EVPSVTMHFEGADVPLQRENMFIRLSEDTICLAFGAFSD-DNIFIYGNIAQSNFLVGYDI 420

Query: 421 KNMSVSFKPADCVAM 434
           KN++VSF+PADC AM
Sbjct: 421 KNLAVSFQPADCNAM 433

BLAST of HG10010824 vs. ExPASy TrEMBL
Match: A0A5D3DLM9 (Aspartic proteinase CDR1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold266G00670 PE=3 SV=1)

HSP 1 Score: 668.3 bits (1723), Expect = 2.2e-188
Identity = 335/435 (77.01%), Postives = 378/435 (86.90%), Query Frame = 0

Query: 1   MAPILSLIFLISSTAVLVAVTGRDYDFTVELIHRDSPKSPMYNPSETHYHRLANTLRRSI 60
           MAPI S++FLI STAV  A T RDY FTVELIHRDS KSPMYN SETHY R+AN LRRSI
Sbjct: 1   MAPIFSILFLI-STAVFSATTARDYGFTVELIHRDSTKSPMYNSSETHYDRIANALRRSI 60

Query: 61  SRNTAALT-DTAEAPIYNYGGQYLMKISLGTPPFSIIAVADTGSDIIWTQCEPCPNCYEQ 120
           +RN A LT DTAEAPIYN GG+YL++IS+GTPPFSI+AVADTGSD+IWTQCEPC NCY+Q
Sbjct: 61  NRNKAVLTSDTAEAPIYNNGGEYLVEISIGTPPFSILAVADTGSDVIWTQCEPCSNCYQQ 120

Query: 121 NAPMFDPSKSTTYKNVPCSSPICSFAGQERSCSTQSECLYSITYGDRSHSQGDLAVDTVT 180
           +APMFDPSKS TYKNVPCSSP+CS++G   SCS  SECLYSI YGD+SHS G+LAVDTVT
Sbjct: 121 SAPMFDPSKSATYKNVPCSSPVCSYSGDGSSCSDDSECLYSIAYGDKSHSDGNLAVDTVT 180

Query: 181 MGSTSGRPVTFPHIAIGCGHDNAGTFDANVSGIVGLGHGPASLVNQMGPATGGKFSYCLA 240
           M STSGRPV+FP   IGCGHDNAGTF+ANVSGIVGLG GPASLV Q+GPATGGKFSYCL 
Sbjct: 181 MQSTSGRPVSFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLM 240

Query: 241 SIGNNSIE-SSKLNFGSNAIVSGYKTVSTPIYTSDTYKTFYSLKLEAISVGENTFDFPVV 300
            IGN S+E S+KLNFGSNA VSG   VSTPIYTSD YKTFYSLKLEA+SVG+N FDFP V
Sbjct: 241 PIGNASMEDSTKLNFGSNADVSGSGAVSTPIYTSDQYKTFYSLKLEAVSVGDNKFDFPEV 300

Query: 301 SSRLGGEANIIIDSGTTLTILPVDLYNNFATAISDSINLQHANDPNQFLDYCYATTTNDY 360
           SS+LGGEANIIIDSGTTLT LP DL +NF +AI+DSINL  A DP+QFLDYC++TTT+DY
Sbjct: 301 SSKLGGEANIIIDSGTTLTYLPSDLMSNFGSAIADSINLPRAEDPSQFLDYCFSTTTDDY 360

Query: 361 KVPPVTMHFEGADVLLQRENVFIRVSDDVVCLAFTASQDEDNISIYGNISQNNFLVGYDI 420
           +VP VTMHFEGADV LQREN+FIR+S+D +CLAF A  D DNI IYGNI+Q+NFLVGYDI
Sbjct: 361 EVPSVTMHFEGADVPLQRENMFIRLSEDTICLAFGAFSD-DNIFIYGNIAQSNFLVGYDI 420

Query: 421 KNMSVSFKPADCVAM 434
           KN++VSF+PA+C AM
Sbjct: 421 KNLAVSFQPAECNAM 433

BLAST of HG10010824 vs. ExPASy TrEMBL
Match: A0A0A0K928 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G078650 PE=3 SV=1)

HSP 1 Score: 645.6 bits (1664), Expect = 1.5e-181
Identity = 319/435 (73.33%), Postives = 371/435 (85.29%), Query Frame = 0

Query: 1   MAPILSLIFLISSTAVLVAVTGRDYDFTVELIHRDSPKSPMYNPSETHYHRLANTLRRSI 60
           MAP+ SL+FLIS+ +V  AVT RDY FTVELIHRDSPKSPMYN SETH+ R+ N LRRS 
Sbjct: 1   MAPVFSLLFLISTASVFSAVTARDYGFTVELIHRDSPKSPMYNSSETHFDRIVNALRRSS 60

Query: 61  SRNTAAL-TDTAEAPIYNYGGQYLMKISLGTPPFSIIAVADTGSDIIWTQCEPCPNCYEQ 120
            RNT  L +DTAEAPI+N GG+YL++IS+GTPPFSI+AVADTGSD+IWTQC+PC NCY+Q
Sbjct: 61  HRNTVVLESDTAEAPIFNNGGEYLVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQ 120

Query: 121 NAPMFDPSKSTTYKNVPCSSPICSFAGQERSCSTQSECLYSITYGDRSHSQGDLAVDTVT 180
           NAPMFDPSKSTTYKNV CSSP+CS++G   SCS  SECLYSI YGD SHSQG+LAVDTVT
Sbjct: 121 NAPMFDPSKSTTYKNVACSSPVCSYSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVT 180

Query: 181 MGSTSGRPVTFPHIAIGCGHDNAGTFDANVSGIVGLGHGPASLVNQMGPATGGKFSYCLA 240
           M STSGRPV FP   IGCGHDNAGTF+ANVSGIVGLG GPASLV Q+GPATGGKFSYCL 
Sbjct: 181 MQSTSGRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLI 240

Query: 241 SIGNNSI-ESSKLNFGSNAIVSGYKTVSTPIYTSDTYKTFYSLKLEAISVGENTFDFPVV 300
            IG  S  +S+KLNFGSNA VSG  TVSTPIY+S  YKTFYSLKLEA+SVG+  F+FP  
Sbjct: 241 PIGTGSTNDSTKLNFGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEG 300

Query: 301 SSRLGGEANIIIDSGTTLTILPVDLYNNFATAISDSINLQHANDPNQFLDYCYATTTNDY 360
           +S+LGGE+NIIIDSGTTLT LP  L N+F +AIS S++L HA DP++FLDYC+ATTT+DY
Sbjct: 301 ASKLGGESNIIIDSGTTLTYLPSALLNSFGSAISQSMSLPHAQDPSEFLDYCFATTTDDY 360

Query: 361 KVPPVTMHFEGADVLLQRENVFIRVSDDVVCLAFTASQDEDNISIYGNISQNNFLVGYDI 420
           ++PPVTMHFEGADV LQREN+F+R+SDD +CLAF  S  +DNI IYGNI+Q+NFLVGYDI
Sbjct: 361 EMPPVTMHFEGADVPLQRENLFVRLSDDTICLAF-GSFPDDNIFIYGNIAQSNFLVGYDI 420

Query: 421 KNMSVSFKPADCVAM 434
           KN++VSF+PA C A+
Sbjct: 421 KNLAVSFQPAHCGAV 434

BLAST of HG10010824 vs. ExPASy TrEMBL
Match: A0A0A0K9V4 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G078630 PE=3 SV=1)

HSP 1 Score: 635.6 bits (1638), Expect = 1.5e-178
Identity = 315/436 (72.25%), Postives = 364/436 (83.49%), Query Frame = 0

Query: 1   MAPILSLIFLI---SSTAVLVAVTGRDYDFTVELIHRDSPKSPMYNPSETHYHRLANTLR 60
           MAPI SL+ +I    STAV+ A TG DY FTVELIHRDSPKSPMYNP E HYHR+A+TLR
Sbjct: 1   MAPIFSLVIVIIFLISTAVVSAATGPDYGFTVELIHRDSPKSPMYNPLENHYHRVADTLR 60

Query: 61  RSISRNTAALTDTAEAPIYNYGGQYLMKISLGTPPFSIIAVADTGSDIIWTQCEPCPNCY 120
           RSIS NT  +T+T EAPIYN  G+YLMK+S+GTPPF IIAVADTGSDIIWTQCEPC NCY
Sbjct: 61  RSISHNTGLVTNTVEAPIYNNRGEYLMKLSVGTPPFPIIAVADTGSDIIWTQCEPCTNCY 120

Query: 121 EQNAPMFDPSKSTTYKNVPCSSPICSFAGQERSCSTQSECLYSITYGDRSHSQGDLAVDT 180
           +Q+ PMF+PSKSTTY+ V CSSP+CSF G++ SCS + +C YSI+YGD SHSQGD AVDT
Sbjct: 121 QQDLPMFNPSKSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDT 180

Query: 181 VTMGSTSGRPVTFPHIAIGCGHDNAGTFDANVSGIVGLGHGPASLVNQMGPATGGKFSYC 240
           +TMGSTSGR V FP  AIGCGHDNAG+FDANVSGIVGLG GPASL+ QMG A GGKFSYC
Sbjct: 181 LTMGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYC 240

Query: 241 LASIGNNSIESSKLNFGSNAIVSGYKTVSTPIYTSDTYKTFYSLKLEAISVGENTFDFPV 300
           L  IGN+   S+KLNFGSNA VSG   VSTPIY SD +K+FYSLKL+A+SVG N   +  
Sbjct: 241 LTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYST 300

Query: 301 VSSRLGGEANIIIDSGTTLTILPVDLYNNFATAISDSINLQHANDPNQFLDYCYATTTND 360
            +S LGG+ANIIIDSGTTLT+LPVDLY+NFA AIS+SINLQ  +DPNQFL+YC+ TTT+D
Sbjct: 301 ANSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQFLEYCFETTTDD 360

Query: 361 YKVPPVTMHFEGADVLLQRENVFIRVSDDVVCLAFTASQDEDNISIYGNISQNNFLVGYD 420
           YKVP + MHFEGA++ LQRENV IRVSD+V+CLAF  +QD D ISIYGNI+Q NFLVGYD
Sbjct: 361 YKVPFIAMHFEGANLRLQRENVLIRVSDNVICLAFAGAQDND-ISIYGNIAQINFLVGYD 420

Query: 421 IKNMSVSFKPADCVAM 434
           + NMS+SFKP +CVAM
Sbjct: 421 VTNMSLSFKPMNCVAM 435

BLAST of HG10010824 vs. ExPASy TrEMBL
Match: A0A6J1HGT9 (aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111464204 PE=3 SV=1)

HSP 1 Score: 606.3 bits (1562), Expect = 1.0e-169
Identity = 310/434 (71.43%), Postives = 357/434 (82.26%), Query Frame = 0

Query: 1   MAPILSLIFLISSTAVLVAVTGRDYDFTVELIHRDSPKSPMYNPSETHYHRLANTLRRSI 60
           MA I SLIFLISS AV  AV G +Y F+VE+IHRDSPKSPMYNPSETHYHRLANTLRRSI
Sbjct: 1   MALIFSLIFLISS-AVFAAVNG-EYGFSVEMIHRDSPKSPMYNPSETHYHRLANTLRRSI 60

Query: 61  SRNTA-ALTDTAEAPIYNYGGQYLMKISLGTPPFSIIAVADTGSDIIWTQCEPCPNCYEQ 120
             N A AL DTAEAP++N  G+YL+++SLGTPPF I+A+ADTGSDI+WTQC+PCP CYEQ
Sbjct: 61  LLNKAVALLDTAEAPMFNDRGEYLVEVSLGTPPFPILAIADTGSDIVWTQCQPCPKCYEQ 120

Query: 121 NAPMFDPSKSTTYKNVPCSSPICSFAGQERSCSTQSECLYSITYGDRSHSQGDLAVDTVT 180
            APMFDPSKS+TYK +PCSSP C+ AGQERSCS +S C YSI+YGD SHS GD AVDTVT
Sbjct: 121 TAPMFDPSKSSTYKIIPCSSPSCALAGQERSCSDRSGCQYSISYGDGSHSNGDFAVDTVT 180

Query: 181 MGSTSGRPVTFPHIAIGCGHDNAGTFDANVSGIVGLGHGPASLVNQMGPATGGKFSYCLA 240
           MGSTSGRPV FP   +GCGHD+AGTF  NVSGIVGLG GPASLV QMG A+GGKFSYCL 
Sbjct: 181 MGSTSGRPVAFPRTVVGCGHDSAGTFSTNVSGIVGLGRGPASLVPQMGAASGGKFSYCLT 240

Query: 241 SIGNNSIESSKLNFGSNAIVSGYKTVSTPIYTSDTYKTFYSLKLEAISVGENTFDFPVVS 300
            IG +S ESSKLNFGSNA V+G  TVSTPI TSD + +FYSL +EA+SVG   F+FP  S
Sbjct: 241 PIG-DSAESSKLNFGSNAQVAGSGTVSTPIKTSDRFNSFYSLNIEAMSVGGKRFEFPAAS 300

Query: 301 SRLGGEANIIIDSGTTLTILPVDLYNNFATAISDSINLQHANDPNQFLDYCYATTTNDYK 360
           + LG  AN+IIDSGTTLTILP + Y+ FATAISDSI+L+   DPNQFLD+C+ TT  D++
Sbjct: 301 A-LGDGANVIIDSGTTLTILPTEFYSTFATAISDSISLERTEDPNQFLDFCFKTTNLDFE 360

Query: 361 VPPVTMHFEGADVLLQRENVFIRVSDDVVCLAFTASQDEDNISIYGNISQNNFLVGYDIK 420
           VP VT+HFEGADV L+RENVF+ V+++VVCLAF    D  +ISIYGNI+QNNFLVGYD+ 
Sbjct: 361 VPSVTVHFEGADVPLRRENVFVMVAENVVCLAFRGG-DGQSISIYGNIAQNNFLVGYDVT 420

Query: 421 NMSVSFKPADCVAM 434
             SVSFKPADC AM
Sbjct: 421 RNSVSFKPADCSAM 429

BLAST of HG10010824 vs. TAIR 10
Match: AT5G33340.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 414.5 bits (1064), Expect = 1.1e-115
Identity = 214/435 (49.20%), Postives = 287/435 (65.98%), Query Frame = 0

Query: 4   ILSLIFLISSTAVLVAVTGRDYDFTVELIHRDSPKSPMYNPSETHYHRLANTLRRSISR- 63
           +L  + L+SS  +  A       FT +LIHRDSPKSP YNP ET   RL N + RS++R 
Sbjct: 8   VLLSLCLLSSLFLSNANAKPKLGFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRV 67

Query: 64  ---NTAALTDTAEAPIYNYGGQYLMKISLGTPPFSIIAVADTGSDIIWTQCEPCPNCYEQ 123
                   T   +  + +  G+YLM +S+GTPPF I+A+ADTGSD++WTQC PC +CY Q
Sbjct: 68  FHFTEKDNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQ 127

Query: 124 NAPMFDPSKSTTYKNVPCSSPICSFAGQERSCST-QSECLYSITYGDRSHSQGDLAVDTV 183
             P+FDP  S+TYK+V CSS  C+    + SCST  + C YS++YGD S+++G++AVDT+
Sbjct: 128 VDPLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTL 187

Query: 184 TMGSTSGRPVTFPHIAIGCGHDNAGTFDANVSGIVGLGHGPASLVNQMGPATGGKFSYCL 243
           T+GS+  RP+   +I IGCGH+NAGTF+   SGIVGLG GP SL+ Q+G +  GKFSYCL
Sbjct: 188 TLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCL 247

Query: 244 ASIGNNSIESSKLNFGSNAIVSGYKTVSTPIYTSDTYKTFYSLKLEAISVGENTFDFPVV 303
             + +   ++SK+NFG+NAIVSG   VSTP+    + +TFY L L++ISVG     +   
Sbjct: 248 VPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGS 307

Query: 304 SSRLGGEANIIIDSGTTLTILPVDLYNNFATAISDSINLQHANDPNQFLDYCYATTTNDY 363
            S    E NIIIDSGTTLT+LP + Y+    A++ SI+ +   DP   L  CY + T D 
Sbjct: 308 DSE-SSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCY-SATGDL 367

Query: 364 KVPPVTMHFEGADVLLQRENVFIRVSDDVVCLAFTASQDEDNISIYGNISQNNFLVGYDI 423
           KVP +TMHF+GADV L   N F++VS+D+VC AF  S    + SIYGN++Q NFLVGYD 
Sbjct: 368 KVPVITMHFDGADVKLDSSNAFVQVSEDLVCFAFRGS---PSFSIYGNVAQMNFLVGYDT 427

Query: 424 KNMSVSFKPADCVAM 434
            + +VSFKP DC  M
Sbjct: 428 VSKTVSFKPTDCAKM 437

BLAST of HG10010824 vs. TAIR 10
Match: AT1G64830.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 388.3 bits (996), Expect = 8.3e-108
Identity = 214/438 (48.86%), Postives = 292/438 (66.67%), Query Frame = 0

Query: 6   SLIF-LISSTAVLVAVTGRDYD-FTVELIHRDSPKSPMYNPSETHYHRLANTLRRSISRN 65
           SLIF  + S  +L  V     D FT++LIHRDSPKSP YN +ET   R+ N +RRS +R+
Sbjct: 3   SLIFATLLSLLLLSNVNAYPKDGFTIDLIHRDSPKSPFYNSAETSSQRMRNAIRRS-ARS 62

Query: 66  TAALTDTAEAP------IYNYGGQYLMKISLGTPPFSIIAVADTGSDIIWTQCEPCPNCY 125
           T   ++   +P      I +  G+YLM IS+GTPP  I+A+ADTGSD+IWTQC PC +CY
Sbjct: 63  TLQFSNDDASPNSPQSFITSNRGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCY 122

Query: 126 EQNAPMFDPSKSTTYKNVPCSSPICSFAGQERSCST-QSECLYSITYGDRSHSQGDLAVD 185
           +Q +P+FDP +S+TY+ V CSS  C  A ++ SCST ++ C Y+ITYGD S+++GD+AVD
Sbjct: 123 QQTSPLFDPKESSTYRKVSCSSSQCR-ALEDASCSTDENTCSYTITYGDNSYTKGDVAVD 182

Query: 186 TVTMGSTSGRPVTFPHIAIGCGHDNAGTFDANVSGIVGLGHGPASLVNQMGPATGGKFSY 245
           TVTMGS+  RPV+  ++ IGCGH+N GTFD   SGI+GLG G  SLV+Q+  +  GKFSY
Sbjct: 183 TVTMGSSGRRPVSLRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSY 242

Query: 246 CLASIGNNSIESSKLNFGSNAIVSGYKTVSTPIYTSDTYKTFYSLKLEAISVGENTFDFP 305
           CL    + +  +SK+NFG+N IVSG   VST +   D   T+Y L LEAISVG     F 
Sbjct: 243 CLVPFTSETGLTSKINFGTNGIVSGDGVVSTSMVKKDP-ATYYFLNLEAISVGSKKIQF- 302

Query: 306 VVSSRLG-GEANIIIDSGTTLTILPVDLYNNFATAISDSINLQHANDPNQFLDYCYATTT 365
             S+  G GE NI+IDSGTTLT+LP + Y    + ++ +I  +   DP+  L  CY  ++
Sbjct: 303 -TSTIFGTGEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSLCYRDSS 362

Query: 366 NDYKVPPVTMHFEGADVLLQRENVFIRVSDDVVCLAFTASQDEDNISIYGNISQNNFLVG 425
           + +KVP +T+HF+G DV L   N F+ VS+DV C AF A+   + ++I+GN++Q NFLVG
Sbjct: 363 S-FKVPDITVHFKGGDVKLGNLNTFVAVSEDVSCFAFAAN---EQLTIFGNLAQMNFLVG 422

Query: 426 YDIKNMSVSFKPADCVAM 434
           YD  + +VSFK  DC  M
Sbjct: 423 YDTVSGTVSFKKTDCSQM 431

BLAST of HG10010824 vs. TAIR 10
Match: AT1G31450.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 342.0 bits (876), Expect = 6.8e-94
Identity = 179/418 (42.82%), Postives = 263/418 (62.92%), Query Frame = 0

Query: 26  DFTVELIHRDSPKSPMYNPSETHYHRLANTLRRSISRNTAALTDT-AEAPIYNYGGQYLM 85
           + TVELIHRDSP SP+YNP  T   RL     RSISR+    T T  ++ + + GG+Y M
Sbjct: 28  NLTVELIHRDSPHSPLYNPHHTVSDRLNAAFLRSISRSRRFTTKTDLQSGLISNGGEYFM 87

Query: 86  KISLGTPPFSIIAVADTGSDIIWTQCEPCPNCYEQNAPMFDPSKSTTYKNVPCSSPIC-S 145
            IS+GTPP  + A+ADTGSD+ W QC+PC  CY+QN+P+FD  KS+TYK   C S  C +
Sbjct: 88  SISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLFDKKKSSTYKTESCDSKTCQA 147

Query: 146 FAGQERSCSTQSE-CLYSITYGDRSHSQGDLAVDTVTMGSTSGRPVTFPHIAIGCGHDNA 205
            +  E  C    + C Y  +YGD S ++GD+A +T+++ S+SG  V+FP    GCG++N 
Sbjct: 148 LSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDSSSGSSVSFPGTVFGCGYNNG 207

Query: 206 GTFDANVSGIVGLGHGPASLVNQMGPATGGKFSYCLASIGNNSIESSKLNFGSNAIVSG- 265
           GTF+   SGI+GLG GP SLV+Q+G + G KFSYCL+     +  +S +N G+N+I S  
Sbjct: 208 GTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTAATTNGTSVINLGTNSIPSNP 267

Query: 266 ---YKTVSTPIYTSDTYKTFYSLKLEAISVGENTFDFPVVSSRLGGEA-----NIIIDSG 325
                T++TP+   D  +T+Y L LEA++VG+    +      L G++     NIIIDSG
Sbjct: 268 SKDSATLTTPLIQKDP-ETYYFLTLEAVTVGKTKLPYTGGGYGLNGKSSKRTGNIIIDSG 327

Query: 326 TTLTILPVDLYNNFATAISDSI-NLQHANDPNQFLDYCYATTTNDYKVPPVTMHFEGADV 385
           TTLT+L    Y++F TA+ +S+   +  +DP   L +C+ +   +  +P +TMHF  ADV
Sbjct: 328 TTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQGLLTHCFKSGDKEIGLPAITMHFTNADV 387

Query: 386 LLQRENVFIRVSDDVVCLAFTASQDEDNISIYGNISQNNFLVGYDIKNMSVSFKPADC 431
            L   N F+++++D VCL+   + +   ++IYGN+ Q +FLVGYD++  +VSF+  DC
Sbjct: 388 KLSPINAFVKLNEDTVCLSMIPTTE---VAIYGNMVQMDFLVGYDLETKTVSFQRMDC 441

BLAST of HG10010824 vs. TAIR 10
Match: AT2G35615.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 334.3 bits (856), Expect = 1.4e-91
Identity = 187/450 (41.56%), Postives = 274/450 (60.89%), Query Frame = 0

Query: 4   ILSLIFLISSTAVLVAVTGRDYDFTVELIHRDSPKSPMYNPSETHYHRLANTLRRSISR- 63
           IL   FL  S  V ++ +G   +F+VELIHRDSP SP+YNP  T   RL     RS+SR 
Sbjct: 5   ILLCFFLFFS--VTLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRS 64

Query: 64  ---NTAALTDTAEAPIYNYGGQYLMKISLGTPPFSIIAVADTGSDIIWTQCEPCPNCYEQ 123
              N        ++ +    G++ M I++GTPP  + A+ADTGSD+ W QC+PC  CY++
Sbjct: 65  RRFNHQLSQTDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKE 124

Query: 124 NAPMFDPSKSTTYKNVPCSSPIC-SFAGQERSCSTQSE-CLYSITYGDRSHSQGDLAVDT 183
           N P+FD  KS+TYK+ PC S  C + +  ER C   +  C Y  +YGD+S S+GD+A +T
Sbjct: 125 NGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATET 184

Query: 184 VTMGSTSGRPVTFPHIAIGCGHDNAGTFDANVSGIVGLGHGPASLVNQMGPATGGKFSYC 243
           V++ S SG PV+FP    GCG++N GTFD   SGI+GLG G  SL++Q+G +   KFSYC
Sbjct: 185 VSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYC 244

Query: 244 LASIGNNSIESSKLNFGSNAIVSGYK----TVSTPIYTSDTYKTFYSLKLEAISVGE--- 303
           L+     +  +S +N G+N+I S        VSTP+   +   T+Y L LEAISVG+   
Sbjct: 245 LSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPL-TYYYLTLEAISVGKKKI 304

Query: 304 -------NTFDFPVVSSRLGGEANIIIDSGTTLTILPVDLYNNFATAISDSI-NLQHAND 363
                  N  D  ++S   G   NIIIDSGTTLT+L    ++ F++A+ +S+   +  +D
Sbjct: 305 PYTGSSYNPNDDGILSETSG---NIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSD 364

Query: 364 PNQFLDYCYATTTNDYKVPPVTMHFEGADVLLQRENVFIRVSDDVVCLAFTASQDEDNIS 423
           P   L +C+ + + +  +P +T+HF GADV L   N F+++S+D+VCL+   + +   ++
Sbjct: 365 PQGLLSHCFKSGSAEIGLPEITVHFTGADVRLSPINAFVKLSEDMVCLSMVPTTE---VA 424

Query: 424 IYGNISQNNFLVGYDIKNMSVSFKPADCVA 433
           IYGN +Q +FLVGYD++  +VSF+  DC A
Sbjct: 425 IYGNFAQMDFLVGYDLETRTVSFQHMDCSA 445

BLAST of HG10010824 vs. TAIR 10
Match: AT2G28010.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 271.9 bits (694), Expect = 8.6e-73
Identity = 172/431 (39.91%), Postives = 240/431 (55.68%), Query Frame = 0

Query: 5   LSLIFLISSTAVLVAVTGRDYDFTVELIHRDSPKSPMYNPSETHYHRLANTLRRSISRNT 64
           +SL FL ++TA         + FT++LIHR S  S   + +++     ANT+  +     
Sbjct: 14  ISLCFLFTTTA------SPPHGFTMDLIHRRSNASSRVSNTQSGSSPYANTVFDN----- 73

Query: 65  AALTDTAEAPIYNYGGQYLMKISLGTPPFSIIAVADTGSDIIWTQCEPCPNCYEQNAPMF 124
                            YLMK+ +GTPPF I A+ DTGS+I WTQC PC +CYEQNAP+F
Sbjct: 74  ---------------SVYLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPIF 133

Query: 125 DPSKSTTYKNVPCSSPICSFAGQERSCSTQSECLYSITYGDRSHSQGDLAVDTVTMGSTS 184
           DPSKS+T+K              E+ C   S C Y + Y D +++ G LA +T+T+ STS
Sbjct: 134 DPSKSSTFK--------------EKRCDGHS-CPYEVDYFDHTYTMGTLATETITLHSTS 193

Query: 185 GRPVTFPHIAIGCGHDNAGTFDANVSGIVGLGHGPASLVNQMGPATGGKFSYCLASIGNN 244
           G P   P   IGCGH+N+  F  + SG+VGL  GP+SL+ QMG    G  SYC +  G  
Sbjct: 194 GEPFVMPETIIGCGHNNS-WFKPSFSGMVGLNWGPSSLITQMGGEYPGLMSYCFSGQG-- 253

Query: 245 SIESSKLNFGSNAIVSGYKTVSTPIYTSDTYKTFYSLKLEAISVGENTFDFPVVSSRLGG 304
              +SK+NFG+NAIV+G   VST ++ +     FY L L+A+SVG NT    + ++    
Sbjct: 254 ---TSKINFGANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVG-NTRIETMGTTFHAL 313

Query: 305 EANIIIDSGTTLTILPVDLYNNFATAISDSINLQHANDPNQFLDYCYATTTNDYKVPPVT 364
           E NI+IDSGTTLT  PV   N    A+   +    A DP      CY + T D   P +T
Sbjct: 314 EGNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYNSDTIDI-FPVIT 373

Query: 365 MHFE-GADVLLQRENVFIRVSD-DVVCLAFTASQDEDNISIYGNISQNNFLVGYDIKNMS 424
           MHF  G D++L + N+++  ++  V CLA   +      +I+GN +QNNFLVGYD  ++ 
Sbjct: 374 MHFSGGVDLVLDKYNMYMESNNGGVFCLAIICNSPTQE-AIFGNRAQNNFLVGYDSSSLL 394

Query: 425 VSFKPADCVAM 434
           VSF P +C A+
Sbjct: 434 VSFSPTNCSAL 394

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038876324.11.2e-20984.83aspartic proteinase CDR1-like [Benincasa hispida][more]
XP_016902483.12.0e-18877.24PREDICTED: aspartic proteinase CDR1-like [Cucumis melo][more]
KAA0039977.14.4e-18877.01aspartic proteinase CDR1-like [Cucumis melo var. makuwa] >TYK24525.1 aspartic pr... [more]
XP_038907012.18.7e-18476.38aspartic proteinase CDR1-like [Benincasa hispida][more]
XP_031744104.13.1e-18173.33aspartic proteinase CDR1 [Cucumis sativus] >KGN46270.1 hypothetical protein Csa_... [more]
Match NameE-valueIdentityDescription
Q6XBF81.5e-11449.20Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1[more]
Q3EBM52.0e-9041.56Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g3561... [more]
Q766C25.3e-6739.18Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q766C33.5e-6336.54Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q9LS405.1e-5434.88Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Match NameE-valueIdentityDescription
A0A1S4E2N49.7e-18977.24aspartic proteinase CDR1-like OS=Cucumis melo OX=3656 GN=LOC107991689 PE=3 SV=1[more]
A0A5D3DLM92.2e-18877.01Aspartic proteinase CDR1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_sc... [more]
A0A0A0K9281.5e-18173.33Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G07865... [more]
A0A0A0K9V41.5e-17872.25Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G07863... [more]
A0A6J1HGT91.0e-16971.43aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111464204 PE=3... [more]
Match NameE-valueIdentityDescription
AT5G33340.11.1e-11549.20Eukaryotic aspartyl protease family protein [more]
AT1G64830.18.3e-10848.86Eukaryotic aspartyl protease family protein [more]
AT1G31450.16.8e-9442.82Eukaryotic aspartyl protease family protein [more]
AT2G35615.11.4e-9141.56Eukaryotic aspartyl protease family protein [more]
AT2G28010.18.6e-7339.91Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 253..433
e-value: 6.0E-44
score: 151.9
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 58..252
e-value: 1.4E-54
score: 187.1
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 76..430
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 82..255
e-value: 3.2E-56
score: 190.4
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 278..426
e-value: 7.5E-28
score: 97.3
NoneNo IPR availablePANTHERPTHR47967:SF66ASPARTIC PROTEINASE CDR1-RELATEDcoord: 7..431
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 7..431
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 308..319
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 82..426
score: 44.421753
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 81..430
e-value: 7.15107E-92
score: 276.837

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10010824.1HG10010824.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity