HG10016937 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10016937
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionEukaryotic aspartyl protease family protein
LocationChr03: 9493812 .. 9495203 (-)
RNA-Seq ExpressionHG10016937
SyntenyHG10016937
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGTTCTTCCCCATTCCCTTTCTCCTTTCCATCTTTCTCCTTCTTCCCCCTTCATCTTCTTCCCCTATCTCCACTATTACACTCCCCCTCACTGCCTTCCCTTCCAATTCACTTACAGATGATCCATGGAAAACCATCGATTATCTTCTCTCTGCTTCACTCAACAGAGCTCAACATCTCAAGAAGCCACAAACAAAATCAAACAGTTCCATCCAGAATGTCTCTCTGTTCCCTCGTAGCTATGGAGCTTATTCAATCTCACTTGCCTTCGGAACTCCACCGCAGAATTTATCGTTCATTTTCGATACTGGAAGTAGTGTCGTCTGGTTCCCCTGCACTGCTAGTTATCTTTGTTCTAATTGTTCGTTTCCTAATGTGGATGCTGCAACGATTCCGAAATTTGTTCCCAAATTATCTTCCTCTGCGAAGATTATTGGTTGTCGAAATCCGAAATGTGCTTGGATTTTTGGCCCTAATTTGAAATCTAGGTGTAGAAGTTGTAGCCCTAAATCTCGAAATTGTTTCGATTCTTGTCCTGGCTATGGAATTCAGTATGGCTCTGGTGCAACCGCTGGATTTCTCCTCTCTGAAACGCTTGATTTCCCGAAGAAACGAGTGCCGGATTTTCTCGTCGGTTGTTCCGTCTTGTCCGTTCATCAACCAGCCGGCATTGCCGGATTCGGCCGCGGTCCTGAATCGTTGCCCTCGCAAATGCGGCTGAAACGATTCTCCTATTGCCTCGTTTCTCGTCGGTTCGACGACTCACCCGTGAGTAGTCCTCTAGTACTGAACTCCGGCTCGGAATCTAACGAATCGAAGAGTAAGAGTCTCATTTACGCACCGTTTCGAAAGAATCCATCAGGATCCAACGCCGCATTTCGAGAGTACTATTACCTTAGTCTTCGGAGAATCATCATCGGTGGAAAGCCGGTGAAGTTCCCGTACAAGTATCTTGTGCCGGATTCCGCCGGGAACGGCGGCGCGATCATTGATTCCGGTTCGACGTTTACGTTTCTGGATAAGCCGATTTTCGAAGCCATATCGGAAGAGTTGGAGAAGCAGCTGGTGAAATATCCTCGAGATAAGGGCGTTGAAGTGCAGTCCGGTTTAAGGCCGTGCTTCGATATTTCCAAGGAGAAATTGGCGGAGTTTCCGGAACTGGTTTTGAAGTTTAAAGGCGGAGCGAAGCTGAGTTTGCCGCCGGCGAATTACTTGGCATTGGTGACGGATGACGGCGTGGTGTGCTTGACGATGATGACGGATGTAGCCGCCGTCGGCGGTGGCGGGGGGCCAGCGATTATATTCGGGGCGTTTCAGCAGCAGAATGTTTTGGTGGAGTATGATTTGGCGAGGGACAGAATCGGATTTCGGAAGCAGAGATGCACGTGA

mRNA sequence

ATGGAGTTCTTCCCCATTCCCTTTCTCCTTTCCATCTTTCTCCTTCTTCCCCCTTCATCTTCTTCCCCTATCTCCACTATTACACTCCCCCTCACTGCCTTCCCTTCCAATTCACTTACAGATGATCCATGGAAAACCATCGATTATCTTCTCTCTGCTTCACTCAACAGAGCTCAACATCTCAAGAAGCCACAAACAAAATCAAACAGTTCCATCCAGAATGTCTCTCTGTTCCCTCGTAGCTATGGAGCTTATTCAATCTCACTTGCCTTCGGAACTCCACCGCAGAATTTATCGTTCATTTTCGATACTGGAAGTAGTGTCGTCTGGTTCCCCTGCACTGCTAGTTATCTTTGTTCTAATTGTTCGTTTCCTAATGTGGATGCTGCAACGATTCCGAAATTTGTTCCCAAATTATCTTCCTCTGCGAAGATTATTGGTTGTCGAAATCCGAAATGTGCTTGGATTTTTGGCCCTAATTTGAAATCTAGGTGTAGAAGTTGTAGCCCTAAATCTCGAAATTGTTTCGATTCTTGTCCTGGCTATGGAATTCAGTATGGCTCTGGTGCAACCGCTGGATTTCTCCTCTCTGAAACGCTTGATTTCCCGAAGAAACGAGTGCCGGATTTTCTCGTCGGTTGTTCCGTCTTGTCCGTTCATCAACCAGCCGGCATTGCCGGATTCGGCCGCGGTCCTGAATCGTTGCCCTCGCAAATGCGGCTGAAACGATTCTCCTATTGCCTCGTTTCTCGTCGGTTCGACGACTCACCCGTGAGTAGTCCTCTAGTACTGAACTCCGGCTCGGAATCTAACGAATCGAAGAGTAAGAGTCTCATTTACGCACCGTTTCGAAAGAATCCATCAGGATCCAACGCCGCATTTCGAGAGTACTATTACCTTAGTCTTCGGAGAATCATCATCGGTGGAAAGCCGGTGAAGTTCCCGTACAAGTATCTTGTGCCGGATTCCGCCGGGAACGGCGGCGCGATCATTGATTCCGGTTCGACGTTTACGTTTCTGGATAAGCCGATTTTCGAAGCCATATCGGAAGAGTTGGAGAAGCAGCTGGTGAAATATCCTCGAGATAAGGGCGTTGAAGTGCAGTCCGGTTTAAGGCCGTGCTTCGATATTTCCAAGGAGAAATTGGCGGAGTTTCCGGAACTGGTTTTGAAGTTTAAAGGCGGAGCGAAGCTGAGTTTGCCGCCGGCGAATTACTTGGCATTGGTGACGGATGACGGCGTGGTGTGCTTGACGATGATGACGGATGTAGCCGCCGTCGGCGGTGGCGGGGGGCCAGCGATTATATTCGGGGCGTTTCAGCAGCAGAATGTTTTGGTGGAGTATGATTTGGCGAGGGACAGAATCGGATTTCGGAAGCAGAGATGCACGTGA

Coding sequence (CDS)

ATGGAGTTCTTCCCCATTCCCTTTCTCCTTTCCATCTTTCTCCTTCTTCCCCCTTCATCTTCTTCCCCTATCTCCACTATTACACTCCCCCTCACTGCCTTCCCTTCCAATTCACTTACAGATGATCCATGGAAAACCATCGATTATCTTCTCTCTGCTTCACTCAACAGAGCTCAACATCTCAAGAAGCCACAAACAAAATCAAACAGTTCCATCCAGAATGTCTCTCTGTTCCCTCGTAGCTATGGAGCTTATTCAATCTCACTTGCCTTCGGAACTCCACCGCAGAATTTATCGTTCATTTTCGATACTGGAAGTAGTGTCGTCTGGTTCCCCTGCACTGCTAGTTATCTTTGTTCTAATTGTTCGTTTCCTAATGTGGATGCTGCAACGATTCCGAAATTTGTTCCCAAATTATCTTCCTCTGCGAAGATTATTGGTTGTCGAAATCCGAAATGTGCTTGGATTTTTGGCCCTAATTTGAAATCTAGGTGTAGAAGTTGTAGCCCTAAATCTCGAAATTGTTTCGATTCTTGTCCTGGCTATGGAATTCAGTATGGCTCTGGTGCAACCGCTGGATTTCTCCTCTCTGAAACGCTTGATTTCCCGAAGAAACGAGTGCCGGATTTTCTCGTCGGTTGTTCCGTCTTGTCCGTTCATCAACCAGCCGGCATTGCCGGATTCGGCCGCGGTCCTGAATCGTTGCCCTCGCAAATGCGGCTGAAACGATTCTCCTATTGCCTCGTTTCTCGTCGGTTCGACGACTCACCCGTGAGTAGTCCTCTAGTACTGAACTCCGGCTCGGAATCTAACGAATCGAAGAGTAAGAGTCTCATTTACGCACCGTTTCGAAAGAATCCATCAGGATCCAACGCCGCATTTCGAGAGTACTATTACCTTAGTCTTCGGAGAATCATCATCGGTGGAAAGCCGGTGAAGTTCCCGTACAAGTATCTTGTGCCGGATTCCGCCGGGAACGGCGGCGCGATCATTGATTCCGGTTCGACGTTTACGTTTCTGGATAAGCCGATTTTCGAAGCCATATCGGAAGAGTTGGAGAAGCAGCTGGTGAAATATCCTCGAGATAAGGGCGTTGAAGTGCAGTCCGGTTTAAGGCCGTGCTTCGATATTTCCAAGGAGAAATTGGCGGAGTTTCCGGAACTGGTTTTGAAGTTTAAAGGCGGAGCGAAGCTGAGTTTGCCGCCGGCGAATTACTTGGCATTGGTGACGGATGACGGCGTGGTGTGCTTGACGATGATGACGGATGTAGCCGCCGTCGGCGGTGGCGGGGGGCCAGCGATTATATTCGGGGCGTTTCAGCAGCAGAATGTTTTGGTGGAGTATGATTTGGCGAGGGACAGAATCGGATTTCGGAAGCAGAGATGCACGTGA

Protein sequence

MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETLDFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYLSLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT
Homology
BLAST of HG10016937 vs. NCBI nr
Match: XP_038881211.1 (probable aspartyl protease At4g16563 [Benincasa hispida])

HSP 1 Score: 820.8 bits (2119), Expect = 5.7e-234
Identity = 410/463 (88.55%), Postives = 427/463 (92.22%), Query Frame = 0

Query: 1   MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQH 60
           MEFFPIPFL SIFLLLP SSSS ISTITLPLTAFPS  LTDDP K I+YLLSASLNRAQH
Sbjct: 1   MEFFPIPFLFSIFLLLPTSSSSSISTITLPLTAFPSIPLTDDPLKIINYLLSASLNRAQH 60

Query: 61  LKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCS 120
           LK PQTK   SIQNVSLF RSYGAYSI+LAFGTPPQNLSF+FDTGSS+VWFPCTA Y CS
Sbjct: 61  LKNPQTK---SIQNVSLFSRSYGAYSITLAFGTPPQNLSFVFDTGSSLVWFPCTAGYRCS 120

Query: 121 NCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCP 180
           NCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNL SRCR+C+PKSRNC  SCP
Sbjct: 121 NCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLNSRCRNCNPKSRNCSGSCP 180

Query: 181 GYGIQYGSGATAGFLLSETLDFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMR 240
           GYGI YGSGATAGFLLSETLDFPKK VPDFLVGCSV SVHQPAGIAGFGR PESLPSQMR
Sbjct: 181 GYGILYGSGATAGFLLSETLDFPKKGVPDFLVGCSVSSVHQPAGIAGFGRAPESLPSQMR 240

Query: 241 LKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL 300
           LKRFSYCLVSR FDDSPVSSPLVL+SGSES++SK++S IYAPFR+NPSGSNAAFREYYYL
Sbjct: 241 LKRFSYCLVSRGFDDSPVSSPLVLDSGSESDDSKTESFIYAPFRENPSGSNAAFREYYYL 300

Query: 301 SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYP 360
           SLRRI+IGGKPVK PYKYL+PDSAG GGAIIDSGSTFTFLDKPIFEA++ ELEKQLVKYP
Sbjct: 301 SLRRILIGGKPVKIPYKYLMPDSAGKGGAIIDSGSTFTFLDKPIFEAVAGELEKQLVKYP 360

Query: 361 RDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMM 420
           R K VEVQSGLRPCFDISKE   EFPELVLKFKGGAKLSLPP NYLALVTD GVVCLTMM
Sbjct: 361 RTKSVEVQSGLRPCFDISKEVSVEFPELVLKFKGGAKLSLPPVNYLALVTDAGVVCLTMM 420

Query: 421 TDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT 464
           TDV  VGGG GPAIIFGAFQQQNVLVEYDLAR+RIGFRKQRCT
Sbjct: 421 TDVGVVGGGAGPAIIFGAFQQQNVLVEYDLARNRIGFRKQRCT 460

BLAST of HG10016937 vs. NCBI nr
Match: XP_023543736.1 (probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 798.5 bits (2061), Expect = 3.0e-227
Identity = 392/463 (84.67%), Postives = 426/463 (92.01%), Query Frame = 0

Query: 1   MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQH 60
           MEFFPIPFLLSI LLL  SSSS  +T+TLPLT FPS   T  PWK I +L+SASL RAQH
Sbjct: 1   MEFFPIPFLLSIVLLLSASSSSSSTTVTLPLTVFPSLPFT-HPWKNIKHLVSASLTRAQH 60

Query: 61  LKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCS 120
           LK P+ KSN+SIQNV+LFPRSYGAYSISLAFGTPPQ+LS +FDTGSS+VWFPCTA Y CS
Sbjct: 61  LKTPRIKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCS 120

Query: 121 NCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCP 180
           NCSFPNVDAATIPKF+PKLSSSAKIIGCRN KC+WIFGPNLKS CRSCSP+SR C D+CP
Sbjct: 121 NCSFPNVDAATIPKFIPKLSSSAKIIGCRNRKCSWIFGPNLKSLCRSCSPRSRKCSDTCP 180

Query: 181 GYGIQYGSGATAGFLLSETLDFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMR 240
           GYGIQYGSGATAGFLLSETLDFP+KRVPDFLVGCSV+SVHQPAGIAGFGRGPESLPSQM 
Sbjct: 181 GYGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVVSVHQPAGIAGFGRGPESLPSQMG 240

Query: 241 LKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL 300
           LKRFS+CLV R+FDDSPVSSPLVL+S SES ESK+ SLIYAPFR+NPSGSNAAFREYYYL
Sbjct: 241 LKRFSHCLVPRQFDDSPVSSPLVLDSSSESGESKNNSLIYAPFRENPSGSNAAFREYYYL 300

Query: 301 SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYP 360
           +LRRI+IG KPVKFPYKYLVP+SAGNGGAIIDSGSTFTFLDKPIFEA++EELEKQLVKYP
Sbjct: 301 TLRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYP 360

Query: 361 RDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMM 420
           R KGVE +SGLRPCFDISKE+  EFPEL+LKFKGGA L+LPPANYLALVTD GVVCLTM+
Sbjct: 361 RAKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMI 420

Query: 421 TDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT 464
           TDV  +GGGGGPAIIFGAFQQQNVLV+YDLA+DRIGFRKQRCT
Sbjct: 421 TDVTFLGGGGGPAIIFGAFQQQNVLVQYDLAKDRIGFRKQRCT 462

BLAST of HG10016937 vs. NCBI nr
Match: XP_022979057.1 (probable aspartyl protease At4g16563 [Cucurbita maxima])

HSP 1 Score: 793.9 bits (2049), Expect = 7.5e-226
Identity = 391/463 (84.45%), Postives = 426/463 (92.01%), Query Frame = 0

Query: 1   MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQH 60
           MEFFPI FLLSI LLL  SSSS   T+TLPLTAFPS  LT  PWK I +L+SASL RAQH
Sbjct: 1   MEFFPIQFLLSIVLLLSASSSSSSITVTLPLTAFPSLPLT-HPWKNIKHLVSASLARAQH 60

Query: 61  LKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCS 120
           LK P+TKSN+SIQNV+LFPRSYGAYSISLAFGTPPQ+LS +FDTGSS+VWFPCTA Y CS
Sbjct: 61  LKTPKTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCS 120

Query: 121 NCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCP 180
           NCSFPNVDAATIPKF+PKLSSSA+IIGCRN KC+WIFGPNLKS CRSCSP+SR C D+CP
Sbjct: 121 NCSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTCP 180

Query: 181 GYGIQYGSGATAGFLLSETLDFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMR 240
           GYGIQYGSGATAGFLLSETLDFP+KRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQM 
Sbjct: 181 GYGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMG 240

Query: 241 LKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL 300
           LKRFS+CLV R+FDDSPVSSPLVL+S  ES +SK+ SLIYAPFR+NPSGSNAAFREYYYL
Sbjct: 241 LKRFSHCLVPRQFDDSPVSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYYL 300

Query: 301 SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYP 360
           +LRRI+IG KPVKFPYKYLVP+SAGNGGAIIDSGSTFTFLDKPIFEA++EELEKQLVKYP
Sbjct: 301 TLRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYP 360

Query: 361 RDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMM 420
           R KGVE +SGLRPCFDISKE+  EFPEL+LKFKGGA L+LPPANYLALVTD GVVCLTM+
Sbjct: 361 RAKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMI 420

Query: 421 TDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT 464
           TDV  +GGGGGPAIIFGAFQQQNVLV+YDLA++RIGFRKQRCT
Sbjct: 421 TDVNFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCT 462

BLAST of HG10016937 vs. NCBI nr
Match: XP_022925946.1 (probable aspartyl protease At4g16563 [Cucurbita moschata] >KAG6604319.1 Aspartic proteinase nepenthesin-1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 786.9 bits (2031), Expect = 9.2e-224
Identity = 386/463 (83.37%), Postives = 423/463 (91.36%), Query Frame = 0

Query: 1   MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQH 60
           MEFF IPFLLSI LLL  SSSS  +T+TLPLT FPS      PWK I +L+SASL RAQH
Sbjct: 1   MEFFLIPFLLSIVLLLSASSSSSSTTVTLPLTVFPSLPFA-HPWKNIKHLVSASLTRAQH 60

Query: 61  LKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCS 120
           LK P+TKSN+SIQNV+LFPRSYGAYSISLAFGTPPQ+LS +FDTGSS+VWFPCTA Y CS
Sbjct: 61  LKTPRTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCS 120

Query: 121 NCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCP 180
           NCSFPNVDAATIPKF+PKLSSSAKIIGCRN KC+WIFGPNLK+ CRSCSP+SR C D+CP
Sbjct: 121 NCSFPNVDAATIPKFIPKLSSSAKIIGCRNRKCSWIFGPNLKTLCRSCSPRSRKCSDTCP 180

Query: 181 GYGIQYGSGATAGFLLSETLDFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMR 240
           GYGIQYGSGATAGFLLSETLDFP+KRVPDFLVGCSV+SVHQPAGIAGFGRGPESLPSQM 
Sbjct: 181 GYGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVVSVHQPAGIAGFGRGPESLPSQMG 240

Query: 241 LKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL 300
           LKRFS+CLV R+FDDSPVSSPLVL+S SES ESK+ SLIYAPFR+NPSGSNAAFREYYYL
Sbjct: 241 LKRFSHCLVPRQFDDSPVSSPLVLDSSSESGESKNNSLIYAPFRENPSGSNAAFREYYYL 300

Query: 301 SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYP 360
           +LRRI+IG KPVKFPYKYLVP+SAGNGGAIIDSGSTFTFLDKPIFEA++EELEKQLVKYP
Sbjct: 301 TLRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYP 360

Query: 361 RDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMM 420
           R KGVE +SGLRPCFDISKE+  EFPEL+LKFKGGA L+LPP+NYLALV D  VVCLTM+
Sbjct: 361 RAKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPSNYLALVADTSVVCLTMI 420

Query: 421 TDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT 464
           TDV  +GGGGGPAIIFGAFQQQNVLV+YDLA++RIGFRKQRCT
Sbjct: 421 TDVTFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCT 462

BLAST of HG10016937 vs. NCBI nr
Match: KAG7034471.1 (Aspartic proteinase nepenthesin-1, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 786.2 bits (2029), Expect = 1.6e-223
Identity = 386/463 (83.37%), Postives = 423/463 (91.36%), Query Frame = 0

Query: 1   MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQH 60
           MEFF IPFLLSI LLL  SSSS  +T+TLPLT FPS      PWK I +L+SASL RAQH
Sbjct: 1   MEFFLIPFLLSIVLLLSASSSSSSTTVTLPLTVFPSLPFA-HPWKNIKHLVSASLTRAQH 60

Query: 61  LKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCS 120
           LK P+TKSN+SIQNV+LFPRSYGAYSISLAFGTPPQ+LS +FDTGSS+VWFPCTA Y CS
Sbjct: 61  LKIPRTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCS 120

Query: 121 NCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCP 180
           NCSFPNVDAATIPKF+PKLSSSAKIIGCRN KC+WIFGPNLK+ CRSCSP+SR C D+CP
Sbjct: 121 NCSFPNVDAATIPKFIPKLSSSAKIIGCRNRKCSWIFGPNLKTLCRSCSPRSRKCSDTCP 180

Query: 181 GYGIQYGSGATAGFLLSETLDFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMR 240
           GYGIQYGSGATAGFLLSETLDFP+KRVPDFLVGCSV+SVHQPAGIAGFGRGPESLPSQM 
Sbjct: 181 GYGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVVSVHQPAGIAGFGRGPESLPSQMG 240

Query: 241 LKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL 300
           LKRFS+CLV R+FDDSPVSSPLVL+S SES ESK+ SLIYAPFR+NPSGSNAAFREYYYL
Sbjct: 241 LKRFSHCLVPRQFDDSPVSSPLVLDSSSESGESKNNSLIYAPFRENPSGSNAAFREYYYL 300

Query: 301 SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYP 360
           +LRRI+IG KPVKFPYKYLVP+SAGNGGAIIDSGSTFTFLDKPIFEA++EELEKQLVKYP
Sbjct: 301 TLRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYP 360

Query: 361 RDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMM 420
           R KGVE +SGLRPCFDISKE+  EFPEL+LKFKGGA L+LPP+NYLALV D  VVCLTM+
Sbjct: 361 RAKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPSNYLALVADTSVVCLTMI 420

Query: 421 TDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT 464
           TDV  +GGGGGPAIIFGAFQQQNVLV+YDLA++RIGFRKQRCT
Sbjct: 421 TDVTFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCT 462

BLAST of HG10016937 vs. ExPASy Swiss-Prot
Match: Q940R4 (Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g16563 PE=2 SV=1)

HSP 1 Score: 188.3 bits (477), Expect = 1.9e-46
Identity = 146/485 (30.10%), Postives = 225/485 (46.39%), Query Frame = 0

Query: 24  ISTITLPLTAFPSNSLTDDPWKT--IDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRS 83
           +S+++ PL    S+SL+     +  +  L S+S   +   ++   K     Q +SL   S
Sbjct: 22  VSSLSTPLLLHLSHSLSTSKHSSSPLHLLKSSSSRSSARFRRHHHKQQQ--QQLSLPISS 81

Query: 84  YGAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVP-KLS 143
              Y ISL+ G+    +S   DTGS +VWFPC   + C  C     ++  +P   P  LS
Sbjct: 82  GSDYLISLSVGSSSSAVSLYLDTGSDLVWFPC-RPFTCILC-----ESKPLPPSPPSSLS 141

Query: 144 SSAKIIGCRNPKCAWIFG--PNLK----SRCRSCSPKSRNCFDS---CPGYGIQYGSGAT 203
           SSA  + C +P C+      P+      S C     ++ +C  S   CP +   YG G+ 
Sbjct: 142 SSATTVSCSSPSCSAAHSSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSL 201

Query: 204 AGFLLSETLDFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRL------KRFS 263
              L S++L  P   V +F  GC+  ++ +P G+AGFGRG  SLP+Q+ +        FS
Sbjct: 202 VAKLYSDSLSLPSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFS 261

Query: 264 YCLVSRRFDDSPV--SSPLVL----------------NSGSESNESKSKSLIYAPFRKNP 323
           YCLVS  FD   V   SPL+L                +   +  + K    ++    +NP
Sbjct: 262 YCLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENP 321

Query: 324 SGSNAAFREYYYLSLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEA 383
                    +Y +SL+ I IG + +  P      D  G GG ++DSG+TFT L    + +
Sbjct: 322 K-----HPYFYSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNS 381

Query: 384 ISEELEKQLVK-YPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGG-AKLSLPPANY 443
           + EE + ++ + + R   VE  SG+ PC+ ++  +  + P LVL F G  + ++LP  NY
Sbjct: 382 VVEEFDSRVGRVHERADRVEPSSGMSPCYYLN--QTVKVPALVLHFAGNRSSVTLPRRNY 441

Query: 444 LALVTDDG--------VVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGF 463
                D G        + CL +M         GG   I G +QQQ   V YDL   R+GF
Sbjct: 442 FYEFMDGGDGKEEKRKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGF 491

BLAST of HG10016937 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 150.2 bits (378), Expect = 5.7e-35
Identity = 146/478 (30.54%), Postives = 196/478 (41.00%), Query Frame = 0

Query: 19  SSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNR-------------------AQ 78
           S S   S+ITL L    + S      KT D L S+ L R                     
Sbjct: 64  SDSESSSSITLNLDHIDALSSN----KTPDELFSSRLQRDSRRVKSIATLAAQIPGRNVT 123

Query: 79  HLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLC 138
           H  +P   S+S +  +S   +  G Y   L  GTP + +  + DTGS +VW  C     C
Sbjct: 124 HAPRPGGFSSSVVSGLS---QGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRC 183

Query: 139 SNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSC 198
            + S P  D        P+ S +   I C +P C  +      +R ++C           
Sbjct: 184 YSQSDPIFD--------PRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCL---------- 243

Query: 199 PGYGIQYGSGA-TAGFLLSETLDFPKKRVPDFLVGCSVLSVHQ-------PAGIAGFGRG 258
             Y + YG G+ T G   +ETL F + RV    +GC     H         AG+ G G+G
Sbjct: 244 --YQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCG----HDNEGLFVGAAGLLGLGKG 303

Query: 259 PESLPSQMRLK---RFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPS 318
             S P Q   +   +FSYCLV R     P S           N + S+   + P   NP 
Sbjct: 304 KLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVF-------GNAAVSRIARFTPLLSNPK 363

Query: 319 GSNAAFREYYYLSLRRIIIGGKPVKFPYKYLVP-DSAGNGGAIIDSGSTFTFLDKPIFEA 378
                   +YY+ L  I +GG  V      L   D  GNGG IIDSG++ T L +P + A
Sbjct: 364 -----LDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIA 423

Query: 379 ISEELE---KQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSLPPAN 438
           + +      K L + P        S    CFD+S     + P +VL F+ GA +SLP  N
Sbjct: 424 MRDAFRVGAKTLKRAPD------FSLFDTCFDLSNMNEVKVPTVVLHFR-GADVSLPATN 483

Query: 439 YLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRC 463
           YL  V  +G  C       A  G  GG +II G  QQQ   V YDLA  R+GF    C
Sbjct: 484 YLIPVDTNGKFCF------AFAGTMGGLSII-GNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of HG10016937 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 149.4 bits (376), Expect = 9.7e-35
Identity = 121/387 (31.27%), Postives = 171/387 (44.19%), Query Frame = 0

Query: 83  GAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSS 142
           G Y +++A GTP  + S I DTGS ++W  C     C+ C      +   P F P+ SSS
Sbjct: 94  GEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEP---CTQCF-----SQPTPIFNPQDSSS 153

Query: 143 AKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATA-GFLLSETLD 202
              + C +  C  +               S  C ++   Y   YG G+T  G++ +ET  
Sbjct: 154 FSTLPCESQYCQDL--------------PSETCNNNECQYTYGYGDGSTTQGYMATETFT 213

Query: 203 FPKKRVPDFLVGCSV----LSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSP 262
           F    VP+   GC            AG+ G G GP SLPSQ+ + +FSYC+ S     S 
Sbjct: 214 FETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSY---GSS 273

Query: 263 VSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYLSLRRIIIGGKPVKFPYK 322
             S L L S +      S S        NP+        YYY++L+ I +GG  +  P  
Sbjct: 274 SPSTLALGSAASGVPEGSPSTTLIHSSLNPT--------YYYITLQGITVGGDNLGIPSS 333

Query: 323 YLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCF-D 382
                  G GG IIDSG+T T+L +  + A+++    Q+     D   E  SGL  CF  
Sbjct: 334 TFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVD---ESSSGLSTCFQQ 393

Query: 383 ISKEKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAI-I 442
            S     + PE+ ++F GG  L+L   N L +   +GV+CL       A+G      I I
Sbjct: 394 PSDGSTVQVPEISMQFDGGV-LNLGEQNIL-ISPAEGVICL-------AMGSSSQLGISI 435

Query: 443 FGAFQQQNVLVEYDLARDRIGFRKQRC 463
           FG  QQQ   V YDL    + F   +C
Sbjct: 454 FGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of HG10016937 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 148.7 bits (374), Expect = 1.7e-34
Identity = 117/387 (30.23%), Postives = 171/387 (44.19%), Query Frame = 0

Query: 83  GAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSS 142
           G Y ++L+ GTP Q  S I DTGS ++W  C     C N S         P F P+ SSS
Sbjct: 93  GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQS--------TPIFNPQGSSS 152

Query: 143 AKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGA-TAGFLLSETLD 202
              + C +  C  +  P               C ++   Y   YG G+ T G + +ETL 
Sbjct: 153 FSTLPCSSQLCQALSSP--------------TCSNNFCQYTYGYGDGSETQGSMGTETLT 212

Query: 203 FPKKRVPDFLVGCSV----LSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSP 262
           F    +P+   GC            AG+ G GRGP SLPSQ+ + +FSYC+       S 
Sbjct: 213 FGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTP---IGSS 272

Query: 263 VSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYLSLRRIIIGGKPVKF-PY 322
             S L+L S + S  + S +       + P+        +YY++L  + +G   +   P 
Sbjct: 273 TPSNLLLGSLANSVTAGSPNTTLIQSSQIPT--------FYYITLNGLSVGSTRLPIDPS 332

Query: 323 KYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFD 382
            + +  + G GG IIDSG+T T+     ++++ +E   Q +  P   G    SG   CF 
Sbjct: 333 AFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQ-INLPVVNG--SSSGFDLCFQ 392

Query: 383 I-SKEKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAII 442
             S     + P  V+ F GG  L LP  NY  +   +G++CL       A+G       I
Sbjct: 393 TPSDPSNLQIPTFVMHFDGG-DLELPSENYF-ISPSNGLICL-------AMGSSSQGMSI 434

Query: 443 FGAFQQQNVLVEYDLARDRIGFRKQRC 463
           FG  QQQN+LV YD     + F   +C
Sbjct: 453 FGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of HG10016937 vs. ExPASy Swiss-Prot
Match: Q9LHE3 (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 128.6 bits (322), Expect = 1.8e-28
Identity = 111/424 (26.18%), Postives = 166/424 (39.15%), Query Frame = 0

Query: 51  LSASLNRAQHLKKPQTKSNSSIQN-----VSLFPRSYGAYSISLAFGTPPQNLSFIFDTG 110
           +SA L R      P + S   + +     VS   +  G Y + +  G+PP++   + D+G
Sbjct: 92  VSAILRRISGKVIPSSDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSG 151

Query: 111 SSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRC 170
           S +VW  C    LC   S P  D        P  S S   + C +  C  I         
Sbjct: 152 SDMVWVQCQPCKLCYKQSDPVFD--------PAKSGSYTGVSCGSSVCDRI--------- 211

Query: 171 RSCSPKSRNCFDSCPGYGIQYGSGA-TAGFLLSETLDFPKKRVPDFLVGCSVLSVHQ--- 230
                ++  C      Y + YG G+ T G L  ETL F K  V +  +GC   +      
Sbjct: 212 -----ENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMFIG 271

Query: 231 PAGIAGFGRGPESLPSQMRLK---RFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSL 290
            AG+ G G G  S   Q+  +    F YCLVSR  D    +  LV        E+     
Sbjct: 272 AAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDS---TGSLVF-----GREALPVGA 331

Query: 291 IYAPFRKNPSGSNAAFREYYYLSLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFT 350
            + P  +NP   +     +YY+ L+ + +GG  +  P         G+GG ++D+G+  T
Sbjct: 332 SWVPLVRNPRAPS-----FYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVT 391

Query: 351 FLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKL 410
            L    + A  +  + Q    PR  GV +      C+D+S       P +   F  G  L
Sbjct: 392 RLPTAAYVAFRDGFKSQTANLPRASGVSI---FDTCYDLSGFVSVRVPTVSFYFTEGPVL 451

Query: 411 SLPPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFR 463
           +LP  N+L  V D G  C         +        I G  QQ+ + V +D A   +GF 
Sbjct: 452 TLPARNFLMPVDDSGTYCFAFAASPTGLS-------IIGNIQQEGIQVSFDGANGFVGFG 470

BLAST of HG10016937 vs. ExPASy TrEMBL
Match: A0A6J1IMR7 (probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111478813 PE=3 SV=1)

HSP 1 Score: 793.9 bits (2049), Expect = 3.6e-226
Identity = 391/463 (84.45%), Postives = 426/463 (92.01%), Query Frame = 0

Query: 1   MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQH 60
           MEFFPI FLLSI LLL  SSSS   T+TLPLTAFPS  LT  PWK I +L+SASL RAQH
Sbjct: 1   MEFFPIQFLLSIVLLLSASSSSSSITVTLPLTAFPSLPLT-HPWKNIKHLVSASLARAQH 60

Query: 61  LKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCS 120
           LK P+TKSN+SIQNV+LFPRSYGAYSISLAFGTPPQ+LS +FDTGSS+VWFPCTA Y CS
Sbjct: 61  LKTPKTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCS 120

Query: 121 NCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCP 180
           NCSFPNVDAATIPKF+PKLSSSA+IIGCRN KC+WIFGPNLKS CRSCSP+SR C D+CP
Sbjct: 121 NCSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTCP 180

Query: 181 GYGIQYGSGATAGFLLSETLDFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMR 240
           GYGIQYGSGATAGFLLSETLDFP+KRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQM 
Sbjct: 181 GYGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMG 240

Query: 241 LKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL 300
           LKRFS+CLV R+FDDSPVSSPLVL+S  ES +SK+ SLIYAPFR+NPSGSNAAFREYYYL
Sbjct: 241 LKRFSHCLVPRQFDDSPVSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYYL 300

Query: 301 SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYP 360
           +LRRI+IG KPVKFPYKYLVP+SAGNGGAIIDSGSTFTFLDKPIFEA++EELEKQLVKYP
Sbjct: 301 TLRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYP 360

Query: 361 RDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMM 420
           R KGVE +SGLRPCFDISKE+  EFPEL+LKFKGGA L+LPPANYLALVTD GVVCLTM+
Sbjct: 361 RAKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMI 420

Query: 421 TDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT 464
           TDV  +GGGGGPAIIFGAFQQQNVLV+YDLA++RIGFRKQRCT
Sbjct: 421 TDVNFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCT 462

BLAST of HG10016937 vs. ExPASy TrEMBL
Match: A0A6J1EDJ0 (probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC111433208 PE=3 SV=1)

HSP 1 Score: 786.9 bits (2031), Expect = 4.4e-224
Identity = 386/463 (83.37%), Postives = 423/463 (91.36%), Query Frame = 0

Query: 1   MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQH 60
           MEFF IPFLLSI LLL  SSSS  +T+TLPLT FPS      PWK I +L+SASL RAQH
Sbjct: 1   MEFFLIPFLLSIVLLLSASSSSSSTTVTLPLTVFPSLPFA-HPWKNIKHLVSASLTRAQH 60

Query: 61  LKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCS 120
           LK P+TKSN+SIQNV+LFPRSYGAYSISLAFGTPPQ+LS +FDTGSS+VWFPCTA Y CS
Sbjct: 61  LKTPRTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCS 120

Query: 121 NCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCP 180
           NCSFPNVDAATIPKF+PKLSSSAKIIGCRN KC+WIFGPNLK+ CRSCSP+SR C D+CP
Sbjct: 121 NCSFPNVDAATIPKFIPKLSSSAKIIGCRNRKCSWIFGPNLKTLCRSCSPRSRKCSDTCP 180

Query: 181 GYGIQYGSGATAGFLLSETLDFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMR 240
           GYGIQYGSGATAGFLLSETLDFP+KRVPDFLVGCSV+SVHQPAGIAGFGRGPESLPSQM 
Sbjct: 181 GYGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVVSVHQPAGIAGFGRGPESLPSQMG 240

Query: 241 LKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL 300
           LKRFS+CLV R+FDDSPVSSPLVL+S SES ESK+ SLIYAPFR+NPSGSNAAFREYYYL
Sbjct: 241 LKRFSHCLVPRQFDDSPVSSPLVLDSSSESGESKNNSLIYAPFRENPSGSNAAFREYYYL 300

Query: 301 SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYP 360
           +LRRI+IG KPVKFPYKYLVP+SAGNGGAIIDSGSTFTFLDKPIFEA++EELEKQLVKYP
Sbjct: 301 TLRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYP 360

Query: 361 RDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMM 420
           R KGVE +SGLRPCFDISKE+  EFPEL+LKFKGGA L+LPP+NYLALV D  VVCLTM+
Sbjct: 361 RAKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPSNYLALVADTSVVCLTMI 420

Query: 421 TDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT 464
           TDV  +GGGGGPAIIFGAFQQQNVLV+YDLA++RIGFRKQRCT
Sbjct: 421 TDVTFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCT 462

BLAST of HG10016937 vs. ExPASy TrEMBL
Match: A0A0A0KHK2 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G454470 PE=3 SV=1)

HSP 1 Score: 780.4 bits (2014), Expect = 4.2e-222
Identity = 391/464 (84.27%), Postives = 419/464 (90.30%), Query Frame = 0

Query: 1   MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQH 60
           MEF PIPFL SIFLLLP SSSS  ST  LPLT FPS S T DP+KTI+ LLSASLNRAQH
Sbjct: 1   MEFLPIPFLFSIFLLLPTSSSS--STTVLPLTTFPSVSFT-DPFKTINLLLSASLNRAQH 60

Query: 61  LKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCS 120
           LK PQ+KSN+SIQNVSLFPRSYGAYS+SLAFGTPPQNLSFIFDTGSS+VWFPCTA Y CS
Sbjct: 61  LKTPQSKSNTSIQNVSLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCS 120

Query: 121 NCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCP 180
            CSFP VD ATI KFVPKLSSS K++GCRNPKCAWIFGPNLKSRCR+C+ KSR C DSCP
Sbjct: 121 RCSFPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCP 180

Query: 181 GYGIQYGSGATAGFLLSETLDFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMR 240
           GYG+QYGSGATAG LLSETLD   KRVPDFLVGCSV+SVHQPAGIAGFGRGPESLPSQMR
Sbjct: 181 GYGLQYGSGATAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMR 240

Query: 241 LKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL 300
           LKRFS+CLVSR FDDSPVSSPLVL+SGSES+ESK+KS IYAPFR+NPS SNAAFREYYYL
Sbjct: 241 LKRFSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYL 300

Query: 301 SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYP 360
           SLRRI+IGGKPVKFPYKYLVPDS GNGGAIIDSGSTFTFLDKPIFEAI++ELEKQLVKYP
Sbjct: 301 SLRRILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYP 360

Query: 361 RDKGVEVQSGLRPCFDISK-EKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTM 420
           R K VE QSGLRPCF+I K E+ AEFP++VLKFKGG KLSL   NYLA+VTD+GVVCLTM
Sbjct: 361 RAKDVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTM 420

Query: 421 MTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT 464
           MTD A VGGGGGPAII GAFQQQNVLVEYDLA+ RIGFRKQ+CT
Sbjct: 421 MTDEAVVGGGGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKCT 461

BLAST of HG10016937 vs. ExPASy TrEMBL
Match: A0A5A7SGF9 (Aspartic proteinase nepenthesin-2-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold541G00670 PE=3 SV=1)

HSP 1 Score: 759.2 bits (1959), Expect = 9.9e-216
Identity = 377/464 (81.25%), Postives = 413/464 (89.01%), Query Frame = 0

Query: 1   MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQH 60
           MEF PIPFL SIFLLLP SSS   S+ITLPL  FPS   T DP KTI++LLSASL+RAQH
Sbjct: 1   MEFLPIPFLFSIFLLLPTSSS---SSITLPLATFPSIPFT-DPLKTINHLLSASLSRAQH 60

Query: 61  LKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCS 120
           LK PQ+KSN+S +NVSLFPRSYGAY++SLAFGTPPQNLSFIFDTGSS+VWFPCTA Y C+
Sbjct: 61  LKSPQSKSNTSTENVSLFPRSYGAYAVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCA 120

Query: 121 NCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCP 180
           +CSFP+VD ATI KFVPKLSSS KI+GCRNPKCAWIFGPNLKSRCR+C+PKSR C DSCP
Sbjct: 121 HCSFPHVDPATISKFVPKLSSSVKIVGCRNPKCAWIFGPNLKSRCRNCNPKSRKCSDSCP 180

Query: 181 GYGIQYGSGATAGFLLSETLDFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMR 240
           GYGIQYGSGATAG LLSETLD   KRVPDFLVGCSV+SVHQPAGIAGFGRGPESLPSQMR
Sbjct: 181 GYGIQYGSGATAGILLSETLDLQNKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMR 240

Query: 241 LKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL 300
           LKRFS+CL+ R FDDSPVSSPLVL+SG ES+ESK+KS IYAPF++NPS SN AFREYYYL
Sbjct: 241 LKRFSHCLLPRGFDDSPVSSPLVLDSGPESDESKTKSFIYAPFQENPSRSNTAFREYYYL 300

Query: 301 SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYP 360
           SLRRI+IGGKPVKFPYKYLVPDS G GGAIIDSGSTFTFLDKPIFEAI+ ELEKQLVKYP
Sbjct: 301 SLRRILIGGKPVKFPYKYLVPDSTGKGGAIIDSGSTFTFLDKPIFEAIAGELEKQLVKYP 360

Query: 361 RDKGVEVQSGLRPCFDISK-EKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTM 420
           R K +E ++GLRPCF+ISK E+ AEFPE+ LKFKGG KLSLPP NYL +VTD  VVCLTM
Sbjct: 361 RAKDIEAKTGLRPCFNISKEEESAEFPEVALKFKGGGKLSLPPENYLVMVTDANVVCLTM 420

Query: 421 MTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT 464
           MT+   VG GGGPAIIFGAFQQQNVLVEYDLA+ RIGFRKQ+CT
Sbjct: 421 MTNAEVVGVGGGPAIIFGAFQQQNVLVEYDLAKQRIGFRKQKCT 460

BLAST of HG10016937 vs. ExPASy TrEMBL
Match: A0A1S3CHV2 (aspartic proteinase nepenthesin-2-like OS=Cucumis melo OX=3656 GN=LOC103500932 PE=3 SV=1)

HSP 1 Score: 759.2 bits (1959), Expect = 9.9e-216
Identity = 377/464 (81.25%), Postives = 413/464 (89.01%), Query Frame = 0

Query: 1   MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQH 60
           MEF PIPFL SIFLLLP SSS   S+ITLPL  FPS   T DP KTI++LLSASL+RAQH
Sbjct: 1   MEFLPIPFLFSIFLLLPTSSS---SSITLPLATFPSIPFT-DPLKTINHLLSASLSRAQH 60

Query: 61  LKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCS 120
           LK PQ+KSN+S +NVSLFPRSYGAY++SLAFGTPPQNLSFIFDTGSS+VWFPCTA Y C+
Sbjct: 61  LKSPQSKSNTSTENVSLFPRSYGAYAVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCA 120

Query: 121 NCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCP 180
           +CSFP+VD ATI KFVPKLSSS KI+GCRNPKCAWIFGPNLKSRCR+C+PKSR C DSCP
Sbjct: 121 HCSFPHVDPATISKFVPKLSSSVKIVGCRNPKCAWIFGPNLKSRCRNCNPKSRKCSDSCP 180

Query: 181 GYGIQYGSGATAGFLLSETLDFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMR 240
           GYGIQYGSGATAG LLSETLD   KRVPDFLVGCSV+SVHQPAGIAGFGRGPESLPSQMR
Sbjct: 181 GYGIQYGSGATAGILLSETLDLQNKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMR 240

Query: 241 LKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL 300
           LKRFS+CL+ R FDDSPVSSPLVL+SG ES+ESK+KS IYAPF++NPS SN AFREYYYL
Sbjct: 241 LKRFSHCLLPRGFDDSPVSSPLVLDSGPESDESKTKSFIYAPFQENPSRSNTAFREYYYL 300

Query: 301 SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYP 360
           SLRRI+IGGKPVKFPYKYLVPDS G GGAIIDSGSTFTFLDKPIFEAI+ ELEKQLVKYP
Sbjct: 301 SLRRILIGGKPVKFPYKYLVPDSTGKGGAIIDSGSTFTFLDKPIFEAIAGELEKQLVKYP 360

Query: 361 RDKGVEVQSGLRPCFDISK-EKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTM 420
           R K +E ++GLRPCF+ISK E+ AEFPE+ LKFKGG KLSLPP NYL +VTD  VVCLTM
Sbjct: 361 RAKDIEAKTGLRPCFNISKEEESAEFPEVALKFKGGGKLSLPPENYLVMVTDANVVCLTM 420

Query: 421 MTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT 464
           MT+   VG GGGPAIIFGAFQQQNVLVEYDLA+ RIGFRKQ+CT
Sbjct: 421 MTNAEVVGVGGGPAIIFGAFQQQNVLVEYDLAKQRIGFRKQKCT 460

BLAST of HG10016937 vs. TAIR 10
Match: AT3G52500.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 495.7 bits (1275), Expect = 3.9e-140
Identity = 246/457 (53.83%), Postives = 324/457 (70.90%), Query Frame = 0

Query: 22  SPISTITLPLTAFP-SNSLTDDPWKTIDYLLSASLNRAQHLK-----KPQ-------TKS 81
           S +S + LPL+ F  S+    DP+ ++  L  +S+ RA  LK     KP        T +
Sbjct: 14  SVVSAVKLPLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEDALSSTTTA 73

Query: 82  NSSIQNVSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCSNCSFPNVD 141
           ++++    L  +SYG YS+SL+FGTP Q + F+FDTGSS+VW PCT+ YLCS C F  +D
Sbjct: 74  SATVVKSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLD 133

Query: 142 AATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGS 201
              IP+F+PK SSS+KIIGC++PKC +++GPN+  +CR C P +RNC   CP Y +QYG 
Sbjct: 134 PTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNV--QCRGCDPNTRNCTVGCPPYILQYGL 193

Query: 202 GATAGFLLSETLDFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCL 261
           G+TAG L++E LDFP   VPDF+VGCS++S  QPAGIAGFGRGP SLPSQM LKRFS+CL
Sbjct: 194 GSTAGVLITEKLDFPDLTVPDFVVGCSIISTRQPAGIAGFGRGPVSLPSQMNLKRFSHCL 253

Query: 262 VSRRFDDSPVSSPLVLNSGSESNE-SKSKSLIYAPFRKNPSGSNAAFREYYYLSLRRIII 321
           VSRRFDD+ V++ L L++GS  N  SK+  L Y PFRKNP+ SN AF EYYYL+LRRI +
Sbjct: 254 VSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYV 313

Query: 322 GGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEV 381
           G K VK PYKYL P + G+GG+I+DSGSTFTF+++P+FE ++EE   Q+  Y R+K +E 
Sbjct: 314 GRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEK 373

Query: 382 QSGLRPCFDISKEKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMMTD-VAAV 441
           ++GL PCF+IS +     PEL+ +FKGGAKL LP +NY   V +   VCLT+++D     
Sbjct: 374 ETGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNP 433

Query: 442 GGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT 464
            GG GPAII G+FQQQN LVEYDL  DR GF K++C+
Sbjct: 434 SGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468

BLAST of HG10016937 vs. TAIR 10
Match: AT5G45120.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 193.4 bits (490), Expect = 4.2e-49
Identity = 138/406 (33.99%), Postives = 190/406 (46.80%), Query Frame = 0

Query: 85  YSISLAFGTPPQNLSFIFDTGSSVVWFPC-TASYLCSNC-SFPNVDAATIPKFVPKLSSS 144
           Y I+L  GTPPQ +    DTGS + W PC   S+ C  C    N D  +   F P  SS+
Sbjct: 83  YLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHSST 142

Query: 145 AKIIGCRNPKCAWI------FGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGA-TAGFL 204
           +    C +  C  I      F P   + C         C   CP +   YG G   +G L
Sbjct: 143 SFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGIL 202

Query: 205 LSETLDFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRL--KRFSYCLVSRRF 264
             + L    + VP F  GC   +  +P GIAGFGRG  SLPSQ+    K FS+C +  +F
Sbjct: 203 TRDILKARTRDVPRFSFGCVTSTYREPIGIAGFGRGLLSLPSQLGFLEKGFSHCFLPFKF 262

Query: 265 DDSP-VSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYLSLRRIIIGGK-- 324
            ++P +SSPL+L + S  + + + SL + P    P   N+     YY+ L  I IG    
Sbjct: 263 VNNPNISSPLILGA-SALSINLTDSLQFTPMLNTPMYPNS-----YYIGLESITIGTNIT 322

Query: 325 PVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSG 384
           P + P      DS GNGG ++DSG+T+T L +P +  +   L+   + YPR    E ++G
Sbjct: 323 PTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQ-STITYPRATETESRTG 382

Query: 385 LRPCF----------DISKEKLAEFPELVLKFKGGAKLSLPPAN--YLALVTDDG--VVC 444
              C+           +  + +  FP +   F   A L LP  N  Y      DG  V C
Sbjct: 383 FDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLLPQGNSFYAMSAPSDGSVVQC 442

Query: 445 LTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRC 463
           L          G  GPA +FG+FQQQNV V YDL ++RIGF+   C
Sbjct: 443 LLFQN---MEDGDYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDC 478

BLAST of HG10016937 vs. TAIR 10
Match: AT4G16563.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 188.3 bits (477), Expect = 1.3e-47
Identity = 146/485 (30.10%), Postives = 225/485 (46.39%), Query Frame = 0

Query: 24  ISTITLPLTAFPSNSLTDDPWKT--IDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRS 83
           +S+++ PL    S+SL+     +  +  L S+S   +   ++   K     Q +SL   S
Sbjct: 22  VSSLSTPLLLHLSHSLSTSKHSSSPLHLLKSSSSRSSARFRRHHHKQQQ--QQLSLPISS 81

Query: 84  YGAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVP-KLS 143
              Y ISL+ G+    +S   DTGS +VWFPC   + C  C     ++  +P   P  LS
Sbjct: 82  GSDYLISLSVGSSSSAVSLYLDTGSDLVWFPC-RPFTCILC-----ESKPLPPSPPSSLS 141

Query: 144 SSAKIIGCRNPKCAWIFG--PNLK----SRCRSCSPKSRNCFDS---CPGYGIQYGSGAT 203
           SSA  + C +P C+      P+      S C     ++ +C  S   CP +   YG G+ 
Sbjct: 142 SSATTVSCSSPSCSAAHSSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSL 201

Query: 204 AGFLLSETLDFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRL------KRFS 263
              L S++L  P   V +F  GC+  ++ +P G+AGFGRG  SLP+Q+ +        FS
Sbjct: 202 VAKLYSDSLSLPSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFS 261

Query: 264 YCLVSRRFDDSPV--SSPLVL----------------NSGSESNESKSKSLIYAPFRKNP 323
           YCLVS  FD   V   SPL+L                +   +  + K    ++    +NP
Sbjct: 262 YCLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENP 321

Query: 324 SGSNAAFREYYYLSLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEA 383
                    +Y +SL+ I IG + +  P      D  G GG ++DSG+TFT L    + +
Sbjct: 322 K-----HPYFYSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNS 381

Query: 384 ISEELEKQLVK-YPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGG-AKLSLPPANY 443
           + EE + ++ + + R   VE  SG+ PC+ ++  +  + P LVL F G  + ++LP  NY
Sbjct: 382 VVEEFDSRVGRVHERADRVEPSSGMSPCYYLN--QTVKVPALVLHFAGNRSSVTLPRRNY 441

Query: 444 LALVTDDG--------VVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGF 463
                D G        + CL +M         GG   I G +QQQ   V YDL   R+GF
Sbjct: 442 FYEFMDGGDGKEEKRKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGF 491

BLAST of HG10016937 vs. TAIR 10
Match: AT3G61820.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 166.8 bits (421), Expect = 4.2e-41
Identity = 135/430 (31.40%), Postives = 183/430 (42.56%), Query Frame = 0

Query: 45  KTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSFIFDT 104
           K+I  L + S  R    + P+T    S   +S   +  G Y + L  GTP  N+  + DT
Sbjct: 95  KSITSLAAVSTGRNATKRTPRTAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDT 154

Query: 105 GSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSR 164
           GS VVW  C+    C N      DA     F PK S +   + C +  C       L   
Sbjct: 155 GSDVVWLQCSPCKACYN----QTDAI----FDPKKSKTFATVPCGSRLCR-----RLDDS 214

Query: 165 CRSCSPKSRNCFDSCPGYGIQYGSGA-TAGFLLSETLDFPKKRVPDFLVGCSVLSVHQ-- 224
               + +S+ C      Y + YG G+ T G   +ETL F   RV    +GC     H   
Sbjct: 215 SECVTRRSKTCL-----YQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCG----HDNE 274

Query: 225 -----PAGIAGFGRGPESLPSQMRLK---RFSYCLVSRRFDDSPVSSPLVLNSGSESNES 284
                 AG+ G GRG  S PSQ + +   +FSYCLV R    S    P  +  G   N +
Sbjct: 275 GLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFG---NAA 334

Query: 285 KSKSLIYAPFRKNPSGSNAAFREYYYLSLRRIIIGGKPVK-FPYKYLVPDSAGNGGAIID 344
             K+ ++ P   NP         +YYL L  I +GG  V          D+ GNGG IID
Sbjct: 335 VPKTSVFTPLLTNPK-----LDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIID 394

Query: 345 SGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKF 404
           SG++ T L +P + A+ +       K  R     +      CFD+S     + P +V  F
Sbjct: 395 SGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSL---FDTCFDLSGMTTVKVPTVVFHF 454

Query: 405 KGGAKLSLPPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLAR 463
            GG ++SLP +NYL  V  +G  C        A  G  G   I G  QQQ   V YDL  
Sbjct: 455 -GGGEVSLPASNYLIPVNTEGRFCF-------AFAGTMGSLSIIGNIQQQGFRVAYDLVG 483

BLAST of HG10016937 vs. TAIR 10
Match: AT2G42980.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 159.8 bits (403), Expect = 5.1e-39
Identity = 124/399 (31.08%), Postives = 187/399 (46.87%), Query Frame = 0

Query: 83  GAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSS 142
           G Y + +  GTPP++ S I DTGS + W  C   Y C + +    D        PK S+S
Sbjct: 158 GEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYD--------PKTSAS 217

Query: 143 AKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGA-TAGFLLSETLD 202
            K I C +P+C+ I  P+   +C S +        SCP Y   YG  + T G    ET  
Sbjct: 218 FKNITCNDPRCSLISSPDPPVQCESDN-------QSCP-YFYWYGDRSNTTGDFAVETFT 277

Query: 203 F---------PKKRVPDFLVGCSVLS---VHQPAGIAGFGRGPESLPSQMRL---KRFSY 262
                      + +V + + GC   +       +G+ G GRGP S  SQ++      FSY
Sbjct: 278 VNLTTTEGGSSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSY 337

Query: 263 CLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYLSLRRII 322
           CLV R   ++ VSS L+   G + +     +L +  F    +G   +   +YY+ ++ I+
Sbjct: 338 CLVDRN-SNTNVSSKLIF--GEDKDLLNHTNLNFTSF---VNGKENSVETFYYIQIKSIL 397

Query: 323 IGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEEL-EKQLVKYPRDKGV 382
           +GGK +  P +     S G+GG IIDSG+T ++  +P +E I  +  EK    YP  +  
Sbjct: 398 VGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDF 457

Query: 383 EVQSGLRPCFDIS--KEKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMMTDV 442
            V   L PCF++S  +E     PEL + F  G   + P  N    +++D +VCL      
Sbjct: 458 PV---LDPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLSED-LVCL------ 517

Query: 443 AAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRC 463
           A +G       I G +QQQN  + YD  R R+GF   +C
Sbjct: 518 AILGTPKSTFSIIGNYQQQNFHILYDTKRSRLGFTPTKC 524

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038881211.15.7e-23488.55probable aspartyl protease At4g16563 [Benincasa hispida][more]
XP_023543736.13.0e-22784.67probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo][more]
XP_022979057.17.5e-22684.45probable aspartyl protease At4g16563 [Cucurbita maxima][more]
XP_022925946.19.2e-22483.37probable aspartyl protease At4g16563 [Cucurbita moschata] >KAG6604319.1 Aspartic... [more]
KAG7034471.11.6e-22383.37Aspartic proteinase nepenthesin-1, partial [Cucurbita argyrosperma subsp. argyro... [more]
Match NameE-valueIdentityDescription
Q940R41.9e-4630.10Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g1656... [more]
Q9LNJ35.7e-3530.54Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Q766C29.7e-3531.27Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q766C31.7e-3430.23Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q9LHE31.8e-2826.18Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Match NameE-valueIdentityDescription
A0A6J1IMR73.6e-22684.45probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111478813... [more]
A0A6J1EDJ04.4e-22483.37probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC1114332... [more]
A0A0A0KHK24.2e-22284.27Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G45447... [more]
A0A5A7SGF99.9e-21681.25Aspartic proteinase nepenthesin-2-like OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
A0A1S3CHV29.9e-21681.25aspartic proteinase nepenthesin-2-like OS=Cucumis melo OX=3656 GN=LOC103500932 P... [more]
Match NameE-valueIdentityDescription
AT3G52500.13.9e-14053.83Eukaryotic aspartyl protease family protein [more]
AT5G45120.14.2e-4933.99Eukaryotic aspartyl protease family protein [more]
AT4G16563.11.3e-4730.10Eukaryotic aspartyl protease family protein [more]
AT3G61820.14.2e-4131.40Eukaryotic aspartyl protease family protein [more]
AT2G42980.15.1e-3931.08Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 329..340
score: 36.31
coord: 91..111
score: 51.9
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 266..463
e-value: 8.8E-49
score: 167.6
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 66..259
e-value: 1.2E-31
score: 112.2
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 80..462
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 85..255
e-value: 3.5E-28
score: 99.0
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 298..458
e-value: 1.6E-33
score: 115.8
NoneNo IPR availablePANTHERPTHR47967:SF36BNACNNG47670D PROTEINcoord: 8..462
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 8..462
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 100..111
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 85..458
score: 34.975674
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 84..462
e-value: 2.27813E-79
score: 245.635

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10016937.1HG10016937.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity