HG10012598 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10012598
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionaspartic proteinase CDR1-like
LocationChr01: 22677847 .. 22679163 (+)
RNA-Seq ExpressionHG10012598
SyntenyHG10012598
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATTCTCTTCCTTTTGTTTACCATCCTTGAATCAACGGCATGGTATATGGTACCAACTGAAGTTGGCTTCACTGCACGTTTGATTCACCGTGATTCACCTTTATCACCATTTTACGATCACGCCCTGACAGACACTGCACGGATAGAGGCGACCGTTCATCGTTCTAGGTCCCGGCTGAATTATCTGTATTACAACATGTTATCAGAAAATACATTAGACAATGATGTGTCACTATCACCAACATTAGTTAATGAAGGTGGTGAGTACCTTATGAGTTTCAACATTGGAAATCCTCCAAGTCAAGTGATGGGGTTTGCAGACACATCAAATGGTCTCATTTGGGTGCAATGTTCAGACTGCAATAGCCAATGTGAGCCAGAGAAAGGAGGCTCCACCACCAAGTTCCTCTCTTCCAAGTCCTTCACCTATGAGATGGAGCCATGTGGCTCTAACTTCTGCAACTCCTTAACTGGCTTCCAGACCTGCAATTCATCTGACAAATGGTGCAAATATAGATTAGTGTATGAAGATAATAAAGCAACAAGTGGAATTCTTTCATCTGATAGTTTTAGTTTTGATACCTCAGATGGGAAACTTGTGGATGTTGGCTATTTGAACTTTGGCTGTTCAGAAGCTCCTTTCACAGGAGATATACAGAGTTATACAGGGAGTGTGGGCTTGAACCAAACACCCTTGTCATTAATTTCTCAATTGGGTATCAAAAAGTTCTCCTACTGCTTGGTTCCTTTCAATAATCTAGGATCAGCAAGTAAAATGTATTTTGGATCATTACCTGTGACTTCTGGGGGTCAAACTCCTCTGTTATATCCCAATTCAGATGCTTATTATGTGAAGGTTCTAGGAATCAGCCTCGGCAATGATGAGGCCCACTTAGATAGAGTTTTTGATGTATATGATGTCAGAGATGGGTGGATTATAGATTCAGGAATAACATACTCAAGTCTTGAAACAGATGCATTTGATAGTTTGCTAGCTAAATTCCTTACACTACCAGATTTACCACAGAAAAAAATTGACCCTAGAGACAGATTCGAGTTGTGCTTTGCAGCAAATGCAAATGATTTGGAGTCATTTCCAGATGTTACAGTTCATTTCGATGGTGCAGATTTAATTCTTAATGTAGAAAGTACCTTTGTGAAGATAGAGGATGATGGAATTTTCTGCCTTGCCCTTCTGCGTTCTGGATCTCCAGTTTCTATATTAGGGAACTTTCAGCTGCAAAACTACCATGTTGGGTATGACCTTGAAGCTCAAGTTATTTCCTTTGCTCCTGTTGACTGTGCTGATTCCTAA

mRNA sequence

ATGATTCTCTTCCTTTTGTTTACCATCCTTGAATCAACGGCATGGTATATGGTACCAACTGAAGTTGGCTTCACTGCACGTTTGATTCACCGTGATTCACCTTTATCACCATTTTACGATCACGCCCTGACAGACACTGCACGGATAGAGGCGACCGTTCATCGTTCTAGGTCCCGGCTGAATTATCTGTATTACAACATGTTATCAGAAAATACATTAGACAATGATGTGTCACTATCACCAACATTAGTTAATGAAGGTGGTGAGTACCTTATGAGTTTCAACATTGGAAATCCTCCAAGTCAAGTGATGGGGTTTGCAGACACATCAAATGGTCTCATTTGGGTGCAATGTTCAGACTGCAATAGCCAATGTGAGCCAGAGAAAGGAGGCTCCACCACCAAGTTCCTCTCTTCCAAGTCCTTCACCTATGAGATGGAGCCATGTGGCTCTAACTTCTGCAACTCCTTAACTGGCTTCCAGACCTGCAATTCATCTGACAAATGGTGCAAATATAGATTAGTGTATGAAGATAATAAAGCAACAAGTGGAATTCTTTCATCTGATAGTTTTAGTTTTGATACCTCAGATGGGAAACTTGTGGATGTTGGCTATTTGAACTTTGGCTGTTCAGAAGCTCCTTTCACAGGAGATATACAGAGTTATACAGGGAGTGTGGGCTTGAACCAAACACCCTTGTCATTAATTTCTCAATTGGGTATCAAAAAGTTCTCCTACTGCTTGGTTCCTTTCAATAATCTAGGATCAGCAAGTAAAATGTATTTTGGATCATTACCTGTGACTTCTGGGGGTCAAACTCCTCTGTTATATCCCAATTCAGATGCTTATTATGTGAAGGTTCTAGGAATCAGCCTCGGCAATGATGAGGCCCACTTAGATAGAGTTTTTGATGTATATGATGTCAGAGATGGGTGGATTATAGATTCAGGAATAACATACTCAAGTCTTGAAACAGATGCATTTGATAGTTTGCTAGCTAAATTCCTTACACTACCAGATTTACCACAGAAAAAAATTGACCCTAGAGACAGATTCGAGTTGTGCTTTGCAGCAAATGCAAATGATTTGGAGTCATTTCCAGATGTTACAGTTCATTTCGATGGTGCAGATTTAATTCTTAATGTAGAAAGTACCTTTGTGAAGATAGAGGATGATGGAATTTTCTGCCTTGCCCTTCTGCGTTCTGGATCTCCAGTTTCTATATTAGGGAACTTTCAGCTGCAAAACTACCATGTTGGGTATGACCTTGAAGCTCAAGTTATTTCCTTTGCTCCTGTTGACTGTGCTGATTCCTAA

Coding sequence (CDS)

ATGATTCTCTTCCTTTTGTTTACCATCCTTGAATCAACGGCATGGTATATGGTACCAACTGAAGTTGGCTTCACTGCACGTTTGATTCACCGTGATTCACCTTTATCACCATTTTACGATCACGCCCTGACAGACACTGCACGGATAGAGGCGACCGTTCATCGTTCTAGGTCCCGGCTGAATTATCTGTATTACAACATGTTATCAGAAAATACATTAGACAATGATGTGTCACTATCACCAACATTAGTTAATGAAGGTGGTGAGTACCTTATGAGTTTCAACATTGGAAATCCTCCAAGTCAAGTGATGGGGTTTGCAGACACATCAAATGGTCTCATTTGGGTGCAATGTTCAGACTGCAATAGCCAATGTGAGCCAGAGAAAGGAGGCTCCACCACCAAGTTCCTCTCTTCCAAGTCCTTCACCTATGAGATGGAGCCATGTGGCTCTAACTTCTGCAACTCCTTAACTGGCTTCCAGACCTGCAATTCATCTGACAAATGGTGCAAATATAGATTAGTGTATGAAGATAATAAAGCAACAAGTGGAATTCTTTCATCTGATAGTTTTAGTTTTGATACCTCAGATGGGAAACTTGTGGATGTTGGCTATTTGAACTTTGGCTGTTCAGAAGCTCCTTTCACAGGAGATATACAGAGTTATACAGGGAGTGTGGGCTTGAACCAAACACCCTTGTCATTAATTTCTCAATTGGGTATCAAAAAGTTCTCCTACTGCTTGGTTCCTTTCAATAATCTAGGATCAGCAAGTAAAATGTATTTTGGATCATTACCTGTGACTTCTGGGGGTCAAACTCCTCTGTTATATCCCAATTCAGATGCTTATTATGTGAAGGTTCTAGGAATCAGCCTCGGCAATGATGAGGCCCACTTAGATAGAGTTTTTGATGTATATGATGTCAGAGATGGGTGGATTATAGATTCAGGAATAACATACTCAAGTCTTGAAACAGATGCATTTGATAGTTTGCTAGCTAAATTCCTTACACTACCAGATTTACCACAGAAAAAAATTGACCCTAGAGACAGATTCGAGTTGTGCTTTGCAGCAAATGCAAATGATTTGGAGTCATTTCCAGATGTTACAGTTCATTTCGATGGTGCAGATTTAATTCTTAATGTAGAAAGTACCTTTGTGAAGATAGAGGATGATGGAATTTTCTGCCTTGCCCTTCTGCGTTCTGGATCTCCAGTTTCTATATTAGGGAACTTTCAGCTGCAAAACTACCATGTTGGGTATGACCTTGAAGCTCAAGTTATTTCCTTTGCTCCTGTTGACTGTGCTGATTCCTAA

Protein sequence

MILFLLFTILESTAWYMVPTEVGFTARLIHRDSPLSPFYDHALTDTARIEATVHRSRSRLNYLYYNMLSENTLDNDVSLSPTLVNEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGGSTTKFLSSKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYEDNKATSGILSSDSFSFDTSDGKLVDVGYLNFGCSEAPFTGDIQSYTGSVGLNQTPLSLISQLGIKKFSYCLVPFNNLGSASKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISLGNDEAHLDRVFDVYDVRDGWIIDSGITYSSLETDAFDSLLAKFLTLPDLPQKKIDPRDRFELCFAANANDLESFPDVTVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFAPVDCADS
Homology
BLAST of HG10012598 vs. NCBI nr
Match: XP_008439375.1 (PREDICTED: aspartic proteinase CDR1-like [Cucumis melo])

HSP 1 Score: 807.4 bits (2084), Expect = 6.2e-230
Identity = 398/440 (90.45%), Postives = 412/440 (93.64%), Query Frame = 0

Query: 1   MILFLLFTILESTAWYMVPTEVGFTARLIHRDSPLSPFYDHALTDTARIEATVHRSRSRL 60
           MIL+LLFTILES   +MV  EVGFTARLIH DSPLSPFY+HA+T TARIEATVHRSRSRL
Sbjct: 1   MILYLLFTILESKGMHMVSNEVGFTARLIHHDSPLSPFYNHAMTGTARIEATVHRSRSRL 60

Query: 61  NYLYY-NMLSENTLDNDVSLSPTLVNEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCS 120
           +YLYY N LSENTLDNDVSLSPTLVNEGGEYLMSFNIGNPPSQVMGF DTSNGLIWVQCS
Sbjct: 61  SYLYYINKLSENTLDNDVSLSPTLVNEGGEYLMSFNIGNPPSQVMGFLDTSNGLIWVQCS 120

Query: 121 DCNSQCEPEKGGSTTKFLSSKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYEDN 180
           +CNSQCEPEK G TTKFLSSKSFTYEMEPCGSNFCNSLTGF+TCNSSDKWCKYRLVY DN
Sbjct: 121 NCNSQCEPEKRGPTTKFLSSKSFTYEMEPCGSNFCNSLTGFKTCNSSDKWCKYRLVYGDN 180

Query: 181 KATSGILSSDSFSFDTSDGKLVDVGYLNFGCSEAPFTGDIQSYTGSVGLNQTPLSLISQL 240
           KATSGILSSDSF FDTSDGKLVDVG+LNFGCSEAP TGD QSYTG VGLNQTPLSLISQL
Sbjct: 181 KATSGILSSDSFGFDTSDGKLVDVGFLNFGCSEAPLTGDEQSYTGRVGLNQTPLSLISQL 240

Query: 241 GIKKFSYCLVPFNNLGSASKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISLGNDEAHL 300
           GIKKFSYCLVPFN+LGS SKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGIS+GNDE H 
Sbjct: 241 GIKKFSYCLVPFNSLGSTSKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISIGNDEPHF 300

Query: 301 DRVFDVYDVRDGWIIDSGITYSSLETDAFDSLLAKFLTLPDLPQKKIDPRDRFELCF-AA 360
           D VFDVYDVRDGWIID+GITYSSLETDAFDSLLAKFL L + PQ+K DP+DRFELCF  A
Sbjct: 301 DGVFDVYDVRDGWIIDTGITYSSLETDAFDSLLAKFLALKNFPQRKNDPKDRFELCFELA 360

Query: 361 NANDLESFPDVTVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYH 420
           NANDLESFPD TVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYH
Sbjct: 361 NANDLESFPDATVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYH 420

Query: 421 VGYDLEAQVISFAPVDCADS 439
           VGYDLEAQVISFAPVDCADS
Sbjct: 421 VGYDLEAQVISFAPVDCADS 440

BLAST of HG10012598 vs. NCBI nr
Match: XP_004134471.3 (aspartic proteinase CDR1 [Cucumis sativus] >KAE8650377.1 hypothetical protein Csa_011170 [Cucumis sativus])

HSP 1 Score: 805.4 bits (2079), Expect = 2.4e-229
Identity = 397/439 (90.43%), Postives = 409/439 (93.17%), Query Frame = 0

Query: 2   ILFLLFTILESTAWYMVPTEVGFTARLIHRDSPLSPFYDHALTDTARIEATVHRSRSRLN 61
           ILFLLFTI ES   +MV  EVGFTARLIH DSPLSPFY+H +TDTARIEATVHRSRSRLN
Sbjct: 11  ILFLLFTIFESKGMHMVSNEVGFTARLIHHDSPLSPFYNHTMTDTARIEATVHRSRSRLN 70

Query: 62  YLYY-NMLSENTLDNDVSLSPTLVNEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSD 121
           YLYY N LSEN LDNDVSLSPTLVNEGGEYLMSFNIGNP SQVMGF DTSNGLIWVQCS+
Sbjct: 71  YLYYINKLSENALDNDVSLSPTLVNEGGEYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSN 130

Query: 122 CNSQCEPEKGGSTTKFLSSKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYEDNK 181
           CNSQCEPEK G TTKFLSSKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVY DNK
Sbjct: 131 CNSQCEPEKRGLTTKFLSSKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNK 190

Query: 182 ATSGILSSDSFSFDTSDGKLVDVGYLNFGCSEAPFTGDIQSYTGSVGLNQTPLSLISQLG 241
           ATSGILSSDSF FDTSDG LVDVG+LNFGCSEAP TGD QSYTG+VGLNQTPLSLISQLG
Sbjct: 191 ATSGILSSDSFGFDTSDGMLVDVGFLNFGCSEAPLTGDEQSYTGNVGLNQTPLSLISQLG 250

Query: 242 IKKFSYCLVPFNNLGSASKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISLGNDEAHLD 301
           IKKFSYCLVPFNNLGS SKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGIS+GNDE H D
Sbjct: 251 IKKFSYCLVPFNNLGSTSKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISIGNDEPHFD 310

Query: 302 RVFDVYDVRDGWIIDSGITYSSLETDAFDSLLAKFLTLPDLPQKKIDPRDRFELCF-AAN 361
            VFDVY+VRDGWIID+GITYSSLETDAFDSLLAKFLTL D PQ+K DP++RFELCF   N
Sbjct: 311 GVFDVYEVRDGWIIDTGITYSSLETDAFDSLLAKFLTLKDFPQRKDDPKERFELCFELQN 370

Query: 362 ANDLESFPDVTVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHV 421
           ANDLESFPDVTVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHV
Sbjct: 371 ANDLESFPDVTVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHV 430

Query: 422 GYDLEAQVISFAPVDCADS 439
           GYDLEAQVISFAPVDCADS
Sbjct: 431 GYDLEAQVISFAPVDCADS 449

BLAST of HG10012598 vs. NCBI nr
Match: KAA0049446.1 (aspartic proteinase CDR1-like [Cucumis melo var. makuwa] >TYK16125.1 aspartic proteinase CDR1-like [Cucumis melo var. makuwa])

HSP 1 Score: 791.2 bits (2042), Expect = 4.6e-225
Identity = 387/425 (91.06%), Postives = 400/425 (94.12%), Query Frame = 0

Query: 16  YMVPTEVGFTARLIHRDSPLSPFYDHALTDTARIEATVHRSRSRLNYLYY-NMLSENTLD 75
           +MV  EVGFTARLIH DSPLSPFY+HA+T TARIEATVHRSRSRL+YLYY N LSENTLD
Sbjct: 2   HMVSNEVGFTARLIHHDSPLSPFYNHAMTGTARIEATVHRSRSRLSYLYYINKLSENTLD 61

Query: 76  NDVSLSPTLVNEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGGSTT 135
           NDVSLSPTLVNEGGEYLMSFNIGNPPSQVMGF DTSNGLIWVQCS+CNSQCEPEK G TT
Sbjct: 62  NDVSLSPTLVNEGGEYLMSFNIGNPPSQVMGFLDTSNGLIWVQCSNCNSQCEPEKRGPTT 121

Query: 136 KFLSSKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYEDNKATSGILSSDSFSFD 195
           KFLSSKSFTYEMEPCGSNFCNSLTGF+TCNSSDKWCKYRLVY DNKATSGILSSDSF FD
Sbjct: 122 KFLSSKSFTYEMEPCGSNFCNSLTGFKTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFD 181

Query: 196 TSDGKLVDVGYLNFGCSEAPFTGDIQSYTGSVGLNQTPLSLISQLGIKKFSYCLVPFNNL 255
           TSDGKLVDVG+LNFGCSEAP TGD QSYTG VGLNQTPLSLISQLGIKKFSYCLVPFN+L
Sbjct: 182 TSDGKLVDVGFLNFGCSEAPLTGDEQSYTGRVGLNQTPLSLISQLGIKKFSYCLVPFNSL 241

Query: 256 GSASKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISLGNDEAHLDRVFDVYDVRDGWII 315
           GS SKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGIS+GNDE H D VFDVYDVRDGWII
Sbjct: 242 GSTSKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISIGNDEPHFDGVFDVYDVRDGWII 301

Query: 316 DSGITYSSLETDAFDSLLAKFLTLPDLPQKKIDPRDRFELCF-AANANDLESFPDVTVHF 375
           D+GITYSSLETDAFDSLLAKFL L + PQ+K DP+DRFELCF  ANANDLESFPD TVHF
Sbjct: 302 DTGITYSSLETDAFDSLLAKFLALKNFPQRKNDPKDRFELCFELANANDLESFPDATVHF 361

Query: 376 DGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFAPV 435
           DGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFAPV
Sbjct: 362 DGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFAPV 421

Query: 436 DCADS 439
           DCADS
Sbjct: 422 DCADS 426

BLAST of HG10012598 vs. NCBI nr
Match: XP_023528351.1 (aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 790.0 bits (2039), Expect = 1.0e-224
Identity = 388/438 (88.58%), Postives = 410/438 (93.61%), Query Frame = 0

Query: 1   MILFLLFTILESTAWYMVPTEVGFTARLIHRDSPLSPFYDHALTDTARIEATVHRSRSRL 60
           M+ FL  TILESTA +MVPTEVGFTARLIHRDSP+SPFYDH +T+TA+IEATVHRSRSRL
Sbjct: 10  MMFFLSLTILESTARHMVPTEVGFTARLIHRDSPVSPFYDHVMTNTAQIEATVHRSRSRL 69

Query: 61  NYLYYNMLSENTLDNDVSLSPTLVNEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSD 120
           NYLYYNMLS+NTLDND+SLSPTLV+EGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSD
Sbjct: 70  NYLYYNMLSKNTLDNDLSLSPTLVHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSD 129

Query: 121 CNSQCEPEKGGSTTKFLSSKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYEDNK 180
           CNSQCEPEK G  TKFL SKSFTYEMEPCGSN CNSLTGFQTCNSSD+WCKYRLVYEDN 
Sbjct: 130 CNSQCEPEK-GPFTKFLPSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRWCKYRLVYEDNS 189

Query: 181 ATSGILSSDSFSFDTSDGKLVDVGYLNFGCSEAPFTGDIQSYTGSVGLNQTPLSLISQLG 240
            TSG LSSDSFSFDT+DGK VDVGYLNFGCSEAP TG +QSY GSVGLNQTPLSLISQLG
Sbjct: 190 ETSGNLSSDSFSFDTTDGKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLG 249

Query: 241 IKKFSYCLVPFNNLGSASKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISLGNDEAHLD 300
           IKKFSYCLVPF NLGS SKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGIS+G D+ +LD
Sbjct: 250 IKKFSYCLVPF-NLGSTSKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNLD 309

Query: 301 RVFDVYDVRDGWIIDSGITYSSLETDAFDSLLAKFLTLPDLPQKKIDPRDRFELCFAANA 360
            VFDVYDVRDGWIIDSG TYSSLETDAFD LLAKF+TLPDL QKK DPR+RFELCFAANA
Sbjct: 310 GVFDVYDVRDGWIIDSGTTYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAANA 369

Query: 361 NDLESFPDVTVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVG 420
           ND+E+FPDVTVH DGA+LILNVESTFVKIEDDGI CLALLRSGSPVSILGNFQLQNYHVG
Sbjct: 370 NDMETFPDVTVHLDGAELILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNYHVG 429

Query: 421 YDLEAQVISFAPVDCADS 439
           YDLEAQV+SFAPV+CADS
Sbjct: 430 YDLEAQVVSFAPVNCADS 445

BLAST of HG10012598 vs. NCBI nr
Match: KAG6582237.1 (Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 782.7 bits (2020), Expect = 1.6e-222
Identity = 386/438 (88.13%), Postives = 404/438 (92.24%), Query Frame = 0

Query: 1   MILFLLFTILESTAWYMVPTEVGFTARLIHRDSPLSPFYDHALTDTARIEATVHRSRSRL 60
           M+ FL  TILESTA +MVPTEVGFTARLIHRDSPLSPFYDH +T+TARIEATVHRSRSRL
Sbjct: 1   MMFFLSLTILESTARHMVPTEVGFTARLIHRDSPLSPFYDHVMTNTARIEATVHRSRSRL 60

Query: 61  NYLYYNMLSENTLDNDVSLSPTLVNEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSD 120
           NYLYYNMLS  TLDND+SLSPTLV+EGGEYLMSFNIGNP SQVMGFADTSNGLIWVQCSD
Sbjct: 61  NYLYYNMLSRKTLDNDLSLSPTLVHEGGEYLMSFNIGNPSSQVMGFADTSNGLIWVQCSD 120

Query: 121 CNSQCEPEKGGSTTKFLSSKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYEDNK 180
           CNS CEPEK G  TKFL SKSFTYEMEPCGSN CNSLTGFQTCNSSD+WCKYRLVYEDN 
Sbjct: 121 CNSHCEPEK-GPFTKFLPSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRWCKYRLVYEDNS 180

Query: 181 ATSGILSSDSFSFDTSDGKLVDVGYLNFGCSEAPFTGDIQSYTGSVGLNQTPLSLISQLG 240
            TSG LSSDSFSFDT+DGK VDVGYLNFGCSEAP  G +QSY GSVGLNQTPLSLISQLG
Sbjct: 181 ETSGTLSSDSFSFDTTDGKHVDVGYLNFGCSEAPLIGGMQSYMGSVGLNQTPLSLISQLG 240

Query: 241 IKKFSYCLVPFNNLGSASKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISLGNDEAHLD 300
           IKKFSYCLVPF NLGS SKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGIS+G D+ +LD
Sbjct: 241 IKKFSYCLVPF-NLGSTSKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGTDDPNLD 300

Query: 301 RVFDVYDVRDGWIIDSGITYSSLETDAFDSLLAKFLTLPDLPQKKIDPRDRFELCFAANA 360
            VFDVYDVRDGWIIDSG TYSSLETDAFD LLAKF TLP+L QKK DPR+RFELCFAANA
Sbjct: 301 GVFDVYDVRDGWIIDSGTTYSSLETDAFDRLLAKFNTLPELQQKKEDPRNRFELCFAANA 360

Query: 361 NDLESFPDVTVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVG 420
           ND+E+FPDVTVH DGA+LILNVESTFVKIEDDGI CLALLRSGSPVSILGNFQLQNYHVG
Sbjct: 361 NDMETFPDVTVHLDGAELILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNYHVG 420

Query: 421 YDLEAQVISFAPVDCADS 439
           YDLEAQV+SFAPVDCADS
Sbjct: 421 YDLEAQVVSFAPVDCADS 436

BLAST of HG10012598 vs. ExPASy Swiss-Prot
Match: Q6XBF8 (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)

HSP 1 Score: 232.3 bits (591), Expect = 1.1e-59
Identity = 152/428 (35.51%), Postives = 225/428 (52.57%), Query Frame = 0

Query: 21  EVGFTARLIHRDSPLSPFYDHALTDTARIEATVHRSRSRLNYLYYNMLSENTLDNDVSLS 80
           ++GFTA LIHRDSP SPFY+   T + R+   +HRS   +N +++    +NT    + L+
Sbjct: 28  KLGFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRS---VNRVFHFTEKDNTPQPQIDLT 87

Query: 81  PTLVNEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCS---DCNSQCEPEKGGSTTKFL 140
               +  GEYLM+ +IG PP  +M  ADT + L+W QC+   DC +Q +P        F 
Sbjct: 88  ----SNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDP-------LFD 147

Query: 141 SSKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYEDNKATSGILSSDSFSFDTSD 200
              S TY+   C S+ C +L    +C+++D  C Y L Y DN  T G ++ D+ +  +SD
Sbjct: 148 PKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSD 207

Query: 201 GKLVDVGYLNFGCSEAPFTGDIQSYTGSVGLNQTPLSLISQLGIK---KFSYCLVPF-NN 260
            + + +  +  GC         +  +G VGL   P+SLI QLG     KFSYCLVP  + 
Sbjct: 208 TRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSK 267

Query: 261 LGSASKMYFGSLPVTSGG---QTPLLYPNSDA--YYVKVLGISLGNDEAHLDRVFDVYDV 320
               SK+ FG+  + SG     TPL+   S    YY+ +  IS+G+ +       D    
Sbjct: 268 KDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSG-SDSESS 327

Query: 321 RDGWIIDSGITYSSLETDAFDSLLAKFLTLPDLPQKKIDPRDRFELCFAANANDLESFPD 380
               IIDSG T + L T+ +  L     +  D  +KK DP+    LC++A   DL+  P 
Sbjct: 328 EGNIIIDSGTTLTLLPTEFYSELEDAVASSID-AEKKQDPQSGLSLCYSA-TGDLK-VPV 387

Query: 381 VTVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVI 437
           +T+HFDGAD+ L+  + FV++ +D + C A  R     SI GN    N+ VGYD  ++ +
Sbjct: 388 ITMHFDGADVKLDSSNAFVQVSED-LVCFA-FRGSPSFSIYGNVAQMNFLVGYDTVSKTV 435

BLAST of HG10012598 vs. ExPASy Swiss-Prot
Match: Q3EBM5 (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 207.6 bits (527), Expect = 2.8e-52
Identity = 153/467 (32.76%), Postives = 230/467 (49.25%), Query Frame = 0

Query: 1   MILFLLFTILESTAWYMVPTEVGFTARLIHRDSPLSPFYDHALTDTARIEATVHRSRSRL 60
           +  FL F++  S++ +       F+  LIHRDSPLSP Y+  +T T R+ A   RS SR 
Sbjct: 7   LCFFLFFSVTLSSSGH----PKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSR- 66

Query: 61  NYLYYNMLSENTLDNDVSLSPTLVNEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSD 120
           +  + + LS+  L +       L+   GE+ MS  IG PP +V   ADT + L WVQC  
Sbjct: 67  SRRFNHQLSQTDLQSG------LIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKP 126

Query: 121 CNSQCEPEKGGSTTKFLSSKSFTYEMEPCGSNFCNSLTGFQT-CNSSDKWCKYRLVYEDN 180
           C  QC  E G     F   KS TY+ EPC S  C +L+  +  C+ S+  CKYR  Y D 
Sbjct: 127 C-QQCYKENG---PIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQ 186

Query: 181 KATSGILSSDSFSFDTSDGKLVDVGYLNFGCSEAPFTGDIQSYTGSVGLNQTPLSLISQL 240
             + G +++++ S D++ G  V      FGC         ++ +G +GL    LSLISQL
Sbjct: 187 SFSKGDVATETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQL 246

Query: 241 G---IKKFSYCL--------------VPFNNLGSASKMYFGSLPVTSGGQTPLLYPNSDA 300
           G    KKFSYCL              +  N++ S+     G +      + PL Y     
Sbjct: 247 GSSISKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTY----- 306

Query: 301 YYVKVLGISLG-------------NDEAHLDRVFDVYDVRDGWIIDSGITYSSLETDAFD 360
           YY+ +  IS+G             ND+  L       +     IIDSG T + LE   FD
Sbjct: 307 YYLTLEAISVGKKKIPYTGSSYNPNDDGILS------ETSGNIIIDSGTTLTLLEAGFFD 366

Query: 361 SLLAKFLTLPDLPQKKIDPRDRFELCFAANANDLESFPDVTVHFDGADLILNVESTFVKI 420
              +         ++  DP+     CF + + ++   P++TVHF GAD+ L+  + FVK+
Sbjct: 367 KFSSAVEESVTGAKRVSDPQGLLSHCFKSGSAEI-GLPEITVHFTGADVRLSPINAFVKL 426

Query: 421 EDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFAPVDCA 437
            +D + CL+++ + + V+I GNF   ++ VGYDLE + +SF  +DC+
Sbjct: 427 SED-MVCLSMVPT-TEVAIYGNFAQMDFLVGYDLETRTVSFQHMDCS 444

BLAST of HG10012598 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 181.4 bits (459), Expect = 2.2e-44
Identity = 135/425 (31.76%), Postives = 199/425 (46.82%), Query Frame = 0

Query: 23  GFTARLIHRDSPLSPFYDHALTDTARIEATVHRSRSRLNYLYYNMLSENTLDNDVSLSPT 82
           GF   L H DS  +      LT    +E  + R   RL  L      E  L+    +  +
Sbjct: 40  GFQIMLEHVDSGKN------LTKFQLLERAIERGSRRLQRL------EAMLNGPSGVETS 99

Query: 83  LVNEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGGSTTKFLSSKSF 142
           +    GEYLM+ +IG P        DT + LIW QC  C +QC  +   ST  F    S 
Sbjct: 100 VYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPC-TQCFNQ---STPIFNPQGSS 159

Query: 143 TYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYEDNKATSGILSSDSFSFDTSDGKLVD 202
           ++   PC S  C +L+   TC  S+ +C+Y   Y D   T G + +++ +F +     V 
Sbjct: 160 SFSTLPCSSQLCQALSS-PTC--SNNFCQYTYGYGDGSETQGSMGTETLTFGS-----VS 219

Query: 203 VGYLNFGCSEAPFTGDIQSYTGSVGLNQTPLSLISQLGIKKFSYCLVPFNNLGSASKMYF 262
           +  + FGC E        +  G VG+ + PLSL SQL + KFSYC+ P  +  + S +  
Sbjct: 220 IPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGS-STPSNLLL 279

Query: 263 GSL--PVTSGGQTPLLYPNSDA---YYVKVLGISLGNDEAHLDR---VFDVYDVRDGWII 322
           GSL   VT+G     L  +S     YY+ + G+S+G+    +D      +  +   G II
Sbjct: 280 GSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIII 339

Query: 323 DSGITYSSLETDAFDSLLAKFLTLPDLPQKKIDPRDRFELCFAANANDLE-SFPDVTVHF 382
           DSG T +    +A+ S+  +F++  +LP         F+LCF   ++      P   +HF
Sbjct: 340 DSGTTLTYFVNNAYQSVRQEFISQINLPVVN-GSSSGFDLCFQTPSDPSNLQIPTFVMHF 399

Query: 383 DGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFAPV 439
           DG DL L  E+ F+    +G+ CLA+  S   +SI GN Q QN  V YD    V+SFA  
Sbjct: 400 DGGDLELPSENYFIS-PSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASA 437

BLAST of HG10012598 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 164.9 bits (416), Expect = 2.1e-39
Identity = 124/405 (30.62%), Postives = 191/405 (47.16%), Query Frame = 0

Query: 43  LTDTARIEATVHRSRSRLNYLYYNMLSENTLDNDVSLSPTLVNEGGEYLMSFNIGNPPSQ 102
           LT    I+  + R   R+  +   + S + ++  V          GEYLM+  IG P S 
Sbjct: 55  LTKYELIKRAIKRGERRMRSINAMLQSSSGIETPVYAG------DGEYLMNVAIGTPDSS 114

Query: 103 VMGFADTSNGLIWVQCSDCNSQCEPEKGGSTTKFLSSKSFTYEMEPCGSNFCNSLTGFQT 162
                DT + LIW QC  C +QC  +    T  F    S ++   PC S +C  L   +T
Sbjct: 115 FSAIMDTGSDLIWTQCEPC-TQCFSQ---PTPIFNPQDSSSFSTLPCESQYCQDLPS-ET 174

Query: 163 CNSSDKWCKYRLVYEDNKATSGILSSDSFSFDTSDGKLVDVGYLNFGCSEAPFTGDIQSY 222
           CN+++  C+Y   Y D   T G +++++F+F+TS      V  + FGC E        + 
Sbjct: 175 CNNNE--CQYTYGYGDGSTTQGYMATETFTFETS-----SVPNIAFGCGEDNQGFGQGNG 234

Query: 223 TGSVGLNQTPLSLISQLGIKKFSYCLVPFNNLGSASKMYFGSLP--VTSGG-QTPLLYP- 282
            G +G+   PLSL SQLG+ +FSYC+  + +  S S +  GS    V  G   T L++  
Sbjct: 235 AGLIGMGWGPLSLPSQLGVGQFSYCMTSYGS-SSPSTLALGSAASGVPEGSPSTTLIHSS 294

Query: 283 -NSDAYYVKVLGISLGNDEAHL-DRVFDVY-DVRDGWIIDSGITYSSLETDAFDSLLAKF 342
            N   YY+ + GI++G D   +    F +  D   G IIDSG T + L  DA++++   F
Sbjct: 295 LNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAF 354

Query: 343 LTLPDLPQKKIDPRDRFELCFAANAN-DLESFPDVTVHFDGADLILNVESTFVKIEDDGI 402
               +LP    +       CF   ++      P++++ FDG  L L  ++  +    +G+
Sbjct: 355 TDQINLPTVD-ESSSGLSTCFQQPSDGSTVQVPEISMQFDGGVLNLGEQNILIS-PAEGV 414

Query: 403 FCLALLRSGS-PVSILGNFQLQNYHVGYDLEAQVISFAPVDCADS 439
            CLA+  S    +SI GN Q Q   V YDL+   +SF P  C  S
Sbjct: 415 ICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQCGAS 438

BLAST of HG10012598 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 146.7 bits (369), Expect = 6.0e-34
Identity = 116/359 (32.31%), Postives = 174/359 (48.47%), Query Frame = 0

Query: 88  GEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGGSTTKFLSSKSFTYEME 147
           GEY     +G P   V    DT + ++W+QC+ C  +C  +   S   F   KS TY   
Sbjct: 140 GEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCR-RCYSQ---SDPIFDPRKSKTYATI 199

Query: 148 PCGSNFCNSLTGFQTCNSSDKWCKYRLVYEDNKATSGILSSDSFSFDTSDGKLVDVGYLN 207
           PC S  C  L     CN+  K C Y++ Y D   T G  S+++ +F  +  K V +G  +
Sbjct: 200 PCSSPHCRRLDS-AGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGH 259

Query: 208 FGCSEAPFTGDIQSYTGSVGLNQTPLSLISQLGIK---KFSYCLVPFNNLGSASKMYFGS 267
              +E  F G      G +GL +  LS   Q G +   KFSYCLV  +     S + FG+
Sbjct: 260 --DNEGLFVG----AAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGN 319

Query: 268 LPVTSGGQ-TPLL-YPNSDA-YYVKVLGISLGNDEAH--LDRVFDVYDVRDGW-IIDSGI 327
             V+   + TPLL  P  D  YYV +LGIS+G          +F +  + +G  IIDSG 
Sbjct: 320 AAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGT 379

Query: 328 TYSSLETDAFDSLLAKFLTLPDLPQKKIDPRDRFELCF-AANANDLESFPDVTVHFDGAD 387
           + + L   A+ ++   F  +     K+      F+ CF  +N N+++  P V +HF GAD
Sbjct: 380 SVTRLIRPAYIAMRDAF-RVGAKTLKRAPDFSLFDTCFDLSNMNEVK-VPTVVLHFRGAD 439

Query: 388 LILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFAPVDCA 437
           + L   +  + ++ +G FC A   +   +SI+GN Q Q + V YDL +  + FAP  CA
Sbjct: 440 VSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of HG10012598 vs. ExPASy TrEMBL
Match: A0A1S3AZA6 (aspartic proteinase CDR1-like OS=Cucumis melo OX=3656 GN=LOC103484191 PE=3 SV=1)

HSP 1 Score: 807.4 bits (2084), Expect = 3.0e-230
Identity = 398/440 (90.45%), Postives = 412/440 (93.64%), Query Frame = 0

Query: 1   MILFLLFTILESTAWYMVPTEVGFTARLIHRDSPLSPFYDHALTDTARIEATVHRSRSRL 60
           MIL+LLFTILES   +MV  EVGFTARLIH DSPLSPFY+HA+T TARIEATVHRSRSRL
Sbjct: 1   MILYLLFTILESKGMHMVSNEVGFTARLIHHDSPLSPFYNHAMTGTARIEATVHRSRSRL 60

Query: 61  NYLYY-NMLSENTLDNDVSLSPTLVNEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCS 120
           +YLYY N LSENTLDNDVSLSPTLVNEGGEYLMSFNIGNPPSQVMGF DTSNGLIWVQCS
Sbjct: 61  SYLYYINKLSENTLDNDVSLSPTLVNEGGEYLMSFNIGNPPSQVMGFLDTSNGLIWVQCS 120

Query: 121 DCNSQCEPEKGGSTTKFLSSKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYEDN 180
           +CNSQCEPEK G TTKFLSSKSFTYEMEPCGSNFCNSLTGF+TCNSSDKWCKYRLVY DN
Sbjct: 121 NCNSQCEPEKRGPTTKFLSSKSFTYEMEPCGSNFCNSLTGFKTCNSSDKWCKYRLVYGDN 180

Query: 181 KATSGILSSDSFSFDTSDGKLVDVGYLNFGCSEAPFTGDIQSYTGSVGLNQTPLSLISQL 240
           KATSGILSSDSF FDTSDGKLVDVG+LNFGCSEAP TGD QSYTG VGLNQTPLSLISQL
Sbjct: 181 KATSGILSSDSFGFDTSDGKLVDVGFLNFGCSEAPLTGDEQSYTGRVGLNQTPLSLISQL 240

Query: 241 GIKKFSYCLVPFNNLGSASKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISLGNDEAHL 300
           GIKKFSYCLVPFN+LGS SKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGIS+GNDE H 
Sbjct: 241 GIKKFSYCLVPFNSLGSTSKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISIGNDEPHF 300

Query: 301 DRVFDVYDVRDGWIIDSGITYSSLETDAFDSLLAKFLTLPDLPQKKIDPRDRFELCF-AA 360
           D VFDVYDVRDGWIID+GITYSSLETDAFDSLLAKFL L + PQ+K DP+DRFELCF  A
Sbjct: 301 DGVFDVYDVRDGWIIDTGITYSSLETDAFDSLLAKFLALKNFPQRKNDPKDRFELCFELA 360

Query: 361 NANDLESFPDVTVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYH 420
           NANDLESFPD TVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYH
Sbjct: 361 NANDLESFPDATVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYH 420

Query: 421 VGYDLEAQVISFAPVDCADS 439
           VGYDLEAQVISFAPVDCADS
Sbjct: 421 VGYDLEAQVISFAPVDCADS 440

BLAST of HG10012598 vs. ExPASy TrEMBL
Match: A0A5D3CXD4 (Aspartic proteinase CDR1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold209G00490 PE=3 SV=1)

HSP 1 Score: 791.2 bits (2042), Expect = 2.2e-225
Identity = 387/425 (91.06%), Postives = 400/425 (94.12%), Query Frame = 0

Query: 16  YMVPTEVGFTARLIHRDSPLSPFYDHALTDTARIEATVHRSRSRLNYLYY-NMLSENTLD 75
           +MV  EVGFTARLIH DSPLSPFY+HA+T TARIEATVHRSRSRL+YLYY N LSENTLD
Sbjct: 2   HMVSNEVGFTARLIHHDSPLSPFYNHAMTGTARIEATVHRSRSRLSYLYYINKLSENTLD 61

Query: 76  NDVSLSPTLVNEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGGSTT 135
           NDVSLSPTLVNEGGEYLMSFNIGNPPSQVMGF DTSNGLIWVQCS+CNSQCEPEK G TT
Sbjct: 62  NDVSLSPTLVNEGGEYLMSFNIGNPPSQVMGFLDTSNGLIWVQCSNCNSQCEPEKRGPTT 121

Query: 136 KFLSSKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYEDNKATSGILSSDSFSFD 195
           KFLSSKSFTYEMEPCGSNFCNSLTGF+TCNSSDKWCKYRLVY DNKATSGILSSDSF FD
Sbjct: 122 KFLSSKSFTYEMEPCGSNFCNSLTGFKTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFD 181

Query: 196 TSDGKLVDVGYLNFGCSEAPFTGDIQSYTGSVGLNQTPLSLISQLGIKKFSYCLVPFNNL 255
           TSDGKLVDVG+LNFGCSEAP TGD QSYTG VGLNQTPLSLISQLGIKKFSYCLVPFN+L
Sbjct: 182 TSDGKLVDVGFLNFGCSEAPLTGDEQSYTGRVGLNQTPLSLISQLGIKKFSYCLVPFNSL 241

Query: 256 GSASKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISLGNDEAHLDRVFDVYDVRDGWII 315
           GS SKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGIS+GNDE H D VFDVYDVRDGWII
Sbjct: 242 GSTSKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISIGNDEPHFDGVFDVYDVRDGWII 301

Query: 316 DSGITYSSLETDAFDSLLAKFLTLPDLPQKKIDPRDRFELCF-AANANDLESFPDVTVHF 375
           D+GITYSSLETDAFDSLLAKFL L + PQ+K DP+DRFELCF  ANANDLESFPD TVHF
Sbjct: 302 DTGITYSSLETDAFDSLLAKFLALKNFPQRKNDPKDRFELCFELANANDLESFPDATVHF 361

Query: 376 DGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFAPV 435
           DGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFAPV
Sbjct: 362 DGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFAPV 421

Query: 436 DCADS 439
           DCADS
Sbjct: 422 DCADS 426

BLAST of HG10012598 vs. ExPASy TrEMBL
Match: A0A0A0L7U3 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G151520 PE=3 SV=1)

HSP 1 Score: 791.2 bits (2042), Expect = 2.2e-225
Identity = 387/425 (91.06%), Postives = 399/425 (93.88%), Query Frame = 0

Query: 16  YMVPTEVGFTARLIHRDSPLSPFYDHALTDTARIEATVHRSRSRLNYLYY-NMLSENTLD 75
           +MV  EVGFTARLIH DSPLSPFY+H +TDTARIEATVHRSRSRLNYLYY N LSEN LD
Sbjct: 2   HMVSNEVGFTARLIHHDSPLSPFYNHTMTDTARIEATVHRSRSRLNYLYYINKLSENALD 61

Query: 76  NDVSLSPTLVNEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGGSTT 135
           NDVSLSPTLVNEGGEYLMSFNIGNP SQVMGF DTSNGLIWVQCS+CNSQCEPEK G TT
Sbjct: 62  NDVSLSPTLVNEGGEYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCNSQCEPEKRGLTT 121

Query: 136 KFLSSKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYEDNKATSGILSSDSFSFD 195
           KFLSSKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVY DNKATSGILSSDSF FD
Sbjct: 122 KFLSSKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFD 181

Query: 196 TSDGKLVDVGYLNFGCSEAPFTGDIQSYTGSVGLNQTPLSLISQLGIKKFSYCLVPFNNL 255
           TSDG LVDVG+LNFGCSEAP TGD QSYTG+VGLNQTPLSLISQLGIKKFSYCLVPFNNL
Sbjct: 182 TSDGMLVDVGFLNFGCSEAPLTGDEQSYTGNVGLNQTPLSLISQLGIKKFSYCLVPFNNL 241

Query: 256 GSASKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISLGNDEAHLDRVFDVYDVRDGWII 315
           GS SKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGIS+GNDE H D VFDVY+VRDGWII
Sbjct: 242 GSTSKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISIGNDEPHFDGVFDVYEVRDGWII 301

Query: 316 DSGITYSSLETDAFDSLLAKFLTLPDLPQKKIDPRDRFELCF-AANANDLESFPDVTVHF 375
           D+GITYSSLETDAFDSLLAKFLTL D PQ+K DP++RFELCF   NANDLESFPDVTVHF
Sbjct: 302 DTGITYSSLETDAFDSLLAKFLTLKDFPQRKDDPKERFELCFELQNANDLESFPDVTVHF 361

Query: 376 DGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFAPV 435
           DGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFAPV
Sbjct: 362 DGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFAPV 421

Query: 436 DCADS 439
           DCADS
Sbjct: 422 DCADS 426

BLAST of HG10012598 vs. ExPASy TrEMBL
Match: A0A6J1IXB1 (aspartic proteinase CDR1-like OS=Cucurbita maxima OX=3661 GN=LOC111479389 PE=3 SV=1)

HSP 1 Score: 777.7 bits (2007), Expect = 2.5e-221
Identity = 385/438 (87.90%), Postives = 407/438 (92.92%), Query Frame = 0

Query: 1   MILFLLFTILESTAWYMVPTEVGFTARLIHRDSPLSPFYDHALTDTARIEATVHRSRSRL 60
           M+ FL FTILESTA +MVPTEVGFTARLIHRDSPLSPFYDH ++ TA IEAT+HRSRSRL
Sbjct: 10  MMFFLSFTILESTARHMVPTEVGFTARLIHRDSPLSPFYDHVMSYTAWIEATIHRSRSRL 69

Query: 61  NYLYYNMLSENTLDNDVSLSPTLVNEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSD 120
           NYLYYNMLS++TLDND+SLSPTLV+EGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSD
Sbjct: 70  NYLYYNMLSKDTLDNDLSLSPTLVHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSD 129

Query: 121 CNSQCEPEKGGSTTKFLSSKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYEDNK 180
           CNSQCEPEK G  TKFL SKSFTYEMEPCGSN CNSLTGFQTCNSSD+ CKYRLVYEDN 
Sbjct: 130 CNSQCEPEK-GPFTKFLPSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRSCKYRLVYEDNS 189

Query: 181 ATSGILSSDSFSFDTSDGKLVDVGYLNFGCSEAPFTGDIQSYTGSVGLNQTPLSLISQLG 240
            TSG LSSDSFSFDT+DGK VDVGYLNFGCSEAP TG +QSY GSVGLNQTPLSLISQLG
Sbjct: 190 ETSGNLSSDSFSFDTTDGKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLG 249

Query: 241 IKKFSYCLVPFNNLGSASKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISLGNDEAHLD 300
           IKKFSYCLVPF NLGS SKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGIS+G D+ +L+
Sbjct: 250 IKKFSYCLVPF-NLGSTSKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNLE 309

Query: 301 RVFDVYDVRDGWIIDSGITYSSLETDAFDSLLAKFLTLPDLPQKKIDPRDRFELCFAANA 360
            VFDVYDVRDGWIIDSG TYSSLETDAFD LLAKF+TLPDL QKK DPR+RFELCFAANA
Sbjct: 310 GVFDVYDVRDGWIIDSGTTYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAANA 369

Query: 361 NDLESFPDVTVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVG 420
           ND+E+FP VTVHFDGA+LILNVESTFVKIEDDGI CLALLRSGSPVSILGNFQLQN HVG
Sbjct: 370 NDMETFPGVTVHFDGAELILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCHVG 429

Query: 421 YDLEAQVISFAPVDCADS 439
           YDLEAQV+SFAPVDCADS
Sbjct: 430 YDLEAQVVSFAPVDCADS 445

BLAST of HG10012598 vs. ExPASy TrEMBL
Match: A0A6J1GWK9 (aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111457819 PE=3 SV=1)

HSP 1 Score: 774.6 bits (1999), Expect = 2.2e-220
Identity = 382/438 (87.21%), Postives = 403/438 (92.01%), Query Frame = 0

Query: 1   MILFLLFTILESTAWYMVPTEVGFTARLIHRDSPLSPFYDHALTDTARIEATVHRSRSRL 60
           M+ FL  TILESTA +MVPT+VGFTARLIHRDSPLSPFY+H +T+TARIEATVHRSRSRL
Sbjct: 10  MMFFLSLTILESTARHMVPTDVGFTARLIHRDSPLSPFYNHVMTNTARIEATVHRSRSRL 69

Query: 61  NYLYYNMLSENTLDNDVSLSPTLVNEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSD 120
           NYLYYNMLS  TLDND+SLSPTLV+EGGEYLMSFNIGNP SQVMGFADTSNGLIWVQCSD
Sbjct: 70  NYLYYNMLSRKTLDNDLSLSPTLVHEGGEYLMSFNIGNPSSQVMGFADTSNGLIWVQCSD 129

Query: 121 CNSQCEPEKGGSTTKFLSSKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYEDNK 180
           CNS C+ EK G  TK L SKSFTYEMEPCGSN CNSLTGFQTCNSSD+WCKYRLVYEDN 
Sbjct: 130 CNSHCDAEK-GPFTKLLPSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRWCKYRLVYEDNS 189

Query: 181 ATSGILSSDSFSFDTSDGKLVDVGYLNFGCSEAPFTGDIQSYTGSVGLNQTPLSLISQLG 240
            TSG LSSDSFSFDT+DGK VDVGYLNFGCSEAP TG +QSY GSVGLNQTPLSLISQLG
Sbjct: 190 ETSGTLSSDSFSFDTTDGKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLG 249

Query: 241 IKKFSYCLVPFNNLGSASKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISLGNDEAHLD 300
           IKKFSYCLVPF NLGS SKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGIS+G D+ +LD
Sbjct: 250 IKKFSYCLVPF-NLGSTSKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNLD 309

Query: 301 RVFDVYDVRDGWIIDSGITYSSLETDAFDSLLAKFLTLPDLPQKKIDPRDRFELCFAANA 360
            VFDVYDVRDGWIIDSG TYSSLETDAFD LLAKF TLP+L QKK DPR+RFELCFAANA
Sbjct: 310 GVFDVYDVRDGWIIDSGTTYSSLETDAFDRLLAKFNTLPELQQKKEDPRNRFELCFAANA 369

Query: 361 NDLESFPDVTVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVG 420
           ND+E+FPDVTVH DGA+LILNVESTFVKIEDDGI CLALLRSGSPVSILGNFQLQNYHVG
Sbjct: 370 NDMETFPDVTVHLDGAELILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNYHVG 429

Query: 421 YDLEAQVISFAPVDCADS 439
           YDLEAQV+SFAPVDCADS
Sbjct: 430 YDLEAQVVSFAPVDCADS 445

BLAST of HG10012598 vs. TAIR 10
Match: AT5G33340.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 232.3 bits (591), Expect = 7.7e-61
Identity = 152/428 (35.51%), Postives = 225/428 (52.57%), Query Frame = 0

Query: 21  EVGFTARLIHRDSPLSPFYDHALTDTARIEATVHRSRSRLNYLYYNMLSENTLDNDVSLS 80
           ++GFTA LIHRDSP SPFY+   T + R+   +HRS   +N +++    +NT    + L+
Sbjct: 28  KLGFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRS---VNRVFHFTEKDNTPQPQIDLT 87

Query: 81  PTLVNEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCS---DCNSQCEPEKGGSTTKFL 140
               +  GEYLM+ +IG PP  +M  ADT + L+W QC+   DC +Q +P        F 
Sbjct: 88  ----SNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDP-------LFD 147

Query: 141 SSKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYEDNKATSGILSSDSFSFDTSD 200
              S TY+   C S+ C +L    +C+++D  C Y L Y DN  T G ++ D+ +  +SD
Sbjct: 148 PKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSD 207

Query: 201 GKLVDVGYLNFGCSEAPFTGDIQSYTGSVGLNQTPLSLISQLGIK---KFSYCLVPF-NN 260
            + + +  +  GC         +  +G VGL   P+SLI QLG     KFSYCLVP  + 
Sbjct: 208 TRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSK 267

Query: 261 LGSASKMYFGSLPVTSGG---QTPLLYPNSDA--YYVKVLGISLGNDEAHLDRVFDVYDV 320
               SK+ FG+  + SG     TPL+   S    YY+ +  IS+G+ +       D    
Sbjct: 268 KDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSG-SDSESS 327

Query: 321 RDGWIIDSGITYSSLETDAFDSLLAKFLTLPDLPQKKIDPRDRFELCFAANANDLESFPD 380
               IIDSG T + L T+ +  L     +  D  +KK DP+    LC++A   DL+  P 
Sbjct: 328 EGNIIIDSGTTLTLLPTEFYSELEDAVASSID-AEKKQDPQSGLSLCYSA-TGDLK-VPV 387

Query: 381 VTVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVI 437
           +T+HFDGAD+ L+  + FV++ +D + C A  R     SI GN    N+ VGYD  ++ +
Sbjct: 388 ITMHFDGADVKLDSSNAFVQVSED-LVCFA-FRGSPSFSIYGNVAQMNFLVGYDTVSKTV 435

BLAST of HG10012598 vs. TAIR 10
Match: AT2G35615.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 207.6 bits (527), Expect = 2.0e-53
Identity = 153/467 (32.76%), Postives = 230/467 (49.25%), Query Frame = 0

Query: 1   MILFLLFTILESTAWYMVPTEVGFTARLIHRDSPLSPFYDHALTDTARIEATVHRSRSRL 60
           +  FL F++  S++ +       F+  LIHRDSPLSP Y+  +T T R+ A   RS SR 
Sbjct: 7   LCFFLFFSVTLSSSGH----PKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSR- 66

Query: 61  NYLYYNMLSENTLDNDVSLSPTLVNEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSD 120
           +  + + LS+  L +       L+   GE+ MS  IG PP +V   ADT + L WVQC  
Sbjct: 67  SRRFNHQLSQTDLQSG------LIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKP 126

Query: 121 CNSQCEPEKGGSTTKFLSSKSFTYEMEPCGSNFCNSLTGFQT-CNSSDKWCKYRLVYEDN 180
           C  QC  E G     F   KS TY+ EPC S  C +L+  +  C+ S+  CKYR  Y D 
Sbjct: 127 C-QQCYKENG---PIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQ 186

Query: 181 KATSGILSSDSFSFDTSDGKLVDVGYLNFGCSEAPFTGDIQSYTGSVGLNQTPLSLISQL 240
             + G +++++ S D++ G  V      FGC         ++ +G +GL    LSLISQL
Sbjct: 187 SFSKGDVATETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQL 246

Query: 241 G---IKKFSYCL--------------VPFNNLGSASKMYFGSLPVTSGGQTPLLYPNSDA 300
           G    KKFSYCL              +  N++ S+     G +      + PL Y     
Sbjct: 247 GSSISKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTY----- 306

Query: 301 YYVKVLGISLG-------------NDEAHLDRVFDVYDVRDGWIIDSGITYSSLETDAFD 360
           YY+ +  IS+G             ND+  L       +     IIDSG T + LE   FD
Sbjct: 307 YYLTLEAISVGKKKIPYTGSSYNPNDDGILS------ETSGNIIIDSGTTLTLLEAGFFD 366

Query: 361 SLLAKFLTLPDLPQKKIDPRDRFELCFAANANDLESFPDVTVHFDGADLILNVESTFVKI 420
              +         ++  DP+     CF + + ++   P++TVHF GAD+ L+  + FVK+
Sbjct: 367 KFSSAVEESVTGAKRVSDPQGLLSHCFKSGSAEI-GLPEITVHFTGADVRLSPINAFVKL 426

Query: 421 EDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFAPVDCA 437
            +D + CL+++ + + V+I GNF   ++ VGYDLE + +SF  +DC+
Sbjct: 427 SED-MVCLSMVPT-TEVAIYGNFAQMDFLVGYDLETRTVSFQHMDCS 444

BLAST of HG10012598 vs. TAIR 10
Match: AT1G31450.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 199.1 bits (505), Expect = 7.2e-51
Identity = 142/432 (32.87%), Postives = 209/432 (48.38%), Query Frame = 0

Query: 25  TARLIHRDSPLSPFYDHALTDTARIEATVHRSRSRLNYLYYNMLSENTLDNDVSLSPTLV 84
           T  LIHRDSP SP Y+   T + R+ A   RS SR                   L   L+
Sbjct: 30  TVELIHRDSPHSPLYNPHHTVSDRLNAAFLRSISR----------SRRFTTKTDLQSGLI 89

Query: 85  NEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGGSTTKFLSSKSFTY 144
           + GGEY MS +IG PPS+V   ADT + L WVQC  C  QC  +   ++  F   KS TY
Sbjct: 90  SNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPC-QQCYKQ---NSPLFDKKKSSTY 149

Query: 145 EMEPCGSNFCNSLTGFQT-CNSSDKWCKYRLVYEDNKATSGILSSDSFSFDTSDGKLVDV 204
           + E C S  C +L+  +  C+ S   CKYR  Y DN  T G +++++ S D+S G  V  
Sbjct: 150 KTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDSSSGSSVSF 209

Query: 205 GYLNFGCSEAPFTGDIQSYTGSVGLNQTPLSLISQLGI---KKFSYCL---VPFNNLGSA 264
               FGC         ++ +G +GL   PLSL+SQLG    KKFSYCL       N  S 
Sbjct: 210 PGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTAATTNGTSV 269

Query: 265 SKMYFGSLPVTSGGQ-----TPLLYPNSDAYYVKVL-GISLGNDEAHLDRVFDVYDVR-- 324
             +   S+P           TPL+  + + YY   L  +++G  +  L      Y +   
Sbjct: 270 INLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVG--KTKLPYTGGGYGLNGK 329

Query: 325 -----DGWIIDSGITYSSLETDAFDSLLAKFLTLPDLPQKKIDPRDRFELCFAANANDLE 384
                   IIDSG T + L++  +D             ++  DP+     CF +   ++ 
Sbjct: 330 SSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQGLLTHCFKSGDKEI- 389

Query: 385 SFPDVTVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLE 437
             P +T+HF  AD+ L+  + FVK+ +D + CL+++ + + V+I GN    ++ VGYDLE
Sbjct: 390 GLPAITMHFTNADVKLSPINAFVKLNEDTV-CLSMIPT-TEVAIYGNMVQMDFLVGYDLE 442

BLAST of HG10012598 vs. TAIR 10
Match: AT1G64830.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 190.3 bits (482), Expect = 3.3e-48
Identity = 139/432 (32.18%), Postives = 213/432 (49.31%), Query Frame = 0

Query: 23  GFTARLIHRDSPLSPFYDHALTDTARIEATVHRS-RSRLNYLYYNMLSENTLDNDVSLSP 82
           GFT  LIHRDSP SPFY+ A T + R+   + RS RS L +        N   +  S   
Sbjct: 25  GFTIDLIHRDSPKSPFYNSAETSSQRMRNAIRRSARSTLQF-------SNDDASPNSPQS 84

Query: 83  TLVNEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCS---DCNSQCEPEKGGSTTKFLS 142
            + +  GEYLM+ +IG PP  ++  ADT + LIW QC+   DC  Q  P        F  
Sbjct: 85  FITSNRGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSP-------LFDP 144

Query: 143 SKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYEDNKATSGILSSDSFSFDTSDG 202
            +S TY    C S+ C +L    +C++ +  C Y + Y DN  T G ++ D+ +  +S  
Sbjct: 145 KESSTYRKVSCSSSQCRALED-ASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGR 204

Query: 203 KLVDVGYLNFGCSEAPFTGDIQSYTGSVGLNQTPLSLISQLGIK---KFSYCLVPF-NNL 262
           + V +  +  GC          + +G +GL     SL+SQL      KFSYCLVPF +  
Sbjct: 205 RPVSLRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSET 264

Query: 263 GSASKMYFGSLPVTSGG---QTPLLYPNSDAYY-VKVLGISLGNDEAHLDRVFDVYDVRD 322
           G  SK+ FG+  + SG     T ++  +   YY + +  IS+G+ +        ++   +
Sbjct: 265 GLTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTST--IFGTGE 324

Query: 323 G-WIIDSGITYSSLETDAF---DSLLAKFLTLPDLPQKKIDPRDRFELCFAANANDLESF 382
           G  +IDSG T + L ++ +   +S++A  +      ++  DP     LC+     D  SF
Sbjct: 325 GNIVIDSGTTLTLLPSNFYYELESVVASTIK----AERVQDPDGILSLCY----RDSSSF 384

Query: 383 --PDVTVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLE 437
             PD+TVHF G D+ L   +TFV + +D + C A   +   ++I GN    N+ VGYD  
Sbjct: 385 KVPDITVHFKGGDVKLGNLNTFVAVSED-VSCFA-FAANEQLTIFGNLAQMNFLVGYDTV 429

BLAST of HG10012598 vs. TAIR 10
Match: AT2G03200.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 185.7 bits (470), Expect = 8.2e-47
Identity = 142/432 (32.87%), Postives = 210/432 (48.61%), Query Frame = 0

Query: 23  GFTARLIHRDSPLSPFYDHALTDTARIEATVHRSRSRLNYL-YYNMLSENTLDNDVSLSP 82
           GF   L H DS  +      LT   +I+  ++R   RLN L    +L+  +  +D +   
Sbjct: 44  GFRLSLRHVDSGKN------LTKIQKIQRGINRGFHRLNRLGAVAVLAVASKPDDTNNIK 103

Query: 83  TLVNEG-GEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGGSTTKFLSSK 142
              + G GE+LM  +IGNP  +     DT + LIW QC  C ++C  +    T  F   K
Sbjct: 104 APTHGGSGEFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPC-TECFDQ---PTPIFDPEK 163

Query: 143 SFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYEDNKATSGILSSDSFSFDTSDGKL 202
           S +Y    C S  CN+L     CN     C+Y   Y D  +T G+L++++F+F+  +   
Sbjct: 164 SSSYSKVGCSSGLCNALPR-SNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDEN--- 223

Query: 203 VDVGYLNFGCSEAPFTGDIQSYTGSVGLNQTPLSLISQLGIKKFSYCLVPFNNLGSASKM 262
             +  + FGC            +G VGL + PLSLISQL   KFSYCL    +  ++S +
Sbjct: 224 -SISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSL 283

Query: 263 YFGSLP---VTSGG--------QTPLLYPNSDA---YYVKVLGISLGNDEAHLDR-VFDV 322
           + GSL    V   G        +T  L  N D    YY+++ GI++G     +++  F++
Sbjct: 284 FIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFEL 343

Query: 323 -YDVRDGWIIDSGITYSSLETDAFDSLLAKFLTLPDLPQKKIDPRDRFELCF-AANANDL 382
             D   G IIDSG T + LE  AF  L  +F +   LP          +LCF   +A   
Sbjct: 344 AEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDD-SGSTGLDLCFKLPDAAKN 403

Query: 383 ESFPDVTVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDL 436
            + P +  HF GADL L  E+  V     G+ CLA + S + +SI GN Q QN++V +DL
Sbjct: 404 IAVPKMIFHFKGADLELPGENYMVADSSTGVLCLA-MGSSNGMSIFGNVQQQNFNVLHDL 458

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008439375.16.2e-23090.45PREDICTED: aspartic proteinase CDR1-like [Cucumis melo][more]
XP_004134471.32.4e-22990.43aspartic proteinase CDR1 [Cucumis sativus] >KAE8650377.1 hypothetical protein Cs... [more]
KAA0049446.14.6e-22591.06aspartic proteinase CDR1-like [Cucumis melo var. makuwa] >TYK16125.1 aspartic pr... [more]
XP_023528351.11.0e-22488.58aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo][more]
KAG6582237.11.6e-22288.13Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. sororia][more]
Match NameE-valueIdentityDescription
Q6XBF81.1e-5935.51Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1[more]
Q3EBM52.8e-5232.76Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g3561... [more]
Q766C32.2e-4431.76Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q766C22.1e-3930.62Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q9LNJ36.0e-3432.31Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Match NameE-valueIdentityDescription
A0A1S3AZA63.0e-23090.45aspartic proteinase CDR1-like OS=Cucumis melo OX=3656 GN=LOC103484191 PE=3 SV=1[more]
A0A5D3CXD42.2e-22591.06Aspartic proteinase CDR1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_sc... [more]
A0A0A0L7U32.2e-22591.06Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G15152... [more]
A0A6J1IXB12.5e-22187.90aspartic proteinase CDR1-like OS=Cucurbita maxima OX=3661 GN=LOC111479389 PE=3 S... [more]
A0A6J1GWK92.2e-22087.21aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111457819 PE=3... [more]
Match NameE-valueIdentityDescription
AT5G33340.17.7e-6135.51Eukaryotic aspartyl protease family protein [more]
AT2G35615.12.0e-5332.76Eukaryotic aspartyl protease family protein [more]
AT1G31450.17.2e-5132.87Eukaryotic aspartyl protease family protein [more]
AT1G64830.13.3e-4832.18Eukaryotic aspartyl protease family protein [more]
AT2G03200.18.2e-4732.87Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 65..263
e-value: 3.9E-33
score: 117.0
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 268..438
e-value: 1.7E-38
score: 133.8
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 83..435
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 282..431
e-value: 2.3E-25
score: 89.2
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 90..264
e-value: 2.1E-33
score: 116.0
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 12..436
NoneNo IPR availablePANTHERPTHR47967:SF66ASPARTIC PROTEINASE CDR1-RELATEDcoord: 12..436
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 90..431
score: 27.354406
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 89..435
e-value: 1.95385E-63
score: 203.649

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10012598.1HG10012598.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005576 extracellular region
molecular_function GO:0016787 hydrolase activity