Lsi10G007850 (gene) Bottle gourd (USVL1VR-Ls) v1

Overview
NameLsi10G007850
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls) v1)
Descriptionaspartic proteinase Asp1-like
Locationchr10: 10689408 .. 10692479 (-)
RNA-Seq ExpressionLsi10G007850
SyntenyLsi10G007850
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGAAAGGGGTATTGATGATATTGGTTCTGATGGTGGCCTCAATGAGCTGTTTGGCTCTGTGTTCAGCTTCTTCGTTCTTTAAGGATAAGCCATGGGAGAGAAGAAGGCCAATTCTGTCGGTTCCGACTGCATCTTCTTCGTTTGCTTCATCCTCCATTGTGTTGCCTCTTCAAGGGAACGTCTTTCCAAATGGGTAAGCCATAACTCTGCTCAAAGATAACGACGAAGGTTCAGCTTTTGTTTTTTCCCATTTCGATTTTACTCCACTGTTACAGAGTACGAAGTGGCTGAAGAGGAAGTTCAATGTGAATGTGTTGACTCCAAATATACTTTTCACATTGATAGTATTGTAATAATATCAGTAGGCCTTTCCCATTCTTAATCTTATTGCATCTTCTCTCACGTGTTAAATTGGAGAAAAAATGATTACGATGATTACGAGACTCTGTACTCTTATGTTTGCTTCTCTCGCGTATTATGCTATTCCTGTGCAATAGCTTTGGTGGAGTGTATTTTTGAGTGCTTTTCAGAATAGGGTTTGTAACTTTGTAGCAAAAGAACACATCCCCAAATTAAGTACTTCTAAAATCTTCAAACCCATTTTCCTGAATATTTTTAATTAGTTACAAACAGTTTGAAAATAAAATATATATTTTTTTGTTCTTGTTTGAGGCTAACTTCTTTACTATCAGATCCTAAATAATGATTTTCATCAATATGTTTTGAGTCATTTAATCCGTGTCTTTTTTCAGGTTCTATAACGTTACTCTTTATGTAGGGCAGCCTCCAAAGCCTTATTTTCTAGATCCAGACACCGGTAGTGATCTCACTTGGCTTCAATGTGACGCTCCATGTCAGCAGTGCACCGAGGTAAATTTTTATTTGATGGTAATTTGGAGATTCTTGTAAATTTGATGACCCAAATGATTGAAGATAGTTTACCAAAGTACAGCTTTGGCTTCATAAGTTAATTTTGATCATAAGTAATTCAAATTTTCATTTTTGAATATTCTATGTTGTTCTATGTTCCCTTATATTTCTTCTTGATGTGCTTAGTGCTTCTTCTTTCGCTCGGTCTCTCACTACCTTAAAAAATTATTGCATTTCTTTTGGAGTTGAGAGTTAAGGTTGATTGCTTCAAATTGATGGCCTTGTTTCTTTCCAGACACTTCATCCGCTCTATCAACCAAGCAACGATCTTGTGCCATGTAAGGACCCTTTGTGTATGTCCTTGCACTCATCTATGGACCACAAATGTGAGAACCCAGGTCAATGTGACTACGAGGTTGAGTATGCAGATGGCGGTTCTTCTCTTGGAGTCCTTGTCAGGGATGTATTTCCTCTCAACTTAACCAATGGAGATCCAATTAGACCGCGTTTGGCCCTCGGGTAAACTGCTCAACTGCCTTCAGTAATTCATCTTAATTCAGTGTACTGCTGAGTTCCTAGCTCTCTAAATCAACCTTGCTACTTCCAATTACAACCGCACATGAAGGTTTTGAGAAAGAAACATCATATCATGCTGTCGTATCTTATGTTTTGCACTTTCCTGACATTGCAAGATAATGAATAGCATGAATGTAAAGGTAGAGTTTATGTTGAATAACAAACGTGTGCATTCATGCTGATGGGTTACTTCAAGAAGACAATTAAGATAGCACCAGGCACTAGATTTTGTGACGGACATAATGATTTGGGTTTGTATTTTTCATAGTTATCAATAAATATTTGTACAAGAATTTCATGCGTATCTAGTCCTTACGAACCTTAGTTGTCAGCTCATGTGGTTGAATAGCATGCATGTAAAGTCGGTATGTAAGTTAGCTATTTGAATCTATAGTACTGAGTTTGGTTGGTTCCGTTTGATTACTCAGATGTGGTTATGATCAAGATCCTGGATCATCATCTTATCACCCCATGGATGGAATACTTGGCCTTGGACGGGGAGCAGTAAGCATGGTCTCACAACTGCATAATCAAGGCATTGTCCGTAATGTCGTTGGTCACTGTTTCAGCAGCAAAGGAGGAGGATATCTTTTCTTTGGGGATGGCATTTATGATCCTTATCGCTTAGTTTGGACGCCCATGTCACGGGACTACCCGTAAACGACCTGCCTTGATCATATATATATATATATATACCATGTTATAATTCATGTTAGCTTCTGTTTTTATAAAACCTTCCTCATGTGACTGTATCAGGAAGCACTACTCCCCTGGGTTTGGAGAACTAATCTTCAATGGAAGAAGTACTGGACTCAGAAACCTGTTTGTAGTTTTTGACAGTGGGAGCTCTTACACATACTTCAATGCTCAGGCTTATCAAGTTTTAACATCTTTGGTAAATCATCTTTAAAAATAACTATTGACTGTGTCTGGGAAAAATAGATCTTTCTGGCTCTTGAAAACTGATGGTTAATACACAATGATTCTTCAATTCATGTTGGCACTGTTCTTGTAGTTAAATAGAGAACTAACTGGAAAACCGCTAAGAGAAGCCATGGACGACGATACGCTTCCGCTCTGTTGGAGAGGGCGGAAGCCATTCAAAAGCTTACGTGATGTGAGAAAATATTTCAAGCCATTGGCCTTGAGCTTTTCCAGTGGTGGAAGAACCAAAGCAGTGTTTGAAATACCAATGGAAGGTTATCTGATAATATCGGTAAAAGCTCCAACTGCTATTCACTACACATTGTCTCTTTGGATTTTCAAGTTGAATAATTACACTTATAAGTTAATTTTGGAATGGCAGTCCATGGGAAATGTTTGCTTAGGAATTCTGAACGGCACCGACGTTGGGCTTGAAAACTCGAATATCATTGGTGGTACGTTATTGCATGCATTGCATGTTATTATTTTCTCGGAATTGCAAATACAAATGCCTTCGAAACCATGCAAAATTATTCTTTTATTAGAGAGAAAAAAAGAGCCCCATTTTCTGTATGCTTTTGTTGACAGATATATCAATGCAAGATAAAATGGTAGTATACAACAACGAGAAGCAAGCAATTGGATGGGCTACTGCTAACTGTGATCGGGTTCCCAAGTCTCGAGTTGGTAGCATGTAA

mRNA sequence

ATGGGGAAAGGGGTATTGATGATATTGGTTCTGATGGTGGCCTCAATGAGCTGTTTGGCTCTGTGTTCAGCTTCTTCGTTCTTTAAGGATAAGCCATGGGAGAGAAGAAGGCCAATTCTGTCGGTTCCGACTGCATCTTCTTCGTTTGCTTCATCCTCCATTGTGTTGCCTCTTCAAGGGAACGTCTTTCCAAATGGGTTCTATAACGTTACTCTTTATGTAGGGCAGCCTCCAAAGCCTTATTTTCTAGATCCAGACACCGGTAGTGATCTCACTTGGCTTCAATGTGACGCTCCATGTCAGCAGTGCACCGAGACACTTCATCCGCTCTATCAACCAAGCAACGATCTTGTGCCATGTAAGGACCCTTTGTGTATGTCCTTGCACTCATCTATGGACCACAAATGTGAGAACCCAGGTCAATGTGACTACGAGGTTGAGTATGCAGATGGCGGTTCTTCTCTTGGAGTCCTTGTCAGGGATGTATTTCCTCTCAACTTAACCAATGGAGATCCAATTAGACCGCGTTTGGCCCTCGGATGTGGTTATGATCAAGATCCTGGATCATCATCTTATCACCCCATGGATGGAATACTTGGCCTTGGACGGGGAGCAGTAAGCATGGTCTCACAACTGCATAATCAAGGCATTGTCCGTAATGTCGTTGGTCACTGTTTCAGCAGCAAAGGAGGAGGATATCTTTTCTTTGGGGATGGCATTTATGATCCTTATCGCTTAGTTTGGACGCCCATGTCACGGGACTACCCGAAGCACTACTCCCCTGGGTTTGGAGAACTAATCTTCAATGGAAGAAGTACTGGACTCAGAAACCTGTTTGTAGTTTTTGACAGTGGGAGCTCTTACACATACTTCAATGCTCAGGCTTATCAAGTTTTAACATCTTTGTTAAATAGAGAACTAACTGGAAAACCGCTAAGAGAAGCCATGGACGACGATACGCTTCCGCTCTGTTGGAGAGGGCGGAAGCCATTCAAAAGCTTACGTGATGTGAGAAAATATTTCAAGCCATTGGCCTTGAGCTTTTCCAGTGGTGGAAGAACCAAAGCAGTGTTTGAAATACCAATGGAAGGTTATCTGATAATATCGTCCATGGGAAATGTTTGCTTAGGAATTCTGAACGGCACCGACGTTGGGCTTGAAAACTCGAATATCATTGGTGATATATCAATGCAAGATAAAATGGTAGTATACAACAACGAGAAGCAAGCAATTGGATGGGCTACTGCTAACTGTGATCGGGTTCCCAAGTCTCGAGTTGGTAGCATGTAA

Coding sequence (CDS)

ATGGGGAAAGGGGTATTGATGATATTGGTTCTGATGGTGGCCTCAATGAGCTGTTTGGCTCTGTGTTCAGCTTCTTCGTTCTTTAAGGATAAGCCATGGGAGAGAAGAAGGCCAATTCTGTCGGTTCCGACTGCATCTTCTTCGTTTGCTTCATCCTCCATTGTGTTGCCTCTTCAAGGGAACGTCTTTCCAAATGGGTTCTATAACGTTACTCTTTATGTAGGGCAGCCTCCAAAGCCTTATTTTCTAGATCCAGACACCGGTAGTGATCTCACTTGGCTTCAATGTGACGCTCCATGTCAGCAGTGCACCGAGACACTTCATCCGCTCTATCAACCAAGCAACGATCTTGTGCCATGTAAGGACCCTTTGTGTATGTCCTTGCACTCATCTATGGACCACAAATGTGAGAACCCAGGTCAATGTGACTACGAGGTTGAGTATGCAGATGGCGGTTCTTCTCTTGGAGTCCTTGTCAGGGATGTATTTCCTCTCAACTTAACCAATGGAGATCCAATTAGACCGCGTTTGGCCCTCGGATGTGGTTATGATCAAGATCCTGGATCATCATCTTATCACCCCATGGATGGAATACTTGGCCTTGGACGGGGAGCAGTAAGCATGGTCTCACAACTGCATAATCAAGGCATTGTCCGTAATGTCGTTGGTCACTGTTTCAGCAGCAAAGGAGGAGGATATCTTTTCTTTGGGGATGGCATTTATGATCCTTATCGCTTAGTTTGGACGCCCATGTCACGGGACTACCCGAAGCACTACTCCCCTGGGTTTGGAGAACTAATCTTCAATGGAAGAAGTACTGGACTCAGAAACCTGTTTGTAGTTTTTGACAGTGGGAGCTCTTACACATACTTCAATGCTCAGGCTTATCAAGTTTTAACATCTTTGTTAAATAGAGAACTAACTGGAAAACCGCTAAGAGAAGCCATGGACGACGATACGCTTCCGCTCTGTTGGAGAGGGCGGAAGCCATTCAAAAGCTTACGTGATGTGAGAAAATATTTCAAGCCATTGGCCTTGAGCTTTTCCAGTGGTGGAAGAACCAAAGCAGTGTTTGAAATACCAATGGAAGGTTATCTGATAATATCGTCCATGGGAAATGTTTGCTTAGGAATTCTGAACGGCACCGACGTTGGGCTTGAAAACTCGAATATCATTGGTGATATATCAATGCAAGATAAAATGGTAGTATACAACAACGAGAAGCAAGCAATTGGATGGGCTACTGCTAACTGTGATCGGGTTCCCAAGTCTCGAGTTGGTAGCATGTAA

Protein sequence

MGKGVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPTASSSFASSSIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRTKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSRVGSM
Homology
BLAST of Lsi10G007850 vs. ExPASy Swiss-Prot
Match: Q0IU52 (Aspartic proteinase Asp1 OS=Oryza sativa subsp. japonica OX=39947 GN=ASP1 PE=2 SV=1)

HSP 1 Score: 339.7 bits (870), Expect = 4.7e-92
Identity = 179/386 (46.37%), Postives = 252/386 (65.28%), Query Frame = 0

Query: 51  SSSIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPL 110
           SS++VL L GNV+P G + +T+ +G P K YFLD DTGS LTWLQCDAPC  C    H L
Sbjct: 21  SSAVVLELHGNVYPIGHFFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHVL 80

Query: 111 YQPS-NDLVPCKDPLCMSLHSSM--DHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNL 170
           Y+P+   LV C D LC  L++ +    +C +  QCDY ++Y D  SS+GVLV D F L+ 
Sbjct: 81  YKPTPKKLVTCADSLCTDLYTDLGKPKRCGSQKQCDYVIQYVD-SSSMGVLVIDRFSLSA 140

Query: 171 TNGDPIRPRLALGCGYDQDPGSSSYH-PMDGILGLGRGAVSMVSQLHNQGIV-RNVVGHC 230
           +NG      +A GCGYDQ   + +   P+D ILGL RG V+++SQL +QG++ ++V+GHC
Sbjct: 141 SNGTN-PTTIAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHC 200

Query: 231 FSSKGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIF--NGRSTGLRNLFVVFD 290
            SSKGGG+LFFGD       + WTPM+R++ K+YSPG G L F  N ++     + V+FD
Sbjct: 201 ISSKGGGFLFFGDAQVPTSGVTWTPMNREH-KYYSPGHGTLHFDSNSKAISAAPMAVIFD 260

Query: 291 SGSSYTYFNAQAYQVLTSLLNRELTG--KPLREAMDDD-TLPLCWRGRKPFKSLRDVRKY 350
           SG++YTYF AQ YQ   S++   L    K L E  + D  L +CW+G+    ++ +V+K 
Sbjct: 261 SGATYTYFAAQPYQATLSVVKSTLNSECKFLTEVTEKDRALTVCWKGKDKIVTIDEVKKC 320

Query: 351 FKPLALSFSSGGRTKAVFEIPMEGYLIISSMGNVCLGILNGT--DVGLENSNIIGDISMQ 410
           F+ L+L F+ G + KA  EIP E YLIIS  G+VCLGIL+G+   + L  +N+IG I+M 
Sbjct: 321 FRSLSLEFADGDK-KATLEIPPEHYLIISQEGHVCLGILDGSKEHLSLAGTNLIGGITML 380

Query: 411 DKMVVYNNEKQAIGWATANCDRVPKS 425
           D+MV+Y++E+  +GW    CDR+P+S
Sbjct: 381 DQMVIYDSERSLLGWVNYQCDRIPRS 402

BLAST of Lsi10G007850 vs. ExPASy Swiss-Prot
Match: A2ZC67 (Aspartic proteinase Asp1 OS=Oryza sativa subsp. indica OX=39946 GN=ASP1 PE=2 SV=2)

HSP 1 Score: 325.5 bits (833), Expect = 9.2e-88
Identity = 174/386 (45.08%), Postives = 247/386 (63.99%), Query Frame = 0

Query: 51  SSSIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPL 110
           SS++VL L GNV+P G + VT+ +G P KPYFLD DTGS LTWLQCD PC  C +  H L
Sbjct: 21  SSAVVLELHGNVYPIGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGL 80

Query: 111 YQPS-NDLVPCKDPLCMSLHSSM--DHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNL 170
           Y+P     V C +  C  L++ +    KC    QC Y ++Y  GGSS+GVL+ D F L  
Sbjct: 81  YKPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV-GGSSIGVLIVDSFSLPA 140

Query: 171 TNGDPIRPRLALGCGYDQDPGSSSY-HPMDGILGLGRGAVSMVSQLHNQGIV-RNVVGHC 230
           +NG      +A GCGY+Q   + +   P++GILGLGRG V+++SQL +QG++ ++V+GHC
Sbjct: 141 SNGTN-PTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHC 200

Query: 231 FSSKGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGL--RNLFVVFD 290
            SSKG G+LFFGD       + W+PM+R++ KHYSP  G L FN  S  +    + V+FD
Sbjct: 201 ISSKGKGFLFFGDAKVPTSGVTWSPMNREH-KHYSPRQGTLQFNSNSKPISAAPMEVIFD 260

Query: 291 SGSSYTYFNAQAYQVLTSLLNRELTG--KPLREAMDDD-TLPLCWRGRKPFKSLRDVRKY 350
           SG++YTYF  Q Y    S++   L+   K L E  + D  L +CW+G+   +++ +V+K 
Sbjct: 261 SGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEVKKC 320

Query: 351 FKPLALSFSSGGRTKAVFEIPMEGYLIISSMGNVCLGILNGT--DVGLENSNIIGDISMQ 410
           F+ L+L F+ G + KA  EIP E YLIIS  G+VCLGIL+G+     L  +N+IG I+M 
Sbjct: 321 FRSLSLKFADGDK-KATLEIPPEHYLIISQEGHVCLGILDGSKEHPSLAGTNLIGGITML 380

Query: 411 DKMVVYNNEKQAIGWATANCDRVPKS 425
           D+MV+Y++E+  +GW    CDR+P+S
Sbjct: 381 DQMVIYDSERSLLGWVNYQCDRIPRS 402

BLAST of Lsi10G007850 vs. ExPASy Swiss-Prot
Match: Q9M9A8 (Aspartyl protease APCB1 OS=Arabidopsis thaliana OX=3702 GN=APCB1 PE=1 SV=1)

HSP 1 Score: 301.6 bits (771), Expect = 1.4e-80
Identity = 170/396 (42.93%), Postives = 236/396 (59.60%), Query Frame = 0

Query: 44  TASSSFASSSIVLPLQGNVFPNGFYNVTLYVGQPP--KPYFLDPDTGSDLTWLQCDAPCQ 103
           T++ S  SS+ + P+ GNV+P+G Y   + VG+P   + Y LD DTGS+LTW+QCDAPC 
Sbjct: 179 TSAGSIDSSTTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCT 238

Query: 104 QCTETLHPLYQPSND-LVPCKDPLCMSL-HSSMDHKCENPGQCDYEVEYADGGSSLGVLV 163
            C +  + LY+P  D LV   +  C+ +  + +   CEN  QCDYE+EYAD   S+GVL 
Sbjct: 239 SCAKGANQLYKPRKDNLVRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLT 298

Query: 164 RDVFPLNLTNGDPIRPRLALGCGYDQDP-GSSSYHPMDGILGLGRGAVSMVSQLHNQGIV 223
           +D F L L NG      +  GCGYDQ     ++    DGILGL R  +S+ SQL ++GI+
Sbjct: 299 KDKFHLKLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGII 358

Query: 224 RNVVGHCFSS--KGGGYLFFGDGIYDPYRLVWTPMSRD--------YPKHYSPGFGELIF 283
            NVVGHC +S   G GY+F G  +   + + W PM  D             S G G L  
Sbjct: 359 SNVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSL 418

Query: 284 NGRSTGLRNLFVVFDSGSSYTYFNAQAY-QVLTSLLNRELTGKPLREAMDDDTLPLCWRG 343
           +G +  +    V+FD+GSSYTYF  QAY Q++TSL  +E++G  L     D+TLP+CWR 
Sbjct: 419 DGENGRVGK--VLFDTGSSYTYFPNQAYSQLVTSL--QEVSGLELTRDDSDETLPICWRA 478

Query: 344 RK--PFKSLRDVRKYFKPLALSFSSGGR-TKAVFEIPMEGYLIISSMGNVCLGILNGTDV 403
           +   PF SL DV+K+F+P+ L   S          I  E YLIIS+ GNVCLGIL+G+ V
Sbjct: 479 KTNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNKGNVCLGILDGSSV 538

Query: 404 GLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 421
              ++ I+GDISM+  ++VY+N K+ IGW  ++C R
Sbjct: 539 HDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDCVR 570

BLAST of Lsi10G007850 vs. ExPASy Swiss-Prot
Match: Q9S9K4 (Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2)

HSP 1 Score: 145.6 bits (366), Expect = 1.3e-33
Identity = 126/448 (28.12%), Postives = 195/448 (43.53%), Query Frame = 0

Query: 5   VLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPTASSSFASSSIVLPLQGN--V 64
           V+ + V+++   S   +  A   F  K  +      S  T   S   +SI LPL G+  V
Sbjct: 10  VVAVFVIVIEFASANFVFKAQHKFAGKK-KNLEHFKSHDTRRHSRMLASIDLPLGGDSRV 69

Query: 65  FPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPS-------- 124
              G Y   + +G PPK Y +  DTGSD+ W+ C  PC +C    +  ++ S        
Sbjct: 70  DSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINC-KPCPKCPTKTNLNFRLSLFDMNASS 129

Query: 125 -NDLVPCKDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGD-- 184
            +  V C D  C  +  S    C+    C Y + YAD  +S G  +RD+  L    GD  
Sbjct: 130 TSKKVGCDDDFCSFI--SQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLK 189

Query: 185 --PIRPRLALGCGYDQDPG-SSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSS 244
             P+   +  GCG DQ     +    +DG++G G+   S++SQL   G  + V  HC  +
Sbjct: 190 TGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDN 249

Query: 245 -KGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGL-----RNLFVVF 304
            KGGG   F  G+ D  ++  TPM  +   HY+     +  +G S  L     RN   + 
Sbjct: 250 VKGGG--IFAVGVVDSPKVKTTPMVPN-QMHYNVMLMGMDVDGTSLDLPRSIVRNGGTIV 309

Query: 305 DSGSSYTYFNAQAYQVLTSLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFK 364
           DSG++  YF    Y    SL+   L  +P++  + ++T        + F    +V + F 
Sbjct: 310 DSGTTLAYFPKVLYD---SLIETILARQPVKLHIVEETF-------QCFSFSTNVDEAFP 369

Query: 365 PLALSFSSGGRTKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLENSNII--GDISMQDK 424
           P++  F    +      +    YL        C G   G     E S +I  GD+ + +K
Sbjct: 370 PVSFEFEDSVK----LTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNK 429

Query: 425 MVVYNNEKQAIGWATANCDRVPKSRVGS 429
           +VVY+ + + IGWA  NC    K + GS
Sbjct: 430 LVVYDLDNEVIGWADHNCSSSIKIKDGS 436

BLAST of Lsi10G007850 vs. ExPASy Swiss-Prot
Match: Q4V3D2 (Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1)

HSP 1 Score: 127.1 bits (318), Expect = 4.8e-28
Identity = 106/403 (26.30%), Postives = 169/403 (41.94%), Query Frame = 0

Query: 52  SSIVLPLQGNVFPN--GFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQC---TET 111
           ++I LPL G+   +  G Y   + +G PPK Y++  DTGSD+ W+ C APC +C   T+ 
Sbjct: 60  ANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNC-APCPKCPVKTDL 119

Query: 112 LHPL------YQPSNDLVPCKDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVR 171
             PL         ++  V C+D  C  +  S    C     C Y V Y DG +S G  ++
Sbjct: 120 GIPLSLYDSKTSSTSKNVGCEDDFCSFIMQS--ETCGAKKPCSYHVVYGDGSTSDGDFIK 179

Query: 172 DVFPLNLTNGD----PIRPRLALGCGYDQDPG-SSSYHPMDGILGLGRGAVSMVSQLHNQ 231
           D   L    G+    P+   +  GCG +Q      +   +DGI+G G+   S++SQL   
Sbjct: 180 DNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAG 239

Query: 232 GIVRNVVGHCFSSKGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGL 291
           G  + +  HC  +  GG +F    +  P  +V T        HY+     +  +G    L
Sbjct: 240 GSTKRIFSHCLDNMNGGGIFAVGEVESP--VVKTTPIVPNQVHYNVILKGMDVDGDPIDL 299

Query: 292 RNLF--------VVFDSGSSYTYFNAQAYQVLTSLLNRELTGKPLREAMDDDTLPLCWRG 351
                        + DSG++  Y     Y    SL+ +    + ++  M  +T       
Sbjct: 300 PPSLASTNGDGGTIIDSGTTLAYLPQNLY---NSLIEKITAKQQVKLHMVQETFAC---- 359

Query: 352 RKPFKSLRDVRKYFKPLALSFSSGGRTKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLE 411
              F    +  K F  + L F    +      +    YL        C G  +G     +
Sbjct: 360 ---FSFTSNTDKAFPVVNLHFEDSLK----LSVYPHDYLFSLREDMYCFGWQSGGMTTQD 419

Query: 412 NSNII--GDISMQDKMVVYNNEKQAIGWATANCDRVPKSRVGS 429
            +++I  GD+ + +K+VVY+ E + IGWA  NC    K + GS
Sbjct: 420 GADVILLGDLVLSNKLVVYDLENEVIGWADHNCSSSIKVKDGS 443

BLAST of Lsi10G007850 vs. ExPASy TrEMBL
Match: A0A5D3BS69 (Aspartic proteinase Asp1 isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold680G00070 PE=4 SV=1)

HSP 1 Score: 859.4 bits (2219), Expect = 6.5e-246
Identity = 408/428 (95.33%), Postives = 420/428 (98.13%), Query Frame = 0

Query: 1   MGKGVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPTASSSFASSSIVLPLQG 60
           MGK VL++L LMVASMSCLA CSASSFFKDKPWER+RPILSVPTASSSFASSSIVLPLQG
Sbjct: 42  MGKWVLVVLALMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 101

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 102 NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 161

Query: 121 KDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDH+CENP QCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 162 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 221

Query: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGI 240
           CGYDQDPGSSSYHPMDGILGLGRGAVS+VSQLHNQGIVRNVVGHCF+SKGGGYLFFGDGI
Sbjct: 222 CGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGI 281

Query: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300
           YDPYRLVWTPMSRDYPKHYSPGFGEL+FNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT
Sbjct: 282 YDPYRLVWTPMSRDYPKHYSPGFGELMFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 341

Query: 301 SLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRTKAVFEI 360
           SLLNREL GKPLREAMDDDTLPLCWR RKP KSLRDVRKYFKPLALSFSSGGR+KAVFEI
Sbjct: 342 SLLNRELAGKPLREAMDDDTLPLCWRERKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEI 401

Query: 361 PMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420
           P+EGY+IISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR
Sbjct: 402 PIEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 461

Query: 421 VPKSRVGS 429
           VPKS+V S
Sbjct: 462 VPKSQVSS 469

BLAST of Lsi10G007850 vs. ExPASy TrEMBL
Match: A0A1S3CDB2 (aspartic proteinase Asp1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103499584 PE=4 SV=1)

HSP 1 Score: 859.4 bits (2219), Expect = 6.5e-246
Identity = 408/428 (95.33%), Postives = 420/428 (98.13%), Query Frame = 0

Query: 1   MGKGVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPTASSSFASSSIVLPLQG 60
           MGK VL++L LMVASMSCLA CSASSFFKDKPWER+RPILSVPTASSSFASSSIVLPLQG
Sbjct: 1   MGKWVLVVLALMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDH+CENP QCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGI 240
           CGYDQDPGSSSYHPMDGILGLGRGAVS+VSQLHNQGIVRNVVGHCF+SKGGGYLFFGDGI
Sbjct: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGI 240

Query: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300
           YDPYRLVWTPMSRDYPKHYSPGFGEL+FNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT
Sbjct: 241 YDPYRLVWTPMSRDYPKHYSPGFGELMFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300

Query: 301 SLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRTKAVFEI 360
           SLLNREL GKPLREAMDDDTLPLCWR RKP KSLRDVRKYFKPLALSFSSGGR+KAVFEI
Sbjct: 301 SLLNRELAGKPLREAMDDDTLPLCWRERKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360

Query: 361 PMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420
           P+EGY+IISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR
Sbjct: 361 PIEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420

Query: 421 VPKSRVGS 429
           VPKS+V S
Sbjct: 421 VPKSQVSS 428

BLAST of Lsi10G007850 vs. ExPASy TrEMBL
Match: A0A1S3CDB4 (aspartic proteinase Asp1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103499584 PE=4 SV=1)

HSP 1 Score: 853.6 bits (2204), Expect = 3.6e-244
Identity = 408/432 (94.44%), Postives = 420/432 (97.22%), Query Frame = 0

Query: 1   MGKGVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPTASSSFASSSIVLPLQG 60
           MGK VL++L LMVASMSCLA CSASSFFKDKPWER+RPILSVPTASSSFASSSIVLPLQG
Sbjct: 1   MGKWVLVVLALMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDH+CENP QCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 ----CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFF 240
               CGYDQDPGSSSYHPMDGILGLGRGAVS+VSQLHNQGIVRNVVGHCF+SKGGGYLFF
Sbjct: 181 CQLICGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFF 240

Query: 241 GDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAY 300
           GDGIYDPYRLVWTPMSRDYPKHYSPGFGEL+FNGRSTGLRNLFVVFDSGSSYTYFNAQAY
Sbjct: 241 GDGIYDPYRLVWTPMSRDYPKHYSPGFGELMFNGRSTGLRNLFVVFDSGSSYTYFNAQAY 300

Query: 301 QVLTSLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRTKA 360
           QVLTSLLNREL GKPLREAMDDDTLPLCWR RKP KSLRDVRKYFKPLALSFSSGGR+KA
Sbjct: 301 QVLTSLLNRELAGKPLREAMDDDTLPLCWRERKPIKSLRDVRKYFKPLALSFSSGGRSKA 360

Query: 361 VFEIPMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATA 420
           VFEIP+EGY+IISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATA
Sbjct: 361 VFEIPIEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATA 420

Query: 421 NCDRVPKSRVGS 429
           NCDRVPKS+V S
Sbjct: 421 NCDRVPKSQVSS 432

BLAST of Lsi10G007850 vs. ExPASy TrEMBL
Match: A0A6J1I590 (aspartic proteinase Asp1-like OS=Cucurbita maxima OX=3661 GN=LOC111469733 PE=4 SV=1)

HSP 1 Score: 825.5 bits (2131), Expect = 1.0e-235
Identity = 391/431 (90.72%), Postives = 411/431 (95.36%), Query Frame = 0

Query: 1   MGKGVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVP--TASSSFASSSIVLPL 60
           MGKGVLMILVLMV+S+SCLA CSASSFFKDK WERRRP LSVP  +ASSS AS SIVLPL
Sbjct: 1   MGKGVLMILVLMVSSISCLAPCSASSFFKDKLWERRRPTLSVPIASASSSIASPSIVLPL 60

Query: 61  QGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLV 120
           QGNVFPNGFYNVTL++GQPPKPYFLDPDTGSDLTWLQCDAPCQQCTET HPLYQPSNDLV
Sbjct: 61  QGNVFPNGFYNVTLFIGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLYQPSNDLV 120

Query: 121 PCKDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLA 180
           PCKDPLCMSLHSS+DH+CENP QCDYEVEYADGGSSLGVLVRD+FPLNLTNGDPIRPRL 
Sbjct: 121 PCKDPLCMSLHSSIDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLT 180

Query: 181 LGCGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGD 240
           LGCGYDQ+PGSSSYHPMDG+LGLG+GAVS+VSQLHNQGIVRNVVGHCFSSKGGGYLFFGD
Sbjct: 181 LGCGYDQNPGSSSYHPMDGVLGLGKGAVSIVSQLHNQGIVRNVVGHCFSSKGGGYLFFGD 240

Query: 241 GIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQV 300
            IYDPYRL WTPMSRDYPKHYSPGFG+L FNGRSTGLRNLFVVFDSGSSYTYFNAQAYQ+
Sbjct: 241 DIYDPYRLAWTPMSRDYPKHYSPGFGDLFFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQI 300

Query: 301 LTSLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRTKAVF 360
           +TSLLNRELTGKPLREA DDDTLPLCWRGR PFKSLRDVRKYFKPLALSFSSG R+KAVF
Sbjct: 301 ITSLLNRELTGKPLREAKDDDTLPLCWRGRNPFKSLRDVRKYFKPLALSFSSGRRSKAVF 360

Query: 361 EIPMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANC 420
           E+P E YLIISS GNVCLGILNG++VGLENSNIIGDISMQDKMVVYNNEKQAIGWATANC
Sbjct: 361 EMPTESYLIISSKGNVCLGILNGSEVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANC 420

Query: 421 DRVPKSRVGSM 430
           DRVPKS VGS+
Sbjct: 421 DRVPKSSVGSL 431

BLAST of Lsi10G007850 vs. ExPASy TrEMBL
Match: A0A6J1G7F2 (aspartic proteinase Asp1-like OS=Cucurbita moschata OX=3662 GN=LOC111451545 PE=4 SV=1)

HSP 1 Score: 822.4 bits (2123), Expect = 8.8e-235
Identity = 391/431 (90.72%), Postives = 409/431 (94.90%), Query Frame = 0

Query: 1   MGKGVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVP--TASSSFASSSIVLPL 60
           MGKGVLMILV MV S+SCLA CSASSFFKDK WERRRP LSVP  +ASSS  SSSIVLPL
Sbjct: 8   MGKGVLMILVPMVFSISCLAPCSASSFFKDKLWERRRPTLSVPIASASSSIPSSSIVLPL 67

Query: 61  QGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLV 120
           QGNVFPNGFYNVTL++GQPPKPYFLDPDTGSDLTWLQCDAPCQQCTET HPLYQPSNDLV
Sbjct: 68  QGNVFPNGFYNVTLFIGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLYQPSNDLV 127

Query: 121 PCKDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLA 180
           PCKDPLCMSLHSS+DH+CENP QCDYEVEYADGGSSLGVLVRD+FPLNLTNGDPIRPRL 
Sbjct: 128 PCKDPLCMSLHSSIDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLT 187

Query: 181 LGCGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGD 240
           LGCGYDQDPGSSSYHPMDG+LGLG+GAVS+VSQLHNQGIVRNVVGHCFSSKGGGYLFFGD
Sbjct: 188 LGCGYDQDPGSSSYHPMDGVLGLGKGAVSIVSQLHNQGIVRNVVGHCFSSKGGGYLFFGD 247

Query: 241 GIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQV 300
            IYDPYRL WTPMSRDYPKHYSPGFG+L FNGRSTGLRNLFVVFDSGSSYTYFNAQAYQ+
Sbjct: 248 DIYDPYRLAWTPMSRDYPKHYSPGFGDLFFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQI 307

Query: 301 LTSLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRTKAVF 360
           +TSLLNRELTGKPLREA DDDTLPLCWRGR PFKSLRDVRKYFKPLALSFSSG R+KAVF
Sbjct: 308 ITSLLNRELTGKPLREAKDDDTLPLCWRGRNPFKSLRDVRKYFKPLALSFSSGRRSKAVF 367

Query: 361 EIPMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANC 420
           E+PME YLIISS GNVCLGILNG++VGLENSNIIGDISMQDKMVVYNNEKQAIGWATA C
Sbjct: 368 EMPMESYLIISSKGNVCLGILNGSEVGLENSNIIGDISMQDKMVVYNNEKQAIGWATAIC 427

Query: 421 DRVPKSRVGSM 430
           DRVPKS VGS+
Sbjct: 428 DRVPKSSVGSL 438

BLAST of Lsi10G007850 vs. NCBI nr
Match: XP_038900559.1 (aspartic proteinase Asp1 isoform X2 [Benincasa hispida])

HSP 1 Score: 872.8 bits (2254), Expect = 1.2e-249
Identity = 416/429 (96.97%), Postives = 422/429 (98.37%), Query Frame = 0

Query: 1   MGKGVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPTASSSFASSSIVLPLQG 60
           M K VLMILVLMVASMSCLA CSASSFFKDKPWERRRPILSVP ASSSFASSSIV+PLQG
Sbjct: 1   MEKRVLMILVLMVASMSCLAPCSASSFFKDKPWERRRPILSVPIASSSFASSSIVMPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDH+CENP QCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGI 240
           CGYDQDPGSSSYHPMDG+LGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGI
Sbjct: 181 CGYDQDPGSSSYHPMDGVLGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGI 240

Query: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300
           YDPYR+VWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT
Sbjct: 241 YDPYRIVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300

Query: 301 SLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRTKAVFEI 360
           SLLNREL GKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGR+KAVFEI
Sbjct: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360

Query: 361 PMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420
           PMEGYLIISSMGN CLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR
Sbjct: 361 PMEGYLIISSMGNACLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420

Query: 421 VPKSRVGSM 430
           VPKSRVGSM
Sbjct: 421 VPKSRVGSM 429

BLAST of Lsi10G007850 vs. NCBI nr
Match: XP_004147327.2 (aspartic proteinase Asp1 isoform X1 [Cucumis sativus] >KAE8651999.1 hypothetical protein Csa_016941 [Cucumis sativus])

HSP 1 Score: 864.4 bits (2232), Expect = 4.2e-247
Identity = 411/428 (96.03%), Postives = 421/428 (98.36%), Query Frame = 0

Query: 1   MGKGVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPTASSSFASSSIVLPLQG 60
           MGK VL++LVLMVASMSCLA CSASSFFKDKPWER+RPILSVPTASSSFASSSIVLPLQG
Sbjct: 1   MGKRVLVVLVLMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDH+CENP QCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGI 240
           CGYDQDPGSSSYHPMDGILGLGRGAVS+VSQLHNQGIVRNVVGHCF+SKGGGYLFFGDGI
Sbjct: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGI 240

Query: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300
           YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT
Sbjct: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300

Query: 301 SLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRTKAVFEI 360
           SLLNREL GKPLREAMDDDTLPLCWRGRKP KSLRDVRKYFKPLALSFSSGGR+KAVFEI
Sbjct: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360

Query: 361 PMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420
           P EGY+IISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR
Sbjct: 361 PTEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420

Query: 421 VPKSRVGS 429
           VPKS+V S
Sbjct: 421 VPKSQVSS 428

BLAST of Lsi10G007850 vs. NCBI nr
Match: XP_008460823.1 (PREDICTED: aspartic proteinase Asp1 isoform X2 [Cucumis melo])

HSP 1 Score: 859.4 bits (2219), Expect = 1.3e-245
Identity = 408/428 (95.33%), Postives = 420/428 (98.13%), Query Frame = 0

Query: 1   MGKGVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPTASSSFASSSIVLPLQG 60
           MGK VL++L LMVASMSCLA CSASSFFKDKPWER+RPILSVPTASSSFASSSIVLPLQG
Sbjct: 1   MGKWVLVVLALMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDH+CENP QCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGI 240
           CGYDQDPGSSSYHPMDGILGLGRGAVS+VSQLHNQGIVRNVVGHCF+SKGGGYLFFGDGI
Sbjct: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGI 240

Query: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300
           YDPYRLVWTPMSRDYPKHYSPGFGEL+FNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT
Sbjct: 241 YDPYRLVWTPMSRDYPKHYSPGFGELMFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300

Query: 301 SLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRTKAVFEI 360
           SLLNREL GKPLREAMDDDTLPLCWR RKP KSLRDVRKYFKPLALSFSSGGR+KAVFEI
Sbjct: 301 SLLNRELAGKPLREAMDDDTLPLCWRERKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360

Query: 361 PMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420
           P+EGY+IISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR
Sbjct: 361 PIEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420

Query: 421 VPKSRVGS 429
           VPKS+V S
Sbjct: 421 VPKSQVSS 428

BLAST of Lsi10G007850 vs. NCBI nr
Match: TYK02025.1 (aspartic proteinase Asp1 isoform X2 [Cucumis melo var. makuwa])

HSP 1 Score: 859.4 bits (2219), Expect = 1.3e-245
Identity = 408/428 (95.33%), Postives = 420/428 (98.13%), Query Frame = 0

Query: 1   MGKGVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPTASSSFASSSIVLPLQG 60
           MGK VL++L LMVASMSCLA CSASSFFKDKPWER+RPILSVPTASSSFASSSIVLPLQG
Sbjct: 42  MGKWVLVVLALMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 101

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 102 NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 161

Query: 121 KDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDH+CENP QCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 162 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 221

Query: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGI 240
           CGYDQDPGSSSYHPMDGILGLGRGAVS+VSQLHNQGIVRNVVGHCF+SKGGGYLFFGDGI
Sbjct: 222 CGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGI 281

Query: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300
           YDPYRLVWTPMSRDYPKHYSPGFGEL+FNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT
Sbjct: 282 YDPYRLVWTPMSRDYPKHYSPGFGELMFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 341

Query: 301 SLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRTKAVFEI 360
           SLLNREL GKPLREAMDDDTLPLCWR RKP KSLRDVRKYFKPLALSFSSGGR+KAVFEI
Sbjct: 342 SLLNRELAGKPLREAMDDDTLPLCWRERKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEI 401

Query: 361 PMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420
           P+EGY+IISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR
Sbjct: 402 PIEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 461

Query: 421 VPKSRVGS 429
           VPKS+V S
Sbjct: 462 VPKSQVSS 469

BLAST of Lsi10G007850 vs. NCBI nr
Match: XP_038900558.1 (aspartic proteinase Asp1 isoform X1 [Benincasa hispida])

HSP 1 Score: 855.1 bits (2208), Expect = 2.5e-244
Identity = 416/464 (89.66%), Postives = 422/464 (90.95%), Query Frame = 0

Query: 1   MGKGVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPTASSSFASSSIVLPLQG 60
           M K VLMILVLMVASMSCLA CSASSFFKDKPWERRRPILSVP ASSSFASSSIV+PLQG
Sbjct: 1   MEKRVLMILVLMVASMSCLAPCSASSFFKDKPWERRRPILSVPIASSSFASSSIVMPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDH+CENP QCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGI 240
           CGYDQDPGSSSYHPMDG+LGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGI
Sbjct: 181 CGYDQDPGSSSYHPMDGVLGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGI 240

Query: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300
           YDPYR+VWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT
Sbjct: 241 YDPYRIVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300

Query: 301 SLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRTKAVFEI 360
           SLLNREL GKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGR+KAVFEI
Sbjct: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360

Query: 361 PMEGYLIIS-----------------------------------SMGNVCLGILNGTDVG 420
           PMEGYLIIS                                   SMGN CLGILNGTDVG
Sbjct: 361 PMEGYLIISVKAPCLQFHLLFTEHFFLWILNLNDFTYKLISEWQSMGNACLGILNGTDVG 420

Query: 421 LENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSRVGSM 430
           LENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSRVGSM
Sbjct: 421 LENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSRVGSM 464

BLAST of Lsi10G007850 vs. TAIR 10
Match: AT4G33490.2 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 573.9 bits (1478), Expect = 1.1e-163
Identity = 270/418 (64.59%), Postives = 330/418 (78.95%), Query Frame = 0

Query: 5   VLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPTASSSFASSSIVLPLQGNVFP 64
           V  ++VLMV S+  L   SA  F     W +        T     A SS+V P+ GNV+P
Sbjct: 6   VRFMIVLMVMSL-VLGFSSAVDF----RWRKTAGFSDRFTR----AVSSVVFPVHGNVYP 65

Query: 65  NGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPL 124
            G+YNVT+ +GQPP+PY+LD DTGSDLTWLQCDAPC +C E  HPLYQPS+DL+PC DPL
Sbjct: 66  LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPL 125

Query: 125 CMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYD 184
           C +LH + + +CE P QCDYEVEYADGGSSLGVLVRDVF +N T G  + PRLALGCGYD
Sbjct: 126 CKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCGYD 185

Query: 185 QDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGIYDPY 244
           Q PG+SS+HP+DG+LGLGRG VS++SQLH+QG V+NV+GHC SS GGG LFFGD +YD  
Sbjct: 186 QIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYDSS 245

Query: 245 RLVWTPMSRDYPKHYSPGF-GELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLL 304
           R+ WTPMSR+Y KHYSP   GEL+F GR+TGL+NL  VFDSGSSYTYFN++AYQ +T LL
Sbjct: 246 RVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLL 305

Query: 305 NRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRTKAVFEIPME 364
            REL+GKPL+EA DD TLPLCW+GR+PF S+ +V+KYFKPLALSF +G R+K +FEIP E
Sbjct: 306 KRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIPPE 365

Query: 365 GYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRV 422
            YLIIS  GNVCLGILNGT++GL+N N+IGDISMQD+M++Y+NEKQ+IGW   +CD +
Sbjct: 366 AYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQSIGWMPVDCDEL 414

BLAST of Lsi10G007850 vs. TAIR 10
Match: AT4G33490.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 531.9 bits (1369), Expect = 4.6e-151
Identity = 253/390 (64.87%), Postives = 306/390 (78.46%), Query Frame = 0

Query: 5   VLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPTASSSFASSSIVLPLQGNVFP 64
           V  ++VLMV S+  L   SA  F     W +        T     A SS+V P+ GNV+P
Sbjct: 3   VRFMIVLMVMSL-VLGFSSAVDF----RWRKTAGFSDRFTR----AVSSVVFPVHGNVYP 62

Query: 65  NGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPL 124
            G+YNVT+ +GQPP+PY+LD DTGSDLTWLQCDAPC +C E  HPLYQPS+DL+PC DPL
Sbjct: 63  LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPL 122

Query: 125 CMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYD 184
           C +LH + + +CE P QCDYEVEYADGGSSLGVLVRDVF +N T G  + PRLALGCGYD
Sbjct: 123 CKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCGYD 182

Query: 185 QDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGIYDPY 244
           Q PG+SS+HP+DG+LGLGRG VS++SQLH+QG V+NV+GHC SS GGG LFFGD +YD  
Sbjct: 183 QIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYDSS 242

Query: 245 RLVWTPMSRDYPKHYSPGF-GELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLL 304
           R+ WTPMSR+Y KHYSP   GEL+F GR+TGL+NL  VFDSGSSYTYFN++AYQ +T LL
Sbjct: 243 RVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLL 302

Query: 305 NRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRTKAVFEIPME 364
            REL+GKPL+EA DD TLPLCW+GR+PF S+ +V+KYFKPLALSF +G R+K +FEIP E
Sbjct: 303 KRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIPPE 362

Query: 365 GYLIISSMGNVCLGILNGTDVGLENSNIIG 394
            YLIIS  GNVCLGILNGT++GL+N N+IG
Sbjct: 363 AYLIISMKGNVCLGILNGTEIGLQNLNLIG 383

BLAST of Lsi10G007850 vs. TAIR 10
Match: AT1G44130.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 430.6 bits (1106), Expect = 1.4e-120
Identity = 200/396 (50.51%), Postives = 283/396 (71.46%), Query Frame = 0

Query: 39  ILSVPTASSSF-------ASSSIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDL 98
           ++ VP + SS        + SS+V PL GNVFP G+Y+V + +G PPK +  D DTGSDL
Sbjct: 13  LVIVPLSKSSIFKTFIKSSPSSVVFPLSGNVFPLGYYSVLMQIGSPPKAFQFDIDTGSDL 72

Query: 99  TWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHKCENP-GQCDYEVEYAD 158
           TW+QCDAPC  CT   +  Y+P  +++PC +P+C +LH      C NP  QCDYEV+YAD
Sbjct: 73  TWVQCDAPCSGCTLPPNLQYKPKGNIIPCSNPICTALHWPNKPHCPNPQEQCDYEVKYAD 132

Query: 159 GGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMD-GILGLGRGAVSMV 218
            GSS+G LV D FPL L NG  ++P +A GCGYDQ   S+   P   G+LGLGRG + ++
Sbjct: 133 QGSSMGALVTDQFPLKLVNGSFMQPPVAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLL 192

Query: 219 SQLHNQGIVRNVVGHCFSSKGGGYLFFGDGIYDPYRLVWTP-MSRDYPKHYSPGFGELIF 278
           +QL + G+ RNVVGHC SSKGGG+LFFGD +     + WTP +S+D   HY+ G  +L+F
Sbjct: 193 TQLVSAGLTRNVVGHCLSSKGGGFLFFGDNLVPSIGVAWTPLLSQD--NHYTTGPADLLF 252

Query: 279 NGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELTGKPLREAMDDDTLPLCWRGR 338
           NG+ TGL+ L ++FD+GSSYTYFN++AYQ + +L+  +L   PL+ A +D TLP+CW+G 
Sbjct: 253 NGKPTGLKGLKLIFDTGSSYTYFNSKAYQTIINLIGNDLKVSPLKVAKEDKTLPICWKGA 312

Query: 339 KPFKSLRDVRKYFKPLALSFSSGGRTKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLEN 398
           KPFKS+ +V+ +FK + ++F++G R   ++  P E YLI+S  GNVCLG+LNG++VGL+N
Sbjct: 313 KPFKSVLEVKNFFKTITINFTNGRRNTQLYLAP-ELYLIVSKTGNVCLGLLNGSEVGLQN 372

Query: 399 SNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKS 425
           SN+IGDISMQ  M++Y+NEKQ +GW +++C+++PK+
Sbjct: 373 SNVIGDISMQGLMMIYDNEKQQLGWVSSDCNKLPKT 405

BLAST of Lsi10G007850 vs. TAIR 10
Match: AT1G77480.2 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 407.9 bits (1047), Expect = 1.0e-113
Identity = 193/377 (51.19%), Postives = 259/377 (68.70%), Query Frame = 0

Query: 51  SSSIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPL 110
           SS++V P+ GNV+P G+Y V L +G PPK + LD DTGSDLTW+QCDAPC  CT+     
Sbjct: 50  SSTVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQ 109

Query: 111 YQPSNDLVPCKDPLCMSLHSSMDHKCENP-GQCDYEVEYADGGSSLGVLVRDVFPLNLTN 170
           Y+P+++ +PC   LC  L    D  C +P  QCDYE+ Y+D  SS+G LV D  PL L N
Sbjct: 110 YKPNHNTLPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLAN 169

Query: 171 GDPIRPRLALGCGYD-QDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSS 230
           G  +  RL  GCGYD Q+PG     P  GILGLGRG V + +QL + GI +NV+ HC S 
Sbjct: 170 GSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSH 229

Query: 231 KGGGYLFFGDGIYDPYRLVWTPMSRDYP-KHYSPGFGELIFNGRSTGLRNLFVVFDSGSS 290
            G G+L  GD +     + WT ++ + P K+Y  G  EL+FN ++TG++ + VVFDSGSS
Sbjct: 230 TGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSS 289

Query: 291 YTYFNAQAYQVLTSLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALS 350
           YTYFNA+AYQ +  L+ ++L GKPL +  DD +LP+CW+G+KP KSL +V+KYFK + L 
Sbjct: 290 YTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLR 349

Query: 351 FSSGGRTKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNE 410
           F +  +   +F++P E YLII+  G VCLGILNGT++GLE  NIIGDIS Q  MV+Y+NE
Sbjct: 350 FGN-QKNGQLFQVPPESYLIITEKGRVCLGILNGTEIGLEGYNIIGDISFQGIMVIYDNE 409

Query: 411 KQAIGWATANCDRVPKS 425
           KQ IGW +++CD++PKS
Sbjct: 410 KQRIGWISSDCDKLPKS 425

BLAST of Lsi10G007850 vs. TAIR 10
Match: AT1G77480.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 404.4 bits (1038), Expect = 1.1e-112
Identity = 191/375 (50.93%), Postives = 257/375 (68.53%), Query Frame = 0

Query: 51  SSSIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPL 110
           SS++V P+ GNV+P G+Y V L +G PPK + LD DTGSDLTW+QCDAPC  CT+     
Sbjct: 50  SSTVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQ 109

Query: 111 YQPSNDLVPCKDPLCMSLHSSMDHKCENP-GQCDYEVEYADGGSSLGVLVRDVFPLNLTN 170
           Y+P+++ +PC   LC  L    D  C +P  QCDYE+ Y+D  SS+G LV D  PL L N
Sbjct: 110 YKPNHNTLPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLAN 169

Query: 171 GDPIRPRLALGCGYD-QDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSS 230
           G  +  RL  GCGYD Q+PG     P  GILGLGRG V + +QL + GI +NV+ HC S 
Sbjct: 170 GSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSH 229

Query: 231 KGGGYLFFGDGIYDPYRLVWTPMSRDYP-KHYSPGFGELIFNGRSTGLRNLFVVFDSGSS 290
            G G+L  GD +     + WT ++ + P K+Y  G  EL+FN ++TG++ + VVFDSGSS
Sbjct: 230 TGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSS 289

Query: 291 YTYFNAQAYQVLTSLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALS 350
           YTYFNA+AYQ +  L+ ++L GKPL +  DD +LP+CW+G+KP KSL +V+KYFK + L 
Sbjct: 290 YTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLR 349

Query: 351 FSSGGRTKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNE 410
           F +  +   +F++P E YLII+  G VCLGILNGT++GLE  NIIGDIS Q  MV+Y+NE
Sbjct: 350 FGN-QKNGQLFQVPPESYLIITEKGRVCLGILNGTEIGLEGYNIIGDISFQGIMVIYDNE 409

Query: 411 KQAIGWATANCDRVP 423
           KQ IGW +++CD++P
Sbjct: 410 KQRIGWISSDCDKLP 423

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q0IU524.7e-9246.37Aspartic proteinase Asp1 OS=Oryza sativa subsp. japonica OX=39947 GN=ASP1 PE=2 S... [more]
A2ZC679.2e-8845.08Aspartic proteinase Asp1 OS=Oryza sativa subsp. indica OX=39946 GN=ASP1 PE=2 SV=... [more]
Q9M9A81.4e-8042.93Aspartyl protease APCB1 OS=Arabidopsis thaliana OX=3702 GN=APCB1 PE=1 SV=1[more]
Q9S9K41.3e-3328.13Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2[more]
Q4V3D24.8e-2826.30Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A5D3BS696.5e-24695.33Aspartic proteinase Asp1 isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5... [more]
A0A1S3CDB26.5e-24695.33aspartic proteinase Asp1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103499584 PE=4... [more]
A0A1S3CDB43.6e-24494.44aspartic proteinase Asp1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103499584 PE=4... [more]
A0A6J1I5901.0e-23590.72aspartic proteinase Asp1-like OS=Cucurbita maxima OX=3661 GN=LOC111469733 PE=4 S... [more]
A0A6J1G7F28.8e-23590.72aspartic proteinase Asp1-like OS=Cucurbita moschata OX=3662 GN=LOC111451545 PE=4... [more]
Match NameE-valueIdentityDescription
XP_038900559.11.2e-24996.97aspartic proteinase Asp1 isoform X2 [Benincasa hispida][more]
XP_004147327.24.2e-24796.03aspartic proteinase Asp1 isoform X1 [Cucumis sativus] >KAE8651999.1 hypothetical... [more]
XP_008460823.11.3e-24595.33PREDICTED: aspartic proteinase Asp1 isoform X2 [Cucumis melo][more]
TYK02025.11.3e-24595.33aspartic proteinase Asp1 isoform X2 [Cucumis melo var. makuwa][more]
XP_038900558.12.5e-24489.66aspartic proteinase Asp1 isoform X1 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
AT4G33490.21.1e-16364.59Eukaryotic aspartyl protease family protein [more]
AT4G33490.14.6e-15164.87Eukaryotic aspartyl protease family protein [more]
AT1G44130.11.4e-12050.51Eukaryotic aspartyl protease family protein [more]
AT1G77480.21.0e-11351.19Eukaryotic aspartyl protease family protein [more]
AT1G77480.11.1e-11250.93Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (USVL1VR-Ls) v1
Date Performed: 2021-10-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 47..237
e-value: 6.0E-45
score: 155.5
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 242..422
e-value: 2.7E-27
score: 97.3
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 60..421
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 280..414
e-value: 7.5E-15
score: 55.1
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 68..238
e-value: 5.1E-48
score: 163.6
NoneNo IPR availablePANTHERPTHR13683:SF800EUKARYOTIC ASPARTYL PROTEASE FAMILY PROTEINcoord: 8..423
NoneNo IPR availablePROSITEPS51257PROKAR_LIPOPROTEINcoord: 1..18
score: 5.0
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 8..423
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 68..414
score: 33.848583

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi10G007850.1Lsi10G007850.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0003677 DNA binding