CSPI03G04660 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI03G04660
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionAspartic proteinase
LocationChr3: 3786067 .. 3791642 (-)
RNA-Seq ExpressionCSPI03G04660
SyntenyCSPI03G04660
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGCTCTTTCTCATTTCTCTTAAATTTCCCCTTTTTCTTTTGCTTCAATCGTCTCTTTGGTCCGCGATCGAGTTCGGTTTTGCTTTCTTTTTTTATATACATCTGCTGTAGGATTTTTTACTCAGGTTCGAGCTTGGGAAATTGCTTAATTTATTAAGTTATCTTAATGTTCTTTGTTGTCTGATTTCTGATAAGTTGTGGGAAAGTTAGATAGTTGAAGCATTTGAATTTTAGATTCGAATATCTTGGGACTAGAGGAAGCTTGAATCTGTTAATTGTTGAAGTATTGGGAGAGCACTTTATTCTAACTTTGATTTATTTCGATTTTTCTATTTTTTTTCTTGTTGAAATGCATTAGTTGTGGCTAACTGTTGCTTTGCTCCACCACTAATTAAATTTACTTTTTCCAAACCTTAACTTTTAGGTGGATTTTATTTTGTAGAATCGTGCTGTATTTTATTGTCAAATCTTTGGACAGCATATTTAGCTTTGGAATTCACTTGTTTACAATTAATGGGAATCCTGAGGAGTGTGTGTCTATGTTACTTGTATTTGTGTGTGCAAGTCAGTTTGTATGTGAATATTCTGTGCTTTGACCTGAGGCTGCTGAAGGGAAGCATGTAACTTTGGATTAAATATGATTTCCTCTTGGCACTGTTGTTCTTAGAAGTGAAGCAGGTGTCCTCCGTAAACAATATCTCAAAATGAAAAACAAAATAAAACAAATAGAACTAACTTTTATGGGTTTGCTTATGTCTTTAAACAGTCTCGTTTAAGCTTCCAATTCCTTTAGTTTTTCATACAACCAAATTGATTTTTAAACAATTAAAACACATGTTTTCAAGCGATTTTAAAAACAGAATTTTAACTATTTAAAAATTACTCCCAAACATGCTCATTTTCTCACGCTAAAACTGTTTTATCAATATAAAGTACTTCATTTATTTATATTGTGTCTCACAAACAAAGCTTGCAGATTTTTTGTTGCCCCGGCTTGCCTTAACAGACTGTTGCACTAGAAAGTGAGACTGTTGCATTACAGTAAATGAATACTGCTCTGACCTATTGATGATAAAAGAATTACAATGTTGTTTAGAGAGTTCTTGATTGAAATGTGATTTGTGAAACTTTGGTTTGTACATATTAATTACTGTACTCATCTTTTGAATTGCTTAATGAAGTGTTGCCCTTATGACTACCTTTTGATTTTAATTAAGCGTGTTTTCAATTCACAGTGAGCAACATGGGTACGAGACTCAAACTTTTTATAGCCGTTCTCTTCATTTGCTTCTTTATGTTTCCAATGGTCTTTTGTGCATCCAATGATGGGAAGGTTAGAATTGGACTTAAAAGGAGGAAGTTTGGCCAGAACAACCGAGTTGCTAGCAAGATTGCAACCAAGGAAGGAATTTCTTTGAAAAATTCTGTTGAGAAATATCAACCTAGTGCAAATTTGGGAGATTCGGATGATTTTGATATTGTTGGATTGAAGAATTACTTGAATGCTCAGTATTTTGGTGAGATTGGTATTGGTACTCCCCCTCAAAAGTTTACTGTTATTTTTGACACTGGTAGTTCAAATTTGTGGGTTCCATCATCTAAGTGCTTCTCTGTAAGTCCCCATCATCTTAGTAATGAAATGATTTATTTGTCCATTATATATTATAGCATGAAATGATTTTGAGTTGTCTGCAGGTGGCTTGTCTACTCCATTCCAAGTACAAGTCAAAGCGATCGAGTACTTATAAAAAGAATGGTCAGTACTCTTAGTAGCTAATGTTATGGTTGTGAAGTTATCTTCTTATAGATACTTAATCTACTAATTGGTTTGAGCCACTTGTACTTCTTTTTTATATCACCAACGTTAATTTGAGTGGTTTTGGCTATGGGCACTGATCTTATGTTCCCCTAATGTGCAGGTAAATCTGCTTCAATCAAATATGGGACTGGTGCTATTTCTGGCTACTTTAGTGAAGACAATGTTAAAGTTGGCGACCTAATAGTTAAAAAACAGGTCTTTGTTTGTGTTTATGGCTGTTCTTTTCATCTATTTTTTTATTGGTCGACCTTTACTATAATATTTTTCATTCCAGGATTTTATCGAGGCAACTAGGGAACCCAGCCTCACGTTTGTACTCGCCCAATTTGATGGCATATTAGGACTTGGCTTTAAAGAGATCTCAGTTGGAGATGCAGTGCCTGTATGGTATGTAGTAGAATCAATACTGCTTTTTGCATGCGTATTTTAACGGAGAGTTAATGTTGATGGTATAGATTACTAAATGTTTAGTAGTACAGGTACAACATGGTTGATCAAAACCTTGTAAAGGAACCTGTTTTCTCTTTTTGGTTTAATCGGAATGCAGATGAGGAACAAGGTGGTGAAATCGTATTTGGTGGTGTGGATCCTGATCACTACAAGGGGGAACATACATATGTTCCAGTTACTAAAAAGGGATATTGGCAGGTTCGTTGTGGTGCTGTTTCGTAAACAAGCTTGCATGTAATATATAGTTGTCTTTGTTAGCAGTTTATTATTAGGCCTTAAACTTTTATATCATTTTCTATGTTGGCAGTTTGATATGGGTGACGTCCTGATCAATGGCAGTACAACTGGTATGTTCAAAATCATCAACTTTGATCAATCAAATATAATTTTCATAACTGGCTTCATTAATTTCGACTCAAAAATTCATATTGCAGGATTTTGTTCTGGTGGTTGCTCAGCGATTGCGGATTCAGGAACTTCGTTATTGGCTGGTCCAACGGTGTGTGTTGCCCTTCATGTGCAATAATGGGGTTAAGCATGTTTATTTTTTTTAGTATATGATTTTACCTATGTTGTAGGATGCGTTTATTTCAGCTCAATTCATAATAAAAATCATTAGGGATGGTTGAAGTGTTGAGTTGAATTTTGAAGCCTAGAGTTAAATATGTTGGAGTTATGGAGTTACAGAGTTTGGAGTTTCTTATTTAATGTGCAAAAAAAGAGAAATAGGGAAAAATGAAATCTCCATAATTGTGCAAGCTTTTAACTGTGAACATCATTAAAATGGTGGAGTTGAGTTATTTAACACTGACTTATGAATTTGATGGGCTAAATCCCCCATTTAGTGTCTCTCTCACATTGATTCACCTTCCAACATCTTAGGTTGCATCTTCATTTTTCCTCTTTGCACATATGTGCAAGAAAGACCATGGCCGGGTGTTCCTTCATTTTCATAGTAAGGCTTTGAGGTTTCTTGTTTGCATTAATTTTATCATTTTCTACGTCATTTCAGACTATTATTACTCAAGTTAATCATGCCATTGGGGCCTCCGGGGTTGTAAGTGAGGAATGCAAGGCTGTCGTTGCAGAATATGGAGAAACCATCATTAAAATGCTGTTAGCAAAGGTATTTGATGCCTTTCTTCACATTATCCAAACTTGAACTTAAGACTCTCAGGTCTTCATTTTGCTTTACTATTTTTCTTTTCTGGCAGGATCAACCAAAGAAAATCTGCTCTACGCTTGGCCTGTGTGCTTTTGATGGGGAGCGAGGTGTAAGGTTTGCAATTTCTTCTCCTTGACAGAAGCTGATGCTTACATTTCATGATCTTATTTCTGTTCTATTTTTTTTTACCGGGGAGTAATATTTTTTACATTCCTGTTCATATTTTGGTGAAATTGGTTTCAGTATGGGCATTGAGAGCGTTGTCGATAACACCACCCAGAAATCTTCAAATGGTTTACGTGATGTTATGTGCAACGCATGTGAGATGGCAGTTGTATGGGCACAAAGCCAGCTGAAGGATGAGAAAACCCAAGACCAGATTCTCAATTATATAGATGGGGTAAAATAACACTTCATCCTCAAGTTCCTTCAATACTCAGCTTCACTTGTGCTCATGTAGTCTTCTATTGTGATTGCAGCTCTGCGAGAAGTTGCCTAGTCCAATGGGAGAATCAGTAATTGATTGTGATTCCTTGTCTACTTTGCCTAGCATTTCATTCACTATTGGCGGAAAGGTTTTTGAGCTCAAGCCGGAGCAGGTAAGATATGGTCGATTATCTATTTTCGTGTGAACATTTCTTTTAGCTCTACTAACAAGCAACAATAGCAATCTTGTTATCTTTTACAAAACAAGCCTATAATGAATGAATCTATATGGGTTTCTTTTAATTCGACCAGTACGTGCTCAAAGTCACCGAGGGACCTGTAACTGAATGCATCAGTGGATTTGCCGCTTTGGACGTTCCCCCTCCACGAGGACCCCTCTGGTTAGTCTTAATTTATCTTAGTTTCTCCATGCATGCTTCAAAATCACTTTTTTTTCATTTTCTAAACTATTCTAGAACACTTTTTTAATCCCTCGTTACTAATGGAATTTTAAACTATACACTTTCAAATTCTATTTACAATTTTAATTGCAAATAAGTAACACTTTTAGGACTAATAATTAAAGAGTATAACTACATTTTAAGAATTACAAATATAACAATATCTATCGGTGATAGATTTTATGGTTAATACTACTAGTGATGGCTATAGTTTATCATTGATAAAGTCCAAATAATGAATCTCAAGTTTGTTATACTTGCAAATTCTTTTGCATTCTACTATATCTACTAACACTCTTGTCTTATTGCTATATGCATTACTGCTCCTACTTCGAAAGGATTTTAAACATGTTTCATAGTCATTCTGAAAAATCACTCCAAACATGCCTAGGAATTCTTCTTCGCTTTCAGATTCCTGCTTTAGCTATAACATCGTCCTGTCTATTCCACAGGATCTTGGGGGATGTTTTCATGGGTTCCTACCATACAGTATTTGACTATGGAAACTCGAGAGTTGGGTTTGCAGAAGCTGCTTAAAATTTCAACTTGCTTTGTTTCTTCAAGTTGGTCAATTTCCACGACTCTCTATTGTTTCTTGAAGTTGTTTATGTAAATATGTATCTATATTTAGCATCGACACTTTCTGTATACTAAAAAAAAAGGGACCGTTAGGATATTTGGGACTCATTAAAAGGTTGGGATTTACATGCCGCCATCTATTTTAGGGTAAACTGGACTTACAGTACGCTTATTGATAATGTCTCAAGTCTTGCGCTCTTCTTTGAGTTTACAAACACCTTGTGGCCAGACTGCAAAAGCGAATCAGGAAGCTGAAGGTTTAATGTGGCCCAAGACCAAGAATAAATTTCCCAAAAATCCAAATACAATTCGGATTTGTTTATCTAATTTGACTGCAAAATAGCATAGGTGTTAGTGTTGGGTGTTTCCAAATTAGTAACCAAACGTTTAGGGGAGTTTAAAAGGTACAATTCGGCATTGTCTTCAGTGCTGAAACTGTTGCTTATGCACTTGTTGTGCCGGTGTGAGGAATTGGGTGGAGTAGAAGATTAGATTGGCGGGTTCATGAGCGGTGAACTGAATTTGTTTGATTTGCAAATCAAAAGGAAGGTTTGCATTTTTTCCTACAATGCATAGATAAAAATATACATGAAAATGCAAAAATAAAAGGTTAATTTAGATAAATGTGATGTAACTAAAGAAACTTTTTTAGAATTTCAGATAAATAGAAATAGACC

mRNA sequence

TGCTCTTTCTCATTTCTCTTAAATTTCCCCTTTTTCTTTTGCTTCAATCGTCTCTTTGGTCCGCGATCGAGTTCGGTTTTGCTTTCTTTTTTTATATACATCTGCTGTAGGATTTTTTACTCAGTGAGCAACATGGGTACGAGACTCAAACTTTTTATAGCCGTTCTCTTCATTTGCTTCTTTATGTTTCCAATGGTCTTTTGTGCATCCAATGATGGGAAGGTTAGAATTGGACTTAAAAGGAGGAAGTTTGGCCAGAACAACCGAGTTGCTAGCAAGATTGCAACCAAGGAAGGAATTTCTTTGAAAAATTCTGTTGAGAAATATCAACCTAGTGCAAATTTGGGAGATTCGGATGATTTTGATATTGTTGGATTGAAGAATTACTTGAATGCTCAGTATTTTGGTGAGATTGGTATTGGTACTCCCCCTCAAAAGTTTACTGTTATTTTTGACACTGGTAGTTCAAATTTGTGGGTTCCATCATCTAAGTGCTTCTCTGTGGCTTGTCTACTCCATTCCAAGTACAAGTCAAAGCGATCGAGTACTTATAAAAAGAATGGTAAATCTGCTTCAATCAAATATGGGACTGGTGCTATTTCTGGCTACTTTAGTGAAGACAATGTTAAAGTTGGCGACCTAATAGTTAAAAAACAGGATTTTATCGAGGCAACTAGGGAACCCAGCCTCACGTTTGTACTCGCCCAATTTGATGGCATATTAGGACTTGGCTTTAAAGAGATCTCAGTTGGAGATGCAGTGCCTGTATGGTACAACATGGTTGATCAAAACCTTGTAAAGGAACCTGTTTTCTCTTTTTGGTTTAATCGGAATGCAGATGAGGAACAAGGTGGTGAAATCGTATTTGGTGGTGTGGATCCTGATCACTACAAGGGGGAACATACATATGTTCCAGTTACTAAAAAGGGATATTGGCAGTTTGATATGGGTGACGTCCTGATCAATGGCAGTACAACTGGATTTTGTTCTGGTGGTTGCTCAGCGATTGCGGATTCAGGAACTTCGTTATTGGCTGGTCCAACGACTATTATTACTCAAGTTAATCATGCCATTGGGGCCTCCGGGGTTGTAAGTGAGGAATGCAAGGCTGTCGTTGCAGAATATGGAGAAACCATCATTAAAATGCTGTTAGCAAAGGATCAACCAAAGAAAATCTGCTCTACGCTTGGCCTGTGTGCTTTTGATGGGGAGCGAGGTGTAAGTATGGGCATTGAGAGCGTTGTCGATAACACCACCCAGAAATCTTCAAATGGTTTACGTGATGTTATGTGCAACGCATGTGAGATGGCAGTTGTATGGGCACAAAGCCAGCTGAAGGATGAGAAAACCCAAGACCAGATTCTCAATTATATAGATGGGCTCTGCGAGAAGTTGCCTAGTCCAATGGGAGAATCAGTAATTGATTGTGATTCCTTGTCTACTTTGCCTAGCATTTCATTCACTATTGGCGGAAAGGTTTTTGAGCTCAAGCCGGAGCAGTACGTGCTCAAAGTCACCGAGGGACCTGTAACTGAATGCATCAGTGGATTTGCCGCTTTGGACGTTCCCCCTCCACGAGGACCCCTCTGGATCTTGGGGGATGTTTTCATGGGTTCCTACCATACAGTATTTGACTATGGAAACTCGAGAGTTGGGTTTGCAGAAGCTGCTTAAAATTTCAACTTGCTTTGTTTCTTCAAGTTGGTCAATTTCCACGACTCTCTATTGTTTCTTGAAGTTGTTTATGTAAATATGTATCTATATTTAGCATCGACACTTTCTGTATACTAAAAAAAAAGGGACCGTTAGGATATTTGGGACTCATTAAAAGGTTGGGATTTACATGCCGCCATCTATTTTAGGGTAAACTGGACTTACAGTACGCTTATTGATAATGTCTCAAGTCTTGCGCTCTTCTTTGAGTTTACAAACACCTTGTGGCCAGACTGCAAAAGCGAATCAGGAAGCTGAAGGTTTAATGTGGCCCAAGACCAAGAATAAATTTCCCAAAAATCCAAATACAATTCGGATTTGTTTATCTAATTTGACTGCAAAATAGCATAGGTGTTAGTGTTGGGTGTTTCCAAATTAGTAACCAAACGTTTAGGGGAGTTTAAAAGGTACAATTCGGCATTGTCTTCAGTGCTGAAACTGTTGCTTATGCACTTGTTGTGCCGGTGTGAGGAATTGGGTGGAGTAGAAGATTAGATTGGCGGGTTCATGAGCGGTGAACTGAATTTGTTTGATTTGCAAATCAAAAGGAAGGTTTGCATTTTTTCCTACAATGCATAGATAAAAATATACATGAAAATGCAAAAATAAAAGGTTAATTTAGATAAATGTGATGTAACTAAAGAAACTTTTTTAGAATTTCAGATAAATAGAAATAGACC

Coding sequence (CDS)

ATGGGTACGAGACTCAAACTTTTTATAGCCGTTCTCTTCATTTGCTTCTTTATGTTTCCAATGGTCTTTTGTGCATCCAATGATGGGAAGGTTAGAATTGGACTTAAAAGGAGGAAGTTTGGCCAGAACAACCGAGTTGCTAGCAAGATTGCAACCAAGGAAGGAATTTCTTTGAAAAATTCTGTTGAGAAATATCAACCTAGTGCAAATTTGGGAGATTCGGATGATTTTGATATTGTTGGATTGAAGAATTACTTGAATGCTCAGTATTTTGGTGAGATTGGTATTGGTACTCCCCCTCAAAAGTTTACTGTTATTTTTGACACTGGTAGTTCAAATTTGTGGGTTCCATCATCTAAGTGCTTCTCTGTGGCTTGTCTACTCCATTCCAAGTACAAGTCAAAGCGATCGAGTACTTATAAAAAGAATGGTAAATCTGCTTCAATCAAATATGGGACTGGTGCTATTTCTGGCTACTTTAGTGAAGACAATGTTAAAGTTGGCGACCTAATAGTTAAAAAACAGGATTTTATCGAGGCAACTAGGGAACCCAGCCTCACGTTTGTACTCGCCCAATTTGATGGCATATTAGGACTTGGCTTTAAAGAGATCTCAGTTGGAGATGCAGTGCCTGTATGGTACAACATGGTTGATCAAAACCTTGTAAAGGAACCTGTTTTCTCTTTTTGGTTTAATCGGAATGCAGATGAGGAACAAGGTGGTGAAATCGTATTTGGTGGTGTGGATCCTGATCACTACAAGGGGGAACATACATATGTTCCAGTTACTAAAAAGGGATATTGGCAGTTTGATATGGGTGACGTCCTGATCAATGGCAGTACAACTGGATTTTGTTCTGGTGGTTGCTCAGCGATTGCGGATTCAGGAACTTCGTTATTGGCTGGTCCAACGACTATTATTACTCAAGTTAATCATGCCATTGGGGCCTCCGGGGTTGTAAGTGAGGAATGCAAGGCTGTCGTTGCAGAATATGGAGAAACCATCATTAAAATGCTGTTAGCAAAGGATCAACCAAAGAAAATCTGCTCTACGCTTGGCCTGTGTGCTTTTGATGGGGAGCGAGGTGTAAGTATGGGCATTGAGAGCGTTGTCGATAACACCACCCAGAAATCTTCAAATGGTTTACGTGATGTTATGTGCAACGCATGTGAGATGGCAGTTGTATGGGCACAAAGCCAGCTGAAGGATGAGAAAACCCAAGACCAGATTCTCAATTATATAGATGGGCTCTGCGAGAAGTTGCCTAGTCCAATGGGAGAATCAGTAATTGATTGTGATTCCTTGTCTACTTTGCCTAGCATTTCATTCACTATTGGCGGAAAGGTTTTTGAGCTCAAGCCGGAGCAGTACGTGCTCAAAGTCACCGAGGGACCTGTAACTGAATGCATCAGTGGATTTGCCGCTTTGGACGTTCCCCCTCCACGAGGACCCCTCTGGATCTTGGGGGATGTTTTCATGGGTTCCTACCATACAGTATTTGACTATGGAAACTCGAGAGTTGGGTTTGCAGAAGCTGCTTAA

Protein sequence

MGTRLKLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKNSVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCFSVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEATREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQGGEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSLLAGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAFDGERGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGLCEKLPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALDVPPPRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA*
Homology
BLAST of CSPI03G04660 vs. ExPASy Swiss-Prot
Match: O04057 (Aspartic proteinase OS=Cucurbita pepo OX=3663 PE=2 SV=1)

HSP 1 Score: 749.2 bits (1933), Expect = 3.1e-215
Identity = 349/504 (69.25%), Postives = 424/504 (84.13%), Query Frame = 0

Query: 13  FICFFM---FPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKNSVEKYQPSA 72
           F+C F+   F +V  ASNDG +R+GLK+ K    NR+A+++ +K+   LK +  KY P  
Sbjct: 10  FLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARVESKDAEILKAAFRKYNPKG 69

Query: 73  NLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCFSVACLLH 132
           NLG+S D DIV LKNYL+AQY+GEI IGTPPQKFTVIFDTGSSNLWV     FSVAC  H
Sbjct: 70  NLGESSDTDIVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVLCECLFSVACHFH 129

Query: 133 SKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEATREPSLTFV 192
           ++YKS RSS+YKKNG SASI+YGTGA+SG+FS DNVKVGDL+VK+Q FIEATREPSLTF+
Sbjct: 130 ARYKSSRSSSYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKEQVFIEATREPSLTFL 189

Query: 193 LAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQGGEIVFGGVD 252
           +A+FDG+LGLGF+EI+VG+AVPVWYNMV+Q LVKEPVFSFW NRN +EE+GGEIVFGGVD
Sbjct: 190 VAKFDGLLGLGFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNVEEEEGGEIVFGGVD 249

Query: 253 PDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSLLAGPTTIITQ 312
           P HY+G+HTYVPVT+KGYWQFDMGDVLI+G  TGFC GGCSAIADSGTSLLAGPT +IT 
Sbjct: 250 PKHYRGKHTYVPVTQKGYWQFDMGDVLIDGEPTGFCDGGCSAIADSGTSLLAGPTPVITM 309

Query: 313 VNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAFDGERGVSMGIES 372
           +NHAIGA GVVS++CKAVVA+YG+TI+ +LL++  PKKICS + LC FDG RGVSMGIES
Sbjct: 310 INHAIGAKGVVSQQCKAVVAQYGQTIMDLLLSEADPKKICSQINLCTFDGTRGVSMGIES 369

Query: 373 VVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGLCEKLPSPMGESV 432
           VVD    KSS+ L D MC+ CEM VVW Q+QL+  +T+++I+NYI+ LC+++PSPMG+S 
Sbjct: 370 VVDENAGKSSDSLHDGMCSVCEMTVVWMQNQLRQNQTKERIINYINELCDRMPSPMGQSA 429

Query: 433 IDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALDVPPPRGPLWILG 492
           +DC  LS++P++SFTIGGK+F+L PE+Y+LKV EGPV +CISGF A D+PPPRGPLWILG
Sbjct: 430 VDCGQLSSMPTVSFTIGGKIFDLAPEEYILKVGEGPVAQCISGFTAFDIPPPRGPLWILG 489

Query: 493 DVFMGSYHTVFDYGNSRVGFAEAA 514
           DVFMG YHTVFD+G  RVG AEAA
Sbjct: 490 DVFMGRYHTVFDFGKLRVGSAEAA 513

BLAST of CSPI03G04660 vs. ExPASy Swiss-Prot
Match: O65390 (Aspartic proteinase A1 OS=Arabidopsis thaliana OX=3702 GN=APA1 PE=1 SV=1)

HSP 1 Score: 743.4 bits (1918), Expect = 1.7e-213
Identity = 353/503 (70.18%), Postives = 418/503 (83.10%), Query Frame = 0

Query: 12  LFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKNSVEKYQPSANL 71
           L + F +    F   NDG  R+GLK+ K    NR+A+++ +K+        EK   +  L
Sbjct: 12  LIVSFLLCFSAFAERNDGTFRVGLKKLKLDSKNRLAARVESKQ--------EKPLRAYRL 71

Query: 72  GDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKC-FSVACLLHS 131
           GDS D D+V LKNYL+AQY+GEI IGTPPQKFTV+FDTGSSNLWVPSSKC FS+ACLLH 
Sbjct: 72  GDSGDADVVVLKNYLDAQYYGEIAIGTPPQKFTVVFDTGSSNLWVPSSKCYFSLACLLHP 131

Query: 132 KYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEATREPSLTFVL 191
           KYKS RSSTY+KNGK+A+I YGTGAI+G+FS D V VGDL+VK Q+FIEAT+EP +TFV+
Sbjct: 132 KYKSSRSSTYEKNGKAAAIHYGTGAIAGFFSNDAVTVGDLVVKDQEFIEATKEPGITFVV 191

Query: 192 AQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQGGEIVFGGVDP 251
           A+FDGILGLGF+EISVG A PVWYNM+ Q L+KEPVFSFW NRNADEE+GGE+VFGGVDP
Sbjct: 192 AKFDGILGLGFQEISVGKAAPVWYNMLKQGLIKEPVFSFWLNRNADEEEGGELVFGGVDP 251

Query: 252 DHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSLLAGPTTIITQV 311
           +H+KG+HTYVPVT+KGYWQFDMGDVLI G+ TGFC  GCSAIADSGTSLLAGPTTIIT +
Sbjct: 252 NHFKGKHTYVPVTQKGYWQFDMGDVLIGGAPTGFCESGCSAIADSGTSLLAGPTTIITMI 311

Query: 312 NHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAFDGERGVSMGIESV 371
           NHAIGA+GVVS++CK VV +YG+TI+ +LL++ QPKKICS +GLC FDG RGVSMGIESV
Sbjct: 312 NHAIGAAGVVSQQCKTVVDQYGQTILDLLLSETQPKKICSQIGLCTFDGTRGVSMGIESV 371

Query: 372 VDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGLCEKLPSPMGESVI 431
           VD    K SNG+ D  C+ACEMAVVW QSQL+   TQ++ILNY++ LCE+LPSPMGES +
Sbjct: 372 VDKENAKLSNGVGDAACSACEMAVVWIQSQLRQNMTQERILNYVNELCERLPSPMGESAV 431

Query: 432 DCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALDVPPPRGPLWILGD 491
           DC  LST+P++S TIGGKVF+L PE+YVLKV EGPV +CISGF ALDV PPRGPLWILGD
Sbjct: 432 DCAQLSTMPTVSLTIGGKVFDLAPEEYVLKVGEGPVAQCISGFIALDVAPPRGPLWILGD 491

Query: 492 VFMGSYHTVFDYGNSRVGFAEAA 514
           VFMG YHTVFD+GN +VGFAEAA
Sbjct: 492 VFMGKYHTVFDFGNEQVGFAEAA 506

BLAST of CSPI03G04660 vs. ExPASy Swiss-Prot
Match: Q8VYL3 (Aspartic proteinase A2 OS=Arabidopsis thaliana OX=3702 GN=APA2 PE=2 SV=1)

HSP 1 Score: 715.7 bits (1846), Expect = 3.8e-205
Identity = 334/513 (65.11%), Postives = 415/513 (80.90%), Query Frame = 0

Query: 1   MGTRLKLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKN 60
           MG   +     +F+ F +F   +   NDG  R+GLK+ K   NNR+A++  +K+  +L++
Sbjct: 1   MGVYSRAVAFSVFVSFLLFFTAYSKRNDGTFRVGLKKLKLDPNNRLATRFGSKQEEALRS 60

Query: 61  SVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 120
           S+  Y  +   GDS D DIV LKNYL+AQY+GEI IGTPPQKFTVIFDTGSSNLWVPS K
Sbjct: 61  SLRSYNNNLG-GDSGDADIVPLKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSGK 120

Query: 121 C-FSVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIE 180
           C FS++C  H+KYKS RSSTYKK+GK A+I YG+G+ISG+FS D V VGDL+VK Q+FIE
Sbjct: 121 CFFSLSCYFHAKYKSSRSSTYKKSGKRAAIHYGSGSISGFFSYDAVTVGDLVVKDQEFIE 180

Query: 181 ATREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQ 240
            T EP LTF++A+FDG+LGLGF+EI+VG+A PVWYNM+ Q L+K PVFSFW NR+   E+
Sbjct: 181 TTSEPGLTFLVAKFDGLLGLGFQEIAVGNATPVWYNMLKQGLIKRPVFSFWLNRDPKSEE 240

Query: 241 GGEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSL 300
           GGEIVFGGVDP H++GEHT+VPVT++GYWQFDMG+VLI G +TG+C  GCSAIADSGTSL
Sbjct: 241 GGEIVFGGVDPKHFRGEHTFVPVTQRGYWQFDMGEVLIAGESTGYCGSGCSAIADSGTSL 300

Query: 301 LAGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAFDG 360
           LAGPT ++  +N AIGASGVVS++CK VV +YG+TI+ +LLA+ QPKKICS +GLCA+DG
Sbjct: 301 LAGPTAVVAMINKAIGASGVVSQQCKTVVDQYGQTILDLLLAETQPKKICSQIGLCAYDG 360

Query: 361 ERGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGLCE 420
             GVSMGIESVVD    +SS+GLRD  C ACEMAVVW QSQL+   TQ++I+NYI+ +CE
Sbjct: 361 THGVSMGIESVVDKENTRSSSGLRDAGCPACEMAVVWIQSQLRQNMTQERIVNYINEICE 420

Query: 421 KLPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALDVP 480
           ++PSP GES +DC  LS +P++SFTIGGKVF+L PE+YVLK+ EGPV +CISGF ALD+P
Sbjct: 421 RMPSPNGESAVDCSQLSKMPTVSFTIGGKVFDLAPEEYVLKIGEGPVAQCISGFTALDIP 480

Query: 481 PPRGPLWILGDVFMGSYHTVFDYGNSRVGFAEA 513
           PPRGPLWILGDVFMG YHTVFD+GN +VGFAEA
Sbjct: 481 PPRGPLWILGDVFMGKYHTVFDFGNEQVGFAEA 512

BLAST of CSPI03G04660 vs. ExPASy Swiss-Prot
Match: Q42456 (Aspartic proteinase oryzasin-1 OS=Oryza sativa subsp. japonica OX=39947 GN=Os05g0567100 PE=2 SV=2)

HSP 1 Score: 707.2 bits (1824), Expect = 1.3e-202
Identity = 341/517 (65.96%), Postives = 421/517 (81.43%), Query Frame = 0

Query: 1   MGTRLKLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEG---IS 60
           MGTR      VL     +  ++  ++ +G VRI LK+R   +N+RVA++++ +EG   + 
Sbjct: 1   MGTR--SVALVLLAAVLLQALLPASAAEGLVRIALKKRPIDENSRVAARLSGEEGARRLG 60

Query: 61  LKNSVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVP 120
           L+ +      ++  G   + DIV LKNY+NAQYFGEIG+GTPPQKFTVIFDTGSSNLWVP
Sbjct: 61  LRGA------NSLGGGGGEGDIVALKNYMNAQYFGEIGVGTPPQKFTVIFDTGSSNLWVP 120

Query: 121 SSKC-FSVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQD 180
           S+KC FS+AC  HS+YKS +SSTY+KNGK A+I+YGTG+I+G+FSED+V VGDL+VK Q+
Sbjct: 121 SAKCYFSIACFFHSRYKSGQSSTYQKNGKPAAIQYGTGSIAGFFSEDSVTVGDLVVKDQE 180

Query: 181 FIEATREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNAD 240
           FIEAT+EP LTF++A+FDGILGLGF+EISVGDAVPVWY MV+Q LV EPVFSFWFNR++D
Sbjct: 181 FIEATKEPGLTFMVAKFDGILGLGFQEISVGDAVPVWYKMVEQGLVSEPVFSFWFNRHSD 240

Query: 241 EEQGGEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSG 300
           E +GGEIVFGG+DP HYKG HTYVPV++KGYWQF+MGDVLI G TTGFC+ GCSAIADSG
Sbjct: 241 EGEGGEIVFGGMDPSHYKGNHTYVPVSQKGYWQFEMGDVLIGGKTTGFCASGCSAIADSG 300

Query: 301 TSLLAGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCA 360
           TSLLAGPT IIT++N  IGA+GVVS+ECK VV++YG+ I+ +LLA+ QP KICS +GLC 
Sbjct: 301 TSLLAGPTAIITEINEKIGATGVVSQECKTVVSQYGQQILDLLLAETQPSKICSQVGLCT 360

Query: 361 FDGERGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDG 420
           FDG+ GVS GI+SVVD+   +S+      MCNACEMAVVW Q+QL   KTQD ILNYI+ 
Sbjct: 361 FDGKHGVSAGIKSVVDDEAGESNGLQSGPMCNACEMAVVWMQNQLAQNKTQDLILNYINQ 420

Query: 421 LCEKLPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAAL 480
           LC+KLPSPMGES +DC SL+++P ISFTIGGK F LKPE+Y+LKV EG   +CISGF A+
Sbjct: 421 LCDKLPSPMGESSVDCGSLASMPEISFTIGGKKFALKPEEYILKVGEGAAAQCISGFTAM 480

Query: 481 DVPPPRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA 514
           D+PPPRGPLWILGDVFMG+YHTVFDYG  RVGFA++A
Sbjct: 481 DIPPPRGPLWILGDVFMGAYHTVFDYGKMRVGFAKSA 509

BLAST of CSPI03G04660 vs. ExPASy Swiss-Prot
Match: P42210 (Phytepsin OS=Hordeum vulgare OX=4513 PE=1 SV=1)

HSP 1 Score: 702.2 bits (1811), Expect = 4.3e-201
Identity = 342/516 (66.28%), Postives = 416/516 (80.62%), Query Frame = 0

Query: 1   MGTRLKLFIAVLFICFFMFPMVFCASN-DGKVRIGLKRRKFGQNNRVASKIATKEGISLK 60
           MGTR  L +A+L     +  ++  AS  +G VRI LK+R   +N+RVA+ ++  E   L 
Sbjct: 1   MGTR-GLALALLAAVLLLQTVLPAASEAEGLVRIALKKRPIDRNSRVATGLSGGEEQPLL 60

Query: 61  NSVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSS 120
           +          L   ++ DIV LKNY+NAQYFGEIG+GTPPQKFTVIFDTGSSNLWVPS+
Sbjct: 61  SGANP------LRSEEEGDIVALKNYMNAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSA 120

Query: 121 KC-FSVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFI 180
           KC FS+AC LHS+YK+  SSTYKKNGK A+I+YGTG+I+GYFSED+V VGDL+VK Q+FI
Sbjct: 121 KCYFSIACYLHSRYKAGASSTYKKNGKPAAIQYGTGSIAGYFSEDSVTVGDLVVKDQEFI 180

Query: 181 EATREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEE 240
           EAT+EP +TF++A+FDGILGLGFKEISVG AVPVWY M++Q LV +PVFSFW NR+ DE 
Sbjct: 181 EATKEPGITFLVAKFDGILGLGFKEISVGKAVPVWYKMIEQGLVSDPVFSFWLNRHVDEG 240

Query: 241 QGGEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTS 300
           +GGEI+FGG+DP HY GEHTYVPVT+KGYWQFDMGDVL+ G +TGFC+GGC+AIADSGTS
Sbjct: 241 EGGEIIFGGMDPKHYVGEHTYVPVTQKGYWQFDMGDVLVGGKSTGFCAGGCAAIADSGTS 300

Query: 301 LLAGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAFD 360
           LLAGPT IIT++N  IGA+GVVS+ECK +V++YG+ I+ +LLA+ QPKKICS +GLC FD
Sbjct: 301 LLAGPTAIITEINEKIGAAGVVSQECKTIVSQYGQQILDLLLAETQPKKICSQVGLCTFD 360

Query: 361 GERGVSMGIESVVDNTTQKSSNGLR-DVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGL 420
           G RGVS GI SVVD+   K SNGLR D MC+ACEMAVVW Q+QL   KTQD IL+Y++ L
Sbjct: 361 GTRGVSAGIRSVVDDEPVK-SNGLRADPMCSACEMAVVWMQNQLAQNKTQDLILDYVNQL 420

Query: 421 CEKLPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALD 480
           C +LPSPMGES +DC SL ++P I FTIGGK F LKPE+Y+LKV EG   +CISGF A+D
Sbjct: 421 CNRLPSPMGESAVDCGSLGSMPDIEFTIGGKKFALKPEEYILKVGEGAAAQCISGFTAMD 480

Query: 481 VPPPRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA 514
           +PPPRGPLWILGDVFMG YHTVFDYG  R+GFA+AA
Sbjct: 481 IPPPRGPLWILGDVFMGPYHTVFDYGKLRIGFAKAA 508

BLAST of CSPI03G04660 vs. ExPASy TrEMBL
Match: A0A5D3DKC0 (Aspartic proteinase A1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold943G00070 PE=3 SV=1)

HSP 1 Score: 1020.0 bits (2636), Expect = 3.5e-294
Identity = 504/516 (97.67%), Postives = 509/516 (98.64%), Query Frame = 0

Query: 1   MGTRLKLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKN 60
           M TRLKLFIAVLFICF MFPMVFCASNDGKVRIGLKRRKFGQN+RVASKIATKEGISLKN
Sbjct: 1   MSTRLKLFIAVLFICFLMFPMVFCASNDGKVRIGLKRRKFGQNHRVASKIATKEGISLKN 60

Query: 61  SVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 120
            VEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFTV+FDTGSSNLWVPSSK
Sbjct: 61  YVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFTVVFDTGSSNLWVPSSK 120

Query: 121 CFSVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEA 180
           CFSVACLLHSKYKSKRSSTYKKNGK ASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEA
Sbjct: 121 CFSVACLLHSKYKSKRSSTYKKNGKPASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEA 180

Query: 181 TREPSLTFVLAQFDGILGLGFKEISVGDAVPVW---YNMVDQNLVKEPVFSFWFNRNADE 240
           TREPSLTFVLAQFDGILGLGFKEISVGDAVPVW   Y+MVDQNLVKEPVFSFWFNRNADE
Sbjct: 181 TREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYYRYSMVDQNLVKEPVFSFWFNRNADE 240

Query: 241 EQGGEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGT 300
           EQGGEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGT
Sbjct: 241 EQGGEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGT 300

Query: 301 SLLAGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAF 360
           SLLAGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAF
Sbjct: 301 SLLAGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAF 360

Query: 361 DGERGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGL 420
           DGERGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGL
Sbjct: 361 DGERGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGL 420

Query: 421 CEKLPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALD 480
           CEKLPSPMGESVIDCDSLSTLP+I+FTIGGKVFELKPEQYVLKVTEGPVTECISGFAALD
Sbjct: 421 CEKLPSPMGESVIDCDSLSTLPNIAFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALD 480

Query: 481 VPPPRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA 514
           VPPPRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA
Sbjct: 481 VPPPRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA 516

BLAST of CSPI03G04660 vs. ExPASy TrEMBL
Match: A0A6J1L5Y5 (aspartic proteinase A1-like OS=Cucurbita maxima OX=3661 GN=LOC111500293 PE=3 SV=1)

HSP 1 Score: 968.8 bits (2503), Expect = 9.1e-279
Identity = 479/513 (93.37%), Postives = 493/513 (96.10%), Query Frame = 0

Query: 1   MGTRLKLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKN 60
           MGTR+KLF A+LFICF MFPMVF ASNDGKVRIGLKRRKFGQN RVASKIATKEG+SLKN
Sbjct: 1   MGTRIKLFTALLFICFLMFPMVFSASNDGKVRIGLKRRKFGQNIRVASKIATKEGVSLKN 60

Query: 61  SVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 120
           SVEK      LGDS D DIV LKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK
Sbjct: 61  SVEK------LGDSADTDIVALKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 120

Query: 121 CFSVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEA 180
           CFSVACLLHSKYKSKRSSTY+KNGK ASI+YGTGAISGYFSED+VKVGDLIVKKQ+FIEA
Sbjct: 121 CFSVACLLHSKYKSKRSSTYEKNGKHASIQYGTGAISGYFSEDHVKVGDLIVKKQEFIEA 180

Query: 181 TREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQG 240
           T+EPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRN DEEQG
Sbjct: 181 TKEPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNEDEEQG 240

Query: 241 GEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSLL 300
           GEIVFGGVDP+HYKGEHTYVPVTKKGYWQFDMGDVLINGSTT FCSGGCSAIADSGTSLL
Sbjct: 241 GEIVFGGVDPNHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTEFCSGGCSAIADSGTSLL 300

Query: 301 AGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAFDGE 360
           AGPT+IITQVNHAIGASGVVSEECK VVAEYGETIIKMLLAKDQPKKICSTLGLCAFDG 
Sbjct: 301 AGPTSIITQVNHAIGASGVVSEECKTVVAEYGETIIKMLLAKDQPKKICSTLGLCAFDGT 360

Query: 361 RGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGLCEK 420
           RGVSMGIESVVDNTTQKSSNGL DVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGLCEK
Sbjct: 361 RGVSMGIESVVDNTTQKSSNGLHDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGLCEK 420

Query: 421 LPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALDVPP 480
           LPSPMGESVIDCDSLSTLPS+SFTIGGKVF+LKPEQYVLKVT+GPVTECISGFAALDVPP
Sbjct: 421 LPSPMGESVIDCDSLSTLPSVSFTIGGKVFDLKPEQYVLKVTQGPVTECISGFAALDVPP 480

Query: 481 PRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA 514
           PRGPLWILGDVFMG+YHTVFDYGN RVGFAEAA
Sbjct: 481 PRGPLWILGDVFMGTYHTVFDYGNLRVGFAEAA 507

BLAST of CSPI03G04660 vs. ExPASy TrEMBL
Match: A0A6J1F137 (aspartic proteinase A1-like OS=Cucurbita moschata OX=3662 GN=LOC111438595 PE=3 SV=1)

HSP 1 Score: 963.0 bits (2488), Expect = 5.0e-277
Identity = 477/513 (92.98%), Postives = 490/513 (95.52%), Query Frame = 0

Query: 1   MGTRLKLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKN 60
           MGTRLKLF A+LFICF MFPMVF A NDGKVRIGLKRRKFGQN RVASKIATKE +SLKN
Sbjct: 1   MGTRLKLFTALLFICFLMFPMVFSAPNDGKVRIGLKRRKFGQNVRVASKIATKEEVSLKN 60

Query: 61  SVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 120
           SVEK      LGDS D DIV LKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK
Sbjct: 61  SVEK------LGDSADTDIVALKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 120

Query: 121 CFSVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEA 180
           CFSVACLLHSKYKSKRSSTY+KNGK ASI+YGTGAISGYFSED+VKVGDLIVKKQ+FIEA
Sbjct: 121 CFSVACLLHSKYKSKRSSTYEKNGKHASIQYGTGAISGYFSEDHVKVGDLIVKKQEFIEA 180

Query: 181 TREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQG 240
           T+EPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRN DEEQG
Sbjct: 181 TKEPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNEDEEQG 240

Query: 241 GEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSLL 300
           GEIVFGGVDP+HYKGEHTYVPVTKKGYWQFDMGDVLINGSTT FCSGGCSAIADSGTSLL
Sbjct: 241 GEIVFGGVDPNHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTEFCSGGCSAIADSGTSLL 300

Query: 301 AGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAFDGE 360
           AGPT+IITQVNHAIGASGVVSEECK VVAEYGETIIKMLLAKDQPKKICSTLGLC FDG 
Sbjct: 301 AGPTSIITQVNHAIGASGVVSEECKTVVAEYGETIIKMLLAKDQPKKICSTLGLCTFDGT 360

Query: 361 RGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGLCEK 420
           RGVSMGIESVVDNTTQKSSNGL DVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGLCEK
Sbjct: 361 RGVSMGIESVVDNTTQKSSNGLHDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGLCEK 420

Query: 421 LPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALDVPP 480
           LPSPMGESVIDCDSLSTLPS+SFTIGGKVF+LKPEQYVLKVT+GPVTECISGFAALDVPP
Sbjct: 421 LPSPMGESVIDCDSLSTLPSVSFTIGGKVFDLKPEQYVLKVTQGPVTECISGFAALDVPP 480

Query: 481 PRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA 514
           PRGPLWILGDVFMG+YHTVFDYGN RVGFAEAA
Sbjct: 481 PRGPLWILGDVFMGTYHTVFDYGNLRVGFAEAA 507

BLAST of CSPI03G04660 vs. ExPASy TrEMBL
Match: A0A0A0L2W7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G066750 PE=3 SV=1)

HSP 1 Score: 948.7 bits (2451), Expect = 9.8e-273
Identity = 473/513 (92.20%), Postives = 474/513 (92.40%), Query Frame = 0

Query: 1   MGTRLKLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKN 60
           MGTRLKLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKN
Sbjct: 61  MGTRLKLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKN 120

Query: 61  SVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 120
           SVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKF VIFDTGSSNLWVPSSK
Sbjct: 121 SVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFAVIFDTGSSNLWVPSSK 180

Query: 121 CFSVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEA 180
           CFSVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEA
Sbjct: 181 CFSVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEA 240

Query: 181 TREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQG 240
           TREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQG
Sbjct: 241 TREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQG 300

Query: 241 GEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSLL 300
           GEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSLL
Sbjct: 301 GEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSLL 360

Query: 301 AGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAFDGE 360
           AGPT                                      DQPKKICSTLGLCAFDGE
Sbjct: 361 AGPT--------------------------------------DQPKKICSTLGLCAFDGE 420

Query: 361 RGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGLCEK 420
           RGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLK+EKTQDQILNYIDGLCEK
Sbjct: 421 RGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKEEKTQDQILNYIDGLCEK 480

Query: 421 LPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALDVPP 480
           LPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALDVPP
Sbjct: 481 LPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALDVPP 535

Query: 481 PRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA 514
           PRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA
Sbjct: 541 PRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA 535

BLAST of CSPI03G04660 vs. ExPASy TrEMBL
Match: A0A6J1DCK4 (aspartic proteinase A1-like OS=Momordica charantia OX=3673 GN=LOC111019253 PE=3 SV=1)

HSP 1 Score: 912.5 bits (2357), Expect = 7.8e-262
Identity = 443/514 (86.19%), Postives = 475/514 (92.41%), Query Frame = 0

Query: 1   MGTRLKLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKN 60
           MGTRLKL  AVLFICF M PMV  ASND KVRIGLKRR F Q+NRVASKIATKEG+SL++
Sbjct: 1   MGTRLKLLTAVLFICFLMLPMVISASNDDKVRIGLKRRNFDQHNRVASKIATKEGVSLRD 60

Query: 61  SVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 120
           SVEKYQPS NLGDS D DIV LKNY+NAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK
Sbjct: 61  SVEKYQPSENLGDSADTDIVALKNYMNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 120

Query: 121 CFSVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEA 180
           C SVACLLHSKYKS+RSSTYKKNGK ASIKYGTGAISGYFS D+VKVGDL+VK QDFIEA
Sbjct: 121 CLSVACLLHSKYKSRRSSTYKKNGKRASIKYGTGAISGYFSRDHVKVGDLVVKNQDFIEA 180

Query: 181 TREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQG 240
           TREPSLTF+LA FDGILGLGFKEISVGDAVPVWYNMVDQ LVKEPVFSFWFNRN DEE+G
Sbjct: 181 TREPSLTFLLAHFDGILGLGFKEISVGDAVPVWYNMVDQGLVKEPVFSFWFNRNGDEEEG 240

Query: 241 GEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSLL 300
           GEIVFGGVDP+HYKGEHTYVPVTKKGYWQFDMGDV ING TTGFCS GCSAIADSGTSLL
Sbjct: 241 GEIVFGGVDPNHYKGEHTYVPVTKKGYWQFDMGDVRINGKTTGFCSNGCSAIADSGTSLL 300

Query: 301 AGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAFDGE 360
           AGPT+IITQVNHAIGASG+VSEECK +V+EYGETIIK+LL K++PKKICSTLGLC FDG 
Sbjct: 301 AGPTSIITQVNHAIGASGIVSEECKTLVSEYGETIIKLLLEKEEPKKICSTLGLCTFDGT 360

Query: 361 RGVSMGIESVVDNTTQKSSNGLRD-VMCNACEMAVVWAQSQLKDEKTQDQILNYIDGLCE 420
           RGVS GI SV+DNTTQ +SNGL D +MC ACEM VVW QSQL DE+TQD+IL+YI+ LCE
Sbjct: 361 RGVSTGIASVLDNTTQTTSNGLHDSLMCTACEMVVVWIQSQLADEQTQDRILDYINRLCE 420

Query: 421 KLPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALDVP 480
           KLPSPMGESVIDCDSLSTLP+ISFTIGGKVF+LKPE+YV+KVT+GP+T+CISGFAALDVP
Sbjct: 421 KLPSPMGESVIDCDSLSTLPNISFTIGGKVFDLKPEEYVIKVTQGPITQCISGFAALDVP 480

Query: 481 PPRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA 514
           PPRGPLWILGDVFMG YHTVFDYGN RVGFAEAA
Sbjct: 481 PPRGPLWILGDVFMGPYHTVFDYGNLRVGFAEAA 514

BLAST of CSPI03G04660 vs. NCBI nr
Match: XP_004151126.1 (aspartic proteinase A1 [Cucumis sativus] >XP_011650512.1 aspartic proteinase A1 [Cucumis sativus] >KAE8650121.1 hypothetical protein Csa_009507 [Cucumis sativus])

HSP 1 Score: 1037.3 bits (2681), Expect = 4.3e-299
Identity = 511/513 (99.61%), Postives = 512/513 (99.81%), Query Frame = 0

Query: 1   MGTRLKLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKN 60
           MGTRLKLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKN
Sbjct: 1   MGTRLKLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKN 60

Query: 61  SVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 120
           SVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKF VIFDTGSSNLWVPSSK
Sbjct: 61  SVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFAVIFDTGSSNLWVPSSK 120

Query: 121 CFSVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEA 180
           CFSVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEA
Sbjct: 121 CFSVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEA 180

Query: 181 TREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQG 240
           TREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQG
Sbjct: 181 TREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQG 240

Query: 241 GEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSLL 300
           GEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSLL
Sbjct: 241 GEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSLL 300

Query: 301 AGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAFDGE 360
           AGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAFDGE
Sbjct: 301 AGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAFDGE 360

Query: 361 RGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGLCEK 420
           RGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLK+EKTQDQILNYIDGLCEK
Sbjct: 361 RGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKEEKTQDQILNYIDGLCEK 420

Query: 421 LPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALDVPP 480
           LPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALDVPP
Sbjct: 421 LPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALDVPP 480

Query: 481 PRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA 514
           PRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA
Sbjct: 481 PRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA 513

BLAST of CSPI03G04660 vs. NCBI nr
Match: KAA0065836.1 (aspartic proteinase A1-like [Cucumis melo var. makuwa] >TYK24045.1 aspartic proteinase A1-like [Cucumis melo var. makuwa])

HSP 1 Score: 1020.0 bits (2636), Expect = 7.1e-294
Identity = 504/516 (97.67%), Postives = 509/516 (98.64%), Query Frame = 0

Query: 1   MGTRLKLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKN 60
           M TRLKLFIAVLFICF MFPMVFCASNDGKVRIGLKRRKFGQN+RVASKIATKEGISLKN
Sbjct: 1   MSTRLKLFIAVLFICFLMFPMVFCASNDGKVRIGLKRRKFGQNHRVASKIATKEGISLKN 60

Query: 61  SVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 120
            VEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFTV+FDTGSSNLWVPSSK
Sbjct: 61  YVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFTVVFDTGSSNLWVPSSK 120

Query: 121 CFSVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEA 180
           CFSVACLLHSKYKSKRSSTYKKNGK ASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEA
Sbjct: 121 CFSVACLLHSKYKSKRSSTYKKNGKPASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEA 180

Query: 181 TREPSLTFVLAQFDGILGLGFKEISVGDAVPVW---YNMVDQNLVKEPVFSFWFNRNADE 240
           TREPSLTFVLAQFDGILGLGFKEISVGDAVPVW   Y+MVDQNLVKEPVFSFWFNRNADE
Sbjct: 181 TREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYYRYSMVDQNLVKEPVFSFWFNRNADE 240

Query: 241 EQGGEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGT 300
           EQGGEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGT
Sbjct: 241 EQGGEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGT 300

Query: 301 SLLAGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAF 360
           SLLAGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAF
Sbjct: 301 SLLAGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAF 360

Query: 361 DGERGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGL 420
           DGERGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGL
Sbjct: 361 DGERGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGL 420

Query: 421 CEKLPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALD 480
           CEKLPSPMGESVIDCDSLSTLP+I+FTIGGKVFELKPEQYVLKVTEGPVTECISGFAALD
Sbjct: 421 CEKLPSPMGESVIDCDSLSTLPNIAFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALD 480

Query: 481 VPPPRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA 514
           VPPPRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA
Sbjct: 481 VPPPRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA 516

BLAST of CSPI03G04660 vs. NCBI nr
Match: XP_038905979.1 (aspartic proteinase A1-like [Benincasa hispida])

HSP 1 Score: 975.3 bits (2520), Expect = 2.0e-280
Identity = 486/513 (94.74%), Postives = 493/513 (96.10%), Query Frame = 0

Query: 1   MGTRLKLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKN 60
           MGTRLKL  AVLFICF MFPMVF ASNDGKVRIGLKRRKFG  NRVASKIA KEG+SLK+
Sbjct: 1   MGTRLKLCAAVLFICFLMFPMVFSASNDGKVRIGLKRRKFGL-NRVASKIANKEGVSLKD 60

Query: 61  SVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 120
           SVE    SANLG SDD DIVGLKNYLNAQYFGEIGIGTPPQKFTV+FDTGSSNLWVPSSK
Sbjct: 61  SVE----SANLGGSDDIDIVGLKNYLNAQYFGEIGIGTPPQKFTVVFDTGSSNLWVPSSK 120

Query: 121 CFSVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEA 180
           CFSVACLLHSKYKSKRSSTYKKNGK ASIKYGTGAISGYFSEDNVKVGDLI+KKQDFIEA
Sbjct: 121 CFSVACLLHSKYKSKRSSTYKKNGKPASIKYGTGAISGYFSEDNVKVGDLIIKKQDFIEA 180

Query: 181 TREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQG 240
           TREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRN DEEQG
Sbjct: 181 TREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNVDEEQG 240

Query: 241 GEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSLL 300
           GEIVFGGVDP+HYKGEHTYVPVTKKGYWQFDMGDVLINGSTT FCSGGCSAIADSGTSLL
Sbjct: 241 GEIVFGGVDPNHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTEFCSGGCSAIADSGTSLL 300

Query: 301 AGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAFDGE 360
           AGPT IITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLC FDGE
Sbjct: 301 AGPTAIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCTFDGE 360

Query: 361 RGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGLCEK 420
           RGVSMGIESVVDNTT KSSNGLRDVMCNACEMAVVWAQSQLK+E+TQDQILNYIDGLCEK
Sbjct: 361 RGVSMGIESVVDNTTSKSSNGLRDVMCNACEMAVVWAQSQLKNEQTQDQILNYIDGLCEK 420

Query: 421 LPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALDVPP 480
           LPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALDVPP
Sbjct: 421 LPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALDVPP 480

Query: 481 PRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA 514
           PRGPLWILGDVFMGSYHTVFDYGN RVGFAEAA
Sbjct: 481 PRGPLWILGDVFMGSYHTVFDYGNLRVGFAEAA 508

BLAST of CSPI03G04660 vs. NCBI nr
Match: XP_023007789.1 (aspartic proteinase A1-like [Cucurbita maxima])

HSP 1 Score: 968.8 bits (2503), Expect = 1.9e-278
Identity = 479/513 (93.37%), Postives = 493/513 (96.10%), Query Frame = 0

Query: 1   MGTRLKLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKN 60
           MGTR+KLF A+LFICF MFPMVF ASNDGKVRIGLKRRKFGQN RVASKIATKEG+SLKN
Sbjct: 1   MGTRIKLFTALLFICFLMFPMVFSASNDGKVRIGLKRRKFGQNIRVASKIATKEGVSLKN 60

Query: 61  SVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 120
           SVEK      LGDS D DIV LKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK
Sbjct: 61  SVEK------LGDSADTDIVALKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 120

Query: 121 CFSVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEA 180
           CFSVACLLHSKYKSKRSSTY+KNGK ASI+YGTGAISGYFSED+VKVGDLIVKKQ+FIEA
Sbjct: 121 CFSVACLLHSKYKSKRSSTYEKNGKHASIQYGTGAISGYFSEDHVKVGDLIVKKQEFIEA 180

Query: 181 TREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQG 240
           T+EPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRN DEEQG
Sbjct: 181 TKEPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNEDEEQG 240

Query: 241 GEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSLL 300
           GEIVFGGVDP+HYKGEHTYVPVTKKGYWQFDMGDVLINGSTT FCSGGCSAIADSGTSLL
Sbjct: 241 GEIVFGGVDPNHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTEFCSGGCSAIADSGTSLL 300

Query: 301 AGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAFDGE 360
           AGPT+IITQVNHAIGASGVVSEECK VVAEYGETIIKMLLAKDQPKKICSTLGLCAFDG 
Sbjct: 301 AGPTSIITQVNHAIGASGVVSEECKTVVAEYGETIIKMLLAKDQPKKICSTLGLCAFDGT 360

Query: 361 RGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGLCEK 420
           RGVSMGIESVVDNTTQKSSNGL DVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGLCEK
Sbjct: 361 RGVSMGIESVVDNTTQKSSNGLHDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGLCEK 420

Query: 421 LPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALDVPP 480
           LPSPMGESVIDCDSLSTLPS+SFTIGGKVF+LKPEQYVLKVT+GPVTECISGFAALDVPP
Sbjct: 421 LPSPMGESVIDCDSLSTLPSVSFTIGGKVFDLKPEQYVLKVTQGPVTECISGFAALDVPP 480

Query: 481 PRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA 514
           PRGPLWILGDVFMG+YHTVFDYGN RVGFAEAA
Sbjct: 481 PRGPLWILGDVFMGTYHTVFDYGNLRVGFAEAA 507

BLAST of CSPI03G04660 vs. NCBI nr
Match: XP_023553350.1 (aspartic proteinase A1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 968.0 bits (2501), Expect = 3.2e-278
Identity = 479/513 (93.37%), Postives = 492/513 (95.91%), Query Frame = 0

Query: 1   MGTRLKLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKN 60
           MGTRLKLF A+LFICF MFPMVF ASNDGKVRIGLKRRKFGQN RVASKIATKEG+SLKN
Sbjct: 1   MGTRLKLFTALLFICFLMFPMVFSASNDGKVRIGLKRRKFGQNIRVASKIATKEGVSLKN 60

Query: 61  SVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 120
           SVEK      LGDS D DIV LKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK
Sbjct: 61  SVEK------LGDSADTDIVALKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 120

Query: 121 CFSVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEA 180
           CFSVACLLHSKYKSKRSSTY+KNGK ASI+YGTGAISGYFSED+VKVGDLIVKKQ+FIEA
Sbjct: 121 CFSVACLLHSKYKSKRSSTYEKNGKHASIQYGTGAISGYFSEDHVKVGDLIVKKQEFIEA 180

Query: 181 TREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQG 240
           T+EPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRN DEEQG
Sbjct: 181 TKEPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNEDEEQG 240

Query: 241 GEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSLL 300
           GEIVFGGVDP+HYKGEHTYVPVTKKGYWQFDMGDVLINGSTT FCSGGCSAIADSGTSLL
Sbjct: 241 GEIVFGGVDPNHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTEFCSGGCSAIADSGTSLL 300

Query: 301 AGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAFDGE 360
           AGPT+IITQVNHAIGASGVVSEECK VVAEYGETIIKMLLAKDQPKKICSTLGLC FDG 
Sbjct: 301 AGPTSIITQVNHAIGASGVVSEECKTVVAEYGETIIKMLLAKDQPKKICSTLGLCTFDGT 360

Query: 361 RGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGLCEK 420
           RGVSMGIESVVDNTTQKSSNGL DVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGLCEK
Sbjct: 361 RGVSMGIESVVDNTTQKSSNGLHDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGLCEK 420

Query: 421 LPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALDVPP 480
           LPSPMGESVIDCDSLSTLPS+SFTIGGKVF+LKPEQYVLKVT+GPVTECISGFAALDVPP
Sbjct: 421 LPSPMGESVIDCDSLSTLPSVSFTIGGKVFDLKPEQYVLKVTQGPVTECISGFAALDVPP 480

Query: 481 PRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA 514
           PRGPLWILGDVFMG+YHTVFDYGN RVGFAEAA
Sbjct: 481 PRGPLWILGDVFMGTYHTVFDYGNLRVGFAEAA 507

BLAST of CSPI03G04660 vs. TAIR 10
Match: AT1G11910.1 (aspartic proteinase A1 )

HSP 1 Score: 743.4 bits (1918), Expect = 1.2e-214
Identity = 353/503 (70.18%), Postives = 418/503 (83.10%), Query Frame = 0

Query: 12  LFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKNSVEKYQPSANL 71
           L + F +    F   NDG  R+GLK+ K    NR+A+++ +K+        EK   +  L
Sbjct: 12  LIVSFLLCFSAFAERNDGTFRVGLKKLKLDSKNRLAARVESKQ--------EKPLRAYRL 71

Query: 72  GDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKC-FSVACLLHS 131
           GDS D D+V LKNYL+AQY+GEI IGTPPQKFTV+FDTGSSNLWVPSSKC FS+ACLLH 
Sbjct: 72  GDSGDADVVVLKNYLDAQYYGEIAIGTPPQKFTVVFDTGSSNLWVPSSKCYFSLACLLHP 131

Query: 132 KYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEATREPSLTFVL 191
           KYKS RSSTY+KNGK+A+I YGTGAI+G+FS D V VGDL+VK Q+FIEAT+EP +TFV+
Sbjct: 132 KYKSSRSSTYEKNGKAAAIHYGTGAIAGFFSNDAVTVGDLVVKDQEFIEATKEPGITFVV 191

Query: 192 AQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQGGEIVFGGVDP 251
           A+FDGILGLGF+EISVG A PVWYNM+ Q L+KEPVFSFW NRNADEE+GGE+VFGGVDP
Sbjct: 192 AKFDGILGLGFQEISVGKAAPVWYNMLKQGLIKEPVFSFWLNRNADEEEGGELVFGGVDP 251

Query: 252 DHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSLLAGPTTIITQV 311
           +H+KG+HTYVPVT+KGYWQFDMGDVLI G+ TGFC  GCSAIADSGTSLLAGPTTIIT +
Sbjct: 252 NHFKGKHTYVPVTQKGYWQFDMGDVLIGGAPTGFCESGCSAIADSGTSLLAGPTTIITMI 311

Query: 312 NHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAFDGERGVSMGIESV 371
           NHAIGA+GVVS++CK VV +YG+TI+ +LL++ QPKKICS +GLC FDG RGVSMGIESV
Sbjct: 312 NHAIGAAGVVSQQCKTVVDQYGQTILDLLLSETQPKKICSQIGLCTFDGTRGVSMGIESV 371

Query: 372 VDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGLCEKLPSPMGESVI 431
           VD    K SNG+ D  C+ACEMAVVW QSQL+   TQ++ILNY++ LCE+LPSPMGES +
Sbjct: 372 VDKENAKLSNGVGDAACSACEMAVVWIQSQLRQNMTQERILNYVNELCERLPSPMGESAV 431

Query: 432 DCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALDVPPPRGPLWILGD 491
           DC  LST+P++S TIGGKVF+L PE+YVLKV EGPV +CISGF ALDV PPRGPLWILGD
Sbjct: 432 DCAQLSTMPTVSLTIGGKVFDLAPEEYVLKVGEGPVAQCISGFIALDVAPPRGPLWILGD 491

Query: 492 VFMGSYHTVFDYGNSRVGFAEAA 514
           VFMG YHTVFD+GN +VGFAEAA
Sbjct: 492 VFMGKYHTVFDFGNEQVGFAEAA 506

BLAST of CSPI03G04660 vs. TAIR 10
Match: AT1G62290.1 (Saposin-like aspartyl protease family protein )

HSP 1 Score: 715.7 bits (1846), Expect = 2.7e-206
Identity = 334/513 (65.11%), Postives = 415/513 (80.90%), Query Frame = 0

Query: 1   MGTRLKLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKN 60
           MG   +     +F+ F +F   +   NDG  R+GLK+ K   NNR+A++  +K+  +L++
Sbjct: 1   MGVYSRAVAFSVFVSFLLFFTAYSKRNDGTFRVGLKKLKLDPNNRLATRFGSKQEEALRS 60

Query: 61  SVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 120
           S+  Y  +   GDS D DIV LKNYL+AQY+GEI IGTPPQKFTVIFDTGSSNLWVPS K
Sbjct: 61  SLRSYNNNLG-GDSGDADIVPLKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSGK 120

Query: 121 C-FSVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIE 180
           C FS++C  H+KYKS RSSTYKK+GK A+I YG+G+ISG+FS D V VGDL+VK Q+FIE
Sbjct: 121 CFFSLSCYFHAKYKSSRSSTYKKSGKRAAIHYGSGSISGFFSYDAVTVGDLVVKDQEFIE 180

Query: 181 ATREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQ 240
            T EP LTF++A+FDG+LGLGF+EI+VG+A PVWYNM+ Q L+K PVFSFW NR+   E+
Sbjct: 181 TTSEPGLTFLVAKFDGLLGLGFQEIAVGNATPVWYNMLKQGLIKRPVFSFWLNRDPKSEE 240

Query: 241 GGEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSL 300
           GGEIVFGGVDP H++GEHT+VPVT++GYWQFDMG+VLI G +TG+C  GCSAIADSGTSL
Sbjct: 241 GGEIVFGGVDPKHFRGEHTFVPVTQRGYWQFDMGEVLIAGESTGYCGSGCSAIADSGTSL 300

Query: 301 LAGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAFDG 360
           LAGPT ++  +N AIGASGVVS++CK VV +YG+TI+ +LLA+ QPKKICS +GLCA+DG
Sbjct: 301 LAGPTAVVAMINKAIGASGVVSQQCKTVVDQYGQTILDLLLAETQPKKICSQIGLCAYDG 360

Query: 361 ERGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGLCE 420
             GVSMGIESVVD    +SS+GLRD  C ACEMAVVW QSQL+   TQ++I+NYI+ +CE
Sbjct: 361 THGVSMGIESVVDKENTRSSSGLRDAGCPACEMAVVWIQSQLRQNMTQERIVNYINEICE 420

Query: 421 KLPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALDVP 480
           ++PSP GES +DC  LS +P++SFTIGGKVF+L PE+YVLK+ EGPV +CISGF ALD+P
Sbjct: 421 RMPSPNGESAVDCSQLSKMPTVSFTIGGKVFDLAPEEYVLKIGEGPVAQCISGFTALDIP 480

Query: 481 PPRGPLWILGDVFMGSYHTVFDYGNSRVGFAEA 513
           PPRGPLWILGDVFMG YHTVFD+GN +VGFAEA
Sbjct: 481 PPRGPLWILGDVFMGKYHTVFDFGNEQVGFAEA 512

BLAST of CSPI03G04660 vs. TAIR 10
Match: AT1G62290.2 (Saposin-like aspartyl protease family protein )

HSP 1 Score: 715.7 bits (1846), Expect = 2.7e-206
Identity = 334/513 (65.11%), Postives = 415/513 (80.90%), Query Frame = 0

Query: 1   MGTRLKLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKN 60
           MG   +     +F+ F +F   +   NDG  R+GLK+ K   NNR+A++  +K+  +L++
Sbjct: 1   MGVYSRAVAFSVFVSFLLFFTAYSKRNDGTFRVGLKKLKLDPNNRLATRFGSKQEEALRS 60

Query: 61  SVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 120
           S+  Y  +   GDS D DIV LKNYL+AQY+GEI IGTPPQKFTVIFDTGSSNLWVPS K
Sbjct: 61  SLRSYNNNLG-GDSGDADIVPLKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSGK 120

Query: 121 C-FSVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIE 180
           C FS++C  H+KYKS RSSTYKK+GK A+I YG+G+ISG+FS D V VGDL+VK Q+FIE
Sbjct: 121 CFFSLSCYFHAKYKSSRSSTYKKSGKRAAIHYGSGSISGFFSYDAVTVGDLVVKDQEFIE 180

Query: 181 ATREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQ 240
            T EP LTF++A+FDG+LGLGF+EI+VG+A PVWYNM+ Q L+K PVFSFW NR+   E+
Sbjct: 181 TTSEPGLTFLVAKFDGLLGLGFQEIAVGNATPVWYNMLKQGLIKRPVFSFWLNRDPKSEE 240

Query: 241 GGEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSL 300
           GGEIVFGGVDP H++GEHT+VPVT++GYWQFDMG+VLI G +TG+C  GCSAIADSGTSL
Sbjct: 241 GGEIVFGGVDPKHFRGEHTFVPVTQRGYWQFDMGEVLIAGESTGYCGSGCSAIADSGTSL 300

Query: 301 LAGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAFDG 360
           LAGPT ++  +N AIGASGVVS++CK VV +YG+TI+ +LLA+ QPKKICS +GLCA+DG
Sbjct: 301 LAGPTAVVAMINKAIGASGVVSQQCKTVVDQYGQTILDLLLAETQPKKICSQIGLCAYDG 360

Query: 361 ERGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDGLCE 420
             GVSMGIESVVD    +SS+GLRD  C ACEMAVVW QSQL+   TQ++I+NYI+ +CE
Sbjct: 361 THGVSMGIESVVDKENTRSSSGLRDAGCPACEMAVVWIQSQLRQNMTQERIVNYINEICE 420

Query: 421 KLPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALDVP 480
           ++PSP GES +DC  LS +P++SFTIGGKVF+L PE+YVLK+ EGPV +CISGF ALD+P
Sbjct: 421 RMPSPNGESAVDCSQLSKMPTVSFTIGGKVFDLAPEEYVLKIGEGPVAQCISGFTALDIP 480

Query: 481 PPRGPLWILGDVFMGSYHTVFDYGNSRVGFAEA 513
           PPRGPLWILGDVFMG YHTVFD+GN +VGFAEA
Sbjct: 481 PPRGPLWILGDVFMGKYHTVFDFGNEQVGFAEA 512

BLAST of CSPI03G04660 vs. TAIR 10
Match: AT4G04460.1 (Saposin-like aspartyl protease family protein )

HSP 1 Score: 662.5 bits (1708), Expect = 2.7e-190
Identity = 315/517 (60.93%), Postives = 405/517 (78.34%), Query Frame = 0

Query: 1   MGTRLKLFIAV-LFICFFMFPMVFCASN-DGKVRIGLKRRKFGQNNRVASKIATKEGISL 60
           MGTR + F+ V L  C  +     C  N DG +RIGLK+RK  ++NR+AS+      + L
Sbjct: 1   MGTRFQSFLLVFLLSCLILISTASCERNGDGTIRIGLKKRKLDRSNRLASQ------LFL 60

Query: 61  KNSVEKYQPSANLGDSDD-FDIVGLKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVP 120
           KN    + P      +D+  D+V LKNYL+AQY+G+I IGTPPQKFTVIFDTGSSNLW+P
Sbjct: 61  KNRGSHWSPKHYFRLNDENADMVPLKNYLDAQYYGDITIGTPPQKFTVIFDTGSSNLWIP 120

Query: 121 SSKCF-SVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQD 180
           S+KC+ SVAC  HSKYK+ +SS+Y+KNGK ASI+YGTGAISGYFS D+VKVGD++VK+Q+
Sbjct: 121 STKCYLSVACYFHSKYKASQSSSYRKNGKPASIRYGTGAISGYFSNDDVKVGDIVVKEQE 180

Query: 181 FIEATREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNAD 240
           FIEAT EP +TF+LA+FDGILGLGFKEISVG++ PVWYNMV++ LVKEP+FSFW NRN  
Sbjct: 181 FIEATSEPGITFLLAKFDGILGLGFKEISVGNSTPVWYNMVEKGLVKEPIFSFWLNRNPK 240

Query: 241 EEQGGEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSG 300
           + +GGEIVFGGVDP H+KGEHT+VPVT KGYWQFDMGD+ I G  TG+C+ GCSAIADSG
Sbjct: 241 DPEGGEIVFGGVDPKHFKGEHTFVPVTHKGYWQFDMGDLQIAGKPTGYCAKGCSAIADSG 300

Query: 301 TSLLAGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCA 360
           TSLL GP+T+IT +NHAIGA G+VS ECKAVV +YG+T++  LLA++ PKK+CS +G+CA
Sbjct: 301 TSLLTGPSTVITMINHAIGAQGIVSRECKAVVDQYGKTMLNSLLAQEDPKKVCSQIGVCA 360

Query: 361 FDGERGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDG 420
           +DG + VSMGI+SVVD+ T   S  L   MC+ACEMA VW +S+L   +TQ++IL Y   
Sbjct: 361 YDGTQSVSMGIQSVVDDGT---SGLLNQAMCSACEMAAVWMESELTQNQTQERILAYAAE 420

Query: 421 LCEKLPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAAL 480
           LC+ +P+   +S +DC  +S++P ++F+IGG+ F+L P+ Y+ K+ EG  ++C SGF A+
Sbjct: 421 LCDHIPTQNQQSAVDCGRVSSMPIVTFSIGGRSFDLTPQDYIFKIGEGVESQCTSGFTAM 480

Query: 481 DVPPPRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA 514
           D+ PPRGPLWILGD+FMG YHTVFDYG  RVGFA+AA
Sbjct: 481 DIAPPRGPLWILGDIFMGPYHTVFDYGKGRVGFAKAA 508

BLAST of CSPI03G04660 vs. TAIR 10
Match: AT4G04460.2 (Saposin-like aspartyl protease family protein )

HSP 1 Score: 651.0 bits (1678), Expect = 8.1e-187
Identity = 313/517 (60.54%), Postives = 402/517 (77.76%), Query Frame = 0

Query: 1   MGTRLKLFIAV-LFICFFMFPMVFCASN-DGKVRIGLKRRKFGQNNRVASKIATKEGISL 60
           MGTR + F+ V L  C  +     C  N DG +RIGLK+RK  ++NR+AS+      + L
Sbjct: 1   MGTRFQSFLLVFLLSCLILISTASCERNGDGTIRIGLKKRKLDRSNRLASQ------LFL 60

Query: 61  KNSVEKYQPSANLGDSDD-FDIVGLKNYLNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVP 120
           KN    + P      +D+  D+V LKNYL+AQY+G+I IGTPPQKFTVIFDTGSSNLW+P
Sbjct: 61  KNRGSHWSPKHYFRLNDENADMVPLKNYLDAQYYGDITIGTPPQKFTVIFDTGSSNLWIP 120

Query: 121 SSKCF-SVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQD 180
           S+KC+ SVAC  HSKYK+ +SS+Y+KNGK ASI+YGTGAISGYFS D+VKVGD++VK+Q+
Sbjct: 121 STKCYLSVACYFHSKYKASQSSSYRKNGKPASIRYGTGAISGYFSNDDVKVGDIVVKEQE 180

Query: 181 FIEATREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNAD 240
           FIEAT EP +TF+LA+FDGILGLGFKEISVG++ PVWYNMV++ LVKEP+FSFW NRN  
Sbjct: 181 FIEATSEPGITFLLAKFDGILGLGFKEISVGNSTPVWYNMVEKGLVKEPIFSFWLNRNPK 240

Query: 241 EEQGGEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSG 300
           + +GGEIVFGGVDP H+KGEHT+VPVT KGYWQFDMGD+ I G  TG+C+ GCSAIADSG
Sbjct: 241 DPEGGEIVFGGVDPKHFKGEHTFVPVTHKGYWQFDMGDLQIAGKPTGYCAKGCSAIADSG 300

Query: 301 TSLLAGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCA 360
           TSLL GP+T+IT +NHAIGA G+VS ECKAVV +YG+T++  LLA    +K+CS +G+CA
Sbjct: 301 TSLLTGPSTVITMINHAIGAQGIVSRECKAVVDQYGKTMLNSLLA----QKVCSQIGVCA 360

Query: 361 FDGERGVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKDEKTQDQILNYIDG 420
           +DG + VSMGI+SVVD+ T   S  L   MC+ACEMA VW +S+L   +TQ++IL Y   
Sbjct: 361 YDGTQSVSMGIQSVVDDGT---SGLLNQAMCSACEMAAVWMESELTQNQTQERILAYAAE 420

Query: 421 LCEKLPSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAAL 480
           LC+ +P+   +S +DC  +S++P ++F+IGG+ F+L P+ Y+ K+ EG  ++C SGF A+
Sbjct: 421 LCDHIPTQNQQSAVDCGRVSSMPIVTFSIGGRSFDLTPQDYIFKIGEGVESQCTSGFTAM 480

Query: 481 DVPPPRGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA 514
           D+ PPRGPLWILGD+FMG YHTVFDYG  RVGFA+AA
Sbjct: 481 DIAPPRGPLWILGDIFMGPYHTVFDYGKGRVGFAKAA 504

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
O040573.1e-21569.25Aspartic proteinase OS=Cucurbita pepo OX=3663 PE=2 SV=1[more]
O653901.7e-21370.18Aspartic proteinase A1 OS=Arabidopsis thaliana OX=3702 GN=APA1 PE=1 SV=1[more]
Q8VYL33.8e-20565.11Aspartic proteinase A2 OS=Arabidopsis thaliana OX=3702 GN=APA2 PE=2 SV=1[more]
Q424561.3e-20265.96Aspartic proteinase oryzasin-1 OS=Oryza sativa subsp. japonica OX=39947 GN=Os05g... [more]
P422104.3e-20166.28Phytepsin OS=Hordeum vulgare OX=4513 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A5D3DKC03.5e-29497.67Aspartic proteinase A1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaf... [more]
A0A6J1L5Y59.1e-27993.37aspartic proteinase A1-like OS=Cucurbita maxima OX=3661 GN=LOC111500293 PE=3 SV=... [more]
A0A6J1F1375.0e-27792.98aspartic proteinase A1-like OS=Cucurbita moschata OX=3662 GN=LOC111438595 PE=3 S... [more]
A0A0A0L2W79.8e-27392.20Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G066750 PE=3 SV=1[more]
A0A6J1DCK47.8e-26286.19aspartic proteinase A1-like OS=Momordica charantia OX=3673 GN=LOC111019253 PE=3 ... [more]
Match NameE-valueIdentityDescription
XP_004151126.14.3e-29999.61aspartic proteinase A1 [Cucumis sativus] >XP_011650512.1 aspartic proteinase A1 ... [more]
KAA0065836.17.1e-29497.67aspartic proteinase A1-like [Cucumis melo var. makuwa] >TYK24045.1 aspartic prot... [more]
XP_038905979.12.0e-28094.74aspartic proteinase A1-like [Benincasa hispida][more]
XP_023007789.11.9e-27893.37aspartic proteinase A1-like [Cucurbita maxima][more]
XP_023553350.13.2e-27893.37aspartic proteinase A1-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
AT1G11910.11.2e-21470.18aspartic proteinase A1 [more]
AT1G62290.12.7e-20665.11Saposin-like aspartyl protease family protein [more]
AT1G62290.22.7e-20665.11Saposin-like aspartyl protease family protein [more]
AT4G04460.12.7e-19060.93Saposin-like aspartyl protease family protein [more]
AT4G04460.28.1e-18760.54Saposin-like aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 96..116
score: 73.01
coord: 486..501
score: 59.32
coord: 241..254
score: 46.54
coord: 291..302
score: 58.07
IPR001461Aspartic peptidase A1 familyPANTHERPTHR47966BETA-SITE APP-CLEAVING ENZYME, ISOFORM A-RELATEDcoord: 9..513
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 31..247
e-value: 5.8E-77
score: 260.2
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 248..513
e-value: 1.4E-120
score: 403.0
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 11..512
NoneNo IPR availableGENE3D1.10.225.10coord: 319..422
e-value: 1.4E-120
score: 403.0
NoneNo IPR availablePANTHERPTHR47966:SF31ASPARTIC PROTEINASE-LIKEcoord: 9..513
IPR033121Peptidase family A1 domainPFAMPF00026Aspcoord: 89..512
e-value: 9.0E-136
score: 452.7
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 90..510
score: 75.121506
IPR007856Saposin-like type B, region 1PFAMPF05184SapB_1coord: 386..422
e-value: 2.3E-13
score: 49.9
IPR008138Saposin B type, region 2PFAMPF03489SapB_2coord: 323..355
e-value: 7.8E-12
score: 45.2
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 105..116
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 291..302
IPR008139Saposin B type domainPROSITEPS50015SAP_Bcoord: 319..359
score: 12.6064
IPR008139Saposin B type domainPROSITEPS50015SAP_Bcoord: 383..424
score: 11.467599
IPR033869PhytepsinCDDcd06098phytepsincoord: 80..511
e-value: 0.0
score: 630.559
IPR011001Saposin-likeSUPERFAMILY47862Saposincoord: 320..422

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G04660.1CSPI03G04660.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006629 lipid metabolic process
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity