CSPI01G20490 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI01G20490
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionEukaryotic aspartyl protease family protein
LocationChr1: 16170484 .. 16172193 (+)
RNA-Seq ExpressionCSPI01G20490
SyntenyCSPI01G20490
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TGGTTCTACATGTTATTAAATATGGTTTCATTCTGCTTTGTGAGATCACCACGGTTTCGTTCTCCATTGTATAACACATCAGAAGCAACATAATGATTCTGTTAGCTTCTCTCCATCATCTTTTACCTTCCCTGATCTTGGCATTTTATTTGTCAACAGCCATAATCTCGTCCACATTGATCACGACAAAGCCTTCACGTTTGGCAACCAAGCTGATCCACCGCAACTCATATCTACATCCACTCTATGACCAAAATGAGACAGTTGAGGATCGATCGAAGAGAGAGCAGACAAGCTCAATCGAACGCTTTGGTTTTCTCGAGTCAAAGATTAAAGAACTGAAGTCTGTTGGTAATGAAGCTCGATCAAGTCTCATTCCTTTCAATCGAGGTAGTGGGTTTCTTGTTAATTTGTCAATCGGTTCGCCACCCGTGACACAGCTCGTAGTGGTCGACACTGGTAGCTCCCTCCTGTGGGTGCAATGTTTGCCTTGTATCAACTGTTTTCAACAATCGACCTCATGGTTCGATCCTTTGAAATCAGTAAGTTTCAAAACATTGGGTTGTGGGTTTCCTGGGTATAACTACATTAATGGTTACAAATGCAATCGGTTTAATCAAGCTGAGTACAAGTTGAGGTACCTTGGTGGGGATTCCTCACAAGGAATTCTTGCCAAGGAATCACTTCTCTTTGAGACACTTGATGAAGGTAGAGTTTTTCAATACAATGCTATTTCAACACAAATAAGTTGATAAATAAATTTGACTTTCTTAATTGTTTTAGTAACATATTTTACCGTGTTTTCTATAATTATTATTTTGTGATATCATATGATTTGTAATACACTATATGATTTAACAGAAAAGAATTGAAATTATAACATAGAAAAACTATATGAACTTTAATTTGTAATTACAACATTTGTCAAACTCACAATGGTATTCGATTGTTGTAACAGGCAAAATAAAAAAGTCGAACGTAACATTTGGGTGTGGTCATATGAACATCAAAACCAACAATGACGATGCCTACAATGGCGTATTTGGATTAGGAGCATACCCCCACATAACAATGGCCACTCAATTAGGCAACAAATTTTCCTATTGCATTGGCGATATCAACAATCCTCTCTACACTCACAATCATCTTGTCTTAGGACAAGGATCTTACATTGAAGGCGATTCCACTCCTCTTCAAATCCACTTTGGCCATTATTATGTCACTTTACAATCCATCAGTGTTGGCTCTAAGACCCTCAAAATCGACCCAAATGCCTTCAAAATCTCATCGGACGGCAGCGGTGGAGTTCTGATCGACTCTGGAATGACCTACACTAAGCTTGCGAATGGTGGGTTTGAATTACTTTATGATGAGATTGTTGATTTGATGAAGGGTTTGCTGGAACGAATCCCAACTCAGAGGAAATTCGAAGGGTTGTGTTTTAAAGGTGTCGTCAGTCGAGACCTTGTTGGGTTTCCGGCGGTGACGTTTCATTTCGCCGGCGGTGCTGATTTGGTGTTGGAATCGGGGAGTTTGTTCCGGCAACATGGTGGGGATCGGTTTTGTTTAGCTATTCTGCCAAGTAATTCTGAGTTGTTGAATCTGTCTGTGATTGGGATTTTGGCTCAACAGAATTATAATGTTGGTTTTGATCTTGAACAAATGAAAGTGTTCTTTCGTAGGATTGATTGTCAACTTCTTGACGAGTAA

mRNA sequence

TGGTTCTACATGTTATTAAATATGGTTTCATTCTGCTTTGTGAGATCACCACGGTTTCGTTCTCCATTGTATAACACATCAGAAGCAACATAATGATTCTGTTAGCTTCTCTCCATCATCTTTTACCTTCCCTGATCTTGGCATTTTATTTGTCAACAGCCATAATCTCGTCCACATTGATCACGACAAAGCCTTCACGTTTGGCAACCAAGCTGATCCACCGCAACTCATATCTACATCCACTCTATGACCAAAATGAGACAGTTGAGGATCGATCGAAGAGAGAGCAGACAAGCTCAATCGAACGCTTTGGTTTTCTCGAGTCAAAGATTAAAGAACTGAAGTCTGTTGGTAATGAAGCTCGATCAAGTCTCATTCCTTTCAATCGAGGTAGTGGGTTTCTTGTTAATTTGTCAATCGGTTCGCCACCCGTGACACAGCTCGTAGTGGTCGACACTGGTAGCTCCCTCCTGTGGGTGCAATGTTTGCCTTGTATCAACTGTTTTCAACAATCGACCTCATGGTTCGATCCTTTGAAATCAGTAAGTTTCAAAACATTGGGTTGTGGGTTTCCTGGGTATAACTACATTAATGGTTACAAATGCAATCGGTTTAATCAAGCTGAGTACAAGTTGAGGTACCTTGGTGGGGATTCCTCACAAGGAATTCTTGCCAAGGAATCACTTCTCTTTGAGACACTTGATGAAGGCAAAATAAAAAAGTCGAACGTAACATTTGGGTGTGGTCATATGAACATCAAAACCAACAATGACGATGCCTACAATGGCGTATTTGGATTAGGAGCATACCCCCACATAACAATGGCCACTCAATTAGGCAACAAATTTTCCTATTGCATTGGCGATATCAACAATCCTCTCTACACTCACAATCATCTTGTCTTAGGACAAGGATCTTACATTGAAGGCGATTCCACTCCTCTTCAAATCCACTTTGGCCATTATTATGTCACTTTACAATCCATCAGTGTTGGCTCTAAGACCCTCAAAATCGACCCAAATGCCTTCAAAATCTCATCGGACGGCAGCGGTGGAGTTCTGATCGACTCTGGAATGACCTACACTAAGCTTGCGAATGGTGGGTTTGAATTACTTTATGATGAGATTGTTGATTTGATGAAGGGTTTGCTGGAACGAATCCCAACTCAGAGGAAATTCGAAGGGTTGTGTTTTAAAGGTGTCGTCAGTCGAGACCTTGTTGGGTTTCCGGCGGTGACGTTTCATTTCGCCGGCGGTGCTGATTTGGTGTTGGAATCGGGGAGTTTGTTCCGGCAACATGGTGGGGATCGGTTTTGTTTAGCTATTCTGCCAAGTAATTCTGAGTTGTTGAATCTGTCTGTGATTGGGATTTTGGCTCAACAGAATTATAATGTTGGTTTTGATCTTGAACAAATGAAAGTGTTCTTTCGTAGGATTGATTGTCAACTTCTTGACGAGTAA

Coding sequence (CDS)

ATGATTCTGTTAGCTTCTCTCCATCATCTTTTACCTTCCCTGATCTTGGCATTTTATTTGTCAACAGCCATAATCTCGTCCACATTGATCACGACAAAGCCTTCACGTTTGGCAACCAAGCTGATCCACCGCAACTCATATCTACATCCACTCTATGACCAAAATGAGACAGTTGAGGATCGATCGAAGAGAGAGCAGACAAGCTCAATCGAACGCTTTGGTTTTCTCGAGTCAAAGATTAAAGAACTGAAGTCTGTTGGTAATGAAGCTCGATCAAGTCTCATTCCTTTCAATCGAGGTAGTGGGTTTCTTGTTAATTTGTCAATCGGTTCGCCACCCGTGACACAGCTCGTAGTGGTCGACACTGGTAGCTCCCTCCTGTGGGTGCAATGTTTGCCTTGTATCAACTGTTTTCAACAATCGACCTCATGGTTCGATCCTTTGAAATCAGTAAGTTTCAAAACATTGGGTTGTGGGTTTCCTGGGTATAACTACATTAATGGTTACAAATGCAATCGGTTTAATCAAGCTGAGTACAAGTTGAGGTACCTTGGTGGGGATTCCTCACAAGGAATTCTTGCCAAGGAATCACTTCTCTTTGAGACACTTGATGAAGGCAAAATAAAAAAGTCGAACGTAACATTTGGGTGTGGTCATATGAACATCAAAACCAACAATGACGATGCCTACAATGGCGTATTTGGATTAGGAGCATACCCCCACATAACAATGGCCACTCAATTAGGCAACAAATTTTCCTATTGCATTGGCGATATCAACAATCCTCTCTACACTCACAATCATCTTGTCTTAGGACAAGGATCTTACATTGAAGGCGATTCCACTCCTCTTCAAATCCACTTTGGCCATTATTATGTCACTTTACAATCCATCAGTGTTGGCTCTAAGACCCTCAAAATCGACCCAAATGCCTTCAAAATCTCATCGGACGGCAGCGGTGGAGTTCTGATCGACTCTGGAATGACCTACACTAAGCTTGCGAATGGTGGGTTTGAATTACTTTATGATGAGATTGTTGATTTGATGAAGGGTTTGCTGGAACGAATCCCAACTCAGAGGAAATTCGAAGGGTTGTGTTTTAAAGGTGTCGTCAGTCGAGACCTTGTTGGGTTTCCGGCGGTGACGTTTCATTTCGCCGGCGGTGCTGATTTGGTGTTGGAATCGGGGAGTTTGTTCCGGCAACATGGTGGGGATCGGTTTTGTTTAGCTATTCTGCCAAGTAATTCTGAGTTGTTGAATCTGTCTGTGATTGGGATTTTGGCTCAACAGAATTATAATGTTGGTTTTGATCTTGAACAAATGAAAGTGTTCTTTCGTAGGATTGATTGTCAACTTCTTGACGAGTAA

Protein sequence

MILLASLHHLLPSLILAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVEDRSKREQTSSIERFGFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGKIKKSNVTFGCGHMNIKTNNDDAYNGVFGLGAYPHITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE*
Homology
BLAST of CSPI01G20490 vs. ExPASy Swiss-Prot
Match: Q3EBM5 (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 186.0 bits (471), Expect = 9.2e-46
Identity = 140/461 (30.37%), Postives = 232/461 (50.33%), Query Frame = 0

Query: 14  LILAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVEDRSKREQTSSIERF 73
           ++L F+L  ++  S+  +  P   + +LIHR+S L P+Y+   TV DR       S+ R 
Sbjct: 5   ILLCFFLFFSVTLSS--SGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRS 64

Query: 74  GFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVVDTGSSLLWVQCLP 133
                ++ +      + +S LI       F ++++IG+PP+    + DTGS L WVQC P
Sbjct: 65  RRFNHQLSQ-----TDLQSGLI--GADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKP 124

Query: 134 CINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYK--CNRFNQAEYKLRYLGGDS--S 193
           C  C++++   FD  KS ++K+  C       ++  +  C+  N    K RY  GD   S
Sbjct: 125 CQQCYKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNI-CKYRYSYGDQSFS 184

Query: 194 QGILAKESLLFETLDEGKIKKSNVTFGCGHMNIKTNNDDAYNGVFGLGAYPHITMATQLG 253
           +G +A E++  ++     +      FGCG+ N  T  D+  +G+ GLG   H+++ +QLG
Sbjct: 185 KGDVATETVSIDSASGSPVSFPGTVFGCGYNNGGT-FDETGSGIIGLGG-GHLSLISQLG 244

Query: 254 N----KFSYCIGDINNPLYTHNHLVLGQG-----SYIEGD----STPL--QIHFGHYYVT 313
           +    KFSYC+   +    T+   V+  G     S +  D    STPL  +    +YY+T
Sbjct: 245 SSISKKFSYCLS--HKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLT 304

Query: 314 LQSISVGSKTL-----KIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLM 373
           L++ISVG K +       +PN   I S+ SG ++IDSG T T L  G F+     + + +
Sbjct: 305 LEAISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESV 364

Query: 374 KGLLERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCL 433
            G  +R+   +     CFK   +   +G P +T HF  GAD+ L   + F +   D  CL
Sbjct: 365 TG-AKRVSDPQGLLSHCFKSGSAE--IGLPEITVHFT-GADVRLSPINAFVKLSEDMVCL 424

Query: 434 AILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDC 451
           +++P+      +++ G  AQ ++ VG+DLE   V F+ +DC
Sbjct: 425 SMVPTT----EVAIYGNFAQMDFLVGYDLETRTVSFQHMDC 443

BLAST of CSPI01G20490 vs. ExPASy Swiss-Prot
Match: Q6XBF8 (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)

HSP 1 Score: 183.3 bits (464), Expect = 6.0e-45
Identity = 141/457 (30.85%), Postives = 212/457 (46.39%), Query Frame = 0

Query: 11  LPSLILAFYLSTAIISSTLITTKPSR----LATKLIHRNSYLHPLYDQNETVEDRSKREQ 70
           + SL  +  LS  ++SS  ++   ++        LIHR+S   P Y+  ET   R +   
Sbjct: 1   MASLFSSVLLSLCLLSSLFLSNANAKPKLGFTADLIHRDSPKSPFYNPMETSSQRLRNAI 60

Query: 71  TSSIER-FGFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVVDTGSS 130
             S+ R F F E          N  +  +   +    +L+N+SIG+PP   + + DTGS 
Sbjct: 61  HRSVNRVFHFTEK--------DNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSD 120

Query: 131 LLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYI-NGYKCN-RFNQAEYKLRY 190
           LLW QC PC +C+ Q    FDP  S ++K + C       + N   C+   N   Y L Y
Sbjct: 121 LLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSY 180

Query: 191 LGGDSSQGILAKESLLFETLDEGKIKKSNVTFGCGHMNIKTNNDDAYNGVFGLGAYPHIT 250
                ++G +A ++L   + D   ++  N+  GCGH N  T N    +G+ GLG  P ++
Sbjct: 181 GDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCGHNNAGTFNKKG-SGIVGLGGGP-VS 240

Query: 251 MATQLGN----KFSYCIGDINNPLYTHNHLVLGQGSYIEGD---STPLQIHFGH---YYV 310
           +  QLG+    KFSYC+  + +     + +  G  + + G    STPL         YY+
Sbjct: 241 LIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYL 300

Query: 311 TLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLL 370
           TL+SISVGSK ++   +    S    G ++IDSG T T L        Y E+ D +   +
Sbjct: 301 TLKSISVGSKQIQYSGSD---SESSEGNIIIDSGTTLTLLPTE----FYSELEDAVASSI 360

Query: 371 ERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILP 430
           +    Q    GL      + DL   P +T HF  GAD+ L+S + F Q   D  C A   
Sbjct: 361 DAEKKQDPQSGLSLCYSATGDL-KVPVITMHF-DGADVKLDSSNAFVQVSEDLVCFAFRG 420

Query: 431 SNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDC 451
           S S     S+ G +AQ N+ VG+D     V F+  DC
Sbjct: 421 SPS----FSIYGNVAQMNFLVGYDTVSKTVSFKPTDC 434

BLAST of CSPI01G20490 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 176.4 bits (446), Expect = 7.3e-43
Identity = 121/356 (33.99%), Postives = 172/356 (48.31%), Query Frame = 0

Query: 103 FLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPG 162
           +L+NLSIG+P      ++DTGS L+W QC PC  CF QST  F+P  S SF TL C    
Sbjct: 95  YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQL 154

Query: 163 YNYINGYKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGKIKKSNVTFGCGHMNI 222
              ++   C+  N  +Y   Y  G  +QG +  E+L F     G +   N+TFGCG  N 
Sbjct: 155 CQALSSPTCSN-NFCQYTYGYGDGSETQGSMGTETLTF-----GSVSIPNITFGCGENNQ 214

Query: 223 KTNNDDAYNGVFGLGAYPHITMATQLG-NKFSYCIGDINNPLYTHNHLVLGQ--GSYIEG 282
                +   G+ G+G  P +++ +QL   KFSYC+  I +   T ++L+LG    S   G
Sbjct: 215 GFGQGNG-AGLVGMGRGP-LSLPSQLDVTKFSYCMTPIGSS--TPSNLLLGSLANSVTAG 274

Query: 283 DSTPLQIHFGH----YYVTLQSISVGSKTLKIDPNAFKI-SSDGSGGVLIDSGMTYTKLA 342
                 I        YY+TL  +SVGS  L IDP+AF + S++G+GG++IDSG T T   
Sbjct: 275 SPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFV 334

Query: 343 NGGFELLYDEIVDLMKGLLERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLE 402
           N  ++ +  E +  +   L  +        LCF+       +  P    HF GG DL L 
Sbjct: 335 NNAYQSVRQEFISQIN--LPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGG-DLELP 394

Query: 403 SGSLFRQHGGDRFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDC 451
           S + F        CLA+    S    +S+ G + QQN  V +D     V F    C
Sbjct: 395 SENYFISPSNGLICLAM---GSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CSPI01G20490 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 167.2 bits (422), Expect = 4.4e-40
Identity = 128/446 (28.70%), Postives = 206/446 (46.19%), Query Frame = 0

Query: 11  LPSLILAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVEDRSKREQTSSI 70
           L S++L   + +AI++ T  T++ + L          L    +Q ++ ++ +K E     
Sbjct: 5   LYSVVLGLAIVSAIVAPTSSTSRGTLLHHGQKRPQPGLRVDLEQVDSGKNLTKYELIKRA 64

Query: 71  ERFGFLESKIKELKSVGNEARSSLIPFNRGSG-FLVNLSIGSPPVTQLVVVDTGSSLLWV 130
            + G  E +++ + ++   +     P   G G +L+N++IG+P  +   ++DTGS L+W 
Sbjct: 65  IKRG--ERRMRSINAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWT 124

Query: 131 QCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYKLRYLGGDSS 190
           QC PC  CF Q T  F+P  S SF TL C       +    CN  N+ +Y   Y  G ++
Sbjct: 125 QCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETCNN-NECQYTYGYGDGSTT 184

Query: 191 QGILAKESLLFETLDEGKIKKSNVTFGCGHMNIKTNNDDAYNGVFGLGAYPHITMATQLG 250
           QG +A E+  FET         N+ FGCG  N      +   G+ G+G  P +++ +QLG
Sbjct: 185 QGYMATETFTFET-----SSVPNIAFGCGEDNQGFGQGNG-AGLIGMGWGP-LSLPSQLG 244

Query: 251 -NKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFG----HYYVTLQSISVGSKT 310
             +FSYC+    +   +   L        EG  +   IH      +YY+TLQ I+VG   
Sbjct: 245 VGQFSYCMTSYGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDN 304

Query: 311 LKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEG 370
           L I  + F++  DG+GG++IDSG T T L    +  +     D +   L  +        
Sbjct: 305 LGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQIN--LPTVDESSSGLS 364

Query: 371 LCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLNLSVI 430
            CF+       V  P ++  F GG   + E   L     G   CLA+   +S  L +S+ 
Sbjct: 365 TCFQQPSDGSTVQVPEISMQFDGGVLNLGEQNILISPAEG-VICLAM--GSSSQLGISIF 424

Query: 431 GILAQQNYNVGFDLEQMKVFFRRIDC 451
           G + QQ   V +DL+ + V F    C
Sbjct: 425 GNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CSPI01G20490 vs. ExPASy Swiss-Prot
Match: Q9SV77 (Aspartyl protease UND OS=Arabidopsis thaliana OX=3702 GN=UND PE=2 SV=1)

HSP 1 Score: 160.6 bits (405), Expect = 4.2e-38
Identity = 112/360 (31.11%), Postives = 175/360 (48.61%), Query Frame = 0

Query: 99  RGSGFLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQST-SWFDPLKSVSFKTLG 158
           RG  F+  +  GSP   Q + +DTGSSL W QC PC +C+ Q     + P  S++++   
Sbjct: 54  RGLAFMAEIHFGSPQKKQFLHMDTGSSLTWTQCFPCSDCYAQKIYPKYRPAASITYRDAM 113

Query: 159 CGFPGYNYINGYKCNRFNQ-AEYKLRYLGGDSSQGILAKESLLFETLDEGKIKKSNVTFG 218
           C          +  +   +   Y+  YL   + +G LA+E +  +T D G  +   V FG
Sbjct: 114 CEDSHPKSNPHFAFDPLTRICTYQQHYLDETNIKGTLAQEMITVDTHDGGFKRVHGVYFG 173

Query: 219 CGHMNIKTNNDDAY---NGVFGLGAYPHITMATQLGNKFSYCIGDINNPLYTHNHLVLGQ 278
           C      T +D +Y    G+ GLG   + ++  + G+KFS+C+G+I+ P  +HN L+LG 
Sbjct: 174 C-----NTLSDGSYFTGTGILGLGVGKY-SIIGEFGSKFSFCLGEISEPKASHN-LILGD 233

Query: 279 GSYIEGDSTPLQIHFGHYYVTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKL 338
           G+ ++G  T + I  GH    L+SI VG           +I+ D    V +D+G T + L
Sbjct: 234 GANVQGHPTVINITEGHTIFQLESIIVGE----------EITLDDPVQVFVDTGSTLSHL 293

Query: 339 ANGGFELLYDEIVDLMKGLLERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVL 398
           +      LY + VD    L+   P   +   LC+K      L     V F F  GA+L +
Sbjct: 294 STN----LYYKFVDAFDDLIGSRPLSYE-PTLCYKADTIERLEKMD-VGFKFDVGAELSV 353

Query: 399 ESGSLFRQHGGDRF-CLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDCQL 453
              ++F Q G     CLAI  +N E  +  +IG++A Q YNVG+DL     +  + DC +
Sbjct: 354 NIHNIFIQQGPPEIRCLAI-QNNKESFSHVIIGVIAMQGYNVGYDLSAKTAYINKQDCDM 389

BLAST of CSPI01G20490 vs. ExPASy TrEMBL
Match: A0A0A0LUP5 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G424350 PE=3 SV=1)

HSP 1 Score: 912.5 bits (2357), Expect = 6.9e-262
Identity = 452/455 (99.34%), Postives = 453/455 (99.56%), Query Frame = 0

Query: 1   MILLASLHHLLPSLILAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVED 60
           MILLASLHHLLPSL LAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVED
Sbjct: 2   MILLASLHHLLPSLTLAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVED 61

Query: 61  RSKREQTSSIERFGFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV 120
           RSKREQTSSIERF FLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV
Sbjct: 62  RSKREQTSSIERFDFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV 121

Query: 121 DTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYK 180
           DTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYK
Sbjct: 122 DTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYK 181

Query: 181 LRYLGGDSSQGILAKESLLFETLDEGKIKKSNVTFGCGHMNIKTNNDDAYNGVFGLGAYP 240
           LRYLGGDSSQGILAKESLLFETLDEGKIKKSN+TFGCGHMNIKTNNDDAYNGVFGLGAYP
Sbjct: 182 LRYLGGDSSQGILAKESLLFETLDEGKIKKSNITFGCGHMNIKTNNDDAYNGVFGLGAYP 241

Query: 241 HITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISV 300
           HITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISV
Sbjct: 242 HITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISV 301

Query: 301 GSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR 360
           GSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR
Sbjct: 302 GSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR 361

Query: 361 KFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLN 420
           KFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLN
Sbjct: 362 KFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLN 421

Query: 421 LSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE 456
           LSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE
Sbjct: 422 LSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE 456

BLAST of CSPI01G20490 vs. ExPASy TrEMBL
Match: A0A5D3DZ20 (Peptidase A1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold629G002350 PE=3 SV=1)

HSP 1 Score: 884.4 bits (2284), Expect = 2.0e-253
Identity = 440/455 (96.70%), Postives = 443/455 (97.36%), Query Frame = 0

Query: 1   MILLASLHHLLPSLILAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVED 60
           MILLASLHHLLPSL LAFYLSTAII ST I TKPSRLATKLIHRNSYLHPLYD NETVED
Sbjct: 391 MILLASLHHLLPSLTLAFYLSTAIILSTSIMTKPSRLATKLIHRNSYLHPLYDPNETVED 450

Query: 61  RSKREQTSSIERFGFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV 120
           RSKREQ SSIERF FLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV
Sbjct: 451 RSKREQASSIERFAFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV 510

Query: 121 DTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYK 180
           DTGSSLLWVQCLPCINCFQQSTSWFDPLKS SFKTLGCGFPGYNYINGYKCN  NQAEYK
Sbjct: 511 DTGSSLLWVQCLPCINCFQQSTSWFDPLKSASFKTLGCGFPGYNYINGYKCNG-NQAEYK 570

Query: 181 LRYLGGDSSQGILAKESLLFETLDEGKIKKSNVTFGCGHMNIKTNNDDAYNGVFGLGAYP 240
           LRYLGGDSSQGILAKESLLFETLDEGKIKK+N+TFGCGHMN KTNNDD YNGVFGLGAYP
Sbjct: 571 LRYLGGDSSQGILAKESLLFETLDEGKIKKTNLTFGCGHMNFKTNNDDTYNGVFGLGAYP 630

Query: 241 HITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISV 300
           HITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQ+ISV
Sbjct: 631 HITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQTISV 690

Query: 301 GSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR 360
           GSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR
Sbjct: 691 GSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR 750

Query: 361 KFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLN 420
           KFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLN
Sbjct: 751 KFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLN 810

Query: 421 LSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE 456
           LSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE
Sbjct: 811 LSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE 844

BLAST of CSPI01G20490 vs. ExPASy TrEMBL
Match: A0A5A7UU11 (Aspartic proteinase CDR1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold274G005510 PE=3 SV=1)

HSP 1 Score: 884.4 bits (2284), Expect = 2.0e-253
Identity = 440/455 (96.70%), Postives = 444/455 (97.58%), Query Frame = 0

Query: 1   MILLASLHHLLPSLILAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVED 60
           MILLASLHHLLPSL LAFYLSTAII ST ITTKPSRLATKLIHRNSYLHPLYD NETVED
Sbjct: 391 MILLASLHHLLPSLTLAFYLSTAIILSTSITTKPSRLATKLIHRNSYLHPLYDPNETVED 450

Query: 61  RSKREQTSSIERFGFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV 120
           RSKREQ SSIERF FLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV
Sbjct: 451 RSKREQASSIERFAFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV 510

Query: 121 DTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYK 180
           DTGSSLLWVQCLPCINCFQQSTSWFDPLKS SFKTLGCGFPGYNYINGYKCN  NQAEYK
Sbjct: 511 DTGSSLLWVQCLPCINCFQQSTSWFDPLKSASFKTLGCGFPGYNYINGYKCNG-NQAEYK 570

Query: 181 LRYLGGDSSQGILAKESLLFETLDEGKIKKSNVTFGCGHMNIKTNNDDAYNGVFGLGAYP 240
           LRYLGGDSSQGILAKESLLFETLDEGKIKK+N+TFGCGHMN KTNNDD YNGVFGLGAYP
Sbjct: 571 LRYLGGDSSQGILAKESLLFETLDEGKIKKTNLTFGCGHMNFKTNNDDTYNGVFGLGAYP 630

Query: 241 HITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISV 300
           +ITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQ+ISV
Sbjct: 631 YITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQTISV 690

Query: 301 GSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR 360
           GSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR
Sbjct: 691 GSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR 750

Query: 361 KFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLN 420
           KFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLN
Sbjct: 751 KFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLN 810

Query: 421 LSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE 456
           LSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE
Sbjct: 811 LSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE 844

BLAST of CSPI01G20490 vs. ExPASy TrEMBL
Match: A0A1S3BY28 (aspartic proteinase CDR1-like OS=Cucumis melo OX=3656 GN=LOC103494349 PE=3 SV=1)

HSP 1 Score: 870.9 bits (2249), Expect = 2.3e-249
Identity = 438/471 (92.99%), Postives = 443/471 (94.06%), Query Frame = 0

Query: 1   MILLASLHHLLPSLILAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVED 60
           MILLASLHHLLPSL LAFYLSTAII ST ITTKPSRLATKLIHRNSYLHPLYD NETVED
Sbjct: 15  MILLASLHHLLPSLTLAFYLSTAIILSTSITTKPSRLATKLIHRNSYLHPLYDPNETVED 74

Query: 61  RSKREQTSSIERFGFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV 120
           RSKREQ SSIERF FLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV
Sbjct: 75  RSKREQASSIERFAFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV 134

Query: 121 DTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYK 180
           DTGSSLLWVQCLPCINCFQQSTSWFDPLKS SFKTLGCGFPGYNYINGYKCN  NQAEYK
Sbjct: 135 DTGSSLLWVQCLPCINCFQQSTSWFDPLKSASFKTLGCGFPGYNYINGYKCNG-NQAEYK 194

Query: 181 LRYLGGDSSQGILAKESLLFETLDEGKI----------------KKSNVTFGCGHMNIKT 240
           LRYLGGDSSQGILAKESLLFETLDEG +                KK+N+TFGCGHMN KT
Sbjct: 195 LRYLGGDSSQGILAKESLLFETLDEGGVFQYNALFQHKQVNKQNKKTNLTFGCGHMNFKT 254

Query: 241 NNDDAYNGVFGLGAYPHITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPL 300
           NNDD YNGVFGLGAYP+ITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPL
Sbjct: 255 NNDDTYNGVFGLGAYPYITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPL 314

Query: 301 QIHFGHYYVTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDE 360
           QIHFGHYYVTLQ+ISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDE
Sbjct: 315 QIHFGHYYVTLQTISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDE 374

Query: 361 IVDLMKGLLERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGG 420
           IVDLMKGLLERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGG
Sbjct: 375 IVDLMKGLLERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGG 434

Query: 421 DRFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE 456
           DRFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE
Sbjct: 435 DRFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE 484

BLAST of CSPI01G20490 vs. ExPASy TrEMBL
Match: A0A6J1C0L1 (probable aspartic protease At2g35615 OS=Momordica charantia OX=3673 GN=LOC111006977 PE=3 SV=1)

HSP 1 Score: 787.7 bits (2033), Expect = 2.6e-224
Identity = 380/454 (83.70%), Postives = 414/454 (91.19%), Query Frame = 0

Query: 3   LLASLHHLLPSLILAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVEDRS 62
           LL SLHHLLP L  AFYLSTA+ISST +TTKPSRL TKLIHRNSYLHPLYD NETVEDRS
Sbjct: 4   LLLSLHHLLPFLTSAFYLSTAVISSTDVTTKPSRLVTKLIHRNSYLHPLYDPNETVEDRS 63

Query: 63  KREQTSSIERFGFLESKIKELKSV-GNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVVD 122
           KRE+TSS ERF +LESKIKEL SV GNEAR++LIPFN+GSGFLVN SIG PPVTQL VVD
Sbjct: 64  KREETSSTERFAYLESKIKELNSVGGNEARANLIPFNQGSGFLVNFSIGQPPVTQLAVVD 123

Query: 123 TGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYKL 182
           TGSSLLWVQCLPC+NCF+QS SWFDPLKS SFK L C FPG+NY+ GYKCN F+QAEYKL
Sbjct: 124 TGSSLLWVQCLPCVNCFRQSGSWFDPLKSWSFKILDCDFPGHNYVRGYKCNDFHQAEYKL 183

Query: 183 RYLGGDSSQGILAKESLLFETLDEGKIKKSNVTFGCGHMNIKTNNDDAYNGVFGLGAYPH 242
           RYLGGD+S+GILAKESLLFET DEGKI+K+N+TFGCGHMN KTN DD YNGVFGLG YPH
Sbjct: 184 RYLGGDTSEGILAKESLLFETPDEGKIRKANLTFGCGHMNTKTNRDDTYNGVFGLGGYPH 243

Query: 243 ITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISVG 302
           ITMATQLGNKFSYCIGDI +PLY HNHL LG G++IEGDSTPLQI FGHYYV+L+ ISVG
Sbjct: 244 ITMATQLGNKFSYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIFGHYYVSLEGISVG 303

Query: 303 SKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRK 362
           SK LKIDPNAF+++SDG GGVLIDSGMTYTKL NGGFELL+DEI DLMKG+LERIPT+RK
Sbjct: 304 SKRLKIDPNAFQMTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLMKGVLERIPTRRK 363

Query: 363 FEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLNL 422
           FEGLC+KGVV RDLVG P VTFHFAGGADLVLESGSLFRQHGGDRFCLA+LPSNSE++NL
Sbjct: 364 FEGLCYKGVVGRDLVGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAVLPSNSEMMNL 423

Query: 423 SVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE 456
           SVIG+LAQQNYNVGFDLEQMKVFFRRIDCQLL +
Sbjct: 424 SVIGVLAQQNYNVGFDLEQMKVFFRRIDCQLLGD 457

BLAST of CSPI01G20490 vs. NCBI nr
Match: XP_031742409.1 (aspartic proteinase CDR1 [Cucumis sativus] >KGN65478.1 hypothetical protein Csa_020065 [Cucumis sativus])

HSP 1 Score: 912.5 bits (2357), Expect = 1.4e-261
Identity = 452/455 (99.34%), Postives = 453/455 (99.56%), Query Frame = 0

Query: 1   MILLASLHHLLPSLILAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVED 60
           MILLASLHHLLPSL LAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVED
Sbjct: 2   MILLASLHHLLPSLTLAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVED 61

Query: 61  RSKREQTSSIERFGFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV 120
           RSKREQTSSIERF FLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV
Sbjct: 62  RSKREQTSSIERFDFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV 121

Query: 121 DTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYK 180
           DTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYK
Sbjct: 122 DTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYK 181

Query: 181 LRYLGGDSSQGILAKESLLFETLDEGKIKKSNVTFGCGHMNIKTNNDDAYNGVFGLGAYP 240
           LRYLGGDSSQGILAKESLLFETLDEGKIKKSN+TFGCGHMNIKTNNDDAYNGVFGLGAYP
Sbjct: 182 LRYLGGDSSQGILAKESLLFETLDEGKIKKSNITFGCGHMNIKTNNDDAYNGVFGLGAYP 241

Query: 241 HITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISV 300
           HITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISV
Sbjct: 242 HITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISV 301

Query: 301 GSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR 360
           GSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR
Sbjct: 302 GSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR 361

Query: 361 KFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLN 420
           KFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLN
Sbjct: 362 KFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLN 421

Query: 421 LSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE 456
           LSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE
Sbjct: 422 LSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE 456

BLAST of CSPI01G20490 vs. NCBI nr
Match: KAA0058227.1 (aspartic proteinase CDR1-like [Cucumis melo var. makuwa])

HSP 1 Score: 884.4 bits (2284), Expect = 4.2e-253
Identity = 440/455 (96.70%), Postives = 444/455 (97.58%), Query Frame = 0

Query: 1   MILLASLHHLLPSLILAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVED 60
           MILLASLHHLLPSL LAFYLSTAII ST ITTKPSRLATKLIHRNSYLHPLYD NETVED
Sbjct: 391 MILLASLHHLLPSLTLAFYLSTAIILSTSITTKPSRLATKLIHRNSYLHPLYDPNETVED 450

Query: 61  RSKREQTSSIERFGFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV 120
           RSKREQ SSIERF FLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV
Sbjct: 451 RSKREQASSIERFAFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV 510

Query: 121 DTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYK 180
           DTGSSLLWVQCLPCINCFQQSTSWFDPLKS SFKTLGCGFPGYNYINGYKCN  NQAEYK
Sbjct: 511 DTGSSLLWVQCLPCINCFQQSTSWFDPLKSASFKTLGCGFPGYNYINGYKCNG-NQAEYK 570

Query: 181 LRYLGGDSSQGILAKESLLFETLDEGKIKKSNVTFGCGHMNIKTNNDDAYNGVFGLGAYP 240
           LRYLGGDSSQGILAKESLLFETLDEGKIKK+N+TFGCGHMN KTNNDD YNGVFGLGAYP
Sbjct: 571 LRYLGGDSSQGILAKESLLFETLDEGKIKKTNLTFGCGHMNFKTNNDDTYNGVFGLGAYP 630

Query: 241 HITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISV 300
           +ITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQ+ISV
Sbjct: 631 YITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQTISV 690

Query: 301 GSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR 360
           GSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR
Sbjct: 691 GSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR 750

Query: 361 KFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLN 420
           KFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLN
Sbjct: 751 KFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLN 810

Query: 421 LSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE 456
           LSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE
Sbjct: 811 LSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE 844

BLAST of CSPI01G20490 vs. NCBI nr
Match: TYK28585.1 (Peptidase A1 [Cucumis melo var. makuwa])

HSP 1 Score: 884.4 bits (2284), Expect = 4.2e-253
Identity = 440/455 (96.70%), Postives = 443/455 (97.36%), Query Frame = 0

Query: 1   MILLASLHHLLPSLILAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVED 60
           MILLASLHHLLPSL LAFYLSTAII ST I TKPSRLATKLIHRNSYLHPLYD NETVED
Sbjct: 391 MILLASLHHLLPSLTLAFYLSTAIILSTSIMTKPSRLATKLIHRNSYLHPLYDPNETVED 450

Query: 61  RSKREQTSSIERFGFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV 120
           RSKREQ SSIERF FLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV
Sbjct: 451 RSKREQASSIERFAFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV 510

Query: 121 DTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYK 180
           DTGSSLLWVQCLPCINCFQQSTSWFDPLKS SFKTLGCGFPGYNYINGYKCN  NQAEYK
Sbjct: 511 DTGSSLLWVQCLPCINCFQQSTSWFDPLKSASFKTLGCGFPGYNYINGYKCNG-NQAEYK 570

Query: 181 LRYLGGDSSQGILAKESLLFETLDEGKIKKSNVTFGCGHMNIKTNNDDAYNGVFGLGAYP 240
           LRYLGGDSSQGILAKESLLFETLDEGKIKK+N+TFGCGHMN KTNNDD YNGVFGLGAYP
Sbjct: 571 LRYLGGDSSQGILAKESLLFETLDEGKIKKTNLTFGCGHMNFKTNNDDTYNGVFGLGAYP 630

Query: 241 HITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISV 300
           HITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQ+ISV
Sbjct: 631 HITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQTISV 690

Query: 301 GSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR 360
           GSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR
Sbjct: 691 GSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR 750

Query: 361 KFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLN 420
           KFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLN
Sbjct: 751 KFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLN 810

Query: 421 LSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE 456
           LSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE
Sbjct: 811 LSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE 844

BLAST of CSPI01G20490 vs. NCBI nr
Match: XP_008453701.1 (PREDICTED: aspartic proteinase CDR1-like [Cucumis melo])

HSP 1 Score: 870.9 bits (2249), Expect = 4.8e-249
Identity = 438/471 (92.99%), Postives = 443/471 (94.06%), Query Frame = 0

Query: 1   MILLASLHHLLPSLILAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVED 60
           MILLASLHHLLPSL LAFYLSTAII ST ITTKPSRLATKLIHRNSYLHPLYD NETVED
Sbjct: 15  MILLASLHHLLPSLTLAFYLSTAIILSTSITTKPSRLATKLIHRNSYLHPLYDPNETVED 74

Query: 61  RSKREQTSSIERFGFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV 120
           RSKREQ SSIERF FLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV
Sbjct: 75  RSKREQASSIERFAFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV 134

Query: 121 DTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYK 180
           DTGSSLLWVQCLPCINCFQQSTSWFDPLKS SFKTLGCGFPGYNYINGYKCN  NQAEYK
Sbjct: 135 DTGSSLLWVQCLPCINCFQQSTSWFDPLKSASFKTLGCGFPGYNYINGYKCNG-NQAEYK 194

Query: 181 LRYLGGDSSQGILAKESLLFETLDEGKI----------------KKSNVTFGCGHMNIKT 240
           LRYLGGDSSQGILAKESLLFETLDEG +                KK+N+TFGCGHMN KT
Sbjct: 195 LRYLGGDSSQGILAKESLLFETLDEGGVFQYNALFQHKQVNKQNKKTNLTFGCGHMNFKT 254

Query: 241 NNDDAYNGVFGLGAYPHITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPL 300
           NNDD YNGVFGLGAYP+ITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPL
Sbjct: 255 NNDDTYNGVFGLGAYPYITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPL 314

Query: 301 QIHFGHYYVTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDE 360
           QIHFGHYYVTLQ+ISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDE
Sbjct: 315 QIHFGHYYVTLQTISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDE 374

Query: 361 IVDLMKGLLERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGG 420
           IVDLMKGLLERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGG
Sbjct: 375 IVDLMKGLLERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGG 434

Query: 421 DRFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE 456
           DRFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE
Sbjct: 435 DRFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE 484

BLAST of CSPI01G20490 vs. NCBI nr
Match: XP_038878960.1 (aspartic proteinase CDR1-like isoform X1 [Benincasa hispida])

HSP 1 Score: 834.7 bits (2155), Expect = 3.8e-238
Identity = 406/455 (89.23%), Postives = 430/455 (94.51%), Query Frame = 0

Query: 1   MILLASLHHLLPSLILAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVED 60
           +ILL S +HLLPSL LAFYLSTAIISS   TTKPSRLATKLIHRNSYLHPLYD  ET+ED
Sbjct: 2   VILLVSQYHLLPSLTLAFYLSTAIISSMAFTTKPSRLATKLIHRNSYLHPLYDPTETIED 61

Query: 61  RSKREQTSSIERFGFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVV 120
           RSKRE+TSSIERF +LESKIKELKSVGNEARSSLIPFN+GSGFLVNLSIGSPPVTQLVV 
Sbjct: 62  RSKREETSSIERFAYLESKIKELKSVGNEARSSLIPFNQGSGFLVNLSIGSPPVTQLVVA 121

Query: 121 DTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYK 180
           DTGSSLLWVQCLPCI+CF+QS SWFDPLKS SFK LGCGF GYNYI+GY+CN FNQAEYK
Sbjct: 122 DTGSSLLWVQCLPCIDCFRQSNSWFDPLKSTSFKILGCGFAGYNYISGYRCNGFNQAEYK 181

Query: 181 LRYLGGDSSQGILAKESLLFETLDEGKIKKSNVTFGCGHMNIKTNNDDAYNGVFGLGAYP 240
           LRYLGGD+SQG+LAKESLLFETLDEGKIKK+N+TFGCGHMN KTNNDD YNGVFGLGAYP
Sbjct: 182 LRYLGGDTSQGVLAKESLLFETLDEGKIKKTNLTFGCGHMNSKTNNDDTYNGVFGLGAYP 241

Query: 241 HITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISV 300
           +ITMATQLGNKFSYCIGDINNP+YTHNHLVLG+GSYIEGDSTPLQIHFGHYYVTLQ ISV
Sbjct: 242 YITMATQLGNKFSYCIGDINNPVYTHNHLVLGEGSYIEGDSTPLQIHFGHYYVTLQGISV 301

Query: 301 GSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR 360
           GSK LKIDP AF+++ DG GGVLIDSGMTYTKLANGGFELLYDEI+DLM GLLERIPT+R
Sbjct: 302 GSKRLKIDPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLYDEILDLMTGLLERIPTER 361

Query: 361 KFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLN 420
           KFEGLCFKGVVSRDL+GFP VTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSE+LN
Sbjct: 362 KFEGLCFKGVVSRDLIGFPTVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEMLN 421

Query: 421 LSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE 456
           LSVIGILAQQNYNV FDLEQMKVFFRRIDCQLLDE
Sbjct: 422 LSVIGILAQQNYNVAFDLEQMKVFFRRIDCQLLDE 456

BLAST of CSPI01G20490 vs. TAIR 10
Match: AT4G30030.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 315.1 bits (806), Expect = 9.4e-86
Identity = 167/396 (42.17%), Postives = 239/396 (60.35%), Query Frame = 0

Query: 61  RSKREQTSSIERFGFLESKIKELKSVGN-EARSSLIPFNRGSGFLVNLSIGSPPVTQLVV 120
           R+K +++S I + G+L SK      + N    S + P    + FL N+SIG+PPV QL++
Sbjct: 36  RTKTQESSKI-KIGYLHSKSTPASRLDNLWTVSHVTPIPNPAAFLANISIGNPPVPQLLL 95

Query: 121 VDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEY 180
           +DTGS L W+ CLPC  C+ Q+  +F P +S +++   C    +     ++  +    +Y
Sbjct: 96  IDTGSDLTWIHCLPC-KCYPQTIPFFHPSRSSTYRNASCVSAPHAMPQIFRDEKTGNCQY 155

Query: 181 KLRYLGGDSSQGILAKESLLFETLDEGKIKKSNVTFGCGHMNIKTNNDDAYNGVFGLGAY 240
            LRY    +++GILA+E L FET D+G I K N+ FGCG  N   +    Y+GV GLG  
Sbjct: 156 HLRYRDFSNTRGILAEEKLTFETSDDGLISKQNIVFGCGQDN---SGFTKYSGVLGLGPG 215

Query: 241 PHITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSIS 300
               +    G+KFSYC G + NP Y HN L+LG G+ IEGD TPLQI    YY+ LQ+IS
Sbjct: 216 TFSIVTRNFGSKFSYCFGSLTNPTYPHNILILGNGAKIEGDPTPLQIFQDRYYLDLQAIS 275

Query: 301 VGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQ 360
            G K L I+P  F+      GG +ID+G + T LA   +E L +EI  L+  +L R+   
Sbjct: 276 FGEKLLDIEPGTFQ-RYRSQGGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDW 335

Query: 361 RKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLF-RQHGGDRFCLAILPSNSEL 420
            ++   C++G +  DL GFP VTFHFAGGA+L L+  SLF     GD FCLA+  +  + 
Sbjct: 336 DQYTTPCYEGNLKLDLYGFPVVTFHFAGGAELALDVESLFVSSESGDSFCLAMTMNTFD- 395

Query: 421 LNLSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLD 455
            ++SVIG +AQQNYNVG++L  MKV+F+R DC+++D
Sbjct: 396 -DMSVIGAMAQQNYNVGYNLRTMKVYFQRTDCEIID 423

BLAST of CSPI01G20490 vs. TAIR 10
Match: AT2G23945.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 291.2 bits (744), Expect = 1.4e-78
Identity = 179/457 (39.17%), Postives = 262/457 (57.33%), Query Frame = 0

Query: 13  SLILAFYLSTAIISSTLITTKPSRLATKLIHRNSY--LHPLYDQNETVEDRSKREQTSSI 72
           SL+L   +S  +++ ++   KP+R+A KLIHR S   L+P      T ED  K     S 
Sbjct: 8   SLLLFITVSYFVVTESI---KPNRMAMKLIHRESVARLNPNARVPITPEDHIKHLTDISS 67

Query: 73  ERFGFLESKI-KELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVVDTGSSLLWV 132
            RF +L++ I KEL S  +  +  +    + S FLVN S+G PPV QL ++DTGSSLLW+
Sbjct: 68  ARFKYLQNSIDKELGS--SNFQVDVEQAIKTSLFLVNFSVGQPPVPQLTIMDTGSSLLWI 127

Query: 133 QCLPCINCFQQST--SWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYKLRYLGGD 192
           QC PC +C         F+P  S +F    C      Y     C   N+  Y+  Y+ G 
Sbjct: 128 QCQPCKHCSSDHMIHPVFNPALSSTFVECSCDDRFCRYAPNGHCGSSNKCVYEQVYISGT 187

Query: 193 SSQGILAKESLLFETLDEGKIKKSNVTFGCGHMNIKTNNDDAYNGVFGLGAYPHITMATQ 252
            S+G+LAKE L F T +   +    + FGCG+ N     +  + G+ GLGA P  ++A Q
Sbjct: 188 GSKGVLAKERLTFTTPNGNTVVTQPIAFGCGYEN-GEQLESHFTGILGLGAKP-TSLAVQ 247

Query: 253 LGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGH--YYVTLQSISVGSKTL 312
           LG+KFSYCIGD+ N  Y +N LVLG+ + I GD TP++    +  YY+ L+ ISVG   L
Sbjct: 248 LGSKFSYCIGDLANKNYGYNQLVLGEDADILGDPTPIEFETENSIYYMNLEGISVGDTQL 307

Query: 313 KIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGL 372
            I+P  FK     + GV++DSG  YT LA+  +  LY+EI  ++   LER   +   + L
Sbjct: 308 NIEPVVFKRRGPRT-GVILDSGTLYTWLADIAYRELYNEIKSILDPKLERFWFR---DFL 367

Query: 373 CFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLF----RQHGGDRFCLAILPS---NSEL 432
           C+ G VS +L+GFP VTFHFAGGA+L +E+ S+F      +  + FC+++ P+     E 
Sbjct: 368 CYHGRVSEELIGFPVVTFHFAGGAELAMEATSMFYPLSEPNTFNVFCMSVKPTKEHGGEY 427

Query: 433 LNLSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLDE 456
              + IG++AQQ YN+G+DL++  ++ +RIDC  LD+
Sbjct: 428 KEFTAIGLMAQQYYNIGYDLKEKNIYLQRIDCVQLDD 453

BLAST of CSPI01G20490 vs. TAIR 10
Match: AT4G30040.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 272.3 bits (695), Expect = 7.0e-73
Identity = 161/388 (41.49%), Postives = 220/388 (56.70%), Query Frame = 0

Query: 68  SSIERFGFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVVDTGSSLL 127
           +S+ER  +L++K              +IP      FLVN+SIGSPP+TQL+ +DT S LL
Sbjct: 54  ASVERLEYLKAKTTGDIIAHLSPNVPIIP----QAFLVNISIGSPPITQLLHMDTASDLL 113

Query: 128 WVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYKLRYLGGD 187
           W+QCLPCINC+ QS   FDP +S + +   C    Y+  +          EY +RY+   
Sbjct: 114 WIQCLPCINCYAQSLPIFDPSRSYTHRNETCRTSQYSMPSLKFNANTRSCEYSMRYVDDT 173

Query: 188 SSQGILAKESLLFETL--DEGKIKKSNVTFGCGHMNIKTNNDDAYNGVFGLGAYPHITMA 247
            S+GILA+E LLF T+  +       +V FGCGH N          G+ GLG Y   ++ 
Sbjct: 174 GSKGILAREMLLFNTIYDESSSAALHDVVFGCGHDNY--GEPLVGTGILGLG-YGEFSLV 233

Query: 248 TQLGNKFSYCIGDINNPLYTHNHLVLG-QGSYIEGDSTPLQIHFGHYYVTLQSISVGSKT 307
            + G KFSYC G +++P Y HN LVLG  G+ I GD+TPL+IH G YYVT+++ISV    
Sbjct: 234 HRFGKKFSYCFGSLDDPSYPHNVLVLGDDGANILGDTTPLEIHNGFYYVTIEAISVDGII 293

Query: 308 LKIDPNAF-KISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR--- 367
           L IDP  F +    G GG +ID+G + T L    ++ L + I D+ +G        +   
Sbjct: 294 LPIDPRVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDM 353

Query: 368 -KFEGLCFKGVVSRDLV--GFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSE 427
            K E  C+ G   RDLV  GFP VTFHF+ GA+L L+  SLF +   + FCLA+ P    
Sbjct: 354 IKME--CYNGNFERDLVESGFPIVTFHFSEGAELSLDVKSLFMKLSPNVFCLAVTPG--- 413

Query: 428 LLNLSVIGILAQQNYNVGFDLEQMKVFF 446
             NL+ IG  AQQ+YN+G+DLE M+V F
Sbjct: 414 --NLNSIGATAQQSYNIGYDLEAMEVSF 427

BLAST of CSPI01G20490 vs. TAIR 10
Match: AT2G35615.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 186.0 bits (471), Expect = 6.6e-47
Identity = 140/461 (30.37%), Postives = 232/461 (50.33%), Query Frame = 0

Query: 14  LILAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVEDRSKREQTSSIERF 73
           ++L F+L  ++  S+  +  P   + +LIHR+S L P+Y+   TV DR       S+ R 
Sbjct: 5   ILLCFFLFFSVTLSS--SGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRS 64

Query: 74  GFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVVDTGSSLLWVQCLP 133
                ++ +      + +S LI       F ++++IG+PP+    + DTGS L WVQC P
Sbjct: 65  RRFNHQLSQ-----TDLQSGLI--GADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKP 124

Query: 134 CINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYK--CNRFNQAEYKLRYLGGDS--S 193
           C  C++++   FD  KS ++K+  C       ++  +  C+  N    K RY  GD   S
Sbjct: 125 CQQCYKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNI-CKYRYSYGDQSFS 184

Query: 194 QGILAKESLLFETLDEGKIKKSNVTFGCGHMNIKTNNDDAYNGVFGLGAYPHITMATQLG 253
           +G +A E++  ++     +      FGCG+ N  T  D+  +G+ GLG   H+++ +QLG
Sbjct: 185 KGDVATETVSIDSASGSPVSFPGTVFGCGYNNGGT-FDETGSGIIGLGG-GHLSLISQLG 244

Query: 254 N----KFSYCIGDINNPLYTHNHLVLGQG-----SYIEGD----STPL--QIHFGHYYVT 313
           +    KFSYC+   +    T+   V+  G     S +  D    STPL  +    +YY+T
Sbjct: 245 SSISKKFSYCLS--HKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLT 304

Query: 314 LQSISVGSKTL-----KIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLM 373
           L++ISVG K +       +PN   I S+ SG ++IDSG T T L  G F+     + + +
Sbjct: 305 LEAISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESV 364

Query: 374 KGLLERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCL 433
            G  +R+   +     CFK   +   +G P +T HF  GAD+ L   + F +   D  CL
Sbjct: 365 TG-AKRVSDPQGLLSHCFKSGSAE--IGLPEITVHFT-GADVRLSPINAFVKLSEDMVCL 424

Query: 434 AILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDC 451
           +++P+      +++ G  AQ ++ VG+DLE   V F+ +DC
Sbjct: 425 SMVPTT----EVAIYGNFAQMDFLVGYDLETRTVSFQHMDC 443

BLAST of CSPI01G20490 vs. TAIR 10
Match: AT5G33340.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 183.3 bits (464), Expect = 4.3e-46
Identity = 141/457 (30.85%), Postives = 212/457 (46.39%), Query Frame = 0

Query: 11  LPSLILAFYLSTAIISSTLITTKPSR----LATKLIHRNSYLHPLYDQNETVEDRSKREQ 70
           + SL  +  LS  ++SS  ++   ++        LIHR+S   P Y+  ET   R +   
Sbjct: 1   MASLFSSVLLSLCLLSSLFLSNANAKPKLGFTADLIHRDSPKSPFYNPMETSSQRLRNAI 60

Query: 71  TSSIER-FGFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVVDTGSS 130
             S+ R F F E          N  +  +   +    +L+N+SIG+PP   + + DTGS 
Sbjct: 61  HRSVNRVFHFTEK--------DNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSD 120

Query: 131 LLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYI-NGYKCN-RFNQAEYKLRY 190
           LLW QC PC +C+ Q    FDP  S ++K + C       + N   C+   N   Y L Y
Sbjct: 121 LLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSY 180

Query: 191 LGGDSSQGILAKESLLFETLDEGKIKKSNVTFGCGHMNIKTNNDDAYNGVFGLGAYPHIT 250
                ++G +A ++L   + D   ++  N+  GCGH N  T N    +G+ GLG  P ++
Sbjct: 181 GDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCGHNNAGTFNKKG-SGIVGLGGGP-VS 240

Query: 251 MATQLGN----KFSYCIGDINNPLYTHNHLVLGQGSYIEGD---STPLQIHFGH---YYV 310
           +  QLG+    KFSYC+  + +     + +  G  + + G    STPL         YY+
Sbjct: 241 LIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYL 300

Query: 311 TLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLL 370
           TL+SISVGSK ++   +    S    G ++IDSG T T L        Y E+ D +   +
Sbjct: 301 TLKSISVGSKQIQYSGSD---SESSEGNIIIDSGTTLTLLPTE----FYSELEDAVASSI 360

Query: 371 ERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILP 430
           +    Q    GL      + DL   P +T HF  GAD+ L+S + F Q   D  C A   
Sbjct: 361 DAEKKQDPQSGLSLCYSATGDL-KVPVITMHF-DGADVKLDSSNAFVQVSEDLVCFAFRG 420

Query: 431 SNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDC 451
           S S     S+ G +AQ N+ VG+D     V F+  DC
Sbjct: 421 SPS----FSIYGNVAQMNFLVGYDTVSKTVSFKPTDC 434

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q3EBM59.2e-4630.37Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g3561... [more]
Q6XBF86.0e-4530.85Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1[more]
Q766C37.3e-4333.99Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q766C24.4e-4028.70Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q9SV774.2e-3831.11Aspartyl protease UND OS=Arabidopsis thaliana OX=3702 GN=UND PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LUP56.9e-26299.34Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G42435... [more]
A0A5D3DZ202.0e-25396.70Peptidase A1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold629G002350 ... [more]
A0A5A7UU112.0e-25396.70Aspartic proteinase CDR1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_sc... [more]
A0A1S3BY282.3e-24992.99aspartic proteinase CDR1-like OS=Cucumis melo OX=3656 GN=LOC103494349 PE=3 SV=1[more]
A0A6J1C0L12.6e-22483.70probable aspartic protease At2g35615 OS=Momordica charantia OX=3673 GN=LOC111006... [more]
Match NameE-valueIdentityDescription
XP_031742409.11.4e-26199.34aspartic proteinase CDR1 [Cucumis sativus] >KGN65478.1 hypothetical protein Csa_... [more]
KAA0058227.14.2e-25396.70aspartic proteinase CDR1-like [Cucumis melo var. makuwa][more]
TYK28585.14.2e-25396.70Peptidase A1 [Cucumis melo var. makuwa][more]
XP_008453701.14.8e-24992.99PREDICTED: aspartic proteinase CDR1-like [Cucumis melo][more]
XP_038878960.13.8e-23889.23aspartic proteinase CDR1-like isoform X1 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
AT4G30030.19.4e-8642.17Eukaryotic aspartyl protease family protein [more]
AT2G23945.11.4e-7839.17Eukaryotic aspartyl protease family protein [more]
AT4G30040.17.0e-7341.49Eukaryotic aspartyl protease family protein [more]
AT2G35615.16.6e-4730.37Eukaryotic aspartyl protease family protein [more]
AT5G33340.14.3e-4630.85Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 290..446
e-value: 6.1E-30
score: 104.1
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 103..271
e-value: 8.5E-33
score: 114.1
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 85..272
e-value: 7.5E-36
score: 125.9
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 277..453
e-value: 2.1E-41
score: 143.5
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 101..451
NoneNo IPR availablePANTHERPTHR47967:SF14EUKARYOTIC ASPARTYL PROTEASE FAMILY PROTEINcoord: 26..452
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 26..452
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 118..129
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 103..446
score: 31.791201
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 102..450
e-value: 3.98848E-71
score: 224.064

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G20490.1CSPI01G20490.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0016757 glycosyltransferase activity