Csor.00g119300 (gene) Silver-seed gourd (wild; sororia) v1

Overview
NameCsor.00g119300
Typegene
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
Descriptionaspartic proteinase PCS1-like
LocationCsor_Chr04: 20761867 .. 20764560 (+)
RNA-Seq ExpressionCsor.00g119300
SyntenyCsor.00g119300
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSinitialstart_codonpolypeptideintroninternalterminalstop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACTTCTTCCTCCACCTTCTGCAACTCCTCGTCTTCCTCCTCTGTTTCAAGCAGAGTCTCTGTTTTTCTGCAACTCAGGCGATGGTTTTGCCACTAAAAACAGAGACGGGATTGAATTCACAGCCGTCGAACAAACTCAGTTTCCACCATAATGTCACGTTGACCGTTGAGTTGACTGTCGGATCGCCGCCTCAAGCTGTCACGATGGTGCTGGATACAGGGAGTGAACTCCCCTGGCTTAACTGCAAAAAAAAAACCCAAACTTTGACCTCTGTTTTTAACCCACTTTCTTCCTCTTCTTACTCGCGAATCCCCTGCATTTCCCCCATTTGTCAGAAGCAAACCCGAGACTTACCCAACCCGGTTGTATGCGACGAAGAAAAACTATGCAATGTCTTTGTCTCATACGCTGACGGCTCGTCTCTCGAGGGTAATCTTGCATCCGACACGTTTCGAATCGGGTCGTTGAATAAACCCGGAACTTTATTCGGGTGTATGGGTTTGGGTTCGAGTTCGAATCCGCAGGAGGATGCGAAGACCACTGGGCTAATGGGGATGAACCGGGGGTCGCTTTCGTTTGTGACTCAATTGGGTCTGTCGAAATTCTCTTACTGCATATCGGGTCGTGATTCCTCCGGCGTTCTGATTTTAGGCGAAGGTAATCATTCGTGGGTTGGGAATATGGCGTACACGCCGTTGGTTCAAATGTCGACGCCATTGCCGTCGTACGATCGGTTCGCGTACACAGTCAAACTGGAGGGAATTAAAGTGGGGAACAAAATCCTGGCGCTGGAGAAATCAATTCTGGTTCCGGACCACACCGGCGCCGGGCAGACCATGGTGGATTCCGGGACCCAGTTCACGTTTCTTCTGGGGCCGGTGTACACGGCTTTAAAGAACGAGTTTCTGGTACAAACGAAGGGCGTTTTGGTTCCACTGGGTGATCCGAATTTCGTGTTCCAAGGAGCAATGGACTCGTGCTTCCGAGTACCGGCGAATAAGGGGAAGCTTCCGCCGCTGCCGACGGTGGGTCTAATGTTGAATGGGGCGGAGATGGTTGTGGGCGAGGAGTTGCTGCTGTACCGAGTGCCGGGAATGGTGAAGGGTGGTGATTGGGTGTATTGCATGACGTTTGGGAACTCGGATTTGTTGGGAATAGCGGCGTTTGTGGTTGGGAATCATCATCAACAGAACTTGTGGATGGAATATGATCTGGCGAAATCAAGGCTGGGATTTGTCGAGACCAGGTGTGATTCAGCGGGTCAACGCCTCGGATTGGACCTTTAAAATTTTCAATTCAATTTTGGTCAACGGCTCCACAAATTTTAATTTACATTAAAGCCACAATAATAACCTAAGACTTTGTCTAAATTATGTTTGAAAAATAACATTATAAAAATTAATTTTTGGTTCTACATTCTATTAAAAAACTGGCTTAACTTTTTAATTTAATTTAATTTAATTTAATTGCAGATTAAAAAAATAATTAAATTGTTATATATTAAATTATTAGTGGGAAAATAAAATCGTAAAATTGTTATGTTGGGGACAGACAGGGTCCCACAGAATGGAGGGGTAATTTCGGTTTGAGCAAATCTAATACGACGTCGTTTAATGAATTCAACTGAACTCCTTTTATCCAAAGCCTTCCTTAGAAGAGACGGGTCACAACAAGCAACTCAACACGTCCGAGTCACCTCCCGCCATCGACTCGCTCCCCCATCTTCTCCGTCGCTACCGTCGGGTTATCATTTTGCACGTAATTTTAGCATGCCCCATCATCGTAACTTTTTGATTCCAAGCCATCCTTGTTGTTTTTGGACCAACAAAATCAACCCATTTACTCGATTCTGTTCTGTAGGTCTCGGATTTCTCTCATCAAGGATTAATCTGGTTCGATTATTTGATGATCGCTCGTTTCTTATGCCTTTTTTGTTACTCGGGATTCTTTAATTAAGCGATTTTTCTTTTCTTATTGTTTTAATTTCACTTGTTTAACTGTCATCGAAGCTTACGGGGCTGTAGAATTGTCGTGTCTGTCTGATTCTGTTTATAGATTCGTGTTATCGCCTCATTTAATTATCTCTGGGAGTTTAATGGAAATGTGGAGAATTGAGAATTTGGATGAAAAAAGTTCAAAACTGATCTGGTTCGATCGACCGGTAGTTCTGATTTCTTACTCGAGCATTAGTAGAGACCTCAATCTTTTGATATATGATGGATGGATGTATGTTTGAGTTTGAATGCAGTGATGTATGTTTGAGTTTGAATGCAGTGATCTTACAGTCTTTTTGGTGTTAATGGGATGCCCAGGTGATCAGCTGATGAATGATTGTGGCTGATGGATTGAATTTGATAACAATTAATTGGGATTGCTATGCTCTTCCTGTTGAACTGCATATGTATTGGAAGAACTTCATACTCTTTTAAATTACTCATGATTTTGTTTGTCGTATGAGTGGTTTATTGGTTAGCCATTGTCATTACATCTTTACATTGACAAAGCTTAGGCTATTATGATTCTTTGATTCTTCAAGCTTCAGTTCTCATGTCATTTGTTCAAGGAGTTGCGGAAATCATTATGCTTCTTTCTATTAAGATACTGGAGCATTTTTCTTGCTGTGTTCTGCAGATTATTGTCAGGAGTTCATTAGATTTTTCATTATGTAGATTCACATTTCTCTGACTTGA

mRNA sequence

ATGAACTTCTTCCTCCACCTTCTGCAACTCCTCGTCTTCCTCCTCTGTTTCAAGCAGAGTCTCTGTTTTTCTGCAACTCAGGCGATGGTTTTGCCACTAAAAACAGAGACGGGATTGAATTCACAGCCGTCGAACAAACTCAGTTTCCACCATAATGTCACGTTGACCGTTGAGTTGACTGTCGGATCGCCGCCTCAAGCTGTCACGATGGTGCTGGATACAGGGAGTGAACTCCCCTGGCTTAACTGCAAAAAAAAAACCCAAACTTTGACCTCTGTTTTTAACCCACTTTCTTCCTCTTCTTACTCGCGAATCCCCTGCATTTCCCCCATTTGTCAGAAGCAAACCCGAGACTTACCCAACCCGGTTGTATGCGACGAAGAAAAACTATGCAATGTCTTTGTCTCATACGCTGACGGCTCGTCTCTCGAGGGTAATCTTGCATCCGACACGTTTCGAATCGGGTCGTTGAATAAACCCGGAACTTTATTCGGGTGTATGGGTTTGGGTTCGAGTTCGAATCCGCAGGAGGATGCGAAGACCACTGGGCTAATGGGGATGAACCGGGGGTCGCTTTCGTTTGTGACTCAATTGGGTCTGTCGAAATTCTCTTACTGCATATCGGGTCGTGATTCCTCCGGCGTTCTGATTTTAGGCGAAGGTAATCATTCGTGGGTTGGGAATATGGCGTACACGCCGTTGGTTCAAATGTCGACGCCATTGCCGTCGTACGATCGGTTCGCGTACACAGTCAAACTGGAGGGAATTAAAGTGGGGAACAAAATCCTGGCGCTGGAGAAATCAATTCTGGTTCCGGACCACACCGGCGCCGGGCAGACCATGGTGGATTCCGGGACCCAGTTCACGTTTCTTCTGGGGCCGGTGTACACGGCTTTAAAGAACGAGTTTCTGGTACAAACGAAGGGCGTTTTGGTTCCACTGGGTGATCCGAATTTCGTGTTCCAAGGAGCAATGGACTCGTGCTTCCGAGTACCGGCGAATAAGGGGAAGCTTCCGCCGCTGCCGACGGTGGGTCTAATGTTGAATGGGGCGGAGATGGTTGTGGGCGAGGAGTTGCTGCTGTACCGAGTGCCGGGAATGGTGAAGGGTGGTGATTGGGTGTATTGCATGACGTTTGGGAACTCGGATTTGTTGGGAATAGCGGCGTTTGTGGTTGGGAATCATCATCAACAGAACTTGTGGATGGAATATGATCTGGCGAAATCAAGGCTGGGATTTGTCGAGACCAGAAGAGACGGGTCACAACAAGCAACTCAACACGTCCGAGTCACCTCCCGCCATCGACTCGCTCCCCCATCTTCTCCGTCGCTACCGTCGGGTTATCATTTTGCACGTCTCGGATTTCTCTCATCAAGGATTAATCTGATTCACATTTCTCTGACTTGA

Coding sequence (CDS)

ATGAACTTCTTCCTCCACCTTCTGCAACTCCTCGTCTTCCTCCTCTGTTTCAAGCAGAGTCTCTGTTTTTCTGCAACTCAGGCGATGGTTTTGCCACTAAAAACAGAGACGGGATTGAATTCACAGCCGTCGAACAAACTCAGTTTCCACCATAATGTCACGTTGACCGTTGAGTTGACTGTCGGATCGCCGCCTCAAGCTGTCACGATGGTGCTGGATACAGGGAGTGAACTCCCCTGGCTTAACTGCAAAAAAAAAACCCAAACTTTGACCTCTGTTTTTAACCCACTTTCTTCCTCTTCTTACTCGCGAATCCCCTGCATTTCCCCCATTTGTCAGAAGCAAACCCGAGACTTACCCAACCCGGTTGTATGCGACGAAGAAAAACTATGCAATGTCTTTGTCTCATACGCTGACGGCTCGTCTCTCGAGGGTAATCTTGCATCCGACACGTTTCGAATCGGGTCGTTGAATAAACCCGGAACTTTATTCGGGTGTATGGGTTTGGGTTCGAGTTCGAATCCGCAGGAGGATGCGAAGACCACTGGGCTAATGGGGATGAACCGGGGGTCGCTTTCGTTTGTGACTCAATTGGGTCTGTCGAAATTCTCTTACTGCATATCGGGTCGTGATTCCTCCGGCGTTCTGATTTTAGGCGAAGGTAATCATTCGTGGGTTGGGAATATGGCGTACACGCCGTTGGTTCAAATGTCGACGCCATTGCCGTCGTACGATCGGTTCGCGTACACAGTCAAACTGGAGGGAATTAAAGTGGGGAACAAAATCCTGGCGCTGGAGAAATCAATTCTGGTTCCGGACCACACCGGCGCCGGGCAGACCATGGTGGATTCCGGGACCCAGTTCACGTTTCTTCTGGGGCCGGTGTACACGGCTTTAAAGAACGAGTTTCTGGTACAAACGAAGGGCGTTTTGGTTCCACTGGGTGATCCGAATTTCGTGTTCCAAGGAGCAATGGACTCGTGCTTCCGAGTACCGGCGAATAAGGGGAAGCTTCCGCCGCTGCCGACGGTGGGTCTAATGTTGAATGGGGCGGAGATGGTTGTGGGCGAGGAGTTGCTGCTGTACCGAGTGCCGGGAATGGTGAAGGGTGGTGATTGGGTGTATTGCATGACGTTTGGGAACTCGGATTTGTTGGGAATAGCGGCGTTTGTGGTTGGGAATCATCATCAACAGAACTTGTGGATGGAATATGATCTGGCGAAATCAAGGCTGGGATTTGTCGAGACCAGAAGAGACGGGTCACAACAAGCAACTCAACACGTCCGAGTCACCTCCCGCCATCGACTCGCTCCCCCATCTTCTCCGTCGCTACCGTCGGGTTATCATTTTGCACGTCTCGGATTTCTCTCATCAAGGATTAATCTGATTCACATTTCTCTGACTTGA

Protein sequence

MNFFLHLLQLLVFLLCFKQSLCFSATQAMVLPLKTETGLNSQPSNKLSFHHNVTLTVELTVGSPPQAVTMVLDTGSELPWLNCKKKTQTLTSVFNPLSSSSYSRIPCISPICQKQTRDLPNPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSSNPQEDAKTTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTPLPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFLVQTKGVLVPLGDPNFVFQGAMDSCFRVPANKGKLPPLPTVGLMLNGAEMVVGEELLLYRVPGMVKGGDWVYCMTFGNSDLLGIAAFVVGNHHQQNLWMEYDLAKSRLGFVETRRDGSQQATQHVRVTSRHRLAPPSSPSLPSGYHFARLGFLSSRINLIHISLT
Homology
BLAST of Csor.00g119300 vs. ExPASy Swiss-Prot
Match: Q9LZL3 (Aspartic proteinase PCS1 OS=Arabidopsis thaliana OX=3702 GN=PCS1 PE=2 SV=1)

HSP 1 Score: 477.2 bits (1227), Expect = 2.1e-133
Identity = 236/407 (57.99%), Postives = 298/407 (73.22%), Query Frame = 0

Query: 24  SATQAMVLPLKTE-TGLNSQPSNKLSFHHNVTLTVELTVGSPPQAVTMVLDTGSELPWLN 83
           S++Q +VLPLKT  T  + +P++KL FHHNVTLTV LTVG+PPQ ++MV+DTGSEL WL 
Sbjct: 41  SSSQTLVLPLKTRITPTDHRPTDKLHFHHNVTLTVTLTVGTPPQNISMVIDTGSELSWLR 100

Query: 84  CKKKTQ-TLTSVFNPLSSSSYSRIPCISPICQKQTRDLPNPVVCDEEKLCNVFVSYADGS 143
           C + +     + F+P  SSSYS IPC SP C+ +TRD   P  CD +KLC+  +SYAD S
Sbjct: 101 CNRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIPASCDSDKLCHATLSYADAS 160

Query: 144 SLEGNLASDTFRIG-SLNKPGTLFGCMGLGSSSNPQEDAKTTGLMGMNRGSLSFVTQLGL 203
           S EGNLA++ F  G S N    +FGCMG  S S+P+ED KTTGL+GMNRGSLSF++Q+G 
Sbjct: 161 SSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGF 220

Query: 204 SKFSYCISGRDS-SGVLILGEGNHSWVGNMAYTPLVQMSTPLPSYDRFAYTVKLEGIKVG 263
            KFSYCISG D   G L+LG+ N +W+  + YTPL+++STPLP +DR AYTV+L GIKV 
Sbjct: 221 PKFSYCISGTDDFPGFLLLGDSNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVN 280

Query: 264 NKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFLVQTKGVLVPLGDPNF 323
            K+L + KS+LVPDHTGAGQTMVDSGTQFTFLLGPVYTAL++ FL +T G+L    DP+F
Sbjct: 281 GKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDF 340

Query: 324 VFQGAMDSCFR---VPANKGKLPPLPTVGLMLNGAEMVVGEELLLYRVPGMVKGGDWVYC 383
           VFQG MD C+R   V    G L  LPTV L+  GAE+ V  + LLYRVP +  G D VYC
Sbjct: 341 VFQGTMDLCYRISPVRIRSGILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTVGNDSVYC 400

Query: 384 MTFGNSDLLGIAAFVVGNHHQQNLWMEYDLAKSRLGFVETRRDGSQQ 424
            TFGNSDL+G+ A+V+G+HHQQN+W+E+DL +SR+G      D S Q
Sbjct: 401 FTFGNSDLMGMEAYVIGHHHQQNMWIEFDLQRSRIGLAPVECDVSGQ 447

BLAST of Csor.00g119300 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 171.4 bits (433), Expect = 2.4e-41
Identity = 115/366 (31.42%), Postives = 177/366 (48.36%), Query Frame = 0

Query: 57  VELTVGSPPQAVTMVLDTGSELPWLNCKKKTQTL---TSVFNPLSSSSYSRIPCISPICQ 116
           + + +G+P  + + ++DTGS+L W  C+  TQ     T +FNP  SSS+S +PC S  CQ
Sbjct: 98  MNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQ 157

Query: 117 KQTRDLPNPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSS 176
               DLP+    + E  C     Y DGS+ +G +A++TF   + + P   FGC   G  +
Sbjct: 158 ----DLPSETCNNNE--CQYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGC---GEDN 217

Query: 177 NPQEDAKTTGLMGMNRGSLSFVTQLGLSKFSYCIS--GRDSSGVLILGEGNHSWVGNMAY 236
                    GL+GM  G LS  +QLG+ +FSYC++  G  S   L LG            
Sbjct: 218 QGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYGSSSPSTLALGSAASGVPEGSPS 277

Query: 237 TPLVQMSTPLPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFL 296
           T L+  S   P+Y    Y + L+GI VG   L +  S       G G  ++DSGT  T+L
Sbjct: 278 TTLIHSSLN-PTY----YYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYL 337

Query: 297 LGPVYTALKNEFLVQTKGVLVPLGDPNFVFQGAMDSCFRVPANKGKLPPLPTVGLMLNGA 356
               Y A+   F   T  + +P  D +      + +CF+ P++ G    +P + +  +G 
Sbjct: 338 PQDAYNAVAQAF---TDQINLPTVDES---SSGLSTCFQQPSD-GSTVQVPEISMQFDGG 397

Query: 357 EMVVGEELLLYRVPGMVKGGDWVYCMTFGNSDLLGIAAFVVGNHHQQNLWMEYDLAKSRL 416
            + +GE+ +L      +   + V C+  G+S  LGI+ F  GN  QQ   + YDL    +
Sbjct: 398 VLNLGEQNIL------ISPAEGVICLAMGSSSQLGISIF--GNIQQQETQVLYDLQNLAV 434

Query: 417 GFVETR 418
            FV T+
Sbjct: 458 SFVPTQ 434

BLAST of Csor.00g119300 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 168.7 bits (426), Expect = 1.6e-40
Identity = 110/367 (29.97%), Postives = 183/367 (49.86%), Query Frame = 0

Query: 57  VELTVGSPPQAVTMVLDTGSELPWLNCKKKTQTL---TSVFNPLSSSSYSRIPCISPICQ 116
           + L++G+P Q  + ++DTGS+L W  C+  TQ     T +FNP  SSS+S +PC S +CQ
Sbjct: 97  MNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQ 156

Query: 117 KQTRDLPNPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSS 176
                L +P        C     Y DGS  +G++ ++T   GS++ P   FGC   G ++
Sbjct: 157 A----LSSPTC--SNNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGC---GENN 216

Query: 177 NPQEDAKTTGLMGMNRGSLSFVTQLGLSKFSYCIS--GRDSSGVLILGEGNHSWVGNMAY 236
                    GL+GM RG LS  +QL ++KFSYC++  G  +   L+LG   +S       
Sbjct: 217 QGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSTPSNLLLGSLANSVTAGSPN 276

Query: 237 TPLVQMSTPLPSYDRFAYTVKLEGIKVGNKILALEKS-ILVPDHTGAGQTMVDSGTQFTF 296
           T L+Q S+ +P++    Y + L G+ VG+  L ++ S   +  + G G  ++DSGT  T+
Sbjct: 277 TTLIQ-SSQIPTF----YYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTY 336

Query: 297 LLGPVYTALKNEFLVQTKGVLVPLGDPNFVFQGAMDSCFRVPANKGKLPPLPTVGLMLNG 356
            +   Y +++ EF+ Q    +V      F      D CF+ P++   L  +PT  +  +G
Sbjct: 337 FVNNAYQSVRQEFISQINLPVVNGSSSGF------DLCFQTPSDPSNL-QIPTFVMHFDG 396

Query: 357 AEMVVGEELLLYRVPGMVKGGDWVYCMTFGNSDLLGIAAFVVGNHHQQNLWMEYDLAKSR 416
            ++ +  E         +   + + C+  G+S   G++ F  GN  QQN+ + YD   S 
Sbjct: 397 GDLELPSENY------FISPSNGLICLAMGSSS-QGMSIF--GNIQQQNMLVVYDTGNSV 433

Query: 417 LGFVETR 418
           + F   +
Sbjct: 457 VSFASAQ 433

BLAST of Csor.00g119300 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 144.8 bits (364), Expect = 2.4e-33
Identity = 120/365 (32.88%), Postives = 174/365 (47.67%), Query Frame = 0

Query: 59  LTVGSPPQAVTMVLDTGSELPWLN---CKKKTQTLTSVFNPLSSSSYSRIPCISPICQKQ 118
           L VG+P + V MVLDTGS++ WL    C++       +F+P  S +Y+ IPC SP C   
Sbjct: 146 LGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHC--- 205

Query: 119 TRDLPNPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSSNP 178
            R L +       K C   VSY DGS   G+ +++T         G   GC       N 
Sbjct: 206 -RRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGC----GHDNE 265

Query: 179 QEDAKTTGLMGMNRGSLSFVTQLG---LSKFSYCISGRDSSGVLILGEGNHSWVGNMAYT 238
                  GL+G+ +G LSF  Q G     KFSYC+  R +S      + +    GN A +
Sbjct: 266 GLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSAS-----SKPSSVVFGNAAVS 325

Query: 239 PLVQMSTPL---PSYDRFAYTVKLEGIKV-GNKILALEKSILVPDHTGAGQTMVDSGTQF 298
            + +  TPL   P  D F Y V L GI V G ++  +  S+   D  G G  ++DSGT  
Sbjct: 326 RIARF-TPLLSNPKLDTF-YYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSV 385

Query: 299 TFLLGPVYTALKNEFLVQTKGVLVPLGDPNFVFQGAMDSCFRVPANKGKLPPLPTVGLML 358
           T L+ P Y A+++ F V  K +      P+F      D+CF + +N  ++  +PTV L  
Sbjct: 386 TRLIRPAYIAMRDAFRVGAKTL---KRAPDF---SLFDTCFDL-SNMNEV-KVPTVVLHF 445

Query: 359 NGAEMVVGEELLLYRVPGMVKGGDWVYCMTFGNSDLLGIAAFVVGNHHQQNLWMEYDLAK 414
            GA+  V      Y +P    G    +C  F  + + G++  ++GN  QQ   + YDLA 
Sbjct: 446 RGAD--VSLPATNYLIPVDTNG---KFCFAFAGT-MGGLS--IIGNIQQQGFRVVYDLAS 479

BLAST of Csor.00g119300 vs. ExPASy Swiss-Prot
Match: O04496 (Aspartyl protease AED3 OS=Arabidopsis thaliana OX=3702 GN=AED3 PE=1 SV=1)

HSP 1 Score: 127.5 bits (319), Expect = 4.0e-28
Identity = 114/373 (30.56%), Postives = 170/373 (45.58%), Query Frame = 0

Query: 51  HNVTLTVELTVGSPPQAVTMVLDTGSELPWLNCK--KKTQTLTSVFNPLSSSSYSRIPCI 110
           H     V   +G+PPQ + MVLDT ++  WL C         ++ FN  SSS+YS + C 
Sbjct: 100 HIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTSFNTNSSSTYSTVSCS 159

Query: 111 SPICQKQTRDLPNPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCM- 170
           +  C  Q R L  P    +  +C+   SY   SS   +L  DT  +     P   FGC+ 
Sbjct: 160 TAQC-TQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPDVIPNFSFGCIN 219

Query: 171 -GLGSSSNPQEDAKTTGLMGMNRGSLSFVTQ---LGLSKFSYCISGRDS---SGVLILGE 230
              G+S  PQ      GLMG+ RG +S V+Q   L    FSYC+    S   SG L LG 
Sbjct: 220 SASGNSLPPQ------GLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLGL 279

Query: 231 GNHSWVGNMAYTPLVQMSTPLPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQT 290
                  ++ YTPL++ +   PS     Y V L G+ VG+  + ++   L  D      T
Sbjct: 280 LGQP--KSIRYTPLLR-NPRRPS----LYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGT 339

Query: 291 MVDSGTQFTFLLGPVYTALKNEFLVQTKGVLVPLGDPNFVFQGAMDSCFRVPANKGKLPP 350
           ++DSGT  T    PVY A+++EF  Q       +   +F   GA D+CF   A+   + P
Sbjct: 340 IIDSGTVITRFAQPVYEAIRDEFRKQ-------VNVSSFSTLGAFDTCF--SADNENVAP 399

Query: 351 LPTVGLMLNGAEMVVGEELLLYRVPGMVKGGDWVYCMTF-GNSDLLGIAAFVVGNHHQQN 410
             T+ +     ++ + E  L++   G +       C++  G          V+ N  QQN
Sbjct: 400 KITLHMTSLDLKLPM-ENTLIHSSAGTLT------CLSMAGIRQNANAVLNVIANLQQQN 442

Query: 411 LWMEYDLAKSRLG 413
           L + +D+  SR+G
Sbjct: 460 LRILFDVPNSRIG 442

BLAST of Csor.00g119300 vs. NCBI nr
Match: KAG6602656.1 (Aspartic proteinase PCS1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 942 bits (2435), Expect = 0.0
Identity = 468/468 (100.00%), Postives = 468/468 (100.00%), Query Frame = 0

Query: 1   MNFFLHLLQLLVFLLCFKQSLCFSATQAMVLPLKTETGLNSQPSNKLSFHHNVTLTVELT 60
           MNFFLHLLQLLVFLLCFKQSLCFSATQAMVLPLKTETGLNSQPSNKLSFHHNVTLTVELT
Sbjct: 1   MNFFLHLLQLLVFLLCFKQSLCFSATQAMVLPLKTETGLNSQPSNKLSFHHNVTLTVELT 60

Query: 61  VGSPPQAVTMVLDTGSELPWLNCKKKTQTLTSVFNPLSSSSYSRIPCISPICQKQTRDLP 120
           VGSPPQAVTMVLDTGSELPWLNCKKKTQTLTSVFNPLSSSSYSRIPCISPICQKQTRDLP
Sbjct: 61  VGSPPQAVTMVLDTGSELPWLNCKKKTQTLTSVFNPLSSSSYSRIPCISPICQKQTRDLP 120

Query: 121 NPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSSNPQEDAK 180
           NPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSSNPQEDAK
Sbjct: 121 NPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSSNPQEDAK 180

Query: 181 TTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTP 240
           TTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTP
Sbjct: 181 TTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTP 240

Query: 241 LPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALK 300
           LPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALK
Sbjct: 241 LPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALK 300

Query: 301 NEFLVQTKGVLVPLGDPNFVFQGAMDSCFRVPANKGKLPPLPTVGLMLNGAEMVVGEELL 360
           NEFLVQTKGVLVPLGDPNFVFQGAMDSCFRVPANKGKLPPLPTVGLMLNGAEMVVGEELL
Sbjct: 301 NEFLVQTKGVLVPLGDPNFVFQGAMDSCFRVPANKGKLPPLPTVGLMLNGAEMVVGEELL 360

Query: 361 LYRVPGMVKGGDWVYCMTFGNSDLLGIAAFVVGNHHQQNLWMEYDLAKSRLGFVETRRDG 420
           LYRVPGMVKGGDWVYCMTFGNSDLLGIAAFVVGNHHQQNLWMEYDLAKSRLGFVETRRDG
Sbjct: 361 LYRVPGMVKGGDWVYCMTFGNSDLLGIAAFVVGNHHQQNLWMEYDLAKSRLGFVETRRDG 420

Query: 421 SQQATQHVRVTSRHRLAPPSSPSLPSGYHFARLGFLSSRINLIHISLT 468
           SQQATQHVRVTSRHRLAPPSSPSLPSGYHFARLGFLSSRINLIHISLT
Sbjct: 421 SQQATQHVRVTSRHRLAPPSSPSLPSGYHFARLGFLSSRINLIHISLT 468

BLAST of Csor.00g119300 vs. NCBI nr
Match: KAG7033342.1 (Aspartic proteinase PCS1, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 839 bits (2167), Expect = 7.58e-306
Identity = 415/423 (98.11%), Postives = 419/423 (99.05%), Query Frame = 0

Query: 1   MNFFLHLLQLLVFLLCFKQSLCFSATQAMVLPLKTETGLNSQPSNKLSFHHNVTLTVELT 60
           MNFFLHLLQLLVFLLCFKQSLCFSATQAMVLPLKTETG+NSQPSNKLSFHHNVTLTVELT
Sbjct: 1   MNFFLHLLQLLVFLLCFKQSLCFSATQAMVLPLKTETGVNSQPSNKLSFHHNVTLTVELT 60

Query: 61  VGSPPQAVTMVLDTGSELPWLNCKKKTQTLTSVFNPLSSSSYSRIPCISPICQKQTRDLP 120
           VGSPPQAVTMVLDTGSELPWLNCKKKTQTLTSVFNPLSSSSYSRIPCISPICQKQTRDLP
Sbjct: 61  VGSPPQAVTMVLDTGSELPWLNCKKKTQTLTSVFNPLSSSSYSRIPCISPICQKQTRDLP 120

Query: 121 NPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSSNPQEDAK 180
           NPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSSNPQEDAK
Sbjct: 121 NPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSSNPQEDAK 180

Query: 181 TTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTP 240
           TTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTP
Sbjct: 181 TTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTP 240

Query: 241 LPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALK 300
           LPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALK
Sbjct: 241 LPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALK 300

Query: 301 NEFLVQTKGVLVPLGDPNFVFQGAMDSCFRVPANKGKLPPLPTVGLMLNGAEMVVGEELL 360
           NEFLVQTKGVLVPLGDPNFVFQGA+DSCFRVPANKGKLPPLPTV LMLNGAEMVVGEELL
Sbjct: 301 NEFLVQTKGVLVPLGDPNFVFQGALDSCFRVPANKGKLPPLPTVSLMLNGAEMVVGEELL 360

Query: 361 LYRVPGMVKGGDWVYCMTFGNSDLLGIAAFVVGNHHQQNLWMEYDLAKSRLGFVETRRDG 420
           LYRVPGMVKGGDWVYCMTFGNSDLLGIAAFV+GNHHQQNLWMEYDLAKSRLGFVETR D 
Sbjct: 361 LYRVPGMVKGGDWVYCMTFGNSDLLGIAAFVIGNHHQQNLWMEYDLAKSRLGFVETRCDS 420

Query: 421 SQQ 423
           + Q
Sbjct: 421 AGQ 423

BLAST of Csor.00g119300 vs. NCBI nr
Match: XP_022955370.1 (aspartic proteinase PCS1-like [Cucurbita moschata])

HSP 1 Score: 835 bits (2157), Expect = 2.53e-304
Identity = 414/423 (97.87%), Postives = 417/423 (98.58%), Query Frame = 0

Query: 1   MNFFLHLLQLLVFLLCFKQSLCFSATQAMVLPLKTETGLNSQPSNKLSFHHNVTLTVELT 60
           MNFFLHLLQLLVFLLCFKQSLCF ATQAMVLPLKTETGLNSQPSNKLSFHHNVTLTVELT
Sbjct: 1   MNFFLHLLQLLVFLLCFKQSLCFCATQAMVLPLKTETGLNSQPSNKLSFHHNVTLTVELT 60

Query: 61  VGSPPQAVTMVLDTGSELPWLNCKKKTQTLTSVFNPLSSSSYSRIPCISPICQKQTRDLP 120
           VGSPPQAVTMVLDTGSELPWLNCKKKTQTLTSVFNPLSSSSYSRIPCISPICQKQTRDLP
Sbjct: 61  VGSPPQAVTMVLDTGSELPWLNCKKKTQTLTSVFNPLSSSSYSRIPCISPICQKQTRDLP 120

Query: 121 NPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSSNPQEDAK 180
           NPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSSNPQEDAK
Sbjct: 121 NPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSSNPQEDAK 180

Query: 181 TTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTP 240
           TTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTP
Sbjct: 181 TTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTP 240

Query: 241 LPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALK 300
           LPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALK
Sbjct: 241 LPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALK 300

Query: 301 NEFLVQTKGVLVPLGDPNFVFQGAMDSCFRVPANKGKLPPLPTVGLMLNGAEMVVGEELL 360
           NEFLVQTKGVLVPLGDPNFVFQGAMDSCFRVPAN+GKLPPLPTV LMLNGAEMVVG ELL
Sbjct: 301 NEFLVQTKGVLVPLGDPNFVFQGAMDSCFRVPANQGKLPPLPTVSLMLNGAEMVVGGELL 360

Query: 361 LYRVPGMVKGGDWVYCMTFGNSDLLGIAAFVVGNHHQQNLWMEYDLAKSRLGFVETRRDG 420
           LYRVPGMVKGGDWVYCMTFGNSDLLGIAAFV+GNHHQQNLWMEYDLAKSRLGFVETR D 
Sbjct: 361 LYRVPGMVKGGDWVYCMTFGNSDLLGIAAFVIGNHHQQNLWMEYDLAKSRLGFVETRCDS 420

Query: 421 SQQ 423
           + Q
Sbjct: 421 AGQ 423

BLAST of Csor.00g119300 vs. NCBI nr
Match: XP_023540416.1 (aspartic proteinase PCS1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 826 bits (2133), Expect = 1.15e-300
Identity = 408/423 (96.45%), Postives = 414/423 (97.87%), Query Frame = 0

Query: 1   MNFFLHLLQLLVFLLCFKQSLCFSATQAMVLPLKTETGLNSQPSNKLSFHHNVTLTVELT 60
           MNFF HLLQLLVF LCFKQ LCFSATQAMVLPLKTETGLNSQPSNK+SFHHNVTLTVELT
Sbjct: 1   MNFFFHLLQLLVFFLCFKQRLCFSATQAMVLPLKTETGLNSQPSNKISFHHNVTLTVELT 60

Query: 61  VGSPPQAVTMVLDTGSELPWLNCKKKTQTLTSVFNPLSSSSYSRIPCISPICQKQTRDLP 120
           VGSPPQAVTMVLDTGSELPWLNCKKKTQTLTSVFNPLSSSSYSRIPCISPICQKQTRDLP
Sbjct: 61  VGSPPQAVTMVLDTGSELPWLNCKKKTQTLTSVFNPLSSSSYSRIPCISPICQKQTRDLP 120

Query: 121 NPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSSNPQEDAK 180
           NPVVCDEEKLC+VFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSSNPQEDAK
Sbjct: 121 NPVVCDEEKLCDVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSSNPQEDAK 180

Query: 181 TTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTP 240
           TTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTP
Sbjct: 181 TTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTP 240

Query: 241 LPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALK 300
           LPSYDRFAYTVKLEGIKVGNKILALEKSILVPDH+GAGQTMVDSGTQFTFLLGPVYTALK
Sbjct: 241 LPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHSGAGQTMVDSGTQFTFLLGPVYTALK 300

Query: 301 NEFLVQTKGVLVPLGDPNFVFQGAMDSCFRVPANKGKLPPLPTVGLMLNGAEMVVGEELL 360
           NEFLVQTKGVLVPLGDPNFVFQGAMDSCFRVPAN+GKLPPLP V LMLNGAEMVVG ELL
Sbjct: 301 NEFLVQTKGVLVPLGDPNFVFQGAMDSCFRVPANQGKLPPLPAVSLMLNGAEMVVGGELL 360

Query: 361 LYRVPGMVKGGDWVYCMTFGNSDLLGIAAFVVGNHHQQNLWMEYDLAKSRLGFVETRRDG 420
           LYRVPGMVKGGDWVYCMTFGNSDLLGIAAFV+GNHHQQNLWMEYDLAKSRLGFVETR D 
Sbjct: 361 LYRVPGMVKGGDWVYCMTFGNSDLLGIAAFVIGNHHQQNLWMEYDLAKSRLGFVETRCDS 420

Query: 421 SQQ 423
           + Q
Sbjct: 421 AGQ 423

BLAST of Csor.00g119300 vs. NCBI nr
Match: XP_022991088.1 (aspartic proteinase PCS1-like [Cucurbita maxima])

HSP 1 Score: 811 bits (2095), Expect = 6.82e-295
Identity = 405/423 (95.74%), Postives = 409/423 (96.69%), Query Frame = 0

Query: 1   MNFFLHLLQLLVFLLCFKQSLCFSATQAMVLPLKTETGLNSQPSNKLSFHHNVTLTVELT 60
           M+FFLHLLQLLVFLLCFKQSLCFSATQAMVLPLKTETGLNSQPSNKLSFHHNVTLTVELT
Sbjct: 1   MDFFLHLLQLLVFLLCFKQSLCFSATQAMVLPLKTETGLNSQPSNKLSFHHNVTLTVELT 60

Query: 61  VGSPPQAVTMVLDTGSELPWLNCKKKTQTLTSVFNPLSSSSYSRIPCISPICQKQTRDLP 120
           VGSPPQAVTMVLDTGSELPWLNCKK TQTLTSVFNPLSSSSYSRIPC SPICQKQTRDLP
Sbjct: 61  VGSPPQAVTMVLDTGSELPWLNCKK-TQTLTSVFNPLSSSSYSRIPCSSPICQKQTRDLP 120

Query: 121 NPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSSNPQEDAK 180
           NPVVCD EKLCNVFVSYADGSSLEGNLASDTFRIGS NKPGTLFGCMG  SSSNPQEDAK
Sbjct: 121 NPVVCDSEKLCNVFVSYADGSSLEGNLASDTFRIGSFNKPGTLFGCMGSDSSSNPQEDAK 180

Query: 181 TTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTP 240
           TTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTP
Sbjct: 181 TTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTP 240

Query: 241 LPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALK 300
           LPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALK
Sbjct: 241 LPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALK 300

Query: 301 NEFLVQTKGVLVPLGDPNFVFQGAMDSCFRVPANKGKLPPLPTVGLMLNGAEMVVGEELL 360
           NEFLVQTKGVLVPL DPNFVFQGAMDSCFRVPAN+GKLPPLP V LMLNGAEMVVG ELL
Sbjct: 301 NEFLVQTKGVLVPLDDPNFVFQGAMDSCFRVPANQGKLPPLPAVSLMLNGAEMVVGGELL 360

Query: 361 LYRVPGMVKGGDWVYCMTFGNSDLLGIAAFVVGNHHQQNLWMEYDLAKSRLGFVETRRDG 420
           LYRVPGMVKGGDWVYCMTFGNSDLLG AAFV+GNHHQQNLWMEYDLAKSRLGFVETR D 
Sbjct: 361 LYRVPGMVKGGDWVYCMTFGNSDLLGTAAFVIGNHHQQNLWMEYDLAKSRLGFVETRCDS 420

Query: 421 SQQ 423
           + Q
Sbjct: 421 AGQ 422

BLAST of Csor.00g119300 vs. ExPASy TrEMBL
Match: A0A6J1GUY9 (aspartic proteinase PCS1-like OS=Cucurbita moschata OX=3662 GN=LOC111457415 PE=3 SV=1)

HSP 1 Score: 835 bits (2157), Expect = 1.23e-304
Identity = 414/423 (97.87%), Postives = 417/423 (98.58%), Query Frame = 0

Query: 1   MNFFLHLLQLLVFLLCFKQSLCFSATQAMVLPLKTETGLNSQPSNKLSFHHNVTLTVELT 60
           MNFFLHLLQLLVFLLCFKQSLCF ATQAMVLPLKTETGLNSQPSNKLSFHHNVTLTVELT
Sbjct: 1   MNFFLHLLQLLVFLLCFKQSLCFCATQAMVLPLKTETGLNSQPSNKLSFHHNVTLTVELT 60

Query: 61  VGSPPQAVTMVLDTGSELPWLNCKKKTQTLTSVFNPLSSSSYSRIPCISPICQKQTRDLP 120
           VGSPPQAVTMVLDTGSELPWLNCKKKTQTLTSVFNPLSSSSYSRIPCISPICQKQTRDLP
Sbjct: 61  VGSPPQAVTMVLDTGSELPWLNCKKKTQTLTSVFNPLSSSSYSRIPCISPICQKQTRDLP 120

Query: 121 NPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSSNPQEDAK 180
           NPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSSNPQEDAK
Sbjct: 121 NPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSSNPQEDAK 180

Query: 181 TTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTP 240
           TTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTP
Sbjct: 181 TTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTP 240

Query: 241 LPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALK 300
           LPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALK
Sbjct: 241 LPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALK 300

Query: 301 NEFLVQTKGVLVPLGDPNFVFQGAMDSCFRVPANKGKLPPLPTVGLMLNGAEMVVGEELL 360
           NEFLVQTKGVLVPLGDPNFVFQGAMDSCFRVPAN+GKLPPLPTV LMLNGAEMVVG ELL
Sbjct: 301 NEFLVQTKGVLVPLGDPNFVFQGAMDSCFRVPANQGKLPPLPTVSLMLNGAEMVVGGELL 360

Query: 361 LYRVPGMVKGGDWVYCMTFGNSDLLGIAAFVVGNHHQQNLWMEYDLAKSRLGFVETRRDG 420
           LYRVPGMVKGGDWVYCMTFGNSDLLGIAAFV+GNHHQQNLWMEYDLAKSRLGFVETR D 
Sbjct: 361 LYRVPGMVKGGDWVYCMTFGNSDLLGIAAFVIGNHHQQNLWMEYDLAKSRLGFVETRCDS 420

Query: 421 SQQ 423
           + Q
Sbjct: 421 AGQ 423

BLAST of Csor.00g119300 vs. ExPASy TrEMBL
Match: A0A6J1JV61 (aspartic proteinase PCS1-like OS=Cucurbita maxima OX=3661 GN=LOC111487788 PE=3 SV=1)

HSP 1 Score: 811 bits (2095), Expect = 3.30e-295
Identity = 405/423 (95.74%), Postives = 409/423 (96.69%), Query Frame = 0

Query: 1   MNFFLHLLQLLVFLLCFKQSLCFSATQAMVLPLKTETGLNSQPSNKLSFHHNVTLTVELT 60
           M+FFLHLLQLLVFLLCFKQSLCFSATQAMVLPLKTETGLNSQPSNKLSFHHNVTLTVELT
Sbjct: 1   MDFFLHLLQLLVFLLCFKQSLCFSATQAMVLPLKTETGLNSQPSNKLSFHHNVTLTVELT 60

Query: 61  VGSPPQAVTMVLDTGSELPWLNCKKKTQTLTSVFNPLSSSSYSRIPCISPICQKQTRDLP 120
           VGSPPQAVTMVLDTGSELPWLNCKK TQTLTSVFNPLSSSSYSRIPC SPICQKQTRDLP
Sbjct: 61  VGSPPQAVTMVLDTGSELPWLNCKK-TQTLTSVFNPLSSSSYSRIPCSSPICQKQTRDLP 120

Query: 121 NPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSSNPQEDAK 180
           NPVVCD EKLCNVFVSYADGSSLEGNLASDTFRIGS NKPGTLFGCMG  SSSNPQEDAK
Sbjct: 121 NPVVCDSEKLCNVFVSYADGSSLEGNLASDTFRIGSFNKPGTLFGCMGSDSSSNPQEDAK 180

Query: 181 TTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTP 240
           TTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTP
Sbjct: 181 TTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTP 240

Query: 241 LPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALK 300
           LPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALK
Sbjct: 241 LPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALK 300

Query: 301 NEFLVQTKGVLVPLGDPNFVFQGAMDSCFRVPANKGKLPPLPTVGLMLNGAEMVVGEELL 360
           NEFLVQTKGVLVPL DPNFVFQGAMDSCFRVPAN+GKLPPLP V LMLNGAEMVVG ELL
Sbjct: 301 NEFLVQTKGVLVPLDDPNFVFQGAMDSCFRVPANQGKLPPLPAVSLMLNGAEMVVGGELL 360

Query: 361 LYRVPGMVKGGDWVYCMTFGNSDLLGIAAFVVGNHHQQNLWMEYDLAKSRLGFVETRRDG 420
           LYRVPGMVKGGDWVYCMTFGNSDLLG AAFV+GNHHQQNLWMEYDLAKSRLGFVETR D 
Sbjct: 361 LYRVPGMVKGGDWVYCMTFGNSDLLGTAAFVIGNHHQQNLWMEYDLAKSRLGFVETRCDS 420

Query: 421 SQQ 423
           + Q
Sbjct: 421 AGQ 422

BLAST of Csor.00g119300 vs. ExPASy TrEMBL
Match: A0A6J1FDS6 (aspartic proteinase PCS1-like OS=Cucurbita moschata OX=3662 GN=LOC111444820 PE=3 SV=1)

HSP 1 Score: 675 bits (1742), Expect = 1.88e-241
Identity = 329/419 (78.52%), Postives = 368/419 (87.83%), Query Frame = 0

Query: 1   MNFFLHLLQLLVFLLCFKQSLCFSATQAMVLPLKTETGLNSQPSNKLSFHHNVTLTVELT 60
           M FFL LLQLL+  + FKQ LCFSATQ MVLPLKT+ G+ S+PSNKLSFHHNVTLTV LT
Sbjct: 1   MAFFLRLLQLLICCVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLT 60

Query: 61  VGSPPQAVTMVLDTGSELPWLNCKKKTQTLTSVFNPLSSSSYSRIPCISPICQKQTRDLP 120
           +GSPPQ VTMVLDTGSEL WL+CKK T  L SVFNPLSSSSYS +PC SP+C+ +TRDLP
Sbjct: 61  LGSPPQPVTMVLDTGSELSWLHCKK-TPNLNSVFNPLSSSSYSPVPCASPVCRTRTRDLP 120

Query: 121 NPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSSNPQEDAK 180
           NPV CD +KLC+VFVSYAD SSLEGNLASDTFR+GS  +PGT FGCM  G SSN +EDAK
Sbjct: 121 NPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAK 180

Query: 181 TTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTP 240
           TTGLMGMNRGSLSFVTQLGL KFSYCISGRDSSGVL+ G+ + SW+GN+ YTPLVQMSTP
Sbjct: 181 TTGLMGMNRGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTP 240

Query: 241 LPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALK 300
           LP YDR AYTV+L+GI+VGNKILAL KSI  PDHTGAGQTMVDSGTQFTFLLGPVYTALK
Sbjct: 241 LPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALK 300

Query: 301 NEFLVQTKGVLVPLGDPNFVFQGAMDSCFRVPANKGKLPPLPTVGLMLNGAEMVVGEELL 360
           NEF+VQTKG+LVPLGDPNFVFQGAMD C+RVP  +GKLPPLP V LM  GAEMVVG E+L
Sbjct: 301 NEFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVL 360

Query: 361 LYRVPGMVKGGDWVYCMTFGNSDLLGIAAFVVGNHHQQNLWMEYDLAKSRLGFVETRRD 419
           +Y+VPGMV+GGD V+C+TFGNSDLLGI AFV+G+HHQQN+WME+DL KSR+GFVETR D
Sbjct: 361 MYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCD 418

BLAST of Csor.00g119300 vs. ExPASy TrEMBL
Match: A0A6J1JZR1 (aspartic proteinase PCS1-like OS=Cucurbita maxima OX=3661 GN=LOC111488906 PE=3 SV=1)

HSP 1 Score: 675 bits (1741), Expect = 2.66e-241
Identity = 330/419 (78.76%), Postives = 367/419 (87.59%), Query Frame = 0

Query: 1   MNFFLHLLQLLVFLLCFKQSLCFSATQAMVLPLKTETGLNSQPSNKLSFHHNVTLTVELT 60
           M FFL LL LL+  + FKQSLCFSA Q MVLPLKT+ G+ SQPSNKLSFHHNVTLTV LT
Sbjct: 1   MAFFLRLLHLLICCVSFKQSLCFSAIQTMVLPLKTQMGVTSQPSNKLSFHHNVTLTVSLT 60

Query: 61  VGSPPQAVTMVLDTGSELPWLNCKKKTQTLTSVFNPLSSSSYSRIPCISPICQKQTRDLP 120
           +GSPPQ VTMVLDTGSEL WL+CKK T  L SVFNPLSSSSYS +PC SP+C+ +TRDLP
Sbjct: 61  LGSPPQPVTMVLDTGSELSWLHCKK-TPNLNSVFNPLSSSSYSPVPCASPVCRTRTRDLP 120

Query: 121 NPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSSNPQEDAK 180
           NPV CD +KLC+VFVSYAD SSLEGNLASDTFR+GS  +PGT FGCM  G SSN +EDAK
Sbjct: 121 NPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAK 180

Query: 181 TTGLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTP 240
           TTGLMGMNRGSLSFVTQLGL KFSYCISGRDSSGVL+ G+ + SW+GN+ YTPLVQMSTP
Sbjct: 181 TTGLMGMNRGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTP 240

Query: 241 LPSYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALK 300
           LP YDR AYTV+L+GI+VGNKILAL KSI  PDHTGAGQTMVDSGTQFTFLLGPVYTALK
Sbjct: 241 LPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALK 300

Query: 301 NEFLVQTKGVLVPLGDPNFVFQGAMDSCFRVPANKGKLPPLPTVGLMLNGAEMVVGEELL 360
           NEF+VQTKG+LVPLGDPNFVFQGAMD C+RVP  +GKLPPLP V LM  GAEMVVG E+L
Sbjct: 301 NEFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVL 360

Query: 361 LYRVPGMVKGGDWVYCMTFGNSDLLGIAAFVVGNHHQQNLWMEYDLAKSRLGFVETRRD 419
           +YRVPGMV+GGD V+C+TFGNSDLLGI AFV+G+HHQQN+WME+DL KSR+GFVETR D
Sbjct: 361 MYRVPGMVRGGDQVHCVTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCD 418

BLAST of Csor.00g119300 vs. ExPASy TrEMBL
Match: A0A5D3DLY6 (Aspartic proteinase PCS1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold205G00670 PE=3 SV=1)

HSP 1 Score: 642 bits (1657), Expect = 8.93e-229
Identity = 318/403 (78.91%), Postives = 350/403 (86.85%), Query Frame = 0

Query: 18  KQSLCFSATQA-MVLPLKTETGLNSQPSNKLSFHHNVTLTVELTVGSPPQAVTMVLDTGS 77
           KQSLCFSAT   MVLPL+T+ GL SQPSNKLSFHHNVTLTV LTVGSPPQ VTMVLDTGS
Sbjct: 2   KQSLCFSATPTTMVLPLQTQMGLISQPSNKLSFHHNVTLTVSLTVGSPPQQVTMVLDTGS 61

Query: 78  ELPWLNCKKKTQTLTSVFNPLSSSSYSRIPCISPICQKQTRDLPNPVVCDEEKLCNVFVS 137
           EL WL+CKK +  LTSVFNPLSSSSYS IPC SP+C+ +TRDLPNPV CD +KLC+  VS
Sbjct: 62  ELSWLHCKK-SPNLTSVFNPLSSSSYSPIPCSSPVCRTRTRDLPNPVTCDPKKLCHAIVS 121

Query: 138 YADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSSNPQEDAKTTGLMGMNRGSLSFVT 197
           YAD SSLEGNLASD FRIGS   PGTLFGCM  G SSN +EDAKTTGLMGMNRGSLSFVT
Sbjct: 122 YADASSLEGNLASDNFRIGSSALPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVT 181

Query: 198 QLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTPLPSYDRFAYTVKLEGI 257
           QLGL KFSYCISGRDSSGVL+ G+ + SW+GN+ YTPLVQ+STPLP +DR AYTV+L+GI
Sbjct: 182 QLGLPKFSYCISGRDSSGVLLFGDSHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGI 241

Query: 258 KVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFLVQTKGVLVPLGD 317
           +VGNKIL L KSI  PDHTGAGQTMVDSGTQFTFLLGPVYTAL+NEFL QTKGVL PLGD
Sbjct: 242 RVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGD 301

Query: 318 PNFVFQGAMDSCFRVPANKGKLPPLPTVGLMLNGAEMVVGEELLLYRVPGMVKGGDWVYC 377
           PNFVFQGAMD C+RVPA  GKLP LP V LM  GAEMVVG E+LLY+VPGM+KG +WVYC
Sbjct: 302 PNFVFQGAMDLCYRVPAG-GKLPELPAVSLMFRGAEMVVGGEVLLYKVPGMMKGKEWVYC 361

Query: 378 MTFGNSDLLGIAAFVVGNHHQQNLWMEYDLAKSRLGFVETRRD 419
           +TFGNSDLLGI AFV+G+HHQQN+WME+DL KSR+GFVETR D
Sbjct: 362 LTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCD 402

BLAST of Csor.00g119300 vs. TAIR 10
Match: AT2G39710.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 517.7 bits (1332), Expect = 9.8e-147
Identity = 265/413 (64.16%), Postives = 319/413 (77.24%), Query Frame = 0

Query: 6   HLLQLLVFLLCFKQSLC--FSATQAMVLPLKTETGLNSQPSNKLSFHHNVTLTVELTVGS 65
           + L++ V LL F  + C   S  Q ++  LKT+  L    S+KLSF HNVTLTV L VG 
Sbjct: 15  NFLRISVLLLIFPLTFCKTSSTNQTLLFSLKTQK-LPQSSSDKLSFRHNVTLTVTLAVGD 74

Query: 66  PPQAVTMVLDTGSELPWLNCKKKTQTLTSVFNPLSSSSYSRIPCISPICQKQTRDLPNPV 125
           PPQ ++MVLDTGSEL WL+C KK+  L SVFNP+SSS+YS +PC SPIC+ +TRDLP P 
Sbjct: 75  PPQNISMVLDTGSELSWLHC-KKSPNLGSVFNPVSSSTYSPVPCSSPICRTRTRDLPIPA 134

Query: 126 VCD-EEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSSSNPQEDAKTT 185
            CD +  LC+V +SYAD +S+EGNLA +TF IGS+ +PGTLFGCM  G SSN +EDAK+T
Sbjct: 135 SCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTLFGCMDSGLSSNSEEDAKST 194

Query: 186 GLMGMNRGSLSFVTQLGLSKFSYCISGRDSSGVLILGEGNHSWVGNMAYTPLVQMSTPLP 245
           GLMGMNRGSLSFV QLG SKFSYCISG DSSG L+LG+ ++SW+G + YTPLV  STPLP
Sbjct: 195 GLMGMNRGSLSFVNQLGFSKFSYCISGSDSSGFLLLGDASYSWLGPIQYTPLVLQSTPLP 254

Query: 246 SYDRFAYTVKLEGIKVGNKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALKNE 305
            +DR AYTV+LEGI+VG+KIL+L KS+ VPDHTGAGQTMVDSGTQFTFL+GPVYTALKNE
Sbjct: 255 YFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNE 314

Query: 306 FLVQTKGVLVPLGDPNFVFQGAMDSCFRV-PANKGKLPPLPTVGLMLNGAEMVVGEELLL 365
           F+ QTK VL  + DP+FVFQG MD C++V    +     LP V LM  GAEM V  + LL
Sbjct: 315 FITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMVSLMFRGAEMSVSGQKLL 374

Query: 366 YRVPGM-VKGGDWVYCMTFGNSDLLGIAAFVVGNHHQQNLWMEYDLAKSRLGF 414
           YRV G   +G + VYC TFGNSDLLGI AFV+G+HHQQN+WME+DLAKSR+GF
Sbjct: 375 YRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQQNVWMEFDLAKSRVGF 425

BLAST of Csor.00g119300 vs. TAIR 10
Match: AT5G02190.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 477.2 bits (1227), Expect = 1.5e-134
Identity = 236/407 (57.99%), Postives = 298/407 (73.22%), Query Frame = 0

Query: 24  SATQAMVLPLKTE-TGLNSQPSNKLSFHHNVTLTVELTVGSPPQAVTMVLDTGSELPWLN 83
           S++Q +VLPLKT  T  + +P++KL FHHNVTLTV LTVG+PPQ ++MV+DTGSEL WL 
Sbjct: 41  SSSQTLVLPLKTRITPTDHRPTDKLHFHHNVTLTVTLTVGTPPQNISMVIDTGSELSWLR 100

Query: 84  CKKKTQ-TLTSVFNPLSSSSYSRIPCISPICQKQTRDLPNPVVCDEEKLCNVFVSYADGS 143
           C + +     + F+P  SSSYS IPC SP C+ +TRD   P  CD +KLC+  +SYAD S
Sbjct: 101 CNRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIPASCDSDKLCHATLSYADAS 160

Query: 144 SLEGNLASDTFRIG-SLNKPGTLFGCMGLGSSSNPQEDAKTTGLMGMNRGSLSFVTQLGL 203
           S EGNLA++ F  G S N    +FGCMG  S S+P+ED KTTGL+GMNRGSLSF++Q+G 
Sbjct: 161 SSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGF 220

Query: 204 SKFSYCISGRDS-SGVLILGEGNHSWVGNMAYTPLVQMSTPLPSYDRFAYTVKLEGIKVG 263
            KFSYCISG D   G L+LG+ N +W+  + YTPL+++STPLP +DR AYTV+L GIKV 
Sbjct: 221 PKFSYCISGTDDFPGFLLLGDSNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVN 280

Query: 264 NKILALEKSILVPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFLVQTKGVLVPLGDPNF 323
            K+L + KS+LVPDHTGAGQTMVDSGTQFTFLLGPVYTAL++ FL +T G+L    DP+F
Sbjct: 281 GKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDF 340

Query: 324 VFQGAMDSCFR---VPANKGKLPPLPTVGLMLNGAEMVVGEELLLYRVPGMVKGGDWVYC 383
           VFQG MD C+R   V    G L  LPTV L+  GAE+ V  + LLYRVP +  G D VYC
Sbjct: 341 VFQGTMDLCYRISPVRIRSGILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTVGNDSVYC 400

Query: 384 MTFGNSDLLGIAAFVVGNHHQQNLWMEYDLAKSRLGFVETRRDGSQQ 424
            TFGNSDL+G+ A+V+G+HHQQN+W+E+DL +SR+G      D S Q
Sbjct: 401 FTFGNSDLMGMEAYVIGHHHQQNMWIEFDLQRSRIGLAPVECDVSGQ 447

BLAST of Csor.00g119300 vs. TAIR 10
Match: AT1G66180.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 233.8 bits (595), Expect = 2.8e-61
Identity = 141/390 (36.15%), Postives = 207/390 (53.08%), Query Frame = 0

Query: 40  NSQPSN-KLSFHHNVTLTVELTVGSPPQAVTMVLDTGSELPWLNC--KKKTQTLTSVFNP 99
           +S P N +  F +++ L + L +G+PPQA  MVLDTGS+L W+ C  KK      + F+P
Sbjct: 56  SSPPYNFRSRFKYSMALIISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPPKPKTSFDP 115

Query: 100 LSSSSYSRIPCISPICQKQTRDLPNPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGS 159
             SSS+S +PC  P+C+ +  D   P  CD  +LC+    YADG+  EGNL  +     +
Sbjct: 116 SLSSSFSTLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSN 175

Query: 160 LN-KPGTLFGCMGLGSSSNPQEDAKTTGLMGMNRGSLSFVTQLGLSKFSYCI------SG 219
               P  + GC          E +   G++GMNRG LSFV+Q  +SKFSYCI       G
Sbjct: 176 TEITPPLILGC--------ATESSDDRGILGMNRGRLSFVSQAKISKFSYCIPPKSNRPG 235

Query: 220 RDSSGVLILGEGNHSWVGNMAYTPLVQM--STPLPSYDRFAYTVKLEGIKVGNKILALEK 279
              +G   LG+  +S      Y  L+    S  +P+ D  AYTV + GI+ G K L +  
Sbjct: 236 FTPTGSFYLGDNPNS--HGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISG 295

Query: 280 SILVPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFLVQTKGVLVPLGDPNFVFQGAMDS 339
           S+  PD  G+GQTMVDSG++FT L+   Y  ++ E + +    L       +V+ G  D 
Sbjct: 296 SVFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRL----KKGYVYGGTADM 355

Query: 340 CFRVPANKGKLPPL--PTVGLMLNGAEMVVGEELLLYRVPGMVKGGDWVYCMTFGNSDLL 399
           CF    N   +P L    V +   G E++V +E +L  V      G  ++C+  G S +L
Sbjct: 356 CF--DGNVAMIPRLIGDLVFVFTRGVEILVPKERVLVNV------GGGIHCVGIGRSSML 415

Query: 400 GIAAFVVGNHHQQNLWMEYDLAKSRLGFVE 416
           G A+ ++GN HQQNLW+E+D+   R+GF +
Sbjct: 416 GAASNIIGNVHQQNLWVEFDVTNRRVGFAK 423

BLAST of Csor.00g119300 vs. TAIR 10
Match: AT5G37540.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 230.7 bits (587), Expect = 2.4e-60
Identity = 152/445 (34.16%), Postives = 228/445 (51.24%), Query Frame = 0

Query: 4   FLHLLQLLVFLLCFKQSLCFSATQAMVLPLKT------------ETGLNSQ-----PSNK 63
           FL LL +  F  C+  SL +S++ ++  PL +            +T L S+     PS+ 
Sbjct: 9   FLKLLYIF-FFFCYSVSLSWSSSLSLHFPLTSLRLTPTTNSSSFKTSLLSRRNPSPPSSP 68

Query: 64  LSFHHNV----TLTVELTVGSPPQAVTMVLDTGSELPWLNC-----KKKTQTLTSVFNPL 123
            +F  N+     L + L +G+P Q+  +VLDTGS+L W+ C     KK     T+ F+P 
Sbjct: 69  YTFRSNIKYSMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPS 128

Query: 124 SSSSYSRIPCISPICQKQTRDLPNPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRI-GS 183
            SSS+S +PC  P+C+ +  D   P  CD  +LC+    YADG+  EGNL  + F    S
Sbjct: 129 LSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNS 188

Query: 184 LNKPGTLFGCMGLGSSSNPQEDAKTTGLMGMNRGSLSFVTQLGLSKFSYCISGRD----- 243
              P  + GC         +E     G++GMN G LSF++Q  +SKFSYCI  R      
Sbjct: 189 QTTPPLILGC--------AKESTDEKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGL 248

Query: 244 -SSGVLILGEGNHSWVGNMAYTPLVQMSTPLPSYDRFAYTVKLEGIKVGNKILALEKSIL 303
            S+G   LG+  +S             S  +P+ D  AYTV L+GI++G K L +  S+ 
Sbjct: 249 ASTGSFYLGDNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVF 308

Query: 304 VPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFLVQTKGVLVPLGDPNFVFQGAMDSCFR 363
            PD  G+GQTMVDSG++FT L+   Y  +K E +V+  G  +  G   +V+    D CF 
Sbjct: 309 RPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEE-IVRLVGSRLKKG---YVYGSTADMCF- 368

Query: 364 VPANKGKLPPLPTVGLMLNGA--EMVVGEELLLYRVPGMVKGGDWVYCMTFGNSDLLGIA 414
              N         +G ++     E   G E+L+ +   +V  G  ++C+  G S +LG A
Sbjct: 369 -DGNHSM-----EIGRLIGDLVFEFGRGVEILVEKQSLLVNVGGGIHCVGIGRSSMLGAA 428

BLAST of Csor.00g119300 vs. TAIR 10
Match: AT3G54400.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 136.0 bits (341), Expect = 8.0e-32
Identity = 119/371 (32.08%), Postives = 174/371 (46.90%), Query Frame = 0

Query: 54  TLTVELTVGSPPQAVTMVLDTGSELPWLNCKKKTQTLTSV-FNPLSSSSYSRIPCISPIC 113
           T  V   +G+P Q + + LDT ++  W+ C       +SV F+P  SSS   + C +P C
Sbjct: 87  TYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSVLFDPSKSSSSRTLQCEAPQC 146

Query: 114 QKQTRDLPNPVVCDEEKLCNVFVSYADGSSLEGNLASDTFRIGSLNKPGTLFGCMGLGSS 173
               +  PNP  C   K C   ++Y  GS++E  L  DT  + S   P   FGC+   S 
Sbjct: 147 ----KQAPNP-SCTVSKSCGFNMTYG-GSTIEAYLTQDTLTLASDVIPNYTFGCINKASG 206

Query: 174 SNPQEDAKTTGLMGMNRGSLSFVTQ---LGLSKFSYCISGRDS---SGVLILGEGNHSWV 233
           ++        GLMG+ RG LS ++Q   L  S FSYC+    S   SG L LG  N    
Sbjct: 207 TS----LPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRLGPKNQ--- 266

Query: 234 GNMAYTPLVQMSTPLPSYDRFA--YTVKLEGIKVGNKILALEKSILVPD-HTGAGQTMVD 293
                 P+   +TPL    R +  Y V L GI+VGNKI+ +  S L  D  TGAG T+ D
Sbjct: 267 ------PIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAG-TIFD 326

Query: 294 SGTQFTFLLGPVYTALKNEFLVQTKGVLVPLGDPNFVFQGAMDSCFRVPANKGKLPPLPT 353
           SGT +T L+ P Y A++NEF  + K       + N    G  D+C+      G +   P+
Sbjct: 327 SGTVYTRLVEPAYVAVRNEFRRRVK-------NANATSLGGFDTCY-----SGSV-VFPS 386

Query: 354 VGLMLNGAEMVV-GEELLLYRVPGMVKGGDWVYCMTFGNSDL-LGIAAFVVGNHHQQNLW 413
           V  M  G  + +  + LL++   G +       C+    + + +     V+ +  QQN  
Sbjct: 387 VTFMFAGMNVTLPPDNLLIHSSAGNLS------CLAMAAAPVNVNSVLNVIASMQQQNHR 418

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LZL32.1e-13357.99Aspartic proteinase PCS1 OS=Arabidopsis thaliana OX=3702 GN=PCS1 PE=2 SV=1[more]
Q766C22.4e-4131.42Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q766C31.6e-4029.97Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q9LNJ32.4e-3332.88Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
O044964.0e-2830.56Aspartyl protease AED3 OS=Arabidopsis thaliana OX=3702 GN=AED3 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
KAG6602656.10.0100.00Aspartic proteinase PCS1, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG7033342.17.58e-30698.11Aspartic proteinase PCS1, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022955370.12.53e-30497.87aspartic proteinase PCS1-like [Cucurbita moschata][more]
XP_023540416.11.15e-30096.45aspartic proteinase PCS1-like [Cucurbita pepo subsp. pepo][more]
XP_022991088.16.82e-29595.74aspartic proteinase PCS1-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1GUY91.23e-30497.87aspartic proteinase PCS1-like OS=Cucurbita moschata OX=3662 GN=LOC111457415 PE=3... [more]
A0A6J1JV613.30e-29595.74aspartic proteinase PCS1-like OS=Cucurbita maxima OX=3661 GN=LOC111487788 PE=3 S... [more]
A0A6J1FDS61.88e-24178.52aspartic proteinase PCS1-like OS=Cucurbita moschata OX=3662 GN=LOC111444820 PE=3... [more]
A0A6J1JZR12.66e-24178.76aspartic proteinase PCS1-like OS=Cucurbita maxima OX=3661 GN=LOC111488906 PE=3 S... [more]
A0A5D3DLY68.93e-22978.91Aspartic proteinase PCS1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffol... [more]
Match NameE-valueIdentityDescription
AT2G39710.19.8e-14764.16Eukaryotic aspartyl protease family protein [more]
AT5G02190.11.5e-13457.99Eukaryotic aspartyl protease family protein [more]
AT1G66180.12.8e-6136.15Eukaryotic aspartyl protease family protein [more]
AT5G37540.12.4e-6034.16Eukaryotic aspartyl protease family protein [more]
AT3G54400.18.0e-3232.08Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Silver-seed gourd (sororia) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 36..220
e-value: 2.6E-34
score: 120.8
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 221..428
e-value: 1.3E-37
score: 131.1
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 55..417
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 248..414
e-value: 9.0E-32
score: 110.1
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 57..219
e-value: 1.5E-37
score: 129.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 422..441
NoneNo IPR availablePANTHERPTHR47965:SF49EUKARYOTIC ASPARTYL PROTEASE FAMILY PROTEINcoord: 6..424
IPR001461Aspartic peptidase A1 familyPANTHERPTHR47965ASPARTYL PROTEASE-RELATEDcoord: 6..424
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 55..414
score: 34.528416
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 54..416
e-value: 7.08749E-67
score: 213.664

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csor.00g119300.m02Csor.00g119300.m02mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity