CmaCh05G005530 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh05G005530
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionEukaryotic aspartyl protease family protein
LocationCma_Chr05: 2709136 .. 2711274 (+)
RNA-Seq ExpressionCmaCh05G005530
SyntenyCmaCh05G005530
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAGACTAAACAATTATATGCCCATTTTCGTAATTTCACAACACAATTTTGTTATAAATCTACCATACCCCCATGCAAAAACTCCGGTAACCGGTAACCTTCAAATTTTCGTTTTCACCTCTTCCACCGCCGTCTCTCTCAACTCCGGCCACCGAGTTGAGTCCGAGTCATGGCCGTCAAATTTTTCTTCTTCGTCTTTTTAGCCCTTCTGGCACTAAACTCCAACGCTTCCGATCTCTGCGCCGCCGGATCGGACGGCTCCGGTGATCTCTCCGTCATCCCCATCTACGGAAAATGCTCGCCGTTCACGGCGCCGAAGTCAGAATCTTGGGTGAACACGGTGATTGATATGGCTTCGAAAGACCCAGCCCGAATTAAGTACTTGTCGAGCCTCGCCGCCCAGAAGACGGTGGCGGCGCCTATCGCCTCCGGGCAACATGCTCTCAATATTGGGAATTATGTGGTGAGAGTTCAATTGGGTACTCCGGGTCAAGCTATGTATATGGTTCTTGACACCAGTAGTGACGCCGCCTGGGCACCGTGCTCCGGCTGCTCCGGCTGCTCCGCCACCACTTTCTTGTCTAAGAATTCCTCCACTTTTGCCACTTTGGATTGCTCCAAACCGCAGTGTAGCCAGGTTTGTTGAATCGCTACTCAAATTATTTAATTTACATTTATTATAATTATTATTTATGTTTAAAAAAAAAAAAAATCTGTCGTTTAATTTTCAAAATAGCAATTTTAGTGTTTAGAATTTATAAATTAATTAATTGAATCTCAGTTTAAATAATTGTTTAAAAAACATAAAATTAGATCTAATTACTTATTAATATTAACTCGAATTTAAATTATTTTGGTATATTTTATAAAATGATCCAAAATTTTCTATTTTTTATCATCAAATAAATTACAAATATTTAAATATTTATTTGTAATTGAGGACTAAATATCTTGAACATGTGAATTAAATTTATTTGTTAATTTTCTTCAGGCTCGGGGTCTTTCCTGTCCGACAACCGGTAGCGTCGATTGCTTATTCAACCAAACATACGGCGGCGACTCATCGTTTTCCGCCACCCTAGTTCAAGACACTCTCCACTTGGGAACCGATGTCATTCCAAATTTCTCATTCGGTTGCATCAGCTCCGCCTCCGGCAGCTCCATTCCGCCGCAAGGCTTATTGGGTCTCGGCCGTGGTCCCCTCTCACTCATCTCACAATCCACTTCACTCTACTCCGGTTTATTCTCGTACTGTCTCCCGAGTTTCAAATCCTACTACTTCTCCGGTTCACTAAAACTCGGCCCGGTCGGACAACCTAAGTCAATCCGAACCACCCCACTCCTTAAGAACCCGCACAGACCGTCTCTCTACTACGTCAACCTAACAGGCATCAGTGTCGGCCGGGTTCTCGTCCCAATTCCACCAGAGACTCTCGCATTCGACCCGAACACTGGTGCGGGAACCATCATCGACTCGGGGACGGTAATAACCCGGTTCGTATACCCTGTCTACACAGCGGTTCGAGACGAATTTAGAAAGCAAGTGGGCGGTTCGTTTTCGCCATTGGGAGCTTTCGACACGTGTTTTACAACGAGCAATGAAATGGCGGCGCCCGGCATTACGTTCCATTTGAGTGGATTGGACTTGAAATTGCCGATGGAGAACAGTTTGATTCACAGCAGCGCCGGTTCGTTGGCTTGTTTGGCGATGGCGGCGGCGCCCAACAATGTGAACTCTGTGTTGAATGTTATCGCCAATTTGCAGCAACAAAATCATCGGATTTTGTTTGATATTGCAAATTCTAAGCTGGGGATTGCTCGCGAGCTCTGTAATTAGCTGAATCACTGACAATGGAAGCTATGAATAATTTGGAAAATGTATCGTTTCTTTTAATTTAATGTCCAATCCAATGTATGGTTAAAGTGGGTTTTAAAGATGGTGGCTTTTAGTATCCATTTTAATGGGCTTGGCCCAATATCATTTGGGCCCAAGGTCCACATTTGGATGAAAGTGGTTTTTAAAGATAGTGGCTTTTTAGTATCTACTTTAATGGGCTTGGCCCAATGTCATCTTGGGCCCAAGGTCCACAATTGGATTTGTCTTCTCTTGGTGTTATCATTTCGTTTAACAAAATTGGAG

mRNA sequence

CAGACTAAACAATTATATGCCCATTTTCGTAATTTCACAACACAATTTTGTTATAAATCTACCATACCCCCATGCAAAAACTCCGGTAACCGGTAACCTTCAAATTTTCGTTTTCACCTCTTCCACCGCCGTCTCTCTCAACTCCGGCCACCGAGTTGAGTCCGAGTCATGGCCGTCAAATTTTTCTTCTTCGTCTTTTTAGCCCTTCTGGCACTAAACTCCAACGCTTCCGATCTCTGCGCCGCCGGATCGGACGGCTCCGGTGATCTCTCCGTCATCCCCATCTACGGAAAATGCTCGCCGTTCACGGCGCCGAAGTCAGAATCTTGGGTGAACACGGTGATTGATATGGCTTCGAAAGACCCAGCCCGAATTAAGTACTTGTCGAGCCTCGCCGCCCAGAAGACGGTGGCGGCGCCTATCGCCTCCGGGCAACATGCTCTCAATATTGGGAATTATGTGGTGAGAGTTCAATTGGGTACTCCGGGTCAAGCTATGTATATGGTTCTTGACACCAGTAGTGACGCCGCCTGGGCACCGTGCTCCGGCTGCTCCGGCTGCTCCGCCACCACTTTCTTGTCTAAGAATTCCTCCACTTTTGCCACTTTGGATTGCTCCAAACCGCAGTGTAGCCAGGCTCGGGGTCTTTCCTGTCCGACAACCGGTAGCGTCGATTGCTTATTCAACCAAACATACGGCGGCGACTCATCGTTTTCCGCCACCCTAGTTCAAGACACTCTCCACTTGGGAACCGATGTCATTCCAAATTTCTCATTCGGTTGCATCAGCTCCGCCTCCGGCAGCTCCATTCCGCCGCAAGGCTTATTGGGTCTCGGCCGTGGTCCCCTCTCACTCATCTCACAATCCACTTCACTCTACTCCGGTTTATTCTCGTACTGTCTCCCGAGTTTCAAATCCTACTACTTCTCCGGTTCACTAAAACTCGGCCCGGTCGGACAACCTAAGTCAATCCGAACCACCCCACTCCTTAAGAACCCGCACAGACCGTCTCTCTACTACGTCAACCTAACAGGCATCAGTGTCGGCCGGGTTCTCGTCCCAATTCCACCAGAGACTCTCGCATTCGACCCGAACACTGGTGCGGGAACCATCATCGACTCGGGGACGGTAATAACCCGGTTCGTATACCCTGTCTACACAGCGGTTCGAGACGAATTTAGAAAGCAAGTGGGCGGTTCGTTTTCGCCATTGGGAGCTTTCGACACGTGTTTTACAACGAGCAATGAAATGGCGGCGCCCGGCATTACGTTCCATTTGAGTGGATTGGACTTGAAATTGCCGATGGAGAACAGTTTGATTCACAGCAGCGCCGGTTCGTTGGCTTGTTTGGCGATGGCGGCGGCGCCCAACAATGTGAACTCTGTGTTGAATGTTATCGCCAATTTGCAGCAACAAAATCATCGGATTTTGTTTGATATTGCAAATTCTAAGCTGGGGATTGCTCGCGAGCTCTGTAATTAGCTGAATCACTGACAATGGAAGCTATGAATAATTTGGAAAATGTATCGTTTCTTTTAATTTAATGTCCAATCCAATGTATGGTTAAAGTGGGTTTTAAAGATGGTGGCTTTTAGTATCCATTTTAATGGGCTTGGCCCAATATCATTTGGGCCCAAGGTCCACATTTGGATGAAAGTGGTTTTTAAAGATAGTGGCTTTTTAGTATCTACTTTAATGGGCTTGGCCCAATGTCATCTTGGGCCCAAGGTCCACAATTGGATTTGTCTTCTCTTGGTGTTATCATTTCGTTTAACAAAATTGGAG

Coding sequence (CDS)

ATGGCCGTCAAATTTTTCTTCTTCGTCTTTTTAGCCCTTCTGGCACTAAACTCCAACGCTTCCGATCTCTGCGCCGCCGGATCGGACGGCTCCGGTGATCTCTCCGTCATCCCCATCTACGGAAAATGCTCGCCGTTCACGGCGCCGAAGTCAGAATCTTGGGTGAACACGGTGATTGATATGGCTTCGAAAGACCCAGCCCGAATTAAGTACTTGTCGAGCCTCGCCGCCCAGAAGACGGTGGCGGCGCCTATCGCCTCCGGGCAACATGCTCTCAATATTGGGAATTATGTGGTGAGAGTTCAATTGGGTACTCCGGGTCAAGCTATGTATATGGTTCTTGACACCAGTAGTGACGCCGCCTGGGCACCGTGCTCCGGCTGCTCCGGCTGCTCCGCCACCACTTTCTTGTCTAAGAATTCCTCCACTTTTGCCACTTTGGATTGCTCCAAACCGCAGTGTAGCCAGGCTCGGGGTCTTTCCTGTCCGACAACCGGTAGCGTCGATTGCTTATTCAACCAAACATACGGCGGCGACTCATCGTTTTCCGCCACCCTAGTTCAAGACACTCTCCACTTGGGAACCGATGTCATTCCAAATTTCTCATTCGGTTGCATCAGCTCCGCCTCCGGCAGCTCCATTCCGCCGCAAGGCTTATTGGGTCTCGGCCGTGGTCCCCTCTCACTCATCTCACAATCCACTTCACTCTACTCCGGTTTATTCTCGTACTGTCTCCCGAGTTTCAAATCCTACTACTTCTCCGGTTCACTAAAACTCGGCCCGGTCGGACAACCTAAGTCAATCCGAACCACCCCACTCCTTAAGAACCCGCACAGACCGTCTCTCTACTACGTCAACCTAACAGGCATCAGTGTCGGCCGGGTTCTCGTCCCAATTCCACCAGAGACTCTCGCATTCGACCCGAACACTGGTGCGGGAACCATCATCGACTCGGGGACGGTAATAACCCGGTTCGTATACCCTGTCTACACAGCGGTTCGAGACGAATTTAGAAAGCAAGTGGGCGGTTCGTTTTCGCCATTGGGAGCTTTCGACACGTGTTTTACAACGAGCAATGAAATGGCGGCGCCCGGCATTACGTTCCATTTGAGTGGATTGGACTTGAAATTGCCGATGGAGAACAGTTTGATTCACAGCAGCGCCGGTTCGTTGGCTTGTTTGGCGATGGCGGCGGCGCCCAACAATGTGAACTCTGTGTTGAATGTTATCGCCAATTTGCAGCAACAAAATCATCGGATTTTGTTTGATATTGCAAATTCTAAGCTGGGGATTGCTCGCGAGCTCTGTAATTAG

Protein sequence

MAVKFFFFVFLALLALNSNASDLCAAGSDGSGDLSVIPIYGKCSPFTAPKSESWVNTVIDMASKDPARIKYLSSLAAQKTVAAPIASGQHALNIGNYVVRVQLGTPGQAMYMVLDTSSDAAWAPCSGCSGCSATTFLSKNSSTFATLDCSKPQCSQARGLSCPTTGSVDCLFNQTYGGDSSFSATLVQDTLHLGTDVIPNFSFGCISSASGSSIPPQGLLGLGRGPLSLISQSTSLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLKNPHRPSLYYVNLTGISVGRVLVPIPPETLAFDPNTGAGTIIDSGTVITRFVYPVYTAVRDEFRKQVGGSFSPLGAFDTCFTTSNEMAAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNHRILFDIANSKLGIARELCN
Homology
BLAST of CmaCh05G005530 vs. ExPASy Swiss-Prot
Match: O04496 (Aspartyl protease AED3 OS=Arabidopsis thaliana OX=3702 GN=AED3 PE=1 SV=1)

HSP 1 Score: 560.8 bits (1444), Expect = 1.3e-158
Identity = 296/446 (66.37%), Postives = 351/446 (78.70%), Query Frame = 0

Query: 2   AVKFFFFVFLAL-LALNSNASDLCA-AGSDGSGDLSVIPIYGKCSPFTAPK-SESWVNTV 61
           ++ FFFF+ L L     +   D CA A  DGS DLS+IPI  KCSPF     S S ++TV
Sbjct: 5   SLHFFFFLTLLLPFTFTTATRDTCATAAPDGSDDLSIIPINAKCSPFAPTHVSASVIDTV 64

Query: 62  IDMASKDPARIKYLSSLAA--QKTVAAPIASGQHALNIGNYVVRVQLGTPGQAMYMVLDT 121
           + MAS D  R+ YLSSL A   K  + P+ASG + L+IGNYVVR +LGTP Q M+MVLDT
Sbjct: 65  LHMASSDSHRLTYLSSLVAGKPKPTSVPVASG-NQLHIGNYVVRAKLGTPPQLMFMVLDT 124

Query: 122 SSDAAWAPCSGCSGCS--ATTFLSKNSSTFATLDCSKPQCSQARGLSCPTTGSVD--CLF 181
           S+DA W PCSGCSGCS  +T+F + +SST++T+ CS  QC+QARGL+CP++      C F
Sbjct: 125 SNDAVWLPCSGCSGCSNASTSFNTNSSSTYSTVSCSTAQCTQARGLTCPSSSPQPSVCSF 184

Query: 182 NQTYGGDSSFSATLVQDTLHLGTDVIPNFSFGCISSASGSSIPPQGLLGLGRGPLSLISQ 241
           NQ+YGGDSSFSA+LVQDTL L  DVIPNFSFGCI+SASG+S+PPQGL+GLGRGP+SL+SQ
Sbjct: 185 NQSYGGDSSFSASLVQDTLTLAPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQ 244

Query: 242 STSLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLKNPHRPSLYYVNLTGISV 301
           +TSLYSG+FSYCLPSF+S+YFSGSLKLG +GQPKSIR TPLL+NP RPSLYYVNLTG+SV
Sbjct: 245 TTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSV 304

Query: 302 GRVLVPIPPETLAFDPNTGAGTIIDSGTVITRFVYPVYTAVRDEFRKQVG-GSFSPLGAF 361
           G V VP+ P  L FD N+GAGTIIDSGTVITRF  PVY A+RDEFRKQV   SFS LGAF
Sbjct: 305 GSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVSSFSTLGAF 364

Query: 362 DTCFTTSNEMAAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIA 421
           DTCF+  NE  AP IT H++ LDLKLPMEN+LIHSSAG+L CL+MA    N N+VLNVIA
Sbjct: 365 DTCFSADNENVAPKITLHMTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIA 424

Query: 422 NLQQQNHRILFDIANSKLGIARELCN 438
           NLQQQN RILFD+ NS++GIA E CN
Sbjct: 425 NLQQQNLRILFDVPNSRIGIAPEPCN 449

BLAST of CmaCh05G005530 vs. ExPASy Swiss-Prot
Match: Q6F4N5 (Aspartyl protease 25 OS=Oryza sativa subsp. japonica OX=39947 GN=AP25 PE=2 SV=1)

HSP 1 Score: 414.1 bits (1063), Expect = 2.0e-114
Identity = 226/410 (55.12%), Postives = 293/410 (71.46%), Query Frame = 0

Query: 49  PKSESWVNTVIDMASKDPARIKYLSSLAAQKTV-AAPIASGQHALNIGNYVVRVQLGTPG 108
           P S S + ++I +A  D AR+ +LSS AA   V +AP+ASGQ      +YVVR  LG+P 
Sbjct: 33  PSSPSPLESIIALARDDDARLLFLSSKAATAGVSSAPVASGQAP---PSYVVRAGLGSPS 92

Query: 109 QAMYMVLDTSSDAAWAPCSGCSGC-SATTFLSKNSSTFATLDCSKPQCSQARGLSCPT-T 168
           Q + + LDTS+DA WA CS C  C S++ F   NSS++A+L CS   C   +G +CP   
Sbjct: 93  QQLLLALDTSADATWAHCSPCGTCPSSSLFAPANSSSYASLPCSSSWCPLFQGQACPAPQ 152

Query: 169 GSVD----------CLFNQTYGGDSSFSATLVQDTLHLGTDVIPNFSFGCISSASG--SS 228
           G  D          C F++ +  D+SF A L  DTL LG D IPN++FGC+SS +G  ++
Sbjct: 153 GGGDAAPPPATLPTCAFSKPF-ADASFQAALASDTLRLGKDAIPNYTFGCVSSVTGPTTN 212

Query: 229 IPPQGLLGLGRGPLSLISQSTSLYSGLFSYCLPSFKSYYFSGSLKLGP-VGQPKSIRTTP 288
           +P QGLLGLGRGP++L+SQ+ SLY+G+FSYCLPS++SYYFSGSL+LG   GQP+S+R TP
Sbjct: 213 MPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYRSYYFSGSLRLGAGGGQPRSVRYTP 272

Query: 289 LLKNPHRPSLYYVNLTGISVGRVLVPIPPETLAFDPNTGAGTIIDSGTVITRFVYPVYTA 348
           +L+NPHR SLYYVN+TG+SVG   V +P  + AFD  TGAGT++DSGTVITR+  PVY A
Sbjct: 273 MLRNPHRSSLYYVNVTGLSVGHAWVKVPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAA 332

Query: 349 VRDEFRKQVG--GSFSPLGAFDTCFTTSNEMA--APGITFHL-SGLDLKLPMENSLIHSS 408
           +R+EFR+QV     ++ LGAFDTCF T    A  AP +T H+  G+DL LPMEN+LIHSS
Sbjct: 333 LREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSS 392

Query: 409 AGSLACLAMAAAPNNVNSVLNVIANLQQQNHRILFDIANSKLGIARELCN 438
           A  LACLAMA AP NVNSV+NVIANLQQQN R++FD+ANS++G A+E CN
Sbjct: 393 ATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSRVGFAKESCN 438

BLAST of CmaCh05G005530 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 202.2 bits (513), Expect = 1.2e-50
Identity = 138/394 (35.03%), Postives = 185/394 (46.95%), Query Frame = 0

Query: 64  KDPARIKYLSSLAAQ----KTVAAPIASGQHALNI-------GNYVVRVQLGTPGQAMYM 123
           +D  R+K +++LAAQ        AP   G  +  +       G Y  R+ +GTP + +YM
Sbjct: 98  RDSRRVKSIATLAAQIPGRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYM 157

Query: 124 VLDTSSDAAW---APCSGCSGCSATTFLSKNSSTFATLDCSKPQCSQARGLSCPTTGSVD 183
           VLDT SD  W   APC  C   S   F  + S T+AT+ CS P C +     C T     
Sbjct: 158 VLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKT- 217

Query: 184 CLFNQTYGGDSSFSATLVQDTLHLGTDVIPNFSFGCISSASGSSIPPQGLLGLGRGPLSL 243
           CL+  +YG  S        +TL    + +   + GC     G  +   GLLGLG+G LS 
Sbjct: 218 CLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSF 277

Query: 244 ISQSTSLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLKNPHRPSLYYVNLTG 303
             Q+   ++  FSYCL    +     S+  G     +  R TPLL NP   + YYV L G
Sbjct: 278 PGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLG 337

Query: 304 ISVGRVLVP-IPPETLAFDPNTGAGTIIDSGTVITRFVYPVYTAVRDEFRKQVGGS---- 363
           ISVG   VP +       D     G IIDSGT +TR + P Y A+RD FR  VG      
Sbjct: 338 ISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFR--VGAKTLKR 397

Query: 364 FSPLGAFDTCFTTS--NEMAAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNN 423
                 FDTCF  S  NE+  P +  H  G D+ LP  N LI        C A A     
Sbjct: 398 APDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLPATNYLIPVDTNGKFCFAFAGTMGG 457

Query: 424 VNSVLNVIANLQQQNHRILFDIANSKLGIARELC 437
               L++I N+QQQ  R+++D+A+S++G A   C
Sbjct: 458 ----LSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of CmaCh05G005530 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 190.7 bits (483), Expect = 3.6e-47
Identity = 132/357 (36.97%), Postives = 182/357 (50.98%), Query Frame = 0

Query: 95  GNYVVRVQLGTPGQAMYMVLDTSSDAAWAPCSGCSGC---SATTFLSKNSSTFATLDCSK 154
           G Y++ + +GTP Q    ++DT SD  W  C  C+ C   S   F  + SS+F+TL CS 
Sbjct: 93  GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSS 152

Query: 155 PQCSQARGLSCPTTGSVDCLFNQTYGGDSSFSATLVQDTLHLGTDVIPNFSFGCISSASG 214
             C   + LS PT  +  C +   YG  S    ++  +TL  G+  IPN +FGC  +  G
Sbjct: 153 QLC---QALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQG 212

Query: 215 -SSIPPQGLLGLGRGPLSLISQSTSLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKSIRT 274
                  GL+G+GRGPLSL SQ   L    FSYC+    S   S +L LG +    +  +
Sbjct: 213 FGQGNGAGLVGMGRGPLSLPSQ---LDVTKFSYCMTPIGSSTPS-NLLLGSLANSVTAGS 272

Query: 275 --TPLLKNPHRPSLYYVNLTGISVGRVLVPIPPETLAFDPNTG-AGTIIDSGTVITRFVY 334
             T L+++   P+ YY+ L G+SVG   +PI P   A + N G  G IIDSGT +T FV 
Sbjct: 273 PNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVN 332

Query: 335 PVYTAVRDEFRKQ-----VGGSFSPLGAFDTCFTTSNE---MAAPGITFHLSGLDLKLPM 394
             Y +VR EF  Q     V GS S    FD CF T ++   +  P    H  G DL+LP 
Sbjct: 333 NAYQSVRQEFISQINLPVVNGSSS---GFDLCFQTPSDPSNLQIPTFVMHFDGGDLELPS 392

Query: 395 ENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNHRILFDIANSKLGIARELC 437
           EN  I  S G L CLAM ++       +++  N+QQQN  +++D  NS +  A   C
Sbjct: 393 ENYFISPSNG-LICLAMGSSSQG----MSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CmaCh05G005530 vs. ExPASy Swiss-Prot
Match: Q9LHE3 (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 182.2 bits (461), Expect = 1.3e-44
Identity = 114/350 (32.57%), Postives = 162/350 (46.29%), Query Frame = 0

Query: 95  GNYVVRVQLGTPGQAMYMVLDTSSDAAWAPCSGCSGC---SATTFLSKNSSTFATLDCSK 154
           G Y VR+ +G+P +  YMV+D+ SD  W  C  C  C   S   F    S ++  + C  
Sbjct: 129 GEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGS 188

Query: 155 PQCSQARGLSCPTTGSVDCLFNQTYGGDSSFSATLVQDTLHLGTDVIPNFSFGCISSASG 214
             C +     C + G   C +   YG  S    TL  +TL     V+ N + GC     G
Sbjct: 189 SVCDRIENSGCHSGG---CRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRG 248

Query: 215 SSIPPQGLLGLGRGPLSLISQSTSLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTT 274
             I   GLLG+G G +S + Q +    G F YCL S +    +GSL  G    P      
Sbjct: 249 MFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVS-RGTDSTGSLVFGREALPVGASWV 308

Query: 275 PLLKNPHRPSLYYVNLTGISVGRVLVPIPPETLAFDPNTGAGTIIDSGTVITRFVYPVYT 334
           PL++NP  PS YYV L G+ VG V +P+P            G ++D+GT +TR     Y 
Sbjct: 309 PLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYV 368

Query: 335 AVRDEFRKQVGG--SFSPLGAFDTCFTTSN--EMAAPGITFHLS-GLDLKLPMENSLIHS 394
           A RD F+ Q       S +  FDTC+  S    +  P ++F+ + G  L LP  N L+  
Sbjct: 369 AFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPV 428

Query: 395 SAGSLACLAMAAAPNNVNSVLNVIANLQQQNHRILFDIANSKLGIARELC 437
                 C A AA+P      L++I N+QQ+  ++ FD AN  +G    +C
Sbjct: 429 DDSGTYCFAFAASPTG----LSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of CmaCh05G005530 vs. TAIR 10
Match: AT1G09750.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 560.8 bits (1444), Expect = 9.4e-160
Identity = 296/446 (66.37%), Postives = 351/446 (78.70%), Query Frame = 0

Query: 2   AVKFFFFVFLAL-LALNSNASDLCA-AGSDGSGDLSVIPIYGKCSPFTAPK-SESWVNTV 61
           ++ FFFF+ L L     +   D CA A  DGS DLS+IPI  KCSPF     S S ++TV
Sbjct: 5   SLHFFFFLTLLLPFTFTTATRDTCATAAPDGSDDLSIIPINAKCSPFAPTHVSASVIDTV 64

Query: 62  IDMASKDPARIKYLSSLAA--QKTVAAPIASGQHALNIGNYVVRVQLGTPGQAMYMVLDT 121
           + MAS D  R+ YLSSL A   K  + P+ASG + L+IGNYVVR +LGTP Q M+MVLDT
Sbjct: 65  LHMASSDSHRLTYLSSLVAGKPKPTSVPVASG-NQLHIGNYVVRAKLGTPPQLMFMVLDT 124

Query: 122 SSDAAWAPCSGCSGCS--ATTFLSKNSSTFATLDCSKPQCSQARGLSCPTTGSVD--CLF 181
           S+DA W PCSGCSGCS  +T+F + +SST++T+ CS  QC+QARGL+CP++      C F
Sbjct: 125 SNDAVWLPCSGCSGCSNASTSFNTNSSSTYSTVSCSTAQCTQARGLTCPSSSPQPSVCSF 184

Query: 182 NQTYGGDSSFSATLVQDTLHLGTDVIPNFSFGCISSASGSSIPPQGLLGLGRGPLSLISQ 241
           NQ+YGGDSSFSA+LVQDTL L  DVIPNFSFGCI+SASG+S+PPQGL+GLGRGP+SL+SQ
Sbjct: 185 NQSYGGDSSFSASLVQDTLTLAPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQ 244

Query: 242 STSLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLKNPHRPSLYYVNLTGISV 301
           +TSLYSG+FSYCLPSF+S+YFSGSLKLG +GQPKSIR TPLL+NP RPSLYYVNLTG+SV
Sbjct: 245 TTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSV 304

Query: 302 GRVLVPIPPETLAFDPNTGAGTIIDSGTVITRFVYPVYTAVRDEFRKQVG-GSFSPLGAF 361
           G V VP+ P  L FD N+GAGTIIDSGTVITRF  PVY A+RDEFRKQV   SFS LGAF
Sbjct: 305 GSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVSSFSTLGAF 364

Query: 362 DTCFTTSNEMAAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIA 421
           DTCF+  NE  AP IT H++ LDLKLPMEN+LIHSSAG+L CL+MA    N N+VLNVIA
Sbjct: 365 DTCFSADNENVAPKITLHMTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIA 424

Query: 422 NLQQQNHRILFDIANSKLGIARELCN 438
           NLQQQN RILFD+ NS++GIA E CN
Sbjct: 425 NLQQQNLRILFDVPNSRIGIAPEPCN 449

BLAST of CmaCh05G005530 vs. TAIR 10
Match: AT3G54400.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 441.4 bits (1134), Expect = 8.3e-124
Identity = 245/430 (56.98%), Postives = 299/430 (69.53%), Query Frame = 0

Query: 9   VFLALLALNSNASDLCAAGSDGSGDLSVIPIYGKCSPFTAPKSESWVNTVIDMASKDPAR 68
           + ++LL L S + + C   S  S DL V  I   CSPF    S SW +T++    +D AR
Sbjct: 8   LLISLLILKSESIN-CNEKSH-SSDLRVFHINSLCSPFKT--SVSWADTLL----QDKAR 67

Query: 69  IKYLSSLAAQKTVAAPIASGQHALNIGNYVVRVQLGTPGQAMYMVLDTSSDAAWAPCSGC 128
             YLSSLA  +  + PIASG+  +    Y+VR  +GTP Q M + LDTS+DAAW PCSGC
Sbjct: 68  FLYLSSLAGVRKSSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGC 127

Query: 129 SGCSATT-FLSKNSSTFATLDCSKPQCSQARGLSCPTTGSVDCLFNQTYGGDSSFSATLV 188
            GCS++  F    SS+  TL C  PQC QA   SC  T S  C FN TYGG S+  A L 
Sbjct: 128 VGCSSSVLFDPSKSSSSRTLQCEAPQCKQAPNPSC--TVSKSCGFNMTYGG-STIEAYLT 187

Query: 189 QDTLHLGTDVIPNFSFGCISSASGSSIPPQGLLGLGRGPLSLISQSTSLYSGLFSYCLPS 248
           QDTL L +DVIPN++FGCI+ ASG+S+P QGL+GLGRGPLSLISQS +LY   FSYCLP+
Sbjct: 188 QDTLTLASDVIPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPN 247

Query: 249 FKSYYFSGSLKLGPVGQPKSIRTTPLLKNPHRPSLYYVNLTGISVGRVLVPIPPETLAFD 308
            KS  FSGSL+LGP  QP  I+TTPLLKNP R SLYYVNL GI VG  +V IP   LAFD
Sbjct: 248 SKSSNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFD 307

Query: 309 PNTGAGTIIDSGTVITRFVYPVYTAVRDEFRKQV-GGSFSPLGAFDTCFTTSNEMAAPGI 368
           P TGAGTI DSGTV TR V P Y AVR+EFR++V   + + LG FDTC+  S  +  P +
Sbjct: 308 PATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGGFDTCY--SGSVVFPSV 367

Query: 369 TFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNHRILFDIAN 428
           TF  +G+++ LP +N LIHSSAG+L+CLAMAAAP NVNSVLNVIA++QQQNHR+L D+ N
Sbjct: 368 TFMFAGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPN 424

Query: 429 SKLGIARELC 437
           S+LGI+RE C
Sbjct: 428 SRLGISRETC 424

BLAST of CmaCh05G005530 vs. TAIR 10
Match: AT5G07030.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 417.5 bits (1072), Expect = 1.3e-116
Identity = 224/437 (51.26%), Postives = 289/437 (66.13%), Query Frame = 0

Query: 7   FFVFLALLALNSNASDLCAAGSDGSGDLSVIPIYGKCSPFTAPKSESWVNTVIDMASKDP 66
           F +    L LN    DL      GS  L +  I   CSPF +    SW   V+   ++D 
Sbjct: 27  FSILPLALGLNHPNCDLTKTQDQGS-TLRIFHIDSPCSPFKSSSPLSWEARVLQTLAQDQ 86

Query: 67  ARIKYLSSLAAQKTVAAPIASGQHALNIGNYVVRVQLGTPGQAMYMVLDTSSDAAWAPCS 126
           AR++YLSSL A ++V  PIASG+  L    Y+V+  +GTP Q + + +DTSSD AW PCS
Sbjct: 87  ARLQYLSSLVAGRSV-VPIASGRQMLQSTTYIVKALIGTPAQPLLLAMDTSSDVAWIPCS 146

Query: 127 GCSGC-SATTFLSKNSSTFATLDCSKPQCSQARGLSCPTTGSVDCLFNQTYGGDSSFSAT 186
           GC GC S T F    S++F  + CS PQC Q      PT G+  C FN TY G SS +A 
Sbjct: 147 GCVGCPSNTAFSPAKSTSFKNVSCSAPQCKQVPN---PTCGARACSFNLTY-GSSSIAAN 206

Query: 187 LVQDTLHLGTDVIPNFSFGCISSASGSSI--PPQGLLGLGRGPLSLISQSTSLYSGLFSY 246
           L QDT+ L  D I  F+FGC++  +G     PPQGLLGLGRGPLSL+SQ+ S+Y   FSY
Sbjct: 207 LSQDTIRLAADPIKAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSY 266

Query: 247 CLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLKNPHRPSLYYVNLTGISVGRVLVPIPPET 306
           CLPSF+S  FSGSL+LGP  QP+ ++ T LL+NP R SLYYVNL  I VGR +V +PP  
Sbjct: 267 CLPSFRSLTFSGSLRLGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAA 326

Query: 307 LAFDPNTGAGTIIDSGTVITRFVYPVYTAVRDEFRKQVGGS---FSPLGAFDTCFTTSNE 366
           +AF+P+TGAGTI DSGTV TR   PVY AVR+EFRK+V  +    + LG FDTC+  S +
Sbjct: 327 IAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCY--SGQ 386

Query: 367 MAAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNHRI 426
           +  P ITF   G+++ +P +N ++HS+AGS +CLAMAAAP NVNSV+NVIA++QQQNHR+
Sbjct: 387 VKVPTITFMFKGVNMTMPADNLMLHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRV 446

Query: 427 LFDIANSKLGIARELCN 438
           L D+ N +LG+ARE C+
Sbjct: 447 LIDVPNGRLGLARERCS 455

BLAST of CmaCh05G005530 vs. TAIR 10
Match: AT1G01300.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 202.2 bits (513), Expect = 8.5e-52
Identity = 138/394 (35.03%), Postives = 185/394 (46.95%), Query Frame = 0

Query: 64  KDPARIKYLSSLAAQ----KTVAAPIASGQHALNI-------GNYVVRVQLGTPGQAMYM 123
           +D  R+K +++LAAQ        AP   G  +  +       G Y  R+ +GTP + +YM
Sbjct: 98  RDSRRVKSIATLAAQIPGRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYM 157

Query: 124 VLDTSSDAAW---APCSGCSGCSATTFLSKNSSTFATLDCSKPQCSQARGLSCPTTGSVD 183
           VLDT SD  W   APC  C   S   F  + S T+AT+ CS P C +     C T     
Sbjct: 158 VLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKT- 217

Query: 184 CLFNQTYGGDSSFSATLVQDTLHLGTDVIPNFSFGCISSASGSSIPPQGLLGLGRGPLSL 243
           CL+  +YG  S        +TL    + +   + GC     G  +   GLLGLG+G LS 
Sbjct: 218 CLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSF 277

Query: 244 ISQSTSLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLKNPHRPSLYYVNLTG 303
             Q+   ++  FSYCL    +     S+  G     +  R TPLL NP   + YYV L G
Sbjct: 278 PGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLG 337

Query: 304 ISVGRVLVP-IPPETLAFDPNTGAGTIIDSGTVITRFVYPVYTAVRDEFRKQVGGS---- 363
           ISVG   VP +       D     G IIDSGT +TR + P Y A+RD FR  VG      
Sbjct: 338 ISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFR--VGAKTLKR 397

Query: 364 FSPLGAFDTCFTTS--NEMAAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNN 423
                 FDTCF  S  NE+  P +  H  G D+ LP  N LI        C A A     
Sbjct: 398 APDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLPATNYLIPVDTNGKFCFAFAGTMGG 457

Query: 424 VNSVLNVIANLQQQNHRILFDIANSKLGIARELC 437
               L++I N+QQQ  R+++D+A+S++G A   C
Sbjct: 458 ----LSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of CmaCh05G005530 vs. TAIR 10
Match: AT3G61820.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 196.4 bits (498), Expect = 4.7e-50
Identity = 140/402 (34.83%), Postives = 186/402 (46.27%), Query Frame = 0

Query: 64  KDPARIKYLSSLAAQKT--------------VAAPIASGQHALNIGNYVVRVQLGTPGQA 123
           +D  R+K ++SLAA  T               +  + SG  +   G Y +R+ +GTP   
Sbjct: 89  RDSLRVKSITSLAAVSTGRNATKRTPRTAGGFSGAVISGL-SQGSGEYFMRLGVGTPATN 148

Query: 124 MYMVLDTSSDAAWAPCSGCSGCSATT---FLSKNSSTFATLDCSKPQCSQARGLS-CPTT 183
           +YMVLDT SD  W  CS C  C   T   F  K S TFAT+ C    C +    S C T 
Sbjct: 149 VYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRLCRRLDDSSECVTR 208

Query: 184 GSVDCLFNQTYGGDSSFSATLVQDTLHLGTDVIPNFSFGCISSASGSSIPPQGLLGLGRG 243
            S  CL+  +YG  S        +TL      + +   GC     G  +   GLLGLGRG
Sbjct: 209 RSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCGHDNEGLFVGAAGLLGLGRG 268

Query: 244 PLSLISQSTSLYSGLFSYCL----PSFKSYYFSGSLKLGPVGQPKSIRTTPLLKNPHRPS 303
            LS  SQ+ + Y+G FSYCL     S  S     ++  G    PK+   TPLL NP   +
Sbjct: 269 GLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDT 328

Query: 304 LYYVNLTGISVGRVLVP-IPPETLAFDPNTGAGTIIDSGTVITRFVYPVYTAVRDEFRKQ 363
            YY+ L GISVG   VP +       D     G IIDSGT +TR   P Y A+RD FR  
Sbjct: 329 FYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFR-- 388

Query: 364 VGGS----FSPLGAFDTCFTTS--NEMAAPGITFHLSGLDLKLPMENSLIHSSAGSLACL 423
           +G +          FDTCF  S    +  P + FH  G ++ LP  N LI  +     C 
Sbjct: 389 LGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFGGGEVSLPASNYLIPVNTEGRFCF 448

Query: 424 AMAAAPNNVNSVLNVIANLQQQNHRILFDIANSKLGIARELC 437
           A A    +    L++I N+QQQ  R+ +D+  S++G     C
Sbjct: 449 AFAGTMGS----LSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
O044961.3e-15866.37Aspartyl protease AED3 OS=Arabidopsis thaliana OX=3702 GN=AED3 PE=1 SV=1[more]
Q6F4N52.0e-11455.12Aspartyl protease 25 OS=Oryza sativa subsp. japonica OX=39947 GN=AP25 PE=2 SV=1[more]
Q9LNJ31.2e-5035.03Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Q766C33.6e-4736.97Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q9LHE31.3e-4432.57Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Match NameE-valueIdentityDescription
AT1G09750.19.4e-16066.37Eukaryotic aspartyl protease family protein [more]
AT3G54400.18.3e-12456.98Eukaryotic aspartyl protease family protein [more]
AT5G07030.11.3e-11651.26Eukaryotic aspartyl protease family protein [more]
AT1G01300.18.5e-5235.03Eukaryotic aspartyl protease family protein [more]
AT3G61820.14.7e-5034.83Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 261..437
e-value: 1.2E-45
score: 157.4
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 72..260
e-value: 8.3E-36
score: 125.7
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 93..436
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 283..432
e-value: 2.2E-31
score: 108.8
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 97..260
e-value: 4.0E-34
score: 118.4
NoneNo IPR availablePANTHERPTHR13683:SF839ASPARTYL PROTEASE AED3-LIKEcoord: 8..437
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 8..437
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 97..432
score: 36.335335

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh05G005530.1CmaCh05G005530.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0043067 regulation of programmed cell death
molecular_function GO:0004190 aspartic-type endopeptidase activity