MC03g0574 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC03g0574
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionEukaryotic aspartyl protease family protein
LocationMC03: 12495774 .. 12497333 (-)
RNA-Seq ExpressionMC03g0574
SyntenyMC03g0574
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TCATCAGCCACCACTATTTCATCTAAAGTTTTGACCTTTTTCCTTCTTCTAATTCTACTTTTGAGTGTCTCAGAAACAGCTGCAAGGCCTCACAGAAACAATTTCCCCAACACAGATTCTTTAGTTCTTGGCCTTGTTCATTCAAGAACTTCCCTCCTCACTCCCAAAAGGGGCTACTTTTCAAGGAAAGGATCATCATCATCAATCAACAAGCCAATGGAGGAAATTGGCAGTGATAATGTGATAGAGCCATTGAGGGAAATTAGGGATGGGTACCTAATTTCTCTCACATTAGGGACACCCCCACAAGTGATTCAAGTGTATATGGACACTGGAAGTGACCTCACATGGGTTCCTTGTGGGAACCTCTCTTTTGATTGCCAAGATTGTGAGGAGTATCAAAACAATGTCTCTGGCCCAAAGTTGGCTCCTTTTTCCCCAACCCATTCTTCAACCTCCATTAGGGATACTTGTGGCAGCTCCTTTTGCATGGATATCCACAGCTCTGATAACCCTTTTGACCCATGCACAATTGCAGGCTGTTCCCTTGCTACCCTTGTGAAGGGCACTTGCCCTAGGCCATGCCCTTCATTTGCCTACACTTATGGGGCAAGTGGGGTTGTAACTGGAACCCTAACAAAAGATGTCATTTTGATGCATGGAGTGTCCCCAAATTCCACTACCCAAATCCCTAGGTTTTGCTTTGGTTGTGTTGGTGCAACTTATAGAGAGCCAATTGGAATTGCTGGCTTTGGGAGAGGATTGCTCTCTCTCCCTTCACAATTAGGGTTTTCCCATAAGGGTTTCTCCCATTGCTTCTTGCCCTTTAAGTTCTCAAATAACCCTAATTTCTCAAGCCCTTTGATCCTTGGGAGCCTTGCCATTTCTTCCAAAGATCATAGCTTGCAATTCACCCCTTTGTTGAAAAGTCCTCTTTACCCTAACTACTACTACATTGGTCTTGAGTCAATTACCATTGGGGATGGTATTGGCAATAATAATTCTAGATTTGGGGTTTCTTTGAAGTTGAGGGAAATTGACACAAAGGGGAATGGGGGAATGTTGATTGATTCTGGTACAACATATACTCACTTGCCAGAACCATTGTATTCACAACTTATCTCTAATCTCGAGTCGGTTGTGACGTATCCTAGAGCTAAACAAGTAGAAATCAATACTGGGTTTGATCTTTGTTACAAAATCCCTTGTAAAAACAACACTAATTTTTCTTCTTCTATGGATGATTATTGTTCCAATCTTCTGCCTTCTATAACATTCCATTTTTTGAACAACGTGAGTGTGGTTCTGCCCCAAGGGAACAATTTCTATGCCATGGCTGCTCCTACCAACTCCACTGTGGTGAAATGCTTGCTTTTCCAAAGCATGGATGGCGGGGGCGGGGACGGAGACGGGAATGGGGACGACGACGGGCCGGCAGGCATTTTTGGGAGCTTCCAACAGCAAAACATGGAGGTTGTTTATGACTTGCAGAAGGAGAGAATAGGGTTTCAGCCAATGGACTGTGCTTCTTCTGCTGCCTCTCAAGGACTCCACAAGAAT

mRNA sequence

TCATCAGCCACCACTATTTCATCTAAAGTTTTGACCTTTTTCCTTCTTCTAATTCTACTTTTGAGTGTCTCAGAAACAGCTGCAAGGCCTCACAGAAACAATTTCCCCAACACAGATTCTTTAGTTCTTGGCCTTGTTCATTCAAGAACTTCCCTCCTCACTCCCAAAAGGGGCTACTTTTCAAGGAAAGGATCATCATCATCAATCAACAAGCCAATGGAGGAAATTGGCAGTGATAATGTGATAGAGCCATTGAGGGAAATTAGGGATGGGTACCTAATTTCTCTCACATTAGGGACACCCCCACAAGTGATTCAAGTGTATATGGACACTGGAAGTGACCTCACATGGGTTCCTTGTGGGAACCTCTCTTTTGATTGCCAAGATTGTGAGGAGTATCAAAACAATGTCTCTGGCCCAAAGTTGGCTCCTTTTTCCCCAACCCATTCTTCAACCTCCATTAGGGATACTTGTGGCAGCTCCTTTTGCATGGATATCCACAGCTCTGATAACCCTTTTGACCCATGCACAATTGCAGGCTGTTCCCTTGCTACCCTTGTGAAGGGCACTTGCCCTAGGCCATGCCCTTCATTTGCCTACACTTATGGGGCAAGTGGGGTTGTAACTGGAACCCTAACAAAAGATGTCATTTTGATGCATGGAGTGTCCCCAAATTCCACTACCCAAATCCCTAGGTTTTGCTTTGGTTGTGTTGGTGCAACTTATAGAGAGCCAATTGGAATTGCTGGCTTTGGGAGAGGATTGCTCTCTCTCCCTTCACAATTAGGGTTTTCCCATAAGGGTTTCTCCCATTGCTTCTTGCCCTTTAAGTTCTCAAATAACCCTAATTTCTCAAGCCCTTTGATCCTTGGGAGCCTTGCCATTTCTTCCAAAGATCATAGCTTGCAATTCACCCCTTTGTTGAAAAGTCCTCTTTACCCTAACTACTACTACATTGGTCTTGAGTCAATTACCATTGGGGATGGTATTGGCAATAATAATTCTAGATTTGGGGTTTCTTTGAAGTTGAGGGAAATTGACACAAAGGGGAATGGGGGAATGTTGATTGATTCTGGTACAACATATACTCACTTGCCAGAACCATTGTATTCACAACTTATCTCTAATCTCGAGTCGGTTGTGACGTATCCTAGAGCTAAACAAGTAGAAATCAATACTGGGTTTGATCTTTGTTACAAAATCCCTTGTAAAAACAACACTAATTTTTCTTCTTCTATGGATGATTATTGTTCCAATCTTCTGCCTTCTATAACATTCCATTTTTTGAACAACGTGAGTGTGGTTCTGCCCCAAGGGAACAATTTCTATGCCATGGCTGCTCCTACCAACTCCACTGTGGTGAAATGCTTGCTTTTCCAAAGCATGGATGGCGGGGGCGGGGACGGAGACGGGAATGGGGACGACGACGGGCCGGCAGGCATTTTTGGGAGCTTCCAACAGCAAAACATGGAGGTTGTTTATGACTTGCAGAAGGAGAGAATAGGGTTTCAGCCAATGGACTGTGCTTCTTCTGCTGCCTCTCAAGGACTCCACAAGAAT

Coding sequence (CDS)

TCATCAGCCACCACTATTTCATCTAAAGTTTTGACCTTTTTCCTTCTTCTAATTCTACTTTTGAGTGTCTCAGAAACAGCTGCAAGGCCTCACAGAAACAATTTCCCCAACACAGATTCTTTAGTTCTTGGCCTTGTTCATTCAAGAACTTCCCTCCTCACTCCCAAAAGGGGCTACTTTTCAAGGAAAGGATCATCATCATCAATCAACAAGCCAATGGAGGAAATTGGCAGTGATAATGTGATAGAGCCATTGAGGGAAATTAGGGATGGGTACCTAATTTCTCTCACATTAGGGACACCCCCACAAGTGATTCAAGTGTATATGGACACTGGAAGTGACCTCACATGGGTTCCTTGTGGGAACCTCTCTTTTGATTGCCAAGATTGTGAGGAGTATCAAAACAATGTCTCTGGCCCAAAGTTGGCTCCTTTTTCCCCAACCCATTCTTCAACCTCCATTAGGGATACTTGTGGCAGCTCCTTTTGCATGGATATCCACAGCTCTGATAACCCTTTTGACCCATGCACAATTGCAGGCTGTTCCCTTGCTACCCTTGTGAAGGGCACTTGCCCTAGGCCATGCCCTTCATTTGCCTACACTTATGGGGCAAGTGGGGTTGTAACTGGAACCCTAACAAAAGATGTCATTTTGATGCATGGAGTGTCCCCAAATTCCACTACCCAAATCCCTAGGTTTTGCTTTGGTTGTGTTGGTGCAACTTATAGAGAGCCAATTGGAATTGCTGGCTTTGGGAGAGGATTGCTCTCTCTCCCTTCACAATTAGGGTTTTCCCATAAGGGTTTCTCCCATTGCTTCTTGCCCTTTAAGTTCTCAAATAACCCTAATTTCTCAAGCCCTTTGATCCTTGGGAGCCTTGCCATTTCTTCCAAAGATCATAGCTTGCAATTCACCCCTTTGTTGAAAAGTCCTCTTTACCCTAACTACTACTACATTGGTCTTGAGTCAATTACCATTGGGGATGGTATTGGCAATAATAATTCTAGATTTGGGGTTTCTTTGAAGTTGAGGGAAATTGACACAAAGGGGAATGGGGGAATGTTGATTGATTCTGGTACAACATATACTCACTTGCCAGAACCATTGTATTCACAACTTATCTCTAATCTCGAGTCGGTTGTGACGTATCCTAGAGCTAAACAAGTAGAAATCAATACTGGGTTTGATCTTTGTTACAAAATCCCTTGTAAAAACAACACTAATTTTTCTTCTTCTATGGATGATTATTGTTCCAATCTTCTGCCTTCTATAACATTCCATTTTTTGAACAACGTGAGTGTGGTTCTGCCCCAAGGGAACAATTTCTATGCCATGGCTGCTCCTACCAACTCCACTGTGGTGAAATGCTTGCTTTTCCAAAGCATGGATGGCGGGGGCGGGGACGGAGACGGGAATGGGGACGACGACGGGCCGGCAGGCATTTTTGGGAGCTTCCAACAGCAAAACATGGAGGTTGTTTATGACTTGCAGAAGGAGAGAATAGGGTTTCAGCCAATGGACTGTGCTTCTTCTGCTGCCTCTCAAGGACTCCACAAGAAT

Protein sequence

SSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCASSAASQGLHKN
Homology
BLAST of MC03g0574 vs. ExPASy Swiss-Prot
Match: Q940R4 (Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g16563 PE=2 SV=1)

HSP 1 Score: 212.6 bits (540), Expect = 1.1e-53
Identity = 156/452 (34.51%), Postives = 213/452 (47.12%), Query Frame = 0

Query: 92  YLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSS 151
           YLISL++G+    + +Y+DTGSDL W PC    F C  CE    +   P   P S + S+
Sbjct: 83  YLISLSVGSSSSAVSLYLDTGSDLVWFPC--RPFTCILCE----SKPLPPSPPSSLSSSA 142

Query: 152 TSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC---PRPCPSFAYTYGASGVV 211
           T++  +C S  C   HSS    D C I+ C L  +  G C     PCP F Y YG  G +
Sbjct: 143 TTV--SCSSPSCSAAHSSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYG-DGSL 202

Query: 212 TGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGF--SH 271
              L  D + +  VS      +  F FGC   T  EPIG+AGFGRG LSLP+QL     H
Sbjct: 203 VAKLYSDSLSLPSVS------VSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPH 262

Query: 272 KG--FSHCFLPFKF-SNNPNFSSPLILGSLA------------------ISSKDHSLQFT 331
            G  FS+C +   F S+     SPLILG                        K +   FT
Sbjct: 263 LGNSFSYCLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFT 322

Query: 332 PLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHL 391
            +L++P +P +Y + L+ I+IG               LR ID  G GG+++DSGTT+T L
Sbjct: 323 EMLENPKHPYFYSVSLQGISIG------KRNIPAPAMLRRIDKNGGGGVVVDSGTTFTML 382

Query: 392 PEPLYSQLISNLESVV--TYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPS 451
           P   Y+ ++   +S V   + RA +VE ++G   CY +   N T             +P+
Sbjct: 383 PAKFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCYYL---NQT-----------VKVPA 442

Query: 452 ITFHFL-NNVSVVLPQGNNFYAMA----APTNSTVVKCLLFQSMDGGGGDGDGNGDDDGP 511
           +  HF  N  SV LP+ N FY              + CL+  +   GG + +  G   G 
Sbjct: 443 LVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKIGCLMLMN---GGDESELRG---GT 493

BLAST of MC03g0574 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 142.9 bits (359), Expect = 1.0e-32
Identity = 127/449 (28.29%), Postives = 179/449 (39.87%), Query Frame = 0

Query: 68  SINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDC 127
           SIN  ++   S  +  P+      YL+++ +GTP       MDTGSDL W  C      C
Sbjct: 74  SINAMLQ--SSSGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCE----PC 133

Query: 128 QDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLV 187
             C      +       F+P  SS+     C S +C D+     P + C    C      
Sbjct: 134 TQCFSQPTPI-------FNPQDSSSFSTLPCESQYCQDL-----PSETCNNNEC------ 193

Query: 188 KGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGC----VGATYR 247
                     + Y YG      G +  +           T+ +P   FGC     G    
Sbjct: 194 ---------QYTYGYGDGSTTQGYMATETFTF------ETSSVPNIAFGCGEDNQGFGQG 253

Query: 248 EPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQ 307
              G+ G G G LSLPSQLG     FS+C   +  S+     S L LGS A S       
Sbjct: 254 NGAGLIGMGWGPLSLPSQLGVGQ--FSYCMTSYGSSS----PSTLALGS-AASGVPEGSP 313

Query: 308 FTPLLKSPLYPNYYYIGLESITI-GDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTY 367
            T L+ S L P YYYI L+ IT+ GD +G  +S F       ++   G GGM+IDSGTT 
Sbjct: 314 STTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTF-------QLQDDGTGGMIIDSGTTL 373

Query: 368 THLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLP 427
           T+LP+  Y+ +       +  P     E ++G   C++ P   +T             +P
Sbjct: 374 TYLPQDAYNAVAQAFTDQINLPTVD--ESSSGLSTCFQQPSDGST-----------VQVP 433

Query: 428 SITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIF 487
            I+  F   V  +  Q      + +P    +  CL   S    G              IF
Sbjct: 434 EISMQFDGGVLNLGEQN----ILISPAEGVI--CLAMGSSSQLG------------ISIF 438

Query: 488 GSFQQQNMEVVYDLQKERIGFQPMDCASS 512
           G+ QQQ  +V+YDLQ   + F P  C +S
Sbjct: 494 GNIQQQETQVLYDLQNLAVSFVPTQCGAS 438

BLAST of MC03g0574 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 139.8 bits (351), Expect = 8.7e-32
Identity = 126/425 (29.65%), Postives = 173/425 (40.71%), Query Frame = 0

Query: 92  YLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSS 151
           YL++L++GTP Q     MDTGSDL W         CQ C +  N         F+P  SS
Sbjct: 95  YLMNLSIGTPAQPFSAIMDTGSDLIWT-------QCQPCTQCFNQ----STPIFNPQGSS 154

Query: 152 TSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGT 211
           +     C S  C  + S                     TC      + Y YG      G+
Sbjct: 155 SFSTLPCSSQLCQALSSP--------------------TCSNNFCQYTYGYGDGSETQGS 214

Query: 212 LTKDVILMHGVSPNSTTQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHK 271
           +  + +    VS      IP   FGC     G       G+ G GRG LSLPSQL  +  
Sbjct: 215 MGTETLTFGSVS------IPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTK- 274

Query: 272 GFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQFTPLLKSPLYPNYYYIGLESITIG 331
            FS+C  P   S   N    L+LGSLA S    S   T L++S   P +YYI L  +++G
Sbjct: 275 -FSYCMTPIGSSTPSN----LLLGSLANSVTAGSPN-TTLIQSSQIPTFYYITLNGLSVG 334

Query: 332 D-GIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRA 391
              +  + S F ++         G GG++IDSGTT T+     Y  +     S +  P  
Sbjct: 335 STRLPIDPSAFALN------SNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVV 394

Query: 392 KQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMA 451
                ++GFDLC++ P            D  +  +P+   HF +   + LP  N F    
Sbjct: 395 N--GSSSGFDLCFQTP-----------SDPSNLQIPTFVMHF-DGGDLELPSENYF---I 437

Query: 452 APTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPM 511
           +P+N  +  CL   S   G               IFG+ QQQNM VVYD     + F   
Sbjct: 455 SPSNGLI--CLAMGSSSQG-------------MSIFGNIQQQNMLVVYDTGNSVVSFASA 437

BLAST of MC03g0574 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 138.7 bits (348), Expect = 1.9e-31
Identity = 124/443 (27.99%), Postives = 179/443 (40.41%), Query Frame = 0

Query: 78  SDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNV 137
           S +V+  L +    Y   L +GTP + + + +DTGSD+ W+ C      C+ C    + +
Sbjct: 128 SSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCA----PCRRCYSQSDPI 187

Query: 138 SGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPS 197
                  F P  S T     C S  C  + S          AGC+     + TC      
Sbjct: 188 -------FDPRKSKTYATIPCSSPHCRRLDS----------AGCNTR---RKTC-----L 247

Query: 198 FAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGC--------VGATYREPIGIA 257
           +  +YG      G  + + +           ++     GC        VGA      G+ 
Sbjct: 248 YQVSYGDGSFTVGDFSTETLTFR------RNRVKGVALGCGHDNEGLFVGAA-----GLL 307

Query: 258 GFGRGLLSLPSQLG--FSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQFTPL 317
           G G+G LS P Q G  F+ K FS+C +    S+ P   S ++ G+ A+S      +FTPL
Sbjct: 308 GLGKGKLSFPGQTGHRFNQK-FSYCLVDRSASSKP---SSVVFGNAAVS---RIARFTPL 367

Query: 318 LKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPE 377
           L +P    +YY+GL  I++G          GV+  L ++D  GNGG++IDSGT+ T L  
Sbjct: 368 LSNPKLDTFYYVGLLGISVG-----GTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIR 427

Query: 378 PLYSQLISNLE-SVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITF 437
           P Y  +         T  RA    +   FD C+ +   N               +P++  
Sbjct: 428 PAYIAMRDAFRVGAKTLKRAPDFSL---FDTCFDLSNMNEVK------------VPTVVL 485

Query: 438 HFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQ 497
           HF     V LP  N  Y +   TN     C  F    GG               I G+ Q
Sbjct: 488 HF-RGADVSLPATN--YLIPVDTNGKF--CFAFAGTMGG-------------LSIIGNIQ 485

Query: 498 QQNMEVVYDLQKERIGFQPMDCA 510
           QQ   VVYDL   R+GF P  CA
Sbjct: 548 QQGFRVVYDLASSRVGFAPGGCA 485

BLAST of MC03g0574 vs. ExPASy Swiss-Prot
Match: O04496 (Aspartyl protease AED3 OS=Arabidopsis thaliana OX=3702 GN=AED3 PE=1 SV=1)

HSP 1 Score: 129.4 bits (324), Expect = 1.2e-28
Identity = 116/434 (26.73%), Postives = 183/434 (42.17%), Query Frame = 0

Query: 92  YLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSS 151
           Y++   LGTPPQ++ + +DT +D  W+PC      C  C    +N S      F+   SS
Sbjct: 104 YVVRAKLGTPPQLMFMVLDTSNDAVWLPCSG----CSGC----SNAS----TSFNTNSSS 163

Query: 152 TSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKG-TCPRPCP-----SFAYTYGAS 211
           T    +C ++ C                     T  +G TCP   P     SF  +YG  
Sbjct: 164 TYSTVSCSTAQC---------------------TQARGLTCPSSSPQPSVCSFNQSYGGD 223

Query: 212 GVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYRE---PIGIAGFGRGLLSLPSQL 271
              + +L +D + +      +   IP F FGC+ +       P G+ G GRG +SL SQ 
Sbjct: 224 SSFSASLVQDTLTL------APDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQT 283

Query: 272 GFSHKG-FSHCFLPFKFSNNPNFSSPLILGSLAIS--SKDHSLQFTPLLKSPLYPNYYYI 331
              + G FS+C         P+F S    GSL +    +  S+++TPLL++P  P+ YY+
Sbjct: 284 TSLYSGVFSYCL--------PSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYV 343

Query: 332 GLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLES 391
            L  +++G      + +  V       D     G +IDSGT  T   +P+Y  +      
Sbjct: 344 NLTGVSVG------SVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFR- 403

Query: 392 VVTYPRAKQVEINT-----GFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSV 451
                  KQV +++      FD C+            S D+   N+ P IT H + ++ +
Sbjct: 404 -------KQVNVSSFSTLGAFDTCF------------SADN--ENVAPKITLH-MTSLDL 448

Query: 452 VLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVY 509
            LP  N     +A T    + CL   SM G         + +    +  + QQQN+ +++
Sbjct: 464 KLPMENTLIHSSAGT----LTCL---SMAG------IRQNANAVLNVIANLQQQNLRILF 448

BLAST of MC03g0574 vs. NCBI nr
Match: XP_022142611.1 (probable aspartyl protease At4g16563 [Momordica charantia])

HSP 1 Score: 1057 bits (2733), Expect = 0.0
Identity = 518/520 (99.62%), Postives = 519/520 (99.81%), Query Frame = 0

Query: 1   SSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYF 60
           SSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYF
Sbjct: 6   SSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYF 65

Query: 61  SRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPC 120
           SRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPC
Sbjct: 66  SRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPC 125

Query: 121 GNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG 180
           GNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG
Sbjct: 126 GNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG 185

Query: 181 CSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGA 240
           CSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGA
Sbjct: 186 CSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGA 245

Query: 241 TYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDH 300
           TYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDH
Sbjct: 246 TYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDH 305

Query: 301 SLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGT 360
           SLQFTPLLKSPLYPNYYYIGLES+TIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGT
Sbjct: 306 SLQFTPLLKSPLYPNYYYIGLESVTIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGT 365

Query: 361 TYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNL 420
           TYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNL
Sbjct: 366 TYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNL 425

Query: 421 LPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAG 480
           LPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAG
Sbjct: 426 LPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAG 485

Query: 481 IFGSFQQQNMEVVYDLQKERIGFQPMDCASSAASQGLHKN 520
           IFGSFQQQNMEVVYDLQKERIGFQ MDCASSAASQGLHKN
Sbjct: 486 IFGSFQQQNMEVVYDLQKERIGFQTMDCASSAASQGLHKN 525

BLAST of MC03g0574 vs. NCBI nr
Match: XP_008459091.1 (PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo] >KAA0043307.1 aspartic proteinase nepenthesin-2 [Cucumis melo var. makuwa] >TYK29371.1 aspartic proteinase nepenthesin-2 [Cucumis melo var. makuwa])

HSP 1 Score: 831 bits (2146), Expect = 3.63e-300
Identity = 420/527 (79.70%), Postives = 460/527 (87.29%), Query Frame = 0

Query: 2   SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFS 61
           S+T+I++K L+ FLLL+   +  +T A   + NFP  DSLVLGLVHSRTSLLTPK+GY  
Sbjct: 5   SSTSIATKFLSLFLLLVH--ASKQTLATNPKTNFPK-DSLVLGLVHSRTSLLTPKKGY-- 64

Query: 62  RKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPC 121
               S    K M+++ G DNVIEPLREIRDGYL+SL++GTPPQV+QVYMDTGSDLTWVPC
Sbjct: 65  -NFISKKRMKAMDQMDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPC 124

Query: 122 GNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG 181
           GNLSFDCQDCEEYQNN+SGPKLA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG
Sbjct: 125 GNLSFDCQDCEEYQNNISGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG 184

Query: 182 CSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGV-------SPNSTTQIPRF 241
           CSLATLVKGTCPRPCPSFAYTYGASGVVTG+LT+DV+ MHG        + N+  Q+PRF
Sbjct: 185 CSLATLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFMHGNYHNNNNNNSNNNKQVPRF 244

Query: 242 CFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSL 301
           CFGCVGATYREPIGIAGFGRGLLSLP QLGFS KGFSHCFLPFKFSNNPNFSSPLILG L
Sbjct: 245 CFGCVGATYREPIGIAGFGRGLLSLPFQLGFSQKGFSHCFLPFKFSNNPNFSSPLILGHL 304

Query: 302 AISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGG 361
           AISSKD +LQFTPLLKSP+YPNYYYIGLESITIG+G  NNN RFGVS KLREIDTKGNGG
Sbjct: 305 AISSKDENLQFTPLLKSPIYPNYYYIGLESITIGNG--NNNFRFGVSFKLREIDTKGNGG 364

Query: 362 MLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSM 421
           MLIDSGTTYTHLPEPLYSQLISNLESV++YPRAKQVE+NTGFDLCYK+PCKNN   SS +
Sbjct: 365 MLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNN--SSFV 424

Query: 422 DDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNG 481
           DD   + LPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG G D D   
Sbjct: 425 DD---SQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDS-- 484

Query: 482 DDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCASSAASQGLHKN 520
           DD+GPAGIFGSFQQQN++VVYDL+KER+GFQ MDC S AA+QGLHKN
Sbjct: 485 DDNGPAGIFGSFQQQNLQVVYDLEKERLGFQAMDCVSVAANQGLHKN 516

BLAST of MC03g0574 vs. NCBI nr
Match: XP_004145478.2 (probable aspartyl protease At4g16563 [Cucumis sativus] >KGN66888.1 hypothetical protein Csa_007266 [Cucumis sativus])

HSP 1 Score: 823 bits (2127), Expect = 2.10e-297
Identity = 417/523 (79.73%), Postives = 456/523 (87.19%), Query Frame = 0

Query: 2   SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFS 61
           S+ + ++K L+ FLLL+ +   ++T A   + NFP  DSLVLGLVHSRTSLLTPK+GY  
Sbjct: 5   SSISTATKFLSLFLLLVHV--STQTLATNPKTNFPK-DSLVLGLVHSRTSLLTPKKGY-- 64

Query: 62  RKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPC 121
               S    K M++  G DNVIEPLREIRDGYL+SL++GTPPQV+QVYMDTGSDLTWVPC
Sbjct: 65  -NFISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPC 124

Query: 122 GNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG 181
           GNLSFDCQDCEEYQNN+SGP+LA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG
Sbjct: 125 GNLSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG 184

Query: 182 CSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGV---SPNSTTQIPRFCFGC 241
           CSLA+LVKGTCPRPCPSFAYTYGASGVVTG+LT+DV+  HG    + N+  QIPRFCFGC
Sbjct: 185 CSLASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYNNNNNNNKQIPRFCFGC 244

Query: 242 VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISS 301
           VGATYREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+LAISS
Sbjct: 245 VGATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISS 304

Query: 302 KDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLID 361
           KD +LQFTPLLKSP+YPNYYYIGLESITIG+G  +NN RFGVS KLREIDTKGNGGMLID
Sbjct: 305 KDENLQFTPLLKSPMYPNYYYIGLESITIGNG--DNNFRFGVSFKLREIDTKGNGGMLID 364

Query: 362 SGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYC 421
           SGTTYTHLPEPLYSQLISNLE V+ YPRAKQVE+NTGFDLCYK+PCKNN   SS +DD  
Sbjct: 365 SGTTYTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNN--SSFVDDA- 424

Query: 422 SNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDG 481
              LPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG G D D   DD+G
Sbjct: 425 --QLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDS--DDNG 484

Query: 482 PAGIFGSFQQQNMEVVYDLQKERIGFQPMDCASSAASQGLHKN 520
           PAGIFGSFQQQN+EVVYDL+KER+GFQPMDC S AA QGLHKN
Sbjct: 485 PAGIFGSFQQQNIEVVYDLEKERLGFQPMDCVSVAAKQGLHKN 512

BLAST of MC03g0574 vs. NCBI nr
Match: XP_038893627.1 (probable aspartyl protease At4g16563 [Benincasa hispida])

HSP 1 Score: 820 bits (2119), Expect = 2.57e-296
Identity = 425/526 (80.80%), Postives = 459/526 (87.26%), Query Frame = 0

Query: 1   SSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGY- 60
           S +T+ + K+L++FLLL+ +    +T A   + N P  DSLV+GLVHSRT+LLTPK+GY 
Sbjct: 3   SISTSFAKKILSYFLLLVYV--SRKTLATNPKTNGPK-DSLVIGLVHSRTTLLTPKKGYN 62

Query: 61  -FSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWV 120
             SRK       K ME    DNVIEPLREIRDGYL+SLTLGTPPQVIQVYMDTGSDLTWV
Sbjct: 63  FISRKRM-----KAMEM--DDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWV 122

Query: 121 PCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTI 180
           PCGNLSFDCQDCEEYQNNVSGPKLA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTI
Sbjct: 123 PCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTI 182

Query: 181 AGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGV---SPNSTTQ-IPRFC 240
           AGCSLATLVK TCPRPCPSFAYTYGASGVV GTLT+DV+LMH     SPNS+T+  PRFC
Sbjct: 183 AGCSLATLVKATCPRPCPSFAYTYGASGVVIGTLTRDVLLMHINNINSPNSSTKKTPRFC 242

Query: 241 FGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLA 300
           FGCVGA+YREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+LA
Sbjct: 243 FGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA 302

Query: 301 ISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGM 360
           +SSKD  LQFTPLLKSP+YPNYYYIGLESITIG+G  N+N RFGVS  LREIDTKGNGGM
Sbjct: 303 VSSKDEHLQFTPLLKSPIYPNYYYIGLESITIGNG--NSNFRFGVSFNLREIDTKGNGGM 362

Query: 361 LIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMD 420
           LIDSGTTYTHLPEPLYSQLISNLESV++YPRAKQVE+NTGFDLCYK+PCKNN NFS  +D
Sbjct: 363 LIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNN-NFSF-ID 422

Query: 421 DYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGD 480
           D   + LPSITFHFLNNVSVVLPQ NNFYAMAAP NSTVVKCLLFQSMDG GGD D   D
Sbjct: 423 D---SQLPSITFHFLNNVSVVLPQENNFYAMAAPINSTVVKCLLFQSMDGVGGDTDD--D 482

Query: 481 DDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCASSAASQGLHKN 520
            DGPAGIFGSFQQQN+EVVYDL+KER+GFQPMDCA  AA+QGLHKN
Sbjct: 483 RDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCAYVAATQGLHKN 509

BLAST of MC03g0574 vs. NCBI nr
Match: XP_023520027.1 (probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 810 bits (2091), Expect = 3.01e-292
Identity = 411/512 (80.27%), Postives = 449/512 (87.70%), Query Frame = 0

Query: 13  FFLLLILLLSVS-----ETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSS 72
           FF+L+++L+ VS     +T A P +  F   DSLVLGLVHSRTSLLTPKRGY S    S 
Sbjct: 9   FFVLVLVLVLVSGEAMGQTLANP-KTKFLK-DSLVLGLVHSRTSLLTPKRGYNSL---SR 68

Query: 73  SINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDC 132
              KPME +G+D+VIEPLREIRDGYL+SLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDC
Sbjct: 69  KRIKPME-MGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDC 128

Query: 133 QDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLV 192
           QDC+EYQNNV GPKLA F PTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLATLV
Sbjct: 129 QDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLV 188

Query: 193 KGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIG 252
           KGTCPRPCPSF+YTYGASG+V GTLTKDVI +HG SPNS+ +IP+FCFGCVGATYREPIG
Sbjct: 189 KGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIG 248

Query: 253 IAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQFTPL 312
           IAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+LAISSK+H L+FTPL
Sbjct: 249 IAGFGRGLLSLPYQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKEH-LKFTPL 308

Query: 313 LKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPE 372
           LKSP YPNYYYIGLESITIG+G   N SRFGVSL+LREIDTKGNGG+LIDSGTTYTHLPE
Sbjct: 309 LKSPFYPNYYYIGLESITIGNG--ENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPE 368

Query: 373 PLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFH 432
           PLYSQLISNLES+++YPRAK+ E+NTGFDLCYK+P KNNT FS   +      LPSITFH
Sbjct: 369 PLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFE------LPSITFH 428

Query: 433 FLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQ 492
           FLNNVSVVLPQGN+FYAMAAP+NSTVVKCLLFQSMD         GD DGPAGIFGSFQQ
Sbjct: 429 FLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMD---------GDGDGPAGIFGSFQQ 488

Query: 493 QNMEVVYDLQKERIGFQPMDCASSAASQGLHK 519
           QN+EVVYDL+KER+GF+ MDCAS A SQGLHK
Sbjct: 489 QNLEVVYDLEKERLGFEGMDCASVAVSQGLHK 496

BLAST of MC03g0574 vs. ExPASy TrEMBL
Match: A0A6J1CMP8 (probable aspartyl protease At4g16563 OS=Momordica charantia OX=3673 GN=LOC111012684 PE=3 SV=1)

HSP 1 Score: 1057 bits (2733), Expect = 0.0
Identity = 518/520 (99.62%), Postives = 519/520 (99.81%), Query Frame = 0

Query: 1   SSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYF 60
           SSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYF
Sbjct: 6   SSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYF 65

Query: 61  SRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPC 120
           SRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPC
Sbjct: 66  SRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPC 125

Query: 121 GNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG 180
           GNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG
Sbjct: 126 GNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG 185

Query: 181 CSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGA 240
           CSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGA
Sbjct: 186 CSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGA 245

Query: 241 TYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDH 300
           TYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDH
Sbjct: 246 TYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDH 305

Query: 301 SLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGT 360
           SLQFTPLLKSPLYPNYYYIGLES+TIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGT
Sbjct: 306 SLQFTPLLKSPLYPNYYYIGLESVTIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGT 365

Query: 361 TYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNL 420
           TYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNL
Sbjct: 366 TYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNL 425

Query: 421 LPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAG 480
           LPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAG
Sbjct: 426 LPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAG 485

Query: 481 IFGSFQQQNMEVVYDLQKERIGFQPMDCASSAASQGLHKN 520
           IFGSFQQQNMEVVYDLQKERIGFQ MDCASSAASQGLHKN
Sbjct: 486 IFGSFQQQNMEVVYDLQKERIGFQTMDCASSAASQGLHKN 525

BLAST of MC03g0574 vs. ExPASy TrEMBL
Match: A0A5A7TNC9 (Aspartic proteinase nepenthesin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold129G00970 PE=3 SV=1)

HSP 1 Score: 831 bits (2146), Expect = 1.76e-300
Identity = 420/527 (79.70%), Postives = 460/527 (87.29%), Query Frame = 0

Query: 2   SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFS 61
           S+T+I++K L+ FLLL+   +  +T A   + NFP  DSLVLGLVHSRTSLLTPK+GY  
Sbjct: 5   SSTSIATKFLSLFLLLVH--ASKQTLATNPKTNFPK-DSLVLGLVHSRTSLLTPKKGY-- 64

Query: 62  RKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPC 121
               S    K M+++ G DNVIEPLREIRDGYL+SL++GTPPQV+QVYMDTGSDLTWVPC
Sbjct: 65  -NFISKKRMKAMDQMDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPC 124

Query: 122 GNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG 181
           GNLSFDCQDCEEYQNN+SGPKLA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG
Sbjct: 125 GNLSFDCQDCEEYQNNISGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG 184

Query: 182 CSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGV-------SPNSTTQIPRF 241
           CSLATLVKGTCPRPCPSFAYTYGASGVVTG+LT+DV+ MHG        + N+  Q+PRF
Sbjct: 185 CSLATLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFMHGNYHNNNNNNSNNNKQVPRF 244

Query: 242 CFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSL 301
           CFGCVGATYREPIGIAGFGRGLLSLP QLGFS KGFSHCFLPFKFSNNPNFSSPLILG L
Sbjct: 245 CFGCVGATYREPIGIAGFGRGLLSLPFQLGFSQKGFSHCFLPFKFSNNPNFSSPLILGHL 304

Query: 302 AISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGG 361
           AISSKD +LQFTPLLKSP+YPNYYYIGLESITIG+G  NNN RFGVS KLREIDTKGNGG
Sbjct: 305 AISSKDENLQFTPLLKSPIYPNYYYIGLESITIGNG--NNNFRFGVSFKLREIDTKGNGG 364

Query: 362 MLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSM 421
           MLIDSGTTYTHLPEPLYSQLISNLESV++YPRAKQVE+NTGFDLCYK+PCKNN   SS +
Sbjct: 365 MLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNN--SSFV 424

Query: 422 DDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNG 481
           DD   + LPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG G D D   
Sbjct: 425 DD---SQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDS-- 484

Query: 482 DDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCASSAASQGLHKN 520
           DD+GPAGIFGSFQQQN++VVYDL+KER+GFQ MDC S AA+QGLHKN
Sbjct: 485 DDNGPAGIFGSFQQQNLQVVYDLEKERLGFQAMDCVSVAANQGLHKN 516

BLAST of MC03g0574 vs. ExPASy TrEMBL
Match: A0A1S3CAK9 (aspartic proteinase nepenthesin-2 OS=Cucumis melo OX=3656 GN=LOC103498305 PE=3 SV=1)

HSP 1 Score: 831 bits (2146), Expect = 1.76e-300
Identity = 420/527 (79.70%), Postives = 460/527 (87.29%), Query Frame = 0

Query: 2   SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFS 61
           S+T+I++K L+ FLLL+   +  +T A   + NFP  DSLVLGLVHSRTSLLTPK+GY  
Sbjct: 5   SSTSIATKFLSLFLLLVH--ASKQTLATNPKTNFPK-DSLVLGLVHSRTSLLTPKKGY-- 64

Query: 62  RKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPC 121
               S    K M+++ G DNVIEPLREIRDGYL+SL++GTPPQV+QVYMDTGSDLTWVPC
Sbjct: 65  -NFISKKRMKAMDQMDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPC 124

Query: 122 GNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG 181
           GNLSFDCQDCEEYQNN+SGPKLA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG
Sbjct: 125 GNLSFDCQDCEEYQNNISGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG 184

Query: 182 CSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGV-------SPNSTTQIPRF 241
           CSLATLVKGTCPRPCPSFAYTYGASGVVTG+LT+DV+ MHG        + N+  Q+PRF
Sbjct: 185 CSLATLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFMHGNYHNNNNNNSNNNKQVPRF 244

Query: 242 CFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSL 301
           CFGCVGATYREPIGIAGFGRGLLSLP QLGFS KGFSHCFLPFKFSNNPNFSSPLILG L
Sbjct: 245 CFGCVGATYREPIGIAGFGRGLLSLPFQLGFSQKGFSHCFLPFKFSNNPNFSSPLILGHL 304

Query: 302 AISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGG 361
           AISSKD +LQFTPLLKSP+YPNYYYIGLESITIG+G  NNN RFGVS KLREIDTKGNGG
Sbjct: 305 AISSKDENLQFTPLLKSPIYPNYYYIGLESITIGNG--NNNFRFGVSFKLREIDTKGNGG 364

Query: 362 MLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSM 421
           MLIDSGTTYTHLPEPLYSQLISNLESV++YPRAKQVE+NTGFDLCYK+PCKNN   SS +
Sbjct: 365 MLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNN--SSFV 424

Query: 422 DDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNG 481
           DD   + LPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG G D D   
Sbjct: 425 DD---SQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDS-- 484

Query: 482 DDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCASSAASQGLHKN 520
           DD+GPAGIFGSFQQQN++VVYDL+KER+GFQ MDC S AA+QGLHKN
Sbjct: 485 DDNGPAGIFGSFQQQNLQVVYDLEKERLGFQAMDCVSVAANQGLHKN 516

BLAST of MC03g0574 vs. ExPASy TrEMBL
Match: A0A0A0LYP0 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G704590 PE=3 SV=1)

HSP 1 Score: 823 bits (2127), Expect = 1.02e-297
Identity = 417/523 (79.73%), Postives = 456/523 (87.19%), Query Frame = 0

Query: 2   SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFS 61
           S+ + ++K L+ FLLL+ +   ++T A   + NFP  DSLVLGLVHSRTSLLTPK+GY  
Sbjct: 5   SSISTATKFLSLFLLLVHV--STQTLATNPKTNFPK-DSLVLGLVHSRTSLLTPKKGY-- 64

Query: 62  RKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPC 121
               S    K M++  G DNVIEPLREIRDGYL+SL++GTPPQV+QVYMDTGSDLTWVPC
Sbjct: 65  -NFISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPC 124

Query: 122 GNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG 181
           GNLSFDCQDCEEYQNN+SGP+LA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG
Sbjct: 125 GNLSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG 184

Query: 182 CSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGV---SPNSTTQIPRFCFGC 241
           CSLA+LVKGTCPRPCPSFAYTYGASGVVTG+LT+DV+  HG    + N+  QIPRFCFGC
Sbjct: 185 CSLASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYNNNNNNNKQIPRFCFGC 244

Query: 242 VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISS 301
           VGATYREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+LAISS
Sbjct: 245 VGATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISS 304

Query: 302 KDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLID 361
           KD +LQFTPLLKSP+YPNYYYIGLESITIG+G  +NN RFGVS KLREIDTKGNGGMLID
Sbjct: 305 KDENLQFTPLLKSPMYPNYYYIGLESITIGNG--DNNFRFGVSFKLREIDTKGNGGMLID 364

Query: 362 SGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYC 421
           SGTTYTHLPEPLYSQLISNLE V+ YPRAKQVE+NTGFDLCYK+PCKNN   SS +DD  
Sbjct: 365 SGTTYTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNN--SSFVDDA- 424

Query: 422 SNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDG 481
              LPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG G D D   DD+G
Sbjct: 425 --QLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDS--DDNG 484

Query: 482 PAGIFGSFQQQNMEVVYDLQKERIGFQPMDCASSAASQGLHKN 520
           PAGIFGSFQQQN+EVVYDL+KER+GFQPMDC S AA QGLHKN
Sbjct: 485 PAGIFGSFQQQNIEVVYDLEKERLGFQPMDCVSVAAKQGLHKN 512

BLAST of MC03g0574 vs. ExPASy TrEMBL
Match: A0A6J1EHM1 (probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC111434252 PE=3 SV=1)

HSP 1 Score: 808 bits (2088), Expect = 4.16e-292
Identity = 409/511 (80.04%), Postives = 444/511 (86.89%), Query Frame = 0

Query: 13  FFLLLILLLSVSETAARPHRNNFPNT----DSLVLGLVHSRTSLLTPKRGYFSRKGSSSS 72
           F L+L+L+L   E   +   N  P T    DSLVLGLVHSRTSLLTPKRGY S       
Sbjct: 10  FVLVLVLVLVSGEAMGQTLAN--PKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLLTKRI- 69

Query: 73  INKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQ 132
             KPME +G+D+VIEPLREIRDGYL+SLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQ
Sbjct: 70  --KPME-MGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQ 129

Query: 133 DCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVK 192
           DC+EYQNNV GPKLA F PTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLATLVK
Sbjct: 130 DCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVK 189

Query: 193 GTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGI 252
           GTCPRPCPSF+YTYGASG+V GTLTKDVI +HG SPNS+ +IP+FCFGCVGATYREPIGI
Sbjct: 190 GTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGI 249

Query: 253 AGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQFTPLL 312
           AGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+LAISSK+H L+FTP L
Sbjct: 250 AGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKEH-LKFTPFL 309

Query: 313 KSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEP 372
           KSP YPNYYYIGLESITIG+G   N SRFGVSL+LREIDTKGNGG+LIDSGTTYTHLPEP
Sbjct: 310 KSPFYPNYYYIGLESITIGNG--ENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEP 369

Query: 373 LYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHF 432
           LYSQLISNLES+++YPRAK+ E+NTGFDLCYK+P KNNT FS   +      LPSITFHF
Sbjct: 370 LYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFE------LPSITFHF 429

Query: 433 LNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQ 492
           LNNVSVVLPQGN+FYAMAAP+NSTVVKCLLFQSMD         GD DGPAGIFGSFQQQ
Sbjct: 430 LNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMD---------GDGDGPAGIFGSFQQQ 489

Query: 493 NMEVVYDLQKERIGFQPMDCASSAASQGLHK 519
           N+EVVYDL+KER+GF+ MDCAS A SQGLHK
Sbjct: 490 NLEVVYDLEKERLGFEAMDCASVAVSQGLHK 496

BLAST of MC03g0574 vs. TAIR 10
Match: AT5G45120.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 615.1 bits (1585), Expect = 5.0e-176
Identity = 312/514 (60.70%), Postives = 386/514 (75.10%), Query Frame = 0

Query: 10  VLTFFLLLILLL-SVSETAARPHRNNFPNTDS-LVLGLVHSRTSLLTPKRGYFSRKGSSS 69
           VL  FLL+ LLL + ++T AR H+N   ++ S LVL L  S  SL TPK        +  
Sbjct: 7   VLFLFLLITLLLNTTNKTQARQHKNPSSSSSSFLVLTLTKSSVSLPTPK------SQTQE 66

Query: 70  SINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDC 129
            I KP+  +  D V+EPLRE+RDGYLI+L +GTPPQ +QVY+DTGSDLTWVPCGNLSFDC
Sbjct: 67  RIKKPLSSV--DVVMEPLREVRDGYLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDC 126

Query: 130 QDCEEYQNN-VSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATL 189
            +C + +NN +  P +  FSP HSSTS RD+C SSFC++IHSSDNPFDPC +AGCS++ L
Sbjct: 127 IECYDLKNNDLKSPSV--FSPLHSSTSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSML 186

Query: 190 VKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPI 249
           +K TC RPCPSFAYTYG  G+++G LT+D++         T  +PRF FGCV +TYREPI
Sbjct: 187 LKSTCVRPCPSFAYTYGEGGLISGILTRDILKAR------TRDVPRFSFGCVTSTYREPI 246

Query: 250 GIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAIS-SKDHSLQFT 309
           GIAGFGRGLLSLPSQLGF  KGFSHCFLPFKF NNPN SSPLILG+ A+S +   SLQFT
Sbjct: 247 GIAGFGRGLLSLPSQLGFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFT 306

Query: 310 PLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHL 369
           P+L +P+YPN YYIGLESITIG  I        V L LR+ D++GNGGML+DSGTTYTHL
Sbjct: 307 PMLNTPMYPNSYYIGLESITIGTNITPTQ----VPLTLRQFDSQGNGGMLVDSGTTYTHL 366

Query: 370 PEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSIT 429
           PEP YSQL++ L+S +TYPRA + E  TGFDLCYK+PC NN    +S+++    + PSIT
Sbjct: 367 PEPFYSQLLTTLQSTITYPRATETESRTGFDLCYKVPCPNNN--LTSLENDVMMIFPSIT 426

Query: 430 FHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSF 489
           FHFLNN +++LPQGN+FYAM+AP++ +VV+CLLFQ+M+ G         D GPAG+FGSF
Sbjct: 427 FHFLNNATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMEDG---------DYGPAGVFGSF 486

Query: 490 QQQNMEVVYDLQKERIGFQPMDCASSAASQGLHK 520
           QQQN++VVYDL+KERIGFQ MDC   AAS GL++
Sbjct: 487 QQQNVKVVYDLEKERIGFQAMDCVLEAASHGLNQ 489

BLAST of MC03g0574 vs. TAIR 10
Match: AT4G16563.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 212.6 bits (540), Expect = 7.5e-55
Identity = 156/452 (34.51%), Postives = 213/452 (47.12%), Query Frame = 0

Query: 92  YLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSS 151
           YLISL++G+    + +Y+DTGSDL W PC    F C  CE    +   P   P S + S+
Sbjct: 83  YLISLSVGSSSSAVSLYLDTGSDLVWFPC--RPFTCILCE----SKPLPPSPPSSLSSSA 142

Query: 152 TSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC---PRPCPSFAYTYGASGVV 211
           T++  +C S  C   HSS    D C I+ C L  +  G C     PCP F Y YG  G +
Sbjct: 143 TTV--SCSSPSCSAAHSSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYG-DGSL 202

Query: 212 TGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGF--SH 271
              L  D + +  VS      +  F FGC   T  EPIG+AGFGRG LSLP+QL     H
Sbjct: 203 VAKLYSDSLSLPSVS------VSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPH 262

Query: 272 KG--FSHCFLPFKF-SNNPNFSSPLILGSLA------------------ISSKDHSLQFT 331
            G  FS+C +   F S+     SPLILG                        K +   FT
Sbjct: 263 LGNSFSYCLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFT 322

Query: 332 PLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHL 391
            +L++P +P +Y + L+ I+IG               LR ID  G GG+++DSGTT+T L
Sbjct: 323 EMLENPKHPYFYSVSLQGISIG------KRNIPAPAMLRRIDKNGGGGVVVDSGTTFTML 382

Query: 392 PEPLYSQLISNLESVV--TYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPS 451
           P   Y+ ++   +S V   + RA +VE ++G   CY +   N T             +P+
Sbjct: 383 PAKFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCYYL---NQT-----------VKVPA 442

Query: 452 ITFHFL-NNVSVVLPQGNNFYAMA----APTNSTVVKCLLFQSMDGGGGDGDGNGDDDGP 511
           +  HF  N  SV LP+ N FY              + CL+  +   GG + +  G   G 
Sbjct: 443 LVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKIGCLMLMN---GGDESELRG---GT 493

BLAST of MC03g0574 vs. TAIR 10
Match: AT3G52500.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 173.3 bits (438), Expect = 5.0e-43
Identity = 156/526 (29.66%), Postives = 236/526 (44.87%), Query Frame = 0

Query: 6   ISSKVLTFFLLLILLLSVSETAARP--HRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRK 65
           ++S +  FFL+ + ++S  +    P  H +  P    L L     R +  +  R +  + 
Sbjct: 1   MASSIFFFFLIFLSVVSAVKLPLSPFSHSDQSPKDPYLSL----RRLAESSIARAHKLKH 60

Query: 66  GSSSSINKPMEEIGSDNVIEPLREIRD--------GYLISLTLGTPPQVIQVYMDTGSDL 125
           G+S    KP E+  S         ++         GY +SL+ GTP Q I    DTGS L
Sbjct: 61  GTSI---KPDEDALSSTTTASATVVKSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSL 120

Query: 126 TWVPCGNLSFDCQDCEEYQNNVSG--PKLAP-FSPTHSSTSIRDTCGSSFCMDIHSSDNP 185
            W+PC +  + C  C+      SG  P L P F P +SS+S    C S  C  ++    P
Sbjct: 121 VWLPCTS-RYLCSGCD-----FSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFLY---GP 180

Query: 186 FDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPR 245
              C   GC   T     C   CP +   YG  G   G L  + +      P+ T  +P 
Sbjct: 181 NVQC--RGCDPNT---RNCTVGCPPYILQYGL-GSTAGVLITEKLDF----PDLT--VPD 240

Query: 246 FCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGS 305
           F  GC   + R+P GIAGFGRG +SLPSQ+    K FSHC +  +F ++ N ++ L L +
Sbjct: 241 FVVGCSIISTRQPAGIAGFGRGPVSLPSQMNL--KRFSHCLVSRRF-DDTNVTTDLDLDT 300

Query: 306 LA---ISSKDHSLQFTPLLKSPLYPN-----YYYIGLESITIGDGIGNNNSRFGVSLKLR 365
            +     SK   L +TP  K+P   N     YYY+ L  I +G           +  K  
Sbjct: 301 GSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVG------RKHVKIPYKYL 360

Query: 366 EIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVT-YPRAKQVEINTGFDLCYKIPC 425
              T G+GG ++DSG+T+T +  P++  +     S ++ Y R K +E  TG   C+ I  
Sbjct: 361 APGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGLGPCFNISG 420

Query: 426 KNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMD 485
           K +              +P + F F     + LP  +N++     T++  +  +  ++++
Sbjct: 421 KGDVT------------VPELIFEFKGGAKLELPL-SNYFTFVGNTDTVCLTVVSDKTVN 468

Query: 486 GGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCA 510
             GG         GPA I GSFQQQN  V YDL+ +R GF    C+
Sbjct: 481 PSGG--------TGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468

BLAST of MC03g0574 vs. TAIR 10
Match: AT3G25700.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 149.8 bits (377), Expect = 5.9e-36
Identity = 143/515 (27.77%), Postives = 206/515 (40.00%), Query Frame = 0

Query: 11  LTFFL---LLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLT--PKRGYFSRKGS 70
           L FFL   L + LL  S  AA  + N +     L      S T  L    +R +F     
Sbjct: 4   LIFFLCSFLSLFLLPPSNIAAVSNHNKYLKLPLLRKSPFPSPTQALALDTRRLHF----- 63

Query: 71  SSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSF 130
            S   KP+  + S  V+         Y + L +G PPQ + +  DTGSDL WV C     
Sbjct: 64  LSLRRKPIPFVKSP-VVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCS---- 123

Query: 131 DCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLAT 190
            C++C  +           F P HSST     C    C  +   D        A     T
Sbjct: 124 ACRNCSHHS------PATVFFPRHSSTFSPAHCYDPVCRLVPKPDR-------APICNHT 183

Query: 191 LVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGC-------- 250
            +  TC      + Y Y    + +G   ++   +   S     ++    FGC        
Sbjct: 184 RIHSTC-----HYEYGYADGSLTSGLFARETTSLK-TSSGKEARLKSVAFGCGFRISGQS 243

Query: 251 -VGATYREPIGIAGFGRGLLSLPSQLG--FSHKGFSHCFLPFKFSNNPNFSSPLILGSLA 310
             G ++    G+ G GRG +S  SQLG  F +K FS+C + +  S  P  +S LI+G+  
Sbjct: 244 VSGTSFNGANGVMGLGRGPISFASQLGRRFGNK-FSYCLMDYTLSPPP--TSYLIIGNGG 303

Query: 311 ISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGM 370
                  L FTPLL +PL P +YY+ L+S+ +      N ++  +   + EID  GNGG 
Sbjct: 304 DGIS--KLFFTPLLTNPLSPTFYYVKLKSVFV------NGAKLRIDPSIWEIDDSGNGGT 363

Query: 371 LIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMD 430
           ++DSGTT   L EP Y  +I+ +   V  P A    +  GFDLC  +             
Sbjct: 364 VVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIAD--ALTPGFDLCVNVSGVTKP------- 423

Query: 431 DYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGD 490
                +LP + F F      V P  N F           ++CL  QS+D   G       
Sbjct: 424 ---EKILPRLKFEFSGGAVFVPPPRNYFI-----ETEEQIQCLAIQSVDPKVG------- 450

Query: 491 DDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCA 510
                 + G+  QQ     +D  + R+GF    CA
Sbjct: 484 ----FSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 450

BLAST of MC03g0574 vs. TAIR 10
Match: AT2G03200.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 144.4 bits (363), Expect = 2.5e-34
Identity = 154/536 (28.73%), Postives = 225/536 (41.98%), Query Frame = 0

Query: 2   SATTISSKVLTFFLLLI-LLLSVSETAA----RPHRNNFPNTDSLVLGLVH-----SRTS 61
           ++++ SS +  FFL+L   L+SVS +      R    N P +    L L H     + T 
Sbjct: 2   ASSSSSSLLFPFFLILFSCLISVSSSRRSLIDRTLPKNLPRS-GFRLSLRHVDSGKNLTK 61

Query: 62  LLTPKRGY------FSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVI 121
           +   +RG        +R G+ + +    +   ++N+  P       +L+ L++G P    
Sbjct: 62  IQKIQRGINRGFHRLNRLGAVAVLAVASKPDDTNNIKAPTHGGSGEFLMELSIGNPAVKY 121

Query: 122 QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMD 181
              +DTGSDL W  C      C +C +    +       F P  SS+  +  C S  C  
Sbjct: 122 SAIVDTGSDLIWTQCK----PCTECFDQPTPI-------FDPEKSSSYSKVGCSSGLCNA 181

Query: 182 IHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPN 241
           +  S+   D             K  C      + YTYG      G L  +         N
Sbjct: 182 LPRSNCNED-------------KDAC-----EYLYTYGDYSSTRGLLATETFTFE--DEN 241

Query: 242 STTQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNN 301
           S + I    FGC     G  + +  G+ G GRG LSL SQL      FS+C    + S  
Sbjct: 242 SISGIG---FGCGVENEGDGFSQGSGLVGLGRGPLSLISQL--KETKFSYCLTSIEDS-- 301

Query: 302 PNFSSPLILGSLAI-------SSKDHSLQFT-PLLKSPLYPNYYYIGLESITIGDGIGNN 361
              SS L +GSLA        +S D  +  T  LL++P  P++YY+ L+ IT+G      
Sbjct: 302 -EASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVG------ 361

Query: 362 NSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINT 421
             R  V     E+   G GGM+IDSGTT T+L E  +  L     S ++ P       +T
Sbjct: 362 AKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLP--VDDSGST 421

Query: 422 GFDLCYKIPCKNNTNFSSSMDDYCSNL-LPSITFHFLNNVSVVLPQGNNFYAMAAPTNST 481
           G DLC+K+P            D   N+ +P + FHF     + LP  N   A     +ST
Sbjct: 422 GLDLCFKLP------------DAAKNIAVPKMIFHF-KGADLELPGENYMVA----DSST 458

Query: 482 VVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDC 509
            V CL   S +G                IFG+ QQQN  V++DL+KE + F P +C
Sbjct: 482 GVLCLAMGSSNG--------------MSIFGNVQQQNFNVLHDLEKETVSFVPTEC 458

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q940R41.1e-5334.51Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g1656... [more]
Q766C21.0e-3228.29Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q766C38.7e-3229.65Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q9LNJ31.9e-3127.99Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
O044961.2e-2826.73Aspartyl protease AED3 OS=Arabidopsis thaliana OX=3702 GN=AED3 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
XP_022142611.10.099.62probable aspartyl protease At4g16563 [Momordica charantia][more]
XP_008459091.13.63e-30079.70PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo] >KAA0043307.1 aspart... [more]
XP_004145478.22.10e-29779.73probable aspartyl protease At4g16563 [Cucumis sativus] >KGN66888.1 hypothetical ... [more]
XP_038893627.12.57e-29680.80probable aspartyl protease At4g16563 [Benincasa hispida][more]
XP_023520027.13.01e-29280.27probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
A0A6J1CMP80.099.62probable aspartyl protease At4g16563 OS=Momordica charantia OX=3673 GN=LOC111012... [more]
A0A5A7TNC91.76e-30079.70Aspartic proteinase nepenthesin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A1S3CAK91.76e-30079.70aspartic proteinase nepenthesin-2 OS=Cucumis melo OX=3656 GN=LOC103498305 PE=3 S... [more]
A0A0A0LYP01.02e-29779.73Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G70459... [more]
A0A6J1EHM14.16e-29280.04probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC1114342... [more]
Match NameE-valueIdentityDescription
AT5G45120.15.0e-17660.70Eukaryotic aspartyl protease family protein [more]
AT4G16563.17.5e-5534.51Eukaryotic aspartyl protease family protein [more]
AT3G52500.15.0e-4329.66Eukaryotic aspartyl protease family protein [more]
AT3G25700.15.9e-3627.77Eukaryotic aspartyl protease family protein [more]
AT2G03200.12.5e-3428.73Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 92..292
e-value: 2.3E-29
score: 102.9
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 317..504
e-value: 7.9E-26
score: 90.8
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 292..514
e-value: 5.0E-45
score: 155.4
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 86..291
e-value: 2.3E-30
score: 108.0
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 91..510
NoneNo IPR availablePANTHERPTHR47967:SF47CHLOROPLAST NUCLEOID DNA-BINDING PROTEIN-LIKEcoord: 53..513
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 53..513
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 354..365
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 92..504
score: 32.954071
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 91..508
e-value: 8.90108E-72
score: 227.531

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC03g0574.1MC03g0574.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005576 extracellular region
molecular_function GO:0004190 aspartic-type endopeptidase activity