CmoCh06G002870 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh06G002870
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
Descriptionaspartic proteinase PCS1-like
LocationCmo_Chr06: 1430454 .. 1434047 (+)
RNA-Seq ExpressionCmoCh06G002870
SyntenyCmoCh06G002870
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TGTTTGGCATAAAAGAAGTAAGTTCATATGTCTTAGAGAACTTTTTCGAAATTTGTGCTGATTCCTGTCTAGTCATGCCATCAGCTGTTGTTGTAAGCCAAGCTATGATTGTTGGAATTTTGTACTGATATCGTAACGTAAAGGACAGAAGGCTCATCCAGCCAGATCAATATGAACCGAGGAGCTCTGTTTTGTTATTGGAAATGGAGGTTCTTTGTATGATGCAAGATGAAGGCTTTTCATGTCTTCTACCATTTGGATTTTGGATTTTGTCTGTGAGAAGAGACAAGAGACTGGACAATGCACTCCACTCGGAAGTGCAAAGGATGTTTGGATTTACTTACCCATGTTTCTTTGTTTGGTTCTCTAAGTTGGAGTTCTTGACACGTTTTTCCACTCTGCTTTTGTCACTATGAAATGGATTCGGGTTTTTTTTTTTTATCGAAGTCCACCCCTAACTGATATTGTGATTCTCCTCAAGATTTCAAAACGTGTCTACTAGTGAAAGATTCTCACACCCTTAGTTTTCCTCTCCAATCAATGTGAGATCTCACACCCTTGATTTTCTTTTTCAACTAACGTGGGATCTCACAATCCACTCATTTGGCACATTGGGCTACCGTTCGATAACTGGTTTCAATACCATTTATAATAACTAGCCTCACAGTTTTGTTAGGGAGAGTTTTTATATCTTTGTAATCTTTAATTAATGCTAGATTTATTTTTCGATTGGTAGAGCTTTTTTTTTATTTAGCTATAAAATGTATTAACTTTCCAAAAAAATGACACTATAAAATCCAGAAGGTGGGCCATCCTTATCCACACAAAACATCAGAGCCTGAAGTCGGACAAAGCCTTATCCATAGTTGATCTCCAAAACATCTCTTTATCCATATTCAATCCCATTCTGACCATCAAAATATAAGCGCAAAACCTCGATCCATTCAGTCATTGCCTTCGATCATGGCGTCCTTATGTTCATTTCCACGCATTTCTTCTGCAGATCCCATCAAGCATCTCGCCGCCGCCCCCTTTCCGCCGTCCAATCACCCGATAAGACCATCGGCGTTGTCTCTCCGTCAAAGCAGCCGCAACCAGAAAAGAACTTCTACGATTGTCGCCGCCATCGGAGACGTCTCCGCTGACGGCACCACGTATTTGATCGCCGGCGCCGTCGCTGTGGCTCTCGTTGGTACCGCCTTCCCTATACTCTTCTCTCGCAAAGACCTGTAAGCAAAATCTCAATTTCTCCTATAAACCATTTGTTCATATTGAACTCATTTTAAAAACGCCGTTTAATACCGCTGTTTGAATGGTCTGGTCCTCCGTAGGTGCCCGGTATGCGACGGCGCAGGGTTTGTCCGGAAGTCGGGGGCGGCGCTGAGGGCGAATGCCGCTCGTAAAGACCAAGCTCAGATCGTTTGTTCTCGTTGCAATGGCCTGGGCAAGCTCAATCAAGTGGACAAATAAATTACTTTATTTTATTTTATATTTTCCGCCCTTTCCTAATGTGACTGTAATATAATATTAAAATATATTATAAGTCTATGTACCTTGAGCTTATAATATATTTTAAAACCATACATTGTACTAAAATCTCATGATTTTAATTCAGAAACCTTCCAGTTATTAATTTAATGATTTTGCTGATCTGGCAGGCAATATCCACGTGTCAACTCAACGTTGACTCTGGTTCGGCGACCCATTTACTGACTATTTCTTTTCCACGTGTTATTTTCATCAAAATATATATTTATAAAATAAAAAATTATAAATTCAATTAATATTGCCAAAAATAATAGAGTTAAATATGTTGCATTCTTTCACTATAATTATTATTTTATTATTCAATGAATTGAAACACCTAGGTGTCCTTTTCCAATAATTTCCCCAAATAGGGTAGCCAAATTGTACATTTAATTTTAATTTTAAAACAATTTATAGCCTTCTTAATTACAAATTTATTATTTTTAAATTTATTTAACTTTTGATTTTGAATTTAATAGATCTATTTAATTATTATTTTTTGGAATTTATTGATTTTCTTTTAATAATTGGTCTTTTTTATTGTCGGATCTTCGCAGGGAAGAAGTTTCTCAAAGGATTTCAAAGATTTTATGAACAATCTGTTCTAATAAACCAAAACAGAGCACACGAATTTCCTCAGATTCATTTCACTCTGCTTCTTCCGACAGCCCCAAAAAACTTGGAATTCCAATTTTCTAGGAAGCAAAGAAGAACAAAATGAGGGATTATTGTATAAGTTTAAATCCCTTGAACAACCCATTTCTCAAATATTGTTTCTCTTTCTTCTTAATTTCTCTGGTTTTTGTCTTTCAAAATCCATTACTCTGTTTCTCATTGAACCCAGCCCTTATTTTGCCCCTCAAAACCCAGGTGATTCCGCCTGAATCCGTCCCGCGATCTCCCGACAAGCTCCCTTTCCGGCATAACGTCAGCCTTACTGTCTCTCTGACTGTCGGAACGCCGCCGCAGAATGTCACGATGGTCATCGACACTGGCAGCGAACTCTCATGGCTCCACTGTAACAGATCACAAAACTCCTCTTCTTCATCTTCAACATTCAACCCAGTCCAGTCCTCTTCGTACACTCCTATCCCTTGTTCCTCCTCCACGTGTACCGACCAAACAAGGGATTTTCCGATTCCGGCATCGTGTGATTCAAATCAGCTCTGTCACGCTACTCTGTCTTACGCCGACGCCTCTTCTTCAGAAGGAAATCTCGCCGCCGATACGTTTTACATCGGAAATTCCGGTATTCCAAATGTCGTTTTCGGCTGTATGGATTCAATTTTCAGTTCCAACAATGAAGAAGACTCCAAAAACACCGGATTAATGGGTATGAATCGTGGATCTTTGTCTTTTGTTTCTCAAATGGGTTTCCCCAAATTTTCATATTGCATATCGGAATATGATTTTCCCGGTTTGTTATTACTCGGAGATGCCAATTTTTCATGGTTGGCTCCGTTGAATTACACTCCAATGATCCAAATATCCACGCCATTGCCGTATTTCGATCGAGTCGCTTACACGGTTCAATTGCAAGGAATCAAAGTTTCCAACAAGTTACTTCCAATACCGGAATCGGCCTTCGAACCGGACCACACCGGCGCCGGTCAGACCATGGTCGATTCAGGCACACAGTTCACTTTCCTTCTGGGGCAAGCTTACACCGTTCTCCGAGACGAGTTCCTGAACCAAACCGCCGGTTCGCTTCGTGTTTCCGAGGATCCGAATTTCGTTTTCCAAGGAGCCATGGATCTTTGCTACCGAGTTCCAACGAATCAAACCCGTCTCCCGCCGCTGCCGGCGGTGACGCTGGTGTTTCGCGGCGCCGAGATGACAGTTACCGGCGACCGTATTCTGTACAGAGTGGCCGGAGAAATAAGAGGGAACGATACGATTTATTGCTTTACATTCGGGAATTCAGATCTTCTGGGCGTTGAAGCGTACGTGATTGGGCATCTCCATCAACAAAACGTGTGGATGGAATTCGATCTGAAAAAATCGCGGATCGGGTTGGCGGAGATCCGGTGCGATTTAGCGGGTCAGAAATTGGGCATGGGCCTGTAA

mRNA sequence

TGTTTGGCATAAAAGAAGTAAGTTCATATGTCTTAGAGAACTTTTTCGAAATTTGTGCTGATTCCTGTCTAGTCATGCCATCAGCTGTTGTTGTAAGCCAAGCTATGATTGTTGGAATTTTGTACTGATATCGTAACGTAAAGGACAGAAGGCTCATCCAGCCAGATCAATATGAACCGAGGAGCTCTGTTTTGTTATTGGAAATGGAGGTTCTTTGTATGATGCAAGATGAAGGCTTTTCATGTCTTCTACCATTTGGATTTTGGATTTTGTCTGTGAGAAGAGACAAGAGACTGGACAATGCACTCCACTCGGAAGTGCAAAGGATGTTTGGATTTACTTACCCATGTTTCTTTGTTTGGTTCTCTAAGTTGGAGTTCTTGACACGTTTTTCCACTCTGCTTTTGTCACTATGAAATGGATTCGGGTTTTTTTTTTTTATCGAAGTCCACCCCTAACTGATATTGTGATTCTCCTCAAGATTTCAAAACGTGTCTACTAGTGAAAGATTCTCACACCCTTAGTTTTCCTCTCCAATCAATGTGAGATCTCACACCCTTGATTTTCTTTTTCAACTAACGTGGGATCTCACAATCCACTCATTTGGCACATTGGGCTACCGTTCGATAACTGGTTTCAATACCATTTATAATAACTAGCCTCACAGTTTTGTTAGGGAGAGTTTTTATATCTTTGTAATCTTTAATTAATGCTAGATTTATTTTTCGATTGGTAGAGCTTTTTTTTTATTTAGCTATAAAATGTATTAACTTTCCAAAAAAATGACACTATAAAATCCAGAAGGTGGGCCATCCTTATCCACACAAAACATCAGAGCCTGAAGTCGGACAAAGCCTTATCCATAGTTGATCTCCAAAACATCTCTTTATCCATATTCAATCCCATTCTGACCATCAAAATATAAGCGCAAAACCTCGATCCATTCAGTCATTGCCTTCGATCATGGCGTCCTTATGTTCATTTCCACGCATTTCTTCTGCAGATCCCATCAAGCATCTCGCCGCCGCCCCCTTTCCGCCGTCCAATCACCCGATAAGACCATCGGCGTTGTCTCTCCGTCAAAGCAGCCGCAACCAGAAAAGAACTTCTACGATTGTCGCCGCCATCGGAGACGTCTCCGCTGACGGCACCACGTATTTGATCGCCGGCGCCGTCGCTGTGGCTCTCGTTGGTACCGCCTTCCCTATACTCTTCTCTCGCAAAGACCTGTGCCCGGTATGCGACGGCGCAGGGTTTGTCCGGAAGTCGGGGGCGGCGCTGAGGGCGAATGCCGCTCGTAAAGACCAAGCTCAGATCGTGATTCCGCCTGAATCCGTCCCGCGATCTCCCGACAAGCTCCCTTTCCGGCATAACGTCAGCCTTACTGTCTCTCTGACTGTCGGAACGCCGCCGCAGAATGTCACGATGGTCATCGACACTGGCAGCGAACTCTCATGGCTCCACTGTAACAGATCACAAAACTCCTCTTCTTCATCTTCAACATTCAACCCAGTCCAGTCCTCTTCGTACACTCCTATCCCTTGTTCCTCCTCCACGTGTACCGACCAAACAAGGGATTTTCCGATTCCGGCATCGTGTGATTCAAATCAGCTCTGTCACGCTACTCTGTCTTACGCCGACGCCTCTTCTTCAGAAGGAAATCTCGCCGCCGATACGTTTTACATCGGAAATTCCGGTATTCCAAATGTCGTTTTCGGCTGTATGGATTCAATTTTCAGTTCCAACAATGAAGAAGACTCCAAAAACACCGGATTAATGGGTATGAATCGTGGATCTTTGTCTTTTGTTTCTCAAATGGGTTTCCCCAAATTTTCATATTGCATATCGGAATATGATTTTCCCGGTTTGTTATTACTCGGAGATGCCAATTTTTCATGGTTGGCTCCGTTGAATTACACTCCAATGATCCAAATATCCACGCCATTGCCGTATTTCGATCGAGTCGCTTACACGGTTCAATTGCAAGGAATCAAAGTTTCCAACAAGTTACTTCCAATACCGGAATCGGCCTTCGAACCGGACCACACCGGCGCCGGTCAGACCATGGTCGATTCAGGCACACAGTTCACTTTCCTTCTGGGGCAAGCTTACACCGTTCTCCGAGACGAGTTCCTGAACCAAACCGCCGGTTCGCTTCGTGTTTCCGAGGATCCGAATTTCGTTTTCCAAGGAGCCATGGATCTTTGCTACCGAGTTCCAACGAATCAAACCCGTCTCCCGCCGCTGCCGGCGGTGACGCTGGTGTTTCGCGGCGCCGAGATGACAGTTACCGGCGACCGTATTCTGTACAGAGTGGCCGGAGAAATAAGAGGGAACGATACGATTTATTGCTTTACATTCGGGAATTCAGATCTTCTGGGCGTTGAAGCGTACGTGATTGGGCATCTCCATCAACAAAACGTGTGGATGGAATTCGATCTGAAAAAATCGCGGATCGGGTTGGCGGAGATCCGGTGCGATTTAGCGGGTCAGAAATTGGGCATGGGCCTGTAA

Coding sequence (CDS)

ATGGCGTCCTTATGTTCATTTCCACGCATTTCTTCTGCAGATCCCATCAAGCATCTCGCCGCCGCCCCCTTTCCGCCGTCCAATCACCCGATAAGACCATCGGCGTTGTCTCTCCGTCAAAGCAGCCGCAACCAGAAAAGAACTTCTACGATTGTCGCCGCCATCGGAGACGTCTCCGCTGACGGCACCACGTATTTGATCGCCGGCGCCGTCGCTGTGGCTCTCGTTGGTACCGCCTTCCCTATACTCTTCTCTCGCAAAGACCTGTGCCCGGTATGCGACGGCGCAGGGTTTGTCCGGAAGTCGGGGGCGGCGCTGAGGGCGAATGCCGCTCGTAAAGACCAAGCTCAGATCGTGATTCCGCCTGAATCCGTCCCGCGATCTCCCGACAAGCTCCCTTTCCGGCATAACGTCAGCCTTACTGTCTCTCTGACTGTCGGAACGCCGCCGCAGAATGTCACGATGGTCATCGACACTGGCAGCGAACTCTCATGGCTCCACTGTAACAGATCACAAAACTCCTCTTCTTCATCTTCAACATTCAACCCAGTCCAGTCCTCTTCGTACACTCCTATCCCTTGTTCCTCCTCCACGTGTACCGACCAAACAAGGGATTTTCCGATTCCGGCATCGTGTGATTCAAATCAGCTCTGTCACGCTACTCTGTCTTACGCCGACGCCTCTTCTTCAGAAGGAAATCTCGCCGCCGATACGTTTTACATCGGAAATTCCGGTATTCCAAATGTCGTTTTCGGCTGTATGGATTCAATTTTCAGTTCCAACAATGAAGAAGACTCCAAAAACACCGGATTAATGGGTATGAATCGTGGATCTTTGTCTTTTGTTTCTCAAATGGGTTTCCCCAAATTTTCATATTGCATATCGGAATATGATTTTCCCGGTTTGTTATTACTCGGAGATGCCAATTTTTCATGGTTGGCTCCGTTGAATTACACTCCAATGATCCAAATATCCACGCCATTGCCGTATTTCGATCGAGTCGCTTACACGGTTCAATTGCAAGGAATCAAAGTTTCCAACAAGTTACTTCCAATACCGGAATCGGCCTTCGAACCGGACCACACCGGCGCCGGTCAGACCATGGTCGATTCAGGCACACAGTTCACTTTCCTTCTGGGGCAAGCTTACACCGTTCTCCGAGACGAGTTCCTGAACCAAACCGCCGGTTCGCTTCGTGTTTCCGAGGATCCGAATTTCGTTTTCCAAGGAGCCATGGATCTTTGCTACCGAGTTCCAACGAATCAAACCCGTCTCCCGCCGCTGCCGGCGGTGACGCTGGTGTTTCGCGGCGCCGAGATGACAGTTACCGGCGACCGTATTCTGTACAGAGTGGCCGGAGAAATAAGAGGGAACGATACGATTTATTGCTTTACATTCGGGAATTCAGATCTTCTGGGCGTTGAAGCGTACGTGATTGGGCATCTCCATCAACAAAACGTGTGGATGGAATTCGATCTGAAAAAATCGCGGATCGGGTTGGCGGAGATCCGGTGCGATTTAGCGGGTCAGAAATTGGGCATGGGCCTGTAA

Protein sequence

MASLCSFPRISSADPIKHLAAAPFPPSNHPIRPSALSLRQSSRNQKRTSTIVAAIGDVSADGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVIPPESVPRSPDKLPFRHNVSLTVSLTVGTPPQNVTMVIDTGSELSWLHCNRSQNSSSSSSTFNPVQSSSYTPIPCSSSTCTDQTRDFPIPASCDSNQLCHATLSYADASSSEGNLAADTFYIGNSGIPNVVFGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFPGLLLLGDANFSWLAPLNYTPMIQISTPLPYFDRVAYTVQLQGIKVSNKLLPIPESAFEPDHTGAGQTMVDSGTQFTFLLGQAYTVLRDEFLNQTAGSLRVSEDPNFVFQGAMDLCYRVPTNQTRLPPLPAVTLVFRGAEMTVTGDRILYRVAGEIRGNDTIYCFTFGNSDLLGVEAYVIGHLHQQNVWMEFDLKKSRIGLAEIRCDLAGQKLGMGL
Homology
BLAST of CmoCh06G002870 vs. ExPASy Swiss-Prot
Match: Q9LZL3 (Aspartic proteinase PCS1 OS=Arabidopsis thaliana OX=3702 GN=PCS1 PE=2 SV=1)

HSP 1 Score: 559.7 bits (1441), Expect = 3.5e-158
Identity = 277/394 (70.30%), Postives = 326/394 (82.74%), Query Frame = 0

Query: 127 RSPDKLPFRHNVSLTVSLTVGTPPQNVTMVIDTGSELSWLHCNRSQNSSSSSSTFNPVQS 186
           R  DKL F HNV+LTV+LTVGTPPQN++MVIDTGSELSWL CNRS N +  ++ F+P +S
Sbjct: 60  RPTDKLHFHHNVTLTVTLTVGTPPQNISMVIDTGSELSWLRCNRSSNPNPVNN-FDPTRS 119

Query: 187 SSYTPIPCSSSTCTDQTRDFPIPASCDSNQLCHATLSYADASSSEGNLAADTFYIGNS-G 246
           SSY+PIPCSS TC  +TRDF IPASCDS++LCHATLSYADASSSEGNLAA+ F+ GNS  
Sbjct: 120 SSYSPIPCSSPTCRTRTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTN 179

Query: 247 IPNVVFGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCIS-EYDFPGLLL 306
             N++FGCM S+  S+ EED+K TGL+GMNRGSLSF+SQMGFPKFSYCIS   DFPG LL
Sbjct: 180 DSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCISGTDDFPGFLL 239

Query: 307 LGDANFSWLAPLNYTPMIQISTPLPYFDRVAYTVQLQGIKVSNKLLPIPESAFEPDHTGA 366
           LGD+NF+WL PLNYTP+I+ISTPLPYFDRVAYTVQL GIKV+ KLLPIP+S   PDHTGA
Sbjct: 240 LGDSNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGA 299

Query: 367 GQTMVDSGTQFTFLLGQAYTVLRDEFLNQTAGSLRVSEDPNFVFQGAMDLCYRVPTNQTR 426
           GQTMVDSGTQFTFLLG  YT LR  FLN+T G L V EDP+FVFQG MDLCYR+   + R
Sbjct: 300 GQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIR 359

Query: 427 ---LPPLPAVTLVFRGAEMTVTGDRILYRVAGEIRGNDTIYCFTFGNSDLLGVEAYVIGH 486
              L  LP V+LVF GAE+ V+G  +LYRV     GND++YCFTFGNSDL+G+EAYVIGH
Sbjct: 360 SGILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGH 419

Query: 487 LHQQNVWMEFDLKKSRIGLAEIRCDLAGQKLGMG 516
            HQQN+W+EFDL++SRIGLA + CD++GQ+LG+G
Sbjct: 420 HHQQNMWIEFDLQRSRIGLAPVECDVSGQRLGIG 452

BLAST of CmoCh06G002870 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 167.2 bits (422), Expect = 5.0e-40
Identity = 121/370 (32.70%), Postives = 188/370 (50.81%), Query Frame = 0

Query: 142 VSLTVGTPPQNVTMVIDTGSELSWLHCNR-SQNSSSSSSTFNPVQSSSYTPIPCSSSTCT 201
           ++L++GTP Q  + ++DTGS+L W  C   +Q  + S+  FNP  SSS++ +PCSS  C 
Sbjct: 97  MNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLC- 156

Query: 202 DQTRDFPIPASCDSNQLCHATLSYADASSSEGNLAADTFYIGNSGIPNVVFGCMDSIFSS 261
            Q    P   +C SN  C  T  Y D S ++G++  +T   G+  IPN+ FGC       
Sbjct: 157 -QALSSP---TC-SNNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGC-----GE 216

Query: 262 NNE--EDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEY--DFPGLLLLGDANFSWLAPL 321
           NN+        GL+GM RG LS  SQ+   KFSYC++      P  LLLG    S  A  
Sbjct: 217 NNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSTPSNLLLGSLANSVTAGS 276

Query: 322 NYTPMIQISTPLPYFDRVAYTVQLQGIKVSNKLLPIPESAFE-PDHTGAGQTMVDSGTQF 381
             T +IQ S+ +P F    Y + L G+ V +  LPI  SAF    + G G  ++DSGT  
Sbjct: 277 PNTTLIQ-SSQIPTF----YYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTL 336

Query: 382 TFLLGQAYTVLRDEFLNQTAGSLRVSEDPNFVFQGAMDLCYRVPTNQTRLPPLPAVTLVF 441
           T+ +  AY  +R EF++Q   +L V    +  F    DLC++ P++ + L  +P   + F
Sbjct: 337 TYFVNNAYQSVRQEFISQI--NLPVVNGSSSGF----DLCFQTPSDPSNL-QIPTFVMHF 396

Query: 442 RGAEMTVTGDRILYRVAGEIRGNDTIYCFTFGNSDLLGVEAYVIGHLHQQNVWMEFDLKK 501
            G ++ +  +         I  ++ + C   G+S   G+   + G++ QQN+ + +D   
Sbjct: 397 DGGDLELPSENYF------ISPSNGLICLAMGSSS-QGMS--IFGNIQQQNMLVVYDTGN 434

Query: 502 SRIGLAEIRC 506
           S +  A  +C
Sbjct: 457 SVVSFASAQC 434

BLAST of CmoCh06G002870 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 159.5 bits (402), Expect = 1.0e-37
Identity = 107/367 (29.16%), Postives = 175/367 (47.68%), Query Frame = 0

Query: 142 VSLTVGTPPQNVTMVIDTGSELSWLHCNR-SQNSSSSSSTFNPVQSSSYTPIPCSSSTCT 201
           +++ +GTP  + + ++DTGS+L W  C   +Q  S  +  FNP  SSS++ +PC S  C 
Sbjct: 98  MNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQ 157

Query: 202 DQTRDFPIPASCDSNQLCHATLSYADASSSEGNLAADTFYIGNSGIPNVVFGCMDSIFSS 261
           D      +P+   +N  C  T  Y D S+++G +A +TF    S +PN+ FGC +    +
Sbjct: 158 D------LPSETCNNNECQYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCGE---DN 217

Query: 262 NNEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEY--DFPGLLLLGDANFSWLAPLNY 321
                    GL+GM  G LS  SQ+G  +FSYC++ Y    P  L LG A          
Sbjct: 218 QGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYGSSSPSTLALGSAASGVPEGSPS 277

Query: 322 TPMIQISTPLPYFDRVAYTVQLQGIKVSNKLLPIPESAFEPDHTGAGQTMVDSGTQFTFL 381
           T +I  S    Y     Y + LQGI V    L IP S F+    G G  ++DSGT  T+L
Sbjct: 278 TTLIHSSLNPTY-----YYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYL 337

Query: 382 LGQAYTVLRDEFLNQTAGSLRVSEDPNFVFQGAMDLCYRVPTNQTRLPPLPAVTLVFRGA 441
              AY  +   F +Q   +L   ++ +      +  C++ P++ + +  +P +++ F G 
Sbjct: 338 PQDAYNAVAQAFTDQI--NLPTVDESS----SGLSTCFQQPSDGSTV-QVPEISMQFDGG 397

Query: 442 EMTVTGDRILYRVAGEIRGNDTIYCFTFGNSDLLGVEAYVIGHLHQQNVWMEFDLKKSRI 501
            + +    IL   A      + + C   G+S  LG+   + G++ QQ   + +DL+   +
Sbjct: 398 VLNLGEQNILISPA------EGVICLAMGSSSQLGIS--IFGNIQQQETQVLYDLQNLAV 435

Query: 502 GLAEIRC 506
                +C
Sbjct: 458 SFVPTQC 435

BLAST of CmoCh06G002870 vs. ExPASy Swiss-Prot
Match: O04496 (Aspartyl protease AED3 OS=Arabidopsis thaliana OX=3702 GN=AED3 PE=1 SV=1)

HSP 1 Score: 141.0 bits (354), Expect = 3.9e-32
Identity = 121/393 (30.79%), Postives = 180/393 (45.80%), Query Frame = 0

Query: 122 PESVP-RSPDKLPFRHNVSLTVSLTVGTPPQNVTMVIDTGSELSWLHCNRSQNSSSSSST 181
           P SVP  S ++L   H  +  V   +GTPPQ + MV+DT ++  WL C+     S++S++
Sbjct: 88  PTSVPVASGNQL---HIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTS 147

Query: 182 FNPVQSSSYTPIPCSSSTCTDQTRDFPIPASCDSNQLCHATLSYADASSSEGNLAADTFY 241
           FN   SS+Y+ + CS++ CT Q R    P+S     +C    SY   SS   +L  DT  
Sbjct: 148 FNTNSSSTYSTVSCSTAQCT-QARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLT 207

Query: 242 IGNSGIPNVVFGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQ---MGFPKFSYCISEY 301
           +    IPN  FGC++S  +S N    +  GLMG+ RG +S VSQ   +    FSYC+  +
Sbjct: 208 LAPDVIPNFSFGCINS--ASGNSLPPQ--GLMGLGRGPMSLVSQTTSLYSGVFSYCLPSF 267

Query: 302 D---FPGLLLLGDANFSWLAPLNYTPMIQISTPLPYFDRVAYTVQLQGIKVSNKLLPIPE 361
               F G L LG         + YTP+++ +   P      Y V L G+ V +  +P+  
Sbjct: 268 RSFYFSGSLKLG--LLGQPKSIRYTPLLR-NPRRPSL----YYVNLTGVSVGSVQVPVDP 327

Query: 362 SAFEPDHTGAGQTMVDSGTQFTFLLGQAYTVLRDEFLNQTAGSLRVSEDPNFVFQGAMDL 421
                D      T++DSGT  T      Y  +RDEF  Q   S       +F   GA D 
Sbjct: 328 VYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVS-------SFSTLGAFDT 387

Query: 422 CYRVPTNQTRLPPLPAVTLVFRGAEMTVTGDRILYRVAGEIRGNDTIYCFTF-GNSDLLG 481
           C+            P +TL     ++ +  +  L   +       T+ C +  G      
Sbjct: 388 CFSADNENV----APKITLHMTSLDLKLPMENTLIHSSA-----GTLTCLSMAGIRQNAN 447

Query: 482 VEAYVIGHLHQQNVWMEFDLKKSRIGLAEIRCD 507
               VI +L QQN+ + FD+  SRIG+A   C+
Sbjct: 448 AVLNVIANLQQQNLRILFDVPNSRIGIAPEPCN 449

BLAST of CmoCh06G002870 vs. ExPASy Swiss-Prot
Match: Q6F4N5 (Aspartyl protease 25 OS=Oryza sativa subsp. japonica OX=39947 GN=AP25 PE=2 SV=1)

HSP 1 Score: 138.7 bits (348), Expect = 1.9e-31
Identity = 121/385 (31.43%), Postives = 182/385 (47.27%), Query Frame = 0

Query: 139 SLTVSLTVGTPPQNVTMVIDTGSELSWLHCNRSQNSSSSSSTFNPVQSSSYTPIPCSSST 198
           S  V   +G+P Q + + +DT ++ +W HC+    +  SSS F P  SSSY  +PCSSS 
Sbjct: 78  SYVVRAGLGSPSQQLLLALDTSADATWAHCSPC-GTCPSSSLFAPANSSSYASLPCSSSW 137

Query: 199 C-TDQTRDFPIPASCDSNQLCHATL-------SYADASSSEGNLAADTFYIGNSGIPNVV 258
           C   Q +  P P          ATL        +ADA S +  LA+DT  +G   IPN  
Sbjct: 138 CPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADA-SFQAALASDTLRLGKDAIPNYT 197

Query: 259 FGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQMGF---PKFSYCISEYD---FPGLLL 318
           FGC+ S+  +    +    GL+G+ RG ++ +SQ G      FSYC+  Y    F G L 
Sbjct: 198 FGCVSSV--TGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYRSYYFSGSLR 257

Query: 319 LGDANFSWLAPLNYTPMIQISTPLPYFDRVAYTVQLQGIKVSNKLLPIPESAFEPD-HTG 378
           LG A       + YTPM++     P+   + Y V + G+ V +  + +P  +F  D  TG
Sbjct: 258 LG-AGGGQPRSVRYTPMLR----NPHRSSL-YYVNVTGLSVGHAWVKVPAGSFAFDAATG 317

Query: 379 AGQTMVDSGTQFTFLLGQAYTVLRDEFLNQTAGSLRVSEDPNFVFQGAMDLCYRVPTNQT 438
           AG T+VDSGT  T      Y  LR+EF  Q      V+    +   GA D C+   T++ 
Sbjct: 318 AG-TVVDSGTVITRWTAPVYAALREEFRRQ------VAAPSGYTSLGAFDTCFN--TDEV 377

Query: 439 RLPPLPAVTL-VFRGAEMTVTGDRILYRVAGEIRGNDTIYCFTFGNS-DLLGVEAYVIGH 498
                PAVT+ +  G ++ +  +  L   +        + C     +   +     VI +
Sbjct: 378 AAGGAPAVTVHMDGGVDLALPMENTLIHSSA-----TPLACLAMAEAPQNVNSVVNVIAN 437

Query: 499 LHQQNVWMEFDLKKSRIGLAEIRCD 507
           L QQN+ + FD+  SR+G A+  C+
Sbjct: 438 LQQQNIRVVFDVANSRVGFAKESCN 438

BLAST of CmoCh06G002870 vs. ExPASy TrEMBL
Match: A0A6J1F3C7 (Laccase OS=Cucurbita moschata OX=3662 GN=LOC111441767 PE=3 SV=1)

HSP 1 Score: 846.7 bits (2186), Expect = 5.3e-242
Identity = 452/600 (75.33%), Postives = 476/600 (79.33%), Query Frame = 0

Query: 1   MASLCSFPRISSADPIKHL-AAAPFPPSNHPIRPSALSLRQSSRNQKRTSTIVAAIGDVS 60
           MASLC+FPRISS +PIK   AAAPFPPSN P+RPSALSLRQSS   +R ST+VAAIGDVS
Sbjct: 36  MASLCTFPRISSTEPIKQTPAAAPFPPSNQPMRPSALSLRQSSSKHRRISTVVAAIGDVS 95

Query: 61  ADGTTYLIAGAVAVALVGTAFPILFSRKDL-CPVCDGAGFVRKSGAALRANAARKDQ--- 120
           ADGTTYLIAGAVAVALVGTAFPILFSRKDL   +  G G + ++ A +      + Q   
Sbjct: 96  ADGTTYLIAGAVAVALVGTAFPILFSRKDLWIFIFAGKGEIYRTVAKILGTLCSRKQKKD 155

Query: 121 ---AQI------------------------------------------------------ 180
              A+I                                                      
Sbjct: 156 NEVARISSLCGASNCPQLFSDSNIPGKKRKMRDYCIAFNSSNHKFLKSLFPFFLCTLFSV 215

Query: 181 ---------------------VIPPESVPRSPDKLPFRHNVSLTVSLTVGTPPQNVTMVI 240
                                VIPPES+ RSPDKLPFRHNVSLTVSLTVGTPPQNVTMVI
Sbjct: 216 FQNLILCSSQNPALLLPLKTQVIPPESIRRSPDKLPFRHNVSLTVSLTVGTPPQNVTMVI 275

Query: 241 DTGSELSWLHCNRSQN-SSSSSSTFNPVQSSSYTPIPCSSSTCTDQTRDFPIPASCDSNQ 300
           DTGSELSWLHCNRSQN SSSSSSTFNP  SSSYTPIPCSSSTCTD+TRDFPIPASCDSN 
Sbjct: 276 DTGSELSWLHCNRSQNSSSSSSSTFNPAGSSSYTPIPCSSSTCTDRTRDFPIPASCDSNH 335

Query: 301 LCHATLSYADASSSEGNLAADTFYIGNSGIPNVVFGCMDSIFSSNNEEDSKNTGLMGMNR 360
           LCHATLSYADASSSEG LA DTFYIGNSGI NVVFGCMDSIFSSNNEEDSKNTGLMGMNR
Sbjct: 336 LCHATLSYADASSSEGTLATDTFYIGNSGISNVVFGCMDSIFSSNNEEDSKNTGLMGMNR 395

Query: 361 GSLSFVSQMGFPKFSYCISEYDFPGLLLLGDANFSWLAPLNYTPMIQISTPLPYFDRVAY 420
           GSLSFVSQMGFPKFSYCISEYDF GLLLLGDANFSWLAPLNYTP+I++STPLPYFDRVAY
Sbjct: 396 GSLSFVSQMGFPKFSYCISEYDFSGLLLLGDANFSWLAPLNYTPLIEMSTPLPYFDRVAY 455

Query: 421 TVQLQGIKVSNKLLPIPESAFEPDHTGAGQTMVDSGTQFTFLLGQAYTVLRDEFLNQTAG 480
           TVQL+GIKVS+KLLPIPES FEPDHTGAGQTMVDSGTQFTFLLG AYT LRDEFLN+TAG
Sbjct: 456 TVQLEGIKVSHKLLPIPESVFEPDHTGAGQTMVDSGTQFTFLLGPAYTALRDEFLNRTAG 515

Query: 481 SLRVSEDPNFVFQGAMDLCYRVPTNQTRLPPLPAVTLVFRGAEMTVTGDRILYRVAGEIR 517
           S RV ED NFVFQGAMDLCYRVP NQTRLPPLP+VTLVFRGAEMTVTGDRILYRV GEIR
Sbjct: 516 SFRVFEDSNFVFQGAMDLCYRVPMNQTRLPPLPSVTLVFRGAEMTVTGDRILYRVPGEIR 575

BLAST of CmoCh06G002870 vs. ExPASy TrEMBL
Match: A0A6J1G928 (aspartic proteinase PCS1-like OS=Cucurbita moschata OX=3662 GN=LOC111452070 PE=3 SV=1)

HSP 1 Score: 806.6 bits (2082), Expect = 6.0e-230
Identity = 398/398 (100.00%), Postives = 398/398 (100.00%), Query Frame = 0

Query: 119 VIPPESVPRSPDKLPFRHNVSLTVSLTVGTPPQNVTMVIDTGSELSWLHCNRSQNSSSSS 178
           VIPPESVPRSPDKLPFRHNVSLTVSLTVGTPPQNVTMVIDTGSELSWLHCNRSQNSSSSS
Sbjct: 52  VIPPESVPRSPDKLPFRHNVSLTVSLTVGTPPQNVTMVIDTGSELSWLHCNRSQNSSSSS 111

Query: 179 STFNPVQSSSYTPIPCSSSTCTDQTRDFPIPASCDSNQLCHATLSYADASSSEGNLAADT 238
           STFNPVQSSSYTPIPCSSSTCTDQTRDFPIPASCDSNQLCHATLSYADASSSEGNLAADT
Sbjct: 112 STFNPVQSSSYTPIPCSSSTCTDQTRDFPIPASCDSNQLCHATLSYADASSSEGNLAADT 171

Query: 239 FYIGNSGIPNVVFGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYD 298
           FYIGNSGIPNVVFGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYD
Sbjct: 172 FYIGNSGIPNVVFGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYD 231

Query: 299 FPGLLLLGDANFSWLAPLNYTPMIQISTPLPYFDRVAYTVQLQGIKVSNKLLPIPESAFE 358
           FPGLLLLGDANFSWLAPLNYTPMIQISTPLPYFDRVAYTVQLQGIKVSNKLLPIPESAFE
Sbjct: 232 FPGLLLLGDANFSWLAPLNYTPMIQISTPLPYFDRVAYTVQLQGIKVSNKLLPIPESAFE 291

Query: 359 PDHTGAGQTMVDSGTQFTFLLGQAYTVLRDEFLNQTAGSLRVSEDPNFVFQGAMDLCYRV 418
           PDHTGAGQTMVDSGTQFTFLLGQAYTVLRDEFLNQTAGSLRVSEDPNFVFQGAMDLCYRV
Sbjct: 292 PDHTGAGQTMVDSGTQFTFLLGQAYTVLRDEFLNQTAGSLRVSEDPNFVFQGAMDLCYRV 351

Query: 419 PTNQTRLPPLPAVTLVFRGAEMTVTGDRILYRVAGEIRGNDTIYCFTFGNSDLLGVEAYV 478
           PTNQTRLPPLPAVTLVFRGAEMTVTGDRILYRVAGEIRGNDTIYCFTFGNSDLLGVEAYV
Sbjct: 352 PTNQTRLPPLPAVTLVFRGAEMTVTGDRILYRVAGEIRGNDTIYCFTFGNSDLLGVEAYV 411

Query: 479 IGHLHQQNVWMEFDLKKSRIGLAEIRCDLAGQKLGMGL 517
           IGHLHQQNVWMEFDLKKSRIGLAEIRCDLAGQKLGMGL
Sbjct: 412 IGHLHQQNVWMEFDLKKSRIGLAEIRCDLAGQKLGMGL 449

BLAST of CmoCh06G002870 vs. ExPASy TrEMBL
Match: A0A6J1I7C0 (aspartic proteinase PCS1-like OS=Cucurbita maxima OX=3661 GN=LOC111470272 PE=3 SV=1)

HSP 1 Score: 789.6 bits (2038), Expect = 7.6e-225
Identity = 390/398 (97.99%), Postives = 392/398 (98.49%), Query Frame = 0

Query: 119 VIPPESVPRSPDKLPFRHNVSLTVSLTVGTPPQNVTMVIDTGSELSWLHCNRSQNSSSSS 178
           VIPPESV RSPDKLPFRHNVSLTVSLTVGTPPQNVTMVIDTGSELSWLHCNRSQNSSSSS
Sbjct: 52  VIPPESVWRSPDKLPFRHNVSLTVSLTVGTPPQNVTMVIDTGSELSWLHCNRSQNSSSSS 111

Query: 179 STFNPVQSSSYTPIPCSSSTCTDQTRDFPIPASCDSNQLCHATLSYADASSSEGNLAADT 238
           STFNPVQSSSYTPIPCSSSTCTDQTRDFPIPASCDSNQLCHATLSYADASSSEGNLAADT
Sbjct: 112 STFNPVQSSSYTPIPCSSSTCTDQTRDFPIPASCDSNQLCHATLSYADASSSEGNLAADT 171

Query: 239 FYIGNSGIPNVVFGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYD 298
           FYIGNS IPNVVFGCMDSIFSSNN EDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYD
Sbjct: 172 FYIGNSSIPNVVFGCMDSIFSSNNVEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYD 231

Query: 299 FPGLLLLGDANFSWLAPLNYTPMIQISTPLPYFDRVAYTVQLQGIKVSNKLLPIPESAFE 358
           FPGLLLLGDANFSWLAPLNYTPMIQISTPLPYFDRVAYTVQLQGIKVSNKLLPIPES FE
Sbjct: 232 FPGLLLLGDANFSWLAPLNYTPMIQISTPLPYFDRVAYTVQLQGIKVSNKLLPIPESVFE 291

Query: 359 PDHTGAGQTMVDSGTQFTFLLGQAYTVLRDEFLNQTAGSLRVSEDPNFVFQGAMDLCYRV 418
           PDHTGAGQTMVDSGTQFTFLLGQAYTVLRDEFLNQTAGSLRV EDPNFVFQGAMDLCYRV
Sbjct: 292 PDHTGAGQTMVDSGTQFTFLLGQAYTVLRDEFLNQTAGSLRVFEDPNFVFQGAMDLCYRV 351

Query: 419 PTNQTRLPPLPAVTLVFRGAEMTVTGDRILYRVAGEIRGNDTIYCFTFGNSDLLGVEAYV 478
           PTNQTRLPPLPAVTLVFRGAEMTVTGDRILY+VAGEIRGNDTIYCFTFGN+DLLGVEAYV
Sbjct: 352 PTNQTRLPPLPAVTLVFRGAEMTVTGDRILYKVAGEIRGNDTIYCFTFGNADLLGVEAYV 411

Query: 479 IGHLHQQNVWMEFDLKKSRIGLAEIRCDLAGQKLGMGL 517
           IGHLHQQNVWMEFDLKKSRIGL EIRCDLAGQKLGMGL
Sbjct: 412 IGHLHQQNVWMEFDLKKSRIGLVEIRCDLAGQKLGMGL 449

BLAST of CmoCh06G002870 vs. ExPASy TrEMBL
Match: A0A6J1J2F5 (aspartic proteinase PCS1 OS=Cucurbita maxima OX=3661 GN=LOC111482077 PE=3 SV=1)

HSP 1 Score: 756.5 bits (1952), Expect = 7.2e-215
Identity = 371/398 (93.22%), Postives = 383/398 (96.23%), Query Frame = 0

Query: 119 VIPPESVPRSPDKLPFRHNVSLTVSLTVGTPPQNVTMVIDTGSELSWLHCNRSQNSSSSS 178
           VIPPES+ RSPDKLPFRHNVSLTVSLTVGTPPQNVTMVIDTGSELSWLHCNRSQNSSSSS
Sbjct: 52  VIPPESIRRSPDKLPFRHNVSLTVSLTVGTPPQNVTMVIDTGSELSWLHCNRSQNSSSSS 111

Query: 179 STFNPVQSSSYTPIPCSSSTCTDQTRDFPIPASCDSNQLCHATLSYADASSSEGNLAADT 238
           STFNP  SSSYTPIPCSSSTCTDQTRDFPIPASCDSN LCHATLSYADASSSEG LA DT
Sbjct: 112 STFNPAGSSSYTPIPCSSSTCTDQTRDFPIPASCDSNHLCHATLSYADASSSEGTLATDT 171

Query: 239 FYIGNSGIPNVVFGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYD 298
           FYIGNSGI NVVFGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYD
Sbjct: 172 FYIGNSGISNVVFGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYD 231

Query: 299 FPGLLLLGDANFSWLAPLNYTPMIQISTPLPYFDRVAYTVQLQGIKVSNKLLPIPESAFE 358
           F GLLLLGDANFSWLAPLNYTP+I++STPLPYFDRVAYTVQL+GIKVS+KLLPIPES FE
Sbjct: 232 FSGLLLLGDANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVSHKLLPIPESVFE 291

Query: 359 PDHTGAGQTMVDSGTQFTFLLGQAYTVLRDEFLNQTAGSLRVSEDPNFVFQGAMDLCYRV 418
           PDHTGAGQTMVDSGTQFTFLLG AYT LRDEFLN+TAGS+RV ED NFVFQGAMDLCYRV
Sbjct: 292 PDHTGAGQTMVDSGTQFTFLLGPAYTALRDEFLNRTAGSIRVFEDSNFVFQGAMDLCYRV 351

Query: 419 PTNQTRLPPLPAVTLVFRGAEMTVTGDRILYRVAGEIRGNDTIYCFTFGNSDLLGVEAYV 478
           P NQTRLPPLP+VTLVFRGAEMTVTGDRILYRV GEIRGND+I+CFTFGNSDLLGVEA+V
Sbjct: 352 PMNQTRLPPLPSVTLVFRGAEMTVTGDRILYRVPGEIRGNDSIHCFTFGNSDLLGVEAFV 411

Query: 479 IGHLHQQNVWMEFDLKKSRIGLAEIRCDLAGQKLGMGL 517
           IGHLHQQNVWMEFDLKKSRIGLAEIRCDLAGQKLGMGL
Sbjct: 412 IGHLHQQNVWMEFDLKKSRIGLAEIRCDLAGQKLGMGL 449

BLAST of CmoCh06G002870 vs. ExPASy TrEMBL
Match: A0A6J1CW88 (aspartic proteinase PCS1 OS=Momordica charantia OX=3673 GN=LOC111014974 PE=3 SV=1)

HSP 1 Score: 753.1 bits (1943), Expect = 7.9e-214
Identity = 372/401 (92.77%), Postives = 388/401 (96.76%), Query Frame = 0

Query: 119 VIPPESVPRSPDKLPFRHNVSLTVSLTVGTPPQNVTMVIDTGSELSWLHCNRSQNSSS-S 178
           VIPPESV RSPDKLPFRHNVSLTVSLTVGTPPQNVTMVIDTGSELSWLHCN+SQNSSS S
Sbjct: 51  VIPPESVRRSPDKLPFRHNVSLTVSLTVGTPPQNVTMVIDTGSELSWLHCNKSQNSSSNS 110

Query: 179 SSTFNPVQSSSYTPIPCSSSTCTDQTRDFPIPASCDSNQLCHATLSYADASSSEGNLAAD 238
           SSTFNP++SSSY+PIPCSSSTCTDQTRDFPIPASCDSNQLCHATLSYADASSSEGNLAAD
Sbjct: 111 SSTFNPIRSSSYSPIPCSSSTCTDQTRDFPIPASCDSNQLCHATLSYADASSSEGNLAAD 170

Query: 239 TFYIGNSGIPNVVFGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEY 298
           TFYIGNSGI NVVFGCMDSIFSSNNEED+KNTGLMGMNRGSLSFVSQMGFPKFSYCISEY
Sbjct: 171 TFYIGNSGISNVVFGCMDSIFSSNNEEDAKNTGLMGMNRGSLSFVSQMGFPKFSYCISEY 230

Query: 299 DFPGLLLLGDANFSW--LAPLNYTPMIQISTPLPYFDRVAYTVQLQGIKVSNKLLPIPES 358
           DF GLLLLGDAN+SW  LAPLNYTP+IQ+STPLPYFDRVAYTV L+GIKVSNKLLPIPES
Sbjct: 231 DFSGLLLLGDANYSWLGLAPLNYTPLIQMSTPLPYFDRVAYTVHLEGIKVSNKLLPIPES 290

Query: 359 AFEPDHTGAGQTMVDSGTQFTFLLGQAYTVLRDEFLNQTAGSLRVSEDPNFVFQGAMDLC 418
            FEPDHTGAGQTMVDSGTQFTFLLG AYT LRDEFLNQTAGS+RV +DPNFVFQGAMDLC
Sbjct: 291 VFEPDHTGAGQTMVDSGTQFTFLLGPAYTALRDEFLNQTAGSIRVLDDPNFVFQGAMDLC 350

Query: 419 YRVPTNQTRLPPLPAVTLVFRGAEMTVTGDRILYRVAGEIRGNDTIYCFTFGNSDLLGVE 478
           YRVP NQ+RLPPLPAVTLVFRGAEMTV+GDRILYRV GEIRGND+I+CFTFGNSDLLGVE
Sbjct: 351 YRVPLNQSRLPPLPAVTLVFRGAEMTVSGDRILYRVPGEIRGNDSIHCFTFGNSDLLGVE 410

Query: 479 AYVIGHLHQQNVWMEFDLKKSRIGLAEIRCDLAGQKLGMGL 517
           AYVIGHLHQQNVWMEFDL+KSRIGLAEIRCDLAGQKLGMGL
Sbjct: 411 AYVIGHLHQQNVWMEFDLQKSRIGLAEIRCDLAGQKLGMGL 451

BLAST of CmoCh06G002870 vs. TAIR 10
Match: AT5G02190.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 559.7 bits (1441), Expect = 2.5e-159
Identity = 277/394 (70.30%), Postives = 326/394 (82.74%), Query Frame = 0

Query: 127 RSPDKLPFRHNVSLTVSLTVGTPPQNVTMVIDTGSELSWLHCNRSQNSSSSSSTFNPVQS 186
           R  DKL F HNV+LTV+LTVGTPPQN++MVIDTGSELSWL CNRS N +  ++ F+P +S
Sbjct: 60  RPTDKLHFHHNVTLTVTLTVGTPPQNISMVIDTGSELSWLRCNRSSNPNPVNN-FDPTRS 119

Query: 187 SSYTPIPCSSSTCTDQTRDFPIPASCDSNQLCHATLSYADASSSEGNLAADTFYIGNS-G 246
           SSY+PIPCSS TC  +TRDF IPASCDS++LCHATLSYADASSSEGNLAA+ F+ GNS  
Sbjct: 120 SSYSPIPCSSPTCRTRTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTN 179

Query: 247 IPNVVFGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCIS-EYDFPGLLL 306
             N++FGCM S+  S+ EED+K TGL+GMNRGSLSF+SQMGFPKFSYCIS   DFPG LL
Sbjct: 180 DSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCISGTDDFPGFLL 239

Query: 307 LGDANFSWLAPLNYTPMIQISTPLPYFDRVAYTVQLQGIKVSNKLLPIPESAFEPDHTGA 366
           LGD+NF+WL PLNYTP+I+ISTPLPYFDRVAYTVQL GIKV+ KLLPIP+S   PDHTGA
Sbjct: 240 LGDSNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGA 299

Query: 367 GQTMVDSGTQFTFLLGQAYTVLRDEFLNQTAGSLRVSEDPNFVFQGAMDLCYRVPTNQTR 426
           GQTMVDSGTQFTFLLG  YT LR  FLN+T G L V EDP+FVFQG MDLCYR+   + R
Sbjct: 300 GQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIR 359

Query: 427 ---LPPLPAVTLVFRGAEMTVTGDRILYRVAGEIRGNDTIYCFTFGNSDLLGVEAYVIGH 486
              L  LP V+LVF GAE+ V+G  +LYRV     GND++YCFTFGNSDL+G+EAYVIGH
Sbjct: 360 SGILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGH 419

Query: 487 LHQQNVWMEFDLKKSRIGLAEIRCDLAGQKLGMG 516
            HQQN+W+EFDL++SRIGLA + CD++GQ+LG+G
Sbjct: 420 HHQQNMWIEFDLQRSRIGLAPVECDVSGQRLGIG 452

BLAST of CmoCh06G002870 vs. TAIR 10
Match: AT2G39710.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 526.6 bits (1355), Expect = 2.3e-149
Identity = 257/391 (65.73%), Postives = 311/391 (79.54%), Query Frame = 0

Query: 128 SPDKLPFRHNVSLTVSLTVGTPPQNVTMVIDTGSELSWLHCNRSQNSSSSSSTFNPVQSS 187
           S DKL FRHNV+LTV+L VG PPQN++MV+DTGSELSWLHC +S N     S FNPV SS
Sbjct: 53  SSDKLSFRHNVTLTVTLAVGDPPQNISMVLDTGSELSWLHCKKSPN---LGSVFNPVSSS 112

Query: 188 SYTPIPCSSSTCTDQTRDFPIPASCD-SNQLCHATLSYADASSSEGNLAADTFYIGNSGI 247
           +Y+P+PCSS  C  +TRD PIPASCD    LCH  +SYADA+S EGNLA +TF IG+   
Sbjct: 113 TYSPVPCSSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTR 172

Query: 248 PNVVFGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFPGLLLLG 307
           P  +FGCMDS  SSN+EED+K+TGLMGMNRGSLSFV+Q+GF KFSYCIS  D  G LLLG
Sbjct: 173 PGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCISGSDSSGFLLLG 232

Query: 308 DANFSWLAPLNYTPMIQISTPLPYFDRVAYTVQLQGIKVSNKLLPIPESAFEPDHTGAGQ 367
           DA++SWL P+ YTP++  STPLPYFDRVAYTVQL+GI+V +K+L +P+S F PDHTGAGQ
Sbjct: 233 DASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQ 292

Query: 368 TMVDSGTQFTFLLGQAYTVLRDEFLNQTAGSLRVSEDPNFVFQGAMDLCYRV-PTNQTRL 427
           TMVDSGTQFTFL+G  YT L++EF+ QT   LR+ +DP+FVFQG MDLCY+V  T +   
Sbjct: 293 TMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNF 352

Query: 428 PPLPAVTLVFRGAEMTVTGDRILYRVAGE-IRGNDTIYCFTFGNSDLLGVEAYVIGHLHQ 487
             LP V+L+FRGAEM+V+G ++LYRV G    G + +YCFTFGNSDLLG+EA+VIGH HQ
Sbjct: 353 SGLPMVSLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQ 412

Query: 488 QNVWMEFDLKKSRIGLA-EIRCDLAGQKLGM 515
           QNVWMEFDL KSR+G A  +RCDLA Q+LG+
Sbjct: 413 QNVWMEFDLAKSRVGFAGNVRCDLASQRLGL 440

BLAST of CmoCh06G002870 vs. TAIR 10
Match: AT1G66180.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 251.1 bits (640), Expect = 1.9e-66
Identity = 150/398 (37.69%), Postives = 219/398 (55.03%), Query Frame = 0

Query: 121 PPESVPRSPDKLPFRHNVSLTVSLTVGTPPQNVTMVIDTGSELSWLHCNRSQNSSSSSST 180
           P  S P    +  F+++++L +SL +GTPPQ   MV+DTGS+LSW+ C+R +      ++
Sbjct: 53  PSPSSPPYNFRSRFKYSMALIISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPPKPKTS 112

Query: 181 FNPVQSSSYTPIPCSSSTCTDQTRDFPIPASCDSNQLCHATLSYADASSSEGNLAADTFY 240
           F+P  SSS++ +PCS   C  +  DF +P SCDSN+LCH +  YAD + +EGNL  +   
Sbjct: 113 FDPSLSSSFSTLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKIT 172

Query: 241 IGNSGI-PNVVFGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISE--- 300
             N+ I P ++ GC          E S + G++GMNRG LSFVSQ    KFSYCI     
Sbjct: 173 FSNTEITPPLILGCA--------TESSDDRGILGMNRGRLSFVSQAKISKFSYCIPPKSN 232

Query: 301 ---YDFPGLLLLGD----ANFSWLAPLNYTPMIQISTPLPYFDRVAYTVQLQGIKVSNKL 360
              +   G   LGD      F +++ L +      S  +P  D +AYTV + GI+   K 
Sbjct: 233 RPGFTPTGSFYLGDNPNSHGFKYVSLLTFPE----SQRMPNLDPLAYTVPMIGIRFGLKK 292

Query: 361 LPIPESAFEPDHTGAGQTMVDSGTQFTFLLGQAYTVLRDEFLNQTAGSLRVSEDPNFVFQ 420
           L I  S F PD  G+GQTMVDSG++FT L+  AY  +R E + +    L+      +V+ 
Sbjct: 293 LNISGSVFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLK----KGYVYG 352

Query: 421 GAMDLCYRVPTNQTRLPPL--PAVTLVFRGAEMTVTGDRILYRVAGEIRGNDTIYCFTFG 480
           G  D+C+    N   +P L    V +  RG E+ V  +R+L  V G       I+C   G
Sbjct: 353 GTADMCF--DGNVAMIPRLIGDLVFVFTRGVEILVPKERVLVNVGG------GIHCVGIG 412

Query: 481 NSDLLGVEAYVIGHLHQQNVWMEFDLKKSRIGLAEIRC 506
            S +LG  + +IG++HQQN+W+EFD+   R+G A+  C
Sbjct: 413 RSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFAKADC 426

BLAST of CmoCh06G002870 vs. TAIR 10
Match: AT5G37540.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 236.5 bits (602), Expect = 4.8e-62
Identity = 150/405 (37.04%), Postives = 220/405 (54.32%), Query Frame = 0

Query: 124 SVPRSPDKLPFRHNV----SLTVSLTVGTPPQNVTMVIDTGSELSWLHCNRSQNSS---S 183
           S P SP    FR N+    +L +SL +GTP Q+  +V+DTGS+LSW+ C+  +       
Sbjct: 62  SPPSSP--YTFRSNIKYSMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPP 121

Query: 184 SSSTFNPVQSSSYTPIPCSSSTCTDQTRDFPIPASCDSNQLCHATLSYADASSSEGNLAA 243
            +++F+P  SSS++ +PCS   C  +  DF +P SCDSN+LCH +  YAD + +EGNL  
Sbjct: 122 PTTSFDPSLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVK 181

Query: 244 DTFYIGNS-GIPNVVFGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCI- 303
           + F   NS   P ++ GC         +E +   G++GMN G LSF+SQ    KFSYCI 
Sbjct: 182 EKFTFSNSQTTPPLILGCA--------KESTDEKGILGMNLGRLSFISQAKISKFSYCIP 241

Query: 304 SEYDFPGL-----LLLGD----ANFSWLAPLNYTPMIQISTPLPYFDRVAYTVQLQGIKV 363
           +  + PGL       LGD      F +++ L +      S  +P  D +AYTV LQGI++
Sbjct: 242 TRSNRPGLASTGSFYLGDNPNSRGFKYVSLLTFPQ----SQRMPNLDPLAYTVPLQGIRI 301

Query: 364 SNKLLPIPESAFEPDHTGAGQTMVDSGTQFTFLLGQAYTVLRDEFLNQTAGSLRVSEDPN 423
             K L IP S F PD  G+GQTMVDSG++FT L+  AY  +++E +      L+      
Sbjct: 302 GQKRLNIPGSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLK----KG 361

Query: 424 FVFQGAMDLCYRVPTNQTRLPPLPAVTLVF---RGAEMTVTGDRILYRVAGEIRGNDTIY 483
           +V+    D+C+    N +         LVF   RG E+ V    +L  V G       I+
Sbjct: 362 YVYGSTADMCF--DGNHSMEIGRLIGDLVFEFGRGVEILVEKQSLLVNVGG------GIH 421

Query: 484 CFTFGNSDLLGVEAYVIGHLHQQNVWMEFDLKKSRIGLAEIRCDL 508
           C   G S +LG  + +IG++HQQN+W+EFD+   R+G ++  C L
Sbjct: 422 CVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFSKAECRL 440

BLAST of CmoCh06G002870 vs. TAIR 10
Match: AT2G03200.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 144.8 bits (364), Expect = 1.9e-34
Identity = 114/380 (30.00%), Postives = 187/380 (49.21%), Query Frame = 0

Query: 142 VSLTVGTPPQNVTMVIDTGSELSWLHCNR-SQNSSSSSSTFNPVQSSSYTPIPCSSSTCT 201
           + L++G P    + ++DTGS+L W  C   ++     +  F+P +SSSY+ + CSS  C 
Sbjct: 109 MELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCN 168

Query: 202 DQTRDFPIPASCDSNQ-LCHATLSYADASSSEGNLAADTF-YIGNSGIPNVVFGCMDSIF 261
              R     ++C+ ++  C    +Y D SS+ G LA +TF +   + I  + FGC     
Sbjct: 169 ALPR-----SNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGC----- 228

Query: 262 SSNNEED--SKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYD--------FPGLLLLGDA 321
              NE D  S+ +GL+G+ RG LS +SQ+   KFSYC++  +        F G L  G  
Sbjct: 229 GVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLASGIV 288

Query: 322 N---FSWLAPLNYTPMIQISTPLPYFDRVAYTVQLQGIKVSNKLLPIPESAFEPDHTGAG 381
           N    S    +  T  +  +   P F    Y ++LQGI V  K L + +S FE    G G
Sbjct: 289 NKTGASLDGEVTKTMSLLRNPDQPSF----YYLELQGITVGAKRLSVEKSTFELAEDGTG 348

Query: 382 QTMVDSGTQFTFLLGQAYTVLRDEFLNQTAGSLRVSEDPNFVFQGAMDLCYRVPTNQTRL 441
             ++DSGT  T+L   A+ VL++EF ++   SL V +  +      +DLC+++P     +
Sbjct: 349 GMIIDSGTTITYLEETAFKVLKEEFTSRM--SLPVDDSGS----TGLDLCFKLPDAAKNI 408

Query: 442 PPLPAVTLVFRGAEMTVTGDRILYRVAGEIRGNDTIYCFTFGNSDLLGVEAYVIGHLHQQ 501
             +P +   F+GA++ + G+   Y VA    G   + C   G+S+ +     + G++ QQ
Sbjct: 409 -AVPKMIFHFKGADLELPGEN--YMVADSSTG---VLCLAMGSSNGMS----IFGNVQQQ 458

Query: 502 NVWMEFDLKKSRIGLAEIRC 506
           N  +  DL+K  +      C
Sbjct: 469 NFNVLHDLEKETVSFVPTEC 458

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LZL33.5e-15870.30Aspartic proteinase PCS1 OS=Arabidopsis thaliana OX=3702 GN=PCS1 PE=2 SV=1[more]
Q766C35.0e-4032.70Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q766C21.0e-3729.16Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
O044963.9e-3230.79Aspartyl protease AED3 OS=Arabidopsis thaliana OX=3702 GN=AED3 PE=1 SV=1[more]
Q6F4N51.9e-3131.43Aspartyl protease 25 OS=Oryza sativa subsp. japonica OX=39947 GN=AP25 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1F3C75.3e-24275.33Laccase OS=Cucurbita moschata OX=3662 GN=LOC111441767 PE=3 SV=1[more]
A0A6J1G9286.0e-230100.00aspartic proteinase PCS1-like OS=Cucurbita moschata OX=3662 GN=LOC111452070 PE=3... [more]
A0A6J1I7C07.6e-22597.99aspartic proteinase PCS1-like OS=Cucurbita maxima OX=3661 GN=LOC111470272 PE=3 S... [more]
A0A6J1J2F57.2e-21593.22aspartic proteinase PCS1 OS=Cucurbita maxima OX=3661 GN=LOC111482077 PE=3 SV=1[more]
A0A6J1CW887.9e-21492.77aspartic proteinase PCS1 OS=Momordica charantia OX=3673 GN=LOC111014974 PE=3 SV=... [more]
Match NameE-valueIdentityDescription
AT5G02190.12.5e-15970.30Eukaryotic aspartyl protease family protein [more]
AT2G39710.12.3e-14965.73Eukaryotic aspartyl protease family protein [more]
AT1G66180.11.9e-6637.69Eukaryotic aspartyl protease family protein [more]
AT5G37540.14.8e-6237.04Eukaryotic aspartyl protease family protein [more]
AT2G03200.11.9e-3430.00Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 367..378
score: 32.91
coord: 477..492
score: 18.07
coord: 146..166
score: 46.19
IPR001461Aspartic peptidase A1 familyPANTHERPTHR47965ASPARTYL PROTEASE-RELATEDcoord: 124..514
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 142..307
e-value: 1.9E-39
score: 135.7
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 132..307
e-value: 7.1E-37
score: 129.2
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 311..508
e-value: 5.8E-44
score: 151.9
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 141..505
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 335..500
e-value: 1.5E-36
score: 125.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 174..194
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 23..43
NoneNo IPR availablePANTHERPTHR47965:SF64ASPARTIC PROTEINASE PCS1coord: 124..514
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 155..166
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 140..501
score: 35.977531
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 141..505
e-value: 6.19016E-71
score: 225.605

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh06G002870.1CmoCh06G002870.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0046274 lignin catabolic process
biological_process GO:0012501 programmed cell death
biological_process GO:0006508 proteolysis
cellular_component GO:0048046 apoplast
cellular_component GO:0005783 endoplasmic reticulum
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0005507 copper ion binding
molecular_function GO:0052716 hydroquinone:oxygen oxidoreductase activity