Cp4.1LG10g10370 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG10g10370
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionaspartyl protease family protein 1-like
LocationCp4.1LG10: 3960422 .. 3964237 (-)
RNA-Seq ExpressionCp4.1LG10g10370
SyntenyCp4.1LG10g10370
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGTGGGCGTTCAGTTTCGGCGCCCAGATGTTACTGGTTTTTTCTGTTTTCTTTCTCTCCGGCGGGCTGAGAAGTGGTGATGCATCTTCGTTTAAGTTCAGTATTCACCATCGATTTTCGGATGCGGTTAAGGGGATTATCGACTCTGAAGGCTTGCCGGAGAAACATTCTCCTGAATATTATGCGACTTTGGTCCATCGTGATCGGTTAGTACATGGCCGGCGATTGGCCGCTAGTAATGGTAGTAAGGAGCTGACGTTCGCTGGTGGTAACGCTACCTACTTAATGGTCAATGCTGGATTGTATGCTCCTTATTCTCTCCGTTCATTTCCCTAATCTCTTCTCTCTATTTCTGTTTTTGAGTTATTTGAATCCTCTGTTGTTCTGATTTTTCTCTTTGAGATTATCTTTGCTTAAATTGCTGTCCTGTTCCCTTCCGTTTGGGGCTTTTCAGTTTGTTGTACGCCAATATTTCGATCGGAACGCCTGCGCTAGATTTCTTTGTGGCGTTGGACACCGGAAGCAATTTGTTCTGGTTACCGTGTGAATGCCGCAGTTGTCCTACTTCAACGCCTAGTGGGAAGGTATTTTCTCGCGATGCAAATTCTTGAAGTTGATGCTCTGATGATATCGATTGGTTGATTATTGTTATTGTTTGTTTTTTTTTTATGTTTCTCTACAGGTTCCGTTCAATCATTATAGTCCAAATGCTTCAAAAACGAGCTCAACTGTTCCCTGCAGCAACGCATTGTGCGAGCTTTCAAACAAATGCACTTCAAACCAAAATACTTGTCCTTATAAAGTTAAATACGGGGCTGGCAATGAATCGTCGACTGGGTACTTGGTAGAGGACGTAGTGTCCTTGATCACTGATGATTCACAACTTAAACCCGTTAAGGCGAAGATTACTTTTGGGTGAGTTCTGCTGCATCTCTCTCTCCCTTTATCTTAATCCTCTGAGTTATATATTTTCTTGTGCTGAATTTGTTCTGTTCTTAATAGATTTCTTTTCTCTTAACTGGAAAAATGTATCTACTACATCAGGACTATGTAGATTTCTGAATCCAATCGCACACTAGTTAGAGTCAGCTTTTAGTTTGATGTAGTGGGCAATGTTAGGAATCACGACTCTCCACAATGATATGATATTGTCCACTTTGAACATTAGCTCTCATGACTTTGCTTTTGGTTTCTCCGGAAGGCCTAATACCAATGAAGATGTATTACTTAATTAACAGATGTGGGAGTCTTCTCCCGACAATCCTCAACCAGCACTAACCAATTATATTCTACTTTGTAACTCAACCACTGTTCCAAGAAGATATTGGGGTTCTTCTTCGTGGGTGTTTAATGATTTTCGAGCAGTTTGCAGTGATATTGAATGTTGACTTTACCATCTTATAGAAAGCACATACAATTTGTTGTGTTGTTGCGCTTAAATGTGATTCTTCATGGCAGGTGCGGTAAGGTCCAGACTGGTAGATTTGCAAGACATGCAGCTCCCAATGGTCTTATTGGGCTTGGCATGGAAAGGATATCAGTTCCAAGTTTCTTAGCTAACCAAGGTCTTATCTCTGATTCATTCTCCATGTGTTTCGGACATGATGAACTTGGGAGAATCGATTTTGGGGACACAGGCACACCAGACCAAAAAGAAACACCCATCAATTCAAATCCCAGCTTGTTAGAACTTTTCACTTCTAAACCATATTCAGAATGTTAAAATGTCCGGTTTTTCTAACTGGATTGTCTTGTTCATTTTGGCAGTCCACACTATAATGTCACCATCACTCAGATAATTGTGGGAGAAAAAACCAACGATGTTCAATTTACTGCAATTTTCGACGGTGGTGCCTCGTTTACACACCTAGCTGAACCAGTTTACTCTCTTATTTCCGAGCAAGTAAGCTATCTGCAGTTTCAACTCTTGGAAATTCCATATAATTGTCAGGAATCACGACATATTATCCACTTTCAGCCTAAGCTCTTATGACTTTGATTTGGGCTTCCTCAAAAGGCCTCGTACCAATTTTGGATGTATTTTCTTACTTATGAATTCATGATTATTCCCTAAATTTGTCCACTATTATACAGATAGATTCAGGGATCCAGTTAAAGCGCTTTTCATTTGGTCCGGATTTCGCATTTGAGTACTGCTACGAATCTCCGTCGTAAGTAGCTAATGAGATATATAATGAAACCAGTTATTCTCTTTTAAAGCATTCCTGGATTCTTATTCATACTCGGTTTATATTCAGTTTATATGCAGAAACCATCGATTGTCCGGTGGTTAATTTTACGATGAAGGGTGGAGATGATTTTATCCCTCTGGGTCAGTTCCTTACCCTTTCGATTGATGTGAGTACTCTGAACATTTGAAATAATTTTCAGCTTTTATAGCTCTCTCACTCATCTCTGCAATCTTTTGAGCTTTTGCTGATCTGGGTTTTTCTTTTTTGGGCTTCAGGATGCGGATACTAGACGTGCTTTTTGTCTAACGATTGTCAAAAGCACTAGTATTAATATAATTGGAAGTGAGACCTCTATCTCCCTCTCTATGCATCTAATTCACGAAGCTATTCTATCTTTACTTGTTTTAGTAGCTGTGGTTTTTGTTAGGAATCACGAATCTACACAATGATATGATATTGTCTACTTTGAGCATAAGCTGACTTTGATTTGGACTTCCCAAAAGACTTTGTATCAATAGAGATAGTATTTCTCACTTATTATAAACAAGCAAAGCTACCAGAGCTTATGCTCAAAGTGGACAATATCTGTTGAGGATTATTGGAAGTGAGTCGCACATAGGTTAATTTCATGGAAGATCATGAGTTTGTAAATGAGGAATACTATCTCCATTGGTTGAGGCCTTTGGTGAAGTTCAAAGCAAAGCCATGAGAGCTTATGTTCAAAGTGGACAATATCATACCATTGTGGAGAGTCATGATTACTAACATGTTATCAGAGTCATGACCCTAAACTTAGCCCGGAAATAAAATCCTCAAATATCAAACAAAGAATTGTAAGTCTCGAAGGTGTAGTCAATAGTGACTAAAGTGTCAAAGAAATGGTGTACTTTGTTCGAGGGCTCCAAAGAAAAGAGTCGAGCCTCGATTAAGGGGAGGCTATTCGAGAGTCCCATAAGCCTCAGGGGAGGCTTATAGTGTACTTTGTTTGAGAGGAGGATTGTTGAGGATTATTGGGAGTGAATCTCACCTTGGTTAATTTCATGGAAGATCATGAGTTTATAAGTAGGAATACTATCTCCATTGGTTGAGGTCTGCTGGTAAACCCAAAACAAAGCCATGAAAGCTTATGAAAACTTAGGCTCAAAGTGGACAATATCATACCATTATGTAGATTCGTGATTTCTAATAGTTTTCACGGTCGAAGAAGCGCTTGTGATCTCGGAGACTTTTCTTTTTTGCAGTGGACTTCATGGCTGGTTATCGTATTGTCTTCAATCGTGAAAAGATGGTGTTGGGCTGGAGTCCTTCAGACTGTGAGTGTCACTTCACCTCAAAACTCTTCGTTTTTATGGTTTCTTTTGTTTGCTGTGTCAGTGAAGCTTGTTTTCTTAAACCAGGTTACGACAATGGCGCTGGCACTCCCTACGGCTATACCACTCCATTTGACTCCCCTCCGACCGACAAATCTCCACCGTCCGATGATTCTCCTCCGGCACCTGTTACCCCAGGAGGAAGCACCGGCTTGCCGAATATTGAGGTGGGTGTTGCAACGCGGTTGAACCCACTGACCTCGGTCGTCGTCGCCGTTCTTGCAATCTTGGCTGTTGTTGGACTATCATAA

mRNA sequence

ATGGCTTTCGGCGCCCAGATGTTACTGGTTTTTTCTGTTTTCTTTCTCTCCGGCGGGCTGAGAAGTGGTGATGCATCTTCGTTTAAGTTCAGTATTCACCATCGATTTTCGGATGCGGTTAAGGGGATTATCGACTCTGAAGGCTTGCCGGAGAAACATTCTCCTGAATATTATGCGACTTTGGTCCATCGTGATCGGTTAGTACATGGCCGGCGATTGGCCGCTAGTAATGGTAGTAAGGAGCTGACGTTCGCTGGTGGTAACGCTACCTACTTAATGGTCAATGCTGGATTTTTGTTGTACGCCAATATTTCGATCGGAACGCCTGCGCTAGATTTCTTTGTGGCGTTGGACACCGGAAGCAATTTGTTCTGGTTACCGTGTGAATGCCGCAGTTGTCCTACTTCAACGCCTAGTGGGAAGGTTCCGTTCAATCATTATAGTCCAAATGCTTCAAAAACGAGCTCAACTGTTCCCTGCAGCAACGCATTGTGCGAGCTTTCAAACAAATGCACTTCAAACCAAAATACTTGTCCTTATAAAGTTAAATACGGGGCTGGCAATGAATCGTCGACTGGGTACTTGGTAGAGGACGTAGTGTCCTTGATCACTGATGATTCACAACTTAAACCCGTTAAGGCGAAGATTACTTTTGGGTGCGGTAAGGTCCAGACTGGTAGATTTGCAAGACATGCAGCTCCCAATGGTCTTATTGGGCTTGGCATGGAAAGGATATCAGTTCCAAGTTTCTTAGCTAACCAAGGTCTTATCTCTGATTCATTCTCCATGTGTTTCGGACATGATGAACTTGGGAGAATCGATTTTGGGGACACAGGCACACCAGACCAAAAAGAAACACCCATCAATTCAAATCCCAGCTTTCCACACTATAATGTCACCATCACTCAGATAATTGTGGGAGAAAAAACCAACGATGTTCAATTTACTGCAATTTTCGACGGTGGTGCCTCGTTTACACACCTAGCTGAACCACATTCCTGGATTCTTATTCATACTCGGTTTATATTCAGTTTATATGCAGAAACCATCGATTGTCCGGTGGTTAATTTTACGATGAAGGGTGGAGATGATTTTATCCCTCTGGGTTACGACAATGGCGCTGGCACTCCCTACGGCTATACCACTCCATTTGACTCCCCTCCGACCGACAAATCTCCACCGTCCGATGATTCTCCTCCGGCACCTGTTACCCCAGGAGGAAGCACCGGCTTGCCGAATATTGAGGTGGGTGTTGCAACGCGGTTGAACCCACTGACCTCGGTCGTCGTCGCCGTTCTTGCAATCTTGGCTGTTGTTGGACTATCATAA

Coding sequence (CDS)

ATGGCTTTCGGCGCCCAGATGTTACTGGTTTTTTCTGTTTTCTTTCTCTCCGGCGGGCTGAGAAGTGGTGATGCATCTTCGTTTAAGTTCAGTATTCACCATCGATTTTCGGATGCGGTTAAGGGGATTATCGACTCTGAAGGCTTGCCGGAGAAACATTCTCCTGAATATTATGCGACTTTGGTCCATCGTGATCGGTTAGTACATGGCCGGCGATTGGCCGCTAGTAATGGTAGTAAGGAGCTGACGTTCGCTGGTGGTAACGCTACCTACTTAATGGTCAATGCTGGATTTTTGTTGTACGCCAATATTTCGATCGGAACGCCTGCGCTAGATTTCTTTGTGGCGTTGGACACCGGAAGCAATTTGTTCTGGTTACCGTGTGAATGCCGCAGTTGTCCTACTTCAACGCCTAGTGGGAAGGTTCCGTTCAATCATTATAGTCCAAATGCTTCAAAAACGAGCTCAACTGTTCCCTGCAGCAACGCATTGTGCGAGCTTTCAAACAAATGCACTTCAAACCAAAATACTTGTCCTTATAAAGTTAAATACGGGGCTGGCAATGAATCGTCGACTGGGTACTTGGTAGAGGACGTAGTGTCCTTGATCACTGATGATTCACAACTTAAACCCGTTAAGGCGAAGATTACTTTTGGGTGCGGTAAGGTCCAGACTGGTAGATTTGCAAGACATGCAGCTCCCAATGGTCTTATTGGGCTTGGCATGGAAAGGATATCAGTTCCAAGTTTCTTAGCTAACCAAGGTCTTATCTCTGATTCATTCTCCATGTGTTTCGGACATGATGAACTTGGGAGAATCGATTTTGGGGACACAGGCACACCAGACCAAAAAGAAACACCCATCAATTCAAATCCCAGCTTTCCACACTATAATGTCACCATCACTCAGATAATTGTGGGAGAAAAAACCAACGATGTTCAATTTACTGCAATTTTCGACGGTGGTGCCTCGTTTACACACCTAGCTGAACCACATTCCTGGATTCTTATTCATACTCGGTTTATATTCAGTTTATATGCAGAAACCATCGATTGTCCGGTGGTTAATTTTACGATGAAGGGTGGAGATGATTTTATCCCTCTGGGTTACGACAATGGCGCTGGCACTCCCTACGGCTATACCACTCCATTTGACTCCCCTCCGACCGACAAATCTCCACCGTCCGATGATTCTCCTCCGGCACCTGTTACCCCAGGAGGAAGCACCGGCTTGCCGAATATTGAGGTGGGTGTTGCAACGCGGTTGAACCCACTGACCTCGGTCGTCGTCGCCGTTCTTGCAATCTTGGCTGTTGTTGGACTATCATAA

Protein sequence

MAFGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEGLPEKHSPEYYATLVHRDRLVHGRRLAASNGSKELTFAGGNATYLMVNAGFLLYANISIGTPALDFFVALDTGSNLFWLPCECRSCPTSTPSGKVPFNHYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYKVKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLGMERISVPSFLANQGLISDSFSMCFGHDELGRIDFGDTGTPDQKETPINSNPSFPHYNVTITQIIVGEKTNDVQFTAIFDGGASFTHLAEPHSWILIHTRFIFSLYAETIDCPVVNFTMKGGDDFIPLGYDNGAGTPYGYTTPFDSPPTDKSPPSDDSPPAPVTPGGSTGLPNIEVGVATRLNPLTSVVVAVLAILAVVGLS
Homology
BLAST of Cp4.1LG10g10370 vs. ExPASy Swiss-Prot
Match: Q8VYV9 (Aspartyl protease family protein 1 OS=Arabidopsis thaliana OX=3702 GN=APF1 PE=2 SV=1)

HSP 1 Score: 336.7 bits (862), Expect = 4.1e-91
Identity = 172/363 (47.38%), Postives = 232/363 (63.91%), Query Frame = 0

Query: 28  FKFSIHHRFSDAVKGIIDSEGLPEKHSPEYYATLVHRDRLVHGRRLAASNGSKELTFAGG 87
           F F  HHRFSD V G++  +GLP + S +YY  + HRDRL+ GRRLA  + S  +TF+ G
Sbjct: 33  FGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRLIRGRRLANEDQSL-VTFSDG 92

Query: 88  NATYLMVNAGFLLYANISIGTPALDFFVALDTGSNLFWLPCECRSC--PTSTPSG-KVPF 147
           N T  +   GFL YAN+++GTP+  F VALDTGS+LFWLPC+C +C      P G  +  
Sbjct: 93  NETVRVDALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSLDL 152

Query: 148 NHYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYKVKYGAGNESSTGYLVEDVVSLIT 207
           N YSPNAS TS+ VPC++ LC   ++C S ++ CPY+++Y +   SSTG LVEDV+ L++
Sbjct: 153 NIYSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVS 212

Query: 208 DDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLGMERISVPSFLANQGLISDSFSMC 267
           +D   K + A++TFGCG+VQTG F   AAPNGL GLG+E ISVPS LA +G+ ++SFSMC
Sbjct: 213 NDKSSKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMC 272

Query: 268 FGHDELGRIDFGDTGTPDQKETPINSNPSFPHYNVTITQIIVGEKTNDVQFTAIFDGGAS 327
           FG+D  GRI FGD G+ DQ+ETP+N     P YN+T+T+I VG  T D++F A+FD G S
Sbjct: 273 FGNDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKISVGGNTGDLEFDAVFDSGTS 332

Query: 328 FTHLAEPHSWILIHTRF----------------------IFSLYAETIDCPVVNFTMKGG 366
           FT+L +  ++ LI   F                        S   ++   P VN TMKGG
Sbjct: 333 FTYLTDA-AYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAVNLTMKGG 392

BLAST of Cp4.1LG10g10370 vs. ExPASy Swiss-Prot
Match: Q9LX20 (Aspartic proteinase-like protein 1 OS=Arabidopsis thaliana OX=3702 GN=At5g10080 PE=2 SV=1)

HSP 1 Score: 213.4 bits (542), Expect = 5.2e-54
Identity = 138/346 (39.88%), Postives = 184/346 (53.18%), Query Frame = 0

Query: 9   LVFSVFFLSGGLRSGDASSFKFSIHHRFSD----AVKGIIDSEGLPEKHSPEYYATLV-- 68
           L+F V FL+       AS F   + HRFSD    ++K    S+ LP K S EYY  L   
Sbjct: 8   LLFCVLFLA--TEETLASLFSSRLIHRFSDEGRASIKTPSSSDSLPNKQSLEYYRLLAES 67

Query: 69  --HRDRLVHGRR---LAASNGSKELTFAGGNATYLMVNAGFLLYANISIGTPALDFFVAL 128
              R R+  G +   L  S GSK  T + GN      + G+L Y  I IGTP++ F VAL
Sbjct: 68  DFRRQRMNLGAKVQSLVPSEGSK--TISSGN------DFGWLHYTWIDIGTPSVSFLVAL 127

Query: 129 DTGSNLFWLPCECRSCPTSTPS-----GKVPFNHYSPNASKTSSTVPCSNALCELSNKCT 188
           DTGSNL W+PC C  C   T +          N Y+P++S TS    CS+ LC+ ++ C 
Sbjct: 128 DTGSNLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVFLCSHKLCDSASDCE 187

Query: 189 SNQNTCPYKVKYGAGNESSTGYLVEDVVSLITDDSQ-----LKPVKAKITFGCGKVQTGR 248
           S +  CPY V Y +GN SS+G LVED++ L  + +         VKA++  GCGK Q+G 
Sbjct: 188 SPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGD 247

Query: 249 FARHAAPNGLIGLGMERISVPSFLANQGLISDSFSMCFGHDELGRIDFGDTGTPDQKETP 308
           +    AP+GL+GLG   ISVPSFL+  GL+ +SFS+CF  ++ GRI FGD G   Q+ TP
Sbjct: 248 YLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTP 307

Query: 309 I--NSNPSFPHYNVTITQIIVGEK-TNDVQFTAIFDGGASFTHLAE 331
                N  +  Y V +    +G        FT   D G SFT+L E
Sbjct: 308 FLQLDNNKYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPE 343

BLAST of Cp4.1LG10g10370 vs. ExPASy Swiss-Prot
Match: Q4V3D2 (Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1)

HSP 1 Score: 104.8 bits (260), Expect = 2.6e-21
Identity = 68/210 (32.38%), Postives = 110/210 (52.38%), Query Frame = 0

Query: 99  LLYANISIGTPALDFFVALDTGSNLFWLPC-ECRSCPTSTPSGKVPFNHYSPNASKTSST 158
           L +  I +G+P  +++V +DTGS++ W+ C  C  CP  T  G +P + Y    S TS  
Sbjct: 77  LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLG-IPLSLYDSKTSSTSKN 136

Query: 159 VPCSNALCE--LSNKCTSNQNTCPYKVKYGAGNESSTGYLVEDVVSL--ITDDSQLKPVK 218
           V C +  C   + ++    +  C Y V YG G+ +S G  ++D ++L  +T + +  P+ 
Sbjct: 137 VGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGS-TSDGDFIKDNITLEQVTGNLRTAPLA 196

Query: 219 AKITFGCGKVQTGRFAR-HAAPNGLIGLGMERISVPSFLANQGLISDSFSMCFGHDELGR 278
            ++ FGCGK Q+G+  +  +A +G++G G    S+ S LA  G     FS C  +   G 
Sbjct: 197 QEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGG 256

Query: 279 I-DFGDTGTPDQKETPINSNPSFPHYNVTI 302
           I   G+  +P  K TPI   P+  HYNV +
Sbjct: 257 IFAVGEVESPVVKTTPI--VPNQVHYNVIL 282

BLAST of Cp4.1LG10g10370 vs. ExPASy Swiss-Prot
Match: Q9LEW3 (Aspartyl protease AED1 OS=Arabidopsis thaliana OX=3702 GN=AED1 PE=2 SV=1)

HSP 1 Score: 99.4 bits (246), Expect = 1.1e-19
Identity = 87/269 (32.34%), Postives = 129/269 (47.96%), Query Frame = 0

Query: 75  ASNGSKELTFAGGNATYLMVNAGFLL-----YANISIGTPALDFFVALDTGSNLFWLPCE 134
           + N + E++ A   +T L   +G  L        I IGTP  D  +  DTGS+L W  CE
Sbjct: 104 SKNSANEVSEA--KSTELPAKSGITLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCE 163

Query: 135 CRSCPTSTPSGKVPFNHYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYKVKYGAGNE 194
              C  S  S K P   ++P++S T   V CS+ +CE +  C++  + C Y + YG    
Sbjct: 164 --PCLGSCYSQKEP--KFNPSSSSTYQNVSCSSPMCEDAESCSA--SNCVYSIVYG-DKS 223

Query: 195 SSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLGMERISVPS 254
            + G+L ++  +L   D     V   + FGCG+   G F   A   GL+GLG  ++S+P+
Sbjct: 224 FTQGFLAKEKFTLTNSD-----VLEDVYFGCGENNQGLFDGVA---GLLGLGPGKLSLPA 283

Query: 255 FLANQGLISDSFSMC---FGHDELGRIDFGDTGTPDQ-KETPINSNPSFPHYNVTITQII 314
                   ++ FS C   F  +  G + FG  G  +  K TPI+S PS  +Y + I  I 
Sbjct: 284 --QTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESVKFTPISSFPSAFNYGIDIIGIS 343

Query: 315 VGEKTNDV---QFT---AIFDGGASFTHL 329
           VG+K   +    F+   AI D G  FT L
Sbjct: 344 VGDKELAITPNSFSTEGAIIDSGTVFTRL 353

BLAST of Cp4.1LG10g10370 vs. ExPASy Swiss-Prot
Match: Q9S9K4 (Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2)

HSP 1 Score: 99.4 bits (246), Expect = 1.1e-19
Identity = 80/261 (30.65%), Postives = 129/261 (49.43%), Query Frame = 0

Query: 69  HGRRLAASNGSKELTFAGGNATYLMVNAGFLLYANISIGTPALDFFVALDTGSNLFWLPC 128
           H R LA    S +L   G +     V++  L +  I +G+P  ++ V +DTGS++ W+ C
Sbjct: 51  HSRMLA----SIDLPLGGDS----RVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINC 110

Query: 129 E-CRSCPTSTPSGKVPFNHYSPNASKTSSTVPCSNALCELSNKCTSNQNT--CPYKVKYG 188
           + C  CPT T +     + +  NAS TS  V C +  C   ++  S Q    C Y + Y 
Sbjct: 111 KPCPKCPTKT-NLNFRLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIVY- 170

Query: 189 AGNESSTGYLVEDVVSL--ITDDSQLKPVKAKITFGCGKVQTGRFAR-HAAPNGLIGLGM 248
           A   +S G  + D+++L  +T D +  P+  ++ FGCG  Q+G+     +A +G++G G 
Sbjct: 171 ADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQ 230

Query: 249 ERISVPSFLANQGLISDSFSMCFGHDELGRI-DFGDTGTPDQKETPINSNPSFPHYNVTI 308
              SV S LA  G     FS C  + + G I   G   +P  K TP+   P+  HYNV +
Sbjct: 231 SNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAVGVVDSPKVKTTPM--VPNQMHYNVML 290

Query: 309 TQIIVGEKTNDVQFTAIFDGG 323
             + V   + D+  + + +GG
Sbjct: 291 MGMDVDGTSLDLPRSIVRNGG 299

BLAST of Cp4.1LG10g10370 vs. NCBI nr
Match: XP_023544679.1 (aspartyl protease family protein 1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 818 bits (2113), Expect = 1.50e-296
Identity = 430/517 (83.17%), Postives = 433/517 (83.75%), Query Frame = 0

Query: 2   AFGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEGLPEKHSPEYYATL 61
           +FGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEGLPEKHSPEYYATL
Sbjct: 6   SFGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEGLPEKHSPEYYATL 65

Query: 62  VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVNAGFLLYANISIGTPALDFFVALDTGS 121
           VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVNAGFLLYANISIGTPALDFFVALDTGS
Sbjct: 66  VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVNAGFLLYANISIGTPALDFFVALDTGS 125

Query: 122 NLFWLPCECRSCPTSTPSGKVPFNHYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYK 181
           NLFWLPCECRSCPTSTPSGKVPFNHYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYK
Sbjct: 126 NLFWLPCECRSCPTSTPSGKVPFNHYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYK 185

Query: 182 VKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLG 241
           VKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLG
Sbjct: 186 VKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLG 245

Query: 242 MERISVPSFLANQGLISDSFSMCFGHDELGRIDFGDTGTPDQKETPINSNPSFPHYNVTI 301
           MERISVPSFLANQGLISDSFSMCFGHDELGRIDFGDTGTPDQKETPINSNPSFPHYNVTI
Sbjct: 246 MERISVPSFLANQGLISDSFSMCFGHDELGRIDFGDTGTPDQKETPINSNPSFPHYNVTI 305

Query: 302 TQIIVGEKTNDVQFTAIFDGGASFTHLAEPHSWILIHT--------RFIF---------- 361
           TQIIVGEKTNDVQFTAIFDGGASFTHLAEP   ++           RF F          
Sbjct: 306 TQIIVGEKTNDVQFTAIFDGGASFTHLAEPVYSLISEQIDSGIQLKRFSFGPDFAFEYCY 365

Query: 362 ---SLYAETIDCPVVNFTMKGGDDFIPLG------------------------------- 421
              SLYAETIDCPVVNFTMKGGDDFIPLG                               
Sbjct: 366 ESPSLYAETIDCPVVNFTMKGGDDFIPLGQFLTLSIDDADTRRAFCLTIVKSTSINIIGM 425

Query: 422 ------------------------YDNGAGTPYGYTTPFDSPPTDKSPPSDDSPPAPVTP 442
                                   YDNGAGTPYGYTTPFDSPPTDKSPPSDDSPPAPVTP
Sbjct: 426 DFMAGYRIVFNREKMVLGWSPSDCYDNGAGTPYGYTTPFDSPPTDKSPPSDDSPPAPVTP 485

BLAST of Cp4.1LG10g10370 vs. NCBI nr
Match: KAG6603889.1 (Aspartyl protease family protein 1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 806 bits (2083), Expect = 4.44e-292
Identity = 422/511 (82.58%), Postives = 429/511 (83.95%), Query Frame = 0

Query: 2   AFGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEGLPEKHSPEYYATL 61
           +FGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEG PEKHSPEYYATL
Sbjct: 6   SFGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEGFPEKHSPEYYATL 65

Query: 62  VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVNAGFLLYANISIGTPALDFFVALDTGS 121
           VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVN GFLLYANISIGTPALDFFVALDTGS
Sbjct: 66  VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVNFGFLLYANISIGTPALDFFVALDTGS 125

Query: 122 NLFWLPCECRSCPTSTPSGKVPFNHYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYK 181
           NLFWLPCECRSCPTSTPSGK+PF+HYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYK
Sbjct: 126 NLFWLPCECRSCPTSTPSGKIPFSHYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYK 185

Query: 182 VKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLG 241
           VKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLG
Sbjct: 186 VKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLG 245

Query: 242 MERISVPSFLANQGLISDSFSMCFGHDELGRIDFGDTGTPDQKETPINSNPSFPHYNVTI 301
           MERISVPSFLANQGLISDSFSMCFGHDE+GRIDFGDTGTPDQKETPINSNPSFPHYNVTI
Sbjct: 246 MERISVPSFLANQGLISDSFSMCFGHDEIGRIDFGDTGTPDQKETPINSNPSFPHYNVTI 305

Query: 302 TQIIVGEKTNDVQFTAIFDGGASFTHLAEPHSWILIHT--------RFIF---------- 361
           TQIIVG KTNDVQFTAIFDGGASFTHLAEP   ++           RF F          
Sbjct: 306 TQIIVGGKTNDVQFTAIFDGGASFTHLAEPVYSLISEQIDSGIQLKRFSFGPDFAFEYCY 365

Query: 362 ---SLYAETIDCPVVNFTMKGGDDFIPLG------------------------------- 421
              SLYAETIDCPVVNFTMKGGDDFIPLG                               
Sbjct: 366 ESPSLYAETIDCPVVNFTMKGGDDFIPLGQFLTLSIDDADTRRAVCLTIVKSTVDFMAGY 425

Query: 422 ------------------YDNGAGTPYGYTTPFDSPPTDKSPPSDDSPPAPVTPGGSTGL 442
                             Y+NGA TPYGYTTPFDSPPTDKSPPSDDSPPAPVTPGGSTGL
Sbjct: 426 RIVFNREKMVLGWSPSDCYNNGAATPYGYTTPFDSPPTDKSPPSDDSPPAPVTPGGSTGL 485

BLAST of Cp4.1LG10g10370 vs. NCBI nr
Match: XP_022949713.1 (aspartyl protease family protein 1-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 803 bits (2073), Expect = 1.85e-290
Identity = 421/517 (81.43%), Postives = 428/517 (82.79%), Query Frame = 0

Query: 2   AFGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEGLPEKHSPEYYATL 61
           +FGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEG PEKHSPEYYATL
Sbjct: 6   SFGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEGFPEKHSPEYYATL 65

Query: 62  VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVNAGFLLYANISIGTPALDFFVALDTGS 121
           VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVN GFLLYANISIGTPALDFFVALDTGS
Sbjct: 66  VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVNFGFLLYANISIGTPALDFFVALDTGS 125

Query: 122 NLFWLPCECRSCPTSTPSGKVPFNHYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYK 181
           NLFWLPCECRSCPTSTPSGK+PF+HYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYK
Sbjct: 126 NLFWLPCECRSCPTSTPSGKIPFSHYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYK 185

Query: 182 VKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLG 241
           VKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLG
Sbjct: 186 VKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLG 245

Query: 242 MERISVPSFLANQGLISDSFSMCFGHDELGRIDFGDTGTPDQKETPINSNPSFPHYNVTI 301
           MERISVPSFLANQGLISDSFSMCFGHDE+GRIDFGDTGTPDQKETPINSNPSFPHYNVTI
Sbjct: 246 MERISVPSFLANQGLISDSFSMCFGHDEIGRIDFGDTGTPDQKETPINSNPSFPHYNVTI 305

Query: 302 TQIIVGEKTNDVQFTAIFDGGASFTHLAEPHSWILIHT--------RFIF---------- 361
           TQIIVG KTNDVQFTAIFDGGASFTHLAEP   ++           RF F          
Sbjct: 306 TQIIVGGKTNDVQFTAIFDGGASFTHLAEPVYSLISEQIDSGIQLKRFSFGPDFAFEYCY 365

Query: 362 ---SLYAETIDCPVVNFTMKGGDDFIPLG------------------------------- 421
              SLYAETIDCPVVNFTMKGGDDFIPLG                               
Sbjct: 366 ESPSLYAETIDCPVVNFTMKGGDDFIPLGQFLTLSIDDADTRRAVCLTIVKSTGINLIGM 425

Query: 422 ------------------------YDNGAGTPYGYTTPFDSPPTDKSPPSDDSPPAPVTP 442
                                   Y+NGA TPYGYTTPFDSPPTDKSPPSDDSPPAPVTP
Sbjct: 426 DFMAGYRIVFNREKMVLGWSPSDCYNNGAATPYGYTTPFDSPPTDKSPPSDDSPPAPVTP 485

BLAST of Cp4.1LG10g10370 vs. NCBI nr
Match: XP_022949714.1 (aspartyl protease family protein 1-like isoform X2 [Cucurbita moschata])

HSP 1 Score: 801 bits (2070), Expect = 5.10e-290
Identity = 420/516 (81.40%), Postives = 427/516 (82.75%), Query Frame = 0

Query: 2   AFGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEGLPEKHSPEYYATL 61
           +FGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEG PEKHSPEYYATL
Sbjct: 6   SFGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEGFPEKHSPEYYATL 65

Query: 62  VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVNAGFLLYANISIGTPALDFFVALDTGS 121
           VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVN GFLLYANISIGTPALDFFVALDTGS
Sbjct: 66  VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVNFGFLLYANISIGTPALDFFVALDTGS 125

Query: 122 NLFWLPCECRSCPTSTPSGKVPFNHYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYK 181
           NLFWLPCECRSCPTSTPSGK+PF+HYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYK
Sbjct: 126 NLFWLPCECRSCPTSTPSGKIPFSHYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYK 185

Query: 182 VKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLG 241
           VKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLG
Sbjct: 186 VKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLG 245

Query: 242 MERISVPSFLANQGLISDSFSMCFGHDELGRIDFGDTGTPDQKETPINSNPSFPHYNVTI 301
           MERISVPSFLANQGLISDSFSMCFGHDE+GRIDFGDTGTPDQKETPINSNPSFPHYNVTI
Sbjct: 246 MERISVPSFLANQGLISDSFSMCFGHDEIGRIDFGDTGTPDQKETPINSNPSFPHYNVTI 305

Query: 302 TQIIVGEKTNDVQFTAIFDGGASFTHLAEPHSWILIHT--------RFIFS--------- 361
           TQIIVG KTNDVQFTAIFDGGASFTHLAEP   ++           RF F          
Sbjct: 306 TQIIVGGKTNDVQFTAIFDGGASFTHLAEPVYSLISEQIDSGIQLKRFSFGPDFAFEYCY 365

Query: 362 ---LYAETIDCPVVNFTMKGGDDFIPLG-------------------------------- 421
              LYAETIDCPVVNFTMKGGDDFIPLG                                
Sbjct: 366 ESPLYAETIDCPVVNFTMKGGDDFIPLGQFLTLSIDDADTRRAVCLTIVKSTGINLIGMD 425

Query: 422 -----------------------YDNGAGTPYGYTTPFDSPPTDKSPPSDDSPPAPVTPG 442
                                  Y+NGA TPYGYTTPFDSPPTDKSPPSDDSPPAPVTPG
Sbjct: 426 FMAGYRIVFNREKMVLGWSPSDCYNNGAATPYGYTTPFDSPPTDKSPPSDDSPPAPVTPG 485

BLAST of Cp4.1LG10g10370 vs. NCBI nr
Match: KAG7034068.1 (Aspartyl protease family protein 1 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 799 bits (2064), Expect = 1.32e-283
Identity = 408/452 (90.27%), Postives = 413/452 (91.37%), Query Frame = 0

Query: 2   AFGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEGLPEKHSPEYYATL 61
           +FGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEG PEKHSPEYYATL
Sbjct: 6   SFGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEGFPEKHSPEYYATL 65

Query: 62  VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVNAGFLLYANISIGTPALDFFVALDTGS 121
           VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVN GFLLYANISIGTPALDFFVALDTGS
Sbjct: 66  VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVNFGFLLYANISIGTPALDFFVALDTGS 125

Query: 122 NLFWLPCECRSCPTSTPSGKVPFNHYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYK 181
           NLFWLPCECRSCPTSTPSGK+PF+HYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYK
Sbjct: 126 NLFWLPCECRSCPTSTPSGKIPFSHYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYK 185

Query: 182 VKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLG 241
           VKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLG
Sbjct: 186 VKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLG 245

Query: 242 MERISVPSFLANQGLISDSFSMCFGHDELGRIDFGDTGTPDQKETPINSNPSFPHYNVTI 301
           MERISVPSFLANQGLISDSFSMCFGHDE+GRIDFGDTGTPDQKETPINSNPSFPHYNVTI
Sbjct: 246 MERISVPSFLANQGLISDSFSMCFGHDEIGRIDFGDTGTPDQKETPINSNPSFPHYNVTI 305

Query: 302 TQIIVGEKTNDVQFTAIFDGGASFTHLAEPHSWILIHTRFIFSLYAETIDCPVVNFTMKG 361
           TQIIVG KTNDVQFTAIFDGGAS                    LYAETIDCPVVNFTMKG
Sbjct: 306 TQIIVGGKTNDVQFTAIFDGGAS--------------------LYAETIDCPVVNFTMKG 365

Query: 362 GDDFIPLG--------------YDNGAGTPYGYTTPFDSPPTDKSPPSDDSPPAPVTPGG 421
           GDDFIPLG              Y+NGA TPYGYTTPFDSPPTDKSPPSDDSPPAPVTPGG
Sbjct: 366 GDDFIPLGQFLTLSIDDADTRRYNNGAATPYGYTTPFDSPPTDKSPPSDDSPPAPVTPGG 425

Query: 422 STGLPNIEVGVATRLNPLTSVVVAVLAILAVV 439
           STGLPNIEVGVATRLNPLTSV VAVLAILAVV
Sbjct: 426 STGLPNIEVGVATRLNPLTSVFVAVLAILAVV 437

BLAST of Cp4.1LG10g10370 vs. ExPASy TrEMBL
Match: A0A6J1GCU4 (aspartyl protease family protein 1-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111453025 PE=4 SV=1)

HSP 1 Score: 803 bits (2073), Expect = 8.95e-291
Identity = 421/517 (81.43%), Postives = 428/517 (82.79%), Query Frame = 0

Query: 2   AFGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEGLPEKHSPEYYATL 61
           +FGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEG PEKHSPEYYATL
Sbjct: 6   SFGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEGFPEKHSPEYYATL 65

Query: 62  VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVNAGFLLYANISIGTPALDFFVALDTGS 121
           VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVN GFLLYANISIGTPALDFFVALDTGS
Sbjct: 66  VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVNFGFLLYANISIGTPALDFFVALDTGS 125

Query: 122 NLFWLPCECRSCPTSTPSGKVPFNHYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYK 181
           NLFWLPCECRSCPTSTPSGK+PF+HYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYK
Sbjct: 126 NLFWLPCECRSCPTSTPSGKIPFSHYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYK 185

Query: 182 VKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLG 241
           VKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLG
Sbjct: 186 VKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLG 245

Query: 242 MERISVPSFLANQGLISDSFSMCFGHDELGRIDFGDTGTPDQKETPINSNPSFPHYNVTI 301
           MERISVPSFLANQGLISDSFSMCFGHDE+GRIDFGDTGTPDQKETPINSNPSFPHYNVTI
Sbjct: 246 MERISVPSFLANQGLISDSFSMCFGHDEIGRIDFGDTGTPDQKETPINSNPSFPHYNVTI 305

Query: 302 TQIIVGEKTNDVQFTAIFDGGASFTHLAEPHSWILIHT--------RFIF---------- 361
           TQIIVG KTNDVQFTAIFDGGASFTHLAEP   ++           RF F          
Sbjct: 306 TQIIVGGKTNDVQFTAIFDGGASFTHLAEPVYSLISEQIDSGIQLKRFSFGPDFAFEYCY 365

Query: 362 ---SLYAETIDCPVVNFTMKGGDDFIPLG------------------------------- 421
              SLYAETIDCPVVNFTMKGGDDFIPLG                               
Sbjct: 366 ESPSLYAETIDCPVVNFTMKGGDDFIPLGQFLTLSIDDADTRRAVCLTIVKSTGINLIGM 425

Query: 422 ------------------------YDNGAGTPYGYTTPFDSPPTDKSPPSDDSPPAPVTP 442
                                   Y+NGA TPYGYTTPFDSPPTDKSPPSDDSPPAPVTP
Sbjct: 426 DFMAGYRIVFNREKMVLGWSPSDCYNNGAATPYGYTTPFDSPPTDKSPPSDDSPPAPVTP 485

BLAST of Cp4.1LG10g10370 vs. ExPASy TrEMBL
Match: A0A6J1GDN3 (aspartyl protease family protein 1-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111453025 PE=4 SV=1)

HSP 1 Score: 801 bits (2070), Expect = 2.47e-290
Identity = 420/516 (81.40%), Postives = 427/516 (82.75%), Query Frame = 0

Query: 2   AFGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEGLPEKHSPEYYATL 61
           +FGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEG PEKHSPEYYATL
Sbjct: 6   SFGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEGFPEKHSPEYYATL 65

Query: 62  VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVNAGFLLYANISIGTPALDFFVALDTGS 121
           VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVN GFLLYANISIGTPALDFFVALDTGS
Sbjct: 66  VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVNFGFLLYANISIGTPALDFFVALDTGS 125

Query: 122 NLFWLPCECRSCPTSTPSGKVPFNHYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYK 181
           NLFWLPCECRSCPTSTPSGK+PF+HYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYK
Sbjct: 126 NLFWLPCECRSCPTSTPSGKIPFSHYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYK 185

Query: 182 VKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLG 241
           VKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLG
Sbjct: 186 VKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLG 245

Query: 242 MERISVPSFLANQGLISDSFSMCFGHDELGRIDFGDTGTPDQKETPINSNPSFPHYNVTI 301
           MERISVPSFLANQGLISDSFSMCFGHDE+GRIDFGDTGTPDQKETPINSNPSFPHYNVTI
Sbjct: 246 MERISVPSFLANQGLISDSFSMCFGHDEIGRIDFGDTGTPDQKETPINSNPSFPHYNVTI 305

Query: 302 TQIIVGEKTNDVQFTAIFDGGASFTHLAEPHSWILIHT--------RFIFS--------- 361
           TQIIVG KTNDVQFTAIFDGGASFTHLAEP   ++           RF F          
Sbjct: 306 TQIIVGGKTNDVQFTAIFDGGASFTHLAEPVYSLISEQIDSGIQLKRFSFGPDFAFEYCY 365

Query: 362 ---LYAETIDCPVVNFTMKGGDDFIPLG-------------------------------- 421
              LYAETIDCPVVNFTMKGGDDFIPLG                                
Sbjct: 366 ESPLYAETIDCPVVNFTMKGGDDFIPLGQFLTLSIDDADTRRAVCLTIVKSTGINLIGMD 425

Query: 422 -----------------------YDNGAGTPYGYTTPFDSPPTDKSPPSDDSPPAPVTPG 442
                                  Y+NGA TPYGYTTPFDSPPTDKSPPSDDSPPAPVTPG
Sbjct: 426 FMAGYRIVFNREKMVLGWSPSDCYNNGAATPYGYTTPFDSPPTDKSPPSDDSPPAPVTPG 485

BLAST of Cp4.1LG10g10370 vs. ExPASy TrEMBL
Match: A0A6J1ISF2 (aspartyl protease family protein 1-like OS=Cucurbita maxima OX=3661 GN=LOC111477998 PE=4 SV=1)

HSP 1 Score: 771 bits (1991), Expect = 2.36e-278
Identity = 408/513 (79.53%), Postives = 421/513 (82.07%), Query Frame = 0

Query: 2   AFGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEGLPEKHSPEYYATL 61
           +FGAQMLLVFSVFFL GGLRSGDASSFKFSIHHRFSD+V  IIDSEGLPEKHSPEYYATL
Sbjct: 6   SFGAQMLLVFSVFFLFGGLRSGDASSFKFSIHHRFSDSVMEIIDSEGLPEKHSPEYYATL 65

Query: 62  VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVNAGFLLYANISIGTPALDFFVALDTGS 121
           VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVN GFLLYANISIGTPALDFFVALDTGS
Sbjct: 66  VHRDRLVHGRRLAASNGSKELTFAGGNATYLMVNFGFLLYANISIGTPALDFFVALDTGS 125

Query: 122 NLFWLPCECRSCPTSTPSGKVPFNHYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYK 181
           NLFWLPCECRSCPTSTPSGKVPFNHY PNASKTSSTVPCSNALCELSNKCTSNQNTCPY+
Sbjct: 126 NLFWLPCECRSCPTSTPSGKVPFNHYRPNASKTSSTVPCSNALCELSNKCTSNQNTCPYE 185

Query: 182 VKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLG 241
           VKY AGN+SSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFA+HAAPNGLIGLG
Sbjct: 186 VKYRAGNDSSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFAKHAAPNGLIGLG 245

Query: 242 MERISVPSFLANQGLISDSFSMCFGHDELGRIDFGDTGTPDQKETPINSNPSFPHYNVTI 301
           MERISVPSFLANQGLI+DSFSMCFG DELGRIDFGDTGTPDQKETPINSNP+FPHYNVTI
Sbjct: 246 MERISVPSFLANQGLIADSFSMCFGRDELGRIDFGDTGTPDQKETPINSNPNFPHYNVTI 305

Query: 302 TQIIVGEKTNDVQFTAIFDGGASFTHLAEPHSWIL-------IHT-RFIFS--------- 361
           TQIIVGEKTNDVQFTAIFDGGASFT+LAEP   ++       IH  RF F          
Sbjct: 306 TQIIVGEKTNDVQFTAIFDGGASFTYLAEPVYSLISEQIDSGIHLKRFSFGPDFPFEYCY 365

Query: 362 ---LYAETIDCPVVNFTMKGGDDFIPLG-------------------------------- 421
              LYAETID PV+NFTMKGGDDF PLG                                
Sbjct: 366 ETPLYAETIDGPVLNFTMKGGDDFTPLGQFLTLSIDDADTRRAVCLTIVKSTSINIIGID 425

Query: 422 -----------------------YDNGAGTPYGYTTPFDSPPTDKSPPSDDSPPAPVTPG 439
                                  Y+NGAGTPYGYT+PFDSPPTDKSPPSDDSPPAPVTPG
Sbjct: 426 FMAGYRIVFNREKMVLGWSPSDCYENGAGTPYGYTSPFDSPPTDKSPPSDDSPPAPVTPG 485

BLAST of Cp4.1LG10g10370 vs. ExPASy TrEMBL
Match: A0A6J1EEK3 (aspartyl protease family protein 1-like OS=Cucurbita moschata OX=3662 GN=LOC111433625 PE=3 SV=1)

HSP 1 Score: 533 bits (1374), Expect = 2.70e-184
Identity = 295/530 (55.66%), Postives = 355/530 (66.98%), Query Frame = 0

Query: 4   GAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEGLPEKHSPEYYATLVH 63
           G QMLLV SV+ L+ GLRSG+A+SFKF+IHHRFS+++KGI+ SEGLPEKH+P YYAT+VH
Sbjct: 8   GVQMLLVLSVYLLACGLRSGEAASFKFNIHHRFSESIKGILTSEGLPEKHTPAYYATMVH 67

Query: 64  RDRLVHGRRLAASNGSKELTFAGGNATYLMVNAGFLLYANISIGTPALDFFVALDTGSNL 123
           RD LV GRRLA+SNG  +LTFA GN T+ + N G+L YANIS+G+P+LDF VALDTGS+L
Sbjct: 68  RDMLVRGRRLASSNGDTKLTFAYGNETFYIENLGYLYYANISVGSPSLDFLVALDTGSDL 127

Query: 124 FWLPCECRSCPT---STPSGKVPFNHYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPY 183
            WLPCECRSC T   +T  GK   NHYSP+ S TS+ VPCSN+LCELSN+CTSN NTCPY
Sbjct: 128 LWLPCECRSCLTYLNTTDGGKFALNHYSPDDSTTSAPVPCSNSLCELSNQCTSNTNTCPY 187

Query: 184 KVKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGL 243
           ++ Y + N SSTGYLV+DV+ L TDD +L PV++KITFGCG VQTG F R AAPNGLIGL
Sbjct: 188 EINYLSANTSSTGYLVQDVLHLATDDLKLDPVESKITFGCGTVQTGVFQRGAAPNGLIGL 247

Query: 244 GMERISVPSFLANQGLISDSFSMCFGHDELGRIDFGDTGTPDQKETPINSNPSFPHYNVT 303
           GM+RISVPS LANQGL +DSFSMCFG D +GRIDFGD+GTP Q+ETP N+  ++P YNVT
Sbjct: 248 GMDRISVPSLLANQGLTTDSFSMCFGIDGIGRIDFGDSGTPGQRETPFNTMANYPSYNVT 307

Query: 304 ITQIIVGEKTNDVQFTAIFDGGASFTHLAEPHSWILIHT------------------RFI 363
           +T+IIVG K N+V+FTAIFD G SFT+L EP   I+                      + 
Sbjct: 308 VTEIIVGGKANNVEFTAIFDSGTSFTYLNEPAYSIISEQMNAGMKLKRFTTDPDFPFEYC 367

Query: 364 FSLYA-ETIDCPVVNFTMKGGDDFIPLG-------------------------------- 423
           + L A + ++ P++NFTM GGDDF+P+                                 
Sbjct: 368 YELPANDKVERPILNFTMMGGDDFVPMDLFIGFPIDDTTHAVCLTLIKSTDINLIGQNFM 427

Query: 424 ---------------------YDNGAGTPYGYTTPF-DSPPTDKSPPSDDSPPA------ 439
                                YD+ AGTP G T P  DSPP D SPP++DSPPA      
Sbjct: 428 TGYRIIFDREKMALGWSPSDCYDSDAGTPSGDTPPAKDSPPADDSPPAEDSPPAEDSPPA 487

BLAST of Cp4.1LG10g10370 vs. ExPASy TrEMBL
Match: A0A6J1BTF7 (aspartyl protease family protein 1-like OS=Momordica charantia OX=3673 GN=LOC111005407 PE=3 SV=1)

HSP 1 Score: 513 bits (1322), Expect = 2.98e-176
Identity = 297/550 (54.00%), Postives = 346/550 (62.91%), Query Frame = 0

Query: 4   GAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEGLPEKHSPEYYATLVH 63
           GAQMLL  SVF L+G LRSG+A SFKFSIHHRFSD++KGI+DSEGLPEK SP YYAT+VH
Sbjct: 8   GAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVH 67

Query: 64  RDRLVHGRRLAASNGSKELTFAGGNATYLMVNAGFLLYANISIGTPALDFFVALDTGSNL 123
           RDRLVHGRRLA +NG   LTF  GN T+L+ N GFL YANIS+GTP L F VALDTGS+L
Sbjct: 68  RDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNLGFLYYANISVGTPELSFLVALDTGSDL 127

Query: 124 FWLPCECRSCPT---STPSGKVPFNHYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPY 183
           FWLPCEC SC T   +T  GK   NHYSP  S TS++VPCSN+LCEL+N+C+S  +TCPY
Sbjct: 128 FWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPY 187

Query: 184 KVKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGL 243
           ++ Y + N SS+GYLV+DV+ L TDD QLKPV AKITFGCGK+QTG FA  AAPNGLIGL
Sbjct: 188 EINYLSANTSSSGYLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGL 247

Query: 244 GMERISVPSFLANQGLISDSFSMCFGHDELGRIDFGDTGTPDQKETPINSNPSFPHYNVT 303
           GME+ISVPSFLA+QGL SDSFSMCFG+D  GRIDFGD GT  Q+ETP N+  +FP YNVT
Sbjct: 248 GMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVNFPSYNVT 307

Query: 304 ITQIIVGEKTNDVQFTAIFDGGASFTHLAEPHSWILIHTRFIFSLYAETIDC-------- 363
            TQIIVG K+N++QF+AIFD G SF+++ +P          ++SL AE +D         
Sbjct: 308 FTQIIVGGKSNNLQFSAIFDSGTSFSYITDP----------VYSLIAEQMDAGMKLERVK 367

Query: 364 ----------------------PVVNFTMKGGDDF--------IP--------------- 423
                                 P +NFTMKGGD++        +P               
Sbjct: 368 FDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVVVPTDNSGGLAACLGIVK 427

Query: 424 --------------------------LG------YDNGAGTPYGYTTPFD---------- 439
                                     LG      YDNGA TP   + P D          
Sbjct: 428 STDPIDLIGQNFMTGYRIIFNREKMVLGWTESDCYDNGAATPSDNSPPADNSPPSDSPPT 487

BLAST of Cp4.1LG10g10370 vs. TAIR 10
Match: AT2G17760.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 336.7 bits (862), Expect = 2.9e-92
Identity = 172/363 (47.38%), Postives = 232/363 (63.91%), Query Frame = 0

Query: 28  FKFSIHHRFSDAVKGIIDSEGLPEKHSPEYYATLVHRDRLVHGRRLAASNGSKELTFAGG 87
           F F  HHRFSD V G++  +GLP + S +YY  + HRDRL+ GRRLA  + S  +TF+ G
Sbjct: 33  FGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRLIRGRRLANEDQSL-VTFSDG 92

Query: 88  NATYLMVNAGFLLYANISIGTPALDFFVALDTGSNLFWLPCECRSC--PTSTPSG-KVPF 147
           N T  +   GFL YAN+++GTP+  F VALDTGS+LFWLPC+C +C      P G  +  
Sbjct: 93  NETVRVDALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSLDL 152

Query: 148 NHYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYKVKYGAGNESSTGYLVEDVVSLIT 207
           N YSPNAS TS+ VPC++ LC   ++C S ++ CPY+++Y +   SSTG LVEDV+ L++
Sbjct: 153 NIYSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVS 212

Query: 208 DDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLGMERISVPSFLANQGLISDSFSMC 267
           +D   K + A++TFGCG+VQTG F   AAPNGL GLG+E ISVPS LA +G+ ++SFSMC
Sbjct: 213 NDKSSKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMC 272

Query: 268 FGHDELGRIDFGDTGTPDQKETPINSNPSFPHYNVTITQIIVGEKTNDVQFTAIFDGGAS 327
           FG+D  GRI FGD G+ DQ+ETP+N     P YN+T+T+I VG  T D++F A+FD G S
Sbjct: 273 FGNDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKISVGGNTGDLEFDAVFDSGTS 332

Query: 328 FTHLAEPHSWILIHTRF----------------------IFSLYAETIDCPVVNFTMKGG 366
           FT+L +  ++ LI   F                        S   ++   P VN TMKGG
Sbjct: 333 FTYLTDA-AYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAVNLTMKGG 392

BLAST of Cp4.1LG10g10370 vs. TAIR 10
Match: AT4G35880.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 319.3 bits (817), Expect = 4.8e-87
Identity = 168/338 (49.70%), Postives = 223/338 (65.98%), Query Frame = 0

Query: 3   FGAQMLLVFSVFFLSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEG----LPEKHSPEYY 62
           F   + L+  +  LS G  S +   F F +HHRFSD VK   DS G     P K S EY+
Sbjct: 6   FKTTLFLIPILMLLSFG--SCNGRIFTFEMHHRFSDEVKQWSDSTGRFAKFPPKGSFEYF 65

Query: 63  ATLVHRDRLVHGRRLAASNGSKE--LTFAGGNATYLMVNAGFLLYANISIGTPALDFFVA 122
             LV RD L+ GRRL+ S    E  LTF+ GN+T  + + GFL Y  + +GTP + F VA
Sbjct: 66  NALVLRDWLIRGRRLSESESESESSLTFSDGNSTSRISSLGFLHYTTVKLGTPGMRFMVA 125

Query: 123 LDTGSNLFWLPCECRSC-PT--STPSGKVPFNHYSPNASKTSSTVPCSNALCELSNKCTS 182
           LDTGS+LFW+PC+C  C PT  +T + +   + Y+P  S T+  V C+N+LC   N+C  
Sbjct: 126 LDTGSDLFWVPCDCGKCAPTEGATYASEFELSIYNPKVSTTNKKVTCNNSLCAQRNQCLG 185

Query: 183 NQNTCPYKVKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAA 242
             +TCPY V Y +   S++G L+EDV+ L T+D   + V+A +TFGCG+VQ+G F   AA
Sbjct: 186 TFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSFLDIAA 245

Query: 243 PNGLIGLGMERISVPSFLANQGLISDSFSMCFGHDELGRIDFGDTGTPDQKETPINSNPS 302
           PNGL GLGME+ISVPS LA +GL++DSFSMCFGHD +GRI FGD G+ DQ+ETP N NPS
Sbjct: 246 PNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPFNLNPS 305

Query: 303 FPHYNVTITQIIVGEKTNDVQFTAIFDGGASFTHLAEP 332
            P+YN+T+T++ VG    D +FTA+FD G SFT+L +P
Sbjct: 306 HPNYNITVTRVRVGTTLIDDEFTALFDTGTSFTYLVDP 341

BLAST of Cp4.1LG10g10370 vs. TAIR 10
Match: AT3G51330.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 291.6 bits (745), Expect = 1.1e-78
Identity = 162/340 (47.65%), Postives = 212/340 (62.35%), Query Frame = 0

Query: 1   MAFGAQMLLVFSVFFLSGGLRSGDAS-SFKFSIHHRFSDAVKGIIDSEGL-PEKHSPEYY 60
           M    Q+ ++ S+  +  GL   +AS  F F +HH FSD VK  +  + L PEK S EY+
Sbjct: 1   MVVARQVFVLLSLLVVCWGLERCEASGKFSFEVHHMFSDRVKQSLGLDDLVPEKGSLEYF 60

Query: 61  ATLVHRDRLVHGRRLAASNGSKELTFAGGNATYLMVNAGFLLYANISIGTPALDFFVALD 120
             L  RDRL+ GR LA++N    +TF  GN T  +   GFL YAN+S+GTPA  F VALD
Sbjct: 61  KVLAQRDRLIRGRGLASNNEETPITFMRGNRTISIDLLGFLHYANVSVGTPATWFLVALD 120

Query: 121 TGSNLFWLPCEC-----RSCPTSTPSGKVPFNHYSPNASKTSSTVPCSNALCELSNKCTS 180
           TGS+LFWLPC C     R       S   P N YSPN S TSS++ CS+  C  S++C+S
Sbjct: 121 TGSDLFWLPCNCGSTCIRDLKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSS 180

Query: 181 NQNTCPYKVKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAA 240
             ++CPY+++Y + +  +TG L EDV+ L+T+D  L+PVKA IT GCGK QTG     AA
Sbjct: 181 PASSCPYQIQYLSKDTFTTGTLFEDVLHLVTEDEGLEPVKANITLGCGKNQTGFLQSSAA 240

Query: 241 PNGLIGLGMERISVPSFLANQGLISDSFSMCFGH--DELGRIDFGDTGTPDQKETPINSN 300
            NGL+GLG++  SVPS LA   + ++SFSMCFG+  D +GRI FGD G  DQ ETP+   
Sbjct: 241 VNGLLGLGLKDYSVPSILAKAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPT 300

Query: 301 PSFPHYNVTITQIIVGEKTNDVQFTAIFDGGASFTHLAEP 332
              P Y V++T++ VG     VQ  A+FD G SFTHL EP
Sbjct: 301 EPSPTYAVSVTEVSVGGDAVGVQLLALFDTGTSFTHLLEP 340

BLAST of Cp4.1LG10g10370 vs. TAIR 10
Match: AT3G51350.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 277.3 bits (708), Expect = 2.1e-74
Identity = 164/395 (41.52%), Postives = 222/395 (56.20%), Query Frame = 0

Query: 1   MAFGAQMLLVFSVFFLSGGLRSGDAS-SFKFSIHHRFSDAVKGIID-SEGLPEKHSPEYY 60
           M    Q+ ++ SV  +  G    +A+  F F +HH FSD+VK  +   + +PE+ S EY+
Sbjct: 1   MDVARQVFVLLSVLVVCWGFERCEATGKFGFEVHHIFSDSVKQSLGLGDLVPEQGSLEYF 60

Query: 61  ATLVHRDRLVHGRRLAASNGSKELTFAGGNATYLMVNAGFLLYANISIGTPALDFFVALD 120
             L HRDRL+ GR LA++N    +TF GGN T  +   G L YAN+S+GTP   F VALD
Sbjct: 61  KVLAHRDRLIRGRGLASNNDETPITFDGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALD 120

Query: 121 TGSNLFWLPCEC-----RSCPTSTPSGKVPFNHYSPNASKTSSTVPCSNALCELSNKCTS 180
           TGS+LFWLPC C     R          VP N Y+PNAS TSS++ CS+  C  S KC+S
Sbjct: 121 TGSDLFWLPCNCGTTCIRDLEDIGVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSS 180

Query: 181 NQNTCPYKVKYGAGNESSTGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAA 240
             + CPY++ Y + +  + G L++DV+ L T+D  L PVKA +T GCG+ QTG F R+ +
Sbjct: 181 PSSICPYQISY-SNSTGTKGTLLQDVLHLATEDENLTPVKANVTLGCGQKQTGLFQRNNS 240

Query: 241 PNGLIGLGMERISVPSFLANQGLISDSFSMCFGH--DELGRIDFGDTGTPDQKETPINSN 300
            NG++GLG++  SVPS LA   + ++SFSMCFG     +GRI FGD G  DQ+ETP  S 
Sbjct: 241 VNGVLGLGIKGYSVPSLLAKANITANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFISV 300

Query: 301 PSFPHYNVTITQIIVGEKTNDVQFTAIFDGGASFTHLAEPHSWILIHT------------ 360
                Y V I+ + V     D++  A FD G+SFTHL EP   +L  +            
Sbjct: 301 APSTAYGVNISGVSVAGDPVDIRLFAKFDTGSSFTHLREPAYGVLTKSFDELVEDRRRPV 360

Query: 361 ------RFIFSL--YAETIDCPVVNFTMKGGDDFI 367
                  F + L   A TI  P+V  T  GG   I
Sbjct: 361 DPELPFEFCYDLSPNATTIQFPLVEMTFIGGSKII 394

BLAST of Cp4.1LG10g10370 vs. TAIR 10
Match: AT3G51360.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 277.3 bits (708), Expect = 2.1e-74
Identity = 169/391 (43.22%), Postives = 235/391 (60.10%), Query Frame = 0

Query: 16  LSGGLRSGDASSFKFSIHHRFSDAVKGIIDSEGLPEKHSPEYYATLVHRDRLVHGRRLAA 75
           +S GL S  + S  F IHHRFS+ VK ++   GLPE  S +YY  LVHRDR   GR+L +
Sbjct: 10  MSLGLASSVSGSLSFEIHHRFSEQVKTVLGGHGLPEMGSLDYYKALVHRDR---GRQLTS 69

Query: 76  SNGSK-ELTFAGGNATYLMVNAGFLLYANISIGTPALDFFVALDTGSNLFWLPCECRS-C 135
           +N ++  ++FA GN+T       FL YAN++IGTPA  F VALDTGS+LFWLPC C S C
Sbjct: 70  NNNNQTTISFAQGNST---EEISFLHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNSTC 129

Query: 136 PTS--TPSG-KVPFNHYSPNASKTSSTVPCSNALCELSNKCTSNQNTCPYKVKYGAGNES 195
             S  T  G ++  N Y+P+ SK+SS V C++ LC L N+C S  + CPY+++Y +    
Sbjct: 130 VRSMETDQGERIKLNIYNPSKSKSSSKVTCNSTLCALRNRCISPVSDCPYRIRYLSPGSK 189

Query: 196 STGYLVEDVVSLITDDSQLKPVKAKITFGCGKVQTGRFARHAAPNGLIGLGMERISVPSF 255
           STG LVEDV+ + T++ + +   A+ITFGC + Q G F +  A NG++GL +  I+VP+ 
Sbjct: 190 STGVLVEDVIHMSTEEGEAR--DARITFGCSESQLGLF-KEVAVNGIMGLAIADIAVPNM 249

Query: 256 LANQGLISDSFSMCFGHDELGRIDFGDTGTPDQKETPINSNPSFPHYNVTITQIIVGEKT 315
           L   G+ SDSFSMCFG +  G I FGD G+ DQ ETP++   S   Y+V+IT+  VG+ T
Sbjct: 250 LVKAGVASDSFSMCFGPNGKGTISFGDKGSSDQLETPLSGTISPMFYDVSITKFKVGKVT 309

Query: 316 NDVQFTAIFDGGASFTHLAEPHSWILIHTRFIFSL----YAETIDCPV------------ 375
            D +FTA FD G + T L EP+ +  + T F  S+     ++++D P             
Sbjct: 310 VDTEFTATFDSGTAVTWLIEPY-YTALTTNFHLSVPDRRLSKSVDSPFEFCYIITSTSDE 369

Query: 376 -----VNFTMKGG---DDFIP-LGYDNGAGT 377
                V+F MKGG   D F P L +D   G+
Sbjct: 370 DKLPSVSFEMKGGAAYDVFSPILVFDTSDGS 390

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8VYV94.1e-9147.38Aspartyl protease family protein 1 OS=Arabidopsis thaliana OX=3702 GN=APF1 PE=2 ... [more]
Q9LX205.2e-5439.88Aspartic proteinase-like protein 1 OS=Arabidopsis thaliana OX=3702 GN=At5g10080 ... [more]
Q4V3D22.6e-2132.38Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1[more]
Q9LEW31.1e-1932.34Aspartyl protease AED1 OS=Arabidopsis thaliana OX=3702 GN=AED1 PE=2 SV=1[more]
Q9S9K41.1e-1930.65Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
XP_023544679.11.50e-29683.17aspartyl protease family protein 1-like [Cucurbita pepo subsp. pepo][more]
KAG6603889.14.44e-29282.58Aspartyl protease family protein 1, partial [Cucurbita argyrosperma subsp. soror... [more]
XP_022949713.11.85e-29081.43aspartyl protease family protein 1-like isoform X1 [Cucurbita moschata][more]
XP_022949714.15.10e-29081.40aspartyl protease family protein 1-like isoform X2 [Cucurbita moschata][more]
KAG7034068.11.32e-28390.27Aspartyl protease family protein 1 [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
A0A6J1GCU48.95e-29181.43aspartyl protease family protein 1-like isoform X1 OS=Cucurbita moschata OX=3662... [more]
A0A6J1GDN32.47e-29081.40aspartyl protease family protein 1-like isoform X2 OS=Cucurbita moschata OX=3662... [more]
A0A6J1ISF22.36e-27879.53aspartyl protease family protein 1-like OS=Cucurbita maxima OX=3661 GN=LOC111477... [more]
A0A6J1EEK32.70e-18455.66aspartyl protease family protein 1-like OS=Cucurbita moschata OX=3662 GN=LOC1114... [more]
A0A6J1BTF72.98e-17654.00aspartyl protease family protein 1-like OS=Momordica charantia OX=3673 GN=LOC111... [more]
Match NameE-valueIdentityDescription
AT2G17760.12.9e-9247.38Eukaryotic aspartyl protease family protein [more]
AT4G35880.14.8e-8749.70Eukaryotic aspartyl protease family protein [more]
AT3G51330.11.1e-7847.65Eukaryotic aspartyl protease family protein [more]
AT3G51350.12.1e-7441.52Eukaryotic aspartyl protease family protein [more]
AT3G51360.12.1e-7443.22Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 93..277
e-value: 3.5E-43
score: 149.8
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 94..366
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 101..277
e-value: 2.8E-38
score: 131.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 387..401
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 377..409
NoneNo IPR availablePANTHERPTHR13683:SF826ASPARTYL PROTEASE FAMILY PROTEIN 1coord: 5..365
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 5..365
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 100..442
score: 20.842337
IPR034164Pepsin-like domainCDDcd05471pepsin_likecoord: 101..328
e-value: 4.26451E-29
score: 113.289

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG10g10370.1Cp4.1LG10g10370.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity