Sgr018207 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr018207
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionEukaryotic aspartyl protease family protein
Locationtig00153145: 187051 .. 188412 (+)
RNA-Seq ExpressionSgr018207
SyntenySgr018207
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGTTTCTTCCGCTCTCTGTTTCTTCTACATTCTCCTATTCTCTGCTGTTTCCGCCATTGACAGCAGCAACGTCATTACCCTCCCTCTCTCGGCCTTTCCTCACCATTCATCTTCAGACCCATTGGAGACTCTGAACTTCCTCGCTTCTGCTTCTCTAAGCAGAGCCCATCAAATCAAGAGCCCGAAATCCAACTCCTCCATCTCCAAATCTCCTCTCTCCCCCCATAGCTATGGAGCTTACTCTGCTCCACTCAGCTTCGGGACGCCCCCACAGACCCTGCATTTGATCTTCGATACAGGTAGCAGCCTCGTTTGGTTTCCTTGTACTTCCCGATATCTCTGCTCCGAGTGTTCTTTCCCCAAGGTAGATCCCGCCGGAATCCCCAGATTTGTTCCGAAATTATCTACCTCTTCGAAGCTTCTCGGTTGCCAGAATCCGAAATGTGCCTGGATATTTGGTCCCGACGTGAAATCTCAGTGCCGGAGCTGTAATCCTACGACGGAGAACTGTACCCAAACTTGCCCTGCTTATGTTGTTCAGTATGGTTCTGGCTCGACGGCCGGGCTTTTGCTCTCGGAGACGCTTGATTTTCCGGAAAAGAAAATACCCAATTTTGTTGTTGGGTGTTCGTTCTTGTCGATCCACCAACCCTCTGGGATTGCCGGATTCGGCCGAGGATCCGAATCGTTGCCGTCGCAGATGGGTCTCAAGAAATTCTCCTACTGCCTCGCGTCCCGTCGATTCGATGACACGCCGCATTCCGGCGAGCTTGTTCTAGATTCCGGCGGCGAGAAGACCGGTGACCTCAGCTACACACCGTTCCGGAAGAACCCCTCTGTATCTAACCACGCTTACAAAGAATACTATTACTTAACCATACGCAAAATCATCGTCGGCAACCAGGCCGTGAAGGTGCCGTACAAGTTTCTGGCGCCGGGACCCGACGGAAGCGGCGGATCTATAATCGACTCCGGCTCCACCTTCACATTTCTGGAAAAACCGGTGTTCGAGGCGGTGGCCCAAGAGTTTGAGAAGCAGTTGGCGAATCATACGAGAGCCACCGACGTAGAATCTCTGACCGGATTACGGCCGTGCTTCGATATTTCGAAAGAGAAATCGGTGGAGTTCCCGGAGTTGATCTTCCAGTTCAAAGGCGGAGCGAAACTGGCACTGCCGTTGAACAACTATTTCGCTCTGGTCAGTAGCTCCGGCGTGGCGTGTTTGACGATCGTGACGCAGAAAGTGGCGGCCGGTGGGCCGTCTGTGATATTGGGTGCGTTTCAGCAGCAGAATTTCTATGTAGAGTATGATTTGGTAAACGATAGACTAGGATTTCGACAACAGACTTGCAGTTAG

mRNA sequence

ATGGCGGTTTCTTCCGCTCTCTGTTTCTTCTACATTCTCCTATTCTCTGCTGTTTCCGCCATTGACAGCAGCAACGTCATTACCCTCCCTCTCTCGGCCTTTCCTCACCATTCATCTTCAGACCCATTGGAGACTCTGAACTTCCTCGCTTCTGCTTCTCTAAGCAGAGCCCATCAAATCAAGAGCCCGAAATCCAACTCCTCCATCTCCAAATCTCCTCTCTCCCCCCATAGCTATGGAGCTTACTCTGCTCCACTCAGCTTCGGGACGCCCCCACAGACCCTGCATTTGATCTTCGATACAGGTAGCAGCCTCGTTTGGTTTCCTTGTACTTCCCGATATCTCTGCTCCGAGTGTTCTTTCCCCAAGGTAGATCCCGCCGGAATCCCCAGATTTGTTCCGAAATTATCTACCTCTTCGAAGCTTCTCGGTTGCCAGAATCCGAAATGTGCCTGGATATTTGGTCCCGACGTGAAATCTCAGTGCCGGAGCTGTAATCCTACGACGGAGAACTGTACCCAAACTTGCCCTGCTTATGTTGTTCAGTATGGTTCTGGCTCGACGGCCGGGCTTTTGCTCTCGGAGACGCTTGATTTTCCGGAAAAGAAAATACCCAATTTTGTTGTTGGGTGTTCGTTCTTGTCGATCCACCAACCCTCTGGGATTGCCGGATTCGGCCGAGGATCCGAATCGTTGCCGTCGCAGATGGGTCTCAAGAAATTCTCCTACTGCCTCGCGTCCCGTCGATTCGATGACACGCCGCATTCCGGCGAGCTTGTTCTAGATTCCGGCGGCGAGAAGACCGGTGACCTCAGCTACACACCGTTCCGGAAGAACCCCTCTGTATCTAACCACGCTTACAAAGAATACTATTACTTAACCATACGCAAAATCATCGTCGGCAACCAGGCCGTGAAGGTGCCGTACAAGTTTCTGGCGCCGGGACCCGACGGAAGCGGCGGATCTATAATCGACTCCGGCTCCACCTTCACATTTCTGGAAAAACCGGTGTTCGAGGCGGTGGCCCAAGAGTTTGAGAAGCAGTTGGCGAATCATACGAGAGCCACCGACGTAGAATCTCTGACCGGATTACGGCCGTGCTTCGATATTTCGAAAGAGAAATCGGTGGAGTTCCCGGAGTTGATCTTCCAGTTCAAAGGCGGAGCGAAACTGGCACTGCCGTTGAACAACTATTTCGCTCTGGTCAGTAGCTCCGGCGTGGCGTGTTTGACGATCGTGACGCAGAAAGTGGCGGCCGGTGGGCCGTCTGTGATATTGGGTGCGTTTCAGCAGCAGAATTTCTATGTAGAGTATGATTTGGTAAACGATAGACTAGGATTTCGACAACAGACTTGCAGTTAG

Coding sequence (CDS)

ATGGCGGTTTCTTCCGCTCTCTGTTTCTTCTACATTCTCCTATTCTCTGCTGTTTCCGCCATTGACAGCAGCAACGTCATTACCCTCCCTCTCTCGGCCTTTCCTCACCATTCATCTTCAGACCCATTGGAGACTCTGAACTTCCTCGCTTCTGCTTCTCTAAGCAGAGCCCATCAAATCAAGAGCCCGAAATCCAACTCCTCCATCTCCAAATCTCCTCTCTCCCCCCATAGCTATGGAGCTTACTCTGCTCCACTCAGCTTCGGGACGCCCCCACAGACCCTGCATTTGATCTTCGATACAGGTAGCAGCCTCGTTTGGTTTCCTTGTACTTCCCGATATCTCTGCTCCGAGTGTTCTTTCCCCAAGGTAGATCCCGCCGGAATCCCCAGATTTGTTCCGAAATTATCTACCTCTTCGAAGCTTCTCGGTTGCCAGAATCCGAAATGTGCCTGGATATTTGGTCCCGACGTGAAATCTCAGTGCCGGAGCTGTAATCCTACGACGGAGAACTGTACCCAAACTTGCCCTGCTTATGTTGTTCAGTATGGTTCTGGCTCGACGGCCGGGCTTTTGCTCTCGGAGACGCTTGATTTTCCGGAAAAGAAAATACCCAATTTTGTTGTTGGGTGTTCGTTCTTGTCGATCCACCAACCCTCTGGGATTGCCGGATTCGGCCGAGGATCCGAATCGTTGCCGTCGCAGATGGGTCTCAAGAAATTCTCCTACTGCCTCGCGTCCCGTCGATTCGATGACACGCCGCATTCCGGCGAGCTTGTTCTAGATTCCGGCGGCGAGAAGACCGGTGACCTCAGCTACACACCGTTCCGGAAGAACCCCTCTGTATCTAACCACGCTTACAAAGAATACTATTACTTAACCATACGCAAAATCATCGTCGGCAACCAGGCCGTGAAGGTGCCGTACAAGTTTCTGGCGCCGGGACCCGACGGAAGCGGCGGATCTATAATCGACTCCGGCTCCACCTTCACATTTCTGGAAAAACCGGTGTTCGAGGCGGTGGCCCAAGAGTTTGAGAAGCAGTTGGCGAATCATACGAGAGCCACCGACGTAGAATCTCTGACCGGATTACGGCCGTGCTTCGATATTTCGAAAGAGAAATCGGTGGAGTTCCCGGAGTTGATCTTCCAGTTCAAAGGCGGAGCGAAACTGGCACTGCCGTTGAACAACTATTTCGCTCTGGTCAGTAGCTCCGGCGTGGCGTGTTTGACGATCGTGACGCAGAAAGTGGCGGCCGGTGGGCCGTCTGTGATATTGGGTGCGTTTCAGCAGCAGAATTTCTATGTAGAGTATGATTTGGTAAACGATAGACTAGGATTTCGACAACAGACTTGCAGTTAG

Protein sequence

MAVSSALCFFYILLFSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKIIVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYFALVSSSGVACLTIVTQKVAAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS
Homology
BLAST of Sgr018207 vs. NCBI nr
Match: XP_038905730.1 (probable aspartyl protease At4g16563 [Benincasa hispida])

HSP 1 Score: 810.1 bits (2091), Expect = 9.9e-231
Identity = 398/456 (87.28%), Postives = 426/456 (93.42%), Query Frame = 0

Query: 1   MAVSSALCFFYILLFSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQI 60
           MA  S+L FFYILLFS+VSAI ++N ITLPL+AFPH SSSDPL+TL FLASAS +RAHQI
Sbjct: 1   MAPPSSLSFFYILLFSSVSAIANTNPITLPLNAFPHLSSSDPLQTLTFLASASQNRAHQI 60

Query: 61  KSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSECS 120
           K+PKSN S+SKSPL PHSYGAYS PLSFGTP QTLHLIFDTGSSLVWFPCTSRYLCSECS
Sbjct: 61  KTPKSN-SVSKSPLFPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECS 120

Query: 121 FPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYV 180
           FPK+DP GIPRFVPKLS+SSKL+GCQNPKCAWIFGP+VKSQCRSCNP TENCTQTCPAYV
Sbjct: 121 FPKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPEVKSQCRSCNPKTENCTQTCPAYV 180

Query: 181 VQYGSGSTAGLLLSETLDFPEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKK 240
           VQYGSGSTAGLLLSETLDFP+KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKK
Sbjct: 181 VQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKK 240

Query: 241 FSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKIIV 300
           F+YCLASR+FDD+PHSGEL+LDS G KT  LSYTPFR+NPSVSNHAYKEYYYL IRKI V
Sbjct: 241 FAYCLASRKFDDSPHSGELILDSTGVKTSGLSYTPFRQNPSVSNHAYKEYYYLNIRKIFV 300

Query: 301 GNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVES 360
           GNQAVKVPYKFL PGPDG+GGSIIDSGSTFTF++KPVFEAVAQEFEKQLAN TRATDVES
Sbjct: 301 GNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVES 360

Query: 361 LTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYFALVSSSGVACLTIVTQKVAA- 420
           LTGLRPCFDISK+KSVEFPELIFQFKGGAK ALPL+NYFALVSSSGVACLT+VT K  A 
Sbjct: 361 LTGLRPCFDISKDKSVEFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAG 420

Query: 421 --GGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS 454
             GGPSVI GAFQQQNFYVEYDLVN++LGFRQQTC+
Sbjct: 421 GGGGPSVIFGAFQQQNFYVEYDLVNEKLGFRQQTCT 455

BLAST of Sgr018207 vs. NCBI nr
Match: KAG7028143.1 (Aspartic proteinase nepenthesin-2, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 802.4 bits (2071), Expect = 2.1e-228
Identity = 394/458 (86.03%), Postives = 426/458 (93.01%), Query Frame = 0

Query: 1   MAVSSALCFFYILLFSAVSAI--DSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAH 60
           MA    LCF YILL  +VSAI   ++N ITLPLSAFPH SSSDPL+ LNFLASAS +RAH
Sbjct: 1   MAAPPPLCFSYILLLFSVSAIVDANANSITLPLSAFPHPSSSDPLQNLNFLASASQNRAH 60

Query: 61  QIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSE 120
           QIK+PKSN S+SKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTS+YLCS+
Sbjct: 61  QIKTPKSN-SVSKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSKYLCSQ 120

Query: 121 CSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPA 180
           CSFPK+DP  IPRFVPKLS+SSKL+GCQNPKCAW+FGPDVKSQCR+CNP TENCTQTCPA
Sbjct: 121 CSFPKIDPLRIPRFVPKLSSSSKLVGCQNPKCAWVFGPDVKSQCRNCNPKTENCTQTCPA 180

Query: 181 YVVQYGSGSTAGLLLSETLDFPEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL 240
           Y VQYGSGSTAGLLLSETLDFP++KIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL
Sbjct: 181 YAVQYGSGSTAGLLLSETLDFPDQKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL 240

Query: 241 KKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKI 300
           KKF+YCLASR+FDD+PHSGEL+LDSGG KTGDL+YTPFR+NPSVSNHAYKEYYYL+IRKI
Sbjct: 241 KKFAYCLASRKFDDSPHSGELILDSGGAKTGDLTYTPFRQNPSVSNHAYKEYYYLSIRKI 300

Query: 301 IVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDV 360
           +VGNQ VKVPYK+L PG DGSGGSIIDSGSTFTF++KPVFEAVA+ FEKQLAN TRATDV
Sbjct: 301 LVGNQDVKVPYKYLVPGSDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDV 360

Query: 361 ESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYFALVSSSGVACLTIVTQKVA 420
           ES TGLRPCFDISKEKSVEFPELIFQFKGGAK ALPLNNYFALVSSSGVACLT+VT K A
Sbjct: 361 ESATGLRPCFDISKEKSVEFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHKEA 420

Query: 421 AG---GPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS 454
           +G   GPSVILGAFQQQNFYVEYDLVN+RLGFRQQ+CS
Sbjct: 421 SGGGSGPSVILGAFQQQNFYVEYDLVNERLGFRQQSCS 457

BLAST of Sgr018207 vs. NCBI nr
Match: XP_022982947.1 (probable aspartyl protease At4g16563 [Cucurbita maxima])

HSP 1 Score: 802.0 bits (2070), Expect = 2.7e-228
Identity = 389/456 (85.31%), Postives = 421/456 (92.32%), Query Frame = 0

Query: 1   MAVSSALCFFYILLFSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQI 60
           MA    LCFFYILL S+VSAI  +N IT+PLS+FPHHSSSDPL+TLNFLASAS +RAHQI
Sbjct: 1   MAPPPPLCFFYILLVSSVSAIADTNPITIPLSSFPHHSSSDPLQTLNFLASASQNRAHQI 60

Query: 61  KSPKSNS-SISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSEC 120
           K+PKS S S+SKSPLSPHSYGAYS PLSFGTPPQTLHLIFDTGSSLVW PCTS+YLCSEC
Sbjct: 61  KAPKSESNSVSKSPLSPHSYGAYSTPLSFGTPPQTLHLIFDTGSSLVWLPCTSKYLCSEC 120

Query: 121 SFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAY 180
           SFPK+DPAGIPRF+PKLS++SKL+GCQNPKCAWIFGPDVKSQCRSCNP TENCTQTCPAY
Sbjct: 121 SFPKIDPAGIPRFIPKLSSTSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY 180

Query: 181 VVQYGSGSTAGLLLSETLDFPEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 240
           VVQYGSGSTAGLLLSETLDFP+KK  NFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK
Sbjct: 181 VVQYGSGSTAGLLLSETLDFPDKKFTNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 240

Query: 241 KFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKII 300
           KF+YCLASR+FDD+PH+GEL+LDS G KT  LSYTPFR+NPSVSNHAYKEYYYLTIRKI 
Sbjct: 241 KFAYCLASRKFDDSPHAGELILDSSGAKTSGLSYTPFRQNPSVSNHAYKEYYYLTIRKIF 300

Query: 301 VGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVE 360
           VG +AVKVPYK+L PGPDG+GGSIIDSGSTFTF++KPVFEAVAQE EKQLAN TRATDVE
Sbjct: 301 VGKKAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEIEKQLANRTRATDVE 360

Query: 361 SLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYFALVSSSGVACLTIVTQKVA- 420
           SLTGLRPCFDISK+KSVEFPEL FQ KGGAK  LPL+NYFALVSSSGVACLT+VT K A 
Sbjct: 361 SLTGLRPCFDISKDKSVEFPELTFQLKGGAKWGLPLSNYFALVSSSGVACLTVVTHKTAD 420

Query: 421 -AGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS 454
             GGPS+ILGAFQQQNFYVEYDLVN ++GFRQQTCS
Sbjct: 421 SGGGPSIILGAFQQQNFYVEYDLVNQKIGFRQQTCS 456

BLAST of Sgr018207 vs. NCBI nr
Match: XP_023528159.1 (probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 801.6 bits (2069), Expect = 3.5e-228
Identity = 391/456 (85.75%), Postives = 421/456 (92.32%), Query Frame = 0

Query: 1   MAVSSALCFFYILLFSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQI 60
           MA    LCFFYILL S+VSAI  +N ITLPLS+FPHHSSSDPL+TLNFLASAS +RAHQI
Sbjct: 1   MAPPPLLCFFYILLLSSVSAIADTNPITLPLSSFPHHSSSDPLQTLNFLASASQNRAHQI 60

Query: 61  KSPKSNS-SISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSEC 120
           K+PKS S S+SKSPLSPHSYGAYS PLSFGTP QTLHLIFDTGSSLVW PCTS+YLCSEC
Sbjct: 61  KAPKSKSNSVSKSPLSPHSYGAYSTPLSFGTPSQTLHLIFDTGSSLVWLPCTSKYLCSEC 120

Query: 121 SFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAY 180
           SFPK+DPAGIPRF+PKLS+SSKL+GCQNPKCAWIFGPDVKSQCRSCNP TENCTQTCPAY
Sbjct: 121 SFPKIDPAGIPRFIPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY 180

Query: 181 VVQYGSGSTAGLLLSETLDFPEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 240
           VVQYGSGSTAGLLLSETLDF  KKI NFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK
Sbjct: 181 VVQYGSGSTAGLLLSETLDFANKKITNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 240

Query: 241 KFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKII 300
           KF+YCLASR+FDD+PH+GEL+LDS G KT  L+YTPFR+NPSVSNHAYKEYYYLTIRKI 
Sbjct: 241 KFAYCLASRKFDDSPHAGELILDSSGAKTSGLTYTPFRQNPSVSNHAYKEYYYLTIRKIF 300

Query: 301 VGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVE 360
           VGN+AVKVPYK+L PGPDG+GGSIIDSGSTFTF++KPVFEAVAQE EKQLAN TRATDVE
Sbjct: 301 VGNKAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEIEKQLANRTRATDVE 360

Query: 361 SLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYFALVSSSGVACLTIVTQKVA- 420
           SLTGLRPCFDISK+KSVEFPEL FQ KGGAK ALPL+NYFALVSSSGVACLT+VT K A 
Sbjct: 361 SLTGLRPCFDISKDKSVEFPELTFQLKGGAKWALPLSNYFALVSSSGVACLTVVTHKAAD 420

Query: 421 -AGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS 454
             GGPS+ILGAFQQQNFYVEYDLVN ++GFRQQTCS
Sbjct: 421 SGGGPSIILGAFQQQNFYVEYDLVNQKIGFRQQTCS 456

BLAST of Sgr018207 vs. NCBI nr
Match: XP_022940517.1 (probable aspartyl protease At4g16563 [Cucurbita moschata])

HSP 1 Score: 801.2 bits (2068), Expect = 4.6e-228
Identity = 394/458 (86.03%), Postives = 425/458 (92.79%), Query Frame = 0

Query: 1   MAVSSALCFFYILLFSAVSAI--DSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAH 60
           MA    LCF YILL  +VSAI   ++N ITLPLSA PH SSSDPL+ LNFLASAS +RAH
Sbjct: 1   MAAPPPLCFSYILLLFSVSAIVDANANSITLPLSALPHPSSSDPLQNLNFLASASQNRAH 60

Query: 61  QIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSE 120
           QIK+PKSN S+SKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTS+YLCS+
Sbjct: 61  QIKTPKSN-SVSKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSKYLCSQ 120

Query: 121 CSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPA 180
           CSFPK+DP  IPRFVPKLS+SSKL+GCQNPKCAW+FGPDVKSQCR+CNP TENCTQTCPA
Sbjct: 121 CSFPKIDPLRIPRFVPKLSSSSKLVGCQNPKCAWVFGPDVKSQCRNCNPKTENCTQTCPA 180

Query: 181 YVVQYGSGSTAGLLLSETLDFPEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL 240
           Y VQYGSGSTAGLLLSETLDFP++KIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL
Sbjct: 181 YAVQYGSGSTAGLLLSETLDFPDQKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL 240

Query: 241 KKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKI 300
           KKF+YCLASR+FDD+PHSGEL+LDSGG KTGDL+YTPFR+NPSVSNHAYKEYYYL+IRKI
Sbjct: 241 KKFAYCLASRKFDDSPHSGELILDSGGAKTGDLTYTPFRQNPSVSNHAYKEYYYLSIRKI 300

Query: 301 IVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDV 360
           +VGNQ VKVPYK+L PG DGSGGSIIDSGSTFTF++KPVFEAVA+ FEKQLAN TRATDV
Sbjct: 301 LVGNQDVKVPYKYLVPGSDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDV 360

Query: 361 ESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYFALVSSSGVACLTIVTQKVA 420
           ES TGLRPCFDISKEKSVEFPELIFQFKGGAK ALPLNNYFALVSSSGVACLT+VT K A
Sbjct: 361 ESATGLRPCFDISKEKSVEFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHKEA 420

Query: 421 AG---GPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS 454
           AG   GPSVILGAFQQQNFYVEYDLVN+RLGFRQQ+CS
Sbjct: 421 AGGGSGPSVILGAFQQQNFYVEYDLVNERLGFRQQSCS 457

BLAST of Sgr018207 vs. ExPASy Swiss-Prot
Match: Q940R4 (Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g16563 PE=2 SV=1)

HSP 1 Score: 195.3 bits (495), Expect = 1.5e-48
Identity = 144/487 (29.57%), Postives = 224/487 (46.00%), Query Frame = 0

Query: 17  AVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNSSISKSPLSP 76
           +VS++ +  ++ L  S      SS PL  L   +S S +R  +    +    +S  P+S 
Sbjct: 21  SVSSLSTPLLLHLSHSLSTSKHSSSPLHLLKSSSSRSSARFRRHHHKQQQQQLS-LPIS- 80

Query: 77  HSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKL 136
            S   Y   LS G+    + L  DTGS LVWFPC   + C  C    + P+        L
Sbjct: 81  -SGSDYLISLSVGSSSSAVSLYLDTGSDLVWFPCRP-FTCILCESKPLPPSP----PSSL 140

Query: 137 STSSKLLGCQNPKCAWIFGPDVKSQ-CRSCN-----PTTENCTQT---CPAYVVQYGSGS 196
           S+S+  + C +P C+        S  C   N       T +C  +   CP +   YG GS
Sbjct: 141 SSSATTVSCSSPSCSAAHSSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGS 200

Query: 197 TAGLLLSETLDFPEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL------KKF 256
               L S++L  P   + NF  GC+  ++ +P G+AGFGRG  SLP+Q+ +        F
Sbjct: 201 LVAKLYSDSLSLPSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSF 260

Query: 257 SYCLASRRFDD--TPHSGELVL--------------------DSGGEKTGDLSYTPFRKN 316
           SYCL S  FD         L+L                    D   +K  +  +T   +N
Sbjct: 261 SYCLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLEN 320

Query: 317 PSVSNHAYKEYYYLTIRKIIVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFE 376
           P    H Y  +Y ++++ I +G + +  P        +G GG ++DSG+TFT L    + 
Sbjct: 321 P---KHPY--FYSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYN 380

Query: 377 AVAQEFEKQLAN-HTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGG-AKLALPLNN 436
           +V +EF+ ++   H RA  VE  +G+ PC+ ++  ++V+ P L+  F G  + + LP  N
Sbjct: 381 SVVEEFDSRVGRVHERADRVEPSSGMSPCYYLN--QTVKVPALVLHFAGNRSSVTLPRRN 440

Query: 437 YFALVSSSG--------VACLTIVT---QKVAAGGPSVILGAFQQQNFYVEYDLVNDRLG 454
           YF      G        + CL ++    +    GG   ILG +QQQ F V YDL+N R+G
Sbjct: 441 YFYEFMDGGDGKEEKRKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVG 492

BLAST of Sgr018207 vs. ExPASy Swiss-Prot
Match: Q6F4N5 (Aspartyl protease 25 OS=Oryza sativa subsp. japonica OX=39947 GN=AP25 PE=2 SV=1)

HSP 1 Score: 154.1 bits (388), Expect = 3.9e-36
Identity = 131/463 (28.29%), Postives = 203/463 (43.84%), Query Frame = 0

Query: 1   MAVSSALCFFYILLFSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQI 60
           MA ++ +    +LL + V+A  +     L +    H SS  PLE++  LA    +R   +
Sbjct: 1   MAATTTIPLLLLLLAATVAAAAAE----LSVYHNVHPSSPSPLESIIALARDDDARLLFL 60

Query: 61  KSPKSNSSISKSPL-SPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSEC 120
            S  + + +S +P+ S  +  +Y      G+P Q L L  DT +   W  C+    C   
Sbjct: 61  SSKAATAGVSSAPVASGQAPPSYVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSS 120

Query: 121 SFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAY 180
           S           F P  S+S   L C +  C    G    +     +      T    A+
Sbjct: 121 SL----------FAPANSSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLPTCAF 180

Query: 181 VVQYGSGSTAGLLLSETLDFPEKKIPNFVVGCSFLSIHQPS------GIAGFGRGSESLP 240
              +   S    L S+TL   +  IPN+  GC   S+  P+      G+ G GRG  +L 
Sbjct: 181 SKPFADASFQAALASDTLRLGKDAIPNYTFGC-VSSVTGPTTNMPRQGLLGLGRGPMALL 240

Query: 241 SQMGL---KKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEY 300
           SQ G      FSYCL S R      SG L L +GG +   + YTP  +NP  S+      
Sbjct: 241 SQAGSLYNGVFSYCLPSYR--SYYFSGSLRLGAGGGQPRSVRYTPMLRNPHRSS-----L 300

Query: 301 YYLTIRKIIVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLA 360
           YY+ +  + VG+  VKVP    A       G+++DSG+  T    PV+ A+ +EF +Q+A
Sbjct: 301 YYVNVTGLSVGHAWVKVPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVA 360

Query: 361 NHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYFALVSSSGVACL 420
             +  T   SL     CF+  +  +   P +     GG  LALP+ N     S++ +ACL
Sbjct: 361 APSGYT---SLGAFDTCFNTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACL 420

Query: 421 TIVTQKVAAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS 454
            +            ++   QQQN  V +D+ N R+GF +++C+
Sbjct: 421 AMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSRVGFAKESCN 438

BLAST of Sgr018207 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 152.9 bits (385), Expect = 8.6e-36
Identity = 132/407 (32.43%), Postives = 181/407 (44.47%), Query Frame = 0

Query: 56  RAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYL 115
           R   I +   +SS  ++P+     G Y   ++ GTP  +   I DTGS L+W  C     
Sbjct: 71  RMRSINAMLQSSSGIETPVYAGD-GEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEP--- 130

Query: 116 CSEC-SFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQ 175
           C++C S P       P F P+ S+S   L C++  C      D+ S         E C  
Sbjct: 131 CTQCFSQP------TPIFNPQDSSSFSTLPCESQYC-----QDLPS---------ETCNN 190

Query: 176 TCPAYVVQYGSGSTA-GLLLSETLDFPEKKIPNFVVGC----SFLSIHQPSGIAGFGRGS 235
               Y   YG GST  G + +ET  F    +PN   GC            +G+ G G G 
Sbjct: 191 NECQYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGP 250

Query: 236 ESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRK--NPSVSNHAY 295
            SLPSQ+G+ +FSYC+ S     +P +  L   + G   G  S T      NP+      
Sbjct: 251 LSLPSQLGVGQFSYCMTSYG-SSSPSTLALGSAASGVPEGSPSTTLIHSSLNPT------ 310

Query: 296 KEYYYLTIRKIIVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEK 355
             YYY+T++ I VG   + +P        DG+GG IIDSG+T T+L +  + AVAQ F  
Sbjct: 311 --YYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTD 370

Query: 356 QLANHTRATDVESLTGLRPCFDISKEKS-VEFPELIFQFKGGAKLALPLNNYFALVS-SS 415
           Q+      T  ES +GL  CF    + S V+ PE+  QF GG    L L     L+S + 
Sbjct: 371 QI---NLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGG---VLNLGEQNILISPAE 430

Query: 416 GVACLTIVTQKVAAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTC 453
           GV CL + +          I G  QQQ   V YDL N  + F    C
Sbjct: 431 GVICLAMGSSSQLG---ISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of Sgr018207 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 152.1 bits (383), Expect = 1.5e-35
Identity = 142/466 (30.47%), Postives = 201/466 (43.13%), Query Frame = 0

Query: 15  FSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSR-AHQIKS----------- 74
           F + S  +SS+ ITL L      SS+   +T + L S+ L R + ++KS           
Sbjct: 60  FESGSDSESSSSITLNLDHIDALSSN---KTPDELFSSRLQRDSRRVKSIATLAAQIPGR 119

Query: 75  -------PKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYL 134
                  P   SS   S LS  S G Y   L  GTP + ++++ DTGS +VW  C     
Sbjct: 120 NVTHAPRPGGFSSSVVSGLSQGS-GEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAP--- 179

Query: 135 CSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQT 194
           C  C + + DP     F P+ S +   + C +P C        +     CN   + C   
Sbjct: 180 CRRC-YSQSDPI----FDPRKSKTYATIPCSSPHCR-------RLDSAGCNTRRKTC--- 239

Query: 195 CPAYVVQYGSGS-TAGLLLSETLDFPEKKIPNFVVGCSFLS---IHQPSGIAGFGRGSES 254
              Y V YG GS T G   +ETL F   ++    +GC   +       +G+ G G+G  S
Sbjct: 240 --LYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLS 299

Query: 255 LPSQMGLK---KFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYK 314
            P Q G +   KFSYCL  R     P S   V+      +    +TP   NP +      
Sbjct: 300 FPGQTGHRFNQKFSYCLVDRSASSKPSS---VVFGNAAVSRIARFTPLLSNPKLDT---- 359

Query: 315 EYYYLTIRKIIVGNQAVK-VPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEK 374
            +YY+ +  I VG   V  V          G+GG IIDSG++ T L +P + A+   F  
Sbjct: 360 -FYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRV 419

Query: 375 QLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYFALVSSSGV 434
                 RA D         CFD+S    V+ P ++  F+ GA ++LP  NY   V ++G 
Sbjct: 420 GAKTLKRAPDFSLFD---TCFDLSNMNEVKVPTVVLHFR-GADVSLPATNYLIPVDTNGK 479

Query: 435 ACLTIVTQKVAAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS 454
            C          GG S+I G  QQQ F V YDL + R+GF    C+
Sbjct: 480 FCFAFAG---TMGGLSII-GNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of Sgr018207 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 151.0 bits (380), Expect = 3.3e-35
Identity = 121/383 (31.59%), Postives = 171/383 (44.65%), Query Frame = 0

Query: 80  GAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTS 139
           G Y   LS GTP Q    I DTGS L+W  C     C++C          P F P+ S+S
Sbjct: 93  GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQP---CTQCF-----NQSTPIFNPQGSSS 152

Query: 140 SKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGS-TAGLLLSETLD 199
              L C +  C  +  P               C+     Y   YG GS T G + +ETL 
Sbjct: 153 FSTLPCSSQLCQALSSP--------------TCSNNFCQYTYGYGDGSETQGSMGTETLT 212

Query: 200 FPEKKIPNFVVGC----SFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTP 259
           F    IPN   GC            +G+ G GRG  SLPSQ+ + KFSYC+       TP
Sbjct: 213 FGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIG-SSTP 272

Query: 260 HSGELVLDSGGEKTGDLSYTPFRKNPS---VSNHAYKEYYYLTIRKIIVGNQAVKV-PYK 319
               L+L       G L+ +    +P+   + +     +YY+T+  + VG+  + + P  
Sbjct: 273 --SNLLL-------GSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSA 332

Query: 320 FLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDI 379
           F     +G+GG IIDSG+T T+     +++V QEF  Q+          S +G   CF  
Sbjct: 333 FALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQI---NLPVVNGSSSGFDLCFQT 392

Query: 380 SKEKS-VEFPELIFQFKGGAKLALPLNNYFALVSSSGVACLTIVTQKVAAGGPSVILGAF 439
             + S ++ P  +  F GG  L LP  NYF +  S+G+ CL + +   ++ G S I G  
Sbjct: 393 PSDPSNLQIPTFVMHFDGG-DLELPSENYF-ISPSNGLICLAMGS---SSQGMS-IFGNI 434

Query: 440 QQQNFYVEYDLVNDRLGFRQQTC 453
           QQQN  V YD  N  + F    C
Sbjct: 453 QQQNMLVVYDTGNSVVSFASAQC 434

BLAST of Sgr018207 vs. ExPASy TrEMBL
Match: A0A6J1IXY3 (probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111481640 PE=3 SV=1)

HSP 1 Score: 802.0 bits (2070), Expect = 1.3e-228
Identity = 389/456 (85.31%), Postives = 421/456 (92.32%), Query Frame = 0

Query: 1   MAVSSALCFFYILLFSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQI 60
           MA    LCFFYILL S+VSAI  +N IT+PLS+FPHHSSSDPL+TLNFLASAS +RAHQI
Sbjct: 1   MAPPPPLCFFYILLVSSVSAIADTNPITIPLSSFPHHSSSDPLQTLNFLASASQNRAHQI 60

Query: 61  KSPKSNS-SISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSEC 120
           K+PKS S S+SKSPLSPHSYGAYS PLSFGTPPQTLHLIFDTGSSLVW PCTS+YLCSEC
Sbjct: 61  KAPKSESNSVSKSPLSPHSYGAYSTPLSFGTPPQTLHLIFDTGSSLVWLPCTSKYLCSEC 120

Query: 121 SFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAY 180
           SFPK+DPAGIPRF+PKLS++SKL+GCQNPKCAWIFGPDVKSQCRSCNP TENCTQTCPAY
Sbjct: 121 SFPKIDPAGIPRFIPKLSSTSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY 180

Query: 181 VVQYGSGSTAGLLLSETLDFPEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 240
           VVQYGSGSTAGLLLSETLDFP+KK  NFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK
Sbjct: 181 VVQYGSGSTAGLLLSETLDFPDKKFTNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 240

Query: 241 KFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKII 300
           KF+YCLASR+FDD+PH+GEL+LDS G KT  LSYTPFR+NPSVSNHAYKEYYYLTIRKI 
Sbjct: 241 KFAYCLASRKFDDSPHAGELILDSSGAKTSGLSYTPFRQNPSVSNHAYKEYYYLTIRKIF 300

Query: 301 VGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVE 360
           VG +AVKVPYK+L PGPDG+GGSIIDSGSTFTF++KPVFEAVAQE EKQLAN TRATDVE
Sbjct: 301 VGKKAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEIEKQLANRTRATDVE 360

Query: 361 SLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYFALVSSSGVACLTIVTQKVA- 420
           SLTGLRPCFDISK+KSVEFPEL FQ KGGAK  LPL+NYFALVSSSGVACLT+VT K A 
Sbjct: 361 SLTGLRPCFDISKDKSVEFPELTFQLKGGAKWGLPLSNYFALVSSSGVACLTVVTHKTAD 420

Query: 421 -AGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS 454
             GGPS+ILGAFQQQNFYVEYDLVN ++GFRQQTCS
Sbjct: 421 SGGGPSIILGAFQQQNFYVEYDLVNQKIGFRQQTCS 456

BLAST of Sgr018207 vs. ExPASy TrEMBL
Match: A0A6J1FJU4 (probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC111446087 PE=3 SV=1)

HSP 1 Score: 801.2 bits (2068), Expect = 2.2e-228
Identity = 394/458 (86.03%), Postives = 425/458 (92.79%), Query Frame = 0

Query: 1   MAVSSALCFFYILLFSAVSAI--DSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAH 60
           MA    LCF YILL  +VSAI   ++N ITLPLSA PH SSSDPL+ LNFLASAS +RAH
Sbjct: 1   MAAPPPLCFSYILLLFSVSAIVDANANSITLPLSALPHPSSSDPLQNLNFLASASQNRAH 60

Query: 61  QIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSE 120
           QIK+PKSN S+SKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTS+YLCS+
Sbjct: 61  QIKTPKSN-SVSKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSKYLCSQ 120

Query: 121 CSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPA 180
           CSFPK+DP  IPRFVPKLS+SSKL+GCQNPKCAW+FGPDVKSQCR+CNP TENCTQTCPA
Sbjct: 121 CSFPKIDPLRIPRFVPKLSSSSKLVGCQNPKCAWVFGPDVKSQCRNCNPKTENCTQTCPA 180

Query: 181 YVVQYGSGSTAGLLLSETLDFPEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL 240
           Y VQYGSGSTAGLLLSETLDFP++KIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL
Sbjct: 181 YAVQYGSGSTAGLLLSETLDFPDQKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL 240

Query: 241 KKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKI 300
           KKF+YCLASR+FDD+PHSGEL+LDSGG KTGDL+YTPFR+NPSVSNHAYKEYYYL+IRKI
Sbjct: 241 KKFAYCLASRKFDDSPHSGELILDSGGAKTGDLTYTPFRQNPSVSNHAYKEYYYLSIRKI 300

Query: 301 IVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDV 360
           +VGNQ VKVPYK+L PG DGSGGSIIDSGSTFTF++KPVFEAVA+ FEKQLAN TRATDV
Sbjct: 301 LVGNQDVKVPYKYLVPGSDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDV 360

Query: 361 ESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYFALVSSSGVACLTIVTQKVA 420
           ES TGLRPCFDISKEKSVEFPELIFQFKGGAK ALPLNNYFALVSSSGVACLT+VT K A
Sbjct: 361 ESATGLRPCFDISKEKSVEFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHKEA 420

Query: 421 AG---GPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS 454
           AG   GPSVILGAFQQQNFYVEYDLVN+RLGFRQQ+CS
Sbjct: 421 AGGGSGPSVILGAFQQQNFYVEYDLVNERLGFRQQSCS 457

BLAST of Sgr018207 vs. ExPASy TrEMBL
Match: A0A6J1KTP0 (probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111498572 PE=3 SV=1)

HSP 1 Score: 799.3 bits (2063), Expect = 8.5e-228
Identity = 394/458 (86.03%), Postives = 424/458 (92.58%), Query Frame = 0

Query: 1   MAVSSALCFFYILLFSAVSAI--DSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAH 60
           MA    LCF YILL  +VSAI   ++N ITLPLSAFPHHSSSDPL+ LNFLASAS +RAH
Sbjct: 1   MAAPPPLCFSYILLLFSVSAIVDVNANSITLPLSAFPHHSSSDPLQNLNFLASASQNRAH 60

Query: 61  QIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSE 120
           QIK+PKSN S+SKS LSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTS+YLCS+
Sbjct: 61  QIKTPKSN-SVSKSSLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSKYLCSQ 120

Query: 121 CSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPA 180
           CSFPK+DP  IPRFVPKLS+SSKL+GCQNPKC W+FGPDVKSQCR+CN  TENCTQTCPA
Sbjct: 121 CSFPKIDPLRIPRFVPKLSSSSKLVGCQNPKCDWVFGPDVKSQCRNCNQKTENCTQTCPA 180

Query: 181 YVVQYGSGSTAGLLLSETLDFPEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL 240
           YVVQYGSGSTAGLLLSETLDFP++KIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL
Sbjct: 181 YVVQYGSGSTAGLLLSETLDFPDQKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL 240

Query: 241 KKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKI 300
           KKF+YCLASR+FDD+PHSGEL+LDSGGEKTGDL+YTPFR+NPSVSNHAYKEYYYL+IRKI
Sbjct: 241 KKFAYCLASRKFDDSPHSGELILDSGGEKTGDLTYTPFRQNPSVSNHAYKEYYYLSIRKI 300

Query: 301 IVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDV 360
           +VGNQ VKVPYK+L PG DGSGGSIIDSGSTFTF++KPVFEAVA+ FEKQLAN TRATDV
Sbjct: 301 LVGNQDVKVPYKYLVPGSDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDV 360

Query: 361 ESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYFALVSSSGVACLTIVTQKVA 420
           ES TGLRPCFDISKEKSVEFPELIFQFKGGAK  L LNNYFALVSSSGVACLT+VT K A
Sbjct: 361 ESATGLRPCFDISKEKSVEFPELIFQFKGGAKWTLALNNYFALVSSSGVACLTVVTHKEA 420

Query: 421 AG---GPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS 454
           AG   GPSVILGAFQQQNFYVEYDLVN+RLGFRQQTCS
Sbjct: 421 AGGGSGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS 457

BLAST of Sgr018207 vs. ExPASy TrEMBL
Match: A0A6J1F3G5 (probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC111441834 PE=3 SV=1)

HSP 1 Score: 797.0 bits (2057), Expect = 4.2e-227
Identity = 389/456 (85.31%), Postives = 419/456 (91.89%), Query Frame = 0

Query: 1   MAVSSALCFFYILLFSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQI 60
           MA    LCFFYILL S+VSAI  +N ITLPLS+FPHHSSSDPL+TLNFLASAS +RAHQI
Sbjct: 1   MAPPPPLCFFYILLLSSVSAIADTNPITLPLSSFPHHSSSDPLQTLNFLASASQNRAHQI 60

Query: 61  KSPKSNS-SISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSEC 120
           K+PKS S S+SKSPLSPHSYGAYS PLSFGTP QTLHLIFDTGSSLVW PCTS+YLCSEC
Sbjct: 61  KAPKSKSNSVSKSPLSPHSYGAYSTPLSFGTPSQTLHLIFDTGSSLVWLPCTSKYLCSEC 120

Query: 121 SFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAY 180
           SFPK+DPA IPRF+PKLS+SSKL+GCQNPKCAWIFGPDVKSQCRSCNP TENCTQTCPAY
Sbjct: 121 SFPKIDPARIPRFIPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY 180

Query: 181 VVQYGSGSTAGLLLSETLDFPEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 240
           VVQYGSGSTAGLLLSETLDFP KKI NFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK
Sbjct: 181 VVQYGSGSTAGLLLSETLDFPNKKITNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 240

Query: 241 KFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKII 300
           KF+YCLASR+FDD+PH+GEL+LDS G KT  L+YTPFR+NPSVSNHAYKEYYYLTIRKI 
Sbjct: 241 KFAYCLASRKFDDSPHAGELILDSSGAKTSGLTYTPFRQNPSVSNHAYKEYYYLTIRKIF 300

Query: 301 VGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVE 360
           VGN+AVKVPYK+L PGPDG+GGSIIDSGSTFTF++KPVFEAVAQE EKQLAN TRATDVE
Sbjct: 301 VGNKAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEIEKQLANRTRATDVE 360

Query: 361 SLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYFALVSSSGVACLTIVTQKVA- 420
           SLTGLRPCFDISK+KSVEFPEL F  KGGAK A PL+NYFALVSSSGVACLT+VT K A 
Sbjct: 361 SLTGLRPCFDISKDKSVEFPELTFHLKGGAKWAPPLSNYFALVSSSGVACLTVVTHKAAE 420

Query: 421 -AGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS 454
             GGPS+ILGAFQQQNFYVEYDLVN ++GFRQQTCS
Sbjct: 421 SGGGPSIILGAFQQQNFYVEYDLVNQKIGFRQQTCS 456

BLAST of Sgr018207 vs. ExPASy TrEMBL
Match: A0A5A7TRK2 (Aspartic proteinase nepenthesin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G004410 PE=3 SV=1)

HSP 1 Score: 791.6 bits (2043), Expect = 1.8e-225
Identity = 387/456 (84.87%), Postives = 419/456 (91.89%), Query Frame = 0

Query: 1   MAVSSALCFFYILLFSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQI 60
           MA  S L FFYILLFS++SAI +SN ITLPL++ PH SSSDPL+ L FLASAS +RAH+I
Sbjct: 1   MASPSPLSFFYILLFSSLSAISNSNPITLPLNSSPHLSSSDPLQALTFLASASKNRAHRI 60

Query: 61  KSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSECS 120
           K+PKSN S+SKSPLSPHSYGAYS PLSFGTP QTLHLIFDTGSSLVWFPCTSRYLC+ECS
Sbjct: 61  KTPKSN-SVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCTECS 120

Query: 121 FPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYV 180
           FPK+DP GIPRFVPKLS+SSKL+GCQNPKCAWIFGPDVKSQCRSCNP TENCTQTCPAYV
Sbjct: 121 FPKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYV 180

Query: 181 VQYGSGSTAGLLLSETLDFPEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKK 240
           VQYGSGSTAGLLLSETLDFP KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKK
Sbjct: 181 VQYGSGSTAGLLLSETLDFPNKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKK 240

Query: 241 FSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKIIV 300
           F+YCLASR+FDD+ HSG+L+LDS G KT  L+YT FR+NPSVSNHAYKEYYYL IRKIIV
Sbjct: 241 FAYCLASRKFDDSAHSGQLILDSSGVKTSGLTYTSFRQNPSVSNHAYKEYYYLNIRKIIV 300

Query: 301 GNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVES 360
           GNQAVKVPYK+L PGPDG+GGSIIDSGSTFTF++KPV + VAQEFEKQLAN TRATDVE+
Sbjct: 301 GNQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLDVVAQEFEKQLANRTRATDVET 360

Query: 361 LTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYFALVSSSGVACLTIVTQKV--- 420
           LTGLRPCFD+SKEKSVEFPELIFQFKGGAK ALPLNNYFALVSSSGVACLT+VT      
Sbjct: 361 LTGLRPCFDVSKEKSVEFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHNTEDG 420

Query: 421 AAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS 454
             GGPSVILGAFQQQNFYVEYDLVN+RLGFR+QTC+
Sbjct: 421 GGGGPSVILGAFQQQNFYVEYDLVNERLGFRKQTCT 455

BLAST of Sgr018207 vs. TAIR 10
Match: AT3G52500.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 537.3 bits (1383), Expect = 1.2e-152
Identity = 267/476 (56.09%), Postives = 346/476 (72.69%), Query Frame = 0

Query: 3   VSSALCFFYILLFSAVSAIDSSNVITLPLSAFPH--HSSSDPLETLNFLASASLSRAHQI 62
           ++S++ FF+++  S VSA      + LPLS F H   S  DP  +L  LA +S++RAH++
Sbjct: 1   MASSIFFFFLIFLSVVSA------VKLPLSPFSHSDQSPKDPYLSLRRLAESSIARAHKL 60

Query: 63  K--------------SPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLV 122
           K              +  +++++ KSPLS  SYG YS  LSFGTP QT+  +FDTGSSLV
Sbjct: 61  KHGTSIKPDEDALSSTTTASATVVKSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLV 120

Query: 123 WFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCN 182
           W PCTSRYLCS C F  +DP  IPRF+PK S+SSK++GCQ+PKC +++GP+V  QCR C+
Sbjct: 121 WLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNV--QCRGCD 180

Query: 183 PTTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPEKKIPNFVVGCSFLSIHQPSGIAGFG 242
           P T NCT  CP Y++QYG GSTAG+L++E LDFP+  +P+FVVGCS +S  QP+GIAGFG
Sbjct: 181 PNTRNCTVGCPPYILQYGLGSTAGVLITEKLDFPDLTVPDFVVGCSIISTRQPAGIAGFG 240

Query: 243 RGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSG-----GEKTGDLSYTPFRKNPS 302
           RG  SLPSQM LK+FS+CL SRRFDDT  + +L LD+G     G KT  L+YTPFRKNP+
Sbjct: 241 RGPVSLPSQMNLKRFSHCLVSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPN 300

Query: 303 VSNHAYKEYYYLTIRKIIVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAV 362
           VSN A+ EYYYL +R+I VG + VK+PYK+LAPG +G GGSI+DSGSTFTF+E+PVFE V
Sbjct: 301 VSNKAFLEYYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELV 360

Query: 363 AQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYFAL 422
           A+EF  Q++N+TR  D+E  TGL PCF+IS +  V  PELIF+FKGGAKL LPL+NYF  
Sbjct: 361 AEEFASQMSNYTREKDLEKETGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTF 420

Query: 423 VSSSGVACLTIVTQKV----AAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS 454
           V ++   CLT+V+ K        GP++ILG+FQQQN+ VEYDL NDR GF ++ CS
Sbjct: 421 VGNTDTVCLTVVSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468

BLAST of Sgr018207 vs. TAIR 10
Match: AT4G16563.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 195.3 bits (495), Expect = 1.1e-49
Identity = 144/487 (29.57%), Postives = 224/487 (46.00%), Query Frame = 0

Query: 17  AVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNSSISKSPLSP 76
           +VS++ +  ++ L  S      SS PL  L   +S S +R  +    +    +S  P+S 
Sbjct: 21  SVSSLSTPLLLHLSHSLSTSKHSSSPLHLLKSSSSRSSARFRRHHHKQQQQQLS-LPIS- 80

Query: 77  HSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKL 136
            S   Y   LS G+    + L  DTGS LVWFPC   + C  C    + P+        L
Sbjct: 81  -SGSDYLISLSVGSSSSAVSLYLDTGSDLVWFPCRP-FTCILCESKPLPPSP----PSSL 140

Query: 137 STSSKLLGCQNPKCAWIFGPDVKSQ-CRSCN-----PTTENCTQT---CPAYVVQYGSGS 196
           S+S+  + C +P C+        S  C   N       T +C  +   CP +   YG GS
Sbjct: 141 SSSATTVSCSSPSCSAAHSSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGS 200

Query: 197 TAGLLLSETLDFPEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL------KKF 256
               L S++L  P   + NF  GC+  ++ +P G+AGFGRG  SLP+Q+ +        F
Sbjct: 201 LVAKLYSDSLSLPSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSF 260

Query: 257 SYCLASRRFDD--TPHSGELVL--------------------DSGGEKTGDLSYTPFRKN 316
           SYCL S  FD         L+L                    D   +K  +  +T   +N
Sbjct: 261 SYCLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLEN 320

Query: 317 PSVSNHAYKEYYYLTIRKIIVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFE 376
           P    H Y  +Y ++++ I +G + +  P        +G GG ++DSG+TFT L    + 
Sbjct: 321 P---KHPY--FYSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYN 380

Query: 377 AVAQEFEKQLAN-HTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGG-AKLALPLNN 436
           +V +EF+ ++   H RA  VE  +G+ PC+ ++  ++V+ P L+  F G  + + LP  N
Sbjct: 381 SVVEEFDSRVGRVHERADRVEPSSGMSPCYYLN--QTVKVPALVLHFAGNRSSVTLPRRN 440

Query: 437 YFALVSSSG--------VACLTIVT---QKVAAGGPSVILGAFQQQNFYVEYDLVNDRLG 454
           YF      G        + CL ++    +    GG   ILG +QQQ F V YDL+N R+G
Sbjct: 441 YFYEFMDGGDGKEEKRKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVG 492

BLAST of Sgr018207 vs. TAIR 10
Match: AT5G45120.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 193.7 bits (491), Expect = 3.1e-49
Identity = 146/454 (32.16%), Postives = 206/454 (45.37%), Query Frame = 0

Query: 35  PHHSSSDPLETLNFLASASL-----SRAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFG 94
           P  SSS  L      +S SL         +IK P S+  +   PL     G Y   L+ G
Sbjct: 32  PSSSSSSFLVLTLTKSSVSLPTPKSQTQERIKKPLSSVDVVMEPLREVRDG-YLITLNIG 91

Query: 95  TPPQTLHLIFDTGSSLVWFPCTS-RYLCSECSFPKVDPAGIPR-FVPKLSTSSKLLGCQN 154
           TPPQ + +  DTGS L W PC +  + C EC   K +    P  F P  S++S    C +
Sbjct: 92  TPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHSSTSFRDSCAS 151

Query: 155 PKCAWI------FGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGS-TAGLLLSETLDFP 214
             C  I      F P   + C         C + CP++   YG G   +G+L  + L   
Sbjct: 152 SFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGILTRDILKAR 211

Query: 215 EKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL--KKFSYCLASRRFDDTPH-SG 274
            + +P F  GC   +  +P GIAGFGRG  SLPSQ+G   K FS+C    +F + P+ S 
Sbjct: 212 TRDVPRFSFGCVTSTYREPIGIAGFGRGLLSLPSQLGFLEKGFSHCFLPFKFVNNPNISS 271

Query: 275 ELVLDSGG---EKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKIIVGNQ--AVKVPYKFL 334
            L+L +       T  L +TP    P      Y   YY+ +  I +G      +VP    
Sbjct: 272 PLILGASALSINLTDSLQFTPMLNTP-----MYPNSYYIGLESITIGTNITPTQVPLTLR 331

Query: 335 APGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDI-- 394
                G+GG ++DSG+T+T L +P +  +    +  +  + RAT+ ES TG   C+ +  
Sbjct: 332 QFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQSTI-TYPRATETESRTGFDLCYKVPC 391

Query: 395 ------SKEKSVE--FPELIFQFKGGAKLALPLNNYFALVSS----SGVACLTIVTQKVA 453
                 S E  V   FP + F F   A L LP  N F  +S+    S V CL     +  
Sbjct: 392 PNNNLTSLENDVMMIFPSITFHFLNNATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMEDG 451

BLAST of Sgr018207 vs. TAIR 10
Match: AT2G42980.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 171.0 bits (432), Expect = 2.2e-42
Identity = 129/395 (32.66%), Postives = 180/395 (45.57%), Query Frame = 0

Query: 80  GAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTS 139
           G Y   +  GTPP+   LI DTGS L W  C   Y C   +    D        PK S S
Sbjct: 158 GEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYD--------PKTSAS 217

Query: 140 SKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGS-TAGLLLSETLD 199
            K + C +P+C+ I  PD   QC S N       Q+CP Y   YG  S T G    ET  
Sbjct: 218 FKNITCNDPRCSLISSPDPPVQCESDN-------QSCP-YFYWYGDRSNTTGDFAVETFT 277

Query: 200 F---------PEKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGL---KKFSY 259
                      E K+ N + GC   +       SG+ G GRG  S  SQ+       FSY
Sbjct: 278 VNLTTTEGGSSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSY 337

Query: 260 CLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVS--NHAYKEYYYLTIRKIIVG 319
           CL  R   +T  S +L+    GE    L++T       V+   ++ + +YY+ I+ I+VG
Sbjct: 338 CLVDRN-SNTNVSSKLIF---GEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVG 397

Query: 320 NQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEF-EKQLANHTRATDVES 379
            +A+ +P +      DG GG+IIDSG+T ++  +P +E +  +F EK   N+    D   
Sbjct: 398 GKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPV 457

Query: 380 LTGLRPCFDIS--KEKSVEFPELIFQFKGGAKLALPLNNYFALVSSSGVACLTIVTQKVA 439
           L    PCF++S  +E ++  PEL   F  G     P  N F  +S   V    + T K  
Sbjct: 458 LD---PCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLSEDLVCLAILGTPKST 517

Query: 440 AGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS 454
                 I+G +QQQNF++ YD    RLGF    C+
Sbjct: 518 FS----IIGNYQQQNFHILYDTKRSRLGFTPTKCA 525

BLAST of Sgr018207 vs. TAIR 10
Match: AT1G25510.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 170.6 bits (431), Expect = 2.8e-42
Identity = 117/377 (31.03%), Postives = 168/377 (44.56%), Query Frame = 0

Query: 80  GAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTS 139
           G Y   +  G P + ++++ DTGS + W  CT    C++C + + +P     F P  S+S
Sbjct: 146 GEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTP---CADC-YHQTEPI----FEPSSSSS 205

Query: 140 SKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGS-TAGLLLSETLD 199
            + L C  P+C  +      S+CR+          TC  Y V YG GS T G   +ETL 
Sbjct: 206 YEPLSCDTPQCNAL----EVSECRNA---------TC-LYEVSYGDGSYTVGDFATETLT 265

Query: 200 FPEKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPH 259
                + N  VGC   +       +G+ G G G  +LPSQ+    FSYCL  R  D    
Sbjct: 266 IGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDSAS- 325

Query: 260 SGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKIIVGNQAVKVPYKFLAPG 319
                +D G   + D    P      + NH    +YYL +  I VG + +++P       
Sbjct: 326 ----TVDFGTSLSPDAVVAPL-----LRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMD 385

Query: 320 PDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKS 379
             GSGG IIDSG+  T L+  ++ ++   F K   +  +A  V        C+++S + +
Sbjct: 386 ESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFD---TCYNLSAKTT 445

Query: 380 VEFPELIFQFKGGAKLALPLNNYFALVSSSGVACLTIVTQKVAAGGPSVILGAFQQQNFY 439
           VE P + F F GG  LALP  NY   V S G  CL              I+G  QQQ   
Sbjct: 446 VEVPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAF----APTASSLAIIGNVQQQGTR 483

Query: 440 VEYDLVNDRLGFRQQTC 453
           V +DL N  +GF    C
Sbjct: 506 VTFDLANSLIGFSSNKC 483

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038905730.19.9e-23187.28probable aspartyl protease At4g16563 [Benincasa hispida][more]
KAG7028143.12.1e-22886.03Aspartic proteinase nepenthesin-2, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022982947.12.7e-22885.31probable aspartyl protease At4g16563 [Cucurbita maxima][more]
XP_023528159.13.5e-22885.75probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo][more]
XP_022940517.14.6e-22886.03probable aspartyl protease At4g16563 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q940R41.5e-4829.57Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g1656... [more]
Q6F4N53.9e-3628.29Aspartyl protease 25 OS=Oryza sativa subsp. japonica OX=39947 GN=AP25 PE=2 SV=1[more]
Q766C28.6e-3632.43Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q9LNJ31.5e-3530.47Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Q766C33.3e-3531.59Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Match NameE-valueIdentityDescription
A0A6J1IXY31.3e-22885.31probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111481640... [more]
A0A6J1FJU42.2e-22886.03probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC1114460... [more]
A0A6J1KTP08.5e-22886.03probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111498572... [more]
A0A6J1F3G54.2e-22785.31probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC1114418... [more]
A0A5A7TRK21.8e-22584.87Aspartic proteinase nepenthesin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
Match NameE-valueIdentityDescription
AT3G52500.11.2e-15256.09Eukaryotic aspartyl protease family protein [more]
AT4G16563.11.1e-4929.57Eukaryotic aspartyl protease family protein [more]
AT5G45120.13.1e-4932.16Eukaryotic aspartyl protease family protein [more]
AT2G42980.12.2e-4232.66Eukaryotic aspartyl protease family protein [more]
AT1G25510.12.8e-4231.03Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 424..439
score: 30.89
coord: 322..333
score: 30.89
coord: 88..108
score: 52.78
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 63..260
e-value: 1.8E-34
score: 121.3
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 261..453
e-value: 1.7E-50
score: 173.2
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 76..452
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 291..448
e-value: 7.7E-35
score: 120.1
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 82..259
e-value: 2.8E-29
score: 102.6
NoneNo IPR availablePANTHERPTHR47967:SF36BNACNNG47670D PROTEINcoord: 8..452
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 8..452
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 97..108
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 82..448
score: 36.746815
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 81..452
e-value: 2.92656E-83
score: 255.265

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr018207.1Sgr018207.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity