Sgr029688 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr029688
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionATP-dependent Clp protease proteolytic subunit
Locationtig00153449: 1813374 .. 1818283 (-)
RNA-Seq ExpressionSgr029688
SyntenySgr029688
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGCTGCTTTCACGCTGTTCTACTCTTATTCCTCATGCTTCCTTCCTTATCCTTCCGAATTCTCATCACAAACCATCAAATTTCTATCCCAAGCTCAAAACCAATCTCCCTTCTCCATCGGCTTCCTCAGTCCTCAAAGCAACCGCTCTTAAACCATCGAGAGCTCTAGCTTCCCCGTGTTCCATCCTCATGGCGGCTCCACAGACGCCTGATGCTGCCCGGAGAGGCGCTGAAACTGACGCCATGGGACTGCTTCTCAGGGAGAGGATCGTCTTCTTAGGGAACAGCATTGATGACTTTGTGGCCGATGCCATTATCAGCCAGCTCTTGCTCTTGGATGCTCAAGACTCCACCAAAGATATCAGACTATTCATTAATTCCCCCGGCGGCTCTCTCAGGTATGCACGGGCTGATTTTTTGTACATAATGTAGTTGGTGAAGATATGGAACCTGAGATTCAGGCTTATCTTTGTTCTTTCCAAATGAGTTGGATAATTGGATTTTTAGGAATATGCATGATGTGCCTAAGTTTCATGCGGAATGAGTTCATATGCTTTCCTTAGCGCGCTGAGTGCGAGAGAGGGGATAGTTGATCTCTGCTTCTCATAATTCTCCATAAAGGTCTAGCTATCTTCACCTGCATCACAATCAGTTAGCCCACGATTGGGTGCATTTATTCCAGTGACGAGTCAGGTCACTGTAAATTTAGGATTTGTTACAGTTTGCTGAAATTCAAAATTATTTACAAGATATAGCAATATTCATGTTGTGCATAGTTAGTTATCAACAAACCGCCATGGGTTGACTTAGTTCAAACCCCATGGTGCCCCTACCTAGTTTTCTTGGTAACTAAATGCTGTAGGGTTAGGCGGTTGTTTGTGAGATTAGTCGTGATGCACTTATGCTAGTCAAGACACTTACCGATATGAAAATAAAGTTATCGATAGAAATGCTTCTGAATGATGAAAGCTATAGAAACATAACATTAAAATTATCTAAAATTAGAAAGAACCGATTGCATGAGAGTTAGTCTACCTTGTTAGATGGAACTGGGCTGGTCCGTTTATTTCCTTGGGTAGCTTGGGAGACGGTAATGTGAGGAATAGAGACACTGCCCTCCTCTCAAAGTGGATGTGCATATTCTTGCGTGAAAAGACTGCCCTTTGAAGAAGATTAAATATGGAATTGGCAACCTTTGGATAAACTTAATTCATTGATCCAAGAATTGACACTGCTAAAGGAAGAGATTTCTTTGATCACAGTGTCTTCATAAAGATTGGCAATGGTTGGAGGATATCATCCTTCTTGTCAAAGCCAGATTTTCCAAGTATTTTCTCTATTTCAAATGCTAAAGACTTTTTTTTAGAGATTTTTGGAATTAGGCAGTCCAAACTTTGAATTTTCAGCTCAGAAGAAACCTTTTTGATAGAGAGCCCGAAGAATGGTTGATCTTTATCCAAAAGATGGAAAGCGATCAGCTAAATCATAGTGAAGAAAAATTTGTTGGAAGGTTGGAAGCTTTGGCCTGTTTCCTTGCAAATCTGCCAATCCCCAGCTGAATAGTAGAGAAGGCAGGTTGGAGAAACCCTTTCTTCTCTGATTTGGGGAGGAAGAATCGTTAAAAGACTTTGTTCTTCCTTTGGACCATGGCTTACAAGAGTCTAAACACTTGTGATTGGACTCAAAAAATGAACCCTTGCATTGCTTTATCTCCTTCTGTTTGCCCTGTCTACTTATCTAATGAAGAAACCACGGATCATCTTTTTATTCATTGTGACTGGATTACAAAGGGATGGAACTTACTCTCAGGGTTTTGGACTTTTTTGGTGCTGTCCTAAAGATATAAACACTTGGTTGCATGAAATTCTCTCTTCTTGGTGATTTCACGAAAAGCCAGAATTTTGCGGGCAAATGCTTCAAGAGCTTTGCTGTGTTATGTATACTTGGAAGGAAAGAAACAACAGAATTTTTAAAGATAGGTACAGGCTTTTGATGATTTTTTTGAAACTGTACAGTTTAATGCCTCTACATGGAGTACTGTCCTGAAAGAATTTTGTGATTACAAGTCTTCTTGGATAAACCTAGATTGGGGGTGCTTTGGCTAAGAAAGTTGTACGACAAGATGGGTATCCCAGCCCTACAAAAGGACAATGATGGATTTGGAAGAAACCTCTCTAGTTTGTCTTGATTTTTTAAAAAAGAGAAACCTATAAGACTCTTTAGCATACAATTATAGAAAAGTTACAAAGATTATTCTAGAGGTATTTCTTTCCTTTTATATCTCCGAAATTAAAATATCATTGAGGCTAATGATTTTAGATTTTAGCAGTATGGCTTCTAATTAGATGCTCCAAAAAGTCTTTTATTCTCCTCGGAAAAGTCCATTCAAACTCAAAAATCTCCAATGATTAATCACCCTAGCTCAAGAAAATGAACAATCTCTTCTAATATTTCCCATGAGATGCCTTTGTGAGTTATAATACAATCATGGATGTCTTCTTGCACACAATCACTGGTATTGATCTTCCTAAGAGAGACCAACCTTAAAAATAGTTTCAATCTTCTGAATCTTGTCAACTAAATTTATAGACGGAAATCTATCTATATTAGCAATAGATTCCACAAAGGATTTACATGTGAAGAACCCCAATCTCTATAATGACCAATTTCCTAAATCAGTACTCTTTAGGGAAAGGGCTAACCCTCCTAAGAGAGTAACCACGTTGGCCAAATCAGAACTTCTTTCAAAAATTAAACATCCAGGTTTCTTGTTCTTTCCCTATAACATTCTCAGTGACAGATGATAAATCATAGAGGTGGGGGGAAGTGTGAAGATACCGAAGGTTCCTTAATTATTAAAGTATGGTAGTGATGCATCTGTTTGTCCGTCTCTCCCTTTTCCTTTCTGGTATCTTTTGAATGTCTTGCTAGGAGTTATGAAAAAAAATTGAGAAGTAGGAGAGTCTTAAGTGTCCTAGCACACTGATAGTGTCACTACATTCAGAATGTACGTTTCAAATCATCTCCATACTAATAGAGAACCTTTTTTTTATTTGTATCAAGTTTTACTTTGCCCTGGATCAAACTTGGTGCTACACTAGTACAGATTTGTCTTGAAGTCATGCAATCTATATGTAATGATGCAACCGATCACTTACAACCAGTACATGCATTGGAATCAAAACTTAATATCTCTTTTCAAACTTGAAAAGAGAACCGCTGCCTCTTTGAGATTCTTCCTTTCTGTGGAGAGGGGATATGCTTCATCGACCACCACAGGAGAGATTTAATTTGCTTTGGTCTATATGTCTAGAGATGAATGTATATGTCTACTCTACATCTCCACATGAATGGATGATTAGCAAACTGTGGGAACACTTACATATCTTCCCTGATTGATTATTTAATCTCTTTTATTCTTGTTTCCATGGTTTGCTGGCCTAAAGTGCATTAGGAAATCTTCTAAAGCAACTTAATTATGATTTCTGTGGTTTACGGATGACCTTTTTTCCCATCTTTGATGCACAATTACTTTAAGGTAAATTCCTTCTCTTAGAACCATACTTGAGAGTTTCCCACCTCACACGGGTCTAATGGTAAACCTATCGTTTACCACCAAACCAGCATTTTTTGAAATGTATATTGATATTTAACTACATAAGGATGCTTGTTAGAGGTGTAGGCAGAAGAAATTATGTACTCTGCCTAGCCCAGTTCATTTGTCAGAGAAGGTTATGTTTTTAACTGCCACGTCAATTTATCCTCATTTTAATTAAGGAACACACCTTCCAACTTTATAGTAAACCGAGTCCGGTCTTCCAATTCTCTATCAAACAGATTCCTTCTCAGCCCAACGTCCCAAGTTTGGGTGTCTTCTTTCCAGCTATTACTTACTACCTCTGCTTTTGTACTAGCAATGGAGTATCTACTGTGAAGATCTGACTCAAAGGATGGAATTAAACCAGTCAATCTTCCTAGAATCTAATTATAAGCCCATTTCCCAGCTTAGTAAAGCAAACCGTTTTTTACAACAAAACCAATACATTTGGCTATAGCAGGCCACGGGCGGAGGGCAGACTAGATTACAAGGTTCTCTTTGTTTTCTTTGACTCCAAGGACTAAAATTTAGAAGTGGCAGTGTCACCATTATGAGTATACGAATTCTCTTGTCATTATTTATGTGAAGCTGGAAATACTGATTGAACAGGCTTTTTTAACATGTATTGCTGTTCATCCTTCTTCTGTCCTGGGATTTAAGGGTAAGACGTATCGCTTTCAACGAGTGACCACCTGATCTCATGCATTTTTGCATCAAATTCATTAGCATTTTCAAAGTGGACGGTGGTGTCAGTTAATATTTGGCGATTGGTTGCAGTACATTTAACCCTGACTGATTTCCTTTTATTAATGTTGTTCTTATTAGTGCTACAATGGCTATCTACGATGTCGTACAGCTCGTGAGGGCTGATGTCTCTACTATTGCACTAGGCATTTCAGCCTCAACAGCTTCCATAATCCTCGGTGGTGGCACTAAAGGGAAGCGTCTCGCAATGCCTAATGCACGTATTATGATGCATCAACCTCTTGGAGGGGCGAGTGGCCAAGCAATAGATGTGGAAATTCAAGCACGCGAAATCATGCATAACAAGAACAATGTAACGAGAATTATTTCCGAATCCACTGGCCATCCATTTGAAAAGGTCCAAAAGGATATCGATAGGGATCGTTATATGTCGCCAATAGAGGCTGTTGAATATGGATTGATTGATGGAGTTATCGACAAAGATAGCATTATACCTCTCGTACCAGTGCCAGAAAGAGTGAAGGCAAGTTTAAATTATGAAGAAATTAGTAAAGATCCCAGAAAATTCTTGACACCAGATGTCCCTGATGACGAGATTTACTAA

mRNA sequence

ATGGAGCTGCTTTCACGCTGTTCTACTCTTATTCCTCATGCTTCCTTCCTTATCCTTCCGAATTCTCATCACAAACCATCAAATTTCTATCCCAAGCTCAAAACCAATCTCCCTTCTCCATCGGCTTCCTCAGTCCTCAAAGCAACCGCTCTTAAACCATCGAGAGCTCTAGCTTCCCCGTGTTCCATCCTCATGGCGGCTCCACAGACGCCTGATGCTGCCCGGAGAGGCGCTGAAACTGACGCCATGGGACTGCTTCTCAGGGAGAGGATCGTCTTCTTAGGGAACAGCATTGATGACTTTGTGGCCGATGCCATTATCAGCCAGCTCTTGCTCTTGGATGCTCAAGACTCCACCAAAGATATCAGACTATTCATTAATTCCCCCGGCGGCTCTCTCAGTGCTACAATGGCTATCTACGATGTCGTACAGCTCGTGAGGGCTGATGTCTCTACTATTGCACTAGGCATTTCAGCCTCAACAGCTTCCATAATCCTCGGTGGTGGCACTAAAGGGAAGCGTCTCGCAATGCCTAATGCACGTATTATGATGCATCAACCTCTTGGAGGGGCGAGTGGCCAAGCAATAGATGTGGAAATTCAAGCACGCGAAATCATGCATAACAAGAACAATGTAACGAGAATTATTTCCGAATCCACTGGCCATCCATTTGAAAAGGTCCAAAAGGATATCGATAGGGATCGTTATATGTCGCCAATAGAGGCTGTTGAATATGGATTGATTGATGGAGTTATCGACAAAGATAGCATTATACCTCTCGTACCAGTGCCAGAAAGAGTGAAGGCAAGTTTAAATTATGAAGAAATTAGTAAAGATCCCAGAAAATTCTTGACACCAGATGTCCCTGATGACGAGATTTACTAA

Coding sequence (CDS)

ATGGAGCTGCTTTCACGCTGTTCTACTCTTATTCCTCATGCTTCCTTCCTTATCCTTCCGAATTCTCATCACAAACCATCAAATTTCTATCCCAAGCTCAAAACCAATCTCCCTTCTCCATCGGCTTCCTCAGTCCTCAAAGCAACCGCTCTTAAACCATCGAGAGCTCTAGCTTCCCCGTGTTCCATCCTCATGGCGGCTCCACAGACGCCTGATGCTGCCCGGAGAGGCGCTGAAACTGACGCCATGGGACTGCTTCTCAGGGAGAGGATCGTCTTCTTAGGGAACAGCATTGATGACTTTGTGGCCGATGCCATTATCAGCCAGCTCTTGCTCTTGGATGCTCAAGACTCCACCAAAGATATCAGACTATTCATTAATTCCCCCGGCGGCTCTCTCAGTGCTACAATGGCTATCTACGATGTCGTACAGCTCGTGAGGGCTGATGTCTCTACTATTGCACTAGGCATTTCAGCCTCAACAGCTTCCATAATCCTCGGTGGTGGCACTAAAGGGAAGCGTCTCGCAATGCCTAATGCACGTATTATGATGCATCAACCTCTTGGAGGGGCGAGTGGCCAAGCAATAGATGTGGAAATTCAAGCACGCGAAATCATGCATAACAAGAACAATGTAACGAGAATTATTTCCGAATCCACTGGCCATCCATTTGAAAAGGTCCAAAAGGATATCGATAGGGATCGTTATATGTCGCCAATAGAGGCTGTTGAATATGGATTGATTGATGGAGTTATCGACAAAGATAGCATTATACCTCTCGTACCAGTGCCAGAAAGAGTGAAGGCAAGTTTAAATTATGAAGAAATTAGTAAAGATCCCAGAAAATTCTTGACACCAGATGTCCCTGATGACGAGATTTACTAA

Protein sequence

MELLSRCSTLIPHASFLILPNSHHKPSNFYPKLKTNLPSPSASSVLKATALKPSRALASPCSILMAAPQTPDAARRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTKDIRLFINSPGGSLSATMAIYDVVQLVRADVSTIALGISASTASIILGGGTKGKRLAMPNARIMMHQPLGGASGQAIDVEIQAREIMHNKNNVTRIISESTGHPFEKVQKDIDRDRYMSPIEAVEYGLIDGVIDKDSIIPLVPVPERVKASLNYEEISKDPRKFLTPDVPDDEIY
Homology
BLAST of Sgr029688 vs. NCBI nr
Match: XP_022145352.1 (ATP-dependent Clp protease proteolytic subunit 4, chloroplastic [Momordica charantia])

HSP 1 Score: 527.7 bits (1358), Expect = 6.4e-146
Identity = 276/294 (93.88%), Postives = 283/294 (96.26%), Query Frame = 0

Query: 1   MELLSRCSTLIPHASFLILPNSHHKPSNFYPKLKTNLPSPSASSVLKATALKPSRALASP 60
           MELLSRCSTLIPHAS L LPNSH KPSNFYPKLK++L  PSA SVLKATALKPSR LASP
Sbjct: 1   MELLSRCSTLIPHASSLSLPNSHPKPSNFYPKLKSSLSFPSACSVLKATALKPSRTLASP 60

Query: 61  CSILMAAPQTPDAARRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK 120
           CS+LM APQTPD ARRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK
Sbjct: 61  CSVLMTAPQTPDVARRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK 120

Query: 121 DIRLFINSPGGSLSATMAIYDVVQLVRADVSTIALGISASTASIILGGGTKGKRLAMPNA 180
           DIRLFINSPGGSLSATMAIYDVVQLVRADVSTI LGI+ASTASIILGGGTKGKRLAMPNA
Sbjct: 121 DIRLFINSPGGSLSATMAIYDVVQLVRADVSTIGLGIAASTASIILGGGTKGKRLAMPNA 180

Query: 181 RIMMHQPLGGASGQAIDVEIQAREIMHNKNNVTRIISESTGHPFEKVQKDIDRDRYMSPI 240
           RIM+HQPLGGASGQAIDVEIQAREIMHNKNNVTRIISE TGHPF+KVQKDIDRDRYMSPI
Sbjct: 181 RIMVHQPLGGASGQAIDVEIQAREIMHNKNNVTRIISEFTGHPFDKVQKDIDRDRYMSPI 240

Query: 241 EAVEYGLIDGVIDKDSIIPLVPVPERVKASLNYEEISKDPRKFLTPDVPDDEIY 295
           EAVEYGLIDGVIDKDSIIPL+PVPERVKASLNYEEISKDPRKFLTPDVPDDEIY
Sbjct: 241 EAVEYGLIDGVIDKDSIIPLMPVPERVKASLNYEEISKDPRKFLTPDVPDDEIY 294

BLAST of Sgr029688 vs. NCBI nr
Match: XP_008448589.1 (PREDICTED: ATP-dependent Clp protease proteolytic subunit 4, chloroplastic [Cucumis melo])

HSP 1 Score: 510.0 bits (1312), Expect = 1.4e-140
Identity = 270/294 (91.84%), Postives = 279/294 (94.90%), Query Frame = 0

Query: 1   MELLSRCSTLIPHASFLILPNSHHKPSNFYPKLKTNLPSPSASSVLKATALKPSRALASP 60
           MELLSRCSTLIP AS L LPNSH KPSNF+PKLK+ L  PSASSVLK TALKPSR L  P
Sbjct: 1   MELLSRCSTLIPQASSLGLPNSHGKPSNFFPKLKSTLSFPSASSVLKTTALKPSRTLPPP 60

Query: 61  CSILMAAPQTPDAARRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK 120
           CS+ M APQTPDA+RRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK
Sbjct: 61  CSV-MTAPQTPDASRRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK 120

Query: 121 DIRLFINSPGGSLSATMAIYDVVQLVRADVSTIALGISASTASIILGGGTKGKRLAMPNA 180
           DIRLFINS GGSLSATMAIYDVVQLVRADVSTIALGI+ASTASIILGGGTKGKRLAMPNA
Sbjct: 121 DIRLFINSAGGSLSATMAIYDVVQLVRADVSTIALGIAASTASIILGGGTKGKRLAMPNA 180

Query: 181 RIMMHQPLGGASGQAIDVEIQAREIMHNKNNVTRIISESTGHPFEKVQKDIDRDRYMSPI 240
           RIM+HQPLGGASGQAIDVEIQAREIMHNKNNVTRIISE TGHPFEKVQKDIDRDRYMSPI
Sbjct: 181 RIMVHQPLGGASGQAIDVEIQAREIMHNKNNVTRIISEFTGHPFEKVQKDIDRDRYMSPI 240

Query: 241 EAVEYGLIDGVIDKDSIIPLVPVPERVKASLNYEEISKDPRKFLTPDVPDDEIY 295
           EAVEYGLIDGVID+DSIIPLVPVPERVKA+LNYEE+SKDPRKFLTPDVPDDEIY
Sbjct: 241 EAVEYGLIDGVIDRDSIIPLVPVPERVKATLNYEEMSKDPRKFLTPDVPDDEIY 293

BLAST of Sgr029688 vs. NCBI nr
Match: XP_038876611.1 (ATP-dependent Clp protease proteolytic subunit 4, chloroplastic [Benincasa hispida])

HSP 1 Score: 509.6 bits (1311), Expect = 1.8e-140
Identity = 269/294 (91.50%), Postives = 276/294 (93.88%), Query Frame = 0

Query: 1   MELLSRCSTLIPHASFLILPNSHHKPSNFYPKLKTNLPSPSASSVLKATALKPSRALASP 60
           MELLSRCSTL+PHAS + LP SH KPSNF PKLK+ L  PSASSVLK TA KPSR  A P
Sbjct: 1   MELLSRCSTLLPHASSIGLPTSHGKPSNFLPKLKSTLSFPSASSVLKTTAPKPSRTPAPP 60

Query: 61  CSILMAAPQTPDAARRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK 120
           CS+LM APQTPDAARRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK
Sbjct: 61  CSVLMTAPQTPDAARRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK 120

Query: 121 DIRLFINSPGGSLSATMAIYDVVQLVRADVSTIALGISASTASIILGGGTKGKRLAMPNA 180
           DIRLFINS GGSLSATMAIYDVVQLVRADVSTIALGI+ASTASIILGGGTKGKRLAMPNA
Sbjct: 121 DIRLFINSAGGSLSATMAIYDVVQLVRADVSTIALGIAASTASIILGGGTKGKRLAMPNA 180

Query: 181 RIMMHQPLGGASGQAIDVEIQAREIMHNKNNVTRIISESTGHPFEKVQKDIDRDRYMSPI 240
           RIM+HQPLGGASGQAIDVEIQAREIMHNKNNVTRIISE TGHPFEKVQKDIDRDRYMSPI
Sbjct: 181 RIMVHQPLGGASGQAIDVEIQAREIMHNKNNVTRIISEFTGHPFEKVQKDIDRDRYMSPI 240

Query: 241 EAVEYGLIDGVIDKDSIIPLVPVPERVKASLNYEEISKDPRKFLTPDVPDDEIY 295
           EAVEYGLIDGVIDKDSIIPL+PVPERVKASLNYEEISKDP KFL PDVPDDEIY
Sbjct: 241 EAVEYGLIDGVIDKDSIIPLMPVPERVKASLNYEEISKDPTKFLNPDVPDDEIY 294

BLAST of Sgr029688 vs. NCBI nr
Match: XP_022923515.1 (ATP-dependent Clp protease proteolytic subunit 4, chloroplastic-like [Cucurbita moschata])

HSP 1 Score: 507.3 bits (1305), Expect = 8.9e-140
Identity = 268/294 (91.16%), Postives = 278/294 (94.56%), Query Frame = 0

Query: 1   MELLSRCSTLIPHASFLILPNSHHKPSNFYPKLKTNLPSPSASSVLKATALKPSRALASP 60
           MELLSRCSTLIPHAS L L NSH KPSNFYPKLK++L  PSASSVLK TA KPSR LA+ 
Sbjct: 48  MELLSRCSTLIPHASSLGLLNSHGKPSNFYPKLKSSLSFPSASSVLKTTARKPSRTLATS 107

Query: 61  CSILMAAPQTPDAARRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK 120
           CS+ M  PQTPDAA+RGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK
Sbjct: 108 CSVHMTGPQTPDAAQRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK 167

Query: 121 DIRLFINSPGGSLSATMAIYDVVQLVRADVSTIALGISASTASIILGGGTKGKRLAMPNA 180
           DIRLFINS GGSLSATMAIYDVVQLVRADVSTIALGI+ASTASIILGGGTKGKRLAMPNA
Sbjct: 168 DIRLFINSSGGSLSATMAIYDVVQLVRADVSTIALGIAASTASIILGGGTKGKRLAMPNA 227

Query: 181 RIMMHQPLGGASGQAIDVEIQAREIMHNKNNVTRIISESTGHPFEKVQKDIDRDRYMSPI 240
           RIM+HQPLGGASGQAIDVEIQAREIMHNKNN+TRIISE TGHPFEKVQKDIDRDRYMSPI
Sbjct: 228 RIMVHQPLGGASGQAIDVEIQAREIMHNKNNITRIISEFTGHPFEKVQKDIDRDRYMSPI 287

Query: 241 EAVEYGLIDGVIDKDSIIPLVPVPERVKASLNYEEISKDPRKFLTPDVPDDEIY 295
           EAVEYGLIDGVIDKDSIIPLVPVPERVKASLNY EISKDP+KFL+PDVPDDEIY
Sbjct: 288 EAVEYGLIDGVIDKDSIIPLVPVPERVKASLNYIEISKDPKKFLSPDVPDDEIY 341

BLAST of Sgr029688 vs. NCBI nr
Match: KAG7015794.1 (ATP-dependent Clp protease proteolytic subunit 4, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 506.5 bits (1303), Expect = 1.5e-139
Identity = 268/294 (91.16%), Postives = 277/294 (94.22%), Query Frame = 0

Query: 1   MELLSRCSTLIPHASFLILPNSHHKPSNFYPKLKTNLPSPSASSVLKATALKPSRALASP 60
           MELLSRCSTLIPHAS L L NSH KPSNFYPKLK++L  PSASSVLK TA KPSR LA  
Sbjct: 1   MELLSRCSTLIPHASSLGLLNSHGKPSNFYPKLKSSLSFPSASSVLKTTARKPSRTLAPS 60

Query: 61  CSILMAAPQTPDAARRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK 120
           CS+ M  PQTPDAA+RGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK
Sbjct: 61  CSVHMTGPQTPDAAQRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK 120

Query: 121 DIRLFINSPGGSLSATMAIYDVVQLVRADVSTIALGISASTASIILGGGTKGKRLAMPNA 180
           DIRLFINS GGSLSATMAIYDVVQLVRADVSTIALGI+ASTASIILGGGTKGKRLAMPNA
Sbjct: 121 DIRLFINSSGGSLSATMAIYDVVQLVRADVSTIALGIAASTASIILGGGTKGKRLAMPNA 180

Query: 181 RIMMHQPLGGASGQAIDVEIQAREIMHNKNNVTRIISESTGHPFEKVQKDIDRDRYMSPI 240
           RIM+HQPLGGASGQAIDVEIQAREIMHNKNN+TRIISE TGHPFEKVQKDIDRDRYMSPI
Sbjct: 181 RIMVHQPLGGASGQAIDVEIQAREIMHNKNNITRIISEFTGHPFEKVQKDIDRDRYMSPI 240

Query: 241 EAVEYGLIDGVIDKDSIIPLVPVPERVKASLNYEEISKDPRKFLTPDVPDDEIY 295
           EAVEYGLIDGVIDKDSIIPLVPVPERVKASLNY EISKDP+KFL+PDVPDDEIY
Sbjct: 241 EAVEYGLIDGVIDKDSIIPLVPVPERVKASLNYIEISKDPKKFLSPDVPDDEIY 294

BLAST of Sgr029688 vs. ExPASy Swiss-Prot
Match: Q94B60 (ATP-dependent Clp protease proteolytic subunit 4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CLPP4 PE=1 SV=1)

HSP 1 Score: 369.4 bits (947), Expect = 3.8e-101
Identity = 204/274 (74.45%), Postives = 228/274 (83.21%), Query Frame = 0

Query: 21  NSHHKPSNFYPKLKTNLPSPSASSVLKATALKPSRALASPCSILMAAPQTPDAARRGAET 80
           +S  KP+N Y K     P+   S  L+ T+  P R   +  SI M+  QT ++A RGAE+
Sbjct: 28  SSFPKPNNLYLK-----PTKLISPPLRTTSPSPLR--FANASIEMS--QTQESAIRGAES 87

Query: 81  DAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTKDIRLFINSPGGSLSATMAIY 140
           D MGLLLRERIVFLG+SIDDFVADAI+SQLLLLDA+D  KDI+LFINSPGGSLSATMAIY
Sbjct: 88  DVMGLLLRERIVFLGSSIDDFVADAIMSQLLLLDAKDPKKDIKLFINSPGGSLSATMAIY 147

Query: 141 DVVQLVRADVSTIALGISASTASIILGGGTKGKRLAMPNARIMMHQPLGGASGQAIDVEI 200
           DVVQLVRADVSTIALGI+ASTASIILG GTKGKR AMPN RIM+HQPLGGASGQAIDVEI
Sbjct: 148 DVVQLVRADVSTIALGIAASTASIILGAGTKGKRFAMPNTRIMIHQPLGGASGQAIDVEI 207

Query: 201 QAREIMHNKNNVTRIISESTGHPFEKVQKDIDRDRYMSPIEAVEYGLIDGVIDKDSIIPL 260
           QA+E+MHNKNNVT II+  T   FE+V KDIDRDRYMSPIEAVEYGLIDGVID DSIIPL
Sbjct: 208 QAKEVMHNKNNVTSIIAGCTSRSFEQVLKDIDRDRYMSPIEAVEYGLIDGVIDGDSIIPL 267

Query: 261 VPVPERVKASLNYEEISKDPRKFLTPDVPDDEIY 295
            PVP+RVK  +NYEEISKDP KFLTP++PDDEIY
Sbjct: 268 EPVPDRVKPRVNYEEISKDPMKFLTPEIPDDEIY 292

BLAST of Sgr029688 vs. ExPASy Swiss-Prot
Match: Q3ANI8 (ATP-dependent Clp protease proteolytic subunit 1 OS=Synechococcus sp. (strain CC9605) OX=110662 GN=clpP1 PE=3 SV=2)

HSP 1 Score: 213.0 bits (541), Expect = 4.5e-54
Identity = 106/195 (54.36%), Postives = 144/195 (73.85%), Query Frame = 0

Query: 60  PCSILMAAPQTPDAARRGAET-DAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDS 119
           P S     P   + + RG    D    LLRERI+FLG  +DD VADA+++Q+L L+A+D 
Sbjct: 2   PVSAPGPLPTVVEQSGRGDRAFDIYSRLLRERIIFLGTGVDDAVADALVAQMLFLEAEDP 61

Query: 120 TKDIRLFINSPGGSLSATMAIYDVVQLVRADVSTIALGISASTASIILGGGTKGKRLAMP 179
            KDI+++INSPGGS++A +AIYD +Q V  DV TI  G++AS  + +L GGTKGKRLA+P
Sbjct: 62  EKDIQIYINSPGGSVTAGLAIYDTMQQVAPDVVTICYGLAASMGAFLLSGGTKGKRLALP 121

Query: 180 NARIMMHQPLGGASGQAIDVEIQAREIMHNKNNVTRIISESTGHPFEKVQKDIDRDRYMS 239
           NARIM+HQPLGGA GQA+D+EIQA+EI++ K  +  +++E TG P +K+ +D DRD ++S
Sbjct: 122 NARIMIHQPLGGAQGQAVDIEIQAKEILYLKETLNGLMAEHTGQPLDKISEDTDRDYFLS 181

Query: 240 PIEAVEYGLIDGVID 254
           P EAVEYGLID V+D
Sbjct: 182 PAEAVEYGLIDRVVD 196

BLAST of Sgr029688 vs. ExPASy Swiss-Prot
Match: Q3B0U1 (ATP-dependent Clp protease proteolytic subunit 2 OS=Synechococcus sp. (strain CC9902) OX=316279 GN=clpP2 PE=3 SV=1)

HSP 1 Score: 211.8 bits (538), Expect = 1.0e-53
Identity = 105/195 (53.85%), Postives = 143/195 (73.33%), Query Frame = 0

Query: 60  PCSILMAAPQTPDAARRGAET-DAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDS 119
           P S     P   + + RG    D    LLRERI+FLG  +DD VADA+++Q+L L+A+D 
Sbjct: 2   PISAPGPLPTVVEQSGRGDRAFDIYSRLLRERIIFLGTGVDDAVADALVAQMLFLEAEDP 61

Query: 120 TKDIRLFINSPGGSLSATMAIYDVVQLVRADVSTIALGISASTASIILGGGTKGKRLAMP 179
            KDI++++NSPGGS++A +AIYD +Q V  DV TI  G++AS  + +L GGTKGKRLA+P
Sbjct: 62  EKDIQIYVNSPGGSVTAGLAIYDTMQQVAPDVVTICYGLAASMGAFLLSGGTKGKRLALP 121

Query: 180 NARIMMHQPLGGASGQAIDVEIQAREIMHNKNNVTRIISESTGHPFEKVQKDIDRDRYMS 239
           NARIM+HQPLGGA GQA+D+EIQA+EI+  K  +  +++E TG P +K+ +D DRD ++S
Sbjct: 122 NARIMIHQPLGGAQGQAVDIEIQAKEILFLKETLNGLLAEHTGQPLDKISEDTDRDYFLS 181

Query: 240 PIEAVEYGLIDGVID 254
           P EAVEYGLID V+D
Sbjct: 182 PAEAVEYGLIDRVVD 196

BLAST of Sgr029688 vs. ExPASy Swiss-Prot
Match: Q7V992 (ATP-dependent Clp protease proteolytic subunit 1 OS=Prochlorococcus marinus (strain MIT 9313) OX=74547 GN=clpP1 PE=3 SV=1)

HSP 1 Score: 209.9 bits (533), Expect = 3.9e-53
Identity = 103/187 (55.08%), Postives = 140/187 (74.87%), Query Frame = 0

Query: 68  PQTPDAARRGAET-DAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTKDIRLFI 127
           P   + + RG    D    LLRERI+FLG  +DD VADA+++Q+L L+A+D  KDI+++I
Sbjct: 27  PTVVEQSGRGERAFDIYSRLLRERIIFLGTGVDDQVADALVAQMLFLEAEDPEKDIQIYI 86

Query: 128 NSPGGSLSATMAIYDVVQLVRADVSTIALGISASTASIILGGGTKGKRLAMPNARIMMHQ 187
           NSPGGS++A +AIYD +Q V  DV TI  G++AS  + +L GGTKGKRLA+PNARIM+HQ
Sbjct: 87  NSPGGSVTAGLAIYDTMQQVAPDVVTICYGLAASMGAFLLCGGTKGKRLALPNARIMIHQ 146

Query: 188 PLGGASGQAIDVEIQAREIMHNKNNVTRIISESTGHPFEKVQKDIDRDRYMSPIEAVEYG 247
           PLGGA GQA+D+EIQA+EI+  K  +  +++E TG P  K+ +D DRD ++SP +AVEYG
Sbjct: 147 PLGGAQGQAVDIEIQAKEILFLKETLNGLLAEHTGQPLNKIAEDTDRDHFLSPAKAVEYG 206

Query: 248 LIDGVID 254
           LID V+D
Sbjct: 207 LIDRVVD 213

BLAST of Sgr029688 vs. ExPASy Swiss-Prot
Match: O34125 (ATP-dependent Clp protease proteolytic subunit 2 OS=Synechococcus elongatus (strain PCC 7942 / FACHB-805) OX=1140 GN=clpP2 PE=3 SV=1)

HSP 1 Score: 208.8 bits (530), Expect = 8.6e-53
Identity = 103/191 (53.93%), Postives = 144/191 (75.39%), Query Frame = 0

Query: 68  PQTPDAARRGAET-DAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTKDIRLFI 127
           P   + + RG    D    LLRERIVFLG  +DD VAD+I++QLL L+A+D  KDI+L+I
Sbjct: 39  PTVVEQSGRGERAFDIYSRLLRERIVFLGTGVDDAVADSIVAQLLFLEAEDPEKDIQLYI 98

Query: 128 NSPGGSLSATMAIYDVVQLVRADVSTIALGISASTASIILGGGTKGKRLAMPNARIMMHQ 187
           NSPGGS++A MAIYD +Q V  DV+TI  G++AS  + +L GG +GKR+A+P+ARIM+HQ
Sbjct: 99  NSPGGSVTAGMAIYDTMQQVAPDVATICFGLAASMGAFLLSGGAQGKRMALPSARIMIHQ 158

Query: 188 PLGGASGQAIDVEIQAREIMHNKNNVTRIISESTGHPFEKVQKDIDRDRYMSPIEAVEYG 247
           PLGGA GQA+D+EIQAREI+++K+ +  ++++ TG P EK++ D DRD +MSP EA  YG
Sbjct: 159 PLGGAQGQAVDIEIQAREILYHKSTLNDLLAQHTGQPLEKIEVDTDRDFFMSPEEAKAYG 218

Query: 248 LIDGVIDKDSI 258
           LID V+ + ++
Sbjct: 219 LIDQVLTRPTM 229

BLAST of Sgr029688 vs. ExPASy TrEMBL
Match: A0A6J1CWB9 (ATP-dependent Clp protease proteolytic subunit OS=Momordica charantia OX=3673 GN=LOC111014827 PE=3 SV=1)

HSP 1 Score: 527.7 bits (1358), Expect = 3.1e-146
Identity = 276/294 (93.88%), Postives = 283/294 (96.26%), Query Frame = 0

Query: 1   MELLSRCSTLIPHASFLILPNSHHKPSNFYPKLKTNLPSPSASSVLKATALKPSRALASP 60
           MELLSRCSTLIPHAS L LPNSH KPSNFYPKLK++L  PSA SVLKATALKPSR LASP
Sbjct: 1   MELLSRCSTLIPHASSLSLPNSHPKPSNFYPKLKSSLSFPSACSVLKATALKPSRTLASP 60

Query: 61  CSILMAAPQTPDAARRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK 120
           CS+LM APQTPD ARRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK
Sbjct: 61  CSVLMTAPQTPDVARRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK 120

Query: 121 DIRLFINSPGGSLSATMAIYDVVQLVRADVSTIALGISASTASIILGGGTKGKRLAMPNA 180
           DIRLFINSPGGSLSATMAIYDVVQLVRADVSTI LGI+ASTASIILGGGTKGKRLAMPNA
Sbjct: 121 DIRLFINSPGGSLSATMAIYDVVQLVRADVSTIGLGIAASTASIILGGGTKGKRLAMPNA 180

Query: 181 RIMMHQPLGGASGQAIDVEIQAREIMHNKNNVTRIISESTGHPFEKVQKDIDRDRYMSPI 240
           RIM+HQPLGGASGQAIDVEIQAREIMHNKNNVTRIISE TGHPF+KVQKDIDRDRYMSPI
Sbjct: 181 RIMVHQPLGGASGQAIDVEIQAREIMHNKNNVTRIISEFTGHPFDKVQKDIDRDRYMSPI 240

Query: 241 EAVEYGLIDGVIDKDSIIPLVPVPERVKASLNYEEISKDPRKFLTPDVPDDEIY 295
           EAVEYGLIDGVIDKDSIIPL+PVPERVKASLNYEEISKDPRKFLTPDVPDDEIY
Sbjct: 241 EAVEYGLIDGVIDKDSIIPLMPVPERVKASLNYEEISKDPRKFLTPDVPDDEIY 294

BLAST of Sgr029688 vs. ExPASy TrEMBL
Match: A0A1S3BJF4 (ATP-dependent Clp protease proteolytic subunit OS=Cucumis melo OX=3656 GN=LOC103490719 PE=3 SV=1)

HSP 1 Score: 510.0 bits (1312), Expect = 6.6e-141
Identity = 270/294 (91.84%), Postives = 279/294 (94.90%), Query Frame = 0

Query: 1   MELLSRCSTLIPHASFLILPNSHHKPSNFYPKLKTNLPSPSASSVLKATALKPSRALASP 60
           MELLSRCSTLIP AS L LPNSH KPSNF+PKLK+ L  PSASSVLK TALKPSR L  P
Sbjct: 1   MELLSRCSTLIPQASSLGLPNSHGKPSNFFPKLKSTLSFPSASSVLKTTALKPSRTLPPP 60

Query: 61  CSILMAAPQTPDAARRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK 120
           CS+ M APQTPDA+RRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK
Sbjct: 61  CSV-MTAPQTPDASRRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK 120

Query: 121 DIRLFINSPGGSLSATMAIYDVVQLVRADVSTIALGISASTASIILGGGTKGKRLAMPNA 180
           DIRLFINS GGSLSATMAIYDVVQLVRADVSTIALGI+ASTASIILGGGTKGKRLAMPNA
Sbjct: 121 DIRLFINSAGGSLSATMAIYDVVQLVRADVSTIALGIAASTASIILGGGTKGKRLAMPNA 180

Query: 181 RIMMHQPLGGASGQAIDVEIQAREIMHNKNNVTRIISESTGHPFEKVQKDIDRDRYMSPI 240
           RIM+HQPLGGASGQAIDVEIQAREIMHNKNNVTRIISE TGHPFEKVQKDIDRDRYMSPI
Sbjct: 181 RIMVHQPLGGASGQAIDVEIQAREIMHNKNNVTRIISEFTGHPFEKVQKDIDRDRYMSPI 240

Query: 241 EAVEYGLIDGVIDKDSIIPLVPVPERVKASLNYEEISKDPRKFLTPDVPDDEIY 295
           EAVEYGLIDGVID+DSIIPLVPVPERVKA+LNYEE+SKDPRKFLTPDVPDDEIY
Sbjct: 241 EAVEYGLIDGVIDRDSIIPLVPVPERVKATLNYEEMSKDPRKFLTPDVPDDEIY 293

BLAST of Sgr029688 vs. ExPASy TrEMBL
Match: A0A6J1EC18 (ATP-dependent Clp protease proteolytic subunit OS=Cucurbita moschata OX=3662 GN=LOC111431188 PE=3 SV=1)

HSP 1 Score: 507.3 bits (1305), Expect = 4.3e-140
Identity = 268/294 (91.16%), Postives = 278/294 (94.56%), Query Frame = 0

Query: 1   MELLSRCSTLIPHASFLILPNSHHKPSNFYPKLKTNLPSPSASSVLKATALKPSRALASP 60
           MELLSRCSTLIPHAS L L NSH KPSNFYPKLK++L  PSASSVLK TA KPSR LA+ 
Sbjct: 48  MELLSRCSTLIPHASSLGLLNSHGKPSNFYPKLKSSLSFPSASSVLKTTARKPSRTLATS 107

Query: 61  CSILMAAPQTPDAARRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK 120
           CS+ M  PQTPDAA+RGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK
Sbjct: 108 CSVHMTGPQTPDAAQRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK 167

Query: 121 DIRLFINSPGGSLSATMAIYDVVQLVRADVSTIALGISASTASIILGGGTKGKRLAMPNA 180
           DIRLFINS GGSLSATMAIYDVVQLVRADVSTIALGI+ASTASIILGGGTKGKRLAMPNA
Sbjct: 168 DIRLFINSSGGSLSATMAIYDVVQLVRADVSTIALGIAASTASIILGGGTKGKRLAMPNA 227

Query: 181 RIMMHQPLGGASGQAIDVEIQAREIMHNKNNVTRIISESTGHPFEKVQKDIDRDRYMSPI 240
           RIM+HQPLGGASGQAIDVEIQAREIMHNKNN+TRIISE TGHPFEKVQKDIDRDRYMSPI
Sbjct: 228 RIMVHQPLGGASGQAIDVEIQAREIMHNKNNITRIISEFTGHPFEKVQKDIDRDRYMSPI 287

Query: 241 EAVEYGLIDGVIDKDSIIPLVPVPERVKASLNYEEISKDPRKFLTPDVPDDEIY 295
           EAVEYGLIDGVIDKDSIIPLVPVPERVKASLNY EISKDP+KFL+PDVPDDEIY
Sbjct: 288 EAVEYGLIDGVIDKDSIIPLVPVPERVKASLNYIEISKDPKKFLSPDVPDDEIY 341

BLAST of Sgr029688 vs. ExPASy TrEMBL
Match: A0A6J1HNH6 (ATP-dependent Clp protease proteolytic subunit OS=Cucurbita maxima OX=3661 GN=LOC111465249 PE=3 SV=1)

HSP 1 Score: 501.1 bits (1289), Expect = 3.1e-138
Identity = 265/294 (90.14%), Postives = 275/294 (93.54%), Query Frame = 0

Query: 1   MELLSRCSTLIPHASFLILPNSHHKPSNFYPKLKTNLPSPSASSVLKATALKPSRALASP 60
           MELLSRCSTLIPHAS L L NSH KPSNFYPKLK+ L  PSASSVLK TA KPSR LA+ 
Sbjct: 1   MELLSRCSTLIPHASSLGLLNSHGKPSNFYPKLKSGLSFPSASSVLKTTARKPSRTLATS 60

Query: 61  CSILMAAPQTPDAARRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK 120
            S+ M  PQTPDAA+RGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK
Sbjct: 61  SSVHMTGPQTPDAAQRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK 120

Query: 121 DIRLFINSPGGSLSATMAIYDVVQLVRADVSTIALGISASTASIILGGGTKGKRLAMPNA 180
           DIRLFINS GGSLSATMAIYDV+QLVRADVSTIALGI+ASTASIILGGGTKGKRLAMPNA
Sbjct: 121 DIRLFINSSGGSLSATMAIYDVIQLVRADVSTIALGIAASTASIILGGGTKGKRLAMPNA 180

Query: 181 RIMMHQPLGGASGQAIDVEIQAREIMHNKNNVTRIISESTGHPFEKVQKDIDRDRYMSPI 240
           RIM+HQPLGGASGQAIDVEIQAREIMHNKNN+TRIISE TGHPFEKVQKDIDRDRYMSPI
Sbjct: 181 RIMVHQPLGGASGQAIDVEIQAREIMHNKNNITRIISEFTGHPFEKVQKDIDRDRYMSPI 240

Query: 241 EAVEYGLIDGVIDKDSIIPLVPVPERVKASLNYEEISKDPRKFLTPDVPDDEIY 295
           EAVEYGLIDGVIDKDSIIPLVPVPERVKASLNY EISKDP+KF +PDVPDDEIY
Sbjct: 241 EAVEYGLIDGVIDKDSIIPLVPVPERVKASLNYIEISKDPKKFFSPDVPDDEIY 294

BLAST of Sgr029688 vs. ExPASy TrEMBL
Match: A0A6J1FAG6 (ATP-dependent Clp protease proteolytic subunit OS=Cucurbita moschata OX=3662 GN=LOC111443555 PE=3 SV=1)

HSP 1 Score: 493.8 bits (1270), Expect = 4.9e-136
Identity = 258/294 (87.76%), Postives = 273/294 (92.86%), Query Frame = 0

Query: 1   MELLSRCSTLIPHASFLILPNSHHKPSNFYPKLKTNLPSPSASSVLKATALKPSRALASP 60
           MELLSRCSTL+PHAS L LPNSH K S FYPK K++L  PS+SSVLKATALKPSR LA P
Sbjct: 1   MELLSRCSTLVPHASTLSLPNSHGKSSVFYPKRKSSLSFPSSSSVLKATALKPSRTLAPP 60

Query: 61  CSILMAAPQTPDAARRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTK 120
           CS+LM  PQTPD+AR GAETDAMGLLLRERIVFLGN+IDDFVADAIISQLLLLDAQDSTK
Sbjct: 61  CSVLMTTPQTPDSARTGAETDAMGLLLRERIVFLGNNIDDFVADAIISQLLLLDAQDSTK 120

Query: 121 DIRLFINSPGGSLSATMAIYDVVQLVRADVSTIALGISASTASIILGGGTKGKRLAMPNA 180
           DIRLFINS GGSLSATMAI DV+QLVRADVSTIALGI+ASTASIILGGGTKGKR AMPNA
Sbjct: 121 DIRLFINSSGGSLSATMAIIDVLQLVRADVSTIALGIAASTASIILGGGTKGKRFAMPNA 180

Query: 181 RIMMHQPLGGASGQAIDVEIQAREIMHNKNNVTRIISESTGHPFEKVQKDIDRDRYMSPI 240
           RIM+HQPLGG SGQAIDVEIQAREIMHNKNN+ RIIS  TGHPFEKVQKDIDRDRYMSPI
Sbjct: 181 RIMVHQPLGGTSGQAIDVEIQAREIMHNKNNIIRIISNYTGHPFEKVQKDIDRDRYMSPI 240

Query: 241 EAVEYGLIDGVIDKDSIIPLVPVPERVKASLNYEEISKDPRKFLTPDVPDDEIY 295
           EAVEYGLIDGVIDKD+IIPL+ +PERVKA+LNYEEISKDPRKFLTPDVPDDEIY
Sbjct: 241 EAVEYGLIDGVIDKDTIIPLMQLPERVKATLNYEEISKDPRKFLTPDVPDDEIY 294

BLAST of Sgr029688 vs. TAIR 10
Match: AT5G45390.1 (CLP protease P4 )

HSP 1 Score: 369.4 bits (947), Expect = 2.7e-102
Identity = 204/274 (74.45%), Postives = 228/274 (83.21%), Query Frame = 0

Query: 21  NSHHKPSNFYPKLKTNLPSPSASSVLKATALKPSRALASPCSILMAAPQTPDAARRGAET 80
           +S  KP+N Y K     P+   S  L+ T+  P R   +  SI M+  QT ++A RGAE+
Sbjct: 28  SSFPKPNNLYLK-----PTKLISPPLRTTSPSPLR--FANASIEMS--QTQESAIRGAES 87

Query: 81  DAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTKDIRLFINSPGGSLSATMAIY 140
           D MGLLLRERIVFLG+SIDDFVADAI+SQLLLLDA+D  KDI+LFINSPGGSLSATMAIY
Sbjct: 88  DVMGLLLRERIVFLGSSIDDFVADAIMSQLLLLDAKDPKKDIKLFINSPGGSLSATMAIY 147

Query: 141 DVVQLVRADVSTIALGISASTASIILGGGTKGKRLAMPNARIMMHQPLGGASGQAIDVEI 200
           DVVQLVRADVSTIALGI+ASTASIILG GTKGKR AMPN RIM+HQPLGGASGQAIDVEI
Sbjct: 148 DVVQLVRADVSTIALGIAASTASIILGAGTKGKRFAMPNTRIMIHQPLGGASGQAIDVEI 207

Query: 201 QAREIMHNKNNVTRIISESTGHPFEKVQKDIDRDRYMSPIEAVEYGLIDGVIDKDSIIPL 260
           QA+E+MHNKNNVT II+  T   FE+V KDIDRDRYMSPIEAVEYGLIDGVID DSIIPL
Sbjct: 208 QAKEVMHNKNNVTSIIAGCTSRSFEQVLKDIDRDRYMSPIEAVEYGLIDGVIDGDSIIPL 267

Query: 261 VPVPERVKASLNYEEISKDPRKFLTPDVPDDEIY 295
            PVP+RVK  +NYEEISKDP KFLTP++PDDEIY
Sbjct: 268 EPVPDRVKPRVNYEEISKDPMKFLTPEIPDDEIY 292

BLAST of Sgr029688 vs. TAIR 10
Match: AT1G66670.1 (CLP protease proteolytic subunit 3 )

HSP 1 Score: 195.7 bits (496), Expect = 5.3e-50
Identity = 105/224 (46.88%), Postives = 152/224 (67.86%), Query Frame = 0

Query: 34  KTNLPSPSASSVLKATALKPSRALASPCSI----LMAAPQTPDAARRGAETDAMGLLLRE 93
           KT+ P    SS+  + +  P + L+S   +    + +  Q+P       E D   +LLR+
Sbjct: 36  KTSKPFCVRSSM--SLSKPPRQTLSSNWDVSSFSIDSVAQSPSRLPSFEELDTTNMLLRQ 95

Query: 94  RIVFLGNSIDDFVADAIISQLLLLDAQDSTKDIRLFINSPGGSLSATMAIYDVVQLVRAD 153
           RIVFLG+ +DD  AD +ISQLLLLDA+DS +DI LFINSPGGS++A M IYD ++  +AD
Sbjct: 96  RIVFLGSQVDDMTADLVISQLLLLDAEDSERDITLFINSPGGSITAGMGIYDAMKQCKAD 155

Query: 154 VSTIALGISASTASIILGGGTKGKRLAMPNARIMMHQPLGGASGQAIDVEIQAREIMHNK 213
           VST+ LG++AS  + +L  G+KGKR  MPN+++M+HQPLG A G+A ++ I+ RE+M++K
Sbjct: 156 VSTVCLGLAASMGAFLLASGSKGKRYCMPNSKVMIHQPLGTAGGKATEMSIRIREMMYHK 215

Query: 214 NNVTRIISESTGHPFEKVQKDIDRDRYMSPIEAVEYGLIDGVID 254
             + +I S  TG P  +++ D DRD +++P EA EYGLID VID
Sbjct: 216 IKLNKIFSRITGKPESEIESDTDRDNFLNPWEAKEYGLIDAVID 257

BLAST of Sgr029688 vs. TAIR 10
Match: AT1G02560.1 (nuclear encoded CLP protease 5 )

HSP 1 Score: 177.2 bits (448), Expect = 2.0e-44
Identity = 97/216 (44.91%), Postives = 140/216 (64.81%), Query Frame = 0

Query: 37  LPSPSASSVLKATALKPSRALASPCSILMAAPQTPDAARRGAETDAMGLLLRERIVFLGN 96
           +PSP     ++     PS    SP     A  Q P    +      +  L + RI+  G 
Sbjct: 74  IPSPQGVWSIRDDLQVPS----SPYFPAYAQGQGPPPMVQERFQSIISQLFQYRIIRCGG 133

Query: 97  SIDDFVADAIISQLLLLDAQDSTKDIRLFINSPGGSLSATMAIYDVVQLVRADVSTIALG 156
           ++DD +A+ I++QLL LDA D TKDI +++NSPGGS++A MAI+D ++ +R DVST+ +G
Sbjct: 134 AVDDDMANIIVAQLLYLDAVDPTKDIVMYVNSPGGSVTAGMAIFDTMRHIRPDVSTVCVG 193

Query: 157 ISASTASIILGGGTKGKRLAMPNARIMMHQPLGGASGQAIDVEIQAREIMHNKNNVTRII 216
           ++AS  + +L  GTKGKR ++PN+RIM+HQPLGGA G   D++IQA E++H+K N+   +
Sbjct: 194 LAASMGAFLLSAGTKGKRYSLPNSRIMIHQPLGGAQGGQTDIDIQANEMLHHKANLNGYL 253

Query: 217 SESTGHPFEKVQKDIDRDRYMSPIEAVEYGLIDGVI 253
           +  TG   EK+ +D DRD +MS  EA EYGLIDGVI
Sbjct: 254 AYHTGQSLEKINQDTDRDFFMSAKEAKEYGLIDGVI 285

BLAST of Sgr029688 vs. TAIR 10
Match: AT1G11750.1 (CLP protease proteolytic subunit 6 )

HSP 1 Score: 157.1 bits (396), Expect = 2.1e-38
Identity = 77/188 (40.96%), Postives = 115/188 (61.17%), Query Frame = 0

Query: 68  PQTPDAARRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTKDIRLFIN 127
           P  P     G   D   +L R RI+F+G  I+  VA  +ISQL+ L + D   DI +++N
Sbjct: 83  PVMPSVMTPGGPLDLSSVLFRNRIIFIGQPINAQVAQRVISQLVTLASIDDKSDILMYLN 142

Query: 128 SPGGSLSATMAIYDVVQLVRADVSTIALGISASTASIILGGGTKGKRLAMPNARIMMHQP 187
            PGGS  + +AIYD +  ++  V T+A G++AS  +++L GG KG R AMPN R+M+HQP
Sbjct: 143 CPGGSTYSVLAIYDCMSWIKPKVGTVAFGVAASQGALLLAGGEKGMRYAMPNTRVMIHQP 202

Query: 188 LGGASGQAIDVEIQAREIMHNKNNVTRIISESTGHPFEKVQKDIDRDRYMSPIEAVEYGL 247
             G  G   DV  Q  E +  +  + R+ +  TG P EKVQ+  +RDR++S  EA+E+GL
Sbjct: 203 QTGCGGHVEDVRRQVNEAIEARQKIDRMYAAFTGQPLEKVQQYTERDRFLSASEALEFGL 262

Query: 248 IDGVIDKD 256
           IDG+++ +
Sbjct: 263 IDGLLETE 270

BLAST of Sgr029688 vs. TAIR 10
Match: AT1G11750.2 (CLP protease proteolytic subunit 6 )

HSP 1 Score: 157.1 bits (396), Expect = 2.1e-38
Identity = 77/188 (40.96%), Postives = 115/188 (61.17%), Query Frame = 0

Query: 68  PQTPDAARRGAETDAMGLLLRERIVFLGNSIDDFVADAIISQLLLLDAQDSTKDIRLFIN 127
           P  P     G   D   +L R RI+F+G  I+  VA  +ISQL+ L + D   DI +++N
Sbjct: 101 PVMPSVMTPGGPLDLSSVLFRNRIIFIGQPINAQVAQRVISQLVTLASIDDKSDILMYLN 160

Query: 128 SPGGSLSATMAIYDVVQLVRADVSTIALGISASTASIILGGGTKGKRLAMPNARIMMHQP 187
            PGGS  + +AIYD +  ++  V T+A G++AS  +++L GG KG R AMPN R+M+HQP
Sbjct: 161 CPGGSTYSVLAIYDCMSWIKPKVGTVAFGVAASQGALLLAGGEKGMRYAMPNTRVMIHQP 220

Query: 188 LGGASGQAIDVEIQAREIMHNKNNVTRIISESTGHPFEKVQKDIDRDRYMSPIEAVEYGL 247
             G  G   DV  Q  E +  +  + R+ +  TG P EKVQ+  +RDR++S  EA+E+GL
Sbjct: 221 QTGCGGHVEDVRRQVNEAIEARQKIDRMYAAFTGQPLEKVQQYTERDRFLSASEALEFGL 280

Query: 248 IDGVIDKD 256
           IDG+++ +
Sbjct: 281 IDGLLETE 288

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022145352.16.4e-14693.88ATP-dependent Clp protease proteolytic subunit 4, chloroplastic [Momordica chara... [more]
XP_008448589.11.4e-14091.84PREDICTED: ATP-dependent Clp protease proteolytic subunit 4, chloroplastic [Cucu... [more]
XP_038876611.11.8e-14091.50ATP-dependent Clp protease proteolytic subunit 4, chloroplastic [Benincasa hispi... [more]
XP_022923515.18.9e-14091.16ATP-dependent Clp protease proteolytic subunit 4, chloroplastic-like [Cucurbita ... [more]
KAG7015794.11.5e-13991.16ATP-dependent Clp protease proteolytic subunit 4, chloroplastic [Cucurbita argyr... [more]
Match NameE-valueIdentityDescription
Q94B603.8e-10174.45ATP-dependent Clp protease proteolytic subunit 4, chloroplastic OS=Arabidopsis t... [more]
Q3ANI84.5e-5454.36ATP-dependent Clp protease proteolytic subunit 1 OS=Synechococcus sp. (strain CC... [more]
Q3B0U11.0e-5353.85ATP-dependent Clp protease proteolytic subunit 2 OS=Synechococcus sp. (strain CC... [more]
Q7V9923.9e-5355.08ATP-dependent Clp protease proteolytic subunit 1 OS=Prochlorococcus marinus (str... [more]
O341258.6e-5353.93ATP-dependent Clp protease proteolytic subunit 2 OS=Synechococcus elongatus (str... [more]
Match NameE-valueIdentityDescription
A0A6J1CWB93.1e-14693.88ATP-dependent Clp protease proteolytic subunit OS=Momordica charantia OX=3673 GN... [more]
A0A1S3BJF46.6e-14191.84ATP-dependent Clp protease proteolytic subunit OS=Cucumis melo OX=3656 GN=LOC103... [more]
A0A6J1EC184.3e-14091.16ATP-dependent Clp protease proteolytic subunit OS=Cucurbita moschata OX=3662 GN=... [more]
A0A6J1HNH63.1e-13890.14ATP-dependent Clp protease proteolytic subunit OS=Cucurbita maxima OX=3661 GN=LO... [more]
A0A6J1FAG64.9e-13687.76ATP-dependent Clp protease proteolytic subunit OS=Cucurbita moschata OX=3662 GN=... [more]
Match NameE-valueIdentityDescription
AT5G45390.12.7e-10274.45CLP protease P4 [more]
AT1G66670.15.3e-5046.88CLP protease proteolytic subunit 3 [more]
AT1G02560.12.0e-4444.91nuclear encoded CLP protease 5 [more]
AT1G11750.12.1e-3840.96CLP protease proteolytic subunit 6 [more]
AT1G11750.22.1e-3840.96CLP protease proteolytic subunit 6 [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001907ATP-dependent Clp protease proteolytic subunitPRINTSPR00127CLPPROTEASEPcoord: 173..192
score: 57.62
coord: 121..141
score: 58.93
coord: 152..169
score: 45.14
coord: 81..96
score: 46.88
coord: 230..249
score: 56.38
IPR001907ATP-dependent Clp protease proteolytic subunitHAMAPMF_00444ClpPcoord: 66..255
score: 37.315231
IPR001907ATP-dependent Clp protease proteolytic subunitCDDcd07017S14_ClpP_2coord: 81..251
e-value: 9.21685E-97
score: 280.482
IPR023562Clp protease proteolytic subunit /Translocation-enhancing protein TepAPFAMPF00574CLP_proteasecoord: 80..254
e-value: 1.4E-71
score: 240.1
IPR023562Clp protease proteolytic subunit /Translocation-enhancing protein TepAPANTHERPTHR10381ATP-DEPENDENT CLP PROTEASE PROTEOLYTIC SUBUNITcoord: 62..291
NoneNo IPR availableGENE3D3.90.226.10coord: 60..257
e-value: 6.0E-69
score: 233.7
NoneNo IPR availablePANTHERPTHR10381:SF24ATP-DEPENDENT CLP PROTEASE PROTEOLYTIC SUBUNIT 4, CHLOROPLASTICcoord: 62..291
IPR033135ClpP, histidine active sitePROSITEPS00382CLP_PROTEASE_HIScoord: 174..187
IPR029045ClpP/crotonase-like domain superfamilySUPERFAMILY52096ClpP/crotonasecoord: 80..257

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr029688.1Sgr029688.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006515 protein quality control for misfolded or incompletely synthesized proteins
biological_process GO:0006508 proteolysis
cellular_component GO:0009570 chloroplast stroma
cellular_component GO:0009368 endopeptidase Clp complex
cellular_component GO:0009526 plastid envelope
molecular_function GO:0051117 ATPase binding
molecular_function GO:0004176 ATP-dependent peptidase activity
molecular_function GO:0004252 serine-type endopeptidase activity