Sgr020530 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr020530
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationtig00153535: 137133 .. 139446 (+)
RNA-Seq ExpressionSgr020530
SyntenySgr020530
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGATGAACTCTCTTGCTGCAATGGCGGCACATCCACACTGCTTTCCGAGAAATCTCAAACAGAAACGCCTGATTTTCTCCCATCAGAGACGCTCCGCAGGTCGAGCTCCGGCTCGTAAGGTACGCTCAATTACTGAAAAGGATTTCAACAGCTCAAATTCAGATTCTGCTGATAAACTCTCAATTTTTTGTTCTTCTTCTTTCTTCATCGTTTGTTGGTTTGTAGTTAGACTGAATTTGGGCGTAAATGGAAAATTGTGAGAAATTCGCGAACGTTATATTGTCATTTTATTTTGAGTCTCGGCCATCTCGGGTGAATGGCTAACTTTTTTCGGCGAATTTAAGGTTATCAAGGCAATATGTTAATCGAGAATTCAATGAGAATATGATGTGCACTTACTATTGTGATGATTGGTTTTCTGTATAGATCAAGCCTACATTCTAAGAATCAGGTCTGAATGTTATGTCTTGAAACTTGGAGATGATTAATTATGTGGTGATCTGTTAATTGGTCTGATCATGTTCTTTGATGTTACTTGGGATTGTTCTTGTAAATAAATTTAACCTTAGAAGGTCCTCTTTGAAGTGGTTAACTCATTTTCTTTTTTAATTGTCTTTTCCTGTTCATATGCAAGTTTGGTGTGCAAGGGGTCAGGTCAGACAAATTGCAACAACCCCTTGCCTGTGCTTGAATGCACTGAGAAAATTAGATAACTTACCAATTGTTGGAAGATGTACAATGGATGAATTCGTGCTGATATGATTATGCTTATTGAAGATAAAATGACAAAAAGGTGTGGTGGCAGTGGGTGGATGCCATGTTAATTCACCTTTCGAACCATGTTAGGTTTCTTGTGGGCCATGGCTTAGATGTTAGTTTCGATTGACAAGGCCCAAGATGTGGACTATTAGGATAAAAGAATAGGGAGGAGTGACATTGTAAGGATTTGAACTTAGGACAACTTACTTCGGTACGATATTAAATCATCATTAAATTAAAAATTTTAATAGGTTGGGTTAGAGAAAATGTAGGATGCCTTAGTTTAGGCAATATAAATACAAAAAATCAAAATCAAGCCTTCTTTCTAAATGGTTAGAGAGATTCCCAAAGGAAATGAACTTTTTTCATTTTAAGCAAGTAAGCTCCAGATCAGTTTGGGTGTAATTCTTGATATGTCTAGAAGGAGTAGCCACAGCTCTTGGAAAGACCTTGGCTTCCATTGTTAACCTCACAATTACTTTCGAAGTTGGAGATGGTAAGTCAATGATCTTTTTCTGGAAGAACGATTGTGTGGGGACTGTGTCCCTTGCAGAGTCTTTCCCTAGAATATGCTTGTTGATATGACTTATCTTTGACCTATTGCTGAGGCTTTTACTGGGAAACTGTTGCATGGAATGCTTTATTCAGGGAGACAGGCGATTGACGAGTGGACAGATGTAACATACTCAGTAAGAGAAATACTCTGAGAAGAGCACAAGGTGCGAAAGCTTTTGAGGTCAAATTTCTTGAAATAGTTGGAATAGATTTTCTTTTGAAAAAAAAAAAAAAGGTTATAAATGAGAAAAGGGCATATTAACGAAAAAAATCTTACTTATCACAATTATTTCTTTGGGCAATCTATACAGCCAATGGATCTCTGTATGATTTGAATCCTTAAAACAATAAAGTATAAGTAGAGTGATTTCTTATCTGATGATCATGTTATAGATATTTTCATTTGCTTAATTTTTGTTTCTTTTGTACGCAATGTACTTAGGATAGATACCCTGGCAGCACAAACAAGAAGGAAGTTTTGTTGGTGATGATTTTTACATCAGCCATTCCCGAGTGATTAAAAAATTGAGCCAGAGAAGGATGCCTGTTTTAGCTCAAGAGATTTTCTGGGAGCTGAAGTCTGAAGGTTCGTTGCTAAACAACTCCACGTTCTCTGCTCTTATGGAATGCTACATAGATGGTGGTCTTCTCCACCAAGCACAGGCAATTTGGGAAGAAATGTTAAACAGTTGTTTCGTACCTTCTGTTCATGTAATTTCAAAGTTATTTAGCACTTATGGGAAGATGGGACACTTTGATGATATAATCAAAGCTTTGGATCAGGTAAAACTAAGGTATTTACATTTACTGCCTGAGGCATACTCACTAGCCATATCATGTTTTGGGAAGCATGGACAACTGGAATTGATGGAAAATACTCTGAGGGAAATGGTTTCCAGCGGTTTCCAGTTAATTCCACTATTGGAAATTCCTTTATTGTACACTACAGCATTTTGGTTCTTTGATGGAGATGGAAACTGCCTATGGCCGCCTTAA

mRNA sequence

ATGGAGATGAACTCTCTTGCTGCAATGGCGGCACATCCACACTGCTTTCCGAGAAATCTCAAACAGAAACGCCTGATTTTCTCCCATCAGAGACGCTCCGCAGGTCGAGCTCCGGCTCAAGGAGTAGCCACAGCTCTTGGAAAGACCTTGGCTTCCATTGTTAACCTCACAATTACTTTCGAAGTTGGAGATGGGAGACAGGCGATTGACGAGTGGACAGATGTAACATACTCAATACCCTGGCAGCACAAACAAGAAGGAAGTTTTGTTGGTGATGATTTTTACATCAGCCATTCCCGAGTGATTAAAAAATTGAGCCAGAGAAGGATGCCTGTTTTAGCTCAAGAGATTTTCTGGGAGCTGAAGTCTGAAGGTTCGTTGCTAAACAACTCCACGTTCTCTGCTCTTATGGAATGCTACATAGATGGTGGTCTTCTCCACCAAGCACAGGCAATTTGGGAAGAAATGTTAAACAGTTGTTTCGTACCTTCTGTTCATGTAATTTCAAAGTTATTTAGCACTTATGGGAAGATGGGACACTTTGATGATATAATCAAAGCTTTGGATCAGGTAAAACTAAGGTATTTACATTTACTGCCTGAGGCATACTCACTAGCCATATCATGTTTTGGGAAGCATGGACAACTGGAATTGATGGAAAATACTCTGAGGGAAATGGTTTCCAGCGGTTTCCAGTTAATTCCACTATTGGAAATTCCTTTATTGTACACTACAGCATTTTGGTTCTTTGATGGAGATGGAAACTGCCTATGGCCGCCTTAA

Coding sequence (CDS)

ATGGAGATGAACTCTCTTGCTGCAATGGCGGCACATCCACACTGCTTTCCGAGAAATCTCAAACAGAAACGCCTGATTTTCTCCCATCAGAGACGCTCCGCAGGTCGAGCTCCGGCTCAAGGAGTAGCCACAGCTCTTGGAAAGACCTTGGCTTCCATTGTTAACCTCACAATTACTTTCGAAGTTGGAGATGGGAGACAGGCGATTGACGAGTGGACAGATGTAACATACTCAATACCCTGGCAGCACAAACAAGAAGGAAGTTTTGTTGGTGATGATTTTTACATCAGCCATTCCCGAGTGATTAAAAAATTGAGCCAGAGAAGGATGCCTGTTTTAGCTCAAGAGATTTTCTGGGAGCTGAAGTCTGAAGGTTCGTTGCTAAACAACTCCACGTTCTCTGCTCTTATGGAATGCTACATAGATGGTGGTCTTCTCCACCAAGCACAGGCAATTTGGGAAGAAATGTTAAACAGTTGTTTCGTACCTTCTGTTCATGTAATTTCAAAGTTATTTAGCACTTATGGGAAGATGGGACACTTTGATGATATAATCAAAGCTTTGGATCAGGTAAAACTAAGGTATTTACATTTACTGCCTGAGGCATACTCACTAGCCATATCATGTTTTGGGAAGCATGGACAACTGGAATTGATGGAAAATACTCTGAGGGAAATGGTTTCCAGCGGTTTCCAGTTAATTCCACTATTGGAAATTCCTTTATTGTACACTACAGCATTTTGGTTCTTTGATGGAGATGGAAACTGCCTATGGCCGCCTTAA

Protein sequence

MEMNSLAAMAAHPHCFPRNLKQKRLIFSHQRRSAGRAPAQGVATALGKTLASIVNLTITFEVGDGRQAIDEWTDVTYSIPWQHKQEGSFVGDDFYISHSRVIKKLSQRRMPVLAQEIFWELKSEGSLLNNSTFSALMECYIDGGLLHQAQAIWEEMLNSCFVPSVHVISKLFSTYGKMGHFDDIIKALDQVKLRYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSGFQLIPLLEIPLLYTTAFWFFDGDGNCLWPP
Homology
BLAST of Sgr020530 vs. NCBI nr
Match: XP_022149915.1 (pentatricopeptide repeat-containing protein At3g42630 isoform X1 [Momordica charantia])

HSP 1 Score: 283.1 bits (723), Expect = 2.4e-72
Identity = 152/231 (65.80%), Postives = 163/231 (70.56%), Query Frame = 0

Query: 1   MEMNSLAAMAAHPHCFPRNLKQKRLIFSHQRRSAGRAPAQGVATALGKTLASIVNLTITF 60
           M+MNSLAAM   P  FP NLKQKRLIFSH RRS GR  A+                    
Sbjct: 1   MDMNSLAAMLPLPQIFPANLKQKRLIFSHHRRSVGRTSAR-------------------- 60

Query: 61  EVGDGRQAIDEWTDVTYSIPWQHKQEGSFVGDDFYISHSRVIKKLSQRRMPVLAQEIFWE 120
                                + K EGSFV DDFY  HS++IKKLS+RR+PVLAQEIFWE
Sbjct: 61  ---------------------KDKGEGSFVDDDFYADHSQMIKKLSERRLPVLAQEIFWE 120

Query: 121 LKSEGSLLNNSTFSALMECYIDGGLLHQAQAIWEEMLNSCFVPSVHVISKLFSTYGKMGH 180
           LKSEGS L NST S LME YIDGG L QAQAIWEEMLNS FVPSV++ISKLF T+GKMG 
Sbjct: 121 LKSEGSPLKNSTLSTLMESYIDGGRLLQAQAIWEEMLNSSFVPSVNLISKLFHTFGKMGR 180

Query: 181 FDDIIKALDQVKLRYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSGF 232
           FDDIIK LDQVKLRY HLLPEA+SLAISCFGKHGQLELMENTLREMVSSGF
Sbjct: 181 FDDIIKVLDQVKLRYSHLLPEAFSLAISCFGKHGQLELMENTLREMVSSGF 190

BLAST of Sgr020530 vs. NCBI nr
Match: KAG7020945.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 278.1 bits (710), Expect = 7.8e-71
Identity = 151/230 (65.65%), Postives = 164/230 (71.30%), Query Frame = 0

Query: 1   MEMNSLAAMAAHPHCFPRNLKQKRLIFSHQRRSAGRAPAQGVATALGKTLASIVNLTITF 60
           ME+NSL A  A+PHCFP NLKQKR I   QRR AGRA A+ +                  
Sbjct: 1   MELNSLVAAVANPHCFPTNLKQKRPILFQQRRPAGRASARKILR---------------- 60

Query: 61  EVGDGRQAIDEWTDVTYSIPWQHKQEGSFVGDDFYISHSRVIKKLSQRRMPVLAQEIFWE 120
                                Q+K++GSF GDDF I+HS+VI+KLSQRR PVLAQEIF E
Sbjct: 61  ---------------------QNKEKGSFAGDDFSINHSQVIEKLSQRRTPVLAQEIFLE 120

Query: 121 LKSEGSLLNNSTFSALMECYIDGGLLHQAQAIWEEMLNSCFVPSVHVISKLFSTYGKMGH 180
           LKSE   LNNST S+LM  YIDGGLL QA+AIWEEMLNSCFVPSV VISKL +TYGKM  
Sbjct: 121 LKSEDFPLNNSTLSSLMVRYIDGGLLLQAEAIWEEMLNSCFVPSVLVISKLLNTYGKMRR 180

Query: 181 FDDIIKALDQVKLRYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSG 231
           FDDIIK LDQVKLRYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSG
Sbjct: 181 FDDIIKVLDQVKLRYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSG 193

BLAST of Sgr020530 vs. NCBI nr
Match: XP_022937749.1 (pentatricopeptide repeat-containing protein At3g42630 [Cucurbita moschata])

HSP 1 Score: 275.0 bits (702), Expect = 6.6e-70
Identity = 150/230 (65.22%), Postives = 163/230 (70.87%), Query Frame = 0

Query: 1   MEMNSLAAMAAHPHCFPRNLKQKRLIFSHQRRSAGRAPAQGVATALGKTLASIVNLTITF 60
           ME+NSL A  A+PHCFP NLKQKR I   QRR AGRA A+ +                  
Sbjct: 1   MELNSLVAAVANPHCFPTNLKQKRPILFQQRRPAGRASARKILR---------------- 60

Query: 61  EVGDGRQAIDEWTDVTYSIPWQHKQEGSFVGDDFYISHSRVIKKLSQRRMPVLAQEIFWE 120
                                Q+K++GSF GDD  I+HS+VI+KLSQRR PVLAQEIF E
Sbjct: 61  ---------------------QNKEKGSFAGDDSSINHSQVIEKLSQRRTPVLAQEIFLE 120

Query: 121 LKSEGSLLNNSTFSALMECYIDGGLLHQAQAIWEEMLNSCFVPSVHVISKLFSTYGKMGH 180
           LKSE   LNNST S+LM  YIDGGLL QA+AIWEEMLNSCFVPSV VISKL +TYGKM  
Sbjct: 121 LKSEDFPLNNSTLSSLMVRYIDGGLLLQAEAIWEEMLNSCFVPSVLVISKLLNTYGKMRR 180

Query: 181 FDDIIKALDQVKLRYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSG 231
           FDDIIK LDQVKLRYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSG
Sbjct: 181 FDDIIKVLDQVKLRYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSG 193

BLAST of Sgr020530 vs. NCBI nr
Match: XP_023536679.1 (pentatricopeptide repeat-containing protein At3g42630 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 275.0 bits (702), Expect = 6.6e-70
Identity = 149/230 (64.78%), Postives = 163/230 (70.87%), Query Frame = 0

Query: 1   MEMNSLAAMAAHPHCFPRNLKQKRLIFSHQRRSAGRAPAQGVATALGKTLASIVNLTITF 60
           ME+NSL A  A+PHCFP NLKQKR I   QRR AGRA A+ +                  
Sbjct: 2   MELNSLVAAVANPHCFPTNLKQKRPILFQQRRPAGRASARKILR---------------- 61

Query: 61  EVGDGRQAIDEWTDVTYSIPWQHKQEGSFVGDDFYISHSRVIKKLSQRRMPVLAQEIFWE 120
                                Q+K++GSF GDDF I+HS+VI+KLSQRR PVLAQEIF E
Sbjct: 62  ---------------------QNKEKGSFAGDDFSINHSQVIEKLSQRRTPVLAQEIFLE 121

Query: 121 LKSEGSLLNNSTFSALMECYIDGGLLHQAQAIWEEMLNSCFVPSVHVISKLFSTYGKMGH 180
           LK E   LNNST S+LM  YIDGGLL QA+AIWEEMLNSCFVPSV VISKL +TYGKM  
Sbjct: 122 LKPEEFPLNNSTLSSLMVRYIDGGLLLQAEAIWEEMLNSCFVPSVLVISKLLNTYGKMRR 181

Query: 181 FDDIIKALDQVKLRYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSG 231
           FDDIIK LDQVK+RYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSG
Sbjct: 182 FDDIIKVLDQVKIRYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSG 194

BLAST of Sgr020530 vs. NCBI nr
Match: KAG6586124.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 274.6 bits (701), Expect = 8.6e-70
Identity = 150/230 (65.22%), Postives = 162/230 (70.43%), Query Frame = 0

Query: 1   MEMNSLAAMAAHPHCFPRNLKQKRLIFSHQRRSAGRAPAQGVATALGKTLASIVNLTITF 60
           ME+NSL A  A+PHCFP NLKQKR I   QRR AGRA A+ +                  
Sbjct: 1   MELNSLVAAVANPHCFPTNLKQKRPILFQQRRPAGRASARKILR---------------- 60

Query: 61  EVGDGRQAIDEWTDVTYSIPWQHKQEGSFVGDDFYISHSRVIKKLSQRRMPVLAQEIFWE 120
                                Q+K+ GSF GDD  I+HS+VI+KLSQRR PVLAQEIF E
Sbjct: 61  ---------------------QNKENGSFAGDDSSINHSQVIEKLSQRRTPVLAQEIFLE 120

Query: 121 LKSEGSLLNNSTFSALMECYIDGGLLHQAQAIWEEMLNSCFVPSVHVISKLFSTYGKMGH 180
           LKSE   LNNST S+LM  YIDGGLL QA+AIWEEMLNSCFVPSV VISKL +TYGKM  
Sbjct: 121 LKSEDFPLNNSTLSSLMVRYIDGGLLLQAEAIWEEMLNSCFVPSVLVISKLLNTYGKMRR 180

Query: 181 FDDIIKALDQVKLRYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSG 231
           FDDIIK LDQVKLRYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSG
Sbjct: 181 FDDIIKVLDQVKLRYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSG 193

BLAST of Sgr020530 vs. ExPASy Swiss-Prot
Match: Q9M2A1 (Pentatricopeptide repeat-containing protein At3g42630 OS=Arabidopsis thaliana OX=3702 GN=At3g42630 PE=2 SV=2)

HSP 1 Score: 137.1 bits (344), Expect = 2.8e-31
Identity = 67/138 (48.55%), Postives = 91/138 (65.94%), Query Frame = 0

Query: 96  ISHSRVIKKLSQRRMPVLAQEIFWELKSEGSLLNNSTFSALMECYIDGGLLHQAQAIWEE 155
           + ++ +++ LSQRR+P +A EIF + KS   L N  T  ALM C+ + G + +A+ IW+E
Sbjct: 48  VDYAPLVQTLSQRRLPDVAHEIFLQTKSVNLLPNYRTLCALMLCFAENGFVLRARTIWDE 107

Query: 156 MLNSCFVPSVHVISKLFSTYGKMGHFDDIIKALDQVKLRYLHLLPEAYSLAISCFGKHGQ 215
           ++NSCFVP V V+SKL S Y + G FD++ K    V  R+  LLP   SLAISCFGK+GQ
Sbjct: 108 IINSCFVPDVFVVSKLISAYEQFGCFDEVAKITKDVAARHSKLLPVVSSLAISCFGKNGQ 167

Query: 216 LELMENTLREMVSSGFQL 234
           LELME  + EM S G  L
Sbjct: 168 LELMEGVIEEMDSKGVLL 185

BLAST of Sgr020530 vs. ExPASy Swiss-Prot
Match: Q9FKC3 (Pentatricopeptide repeat-containing protein At5g48730, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At5g48730 PE=2 SV=2)

HSP 1 Score: 57.0 bits (136), Expect = 3.7e-07
Identity = 36/130 (27.69%), Postives = 65/130 (50.00%), Query Frame = 0

Query: 100 RVIKKLSQRRMPVLAQEIFWELKSEGSLLNNSTFSALMECYIDGGLLHQAQAIWEEMLNS 159
           ++I  L + + P  A E+F E+ +EG ++N+  ++AL+  Y   G    A  + E M +S
Sbjct: 155 KLIVMLGKCKQPEKAHELFQEMINEGCVVNHEVYTALVSAYSRSGRFDAAFTLLERMKSS 214

Query: 160 --CFVPSVHVISKLFSTYGKMGHFDDIIKALDQVKLRYLHLLPEAYSLAISCFGKHGQLE 219
             C  P VH  S L  ++ ++  FD +   L  ++ + +      Y+  I  +GK     
Sbjct: 215 HNC-QPDVHTYSILIKSFLQVFAFDKVQDLLSDMRRQGIRPNTITYNTLIDAYGKAKMFV 274

Query: 220 LMENTLREMV 228
            ME+TL +M+
Sbjct: 275 EMESTLIQML 283

BLAST of Sgr020530 vs. ExPASy Swiss-Prot
Match: Q9LW84 (Pentatricopeptide repeat-containing protein At3g16010 OS=Arabidopsis thaliana OX=3702 GN=At3g16010 PE=2 SV=1)

HSP 1 Score: 55.8 bits (133), Expect = 8.2e-07
Identity = 33/134 (24.63%), Postives = 64/134 (47.76%), Query Frame = 0

Query: 99  SRVIKKLSQRRMPVLAQEIFWELKSEGSLLNNSTFSALMECYIDGGLLHQAQAIWEEMLN 158
           S ++K L + +M   A  +F++ K       +ST+++++   +  G   +   ++ EM N
Sbjct: 166 SELVKALGRAKMVSKALSVFYQAKGRKCKPTSSTYNSVILMLMQEGQHEKVHEVYTEMCN 225

Query: 159 --SCFVPSVHVISKLFSTYGKMGHFDDIIKALDQVKLRYLHLLPEAYSLAISCFGKHGQL 218
              CF P     S L S+Y K+G  D  I+  D++K   +    + Y+  +  + K G++
Sbjct: 226 EGDCF-PDTITYSALISSYEKLGRNDSAIRLFDEMKDNCMQPTEKIYTTLLGIYFKVGKV 285

Query: 219 ELMENTLREMVSSG 231
           E   +   EM  +G
Sbjct: 286 EKALDLFEEMKRAG 298

BLAST of Sgr020530 vs. ExPASy Swiss-Prot
Match: O65567 (Pentatricopeptide repeat-containing protein At4g30825, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At4g30825 PE=2 SV=2)

HSP 1 Score: 53.5 bits (127), Expect = 4.1e-06
Identity = 31/119 (26.05%), Postives = 57/119 (47.90%), Query Frame = 0

Query: 114 AQEIFWELKSEGSLLNNSTFSALMECYIDGGLLHQAQAIWEEM-LNSCFVPSVHVISKLF 173
           A++++  LKS G +L+   FS ++  Y+  G L +A ++ E M      VP V++   + 
Sbjct: 577 AEKLYLNLKSSGVVLDRIGFSIVVRMYVKAGSLEEACSVLEIMDEQKDIVPDVYLFRDML 636

Query: 174 STYGKMGHFDDIIKALDQVKLRYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSGF 232
             Y K    D +     +++   +H   E Y+  I+C  +   L+ +  T  EM+  GF
Sbjct: 637 RIYQKCDLQDKLQHLYYRIRKSGIHWNQEMYNCVINCCARALPLDELSGTFEEMIRYGF 695

BLAST of Sgr020530 vs. ExPASy Swiss-Prot
Match: Q9SAK0 (Pentatricopeptide repeat-containing protein At1g79490, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=EMB2217 PE=2 SV=1)

HSP 1 Score: 53.1 bits (126), Expect = 5.3e-06
Identity = 35/146 (23.97%), Postives = 66/146 (45.21%), Query Frame = 0

Query: 85  QEGSFVGDDFYISHSRVIKKLSQRRMPVLAQEIFWELKSEGSLLNNSTFSALMECYIDGG 144
           Q+ S  GD  + ++++VI+ L++     +A   F + +  G  ++  T++ LM  +++ G
Sbjct: 233 QDSSSHGDLSFNAYNQVIQYLAKAEKLEVAFCCFKKAQESGCKIDTQTYNNLMMLFLNKG 292

Query: 145 LLHQAQAIWEEMLNSCFVPSVHVISKLFSTYGKMGHFDDIIKALDQVKLRYLHLLPEAYS 204
           L ++A  I+E M  +  +        +  +  K G  D   K   Q+K R L      +S
Sbjct: 293 LPYKAFEIYESMEKTDSLLDGSTYELIIPSLAKSGRLDAAFKLFQQMKERKLRPSFSVFS 352

Query: 205 LAISCFGKHGQLELMENTLREMVSSG 231
             +   GK G+L+       EM   G
Sbjct: 353 SLVDSMGKAGRLDTSMKVYMEMQGFG 378

BLAST of Sgr020530 vs. ExPASy TrEMBL
Match: A0A6J1D726 (pentatricopeptide repeat-containing protein At3g42630 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111018218 PE=4 SV=1)

HSP 1 Score: 283.1 bits (723), Expect = 1.2e-72
Identity = 152/231 (65.80%), Postives = 163/231 (70.56%), Query Frame = 0

Query: 1   MEMNSLAAMAAHPHCFPRNLKQKRLIFSHQRRSAGRAPAQGVATALGKTLASIVNLTITF 60
           M+MNSLAAM   P  FP NLKQKRLIFSH RRS GR  A+                    
Sbjct: 1   MDMNSLAAMLPLPQIFPANLKQKRLIFSHHRRSVGRTSAR-------------------- 60

Query: 61  EVGDGRQAIDEWTDVTYSIPWQHKQEGSFVGDDFYISHSRVIKKLSQRRMPVLAQEIFWE 120
                                + K EGSFV DDFY  HS++IKKLS+RR+PVLAQEIFWE
Sbjct: 61  ---------------------KDKGEGSFVDDDFYADHSQMIKKLSERRLPVLAQEIFWE 120

Query: 121 LKSEGSLLNNSTFSALMECYIDGGLLHQAQAIWEEMLNSCFVPSVHVISKLFSTYGKMGH 180
           LKSEGS L NST S LME YIDGG L QAQAIWEEMLNS FVPSV++ISKLF T+GKMG 
Sbjct: 121 LKSEGSPLKNSTLSTLMESYIDGGRLLQAQAIWEEMLNSSFVPSVNLISKLFHTFGKMGR 180

Query: 181 FDDIIKALDQVKLRYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSGF 232
           FDDIIK LDQVKLRY HLLPEA+SLAISCFGKHGQLELMENTLREMVSSGF
Sbjct: 181 FDDIIKVLDQVKLRYSHLLPEAFSLAISCFGKHGQLELMENTLREMVSSGF 190

BLAST of Sgr020530 vs. ExPASy TrEMBL
Match: A0A6J1FC40 (pentatricopeptide repeat-containing protein At3g42630 OS=Cucurbita moschata OX=3662 GN=LOC111444056 PE=4 SV=1)

HSP 1 Score: 275.0 bits (702), Expect = 3.2e-70
Identity = 150/230 (65.22%), Postives = 163/230 (70.87%), Query Frame = 0

Query: 1   MEMNSLAAMAAHPHCFPRNLKQKRLIFSHQRRSAGRAPAQGVATALGKTLASIVNLTITF 60
           ME+NSL A  A+PHCFP NLKQKR I   QRR AGRA A+ +                  
Sbjct: 1   MELNSLVAAVANPHCFPTNLKQKRPILFQQRRPAGRASARKILR---------------- 60

Query: 61  EVGDGRQAIDEWTDVTYSIPWQHKQEGSFVGDDFYISHSRVIKKLSQRRMPVLAQEIFWE 120
                                Q+K++GSF GDD  I+HS+VI+KLSQRR PVLAQEIF E
Sbjct: 61  ---------------------QNKEKGSFAGDDSSINHSQVIEKLSQRRTPVLAQEIFLE 120

Query: 121 LKSEGSLLNNSTFSALMECYIDGGLLHQAQAIWEEMLNSCFVPSVHVISKLFSTYGKMGH 180
           LKSE   LNNST S+LM  YIDGGLL QA+AIWEEMLNSCFVPSV VISKL +TYGKM  
Sbjct: 121 LKSEDFPLNNSTLSSLMVRYIDGGLLLQAEAIWEEMLNSCFVPSVLVISKLLNTYGKMRR 180

Query: 181 FDDIIKALDQVKLRYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSG 231
           FDDIIK LDQVKLRYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSG
Sbjct: 181 FDDIIKVLDQVKLRYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSG 193

BLAST of Sgr020530 vs. ExPASy TrEMBL
Match: A0A6J1HSM5 (pentatricopeptide repeat-containing protein At3g42630 OS=Cucurbita maxima OX=3661 GN=LOC111465788 PE=4 SV=1)

HSP 1 Score: 269.2 bits (687), Expect = 1.7e-68
Identity = 147/230 (63.91%), Postives = 160/230 (69.57%), Query Frame = 0

Query: 1   MEMNSLAAMAAHPHCFPRNLKQKRLIFSHQRRSAGRAPAQGVATALGKTLASIVNLTITF 60
           ME+NSL A  A+PHCFP NLKQKR      RR AGRA A+ +                  
Sbjct: 1   MELNSLVAAVANPHCFPTNLKQKRSFLFQHRRPAGRASARKILR---------------- 60

Query: 61  EVGDGRQAIDEWTDVTYSIPWQHKQEGSFVGDDFYISHSRVIKKLSQRRMPVLAQEIFWE 120
                                Q+K++GSF  DD  I+HS+VI+KLSQRR PVLAQEIF E
Sbjct: 61  ---------------------QNKEKGSFADDDSSINHSQVIEKLSQRRTPVLAQEIFLE 120

Query: 121 LKSEGSLLNNSTFSALMECYIDGGLLHQAQAIWEEMLNSCFVPSVHVISKLFSTYGKMGH 180
           LKSE   LNNST S+LM  YIDGGLL QA+AIWEEMLNSCFVPSV VISKL +TYGKM  
Sbjct: 121 LKSEDFPLNNSTLSSLMVRYIDGGLLLQAEAIWEEMLNSCFVPSVLVISKLLNTYGKMRR 180

Query: 181 FDDIIKALDQVKLRYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSG 231
           FDDIIK LDQVKLRYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSG
Sbjct: 181 FDDIIKVLDQVKLRYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSG 193

BLAST of Sgr020530 vs. ExPASy TrEMBL
Match: A0A1S3BP30 (pentatricopeptide repeat-containing protein At3g42630-like OS=Cucumis melo OX=3656 GN=LOC103492176 PE=4 SV=1)

HSP 1 Score: 239.6 bits (610), Expect = 1.5e-59
Identity = 133/231 (57.58%), Postives = 152/231 (65.80%), Query Frame = 0

Query: 1   MEMNSLAAMAAHPHCFPRNLKQKRLIFSHQRRSAGRAPAQGVATALGKTLASIVNLTITF 60
           M+M+ L    A+P+CFPRN KQ+R + SHQR +A RA +  +                  
Sbjct: 1   MKMDFLVTTVANPYCFPRNFKQERPMLSHQRHTACRASSCKI------------------ 60

Query: 61  EVGDGRQAIDEWTDVTYSIPWQHKQEGSFVGDDFYISHSRVIKKLSQRRMPVLAQEIFWE 120
                               +Q   EGS V   F I++S+VIKKLS+RRMP LA+EIF E
Sbjct: 61  --------------------FQQHNEGSSVDYGFNINNSQVIKKLSRRRMPTLAKEIFLE 120

Query: 121 LKSEGSLLNNSTFSALMECYIDGGLLHQAQAIWEEMLNSCFVPSVHVISKLFSTYGKMGH 180
           LKSEG  LNNST S +M  YID G   QAQA+WEEMLNSCF PSV VISKLF+ YGKMGH
Sbjct: 121 LKSEGFPLNNSTLSTIMVHYIDDGSPLQAQAMWEEMLNSCFEPSVQVISKLFNAYGKMGH 180

Query: 181 FDDIIKALDQVKLRYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSGF 232
           FD I K LDQVKLRY HLLPEAYSLAISCFGKH QLELME+TLREMVSSGF
Sbjct: 181 FDYITKVLDQVKLRYSHLLPEAYSLAISCFGKHKQLELMESTLREMVSSGF 193

BLAST of Sgr020530 vs. ExPASy TrEMBL
Match: A0A6J1D9V6 (pentatricopeptide repeat-containing protein At3g42630 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111018218 PE=4 SV=1)

HSP 1 Score: 226.1 bits (575), Expect = 1.7e-55
Identity = 112/131 (85.50%), Postives = 119/131 (90.84%), Query Frame = 0

Query: 101 VIKKLSQRRMPVLAQEIFWELKSEGSLLNNSTFSALMECYIDGGLLHQAQAIWEEMLNSC 160
           +IKKLS+RR+PVLAQEIFWELKSEGS L NST S LME YIDGG L QAQAIWEEMLNS 
Sbjct: 1   MIKKLSERRLPVLAQEIFWELKSEGSPLKNSTLSTLMESYIDGGRLLQAQAIWEEMLNSS 60

Query: 161 FVPSVHVISKLFSTYGKMGHFDDIIKALDQVKLRYLHLLPEAYSLAISCFGKHGQLELME 220
           FVPSV++ISKLF T+GKMG FDDIIK LDQVKLRY HLLPEA+SLAISCFGKHGQLELME
Sbjct: 61  FVPSVNLISKLFHTFGKMGRFDDIIKVLDQVKLRYSHLLPEAFSLAISCFGKHGQLELME 120

Query: 221 NTLREMVSSGF 232
           NTLREMVSSGF
Sbjct: 121 NTLREMVSSGF 131

BLAST of Sgr020530 vs. TAIR 10
Match: AT3G42630.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 137.1 bits (344), Expect = 2.0e-32
Identity = 67/138 (48.55%), Postives = 91/138 (65.94%), Query Frame = 0

Query: 96  ISHSRVIKKLSQRRMPVLAQEIFWELKSEGSLLNNSTFSALMECYIDGGLLHQAQAIWEE 155
           + ++ +++ LSQRR+P +A EIF + KS   L N  T  ALM C+ + G + +A+ IW+E
Sbjct: 48  VDYAPLVQTLSQRRLPDVAHEIFLQTKSVNLLPNYRTLCALMLCFAENGFVLRARTIWDE 107

Query: 156 MLNSCFVPSVHVISKLFSTYGKMGHFDDIIKALDQVKLRYLHLLPEAYSLAISCFGKHGQ 215
           ++NSCFVP V V+SKL S Y + G FD++ K    V  R+  LLP   SLAISCFGK+GQ
Sbjct: 108 IINSCFVPDVFVVSKLISAYEQFGCFDEVAKITKDVAARHSKLLPVVSSLAISCFGKNGQ 167

Query: 216 LELMENTLREMVSSGFQL 234
           LELME  + EM S G  L
Sbjct: 168 LELMEGVIEEMDSKGVLL 185

BLAST of Sgr020530 vs. TAIR 10
Match: AT5G48730.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 57.0 bits (136), Expect = 2.6e-08
Identity = 36/130 (27.69%), Postives = 65/130 (50.00%), Query Frame = 0

Query: 100 RVIKKLSQRRMPVLAQEIFWELKSEGSLLNNSTFSALMECYIDGGLLHQAQAIWEEMLNS 159
           ++I  L + + P  A E+F E+ +EG ++N+  ++AL+  Y   G    A  + E M +S
Sbjct: 155 KLIVMLGKCKQPEKAHELFQEMINEGCVVNHEVYTALVSAYSRSGRFDAAFTLLERMKSS 214

Query: 160 --CFVPSVHVISKLFSTYGKMGHFDDIIKALDQVKLRYLHLLPEAYSLAISCFGKHGQLE 219
             C  P VH  S L  ++ ++  FD +   L  ++ + +      Y+  I  +GK     
Sbjct: 215 HNC-QPDVHTYSILIKSFLQVFAFDKVQDLLSDMRRQGIRPNTITYNTLIDAYGKAKMFV 274

Query: 220 LMENTLREMV 228
            ME+TL +M+
Sbjct: 275 EMESTLIQML 283

BLAST of Sgr020530 vs. TAIR 10
Match: AT3G16010.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 55.8 bits (133), Expect = 5.8e-08
Identity = 33/134 (24.63%), Postives = 64/134 (47.76%), Query Frame = 0

Query: 99  SRVIKKLSQRRMPVLAQEIFWELKSEGSLLNNSTFSALMECYIDGGLLHQAQAIWEEMLN 158
           S ++K L + +M   A  +F++ K       +ST+++++   +  G   +   ++ EM N
Sbjct: 166 SELVKALGRAKMVSKALSVFYQAKGRKCKPTSSTYNSVILMLMQEGQHEKVHEVYTEMCN 225

Query: 159 --SCFVPSVHVISKLFSTYGKMGHFDDIIKALDQVKLRYLHLLPEAYSLAISCFGKHGQL 218
              CF P     S L S+Y K+G  D  I+  D++K   +    + Y+  +  + K G++
Sbjct: 226 EGDCF-PDTITYSALISSYEKLGRNDSAIRLFDEMKDNCMQPTEKIYTTLLGIYFKVGKV 285

Query: 219 ELMENTLREMVSSG 231
           E   +   EM  +G
Sbjct: 286 EKALDLFEEMKRAG 298

BLAST of Sgr020530 vs. TAIR 10
Match: AT4G30825.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 53.5 bits (127), Expect = 2.9e-07
Identity = 31/119 (26.05%), Postives = 57/119 (47.90%), Query Frame = 0

Query: 114 AQEIFWELKSEGSLLNNSTFSALMECYIDGGLLHQAQAIWEEM-LNSCFVPSVHVISKLF 173
           A++++  LKS G +L+   FS ++  Y+  G L +A ++ E M      VP V++   + 
Sbjct: 577 AEKLYLNLKSSGVVLDRIGFSIVVRMYVKAGSLEEACSVLEIMDEQKDIVPDVYLFRDML 636

Query: 174 STYGKMGHFDDIIKALDQVKLRYLHLLPEAYSLAISCFGKHGQLELMENTLREMVSSGF 232
             Y K    D +     +++   +H   E Y+  I+C  +   L+ +  T  EM+  GF
Sbjct: 637 RIYQKCDLQDKLQHLYYRIRKSGIHWNQEMYNCVINCCARALPLDELSGTFEEMIRYGF 695

BLAST of Sgr020530 vs. TAIR 10
Match: AT1G79490.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 53.1 bits (126), Expect = 3.8e-07
Identity = 35/146 (23.97%), Postives = 66/146 (45.21%), Query Frame = 0

Query: 85  QEGSFVGDDFYISHSRVIKKLSQRRMPVLAQEIFWELKSEGSLLNNSTFSALMECYIDGG 144
           Q+ S  GD  + ++++VI+ L++     +A   F + +  G  ++  T++ LM  +++ G
Sbjct: 233 QDSSSHGDLSFNAYNQVIQYLAKAEKLEVAFCCFKKAQESGCKIDTQTYNNLMMLFLNKG 292

Query: 145 LLHQAQAIWEEMLNSCFVPSVHVISKLFSTYGKMGHFDDIIKALDQVKLRYLHLLPEAYS 204
           L ++A  I+E M  +  +        +  +  K G  D   K   Q+K R L      +S
Sbjct: 293 LPYKAFEIYESMEKTDSLLDGSTYELIIPSLAKSGRLDAAFKLFQQMKERKLRPSFSVFS 352

Query: 205 LAISCFGKHGQLELMENTLREMVSSG 231
             +   GK G+L+       EM   G
Sbjct: 353 SLVDSMGKAGRLDTSMKVYMEMQGFG 378

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022149915.12.4e-7265.80pentatricopeptide repeat-containing protein At3g42630 isoform X1 [Momordica char... [more]
KAG7020945.17.8e-7165.65Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022937749.16.6e-7065.22pentatricopeptide repeat-containing protein At3g42630 [Cucurbita moschata][more]
XP_023536679.16.6e-7064.78pentatricopeptide repeat-containing protein At3g42630 [Cucurbita pepo subsp. pep... [more]
KAG6586124.18.6e-7065.22Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
Match NameE-valueIdentityDescription
Q9M2A12.8e-3148.55Pentatricopeptide repeat-containing protein At3g42630 OS=Arabidopsis thaliana OX... [more]
Q9FKC33.7e-0727.69Pentatricopeptide repeat-containing protein At5g48730, chloroplastic OS=Arabidop... [more]
Q9LW848.2e-0724.63Pentatricopeptide repeat-containing protein At3g16010 OS=Arabidopsis thaliana OX... [more]
O655674.1e-0626.05Pentatricopeptide repeat-containing protein At4g30825, chloroplastic OS=Arabidop... [more]
Q9SAK05.3e-0623.97Pentatricopeptide repeat-containing protein At1g79490, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1D7261.2e-7265.80pentatricopeptide repeat-containing protein At3g42630 isoform X1 OS=Momordica ch... [more]
A0A6J1FC403.2e-7065.22pentatricopeptide repeat-containing protein At3g42630 OS=Cucurbita moschata OX=3... [more]
A0A6J1HSM51.7e-6863.91pentatricopeptide repeat-containing protein At3g42630 OS=Cucurbita maxima OX=366... [more]
A0A1S3BP301.5e-5957.58pentatricopeptide repeat-containing protein At3g42630-like OS=Cucumis melo OX=36... [more]
A0A6J1D9V61.7e-5585.50pentatricopeptide repeat-containing protein At3g42630 isoform X2 OS=Momordica ch... [more]
Match NameE-valueIdentityDescription
AT3G42630.12.0e-3248.55Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G48730.12.6e-0827.69Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G16010.15.8e-0824.63Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT4G30825.12.9e-0726.05Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G79490.13.8e-0723.97Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 97..235
e-value: 8.4E-20
score: 73.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 202..231
e-value: 7.7E-4
score: 19.5
coord: 131..159
e-value: 0.031
score: 14.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 202..232
e-value: 9.0E-5
score: 20.4
coord: 132..165
e-value: 0.0017
score: 16.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 129..163
score: 9.96388
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 199..233
score: 8.714292
NoneNo IPR availablePANTHERPTHR47493OS08G0520200 PROTEINcoord: 48..233

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr020530.1Sgr020530.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding