Sgr016806 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr016806
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationtig00153010: 1052877 .. 1054995 (+)
RNA-Seq ExpressionSgr016806
SyntenySgr016806
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTAATCATGCAAGGATTTTGTTTGACTATTCTGATAAGTTAGATGGTGTTTCTTGGAATTCGTTGATTGCTGGTTATGCACAAAATGGAAAATATGAGGAGCTGTTGATAGTTCTGGAGAAAATGCATCAATCTGGATTGGCTTTGAACACTTATACTCTAGGAAGTGCCCTGAAGGCTTGTAGCTCAAACTTCAATGGTTCAAAAAAGTTTGGGACAATGCTACATAGCCTCGCGTTCAAACTTGGTCTACATCTCGATGTCGTTGTTGGGACTGCATTGCTTGATATGTATGCAAAAACTGGAAGTATGGATGATGCTATTCAAATTTTTGACCAAGTGTTGGACAAAAATGTTGTGATGTATAATGCAATGATGGCTGGTTTGCTCCAACAAGAGACAATTGACGATAAATGTGCATACAAAGCCCTAAATCTTTTCTTTGAGATGAAAGGCTGTGGAATAAAACCTTCCATGTTTACATATTCAAGCTTACTTAAAGCTTGTATTAGTATTGAGGTTTTTGAATTTGCAAAGCAAATTCATGCTTTAATATGCAAGAATGGCCTTCAGTCGGACGAGTACATTGGAAGTGTCCTTATTGATTTGTACTCTTTGTTAGGTTCAATGAAGGATGCTTTATCATGTTTTAACTCAATTCATAATTTGACCATAGTTCCAATGACAGCCATGATTGTTGTTTATCTTCAAAATGGGGAATTTGAAAGTGCATTGGCTTTGTTCTATGAACTTTTGTCATCCGAAGAGAAACTTGATGAGTTCATTTTGTCCACAATCTTGAGTGCTTGTGGAAATATGGGCATGTTAAGATCTGGAGAACAAATCCAGGGGTATGCAACAAAAATAGGCATCTCAAGATTCACCATCTTTCAGAATTCACAGATTTTGATGTATGCTAAGTCTGGAGATCTATACTCGACCAATCTAACCTTTCAACAGATGGAAAATCCTGATGTTGTGTCCTGGTCAACAATGATCCACAGCAATGCGCAGCATGGGCATGCAATGGAGGCTTTGAGATTCTTTGAGCTGATGAAGAGTTGTGGAATTGAGCCTAACCACTTTGGCTTCCTTGGAGTTCTAATTGCATGTAGTCACAGAGGGCTTGTTGAAGAAGGACTAAGGTAGGTCATGAAGCTCCTCTTTCTTGTTATTCAATACCAAATGTTTAGATGTTGAAGGCTCGGTGATAAGAGGATTATAACAGGTGTTGTGGAGGCATAAGCCTAATTTTAAATTTTCACTTAGTCCTATAATTTTATGAAAAAGATTTGCTATTTGGTTTGATTTTTATTTATTTATTTATCTCTCTCTTCTGCTCCCTCAAGGCTTAAACAGGGTTTGGAGGTGTAGACTTCTGTCTCATAGTTTTCACATGCTGGTGTCTTCCTTTAGCTTTGATTCTTTCCCTGAATTCAATCCAAGAAGCTCTAATATATGGTTAAATTTTCAATCCAAAAAAATGTGAAGGGCACTTTAGCTCTCTGCTGTTTTGGGAGGGTCTCTCTTTTGCATGAATATATTTCTGTTTCTTAATCCATCCATCTGGTTTGCTGGTGACAGGTACTTTGATACCATGAAGAAAGATTACAATATGACAAGTCATGTCAAGTACTGTACCACTGCCTGTGTTGTTGATCTTCTTGGTCGAGCTGGAAGGTTGGTTGAGGCAGAAAGTTTAATTTTGTGTTCGGGTTTTGAGCACGAACCAGTGATGTGGCGAGCTTTACTAAGTGCATGCAATGTTCATAAGGACACATTCACTGCACAACGTGTTGCAGAGAAAGTAATAGAGCTTGAACCTCTGGCATCTGCATCTTATGTGCTTCTTTATAATATTTATATGGATGCTGGAAACAAGTCAGATGCCTTGAAAGTTAGGAAATTAATGGAAGCTCGGAGAATTAAGAAGGAACCTGGTCTAAGCTGGATAGAAGTAGGAGATAAAGTCTACTCATTTGTTTCTGGTGATCGATCTCACAAAAATAGTGAATTGATTTATGCGCAGTTGCAGGAAATGTTGGCAAAGACAAAGAGTACAGGGTTGGTGAAGGACATATTTGATTACAAAATTGAGCATGAATACATGGCACTATAA

mRNA sequence

GTTAATCATGCAAGGATTTTGTTTGACTATTCTGATAAGTTAGATGGTGTTTCTTGGAATTCGTTGATTGCTGGTTATGCACAAAATGGAAAATATGAGGAGCTGTTGATAGTTCTGGAGAAAATGCATCAATCTGGATTGGCTTTGAACACTTATACTCTAGGAAGTGCCCTGAAGGCTTGTAGCTCAAACTTCAATGGTTCAAAAAAGTTTGGGACAATGCTACATAGCCTCGCGTTCAAACTTGGTCTACATCTCGATGTCGTTGTTGGGACTGCATTGCTTGATATGTATGCAAAAACTGGAAGTATGGATGATGCTATTCAAATTTTTGACCAAGTGTTGGACAAAAATGTTGTGATGTATAATGCAATGATGGCTGGTTTGCTCCAACAAGAGACAATTGACGATAAATGTGCATACAAAGCCCTAAATCTTTTCTTTGAGATGAAAGGCTGTGGAATAAAACCTTCCATGTTTACATATTCAAGCTTACTTAAAGCTTGTATTAGTATTGAGGTTTTTGAATTTGCAAAGCAAATTCATGCTTTAATATGCAAGAATGGCCTTCAGTCGGACGAGTACATTGGAAGTGTCCTTATTGATTTGTACTCTTTGTTAGGTTCAATGAAGGATGCTTTATCATGTTTTAACTCAATTCATAATTTGACCATAGTTCCAATGACAGCCATGATTGTTGTTTATCTTCAAAATGGGGAATTTGAAAGTGCATTGGCTTTGTTCTATGAACTTTTGTCATCCGAAGAGAAACTTGATGAGTTCATTTTGTCCACAATCTTGAGTGCTTGTGGAAATATGGGCATGTTAAGATCTGGAGAACAAATCCAGGGGTATGCAACAAAAATAGGCATCTCAAGATTCACCATCTTTCAGAATTCACAGATTTTGATGTATGCTAAGTCTGGAGATCTATACTCGACCAATCTAACCTTTCAACAGATGGAAAATCCTGATGTTGTGTCCTGGTCAACAATGATCCACAGCAATGCGCAGCATGGGCATGCAATGGAGGCTTTGAGATTCTTTGAGCTGATGAAGAGTTGTGGAATTGAGCCTAACCACTTTGGCTTCCTTGGAGTTCTAATTGCATGTAGTCACAGAGGGCTTGTTGAAGAAGGACTAAGGTACTTTGATACCATGAAGAAAGATTACAATATGACAAGTCATGTCAAGTACTGTACCACTGCCTGTGTTGTTGATCTTCTTGGTCGAGCTGGAAGGTTGGTTGAGGCAGAAAGTTTAATTTTGTGTTCGGGTTTTGAGCACGAACCAGTGATGTGGCGAGCTTTACTAAGTGCATGCAATGTTCATAAGGACACATTCACTGCACAACGTGTTGCAGAGAAAGTAATAGAGCTTGAACCTCTGGCATCTGCATCTTATGTGCTTCTTTATAATATTTATATGGATGCTGGAAACAAGTCAGATGCCTTGAAAGTTAGGAAATTAATGGAAGCTCGGAGAATTAAGAAGGAACCTGGTCTAAGCTGGATAGAAGTAGGAGATAAAGTCTACTCATTTGTTTCTGGTGATCGATCTCACAAAAATAGTGAATTGATTTATGCGCAGTTGCAGGAAATGTTGGCAAAGACAAAGAGTACAGGGTTGGTGAAGGACATATTTGATTACAAAATTGAGCATGAATACATGGCACTATAA

Coding sequence (CDS)

GTTAATCATGCAAGGATTTTGTTTGACTATTCTGATAAGTTAGATGGTGTTTCTTGGAATTCGTTGATTGCTGGTTATGCACAAAATGGAAAATATGAGGAGCTGTTGATAGTTCTGGAGAAAATGCATCAATCTGGATTGGCTTTGAACACTTATACTCTAGGAAGTGCCCTGAAGGCTTGTAGCTCAAACTTCAATGGTTCAAAAAAGTTTGGGACAATGCTACATAGCCTCGCGTTCAAACTTGGTCTACATCTCGATGTCGTTGTTGGGACTGCATTGCTTGATATGTATGCAAAAACTGGAAGTATGGATGATGCTATTCAAATTTTTGACCAAGTGTTGGACAAAAATGTTGTGATGTATAATGCAATGATGGCTGGTTTGCTCCAACAAGAGACAATTGACGATAAATGTGCATACAAAGCCCTAAATCTTTTCTTTGAGATGAAAGGCTGTGGAATAAAACCTTCCATGTTTACATATTCAAGCTTACTTAAAGCTTGTATTAGTATTGAGGTTTTTGAATTTGCAAAGCAAATTCATGCTTTAATATGCAAGAATGGCCTTCAGTCGGACGAGTACATTGGAAGTGTCCTTATTGATTTGTACTCTTTGTTAGGTTCAATGAAGGATGCTTTATCATGTTTTAACTCAATTCATAATTTGACCATAGTTCCAATGACAGCCATGATTGTTGTTTATCTTCAAAATGGGGAATTTGAAAGTGCATTGGCTTTGTTCTATGAACTTTTGTCATCCGAAGAGAAACTTGATGAGTTCATTTTGTCCACAATCTTGAGTGCTTGTGGAAATATGGGCATGTTAAGATCTGGAGAACAAATCCAGGGGTATGCAACAAAAATAGGCATCTCAAGATTCACCATCTTTCAGAATTCACAGATTTTGATGTATGCTAAGTCTGGAGATCTATACTCGACCAATCTAACCTTTCAACAGATGGAAAATCCTGATGTTGTGTCCTGGTCAACAATGATCCACAGCAATGCGCAGCATGGGCATGCAATGGAGGCTTTGAGATTCTTTGAGCTGATGAAGAGTTGTGGAATTGAGCCTAACCACTTTGGCTTCCTTGGAGTTCTAATTGCATGTAGTCACAGAGGGCTTGTTGAAGAAGGACTAAGGTACTTTGATACCATGAAGAAAGATTACAATATGACAAGTCATGTCAAGTACTGTACCACTGCCTGTGTTGTTGATCTTCTTGGTCGAGCTGGAAGGTTGGTTGAGGCAGAAAGTTTAATTTTGTGTTCGGGTTTTGAGCACGAACCAGTGATGTGGCGAGCTTTACTAAGTGCATGCAATGTTCATAAGGACACATTCACTGCACAACGTGTTGCAGAGAAAGTAATAGAGCTTGAACCTCTGGCATCTGCATCTTATGTGCTTCTTTATAATATTTATATGGATGCTGGAAACAAGTCAGATGCCTTGAAAGTTAGGAAATTAATGGAAGCTCGGAGAATTAAGAAGGAACCTGGTCTAAGCTGGATAGAAGTAGGAGATAAAGTCTACTCATTTGTTTCTGGTGATCGATCTCACAAAAATAGTGAATTGATTTATGCGCAGTTGCAGGAAATGTTGGCAAAGACAAAGAGTACAGGGTTGGTGAAGGACATATTTGATTACAAAATTGAGCATGAATACATGGCACTATAA

Protein sequence

VNHARILFDYSDKLDGVSWNSLIAGYAQNGKYEELLIVLEKMHQSGLALNTYTLGSALKACSSNFNGSKKFGTMLHSLAFKLGLHLDVVVGTALLDMYAKTGSMDDAIQIFDQVLDKNVVMYNAMMAGLLQQETIDDKCAYKALNLFFEMKGCGIKPSMFTYSSLLKACISIEVFEFAKQIHALICKNGLQSDEYIGSVLIDLYSLLGSMKDALSCFNSIHNLTIVPMTAMIVVYLQNGEFESALALFYELLSSEEKLDEFILSTILSACGNMGMLRSGEQIQGYATKIGISRFTIFQNSQILMYAKSGDLYSTNLTFQQMENPDVVSWSTMIHSNAQHGHAMEALRFFELMKSCGIEPNHFGFLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKYCTTACVVDLLGRAGRLVEAESLILCSGFEHEPVMWRALLSACNVHKDTFTAQRVAEKVIELEPLASASYVLLYNIYMDAGNKSDALKVRKLMEARRIKKEPGLSWIEVGDKVYSFVSGDRSHKNSELIYAQLQEMLAKTKSTGLVKDIFDYKIEHEYMAL
Homology
BLAST of Sgr016806 vs. NCBI nr
Match: XP_022141889.1 (pentatricopeptide repeat-containing protein At3g13880 isoform X1 [Momordica charantia] >XP_022141890.1 pentatricopeptide repeat-containing protein At3g13880 isoform X1 [Momordica charantia] >XP_022141891.1 pentatricopeptide repeat-containing protein At3g13880 isoform X1 [Momordica charantia])

HSP 1 Score: 1013.1 bits (2618), Expect = 9.5e-292
Identity = 510/559 (91.23%), Postives = 529/559 (94.63%), Query Frame = 0

Query: 1   VNHARILFDYSDKLDGVSWNSLIAGYAQNGKYEELLIVLEKMHQSGLALNTYTLGSALKA 60
           V+HARILFDYSD LDGVSWNSLIAGYAQNGKYEELLI+LEKMHQSGLALNTYTLGSALKA
Sbjct: 211 VDHARILFDYSDNLDGVSWNSLIAGYAQNGKYEELLIILEKMHQSGLALNTYTLGSALKA 270

Query: 61  CSSNFNGSKKFGTMLHSLAFKLGLHLDVVVGTALLDMYAKTGSMDDAIQIFDQVLDKNVV 120
           CSSNFNGSK+FGTMLHSLA KLGLHLDVVVGTALLDMYAKTGS+DDAIQIFDQVLDKNVV
Sbjct: 271 CSSNFNGSKQFGTMLHSLAIKLGLHLDVVVGTALLDMYAKTGSLDDAIQIFDQVLDKNVV 330

Query: 121 MYNAMMAGLLQQETIDDKCAYKALNLFFEMKGCGIKPSMFTYSSLLKACISIEVFEFAKQ 180
           MYNAMMAGLLQQETI+DKCAYKAL LFFEMK CG+KPSMFTYSSLLKACI++EVFEFAKQ
Sbjct: 331 MYNAMMAGLLQQETIEDKCAYKALGLFFEMKSCGVKPSMFTYSSLLKACITVEVFEFAKQ 390

Query: 181 IHALICKNGLQSDEYIGSVLIDLYSLLGSMKDALSCFNSIHNLTIVPMTAMIVVYLQNGE 240
           IHALI KNGLQSDEYIGSVLID YSLLGSMKDALSCFNSIHNLTIVPMTAMIV YLQNGE
Sbjct: 391 IHALIFKNGLQSDEYIGSVLIDFYSLLGSMKDALSCFNSIHNLTIVPMTAMIVGYLQNGE 450

Query: 241 FESALALFYELLSSEEKLDEFILSTILSACGNMGMLRSGEQIQGYATKIGISRFTIFQNS 300
           FE ALALFYELL+SEEK DEFILSTIL AC NMGMLRSGEQIQGYATK GI +F IFQNS
Sbjct: 451 FEIALALFYELLASEEKPDEFILSTILGACANMGMLRSGEQIQGYATKTGILKFKIFQNS 510

Query: 301 QILMYAKSGDLYSTNLTFQQMENPDVVSWSTMIHSNAQHGHAMEALRFFELMKSCGIEPN 360
           QI MYAKSGDLYS NLTFQQMENPDVVSWSTMI S AQHGHAM+A RFFELMKS GIEPN
Sbjct: 511 QIFMYAKSGDLYSANLTFQQMENPDVVSWSTMICSAAQHGHAMKAFRFFELMKSSGIEPN 570

Query: 361 HFGFLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKYCTTACVVDLLGRAGRLVEAES 420
            F FLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVK+C  ACVVDLLGRAGRLV+AES
Sbjct: 571 DFAFLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKHC--ACVVDLLGRAGRLVDAES 630

Query: 421 LILCSGFEHEPVMWRALLSACNVHKDTFTAQRVAEKVIELEPLASASYVLLYNIYMDAGN 480
           +ILCSGFEHEPVMWRALLSAC +HKDTFTAQRVAEKVIELEPL SASYVLL+NIYMDAGN
Sbjct: 631 IILCSGFEHEPVMWRALLSACLIHKDTFTAQRVAEKVIELEPLTSASYVLLHNIYMDAGN 690

Query: 481 KSDALKVRKLMEARRIKKEPGLSWIEVGDKVYSFVSGDRSHKNSELIYAQLQEMLAKTKS 540
           KS+ALKVRKLMEARRIKKEPGLSWIEVGD VYSFVSGDRSHKNSELIYAQL +MLAKTKS
Sbjct: 691 KSEALKVRKLMEARRIKKEPGLSWIEVGDNVYSFVSGDRSHKNSELIYAQLDDMLAKTKS 750

Query: 541 TGLVKDIFDYKIEHEYMAL 560
            GLV DIFDYK+EHEYMAL
Sbjct: 751 LGLVNDIFDYKMEHEYMAL 767

BLAST of Sgr016806 vs. NCBI nr
Match: XP_022141892.1 (pentatricopeptide repeat-containing protein At3g13880 isoform X2 [Momordica charantia] >XP_022141893.1 pentatricopeptide repeat-containing protein At3g13880 isoform X2 [Momordica charantia])

HSP 1 Score: 1013.1 bits (2618), Expect = 9.5e-292
Identity = 510/559 (91.23%), Postives = 529/559 (94.63%), Query Frame = 0

Query: 1   VNHARILFDYSDKLDGVSWNSLIAGYAQNGKYEELLIVLEKMHQSGLALNTYTLGSALKA 60
           V+HARILFDYSD LDGVSWNSLIAGYAQNGKYEELLI+LEKMHQSGLALNTYTLGSALKA
Sbjct: 74  VDHARILFDYSDNLDGVSWNSLIAGYAQNGKYEELLIILEKMHQSGLALNTYTLGSALKA 133

Query: 61  CSSNFNGSKKFGTMLHSLAFKLGLHLDVVVGTALLDMYAKTGSMDDAIQIFDQVLDKNVV 120
           CSSNFNGSK+FGTMLHSLA KLGLHLDVVVGTALLDMYAKTGS+DDAIQIFDQVLDKNVV
Sbjct: 134 CSSNFNGSKQFGTMLHSLAIKLGLHLDVVVGTALLDMYAKTGSLDDAIQIFDQVLDKNVV 193

Query: 121 MYNAMMAGLLQQETIDDKCAYKALNLFFEMKGCGIKPSMFTYSSLLKACISIEVFEFAKQ 180
           MYNAMMAGLLQQETI+DKCAYKAL LFFEMK CG+KPSMFTYSSLLKACI++EVFEFAKQ
Sbjct: 194 MYNAMMAGLLQQETIEDKCAYKALGLFFEMKSCGVKPSMFTYSSLLKACITVEVFEFAKQ 253

Query: 181 IHALICKNGLQSDEYIGSVLIDLYSLLGSMKDALSCFNSIHNLTIVPMTAMIVVYLQNGE 240
           IHALI KNGLQSDEYIGSVLID YSLLGSMKDALSCFNSIHNLTIVPMTAMIV YLQNGE
Sbjct: 254 IHALIFKNGLQSDEYIGSVLIDFYSLLGSMKDALSCFNSIHNLTIVPMTAMIVGYLQNGE 313

Query: 241 FESALALFYELLSSEEKLDEFILSTILSACGNMGMLRSGEQIQGYATKIGISRFTIFQNS 300
           FE ALALFYELL+SEEK DEFILSTIL AC NMGMLRSGEQIQGYATK GI +F IFQNS
Sbjct: 314 FEIALALFYELLASEEKPDEFILSTILGACANMGMLRSGEQIQGYATKTGILKFKIFQNS 373

Query: 301 QILMYAKSGDLYSTNLTFQQMENPDVVSWSTMIHSNAQHGHAMEALRFFELMKSCGIEPN 360
           QI MYAKSGDLYS NLTFQQMENPDVVSWSTMI S AQHGHAM+A RFFELMKS GIEPN
Sbjct: 374 QIFMYAKSGDLYSANLTFQQMENPDVVSWSTMICSAAQHGHAMKAFRFFELMKSSGIEPN 433

Query: 361 HFGFLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKYCTTACVVDLLGRAGRLVEAES 420
            F FLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVK+C  ACVVDLLGRAGRLV+AES
Sbjct: 434 DFAFLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKHC--ACVVDLLGRAGRLVDAES 493

Query: 421 LILCSGFEHEPVMWRALLSACNVHKDTFTAQRVAEKVIELEPLASASYVLLYNIYMDAGN 480
           +ILCSGFEHEPVMWRALLSAC +HKDTFTAQRVAEKVIELEPL SASYVLL+NIYMDAGN
Sbjct: 494 IILCSGFEHEPVMWRALLSACLIHKDTFTAQRVAEKVIELEPLTSASYVLLHNIYMDAGN 553

Query: 481 KSDALKVRKLMEARRIKKEPGLSWIEVGDKVYSFVSGDRSHKNSELIYAQLQEMLAKTKS 540
           KS+ALKVRKLMEARRIKKEPGLSWIEVGD VYSFVSGDRSHKNSELIYAQL +MLAKTKS
Sbjct: 554 KSEALKVRKLMEARRIKKEPGLSWIEVGDNVYSFVSGDRSHKNSELIYAQLDDMLAKTKS 613

Query: 541 TGLVKDIFDYKIEHEYMAL 560
            GLV DIFDYK+EHEYMAL
Sbjct: 614 LGLVNDIFDYKMEHEYMAL 630

BLAST of Sgr016806 vs. NCBI nr
Match: XP_023523805.1 (pentatricopeptide repeat-containing protein At3g13880 isoform X2 [Cucurbita pepo subsp. pepo] >XP_023523806.1 pentatricopeptide repeat-containing protein At3g13880 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 975.7 bits (2521), Expect = 1.7e-280
Identity = 490/557 (87.97%), Postives = 522/557 (93.72%), Query Frame = 0

Query: 1   VNHARILFDYSDKLDGVSWNSLIAGYAQNGKYEELLIVLEKMHQSGLALNTYTLGSALKA 60
           V+HAR+LF++++ LDGVSWNSLIAGYAQNGKYEELL +L KMHQSGL L+TYTLGSALKA
Sbjct: 67  VDHARMLFNHANNLDGVSWNSLIAGYAQNGKYEELLTILMKMHQSGLTLSTYTLGSALKA 126

Query: 61  CSSNFNGSKKFGTMLHSLAFKLGLHLDVVVGTALLDMYAKTGSMDDAIQIFDQVLDKNVV 120
           CSSNFNGSK FGTMLH L  KLGLHLDVVVGTALLDMYAKTGS+DDAIQIFDQ++DKNVV
Sbjct: 127 CSSNFNGSKIFGTMLHGLTIKLGLHLDVVVGTALLDMYAKTGSLDDAIQIFDQMMDKNVV 186

Query: 121 MYNAMMAGLLQQETIDDKCAYKALNLFFEMKGCGIKPSMFTYSSLLKACISIEVFEFAKQ 180
           MYNAMMAGLLQQE I+DKCAYKALNLFFEMK CGIKPSMFTYSSLLKACI++E FEFAKQ
Sbjct: 187 MYNAMMAGLLQQEKIEDKCAYKALNLFFEMKSCGIKPSMFTYSSLLKACIAVEDFEFAKQ 246

Query: 181 IHALICKNGLQSDEYIGSVLIDLYSLLGSMKDALSCFNSIHNLTIVPMTAMIVVYLQNGE 240
           IHALICKNGLQSDEYIGSVLIDLY LLGS+KDA SCFNSIHNLTIVP+TAMIV YLQ GE
Sbjct: 247 IHALICKNGLQSDEYIGSVLIDLYFLLGSIKDAFSCFNSIHNLTIVPITAMIVGYLQKGE 306

Query: 241 FESALALFYELLSSEEKLDEFILSTILSACGNMGMLRSGEQIQGYATKIGISRFTIFQNS 300
           FE ALALFYELL+S+EK DEFILSTILSAC NMGMLRSGEQIQGYA+KIGISR+TIFQNS
Sbjct: 307 FERALALFYELLASKEKPDEFILSTILSACANMGMLRSGEQIQGYASKIGISRYTIFQNS 366

Query: 301 QILMYAKSGDLYSTNLTFQQMENPDVVSWSTMIHSNAQHGHAMEALRFFELMKSCGIEPN 360
           QI MYAKSGDLYS NLTFQQMENPDVVSWST+I SNAQHGHA+EALRFF+LMKSCGIEPN
Sbjct: 367 QIWMYAKSGDLYSANLTFQQMENPDVVSWSTIICSNAQHGHAIEALRFFDLMKSCGIEPN 426

Query: 361 HFGFLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKYCTTACVVDLLGRAGRLVEAES 420
           HF FLGVLIACSHRGLVEEGLRYFDTMKKD+ MTSHVK+C  ACVVDLLGRAGRLV+AES
Sbjct: 427 HFAFLGVLIACSHRGLVEEGLRYFDTMKKDHIMTSHVKHC--ACVVDLLGRAGRLVDAES 486

Query: 421 LILCSGFEHEPVMWRALLSACNVHKDTFTAQRVAEKVIELEPLASASYVLLYNIYMDAGN 480
           LIL  GFEHEPVMWRALLSAC +HKDTFTA+RVAEKVIELEPLASASYVLLYNIYMDAGN
Sbjct: 487 LILDLGFEHEPVMWRALLSACRIHKDTFTAKRVAEKVIELEPLASASYVLLYNIYMDAGN 546

Query: 481 KSDALKVRKLMEARRIKKEPGLSWIEVGDKVYSFVSGDRSHKNSELIYAQLQEMLAKTKS 540
           K DALKVRKLME RRIKKEPGLSWIEVGDK+YSFVSGDRSHKNSELIYA+L EMLAKTKS
Sbjct: 547 KQDALKVRKLMEDRRIKKEPGLSWIEVGDKMYSFVSGDRSHKNSELIYAKLDEMLAKTKS 606

Query: 541 TGLVKDIFDYKIEHEYM 558
             L+KD FDYKIE+E M
Sbjct: 607 LDLMKDEFDYKIEYESM 621

BLAST of Sgr016806 vs. NCBI nr
Match: XP_023523804.1 (pentatricopeptide repeat-containing protein At3g13880 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 975.7 bits (2521), Expect = 1.7e-280
Identity = 490/557 (87.97%), Postives = 522/557 (93.72%), Query Frame = 0

Query: 1   VNHARILFDYSDKLDGVSWNSLIAGYAQNGKYEELLIVLEKMHQSGLALNTYTLGSALKA 60
           V+HAR+LF++++ LDGVSWNSLIAGYAQNGKYEELL +L KMHQSGL L+TYTLGSALKA
Sbjct: 211 VDHARMLFNHANNLDGVSWNSLIAGYAQNGKYEELLTILMKMHQSGLTLSTYTLGSALKA 270

Query: 61  CSSNFNGSKKFGTMLHSLAFKLGLHLDVVVGTALLDMYAKTGSMDDAIQIFDQVLDKNVV 120
           CSSNFNGSK FGTMLH L  KLGLHLDVVVGTALLDMYAKTGS+DDAIQIFDQ++DKNVV
Sbjct: 271 CSSNFNGSKIFGTMLHGLTIKLGLHLDVVVGTALLDMYAKTGSLDDAIQIFDQMMDKNVV 330

Query: 121 MYNAMMAGLLQQETIDDKCAYKALNLFFEMKGCGIKPSMFTYSSLLKACISIEVFEFAKQ 180
           MYNAMMAGLLQQE I+DKCAYKALNLFFEMK CGIKPSMFTYSSLLKACI++E FEFAKQ
Sbjct: 331 MYNAMMAGLLQQEKIEDKCAYKALNLFFEMKSCGIKPSMFTYSSLLKACIAVEDFEFAKQ 390

Query: 181 IHALICKNGLQSDEYIGSVLIDLYSLLGSMKDALSCFNSIHNLTIVPMTAMIVVYLQNGE 240
           IHALICKNGLQSDEYIGSVLIDLY LLGS+KDA SCFNSIHNLTIVP+TAMIV YLQ GE
Sbjct: 391 IHALICKNGLQSDEYIGSVLIDLYFLLGSIKDAFSCFNSIHNLTIVPITAMIVGYLQKGE 450

Query: 241 FESALALFYELLSSEEKLDEFILSTILSACGNMGMLRSGEQIQGYATKIGISRFTIFQNS 300
           FE ALALFYELL+S+EK DEFILSTILSAC NMGMLRSGEQIQGYA+KIGISR+TIFQNS
Sbjct: 451 FERALALFYELLASKEKPDEFILSTILSACANMGMLRSGEQIQGYASKIGISRYTIFQNS 510

Query: 301 QILMYAKSGDLYSTNLTFQQMENPDVVSWSTMIHSNAQHGHAMEALRFFELMKSCGIEPN 360
           QI MYAKSGDLYS NLTFQQMENPDVVSWST+I SNAQHGHA+EALRFF+LMKSCGIEPN
Sbjct: 511 QIWMYAKSGDLYSANLTFQQMENPDVVSWSTIICSNAQHGHAIEALRFFDLMKSCGIEPN 570

Query: 361 HFGFLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKYCTTACVVDLLGRAGRLVEAES 420
           HF FLGVLIACSHRGLVEEGLRYFDTMKKD+ MTSHVK+C  ACVVDLLGRAGRLV+AES
Sbjct: 571 HFAFLGVLIACSHRGLVEEGLRYFDTMKKDHIMTSHVKHC--ACVVDLLGRAGRLVDAES 630

Query: 421 LILCSGFEHEPVMWRALLSACNVHKDTFTAQRVAEKVIELEPLASASYVLLYNIYMDAGN 480
           LIL  GFEHEPVMWRALLSAC +HKDTFTA+RVAEKVIELEPLASASYVLLYNIYMDAGN
Sbjct: 631 LILDLGFEHEPVMWRALLSACRIHKDTFTAKRVAEKVIELEPLASASYVLLYNIYMDAGN 690

Query: 481 KSDALKVRKLMEARRIKKEPGLSWIEVGDKVYSFVSGDRSHKNSELIYAQLQEMLAKTKS 540
           K DALKVRKLME RRIKKEPGLSWIEVGDK+YSFVSGDRSHKNSELIYA+L EMLAKTKS
Sbjct: 691 KQDALKVRKLMEDRRIKKEPGLSWIEVGDKMYSFVSGDRSHKNSELIYAKLDEMLAKTKS 750

Query: 541 TGLVKDIFDYKIEHEYM 558
             L+KD FDYKIE+E M
Sbjct: 751 LDLMKDEFDYKIEYESM 765

BLAST of Sgr016806 vs. NCBI nr
Match: XP_022932499.1 (pentatricopeptide repeat-containing protein At3g13880 isoform X2 [Cucurbita moschata])

HSP 1 Score: 975.3 bits (2520), Expect = 2.2e-280
Identity = 489/557 (87.79%), Postives = 521/557 (93.54%), Query Frame = 0

Query: 1   VNHARILFDYSDKLDGVSWNSLIAGYAQNGKYEELLIVLEKMHQSGLALNTYTLGSALKA 60
           V+HAR+LFD+++ LDGVSWNSLIAGYAQNGKYEELL ++ KMHQSGL LNTYTLGSALKA
Sbjct: 67  VDHARMLFDHANNLDGVSWNSLIAGYAQNGKYEELLTIMMKMHQSGLTLNTYTLGSALKA 126

Query: 61  CSSNFNGSKKFGTMLHSLAFKLGLHLDVVVGTALLDMYAKTGSMDDAIQIFDQVLDKNVV 120
           CSSNFNGSK FGTMLH L  KLGLHLDVVVGTALLDMYAKTGS+DDAIQIFDQ++DKNVV
Sbjct: 127 CSSNFNGSKIFGTMLHGLTIKLGLHLDVVVGTALLDMYAKTGSLDDAIQIFDQMVDKNVV 186

Query: 121 MYNAMMAGLLQQETIDDKCAYKALNLFFEMKGCGIKPSMFTYSSLLKACISIEVFEFAKQ 180
           MYNAMMAGLLQQE I+DKCAYKALNLFFEMK CGIKPSMFTYSSLLKACI++E FEFAKQ
Sbjct: 187 MYNAMMAGLLQQEKIEDKCAYKALNLFFEMKSCGIKPSMFTYSSLLKACIAVEDFEFAKQ 246

Query: 181 IHALICKNGLQSDEYIGSVLIDLYSLLGSMKDALSCFNSIHNLTIVPMTAMIVVYLQNGE 240
           IHALICKNGLQSDEYIGSVLIDLY LLGS+KDA SCFNSIHNLTIVP+TAMIV YLQ GE
Sbjct: 247 IHALICKNGLQSDEYIGSVLIDLYFLLGSIKDAFSCFNSIHNLTIVPITAMIVGYLQKGE 306

Query: 241 FESALALFYELLSSEEKLDEFILSTILSACGNMGMLRSGEQIQGYATKIGISRFTIFQNS 300
           FE ALALFYELL+S+EK DEFILSTILSAC NMGMLRSGEQIQGYA KIGISR+TIFQNS
Sbjct: 307 FERALALFYELLASKEKPDEFILSTILSACANMGMLRSGEQIQGYANKIGISRYTIFQNS 366

Query: 301 QILMYAKSGDLYSTNLTFQQMENPDVVSWSTMIHSNAQHGHAMEALRFFELMKSCGIEPN 360
           QI MYAKSGDLYS NLTFQQMENPDVVSWST+I SNAQHGHA+EALRFF+LMKSCGIEPN
Sbjct: 367 QIWMYAKSGDLYSANLTFQQMENPDVVSWSTIICSNAQHGHAIEALRFFDLMKSCGIEPN 426

Query: 361 HFGFLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKYCTTACVVDLLGRAGRLVEAES 420
           HF FLGVLIACSHRGLVEEGLRYFDTMKKD+ MTSHVK+C  ACVVDLLGRAGRLV+AES
Sbjct: 427 HFAFLGVLIACSHRGLVEEGLRYFDTMKKDHIMTSHVKHC--ACVVDLLGRAGRLVDAES 486

Query: 421 LILCSGFEHEPVMWRALLSACNVHKDTFTAQRVAEKVIELEPLASASYVLLYNIYMDAGN 480
           LIL  GFEHEPVMWRALLSAC +HKDTFTA+RVAEKVIELEPLASASYVLLYNIYMDAGN
Sbjct: 487 LILDLGFEHEPVMWRALLSACRIHKDTFTAKRVAEKVIELEPLASASYVLLYNIYMDAGN 546

Query: 481 KSDALKVRKLMEARRIKKEPGLSWIEVGDKVYSFVSGDRSHKNSELIYAQLQEMLAKTKS 540
           K DALKVRKLME RRIKKEPGLSWI+VGD++YSFVSGDRSHKNSELIYA+L EMLAKTKS
Sbjct: 547 KQDALKVRKLMEDRRIKKEPGLSWIQVGDQMYSFVSGDRSHKNSELIYAKLDEMLAKTKS 606

Query: 541 TGLVKDIFDYKIEHEYM 558
             L+KD FDYKIE+E M
Sbjct: 607 LDLMKDEFDYKIEYESM 621

BLAST of Sgr016806 vs. ExPASy Swiss-Prot
Match: Q9LRV9 (Pentatricopeptide repeat-containing protein At3g13880 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E89 PE=2 SV=1)

HSP 1 Score: 538.5 bits (1386), Expect = 9.0e-152
Identity = 280/535 (52.34%), Postives = 369/535 (68.97%), Query Frame = 0

Query: 1   VNHARILFDYSDKLDGVSWNSLIAGYAQNGKYEELLIVLEKMHQSGLALNTYTLGSALKA 60
           ++ A  LFD  D+ D VSWNSLI+GY + G  EE L +L KMH+ GL L TY LGS LKA
Sbjct: 199 LDQAMSLFDRCDERDQVSWNSLISGYVRVGAAEEPLNLLAKMHRDGLNLTTYALGSVLKA 258

Query: 61  CSSNFN-GSKKFGTMLHSLAFKLGLHLDVVVGTALLDMYAKTGSMDDAIQIFDQVLDKNV 120
           C  N N G  + G  +H    KLG+  D+VV TALLDMYAK GS+ +AI++F  +  KNV
Sbjct: 259 CCINLNEGFIEKGMAIHCYTAKLGMEFDIVVRTALLDMYAKNGSLKEAIKLFSLMPSKNV 318

Query: 121 VMYNAMMAGLLQQETIDDKCAYKALNLFFEMKGCGIKPSMFTYSSLLKACISIEVFEFAK 180
           V YNAM++G LQ + I D+ + +A  LF +M+  G++PS  T+S +LKAC + +  E+ +
Sbjct: 319 VTYNAMISGFLQMDEITDEASSEAFKLFMDMQRRGLEPSPSTFSVVLKACSAAKTLEYGR 378

Query: 181 QIHALICKNGLQSDEYIGSVLIDLYSLLGSMKDALSCFNSIHNLTIVPMTAMIVVYLQNG 240
           QIHALICKN  QSDE+IGS LI+LY+L+GS +D + CF S     I   T+MI  ++QN 
Sbjct: 379 QIHALICKNNFQSDEFIGSALIELYALMGSTEDGMQCFASTSKQDIASWTSMIDCHVQNE 438

Query: 241 EFESALALFYELLSSEEKLDEFILSTILSACGNMGMLRSGEQIQGYATKIGISRFTIFQN 300
           + ESA  LF +L SS  + +E+ +S ++SAC +   L SGEQIQGYA K GI  FT  + 
Sbjct: 439 QLESAFDLFRQLFSSHIRPEEYTVSLMMSACADFAALSSGEQIQGYAIKSGIDAFTSVKT 498

Query: 301 SQILMYAKSGDLYSTNLTFQQMENPDVVSWSTMIHSNAQHGHAMEALRFFELMKSCGIEP 360
           S I MYAKSG++   N  F +++NPDV ++S MI S AQHG A EAL  FE MK+ GI+P
Sbjct: 499 SSISMYAKSGNMPLANQVFIEVQNPDVATYSAMISSLAQHGSANEALNIFESMKTHGIKP 558

Query: 361 NHFGFLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKYCTTACVVDLLGRAGRLVEAE 420
           N   FLGVLIAC H GLV +GL+YF  MK DY +  + K+ T  C+VDLLGR GRL +AE
Sbjct: 559 NQQAFLGVLIACCHGGLVTQGLKYFQCMKNDYRINPNEKHFT--CLVDLLGRTGRLSDAE 618

Query: 421 SLILCSGFEHEPVMWRALLSACNVHKDTFTAQRVAEKVIELEPLASASYVLLYNIYMDAG 480
           +LIL SGF+  PV WRALLS+C V+KD+   +RVAE+++ELEP AS SYVLL+NIY D+G
Sbjct: 619 NLILSSGFQDHPVTWRALLSSCRVYKDSVIGKRVAERLMELEPEASGSYVLLHNIYNDSG 678

Query: 481 NKSDALKVRKLMEARRIKKEPGLSWIEVGDKVYSFVSGDRSHKNSELIYAQLQEM 535
             S A +VR+LM  R +KKEP LSWI +G++ +SF   D SH +S++IY  L+ M
Sbjct: 679 VNSSAEEVRELMRDRGVKKEPALSWIVIGNQTHSFAVADLSHPSSQMIYTMLETM 731

BLAST of Sgr016806 vs. ExPASy Swiss-Prot
Match: Q9ZUW3 (Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H60 PE=2 SV=1)

HSP 1 Score: 354.4 bits (908), Expect = 2.4e-96
Identity = 205/548 (37.41%), Postives = 314/548 (57.30%), Query Frame = 0

Query: 1   VNHARILFDYSDKLDGVSWNSLIAGYAQNGKYEELLIVLEKMHQSGLALNTYTLGSALKA 60
           V  ARILFD ++    V+WNS+I+GYA NG   E L +   M  + + L+  +  S +K 
Sbjct: 245 VRKARILFDKTEVKSVVTWNSMISGYAANGLDLEALGMFYSMRLNYVRLSESSFASVIKL 304

Query: 61  CSSNFNGSKKFGTMLHSLAFKLGLHLDVVVGTALLDMYAKTGSMDDAIQIFDQV-LDKNV 120
           C++      +F   LH    K G   D  + TAL+  Y+K  +M DA+++F ++    NV
Sbjct: 305 CAN--LKELRFTEQLHCSVVKYGFLFDQNIRTALMVAYSKCTAMLDALRLFKEIGCVGNV 364

Query: 121 VMYNAMMAGLLQQETIDDKCAYKALNLFFEMKGCGIKPSMFTYSSLLKACISIEVFEFAK 180
           V + AM++G LQ +  ++     A++LF EMK  G++P+ FTYS +L A   I       
Sbjct: 365 VSWTAMISGFLQNDGKEE-----AVDLFSEMKRKGVRPNEFTYSVILTALPVIS----PS 424

Query: 181 QIHALICKNGLQSDEYIGSVLIDLYSLLGSMKDALSCFNSIHNLTIVPMTAMIVVYLQNG 240
           ++HA + K   +    +G+ L+D Y  LG +++A   F+ I +  IV  +AM+  Y Q G
Sbjct: 425 EVHAQVVKTNYERSSTVGTALLDAYVKLGKVEEAAKVFSGIDDKDIVAWSAMLAGYAQTG 484

Query: 241 EFESALALFYELLSSEEKLDEFILSTILSAC-GNMGMLRSGEQIQGYATKIGISRFTIFQ 300
           E E+A+ +F EL     K +EF  S+IL+ C      +  G+Q  G+A K  +       
Sbjct: 485 ETEAAIKMFGELTKGGIKPNEFTFSSILNVCAATNASMGQGKQFHGFAIKSRLDSSLCVS 544

Query: 301 NSQILMYAKSGDLYSTNLTFQQMENPDVVSWSTMIHSNAQHGHAMEALRFFELMKSCGIE 360
           ++ + MYAK G++ S    F++    D+VSW++MI   AQHG AM+AL  F+ MK   ++
Sbjct: 545 SALLTMYAKKGNIESAEEVFKRQREKDLVSWNSMISGYAQHGQAMKALDVFKEMKKRKVK 604

Query: 361 PNHFGFLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKYCTTACVVDLLGRAGRLVEA 420
            +   F+GV  AC+H GLVEEG +YFD M +D  +    ++   +C+VDL  RAG+L +A
Sbjct: 605 MDGVTFIGVFAACTHAGLVEEGEKYFDIMVRDCKIAPTKEH--NSCMVDLYSRAGQLEKA 664

Query: 421 ESLILCSGFEHEPVMWRALLSACNVHKDTFTAQRVAEKVIELEPLASASYVLLYNIYMDA 480
             +I          +WR +L+AC VHK T   +  AEK+I ++P  SA+YVLL N+Y ++
Sbjct: 665 MKVIENMPNPAGSTIWRTILAACRVHKKTELGRLAAEKIIAMKPEDSAAYVLLSNMYAES 724

Query: 481 GNKSDALKVRKLMEARRIKKEPGLSWIEVGDKVYSFVSGDRSHKNSELIYAQLQEMLAKT 540
           G+  +  KVRKLM  R +KKEPG SWIEV +K YSF++GDRSH   + IY +L+++  + 
Sbjct: 725 GDWQERAKVRKLMNERNVKKEPGYSWIEVKNKTYSFLAGDRSHPLKDQIYMKLEDLSTRL 779

Query: 541 KSTGLVKD 547
           K  G   D
Sbjct: 785 KDLGYEPD 779

BLAST of Sgr016806 vs. ExPASy Swiss-Prot
Match: Q9SMZ2 (Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H53 PE=3 SV=1)

HSP 1 Score: 332.4 bits (851), Expect = 9.8e-90
Identity = 198/543 (36.46%), Postives = 300/543 (55.25%), Query Frame = 0

Query: 4   ARILFDYSDKLDGVSWNSLIAGYAQNGKYEELLIVLEKMHQSGLALNTYTLGSALKACSS 63
           AR +FD   + D +SWNS+IAG AQNG   E + +  ++ + GL  + YT+ S LKA SS
Sbjct: 369 ARTVFDNMSERDLISWNSVIAGIAQNGLEVEAVCLFMQLLRCGLKPDQYTMTSVLKAASS 428

Query: 64  NFNGSKKFGTMLHSLAFKLGLHLDVVVGTALLDMYAKTGSMDDAIQIFDQVLDKNVVMYN 123
              G       +H  A K+    D  V TAL+D Y++   M +A  +F++  + ++V +N
Sbjct: 429 LPEG-LSLSKQVHVHAIKINNVSDSFVSTALIDAYSRNRCMKEAEILFER-HNFDLVAWN 488

Query: 124 AMMAGLLQQETIDDKCAYKALNLFFEMKGCGIKPSMFTYSSLLKACISIEVFEFAKQIHA 183
           AMMAG  Q         +K L LF  M   G +   FT +++ K C  +      KQ+HA
Sbjct: 489 AMMAGYTQSHD-----GHKTLKLFALMHKQGERSDDFTLATVFKTCGFLFAINQGKQVHA 548

Query: 184 LICKNGLQSDEYIGSVLIDLYSLLGSMKDALSCFNSIHNLTIVPMTAMIVVYLQNGEFES 243
              K+G   D ++ S ++D+Y   G M  A   F+SI     V  T MI   ++NGE E 
Sbjct: 549 YAIKSGYDLDLWVSSGILDMYVKCGDMSAAQFAFDSIPVPDDVAWTTMISGCIENGEEER 608

Query: 244 ALALFYELLSSEEKLDEFILSTILSACGNMGMLRSGEQIQGYATKIGISRFTIFQNSQIL 303
           A  +F ++       DEF ++T+  A   +  L  G QI   A K+  +       S + 
Sbjct: 609 AFHVFSQMRLMGVLPDEFTIATLAKASSCLTALEQGRQIHANALKLNCTNDPFVGTSLVD 668

Query: 304 MYAKSGDLYSTNLTFQQMENPDVVSWSTMIHSNAQHGHAMEALRFFELMKSCGIEPNHFG 363
           MYAK G +      F+++E  ++ +W+ M+   AQHG   E L+ F+ MKS GI+P+   
Sbjct: 669 MYAKCGSIDDAYCLFKRIEMMNITAWNAMLVGLAQHGEGKETLQLFKQMKSLGIKPDKVT 728

Query: 364 FLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKYCTTACVVDLLGRAGRLVEAESLIL 423
           F+GVL ACSH GLV E  ++  +M  DY +   +++   +C+ D LGRAG + +AE+LI 
Sbjct: 729 FIGVLSACSHSGLVSEAYKHMRSMHGDYGIKPEIEH--YSCLADALGRAGLVKQAENLIE 788

Query: 424 CSGFEHEPVMWRALLSACNVHKDTFTAQRVAEKVIELEPLASASYVLLYNIYMDAGNKSD 483
               E    M+R LL+AC V  DT T +RVA K++ELEPL S++YVLL N+Y  A    +
Sbjct: 789 SMSMEASASMYRTLLAACRVQGDTETGKRVATKLLELEPLDSSAYVLLSNMYAAASKWDE 848

Query: 484 ALKVRKLMEARRIKKEPGLSWIEVGDKVYSFVSGDRSHKNSELIYAQLQEMLAKTKSTGL 543
               R +M+  ++KK+PG SWIEV +K++ FV  DRS++ +ELIY ++++M+   K  G 
Sbjct: 849 MKLARTMMKGHKVKKDPGFSWIEVKNKIHIFVVDDRSNRQTELIYRKVKDMIRDIKQEGY 902

Query: 544 VKD 547
           V +
Sbjct: 909 VPE 902

BLAST of Sgr016806 vs. ExPASy Swiss-Prot
Match: Q9SVA5 (Pentatricopeptide repeat-containing protein At4g39530 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E52 PE=3 SV=1)

HSP 1 Score: 326.2 bits (835), Expect = 7.0e-88
Identity = 195/536 (36.38%), Postives = 300/536 (55.97%), Query Frame = 0

Query: 4   ARILFDYSDKLDGVSWNSLIAGYAQNGKYEELLIVLEKMHQSGLALNTYTLGSALKACSS 63
           A  LF+     + +SW +L++GY QN  ++E + +   M + GL  + Y   S L +C+S
Sbjct: 303 AHKLFNGMPNKNIISWTTLLSGYKQNALHKEAMELFTSMSKFGLKPDMYACSSILTSCAS 362

Query: 64  NFNGSKKFGTMLHSLAFKLGLHLDVVVGTALLDMYAKTGSMDDAIQIFDQVLDKNVVMYN 123
               +  FGT +H+   K  L  D  V  +L+DMYAK   + DA ++FD     +VV++N
Sbjct: 363 LH--ALGFGTQVHAYTIKANLGNDSYVTNSLIDMYAKCDCLTDARKVFDIFAAADVVLFN 422

Query: 124 AMMAGLLQQETIDDKCAYKALNLFFEMKGCGIKPSMFTYSSLLKACISIEVFEFAKQIHA 183
           AM+ G  +  T  +   ++ALN+F +M+   I+PS+ T+ SLL+A  S+     +KQIH 
Sbjct: 423 AMIEGYSRLGTQWE--LHEALNIFRDMRFRLIRPSLLTFVSLLRASASLTSLGLSKQIHG 482

Query: 184 LICKNGLQSDEYIGSVLIDLYSLLGSMKDALSCFNSIHNLTIVPMTAMIVVYLQNGEFES 243
           L+ K GL  D + GS LID+YS    +KD+   F+ +    +V   +M   Y+Q  E E 
Sbjct: 483 LMFKYGLNLDIFAGSALIDVYSNCYCLKDSRLVFDEMKVKDLVIWNSMFAGYVQQSENEE 542

Query: 244 ALALFYELLSSEEKLDEFILSTILSACGNMGMLRSGEQIQGYATKIGISRFTIFQNSQIL 303
           AL LF EL  S E+ DEF  + +++A GN+  ++ G++      K G+       N+ + 
Sbjct: 543 ALNLFLELQLSRERPDEFTFANMVTAAGNLASVQLGQEFHCQLLKRGLECNPYITNALLD 602

Query: 304 MYAKSGDLYSTNLTFQQMENPDVVSWSTMIHSNAQHGHAMEALRFFELMKSCGIEPNHFG 363
           MYAK G     +  F    + DVV W+++I S A HG   +AL+  E M S GIEPN+  
Sbjct: 603 MYAKCGSPEDAHKAFDSAASRDVVCWNSVISSYANHGEGKKALQMLEKMMSEGIEPNYIT 662

Query: 364 FLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKYCTTACVVDLLGRAGRLVEAESLIL 423
           F+GVL ACSH GLVE+GL+ F+ M + + +    ++    C+V LLGRAGRL +A  LI 
Sbjct: 663 FVGVLSACSHAGLVEDGLKQFELMLR-FGIEPETEH--YVCMVSLLGRAGRLNKARELIE 722

Query: 424 CSGFEHEPVMWRALLSACNVHKDTFTAQRVAEKVIELEPLASASYVLLYNIYMDAGNKSD 483
               +   ++WR+LLS C    +   A+  AE  I  +P  S S+ +L NIY   G  ++
Sbjct: 723 KMPTKPAAIVWRSLLSGCAKAGNVELAEHAAEMAILSDPKDSGSFTMLSNIYASKGMWTE 782

Query: 484 ALKVRKLMEARRIKKEPGLSWIEVGDKVYSFVSGDRSHKNSELIYAQLQEMLAKTK 540
           A KVR+ M+   + KEPG SWI +  +V+ F+S D+SH  +  IY  L ++L + +
Sbjct: 783 AKKVRERMKVEGVVKEPGRSWIGINKEVHIFLSKDKSHCKANQIYEVLDDLLVQIR 831

BLAST of Sgr016806 vs. ExPASy Swiss-Prot
Match: Q5G1T1 (Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=EMB2261 PE=2 SV=1)

HSP 1 Score: 322.4 bits (825), Expect = 1.0e-86
Identity = 195/555 (35.14%), Postives = 305/555 (54.95%), Query Frame = 0

Query: 7   LFDYSDKLDGVSWNSLIAGYAQNGKYEELLIVLEKMHQSGLALNTYTLGSALKACSSNFN 66
           +FD   +L+ V+W  +I    Q G   E +     M  SG   + +TL S   AC+   N
Sbjct: 225 VFDKMSELNVVTWTLMITRCMQMGFPREAIRFFLDMVLSGFESDKFTLSSVFSACAELEN 284

Query: 67  GSKKFGTMLHSLAFKLGLHLDVVVGTALLDMYAK---TGSMDDAIQIFDQVLDKNVVMYN 126
            S   G  LHS A + GL  D  V  +L+DMYAK    GS+DD  ++FD++ D +V+ + 
Sbjct: 285 LS--LGKQLHSWAIRSGLVDD--VECSLVDMYAKCSADGSVDDCRKVFDRMEDHSVMSWT 344

Query: 127 AMMAGLLQQETIDDKCAYKALNLFFEMKGCG-IKPSMFTYSSLLKACISIEVFEFAKQIH 186
           A++ G ++   +    A +A+NLF EM   G ++P+ FT+SS  KAC ++      KQ+ 
Sbjct: 345 ALITGYMKNCNL----ATEAINLFSEMITQGHVEPNHFTFSSAFKACGNLSDPRVGKQVL 404

Query: 187 ALICKNGLQSDEYIGSVLIDLYSLLGSMKDALSCFNSIHNLTIVPMTAMIVVYLQNGEFE 246
               K GL S+  + + +I ++     M+DA   F S+    +V     +    +N  FE
Sbjct: 405 GQAFKRGLASNSSVANSVISMFVKSDRMEDAQRAFESLSEKNLVSYNTFLDGTCRNLNFE 464

Query: 247 SALALFYELLSSEEKLDEFILSTILSACGNMGMLRSGEQIQGYATKIGISRFTIFQNSQI 306
            A  L  E+   E  +  F  +++LS   N+G +R GEQI     K+G+S      N+ I
Sbjct: 465 QAFKLLSEITERELGVSAFTFASLLSGVANVGSIRKGEQIHSQVVKLGLSCNQPVCNALI 524

Query: 307 LMYAKSGDLYSTNLTFQQMENPDVVSWSTMIHSNAQHGHAMEALRFFELMKSCGIEPNHF 366
            MY+K G + + +  F  MEN +V+SW++MI   A+HG A+  L  F  M   G++PN  
Sbjct: 525 SMYSKCGSIDTASRVFNFMENRNVISWTSMITGFAKHGFAIRVLETFNQMIEEGVKPNEV 584

Query: 367 GFLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKYCTTACVVDLLGRAGRLVEAESLI 426
            ++ +L ACSH GLV EG R+F++M +D+ +   +++   AC+VDLL RAG L +A   I
Sbjct: 585 TYVAILSACSHVGLVSEGWRHFNSMYEDHKIKPKMEH--YACMVDLLCRAGLLTDAFEFI 644

Query: 427 LCSGFEHEPVMWRALLSACNVHKDTFTAQRVAEKVIELEPLASASYVLLYNIYMDAGNKS 486
               F+ + ++WR  L AC VH +T   +  A K++EL+P   A+Y+ L NIY  AG   
Sbjct: 645 NTMPFQADVLVWRTFLGACRVHSNTELGKLAARKILELDPNEPAAYIQLSNIYACAGKWE 704

Query: 487 DALKVRKLMEARRIKKEPGLSWIEVGDKVYSFVSGDRSHKNSELIYAQLQEMLAKTKSTG 546
           ++ ++R+ M+ R + KE G SWIEVGDK++ F  GD +H N+  IY +L  ++ + K  G
Sbjct: 705 ESTEMRRKMKERNLVKEGGCSWIEVGDKIHKFYVGDTAHPNAHQIYDELDRLITEIKRCG 764

Query: 547 LV--KDIFDYKIEHE 556
            V   D+  +K+E E
Sbjct: 765 YVPDTDLVLHKLEEE 769

BLAST of Sgr016806 vs. ExPASy TrEMBL
Match: A0A6J1CL46 (pentatricopeptide repeat-containing protein At3g13880 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111012143 PE=4 SV=1)

HSP 1 Score: 1013.1 bits (2618), Expect = 4.6e-292
Identity = 510/559 (91.23%), Postives = 529/559 (94.63%), Query Frame = 0

Query: 1   VNHARILFDYSDKLDGVSWNSLIAGYAQNGKYEELLIVLEKMHQSGLALNTYTLGSALKA 60
           V+HARILFDYSD LDGVSWNSLIAGYAQNGKYEELLI+LEKMHQSGLALNTYTLGSALKA
Sbjct: 74  VDHARILFDYSDNLDGVSWNSLIAGYAQNGKYEELLIILEKMHQSGLALNTYTLGSALKA 133

Query: 61  CSSNFNGSKKFGTMLHSLAFKLGLHLDVVVGTALLDMYAKTGSMDDAIQIFDQVLDKNVV 120
           CSSNFNGSK+FGTMLHSLA KLGLHLDVVVGTALLDMYAKTGS+DDAIQIFDQVLDKNVV
Sbjct: 134 CSSNFNGSKQFGTMLHSLAIKLGLHLDVVVGTALLDMYAKTGSLDDAIQIFDQVLDKNVV 193

Query: 121 MYNAMMAGLLQQETIDDKCAYKALNLFFEMKGCGIKPSMFTYSSLLKACISIEVFEFAKQ 180
           MYNAMMAGLLQQETI+DKCAYKAL LFFEMK CG+KPSMFTYSSLLKACI++EVFEFAKQ
Sbjct: 194 MYNAMMAGLLQQETIEDKCAYKALGLFFEMKSCGVKPSMFTYSSLLKACITVEVFEFAKQ 253

Query: 181 IHALICKNGLQSDEYIGSVLIDLYSLLGSMKDALSCFNSIHNLTIVPMTAMIVVYLQNGE 240
           IHALI KNGLQSDEYIGSVLID YSLLGSMKDALSCFNSIHNLTIVPMTAMIV YLQNGE
Sbjct: 254 IHALIFKNGLQSDEYIGSVLIDFYSLLGSMKDALSCFNSIHNLTIVPMTAMIVGYLQNGE 313

Query: 241 FESALALFYELLSSEEKLDEFILSTILSACGNMGMLRSGEQIQGYATKIGISRFTIFQNS 300
           FE ALALFYELL+SEEK DEFILSTIL AC NMGMLRSGEQIQGYATK GI +F IFQNS
Sbjct: 314 FEIALALFYELLASEEKPDEFILSTILGACANMGMLRSGEQIQGYATKTGILKFKIFQNS 373

Query: 301 QILMYAKSGDLYSTNLTFQQMENPDVVSWSTMIHSNAQHGHAMEALRFFELMKSCGIEPN 360
           QI MYAKSGDLYS NLTFQQMENPDVVSWSTMI S AQHGHAM+A RFFELMKS GIEPN
Sbjct: 374 QIFMYAKSGDLYSANLTFQQMENPDVVSWSTMICSAAQHGHAMKAFRFFELMKSSGIEPN 433

Query: 361 HFGFLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKYCTTACVVDLLGRAGRLVEAES 420
            F FLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVK+C  ACVVDLLGRAGRLV+AES
Sbjct: 434 DFAFLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKHC--ACVVDLLGRAGRLVDAES 493

Query: 421 LILCSGFEHEPVMWRALLSACNVHKDTFTAQRVAEKVIELEPLASASYVLLYNIYMDAGN 480
           +ILCSGFEHEPVMWRALLSAC +HKDTFTAQRVAEKVIELEPL SASYVLL+NIYMDAGN
Sbjct: 494 IILCSGFEHEPVMWRALLSACLIHKDTFTAQRVAEKVIELEPLTSASYVLLHNIYMDAGN 553

Query: 481 KSDALKVRKLMEARRIKKEPGLSWIEVGDKVYSFVSGDRSHKNSELIYAQLQEMLAKTKS 540
           KS+ALKVRKLMEARRIKKEPGLSWIEVGD VYSFVSGDRSHKNSELIYAQL +MLAKTKS
Sbjct: 554 KSEALKVRKLMEARRIKKEPGLSWIEVGDNVYSFVSGDRSHKNSELIYAQLDDMLAKTKS 613

Query: 541 TGLVKDIFDYKIEHEYMAL 560
            GLV DIFDYK+EHEYMAL
Sbjct: 614 LGLVNDIFDYKMEHEYMAL 630

BLAST of Sgr016806 vs. ExPASy TrEMBL
Match: A0A6J1CJD1 (pentatricopeptide repeat-containing protein At3g13880 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111012143 PE=4 SV=1)

HSP 1 Score: 1013.1 bits (2618), Expect = 4.6e-292
Identity = 510/559 (91.23%), Postives = 529/559 (94.63%), Query Frame = 0

Query: 1   VNHARILFDYSDKLDGVSWNSLIAGYAQNGKYEELLIVLEKMHQSGLALNTYTLGSALKA 60
           V+HARILFDYSD LDGVSWNSLIAGYAQNGKYEELLI+LEKMHQSGLALNTYTLGSALKA
Sbjct: 211 VDHARILFDYSDNLDGVSWNSLIAGYAQNGKYEELLIILEKMHQSGLALNTYTLGSALKA 270

Query: 61  CSSNFNGSKKFGTMLHSLAFKLGLHLDVVVGTALLDMYAKTGSMDDAIQIFDQVLDKNVV 120
           CSSNFNGSK+FGTMLHSLA KLGLHLDVVVGTALLDMYAKTGS+DDAIQIFDQVLDKNVV
Sbjct: 271 CSSNFNGSKQFGTMLHSLAIKLGLHLDVVVGTALLDMYAKTGSLDDAIQIFDQVLDKNVV 330

Query: 121 MYNAMMAGLLQQETIDDKCAYKALNLFFEMKGCGIKPSMFTYSSLLKACISIEVFEFAKQ 180
           MYNAMMAGLLQQETI+DKCAYKAL LFFEMK CG+KPSMFTYSSLLKACI++EVFEFAKQ
Sbjct: 331 MYNAMMAGLLQQETIEDKCAYKALGLFFEMKSCGVKPSMFTYSSLLKACITVEVFEFAKQ 390

Query: 181 IHALICKNGLQSDEYIGSVLIDLYSLLGSMKDALSCFNSIHNLTIVPMTAMIVVYLQNGE 240
           IHALI KNGLQSDEYIGSVLID YSLLGSMKDALSCFNSIHNLTIVPMTAMIV YLQNGE
Sbjct: 391 IHALIFKNGLQSDEYIGSVLIDFYSLLGSMKDALSCFNSIHNLTIVPMTAMIVGYLQNGE 450

Query: 241 FESALALFYELLSSEEKLDEFILSTILSACGNMGMLRSGEQIQGYATKIGISRFTIFQNS 300
           FE ALALFYELL+SEEK DEFILSTIL AC NMGMLRSGEQIQGYATK GI +F IFQNS
Sbjct: 451 FEIALALFYELLASEEKPDEFILSTILGACANMGMLRSGEQIQGYATKTGILKFKIFQNS 510

Query: 301 QILMYAKSGDLYSTNLTFQQMENPDVVSWSTMIHSNAQHGHAMEALRFFELMKSCGIEPN 360
           QI MYAKSGDLYS NLTFQQMENPDVVSWSTMI S AQHGHAM+A RFFELMKS GIEPN
Sbjct: 511 QIFMYAKSGDLYSANLTFQQMENPDVVSWSTMICSAAQHGHAMKAFRFFELMKSSGIEPN 570

Query: 361 HFGFLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKYCTTACVVDLLGRAGRLVEAES 420
            F FLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVK+C  ACVVDLLGRAGRLV+AES
Sbjct: 571 DFAFLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKHC--ACVVDLLGRAGRLVDAES 630

Query: 421 LILCSGFEHEPVMWRALLSACNVHKDTFTAQRVAEKVIELEPLASASYVLLYNIYMDAGN 480
           +ILCSGFEHEPVMWRALLSAC +HKDTFTAQRVAEKVIELEPL SASYVLL+NIYMDAGN
Sbjct: 631 IILCSGFEHEPVMWRALLSACLIHKDTFTAQRVAEKVIELEPLTSASYVLLHNIYMDAGN 690

Query: 481 KSDALKVRKLMEARRIKKEPGLSWIEVGDKVYSFVSGDRSHKNSELIYAQLQEMLAKTKS 540
           KS+ALKVRKLMEARRIKKEPGLSWIEVGD VYSFVSGDRSHKNSELIYAQL +MLAKTKS
Sbjct: 691 KSEALKVRKLMEARRIKKEPGLSWIEVGDNVYSFVSGDRSHKNSELIYAQLDDMLAKTKS 750

Query: 541 TGLVKDIFDYKIEHEYMAL 560
            GLV DIFDYK+EHEYMAL
Sbjct: 751 LGLVNDIFDYKMEHEYMAL 767

BLAST of Sgr016806 vs. ExPASy TrEMBL
Match: A0A6J1EX55 (pentatricopeptide repeat-containing protein At3g13880 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111438902 PE=3 SV=1)

HSP 1 Score: 975.3 bits (2520), Expect = 1.1e-280
Identity = 489/557 (87.79%), Postives = 521/557 (93.54%), Query Frame = 0

Query: 1   VNHARILFDYSDKLDGVSWNSLIAGYAQNGKYEELLIVLEKMHQSGLALNTYTLGSALKA 60
           V+HAR+LFD+++ LDGVSWNSLIAGYAQNGKYEELL ++ KMHQSGL LNTYTLGSALKA
Sbjct: 67  VDHARMLFDHANNLDGVSWNSLIAGYAQNGKYEELLTIMMKMHQSGLTLNTYTLGSALKA 126

Query: 61  CSSNFNGSKKFGTMLHSLAFKLGLHLDVVVGTALLDMYAKTGSMDDAIQIFDQVLDKNVV 120
           CSSNFNGSK FGTMLH L  KLGLHLDVVVGTALLDMYAKTGS+DDAIQIFDQ++DKNVV
Sbjct: 127 CSSNFNGSKIFGTMLHGLTIKLGLHLDVVVGTALLDMYAKTGSLDDAIQIFDQMVDKNVV 186

Query: 121 MYNAMMAGLLQQETIDDKCAYKALNLFFEMKGCGIKPSMFTYSSLLKACISIEVFEFAKQ 180
           MYNAMMAGLLQQE I+DKCAYKALNLFFEMK CGIKPSMFTYSSLLKACI++E FEFAKQ
Sbjct: 187 MYNAMMAGLLQQEKIEDKCAYKALNLFFEMKSCGIKPSMFTYSSLLKACIAVEDFEFAKQ 246

Query: 181 IHALICKNGLQSDEYIGSVLIDLYSLLGSMKDALSCFNSIHNLTIVPMTAMIVVYLQNGE 240
           IHALICKNGLQSDEYIGSVLIDLY LLGS+KDA SCFNSIHNLTIVP+TAMIV YLQ GE
Sbjct: 247 IHALICKNGLQSDEYIGSVLIDLYFLLGSIKDAFSCFNSIHNLTIVPITAMIVGYLQKGE 306

Query: 241 FESALALFYELLSSEEKLDEFILSTILSACGNMGMLRSGEQIQGYATKIGISRFTIFQNS 300
           FE ALALFYELL+S+EK DEFILSTILSAC NMGMLRSGEQIQGYA KIGISR+TIFQNS
Sbjct: 307 FERALALFYELLASKEKPDEFILSTILSACANMGMLRSGEQIQGYANKIGISRYTIFQNS 366

Query: 301 QILMYAKSGDLYSTNLTFQQMENPDVVSWSTMIHSNAQHGHAMEALRFFELMKSCGIEPN 360
           QI MYAKSGDLYS NLTFQQMENPDVVSWST+I SNAQHGHA+EALRFF+LMKSCGIEPN
Sbjct: 367 QIWMYAKSGDLYSANLTFQQMENPDVVSWSTIICSNAQHGHAIEALRFFDLMKSCGIEPN 426

Query: 361 HFGFLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKYCTTACVVDLLGRAGRLVEAES 420
           HF FLGVLIACSHRGLVEEGLRYFDTMKKD+ MTSHVK+C  ACVVDLLGRAGRLV+AES
Sbjct: 427 HFAFLGVLIACSHRGLVEEGLRYFDTMKKDHIMTSHVKHC--ACVVDLLGRAGRLVDAES 486

Query: 421 LILCSGFEHEPVMWRALLSACNVHKDTFTAQRVAEKVIELEPLASASYVLLYNIYMDAGN 480
           LIL  GFEHEPVMWRALLSAC +HKDTFTA+RVAEKVIELEPLASASYVLLYNIYMDAGN
Sbjct: 487 LILDLGFEHEPVMWRALLSACRIHKDTFTAKRVAEKVIELEPLASASYVLLYNIYMDAGN 546

Query: 481 KSDALKVRKLMEARRIKKEPGLSWIEVGDKVYSFVSGDRSHKNSELIYAQLQEMLAKTKS 540
           K DALKVRKLME RRIKKEPGLSWI+VGD++YSFVSGDRSHKNSELIYA+L EMLAKTKS
Sbjct: 547 KQDALKVRKLMEDRRIKKEPGLSWIQVGDQMYSFVSGDRSHKNSELIYAKLDEMLAKTKS 606

Query: 541 TGLVKDIFDYKIEHEYM 558
             L+KD FDYKIE+E M
Sbjct: 607 LDLMKDEFDYKIEYESM 621

BLAST of Sgr016806 vs. ExPASy TrEMBL
Match: A0A6J1F1V7 (pentatricopeptide repeat-containing protein At3g13880 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111438902 PE=3 SV=1)

HSP 1 Score: 975.3 bits (2520), Expect = 1.1e-280
Identity = 489/557 (87.79%), Postives = 521/557 (93.54%), Query Frame = 0

Query: 1   VNHARILFDYSDKLDGVSWNSLIAGYAQNGKYEELLIVLEKMHQSGLALNTYTLGSALKA 60
           V+HAR+LFD+++ LDGVSWNSLIAGYAQNGKYEELL ++ KMHQSGL LNTYTLGSALKA
Sbjct: 211 VDHARMLFDHANNLDGVSWNSLIAGYAQNGKYEELLTIMMKMHQSGLTLNTYTLGSALKA 270

Query: 61  CSSNFNGSKKFGTMLHSLAFKLGLHLDVVVGTALLDMYAKTGSMDDAIQIFDQVLDKNVV 120
           CSSNFNGSK FGTMLH L  KLGLHLDVVVGTALLDMYAKTGS+DDAIQIFDQ++DKNVV
Sbjct: 271 CSSNFNGSKIFGTMLHGLTIKLGLHLDVVVGTALLDMYAKTGSLDDAIQIFDQMVDKNVV 330

Query: 121 MYNAMMAGLLQQETIDDKCAYKALNLFFEMKGCGIKPSMFTYSSLLKACISIEVFEFAKQ 180
           MYNAMMAGLLQQE I+DKCAYKALNLFFEMK CGIKPSMFTYSSLLKACI++E FEFAKQ
Sbjct: 331 MYNAMMAGLLQQEKIEDKCAYKALNLFFEMKSCGIKPSMFTYSSLLKACIAVEDFEFAKQ 390

Query: 181 IHALICKNGLQSDEYIGSVLIDLYSLLGSMKDALSCFNSIHNLTIVPMTAMIVVYLQNGE 240
           IHALICKNGLQSDEYIGSVLIDLY LLGS+KDA SCFNSIHNLTIVP+TAMIV YLQ GE
Sbjct: 391 IHALICKNGLQSDEYIGSVLIDLYFLLGSIKDAFSCFNSIHNLTIVPITAMIVGYLQKGE 450

Query: 241 FESALALFYELLSSEEKLDEFILSTILSACGNMGMLRSGEQIQGYATKIGISRFTIFQNS 300
           FE ALALFYELL+S+EK DEFILSTILSAC NMGMLRSGEQIQGYA KIGISR+TIFQNS
Sbjct: 451 FERALALFYELLASKEKPDEFILSTILSACANMGMLRSGEQIQGYANKIGISRYTIFQNS 510

Query: 301 QILMYAKSGDLYSTNLTFQQMENPDVVSWSTMIHSNAQHGHAMEALRFFELMKSCGIEPN 360
           QI MYAKSGDLYS NLTFQQMENPDVVSWST+I SNAQHGHA+EALRFF+LMKSCGIEPN
Sbjct: 511 QIWMYAKSGDLYSANLTFQQMENPDVVSWSTIICSNAQHGHAIEALRFFDLMKSCGIEPN 570

Query: 361 HFGFLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKYCTTACVVDLLGRAGRLVEAES 420
           HF FLGVLIACSHRGLVEEGLRYFDTMKKD+ MTSHVK+C  ACVVDLLGRAGRLV+AES
Sbjct: 571 HFAFLGVLIACSHRGLVEEGLRYFDTMKKDHIMTSHVKHC--ACVVDLLGRAGRLVDAES 630

Query: 421 LILCSGFEHEPVMWRALLSACNVHKDTFTAQRVAEKVIELEPLASASYVLLYNIYMDAGN 480
           LIL  GFEHEPVMWRALLSAC +HKDTFTA+RVAEKVIELEPLASASYVLLYNIYMDAGN
Sbjct: 631 LILDLGFEHEPVMWRALLSACRIHKDTFTAKRVAEKVIELEPLASASYVLLYNIYMDAGN 690

Query: 481 KSDALKVRKLMEARRIKKEPGLSWIEVGDKVYSFVSGDRSHKNSELIYAQLQEMLAKTKS 540
           K DALKVRKLME RRIKKEPGLSWI+VGD++YSFVSGDRSHKNSELIYA+L EMLAKTKS
Sbjct: 691 KQDALKVRKLMEDRRIKKEPGLSWIQVGDQMYSFVSGDRSHKNSELIYAKLDEMLAKTKS 750

Query: 541 TGLVKDIFDYKIEHEYM 558
             L+KD FDYKIE+E M
Sbjct: 751 LDLMKDEFDYKIEYESM 765

BLAST of Sgr016806 vs. ExPASy TrEMBL
Match: A0A6J1I7D1 (pentatricopeptide repeat-containing protein At3g13880 OS=Cucurbita maxima OX=3661 GN=LOC111471935 PE=3 SV=1)

HSP 1 Score: 970.3 bits (2507), Expect = 3.4e-279
Identity = 489/557 (87.79%), Postives = 518/557 (93.00%), Query Frame = 0

Query: 1   VNHARILFDYSDKLDGVSWNSLIAGYAQNGKYEELLIVLEKMHQSGLALNTYTLGSALKA 60
           V+HAR+LFD+++ LDGVSWNSLIAGYAQNGKYEELL +L KMHQSGL LNTYTLGSALKA
Sbjct: 211 VDHARMLFDHANNLDGVSWNSLIAGYAQNGKYEELLTILMKMHQSGLTLNTYTLGSALKA 270

Query: 61  CSSNFNGSKKFGTMLHSLAFKLGLHLDVVVGTALLDMYAKTGSMDDAIQIFDQVLDKNVV 120
           CSSNFNGSK FGTMLH L  KLGLHLDVVVGTALLDMYAKTGS+DDAIQIFDQ++DKNVV
Sbjct: 271 CSSNFNGSKIFGTMLHGLTIKLGLHLDVVVGTALLDMYAKTGSLDDAIQIFDQMVDKNVV 330

Query: 121 MYNAMMAGLLQQETIDDKCAYKALNLFFEMKGCGIKPSMFTYSSLLKACISIEVFEFAKQ 180
           MYNAMMAGLLQQE I+DKCAYKALNLFFEMK CGIKPSMFTYSSLLKACI++E FEFAKQ
Sbjct: 331 MYNAMMAGLLQQEKIEDKCAYKALNLFFEMKSCGIKPSMFTYSSLLKACIAVEDFEFAKQ 390

Query: 181 IHALICKNGLQSDEYIGSVLIDLYSLLGSMKDALSCFNSIHNLTIVPMTAMIVVYLQNGE 240
            HALICKNGLQSDEYIGSVLIDLY LLGS+KDA SCFNSIHNLTIVP+TAMIV YLQ GE
Sbjct: 391 THALICKNGLQSDEYIGSVLIDLYFLLGSIKDAFSCFNSIHNLTIVPITAMIVGYLQKGE 450

Query: 241 FESALALFYELLSSEEKLDEFILSTILSACGNMGMLRSGEQIQGYATKIGISRFTIFQNS 300
           FE ALALFYELLSS+EK DEFILSTILSAC NMGMLRSGEQIQGYA KIGISR+TIFQNS
Sbjct: 451 FERALALFYELLSSKEKPDEFILSTILSACANMGMLRSGEQIQGYANKIGISRYTIFQNS 510

Query: 301 QILMYAKSGDLYSTNLTFQQMENPDVVSWSTMIHSNAQHGHAMEALRFFELMKSCGIEPN 360
           QI MYAKSGDLYS NLTFQQMENPDVVSWST+I SNAQHGHA+EALRFF+LMKSCGIEPN
Sbjct: 511 QIWMYAKSGDLYSANLTFQQMENPDVVSWSTIICSNAQHGHAIEALRFFDLMKSCGIEPN 570

Query: 361 HFGFLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKYCTTACVVDLLGRAGRLVEAES 420
           HF FLGVLIACSHRGLVEEG+RYFDTMKKD+ MTSHVK+C  ACVVDLLGRAGRLV+AES
Sbjct: 571 HFAFLGVLIACSHRGLVEEGIRYFDTMKKDHIMTSHVKHC--ACVVDLLGRAGRLVDAES 630

Query: 421 LILCSGFEHEPVMWRALLSACNVHKDTFTAQRVAEKVIELEPLASASYVLLYNIYMDAGN 480
           LIL  GFEHEPVMWRALLSAC +HKDT TA+RVAEKVIELEPLASASYVLLYNIYMDAGN
Sbjct: 631 LILDLGFEHEPVMWRALLSACRIHKDTSTAKRVAEKVIELEPLASASYVLLYNIYMDAGN 690

Query: 481 KSDALKVRKLMEARRIKKEPGLSWIEVGDKVYSFVSGDRSHKNSELIYAQLQEMLAKTKS 540
           K DALKVRKLME RRIKKEPGLSWIEVGD+VYSFVSGDRSHKNSELIYA+L EMLAKTK+
Sbjct: 691 KQDALKVRKLMEDRRIKKEPGLSWIEVGDQVYSFVSGDRSHKNSELIYAKLDEMLAKTKN 750

Query: 541 TGLVKDIFDYKIEHEYM 558
             L+KD FDYKIE E M
Sbjct: 751 LHLMKDEFDYKIELESM 765

BLAST of Sgr016806 vs. TAIR 10
Match: AT3G13880.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 538.5 bits (1386), Expect = 6.4e-153
Identity = 280/535 (52.34%), Postives = 369/535 (68.97%), Query Frame = 0

Query: 1   VNHARILFDYSDKLDGVSWNSLIAGYAQNGKYEELLIVLEKMHQSGLALNTYTLGSALKA 60
           ++ A  LFD  D+ D VSWNSLI+GY + G  EE L +L KMH+ GL L TY LGS LKA
Sbjct: 199 LDQAMSLFDRCDERDQVSWNSLISGYVRVGAAEEPLNLLAKMHRDGLNLTTYALGSVLKA 258

Query: 61  CSSNFN-GSKKFGTMLHSLAFKLGLHLDVVVGTALLDMYAKTGSMDDAIQIFDQVLDKNV 120
           C  N N G  + G  +H    KLG+  D+VV TALLDMYAK GS+ +AI++F  +  KNV
Sbjct: 259 CCINLNEGFIEKGMAIHCYTAKLGMEFDIVVRTALLDMYAKNGSLKEAIKLFSLMPSKNV 318

Query: 121 VMYNAMMAGLLQQETIDDKCAYKALNLFFEMKGCGIKPSMFTYSSLLKACISIEVFEFAK 180
           V YNAM++G LQ + I D+ + +A  LF +M+  G++PS  T+S +LKAC + +  E+ +
Sbjct: 319 VTYNAMISGFLQMDEITDEASSEAFKLFMDMQRRGLEPSPSTFSVVLKACSAAKTLEYGR 378

Query: 181 QIHALICKNGLQSDEYIGSVLIDLYSLLGSMKDALSCFNSIHNLTIVPMTAMIVVYLQNG 240
           QIHALICKN  QSDE+IGS LI+LY+L+GS +D + CF S     I   T+MI  ++QN 
Sbjct: 379 QIHALICKNNFQSDEFIGSALIELYALMGSTEDGMQCFASTSKQDIASWTSMIDCHVQNE 438

Query: 241 EFESALALFYELLSSEEKLDEFILSTILSACGNMGMLRSGEQIQGYATKIGISRFTIFQN 300
           + ESA  LF +L SS  + +E+ +S ++SAC +   L SGEQIQGYA K GI  FT  + 
Sbjct: 439 QLESAFDLFRQLFSSHIRPEEYTVSLMMSACADFAALSSGEQIQGYAIKSGIDAFTSVKT 498

Query: 301 SQILMYAKSGDLYSTNLTFQQMENPDVVSWSTMIHSNAQHGHAMEALRFFELMKSCGIEP 360
           S I MYAKSG++   N  F +++NPDV ++S MI S AQHG A EAL  FE MK+ GI+P
Sbjct: 499 SSISMYAKSGNMPLANQVFIEVQNPDVATYSAMISSLAQHGSANEALNIFESMKTHGIKP 558

Query: 361 NHFGFLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKYCTTACVVDLLGRAGRLVEAE 420
           N   FLGVLIAC H GLV +GL+YF  MK DY +  + K+ T  C+VDLLGR GRL +AE
Sbjct: 559 NQQAFLGVLIACCHGGLVTQGLKYFQCMKNDYRINPNEKHFT--CLVDLLGRTGRLSDAE 618

Query: 421 SLILCSGFEHEPVMWRALLSACNVHKDTFTAQRVAEKVIELEPLASASYVLLYNIYMDAG 480
           +LIL SGF+  PV WRALLS+C V+KD+   +RVAE+++ELEP AS SYVLL+NIY D+G
Sbjct: 619 NLILSSGFQDHPVTWRALLSSCRVYKDSVIGKRVAERLMELEPEASGSYVLLHNIYNDSG 678

Query: 481 NKSDALKVRKLMEARRIKKEPGLSWIEVGDKVYSFVSGDRSHKNSELIYAQLQEM 535
             S A +VR+LM  R +KKEP LSWI +G++ +SF   D SH +S++IY  L+ M
Sbjct: 679 VNSSAEEVRELMRDRGVKKEPALSWIVIGNQTHSFAVADLSHPSSQMIYTMLETM 731

BLAST of Sgr016806 vs. TAIR 10
Match: AT2G27610.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 354.4 bits (908), Expect = 1.7e-97
Identity = 205/548 (37.41%), Postives = 314/548 (57.30%), Query Frame = 0

Query: 1   VNHARILFDYSDKLDGVSWNSLIAGYAQNGKYEELLIVLEKMHQSGLALNTYTLGSALKA 60
           V  ARILFD ++    V+WNS+I+GYA NG   E L +   M  + + L+  +  S +K 
Sbjct: 245 VRKARILFDKTEVKSVVTWNSMISGYAANGLDLEALGMFYSMRLNYVRLSESSFASVIKL 304

Query: 61  CSSNFNGSKKFGTMLHSLAFKLGLHLDVVVGTALLDMYAKTGSMDDAIQIFDQV-LDKNV 120
           C++      +F   LH    K G   D  + TAL+  Y+K  +M DA+++F ++    NV
Sbjct: 305 CAN--LKELRFTEQLHCSVVKYGFLFDQNIRTALMVAYSKCTAMLDALRLFKEIGCVGNV 364

Query: 121 VMYNAMMAGLLQQETIDDKCAYKALNLFFEMKGCGIKPSMFTYSSLLKACISIEVFEFAK 180
           V + AM++G LQ +  ++     A++LF EMK  G++P+ FTYS +L A   I       
Sbjct: 365 VSWTAMISGFLQNDGKEE-----AVDLFSEMKRKGVRPNEFTYSVILTALPVIS----PS 424

Query: 181 QIHALICKNGLQSDEYIGSVLIDLYSLLGSMKDALSCFNSIHNLTIVPMTAMIVVYLQNG 240
           ++HA + K   +    +G+ L+D Y  LG +++A   F+ I +  IV  +AM+  Y Q G
Sbjct: 425 EVHAQVVKTNYERSSTVGTALLDAYVKLGKVEEAAKVFSGIDDKDIVAWSAMLAGYAQTG 484

Query: 241 EFESALALFYELLSSEEKLDEFILSTILSAC-GNMGMLRSGEQIQGYATKIGISRFTIFQ 300
           E E+A+ +F EL     K +EF  S+IL+ C      +  G+Q  G+A K  +       
Sbjct: 485 ETEAAIKMFGELTKGGIKPNEFTFSSILNVCAATNASMGQGKQFHGFAIKSRLDSSLCVS 544

Query: 301 NSQILMYAKSGDLYSTNLTFQQMENPDVVSWSTMIHSNAQHGHAMEALRFFELMKSCGIE 360
           ++ + MYAK G++ S    F++    D+VSW++MI   AQHG AM+AL  F+ MK   ++
Sbjct: 545 SALLTMYAKKGNIESAEEVFKRQREKDLVSWNSMISGYAQHGQAMKALDVFKEMKKRKVK 604

Query: 361 PNHFGFLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKYCTTACVVDLLGRAGRLVEA 420
            +   F+GV  AC+H GLVEEG +YFD M +D  +    ++   +C+VDL  RAG+L +A
Sbjct: 605 MDGVTFIGVFAACTHAGLVEEGEKYFDIMVRDCKIAPTKEH--NSCMVDLYSRAGQLEKA 664

Query: 421 ESLILCSGFEHEPVMWRALLSACNVHKDTFTAQRVAEKVIELEPLASASYVLLYNIYMDA 480
             +I          +WR +L+AC VHK T   +  AEK+I ++P  SA+YVLL N+Y ++
Sbjct: 665 MKVIENMPNPAGSTIWRTILAACRVHKKTELGRLAAEKIIAMKPEDSAAYVLLSNMYAES 724

Query: 481 GNKSDALKVRKLMEARRIKKEPGLSWIEVGDKVYSFVSGDRSHKNSELIYAQLQEMLAKT 540
           G+  +  KVRKLM  R +KKEPG SWIEV +K YSF++GDRSH   + IY +L+++  + 
Sbjct: 725 GDWQERAKVRKLMNERNVKKEPGYSWIEVKNKTYSFLAGDRSHPLKDQIYMKLEDLSTRL 779

Query: 541 KSTGLVKD 547
           K  G   D
Sbjct: 785 KDLGYEPD 779

BLAST of Sgr016806 vs. TAIR 10
Match: AT4G33170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 332.4 bits (851), Expect = 6.9e-91
Identity = 198/543 (36.46%), Postives = 300/543 (55.25%), Query Frame = 0

Query: 4   ARILFDYSDKLDGVSWNSLIAGYAQNGKYEELLIVLEKMHQSGLALNTYTLGSALKACSS 63
           AR +FD   + D +SWNS+IAG AQNG   E + +  ++ + GL  + YT+ S LKA SS
Sbjct: 369 ARTVFDNMSERDLISWNSVIAGIAQNGLEVEAVCLFMQLLRCGLKPDQYTMTSVLKAASS 428

Query: 64  NFNGSKKFGTMLHSLAFKLGLHLDVVVGTALLDMYAKTGSMDDAIQIFDQVLDKNVVMYN 123
              G       +H  A K+    D  V TAL+D Y++   M +A  +F++  + ++V +N
Sbjct: 429 LPEG-LSLSKQVHVHAIKINNVSDSFVSTALIDAYSRNRCMKEAEILFER-HNFDLVAWN 488

Query: 124 AMMAGLLQQETIDDKCAYKALNLFFEMKGCGIKPSMFTYSSLLKACISIEVFEFAKQIHA 183
           AMMAG  Q         +K L LF  M   G +   FT +++ K C  +      KQ+HA
Sbjct: 489 AMMAGYTQSHD-----GHKTLKLFALMHKQGERSDDFTLATVFKTCGFLFAINQGKQVHA 548

Query: 184 LICKNGLQSDEYIGSVLIDLYSLLGSMKDALSCFNSIHNLTIVPMTAMIVVYLQNGEFES 243
              K+G   D ++ S ++D+Y   G M  A   F+SI     V  T MI   ++NGE E 
Sbjct: 549 YAIKSGYDLDLWVSSGILDMYVKCGDMSAAQFAFDSIPVPDDVAWTTMISGCIENGEEER 608

Query: 244 ALALFYELLSSEEKLDEFILSTILSACGNMGMLRSGEQIQGYATKIGISRFTIFQNSQIL 303
           A  +F ++       DEF ++T+  A   +  L  G QI   A K+  +       S + 
Sbjct: 609 AFHVFSQMRLMGVLPDEFTIATLAKASSCLTALEQGRQIHANALKLNCTNDPFVGTSLVD 668

Query: 304 MYAKSGDLYSTNLTFQQMENPDVVSWSTMIHSNAQHGHAMEALRFFELMKSCGIEPNHFG 363
           MYAK G +      F+++E  ++ +W+ M+   AQHG   E L+ F+ MKS GI+P+   
Sbjct: 669 MYAKCGSIDDAYCLFKRIEMMNITAWNAMLVGLAQHGEGKETLQLFKQMKSLGIKPDKVT 728

Query: 364 FLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKYCTTACVVDLLGRAGRLVEAESLIL 423
           F+GVL ACSH GLV E  ++  +M  DY +   +++   +C+ D LGRAG + +AE+LI 
Sbjct: 729 FIGVLSACSHSGLVSEAYKHMRSMHGDYGIKPEIEH--YSCLADALGRAGLVKQAENLIE 788

Query: 424 CSGFEHEPVMWRALLSACNVHKDTFTAQRVAEKVIELEPLASASYVLLYNIYMDAGNKSD 483
               E    M+R LL+AC V  DT T +RVA K++ELEPL S++YVLL N+Y  A    +
Sbjct: 789 SMSMEASASMYRTLLAACRVQGDTETGKRVATKLLELEPLDSSAYVLLSNMYAAASKWDE 848

Query: 484 ALKVRKLMEARRIKKEPGLSWIEVGDKVYSFVSGDRSHKNSELIYAQLQEMLAKTKSTGL 543
               R +M+  ++KK+PG SWIEV +K++ FV  DRS++ +ELIY ++++M+   K  G 
Sbjct: 849 MKLARTMMKGHKVKKDPGFSWIEVKNKIHIFVVDDRSNRQTELIYRKVKDMIRDIKQEGY 902

Query: 544 VKD 547
           V +
Sbjct: 909 VPE 902

BLAST of Sgr016806 vs. TAIR 10
Match: AT4G39530.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 326.2 bits (835), Expect = 5.0e-89
Identity = 195/536 (36.38%), Postives = 300/536 (55.97%), Query Frame = 0

Query: 4   ARILFDYSDKLDGVSWNSLIAGYAQNGKYEELLIVLEKMHQSGLALNTYTLGSALKACSS 63
           A  LF+     + +SW +L++GY QN  ++E + +   M + GL  + Y   S L +C+S
Sbjct: 303 AHKLFNGMPNKNIISWTTLLSGYKQNALHKEAMELFTSMSKFGLKPDMYACSSILTSCAS 362

Query: 64  NFNGSKKFGTMLHSLAFKLGLHLDVVVGTALLDMYAKTGSMDDAIQIFDQVLDKNVVMYN 123
               +  FGT +H+   K  L  D  V  +L+DMYAK   + DA ++FD     +VV++N
Sbjct: 363 LH--ALGFGTQVHAYTIKANLGNDSYVTNSLIDMYAKCDCLTDARKVFDIFAAADVVLFN 422

Query: 124 AMMAGLLQQETIDDKCAYKALNLFFEMKGCGIKPSMFTYSSLLKACISIEVFEFAKQIHA 183
           AM+ G  +  T  +   ++ALN+F +M+   I+PS+ T+ SLL+A  S+     +KQIH 
Sbjct: 423 AMIEGYSRLGTQWE--LHEALNIFRDMRFRLIRPSLLTFVSLLRASASLTSLGLSKQIHG 482

Query: 184 LICKNGLQSDEYIGSVLIDLYSLLGSMKDALSCFNSIHNLTIVPMTAMIVVYLQNGEFES 243
           L+ K GL  D + GS LID+YS    +KD+   F+ +    +V   +M   Y+Q  E E 
Sbjct: 483 LMFKYGLNLDIFAGSALIDVYSNCYCLKDSRLVFDEMKVKDLVIWNSMFAGYVQQSENEE 542

Query: 244 ALALFYELLSSEEKLDEFILSTILSACGNMGMLRSGEQIQGYATKIGISRFTIFQNSQIL 303
           AL LF EL  S E+ DEF  + +++A GN+  ++ G++      K G+       N+ + 
Sbjct: 543 ALNLFLELQLSRERPDEFTFANMVTAAGNLASVQLGQEFHCQLLKRGLECNPYITNALLD 602

Query: 304 MYAKSGDLYSTNLTFQQMENPDVVSWSTMIHSNAQHGHAMEALRFFELMKSCGIEPNHFG 363
           MYAK G     +  F    + DVV W+++I S A HG   +AL+  E M S GIEPN+  
Sbjct: 603 MYAKCGSPEDAHKAFDSAASRDVVCWNSVISSYANHGEGKKALQMLEKMMSEGIEPNYIT 662

Query: 364 FLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKYCTTACVVDLLGRAGRLVEAESLIL 423
           F+GVL ACSH GLVE+GL+ F+ M + + +    ++    C+V LLGRAGRL +A  LI 
Sbjct: 663 FVGVLSACSHAGLVEDGLKQFELMLR-FGIEPETEH--YVCMVSLLGRAGRLNKARELIE 722

Query: 424 CSGFEHEPVMWRALLSACNVHKDTFTAQRVAEKVIELEPLASASYVLLYNIYMDAGNKSD 483
               +   ++WR+LLS C    +   A+  AE  I  +P  S S+ +L NIY   G  ++
Sbjct: 723 KMPTKPAAIVWRSLLSGCAKAGNVELAEHAAEMAILSDPKDSGSFTMLSNIYASKGMWTE 782

Query: 484 ALKVRKLMEARRIKKEPGLSWIEVGDKVYSFVSGDRSHKNSELIYAQLQEMLAKTK 540
           A KVR+ M+   + KEPG SWI +  +V+ F+S D+SH  +  IY  L ++L + +
Sbjct: 783 AKKVRERMKVEGVVKEPGRSWIGINKEVHIFLSKDKSHCKANQIYEVLDDLLVQIR 831

BLAST of Sgr016806 vs. TAIR 10
Match: AT3G49170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 322.4 bits (825), Expect = 7.2e-88
Identity = 195/555 (35.14%), Postives = 305/555 (54.95%), Query Frame = 0

Query: 7   LFDYSDKLDGVSWNSLIAGYAQNGKYEELLIVLEKMHQSGLALNTYTLGSALKACSSNFN 66
           +FD   +L+ V+W  +I    Q G   E +     M  SG   + +TL S   AC+   N
Sbjct: 225 VFDKMSELNVVTWTLMITRCMQMGFPREAIRFFLDMVLSGFESDKFTLSSVFSACAELEN 284

Query: 67  GSKKFGTMLHSLAFKLGLHLDVVVGTALLDMYAK---TGSMDDAIQIFDQVLDKNVVMYN 126
            S   G  LHS A + GL  D  V  +L+DMYAK    GS+DD  ++FD++ D +V+ + 
Sbjct: 285 LS--LGKQLHSWAIRSGLVDD--VECSLVDMYAKCSADGSVDDCRKVFDRMEDHSVMSWT 344

Query: 127 AMMAGLLQQETIDDKCAYKALNLFFEMKGCG-IKPSMFTYSSLLKACISIEVFEFAKQIH 186
           A++ G ++   +    A +A+NLF EM   G ++P+ FT+SS  KAC ++      KQ+ 
Sbjct: 345 ALITGYMKNCNL----ATEAINLFSEMITQGHVEPNHFTFSSAFKACGNLSDPRVGKQVL 404

Query: 187 ALICKNGLQSDEYIGSVLIDLYSLLGSMKDALSCFNSIHNLTIVPMTAMIVVYLQNGEFE 246
               K GL S+  + + +I ++     M+DA   F S+    +V     +    +N  FE
Sbjct: 405 GQAFKRGLASNSSVANSVISMFVKSDRMEDAQRAFESLSEKNLVSYNTFLDGTCRNLNFE 464

Query: 247 SALALFYELLSSEEKLDEFILSTILSACGNMGMLRSGEQIQGYATKIGISRFTIFQNSQI 306
            A  L  E+   E  +  F  +++LS   N+G +R GEQI     K+G+S      N+ I
Sbjct: 465 QAFKLLSEITERELGVSAFTFASLLSGVANVGSIRKGEQIHSQVVKLGLSCNQPVCNALI 524

Query: 307 LMYAKSGDLYSTNLTFQQMENPDVVSWSTMIHSNAQHGHAMEALRFFELMKSCGIEPNHF 366
            MY+K G + + +  F  MEN +V+SW++MI   A+HG A+  L  F  M   G++PN  
Sbjct: 525 SMYSKCGSIDTASRVFNFMENRNVISWTSMITGFAKHGFAIRVLETFNQMIEEGVKPNEV 584

Query: 367 GFLGVLIACSHRGLVEEGLRYFDTMKKDYNMTSHVKYCTTACVVDLLGRAGRLVEAESLI 426
            ++ +L ACSH GLV EG R+F++M +D+ +   +++   AC+VDLL RAG L +A   I
Sbjct: 585 TYVAILSACSHVGLVSEGWRHFNSMYEDHKIKPKMEH--YACMVDLLCRAGLLTDAFEFI 644

Query: 427 LCSGFEHEPVMWRALLSACNVHKDTFTAQRVAEKVIELEPLASASYVLLYNIYMDAGNKS 486
               F+ + ++WR  L AC VH +T   +  A K++EL+P   A+Y+ L NIY  AG   
Sbjct: 645 NTMPFQADVLVWRTFLGACRVHSNTELGKLAARKILELDPNEPAAYIQLSNIYACAGKWE 704

Query: 487 DALKVRKLMEARRIKKEPGLSWIEVGDKVYSFVSGDRSHKNSELIYAQLQEMLAKTKSTG 546
           ++ ++R+ M+ R + KE G SWIEVGDK++ F  GD +H N+  IY +L  ++ + K  G
Sbjct: 705 ESTEMRRKMKERNLVKEGGCSWIEVGDKIHKFYVGDTAHPNAHQIYDELDRLITEIKRCG 764

Query: 547 LV--KDIFDYKIEHE 556
            V   D+  +K+E E
Sbjct: 765 YVPDTDLVLHKLEEE 769

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022141889.19.5e-29291.23pentatricopeptide repeat-containing protein At3g13880 isoform X1 [Momordica char... [more]
XP_022141892.19.5e-29291.23pentatricopeptide repeat-containing protein At3g13880 isoform X2 [Momordica char... [more]
XP_023523805.11.7e-28087.97pentatricopeptide repeat-containing protein At3g13880 isoform X2 [Cucurbita pepo... [more]
XP_023523804.11.7e-28087.97pentatricopeptide repeat-containing protein At3g13880 isoform X1 [Cucurbita pepo... [more]
XP_022932499.12.2e-28087.79pentatricopeptide repeat-containing protein At3g13880 isoform X2 [Cucurbita mosc... [more]
Match NameE-valueIdentityDescription
Q9LRV99.0e-15252.34Pentatricopeptide repeat-containing protein At3g13880 OS=Arabidopsis thaliana OX... [more]
Q9ZUW32.4e-9637.41Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX... [more]
Q9SMZ29.8e-9036.46Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX... [more]
Q9SVA57.0e-8836.38Pentatricopeptide repeat-containing protein At4g39530 OS=Arabidopsis thaliana OX... [more]
Q5G1T11.0e-8635.14Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1CL464.6e-29291.23pentatricopeptide repeat-containing protein At3g13880 isoform X2 OS=Momordica ch... [more]
A0A6J1CJD14.6e-29291.23pentatricopeptide repeat-containing protein At3g13880 isoform X1 OS=Momordica ch... [more]
A0A6J1EX551.1e-28087.79pentatricopeptide repeat-containing protein At3g13880 isoform X2 OS=Cucurbita mo... [more]
A0A6J1F1V71.1e-28087.79pentatricopeptide repeat-containing protein At3g13880 isoform X1 OS=Cucurbita mo... [more]
A0A6J1I7D13.4e-27987.79pentatricopeptide repeat-containing protein At3g13880 OS=Cucurbita maxima OX=366... [more]
Match NameE-valueIdentityDescription
AT3G13880.16.4e-15352.34Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G27610.11.7e-9737.41Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G33170.16.9e-9136.46Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G39530.15.0e-8936.38Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G49170.17.2e-8835.14Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 92..120
e-value: 0.0018
score: 16.3
coord: 327..360
e-value: 5.5E-6
score: 24.2
coord: 17..50
e-value: 1.5E-5
score: 22.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 117..169
e-value: 1.2E-10
score: 41.4
coord: 324..368
e-value: 6.1E-8
score: 32.7
coord: 17..62
e-value: 6.5E-8
score: 32.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 228..253
e-value: 0.0089
score: 16.2
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 87..121
score: 9.536388
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 325..359
score: 11.717688
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 15..49
score: 11.531345
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 1..70
e-value: 4.5E-11
score: 44.4
coord: 71..173
e-value: 3.5E-21
score: 77.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 174..292
e-value: 1.3E-14
score: 56.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 295..534
e-value: 1.6E-27
score: 98.7
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 115..537
NoneNo IPR availablePANTHERPTHR24015:SF811OS01G0959600 PROTEINcoord: 3..114
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 3..114
NoneNo IPR availablePANTHERPTHR24015:SF811OS01G0959600 PROTEINcoord: 115..537

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr016806.1Sgr016806.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding