Cla97C06G112990 (gene) Watermelon (97103) v2.5

Overview
NameCla97C06G112990
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionPentatricopeptide repeat-containing protein
LocationCla97Chr06: 4040995 .. 4042857 (-)
RNA-Seq ExpressionCla97C06G112990
SyntenyCla97C06G112990
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTGATTTGGCCGTCCATCCACTTTGGTCGTTCTCGTCTGGTCCATTCCTTTTCCGTCAACGCCCTCAAAGCCGCCGCCCCAGTGAATCCCATTCCCCGAGGTACCCAATCGCACAGCCTCATTATAAAGTTGGGATTGGCTAATGAACTTTCTGTTCAGAACAAGCTATTGAAGATTTATGTTAAGTGCAGGGATTTAGAAAGTGCAGGGAACGTGTTTGATGAAATGTCTAGGAGAAATGTTGTGTCGTGGAACACGGTGATATGTGGGCTTGTCGATTGCGGGTATGGAGCTGATTTTAAGCTGAGGCAGCACTCAATTTTTTTATATTTTAAGAAGATGTTGATGGGTATGGTGCGCCCAGATGGTGTCACATTTAATGGGTTGTTTCGATCTTGTGTTGTGTTGAACGATGTTGAAAGTGGCAGGCAATTGCATGGTTTTGTAATGAAAATTGGGTTTGATTTGGATTGTTTTGTGGGGAGTGCAGTGGTTGATTTTTATGGGAAATGTGGGTTATATGAAGATGCGAGATTAGCTTTTAGCTGCATTCTGTATAGGGATCTGGTTTTGTGGAATGTGATGTTGTACTGTTATGTGTTTAATTGTTTGGGGCGAGAAGCAATTGAAGTCTTTTGTTTGATGCAGTTGGAAGGTTTTAAAGGTGATGATTTTACATTCAGCAGCCTGCTAAGTTCGTGCAAGTATAAAGGATCAGGAGAATTGGGTAAGCAGCTCCATGGTCTTCTTATAAAACAGTCATTCGATTTAGATATTCTTGTGGCAAGTTCACTTGTCAATGTGTATGCTAAAAACGATAATTTATATGATGCTCGCAAGGTTTTTGATGAAATGCCATCTAGAAATTCTGTGTCTTGGACCACTATGATTGTGGGGTATGGGCAGCAAGAAGATGGAAAAGAGGCAGTGAAACTGTTCAGAAGAATGTTTGGGGAAGATTATTGCCCAGATGAATTAACTTTTGCTAGTGTGCTGAGTTCGTGTGGCTTTACGTGCGGGGCTTGTGAACTGATGCAAGTTCATTCCTGCTTGATAAAACTTGGTTTGGAAGCATTTCTGTCTATTAATAACGGGTTGATAAATGCATATTCGAAGTGTGGTATCATCTCCACAGCGTTACAATGCTTTAGATTAATTGCAGAACCAGATTTGGTAACATGGACATCAATTATATGTGGACTTGCACTTTGTGGCCTTGAGAAGGATGCTGTTGAGTTATTTGATAAGATGTTATCTTATGGCATTAAACCAGATAAAATTGCATTTCTTGGAGTTCTTTCTGCCTGTAGTCATGGGGGATTTGTAAGCATGGGGCTTCACTACTTCAACTTAATGACGATCCAATACCAAATTGTTCCTAATTCAGAGCATTTAACATGCTTGATCGACCTTCTCAGTAGAGCGGGTAGTCTAGACCAGGCTTTTGACCTTTTGAAATCAACGGCGAAGGAAGCTGGACCAGATGCTTTCAGGGCTTTCATTCGAGCATGTAGAACTCATGGGGACTTGAGATTAGCAGAATGGGCAATGGAATTTGCATCAGAGCCAAATGAACAAGTGAATTATTCTCTAGTGTCGAATATGTATGCTTCTGAAGGAAGATGGTCAGATGTGGCTAGAATGCGCAAACTGATGAAGGATAGTTGTGAAAGGAAAGCCCCAGGCCTTAGTTGGGTAGAAATTGCTGGTATAACCATTTGTTATAACCATTTGTTTGTATCGGGTGATAGATCCCATCCACAGTCTTCAGATCTCTATACAATGTTAGGATTATTACTAAACACGATGAAGAAGGATGACAACTCCGCAGCCCTCTGGGTAGATATTGTGCCCGATTGA

mRNA sequence

ATGCTGATTTGGCCGTCCATCCACTTTGGTCGTTCTCGTCTGGTCCATTCCTTTTCCGTCAACGCCCTCAAAGCCGCCGCCCCAGTGAATCCCATTCCCCGAGGTACCCAATCGCACAGCCTCATTATAAAGTTGGGATTGGCTAATGAACTTTCTGTTCAGAACAAGCTATTGAAGATTTATGTTAAGTGCAGGGATTTAGAAAGTGCAGGGAACGTGTTTGATGAAATGTCTAGGAGAAATGTTGTGTCGTGGAACACGGTGATATGTGGGCTTGTCGATTGCGGGTATGGAGCTGATTTTAAGCTGAGGCAGCACTCAATTTTTTTATATTTTAAGAAGATGTTGATGGGTATGGTGCGCCCAGATGGTGTCACATTTAATGGGTTGTTTCGATCTTGTGTTGTGTTGAACGATGTTGAAAGTGGCAGGCAATTGCATGGTTTTGTAATGAAAATTGGGTTTGATTTGGATTGTTTTGTGGGGAGTGCAGTGGTTGATTTTTATGGGAAATGTGGGTTATATGAAGATGCGAGATTAGCTTTTAGCTGCATTCTGTATAGGGATCTGGTTTTGTGGAATGTGATGTTGTACTGTTATGTGTTTAATTGTTTGGGGCGAGAAGCAATTGAAGTCTTTTGTTTGATGCAGTTGGAAGGTTTTAAAGGTGATGATTTTACATTCAGCAGCCTGCTAAGTTCGTGCAAGTATAAAGGATCAGGAGAATTGGGTAAGCAGCTCCATGGTCTTCTTATAAAACAGTCATTCGATTTAGATATTCTTGTGGCAAGTTCACTTGTCAATGTGTATGCTAAAAACGATAATTTATATGATGCTCGCAAGGTTTTTGATGAAATGCCATCTAGAAATTCTGTGTCTTGGACCACTATGATTGTGGGGTATGGGCAGCAAGAAGATGGAAAAGAGGCAGTGAAACTGTTCAGAAGAATGTTTGGGGAAGATTATTGCCCAGATGAATTAACTTTTGCTAGTGTGCTGAGTTCGTGTGGCTTTACGTGCGGGGCTTGTGAACTGATGCAAGTTCATTCCTGCTTGATAAAACTTGGTTTGGAAGCATTTCTGTCTATTAATAACGGGTTGATAAATGCATATTCGAAGTGTGGTATCATCTCCACAGCGTTACAATGCTTTAGATTAATTGCAGAACCAGATTTGGTAACATGGACATCAATTATATGTGGACTTGCACTTTGTGGCCTTGAGAAGGATGCTGTTGAGTTATTTGATAAGATGTTATCTTATGGCATTAAACCAGATAAAATTGCATTTCTTGGAGTTCTTTCTGCCTGTAGTCATGGGGGATTTGTAAGCATGGGGCTTCACTACTTCAACTTAATGACGATCCAATACCAAATTGTTCCTAATTCAGAGCATTTAACATGCTTGATCGACCTTCTCAGTAGAGCGGGTAGTCTAGACCAGGCTTTTGACCTTTTGAAATCAACGGCGAAGGAAGCTGGACCAGATGCTTTCAGGGCTTTCATTCGAGCATGTAGAACTCATGGGGACTTGAGATTAGCAGAATGGGCAATGGAATTTGCATCAGAGCCAAATGAACAAGTGAATTATTCTCTAGTGTCGAATATGTATGCTTCTGAAGGAAGATGGTCAGATGTGGCTAGAATGCGCAAACTGATGAAGGATAGTTGTGAAAGGAAAGCCCCAGGCCTTAGTTGGGTAGAAATTGCTGGTATAACCATTTGTTATAACCATTTGTTTGTATCGGGTGATAGATCCCATCCACAGTCTTCAGATCTCTATACAATGTTAGGATTATTACTAAACACGATGAAGAAGGATGACAACTCCGCAGCCCTCTGGGTAGATATTGTGCCCGATTGA

Coding sequence (CDS)

ATGCTGATTTGGCCGTCCATCCACTTTGGTCGTTCTCGTCTGGTCCATTCCTTTTCCGTCAACGCCCTCAAAGCCGCCGCCCCAGTGAATCCCATTCCCCGAGGTACCCAATCGCACAGCCTCATTATAAAGTTGGGATTGGCTAATGAACTTTCTGTTCAGAACAAGCTATTGAAGATTTATGTTAAGTGCAGGGATTTAGAAAGTGCAGGGAACGTGTTTGATGAAATGTCTAGGAGAAATGTTGTGTCGTGGAACACGGTGATATGTGGGCTTGTCGATTGCGGGTATGGAGCTGATTTTAAGCTGAGGCAGCACTCAATTTTTTTATATTTTAAGAAGATGTTGATGGGTATGGTGCGCCCAGATGGTGTCACATTTAATGGGTTGTTTCGATCTTGTGTTGTGTTGAACGATGTTGAAAGTGGCAGGCAATTGCATGGTTTTGTAATGAAAATTGGGTTTGATTTGGATTGTTTTGTGGGGAGTGCAGTGGTTGATTTTTATGGGAAATGTGGGTTATATGAAGATGCGAGATTAGCTTTTAGCTGCATTCTGTATAGGGATCTGGTTTTGTGGAATGTGATGTTGTACTGTTATGTGTTTAATTGTTTGGGGCGAGAAGCAATTGAAGTCTTTTGTTTGATGCAGTTGGAAGGTTTTAAAGGTGATGATTTTACATTCAGCAGCCTGCTAAGTTCGTGCAAGTATAAAGGATCAGGAGAATTGGGTAAGCAGCTCCATGGTCTTCTTATAAAACAGTCATTCGATTTAGATATTCTTGTGGCAAGTTCACTTGTCAATGTGTATGCTAAAAACGATAATTTATATGATGCTCGCAAGGTTTTTGATGAAATGCCATCTAGAAATTCTGTGTCTTGGACCACTATGATTGTGGGGTATGGGCAGCAAGAAGATGGAAAAGAGGCAGTGAAACTGTTCAGAAGAATGTTTGGGGAAGATTATTGCCCAGATGAATTAACTTTTGCTAGTGTGCTGAGTTCGTGTGGCTTTACGTGCGGGGCTTGTGAACTGATGCAAGTTCATTCCTGCTTGATAAAACTTGGTTTGGAAGCATTTCTGTCTATTAATAACGGGTTGATAAATGCATATTCGAAGTGTGGTATCATCTCCACAGCGTTACAATGCTTTAGATTAATTGCAGAACCAGATTTGGTAACATGGACATCAATTATATGTGGACTTGCACTTTGTGGCCTTGAGAAGGATGCTGTTGAGTTATTTGATAAGATGTTATCTTATGGCATTAAACCAGATAAAATTGCATTTCTTGGAGTTCTTTCTGCCTGTAGTCATGGGGGATTTGTAAGCATGGGGCTTCACTACTTCAACTTAATGACGATCCAATACCAAATTGTTCCTAATTCAGAGCATTTAACATGCTTGATCGACCTTCTCAGTAGAGCGGGTAGTCTAGACCAGGCTTTTGACCTTTTGAAATCAACGGCGAAGGAAGCTGGACCAGATGCTTTCAGGGCTTTCATTCGAGCATGTAGAACTCATGGGGACTTGAGATTAGCAGAATGGGCAATGGAATTTGCATCAGAGCCAAATGAACAAGTGAATTATTCTCTAGTGTCGAATATGTATGCTTCTGAAGGAAGATGGTCAGATGTGGCTAGAATGCGCAAACTGATGAAGGATAGTTGTGAAAGGAAAGCCCCAGGCCTTAGTTGGGTAGAAATTGCTGGTATAACCATTTGTTATAACCATTTGTTTGTATCGGGTGATAGATCCCATCCACAGTCTTCAGATCTCTATACAATGTTAGGATTATTACTAAACACGATGAAGAAGGATGACAACTCCGCAGCCCTCTGGGTAGATATTGTGCCCGATTGA

Protein sequence

MLIWPSIHFGRSRLVHSFSVNALKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKIYVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMVRPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARLAFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGSGELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVGYGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAFLSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLSYGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLDQAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAMEFASEPNEQVNYSLVSNMYASEGRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLLLNTMKKDDNSAALWVDIVPD
Homology
BLAST of Cla97C06G112990 vs. NCBI nr
Match: XP_038874466.1 (pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 [Benincasa hispida])

HSP 1 Score: 1125.9 bits (2911), Expect = 0.0e+00
Identity = 556/613 (90.70%), Postives = 573/613 (93.47%), Query Frame = 0

Query: 1   MLIWPSIHFGRSRLVHSFSVNALKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKI 60
           MLIWPS HFGRSRLVHSFS N LKAAA VN IPRGTQ HS  IKLGLANELSVQNKLLKI
Sbjct: 1   MLIWPSTHFGRSRLVHSFSFNVLKAAAAVNSIPRGTQLHSHFIKLGLANELSVQNKLLKI 60

Query: 61  YVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMV 120
           YVKCR+LESA N+FDEM RRNVVSWNTVICGLV+CGYG +FK+RQHSIF YFKKMLMG+V
Sbjct: 61  YVKCRNLESARNLFDEMPRRNVVSWNTVICGLVNCGYGGEFKVRQHSIFSYFKKMLMGLV 120

Query: 121 RPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARL 180
            PDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVV FY KCGLYEDARL
Sbjct: 121 DPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVAFYAKCGLYEDARL 180

Query: 181 AFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGS 240
           AFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGS
Sbjct: 181 AFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGS 240

Query: 241 GELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVG 300
           GELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVG
Sbjct: 241 GELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVG 300

Query: 301 YGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAF 360
           +GQQEDGKEAVKLFRRMFGEDY PDELTFASVLSSCG T GACEL QVHSCLIKLG EAF
Sbjct: 301 HGQQEDGKEAVKLFRRMFGEDYYPDELTFASVLSSCGLTSGACELKQVHSCLIKLGFEAF 360

Query: 361 LSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLS 420
            SINNGLINAYSKCGIIS ALQCFRLIAEPDLVTWTS ICGLALCGLEK+A+ELFDKMLS
Sbjct: 361 SSINNGLINAYSKCGIISAALQCFRLIAEPDLVTWTSTICGLALCGLEKNAIELFDKMLS 420

Query: 421 YGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLD 480
           Y I+PDKIAFLGVLSACSHGGFVSMGLHYFNLMT QYQIVP+SEHLTCLIDLL RAGSLD
Sbjct: 421 YAIRPDKIAFLGVLSACSHGGFVSMGLHYFNLMTNQYQIVPDSEHLTCLIDLLGRAGSLD 480

Query: 481 QAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAMEFASEPNEQVNYSLVSNMYASE 540
           +AFDLLKS    AGPDAFRAFIRACRTHG+LRLA+WAMEFASEPNEQVNYSLVSNMYASE
Sbjct: 481 EAFDLLKS--MPAGPDAFRAFIRACRTHGNLRLAKWAMEFASEPNEQVNYSLVSNMYASE 540

Query: 541 GRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLL 600
           GRWSDVARMRKLMKDSC+RKAPG SWVEIAG    YNHLFVSGDRSHP+S DLY MLGLL
Sbjct: 541 GRWSDVARMRKLMKDSCDRKAPGFSWVEIAG----YNHLFVSGDRSHPESLDLYAMLGLL 600

Query: 601 LNTMKKDDNSAAL 614
           LNTMK D+ S AL
Sbjct: 601 LNTMKMDNKSTAL 607

BLAST of Cla97C06G112990 vs. NCBI nr
Match: XP_008458191.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 [Cucumis melo] >TYK03026.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1076.6 bits (2783), Expect = 0.0e+00
Identity = 528/612 (86.27%), Postives = 558/612 (91.18%), Query Frame = 0

Query: 1   MLIWPSIHFGRSRLVHSFSVNALKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKI 60
           MLIW S HFGRSRLVHSFS N LKAAAPVN IPR T  HS+++KLGLANELSVQNKLLK+
Sbjct: 11  MLIWTSTHFGRSRLVHSFSFNVLKAAAPVNSIPRDTLLHSVVVKLGLANELSVQNKLLKV 70

Query: 61  YVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMV 120
           YVKCRDL+SA ++FDEM RRN VSWNTVICGLVD GYG +FK RQ  IFLYFKKMLMG+V
Sbjct: 71  YVKCRDLDSARSLFDEMPRRNAVSWNTVICGLVDGGYGGEFKTRQRLIFLYFKKMLMGLV 130

Query: 121 RPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARL 180
            PDG+TFNGLFRSCVVLNDVESGRQLH FVMKIGFDLDCFVGSA+VDFY KCGLYEDARL
Sbjct: 131 DPDGITFNGLFRSCVVLNDVESGRQLHSFVMKIGFDLDCFVGSALVDFYAKCGLYEDARL 190

Query: 181 AFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGS 240
           AFSC LY+DLVLWNVMLYCYVFN L REAIE F LMQLEGFKGD+FTFSSLLSSCKYKGS
Sbjct: 191 AFSCTLYKDLVLWNVMLYCYVFNSLSREAIEGFRLMQLEGFKGDEFTFSSLLSSCKYKGS 250

Query: 241 GELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVG 300
           GELGKQLHGLLIKQSFDLDILVASSL++VYAKNDNLYDARKVFDEMP+RNSVSWTTMIVG
Sbjct: 251 GELGKQLHGLLIKQSFDLDILVASSLIDVYAKNDNLYDARKVFDEMPTRNSVSWTTMIVG 310

Query: 301 YGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAF 360
           YGQQE GKEAVKLFRRMFG+DYC DELTFASVLSSCGFT GA ELMQVHSCLIKLG EAF
Sbjct: 311 YGQQEYGKEAVKLFRRMFGKDYCLDELTFASVLSSCGFTSGASELMQVHSCLIKLGFEAF 370

Query: 361 LSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLS 420
           LSINNGLI AYSKCGI++ ALQCFRLIAEPDLVTWTSIICGLA CGLEKDAV+LFDKMLS
Sbjct: 371 LSINNGLIYAYSKCGIVAAALQCFRLIAEPDLVTWTSIICGLAFCGLEKDAVKLFDKMLS 430

Query: 421 YGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLD 480
           YGI+PDKIAFLGVLSACSHGGFVSMGLHYFNLMT QYQ+VP+ EHLTCLIDLL RAGSLD
Sbjct: 431 YGIRPDKIAFLGVLSACSHGGFVSMGLHYFNLMTNQYQLVPDPEHLTCLIDLLGRAGSLD 490

Query: 481 QAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAMEFASEPNEQVNYSLVSNMYASE 540
           QAFDLLKS  KEAGPDA  AFIRACRTHG+L+LA+WAMEF SEP+E VNYSLVSNMYASE
Sbjct: 491 QAFDLLKSMRKEAGPDALTAFIRACRTHGNLKLAKWAMEFISEPDEPVNYSLVSNMYASE 550

Query: 541 GRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLL 600
           GRWSDVARM KL+ D CE+K PGLSWVEIAG    YNHLF SGDRSHPQSSDLY MLGLL
Sbjct: 551 GRWSDVARMHKLINDRCEQKTPGLSWVEIAG----YNHLFKSGDRSHPQSSDLYAMLGLL 610

Query: 601 LNTMKKDDNSAA 613
           LNTMK+D  S A
Sbjct: 611 LNTMKEDYKSTA 618

BLAST of Cla97C06G112990 vs. NCBI nr
Match: KAG6575187.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1044.6 bits (2700), Expect = 3.3e-301
Identity = 512/620 (82.58%), Postives = 555/620 (89.52%), Query Frame = 0

Query: 1   MLIWPSIHFGRSRLVHSFSVNALKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKI 60
           MLIWPS HFG  RLVHSFS N LKAAA +N IPRGT+ HSL+IKLGLANELSVQNKLLKI
Sbjct: 1   MLIWPSTHFGCCRLVHSFSFNVLKAAADLNSIPRGTKLHSLVIKLGLANELSVQNKLLKI 60

Query: 61  YVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMV 120
           YVKCRDL  A N+FDEM RRNVVSWNTVICG+V+CGYG +FK+R+ SI   FK MLM MV
Sbjct: 61  YVKCRDLGRARNLFDEMRRRNVVSWNTVICGVVNCGYGGEFKMRERSILSCFKNMLMDMV 120

Query: 121 RPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARL 180
            PDGVTFNGLFRSC V+NDV SG+QLHGFV+KIGFDLDCFVGSAVVDFY KCGLYEDARL
Sbjct: 121 DPDGVTFNGLFRSCDVMNDVGSGKQLHGFVIKIGFDLDCFVGSAVVDFYAKCGLYEDARL 180

Query: 181 AFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGS 240
           AF+ +LY+DLVLWNVMLYCYVFNCL +EAI++F LMQLEGF GDDFTFSSLLSSCKYKGS
Sbjct: 181 AFTSVLYKDLVLWNVMLYCYVFNCLAKEAIDIFLLMQLEGFTGDDFTFSSLLSSCKYKGS 240

Query: 241 GELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVG 300
           GELGKQLH  LIK SFDLDILVASSLVN+YAKN++LYDARKVFDEMP RNSVSWTTMIVG
Sbjct: 241 GELGKQLHVHLIKHSFDLDILVASSLVNMYAKNNHLYDARKVFDEMPIRNSVSWTTMIVG 300

Query: 301 YGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAF 360
           YGQQE GKEAVKL RRMF EDY PDELTFASVLSSCGFT GA EL+QVHSCLIKLG EAF
Sbjct: 301 YGQQEHGKEAVKLLRRMFEEDYYPDELTFASVLSSCGFTSGASELIQVHSCLIKLGFEAF 360

Query: 361 LSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLS 420
           LS+NNGLINAYSKCG IS ALQCFRLIAEPDLV+WTSIICGLA CG+EKDAVELFDKMLS
Sbjct: 361 LSVNNGLINAYSKCGAISPALQCFRLIAEPDLVSWTSIICGLAFCGVEKDAVELFDKMLS 420

Query: 421 YGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLD 480
            GI+PDKIAFLGVLSAC+HGGFV+MGLHYFNLMT +YQIVP+SEHLTCLIDL+ RAGSLD
Sbjct: 421 QGIRPDKIAFLGVLSACNHGGFVNMGLHYFNLMTNEYQIVPDSEHLTCLIDLIGRAGSLD 480

Query: 481 QAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAMEFASEPNEQVNYSLVSNMYASE 540
           +AF LLKS ++EAGPDAFR+FIRACRTHG LRLA+WAMEFAS+P + VN SL+SNMYASE
Sbjct: 481 EAFKLLKSVSEEAGPDAFRSFIRACRTHGRLRLAKWAMEFASDPCKPVNSSLMSNMYASE 540

Query: 541 GRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLL 600
           GRWSDVARMRKL+KDSCE K PG SW+EIAG    YNHLFVS DRSHPQSSDLY MLGLL
Sbjct: 541 GRWSDVARMRKLLKDSCEPKVPGFSWIEIAG----YNHLFVSSDRSHPQSSDLYEMLGLL 600

Query: 601 LNTMKKDDNSAALWVDIVPD 621
           LNTMKKD  S A  +DI P+
Sbjct: 601 LNTMKKDYKSIASNIDIEPE 616

BLAST of Cla97C06G112990 vs. NCBI nr
Match: KAG7013750.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1043.1 bits (2696), Expect = 9.5e-301
Identity = 514/620 (82.90%), Postives = 553/620 (89.19%), Query Frame = 0

Query: 1   MLIWPSIHFGRSRLVHSFSVNALKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKI 60
           MLIWPS HFG  RLVHSFS N LKAAA VN IPRGTQ HSL+IKLGLANEL VQNKLLKI
Sbjct: 1   MLIWPSTHFGCCRLVHSFSFNVLKAAADVNSIPRGTQLHSLVIKLGLANELFVQNKLLKI 60

Query: 61  YVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMV 120
           YVKCRDL  A N+FDEM RRNVVSWNTVICG+VDCGYG +FK+R+ SI   FK MLM MV
Sbjct: 61  YVKCRDLGRARNLFDEMRRRNVVSWNTVICGVVDCGYGDEFKMRERSILSCFKNMLMDMV 120

Query: 121 RPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARL 180
            PDGVTFNGLFRSC V+NDV SG+QLHGFV+KIGFDLDCFVGSAVVDFY KCGLYEDARL
Sbjct: 121 DPDGVTFNGLFRSCDVMNDVGSGKQLHGFVIKIGFDLDCFVGSAVVDFYAKCGLYEDARL 180

Query: 181 AFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGS 240
           AF+ +LY+DLVLWNVMLYCYVFN L +EAI++F LMQLEGF GDDFTFSSLLSSCKYKGS
Sbjct: 181 AFTSVLYKDLVLWNVMLYCYVFNFLAKEAIDIFLLMQLEGFTGDDFTFSSLLSSCKYKGS 240

Query: 241 GELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVG 300
           GELGKQLH  LIK SFDLDILVASSLVN+YAKN++LYDARKVFDEMP RNSVSWTTMIVG
Sbjct: 241 GELGKQLHVHLIKHSFDLDILVASSLVNMYAKNNHLYDARKVFDEMPIRNSVSWTTMIVG 300

Query: 301 YGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAF 360
           YGQQE GKEAVKL RRMF EDY PDELTFASVLSSCGFT GA EL+QVHSCLIKLG EAF
Sbjct: 301 YGQQEHGKEAVKLLRRMFEEDYYPDELTFASVLSSCGFTSGASELIQVHSCLIKLGFEAF 360

Query: 361 LSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLS 420
           LS+NNGLINAYSKCG IS ALQCFRLIAEPDLV+WTSIICGLA CG+EKDAVELFDKMLS
Sbjct: 361 LSVNNGLINAYSKCGAISPALQCFRLIAEPDLVSWTSIICGLAFCGVEKDAVELFDKMLS 420

Query: 421 YGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLD 480
            GI+PDKIAFLGVLSACSHGGFV+MGLHYFNLMT +YQIVP+SEHLTCLIDL+ RAGSLD
Sbjct: 421 QGIRPDKIAFLGVLSACSHGGFVNMGLHYFNLMTNEYQIVPDSEHLTCLIDLIGRAGSLD 480

Query: 481 QAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAMEFASEPNEQVNYSLVSNMYASE 540
           +AF LLKS ++EAGPDAFR+FIRACRTHG LRLA+WAMEFAS+P + VN SL+SNMYASE
Sbjct: 481 EAFKLLKSVSEEAGPDAFRSFIRACRTHGRLRLAKWAMEFASDPCKPVNSSLMSNMYASE 540

Query: 541 GRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLL 600
           GRWSDVARMRKL+KDSCE K PG SW+EIAG    YNHLFVS DRSHPQSSDLY MLGLL
Sbjct: 541 GRWSDVARMRKLLKDSCEPKVPGFSWIEIAG----YNHLFVSSDRSHPQSSDLYEMLGLL 600

Query: 601 LNTMKKDDNSAALWVDIVPD 621
           LNTMKKD  S A  +DI P+
Sbjct: 601 LNTMKKDYKSIASNIDIEPE 616

BLAST of Cla97C06G112990 vs. NCBI nr
Match: XP_022958961.1 (pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 [Cucurbita moschata] >XP_022958962.1 pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 [Cucurbita moschata] >XP_022958963.1 pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 [Cucurbita moschata])

HSP 1 Score: 1040.0 bits (2688), Expect = 8.0e-300
Identity = 511/620 (82.42%), Postives = 552/620 (89.03%), Query Frame = 0

Query: 1   MLIWPSIHFGRSRLVHSFSVNALKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKI 60
           MLIWPS HFG  RLVHSFS N LKAAA +N IPRGT+ HSL+IKLGLANELSVQNKLLKI
Sbjct: 1   MLIWPSTHFGCCRLVHSFSFNVLKAAADLNSIPRGTKLHSLVIKLGLANELSVQNKLLKI 60

Query: 61  YVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMV 120
           YVKCRDL  A N+FDEM RRNVVSWNTVICG+V+CGYG +FK+R+ SI   FK MLM MV
Sbjct: 61  YVKCRDLGRARNLFDEMRRRNVVSWNTVICGVVNCGYGGEFKMRERSILSCFKNMLMDMV 120

Query: 121 RPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARL 180
            PDGVTFNGLFRSC V+NDV SG+QLHGFV+KIGFDLDCFVGSAVVDFY KCGLYEDARL
Sbjct: 121 DPDGVTFNGLFRSCDVMNDVGSGKQLHGFVIKIGFDLDCFVGSAVVDFYAKCGLYEDARL 180

Query: 181 AFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGS 240
           AFS +LY+DLVLWNVMLYCYVFNCL +EAIE+F LMQLEGF GDDFTFSSLLSSCKYKGS
Sbjct: 181 AFSSVLYKDLVLWNVMLYCYVFNCLAKEAIEIFFLMQLEGFTGDDFTFSSLLSSCKYKGS 240

Query: 241 GELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVG 300
           GELGKQLH  LIK SFDLDILVASSLVN+YAKN++LYDARK FDEMP RNSVSWTTMIVG
Sbjct: 241 GELGKQLHVHLIKHSFDLDILVASSLVNMYAKNNHLYDARKAFDEMPIRNSVSWTTMIVG 300

Query: 301 YGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAF 360
           YGQQE GKEAVKL RRMF EDY PDELTFASVLSSCGFT GA EL+QVHSCLIKLG EAF
Sbjct: 301 YGQQEHGKEAVKLLRRMFEEDYYPDELTFASVLSSCGFTSGASELIQVHSCLIKLGFEAF 360

Query: 361 LSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLS 420
           LS+NNGLINAYSKCG IS AL+CFRLIAEPDLV+WTSIICG A CGLEK AVELFDKMLS
Sbjct: 361 LSVNNGLINAYSKCGTISPALRCFRLIAEPDLVSWTSIICGFAFCGLEKAAVELFDKMLS 420

Query: 421 YGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLD 480
            GI+PDKIAFLGVLSACSHGGFV+MGLHYFNLMT +YQIVP+SEHLTCLIDL+ RAGSLD
Sbjct: 421 QGIRPDKIAFLGVLSACSHGGFVNMGLHYFNLMTNEYQIVPDSEHLTCLIDLIGRAGSLD 480

Query: 481 QAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAMEFASEPNEQVNYSLVSNMYASE 540
           +AF LLKS ++EAGPDAFR+FIRACRTHG LRLA+WAMEFAS+P + VN SL+SNMYASE
Sbjct: 481 EAFKLLKSVSEEAGPDAFRSFIRACRTHGRLRLAKWAMEFASDPCKPVNSSLMSNMYASE 540

Query: 541 GRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLL 600
           GRWSDVARMRKL+KDSCE K PG SW+EIAG    YNHLFVS DRSHPQSSDLY MLGLL
Sbjct: 541 GRWSDVARMRKLLKDSCEPKVPGFSWIEIAG----YNHLFVSSDRSHPQSSDLYEMLGLL 600

Query: 601 LNTMKKDDNSAALWVDIVPD 621
           LNT+KKD  S A  +DI P+
Sbjct: 601 LNTVKKDYKSTASNIDIEPE 616

BLAST of Cla97C06G112990 vs. ExPASy Swiss-Prot
Match: O82363 (Pentatricopeptide repeat-containing protein At2g46050, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E39 PE=3 SV=1)

HSP 1 Score: 449.9 bits (1156), Expect = 4.7e-125
Identity = 245/549 (44.63%), Postives = 336/549 (61.20%), Query Frame = 0

Query: 24  KAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKIYVKCRDLESAGNVFDEMSRRNVV 83
           K +A ++ +    Q H  ++K G+ N L +QNKLL+ Y K R+ + A  +FDEM  RN+V
Sbjct: 44  KLSASLDHLSDVKQEHGFMVKQGIYNSLFLQNKLLQAYTKIREFDDADKLFDEMPLRNIV 103

Query: 84  SWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMVRPDGVTFNGLFRSCVVLNDVESG 143
           +WN +I G++      D   R H  F Y  ++L   V  D V+F GL R C    ++++G
Sbjct: 104 TWNILIHGVIQ--RDGDTNHRAHLGFCYLSRILFTDVSLDHVSFMGLIRLCTDSTNMKAG 163

Query: 144 RQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARLAFSCILYRDLVLWNVMLYCYVFN 203
            QLH  ++K G +  CF  +++V FYGKCGL  +AR  F  +L RDLVLWN ++  YV N
Sbjct: 164 IQLHCLMVKQGLESSCFPSTSLVHFYGKCGLIVEARRVFEAVLDRDLVLWNALVSSYVLN 223

Query: 204 CLGREAIEVFCLM--QLEGFKGDDFTFSSLLSSCKYKGSGELGKQLHGLLIKQSFDLDIL 263
            +  EA  +  LM      F+GD FTFSSLLS+C+     E GKQ+H +L K S+  DI 
Sbjct: 224 GMIDEAFGLLKLMGSDKNRFRGDYFTFSSLLSACRI----EQGKQIHAILFKVSYQFDIP 283

Query: 264 VASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVGYGQQEDGKEAVKLFRRMFGED 323
           VA++L+N+YAK+++L DAR+ F+ M  RN VSW  MIVG+ Q  +G+EA++LF +M  E+
Sbjct: 284 VATALLNMYAKSNHLSDARECFESMVVRNVVSWNAMIVGFAQNGEGREAMRLFGQMLLEN 343

Query: 324 YCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAFLSINNGLINAYSKCGIISTAL 383
             PDELTFASVLSSC       E+ QV + + K G   FLS+ N LI++YS+ G +S AL
Sbjct: 344 LQPDELTFASVLSSCAKFSAIWEIKQVQAMVTKKGSADFLSVANSLISSYSRNGNLSEAL 403

Query: 384 QCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLSYGIKPDKIAFLGVLSACSHGG 443
            CF  I EPDLV+WTS+I  LA  G  ++++++F+ ML   ++PDKI FL VLSACSHGG
Sbjct: 404 LCFHSIREPDLVSWTSVIGALASHGFAEESLQMFESMLQ-KLQPDKITFLEVLSACSHGG 463

Query: 444 FVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLDQAFDLLKSTAKEAGPDAFRAF 503
            V  GL  F  MT  Y+I    EH TCLIDLL RAG +D+A D+L S   E    A  AF
Sbjct: 464 LVQEGLRCFKRMTEFYKIEAEDEHYTCLIDLLGRAGFIDEASDVLNSMPTEPSTHALAAF 523

Query: 504 IRACRTHGDLRLAEWAME--FASEPNEQVNYSLVSNMYASEGRWSDVARMRKLMKDSC-E 563
              C  H      +W  +     EP + VNYS++SN Y SEG W+  A +RK  + +C  
Sbjct: 524 TGGCNIHEKRESMKWGAKKLLEIEPTKPVNYSILSNAYVSEGHWNQAALLRKRERRNCYN 583

Query: 564 RKAPGLSWV 568
            K PG SW+
Sbjct: 584 PKTPGCSWL 585

BLAST of Cla97C06G112990 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 368.6 bits (945), Expect = 1.4e-100
Identity = 208/635 (32.76%), Postives = 331/635 (52.13%), Query Frame = 0

Query: 39  HSLIIKLGLANELSVQNKLLKIYVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVDCGY- 98
           H+ +IK G +NE+ +QN+L+  Y KC  LE    VFD+M +RN+ +WN+V+ GL   G+ 
Sbjct: 43  HASVIKSGFSNEIFIQNRLIDAYSKCGSLEDGRQVFDKMPQRNIYTWNSVVTGLTKLGFL 102

Query: 99  -GADFKLR------------------QH----SIFLYFKKMLMGMVRPDGVTFNGLFRSC 158
             AD   R                  QH        YF  M       +  +F  +  +C
Sbjct: 103 DEADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASVLSAC 162

Query: 159 VVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARLAFSCILYRDLVLWN 218
             LND+  G Q+H  + K  F  D ++GSA+VD Y KCG   DA+  F  +  R++V WN
Sbjct: 163 SGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCGNVNDAQRVFDEMGDRNVVSWN 222

Query: 219 VMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGSGELGKQLHGLLIK- 278
            ++ C+  N    EA++VF +M     + D+ T +S++S+C    + ++G+++HG ++K 
Sbjct: 223 SLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACASLSAIKVGQEVHGRVVKN 282

Query: 279 QSFDLDILVASSLVNVYAKNDNLYDARKVFDEMP-------------------------- 338
                DI+++++ V++YAK   + +AR +FD MP                          
Sbjct: 283 DKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARLM 342

Query: 339 -----SRNSVSWTTMIVGYGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSSCGFTCGA 398
                 RN VSW  +I GY Q  + +EA+ LF  +  E  CP   +FA++L +C      
Sbjct: 343 FTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRESVCPTHYSFANILKACADLAEL 402

Query: 399 CELMQVHSCLIKLGL------EAFLSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWT 458
              MQ H  ++K G       E  + + N LI+ Y KCG +      FR + E D V+W 
Sbjct: 403 HLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWN 462

Query: 459 SIICGLALCGLEKDAVELFDKMLSYGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQ 518
           ++I G A  G   +A+ELF +ML  G KPD I  +GVLSAC H GFV  G HYF+ MT  
Sbjct: 463 AMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRD 522

Query: 519 YQIVPNSEHLTCLIDLLSRAGSLDQAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEW 578
           + + P  +H TC++DLL RAG L++A  +++    +     + + + AC+ H ++ L ++
Sbjct: 523 FGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGKY 582

Query: 579 AME--FASEPNEQVNYSLVSNMYASEGRWSDVARMRKLMKDSCERKAPGLSWVEIAGITI 610
             E     EP+    Y L+SNMYA  G+W DV  +RK M+     K PG SW++I G   
Sbjct: 583 VAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWIKIQG--- 642

BLAST of Cla97C06G112990 vs. ExPASy Swiss-Prot
Match: Q9LUJ2 (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H56 PE=3 SV=1)

HSP 1 Score: 344.7 bits (883), Expect = 2.1e-93
Identity = 206/610 (33.77%), Postives = 320/610 (52.46%), Query Frame = 0

Query: 23  LKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKIYVKCRDLESAGNVFDEMSRRNV 82
           L A A       G Q H LI+K+G A +L VQN L+  Y +C +L+SA  VFDEMS RNV
Sbjct: 141 LSACAKSRAKGNGIQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNV 200

Query: 83  VSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMVRPDGVTFNGLFRSCVVLNDVES 142
           VSW ++ICG     +  D      ++ L+F+ +    V P+ VT   +  +C  L D+E+
Sbjct: 201 VSWTSMICGYARRDFAKD------AVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLET 260

Query: 143 GRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARLAFSCILYRDLVLWNVMLYCYVF 202
           G +++ F+   G +++  + SA+VD Y KC   + A+  F      +L L N M   YV 
Sbjct: 261 GEKVYAFIRNSGIEVNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVR 320

Query: 203 NCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGSGELGKQLHGLLIKQSFDLDILV 262
             L REA+ VF LM   G + D  +  S +SSC    +   GK  HG +++  F+    +
Sbjct: 321 QGLTREALGVFNLMMDSGVRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNI 380

Query: 263 ASSLVNVYAK-------------------------------NDNLYDARKVFDEMPSRNS 322
            ++L+++Y K                               N  +  A + F+ MP +N 
Sbjct: 381 CNALIDMYMKCHRQDTAFRIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNI 440

Query: 323 VSWTTMIVGYGQQEDGKEAVKLFRRMFG-EDYCPDELTFASVLSSCGFTCGACELMQ-VH 382
           VSW T+I G  Q    +EA+++F  M   E    D +T  S+ S+CG   GA +L + ++
Sbjct: 441 VSWNTIISGLVQGSLFEEAIEVFCSMQSQEGVNADGVTMMSIASACGH-LGALDLAKWIY 500

Query: 383 SCLIKLGLEAFLSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEK 442
             + K G++  + +   L++ +S+CG   +A+  F  +   D+  WT+ I  +A+ G  +
Sbjct: 501 YYIEKNGIQLDVRLGTTLVDMFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAE 560

Query: 443 DAVELFDKMLSYGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCL 502
            A+ELFD M+  G+KPD +AF+G L+ACSHGG V  G   F  M   + + P   H  C+
Sbjct: 561 RAIELFDDMIEQGLKPDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCM 620

Query: 503 IDLLSRAGSLDQAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAMEFAS--EPNEQ 562
           +DLL RAG L++A  L++    E     + + + ACR  G++ +A +A E      P   
Sbjct: 621 VDLLGRAGLLEEAVQLIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERT 680

Query: 563 VNYSLVSNMYASEGRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSGDRSH 598
            +Y L+SN+YAS GRW+D+A++R  MK+   RK PG S ++I G T    H F SGD SH
Sbjct: 681 GSYVLLSNVYASAGRWNDMAKVRLSMKEKGLRKPPGTSSIQIRGKT----HEFTSGDESH 739

BLAST of Cla97C06G112990 vs. ExPASy Swiss-Prot
Match: Q9SVA5 (Pentatricopeptide repeat-containing protein At4g39530 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E52 PE=3 SV=1)

HSP 1 Score: 342.0 bits (876), Expect = 1.4e-92
Identity = 199/574 (34.67%), Postives = 317/574 (55.23%), Query Frame = 0

Query: 35  GTQSHSLIIKLGLANELSVQNKLLKIYVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVD 94
           G Q H+ I++ GL  + S+ N L+  YVKC  + +A  +F+ M  +N++SW T++ G   
Sbjct: 268 GKQIHAHILRYGLEMDASLMNVLIDSYVKCGRVIAAHKLFNGMPNKNIISWTTLLSGY-- 327

Query: 95  CGYGADFKLRQHSIFLYFKKMLMGMVRPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIG 154
                   L + ++ L+      G+ +PD    + +  SC  L+ +  G Q+H + +K  
Sbjct: 328 ----KQNALHKEAMELFTSMSKFGL-KPDMYACSSILTSCASLHALGFGTQVHAYTIKAN 387

Query: 155 FDLDCFVGSAVVDFYGKCGLYEDARLAFSCILYRDLVLWNVMLYCYVFNCLG-----REA 214
              D +V ++++D Y KC    DAR  F      D+VL+N M+  Y  + LG      EA
Sbjct: 388 LGNDSYVTNSLIDMYAKCDCLTDARKVFDIFAAADVVLFNAMIEGY--SRLGTQWELHEA 447

Query: 215 IEVFCLMQLEGFKGDDFTFSSLLSSCKYKGSGELGKQLHGLLIKQSFDLDILVASSLVNV 274
           + +F  M+    +    TF SLL +     S  L KQ+HGL+ K   +LDI   S+L++V
Sbjct: 448 LNIFRDMRFRLIRPSLLTFVSLLRASASLTSLGLSKQIHGLMFKYGLNLDIFAGSALIDV 507

Query: 275 YAKNDNLYDARKVFDEMPSRNSVSWTTMIVGYGQQEDGKEAVKLFRRMFGEDYCPDELTF 334
           Y+    L D+R VFDEM  ++ V W +M  GY QQ + +EA+ LF  +      PDE TF
Sbjct: 508 YSNCYCLKDSRLVFDEMKVKDLVIWNSMFAGYVQQSENEEALNLFLELQLSRERPDEFTF 567

Query: 335 ASVLSSCGFTCGACELMQVHSCLIKLGLEAFLSINNGLINAYSKCGIISTALQCFRLIAE 394
           A+++++ G         + H  L+K GLE    I N L++ Y+KCG    A + F   A 
Sbjct: 568 ANMVTAAGNLASVQLGQEFHCQLLKRGLECNPYITNALLDMYAKCGSPEDAHKAFDSAAS 627

Query: 395 PDLVTWTSIICGLALCGLEKDAVELFDKMLSYGIKPDKIAFLGVLSACSHGGFVSMGLHY 454
            D+V W S+I   A  G  K A+++ +KM+S GI+P+ I F+GVLSACSH G V  GL  
Sbjct: 628 RDVVCWNSVISSYANHGEGKKALQMLEKMMSEGIEPNYITFVGVLSACSHAGLVEDGLKQ 687

Query: 455 FNLMTIQYQIVPNSEHLTCLIDLLSRAGSLDQAFDLLKSTAKEAGPDAFRAFIRACRTHG 514
           F LM +++ I P +EH  C++ LL RAG L++A +L++    +     +R+ +  C   G
Sbjct: 688 FELM-LRFGIEPETEHYVCMVSLLGRAGRLNKARELIEKMPTKPAAIVWRSLLSGCAKAG 747

Query: 515 DLRLAEWAMEFA--SEPNEQVNYSLVSNMYASEGRWSDVARMRKLMKDSCERKAPGLSWV 574
           ++ LAE A E A  S+P +  +++++SN+YAS+G W++  ++R+ MK     K PG SW+
Sbjct: 748 NVELAEHAAEMAILSDPKDSGSFTMLSNIYASKGMWTEAKKVRERMKVEGVVKEPGRSWI 807

Query: 575 EIAGITICYNHLFVSGDRSHPQSSDLYTMLGLLL 602
            I        H+F+S D+SH +++ +Y +L  LL
Sbjct: 808 GINKEV----HIFLSKDKSHCKANQIYEVLDDLL 827

BLAST of Cla97C06G112990 vs. ExPASy Swiss-Prot
Match: Q9SMZ2 (Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H53 PE=3 SV=1)

HSP 1 Score: 341.3 bits (874), Expect = 2.3e-92
Identity = 196/590 (33.22%), Postives = 318/590 (53.90%), Query Frame = 0

Query: 23  LKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKIYVKCRDLESAGNVFDEMSRRNV 82
           L  A  V+ +  G Q H + +KLGL   L+V N L+ +Y K R    A  VFD MS R++
Sbjct: 322 LATAVKVDSLALGQQVHCMALKLGLDLMLTVSNSLINMYCKLRKFGFARTVFDNMSERDL 381

Query: 83  VSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMVRPDGVTFNGLFRSCVVLND-VE 142
           +SWN+VI G+   G      L   ++ L+ + +  G+ +PD  T   + ++   L + + 
Sbjct: 382 ISWNSVIAGIAQNG------LEVEAVCLFMQLLRCGL-KPDQYTMTSVLKAASSLPEGLS 441

Query: 143 SGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARLAFSCILYRDLVLWNVMLYCYV 202
             +Q+H   +KI    D FV +A++D Y +    ++A + F    + DLV WN M+  Y 
Sbjct: 442 LSKQVHVHAIKINNVSDSFVSTALIDAYSRNRCMKEAEILFERHNF-DLVAWNAMMAGYT 501

Query: 203 FNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGSGELGKQLHGLLIKQSFDLDIL 262
            +  G + +++F LM  +G + DDFT +++  +C +  +   GKQ+H   IK  +DLD+ 
Sbjct: 502 QSHDGHKTLKLFALMHKQGERSDDFTLATVFKTCGFLFAINQGKQVHAYAIKSGYDLDLW 561

Query: 263 VASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVGYGQQEDGKEAVKLFRRMFGED 322
           V+S ++++Y K  ++  A+  FD +P  + V+WTTMI G  +  + + A  +F +M    
Sbjct: 562 VSSGILDMYVKCGDMSAAQFAFDSIPVPDDVAWTTMISGCIENGEEERAFHVFSQMRLMG 621

Query: 323 YCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAFLSINNGLINAYSKCGIISTAL 382
             PDE T A++  +        +  Q+H+  +KL       +   L++ Y+KCG I  A 
Sbjct: 622 VLPDEFTIATLAKASSCLTALEQGRQIHANALKLNCTNDPFVGTSLVDMYAKCGSIDDAY 681

Query: 383 QCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLSYGIKPDKIAFLGVLSACSHGG 442
             F+ I   ++  W +++ GLA  G  K+ ++LF +M S GIKPDK+ F+GVLSACSH G
Sbjct: 682 CLFKRIEMMNITAWNAMLVGLAQHGEGKETLQLFKQMKSLGIKPDKVTFIGVLSACSHSG 741

Query: 443 FVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLDQAFDLLKSTAKEAGPDAFRAF 502
            VS    +   M   Y I P  EH +CL D L RAG + QA +L++S + EA    +R  
Sbjct: 742 LVSEAYKHMRSMHGDYGIKPEIEHYSCLADALGRAGLVKQAENLIESMSMEASASMYRTL 801

Query: 503 IRACRTHGDL----RLAEWAMEFASEPNEQVNYSLVSNMYASEGRWSDVARMRKLMKDSC 562
           + ACR  GD     R+A   +E   EP +   Y L+SNMYA+  +W ++   R +MK   
Sbjct: 802 LAACRVQGDTETGKRVATKLLEL--EPLDSSAYVLLSNMYAAASKWDEMKLARTMMKGHK 861

Query: 563 ERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLLLNTMKKD 608
            +K PG SW+E+        H+FV  DRS+ Q+  +Y  +  ++  +K++
Sbjct: 862 VKKDPGFSWIEVKNKI----HIFVVDDRSNRQTELIYRKVKDMIRDIKQE 897

BLAST of Cla97C06G112990 vs. ExPASy TrEMBL
Match: A0A0A0K863 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G005130 PE=4 SV=1)

HSP 1 Score: 1098.6 bits (2840), Expect = 0.0e+00
Identity = 543/620 (87.58%), Postives = 568/620 (91.61%), Query Frame = 0

Query: 1   MLIWPSIHFGRSRLVHSFSVNALKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKI 60
           MLIW S HFGRSRLVHSFS N LKAAAPVN IP  T  HSL++KLGL NELSVQNKLL++
Sbjct: 1   MLIWTSTHFGRSRLVHSFSFNVLKAAAPVNSIPHDTLLHSLVVKLGLVNELSVQNKLLRV 60

Query: 61  YVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMV 120
           YVKCRDL+SA N+FDEM+RRNVVSWNTVICGLVD GYG +FK+RQHSIFLYFKKMLMG+V
Sbjct: 61  YVKCRDLDSARNLFDEMARRNVVSWNTVICGLVDGGYGGEFKMRQHSIFLYFKKMLMGLV 120

Query: 121 RPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARL 180
            PDG+TFNGLFRSCVVLNDVESGRQLH FVMKIGFDLDCFVGSAVVDFY KCGLYEDARL
Sbjct: 121 DPDGITFNGLFRSCVVLNDVESGRQLHSFVMKIGFDLDCFVGSAVVDFYAKCGLYEDARL 180

Query: 181 AFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGS 240
           AFSCILYRDLVLWNVMLYC VFN L REAIEVF LMQLEGFKGDDFTFSSLLSSCKYKGS
Sbjct: 181 AFSCILYRDLVLWNVMLYCCVFNSLSREAIEVFRLMQLEGFKGDDFTFSSLLSSCKYKGS 240

Query: 241 GELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVG 300
           GELGKQLH LLIKQSFDLDILVASSLVNVY KNDNLYDARKVFDEMP+RNSVSWTTMIVG
Sbjct: 241 GELGKQLHCLLIKQSFDLDILVASSLVNVYTKNDNLYDARKVFDEMPTRNSVSWTTMIVG 300

Query: 301 YGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAF 360
           YGQ E GKEAVKLFRRMF +DYCPDELTFASVLSSCGFT GA ELMQVHSCLIKLG EAF
Sbjct: 301 YGQHEYGKEAVKLFRRMFRKDYCPDELTFASVLSSCGFTSGASELMQVHSCLIKLGFEAF 360

Query: 361 LSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLS 420
           LSINNGLI AYSKCGII+ ALQCFRLIAEPDLVTWTSIICGLALCGLEKDAV+LFDKMLS
Sbjct: 361 LSINNGLIYAYSKCGIIAAALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVKLFDKMLS 420

Query: 421 YGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLD 480
           YGI+PDKIAFLGVLSACSHGGFVSMGLHYFNLMT QYQ+VP+SEHLTCLIDLL RAGSLD
Sbjct: 421 YGIRPDKIAFLGVLSACSHGGFVSMGLHYFNLMTNQYQLVPDSEHLTCLIDLLGRAGSLD 480

Query: 481 QAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAMEFASEPNEQVNYSLVSNMYASE 540
           QAFDLLKS  KEAGPDA RAFIRACRTHG+LRLA+ AMEFASEP+E VNYSLVSNMYASE
Sbjct: 481 QAFDLLKSMPKEAGPDALRAFIRACRTHGNLRLAKRAMEFASEPDEPVNYSLVSNMYASE 540

Query: 541 GRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLL 600
           GRWSDVARMRKL+ D CE+K PGLSWVEIAG    YNHLF+SGDRSHPQS DLY MLGLL
Sbjct: 541 GRWSDVARMRKLINDRCEQKTPGLSWVEIAG----YNHLFISGDRSHPQSLDLYAMLGLL 600

Query: 601 LNTMKKDDNSAALWVDIVPD 621
           LNTMKKD    A  VDIVP+
Sbjct: 601 LNTMKKDYKFTASQVDIVPE 616

BLAST of Cla97C06G112990 vs. ExPASy TrEMBL
Match: A0A5D3BXR6 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold46G001210 PE=4 SV=1)

HSP 1 Score: 1076.6 bits (2783), Expect = 0.0e+00
Identity = 528/612 (86.27%), Postives = 558/612 (91.18%), Query Frame = 0

Query: 1   MLIWPSIHFGRSRLVHSFSVNALKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKI 60
           MLIW S HFGRSRLVHSFS N LKAAAPVN IPR T  HS+++KLGLANELSVQNKLLK+
Sbjct: 11  MLIWTSTHFGRSRLVHSFSFNVLKAAAPVNSIPRDTLLHSVVVKLGLANELSVQNKLLKV 70

Query: 61  YVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMV 120
           YVKCRDL+SA ++FDEM RRN VSWNTVICGLVD GYG +FK RQ  IFLYFKKMLMG+V
Sbjct: 71  YVKCRDLDSARSLFDEMPRRNAVSWNTVICGLVDGGYGGEFKTRQRLIFLYFKKMLMGLV 130

Query: 121 RPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARL 180
            PDG+TFNGLFRSCVVLNDVESGRQLH FVMKIGFDLDCFVGSA+VDFY KCGLYEDARL
Sbjct: 131 DPDGITFNGLFRSCVVLNDVESGRQLHSFVMKIGFDLDCFVGSALVDFYAKCGLYEDARL 190

Query: 181 AFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGS 240
           AFSC LY+DLVLWNVMLYCYVFN L REAIE F LMQLEGFKGD+FTFSSLLSSCKYKGS
Sbjct: 191 AFSCTLYKDLVLWNVMLYCYVFNSLSREAIEGFRLMQLEGFKGDEFTFSSLLSSCKYKGS 250

Query: 241 GELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVG 300
           GELGKQLHGLLIKQSFDLDILVASSL++VYAKNDNLYDARKVFDEMP+RNSVSWTTMIVG
Sbjct: 251 GELGKQLHGLLIKQSFDLDILVASSLIDVYAKNDNLYDARKVFDEMPTRNSVSWTTMIVG 310

Query: 301 YGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAF 360
           YGQQE GKEAVKLFRRMFG+DYC DELTFASVLSSCGFT GA ELMQVHSCLIKLG EAF
Sbjct: 311 YGQQEYGKEAVKLFRRMFGKDYCLDELTFASVLSSCGFTSGASELMQVHSCLIKLGFEAF 370

Query: 361 LSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLS 420
           LSINNGLI AYSKCGI++ ALQCFRLIAEPDLVTWTSIICGLA CGLEKDAV+LFDKMLS
Sbjct: 371 LSINNGLIYAYSKCGIVAAALQCFRLIAEPDLVTWTSIICGLAFCGLEKDAVKLFDKMLS 430

Query: 421 YGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLD 480
           YGI+PDKIAFLGVLSACSHGGFVSMGLHYFNLMT QYQ+VP+ EHLTCLIDLL RAGSLD
Sbjct: 431 YGIRPDKIAFLGVLSACSHGGFVSMGLHYFNLMTNQYQLVPDPEHLTCLIDLLGRAGSLD 490

Query: 481 QAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAMEFASEPNEQVNYSLVSNMYASE 540
           QAFDLLKS  KEAGPDA  AFIRACRTHG+L+LA+WAMEF SEP+E VNYSLVSNMYASE
Sbjct: 491 QAFDLLKSMRKEAGPDALTAFIRACRTHGNLKLAKWAMEFISEPDEPVNYSLVSNMYASE 550

Query: 541 GRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLL 600
           GRWSDVARM KL+ D CE+K PGLSWVEIAG    YNHLF SGDRSHPQSSDLY MLGLL
Sbjct: 551 GRWSDVARMHKLINDRCEQKTPGLSWVEIAG----YNHLFKSGDRSHPQSSDLYAMLGLL 610

Query: 601 LNTMKKDDNSAA 613
           LNTMK+D  S A
Sbjct: 611 LNTMKEDYKSTA 618

BLAST of Cla97C06G112990 vs. ExPASy TrEMBL
Match: A0A1S3C6T7 (pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 OS=Cucumis melo OX=3656 GN=LOC103497699 PE=4 SV=1)

HSP 1 Score: 1076.6 bits (2783), Expect = 0.0e+00
Identity = 528/612 (86.27%), Postives = 558/612 (91.18%), Query Frame = 0

Query: 1   MLIWPSIHFGRSRLVHSFSVNALKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKI 60
           MLIW S HFGRSRLVHSFS N LKAAAPVN IPR T  HS+++KLGLANELSVQNKLLK+
Sbjct: 11  MLIWTSTHFGRSRLVHSFSFNVLKAAAPVNSIPRDTLLHSVVVKLGLANELSVQNKLLKV 70

Query: 61  YVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMV 120
           YVKCRDL+SA ++FDEM RRN VSWNTVICGLVD GYG +FK RQ  IFLYFKKMLMG+V
Sbjct: 71  YVKCRDLDSARSLFDEMPRRNAVSWNTVICGLVDGGYGGEFKTRQRLIFLYFKKMLMGLV 130

Query: 121 RPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARL 180
            PDG+TFNGLFRSCVVLNDVESGRQLH FVMKIGFDLDCFVGSA+VDFY KCGLYEDARL
Sbjct: 131 DPDGITFNGLFRSCVVLNDVESGRQLHSFVMKIGFDLDCFVGSALVDFYAKCGLYEDARL 190

Query: 181 AFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGS 240
           AFSC LY+DLVLWNVMLYCYVFN L REAIE F LMQLEGFKGD+FTFSSLLSSCKYKGS
Sbjct: 191 AFSCTLYKDLVLWNVMLYCYVFNSLSREAIEGFRLMQLEGFKGDEFTFSSLLSSCKYKGS 250

Query: 241 GELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVG 300
           GELGKQLHGLLIKQSFDLDILVASSL++VYAKNDNLYDARKVFDEMP+RNSVSWTTMIVG
Sbjct: 251 GELGKQLHGLLIKQSFDLDILVASSLIDVYAKNDNLYDARKVFDEMPTRNSVSWTTMIVG 310

Query: 301 YGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAF 360
           YGQQE GKEAVKLFRRMFG+DYC DELTFASVLSSCGFT GA ELMQVHSCLIKLG EAF
Sbjct: 311 YGQQEYGKEAVKLFRRMFGKDYCLDELTFASVLSSCGFTSGASELMQVHSCLIKLGFEAF 370

Query: 361 LSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLS 420
           LSINNGLI AYSKCGI++ ALQCFRLIAEPDLVTWTSIICGLA CGLEKDAV+LFDKMLS
Sbjct: 371 LSINNGLIYAYSKCGIVAAALQCFRLIAEPDLVTWTSIICGLAFCGLEKDAVKLFDKMLS 430

Query: 421 YGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLD 480
           YGI+PDKIAFLGVLSACSHGGFVSMGLHYFNLMT QYQ+VP+ EHLTCLIDLL RAGSLD
Sbjct: 431 YGIRPDKIAFLGVLSACSHGGFVSMGLHYFNLMTNQYQLVPDPEHLTCLIDLLGRAGSLD 490

Query: 481 QAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAMEFASEPNEQVNYSLVSNMYASE 540
           QAFDLLKS  KEAGPDA  AFIRACRTHG+L+LA+WAMEF SEP+E VNYSLVSNMYASE
Sbjct: 491 QAFDLLKSMRKEAGPDALTAFIRACRTHGNLKLAKWAMEFISEPDEPVNYSLVSNMYASE 550

Query: 541 GRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLL 600
           GRWSDVARM KL+ D CE+K PGLSWVEIAG    YNHLF SGDRSHPQSSDLY MLGLL
Sbjct: 551 GRWSDVARMHKLINDRCEQKTPGLSWVEIAG----YNHLFKSGDRSHPQSSDLYAMLGLL 610

Query: 601 LNTMKKDDNSAA 613
           LNTMK+D  S A
Sbjct: 611 LNTMKEDYKSTA 618

BLAST of Cla97C06G112990 vs. ExPASy TrEMBL
Match: A0A6J1H3L2 (pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111460096 PE=4 SV=1)

HSP 1 Score: 1040.0 bits (2688), Expect = 3.9e-300
Identity = 511/620 (82.42%), Postives = 552/620 (89.03%), Query Frame = 0

Query: 1   MLIWPSIHFGRSRLVHSFSVNALKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKI 60
           MLIWPS HFG  RLVHSFS N LKAAA +N IPRGT+ HSL+IKLGLANELSVQNKLLKI
Sbjct: 1   MLIWPSTHFGCCRLVHSFSFNVLKAAADLNSIPRGTKLHSLVIKLGLANELSVQNKLLKI 60

Query: 61  YVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMV 120
           YVKCRDL  A N+FDEM RRNVVSWNTVICG+V+CGYG +FK+R+ SI   FK MLM MV
Sbjct: 61  YVKCRDLGRARNLFDEMRRRNVVSWNTVICGVVNCGYGGEFKMRERSILSCFKNMLMDMV 120

Query: 121 RPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARL 180
            PDGVTFNGLFRSC V+NDV SG+QLHGFV+KIGFDLDCFVGSAVVDFY KCGLYEDARL
Sbjct: 121 DPDGVTFNGLFRSCDVMNDVGSGKQLHGFVIKIGFDLDCFVGSAVVDFYAKCGLYEDARL 180

Query: 181 AFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGS 240
           AFS +LY+DLVLWNVMLYCYVFNCL +EAIE+F LMQLEGF GDDFTFSSLLSSCKYKGS
Sbjct: 181 AFSSVLYKDLVLWNVMLYCYVFNCLAKEAIEIFFLMQLEGFTGDDFTFSSLLSSCKYKGS 240

Query: 241 GELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVG 300
           GELGKQLH  LIK SFDLDILVASSLVN+YAKN++LYDARK FDEMP RNSVSWTTMIVG
Sbjct: 241 GELGKQLHVHLIKHSFDLDILVASSLVNMYAKNNHLYDARKAFDEMPIRNSVSWTTMIVG 300

Query: 301 YGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAF 360
           YGQQE GKEAVKL RRMF EDY PDELTFASVLSSCGFT GA EL+QVHSCLIKLG EAF
Sbjct: 301 YGQQEHGKEAVKLLRRMFEEDYYPDELTFASVLSSCGFTSGASELIQVHSCLIKLGFEAF 360

Query: 361 LSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLS 420
           LS+NNGLINAYSKCG IS AL+CFRLIAEPDLV+WTSIICG A CGLEK AVELFDKMLS
Sbjct: 361 LSVNNGLINAYSKCGTISPALRCFRLIAEPDLVSWTSIICGFAFCGLEKAAVELFDKMLS 420

Query: 421 YGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLD 480
            GI+PDKIAFLGVLSACSHGGFV+MGLHYFNLMT +YQIVP+SEHLTCLIDL+ RAGSLD
Sbjct: 421 QGIRPDKIAFLGVLSACSHGGFVNMGLHYFNLMTNEYQIVPDSEHLTCLIDLIGRAGSLD 480

Query: 481 QAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAMEFASEPNEQVNYSLVSNMYASE 540
           +AF LLKS ++EAGPDAFR+FIRACRTHG LRLA+WAMEFAS+P + VN SL+SNMYASE
Sbjct: 481 EAFKLLKSVSEEAGPDAFRSFIRACRTHGRLRLAKWAMEFASDPCKPVNSSLMSNMYASE 540

Query: 541 GRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLL 600
           GRWSDVARMRKL+KDSCE K PG SW+EIAG    YNHLFVS DRSHPQSSDLY MLGLL
Sbjct: 541 GRWSDVARMRKLLKDSCEPKVPGFSWIEIAG----YNHLFVSSDRSHPQSSDLYEMLGLL 600

Query: 601 LNTMKKDDNSAALWVDIVPD 621
           LNT+KKD  S A  +DI P+
Sbjct: 601 LNTVKKDYKSTASNIDIEPE 616

BLAST of Cla97C06G112990 vs. ExPASy TrEMBL
Match: A0A6J1L572 (pentatricopeptide repeat-containing protein At2g46050, mitochondrial OS=Cucurbita maxima OX=3661 GN=LOC111499233 PE=4 SV=1)

HSP 1 Score: 1035.4 bits (2676), Expect = 9.6e-299
Identity = 511/620 (82.42%), Postives = 550/620 (88.71%), Query Frame = 0

Query: 1   MLIWPSIHFGRSRLVHSFSVNALKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKI 60
           MLIWPS HFG SRLVHSFS N LKAAA VN IPRGTQ HSL+IKLGLANELSVQNKLLKI
Sbjct: 1   MLIWPSTHFGCSRLVHSFSFNVLKAAADVNSIPRGTQLHSLVIKLGLANELSVQNKLLKI 60

Query: 61  YVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMV 120
           YVKCRDL  A N+FDEM RRNVVSWNTVICG+VDCGYG +FK+R+ S    FK MLM MV
Sbjct: 61  YVKCRDLGRAWNLFDEMRRRNVVSWNTVICGVVDCGYGGEFKMRERSNLSCFKNMLMEMV 120

Query: 121 RPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARL 180
            PDGVTFNGLFRSC V+NDV SG+QLHGFV+K GFDLDCFVGSAVVDFY KCGLYEDARL
Sbjct: 121 DPDGVTFNGLFRSCDVMNDVGSGKQLHGFVIKFGFDLDCFVGSAVVDFYAKCGLYEDARL 180

Query: 181 AFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGS 240
           AFS +LY+DLVLWNVMLYCYVFNCL  EAIE+F LMQLEGF GDDFTFSSLLSSCKYKGS
Sbjct: 181 AFSSVLYKDLVLWNVMLYCYVFNCLAEEAIEIFFLMQLEGFTGDDFTFSSLLSSCKYKGS 240

Query: 241 GELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVG 300
           GELG QLH  LIK SFDLDILVASSLVN+YAKN++LYDARKVFDEMP RNSVSWTTMIVG
Sbjct: 241 GELGMQLHVHLIKHSFDLDILVASSLVNMYAKNNHLYDARKVFDEMPIRNSVSWTTMIVG 300

Query: 301 YGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAF 360
           YGQQE GKEAVKL RRM  EDY PDELTFASVLSSCGFT GA EL+QVHSCLIKLG EAF
Sbjct: 301 YGQQEHGKEAVKLLRRMLEEDYYPDELTFASVLSSCGFTSGASELIQVHSCLIKLGFEAF 360

Query: 361 LSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLS 420
           LS+NNGLINAYSKCG IS+AL+CFRLIAEPDLV+ TSIICGLA CG+EKDAVELFDKMLS
Sbjct: 361 LSVNNGLINAYSKCGAISSALRCFRLIAEPDLVSRTSIICGLAFCGVEKDAVELFDKMLS 420

Query: 421 YGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLD 480
            GI+PDKIAFLGVLSACSHGG+ +MGLHYFNLMT +YQIVP+SEHLTCLIDLL RAGSLD
Sbjct: 421 QGIRPDKIAFLGVLSACSHGGYANMGLHYFNLMTNEYQIVPDSEHLTCLIDLLGRAGSLD 480

Query: 481 QAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAMEFASEPNEQVNYSLVSNMYASE 540
           +AF LLKS +++AGPDAFR+FIRACRTHG LRLA+WAMEFAS+P + VN SL+SN+YASE
Sbjct: 481 EAFKLLKSVSEKAGPDAFRSFIRACRTHGHLRLAKWAMEFASDPYKPVNCSLMSNIYASE 540

Query: 541 GRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLL 600
           GRWSDVARMRKLMKDSCE K PG SW+EIAG    YNHLFVS DRSHPQSSDLY MLGLL
Sbjct: 541 GRWSDVARMRKLMKDSCEPKVPGFSWIEIAG----YNHLFVSSDRSHPQSSDLYAMLGLL 600

Query: 601 LNTMKKDDNSAALWVDIVPD 621
           LNTMKKD  S A  +DI P+
Sbjct: 601 LNTMKKDYKSIASNIDIEPE 616

BLAST of Cla97C06G112990 vs. TAIR 10
Match: AT2G46050.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 449.9 bits (1156), Expect = 3.3e-126
Identity = 245/549 (44.63%), Postives = 336/549 (61.20%), Query Frame = 0

Query: 24  KAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKIYVKCRDLESAGNVFDEMSRRNVV 83
           K +A ++ +    Q H  ++K G+ N L +QNKLL+ Y K R+ + A  +FDEM  RN+V
Sbjct: 44  KLSASLDHLSDVKQEHGFMVKQGIYNSLFLQNKLLQAYTKIREFDDADKLFDEMPLRNIV 103

Query: 84  SWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMVRPDGVTFNGLFRSCVVLNDVESG 143
           +WN +I G++      D   R H  F Y  ++L   V  D V+F GL R C    ++++G
Sbjct: 104 TWNILIHGVIQ--RDGDTNHRAHLGFCYLSRILFTDVSLDHVSFMGLIRLCTDSTNMKAG 163

Query: 144 RQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARLAFSCILYRDLVLWNVMLYCYVFN 203
            QLH  ++K G +  CF  +++V FYGKCGL  +AR  F  +L RDLVLWN ++  YV N
Sbjct: 164 IQLHCLMVKQGLESSCFPSTSLVHFYGKCGLIVEARRVFEAVLDRDLVLWNALVSSYVLN 223

Query: 204 CLGREAIEVFCLM--QLEGFKGDDFTFSSLLSSCKYKGSGELGKQLHGLLIKQSFDLDIL 263
            +  EA  +  LM      F+GD FTFSSLLS+C+     E GKQ+H +L K S+  DI 
Sbjct: 224 GMIDEAFGLLKLMGSDKNRFRGDYFTFSSLLSACRI----EQGKQIHAILFKVSYQFDIP 283

Query: 264 VASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVGYGQQEDGKEAVKLFRRMFGED 323
           VA++L+N+YAK+++L DAR+ F+ M  RN VSW  MIVG+ Q  +G+EA++LF +M  E+
Sbjct: 284 VATALLNMYAKSNHLSDARECFESMVVRNVVSWNAMIVGFAQNGEGREAMRLFGQMLLEN 343

Query: 324 YCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAFLSINNGLINAYSKCGIISTAL 383
             PDELTFASVLSSC       E+ QV + + K G   FLS+ N LI++YS+ G +S AL
Sbjct: 344 LQPDELTFASVLSSCAKFSAIWEIKQVQAMVTKKGSADFLSVANSLISSYSRNGNLSEAL 403

Query: 384 QCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLSYGIKPDKIAFLGVLSACSHGG 443
            CF  I EPDLV+WTS+I  LA  G  ++++++F+ ML   ++PDKI FL VLSACSHGG
Sbjct: 404 LCFHSIREPDLVSWTSVIGALASHGFAEESLQMFESMLQ-KLQPDKITFLEVLSACSHGG 463

Query: 444 FVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLDQAFDLLKSTAKEAGPDAFRAF 503
            V  GL  F  MT  Y+I    EH TCLIDLL RAG +D+A D+L S   E    A  AF
Sbjct: 464 LVQEGLRCFKRMTEFYKIEAEDEHYTCLIDLLGRAGFIDEASDVLNSMPTEPSTHALAAF 523

Query: 504 IRACRTHGDLRLAEWAME--FASEPNEQVNYSLVSNMYASEGRWSDVARMRKLMKDSC-E 563
              C  H      +W  +     EP + VNYS++SN Y SEG W+  A +RK  + +C  
Sbjct: 524 TGGCNIHEKRESMKWGAKKLLEIEPTKPVNYSILSNAYVSEGHWNQAALLRKRERRNCYN 583

Query: 564 RKAPGLSWV 568
            K PG SW+
Sbjct: 584 PKTPGCSWL 585

BLAST of Cla97C06G112990 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 368.6 bits (945), Expect = 9.7e-102
Identity = 208/635 (32.76%), Postives = 331/635 (52.13%), Query Frame = 0

Query: 39  HSLIIKLGLANELSVQNKLLKIYVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVDCGY- 98
           H+ +IK G +NE+ +QN+L+  Y KC  LE    VFD+M +RN+ +WN+V+ GL   G+ 
Sbjct: 43  HASVIKSGFSNEIFIQNRLIDAYSKCGSLEDGRQVFDKMPQRNIYTWNSVVTGLTKLGFL 102

Query: 99  -GADFKLR------------------QH----SIFLYFKKMLMGMVRPDGVTFNGLFRSC 158
             AD   R                  QH        YF  M       +  +F  +  +C
Sbjct: 103 DEADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASVLSAC 162

Query: 159 VVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARLAFSCILYRDLVLWN 218
             LND+  G Q+H  + K  F  D ++GSA+VD Y KCG   DA+  F  +  R++V WN
Sbjct: 163 SGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCGNVNDAQRVFDEMGDRNVVSWN 222

Query: 219 VMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGSGELGKQLHGLLIK- 278
            ++ C+  N    EA++VF +M     + D+ T +S++S+C    + ++G+++HG ++K 
Sbjct: 223 SLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACASLSAIKVGQEVHGRVVKN 282

Query: 279 QSFDLDILVASSLVNVYAKNDNLYDARKVFDEMP-------------------------- 338
                DI+++++ V++YAK   + +AR +FD MP                          
Sbjct: 283 DKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARLM 342

Query: 339 -----SRNSVSWTTMIVGYGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSSCGFTCGA 398
                 RN VSW  +I GY Q  + +EA+ LF  +  E  CP   +FA++L +C      
Sbjct: 343 FTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRESVCPTHYSFANILKACADLAEL 402

Query: 399 CELMQVHSCLIKLGL------EAFLSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWT 458
              MQ H  ++K G       E  + + N LI+ Y KCG +      FR + E D V+W 
Sbjct: 403 HLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWN 462

Query: 459 SIICGLALCGLEKDAVELFDKMLSYGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQ 518
           ++I G A  G   +A+ELF +ML  G KPD I  +GVLSAC H GFV  G HYF+ MT  
Sbjct: 463 AMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRD 522

Query: 519 YQIVPNSEHLTCLIDLLSRAGSLDQAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEW 578
           + + P  +H TC++DLL RAG L++A  +++    +     + + + AC+ H ++ L ++
Sbjct: 523 FGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGKY 582

Query: 579 AME--FASEPNEQVNYSLVSNMYASEGRWSDVARMRKLMKDSCERKAPGLSWVEIAGITI 610
             E     EP+    Y L+SNMYA  G+W DV  +RK M+     K PG SW++I G   
Sbjct: 583 VAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWIKIQG--- 642

BLAST of Cla97C06G112990 vs. TAIR 10
Match: AT3G22690.1 (CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1); Has 49784 Blast hits to 14716 proteins in 280 species: Archae - 2; Bacteria - 10; Metazoa - 107; Fungi - 167; Plants - 48594; Viruses - 0; Other Eukaryotes - 904 (source: NCBI BLink). )

HSP 1 Score: 344.7 bits (883), Expect = 1.5e-94
Identity = 206/610 (33.77%), Postives = 320/610 (52.46%), Query Frame = 0

Query: 23  LKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKIYVKCRDLESAGNVFDEMSRRNV 82
           L A A       G Q H LI+K+G A +L VQN L+  Y +C +L+SA  VFDEMS RNV
Sbjct: 141 LSACAKSRAKGNGIQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNV 200

Query: 83  VSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMVRPDGVTFNGLFRSCVVLNDVES 142
           VSW ++ICG     +  D      ++ L+F+ +    V P+ VT   +  +C  L D+E+
Sbjct: 201 VSWTSMICGYARRDFAKD------AVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLET 260

Query: 143 GRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARLAFSCILYRDLVLWNVMLYCYVF 202
           G +++ F+   G +++  + SA+VD Y KC   + A+  F      +L L N M   YV 
Sbjct: 261 GEKVYAFIRNSGIEVNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVR 320

Query: 203 NCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGSGELGKQLHGLLIKQSFDLDILV 262
             L REA+ VF LM   G + D  +  S +SSC    +   GK  HG +++  F+    +
Sbjct: 321 QGLTREALGVFNLMMDSGVRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNI 380

Query: 263 ASSLVNVYAK-------------------------------NDNLYDARKVFDEMPSRNS 322
            ++L+++Y K                               N  +  A + F+ MP +N 
Sbjct: 381 CNALIDMYMKCHRQDTAFRIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNI 440

Query: 323 VSWTTMIVGYGQQEDGKEAVKLFRRMFG-EDYCPDELTFASVLSSCGFTCGACELMQ-VH 382
           VSW T+I G  Q    +EA+++F  M   E    D +T  S+ S+CG   GA +L + ++
Sbjct: 441 VSWNTIISGLVQGSLFEEAIEVFCSMQSQEGVNADGVTMMSIASACGH-LGALDLAKWIY 500

Query: 383 SCLIKLGLEAFLSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEK 442
             + K G++  + +   L++ +S+CG   +A+  F  +   D+  WT+ I  +A+ G  +
Sbjct: 501 YYIEKNGIQLDVRLGTTLVDMFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAE 560

Query: 443 DAVELFDKMLSYGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCL 502
            A+ELFD M+  G+KPD +AF+G L+ACSHGG V  G   F  M   + + P   H  C+
Sbjct: 561 RAIELFDDMIEQGLKPDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCM 620

Query: 503 IDLLSRAGSLDQAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAMEFAS--EPNEQ 562
           +DLL RAG L++A  L++    E     + + + ACR  G++ +A +A E      P   
Sbjct: 621 VDLLGRAGLLEEAVQLIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERT 680

Query: 563 VNYSLVSNMYASEGRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSGDRSH 598
            +Y L+SN+YAS GRW+D+A++R  MK+   RK PG S ++I G T    H F SGD SH
Sbjct: 681 GSYVLLSNVYASAGRWNDMAKVRLSMKEKGLRKPPGTSSIQIRGKT----HEFTSGDESH 739

BLAST of Cla97C06G112990 vs. TAIR 10
Match: AT3G22690.2 (INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification; LOCATED IN: chloroplast; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: LP.04 four leaves visible, 4 anthesis, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; CONTAINS InterPro DOMAIN/s: Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1). )

HSP 1 Score: 344.7 bits (883), Expect = 1.5e-94
Identity = 206/610 (33.77%), Postives = 320/610 (52.46%), Query Frame = 0

Query: 23  LKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKIYVKCRDLESAGNVFDEMSRRNV 82
           L A A       G Q H LI+K+G A +L VQN L+  Y +C +L+SA  VFDEMS RNV
Sbjct: 141 LSACAKSRAKGNGIQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNV 200

Query: 83  VSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMVRPDGVTFNGLFRSCVVLNDVES 142
           VSW ++ICG     +  D      ++ L+F+ +    V P+ VT   +  +C  L D+E+
Sbjct: 201 VSWTSMICGYARRDFAKD------AVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLET 260

Query: 143 GRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARLAFSCILYRDLVLWNVMLYCYVF 202
           G +++ F+   G +++  + SA+VD Y KC   + A+  F      +L L N M   YV 
Sbjct: 261 GEKVYAFIRNSGIEVNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVR 320

Query: 203 NCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGSGELGKQLHGLLIKQSFDLDILV 262
             L REA+ VF LM   G + D  +  S +SSC    +   GK  HG +++  F+    +
Sbjct: 321 QGLTREALGVFNLMMDSGVRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNI 380

Query: 263 ASSLVNVYAK-------------------------------NDNLYDARKVFDEMPSRNS 322
            ++L+++Y K                               N  +  A + F+ MP +N 
Sbjct: 381 CNALIDMYMKCHRQDTAFRIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNI 440

Query: 323 VSWTTMIVGYGQQEDGKEAVKLFRRMFG-EDYCPDELTFASVLSSCGFTCGACELMQ-VH 382
           VSW T+I G  Q    +EA+++F  M   E    D +T  S+ S+CG   GA +L + ++
Sbjct: 441 VSWNTIISGLVQGSLFEEAIEVFCSMQSQEGVNADGVTMMSIASACGH-LGALDLAKWIY 500

Query: 383 SCLIKLGLEAFLSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEK 442
             + K G++  + +   L++ +S+CG   +A+  F  +   D+  WT+ I  +A+ G  +
Sbjct: 501 YYIEKNGIQLDVRLGTTLVDMFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAE 560

Query: 443 DAVELFDKMLSYGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCL 502
            A+ELFD M+  G+KPD +AF+G L+ACSHGG V  G   F  M   + + P   H  C+
Sbjct: 561 RAIELFDDMIEQGLKPDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCM 620

Query: 503 IDLLSRAGSLDQAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAMEFAS--EPNEQ 562
           +DLL RAG L++A  L++    E     + + + ACR  G++ +A +A E      P   
Sbjct: 621 VDLLGRAGLLEEAVQLIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERT 680

Query: 563 VNYSLVSNMYASEGRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSGDRSH 598
            +Y L+SN+YAS GRW+D+A++R  MK+   RK PG S ++I G T    H F SGD SH
Sbjct: 681 GSYVLLSNVYASAGRWNDMAKVRLSMKEKGLRKPPGTSSIQIRGKT----HEFTSGDESH 739

BLAST of Cla97C06G112990 vs. TAIR 10
Match: AT4G39530.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 342.0 bits (876), Expect = 9.7e-94
Identity = 199/574 (34.67%), Postives = 317/574 (55.23%), Query Frame = 0

Query: 35  GTQSHSLIIKLGLANELSVQNKLLKIYVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVD 94
           G Q H+ I++ GL  + S+ N L+  YVKC  + +A  +F+ M  +N++SW T++ G   
Sbjct: 268 GKQIHAHILRYGLEMDASLMNVLIDSYVKCGRVIAAHKLFNGMPNKNIISWTTLLSGY-- 327

Query: 95  CGYGADFKLRQHSIFLYFKKMLMGMVRPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIG 154
                   L + ++ L+      G+ +PD    + +  SC  L+ +  G Q+H + +K  
Sbjct: 328 ----KQNALHKEAMELFTSMSKFGL-KPDMYACSSILTSCASLHALGFGTQVHAYTIKAN 387

Query: 155 FDLDCFVGSAVVDFYGKCGLYEDARLAFSCILYRDLVLWNVMLYCYVFNCLG-----REA 214
              D +V ++++D Y KC    DAR  F      D+VL+N M+  Y  + LG      EA
Sbjct: 388 LGNDSYVTNSLIDMYAKCDCLTDARKVFDIFAAADVVLFNAMIEGY--SRLGTQWELHEA 447

Query: 215 IEVFCLMQLEGFKGDDFTFSSLLSSCKYKGSGELGKQLHGLLIKQSFDLDILVASSLVNV 274
           + +F  M+    +    TF SLL +     S  L KQ+HGL+ K   +LDI   S+L++V
Sbjct: 448 LNIFRDMRFRLIRPSLLTFVSLLRASASLTSLGLSKQIHGLMFKYGLNLDIFAGSALIDV 507

Query: 275 YAKNDNLYDARKVFDEMPSRNSVSWTTMIVGYGQQEDGKEAVKLFRRMFGEDYCPDELTF 334
           Y+    L D+R VFDEM  ++ V W +M  GY QQ + +EA+ LF  +      PDE TF
Sbjct: 508 YSNCYCLKDSRLVFDEMKVKDLVIWNSMFAGYVQQSENEEALNLFLELQLSRERPDEFTF 567

Query: 335 ASVLSSCGFTCGACELMQVHSCLIKLGLEAFLSINNGLINAYSKCGIISTALQCFRLIAE 394
           A+++++ G         + H  L+K GLE    I N L++ Y+KCG    A + F   A 
Sbjct: 568 ANMVTAAGNLASVQLGQEFHCQLLKRGLECNPYITNALLDMYAKCGSPEDAHKAFDSAAS 627

Query: 395 PDLVTWTSIICGLALCGLEKDAVELFDKMLSYGIKPDKIAFLGVLSACSHGGFVSMGLHY 454
            D+V W S+I   A  G  K A+++ +KM+S GI+P+ I F+GVLSACSH G V  GL  
Sbjct: 628 RDVVCWNSVISSYANHGEGKKALQMLEKMMSEGIEPNYITFVGVLSACSHAGLVEDGLKQ 687

Query: 455 FNLMTIQYQIVPNSEHLTCLIDLLSRAGSLDQAFDLLKSTAKEAGPDAFRAFIRACRTHG 514
           F LM +++ I P +EH  C++ LL RAG L++A +L++    +     +R+ +  C   G
Sbjct: 688 FELM-LRFGIEPETEHYVCMVSLLGRAGRLNKARELIEKMPTKPAAIVWRSLLSGCAKAG 747

Query: 515 DLRLAEWAMEFA--SEPNEQVNYSLVSNMYASEGRWSDVARMRKLMKDSCERKAPGLSWV 574
           ++ LAE A E A  S+P +  +++++SN+YAS+G W++  ++R+ MK     K PG SW+
Sbjct: 748 NVELAEHAAEMAILSDPKDSGSFTMLSNIYASKGMWTEAKKVRERMKVEGVVKEPGRSWI 807

Query: 575 EIAGITICYNHLFVSGDRSHPQSSDLYTMLGLLL 602
            I        H+F+S D+SH +++ +Y +L  LL
Sbjct: 808 GINKEV----HIFLSKDKSHCKANQIYEVLDDLL 827

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038874466.10.0e+0090.70pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 ... [more]
XP_008458191.10.0e+0086.27PREDICTED: pentatricopeptide repeat-containing protein At2g46050, mitochondrial ... [more]
KAG6575187.13.3e-30182.58Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
KAG7013750.19.5e-30182.90Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_022958961.18.0e-30082.42pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 ... [more]
Match NameE-valueIdentityDescription
O823634.7e-12544.63Pentatricopeptide repeat-containing protein At2g46050, mitochondrial OS=Arabidop... [more]
Q9SIT71.4e-10032.76Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
Q9LUJ22.1e-9333.77Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX... [more]
Q9SVA51.4e-9234.67Pentatricopeptide repeat-containing protein At4g39530 OS=Arabidopsis thaliana OX... [more]
Q9SMZ22.3e-9233.22Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A0A0K8630.0e+0087.58Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G005130 PE=4 SV=1[more]
A0A5D3BXR60.0e+0086.27Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3C6T70.0e+0086.27pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 ... [more]
A0A6J1H3L23.9e-30082.42pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 ... [more]
A0A6J1L5729.6e-29982.42pentatricopeptide repeat-containing protein At2g46050, mitochondrial OS=Cucurbit... [more]
Match NameE-valueIdentityDescription
AT2G46050.13.3e-12644.63Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT2G13600.19.7e-10232.76Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G22690.11.5e-9433.77CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012... [more]
AT3G22690.21.5e-9433.77INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic pro... [more]
AT4G39530.19.7e-9434.67Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 290..336
e-value: 5.2E-8
score: 33.0
coord: 390..437
e-value: 3.3E-7
score: 30.4
coord: 189..235
e-value: 2.5E-7
score: 30.8
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 467..487
e-value: 0.67
score: 10.3
coord: 56..82
e-value: 0.038
score: 14.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 393..426
e-value: 1.7E-7
score: 29.0
coord: 191..224
e-value: 8.2E-4
score: 17.4
coord: 292..326
e-value: 4.0E-6
score: 24.7
coord: 264..290
e-value: 0.0015
score: 16.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 391..425
score: 11.838262
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 290..324
score: 10.511944
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 340..586
e-value: 3.3E-34
score: 120.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 36..142
e-value: 1.1E-13
score: 52.9
coord: 236..339
e-value: 3.3E-20
score: 74.2
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 7..592
NoneNo IPR availablePANTHERPTHR24015:SF398OS07G0259400 PROTEINcoord: 7..592

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C06G112990.1Cla97C06G112990.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding