CSPI01G23820 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI01G23820
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr1: 19296858 .. 19299521 (+)
RNA-Seq ExpressionCSPI01G23820
SyntenyCSPI01G23820
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGTTCAACAAGAAATGAGATTATTATTATTATTATTATTACGAATTCAAAGAAATCACAAATCACTCACCTACTCTTCGATAGAATTTCAAGAACCTTCGATCGTTCATCTTCCATTCGCCAGACGGAGACATCACGGGAGTTGATTATTAGAAATTGTGAAGTGAAACCACTTTAAATAGAATTAGAAGAAAAGACAAGTTTTGTTGGAATAGCGAGGAAGAGAGTACCTCAAAAGGCGAGTCAATGGCGGAGAAACGAAACTTATTAGGAGGCCATTTGTAAGGGGCTATGGAAGCTCTAAGTGTTCCATCGATTTCTCTCCAGAATTTCTCAACCCTAAACAACAATCTTCTTTTCAGAAACCATCAAATTCTCTCTACAATAGATAAATGTTCAAGTTCAAAGCAATTGAAGGAAGTTCACGCTCGCATGCTCCGTACCGGTCTCTTCTTCGACCCCTTTTCGGCTAGCAAACTCTTCACAGCCTCCGCTCTTTCGTCCTTCTCCACTCTCGACTATGCCCGCAACTTGTTCGACCAAATTCCCCAACCAAATCTCTACACTTGGAACACCCTCATTCGAGCTTACGCTTCCAGCTCCGACCCTTTTCAGAGTTTCGTAATATTTCTGGATTTGCTTCATAAATGTGAGGATTTGCCTAATAAGTTCACTTTCCCGTTTGTTATTAAGGCCGCTTCGGAGCTTAAAGCTTCACGGGTTGGGACAGCTGTTCATGGAATGGCGATTAAGTTGTCGTTTGGTATGGATCTTTATATCCTTAATTCTCTTGTGCGATTCTATGGGGCATGTGGCGATTTGAGTATGGCTGAGCGATTGTTTAAGGGTATTTCTTGCAAAGATGTAGTGTCTTGGAATTCCATGATTTCGGCTTTTGCTCAGGGTAACTGTCCCGAAGATGCATTGGAGTTGTTTTTGAAAATGGAGAGGGAAAATGTGATGCCTAATTCTGTAACAATGGTGGGTGTTTTATCTGCTTGTGCGAAGAAGTTGGATTTGGAATTTGGGAGGTGGGTTTGTTCGTACATTGAAAGGAAAGGGATCAAAGTGGATTTAACTTTATGTAACGCCATGCTTGACATGTATACAAAGTGTGGAAGTGTTGATGATGCACAGAAGCTGTTTGACGAAATGCCTGAAAGAGATGTCTTCTCTTGGACCATCATGCTTGATGGGTATGCGAAAATGGGCGACTACGATGCTGCTCGGCTAGTGTTCAATGCAATGCCTGTGAAAGAAATTGCTGCTTGGAATGTTCTCATATCTGCTTATGAACAAAATGGTAAACCTAAGGAAGCTTTGGCCATTTTTAATGAGTTGCAGCTCAGTAAGATTGCAAAGCCTGATGAAGTCACTTTAGTTAGTACTCTGTCAGCTTGTGCTCAATTGGGAGCAATTGATTTGGGTGGATGGATTCATGTGTACACAAAAAGGGAAGGGATAGTTCTAAATTGCCATTTAATTTCTTCTCTTGTGGACATGTATGCTAAATGTGGTTCTTTAGAGAAAGCTCTCGAGGTGTTCTATTCAGTGGAGGAGAGAGATGTGTATGTTTGGAGTGCCATGATTGCTGGTTTGGGAATGCACGGCTGTGGGAAGGCGGCAATTGATCTATTCTTCAAAATGCAGGAAGCTAAGGTGAAGCCAAATAGTGTGACATTTACAAATGTATTATGTGCCTGTAGCCATGCTGGCTTAGTTGATGAGGGACGGGTGTTTTTCCATGAAATGGAGCCAGTTTATGGGGTTGTTCCTGAGATGAAGCACTATGCTTGTATGGTTGATATTCTCGGTCGTGCAGGGTTTCTTGAAGAAGCTATGGAGTTGATCAATGAAATGTCTACAACTCCAAGCGCATCCGTTTGGGGTGCTTTGCTTGGTGCTTGCAGCCTTCATATGAATGTTGAGCTTGGAGAATTAGCGAGTGACCAATTGCTAAAGTTGGAGCCTAGAAATCATGGTGCTATTGTACTTTTATCAAACATATATGCTAAAACAGGAAGATGGGAAAAGGTTTCTGAGTTGAGGAAACTAATGAGAGACACTGAATTGAAAAAGGAACCTGGTTGTAGCTCCATTGAAGCCAACGGCAACGTCCACGAGTTTCTAGTGGGAGATAATACGCACCCGTTATCCAGTAACATCTATTCAAAGTTGGAGGAAATTGCAACAAAACTAAAATCAGTCGGTTACGAACCAAACAAATCCCATCTTCTCCAGCTCATCGAAGAGGACGACCTCAAGGAACAGGCCTTAAGCCTTCACAGCGAGAAGTTAGCCATCGCTTTTGGGCTTGTTACTTTGGCTCCATCTCAACCAATTCGAGTTGTGAAGAATCTTCGGATTTGTGGAGACTGCCATGCATTTGCTAAGCTTGTATCTAGAGTTTACGACAGAGATATATTACTTCGAGATCGATATCGATTCCATCATTTCCGAGATGGGCATTGTTCGTGTATGGATTACTGGTAAAGCTGCAAAATAATGGATTCTACGTTCACTTGCTCTGTTTGGTGCATGAGTTGAACTTCACAACAATAGCCACAACCTTTATGGACTTTTGTAACTACATCAATTTGAAATTTGGGAAAGCCTCGTGTAAATTTTGATTTAAATTTTCAGTTAGAATTTCCTTTAGAAAATT

mRNA sequence

TGTTCAACAAGAAATGAGATTATTATTATTATTATTATTACGAATTCAAAGAAATCACAAATCACTCACCTACTCTTCGATAGAATTTCAAGAACCTTCGATCGTTCATCTTCCATTCGCCAGACGGAGACATCACGGGAGTTGATTATTAGAAATTGTGAAGTGAAACCACTTTAAATAGAATTAGAAGAAAAGACAAGTTTTGTTGGAATAGCGAGGAAGAGAGTACCTCAAAAGGCGAGTCAATGGCGGAGAAACGAAACTTATTAGGAGGCCATTTGTAAGGGGCTATGGAAGCTCTAAGTGTTCCATCGATTTCTCTCCAGAATTTCTCAACCCTAAACAACAATCTTCTTTTCAGAAACCATCAAATTCTCTCTACAATAGATAAATGTTCAAGTTCAAAGCAATTGAAGGAAGTTCACGCTCGCATGCTCCGTACCGGTCTCTTCTTCGACCCCTTTTCGGCTAGCAAACTCTTCACAGCCTCCGCTCTTTCGTCCTTCTCCACTCTCGACTATGCCCGCAACTTGTTCGACCAAATTCCCCAACCAAATCTCTACACTTGGAACACCCTCATTCGAGCTTACGCTTCCAGCTCCGACCCTTTTCAGAGTTTCGTAATATTTCTGGATTTGCTTCATAAATGTGAGGATTTGCCTAATAAGTTCACTTTCCCGTTTGTTATTAAGGCCGCTTCGGAGCTTAAAGCTTCACGGGTTGGGACAGCTGTTCATGGAATGGCGATTAAGTTGTCGTTTGGTATGGATCTTTATATCCTTAATTCTCTTGTGCGATTCTATGGGGCATGTGGCGATTTGAGTATGGCTGAGCGATTGTTTAAGGGTATTTCTTGCAAAGATGTAGTGTCTTGGAATTCCATGATTTCGGCTTTTGCTCAGGGTAACTGTCCCGAAGATGCATTGGAGTTGTTTTTGAAAATGGAGAGGGAAAATGTGATGCCTAATTCTGTAACAATGGTGGGTGTTTTATCTGCTTGTGCGAAGAAGTTGGATTTGGAATTTGGGAGGTGGGTTTGTTCGTACATTGAAAGGAAAGGGATCAAAGTGGATTTAACTTTATGTAACGCCATGCTTGACATGTATACAAAGTGTGGAAGTGTTGATGATGCACAGAAGCTGTTTGACGAAATGCCTGAAAGAGATGTCTTCTCTTGGACCATCATGCTTGATGGGTATGCGAAAATGGGCGACTACGATGCTGCTCGGCTAGTGTTCAATGCAATGCCTGTGAAAGAAATTGCTGCTTGGAATGTTCTCATATCTGCTTATGAACAAAATGGTAAACCTAAGGAAGCTTTGGCCATTTTTAATGAGTTGCAGCTCAGTAAGATTGCAAAGCCTGATGAAGTCACTTTAGTTAGTACTCTGTCAGCTTGTGCTCAATTGGGAGCAATTGATTTGGGTGGATGGATTCATGTGTACACAAAAAGGGAAGGGATAGTTCTAAATTGCCATTTAATTTCTTCTCTTGTGGACATGTATGCTAAATGTGGTTCTTTAGAGAAAGCTCTCGAGGTGTTCTATTCAGTGGAGGAGAGAGATGTGTATGTTTGGAGTGCCATGATTGCTGGTTTGGGAATGCACGGCTGTGGGAAGGCGGCAATTGATCTATTCTTCAAAATGCAGGAAGCTAAGGTGAAGCCAAATAGTGTGACATTTACAAATGTATTATGTGCCTGTAGCCATGCTGGCTTAGTTGATGAGGGACGGGTGTTTTTCCATGAAATGGAGCCAGTTTATGGGGTTGTTCCTGAGATGAAGCACTATGCTTGTATGGTTGATATTCTCGGTCGTGCAGGGTTTCTTGAAGAAGCTATGGAGTTGATCAATGAAATGTCTACAACTCCAAGCGCATCCGTTTGGGGTGCTTTGCTTGGTGCTTGCAGCCTTCATATGAATGTTGAGCTTGGAGAATTAGCGAGTGACCAATTGCTAAAGTTGGAGCCTAGAAATCATGGTGCTATTGTACTTTTATCAAACATATATGCTAAAACAGGAAGATGGGAAAAGGTTTCTGAGTTGAGGAAACTAATGAGAGACACTGAATTGAAAAAGGAACCTGGTTGTAGCTCCATTGAAGCCAACGGCAACGTCCACGAGTTTCTAGTGGGAGATAATACGCACCCGTTATCCAGTAACATCTATTCAAAGTTGGAGGAAATTGCAACAAAACTAAAATCAGTCGGTTACGAACCAAACAAATCCCATCTTCTCCAGCTCATCGAAGAGGACGACCTCAAGGAACAGGCCTTAAGCCTTCACAGCGAGAAGTTAGCCATCGCTTTTGGGCTTGTTACTTTGGCTCCATCTCAACCAATTCGAGTTGTGAAGAATCTTCGGATTTGTGGAGACTGCCATGCATTTGCTAAGCTTGTATCTAGAGTTTACGACAGAGATATATTACTTCGAGATCGATATCGATTCCATCATTTCCGAGATGGGCATTGTTCGTGTATGGATTACTGGTAAAGCTGCAAAATAATGGATTCTACGTTCACTTGCTCTGTTTGGTGCATGAGTTGAACTTCACAACAATAGCCACAACCTTTATGGACTTTTGTAACTACATCAATTTGAAATTTGGGAAAGCCTCGTGTAAATTTTGATTTAAATTTTCAGTTAGAATTTCCTTTAGAAAATT

Coding sequence (CDS)

ATGGAAGCTCTAAGTGTTCCATCGATTTCTCTCCAGAATTTCTCAACCCTAAACAACAATCTTCTTTTCAGAAACCATCAAATTCTCTCTACAATAGATAAATGTTCAAGTTCAAAGCAATTGAAGGAAGTTCACGCTCGCATGCTCCGTACCGGTCTCTTCTTCGACCCCTTTTCGGCTAGCAAACTCTTCACAGCCTCCGCTCTTTCGTCCTTCTCCACTCTCGACTATGCCCGCAACTTGTTCGACCAAATTCCCCAACCAAATCTCTACACTTGGAACACCCTCATTCGAGCTTACGCTTCCAGCTCCGACCCTTTTCAGAGTTTCGTAATATTTCTGGATTTGCTTCATAAATGTGAGGATTTGCCTAATAAGTTCACTTTCCCGTTTGTTATTAAGGCCGCTTCGGAGCTTAAAGCTTCACGGGTTGGGACAGCTGTTCATGGAATGGCGATTAAGTTGTCGTTTGGTATGGATCTTTATATCCTTAATTCTCTTGTGCGATTCTATGGGGCATGTGGCGATTTGAGTATGGCTGAGCGATTGTTTAAGGGTATTTCTTGCAAAGATGTAGTGTCTTGGAATTCCATGATTTCGGCTTTTGCTCAGGGTAACTGTCCCGAAGATGCATTGGAGTTGTTTTTGAAAATGGAGAGGGAAAATGTGATGCCTAATTCTGTAACAATGGTGGGTGTTTTATCTGCTTGTGCGAAGAAGTTGGATTTGGAATTTGGGAGGTGGGTTTGTTCGTACATTGAAAGGAAAGGGATCAAAGTGGATTTAACTTTATGTAACGCCATGCTTGACATGTATACAAAGTGTGGAAGTGTTGATGATGCACAGAAGCTGTTTGACGAAATGCCTGAAAGAGATGTCTTCTCTTGGACCATCATGCTTGATGGGTATGCGAAAATGGGCGACTACGATGCTGCTCGGCTAGTGTTCAATGCAATGCCTGTGAAAGAAATTGCTGCTTGGAATGTTCTCATATCTGCTTATGAACAAAATGGTAAACCTAAGGAAGCTTTGGCCATTTTTAATGAGTTGCAGCTCAGTAAGATTGCAAAGCCTGATGAAGTCACTTTAGTTAGTACTCTGTCAGCTTGTGCTCAATTGGGAGCAATTGATTTGGGTGGATGGATTCATGTGTACACAAAAAGGGAAGGGATAGTTCTAAATTGCCATTTAATTTCTTCTCTTGTGGACATGTATGCTAAATGTGGTTCTTTAGAGAAAGCTCTCGAGGTGTTCTATTCAGTGGAGGAGAGAGATGTGTATGTTTGGAGTGCCATGATTGCTGGTTTGGGAATGCACGGCTGTGGGAAGGCGGCAATTGATCTATTCTTCAAAATGCAGGAAGCTAAGGTGAAGCCAAATAGTGTGACATTTACAAATGTATTATGTGCCTGTAGCCATGCTGGCTTAGTTGATGAGGGACGGGTGTTTTTCCATGAAATGGAGCCAGTTTATGGGGTTGTTCCTGAGATGAAGCACTATGCTTGTATGGTTGATATTCTCGGTCGTGCAGGGTTTCTTGAAGAAGCTATGGAGTTGATCAATGAAATGTCTACAACTCCAAGCGCATCCGTTTGGGGTGCTTTGCTTGGTGCTTGCAGCCTTCATATGAATGTTGAGCTTGGAGAATTAGCGAGTGACCAATTGCTAAAGTTGGAGCCTAGAAATCATGGTGCTATTGTACTTTTATCAAACATATATGCTAAAACAGGAAGATGGGAAAAGGTTTCTGAGTTGAGGAAACTAATGAGAGACACTGAATTGAAAAAGGAACCTGGTTGTAGCTCCATTGAAGCCAACGGCAACGTCCACGAGTTTCTAGTGGGAGATAATACGCACCCGTTATCCAGTAACATCTATTCAAAGTTGGAGGAAATTGCAACAAAACTAAAATCAGTCGGTTACGAACCAAACAAATCCCATCTTCTCCAGCTCATCGAAGAGGACGACCTCAAGGAACAGGCCTTAAGCCTTCACAGCGAGAAGTTAGCCATCGCTTTTGGGCTTGTTACTTTGGCTCCATCTCAACCAATTCGAGTTGTGAAGAATCTTCGGATTTGTGGAGACTGCCATGCATTTGCTAAGCTTGTATCTAGAGTTTACGACAGAGATATATTACTTCGAGATCGATATCGATTCCATCATTTCCGAGATGGGCATTGTTCGTGTATGGATTACTGGTAA

Protein sequence

MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLHKCEDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYTKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYSVEERDVYVWSAMIAGLGMHGCGKAAIDLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEGRVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACSLHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGCSSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFHHFRDGHCSCMDYW*
Homology
BLAST of CSPI01G23820 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 943.3 bits (2437), Expect = 1.6e-273
Identity = 448/727 (61.62%), Postives = 564/727 (77.58%), Query Frame = 0

Query: 7   PSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTA 66
           P+ S  N  T NN       + +S I++C S +QLK+ H  M+RTG F DP+SASKLF  
Sbjct: 16  PNFSNPNQPTTNN----ERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAM 75

Query: 67  SALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLHKCEDLPNK 126
           +ALSSF++L+YAR +FD+IP+PN + WNTLIRAYAS  DP  S   FLD++ + +  PNK
Sbjct: 76  AALSSFASLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNK 135

Query: 127 FTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLFKG 186
           +TFPF+IKAA+E+ +  +G ++HGMA+K + G D+++ NSL+  Y +CGDL  A ++F  
Sbjct: 136 YTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTT 195

Query: 187 ISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKKLDLEFG 246
           I  KDVVSWNSMI+ F Q   P+ ALELF KME E+V  + VTMVGVLSACAK  +LEFG
Sbjct: 196 IKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFG 255

Query: 247 RWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKM 306
           R VCSYIE   + V+LTL NAMLDMYTKCGS++DA++LFD M E+D  +WT MLDGYA  
Sbjct: 256 RQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAIS 315

Query: 307 GDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDEVTLVST 366
            DY+AAR V N+MP K+I AWN LISAYEQNGKP EAL +F+ELQL K  K +++TLVST
Sbjct: 316 EDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVST 375

Query: 367 LSACAQLGAIDLGGWIHVYTKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYSVEERDV 426
           LSACAQ+GA++LG WIH Y K+ GI +N H+ S+L+ MY+KCG LEK+ EVF SVE+RDV
Sbjct: 376 LSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDV 435

Query: 427 YVWSAMIAGLGMHGCGKAAIDLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEGRVFFHE 486
           +VWSAMI GL MHGCG  A+D+F+KMQEA VKPN VTFTNV CACSH GLVDE    FH+
Sbjct: 436 FVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQ 495

Query: 487 MEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACSLHMNVE 546
           ME  YG+VPE KHYAC+VD+LGR+G+LE+A++ I  M   PS SVWGALLGAC +H N+ 
Sbjct: 496 MESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLN 555

Query: 547 LGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGCSSIEAN 606
           L E+A  +LL+LEPRN GA VLLSNIYAK G+WE VSELRK MR T LKKEPGCSSIE +
Sbjct: 556 LAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEID 615

Query: 607 GNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHS 666
           G +HEFL GDN HP+S  +Y KL E+  KLKS GYEP  S +LQ+IEE+++KEQ+L+LHS
Sbjct: 616 GMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHS 675

Query: 667 EKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFHHFRDGH 726
           EKLAI +GL++    + IRV+KNLR+CGDCH+ AKL+S++YDR+I++RDRYRFHHFR+G 
Sbjct: 676 EKLAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQ 735

Query: 727 CSCMDYW 734
           CSC D+W
Sbjct: 736 CSCNDFW 738

BLAST of CSPI01G23820 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 602.1 bits (1551), Expect = 8.7e-171
Identity = 312/744 (41.94%), Postives = 448/744 (60.22%), Query Frame = 0

Query: 24  RNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTASALS-SFSTLDYARNLF 83
           RNH  LS +  C + + L+ +HA+M++ GL    ++ SKL     LS  F  L YA ++F
Sbjct: 32  RNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVF 91

Query: 84  DQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLHKCEDLPNKFTFPFVIKAASELKAS 143
             I +PNL  WNT+ R +A SSDP  +  +++ ++     LPN +TFPFV+K+ ++ KA 
Sbjct: 92  KTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMI-SLGLLPNSYTFPFVLKSCAKSKAF 151

Query: 144 RVGTAVHGMAIKLSFGMDLYILNS-------------------------------LVRFY 203
           + G  +HG  +KL   +DLY+  S                               L++ Y
Sbjct: 152 KEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGY 211

Query: 204 GACGDLSMAERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMV 263
            + G +  A++LF  I  KDVVSWN+MIS +A+    ++ALELF  M + NV P+  TMV
Sbjct: 212 ASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMV 271

Query: 264 GVLSACAKKLDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPER 323
            V+SACA+   +E GR V  +I+  G   +L + NA++D+Y+KCG ++ A  LF+ +P +
Sbjct: 272 TVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYK 331

Query: 324 DVFSWTIMLDGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQ 383
           DV SW  ++ GY  M  Y                               KEAL +F E+ 
Sbjct: 332 DVISWNTLIGGYTHMNLY-------------------------------KEALLLFQEM- 391

Query: 384 LSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYTKR--EGIVLNCHLISSLVDMYAKCG 443
           L     P++VT++S L ACA LGAID+G WIHVY  +  +G+     L +SL+DMYAKCG
Sbjct: 392 LRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCG 451

Query: 444 SLEKALEVFYSVEERDVYVWSAMIAGLGMHGCGKAAIDLFFKMQEAKVKPNSVTFTNVLC 503
            +E A +VF S+  + +  W+AMI G  MHG   A+ DLF +M++  ++P+ +TF  +L 
Sbjct: 452 DIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLS 511

Query: 504 ACSHAGLVDEGRVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSA 563
           ACSH+G++D GR  F  M   Y + P+++HY CM+D+LG +G  +EA E+IN M   P  
Sbjct: 512 ACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDG 571

Query: 564 SVWGALLGACSLHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLM 623
            +W +LL AC +H NVELGE  ++ L+K+EP N G+ VLLSNIYA  GRW +V++ R L+
Sbjct: 572 VIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALL 631

Query: 624 RDTELKKEPGCSSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLL 683
            D  +KK PGCSSIE +  VHEF++GD  HP +  IY  LEE+   L+  G+ P+ S +L
Sbjct: 632 NDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVL 691

Query: 684 QLIEEDDLKEQALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDR 734
           Q +EE + KE AL  HSEKLAIAFGL++  P   + +VKNLR+C +CH   KL+S++Y R
Sbjct: 692 QEMEE-EWKEGALRHHSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKR 741

BLAST of CSPI01G23820 vs. ExPASy Swiss-Prot
Match: O23337 (Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H3 PE=2 SV=1)

HSP 1 Score: 557.8 bits (1436), Expect = 1.9e-157
Identity = 278/713 (38.99%), Postives = 437/713 (61.29%), Query Frame = 0

Query: 28  ILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTASALSSFSTLDYARNLFDQIPQ 87
           IL  +  C S   +K++HA +LRT    +    S LF  S  SS   L YA N+F  IP 
Sbjct: 15  ILEKLSFCKSLNHIKQLHAHILRT--VINHKLNSFLFNLSVSSSSINLSYALNVFSSIPS 74

Query: 88  -PNLYTWNTLIRAYASSSDPFQSFVIFLDLLHKCEDLPNKFTFPFVIKAASELKASRVGT 147
            P    +N  +R  + SS+P ++ ++F   +       ++F+F  ++KA S++ A   G 
Sbjct: 75  PPESIVFNPFLRDLSRSSEP-RATILFYQRIRHVGGRLDQFSFLPILKAVSKVSALFEGM 134

Query: 148 AVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLFKGISCKDVVSWNSMISAFAQGN 207
            +HG+A K++   D ++    +  Y +CG ++ A  +F  +S +DVV+WN+MI  + +  
Sbjct: 135 ELHGVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIERYCRFG 194

Query: 208 CPEDALELFLKMERENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKGIKVDLTLCN 267
             ++A +LF +M+  NVMP+ + +  ++SAC +  ++ + R +  ++    +++D  L  
Sbjct: 195 LVDEAFKLFEEMKDSNVMPDEMILCNIVSACGRTGNMRYNRAIYEFLIENDVRMDTHLLT 254

Query: 268 AMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKMGDYDAARLVFNAMPVKEIAA 327
           A++ MY   G +D A++ F +M  R++F  T M+ GY+K G  D A+++F+    K++  
Sbjct: 255 ALVTMYAGAGCMDMAREFFRKMSVRNLFVSTAMVSGYSKCGRLDDAQVIFDQTEKKDLVC 314

Query: 328 WNVLISAYEQNGKPKEALAIFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYT 387
           W  +ISAY ++  P+EAL +F E+  S I KPD V++ S +SACA LG +D   W+H   
Sbjct: 315 WTTMISAYVESDYPQEALRVFEEMCCSGI-KPDVVSMFSVISACANLGILDKAKWVHSCI 374

Query: 388 KREGIVLNCHLISSLVDMYAKCGSLEKALEVFYSVEERDVYVWSAMIAGLGMHGCGKAAI 447
              G+     + ++L++MYAKCG L+   +VF  +  R+V  WS+MI  L MHG    A+
Sbjct: 375 HVNGLESELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEASDAL 434

Query: 448 DLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEGRVFFHEMEPVYGVVPEMKHYACMVDI 507
            LF +M++  V+PN VTF  VL  CSH+GLV+EG+  F  M   Y + P+++HY CMVD+
Sbjct: 435 SLFARMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGCMVDL 494

Query: 508 LGRAGFLEEAMELINEMSTTPSASVWGALLGACSLHMNVELGELASDQLLKLEPRNHGAI 567
            GRA  L EA+E+I  M    +  +WG+L+ AC +H  +ELG+ A+ ++L+LEP + GA+
Sbjct: 495 FGRANLLREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDHDGAL 554

Query: 568 VLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGCSSIEANGNVHEFLVGDNTHPLSSNIY 627
           VL+SNIYA+  RWE V  +R++M +  + KE G S I+ NG  HEFL+GD  H  S+ IY
Sbjct: 555 VLMSNIYAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQSNEIY 614

Query: 628 SKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHSEKLAIAFGLVTLAPSQP--- 687
           +KL+E+ +KLK  GY P+   +L  +EE++ K+  L  HSEKLA+ FGL+     +    
Sbjct: 615 AKLDEVVSKLKLAGYVPDCGSVLVDVEEEEKKDLVL-WHSEKLALCFGLMNEEKEEEKDS 674

Query: 688 ---IRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFHHFRDGHCSCMDYW 734
              IR+VKNLR+C DCH F KLVS+VY+R+I++RDR RFH +++G CSC DYW
Sbjct: 675 CGVIRIVKNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722

BLAST of CSPI01G23820 vs. ExPASy Swiss-Prot
Match: Q9LTV8 (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 538.1 bits (1385), Expect = 1.5e-151
Identity = 283/727 (38.93%), Postives = 431/727 (59.28%), Query Frame = 0

Query: 15  STLNNNLLFRNHQI------LSTIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTASA 74
           S L + LL+ N  I       S ID  +   QLK++HAR+L  GL F  F  +KL  AS 
Sbjct: 5   SCLASPLLYTNSGIHSDSFYASLIDSATHKAQLKQIHARLLVLGLQFSGFLITKLIHAS- 64

Query: 75  LSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLHKCEDLPNKFT 134
            SSF  + +AR +FD +P+P ++ WN +IR Y S ++ FQ  ++    +      P+ FT
Sbjct: 65  -SSFGDITFARQVFDDLPRPQIFPWNAIIRGY-SRNNHFQDALLMYSNMQLARVSPDSFT 124

Query: 135 FPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLFKGIS 194
           FP ++KA S L   ++G  VH    +L F  D+++ N L+  Y  C  L  A  +F+G+ 
Sbjct: 125 FPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLP 184

Query: 195 C--KDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKKLDLEFG 254
              + +VSW +++SA+AQ   P +ALE+F +M + +V P+ V +V VL+A     DL+ G
Sbjct: 185 LPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQG 244

Query: 255 RWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKM 314
           R + + + + G++++  L  ++  MY KCG V  A+ LFD+M   ++  W  M+ GYAK 
Sbjct: 245 RSIHASVVKMGLEIEPDLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAK- 304

Query: 315 GDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDEVTLVST 374
                                         NG  +EA+ +F+E+ ++K  +PD +++ S 
Sbjct: 305 ------------------------------NGYAREAIDMFHEM-INKDVRPDTISITSA 364

Query: 375 LSACAQLGAIDLGGWIHVYTKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYSVEERDV 434
           +SACAQ+G+++    ++ Y  R     +  + S+L+DM+AKCGS+E A  VF    +RDV
Sbjct: 365 ISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDV 424

Query: 435 YVWSAMIAGLGMHGCGKAAIDLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEGRVFFHE 494
            VWSAMI G G+HG  + AI L+  M+   V PN VTF  +L AC+H+G+V EG  FF+ 
Sbjct: 425 VVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNR 484

Query: 495 MEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACSLHMNVE 554
           M   + + P+ +HYAC++D+LGRAG L++A E+I  M   P  +VWGALL AC  H +VE
Sbjct: 485 MAD-HKINPQQQHYACVIDLLGRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKHRHVE 544

Query: 555 LGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGCSSIEAN 614
           LGE A+ QL  ++P N G  V LSN+YA    W++V+E+R  M++  L K+ GCS +E  
Sbjct: 545 LGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSWVEVR 604

Query: 615 GNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHS 674
           G +  F VGD +HP    I  ++E I ++LK  G+  NK   L  + +++  E+ L  HS
Sbjct: 605 GRLEAFRVGDKSHPRYEEIERQVEWIESRLKEGGFVANKDASLHDLNDEE-AEETLCSHS 664

Query: 675 EKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFHHFRDGH 734
           E++AIA+GL++     P+R+ KNLR C +CHA  KL+S++ DR+I++RD  RFHHF+DG 
Sbjct: 665 ERIAIAYGLISTPQGTPLRITKNLRACVNCHAATKLISKLVDREIVVRDTNRFHHFKDGV 694

BLAST of CSPI01G23820 vs. ExPASy Swiss-Prot
Match: Q9FI80 (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 523.5 bits (1347), Expect = 3.9e-147
Identity = 274/707 (38.76%), Postives = 413/707 (58.42%), Query Frame = 0

Query: 32  IDKCSSSKQLKEVHARMLRTGLFFDPFSASKL--FTASALSSFSTLDYARNLFDQIPQPN 91
           I+ C + + L ++HA  +++G   D  +A+++  F A++      LDYA  +F+Q+PQ N
Sbjct: 30  INNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHKIFNQMPQRN 89

Query: 92  LYTWNTLIRAYASSSD--PFQSFVIFLDLLHKCEDLPNKFTFPFVIKAASELKASRVGTA 151
            ++WNT+IR ++ S +     +  +F +++      PN+FTFP V+KA ++    + G  
Sbjct: 90  CFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAKTGKIQEGKQ 149

Query: 152 VHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLF-KGISCKDVVSWNSMISAFAQGN 211
           +HG+A+K  FG D +++++LVR Y  CG +  A  LF K I  KD+V             
Sbjct: 150 IHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMV------------- 209

Query: 212 CPEDALELFLKMERENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKGIKVDLTLCN 271
                                                                       
Sbjct: 210 ------------------------------------------------------------ 269

Query: 272 AMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKMGDYDAARLVFNAMPVKEIAA 331
            M D   + G               ++  W +M+DGY ++GD  AAR++F+ M  + + +
Sbjct: 270 VMTDRRKRDG---------------EIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVS 329

Query: 332 WNVLISAYEQNGKPKEALAIFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYT 391
           WN +IS Y  NG  K+A+ +F E++   I +P+ VTLVS L A ++LG+++LG W+H+Y 
Sbjct: 330 WNTMISGYSLNGFFKDAVEVFREMKKGDI-RPNYVTLVSVLPAISRLGSLELGEWLHLYA 389

Query: 392 KREGIVLNCHLISSLVDMYAKCGSLEKALEVFYSVEERDVYVWSAMIAGLGMHGCGKAAI 451
           +  GI ++  L S+L+DMY+KCG +EKA+ VF  +   +V  WSAMI G  +HG    AI
Sbjct: 390 EDSGIRIDDVLGSALIDMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAI 449

Query: 452 DLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEGRVFFHEMEPVYGVVPEMKHYACMVDI 511
           D F KM++A V+P+ V + N+L ACSH GLV+EGR +F +M  V G+ P ++HY CMVD+
Sbjct: 450 DCFCKMRQAGVRPSDVAYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDL 509

Query: 512 LGRAGFLEEAMELINEMSTTPSASVWGALLGACSLHMNVELGELASDQLLKLEPRNHGAI 571
           LGR+G L+EA E I  M   P   +W ALLGAC +  NVE+G+  ++ L+ + P + GA 
Sbjct: 510 LGRSGLLDEAEEFILNMPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAY 569

Query: 572 VLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGCSSIEANGNVHEFLVGDNTHPLSSNIY 631
           V LSN+YA  G W +VSE+R  M++ +++K+PGCS I+ +G +HEF+V D++HP +  I 
Sbjct: 570 VALSNMYASQGNWSEVSEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEIN 629

Query: 632 SKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHSEKLAIAFGLVTLAPSQPIRV 691
           S L EI+ KL+  GY P  + +L  +EE+D KE  L  HSEK+A AFGL++ +P +PIR+
Sbjct: 630 SMLVEISDKLRLAGYRPITTQVLLNLEEED-KENVLHYHSEKIATAFGLISTSPGKPIRI 646

Query: 692 VKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFHHFRDGHCSCMDYW 734
           VKNLRIC DCH+  KL+S+VY R I +RDR RFHHF+DG CSCMDYW
Sbjct: 690 VKNLRICEDCHSSIKLISKVYKRKITVRDRKRFHHFQDGSCSCMDYW 646

BLAST of CSPI01G23820 vs. ExPASy TrEMBL
Match: A0A0A0M0R9 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G530130 PE=3 SV=1)

HSP 1 Score: 1468.4 bits (3800), Expect = 0.0e+00
Identity = 729/733 (99.45%), Postives = 730/733 (99.59%), Query Frame = 0

Query: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
           MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA
Sbjct: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLHKC 120
           SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLL KC
Sbjct: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
           EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA
Sbjct: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240
           ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240

Query: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
           LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML
Sbjct: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
           DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE
Sbjct: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYTKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420
           VTLVSTLSACAQLGAIDLGGWIHVY KREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420

Query: 421 VEERDVYVWSAMIAGLGMHGCGKAAIDLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VEERDVYVWSAMIAGLGMHG GKAAIDLFF+MQEAKVKPNSVTFTNVLCACSHAGLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480

Query: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540
           RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540

Query: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
           LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660
           SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ
Sbjct: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720
           ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720

Query: 721 HFRDGHCSCMDYW 734
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of CSPI01G23820 vs. ExPASy TrEMBL
Match: A0A5D3BBW6 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold194G001260 PE=3 SV=1)

HSP 1 Score: 1422.1 bits (3680), Expect = 0.0e+00
Identity = 707/733 (96.45%), Postives = 714/733 (97.41%), Query Frame = 0

Query: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
           MEALSVP ISLQNFSTLNNNL FRNHQILS IDKCSSSKQLKEVHARMLRTGLFFDPFSA
Sbjct: 1   MEALSVPLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLHKC 120
           SKLFTASALSSFSTLDYARN+FDQIPQPNLYTWN LIRAYASSSDPFQSFVIFLDLL KC
Sbjct: 61  SKLFTASALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
           EDLPN FTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA
Sbjct: 121 EDLPNNFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240
           ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMV VLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKK 240

Query: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
           LDLEFGRWVCSYIERKGIK+DLTL NAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML
Sbjct: 241 LDLEFGRWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
           DGYAKMGDYDAAR VFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE
Sbjct: 301 DGYAKMGDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYTKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420
           VTLVSTLSACAQLGAIDLGGWIHVY KREGI LNCHLISSLVDMYAKCG+LEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420

Query: 421 VEERDVYVWSAMIAGLGMHGCGKAAIDLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VEERDVYVWSAMIAGLGMHG GKAAIDLFF+MQEAKVKPNSVTFTNVLCACSH GLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEG 480

Query: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540
           RVFFHEMEPVYGVVPE KHYACMVDILGRAGFLEEAMELINEMS TPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACS 540

Query: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
           LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660
           SSIE NGNVHEFLVGDN HPLSSNIYSKL++IATKLK VGYEPNKSHLLQLIEEDDLKEQ
Sbjct: 601 SSIEVNGNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720
           ALSLHSEKLAIAFGLV+LAPSQPIRVVKNLRICGDCH FAKLVSRVYDRDILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFH 720

Query: 721 HFRDGHCSCMDYW 734
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of CSPI01G23820 vs. ExPASy TrEMBL
Match: A0A1S3C623 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103497087 PE=3 SV=1)

HSP 1 Score: 1422.1 bits (3680), Expect = 0.0e+00
Identity = 707/733 (96.45%), Postives = 714/733 (97.41%), Query Frame = 0

Query: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
           MEALSVP ISLQNFSTLNNNL FRNHQILS IDKCSSSKQLKEVHARMLRTGLFFDPFSA
Sbjct: 1   MEALSVPLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLHKC 120
           SKLFTASALSSFSTLDYARN+FDQIPQPNLYTWN LIRAYASSSDPFQSFVIFLDLL KC
Sbjct: 61  SKLFTASALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
           EDLPN FTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA
Sbjct: 121 EDLPNNFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240
           ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMV VLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKK 240

Query: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
           LDLEFGRWVCSYIERKGIK+DLTL NAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML
Sbjct: 241 LDLEFGRWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
           DGYAKMGDYDAAR VFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE
Sbjct: 301 DGYAKMGDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYTKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420
           VTLVSTLSACAQLGAIDLGGWIHVY KREGI LNCHLISSLVDMYAKCG+LEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420

Query: 421 VEERDVYVWSAMIAGLGMHGCGKAAIDLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VEERDVYVWSAMIAGLGMHG GKAAIDLFF+MQEAKVKPNSVTFTNVLCACSH GLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEG 480

Query: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540
           RVFFHEMEPVYGVVPE KHYACMVDILGRAGFLEEAMELINEMS TPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACS 540

Query: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
           LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660
           SSIE NGNVHEFLVGDN HPLSSNIYSKL++IATKLK VGYEPNKSHLLQLIEEDDLKEQ
Sbjct: 601 SSIEVNGNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720
           ALSLHSEKLAIAFGLV+LAPSQPIRVVKNLRICGDCH FAKLVSRVYDRDILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFH 720

Query: 721 HFRDGHCSCMDYW 734
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of CSPI01G23820 vs. ExPASy TrEMBL
Match: A0A5A7SKX2 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold848G00570 PE=3 SV=1)

HSP 1 Score: 1420.2 bits (3675), Expect = 0.0e+00
Identity = 706/733 (96.32%), Postives = 713/733 (97.27%), Query Frame = 0

Query: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
           MEALSVP ISLQNFSTLNNNL FRNHQILS IDKCSSSKQLKEVHARMLRTGLFFDPFSA
Sbjct: 1   MEALSVPLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLHKC 120
           SKLFTASALSSFSTLDYARN+FDQIPQPNLYTWN LIRAYASSSDPFQSFVIFLDLL KC
Sbjct: 61  SKLFTASALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
           EDLPN FTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA
Sbjct: 121 EDLPNNFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240
           ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMV VLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKK 240

Query: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
           LDLEFGRWVCSYIERKGIK+DLTL NAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML
Sbjct: 241 LDLEFGRWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
           DGYAKMGDYDAAR VFNAMPVKEIAAWNVLISAYEQNGKPKEALA FNELQLSKIAKPDE
Sbjct: 301 DGYAKMGDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYTKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420
           VTLVSTLSACAQLGAIDLGGWIHVY KREGI LNCHLISSLVDMYAKCG+LEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420

Query: 421 VEERDVYVWSAMIAGLGMHGCGKAAIDLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VEERDVYVWSAMIAGLGMHG GKAAIDLFF+MQEAKVKPNSVTFTNVLCACSH GLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEG 480

Query: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540
           RVFFHEMEPVYGVVPE KHYACMVDILGRAGFLEEAMELINEMS TPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACS 540

Query: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
           LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660
           SSIE NGNVHEFLVGDN HPLSSNIYSKL++IATKLK VGYEPNKSHLLQLIEEDDLKEQ
Sbjct: 601 SSIEVNGNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720
           ALSLHSEKLAIAFGLV+LAPSQPIRVVKNLRICGDCH FAKLVSRVYDRDILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFH 720

Query: 721 HFRDGHCSCMDYW 734
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of CSPI01G23820 vs. ExPASy TrEMBL
Match: A0A6J1HLG4 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111464676 PE=3 SV=1)

HSP 1 Score: 1315.8 bits (3404), Expect = 0.0e+00
Identity = 643/733 (87.72%), Postives = 685/733 (93.45%), Query Frame = 0

Query: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
           ME LS P +SL N S  +NNL FRNHQILSTID+CSS KQLK+VHA+MLRTGLFFDPFSA
Sbjct: 1   METLSAPLVSLPNRSIADNNLHFRNHQILSTIDQCSSGKQLKQVHAQMLRTGLFFDPFSA 60

Query: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLHKC 120
           SKL  ASAL S STL+YAR++FDQIP PNLYTWNTLIRAYASS+DPFQSFVIFL LL +C
Sbjct: 61  SKLIAASALKSSSTLEYARDVFDQIPHPNLYTWNTLIRAYASSADPFQSFVIFLALLDEC 120

Query: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
           +DLPN FTFPFVIKAASELKASRVG AVHGMAIKLS GMD YILNSLVRFYGACGDL+MA
Sbjct: 121 DDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSLGMDQYILNSLVRFYGACGDLNMA 180

Query: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240
           ERLF+GISCKDVVSWNSMISAFAQGNCPEDALELFLKME  NVMPNSVTMVGVLSACAKK
Sbjct: 181 ERLFEGISCKDVVSWNSMISAFAQGNCPEDALELFLKMEGANVMPNSVTMVGVLSACAKK 240

Query: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
           LDLEFGRWVCSYIERK I VDLTLCNAMLDMYTKCGS+ DA+KLFDEMPERDVFSWT ML
Sbjct: 241 LDLEFGRWVCSYIERKEISVDLTLCNAMLDMYTKCGSIGDAEKLFDEMPERDVFSWTTML 300

Query: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
           DGYAKMGD++AAR VF+ MPVKEIAAWN LISAYE+NGKPKEALA FNELQLSKIAKPDE
Sbjct: 301 DGYAKMGDFNAARKVFDEMPVKEIAAWNALISAYERNGKPKEALATFNELQLSKIAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYTKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420
           VTLVS+LSACAQLGAIDLGGWIHVY KREGI LN HLI+SL+DMYAKCG+LEKALEVFY+
Sbjct: 361 VTLVSSLSACAQLGAIDLGGWIHVYIKREGINLNGHLITSLIDMYAKCGALEKALEVFYA 420

Query: 421 VEERDVYVWSAMIAGLGMHGCGKAAIDLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VEE+DVYVWSAMIAGLGMHG GKAAI+LFFKMQEAKVKPN VTFTN+LCACSHAGLVDEG
Sbjct: 421 VEEKDVYVWSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTNLLCACSHAGLVDEG 480

Query: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540
           R  FHEMEPVYGVVP  KHYACMVDILGRAGFLEEAMELINEM TTPSASVWGALLGACS
Sbjct: 481 RALFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPTTPSASVWGALLGACS 540

Query: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
           LHMNVEL ELASDQLLKLEPRNHGAI+LLSN+YAKTGRW+KVSELRKLMRD+ELKKEPGC
Sbjct: 541 LHMNVELAELASDQLLKLEPRNHGAIILLSNVYAKTGRWDKVSELRKLMRDSELKKEPGC 600

Query: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660
           SS+E NG VHEFLVGDN+HPLS +IYSKL+EIA KLKSVGYEPNKSHLLQLIEEDD+KE 
Sbjct: 601 SSVEVNGIVHEFLVGDNSHPLSRDIYSKLDEIAAKLKSVGYEPNKSHLLQLIEEDDVKEH 660

Query: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720
           ALSLHSEKLAIAFGL++LAPSQPIRVVKNLRICGDCH  AKL+SRVY+RDIL++DRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLISRVYNRDILVQDRYRFH 720

Query: 721 HFRDGHCSCMDYW 734
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of CSPI01G23820 vs. NCBI nr
Match: XP_004145320.1 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucumis sativus] >KGN65801.1 hypothetical protein Csa_023315 [Cucumis sativus])

HSP 1 Score: 1468.4 bits (3800), Expect = 0.0e+00
Identity = 729/733 (99.45%), Postives = 730/733 (99.59%), Query Frame = 0

Query: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
           MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA
Sbjct: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLHKC 120
           SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLL KC
Sbjct: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
           EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA
Sbjct: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240
           ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240

Query: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
           LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML
Sbjct: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
           DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE
Sbjct: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYTKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420
           VTLVSTLSACAQLGAIDLGGWIHVY KREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420

Query: 421 VEERDVYVWSAMIAGLGMHGCGKAAIDLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VEERDVYVWSAMIAGLGMHG GKAAIDLFF+MQEAKVKPNSVTFTNVLCACSHAGLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480

Query: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540
           RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540

Query: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
           LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660
           SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ
Sbjct: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720
           ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720

Query: 721 HFRDGHCSCMDYW 734
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of CSPI01G23820 vs. NCBI nr
Match: XP_008457379.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucumis melo] >TYJ97320.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1422.1 bits (3680), Expect = 0.0e+00
Identity = 707/733 (96.45%), Postives = 714/733 (97.41%), Query Frame = 0

Query: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
           MEALSVP ISLQNFSTLNNNL FRNHQILS IDKCSSSKQLKEVHARMLRTGLFFDPFSA
Sbjct: 1   MEALSVPLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLHKC 120
           SKLFTASALSSFSTLDYARN+FDQIPQPNLYTWN LIRAYASSSDPFQSFVIFLDLL KC
Sbjct: 61  SKLFTASALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
           EDLPN FTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA
Sbjct: 121 EDLPNNFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240
           ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMV VLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKK 240

Query: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
           LDLEFGRWVCSYIERKGIK+DLTL NAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML
Sbjct: 241 LDLEFGRWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
           DGYAKMGDYDAAR VFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE
Sbjct: 301 DGYAKMGDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYTKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420
           VTLVSTLSACAQLGAIDLGGWIHVY KREGI LNCHLISSLVDMYAKCG+LEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420

Query: 421 VEERDVYVWSAMIAGLGMHGCGKAAIDLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VEERDVYVWSAMIAGLGMHG GKAAIDLFF+MQEAKVKPNSVTFTNVLCACSH GLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEG 480

Query: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540
           RVFFHEMEPVYGVVPE KHYACMVDILGRAGFLEEAMELINEMS TPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACS 540

Query: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
           LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660
           SSIE NGNVHEFLVGDN HPLSSNIYSKL++IATKLK VGYEPNKSHLLQLIEEDDLKEQ
Sbjct: 601 SSIEVNGNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720
           ALSLHSEKLAIAFGLV+LAPSQPIRVVKNLRICGDCH FAKLVSRVYDRDILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFH 720

Query: 721 HFRDGHCSCMDYW 734
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of CSPI01G23820 vs. NCBI nr
Match: KAA0031814.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1420.2 bits (3675), Expect = 0.0e+00
Identity = 706/733 (96.32%), Postives = 713/733 (97.27%), Query Frame = 0

Query: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
           MEALSVP ISLQNFSTLNNNL FRNHQILS IDKCSSSKQLKEVHARMLRTGLFFDPFSA
Sbjct: 1   MEALSVPLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLHKC 120
           SKLFTASALSSFSTLDYARN+FDQIPQPNLYTWN LIRAYASSSDPFQSFVIFLDLL KC
Sbjct: 61  SKLFTASALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
           EDLPN FTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA
Sbjct: 121 EDLPNNFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240
           ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMV VLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKK 240

Query: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
           LDLEFGRWVCSYIERKGIK+DLTL NAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML
Sbjct: 241 LDLEFGRWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
           DGYAKMGDYDAAR VFNAMPVKEIAAWNVLISAYEQNGKPKEALA FNELQLSKIAKPDE
Sbjct: 301 DGYAKMGDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYTKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420
           VTLVSTLSACAQLGAIDLGGWIHVY KREGI LNCHLISSLVDMYAKCG+LEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420

Query: 421 VEERDVYVWSAMIAGLGMHGCGKAAIDLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VEERDVYVWSAMIAGLGMHG GKAAIDLFF+MQEAKVKPNSVTFTNVLCACSH GLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEG 480

Query: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540
           RVFFHEMEPVYGVVPE KHYACMVDILGRAGFLEEAMELINEMS TPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACS 540

Query: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
           LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660
           SSIE NGNVHEFLVGDN HPLSSNIYSKL++IATKLK VGYEPNKSHLLQLIEEDDLKEQ
Sbjct: 601 SSIEVNGNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720
           ALSLHSEKLAIAFGLV+LAPSQPIRVVKNLRICGDCH FAKLVSRVYDRDILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFH 720

Query: 721 HFRDGHCSCMDYW 734
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of CSPI01G23820 vs. NCBI nr
Match: XP_038893523.1 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Benincasa hispida])

HSP 1 Score: 1363.2 bits (3527), Expect = 0.0e+00
Identity = 672/733 (91.68%), Postives = 696/733 (94.95%), Query Frame = 0

Query: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
           MEALSVP ISLQNF T N+NL FRNHQILSTID+CSS KQLK+VHA MLRTGLFFDPFSA
Sbjct: 1   MEALSVPLISLQNFPTPNDNLPFRNHQILSTIDQCSSPKQLKQVHAHMLRTGLFFDPFSA 60

Query: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLHKC 120
           SKLFTASALSSFSTLDYA N+FDQI  PNLYTWNTLIRAYASSSDPFQSFVIFLDLL KC
Sbjct: 61  SKLFTASALSSFSTLDYALNVFDQISHPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
           +DLPN FTFPFVIKAASELKASRVG AVHGMAIKLSFGMDLYILNSLVRFYG CGDL+MA
Sbjct: 121 DDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGTCGDLNMA 180

Query: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240
           ERLF+GISCKDVVSWNSMISAFAQGNCPEDAL+LFLKMERENVMPNSVTMVGVLSACAKK
Sbjct: 181 ERLFEGISCKDVVSWNSMISAFAQGNCPEDALDLFLKMERENVMPNSVTMVGVLSACAKK 240

Query: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
           LDLEFGRWVCSYIERK IKVDLTLCNAMLDMYTKCGS+DDAQKLFDEMPERDVFSWT ML
Sbjct: 241 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYTKCGSIDDAQKLFDEMPERDVFSWTTML 300

Query: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
           DGYAKMGD+DAAR VF+AMPVKEIAAWNVLISAYEQNG PKEALA FNELQLSKIAKPDE
Sbjct: 301 DGYAKMGDFDAARRVFDAMPVKEIAAWNVLISAYEQNGNPKEALATFNELQLSKIAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYTKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420
           VTLVSTLSAC+QLGAIDLGGWIHVY KREGI LNCHLISSLVDMYAKCG+LEKALEVFYS
Sbjct: 361 VTLVSTLSACSQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420

Query: 421 VEERDVYVWSAMIAGLGMHGCGKAAIDLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VE RDVYVWSAMIAGLGMHG GKAAI+LFF+MQEAKVKPNSVTF NVLCACSHAGLVDEG
Sbjct: 421 VEVRDVYVWSAMIAGLGMHGRGKAAINLFFEMQEAKVKPNSVTFMNVLCACSHAGLVDEG 480

Query: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540
           R F HEMEP+YGVVP  KHYACMVDILGRAGFLEEAMELINEM  TPSAS+WGALLGACS
Sbjct: 481 RAFLHEMEPIYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASIWGALLGACS 540

Query: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
           LHMNVEL ELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRD++LKKEPGC
Sbjct: 541 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSKLKKEPGC 600

Query: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660
           SSIE +GNVHEFLVGDN+HPLSS IY KL+EIATKLKSVGYEPNKSHLLQ IEEDDLKEQ
Sbjct: 601 SSIEVDGNVHEFLVGDNSHPLSSKIYLKLDEIATKLKSVGYEPNKSHLLQFIEEDDLKEQ 660

Query: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720
           ALSLHSEKLAIAFGL++LAPSQPIRVVKNLRICGDCH  AKLVSRVYDRDILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDRDILLRDRYRFH 720

Query: 721 HFRDGHCSCMDYW 734
           HFRDGHCSC DYW
Sbjct: 721 HFRDGHCSCRDYW 733

BLAST of CSPI01G23820 vs. NCBI nr
Match: XP_022964665.1 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucurbita moschata])

HSP 1 Score: 1315.8 bits (3404), Expect = 0.0e+00
Identity = 643/733 (87.72%), Postives = 685/733 (93.45%), Query Frame = 0

Query: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
           ME LS P +SL N S  +NNL FRNHQILSTID+CSS KQLK+VHA+MLRTGLFFDPFSA
Sbjct: 1   METLSAPLVSLPNRSIADNNLHFRNHQILSTIDQCSSGKQLKQVHAQMLRTGLFFDPFSA 60

Query: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLHKC 120
           SKL  ASAL S STL+YAR++FDQIP PNLYTWNTLIRAYASS+DPFQSFVIFL LL +C
Sbjct: 61  SKLIAASALKSSSTLEYARDVFDQIPHPNLYTWNTLIRAYASSADPFQSFVIFLALLDEC 120

Query: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
           +DLPN FTFPFVIKAASELKASRVG AVHGMAIKLS GMD YILNSLVRFYGACGDL+MA
Sbjct: 121 DDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSLGMDQYILNSLVRFYGACGDLNMA 180

Query: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240
           ERLF+GISCKDVVSWNSMISAFAQGNCPEDALELFLKME  NVMPNSVTMVGVLSACAKK
Sbjct: 181 ERLFEGISCKDVVSWNSMISAFAQGNCPEDALELFLKMEGANVMPNSVTMVGVLSACAKK 240

Query: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
           LDLEFGRWVCSYIERK I VDLTLCNAMLDMYTKCGS+ DA+KLFDEMPERDVFSWT ML
Sbjct: 241 LDLEFGRWVCSYIERKEISVDLTLCNAMLDMYTKCGSIGDAEKLFDEMPERDVFSWTTML 300

Query: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
           DGYAKMGD++AAR VF+ MPVKEIAAWN LISAYE+NGKPKEALA FNELQLSKIAKPDE
Sbjct: 301 DGYAKMGDFNAARKVFDEMPVKEIAAWNALISAYERNGKPKEALATFNELQLSKIAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYTKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420
           VTLVS+LSACAQLGAIDLGGWIHVY KREGI LN HLI+SL+DMYAKCG+LEKALEVFY+
Sbjct: 361 VTLVSSLSACAQLGAIDLGGWIHVYIKREGINLNGHLITSLIDMYAKCGALEKALEVFYA 420

Query: 421 VEERDVYVWSAMIAGLGMHGCGKAAIDLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VEE+DVYVWSAMIAGLGMHG GKAAI+LFFKMQEAKVKPN VTFTN+LCACSHAGLVDEG
Sbjct: 421 VEEKDVYVWSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTNLLCACSHAGLVDEG 480

Query: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540
           R  FHEMEPVYGVVP  KHYACMVDILGRAGFLEEAMELINEM TTPSASVWGALLGACS
Sbjct: 481 RALFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPTTPSASVWGALLGACS 540

Query: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
           LHMNVEL ELASDQLLKLEPRNHGAI+LLSN+YAKTGRW+KVSELRKLMRD+ELKKEPGC
Sbjct: 541 LHMNVELAELASDQLLKLEPRNHGAIILLSNVYAKTGRWDKVSELRKLMRDSELKKEPGC 600

Query: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660
           SS+E NG VHEFLVGDN+HPLS +IYSKL+EIA KLKSVGYEPNKSHLLQLIEEDD+KE 
Sbjct: 601 SSVEVNGIVHEFLVGDNSHPLSRDIYSKLDEIAAKLKSVGYEPNKSHLLQLIEEDDVKEH 660

Query: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720
           ALSLHSEKLAIAFGL++LAPSQPIRVVKNLRICGDCH  AKL+SRVY+RDIL++DRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLISRVYNRDILVQDRYRFH 720

Query: 721 HFRDGHCSCMDYW 734
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of CSPI01G23820 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 943.3 bits (2437), Expect = 1.1e-274
Identity = 448/727 (61.62%), Postives = 564/727 (77.58%), Query Frame = 0

Query: 7   PSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTA 66
           P+ S  N  T NN       + +S I++C S +QLK+ H  M+RTG F DP+SASKLF  
Sbjct: 16  PNFSNPNQPTTNN----ERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAM 75

Query: 67  SALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLHKCEDLPNK 126
           +ALSSF++L+YAR +FD+IP+PN + WNTLIRAYAS  DP  S   FLD++ + +  PNK
Sbjct: 76  AALSSFASLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNK 135

Query: 127 FTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLFKG 186
           +TFPF+IKAA+E+ +  +G ++HGMA+K + G D+++ NSL+  Y +CGDL  A ++F  
Sbjct: 136 YTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTT 195

Query: 187 ISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKKLDLEFG 246
           I  KDVVSWNSMI+ F Q   P+ ALELF KME E+V  + VTMVGVLSACAK  +LEFG
Sbjct: 196 IKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFG 255

Query: 247 RWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKM 306
           R VCSYIE   + V+LTL NAMLDMYTKCGS++DA++LFD M E+D  +WT MLDGYA  
Sbjct: 256 RQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAIS 315

Query: 307 GDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDEVTLVST 366
            DY+AAR V N+MP K+I AWN LISAYEQNGKP EAL +F+ELQL K  K +++TLVST
Sbjct: 316 EDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVST 375

Query: 367 LSACAQLGAIDLGGWIHVYTKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYSVEERDV 426
           LSACAQ+GA++LG WIH Y K+ GI +N H+ S+L+ MY+KCG LEK+ EVF SVE+RDV
Sbjct: 376 LSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDV 435

Query: 427 YVWSAMIAGLGMHGCGKAAIDLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEGRVFFHE 486
           +VWSAMI GL MHGCG  A+D+F+KMQEA VKPN VTFTNV CACSH GLVDE    FH+
Sbjct: 436 FVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQ 495

Query: 487 MEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACSLHMNVE 546
           ME  YG+VPE KHYAC+VD+LGR+G+LE+A++ I  M   PS SVWGALLGAC +H N+ 
Sbjct: 496 MESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLN 555

Query: 547 LGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGCSSIEAN 606
           L E+A  +LL+LEPRN GA VLLSNIYAK G+WE VSELRK MR T LKKEPGCSSIE +
Sbjct: 556 LAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEID 615

Query: 607 GNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHS 666
           G +HEFL GDN HP+S  +Y KL E+  KLKS GYEP  S +LQ+IEE+++KEQ+L+LHS
Sbjct: 616 GMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHS 675

Query: 667 EKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFHHFRDGH 726
           EKLAI +GL++    + IRV+KNLR+CGDCH+ AKL+S++YDR+I++RDRYRFHHFR+G 
Sbjct: 676 EKLAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQ 735

Query: 727 CSCMDYW 734
           CSC D+W
Sbjct: 736 CSCNDFW 738

BLAST of CSPI01G23820 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 602.1 bits (1551), Expect = 6.2e-172
Identity = 312/744 (41.94%), Postives = 448/744 (60.22%), Query Frame = 0

Query: 24  RNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTASALS-SFSTLDYARNLF 83
           RNH  LS +  C + + L+ +HA+M++ GL    ++ SKL     LS  F  L YA ++F
Sbjct: 32  RNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVF 91

Query: 84  DQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLHKCEDLPNKFTFPFVIKAASELKAS 143
             I +PNL  WNT+ R +A SSDP  +  +++ ++     LPN +TFPFV+K+ ++ KA 
Sbjct: 92  KTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMI-SLGLLPNSYTFPFVLKSCAKSKAF 151

Query: 144 RVGTAVHGMAIKLSFGMDLYILNS-------------------------------LVRFY 203
           + G  +HG  +KL   +DLY+  S                               L++ Y
Sbjct: 152 KEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGY 211

Query: 204 GACGDLSMAERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMV 263
            + G +  A++LF  I  KDVVSWN+MIS +A+    ++ALELF  M + NV P+  TMV
Sbjct: 212 ASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMV 271

Query: 264 GVLSACAKKLDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPER 323
            V+SACA+   +E GR V  +I+  G   +L + NA++D+Y+KCG ++ A  LF+ +P +
Sbjct: 272 TVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYK 331

Query: 324 DVFSWTIMLDGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQ 383
           DV SW  ++ GY  M  Y                               KEAL +F E+ 
Sbjct: 332 DVISWNTLIGGYTHMNLY-------------------------------KEALLLFQEM- 391

Query: 384 LSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYTKR--EGIVLNCHLISSLVDMYAKCG 443
           L     P++VT++S L ACA LGAID+G WIHVY  +  +G+     L +SL+DMYAKCG
Sbjct: 392 LRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCG 451

Query: 444 SLEKALEVFYSVEERDVYVWSAMIAGLGMHGCGKAAIDLFFKMQEAKVKPNSVTFTNVLC 503
            +E A +VF S+  + +  W+AMI G  MHG   A+ DLF +M++  ++P+ +TF  +L 
Sbjct: 452 DIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLS 511

Query: 504 ACSHAGLVDEGRVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSA 563
           ACSH+G++D GR  F  M   Y + P+++HY CM+D+LG +G  +EA E+IN M   P  
Sbjct: 512 ACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDG 571

Query: 564 SVWGALLGACSLHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLM 623
            +W +LL AC +H NVELGE  ++ L+K+EP N G+ VLLSNIYA  GRW +V++ R L+
Sbjct: 572 VIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALL 631

Query: 624 RDTELKKEPGCSSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLL 683
            D  +KK PGCSSIE +  VHEF++GD  HP +  IY  LEE+   L+  G+ P+ S +L
Sbjct: 632 NDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVL 691

Query: 684 QLIEEDDLKEQALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDR 734
           Q +EE + KE AL  HSEKLAIAFGL++  P   + +VKNLR+C +CH   KL+S++Y R
Sbjct: 692 QEMEE-EWKEGALRHHSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKR 741

BLAST of CSPI01G23820 vs. TAIR 10
Match: AT4G14820.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 557.8 bits (1436), Expect = 1.3e-158
Identity = 278/713 (38.99%), Postives = 437/713 (61.29%), Query Frame = 0

Query: 28  ILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTASALSSFSTLDYARNLFDQIPQ 87
           IL  +  C S   +K++HA +LRT    +    S LF  S  SS   L YA N+F  IP 
Sbjct: 15  ILEKLSFCKSLNHIKQLHAHILRT--VINHKLNSFLFNLSVSSSSINLSYALNVFSSIPS 74

Query: 88  -PNLYTWNTLIRAYASSSDPFQSFVIFLDLLHKCEDLPNKFTFPFVIKAASELKASRVGT 147
            P    +N  +R  + SS+P ++ ++F   +       ++F+F  ++KA S++ A   G 
Sbjct: 75  PPESIVFNPFLRDLSRSSEP-RATILFYQRIRHVGGRLDQFSFLPILKAVSKVSALFEGM 134

Query: 148 AVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLFKGISCKDVVSWNSMISAFAQGN 207
            +HG+A K++   D ++    +  Y +CG ++ A  +F  +S +DVV+WN+MI  + +  
Sbjct: 135 ELHGVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIERYCRFG 194

Query: 208 CPEDALELFLKMERENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKGIKVDLTLCN 267
             ++A +LF +M+  NVMP+ + +  ++SAC +  ++ + R +  ++    +++D  L  
Sbjct: 195 LVDEAFKLFEEMKDSNVMPDEMILCNIVSACGRTGNMRYNRAIYEFLIENDVRMDTHLLT 254

Query: 268 AMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKMGDYDAARLVFNAMPVKEIAA 327
           A++ MY   G +D A++ F +M  R++F  T M+ GY+K G  D A+++F+    K++  
Sbjct: 255 ALVTMYAGAGCMDMAREFFRKMSVRNLFVSTAMVSGYSKCGRLDDAQVIFDQTEKKDLVC 314

Query: 328 WNVLISAYEQNGKPKEALAIFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYT 387
           W  +ISAY ++  P+EAL +F E+  S I KPD V++ S +SACA LG +D   W+H   
Sbjct: 315 WTTMISAYVESDYPQEALRVFEEMCCSGI-KPDVVSMFSVISACANLGILDKAKWVHSCI 374

Query: 388 KREGIVLNCHLISSLVDMYAKCGSLEKALEVFYSVEERDVYVWSAMIAGLGMHGCGKAAI 447
              G+     + ++L++MYAKCG L+   +VF  +  R+V  WS+MI  L MHG    A+
Sbjct: 375 HVNGLESELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEASDAL 434

Query: 448 DLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEGRVFFHEMEPVYGVVPEMKHYACMVDI 507
            LF +M++  V+PN VTF  VL  CSH+GLV+EG+  F  M   Y + P+++HY CMVD+
Sbjct: 435 SLFARMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGCMVDL 494

Query: 508 LGRAGFLEEAMELINEMSTTPSASVWGALLGACSLHMNVELGELASDQLLKLEPRNHGAI 567
            GRA  L EA+E+I  M    +  +WG+L+ AC +H  +ELG+ A+ ++L+LEP + GA+
Sbjct: 495 FGRANLLREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDHDGAL 554

Query: 568 VLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGCSSIEANGNVHEFLVGDNTHPLSSNIY 627
           VL+SNIYA+  RWE V  +R++M +  + KE G S I+ NG  HEFL+GD  H  S+ IY
Sbjct: 555 VLMSNIYAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQSNEIY 614

Query: 628 SKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHSEKLAIAFGLVTLAPSQP--- 687
           +KL+E+ +KLK  GY P+   +L  +EE++ K+  L  HSEKLA+ FGL+     +    
Sbjct: 615 AKLDEVVSKLKLAGYVPDCGSVLVDVEEEEKKDLVL-WHSEKLALCFGLMNEEKEEEKDS 674

Query: 688 ---IRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFHHFRDGHCSCMDYW 734
              IR+VKNLR+C DCH F KLVS+VY+R+I++RDR RFH +++G CSC DYW
Sbjct: 675 CGVIRIVKNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722

BLAST of CSPI01G23820 vs. TAIR 10
Match: AT3G12770.1 (mitochondrial editing factor 22 )

HSP 1 Score: 538.1 bits (1385), Expect = 1.1e-152
Identity = 283/727 (38.93%), Postives = 431/727 (59.28%), Query Frame = 0

Query: 15  STLNNNLLFRNHQI------LSTIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTASA 74
           S L + LL+ N  I       S ID  +   QLK++HAR+L  GL F  F  +KL  AS 
Sbjct: 5   SCLASPLLYTNSGIHSDSFYASLIDSATHKAQLKQIHARLLVLGLQFSGFLITKLIHAS- 64

Query: 75  LSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLHKCEDLPNKFT 134
            SSF  + +AR +FD +P+P ++ WN +IR Y S ++ FQ  ++    +      P+ FT
Sbjct: 65  -SSFGDITFARQVFDDLPRPQIFPWNAIIRGY-SRNNHFQDALLMYSNMQLARVSPDSFT 124

Query: 135 FPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLFKGIS 194
           FP ++KA S L   ++G  VH    +L F  D+++ N L+  Y  C  L  A  +F+G+ 
Sbjct: 125 FPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLP 184

Query: 195 C--KDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKKLDLEFG 254
              + +VSW +++SA+AQ   P +ALE+F +M + +V P+ V +V VL+A     DL+ G
Sbjct: 185 LPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQG 244

Query: 255 RWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKM 314
           R + + + + G++++  L  ++  MY KCG V  A+ LFD+M   ++  W  M+ GYAK 
Sbjct: 245 RSIHASVVKMGLEIEPDLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAK- 304

Query: 315 GDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDEVTLVST 374
                                         NG  +EA+ +F+E+ ++K  +PD +++ S 
Sbjct: 305 ------------------------------NGYAREAIDMFHEM-INKDVRPDTISITSA 364

Query: 375 LSACAQLGAIDLGGWIHVYTKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYSVEERDV 434
           +SACAQ+G+++    ++ Y  R     +  + S+L+DM+AKCGS+E A  VF    +RDV
Sbjct: 365 ISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDV 424

Query: 435 YVWSAMIAGLGMHGCGKAAIDLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEGRVFFHE 494
            VWSAMI G G+HG  + AI L+  M+   V PN VTF  +L AC+H+G+V EG  FF+ 
Sbjct: 425 VVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNR 484

Query: 495 MEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACSLHMNVE 554
           M   + + P+ +HYAC++D+LGRAG L++A E+I  M   P  +VWGALL AC  H +VE
Sbjct: 485 MAD-HKINPQQQHYACVIDLLGRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKHRHVE 544

Query: 555 LGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGCSSIEAN 614
           LGE A+ QL  ++P N G  V LSN+YA    W++V+E+R  M++  L K+ GCS +E  
Sbjct: 545 LGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSWVEVR 604

Query: 615 GNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHS 674
           G +  F VGD +HP    I  ++E I ++LK  G+  NK   L  + +++  E+ L  HS
Sbjct: 605 GRLEAFRVGDKSHPRYEEIERQVEWIESRLKEGGFVANKDASLHDLNDEE-AEETLCSHS 664

Query: 675 EKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFHHFRDGH 734
           E++AIA+GL++     P+R+ KNLR C +CHA  KL+S++ DR+I++RD  RFHHF+DG 
Sbjct: 665 ERIAIAYGLISTPQGTPLRITKNLRACVNCHAATKLISKLVDREIVVRDTNRFHHFKDGV 694

BLAST of CSPI01G23820 vs. TAIR 10
Match: AT5G48910.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 523.5 bits (1347), Expect = 2.8e-148
Identity = 274/707 (38.76%), Postives = 413/707 (58.42%), Query Frame = 0

Query: 32  IDKCSSSKQLKEVHARMLRTGLFFDPFSASKL--FTASALSSFSTLDYARNLFDQIPQPN 91
           I+ C + + L ++HA  +++G   D  +A+++  F A++      LDYA  +F+Q+PQ N
Sbjct: 30  INNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHKIFNQMPQRN 89

Query: 92  LYTWNTLIRAYASSSD--PFQSFVIFLDLLHKCEDLPNKFTFPFVIKAASELKASRVGTA 151
            ++WNT+IR ++ S +     +  +F +++      PN+FTFP V+KA ++    + G  
Sbjct: 90  CFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAKTGKIQEGKQ 149

Query: 152 VHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLF-KGISCKDVVSWNSMISAFAQGN 211
           +HG+A+K  FG D +++++LVR Y  CG +  A  LF K I  KD+V             
Sbjct: 150 IHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMV------------- 209

Query: 212 CPEDALELFLKMERENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKGIKVDLTLCN 271
                                                                       
Sbjct: 210 ------------------------------------------------------------ 269

Query: 272 AMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKMGDYDAARLVFNAMPVKEIAA 331
            M D   + G               ++  W +M+DGY ++GD  AAR++F+ M  + + +
Sbjct: 270 VMTDRRKRDG---------------EIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVS 329

Query: 332 WNVLISAYEQNGKPKEALAIFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYT 391
           WN +IS Y  NG  K+A+ +F E++   I +P+ VTLVS L A ++LG+++LG W+H+Y 
Sbjct: 330 WNTMISGYSLNGFFKDAVEVFREMKKGDI-RPNYVTLVSVLPAISRLGSLELGEWLHLYA 389

Query: 392 KREGIVLNCHLISSLVDMYAKCGSLEKALEVFYSVEERDVYVWSAMIAGLGMHGCGKAAI 451
           +  GI ++  L S+L+DMY+KCG +EKA+ VF  +   +V  WSAMI G  +HG    AI
Sbjct: 390 EDSGIRIDDVLGSALIDMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAI 449

Query: 452 DLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEGRVFFHEMEPVYGVVPEMKHYACMVDI 511
           D F KM++A V+P+ V + N+L ACSH GLV+EGR +F +M  V G+ P ++HY CMVD+
Sbjct: 450 DCFCKMRQAGVRPSDVAYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDL 509

Query: 512 LGRAGFLEEAMELINEMSTTPSASVWGALLGACSLHMNVELGELASDQLLKLEPRNHGAI 571
           LGR+G L+EA E I  M   P   +W ALLGAC +  NVE+G+  ++ L+ + P + GA 
Sbjct: 510 LGRSGLLDEAEEFILNMPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAY 569

Query: 572 VLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGCSSIEANGNVHEFLVGDNTHPLSSNIY 631
           V LSN+YA  G W +VSE+R  M++ +++K+PGCS I+ +G +HEF+V D++HP +  I 
Sbjct: 570 VALSNMYASQGNWSEVSEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEIN 629

Query: 632 SKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHSEKLAIAFGLVTLAPSQPIRV 691
           S L EI+ KL+  GY P  + +L  +EE+D KE  L  HSEK+A AFGL++ +P +PIR+
Sbjct: 630 SMLVEISDKLRLAGYRPITTQVLLNLEEED-KENVLHYHSEKIATAFGLISTSPGKPIRI 646

Query: 692 VKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFHHFRDGHCSCMDYW 734
           VKNLRIC DCH+  KL+S+VY R I +RDR RFHHF+DG CSCMDYW
Sbjct: 690 VKNLRICEDCHSSIKLISKVYKRKITVRDRKRFHHFQDGSCSCMDYW 646

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
O823801.6e-27361.62Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9LN018.7e-17141.94Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
O233371.9e-15738.99Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana OX... [more]
Q9LTV81.5e-15138.93Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... [more]
Q9FI803.9e-14738.76Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A0A0M0R90.0e+0099.45DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G5301... [more]
A0A5D3BBW60.0e+0096.45Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3C6230.0e+0096.45pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Cucumis ... [more]
A0A5A7SKX20.0e+0096.32Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1HLG40.0e+0087.72pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Cucurbit... [more]
Match NameE-valueIdentityDescription
XP_004145320.10.0e+0099.45pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucumis sa... [more]
XP_008457379.10.0e+0096.45PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic ... [more]
KAA0031814.10.0e+0096.32pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
XP_038893523.10.0e+0091.68pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Benincasa ... [more]
XP_022964665.10.0e+0087.72pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucurbita ... [more]
Match NameE-valueIdentityDescription
AT2G29760.11.1e-27461.62Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G08070.16.2e-17241.94Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G14820.11.3e-15838.99Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G12770.11.1e-15238.93mitochondrial editing factor 22 [more]
AT5G48910.12.8e-14838.76Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 399..427
e-value: 0.0032
score: 15.5
coord: 193..226
e-value: 1.9E-7
score: 28.8
coord: 500..524
e-value: 0.0014
score: 16.7
coord: 462..495
e-value: 0.0015
score: 16.6
coord: 427..460
e-value: 3.2E-6
score: 25.0
coord: 265..293
e-value: 1.0E-4
score: 20.2
coord: 294..321
e-value: 1.5E-5
score: 22.8
coord: 326..359
e-value: 9.3E-5
score: 20.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 500..523
e-value: 0.0019
score: 18.3
coord: 294..322
e-value: 3.2E-5
score: 23.9
coord: 265..292
e-value: 6.8E-6
score: 26.0
coord: 164..186
e-value: 0.5
score: 10.7
coord: 91..115
e-value: 0.088
score: 13.1
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 190..239
e-value: 2.9E-11
score: 43.4
coord: 425..472
e-value: 4.2E-9
score: 36.5
coord: 324..371
e-value: 4.0E-7
score: 30.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 191..225
score: 12.353442
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 323..357
score: 9.689847
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 425..459
score: 9.996763
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 261..295
score: 11.070971
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 144..241
e-value: 8.5E-21
score: 76.1
coord: 9..143
e-value: 5.6E-9
score: 37.6
coord: 242..326
e-value: 4.9E-20
score: 73.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 327..468
e-value: 3.7E-30
score: 107.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 469..606
e-value: 3.9E-16
score: 60.9
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 303..582
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 598..723
e-value: 9.0E-36
score: 122.6
NoneNo IPR availablePANTHERPTHR47926:SF99BNAANNG32650D PROTEINcoord: 190..305
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 171..238
coord: 304..719
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 14..184
NoneNo IPR availablePANTHERPTHR47926:SF99BNAANNG32650D PROTEINcoord: 171..238
coord: 304..719
coord: 14..184
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 190..305

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G23820.1CSPI01G23820.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding