CSPI01G31740 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI01G31740
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr1: 26433147 .. 26435597 (+)
RNA-Seq ExpressionCSPI01G31740
SyntenyCSPI01G31740
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTCAATCGGTCTTTAGTTATGTAATAGTTGGTGTCGGGATTGGCGAACTTGGCATACAAGGTTGCATTTCGTTGTTGGACTGTTATGAGTGGATTGATACATGAGTCAATAGCCCATGGCTGCACCAAATCATTTAACTCTCTCGTAAGTCGTCTTTCTTATCAAGGTGCTCACCATCAAGTTTTGCAAACATATATTTCTATGCAAAAAACCCACACCCAATTAGATGCTTACACTTTTCCCAGCCTCTTCAAAGCTTGTACCAATTTGAACTTATTTTCACATGGCCTCTCACTTCATCAATCTGTCGTCGTTAATGGGCTCTCTCATGATTCCTATATTGGGTCTTCGCTTATCAGTTTTTATGCGAAATTCGGGTGCATTCATCTTGGTCGCAAGGTGTTTGATACAATGCTCAAAAGAAATGTTGTTCCTTGGACTACCATAATTGGGTCTTATTCACGGGAAGGGGACATCGATATTGCTTTCTCAATGTTCAAACAAATGCGGGAGAGTGGTATTCAGCCCACTTCTGTTACCTTGTTGAGTCTGCTTCCTGGTATTTCAAAGCTTCCCCTTCTTCTTTGTTTGCATTGTTTGATTATTTTACATGGTTTTGAGTCAGACTTAGCTTTATCGAACTCCATGGTGAATATGTATGGTAAATGTGGCAGAATTGCTGATGCAAGAAGGTTGTTTGAGTCAATTGATTGCAGAGACATAGTTTCTTGGAATTCACTATTGTCTGCCTATTCGAAAATTGGAGCCACAGAAGAAATATTGCAGCTTCTACAAGCAATGAAGATTGAAGATATCAAACCTGACAAGCAGACTTTTTGCTCTGCTTTGTCTGCTTCTGCTATAAAGGGTGATCTTCGACTTGGTAAGTTAGTGCATGGTCTGATGCTCAAAGATGGGTTAAATATAGATCAACATGTAGAGTCGGCACTCGTAGTTTTATACTTGAGATGTAGATGTTTGGATCCCGCCTATAAAGTTTTCAAATCAACTACTGAAAAGGACGTGGTCATGTGGACAGCAATGATATCAGGACTTGTTCAGAACGATTGTGCTGACAAGGCATTGGGGGTCTTTTATCAAATGATCGAATCAAATGTCAAGCCGAGTACTGCTACCTTAGCTAGTGGTCTTGCAGCCTGTGCTCAACTTGGTTGTTGTGATATTGGTGCCTCAATTCATGGTTACGTATTAAGGCAAGGAATAATGCTAGACATCCCTGCTCAAAACTCTCTTGTCACCATGTATGCAAAGTGTAATAAGCTGCAGCAAAGTTGTTCAATTTTTAATAAGATGGTTGAAAAGGACTTAGTTTCTTGGAATGCAATTGTGGCTGGACATGCTAAAAATGGTTATTTAAGCAAAGGTATCTTTTTCTTCAATGAAATGAGAAAGAGCTTTCTAAGGCCCGACTCGATAACAGTGACCTCACTTCTTCAGGCTTGTGGTTCTGCTGGCGCACTTTGCCAGGGAAAGTGGATTCACAACTTTGTTCTTAGAAGTTCTCTTATTCCATGCATTATGACTGAAACAGCTCTAGTTGACATGTACTTCAAGTGCGGAAATTTAGAGAATGCTCAGAAGTGTTTTGATTGTATGTTACAACGAGATCTTGTAGCATGGAGCACCCTTATTGTTGGATATGGTTTTAATGGAAAAGGTGAAATCGCTTTGAGAAAATATTCAGAGTTTCTTGGCACAGGGATGGAACCAAATCATGTTATTTTCATTTCAGTTCTTTCTGCTTGTAGTCATGGTGGGCTTATTAGCAAAGGTTTGAGTATATATGAGTCAATGACTAAAGATTTTAGAATGTCACCAAATCTTGAGCATCGAGCTTGCGTCGTCGACCTTCTAAGTCGAGCTGGAAAGGTTGATGAGGCATATAGCTTCTATAAAACGATGTTTAAAGAACCCTCAATAGTTGTTTTAGGCATGCTCCTTGATGCTTGTCGGGTGAATGGTAGGGTCGAACTTGGAAAGGTTATTGCTAGAGATATGTTTGAATTAAAGCCTGTGGATCCTGGAAACTTTGTGCAACTTGCCAATAGTTATGCATCCATGAGTAGATGGGATGGAGTGGAGAAGGCATGGACTCAAATGAGATCTCTTGGTCTGAAAAAGTATCCAGGATGGAGTTCTATTGAAGTTCATGGAACCACTTTTACATTTTTTGCATCTCACAATTCACATCCTAAGATTGAAAAAATAATCTTGACAGTGAAAGCATTGAGCAAGAATATCAGAAATTTGTATGTTAAAAATGAGATTTGTGAGGATTTTGTTGAAAATTCATGAGATGCAGTTTCTCCTCCCTTTTGTTAAAATAAAATACATAGAATTGCATGAGCTACCAGAAGAATGTGTTTGGAATGGTATTGGTTGTGTACTACTTACCATGGTTTTCTAGTGCATTTTATGTTCATCCAACA

mRNA sequence

TCTCAATCGGTCTTTAGTTATGTAATAGTTGGTGTCGGGATTGGCGAACTTGGCATACAAGGTTGCATTTCGTTGTTGGACTGTTATGAGTGGATTGATACATGAGTCAATAGCCCATGGCTGCACCAAATCATTTAACTCTCTCGTAAGTCGTCTTTCTTATCAAGGTGCTCACCATCAAGTTTTGCAAACATATATTTCTATGCAAAAAACCCACACCCAATTAGATGCTTACACTTTTCCCAGCCTCTTCAAAGCTTGTACCAATTTGAACTTATTTTCACATGGCCTCTCACTTCATCAATCTGTCGTCGTTAATGGGCTCTCTCATGATTCCTATATTGGGTCTTCGCTTATCAGTTTTTATGCGAAATTCGGGTGCATTCATCTTGGTCGCAAGGTGTTTGATACAATGCTCAAAAGAAATGTTGTTCCTTGGACTACCATAATTGGGTCTTATTCACGGGAAGGGGACATCGATATTGCTTTCTCAATGTTCAAACAAATGCGGGAGAGTGGTATTCAGCCCACTTCTGTTACCTTGTTGAGTCTGCTTCCTGGTATTTCAAAGCTTCCCCTTCTTCTTTGTTTGCATTGTTTGATTATTTTACATGGTTTTGAGTCAGACTTAGCTTTATCGAACTCCATGGTGAATATGTATGGTAAATGTGGCAGAATTGCTGATGCAAGAAGGTTGTTTGAGTCAATTGATTGCAGAGACATAGTTTCTTGGAATTCACTATTGTCTGCCTATTCGAAAATTGGAGCCACAGAAGAAATATTGCAGCTTCTACAAGCAATGAAGATTGAAGATATCAAACCTGACAAGCAGACTTTTTGCTCTGCTTTGTCTGCTTCTGCTATAAAGGGTGATCTTCGACTTGGTAAGTTAGTGCATGGTCTGATGCTCAAAGATGGGTTAAATATAGATCAACATGTAGAGTCGGCACTCGTAGTTTTATACTTGAGATGTAGATGTTTGGATCCCGCCTATAAAGTTTTCAAATCAACTACTGAAAAGGACGTGGTCATGTGGACAGCAATGATATCAGGACTTGTTCAGAACGATTGTGCTGACAAGGCATTGGGGGTCTTTTATCAAATGATCGAATCAAATGTCAAGCCGAGTACTGCTACCTTAGCTAGTGGTCTTGCAGCCTGTGCTCAACTTGGTTGTTGTGATATTGGTGCCTCAATTCATGGTTACGTATTAAGGCAAGGAATAATGCTAGACATCCCTGCTCAAAACTCTCTTGTCACCATGTATGCAAAGTGTAATAAGCTGCAGCAAAGTTGTTCAATTTTTAATAAGATGGTTGAAAAGGACTTAGTTTCTTGGAATGCAATTGTGGCTGGACATGCTAAAAATGGTTATTTAAGCAAAGGTATCTTTTTCTTCAATGAAATGAGAAAGAGCTTTCTAAGGCCCGACTCGATAACAGTGACCTCACTTCTTCAGGCTTGTGGTTCTGCTGGCGCACTTTGCCAGGGAAAGTGGATTCACAACTTTGTTCTTAGAAGTTCTCTTATTCCATGCATTATGACTGAAACAGCTCTAGTTGACATGTACTTCAAGTGCGGAAATTTAGAGAATGCTCAGAAGTGTTTTGATTGTATGTTACAACGAGATCTTGTAGCATGGAGCACCCTTATTGTTGGATATGGTTTTAATGGAAAAGGTGAAATCGCTTTGAGAAAATATTCAGAGTTTCTTGGCACAGGGATGGAACCAAATCATGTTATTTTCATTTCAGTTCTTTCTGCTTGTAGTCATGGTGGGCTTATTAGCAAAGGTTTGAGTATATATGAGTCAATGACTAAAGATTTTAGAATGTCACCAAATCTTGAGCATCGAGCTTGCGTCGTCGACCTTCTAAGTCGAGCTGGAAAGGTTGATGAGGCATATAGCTTCTATAAAACGATGTTTAAAGAACCCTCAATAGTTGTTTTAGGCATGCTCCTTGATGCTTGTCGGGTGAATGGTAGGGTCGAACTTGGAAAGGTTATTGCTAGAGATATGTTTGAATTAAAGCCTGTGGATCCTGGAAACTTTGTGCAACTTGCCAATAGTTATGCATCCATGAGTAGATGGGATGGAGTGGAGAAGGCATGGACTCAAATGAGATCTCTTGGTCTGAAAAAGTATCCAGGATGGAGTTCTATTGAAGTTCATGGAACCACTTTTACATTTTTTGCATCTCACAATTCACATCCTAAGATTGAAAAAATAATCTTGACAGTGAAAGCATTGAGCAAGAATATCAGAAATTTGTATGTTAAAAATGAGATTTGTGAGGATTTTGTTGAAAATTCATGAGATGCAGTTTCTCCTCCCTTTTGTTAAAATAAAATACATAGAATTGCATGAGCTACCAGAAGAATGTGTTTGGAATGGTATTGGTTGTGTACTACTTACCATGGTTTTCTAGTGCATTTTATGTTCATCCAACA

Coding sequence (CDS)

ATGAGTGGATTGATACATGAGTCAATAGCCCATGGCTGCACCAAATCATTTAACTCTCTCGTAAGTCGTCTTTCTTATCAAGGTGCTCACCATCAAGTTTTGCAAACATATATTTCTATGCAAAAAACCCACACCCAATTAGATGCTTACACTTTTCCCAGCCTCTTCAAAGCTTGTACCAATTTGAACTTATTTTCACATGGCCTCTCACTTCATCAATCTGTCGTCGTTAATGGGCTCTCTCATGATTCCTATATTGGGTCTTCGCTTATCAGTTTTTATGCGAAATTCGGGTGCATTCATCTTGGTCGCAAGGTGTTTGATACAATGCTCAAAAGAAATGTTGTTCCTTGGACTACCATAATTGGGTCTTATTCACGGGAAGGGGACATCGATATTGCTTTCTCAATGTTCAAACAAATGCGGGAGAGTGGTATTCAGCCCACTTCTGTTACCTTGTTGAGTCTGCTTCCTGGTATTTCAAAGCTTCCCCTTCTTCTTTGTTTGCATTGTTTGATTATTTTACATGGTTTTGAGTCAGACTTAGCTTTATCGAACTCCATGGTGAATATGTATGGTAAATGTGGCAGAATTGCTGATGCAAGAAGGTTGTTTGAGTCAATTGATTGCAGAGACATAGTTTCTTGGAATTCACTATTGTCTGCCTATTCGAAAATTGGAGCCACAGAAGAAATATTGCAGCTTCTACAAGCAATGAAGATTGAAGATATCAAACCTGACAAGCAGACTTTTTGCTCTGCTTTGTCTGCTTCTGCTATAAAGGGTGATCTTCGACTTGGTAAGTTAGTGCATGGTCTGATGCTCAAAGATGGGTTAAATATAGATCAACATGTAGAGTCGGCACTCGTAGTTTTATACTTGAGATGTAGATGTTTGGATCCCGCCTATAAAGTTTTCAAATCAACTACTGAAAAGGACGTGGTCATGTGGACAGCAATGATATCAGGACTTGTTCAGAACGATTGTGCTGACAAGGCATTGGGGGTCTTTTATCAAATGATCGAATCAAATGTCAAGCCGAGTACTGCTACCTTAGCTAGTGGTCTTGCAGCCTGTGCTCAACTTGGTTGTTGTGATATTGGTGCCTCAATTCATGGTTACGTATTAAGGCAAGGAATAATGCTAGACATCCCTGCTCAAAACTCTCTTGTCACCATGTATGCAAAGTGTAATAAGCTGCAGCAAAGTTGTTCAATTTTTAATAAGATGGTTGAAAAGGACTTAGTTTCTTGGAATGCAATTGTGGCTGGACATGCTAAAAATGGTTATTTAAGCAAAGGTATCTTTTTCTTCAATGAAATGAGAAAGAGCTTTCTAAGGCCCGACTCGATAACAGTGACCTCACTTCTTCAGGCTTGTGGTTCTGCTGGCGCACTTTGCCAGGGAAAGTGGATTCACAACTTTGTTCTTAGAAGTTCTCTTATTCCATGCATTATGACTGAAACAGCTCTAGTTGACATGTACTTCAAGTGCGGAAATTTAGAGAATGCTCAGAAGTGTTTTGATTGTATGTTACAACGAGATCTTGTAGCATGGAGCACCCTTATTGTTGGATATGGTTTTAATGGAAAAGGTGAAATCGCTTTGAGAAAATATTCAGAGTTTCTTGGCACAGGGATGGAACCAAATCATGTTATTTTCATTTCAGTTCTTTCTGCTTGTAGTCATGGTGGGCTTATTAGCAAAGGTTTGAGTATATATGAGTCAATGACTAAAGATTTTAGAATGTCACCAAATCTTGAGCATCGAGCTTGCGTCGTCGACCTTCTAAGTCGAGCTGGAAAGGTTGATGAGGCATATAGCTTCTATAAAACGATGTTTAAAGAACCCTCAATAGTTGTTTTAGGCATGCTCCTTGATGCTTGTCGGGTGAATGGTAGGGTCGAACTTGGAAAGGTTATTGCTAGAGATATGTTTGAATTAAAGCCTGTGGATCCTGGAAACTTTGTGCAACTTGCCAATAGTTATGCATCCATGAGTAGATGGGATGGAGTGGAGAAGGCATGGACTCAAATGAGATCTCTTGGTCTGAAAAAGTATCCAGGATGGAGTTCTATTGAAGTTCATGGAACCACTTTTACATTTTTTGCATCTCACAATTCACATCCTAAGATTGAAAAAATAATCTTGACAGTGAAAGCATTGAGCAAGAATATCAGAAATTTGTATGTTAAAAATGAGATTTGTGAGGATTTTGTTGAAAATTCATGA

Protein sequence

MSGLIHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACTNLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPGISKLPLLLCLHCLIILHGFESDLALSNSMVNMYGKCGRIADARRLFESIDCRDIVSWNSLLSAYSKIGATEEILQLLQAMKIEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLDPAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACAQLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNAIVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSSLIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYSEFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRAGKVDEAYSFYKTMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLANSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVKALSKNIRNLYVKNEICEDFVENS*
Homology
BLAST of CSPI01G31740 vs. ExPASy Swiss-Prot
Match: Q9XE98 (Pentatricopeptide repeat-containing protein At4g04370 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E99 PE=3 SV=1)

HSP 1 Score: 757.7 bits (1955), Expect = 1.3e-217
Identity = 371/724 (51.24%), Postives = 511/724 (70.58%), Query Frame = 0

Query: 4   LIHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACTNLN 63
           +I  S     TK FNS ++ LS  G H QVL T+ SM       D +TFPSL KAC +L 
Sbjct: 1   MIRTSSVLNSTKYFNSHINHLSSHGDHKQVLSTFSSMLANKLLPDTFTFPSLLKACASLQ 60

Query: 64  LFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTTIIG 123
             S GLS+HQ V+VNG S D YI SSL++ YAKFG +   RKVF+ M +R+VV WT +IG
Sbjct: 61  RLSFGLSIHQQVLVNGFSSDFYISSSLVNLYAKFGLLAHARKVFEEMRERDVVHWTAMIG 120

Query: 124 SYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPGISKLPLLLCLHCLIILHGFESDLA 183
            YSR G +  A S+  +MR  GI+P  VTLL +L G+ ++  L CLH   +++GF+ D+A
Sbjct: 121 CYSRAGIVGEACSLVNEMRFQGIKPGPVTLLEMLSGVLEITQLQCLHDFAVIYGFDCDIA 180

Query: 184 LSNSMVNMYGKCGRIADARRLFESIDCRDIVSWNSLLSAYSKIGATEEILQLLQAMKIED 243
           + NSM+N+Y KC  + DA+ LF+ ++ RD+VSWN+++S Y+ +G   EIL+LL  M+ + 
Sbjct: 181 VMNSMLNLYCKCDHVGDAKDLFDQMEQRDMVSWNTMISGYASVGNMSEILKLLYRMRGDG 240

Query: 244 IKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLDPAY 303
           ++PD+QTF ++LS S    DL +G+++H  ++K G ++D H+++AL+ +YL+C   + +Y
Sbjct: 241 LRPDQQTFGASLSVSGTMCDLEMGRMLHCQIVKTGFDVDMHLKTALITMYLKCGKEEASY 300

Query: 304 KVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACAQLG 363
           +V ++   KDVV WT MISGL++   A+KAL VF +M++S    S+  +AS +A+CAQLG
Sbjct: 301 RVLETIPNKDVVCWTVMISGLMRLGRAEKALIVFSEMLQSGSDLSSEAIASVVASCAQLG 360

Query: 364 CCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNAIVA 423
             D+GAS+HGYVLR G  LD PA NSL+TMYAKC  L +S  IF +M E+DLVSWNAI++
Sbjct: 361 SFDLGASVHGYVLRHGYTLDTPALNSLITMYAKCGHLDKSLVIFERMNERDLVSWNAIIS 420

Query: 424 GHAKNGYLSKGIFFFNEMR-KSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSSLI 483
           G+A+N  L K +  F EM+ K+  + DS TV SLLQAC SAGAL  GK IH  V+RS + 
Sbjct: 421 GYAQNVDLCKALLLFEEMKFKTVQQVDSFTVVSLLQACSSAGALPVGKLIHCIVIRSFIR 480

Query: 484 PCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYSEF 543
           PC + +TALVDMY KCG LE AQ+CFD +  +D+V+W  LI GYGF+GKG+IAL  YSEF
Sbjct: 481 PCSLVDTALVDMYSKCGYLEAAQRCFDSISWKDVVSWGILIAGYGFHGKGDIALEIYSEF 540

Query: 544 LGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRAGK 603
           L +GMEPNHVIF++VLS+CSH G++ +GL I+ SM +DF + PN EH ACVVDLL RA +
Sbjct: 541 LHSGMEPNHVIFLAVLSSCSHNGMVQQGLKIFSSMVRDFGVEPNHEHLACVVDLLCRAKR 600

Query: 604 VDEAYSFYKTMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLANS 663
           +++A+ FYK  F  PSI VLG++LDACR NG+ E+  +I  DM ELKP D G++V+L +S
Sbjct: 601 IEDAFKFYKENFTRPSIDVLGIILDACRANGKTEVEDIICEDMIELKPGDAGHYVKLGHS 660

Query: 664 YASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVKAL 723
           +A+M RWD V ++W QMRSLGLKK PGWS IE++G T TFF +H SH   +  +  +K L
Sbjct: 661 FAAMKRWDDVSESWNQMRSLGLKKLPGWSKIEMNGKTTTFFMNHTSHS--DDTVSLLKLL 720

Query: 724 SKNI 727
           S+ +
Sbjct: 721 SREM 722

BLAST of CSPI01G31740 vs. ExPASy Swiss-Prot
Match: Q9C507 (Putative pentatricopeptide repeat-containing protein At1g69350, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E66 PE=3 SV=1)

HSP 1 Score: 442.6 bits (1137), Expect = 8.9e-123
Identity = 234/690 (33.91%), Postives = 397/690 (57.54%), Query Frame = 0

Query: 34  LQTYISMQKTHTQLDAYTFPSLFKACT-NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLIS 93
           +  Y  +    TQ+  + FPS+ +AC  +    S G  +H  ++  G+  D+ I +SL+ 
Sbjct: 85  IDLYHRLVSETTQISKFVFPSVLRACAGSREHLSVGGKVHGRIIKGGVDDDAVIETSLLC 144

Query: 94  FYAKFGCIHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFSMFKQMRESGIQPTSVT 153
            Y + G +    KVFD M  R++V W+T++ S    G++  A  MFK M + G++P +VT
Sbjct: 145 MYGQTGNLSDAEKVFDGMPVRDLVAWSTLVSSCLENGEVVKALRMFKCMVDDGVEPDAVT 204

Query: 154 LLSLLPGISKLPLLLCLHCLIILHG------FESDLALSNSMVNMYGKCGRIADARRLFE 213
           ++S++ G ++L    CL     +HG      F+ D  L NS++ MY KCG +  + R+FE
Sbjct: 205 MISVVEGCAELG---CLRIARSVHGQITRKMFDLDETLCNSLLTMYSKCGDLLSSERIFE 264

Query: 214 SIDCRDIVSWNSLLSAYSKIGATEEILQLLQAMKIEDIKPDKQTFCSALSASAIKGDLRL 273
            I  ++ VSW +++S+Y++   +E+ L+    M    I+P+  T  S LS+  + G +R 
Sbjct: 265 KIAKKNAVSWTAMISSYNRGEFSEKALRSFSEMIKSGIEPNLVTLYSVLSSCGLIGLIRE 324

Query: 274 GKLVHGLMLKDGLNID-QHVESALVVLYLRCRCLDPAYKVFKSTTEKDVVMWTAMISGLV 333
           GK VHG  ++  L+ + + +  ALV LY  C  L     V +  +++++V W ++IS   
Sbjct: 325 GKSVHGFAVRRELDPNYESLSLALVELYAECGKLSDCETVLRVVSDRNIVAWNSLISLYA 384

Query: 334 QNDCADKALGVFYQMIESNVKPSTATLASGLAACAQLGCCDIGASIHGYVLRQGIMLDIP 393
                 +ALG+F QM+   +KP   TLAS ++AC   G   +G  IHG+V+R  +  D  
Sbjct: 385 HRGMVIQALGLFRQMVTQRIKPDAFTLASSISACENAGLVPLGKQIHGHVIRTDVS-DEF 444

Query: 394 AQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNAIVAGHAKNGYLSKGIFFFNEMRKSF 453
            QNSL+ MY+K   +  + ++FN++  + +V+WN+++ G ++NG   + I  F+ M  S+
Sbjct: 445 VQNSLIDMYSKSGSVDSASTVFNQIKHRSVVTWNSMLCGFSQNGNSVEAISLFDYMYHSY 504

Query: 454 LRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSSLIPCIMTETALVDMYFKCGNLENAQ 513
           L  + +T  +++QAC S G+L +GKW+H+ ++ S L   + T+TAL+DMY KCG+L  A+
Sbjct: 505 LEMNEVTFLAVIQACSSIGSLEKGKWVHHKLIISGL-KDLFTDTALIDMYAKCGDLNAAE 564

Query: 514 KCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYSEFLGTGMEPNHVIFISVLSACSHGG 573
             F  M  R +V+WS++I  YG +G+   A+  +++ + +G +PN V+F++VLSAC H G
Sbjct: 565 TVFRAMSSRSIVSWSSMINAYGMHGRIGSAISTFNQMVESGTKPNEVVFMNVLSACGHSG 624

Query: 574 LISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRAGKVDEAYSFYKTMFKEPSIVVLGML 633
            + +G   Y ++ K F +SPN EH AC +DLLSR+G + EAY   K M       V G L
Sbjct: 625 SVEEG-KYYFNLMKSFGVSPNSEHFACFIDLLSRSGDLKEAYRTIKEMPFLADASVWGSL 684

Query: 634 LDACRVNGRVELGKVIARDMFELKPVDPGNFVQLANSYASMSRWDGVEKAWTQMRSLGLK 693
           ++ CR++ ++++ K I  D+ ++   D G +  L+N YA    W+   +  + M+S  LK
Sbjct: 685 VNGCRIHQKMDIIKAIKNDLSDIVTDDTGYYTLLSNIYAEEGEWEEFRRLRSAMKSSNLK 744

Query: 694 KYPGWSSIEVHGTTFTFFASHNSHPKIEKI 716
           K PG+S+IE+    F F A   +  + ++I
Sbjct: 745 KVPGYSAIEIDQKVFRFGAGEENRIQTDEI 768

BLAST of CSPI01G31740 vs. ExPASy Swiss-Prot
Match: Q9SS60 (Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H23 PE=2 SV=1)

HSP 1 Score: 431.4 bits (1108), Expect = 2.1e-119
Identity = 233/709 (32.86%), Postives = 388/709 (54.72%), Query Frame = 0

Query: 17  FNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACTNLNLFSHGLSLHQSVV 76
           +NS++   S  G   + L+ Y  ++++    D YTFPS+ KAC  L     G  +++ ++
Sbjct: 74  WNSIIRAFSKNGLFPEALEFYGKLRESKVSPDKYTFPSVIKACAGLFDAEMGDLVYEQIL 133

Query: 77  VNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFS 136
             G   D ++G++L+  Y++ G +   R+VFD M  R++V W ++I  YS  G  + A  
Sbjct: 134 DMGFESDLFVGNALVDMYSRMGLLTRARQVFDEMPVRDLVSWNSLISGYSSHGYYEEALE 193

Query: 137 MFKQMRESGIQPTSVTLLSLLPGISKLPLL---LCLHCLIILHGFESDLALSNSMVNMYG 196
           ++ +++ S I P S T+ S+LP    L ++     LH   +  G  S + ++N +V MY 
Sbjct: 194 IYHELKNSWIVPDSFTVSSVLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVNNGLVAMYL 253

Query: 197 KCGRIADARRLFESIDCRDIVSWNSLLSAYSKIGATEEILQLLQAMKIEDIKPDKQTFCS 256
           K  R  DARR+F+ +D RD VS+N+++  Y K+   EE +++     ++  KPD  T  S
Sbjct: 254 KFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEMVEESVRMF-LENLDQFKPDLLTVSS 313

Query: 257 ALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLDPAYKVFKSTTEKD 316
            L A     DL L K ++  MLK G  ++  V + L+ +Y +C  +  A  VF S   KD
Sbjct: 314 VLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYAKCGDMITARDVFNSMECKD 373

Query: 317 VVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACAQLGCCDIGASIHG 376
            V W ++ISG +Q+    +A+ +F  M+    +    T    ++   +L     G  +H 
Sbjct: 374 TVSWNSIISGYIQSGDLMEAMKLFKMMMIMEEQADHITYLMLISVSTRLADLKFGKGLHS 433

Query: 377 YVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNAIVAGHAKNGYLSK 436
             ++ GI +D+   N+L+ MYAKC ++  S  IF+ M   D V+WN +++   + G  + 
Sbjct: 434 NGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSSMGTGDTVTWNTVISACVRFGDFAT 493

Query: 437 GIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSSLIPCIMTETALVD 496
           G+    +MRKS + PD  T    L  C S  A   GK IH  +LR      +    AL++
Sbjct: 494 GLQVTTQMRKSEVVPDMATFLVTLPMCASLAAKRLGKEIHCCLLRFGYESELQIGNALIE 553

Query: 497 MYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYSEFLGTGMEPNHVI 556
           MY KCG LEN+ + F+ M +RD+V W+ +I  YG  G+GE AL  +++   +G+ P+ V+
Sbjct: 554 MYSKCGCLENSSRVFERMSRRDVVTWTGMIYAYGMYGEGEKALETFADMEKSGIVPDSVV 613

Query: 557 FISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRAGKVDEAYSFYKTM 616
           FI+++ ACSH GL+ +GL+ +E M   +++ P +EH ACVVDLLSR+ K+ +A  F + M
Sbjct: 614 FIAIIYACSHSGLVDEGLACFEKMKTHYKIDPMIEHYACVVDLLSRSQKISKAEEFIQAM 673

Query: 617 FKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLANSYASMSRWDGVE 676
             +P   +   +L ACR +G +E  + ++R + EL P DPG  +  +N+YA++ +WD V 
Sbjct: 674 PIKPDASIWASVLRACRTSGDMETAERVSRRIIELNPDDPGYSILASNAYAALRKWDKVS 733

Query: 677 KAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVKAL 723
                ++   + K PG+S IEV      F +  +S P+ E I  +++ L
Sbjct: 734 LIRKSLKDKHITKNPGYSWIEVGKNVHVFSSGDDSAPQSEAIYKSLEIL 781

BLAST of CSPI01G31740 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 428.7 bits (1101), Expect = 1.3e-118
Identity = 225/692 (32.51%), Postives = 388/692 (56.07%), Query Frame = 0

Query: 41  QKTHTQLDAYTFPS--LFKACTNLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFG 100
           ++ +   + Y  P+  L + C++L      L L   V  NGL  + +  + L+S + ++G
Sbjct: 27  ERNYIPANVYEHPAALLLERCSSLKELRQILPL---VFKNGLYQEHFFQTKLVSLFCRYG 86

Query: 101 CIHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFSMFKQMRESGIQPTSVT---LLS 160
            +    +VF+ +  +  V + T++  +++  D+D A   F +MR   ++P       LL 
Sbjct: 87  SVDEAARVFEPIDSKLNVLYHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLK 146

Query: 161 LLPGISKLPLLLCLHCLIILHGFESDLALSNSMVNMYGKCGRIADARRLFESIDCRDIVS 220
           +    ++L +   +H L++  GF  DL     + NMY KC ++ +AR++F+ +  RD+VS
Sbjct: 147 VCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVS 206

Query: 221 WNSLLSAYSKIGATEEILQLLQAMKIEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLML 280
           WN++++ YS+ G     L+++++M  E++KP   T  S L A +    + +GK +HG  +
Sbjct: 207 WNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAM 266

Query: 281 KDGLNIDQHVESALVVLYLRCRCLDPAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALG 340
           + G +   ++ +ALV +Y +C  L+ A ++F    E++VV W +MI   VQN+   +A+ 
Sbjct: 267 RSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAML 326

Query: 341 VFYQMIESNVKPSTATLASGLAACAQLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYA 400
           +F +M++  VKP+  ++   L ACA LG  + G  IH   +  G+  ++   NSL++MY 
Sbjct: 327 IFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYC 386

Query: 401 KCNKLQQSCSIFNKMVEKDLVSWNAIVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTS 460
           KC ++  + S+F K+  + LVSWNA++ G A+NG     + +F++MR   ++PD+ T  S
Sbjct: 387 KCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVS 446

Query: 461 LLQACGSAGALCQGKWIHNFVLRSSLIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRD 520
           ++ A          KWIH  V+RS L   +   TALVDMY KCG +  A+  FD M +R 
Sbjct: 447 VITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERH 506

Query: 521 LVAWSTLIVGYGFNGKGEIALRKYSEFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYE 580
           +  W+ +I GYG +G G+ AL  + E     ++PN V F+SV+SACSH GL+  GL  + 
Sbjct: 507 VTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFY 566

Query: 581 SMTKDFRMSPNLEHRACVVDLLSRAGKVDEAYSFYKTMFKEPSIVVLGMLLDACRVNGRV 640
            M +++ +  +++H   +VDLL RAG+++EA+ F   M  +P++ V G +L AC+++  V
Sbjct: 567 MMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNV 626

Query: 641 ELGKVIARDMFELKPVDPGNFVQLANSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEV 700
              +  A  +FEL P D G  V LAN Y + S W+ V +    M   GL+K PG S +E+
Sbjct: 627 NFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEI 686

Query: 701 HGTTFTFFASHNSHPKIEKIILTVKALSKNIR 728
                +FF+   +HP  +KI   ++ L  +I+
Sbjct: 687 KNEVHSFFSGSTAHPDSKKIYAFLEKLICHIK 715

BLAST of CSPI01G31740 vs. ExPASy Swiss-Prot
Match: Q9ZUW3 (Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H60 PE=2 SV=1)

HSP 1 Score: 421.0 bits (1081), Expect = 2.8e-116
Identity = 226/720 (31.39%), Postives = 397/720 (55.14%), Query Frame = 0

Query: 15  KSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACTNLNLFSHGLSLHQS 74
           +S+ SL+   S  G   +  + ++++ +   ++D   F S+ K    L     G  LH  
Sbjct: 59  ESYISLLFGFSRDGRTQEAKRLFLNIHRLGMEMDCSIFSSVLKVSATLCDELFGRQLHCQ 118

Query: 75  VVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIA 134
            +  G   D  +G+SL+  Y K      GRKVFD M +RNVV WTT+I  Y+R    D  
Sbjct: 119 CIKFGFLDDVSVGTSLVDTYMKGSNFKDGRKVFDEMKERNVVTWTTLISGYARNSMNDEV 178

Query: 135 FSMFKQMRESGIQPTSVTLLSLLPGISKLPL---LLCLHCLIILHGFESDLALSNSMVNM 194
            ++F +M+  G QP S T  + L  +++  +    L +H +++ +G +  + +SNS++N+
Sbjct: 179 LTLFMRMQNEGTQPNSFTFAAALGVLAEEGVGGRGLQVHTVVVKNGLDKTIPVSNSLINL 238

Query: 195 YGKCGRIADARRLFESIDCRDIVSWNSLLSAYSKIGATEEILQLLQAMKIEDIKPDKQTF 254
           Y KCG +  AR LF+  + + +V+WNS++S Y+  G   E L +  +M++  ++  + +F
Sbjct: 239 YLKCGNVRKARILFDKTEVKSVVTWNSMISGYAANGLDLEALGMFYSMRLNYVRLSESSF 298

Query: 255 CSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLDPAYKVFKST-T 314
            S +   A   +LR  + +H  ++K G   DQ++ +AL+V Y +C  +  A ++FK    
Sbjct: 299 ASVIKLCANLKELRFTEQLHCSVVKYGFLFDQNIRTALMVAYSKCTAMLDALRLFKEIGC 358

Query: 315 EKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACAQLGCCDIGAS 374
             +VV WTAMISG +QND  ++A+ +F +M    V+P+  T +  L A   +      + 
Sbjct: 359 VGNVVSWTAMISGFLQNDGKEEAVDLFSEMKRKGVRPNEFTYSVILTALPVIS----PSE 418

Query: 375 IHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNAIVAGHAKNGY 434
           +H  V++           +L+  Y K  K++++  +F+ + +KD+V+W+A++AG+A+ G 
Sbjct: 419 VHAQVVKTNYERSSTVGTALLDAYVKLGKVEEAAKVFSGIDDKDIVAWSAMLAGYAQTGE 478

Query: 435 LSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGA-LCQGKWIHNFVLRSSLIPCIMTET 494
               I  F E+ K  ++P+  T +S+L  C +  A + QGK  H F ++S L   +   +
Sbjct: 479 TEAAIKMFGELTKGGIKPNEFTFSSILNVCAATNASMGQGKQFHGFAIKSRLDSSLCVSS 538

Query: 495 ALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYSEFLGTGMEP 554
           AL+ MY K GN+E+A++ F    ++DLV+W+++I GY  +G+   AL  + E     ++ 
Sbjct: 539 ALLTMYAKKGNIESAEEVFKRQREKDLVSWNSMISGYAQHGQAMKALDVFKEMKKRKVKM 598

Query: 555 NHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRAGKVDEAYSF 614
           + V FI V +AC+H GL+ +G   ++ M +D +++P  EH +C+VDL SRAG++++A   
Sbjct: 599 DGVTFIGVFAACTHAGLVEEGEKYFDIMVRDCKIAPTKEHNSCMVDLYSRAGQLEKAMKV 658

Query: 615 YKTMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLANSYASMSRW 674
            + M       +   +L ACRV+ + ELG++ A  +  +KP D   +V L+N YA    W
Sbjct: 659 IENMPNPAGSTIWRTILAACRVHKKTELGRLAAEKIIAMKPEDSAAYVLLSNMYAESGDW 718

Query: 675 DGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVKALSKNIRNL 730
               K    M    +KK PG+S IEV   T++F A   SHP  ++I + ++ LS  +++L
Sbjct: 719 QERAKVRKLMNERNVKKEPGYSWIEVKNKTYSFLAGDRSHPLKDQIYMKLEDLSTRLKDL 774

BLAST of CSPI01G31740 vs. ExPASy TrEMBL
Match: A0A0A0M0F8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G650050 PE=4 SV=1)

HSP 1 Score: 1491.1 bits (3859), Expect = 0.0e+00
Identity = 739/743 (99.46%), Postives = 740/743 (99.60%), Query Frame = 0

Query: 1   MSGLIHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACT 60
           MSG IHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACT
Sbjct: 1   MSGFIHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACT 60

Query: 61  NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT 120
           NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT
Sbjct: 61  NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT 120

Query: 121 IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPGISKLPLLLCLHCLIILHGFES 180
           IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPGISKLPLLLCLHCLIILHGFES
Sbjct: 121 IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPGISKLPLLLCLHCLIILHGFES 180

Query: 181 DLALSNSMVNMYGKCGRIADARRLFESIDCRDIVSWNSLLSAYSKIGATEEILQLLQAMK 240
           DLALSNSMVNMYGKCGRIADARRLF+SIDCRDIVSWNSLLSAYSKIGATEEILQLLQAMK
Sbjct: 181 DLALSNSMVNMYGKCGRIADARRLFQSIDCRDIVSWNSLLSAYSKIGATEEILQLLQAMK 240

Query: 241 IEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD 300
           IEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD
Sbjct: 241 IEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD 300

Query: 301 PAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACA 360
           PAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACA
Sbjct: 301 PAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACA 360

Query: 361 QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNA 420
           QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNA
Sbjct: 361 QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNA 420

Query: 421 IVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS 480
           IVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS
Sbjct: 421 IVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS 480

Query: 481 LIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYS 540
           LIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYS
Sbjct: 481 LIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYS 540

Query: 541 EFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRA 600
           EFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRA
Sbjct: 541 EFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRA 600

Query: 601 GKVDEAYSFYKTMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLA 660
           GKVDEAYSFYK MFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLA
Sbjct: 601 GKVDEAYSFYKMMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLA 660

Query: 661 NSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVK 720
           NSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVK
Sbjct: 661 NSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVK 720

Query: 721 ALSKNIRNLYVKNEICEDFVENS 744
           ALSKNIRNLYVKNEICEDFVE S
Sbjct: 721 ALSKNIRNLYVKNEICEDFVEYS 743

BLAST of CSPI01G31740 vs. ExPASy TrEMBL
Match: A0A1S4DUX5 (pentatricopeptide repeat-containing protein At4g04370 OS=Cucumis melo OX=3656 GN=LOC103487188 PE=4 SV=1)

HSP 1 Score: 1426.4 bits (3691), Expect = 0.0e+00
Identity = 708/742 (95.42%), Postives = 723/742 (97.44%), Query Frame = 0

Query: 1   MSGLIHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACT 60
           MS LIHESIAHG TKSFNSLVSRLS QGAHHQVLQTYISMQKTHT  DAYTFPSLFKACT
Sbjct: 1   MSRLIHESIAHGSTKSFNSLVSRLSSQGAHHQVLQTYISMQKTHTPSDAYTFPSLFKACT 60

Query: 61  NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT 120
           NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT
Sbjct: 61  NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT 120

Query: 121 IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPGISKLPLLLCLHCLIILHGFES 180
           IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPGISKLPLLLCLHCLI L+GFES
Sbjct: 121 IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPGISKLPLLLCLHCLIFLYGFES 180

Query: 181 DLALSNSMVNMYGKCGRIADARRLFESIDCRDIVSWNSLLSAYSKIGATEEILQLLQAMK 240
           DLALSNSMVNMYGKCGRIADAR LFESID RDIVSWNSLLSAYSKIGATEEILQL+QAMK
Sbjct: 181 DLALSNSMVNMYGKCGRIADARSLFESIDYRDIVSWNSLLSAYSKIGATEEILQLVQAMK 240

Query: 241 IEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD 300
           IEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD
Sbjct: 241 IEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD 300

Query: 301 PAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACA 360
            A+KVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNV+PSTATLAS LAACA
Sbjct: 301 LAHKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVQPSTATLASALAACA 360

Query: 361 QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNA 420
           QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKD+VSWNA
Sbjct: 361 QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDVVSWNA 420

Query: 421 IVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS 480
           IVAG+AKNGYLSK IFFFNEMR SF RPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS
Sbjct: 421 IVAGNAKNGYLSKAIFFFNEMRTSFQRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS 480

Query: 481 LIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYS 540
           LIPCIMTETALVDMYFKCGNLENAQKCFDCM QRDLVAWSTLIVGYGFNGKGEIALRKYS
Sbjct: 481 LIPCIMTETALVDMYFKCGNLENAQKCFDCMSQRDLVAWSTLIVGYGFNGKGEIALRKYS 540

Query: 541 EFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRA 600
           EFLGTGMEPNHVIFISVLSACSH GLIS+GLSIYESMTKDFRM PNLEHRAC+VDLLSRA
Sbjct: 541 EFLGTGMEPNHVIFISVLSACSHSGLISQGLSIYESMTKDFRMPPNLEHRACIVDLLSRA 600

Query: 601 GKVDEAYSFYKTMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLA 660
           GKVDEAYSFYK MFKEPS+VVLG LLDACRVNG VELGKVIARDMFELKPVDPGNFVQLA
Sbjct: 601 GKVDEAYSFYKMMFKEPSMVVLGTLLDACRVNGSVELGKVIARDMFELKPVDPGNFVQLA 660

Query: 661 NSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVK 720
           NSYASM+RWDGVEKAWTQMRSLGLKK+PGWSSIE+HGTTFTFFA+HNSHPKIEKIILTVK
Sbjct: 661 NSYASMNRWDGVEKAWTQMRSLGLKKFPGWSSIELHGTTFTFFAAHNSHPKIEKIILTVK 720

Query: 721 ALSKNIRNLYVKNEICEDFVEN 743
           ALSK+IRNLY+KNEICEDFVEN
Sbjct: 721 ALSKDIRNLYIKNEICEDFVEN 742

BLAST of CSPI01G31740 vs. ExPASy TrEMBL
Match: A0A5D3CX33 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1610G00070 PE=4 SV=1)

HSP 1 Score: 1360.5 bits (3520), Expect = 0.0e+00
Identity = 672/703 (95.59%), Postives = 687/703 (97.72%), Query Frame = 0

Query: 40  MQKTHTQLDAYTFPSLFKACTNLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGC 99
           MQKTHT  DAYTFPSLFKACTNLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGC
Sbjct: 1   MQKTHTPSDAYTFPSLFKACTNLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGC 60

Query: 100 IHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPG 159
           IHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPG
Sbjct: 61  IHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPG 120

Query: 160 ISKLPLLLCLHCLIILHGFESDLALSNSMVNMYGKCGRIADARRLFESIDCRDIVSWNSL 219
           ISKLPLLLCLHCLI L+GFESDLALSNSMVNMYGKCGRIADAR LFESID RDIVSWNSL
Sbjct: 121 ISKLPLLLCLHCLIFLYGFESDLALSNSMVNMYGKCGRIADARSLFESIDYRDIVSWNSL 180

Query: 220 LSAYSKIGATEEILQLLQAMKIEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGL 279
           LSAYSKIGATEEILQL+QAMKIEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGL
Sbjct: 181 LSAYSKIGATEEILQLVQAMKIEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGL 240

Query: 280 NIDQHVESALVVLYLRCRCLDPAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQ 339
           NIDQHVESALVVLYLRCRCLD A+KVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQ
Sbjct: 241 NIDQHVESALVVLYLRCRCLDLAHKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQ 300

Query: 340 MIESNVKPSTATLASGLAACAQLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNK 399
           MIESNV+PSTATLAS LAACAQLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNK
Sbjct: 301 MIESNVQPSTATLASALAACAQLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNK 360

Query: 400 LQQSCSIFNKMVEKDLVSWNAIVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQA 459
           LQQSCSIFNKMVEKD+VSWNAIVAG+AKNGYLSK IFFFNEMR SF RPDSITVTSLLQA
Sbjct: 361 LQQSCSIFNKMVEKDVVSWNAIVAGNAKNGYLSKAIFFFNEMRTSFQRPDSITVTSLLQA 420

Query: 460 CGSAGALCQGKWIHNFVLRSSLIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAW 519
           CGSAGALCQGKWIHNFVLRSSLIPCIMTETALVDMYFKCGNLENAQKCFDCM QRDLVAW
Sbjct: 421 CGSAGALCQGKWIHNFVLRSSLIPCIMTETALVDMYFKCGNLENAQKCFDCMSQRDLVAW 480

Query: 520 STLIVGYGFNGKGEIALRKYSEFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTK 579
           STLIVGYGFNGKGEIALRKYSEFLGTGMEPNHVIFISVLSACSH GLIS+GLSIYESMTK
Sbjct: 481 STLIVGYGFNGKGEIALRKYSEFLGTGMEPNHVIFISVLSACSHSGLISQGLSIYESMTK 540

Query: 580 DFRMSPNLEHRACVVDLLSRAGKVDEAYSFYKTMFKEPSIVVLGMLLDACRVNGRVELGK 639
           DFRM PNLEHRAC+VDLLSRAGKVDEAYSFYK MFKEPS+VVLG LLDACRVNG VELGK
Sbjct: 541 DFRMPPNLEHRACIVDLLSRAGKVDEAYSFYKMMFKEPSMVVLGTLLDACRVNGSVELGK 600

Query: 640 VIARDMFELKPVDPGNFVQLANSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTT 699
           VIARDMFELKPVDPGNFVQLANSYASM+RWDGVEKAWTQMRSLGLKK+PGWSSIE+HGTT
Sbjct: 601 VIARDMFELKPVDPGNFVQLANSYASMNRWDGVEKAWTQMRSLGLKKFPGWSSIELHGTT 660

Query: 700 FTFFASHNSHPKIEKIILTVKALSKNIRNLYVKNEICEDFVEN 743
           FTFFA+HNSHPKIEKIILTVKALSK+IRNLY+KNEICEDFVEN
Sbjct: 661 FTFFAAHNSHPKIEKIILTVKALSKDIRNLYIKNEICEDFVEN 703

BLAST of CSPI01G31740 vs. ExPASy TrEMBL
Match: A0A5A7V033 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold542G00680 PE=4 SV=1)

HSP 1 Score: 1357.8 bits (3513), Expect = 0.0e+00
Identity = 671/703 (95.45%), Postives = 685/703 (97.44%), Query Frame = 0

Query: 40  MQKTHTQLDAYTFPSLFKACTNLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGC 99
           MQKTHT  DAYTFPSLFKACTNLNLFSHGLSLHQSVVVNGLSHDSYIGSSLI+FYAKFGC
Sbjct: 1   MQKTHTPSDAYTFPSLFKACTNLNLFSHGLSLHQSVVVNGLSHDSYIGSSLITFYAKFGC 60

Query: 100 IHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPG 159
           IHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPG
Sbjct: 61  IHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPG 120

Query: 160 ISKLPLLLCLHCLIILHGFESDLALSNSMVNMYGKCGRIADARRLFESIDCRDIVSWNSL 219
           ISKLPLLLCLHCLI L+GFESDLALSNSMVNMYGKCGRIADAR LFESID RDIVSWNSL
Sbjct: 121 ISKLPLLLCLHCLIFLYGFESDLALSNSMVNMYGKCGRIADARSLFESIDYRDIVSWNSL 180

Query: 220 LSAYSKIGATEEILQLLQAMKIEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGL 279
           LSAYSKIGATEEILQL+QAMKIEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGL
Sbjct: 181 LSAYSKIGATEEILQLVQAMKIEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGL 240

Query: 280 NIDQHVESALVVLYLRCRCLDPAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQ 339
           NIDQHVESALVVLYLRCRCLD A+KVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQ
Sbjct: 241 NIDQHVESALVVLYLRCRCLDLAHKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQ 300

Query: 340 MIESNVKPSTATLASGLAACAQLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNK 399
           MIESNV+PSTATLAS LAACAQLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNK
Sbjct: 301 MIESNVQPSTATLASALAACAQLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNK 360

Query: 400 LQQSCSIFNKMVEKDLVSWNAIVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQA 459
           LQQSCSIFNKMVEKD+VSWNAIVAG+AKNGYLSK IFFFNEMR SF RPDSITVTSLLQA
Sbjct: 361 LQQSCSIFNKMVEKDVVSWNAIVAGNAKNGYLSKAIFFFNEMRTSFQRPDSITVTSLLQA 420

Query: 460 CGSAGALCQGKWIHNFVLRSSLIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAW 519
           CGSAGALCQGKWIHNFVLRSSLIPCIMTETALVDMYFKCGNLENAQKCFDCM QRDLVAW
Sbjct: 421 CGSAGALCQGKWIHNFVLRSSLIPCIMTETALVDMYFKCGNLENAQKCFDCMSQRDLVAW 480

Query: 520 STLIVGYGFNGKGEIALRKYSEFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTK 579
           STLIVGYGFNGKGEIALRKYSEFLG GMEPNHVIFISVLSACSH GLIS+GLSIYESMTK
Sbjct: 481 STLIVGYGFNGKGEIALRKYSEFLGAGMEPNHVIFISVLSACSHSGLISQGLSIYESMTK 540

Query: 580 DFRMSPNLEHRACVVDLLSRAGKVDEAYSFYKTMFKEPSIVVLGMLLDACRVNGRVELGK 639
           DFRM PNLEHRACVVDLLSRAGKVDEAYSFYK MFKEPS+VVLG LLDACRVNG VELGK
Sbjct: 541 DFRMPPNLEHRACVVDLLSRAGKVDEAYSFYKMMFKEPSMVVLGTLLDACRVNGSVELGK 600

Query: 640 VIARDMFELKPVDPGNFVQLANSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTT 699
           VIARDMFELKPVDPGNFVQLANSYASM+RWDGVEKAWTQMRSLGLKKYPGWSSIE+HGTT
Sbjct: 601 VIARDMFELKPVDPGNFVQLANSYASMNRWDGVEKAWTQMRSLGLKKYPGWSSIELHGTT 660

Query: 700 FTFFASHNSHPKIEKIILTVKALSKNIRNLYVKNEICEDFVEN 743
           FTFFA+HNSHPKIEKIILTVKALSK+IRN Y+KNEICEDFVEN
Sbjct: 661 FTFFAAHNSHPKIEKIILTVKALSKDIRNFYIKNEICEDFVEN 703

BLAST of CSPI01G31740 vs. ExPASy TrEMBL
Match: A0A6J1E522 (pentatricopeptide repeat-containing protein At4g04370 OS=Momordica charantia OX=3673 GN=LOC111026115 PE=4 SV=1)

HSP 1 Score: 1213.7 bits (3139), Expect = 0.0e+00
Identity = 593/731 (81.12%), Postives = 660/731 (90.29%), Query Frame = 0

Query: 9   IAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACTNLNLFSHG 68
           IA+G TKSFNS+++RLS QGAHHQVLQTY SMQKT T  DAYTFPSL KACT LNLF  G
Sbjct: 16  IANGSTKSFNSIINRLSSQGAHHQVLQTYASMQKTSTPPDAYTFPSLLKACTILNLFLDG 75

Query: 69  LSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTTIIGSYSRE 128
           LSLHQS++VNG S DSYIGSSLISFYAKFGCI +GRKVFD M +RNVVPWTTIIG YSRE
Sbjct: 76  LSLHQSIIVNGFSLDSYIGSSLISFYAKFGCIDIGRKVFDIMPERNVVPWTTIIGCYSRE 135

Query: 129 GDIDIAFSMFKQMRESGIQPTSVTLLSLLPGISKLPLLLCLHCLIILHGFESDLALSNSM 188
           G+ID+AFSMFKQMR +GIQPTSVTLLSLLP IS+LPLL CLHC IIL+GFES+L+LSNSM
Sbjct: 136 GEIDVAFSMFKQMRATGIQPTSVTLLSLLPSISELPLLQCLHCWIILYGFESNLSLSNSM 195

Query: 189 VNMYGKCGRIADARRLFESIDCRDIVSWNSLLSAYSKIGATEEILQLLQAMKIEDIKPDK 248
           VN+YG+CG I DAR LFES+D RDIVSWNSLLSAYSKIG  EEILQL+  M+ EDIKPDK
Sbjct: 196 VNVYGRCGSIEDARSLFESMDYRDIVSWNSLLSAYSKIGVIEEILQLVLGMRTEDIKPDK 255

Query: 249 QTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLDPAYKVFKS 308
           QTFCSALSASAIKGD+RLGKLVHGL++KDGL IDQ VE+AL+VLYLRC+ LD A KVFKS
Sbjct: 256 QTFCSALSASAIKGDIRLGKLVHGLIIKDGLGIDQQVETALMVLYLRCKSLDLALKVFKS 315

Query: 309 TTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACAQLGCCDIG 368
           TTEKD+V+WTAMISGLVQNDCADKAL VFYQM+ESN++P TATLAS LAACAQLGC DIG
Sbjct: 316 TTEKDMVLWTAMISGLVQNDCADKALRVFYQMLESNMEPGTATLASALAACAQLGCYDIG 375

Query: 369 ASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNAIVAGHAKN 428
             IHGY+LRQGIMLDIPAQN+LVTMYAKCN+L+QSC IFNKMVE+DLVSWNAIVAGHAKN
Sbjct: 376 TLIHGYILRQGIMLDIPAQNALVTMYAKCNRLEQSCGIFNKMVERDLVSWNAIVAGHAKN 435

Query: 429 GYLSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSSLIPCIMTE 488
           GYLSK I FFNEMR S  RPDSITVTSLLQACGSAGAL QGKWIHNFV RSSL+PCIM E
Sbjct: 436 GYLSKAILFFNEMRTSLQRPDSITVTSLLQACGSAGALWQGKWIHNFVFRSSLMPCIMIE 495

Query: 489 TALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYSEFLGTGME 548
           TAL+DMYFKCGNLE AQKCFD M  +DLV WSTLI GYGFNG GEIALRKYSEFLGTG+E
Sbjct: 496 TALIDMYFKCGNLEIAQKCFDYMPHQDLVTWSTLISGYGFNGNGEIALRKYSEFLGTGLE 555

Query: 549 PNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRAGKVDEAYS 608
           PNHVIF+SVLSACSH GL+++GL IYESMT+DF M PNLEHRAC+VDLLSRAGKV+EAYS
Sbjct: 556 PNHVIFLSVLSACSHSGLVNQGLRIYESMTRDFLMPPNLEHRACIVDLLSRAGKVEEAYS 615

Query: 609 FYKTMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLANSYASMSR 668
           FYK MF+EPSI VLG+LLDACRVNG VELG+ IARD+F LKPVDPGN+VQLA+SYASM R
Sbjct: 616 FYKMMFQEPSIDVLGILLDACRVNGSVELGEAIARDIFALKPVDPGNYVQLAHSYASMGR 675

Query: 669 WDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVKALSKNIRN 728
           WDGVE+AWTQMRSLGLKK PGWSSIEVHGT+F+F++ HNSHPKIE+I+LTVK+LS +IR 
Sbjct: 676 WDGVEEAWTQMRSLGLKKLPGWSSIEVHGTSFSFYSVHNSHPKIEEIMLTVKSLSNDIRK 735

Query: 729 LYVKNEICEDF 740
           ++++NEI +DF
Sbjct: 736 MHIENEINKDF 746

BLAST of CSPI01G31740 vs. NCBI nr
Match: XP_004139152.1 (pentatricopeptide repeat-containing protein At4g04370 [Cucumis sativus])

HSP 1 Score: 1491.1 bits (3859), Expect = 0.0e+00
Identity = 739/743 (99.46%), Postives = 740/743 (99.60%), Query Frame = 0

Query: 1   MSGLIHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACT 60
           MSG IHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACT
Sbjct: 1   MSGFIHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACT 60

Query: 61  NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT 120
           NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT
Sbjct: 61  NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT 120

Query: 121 IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPGISKLPLLLCLHCLIILHGFES 180
           IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPGISKLPLLLCLHCLIILHGFES
Sbjct: 121 IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPGISKLPLLLCLHCLIILHGFES 180

Query: 181 DLALSNSMVNMYGKCGRIADARRLFESIDCRDIVSWNSLLSAYSKIGATEEILQLLQAMK 240
           DLALSNSMVNMYGKCGRIADARRLF+SIDCRDIVSWNSLLSAYSKIGATEEILQLLQAMK
Sbjct: 181 DLALSNSMVNMYGKCGRIADARRLFQSIDCRDIVSWNSLLSAYSKIGATEEILQLLQAMK 240

Query: 241 IEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD 300
           IEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD
Sbjct: 241 IEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD 300

Query: 301 PAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACA 360
           PAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACA
Sbjct: 301 PAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACA 360

Query: 361 QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNA 420
           QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNA
Sbjct: 361 QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNA 420

Query: 421 IVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS 480
           IVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS
Sbjct: 421 IVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS 480

Query: 481 LIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYS 540
           LIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYS
Sbjct: 481 LIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYS 540

Query: 541 EFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRA 600
           EFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRA
Sbjct: 541 EFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRA 600

Query: 601 GKVDEAYSFYKTMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLA 660
           GKVDEAYSFYK MFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLA
Sbjct: 601 GKVDEAYSFYKMMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLA 660

Query: 661 NSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVK 720
           NSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVK
Sbjct: 661 NSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVK 720

Query: 721 ALSKNIRNLYVKNEICEDFVENS 744
           ALSKNIRNLYVKNEICEDFVE S
Sbjct: 721 ALSKNIRNLYVKNEICEDFVEYS 743

BLAST of CSPI01G31740 vs. NCBI nr
Match: XP_016899786.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g04370 [Cucumis melo])

HSP 1 Score: 1426.4 bits (3691), Expect = 0.0e+00
Identity = 708/742 (95.42%), Postives = 723/742 (97.44%), Query Frame = 0

Query: 1   MSGLIHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACT 60
           MS LIHESIAHG TKSFNSLVSRLS QGAHHQVLQTYISMQKTHT  DAYTFPSLFKACT
Sbjct: 1   MSRLIHESIAHGSTKSFNSLVSRLSSQGAHHQVLQTYISMQKTHTPSDAYTFPSLFKACT 60

Query: 61  NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT 120
           NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT
Sbjct: 61  NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT 120

Query: 121 IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPGISKLPLLLCLHCLIILHGFES 180
           IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPGISKLPLLLCLHCLI L+GFES
Sbjct: 121 IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPGISKLPLLLCLHCLIFLYGFES 180

Query: 181 DLALSNSMVNMYGKCGRIADARRLFESIDCRDIVSWNSLLSAYSKIGATEEILQLLQAMK 240
           DLALSNSMVNMYGKCGRIADAR LFESID RDIVSWNSLLSAYSKIGATEEILQL+QAMK
Sbjct: 181 DLALSNSMVNMYGKCGRIADARSLFESIDYRDIVSWNSLLSAYSKIGATEEILQLVQAMK 240

Query: 241 IEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD 300
           IEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD
Sbjct: 241 IEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD 300

Query: 301 PAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACA 360
            A+KVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNV+PSTATLAS LAACA
Sbjct: 301 LAHKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVQPSTATLASALAACA 360

Query: 361 QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNA 420
           QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKD+VSWNA
Sbjct: 361 QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDVVSWNA 420

Query: 421 IVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS 480
           IVAG+AKNGYLSK IFFFNEMR SF RPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS
Sbjct: 421 IVAGNAKNGYLSKAIFFFNEMRTSFQRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS 480

Query: 481 LIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYS 540
           LIPCIMTETALVDMYFKCGNLENAQKCFDCM QRDLVAWSTLIVGYGFNGKGEIALRKYS
Sbjct: 481 LIPCIMTETALVDMYFKCGNLENAQKCFDCMSQRDLVAWSTLIVGYGFNGKGEIALRKYS 540

Query: 541 EFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRA 600
           EFLGTGMEPNHVIFISVLSACSH GLIS+GLSIYESMTKDFRM PNLEHRAC+VDLLSRA
Sbjct: 541 EFLGTGMEPNHVIFISVLSACSHSGLISQGLSIYESMTKDFRMPPNLEHRACIVDLLSRA 600

Query: 601 GKVDEAYSFYKTMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLA 660
           GKVDEAYSFYK MFKEPS+VVLG LLDACRVNG VELGKVIARDMFELKPVDPGNFVQLA
Sbjct: 601 GKVDEAYSFYKMMFKEPSMVVLGTLLDACRVNGSVELGKVIARDMFELKPVDPGNFVQLA 660

Query: 661 NSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVK 720
           NSYASM+RWDGVEKAWTQMRSLGLKK+PGWSSIE+HGTTFTFFA+HNSHPKIEKIILTVK
Sbjct: 661 NSYASMNRWDGVEKAWTQMRSLGLKKFPGWSSIELHGTTFTFFAAHNSHPKIEKIILTVK 720

Query: 721 ALSKNIRNLYVKNEICEDFVEN 743
           ALSK+IRNLY+KNEICEDFVEN
Sbjct: 721 ALSKDIRNLYIKNEICEDFVEN 742

BLAST of CSPI01G31740 vs. NCBI nr
Match: TYK14769.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1360.5 bits (3520), Expect = 0.0e+00
Identity = 672/703 (95.59%), Postives = 687/703 (97.72%), Query Frame = 0

Query: 40  MQKTHTQLDAYTFPSLFKACTNLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGC 99
           MQKTHT  DAYTFPSLFKACTNLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGC
Sbjct: 1   MQKTHTPSDAYTFPSLFKACTNLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGC 60

Query: 100 IHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPG 159
           IHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPG
Sbjct: 61  IHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPG 120

Query: 160 ISKLPLLLCLHCLIILHGFESDLALSNSMVNMYGKCGRIADARRLFESIDCRDIVSWNSL 219
           ISKLPLLLCLHCLI L+GFESDLALSNSMVNMYGKCGRIADAR LFESID RDIVSWNSL
Sbjct: 121 ISKLPLLLCLHCLIFLYGFESDLALSNSMVNMYGKCGRIADARSLFESIDYRDIVSWNSL 180

Query: 220 LSAYSKIGATEEILQLLQAMKIEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGL 279
           LSAYSKIGATEEILQL+QAMKIEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGL
Sbjct: 181 LSAYSKIGATEEILQLVQAMKIEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGL 240

Query: 280 NIDQHVESALVVLYLRCRCLDPAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQ 339
           NIDQHVESALVVLYLRCRCLD A+KVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQ
Sbjct: 241 NIDQHVESALVVLYLRCRCLDLAHKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQ 300

Query: 340 MIESNVKPSTATLASGLAACAQLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNK 399
           MIESNV+PSTATLAS LAACAQLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNK
Sbjct: 301 MIESNVQPSTATLASALAACAQLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNK 360

Query: 400 LQQSCSIFNKMVEKDLVSWNAIVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQA 459
           LQQSCSIFNKMVEKD+VSWNAIVAG+AKNGYLSK IFFFNEMR SF RPDSITVTSLLQA
Sbjct: 361 LQQSCSIFNKMVEKDVVSWNAIVAGNAKNGYLSKAIFFFNEMRTSFQRPDSITVTSLLQA 420

Query: 460 CGSAGALCQGKWIHNFVLRSSLIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAW 519
           CGSAGALCQGKWIHNFVLRSSLIPCIMTETALVDMYFKCGNLENAQKCFDCM QRDLVAW
Sbjct: 421 CGSAGALCQGKWIHNFVLRSSLIPCIMTETALVDMYFKCGNLENAQKCFDCMSQRDLVAW 480

Query: 520 STLIVGYGFNGKGEIALRKYSEFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTK 579
           STLIVGYGFNGKGEIALRKYSEFLGTGMEPNHVIFISVLSACSH GLIS+GLSIYESMTK
Sbjct: 481 STLIVGYGFNGKGEIALRKYSEFLGTGMEPNHVIFISVLSACSHSGLISQGLSIYESMTK 540

Query: 580 DFRMSPNLEHRACVVDLLSRAGKVDEAYSFYKTMFKEPSIVVLGMLLDACRVNGRVELGK 639
           DFRM PNLEHRAC+VDLLSRAGKVDEAYSFYK MFKEPS+VVLG LLDACRVNG VELGK
Sbjct: 541 DFRMPPNLEHRACIVDLLSRAGKVDEAYSFYKMMFKEPSMVVLGTLLDACRVNGSVELGK 600

Query: 640 VIARDMFELKPVDPGNFVQLANSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTT 699
           VIARDMFELKPVDPGNFVQLANSYASM+RWDGVEKAWTQMRSLGLKK+PGWSSIE+HGTT
Sbjct: 601 VIARDMFELKPVDPGNFVQLANSYASMNRWDGVEKAWTQMRSLGLKKFPGWSSIELHGTT 660

Query: 700 FTFFASHNSHPKIEKIILTVKALSKNIRNLYVKNEICEDFVEN 743
           FTFFA+HNSHPKIEKIILTVKALSK+IRNLY+KNEICEDFVEN
Sbjct: 661 FTFFAAHNSHPKIEKIILTVKALSKDIRNLYIKNEICEDFVEN 703

BLAST of CSPI01G31740 vs. NCBI nr
Match: KAA0060187.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1357.8 bits (3513), Expect = 0.0e+00
Identity = 671/703 (95.45%), Postives = 685/703 (97.44%), Query Frame = 0

Query: 40  MQKTHTQLDAYTFPSLFKACTNLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGC 99
           MQKTHT  DAYTFPSLFKACTNLNLFSHGLSLHQSVVVNGLSHDSYIGSSLI+FYAKFGC
Sbjct: 1   MQKTHTPSDAYTFPSLFKACTNLNLFSHGLSLHQSVVVNGLSHDSYIGSSLITFYAKFGC 60

Query: 100 IHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPG 159
           IHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPG
Sbjct: 61  IHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPG 120

Query: 160 ISKLPLLLCLHCLIILHGFESDLALSNSMVNMYGKCGRIADARRLFESIDCRDIVSWNSL 219
           ISKLPLLLCLHCLI L+GFESDLALSNSMVNMYGKCGRIADAR LFESID RDIVSWNSL
Sbjct: 121 ISKLPLLLCLHCLIFLYGFESDLALSNSMVNMYGKCGRIADARSLFESIDYRDIVSWNSL 180

Query: 220 LSAYSKIGATEEILQLLQAMKIEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGL 279
           LSAYSKIGATEEILQL+QAMKIEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGL
Sbjct: 181 LSAYSKIGATEEILQLVQAMKIEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGL 240

Query: 280 NIDQHVESALVVLYLRCRCLDPAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQ 339
           NIDQHVESALVVLYLRCRCLD A+KVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQ
Sbjct: 241 NIDQHVESALVVLYLRCRCLDLAHKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQ 300

Query: 340 MIESNVKPSTATLASGLAACAQLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNK 399
           MIESNV+PSTATLAS LAACAQLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNK
Sbjct: 301 MIESNVQPSTATLASALAACAQLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNK 360

Query: 400 LQQSCSIFNKMVEKDLVSWNAIVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQA 459
           LQQSCSIFNKMVEKD+VSWNAIVAG+AKNGYLSK IFFFNEMR SF RPDSITVTSLLQA
Sbjct: 361 LQQSCSIFNKMVEKDVVSWNAIVAGNAKNGYLSKAIFFFNEMRTSFQRPDSITVTSLLQA 420

Query: 460 CGSAGALCQGKWIHNFVLRSSLIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAW 519
           CGSAGALCQGKWIHNFVLRSSLIPCIMTETALVDMYFKCGNLENAQKCFDCM QRDLVAW
Sbjct: 421 CGSAGALCQGKWIHNFVLRSSLIPCIMTETALVDMYFKCGNLENAQKCFDCMSQRDLVAW 480

Query: 520 STLIVGYGFNGKGEIALRKYSEFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTK 579
           STLIVGYGFNGKGEIALRKYSEFLG GMEPNHVIFISVLSACSH GLIS+GLSIYESMTK
Sbjct: 481 STLIVGYGFNGKGEIALRKYSEFLGAGMEPNHVIFISVLSACSHSGLISQGLSIYESMTK 540

Query: 580 DFRMSPNLEHRACVVDLLSRAGKVDEAYSFYKTMFKEPSIVVLGMLLDACRVNGRVELGK 639
           DFRM PNLEHRACVVDLLSRAGKVDEAYSFYK MFKEPS+VVLG LLDACRVNG VELGK
Sbjct: 541 DFRMPPNLEHRACVVDLLSRAGKVDEAYSFYKMMFKEPSMVVLGTLLDACRVNGSVELGK 600

Query: 640 VIARDMFELKPVDPGNFVQLANSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTT 699
           VIARDMFELKPVDPGNFVQLANSYASM+RWDGVEKAWTQMRSLGLKKYPGWSSIE+HGTT
Sbjct: 601 VIARDMFELKPVDPGNFVQLANSYASMNRWDGVEKAWTQMRSLGLKKYPGWSSIELHGTT 660

Query: 700 FTFFASHNSHPKIEKIILTVKALSKNIRNLYVKNEICEDFVEN 743
           FTFFA+HNSHPKIEKIILTVKALSK+IRN Y+KNEICEDFVEN
Sbjct: 661 FTFFAAHNSHPKIEKIILTVKALSKDIRNFYIKNEICEDFVEN 703

BLAST of CSPI01G31740 vs. NCBI nr
Match: XP_038878475.1 (pentatricopeptide repeat-containing protein At4g04370 [Benincasa hispida] >XP_038878476.1 pentatricopeptide repeat-containing protein At4g04370 [Benincasa hispida])

HSP 1 Score: 1302.3 bits (3369), Expect = 0.0e+00
Identity = 640/740 (86.49%), Postives = 688/740 (92.97%), Query Frame = 0

Query: 4   LIHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACTNLN 63
           LIHE IAHG TKSFNSL++RLS QGAHHQVLQTYIS Q T+T  DAYTFPSL KACTNLN
Sbjct: 14  LIHEPIAHGSTKSFNSLLNRLSSQGAHHQVLQTYISFQNTNTPPDAYTFPSLLKACTNLN 73

Query: 64  LFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTTIIG 123
           +FS+GLS+HQSV+VNGLSHDSYIGSSLISFYAKFGCIH GRKVFDTM +RNVVPWTT+IG
Sbjct: 74  MFSNGLSIHQSVIVNGLSHDSYIGSSLISFYAKFGCIHFGRKVFDTMPERNVVPWTTLIG 133

Query: 124 SYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPGISKLPLLLCLHCLIILHGFESDLA 183
            YSR+GDIDIAFSMFKQMRESGIQPTSVT LSLLPGIS+LPLLLCLHCLI+L+GFESDLA
Sbjct: 134 CYSRQGDIDIAFSMFKQMRESGIQPTSVTFLSLLPGISELPLLLCLHCLIVLYGFESDLA 193

Query: 184 LSNSMVNMYGKCGRIADARRLFESIDCRDIVSWNSLLSAYSKIGATEEILQLLQAMKIED 243
           L NSMVNMYGKCG+I DAR LFES+D RD+VSWNSLLSAYSKIG  EEIL+ +Q M+IED
Sbjct: 194 LLNSMVNMYGKCGKIGDARSLFESMDYRDLVSWNSLLSAYSKIGGIEEILKFIQGMRIED 253

Query: 244 IKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLDPAY 303
           IKPDKQTFCSALSASAIKGDLR GKLVH L+LKDG +IDQ VE+AL+VLYLRCRCLD A+
Sbjct: 254 IKPDKQTFCSALSASAIKGDLRFGKLVHCLILKDGSDIDQQVETALIVLYLRCRCLDLAH 313

Query: 304 KVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACAQLG 363
           +VFKSTTEKD V+WTAMISGLVQNDCADKALGVFYQMIESNV+PSTATLAS L+ACAQL 
Sbjct: 314 EVFKSTTEKDAVLWTAMISGLVQNDCADKALGVFYQMIESNVEPSTATLASALSACAQLV 373

Query: 364 CCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNAIVA 423
           CCDIG SIHGYVLRQGI+LDIPAQNSLVTMYAKCNKL+QSCSIFN+MVEKDLVSWNAIVA
Sbjct: 374 CCDIGTSIHGYVLRQGILLDIPAQNSLVTMYAKCNKLEQSCSIFNEMVEKDLVSWNAIVA 433

Query: 424 GHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSSLIP 483
           GHAKNGYLSK IFFFNEMR SF RPDSITVTSLLQACGSAGAL QGKWIHNFVLRSSLIP
Sbjct: 434 GHAKNGYLSKAIFFFNEMRTSFQRPDSITVTSLLQACGSAGALFQGKWIHNFVLRSSLIP 493

Query: 484 CIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYSEFL 543
           CIMTETALVDMYFKCGNL  AQKCFD MLQ+DLV WS LI GYGFNGKGEIALRKYSEFL
Sbjct: 494 CIMTETALVDMYFKCGNLGTAQKCFDYMLQKDLVTWSILIAGYGFNGKGEIALRKYSEFL 553

Query: 544 GTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRAGKV 603
           GTGMEPNHVIF+SVLSACSH GLI +GL+IYESMTKDFRMSPNLEHRAC++DLLSRAGKV
Sbjct: 554 GTGMEPNHVIFLSVLSACSHSGLIIQGLNIYESMTKDFRMSPNLEHRACIIDLLSRAGKV 613

Query: 604 DEAYSFYKTMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLANSY 663
           DEAYSFYK MFKEP+I VLG+LLDACRVNG V+LGKVIARDMFELKPVD GNFVQLA+SY
Sbjct: 614 DEAYSFYKMMFKEPAIDVLGILLDACRVNGSVQLGKVIARDMFELKPVDAGNFVQLAHSY 673

Query: 664 ASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVKALS 723
           ASMSRWDGVE AWTQMRSLGLKK PGWSSIEVHGT+FTFF+ HNSHPKIE IILTVK+LS
Sbjct: 674 ASMSRWDGVEAAWTQMRSLGLKKLPGWSSIEVHGTSFTFFSVHNSHPKIEDIILTVKSLS 733

Query: 724 KNIRNLYVKNEICEDFVENS 744
           K+IR ++V+NEI EDFVE S
Sbjct: 734 KDIRKMHVENEIREDFVEIS 753

BLAST of CSPI01G31740 vs. TAIR 10
Match: AT4G04370.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 757.7 bits (1955), Expect = 8.9e-219
Identity = 371/724 (51.24%), Postives = 511/724 (70.58%), Query Frame = 0

Query: 4   LIHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACTNLN 63
           +I  S     TK FNS ++ LS  G H QVL T+ SM       D +TFPSL KAC +L 
Sbjct: 1   MIRTSSVLNSTKYFNSHINHLSSHGDHKQVLSTFSSMLANKLLPDTFTFPSLLKACASLQ 60

Query: 64  LFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTTIIG 123
             S GLS+HQ V+VNG S D YI SSL++ YAKFG +   RKVF+ M +R+VV WT +IG
Sbjct: 61  RLSFGLSIHQQVLVNGFSSDFYISSSLVNLYAKFGLLAHARKVFEEMRERDVVHWTAMIG 120

Query: 124 SYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPGISKLPLLLCLHCLIILHGFESDLA 183
            YSR G +  A S+  +MR  GI+P  VTLL +L G+ ++  L CLH   +++GF+ D+A
Sbjct: 121 CYSRAGIVGEACSLVNEMRFQGIKPGPVTLLEMLSGVLEITQLQCLHDFAVIYGFDCDIA 180

Query: 184 LSNSMVNMYGKCGRIADARRLFESIDCRDIVSWNSLLSAYSKIGATEEILQLLQAMKIED 243
           + NSM+N+Y KC  + DA+ LF+ ++ RD+VSWN+++S Y+ +G   EIL+LL  M+ + 
Sbjct: 181 VMNSMLNLYCKCDHVGDAKDLFDQMEQRDMVSWNTMISGYASVGNMSEILKLLYRMRGDG 240

Query: 244 IKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLDPAY 303
           ++PD+QTF ++LS S    DL +G+++H  ++K G ++D H+++AL+ +YL+C   + +Y
Sbjct: 241 LRPDQQTFGASLSVSGTMCDLEMGRMLHCQIVKTGFDVDMHLKTALITMYLKCGKEEASY 300

Query: 304 KVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACAQLG 363
           +V ++   KDVV WT MISGL++   A+KAL VF +M++S    S+  +AS +A+CAQLG
Sbjct: 301 RVLETIPNKDVVCWTVMISGLMRLGRAEKALIVFSEMLQSGSDLSSEAIASVVASCAQLG 360

Query: 364 CCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNAIVA 423
             D+GAS+HGYVLR G  LD PA NSL+TMYAKC  L +S  IF +M E+DLVSWNAI++
Sbjct: 361 SFDLGASVHGYVLRHGYTLDTPALNSLITMYAKCGHLDKSLVIFERMNERDLVSWNAIIS 420

Query: 424 GHAKNGYLSKGIFFFNEMR-KSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSSLI 483
           G+A+N  L K +  F EM+ K+  + DS TV SLLQAC SAGAL  GK IH  V+RS + 
Sbjct: 421 GYAQNVDLCKALLLFEEMKFKTVQQVDSFTVVSLLQACSSAGALPVGKLIHCIVIRSFIR 480

Query: 484 PCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYSEF 543
           PC + +TALVDMY KCG LE AQ+CFD +  +D+V+W  LI GYGF+GKG+IAL  YSEF
Sbjct: 481 PCSLVDTALVDMYSKCGYLEAAQRCFDSISWKDVVSWGILIAGYGFHGKGDIALEIYSEF 540

Query: 544 LGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRAGK 603
           L +GMEPNHVIF++VLS+CSH G++ +GL I+ SM +DF + PN EH ACVVDLL RA +
Sbjct: 541 LHSGMEPNHVIFLAVLSSCSHNGMVQQGLKIFSSMVRDFGVEPNHEHLACVVDLLCRAKR 600

Query: 604 VDEAYSFYKTMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLANS 663
           +++A+ FYK  F  PSI VLG++LDACR NG+ E+  +I  DM ELKP D G++V+L +S
Sbjct: 601 IEDAFKFYKENFTRPSIDVLGIILDACRANGKTEVEDIICEDMIELKPGDAGHYVKLGHS 660

Query: 664 YASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVKAL 723
           +A+M RWD V ++W QMRSLGLKK PGWS IE++G T TFF +H SH   +  +  +K L
Sbjct: 661 FAAMKRWDDVSESWNQMRSLGLKKLPGWSKIEMNGKTTTFFMNHTSHS--DDTVSLLKLL 720

Query: 724 SKNI 727
           S+ +
Sbjct: 721 SREM 722

BLAST of CSPI01G31740 vs. TAIR 10
Match: AT1G69350.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 442.6 bits (1137), Expect = 6.3e-124
Identity = 234/690 (33.91%), Postives = 397/690 (57.54%), Query Frame = 0

Query: 34  LQTYISMQKTHTQLDAYTFPSLFKACT-NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLIS 93
           +  Y  +    TQ+  + FPS+ +AC  +    S G  +H  ++  G+  D+ I +SL+ 
Sbjct: 85  IDLYHRLVSETTQISKFVFPSVLRACAGSREHLSVGGKVHGRIIKGGVDDDAVIETSLLC 144

Query: 94  FYAKFGCIHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFSMFKQMRESGIQPTSVT 153
            Y + G +    KVFD M  R++V W+T++ S    G++  A  MFK M + G++P +VT
Sbjct: 145 MYGQTGNLSDAEKVFDGMPVRDLVAWSTLVSSCLENGEVVKALRMFKCMVDDGVEPDAVT 204

Query: 154 LLSLLPGISKLPLLLCLHCLIILHG------FESDLALSNSMVNMYGKCGRIADARRLFE 213
           ++S++ G ++L    CL     +HG      F+ D  L NS++ MY KCG +  + R+FE
Sbjct: 205 MISVVEGCAELG---CLRIARSVHGQITRKMFDLDETLCNSLLTMYSKCGDLLSSERIFE 264

Query: 214 SIDCRDIVSWNSLLSAYSKIGATEEILQLLQAMKIEDIKPDKQTFCSALSASAIKGDLRL 273
            I  ++ VSW +++S+Y++   +E+ L+    M    I+P+  T  S LS+  + G +R 
Sbjct: 265 KIAKKNAVSWTAMISSYNRGEFSEKALRSFSEMIKSGIEPNLVTLYSVLSSCGLIGLIRE 324

Query: 274 GKLVHGLMLKDGLNID-QHVESALVVLYLRCRCLDPAYKVFKSTTEKDVVMWTAMISGLV 333
           GK VHG  ++  L+ + + +  ALV LY  C  L     V +  +++++V W ++IS   
Sbjct: 325 GKSVHGFAVRRELDPNYESLSLALVELYAECGKLSDCETVLRVVSDRNIVAWNSLISLYA 384

Query: 334 QNDCADKALGVFYQMIESNVKPSTATLASGLAACAQLGCCDIGASIHGYVLRQGIMLDIP 393
                 +ALG+F QM+   +KP   TLAS ++AC   G   +G  IHG+V+R  +  D  
Sbjct: 385 HRGMVIQALGLFRQMVTQRIKPDAFTLASSISACENAGLVPLGKQIHGHVIRTDVS-DEF 444

Query: 394 AQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNAIVAGHAKNGYLSKGIFFFNEMRKSF 453
            QNSL+ MY+K   +  + ++FN++  + +V+WN+++ G ++NG   + I  F+ M  S+
Sbjct: 445 VQNSLIDMYSKSGSVDSASTVFNQIKHRSVVTWNSMLCGFSQNGNSVEAISLFDYMYHSY 504

Query: 454 LRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSSLIPCIMTETALVDMYFKCGNLENAQ 513
           L  + +T  +++QAC S G+L +GKW+H+ ++ S L   + T+TAL+DMY KCG+L  A+
Sbjct: 505 LEMNEVTFLAVIQACSSIGSLEKGKWVHHKLIISGL-KDLFTDTALIDMYAKCGDLNAAE 564

Query: 514 KCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYSEFLGTGMEPNHVIFISVLSACSHGG 573
             F  M  R +V+WS++I  YG +G+   A+  +++ + +G +PN V+F++VLSAC H G
Sbjct: 565 TVFRAMSSRSIVSWSSMINAYGMHGRIGSAISTFNQMVESGTKPNEVVFMNVLSACGHSG 624

Query: 574 LISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRAGKVDEAYSFYKTMFKEPSIVVLGML 633
            + +G   Y ++ K F +SPN EH AC +DLLSR+G + EAY   K M       V G L
Sbjct: 625 SVEEG-KYYFNLMKSFGVSPNSEHFACFIDLLSRSGDLKEAYRTIKEMPFLADASVWGSL 684

Query: 634 LDACRVNGRVELGKVIARDMFELKPVDPGNFVQLANSYASMSRWDGVEKAWTQMRSLGLK 693
           ++ CR++ ++++ K I  D+ ++   D G +  L+N YA    W+   +  + M+S  LK
Sbjct: 685 VNGCRIHQKMDIIKAIKNDLSDIVTDDTGYYTLLSNIYAEEGEWEEFRRLRSAMKSSNLK 744

Query: 694 KYPGWSSIEVHGTTFTFFASHNSHPKIEKI 716
           K PG+S+IE+    F F A   +  + ++I
Sbjct: 745 KVPGYSAIEIDQKVFRFGAGEENRIQTDEI 768

BLAST of CSPI01G31740 vs. TAIR 10
Match: AT3G03580.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 431.4 bits (1108), Expect = 1.5e-120
Identity = 233/709 (32.86%), Postives = 388/709 (54.72%), Query Frame = 0

Query: 17  FNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACTNLNLFSHGLSLHQSVV 76
           +NS++   S  G   + L+ Y  ++++    D YTFPS+ KAC  L     G  +++ ++
Sbjct: 74  WNSIIRAFSKNGLFPEALEFYGKLRESKVSPDKYTFPSVIKACAGLFDAEMGDLVYEQIL 133

Query: 77  VNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFS 136
             G   D ++G++L+  Y++ G +   R+VFD M  R++V W ++I  YS  G  + A  
Sbjct: 134 DMGFESDLFVGNALVDMYSRMGLLTRARQVFDEMPVRDLVSWNSLISGYSSHGYYEEALE 193

Query: 137 MFKQMRESGIQPTSVTLLSLLPGISKLPLL---LCLHCLIILHGFESDLALSNSMVNMYG 196
           ++ +++ S I P S T+ S+LP    L ++     LH   +  G  S + ++N +V MY 
Sbjct: 194 IYHELKNSWIVPDSFTVSSVLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVNNGLVAMYL 253

Query: 197 KCGRIADARRLFESIDCRDIVSWNSLLSAYSKIGATEEILQLLQAMKIEDIKPDKQTFCS 256
           K  R  DARR+F+ +D RD VS+N+++  Y K+   EE +++     ++  KPD  T  S
Sbjct: 254 KFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEMVEESVRMF-LENLDQFKPDLLTVSS 313

Query: 257 ALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLDPAYKVFKSTTEKD 316
            L A     DL L K ++  MLK G  ++  V + L+ +Y +C  +  A  VF S   KD
Sbjct: 314 VLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYAKCGDMITARDVFNSMECKD 373

Query: 317 VVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACAQLGCCDIGASIHG 376
            V W ++ISG +Q+    +A+ +F  M+    +    T    ++   +L     G  +H 
Sbjct: 374 TVSWNSIISGYIQSGDLMEAMKLFKMMMIMEEQADHITYLMLISVSTRLADLKFGKGLHS 433

Query: 377 YVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNAIVAGHAKNGYLSK 436
             ++ GI +D+   N+L+ MYAKC ++  S  IF+ M   D V+WN +++   + G  + 
Sbjct: 434 NGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSSMGTGDTVTWNTVISACVRFGDFAT 493

Query: 437 GIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSSLIPCIMTETALVD 496
           G+    +MRKS + PD  T    L  C S  A   GK IH  +LR      +    AL++
Sbjct: 494 GLQVTTQMRKSEVVPDMATFLVTLPMCASLAAKRLGKEIHCCLLRFGYESELQIGNALIE 553

Query: 497 MYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYSEFLGTGMEPNHVI 556
           MY KCG LEN+ + F+ M +RD+V W+ +I  YG  G+GE AL  +++   +G+ P+ V+
Sbjct: 554 MYSKCGCLENSSRVFERMSRRDVVTWTGMIYAYGMYGEGEKALETFADMEKSGIVPDSVV 613

Query: 557 FISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRAGKVDEAYSFYKTM 616
           FI+++ ACSH GL+ +GL+ +E M   +++ P +EH ACVVDLLSR+ K+ +A  F + M
Sbjct: 614 FIAIIYACSHSGLVDEGLACFEKMKTHYKIDPMIEHYACVVDLLSRSQKISKAEEFIQAM 673

Query: 617 FKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLANSYASMSRWDGVE 676
             +P   +   +L ACR +G +E  + ++R + EL P DPG  +  +N+YA++ +WD V 
Sbjct: 674 PIKPDASIWASVLRACRTSGDMETAERVSRRIIELNPDDPGYSILASNAYAALRKWDKVS 733

Query: 677 KAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVKAL 723
                ++   + K PG+S IEV      F +  +S P+ E I  +++ L
Sbjct: 734 LIRKSLKDKHITKNPGYSWIEVGKNVHVFSSGDDSAPQSEAIYKSLEIL 781

BLAST of CSPI01G31740 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 428.7 bits (1101), Expect = 9.5e-120
Identity = 225/692 (32.51%), Postives = 388/692 (56.07%), Query Frame = 0

Query: 41  QKTHTQLDAYTFPS--LFKACTNLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFG 100
           ++ +   + Y  P+  L + C++L      L L   V  NGL  + +  + L+S + ++G
Sbjct: 27  ERNYIPANVYEHPAALLLERCSSLKELRQILPL---VFKNGLYQEHFFQTKLVSLFCRYG 86

Query: 101 CIHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFSMFKQMRESGIQPTSVT---LLS 160
            +    +VF+ +  +  V + T++  +++  D+D A   F +MR   ++P       LL 
Sbjct: 87  SVDEAARVFEPIDSKLNVLYHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLK 146

Query: 161 LLPGISKLPLLLCLHCLIILHGFESDLALSNSMVNMYGKCGRIADARRLFESIDCRDIVS 220
           +    ++L +   +H L++  GF  DL     + NMY KC ++ +AR++F+ +  RD+VS
Sbjct: 147 VCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVS 206

Query: 221 WNSLLSAYSKIGATEEILQLLQAMKIEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLML 280
           WN++++ YS+ G     L+++++M  E++KP   T  S L A +    + +GK +HG  +
Sbjct: 207 WNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAM 266

Query: 281 KDGLNIDQHVESALVVLYLRCRCLDPAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALG 340
           + G +   ++ +ALV +Y +C  L+ A ++F    E++VV W +MI   VQN+   +A+ 
Sbjct: 267 RSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAML 326

Query: 341 VFYQMIESNVKPSTATLASGLAACAQLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYA 400
           +F +M++  VKP+  ++   L ACA LG  + G  IH   +  G+  ++   NSL++MY 
Sbjct: 327 IFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYC 386

Query: 401 KCNKLQQSCSIFNKMVEKDLVSWNAIVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTS 460
           KC ++  + S+F K+  + LVSWNA++ G A+NG     + +F++MR   ++PD+ T  S
Sbjct: 387 KCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVS 446

Query: 461 LLQACGSAGALCQGKWIHNFVLRSSLIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRD 520
           ++ A          KWIH  V+RS L   +   TALVDMY KCG +  A+  FD M +R 
Sbjct: 447 VITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERH 506

Query: 521 LVAWSTLIVGYGFNGKGEIALRKYSEFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYE 580
           +  W+ +I GYG +G G+ AL  + E     ++PN V F+SV+SACSH GL+  GL  + 
Sbjct: 507 VTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFY 566

Query: 581 SMTKDFRMSPNLEHRACVVDLLSRAGKVDEAYSFYKTMFKEPSIVVLGMLLDACRVNGRV 640
            M +++ +  +++H   +VDLL RAG+++EA+ F   M  +P++ V G +L AC+++  V
Sbjct: 567 MMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNV 626

Query: 641 ELGKVIARDMFELKPVDPGNFVQLANSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEV 700
              +  A  +FEL P D G  V LAN Y + S W+ V +    M   GL+K PG S +E+
Sbjct: 627 NFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEI 686

Query: 701 HGTTFTFFASHNSHPKIEKIILTVKALSKNIR 728
                +FF+   +HP  +KI   ++ L  +I+
Sbjct: 687 KNEVHSFFSGSTAHPDSKKIYAFLEKLICHIK 715

BLAST of CSPI01G31740 vs. TAIR 10
Match: AT2G27610.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 421.0 bits (1081), Expect = 2.0e-117
Identity = 226/720 (31.39%), Postives = 397/720 (55.14%), Query Frame = 0

Query: 15  KSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACTNLNLFSHGLSLHQS 74
           +S+ SL+   S  G   +  + ++++ +   ++D   F S+ K    L     G  LH  
Sbjct: 59  ESYISLLFGFSRDGRTQEAKRLFLNIHRLGMEMDCSIFSSVLKVSATLCDELFGRQLHCQ 118

Query: 75  VVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIA 134
            +  G   D  +G+SL+  Y K      GRKVFD M +RNVV WTT+I  Y+R    D  
Sbjct: 119 CIKFGFLDDVSVGTSLVDTYMKGSNFKDGRKVFDEMKERNVVTWTTLISGYARNSMNDEV 178

Query: 135 FSMFKQMRESGIQPTSVTLLSLLPGISKLPL---LLCLHCLIILHGFESDLALSNSMVNM 194
            ++F +M+  G QP S T  + L  +++  +    L +H +++ +G +  + +SNS++N+
Sbjct: 179 LTLFMRMQNEGTQPNSFTFAAALGVLAEEGVGGRGLQVHTVVVKNGLDKTIPVSNSLINL 238

Query: 195 YGKCGRIADARRLFESIDCRDIVSWNSLLSAYSKIGATEEILQLLQAMKIEDIKPDKQTF 254
           Y KCG +  AR LF+  + + +V+WNS++S Y+  G   E L +  +M++  ++  + +F
Sbjct: 239 YLKCGNVRKARILFDKTEVKSVVTWNSMISGYAANGLDLEALGMFYSMRLNYVRLSESSF 298

Query: 255 CSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLDPAYKVFKST-T 314
            S +   A   +LR  + +H  ++K G   DQ++ +AL+V Y +C  +  A ++FK    
Sbjct: 299 ASVIKLCANLKELRFTEQLHCSVVKYGFLFDQNIRTALMVAYSKCTAMLDALRLFKEIGC 358

Query: 315 EKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACAQLGCCDIGAS 374
             +VV WTAMISG +QND  ++A+ +F +M    V+P+  T +  L A   +      + 
Sbjct: 359 VGNVVSWTAMISGFLQNDGKEEAVDLFSEMKRKGVRPNEFTYSVILTALPVIS----PSE 418

Query: 375 IHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNAIVAGHAKNGY 434
           +H  V++           +L+  Y K  K++++  +F+ + +KD+V+W+A++AG+A+ G 
Sbjct: 419 VHAQVVKTNYERSSTVGTALLDAYVKLGKVEEAAKVFSGIDDKDIVAWSAMLAGYAQTGE 478

Query: 435 LSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGA-LCQGKWIHNFVLRSSLIPCIMTET 494
               I  F E+ K  ++P+  T +S+L  C +  A + QGK  H F ++S L   +   +
Sbjct: 479 TEAAIKMFGELTKGGIKPNEFTFSSILNVCAATNASMGQGKQFHGFAIKSRLDSSLCVSS 538

Query: 495 ALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYSEFLGTGMEP 554
           AL+ MY K GN+E+A++ F    ++DLV+W+++I GY  +G+   AL  + E     ++ 
Sbjct: 539 ALLTMYAKKGNIESAEEVFKRQREKDLVSWNSMISGYAQHGQAMKALDVFKEMKKRKVKM 598

Query: 555 NHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRAGKVDEAYSF 614
           + V FI V +AC+H GL+ +G   ++ M +D +++P  EH +C+VDL SRAG++++A   
Sbjct: 599 DGVTFIGVFAACTHAGLVEEGEKYFDIMVRDCKIAPTKEHNSCMVDLYSRAGQLEKAMKV 658

Query: 615 YKTMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLANSYASMSRW 674
            + M       +   +L ACRV+ + ELG++ A  +  +KP D   +V L+N YA    W
Sbjct: 659 IENMPNPAGSTIWRTILAACRVHKKTELGRLAAEKIIAMKPEDSAAYVLLSNMYAESGDW 718

Query: 675 DGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVKALSKNIRNL 730
               K    M    +KK PG+S IEV   T++F A   SHP  ++I + ++ LS  +++L
Sbjct: 719 QERAKVRKLMNERNVKKEPGYSWIEVKNKTYSFLAGDRSHPLKDQIYMKLEDLSTRLKDL 774

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9XE981.3e-21751.24Pentatricopeptide repeat-containing protein At4g04370 OS=Arabidopsis thaliana OX... [more]
Q9C5078.9e-12333.91Putative pentatricopeptide repeat-containing protein At1g69350, mitochondrial OS... [more]
Q9SS602.1e-11932.86Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX... [more]
Q3E6Q11.3e-11832.51Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Q9ZUW32.8e-11631.39Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A0A0M0F80.0e+0099.46Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G650050 PE=4 SV=1[more]
A0A1S4DUX50.0e+0095.42pentatricopeptide repeat-containing protein At4g04370 OS=Cucumis melo OX=3656 GN... [more]
A0A5D3CX330.0e+0095.59Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A5A7V0330.0e+0095.45Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1E5220.0e+0081.12pentatricopeptide repeat-containing protein At4g04370 OS=Momordica charantia OX=... [more]
Match NameE-valueIdentityDescription
XP_004139152.10.0e+0099.46pentatricopeptide repeat-containing protein At4g04370 [Cucumis sativus][more]
XP_016899786.10.0e+0095.42PREDICTED: pentatricopeptide repeat-containing protein At4g04370 [Cucumis melo][more]
TYK14769.10.0e+0095.59pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
KAA0060187.10.0e+0095.45pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
XP_038878475.10.0e+0086.49pentatricopeptide repeat-containing protein At4g04370 [Benincasa hispida] >XP_03... [more]
Match NameE-valueIdentityDescription
AT4G04370.18.9e-21951.24Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G69350.16.3e-12433.91Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G03580.11.5e-12032.86Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G11290.19.5e-12032.51Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G27610.12.0e-11731.39Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 593..615
e-value: 0.31
score: 11.4
coord: 489..515
e-value: 0.0072
score: 16.5
coord: 315..345
e-value: 1.1E-6
score: 28.5
coord: 186..209
e-value: 0.023
score: 14.9
coord: 552..579
e-value: 0.45
score: 10.9
coord: 517..541
e-value: 0.5
score: 10.7
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 214..247
e-value: 1.6E-4
score: 19.6
coord: 416..449
e-value: 2.8E-5
score: 22.0
coord: 315..348
e-value: 1.8E-7
score: 28.9
coord: 89..118
e-value: 0.0021
score: 16.1
coord: 118..149
e-value: 3.1E-8
score: 31.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 113..159
e-value: 1.0E-9
score: 38.5
coord: 413..460
e-value: 3.5E-9
score: 36.7
coord: 212..256
e-value: 1.5E-8
score: 34.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 515..549
score: 8.988323
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 414..448
score: 10.577712
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 313..347
score: 11.191545
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 212..246
score: 11.509422
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 114..148
score: 12.75901
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 568..714
e-value: 1.9E-11
score: 46.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 470..567
e-value: 4.8E-17
score: 63.9
coord: 167..263
e-value: 7.9E-20
score: 73.0
coord: 366..469
e-value: 9.0E-20
score: 72.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 14..166
e-value: 1.5E-27
score: 98.8
coord: 264..365
e-value: 1.4E-16
score: 62.8
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 5..367
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 169..443
NoneNo IPR availablePANTHERPTHR47924:SF46PENTATRICOPEPTIDE REPEAT PROTEIN-RELATEDcoord: 5..367
NoneNo IPR availablePANTHERPTHR47924:SF46PENTATRICOPEPTIDE REPEAT PROTEIN-RELATEDcoord: 169..443
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 276..729
NoneNo IPR availablePANTHERPTHR47924:SF46PENTATRICOPEPTIDE REPEAT PROTEIN-RELATEDcoord: 276..729

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G31740.1CSPI01G31740.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding