CSPI04G18560 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI04G18560
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr4: 16056873 .. 16058783 (+)
RNA-Seq ExpressionCSPI04G18560
SyntenyCSPI04G18560
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TGCTTAACAAACAGAAAGTTTATCTTCTCCAAAATTGTCTCCAAGAAGCACTACTTTTTTTCTTCTTCCTTTCTTAGTTTCTCTACTTCACCACCTTCCATGCTTTCAGCAATTCAAATGCTGAATAATGTAACCAAATGCGTTGCTTTTCTACAATCGTGTGCTGACCACCAGAATCTTAACAAAGGAAAACAGCTTCACTCCCTAATGATCACCTATGGTTTTTCTCCCTCACCTCCATCCATCACTAGCTTAATCAACATGTACTCCAAATGTGGTCAAATGGGGGAGGCCATTTTGGTTTTCTATGATCCGTGCCATGAGCGTAATGTGTTTGCATATAATGCTATAATTTCTGGGTTTGTCTCCAACGGTTTGGCTTCAAAAGGGTTTCAATTTTATAAGAAAATGAGGTTAGAGGGTGTAATGCCTGATAAATACACTTTTCCATGTGTAGTTAGAACTTGTTGTGAGGTTATGGAGGTGAAGAAGATTCATGGATGTTTGCTTAAAATGGGGTTGGAGTTGGATGTGTTTGTTGGTAGTGCTTTGGTTAATACTTATTTGAAGAATGGCTCGATGGAGGATGCACAAAAAGTGTTTGGAGAACTATCAATAAGAGATGTTGTACTTTGGAATGCAATGATCAACGGGTATGCCAAAATTGGTTGCCTTGACGAGGCACTGGAGGTTTTCAGAAGAATGCATGTAAAAGGGGTTGCACCTAGTAGGTTTACAATTACTGGCATTTTATCTGTTTTTGCTTCAAGGGGAGACTTAGACAATGGGAAAACAGTTCATGGGATTGTCATGAAAATGGGTTATGATTCAGGAGTTTCAGTTTCAAATGCATTAATTGATATGTATGGGAAATGCAAACATATTGGAGATGCGTTAATAATTTTTGAGATGATAAATGAGAAGGATATTTTCTCGTGGAACTCGATTATATCAGTTCATGAACAATGTGGTGATCATGATGGTACCTTGAGGCTTTTTGATAAGATGTTAGGTTCTGGGATTCTACCAGATTTGGTAACCATCACAACTGTGCTTCCAGCTTGCTCTCATTTGGCTGCGCTCATGCACGGTAGAGAAATTCATGGATACATGATCATTAATGGACTTGGAAAGGATGATGAAAATGGAGCTGTAGATAATTTACTTGTAAGTAATGCTGTTATGGATATGTATGCAAAATGCGGTAGTATGAACAATGCCCTCAAGATTTTTGATTCAATGAGCAAAAAGGACGTGGCATCATGGAATATCATGATTATGGGTTATGGTATGCATGGTTATGCTTTGGAGGCATTGGGTATGTTTTCTCAAATGTGTGAGGCTGAATTTAAGCCAAATGAAGTTACGCTTGTTGGAGTTCTATCAGCATGCAATCATGCAGGCTTTGTGTCTCATGGGCGTTTGTTTTTAGCTCAGATGGAATCTACATTTGGTGTTATTCCAACTATTGAGCATTATACGTGTGTAATTGATATGCTTGGTCGAGCTGGGCATTTAGAGGACGCGTATGAGATCGTGCAGAAAATGCCTATTCAAGCCAATCCCGTTGTTTGGAGGGCTTTATTGGGAGCATGTCGACTTCATGGGAATGCAGAGTTGGCAGAAATTGCAGCACGACAAGTACTGCAACTAGAACCAGAGCATTGTGGGAGTTATGTATTGATGTCCAATGTTTATGGAGTTATAGGTCGATATGAAGAGGTGTTGGAGGTTAGAAAAACGATGAAGGAACAAAATGTCAAGAAGACACCTGGTTGTAGTTGGATTGAACTCAAGGATGGGGTGCATGTTTTCCGTACTGGAGATAGGACCCATTCAGAATTGAATGCATTGACTAATCAACTGTGTGATATTGGATTCATTTTAGATGAAGTTTTGAATTTATATTGA

mRNA sequence

TGCTTAACAAACAGAAAGTTTATCTTCTCCAAAATTGTCTCCAAGAAGCACTACTTTTTTTCTTCTTCCTTTCTTAGTTTCTCTACTTCACCACCTTCCATGCTTTCAGCAATTCAAATGCTGAATAATGTAACCAAATGCGTTGCTTTTCTACAATCGTGTGCTGACCACCAGAATCTTAACAAAGGAAAACAGCTTCACTCCCTAATGATCACCTATGGTTTTTCTCCCTCACCTCCATCCATCACTAGCTTAATCAACATGTACTCCAAATGTGGTCAAATGGGGGAGGCCATTTTGGTTTTCTATGATCCGTGCCATGAGCGTAATGTGTTTGCATATAATGCTATAATTTCTGGGTTTGTCTCCAACGGTTTGGCTTCAAAAGGGTTTCAATTTTATAAGAAAATGAGGTTAGAGGGTGTAATGCCTGATAAATACACTTTTCCATGTGTAGTTAGAACTTGTTGTGAGGTTATGGAGGTGAAGAAGATTCATGGATGTTTGCTTAAAATGGGGTTGGAGTTGGATGTGTTTGTTGGTAGTGCTTTGGTTAATACTTATTTGAAGAATGGCTCGATGGAGGATGCACAAAAAGTGTTTGGAGAACTATCAATAAGAGATGTTGTACTTTGGAATGCAATGATCAACGGGTATGCCAAAATTGGTTGCCTTGACGAGGCACTGGAGGTTTTCAGAAGAATGCATGTAAAAGGGGTTGCACCTAGTAGGTTTACAATTACTGGCATTTTATCTGTTTTTGCTTCAAGGGGAGACTTAGACAATGGGAAAACAGTTCATGGGATTGTCATGAAAATGGGTTATGATTCAGGAGTTTCAGTTTCAAATGCATTAATTGATATGTATGGGAAATGCAAACATATTGGAGATGCGTTAATAATTTTTGAGATGATAAATGAGAAGGATATTTTCTCGTGGAACTCGATTATATCAGTTCATGAACAATGTGGTGATCATGATGGTACCTTGAGGCTTTTTGATAAGATGTTAGGTTCTGGGATTCTACCAGATTTGGTAACCATCACAACTGTGCTTCCAGCTTGCTCTCATTTGGCTGCGCTCATGCACGGTAGAGAAATTCATGGATACATGATCATTAATGGACTTGGAAAGGATGATGAAAATGGAGCTGTAGATAATTTACTTGTAAGTAATGCTGTTATGGATATGTATGCAAAATGCGGTAGTATGAACAATGCCCTCAAGATTTTTGATTCAATGAGCAAAAAGGACGTGGCATCATGGAATATCATGATTATGGGTTATGGTATGCATGGTTATGCTTTGGAGGCATTGGGTATGTTTTCTCAAATGTGTGAGGCTGAATTTAAGCCAAATGAAGTTACGCTTGTTGGAGTTCTATCAGCATGCAATCATGCAGGCTTTGTGTCTCATGGGCGTTTGTTTTTAGCTCAGATGGAATCTACATTTGGTGTTATTCCAACTATTGAGCATTATACGTGTGTAATTGATATGCTTGGTCGAGCTGGGCATTTAGAGGACGCGTATGAGATCGTGCAGAAAATGCCTATTCAAGCCAATCCCGTTGTTTGGAGGGCTTTATTGGGAGCATGTCGACTTCATGGGAATGCAGAGTTGGCAGAAATTGCAGCACGACAAGTACTGCAACTAGAACCAGAGCATTGTGGGAGTTATGTATTGATGTCCAATGTTTATGGAGTTATAGGTCGATATGAAGAGGTGTTGGAGGTTAGAAAAACGATGAAGGAACAAAATGTCAAGAAGACACCTGGTTGTAGTTGGATTGAACTCAAGGATGGGGTGCATGTTTTCCGTACTGGAGATAGGACCCATTCAGAATTGAATGCATTGACTAATCAACTGTGTGATATTGGATTCATTTTAGATGAAGTTTTGAATTTATATTGA

Coding sequence (CDS)

ATGCTTTCAGCAATTCAAATGCTGAATAATGTAACCAAATGCGTTGCTTTTCTACAATCGTGTGCTGACCACCAGAATCTTAACAAAGGAAAACAGCTTCACTCCCTAATGATCACCTATGGTTTTTCTCCCTCACCTCCATCCATCACTAGCTTAATCAACATGTACTCCAAATGTGGTCAAATGGGGGAGGCCATTTTGGTTTTCTATGATCCGTGCCATGAGCGTAATGTGTTTGCATATAATGCTATAATTTCTGGGTTTGTCTCCAACGGTTTGGCTTCAAAAGGGTTTCAATTTTATAAGAAAATGAGGTTAGAGGGTGTAATGCCTGATAAATACACTTTTCCATGTGTAGTTAGAACTTGTTGTGAGGTTATGGAGGTGAAGAAGATTCATGGATGTTTGCTTAAAATGGGGTTGGAGTTGGATGTGTTTGTTGGTAGTGCTTTGGTTAATACTTATTTGAAGAATGGCTCGATGGAGGATGCACAAAAAGTGTTTGGAGAACTATCAATAAGAGATGTTGTACTTTGGAATGCAATGATCAACGGGTATGCCAAAATTGGTTGCCTTGACGAGGCACTGGAGGTTTTCAGAAGAATGCATGTAAAAGGGGTTGCACCTAGTAGGTTTACAATTACTGGCATTTTATCTGTTTTTGCTTCAAGGGGAGACTTAGACAATGGGAAAACAGTTCATGGGATTGTCATGAAAATGGGTTATGATTCAGGAGTTTCAGTTTCAAATGCATTAATTGATATGTATGGGAAATGCAAACATATTGGAGATGCGTTAATAATTTTTGAGATGATAAATGAGAAGGATATTTTCTCGTGGAACTCGATTATATCAGTTCATGAACAATGTGGTGATCATGATGGTACCTTGAGGCTTTTTGATAAGATGTTAGGTTCTGGGATTCTACCAGATTTGGTAACCATCACAACTGTGCTTCCAGCTTGCTCTCATTTGGCTGCGCTCATGCACGGTAGAGAAATTCATGGATACATGATCATTAATGGACTTGGAAAGGATGATGAAAATGGAGCTGTAGATAATTTACTTGTAAGTAATGCTGTTATGGATATGTATGCAAAATGCGGTAGTATGAACAATGCCCTCAAGATTTTTGATTCAATGAGCAAAAAGGACGTGGCATCATGGAATATCATGATTATGGGTTATGGTATGCATGGTTATGCTTTGGAGGCATTGGGTATGTTTTCTCAAATGTGTGAGGCTGAATTTAAGCCAAATGAAGTTACGCTTGTTGGAGTTCTATCAGCATGCAATCATGCAGGCTTTGTGTCTCATGGGCGTTTGTTTTTAGCTCAGATGGAATCTACATTTGGTGTTATTCCAACTATTGAGCATTATACGTGTGTAATTGATATGCTTGGTCGAGCTGGGCATTTAGAGGACGCGTATGAGATCGTGCAGAAAATGCCTATTCAAGCCAATCCCGTTGTTTGGAGGGCTTTATTGGGAGCATGTCGACTTCATGGGAATGCAGAGTTGGCAGAAATTGCAGCACGACAAGTACTGCAACTAGAACCAGAGCATTGTGGGAGTTATGTATTGATGTCCAATGTTTATGGAGTTATAGGTCGATATGAAGAGGTGTTGGAGGTTAGAAAAACGATGAAGGAACAAAATGTCAAGAAGACACCTGGTTGTAGTTGGATTGAACTCAAGGATGGGGTGCATGTTTTCCGTACTGGAGATAGGACCCATTCAGAATTGAATGCATTGACTAATCAACTGTGTGATATTGGATTCATTTTAGATGAAGTTTTGAATTTATATTGA

Protein sequence

MLSAIQMLNNVTKCVAFLQSCADHQNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCGQMGEAILVFYDPCHERNVFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVVRTCCEVMEVKKIHGCLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIRDVVLWNAMINGYAKIGCLDEALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIVMKMGYDSGVSVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLLVSNAVMDMYAKCGSMNNALKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCEAEFKPNEVTLVGVLSACNHAGFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEIVQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVIGRYEEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFRTGDRTHSELNALTNQLCDIGFILDEVLNLY*
Homology
BLAST of CSPI04G18560 vs. ExPASy Swiss-Prot
Match: Q9LUC2 (Pentatricopeptide repeat-containing protein At3g14730 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E31 PE=2 SV=1)

HSP 1 Score: 659.4 bits (1700), Expect = 3.8e-188
Identity = 319/585 (54.53%), Postives = 421/585 (71.97%), Query Frame = 0

Query: 9   NNVTKCVAFLQSCADHQNLNKGKQLHSLMITYGF-SPSPPSITSLINMYSKCGQMGEAIL 68
           +NV  C+A LQ CA  ++   G+Q+H  M+  GF   SP + TSL+NMY+KCG M  A+L
Sbjct: 58  HNVATCIATLQRCAQRKDYVSGQQIHGFMVRKGFLDDSPRAGTSLVNMYAKCGLMRRAVL 117

Query: 69  VFYDPCHERNVFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVVR--TCCE 128
           VF     ER+VF YNA+ISGFV NG      + Y++MR  G++PDKYTFP +++     E
Sbjct: 118 VFGG--SERDVFGYNALISGFVVNGSPLDAMETYREMRANGILPDKYTFPSLLKGSDAME 177

Query: 129 VMEVKKIHGCLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIR-DVVLWNAMIN 188
           + +VKK+HG   K+G + D +VGS LV +Y K  S+EDAQKVF EL  R D VLWNA++N
Sbjct: 178 LSDVKKVHGLAFKLGFDSDCYVGSGLVTSYSKFMSVEDAQKVFDELPDRDDSVLWNALVN 237

Query: 189 GYAKIGCLDEALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIVMKMGYDS 248
           GY++I   ++AL VF +M  +GV  SR TIT +LS F   GD+DNG+++HG+ +K G  S
Sbjct: 238 GYSQIFRFEDALLVFSKMREEGVGVSRHTITSVLSAFTVSGDIDNGRSIHGLAVKTGSGS 297

Query: 249 GVSVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKML 308
            + VSNALIDMYGK K + +A  IFE ++E+D+F+WNS++ VH+ CGDHDGTL LF++ML
Sbjct: 298 DIVVSNALIDMYGKSKWLEEANSIFEAMDERDLFTWNSVLCVHDYCGDHDGTLALFERML 357

Query: 309 GSGILPDLVTITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLLVSNAVMDM 368
            SGI PD+VT+TTVLP C  LA+L  GREIHGYMI++GL     N    N  + N++MDM
Sbjct: 358 CSGIRPDIVTLTTVLPTCGRLASLRQGREIHGYMIVSGL----LNRKSSNEFIHNSLMDM 417

Query: 369 YAKCGSMNNALKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCEAEFKPNEVTL 428
           Y KCG + +A  +FDSM  KD ASWNIMI GYG+      AL MFS MC A  KP+E+T 
Sbjct: 418 YVKCGDLRDARMVFDSMRVKDSASWNIMINGYGVQSCGELALDMFSCMCRAGVKPDEITF 477

Query: 429 VGVLSACNHAGFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEIVQKMP 488
           VG+L AC+H+GF++ GR FLAQME+ + ++PT +HY CVIDMLGRA  LE+AYE+    P
Sbjct: 478 VGLLQACSHSGFLNEGRNFLAQMETVYNILPTSDHYACVIDMLGRADKLEEAYELAISKP 537

Query: 489 IQANPVVWRALLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVIGRYEEVLE 548
           I  NPVVWR++L +CRLHGN +LA +A +++ +LEPEHCG YVLMSNVY   G+YEEVL+
Sbjct: 538 ICDNPVVWRSILSSCRLHGNKDLALVAGKRLHELEPEHCGGYVLMSNVYVEAGKYEEVLD 597

Query: 549 VRKTMKEQNVKKTPGCSWIELKDGVHVFRTGDRTHSELNALTNQL 590
           VR  M++QNVKKTPGCSWI LK+GVH F TG++TH E  ++ + L
Sbjct: 598 VRDAMRQQNVKKTPGCSWIVLKNGVHTFFTGNQTHPEFKSIHDWL 636

BLAST of CSPI04G18560 vs. ExPASy Swiss-Prot
Match: Q9M9E2 (Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H73 PE=1 SV=1)

HSP 1 Score: 419.5 bits (1077), Expect = 6.6e-116
Identity = 208/579 (35.92%), Postives = 341/579 (58.89%), Query Frame = 0

Query: 18  LQSCADHQNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCGQMGEAILVFYDPCHERN 77
           L++C    +L +GK++H  ++ YG+      + +LI MY KCG +  A L+F D    R+
Sbjct: 203 LRTCGGIPDLARGKEVHVHVVRYGYELDIDVVNALITMYVKCGDVKSARLLF-DRMPRRD 262

Query: 78  VFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVVRTCCEVMEVKK----IH 137
           + ++NA+ISG+  NG+  +G + +  MR   V PD  T   V+ + CE++  ++    IH
Sbjct: 263 IISWNAMISGYFENGMCHEGLELFFAMRGLSVDPDLMTLTSVI-SACELLGDRRLGRDIH 322

Query: 138 GCLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIRDVVLWNAMINGYAKIGCLD 197
             ++  G  +D+ V ++L   YL  GS  +A+K+F  +  +D+V W  MI+GY      D
Sbjct: 323 AYVITTGFAVDISVCNSLTQMYLNAGSWREAEKLFSRMERKDIVSWTTMISGYEYNFLPD 382

Query: 198 EALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIVMKMGYDSGVSVSNALI 257
           +A++ +R M    V P   T+  +LS  A+ GDLD G  +H + +K    S V V+N LI
Sbjct: 383 KAIDTYRMMDQDSVKPDEITVAAVLSACATLGDLDTGVELHKLAIKARLISYVIVANNLI 442

Query: 258 DMYGKCKHIGDALIIFEMINEKDIFSWNSIIS---VHEQCGDHDGTLRLFDKMLGSGILP 317
           +MY KCK I  AL IF  I  K++ SW SII+   ++ +C +      +F + +   + P
Sbjct: 443 NMYSKCKCIDKALDIFHNIPRKNVISWTSIIAGLRLNNRCFE----ALIFLRQMKMTLQP 502

Query: 318 DLVTITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLLVSNAVMDMYAKCGS 377
           + +T+T  L AC+ + ALM G+EIH +++  G+G DD         + NA++DMY +CG 
Sbjct: 503 NAITLTAALAACARIGALMCGKEIHAHVLRTGVGLDD--------FLPNALLDMYVRCGR 562

Query: 378 MNNALKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCEAEFKPNEVTLVGVLSA 437
           MN A   F+S  KKDV SWNI++ GY   G     + +F +M ++  +P+E+T + +L  
Sbjct: 563 MNTAWSQFNS-QKKDVTSWNILLTGYSERGQGSMVVELFDRMVKSRVRPDEITFISLLCG 622

Query: 438 CNHAGFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEIVQKMPIQANPV 497
           C+ +  V  G ++ ++ME  +GV P ++HY CV+D+LGRAG L++A++ +QKMP+  +P 
Sbjct: 623 CSKSQMVRQGLMYFSKMED-YGVTPNLKHYACVVDLLGRAGELQEAHKFIQKMPVTPDPA 682

Query: 498 VWRALLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVIGRYEEVLEVRKTMK 557
           VW ALL ACR+H   +L E++A+ + +L+ +  G Y+L+ N+Y   G++ EV +VR+ MK
Sbjct: 683 VWGALLNACRIHHKIDLGELSAQHIFELDKKSVGYYILLCNLYADCGKWREVAKVRRMMK 742

Query: 558 EQNVKKTPGCSWIELKDGVHVFRTGDRTHSELNALTNQL 590
           E  +    GCSW+E+K  VH F + D+ H +   +   L
Sbjct: 743 ENGLTVDAGCSWVEVKGKVHAFLSDDKYHPQTKEINTVL 765

BLAST of CSPI04G18560 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 416.0 bits (1068), Expect = 7.3e-115
Identity = 220/585 (37.61%), Postives = 345/585 (58.97%), Query Frame = 0

Query: 1   MLSAIQMLNNVTKCVAFLQSCADHQNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCG 60
           M S ++M +    CV+  +S +  ++++ G+QLH  ++  GF        SL+  Y K  
Sbjct: 187 MSSGVEMDSYTFSCVS--KSFSSLRSVHGGEQLHGFILKSGFGERNSVGNSLVAFYLKNQ 246

Query: 61  QMGEAILVFYDPCHERNVFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVV 120
           ++  A  VF D   ER+V ++N+II+G+VSNGLA KG   + +M + G+  D  T   V 
Sbjct: 247 RVDSARKVF-DEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVF 306

Query: 121 RTCCEVMEV---KKIHGCLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIRDVV 180
             C +   +   + +H   +K     +    + L++ Y K G ++ A+ VF E+S R VV
Sbjct: 307 AGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVV 366

Query: 181 LWNAMINGYAKIGCLDEALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIV 240
            + +MI GYA+ G   EA+++F  M  +G++P  +T+T +L+  A    LD GK VH  +
Sbjct: 367 SYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWI 426

Query: 241 MKMGYDSGVSVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTL 300
            +      + VSNAL+DMY KC  + +A ++F  +  KDI SWN+II  + +    +  L
Sbjct: 427 KENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEAL 486

Query: 301 RLFDKML-GSGILPDLVTITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLL 360
            LF+ +L      PD  T+  VLPAC+ L+A   GREIHGY++ NG   D          
Sbjct: 487 SLFNLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRH-------- 546

Query: 361 VSNAVMDMYAKCGSMNNALKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCEAE 420
           V+N+++DMYAKCG++  A  +FD ++ KD+ SW +MI GYGMHG+  EA+ +F+QM +A 
Sbjct: 547 VANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAG 606

Query: 421 FKPNEVTLVGVLSACNHAGFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDA 480
            + +E++ V +L AC+H+G V  G  F   M     + PT+EHY C++DML R G L  A
Sbjct: 607 IEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKA 666

Query: 481 YEIVQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVI 540
           Y  ++ MPI  +  +W ALL  CR+H + +LAE  A +V +LEPE+ G YVLM+N+Y   
Sbjct: 667 YRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEA 726

Query: 541 GRYEEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFRTGDRTHSE 582
            ++E+V  +RK + ++ ++K PGCSWIE+K  V++F  GD ++ E
Sbjct: 727 EKWEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPE 760

BLAST of CSPI04G18560 vs. ExPASy Swiss-Prot
Match: Q9STE1 (Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E36 PE=3 SV=1)

HSP 1 Score: 409.1 bits (1050), Expect = 8.9e-113
Identity = 218/609 (35.80%), Postives = 347/609 (56.98%), Query Frame = 0

Query: 6   QMLNNVTKCVAFLQSCADHQNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCGQMGEA 65
           Q+  N       L  CA    ++ G QLH L++  G         SL++MYSKCG+  +A
Sbjct: 234 QISPNAVTFDCVLSVCASKLLIDLGVQLHGLVVVSGVDFEGSIKNSLLSMYSKCGRFDDA 293

Query: 66  ILVFYDPCHERNVFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVVRTCCE 125
             +F       +   +N +ISG+V +GL  +   F+ +M   GV+PD  TF  ++ +  +
Sbjct: 294 SKLF-RMMSRADTVTWNCMISGYVQSGLMEESLTFFYEMISSGVLPDAITFSSLLPSVSK 353

Query: 126 VMEV---KKIHGCLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIRDVVLWNAM 185
              +   K+IH  +++  + LD+F+ SAL++ Y K   +  AQ +F + +  DVV++ AM
Sbjct: 354 FENLEYCKQIHCYIMRHSISLDIFLTSALIDAYFKCRGVSMAQNIFSQCNSVDVVVFTAM 413

Query: 186 INGYAKIGCLDEALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIVMKMGY 245
           I+GY   G   ++LE+FR +    ++P+  T+  IL V      L  G+ +HG ++K G+
Sbjct: 414 ISGYLHNGLYIDSLEMFRWLVKVKISPNEITLVSILPVIGILLALKLGRELHGFIIKKGF 473

Query: 246 DSGVSVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDK 305
           D+  ++  A+IDMY KC  +  A  IFE ++++DI SWNS+I+   Q  +    + +F +
Sbjct: 474 DNRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWNSMITRCAQSDNPSAAIDIFRQ 533

Query: 306 MLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLLVSNAVM 365
           M  SGI  D V+I+  L AC++L +   G+ IHG+MI + L  D        +   + ++
Sbjct: 534 MGVSGICYDCVSISAALSACANLPSESFGKAIHGFMIKHSLASD--------VYSESTLI 593

Query: 366 DMYAKCGSMNNALKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCE-AEFKPNE 425
           DMYAKCG++  A+ +F +M +K++ SWN +I   G HG   ++L +F +M E +  +P++
Sbjct: 594 DMYAKCGNLKAAMNVFKTMKEKNIVSWNSIIAACGNHGKLKDSLCLFHEMVEKSGIRPDQ 653

Query: 426 VTLVGVLSACNHAGFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEIVQ 485
           +T + ++S+C H G V  G  F   M   +G+ P  EHY CV+D+ GRAG L +AYE V+
Sbjct: 654 ITFLEIISSCCHVGDVDEGVRFFRSMTEDYGIQPQQEHYACVVDLFGRAGRLTEAYETVK 713

Query: 486 KMPIQANPVVWRALLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVIGRYEE 545
            MP   +  VW  LLGACRLH N ELAE+A+ +++ L+P + G YVL+SN +     +E 
Sbjct: 714 SMPFPPDAGVWGTLLGACRLHKNVELAEVASSKLMDLDPSNSGYYVLISNAHANAREWES 773

Query: 546 VLEVRKTMKEQNVKKTPGCSWIELKDGVHVFRTGDRTHSE-------LNALTNQLCDIGF 604
           V +VR  MKE+ V+K PG SWIE+    H+F +GD  H E       LN+L  +L   G+
Sbjct: 774 VTKVRSLMKEREVQKIPGYSWIEINKRTHLFVSGDVNHPESSHIYSLLNSLLGELRLEGY 833

BLAST of CSPI04G18560 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 404.8 bits (1039), Expect = 1.7e-111
Identity = 213/595 (35.80%), Postives = 344/595 (57.82%), Query Frame = 0

Query: 18  LQSCADHQNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCGQMGEAILVFYDPCHERN 77
           L+ C D   L  GK++H L++  GFS    ++T L NMY+KC Q+ EA  VF D   ER+
Sbjct: 142 LKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVF-DRMPERD 201

Query: 78  VFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVVRTCCEVMEV---KKIHG 137
           + ++N I++G+  NG+A    +  K M  E + P   T   V+     +  +   K+IHG
Sbjct: 202 LVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHG 261

Query: 138 CLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIRDVVLWNAMINGYAKIGCLDE 197
             ++ G +  V + +ALV+ Y K GS+E A+++F  +  R+VV WN+MI+ Y +     E
Sbjct: 262 YAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKE 321

Query: 198 ALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIVMKMGYDSGVSVSNALID 257
           A+ +F++M  +GV P+  ++ G L   A  GDL+ G+ +H + +++G D  VSV N+LI 
Sbjct: 322 AMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLIS 381

Query: 258 MYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVT 317
           MY KCK +  A  +F  +  + + SWN++I    Q G     L  F +M    + PD  T
Sbjct: 382 MYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFT 441

Query: 318 ITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLLVSNAVMDMYAKCGSMNNA 377
             +V+ A + L+   H + IHG ++ + L K        N+ V+ A++DMYAKCG++  A
Sbjct: 442 YVSVITAIAELSITHHAKWIHGVVMRSCLDK--------NVFVTTALVDMYAKCGAIMIA 501

Query: 378 LKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCEAEFKPNEVTLVGVLSACNHA 437
             IFD MS++ V +WN MI GYG HG+   AL +F +M +   KPN VT + V+SAC+H+
Sbjct: 502 RLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHS 561

Query: 438 GFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEIVQKMPIQANPVVWRA 497
           G V  G      M+  + +  +++HY  ++D+LGRAG L +A++ + +MP++    V+ A
Sbjct: 562 GLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGA 621

Query: 498 LLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVIGRYEEVLEVRKTMKEQNV 557
           +LGAC++H N   AE AA ++ +L P+  G +VL++N+Y     +E+V +VR +M  Q +
Sbjct: 622 MLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGL 681

Query: 558 KKTPGCSWIELKDGVHVFRTGDRTHSE-------LNALTNQLCDIGFILDEVLNL 603
           +KTPGCS +E+K+ VH F +G   H +       L  L   + + G++ D  L L
Sbjct: 682 RKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEAGYVPDTNLVL 727

BLAST of CSPI04G18560 vs. ExPASy TrEMBL
Match: A0A0A0L3D2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G414410 PE=4 SV=1)

HSP 1 Score: 1235.7 bits (3196), Expect = 0.0e+00
Identity = 603/603 (100.00%), Postives = 603/603 (100.00%), Query Frame = 0

Query: 1   MLSAIQMLNNVTKCVAFLQSCADHQNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCG 60
           MLSAIQMLNNVTKCVAFLQSCADHQNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCG
Sbjct: 1   MLSAIQMLNNVTKCVAFLQSCADHQNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCG 60

Query: 61  QMGEAILVFYDPCHERNVFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVV 120
           QMGEAILVFYDPCHERNVFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVV
Sbjct: 61  QMGEAILVFYDPCHERNVFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVV 120

Query: 121 RTCCEVMEVKKIHGCLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIRDVVLWN 180
           RTCCEVMEVKKIHGCLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIRDVVLWN
Sbjct: 121 RTCCEVMEVKKIHGCLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIRDVVLWN 180

Query: 181 AMINGYAKIGCLDEALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIVMKM 240
           AMINGYAKIGCLDEALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIVMKM
Sbjct: 181 AMINGYAKIGCLDEALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIVMKM 240

Query: 241 GYDSGVSVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLF 300
           GYDSGVSVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLF
Sbjct: 241 GYDSGVSVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLF 300

Query: 301 DKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLLVSNA 360
           DKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLLVSNA
Sbjct: 301 DKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLLVSNA 360

Query: 361 VMDMYAKCGSMNNALKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCEAEFKPN 420
           VMDMYAKCGSMNNALKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCEAEFKPN
Sbjct: 361 VMDMYAKCGSMNNALKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCEAEFKPN 420

Query: 421 EVTLVGVLSACNHAGFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEIV 480
           EVTLVGVLSACNHAGFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEIV
Sbjct: 421 EVTLVGVLSACNHAGFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEIV 480

Query: 481 QKMPIQANPVVWRALLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVIGRYE 540
           QKMPIQANPVVWRALLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVIGRYE
Sbjct: 481 QKMPIQANPVVWRALLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVIGRYE 540

Query: 541 EVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFRTGDRTHSELNALTNQLCDIGFILDEVL 600
           EVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFRTGDRTHSELNALTNQLCDIGFILDEVL
Sbjct: 541 EVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFRTGDRTHSELNALTNQLCDIGFILDEVL 600

Query: 601 NLY 604
           NLY
Sbjct: 601 NLY 603

BLAST of CSPI04G18560 vs. ExPASy TrEMBL
Match: A0A1S4E0R7 (pentatricopeptide repeat-containing protein At3g14730-like OS=Cucumis melo OX=3656 GN=LOC103495881 PE=4 SV=1)

HSP 1 Score: 1178.7 bits (3048), Expect = 0.0e+00
Identity = 568/602 (94.35%), Postives = 587/602 (97.51%), Query Frame = 0

Query: 2   LSAIQMLNNVTKCVAFLQSCADHQNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCGQ 61
           LSAIQML+NVTKC+AFLQSCADH+NLNKGKQ HSLMITYGFS SPPSITSLINMYSKCGQ
Sbjct: 56  LSAIQMLDNVTKCIAFLQSCADHKNLNKGKQFHSLMITYGFSLSPPSITSLINMYSKCGQ 115

Query: 62  MGEAILVFYDPCHERNVFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVVR 121
           MGEAILVFYDPCHERNVFAYNAIISGFV+NGLASKGFQFY+KMRLEGVMPDKYTFPCVVR
Sbjct: 116 MGEAILVFYDPCHERNVFAYNAIISGFVANGLASKGFQFYEKMRLEGVMPDKYTFPCVVR 175

Query: 122 TCCEVMEVKKIHGCLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIRDVVLWNA 181
           TCCEV EVKKIHGC LKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGEL +RDVVLWNA
Sbjct: 176 TCCEVKEVKKIHGCSLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELPMRDVVLWNA 235

Query: 182 MINGYAKIGCLDEALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIVMKMG 241
           MINGYAKIGCLDEALEVFRRMHV+G+AP RFTITGILS+FASRGDLDNGKTVHGIV+KMG
Sbjct: 236 MINGYAKIGCLDEALEVFRRMHVEGIAPGRFTITGILSIFASRGDLDNGKTVHGIVVKMG 295

Query: 242 YDSGVSVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFD 301
           YDSGV+VSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDH GTLRLFD
Sbjct: 296 YDSGVAVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHHGTLRLFD 355

Query: 302 KMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLLVSNAV 361
           KMLGSGILPDLVTITTVLPACSHLAALM GREIHGYMIING GKDDENGA+D+L VSNAV
Sbjct: 356 KMLGSGILPDLVTITTVLPACSHLAALMRGREIHGYMIINGFGKDDENGAIDDLHVSNAV 415

Query: 362 MDMYAKCGSMNNALKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCEAEFKPNE 421
           MDMYAKCGSMNNALKIFDSMS KDVASWNIMIMGYGMHGYALEAL MFS+MCEAEFKP+E
Sbjct: 416 MDMYAKCGSMNNALKIFDSMSNKDVASWNIMIMGYGMHGYALEALDMFSRMCEAEFKPDE 475

Query: 422 VTLVGVLSACNHAGFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEIVQ 481
           VTLVGVLSACNHAGFVS GRLF AQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYE+ Q
Sbjct: 476 VTLVGVLSACNHAGFVSQGRLFFAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEVAQ 535

Query: 482 KMPIQANPVVWRALLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVIGRYEE 541
           KMPIQANPVVWRALLGACRLHGNAELAE+AARQVLQLEPEHCGSYVLMSNVYGVIGRYEE
Sbjct: 536 KMPIQANPVVWRALLGACRLHGNAELAEVAARQVLQLEPEHCGSYVLMSNVYGVIGRYEE 595

Query: 542 VLEVRKTMKEQNVKKTPGCSWIELKDGVHVFRTGDRTHSELNALTNQLCDIGFILDEVLN 601
           VLEVRKTMKEQNVKKTPGCSWIELKDG+HVFRTGDRTHSELNALTNQLCDIGF+LDEVLN
Sbjct: 596 VLEVRKTMKEQNVKKTPGCSWIELKDGMHVFRTGDRTHSELNALTNQLCDIGFLLDEVLN 655

Query: 602 LY 604
           LY
Sbjct: 656 LY 657

BLAST of CSPI04G18560 vs. ExPASy TrEMBL
Match: A0A5A7TK17 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold2208G00170 PE=4 SV=1)

HSP 1 Score: 1171.0 bits (3028), Expect = 0.0e+00
Identity = 563/597 (94.30%), Postives = 582/597 (97.49%), Query Frame = 0

Query: 7   MLNNVTKCVAFLQSCADHQNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCGQMGEAI 66
           ML+NVTKC+AFLQSCADH+NLNKGKQ HSLMITYGFS SPPSITSLINMYSKCGQMGEAI
Sbjct: 1   MLDNVTKCIAFLQSCADHKNLNKGKQFHSLMITYGFSLSPPSITSLINMYSKCGQMGEAI 60

Query: 67  LVFYDPCHERNVFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVVRTCCEV 126
           LVFYDPCHERNVFAYNAIISGFV+NGLASKGFQFY+KMRLEGVMPDKYTFPCVVRTCCEV
Sbjct: 61  LVFYDPCHERNVFAYNAIISGFVANGLASKGFQFYEKMRLEGVMPDKYTFPCVVRTCCEV 120

Query: 127 MEVKKIHGCLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIRDVVLWNAMINGY 186
            EVKKIHGC LKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGEL +RDVVLWNAMINGY
Sbjct: 121 KEVKKIHGCSLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELPMRDVVLWNAMINGY 180

Query: 187 AKIGCLDEALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIVMKMGYDSGV 246
           AKIGCLDEALEVFRRMHV+G+AP RFTITGILS+FASRGDLDNGKTVHGIV+KMGYDSGV
Sbjct: 181 AKIGCLDEALEVFRRMHVEGIAPGRFTITGILSIFASRGDLDNGKTVHGIVVKMGYDSGV 240

Query: 247 SVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGS 306
           +VSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDH GTLRLFDKMLGS
Sbjct: 241 AVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHHGTLRLFDKMLGS 300

Query: 307 GILPDLVTITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLLVSNAVMDMYA 366
           GILPDLVTITTVLPACSHLAALM GREIHGYMIING GKDDENGA+D+L VSNAVMDMYA
Sbjct: 301 GILPDLVTITTVLPACSHLAALMRGREIHGYMIINGFGKDDENGAIDDLHVSNAVMDMYA 360

Query: 367 KCGSMNNALKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCEAEFKPNEVTLVG 426
           KCGSMNNALKIFDSMS KDVASWNIMIMGYGMHGYALEAL MFS+MCEAEFKP+EVTLVG
Sbjct: 361 KCGSMNNALKIFDSMSNKDVASWNIMIMGYGMHGYALEALDMFSRMCEAEFKPDEVTLVG 420

Query: 427 VLSACNHAGFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEIVQKMPIQ 486
           VLSACNHAGFVS GRLF AQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYE+ QKMPIQ
Sbjct: 421 VLSACNHAGFVSQGRLFFAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQ 480

Query: 487 ANPVVWRALLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVIGRYEEVLEVR 546
           ANPVVWRALLGACRLHGNAELAE+AARQVLQLEPEHCGSYVLMSNVYGVIGRYEEVLEVR
Sbjct: 481 ANPVVWRALLGACRLHGNAELAEVAARQVLQLEPEHCGSYVLMSNVYGVIGRYEEVLEVR 540

Query: 547 KTMKEQNVKKTPGCSWIELKDGVHVFRTGDRTHSELNALTNQLCDIGFILDEVLNLY 604
           KTMKEQNVKKTPGCSWIELKDG+HVFRTGDRTHSELNALTNQLCDIGF+LDEVLNLY
Sbjct: 541 KTMKEQNVKKTPGCSWIELKDGMHVFRTGDRTHSELNALTNQLCDIGFLLDEVLNLY 597

BLAST of CSPI04G18560 vs. ExPASy TrEMBL
Match: A0A6J1HME0 (pentatricopeptide repeat-containing protein At3g14730-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111465952 PE=4 SV=1)

HSP 1 Score: 1086.6 bits (2809), Expect = 0.0e+00
Identity = 511/601 (85.02%), Postives = 565/601 (94.01%), Query Frame = 0

Query: 3   SAIQMLNNVTKCVAFLQSCADHQNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCGQM 62
           S  ++LNNVT C+AFLQSCA+ +NLNKGKQLHS+MITYGFS SP SITSLINMYSKCG+M
Sbjct: 55  SDFRLLNNVTTCIAFLQSCAESKNLNKGKQLHSVMITYGFSHSPSSITSLINMYSKCGRM 114

Query: 63  GEAILVFYDPCHERNVFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVVRT 122
            EA+LVF+DPC+E NVFAYNAIISGFV+NGLAS GFQFYK+MRLEGVMPDKYTFPCVVR+
Sbjct: 115 EEAVLVFHDPCYEPNVFAYNAIISGFVANGLASIGFQFYKQMRLEGVMPDKYTFPCVVRS 174

Query: 123 CCEVMEVKKIHGCLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIRDVVLWNAM 182
           CCEVMEVKKIHGCL KMGLELD+FVGSALVNTYLK GSMEDAQ+VF EL IRDVVLWNAM
Sbjct: 175 CCEVMEVKKIHGCLFKMGLELDLFVGSALVNTYLKLGSMEDAQEVFEELPIRDVVLWNAM 234

Query: 183 INGYAKIGCLDEALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIVMKMGY 242
           INGYA+IGCLDEALE+F+RMH++GV+PSRFTITGILS+FA +G LDNG+TVHGIVMKMGY
Sbjct: 235 INGYAQIGCLDEALEIFKRMHIEGVSPSRFTITGILSIFALKGHLDNGRTVHGIVMKMGY 294

Query: 243 DSGVSVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDK 302
           DSGV+VSNALIDMYGKCKHIGDAL++FE +NEKDIFSWNSIISVHEQCGDHDG LRLFDK
Sbjct: 295 DSGVAVSNALIDMYGKCKHIGDALMVFETMNEKDIFSWNSIISVHEQCGDHDGALRLFDK 354

Query: 303 MLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLLVSNAVM 362
           MLGSG LPDLVT+TT+LPACSHLAALMHGREIHGYMI+NGLG+D +NG +D+LLV+NAVM
Sbjct: 355 MLGSGFLPDLVTVTTILPACSHLAALMHGREIHGYMIVNGLGRDGDNGVIDDLLVNNAVM 414

Query: 363 DMYAKCGSMNNALKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCEAEFKPNEV 422
           DMYAKCGSMNNA K+F+SM+ KDVASWNIMIMGYGMHGY ++AL MFS MCEA+ KP+EV
Sbjct: 415 DMYAKCGSMNNAQKVFNSMTNKDVASWNIMIMGYGMHGYGMDALDMFSHMCEAKIKPDEV 474

Query: 423 TLVGVLSACNHAGFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEIVQK 482
           T VGVLSACNHAGFV  GR+FLAQME  FGVIPTIEHYTCVIDMLGRAGHLEDAY++ Q 
Sbjct: 475 TFVGVLSACNHAGFVCQGRMFLAQMEHDFGVIPTIEHYTCVIDMLGRAGHLEDAYDLAQT 534

Query: 483 MPIQANPVVWRALLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVIGRYEEV 542
           MPIQANPVVWRALLGACRLHGNAELAEIAA++V+QL+PEHCGSYVLMSNVYGV+GRYEEV
Sbjct: 535 MPIQANPVVWRALLGACRLHGNAELAEIAAQKVMQLDPEHCGSYVLMSNVYGVVGRYEEV 594

Query: 543 LEVRKTMKEQNVKKTPGCSWIELKDGVHVFRTGDRTHSELNALTNQLCDIGFILDEVLNL 602
           LEVR TMKEQ+V+KTPGCSWIELKDGVHVF TGDRTH ELNALT+QLCDIGFILDEVLNL
Sbjct: 595 LEVRNTMKEQHVRKTPGCSWIELKDGVHVFLTGDRTHLELNALTSQLCDIGFILDEVLNL 654

Query: 603 Y 604
           Y
Sbjct: 655 Y 655

BLAST of CSPI04G18560 vs. ExPASy TrEMBL
Match: A0A6J1HNR1 (pentatricopeptide repeat-containing protein At3g14730-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111465952 PE=4 SV=1)

HSP 1 Score: 1086.6 bits (2809), Expect = 0.0e+00
Identity = 511/601 (85.02%), Postives = 565/601 (94.01%), Query Frame = 0

Query: 3   SAIQMLNNVTKCVAFLQSCADHQNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCGQM 62
           S  ++LNNVT C+AFLQSCA+ +NLNKGKQLHS+MITYGFS SP SITSLINMYSKCG+M
Sbjct: 61  SDFRLLNNVTTCIAFLQSCAESKNLNKGKQLHSVMITYGFSHSPSSITSLINMYSKCGRM 120

Query: 63  GEAILVFYDPCHERNVFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVVRT 122
            EA+LVF+DPC+E NVFAYNAIISGFV+NGLAS GFQFYK+MRLEGVMPDKYTFPCVVR+
Sbjct: 121 EEAVLVFHDPCYEPNVFAYNAIISGFVANGLASIGFQFYKQMRLEGVMPDKYTFPCVVRS 180

Query: 123 CCEVMEVKKIHGCLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIRDVVLWNAM 182
           CCEVMEVKKIHGCL KMGLELD+FVGSALVNTYLK GSMEDAQ+VF EL IRDVVLWNAM
Sbjct: 181 CCEVMEVKKIHGCLFKMGLELDLFVGSALVNTYLKLGSMEDAQEVFEELPIRDVVLWNAM 240

Query: 183 INGYAKIGCLDEALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIVMKMGY 242
           INGYA+IGCLDEALE+F+RMH++GV+PSRFTITGILS+FA +G LDNG+TVHGIVMKMGY
Sbjct: 241 INGYAQIGCLDEALEIFKRMHIEGVSPSRFTITGILSIFALKGHLDNGRTVHGIVMKMGY 300

Query: 243 DSGVSVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDK 302
           DSGV+VSNALIDMYGKCKHIGDAL++FE +NEKDIFSWNSIISVHEQCGDHDG LRLFDK
Sbjct: 301 DSGVAVSNALIDMYGKCKHIGDALMVFETMNEKDIFSWNSIISVHEQCGDHDGALRLFDK 360

Query: 303 MLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLLVSNAVM 362
           MLGSG LPDLVT+TT+LPACSHLAALMHGREIHGYMI+NGLG+D +NG +D+LLV+NAVM
Sbjct: 361 MLGSGFLPDLVTVTTILPACSHLAALMHGREIHGYMIVNGLGRDGDNGVIDDLLVNNAVM 420

Query: 363 DMYAKCGSMNNALKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCEAEFKPNEV 422
           DMYAKCGSMNNA K+F+SM+ KDVASWNIMIMGYGMHGY ++AL MFS MCEA+ KP+EV
Sbjct: 421 DMYAKCGSMNNAQKVFNSMTNKDVASWNIMIMGYGMHGYGMDALDMFSHMCEAKIKPDEV 480

Query: 423 TLVGVLSACNHAGFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEIVQK 482
           T VGVLSACNHAGFV  GR+FLAQME  FGVIPTIEHYTCVIDMLGRAGHLEDAY++ Q 
Sbjct: 481 TFVGVLSACNHAGFVCQGRMFLAQMEHDFGVIPTIEHYTCVIDMLGRAGHLEDAYDLAQT 540

Query: 483 MPIQANPVVWRALLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVIGRYEEV 542
           MPIQANPVVWRALLGACRLHGNAELAEIAA++V+QL+PEHCGSYVLMSNVYGV+GRYEEV
Sbjct: 541 MPIQANPVVWRALLGACRLHGNAELAEIAAQKVMQLDPEHCGSYVLMSNVYGVVGRYEEV 600

Query: 543 LEVRKTMKEQNVKKTPGCSWIELKDGVHVFRTGDRTHSELNALTNQLCDIGFILDEVLNL 602
           LEVR TMKEQ+V+KTPGCSWIELKDGVHVF TGDRTH ELNALT+QLCDIGFILDEVLNL
Sbjct: 601 LEVRNTMKEQHVRKTPGCSWIELKDGVHVFLTGDRTHLELNALTSQLCDIGFILDEVLNL 660

Query: 603 Y 604
           Y
Sbjct: 661 Y 661

BLAST of CSPI04G18560 vs. NCBI nr
Match: XP_004150613.2 (pentatricopeptide repeat-containing protein At3g14730 [Cucumis sativus] >XP_031740376.1 pentatricopeptide repeat-containing protein At3g14730 [Cucumis sativus] >XP_031740377.1 pentatricopeptide repeat-containing protein At3g14730 [Cucumis sativus] >XP_031740378.1 pentatricopeptide repeat-containing protein At3g14730 [Cucumis sativus] >XP_031740379.1 pentatricopeptide repeat-containing protein At3g14730 [Cucumis sativus] >XP_031740380.1 pentatricopeptide repeat-containing protein At3g14730 [Cucumis sativus] >KAE8649651.1 hypothetical protein Csa_012609 [Cucumis sativus])

HSP 1 Score: 1235.7 bits (3196), Expect = 0.0e+00
Identity = 603/603 (100.00%), Postives = 603/603 (100.00%), Query Frame = 0

Query: 1   MLSAIQMLNNVTKCVAFLQSCADHQNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCG 60
           MLSAIQMLNNVTKCVAFLQSCADHQNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCG
Sbjct: 55  MLSAIQMLNNVTKCVAFLQSCADHQNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCG 114

Query: 61  QMGEAILVFYDPCHERNVFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVV 120
           QMGEAILVFYDPCHERNVFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVV
Sbjct: 115 QMGEAILVFYDPCHERNVFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVV 174

Query: 121 RTCCEVMEVKKIHGCLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIRDVVLWN 180
           RTCCEVMEVKKIHGCLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIRDVVLWN
Sbjct: 175 RTCCEVMEVKKIHGCLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIRDVVLWN 234

Query: 181 AMINGYAKIGCLDEALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIVMKM 240
           AMINGYAKIGCLDEALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIVMKM
Sbjct: 235 AMINGYAKIGCLDEALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIVMKM 294

Query: 241 GYDSGVSVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLF 300
           GYDSGVSVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLF
Sbjct: 295 GYDSGVSVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLF 354

Query: 301 DKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLLVSNA 360
           DKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLLVSNA
Sbjct: 355 DKMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLLVSNA 414

Query: 361 VMDMYAKCGSMNNALKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCEAEFKPN 420
           VMDMYAKCGSMNNALKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCEAEFKPN
Sbjct: 415 VMDMYAKCGSMNNALKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCEAEFKPN 474

Query: 421 EVTLVGVLSACNHAGFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEIV 480
           EVTLVGVLSACNHAGFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEIV
Sbjct: 475 EVTLVGVLSACNHAGFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEIV 534

Query: 481 QKMPIQANPVVWRALLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVIGRYE 540
           QKMPIQANPVVWRALLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVIGRYE
Sbjct: 535 QKMPIQANPVVWRALLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVIGRYE 594

Query: 541 EVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFRTGDRTHSELNALTNQLCDIGFILDEVL 600
           EVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFRTGDRTHSELNALTNQLCDIGFILDEVL
Sbjct: 595 EVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFRTGDRTHSELNALTNQLCDIGFILDEVL 654

Query: 601 NLY 604
           NLY
Sbjct: 655 NLY 657

BLAST of CSPI04G18560 vs. NCBI nr
Match: XP_008455782.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g14730-like [Cucumis melo] >XP_008455783.1 PREDICTED: pentatricopeptide repeat-containing protein At3g14730-like [Cucumis melo] >XP_008455784.1 PREDICTED: pentatricopeptide repeat-containing protein At3g14730-like [Cucumis melo] >XP_016901821.1 PREDICTED: pentatricopeptide repeat-containing protein At3g14730-like [Cucumis melo])

HSP 1 Score: 1178.7 bits (3048), Expect = 0.0e+00
Identity = 568/602 (94.35%), Postives = 587/602 (97.51%), Query Frame = 0

Query: 2   LSAIQMLNNVTKCVAFLQSCADHQNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCGQ 61
           LSAIQML+NVTKC+AFLQSCADH+NLNKGKQ HSLMITYGFS SPPSITSLINMYSKCGQ
Sbjct: 56  LSAIQMLDNVTKCIAFLQSCADHKNLNKGKQFHSLMITYGFSLSPPSITSLINMYSKCGQ 115

Query: 62  MGEAILVFYDPCHERNVFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVVR 121
           MGEAILVFYDPCHERNVFAYNAIISGFV+NGLASKGFQFY+KMRLEGVMPDKYTFPCVVR
Sbjct: 116 MGEAILVFYDPCHERNVFAYNAIISGFVANGLASKGFQFYEKMRLEGVMPDKYTFPCVVR 175

Query: 122 TCCEVMEVKKIHGCLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIRDVVLWNA 181
           TCCEV EVKKIHGC LKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGEL +RDVVLWNA
Sbjct: 176 TCCEVKEVKKIHGCSLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELPMRDVVLWNA 235

Query: 182 MINGYAKIGCLDEALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIVMKMG 241
           MINGYAKIGCLDEALEVFRRMHV+G+AP RFTITGILS+FASRGDLDNGKTVHGIV+KMG
Sbjct: 236 MINGYAKIGCLDEALEVFRRMHVEGIAPGRFTITGILSIFASRGDLDNGKTVHGIVVKMG 295

Query: 242 YDSGVSVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFD 301
           YDSGV+VSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDH GTLRLFD
Sbjct: 296 YDSGVAVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHHGTLRLFD 355

Query: 302 KMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLLVSNAV 361
           KMLGSGILPDLVTITTVLPACSHLAALM GREIHGYMIING GKDDENGA+D+L VSNAV
Sbjct: 356 KMLGSGILPDLVTITTVLPACSHLAALMRGREIHGYMIINGFGKDDENGAIDDLHVSNAV 415

Query: 362 MDMYAKCGSMNNALKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCEAEFKPNE 421
           MDMYAKCGSMNNALKIFDSMS KDVASWNIMIMGYGMHGYALEAL MFS+MCEAEFKP+E
Sbjct: 416 MDMYAKCGSMNNALKIFDSMSNKDVASWNIMIMGYGMHGYALEALDMFSRMCEAEFKPDE 475

Query: 422 VTLVGVLSACNHAGFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEIVQ 481
           VTLVGVLSACNHAGFVS GRLF AQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYE+ Q
Sbjct: 476 VTLVGVLSACNHAGFVSQGRLFFAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEVAQ 535

Query: 482 KMPIQANPVVWRALLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVIGRYEE 541
           KMPIQANPVVWRALLGACRLHGNAELAE+AARQVLQLEPEHCGSYVLMSNVYGVIGRYEE
Sbjct: 536 KMPIQANPVVWRALLGACRLHGNAELAEVAARQVLQLEPEHCGSYVLMSNVYGVIGRYEE 595

Query: 542 VLEVRKTMKEQNVKKTPGCSWIELKDGVHVFRTGDRTHSELNALTNQLCDIGFILDEVLN 601
           VLEVRKTMKEQNVKKTPGCSWIELKDG+HVFRTGDRTHSELNALTNQLCDIGF+LDEVLN
Sbjct: 596 VLEVRKTMKEQNVKKTPGCSWIELKDGMHVFRTGDRTHSELNALTNQLCDIGFLLDEVLN 655

Query: 602 LY 604
           LY
Sbjct: 656 LY 657

BLAST of CSPI04G18560 vs. NCBI nr
Match: KAA0043600.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK29760.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1171.0 bits (3028), Expect = 0.0e+00
Identity = 563/597 (94.30%), Postives = 582/597 (97.49%), Query Frame = 0

Query: 7   MLNNVTKCVAFLQSCADHQNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCGQMGEAI 66
           ML+NVTKC+AFLQSCADH+NLNKGKQ HSLMITYGFS SPPSITSLINMYSKCGQMGEAI
Sbjct: 1   MLDNVTKCIAFLQSCADHKNLNKGKQFHSLMITYGFSLSPPSITSLINMYSKCGQMGEAI 60

Query: 67  LVFYDPCHERNVFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVVRTCCEV 126
           LVFYDPCHERNVFAYNAIISGFV+NGLASKGFQFY+KMRLEGVMPDKYTFPCVVRTCCEV
Sbjct: 61  LVFYDPCHERNVFAYNAIISGFVANGLASKGFQFYEKMRLEGVMPDKYTFPCVVRTCCEV 120

Query: 127 MEVKKIHGCLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIRDVVLWNAMINGY 186
            EVKKIHGC LKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGEL +RDVVLWNAMINGY
Sbjct: 121 KEVKKIHGCSLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELPMRDVVLWNAMINGY 180

Query: 187 AKIGCLDEALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIVMKMGYDSGV 246
           AKIGCLDEALEVFRRMHV+G+AP RFTITGILS+FASRGDLDNGKTVHGIV+KMGYDSGV
Sbjct: 181 AKIGCLDEALEVFRRMHVEGIAPGRFTITGILSIFASRGDLDNGKTVHGIVVKMGYDSGV 240

Query: 247 SVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGS 306
           +VSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDH GTLRLFDKMLGS
Sbjct: 241 AVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHHGTLRLFDKMLGS 300

Query: 307 GILPDLVTITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLLVSNAVMDMYA 366
           GILPDLVTITTVLPACSHLAALM GREIHGYMIING GKDDENGA+D+L VSNAVMDMYA
Sbjct: 301 GILPDLVTITTVLPACSHLAALMRGREIHGYMIINGFGKDDENGAIDDLHVSNAVMDMYA 360

Query: 367 KCGSMNNALKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCEAEFKPNEVTLVG 426
           KCGSMNNALKIFDSMS KDVASWNIMIMGYGMHGYALEAL MFS+MCEAEFKP+EVTLVG
Sbjct: 361 KCGSMNNALKIFDSMSNKDVASWNIMIMGYGMHGYALEALDMFSRMCEAEFKPDEVTLVG 420

Query: 427 VLSACNHAGFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEIVQKMPIQ 486
           VLSACNHAGFVS GRLF AQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYE+ QKMPIQ
Sbjct: 421 VLSACNHAGFVSQGRLFFAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEVAQKMPIQ 480

Query: 487 ANPVVWRALLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVIGRYEEVLEVR 546
           ANPVVWRALLGACRLHGNAELAE+AARQVLQLEPEHCGSYVLMSNVYGVIGRYEEVLEVR
Sbjct: 481 ANPVVWRALLGACRLHGNAELAEVAARQVLQLEPEHCGSYVLMSNVYGVIGRYEEVLEVR 540

Query: 547 KTMKEQNVKKTPGCSWIELKDGVHVFRTGDRTHSELNALTNQLCDIGFILDEVLNLY 604
           KTMKEQNVKKTPGCSWIELKDG+HVFRTGDRTHSELNALTNQLCDIGF+LDEVLNLY
Sbjct: 541 KTMKEQNVKKTPGCSWIELKDGMHVFRTGDRTHSELNALTNQLCDIGFLLDEVLNLY 597

BLAST of CSPI04G18560 vs. NCBI nr
Match: XP_038881250.1 (pentatricopeptide repeat-containing protein At3g14730-like [Benincasa hispida])

HSP 1 Score: 1142.9 bits (2955), Expect = 0.0e+00
Identity = 548/602 (91.03%), Postives = 577/602 (95.85%), Query Frame = 0

Query: 2   LSAIQMLNNVTKCVAFLQSCADHQNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCGQ 61
           LS  Q+L+NVT C+AFLQSCADH+NLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCGQ
Sbjct: 55  LSVFQLLDNVTTCIAFLQSCADHKNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCGQ 114

Query: 62  MGEAILVFYDPCHERNVFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVVR 121
           M EAILVF+DPCHERNVFAYNAIISGFV+NGLASKGFQFY++MRLEGVMPDKYTFPCVVR
Sbjct: 115 MREAILVFHDPCHERNVFAYNAIISGFVANGLASKGFQFYEQMRLEGVMPDKYTFPCVVR 174

Query: 122 TCCEVMEVKKIHGCLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIRDVVLWNA 181
           TCCEVMEVKKIHGCL KMGLELDVFVGSALVNTYLK GSME+AQKVF E+SIRDVVLWNA
Sbjct: 175 TCCEVMEVKKIHGCLFKMGLELDVFVGSALVNTYLKIGSMENAQKVFEEMSIRDVVLWNA 234

Query: 182 MINGYAKIGCLDEALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIVMKMG 241
           MINGYA+IGCLDEALEVFRRMH++G+APSRFTITGIL +FASRGDLDNG+TVHGIVMKMG
Sbjct: 235 MINGYAQIGCLDEALEVFRRMHIEGIAPSRFTITGILPIFASRGDLDNGQTVHGIVMKMG 294

Query: 242 YDSGVSVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFD 301
           YDSGV+VSNALIDMYGKCKHI DALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFD
Sbjct: 295 YDSGVAVSNALIDMYGKCKHIRDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFD 354

Query: 302 KMLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLLVSNAV 361
           KMLGS ILPDLVTITTVLPACSHLAA MHGREIHGYMI+NGLGKDDENG VD+LLV+NAV
Sbjct: 355 KMLGSEILPDLVTITTVLPACSHLAAFMHGREIHGYMIVNGLGKDDENGVVDDLLVNNAV 414

Query: 362 MDMYAKCGSMNNALKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCEAEFKPNE 421
           MDMYAKCGSM NALK+FD MS KDVASWNIMIMGYGMHGY ++AL MFS+MCE  FKP+E
Sbjct: 415 MDMYAKCGSMYNALKVFDQMSNKDVASWNIMIMGYGMHGYGMKALDMFSRMCEVGFKPDE 474

Query: 422 VTLVGVLSACNHAGFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEIVQ 481
           VTLVGVLSACNH GFVS GRL LAQMES FGVIPTIEHYTCVIDMLGRAGHLEDAY+I Q
Sbjct: 475 VTLVGVLSACNHTGFVSQGRLLLAQMESKFGVIPTIEHYTCVIDMLGRAGHLEDAYDITQ 534

Query: 482 KMPIQANPVVWRALLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVIGRYEE 541
           KMPIQANPVVWRALLGACRLHGNAELAEIAARQV+QLEPEHCGSYVLMSNVYGVIGR+EE
Sbjct: 535 KMPIQANPVVWRALLGACRLHGNAELAEIAARQVMQLEPEHCGSYVLMSNVYGVIGRHEE 594

Query: 542 VLEVRKTMKEQNVKKTPGCSWIELKDGVHVFRTGDRTHSELNALTNQLCDIGFILDEVLN 601
           VLEVRKTMKEQNVKKTPGCSWIELKDGVHVF TGDRTHSELNALTNQ+CDIGFILDEVLN
Sbjct: 595 VLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNALTNQMCDIGFILDEVLN 654

Query: 602 LY 604
           LY
Sbjct: 655 LY 656

BLAST of CSPI04G18560 vs. NCBI nr
Match: XP_022966216.1 (pentatricopeptide repeat-containing protein At3g14730-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 1086.6 bits (2809), Expect = 0.0e+00
Identity = 511/601 (85.02%), Postives = 565/601 (94.01%), Query Frame = 0

Query: 3   SAIQMLNNVTKCVAFLQSCADHQNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCGQM 62
           S  ++LNNVT C+AFLQSCA+ +NLNKGKQLHS+MITYGFS SP SITSLINMYSKCG+M
Sbjct: 61  SDFRLLNNVTTCIAFLQSCAESKNLNKGKQLHSVMITYGFSHSPSSITSLINMYSKCGRM 120

Query: 63  GEAILVFYDPCHERNVFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVVRT 122
            EA+LVF+DPC+E NVFAYNAIISGFV+NGLAS GFQFYK+MRLEGVMPDKYTFPCVVR+
Sbjct: 121 EEAVLVFHDPCYEPNVFAYNAIISGFVANGLASIGFQFYKQMRLEGVMPDKYTFPCVVRS 180

Query: 123 CCEVMEVKKIHGCLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIRDVVLWNAM 182
           CCEVMEVKKIHGCL KMGLELD+FVGSALVNTYLK GSMEDAQ+VF EL IRDVVLWNAM
Sbjct: 181 CCEVMEVKKIHGCLFKMGLELDLFVGSALVNTYLKLGSMEDAQEVFEELPIRDVVLWNAM 240

Query: 183 INGYAKIGCLDEALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIVMKMGY 242
           INGYA+IGCLDEALE+F+RMH++GV+PSRFTITGILS+FA +G LDNG+TVHGIVMKMGY
Sbjct: 241 INGYAQIGCLDEALEIFKRMHIEGVSPSRFTITGILSIFALKGHLDNGRTVHGIVMKMGY 300

Query: 243 DSGVSVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDK 302
           DSGV+VSNALIDMYGKCKHIGDAL++FE +NEKDIFSWNSIISVHEQCGDHDG LRLFDK
Sbjct: 301 DSGVAVSNALIDMYGKCKHIGDALMVFETMNEKDIFSWNSIISVHEQCGDHDGALRLFDK 360

Query: 303 MLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLLVSNAVM 362
           MLGSG LPDLVT+TT+LPACSHLAALMHGREIHGYMI+NGLG+D +NG +D+LLV+NAVM
Sbjct: 361 MLGSGFLPDLVTVTTILPACSHLAALMHGREIHGYMIVNGLGRDGDNGVIDDLLVNNAVM 420

Query: 363 DMYAKCGSMNNALKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCEAEFKPNEV 422
           DMYAKCGSMNNA K+F+SM+ KDVASWNIMIMGYGMHGY ++AL MFS MCEA+ KP+EV
Sbjct: 421 DMYAKCGSMNNAQKVFNSMTNKDVASWNIMIMGYGMHGYGMDALDMFSHMCEAKIKPDEV 480

Query: 423 TLVGVLSACNHAGFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEIVQK 482
           T VGVLSACNHAGFV  GR+FLAQME  FGVIPTIEHYTCVIDMLGRAGHLEDAY++ Q 
Sbjct: 481 TFVGVLSACNHAGFVCQGRMFLAQMEHDFGVIPTIEHYTCVIDMLGRAGHLEDAYDLAQT 540

Query: 483 MPIQANPVVWRALLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVIGRYEEV 542
           MPIQANPVVWRALLGACRLHGNAELAEIAA++V+QL+PEHCGSYVLMSNVYGV+GRYEEV
Sbjct: 541 MPIQANPVVWRALLGACRLHGNAELAEIAAQKVMQLDPEHCGSYVLMSNVYGVVGRYEEV 600

Query: 543 LEVRKTMKEQNVKKTPGCSWIELKDGVHVFRTGDRTHSELNALTNQLCDIGFILDEVLNL 602
           LEVR TMKEQ+V+KTPGCSWIELKDGVHVF TGDRTH ELNALT+QLCDIGFILDEVLNL
Sbjct: 601 LEVRNTMKEQHVRKTPGCSWIELKDGVHVFLTGDRTHLELNALTSQLCDIGFILDEVLNL 660

Query: 603 Y 604
           Y
Sbjct: 661 Y 661

BLAST of CSPI04G18560 vs. TAIR 10
Match: AT3G14730.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 659.4 bits (1700), Expect = 2.7e-189
Identity = 319/585 (54.53%), Postives = 421/585 (71.97%), Query Frame = 0

Query: 9   NNVTKCVAFLQSCADHQNLNKGKQLHSLMITYGF-SPSPPSITSLINMYSKCGQMGEAIL 68
           +NV  C+A LQ CA  ++   G+Q+H  M+  GF   SP + TSL+NMY+KCG M  A+L
Sbjct: 58  HNVATCIATLQRCAQRKDYVSGQQIHGFMVRKGFLDDSPRAGTSLVNMYAKCGLMRRAVL 117

Query: 69  VFYDPCHERNVFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVVR--TCCE 128
           VF     ER+VF YNA+ISGFV NG      + Y++MR  G++PDKYTFP +++     E
Sbjct: 118 VFGG--SERDVFGYNALISGFVVNGSPLDAMETYREMRANGILPDKYTFPSLLKGSDAME 177

Query: 129 VMEVKKIHGCLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIR-DVVLWNAMIN 188
           + +VKK+HG   K+G + D +VGS LV +Y K  S+EDAQKVF EL  R D VLWNA++N
Sbjct: 178 LSDVKKVHGLAFKLGFDSDCYVGSGLVTSYSKFMSVEDAQKVFDELPDRDDSVLWNALVN 237

Query: 189 GYAKIGCLDEALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIVMKMGYDS 248
           GY++I   ++AL VF +M  +GV  SR TIT +LS F   GD+DNG+++HG+ +K G  S
Sbjct: 238 GYSQIFRFEDALLVFSKMREEGVGVSRHTITSVLSAFTVSGDIDNGRSIHGLAVKTGSGS 297

Query: 249 GVSVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKML 308
            + VSNALIDMYGK K + +A  IFE ++E+D+F+WNS++ VH+ CGDHDGTL LF++ML
Sbjct: 298 DIVVSNALIDMYGKSKWLEEANSIFEAMDERDLFTWNSVLCVHDYCGDHDGTLALFERML 357

Query: 309 GSGILPDLVTITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLLVSNAVMDM 368
            SGI PD+VT+TTVLP C  LA+L  GREIHGYMI++GL     N    N  + N++MDM
Sbjct: 358 CSGIRPDIVTLTTVLPTCGRLASLRQGREIHGYMIVSGL----LNRKSSNEFIHNSLMDM 417

Query: 369 YAKCGSMNNALKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCEAEFKPNEVTL 428
           Y KCG + +A  +FDSM  KD ASWNIMI GYG+      AL MFS MC A  KP+E+T 
Sbjct: 418 YVKCGDLRDARMVFDSMRVKDSASWNIMINGYGVQSCGELALDMFSCMCRAGVKPDEITF 477

Query: 429 VGVLSACNHAGFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEIVQKMP 488
           VG+L AC+H+GF++ GR FLAQME+ + ++PT +HY CVIDMLGRA  LE+AYE+    P
Sbjct: 478 VGLLQACSHSGFLNEGRNFLAQMETVYNILPTSDHYACVIDMLGRADKLEEAYELAISKP 537

Query: 489 IQANPVVWRALLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVIGRYEEVLE 548
           I  NPVVWR++L +CRLHGN +LA +A +++ +LEPEHCG YVLMSNVY   G+YEEVL+
Sbjct: 538 ICDNPVVWRSILSSCRLHGNKDLALVAGKRLHELEPEHCGGYVLMSNVYVEAGKYEEVLD 597

Query: 549 VRKTMKEQNVKKTPGCSWIELKDGVHVFRTGDRTHSELNALTNQL 590
           VR  M++QNVKKTPGCSWI LK+GVH F TG++TH E  ++ + L
Sbjct: 598 VRDAMRQQNVKKTPGCSWIVLKNGVHTFFTGNQTHPEFKSIHDWL 636

BLAST of CSPI04G18560 vs. TAIR 10
Match: AT1G15510.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 419.5 bits (1077), Expect = 4.7e-117
Identity = 208/579 (35.92%), Postives = 341/579 (58.89%), Query Frame = 0

Query: 18  LQSCADHQNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCGQMGEAILVFYDPCHERN 77
           L++C    +L +GK++H  ++ YG+      + +LI MY KCG +  A L+F D    R+
Sbjct: 203 LRTCGGIPDLARGKEVHVHVVRYGYELDIDVVNALITMYVKCGDVKSARLLF-DRMPRRD 262

Query: 78  VFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVVRTCCEVMEVKK----IH 137
           + ++NA+ISG+  NG+  +G + +  MR   V PD  T   V+ + CE++  ++    IH
Sbjct: 263 IISWNAMISGYFENGMCHEGLELFFAMRGLSVDPDLMTLTSVI-SACELLGDRRLGRDIH 322

Query: 138 GCLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIRDVVLWNAMINGYAKIGCLD 197
             ++  G  +D+ V ++L   YL  GS  +A+K+F  +  +D+V W  MI+GY      D
Sbjct: 323 AYVITTGFAVDISVCNSLTQMYLNAGSWREAEKLFSRMERKDIVSWTTMISGYEYNFLPD 382

Query: 198 EALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIVMKMGYDSGVSVSNALI 257
           +A++ +R M    V P   T+  +LS  A+ GDLD G  +H + +K    S V V+N LI
Sbjct: 383 KAIDTYRMMDQDSVKPDEITVAAVLSACATLGDLDTGVELHKLAIKARLISYVIVANNLI 442

Query: 258 DMYGKCKHIGDALIIFEMINEKDIFSWNSIIS---VHEQCGDHDGTLRLFDKMLGSGILP 317
           +MY KCK I  AL IF  I  K++ SW SII+   ++ +C +      +F + +   + P
Sbjct: 443 NMYSKCKCIDKALDIFHNIPRKNVISWTSIIAGLRLNNRCFE----ALIFLRQMKMTLQP 502

Query: 318 DLVTITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLLVSNAVMDMYAKCGS 377
           + +T+T  L AC+ + ALM G+EIH +++  G+G DD         + NA++DMY +CG 
Sbjct: 503 NAITLTAALAACARIGALMCGKEIHAHVLRTGVGLDD--------FLPNALLDMYVRCGR 562

Query: 378 MNNALKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCEAEFKPNEVTLVGVLSA 437
           MN A   F+S  KKDV SWNI++ GY   G     + +F +M ++  +P+E+T + +L  
Sbjct: 563 MNTAWSQFNS-QKKDVTSWNILLTGYSERGQGSMVVELFDRMVKSRVRPDEITFISLLCG 622

Query: 438 CNHAGFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEIVQKMPIQANPV 497
           C+ +  V  G ++ ++ME  +GV P ++HY CV+D+LGRAG L++A++ +QKMP+  +P 
Sbjct: 623 CSKSQMVRQGLMYFSKMED-YGVTPNLKHYACVVDLLGRAGELQEAHKFIQKMPVTPDPA 682

Query: 498 VWRALLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVIGRYEEVLEVRKTMK 557
           VW ALL ACR+H   +L E++A+ + +L+ +  G Y+L+ N+Y   G++ EV +VR+ MK
Sbjct: 683 VWGALLNACRIHHKIDLGELSAQHIFELDKKSVGYYILLCNLYADCGKWREVAKVRRMMK 742

Query: 558 EQNVKKTPGCSWIELKDGVHVFRTGDRTHSELNALTNQL 590
           E  +    GCSW+E+K  VH F + D+ H +   +   L
Sbjct: 743 ENGLTVDAGCSWVEVKGKVHAFLSDDKYHPQTKEINTVL 765

BLAST of CSPI04G18560 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 416.0 bits (1068), Expect = 5.2e-116
Identity = 220/585 (37.61%), Postives = 345/585 (58.97%), Query Frame = 0

Query: 1   MLSAIQMLNNVTKCVAFLQSCADHQNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCG 60
           M S ++M +    CV+  +S +  ++++ G+QLH  ++  GF        SL+  Y K  
Sbjct: 187 MSSGVEMDSYTFSCVS--KSFSSLRSVHGGEQLHGFILKSGFGERNSVGNSLVAFYLKNQ 246

Query: 61  QMGEAILVFYDPCHERNVFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVV 120
           ++  A  VF D   ER+V ++N+II+G+VSNGLA KG   + +M + G+  D  T   V 
Sbjct: 247 RVDSARKVF-DEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVF 306

Query: 121 RTCCEVMEV---KKIHGCLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIRDVV 180
             C +   +   + +H   +K     +    + L++ Y K G ++ A+ VF E+S R VV
Sbjct: 307 AGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVV 366

Query: 181 LWNAMINGYAKIGCLDEALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIV 240
            + +MI GYA+ G   EA+++F  M  +G++P  +T+T +L+  A    LD GK VH  +
Sbjct: 367 SYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWI 426

Query: 241 MKMGYDSGVSVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTL 300
            +      + VSNAL+DMY KC  + +A ++F  +  KDI SWN+II  + +    +  L
Sbjct: 427 KENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEAL 486

Query: 301 RLFDKML-GSGILPDLVTITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLL 360
            LF+ +L      PD  T+  VLPAC+ L+A   GREIHGY++ NG   D          
Sbjct: 487 SLFNLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRH-------- 546

Query: 361 VSNAVMDMYAKCGSMNNALKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCEAE 420
           V+N+++DMYAKCG++  A  +FD ++ KD+ SW +MI GYGMHG+  EA+ +F+QM +A 
Sbjct: 547 VANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAG 606

Query: 421 FKPNEVTLVGVLSACNHAGFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDA 480
            + +E++ V +L AC+H+G V  G  F   M     + PT+EHY C++DML R G L  A
Sbjct: 607 IEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKA 666

Query: 481 YEIVQKMPIQANPVVWRALLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVI 540
           Y  ++ MPI  +  +W ALL  CR+H + +LAE  A +V +LEPE+ G YVLM+N+Y   
Sbjct: 667 YRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEA 726

Query: 541 GRYEEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFRTGDRTHSE 582
            ++E+V  +RK + ++ ++K PGCSWIE+K  V++F  GD ++ E
Sbjct: 727 EKWEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPE 760

BLAST of CSPI04G18560 vs. TAIR 10
Match: AT4G21300.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 409.1 bits (1050), Expect = 6.3e-114
Identity = 218/609 (35.80%), Postives = 347/609 (56.98%), Query Frame = 0

Query: 6   QMLNNVTKCVAFLQSCADHQNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCGQMGEA 65
           Q+  N       L  CA    ++ G QLH L++  G         SL++MYSKCG+  +A
Sbjct: 234 QISPNAVTFDCVLSVCASKLLIDLGVQLHGLVVVSGVDFEGSIKNSLLSMYSKCGRFDDA 293

Query: 66  ILVFYDPCHERNVFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVVRTCCE 125
             +F       +   +N +ISG+V +GL  +   F+ +M   GV+PD  TF  ++ +  +
Sbjct: 294 SKLF-RMMSRADTVTWNCMISGYVQSGLMEESLTFFYEMISSGVLPDAITFSSLLPSVSK 353

Query: 126 VMEV---KKIHGCLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIRDVVLWNAM 185
              +   K+IH  +++  + LD+F+ SAL++ Y K   +  AQ +F + +  DVV++ AM
Sbjct: 354 FENLEYCKQIHCYIMRHSISLDIFLTSALIDAYFKCRGVSMAQNIFSQCNSVDVVVFTAM 413

Query: 186 INGYAKIGCLDEALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIVMKMGY 245
           I+GY   G   ++LE+FR +    ++P+  T+  IL V      L  G+ +HG ++K G+
Sbjct: 414 ISGYLHNGLYIDSLEMFRWLVKVKISPNEITLVSILPVIGILLALKLGRELHGFIIKKGF 473

Query: 246 DSGVSVSNALIDMYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDK 305
           D+  ++  A+IDMY KC  +  A  IFE ++++DI SWNS+I+   Q  +    + +F +
Sbjct: 474 DNRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWNSMITRCAQSDNPSAAIDIFRQ 533

Query: 306 MLGSGILPDLVTITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLLVSNAVM 365
           M  SGI  D V+I+  L AC++L +   G+ IHG+MI + L  D        +   + ++
Sbjct: 534 MGVSGICYDCVSISAALSACANLPSESFGKAIHGFMIKHSLASD--------VYSESTLI 593

Query: 366 DMYAKCGSMNNALKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCE-AEFKPNE 425
           DMYAKCG++  A+ +F +M +K++ SWN +I   G HG   ++L +F +M E +  +P++
Sbjct: 594 DMYAKCGNLKAAMNVFKTMKEKNIVSWNSIIAACGNHGKLKDSLCLFHEMVEKSGIRPDQ 653

Query: 426 VTLVGVLSACNHAGFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEIVQ 485
           +T + ++S+C H G V  G  F   M   +G+ P  EHY CV+D+ GRAG L +AYE V+
Sbjct: 654 ITFLEIISSCCHVGDVDEGVRFFRSMTEDYGIQPQQEHYACVVDLFGRAGRLTEAYETVK 713

Query: 486 KMPIQANPVVWRALLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVIGRYEE 545
            MP   +  VW  LLGACRLH N ELAE+A+ +++ L+P + G YVL+SN +     +E 
Sbjct: 714 SMPFPPDAGVWGTLLGACRLHKNVELAEVASSKLMDLDPSNSGYYVLISNAHANAREWES 773

Query: 546 VLEVRKTMKEQNVKKTPGCSWIELKDGVHVFRTGDRTHSE-------LNALTNQLCDIGF 604
           V +VR  MKE+ V+K PG SWIE+    H+F +GD  H E       LN+L  +L   G+
Sbjct: 774 VTKVRSLMKEREVQKIPGYSWIEINKRTHLFVSGDVNHPESSHIYSLLNSLLGELRLEGY 833

BLAST of CSPI04G18560 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 404.8 bits (1039), Expect = 1.2e-112
Identity = 213/595 (35.80%), Postives = 344/595 (57.82%), Query Frame = 0

Query: 18  LQSCADHQNLNKGKQLHSLMITYGFSPSPPSITSLINMYSKCGQMGEAILVFYDPCHERN 77
           L+ C D   L  GK++H L++  GFS    ++T L NMY+KC Q+ EA  VF D   ER+
Sbjct: 142 LKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVF-DRMPERD 201

Query: 78  VFAYNAIISGFVSNGLASKGFQFYKKMRLEGVMPDKYTFPCVVRTCCEVMEV---KKIHG 137
           + ++N I++G+  NG+A    +  K M  E + P   T   V+     +  +   K+IHG
Sbjct: 202 LVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHG 261

Query: 138 CLLKMGLELDVFVGSALVNTYLKNGSMEDAQKVFGELSIRDVVLWNAMINGYAKIGCLDE 197
             ++ G +  V + +ALV+ Y K GS+E A+++F  +  R+VV WN+MI+ Y +     E
Sbjct: 262 YAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKE 321

Query: 198 ALEVFRRMHVKGVAPSRFTITGILSVFASRGDLDNGKTVHGIVMKMGYDSGVSVSNALID 257
           A+ +F++M  +GV P+  ++ G L   A  GDL+ G+ +H + +++G D  VSV N+LI 
Sbjct: 322 AMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLIS 381

Query: 258 MYGKCKHIGDALIIFEMINEKDIFSWNSIISVHEQCGDHDGTLRLFDKMLGSGILPDLVT 317
           MY KCK +  A  +F  +  + + SWN++I    Q G     L  F +M    + PD  T
Sbjct: 382 MYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFT 441

Query: 318 ITTVLPACSHLAALMHGREIHGYMIINGLGKDDENGAVDNLLVSNAVMDMYAKCGSMNNA 377
             +V+ A + L+   H + IHG ++ + L K        N+ V+ A++DMYAKCG++  A
Sbjct: 442 YVSVITAIAELSITHHAKWIHGVVMRSCLDK--------NVFVTTALVDMYAKCGAIMIA 501

Query: 378 LKIFDSMSKKDVASWNIMIMGYGMHGYALEALGMFSQMCEAEFKPNEVTLVGVLSACNHA 437
             IFD MS++ V +WN MI GYG HG+   AL +F +M +   KPN VT + V+SAC+H+
Sbjct: 502 RLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHS 561

Query: 438 GFVSHGRLFLAQMESTFGVIPTIEHYTCVIDMLGRAGHLEDAYEIVQKMPIQANPVVWRA 497
           G V  G      M+  + +  +++HY  ++D+LGRAG L +A++ + +MP++    V+ A
Sbjct: 562 GLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGA 621

Query: 498 LLGACRLHGNAELAEIAARQVLQLEPEHCGSYVLMSNVYGVIGRYEEVLEVRKTMKEQNV 557
           +LGAC++H N   AE AA ++ +L P+  G +VL++N+Y     +E+V +VR +M  Q +
Sbjct: 622 MLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGL 681

Query: 558 KKTPGCSWIELKDGVHVFRTGDRTHSE-------LNALTNQLCDIGFILDEVLNL 603
           +KTPGCS +E+K+ VH F +G   H +       L  L   + + G++ D  L L
Sbjct: 682 RKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEAGYVPDTNLVL 727

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LUC23.8e-18854.53Pentatricopeptide repeat-containing protein At3g14730 OS=Arabidopsis thaliana OX... [more]
Q9M9E26.6e-11635.92Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidop... [more]
Q9SN397.3e-11537.61Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q9STE18.9e-11335.80Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana OX... [more]
Q3E6Q11.7e-11135.80Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0L3D20.0e+00100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G414410 PE=4 SV=1[more]
A0A1S4E0R70.0e+0094.35pentatricopeptide repeat-containing protein At3g14730-like OS=Cucumis melo OX=36... [more]
A0A5A7TK170.0e+0094.30Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1HME00.0e+0085.02pentatricopeptide repeat-containing protein At3g14730-like isoform X2 OS=Cucurbi... [more]
A0A6J1HNR10.0e+0085.02pentatricopeptide repeat-containing protein At3g14730-like isoform X1 OS=Cucurbi... [more]
Match NameE-valueIdentityDescription
XP_004150613.20.0e+00100.00pentatricopeptide repeat-containing protein At3g14730 [Cucumis sativus] >XP_0317... [more]
XP_008455782.10.0e+0094.35PREDICTED: pentatricopeptide repeat-containing protein At3g14730-like [Cucumis m... [more]
KAA0043600.10.0e+0094.30pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK29760... [more]
XP_038881250.10.0e+0091.03pentatricopeptide repeat-containing protein At3g14730-like [Benincasa hispida][more]
XP_022966216.10.0e+0085.02pentatricopeptide repeat-containing protein At3g14730-like isoform X1 [Cucurbita... [more]
Match NameE-valueIdentityDescription
AT3G14730.12.7e-18954.53Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G15510.14.7e-11735.92Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G18750.15.2e-11637.61Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G21300.16.3e-11435.80Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G11290.11.2e-11235.80Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 525..554
e-value: 0.12
score: 12.6
coord: 459..483
e-value: 0.0036
score: 17.4
coord: 150..171
e-value: 0.27
score: 11.6
coord: 50..70
e-value: 0.03
score: 14.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 76..124
e-value: 1.3E-11
score: 44.5
coord: 275..322
e-value: 2.5E-8
score: 34.0
coord: 384..431
e-value: 1.1E-7
score: 32.0
coord: 174..222
e-value: 5.5E-11
score: 42.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 278..311
e-value: 3.7E-5
score: 21.6
coord: 388..421
e-value: 1.9E-6
score: 25.7
coord: 177..210
e-value: 5.2E-10
score: 36.9
coord: 80..112
e-value: 5.2E-6
score: 24.3
coord: 359..386
e-value: 9.2E-4
score: 17.2
coord: 460..483
e-value: 6.2E-4
score: 17.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 77..111
score: 11.257313
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 175..209
score: 13.953793
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 354..384
score: 8.53891
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 276..310
score: 11.257313
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 385..419
score: 10.599635
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 449..576
e-value: 1.6E-12
score: 49.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 130..234
e-value: 1.6E-22
score: 82.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 2..129
e-value: 2.1E-23
score: 84.6
coord: 235..332
e-value: 5.4E-20
score: 73.5
coord: 346..439
e-value: 1.0E-20
score: 75.8
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 153..545
NoneNo IPR availablePANTHERPTHR47928:SF61OS01G0818200 PROTEINcoord: 7..581
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 7..581

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G18560.1CSPI04G18560.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding