CsGy2G024800 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy2G024800
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionPentatricopeptide repeat-containing protein
LocationGy14Chr2: 32464309 .. 32466624 (+)
RNA-Seq ExpressionCsGy2G024800
SyntenyCsGy2G024800
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTAAAGACATACTAATTTTCAACCCGAATTTGTGGCCGTGGCGGGAAGTTCAAACCACTTTCCGGCTATGTTCTCGCCGGCTATCATTTCTCAATCACCGCCTTGTTTAACTTTTCAACCAACGTCAACGTCACTATCCACACGGCGGACCTGTTCCAAATGGAACCTCACTACTTTCAACCGTTGTAAAAGCTCCACAAGTTTCCCGTTCAATTTTGTCGAAGACCATTCCAAGGCTTTGCCGGTCGCTTGCGCAACGGGAAAATGTACTACTACTGAAGAATACGCCGATGTAGAATCTTGCAGCAATCAGTCGGTGAGTGGATGTTTGAGTCCCTATTTGATTGGGGTTTGGCTTCGCTCTAGTCGTAGCGTCAAGAAATTAAGGGCGGTACATGCTTTCATTTTGCGAAATTTTACAAGTTTTGGGATCTATGTTGGAAACAATTTGCTTAGTTCTTACTTAAGATTGGGAATGTTGGTTGATGCTAGAAAGGTGTTCGATGAAATGCCAATGAGGAGTGTTGTGACCTGGACGGCTATTATTAATGGATATATTGATTTGGATTTGACTGAAGAAGCTTTAGCGTTGTTCAGTGATTCGGTCAAGAGCGGGGTGCTAGCAAATGGGCAGATGTTTGTTTGCATCTTGAACTTGTGTGCTAAGAGGTTGGATTTTGAGCTTGGGAGGCAAATTCATGGCGTTATTGTGAAAGGTAATCGCGGGAATTTGATTGTTGACAGTGCCATTATTTACTTTTATGCACAATGTAAAGATATTTCCAGTGCTTTTGTTGCATTTGAACGTATGCGGAGGCGTGATGTTGTTTGTTGGACTTCCATGATAACTTCTTGTTCCCAACAAGGGCTTGGACGAGAAGCGATTTCGATGTTTTCAAATATGCTAAGCGATGAATTTCTACCAAACGAGTTTTCCGTATGCAGTGTTCTCAAGGCTTGTGGAGAAGAGAGGGAATTGAAGATTGGGAGACAGTTACATGGTTTGATAATTAAGAAAATAATCAAGAATGATGTTTTTGTTGGGACTTCATTGGTTGATATGTATGCCAAGTGCGGAAACTTGGCAGATTCTAGAGAAGTATTCGACGGGATGAGGAATAGGAACACGGTTACCTGGACGTCAATCATAGCTGGTTATGCACGGGAGGGGCTCGGTGAGGAGGCTCTGAACCTCTTTAGGTTGATGAAGAGGCAAAGGATTCCTGCCAATAACTTAACCATTGTAAGTATTCTCCGCGCTTGCGGTTCGATTGAGGCATCATTGACTGGGAGGGAAGTTCATGCCCAGATTGTAAAAAATTCATTTCAAACTAATATACACATAGGAAGCACTCTAGTTTGGTTCTACTGTAAATGTAGGAATCAACTTAAGGCCTCGATGGTTCTTCAGCTGATGCCGCTAAGAGATGTGGTTTCTTGGACAGCCATCATTTCTGGATGTGCTCATCTTGGGCACGAGTCCGAGGCGCTTGAGTTTCTAAAAAACATGATAGAGGAAGGTGTTGAACCAAATTCCTTTACTTATTCGTCAACTTTAAAAGCGTGTGCCAAGATGGAAGCTGTCCTCCAAGGGAAAATGATCCACTCTTCCGCAAACAAAACATCTGCGTTGTCCAATGTTTTTGTGGGAAGTGCACTGATTTACATGTATGCAAAATGTGGATATGTAACCGAAGCTTCTCAAGTTTTCGACAGTATGCCGGTGAGGAATTTGGTTTCTTGGAAGGCCATGATTTTGTGTTATGCTAGGAACGGTCTGTGCCGAGAGGCATTAAAGCTCATGTATCGGATGCAGGCGGAAGGTTTCGAAGTGGACGATTACATTCTTGGAACAGTTTATGGAGCTTGTGGAGATGTGAAATGTGATGTGGATTCATCACTTGAATATAGGTTGCAAACTCATTGACCTCCCAGTTACAATGGTTATCAAGACTCAGACATTCAACGACGGAAATCTTGCCATCTTGAGGAGGTTTGCACTTGTTCATCCTGTTTCATTTGGAATGCCAATGATCTGAATAACTTTCTTGTGCTCTTATCACTCCATAAATCAAAAGTTTTGAAGAGGGGAGAAACAAATAATCAAACTCTAACCCCACCTAGCTGAGATGTTTTATCTGCTCCATGCATCATGTATTTTACAACCCAACCCTACTTAATTGTAAAAATATTCAATGTAATCTTTGTAAATCCATACCCATCTAGTTCTATGAATAGCCCTACTTAAATTTGCTTTTCATATGATAGAACCCTATGAATCTTTAGAAATGTTATTTTTCTTACACCATGTAGACC

mRNA sequence

TTAAAGACATACTAATTTTCAACCCGAATTTGTGGCCGTGGCGGGAAGTTCAAACCACTTTCCGGCTATGTTCTCGCCGGCTATCATTTCTCAATCACCGCCTTGTTTAACTTTTCAACCAACGTCAACGTCACTATCCACACGGCGGACCTGTTCCAAATGGAACCTCACTACTTTCAACCGTTGTAAAAGCTCCACAAGTTTCCCGTTCAATTTTGTCGAAGACCATTCCAAGGCTTTGCCGGTCGCTTGCGCAACGGGAAAATGTACTACTACTGAAGAATACGCCGATGTAGAATCTTGCAGCAATCAGTCGGTGAGTGGATGTTTGAGTCCCTATTTGATTGGGGTTTGGCTTCGCTCTAGTCGTAGCGTCAAGAAATTAAGGGCGGTACATGCTTTCATTTTGCGAAATTTTACAAGTTTTGGGATCTATGTTGGAAACAATTTGCTTAGTTCTTACTTAAGATTGGGAATGTTGGTTGATGCTAGAAAGGTGTTCGATGAAATGCCAATGAGGAGTGTTGTGACCTGGACGGCTATTATTAATGGATATATTGATTTGGATTTGACTGAAGAAGCTTTAGCGTTGTTCAGTGATTCGGTCAAGAGCGGGGTGCTAGCAAATGGGCAGATGTTTGTTTGCATCTTGAACTTGTGTGCTAAGAGGTTGGATTTTGAGCTTGGGAGGCAAATTCATGGCGTTATTGTGAAAGGTAATCGCGGGAATTTGATTGTTGACAGTGCCATTATTTACTTTTATGCACAATGTAAAGATATTTCCAGTGCTTTTGTTGCATTTGAACGTATGCGGAGGCGTGATGTTGTTTGTTGGACTTCCATGATAACTTCTTGTTCCCAACAAGGGCTTGGACGAGAAGCGATTTCGATGTTTTCAAATATGCTAAGCGATGAATTTCTACCAAACGAGTTTTCCGTATGCAGTGTTCTCAAGGCTTGTGGAGAAGAGAGGGAATTGAAGATTGGGAGACAGTTACATGGTTTGATAATTAAGAAAATAATCAAGAATGATGTTTTTGTTGGGACTTCATTGGTTGATATGTATGCCAAGTGCGGAAACTTGGCAGATTCTAGAGAAGTATTCGACGGGATGAGGAATAGGAACACGGTTACCTGGACGTCAATCATAGCTGGTTATGCACGGGAGGGGCTCGGTGAGGAGGCTCTGAACCTCTTTAGGTTGATGAAGAGGCAAAGGATTCCTGCCAATAACTTAACCATTGTAAGTATTCTCCGCGCTTGCGGTTCGATTGAGGCATCATTGACTGGGAGGGAAGTTCATGCCCAGATTGTAAAAAATTCATTTCAAACTAATATACACATAGGAAGCACTCTAGTTTGGTTCTACTGTAAATGTAGGAATCAACTTAAGGCCTCGATGGTTCTTCAGCTGATGCCGCTAAGAGATGTGGTTTCTTGGACAGCCATCATTTCTGGATGTGCTCATCTTGGGCACGAGTCCGAGGCGCTTGAGTTTCTAAAAAACATGATAGAGGAAGGTGTTGAACCAAATTCCTTTACTTATTCGTCAACTTTAAAAGCGTGTGCCAAGATGGAAGCTGTCCTCCAAGGGAAAATGATCCACTCTTCCGCAAACAAAACATCTGCGTTGTCCAATGTTTTTGTGGGAAGTGCACTGATTTACATGTATGCAAAATGTGGATATGTAACCGAAGCTTCTCAAGTTTTCGACAGTATGCCGGTGAGGAATTTGGTTTCTTGGAAGGCCATGATTTTGTGTTATGCTAGGAACGGTCTGTGCCGAGAGGCATTAAAGCTCATGTATCGGATGCAGGCGGAAGGTTTCGAAGTGGACGATTACATTCTTGGAACAGTTTATGGAGCTTGTGGAGATGTGAAATGTGATGTGGATTCATCACTTGAATATAGGTTGCAAACTCATTGACCTCCCAGTTACAATGGTTATCAAGACTCAGACATTCAACGACGGAAATCTTGCCATCTTGAGGAGGTTTGCACTTGTTCATCCTGTTTCATTTGGAATGCCAATGATCTGAATAACTTTCTTGTGCTCTTATCACTCCATAAATCAAAAGTTTTGAAGAGGGGAGAAACAAATAATCAAACTCTAACCCCACCTAGCTGAGATGTTTTATCTGCTCCATGCATCATGTATTTTACAACCCAACCCTACTTAATTGTAAAAATATTCAATGTAATCTTTGTAAATCCATACCCATCTAGTTCTATGAATAGCCCTACTTAAATTTGCTTTTCATATGATAGAACCCTATGAATCTTTAGAAATGTTATTTTTCTTACACCATGTAGACC

Coding sequence (CDS)

ATGTTCTCGCCGGCTATCATTTCTCAATCACCGCCTTGTTTAACTTTTCAACCAACGTCAACGTCACTATCCACACGGCGGACCTGTTCCAAATGGAACCTCACTACTTTCAACCGTTGTAAAAGCTCCACAAGTTTCCCGTTCAATTTTGTCGAAGACCATTCCAAGGCTTTGCCGGTCGCTTGCGCAACGGGAAAATGTACTACTACTGAAGAATACGCCGATGTAGAATCTTGCAGCAATCAGTCGGTGAGTGGATGTTTGAGTCCCTATTTGATTGGGGTTTGGCTTCGCTCTAGTCGTAGCGTCAAGAAATTAAGGGCGGTACATGCTTTCATTTTGCGAAATTTTACAAGTTTTGGGATCTATGTTGGAAACAATTTGCTTAGTTCTTACTTAAGATTGGGAATGTTGGTTGATGCTAGAAAGGTGTTCGATGAAATGCCAATGAGGAGTGTTGTGACCTGGACGGCTATTATTAATGGATATATTGATTTGGATTTGACTGAAGAAGCTTTAGCGTTGTTCAGTGATTCGGTCAAGAGCGGGGTGCTAGCAAATGGGCAGATGTTTGTTTGCATCTTGAACTTGTGTGCTAAGAGGTTGGATTTTGAGCTTGGGAGGCAAATTCATGGCGTTATTGTGAAAGGTAATCGCGGGAATTTGATTGTTGACAGTGCCATTATTTACTTTTATGCACAATGTAAAGATATTTCCAGTGCTTTTGTTGCATTTGAACGTATGCGGAGGCGTGATGTTGTTTGTTGGACTTCCATGATAACTTCTTGTTCCCAACAAGGGCTTGGACGAGAAGCGATTTCGATGTTTTCAAATATGCTAAGCGATGAATTTCTACCAAACGAGTTTTCCGTATGCAGTGTTCTCAAGGCTTGTGGAGAAGAGAGGGAATTGAAGATTGGGAGACAGTTACATGGTTTGATAATTAAGAAAATAATCAAGAATGATGTTTTTGTTGGGACTTCATTGGTTGATATGTATGCCAAGTGCGGAAACTTGGCAGATTCTAGAGAAGTATTCGACGGGATGAGGAATAGGAACACGGTTACCTGGACGTCAATCATAGCTGGTTATGCACGGGAGGGGCTCGGTGAGGAGGCTCTGAACCTCTTTAGGTTGATGAAGAGGCAAAGGATTCCTGCCAATAACTTAACCATTGTAAGTATTCTCCGCGCTTGCGGTTCGATTGAGGCATCATTGACTGGGAGGGAAGTTCATGCCCAGATTGTAAAAAATTCATTTCAAACTAATATACACATAGGAAGCACTCTAGTTTGGTTCTACTGTAAATGTAGGAATCAACTTAAGGCCTCGATGGTTCTTCAGCTGATGCCGCTAAGAGATGTGGTTTCTTGGACAGCCATCATTTCTGGATGTGCTCATCTTGGGCACGAGTCCGAGGCGCTTGAGTTTCTAAAAAACATGATAGAGGAAGGTGTTGAACCAAATTCCTTTACTTATTCGTCAACTTTAAAAGCGTGTGCCAAGATGGAAGCTGTCCTCCAAGGGAAAATGATCCACTCTTCCGCAAACAAAACATCTGCGTTGTCCAATGTTTTTGTGGGAAGTGCACTGATTTACATGTATGCAAAATGTGGATATGTAACCGAAGCTTCTCAAGTTTTCGACAGTATGCCGGTGAGGAATTTGGTTTCTTGGAAGGCCATGATTTTGTGTTATGCTAGGAACGGTCTGTGCCGAGAGGCATTAAAGCTCATGTATCGGATGCAGGCGGAAGGTTTCGAAGTGGACGATTACATTCTTGGAACAGTTTATGGAGCTTGTGGAGATGTGAAATGTGATGTGGATTCATCACTTGAATATAGGTTGCAAACTCATTGA

Protein sequence

MFSPAIISQSPPCLTFQPTSTSLSTRRTCSKWNLTTFNRCKSSTSFPFNFVEDHSKALPVACATGKCTTTEEYADVESCSNQSVSGCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNLIVDSAIIYFYAQCKDISSAFVAFERMRRRDVVCWTSMITSCSQQGLGREAISMFSNMLSDEFLPNEFSVCSVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFDGMRNRNTVTWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLTGREVHAQIVKNSFQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSSANKTSALSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGFEVDDYILGTVYGACGDVKCDVDSSLEYRLQTH*
Homology
BLAST of CsGy2G024800 vs. ExPASy Swiss-Prot
Match: Q0WNP3 (Pentatricopeptide repeat-containing protein At4g18520, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-A2 PE=1 SV=1)

HSP 1 Score: 697.2 bits (1798), Expect = 1.7e-199
Identity = 335/530 (63.21%), Postives = 424/530 (80.00%), Query Frame = 0

Query: 92  LIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMR 151
           L+  WL+SS  ++ ++ +HA  L+ F    IY GNNL+SS +RLG LV ARKVFD MP +
Sbjct: 87  LLAEWLQSSNGMRLIKRIHAMALKCFDDQVIYFGNNLISSCVRLGDLVYARKVFDSMPEK 146

Query: 152 SVVTWTAIINGYIDLDLTEEALALFSDSVKSGV-LANGQMFVCILNLCAKRLDFELGRQI 211
           + VTWTA+I+GY+   L +EA ALF D VK G+   N +MFVC+LNLC++R +FELGRQ+
Sbjct: 147 NTVTWTAMIDGYLKYGLEDEAFALFEDYVKHGIRFTNERMFVCLLNLCSRRAEFELGRQV 206

Query: 212 HGVIVKGNRGNLIVDSAIIYFYAQCKDISSAFVAFERMRRRDVVCWTSMITSCSQQGLGR 271
           HG +VK   GNLIV+S+++YFYAQC +++SA  AF+ M  +DV+ WT++I++CS++G G 
Sbjct: 207 HGNMVKVGVGNLIVESSLVYFYAQCGELTSALRAFDMMEEKDVISWTAVISACSRKGHGI 266

Query: 272 EAISMFSNMLSDEFLPNEFSVCSVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLV 331
           +AI MF  ML+  FLPNEF+VCS+LKAC EE+ L+ GRQ+H L++K++IK DVFVGTSL+
Sbjct: 267 KAIGMFIGMLNHWFLPNEFTVCSILKACSEEKALRFGRQVHSLVVKRMIKTDVFVGTSLM 326

Query: 332 DMYAKCGNLADSREVFDGMRNRNTVTWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNL 391
           DMYAKCG ++D R+VFDGM NRNTVTWTSIIA +AREG GEEA++LFR+MKR+ + ANNL
Sbjct: 327 DMYAKCGEISDCRKVFDGMSNRNTVTWTSIIAAHAREGFGEEAISLFRIMKRRHLIANNL 386

Query: 392 TIVSILRACGSIEASLTGREVHAQIVKNSFQTNIHIGSTLVWFYCKCRNQLKASMVLQLM 451
           T+VSILRACGS+ A L G+E+HAQI+KNS + N++IGSTLVW YCKC     A  VLQ +
Sbjct: 387 TVVSILRACGSVGALLLGKELHAQIIKNSIEKNVYIGSTLVWLYCKCGESRDAFNVLQQL 446

Query: 452 PLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFTYSSTLKACAKMEAVLQGK 511
           P RDVVSWTA+ISGC+ LGHESEAL+FLK MI+EGVEPN FTYSS LKACA  E++L G+
Sbjct: 447 PSRDVVSWTAMISGCSSLGHESEALDFLKEMIQEGVEPNPFTYSSALKACANSESLLIGR 506

Query: 512 MIHSSANKTSALSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNG 571
            IHS A K  ALSNVFVGSALI+MYAKCG+V+EA +VFDSMP +NLVSWKAMI+ YARNG
Sbjct: 507 SIHSIAKKNHALSNVFVGSALIHMYAKCGFVSEAFRVFDSMPEKNLVSWKAMIMGYARNG 566

Query: 572 LCREALKLMYRMQAEGFEVDDYILGTVYGACGDVKCD--VDSSLEYRLQT 619
            CREALKLMYRM+AEGFEVDDYI  T+   CGD++ D  V+SS    L+T
Sbjct: 567 FCREALKLMYRMEAEGFEVDDYIFATILSTCGDIELDEAVESSATCYLET 616

BLAST of CsGy2G024800 vs. ExPASy Swiss-Prot
Match: P93005 (Pentatricopeptide repeat-containing protein At2g33680 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E19 PE=3 SV=1)

HSP 1 Score: 288.1 bits (736), Expect = 2.3e-76
Identity = 166/507 (32.74%), Postives = 279/507 (55.03%), Query Frame = 0

Query: 107 RAVHAFILRNFTSFG-IYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYID 166
           R  HA +++  +SFG IYV  +L+  Y + G++ D  KVF  MP R+  TW+ +++GY  
Sbjct: 138 RQAHALVVK-MSSFGDIYVDTSLVGMYCKAGLVEDGLKVFAYMPERNTYTWSTMVSGYAT 197

Query: 167 LDLTEEALALFSDSV--KSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVK-GNRGNL 226
               EEA+ +F+  +  K     +  +F  +L+  A  +   LGRQIH + +K G  G +
Sbjct: 198 RGRVEEAIKVFNLFLREKEEGSDSDYVFTAVLSSLAATIYVGLGRQIHCITIKNGLLGFV 257

Query: 227 IVDSAIIYFYAQCKDISSAFVAFERMRRRDVVCWTSMITSCSQQGLGREAISMFSNMLSD 286
            + +A++  Y++C+ ++ A   F+    R+ + W++M+T  SQ G   EA+ +FS M S 
Sbjct: 258 ALSNALVTMYSKCESLNEACKMFDSSGDRNSITWSAMVTGYSQNGESLEAVKLFSRMFSA 317

Query: 287 EFLPNEFSVCSVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADS 346
              P+E+++  VL AC +   L+ G+QLH  ++K   +  +F  T+LVDMYAK G LAD+
Sbjct: 318 GIKPSEYTIVGVLNACSDICYLEEGKQLHSFLLKLGFERHLFATTALVDMYAKAGCLADA 377

Query: 347 REVFDGMRNRNTVTWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSI 406
           R+ FD ++ R+   WTS+I+GY +    EEAL L+R MK   I  N+ T+ S+L+AC S+
Sbjct: 378 RKGFDCLQERDVALWTSLISGYVQNSDNEEALILYRRMKTAGIIPNDPTMASVLKACSSL 437

Query: 407 EASLTGREVHAQIVKNSFQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAII 466
                G++VH   +K+ F   + IGS L   Y KC +    ++V +  P +DVVSW A+I
Sbjct: 438 ATLELGKQVHGHTIKHGFGLEVPIGSALSTMYSKCGSLEDGNLVFRRTPNKDVVSWNAMI 497

Query: 467 SGCAHLGHESEALEFLKNMIEEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSSANKTSAL 526
           SG +H G   EALE  + M+ EG+EP+  T+ + + AC+    V +G    +  +    L
Sbjct: 498 SGLSHNGQGDEALELFEEMLAEGMEPDDVTFVNIISACSHKGFVERGWFYFNMMSDQIGL 557

Query: 527 S-NVFVGSALIYMYAKCGYVTEASQVFDSMPV-RNLVSWKAMILCYARNGLCREALKLMY 586
              V   + ++ + ++ G + EA +  +S  +   L  W+ ++     +G C   +    
Sbjct: 558 DPKVDHYACMVDLLSRAGQLKEAKEFIESANIDHGLCLWRILLSACKNHGKCELGVYAGE 617

Query: 587 RMQAEGF-EVDDYI-LGTVYGACGDVK 606
           ++ A G  E   Y+ L  +Y A G ++
Sbjct: 618 KLMALGSRESSTYVQLSGIYTALGRMR 643

BLAST of CsGy2G024800 vs. ExPASy Swiss-Prot
Match: Q9LFI1 (Pentatricopeptide repeat-containing protein At3g53360, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E86 PE=2 SV=1)

HSP 1 Score: 285.8 bits (730), Expect = 1.2e-75
Identity = 168/509 (33.01%), Postives = 268/509 (52.65%), Query Frame = 0

Query: 99  SSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTA 158
           SSRS+ + R +H  IL +   +   + N++LS Y + G L DAR+VFD MP R++V++T+
Sbjct: 79  SSRSLAQGRKIHDHILNSNCKYDTILNNHILSMYGKCGSLRDAREVFDFMPERNLVSYTS 138

Query: 159 IINGYIDLDLTEEALALFSDSVKSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVK-G 218
           +I GY       EA+ L+   ++  ++ +   F  I+  CA   D  LG+Q+H  ++K  
Sbjct: 139 VITGYSQNGQGAEAIRLYLKMLQEDLVPDQFAFGSIIKACASSSDVGLGKQLHAQVIKLE 198

Query: 219 NRGNLIVDSAIIYFYAQCKDISSAFVAFERMRRRDVVCWTSMITSCSQQGLGREAISMFS 278
           +  +LI  +A+I  Y +   +S A   F  +  +D++ W+S+I   SQ G   EA+S   
Sbjct: 199 SSSHLIAQNALIAMYVRFNQMSDASRVFYGIPMKDLISWSSIIAGFSQLGFEFEALSHLK 258

Query: 279 NMLS-DEFLPNEFSVCSVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKC 338
            MLS   F PNE+   S LKAC        G Q+HGL IK  +  +   G SL DMYA+C
Sbjct: 259 EMLSFGVFHPNEYIFGSSLKACSSLLRPDYGSQIHGLCIKSELAGNAIAGCSLCDMYARC 318

Query: 339 GNLADSREVFDGMRNRNTVTWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSIL 398
           G L  +R VFD +   +T +W  IIAG A  G  +EA+++F  M+      + +++ S+L
Sbjct: 319 GFLNSARRVFDQIERPDTASWNVIIAGLANNGYADEAVSVFSQMRSSGFIPDAISLRSLL 378

Query: 399 RACGSIEASLTGREVHAQIVKNSFQTNIHIGSTLVWFYCKCRNQLKA-SMVLQLMPLRDV 458
            A     A   G ++H+ I+K  F  ++ + ++L+  Y  C +     ++        D 
Sbjct: 379 CAQTKPMALSQGMQIHSYIIKWGFLADLTVCNSLLTMYTFCSDLYCCFNLFEDFRNNADS 438

Query: 459 VSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSS 518
           VSW  I++ C       E L   K M+    EP+  T  + L+ C ++ ++  G  +H  
Sbjct: 439 VSWNTILTACLQHEQPVEMLRLFKLMLVSECEPDHITMGNLLRGCVEISSLKLGSQVHCY 498

Query: 519 ANKTSALSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREA 578
           + KT      F+ + LI MYAKCG + +A ++FDSM  R++VSW  +I+ YA++G   EA
Sbjct: 499 SLKTGLAPEQFIKNGLIDMYAKCGSLGQARRIFDSMDNRDVVSWSTLIVGYAQSGFGEEA 558

Query: 579 LKLMYRMQAEGFEVDDYILGTVYGACGDV 605
           L L   M++ G E +      V  AC  V
Sbjct: 559 LILFKEMKSAGIEPNHVTFVGVLTACSHV 587

BLAST of CsGy2G024800 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 279.6 bits (714), Expect = 8.3e-74
Identity = 153/506 (30.24%), Postives = 266/506 (52.57%), Query Frame = 0

Query: 97  LRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTW 156
           L    S+K+LR +   + +N      +    L+S + R G + +A +VF+ +  +  V +
Sbjct: 44  LERCSSLKELRQILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDSKLNVLY 103

Query: 157 TAIINGYIDLDLTEEALALFSDSVKSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVK 216
             ++ G+  +   ++AL  F       V      F  +L +C    +  +G++IHG++VK
Sbjct: 104 HTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVK 163

Query: 217 -GNRGNLIVDSAIIYFYAQCKDISSAFVAFERMRRRDVVCWTSMITSCSQQGLGREAISM 276
            G   +L   + +   YA+C+ ++ A   F+RM  RD+V W +++   SQ G+ R A+ M
Sbjct: 164 SGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEM 223

Query: 277 FSNMLSDEFLPNEFSVCSVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAK 336
             +M  +   P+  ++ SVL A    R + +G+++HG  ++    + V + T+LVDMYAK
Sbjct: 224 VKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAK 283

Query: 337 CGNLADSREVFDGMRNRNTVTWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSI 396
           CG+L  +R++FDGM  RN V+W S+I  Y +    +EA+ +F+ M  + +   +++++  
Sbjct: 284 CGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGA 343

Query: 397 LRACGSIEASLTGREVHAQIVKNSFQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDV 456
           L AC  +     GR +H   V+     N+ + ++L+  YCKC+    A+ +   +  R +
Sbjct: 344 LHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTL 403

Query: 457 VSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSS 516
           VSW A+I G A  G   +AL +   M    V+P++FTY S + A A++      K IH  
Sbjct: 404 VSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGV 463

Query: 517 ANKTSALSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREA 576
             ++    NVFV +AL+ MYAKCG +  A  +FD M  R++ +W AMI  Y  +G  + A
Sbjct: 464 VMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAA 523

Query: 577 LKLMYRMQAEGFEVDDYILGTVYGAC 602
           L+L   MQ    + +     +V  AC
Sbjct: 524 LELFEEMQKGTIKPNGVTFLSVISAC 549

BLAST of CsGy2G024800 vs. ExPASy Swiss-Prot
Match: Q9SS83 (Pentatricopeptide repeat-containing protein At3g09040, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E88 PE=2 SV=1)

HSP 1 Score: 277.3 bits (708), Expect = 4.1e-73
Identity = 167/497 (33.60%), Postives = 267/497 (53.72%), Query Frame = 0

Query: 109 VHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDL 168
           VHA  ++   +  IYVG++L+S Y +   +  A KVF+ +  ++ V W A+I GY     
Sbjct: 349 VHAEAIKLGLASNIYVGSSLVSMYSKCEKMEAAAKVFEALEEKNDVFWNAMIRGYAHNGE 408

Query: 169 TEEALALFSDSVKSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRG-NLIVDSA 228
           + + + LF D   SG   +   F  +L+ CA   D E+G Q H +I+K     NL V +A
Sbjct: 409 SHKVMELFMDMKSSGYNIDDFTFTSLLSTCAASHDLEMGSQFHSIIIKKKLAKNLFVGNA 468

Query: 229 IIYFYAQCKDISSAFVAFERMRRRDVVCWTSMITSCSQQGLGREAISMFSNMLSDEFLPN 288
           ++  YA+C  +  A   FERM  RD V W ++I S  Q     EA  +F  M     + +
Sbjct: 469 LVDMYAKCGALEDARQIFERMCDRDNVTWNTIIGSYVQDENESEAFDLFKRMNLCGIVSD 528

Query: 289 EFSVCSVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFD 348
              + S LKAC     L  G+Q+H L +K  +  D+  G+SL+DMY+KCG + D+R+VF 
Sbjct: 529 GACLASTLKACTHVHGLYQGKQVHCLSVKCGLDRDLHTGSSLIDMYSKCGIIKDARKVFS 588

Query: 349 GMRNRNTVTWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLT 408
            +   + V+  ++IAGY++  L EEA+ LF+ M  + +  + +T  +I+ AC   E+   
Sbjct: 589 SLPEWSVVSMNALIAGYSQNNL-EEAVVLFQEMLTRGVNPSEITFATIVEACHKPESLTL 648

Query: 409 GREVHAQIVKNSFQT-NIHIGSTLVWFYCKCRNQLKA-SMVLQLMPLRDVVSWTAIISGC 468
           G + H QI K  F +   ++G +L+  Y   R   +A ++  +L   + +V WT ++SG 
Sbjct: 649 GTQFHGQITKRGFSSEGEYLGISLLGMYMNSRGMTEACALFSELSSPKSIVLWTGMMSGH 708

Query: 469 AHLGHESEALEFLKNMIEEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSSANKTSALSNV 528
           +  G   EAL+F K M  +GV P+  T+ + L+ C+ + ++ +G+ IHS     +   + 
Sbjct: 709 SQNGFYEEALKFYKEMRHDGVLPDQATFVTVLRVCSVLSSLREGRAIHSLIFHLAHDLDE 768

Query: 529 FVGSALIYMYAKCGYVTEASQVFDSMPVR-NLVSWKAMILCYARNGLCREALKLMYRMQA 588
              + LI MYAKCG +  +SQVFD M  R N+VSW ++I  YA+NG   +ALK+   M+ 
Sbjct: 769 LTSNTLIDMYAKCGDMKGSSQVFDEMRRRSNVVSWNSLINGYAKNGYAEDALKIFDSMRQ 828

Query: 589 EGFEVDDYILGTVYGAC 602
                D+     V  AC
Sbjct: 829 SHIMPDEITFLGVLTAC 844

BLAST of CsGy2G024800 vs. NCBI nr
Match: XP_004138810.1 (pentatricopeptide repeat-containing protein At4g18520, chloroplastic [Cucumis sativus] >KGN63116.1 hypothetical protein Csa_022382 [Cucumis sativus])

HSP 1 Score: 1240 bits (3208), Expect = 0.0
Identity = 619/619 (100.00%), Postives = 619/619 (100.00%), Query Frame = 0

Query: 1   MFSPAIISQSPPCLTFQPTSTSLSTRRTCSKWNLTTFNRCKSSTSFPFNFVEDHSKALPV 60
           MFSPAIISQSPPCLTFQPTSTSLSTRRTCSKWNLTTFNRCKSSTSFPFNFVEDHSKALPV
Sbjct: 1   MFSPAIISQSPPCLTFQPTSTSLSTRRTCSKWNLTTFNRCKSSTSFPFNFVEDHSKALPV 60

Query: 61  ACATGKCTTTEEYADVESCSNQSVSGCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSF 120
           ACATGKCTTTEEYADVESCSNQSVSGCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSF
Sbjct: 61  ACATGKCTTTEEYADVESCSNQSVSGCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSF 120

Query: 121 GIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSV 180
           GIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSV
Sbjct: 121 GIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSV 180

Query: 181 KSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNLIVDSAIIYFYAQCKDISS 240
           KSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNLIVDSAIIYFYAQCKDISS
Sbjct: 181 KSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNLIVDSAIIYFYAQCKDISS 240

Query: 241 AFVAFERMRRRDVVCWTSMITSCSQQGLGREAISMFSNMLSDEFLPNEFSVCSVLKACGE 300
           AFVAFERMRRRDVVCWTSMITSCSQQGLGREAISMFSNMLSDEFLPNEFSVCSVLKACGE
Sbjct: 241 AFVAFERMRRRDVVCWTSMITSCSQQGLGREAISMFSNMLSDEFLPNEFSVCSVLKACGE 300

Query: 301 ERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFDGMRNRNTVTWTSI 360
           ERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFDGMRNRNTVTWTSI
Sbjct: 301 ERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFDGMRNRNTVTWTSI 360

Query: 361 IAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLTGREVHAQIVKNSF 420
           IAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLTGREVHAQIVKNSF
Sbjct: 361 IAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLTGREVHAQIVKNSF 420

Query: 421 QTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKN 480
           QTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKN
Sbjct: 421 QTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKN 480

Query: 481 MIEEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSSANKTSALSNVFVGSALIYMYAKCGY 540
           MIEEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSSANKTSALSNVFVGSALIYMYAKCGY
Sbjct: 481 MIEEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSSANKTSALSNVFVGSALIYMYAKCGY 540

Query: 541 VTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGFEVDDYILGTVYGA 600
           VTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGFEVDDYILGTVYGA
Sbjct: 541 VTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGFEVDDYILGTVYGA 600

Query: 601 CGDVKCDVDSSLEYRLQTH 619
           CGDVKCDVDSSLEYRLQTH
Sbjct: 601 CGDVKCDVDSSLEYRLQTH 619

BLAST of CsGy2G024800 vs. NCBI nr
Match: TYK06655.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1179 bits (3049), Expect = 0.0
Identity = 594/624 (95.19%), Postives = 601/624 (96.31%), Query Frame = 0

Query: 1   MFSPAIIS-----QSPPCLTFQPTSTSLSTRRTCSKWNLTTFNRCKSSTSFPFNFVEDHS 60
           MFSPA IS     QSPPCLTFQ TSTS S RRTCSK NLTTFNR KSST+FPF FVED S
Sbjct: 1   MFSPAFISTAITSQSPPCLTFQRTSTSQSARRTCSKRNLTTFNRYKSSTNFPFKFVEDQS 60

Query: 61  KALPVACATGKCTTTEEYADVESCSNQSVSGCLSPYLIGVWLRSSRSVKKLRAVHAFILR 120
           KA  +AC T KCTTTEEYADVESCSNQSVSGCLS YLIGVWLRSSRSVKKLRAVHAFILR
Sbjct: 61  KAFSIACTTAKCTTTEEYADVESCSNQSVSGCLSHYLIGVWLRSSRSVKKLRAVHAFILR 120

Query: 121 NFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALAL 180
           +FTSF IYVGNNLLSSYLR+GMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALAL
Sbjct: 121 HFTSFSIYVGNNLLSSYLRVGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALAL 180

Query: 181 FSDSVKSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNLIVDSAIIYFYAQC 240
           FSDSVKSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNLIVDSAIIYFYAQC
Sbjct: 181 FSDSVKSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNLIVDSAIIYFYAQC 240

Query: 241 KDISSAFVAFERMRRRDVVCWTSMITSCSQQGLGREAISMFSNMLSDEFLPNEFSVCSVL 300
           KDISSAFVAFERMRRRDVVCWTSMITSCSQQGLG+EAISMFSNMLSD FLPNEFSVCSVL
Sbjct: 241 KDISSAFVAFERMRRRDVVCWTSMITSCSQQGLGQEAISMFSNMLSDGFLPNEFSVCSVL 300

Query: 301 KACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFDGMRNRNTV 360
           KACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNL DSREVFDGMRNRNTV
Sbjct: 301 KACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLVDSREVFDGMRNRNTV 360

Query: 361 TWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLTGREVHAQI 420
           TWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLTGREVHAQI
Sbjct: 361 TWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLTGREVHAQI 420

Query: 421 VKNSFQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEAL 480
           VKNSFQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEAL
Sbjct: 421 VKNSFQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEAL 480

Query: 481 EFLKNMIEEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSSANKTSALSNVFVGSALIYMY 540
           EFLKNMIEEGVEPNSFTYSSTLKACAKMEA+LQGKMIHSSANKTSALSNVFVGSALIYMY
Sbjct: 481 EFLKNMIEEGVEPNSFTYSSTLKACAKMEAILQGKMIHSSANKTSALSNVFVGSALIYMY 540

Query: 541 AKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGFEVDDYILG 600
           AKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGFEVDDYILG
Sbjct: 541 AKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGFEVDDYILG 600

Query: 601 TVYGACGDVKCDVDSSLEYRLQTH 619
           TVYGACGDVKCDVDSS E+ LQTH
Sbjct: 601 TVYGACGDVKCDVDSSFEHSLQTH 624

BLAST of CsGy2G024800 vs. NCBI nr
Match: XP_008441245.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g18520 [Cucumis melo])

HSP 1 Score: 1176 bits (3042), Expect = 0.0
Identity = 593/624 (95.03%), Postives = 600/624 (96.15%), Query Frame = 0

Query: 1   MFSPAIIS-----QSPPCLTFQPTSTSLSTRRTCSKWNLTTFNRCKSSTSFPFNFVEDHS 60
           MFSPA IS     QSPPCLTFQ TSTS S RRTCSK NLTTFNR KSST+FPF FVED S
Sbjct: 1   MFSPAFISTAITSQSPPCLTFQRTSTSQSARRTCSKRNLTTFNRYKSSTNFPFKFVEDQS 60

Query: 61  KALPVACATGKCTTTEEYADVESCSNQSVSGCLSPYLIGVWLRSSRSVKKLRAVHAFILR 120
           KA  +AC T KCTTTEEYADVESCSNQSVSGCLS YLIGVWLRSSRSVKKLRAVHAFILR
Sbjct: 61  KAFSIACTTAKCTTTEEYADVESCSNQSVSGCLSHYLIGVWLRSSRSVKKLRAVHAFILR 120

Query: 121 NFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALAL 180
           +FTSF IYVGNNLLSSYLR+GMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALAL
Sbjct: 121 HFTSFSIYVGNNLLSSYLRVGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALAL 180

Query: 181 FSDSVKSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNLIVDSAIIYFYAQC 240
           FSDSVKSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNLIVDSAIIYFYAQC
Sbjct: 181 FSDSVKSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNLIVDSAIIYFYAQC 240

Query: 241 KDISSAFVAFERMRRRDVVCWTSMITSCSQQGLGREAISMFSNMLSDEFLPNEFSVCSVL 300
           KDISSAFVAFERM RRDVVCWTSMITSCSQQGLG+EAISMFSNMLSD FLPNEFSVCSVL
Sbjct: 241 KDISSAFVAFERMGRRDVVCWTSMITSCSQQGLGQEAISMFSNMLSDGFLPNEFSVCSVL 300

Query: 301 KACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFDGMRNRNTV 360
           KACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNL DSREVFDGMRNRNTV
Sbjct: 301 KACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLVDSREVFDGMRNRNTV 360

Query: 361 TWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLTGREVHAQI 420
           TWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLTGREVHAQI
Sbjct: 361 TWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLTGREVHAQI 420

Query: 421 VKNSFQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEAL 480
           VKNSFQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEAL
Sbjct: 421 VKNSFQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEAL 480

Query: 481 EFLKNMIEEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSSANKTSALSNVFVGSALIYMY 540
           EFLKNMIEEGVEPNSFTYSSTLKACAKMEA+LQGKMIHSSANKTSALSNVFVGSALIYMY
Sbjct: 481 EFLKNMIEEGVEPNSFTYSSTLKACAKMEAILQGKMIHSSANKTSALSNVFVGSALIYMY 540

Query: 541 AKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGFEVDDYILG 600
           AKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGFEVDDYILG
Sbjct: 541 AKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGFEVDDYILG 600

Query: 601 TVYGACGDVKCDVDSSLEYRLQTH 619
           TVYGACGDVKCDVDSS E+ LQTH
Sbjct: 601 TVYGACGDVKCDVDSSFEHSLQTH 624

BLAST of CsGy2G024800 vs. NCBI nr
Match: XP_038884364.1 (pentatricopeptide repeat-containing protein At4g18520, chloroplastic-like [Benincasa hispida])

HSP 1 Score: 1095 bits (2832), Expect = 0.0
Identity = 544/617 (88.17%), Postives = 576/617 (93.35%), Query Frame = 0

Query: 3   SPAIISQSPPCLTFQPTSTSLSTRRTCSKWNLTTFNRCKSSTSFPFNFVEDHSKALPVAC 62
           S A IS  PPC + QP  T L+ RRTCSKWN T+ + CKSST+F  NFV+D S+A PVAC
Sbjct: 8   STATISHPPPCFSVQPPPTLLTARRTCSKWNFTSIDHCKSSTNFRLNFVKDLSRASPVAC 67

Query: 63  ATGKCTTTEEYADVESCSNQSVSGCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGI 122
              +  TT+EYA+VESCS+QSVSGCLSPYLI VWLRSSRS K+LRA+HAFILR+ TSF I
Sbjct: 68  IA-ETFTTQEYANVESCSSQSVSGCLSPYLIAVWLRSSRSAKELRAIHAFILRHITSFEI 127

Query: 123 YVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKS 182
           YVGNNL+SSYLR GMLVDARK FDEMP+R+VVTWT IINGYI LD TEEAL LFSDSVK+
Sbjct: 128 YVGNNLVSSYLRFGMLVDARKAFDEMPVRNVVTWTTIINGYIHLDFTEEALGLFSDSVKN 187

Query: 183 GVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNLIVDSAIIYFYAQCKDISSAF 242
           GV ANG+MFVCILNLCAKRLDFELGRQIHGVIVKGN GNLI+DSAI+YFYAQCKDISSAF
Sbjct: 188 GVQANGKMFVCILNLCAKRLDFELGRQIHGVIVKGNWGNLIIDSAIVYFYAQCKDISSAF 247

Query: 243 VAFERMRRRDVVCWTSMITSCSQQGLGREAISMFSNMLSDEFLPNEFSVCSVLKACGEER 302
           VAFE M +RDVVCWTSMITSCSQQGLGREAIS+FSNML+D FLPNEFSVCSVLKACGEER
Sbjct: 248 VAFEHMPKRDVVCWTSMITSCSQQGLGREAISLFSNMLNDGFLPNEFSVCSVLKACGEER 307

Query: 303 ELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFDGMRNRNTVTWTSIIA 362
           ELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVF+GMRNRNTVTWTSIIA
Sbjct: 308 ELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFNGMRNRNTVTWTSIIA 367

Query: 363 GYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLTGREVHAQIVKNSFQT 422
           GYAREG GEEA+NLFRLMKRQRIPANNLTIVSILRACGSI ASLTGREVHAQIVK+SFQT
Sbjct: 368 GYAREGRGEEAVNLFRLMKRQRIPANNLTIVSILRACGSIGASLTGREVHAQIVKSSFQT 427

Query: 423 NIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMI 482
           NIHIGSTLVWFYCKCRN+LKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNM+
Sbjct: 428 NIHIGSTLVWFYCKCRNRLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMV 487

Query: 483 EEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSSANKTSALSNVFVGSALIYMYAKCGYVT 542
           EEGVEPNSFTYSS LKACAKMEA+LQGK+IHSSANKTSALSNVFVGSALIYMYAKCGYVT
Sbjct: 488 EEGVEPNSFTYSSALKACAKMEAILQGKLIHSSANKTSALSNVFVGSALIYMYAKCGYVT 547

Query: 543 EASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGFEVDDYILGTVYGACG 602
           EASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEG EVDDYILGTVYGACG
Sbjct: 548 EASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVDDYILGTVYGACG 607

Query: 603 DVKCDVDSSLEYRLQTH 619
           DVKCDVDSSLEYRLQT+
Sbjct: 608 DVKCDVDSSLEYRLQTY 623

BLAST of CsGy2G024800 vs. NCBI nr
Match: KAG7015915.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1063 bits (2748), Expect = 0.0
Identity = 525/616 (85.23%), Postives = 564/616 (91.56%), Query Frame = 0

Query: 3   SPAIISQSPPCLTFQPTSTSLSTRRTCSKWNLTTFNRCKSSTSFPFNFVEDHSKALPVAC 62
           S A ISQSPPC + QP   SLSTRR CSKWNLT+ +RCKS T+  FNF++D  +A  VAC
Sbjct: 53  STATISQSPPCFSVQPPPISLSTRRACSKWNLTSIDRCKSPTNLRFNFIKDPFRASLVAC 112

Query: 63  ATGKCTTTEEYADVESCSNQSVSGCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGI 122
            T  CT  +E  DVESCSN+ V+GCLSPYLI  WLRSSR VK+LRA+HAFILR+F+S   
Sbjct: 113 TTETCTA-QECTDVESCSNEPVNGCLSPYLIAAWLRSSRGVKELRAIHAFILRHFSSLVT 172

Query: 123 YVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKS 182
           YVGNNL+SSYLR GML+DAR+VFDEMPMRSVVTWTAIINGYID DLT+EAL LF DSV+S
Sbjct: 173 YVGNNLISSYLRFGMLIDARRVFDEMPMRSVVTWTAIINGYIDFDLTDEALGLFGDSVRS 232

Query: 183 GVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNLIVDSAIIYFYAQCKDISSAF 242
           GV ANG+MFVCILNLCAKRLDFELGRQIHG IVKGN GNLIVDSAI+YFYAQCKDISSAF
Sbjct: 233 GVRANGKMFVCILNLCAKRLDFELGRQIHGAIVKGNWGNLIVDSAIVYFYAQCKDISSAF 292

Query: 243 VAFERMRRRDVVCWTSMITSCSQQGLGREAISMFSNMLSDEFLPNEFSVCSVLKACGEER 302
           VAFE M +RD+VCWTSMITSCSQQGLGREAISMFSNMLSD FLPNEFSVCSVLKACGEER
Sbjct: 293 VAFEHMPKRDIVCWTSMITSCSQQGLGREAISMFSNMLSDGFLPNEFSVCSVLKACGEER 352

Query: 303 ELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFDGMRNRNTVTWTSIIA 362
           ELKIG+QLHGL+IKKIIKNDVFVG+SLVDMYAK G+LADSREVFD MRNRNTVTWTSIIA
Sbjct: 353 ELKIGKQLHGLVIKKIIKNDVFVGSSLVDMYAKSGSLADSREVFDEMRNRNTVTWTSIIA 412

Query: 363 GYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLTGREVHAQIVKNSFQT 422
           GYAREGLGEEA+NLFRLM +QR+PAN+LTIVSILRACGSI ASLTGREVHAQIVKNSFQT
Sbjct: 413 GYAREGLGEEAVNLFRLMNKQRVPANDLTIVSILRACGSIGASLTGREVHAQIVKNSFQT 472

Query: 423 NIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMI 482
           N+HIGSTLVWFYCKCRNQ KASMVLQ MPLRDVVSWTAIISGCAHLGHESEALEFL+NMI
Sbjct: 473 NLHIGSTLVWFYCKCRNQPKASMVLQQMPLRDVVSWTAIISGCAHLGHESEALEFLENMI 532

Query: 483 EEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSSANKTSALSNVFVGSALIYMYAKCGYVT 542
           EEGVEPNSFTYSS LKACAKMEA+LQG++IHSSANKTSA+SNVFVGSALIYMYAKCGYVT
Sbjct: 533 EEGVEPNSFTYSSALKACAKMEAILQGRLIHSSANKTSAVSNVFVGSALIYMYAKCGYVT 592

Query: 543 EASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGFEVDDYILGTVYGACG 602
           EASQVFDSMP RNLVSWKAMILCYARNGLCREALKLMYRMQAEG EVDD+I GTVYG CG
Sbjct: 593 EASQVFDSMPERNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVDDHIFGTVYGVCG 652

Query: 603 DVKCDVDSSLEYRLQT 618
           DVKC+VDSSLEY LQT
Sbjct: 653 DVKCEVDSSLEYSLQT 667

BLAST of CsGy2G024800 vs. ExPASy TrEMBL
Match: A0A0A0LSX2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G403710 PE=4 SV=1)

HSP 1 Score: 1240 bits (3208), Expect = 0.0
Identity = 619/619 (100.00%), Postives = 619/619 (100.00%), Query Frame = 0

Query: 1   MFSPAIISQSPPCLTFQPTSTSLSTRRTCSKWNLTTFNRCKSSTSFPFNFVEDHSKALPV 60
           MFSPAIISQSPPCLTFQPTSTSLSTRRTCSKWNLTTFNRCKSSTSFPFNFVEDHSKALPV
Sbjct: 1   MFSPAIISQSPPCLTFQPTSTSLSTRRTCSKWNLTTFNRCKSSTSFPFNFVEDHSKALPV 60

Query: 61  ACATGKCTTTEEYADVESCSNQSVSGCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSF 120
           ACATGKCTTTEEYADVESCSNQSVSGCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSF
Sbjct: 61  ACATGKCTTTEEYADVESCSNQSVSGCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSF 120

Query: 121 GIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSV 180
           GIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSV
Sbjct: 121 GIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSV 180

Query: 181 KSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNLIVDSAIIYFYAQCKDISS 240
           KSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNLIVDSAIIYFYAQCKDISS
Sbjct: 181 KSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNLIVDSAIIYFYAQCKDISS 240

Query: 241 AFVAFERMRRRDVVCWTSMITSCSQQGLGREAISMFSNMLSDEFLPNEFSVCSVLKACGE 300
           AFVAFERMRRRDVVCWTSMITSCSQQGLGREAISMFSNMLSDEFLPNEFSVCSVLKACGE
Sbjct: 241 AFVAFERMRRRDVVCWTSMITSCSQQGLGREAISMFSNMLSDEFLPNEFSVCSVLKACGE 300

Query: 301 ERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFDGMRNRNTVTWTSI 360
           ERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFDGMRNRNTVTWTSI
Sbjct: 301 ERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFDGMRNRNTVTWTSI 360

Query: 361 IAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLTGREVHAQIVKNSF 420
           IAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLTGREVHAQIVKNSF
Sbjct: 361 IAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLTGREVHAQIVKNSF 420

Query: 421 QTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKN 480
           QTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKN
Sbjct: 421 QTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKN 480

Query: 481 MIEEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSSANKTSALSNVFVGSALIYMYAKCGY 540
           MIEEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSSANKTSALSNVFVGSALIYMYAKCGY
Sbjct: 481 MIEEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSSANKTSALSNVFVGSALIYMYAKCGY 540

Query: 541 VTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGFEVDDYILGTVYGA 600
           VTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGFEVDDYILGTVYGA
Sbjct: 541 VTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGFEVDDYILGTVYGA 600

Query: 601 CGDVKCDVDSSLEYRLQTH 619
           CGDVKCDVDSSLEYRLQTH
Sbjct: 601 CGDVKCDVDSSLEYRLQTH 619

BLAST of CsGy2G024800 vs. ExPASy TrEMBL
Match: A0A5D3C5K1 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold453G001360 PE=4 SV=1)

HSP 1 Score: 1179 bits (3049), Expect = 0.0
Identity = 594/624 (95.19%), Postives = 601/624 (96.31%), Query Frame = 0

Query: 1   MFSPAIIS-----QSPPCLTFQPTSTSLSTRRTCSKWNLTTFNRCKSSTSFPFNFVEDHS 60
           MFSPA IS     QSPPCLTFQ TSTS S RRTCSK NLTTFNR KSST+FPF FVED S
Sbjct: 1   MFSPAFISTAITSQSPPCLTFQRTSTSQSARRTCSKRNLTTFNRYKSSTNFPFKFVEDQS 60

Query: 61  KALPVACATGKCTTTEEYADVESCSNQSVSGCLSPYLIGVWLRSSRSVKKLRAVHAFILR 120
           KA  +AC T KCTTTEEYADVESCSNQSVSGCLS YLIGVWLRSSRSVKKLRAVHAFILR
Sbjct: 61  KAFSIACTTAKCTTTEEYADVESCSNQSVSGCLSHYLIGVWLRSSRSVKKLRAVHAFILR 120

Query: 121 NFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALAL 180
           +FTSF IYVGNNLLSSYLR+GMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALAL
Sbjct: 121 HFTSFSIYVGNNLLSSYLRVGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALAL 180

Query: 181 FSDSVKSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNLIVDSAIIYFYAQC 240
           FSDSVKSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNLIVDSAIIYFYAQC
Sbjct: 181 FSDSVKSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNLIVDSAIIYFYAQC 240

Query: 241 KDISSAFVAFERMRRRDVVCWTSMITSCSQQGLGREAISMFSNMLSDEFLPNEFSVCSVL 300
           KDISSAFVAFERMRRRDVVCWTSMITSCSQQGLG+EAISMFSNMLSD FLPNEFSVCSVL
Sbjct: 241 KDISSAFVAFERMRRRDVVCWTSMITSCSQQGLGQEAISMFSNMLSDGFLPNEFSVCSVL 300

Query: 301 KACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFDGMRNRNTV 360
           KACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNL DSREVFDGMRNRNTV
Sbjct: 301 KACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLVDSREVFDGMRNRNTV 360

Query: 361 TWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLTGREVHAQI 420
           TWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLTGREVHAQI
Sbjct: 361 TWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLTGREVHAQI 420

Query: 421 VKNSFQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEAL 480
           VKNSFQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEAL
Sbjct: 421 VKNSFQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEAL 480

Query: 481 EFLKNMIEEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSSANKTSALSNVFVGSALIYMY 540
           EFLKNMIEEGVEPNSFTYSSTLKACAKMEA+LQGKMIHSSANKTSALSNVFVGSALIYMY
Sbjct: 481 EFLKNMIEEGVEPNSFTYSSTLKACAKMEAILQGKMIHSSANKTSALSNVFVGSALIYMY 540

Query: 541 AKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGFEVDDYILG 600
           AKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGFEVDDYILG
Sbjct: 541 AKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGFEVDDYILG 600

Query: 601 TVYGACGDVKCDVDSSLEYRLQTH 619
           TVYGACGDVKCDVDSS E+ LQTH
Sbjct: 601 TVYGACGDVKCDVDSSFEHSLQTH 624

BLAST of CsGy2G024800 vs. ExPASy TrEMBL
Match: A0A1S3B3M7 (pentatricopeptide repeat-containing protein At4g18520 OS=Cucumis melo OX=3656 GN=LOC103485432 PE=4 SV=1)

HSP 1 Score: 1176 bits (3042), Expect = 0.0
Identity = 593/624 (95.03%), Postives = 600/624 (96.15%), Query Frame = 0

Query: 1   MFSPAIIS-----QSPPCLTFQPTSTSLSTRRTCSKWNLTTFNRCKSSTSFPFNFVEDHS 60
           MFSPA IS     QSPPCLTFQ TSTS S RRTCSK NLTTFNR KSST+FPF FVED S
Sbjct: 1   MFSPAFISTAITSQSPPCLTFQRTSTSQSARRTCSKRNLTTFNRYKSSTNFPFKFVEDQS 60

Query: 61  KALPVACATGKCTTTEEYADVESCSNQSVSGCLSPYLIGVWLRSSRSVKKLRAVHAFILR 120
           KA  +AC T KCTTTEEYADVESCSNQSVSGCLS YLIGVWLRSSRSVKKLRAVHAFILR
Sbjct: 61  KAFSIACTTAKCTTTEEYADVESCSNQSVSGCLSHYLIGVWLRSSRSVKKLRAVHAFILR 120

Query: 121 NFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALAL 180
           +FTSF IYVGNNLLSSYLR+GMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALAL
Sbjct: 121 HFTSFSIYVGNNLLSSYLRVGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALAL 180

Query: 181 FSDSVKSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNLIVDSAIIYFYAQC 240
           FSDSVKSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNLIVDSAIIYFYAQC
Sbjct: 181 FSDSVKSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNLIVDSAIIYFYAQC 240

Query: 241 KDISSAFVAFERMRRRDVVCWTSMITSCSQQGLGREAISMFSNMLSDEFLPNEFSVCSVL 300
           KDISSAFVAFERM RRDVVCWTSMITSCSQQGLG+EAISMFSNMLSD FLPNEFSVCSVL
Sbjct: 241 KDISSAFVAFERMGRRDVVCWTSMITSCSQQGLGQEAISMFSNMLSDGFLPNEFSVCSVL 300

Query: 301 KACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFDGMRNRNTV 360
           KACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNL DSREVFDGMRNRNTV
Sbjct: 301 KACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLVDSREVFDGMRNRNTV 360

Query: 361 TWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLTGREVHAQI 420
           TWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLTGREVHAQI
Sbjct: 361 TWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLTGREVHAQI 420

Query: 421 VKNSFQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEAL 480
           VKNSFQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEAL
Sbjct: 421 VKNSFQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEAL 480

Query: 481 EFLKNMIEEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSSANKTSALSNVFVGSALIYMY 540
           EFLKNMIEEGVEPNSFTYSSTLKACAKMEA+LQGKMIHSSANKTSALSNVFVGSALIYMY
Sbjct: 481 EFLKNMIEEGVEPNSFTYSSTLKACAKMEAILQGKMIHSSANKTSALSNVFVGSALIYMY 540

Query: 541 AKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGFEVDDYILG 600
           AKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGFEVDDYILG
Sbjct: 541 AKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGFEVDDYILG 600

Query: 601 TVYGACGDVKCDVDSSLEYRLQTH 619
           TVYGACGDVKCDVDSS E+ LQTH
Sbjct: 601 TVYGACGDVKCDVDSSFEHSLQTH 624

BLAST of CsGy2G024800 vs. ExPASy TrEMBL
Match: A0A6J1FJ52 (pentatricopeptide repeat-containing protein At4g18520, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111444741 PE=4 SV=1)

HSP 1 Score: 1063 bits (2748), Expect = 0.0
Identity = 525/616 (85.23%), Postives = 564/616 (91.56%), Query Frame = 0

Query: 3   SPAIISQSPPCLTFQPTSTSLSTRRTCSKWNLTTFNRCKSSTSFPFNFVEDHSKALPVAC 62
           S A ISQSPPC + QP   SLSTRR CSKWNLT+ +RCKS T+  FNF++D  +A  VAC
Sbjct: 8   STATISQSPPCFSVQPPPISLSTRRACSKWNLTSIDRCKSPTNLRFNFIKDPFRASLVAC 67

Query: 63  ATGKCTTTEEYADVESCSNQSVSGCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGI 122
            T  CT  +E  DVESCSN+ V+GCLSPYLI  WLRSSR VK+LRA+HAFILR+F+S   
Sbjct: 68  TTETCTA-QECTDVESCSNEPVNGCLSPYLIAAWLRSSRGVKELRAIHAFILRHFSSLVT 127

Query: 123 YVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKS 182
           YVGNNL+SSYLR GML+DAR+VFDEMPMRSVVTWTAIINGYID DLT+EAL LF DSV+S
Sbjct: 128 YVGNNLISSYLRFGMLIDARRVFDEMPMRSVVTWTAIINGYIDFDLTDEALGLFGDSVRS 187

Query: 183 GVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNLIVDSAIIYFYAQCKDISSAF 242
           GV ANG+MFVCILNLCAKRLDFELGRQIHG IVKGN GNLIVDSAI+YFYAQCKDISSAF
Sbjct: 188 GVRANGKMFVCILNLCAKRLDFELGRQIHGAIVKGNWGNLIVDSAIVYFYAQCKDISSAF 247

Query: 243 VAFERMRRRDVVCWTSMITSCSQQGLGREAISMFSNMLSDEFLPNEFSVCSVLKACGEER 302
           VAFE M +RD+VCWTSMITSCSQQGLGREAISMFSNMLSD FLPNEFSVCSVLKACGEER
Sbjct: 248 VAFEHMPKRDIVCWTSMITSCSQQGLGREAISMFSNMLSDGFLPNEFSVCSVLKACGEER 307

Query: 303 ELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFDGMRNRNTVTWTSIIA 362
           ELKIG+QLHGL+IKKIIKNDVFVG+SLVDMYAK G+LADSREVFD MRNRNTVTWTSIIA
Sbjct: 308 ELKIGKQLHGLVIKKIIKNDVFVGSSLVDMYAKSGSLADSREVFDEMRNRNTVTWTSIIA 367

Query: 363 GYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLTGREVHAQIVKNSFQT 422
           GYAREGLGEEA+NLFRLM +QR+PAN+LTIVSILRACGSI ASLTGREVHAQIVKNSFQT
Sbjct: 368 GYAREGLGEEAVNLFRLMNKQRVPANDLTIVSILRACGSIGASLTGREVHAQIVKNSFQT 427

Query: 423 NIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMI 482
           N+HIGSTLVWFYCKCRNQ KASMVLQ MPLRDVVSWTAIISGCAHLGHESEALEFL+NMI
Sbjct: 428 NLHIGSTLVWFYCKCRNQPKASMVLQQMPLRDVVSWTAIISGCAHLGHESEALEFLENMI 487

Query: 483 EEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSSANKTSALSNVFVGSALIYMYAKCGYVT 542
           EEGVEPNSFTYSS LKACAKMEA+LQG++IHSSANKTSA+SNVFVGSALIYMYAKCGYVT
Sbjct: 488 EEGVEPNSFTYSSALKACAKMEAILQGRLIHSSANKTSAVSNVFVGSALIYMYAKCGYVT 547

Query: 543 EASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGFEVDDYILGTVYGACG 602
           EASQVFDSMP RNLVSWKAMILCYARNGLCREALKLMYRMQAEG EVDD+I GTVYG CG
Sbjct: 548 EASQVFDSMPERNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVDDHIFGTVYGVCG 607

Query: 603 DVKCDVDSSLEYRLQT 618
           DVKC+VDSSLEY LQT
Sbjct: 608 DVKCEVDSSLEYSLQT 622

BLAST of CsGy2G024800 vs. ExPASy TrEMBL
Match: A0A6J1JUB8 (pentatricopeptide repeat-containing protein At4g18520, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111489800 PE=4 SV=1)

HSP 1 Score: 1059 bits (2738), Expect = 0.0
Identity = 523/616 (84.90%), Postives = 562/616 (91.23%), Query Frame = 0

Query: 3   SPAIISQSPPCLTFQPTSTSLSTRRTCSKWNLTTFNRCKSSTSFPFNFVEDHSKALPVAC 62
           S A ISQSPPC + QP   SLSTRR CSKWNLT+ +RCKS T+  FNF++D  +A  VAC
Sbjct: 8   STATISQSPPCFSVQPPPISLSTRRACSKWNLTSIDRCKSPTNLRFNFIKDPFRASLVAC 67

Query: 63  ATGKCTTTEEYADVESCSNQSVSGCLSPYLIGVWLRSSRSVKKLRAVHAFILRNFTSFGI 122
            T  CT  +E  DVESCSN+ V+GCLSPYLI  WLRSSR VK+LRA+HAFI R+FTS   
Sbjct: 68  TTETCTA-QECTDVESCSNEPVNGCLSPYLIAAWLRSSRGVKELRAIHAFIWRHFTSLVT 127

Query: 123 YVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDLTEEALALFSDSVKS 182
           YVGNNL+SSYLR GML+DAR+VFDEMPMRSVVTWTAIINGYID DLT+EAL LF DSV+S
Sbjct: 128 YVGNNLISSYLRFGMLIDARRVFDEMPMRSVVTWTAIINGYIDFDLTDEALGLFGDSVRS 187

Query: 183 GVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRGNLIVDSAIIYFYAQCKDISSAF 242
           GV  NG+MFVCILNLCAKRLDFELGRQIHG IVKGN GNLIVDSAI+YFYAQCKDISSAF
Sbjct: 188 GVRVNGKMFVCILNLCAKRLDFELGRQIHGAIVKGNWGNLIVDSAIVYFYAQCKDISSAF 247

Query: 243 VAFERMRRRDVVCWTSMITSCSQQGLGREAISMFSNMLSDEFLPNEFSVCSVLKACGEER 302
           VAFE M +RD+VCWTSMITSCSQQGLGREAISMFSNMLSD FLPNEFSVCSVLKACGEER
Sbjct: 248 VAFEHMPKRDIVCWTSMITSCSQQGLGREAISMFSNMLSDGFLPNEFSVCSVLKACGEER 307

Query: 303 ELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFDGMRNRNTVTWTSIIA 362
           ELKIG+QLHGL+IKKIIKNDVFVG+SLVDMYAK G+LADSREVFD MRNRNTVTWTSIIA
Sbjct: 308 ELKIGKQLHGLVIKKIIKNDVFVGSSLVDMYAKSGSLADSREVFDEMRNRNTVTWTSIIA 367

Query: 363 GYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLTGREVHAQIVKNSFQT 422
           GYAREGLGEEA+NLFRLM +QR+PAN+LTIVSILRACGSI ASLTGREVHAQIVKNSFQT
Sbjct: 368 GYAREGLGEEAVNLFRLMNKQRVPANDLTIVSILRACGSIGASLTGREVHAQIVKNSFQT 427

Query: 423 NIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMI 482
           N+HIGSTLVWFYCKCRNQ KASMVLQ MPLRDVVSWTAIISGCAHLGHESEALEFL+NMI
Sbjct: 428 NLHIGSTLVWFYCKCRNQPKASMVLQQMPLRDVVSWTAIISGCAHLGHESEALEFLENMI 487

Query: 483 EEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSSANKTSALSNVFVGSALIYMYAKCGYVT 542
           EEGVEPNSFTYSS LKACAKMEA+LQG++IHSSANK+SA+SNVFVGSALIYMYAKCGYVT
Sbjct: 488 EEGVEPNSFTYSSALKACAKMEAILQGRLIHSSANKSSAVSNVFVGSALIYMYAKCGYVT 547

Query: 543 EASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGFEVDDYILGTVYGACG 602
           EASQVFDSMP RNLVSWKAMILCYARNGLCREALKLMYRMQAEG EVDD+I GTVYG CG
Sbjct: 548 EASQVFDSMPERNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVDDHIFGTVYGVCG 607

Query: 603 DVKCDVDSSLEYRLQT 618
           DVKC+VDSSLEY LQT
Sbjct: 608 DVKCEVDSSLEYSLQT 622

BLAST of CsGy2G024800 vs. TAIR 10
Match: AT4G18520.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 697.2 bits (1798), Expect = 1.2e-200
Identity = 335/530 (63.21%), Postives = 424/530 (80.00%), Query Frame = 0

Query: 92  LIGVWLRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMR 151
           L+  WL+SS  ++ ++ +HA  L+ F    IY GNNL+SS +RLG LV ARKVFD MP +
Sbjct: 87  LLAEWLQSSNGMRLIKRIHAMALKCFDDQVIYFGNNLISSCVRLGDLVYARKVFDSMPEK 146

Query: 152 SVVTWTAIINGYIDLDLTEEALALFSDSVKSGV-LANGQMFVCILNLCAKRLDFELGRQI 211
           + VTWTA+I+GY+   L +EA ALF D VK G+   N +MFVC+LNLC++R +FELGRQ+
Sbjct: 147 NTVTWTAMIDGYLKYGLEDEAFALFEDYVKHGIRFTNERMFVCLLNLCSRRAEFELGRQV 206

Query: 212 HGVIVKGNRGNLIVDSAIIYFYAQCKDISSAFVAFERMRRRDVVCWTSMITSCSQQGLGR 271
           HG +VK   GNLIV+S+++YFYAQC +++SA  AF+ M  +DV+ WT++I++CS++G G 
Sbjct: 207 HGNMVKVGVGNLIVESSLVYFYAQCGELTSALRAFDMMEEKDVISWTAVISACSRKGHGI 266

Query: 272 EAISMFSNMLSDEFLPNEFSVCSVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLV 331
           +AI MF  ML+  FLPNEF+VCS+LKAC EE+ L+ GRQ+H L++K++IK DVFVGTSL+
Sbjct: 267 KAIGMFIGMLNHWFLPNEFTVCSILKACSEEKALRFGRQVHSLVVKRMIKTDVFVGTSLM 326

Query: 332 DMYAKCGNLADSREVFDGMRNRNTVTWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNL 391
           DMYAKCG ++D R+VFDGM NRNTVTWTSIIA +AREG GEEA++LFR+MKR+ + ANNL
Sbjct: 327 DMYAKCGEISDCRKVFDGMSNRNTVTWTSIIAAHAREGFGEEAISLFRIMKRRHLIANNL 386

Query: 392 TIVSILRACGSIEASLTGREVHAQIVKNSFQTNIHIGSTLVWFYCKCRNQLKASMVLQLM 451
           T+VSILRACGS+ A L G+E+HAQI+KNS + N++IGSTLVW YCKC     A  VLQ +
Sbjct: 387 TVVSILRACGSVGALLLGKELHAQIIKNSIEKNVYIGSTLVWLYCKCGESRDAFNVLQQL 446

Query: 452 PLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFTYSSTLKACAKMEAVLQGK 511
           P RDVVSWTA+ISGC+ LGHESEAL+FLK MI+EGVEPN FTYSS LKACA  E++L G+
Sbjct: 447 PSRDVVSWTAMISGCSSLGHESEALDFLKEMIQEGVEPNPFTYSSALKACANSESLLIGR 506

Query: 512 MIHSSANKTSALSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNG 571
            IHS A K  ALSNVFVGSALI+MYAKCG+V+EA +VFDSMP +NLVSWKAMI+ YARNG
Sbjct: 507 SIHSIAKKNHALSNVFVGSALIHMYAKCGFVSEAFRVFDSMPEKNLVSWKAMIMGYARNG 566

Query: 572 LCREALKLMYRMQAEGFEVDDYILGTVYGACGDVKCD--VDSSLEYRLQT 619
            CREALKLMYRM+AEGFEVDDYI  T+   CGD++ D  V+SS    L+T
Sbjct: 567 FCREALKLMYRMEAEGFEVDDYIFATILSTCGDIELDEAVESSATCYLET 616

BLAST of CsGy2G024800 vs. TAIR 10
Match: AT2G33680.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 288.1 bits (736), Expect = 1.7e-77
Identity = 166/507 (32.74%), Postives = 279/507 (55.03%), Query Frame = 0

Query: 107 RAVHAFILRNFTSFG-IYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYID 166
           R  HA +++  +SFG IYV  +L+  Y + G++ D  KVF  MP R+  TW+ +++GY  
Sbjct: 138 RQAHALVVK-MSSFGDIYVDTSLVGMYCKAGLVEDGLKVFAYMPERNTYTWSTMVSGYAT 197

Query: 167 LDLTEEALALFSDSV--KSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVK-GNRGNL 226
               EEA+ +F+  +  K     +  +F  +L+  A  +   LGRQIH + +K G  G +
Sbjct: 198 RGRVEEAIKVFNLFLREKEEGSDSDYVFTAVLSSLAATIYVGLGRQIHCITIKNGLLGFV 257

Query: 227 IVDSAIIYFYAQCKDISSAFVAFERMRRRDVVCWTSMITSCSQQGLGREAISMFSNMLSD 286
            + +A++  Y++C+ ++ A   F+    R+ + W++M+T  SQ G   EA+ +FS M S 
Sbjct: 258 ALSNALVTMYSKCESLNEACKMFDSSGDRNSITWSAMVTGYSQNGESLEAVKLFSRMFSA 317

Query: 287 EFLPNEFSVCSVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADS 346
              P+E+++  VL AC +   L+ G+QLH  ++K   +  +F  T+LVDMYAK G LAD+
Sbjct: 318 GIKPSEYTIVGVLNACSDICYLEEGKQLHSFLLKLGFERHLFATTALVDMYAKAGCLADA 377

Query: 347 REVFDGMRNRNTVTWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSI 406
           R+ FD ++ R+   WTS+I+GY +    EEAL L+R MK   I  N+ T+ S+L+AC S+
Sbjct: 378 RKGFDCLQERDVALWTSLISGYVQNSDNEEALILYRRMKTAGIIPNDPTMASVLKACSSL 437

Query: 407 EASLTGREVHAQIVKNSFQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAII 466
                G++VH   +K+ F   + IGS L   Y KC +    ++V +  P +DVVSW A+I
Sbjct: 438 ATLELGKQVHGHTIKHGFGLEVPIGSALSTMYSKCGSLEDGNLVFRRTPNKDVVSWNAMI 497

Query: 467 SGCAHLGHESEALEFLKNMIEEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSSANKTSAL 526
           SG +H G   EALE  + M+ EG+EP+  T+ + + AC+    V +G    +  +    L
Sbjct: 498 SGLSHNGQGDEALELFEEMLAEGMEPDDVTFVNIISACSHKGFVERGWFYFNMMSDQIGL 557

Query: 527 S-NVFVGSALIYMYAKCGYVTEASQVFDSMPV-RNLVSWKAMILCYARNGLCREALKLMY 586
              V   + ++ + ++ G + EA +  +S  +   L  W+ ++     +G C   +    
Sbjct: 558 DPKVDHYACMVDLLSRAGQLKEAKEFIESANIDHGLCLWRILLSACKNHGKCELGVYAGE 617

Query: 587 RMQAEGF-EVDDYI-LGTVYGACGDVK 606
           ++ A G  E   Y+ L  +Y A G ++
Sbjct: 618 KLMALGSRESSTYVQLSGIYTALGRMR 643

BLAST of CsGy2G024800 vs. TAIR 10
Match: AT3G53360.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 285.8 bits (730), Expect = 8.3e-77
Identity = 168/509 (33.01%), Postives = 268/509 (52.65%), Query Frame = 0

Query: 99  SSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTA 158
           SSRS+ + R +H  IL +   +   + N++LS Y + G L DAR+VFD MP R++V++T+
Sbjct: 79  SSRSLAQGRKIHDHILNSNCKYDTILNNHILSMYGKCGSLRDAREVFDFMPERNLVSYTS 138

Query: 159 IINGYIDLDLTEEALALFSDSVKSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVK-G 218
           +I GY       EA+ L+   ++  ++ +   F  I+  CA   D  LG+Q+H  ++K  
Sbjct: 139 VITGYSQNGQGAEAIRLYLKMLQEDLVPDQFAFGSIIKACASSSDVGLGKQLHAQVIKLE 198

Query: 219 NRGNLIVDSAIIYFYAQCKDISSAFVAFERMRRRDVVCWTSMITSCSQQGLGREAISMFS 278
           +  +LI  +A+I  Y +   +S A   F  +  +D++ W+S+I   SQ G   EA+S   
Sbjct: 199 SSSHLIAQNALIAMYVRFNQMSDASRVFYGIPMKDLISWSSIIAGFSQLGFEFEALSHLK 258

Query: 279 NMLS-DEFLPNEFSVCSVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKC 338
            MLS   F PNE+   S LKAC        G Q+HGL IK  +  +   G SL DMYA+C
Sbjct: 259 EMLSFGVFHPNEYIFGSSLKACSSLLRPDYGSQIHGLCIKSELAGNAIAGCSLCDMYARC 318

Query: 339 GNLADSREVFDGMRNRNTVTWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSIL 398
           G L  +R VFD +   +T +W  IIAG A  G  +EA+++F  M+      + +++ S+L
Sbjct: 319 GFLNSARRVFDQIERPDTASWNVIIAGLANNGYADEAVSVFSQMRSSGFIPDAISLRSLL 378

Query: 399 RACGSIEASLTGREVHAQIVKNSFQTNIHIGSTLVWFYCKCRNQLKA-SMVLQLMPLRDV 458
            A     A   G ++H+ I+K  F  ++ + ++L+  Y  C +     ++        D 
Sbjct: 379 CAQTKPMALSQGMQIHSYIIKWGFLADLTVCNSLLTMYTFCSDLYCCFNLFEDFRNNADS 438

Query: 459 VSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSS 518
           VSW  I++ C       E L   K M+    EP+  T  + L+ C ++ ++  G  +H  
Sbjct: 439 VSWNTILTACLQHEQPVEMLRLFKLMLVSECEPDHITMGNLLRGCVEISSLKLGSQVHCY 498

Query: 519 ANKTSALSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREA 578
           + KT      F+ + LI MYAKCG + +A ++FDSM  R++VSW  +I+ YA++G   EA
Sbjct: 499 SLKTGLAPEQFIKNGLIDMYAKCGSLGQARRIFDSMDNRDVVSWSTLIVGYAQSGFGEEA 558

Query: 579 LKLMYRMQAEGFEVDDYILGTVYGACGDV 605
           L L   M++ G E +      V  AC  V
Sbjct: 559 LILFKEMKSAGIEPNHVTFVGVLTACSHV 587

BLAST of CsGy2G024800 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 279.6 bits (714), Expect = 5.9e-75
Identity = 153/506 (30.24%), Postives = 266/506 (52.57%), Query Frame = 0

Query: 97  LRSSRSVKKLRAVHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTW 156
           L    S+K+LR +   + +N      +    L+S + R G + +A +VF+ +  +  V +
Sbjct: 44  LERCSSLKELRQILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDSKLNVLY 103

Query: 157 TAIINGYIDLDLTEEALALFSDSVKSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVK 216
             ++ G+  +   ++AL  F       V      F  +L +C    +  +G++IHG++VK
Sbjct: 104 HTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVK 163

Query: 217 -GNRGNLIVDSAIIYFYAQCKDISSAFVAFERMRRRDVVCWTSMITSCSQQGLGREAISM 276
            G   +L   + +   YA+C+ ++ A   F+RM  RD+V W +++   SQ G+ R A+ M
Sbjct: 164 SGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEM 223

Query: 277 FSNMLSDEFLPNEFSVCSVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAK 336
             +M  +   P+  ++ SVL A    R + +G+++HG  ++    + V + T+LVDMYAK
Sbjct: 224 VKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAK 283

Query: 337 CGNLADSREVFDGMRNRNTVTWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSI 396
           CG+L  +R++FDGM  RN V+W S+I  Y +    +EA+ +F+ M  + +   +++++  
Sbjct: 284 CGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGA 343

Query: 397 LRACGSIEASLTGREVHAQIVKNSFQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDV 456
           L AC  +     GR +H   V+     N+ + ++L+  YCKC+    A+ +   +  R +
Sbjct: 344 LHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTL 403

Query: 457 VSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSS 516
           VSW A+I G A  G   +AL +   M    V+P++FTY S + A A++      K IH  
Sbjct: 404 VSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGV 463

Query: 517 ANKTSALSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREA 576
             ++    NVFV +AL+ MYAKCG +  A  +FD M  R++ +W AMI  Y  +G  + A
Sbjct: 464 VMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAA 523

Query: 577 LKLMYRMQAEGFEVDDYILGTVYGAC 602
           L+L   MQ    + +     +V  AC
Sbjct: 524 LELFEEMQKGTIKPNGVTFLSVISAC 549

BLAST of CsGy2G024800 vs. TAIR 10
Match: AT3G09040.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 277.3 bits (708), Expect = 2.9e-74
Identity = 167/497 (33.60%), Postives = 267/497 (53.72%), Query Frame = 0

Query: 109 VHAFILRNFTSFGIYVGNNLLSSYLRLGMLVDARKVFDEMPMRSVVTWTAIINGYIDLDL 168
           VHA  ++   +  IYVG++L+S Y +   +  A KVF+ +  ++ V W A+I GY     
Sbjct: 349 VHAEAIKLGLASNIYVGSSLVSMYSKCEKMEAAAKVFEALEEKNDVFWNAMIRGYAHNGE 408

Query: 169 TEEALALFSDSVKSGVLANGQMFVCILNLCAKRLDFELGRQIHGVIVKGNRG-NLIVDSA 228
           + + + LF D   SG   +   F  +L+ CA   D E+G Q H +I+K     NL V +A
Sbjct: 409 SHKVMELFMDMKSSGYNIDDFTFTSLLSTCAASHDLEMGSQFHSIIIKKKLAKNLFVGNA 468

Query: 229 IIYFYAQCKDISSAFVAFERMRRRDVVCWTSMITSCSQQGLGREAISMFSNMLSDEFLPN 288
           ++  YA+C  +  A   FERM  RD V W ++I S  Q     EA  +F  M     + +
Sbjct: 469 LVDMYAKCGALEDARQIFERMCDRDNVTWNTIIGSYVQDENESEAFDLFKRMNLCGIVSD 528

Query: 289 EFSVCSVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFD 348
              + S LKAC     L  G+Q+H L +K  +  D+  G+SL+DMY+KCG + D+R+VF 
Sbjct: 529 GACLASTLKACTHVHGLYQGKQVHCLSVKCGLDRDLHTGSSLIDMYSKCGIIKDARKVFS 588

Query: 349 GMRNRNTVTWTSIIAGYAREGLGEEALNLFRLMKRQRIPANNLTIVSILRACGSIEASLT 408
            +   + V+  ++IAGY++  L EEA+ LF+ M  + +  + +T  +I+ AC   E+   
Sbjct: 589 SLPEWSVVSMNALIAGYSQNNL-EEAVVLFQEMLTRGVNPSEITFATIVEACHKPESLTL 648

Query: 409 GREVHAQIVKNSFQT-NIHIGSTLVWFYCKCRNQLKA-SMVLQLMPLRDVVSWTAIISGC 468
           G + H QI K  F +   ++G +L+  Y   R   +A ++  +L   + +V WT ++SG 
Sbjct: 649 GTQFHGQITKRGFSSEGEYLGISLLGMYMNSRGMTEACALFSELSSPKSIVLWTGMMSGH 708

Query: 469 AHLGHESEALEFLKNMIEEGVEPNSFTYSSTLKACAKMEAVLQGKMIHSSANKTSALSNV 528
           +  G   EAL+F K M  +GV P+  T+ + L+ C+ + ++ +G+ IHS     +   + 
Sbjct: 709 SQNGFYEEALKFYKEMRHDGVLPDQATFVTVLRVCSVLSSLREGRAIHSLIFHLAHDLDE 768

Query: 529 FVGSALIYMYAKCGYVTEASQVFDSMPVR-NLVSWKAMILCYARNGLCREALKLMYRMQA 588
              + LI MYAKCG +  +SQVFD M  R N+VSW ++I  YA+NG   +ALK+   M+ 
Sbjct: 769 LTSNTLIDMYAKCGDMKGSSQVFDEMRRRSNVVSWNSLINGYAKNGYAEDALKIFDSMRQ 828

Query: 589 EGFEVDDYILGTVYGAC 602
                D+     V  AC
Sbjct: 829 SHIMPDEITFLGVLTAC 844

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q0WNP31.7e-19963.21Pentatricopeptide repeat-containing protein At4g18520, chloroplastic OS=Arabidop... [more]
P930052.3e-7632.74Pentatricopeptide repeat-containing protein At2g33680 OS=Arabidopsis thaliana OX... [more]
Q9LFI11.2e-7533.01Pentatricopeptide repeat-containing protein At3g53360, mitochondrial OS=Arabidop... [more]
Q3E6Q18.3e-7430.24Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Q9SS834.1e-7333.60Pentatricopeptide repeat-containing protein At3g09040, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_004138810.10.0100.00pentatricopeptide repeat-containing protein At4g18520, chloroplastic [Cucumis sa... [more]
TYK06655.10.095.19pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
XP_008441245.10.095.03PREDICTED: pentatricopeptide repeat-containing protein At4g18520 [Cucumis melo][more]
XP_038884364.10.088.17pentatricopeptide repeat-containing protein At4g18520, chloroplastic-like [Benin... [more]
KAG7015915.10.085.23Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
A0A0A0LSX20.0100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G403710 PE=4 SV=1[more]
A0A5D3C5K10.095.19Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3B3M70.095.03pentatricopeptide repeat-containing protein At4g18520 OS=Cucumis melo OX=3656 GN... [more]
A0A6J1FJ520.085.23pentatricopeptide repeat-containing protein At4g18520, chloroplastic-like OS=Cuc... [more]
A0A6J1JUB80.084.90pentatricopeptide repeat-containing protein At4g18520, chloroplastic-like OS=Cuc... [more]
Match NameE-valueIdentityDescription
AT4G18520.11.2e-20063.21Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G33680.11.7e-7732.74Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G53360.18.3e-7733.01Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G11290.15.9e-7530.24Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G09040.12.9e-7433.60Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 254..288
e-value: 4.9E-6
score: 24.4
coord: 456..490
e-value: 1.7E-7
score: 29.0
coord: 154..186
e-value: 0.0013
score: 16.7
coord: 557..590
e-value: 1.4E-5
score: 22.9
coord: 126..152
e-value: 6.7E-4
score: 17.7
coord: 355..388
e-value: 3.0E-7
score: 28.2
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 251..298
e-value: 2.9E-8
score: 33.8
coord: 453..502
e-value: 5.5E-13
score: 48.9
coord: 352..399
e-value: 3.0E-10
score: 40.1
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 154..184
e-value: 3.9E-4
score: 20.5
coord: 126..151
e-value: 0.0014
score: 18.7
coord: 557..587
e-value: 1.3E-6
score: 28.3
coord: 529..555
e-value: 0.0087
score: 16.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 152..186
score: 9.415814
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 252..286
score: 10.435215
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 555..589
score: 10.566751
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 353..387
score: 11.684803
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 454..488
score: 12.287675
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 513..615
e-value: 1.7E-18
score: 68.6
coord: 306..407
e-value: 2.9E-23
score: 84.1
coord: 96..202
e-value: 5.9E-16
score: 60.3
coord: 203..305
e-value: 1.2E-17
score: 65.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 408..512
e-value: 2.0E-11
score: 45.7
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 284..500
coord: 395..602
coord: 169..400
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 100..298
NoneNo IPR availablePANTHERPTHR47924:SF2PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 395..602
coord: 169..400
NoneNo IPR availablePANTHERPTHR47924:SF2PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 284..500
coord: 100..298

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy2G024800.1CsGy2G024800.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:1900865 chloroplast RNA modification
biological_process GO:0008380 RNA splicing
cellular_component GO:0009507 chloroplast
molecular_function GO:0003729 mRNA binding
molecular_function GO:0005515 protein binding