Csor.00g113810 (gene) Silver-seed gourd (wild; sororia) v1

Overview
NameCsor.00g113810
Typegene
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationCsor_Chr07: 1573537 .. 1575165 (-)
RNA-Seq ExpressionCsor.00g113810
SyntenyCsor.00g113810
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSinitialstart_codonintronterminalstop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGCCTCTGTGGTCGGCCTTCTTCTTCCGGCGTCATTCATCTGTTCACCAAGTCGATTGTAGCCGGCGCAACCGCGACCATTCGTCGTCGGCATAAATCTGAATACGCCAACGACAGGTTTTCAGCTTTCCTTGTTCATTTCCTCTTAATTCTTTCTTATACTTGCCTGTTATGTATAATTTTATATACTGCCCTAAGAATTCTTCTGCTCGACGCTTAAAGTATCTCCTGCTTGATCAAATTTAATCGCGAACCTTCTTTTCATTTTTAGGGTTTAGGGTTTTTGGGTTTTTCGCGTTTTAATCTATCCTTAGAAACTTCCAGGTCTCAGGTGAAGCCACATCAGAAAGATTCCTCCTCCTGGGATAGAACCTTTAGGAGCCTATGTATAACGGGGAGATTGAGCGAGGCGGTTGCACTTTTGTGCTGTATGCCCTTCCGATTTCACTCCAAAACTTACTGCCTTCTGTTACAAGAATGCATTTTCAGGAAAGAGTATATGAAAGGAAAAAGAATCCATGCTCAAATGGTTGTTGTTGGACATTTACCCAATGAGTATCTCAAAACCAAACTGCTGATATTATATGCCAAATTAGGTGACTTAGAAACTGCAAATGTTCTTCATGAGAAATTGCTGGAGAACAGTCTGGTTTCATGGAATGCATTGATTGCTGGATATGTACAGAAAGGGTTTGGAGAAGTTGGATTGGAGCTTTACTTTAAGATGAGACGAACTGGTTTAATACCTGATCAATATACCTTTGCATCAGTTTTCAGAGCCTGTGCTAGCTTAGCTTCTTTGGAACATGGAAAGAGAGCTCATGGAGTTCTGATTAAGTGTCGAATCGGCGACAATGTTGTCGTGTCTAGTGCCCTTGTTGATATGTACTTCAAATGCAGTAGCATATCAGATGGTCATAAGGTATTTAACAAATCTACAACTAGAAATGTGATTACATGGACTGCTTTAATATCAGGGTATGGCCACCATGGAAGAGTTTCTGAAGTTTTGGAATCCTTCAATAGTATGATAAATGAAGGTTACCGACCAAATTACGTTACTTTCCTTGCGGTTCTTACTGCTTGTGGACATGTTGGTTTTGTATCGGAAGCATGGCGATACTTATCGTTAATGAAGACGACGTATGAAATAGAACCAAGAGGGCAACATTATGCTGCCATGGCGGATCTTCTCGCGCGGGCAGGGAGGTTGCAAGAGGCATATAATTTTGTCGTCGATGCACCATGCAAGGAGCACGCTGTTATATGGGGTGCTTTGGTCGGGGGTTGTAAGGTTCACGAAGACATAGATTTGATGAAACATGCAGCAGCAAATTACTTGGCATTGGATGCTGGCAACGCCGGGAAGTATGTGGTTTTAGCAAATGGGTTTGCGGCGTCTGGCTTGTGGGATAATGTTGCGGAGATTAGAGGCATGATGAAGAAATCAGGAATGAATAAGGAACCTGGTTACAGCAGAATTGAGATACAACGTGAGTTTCACTTCTTTGTTAAAAGTGATAAATCTCACGAACAAGCCGAGGAGATTTATAGAACCATTCACAGCATCACTCCGATTTTAAAGGATGCAGGTTCTATTCATGAACTAAGTGAAAACTCATTGTAG

mRNA sequence

ATGCGCCTCTGTGGTCGGCCTTCTTCTTCCGGCGTCATTCATCTGTTCACCAAGTCGATTGTAGCCGGCGCAACCGCGACCATTCGTCGTCGGCATAAATCTGAATACGCCAACGACAGGTCTCAGGTGAAGCCACATCAGAAAGATTCCTCCTCCTGGGATAGAACCTTTAGGAGCCTATGTATAACGGGGAGATTGAGCGAGGCGGTTGCACTTTTGTGCTGTATGCCCTTCCGATTTCACTCCAAAACTTACTGCCTTCTGTTACAAGAATGCATTTTCAGGAAAGAGTATATGAAAGGAAAAAGAATCCATGCTCAAATGGTTGTTGTTGGACATTTACCCAATGAGTATCTCAAAACCAAACTGCTGATATTATATGCCAAATTAGGTGACTTAGAAACTGCAAATGTTCTTCATGAGAAATTGCTGGAGAACAGTCTGGTTTCATGGAATGCATTGATTGCTGGATATGTACAGAAAGGGTTTGGAGAAGTTGGATTGGAGCTTTACTTTAAGATGAGACGAACTGGTTTAATACCTGATCAATATACCTTTGCATCAGTTTTCAGAGCCTGTGCTAGCTTAGCTTCTTTGGAACATGGAAAGAGAGCTCATGGAGTTCTGATTAAGTGTCGAATCGGCGACAATGTTGTCGTGTCTAGTGCCCTTGTTGATATGTACTTCAAATGCAGTAGCATATCAGATGGTCATAAGGTATTTAACAAATCTACAACTAGAAATGTGATTACATGGACTGCTTTAATATCAGGGTATGGCCACCATGGAAGAGTTTCTGAAGTTTTGGAATCCTTCAATAGTATGATAAATGAAGGTTACCGACCAAATTACGTTACTTTCCTTGCGGTTCTTACTGCTTGTGGACATGTTGGTTTTGTATCGGAAGCATGGCGATACTTATCGTTAATGAAGACGACGTATGAAATAGAACCAAGAGGGCAACATTATGCTGCCATGGCGGATCTTCTCGCGCGGGCAGGGAGGTTGCAAGAGGCATATAATTTTGTCGTCGATGCACCATGCAAGGAGCACGCTGTTATATGGGGTGCTTTGGTCGGGGGTTGTAAGGTTCACGAAGACATAGATTTGATGAAACATGCAGCAGCAAATTACTTGGCATTGGATGCTGGCAACGCCGGGAAGTATGTGGTTTTAGCAAATGGGTTTGCGGCGTCTGGCTTGTGGGATAATGTTGCGGAGATTAGAGGCATGATGAAGAAATCAGGAATGAATAAGGAACCTGGTTACAGCAGAATTGAGATACAACGTGAGTTTCACTTCTTTGTTAAAAGTGATAAATCTCACGAACAAGCCGAGGAGATTTATAGAACCATTCACAGCATCACTCCGATTTTAAAGGATGCAGGTTCTATTCATGAACTAAGTGAAAACTCATTGTAG

Coding sequence (CDS)

ATGCGCCTCTGTGGTCGGCCTTCTTCTTCCGGCGTCATTCATCTGTTCACCAAGTCGATTGTAGCCGGCGCAACCGCGACCATTCGTCGTCGGCATAAATCTGAATACGCCAACGACAGGTCTCAGGTGAAGCCACATCAGAAAGATTCCTCCTCCTGGGATAGAACCTTTAGGAGCCTATGTATAACGGGGAGATTGAGCGAGGCGGTTGCACTTTTGTGCTGTATGCCCTTCCGATTTCACTCCAAAACTTACTGCCTTCTGTTACAAGAATGCATTTTCAGGAAAGAGTATATGAAAGGAAAAAGAATCCATGCTCAAATGGTTGTTGTTGGACATTTACCCAATGAGTATCTCAAAACCAAACTGCTGATATTATATGCCAAATTAGGTGACTTAGAAACTGCAAATGTTCTTCATGAGAAATTGCTGGAGAACAGTCTGGTTTCATGGAATGCATTGATTGCTGGATATGTACAGAAAGGGTTTGGAGAAGTTGGATTGGAGCTTTACTTTAAGATGAGACGAACTGGTTTAATACCTGATCAATATACCTTTGCATCAGTTTTCAGAGCCTGTGCTAGCTTAGCTTCTTTGGAACATGGAAAGAGAGCTCATGGAGTTCTGATTAAGTGTCGAATCGGCGACAATGTTGTCGTGTCTAGTGCCCTTGTTGATATGTACTTCAAATGCAGTAGCATATCAGATGGTCATAAGGTATTTAACAAATCTACAACTAGAAATGTGATTACATGGACTGCTTTAATATCAGGGTATGGCCACCATGGAAGAGTTTCTGAAGTTTTGGAATCCTTCAATAGTATGATAAATGAAGGTTACCGACCAAATTACGTTACTTTCCTTGCGGTTCTTACTGCTTGTGGACATGTTGGTTTTGTATCGGAAGCATGGCGATACTTATCGTTAATGAAGACGACGTATGAAATAGAACCAAGAGGGCAACATTATGCTGCCATGGCGGATCTTCTCGCGCGGGCAGGGAGGTTGCAAGAGGCATATAATTTTGTCGTCGATGCACCATGCAAGGAGCACGCTGTTATATGGGGTGCTTTGGTCGGGGGTTGTAAGGTTCACGAAGACATAGATTTGATGAAACATGCAGCAGCAAATTACTTGGCATTGGATGCTGGCAACGCCGGGAAGTATGTGGTTTTAGCAAATGGGTTTGCGGCGTCTGGCTTGTGGGATAATGTTGCGGAGATTAGAGGCATGATGAAGAAATCAGGAATGAATAAGGAACCTGGTTACAGCAGAATTGAGATACAACGTGAGTTTCACTTCTTTGTTAAAAGTGATAAATCTCACGAACAAGCCGAGGAGATTTATAGAACCATTCACAGCATCACTCCGATTTTAAAGGATGCAGGTTCTATTCATGAACTAAGTGAAAACTCATTGTAG

Protein sequence

MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNSMINEGYRPNYVTFLAVLTACGHVGFVSEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSENSL
Homology
BLAST of Csor.00g113810 vs. ExPASy Swiss-Prot
Match: O23491 (Pentatricopeptide repeat-containing protein At4g16470 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E12 PE=2 SV=2)

HSP 1 Score: 502.7 bits (1293), Expect = 4.6e-141
Identity = 247/443 (55.76%), Postives = 316/443 (71.33%), Query Frame = 0

Query: 19  SIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCITGRLSEAVALLCCMPF 78
           S+ +G   TI RR  +E    R QV+ +Q+ +   D+T + LC+TGRL EAV LL     
Sbjct: 46  SMFSGNATTILRRMLAEKRIGRFQVE-NQRKTEKLDKTLKGLCVTGRLKEAVGLLWSSGL 105

Query: 79  RFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANV 138
           +   +TY +LLQEC  RKEY KGKRIHAQM VVG   NEYLK KLLILYA  GDL+TA +
Sbjct: 106 QVEPETYAVLLQECKQRKEYTKGKRIHAQMFVVGFALNEYLKVKLLILYALSGDLQTAGI 165

Query: 139 LHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLAS 198
           L   L    L+ WNA+I+GYVQKG  + GL +Y+ MR+  ++PDQYTFASVFRAC++L  
Sbjct: 166 LFRSLKIRDLIPWNAMISGYVQKGLEQEGLFIYYDMRQNRIVPDQYTFASVFRACSALDR 225

Query: 199 LEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISG 258
           LEHGKRAH V+IK  I  N++V SALVDMYFKCSS SDGH+VF++ +TRNVITWT+LISG
Sbjct: 226 LEHGKRAHAVMIKRCIKSNIIVDSALVDMYFKCSSFSDGHRVFDQLSTRNVITWTSLISG 285

Query: 259 YGHHGRVSEVLESFNSMINEGYRPNYVTFLAVLTACGHVGFVSEAWRYLSLMKTTYEIEP 318
           YG+HG+VSEVL+ F  M  EG RPN VTFL VLTAC H G V + W +   MK  Y IEP
Sbjct: 286 YGYHGKVSEVLKCFEKMKEEGCRPNPVTFLVVLTACNHGGLVDKGWEHFYSMKRDYGIEP 345

Query: 319 RGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANY 378
            GQHYAAM D L RAGRLQEAY FV+ +PCKEH  +WG+L+G C++H ++ L++ AA  +
Sbjct: 346 EGQHYAAMVDTLGRAGRLQEAYEFVMKSPCKEHPPVWGSLLGACRIHGNVKLLELAATKF 405

Query: 379 LALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKS 438
           L LD  N G YVV ANG+A+ GL +  +++R  M+ +G+ K+PGYS+IE+Q E H F+K 
Sbjct: 406 LELDPTNGGNYVVFANGYASCGLREAASKVRRKMENAGVKKDPGYSQIELQGEVHRFMKD 465

Query: 439 DKSHEQAEEIYRTIHSITPILKD 462
           D SH  +E+IY+ +H +T    D
Sbjct: 466 DTSHRLSEKIYKKVHEMTSFFMD 487

BLAST of Csor.00g113810 vs. ExPASy Swiss-Prot
Match: Q9LTV8 (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 283.9 bits (725), Expect = 3.4e-75
Identity = 142/382 (37.17%), Postives = 221/382 (57.85%), Query Frame = 0

Query: 96  KEYMKGKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALI 155
           ++  +G+ IHA +V +G      L   L  +YAK G + TA +L +K+   +L+ WNA+I
Sbjct: 236 QDLKQGRSIHASVVKMGLEIEPDLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMI 295

Query: 156 AGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLEHGKRAHGVLIKCRIG 215
           +GY + G+    ++++ +M    + PD  +  S   ACA + SLE  +  +  + +    
Sbjct: 296 SGYAKNGYAREAIDMFHEMINKDVRPDTISITSAISACAQVGSLEQARSMYEYVGRSDYR 355

Query: 216 DNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNSM 275
           D+V +SSAL+DM+ KC S+     VF+++  R+V+ W+A+I GYG HGR  E +  + +M
Sbjct: 356 DDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAM 415

Query: 276 INEGYRPNYVTFLAVLTACGHVGFVSEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGR 335
              G  PN VTFL +L AC H G V E W + + M   ++I P+ QHYA + DLL RAG 
Sbjct: 416 ERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNRM-ADHKINPQQQHYACVIDLLGRAGH 475

Query: 336 LQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANG 395
           L +AY  +   P +    +WGAL+  CK H  ++L ++AA    ++D  N G YV L+N 
Sbjct: 476 LDQAYEVIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYVQLSNL 535

Query: 396 FAASGLWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSI 455
           +AA+ LWD VAE+R  MK+ G+NK+ G S +E++     F   DKSH + EEI R +  I
Sbjct: 536 YAAARLWDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIERQVEWI 595

Query: 456 TPILKDAG-------SIHELSE 471
              LK+ G       S+H+L++
Sbjct: 596 ESRLKEGGFVANKDASLHDLND 616

BLAST of Csor.00g113810 vs. ExPASy Swiss-Prot
Match: Q9LW63 (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 283.1 bits (723), Expect = 5.8e-75
Identity = 137/373 (36.73%), Postives = 219/373 (58.71%), Query Frame = 0

Query: 97  EYMKGKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIA 156
           + +KGK IH  ++  G   + Y+ + L+ +YAK   +E +  +  +L     +SWN+L+A
Sbjct: 257 DVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRLYCRDGISWNSLVA 316

Query: 157 GYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGD 216
           GYVQ G     L L+ +M    + P    F+SV  ACA LA+L  GK+ HG +++   G 
Sbjct: 317 GYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQLHGYVLRGGFGS 376

Query: 217 NVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNSMI 276
           N+ ++SALVDMY KC +I    K+F++    + ++WTA+I G+  HG   E +  F  M 
Sbjct: 377 NIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHGHGHEAVSLFEEMK 436

Query: 277 NEGYRPNYVTFLAVLTACGHVGFVSEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRL 336
            +G +PN V F+AVLTAC HVG V EAW Y + M   Y +    +HYAA+ADLL RAG+L
Sbjct: 437 RQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAAVADLLGRAGKL 496

Query: 337 QEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGF 396
           +EAYNF+     +    +W  L+  C VH++++L +  A     +D+ N G YV++ N +
Sbjct: 497 EEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSENMGAYVLMCNMY 556

Query: 397 AASGLWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSIT 456
           A++G W  +A++R  M+K G+ K+P  S IE++ + H FV  D+SH   ++I   + ++ 
Sbjct: 557 ASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSMDKINEFLKAVM 616

Query: 457 PILKDAGSIHELS 470
             ++  G + + S
Sbjct: 617 EQMEKEGYVADTS 629

BLAST of Csor.00g113810 vs. ExPASy Swiss-Prot
Match: Q9SI53 (Pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H44 PE=2 SV=1)

HSP 1 Score: 282.0 bits (720), Expect = 1.3e-74
Identity = 141/384 (36.72%), Postives = 226/384 (58.85%), Query Frame = 0

Query: 84  TYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANVLHEKL 143
           TY  +L+ C    +    + +H  ++  G   + ++++ L+ ++AKLG+ E A  + +++
Sbjct: 164 TYSSVLRSCNGMSDV---RMLHCGIIKEGLESDVFVRSALIDVFAKLGEPEDALSVFDEM 223

Query: 144 LENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLEHGK 203
           +    + WN++I G+ Q    +V LEL+ +M+R G I +Q T  SV RAC  LA LE G 
Sbjct: 224 VTGDAIVWNSIIGGFAQNSRSDVALELFKRMKRAGFIAEQATLTSVLRACTGLALLELGM 283

Query: 204 RAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHG 263
           +AH  ++K     ++++++ALVDMY KC S+ D  +VFN+   R+VITW+ +ISG   +G
Sbjct: 284 QAHVHIVK--YDQDLILNNALVDMYCKCGSLEDALRVFNQMKERDVITWSTMISGLAQNG 343

Query: 264 RVSEVLESFNSMINEGYRPNYVTFLAVLTACGHVGFVSEAWRYLSLMKTTYEIEPRGQHY 323
              E L+ F  M + G +PNY+T + VL AC H G + + W Y   MK  Y I+P  +HY
Sbjct: 344 YSQEALKLFERMKSSGTKPNYITIVGVLFACSHAGLLEDGWYYFRSMKKLYGIDPVREHY 403

Query: 324 AAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDA 383
             M DLL +AG+L +A   + +  C+  AV W  L+G C+V  ++ L ++AA   +ALD 
Sbjct: 404 GCMIDLLGKAGKLDDAVKLLNEMECEPDAVTWRTLLGACRVQRNMVLAEYAAKKVIALDP 463

Query: 384 GNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHE 443
            +AG Y +L+N +A S  WD+V EIR  M+  G+ KEPG S IE+ ++ H F+  D SH 
Sbjct: 464 EDAGTYTLLSNIYANSQKWDSVEEIRTRMRDRGIKKEPGCSWIEVNKQIHAFIIGDNSHP 523

Query: 444 QAEEIYRTIHSITPILKDAGSIHE 468
           Q  E+ + ++ +   L   G + E
Sbjct: 524 QIVEVSKKLNQLIHRLTGIGYVPE 542

BLAST of Csor.00g113810 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 280.0 bits (715), Expect = 4.9e-74
Identity = 141/370 (38.11%), Postives = 214/370 (57.84%), Query Frame = 0

Query: 98  YMKGKRIHAQMVVVGHLP--NEYLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALI 157
           Y K  RI     +   +P  N   +T ++  YA     + A ++  K+ E ++VSWNALI
Sbjct: 299 YAKCSRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALI 358

Query: 158 AGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLEHGKRA------HGVL 217
           AGY Q G  E  L L+  ++R  + P  Y+FA++ +ACA LA L  G +A      HG  
Sbjct: 359 AGYTQNGENEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFK 418

Query: 218 IKCRIGDNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVL 277
            +    D++ V ++L+DMY KC  + +G+ VF K   R+ ++W A+I G+  +G  +E L
Sbjct: 419 FQSGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEAL 478

Query: 278 ESFNSMINEGYRPNYVTFLAVLTACGHVGFVSEAWRYLSLMKTTYEIEPRGQHYAAMADL 337
           E F  M+  G +P+++T + VL+ACGH GFV E   Y S M   + + P   HY  M DL
Sbjct: 479 ELFREMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDL 538

Query: 338 LARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKY 397
           L RAG L+EA + + + P +  +VIWG+L+  CKVH +I L K+ A   L ++  N+G Y
Sbjct: 539 LGRAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPY 598

Query: 398 VVLANGFAASGLWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHEQAEEIY 457
           V+L+N +A  G W++V  +R  M+K G+ K+PG S I+IQ   H F+  DKSH + ++  
Sbjct: 599 VLLSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWIKIQGHDHVFMVKDKSHPRKKQ-- 658

Query: 458 RTIHSITPIL 460
             IHS+  IL
Sbjct: 659 --IHSLLDIL 664

BLAST of Csor.00g113810 vs. NCBI nr
Match: KAG6594700.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 956 bits (2472), Expect = 0.0
Identity = 473/473 (100.00%), Postives = 473/473 (100.00%), Query Frame = 0

Query: 1   MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSL 60
           MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSL
Sbjct: 1   MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSL 60

Query: 61  CITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLK 120
           CITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLK
Sbjct: 61  CITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLK 120

Query: 121 TKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLI 180
           TKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLI
Sbjct: 121 TKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLI 180

Query: 181 PDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKV 240
           PDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKV
Sbjct: 181 PDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKV 240

Query: 241 FNKSTTRNVITWTALISGYGHHGRVSEVLESFNSMINEGYRPNYVTFLAVLTACGHVGFV 300
           FNKSTTRNVITWTALISGYGHHGRVSEVLESFNSMINEGYRPNYVTFLAVLTACGHVGFV
Sbjct: 241 FNKSTTRNVITWTALISGYGHHGRVSEVLESFNSMINEGYRPNYVTFLAVLTACGHVGFV 300

Query: 301 SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVG 360
           SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVG
Sbjct: 301 SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVG 360

Query: 361 GCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKE 420
           GCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKE
Sbjct: 361 GCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKE 420

Query: 421 PGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSENSL 473
           PGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSENSL
Sbjct: 421 PGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSENSL 473

BLAST of Csor.00g113810 vs. NCBI nr
Match: KAG7026668.1 (Scarecrow-like protein 13, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 954 bits (2467), Expect = 0.0
Identity = 472/473 (99.79%), Postives = 472/473 (99.79%), Query Frame = 0

Query: 1   MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSL 60
           MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSL
Sbjct: 506 MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSL 565

Query: 61  CITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLK 120
           CITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLK
Sbjct: 566 CITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLK 625

Query: 121 TKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLI 180
           TKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLI
Sbjct: 626 TKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLI 685

Query: 181 PDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKV 240
           PDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKV
Sbjct: 686 PDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKV 745

Query: 241 FNKSTTRNVITWTALISGYGHHGRVSEVLESFNSMINEGYRPNYVTFLAVLTACGHVGFV 300
           FNKSTTRNVITWTALISGYGHHGRVSEVLESFN MINEGYRPNYVTFLAVLTACGHVGFV
Sbjct: 746 FNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACGHVGFV 805

Query: 301 SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVG 360
           SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVG
Sbjct: 806 SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVG 865

Query: 361 GCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKE 420
           GCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKE
Sbjct: 866 GCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKE 925

Query: 421 PGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSENSL 473
           PGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSENSL
Sbjct: 926 PGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSENSL 978

BLAST of Csor.00g113810 vs. NCBI nr
Match: XP_022926503.1 (pentatricopeptide repeat-containing protein At4g16470 [Cucurbita moschata])

HSP 1 Score: 934 bits (2415), Expect = 0.0
Identity = 462/473 (97.67%), Postives = 467/473 (98.73%), Query Frame = 0

Query: 1   MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSL 60
           MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDR QVKPHQKDSSSWDRTFRSL
Sbjct: 1   MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRFQVKPHQKDSSSWDRTFRSL 60

Query: 61  CITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLK 120
           CITGRL+EAVALLCCMPF+FHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLK
Sbjct: 61  CITGRLTEAVALLCCMPFQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLK 120

Query: 121 TKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLI 180
           TKLLILYAKLGDLETAN+LHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLI
Sbjct: 121 TKLLILYAKLGDLETANILHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLI 180

Query: 181 PDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKV 240
           PDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSI DGHKV
Sbjct: 181 PDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSILDGHKV 240

Query: 241 FNKSTTRNVITWTALISGYGHHGRVSEVLESFNSMINEGYRPNYVTFLAVLTACGHVGFV 300
           FNKSTTRNVITWTALISGYGHHGRVSEVLESFN MINEGYRPNYVTFLAVLTACGH GFV
Sbjct: 241 FNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACGHGGFV 300

Query: 301 SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVG 360
           SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAY+FVVDAPCKEHAVIWGALVG
Sbjct: 301 SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYDFVVDAPCKEHAVIWGALVG 360

Query: 361 GCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKE 420
           GCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKE
Sbjct: 361 GCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKE 420

Query: 421 PGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSENSL 473
           PGYSRIEIQREFHFFVKSDKSH+QAEEIYRTIHSIT ILKDAGSI ELSENSL
Sbjct: 421 PGYSRIEIQREFHFFVKSDKSHKQAEEIYRTIHSITAILKDAGSIRELSENSL 473

BLAST of Csor.00g113810 vs. NCBI nr
Match: XP_023518629.1 (pentatricopeptide repeat-containing protein At4g16470 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 932 bits (2410), Expect = 0.0
Identity = 458/471 (97.24%), Postives = 466/471 (98.94%), Query Frame = 0

Query: 1   MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSL 60
           MRLCGRPSSSGV+HLFTKSIVAGATATIRRRHKSEY NDRSQVKPHQKDSSSWDRTFRSL
Sbjct: 1   MRLCGRPSSSGVVHLFTKSIVAGATATIRRRHKSEYVNDRSQVKPHQKDSSSWDRTFRSL 60

Query: 61  CITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLK 120
           CITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLK
Sbjct: 61  CITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLK 120

Query: 121 TKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLI 180
           TKLLILYAKLGDLETAN+LHEKLLENSLVSWNALIAGYVQKGFGEVGLE+YFKMRRTGL+
Sbjct: 121 TKLLILYAKLGDLETANILHEKLLENSLVSWNALIAGYVQKGFGEVGLEIYFKMRRTGLM 180

Query: 181 PDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKV 240
           PDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDG KV
Sbjct: 181 PDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGRKV 240

Query: 241 FNKSTTRNVITWTALISGYGHHGRVSEVLESFNSMINEGYRPNYVTFLAVLTACGHVGFV 300
           FNKS+TRNVITWTALISGYGHHGRVSEVLESFN+MINEGYRPNYVTFLAVLTACGH GFV
Sbjct: 241 FNKSSTRNVITWTALISGYGHHGRVSEVLESFNNMINEGYRPNYVTFLAVLTACGHGGFV 300

Query: 301 SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVG 360
           SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAY+FVVDAPCKEHAVIWGALVG
Sbjct: 301 SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYDFVVDAPCKEHAVIWGALVG 360

Query: 361 GCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKE 420
           GCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKE
Sbjct: 361 GCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKE 420

Query: 421 PGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSEN 471
           PGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPI+KDAGS  ELSEN
Sbjct: 421 PGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPIIKDAGSFPELSEN 471

BLAST of Csor.00g113810 vs. NCBI nr
Match: XP_023003064.1 (pentatricopeptide repeat-containing protein At4g16470 [Cucurbita maxima])

HSP 1 Score: 905 bits (2340), Expect = 0.0
Identity = 450/471 (95.54%), Postives = 460/471 (97.66%), Query Frame = 0

Query: 1   MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSL 60
           MRLCGR SSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSL
Sbjct: 1   MRLCGRSSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSL 60

Query: 61  CITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLK 120
           CITGRLSEAVALLC MPF+FHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLK
Sbjct: 61  CITGRLSEAVALLCSMPFQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLK 120

Query: 121 TKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLI 180
           TKLLILYAKLGDLETA +LHEKLL+NSLVSWNALIAG VQKG GEVGLELYFKMRRTGLI
Sbjct: 121 TKLLILYAKLGDLETAYILHEKLLDNSLVSWNALIAGCVQKGLGEVGLELYFKMRRTGLI 180

Query: 181 PDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKV 240
           PDQYTFASV RACASLASLEHGKRAHGVLIKC+IGDNVVVSSALVDMYFKCSSISDGHKV
Sbjct: 181 PDQYTFASVIRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSISDGHKV 240

Query: 241 FNKSTTRNVITWTALISGYGHHGRVSEVLESFNSMINEGYRPNYVTFLAVLTACGHVGFV 300
           F+KS+TRNVITWTALISGYGHHGRVSEVLESFNSMINEGYRPNYVTFLAVLTACGH GFV
Sbjct: 241 FDKSSTRNVITWTALISGYGHHGRVSEVLESFNSMINEGYRPNYVTFLAVLTACGHGGFV 300

Query: 301 SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVG 360
           SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAY+FVVDAPCKEH+VIWGALVG
Sbjct: 301 SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYDFVVDAPCKEHSVIWGALVG 360

Query: 361 GCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKE 420
           GCKVHEDIDLMKHAAA+YLALDA NAGKYVVLANGFAASGLWDNVAEIR MMKKSGMNKE
Sbjct: 361 GCKVHEDIDLMKHAAAHYLALDASNAGKYVVLANGFAASGLWDNVAEIRCMMKKSGMNKE 420

Query: 421 PGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSEN 471
           PGYSRIEIQREFHFFVKSDKSH+QA EIYRTIHSITPILKDAGSI ELSEN
Sbjct: 421 PGYSRIEIQREFHFFVKSDKSHKQAVEIYRTIHSITPILKDAGSIPELSEN 471

BLAST of Csor.00g113810 vs. ExPASy TrEMBL
Match: A0A6J1EI86 (pentatricopeptide repeat-containing protein At4g16470 OS=Cucurbita moschata OX=3662 GN=LOC111433634 PE=4 SV=1)

HSP 1 Score: 934 bits (2415), Expect = 0.0
Identity = 462/473 (97.67%), Postives = 467/473 (98.73%), Query Frame = 0

Query: 1   MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSL 60
           MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDR QVKPHQKDSSSWDRTFRSL
Sbjct: 1   MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRFQVKPHQKDSSSWDRTFRSL 60

Query: 61  CITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLK 120
           CITGRL+EAVALLCCMPF+FHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLK
Sbjct: 61  CITGRLTEAVALLCCMPFQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLK 120

Query: 121 TKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLI 180
           TKLLILYAKLGDLETAN+LHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLI
Sbjct: 121 TKLLILYAKLGDLETANILHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLI 180

Query: 181 PDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKV 240
           PDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSI DGHKV
Sbjct: 181 PDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSILDGHKV 240

Query: 241 FNKSTTRNVITWTALISGYGHHGRVSEVLESFNSMINEGYRPNYVTFLAVLTACGHVGFV 300
           FNKSTTRNVITWTALISGYGHHGRVSEVLESFN MINEGYRPNYVTFLAVLTACGH GFV
Sbjct: 241 FNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACGHGGFV 300

Query: 301 SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVG 360
           SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAY+FVVDAPCKEHAVIWGALVG
Sbjct: 301 SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYDFVVDAPCKEHAVIWGALVG 360

Query: 361 GCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKE 420
           GCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKE
Sbjct: 361 GCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKE 420

Query: 421 PGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSENSL 473
           PGYSRIEIQREFHFFVKSDKSH+QAEEIYRTIHSIT ILKDAGSI ELSENSL
Sbjct: 421 PGYSRIEIQREFHFFVKSDKSHKQAEEIYRTIHSITAILKDAGSIRELSENSL 473

BLAST of Csor.00g113810 vs. ExPASy TrEMBL
Match: A0A6J1KS92 (pentatricopeptide repeat-containing protein At4g16470 OS=Cucurbita maxima OX=3661 GN=LOC111496780 PE=4 SV=1)

HSP 1 Score: 905 bits (2340), Expect = 0.0
Identity = 450/471 (95.54%), Postives = 460/471 (97.66%), Query Frame = 0

Query: 1   MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSL 60
           MRLCGR SSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSL
Sbjct: 1   MRLCGRSSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSL 60

Query: 61  CITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLK 120
           CITGRLSEAVALLC MPF+FHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLK
Sbjct: 61  CITGRLSEAVALLCSMPFQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLK 120

Query: 121 TKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLI 180
           TKLLILYAKLGDLETA +LHEKLL+NSLVSWNALIAG VQKG GEVGLELYFKMRRTGLI
Sbjct: 121 TKLLILYAKLGDLETAYILHEKLLDNSLVSWNALIAGCVQKGLGEVGLELYFKMRRTGLI 180

Query: 181 PDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKV 240
           PDQYTFASV RACASLASLEHGKRAHGVLIKC+IGDNVVVSSALVDMYFKCSSISDGHKV
Sbjct: 181 PDQYTFASVIRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSISDGHKV 240

Query: 241 FNKSTTRNVITWTALISGYGHHGRVSEVLESFNSMINEGYRPNYVTFLAVLTACGHVGFV 300
           F+KS+TRNVITWTALISGYGHHGRVSEVLESFNSMINEGYRPNYVTFLAVLTACGH GFV
Sbjct: 241 FDKSSTRNVITWTALISGYGHHGRVSEVLESFNSMINEGYRPNYVTFLAVLTACGHGGFV 300

Query: 301 SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVG 360
           SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAY+FVVDAPCKEH+VIWGALVG
Sbjct: 301 SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYDFVVDAPCKEHSVIWGALVG 360

Query: 361 GCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKE 420
           GCKVHEDIDLMKHAAA+YLALDA NAGKYVVLANGFAASGLWDNVAEIR MMKKSGMNKE
Sbjct: 361 GCKVHEDIDLMKHAAAHYLALDASNAGKYVVLANGFAASGLWDNVAEIRCMMKKSGMNKE 420

Query: 421 PGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSEN 471
           PGYSRIEIQREFHFFVKSDKSH+QA EIYRTIHSITPILKDAGSI ELSEN
Sbjct: 421 PGYSRIEIQREFHFFVKSDKSHKQAVEIYRTIHSITPILKDAGSIPELSEN 471

BLAST of Csor.00g113810 vs. ExPASy TrEMBL
Match: A0A6J1BXR4 (pentatricopeptide repeat-containing protein At4g16470 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111005684 PE=4 SV=1)

HSP 1 Score: 765 bits (1976), Expect = 2.60e-276
Identity = 377/471 (80.04%), Postives = 414/471 (87.90%), Query Frame = 0

Query: 1   MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSL 60
           MRLC RPSS G IHLFTKSI+A  T TI RR  SEY     QV PHQKD+S WD+TFR L
Sbjct: 1   MRLCCRPSS-GAIHLFTKSILAATTLTIHRRKISEYCTASFQVNPHQKDASFWDKTFRGL 60

Query: 61  CITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLK 120
           C+TGRLSEAVALLCCM  +FHS+TYCLLLQECIFRKEYMKGKRIHAQ+VVVGHLP+EYL 
Sbjct: 61  CLTGRLSEAVALLCCMALQFHSRTYCLLLQECIFRKEYMKGKRIHAQIVVVGHLPSEYLT 120

Query: 121 TKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLI 180
           TKLLILYAK GDL TA +LHEKLL  SLVSWNA+IAGYVQKG GEVGLE Y KMR++G++
Sbjct: 121 TKLLILYAKSGDLGTAYILHEKLLGKSLVSWNAMIAGYVQKGLGEVGLEFYLKMRQSGMV 180

Query: 181 PDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKV 240
           PDQYTFASVF+ACA+LASLEHGKRAHGVLIKC+IGDNVVVSSALVDMYFKCSS+SDG +V
Sbjct: 181 PDQYTFASVFKACATLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLSDGRRV 240

Query: 241 FNKSTTRNVITWTALISGYGHHGRVSEVLESFNSMINEGYRPNYVTFLAVLTACGHVGFV 300
           FN S+ RNVITWTALI GY  HGRV EVLESFNSMINEGYRPN+VTFLAVL ACG  GFV
Sbjct: 241 FNISSNRNVITWTALIFGYAQHGRVFEVLESFNSMINEGYRPNHVTFLAVLAACGRGGFV 300

Query: 301 SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVG 360
           SEA RY SLM T Y IEPRGQHYAAMADLLARAGRL+EAYNFV++APCKEH+VIWGALVG
Sbjct: 301 SEAQRYFSLMMTDYRIEPRGQHYAAMADLLARAGRLEEAYNFVLNAPCKEHSVIWGALVG 360

Query: 361 GCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKE 420
           GCKVHEDIDLMK+AAANY ALDA N+GKYVVL+N +A SGLWDNVAE+R MMKK+GM K+
Sbjct: 361 GCKVHEDIDLMKYAAANYFALDAENSGKYVVLSNAYATSGLWDNVAEVRDMMKKTGMTKD 420

Query: 421 PGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSEN 471
           PGYSRIEIQREF FF KSD+SH++ EEIYRTI+ +TPILKDAG I EL  N
Sbjct: 421 PGYSRIEIQREFRFFFKSDESHKETEEIYRTINRLTPILKDAGYIAELRGN 470

BLAST of Csor.00g113810 vs. ExPASy TrEMBL
Match: A0A6J1BVB9 (pentatricopeptide repeat-containing protein At4g16470 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111005684 PE=4 SV=1)

HSP 1 Score: 763 bits (1971), Expect = 1.68e-275
Identity = 378/474 (79.75%), Postives = 416/474 (87.76%), Query Frame = 0

Query: 1   MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRS---QVKPHQKDSSSWDRTF 60
           MRLC RPSS G IHLFTKSI+A  T TI RR  SEY   R+   QV PHQKD+S WD+TF
Sbjct: 1   MRLCCRPSS-GAIHLFTKSILAATTLTIHRRKISEYCTARNSSFQVNPHQKDASFWDKTF 60

Query: 61  RSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNE 120
           R LC+TGRLSEAVALLCCM  +FHS+TYCLLLQECIFRKEYMKGKRIHAQ+VVVGHLP+E
Sbjct: 61  RGLCLTGRLSEAVALLCCMALQFHSRTYCLLLQECIFRKEYMKGKRIHAQIVVVGHLPSE 120

Query: 121 YLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRT 180
           YL TKLLILYAK GDL TA +LHEKLL  SLVSWNA+IAGYVQKG GEVGLE Y KMR++
Sbjct: 121 YLTTKLLILYAKSGDLGTAYILHEKLLGKSLVSWNAMIAGYVQKGLGEVGLEFYLKMRQS 180

Query: 181 GLIPDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDG 240
           G++PDQYTFASVF+ACA+LASLEHGKRAHGVLIKC+IGDNVVVSSALVDMYFKCSS+SDG
Sbjct: 181 GMVPDQYTFASVFKACATLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLSDG 240

Query: 241 HKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNSMINEGYRPNYVTFLAVLTACGHV 300
            +VFN S+ RNVITWTALI GY  HGRV EVLESFNSMINEGYRPN+VTFLAVL ACG  
Sbjct: 241 RRVFNISSNRNVITWTALIFGYAQHGRVFEVLESFNSMINEGYRPNHVTFLAVLAACGRG 300

Query: 301 GFVSEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGA 360
           GFVSEA RY SLM T Y IEPRGQHYAAMADLLARAGRL+EAYNFV++APCKEH+VIWGA
Sbjct: 301 GFVSEAQRYFSLMMTDYRIEPRGQHYAAMADLLARAGRLEEAYNFVLNAPCKEHSVIWGA 360

Query: 361 LVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGM 420
           LVGGCKVHEDIDLMK+AAANY ALDA N+GKYVVL+N +A SGLWDNVAE+R MMKK+GM
Sbjct: 361 LVGGCKVHEDIDLMKYAAANYFALDAENSGKYVVLSNAYATSGLWDNVAEVRDMMKKTGM 420

Query: 421 NKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSEN 471
            K+PGYSRIEIQREF FF KSD+SH++ EEIYRTI+ +TPILKDAG I EL  N
Sbjct: 421 TKDPGYSRIEIQREFRFFFKSDESHKETEEIYRTINRLTPILKDAGYIAELRGN 473

BLAST of Csor.00g113810 vs. ExPASy TrEMBL
Match: A0A0A0KJS7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G511640 PE=4 SV=1)

HSP 1 Score: 749 bits (1934), Expect = 1.17e-269
Identity = 369/470 (78.51%), Postives = 404/470 (85.96%), Query Frame = 0

Query: 6   RPSSSGVIHLFTKSIVAGATATIR--RRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCIT 65
           R S S VIHL + SIV     T+   RRHKSE+A+   QVKPH KD+SSWD+T R LC+T
Sbjct: 10  RSSFSAVIHLISTSIVPTQLPTVHLPRRHKSEFASPSFQVKPHHKDTSSWDKTLRGLCLT 69

Query: 66  GRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLKTKL 125
           G+L+EAVALLCCM  +FHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVG++PNEYL TKL
Sbjct: 70  GKLAEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGYVPNEYLNTKL 129

Query: 126 LILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQ 185
           LILYAK GDLETA VLHE LLE SLVSWN+LIAGYVQKG  EVGLE Y KMR++GL+PDQ
Sbjct: 130 LILYAKSGDLETAYVLHEHLLEKSLVSWNSLIAGYVQKGLAEVGLEFYLKMRQSGLMPDQ 189

Query: 186 YTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKVFNK 245
           YTFASV RACASLASLEHGKRAHGVLIKC+IGDNVVVSSALVDMYFKCSS+SDGHK FNK
Sbjct: 190 YTFASVLRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLSDGHKAFNK 249

Query: 246 STTRNVITWTALISGYGHHGRVSEVLESFNSMINEGYRPNYVTFLAVLTACGHVGFVSEA 305
           S+ RNVITWTALISGYG HGR+SEVLESF+SMIN+GYRPNYVTFLAVL AC   GFVSEA
Sbjct: 250 SSNRNVITWTALISGYGQHGRISEVLESFHSMINKGYRPNYVTFLAVLAACSRGGFVSEA 309

Query: 306 WRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCK 365
           W Y SLM  TY IEPRGQHYAAMADLLARAGRLQEAY+FV+DAPCKEH+V+WGALVG CK
Sbjct: 310 WNYFSLMTKTYGIEPRGQHYAAMADLLARAGRLQEAYDFVLDAPCKEHSVMWGALVGACK 369

Query: 366 VHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKEPGY 425
           VHED+DLMKH AA+Y  LD  N+GK VV +N FA SGLWDNV EIR MMKKSGM+K+PG 
Sbjct: 370 VHEDVDLMKHVAASYFELDPKNSGKLVVFSNAFATSGLWDNVEEIRAMMKKSGMSKDPGC 429

Query: 426 SRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSENSL 473
           SRIEIQREFH FVK DKSH + EEIYRTI  ITPILKDAG I EL E ++
Sbjct: 430 SRIEIQREFHIFVKGDKSHRETEEIYRTIDRITPILKDAGYIPELCEKTV 479

BLAST of Csor.00g113810 vs. TAIR 10
Match: AT4G16470.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 502.7 bits (1293), Expect = 3.3e-142
Identity = 247/443 (55.76%), Postives = 316/443 (71.33%), Query Frame = 0

Query: 19  SIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCITGRLSEAVALLCCMPF 78
           S+ +G   TI RR  +E    R QV+ +Q+ +   D+T + LC+TGRL EAV LL     
Sbjct: 46  SMFSGNATTILRRMLAEKRIGRFQVE-NQRKTEKLDKTLKGLCVTGRLKEAVGLLWSSGL 105

Query: 79  RFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANV 138
           +   +TY +LLQEC  RKEY KGKRIHAQM VVG   NEYLK KLLILYA  GDL+TA +
Sbjct: 106 QVEPETYAVLLQECKQRKEYTKGKRIHAQMFVVGFALNEYLKVKLLILYALSGDLQTAGI 165

Query: 139 LHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLAS 198
           L   L    L+ WNA+I+GYVQKG  + GL +Y+ MR+  ++PDQYTFASVFRAC++L  
Sbjct: 166 LFRSLKIRDLIPWNAMISGYVQKGLEQEGLFIYYDMRQNRIVPDQYTFASVFRACSALDR 225

Query: 199 LEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISG 258
           LEHGKRAH V+IK  I  N++V SALVDMYFKCSS SDGH+VF++ +TRNVITWT+LISG
Sbjct: 226 LEHGKRAHAVMIKRCIKSNIIVDSALVDMYFKCSSFSDGHRVFDQLSTRNVITWTSLISG 285

Query: 259 YGHHGRVSEVLESFNSMINEGYRPNYVTFLAVLTACGHVGFVSEAWRYLSLMKTTYEIEP 318
           YG+HG+VSEVL+ F  M  EG RPN VTFL VLTAC H G V + W +   MK  Y IEP
Sbjct: 286 YGYHGKVSEVLKCFEKMKEEGCRPNPVTFLVVLTACNHGGLVDKGWEHFYSMKRDYGIEP 345

Query: 319 RGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANY 378
            GQHYAAM D L RAGRLQEAY FV+ +PCKEH  +WG+L+G C++H ++ L++ AA  +
Sbjct: 346 EGQHYAAMVDTLGRAGRLQEAYEFVMKSPCKEHPPVWGSLLGACRIHGNVKLLELAATKF 405

Query: 379 LALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKS 438
           L LD  N G YVV ANG+A+ GL +  +++R  M+ +G+ K+PGYS+IE+Q E H F+K 
Sbjct: 406 LELDPTNGGNYVVFANGYASCGLREAASKVRRKMENAGVKKDPGYSQIELQGEVHRFMKD 465

Query: 439 DKSHEQAEEIYRTIHSITPILKD 462
           D SH  +E+IY+ +H +T    D
Sbjct: 466 DTSHRLSEKIYKKVHEMTSFFMD 487

BLAST of Csor.00g113810 vs. TAIR 10
Match: AT3G12770.1 (mitochondrial editing factor 22 )

HSP 1 Score: 283.9 bits (725), Expect = 2.4e-76
Identity = 142/382 (37.17%), Postives = 221/382 (57.85%), Query Frame = 0

Query: 96  KEYMKGKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALI 155
           ++  +G+ IHA +V +G      L   L  +YAK G + TA +L +K+   +L+ WNA+I
Sbjct: 236 QDLKQGRSIHASVVKMGLEIEPDLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMI 295

Query: 156 AGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLEHGKRAHGVLIKCRIG 215
           +GY + G+    ++++ +M    + PD  +  S   ACA + SLE  +  +  + +    
Sbjct: 296 SGYAKNGYAREAIDMFHEMINKDVRPDTISITSAISACAQVGSLEQARSMYEYVGRSDYR 355

Query: 216 DNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNSM 275
           D+V +SSAL+DM+ KC S+     VF+++  R+V+ W+A+I GYG HGR  E +  + +M
Sbjct: 356 DDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAM 415

Query: 276 INEGYRPNYVTFLAVLTACGHVGFVSEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGR 335
              G  PN VTFL +L AC H G V E W + + M   ++I P+ QHYA + DLL RAG 
Sbjct: 416 ERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNRM-ADHKINPQQQHYACVIDLLGRAGH 475

Query: 336 LQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANG 395
           L +AY  +   P +    +WGAL+  CK H  ++L ++AA    ++D  N G YV L+N 
Sbjct: 476 LDQAYEVIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYVQLSNL 535

Query: 396 FAASGLWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSI 455
           +AA+ LWD VAE+R  MK+ G+NK+ G S +E++     F   DKSH + EEI R +  I
Sbjct: 536 YAAARLWDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIERQVEWI 595

Query: 456 TPILKDAG-------SIHELSE 471
              LK+ G       S+H+L++
Sbjct: 596 ESRLKEGGFVANKDASLHDLND 616

BLAST of Csor.00g113810 vs. TAIR 10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 283.1 bits (723), Expect = 4.1e-76
Identity = 137/373 (36.73%), Postives = 219/373 (58.71%), Query Frame = 0

Query: 97  EYMKGKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIA 156
           + +KGK IH  ++  G   + Y+ + L+ +YAK   +E +  +  +L     +SWN+L+A
Sbjct: 257 DVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRLYCRDGISWNSLVA 316

Query: 157 GYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGD 216
           GYVQ G     L L+ +M    + P    F+SV  ACA LA+L  GK+ HG +++   G 
Sbjct: 317 GYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQLHGYVLRGGFGS 376

Query: 217 NVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNSMI 276
           N+ ++SALVDMY KC +I    K+F++    + ++WTA+I G+  HG   E +  F  M 
Sbjct: 377 NIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHGHGHEAVSLFEEMK 436

Query: 277 NEGYRPNYVTFLAVLTACGHVGFVSEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRL 336
            +G +PN V F+AVLTAC HVG V EAW Y + M   Y +    +HYAA+ADLL RAG+L
Sbjct: 437 RQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAAVADLLGRAGKL 496

Query: 337 QEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGF 396
           +EAYNF+     +    +W  L+  C VH++++L +  A     +D+ N G YV++ N +
Sbjct: 497 EEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSENMGAYVLMCNMY 556

Query: 397 AASGLWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSIT 456
           A++G W  +A++R  M+K G+ K+P  S IE++ + H FV  D+SH   ++I   + ++ 
Sbjct: 557 ASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSMDKINEFLKAVM 616

Query: 457 PILKDAGSIHELS 470
             ++  G + + S
Sbjct: 617 EQMEKEGYVADTS 629

BLAST of Csor.00g113810 vs. TAIR 10
Match: AT2G03880.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 282.0 bits (720), Expect = 9.1e-76
Identity = 141/384 (36.72%), Postives = 226/384 (58.85%), Query Frame = 0

Query: 84  TYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANVLHEKL 143
           TY  +L+ C    +    + +H  ++  G   + ++++ L+ ++AKLG+ E A  + +++
Sbjct: 164 TYSSVLRSCNGMSDV---RMLHCGIIKEGLESDVFVRSALIDVFAKLGEPEDALSVFDEM 223

Query: 144 LENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLEHGK 203
           +    + WN++I G+ Q    +V LEL+ +M+R G I +Q T  SV RAC  LA LE G 
Sbjct: 224 VTGDAIVWNSIIGGFAQNSRSDVALELFKRMKRAGFIAEQATLTSVLRACTGLALLELGM 283

Query: 204 RAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHG 263
           +AH  ++K     ++++++ALVDMY KC S+ D  +VFN+   R+VITW+ +ISG   +G
Sbjct: 284 QAHVHIVK--YDQDLILNNALVDMYCKCGSLEDALRVFNQMKERDVITWSTMISGLAQNG 343

Query: 264 RVSEVLESFNSMINEGYRPNYVTFLAVLTACGHVGFVSEAWRYLSLMKTTYEIEPRGQHY 323
              E L+ F  M + G +PNY+T + VL AC H G + + W Y   MK  Y I+P  +HY
Sbjct: 344 YSQEALKLFERMKSSGTKPNYITIVGVLFACSHAGLLEDGWYYFRSMKKLYGIDPVREHY 403

Query: 324 AAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDA 383
             M DLL +AG+L +A   + +  C+  AV W  L+G C+V  ++ L ++AA   +ALD 
Sbjct: 404 GCMIDLLGKAGKLDDAVKLLNEMECEPDAVTWRTLLGACRVQRNMVLAEYAAKKVIALDP 463

Query: 384 GNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHE 443
            +AG Y +L+N +A S  WD+V EIR  M+  G+ KEPG S IE+ ++ H F+  D SH 
Sbjct: 464 EDAGTYTLLSNIYANSQKWDSVEEIRTRMRDRGIKKEPGCSWIEVNKQIHAFIIGDNSHP 523

Query: 444 QAEEIYRTIHSITPILKDAGSIHE 468
           Q  E+ + ++ +   L   G + E
Sbjct: 524 QIVEVSKKLNQLIHRLTGIGYVPE 542

BLAST of Csor.00g113810 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 280.0 bits (715), Expect = 3.5e-75
Identity = 141/370 (38.11%), Postives = 214/370 (57.84%), Query Frame = 0

Query: 98  YMKGKRIHAQMVVVGHLP--NEYLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALI 157
           Y K  RI     +   +P  N   +T ++  YA     + A ++  K+ E ++VSWNALI
Sbjct: 299 YAKCSRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALI 358

Query: 158 AGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLEHGKRA------HGVL 217
           AGY Q G  E  L L+  ++R  + P  Y+FA++ +ACA LA L  G +A      HG  
Sbjct: 359 AGYTQNGENEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFK 418

Query: 218 IKCRIGDNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVL 277
            +    D++ V ++L+DMY KC  + +G+ VF K   R+ ++W A+I G+  +G  +E L
Sbjct: 419 FQSGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEAL 478

Query: 278 ESFNSMINEGYRPNYVTFLAVLTACGHVGFVSEAWRYLSLMKTTYEIEPRGQHYAAMADL 337
           E F  M+  G +P+++T + VL+ACGH GFV E   Y S M   + + P   HY  M DL
Sbjct: 479 ELFREMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDL 538

Query: 338 LARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKY 397
           L RAG L+EA + + + P +  +VIWG+L+  CKVH +I L K+ A   L ++  N+G Y
Sbjct: 539 LGRAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPY 598

Query: 398 VVLANGFAASGLWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHEQAEEIY 457
           V+L+N +A  G W++V  +R  M+K G+ K+PG S I+IQ   H F+  DKSH + ++  
Sbjct: 599 VLLSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWIKIQGHDHVFMVKDKSHPRKKQ-- 658

Query: 458 RTIHSITPIL 460
             IHS+  IL
Sbjct: 659 --IHSLLDIL 664

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
O234914.6e-14155.76Pentatricopeptide repeat-containing protein At4g16470 OS=Arabidopsis thaliana OX... [more]
Q9LTV83.4e-7537.17Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... [more]
Q9LW635.8e-7536.73Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
Q9SI531.3e-7436.72Pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Arabidop... [more]
Q9SIT74.9e-7438.11Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
KAG6594700.10.0100.00Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
KAG7026668.10.099.79Scarecrow-like protein 13, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022926503.10.097.67pentatricopeptide repeat-containing protein At4g16470 [Cucurbita moschata][more]
XP_023518629.10.097.24pentatricopeptide repeat-containing protein At4g16470 [Cucurbita pepo subsp. pep... [more]
XP_023003064.10.095.54pentatricopeptide repeat-containing protein At4g16470 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1EI860.097.67pentatricopeptide repeat-containing protein At4g16470 OS=Cucurbita moschata OX=3... [more]
A0A6J1KS920.095.54pentatricopeptide repeat-containing protein At4g16470 OS=Cucurbita maxima OX=366... [more]
A0A6J1BXR42.60e-27680.04pentatricopeptide repeat-containing protein At4g16470 isoform X2 OS=Momordica ch... [more]
A0A6J1BVB91.68e-27579.75pentatricopeptide repeat-containing protein At4g16470 isoform X1 OS=Momordica ch... [more]
A0A0A0KJS71.17e-26978.51Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G511640 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G16470.13.3e-14255.76Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G12770.12.4e-7637.17mitochondrial editing factor 22 [more]
AT3G23330.14.1e-7636.73Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G03880.19.1e-7636.72Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G13600.13.5e-7538.11Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Silver-seed gourd (sororia) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 250..283
e-value: 3.8E-7
score: 27.9
coord: 149..182
e-value: 4.8E-6
score: 24.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 247..294
e-value: 3.5E-10
score: 39.9
coord: 148..194
e-value: 2.3E-9
score: 37.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 389..416
e-value: 1.2
score: 9.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 147..181
score: 10.873667
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 248..282
score: 12.353442
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 203..276
e-value: 6.7E-6
score: 27.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 287..451
e-value: 1.3E-10
score: 42.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 83..202
e-value: 3.7E-17
score: 64.2
NoneNo IPR availablePANTHERPTHR47929:SF11BNACNNG15330D PROTEINcoord: 81..196
coord: 178..460
NoneNo IPR availablePANTHERPTHR47929FAMILY NOT NAMEDcoord: 81..196
NoneNo IPR availablePANTHERPTHR47929FAMILY NOT NAMEDcoord: 178..460

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csor.00g113810.m01Csor.00g113810.m01mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009157 deoxyribonucleoside monophosphate biosynthetic process
biological_process GO:0071897 DNA biosynthetic process
biological_process GO:0016310 phosphorylation
molecular_function GO:0005524 ATP binding
molecular_function GO:0005515 protein binding
molecular_function GO:0004797 thymidine kinase activity