CsGy6G002410.1 (mRNA) Cucumber (Gy14) v2.1

Overview
NameCsGy6G002410.1
TypemRNA
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionPentatricopeptide repeat-containing protein
LocationGy14Chr6: 1759087 .. 1761445 (+)
Sequence length2359
RNA-Seq ExpressionCsGy6G002410.1
SyntenyCsGy6G002410.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ACTCAAACCAGTTCAACCCCAAACACACCCCCGCGCCGCTCATCTCTTGCCCAAACCACCAGATCACCAGACCAGCCCCTCTTCTCGCTGCCAACCAACATCCGTCGCGCCGCTCATCTTCCCGACTCCGGTACCCACGTATTCTTCTTAAGATATACATTAACATGATCGATAGATAGCAACTACTTGATTTTTCATTTCCTTGTGCAATAGGTCAAATTCCCACTTTTTCCCTAAATTCTTCCCAACTGTAACTAACAATTCACTCTTGGAATTTCCAAAACCTAACTTGTGAAAATCTACCAAGTCACTGGCATCACTCTTAATCTTATGAAAATTTAAATCAATTCTTAAAATTCTTGATATGGTTTTGATTTTGAGTGCTTGTGATATCTCTGCCTGAACTTCCTTTCGCTGATAATTATTCCATACCTTGCCATTGAATCTTGTCTTTCCATTGGGGCAGTTAAATTTGGAGGTTGTAGGGAAAAAAAGAATTCAGCCTCGAGTATAGTTAAATTCGAAAGTTTGTGAAATTATGTGGCAACACCTTCTTCGGCCCTGTAATTATAGGACTATTGAGACTGTTGCTGCTGCTCATGTTGCCCGCAAATCCCCATTGCTTCGTAATTTAATCTCCTCCTCATCCTCTCTTTACCAACCACACTTAAATGTGCACAATGAATCAAAATTTTTGATAACCAATGTAAAACATGAGCAGTGTGAAGATCAACCAGATTTCTCAATTGGGTCTCCATGTAGGGTACAGAAACTCATTGCATCCCAATCTGATCCTCTTCTTGCCAAGGAAATTTTCGACTATGCTTGTCGTCAACCCCATTTTCGACCATCATCTTCCTCTCTCCTCGTTCTTATCCTCAAGCTAGGCCGTTCCAAATACTTCTCTCTGATTGATGATCTTCTTCTTAGCTTCAAGTCTAGACGTTACCCTGTCACTCCAACAGCCTTCTCCTACATAATCAAAATCTATGGTGAAGCTGATTTACCAGATAAAGCTCTTAAAGTCTTTTATACTATGATCGACTTTGGGTGTACGCCTTCTTCCAAACAATTGAACCGCATACTGGAAATTTTGGTTTCTCATCGTAACTTCATTCGACCAGCTTTTGATCTTTTCAAGAATGCCCGTCATCATGGAGTGTTGCCCAACACCAAATCTTATAACATTCTTATTCGTGCATTCTGTTGGAATGGAAATATTAGTATTGCCTACACACTGTTCAACAAGATGTTCGAACGAAATGTCATTCCAGATGTCGAGACGTATCGGACATTAATGCAGGGTCTTTGCAGGAAGAATCAAGTGAATGGTGCTGTTGACTTGCTTGAAGATATGTTGAACAAAGGATACATTCCAGACACATTGAGCTATGCCACTTTGTTAAATAGTTTATGTAGGAAGAAAAAGCTAAGGGAAGCTTATAAACTTCTCTGTAGAATGAAGGTTAAAGGTTGCAATCCTGATATTGCTCATTACAATACAGTTATAATGGGATTTTGCAGAGAAGGGCGTGCTCTTGATGCTTGTAAGATTCTGGAGGATATGCAGTCAAATGGTTGTTTGCCTAATTTAGTATCCTACGAGAGTTTGACTAATGGATTATGTGATCAAGGAATGTTTGAATTGGCAAAGGGTTATGTTGAGGAGATGACATTAAAGGGTTTTTACCCACATTTCTCTGTCATTCATGCTTTGGTTAAGGGTTTCCATAGCATTGGCAGAATCCACGAGTCGTGTAGTGTTCTTGAAGACATGCTAAAGCGTGGGAAAGCCCCTCATTCCGATACTTGGGAGATTATTATATCTGGGATTTGTGAAGTTGAGGACACTGCCAAATTTTGTGAAGTTTGGGAGAAGATTTTGAAGAAAGATGTAAGAAGAGACACTAGAATAGTTGAAGCAGGCACTGGTTTGGGTGAGTATTTAATTAGGAAGCTACAAGCTTCCATATCACGAAGGATTTGAATATTTCTTAGTTAAACTAGTTGTAAGAGTATTCAAATTTTTAGTCTCTTGGTTTGAGATATACGCCATAATCAATTCATTTGTAGCTCAAGTTGCACAAACGGCTTGAATTTTTATTGTACATGCTTGATTGAGTTTTTGGATATCTTCCATCCTATTTACAGCACAAACATGAACATCATTTTGTTTTTGAAATTTATGCTTTTTTCCCCTCCACATTTGGATTAGTCAAAATCTAAAAACAAAAACAAACTTTGAGAAGCTACTTTCTTCAAAATTTTGCTTGGGTTCTTAAAACATTGATAAAAGGTGGACTAAATAAAGTTAAAAAAATGAGATGTGAAAATAAGGTAGACTTCAATCTAATAAGAG

mRNA sequence

ACTCAAACCAGTTCAACCCCAAACACACCCCCGCGCCGCTCATCTCTTGCCCAAACCACCAGATCACCAGACCAGCCCCTCTTCTCGCTGCCAACCAACATCCGTCGCGCCGCTCATCTTCCCGACTCCGGTACCCACGTATTCTTCTTAAGATATACATTAACATGATCGATAGATAGCAACTACTTGATTTTTCATTTCCTTGTGCAATAGGTCAAATTCCCACTTTTTCCCTAAATTCTTCCCAACTGTAACTAACAATTCACTCTTGGAATTTCCAAAACCTAACTTGTGAAAATCTACCAAGTCACTGGCATCACTCTTAATCTTATGAAAATTTAAATCAATTCTTAAAATTCTTGATATGGTTTTGATTTTGAGTGCTTGTGATATCTCTGCCTGAACTTCCTTTCGCTGATAATTATTCCATACCTTGCCATTGAATCTTGTCTTTCCATTGGGGCAGTTAAATTTGGAGGTTGTAGGGAAAAAAAGAATTCAGCCTCGAGTATAGTTAAATTCGAAAGTTTGTGAAATTATGTGGCAACACCTTCTTCGGCCCTGTAATTATAGGACTATTGAGACTGTTGCTGCTGCTCATGTTGCCCGCAAATCCCCATTGCTTCGTAATTTAATCTCCTCCTCATCCTCTCTTTACCAACCACACTTAAATGTGCACAATGAATCAAAATTTTTGATAACCAATGTAAAACATGAGCAGTGTGAAGATCAACCAGATTTCTCAATTGGGTCTCCATGTAGGGTACAGAAACTCATTGCATCCCAATCTGATCCTCTTCTTGCCAAGGAAATTTTCGACTATGCTTGTCGTCAACCCCATTTTCGACCATCATCTTCCTCTCTCCTCGTTCTTATCCTCAAGCTAGGCCGTTCCAAATACTTCTCTCTGATTGATGATCTTCTTCTTAGCTTCAAGTCTAGACGTTACCCTGTCACTCCAACAGCCTTCTCCTACATAATCAAAATCTATGGTGAAGCTGATTTACCAGATAAAGCTCTTAAAGTCTTTTATACTATGATCGACTTTGGGTGTACGCCTTCTTCCAAACAATTGAACCGCATACTGGAAATTTTGGTTTCTCATCGTAACTTCATTCGACCAGCTTTTGATCTTTTCAAGAATGCCCGTCATCATGGAGTGTTGCCCAACACCAAATCTTATAACATTCTTATTCGTGCATTCTGTTGGAATGGAAATATTAGTATTGCCTACACACTGTTCAACAAGATGTTCGAACGAAATGTCATTCCAGATGTCGAGACGTATCGGACATTAATGCAGGGTCTTTGCAGGAAGAATCAAGTGAATGGTGCTGTTGACTTGCTTGAAGATATGTTGAACAAAGGATACATTCCAGACACATTGAGCTATGCCACTTTGTTAAATAGTTTATGTAGGAAGAAAAAGCTAAGGGAAGCTTATAAACTTCTCTGTAGAATGAAGGTTAAAGGTTGCAATCCTGATATTGCTCATTACAATACAGTTATAATGGGATTTTGCAGAGAAGGGCGTGCTCTTGATGCTTGTAAGATTCTGGAGGATATGCAGTCAAATGGTTGTTTGCCTAATTTAGTATCCTACGAGAGTTTGACTAATGGATTATGTGATCAAGGAATGTTTGAATTGGCAAAGGGTTATGTTGAGGAGATGACATTAAAGGGTTTTTACCCACATTTCTCTGTCATTCATGCTTTGGTTAAGGGTTTCCATAGCATTGGCAGAATCCACGAGTCGTGTAGTGTTCTTGAAGACATGCTAAAGCGTGGGAAAGCCCCTCATTCCGATACTTGGGAGATTATTATATCTGGGATTTGTGAAGTTGAGGACACTGCCAAATTTTGTGAAGTTTGGGAGAAGATTTTGAAGAAAGATGTAAGAAGAGACACTAGAATAGTTGAAGCAGGCACTGGTTTGGGTGAGTATTTAATTAGGAAGCTACAAGCTTCCATATCACGAAGGATTTGAATATTTCTTAGTTAAACTAGTTGTAAGAGTATTCAAATTTTTAGTCTCTTGGTTTGAGATATACGCCATAATCAATTCATTTGTAGCTCAAGTTGCACAAACGGCTTGAATTTTTATTGTACATGCTTGATTGAGTTTTTGGATATCTTCCATCCTATTTACAGCACAAACATGAACATCATTTTGTTTTTGAAATTTATGCTTTTTTCCCCTCCACATTTGGATTAGTCAAAATCTAAAAACAAAAACAAACTTTGAGAAGCTACTTTCTTCAAAATTTTGCTTGGGTTCTTAAAACATTGATAAAAGGTGGACTAAATAAAGTTAAAAAAATGAGATGTGAAAATAAGGTAGACTTCAATCTAATAAGAG

Coding sequence (CDS)

ATGTGGCAACACCTTCTTCGGCCCTGTAATTATAGGACTATTGAGACTGTTGCTGCTGCTCATGTTGCCCGCAAATCCCCATTGCTTCGTAATTTAATCTCCTCCTCATCCTCTCTTTACCAACCACACTTAAATGTGCACAATGAATCAAAATTTTTGATAACCAATGTAAAACATGAGCAGTGTGAAGATCAACCAGATTTCTCAATTGGGTCTCCATGTAGGGTACAGAAACTCATTGCATCCCAATCTGATCCTCTTCTTGCCAAGGAAATTTTCGACTATGCTTGTCGTCAACCCCATTTTCGACCATCATCTTCCTCTCTCCTCGTTCTTATCCTCAAGCTAGGCCGTTCCAAATACTTCTCTCTGATTGATGATCTTCTTCTTAGCTTCAAGTCTAGACGTTACCCTGTCACTCCAACAGCCTTCTCCTACATAATCAAAATCTATGGTGAAGCTGATTTACCAGATAAAGCTCTTAAAGTCTTTTATACTATGATCGACTTTGGGTGTACGCCTTCTTCCAAACAATTGAACCGCATACTGGAAATTTTGGTTTCTCATCGTAACTTCATTCGACCAGCTTTTGATCTTTTCAAGAATGCCCGTCATCATGGAGTGTTGCCCAACACCAAATCTTATAACATTCTTATTCGTGCATTCTGTTGGAATGGAAATATTAGTATTGCCTACACACTGTTCAACAAGATGTTCGAACGAAATGTCATTCCAGATGTCGAGACGTATCGGACATTAATGCAGGGTCTTTGCAGGAAGAATCAAGTGAATGGTGCTGTTGACTTGCTTGAAGATATGTTGAACAAAGGATACATTCCAGACACATTGAGCTATGCCACTTTGTTAAATAGTTTATGTAGGAAGAAAAAGCTAAGGGAAGCTTATAAACTTCTCTGTAGAATGAAGGTTAAAGGTTGCAATCCTGATATTGCTCATTACAATACAGTTATAATGGGATTTTGCAGAGAAGGGCGTGCTCTTGATGCTTGTAAGATTCTGGAGGATATGCAGTCAAATGGTTGTTTGCCTAATTTAGTATCCTACGAGAGTTTGACTAATGGATTATGTGATCAAGGAATGTTTGAATTGGCAAAGGGTTATGTTGAGGAGATGACATTAAAGGGTTTTTACCCACATTTCTCTGTCATTCATGCTTTGGTTAAGGGTTTCCATAGCATTGGCAGAATCCACGAGTCGTGTAGTGTTCTTGAAGACATGCTAAAGCGTGGGAAAGCCCCTCATTCCGATACTTGGGAGATTATTATATCTGGGATTTGTGAAGTTGAGGACACTGCCAAATTTTGTGAAGTTTGGGAGAAGATTTTGAAGAAAGATGTAAGAAGAGACACTAGAATAGTTGAAGCAGGCACTGGTTTGGGTGAGTATTTAATTAGGAAGCTACAAGCTTCCATATCACGAAGGATTTGA

Protein sequence

MWQHLLRPCNYRTIETVAAAHVARKSPLLRNLISSSSSLYQPHLNVHNESKFLITNVKHEQCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSKYFSLIDDLLLSFKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQLNRILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMFERNVIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASISRRI*
Homology
BLAST of CsGy6G002410.1 vs. ExPASy Swiss-Prot
Match: Q8LDU5 (Pentatricopeptide repeat-containing protein At4g01400, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g01400 PE=2 SV=2)

HSP 1 Score: 547.7 bits (1410), Expect = 1.3e-154
Identity = 262/447 (58.61%), Postives = 338/447 (75.62%), Query Frame = 0

Query: 29  LRNLISSSSSLYQPHLNVHNESKFLITNVKHEQCEDQPDFSIGSPCRVQKLIASQSDPLL 88
           L + +S+SS       + H   K +++N         P   IGSP RVQKLIASQSDPLL
Sbjct: 16  LTSPLSTSSRFLFYSSSEHEARKPIVSN---------PKSPIGSPTRVQKLIASQSDPLL 75

Query: 89  AKEIFDYACRQPHFRPSSSSLLVLILKLGRSKYFSLIDDLLLSFKSRRYPVTPTAFSYII 148
           AKEIFDYA +QP+FR S SS L+LILKLGR +YF+LIDD+L   +S  YP+T   F+Y+I
Sbjct: 76  AKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGEIFTYLI 135

Query: 149 KIYGEADLPDKALKVFYTMIDFGCTPSSKQLNRILEILVSHRNFIRPAFDLFKNARHHGV 208
           K+Y EA LP+K L  FY M++F  TP  K LNRIL++LVSHR +++ AF+LFK++R HGV
Sbjct: 136 KVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKSSRLHGV 195

Query: 209 LPNTKSYNILIRAFCWNGNISIAYTLFNKMFERNVIPDVETYRTLMQGLCRKNQVNGAVD 268
           +PNT+SYN+L++AFC N ++SIAY LF KM ER+V+PDV++Y+ L+QG CRK QVNGA++
Sbjct: 196 MPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQVNGAME 255

Query: 269 LLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFC 328
           LL+DMLNKG++PD LSY TLLNSLCRK +LREAYKLLCRMK+KGCNPD+ HYNT+I+GFC
Sbjct: 256 LLDDMLNKGFVPDRLSYTTLLNSLCRKTQLREAYKLLCRMKLKGCNPDLVHYNTMILGFC 315

Query: 329 REGRALDACKILEDMQSNGCLPNLVSYESLTNGLCDQGMFELAKGYVEEMTLKGFYPHFS 388
           RE RA+DA K+L+DM SNGC PN VSY +L  GLCDQGMF+  K Y+EEM  KGF PHFS
Sbjct: 316 REDRAMDARKVLDDMLSNGCSPNSVSYRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFS 375

Query: 389 VIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKI 448
           V + LVKGF S G++ E+C V+E ++K G+  HSDTWE++I  IC  +++ K     E  
Sbjct: 376 VSNCLVKGFCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDA 435

Query: 449 LKKDVRRDTRIVEAGTGLGEYLIRKLQ 476
           +K+++  DTRIV+ G GLG YL  KLQ
Sbjct: 436 VKEEITGDTRIVDVGIGLGSYLSSKLQ 453

BLAST of CsGy6G002410.1 vs. ExPASy Swiss-Prot
Match: Q9FNL2 (Pentatricopeptide repeat-containing protein At5g46100 OS=Arabidopsis thaliana OX=3702 GN=At5g46100 PE=2 SV=1)

HSP 1 Score: 217.2 bits (552), Expect = 4.0e-55
Identity = 120/372 (32.26%), Postives = 197/372 (52.96%), Query Frame = 0

Query: 72  SPCRVQKLIASQSDPLLAKEIFDYACRQ--PHFRPSSSSLLVLILKLGRSKYFSLIDDLL 131
           +P +V KL+ ++ D   +  +FD A  +    +    SS   ++L+L  +  F   +DL+
Sbjct: 15  TPSQVIKLMRAEKDVEKSMAVFDSATAEYANGYVHDQSSFGYMVLRLVSANKFKAAEDLI 74

Query: 132 LSFKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQLNRILEILVSH 191
           +  K     V+      I + YG    P  +L+VF+ M DF C PS K    +L ILV  
Sbjct: 75  VRMKIENCVVSEDILLSICRGYGRVHRPFDSLRVFHKMKDFDCDPSQKAYVTVLAILV-E 134

Query: 192 RNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWN-GNISIAYTLFNKMFERNVIPDVE 251
            N +  AF  +KN R  G+ P   S N+LI+A C N G +     +F +M +R   PD  
Sbjct: 135 ENQLNLAFKFYKNMREIGLPPTVASLNVLIKALCRNDGTVDAGLKIFLEMPKRGCDPDSY 194

Query: 252 TYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRM 311
           TY TL+ GLCR  +++ A  L  +M+ K   P  ++Y +L+N LC  K + EA + L  M
Sbjct: 195 TYGTLISGLCRFGRIDEAKKLFTEMVEKDCAPTVVTYTSLINGLCGSKNVDEAMRYLEEM 254

Query: 312 KVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTNGLCDQGMF 371
           K KG  P++  Y++++ G C++GR+L A ++ E M + GC PN+V+Y +L  GLC +   
Sbjct: 255 KSKGIEPNVFTYSSLMDGLCKDGRSLQAMELFEMMMARGCRPNMVTYTTLITGLCKEQKI 314

Query: 372 ELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEI- 431
           + A   ++ M L+G  P   +   ++ GF +I +  E+ + L++M+  G  P+  TW I 
Sbjct: 315 QEAVELLDRMNLQGLKPDAGLYGKVISGFCAISKFREAANFLDEMILGGITPNRLTWNIH 374

Query: 432 ------IISGIC 434
                 ++ G+C
Sbjct: 375 VKTSNEVVRGLC 385

BLAST of CsGy6G002410.1 vs. ExPASy Swiss-Prot
Match: O49436 (Pentatricopeptide repeat-containing protein At4g20090 OS=Arabidopsis thaliana OX=3702 GN=EMB1025 PE=3 SV=1)

HSP 1 Score: 197.2 bits (500), Expect = 4.2e-49
Identity = 110/366 (30.05%), Postives = 195/366 (53.28%), Query Frame = 0

Query: 72  SPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSKYFSLIDDLLLS 131
           SP    +++ +  +  +++++F  A +   F+   S+L  +I     S  F  ++ LL  
Sbjct: 43  SPNPSMEVVENPLEAPISEKMFKSAPKMGSFKLGDSTLSSMIESYANSGDFDSVEKLLSR 102

Query: 132 FKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMID-FGCTPSSKQLNRILEILVSHR 191
            +     +   +F  + + YG+A LPDKA+ +F+ M+D F C  S K  N +L ++++  
Sbjct: 103 IRLENRVIIERSFIVVFRAYGKAHLPDKAVDLFHRMVDEFRCKRSVKSFNSVLNVIINEG 162

Query: 192 NFIR--PAFDLFKNAR-HHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMFERNVIPDV 251
            + R    +D   N+  +  + PN  S+N++I+A C    +  A  +F  M ER  +PD 
Sbjct: 163 LYHRGLEFYDYVVNSNMNMNISPNGLSFNLVIKALCKLRFVDRAIEVFRGMPERKCLPDG 222

Query: 252 ETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCR 311
            TY TLM GLC++ +++ AV LL++M ++G  P  + Y  L++ LC+K  L    KL+  
Sbjct: 223 YTYCTLMDGLCKEERIDEAVLLLDEMQSEGCSPSPVIYNVLIDGLCKKGDLTRVTKLVDN 282

Query: 312 MKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTNGLCDQGM 371
           M +KGC P+   YNT+I G C +G+   A  +LE M S+ C+PN V+Y +L NGL  Q  
Sbjct: 283 MFLKGCVPNEVTYNTLIHGLCLKGKLDKAVSLLERMVSSKCIPNDVTYGTLINGLVKQRR 342

Query: 372 FELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEI 431
              A   +  M  +G++ +  +   L+ G    G+  E+ S+   M ++G  P+   + +
Sbjct: 343 ATDAVRLLSSMEERGYHLNQHIYSVLISGLFKEGKAEEAMSLWRKMAEKGCKPNIVVYSV 402

Query: 432 IISGIC 434
           ++ G+C
Sbjct: 403 LVDGLC 408

BLAST of CsGy6G002410.1 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 196.1 bits (497), Expect = 9.5e-49
Identity = 106/318 (33.33%), Postives = 170/318 (53.46%), Query Frame = 0

Query: 140 TPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQLNRILEILVSHRNFIRPAFDL 199
           T + F  ++K Y    L DKAL + +     G  P     N +L+  +  +  I  A ++
Sbjct: 133 TSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENV 192

Query: 200 FKNARHHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMFERNVIPDVETYRTLMQGLCR 259
           FK      V PN  +YNILIR FC+ GNI +A TLF+KM  +  +P+V TY TL+ G C+
Sbjct: 193 FKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCK 252

Query: 260 KNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAH 319
             +++    LL  M  KG  P+ +SY  ++N LCR+ +++E   +L  M  +G + D   
Sbjct: 253 LRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVT 312

Query: 320 YNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTNGLCDQGMFELAKGYVEEMT 379
           YNT+I G+C+EG    A  +  +M  +G  P++++Y SL + +C  G    A  ++++M 
Sbjct: 313 YNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMR 372

Query: 380 LKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGIC---EVE 439
           ++G  P+      LV GF   G ++E+  VL +M   G +P   T+  +I+G C   ++E
Sbjct: 373 VRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKME 432

Query: 440 DTAKFCE-VWEKILKKDV 454
           D     E + EK L  DV
Sbjct: 433 DAIAVLEDMKEKGLSPDV 450

BLAST of CsGy6G002410.1 vs. ExPASy Swiss-Prot
Match: Q9FMF6 (Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g64320 PE=2 SV=1)

HSP 1 Score: 180.3 bits (456), Expect = 5.4e-44
Identity = 120/421 (28.50%), Postives = 200/421 (47.51%), Query Frame = 0

Query: 72  SPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSKYFSLIDDLLLS 131
           +P ++ KL+    +   + E+F +   Q  +R S     VLI KLG +  F  ID LL+ 
Sbjct: 77  TPFQLYKLLELPLNVSTSMELFSWTGSQNGYRHSFDVYQVLIGKLGANGEFKTIDRLLIQ 136

Query: 132 FKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMID-FGCTPSSKQLNRILEILVS-- 191
            K        + F  I++ Y +A  P +  ++   M + + C P+ K  N +LEILVS  
Sbjct: 137 MKDEGIVFKESLFISIMRDYDKAGFPGQTTRLMLEMRNVYSCEPTFKSYNVVLEILVSGN 196

Query: 192 -HR-------------------------------NFIRPAFDLFKNARHHGVLPNTKSYN 251
            H+                               N I  A  L ++   HG +PN+  Y 
Sbjct: 197 CHKVAANVFYDMLSRKIPPTLFTFGVVMKAFCAVNEIDSALSLLRDMTKHGCVPNSVIYQ 256

Query: 252 ILIRAFCWNGNISIAYTLFNKMFERNVIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNK 311
            LI +      ++ A  L  +MF    +PD ET+  ++ GLC+ +++N A  ++  ML +
Sbjct: 257 TLIHSLSKCNRVNEALQLLEEMFLMGCVPDAETFNDVILGLCKFDRINEAAKMVNRMLIR 316

Query: 312 GYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDA 371
           G+ PD ++Y  L+N LC+  ++  A  L  R+      P+I  +NT+I GF   GR  DA
Sbjct: 317 GFAPDDITYGYLMNGLCKIGRVDAAKDLFYRIP----KPEIVIFNTLIHGFVTHGRLDDA 376

Query: 372 CKILEDM-QSNGCLPNLVSYESLTNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVK 431
             +L DM  S G +P++ +Y SL  G   +G+  LA   + +M  KG  P+      LV 
Sbjct: 377 KAVLSDMVTSYGIVPDVCTYNSLIYGYWKEGLVGLALEVLHDMRNKGCKPNVYSYTILVD 436

Query: 432 GFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRR 457
           GF  +G+I E+ +VL +M   G  P++  +  +IS  C+     +  E++ ++ +K  + 
Sbjct: 437 GFCKLGKIDEAYNVLNEMSADGLKPNTVGFNCLISAFCKEHRIPEAVEIFREMPRKGCKP 493

BLAST of CsGy6G002410.1 vs. NCBI nr
Match: XP_004138384.1 (pentatricopeptide repeat-containing protein At4g01400, mitochondrial [Cucumis sativus] >KGN45858.1 hypothetical protein Csa_004855 [Cucumis sativus])

HSP 1 Score: 984 bits (2543), Expect = 0.0
Identity = 482/482 (100.00%), Postives = 482/482 (100.00%), Query Frame = 0

Query: 1   MWQHLLRPCNYRTIETVAAAHVARKSPLLRNLISSSSSLYQPHLNVHNESKFLITNVKHE 60
           MWQHLLRPCNYRTIETVAAAHVARKSPLLRNLISSSSSLYQPHLNVHNESKFLITNVKHE
Sbjct: 1   MWQHLLRPCNYRTIETVAAAHVARKSPLLRNLISSSSSLYQPHLNVHNESKFLITNVKHE 60

Query: 61  QCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSK 120
           QCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSK
Sbjct: 61  QCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSK 120

Query: 121 YFSLIDDLLLSFKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQLN 180
           YFSLIDDLLLSFKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQLN
Sbjct: 121 YFSLIDDLLLSFKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQLN 180

Query: 181 RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMFE 240
           RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMFE
Sbjct: 181 RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMFE 240

Query: 241 RNVIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE 300
           RNVIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE
Sbjct: 241 RNVIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE 300

Query: 301 AYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTN 360
           AYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTN
Sbjct: 301 AYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTN 360

Query: 361 GLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAP 420
           GLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAP
Sbjct: 361 GLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAP 420

Query: 421 HSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASISR 480
           HSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASISR
Sbjct: 421 HSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASISR 480

Query: 481 RI 482
           RI
Sbjct: 481 RI 482

BLAST of CsGy6G002410.1 vs. NCBI nr
Match: KAA0035033.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK02366.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 926 bits (2392), Expect = 0.0
Identity = 456/482 (94.61%), Postives = 466/482 (96.68%), Query Frame = 0

Query: 1   MWQHLLRPCNYRTIETVAAAHVARKSPLLRNLISSSSSLYQPHLNVHNESKFLITNVKHE 60
           MW HLLRP NYRTIETVAA HVAR +PLL NLISSSSSLYQPHLNVHNESK LITN+ H+
Sbjct: 83  MWLHLLRPGNYRTIETVAA-HVARNAPLLHNLISSSSSLYQPHLNVHNESKTLITNINHK 142

Query: 61  QCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSK 120
           QCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSK
Sbjct: 143 QCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSK 202

Query: 121 YFSLIDDLLLSFKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQLN 180
           YFSLIDDLLLSFKSR YPVTPTAFSYIIKIYGEADLPDKALKVFYTMI+FGCTPSSKQLN
Sbjct: 203 YFSLIDDLLLSFKSRGYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIEFGCTPSSKQLN 262

Query: 181 RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMFE 240
           RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAY LFNKMFE
Sbjct: 263 RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYKLFNKMFE 322

Query: 241 RNVIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE 300
            +VIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKL+E
Sbjct: 323 GDVIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLKE 382

Query: 301 AYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTN 360
           AYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTN
Sbjct: 383 AYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTN 442

Query: 361 GLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAP 420
           GLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFH++GR+ ESCSVLE MLK GKAP
Sbjct: 443 GLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHNVGRMDESCSVLEGMLKHGKAP 502

Query: 421 HSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASISR 480
           HSDTWEIIISGICEVEDT KFCE+ EKILKKDVRRDTRIVEAGTGLGEYLIRKLQAS SR
Sbjct: 503 HSDTWEIIISGICEVEDTVKFCEILEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASKSR 562

Query: 481 RI 482
           RI
Sbjct: 563 RI 563

BLAST of CsGy6G002410.1 vs. NCBI nr
Match: XP_008463091.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like [Cucumis melo])

HSP 1 Score: 926 bits (2392), Expect = 0.0
Identity = 456/482 (94.61%), Postives = 466/482 (96.68%), Query Frame = 0

Query: 1   MWQHLLRPCNYRTIETVAAAHVARKSPLLRNLISSSSSLYQPHLNVHNESKFLITNVKHE 60
           MW HLLRP NYRTIETVAA HVAR +PLL NLISSSSSLYQPHLNVHNESK LITN+ H+
Sbjct: 1   MWLHLLRPGNYRTIETVAA-HVARNAPLLHNLISSSSSLYQPHLNVHNESKTLITNINHK 60

Query: 61  QCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSK 120
           QCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSK
Sbjct: 61  QCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSK 120

Query: 121 YFSLIDDLLLSFKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQLN 180
           YFSLIDDLLLSFKSR YPVTPTAFSYIIKIYGEADLPDKALKVFYTMI+FGCTPSSKQLN
Sbjct: 121 YFSLIDDLLLSFKSRGYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIEFGCTPSSKQLN 180

Query: 181 RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMFE 240
           RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAY LFNKMFE
Sbjct: 181 RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYKLFNKMFE 240

Query: 241 RNVIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE 300
            +VIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKL+E
Sbjct: 241 GDVIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLKE 300

Query: 301 AYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTN 360
           AYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTN
Sbjct: 301 AYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTN 360

Query: 361 GLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAP 420
           GLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFH++GR+ ESCSVLE MLK GKAP
Sbjct: 361 GLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHNVGRMDESCSVLEGMLKHGKAP 420

Query: 421 HSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASISR 480
           HSDTWEIIISGICEVEDT KFCE+ EKILKKDVRRDTRIVEAGTGLGEYLIRKLQAS SR
Sbjct: 421 HSDTWEIIISGICEVEDTVKFCEILEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASKSR 480

Query: 481 RI 482
           RI
Sbjct: 481 RI 481

BLAST of CsGy6G002410.1 vs. NCBI nr
Match: XP_038886671.1 (pentatricopeptide repeat-containing protein At4g01400, mitochondrial [Benincasa hispida] >XP_038886672.1 pentatricopeptide repeat-containing protein At4g01400, mitochondrial [Benincasa hispida] >XP_038886673.1 pentatricopeptide repeat-containing protein At4g01400, mitochondrial [Benincasa hispida])

HSP 1 Score: 865 bits (2234), Expect = 6.37e-315
Identity = 425/483 (87.99%), Postives = 450/483 (93.17%), Query Frame = 0

Query: 1   MWQHLLRPCNYRTIETVAAAHVARKSPLLRNLISSSSSLYQPHLNVHNESKFLIT-NVKH 60
           M +HLLRPCNY TIET+AA HV  K+PLL   ISSSSSLYQ  LNVH+ESK LIT N+ H
Sbjct: 1   MRRHLLRPCNYNTIETIAA-HVVPKTPLLHKSISSSSSLYQRDLNVHDESKTLITININH 60

Query: 61  EQCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRS 120
           +QC DQP FSIGSPCRVQKLIASQSDPLLAKEIF YACRQPHFRPSSSSL +LILKLGRS
Sbjct: 61  KQCGDQPHFSIGSPCRVQKLIASQSDPLLAKEIFYYACRQPHFRPSSSSLPILILKLGRS 120

Query: 121 KYFSLIDDLLLSFKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQL 180
           KYFSLIDDLLLSFKSR YPVTPT FSY+IKIYGEADLPDKALK FYTMI+FGCTPSSKQL
Sbjct: 121 KYFSLIDDLLLSFKSRGYPVTPTVFSYMIKIYGEADLPDKALKAFYTMIEFGCTPSSKQL 180

Query: 181 NRILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMF 240
           NRILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNIL+RAFCWNGN+SIAYTLFNKMF
Sbjct: 181 NRILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMF 240

Query: 241 ERNVIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLR 300
           +R+VIPDVE+YR LMQGLCRKNQVNGAVDLLEDMLNKGYIPD+LSYATLLNSLCRKKKLR
Sbjct: 241 KRDVIPDVESYRILMQGLCRKNQVNGAVDLLEDMLNKGYIPDSLSYATLLNSLCRKKKLR 300

Query: 301 EAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLT 360
           EAYKLLCRMKVKGCNPD+AHYNT I+GFCREGRALDACKILEDMQSNGCLPNLVSY+SLT
Sbjct: 301 EAYKLLCRMKVKGCNPDVAHYNTAIIGFCREGRALDACKILEDMQSNGCLPNLVSYQSLT 360

Query: 361 NGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKA 420
           NGLCDQGMFELAK YVEEMTLKGF PHFS+IHALVKGF ++GRI ESCS+LEDML  GKA
Sbjct: 361 NGLCDQGMFELAKDYVEEMTLKGFCPHFSIIHALVKGFRNVGRIDESCSILEDMLNHGKA 420

Query: 421 PHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASIS 480
           PHSDTWEIIISGICEVEDT K CE+  KILKKDVRRDTRIVEAG+GLGEYLIRKLQAS S
Sbjct: 421 PHSDTWEIIISGICEVEDTVKLCEILGKILKKDVRRDTRIVEAGSGLGEYLIRKLQASKS 480

Query: 481 RRI 482
           RR+
Sbjct: 481 RRV 482

BLAST of CsGy6G002410.1 vs. NCBI nr
Match: XP_023515125.1 (pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 862 bits (2226), Expect = 1.05e-313
Identity = 424/482 (87.97%), Postives = 450/482 (93.36%), Query Frame = 0

Query: 1   MWQHLLRPCNYRTIETVAAAHVARKSPLLRNLISSSSSLYQPHLNVHNESKFL-ITNVKH 60
           M QHLLRPCNY+TIETVA  H+A K+PLL N ISSSSSLYQP LNVHNE K L  TN+ H
Sbjct: 1   MRQHLLRPCNYKTIETVAV-HLAPKTPLLHNSISSSSSLYQPDLNVHNELKTLNATNINH 60

Query: 61  EQCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRS 120
           ++ E+QPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSS LVLILKLGRS
Sbjct: 61  KRLEEQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSFLVLILKLGRS 120

Query: 121 KYFSLIDDLLLSFKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQL 180
           KYFSLIDDLLLSFKSR YP +PT FSYIIKIYGEADLPDKALK FYTMI+FGCTPSSKQL
Sbjct: 121 KYFSLIDDLLLSFKSRGYPFSPTVFSYIIKIYGEADLPDKALKTFYTMIEFGCTPSSKQL 180

Query: 181 NRILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMF 240
           NRILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNIL+RAFCWNG++SIAYTLFNKMF
Sbjct: 181 NRILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILMRAFCWNGDLSIAYTLFNKMF 240

Query: 241 ERNVIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLR 300
           +R+V+PDVE+YR LMQGLCRKNQV GAVDLLEDMLNKGY+PDTLSYATLLNSLCRKKKLR
Sbjct: 241 KRDVVPDVESYRILMQGLCRKNQVIGAVDLLEDMLNKGYVPDTLSYATLLNSLCRKKKLR 300

Query: 301 EAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLT 360
           EAYKLLCRMKVKGCNPD+AHYNTVI GFCREGRALDACKILEDMQSN CLPNLVSY+SLT
Sbjct: 301 EAYKLLCRMKVKGCNPDVAHYNTVITGFCREGRALDACKILEDMQSNRCLPNLVSYQSLT 360

Query: 361 NGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKA 420
           NGLCDQGMFELAK YVEEMTLKGF PHFSVIH LVKGF ++GRI +SCSVLEDMLK GKA
Sbjct: 361 NGLCDQGMFELAKDYVEEMTLKGFCPHFSVIHTLVKGFINVGRIDDSCSVLEDMLKDGKA 420

Query: 421 PHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASIS 480
           PHS+TWEI+ISGICEVEDT K CE+ EKILKKDVRRDTRIVEAG+GLGEYLIRKLQAS S
Sbjct: 421 PHSETWEIVISGICEVEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQASKS 480

BLAST of CsGy6G002410.1 vs. ExPASy TrEMBL
Match: A0A0A0K8U0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G014780 PE=4 SV=1)

HSP 1 Score: 984 bits (2543), Expect = 0.0
Identity = 482/482 (100.00%), Postives = 482/482 (100.00%), Query Frame = 0

Query: 1   MWQHLLRPCNYRTIETVAAAHVARKSPLLRNLISSSSSLYQPHLNVHNESKFLITNVKHE 60
           MWQHLLRPCNYRTIETVAAAHVARKSPLLRNLISSSSSLYQPHLNVHNESKFLITNVKHE
Sbjct: 1   MWQHLLRPCNYRTIETVAAAHVARKSPLLRNLISSSSSLYQPHLNVHNESKFLITNVKHE 60

Query: 61  QCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSK 120
           QCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSK
Sbjct: 61  QCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSK 120

Query: 121 YFSLIDDLLLSFKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQLN 180
           YFSLIDDLLLSFKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQLN
Sbjct: 121 YFSLIDDLLLSFKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQLN 180

Query: 181 RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMFE 240
           RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMFE
Sbjct: 181 RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMFE 240

Query: 241 RNVIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE 300
           RNVIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE
Sbjct: 241 RNVIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE 300

Query: 301 AYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTN 360
           AYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTN
Sbjct: 301 AYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTN 360

Query: 361 GLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAP 420
           GLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAP
Sbjct: 361 GLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAP 420

Query: 421 HSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASISR 480
           HSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASISR
Sbjct: 421 HSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASISR 480

Query: 481 RI 482
           RI
Sbjct: 481 RI 482

BLAST of CsGy6G002410.1 vs. ExPASy TrEMBL
Match: A0A5A7SWW3 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold155G00670 PE=4 SV=1)

HSP 1 Score: 926 bits (2392), Expect = 0.0
Identity = 456/482 (94.61%), Postives = 466/482 (96.68%), Query Frame = 0

Query: 1   MWQHLLRPCNYRTIETVAAAHVARKSPLLRNLISSSSSLYQPHLNVHNESKFLITNVKHE 60
           MW HLLRP NYRTIETVAA HVAR +PLL NLISSSSSLYQPHLNVHNESK LITN+ H+
Sbjct: 83  MWLHLLRPGNYRTIETVAA-HVARNAPLLHNLISSSSSLYQPHLNVHNESKTLITNINHK 142

Query: 61  QCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSK 120
           QCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSK
Sbjct: 143 QCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSK 202

Query: 121 YFSLIDDLLLSFKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQLN 180
           YFSLIDDLLLSFKSR YPVTPTAFSYIIKIYGEADLPDKALKVFYTMI+FGCTPSSKQLN
Sbjct: 203 YFSLIDDLLLSFKSRGYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIEFGCTPSSKQLN 262

Query: 181 RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMFE 240
           RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAY LFNKMFE
Sbjct: 263 RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYKLFNKMFE 322

Query: 241 RNVIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE 300
            +VIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKL+E
Sbjct: 323 GDVIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLKE 382

Query: 301 AYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTN 360
           AYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTN
Sbjct: 383 AYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTN 442

Query: 361 GLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAP 420
           GLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFH++GR+ ESCSVLE MLK GKAP
Sbjct: 443 GLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHNVGRMDESCSVLEGMLKHGKAP 502

Query: 421 HSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASISR 480
           HSDTWEIIISGICEVEDT KFCE+ EKILKKDVRRDTRIVEAGTGLGEYLIRKLQAS SR
Sbjct: 503 HSDTWEIIISGICEVEDTVKFCEILEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASKSR 562

Query: 481 RI 482
           RI
Sbjct: 563 RI 563

BLAST of CsGy6G002410.1 vs. ExPASy TrEMBL
Match: A0A1S3CIG1 (pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103501324 PE=4 SV=1)

HSP 1 Score: 926 bits (2392), Expect = 0.0
Identity = 456/482 (94.61%), Postives = 466/482 (96.68%), Query Frame = 0

Query: 1   MWQHLLRPCNYRTIETVAAAHVARKSPLLRNLISSSSSLYQPHLNVHNESKFLITNVKHE 60
           MW HLLRP NYRTIETVAA HVAR +PLL NLISSSSSLYQPHLNVHNESK LITN+ H+
Sbjct: 1   MWLHLLRPGNYRTIETVAA-HVARNAPLLHNLISSSSSLYQPHLNVHNESKTLITNINHK 60

Query: 61  QCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSK 120
           QCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSK
Sbjct: 61  QCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSK 120

Query: 121 YFSLIDDLLLSFKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQLN 180
           YFSLIDDLLLSFKSR YPVTPTAFSYIIKIYGEADLPDKALKVFYTMI+FGCTPSSKQLN
Sbjct: 121 YFSLIDDLLLSFKSRGYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIEFGCTPSSKQLN 180

Query: 181 RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMFE 240
           RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAY LFNKMFE
Sbjct: 181 RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYKLFNKMFE 240

Query: 241 RNVIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE 300
            +VIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKL+E
Sbjct: 241 GDVIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLKE 300

Query: 301 AYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTN 360
           AYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTN
Sbjct: 301 AYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTN 360

Query: 361 GLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAP 420
           GLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFH++GR+ ESCSVLE MLK GKAP
Sbjct: 361 GLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHNVGRMDESCSVLEGMLKHGKAP 420

Query: 421 HSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASISR 480
           HSDTWEIIISGICEVEDT KFCE+ EKILKKDVRRDTRIVEAGTGLGEYLIRKLQAS SR
Sbjct: 421 HSDTWEIIISGICEVEDTVKFCEILEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASKSR 480

Query: 481 RI 482
           RI
Sbjct: 481 RI 481

BLAST of CsGy6G002410.1 vs. ExPASy TrEMBL
Match: A0A6J1EXV3 (pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like OS=Cucurbita moschata OX=3662 GN=LOC111439286 PE=4 SV=1)

HSP 1 Score: 858 bits (2218), Expect = 8.44e-313
Identity = 421/482 (87.34%), Postives = 448/482 (92.95%), Query Frame = 0

Query: 1   MWQHLLRPCNYRTIETVAAAHVARKSPLLRNLISSSSSLYQPHLNVHNESKFL-ITNVKH 60
           M QHLLRPCNY+T+ETVA  H+A K+PLL N ISSSSSLYQP LNVHNE K L  TN+ H
Sbjct: 1   MRQHLLRPCNYKTLETVAV-HLAPKTPLLHNSISSSSSLYQPDLNVHNELKTLSATNINH 60

Query: 61  EQCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRS 120
           +  E QPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSS LVLILKLGRS
Sbjct: 61  KHLEQQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSFLVLILKLGRS 120

Query: 121 KYFSLIDDLLLSFKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQL 180
           KYFSLIDDLLLSFKSR YP++PT FSYIIKIYGEADLPDKALK FYTMI+FGCTPSSKQL
Sbjct: 121 KYFSLIDDLLLSFKSRGYPLSPTVFSYIIKIYGEADLPDKALKTFYTMIEFGCTPSSKQL 180

Query: 181 NRILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMF 240
           NRILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNIL+R FCWNG++SIAYTLFNKMF
Sbjct: 181 NRILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILMRVFCWNGDLSIAYTLFNKMF 240

Query: 241 ERNVIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLR 300
           +R+V+PDVE+YR LMQGLCRKNQV GAVDLLEDMLNKGY+PDTLSYATLLNSLCRKKKLR
Sbjct: 241 KRDVVPDVESYRILMQGLCRKNQVIGAVDLLEDMLNKGYVPDTLSYATLLNSLCRKKKLR 300

Query: 301 EAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLT 360
           EAYKLLCRMKVKGCNPD+AHYNTVI GFCREGRALDACKILEDMQSN CLPNLVSY+SLT
Sbjct: 301 EAYKLLCRMKVKGCNPDVAHYNTVITGFCREGRALDACKILEDMQSNRCLPNLVSYQSLT 360

Query: 361 NGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKA 420
           NGLCDQGMFELAK YVEEMTLKGF PHFSVIH LVKGF ++GRI +SCSVLEDMLK GKA
Sbjct: 361 NGLCDQGMFELAKDYVEEMTLKGFCPHFSVIHTLVKGFINVGRIDDSCSVLEDMLKHGKA 420

Query: 421 PHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASIS 480
           PHS+TWE+IISG+CEVEDT K CE+ EKILKKDVRRDTRIVEAG+GLGEYLIRKLQAS S
Sbjct: 421 PHSETWEMIISGVCEVEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQASKS 480

BLAST of CsGy6G002410.1 vs. ExPASy TrEMBL
Match: A0A6J1ICW5 (pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like OS=Cucurbita maxima OX=3661 GN=LOC111472648 PE=4 SV=1)

HSP 1 Score: 858 bits (2217), Expect = 1.20e-312
Identity = 424/482 (87.97%), Postives = 449/482 (93.15%), Query Frame = 0

Query: 1   MWQHLLRPCNYRTIETVAAAHVARKSPLLRNLISSSSSLYQPHLNVHNESKFLI-TNVKH 60
           M QHLLRPCNY+TIETVA  H+A K+PLL N ISSSSSLYQP LNVHNE K L  TN+ H
Sbjct: 1   MRQHLLRPCNYKTIETVAV-HLAPKTPLLHNSISSSSSLYQPDLNVHNELKTLNDTNINH 60

Query: 61  EQCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRS 120
           +  E+QPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSS LVLILKLGRS
Sbjct: 61  KHLEEQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSFLVLILKLGRS 120

Query: 121 KYFSLIDDLLLSFKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQL 180
           KYFSLI+DLLLSFKSR YP++PT FSYIIKIYGEADLPDKALK FYTMI+FGCTPSSKQL
Sbjct: 121 KYFSLINDLLLSFKSRGYPLSPTVFSYIIKIYGEADLPDKALKTFYTMIEFGCTPSSKQL 180

Query: 181 NRILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMF 240
           NRILEILVSHR+FIRPAFDLFKNARHHGVLPNTKSYNIL+RAFCWNG++SIAYTLFNKMF
Sbjct: 181 NRILEILVSHRDFIRPAFDLFKNARHHGVLPNTKSYNILMRAFCWNGDLSIAYTLFNKMF 240

Query: 241 ERNVIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLR 300
           +R+VIPDVE+YR LMQGLCRKNQV GAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLR
Sbjct: 241 KRDVIPDVESYRILMQGLCRKNQVIGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLR 300

Query: 301 EAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLT 360
           EAYKLLCRMKVKGCNPD+AHYNTVI GFCREGRALDACKILEDMQ NGCLPNLVSY+SLT
Sbjct: 301 EAYKLLCRMKVKGCNPDVAHYNTVITGFCREGRALDACKILEDMQLNGCLPNLVSYQSLT 360

Query: 361 NGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKA 420
           NGLCDQGMFELAK YVEEMTL GF PHFSVIH LVKGF ++GRI +SCSVLEDMLK GKA
Sbjct: 361 NGLCDQGMFELAKDYVEEMTLNGFCPHFSVIHTLVKGFINVGRIDDSCSVLEDMLKHGKA 420

Query: 421 PHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASIS 480
           PHS+TWEIIISGICEVEDT K CE+ EKILKKDVRRDTRIVEAG+GLGEYLIRKLQAS S
Sbjct: 421 PHSETWEIIISGICEVEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQASKS 480

BLAST of CsGy6G002410.1 vs. TAIR 10
Match: AT4G01400.3 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT5G46100.1); Has 40053 Blast hits to 12380 proteins in 263 species: Archae - 4; Bacteria - 27; Metazoa - 366; Fungi - 374; Plants - 38347; Viruses - 0; Other Eukaryotes - 935 (source: NCBI BLink). )

HSP 1 Score: 547.7 bits (1410), Expect = 9.1e-156
Identity = 262/447 (58.61%), Postives = 338/447 (75.62%), Query Frame = 0

Query: 29  LRNLISSSSSLYQPHLNVHNESKFLITNVKHEQCEDQPDFSIGSPCRVQKLIASQSDPLL 88
           L + +S+SS       + H   K +++N         P   IGSP RVQKLIASQSDPLL
Sbjct: 16  LTSPLSTSSRFLFYSSSEHEARKPIVSN---------PKSPIGSPTRVQKLIASQSDPLL 75

Query: 89  AKEIFDYACRQPHFRPSSSSLLVLILKLGRSKYFSLIDDLLLSFKSRRYPVTPTAFSYII 148
           AKEIFDYA +QP+FR S SS L+LILKLGR +YF+LIDD+L   +S  YP+T   F+Y+I
Sbjct: 76  AKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGEIFTYLI 135

Query: 149 KIYGEADLPDKALKVFYTMIDFGCTPSSKQLNRILEILVSHRNFIRPAFDLFKNARHHGV 208
           K+Y EA LP+K L  FY M++F  TP  K LNRIL++LVSHR +++ AF+LFK++R HGV
Sbjct: 136 KVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKSSRLHGV 195

Query: 209 LPNTKSYNILIRAFCWNGNISIAYTLFNKMFERNVIPDVETYRTLMQGLCRKNQVNGAVD 268
           +PNT+SYN+L++AFC N ++SIAY LF KM ER+V+PDV++Y+ L+QG CRK QVNGA++
Sbjct: 196 MPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQVNGAME 255

Query: 269 LLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFC 328
           LL+DMLNKG++PD LSY TLLNSLCRK +LREAYKLLCRMK+KGCNPD+ HYNT+I+GFC
Sbjct: 256 LLDDMLNKGFVPDRLSYTTLLNSLCRKTQLREAYKLLCRMKLKGCNPDLVHYNTMILGFC 315

Query: 329 REGRALDACKILEDMQSNGCLPNLVSYESLTNGLCDQGMFELAKGYVEEMTLKGFYPHFS 388
           RE RA+DA K+L+DM SNGC PN VSY +L  GLCDQGMF+  K Y+EEM  KGF PHFS
Sbjct: 316 REDRAMDARKVLDDMLSNGCSPNSVSYRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFS 375

Query: 389 VIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKI 448
           V + LVKGF S G++ E+C V+E ++K G+  HSDTWE++I  IC  +++ K     E  
Sbjct: 376 VSNCLVKGFCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDA 435

Query: 449 LKKDVRRDTRIVEAGTGLGEYLIRKLQ 476
           +K+++  DTRIV+ G GLG YL  KLQ
Sbjct: 436 VKEEITGDTRIVDVGIGLGSYLSSKLQ 453

BLAST of CsGy6G002410.1 vs. TAIR 10
Match: AT4G01400.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: COG4 transport (InterPro:IPR013167), Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT5G46100.1); Has 26268 Blast hits to 8959 proteins in 289 species: Archae - 0; Bacteria - 3; Metazoa - 247; Fungi - 222; Plants - 25350; Viruses - 0; Other Eukaryotes - 446 (source: NCBI BLink). )

HSP 1 Score: 376.7 bits (966), Expect = 2.8e-104
Identity = 197/435 (45.29%), Postives = 265/435 (60.92%), Query Frame = 0

Query: 29  LRNLISSSSSLYQPHLNVHNESKFLITNVKHEQCEDQPDFSIGSPCRVQKLIASQSDPLL 88
           L + +S+SS       + H   K +++N         P   IGSP RVQKLIASQSDPLL
Sbjct: 16  LTSPLSTSSRFLFYSSSEHEARKPIVSN---------PKSPIGSPTRVQKLIASQSDPLL 75

Query: 89  AKEIFDYACRQPHFRPSSSSLLVLILKLGRSKYFSLIDDLLLSFKSRRYPVTPTAFSYII 148
           AKEIFDYA +QP+FR S SS L+LILKLGR +YF+LIDD+L   +S  YP+T   F+Y+I
Sbjct: 76  AKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGEIFTYLI 135

Query: 149 KIYGEADLPDKALKVFYTMIDFGCTPSSKQLNRILEILVSHRNFIRPAFDLFKNARHHGV 208
           K+Y EA LP+K L  FY M++F  TP  K LNRIL++LVSHR +++ AF+LFK++R HGV
Sbjct: 136 KVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKSSRLHGV 195

Query: 209 LPNTKSYNILIRAFCWNGNISIAYTLFNKMFERNVIPDVETYRTLMQGLCRKNQVNGAVD 268
           +PNT+SYN+L++AFC N ++SIAY LF KM ER+V+PDV++Y+ L+QG CRK QVNGA++
Sbjct: 196 MPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQVNGAME 255

Query: 269 LLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFC 328
           LL+DMLNKG++PD                                               
Sbjct: 256 LLDDMLNKGFVPD----------------------------------------------- 315

Query: 329 REGRALDACKILEDMQSNGCLPNLVSYESLTNGLCDQGMFELAKGYVEEMTLKGFYPHFS 388
                                       +L  GLCDQGMF+  K Y+EEM  KGF PHFS
Sbjct: 316 ---------------------------RTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFS 367

Query: 389 VIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKI 448
           V + LVKGF S G++ E+C V+E ++K G+  HSDTWE++I  IC  +++ K     E  
Sbjct: 376 VSNCLVKGFCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDA 367

Query: 449 LKKDVRRDTRIVEAG 464
           +K+++  DTRIV+ G
Sbjct: 436 VKEEITGDTRIVDVG 367

BLAST of CsGy6G002410.1 vs. TAIR 10
Match: AT5G46100.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 217.2 bits (552), Expect = 2.8e-56
Identity = 120/372 (32.26%), Postives = 197/372 (52.96%), Query Frame = 0

Query: 72  SPCRVQKLIASQSDPLLAKEIFDYACRQ--PHFRPSSSSLLVLILKLGRSKYFSLIDDLL 131
           +P +V KL+ ++ D   +  +FD A  +    +    SS   ++L+L  +  F   +DL+
Sbjct: 15  TPSQVIKLMRAEKDVEKSMAVFDSATAEYANGYVHDQSSFGYMVLRLVSANKFKAAEDLI 74

Query: 132 LSFKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQLNRILEILVSH 191
           +  K     V+      I + YG    P  +L+VF+ M DF C PS K    +L ILV  
Sbjct: 75  VRMKIENCVVSEDILLSICRGYGRVHRPFDSLRVFHKMKDFDCDPSQKAYVTVLAILV-E 134

Query: 192 RNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWN-GNISIAYTLFNKMFERNVIPDVE 251
            N +  AF  +KN R  G+ P   S N+LI+A C N G +     +F +M +R   PD  
Sbjct: 135 ENQLNLAFKFYKNMREIGLPPTVASLNVLIKALCRNDGTVDAGLKIFLEMPKRGCDPDSY 194

Query: 252 TYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRM 311
           TY TL+ GLCR  +++ A  L  +M+ K   P  ++Y +L+N LC  K + EA + L  M
Sbjct: 195 TYGTLISGLCRFGRIDEAKKLFTEMVEKDCAPTVVTYTSLINGLCGSKNVDEAMRYLEEM 254

Query: 312 KVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTNGLCDQGMF 371
           K KG  P++  Y++++ G C++GR+L A ++ E M + GC PN+V+Y +L  GLC +   
Sbjct: 255 KSKGIEPNVFTYSSLMDGLCKDGRSLQAMELFEMMMARGCRPNMVTYTTLITGLCKEQKI 314

Query: 372 ELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEI- 431
           + A   ++ M L+G  P   +   ++ GF +I +  E+ + L++M+  G  P+  TW I 
Sbjct: 315 QEAVELLDRMNLQGLKPDAGLYGKVISGFCAISKFREAANFLDEMILGGITPNRLTWNIH 374

Query: 432 ------IISGIC 434
                 ++ G+C
Sbjct: 375 VKTSNEVVRGLC 385

BLAST of CsGy6G002410.1 vs. TAIR 10
Match: AT4G20090.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 197.2 bits (500), Expect = 3.0e-50
Identity = 110/366 (30.05%), Postives = 195/366 (53.28%), Query Frame = 0

Query: 72  SPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSKYFSLIDDLLLS 131
           SP    +++ +  +  +++++F  A +   F+   S+L  +I     S  F  ++ LL  
Sbjct: 43  SPNPSMEVVENPLEAPISEKMFKSAPKMGSFKLGDSTLSSMIESYANSGDFDSVEKLLSR 102

Query: 132 FKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMID-FGCTPSSKQLNRILEILVSHR 191
            +     +   +F  + + YG+A LPDKA+ +F+ M+D F C  S K  N +L ++++  
Sbjct: 103 IRLENRVIIERSFIVVFRAYGKAHLPDKAVDLFHRMVDEFRCKRSVKSFNSVLNVIINEG 162

Query: 192 NFIR--PAFDLFKNAR-HHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMFERNVIPDV 251
            + R    +D   N+  +  + PN  S+N++I+A C    +  A  +F  M ER  +PD 
Sbjct: 163 LYHRGLEFYDYVVNSNMNMNISPNGLSFNLVIKALCKLRFVDRAIEVFRGMPERKCLPDG 222

Query: 252 ETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCR 311
            TY TLM GLC++ +++ AV LL++M ++G  P  + Y  L++ LC+K  L    KL+  
Sbjct: 223 YTYCTLMDGLCKEERIDEAVLLLDEMQSEGCSPSPVIYNVLIDGLCKKGDLTRVTKLVDN 282

Query: 312 MKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTNGLCDQGM 371
           M +KGC P+   YNT+I G C +G+   A  +LE M S+ C+PN V+Y +L NGL  Q  
Sbjct: 283 MFLKGCVPNEVTYNTLIHGLCLKGKLDKAVSLLERMVSSKCIPNDVTYGTLINGLVKQRR 342

Query: 372 FELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEI 431
              A   +  M  +G++ +  +   L+ G    G+  E+ S+   M ++G  P+   + +
Sbjct: 343 ATDAVRLLSSMEERGYHLNQHIYSVLISGLFKEGKAEEAMSLWRKMAEKGCKPNIVVYSV 402

Query: 432 IISGIC 434
           ++ G+C
Sbjct: 403 LVDGLC 408

BLAST of CsGy6G002410.1 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 196.1 bits (497), Expect = 6.7e-50
Identity = 106/318 (33.33%), Postives = 170/318 (53.46%), Query Frame = 0

Query: 140 TPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQLNRILEILVSHRNFIRPAFDL 199
           T + F  ++K Y    L DKAL + +     G  P     N +L+  +  +  I  A ++
Sbjct: 133 TSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENV 192

Query: 200 FKNARHHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMFERNVIPDVETYRTLMQGLCR 259
           FK      V PN  +YNILIR FC+ GNI +A TLF+KM  +  +P+V TY TL+ G C+
Sbjct: 193 FKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCK 252

Query: 260 KNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAH 319
             +++    LL  M  KG  P+ +SY  ++N LCR+ +++E   +L  M  +G + D   
Sbjct: 253 LRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVT 312

Query: 320 YNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTNGLCDQGMFELAKGYVEEMT 379
           YNT+I G+C+EG    A  +  +M  +G  P++++Y SL + +C  G    A  ++++M 
Sbjct: 313 YNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMR 372

Query: 380 LKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGIC---EVE 439
           ++G  P+      LV GF   G ++E+  VL +M   G +P   T+  +I+G C   ++E
Sbjct: 373 VRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKME 432

Query: 440 DTAKFCE-VWEKILKKDV 454
           D     E + EK L  DV
Sbjct: 433 DAIAVLEDMKEKGLSPDV 450

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8LDU51.3e-15458.61Pentatricopeptide repeat-containing protein At4g01400, mitochondrial OS=Arabidop... [more]
Q9FNL24.0e-5532.26Pentatricopeptide repeat-containing protein At5g46100 OS=Arabidopsis thaliana OX... [more]
O494364.2e-4930.05Pentatricopeptide repeat-containing protein At4g20090 OS=Arabidopsis thaliana OX... [more]
Q9FIX39.5e-4933.33Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Q9FMF65.4e-4428.50Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_004138384.10.0100.00pentatricopeptide repeat-containing protein At4g01400, mitochondrial [Cucumis sa... [more]
KAA0035033.10.094.61pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK02366... [more]
XP_008463091.10.094.61PREDICTED: pentatricopeptide repeat-containing protein At4g01400, mitochondrial-... [more]
XP_038886671.16.37e-31587.99pentatricopeptide repeat-containing protein At4g01400, mitochondrial [Benincasa ... [more]
XP_023515125.11.05e-31387.97pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like [Cucur... [more]
Match NameE-valueIdentityDescription
A0A0A0K8U00.0100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G014780 PE=4 SV=1[more]
A0A5A7SWW30.094.61Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3CIG10.094.61pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like OS=Cuc... [more]
A0A6J1EXV38.44e-31387.34pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like OS=Cuc... [more]
A0A6J1ICW51.20e-31287.97pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like OS=Cuc... [more]
Match NameE-valueIdentityDescription
AT4G01400.39.1e-15658.61FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT4G01400.12.8e-10445.29FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT5G46100.12.8e-5632.26Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G20090.13.0e-5030.05Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G39710.16.7e-5033.33Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 210..259
e-value: 9.2E-16
score: 57.8
coord: 315..364
e-value: 1.5E-13
score: 50.8
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 284..317
e-value: 2.0E-8
score: 31.9
coord: 144..175
e-value: 7.2E-5
score: 20.7
coord: 214..247
e-value: 5.5E-6
score: 24.2
coord: 353..385
e-value: 1.9E-4
score: 19.4
coord: 320..351
e-value: 4.4E-9
score: 34.0
coord: 249..282
e-value: 1.4E-7
score: 29.2
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 144..172
e-value: 0.0054
score: 16.9
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 276..309
e-value: 2.2E-8
score: 33.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 140..174
score: 9.591195
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 246..280
score: 12.24383
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 281..315
score: 12.232868
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 211..245
score: 11.586152
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 316..350
score: 12.298636
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 351..385
score: 10.369448
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 67..190
e-value: 4.5E-11
score: 44.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 191..307
e-value: 1.8E-32
score: 115.1
coord: 379..466
e-value: 2.7E-10
score: 42.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 308..378
e-value: 8.0E-20
score: 73.2
NoneNo IPR availablePANTHERPTHR47942TETRATRICOPEPTIDE REPEAT (TPR)-LIKE SUPERFAMILY PROTEIN-RELATEDcoord: 55..473
NoneNo IPR availablePANTHERPTHR47942:SF2OS09G0532800 PROTEINcoord: 55..473

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CsGy6G002410CsGy6G002410gene


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy6G002410.1.utr5p1CsGy6G002410.1.utr5p1five_prime_UTR


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy6G002410.1.exon1CsGy6G002410.1.exon1exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
cds.CsGy6G002410.1cds.CsGy6G002410.1CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy6G002410.1.utr3p1CsGy6G002410.1.utr3p1three_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CsGy6G002410.1CsGy6G002410.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032981 mitochondrial respiratory chain complex I assembly
biological_process GO:0000373 Group II intron splicing
biological_process GO:0000963 mitochondrial RNA processing
cellular_component GO:0005739 mitochondrion
molecular_function GO:0005515 protein binding