Cla97C02G028040 (gene) Watermelon (97103) v2.5

Overview
NameCla97C02G028040
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionPentatricopeptide repeat-containing protein family
LocationCla97Chr02: 1572292 .. 1573359 (+)
RNA-Seq ExpressionCla97C02G028040
SyntenyCla97C02G028040
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTACTGGAAGAAGCTGGGGTATTATGATATATCATTTTCTGTTTCACTTGATCTCCTCTATCACTCTTGATAACCAAGCCTTGGTTAATGACCAAATTGAGGAATGGGCTAACCTAATGTCTTCCTTACTGAATCCTTACTGGAGGCTTGATAGCAGAAGAAACCTAAAGTGCATTGGCTTTGAAAGTGAATTGAATATTCTATACAAACTCCTTCGCACAAGGCCGGAAACTCCAGGCCTATATGCATACATGTGGGAATCTTCAAAACAATACCTATCTCAGCTCTAAACTTGCTGCATGACAGAATCCCAAGTGATTTTCAATGGGGTTGCTTTAAAAATTTATTTCCTGTGGAATTTTATGATAAGAGGTTATGCTTCTGATGGTACTAATGCTTACAAGGCCCTTGATATGTTTCGTAAAATGCTAGCATTAGGATTTACGCTGCTATTAAACAATGGACTGTTGCGGAAAAGGTGAGGCCTACGATGAGAAAACATGGATTGAAAAAACCTGCGGGCTATAGTTTAATTGAGCTTGATAGCCAGGTTCATACGTTCTTTGTTGGAGACAAATCACACCAACAGACAAAAGATATATATGCCAAATTGAAGGATATAAATTCGCTACTCAAGGAGGATGGGTACAAGCCCGATACAAGTCTGGTGTTATATGATGTCCATGAAGAACAAAAAGAGAAGATTCTTTGGGATCATAGTGAGCGGTTGGCCATTGCTTTTGCTCTTCTAAATACCTCTCCAGGCACTAAAATTAGGATAACTAAGAATCTTCGAGTATGTGTAGATTGCCACACGGTAGCAAAATTAATATCAAAACTCATGGATCGAGAAATCTTCAGGTTTCACCATTTTATTGATGGCCTTTGTTCTTGTGGTGATTACATAACTTTGTTTCATATCAAATTTATTCAAAAACAAGAGGTATGGTATTTCTTATATATGAACTTTGTAGCAATCCTTTTATAGTTAGCAATTAAATATGTTCATGGTTCTTTCTAAGTATCATGCTTGGTAACCATAGGTTTCACCATTTTATAGATAG

mRNA sequence

ATGGCTACTGGAAGAAGCTGGGGTATTATGATATATCATTTTCTGTTTCACTTGATCTCCTCTATCACTCTTGATAACCAAGCCTTGGTTAATGACCAAATTGAGGAATGGGCTAACCTAATGTCTTCCTTACTGAATCCTTACTGGAGGCTTGATAGCAGAAGAAACCTAAAGTGCATTGGCTTTGAAAGTGAATTGAATATTCTATACAAACTCCTTCGCACAAGGCCGGAAACTCCAGGCCTATATGCATACATCATTAGGATTTACGCTGCTATTAAACAATGGACTGTTGCGGAAAAGGTGAGGCCTACGATGAGAAAACATGGATTGAAAAAACCTGCGGGCTATAGTTTAATTGAGCTTGATAGCCAGGTTCATACGTTCTTTGTTGGAGACAAATCACACCAACAGACAAAAGATATATATGCCAAATTGAAGGATATAAATTCGCTACTCAAGGAGGATGGGTACAAGCCCGATACAAGTCTGGTGTTATATGATGTCCATGAAGAACAAAAAGAGAAGATTCTTTGGGATCATAGTGAGCGGTTGGCCATTGCTTTTGCTCTTCTAAATACCTCTCCAGGCACTAAAATTAGGATAACTAAGAATCTTCGAGTATGTGTAGATTGCCACACGGTAGCAAAATTAATATCAAAACTCATGGATCGAGAAATCTTCAGGTTTCACCATTTTATTGATGGCCTTTGTTCTTGTGGTGATTACATAACTTTGTTTCATATCAAATTTATTCAAAAACAAGAGGTTTCACCATTTTATAGATAG

Coding sequence (CDS)

ATGGCTACTGGAAGAAGCTGGGGTATTATGATATATCATTTTCTGTTTCACTTGATCTCCTCTATCACTCTTGATAACCAAGCCTTGGTTAATGACCAAATTGAGGAATGGGCTAACCTAATGTCTTCCTTACTGAATCCTTACTGGAGGCTTGATAGCAGAAGAAACCTAAAGTGCATTGGCTTTGAAAGTGAATTGAATATTCTATACAAACTCCTTCGCACAAGGCCGGAAACTCCAGGCCTATATGCATACATCATTAGGATTTACGCTGCTATTAAACAATGGACTGTTGCGGAAAAGGTGAGGCCTACGATGAGAAAACATGGATTGAAAAAACCTGCGGGCTATAGTTTAATTGAGCTTGATAGCCAGGTTCATACGTTCTTTGTTGGAGACAAATCACACCAACAGACAAAAGATATATATGCCAAATTGAAGGATATAAATTCGCTACTCAAGGAGGATGGGTACAAGCCCGATACAAGTCTGGTGTTATATGATGTCCATGAAGAACAAAAAGAGAAGATTCTTTGGGATCATAGTGAGCGGTTGGCCATTGCTTTTGCTCTTCTAAATACCTCTCCAGGCACTAAAATTAGGATAACTAAGAATCTTCGAGTATGTGTAGATTGCCACACGGTAGCAAAATTAATATCAAAACTCATGGATCGAGAAATCTTCAGGTTTCACCATTTTATTGATGGCCTTTGTTCTTGTGGTGATTACATAACTTTGTTTCATATCAAATTTATTCAAAAACAAGAGGTTTCACCATTTTATAGATAG

Protein sequence

MATGRSWGIMIYHFLFHLISSITLDNQALVNDQIEEWANLMSSLLNPYWRLDSRRNLKCIGFESELNILYKLLRTRPETPGLYAYIIRIYAAIKQWTVAEKVRPTMRKHGLKKPAGYSLIELDSQVHTFFVGDKSHQQTKDIYAKLKDINSLLKEDGYKPDTSLVLYDVHEEQKEKILWDHSERLAIAFALLNTSPGTKIRITKNLRVCVDCHTVAKLISKLMDREIFRFHHFIDGLCSCGDYITLFHIKFIQKQEVSPFYR
Homology
BLAST of Cla97C02G028040 vs. NCBI nr
Match: XP_038887372.1 (pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like [Benincasa hispida])

HSP 1 Score: 284.3 bits (726), Expect = 1.1e-72
Identity = 146/219 (66.67%), Postives = 164/219 (74.89%), Query Frame = 0

Query: 30  VNDQIEEWANLMSSLLNPYWRLDSRRNLKCIGFESELNILYKLLRTRPETPGLYAYIIRI 89
           V    + WA L+S+      RL   RN++     ++      + RT P     Y  +  I
Sbjct: 514 VEPSADIWATLLSAC-----RL--HRNVELAEISAQ-----NIFRTNPSKVSSYICLSNI 573

Query: 90  YAAIKQWTVAEKVRPTMRKHGLKKPAGYSLIELDSQVHTFFVGDKSHQQTKDIYAKLKDI 149
           YAA+KQWT  EKVR TMRK GLKKPAGYSLIELDSQ+H FFVGDKSHQ+TKDIYAKLKDI
Sbjct: 574 YAAMKQWTAVEKVRATMRKRGLKKPAGYSLIELDSQIHMFFVGDKSHQETKDIYAKLKDI 633

Query: 150 NSLLKEDGYKPDTSLVLYDVHEEQKEKILWDHSERLAIAFALLNTSPGTKIRITKNLRVC 209
           +  LKE+GYKPDTS VLYDV+EEQKEK+LWDHSERLAIAFALLNT PGTKIRITKNLRVC
Sbjct: 634 SWRLKEEGYKPDTSPVLYDVNEEQKEKMLWDHSERLAIAFALLNTPPGTKIRITKNLRVC 693

Query: 210 VDCHTVAKLISKLMDREI-----FRFHHFIDGLCSCGDY 244
            DCHTV K ISKLMDREI      RFHHFIDG+CSCGD+
Sbjct: 694 ADCHTVTKFISKLMDREIVMRDNHRFHHFIDGICSCGDF 720

BLAST of Cla97C02G028040 vs. NCBI nr
Match: XP_022135919.1 (pentatricopeptide repeat-containing protein At3g12770-like isoform X1 [Momordica charantia])

HSP 1 Score: 283.1 bits (723), Expect = 2.4e-72
Identity = 148/219 (67.58%), Postives = 164/219 (74.89%), Query Frame = 0

Query: 30  VNDQIEEWANLMSSLLNPYWRLDSRRNLKCIGFESELNILYKLLRTRPETPGLYAYIIRI 89
           V    E WA L+S+      RL   RN++     ++      + RT P     Y  +  I
Sbjct: 512 VEPPAEIWAALLSAC-----RL--HRNVELAEISAQ-----NIFRTNPTRVSSYICLSNI 571

Query: 90  YAAIKQWTVAEKVRPTMRKHGLKKPAGYSLIELDSQVHTFFVGDKSHQQTKDIYAKLKDI 149
           YAA KQWT  EKVR TMR+HGLKKPAGYSLIELDSQVH FFVGD+SHQQTKDIYAKLKDI
Sbjct: 572 YAAKKQWTDVEKVRATMRRHGLKKPAGYSLIELDSQVHMFFVGDRSHQQTKDIYAKLKDI 631

Query: 150 NSLLKEDGYKPDTSLVLYDVHEEQKEKILWDHSERLAIAFALLNTSPGTKIRITKNLRVC 209
           +  LKE+G+KPDTS VLYDV EEQKEK+LWDHSERLAIAFALLNT PGTKIRITKNLRVC
Sbjct: 632 SWRLKEEGHKPDTSSVLYDVDEEQKEKMLWDHSERLAIAFALLNTPPGTKIRITKNLRVC 691

Query: 210 VDCHTVAKLISKLMDREIF-----RFHHFIDGLCSCGDY 244
            DCHTV KLISKLMDREI      RFHHFIDG+CSCGD+
Sbjct: 692 ADCHTVTKLISKLMDREIIMRDNHRFHHFIDGICSCGDF 718

BLAST of Cla97C02G028040 vs. NCBI nr
Match: XP_022135920.1 (pentatricopeptide repeat-containing protein At3g12770-like isoform X2 [Momordica charantia])

HSP 1 Score: 283.1 bits (723), Expect = 2.4e-72
Identity = 148/219 (67.58%), Postives = 164/219 (74.89%), Query Frame = 0

Query: 30  VNDQIEEWANLMSSLLNPYWRLDSRRNLKCIGFESELNILYKLLRTRPETPGLYAYIIRI 89
           V    E WA L+S+      RL   RN++     ++      + RT P     Y  +  I
Sbjct: 476 VEPPAEIWAALLSAC-----RL--HRNVELAEISAQ-----NIFRTNPTRVSSYICLSNI 535

Query: 90  YAAIKQWTVAEKVRPTMRKHGLKKPAGYSLIELDSQVHTFFVGDKSHQQTKDIYAKLKDI 149
           YAA KQWT  EKVR TMR+HGLKKPAGYSLIELDSQVH FFVGD+SHQQTKDIYAKLKDI
Sbjct: 536 YAAKKQWTDVEKVRATMRRHGLKKPAGYSLIELDSQVHMFFVGDRSHQQTKDIYAKLKDI 595

Query: 150 NSLLKEDGYKPDTSLVLYDVHEEQKEKILWDHSERLAIAFALLNTSPGTKIRITKNLRVC 209
           +  LKE+G+KPDTS VLYDV EEQKEK+LWDHSERLAIAFALLNT PGTKIRITKNLRVC
Sbjct: 596 SWRLKEEGHKPDTSSVLYDVDEEQKEKMLWDHSERLAIAFALLNTPPGTKIRITKNLRVC 655

Query: 210 VDCHTVAKLISKLMDREIF-----RFHHFIDGLCSCGDY 244
            DCHTV KLISKLMDREI      RFHHFIDG+CSCGD+
Sbjct: 656 ADCHTVTKLISKLMDREIIMRDNHRFHHFIDGICSCGDF 682

BLAST of Cla97C02G028040 vs. NCBI nr
Match: XP_022968999.1 (pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like [Cucurbita maxima])

HSP 1 Score: 280.8 bits (717), Expect = 1.2e-71
Identity = 137/177 (77.40%), Postives = 148/177 (83.62%), Query Frame = 0

Query: 72  LLRTRPETPGLYAYIIRIYAAIKQWTVAEKVRPTMRKHGLKKPAGYSLIELDSQVHTFFV 131
           + RT P     Y  +  IYAA+KQWT  EKVR TMR +GLKKPAGYSLIELDSQVH FFV
Sbjct: 543 IFRTNPTRVSSYICLSNIYAAMKQWTAVEKVRTTMRNYGLKKPAGYSLIELDSQVHMFFV 602

Query: 132 GDKSHQQTKDIYAKLKDINSLLKEDGYKPDTSLVLYDVHEEQKEKILWDHSERLAIAFAL 191
           GDKSHQQTKDIYAKLKDI+  LKE G+KPDTS +LYDV+EEQKEK+LWDHSERLAIAFAL
Sbjct: 603 GDKSHQQTKDIYAKLKDISWRLKEKGHKPDTSSILYDVNEEQKEKMLWDHSERLAIAFAL 662

Query: 192 LNTSPGTKIRITKNLRVCVDCHTVAKLISKLMDREI-----FRFHHFIDGLCSCGDY 244
           LNT PGTKIRITKNLRVC DCHTV KLISKLMDREI      RFHHFIDG+CSCGD+
Sbjct: 663 LNTPPGTKIRITKNLRVCADCHTVTKLISKLMDREIVMRDNHRFHHFIDGICSCGDF 719

BLAST of Cla97C02G028040 vs. NCBI nr
Match: XP_023554404.1 (putative pentatricopeptide repeat-containing protein At3g11460, mitochondrial [Cucurbita pepo subsp. pepo])

HSP 1 Score: 280.0 bits (715), Expect = 2.1e-71
Identity = 137/177 (77.40%), Postives = 147/177 (83.05%), Query Frame = 0

Query: 72  LLRTRPETPGLYAYIIRIYAAIKQWTVAEKVRPTMRKHGLKKPAGYSLIELDSQVHTFFV 131
           + RT P     Y  +  IYAA+KQWT  EKVR TMR +GLKKPAGYSLIELDSQVH FFV
Sbjct: 543 IFRTNPTRVSSYICLSNIYAAMKQWTAVEKVRTTMRNYGLKKPAGYSLIELDSQVHMFFV 602

Query: 132 GDKSHQQTKDIYAKLKDINSLLKEDGYKPDTSLVLYDVHEEQKEKILWDHSERLAIAFAL 191
           GDKSHQQTKDIYAKLKDI+  LKE G+KPDTS +LYDV EEQKEK+LWDHSERLAIAFAL
Sbjct: 603 GDKSHQQTKDIYAKLKDISWRLKEKGHKPDTSSILYDVSEEQKEKMLWDHSERLAIAFAL 662

Query: 192 LNTSPGTKIRITKNLRVCVDCHTVAKLISKLMDREI-----FRFHHFIDGLCSCGDY 244
           LNT PGTKIRITKNLRVC DCHTV KLISKLMDREI      RFHHFIDG+CSCGD+
Sbjct: 663 LNTPPGTKIRITKNLRVCADCHTVTKLISKLMDREIVMRDNHRFHHFIDGICSCGDF 719

BLAST of Cla97C02G028040 vs. ExPASy Swiss-Prot
Match: Q9SUH6 (Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana OX=3702 GN=DYW9 PE=2 SV=1)

HSP 1 Score: 183.7 bits (465), Expect = 2.6e-45
Identity = 90/178 (50.56%), Postives = 119/178 (66.85%), Query Frame = 0

Query: 71  KLLRTRPETPGLYAYIIRIYAAIKQWTVAEKVRPTMRKHGLKKPAGYSLIELDSQVHTFF 130
           KL    P+  G +  +  I++A + +  A  VR T +K  L K  GY+LIE+    H F 
Sbjct: 614 KLFELDPDNVGYHVLLSNIHSADRNYPQAATVRQTAKKRKLAKAPGYTLIEIGETPHVFT 673

Query: 131 VGDKSHQQTKDIYAKLKDINSLLKEDGYKPDTSLVLYDVHEEQKEKILWDHSERLAIAFA 190
            GD+SH Q K+IY KL+ +   ++E GY+P+T L L+DV EE++E ++  HSERLAIAF 
Sbjct: 674 SGDQSHPQVKEIYEKLEKLEGKMREAGYQPETELALHDVEEEERELMVKVHSERLAIAFG 733

Query: 191 LLNTSPGTKIRITKNLRVCVDCHTVAKLISKLMDREIF-----RFHHFIDGLCSCGDY 244
           L+ T PGT+IRI KNLRVC+DCHTV KLISK+ +R I      RFHHF DG+CSCGDY
Sbjct: 734 LIATEPGTEIRIIKNLRVCLDCHTVTKLISKITERVIVVRDANRFHHFKDGVCSCGDY 791

BLAST of Cla97C02G028040 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 181.4 bits (459), Expect = 1.3e-44
Identity = 88/177 (49.72%), Postives = 115/177 (64.97%), Query Frame = 0

Query: 72  LLRTRPETPGLYAYIIRIYAAIKQWTVAEKVRPTMRKHGLKKPAGYSLIELDSQVHTFFV 131
           L++  PE PG Y  +  IYA+  +W    K R  +   G+KK  G S IE+DS VH F +
Sbjct: 564 LIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFII 623

Query: 132 GDKSHQQTKDIYAKLKDINSLLKEDGYKPDTSLVLYDVHEEQKEKILWDHSERLAIAFAL 191
           GDK H + ++IY  L+++  LL++ G+ PDTS VL ++ EE KE  L  HSE+LAIAF L
Sbjct: 624 GDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGL 683

Query: 192 LNTSPGTKIRITKNLRVCVDCHTVAKLISKLMDREIF-----RFHHFIDGLCSCGDY 244
           ++T PGTK+ I KNLRVC +CH   KLISK+  REI      RFHHF DG+CSC DY
Sbjct: 684 ISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDY 740

BLAST of Cla97C02G028040 vs. ExPASy Swiss-Prot
Match: Q9LW32 (Pentatricopeptide repeat-containing protein At3g26782, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H34 PE=2 SV=1)

HSP 1 Score: 178.3 bits (451), Expect = 1.1e-43
Identity = 87/168 (51.79%), Postives = 111/168 (66.07%), Query Frame = 0

Query: 81  GLYAYIIRIYAAIKQWTVAEKVRPTMRKHGLKKPAGYSLIELDSQVHTFFVGDKSHQQTK 140
           G Y  +  IYA   +W   E+VR  M+  GL KP G+SL+EL+ +VH F +GD+ H Q +
Sbjct: 491 GYYMLLSHIYADAGRWKDVERVRMIMKNRGLVKPPGFSLLELNGEVHVFLIGDEEHPQRE 550

Query: 141 DIYAKLKDINSLLKEDGYKPDTSLVLYDVHEEQKEKILWDHSERLAIAFALLNTSPGTKI 200
            IY  L ++N  L E GY  +TS V +DV EE+KE  L  HSE+LAIAF ++NT PG+ +
Sbjct: 551 KIYEFLAELNRKLLEAGYVSNTSSVCHDVDEEEKEMTLRVHSEKLAIAFGIMNTVPGSTV 610

Query: 201 RITKNLRVCVDCHTVAKLISKLMDREIF-----RFHHFIDGLCSCGDY 244
            + KNLRVC DCH V KLISK++DRE       RFHHF DG CSCGDY
Sbjct: 611 NVVKNLRVCSDCHNVIKLISKIVDREFVVRDAKRFHHFKDGGCSCGDY 658

BLAST of Cla97C02G028040 vs. ExPASy Swiss-Prot
Match: Q9LNU6 (Pentatricopeptide repeat-containing protein At1g20230 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H21 PE=2 SV=2)

HSP 1 Score: 178.3 bits (451), Expect = 1.1e-43
Identity = 82/178 (46.07%), Postives = 113/178 (63.48%), Query Frame = 0

Query: 71  KLLRTRPETPGLYAYIIRIYAAIKQWTVAEKVRPTMRKHGLKKPAGYSLIELDSQVHTFF 130
           KL    PE PG Y  +  IYAA   WT  + +R  M   GLKK  G S I++ ++V+T  
Sbjct: 582 KLFHLEPENPGTYVLLSNIYAAKGMWTEVDSIRNKMESLGLKKNPGCSWIQVKNRVYTLL 641

Query: 131 VGDKSHQQTKDIYAKLKDINSLLKEDGYKPDTSLVLYDVHEEQKEKILWDHSERLAIAFA 190
            GDKSH Q   I  K+ +I+  +++ G++P+    L+DV E+++E++LW HSE+LA+ F 
Sbjct: 642 AGDKSHPQIDQITEKMDEISKEMRKSGHRPNLDFALHDVEEQEQEQMLWGHSEKLAVVFG 701

Query: 191 LLNTSPGTKIRITKNLRVCVDCHTVAKLISKLMDREIF-----RFHHFIDGLCSCGDY 244
           LLNT  GT +++ KNLR+C DCH V K IS    REIF     RFHHF DG+CSCGD+
Sbjct: 702 LLNTPDGTPLQVIKNLRICGDCHAVIKFISSYAGREIFIRDTNRFHHFKDGICSCGDF 759

BLAST of Cla97C02G028040 vs. ExPASy Swiss-Prot
Match: Q9SMZ2 (Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H53 PE=3 SV=1)

HSP 1 Score: 175.6 bits (444), Expect = 7.2e-43
Identity = 85/186 (45.70%), Postives = 119/186 (63.98%), Query Frame = 0

Query: 63  ESELNILYKLLRTRPETPGLYAYIIRIYAAIKQWTVAEKVRPTMRKHGLKKPAGYSLIEL 122
           E+   +  KLL   P     Y  +  +YAA  +W   +  R  M+ H +KK  G+S IE+
Sbjct: 804 ETGKRVATKLLELEPLDSSAYVLLSNMYAAASKWDEMKLARTMMKGHKVKKDPGFSWIEV 863

Query: 123 DSQVHTFFVGDKSHQQTKDIYAKLKDINSLLKEDGYKPDTSLVLYDVHEEQKEKILWDHS 182
            +++H F V D+S++QT+ IY K+KD+   +K++GY P+T   L DV EE+KE+ L+ HS
Sbjct: 864 KNKIHIFVVDDRSNRQTELIYRKVKDMIRDIKQEGYVPETDFTLVDVEEEEKERALYYHS 923

Query: 183 ERLAIAFALLNTSPGTKIRITKNLRVCVDCHTVAKLISKLMDREIF-----RFHHFIDGL 242
           E+LA+AF LL+T P T IR+ KNLRVC DCH   K I+K+ +REI      RFH F DG+
Sbjct: 924 EKLAVAFGLLSTPPSTPIRVIKNLRVCGDCHNAMKYIAKVYNREIVLRDANRFHRFKDGI 983

Query: 243 CSCGDY 244
           CSCGDY
Sbjct: 984 CSCGDY 989

BLAST of Cla97C02G028040 vs. ExPASy TrEMBL
Match: A0A6J1C2E2 (pentatricopeptide repeat-containing protein At3g12770-like isoform X2 OS=Momordica charantia OX=3673 GN=LOC111007757 PE=3 SV=1)

HSP 1 Score: 283.1 bits (723), Expect = 1.2e-72
Identity = 148/219 (67.58%), Postives = 164/219 (74.89%), Query Frame = 0

Query: 30  VNDQIEEWANLMSSLLNPYWRLDSRRNLKCIGFESELNILYKLLRTRPETPGLYAYIIRI 89
           V    E WA L+S+      RL   RN++     ++      + RT P     Y  +  I
Sbjct: 476 VEPPAEIWAALLSAC-----RL--HRNVELAEISAQ-----NIFRTNPTRVSSYICLSNI 535

Query: 90  YAAIKQWTVAEKVRPTMRKHGLKKPAGYSLIELDSQVHTFFVGDKSHQQTKDIYAKLKDI 149
           YAA KQWT  EKVR TMR+HGLKKPAGYSLIELDSQVH FFVGD+SHQQTKDIYAKLKDI
Sbjct: 536 YAAKKQWTDVEKVRATMRRHGLKKPAGYSLIELDSQVHMFFVGDRSHQQTKDIYAKLKDI 595

Query: 150 NSLLKEDGYKPDTSLVLYDVHEEQKEKILWDHSERLAIAFALLNTSPGTKIRITKNLRVC 209
           +  LKE+G+KPDTS VLYDV EEQKEK+LWDHSERLAIAFALLNT PGTKIRITKNLRVC
Sbjct: 596 SWRLKEEGHKPDTSSVLYDVDEEQKEKMLWDHSERLAIAFALLNTPPGTKIRITKNLRVC 655

Query: 210 VDCHTVAKLISKLMDREIF-----RFHHFIDGLCSCGDY 244
            DCHTV KLISKLMDREI      RFHHFIDG+CSCGD+
Sbjct: 656 ADCHTVTKLISKLMDREIIMRDNHRFHHFIDGICSCGDF 682

BLAST of Cla97C02G028040 vs. ExPASy TrEMBL
Match: A0A6J1C2U5 (pentatricopeptide repeat-containing protein At3g12770-like isoform X1 OS=Momordica charantia OX=3673 GN=LOC111007757 PE=3 SV=1)

HSP 1 Score: 283.1 bits (723), Expect = 1.2e-72
Identity = 148/219 (67.58%), Postives = 164/219 (74.89%), Query Frame = 0

Query: 30  VNDQIEEWANLMSSLLNPYWRLDSRRNLKCIGFESELNILYKLLRTRPETPGLYAYIIRI 89
           V    E WA L+S+      RL   RN++     ++      + RT P     Y  +  I
Sbjct: 512 VEPPAEIWAALLSAC-----RL--HRNVELAEISAQ-----NIFRTNPTRVSSYICLSNI 571

Query: 90  YAAIKQWTVAEKVRPTMRKHGLKKPAGYSLIELDSQVHTFFVGDKSHQQTKDIYAKLKDI 149
           YAA KQWT  EKVR TMR+HGLKKPAGYSLIELDSQVH FFVGD+SHQQTKDIYAKLKDI
Sbjct: 572 YAAKKQWTDVEKVRATMRRHGLKKPAGYSLIELDSQVHMFFVGDRSHQQTKDIYAKLKDI 631

Query: 150 NSLLKEDGYKPDTSLVLYDVHEEQKEKILWDHSERLAIAFALLNTSPGTKIRITKNLRVC 209
           +  LKE+G+KPDTS VLYDV EEQKEK+LWDHSERLAIAFALLNT PGTKIRITKNLRVC
Sbjct: 632 SWRLKEEGHKPDTSSVLYDVDEEQKEKMLWDHSERLAIAFALLNTPPGTKIRITKNLRVC 691

Query: 210 VDCHTVAKLISKLMDREIF-----RFHHFIDGLCSCGDY 244
            DCHTV KLISKLMDREI      RFHHFIDG+CSCGD+
Sbjct: 692 ADCHTVTKLISKLMDREIIMRDNHRFHHFIDGICSCGDF 718

BLAST of Cla97C02G028040 vs. ExPASy TrEMBL
Match: A0A6J1I1A3 (pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like OS=Cucurbita maxima OX=3661 GN=LOC111468131 PE=3 SV=1)

HSP 1 Score: 280.8 bits (717), Expect = 5.8e-72
Identity = 137/177 (77.40%), Postives = 148/177 (83.62%), Query Frame = 0

Query: 72  LLRTRPETPGLYAYIIRIYAAIKQWTVAEKVRPTMRKHGLKKPAGYSLIELDSQVHTFFV 131
           + RT P     Y  +  IYAA+KQWT  EKVR TMR +GLKKPAGYSLIELDSQVH FFV
Sbjct: 543 IFRTNPTRVSSYICLSNIYAAMKQWTAVEKVRTTMRNYGLKKPAGYSLIELDSQVHMFFV 602

Query: 132 GDKSHQQTKDIYAKLKDINSLLKEDGYKPDTSLVLYDVHEEQKEKILWDHSERLAIAFAL 191
           GDKSHQQTKDIYAKLKDI+  LKE G+KPDTS +LYDV+EEQKEK+LWDHSERLAIAFAL
Sbjct: 603 GDKSHQQTKDIYAKLKDISWRLKEKGHKPDTSSILYDVNEEQKEKMLWDHSERLAIAFAL 662

Query: 192 LNTSPGTKIRITKNLRVCVDCHTVAKLISKLMDREI-----FRFHHFIDGLCSCGDY 244
           LNT PGTKIRITKNLRVC DCHTV KLISKLMDREI      RFHHFIDG+CSCGD+
Sbjct: 663 LNTPPGTKIRITKNLRVCADCHTVTKLISKLMDREIVMRDNHRFHHFIDGICSCGDF 719

BLAST of Cla97C02G028040 vs. ExPASy TrEMBL
Match: A0A6J1GMH0 (putative pentatricopeptide repeat-containing protein At3g11460, mitochondrial OS=Cucurbita moschata OX=3662 GN=LOC111455614 PE=3 SV=1)

HSP 1 Score: 276.2 bits (705), Expect = 1.4e-70
Identity = 135/177 (76.27%), Postives = 146/177 (82.49%), Query Frame = 0

Query: 72  LLRTRPETPGLYAYIIRIYAAIKQWTVAEKVRPTMRKHGLKKPAGYSLIELDSQVHTFFV 131
           + RT P     Y  +  IYAA+KQWT  EKVR TMR +GLKKPAGYS IELDSQVH FFV
Sbjct: 543 IFRTNPTRVSSYICLSNIYAAMKQWTAVEKVRTTMRNYGLKKPAGYSSIELDSQVHMFFV 602

Query: 132 GDKSHQQTKDIYAKLKDINSLLKEDGYKPDTSLVLYDVHEEQKEKILWDHSERLAIAFAL 191
           GDKSHQQTKDIYAKLKDI+  LKE G+KPDTS +LYDV+EEQKEK+LWDHSERLAIAFAL
Sbjct: 603 GDKSHQQTKDIYAKLKDISWRLKEKGHKPDTSSILYDVNEEQKEKMLWDHSERLAIAFAL 662

Query: 192 LNTSPGTKIRITKNLRVCVDCHTVAKLISKLMDREI-----FRFHHFIDGLCSCGDY 244
           LNT PGTKIRITKNLRVC DCHTV KLISKL DREI      RFHHFIDG+CSCGD+
Sbjct: 663 LNTPPGTKIRITKNLRVCADCHTVTKLISKLTDREIVMRDNHRFHHFIDGICSCGDF 719

BLAST of Cla97C02G028040 vs. ExPASy TrEMBL
Match: A0A2P6S2G8 (Putative tetratricopeptide-like helical domain, DYW domain-containing protein OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr2g0160301 PE=3 SV=1)

HSP 1 Score: 241.1 bits (614), Expect = 5.1e-60
Identity = 117/178 (65.73%), Postives = 132/178 (74.16%), Query Frame = 0

Query: 71  KLLRTRPETPGLYAYIIRIYAAIKQWTVAEKVRPTMRKHGLKKPAGYSLIELDSQVHTFF 130
           K+    PE  G Y  +  IYAA K+W   E+VR  +RK GLKKP G + +ELD  VH F 
Sbjct: 531 KIFEMNPEGVGSYICLSNIYAAEKRWDDVERVRAMVRKKGLKKPPGCTFVELDKMVHRFL 590

Query: 131 VGDKSHQQTKDIYAKLKDINSLLKEDGYKPDTSLVLYDVHEEQKEKILWDHSERLAIAFA 190
           VGDKSH QT+DIYAKLKD+N  L+E GYKPDT+ VLYDV EE KEK+LWDHSERLAIAFA
Sbjct: 591 VGDKSHPQTEDIYAKLKDLNLRLREVGYKPDTTSVLYDVEEEMKEKMLWDHSERLAIAFA 650

Query: 191 LLNTSPGTKIRITKNLRVCVDCHTVAKLISKLMDREIF-----RFHHFIDGLCSCGDY 244
           L+NT PGT IRITKNLR+CVDCHTV K+ISK   REI      RFHHF DG CSCGDY
Sbjct: 651 LINTRPGTTIRITKNLRICVDCHTVTKMISKFTAREIIMRDNHRFHHFRDGFCSCGDY 708

BLAST of Cla97C02G028040 vs. TAIR 10
Match: AT4G30700.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 183.7 bits (465), Expect = 1.9e-46
Identity = 90/178 (50.56%), Postives = 119/178 (66.85%), Query Frame = 0

Query: 71  KLLRTRPETPGLYAYIIRIYAAIKQWTVAEKVRPTMRKHGLKKPAGYSLIELDSQVHTFF 130
           KL    P+  G +  +  I++A + +  A  VR T +K  L K  GY+LIE+    H F 
Sbjct: 614 KLFELDPDNVGYHVLLSNIHSADRNYPQAATVRQTAKKRKLAKAPGYTLIEIGETPHVFT 673

Query: 131 VGDKSHQQTKDIYAKLKDINSLLKEDGYKPDTSLVLYDVHEEQKEKILWDHSERLAIAFA 190
            GD+SH Q K+IY KL+ +   ++E GY+P+T L L+DV EE++E ++  HSERLAIAF 
Sbjct: 674 SGDQSHPQVKEIYEKLEKLEGKMREAGYQPETELALHDVEEEERELMVKVHSERLAIAFG 733

Query: 191 LLNTSPGTKIRITKNLRVCVDCHTVAKLISKLMDREIF-----RFHHFIDGLCSCGDY 244
           L+ T PGT+IRI KNLRVC+DCHTV KLISK+ +R I      RFHHF DG+CSCGDY
Sbjct: 734 LIATEPGTEIRIIKNLRVCLDCHTVTKLISKITERVIVVRDANRFHHFKDGVCSCGDY 791

BLAST of Cla97C02G028040 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 181.4 bits (459), Expect = 9.3e-46
Identity = 88/177 (49.72%), Postives = 115/177 (64.97%), Query Frame = 0

Query: 72  LLRTRPETPGLYAYIIRIYAAIKQWTVAEKVRPTMRKHGLKKPAGYSLIELDSQVHTFFV 131
           L++  PE PG Y  +  IYA+  +W    K R  +   G+KK  G S IE+DS VH F +
Sbjct: 564 LIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFII 623

Query: 132 GDKSHQQTKDIYAKLKDINSLLKEDGYKPDTSLVLYDVHEEQKEKILWDHSERLAIAFAL 191
           GDK H + ++IY  L+++  LL++ G+ PDTS VL ++ EE KE  L  HSE+LAIAF L
Sbjct: 624 GDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGL 683

Query: 192 LNTSPGTKIRITKNLRVCVDCHTVAKLISKLMDREIF-----RFHHFIDGLCSCGDY 244
           ++T PGTK+ I KNLRVC +CH   KLISK+  REI      RFHHF DG+CSC DY
Sbjct: 684 ISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDY 740

BLAST of Cla97C02G028040 vs. TAIR 10
Match: AT1G20230.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 178.3 bits (451), Expect = 7.9e-45
Identity = 82/178 (46.07%), Postives = 113/178 (63.48%), Query Frame = 0

Query: 71  KLLRTRPETPGLYAYIIRIYAAIKQWTVAEKVRPTMRKHGLKKPAGYSLIELDSQVHTFF 130
           KL    PE PG Y  +  IYAA   WT  + +R  M   GLKK  G S I++ ++V+T  
Sbjct: 582 KLFHLEPENPGTYVLLSNIYAAKGMWTEVDSIRNKMESLGLKKNPGCSWIQVKNRVYTLL 641

Query: 131 VGDKSHQQTKDIYAKLKDINSLLKEDGYKPDTSLVLYDVHEEQKEKILWDHSERLAIAFA 190
            GDKSH Q   I  K+ +I+  +++ G++P+    L+DV E+++E++LW HSE+LA+ F 
Sbjct: 642 AGDKSHPQIDQITEKMDEISKEMRKSGHRPNLDFALHDVEEQEQEQMLWGHSEKLAVVFG 701

Query: 191 LLNTSPGTKIRITKNLRVCVDCHTVAKLISKLMDREIF-----RFHHFIDGLCSCGDY 244
           LLNT  GT +++ KNLR+C DCH V K IS    REIF     RFHHF DG+CSCGD+
Sbjct: 702 LLNTPDGTPLQVIKNLRICGDCHAVIKFISSYAGREIFIRDTNRFHHFKDGICSCGDF 759

BLAST of Cla97C02G028040 vs. TAIR 10
Match: AT3G26782.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 178.3 bits (451), Expect = 7.9e-45
Identity = 87/168 (51.79%), Postives = 111/168 (66.07%), Query Frame = 0

Query: 81  GLYAYIIRIYAAIKQWTVAEKVRPTMRKHGLKKPAGYSLIELDSQVHTFFVGDKSHQQTK 140
           G Y  +  IYA   +W   E+VR  M+  GL KP G+SL+EL+ +VH F +GD+ H Q +
Sbjct: 491 GYYMLLSHIYADAGRWKDVERVRMIMKNRGLVKPPGFSLLELNGEVHVFLIGDEEHPQRE 550

Query: 141 DIYAKLKDINSLLKEDGYKPDTSLVLYDVHEEQKEKILWDHSERLAIAFALLNTSPGTKI 200
            IY  L ++N  L E GY  +TS V +DV EE+KE  L  HSE+LAIAF ++NT PG+ +
Sbjct: 551 KIYEFLAELNRKLLEAGYVSNTSSVCHDVDEEEKEMTLRVHSEKLAIAFGIMNTVPGSTV 610

Query: 201 RITKNLRVCVDCHTVAKLISKLMDREIF-----RFHHFIDGLCSCGDY 244
            + KNLRVC DCH V KLISK++DRE       RFHHF DG CSCGDY
Sbjct: 611 NVVKNLRVCSDCHNVIKLISKIVDREFVVRDAKRFHHFKDGGCSCGDY 658

BLAST of Cla97C02G028040 vs. TAIR 10
Match: AT4G33170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 175.6 bits (444), Expect = 5.1e-44
Identity = 85/186 (45.70%), Postives = 119/186 (63.98%), Query Frame = 0

Query: 63  ESELNILYKLLRTRPETPGLYAYIIRIYAAIKQWTVAEKVRPTMRKHGLKKPAGYSLIEL 122
           E+   +  KLL   P     Y  +  +YAA  +W   +  R  M+ H +KK  G+S IE+
Sbjct: 804 ETGKRVATKLLELEPLDSSAYVLLSNMYAAASKWDEMKLARTMMKGHKVKKDPGFSWIEV 863

Query: 123 DSQVHTFFVGDKSHQQTKDIYAKLKDINSLLKEDGYKPDTSLVLYDVHEEQKEKILWDHS 182
            +++H F V D+S++QT+ IY K+KD+   +K++GY P+T   L DV EE+KE+ L+ HS
Sbjct: 864 KNKIHIFVVDDRSNRQTELIYRKVKDMIRDIKQEGYVPETDFTLVDVEEEEKERALYYHS 923

Query: 183 ERLAIAFALLNTSPGTKIRITKNLRVCVDCHTVAKLISKLMDREIF-----RFHHFIDGL 242
           E+LA+AF LL+T P T IR+ KNLRVC DCH   K I+K+ +REI      RFH F DG+
Sbjct: 924 EKLAVAFGLLSTPPSTPIRVIKNLRVCGDCHNAMKYIAKVYNREIVLRDANRFHRFKDGI 983

Query: 243 CSCGDY 244
           CSCGDY
Sbjct: 984 CSCGDY 989

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038887372.11.1e-7266.67pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like [Benin... [more]
XP_022135919.12.4e-7267.58pentatricopeptide repeat-containing protein At3g12770-like isoform X1 [Momordica... [more]
XP_022135920.12.4e-7267.58pentatricopeptide repeat-containing protein At3g12770-like isoform X2 [Momordica... [more]
XP_022968999.11.2e-7177.40pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like [Cucur... [more]
XP_023554404.12.1e-7177.40putative pentatricopeptide repeat-containing protein At3g11460, mitochondrial [C... [more]
Match NameE-valueIdentityDescription
Q9SUH62.6e-4550.56Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana OX... [more]
Q9LN011.3e-4449.72Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9LW321.1e-4351.79Pentatricopeptide repeat-containing protein At3g26782, mitochondrial OS=Arabidop... [more]
Q9LNU61.1e-4346.07Pentatricopeptide repeat-containing protein At1g20230 OS=Arabidopsis thaliana OX... [more]
Q9SMZ27.2e-4345.70Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1C2E21.2e-7267.58pentatricopeptide repeat-containing protein At3g12770-like isoform X2 OS=Momordi... [more]
A0A6J1C2U51.2e-7267.58pentatricopeptide repeat-containing protein At3g12770-like isoform X1 OS=Momordi... [more]
A0A6J1I1A35.8e-7277.40pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like OS=Cuc... [more]
A0A6J1GMH01.4e-7076.27putative pentatricopeptide repeat-containing protein At3g11460, mitochondrial OS... [more]
A0A2P6S2G85.1e-6065.73Putative tetratricopeptide-like helical domain, DYW domain-containing protein OS... [more]
Match NameE-valueIdentityDescription
AT4G30700.11.9e-4650.56Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G08070.19.3e-4649.72Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G20230.17.9e-4546.07Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G26782.17.9e-4551.79Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G33170.15.1e-4445.70Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 116..233
e-value: 2.7E-35
score: 121.0
NoneNo IPR availablePANTHERPTHR47924:SF6PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 70..238
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 70..238

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C02G028040.2Cla97C02G028040.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0008270 zinc ion binding