Cp4.1LG17g02740 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG17g02740
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG17: 2077328 .. 2080796 (+)
RNA-Seq ExpressionCp4.1LG17g02740
SyntenyCp4.1LG17g02740
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTCTTCTTCGATTGAATGGAACTGGAAGTAACAGAATGAAAAATCTTCATGTTCTTTTCAAGCCAAGGATTGCCTTCTTCAATTCAACGTCTTCTTCATCATCACCTCAGATTTCATCTCAGGAAACCCATTTCATCGATCTAATACATGCTTCCGATTCGACCCACAAGCTTCGTCAGATCCATGGTCAACTCTACCGCTGTAACATCTTCTCCAGCAGCCGGGTCGTGACCCAGTTCATCTCTTCGTGTTCTTCGTTAAATTCTGTCGACTATGCGGTTTTGATCTTCCAGCGGTTCGAGTTGAAGAATAGTTTCCTCTTCAATGCATTGATTCGAGGACTCGCTGAAAATTCCAGGTTCGAGAGCTCAATTGCTTACTTTGTTTGCATGCTGAGGTGGGAAATTAGCCCTGATAGGCTTACTTTTCCGTTTGTGCTCAAATCAGCGGCGGCTCTTTCCAATGGTGGCGTTGGGAGTGCTTTACATTCTGGGATTGTGAAATTTGGACTTGAATTTGATTCTTTTGTGAGGGTTTCGTTGGTGGACATGTACGTGAAAGTTGACGATTTGGGTTCTGCCTTGAAGGTGTTTGATGAAAGTCCTGACAGAATTAAGAAGGAAAATGTGTTGATTTGGAATGTTCTTATTCATGGGTATTGTAGAGTGGGGAATTTGGTAAAAGCTACGGAGCTATTCGAGACAATGCCTAAGAAGGATACAGGTTCTTGGAATAGTTTGATTAATGGCTTCATGAGAAAGGGGCAGTTGGGTCCAGCAAACGAACTGTTTGAGAAAATGCCTGAAAAGAATGTGGTTTCTTGGACTACAATGGTGAATGGATTTTCACAGAATGGAGACCCTGAAAAGGCACTGCAATTTTTCTTTTGTATGCTCGAAGAAGGCGCACGGCCGAACGATTACACAATTGTCTCTGCACTTTCAGCTTGTGCAAAACTTGGTGCCTTAGATGCTGGTTTAAGGATCCATAGATACCTTTCAAGCCATGGTTTCAAATTGAATCAAACGATTGGAACTGCACTTGTGGATATGTATGCAAAATGTGGAAACATTGAGTCTGCAGGAGAGGTGTTTCGTGAAATAAAACAAAAGGGTCTTCTTACTTGGAGTGTTATGATCTGGGGCTGGGCTATCCATGGACATTTTAAGAAATCTATACAATACTTTGAATGGATGAAGTCTACAGGTTTAACTTCATTTTGCGTTTGTTATTCTCTGAAGTTTATTCTTTGTTGTCCATAACTCAAAACTCGGTATGTTTGCAGGAACAAAGCCAGATGGGGTGGTGTTTCTAGCTGTTCTTACTGCTTGCTCACATTCTGGACAAGTAGACGATGGACTCGAGTTTTTCGACAGTATGAGGCGAGACTACTTGATTGAGCCTTCTATGAAGCATTACACTCTGATTGTAGACATGCTAGGAAGGGCTGGTAGACTAGATGAAGCTCTAAAGTTCCTAAGAGACATGCCTATCAATCCGGATTTTGTGGTCTGGGGTGCTCTATTTTGTGCTTGCAGGGCTCATAAGAACATTAAAATGGCCGAATTAGCATCCGAAAAGCTTCTTGAACTTGAACCGAAGCATCCGGGGAGTTACGTATTTTTGTCGAATGCATATGCTGCTGTAGGAAGATGGGAAGATGCAGAGAGAGTGAGAGTTTCAATGCGAGATCGAGGTGCACAAAAGGATCCAGGATGGAGCTTCATGGAAGTGGATGATAAGTTACATAGATTTGTGGCTGGTGATAATACTCATAACCGTGCCCAAGAGATATACTCGAAATTAGATGAGATAAATGCAGGTGCCAGGGAAAAAGGATACACAAAAGGAATTGAGTGTGTTCTTCATAACATTGAAGAGGAAGAGAAGGAAGAAGCACTGGGACATCACAGCGAGAAGTTGGCGCTTGCTTTCGGGCTCGTTAGTACAGCCCCGGAAACGACGATTAGGATAGTGAAAAACCTTAGAGTTTGTGTGGATTGTCATTCATTCATGAAATATGCAAGTAAAATGAGCCAGAGAGAGATCATTCTGCGGGATATGAAACGATTTCATCATTTTCATGATGGGGTTTGTTCATGTGGAGATTATTGGTAAAAGATTGTTGATCAGAGAAGGTGGAACTACTGATGCTTTTGTTACTGAGGTTGCCACTTATAGCTAAGCCTACTCCCATACTTCTCGAATCTCCGTTCTCGCAAAATTCTCGACATCAAACCATTTTCGAGATATTCGGCTCCGTTTTTAACTCTCACCAGGCTTTGTTGGATGAAAAGAAAAGTTGAAAAGTCCCACATCGGCTAATTCCTTAGGGAATGTTCATGGGTTTTCAAGGAATACTCCCTCCATTGGTATGAGACCTCTTGGGGAAAAGTCCCGCATCTGCCAATTTAGGGAATGTTCATGGGTTTTCAAGGAATACTCTCTCCATTGGTATGAAGCATTTTGGGGAAGCCCAAAGCAAAGCCACAAGAGCTTATGCTCAAAGTAGACAATATCATACCATTGAGCAAAGCCACGAGAGCTTATTCTCAAAGTCATACCATTGTGGAGAGTCGTGTTTGTCTAACACACTCATCGATTAATTTTGATCAACTCCCTTGTAGGTTTGTTCTTTTCCTCTTGATGATCGTTGGTTATATTCGTTAGTTCTTCTCTTAATTCCATGGTCTACTTGCCAATCTGCTAGAGTTTCTTTTTCATTAATTCGTTGATTGAACGTTTGAATTCGAGTCGTTGAGTTTAATTTCGTGTTCTTGGTTTCTTTTTCTTGTTGTTTGTTGATCGGGAGAGTGGTAAAAGTTCGTTAACTTAAATTGTTAAGGCTATTTTGTTTGTCTCGATTATGTTTATGGCATGAACAGTGGGAGTTTTGCCTTAGATTTTCATAATTCTGTGGTTGATTGTTTTAAAATTAAAGGAGTCTGACTTTCGTGGATTGTAGTTTCAGGACTGATGGCTAAGAGCTCTACTTCAAGCAAGAGCATGATGATCTTGGTAAGGAAGCAATCAGTTATTGATGATGTTCTTTGATTTCTTCCCCCAAAACCTAGAAAAATCCTAACCCCGTCGCGAATTGGTGGTTTCCAGCCATGTGAAACGCCGTCGGAGGCGGACATTGATGATGGTTTCCGTCGAGGTTAAATGGCGACGTCGGCGGCAACAGAGCCTTACCGGTTCGGACCGCCGTCGATTTCTCGAATTTCTCCGCTACTTCGGCTCTCAAACCCATCAATGGCAATCGAAACCCCCCTCGATTCTTCCAATCTCATCCCTTCCATAGACTCCTCTCATCCGGATGACCGCAACAAAAGAGCTTACCGGAGGGAGAGAAGAAGAATGGGATATTAGGATACCAGGTGTTAGCTCTGATACCAATTGTTAGGATCGCTCAACAACGCTTACACTCAATCAAGATGAATCCAACAAATCGGAGAGAGAAAA

mRNA sequence

ATGCTTCTTCTTCGATTGAATGGAACTGGAAGTAACAGAATGAAAAATCTTCATGTTCTTTTCAAGCCAAGGATTGCCTTCTTCAATTCAACGTCTTCTTCATCATCACCTCAGATTTCATCTCAGGAAACCCATTTCATCGATCTAATACATGCTTCCGATTCGACCCACAAGCTTCGTCAGATCCATGGTCAACTCTACCGCTGTAACATCTTCTCCAGCAGCCGGGTCGTGACCCAGTTCATCTCTTCGTGTTCTTCGTTAAATTCTGTCGACTATGCGGTTTTGATCTTCCAGCGGTTCGAGTTGAAGAATAGTTTCCTCTTCAATGCATTGATTCGAGGACTCGCTGAAAATTCCAGGTTCGAGAGCTCAATTGCTTACTTTGTTTGCATGCTGAGGTGGGAAATTAGCCCTGATAGGCTTACTTTTCCGTTTGTGCTCAAATCAGCGGCGGCTCTTTCCAATGGTGGCGTTGGGAGTGCTTTACATTCTGGGATTGTGAAATTTGGACTTGAATTTGATTCTTTTGTGAGGGTTTCGTTGGTGGACATGTACGTGAAAGTTGACGATTTGGGTTCTGCCTTGAAGGTGTTTGATGAAAGTCCTGACAGAATTAAGAAGGAAAATGTGTTGATTTGGAATGTTCTTATTCATGGGTATTGTAGAGTGGGGAATTTGGTAAAAGCTACGGAGCTATTCGAGACAATGCCTAAGAAGGATACAGGTTCTTGGAATAGTTTGATTAATGGCTTCATGAGAAAGGGGCAGTTGGGTCCAGCAAACGAACTGTTTGAGAAAATGCCTGAAAAGAATGTGGTTTCTTGGACTACAATGGTGAATGGATTTTCACAGAATGGAGACCCTGAAAAGGCACTGCAATTTTTCTTTTGTATGCTCGAAGAAGGCGCACGGCCGAACGATTACACAATTGTCTCTGCACTTTCAGCTTGTGCAAAACTTGGACTGATGGCTAAGAGCTCTACTTCAAGCAAGAGCATGATGATCTTGGTAAGGAAGCAATCAGTTATTGATGATGTTCTTTGATTTCTTCCCCCAAAACCTAGAAAAATCCTAACCCCGTCGCGAATTGGTGGTTTCCAGCCATGTGAAACGCCGTCGGAGGCGGACATTGATGATGGTTTCCGTCGAGGTTAAATGGCGACGTCGGCGGCAACAGAGCCTTACCGGTTCGGACCGCCGTCGATTTCTCGAATTTCTCCGCTACTTCGGCTCTCAAACCCATCAATGGCAATCGAAACCCCCCTCGATTCTTCCAATCTCATCCCTTCCATAGACTCCTCTCATCCGGATGACCGCAACAAAAGAGCTTACCGGAGGGAGAGAAGAAGAATGGGATATTAGGATACCAGGTGTTAGCTCTGATACCAATTGTTAGGATCGCTCAACAACGCTTACACTCAATCAAGATGAATCCAACAAATCGGAGAGAGAAAA

Coding sequence (CDS)

ATGCTTCTTCTTCGATTGAATGGAACTGGAAGTAACAGAATGAAAAATCTTCATGTTCTTTTCAAGCCAAGGATTGCCTTCTTCAATTCAACGTCTTCTTCATCATCACCTCAGATTTCATCTCAGGAAACCCATTTCATCGATCTAATACATGCTTCCGATTCGACCCACAAGCTTCGTCAGATCCATGGTCAACTCTACCGCTGTAACATCTTCTCCAGCAGCCGGGTCGTGACCCAGTTCATCTCTTCGTGTTCTTCGTTAAATTCTGTCGACTATGCGGTTTTGATCTTCCAGCGGTTCGAGTTGAAGAATAGTTTCCTCTTCAATGCATTGATTCGAGGACTCGCTGAAAATTCCAGGTTCGAGAGCTCAATTGCTTACTTTGTTTGCATGCTGAGGTGGGAAATTAGCCCTGATAGGCTTACTTTTCCGTTTGTGCTCAAATCAGCGGCGGCTCTTTCCAATGGTGGCGTTGGGAGTGCTTTACATTCTGGGATTGTGAAATTTGGACTTGAATTTGATTCTTTTGTGAGGGTTTCGTTGGTGGACATGTACGTGAAAGTTGACGATTTGGGTTCTGCCTTGAAGGTGTTTGATGAAAGTCCTGACAGAATTAAGAAGGAAAATGTGTTGATTTGGAATGTTCTTATTCATGGGTATTGTAGAGTGGGGAATTTGGTAAAAGCTACGGAGCTATTCGAGACAATGCCTAAGAAGGATACAGGTTCTTGGAATAGTTTGATTAATGGCTTCATGAGAAAGGGGCAGTTGGGTCCAGCAAACGAACTGTTTGAGAAAATGCCTGAAAAGAATGTGGTTTCTTGGACTACAATGGTGAATGGATTTTCACAGAATGGAGACCCTGAAAAGGCACTGCAATTTTTCTTTTGTATGCTCGAAGAAGGCGCACGGCCGAACGATTACACAATTGTCTCTGCACTTTCAGCTTGTGCAAAACTTGGACTGATGGCTAAGAGCTCTACTTCAAGCAAGAGCATGATGATCTTGGTAAGGAAGCAATCAGTTATTGATGATGTTCTTTGA

Protein sequence

MLLLRLNGTGSNRMKNLHVLFKPRIAFFNSTSSSSSPQISSQETHFIDLIHASDSTHKLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENSRFESSIAYFVCMLRWEISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFVRVSLVDMYVKVDDLGSALKVFDESPDRIKKENVLIWNVLIHGYCRVGNLVKATELFETMPKKDTGSWNSLINGFMRKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFCMLEEGARPNDYTIVSALSACAKLGLMAKSSTSSKSMMILVRKQSVIDDVL
Homology
BLAST of Cp4.1LG17g02740 vs. ExPASy Swiss-Prot
Match: Q9MAT2 (Pentatricopeptide repeat-containing protein At1g04840 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H64 PE=2 SV=1)

HSP 1 Score: 323.2 bits (827), Expect = 3.7e-87
Identity = 167/314 (53.18%), Postives = 222/314 (70.70%), Query Frame = 0

Query: 14  MKNLHVLFKPRIAFFNSTSSSSSP---QISSQETHFIDLIHASDSTHKLRQIHGQLYRCN 73
           MK+L V+FKP+    +S +    P   Q S  E+HFI LIHA   T  LR +H Q+ R  
Sbjct: 1   MKSLSVIFKPK----SSPAKIYFPADRQASPDESHFISLIHACKDTASLRHVHAQILRRG 60

Query: 74  IFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENSRFESSIAYFV 133
           +  SSRV  Q +S  S L S DY++ IF+  E +N F+ NALIRGL EN+RFESS+ +F+
Sbjct: 61  VL-SSRVAAQLVSCSSLLKSPDYSLSIFRNSEERNPFVLNALIRGLTENARFESSVRHFI 120

Query: 134 CMLRWEISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFVRVSLVDMYVKVD 193
            MLR  + PDRLTFPFVLKS + L    +G ALH+  +K  ++ DSFVR+SLVDMY K  
Sbjct: 121 LMLRLGVKPDRLTFPFVLKSNSKLGFRWLGRALHAATLKNFVDCDSFVRLSLVDMYAKTG 180

Query: 194 DLGSALKVFDESPDRIKKENVLIWNVLIHGYCRVGNLVKATELFETMPKKDTGSWNSLIN 253
            L  A +VF+ESPDRIKKE++LIWNVLI+GYCR  ++  AT LF +MP++++GSW++LI 
Sbjct: 181 QLKHAFQVFEESPDRIKKESILIWNVLINGYCRAKDMHMATTLFRSMPERNSGSWSTLIK 240

Query: 254 GFMRKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFCMLEEGARPNDYT 313
           G++  G+L  A +LFE MPEKNVVSWTT++NGFSQ GD E A+  +F MLE+G +PN+YT
Sbjct: 241 GYVDSGELNRAKQLFELMPEKNVVSWTTLINGFSQTGDYETAISTYFEMLEKGLKPNEYT 300

Query: 314 IVSALSACAKLGLM 325
           I + LSAC+K G +
Sbjct: 301 IAAVLSACSKSGAL 309

BLAST of Cp4.1LG17g02740 vs. ExPASy Swiss-Prot
Match: Q9FHR3 (Putative pentatricopeptide repeat-containing protein At5g37570 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E37 PE=3 SV=1)

HSP 1 Score: 170.6 bits (431), Expect = 3.1e-41
Identity = 99/321 (30.84%), Postives = 172/321 (53.58%), Query Frame = 0

Query: 35  SSPQISSQETHFIDLIHASDSTHKLRQIHGQLYRCNIFSSSRVVTQFI-SSCSSLNSVDY 94
           S P + S ET    L     S   L QIH ++ R  +     +++ FI SS SS +S+ Y
Sbjct: 6   SHPSLLSLET----LFKLCKSEIHLNQIHARIIRKGLEQDQNLISIFISSSSSSSSSLSY 65

Query: 95  AVLIFQRFELKNSFLFNALIRGLAENSRFESSIAYFVCMLRWEIS-PDRLTFPFVLKSAA 154
           +  +F+R     ++L+N LI+G +    F  +++  + M+R  ++ PD  TFP V+K  +
Sbjct: 66  SSSVFERVPSPGTYLWNHLIKGYSNKFLFFETVSILMRMMRTGLARPDEYTFPLVMKVCS 125

Query: 155 ALSNGGVGSALHSGIVKFGLEFDSFVRVSLVDMYVKVDDLGSALKVFDESPDRIKKENVL 214
                 VGS++H  +++ G + D  V  S VD Y K  DL SA KVF E P+R    N +
Sbjct: 126 NNGQVRVGSSVHGLVLRIGFDKDVVVGTSFVDFYGKCKDLFSARKVFGEMPER----NAV 185

Query: 215 IWNVLIHGYCRVGNLVKATELFETMPKKDTGSWNSLINGFMRKGQLGPANELFEKMPEKN 274
            W  L+  Y + G L +A  +F+ MP+++ GSWN+L++G ++ G L  A +LF++MP+++
Sbjct: 186 SWTALVVAYVKSGELEEAKSMFDLMPERNLGSWNALVDGLVKSGDLVNAKKLFDEMPKRD 245

Query: 275 VVSWTTMVN-------------------------------GFSQNGDPEKALQFFFCMLE 323
           ++S+T+M++                               G++QNG P +A + F  M  
Sbjct: 246 IISYTSMIDGYAKGGDMVSARDLFEEARGVDVRAWSALILGYAQNGQPNEAFKVFSEMCA 305

BLAST of Cp4.1LG17g02740 vs. ExPASy Swiss-Prot
Match: Q56X05 (Pentatricopeptide repeat-containing protein At1g06143 OS=Arabidopsis thaliana OX=3702 GN=EMB1444 PE=2 SV=2)

HSP 1 Score: 166.4 bits (420), Expect = 5.8e-40
Identity = 99/315 (31.43%), Postives = 167/315 (53.02%), Query Frame = 0

Query: 12  NRMKNLHVLFKP--RIAFFNSTSSSSSPQISSQETHFIDLIHASDSTHKLRQIHGQLYRC 71
           N   N+H L  P   +  F+++ S + P +         +I    +   L      + + 
Sbjct: 2   NAFANVHSLRVPSHHLRDFSASLSLAPPNLKK-------IIKQCSTPKLLESALAAMIKT 61

Query: 72  NIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENSRFESSIAYF 131
           ++    R++ QFI++C+S   +D AV    + +  N F++NAL +G    S    S+  +
Sbjct: 62  SLNQDCRLMNQFITACTSFKRLDLAVSTMTQMQEPNVFVYNALFKGFVTCSHPIRSLELY 121

Query: 132 VCMLRWEISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFVRVSLVDMYVKV 191
           V MLR  +SP   T+  ++K+++  S    G +L + I KFG  F   ++ +L+D Y   
Sbjct: 122 VRMLRDSVSPSSYTYSSLVKASSFASR--FGESLQAHIWKFGFGFHVKIQTTLIDFYSAT 181

Query: 192 DDLGSALKVFDESPDRIKKENVLIWNVLIHGYCRVGNLVKATELFETMPKKDTGSWNSLI 251
             +  A KVFDE P+R    + + W  ++  Y RV ++  A  L   M +K+  + N LI
Sbjct: 182 GRIREARKVFDEMPER----DDIAWTTMVSAYRRVLDMDSANSLANQMSEKNEATSNCLI 241

Query: 252 NGFMRKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFCMLEEGARPNDY 311
           NG+M  G L  A  LF +MP K+++SWTTM+ G+SQN    +A+  F+ M+EEG  P++ 
Sbjct: 242 NGYMGLGNLEQAESLFNQMPVKDIISWTTMIKGYSQNKRYREAIAVFYKMMEEGIIPDEV 301

Query: 312 TIVSALSACAKLGLM 325
           T+ + +SACA LG++
Sbjct: 302 TMSTVISACAHLGVL 303

BLAST of Cp4.1LG17g02740 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 160.6 bits (405), Expect = 3.2e-38
Identity = 110/365 (30.14%), Postives = 171/365 (46.85%), Query Frame = 0

Query: 28  FNSTSSSSSPQISSQETH-FIDLIHASDSTHKLRQIHGQLYRCNIFSSSRVVTQFISSC- 87
           F+   SSS P   S   H  + L+H   +   LR IH Q+ +  + +++  +++ I  C 
Sbjct: 17  FHFLPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCI 76

Query: 88  --SSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENSRFESSIAYFVCMLRWEISPDRLT 147
                  + YA+ +F+  +  N  ++N + RG A +S   S++  +VCM+   + P+  T
Sbjct: 77  LSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYT 136

Query: 148 FPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFVRVSLVDMYVKVDDLGSALKVFDESP 207
           FPFVLKS A       G  +H  ++K G + D +V  SL+ MYV+   L  A KVFD+SP
Sbjct: 137 FPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSP 196

Query: 208 ---------------------------DRIKKENVLIWNVLIHGYCRVGNLVKATELFET 267
                                      D I  ++V+ WN +I GY   GN  +A ELF+ 
Sbjct: 197 HRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKD 256

Query: 268 MPKKD-----------------TGS---------W-------------NSLINGFMRKGQ 323
           M K +                 +GS         W             N+LI+ + + G+
Sbjct: 257 MMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGE 316

BLAST of Cp4.1LG17g02740 vs. ExPASy Swiss-Prot
Match: Q9M9R6 (Pentatricopeptide repeat-containing protein At1g14470 OS=Arabidopsis thaliana OX=3702 GN=PCMP-A4 PE=2 SV=2)

HSP 1 Score: 160.6 bits (405), Expect = 3.2e-38
Identity = 98/264 (37.12%), Postives = 143/264 (54.17%), Query Frame = 0

Query: 58  KLRQIHGQLYRCN-IFSSSRVVTQFISSCSSLNSVDYAV-LIFQRFELKNSFLFNALIRG 117
           +L QIH QL   N +   S   ++ IS C+ L +  Y   LIF      N F+ N++ + 
Sbjct: 21  QLNQIHAQLIVFNSLPRQSYWASRIISCCTRLRAPSYYTRLIFDSVTFPNVFVVNSMFKY 80

Query: 118 LAENSRFESSIAYFVCMLRWEISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFD 177
            ++       +  +    R  I PD  +FP V+KSA     G  G    + + K G   D
Sbjct: 81  FSKMDMANDVLRLYEQRSRCGIMPDAFSFPVVIKSA-----GRFGILFQALVEKLGFFKD 140

Query: 178 SFVRVSLVDMYVKVDDLGSALKVFDESPDRIKKENVLIWNVLIHGYCRVGNLVKATELFE 237
            +VR  ++DMYVK + + SA KVFD+   R   +    WNV+I GY + GN  +A +LF+
Sbjct: 141 PYVRNVIMDMYVKHESVESARKVFDQISQRKGSD----WNVMISGYWKWGNKEEACKLFD 200

Query: 238 TMPKKDTGSWNSLINGFMRKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQF 297
            MP+ D  SW  +I GF +   L  A + F++MPEK+VVSW  M++G++QNG  E AL+ 
Sbjct: 201 MMPENDVVSWTVMITGFAKVKDLENARKYFDRMPEKSVVSWNAMLSGYAQNGFTEDALRL 260

Query: 298 FFCMLEEGARPNDYTIVSALSACA 320
           F  ML  G RPN+ T V  +SAC+
Sbjct: 261 FNDMLRLGVRPNETTWVIVISACS 275

BLAST of Cp4.1LG17g02740 vs. NCBI nr
Match: XP_023513771.1 (pentatricopeptide repeat-containing protein At1g04840 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 640 bits (1652), Expect = 1.27e-225
Identity = 322/324 (99.38%), Postives = 323/324 (99.69%), Query Frame = 0

Query: 1   MLLLRLNGTGSNRMKNLHVLFKPRIAFFNSTSSSSSPQISSQETHFIDLIHASDSTHKLR 60
           MLLLRLNGTGSNRMKNLHVLFKPRIAFFNSTSSSSSPQISSQETHFIDLIHASDSTHKLR
Sbjct: 1   MLLLRLNGTGSNRMKNLHVLFKPRIAFFNSTSSSSSPQISSQETHFIDLIHASDSTHKLR 60

Query: 61  QIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENS 120
           QIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENS
Sbjct: 61  QIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENS 120

Query: 121 RFESSIAYFVCMLRWEISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFVRV 180
           RFESSIAYFVCMLRWEISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFVRV
Sbjct: 121 RFESSIAYFVCMLRWEISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFVRV 180

Query: 181 SLVDMYVKVDDLGSALKVFDESPDRIKKENVLIWNVLIHGYCRVGNLVKATELFETMPKK 240
           SLVDMYVKVDDLGSALKVFDESPDRIKKENVLIWNVLIHGYCRVGNLVKATELFETMPKK
Sbjct: 181 SLVDMYVKVDDLGSALKVFDESPDRIKKENVLIWNVLIHGYCRVGNLVKATELFETMPKK 240

Query: 241 DTGSWNSLINGFMRKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFCML 300
           DTGSWNSLINGFMRKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFCML
Sbjct: 241 DTGSWNSLINGFMRKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFCML 300

Query: 301 EEGARPNDYTIVSALSACAKLGLM 324
           EEGARPNDYTIVSALSACAKLG +
Sbjct: 301 EEGARPNDYTIVSALSACAKLGAL 324

BLAST of Cp4.1LG17g02740 vs. NCBI nr
Match: XP_023000600.1 (pentatricopeptide repeat-containing protein At1g04840 [Cucurbita maxima])

HSP 1 Score: 622 bits (1604), Expect = 2.37e-218
Identity = 312/323 (96.59%), Postives = 319/323 (98.76%), Query Frame = 0

Query: 2   LLLRLNGTGSNRMKNLHVLFKPRIAFFNSTSSSSSPQISSQETHFIDLIHASDSTHKLRQ 61
           +LL LNGTGSNRMKNLHVLFKPRIAFFNSTSSSSSPQISS ET+FIDLIHASDSTHKLRQ
Sbjct: 1   MLLLLNGTGSNRMKNLHVLFKPRIAFFNSTSSSSSPQISSLETYFIDLIHASDSTHKLRQ 60

Query: 62  IHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENSR 121
           IHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENSR
Sbjct: 61  IHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENSR 120

Query: 122 FESSIAYFVCMLRWEISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFVRVS 181
           FESSI+YFVCMLRW+ISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFVRVS
Sbjct: 121 FESSISYFVCMLRWKISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFVRVS 180

Query: 182 LVDMYVKVDDLGSALKVFDESPDRIKKENVLIWNVLIHGYCRVGNLVKATELFETMPKKD 241
           LVDMYVKVDDLGSALKVFDESPDRIK+ NVLIWNVLIHGYCRVGNLVKATELFETMPKKD
Sbjct: 181 LVDMYVKVDDLGSALKVFDESPDRIKQGNVLIWNVLIHGYCRVGNLVKATELFETMPKKD 240

Query: 242 TGSWNSLINGFMRKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFCMLE 301
           TGSWNSLINGFMRKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFCMLE
Sbjct: 241 TGSWNSLINGFMRKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFCMLE 300

Query: 302 EGARPNDYTIVSALSACAKLGLM 324
           EGA+PNDYTIVSALSACAKLG +
Sbjct: 301 EGAQPNDYTIVSALSACAKLGAL 323

BLAST of Cp4.1LG17g02740 vs. NCBI nr
Match: KAG7026055.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 619 bits (1595), Expect = 3.43e-217
Identity = 314/326 (96.32%), Postives = 320/326 (98.16%), Query Frame = 0

Query: 1   MLLLRLNG--TGSNRMKNLHVLFKPRIAFFNSTSSSSSPQISSQETHFIDLIHASDSTHK 60
           MLLLRLNG  TGSNRMKNL VLFKPRIAFFNSTSSSSSPQISS ETHFIDLIHASDSTHK
Sbjct: 1   MLLLRLNGYGTGSNRMKNLLVLFKPRIAFFNSTSSSSSPQISSLETHFIDLIHASDSTHK 60

Query: 61  LRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAE 120
           LRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAE
Sbjct: 61  LRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAE 120

Query: 121 NSRFESSIAYFVCMLRWEISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFV 180
           NSRFESSI+YFVCMLRW+ISPDRLTFPFVLKSAAALSNGGVGSALHSGI+KFGLEFDSFV
Sbjct: 121 NSRFESSISYFVCMLRWKISPDRLTFPFVLKSAAALSNGGVGSALHSGILKFGLEFDSFV 180

Query: 181 RVSLVDMYVKVDDLGSALKVFDESPDRIKKENVLIWNVLIHGYCRVGNLVKATELFETMP 240
           RVSLVDMYVKVDDLGSALKVFDESPDRIKK NVLIWNVLIHGYCRVGNLVKATELFETMP
Sbjct: 181 RVSLVDMYVKVDDLGSALKVFDESPDRIKKGNVLIWNVLIHGYCRVGNLVKATELFETMP 240

Query: 241 KKDTGSWNSLINGFMRKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFC 300
           +KDTGSWNSLINGFMRKGQLGPA+ELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFC
Sbjct: 241 EKDTGSWNSLINGFMRKGQLGPAHELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFC 300

Query: 301 MLEEGARPNDYTIVSALSACAKLGLM 324
           MLEEGARPNDYTIVSALSACAKLG +
Sbjct: 301 MLEEGARPNDYTIVSALSACAKLGAL 326

BLAST of Cp4.1LG17g02740 vs. NCBI nr
Match: XP_022964045.1 (pentatricopeptide repeat-containing protein At1g04840 [Cucurbita moschata])

HSP 1 Score: 619 bits (1595), Expect = 6.09e-217
Identity = 314/326 (96.32%), Postives = 320/326 (98.16%), Query Frame = 0

Query: 1   MLLLRLNG--TGSNRMKNLHVLFKPRIAFFNSTSSSSSPQISSQETHFIDLIHASDSTHK 60
           MLLLRLNG  TGSNRMKNL VLFKPRIAFFNSTSSSSSPQISS ETHFIDLIHASDSTHK
Sbjct: 1   MLLLRLNGYGTGSNRMKNLLVLFKPRIAFFNSTSSSSSPQISSLETHFIDLIHASDSTHK 60

Query: 61  LRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAE 120
           LRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAE
Sbjct: 61  LRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAE 120

Query: 121 NSRFESSIAYFVCMLRWEISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFV 180
           NSRFESSI+YFVCMLRW+ISPDRLTFPFVLKSAAALSNGGVGSALHSGI+KFGLEFDSFV
Sbjct: 121 NSRFESSISYFVCMLRWKISPDRLTFPFVLKSAAALSNGGVGSALHSGILKFGLEFDSFV 180

Query: 181 RVSLVDMYVKVDDLGSALKVFDESPDRIKKENVLIWNVLIHGYCRVGNLVKATELFETMP 240
           RVSLVDMYVKVDDLGSALKVFDESPDRIKK NVLIWNVLIHGYCRVGNLVKATELFETMP
Sbjct: 181 RVSLVDMYVKVDDLGSALKVFDESPDRIKKGNVLIWNVLIHGYCRVGNLVKATELFETMP 240

Query: 241 KKDTGSWNSLINGFMRKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFC 300
           +KDTGSWNSLINGFMRKGQLGPA+ELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFC
Sbjct: 241 EKDTGSWNSLINGFMRKGQLGPAHELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFC 300

Query: 301 MLEEGARPNDYTIVSALSACAKLGLM 324
           MLEEGARPNDYTIVSALSACAKLG +
Sbjct: 301 MLEEGARPNDYTIVSALSACAKLGAL 326

BLAST of Cp4.1LG17g02740 vs. NCBI nr
Match: KAG6593716.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 619 bits (1595), Expect = 2.48e-211
Identity = 314/326 (96.32%), Postives = 320/326 (98.16%), Query Frame = 0

Query: 1   MLLLRLNG--TGSNRMKNLHVLFKPRIAFFNSTSSSSSPQISSQETHFIDLIHASDSTHK 60
           MLLLRLNG  TGSNRMKNL VLFKPRIAFFNSTSSSSSPQISS ETHFIDLIHASDSTHK
Sbjct: 1   MLLLRLNGYGTGSNRMKNLLVLFKPRIAFFNSTSSSSSPQISSLETHFIDLIHASDSTHK 60

Query: 61  LRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAE 120
           LRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAE
Sbjct: 61  LRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAE 120

Query: 121 NSRFESSIAYFVCMLRWEISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFV 180
           NSRFESSI+YFVCMLRW+ISPDRLTFPFVLKSAAALSNGGVGSALHSGI+KFGLEFDSFV
Sbjct: 121 NSRFESSISYFVCMLRWKISPDRLTFPFVLKSAAALSNGGVGSALHSGILKFGLEFDSFV 180

Query: 181 RVSLVDMYVKVDDLGSALKVFDESPDRIKKENVLIWNVLIHGYCRVGNLVKATELFETMP 240
           RVSLVDMYVKVDDLGSALKVFDESPDRIKK NVLIWNVLIHGYCRVGNLVKATELFETMP
Sbjct: 181 RVSLVDMYVKVDDLGSALKVFDESPDRIKKGNVLIWNVLIHGYCRVGNLVKATELFETMP 240

Query: 241 KKDTGSWNSLINGFMRKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFC 300
           +KDTGSWNSLINGFMRKGQLGPA+ELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFC
Sbjct: 241 EKDTGSWNSLINGFMRKGQLGPAHELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFC 300

Query: 301 MLEEGARPNDYTIVSALSACAKLGLM 324
           MLEEGARPNDYTIVSALSACAKLG +
Sbjct: 301 MLEEGARPNDYTIVSALSACAKLGAL 326

BLAST of Cp4.1LG17g02740 vs. ExPASy TrEMBL
Match: A0A6J1KIT8 (pentatricopeptide repeat-containing protein At1g04840 OS=Cucurbita maxima OX=3661 GN=LOC111494840 PE=3 SV=1)

HSP 1 Score: 622 bits (1604), Expect = 1.15e-218
Identity = 312/323 (96.59%), Postives = 319/323 (98.76%), Query Frame = 0

Query: 2   LLLRLNGTGSNRMKNLHVLFKPRIAFFNSTSSSSSPQISSQETHFIDLIHASDSTHKLRQ 61
           +LL LNGTGSNRMKNLHVLFKPRIAFFNSTSSSSSPQISS ET+FIDLIHASDSTHKLRQ
Sbjct: 1   MLLLLNGTGSNRMKNLHVLFKPRIAFFNSTSSSSSPQISSLETYFIDLIHASDSTHKLRQ 60

Query: 62  IHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENSR 121
           IHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENSR
Sbjct: 61  IHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENSR 120

Query: 122 FESSIAYFVCMLRWEISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFVRVS 181
           FESSI+YFVCMLRW+ISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFVRVS
Sbjct: 121 FESSISYFVCMLRWKISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFVRVS 180

Query: 182 LVDMYVKVDDLGSALKVFDESPDRIKKENVLIWNVLIHGYCRVGNLVKATELFETMPKKD 241
           LVDMYVKVDDLGSALKVFDESPDRIK+ NVLIWNVLIHGYCRVGNLVKATELFETMPKKD
Sbjct: 181 LVDMYVKVDDLGSALKVFDESPDRIKQGNVLIWNVLIHGYCRVGNLVKATELFETMPKKD 240

Query: 242 TGSWNSLINGFMRKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFCMLE 301
           TGSWNSLINGFMRKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFCMLE
Sbjct: 241 TGSWNSLINGFMRKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFCMLE 300

Query: 302 EGARPNDYTIVSALSACAKLGLM 324
           EGA+PNDYTIVSALSACAKLG +
Sbjct: 301 EGAQPNDYTIVSALSACAKLGAL 323

BLAST of Cp4.1LG17g02740 vs. ExPASy TrEMBL
Match: A0A6J1HJP9 (pentatricopeptide repeat-containing protein At1g04840 OS=Cucurbita moschata OX=3662 GN=LOC111464188 PE=3 SV=1)

HSP 1 Score: 619 bits (1595), Expect = 2.95e-217
Identity = 314/326 (96.32%), Postives = 320/326 (98.16%), Query Frame = 0

Query: 1   MLLLRLNG--TGSNRMKNLHVLFKPRIAFFNSTSSSSSPQISSQETHFIDLIHASDSTHK 60
           MLLLRLNG  TGSNRMKNL VLFKPRIAFFNSTSSSSSPQISS ETHFIDLIHASDSTHK
Sbjct: 1   MLLLRLNGYGTGSNRMKNLLVLFKPRIAFFNSTSSSSSPQISSLETHFIDLIHASDSTHK 60

Query: 61  LRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAE 120
           LRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAE
Sbjct: 61  LRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAE 120

Query: 121 NSRFESSIAYFVCMLRWEISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFV 180
           NSRFESSI+YFVCMLRW+ISPDRLTFPFVLKSAAALSNGGVGSALHSGI+KFGLEFDSFV
Sbjct: 121 NSRFESSISYFVCMLRWKISPDRLTFPFVLKSAAALSNGGVGSALHSGILKFGLEFDSFV 180

Query: 181 RVSLVDMYVKVDDLGSALKVFDESPDRIKKENVLIWNVLIHGYCRVGNLVKATELFETMP 240
           RVSLVDMYVKVDDLGSALKVFDESPDRIKK NVLIWNVLIHGYCRVGNLVKATELFETMP
Sbjct: 181 RVSLVDMYVKVDDLGSALKVFDESPDRIKKGNVLIWNVLIHGYCRVGNLVKATELFETMP 240

Query: 241 KKDTGSWNSLINGFMRKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFC 300
           +KDTGSWNSLINGFMRKGQLGPA+ELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFC
Sbjct: 241 EKDTGSWNSLINGFMRKGQLGPAHELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFC 300

Query: 301 MLEEGARPNDYTIVSALSACAKLGLM 324
           MLEEGARPNDYTIVSALSACAKLG +
Sbjct: 301 MLEEGARPNDYTIVSALSACAKLGAL 326

BLAST of Cp4.1LG17g02740 vs. ExPASy TrEMBL
Match: A0A0A0LI86 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G139850 PE=3 SV=1)

HSP 1 Score: 554 bits (1427), Expect = 7.85e-192
Identity = 274/323 (84.83%), Postives = 301/323 (93.19%), Query Frame = 0

Query: 2   LLLRLNGTGSNRMKNLHVLFKPRIAFFNSTSSSSSPQISSQETHFIDLIHASDSTHKLRQ 61
           +LLR NG+GSN MK+LHVLF PRIAFF+S  SSSSP IS  ETHFIDLIHAS+STHKLRQ
Sbjct: 1   MLLRRNGSGSNIMKDLHVLFNPRIAFFSSMFSSSSPPISFLETHFIDLIHASNSTHKLRQ 60

Query: 62  IHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENSR 121
           IHGQLYRCN+FSSSRVVTQFISSCSSLNSVDYA+ IFQRFELKNS+LFNALIRGLAENSR
Sbjct: 61  IHGQLYRCNVFSSSRVVTQFISSCSSLNSVDYAISIFQRFELKNSYLFNALIRGLAENSR 120

Query: 122 FESSIAYFVCMLRWEISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFVRVS 181
           FESSI++FV ML+W+ISPDRLTFPFVLKSAAALSNGGVG ALH GI+KFGLEFDSFVRVS
Sbjct: 121 FESSISFFVLMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVS 180

Query: 182 LVDMYVKVDDLGSALKVFDESPDRIKKENVLIWNVLIHGYCRVGNLVKATELFETMPKKD 241
           LVDMYVKV++LGSALKVFDESP+ +K  +VLIWNVLIHGYCR+G+LVKATELF++MPKKD
Sbjct: 181 LVDMYVKVEELGSALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKD 240

Query: 242 TGSWNSLINGFMRKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFCMLE 301
           TGSWNSLINGFM+ G +G A ELF KMPEKNVVSWTTMVNGFSQNGDPEKAL+ FFCMLE
Sbjct: 241 TGSWNSLINGFMKMGDMGRAKELFVKMPEKNVVSWTTMVNGFSQNGDPEKALETFFCMLE 300

Query: 302 EGARPNDYTIVSALSACAKLGLM 324
           EGARPNDYTIVSALSACAK+G +
Sbjct: 301 EGARPNDYTIVSALSACAKIGAL 323

BLAST of Cp4.1LG17g02740 vs. ExPASy TrEMBL
Match: A0A5A7SRY4 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold275G00910 PE=3 SV=1)

HSP 1 Score: 544 bits (1402), Expect = 4.80e-188
Identity = 271/323 (83.90%), Postives = 298/323 (92.26%), Query Frame = 0

Query: 2   LLLRLNGTGSNRMKNLHVLFKPRIAFFNSTSSSSSPQISSQETHFIDLIHASDSTHKLRQ 61
           +LL  NGTGSN MK+LHVLF PRIAF +S  SSSS +ISS ETHFIDLIHAS+STHKLRQ
Sbjct: 1   MLLPRNGTGSNIMKDLHVLFNPRIAFLSSMFSSSSLRISSLETHFIDLIHASNSTHKLRQ 60

Query: 62  IHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENSR 121
           IHGQLYRCN+FSSSRVVTQFISSCS LN+VDYAV IFQRFELKNS+LFNALIRGLAENSR
Sbjct: 61  IHGQLYRCNVFSSSRVVTQFISSCSLLNAVDYAVSIFQRFELKNSYLFNALIRGLAENSR 120

Query: 122 FESSIAYFVCMLRWEISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFVRVS 181
           FESSI++FV ML+W+ISPDRLTFPFVLKSAAALSNGGVG ALH GI+KFGL FDSFVRVS
Sbjct: 121 FESSISFFVLMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLVFDSFVRVS 180

Query: 182 LVDMYVKVDDLGSALKVFDESPDRIKKENVLIWNVLIHGYCRVGNLVKATELFETMPKKD 241
           LVDMYVKV +LGSALKVFDESP+ +K  +VLIWNVLIHGYCR+G+LVKATELF++MPKKD
Sbjct: 181 LVDMYVKVGELGSALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKD 240

Query: 242 TGSWNSLINGFMRKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFCMLE 301
           TGSWNSLINGFM+ G +G A ELFEKMPEKNVVSWTTMVNGFSQNGDP+KAL+ FFCMLE
Sbjct: 241 TGSWNSLINGFMKMGDMGRAKELFEKMPEKNVVSWTTMVNGFSQNGDPQKALETFFCMLE 300

Query: 302 EGARPNDYTIVSALSACAKLGLM 324
           EGARPNDYTIVSALSACAK+G +
Sbjct: 301 EGARPNDYTIVSALSACAKIGAL 323

BLAST of Cp4.1LG17g02740 vs. ExPASy TrEMBL
Match: A0A1S3C6B0 (pentatricopeptide repeat-containing protein At1g04840 OS=Cucumis melo OX=3656 GN=LOC103496955 PE=3 SV=1)

HSP 1 Score: 544 bits (1402), Expect = 4.80e-188
Identity = 271/323 (83.90%), Postives = 298/323 (92.26%), Query Frame = 0

Query: 2   LLLRLNGTGSNRMKNLHVLFKPRIAFFNSTSSSSSPQISSQETHFIDLIHASDSTHKLRQ 61
           +LL  NGTGSN MK+LHVLF PRIAF +S  SSSS +ISS ETHFIDLIHAS+STHKLRQ
Sbjct: 1   MLLPRNGTGSNIMKDLHVLFNPRIAFLSSMFSSSSLRISSLETHFIDLIHASNSTHKLRQ 60

Query: 62  IHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENSR 121
           IHGQLYRCN+FSSSRVVTQFISSCS LN+VDYAV IFQRFELKNS+LFNALIRGLAENSR
Sbjct: 61  IHGQLYRCNVFSSSRVVTQFISSCSLLNAVDYAVSIFQRFELKNSYLFNALIRGLAENSR 120

Query: 122 FESSIAYFVCMLRWEISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFVRVS 181
           FESSI++FV ML+W+ISPDRLTFPFVLKSAAALSNGGVG ALH GI+KFGL FDSFVRVS
Sbjct: 121 FESSISFFVLMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLVFDSFVRVS 180

Query: 182 LVDMYVKVDDLGSALKVFDESPDRIKKENVLIWNVLIHGYCRVGNLVKATELFETMPKKD 241
           LVDMYVKV +LGSALKVFDESP+ +K  +VLIWNVLIHGYCR+G+LVKATELF++MPKKD
Sbjct: 181 LVDMYVKVGELGSALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKD 240

Query: 242 TGSWNSLINGFMRKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFCMLE 301
           TGSWNSLINGFM+ G +G A ELFEKMPEKNVVSWTTMVNGFSQNGDP+KAL+ FFCMLE
Sbjct: 241 TGSWNSLINGFMKMGDMGRAKELFEKMPEKNVVSWTTMVNGFSQNGDPQKALETFFCMLE 300

Query: 302 EGARPNDYTIVSALSACAKLGLM 324
           EGARPNDYTIVSALSACAK+G +
Sbjct: 301 EGARPNDYTIVSALSACAKIGAL 323

BLAST of Cp4.1LG17g02740 vs. TAIR 10
Match: AT1G04840.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 323.2 bits (827), Expect = 2.6e-88
Identity = 167/314 (53.18%), Postives = 222/314 (70.70%), Query Frame = 0

Query: 14  MKNLHVLFKPRIAFFNSTSSSSSP---QISSQETHFIDLIHASDSTHKLRQIHGQLYRCN 73
           MK+L V+FKP+    +S +    P   Q S  E+HFI LIHA   T  LR +H Q+ R  
Sbjct: 1   MKSLSVIFKPK----SSPAKIYFPADRQASPDESHFISLIHACKDTASLRHVHAQILRRG 60

Query: 74  IFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENSRFESSIAYFV 133
           +  SSRV  Q +S  S L S DY++ IF+  E +N F+ NALIRGL EN+RFESS+ +F+
Sbjct: 61  VL-SSRVAAQLVSCSSLLKSPDYSLSIFRNSEERNPFVLNALIRGLTENARFESSVRHFI 120

Query: 134 CMLRWEISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFVRVSLVDMYVKVD 193
            MLR  + PDRLTFPFVLKS + L    +G ALH+  +K  ++ DSFVR+SLVDMY K  
Sbjct: 121 LMLRLGVKPDRLTFPFVLKSNSKLGFRWLGRALHAATLKNFVDCDSFVRLSLVDMYAKTG 180

Query: 194 DLGSALKVFDESPDRIKKENVLIWNVLIHGYCRVGNLVKATELFETMPKKDTGSWNSLIN 253
            L  A +VF+ESPDRIKKE++LIWNVLI+GYCR  ++  AT LF +MP++++GSW++LI 
Sbjct: 181 QLKHAFQVFEESPDRIKKESILIWNVLINGYCRAKDMHMATTLFRSMPERNSGSWSTLIK 240

Query: 254 GFMRKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFCMLEEGARPNDYT 313
           G++  G+L  A +LFE MPEKNVVSWTT++NGFSQ GD E A+  +F MLE+G +PN+YT
Sbjct: 241 GYVDSGELNRAKQLFELMPEKNVVSWTTLINGFSQTGDYETAISTYFEMLEKGLKPNEYT 300

Query: 314 IVSALSACAKLGLM 325
           I + LSAC+K G +
Sbjct: 301 IAAVLSACSKSGAL 309

BLAST of Cp4.1LG17g02740 vs. TAIR 10
Match: AT5G37570.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 170.6 bits (431), Expect = 2.2e-42
Identity = 99/321 (30.84%), Postives = 172/321 (53.58%), Query Frame = 0

Query: 35  SSPQISSQETHFIDLIHASDSTHKLRQIHGQLYRCNIFSSSRVVTQFI-SSCSSLNSVDY 94
           S P + S ET    L     S   L QIH ++ R  +     +++ FI SS SS +S+ Y
Sbjct: 6   SHPSLLSLET----LFKLCKSEIHLNQIHARIIRKGLEQDQNLISIFISSSSSSSSSLSY 65

Query: 95  AVLIFQRFELKNSFLFNALIRGLAENSRFESSIAYFVCMLRWEIS-PDRLTFPFVLKSAA 154
           +  +F+R     ++L+N LI+G +    F  +++  + M+R  ++ PD  TFP V+K  +
Sbjct: 66  SSSVFERVPSPGTYLWNHLIKGYSNKFLFFETVSILMRMMRTGLARPDEYTFPLVMKVCS 125

Query: 155 ALSNGGVGSALHSGIVKFGLEFDSFVRVSLVDMYVKVDDLGSALKVFDESPDRIKKENVL 214
                 VGS++H  +++ G + D  V  S VD Y K  DL SA KVF E P+R    N +
Sbjct: 126 NNGQVRVGSSVHGLVLRIGFDKDVVVGTSFVDFYGKCKDLFSARKVFGEMPER----NAV 185

Query: 215 IWNVLIHGYCRVGNLVKATELFETMPKKDTGSWNSLINGFMRKGQLGPANELFEKMPEKN 274
            W  L+  Y + G L +A  +F+ MP+++ GSWN+L++G ++ G L  A +LF++MP+++
Sbjct: 186 SWTALVVAYVKSGELEEAKSMFDLMPERNLGSWNALVDGLVKSGDLVNAKKLFDEMPKRD 245

Query: 275 VVSWTTMVN-------------------------------GFSQNGDPEKALQFFFCMLE 323
           ++S+T+M++                               G++QNG P +A + F  M  
Sbjct: 246 IISYTSMIDGYAKGGDMVSARDLFEEARGVDVRAWSALILGYAQNGQPNEAFKVFSEMCA 305

BLAST of Cp4.1LG17g02740 vs. TAIR 10
Match: AT1G06150.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 166.4 bits (420), Expect = 4.1e-41
Identity = 99/315 (31.43%), Postives = 167/315 (53.02%), Query Frame = 0

Query: 12   NRMKNLHVLFKP--RIAFFNSTSSSSSPQISSQETHFIDLIHASDSTHKLRQIHGQLYRC 71
            N   N+H L  P   +  F+++ S + P +         +I    +   L      + + 
Sbjct: 747  NAFANVHSLRVPSHHLRDFSASLSLAPPNLKK-------IIKQCSTPKLLESALAAMIKT 806

Query: 72   NIFSSSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENSRFESSIAYF 131
            ++    R++ QFI++C+S   +D AV    + +  N F++NAL +G    S    S+  +
Sbjct: 807  SLNQDCRLMNQFITACTSFKRLDLAVSTMTQMQEPNVFVYNALFKGFVTCSHPIRSLELY 866

Query: 132  VCMLRWEISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFVRVSLVDMYVKV 191
            V MLR  +SP   T+  ++K+++  S    G +L + I KFG  F   ++ +L+D Y   
Sbjct: 867  VRMLRDSVSPSSYTYSSLVKASSFASR--FGESLQAHIWKFGFGFHVKIQTTLIDFYSAT 926

Query: 192  DDLGSALKVFDESPDRIKKENVLIWNVLIHGYCRVGNLVKATELFETMPKKDTGSWNSLI 251
              +  A KVFDE P+R    + + W  ++  Y RV ++  A  L   M +K+  + N LI
Sbjct: 927  GRIREARKVFDEMPER----DDIAWTTMVSAYRRVLDMDSANSLANQMSEKNEATSNCLI 986

Query: 252  NGFMRKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFCMLEEGARPNDY 311
            NG+M  G L  A  LF +MP K+++SWTTM+ G+SQN    +A+  F+ M+EEG  P++ 
Sbjct: 987  NGYMGLGNLEQAESLFNQMPVKDIISWTTMIKGYSQNKRYREAIAVFYKMMEEGIIPDEV 1046

Query: 312  TIVSALSACAKLGLM 325
            T+ + +SACA LG++
Sbjct: 1047 TMSTVISACAHLGVL 1048

BLAST of Cp4.1LG17g02740 vs. TAIR 10
Match: AT1G14470.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 160.6 bits (405), Expect = 2.3e-39
Identity = 98/264 (37.12%), Postives = 143/264 (54.17%), Query Frame = 0

Query: 58  KLRQIHGQLYRCN-IFSSSRVVTQFISSCSSLNSVDYAV-LIFQRFELKNSFLFNALIRG 117
           +L QIH QL   N +   S   ++ IS C+ L +  Y   LIF      N F+ N++ + 
Sbjct: 21  QLNQIHAQLIVFNSLPRQSYWASRIISCCTRLRAPSYYTRLIFDSVTFPNVFVVNSMFKY 80

Query: 118 LAENSRFESSIAYFVCMLRWEISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFD 177
            ++       +  +    R  I PD  +FP V+KSA     G  G    + + K G   D
Sbjct: 81  FSKMDMANDVLRLYEQRSRCGIMPDAFSFPVVIKSA-----GRFGILFQALVEKLGFFKD 140

Query: 178 SFVRVSLVDMYVKVDDLGSALKVFDESPDRIKKENVLIWNVLIHGYCRVGNLVKATELFE 237
            +VR  ++DMYVK + + SA KVFD+   R   +    WNV+I GY + GN  +A +LF+
Sbjct: 141 PYVRNVIMDMYVKHESVESARKVFDQISQRKGSD----WNVMISGYWKWGNKEEACKLFD 200

Query: 238 TMPKKDTGSWNSLINGFMRKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQF 297
            MP+ D  SW  +I GF +   L  A + F++MPEK+VVSW  M++G++QNG  E AL+ 
Sbjct: 201 MMPENDVVSWTVMITGFAKVKDLENARKYFDRMPEKSVVSWNAMLSGYAQNGFTEDALRL 260

Query: 298 FFCMLEEGARPNDYTIVSALSACA 320
           F  ML  G RPN+ T V  +SAC+
Sbjct: 261 FNDMLRLGVRPNETTWVIVISACS 275

BLAST of Cp4.1LG17g02740 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 160.6 bits (405), Expect = 2.3e-39
Identity = 110/365 (30.14%), Postives = 171/365 (46.85%), Query Frame = 0

Query: 28  FNSTSSSSSPQISSQETH-FIDLIHASDSTHKLRQIHGQLYRCNIFSSSRVVTQFISSC- 87
           F+   SSS P   S   H  + L+H   +   LR IH Q+ +  + +++  +++ I  C 
Sbjct: 17  FHFLPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCI 76

Query: 88  --SSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENSRFESSIAYFVCMLRWEISPDRLT 147
                  + YA+ +F+  +  N  ++N + RG A +S   S++  +VCM+   + P+  T
Sbjct: 77  LSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYT 136

Query: 148 FPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFVRVSLVDMYVKVDDLGSALKVFDESP 207
           FPFVLKS A       G  +H  ++K G + D +V  SL+ MYV+   L  A KVFD+SP
Sbjct: 137 FPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSP 196

Query: 208 ---------------------------DRIKKENVLIWNVLIHGYCRVGNLVKATELFET 267
                                      D I  ++V+ WN +I GY   GN  +A ELF+ 
Sbjct: 197 HRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKD 256

Query: 268 MPKKD-----------------TGS---------W-------------NSLINGFMRKGQ 323
           M K +                 +GS         W             N+LI+ + + G+
Sbjct: 257 MMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGE 316

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9MAT23.7e-8753.18Pentatricopeptide repeat-containing protein At1g04840 OS=Arabidopsis thaliana OX... [more]
Q9FHR33.1e-4130.84Putative pentatricopeptide repeat-containing protein At5g37570 OS=Arabidopsis th... [more]
Q56X055.8e-4031.43Pentatricopeptide repeat-containing protein At1g06143 OS=Arabidopsis thaliana OX... [more]
Q9LN013.2e-3830.14Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9M9R63.2e-3837.12Pentatricopeptide repeat-containing protein At1g14470 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_023513771.11.27e-22599.38pentatricopeptide repeat-containing protein At1g04840 [Cucurbita pepo subsp. pep... [more]
XP_023000600.12.37e-21896.59pentatricopeptide repeat-containing protein At1g04840 [Cucurbita maxima][more]
KAG7026055.13.43e-21796.32Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022964045.16.09e-21796.32pentatricopeptide repeat-containing protein At1g04840 [Cucurbita moschata][more]
KAG6593716.12.48e-21196.32Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
Match NameE-valueIdentityDescription
A0A6J1KIT81.15e-21896.59pentatricopeptide repeat-containing protein At1g04840 OS=Cucurbita maxima OX=366... [more]
A0A6J1HJP92.95e-21796.32pentatricopeptide repeat-containing protein At1g04840 OS=Cucurbita moschata OX=3... [more]
A0A0A0LI867.85e-19284.83DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G1398... [more]
A0A5A7SRY44.80e-18883.90Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3C6B04.80e-18883.90pentatricopeptide repeat-containing protein At1g04840 OS=Cucumis melo OX=3656 GN... [more]
Match NameE-valueIdentityDescription
AT1G04840.12.6e-8853.18Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G37570.12.2e-4230.84Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G06150.14.1e-4131.43basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
AT1G14470.12.3e-3937.12Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G08070.12.3e-3930.14Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 108..135
e-value: 0.06
score: 13.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 271..320
e-value: 3.0E-10
score: 40.1
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 244..274
e-value: 7.2E-4
score: 17.6
coord: 274..307
e-value: 7.5E-7
score: 26.9
coord: 213..240
e-value: 2.3E-6
score: 25.4
coord: 108..140
e-value: 0.002
score: 16.2
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 210..237
e-value: 5.7E-8
score: 32.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 210..244
score: 10.89559
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 272..306
score: 12.309597
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 105..139
score: 9.448698
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 162..240
e-value: 2.4E-13
score: 52.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 241..343
e-value: 9.7E-23
score: 83.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 44..161
e-value: 1.7E-9
score: 39.3
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 30..324
NoneNo IPR availablePANTHERPTHR24015:SF1865TETRATRICOPEPTIDE REPEAT-LIKE SUPERFAMILY PROTEIN ISOFORM 1coord: 30..324

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG17g02740.1Cp4.1LG17g02740.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding
molecular_function GO:0008270 zinc ion binding