Cp4.1LG03g17120 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG03g17120
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG03: 12824901 .. 12828990 (-)
RNA-Seq ExpressionCp4.1LG03g17120
SyntenyCp4.1LG03g17120
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TATAGTCCCGAGAATTTGATCCATTCTATGGATTTCTCTGTTCAGTCGTACGGATTTCTTGCAGAGCTCCACGATTCGACGCGCGCCGGATTCCCCAGATAAGGATTAACGTCTCCAATGTCGGATTTCTTCTTCCCTATCCTCTTTGCTTGAACACAGCTTTCCGATTTTTTGTTCAAACGTTGGATTATCGTTCTGCTGCCGCTTAATCGTATCGTGGAATTTGTTCATACTGAGCGAACAACATTGTTTCTTTTTTGTGCGATTTTGAGTTTTCTCGTTGATTATGGGAACCTCCGTCTGTAACATTCTCTATCAAATTCATCCAAAACAGCCGCTGGTTAATGGAACTGCAAGGAGTTCGTATTCCTGTTACTGCAGAGGCTTAACTGGGCGAAGACTCAGAGTTTTAAGTCCTCGCAGAAGGTGTTATCAATTGGCTACTGTTGCCGCCATTGTTGAGGAAGTTCACACGTTAGAGAGTGGAAGAGAGAAACCGAGGTTTCGGTGGGTGGAGGTAGGCTCTGATATTACTGAAATGCAAAAGCAGGCTATATCTCAGCTTCCTCCCAAGATGACTAAAAGATGTAAGGCTGTGATGAAGCAAATTATCTGTTTTTCGCCTCAAAATGGTAATTTATCAGATATGTTGGCGGCGTGGGTGAGGATTATGAAGCCTAAAAGAGCTGATTGGCTTTCAGTTCTTAAGCATTTGAGAATTTTGGATCATCCACTTTACATCGAGGTACGTTAAATCTCGTATAGTTTCATAAATCTACTGCATAATATTCTGAAGGTTAAAGTACAAGTTCTTAATCCTCAAAACTTTCAATTTTTTGCAAATTTTTGTTCAATAAATCTATAAGATTTTACAGTTAGATTGACAAATTCGAAAATTTTAAAAATTAATGGTTCTATTAAACAATTTGAGTTTTATGTTTGACTGAATCCTAAACTTGCAATTTCTGTCTAAATTGAGGACTAAATTTATAATTTTGAAAGTTTTAAGAACCAAATGAACACAAACACGAACACGAGATTCATCTTATTCTTGATAAACATTAGGAACCTATTCTGATTGTTAAGTTCCTTCCCCTTCTTTGAGTTTGACAGAAGCAATCCAAATTTCTCTTGAAGTTGTATAATATAGTTGCAAATATCATCTAGATTCTTTGCTTATGTTTAAGAGACTATTTTGCATATTCGAACTGGCTCGGTTGATACGATGAAGTTTCCTATTTATTAAGATTCCGTTTGGTAACCATTTAATTTTGTGTTTTCGAAAACTAAATTTATAACCAATTTTTAAGGATTAAAAAAAAAAAGTTTTTCAAAAACATATTTTTGTTTTAAGAATTCAAAGATGAAGATTATGGTAAGTAAATTATGCAAAAACAAACACAATATTCAAAATTCGAATGGTTATCAAATGGCGTGACCTTAGCGATCGATGATATCTCGTAATCTAAGTCGATGAGTTTGTTTGTTGCTCTTAGGTGGCAGAAGCTGCTCTTGTAGAGAGAACATTTGAAGCCAGTACTCGGGACTACACAAAGATAATTCATTACTATGGGAAGCGAAACCAACTCGAGGATGCTGAAAGAATTCTCTTAAGCATGAGAGAAAGGGGTTTTGCTTGTGATCAAATAACATTAACCACAATGATCCACATTTATAGCAAGGCTGACGAACTTAGTCTGGCCAAACAAACTTTTGAAGAGCTCAAACTGCTCGAGGAACCGTTGGATCGAAGATCGTACGATGCGATGATTATGGCATTTATCCGAGCTGGGATGCCCGAGGAAGGTGAGAACATTCTCAAAGAAATGGATGAGAAAGACATATATGCAGGAAGTGAAGTTTACAAGGCTTTGTTAAGAGCGTACTCGATGGCTAGCAATGCCGATGGAGCTCAAAGGGTGTTCGATGCCATTCAGTTGGCTGCTATTCCTCCTGATGATAAGTTATGTGGTCTACTGATCAATGCGTATTTGATGGCAGGCCAAAGCCAAAAGGCACAAATTGCTTTTGACAATATGAGGAGGGCTGGTCTTGAACCTAGTGACAAATGCATAGCGTTGGTATTAAGTGCATATGAAAAGGAGAACAGGCTGAACGCTGCGTTGGAACTTCTCATAGATTTGGAGAAGGAGAAGCTCATGGTTGGGAAGGAAGCTTCAGAAGTACTGGCAGCCTGGCTTAAAAGACTAGGGGTGGTAGAAGAGGTAGAACTTGTCTTGAGAGAATACGCTGTGAAAGAAGCCAGCGGATAAGGTACGAACAGCCCCCACCGCTAGCAGAAATTGTTTGCTTTGGCTTGTTACGTATCGCCTTCAGCCTCACAGTTTTAGAACGCGTCTAGTAGGGAGATGTTTCCGCACAGCTGTAAGGAGTGCTTCATTCTCCTCTCCAACCCATGTGGGATCTCACAATCCACCCCCTTGGGGCTCAGCGTCCTCGCTGGCACACCGCCCGATGTCTGGCTCTAATACCTTTGTAACAGCCCAAGCCCACCGCTAGTAGAAATTGTCCGTTTTGGCATGTTACGTATCACCGTCAGCCTCACAGTTTTAAAACGCGTCTACTAGAGATGTTTCCACAACCTTATAAGGAGTGTTTCGTTTCCCTTCTCCAACCGATGTGAGATGTCACATGACAAGGTACGAACACTTCCTTGGATGCAAAGTTCACTTTTGCCATGTGAATTGAAGTATAAAGAACACAAATTTCTGCTTCATTTCTGCTTTTAAGGGTAGTGGTTGCATAATTTTCAAATAGAAATAGACGAGAAAAGAGTTTTACAAGCTTGTTATTGAAACCTTCATGATGATCAAAACATGATATAAGTAAGCTGAAGTTTAACTTATTGATCAGCAGGAAGAACCCTGAGATTATTAGCAGTCCAATCACCAGTTAGGCTCCTTTAGGGACGTACGGAAATAGCGAGGCACCGATGCGTTGCATCGTGGAGAGTTGTTTTTGGACAATTTGTTGATCAATGCTGCCCATGGATGGGAAATTGTTGCTTCTAATTTGACAGAATTCCAATGGAGACTGCAACTTTGCCGGCGAAGCTTGCTCCACAGGATCGATAAGTAGTGAAAGATCTGATGCAGCAGAAGGCAAAGAAGAGCAATCTAATGGAAGGCTTACAATTGAGGGTATTGTTGGCTTGACCAGATCTTGTGGCTGTGCCTGTGGATGCGGTGCTGTTCTTGAAAGCTCAGATGCTGCTAGGTACGCTTGCAAGTATGCAAGCTCTACTTGCAACCTCGCAGCCTAAAAGATAATGTATAGAACTCGAATTAGTTTAAATTTTCGAAAGCCAAATAGTTATCGAATAGAGCTTCAGCTAATTTTGTTTTACCTGTTGTTGAAGGGCAAAGATATGAGCAACACAACCAAAAACTGGCTCTCTAACACGAGCTTGTGCCTCGTAGCATATAGTTATAGCTGCGTCGAGGCGCTTGTGTTCAGGAATATGCAGAAGCAGCTTCGACACATTACTAGCTCCGAACACTTTGTGCACGGCTGCAAAATGAGTCGTGCCTTGTTCAGAGTCAAAGTAAGGTGCAAATATACACTCCGGTGTACACTTCCTCCGCAGAAACTTGCACGCCCCACACGGTCCACCGCTGCCATTGCTGCTACTACCACCATTGCCCTCCTTGCAACCTCCACCGCCGTGCCTCGAGCTCATCTCGGCCTTGCTTTCTTGCCGTTCTTTCATCAAAAGCTAAGCAAGGCTTACTGGAAGAACAAGACAAGTATGGATTTATATATATATATATGGAGCTGACTTTGAGGTTTGTGGATCACAAGGTATATAACACTTTCTTTTACTCTTATCTCTTAGTTGTTTGATGGGATTTGCAAATTAGAAGGAAATGTAGGTTTTGGAAGATAGAAACATAGGTCCATTTTTGTAATAATATTAATGAATTAATATTATTAAAATTCCGCCACCCTCCATGTTTTCAAAGCTACGTGGAGTGGAGGATACGAGGACCGCCAATAAATCTTTTTTAAAATGTTTTTAAAGCTAATTAATGATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA

mRNA sequence

TATAGTCCCGAGAATTTGATCCATTCTATGGATTTCTCTGTTCAGTCGTACGGATTTCTTGCAGAGCTCCACGATTCGACGCGCGCCGGATTCCCCAGATAAGGATTAACGTCTCCAATGTCGGATTTCTTCTTCCCTATCCTCTTTGCTTGAACACAGCTTTCCGATTTTTTGTTCAAACGTTGGATTATCGTTCTGCTGCCGCTTAATCGTATCGTGGAATTTGTTCATACTGAGCGAACAACATTGTTTCTTTTTTGTGCGATTTTGAGTTTTCTCGTTGATTATGGGAACCTCCGTCTGTAACATTCTCTATCAAATTCATCCAAAACAGCCGCTGGTTAATGGAACTGCAAGGAGTTCGTATTCCTGTTACTGCAGAGGCTTAACTGGGCGAAGACTCAGAGTTTTAAGTCCTCGCAGAAGGTGTTATCAATTGGCTACTGTTGCCGCCATTGTTGAGGAAGTTCACACGTTAGAGAGTGGAAGAGAGAAACCGAGGTTTCGGTGGGTGGAGGTAGGCTCTGATATTACTGAAATGCAAAAGCAGGCTATATCTCAGCTTCCTCCCAAGATGACTAAAAGATGTAAGGCTGTGATGAAGCAAATTATCTGTTTTTCGCCTCAAAATGGTAATTTATCAGATATGTTGGCGGCGTGGGTGAGGATTATGAAGCCTAAAAGAGCTGATTGGCTTTCAGTTCTTAAGCATTTGAGAATTTTGGATCATCCACTTTACATCGAGGTGGCAGAAGCTGCTCTTGTAGAGAGAACATTTGAAGCCAGTACTCGGGACTACACAAAGATAATTCATTACTATGGGAAGCGAAACCAACTCGAGGATGCTGAAAGAATTCTCTTAAGCATGAGAGAAAGGGGTTTTGCTTGTGATCAAATAACATTAACCACAATGATCCACATTTATAGCAAGGCTGACGAACTTAGTCTGGCCAAACAAACTTTTGAAGAGCTCAAACTGCTCGAGGAACCGTTGGATCGAAGATCGTACGATGCGATGATTATGGCATTTATCCGAGCTGGGATGCCCGAGGAAGGTGAGAACATTCTCAAAGAAATGGATGAGAAAGACATATATGCAGGAAGTGAAGTTTACAAGGCTTTGTTAAGAGCGTACTCGATGGCTAGCAATGCCGATGGAGCTCAAAGGGTGTTCGATGCCATTCAGTTGGCTGCTATTCCTCCTGATGATAAGTTATGTGGTCTACTGATCAATGCGTATTTGATGGCAGGCCAAAGCCAAAAGGCACAAATTGCTTTTGACAATATGAGGAGGGCTGGTCTTGAACCTAGTGACAAATGCATAGCGTTGGTATTAAGTGCATATGAAAAGGAGAACAGGCTGAACGCTGCGTTGGAACTTCTCATAGATTTGGAGAAGGAGAAGCTCATGGTTGGGAAGGAAGCTTCAGAAGAAGAACCCTGAGATTATTAGCAGTCCAATCACCAGTTAGGCTCCTTTAGGGACGTACGGAAATAGCGAGGCACCGATGCGTTGCATCGTGGAGAGTTGTTTTTGGACAATTTGTTGATCAATGCTGCCCATGGATGGGAAATTGTTGCTTCTAATTTGACAGAATTCCAATGGAGACTGCAACTTTGCCGGCGAAGCTTGCTCCACAGGATCGATAAGTAGTGAAAGATCTGATGCAGCAGAAGGCAAAGAAGAGCAATCTAATGGAAGGCTTACAATTGAGGGTATTGTTGGCTTGACCAGATCTTGTGGCTGTGCCTGTGGATGCGGTGCTGTTCTTGAAAGCTCAGATGCTGCTAGGTACGCTTGCAAGTATGCAAGCTCTACTTGCAACCTCGCAGCCTAAAAGATAATGGCAAAGATATGAGCAACACAACCAAAAACTGGCTCTCTAACACGAGCTTGTGCCTCGTAGCATATAGTTATAGCTGCGTCGAGGCGCTTGTGTTCAGGAATATGCAGAAGCAGCTTCGACACATTACTAGCTCCGAACACTTTGTGCACGGCTGCAAAATGAGTCGTGCCTTGTTCAGAGTCAAAGTAAGGTGCAAATATACACTCCGGTGTACACTTCCTCCGCAGAAACTTGCACGCCCCACACGGTCCACCGCTGCCATTGCTGCTACTACCACCATTGCCCTCCTTGCAACCTCCACCGCCGTGCCTCGAGCTCATCTCGGCCTTGCTTTCTTGCCGTTCTTTCATCAAAAGCTAAGCAAGGCTTACTGGAAGAACAAGACAAGTATGGATTTATATATATATATATGGAGCTGACTTTGAGGTTTGTGGATCACAAGGTATATAACACTTTCTTTTACTCTTATCTCTTAGTTGTTTGATGGGATTTGCAAATTAGAAGGAAATGTAGGTTTTGGAAGATAGAAACATAGGTCCATTTTTGTAATAATATTAATGAATTAATATTATTAAAATTCCGCCACCCTCCATGTTTTCAAAGCTACGTGGAGTGGAGGATACGAGGACCGCCAATAAATCTTTTTTAAAATGTTTTTAAAGCTAATTAATGATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA

Coding sequence (CDS)

ATGGGAACCTCCGTCTGTAACATTCTCTATCAAATTCATCCAAAACAGCCGCTGGTTAATGGAACTGCAAGGAGTTCGTATTCCTGTTACTGCAGAGGCTTAACTGGGCGAAGACTCAGAGTTTTAAGTCCTCGCAGAAGGTGTTATCAATTGGCTACTGTTGCCGCCATTGTTGAGGAAGTTCACACGTTAGAGAGTGGAAGAGAGAAACCGAGGTTTCGGTGGGTGGAGGTAGGCTCTGATATTACTGAAATGCAAAAGCAGGCTATATCTCAGCTTCCTCCCAAGATGACTAAAAGATGTAAGGCTGTGATGAAGCAAATTATCTGTTTTTCGCCTCAAAATGGTAATTTATCAGATATGTTGGCGGCGTGGGTGAGGATTATGAAGCCTAAAAGAGCTGATTGGCTTTCAGTTCTTAAGCATTTGAGAATTTTGGATCATCCACTTTACATCGAGGTGGCAGAAGCTGCTCTTGTAGAGAGAACATTTGAAGCCAGTACTCGGGACTACACAAAGATAATTCATTACTATGGGAAGCGAAACCAACTCGAGGATGCTGAAAGAATTCTCTTAAGCATGAGAGAAAGGGGTTTTGCTTGTGATCAAATAACATTAACCACAATGATCCACATTTATAGCAAGGCTGACGAACTTAGTCTGGCCAAACAAACTTTTGAAGAGCTCAAACTGCTCGAGGAACCGTTGGATCGAAGATCGTACGATGCGATGATTATGGCATTTATCCGAGCTGGGATGCCCGAGGAAGGTGAGAACATTCTCAAAGAAATGGATGAGAAAGACATATATGCAGGAAGTGAAGTTTACAAGGCTTTGTTAAGAGCGTACTCGATGGCTAGCAATGCCGATGGAGCTCAAAGGGTGTTCGATGCCATTCAGTTGGCTGCTATTCCTCCTGATGATAAGTTATGTGGTCTACTGATCAATGCGTATTTGATGGCAGGCCAAAGCCAAAAGGCACAAATTGCTTTTGACAATATGAGGAGGGCTGGTCTTGAACCTAGTGACAAATGCATAGCGTTGGTATTAAGTGCATATGAAAAGGAGAACAGGCTGAACGCTGCGTTGGAACTTCTCATAGATTTGGAGAAGGAGAAGCTCATGGTTGGGAAGGAAGCTTCAGAAGAAGAACCCTGA

Protein sequence

MGTSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSPRRRCYQLATVAAIVEEVHTLESGREKPRFRWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLSDMLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGKRNQLEDAERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRRSYDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNADGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLNAALELLIDLEKEKLMVGKEASEEEP
Homology
BLAST of Cp4.1LG03g17120 vs. ExPASy Swiss-Prot
Match: Q9LPC4 (Pentatricopeptide repeat-containing protein At1g01970 OS=Arabidopsis thaliana OX=3702 GN=At1g01970 PE=2 SV=1)

HSP 1 Score: 413.3 bits (1061), Expect = 3.0e-114
Identity = 213/382 (55.76%), Postives = 280/382 (73.30%), Query Frame = 0

Query: 1   MGTSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSP-RRRCYQLATVAAIVE 60
           MG   C+ +     K PLV    R     Y R      L V S   R C      +  + 
Sbjct: 1   MGIYSCSAVLSFGLKCPLVIARHR----LYHRMFRRNPLLVESHLNRLCSCKCNASLAIG 60

Query: 61  EVHTLESGREKPRFRWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLS 120
           EV   E   +   F W +VG ++TE Q +AI+++P KM+KRC+A+M+QIICFSP+ G+  
Sbjct: 61  EVVEKEDAEQSRSFNWADVGLNLTEEQDEAITRIPIKMSKRCQALMRQIICFSPEKGSFC 120

Query: 121 DMLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYG 180
           D+L AW+R M P RADWLS+LK L+ LD P YI+VAE +L++ +FEA+ RDYTKIIHYYG
Sbjct: 121 DLLGAWLRRMNPIRADWLSILKELKNLDSPFYIKVAEFSLLQDSFEANARDYTKIIHYYG 180

Query: 181 KRNQLEDAERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRR 240
           K NQ+EDAER LLSM+ RGF  DQ+TLT M+ +YSKA    LA++TF E+KLL EPLD R
Sbjct: 181 KLNQVEDAERTLLSMKNRGFLIDQVTLTAMVQLYSKAGCHKLAEETFNEIKLLGEPLDYR 240

Query: 241 SYDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNADGAQRVFDAI 300
           SY +MIMA+IRAG+PE+GE++L+EMD ++I AG EVYKALLR YSM  +A+GA+RVFDA+
Sbjct: 241 SYGSMIMAYIRAGVPEKGESLLREMDSQEICAGREVYKALLRDYSMGGDAEGAKRVFDAV 300

Query: 301 QLAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRL 360
           Q+A I PD KLCGLLINAY ++GQSQ A++AF+NMR+AG++ +DKC+ALVL+AYEKE +L
Sbjct: 301 QIAGITPDVKLCGLLINAYSVSGQSQNARLAFENMRKAGIKATDKCVALVLAAYEKEEKL 360

Query: 361 NAALELLIDLEKEKLMVGKEAS 382
           N AL  L++LEK+ +M+GKEAS
Sbjct: 361 NEALGFLVELEKDSIMLGKEAS 378

BLAST of Cp4.1LG03g17120 vs. ExPASy Swiss-Prot
Match: Q940Z1 (Pentatricopeptide repeat-containing protein At1g19525 OS=Arabidopsis thaliana OX=3702 GN=At1g19525 PE=2 SV=2)

HSP 1 Score: 140.6 bits (353), Expect = 3.8e-32
Identity = 76/185 (41.08%), Postives = 113/185 (61.08%), Query Frame = 0

Query: 194 MRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRRSYDAMIMAFIRAGM 253
           M + G   D +T T ++H+YSK+     A + FE LK      D + Y+AMI+ ++ AG 
Sbjct: 1   MSQNGIFPDILTATALVHMYSKSGNFERATEAFENLKSYGLRPDEKIYEAMILGYVNAGK 60

Query: 254 PEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNADGAQRVFDAIQLAAIPP-DDKLCG 313
           P+ GE ++KEM  K++ A  EVY ALLRAY+   +A+GA  +  ++Q A+  P   +   
Sbjct: 61  PKLGERLMKEMQAKELKASEEVYMALLRAYAQMGDANGAAGISSSMQYASDGPLSFEAYS 120

Query: 314 LLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLNAALELLIDLEKE 373
           L + AY  AGQ  KA+  FD MR+ G +P DKCIA ++ AY+ EN L+ AL LL+ LEK+
Sbjct: 121 LFVEAYGKAGQVDKAKSNFDEMRKLGHKPDDKCIANLVRAYKGENSLDKALRLLLQLEKD 180

Query: 374 KLMVG 378
            + +G
Sbjct: 181 GIEIG 185

BLAST of Cp4.1LG03g17120 vs. ExPASy Swiss-Prot
Match: Q0WMY5 (Pentatricopeptide repeat-containing protein At5g04810, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PPR4 PE=1 SV=1)

HSP 1 Score: 88.6 bits (218), Expect = 1.7e-16
Identity = 73/306 (23.86%), Postives = 141/306 (46.08%), Query Frame = 0

Query: 68  REKPRFRWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLSDMLAAWVR 127
           + + R RWVE G + T+M  ++      + +++    +++I+       N   +++A+ +
Sbjct: 249 KAEQRVRWVEEGEEDTKMSNKSSWHQEREGSRKS---LQRIL--DTNGDNWQAVISAFEK 308

Query: 128 IMKPKRADW-LSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGKRNQLED 187
           I KP R ++ L V  + R  D     E  E  +  R    ++R YT +IH Y     +++
Sbjct: 309 ISKPSRTEFGLMVKFYGRRGDMHRARETFE-RMRARGITPTSRIYTSLIHAYAVGRDMDE 368

Query: 188 AERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRRSYDAMIM 247
           A   +  M+E G     +T + ++  +SKA     A   F+E K + + L+   Y  +I 
Sbjct: 369 ALSCVRKMKEEGIEMSLVTYSVIVGGFSKAGHAEAADYWFDEAKRIHKTLNASIYGKIIY 428

Query: 248 AFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNADGAQRVFDAIQLAAIPP 307
           A  +    E  E +++EM+E+ I A   +Y  ++  Y+M ++      VF  ++     P
Sbjct: 429 AHCQTCNMERAEALVREMEEEGIDAPIAIYHTMMDGYTMVADEKKGLVVFKRLKECGFTP 488

Query: 308 DDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLNAALELL 367
                G LIN Y   G+  KA      M+  G++ + K  +++++ + K      A  + 
Sbjct: 489 TVVTYGCLINLYTKVGKISKALEVSRVMKEEGVKHNLKTYSMMINGFVKLKDWANAFAVF 548

Query: 368 IDLEKE 373
            D+ KE
Sbjct: 549 EDMVKE 548

BLAST of Cp4.1LG03g17120 vs. ExPASy Swiss-Prot
Match: O82178 (Pentatricopeptide repeat-containing protein At2g35130 OS=Arabidopsis thaliana OX=3702 GN=At2g35130 PE=3 SV=1)

HSP 1 Score: 82.8 bits (203), Expect = 9.3e-15
Identity = 50/176 (28.41%), Postives = 89/176 (50.57%), Query Frame = 0

Query: 180 KRNQLEDAERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRR 239
           ++   E+A  +   M+         T   MI++Y KA +  ++ + + E++  +   +  
Sbjct: 241 RKGNTEEAIDVFQRMKRDRCKPTTETYNLMINLYGKASKSYMSWKLYCEMRSHQCKPNIC 300

Query: 240 SYDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNADGAQRVFDAI 299
           +Y A++ AF R G+ E+ E I +++ E  +     VY AL+ +YS A    GA  +F  +
Sbjct: 301 TYTALVNAFAREGLCEKAEEIFEQLQEDGLEPDVYVYNALMESYSRAGYPYGAAEIFSLM 360

Query: 300 QLAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEK 356
           Q     PD     ++++AY  AG    A+  F+ M+R G+ P+ K   L+LSAY K
Sbjct: 361 QHMGCEPDRASYNIMVDAYGRAGLHSDAEAVFEEMKRLGIAPTMKSHMLLLSAYSK 416

BLAST of Cp4.1LG03g17120 vs. ExPASy Swiss-Prot
Match: O64624 (Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At2g18940 PE=2 SV=1)

HSP 1 Score: 80.1 bits (196), Expect = 6.0e-14
Identity = 58/201 (28.86%), Postives = 99/201 (49.25%), Query Frame = 0

Query: 171 YTKIIHYYGKRNQ-LEDAERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEEL 230
           Y  I+  +GK  +       +L  MR +G   D+ T +T++   ++   L  AK+ F EL
Sbjct: 248 YNVILDVFGKMGRSWRKILGVLDEMRSKGLKFDEFTCSTVLSACAREGLLREAKEFFAEL 307

Query: 231 KLLEEPLDRRSYDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNA 290
           K         +Y+A++  F +AG+  E  ++LKEM+E    A S  Y  L+ AY  A  +
Sbjct: 308 KSCGYEPGTVTYNALLQVFGKAGVYTEALSVLKEMEENSCPADSVTYNELVAAYVRAGFS 367

Query: 291 DGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALV 350
             A  V + +    + P+      +I+AY  AG+  +A   F +M+ AG  P+      V
Sbjct: 368 KEAAGVIEMMTKKGVMPNAITYTTVIDAYGKAGKEDEALKLFYSMKEAGCVPNTCTYNAV 427

Query: 351 LSAYEKENRLNAALELLIDLE 371
           LS   K++R N  +++L D++
Sbjct: 428 LSLLGKKSRSNEMIKMLCDMK 448

BLAST of Cp4.1LG03g17120 vs. NCBI nr
Match: XP_023526280.1 (pentatricopeptide repeat-containing protein At1g01970 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023526281.1 pentatricopeptide repeat-containing protein At1g01970 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 753 bits (1943), Expect = 1.88e-273
Identity = 382/382 (100.00%), Postives = 382/382 (100.00%), Query Frame = 0

Query: 1   MGTSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSPRRRCYQLATVAAIVEE 60
           MGTSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSPRRRCYQLATVAAIVEE
Sbjct: 1   MGTSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSPRRRCYQLATVAAIVEE 60

Query: 61  VHTLESGREKPRFRWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLSD 120
           VHTLESGREKPRFRWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLSD
Sbjct: 61  VHTLESGREKPRFRWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLSD 120

Query: 121 MLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGK 180
           MLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGK
Sbjct: 121 MLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGK 180

Query: 181 RNQLEDAERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRRS 240
           RNQLEDAERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRRS
Sbjct: 181 RNQLEDAERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRRS 240

Query: 241 YDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNADGAQRVFDAIQ 300
           YDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNADGAQRVFDAIQ
Sbjct: 241 YDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNADGAQRVFDAIQ 300

Query: 301 LAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLN 360
           LAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLN
Sbjct: 301 LAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLN 360

Query: 361 AALELLIDLEKEKLMVGKEASE 382
           AALELLIDLEKEKLMVGKEASE
Sbjct: 361 AALELLIDLEKEKLMVGKEASE 382

BLAST of Cp4.1LG03g17120 vs. NCBI nr
Match: KAG6582566.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia] >KAG7018949.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 739 bits (1909), Expect = 2.85e-268
Identity = 374/382 (97.91%), Postives = 379/382 (99.21%), Query Frame = 0

Query: 1   MGTSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSPRRRCYQLATVAAIVEE 60
           M TSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSPRRRCYQLA VAAIVEE
Sbjct: 1   METSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSPRRRCYQLAIVAAIVEE 60

Query: 61  VHTLESGREKPRFRWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLSD 120
           VHTLESGREKPRFRWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLSD
Sbjct: 61  VHTLESGREKPRFRWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLSD 120

Query: 121 MLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGK 180
           MLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGK
Sbjct: 121 MLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGK 180

Query: 181 RNQLEDAERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRRS 240
           RNQLEDAERILL M+ERGFACDQITLTTMIHIYSKADEL+LAKQTFE++KLLEEPLDRRS
Sbjct: 181 RNQLEDAERILLCMKERGFACDQITLTTMIHIYSKADELNLAKQTFEDIKLLEEPLDRRS 240

Query: 241 YDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNADGAQRVFDAIQ 300
           YDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNA+GAQRVFDAIQ
Sbjct: 241 YDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNAEGAQRVFDAIQ 300

Query: 301 LAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLN 360
           LAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLN
Sbjct: 301 LAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLN 360

Query: 361 AALELLIDLEKEKLMVGKEASE 382
           AALELLIDLEKEKLMVGKEASE
Sbjct: 361 AALELLIDLEKEKLMVGKEASE 382

BLAST of Cp4.1LG03g17120 vs. NCBI nr
Match: XP_022980308.1 (pentatricopeptide repeat-containing protein At1g01970 [Cucurbita maxima] >XP_022980309.1 pentatricopeptide repeat-containing protein At1g01970 [Cucurbita maxima] >XP_022980310.1 pentatricopeptide repeat-containing protein At1g01970 [Cucurbita maxima] >XP_022980311.1 pentatricopeptide repeat-containing protein At1g01970 [Cucurbita maxima] >XP_022980312.1 pentatricopeptide repeat-containing protein At1g01970 [Cucurbita maxima] >XP_022980313.1 pentatricopeptide repeat-containing protein At1g01970 [Cucurbita maxima])

HSP 1 Score: 739 bits (1908), Expect = 4.05e-268
Identity = 374/382 (97.91%), Postives = 379/382 (99.21%), Query Frame = 0

Query: 1   MGTSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSPRRRCYQLATVAAIVEE 60
           MGTSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSPRR CYQLATVAAIVEE
Sbjct: 1   MGTSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSPRRTCYQLATVAAIVEE 60

Query: 61  VHTLESGREKPRFRWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLSD 120
           VH LESGREKPRFRWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICF PQNG+LSD
Sbjct: 61  VHKLESGREKPRFRWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFLPQNGHLSD 120

Query: 121 MLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGK 180
           MLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGK
Sbjct: 121 MLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGK 180

Query: 181 RNQLEDAERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRRS 240
           +NQLEDAERILLSMRERGFACDQITLTTMIHIYSKADEL+LAKQTFEELKLLEEPLDRRS
Sbjct: 181 QNQLEDAERILLSMRERGFACDQITLTTMIHIYSKADELNLAKQTFEELKLLEEPLDRRS 240

Query: 241 YDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNADGAQRVFDAIQ 300
           YDAMIMAF+RAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNA+GAQRVFDAIQ
Sbjct: 241 YDAMIMAFVRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNAEGAQRVFDAIQ 300

Query: 301 LAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLN 360
           LAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLN
Sbjct: 301 LAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLN 360

Query: 361 AALELLIDLEKEKLMVGKEASE 382
           AALELLIDLEKEKLMVGKEASE
Sbjct: 361 AALELLIDLEKEKLMVGKEASE 382

BLAST of Cp4.1LG03g17120 vs. NCBI nr
Match: XP_022924339.1 (pentatricopeptide repeat-containing protein At1g01970 [Cucurbita moschata] >XP_022924340.1 pentatricopeptide repeat-containing protein At1g01970 [Cucurbita moschata] >XP_022924341.1 pentatricopeptide repeat-containing protein At1g01970 [Cucurbita moschata])

HSP 1 Score: 737 bits (1902), Expect = 3.32e-267
Identity = 374/382 (97.91%), Postives = 377/382 (98.69%), Query Frame = 0

Query: 1   MGTSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSPRRRCYQLATVAAIVEE 60
           M TSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSPRRRCYQL  VAAIVEE
Sbjct: 1   METSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSPRRRCYQLCAVAAIVEE 60

Query: 61  VHTLESGREKPRFRWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLSD 120
           VH LESGREKPRFRW+EVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLSD
Sbjct: 61  VHKLESGREKPRFRWLEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLSD 120

Query: 121 MLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGK 180
           MLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGK
Sbjct: 121 MLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGK 180

Query: 181 RNQLEDAERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRRS 240
           RNQLEDAERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRRS
Sbjct: 181 RNQLEDAERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRRS 240

Query: 241 YDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNADGAQRVFDAIQ 300
           YDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNA+GAQRVFDAIQ
Sbjct: 241 YDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNAEGAQRVFDAIQ 300

Query: 301 LAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLN 360
           LAAIPPDDKLCGLLINAYLMA QSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLN
Sbjct: 301 LAAIPPDDKLCGLLINAYLMADQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLN 360

Query: 361 AALELLIDLEKEKLMVGKEASE 382
           AALELLIDLEKEKL+VGKEASE
Sbjct: 361 AALELLIDLEKEKLVVGKEASE 382

BLAST of Cp4.1LG03g17120 vs. NCBI nr
Match: XP_038903030.1 (pentatricopeptide repeat-containing protein At1g01970 [Benincasa hispida] >XP_038903038.1 pentatricopeptide repeat-containing protein At1g01970 [Benincasa hispida])

HSP 1 Score: 636 bits (1640), Expect = 2.80e-227
Identity = 323/379 (85.22%), Postives = 349/379 (92.08%), Query Frame = 0

Query: 4   SVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSPRRRCYQLATVAAIVEEVHT 63
           S  +I YQ+HPKQPLVNGT RSSY+ Y RG   + L VLS RRRC +LATVAAIVEEVH 
Sbjct: 5   STSSIFYQLHPKQPLVNGTPRSSYTRYWRGSIEQTLSVLSSRRRCSRLATVAAIVEEVHK 64

Query: 64  LESGREKPRFRWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLSDMLA 123
           LE+ REKPRFRWVEVGSDITE+QKQAISQLPPKMTKRCKA+MKQIICFSPQ G+LSDML 
Sbjct: 65  LENEREKPRFRWVEVGSDITEIQKQAISQLPPKMTKRCKALMKQIICFSPQKGSLSDMLT 124

Query: 124 AWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGKRNQ 183
           AWVRIMKP+RADWLSVLKHLRI +HPLYIEVAEAALVE TFEA+TRDYTKIIH+YGKRNQ
Sbjct: 125 AWVRIMKPERADWLSVLKHLRISNHPLYIEVAEAALVEITFEANTRDYTKIIHHYGKRNQ 184

Query: 184 LEDAERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRRSYDA 243
           LEDAE++LLSMRERG ACDQITLTTMIHIYSKAD L+LAKQTFEELKLLE+ LDRRSY A
Sbjct: 185 LEDAEKVLLSMRERGLACDQITLTTMIHIYSKADRLNLAKQTFEELKLLEKSLDRRSYGA 244

Query: 244 MIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNADGAQRVFDAIQLAA 303
           MIMA++RAGMPEEGENILKEMD K IYAGSEVYKALLRAYSMA NA+GAQRVFDAIQLA 
Sbjct: 245 MIMAYVRAGMPEEGENILKEMDAKQIYAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLAV 304

Query: 304 IPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLNAAL 363
           IPPD+KLCGLLINAYLMAG+S+KAQIAFDNMRRAG+EPSDKCIALVLSAYE ENRLNAAL
Sbjct: 305 IPPDEKLCGLLINAYLMAGESRKAQIAFDNMRRAGIEPSDKCIALVLSAYETENRLNAAL 364

Query: 364 ELLIDLEKEKLMVGKEASE 382
           ELLIDLEK+ L+V KEASE
Sbjct: 365 ELLIDLEKDNLVVRKEASE 383

BLAST of Cp4.1LG03g17120 vs. ExPASy TrEMBL
Match: A0A6J1IR15 (pentatricopeptide repeat-containing protein At1g01970 OS=Cucurbita maxima OX=3661 GN=LOC111479721 PE=4 SV=1)

HSP 1 Score: 739 bits (1908), Expect = 1.96e-268
Identity = 374/382 (97.91%), Postives = 379/382 (99.21%), Query Frame = 0

Query: 1   MGTSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSPRRRCYQLATVAAIVEE 60
           MGTSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSPRR CYQLATVAAIVEE
Sbjct: 1   MGTSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSPRRTCYQLATVAAIVEE 60

Query: 61  VHTLESGREKPRFRWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLSD 120
           VH LESGREKPRFRWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICF PQNG+LSD
Sbjct: 61  VHKLESGREKPRFRWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFLPQNGHLSD 120

Query: 121 MLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGK 180
           MLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGK
Sbjct: 121 MLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGK 180

Query: 181 RNQLEDAERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRRS 240
           +NQLEDAERILLSMRERGFACDQITLTTMIHIYSKADEL+LAKQTFEELKLLEEPLDRRS
Sbjct: 181 QNQLEDAERILLSMRERGFACDQITLTTMIHIYSKADELNLAKQTFEELKLLEEPLDRRS 240

Query: 241 YDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNADGAQRVFDAIQ 300
           YDAMIMAF+RAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNA+GAQRVFDAIQ
Sbjct: 241 YDAMIMAFVRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNAEGAQRVFDAIQ 300

Query: 301 LAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLN 360
           LAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLN
Sbjct: 301 LAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLN 360

Query: 361 AALELLIDLEKEKLMVGKEASE 382
           AALELLIDLEKEKLMVGKEASE
Sbjct: 361 AALELLIDLEKEKLMVGKEASE 382

BLAST of Cp4.1LG03g17120 vs. ExPASy TrEMBL
Match: A0A6J1EER0 (pentatricopeptide repeat-containing protein At1g01970 OS=Cucurbita moschata OX=3662 GN=LOC111431862 PE=4 SV=1)

HSP 1 Score: 737 bits (1902), Expect = 1.61e-267
Identity = 374/382 (97.91%), Postives = 377/382 (98.69%), Query Frame = 0

Query: 1   MGTSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSPRRRCYQLATVAAIVEE 60
           M TSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSPRRRCYQL  VAAIVEE
Sbjct: 1   METSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSPRRRCYQLCAVAAIVEE 60

Query: 61  VHTLESGREKPRFRWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLSD 120
           VH LESGREKPRFRW+EVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLSD
Sbjct: 61  VHKLESGREKPRFRWLEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLSD 120

Query: 121 MLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGK 180
           MLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGK
Sbjct: 121 MLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGK 180

Query: 181 RNQLEDAERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRRS 240
           RNQLEDAERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRRS
Sbjct: 181 RNQLEDAERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRRS 240

Query: 241 YDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNADGAQRVFDAIQ 300
           YDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNA+GAQRVFDAIQ
Sbjct: 241 YDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNAEGAQRVFDAIQ 300

Query: 301 LAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLN 360
           LAAIPPDDKLCGLLINAYLMA QSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLN
Sbjct: 301 LAAIPPDDKLCGLLINAYLMADQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLN 360

Query: 361 AALELLIDLEKEKLMVGKEASE 382
           AALELLIDLEKEKL+VGKEASE
Sbjct: 361 AALELLIDLEKEKLVVGKEASE 382

BLAST of Cp4.1LG03g17120 vs. ExPASy TrEMBL
Match: A0A6J1D001 (pentatricopeptide repeat-containing protein At1g01970 OS=Momordica charantia OX=3673 GN=LOC111016111 PE=4 SV=1)

HSP 1 Score: 613 bits (1581), Expect = 1.25e-218
Identity = 306/382 (80.10%), Postives = 341/382 (89.27%), Query Frame = 0

Query: 1   MGTSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSPRRRCYQLATVAAIVEE 60
           MGT  CNI+YQ+HP+QPLVNG A  S  CY  G   +R RV S RR C QLA  AAIVE 
Sbjct: 1   MGTFPCNIIYQLHPRQPLVNGIAEGSCYCYLGGSVEQRRRVFSSRRSCSQLAVAAAIVER 60

Query: 61  VHTLESGREKPRFRWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLSD 120
           V    S  EK RFRWVEVGSDITE QKQAIS+LPPKM KRCKA+M+QIICFSPQ GNLSD
Sbjct: 61  VRETGSEIEKLRFRWVEVGSDITETQKQAISRLPPKMAKRCKALMRQIICFSPQKGNLSD 120

Query: 121 MLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGK 180
           +L AWVRIMKPKRADWL VLKHLR+ +HP YIEVAEAAL+E+TFEASTRD+TKIIHYYGK
Sbjct: 121 LLTAWVRIMKPKRADWLLVLKHLRLFNHPFYIEVAEAALLEKTFEASTRDFTKIIHYYGK 180

Query: 181 RNQLEDAERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRRS 240
           +N+LEDAE+ILLSM+E+GFACDQITLTTM+HIYSKAD+L+LAKQTFEELKLLE+PLD+RS
Sbjct: 181 QNRLEDAEKILLSMKEKGFACDQITLTTMVHIYSKADKLNLAKQTFEELKLLEQPLDKRS 240

Query: 241 YDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNADGAQRVFDAIQ 300
           Y AMIMA+IRAGMP+EGE+IL+EMD K+IYAGSEVYKALLRAYSM  NA+GAQRVFDAIQ
Sbjct: 241 YGAMIMAYIRAGMPQEGESILREMDAKEIYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQ 300

Query: 301 LAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLN 360
           LAAIPPD+KLCGLLINAYLMAGQ+QK +I+FDNMR+AGLEP DKCIALVL+AYEKENRLN
Sbjct: 301 LAAIPPDEKLCGLLINAYLMAGQTQKVRISFDNMRKAGLEPVDKCIALVLAAYEKENRLN 360

Query: 361 AALELLIDLEKEKLMVGKEASE 382
           AALELLIDLEKE LMVGKEASE
Sbjct: 361 AALELLIDLEKENLMVGKEASE 382

BLAST of Cp4.1LG03g17120 vs. ExPASy TrEMBL
Match: A0A1S3AWA7 (pentatricopeptide repeat-containing protein At1g01970 OS=Cucumis melo OX=3656 GN=LOC103483346 PE=4 SV=1)

HSP 1 Score: 608 bits (1567), Expect = 1.31e-216
Identity = 311/382 (81.41%), Postives = 345/382 (90.31%), Query Frame = 0

Query: 1   MGTSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSPRRRCYQLATVAAIVEE 60
           M  S  NILYQ+H   PLVNGT+ +S S Y +        VL+ RRRC Q+ATV AIV+E
Sbjct: 1   MHISTSNILYQLH--LPLVNGTSNTSSSRYWKDSI-----VLNSRRRCSQMATVTAIVDE 60

Query: 61  VHTLESGREKPRFRWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLSD 120
           +H LES REKPRFRWVEVG +ITE QKQAISQLPPKMTK+CKAVMKQIICFSPQ G LSD
Sbjct: 61  LHKLESEREKPRFRWVEVGYNITETQKQAISQLPPKMTKKCKAVMKQIICFSPQKGELSD 120

Query: 121 MLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGK 180
           MLAAWVRIMKP+RADWLSVLKHLRIL+HPLYI+VAEAALVE TFEA+TRDYTKIIH+YGK
Sbjct: 121 MLAAWVRIMKPERADWLSVLKHLRILNHPLYIQVAEAALVEITFEANTRDYTKIIHHYGK 180

Query: 181 RNQLEDAERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRRS 240
           +NQLEDAE++LL+MRERGFACDQITLTTMIHIYSKAD+L LAKQTFEELKLLE+ LD+RS
Sbjct: 181 QNQLEDAEKVLLTMRERGFACDQITLTTMIHIYSKADKLKLAKQTFEELKLLEQSLDKRS 240

Query: 241 YDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNADGAQRVFDAIQ 300
           Y AMIMA++RAG+PEEGE ILKEMD KDIYAGSEVYKALLRAYSMA +A+GAQRVFDAIQ
Sbjct: 241 YGAMIMAYVRAGLPEEGEKILKEMDAKDIYAGSEVYKALLRAYSMAGDAEGAQRVFDAIQ 300

Query: 301 LAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLN 360
           LAAIPPD+KLCGLL+NAYLMAGQS+KAQIAFDNMRRAG+EPSDKCIAL LSAYEKENRLN
Sbjct: 301 LAAIPPDEKLCGLLMNAYLMAGQSRKAQIAFDNMRRAGIEPSDKCIALALSAYEKENRLN 360

Query: 361 AALELLIDLEKEKLMVGKEASE 382
           AALELLIDLEK+ +MVGKEAS+
Sbjct: 361 AALELLIDLEKDNVMVGKEASQ 375

BLAST of Cp4.1LG03g17120 vs. ExPASy TrEMBL
Match: A0A5A7U612 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold171G002570 PE=4 SV=1)

HSP 1 Score: 608 bits (1567), Expect = 1.31e-216
Identity = 311/382 (81.41%), Postives = 345/382 (90.31%), Query Frame = 0

Query: 1   MGTSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSPRRRCYQLATVAAIVEE 60
           M  S  NILYQ+H   PLVNGT+ +S S Y +        VL+ RRRC Q+ATV AIV+E
Sbjct: 1   MHISTSNILYQLH--LPLVNGTSNTSSSRYWKDSI-----VLNSRRRCSQMATVTAIVDE 60

Query: 61  VHTLESGREKPRFRWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLSD 120
           +H LES REKPRFRWVEVG +ITE QKQAISQLPPKMTK+CKAVMKQIICFSPQ G LSD
Sbjct: 61  LHKLESEREKPRFRWVEVGYNITETQKQAISQLPPKMTKKCKAVMKQIICFSPQKGELSD 120

Query: 121 MLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGK 180
           MLAAWVRIMKP+RADWLSVLKHLRIL+HPLYI+VAEAALVE TFEA+TRDYTKIIH+YGK
Sbjct: 121 MLAAWVRIMKPERADWLSVLKHLRILNHPLYIQVAEAALVEITFEANTRDYTKIIHHYGK 180

Query: 181 RNQLEDAERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRRS 240
           +NQLEDAE++LL+MRERGFACDQITLTTMIHIYSKAD+L LAKQTFEELKLLE+ LD+RS
Sbjct: 181 QNQLEDAEKVLLTMRERGFACDQITLTTMIHIYSKADKLKLAKQTFEELKLLEQSLDKRS 240

Query: 241 YDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNADGAQRVFDAIQ 300
           Y AMIMA++RAG+PEEGE ILKEMD KDIYAGSEVYKALLRAYSMA +A+GAQRVFDAIQ
Sbjct: 241 YGAMIMAYVRAGLPEEGEKILKEMDAKDIYAGSEVYKALLRAYSMAGDAEGAQRVFDAIQ 300

Query: 301 LAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLN 360
           LAAIPPD+KLCGLL+NAYLMAGQS+KAQIAFDNMRRAG+EPSDKCIAL LSAYEKENRLN
Sbjct: 301 LAAIPPDEKLCGLLMNAYLMAGQSRKAQIAFDNMRRAGIEPSDKCIALALSAYEKENRLN 360

Query: 361 AALELLIDLEKEKLMVGKEASE 382
           AALELLIDLEK+ +MVGKEAS+
Sbjct: 361 AALELLIDLEKDNVMVGKEASQ 375

BLAST of Cp4.1LG03g17120 vs. TAIR 10
Match: AT1G01970.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 413.3 bits (1061), Expect = 2.1e-115
Identity = 213/382 (55.76%), Postives = 280/382 (73.30%), Query Frame = 0

Query: 1   MGTSVCNILYQIHPKQPLVNGTARSSYSCYCRGLTGRRLRVLSP-RRRCYQLATVAAIVE 60
           MG   C+ +     K PLV    R     Y R      L V S   R C      +  + 
Sbjct: 1   MGIYSCSAVLSFGLKCPLVIARHR----LYHRMFRRNPLLVESHLNRLCSCKCNASLAIG 60

Query: 61  EVHTLESGREKPRFRWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLS 120
           EV   E   +   F W +VG ++TE Q +AI+++P KM+KRC+A+M+QIICFSP+ G+  
Sbjct: 61  EVVEKEDAEQSRSFNWADVGLNLTEEQDEAITRIPIKMSKRCQALMRQIICFSPEKGSFC 120

Query: 121 DMLAAWVRIMKPKRADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYG 180
           D+L AW+R M P RADWLS+LK L+ LD P YI+VAE +L++ +FEA+ RDYTKIIHYYG
Sbjct: 121 DLLGAWLRRMNPIRADWLSILKELKNLDSPFYIKVAEFSLLQDSFEANARDYTKIIHYYG 180

Query: 181 KRNQLEDAERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRR 240
           K NQ+EDAER LLSM+ RGF  DQ+TLT M+ +YSKA    LA++TF E+KLL EPLD R
Sbjct: 181 KLNQVEDAERTLLSMKNRGFLIDQVTLTAMVQLYSKAGCHKLAEETFNEIKLLGEPLDYR 240

Query: 241 SYDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNADGAQRVFDAI 300
           SY +MIMA+IRAG+PE+GE++L+EMD ++I AG EVYKALLR YSM  +A+GA+RVFDA+
Sbjct: 241 SYGSMIMAYIRAGVPEKGESLLREMDSQEICAGREVYKALLRDYSMGGDAEGAKRVFDAV 300

Query: 301 QLAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRL 360
           Q+A I PD KLCGLLINAY ++GQSQ A++AF+NMR+AG++ +DKC+ALVL+AYEKE +L
Sbjct: 301 QIAGITPDVKLCGLLINAYSVSGQSQNARLAFENMRKAGIKATDKCVALVLAAYEKEEKL 360

Query: 361 NAALELLIDLEKEKLMVGKEAS 382
           N AL  L++LEK+ +M+GKEAS
Sbjct: 361 NEALGFLVELEKDSIMLGKEAS 378

BLAST of Cp4.1LG03g17120 vs. TAIR 10
Match: AT1G19520.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 228.0 bits (580), Expect = 1.3e-59
Identity = 120/305 (39.34%), Postives = 187/305 (61.31%), Query Frame = 0

Query: 74  RWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLSDMLAAWVRIMKPKR 133
           +WVE+   I E +++A  + P  +T +CK VM+++     +  + S +LA W  +++P R
Sbjct: 291 KWVEMADKIHEAEEEADWREPKPVTGKCKLVMEKLESLQ-EGDDPSGLLAEWAELLEPNR 350

Query: 134 ADWLSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGKRNQLEDAERILLS 193
            DW++++  LR  +   Y++VAE  L E++F AS  DY+K+IH + K N +ED ERIL  
Sbjct: 351 VDWIALINQLREGNTHAYLKVAEGVLDEKSFNASISDYSKLIHIHAKENHIEDVERILKK 410

Query: 194 MRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRRSYDAMIMAFIRAGM 253
           M + G   D +T T ++H+YSK+     A + FE LK      D + Y+AMI+ ++ AG 
Sbjct: 411 MSQNGIFPDILTATALVHMYSKSGNFERATEAFENLKSYGLRPDEKIYEAMILGYVNAGK 470

Query: 254 PEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNADGAQRVFDAIQLAAIPP-DDKLCG 313
           P+ GE ++KEM  K++ A  EVY ALLRAY+   +A+GA  +  ++Q A+  P   +   
Sbjct: 471 PKLGERLMKEMQAKELKASEEVYMALLRAYAQMGDANGAAGISSSMQYASDGPLSFEAYS 530

Query: 314 LLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLNAALELLIDLEKE 373
           L + AY  AGQ  KA+  FD MR+ G +P DKCIA ++ AY+ EN L+ AL LL+ LEK+
Sbjct: 531 LFVEAYGKAGQVDKAKSNFDEMRKLGHKPDDKCIANLVRAYKGENSLDKALRLLLQLEKD 590

Query: 374 KLMVG 378
            + +G
Sbjct: 591 GIEIG 594

BLAST of Cp4.1LG03g17120 vs. TAIR 10
Match: AT5G04810.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 88.6 bits (218), Expect = 1.2e-17
Identity = 73/306 (23.86%), Postives = 141/306 (46.08%), Query Frame = 0

Query: 68  REKPRFRWVEVGSDITEMQKQAISQLPPKMTKRCKAVMKQIICFSPQNGNLSDMLAAWVR 127
           + + R RWVE G + T+M  ++      + +++    +++I+       N   +++A+ +
Sbjct: 249 KAEQRVRWVEEGEEDTKMSNKSSWHQEREGSRKS---LQRIL--DTNGDNWQAVISAFEK 308

Query: 128 IMKPKRADW-LSVLKHLRILDHPLYIEVAEAALVERTFEASTRDYTKIIHYYGKRNQLED 187
           I KP R ++ L V  + R  D     E  E  +  R    ++R YT +IH Y     +++
Sbjct: 309 ISKPSRTEFGLMVKFYGRRGDMHRARETFE-RMRARGITPTSRIYTSLIHAYAVGRDMDE 368

Query: 188 AERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRRSYDAMIM 247
           A   +  M+E G     +T + ++  +SKA     A   F+E K + + L+   Y  +I 
Sbjct: 369 ALSCVRKMKEEGIEMSLVTYSVIVGGFSKAGHAEAADYWFDEAKRIHKTLNASIYGKIIY 428

Query: 248 AFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNADGAQRVFDAIQLAAIPP 307
           A  +    E  E +++EM+E+ I A   +Y  ++  Y+M ++      VF  ++     P
Sbjct: 429 AHCQTCNMERAEALVREMEEEGIDAPIAIYHTMMDGYTMVADEKKGLVVFKRLKECGFTP 488

Query: 308 DDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEKENRLNAALELL 367
                G LIN Y   G+  KA      M+  G++ + K  +++++ + K      A  + 
Sbjct: 489 TVVTYGCLINLYTKVGKISKALEVSRVMKEEGVKHNLKTYSMMINGFVKLKDWANAFAVF 548

Query: 368 IDLEKE 373
            D+ KE
Sbjct: 549 EDMVKE 548

BLAST of Cp4.1LG03g17120 vs. TAIR 10
Match: AT2G35130.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 82.8 bits (203), Expect = 6.6e-16
Identity = 50/176 (28.41%), Postives = 89/176 (50.57%), Query Frame = 0

Query: 180 KRNQLEDAERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRR 239
           ++   E+A  +   M+         T   MI++Y KA +  ++ + + E++  +   +  
Sbjct: 241 RKGNTEEAIDVFQRMKRDRCKPTTETYNLMINLYGKASKSYMSWKLYCEMRSHQCKPNIC 300

Query: 240 SYDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNADGAQRVFDAI 299
           +Y A++ AF R G+ E+ E I +++ E  +     VY AL+ +YS A    GA  +F  +
Sbjct: 301 TYTALVNAFAREGLCEKAEEIFEQLQEDGLEPDVYVYNALMESYSRAGYPYGAAEIFSLM 360

Query: 300 QLAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEK 356
           Q     PD     ++++AY  AG    A+  F+ M+R G+ P+ K   L+LSAY K
Sbjct: 361 QHMGCEPDRASYNIMVDAYGRAGLHSDAEAVFEEMKRLGIAPTMKSHMLLLSAYSK 416

BLAST of Cp4.1LG03g17120 vs. TAIR 10
Match: AT2G35130.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 82.8 bits (203), Expect = 6.6e-16
Identity = 50/176 (28.41%), Postives = 89/176 (50.57%), Query Frame = 0

Query: 180 KRNQLEDAERILLSMRERGFACDQITLTTMIHIYSKADELSLAKQTFEELKLLEEPLDRR 239
           ++   E+A  +   M+         T   MI++Y KA +  ++ + + E++  +   +  
Sbjct: 263 RKGNTEEAIDVFQRMKRDRCKPTTETYNLMINLYGKASKSYMSWKLYCEMRSHQCKPNIC 322

Query: 240 SYDAMIMAFIRAGMPEEGENILKEMDEKDIYAGSEVYKALLRAYSMASNADGAQRVFDAI 299
           +Y A++ AF R G+ E+ E I +++ E  +     VY AL+ +YS A    GA  +F  +
Sbjct: 323 TYTALVNAFAREGLCEKAEEIFEQLQEDGLEPDVYVYNALMESYSRAGYPYGAAEIFSLM 382

Query: 300 QLAAIPPDDKLCGLLINAYLMAGQSQKAQIAFDNMRRAGLEPSDKCIALVLSAYEK 356
           Q     PD     ++++AY  AG    A+  F+ M+R G+ P+ K   L+LSAY K
Sbjct: 383 QHMGCEPDRASYNIMVDAYGRAGLHSDAEAVFEEMKRLGIAPTMKSHMLLLSAYSK 438

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LPC43.0e-11455.76Pentatricopeptide repeat-containing protein At1g01970 OS=Arabidopsis thaliana OX... [more]
Q940Z13.8e-3241.08Pentatricopeptide repeat-containing protein At1g19525 OS=Arabidopsis thaliana OX... [more]
Q0WMY51.7e-1623.86Pentatricopeptide repeat-containing protein At5g04810, chloroplastic OS=Arabidop... [more]
O821789.3e-1528.41Pentatricopeptide repeat-containing protein At2g35130 OS=Arabidopsis thaliana OX... [more]
O646246.0e-1428.86Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_023526280.11.88e-273100.00pentatricopeptide repeat-containing protein At1g01970 isoform X1 [Cucurbita pepo... [more]
KAG6582566.12.85e-26897.91Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022980308.14.05e-26897.91pentatricopeptide repeat-containing protein At1g01970 [Cucurbita maxima] >XP_022... [more]
XP_022924339.13.32e-26797.91pentatricopeptide repeat-containing protein At1g01970 [Cucurbita moschata] >XP_0... [more]
XP_038903030.12.80e-22785.22pentatricopeptide repeat-containing protein At1g01970 [Benincasa hispida] >XP_03... [more]
Match NameE-valueIdentityDescription
A0A6J1IR151.96e-26897.91pentatricopeptide repeat-containing protein At1g01970 OS=Cucurbita maxima OX=366... [more]
A0A6J1EER01.61e-26797.91pentatricopeptide repeat-containing protein At1g01970 OS=Cucurbita moschata OX=3... [more]
A0A6J1D0011.25e-21880.10pentatricopeptide repeat-containing protein At1g01970 OS=Momordica charantia OX=... [more]
A0A1S3AWA71.31e-21681.41pentatricopeptide repeat-containing protein At1g01970 OS=Cucumis melo OX=3656 GN... [more]
A0A5A7U6121.31e-21681.41Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT1G01970.12.1e-11555.76Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G19520.11.3e-5939.34pentatricopeptide (PPR) repeat-containing protein [more]
AT5G04810.11.2e-1723.86pentatricopeptide (PPR) repeat-containing protein [more]
AT2G35130.16.6e-1628.41Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G35130.26.6e-1628.41Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 352..372
NoneNo IPR availablePANTHERPTHR46862:SF3OS07G0661900 PROTEINcoord: 54..382
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 152..302
e-value: 4.3E-28
score: 100.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 303..374
e-value: 1.5E-7
score: 33.1
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 313..339
e-value: 0.12
score: 12.6
coord: 240..269
e-value: 1.8E-5
score: 24.7
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 171..212
e-value: 3.9E-7
score: 30.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 171..202
e-value: 4.1E-6
score: 24.6
coord: 240..270
e-value: 4.4E-5
score: 21.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 167..201
score: 9.262356
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 237..271
score: 10.270796
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 307..341
score: 9.45966
IPR044657Pentatricopeptide repeat-containing protein NFD5-likePANTHERPTHR46862OS07G0661900 PROTEINcoord: 54..382

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g17120.1Cp4.1LG03g17120.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding