Cp4.1LG20g00350 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG20g00350
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG20: 117207 .. 120420 (-)
RNA-Seq ExpressionCp4.1LG20g00350
SyntenyCp4.1LG20g00350
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAAGTTGCTTTAGAAATGAGTAAAATTTGCCTACTAGCTGGGTTCAGATAATGCCACGGGGCTTCTCCGGGAACAGAGCCGGCTTCCGGCGAGCAGAAGACGGCGACTGATGAAACGCTGTGTGAGACAGAGACTTGATGGACGACACGAGGAGAGGATGGAGAGAAATGAGTGGAGGAGATTTTTTTTTTAAGATTTTTAAAATAAATAAATAAATAAAAATAAATAAAATAAAATAGTTAAGAGGAGTATGGAATTTTAAATAAAATAAAATGAACAAATGACATGTTGCTAAATTCCGGGTTAGAATGAATAGCAGAGAATTTCACTGCCTTGCGTTATTCAGCAAATGCAGATCTCTTAGAACCTTGAAGCAAATCCACGCTTTTACGTTCAAAACAGGCTTAAATTCCGACCCATTAGTCGCCGGCAAGCTTCTTCTTCATTGTGCAGTTACACTTCCTGATTCTGTTCGCTATGCTGGACGCCTCTTCCTTGACATTCGAAATCCAGATGTGTTCATGTACAACACACTCATCCGTGGACTTTCCGATTCTGACACCCCCTCTCATGCCCTTCTACTGTTTGTTGAAATGCGTCGCAAATCCATGGCTTTACCCGATAGTTTCTCTTTTGCTTTTCTGCTCAAAGCTGCCGCTAATTGCAGGGCTCTCAGAAATGGGTTGCAATTGCATCGCCAAGCTATTGGTTATGGTCTGGATACCCATCTTTTTGTTGGGACGACACTGATCAGCATGTATGCTGAATGTGCAAGTTTGGCCTTTGCACGGCAGGTGTTTGATGAAATGATTGAACCAAATATTGTTGCTTGGAATGCCATTGTTGCTGCGTGTTTTAGGTGCGAGGACGTTAAGGATGCAGAGCAAGTGTTCCATCGGATGCCCATAAAAAACTTAACCTCGTGGAACATCATGCTTGCAGGGTACACAAAAGCAGGTGAGCTTCGGCTAGCCAGGGAGGTGTTTATGAAAATGCCTTTGAAAGATGAAGTTTCGTGGAGTAGTATGATTGTTGGGTTCGCTCATAATGGCAGCTTTAACAACGCTTTCGCATTTTTCAGGGAGTTGCGGCAGGAAGGGATGAGACCAAATGAGGTAAGCCTCACAGGTGCGCTTTCTGCATGTGCACAAGCTGGGGCATTCGAGTTTGGAAGAATCCTACATGGGTTTGTTGAAAAATCTGGCTTTCTGCAGATTGTTTCAGTGAGTAATGCATTGATCGATACTTACTCTAAATGTGGGAATTTGGATATGGCTCGTTTGATCTTTGATAATATGCTGGAAAGGAGTGTTGTCTCTTGGACAGCCATGATTACGGGGCTCGCAATGCATGGCTACGGGGAGGAAGCAATCAGATTATTTAATGAGATGGAAGAGTCTAATATTAAGCCCGACGGTATCACCTTTATATCCATCTTGTATGCTTGTAGCCATGCTGGATTGGTTGATTTGGGATGTTCTTATTTTTCAAGGATGGTAAATATTTACGGTATTGAACCCGTAATTGAACATTATGGTTGCATAGTTGATCTTTATGGTCGAGCTGGTAAGCTGCAGCAAGCTTTTGACTTTGTGTCTCAAATGCCAGTTTCACCGAATGATATTGTCTGGAGGACACTTCTTGGAGCTTGTGGCATTCATGGTAACTTAGAACTGGCAGGGCAAGTAAAGAGGCGACTCTTCGAACTCGACCCCGAAAACTCTGGAGATCATGTTCTTTTGTCAAACATTTATGCAGTTGCGGGGAAATGGAAGGATGTTGCCACTTTAAGAAGATCAATGACTCACCAGAAACTTAAGAAAACTCCTGGTTGGAGCATGATTGAAGTCGATAGAATCATGTATAGTTTTGTTGCAGGAGAAAAGCAAAATGACATAGCAGAAGAGGCACATCAAAAACTAAGGGAGATAATGTCGAGGCTGAGGATAGAAGGAGGTTATGTTCCAGAAGTTGGAAGTGTATTGCATGATATAGAAGTGGAAGAAAAGGAAGACTCTGTTTCACAGCACAGTGAGAAGCTAGCAGTTGCTTTTGGGATGGTGAGGCTGCCAAGAGGAAGAAGTATAAGAGTGGTGAAGAATTTAAGAATTTGCAGGGACTGTCACACTGTGATGAAGCTGATTTCTAAGGTGTATGAAGTAGAAATTGTGGTGAGAGATAGAAGTCGGTTCCACTCTTTCACACATGGTTCATGTTCGTGCAGAGACTACTGGTAATTAGTTGAGTAAACTAGAGTTGAAATTAGCATCCTATGCTAGAAGGGAAAACCTGCAAATAAGCTGATATCAGAATCAAACCAGAAAACTGATAACCTGCTACATCAATGGAGTTGCAACACATCACTGGTCCTTGCAAAAGTTAGCAAGCCCAAAGAATATTGTTGAGTAATATATTCTTTTTTGGAAGGTTACAATCCTAGAAAAGGGGAATTCAATTCTATAGTGCAGAAGGGGAGGTACTAGGTATAAATTCTGACCTCTAACTTTGGGAGAGTCGGTGGTCCATTAGAGATTGTCATATTGTGTTGTAGGTTTACGGATGTTGATCATTGAAATTGAAGTGCAATAGGTTAAGATATTAGTCTTATATGCATGCAACCATGTGGTTCCACTTAGCGATAGGTTTTGTTGGTATTGCTGCTCGTTTCTACTTTTCAACATTTTTCCGCTTCCATATTTTGTTTATGTTCTAGCCTCTCTCACCTTGTCAGTTCTTGTGGTAACTTCCCACCTTCTATAATCTGCTTGTGCCTGGCCTCTCGTATCTCATATACATCCTCTCAAAATAGTTCTTCTTTTAGATTTTGCATTTCAACAATATTTTCAGTTCAAAACTAACATTTTCTAGTATCAAAGCAAAAAGAGGTACTACTTGCATCAGGAGTTGGTTTGTGCTAGGGTTTCATCTTATGCTGGTCGAATGCAAATGAAGGCATAAGTTCGTGATCCACAATTCACACCTCTTTCCAAACATCAAACCATCCATTATAGAGAGAATTGAGAAGGCCCTAATATTTTCCTTATTGTTCCTCTTATGAAGTGCTGAGTGTGGCAGATGGGCTGTTGGATTAAAGCTTTGGTTAAAAGACCTCTGGTTGTGAGTAGAGTGTGTCACGGTCATGCTTGTCCAAAACGTGTTGTCCATGGCCACTTTTCGTTAGAGGCCAAAACTCTTGTCTCATACCACTTT

mRNA sequence

ATGGAAAATAATGCCACGGGGCTTCTCCGGGAACAGAGCCGGCTTCCGGCGAGCAGAAGACGGCGACTGATGAAACGCTGTGTGAGACAGAGACTTGATGGACGACACGAGGAGAGGATGGAGAGAAATGAAATGAATAGCAGAGAATTTCACTGCCTTGCGTTATTCAGCAAATGCAGATCTCTTAGAACCTTGAAGCAAATCCACGCTTTTACGTTCAAAACAGGCTTAAATTCCGACCCATTAGTCGCCGGCAAGCTTCTTCTTCATTGTGCAGTTACACTTCCTGATTCTGTTCGCTATGCTGGACGCCTCTTCCTTGACATTCGAAATCCAGATGTGTTCATGTACAACACACTCATCCGTGGACTTTCCGATTCTGACACCCCCTCTCATGCCCTTCTACTGTTTGTTGAAATGCGTCGCAAATCCATGGCTTTACCCGATAGTTTCTCTTTTGCTTTTCTGCTCAAAGCTGCCGCTAATTGCAGGGCTCTCAGAAATGGGGAGTTGCGGCAGGAAGGGATGAGACCAAATGAGGTAAGCCTCACAGGTGCGCTTTCTGCATGTGCACAAGCTGGGGCATTCGAGTTTGGAAGAATCCTACATGGGTTTGTTGAAAAATCTGGCTTTCTGCAGATTGTTTCAGTGAGTAATGCATTGATCGATACTTACTCTAAATGTGGGAATTTGGATATGGCTCGTTTGATCTTTGATAATATGCTGGAAAGGAGTGTTGTCTCTTGGACAGCCATGATTACGGGGCTCGCAATGCATGGCTACGGGGAGGAAGCAATCAGATTATTTAATGAGATGGAAGAGTCTAATATTAAGCCCGACGGTATCACCTTTATATCCATCTTGTATGCTTGTAGCCATGCTGGATTGGTTGATTTGGGATGTTCTTATTTTTCAAGGATGGTAAATATTTACGGTATTGAACCCGTAATTGAACATTATGGTTGCATAGTTGATCTTTATGGTCGAGCTGGTAAGCTGCAGCAAGCTTTTGACTTTGTGTCTCAAATGCCAGTTTCACCGAATGATATTGTCTGGAGGACACTTCTTGGAGCTTGTGGCATTCATGGTAACTTAGAACTGGCAGGGCAAGTAAAGAGGCGACTCTTCGAACTCGACCCCGAAAACTCTGGAGATCATGTTCTTTTGTCAAACATTTATGCAGTTGCGGGGAAATGGAAGGATGTTGCCACTTTAAGAAGATCAATGACTCACCAGAAACTTAAGAAAACTCCTGGTTGGAGCATGATTGAAGTCGATAGAATCATGTATAGTTTTGTTGCAGGAGAAAAGCAAAATGACATAGCAGAAGAGGCACATCAAAAACTAAGGGAGATAATGTCGAGGCTGAGGATAGAAGGAGGTTATGTTCCAGAAGTTGGAAGTGTATTGCATGATATAGAAGTGGAAGAAAAGGAAGACTCTGTTTCACAGCACAGTGAGAAGCTAGCAGTTGCTTTTGGGATGGTGAGGCTGCCAAGAGGAAGAAGTATAAGAGTGGTGAAGAATTTAAGAATTTGCAGGGACTGTCACACTGTGATGAAGCTGATTTCTAAGGTGTATGAAGTAGAAATTGTGGTGAGAGATAGAAGTCGGTTCCACTCTTTCACACATGGTTCATGTTCGTGCAGAGACTACTGGTAATTAGTTGAGTAAACTAGAGTTGAAATTAGCATCCTATGCTAGAAGGGAAAACCTGCAAATAAGCTGATATCAGAATCAAACCAGAAAACTGATAACCTGCTACATCAATGGAGTTGCAACACATCACTGGTCCTTGCAAAAGTTAGCAAGCCCAAAGAATATTGTTGAGTAATATATTCTTTTTTGGAAGGTTACAATCCTAGAAAAGGGGAATTCAATTCTATAGTGCAGAAGGGGAGGTACTAGGTATAAATTCTGACCTCTAACTTTGGGAGAGTCGGTGGTCCATTAGAGATTGTCATATTGTGTTGTAGGTTTACGGATGTTGATCATTGAAATTGAAGTGCAATAGGTTAAGATATTAGTCTTATATGCATGCAACCATGTGGTTCCACTTAGCGATAGGTTTTGTTGGTATTGCTGCTCGTTTCTACTTTTCAACATTTTTCCGCTTCCATATTTTGTTTATGTTCTAGCCTCTCTCACCTTGTCAGTTCTTGTGGTAACTTCCCACCTTCTATAATCTGCTTGTGCCTGGCCTCTCGTATCTCATATACATCCTCTCAAAATAGTTCTTCTTTTAGATTTTGCATTTCAACAATATTTTCAGTTCAAAACTAACATTTTCTAGTATCAAAGCAAAAAGAGGTACTACTTGCATCAGGAGTTGGTTTGTGCTAGGGTTTCATCTTATGCTGGTCGAATGCAAATGAAGGCATAAGTTCGTGATCCACAATTCACACCTCTTTCCAAACATCAAACCATCCATTATAGAGAGAATTGAGAAGGCCCTAATATTTTCCTTATTGTTCCTCTTATGAAGTGCTGAGTGTGGCAGATGGGCTGTTGGATTAAAGCTTTGGTTAAAAGACCTCTGGTTGTGAGTAGAGTGTGTCACGGTCATGCTTGTCCAAAACGTGTTGTCCATGGCCACTTTTCGTTAGAGGCCAAAACTCTTGTCTCATACCACTTT

Coding sequence (CDS)

ATGGAAAATAATGCCACGGGGCTTCTCCGGGAACAGAGCCGGCTTCCGGCGAGCAGAAGACGGCGACTGATGAAACGCTGTGTGAGACAGAGACTTGATGGACGACACGAGGAGAGGATGGAGAGAAATGAAATGAATAGCAGAGAATTTCACTGCCTTGCGTTATTCAGCAAATGCAGATCTCTTAGAACCTTGAAGCAAATCCACGCTTTTACGTTCAAAACAGGCTTAAATTCCGACCCATTAGTCGCCGGCAAGCTTCTTCTTCATTGTGCAGTTACACTTCCTGATTCTGTTCGCTATGCTGGACGCCTCTTCCTTGACATTCGAAATCCAGATGTGTTCATGTACAACACACTCATCCGTGGACTTTCCGATTCTGACACCCCCTCTCATGCCCTTCTACTGTTTGTTGAAATGCGTCGCAAATCCATGGCTTTACCCGATAGTTTCTCTTTTGCTTTTCTGCTCAAAGCTGCCGCTAATTGCAGGGCTCTCAGAAATGGGGAGTTGCGGCAGGAAGGGATGAGACCAAATGAGGTAAGCCTCACAGGTGCGCTTTCTGCATGTGCACAAGCTGGGGCATTCGAGTTTGGAAGAATCCTACATGGGTTTGTTGAAAAATCTGGCTTTCTGCAGATTGTTTCAGTGAGTAATGCATTGATCGATACTTACTCTAAATGTGGGAATTTGGATATGGCTCGTTTGATCTTTGATAATATGCTGGAAAGGAGTGTTGTCTCTTGGACAGCCATGATTACGGGGCTCGCAATGCATGGCTACGGGGAGGAAGCAATCAGATTATTTAATGAGATGGAAGAGTCTAATATTAAGCCCGACGGTATCACCTTTATATCCATCTTGTATGCTTGTAGCCATGCTGGATTGGTTGATTTGGGATGTTCTTATTTTTCAAGGATGGTAAATATTTACGGTATTGAACCCGTAATTGAACATTATGGTTGCATAGTTGATCTTTATGGTCGAGCTGGTAAGCTGCAGCAAGCTTTTGACTTTGTGTCTCAAATGCCAGTTTCACCGAATGATATTGTCTGGAGGACACTTCTTGGAGCTTGTGGCATTCATGGTAACTTAGAACTGGCAGGGCAAGTAAAGAGGCGACTCTTCGAACTCGACCCCGAAAACTCTGGAGATCATGTTCTTTTGTCAAACATTTATGCAGTTGCGGGGAAATGGAAGGATGTTGCCACTTTAAGAAGATCAATGACTCACCAGAAACTTAAGAAAACTCCTGGTTGGAGCATGATTGAAGTCGATAGAATCATGTATAGTTTTGTTGCAGGAGAAAAGCAAAATGACATAGCAGAAGAGGCACATCAAAAACTAAGGGAGATAATGTCGAGGCTGAGGATAGAAGGAGGTTATGTTCCAGAAGTTGGAAGTGTATTGCATGATATAGAAGTGGAAGAAAAGGAAGACTCTGTTTCACAGCACAGTGAGAAGCTAGCAGTTGCTTTTGGGATGGTGAGGCTGCCAAGAGGAAGAAGTATAAGAGTGGTGAAGAATTTAAGAATTTGCAGGGACTGTCACACTGTGATGAAGCTGATTTCTAAGGTGTATGAAGTAGAAATTGTGGTGAGAGATAGAAGTCGGTTCCACTCTTTCACACATGGTTCATGTTCGTGCAGAGACTACTGGTAA

Protein sequence

MENNATGLLREQSRLPASRRRRLMKRCVRQRLDGRHEERMERNEMNSREFHCLALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRYAGRLFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAANCRALRNGELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEKSGFLQIVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRLFNEMEESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYGRAGKLQQAFDFVSQMPVSPNDIVWRTLLGACGIHGNLELAGQVKRRLFELDPENSGDHVLLSNIYAVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQKLREIMSRLRIEGGYVPEVGSVLHDIEVEEKEDSVSQHSEKLAVAFGMVRLPRGRSIRVVKNLRICRDCHTVMKLISKVYEVEIVVRDRSRFHSFTHGSCSCRDYW
Homology
BLAST of Cp4.1LG20g00350 vs. ExPASy Swiss-Prot
Match: Q9CA54 (Pentatricopeptide repeat-containing protein At1g74630 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H71 PE=2 SV=1)

HSP 1 Score: 637.1 bits (1642), Expect = 1.8e-181
Identity = 325/637 (51.02%), Postives = 402/637 (63.11%), Query Frame = 0

Query: 51  HCLALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRYAGRLFLDIR 110
           HCL+L + C++LR L QIH    K G+++D    GKL+LHCA+++ D++ YA RL L   
Sbjct: 7   HCLSLLNSCKNLRALTQIHGLFIKYGVDTDSYFTGKLILHCAISISDALPYARRLLLCFP 66

Query: 111 NPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAANCRALRNG- 170
            PD FM+NTL+RG S+SD P +++ +FVEM RK    PDSFSFAF++KA  N R+LR G 
Sbjct: 67  EPDAFMFNTLVRGYSESDEPHNSVAVFVEMMRKGFVFPDSFSFAFVIKAVENFRSLRTGF 126

Query: 171 ------------------------------------------------------------ 230
                                                                       
Sbjct: 127 QMHCQALKHGLESHLFVGTTLIGMYGGCGCVEFARKVFDEMHQPNLVAWNAVITACFRGN 186

Query: 231 ------------------------------------------------------------ 290
                                                                       
Sbjct: 187 DVAGAREIFDKMLVRNHTSWNVMLAGYIKAGELESAKRIFSEMPHRDDVSWSTMIVGIAH 246

Query: 291 ------------ELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEKSGFLQIVSV 350
                       EL++ GM PNEVSLTG LSAC+Q+G+FEFG+ILHGFVEK+G+  IVSV
Sbjct: 247 NGSFNESFLYFRELQRAGMSPNEVSLTGVLSACSQSGSFEFGKILHGFVEKAGYSWIVSV 306

Query: 351 SNALIDTYSKCGNLDMARLIFDNMLE-RSVVSWTAMITGLAMHGYGEEAIRLFNEMEESN 410
           +NALID YS+CGN+ MARL+F+ M E R +VSWT+MI GLAMHG GEEA+RLFNEM    
Sbjct: 307 NNALIDMYSRCGNVPMARLVFEGMQEKRCIVSWTSMIAGLAMHGQGEEAVRLFNEMTAYG 366

Query: 411 IKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYGRAGKLQQA 470
           + PDGI+FIS+L+ACSHAGL++ G  YFS M  +Y IEP IEHYGC+VDLYGR+GKLQ+A
Sbjct: 367 VTPDGISFISLLHACSHAGLIEEGEDYFSEMKRVYHIEPEIEHYGCMVDLYGRSGKLQKA 426

Query: 471 FDFVSQMPVSPNDIVWRTLLGACGIHGNLELAGQVKRRLFELDPENSGDHVLLSNIYAVA 530
           +DF+ QMP+ P  IVWRTLLGAC  HGN+ELA QVK+RL ELDP NSGD VLLSN YA A
Sbjct: 427 YDFICQMPIPPTAIVWRTLLGACSSHGNIELAEQVKQRLNELDPNNSGDLVLLSNAYATA 486

Query: 531 GKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQKLREIMSRL 554
           GKWKDVA++R+SM  Q++KKT  WS++EV + MY F AGEK+  I  EAH+KL+EI+ RL
Sbjct: 487 GKWKDVASIRKSMIVQRIKKTTAWSLVEVGKTMYKFTAGEKKKGIDIEAHEKLKEIILRL 546

BLAST of Cp4.1LG20g00350 vs. ExPASy Swiss-Prot
Match: A8MQA3 (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 436.4 bits (1121), Expect = 4.8e-121
Identity = 226/568 (39.79%), Postives = 340/568 (59.86%), Query Frame = 0

Query: 61  SLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPD--SVRYAGRLFLDIRNP-DVFMY 120
           S+  L+QIHAF+ + G++      GK L+   V+LP    + YA ++F  I  P +VF++
Sbjct: 29  SITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINVFIW 88

Query: 121 NTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAA----------------- 180
           NTLIRG ++      A  L+ EMR   +  PD+ ++ FL+KA                  
Sbjct: 89  NTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVRLGETIHSVVI 148

Query: 181 ------------------ANC---------------------RALRNG------------ 240
                             ANC                      ++ NG            
Sbjct: 149 RSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFAENGKPEEALA 208

Query: 241 ---ELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEKSGFLQIVSVSNALIDTYS 300
              E+  +G++P+  ++   LSACA+ GA   G+ +H ++ K G  + +  SN L+D Y+
Sbjct: 209 LYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSSNVLLDLYA 268

Query: 301 KCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRLFNEMEES-NIKPDGITFI 360
           +CG ++ A+ +FD M++++ VSWT++I GLA++G+G+EAI LF  ME +  + P  ITF+
Sbjct: 269 RCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGLLPCEITFV 328

Query: 361 SILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYGRAGKLQQAFDFVSQMPV 420
            ILYACSH G+V  G  YF RM   Y IEP IEH+GC+VDL  RAG++++A++++  MP+
Sbjct: 329 GILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAYEYIKSMPM 388

Query: 421 SPNDIVWRTLLGACGIHGNLELAGQVKRRLFELDPENSGDHVLLSNIYAVAGKWKDVATL 480
            PN ++WRTLLGAC +HG+ +LA   + ++ +L+P +SGD+VLLSN+YA   +W DV  +
Sbjct: 389 QPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQRWSDVQKI 448

Query: 481 RRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQKLREIMSRLRIEGGYVPE 540
           R+ M    +KK PG S++EV   ++ F+ G+K +  ++  + KL+E+  RLR E GYVP+
Sbjct: 449 RKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGRLRSE-GYVPQ 508

Query: 541 VGSVLHDIEVEEKEDSVSQHSEKLAVAFGMVRLPRGRSIRVVKNLRICRDCHTVMKLISK 554
           + +V  D+E EEKE++V  HSEK+A+AF ++  P    I VVKNLR+C DCH  +KL+SK
Sbjct: 509 ISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADCHLAIKLVSK 568

BLAST of Cp4.1LG20g00350 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 418.7 bits (1075), Expect = 1.0e-115
Identity = 217/521 (41.65%), Postives = 326/521 (62.57%), Query Frame = 0

Query: 38  ERMERNEMNSREFHCLALFSKCRSLRTL---KQIHAFTFKTGLNSDPLVAGKLLLHCAVT 97
           ++ME  ++ +     + + S C  +R L   +Q+ ++  +  +N +  +A  +L     T
Sbjct: 221 KKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLANAML--DMYT 280

Query: 98  LPDSVRYAGRLFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFA 157
              S+  A RLF  +   D   + T++ G + S+    A  +   M +K +   ++   A
Sbjct: 281 KCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAWNALISA 340

Query: 158 FLLKAAANCRALRNGELR-QEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEKSGFLQ 217
           +      N   +   EL+ Q+ M+ N+++L   LSACAQ GA E GR +H +++K G   
Sbjct: 341 YEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSYIKKHGIRM 400

Query: 218 IVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRLFNEME 277
              V++ALI  YSKCG+L+ +R +F+++ +R V  W+AMI GLAMHG G EA+ +F +M+
Sbjct: 401 NFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQ 460

Query: 278 ESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYGRAGKL 337
           E+N+KP+G+TF ++  ACSH GLVD   S F +M + YGI P  +HY CIVD+ GR+G L
Sbjct: 461 EANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVLGRSGYL 520

Query: 338 QQAFDFVSQMPVSPNDIVWRTLLGACGIHGNLELAGQVKRRLFELDPENSGDHVLLSNIY 397
           ++A  F+  MP+ P+  VW  LLGAC IH NL LA     RL EL+P N G HVLLSNIY
Sbjct: 521 EKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGAHVLLSNIY 580

Query: 398 AVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQKLREIM 457
           A  GKW++V+ LR+ M    LKK PG S IE+D +++ F++G+  + ++E+ + KL E+M
Sbjct: 581 AKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKVYGKLHEVM 640

Query: 458 SRLRIEGGYVPEVGSVLHDIEVEE-KEDSVSQHSEKLAVAFGMVRLPRGRSIRVVKNLRI 517
            +L+   GY PE+  VL  IE EE KE S++ HSEKLA+ +G++     + IRV+KNLR+
Sbjct: 641 EKLK-SNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKVIRVIKNLRV 700

Query: 518 CRDCHTVMKLISKVYEVEIVVRDRSRFHSFTHGSCSCRDYW 554
           C DCH+V KLIS++Y+ EI+VRDR RFH F +G CSC D+W
Sbjct: 701 CGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738

BLAST of Cp4.1LG20g00350 vs. ExPASy Swiss-Prot
Match: Q8LK93 (Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H26 PE=2 SV=2)

HSP 1 Score: 410.2 bits (1053), Expect = 3.7e-113
Identity = 216/574 (37.63%), Postives = 322/574 (56.10%), Query Frame = 0

Query: 53  LALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVT-LPDSVRYAGRLFLDIRN 112
           + L SKC SLR L QI A+  K+ +     VA KL+  C  +    S+ YA  LF  +  
Sbjct: 33  ILLISKCNSLRELMQIQAYAIKSHIEDVSFVA-KLINFCTESPTESSMSYARHLFEAMSE 92

Query: 113 PDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAANCRALRNG-- 172
           PD+ ++N++ RG S    P     LFVE+    + LPD+++F  LLKA A  +AL  G  
Sbjct: 93  PDIVIFNSMARGYSRFTNPLEVFSLFVEILEDGI-LPDNYTFPSLLKACAVAKALEEGRQ 152

Query: 173 ------------------------------------------------------------ 232
                                                                       
Sbjct: 153 LHCLSMKLGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNAMITGYARRNR 212

Query: 233 ---------ELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEKSGFLQIVSVSNA 292
                    E++ + ++PNE++L   LS+CA  G+ + G+ +H + +K  F + V V+ A
Sbjct: 213 PNEALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHSFCKYVKVNTA 272

Query: 293 LIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRLFNEMEESNIKPD 352
           LID ++KCG+LD A  IF+ M  +   +W+AMI   A HG  E+++ +F  M   N++PD
Sbjct: 273 LIDMFAKCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLMFERMRSENVQPD 332

Query: 353 GITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYGRAGKLQQAFDFV 412
            ITF+ +L ACSH G V+ G  YFS+MV+ +GI P I+HYG +VDL  RAG L+ A++F+
Sbjct: 333 EITFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNLEDAYEFI 392

Query: 413 SQMPVSPNDIVWRTLLGACGIHGNLELAGQVKRRLFELDPENSGDHVLLSNIYAVAGKWK 472
            ++P+SP  ++WR LL AC  H NL+LA +V  R+FELD  + GD+V+LSN+YA   KW+
Sbjct: 393 DKLPISPTPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILSNLYARNKKWE 452

Query: 473 DVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQKLREIMSRLRIEG 532
            V +LR+ M  +K  K PG S IEV+ +++ F +G+       + H+ L E++  L++  
Sbjct: 453 YVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMVKELKL-S 512

Query: 533 GYVPEVGSVLH-DIEVEEKEDSVSQHSEKLAVAFGMVRLPRGRSIRVVKNLRICRDCHTV 554
           GYVP+   V+H ++  +EKE ++  HSEKLA+ FG++  P G +IRVVKNLR+CRDCH  
Sbjct: 513 GYVPDTSMVVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVCRDCHNA 572

BLAST of Cp4.1LG20g00350 vs. ExPASy Swiss-Prot
Match: Q9LW63 (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 393.3 bits (1009), Expect = 4.6e-108
Identity = 196/503 (38.97%), Postives = 293/503 (58.25%), Query Frame = 0

Query: 53  LALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKL--LLHCAVTLPDSVRYAGRLFLDIR 112
           L +FS+   +   K+IH +  + G++SD  +   L  +   +  + DS R   RL+    
Sbjct: 249 LPIFSEYVDVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRLYC--- 308

Query: 113 NPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAANCRALRNGE 172
             D   +N+L+ G   +   + AL LF +M                              
Sbjct: 309 -RDGISWNSLVAGYVQNGRYNEALRLFRQMVTAK-------------------------- 368

Query: 173 LRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEKSGFLQIVSVSNALIDTYSKCGN 232
                ++P  V+ +  + ACA       G+ LHG+V + GF   + +++AL+D YSKCGN
Sbjct: 369 -----VKPGAVAFSSVIPACAHLATLHLGKQLHGYVLRGGFGSNIFIASALVDMYSKCGN 428

Query: 233 LDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRLFNEMEESNIKPDGITFISILYA 292
           +  AR IFD M     VSWTA+I G A+HG+G EA+ LF EM+   +KP+ + F+++L A
Sbjct: 429 IKAARKIFDRMNVLDEVSWTAIIMGHALHGHGHEAVSLFEEMKRQGVKPNQVAFVAVLTA 488

Query: 293 CSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYGRAGKLQQAFDFVSQMPVSPNDI 352
           CSH GLVD    YF+ M  +YG+   +EHY  + DL GRAGKL++A++F+S+M V P   
Sbjct: 489 CSHVGLVDEAWGYFNSMTKVYGLNQELEHYAAVADLLGRAGKLEEAYNFISKMCVEPTGS 548

Query: 353 VWRTLLGACGIHGNLELAGQVKRRLFELDPENSGDHVLLSNIYAVAGKWKDVATLRRSMT 412
           VW TLL +C +H NLELA +V  ++F +D EN G +VL+ N+YA  G+WK++A LR  M 
Sbjct: 549 VWSTLLSSCSVHKNLELAEKVAEKIFTVDSENMGAYVLMCNMYASNGRWKEMAKLRLRMR 608

Query: 413 HQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQKLREIMSRLRIEGGYVPEVGSVL 472
            + L+K P  S IE+    + FV+G++ +   ++ ++ L+ +M ++  E GYV +   VL
Sbjct: 609 KKGLRKKPACSWIEMKNKTHGFVSGDRSHPSMDKINEFLKAVMEQMEKE-GYVADTSGVL 668

Query: 473 HDIEVEEKEDSVSQHSEKLAVAFGMVRLPRGRSIRVVKNLRICRDCHTVMKLISKVYEVE 532
           HD++ E K + +  HSE+LAVAFG++    G +IRV KN+RIC DCH  +K ISK+ E E
Sbjct: 669 HDVDEEHKRELLFGHSERLAVAFGIINTEPGTTIRVTKNIRICTDCHVAIKFISKITERE 715

Query: 533 IVVRDRSRFHSFTHGSCSCRDYW 554
           I+VRD SRFH F  G+CSC DYW
Sbjct: 729 IIVRDNSRFHHFNRGNCSCGDYW 715

BLAST of Cp4.1LG20g00350 vs. NCBI nr
Match: XP_023519288.1 (pentatricopeptide repeat-containing protein At1g74630 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 970 bits (2508), Expect = 0.0
Identity = 509/642 (79.28%), Postives = 509/642 (79.28%), Query Frame = 0

Query: 45  MNSREFHCLALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRYAGR 104
           MNSREFHCLALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRYAGR
Sbjct: 1   MNSREFHCLALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRYAGR 60

Query: 105 LFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAANCR 164
           LFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAANCR
Sbjct: 61  LFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAANCR 120

Query: 165 ALRNG------------------------------------------------------- 224
           ALRNG                                                       
Sbjct: 121 ALRNGLQLHRQAIGYGLDTHLFVGTTLISMYAECASLAFARQVFDEMIEPNIVAWNAIVA 180

Query: 225 ------------------------------------------------------------ 284
                                                                       
Sbjct: 181 ACFRCEDVKDAEQVFHRMPIKNLTSWNIMLAGYTKAGELRLAREVFMKMPLKDEVSWSSM 240

Query: 285 ------------------ELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEKSGF 344
                             ELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEKSGF
Sbjct: 241 IVGFAHNGSFNNAFAFFRELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEKSGF 300

Query: 345 LQIVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRLFNE 404
           LQIVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRLFNE
Sbjct: 301 LQIVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRLFNE 360

Query: 405 MEESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYGRAG 464
           MEESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYGRAG
Sbjct: 361 MEESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYGRAG 420

Query: 465 KLQQAFDFVSQMPVSPNDIVWRTLLGACGIHGNLELAGQVKRRLFELDPENSGDHVLLSN 524
           KLQQAFDFVSQMPVSPNDIVWRTLLGACGIHGNLELAGQVKRRLFELDPENSGDHVLLSN
Sbjct: 421 KLQQAFDFVSQMPVSPNDIVWRTLLGACGIHGNLELAGQVKRRLFELDPENSGDHVLLSN 480

Query: 525 IYAVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQKLRE 553
           IYAVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQKLRE
Sbjct: 481 IYAVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQKLRE 540

BLAST of Cp4.1LG20g00350 vs. NCBI nr
Match: XP_022923786.1 (uncharacterized protein LOC111431396 [Cucurbita moschata])

HSP 1 Score: 961 bits (2485), Expect = 0.0
Identity = 505/645 (78.29%), Postives = 506/645 (78.45%), Query Frame = 0

Query: 42   RNEMNSREFHCLALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRY 101
            R  MNSRE HCLALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRY
Sbjct: 1269 RVRMNSRELHCLALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRY 1328

Query: 102  AGRLFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAA 161
            AGRLFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAA
Sbjct: 1329 AGRLFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAA 1388

Query: 162  NCRALRNG---------------------------------------------------- 221
            NCRALRNG                                                    
Sbjct: 1389 NCRALRNGLQLHRQAIGFGLDTHLFVGTTLISMYAECASLAFARQVFDEMIEPNIVAWNA 1448

Query: 222  ------------------------------------------------------------ 281
                                                                        
Sbjct: 1449 IVAACFRCEDVKDAEQVFHRMSIKNLTSWNIMLAGYTKAGELRLAREVFMKMPLKDEVSW 1508

Query: 282  ---------------------ELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEK 341
                                 ELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEK
Sbjct: 1509 SSMIVGFAHNGSFNNAFAFFRELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEK 1568

Query: 342  SGFLQIVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRL 401
            SGFL IVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRL
Sbjct: 1569 SGFLHIVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRL 1628

Query: 402  FNEMEESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYG 461
            FNEMEESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYG
Sbjct: 1629 FNEMEESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYG 1688

Query: 462  RAGKLQQAFDFVSQMPVSPNDIVWRTLLGACGIHGNLELAGQVKRRLFELDPENSGDHVL 521
            RAGKLQQAFDFVSQMPVSPNDIVWRTLLGAC IHGNLELAGQVKRRLFELDPENSGDHVL
Sbjct: 1689 RAGKLQQAFDFVSQMPVSPNDIVWRTLLGACSIHGNLELAGQVKRRLFELDPENSGDHVL 1748

Query: 522  LSNIYAVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQK 553
            LSNIYAVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQK
Sbjct: 1749 LSNIYAVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQK 1808

BLAST of Cp4.1LG20g00350 vs. NCBI nr
Match: KAG6584460.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 961 bits (2484), Expect = 0.0
Identity = 504/642 (78.50%), Postives = 505/642 (78.66%), Query Frame = 0

Query: 45  MNSREFHCLALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRYAGR 104
           MNSRE HCLALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRYAGR
Sbjct: 1   MNSRELHCLALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRYAGR 60

Query: 105 LFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAANCR 164
           LFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAANCR
Sbjct: 61  LFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAANCR 120

Query: 165 ALRNG------------------------------------------------------- 224
           ALRNG                                                       
Sbjct: 121 ALRNGLQLHRQAIGFGLDTHLFVGTTLISMYAECASLAFARQVFDEMIEPNIVAWNAIVA 180

Query: 225 ------------------------------------------------------------ 284
                                                                       
Sbjct: 181 ACFRCEDVKDAEQVFHRMSIKNLTSWNIMLAGYTKAGELRLAREVFMKMPLKDEVSWSSM 240

Query: 285 ------------------ELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEKSGF 344
                             ELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEKSGF
Sbjct: 241 IVGFAHNGSFNNAFAFFRELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEKSGF 300

Query: 345 LQIVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRLFNE 404
           L IVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRLFNE
Sbjct: 301 LHIVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRLFNE 360

Query: 405 MEESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYGRAG 464
           MEESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYGRAG
Sbjct: 361 MEESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYGRAG 420

Query: 465 KLQQAFDFVSQMPVSPNDIVWRTLLGACGIHGNLELAGQVKRRLFELDPENSGDHVLLSN 524
           KLQQAFDFVSQMPVSPNDIVWRTLLGAC IHGNLELAGQVKRRLFELDPENSGDHVLLSN
Sbjct: 421 KLQQAFDFVSQMPVSPNDIVWRTLLGACSIHGNLELAGQVKRRLFELDPENSGDHVLLSN 480

Query: 525 IYAVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQKLRE 553
           IYAVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQKLRE
Sbjct: 481 IYAVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQKLRE 540

BLAST of Cp4.1LG20g00350 vs. NCBI nr
Match: XP_023001664.1 (uncharacterized protein LOC111495738 [Cucurbita maxima])

HSP 1 Score: 954 bits (2466), Expect = 0.0
Identity = 501/645 (77.67%), Postives = 505/645 (78.29%), Query Frame = 0

Query: 42   RNEMNSREFHCLALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRY 101
            R  MNSRE HCLAL SKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRY
Sbjct: 1254 RVRMNSREVHCLALLSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRY 1313

Query: 102  AGRLFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAA 161
            AGRLFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSF+FAF+LKAAA
Sbjct: 1314 AGRLFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFTFAFVLKAAA 1373

Query: 162  NCRALRNG---------------------------------------------------- 221
            NCRALRNG                                                    
Sbjct: 1374 NCRALRNGLQLHRQAIGYGLDTHLFVGTTLISMYAECASLAFARQVFDEMIEPNIVAWNA 1433

Query: 222  ------------------------------------------------------------ 281
                                                                        
Sbjct: 1434 IVAACFRCEDVKNAEQVFHRMPIKNLTSWNIMLAGYTKAGELRLAREVFMKMPLKDEVSW 1493

Query: 282  ---------------------ELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEK 341
                                 ELRQ+GMRPNEVSLTGALSACAQAGAFEFGRILHGFVEK
Sbjct: 1494 SSMIVGFAHNGSFNNAFAFFRELRQDGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEK 1553

Query: 342  SGFLQIVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRL 401
            SGFLQIVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRL
Sbjct: 1554 SGFLQIVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRL 1613

Query: 402  FNEMEESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYG 461
            FNEMEESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYG
Sbjct: 1614 FNEMEESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYG 1673

Query: 462  RAGKLQQAFDFVSQMPVSPNDIVWRTLLGACGIHGNLELAGQVKRRLFELDPENSGDHVL 521
            RAGKLQQAFDFVSQMPVSPNDIVWRTLLGAC IHGNLELAGQVKRRLFELDPENSGDHVL
Sbjct: 1674 RAGKLQQAFDFVSQMPVSPNDIVWRTLLGACSIHGNLELAGQVKRRLFELDPENSGDHVL 1733

Query: 522  LSNIYAVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQK 553
            LSNIYAVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQK
Sbjct: 1734 LSNIYAVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQK 1793

BLAST of Cp4.1LG20g00350 vs. NCBI nr
Match: KAG7020047.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 949 bits (2454), Expect = 0.0
Identity = 499/637 (78.34%), Postives = 500/637 (78.49%), Query Frame = 0

Query: 49   EFHCLALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRYAGRLFLD 108
            E HCLALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRYAGRLFLD
Sbjct: 954  ELHCLALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRYAGRLFLD 1013

Query: 109  IRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAANCRALRN 168
            IRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAANCRALRN
Sbjct: 1014 IRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAANCRALRN 1073

Query: 169  G----------------------------------------------------------- 228
            G                                                           
Sbjct: 1074 GLQLHRQAIGFGLDTHLFVGTTLISMYAECASLAFARQVFDEMIEPNIVAWNAIVAACFR 1133

Query: 229  ------------------------------------------------------------ 288
                                                                        
Sbjct: 1134 CEDVKDAEQVFHRMSIKNLTSWNIMLAGYTKAGELRLAREVFMKMPLKDEVSWSSMIVGF 1193

Query: 289  --------------ELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEKSGFLQIV 348
                          ELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEKSGFL IV
Sbjct: 1194 AHNGSFNNAFAFFRELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEKSGFLHIV 1253

Query: 349  SVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRLFNEMEES 408
            SVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRLFNEMEES
Sbjct: 1254 SVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRLFNEMEES 1313

Query: 409  NIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYGRAGKLQQ 468
            NIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYGRAGKLQQ
Sbjct: 1314 NIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYGRAGKLQQ 1373

Query: 469  AFDFVSQMPVSPNDIVWRTLLGACGIHGNLELAGQVKRRLFELDPENSGDHVLLSNIYAV 528
            AFDFVSQMPVSPNDIVWRTLLGAC IHGNLELAGQVKRRLFELDPENSGDHVLLSNIYAV
Sbjct: 1374 AFDFVSQMPVSPNDIVWRTLLGACSIHGNLELAGQVKRRLFELDPENSGDHVLLSNIYAV 1433

Query: 529  AGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQKLREIMSR 552
            AGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQKLREIMSR
Sbjct: 1434 AGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQKLREIMSR 1493

BLAST of Cp4.1LG20g00350 vs. ExPASy TrEMBL
Match: A0A6J1E7P0 (uncharacterized protein LOC111431396 OS=Cucurbita moschata OX=3662 GN=LOC111431396 PE=3 SV=1)

HSP 1 Score: 961 bits (2485), Expect = 0.0
Identity = 505/645 (78.29%), Postives = 506/645 (78.45%), Query Frame = 0

Query: 42   RNEMNSREFHCLALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRY 101
            R  MNSRE HCLALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRY
Sbjct: 1269 RVRMNSRELHCLALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRY 1328

Query: 102  AGRLFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAA 161
            AGRLFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAA
Sbjct: 1329 AGRLFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAA 1388

Query: 162  NCRALRNG---------------------------------------------------- 221
            NCRALRNG                                                    
Sbjct: 1389 NCRALRNGLQLHRQAIGFGLDTHLFVGTTLISMYAECASLAFARQVFDEMIEPNIVAWNA 1448

Query: 222  ------------------------------------------------------------ 281
                                                                        
Sbjct: 1449 IVAACFRCEDVKDAEQVFHRMSIKNLTSWNIMLAGYTKAGELRLAREVFMKMPLKDEVSW 1508

Query: 282  ---------------------ELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEK 341
                                 ELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEK
Sbjct: 1509 SSMIVGFAHNGSFNNAFAFFRELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEK 1568

Query: 342  SGFLQIVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRL 401
            SGFL IVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRL
Sbjct: 1569 SGFLHIVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRL 1628

Query: 402  FNEMEESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYG 461
            FNEMEESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYG
Sbjct: 1629 FNEMEESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYG 1688

Query: 462  RAGKLQQAFDFVSQMPVSPNDIVWRTLLGACGIHGNLELAGQVKRRLFELDPENSGDHVL 521
            RAGKLQQAFDFVSQMPVSPNDIVWRTLLGAC IHGNLELAGQVKRRLFELDPENSGDHVL
Sbjct: 1689 RAGKLQQAFDFVSQMPVSPNDIVWRTLLGACSIHGNLELAGQVKRRLFELDPENSGDHVL 1748

Query: 522  LSNIYAVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQK 553
            LSNIYAVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQK
Sbjct: 1749 LSNIYAVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQK 1808

BLAST of Cp4.1LG20g00350 vs. ExPASy TrEMBL
Match: A0A6J1KND0 (uncharacterized protein LOC111495738 OS=Cucurbita maxima OX=3661 GN=LOC111495738 PE=3 SV=1)

HSP 1 Score: 954 bits (2466), Expect = 0.0
Identity = 501/645 (77.67%), Postives = 505/645 (78.29%), Query Frame = 0

Query: 42   RNEMNSREFHCLALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRY 101
            R  MNSRE HCLAL SKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRY
Sbjct: 1254 RVRMNSREVHCLALLSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRY 1313

Query: 102  AGRLFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAA 161
            AGRLFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSF+FAF+LKAAA
Sbjct: 1314 AGRLFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFTFAFVLKAAA 1373

Query: 162  NCRALRNG---------------------------------------------------- 221
            NCRALRNG                                                    
Sbjct: 1374 NCRALRNGLQLHRQAIGYGLDTHLFVGTTLISMYAECASLAFARQVFDEMIEPNIVAWNA 1433

Query: 222  ------------------------------------------------------------ 281
                                                                        
Sbjct: 1434 IVAACFRCEDVKNAEQVFHRMPIKNLTSWNIMLAGYTKAGELRLAREVFMKMPLKDEVSW 1493

Query: 282  ---------------------ELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEK 341
                                 ELRQ+GMRPNEVSLTGALSACAQAGAFEFGRILHGFVEK
Sbjct: 1494 SSMIVGFAHNGSFNNAFAFFRELRQDGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEK 1553

Query: 342  SGFLQIVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRL 401
            SGFLQIVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRL
Sbjct: 1554 SGFLQIVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRL 1613

Query: 402  FNEMEESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYG 461
            FNEMEESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYG
Sbjct: 1614 FNEMEESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYG 1673

Query: 462  RAGKLQQAFDFVSQMPVSPNDIVWRTLLGACGIHGNLELAGQVKRRLFELDPENSGDHVL 521
            RAGKLQQAFDFVSQMPVSPNDIVWRTLLGAC IHGNLELAGQVKRRLFELDPENSGDHVL
Sbjct: 1674 RAGKLQQAFDFVSQMPVSPNDIVWRTLLGACSIHGNLELAGQVKRRLFELDPENSGDHVL 1733

Query: 522  LSNIYAVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQK 553
            LSNIYAVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQK
Sbjct: 1734 LSNIYAVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQK 1793

BLAST of Cp4.1LG20g00350 vs. ExPASy TrEMBL
Match: A0A5A7T383 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold112G00750 PE=3 SV=1)

HSP 1 Score: 904 bits (2336), Expect = 0.0
Identity = 470/642 (73.21%), Postives = 490/642 (76.32%), Query Frame = 0

Query: 45  MNSREFHCLALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRYAGR 104
           MNSREFHCLALFSKC+SLRT+KQI AFTFKTGLNSDPLV+GKLLLHCAVTLPDS+ YA R
Sbjct: 1   MNSREFHCLALFSKCKSLRTVKQIQAFTFKTGLNSDPLVSGKLLLHCAVTLPDSLHYARR 60

Query: 105 LFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAANCR 164
           +FLDIRNPDVFMYNTLIRGLSDSDTPS+AL LFVEMRRKSMALPDSFSFAFLLKAAANCR
Sbjct: 61  IFLDIRNPDVFMYNTLIRGLSDSDTPSNALQLFVEMRRKSMALPDSFSFAFLLKAAANCR 120

Query: 165 ALRNG------------------------------------------------------- 224
           AL NG                                                       
Sbjct: 121 ALTNGLQLHCQAVGYGLDSHLFVGTTLISMYAECASLTFARKVFDEMIEPNIVAWNAIVA 180

Query: 225 ------------------------------------------------------------ 284
                                                                       
Sbjct: 181 ACFRCEDVKDAEQVFRCMPIRNLTSWNILLAGYAKAGELQLAREVFMKMPLKDDVSWSSM 240

Query: 285 ------------------ELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEKSGF 344
                             ELR+EGMRPNEVSLTGALSACAQAGAFEFGRILH FVEKSGF
Sbjct: 241 IVGFAHNGNFNDAFAFFRELRREGMRPNEVSLTGALSACAQAGAFEFGRILHAFVEKSGF 300

Query: 345 LQIVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRLFNE 404
           LQI+SV+NALIDTYSKCGNLDMARL+FDNML R+ VSWTAMI G+AMHGYGEEAIRLFNE
Sbjct: 301 LQIISVNNALIDTYSKCGNLDMARLVFDNMLGRNAVSWTAMIAGMAMHGYGEEAIRLFNE 360

Query: 405 MEESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYGRAG 464
           MEESNIKPD I FISILYACSHAGLVDLGCSYFSRMVN YGIEPVIEHYGC+VDLYGRAG
Sbjct: 361 MEESNIKPDSIAFISILYACSHAGLVDLGCSYFSRMVNTYGIEPVIEHYGCMVDLYGRAG 420

Query: 465 KLQQAFDFVSQMPVSPNDIVWRTLLGACGIHGNLELAGQVKRRLFELDPENSGDHVLLSN 524
           KLQQA+DFV QMP+SPNDIVWRTLLGAC IHGNL+LAGQVKR+L ELDPENSGDHVLLSN
Sbjct: 421 KLQQAYDFVCQMPISPNDIVWRTLLGACSIHGNLDLAGQVKRQLSELDPENSGDHVLLSN 480

Query: 525 IYAVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQKLRE 553
           IYAVAGKWKDVA LRRSMTHQ+LKKTPGWSMIEV+RIMYSFVAGEKQNDIA EAHQKLRE
Sbjct: 481 IYAVAGKWKDVAALRRSMTHQRLKKTPGWSMIEVNRIMYSFVAGEKQNDIAVEAHQKLRE 540

BLAST of Cp4.1LG20g00350 vs. ExPASy TrEMBL
Match: A0A5D3BIR8 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G00850 PE=3 SV=1)

HSP 1 Score: 902 bits (2330), Expect = 0.0
Identity = 469/642 (73.05%), Postives = 489/642 (76.17%), Query Frame = 0

Query: 45  MNSREFHCLALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRYAGR 104
           MNSREFHCLALFSKC+SLRT+KQI AFTFKTGLNSDPLV+GKLLLHCAVTLPDS+ YA R
Sbjct: 1   MNSREFHCLALFSKCKSLRTVKQIQAFTFKTGLNSDPLVSGKLLLHCAVTLPDSLHYARR 60

Query: 105 LFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAANCR 164
           +FLDIRNPDVFMYNTLIR LSDSDTPS+AL LFVEMRRKSMALPDSFSFAFLLKAAANCR
Sbjct: 61  IFLDIRNPDVFMYNTLIRSLSDSDTPSNALQLFVEMRRKSMALPDSFSFAFLLKAAANCR 120

Query: 165 ALRNG------------------------------------------------------- 224
           AL NG                                                       
Sbjct: 121 ALTNGLQLHCQAVGYGLDSHLFVGTTLISMYAECASLTSARKVFDEMIEPNIVAWNAIVA 180

Query: 225 ------------------------------------------------------------ 284
                                                                       
Sbjct: 181 ACFRCEDVKDAERVFRCMPIRNLTSWNIMLAGYTKAGELQLAREVFMKMPLKDDVSWSSM 240

Query: 285 ------------------ELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEKSGF 344
                             ELR+EGMRPNEVSLTGALSACAQAGAFEFGRILH FVEKSGF
Sbjct: 241 IVGFAHNGNFNDAFAFFRELRREGMRPNEVSLTGALSACAQAGAFEFGRILHAFVEKSGF 300

Query: 345 LQIVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRLFNE 404
           LQI+SV+NALIDTYSKCGNLDMARL+FDNML R+ VSWTAMI G+AMHGYGEEAIRLFNE
Sbjct: 301 LQIISVNNALIDTYSKCGNLDMARLVFDNMLGRNAVSWTAMIAGMAMHGYGEEAIRLFNE 360

Query: 405 MEESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYGRAG 464
           MEESNIKPD I FISILYACSHAGLVDLGCSYFSRMVN YGIEPVIEHYGC+VDLYGRAG
Sbjct: 361 MEESNIKPDSIAFISILYACSHAGLVDLGCSYFSRMVNTYGIEPVIEHYGCMVDLYGRAG 420

Query: 465 KLQQAFDFVSQMPVSPNDIVWRTLLGACGIHGNLELAGQVKRRLFELDPENSGDHVLLSN 524
           KLQQA+DFV QMP+SPNDIVWRTLLGAC IHGNL+LAGQVKR+L ELDPENSGDHVLLSN
Sbjct: 421 KLQQAYDFVCQMPISPNDIVWRTLLGACSIHGNLDLAGQVKRQLSELDPENSGDHVLLSN 480

Query: 525 IYAVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQKLRE 553
           IYAVAGKWKDVA LRRSMTHQ+LKKTPGWSMIEV+RIMYSFVAGEKQNDIA EAHQKLRE
Sbjct: 481 IYAVAGKWKDVAALRRSMTHQRLKKTPGWSMIEVNRIMYSFVAGEKQNDIAVEAHQKLRE 540

BLAST of Cp4.1LG20g00350 vs. ExPASy TrEMBL
Match: A0A1S4DUF7 (pentatricopeptide repeat-containing protein At1g74630 OS=Cucumis melo OX=3656 GN=LOC107990467 PE=3 SV=1)

HSP 1 Score: 902 bits (2330), Expect = 0.0
Identity = 469/642 (73.05%), Postives = 489/642 (76.17%), Query Frame = 0

Query: 45  MNSREFHCLALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRYAGR 104
           MNSREFHCLALFSKC+SLRT+KQI AFTFKTGLNSDPLV+GKLLLHCAVTLPDS+ YA R
Sbjct: 1   MNSREFHCLALFSKCKSLRTVKQIQAFTFKTGLNSDPLVSGKLLLHCAVTLPDSLHYARR 60

Query: 105 LFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAANCR 164
           +FLDIRNPDVFMYNTLIR LSDSDTPS+AL LFVEMRRKSMALPDSFSFAFLLKAAANCR
Sbjct: 61  IFLDIRNPDVFMYNTLIRSLSDSDTPSNALQLFVEMRRKSMALPDSFSFAFLLKAAANCR 120

Query: 165 ALRNG------------------------------------------------------- 224
           AL NG                                                       
Sbjct: 121 ALTNGLQLHCQAVGYGLDSHLFVGTTLISMYAECASLTSARKVFDEMIEPNIVAWNAIVA 180

Query: 225 ------------------------------------------------------------ 284
                                                                       
Sbjct: 181 ACFRCEDVKDAERVFRCMPIRNLTSWNIMLAGYTKAGELQLAREVFMKMPLKDDVSWSSM 240

Query: 285 ------------------ELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEKSGF 344
                             ELR+EGMRPNEVSLTGALSACAQAGAFEFGRILH FVEKSGF
Sbjct: 241 IVGFAHNGNFNDAFAFFRELRREGMRPNEVSLTGALSACAQAGAFEFGRILHAFVEKSGF 300

Query: 345 LQIVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRLFNE 404
           LQI+SV+NALIDTYSKCGNLDMARL+FDNML R+ VSWTAMI G+AMHGYGEEAIRLFNE
Sbjct: 301 LQIISVNNALIDTYSKCGNLDMARLVFDNMLGRNAVSWTAMIAGMAMHGYGEEAIRLFNE 360

Query: 405 MEESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYGRAG 464
           MEESNIKPD I FISILYACSHAGLVDLGCSYFSRMVN YGIEPVIEHYGC+VDLYGRAG
Sbjct: 361 MEESNIKPDSIAFISILYACSHAGLVDLGCSYFSRMVNTYGIEPVIEHYGCMVDLYGRAG 420

Query: 465 KLQQAFDFVSQMPVSPNDIVWRTLLGACGIHGNLELAGQVKRRLFELDPENSGDHVLLSN 524
           KLQQA+DFV QMP+SPNDIVWRTLLGAC IHGNL+LAGQVKR+L ELDPENSGDHVLLSN
Sbjct: 421 KLQQAYDFVCQMPISPNDIVWRTLLGACSIHGNLDLAGQVKRQLSELDPENSGDHVLLSN 480

Query: 525 IYAVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQKLRE 553
           IYAVAGKWKDVA LRRSMTHQ+LKKTPGWSMIEV+RIMYSFVAGEKQNDIA EAHQKLRE
Sbjct: 481 IYAVAGKWKDVAALRRSMTHQRLKKTPGWSMIEVNRIMYSFVAGEKQNDIAVEAHQKLRE 540

BLAST of Cp4.1LG20g00350 vs. TAIR 10
Match: AT1G74630.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 637.1 bits (1642), Expect = 1.3e-182
Identity = 325/637 (51.02%), Postives = 402/637 (63.11%), Query Frame = 0

Query: 51  HCLALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRYAGRLFLDIR 110
           HCL+L + C++LR L QIH    K G+++D    GKL+LHCA+++ D++ YA RL L   
Sbjct: 7   HCLSLLNSCKNLRALTQIHGLFIKYGVDTDSYFTGKLILHCAISISDALPYARRLLLCFP 66

Query: 111 NPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAANCRALRNG- 170
            PD FM+NTL+RG S+SD P +++ +FVEM RK    PDSFSFAF++KA  N R+LR G 
Sbjct: 67  EPDAFMFNTLVRGYSESDEPHNSVAVFVEMMRKGFVFPDSFSFAFVIKAVENFRSLRTGF 126

Query: 171 ------------------------------------------------------------ 230
                                                                       
Sbjct: 127 QMHCQALKHGLESHLFVGTTLIGMYGGCGCVEFARKVFDEMHQPNLVAWNAVITACFRGN 186

Query: 231 ------------------------------------------------------------ 290
                                                                       
Sbjct: 187 DVAGAREIFDKMLVRNHTSWNVMLAGYIKAGELESAKRIFSEMPHRDDVSWSTMIVGIAH 246

Query: 291 ------------ELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEKSGFLQIVSV 350
                       EL++ GM PNEVSLTG LSAC+Q+G+FEFG+ILHGFVEK+G+  IVSV
Sbjct: 247 NGSFNESFLYFRELQRAGMSPNEVSLTGVLSACSQSGSFEFGKILHGFVEKAGYSWIVSV 306

Query: 351 SNALIDTYSKCGNLDMARLIFDNMLE-RSVVSWTAMITGLAMHGYGEEAIRLFNEMEESN 410
           +NALID YS+CGN+ MARL+F+ M E R +VSWT+MI GLAMHG GEEA+RLFNEM    
Sbjct: 307 NNALIDMYSRCGNVPMARLVFEGMQEKRCIVSWTSMIAGLAMHGQGEEAVRLFNEMTAYG 366

Query: 411 IKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYGRAGKLQQA 470
           + PDGI+FIS+L+ACSHAGL++ G  YFS M  +Y IEP IEHYGC+VDLYGR+GKLQ+A
Sbjct: 367 VTPDGISFISLLHACSHAGLIEEGEDYFSEMKRVYHIEPEIEHYGCMVDLYGRSGKLQKA 426

Query: 471 FDFVSQMPVSPNDIVWRTLLGACGIHGNLELAGQVKRRLFELDPENSGDHVLLSNIYAVA 530
           +DF+ QMP+ P  IVWRTLLGAC  HGN+ELA QVK+RL ELDP NSGD VLLSN YA A
Sbjct: 427 YDFICQMPIPPTAIVWRTLLGACSSHGNIELAEQVKQRLNELDPNNSGDLVLLSNAYATA 486

Query: 531 GKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQKLREIMSRL 554
           GKWKDVA++R+SM  Q++KKT  WS++EV + MY F AGEK+  I  EAH+KL+EI+ RL
Sbjct: 487 GKWKDVASIRKSMIVQRIKKTTAWSLVEVGKTMYKFTAGEKKKGIDIEAHEKLKEIILRL 546

BLAST of Cp4.1LG20g00350 vs. TAIR 10
Match: AT4G21065.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 436.4 bits (1121), Expect = 3.4e-122
Identity = 226/568 (39.79%), Postives = 340/568 (59.86%), Query Frame = 0

Query: 61  SLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPD--SVRYAGRLFLDIRNP-DVFMY 120
           S+  L+QIHAF+ + G++      GK L+   V+LP    + YA ++F  I  P +VF++
Sbjct: 29  SITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINVFIW 88

Query: 121 NTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAA----------------- 180
           NTLIRG ++      A  L+ EMR   +  PD+ ++ FL+KA                  
Sbjct: 89  NTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVRLGETIHSVVI 148

Query: 181 ------------------ANC---------------------RALRNG------------ 240
                             ANC                      ++ NG            
Sbjct: 149 RSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFAENGKPEEALA 208

Query: 241 ---ELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEKSGFLQIVSVSNALIDTYS 300
              E+  +G++P+  ++   LSACA+ GA   G+ +H ++ K G  + +  SN L+D Y+
Sbjct: 209 LYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSSNVLLDLYA 268

Query: 301 KCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRLFNEMEES-NIKPDGITFI 360
           +CG ++ A+ +FD M++++ VSWT++I GLA++G+G+EAI LF  ME +  + P  ITF+
Sbjct: 269 RCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGLLPCEITFV 328

Query: 361 SILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYGRAGKLQQAFDFVSQMPV 420
            ILYACSH G+V  G  YF RM   Y IEP IEH+GC+VDL  RAG++++A++++  MP+
Sbjct: 329 GILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAYEYIKSMPM 388

Query: 421 SPNDIVWRTLLGACGIHGNLELAGQVKRRLFELDPENSGDHVLLSNIYAVAGKWKDVATL 480
            PN ++WRTLLGAC +HG+ +LA   + ++ +L+P +SGD+VLLSN+YA   +W DV  +
Sbjct: 389 QPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQRWSDVQKI 448

Query: 481 RRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQKLREIMSRLRIEGGYVPE 540
           R+ M    +KK PG S++EV   ++ F+ G+K +  ++  + KL+E+  RLR E GYVP+
Sbjct: 449 RKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGRLRSE-GYVPQ 508

Query: 541 VGSVLHDIEVEEKEDSVSQHSEKLAVAFGMVRLPRGRSIRVVKNLRICRDCHTVMKLISK 554
           + +V  D+E EEKE++V  HSEK+A+AF ++  P    I VVKNLR+C DCH  +KL+SK
Sbjct: 509 ISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADCHLAIKLVSK 568

BLAST of Cp4.1LG20g00350 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 418.7 bits (1075), Expect = 7.3e-117
Identity = 217/521 (41.65%), Postives = 326/521 (62.57%), Query Frame = 0

Query: 38  ERMERNEMNSREFHCLALFSKCRSLRTL---KQIHAFTFKTGLNSDPLVAGKLLLHCAVT 97
           ++ME  ++ +     + + S C  +R L   +Q+ ++  +  +N +  +A  +L     T
Sbjct: 221 KKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLANAML--DMYT 280

Query: 98  LPDSVRYAGRLFLDIRNPDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFA 157
              S+  A RLF  +   D   + T++ G + S+    A  +   M +K +   ++   A
Sbjct: 281 KCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAWNALISA 340

Query: 158 FLLKAAANCRALRNGELR-QEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEKSGFLQ 217
           +      N   +   EL+ Q+ M+ N+++L   LSACAQ GA E GR +H +++K G   
Sbjct: 341 YEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSYIKKHGIRM 400

Query: 218 IVSVSNALIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRLFNEME 277
              V++ALI  YSKCG+L+ +R +F+++ +R V  W+AMI GLAMHG G EA+ +F +M+
Sbjct: 401 NFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQ 460

Query: 278 ESNIKPDGITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYGRAGKL 337
           E+N+KP+G+TF ++  ACSH GLVD   S F +M + YGI P  +HY CIVD+ GR+G L
Sbjct: 461 EANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVLGRSGYL 520

Query: 338 QQAFDFVSQMPVSPNDIVWRTLLGACGIHGNLELAGQVKRRLFELDPENSGDHVLLSNIY 397
           ++A  F+  MP+ P+  VW  LLGAC IH NL LA     RL EL+P N G HVLLSNIY
Sbjct: 521 EKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGAHVLLSNIY 580

Query: 398 AVAGKWKDVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQKLREIM 457
           A  GKW++V+ LR+ M    LKK PG S IE+D +++ F++G+  + ++E+ + KL E+M
Sbjct: 581 AKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKVYGKLHEVM 640

Query: 458 SRLRIEGGYVPEVGSVLHDIEVEE-KEDSVSQHSEKLAVAFGMVRLPRGRSIRVVKNLRI 517
            +L+   GY PE+  VL  IE EE KE S++ HSEKLA+ +G++     + IRV+KNLR+
Sbjct: 641 EKLK-SNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKVIRVIKNLRV 700

Query: 518 CRDCHTVMKLISKVYEVEIVVRDRSRFHSFTHGSCSCRDYW 554
           C DCH+V KLIS++Y+ EI+VRDR RFH F +G CSC D+W
Sbjct: 701 CGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738

BLAST of Cp4.1LG20g00350 vs. TAIR 10
Match: AT4G21065.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 416.0 bits (1068), Expect = 4.7e-116
Identity = 206/493 (41.78%), Postives = 310/493 (62.88%), Query Frame = 0

Query: 62  LRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVTLPDSVRYAGRLFLDIRNPDVFMYNTLI 121
           +R  + IH+   ++G  S   V    LLH      D V  A ++F  +   D+  +N++I
Sbjct: 4   VRLGETIHSVVIRSGFGSLIYVQNS-LLHLYANCGD-VASAYKVFDKMPEKDLVAWNSVI 63

Query: 122 RGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAANCRALRNGELRQEGMRPNEV 181
            G +++  P  AL L+ EM  K                               G++P+  
Sbjct: 64  NGFAENGKPEEALALYTEMNSK-------------------------------GIKPDGF 123

Query: 182 SLTGALSACAQAGAFEFGRILHGFVEKSGFLQIVSVSNALIDTYSKCGNLDMARLIFDNM 241
           ++   LSACA+ GA   G+ +H ++ K G  + +  SN L+D Y++CG ++ A+ +FD M
Sbjct: 124 TIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSSNVLLDLYARCGRVEEAKTLFDEM 183

Query: 242 LERSVVSWTAMITGLAMHGYGEEAIRLFNEMEES-NIKPDGITFISILYACSHAGLVDLG 301
           ++++ VSWT++I GLA++G+G+EAI LF  ME +  + P  ITF+ ILYACSH G+V  G
Sbjct: 184 VDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGLLPCEITFVGILYACSHCGMVKEG 243

Query: 302 CSYFSRMVNIYGIEPVIEHYGCIVDLYGRAGKLQQAFDFVSQMPVSPNDIVWRTLLGACG 361
             YF RM   Y IEP IEH+GC+VDL  RAG++++A++++  MP+ PN ++WRTLLGAC 
Sbjct: 244 FEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAYEYIKSMPMQPNVVIWRTLLGACT 303

Query: 362 IHGNLELAGQVKRRLFELDPENSGDHVLLSNIYAVAGKWKDVATLRRSMTHQKLKKTPGW 421
           +HG+ +LA   + ++ +L+P +SGD+VLLSN+YA   +W DV  +R+ M    +KK PG 
Sbjct: 304 VHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQRWSDVQKIRKQMLRDGVKKVPGH 363

Query: 422 SMIEVDRIMYSFVAGEKQNDIAEEAHQKLREIMSRLRIEGGYVPEVGSVLHDIEVEEKED 481
           S++EV   ++ F+ G+K +  ++  + KL+E+  RLR E GYVP++ +V  D+E EEKE+
Sbjct: 364 SLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGRLRSE-GYVPQISNVYVDVEEEEKEN 423

Query: 482 SVSQHSEKLAVAFGMVRLPRGRSIRVVKNLRICRDCHTVMKLISKVYEVEIVVRDRSRFH 541
           +V  HSEK+A+AF ++  P    I VVKNLR+C DCH  +KL+SKVY  EIVVRDRSRFH
Sbjct: 424 AVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADCHLAIKLVSKVYNREIVVRDRSRFH 462

Query: 542 SFTHGSCSCRDYW 554
            F +GSCSC+DYW
Sbjct: 484 HFKNGSCSCQDYW 462

BLAST of Cp4.1LG20g00350 vs. TAIR 10
Match: AT2G02980.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 410.2 bits (1053), Expect = 2.6e-114
Identity = 216/574 (37.63%), Postives = 322/574 (56.10%), Query Frame = 0

Query: 53  LALFSKCRSLRTLKQIHAFTFKTGLNSDPLVAGKLLLHCAVT-LPDSVRYAGRLFLDIRN 112
           + L SKC SLR L QI A+  K+ +     VA KL+  C  +    S+ YA  LF  +  
Sbjct: 33  ILLISKCNSLRELMQIQAYAIKSHIEDVSFVA-KLINFCTESPTESSMSYARHLFEAMSE 92

Query: 113 PDVFMYNTLIRGLSDSDTPSHALLLFVEMRRKSMALPDSFSFAFLLKAAANCRALRNG-- 172
           PD+ ++N++ RG S    P     LFVE+    + LPD+++F  LLKA A  +AL  G  
Sbjct: 93  PDIVIFNSMARGYSRFTNPLEVFSLFVEILEDGI-LPDNYTFPSLLKACAVAKALEEGRQ 152

Query: 173 ------------------------------------------------------------ 232
                                                                       
Sbjct: 153 LHCLSMKLGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNAMITGYARRNR 212

Query: 233 ---------ELRQEGMRPNEVSLTGALSACAQAGAFEFGRILHGFVEKSGFLQIVSVSNA 292
                    E++ + ++PNE++L   LS+CA  G+ + G+ +H + +K  F + V V+ A
Sbjct: 213 PNEALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHSFCKYVKVNTA 272

Query: 293 LIDTYSKCGNLDMARLIFDNMLERSVVSWTAMITGLAMHGYGEEAIRLFNEMEESNIKPD 352
           LID ++KCG+LD A  IF+ M  +   +W+AMI   A HG  E+++ +F  M   N++PD
Sbjct: 273 LIDMFAKCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLMFERMRSENVQPD 332

Query: 353 GITFISILYACSHAGLVDLGCSYFSRMVNIYGIEPVIEHYGCIVDLYGRAGKLQQAFDFV 412
            ITF+ +L ACSH G V+ G  YFS+MV+ +GI P I+HYG +VDL  RAG L+ A++F+
Sbjct: 333 EITFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNLEDAYEFI 392

Query: 413 SQMPVSPNDIVWRTLLGACGIHGNLELAGQVKRRLFELDPENSGDHVLLSNIYAVAGKWK 472
            ++P+SP  ++WR LL AC  H NL+LA +V  R+FELD  + GD+V+LSN+YA   KW+
Sbjct: 393 DKLPISPTPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILSNLYARNKKWE 452

Query: 473 DVATLRRSMTHQKLKKTPGWSMIEVDRIMYSFVAGEKQNDIAEEAHQKLREIMSRLRIEG 532
            V +LR+ M  +K  K PG S IEV+ +++ F +G+       + H+ L E++  L++  
Sbjct: 453 YVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMVKELKL-S 512

Query: 533 GYVPEVGSVLH-DIEVEEKEDSVSQHSEKLAVAFGMVRLPRGRSIRVVKNLRICRDCHTV 554
           GYVP+   V+H ++  +EKE ++  HSEKLA+ FG++  P G +IRVVKNLR+CRDCH  
Sbjct: 513 GYVPDTSMVVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVCRDCHNA 572

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9CA541.8e-18151.02Pentatricopeptide repeat-containing protein At1g74630 OS=Arabidopsis thaliana OX... [more]
A8MQA34.8e-12139.79Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
O823801.0e-11541.65Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q8LK933.7e-11337.63Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidop... [more]
Q9LW634.6e-10838.97Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
XP_023519288.10.079.28pentatricopeptide repeat-containing protein At1g74630 [Cucurbita pepo subsp. pep... [more]
XP_022923786.10.078.29uncharacterized protein LOC111431396 [Cucurbita moschata][more]
KAG6584460.10.078.50Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_023001664.10.077.67uncharacterized protein LOC111495738 [Cucurbita maxima][more]
KAG7020047.10.078.34Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
Match NameE-valueIdentityDescription
A0A6J1E7P00.078.29uncharacterized protein LOC111431396 OS=Cucurbita moschata OX=3662 GN=LOC1114313... [more]
A0A6J1KND00.077.67uncharacterized protein LOC111495738 OS=Cucurbita maxima OX=3661 GN=LOC111495738... [more]
A0A5A7T3830.073.21Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A5D3BIR80.073.05Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S4DUF70.073.05pentatricopeptide repeat-containing protein At1g74630 OS=Cucumis melo OX=3656 GN... [more]
Match NameE-valueIdentityDescription
AT1G74630.11.3e-18251.02Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G21065.13.4e-12239.79Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G29760.17.3e-11741.65Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G21065.24.7e-11641.78Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G02980.12.6e-11437.63Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 439..459
NoneNo IPR availablePANTHERPTHR47925OS01G0913400 PROTEIN-RELATEDcoord: 170..493
NoneNo IPR availablePANTHERPTHR47925OS01G0913400 PROTEIN-RELATEDcoord: 55..169
NoneNo IPR availablePANTHERPTHR47925:SF111SUBFAMILY NOT NAMEDcoord: 55..169
NoneNo IPR availablePANTHERPTHR47925:SF111SUBFAMILY NOT NAMEDcoord: 170..493
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 219..247
e-value: 2.1E-4
score: 19.2
coord: 116..146
e-value: 0.0025
score: 15.9
coord: 247..280
e-value: 1.1E-8
score: 32.7
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 112..161
e-value: 7.9E-8
score: 32.4
coord: 245..291
e-value: 3.3E-12
score: 46.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 319..344
e-value: 0.27
score: 11.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 245..279
score: 12.167101
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 113..147
score: 9.788499
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 38..196
e-value: 3.3E-13
score: 51.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 197..309
e-value: 1.5E-25
score: 92.2
coord: 310..495
e-value: 7.2E-12
score: 47.4
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 418..542
e-value: 3.3E-34
score: 117.5

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g00350.1Cp4.1LG20g00350.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding