Cp4.1LG07g05790 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG07g05790
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG07: 3974834 .. 3977620 (-)
RNA-Seq ExpressionCp4.1LG07g05790
SyntenyCp4.1LG07g05790
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTATGGAGTCTTCACCGCTATTCGATGCCCTACGATGATTAGAAATTCTTCCGCCATTATCAACTCAGGTCAGCTTCTTATCGTTCTTGGATTCAGGCTCAGATTCACATTTACACTCGCGTTTAAGTTCTTCACCTCAACTACTGCTTCTCTTCCTCAAAGCCTTCCTGTAGAACATGATGTACCGGCGCAGCTTTTTTCCATTCTTTCTCGCCCCGATTGGCAAAAGCATCCTTCTCTGAAAATTTTGATCCCTTCTATTGCGCCATCCCATGTATCTTCCCTTTTTGCCCTCAATCTCGATCCCAAAACTGCTCTTGCGTTTTTTAATTGGATCGAACAGAAGCATGGATTCAAACACAATGTTCAATCCTATGTTTCTATTTTAAATATCCTTGTTCCCAATGGGTACCACCGCATTGCTGAAAAACTGCGAATTTTAATGATTAAGTCGACGAATTCCGCAGAGAATGCGCTGTTCGTGTTGGAAATGCTGCGGAGCATGAACCGCCGGGGGGATGATTTAAGATTTAAGCTTACTCTTAAGAGCTATAACATGCTCTTGATGTTGTTGTCGAGGTTTCTCATGATTGATGAAATGAAAAATGTGTATTTGGAGATGTTGGATGACATGGTTTCGCCGAATATGTACACGCTCAATACAATGGTTAATGGATATTGTAAGTTGGGTAATGTAGTTGAAGCAGAGTTGTACGTCAGTAAGATAGTGCAAGCCGGTTTAAGTTTGGATACATTTACTTATACGTCTTTGATATTAGGATATTGTAGGAATAAAAATGTAGATGGTGCAAATAAAATTTTTCTGTCAATGCCAAGTAAAGGTTGCCGCAGAAATGAGGTTTCTTATACTAATTTGATTCATGGGTTTTGTGAAGCCGGGAGGATTGATGAAGCTCTGAAATTGCTCTCACAAATGCATGAGGATAATTGTTGGCCGACTGTTCGTACATATACAGTTATCATATGTGCATTGTGTCAAATGGGCAGGAAATCAGAAGCATTTAATGTGTTCAAGGAGATGACTGAGAAAGGATGTGAGCCAAATGTACATACCTATACAGTCCTTATTCGCAGTTTATGCGAGGACAATAAGTTTGATGATGCCAAGAAATTGCTAGATGGGATGCTTGAGAAAGGATTGGTTCCAAGTGTGGTCACTTACAATGCCTTTATTGATGGTTATTGCAAGAAAGGAATGAGCACGAGTGCCTTGGAAATTTTGAGCCTGATGGAATCGAATAATTGTAGTCCAAATACTCGCACTTATAATGAATTGATATTGGGATTTTGCAGGGCAAAGAATGTCCACAAGGCCATGTTACTTCTTCATAAAATGCTTGAGCTGAAGCTTCAACCAGATGTAGTTACCTACAACCTATTAATCCATGGACAGTGCAAAGAAGGGCATCTGGGTAGTGCTTATAAGCTGCTCAGTTTGATGAATGAAAATGGTTTGGTTCCTGATGAATGGACTTACAGTGTCTTCATAGCTGTACTCTGCAAAAGGGGGCGGGTTGAAGATGCTCGTTTTCTCTTTGACTCCCTAAAGGAGAAAGGCATAAAGGCAAATGAAGTAATCTATAGTGCATTGATTGATGGCTATTGCAAGGTCGGAAAAGTCAGTGACGGTCATTCATTGCTTGATAAAATGCTTAGTGATGGATGCGTTCCAAATTCAATTACTTATAATTCCTTGATAGATGGACATTGCAAAGAGAAAAATTTTCAAGAAGCTCTTTTACTTGTGGAAATAATGATAAAGAGGGACATTAAGCCTACTGCTGATACTTACACCATTCTTATAAAAAATTTATTAAAAGATGGTGAGTTTGACCGTGCCCATCAGATGTTTGATCAAATGCTTTCCGCAGGTTCTCATCCTGATGTAGTTATCTATACTGTATTTATTCATGCATATTGTAGCCTGGGTAGATTACAAGACGCGGAGTTATTTCTTCATAAAATGAATGAAAAAGGAATATTGCCAGACACTCTGCTTTATTCTTTATTGATTGATGCATATGGATGGTCTGGATCAATTGACATTGCTTTTGACATTTTGAAGCGTATGCATGATATCGGTTGTGAGCCGTCTTTCTACACATATTCTTATTTAATTAAACATCTATTAAGTGCAAAGCTGATAGAAGTAAATAGCAGTACAGAGTTGGGTGACTTGTCATCAGGGGTGGTTTCCAATGATTTTGCCAACTTATGGAGGAGAGTAGATTATGAATTTGCTTTGGAGTTGTTTGAGGAAATGGTCAAGCAAGGCTGTGCACCTAATGCTAATACTTATGGCAAGTTTATTTCAGGGCTTTGCAAGGTGGGATGCTTGGAAGTAGGCCGCAGGTTGTTTGATCATATGAAAGAAAAAGGACTATCGCCTAATGAAGACATTTATAACTCTCTTCTTGGTTGTTCGTGTCAATTAGGATTGTATGAAAAAGCAATAAGGTGGTTAGATATCATGGTAGAGCATGGATATTTACCACATTTAGATTCTTGCAAGCTGCTGCTCTGTGGCTTGTTTGACGAAGGAAATAACGAGAAAGCAAAAACAGTGTTTCATAGTTTGCTTCAGTGTGGGTATAATTATGATGAAATTGCTTGGAAATTACTTATTGATGGCTTACTTCAGAAGGGCCTTGTTGATAAATGCTCTGAGCTATTTGGCGTCATGGAGAGACAAGGTTGCCAAATTCATCCCAAGACATATAGTATGTTGATTGAGGGATTTGATGATATTCAGGATATAGATTAA

mRNA sequence

ATGTATGGAGTCTTCACCGCTATTCGATGCCCTACGATGATTAGAAATTCTTCCGCCATTATCAACTCAGGTCAGCTTCTTATCGTTCTTGGATTCAGGCTCAGATTCACATTTACACTCGCGTTTAAGTTCTTCACCTCAACTACTGCTTCTCTTCCTCAAAGCCTTCCTGTAGAACATGATGTACCGGCGCAGCTTTTTTCCATTCTTTCTCGCCCCGATTGGCAAAAGCATCCTTCTCTGAAAATTTTGATCCCTTCTATTGCGCCATCCCATGTATCTTCCCTTTTTGCCCTCAATCTCGATCCCAAAACTGCTCTTGCGTTTTTTAATTGGATCGAACAGAAGCATGGATTCAAACACAATGTTCAATCCTATGTTTCTATTTTAAATATCCTTGTTCCCAATGGGTACCACCGCATTGCTGAAAAACTGCGAATTTTAATGATTAAGTCGACGAATTCCGCAGAGAATGCGCTGTTCGTGTTGGAAATGCTGCGGAGCATGAACCGCCGGGGGGATGATTTAAGATTTAAGCTTACTCTTAAGAGCTATAACATGCTCTTGATGTTGTTGTCGAGGTTTCTCATGATTGATGAAATGAAAAATGTGTATTTGGAGATGTTGGATGACATGGTTTCGCCGAATATGTACACGCTCAATACAATGGTTAATGGATATTGTAAGTTGGGTAATGTAGTTGAAGCAGAGTTGTACGTCAGTAAGATAGTGCAAGCCGGTTTAAGTTTGGATACATTTACTTATACGTCTTTGATATTAGGATATTGTAGGAATAAAAATGTAGATGGTGCAAATAAAATTTTTCTGTCAATGCCAAGTAAAGGTTGCCGCAGAAATGAGGTTTCTTATACTAATTTGATTCATGGGTTTTGTGAAGCCGGGAGGATTGATGAAGCTCTGAAATTGCTCTCACAAATGCATGAGGATAATTGTTGGCCGACTGTTCGTACATATACAGTTATCATATGTGCATTGTGTCAAATGGGCAGGAAATCAGAAGCATTTAATGTGTTCAAGGAGATGACTGAGAAAGGATGTGAGCCAAATGTACATACCTATACAGTCCTTATTCGCAGTTTATGCGAGGACAATAAGTTTGATGATGCCAAGAAATTGCTAGATGGGATGCTTGAGAAAGGATTGGTTCCAAGTGTGGTCACTTACAATGCCTTTATTGATGGTTATTGCAAGAAAGGAATGAGCACGAGTGCCTTGGAAATTTTGAGCCTGATGGAATCGAATAATTGTAGTCCAAATACTCGCACTTATAATGAATTGATATTGGGATTTTGCAGGGCAAAGAATGTCCACAAGGCCATGTTACTTCTTCATAAAATGCTTGAGCTGAAGCTTCAACCAGATGTAGTTACCTACAACCTATTAATCCATGGACAGTGCAAAGAAGGGCATCTGGGTAGTGCTTATAAGCTGCTCAGTTTGATGAATGAAAATGGTTTGGTTCCTGATGAATGGACTTACAGTGTCTTCATAGCTGTACTCTGCAAAAGGGGGCGGGTTGAAGATGCTCGTTTTCTCTTTGACTCCCTAAAGGAGAAAGGCATAAAGGCAAATGAAGTAATCTATAGTGCATTGATTGATGGCTATTGCAAGGTCGGAAAAGTCAGTGACGGTCATTCATTGCTTGATAAAATGCTTAGTGATGGATGCGTTCCAAATTCAATTACTTATAATTCCTTGATAGATGGACATTGCAAAGAGAAAAATTTTCAAGAAGCTCTTTTACTTGTGGAAATAATGATAAAGAGGGACATTAAGCCTACTGCTGATACTTACACCATTCTTATAAAAAATTTATTAAAAGATGGTGAGTTTGACCGTGCCCATCAGATGTTTGATCAAATGCTTTCCGCAGGTTCTCATCCTGATGTAGTTATCTATACTGTATTTATTCATGCATATTGTAGCCTGGGTAGATTACAAGACGCGGAGTTATTTCTTCATAAAATGAATGAAAAAGGAATATTGCCAGACACTCTGCTTTATTCTTTATTGATTGATGCATATGGATGGTCTGGATCAATTGACATTGCTTTTGACATTTTGAAGCGTATGCATGATATCGGTTGTGAGCCGTCTTTCTACACATATTCTTATTTAATTAAACATCTATTAAGTGCAAAGCTGATAGAAGTAAATAGCAGTACAGAGTTGGGTGACTTGTCATCAGGGGTGGTTTCCAATGATTTTGCCAACTTATGGAGGAGAGTAGATTATGAATTTGCTTTGGAGTTGTTTGAGGAAATGGTCAAGCAAGGCTGTGCACCTAATGCTAATACTTATGGCAAGTTTATTTCAGGGCTTTGCAAGGTGGGATGCTTGGAAGTAGGCCGCAGGTTGTTTGATCATATGAAAGAAAAAGGACTATCGCCTAATGAAGACATTTATAACTCTCTTCTTGGTTGTTCGTGTCAATTAGGATTGTATGAAAAAGCAATAAGGTGGTTAGATATCATGGTAGAGCATGGATATTTACCACATTTAGATTCTTGCAAGCTGCTGCTCTGTGGCTTGTTTGACGAAGGAAATAACGAGAAAGCAAAAACAGTGTTTCATAGTTTGCTTCAGTGTGGGTATAATTATGATGAAATTGCTTGGAAATTACTTATTGATGGCTTACTTCAGAAGGGCCTTGTTGATAAATGCTCTGAGCTATTTGGCGTCATGGAGAGACAAGGTTGCCAAATTCATCCCAAGACATATAGTATGTTGATTGAGGGATTTGATGATATTCAGGATATAGATTAA

Coding sequence (CDS)

ATGTATGGAGTCTTCACCGCTATTCGATGCCCTACGATGATTAGAAATTCTTCCGCCATTATCAACTCAGGTCAGCTTCTTATCGTTCTTGGATTCAGGCTCAGATTCACATTTACACTCGCGTTTAAGTTCTTCACCTCAACTACTGCTTCTCTTCCTCAAAGCCTTCCTGTAGAACATGATGTACCGGCGCAGCTTTTTTCCATTCTTTCTCGCCCCGATTGGCAAAAGCATCCTTCTCTGAAAATTTTGATCCCTTCTATTGCGCCATCCCATGTATCTTCCCTTTTTGCCCTCAATCTCGATCCCAAAACTGCTCTTGCGTTTTTTAATTGGATCGAACAGAAGCATGGATTCAAACACAATGTTCAATCCTATGTTTCTATTTTAAATATCCTTGTTCCCAATGGGTACCACCGCATTGCTGAAAAACTGCGAATTTTAATGATTAAGTCGACGAATTCCGCAGAGAATGCGCTGTTCGTGTTGGAAATGCTGCGGAGCATGAACCGCCGGGGGGATGATTTAAGATTTAAGCTTACTCTTAAGAGCTATAACATGCTCTTGATGTTGTTGTCGAGGTTTCTCATGATTGATGAAATGAAAAATGTGTATTTGGAGATGTTGGATGACATGGTTTCGCCGAATATGTACACGCTCAATACAATGGTTAATGGATATTGTAAGTTGGGTAATGTAGTTGAAGCAGAGTTGTACGTCAGTAAGATAGTGCAAGCCGGTTTAAGTTTGGATACATTTACTTATACGTCTTTGATATTAGGATATTGTAGGAATAAAAATGTAGATGGTGCAAATAAAATTTTTCTGTCAATGCCAAGTAAAGGTTGCCGCAGAAATGAGGTTTCTTATACTAATTTGATTCATGGGTTTTGTGAAGCCGGGAGGATTGATGAAGCTCTGAAATTGCTCTCACAAATGCATGAGGATAATTGTTGGCCGACTGTTCGTACATATACAGTTATCATATGTGCATTGTGTCAAATGGGCAGGAAATCAGAAGCATTTAATGTGTTCAAGGAGATGACTGAGAAAGGATGTGAGCCAAATGTACATACCTATACAGTCCTTATTCGCAGTTTATGCGAGGACAATAAGTTTGATGATGCCAAGAAATTGCTAGATGGGATGCTTGAGAAAGGATTGGTTCCAAGTGTGGTCACTTACAATGCCTTTATTGATGGTTATTGCAAGAAAGGAATGAGCACGAGTGCCTTGGAAATTTTGAGCCTGATGGAATCGAATAATTGTAGTCCAAATACTCGCACTTATAATGAATTGATATTGGGATTTTGCAGGGCAAAGAATGTCCACAAGGCCATGTTACTTCTTCATAAAATGCTTGAGCTGAAGCTTCAACCAGATGTAGTTACCTACAACCTATTAATCCATGGACAGTGCAAAGAAGGGCATCTGGGTAGTGCTTATAAGCTGCTCAGTTTGATGAATGAAAATGGTTTGGTTCCTGATGAATGGACTTACAGTGTCTTCATAGCTGTACTCTGCAAAAGGGGGCGGGTTGAAGATGCTCGTTTTCTCTTTGACTCCCTAAAGGAGAAAGGCATAAAGGCAAATGAAGTAATCTATAGTGCATTGATTGATGGCTATTGCAAGGTCGGAAAAGTCAGTGACGGTCATTCATTGCTTGATAAAATGCTTAGTGATGGATGCGTTCCAAATTCAATTACTTATAATTCCTTGATAGATGGACATTGCAAAGAGAAAAATTTTCAAGAAGCTCTTTTACTTGTGGAAATAATGATAAAGAGGGACATTAAGCCTACTGCTGATACTTACACCATTCTTATAAAAAATTTATTAAAAGATGGTGAGTTTGACCGTGCCCATCAGATGTTTGATCAAATGCTTTCCGCAGGTTCTCATCCTGATGTAGTTATCTATACTGTATTTATTCATGCATATTGTAGCCTGGGTAGATTACAAGACGCGGAGTTATTTCTTCATAAAATGAATGAAAAAGGAATATTGCCAGACACTCTGCTTTATTCTTTATTGATTGATGCATATGGATGGTCTGGATCAATTGACATTGCTTTTGACATTTTGAAGCGTATGCATGATATCGGTTGTGAGCCGTCTTTCTACACATATTCTTATTTAATTAAACATCTATTAAGTGCAAAGCTGATAGAAGTAAATAGCAGTACAGAGTTGGGTGACTTGTCATCAGGGGTGGTTTCCAATGATTTTGCCAACTTATGGAGGAGAGTAGATTATGAATTTGCTTTGGAGTTGTTTGAGGAAATGGTCAAGCAAGGCTGTGCACCTAATGCTAATACTTATGGCAAGTTTATTTCAGGGCTTTGCAAGGTGGGATGCTTGGAAGTAGGCCGCAGGTTGTTTGATCATATGAAAGAAAAAGGACTATCGCCTAATGAAGACATTTATAACTCTCTTCTTGGTTGTTCGTGTCAATTAGGATTGTATGAAAAAGCAATAAGGTGGTTAGATATCATGGTAGAGCATGGATATTTACCACATTTAGATTCTTGCAAGCTGCTGCTCTGTGGCTTGTTTGACGAAGGAAATAACGAGAAAGCAAAAACAGTGTTTCATAGTTTGCTTCAGTGTGGGTATAATTATGATGAAATTGCTTGGAAATTACTTATTGATGGCTTACTTCAGAAGGGCCTTGTTGATAAATGCTCTGAGCTATTTGGCGTCATGGAGAGACAAGGTTGCCAAATTCATCCCAAGACATATAGTATGTTGATTGAGGGATTTGATGATATTCAGGATATAGATTAA

Protein sequence

MYGVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLAFKFFTSTTASLPQSLPVEHDVPAQLFSILSRPDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFKHNVQSYVSILNILVPNGYHRIAEKLRILMIKSTNSAENALFVLEMLRSMNRRGDDLRFKLTLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTMVNGYCKLGNVVEAELYVSKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEAGRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFNVFKEMTEKGCEPNVHTYTVLIRSLCEDNKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMESNNCSPNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGSAYKLLSLMNENGLVPDEWTYSVFIAVLCKRGRVEDARFLFDSLKEKGIKANEVIYSALIDGYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKPTADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQDAELFLHKMNEKGILPDTLLYSLLIDAYGWSGSIDIAFDILKRMHDIGCEPSFYTYSYLIKHLLSAKLIEVNSSTELGDLSSGVVSNDFANLWRRVDYEFALELFEEMVKQGCAPNANTYGKFISGLCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIMVEHGYLPHLDSCKLLLCGLFDEGNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFGVMERQGCQIHPKTYSMLIEGFDDIQDID
Homology
BLAST of Cp4.1LG07g05790 vs. ExPASy Swiss-Prot
Match: Q9LSL9 (Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX=3702 GN=At5g65560 PE=2 SV=1)

HSP 1 Score: 965.3 bits (2494), Expect = 4.9e-280
Identity = 494/931 (53.06%), Postives = 648/931 (69.60%), Query Frame = 0

Query: 13  MIRNSSAIINSGQLLIVLGFRLRFTFTLAFKFFTSTTASLP-------------QSLPVE 72
           MIR      NSG    V  F +     L  KF T  T   P             ++LP E
Sbjct: 1   MIRRIQPRCNSGLTGSVSAFEV-----LKKKFSTDVTVPSPVTRRQFCSVSPLLRNLPEE 60

Query: 73  H----DVPAQLFSILSRPDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQ 132
                 VP +L SILS+P+W K PSLK ++ +I+PSHVSSLF+L+LDPKTAL F +WI Q
Sbjct: 61  ESDSMSVPHRLLSILSKPNWHKSPSLKSMVSAISPSHVSSLFSLDLDPKTALNFSHWISQ 120

Query: 133 KHGFKHNVQSYVSILNILVPNGYHRIAEKLRILMIKSTNSAENALFVLEMLRSMNR-RGD 192
              +KH+V SY S+L +L+ NGY  +  K+R+LMIKS +S  +AL+VL++ R MN+    
Sbjct: 121 NPRYKHSVYSYASLLTLLINNGYVGVVFKIRLLMIKSCDSVGDALYVLDLCRKMNKDERF 180

Query: 193 DLRFKLTLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTMVNGYCKLGNVV 252
           +L++KL +  YN LL  L+RF ++DEMK VY+EML+D V PN+YT N MVNGYCKLGNV 
Sbjct: 181 ELKYKLIIGCYNTLLNSLARFGLVDEMKQVYMEMLEDKVCPNIYTYNKMVNGYCKLGNVE 240

Query: 253 EAELYVSKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLI 312
           EA  YVSKIV+AGL  D FTYTSLI+GYC+ K++D A K+F  MP KGCRRNEV+YT+LI
Sbjct: 241 EANQYVSKIVEAGLDPDFFTYTSLIMGYCQRKDLDSAFKVFNEMPLKGCRRNEVAYTHLI 300

Query: 313 HGFCEAGRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFNVFKEMTEKGCE 372
           HG C A RIDEA+ L  +M +D C+PTVRTYTV+I +LC   RKSEA N+ KEM E G +
Sbjct: 301 HGLCVARRIDEAMDLFVKMKDDECFPTVRTYTVLIKSLCGSERKSEALNLVKEMEETGIK 360

Query: 373 PNVHTYTVLIRSLCEDNKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEI 432
           PN+HTYTVLI SLC   KF+ A++LL  MLEKGL+P+V+TYNA I+GYCK+GM   A+++
Sbjct: 361 PNIHTYTVLIDSLCSQCKFEKARELLGQMLEKGLMPNVITYNALINGYCKRGMIEDAVDV 420

Query: 433 LSLMESNNCSPNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCK 492
           + LMES   SPNTRTYNELI G+C++ NVHKAM +L+KMLE K+ PDVVTYN LI GQC+
Sbjct: 421 VELMESRKLSPNTRTYNELIKGYCKS-NVHKAMGVLNKMLERKVLPDVVTYNSLIDGQCR 480

Query: 493 EGHLGSAYKLLSLMNENGLVPDEWTYSVFIAVLCKRGRVEDARFLFDSLKEKGIKANEVI 552
            G+  SAY+LLSLMN+ GLVPD+WTY+  I  LCK  RVE+A  LFDSL++KG+  N V+
Sbjct: 481 SGNFDSAYRLLSLMNDRGLVPDQWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVNPNVVM 540

Query: 553 YSALIDGYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMI 612
           Y+ALIDGYCK GKV + H +L+KMLS  C+PNS+T+N+LI G C +   +EA LL E M+
Sbjct: 541 YTALIDGYCKAGKVDEAHLMLEKMLSKNCLPNSLTFNALIHGLCADGKLKEATLLEEKMV 600

Query: 613 KRDIKPTADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQ 672
           K  ++PT  T TILI  LLKDG+FD A+  F QMLS+G+ PD   YT FI  YC  GRL 
Sbjct: 601 KIGLQPTVSTDTILIHRLLKDGDFDHAYSRFQQMLSSGTKPDAHTYTTFIQTYCREGRLL 660

Query: 673 DAELFLHKMNEKGILPDTLLYSLLIDAYGWSGSIDIAFDILKRMHDIGCEPSFYTYSYLI 732
           DAE  + KM E G+ PD   YS LI  YG  G  + AFD+LKRM D GCEPS +T+  LI
Sbjct: 661 DAEDMMAKMRENGVSPDLFTYSSLIKGYGDLGQTNFAFDVLKRMRDTGCEPSQHTFLSLI 720

Query: 733 KHLLSAKL-IEVNSSTELGDLSSGVVSNDFANLWRRVDYEFALELFEEMVKQGCAPNANT 792
           KHLL  K   +  S  EL  +S+             ++++  +EL E+MV+    PNA +
Sbjct: 721 KHLLEMKYGKQKGSEPELCAMSN------------MMEFDTVVELLEKMVEHSVTPNAKS 780

Query: 793 YGKFISGLCKVGCLEVGRRLFDHM-KEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIM 852
           Y K I G+C+VG L V  ++FDHM + +G+SP+E ++N+LL C C+L  + +A + +D M
Sbjct: 781 YEKLILGICEVGNLRVAEKVFDHMQRNEGISPSELVFNALLSCCCKLKKHNEAAKVVDDM 840

Query: 853 VEHGYLPHLDSCKLLLCGLFDEGNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLV 912
           +  G+LP L+SCK+L+CGL+ +G  E+  +VF +LLQCGY  DE+AWK++IDG+ ++GLV
Sbjct: 841 ICVGHLPQLESCKVLICGLYKKGEKERGTSVFQNLLQCGYYEDELAWKIIIDGVGKQGLV 900

Query: 913 DKCSELFGVMERQGCQIHPKTYSMLIEGFDD 924
           +   ELF VME+ GC+   +TYS+LIEG  D
Sbjct: 901 EAFYELFNVMEKNGCKFSSQTYSLLIEGPPD 913

BLAST of Cp4.1LG07g05790 vs. ExPASy Swiss-Prot
Match: Q9SFV9 (Pentatricopeptide repeat-containing protein At3g07290, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g07290 PE=2 SV=1)

HSP 1 Score: 481.1 bits (1237), Expect = 2.8e-134
Identity = 294/897 (32.78%), Postives = 479/897 (53.40%), Query Frame = 0

Query: 21  INSGQLLIVLGFRLRFTFTLAFKFFTSTTASLPQSLPV-EHDVPAQLFSILSRPDWQKHP 80
           I S + ++ LG   R  F     F  S+  SL  S  V  HDV     S+L  P+W+K+ 
Sbjct: 6   IRSTRKILALG---RHVFPSNAFFSVSSRPSLSSSDEVAAHDVA----SLLKTPNWEKNS 65

Query: 81  SLKILIPSIAPSHVSSLFAL-NLDPKTALAFFNWIEQKHGFKHNVQSYVSILNILVPNGY 140
           SLK L+  + P+  S + +L   D    + FF W+ +   +  +      +L ++V +G 
Sbjct: 66  SLKSLVSHMNPNVASQVISLQRSDNDICVRFFMWVCKHSSYCFDPTQKNQLLKLIVSSGL 125

Query: 141 HRIAEKLRILMIKSTNSAENALFVLEMLRSMNRRGDDLRFKLTLKSYNMLLMLLSRFLMI 200
           +R+A  + + +IK  +  E  +  L+++   +   +   F+L    Y+ LLM L++  + 
Sbjct: 126 YRVAHAVIVALIKECSRCEKEM--LKLMYCFDELREVFGFRLNYPCYSSLLMSLAKLDLG 185

Query: 201 DEMKNVYLEMLDDMVSPNMYTLNTMVNGYCKLGNVVEAELYVSKIVQAGLSLDTFTYTSL 260
                 Y  M  D     M    T+VN  CK G    AE+++SKI++ G  LD+   TSL
Sbjct: 186 FLAYVTYRRMEADGFVVGMIDYRTIVNALCKNGYTEAAEMFMSKILKIGFVLDSHIGTSL 245

Query: 261 ILGYCRNKNVDGANKIFLSMPSK-GCRRNEVSYTNLIHGFCEAGRIDEALKLLSQMHEDN 320
           +LG+CR  N+  A K+F  M  +  C  N VSY+ LIHG CE GR++EA  L  QM E  
Sbjct: 246 LLGFCRGLNLRDALKVFDVMSKEVTCAPNSVSYSILIHGLCEVGRLEEAFGLKDQMGEKG 305

Query: 321 CWPTVRTYTVIICALCQMGRKSEAFNVFKEMTEKGCEPNVHTYTVLIRSLCEDNKFDDAK 380
           C P+ RTYTV+I ALC  G   +AFN+F EM  +GC+PNVHTYTVLI  LC D K ++A 
Sbjct: 306 CQPSTRTYTVLIKALCDRGLIDKAFNLFDEMIPRGCKPNVHTYTVLIDGLCRDGKIEEAN 365

Query: 381 KLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMESNNCSPNTRTYNELILGF 440
            +   M++  + PSV+TYNA I+GYCK G    A E+L++ME   C PN RT+NEL+ G 
Sbjct: 366 GVCRKMVKDRIFPSVITYNALINGYCKDGRVVPAFELLTVMEKRACKPNVRTFNELMEGL 425

Query: 441 CRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGSAYKLLSLMNENGLVPDE 500
           CR    +KA+ LL +ML+  L PD+V+YN+LI G C+EGH+ +AYKLLS MN   + PD 
Sbjct: 426 CRVGKPYKAVHLLKRMLDNGLSPDIVSYNVLIDGLCREGHMNTAYKLLSSMNCFDIEPDC 485

Query: 501 WTYSVFIAVLCKRGRVEDARFLFDSLKEKGIKANEVIYSALIDGYCKVGKVSDGHSLLDK 560
            T++  I   CK+G+ + A      +  KGI  +EV  + LIDG CKVGK  D   +L+ 
Sbjct: 486 LTFTAIINAFCKQGKADVASAFLGLMLRKGISLDEVTGTTLIDGVCKVGKTRDALFILET 545

Query: 561 MLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKPTADTYTILIKNLLKDGE 620
           ++    +    + N ++D   K    +E L ++  + K  + P+  TYT L+  L++ G+
Sbjct: 546 LVKMRILTTPHSLNVILDMLSKGCKVKEELAMLGKINKLGLVPSVVTYTTLVDGLIRSGD 605

Query: 621 FDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQDAELFLHKMNEKGILPDTLLYSL 680
              + ++ + M  +G  P+V  YT+ I+  C  GR+++AE  L  M + G+ P+ + Y++
Sbjct: 606 ITGSFRILELMKLSGCLPNVYPYTIIINGLCQFGRVEEAEKLLSAMQDSGVSPNHVTYTV 665

Query: 681 LIDAYGWSGSIDIAFDILKRMHDIGCEPSFYTYSYLIK-HLLSAKLIEVNSSTELGDLSS 740
           ++  Y  +G +D A + ++ M + G E +   YS L++  +LS K I+ +  + + D++ 
Sbjct: 666 MVKGYVNNGKLDRALETVRAMVERGYELNDRIYSSLLQGFVLSQKGIDNSEESTVSDIA- 725

Query: 741 GVVSNDFANLWRRVDYEFALELFEEMVKQ--GCAPNANTYGKFISGLCKVGCLEVGRRLF 800
                      R  D E   EL   +V+Q  GC      +   ++ LCK G  +    L 
Sbjct: 726 ----------LRETDPECINELI-SVVEQLGGCISGLCIF--LVTRLCKEGRTDESNDLV 785

Query: 801 DHMKEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIMVEHGYLPHLDSCKLLLCGLFDE 860
            ++ E+G+   E   + ++   C    + K +  + ++++ G++P   S  L++ GL  E
Sbjct: 786 QNVLERGVF-LEKAMDIIMESYCSKKKHTKCMELITLVLKSGFVPSFKSFCLVIQGLKKE 845

Query: 861 GNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFGVMERQGCQIHP 912
           G+ E+A+ +   LL      ++      ++ L++      CSE+  ++++  C+  P
Sbjct: 846 GDAERARELVMELLTSNGVVEKSGVLTYVECLMEGDETGDCSEVIDLVDQLHCRERP 878

BLAST of Cp4.1LG07g05790 vs. ExPASy Swiss-Prot
Match: Q76C99 (Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV=1)

HSP 1 Score: 330.5 bits (846), Expect = 6.2e-89
Identity = 187/713 (26.23%), Postives = 335/713 (46.98%), Query Frame = 0

Query: 211 DMVSPNMYTLNTMVNGYCKLGNVVEAELYVSKIVQAGLSLDTFTYTSLILGYCRNKNVDG 270
           D V+P++ T   ++   C+ G +      +  +++ G  +D   +T L+ G C +K    
Sbjct: 81  DEVTPDLCTYGILIGCCCRAGRLDLGFAALGNVIKKGFRVDAIAFTPLLKGLCADKRTSD 140

Query: 271 ANKIFL-SMPSKGCRRNEVSYTNLIHGFCEAGRIDEALKLLSQMHED---NCWPTVRTYT 330
           A  I L  M   GC  N  SY  L+ G C+  R  EAL+LL  M +D      P V +YT
Sbjct: 141 AMDIVLRRMTELGCIPNVFSYNILLKGLCDENRSQEALELLHMMADDRGGGSPPDVVSYT 200

Query: 331 VIICALCQMGRKSEAFNVFKEMTEKGCEPNVHTYTVLIRSLCEDNKFDDAKKLLDGMLEK 390
            +I    + G   +A++ + EM ++G  P+V TY  +I +LC+    D A ++L+ M++ 
Sbjct: 201 TVINGFFKEGDSDKAYSTYHEMLDRGILPDVVTYNSIIAALCKAQAMDKAMEVLNTMVKN 260

Query: 391 GLVPSVVTYNAFIDGYCKKGMSTSALEILSLMESNNCSPNTRTYNELILGFCRAKNVHKA 450
           G++P  +TYN+ + GYC  G    A+  L  M S+   P+  TY+ L+   C+     +A
Sbjct: 261 GVMPDCMTYNSILHGYCSSGQPKEAIGFLKKMRSDGVEPDVVTYSLLMDYLCKNGRCMEA 320

Query: 451 MLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGSAYKLLSLMNENGLVPDEWTYSVFIAV 510
             +   M +  L+P++ TY  L+ G   +G L   + LL LM  NG+ PD + +S+ I  
Sbjct: 321 RKIFDSMTKRGLKPEITTYGTLLQGYATKGALVEMHGLLDLMVRNGIHPDHYVFSILICA 380

Query: 511 LCKRGRVEDARFLFDSLKEKGIKANEVIYSALIDGYCKVGKVSDGHSLLDKMLSDGCVPN 570
             K+G+V+ A  +F  ++++G+  N V Y A+I   CK G+V D     ++M+ +G  P 
Sbjct: 381 YAKQGKVDQAMLVFSKMRQQGLNPNAVTYGAVIGILCKSGRVEDAMLYFEQMIDEGLSPG 440

Query: 571 SITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKPTADTYTILIKNLLKDGEFDRAHQMFD 630
           +I YNSLI G C    ++ A  L+  M+ R I      +  +I +  K+G    + ++F+
Sbjct: 441 NIVYNSLIHGLCTCNKWERAEELILEMLDRGICLNTIFFNSIIDSHCKEGRVIESEKLFE 500

Query: 631 QMLSAGSHPDVVIYTVFIHAYCSLGRLQDAELFLHKMNEKGILPDTLLYSLLIDAYGWSG 690
            M+  G  P+V+ Y   I+ YC  G++ +A   L  M   G+ P+T+ YS LI+ Y    
Sbjct: 501 LMVRIGVKPNVITYNTLINGYCLAGKMDEAMKLLSGMVSVGLKPNTVTYSTLINGYCKIS 560

Query: 691 SIDIAFDILKRMHDIGCEPSFYTYSYLIKHLLSAKLIEVNSSTELGDLSSGVVSNDFANL 750
            ++ A  + K M   G  P   TY+ +++ L   +                         
Sbjct: 561 RMEDALVLFKEMESSGVSPDIITYNIILQGLFQTRRTAA--------------------- 620

Query: 751 WRRVDYEFALELFEEMVKQGCAPNANTYGKFISGLCKVGCLEVGRRLFDHMKEKGLSPNE 810
                   A EL+  + + G     +TY   + GLCK    +   ++F ++    L    
Sbjct: 621 --------AKELYVRITESGTQIELSTYNIILHGLCKNKLTDDALQMFQNLCLMDLKLEA 680

Query: 811 DIYNSLLGCSCQLGLYEKAIRWLDIMVEHGYLPHLDSCKLLLCGLFDEGNNEKAKTVFHS 870
             +N ++    ++G  ++A         +G +P+  + +L+   +  +G  E+   +F S
Sbjct: 681 RTFNIMIDALLKVGRNDEAKDLFVAFSSNGLVPNYWTYRLMAENIIGQGLLEELDQLFLS 740

Query: 871 LLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFGVMERQGCQIHPKTYSMLIE 920
           +   G   D      ++  LLQ+G + +      +++ +   +   T S+ I+
Sbjct: 741 MEDNGCTVDSGMLNFIVRELLQRGEITRAGTYLSMIDEKHFSLEASTASLFID 764

BLAST of Cp4.1LG07g05790 vs. ExPASy Swiss-Prot
Match: Q9FJE6 (Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis thaliana OX=3702 GN=At5g59900 PE=3 SV=1)

HSP 1 Score: 327.8 bits (839), Expect = 4.0e-88
Identity = 233/877 (26.57%), Postives = 419/877 (47.78%), Query Frame = 0

Query: 38  FTLAFKFFTSTTASLPQSLPVEHD---VPAQLFSILSRPDWQKHPSLKILIPSIAPSHVS 97
           F  +F+  +S   S  +   +  D   V A    +  +  W+   S +++   +   HV 
Sbjct: 15  FRNSFRNVSSVIDSAQEECRIAEDKQFVDAVKRIVRGKRSWEIALSSELVSRRLKTVHVE 74

Query: 98  SLFALNL-DPKTALAFFNWIEQKHGFKHNVQSYVSILNILV-PNGYHRIAEKLRILMIKS 157
            +    + DPK  L FFN++    GF H+  S+  +++ LV  N +   +  L+ L++++
Sbjct: 75  EILIGTIDDPKLGLRFFNFLGLHRGFDHSTASFCILIHALVKANLFWPASSLLQTLLLRA 134

Query: 158 TNSAE--NALF-VLEMLRSMNRRGDDLRFKLTLKSYNML------LMLLSRFLMIDEMK- 217
              ++  N LF   E  +  +    DL  +  ++S  +L       M++++  ++ E++ 
Sbjct: 135 LKPSDVFNVLFSCYEKCKLSSSSSFDLLIQHYVRSRRVLDGVLVFKMMITKVSLLPEVRT 194

Query: 218 --------------NVYLEMLDDMVS----PNMYTLNTMVNGYCKLGNVVEAELYVSKIV 277
                          + +E+ +DMVS    P++Y    ++   C+L ++  A+  ++ + 
Sbjct: 195 LSALLHGLVKFRHFGLAMELFNDMVSVGIRPDVYIYTGVIRSLCELKDLSRAKEMIAHME 254

Query: 278 QAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEAGRID 337
             G  ++   Y  LI G C+ + V  A  I   +  K  + + V+Y  L++G C+    +
Sbjct: 255 ATGCDVNIVPYNVLIDGLCKKQKVWEAVGIKKDLAGKDLKPDVVTYCTLVYGLCKVQEFE 314

Query: 338 EALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFNVFKEMTEKGCEPNVHTYTVLI 397
             L+++ +M      P+    + ++  L + G+  EA N+ K + + G  PN+  Y  LI
Sbjct: 315 IGLEMMDEMLCLRFSPSEAAVSSLVEGLRKRGKIEEALNLVKRVVDFGVSPNLFVYNALI 374

Query: 398 RSLCEDNKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMESNNCS 457
            SLC+  KF +A+ L D M + GL P+ VTY+  ID +C++G   +AL  L  M      
Sbjct: 375 DSLCKGRKFHEAELLFDRMGKIGLRPNDVTYSILIDMFCRRGKLDTALSFLGEMVDTGLK 434

Query: 458 PNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGSAYKL 517
            +   YN LI G C+  ++  A   + +M+  KL+P VVTY  L+ G C +G +  A +L
Sbjct: 435 LSVYPYNSLINGHCKFGDISAAEGFMAEMINKKLEPTVVTYTSLMGGYCSKGKINKALRL 494

Query: 518 LSLMNENGLVPDEWTYSVFIAVLCKRGRVEDARFLFDSLKEKGIKANEVIYSALIDGYCK 577
              M   G+ P  +T++  ++ L + G + DA  LF+ + E  +K N V Y+ +I+GYC+
Sbjct: 495 YHEMTGKGIAPSIYTFTTLLSGLFRAGLIRDAVKLFNEMAEWNVKPNRVTYNVMIEGYCE 554

Query: 578 VGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKPTADT 637
            G +S     L +M   G VP++ +Y  LI G C      EA + V+ + K + +     
Sbjct: 555 EGDMSKAFEFLKEMTEKGIVPDTYSYRPLIHGLCLTGQASEAKVFVDGLHKGNCELNEIC 614

Query: 638 YTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQDAELF---LH 697
           YT L+    ++G+ + A  +  +M+  G   D+V Y V I    SL + +D +LF   L 
Sbjct: 615 YTGLLHGFCREGKLEEALSVCQEMVQRGVDLDLVCYGVLIDG--SL-KHKDRKLFFGLLK 674

Query: 698 KMNEKGILPDTLLYSLLIDAYGWSGSIDIAFDILKRMHDIGCEPSFYTYSYLIKHLLSAK 757
           +M+++G+ PD ++Y+ +IDA   +G    AF I   M + GC P+  TY+ +I  L  A 
Sbjct: 675 EMHDRGLKPDDVIYTSMIDAKSKTGDFKEAFGIWDLMINEGCVPNEVTYTAVINGLCKAG 734

Query: 758 LIE-----VNSSTELGDLSSGVVSNDFANLWRR--VDYEFALELFEEMVKQGCAPNANTY 817
            +       +    +  + + V    F ++  +  VD + A+EL   ++K G   N  TY
Sbjct: 735 FVNEAEVLCSKMQPVSSVPNQVTYGCFLDILTKGEVDMQKAVELHNAILK-GLLANTATY 794

Query: 818 GKFISGLCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIMVE 872
              I G C+ G +E    L   M   G+SP+   Y +++   C+    +KAI   + M E
Sbjct: 795 NMLIRGFCRQGRIEEASELITRMIGDGVSPDCITYTTMINELCRRNDVKKAIELWNSMTE 854

BLAST of Cp4.1LG07g05790 vs. ExPASy Swiss-Prot
Match: Q9SZ52 (Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PGR3 PE=1 SV=1)

HSP 1 Score: 322.4 bits (825), Expect = 1.7e-86
Identity = 248/942 (26.33%), Postives = 399/942 (42.36%), Query Frame = 0

Query: 102  DPKTALAFFNWIEQKHGFK------HNVQSYVSILNILVPNGYHRIAEKLRILMIKSTNS 161
            D  T L  F  +  K G K        ++ +  +LN    NG       L  L++KS   
Sbjct: 152  DTNTYLTIFKSLSVKGGLKQAPYALRKMREFGFVLNAYSYNG-------LIHLLLKSRFC 211

Query: 162  AENALFVLEMLRSMNRRGDDLRFKLTLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSP 221
             E     +E+ R M   G    F+ +L++Y+ L++ L +   ID +  +  EM    + P
Sbjct: 212  TE----AMEVYRRMILEG----FRPSLQTYSSLMVGLGKRRDIDSVMGLLKEMETLGLKP 271

Query: 222  NMYTLNTMVNGYCKLGNVVEAELYVSKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIF 281
            N+YT    +    + G + EA   + ++   G   D  TYT LI   C  + +D A ++F
Sbjct: 272  NVYTFTICIRVLGRAGKINEAYEILKRMDDEGCGPDVVTYTVLIDALCTARKLDCAKEVF 331

Query: 282  LSMPSKGCRRNEVSYTNLIHGFCEAGRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQM 341
              M +   + + V+Y  L+  F +   +D   +  S+M +D   P V T+T+++ ALC+ 
Sbjct: 332  EKMKTGRHKPDRVTYITLLDRFSDNRDLDSVKQFWSEMEKDGHVPDVVTFTILVDALCKA 391

Query: 342  GRKSEAFNVFKEMTEKGCEPNVHTYTVLIRSLCEDNKFDDAKKLLDGMLEKGLVPSVVTY 401
            G   EAF+    M ++G  PN+HTY  LI  L   ++ DDA +L   M   G+ P+  TY
Sbjct: 392  GNFGEAFDTLDVMRDQGILPNLHTYNTLICGLLRVHRLDDALELFGNMESLGVKPTAYTY 451

Query: 402  NAFIDGYCKKGMSTSALEILSLMESNNCSPNTRTYNELILGFCRAKNVHKAMLLLHKMLE 461
              FID Y K G S SALE    M++   +PN    N  +    +A    +A  + + + +
Sbjct: 452  IVFIDYYGKSGDSVSALETFEKMKTKGIAPNIVACNASLYSLAKAGRDREAKQIFYGLKD 511

Query: 462  LKLQPDVVTYNLLIHGQCKEGHLGSAYKLLSLMNENGLVPDEWTYSVFIAVLCKRGRVED 521
            + L PD VTYN+++    K G +  A KLLS M ENG  PD    +  I  L K  RV++
Sbjct: 512  IGLVPDSVTYNMMMKCYSKVGEIDEAIKLLSEMMENGCEPDVIVVNSLINTLYKADRVDE 571

Query: 522  ARFLFDSLKEKGIKANEVIYSALIDGYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLID 581
            A  +F  +KE  +K   V Y+ L+ G  K GK+ +   L + M+  GC PN+IT+N+L D
Sbjct: 572  AWKMFMRMKEMKLKPTVVTYNTLLAGLGKNGKIQEAIELFEGMVQKGCPPNTITFNTLFD 631

Query: 582  GHCKEKNFQEALLLVEIMIKRDIKPTADTYTILIKNLLKDGEFDRA----HQM------- 641
              CK      AL ++  M+     P   TY  +I  L+K+G+   A    HQM       
Sbjct: 632  CLCKNDEVTLALKMLFKMMDMGCVPDVFTYNTIIFGLVKNGQVKEAMCFFHQMKKLVYPD 691

Query: 642  ----------------------------------------------------------FD 701
                                                                      F 
Sbjct: 692  FVTLCTLLPGVVKASLIEDAYKIITNFLYNCADQPANLFWEDLIGSILAEAGIDNAVSFS 751

Query: 702  QMLSA--------------------------------------GSHPDVVIYTVFIHAYC 761
            + L A                                      G  P +  Y + I    
Sbjct: 752  ERLVANGICRDGDSILVPIIRYSCKHNNVSGARTLFEKFTKDLGVQPKLPTYNLLIGGLL 811

Query: 762  SLGRLQDAELFLHKMNEKGILPDTLLYSLLIDAYGWSGSIDIAFDILKRMHDIGCEPSFY 821
                ++ A+    ++   G +PD   Y+ L+DAYG SG ID  F++ K M    CE +  
Sbjct: 812  EADMIEIAQDVFLQVKSTGCIPDVATYNFLLDAYGKSGKIDELFELYKEMSTHECEANTI 871

Query: 822  TYSYLIKHLLSAKLIEVNSSTELGDLSSGVVSNDFANLWRRVD--------YEFALELFE 881
            T++ +I  L+ A  ++         +S    S         +D        YE A +LFE
Sbjct: 872  THNIVISGLVKAGNVDDALDLYYDLMSDRDFSPTACTYGPLIDGLSKSGRLYE-AKQLFE 931

Query: 882  EMVKQGCAPNANTYGKFISGLCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLGCSCQLG 922
             M+  GC PN   Y   I+G  K G  +    LF  M ++G+ P+   Y+ L+ C C +G
Sbjct: 932  GMLDYGCRPNCAIYNILINGFGKAGEADAACALFKRMVKEGVRPDLKTYSVLVDCLCMVG 991

BLAST of Cp4.1LG07g05790 vs. NCBI nr
Match: XP_023536697.1 (pentatricopeptide repeat-containing protein At5g65560-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1875 bits (4858), Expect = 0.0
Identity = 928/928 (100.00%), Postives = 928/928 (100.00%), Query Frame = 0

Query: 1   MYGVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLAFKFFTSTTASLPQSLPVEH 60
           MYGVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLAFKFFTSTTASLPQSLPVEH
Sbjct: 13  MYGVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLAFKFFTSTTASLPQSLPVEH 72

Query: 61  DVPAQLFSILSRPDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK 120
           DVPAQLFSILSRPDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK
Sbjct: 73  DVPAQLFSILSRPDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK 132

Query: 121 HNVQSYVSILNILVPNGYHRIAEKLRILMIKSTNSAENALFVLEMLRSMNRRGDDLRFKL 180
           HNVQSYVSILNILVPNGYHRIAEKLRILMIKSTNSAENALFVLEMLRSMNRRGDDLRFKL
Sbjct: 133 HNVQSYVSILNILVPNGYHRIAEKLRILMIKSTNSAENALFVLEMLRSMNRRGDDLRFKL 192

Query: 181 TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTMVNGYCKLGNVVEAELYV 240
           TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTMVNGYCKLGNVVEAELYV
Sbjct: 193 TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTMVNGYCKLGNVVEAELYV 252

Query: 241 SKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA 300
           SKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA
Sbjct: 253 SKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA 312

Query: 301 GRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFNVFKEMTEKGCEPNVHTY 360
           GRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFNVFKEMTEKGCEPNVHTY
Sbjct: 313 GRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFNVFKEMTEKGCEPNVHTY 372

Query: 361 TVLIRSLCEDNKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMES 420
           TVLIRSLCEDNKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMES
Sbjct: 373 TVLIRSLCEDNKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMES 432

Query: 421 NNCSPNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGS 480
           NNCSPNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGS
Sbjct: 433 NNCSPNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGS 492

Query: 481 AYKLLSLMNENGLVPDEWTYSVFIAVLCKRGRVEDARFLFDSLKEKGIKANEVIYSALID 540
           AYKLLSLMNENGLVPDEWTYSVFIAVLCKRGRVEDARFLFDSLKEKGIKANEVIYSALID
Sbjct: 493 AYKLLSLMNENGLVPDEWTYSVFIAVLCKRGRVEDARFLFDSLKEKGIKANEVIYSALID 552

Query: 541 GYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKP 600
           GYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKP
Sbjct: 553 GYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKP 612

Query: 601 TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQDAELFL 660
           TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQDAELFL
Sbjct: 613 TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQDAELFL 672

Query: 661 HKMNEKGILPDTLLYSLLIDAYGWSGSIDIAFDILKRMHDIGCEPSFYTYSYLIKHLLSA 720
           HKMNEKGILPDTLLYSLLIDAYGWSGSIDIAFDILKRMHDIGCEPSFYTYSYLIKHLLSA
Sbjct: 673 HKMNEKGILPDTLLYSLLIDAYGWSGSIDIAFDILKRMHDIGCEPSFYTYSYLIKHLLSA 732

Query: 721 KLIEVNSSTELGDLSSGVVSNDFANLWRRVDYEFALELFEEMVKQGCAPNANTYGKFISG 780
           KLIEVNSSTELGDLSSGVVSNDFANLWRRVDYEFALELFEEMVKQGCAPNANTYGKFISG
Sbjct: 733 KLIEVNSSTELGDLSSGVVSNDFANLWRRVDYEFALELFEEMVKQGCAPNANTYGKFISG 792

Query: 781 LCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIMVEHGYLPH 840
           LCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIMVEHGYLPH
Sbjct: 793 LCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIMVEHGYLPH 852

Query: 841 LDSCKLLLCGLFDEGNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG 900
           LDSCKLLLCGLFDEGNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG
Sbjct: 853 LDSCKLLLCGLFDEGNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG 912

Query: 901 VMERQGCQIHPKTYSMLIEGFDDIQDID 928
           VMERQGCQIHPKTYSMLIEGFDDIQDID
Sbjct: 913 VMERQGCQIHPKTYSMLIEGFDDIQDID 940

BLAST of Cp4.1LG07g05790 vs. NCBI nr
Match: XP_022951246.1 (pentatricopeptide repeat-containing protein At5g65560-like [Cucurbita moschata])

HSP 1 Score: 1838 bits (4760), Expect = 0.0
Identity = 908/928 (97.84%), Postives = 920/928 (99.14%), Query Frame = 0

Query: 1   MYGVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLAFKFFTSTTASLPQSLPVEH 60
           MYGVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLAFKFFTS+TASLPQSLPVEH
Sbjct: 13  MYGVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLAFKFFTSSTASLPQSLPVEH 72

Query: 61  DVPAQLFSILSRPDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK 120
           DVPAQLFSILSRPDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK
Sbjct: 73  DVPAQLFSILSRPDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK 132

Query: 121 HNVQSYVSILNILVPNGYHRIAEKLRILMIKSTNSAENALFVLEMLRSMNRRGDDLRFKL 180
           HNVQSYVS+LNILVPNGY RIAEKLRILMIKSTNSAENALFVLEMLRSMNRRGDDLRFKL
Sbjct: 133 HNVQSYVSMLNILVPNGYLRIAEKLRILMIKSTNSAENALFVLEMLRSMNRRGDDLRFKL 192

Query: 181 TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTMVNGYCKLGNVVEAELYV 240
           TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNT+VNGYCKLGNVVEAELYV
Sbjct: 193 TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTLVNGYCKLGNVVEAELYV 252

Query: 241 SKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA 300
           SKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA
Sbjct: 253 SKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA 312

Query: 301 GRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFNVFKEMTEKGCEPNVHTY 360
            RIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAF+VFKEMTEKGCEPNVHTY
Sbjct: 313 RRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFDVFKEMTEKGCEPNVHTY 372

Query: 361 TVLIRSLCEDNKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMES 420
           TVLIRSLCED+KFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMES
Sbjct: 373 TVLIRSLCEDSKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMES 432

Query: 421 NNCSPNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGS 480
           NNC+PNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEG LGS
Sbjct: 433 NNCNPNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGQLGS 492

Query: 481 AYKLLSLMNENGLVPDEWTYSVFIAVLCKRGRVEDARFLFDSLKEKGIKANEVIYSALID 540
           AYKLLSLMNENGLVPDEWTYSVFI VLCKRGRVEDARFLFDSLKEKG+KANEVIYSALID
Sbjct: 493 AYKLLSLMNENGLVPDEWTYSVFIVVLCKRGRVEDARFLFDSLKEKGVKANEVIYSALID 552

Query: 541 GYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKP 600
           GYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIK 
Sbjct: 553 GYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKL 612

Query: 601 TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQDAELFL 660
           TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRL+DAELFL
Sbjct: 613 TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLRDAELFL 672

Query: 661 HKMNEKGILPDTLLYSLLIDAYGWSGSIDIAFDILKRMHDIGCEPSFYTYSYLIKHLLSA 720
           HKMN+KGILPDTLLYSLLIDAYGWSGSI IAFDILKRMHD+GCEPSFYTYSYLIKHLLSA
Sbjct: 673 HKMNDKGILPDTLLYSLLIDAYGWSGSIGIAFDILKRMHDVGCEPSFYTYSYLIKHLLSA 732

Query: 721 KLIEVNSSTELGDLSSGVVSNDFANLWRRVDYEFALELFEEMVKQGCAPNANTYGKFISG 780
           KLIEVNSSTELGDLSSGVVSNDFANLWRRVD+EFALELFEEMVKQGCAPNANTY KFISG
Sbjct: 733 KLIEVNSSTELGDLSSGVVSNDFANLWRRVDFEFALELFEEMVKQGCAPNANTYSKFISG 792

Query: 781 LCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIMVEHGYLPH 840
           LCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIMVEHGYLPH
Sbjct: 793 LCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIMVEHGYLPH 852

Query: 841 LDSCKLLLCGLFDEGNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG 900
           LDSCKLLLCGLFDEGNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG
Sbjct: 853 LDSCKLLLCGLFDEGNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG 912

Query: 901 VMERQGCQIHPKTYSMLIEGFDDIQDID 928
           +MERQGCQIHPKTYSMLIEGFD IQDID
Sbjct: 913 IMERQGCQIHPKTYSMLIEGFDGIQDID 940

BLAST of Cp4.1LG07g05790 vs. NCBI nr
Match: KAG6585789.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1837 bits (4757), Expect = 0.0
Identity = 908/928 (97.84%), Postives = 919/928 (99.03%), Query Frame = 0

Query: 1   MYGVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLAFKFFTSTTASLPQSLPVEH 60
           MYGVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLAFKFFTS+TASLPQSLPVEH
Sbjct: 1   MYGVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLAFKFFTSSTASLPQSLPVEH 60

Query: 61  DVPAQLFSILSRPDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK 120
           DVPAQLFSILSRPDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK
Sbjct: 61  DVPAQLFSILSRPDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK 120

Query: 121 HNVQSYVSILNILVPNGYHRIAEKLRILMIKSTNSAENALFVLEMLRSMNRRGDDLRFKL 180
           HNVQSYVS+LNILVPNGY RIAEKLRILMIKSTNSAENALFVLEMLRSMNRRGDDLRFKL
Sbjct: 121 HNVQSYVSMLNILVPNGYLRIAEKLRILMIKSTNSAENALFVLEMLRSMNRRGDDLRFKL 180

Query: 181 TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTMVNGYCKLGNVVEAELYV 240
           TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNT+VNGYCKLGNVVEAELYV
Sbjct: 181 TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTLVNGYCKLGNVVEAELYV 240

Query: 241 SKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA 300
           SKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA
Sbjct: 241 SKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA 300

Query: 301 GRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFNVFKEMTEKGCEPNVHTY 360
            RIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAF+VFKEMTEKGCEPNVHTY
Sbjct: 301 RRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFDVFKEMTEKGCEPNVHTY 360

Query: 361 TVLIRSLCEDNKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMES 420
           TVLIRSLCED+KFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMES
Sbjct: 361 TVLIRSLCEDSKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMES 420

Query: 421 NNCSPNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGS 480
           NNC+PNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEG LGS
Sbjct: 421 NNCNPNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGQLGS 480

Query: 481 AYKLLSLMNENGLVPDEWTYSVFIAVLCKRGRVEDARFLFDSLKEKGIKANEVIYSALID 540
           AYKLLSLMNENGLVPDEWTYSVFI VLCKRGRVEDARFLFDSLKEKG+KANEVIYSALID
Sbjct: 481 AYKLLSLMNENGLVPDEWTYSVFIVVLCKRGRVEDARFLFDSLKEKGVKANEVIYSALID 540

Query: 541 GYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKP 600
           GYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIK 
Sbjct: 541 GYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKL 600

Query: 601 TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQDAELFL 660
           TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRL+DAELFL
Sbjct: 601 TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLRDAELFL 660

Query: 661 HKMNEKGILPDTLLYSLLIDAYGWSGSIDIAFDILKRMHDIGCEPSFYTYSYLIKHLLSA 720
           HKMNEKGILPDTLLYSLLIDAYGWSGSI IAFDILKRMHD+GCEPSFYTYSYLIKHLLSA
Sbjct: 661 HKMNEKGILPDTLLYSLLIDAYGWSGSIGIAFDILKRMHDVGCEPSFYTYSYLIKHLLSA 720

Query: 721 KLIEVNSSTELGDLSSGVVSNDFANLWRRVDYEFALELFEEMVKQGCAPNANTYGKFISG 780
           KLIEVNSSTELGDLSSGVVSNDFANLWRRVDYEFALELFEEMVK+GCAPNANTY KFISG
Sbjct: 721 KLIEVNSSTELGDLSSGVVSNDFANLWRRVDYEFALELFEEMVKKGCAPNANTYSKFISG 780

Query: 781 LCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIMVEHGYLPH 840
           LCKVGCLEVGRRLFDHMKEKGL PNEDIYNSLLGCSCQLGLYEKAIRWLDIMVEHGYLPH
Sbjct: 781 LCKVGCLEVGRRLFDHMKEKGLLPNEDIYNSLLGCSCQLGLYEKAIRWLDIMVEHGYLPH 840

Query: 841 LDSCKLLLCGLFDEGNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG 900
           LDSCKLLLCGLFDEGNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG
Sbjct: 841 LDSCKLLLCGLFDEGNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG 900

Query: 901 VMERQGCQIHPKTYSMLIEGFDDIQDID 928
           +MERQGCQIHPKTYSMLIEGFD IQDID
Sbjct: 901 IMERQGCQIHPKTYSMLIEGFDGIQDID 928

BLAST of Cp4.1LG07g05790 vs. NCBI nr
Match: XP_023002847.1 (pentatricopeptide repeat-containing protein At5g65560-like [Cucurbita maxima])

HSP 1 Score: 1821 bits (4718), Expect = 0.0
Identity = 903/928 (97.31%), Postives = 912/928 (98.28%), Query Frame = 0

Query: 1   MYGVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLAFKFFTSTTASLPQSLPVEH 60
           ++GVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLA KFFTSTTASLPQSLPVEH
Sbjct: 13  VHGVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLALKFFTSTTASLPQSLPVEH 72

Query: 61  DVPAQLFSILSRPDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK 120
           DVPAQLFSILSR DWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK
Sbjct: 73  DVPAQLFSILSRLDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK 132

Query: 121 HNVQSYVSILNILVPNGYHRIAEKLRILMIKSTNSAENALFVLEMLRSMNRRGDDLRFKL 180
           HNVQSYVSILNILVPNGY RIAEKLRI MIKSTNSAENALFVLEMLRSMNRRGDDLRFKL
Sbjct: 133 HNVQSYVSILNILVPNGYLRIAEKLRISMIKSTNSAENALFVLEMLRSMNRRGDDLRFKL 192

Query: 181 TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTMVNGYCKLGNVVEAELYV 240
           TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTMVNGYCKLGNVVEAELYV
Sbjct: 193 TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTMVNGYCKLGNVVEAELYV 252

Query: 241 SKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA 300
           SKIVQ GL LDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA
Sbjct: 253 SKIVQTGLCLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA 312

Query: 301 GRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFNVFKEMTEKGCEPNVHTY 360
            RIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAF+VFKEMTEKGCEPNVHTY
Sbjct: 313 RRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFDVFKEMTEKGCEPNVHTY 372

Query: 361 TVLIRSLCEDNKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMES 420
           TVLI SLCEDNKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLME 
Sbjct: 373 TVLIHSLCEDNKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMEL 432

Query: 421 NNCSPNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGS 480
           NNCSPNTRTYNELI+GFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGS
Sbjct: 433 NNCSPNTRTYNELIMGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGS 492

Query: 481 AYKLLSLMNENGLVPDEWTYSVFIAVLCKRGRVEDARFLFDSLKEKGIKANEVIYSALID 540
           AYKLLSLMNENGLVPDEWTYSVFI VLCKRGRVE+ARFLFDSLKEKGIKANEVIYSALID
Sbjct: 493 AYKLLSLMNENGLVPDEWTYSVFIVVLCKRGRVEEARFLFDSLKEKGIKANEVIYSALID 552

Query: 541 GYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKP 600
           GYCKV KVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKP
Sbjct: 553 GYCKVEKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKP 612

Query: 601 TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQDAELFL 660
           TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQDAELFL
Sbjct: 613 TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQDAELFL 672

Query: 661 HKMNEKGILPDTLLYSLLIDAYGWSGSIDIAFDILKRMHDIGCEPSFYTYSYLIKHLLSA 720
           HKMNEKGILPD LLYSLLIDAYGWSGSI+IAFDILKRMHD+GCEPSFYTYSYLIKHLLSA
Sbjct: 673 HKMNEKGILPDALLYSLLIDAYGWSGSIEIAFDILKRMHDVGCEPSFYTYSYLIKHLLSA 732

Query: 721 KLIEVNSSTELGDLSSGVVSNDFANLWRRVDYEFALELFEEMVKQGCAPNANTYGKFISG 780
           KLIEVNSSTELGDLSSGVVSNDFANLWRRVDYEFALELFE MVKQGCAPNANTYGKFISG
Sbjct: 733 KLIEVNSSTELGDLSSGVVSNDFANLWRRVDYEFALELFEGMVKQGCAPNANTYGKFISG 792

Query: 781 LCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIMVEHGYLPH 840
           LCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLL CSCQLGLYEKAIRWLD MVEHGYLPH
Sbjct: 793 LCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLCCSCQLGLYEKAIRWLDGMVEHGYLPH 852

Query: 841 LDSCKLLLCGLFDEGNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG 900
           LDSCKLLLCGLFDEG+NEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG
Sbjct: 853 LDSCKLLLCGLFDEGSNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG 912

Query: 901 VMERQGCQIHPKTYSMLIEGFDDIQDID 928
           +MERQGCQIHPKTYSMLIEGFD IQDID
Sbjct: 913 IMERQGCQIHPKTYSMLIEGFDGIQDID 940

BLAST of Cp4.1LG07g05790 vs. NCBI nr
Match: XP_022153102.1 (pentatricopeptide repeat-containing protein At5g65560 isoform X1 [Momordica charantia])

HSP 1 Score: 1636 bits (4237), Expect = 0.0
Identity = 796/928 (85.78%), Postives = 863/928 (93.00%), Query Frame = 0

Query: 1   MYGVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLAFKFFTSTTASLPQSLPVEH 60
           M+GV TA+RC TMIR  +AIINSGQL IVLGFRLR TFTL  KFFTST ASLPQSLPVEH
Sbjct: 13  MHGVLTAVRCRTMIRYPTAIINSGQLFIVLGFRLRLTFTLNLKFFTST-ASLPQSLPVEH 72

Query: 61  DVPAQLFSILSRPDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK 120
           D+ AQLFSILSRP+WQKHPSLK LIPSIAPSH+S+LFALNLDP+TALAFFNWI QKHGFK
Sbjct: 73  DISAQLFSILSRPNWQKHPSLKNLIPSIAPSHISALFALNLDPQTALAFFNWIGQKHGFK 132

Query: 121 HNVQSYVSILNILVPNGYHRIAEKLRILMIKSTNSAENALFVLEMLRSMNRRGDDLRFKL 180
           HNVQSY S+LNILVPNGY RIAEK+RILMIKST+S+ENALFVLEMLRSMNRRGDD +FKL
Sbjct: 133 HNVQSYTSMLNILVPNGYLRIAEKMRILMIKSTDSSENALFVLEMLRSMNRRGDDFKFKL 192

Query: 181 TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTMVNGYCKLGNVVEAELYV 240
           TL+ YNMLLMLLSRFL++DEM++VYLEMLDDMV+PN+YTLNTMVNGYCKLGNVVEAELYV
Sbjct: 193 TLRCYNMLLMLLSRFLLVDEMRSVYLEMLDDMVTPNIYTLNTMVNGYCKLGNVVEAELYV 252

Query: 241 SKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA 300
           SKIVQAGLSLDTFTYTSLILGYCRNKNVDGA +IFLSMP+KGCRRNEVSYTNLIHGFC+A
Sbjct: 253 SKIVQAGLSLDTFTYTSLILGYCRNKNVDGAYRIFLSMPNKGCRRNEVSYTNLIHGFCDA 312

Query: 301 GRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFNVFKEMTEKGCEPNVHTY 360
            R DEALKL SQMHEDNCWPTVRTYTVIICALCQ+GRKSEAFN FKEMTEKGCEPNVHTY
Sbjct: 313 KRTDEALKLFSQMHEDNCWPTVRTYTVIICALCQLGRKSEAFNTFKEMTEKGCEPNVHTY 372

Query: 361 TVLIRSLCEDNKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMES 420
           TVLI SLCEDN FDDAK +L+GML+KGLVPSVVTYNA IDGYCKKGMS SALEILSLMES
Sbjct: 373 TVLIHSLCEDNNFDDAKNMLNGMLQKGLVPSVVTYNALIDGYCKKGMSLSALEILSLMES 432

Query: 421 NNCSPNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGS 480
           NNCSPN RTYNELILGFC+AKNVHKAM LLHKMLE KLQPDVVTYNLLIHGQCK+GHLGS
Sbjct: 433 NNCSPNARTYNELILGFCKAKNVHKAMSLLHKMLERKLQPDVVTYNLLIHGQCKDGHLGS 492

Query: 481 AYKLLSLMNENGLVPDEWTYSVFIAVLCKRGRVEDARFLFDSLKEKGIKANEVIYSALID 540
           AYKLL LMNE+GLVPDEWTYSVF+  LCKRG+VE+ARFLFDSLKEKGI+ANEVIYSALID
Sbjct: 493 AYKLLGLMNESGLVPDEWTYSVFVDTLCKRGQVEEARFLFDSLKEKGIRANEVIYSALID 552

Query: 541 GYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKP 600
           GYCKVGKV+DGHSL DKM  DGCVPNSITYNSLIDG+C+EKNFQEALLL+EIMIKRDIKP
Sbjct: 553 GYCKVGKVTDGHSLFDKMHGDGCVPNSITYNSLIDGYCREKNFQEALLLLEIMIKRDIKP 612

Query: 601 TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQDAELFL 660
           TADTYTILI++LLKDGEFDRAH MFDQMLS GS PDV  YT FIHAYCS GRL+DAELF+
Sbjct: 613 TADTYTILIESLLKDGEFDRAHNMFDQMLSTGSRPDVFTYTAFIHAYCSQGRLKDAELFI 672

Query: 661 HKMNEKGILPDTLLYSLLIDAYGWSGSIDIAFDILKRMHDIGCEPSFYTYSYLIKHLLSA 720
           +KMNEKGI+PDTLLY+LLIDAYG  GSI  AFDILKRM+D+GCEPSF+TYSYLIKHL ++
Sbjct: 673 YKMNEKGIMPDTLLYTLLIDAYGQFGSIGRAFDILKRMYDVGCEPSFHTYSYLIKHLSNS 732

Query: 721 KLIEVNSSTELGDLSSGVVSNDFANLWRRVDYEFALELFEEMVKQGCAPNANTYGKFISG 780
           K I+V+SS EL DLSSGV SNDFA+LWR+VDYEFAL+LFE+MVK GC PNANTY KFI+G
Sbjct: 733 KSIKVDSSLELNDLSSGVTSNDFASLWRKVDYEFALDLFEKMVKHGCEPNANTYSKFITG 792

Query: 781 LCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIMVEHGYLPH 840
           LCKVGCLEV  RL+DHMK KGLSPNED YNSLLGCSCQLG Y KAI+WLDIM+EHG LPH
Sbjct: 793 LCKVGCLEVAHRLYDHMKAKGLSPNEDSYNSLLGCSCQLGSYGKAIKWLDIMIEHGLLPH 852

Query: 841 LDSCKLLLCGLFDEGNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG 900
           LDSCKLL+CGL+DEGNNEKAKTV +SLLQCGYN DE+AWK+LIDGLL+KGLVDKCSELFG
Sbjct: 853 LDSCKLLVCGLYDEGNNEKAKTVLYSLLQCGYNNDELAWKVLIDGLLKKGLVDKCSELFG 912

Query: 901 VMERQGCQIHPKTYSMLIEGFDDIQDID 928
           +MERQGCQIHPKTYSMLIEGFD I DID
Sbjct: 913 IMERQGCQIHPKTYSMLIEGFDGIHDID 939

BLAST of Cp4.1LG07g05790 vs. ExPASy TrEMBL
Match: A0A6J1GH11 (pentatricopeptide repeat-containing protein At5g65560-like OS=Cucurbita moschata OX=3662 GN=LOC111454140 PE=4 SV=1)

HSP 1 Score: 1838 bits (4760), Expect = 0.0
Identity = 908/928 (97.84%), Postives = 920/928 (99.14%), Query Frame = 0

Query: 1   MYGVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLAFKFFTSTTASLPQSLPVEH 60
           MYGVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLAFKFFTS+TASLPQSLPVEH
Sbjct: 13  MYGVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLAFKFFTSSTASLPQSLPVEH 72

Query: 61  DVPAQLFSILSRPDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK 120
           DVPAQLFSILSRPDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK
Sbjct: 73  DVPAQLFSILSRPDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK 132

Query: 121 HNVQSYVSILNILVPNGYHRIAEKLRILMIKSTNSAENALFVLEMLRSMNRRGDDLRFKL 180
           HNVQSYVS+LNILVPNGY RIAEKLRILMIKSTNSAENALFVLEMLRSMNRRGDDLRFKL
Sbjct: 133 HNVQSYVSMLNILVPNGYLRIAEKLRILMIKSTNSAENALFVLEMLRSMNRRGDDLRFKL 192

Query: 181 TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTMVNGYCKLGNVVEAELYV 240
           TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNT+VNGYCKLGNVVEAELYV
Sbjct: 193 TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTLVNGYCKLGNVVEAELYV 252

Query: 241 SKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA 300
           SKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA
Sbjct: 253 SKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA 312

Query: 301 GRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFNVFKEMTEKGCEPNVHTY 360
            RIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAF+VFKEMTEKGCEPNVHTY
Sbjct: 313 RRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFDVFKEMTEKGCEPNVHTY 372

Query: 361 TVLIRSLCEDNKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMES 420
           TVLIRSLCED+KFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMES
Sbjct: 373 TVLIRSLCEDSKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMES 432

Query: 421 NNCSPNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGS 480
           NNC+PNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEG LGS
Sbjct: 433 NNCNPNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGQLGS 492

Query: 481 AYKLLSLMNENGLVPDEWTYSVFIAVLCKRGRVEDARFLFDSLKEKGIKANEVIYSALID 540
           AYKLLSLMNENGLVPDEWTYSVFI VLCKRGRVEDARFLFDSLKEKG+KANEVIYSALID
Sbjct: 493 AYKLLSLMNENGLVPDEWTYSVFIVVLCKRGRVEDARFLFDSLKEKGVKANEVIYSALID 552

Query: 541 GYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKP 600
           GYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIK 
Sbjct: 553 GYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKL 612

Query: 601 TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQDAELFL 660
           TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRL+DAELFL
Sbjct: 613 TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLRDAELFL 672

Query: 661 HKMNEKGILPDTLLYSLLIDAYGWSGSIDIAFDILKRMHDIGCEPSFYTYSYLIKHLLSA 720
           HKMN+KGILPDTLLYSLLIDAYGWSGSI IAFDILKRMHD+GCEPSFYTYSYLIKHLLSA
Sbjct: 673 HKMNDKGILPDTLLYSLLIDAYGWSGSIGIAFDILKRMHDVGCEPSFYTYSYLIKHLLSA 732

Query: 721 KLIEVNSSTELGDLSSGVVSNDFANLWRRVDYEFALELFEEMVKQGCAPNANTYGKFISG 780
           KLIEVNSSTELGDLSSGVVSNDFANLWRRVD+EFALELFEEMVKQGCAPNANTY KFISG
Sbjct: 733 KLIEVNSSTELGDLSSGVVSNDFANLWRRVDFEFALELFEEMVKQGCAPNANTYSKFISG 792

Query: 781 LCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIMVEHGYLPH 840
           LCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIMVEHGYLPH
Sbjct: 793 LCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIMVEHGYLPH 852

Query: 841 LDSCKLLLCGLFDEGNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG 900
           LDSCKLLLCGLFDEGNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG
Sbjct: 853 LDSCKLLLCGLFDEGNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG 912

Query: 901 VMERQGCQIHPKTYSMLIEGFDDIQDID 928
           +MERQGCQIHPKTYSMLIEGFD IQDID
Sbjct: 913 IMERQGCQIHPKTYSMLIEGFDGIQDID 940

BLAST of Cp4.1LG07g05790 vs. ExPASy TrEMBL
Match: A0A6J1KKQ2 (pentatricopeptide repeat-containing protein At5g65560-like OS=Cucurbita maxima OX=3661 GN=LOC111496590 PE=3 SV=1)

HSP 1 Score: 1821 bits (4718), Expect = 0.0
Identity = 903/928 (97.31%), Postives = 912/928 (98.28%), Query Frame = 0

Query: 1   MYGVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLAFKFFTSTTASLPQSLPVEH 60
           ++GVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLA KFFTSTTASLPQSLPVEH
Sbjct: 13  VHGVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLALKFFTSTTASLPQSLPVEH 72

Query: 61  DVPAQLFSILSRPDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK 120
           DVPAQLFSILSR DWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK
Sbjct: 73  DVPAQLFSILSRLDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK 132

Query: 121 HNVQSYVSILNILVPNGYHRIAEKLRILMIKSTNSAENALFVLEMLRSMNRRGDDLRFKL 180
           HNVQSYVSILNILVPNGY RIAEKLRI MIKSTNSAENALFVLEMLRSMNRRGDDLRFKL
Sbjct: 133 HNVQSYVSILNILVPNGYLRIAEKLRISMIKSTNSAENALFVLEMLRSMNRRGDDLRFKL 192

Query: 181 TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTMVNGYCKLGNVVEAELYV 240
           TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTMVNGYCKLGNVVEAELYV
Sbjct: 193 TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTMVNGYCKLGNVVEAELYV 252

Query: 241 SKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA 300
           SKIVQ GL LDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA
Sbjct: 253 SKIVQTGLCLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA 312

Query: 301 GRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFNVFKEMTEKGCEPNVHTY 360
            RIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAF+VFKEMTEKGCEPNVHTY
Sbjct: 313 RRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFDVFKEMTEKGCEPNVHTY 372

Query: 361 TVLIRSLCEDNKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMES 420
           TVLI SLCEDNKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLME 
Sbjct: 373 TVLIHSLCEDNKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMEL 432

Query: 421 NNCSPNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGS 480
           NNCSPNTRTYNELI+GFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGS
Sbjct: 433 NNCSPNTRTYNELIMGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGS 492

Query: 481 AYKLLSLMNENGLVPDEWTYSVFIAVLCKRGRVEDARFLFDSLKEKGIKANEVIYSALID 540
           AYKLLSLMNENGLVPDEWTYSVFI VLCKRGRVE+ARFLFDSLKEKGIKANEVIYSALID
Sbjct: 493 AYKLLSLMNENGLVPDEWTYSVFIVVLCKRGRVEEARFLFDSLKEKGIKANEVIYSALID 552

Query: 541 GYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKP 600
           GYCKV KVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKP
Sbjct: 553 GYCKVEKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKP 612

Query: 601 TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQDAELFL 660
           TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQDAELFL
Sbjct: 613 TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQDAELFL 672

Query: 661 HKMNEKGILPDTLLYSLLIDAYGWSGSIDIAFDILKRMHDIGCEPSFYTYSYLIKHLLSA 720
           HKMNEKGILPD LLYSLLIDAYGWSGSI+IAFDILKRMHD+GCEPSFYTYSYLIKHLLSA
Sbjct: 673 HKMNEKGILPDALLYSLLIDAYGWSGSIEIAFDILKRMHDVGCEPSFYTYSYLIKHLLSA 732

Query: 721 KLIEVNSSTELGDLSSGVVSNDFANLWRRVDYEFALELFEEMVKQGCAPNANTYGKFISG 780
           KLIEVNSSTELGDLSSGVVSNDFANLWRRVDYEFALELFE MVKQGCAPNANTYGKFISG
Sbjct: 733 KLIEVNSSTELGDLSSGVVSNDFANLWRRVDYEFALELFEGMVKQGCAPNANTYGKFISG 792

Query: 781 LCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIMVEHGYLPH 840
           LCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLL CSCQLGLYEKAIRWLD MVEHGYLPH
Sbjct: 793 LCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLCCSCQLGLYEKAIRWLDGMVEHGYLPH 852

Query: 841 LDSCKLLLCGLFDEGNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG 900
           LDSCKLLLCGLFDEG+NEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG
Sbjct: 853 LDSCKLLLCGLFDEGSNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG 912

Query: 901 VMERQGCQIHPKTYSMLIEGFDDIQDID 928
           +MERQGCQIHPKTYSMLIEGFD IQDID
Sbjct: 913 IMERQGCQIHPKTYSMLIEGFDGIQDID 940

BLAST of Cp4.1LG07g05790 vs. ExPASy TrEMBL
Match: A0A6J1DI13 (pentatricopeptide repeat-containing protein At5g65560 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111020685 PE=4 SV=1)

HSP 1 Score: 1636 bits (4237), Expect = 0.0
Identity = 796/928 (85.78%), Postives = 863/928 (93.00%), Query Frame = 0

Query: 1   MYGVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLAFKFFTSTTASLPQSLPVEH 60
           M+GV TA+RC TMIR  +AIINSGQL IVLGFRLR TFTL  KFFTST ASLPQSLPVEH
Sbjct: 13  MHGVLTAVRCRTMIRYPTAIINSGQLFIVLGFRLRLTFTLNLKFFTST-ASLPQSLPVEH 72

Query: 61  DVPAQLFSILSRPDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK 120
           D+ AQLFSILSRP+WQKHPSLK LIPSIAPSH+S+LFALNLDP+TALAFFNWI QKHGFK
Sbjct: 73  DISAQLFSILSRPNWQKHPSLKNLIPSIAPSHISALFALNLDPQTALAFFNWIGQKHGFK 132

Query: 121 HNVQSYVSILNILVPNGYHRIAEKLRILMIKSTNSAENALFVLEMLRSMNRRGDDLRFKL 180
           HNVQSY S+LNILVPNGY RIAEK+RILMIKST+S+ENALFVLEMLRSMNRRGDD +FKL
Sbjct: 133 HNVQSYTSMLNILVPNGYLRIAEKMRILMIKSTDSSENALFVLEMLRSMNRRGDDFKFKL 192

Query: 181 TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTMVNGYCKLGNVVEAELYV 240
           TL+ YNMLLMLLSRFL++DEM++VYLEMLDDMV+PN+YTLNTMVNGYCKLGNVVEAELYV
Sbjct: 193 TLRCYNMLLMLLSRFLLVDEMRSVYLEMLDDMVTPNIYTLNTMVNGYCKLGNVVEAELYV 252

Query: 241 SKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA 300
           SKIVQAGLSLDTFTYTSLILGYCRNKNVDGA +IFLSMP+KGCRRNEVSYTNLIHGFC+A
Sbjct: 253 SKIVQAGLSLDTFTYTSLILGYCRNKNVDGAYRIFLSMPNKGCRRNEVSYTNLIHGFCDA 312

Query: 301 GRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFNVFKEMTEKGCEPNVHTY 360
            R DEALKL SQMHEDNCWPTVRTYTVIICALCQ+GRKSEAFN FKEMTEKGCEPNVHTY
Sbjct: 313 KRTDEALKLFSQMHEDNCWPTVRTYTVIICALCQLGRKSEAFNTFKEMTEKGCEPNVHTY 372

Query: 361 TVLIRSLCEDNKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMES 420
           TVLI SLCEDN FDDAK +L+GML+KGLVPSVVTYNA IDGYCKKGMS SALEILSLMES
Sbjct: 373 TVLIHSLCEDNNFDDAKNMLNGMLQKGLVPSVVTYNALIDGYCKKGMSLSALEILSLMES 432

Query: 421 NNCSPNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGS 480
           NNCSPN RTYNELILGFC+AKNVHKAM LLHKMLE KLQPDVVTYNLLIHGQCK+GHLGS
Sbjct: 433 NNCSPNARTYNELILGFCKAKNVHKAMSLLHKMLERKLQPDVVTYNLLIHGQCKDGHLGS 492

Query: 481 AYKLLSLMNENGLVPDEWTYSVFIAVLCKRGRVEDARFLFDSLKEKGIKANEVIYSALID 540
           AYKLL LMNE+GLVPDEWTYSVF+  LCKRG+VE+ARFLFDSLKEKGI+ANEVIYSALID
Sbjct: 493 AYKLLGLMNESGLVPDEWTYSVFVDTLCKRGQVEEARFLFDSLKEKGIRANEVIYSALID 552

Query: 541 GYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKP 600
           GYCKVGKV+DGHSL DKM  DGCVPNSITYNSLIDG+C+EKNFQEALLL+EIMIKRDIKP
Sbjct: 553 GYCKVGKVTDGHSLFDKMHGDGCVPNSITYNSLIDGYCREKNFQEALLLLEIMIKRDIKP 612

Query: 601 TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQDAELFL 660
           TADTYTILI++LLKDGEFDRAH MFDQMLS GS PDV  YT FIHAYCS GRL+DAELF+
Sbjct: 613 TADTYTILIESLLKDGEFDRAHNMFDQMLSTGSRPDVFTYTAFIHAYCSQGRLKDAELFI 672

Query: 661 HKMNEKGILPDTLLYSLLIDAYGWSGSIDIAFDILKRMHDIGCEPSFYTYSYLIKHLLSA 720
           +KMNEKGI+PDTLLY+LLIDAYG  GSI  AFDILKRM+D+GCEPSF+TYSYLIKHL ++
Sbjct: 673 YKMNEKGIMPDTLLYTLLIDAYGQFGSIGRAFDILKRMYDVGCEPSFHTYSYLIKHLSNS 732

Query: 721 KLIEVNSSTELGDLSSGVVSNDFANLWRRVDYEFALELFEEMVKQGCAPNANTYGKFISG 780
           K I+V+SS EL DLSSGV SNDFA+LWR+VDYEFAL+LFE+MVK GC PNANTY KFI+G
Sbjct: 733 KSIKVDSSLELNDLSSGVTSNDFASLWRKVDYEFALDLFEKMVKHGCEPNANTYSKFITG 792

Query: 781 LCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIMVEHGYLPH 840
           LCKVGCLEV  RL+DHMK KGLSPNED YNSLLGCSCQLG Y KAI+WLDIM+EHG LPH
Sbjct: 793 LCKVGCLEVAHRLYDHMKAKGLSPNEDSYNSLLGCSCQLGSYGKAIKWLDIMIEHGLLPH 852

Query: 841 LDSCKLLLCGLFDEGNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG 900
           LDSCKLL+CGL+DEGNNEKAKTV +SLLQCGYN DE+AWK+LIDGLL+KGLVDKCSELFG
Sbjct: 853 LDSCKLLVCGLYDEGNNEKAKTVLYSLLQCGYNNDELAWKVLIDGLLKKGLVDKCSELFG 912

Query: 901 VMERQGCQIHPKTYSMLIEGFDDIQDID 928
           +MERQGCQIHPKTYSMLIEGFD I DID
Sbjct: 913 IMERQGCQIHPKTYSMLIEGFDGIHDID 939

BLAST of Cp4.1LG07g05790 vs. ExPASy TrEMBL
Match: A0A0A0KFF8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G355970 PE=4 SV=1)

HSP 1 Score: 1574 bits (4076), Expect = 0.0
Identity = 771/928 (83.08%), Postives = 845/928 (91.06%), Query Frame = 0

Query: 1   MYGVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLAFKFFTSTTASLPQSLPVEH 60
           M+GVFT +RCPTMIRNS+AII SGQLL+VLGFRLR TF++  +FFTS  ASLPQS  VEH
Sbjct: 13  MHGVFTPVRCPTMIRNSTAIIKSGQLLVVLGFRLRLTFSITHRFFTSP-ASLPQSFSVEH 72

Query: 61  DVPAQLFSILSRPDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK 120
           D+PAQLFSILSRP+WQKHPSLK LIPSIAPSH+S+LFALNLDP+TALAFFNWI QKHGFK
Sbjct: 73  DIPAQLFSILSRPNWQKHPSLKNLIPSIAPSHISALFALNLDPQTALAFFNWIGQKHGFK 132

Query: 121 HNVQSYVSILNILVPNGYHRIAEKLRILMIKSTNSAENALFVLEMLRSMNRRGDDLRFKL 180
           HNVQS+VS+LNILVPNGY RIAE +RILMIKST+S+ENALFVLEMLRSMNRR D  +FKL
Sbjct: 133 HNVQSHVSMLNILVPNGYLRIAENMRILMIKSTDSSENALFVLEMLRSMNRRVDAFKFKL 192

Query: 181 TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTMVNGYCKLGNVVEAELYV 240
           TL+ YNMLLMLLSRFLMIDEMK+VYLEMLDDMV+PN++TLNTMVNGYCKLGNVVEAELYV
Sbjct: 193 TLRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNIFTLNTMVNGYCKLGNVVEAELYV 252

Query: 241 SKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA 300
           SKIVQAGLSLDTFTYTSLILGYCRNKNVD AN IFLSMP+KGC RNEVSYTNLIHGFCEA
Sbjct: 253 SKIVQAGLSLDTFTYTSLILGYCRNKNVDAANAIFLSMPNKGCLRNEVSYTNLIHGFCEA 312

Query: 301 GRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFNVFKEMTEKGCEPNVHTY 360
            R+DEALKL SQMHEDNCWPTVRTYTVII ALCQ+GRK+EA N+FKEMTEK C+PNVHTY
Sbjct: 313 RRVDEALKLFSQMHEDNCWPTVRTYTVIIFALCQLGRKTEALNMFKEMTEKHCQPNVHTY 372

Query: 361 TVLIRSLCEDNKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMES 420
           TVLI SLCED+ FDDAKK+L+GMLEKGL+PSVVTYNA IDGYCKKG+S SALEILSLMES
Sbjct: 373 TVLICSLCEDSNFDDAKKILNGMLEKGLIPSVVTYNALIDGYCKKGLSASALEILSLMES 432

Query: 421 NNCSPNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGS 480
           NNCSPN RTYNELILGFCR KN+HKAM LLHKMLE KLQP+VVTYN+LIHGQCKEG LGS
Sbjct: 433 NNCSPNARTYNELILGFCRGKNIHKAMSLLHKMLERKLQPNVVTYNILIHGQCKEGDLGS 492

Query: 481 AYKLLSLMNENGLVPDEWTYSVFIAVLCKRGRVEDARFLFDSLKEKGIKANEVIYSALID 540
           AYKLLSLMNE+GLVPDEWTYSVFI  LCKRG VE+AR LF+SLKEKGIKANEVIYS LID
Sbjct: 493 AYKLLSLMNESGLVPDEWTYSVFIDTLCKRGLVEEARSLFESLKEKGIKANEVIYSTLID 552

Query: 541 GYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKP 600
           GYCKVGKVSDG  LLDKMLS GCVPNSITYNSLIDG+CKEKNF+EA LLV+IMIKRDI+P
Sbjct: 553 GYCKVGKVSDGRFLLDKMLSAGCVPNSITYNSLIDGYCKEKNFKEARLLVDIMIKRDIEP 612

Query: 601 TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQDAELFL 660
            ADTYTILI NLLKD EFD+AH MFDQMLS GSHPDV IYT FIHAYCS GRL+DAE+ +
Sbjct: 613 AADTYTILIDNLLKDDEFDQAHDMFDQMLSTGSHPDVFIYTAFIHAYCSHGRLKDAEVLI 672

Query: 661 HKMNEKGILPDTLLYSLLIDAYGWSGSIDIAFDILKRMHDIGCEPSFYTYSYLIKHLLSA 720
            KMN KGI+PDT+LY+L IDAYG  GSID AF ILKRMH++GCEPS+YTYS LIKHL +A
Sbjct: 673 CKMNAKGIMPDTMLYTLFIDAYGRFGSIDGAFGILKRMHEVGCEPSYYTYSCLIKHLSNA 732

Query: 721 KLIEVNSSTELGDLSSGVVSNDFANLWRRVDYEFALELFEEMVKQGCAPNANTYGKFISG 780
           K  EV+SS+EL DLSSGV SNDF+N WRRVDYEF L+LF +M + GCAPNANTYGKFI+G
Sbjct: 733 KPKEVSSSSELSDLSSGVASNDFSNCWRRVDYEFTLDLFGKMAEHGCAPNANTYGKFITG 792

Query: 781 LCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIMVEHGYLPH 840
           LCKVGCLEV  RLFDHMKEKG SPNEDIYNSLLGCSCQLGLY +AIRWLDIM+E+ +LPH
Sbjct: 793 LCKVGCLEVAHRLFDHMKEKGQSPNEDIYNSLLGCSCQLGLYGEAIRWLDIMIENRHLPH 852

Query: 841 LDSCKLLLCGLFDEGNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG 900
           LDSCKLLLCGL+DEGN+EKAK VF S LQC YNYDE+ WK+LIDGLL+KGL DKCS+LFG
Sbjct: 853 LDSCKLLLCGLYDEGNDEKAKRVFCSFLQCEYNYDEMVWKVLIDGLLKKGLSDKCSDLFG 912

Query: 901 VMERQGCQIHPKTYSMLIEGFDDIQDID 928
           +ME QGCQIHPKTYSMLIEGFD IQ+ID
Sbjct: 913 IMETQGCQIHPKTYSMLIEGFDGIQEID 939

BLAST of Cp4.1LG07g05790 vs. ExPASy TrEMBL
Match: A0A5A7T899 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold119G00120 PE=4 SV=1)

HSP 1 Score: 1571 bits (4069), Expect = 0.0
Identity = 768/928 (82.76%), Postives = 845/928 (91.06%), Query Frame = 0

Query: 1   MYGVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLAFKFFTSTTASLPQSLPVEH 60
           M+GVFT +RCPTMIRNS+AI  SGQLL+VLGFRLR TF L  +FFTST AS PQSL VEH
Sbjct: 1   MHGVFTPVRCPTMIRNSTAIFKSGQLLVVLGFRLRLTFPLTHRFFTST-ASFPQSLSVEH 60

Query: 61  DVPAQLFSILSRPDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK 120
           D+PAQLF+ILSRP+WQKHPSLK LIPSI+PSH+S+LFALNLDP+TALAFFNWI QKHGFK
Sbjct: 61  DIPAQLFTILSRPNWQKHPSLKNLIPSISPSHISALFALNLDPQTALAFFNWIGQKHGFK 120

Query: 121 HNVQSYVSILNILVPNGYHRIAEKLRILMIKSTNSAENALFVLEMLRSMNRRGDDLRFKL 180
           HNVQSYVS+LNILVPNGY RIAE +RILMIKST+S+ENA+FVLEMLRSMNRR D  +FKL
Sbjct: 121 HNVQSYVSMLNILVPNGYLRIAENMRILMIKSTDSSENAVFVLEMLRSMNRRVDAFKFKL 180

Query: 181 TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTMVNGYCKLGNVVEAELYV 240
           +L+ YNMLLMLLSRFLMIDEMK+VYLEMLDDMV+PN++TLNTMVNGYCKLGNVVEAELYV
Sbjct: 181 SLRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNIFTLNTMVNGYCKLGNVVEAELYV 240

Query: 241 SKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA 300
           SKIVQAGLSLDTFTYTSLILGYCRNKNVD AN IFLSMP+KGCRRNEVSYTNLIHGFCEA
Sbjct: 241 SKIVQAGLSLDTFTYTSLILGYCRNKNVDAANAIFLSMPNKGCRRNEVSYTNLIHGFCEA 300

Query: 301 GRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFNVFKEMTEKGCEPNVHTY 360
            R+ EALKL SQMHEDNCWPTVRTYTV+I ALCQ+GRK+EA N+FKEMTEK C+PNVHTY
Sbjct: 301 RRVGEALKLFSQMHEDNCWPTVRTYTVLIFALCQLGRKTEALNMFKEMTEKRCQPNVHTY 360

Query: 361 TVLIRSLCEDNKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMES 420
           TVLI SLCED  FDDAKK+L+GMLEKGL+PSVVTYNA IDGYCKKG+S SALEILSLMES
Sbjct: 361 TVLICSLCEDGNFDDAKKILNGMLEKGLIPSVVTYNALIDGYCKKGLSASALEILSLMES 420

Query: 421 NNCSPNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGS 480
           NNCSPN RTYNELILGFCRAKN+HKAM LLHKMLE KLQP+VVTYN+LIHGQCKEG LGS
Sbjct: 421 NNCSPNARTYNELILGFCRAKNIHKAMSLLHKMLERKLQPNVVTYNILIHGQCKEGDLGS 480

Query: 481 AYKLLSLMNENGLVPDEWTYSVFIAVLCKRGRVEDARFLFDSLKEKGIKANEVIYSALID 540
           AYKLLSLMNE+GLVPDEWTY VFI  LCKRG VE+AR LF+SLKEKGIKANEV+YS LID
Sbjct: 481 AYKLLSLMNESGLVPDEWTYGVFIDTLCKRGLVEEARSLFESLKEKGIKANEVMYSTLID 540

Query: 541 GYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKP 600
           GYCKVGKVSDG  LLDKMLS GCVPNSITYNSLIDG+CKEKNF+EA LLVE+MIKRDI+P
Sbjct: 541 GYCKVGKVSDGRFLLDKMLSAGCVPNSITYNSLIDGYCKEKNFKEARLLVEVMIKRDIQP 600

Query: 601 TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQDAELFL 660
            ADTYTILI NLLKDGE D AH +FDQMLS GSHPDV IYT FIHAYCS GRL+DAE+ +
Sbjct: 601 AADTYTILIDNLLKDGEIDHAHDVFDQMLSTGSHPDVFIYTAFIHAYCSQGRLKDAEVLI 660

Query: 661 HKMNEKGILPDTLLYSLLIDAYGWSGSIDIAFDILKRMHDIGCEPSFYTYSYLIKHLLSA 720
            KMN KGI+PDT+LY+L IDAYG  GSID AF ILKRMHD+GCEPS++TYSYLIKHL +A
Sbjct: 661 CKMNAKGIMPDTILYTLFIDAYGRFGSIDGAFGILKRMHDVGCEPSYHTYSYLIKHLSNA 720

Query: 721 KLIEVNSSTELGDLSSGVVSNDFANLWRRVDYEFALELFEEMVKQGCAPNANTYGKFISG 780
           K  EV+SS+EL DLSSGV SNDF+N WRRVDYEF LELF +MV+ GCAPNANTYGKFI+G
Sbjct: 721 KPKEVSSSSELSDLSSGVASNDFSNCWRRVDYEFTLELFGKMVEHGCAPNANTYGKFITG 780

Query: 781 LCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIMVEHGYLPH 840
           LCKVG LEV  RLFDHMKEKGLSPNEDIYNSLLGCSCQLGLY +AIRWLDI++E+G+LP 
Sbjct: 781 LCKVGYLEVADRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYGEAIRWLDILIENGHLPR 840

Query: 841 LDSCKLLLCGLFDEGNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG 900
           LDSCKLLLCGL+DEGN+EKAK VF SLLQCGYN DE+AWK+LIDGLL+KGL DKCS+LFG
Sbjct: 841 LDSCKLLLCGLYDEGNDEKAKRVFCSLLQCGYNCDEMAWKVLIDGLLKKGLSDKCSDLFG 900

Query: 901 VMERQGCQIHPKTYSMLIEGFDDIQDID 928
           +ME QGC IHPKTYSMLIEGFD +Q+ID
Sbjct: 901 IMETQGCHIHPKTYSMLIEGFDGVQEID 927

BLAST of Cp4.1LG07g05790 vs. TAIR 10
Match: AT5G65560.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 965.3 bits (2494), Expect = 3.5e-281
Identity = 494/931 (53.06%), Postives = 648/931 (69.60%), Query Frame = 0

Query: 13  MIRNSSAIINSGQLLIVLGFRLRFTFTLAFKFFTSTTASLP-------------QSLPVE 72
           MIR      NSG    V  F +     L  KF T  T   P             ++LP E
Sbjct: 1   MIRRIQPRCNSGLTGSVSAFEV-----LKKKFSTDVTVPSPVTRRQFCSVSPLLRNLPEE 60

Query: 73  H----DVPAQLFSILSRPDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQ 132
                 VP +L SILS+P+W K PSLK ++ +I+PSHVSSLF+L+LDPKTAL F +WI Q
Sbjct: 61  ESDSMSVPHRLLSILSKPNWHKSPSLKSMVSAISPSHVSSLFSLDLDPKTALNFSHWISQ 120

Query: 133 KHGFKHNVQSYVSILNILVPNGYHRIAEKLRILMIKSTNSAENALFVLEMLRSMNR-RGD 192
              +KH+V SY S+L +L+ NGY  +  K+R+LMIKS +S  +AL+VL++ R MN+    
Sbjct: 121 NPRYKHSVYSYASLLTLLINNGYVGVVFKIRLLMIKSCDSVGDALYVLDLCRKMNKDERF 180

Query: 193 DLRFKLTLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTMVNGYCKLGNVV 252
           +L++KL +  YN LL  L+RF ++DEMK VY+EML+D V PN+YT N MVNGYCKLGNV 
Sbjct: 181 ELKYKLIIGCYNTLLNSLARFGLVDEMKQVYMEMLEDKVCPNIYTYNKMVNGYCKLGNVE 240

Query: 253 EAELYVSKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLI 312
           EA  YVSKIV+AGL  D FTYTSLI+GYC+ K++D A K+F  MP KGCRRNEV+YT+LI
Sbjct: 241 EANQYVSKIVEAGLDPDFFTYTSLIMGYCQRKDLDSAFKVFNEMPLKGCRRNEVAYTHLI 300

Query: 313 HGFCEAGRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFNVFKEMTEKGCE 372
           HG C A RIDEA+ L  +M +D C+PTVRTYTV+I +LC   RKSEA N+ KEM E G +
Sbjct: 301 HGLCVARRIDEAMDLFVKMKDDECFPTVRTYTVLIKSLCGSERKSEALNLVKEMEETGIK 360

Query: 373 PNVHTYTVLIRSLCEDNKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEI 432
           PN+HTYTVLI SLC   KF+ A++LL  MLEKGL+P+V+TYNA I+GYCK+GM   A+++
Sbjct: 361 PNIHTYTVLIDSLCSQCKFEKARELLGQMLEKGLMPNVITYNALINGYCKRGMIEDAVDV 420

Query: 433 LSLMESNNCSPNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCK 492
           + LMES   SPNTRTYNELI G+C++ NVHKAM +L+KMLE K+ PDVVTYN LI GQC+
Sbjct: 421 VELMESRKLSPNTRTYNELIKGYCKS-NVHKAMGVLNKMLERKVLPDVVTYNSLIDGQCR 480

Query: 493 EGHLGSAYKLLSLMNENGLVPDEWTYSVFIAVLCKRGRVEDARFLFDSLKEKGIKANEVI 552
            G+  SAY+LLSLMN+ GLVPD+WTY+  I  LCK  RVE+A  LFDSL++KG+  N V+
Sbjct: 481 SGNFDSAYRLLSLMNDRGLVPDQWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVNPNVVM 540

Query: 553 YSALIDGYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMI 612
           Y+ALIDGYCK GKV + H +L+KMLS  C+PNS+T+N+LI G C +   +EA LL E M+
Sbjct: 541 YTALIDGYCKAGKVDEAHLMLEKMLSKNCLPNSLTFNALIHGLCADGKLKEATLLEEKMV 600

Query: 613 KRDIKPTADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQ 672
           K  ++PT  T TILI  LLKDG+FD A+  F QMLS+G+ PD   YT FI  YC  GRL 
Sbjct: 601 KIGLQPTVSTDTILIHRLLKDGDFDHAYSRFQQMLSSGTKPDAHTYTTFIQTYCREGRLL 660

Query: 673 DAELFLHKMNEKGILPDTLLYSLLIDAYGWSGSIDIAFDILKRMHDIGCEPSFYTYSYLI 732
           DAE  + KM E G+ PD   YS LI  YG  G  + AFD+LKRM D GCEPS +T+  LI
Sbjct: 661 DAEDMMAKMRENGVSPDLFTYSSLIKGYGDLGQTNFAFDVLKRMRDTGCEPSQHTFLSLI 720

Query: 733 KHLLSAKL-IEVNSSTELGDLSSGVVSNDFANLWRRVDYEFALELFEEMVKQGCAPNANT 792
           KHLL  K   +  S  EL  +S+             ++++  +EL E+MV+    PNA +
Sbjct: 721 KHLLEMKYGKQKGSEPELCAMSN------------MMEFDTVVELLEKMVEHSVTPNAKS 780

Query: 793 YGKFISGLCKVGCLEVGRRLFDHM-KEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIM 852
           Y K I G+C+VG L V  ++FDHM + +G+SP+E ++N+LL C C+L  + +A + +D M
Sbjct: 781 YEKLILGICEVGNLRVAEKVFDHMQRNEGISPSELVFNALLSCCCKLKKHNEAAKVVDDM 840

Query: 853 VEHGYLPHLDSCKLLLCGLFDEGNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLV 912
           +  G+LP L+SCK+L+CGL+ +G  E+  +VF +LLQCGY  DE+AWK++IDG+ ++GLV
Sbjct: 841 ICVGHLPQLESCKVLICGLYKKGEKERGTSVFQNLLQCGYYEDELAWKIIIDGVGKQGLV 900

Query: 913 DKCSELFGVMERQGCQIHPKTYSMLIEGFDD 924
           +   ELF VME+ GC+   +TYS+LIEG  D
Sbjct: 901 EAFYELFNVMEKNGCKFSSQTYSLLIEGPPD 913

BLAST of Cp4.1LG07g05790 vs. TAIR 10
Match: AT3G07290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 481.1 bits (1237), Expect = 2.0e-135
Identity = 294/897 (32.78%), Postives = 479/897 (53.40%), Query Frame = 0

Query: 21  INSGQLLIVLGFRLRFTFTLAFKFFTSTTASLPQSLPV-EHDVPAQLFSILSRPDWQKHP 80
           I S + ++ LG   R  F     F  S+  SL  S  V  HDV     S+L  P+W+K+ 
Sbjct: 6   IRSTRKILALG---RHVFPSNAFFSVSSRPSLSSSDEVAAHDVA----SLLKTPNWEKNS 65

Query: 81  SLKILIPSIAPSHVSSLFAL-NLDPKTALAFFNWIEQKHGFKHNVQSYVSILNILVPNGY 140
           SLK L+  + P+  S + +L   D    + FF W+ +   +  +      +L ++V +G 
Sbjct: 66  SLKSLVSHMNPNVASQVISLQRSDNDICVRFFMWVCKHSSYCFDPTQKNQLLKLIVSSGL 125

Query: 141 HRIAEKLRILMIKSTNSAENALFVLEMLRSMNRRGDDLRFKLTLKSYNMLLMLLSRFLMI 200
           +R+A  + + +IK  +  E  +  L+++   +   +   F+L    Y+ LLM L++  + 
Sbjct: 126 YRVAHAVIVALIKECSRCEKEM--LKLMYCFDELREVFGFRLNYPCYSSLLMSLAKLDLG 185

Query: 201 DEMKNVYLEMLDDMVSPNMYTLNTMVNGYCKLGNVVEAELYVSKIVQAGLSLDTFTYTSL 260
                 Y  M  D     M    T+VN  CK G    AE+++SKI++ G  LD+   TSL
Sbjct: 186 FLAYVTYRRMEADGFVVGMIDYRTIVNALCKNGYTEAAEMFMSKILKIGFVLDSHIGTSL 245

Query: 261 ILGYCRNKNVDGANKIFLSMPSK-GCRRNEVSYTNLIHGFCEAGRIDEALKLLSQMHEDN 320
           +LG+CR  N+  A K+F  M  +  C  N VSY+ LIHG CE GR++EA  L  QM E  
Sbjct: 246 LLGFCRGLNLRDALKVFDVMSKEVTCAPNSVSYSILIHGLCEVGRLEEAFGLKDQMGEKG 305

Query: 321 CWPTVRTYTVIICALCQMGRKSEAFNVFKEMTEKGCEPNVHTYTVLIRSLCEDNKFDDAK 380
           C P+ RTYTV+I ALC  G   +AFN+F EM  +GC+PNVHTYTVLI  LC D K ++A 
Sbjct: 306 CQPSTRTYTVLIKALCDRGLIDKAFNLFDEMIPRGCKPNVHTYTVLIDGLCRDGKIEEAN 365

Query: 381 KLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMESNNCSPNTRTYNELILGF 440
            +   M++  + PSV+TYNA I+GYCK G    A E+L++ME   C PN RT+NEL+ G 
Sbjct: 366 GVCRKMVKDRIFPSVITYNALINGYCKDGRVVPAFELLTVMEKRACKPNVRTFNELMEGL 425

Query: 441 CRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGSAYKLLSLMNENGLVPDE 500
           CR    +KA+ LL +ML+  L PD+V+YN+LI G C+EGH+ +AYKLLS MN   + PD 
Sbjct: 426 CRVGKPYKAVHLLKRMLDNGLSPDIVSYNVLIDGLCREGHMNTAYKLLSSMNCFDIEPDC 485

Query: 501 WTYSVFIAVLCKRGRVEDARFLFDSLKEKGIKANEVIYSALIDGYCKVGKVSDGHSLLDK 560
            T++  I   CK+G+ + A      +  KGI  +EV  + LIDG CKVGK  D   +L+ 
Sbjct: 486 LTFTAIINAFCKQGKADVASAFLGLMLRKGISLDEVTGTTLIDGVCKVGKTRDALFILET 545

Query: 561 MLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKPTADTYTILIKNLLKDGE 620
           ++    +    + N ++D   K    +E L ++  + K  + P+  TYT L+  L++ G+
Sbjct: 546 LVKMRILTTPHSLNVILDMLSKGCKVKEELAMLGKINKLGLVPSVVTYTTLVDGLIRSGD 605

Query: 621 FDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQDAELFLHKMNEKGILPDTLLYSL 680
              + ++ + M  +G  P+V  YT+ I+  C  GR+++AE  L  M + G+ P+ + Y++
Sbjct: 606 ITGSFRILELMKLSGCLPNVYPYTIIINGLCQFGRVEEAEKLLSAMQDSGVSPNHVTYTV 665

Query: 681 LIDAYGWSGSIDIAFDILKRMHDIGCEPSFYTYSYLIK-HLLSAKLIEVNSSTELGDLSS 740
           ++  Y  +G +D A + ++ M + G E +   YS L++  +LS K I+ +  + + D++ 
Sbjct: 666 MVKGYVNNGKLDRALETVRAMVERGYELNDRIYSSLLQGFVLSQKGIDNSEESTVSDIA- 725

Query: 741 GVVSNDFANLWRRVDYEFALELFEEMVKQ--GCAPNANTYGKFISGLCKVGCLEVGRRLF 800
                      R  D E   EL   +V+Q  GC      +   ++ LCK G  +    L 
Sbjct: 726 ----------LRETDPECINELI-SVVEQLGGCISGLCIF--LVTRLCKEGRTDESNDLV 785

Query: 801 DHMKEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIMVEHGYLPHLDSCKLLLCGLFDE 860
            ++ E+G+   E   + ++   C    + K +  + ++++ G++P   S  L++ GL  E
Sbjct: 786 QNVLERGVF-LEKAMDIIMESYCSKKKHTKCMELITLVLKSGFVPSFKSFCLVIQGLKKE 845

Query: 861 GNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFGVMERQGCQIHP 912
           G+ E+A+ +   LL      ++      ++ L++      CSE+  ++++  C+  P
Sbjct: 846 GDAERARELVMELLTSNGVVEKSGVLTYVECLMEGDETGDCSEVIDLVDQLHCRERP 878

BLAST of Cp4.1LG07g05790 vs. TAIR 10
Match: AT1G77340.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 418.7 bits (1075), Expect = 1.2e-116
Identity = 223/420 (53.10%), Postives = 283/420 (67.38%), Query Frame = 0

Query: 82  KILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFKHNVQSYVSILNILVPNGYHRI 141
           KI  P   PSHVSSLF+LNLDP+TAL+F +WI +   FKHNV SY S++ +L        
Sbjct: 19  KISYPFYTPSHVSSLFSLNLDPQTALSFSDWISRIPNFKHNVTSYASLVTLLCSQEIPYE 78

Query: 142 AEKLRILMIKSTNSAENALFVLEMLRSMNRRGD--DLRFKLTLKSYNMLLMLLSRFLMID 201
             K+ ILMIKS NS  +ALFV++  R+M R+GD  ++++KLT K YN LL  L+RF +++
Sbjct: 79  VPKITILMIKSCNSVRDALFVVDFCRTM-RKGDSFEIKYKLTPKCYNNLLSSLARFGLVE 138

Query: 202 EMKNVYLEMLDDMVSPNMYTLNTMVNGYCKLGNVVEAELYVSKIVQAGLSLDTFTYTSLI 261
           EMK +Y EML+D+VSP++YT NT+VNGYCKLG VVEA+ YV+ ++QAG   D FTYTS I
Sbjct: 139 EMKRLYTEMLEDLVSPDIYTFNTLVNGYCKLGYVVEAKQYVTWLIQAGCDPDYFTYTSFI 198

Query: 262 LGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEAGRIDEALKLLSQMHEDNCW 321
            G+CR K VD A K+F  M   GC RNEVSYT LI+G  EA +IDEAL LL +M +DNC 
Sbjct: 199 TGHCRRKEVDAAFKVFKEMTQNGCHRNEVSYTQLIYGLFEAKKIDEALSLLVKMKDDNCC 258

Query: 322 PTVRTYTVIICALCQMGRKSEAFNVFKEMTEKGCEPNVHTYTVLIRSLCEDNKFDDAKKL 381
           P VRTYTV+I ALC  G+KSEA N+FK+M+E G +P+   YTVLI+S C  +  D+A  L
Sbjct: 259 PNVRTYTVLIDALCGSGQKSEAMNLFKQMSESGIKPDDCMYTVLIQSFCSGDTLDEASGL 318

Query: 382 LDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMESNNCSPNTRTYNELILGFCR 441
           L+ MLE GL+P+V+TYNA I G+CK                                   
Sbjct: 319 LEHMLENGLMPNVITYNALIKGFCK----------------------------------- 378

Query: 442 AKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGSAYKLLSLMNENGLVPDEWT 500
            KNVHKAM LL KMLE  L PD++TYN LI GQC  G+L SAY+LLSLM E+GLVP++ T
Sbjct: 379 -KNVHKAMGLLSKMLEQNLVPDLITYNTLIAGQCSSGNLDSAYRLLSLMEESGLVPNQRT 401

BLAST of Cp4.1LG07g05790 vs. TAIR 10
Match: AT5G59900.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 327.8 bits (839), Expect = 2.8e-89
Identity = 233/877 (26.57%), Postives = 419/877 (47.78%), Query Frame = 0

Query: 38  FTLAFKFFTSTTASLPQSLPVEHD---VPAQLFSILSRPDWQKHPSLKILIPSIAPSHVS 97
           F  +F+  +S   S  +   +  D   V A    +  +  W+   S +++   +   HV 
Sbjct: 15  FRNSFRNVSSVIDSAQEECRIAEDKQFVDAVKRIVRGKRSWEIALSSELVSRRLKTVHVE 74

Query: 98  SLFALNL-DPKTALAFFNWIEQKHGFKHNVQSYVSILNILV-PNGYHRIAEKLRILMIKS 157
            +    + DPK  L FFN++    GF H+  S+  +++ LV  N +   +  L+ L++++
Sbjct: 75  EILIGTIDDPKLGLRFFNFLGLHRGFDHSTASFCILIHALVKANLFWPASSLLQTLLLRA 134

Query: 158 TNSAE--NALF-VLEMLRSMNRRGDDLRFKLTLKSYNML------LMLLSRFLMIDEMK- 217
              ++  N LF   E  +  +    DL  +  ++S  +L       M++++  ++ E++ 
Sbjct: 135 LKPSDVFNVLFSCYEKCKLSSSSSFDLLIQHYVRSRRVLDGVLVFKMMITKVSLLPEVRT 194

Query: 218 --------------NVYLEMLDDMVS----PNMYTLNTMVNGYCKLGNVVEAELYVSKIV 277
                          + +E+ +DMVS    P++Y    ++   C+L ++  A+  ++ + 
Sbjct: 195 LSALLHGLVKFRHFGLAMELFNDMVSVGIRPDVYIYTGVIRSLCELKDLSRAKEMIAHME 254

Query: 278 QAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEAGRID 337
             G  ++   Y  LI G C+ + V  A  I   +  K  + + V+Y  L++G C+    +
Sbjct: 255 ATGCDVNIVPYNVLIDGLCKKQKVWEAVGIKKDLAGKDLKPDVVTYCTLVYGLCKVQEFE 314

Query: 338 EALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFNVFKEMTEKGCEPNVHTYTVLI 397
             L+++ +M      P+    + ++  L + G+  EA N+ K + + G  PN+  Y  LI
Sbjct: 315 IGLEMMDEMLCLRFSPSEAAVSSLVEGLRKRGKIEEALNLVKRVVDFGVSPNLFVYNALI 374

Query: 398 RSLCEDNKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMESNNCS 457
            SLC+  KF +A+ L D M + GL P+ VTY+  ID +C++G   +AL  L  M      
Sbjct: 375 DSLCKGRKFHEAELLFDRMGKIGLRPNDVTYSILIDMFCRRGKLDTALSFLGEMVDTGLK 434

Query: 458 PNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGSAYKL 517
            +   YN LI G C+  ++  A   + +M+  KL+P VVTY  L+ G C +G +  A +L
Sbjct: 435 LSVYPYNSLINGHCKFGDISAAEGFMAEMINKKLEPTVVTYTSLMGGYCSKGKINKALRL 494

Query: 518 LSLMNENGLVPDEWTYSVFIAVLCKRGRVEDARFLFDSLKEKGIKANEVIYSALIDGYCK 577
              M   G+ P  +T++  ++ L + G + DA  LF+ + E  +K N V Y+ +I+GYC+
Sbjct: 495 YHEMTGKGIAPSIYTFTTLLSGLFRAGLIRDAVKLFNEMAEWNVKPNRVTYNVMIEGYCE 554

Query: 578 VGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKPTADT 637
            G +S     L +M   G VP++ +Y  LI G C      EA + V+ + K + +     
Sbjct: 555 EGDMSKAFEFLKEMTEKGIVPDTYSYRPLIHGLCLTGQASEAKVFVDGLHKGNCELNEIC 614

Query: 638 YTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQDAELF---LH 697
           YT L+    ++G+ + A  +  +M+  G   D+V Y V I    SL + +D +LF   L 
Sbjct: 615 YTGLLHGFCREGKLEEALSVCQEMVQRGVDLDLVCYGVLIDG--SL-KHKDRKLFFGLLK 674

Query: 698 KMNEKGILPDTLLYSLLIDAYGWSGSIDIAFDILKRMHDIGCEPSFYTYSYLIKHLLSAK 757
           +M+++G+ PD ++Y+ +IDA   +G    AF I   M + GC P+  TY+ +I  L  A 
Sbjct: 675 EMHDRGLKPDDVIYTSMIDAKSKTGDFKEAFGIWDLMINEGCVPNEVTYTAVINGLCKAG 734

Query: 758 LIE-----VNSSTELGDLSSGVVSNDFANLWRR--VDYEFALELFEEMVKQGCAPNANTY 817
            +       +    +  + + V    F ++  +  VD + A+EL   ++K G   N  TY
Sbjct: 735 FVNEAEVLCSKMQPVSSVPNQVTYGCFLDILTKGEVDMQKAVELHNAILK-GLLANTATY 794

Query: 818 GKFISGLCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIMVE 872
              I G C+ G +E    L   M   G+SP+   Y +++   C+    +KAI   + M E
Sbjct: 795 NMLIRGFCRQGRIEEASELITRMIGDGVSPDCITYTTMINELCRRNDVKKAIELWNSMTE 854

BLAST of Cp4.1LG07g05790 vs. TAIR 10
Match: AT4G31850.1 (proton gradient regulation 3 )

HSP 1 Score: 322.4 bits (825), Expect = 1.2e-87
Identity = 248/942 (26.33%), Postives = 399/942 (42.36%), Query Frame = 0

Query: 102  DPKTALAFFNWIEQKHGFK------HNVQSYVSILNILVPNGYHRIAEKLRILMIKSTNS 161
            D  T L  F  +  K G K        ++ +  +LN    NG       L  L++KS   
Sbjct: 152  DTNTYLTIFKSLSVKGGLKQAPYALRKMREFGFVLNAYSYNG-------LIHLLLKSRFC 211

Query: 162  AENALFVLEMLRSMNRRGDDLRFKLTLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSP 221
             E     +E+ R M   G    F+ +L++Y+ L++ L +   ID +  +  EM    + P
Sbjct: 212  TE----AMEVYRRMILEG----FRPSLQTYSSLMVGLGKRRDIDSVMGLLKEMETLGLKP 271

Query: 222  NMYTLNTMVNGYCKLGNVVEAELYVSKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIF 281
            N+YT    +    + G + EA   + ++   G   D  TYT LI   C  + +D A ++F
Sbjct: 272  NVYTFTICIRVLGRAGKINEAYEILKRMDDEGCGPDVVTYTVLIDALCTARKLDCAKEVF 331

Query: 282  LSMPSKGCRRNEVSYTNLIHGFCEAGRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQM 341
              M +   + + V+Y  L+  F +   +D   +  S+M +D   P V T+T+++ ALC+ 
Sbjct: 332  EKMKTGRHKPDRVTYITLLDRFSDNRDLDSVKQFWSEMEKDGHVPDVVTFTILVDALCKA 391

Query: 342  GRKSEAFNVFKEMTEKGCEPNVHTYTVLIRSLCEDNKFDDAKKLLDGMLEKGLVPSVVTY 401
            G   EAF+    M ++G  PN+HTY  LI  L   ++ DDA +L   M   G+ P+  TY
Sbjct: 392  GNFGEAFDTLDVMRDQGILPNLHTYNTLICGLLRVHRLDDALELFGNMESLGVKPTAYTY 451

Query: 402  NAFIDGYCKKGMSTSALEILSLMESNNCSPNTRTYNELILGFCRAKNVHKAMLLLHKMLE 461
              FID Y K G S SALE    M++   +PN    N  +    +A    +A  + + + +
Sbjct: 452  IVFIDYYGKSGDSVSALETFEKMKTKGIAPNIVACNASLYSLAKAGRDREAKQIFYGLKD 511

Query: 462  LKLQPDVVTYNLLIHGQCKEGHLGSAYKLLSLMNENGLVPDEWTYSVFIAVLCKRGRVED 521
            + L PD VTYN+++    K G +  A KLLS M ENG  PD    +  I  L K  RV++
Sbjct: 512  IGLVPDSVTYNMMMKCYSKVGEIDEAIKLLSEMMENGCEPDVIVVNSLINTLYKADRVDE 571

Query: 522  ARFLFDSLKEKGIKANEVIYSALIDGYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLID 581
            A  +F  +KE  +K   V Y+ L+ G  K GK+ +   L + M+  GC PN+IT+N+L D
Sbjct: 572  AWKMFMRMKEMKLKPTVVTYNTLLAGLGKNGKIQEAIELFEGMVQKGCPPNTITFNTLFD 631

Query: 582  GHCKEKNFQEALLLVEIMIKRDIKPTADTYTILIKNLLKDGEFDRA----HQM------- 641
              CK      AL ++  M+     P   TY  +I  L+K+G+   A    HQM       
Sbjct: 632  CLCKNDEVTLALKMLFKMMDMGCVPDVFTYNTIIFGLVKNGQVKEAMCFFHQMKKLVYPD 691

Query: 642  ----------------------------------------------------------FD 701
                                                                      F 
Sbjct: 692  FVTLCTLLPGVVKASLIEDAYKIITNFLYNCADQPANLFWEDLIGSILAEAGIDNAVSFS 751

Query: 702  QMLSA--------------------------------------GSHPDVVIYTVFIHAYC 761
            + L A                                      G  P +  Y + I    
Sbjct: 752  ERLVANGICRDGDSILVPIIRYSCKHNNVSGARTLFEKFTKDLGVQPKLPTYNLLIGGLL 811

Query: 762  SLGRLQDAELFLHKMNEKGILPDTLLYSLLIDAYGWSGSIDIAFDILKRMHDIGCEPSFY 821
                ++ A+    ++   G +PD   Y+ L+DAYG SG ID  F++ K M    CE +  
Sbjct: 812  EADMIEIAQDVFLQVKSTGCIPDVATYNFLLDAYGKSGKIDELFELYKEMSTHECEANTI 871

Query: 822  TYSYLIKHLLSAKLIEVNSSTELGDLSSGVVSNDFANLWRRVD--------YEFALELFE 881
            T++ +I  L+ A  ++         +S    S         +D        YE A +LFE
Sbjct: 872  THNIVISGLVKAGNVDDALDLYYDLMSDRDFSPTACTYGPLIDGLSKSGRLYE-AKQLFE 931

Query: 882  EMVKQGCAPNANTYGKFISGLCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLGCSCQLG 922
             M+  GC PN   Y   I+G  K G  +    LF  M ++G+ P+   Y+ L+ C C +G
Sbjct: 932  GMLDYGCRPNCAIYNILINGFGKAGEADAACALFKRMVKEGVRPDLKTYSVLVDCLCMVG 991

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LSL94.9e-28053.06Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX... [more]
Q9SFV92.8e-13432.78Pentatricopeptide repeat-containing protein At3g07290, mitochondrial OS=Arabidop... [more]
Q76C996.2e-8926.23Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV... [more]
Q9FJE64.0e-8826.57Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis th... [more]
Q9SZ521.7e-8626.33Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_023536697.10.0100.00pentatricopeptide repeat-containing protein At5g65560-like isoform X1 [Cucurbita... [more]
XP_022951246.10.097.84pentatricopeptide repeat-containing protein At5g65560-like [Cucurbita moschata][more]
KAG6585789.10.097.84Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_023002847.10.097.31pentatricopeptide repeat-containing protein At5g65560-like [Cucurbita maxima][more]
XP_022153102.10.085.78pentatricopeptide repeat-containing protein At5g65560 isoform X1 [Momordica char... [more]
Match NameE-valueIdentityDescription
A0A6J1GH110.097.84pentatricopeptide repeat-containing protein At5g65560-like OS=Cucurbita moschata... [more]
A0A6J1KKQ20.097.31pentatricopeptide repeat-containing protein At5g65560-like OS=Cucurbita maxima O... [more]
A0A6J1DI130.085.78pentatricopeptide repeat-containing protein At5g65560 isoform X1 OS=Momordica ch... [more]
A0A0A0KFF80.083.08Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G355970 PE=4 SV=1[more]
A0A5A7T8990.082.76Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT5G65560.13.5e-28153.06Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G07290.12.0e-13532.78Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G77340.11.2e-11653.10Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G59900.12.8e-8926.57Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G31850.11.2e-8726.33proton gradient regulation 3 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 758..815
e-value: 2.9E-9
score: 36.8
coord: 660..715
e-value: 5.9E-9
score: 35.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 876..921
e-value: 1.9E-7
score: 31.2
coord: 251..298
e-value: 1.6E-12
score: 47.4
coord: 390..439
e-value: 2.4E-15
score: 56.5
coord: 460..509
e-value: 3.4E-15
score: 56.0
coord: 531..579
e-value: 9.4E-15
score: 54.6
coord: 320..368
e-value: 1.6E-17
score: 63.4
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 212..237
e-value: 4.3E-7
score: 29.5
coord: 597..628
e-value: 1.7E-5
score: 24.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 499..531
e-value: 1.3E-6
score: 26.2
coord: 324..357
e-value: 1.2E-9
score: 35.7
coord: 604..637
e-value: 1.9E-8
score: 31.9
coord: 533..567
e-value: 3.4E-10
score: 37.4
coord: 773..806
e-value: 7.5E-8
score: 30.1
coord: 751..771
e-value: 0.0027
score: 15.8
coord: 358..392
e-value: 7.7E-9
score: 33.2
coord: 463..497
e-value: 5.8E-7
score: 27.3
coord: 288..318
e-value: 1.8E-8
score: 32.0
coord: 429..462
e-value: 5.1E-7
score: 27.5
coord: 808..839
e-value: 2.2E-4
score: 19.2
coord: 393..427
e-value: 1.1E-9
score: 35.8
coord: 638..672
e-value: 5.2E-7
score: 27.5
coord: 218..251
e-value: 1.1E-5
score: 23.2
coord: 253..286
e-value: 2.8E-7
score: 28.3
coord: 568..601
e-value: 8.2E-9
score: 33.1
coord: 878..908
e-value: 0.0013
score: 16.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 321..355
score: 12.978237
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 216..250
score: 10.062531
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 671..705
score: 10.840783
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 356..390
score: 12.802855
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 426..460
score: 11.334042
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 875..909
score: 9.88715
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 286..320
score: 13.044004
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 805..839
score: 10.884628
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 461..495
score: 12.364404
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 566..600
score: 12.561707
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 391..425
score: 12.353442
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 601..635
score: 11.268274
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 531..565
score: 12.978237
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 251..285
score: 11.717688
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 770..804
score: 11.893068
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 496..530
score: 11.509422
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 636..670
score: 12.134216
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 595..733
e-value: 4.5E-32
score: 113.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 91..242
e-value: 3.1E-20
score: 74.5
coord: 456..525
e-value: 3.3E-19
score: 71.1
coord: 315..383
e-value: 3.3E-23
score: 84.2
coord: 243..314
e-value: 1.6E-20
score: 75.4
coord: 872..928
e-value: 2.3E-7
score: 32.5
coord: 526..594
e-value: 2.1E-20
score: 75.0
coord: 384..455
e-value: 1.1E-19
score: 72.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 734..871
e-value: 6.6E-30
score: 106.6
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 317..839
NoneNo IPR availablePANTHERPTHR47938:SF4REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 31..875
NoneNo IPR availablePANTHERPTHR47938RESPIRATORY COMPLEX I CHAPERONE (CIA84), PUTATIVE (AFU_ORTHOLOGUE AFUA_2G06020)-RELATEDcoord: 31..875
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 750..875

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG07g05790.1Cp4.1LG07g05790.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding