CmoCh10G010680 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh10G010680
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationCmo_Chr10: 5685444 .. 5687738 (+)
RNA-Seq ExpressionCmoCh10G010680
SyntenyCmoCh10G010680
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACCAACTGATTCTCTCCGCCGCCTCTCCGTCAGGCTCCGGCCACTCCCATTTGAATCTCCAACAGACCCACCAAATCCATGCCCATTGCATCAAAACCCAATTCCGTAACCCTCACAGTTTCTTCTCTCGATCCCACTTCACCCCTGAAGCCAATTTCAATCTCCTCATTTCATCTTACACCGACAACCACCTCCCACAAGCGGCCTTCATCTTGTATCACCATATGCGCACAACTGATGCTGCTGCAGTTGACAACTTCATTGTTCCTTCACTTCTCAAAGCCTGTGCTCAAGCTTCCTCTACAAATTTCGGCAGGGAAGTGCACGGTTTCGCGGTTAAGAACGGGTTCGTATCGGACGTTTTTGTGTGCAATGCTTTGATGAACATGTATGAGAAATGTGGGAGTTTGGTTTCTGCTTGCTTGGTGTTTGATAAAATGCCTGACAGAGATGTTGTCTCTTGGAGTACTATGCTTGGGTGCTACGTGCGGAGCAAATCGTTCGGTGAAGCATATAGGCTCGTTCGAGAGATGCATTTTGTGGGAGTGAAGCTTAGTGATGTTGCTTTGATTAGCATGATTGGTGTATTTGGGGAGCTCTCGGATATGAAGTCGGGGAGGGCGATTCATGGTTACGTTGTGAGAAATGTTGGTAATGAGAGAATCGAACTTCCTTTAACAACTGCGTTGATTGATATGTATTGCAAGGGCGACAAATTGGCATCGGCAATGAGGCTTTTCGATGGGTTATCTCAGAGAAACGTCGTTTCTTGGACGGCGTTGATAGCGGGTTGTATTCGCAGTTGCAGGTTCGTTGAAGGGGCAAAGAATTTTAGTAGAATGCTTGAAGAAAACATAGCTCCTAATGAGATCACTTTACTAAGTTTGATAACAGAGTGTGGCTTTGTGGGAGCCTTGGATTTGGGCAAGTGGTTGCATGCCTATCTGTTAAGGAATGGGTTTGGGATGTCTCTGGCTTTGGCCACTGCTCTCATAGACATGTATGGAAAGTGTGGGCAAGTTGCATATGCTAGAGCTCTTTTCAATGGCGTCGAGGAAAAAGATGTCAAGATTTGGAGCGCTTTGATATCGGCTTACGCTCATGCGAGTTGCATCGATCAAGCGTTCAGCCTCTTCCTTAAGATGTTAGACAGTGAGGTGAAACCAAATAAGGTGACAATGGTTAGCCTGCTTTCTTTATGTGCAGAGGTTGGAGCCCTTGACCTTGGTAGATGGACTCATGCTTACATAAACCGTCATGGTGTCGAAGTAGACGTCGTTTTAGAAACAGCGCTCATCAACATGTATGCGAAATGTGGAGATCTAAAAACTGCTCGTTGCCTGTTCGATGAAGCCACACGACGAGATATTCACATGTGGAATGCAATGATGGCTGGATTCTCAATCCATGGTTGTGGAAAAGAAGCTTTAGAACTCTTTTCAGATATGGTGTGTCATGGTGTTGAACCTAATGACATCACATTCATTTCTGTTTTTCATGCTTGTAGTCATTCTGGATTGGTAGGGGAGGGAATGAAGCATTTCGACAGAATGGTTCATGAATTTGGAATAGTTCCAAAGATCGAACACTATGGATGCTTGGTAGATCTTCTTGGTCGAGCTAAACGTCTCGACGCAGCTCACAGCATCATCGAAAACATGCCCATGAGGCCCAACACAATTGTATGGGGTGCGCTGCTAGCTGCATGTAAGCTACATAAGAATCTGGCCTTGGGGGAGGTGGCTGCAAGAAAGATTCTTGAATTGGACCCAGAAAACTGTGGGTATAGAGTTCTTAAGTCAAACATCTATGCATCCGAAAAGAGATGGACCGATGTAACGAGCGTTAGAGAAACAATGAGCCATTTAGGGATGAAGAAAGAACCAGGACTCAGCTGGATTGAGGTAAATGGCTCAGTTCATCACTTCAGATCTGGAGATAAGACATGCACACAAACAAGAAAAGTACATGAAATGGTGACCGAAATGTGCATCAAATTGAGAGAGGCGGGATACGCACCGAACACATCTGCAGTTTTGTTAAACGTAGAAGACGAAGAGAAGGAATCTGCACTCAGTTACCATAGTGAGAAACTGGCCATGGCATTTGGACTCATAAGTACAGCCCCGGGTACGCCCATCCGAATCATTAAGAATCTGAGGATTTGCGATGACTGTCATGCTGCAACAAAACTATTATCGAAAATCTATGGACGAACAATAATAGTCAGAGATCGAAACCGATTTCATCACTTTAGTGAAGGATATTGTTCTTGTCTAGGCTACTGGTAA

mRNA sequence

ATGGACCAACTGATTCTCTCCGCCGCCTCTCCGTCAGGCTCCGGCCACTCCCATTTGAATCTCCAACAGACCCACCAAATCCATGCCCATTGCATCAAAACCCAATTCCGTAACCCTCACAGTTTCTTCTCTCGATCCCACTTCACCCCTGAAGCCAATTTCAATCTCCTCATTTCATCTTACACCGACAACCACCTCCCACAAGCGGCCTTCATCTTGTATCACCATATGCGCACAACTGATGCTGCTGCAGTTGACAACTTCATTGTTCCTTCACTTCTCAAAGCCTGTGCTCAAGCTTCCTCTACAAATTTCGGCAGGGAAGTGCACGGTTTCGCGGTTAAGAACGGGTTCGTATCGGACGTTTTTGTGTGCAATGCTTTGATGAACATGTATGAGAAATGTGGGAGTTTGGTTTCTGCTTGCTTGGTGTTTGATAAAATGCCTGACAGAGATGTTGTCTCTTGGAGTACTATGCTTGGGTGCTACGTGCGGAGCAAATCGTTCGGTGAAGCATATAGGCTCGTTCGAGAGATGCATTTTGTGGGAGTGAAGCTTAGTGATGTTGCTTTGATTAGCATGATTGGTGTATTTGGGGAGCTCTCGGATATGAAGTCGGGGAGGGCGATTCATGGTTACGTTGTGAGAAATGTTGGTAATGAGAGAATCGAACTTCCTTTAACAACTGCGTTGATTGATATGTATTGCAAGGGCGACAAATTGGCATCGGCAATGAGGCTTTTCGATGGGTTATCTCAGAGAAACGTCGTTTCTTGGACGGCGTTGATAGCGGGTTGTATTCGCAGTTGCAGGTTCGTTGAAGGGGCAAAGAATTTTAGTAGAATGCTTGAAGAAAACATAGCTCCTAATGAGATCACTTTACTAAGTTTGATAACAGAGTGTGGCTTTGTGGGAGCCTTGGATTTGGGCAAGTGGTTGCATGCCTATCTGTTAAGGAATGGGTTTGGGATGTCTCTGGCTTTGGCCACTGCTCTCATAGACATGTATGGAAAGTGTGGGCAAGTTGCATATGCTAGAGCTCTTTTCAATGGCGTCGAGGAAAAAGATGTCAAGATTTGGAGCGCTTTGATATCGGCTTACGCTCATGCGAGTTGCATCGATCAAGCGTTCAGCCTCTTCCTTAAGATGTTAGACAGTGAGGTGAAACCAAATAAGGTGACAATGGTTAGCCTGCTTTCTTTATGTGCAGAGGTTGGAGCCCTTGACCTTGGTAGATGGACTCATGCTTACATAAACCGTCATGGTGTCGAAGTAGACGTCGTTTTAGAAACAGCGCTCATCAACATGTATGCGAAATGTGGAGATCTAAAAACTGCTCGTTGCCTGTTCGATGAAGCCACACGACGAGATATTCACATGTGGAATGCAATGATGGCTGGATTCTCAATCCATGGTTGTGGAAAAGAAGCTTTAGAACTCTTTTCAGATATGGTGTGTCATGGTGTTGAACCTAATGACATCACATTCATTTCTGTTTTTCATGCTTGTAGTCATTCTGGATTGGTAGGGGAGGGAATGAAGCATTTCGACAGAATGGTTCATGAATTTGGAATAGTTCCAAAGATCGAACACTATGGATGCTTGGTAGATCTTCTTGGTCGAGCTAAACGTCTCGACGCAGCTCACAGCATCATCGAAAACATGCCCATGAGGCCCAACACAATTGTATGGGGTGCGCTGCTAGCTGCATGTAAGCTACATAAGAATCTGGCCTTGGGGGAGGTGGCTGCAAGAAAGATTCTTGAATTGGACCCAGAAAACTGTGGGTATAGAGTTCTTAAGTCAAACATCTATGCATCCGAAAAGAGATGGACCGATGTAACGAGCGTTAGAGAAACAATGAGCCATTTAGGGATGAAGAAAGAACCAGGACTCAGCTGGATTGAGGTAAATGGCTCAGTTCATCACTTCAGATCTGGAGATAAGACATGCACACAAACAAGAAAAGTACATGAAATGGTGACCGAAATGTGCATCAAATTGAGAGAGGCGGGATACGCACCGAACACATCTGCAGTTTTGTTAAACGTAGAAGACGAAGAGAAGGAATCTGCACTCAGTTACCATAGTGAGAAACTGGCCATGGCATTTGGACTCATAAGTACAGCCCCGGGTACGCCCATCCGAATCATTAAGAATCTGAGGATTTGCGATGACTGTCATGCTGCAACAAAACTATTATCGAAAATCTATGGACGAACAATAATAGTCAGAGATCGAAACCGATTTCATCACTTTAGTGAAGGATATTGTTCTTGTCTAGGCTACTGGTAA

Coding sequence (CDS)

ATGGACCAACTGATTCTCTCCGCCGCCTCTCCGTCAGGCTCCGGCCACTCCCATTTGAATCTCCAACAGACCCACCAAATCCATGCCCATTGCATCAAAACCCAATTCCGTAACCCTCACAGTTTCTTCTCTCGATCCCACTTCACCCCTGAAGCCAATTTCAATCTCCTCATTTCATCTTACACCGACAACCACCTCCCACAAGCGGCCTTCATCTTGTATCACCATATGCGCACAACTGATGCTGCTGCAGTTGACAACTTCATTGTTCCTTCACTTCTCAAAGCCTGTGCTCAAGCTTCCTCTACAAATTTCGGCAGGGAAGTGCACGGTTTCGCGGTTAAGAACGGGTTCGTATCGGACGTTTTTGTGTGCAATGCTTTGATGAACATGTATGAGAAATGTGGGAGTTTGGTTTCTGCTTGCTTGGTGTTTGATAAAATGCCTGACAGAGATGTTGTCTCTTGGAGTACTATGCTTGGGTGCTACGTGCGGAGCAAATCGTTCGGTGAAGCATATAGGCTCGTTCGAGAGATGCATTTTGTGGGAGTGAAGCTTAGTGATGTTGCTTTGATTAGCATGATTGGTGTATTTGGGGAGCTCTCGGATATGAAGTCGGGGAGGGCGATTCATGGTTACGTTGTGAGAAATGTTGGTAATGAGAGAATCGAACTTCCTTTAACAACTGCGTTGATTGATATGTATTGCAAGGGCGACAAATTGGCATCGGCAATGAGGCTTTTCGATGGGTTATCTCAGAGAAACGTCGTTTCTTGGACGGCGTTGATAGCGGGTTGTATTCGCAGTTGCAGGTTCGTTGAAGGGGCAAAGAATTTTAGTAGAATGCTTGAAGAAAACATAGCTCCTAATGAGATCACTTTACTAAGTTTGATAACAGAGTGTGGCTTTGTGGGAGCCTTGGATTTGGGCAAGTGGTTGCATGCCTATCTGTTAAGGAATGGGTTTGGGATGTCTCTGGCTTTGGCCACTGCTCTCATAGACATGTATGGAAAGTGTGGGCAAGTTGCATATGCTAGAGCTCTTTTCAATGGCGTCGAGGAAAAAGATGTCAAGATTTGGAGCGCTTTGATATCGGCTTACGCTCATGCGAGTTGCATCGATCAAGCGTTCAGCCTCTTCCTTAAGATGTTAGACAGTGAGGTGAAACCAAATAAGGTGACAATGGTTAGCCTGCTTTCTTTATGTGCAGAGGTTGGAGCCCTTGACCTTGGTAGATGGACTCATGCTTACATAAACCGTCATGGTGTCGAAGTAGACGTCGTTTTAGAAACAGCGCTCATCAACATGTATGCGAAATGTGGAGATCTAAAAACTGCTCGTTGCCTGTTCGATGAAGCCACACGACGAGATATTCACATGTGGAATGCAATGATGGCTGGATTCTCAATCCATGGTTGTGGAAAAGAAGCTTTAGAACTCTTTTCAGATATGGTGTGTCATGGTGTTGAACCTAATGACATCACATTCATTTCTGTTTTTCATGCTTGTAGTCATTCTGGATTGGTAGGGGAGGGAATGAAGCATTTCGACAGAATGGTTCATGAATTTGGAATAGTTCCAAAGATCGAACACTATGGATGCTTGGTAGATCTTCTTGGTCGAGCTAAACGTCTCGACGCAGCTCACAGCATCATCGAAAACATGCCCATGAGGCCCAACACAATTGTATGGGGTGCGCTGCTAGCTGCATGTAAGCTACATAAGAATCTGGCCTTGGGGGAGGTGGCTGCAAGAAAGATTCTTGAATTGGACCCAGAAAACTGTGGGTATAGAGTTCTTAAGTCAAACATCTATGCATCCGAAAAGAGATGGACCGATGTAACGAGCGTTAGAGAAACAATGAGCCATTTAGGGATGAAGAAAGAACCAGGACTCAGCTGGATTGAGGTAAATGGCTCAGTTCATCACTTCAGATCTGGAGATAAGACATGCACACAAACAAGAAAAGTACATGAAATGGTGACCGAAATGTGCATCAAATTGAGAGAGGCGGGATACGCACCGAACACATCTGCAGTTTTGTTAAACGTAGAAGACGAAGAGAAGGAATCTGCACTCAGTTACCATAGTGAGAAACTGGCCATGGCATTTGGACTCATAAGTACAGCCCCGGGTACGCCCATCCGAATCATTAAGAATCTGAGGATTTGCGATGACTGTCATGCTGCAACAAAACTATTATCGAAAATCTATGGACGAACAATAATAGTCAGAGATCGAAACCGATTTCATCACTTTAGTGAAGGATATTGTTCTTGTCTAGGCTACTGGTAA

Protein sequence

MDQLILSAASPSGSGHSHLNLQQTHQIHAHCIKTQFRNPHSFFSRSHFTPEANFNLLISSYTDNHLPQAAFILYHHMRTTDAAAVDNFIVPSLLKACAQASSTNFGREVHGFAVKNGFVSDVFVCNALMNMYEKCGSLVSACLVFDKMPDRDVVSWSTMLGCYVRSKSFGEAYRLVREMHFVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGNERIELPLTTALIDMYCKGDKLASAMRLFDGLSQRNVVSWTALIAGCIRSCRFVEGAKNFSRMLEENIAPNEITLLSLITECGFVGALDLGKWLHAYLLRNGFGMSLALATALIDMYGKCGQVAYARALFNGVEEKDVKIWSALISAYAHASCIDQAFSLFLKMLDSEVKPNKVTMVSLLSLCAEVGALDLGRWTHAYINRHGVEVDVVLETALINMYAKCGDLKTARCLFDEATRRDIHMWNAMMAGFSIHGCGKEALELFSDMVCHGVEPNDITFISVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCLVDLLGRAKRLDAAHSIIENMPMRPNTIVWGALLAACKLHKNLALGEVAARKILELDPENCGYRVLKSNIYASEKRWTDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFRSGDKTCTQTRKVHEMVTEMCIKLREAGYAPNTSAVLLNVEDEEKESALSYHSEKLAMAFGLISTAPGTPIRIIKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW
Homology
BLAST of CmoCh10G010680 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 544.7 bits (1402), Expect = 1.7e-153
Identity = 266/672 (39.58%), Postives = 409/672 (60.86%), Query Frame = 0

Query: 93  LLKACAQASSTNFGREVHGFAVKNGFVSDVFVCNALMNMYEKCGSLVSACLVFDKMPDRD 152
           LLK C   +    G+E+HG  VK+GF  D+F    L NMY KC  +  A  VFD+MP+RD
Sbjct: 141 LLKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERD 200

Query: 153 VVSWSTMLGCYVRSKSFGEAYRLVREMHFVGVKLSDVALISMIGVFGELSDMKSGRAIHG 212
           +VSW+T++  Y ++     A  +V+ M    +K S + ++S++     L  +  G+ IHG
Sbjct: 201 LVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHG 260

Query: 213 YVVRNVGNERIELPLTTALIDMYCKGDKLASAMRLFDGLSQRNVVSWTALIAGCIRSCRF 272
           Y +R+  +  +   ++TAL+DMY K   L +A +LFDG+ +RNVVSW ++I   +++   
Sbjct: 261 YAMRSGFDSLVN--ISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENP 320

Query: 273 VEGAKNFSRMLEENIAPNEITLLSLITECGFVGALDLGKWLHAYLLRNGFGMSLALATAL 332
            E    F +ML+E + P +++++  +  C  +G L+ G+++H   +  G   ++++  +L
Sbjct: 321 KEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSL 380

Query: 333 IDMYGKCGQVAYARALFNGVEEKDVKIWSALISAYAHASCIDQAFSLFLKMLDSEVKPNK 392
           I MY KC +V  A ++F  ++ + +  W+A+I  +A       A + F +M    VKP+ 
Sbjct: 381 ISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDT 440

Query: 393 VTMVSLLSLCAEVGALDLGRWTHAYINRHGVEVDVVLETALINMYAKCGDLKTARCLFDE 452
            T VS+++  AE+      +W H  + R  ++ +V + TAL++MYAKCG +  AR +FD 
Sbjct: 441 FTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDM 500

Query: 453 ATRRDIHMWNAMMAGFSIHGCGKEALELFSDMVCHGVEPNDITFISVFHACSHSGLVGEG 512
            + R +  WNAM+ G+  HG GK ALELF +M    ++PN +TF+SV  ACSHSGLV  G
Sbjct: 501 MSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAG 560

Query: 513 MKHFDRMVHEFGIVPKIEHYGCLVDLLGRAKRLDAAHSIIENMPMRPNTIVWGALLAACK 572
           +K F  M   + I   ++HYG +VDLLGRA RL+ A   I  MP++P   V+GA+L AC+
Sbjct: 561 LKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQ 620

Query: 573 LHKNLALGEVAARKILELDPENCGYRVLKSNIYASEKRWTDVTSVRETMSHLGMKKEPGL 632
           +HKN+   E AA ++ EL+P++ GY VL +NIY +   W  V  VR +M   G++K PG 
Sbjct: 621 IHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGC 680

Query: 633 SWIEVNGSVHHFRSGDKTCTQTRKVHEMVTEMCIKLREAGYAPNTSAVLLNVEDEEKESA 692
           S +E+   VH F SG      ++K++  + ++   ++EAGY P+T+ V L VE++ KE  
Sbjct: 681 SMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEAGYVPDTNLV-LGVENDVKEQL 740

Query: 693 LSYHSEKLAMAFGLISTAPGTPIRIIKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHH 752
           LS HSEKLA++FGL++T  GT I + KNLR+C DCH ATK +S + GR I+VRD  RFHH
Sbjct: 741 LSTHSEKLAISFGLLNTTAGTTIHVRKNLRVCADCHNATKYISLVTGREIVVRDMQRFHH 800

Query: 753 FSEGYCSCLGYW 765
           F  G CSC  YW
Sbjct: 801 FKNGACSCGDYW 809

BLAST of CmoCh10G010680 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 540.0 bits (1390), Expect = 4.2e-152
Identity = 293/770 (38.05%), Postives = 436/770 (56.62%), Query Frame = 0

Query: 19  LNLQQTHQIHAHCIKT-QFRNPHSF----------------FSRSHF--TPEAN---FNL 78
           ++L+Q  Q H H I+T  F +P+S                 ++R  F   P+ N   +N 
Sbjct: 41  VSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKPNSFAWNT 100

Query: 79  LISSYTDNHLPQAAFILYHHMRTTDAAAVDNFIVPSLLKACAQASSTNFGREVHGFAVKN 138
           LI +Y     P  +   +  M +      + +  P L+KA A+ SS + G+ +HG AVK+
Sbjct: 101 LIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHGMAVKS 160

Query: 139 GFVSDVFVCNALMNMYEKCGSLVSACLVFDKMPDRDVVSWSTMLGCYVRSKSFGEAYRLV 198
              SDVFV N+L++ Y  CG L SAC VF  + ++DVVSW++M+  +V+  S  +A  L 
Sbjct: 161 AVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALELF 220

Query: 199 REMHFVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGNERIELPLTTALIDMYC 258
           ++M    VK S V ++ ++    ++ +++ GR +  Y+  N  N  + L L  A++DMY 
Sbjct: 221 KKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVN--VNLTLANAMLDMYT 280

Query: 259 KGDKLASAMRLFDGLSQRNVVSWTALIAGCIRSCRFVEGAKNFSRMLEENIAPNEITLLS 318
           K   +  A RLFD + +++ V+WT ++ G                               
Sbjct: 281 KCGSIEDAKRLFDAMEEKDNVTWTTMLDG------------------------------- 340

Query: 319 LITECGFVGALDLGKWLHAYLLRNGFGMSLALATALIDMYGKCGQVAYARALFNGVEEKD 378
                                                  Y        AR + N + +KD
Sbjct: 341 ---------------------------------------YAISEDYEAAREVLNSMPQKD 400

Query: 379 VKIWSALISAYAHASCIDQAFSLFLKM-LDSEVKPNKVTMVSLLSLCAEVGALDLGRWTH 438
           +  W+ALISAY      ++A  +F ++ L   +K N++T+VS LS CA+VGAL+LGRW H
Sbjct: 401 IVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIH 460

Query: 439 AYINRHGVEVDVVLETALINMYAKCGDLKTARCLFDEATRRDIHMWNAMMAGFSIHGCGK 498
           +YI +HG+ ++  + +ALI+MY+KCGDL+ +R +F+   +RD+ +W+AM+ G ++HGCG 
Sbjct: 461 SYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGN 520

Query: 499 EALELFSDMVCHGVEPNDITFISVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCL 558
           EA+++F  M    V+PN +TF +VF ACSH+GLV E    F +M   +GIVP+ +HY C+
Sbjct: 521 EAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACI 580

Query: 559 VDLLGRAKRLDAAHSIIENMPMRPNTIVWGALLAACKLHKNLALGEVAARKILELDPENC 618
           VD+LGR+  L+ A   IE MP+ P+T VWGALL ACK+H NL L E+A  ++LEL+P N 
Sbjct: 581 VDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRND 640

Query: 619 GYRVLKSNIYASEKRWTDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFRSGDKTCTQTR 678
           G  VL SNIYA   +W +V+ +R+ M   G+KKEPG S IE++G +H F SGD     + 
Sbjct: 641 GAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSE 700

Query: 679 KVHEMVTEMCIKLREAGYAPNTSAVLLNVEDEE-KESALSYHSEKLAMAFGLISTAPGTP 738
           KV+  + E+  KL+  GY P  S VL  +E+EE KE +L+ HSEKLA+ +GLIST     
Sbjct: 701 KVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKV 738

Query: 739 IRIIKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW 765
           IR+IKNLR+C DCH+  KL+S++Y R IIVRDR RFHHF  G CSC  +W
Sbjct: 761 IRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738

BLAST of CmoCh10G010680 vs. ExPASy Swiss-Prot
Match: Q9LUJ2 (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H56 PE=3 SV=1)

HSP 1 Score: 533.9 bits (1374), Expect = 3.0e-150
Identity = 281/747 (37.62%), Postives = 431/747 (57.70%), Query Frame = 0

Query: 54  FNLLISSYTDNHLPQAAFILYHHMRTTDAAAVDNFIVPSLLKACAQASSTNFGREVHGFA 113
           +N LI  Y  + L   A +L+  M  +   + D +  P  L ACA++ +   G ++HG  
Sbjct: 102 YNSLIRGYASSGLCNEAILLFLRMMNS-GISPDKYTFPFGLSACAKSRAKGNGIQIHGLI 161

Query: 114 VKNGFVSDVFVCNALMNMYEKCGSLVSACLVFDKMPDRDVVSWSTMLGCYVRSKSFGEA- 173
           VK G+  D+FV N+L++ Y +CG L SA  VFD+M +R+VVSW++M+  Y R     +A 
Sbjct: 162 VKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMICGYARRDFAKDAV 221

Query: 174 ---YRLVREMHFVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGNERIELPLTT 233
              +R+VR+     V  + V ++ +I    +L D+++G  ++ + +RN G E  +L + +
Sbjct: 222 DLFFRMVRDEE---VTPNSVTMVCVISACAKLEDLETGEKVYAF-IRNSGIEVNDL-MVS 281

Query: 234 ALIDMYCKGDKLASAMRLFDGLSQRNVVSWTALIAGCIRSCRFVEGAKNFSRMLEENIAP 293
           AL+DMY K + +  A RLFD     N+    A+ +  +R     E    F+ M++  + P
Sbjct: 282 ALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGVRP 341

Query: 294 NEITLLSLITECGFVGALDLGKWLHAYLLRNGFGMSLALATALIDMYGKC---------- 353
           + I++LS I+ C  +  +  GK  H Y+LRNGF     +  ALIDMY KC          
Sbjct: 342 DRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAFRIF 401

Query: 354 ---------------------GQVAYARALFNGVEEKDVKIWSALISAYAHASCIDQAFS 413
                                G+V  A   F  + EK++  W+ +IS     S  ++A  
Sbjct: 402 DRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEEAIE 461

Query: 414 LFLKMLDSE-VKPNKVTMVSLLSLCAEVGALDLGRWTHAYINRHGVEVDVVLETALINMY 473
           +F  M   E V  + VTM+S+ S C  +GALDL +W + YI ++G+++DV L T L++M+
Sbjct: 462 VFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTTLVDMF 521

Query: 474 AKCGDLKTARCLFDEATRRDIHMWNAMMAGFSIHGCGKEALELFSDMVCHGVEPNDITFI 533
           ++CGD ++A  +F+  T RD+  W A +   ++ G  + A+ELF DM+  G++P+ + F+
Sbjct: 522 SRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGVAFV 581

Query: 534 SVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCLVDLLGRAKRLDAAHSIIENMPM 593
               ACSH GLV +G + F  M+   G+ P+  HYGC+VDLLGRA  L+ A  +IE+MPM
Sbjct: 582 GALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIEDMPM 641

Query: 594 RPNTIVWGALLAACKLHKNLALGEVAARKILELDPENCGYRVLKSNIYASEKRWTDVTSV 653
            PN ++W +LLAAC++  N+ +   AA KI  L PE  G  VL SN+YAS  RW D+  V
Sbjct: 642 EPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDMAKV 701

Query: 654 RETMSHLGMKKEPGLSWIEVNGSVHHFRSGDKTCTQTRKVHEMVTEMCIKLREAGYAPNT 713
           R +M   G++K PG S I++ G  H F SGD++  +   +  M+ E+  +    G+ P+ 
Sbjct: 702 RLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASHLGHVPDL 761

Query: 714 SAVLLNVEDEEKESALSYHSEKLAMAFGLISTAPGTPIRIIKNLRICDDCHAATKLLSKI 765
           S VL++V+++EK   LS HSEKLAMA+GLIS+  GT IRI+KNLR+C DCH+  K  SK+
Sbjct: 762 SNVLMDVDEKEKIFMLSRHSEKLAMAYGLISSNKGTTIRIVKNLRVCSDCHSFAKFASKV 821

BLAST of CmoCh10G010680 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 531.2 bits (1367), Expect = 2.0e-149
Identity = 291/774 (37.60%), Postives = 428/774 (55.30%), Query Frame = 0

Query: 16  HSHLNLQQTHQIHAHCIKTQFRNPHSFFSR--------SHF------------TPEAN-- 75
           H+   LQ    IHA  IK    N +   S+         HF              E N  
Sbjct: 41  HNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQEPNLL 100

Query: 76  -FNLLISSYTDNHLPQAAFILYHHMRTTDAAAVDNFIVPSLLKACAQASSTNFGREVHGF 135
            +N +   +  +  P +A  LY  M +      +++  P +LK+CA++ +   G+++HG 
Sbjct: 101 IWNTMFRGHALSSDPVSALKLYVCMISL-GLLPNSYTFPFVLKSCAKSKAFKEGQQIHGH 160

Query: 136 AVKNGFVSDVFVCNALMNMYEKCGSLVSACLVFDKMPDRDVVSWSTMLGCYVRSKSFGEA 195
            +K G   D++V  +L++MY + G L  A  VFDK P RDVVS+                
Sbjct: 161 VLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSY---------------- 220

Query: 196 YRLVREMHFVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGNERIELPLTTALI 255
                                                                   TALI
Sbjct: 221 --------------------------------------------------------TALI 280

Query: 256 DMYCKGDKLASAMRLFDGLSQRNVVSWTALIAGCIRSCRFVEGAKNFSRMLEENIAPNEI 315
             Y     + +A +LFD +  ++VVSW A+I+G   +  + E  + F  M++ N+ P+E 
Sbjct: 281 KGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDES 340

Query: 316 TLLSLITECGFVGALDLGKWLHAYLLRNGFGMSLALATALIDMYGKCGQVAYARALFNGV 375
           T++++++ C   G+++LG+ +H ++  +GFG +L +  ALID+Y KCG++  A  LF  +
Sbjct: 341 TMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERL 400

Query: 376 EEKDVKIWSALISAYAHASCIDQAFSLFLKMLDSEVKPNKVTMVSLLSLCAEVGALDLGR 435
             KDV  W+ LI  Y H +   +A  LF +ML S   PN VTM+S+L  CA +GA+D+GR
Sbjct: 401 PYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGR 460

Query: 436 WTHAYINRH--GVEVDVVLETALINMYAKCGDLKTARCLFDEATRRDIHMWNAMMAGFSI 495
           W H YI++   GV     L T+LI+MYAKCGD++ A  +F+    + +  WNAM+ GF++
Sbjct: 461 WIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAM 520

Query: 496 HGCGKEALELFSDMVCHGVEPNDITFISVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIE 555
           HG    + +LFS M   G++P+DITF+ +  ACSHSG++  G   F  M  ++ + PK+E
Sbjct: 521 HGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLE 580

Query: 556 HYGCLVDLLGRAKRLDAAHSIIENMPMRPNTIVWGALLAACKLHKNLALGEVAARKILEL 615
           HYGC++DLLG +     A  +I  M M P+ ++W +LL ACK+H N+ LGE  A  ++++
Sbjct: 581 HYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKI 640

Query: 616 DPENCGYRVLKSNIYASEKRWTDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFRSGDKT 675
           +PEN G  VL SNIYAS  RW +V   R  ++  GMKK PG S IE++  VH F  GDK 
Sbjct: 641 EPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKF 700

Query: 676 CTQTRKVHEMVTEMCIKLREAGYAPNTSAVLLNVEDEEKESALSYHSEKLAMAFGLISTA 735
             + R+++ M+ EM + L +AG+ P+TS VL  +E+E KE AL +HSEKLA+AFGLIST 
Sbjct: 701 HPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTK 741

Query: 736 PGTPIRIIKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW 765
           PGT + I+KNLR+C +CH ATKL+SKIY R II RDR RFHHF +G CSC  YW
Sbjct: 761 PGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of CmoCh10G010680 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 527.7 bits (1358), Expect = 2.2e-148
Identity = 264/681 (38.77%), Postives = 407/681 (59.77%), Query Frame = 0

Query: 85  VDNFIVPSLLKACAQASSTNFGREVHGFAVKNGFVSDVFVCNALMNMYEKCGSLVSACLV 144
           +D++    + K+ +   S + G ++HGF +K+GF     V N+L+  Y K   + SA  V
Sbjct: 193 MDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKV 252

Query: 145 FDKMPDRDVVSWSTMLGCYVRSKSFGEAYRLVREMHFVGVKLSDVALISMIGVFGELSDM 204
           FD+M +RDV+SW++++  YV +    +   +  +M   G+++    ++S+     +   +
Sbjct: 253 FDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLI 312

Query: 205 KSGRAIHGYVVRNVGNERIELPLTTALIDMYCKGDKLASAMRLFDGLSQRNVVSWTALIA 264
             GRA+H   V+   +   E      L+DMY K   L SA  +F  +S R+VVS+T++IA
Sbjct: 313 SLGRAVHSIGVKACFSR--EDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIA 372

Query: 265 GCIRSCRFVEGAKNFSRMLEENIAPNEITLLSLITECGFVGALDLGKWLHAYLLRNGFGM 324
           G  R     E  K F  M EE I+P+  T+ +++  C     LD GK +H ++  N  G 
Sbjct: 373 GYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGF 432

Query: 325 SLALATALIDMYGKCGQVAYARALFNGVEEKDVKIWSALISAYAHASCIDQAFSLFLKML 384
            + ++ AL+DMY KCG +  A  +F+ +  KD+  W+ +I  Y+     ++A SLF  +L
Sbjct: 433 DIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLL 492

Query: 385 DSE-VKPNKVTMVSLLSLCAEVGALDLGRWTHAYINRHGVEVDVVLETALINMYAKCGDL 444
           + +   P++ T+  +L  CA + A D GR  H YI R+G   D  +  +L++MYAKCG L
Sbjct: 493 EEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGAL 552

Query: 445 KTARCLFDEATRRDIHMWNAMMAGFSIHGCGKEALELFSDMVCHGVEPNDITFISVFHAC 504
             A  LFD+   +D+  W  M+AG+ +HG GKEA+ LF+ M   G+E ++I+F+S+ +AC
Sbjct: 553 LLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYAC 612

Query: 505 SHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCLVDLLGRAKRLDAAHSIIENMPMRPNTIV 564
           SHSGLV EG + F+ M HE  I P +EHY C+VD+L R   L  A+  IENMP+ P+  +
Sbjct: 613 SHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATI 672

Query: 565 WGALLAACKLHKNLALGEVAARKILELDPENCGYRVLKSNIYASEKRWTDVTSVRETMSH 624
           WGALL  C++H ++ L E  A K+ EL+PEN GY VL +NIYA  ++W  V  +R+ +  
Sbjct: 673 WGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQ 732

Query: 625 LGMKKEPGLSWIEVNGSVHHFRSGDKTCTQTRKVHEMVTEMCIKLREAGYAPNTSAVLLN 684
            G++K PG SWIE+ G V+ F +GD +  +T  +   + ++  ++ E GY+P T   L++
Sbjct: 733 RGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIEEGYSPLTKYALID 792

Query: 685 VEDEEKESALSYHSEKLAMAFGLISTAPGTPIRIIKNLRICDDCHAATKLLSKIYGRTII 744
            E+ EKE AL  HSEKLAMA G+IS+  G  IR+ KNLR+C DCH   K +SK+  R I+
Sbjct: 793 AEEMEKEEALCGHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHEMAKFMSKLTRREIV 852

Query: 745 VRDRNRFHHFSEGYCSCLGYW 765
           +RD NRFH F +G+CSC G+W
Sbjct: 853 LRDSNRFHQFKDGHCSCRGFW 871

BLAST of CmoCh10G010680 vs. ExPASy TrEMBL
Match: A0A6J1HA74 (pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like OS=Cucurbita moschata OX=3662 GN=LOC111461539 PE=3 SV=1)

HSP 1 Score: 1563.9 bits (4048), Expect = 0.0e+00
Identity = 764/764 (100.00%), Postives = 764/764 (100.00%), Query Frame = 0

Query: 1   MDQLILSAASPSGSGHSHLNLQQTHQIHAHCIKTQFRNPHSFFSRSHFTPEANFNLLISS 60
           MDQLILSAASPSGSGHSHLNLQQTHQIHAHCIKTQFRNPHSFFSRSHFTPEANFNLLISS
Sbjct: 1   MDQLILSAASPSGSGHSHLNLQQTHQIHAHCIKTQFRNPHSFFSRSHFTPEANFNLLISS 60

Query: 61  YTDNHLPQAAFILYHHMRTTDAAAVDNFIVPSLLKACAQASSTNFGREVHGFAVKNGFVS 120
           YTDNHLPQAAFILYHHMRTTDAAAVDNFIVPSLLKACAQASSTNFGREVHGFAVKNGFVS
Sbjct: 61  YTDNHLPQAAFILYHHMRTTDAAAVDNFIVPSLLKACAQASSTNFGREVHGFAVKNGFVS 120

Query: 121 DVFVCNALMNMYEKCGSLVSACLVFDKMPDRDVVSWSTMLGCYVRSKSFGEAYRLVREMH 180
           DVFVCNALMNMYEKCGSLVSACLVFDKMPDRDVVSWSTMLGCYVRSKSFGEAYRLVREMH
Sbjct: 121 DVFVCNALMNMYEKCGSLVSACLVFDKMPDRDVVSWSTMLGCYVRSKSFGEAYRLVREMH 180

Query: 181 FVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGNERIELPLTTALIDMYCKGDK 240
           FVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGNERIELPLTTALIDMYCKGDK
Sbjct: 181 FVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGNERIELPLTTALIDMYCKGDK 240

Query: 241 LASAMRLFDGLSQRNVVSWTALIAGCIRSCRFVEGAKNFSRMLEENIAPNEITLLSLITE 300
           LASAMRLFDGLSQRNVVSWTALIAGCIRSCRFVEGAKNFSRMLEENIAPNEITLLSLITE
Sbjct: 241 LASAMRLFDGLSQRNVVSWTALIAGCIRSCRFVEGAKNFSRMLEENIAPNEITLLSLITE 300

Query: 301 CGFVGALDLGKWLHAYLLRNGFGMSLALATALIDMYGKCGQVAYARALFNGVEEKDVKIW 360
           CGFVGALDLGKWLHAYLLRNGFGMSLALATALIDMYGKCGQVAYARALFNGVEEKDVKIW
Sbjct: 301 CGFVGALDLGKWLHAYLLRNGFGMSLALATALIDMYGKCGQVAYARALFNGVEEKDVKIW 360

Query: 361 SALISAYAHASCIDQAFSLFLKMLDSEVKPNKVTMVSLLSLCAEVGALDLGRWTHAYINR 420
           SALISAYAHASCIDQAFSLFLKMLDSEVKPNKVTMVSLLSLCAEVGALDLGRWTHAYINR
Sbjct: 361 SALISAYAHASCIDQAFSLFLKMLDSEVKPNKVTMVSLLSLCAEVGALDLGRWTHAYINR 420

Query: 421 HGVEVDVVLETALINMYAKCGDLKTARCLFDEATRRDIHMWNAMMAGFSIHGCGKEALEL 480
           HGVEVDVVLETALINMYAKCGDLKTARCLFDEATRRDIHMWNAMMAGFSIHGCGKEALEL
Sbjct: 421 HGVEVDVVLETALINMYAKCGDLKTARCLFDEATRRDIHMWNAMMAGFSIHGCGKEALEL 480

Query: 481 FSDMVCHGVEPNDITFISVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCLVDLLG 540
           FSDMVCHGVEPNDITFISVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCLVDLLG
Sbjct: 481 FSDMVCHGVEPNDITFISVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCLVDLLG 540

Query: 541 RAKRLDAAHSIIENMPMRPNTIVWGALLAACKLHKNLALGEVAARKILELDPENCGYRVL 600
           RAKRLDAAHSIIENMPMRPNTIVWGALLAACKLHKNLALGEVAARKILELDPENCGYRVL
Sbjct: 541 RAKRLDAAHSIIENMPMRPNTIVWGALLAACKLHKNLALGEVAARKILELDPENCGYRVL 600

Query: 601 KSNIYASEKRWTDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFRSGDKTCTQTRKVHEM 660
           KSNIYASEKRWTDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFRSGDKTCTQTRKVHEM
Sbjct: 601 KSNIYASEKRWTDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFRSGDKTCTQTRKVHEM 660

Query: 661 VTEMCIKLREAGYAPNTSAVLLNVEDEEKESALSYHSEKLAMAFGLISTAPGTPIRIIKN 720
           VTEMCIKLREAGYAPNTSAVLLNVEDEEKESALSYHSEKLAMAFGLISTAPGTPIRIIKN
Sbjct: 661 VTEMCIKLREAGYAPNTSAVLLNVEDEEKESALSYHSEKLAMAFGLISTAPGTPIRIIKN 720

Query: 721 LRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW 765
           LRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW
Sbjct: 721 LRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW 764

BLAST of CmoCh10G010680 vs. ExPASy TrEMBL
Match: A0A6J1JKG9 (pentatricopeptide repeat-containing protein At4g21065-like OS=Cucurbita maxima OX=3661 GN=LOC111485395 PE=3 SV=1)

HSP 1 Score: 1505.3 bits (3896), Expect = 0.0e+00
Identity = 734/764 (96.07%), Postives = 746/764 (97.64%), Query Frame = 0

Query: 1   MDQLILSAASPSGSGHSHLNLQQTHQIHAHCIKTQFRNPHSFFSRSHFTPEANFNLLISS 60
           MDQLILSAASPSGSGHSHLNLQQTHQIHAH IKTQFRNPH+FFSRS+FTPEANFNLLISS
Sbjct: 1   MDQLILSAASPSGSGHSHLNLQQTHQIHAHFIKTQFRNPHNFFSRSNFTPEANFNLLISS 60

Query: 61  YTDNHLPQAAFILYHHMRTTDAAAVDNFIVPSLLKACAQASSTNFGREVHGFAVKNGFVS 120
           YTDNH PQAAF LYHHMRTTDAAAVDNFIVPSLLKACAQASSTN GREVHGFAVKNGFVS
Sbjct: 61  YTDNHRPQAAFNLYHHMRTTDAAAVDNFIVPSLLKACAQASSTNLGREVHGFAVKNGFVS 120

Query: 121 DVFVCNALMNMYEKCGSLVSACLVFDKMPDRDVVSWSTMLGCYVRSKSFGEAYRLVREMH 180
           DVFVCNALMNMYEKCGSLVSACLVFDKMPDRDVVSWSTMLGCYVRSKSFGEAYRLVREMH
Sbjct: 121 DVFVCNALMNMYEKCGSLVSACLVFDKMPDRDVVSWSTMLGCYVRSKSFGEAYRLVREMH 180

Query: 181 FVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGNERIELPLTTALIDMYCKGDK 240
           FVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVG ERIELPLTTALIDMYCKGD 
Sbjct: 181 FVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGKERIELPLTTALIDMYCKGDN 240

Query: 241 LASAMRLFDGLSQRNVVSWTALIAGCIRSCRFVEGAKNFSRMLEENIAPNEITLLSLITE 300
           LASAMRLF+GLSQRNVVSWTALIAGCIRSCRFVEGAKNFSRMLEENI PNEITLLSLITE
Sbjct: 241 LASAMRLFNGLSQRNVVSWTALIAGCIRSCRFVEGAKNFSRMLEENIVPNEITLLSLITE 300

Query: 301 CGFVGALDLGKWLHAYLLRNGFGMSLALATALIDMYGKCGQVAYARALFNGVEEKDVKIW 360
           CGFVGALDLGKWLH+YLLRNGFGMSL L TALIDMYGKCGQVAYARALFN V+EKDVKIW
Sbjct: 301 CGFVGALDLGKWLHSYLLRNGFGMSLTLTTALIDMYGKCGQVAYARALFNVVDEKDVKIW 360

Query: 361 SALISAYAHASCIDQAFSLFLKMLDSEVKPNKVTMVSLLSLCAEVGALDLGRWTHAYINR 420
           SALISAYAH SCIDQAFSLFLKMLDSEVKPNKVTMVSLLSLCAEVGALDLGRWTHAYI  
Sbjct: 361 SALISAYAHTSCIDQAFSLFLKMLDSEVKPNKVTMVSLLSLCAEVGALDLGRWTHAYIIH 420

Query: 421 HGVEVDVVLETALINMYAKCGDLKTARCLFDEATRRDIHMWNAMMAGFSIHGCGKEALEL 480
           HGVEVD+VLETALINMYAKCGDLKTAR LFDEAT+RDIHMWNAMMAGFSIHGCGKEALEL
Sbjct: 421 HGVEVDIVLETALINMYAKCGDLKTARSLFDEATQRDIHMWNAMMAGFSIHGCGKEALEL 480

Query: 481 FSDMVCHGVEPNDITFISVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCLVDLLG 540
           FSDM CHGVEPNDITFISVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCLVDLLG
Sbjct: 481 FSDMECHGVEPNDITFISVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCLVDLLG 540

Query: 541 RAKRLDAAHSIIENMPMRPNTIVWGALLAACKLHKNLALGEVAARKILELDPENCGYRVL 600
           RAKRLDAAHSIIENMPMRPNTI+WGALLAACKLHKNL LG+VAARKILELDPENCGYRVL
Sbjct: 541 RAKRLDAAHSIIENMPMRPNTIIWGALLAACKLHKNLPLGKVAARKILELDPENCGYRVL 600

Query: 601 KSNIYASEKRWTDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFRSGDKTCTQTRKVHEM 660
           KSNIYASEKRWT+VTS+RE+MSHLGMKKEPGLSW EVNGSVHHFRSGDKTCTQ RKVHEM
Sbjct: 601 KSNIYASEKRWTNVTSIRESMSHLGMKKEPGLSWTEVNGSVHHFRSGDKTCTQARKVHEM 660

Query: 661 VTEMCIKLREAGYAPNTSAVLLNVEDEEKESALSYHSEKLAMAFGLISTAPGTPIRIIKN 720
           VTEMCIKLREAGYAPNTSAVLLNVEDEEKESALSYHSEKLAMAFGLISTAPGTPIRIIKN
Sbjct: 661 VTEMCIKLREAGYAPNTSAVLLNVEDEEKESALSYHSEKLAMAFGLISTAPGTPIRIIKN 720

Query: 721 LRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW 765
           LRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW
Sbjct: 721 LRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW 764

BLAST of CmoCh10G010680 vs. ExPASy TrEMBL
Match: A0A5A7V2V9 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold403G001350 PE=3 SV=1)

HSP 1 Score: 1346.6 bits (3484), Expect = 0.0e+00
Identity = 651/766 (84.99%), Postives = 703/766 (91.78%), Query Frame = 0

Query: 1   MDQLILSAASPSG-SGHSHLNLQQTHQIHAHCIKTQFRNPHSFFSRSHFTPEANFNLLIS 60
           M+QLILS+ S SG SG+SHLNLQQTHQ+HAH IKTQF NPH FFS+SHFTPEAN+NLLIS
Sbjct: 1   MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60

Query: 61  SYTDNHLPQAAFILYHHMRTTD-AAAVDNFIVPSLLKACAQASSTNFGREVHGFAVKNGF 120
           SYT+NHLPQA+   Y HMRT D AAA+DNFI+PSLLKACAQASS + GRE+HGFA KNGF
Sbjct: 61  SYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGF 120

Query: 121 VSDVFVCNALMNMYEKCGSLVSACLVFDKMPDRDVVSWSTMLGCYVRSKSFGEAYRLVRE 180
            SDVFVCNALMNMYEKCG LVSA LVFDKMP+RDVVSWSTMLGCYVRSK+FGEA RLVRE
Sbjct: 121 ASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVRE 180

Query: 181 MHFVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGNERIELPLTTALIDMYCKG 240
           M FVGVKLS VALIS+IGVFG L DMKSGRA+HGY+VRNVG+E++E+ LTTALIDMYCK 
Sbjct: 181 MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALIDMYCKC 240

Query: 241 DKLASAMRLFDGLSQRNVVSWTALIAGCIRSCRFVEGAKNFSRMLEENIAPNEITLLSLI 300
           + LASA RLFD LS+R+VVSWT +I GCIRSCR VEGAKNF+RMLEE + PNEITLLSLI
Sbjct: 241 ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI 300

Query: 301 TECGFVGALDLGKWLHAYLLRNGFGMSLALATALIDMYGKCGQVAYARALFNGVEEKDVK 360
           TECGFV  LDLGKW HAYLLRNGFGMSLAL TALIDMYGKCGQV YARALFNGVE+KDVK
Sbjct: 301 TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK 360

Query: 361 IWSALISAYAHASCIDQAFSLFLKMLDSEVKPNKVTMVSLLSLCAEVGALDLGRWTHAYI 420
           IWSALISAYAH SC+DQ F+LFL+MLD+EVKPNKVTMVSLLSLCAE G LDLG+WTHAYI
Sbjct: 361 IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI 420

Query: 421 NRHGVEVDVVLETALINMYAKCGDLKTARCLFDEATRRDIHMWNAMMAGFSIHGCGKEAL 480
           NRHG+EVDV+LETALINMY KCGD+  AR LFDEAT+RDIHMWNAMMAGFS+HGCGKEAL
Sbjct: 421 NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL 480

Query: 481 ELFSDMVCHGVEPNDITFISVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCLVDL 540
           ELFS+M  HGVEPNDITFIS+FHACSHSGLV EG KHF+RMVH FGIVPK+EHYGCLVDL
Sbjct: 481 ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGCLVDL 540

Query: 541 LGRAKRLDAAHSIIENMPMRPNTIVWGALLAACKLHKNLALGEVAARKILELDPENCGYR 600
           LGRA  L+ AH+IIENMPMRPNTI+WGALLAACKLHKNLALGEVAARKILELDP+NCGY 
Sbjct: 541 LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600

Query: 601 VLKSNIYASEKRWTDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFRSGDKTCTQTRKVH 660
           VLKSNIYAS KRW DVTSVRETMSHLGMKKEPGLSWIEVNGSVHHF+SGDKTCTQT KV+
Sbjct: 601 VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY 660

Query: 661 EMVTEMCIKLREAGYAPNTSAVLLNVEDEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720
           EMV EMCIKLREAGY PNT+ VLLN+++EEKESALSYHSEKLAMAFGLISTAPGTPIRII
Sbjct: 661 EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720

Query: 721 KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW 765
           KNLRICDDCHAA KLLSKIY RTIIVRDRNRFHHFSEGYCSCLGYW
Sbjct: 721 KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 766

BLAST of CmoCh10G010680 vs. ExPASy TrEMBL
Match: A0A1S3CJ58 (pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103501009 PE=3 SV=1)

HSP 1 Score: 1345.5 bits (3481), Expect = 0.0e+00
Identity = 650/766 (84.86%), Postives = 703/766 (91.78%), Query Frame = 0

Query: 1   MDQLILSAASPSG-SGHSHLNLQQTHQIHAHCIKTQFRNPHSFFSRSHFTPEANFNLLIS 60
           M+QLILS+ S SG SG+SHLNLQQTHQ+HAH IKTQF NPH FFS+SHFTPEAN+NLLIS
Sbjct: 1   MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60

Query: 61  SYTDNHLPQAAFILYHHMRTTD-AAAVDNFIVPSLLKACAQASSTNFGREVHGFAVKNGF 120
           SYT+NHLPQA+   Y HMRT D AAA+DNFI+PSLLKACAQASS + GRE+HGFA KNGF
Sbjct: 61  SYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGF 120

Query: 121 VSDVFVCNALMNMYEKCGSLVSACLVFDKMPDRDVVSWSTMLGCYVRSKSFGEAYRLVRE 180
            SDVFVCNALMNMYEKCG LVSA LVFDKMP+RDVVSWSTMLGCYVRSK+FGEA RLVRE
Sbjct: 121 ASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVRE 180

Query: 181 MHFVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGNERIELPLTTALIDMYCKG 240
           M FVGVKLS VALIS+IGVFG L DMKSGRA+HGY++RNVG+E++E+ LTTALIDMYCK 
Sbjct: 181 MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIMRNVGDEKMEVSLTTALIDMYCKC 240

Query: 241 DKLASAMRLFDGLSQRNVVSWTALIAGCIRSCRFVEGAKNFSRMLEENIAPNEITLLSLI 300
           + LASA RLFD LS+R+VVSWT +I GCIRSCR VEGAKNF+RMLEE + PNEITLLSLI
Sbjct: 241 ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI 300

Query: 301 TECGFVGALDLGKWLHAYLLRNGFGMSLALATALIDMYGKCGQVAYARALFNGVEEKDVK 360
           TECGFV  LDLGKW HAYLLRNGFGMSLAL TALIDMYGKCGQV YARALFNGVE+KDVK
Sbjct: 301 TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK 360

Query: 361 IWSALISAYAHASCIDQAFSLFLKMLDSEVKPNKVTMVSLLSLCAEVGALDLGRWTHAYI 420
           IWSALISAYAH SC+DQ F+LFL+MLD+EVKPNKVTMVSLLSLCAE G LDLG+WTHAYI
Sbjct: 361 IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI 420

Query: 421 NRHGVEVDVVLETALINMYAKCGDLKTARCLFDEATRRDIHMWNAMMAGFSIHGCGKEAL 480
           NRHG+EVDV+LETALINMY KCGD+  AR LFDEAT+RDIHMWNAMMAGFS+HGCGKEAL
Sbjct: 421 NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL 480

Query: 481 ELFSDMVCHGVEPNDITFISVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCLVDL 540
           ELFS+M  HGVEPNDITFIS+FHACSHSGLV EG KHF+RMVH FGIVPK+EHYGCLVDL
Sbjct: 481 ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHNFGIVPKMEHYGCLVDL 540

Query: 541 LGRAKRLDAAHSIIENMPMRPNTIVWGALLAACKLHKNLALGEVAARKILELDPENCGYR 600
           LGRA  L+ AH+IIENMPMRPNTI+WGALLAACKLHKNLALGEVAARKILELDP+NCGY 
Sbjct: 541 LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600

Query: 601 VLKSNIYASEKRWTDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFRSGDKTCTQTRKVH 660
           VLKSNIYAS KRW DVTSVRETMSHLGMKKEPGLSWIEVNGSVHHF+SGDKTCTQT KV+
Sbjct: 601 VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY 660

Query: 661 EMVTEMCIKLREAGYAPNTSAVLLNVEDEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720
           EMV EMCIKLREAGY PNT+ VLLN+++EEKESALSYHSEKLAMAFGLISTAPGTPIRII
Sbjct: 661 EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720

Query: 721 KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW 765
           KNLRICDDCHAA KLLSKIY RTIIVRDRNRFHHFSEGYCSCLGYW
Sbjct: 721 KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 766

BLAST of CmoCh10G010680 vs. ExPASy TrEMBL
Match: A0A0A0LYC2 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G690260 PE=3 SV=1)

HSP 1 Score: 1327.8 bits (3435), Expect = 0.0e+00
Identity = 635/765 (83.01%), Postives = 701/765 (91.63%), Query Frame = 0

Query: 1   MDQLILSAASPSG-SGHSHLNLQQTHQIHAHCIKTQFRNPHSFFSRSHFTPEANFNLLIS 60
           M+QLIL + S SG SGHSHLNLQQTHQ+HAH IKTQF NPH FFS+SHFTPEAN+NLLIS
Sbjct: 1   MNQLILCSPSFSGCSGHSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60

Query: 61  SYTDNHLPQAAFILYHHMRTTDAAAVDNFIVPSLLKACAQASSTNFGREVHGFAVKNGFV 120
           SYT+NHLPQA+F  Y HMR+ DAAA+DNFI+PSLLKACAQASS + GRE+HGFA KNGF 
Sbjct: 61  SYTNNHLPQASFNCYLHMRSNDAAALDNFILPSLLKACAQASSGDLGRELHGFAQKNGFA 120

Query: 121 SDVFVCNALMNMYEKCGSLVSACLVFDKMPDRDVVSWSTMLGCYVRSKSFGEAYRLVREM 180
           SDVFVCNALMNMYEKCG LVSA LVFD+MP+RDVVSW+TMLGCYVRSK+FGEA RLVREM
Sbjct: 121 SDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEALRLVREM 180

Query: 181 HFVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGNERIELPLTTALIDMYCKGD 240
            FVGVKLS VALIS+I VFG L DMKSGRA+HGY+VRNVG+E++E+ +TTALIDMYCKG 
Sbjct: 181 QFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKGG 240

Query: 241 KLASAMRLFDGLSQRNVVSWTALIAGCIRSCRFVEGAKNFSRMLEENIAPNEITLLSLIT 300
            LASA RLFD LS+R+VVSWT +IAGCIRSCR  EGAKNF+RMLEE + PNEITLLSLIT
Sbjct: 241 CLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLIT 300

Query: 301 ECGFVGALDLGKWLHAYLLRNGFGMSLALATALIDMYGKCGQVAYARALFNGVEEKDVKI 360
           ECGFVG LDLGKW HAYLLRNGFGMSLAL TALIDMYGKCGQV YARALFNGV++KDVKI
Sbjct: 301 ECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVKI 360

Query: 361 WSALISAYAHASCIDQAFSLFLKMLDSEVKPNKVTMVSLLSLCAEVGALDLGRWTHAYIN 420
           WS LISAYAH SC+DQ F+LF++ML+++VKPN VTMVSLLSLCAE GALDLG+WTHAYIN
Sbjct: 361 WSVLISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYIN 420

Query: 421 RHGVEVDVVLETALINMYAKCGDLKTARCLFDEATRRDIHMWNAMMAGFSIHGCGKEALE 480
           RHG+EVDV+LETALINMYAKCGD+  AR LF+EA +RDI MWN MMAGFS+HGCGKEALE
Sbjct: 421 RHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEALE 480

Query: 481 LFSDMVCHGVEPNDITFISVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCLVDLL 540
           LFS+M  HGVEPNDITF+S+FHACSHSGLV EG K+F++MVH+FGIVPK+EHYGCLVDLL
Sbjct: 481 LFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDLL 540

Query: 541 GRAKRLDAAHSIIENMPMRPNTIVWGALLAACKLHKNLALGEVAARKILELDPENCGYRV 600
           GRA  LD AH+IIENMPMRPNTI+WGALLAACKLHKNLALGEVAARKILELDP+NCGY V
Sbjct: 541 GRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSV 600

Query: 601 LKSNIYASEKRWTDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFRSGDKTCTQTRKVHE 660
           LKSNIYAS KRW DVTSVRE MSH GMKKEPGLSWIEV+GSVHHF+SGDK CTQT KV+E
Sbjct: 601 LKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVYE 660

Query: 661 MVTEMCIKLREAGYAPNTSAVLLNVEDEEKESALSYHSEKLAMAFGLISTAPGTPIRIIK 720
           MVTEMCIKLRE+GY PNT+AVLLN+++EEKESALSYHSEKLA AFGLISTAPGTPIRI+K
Sbjct: 661 MVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIVK 720

Query: 721 NLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW 765
           NLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSC+GYW
Sbjct: 721 NLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 765

BLAST of CmoCh10G010680 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 544.7 bits (1402), Expect = 1.2e-154
Identity = 266/672 (39.58%), Postives = 409/672 (60.86%), Query Frame = 0

Query: 93  LLKACAQASSTNFGREVHGFAVKNGFVSDVFVCNALMNMYEKCGSLVSACLVFDKMPDRD 152
           LLK C   +    G+E+HG  VK+GF  D+F    L NMY KC  +  A  VFD+MP+RD
Sbjct: 141 LLKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERD 200

Query: 153 VVSWSTMLGCYVRSKSFGEAYRLVREMHFVGVKLSDVALISMIGVFGELSDMKSGRAIHG 212
           +VSW+T++  Y ++     A  +V+ M    +K S + ++S++     L  +  G+ IHG
Sbjct: 201 LVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHG 260

Query: 213 YVVRNVGNERIELPLTTALIDMYCKGDKLASAMRLFDGLSQRNVVSWTALIAGCIRSCRF 272
           Y +R+  +  +   ++TAL+DMY K   L +A +LFDG+ +RNVVSW ++I   +++   
Sbjct: 261 YAMRSGFDSLVN--ISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENP 320

Query: 273 VEGAKNFSRMLEENIAPNEITLLSLITECGFVGALDLGKWLHAYLLRNGFGMSLALATAL 332
            E    F +ML+E + P +++++  +  C  +G L+ G+++H   +  G   ++++  +L
Sbjct: 321 KEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSL 380

Query: 333 IDMYGKCGQVAYARALFNGVEEKDVKIWSALISAYAHASCIDQAFSLFLKMLDSEVKPNK 392
           I MY KC +V  A ++F  ++ + +  W+A+I  +A       A + F +M    VKP+ 
Sbjct: 381 ISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDT 440

Query: 393 VTMVSLLSLCAEVGALDLGRWTHAYINRHGVEVDVVLETALINMYAKCGDLKTARCLFDE 452
            T VS+++  AE+      +W H  + R  ++ +V + TAL++MYAKCG +  AR +FD 
Sbjct: 441 FTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDM 500

Query: 453 ATRRDIHMWNAMMAGFSIHGCGKEALELFSDMVCHGVEPNDITFISVFHACSHSGLVGEG 512
            + R +  WNAM+ G+  HG GK ALELF +M    ++PN +TF+SV  ACSHSGLV  G
Sbjct: 501 MSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAG 560

Query: 513 MKHFDRMVHEFGIVPKIEHYGCLVDLLGRAKRLDAAHSIIENMPMRPNTIVWGALLAACK 572
           +K F  M   + I   ++HYG +VDLLGRA RL+ A   I  MP++P   V+GA+L AC+
Sbjct: 561 LKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQ 620

Query: 573 LHKNLALGEVAARKILELDPENCGYRVLKSNIYASEKRWTDVTSVRETMSHLGMKKEPGL 632
           +HKN+   E AA ++ EL+P++ GY VL +NIY +   W  V  VR +M   G++K PG 
Sbjct: 621 IHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGC 680

Query: 633 SWIEVNGSVHHFRSGDKTCTQTRKVHEMVTEMCIKLREAGYAPNTSAVLLNVEDEEKESA 692
           S +E+   VH F SG      ++K++  + ++   ++EAGY P+T+ V L VE++ KE  
Sbjct: 681 SMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEAGYVPDTNLV-LGVENDVKEQL 740

Query: 693 LSYHSEKLAMAFGLISTAPGTPIRIIKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHH 752
           LS HSEKLA++FGL++T  GT I + KNLR+C DCH ATK +S + GR I+VRD  RFHH
Sbjct: 741 LSTHSEKLAISFGLLNTTAGTTIHVRKNLRVCADCHNATKYISLVTGREIVVRDMQRFHH 800

Query: 753 FSEGYCSCLGYW 765
           F  G CSC  YW
Sbjct: 801 FKNGACSCGDYW 809

BLAST of CmoCh10G010680 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 540.0 bits (1390), Expect = 3.0e-153
Identity = 293/770 (38.05%), Postives = 436/770 (56.62%), Query Frame = 0

Query: 19  LNLQQTHQIHAHCIKT-QFRNPHSF----------------FSRSHF--TPEAN---FNL 78
           ++L+Q  Q H H I+T  F +P+S                 ++R  F   P+ N   +N 
Sbjct: 41  VSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKPNSFAWNT 100

Query: 79  LISSYTDNHLPQAAFILYHHMRTTDAAAVDNFIVPSLLKACAQASSTNFGREVHGFAVKN 138
           LI +Y     P  +   +  M +      + +  P L+KA A+ SS + G+ +HG AVK+
Sbjct: 101 LIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHGMAVKS 160

Query: 139 GFVSDVFVCNALMNMYEKCGSLVSACLVFDKMPDRDVVSWSTMLGCYVRSKSFGEAYRLV 198
              SDVFV N+L++ Y  CG L SAC VF  + ++DVVSW++M+  +V+  S  +A  L 
Sbjct: 161 AVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALELF 220

Query: 199 REMHFVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGNERIELPLTTALIDMYC 258
           ++M    VK S V ++ ++    ++ +++ GR +  Y+  N  N  + L L  A++DMY 
Sbjct: 221 KKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVN--VNLTLANAMLDMYT 280

Query: 259 KGDKLASAMRLFDGLSQRNVVSWTALIAGCIRSCRFVEGAKNFSRMLEENIAPNEITLLS 318
           K   +  A RLFD + +++ V+WT ++ G                               
Sbjct: 281 KCGSIEDAKRLFDAMEEKDNVTWTTMLDG------------------------------- 340

Query: 319 LITECGFVGALDLGKWLHAYLLRNGFGMSLALATALIDMYGKCGQVAYARALFNGVEEKD 378
                                                  Y        AR + N + +KD
Sbjct: 341 ---------------------------------------YAISEDYEAAREVLNSMPQKD 400

Query: 379 VKIWSALISAYAHASCIDQAFSLFLKM-LDSEVKPNKVTMVSLLSLCAEVGALDLGRWTH 438
           +  W+ALISAY      ++A  +F ++ L   +K N++T+VS LS CA+VGAL+LGRW H
Sbjct: 401 IVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIH 460

Query: 439 AYINRHGVEVDVVLETALINMYAKCGDLKTARCLFDEATRRDIHMWNAMMAGFSIHGCGK 498
           +YI +HG+ ++  + +ALI+MY+KCGDL+ +R +F+   +RD+ +W+AM+ G ++HGCG 
Sbjct: 461 SYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGN 520

Query: 499 EALELFSDMVCHGVEPNDITFISVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCL 558
           EA+++F  M    V+PN +TF +VF ACSH+GLV E    F +M   +GIVP+ +HY C+
Sbjct: 521 EAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACI 580

Query: 559 VDLLGRAKRLDAAHSIIENMPMRPNTIVWGALLAACKLHKNLALGEVAARKILELDPENC 618
           VD+LGR+  L+ A   IE MP+ P+T VWGALL ACK+H NL L E+A  ++LEL+P N 
Sbjct: 581 VDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRND 640

Query: 619 GYRVLKSNIYASEKRWTDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFRSGDKTCTQTR 678
           G  VL SNIYA   +W +V+ +R+ M   G+KKEPG S IE++G +H F SGD     + 
Sbjct: 641 GAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSE 700

Query: 679 KVHEMVTEMCIKLREAGYAPNTSAVLLNVEDEE-KESALSYHSEKLAMAFGLISTAPGTP 738
           KV+  + E+  KL+  GY P  S VL  +E+EE KE +L+ HSEKLA+ +GLIST     
Sbjct: 701 KVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKV 738

Query: 739 IRIIKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW 765
           IR+IKNLR+C DCH+  KL+S++Y R IIVRDR RFHHF  G CSC  +W
Sbjct: 761 IRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738

BLAST of CmoCh10G010680 vs. TAIR 10
Match: AT3G22690.2 (INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification; LOCATED IN: chloroplast; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: LP.04 four leaves visible, 4 anthesis, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; CONTAINS InterPro DOMAIN/s: Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1). )

HSP 1 Score: 533.9 bits (1374), Expect = 2.1e-151
Identity = 281/747 (37.62%), Postives = 431/747 (57.70%), Query Frame = 0

Query: 54  FNLLISSYTDNHLPQAAFILYHHMRTTDAAAVDNFIVPSLLKACAQASSTNFGREVHGFA 113
           +N LI  Y  + L   A +L+  M  +   + D +  P  L ACA++ +   G ++HG  
Sbjct: 102 YNSLIRGYASSGLCNEAILLFLRMMNS-GISPDKYTFPFGLSACAKSRAKGNGIQIHGLI 161

Query: 114 VKNGFVSDVFVCNALMNMYEKCGSLVSACLVFDKMPDRDVVSWSTMLGCYVRSKSFGEA- 173
           VK G+  D+FV N+L++ Y +CG L SA  VFD+M +R+VVSW++M+  Y R     +A 
Sbjct: 162 VKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMICGYARRDFAKDAV 221

Query: 174 ---YRLVREMHFVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGNERIELPLTT 233
              +R+VR+     V  + V ++ +I    +L D+++G  ++ + +RN G E  +L + +
Sbjct: 222 DLFFRMVRDEE---VTPNSVTMVCVISACAKLEDLETGEKVYAF-IRNSGIEVNDL-MVS 281

Query: 234 ALIDMYCKGDKLASAMRLFDGLSQRNVVSWTALIAGCIRSCRFVEGAKNFSRMLEENIAP 293
           AL+DMY K + +  A RLFD     N+    A+ +  +R     E    F+ M++  + P
Sbjct: 282 ALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGVRP 341

Query: 294 NEITLLSLITECGFVGALDLGKWLHAYLLRNGFGMSLALATALIDMYGKC---------- 353
           + I++LS I+ C  +  +  GK  H Y+LRNGF     +  ALIDMY KC          
Sbjct: 342 DRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAFRIF 401

Query: 354 ---------------------GQVAYARALFNGVEEKDVKIWSALISAYAHASCIDQAFS 413
                                G+V  A   F  + EK++  W+ +IS     S  ++A  
Sbjct: 402 DRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEEAIE 461

Query: 414 LFLKMLDSE-VKPNKVTMVSLLSLCAEVGALDLGRWTHAYINRHGVEVDVVLETALINMY 473
           +F  M   E V  + VTM+S+ S C  +GALDL +W + YI ++G+++DV L T L++M+
Sbjct: 462 VFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTTLVDMF 521

Query: 474 AKCGDLKTARCLFDEATRRDIHMWNAMMAGFSIHGCGKEALELFSDMVCHGVEPNDITFI 533
           ++CGD ++A  +F+  T RD+  W A +   ++ G  + A+ELF DM+  G++P+ + F+
Sbjct: 522 SRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGVAFV 581

Query: 534 SVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCLVDLLGRAKRLDAAHSIIENMPM 593
               ACSH GLV +G + F  M+   G+ P+  HYGC+VDLLGRA  L+ A  +IE+MPM
Sbjct: 582 GALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIEDMPM 641

Query: 594 RPNTIVWGALLAACKLHKNLALGEVAARKILELDPENCGYRVLKSNIYASEKRWTDVTSV 653
            PN ++W +LLAAC++  N+ +   AA KI  L PE  G  VL SN+YAS  RW D+  V
Sbjct: 642 EPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDMAKV 701

Query: 654 RETMSHLGMKKEPGLSWIEVNGSVHHFRSGDKTCTQTRKVHEMVTEMCIKLREAGYAPNT 713
           R +M   G++K PG S I++ G  H F SGD++  +   +  M+ E+  +    G+ P+ 
Sbjct: 702 RLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASHLGHVPDL 761

Query: 714 SAVLLNVEDEEKESALSYHSEKLAMAFGLISTAPGTPIRIIKNLRICDDCHAATKLLSKI 765
           S VL++V+++EK   LS HSEKLAMA+GLIS+  GT IRI+KNLR+C DCH+  K  SK+
Sbjct: 762 SNVLMDVDEKEKIFMLSRHSEKLAMAYGLISSNKGTTIRIVKNLRVCSDCHSFAKFASKV 821

BLAST of CmoCh10G010680 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 531.2 bits (1367), Expect = 1.4e-150
Identity = 291/774 (37.60%), Postives = 428/774 (55.30%), Query Frame = 0

Query: 16  HSHLNLQQTHQIHAHCIKTQFRNPHSFFSR--------SHF------------TPEAN-- 75
           H+   LQ    IHA  IK    N +   S+         HF              E N  
Sbjct: 41  HNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQEPNLL 100

Query: 76  -FNLLISSYTDNHLPQAAFILYHHMRTTDAAAVDNFIVPSLLKACAQASSTNFGREVHGF 135
            +N +   +  +  P +A  LY  M +      +++  P +LK+CA++ +   G+++HG 
Sbjct: 101 IWNTMFRGHALSSDPVSALKLYVCMISL-GLLPNSYTFPFVLKSCAKSKAFKEGQQIHGH 160

Query: 136 AVKNGFVSDVFVCNALMNMYEKCGSLVSACLVFDKMPDRDVVSWSTMLGCYVRSKSFGEA 195
            +K G   D++V  +L++MY + G L  A  VFDK P RDVVS+                
Sbjct: 161 VLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSY---------------- 220

Query: 196 YRLVREMHFVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGNERIELPLTTALI 255
                                                                   TALI
Sbjct: 221 --------------------------------------------------------TALI 280

Query: 256 DMYCKGDKLASAMRLFDGLSQRNVVSWTALIAGCIRSCRFVEGAKNFSRMLEENIAPNEI 315
             Y     + +A +LFD +  ++VVSW A+I+G   +  + E  + F  M++ N+ P+E 
Sbjct: 281 KGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDES 340

Query: 316 TLLSLITECGFVGALDLGKWLHAYLLRNGFGMSLALATALIDMYGKCGQVAYARALFNGV 375
           T++++++ C   G+++LG+ +H ++  +GFG +L +  ALID+Y KCG++  A  LF  +
Sbjct: 341 TMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERL 400

Query: 376 EEKDVKIWSALISAYAHASCIDQAFSLFLKMLDSEVKPNKVTMVSLLSLCAEVGALDLGR 435
             KDV  W+ LI  Y H +   +A  LF +ML S   PN VTM+S+L  CA +GA+D+GR
Sbjct: 401 PYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGR 460

Query: 436 WTHAYINRH--GVEVDVVLETALINMYAKCGDLKTARCLFDEATRRDIHMWNAMMAGFSI 495
           W H YI++   GV     L T+LI+MYAKCGD++ A  +F+    + +  WNAM+ GF++
Sbjct: 461 WIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAM 520

Query: 496 HGCGKEALELFSDMVCHGVEPNDITFISVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIE 555
           HG    + +LFS M   G++P+DITF+ +  ACSHSG++  G   F  M  ++ + PK+E
Sbjct: 521 HGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLE 580

Query: 556 HYGCLVDLLGRAKRLDAAHSIIENMPMRPNTIVWGALLAACKLHKNLALGEVAARKILEL 615
           HYGC++DLLG +     A  +I  M M P+ ++W +LL ACK+H N+ LGE  A  ++++
Sbjct: 581 HYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKI 640

Query: 616 DPENCGYRVLKSNIYASEKRWTDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFRSGDKT 675
           +PEN G  VL SNIYAS  RW +V   R  ++  GMKK PG S IE++  VH F  GDK 
Sbjct: 641 EPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKF 700

Query: 676 CTQTRKVHEMVTEMCIKLREAGYAPNTSAVLLNVEDEEKESALSYHSEKLAMAFGLISTA 735
             + R+++ M+ EM + L +AG+ P+TS VL  +E+E KE AL +HSEKLA+AFGLIST 
Sbjct: 701 HPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTK 741

Query: 736 PGTPIRIIKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW 765
           PGT + I+KNLR+C +CH ATKL+SKIY R II RDR RFHHF +G CSC  YW
Sbjct: 761 PGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of CmoCh10G010680 vs. TAIR 10
Match: AT3G22690.1 (CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1); Has 49784 Blast hits to 14716 proteins in 280 species: Archae - 2; Bacteria - 10; Metazoa - 107; Fungi - 167; Plants - 48594; Viruses - 0; Other Eukaryotes - 904 (source: NCBI BLink). )

HSP 1 Score: 530.8 bits (1366), Expect = 1.8e-150
Identity = 280/743 (37.69%), Postives = 429/743 (57.74%), Query Frame = 0

Query: 54  FNLLISSYTDNHLPQAAFILYHHMRTTDAAAVDNFIVPSLLKACAQASSTNFGREVHGFA 113
           +N LI  Y  + L   A +L+  M  +   + D +  P  L ACA++ +   G ++HG  
Sbjct: 102 YNSLIRGYASSGLCNEAILLFLRMMNS-GISPDKYTFPFGLSACAKSRAKGNGIQIHGLI 161

Query: 114 VKNGFVSDVFVCNALMNMYEKCGSLVSACLVFDKMPDRDVVSWSTMLGCYVRSKSFGEA- 173
           VK G+  D+FV N+L++ Y +CG L SA  VFD+M +R+VVSW++M+  Y R     +A 
Sbjct: 162 VKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMICGYARRDFAKDAV 221

Query: 174 ---YRLVREMHFVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGNERIELPLTT 233
              +R+VR+     V  + V ++ +I    +L D+++G  ++ + +RN G E  +L + +
Sbjct: 222 DLFFRMVRDEE---VTPNSVTMVCVISACAKLEDLETGEKVYAF-IRNSGIEVNDL-MVS 281

Query: 234 ALIDMYCKGDKLASAMRLFDGLSQRNVVSWTALIAGCIRSCRFVEGAKNFSRMLEENIAP 293
           AL+DMY K + +  A RLFD     N+    A+ +  +R     E    F+ M++  + P
Sbjct: 282 ALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGVRP 341

Query: 294 NEITLLSLITECGFVGALDLGKWLHAYLLRNGFGMSLALATALIDMYGKC---------- 353
           + I++LS I+ C  +  +  GK  H Y+LRNGF     +  ALIDMY KC          
Sbjct: 342 DRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAFRIF 401

Query: 354 ---------------------GQVAYARALFNGVEEKDVKIWSALISAYAHASCIDQAFS 413
                                G+V  A   F  + EK++  W+ +IS     S  ++A  
Sbjct: 402 DRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEEAIE 461

Query: 414 LFLKMLDSE-VKPNKVTMVSLLSLCAEVGALDLGRWTHAYINRHGVEVDVVLETALINMY 473
           +F  M   E V  + VTM+S+ S C  +GALDL +W + YI ++G+++DV L T L++M+
Sbjct: 462 VFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTTLVDMF 521

Query: 474 AKCGDLKTARCLFDEATRRDIHMWNAMMAGFSIHGCGKEALELFSDMVCHGVEPNDITFI 533
           ++CGD ++A  +F+  T RD+  W A +   ++ G  + A+ELF DM+  G++P+ + F+
Sbjct: 522 SRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGVAFV 581

Query: 534 SVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCLVDLLGRAKRLDAAHSIIENMPM 593
               ACSH GLV +G + F  M+   G+ P+  HYGC+VDLLGRA  L+ A  +IE+MPM
Sbjct: 582 GALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIEDMPM 641

Query: 594 RPNTIVWGALLAACKLHKNLALGEVAARKILELDPENCGYRVLKSNIYASEKRWTDVTSV 653
            PN ++W +LLAAC++  N+ +   AA KI  L PE  G  VL SN+YAS  RW D+  V
Sbjct: 642 EPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDMAKV 701

Query: 654 RETMSHLGMKKEPGLSWIEVNGSVHHFRSGDKTCTQTRKVHEMVTEMCIKLREAGYAPNT 713
           R +M   G++K PG S I++ G  H F SGD++  +   +  M+ E+  +    G+ P+ 
Sbjct: 702 RLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASHLGHVPDL 761

Query: 714 SAVLLNVEDEEKESALSYHSEKLAMAFGLISTAPGTPIRIIKNLRICDDCHAATKLLSKI 761
           S VL++V+++EK   LS HSEKLAMA+GLIS+  GT IRI+KNLR+C DCH+  K  SK+
Sbjct: 762 SNVLMDVDEKEKIFMLSRHSEKLAMAYGLISSNKGTTIRIVKNLRVCSDCHSFAKFASKV 821

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q3E6Q11.7e-15339.58Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
O823804.2e-15238.05Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9LUJ23.0e-15037.62Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX... [more]
Q9LN012.0e-14937.60Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9SN392.2e-14838.77Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Match NameE-valueIdentityDescription
A0A6J1HA740.0e+00100.00pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like OS=Cuc... [more]
A0A6J1JKG90.0e+0096.07pentatricopeptide repeat-containing protein At4g21065-like OS=Cucurbita maxima O... [more]
A0A5A7V2V90.0e+0084.99Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3CJ580.0e+0084.86pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like OS=Cuc... [more]
A0A0A0LYC20.0e+0083.01DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G6902... [more]
Match NameE-valueIdentityDescription
AT1G11290.11.2e-15439.58Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G29760.13.0e-15338.05Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G22690.22.1e-15137.62INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic pro... [more]
AT1G08070.11.4e-15037.60Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G22690.11.8e-15037.69CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 49..223
e-value: 2.2E-28
score: 101.6
coord: 431..672
e-value: 3.9E-40
score: 140.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 306..421
e-value: 1.7E-21
score: 78.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 224..305
e-value: 1.7E-15
score: 58.8
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 154..180
e-value: 8.7E-4
score: 19.4
coord: 257..287
e-value: 0.39
score: 11.1
coord: 124..152
e-value: 0.016
score: 15.4
coord: 229..255
e-value: 0.13
score: 12.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 457..504
e-value: 5.2E-12
score: 45.8
coord: 355..402
e-value: 7.0E-8
score: 32.6
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 154..187
e-value: 8.5E-6
score: 23.6
coord: 359..391
e-value: 7.8E-6
score: 23.7
coord: 460..492
e-value: 7.2E-7
score: 27.0
coord: 257..291
e-value: 3.9E-4
score: 18.4
coord: 124..154
e-value: 0.0029
score: 15.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 152..186
score: 10.500983
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 356..390
score: 10.775016
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 457..491
score: 11.39981
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 255..289
score: 9.996763
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 630..753
e-value: 2.4E-39
score: 134.1
NoneNo IPR availablePANTHERPTHR47924:SF1SUBFAMILY NOT NAMEDcoord: 28..181
coord: 241..368
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 159..305
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 241..368
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 28..181
coord: 342..758
NoneNo IPR availablePANTHERPTHR47924:SF1SUBFAMILY NOT NAMEDcoord: 342..758
coord: 159..305

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh10G010680.1CmoCh10G010680.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding