MC04g0907 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC04g0907
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationMC04: 16499848 .. 16501428 (+)
RNA-Seq ExpressionMC04g0907
SyntenyMC04g0907
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
GCCCGCCATCTGTTCGATCATTTTCCTGACCCAAAAGTGGCGCTCTGGAATTCAATCCTCAGAGGGTATGCCCACAACGAGTTTTACAGGGAAGTGGTTCTTTTGTTTGGGAAAATGAAGAGCACGGACGTGAGACCCAACTGCTTCACGTTCCCTCTTGTTCTCAAATCTTGTGCCAAAATCAATGCATTCGTGGAAGGGGAGGAGATTCATTGTGAGGTCATTAAGGGTGGGTTTAGAGGGAACCAATTCGTGGCTACTACTCTGATCGATGTGTACTCTGGTGGGAGGGCAATTGGGTCTGCCTACAAGGTGTTTGTTGGAATGCTTGAGAGAAATATAGTAGCCTGGACTTCCATGATCAGTGGCTACATTTTGTGCAATGATGTGACATTTGCACGGCGGCTGTTCGACTTGGCACCCGAACGAGATGTTGTGCTGTGGAGCATTATGGTTTCTGGTTATATTGAAATAGGGCATATGGAGGCAGCAAGAAAGCTGTTTGATGCAATGCCGTACCGTGATGTGATGTCTTGGAATACAATGTTGAGTGGTTATGCAAATAATGGGGACGTTGAGGCTTGCGAGCATCTGTTTGAAGAGATGCCCGAGCGGAATGTTTTCTCCTGGAATGGATTGATTGGAGGGTATGCTCATAATGGGCGTTTCTTTGATGTATTGAGTTGTTTCAAACGGATGCTACTCGATGGCCAGGTAGTTCCCAATGATGCCACCCTTGTGACTGTGTTGTCTGCTTGTGCGAGATTGGGAGCTCTTGATTTGGGAAAGTGGGTGCATGTATACGCTGCGACCATCGGGTTTAAAAGAAACATTTATGTTGGAAATGCATTGATAGACATGTATGCAAAATGCGGTGTGATAGAGAATGCAATGGAAGTATTTGGGAGCATGGATTCAAAAGACCTTATTTCATGGAATTCTGTGATCTGTGGCTTGGCAACTCATGGATGTGGGGCGGATGCGTTAAATTTGTTTCACCGGATGAAGAACTCTGGCAAAAAACCGGATGGAATCACCTTCATTGGGGTATTGTGCTCCTGTACTCATCTTGGATTAGTTGAAGAAGGCATCTCATATTTCAACTCAATGGTCCATGACTACTCCATTGATCCTCAGATTGAGCATTATGGTTGTATGGTCGATCTATTTGGCCGAGCTGGTCTTTTGGACCGGGCAATTGAGTTCGTGAAGAGGATGCCGATCAAAGCAGATGCTGTCATCTGGGCTGCCCTGCTAGGTGCGTGCAGGACTTACAAAAACATTGATTTGGCAGAATTAGCTCTTCAGAAACTCATTCAGCTCGAACCCAAAAACCCTGCAAATTACGTCATGCTATCGAATATTTATGGAGATCTTAGTAGATGGAAAGACGTTGCACGGTTGAAGATTTTAATGAGGGACACCGGGTTCAAAAAGTTGCCAGGATGTAGCTTGATTGAGGTAAATGATAGCGTGGTTGAGTTTTATTCCTTAGATGAGAGGCATTCTCAGAGCAAGGAAATATATGGAGTCTTAAAGGGATTGATGAAATTGTTAAGATCGTATGGATATGAACCAAAT

mRNA sequence

GCCCGCCATCTGTTCGATCATTTTCCTGACCCAAAAGTGGCGCTCTGGAATTCAATCCTCAGAGGGTATGCCCACAACGAGTTTTACAGGGAAGTGGTTCTTTTGTTTGGGAAAATGAAGAGCACGGACGTGAGACCCAACTGCTTCACGTTCCCTCTTGTTCTCAAATCTTGTGCCAAAATCAATGCATTCGTGGAAGGGGAGGAGATTCATTGTGAGGTCATTAAGGGTGGGTTTAGAGGGAACCAATTCGTGGCTACTACTCTGATCGATGTGTACTCTGGGCATATGGAGGCAGCAAGAAAGCTGTTTGATGCAATGCCGTACCGTGATGTGATGTCTTGGAATACAATGTTGAGTGGTTATGCAAATAATGGGGACGTTGAGGCTTGCGAGCATCTGTTTGAAGAGATGCCCGAGCGGAATGTTTTCTCCTGGAATGGATTGATTGGAGGGTATGCTCATAATGGGCGTTTCTTTGATGTATTGAGTTGTTTCAAACGGATGCTACTCGATGGCCAGGTAGTTCCCAATGATGCCACCCTTGTGACTGTGTTGTCTGCTTGTGCGAGATTGGGAGCTCTTGATTTGGGAAAGTGGGTGCATGTATACGCTGCGACCATCGGGTTTAAAAGAAACATTTATGTTGGAAATGCATTGATAGACATGTATGCAAAATGCGGTGTGATAGAGAATGCAATGGAAGTATTTGGGAGCATGGATTCAAAAGACCTTATTTCATGGAATTCTGTGATCTGTGGCTTGGCAACTCATGGATGTGGGGCGGATGCGTTAAATTTGTTTCACCGGATGAAGAACTCTGGCAAAAAACCGGATGGAATCACCTTCATTGGGGTATTGTGCTCCTGTACTCATCTTGGATTAGTTGAAGAAGGCATCTCATATTTCAACTCAATGGTCCATGACTACTCCATTGATCCTCAGATTGAGCATTATGGTTGTATGGTCGATCTATTTGGCCGAGCTGGTCTTTTGGACCGGGCAATTGAGTTCGTGAAGAGGATGCCGATCAAAGCAGATGCTGTCATCTGGGCTGCCCTGCTAGGTGCGTGCAGGACTTACAAAAACATTGATTTGGCAGAATTAGCTCTTCAGAAACTCATTCAGCTCGAACCCAAAAACCCTGCAAATTACGTCATGCTATCGAATATTTATGGAGATCTTAGTAGATGGAAAGACGTTGCACGGTTGAAGATTTTAATGAGGGACACCGGGTTCAAAAAGTTGCCAGGATGTAGCTTGATTGAGGTAAATGATAGCGTGGTTGAGTTTTATTCCTTAGATGAGAGGCATTCTCAGAGCAAGGAAATATATGGAGTCTTAAAGGGATTGATGAAATTGTTAAGATCGTATGGATATGAACCAAAT

Coding sequence (CDS)

GCCCGCCATCTGTTCGATCATTTTCCTGACCCAAAAGTGGCGCTCTGGAATTCAATCCTCAGAGGGTATGCCCACAACGAGTTTTACAGGGAAGTGGTTCTTTTGTTTGGGAAAATGAAGAGCACGGACGTGAGACCCAACTGCTTCACGTTCCCTCTTGTTCTCAAATCTTGTGCCAAAATCAATGCATTCGTGGAAGGGGAGGAGATTCATTGTGAGGTCATTAAGGGTGGGTTTAGAGGGAACCAATTCGTGGCTACTACTCTGATCGATGTGTACTCTGGGCATATGGAGGCAGCAAGAAAGCTGTTTGATGCAATGCCGTACCGTGATGTGATGTCTTGGAATACAATGTTGAGTGGTTATGCAAATAATGGGGACGTTGAGGCTTGCGAGCATCTGTTTGAAGAGATGCCCGAGCGGAATGTTTTCTCCTGGAATGGATTGATTGGAGGGTATGCTCATAATGGGCGTTTCTTTGATGTATTGAGTTGTTTCAAACGGATGCTACTCGATGGCCAGGTAGTTCCCAATGATGCCACCCTTGTGACTGTGTTGTCTGCTTGTGCGAGATTGGGAGCTCTTGATTTGGGAAAGTGGGTGCATGTATACGCTGCGACCATCGGGTTTAAAAGAAACATTTATGTTGGAAATGCATTGATAGACATGTATGCAAAATGCGGTGTGATAGAGAATGCAATGGAAGTATTTGGGAGCATGGATTCAAAAGACCTTATTTCATGGAATTCTGTGATCTGTGGCTTGGCAACTCATGGATGTGGGGCGGATGCGTTAAATTTGTTTCACCGGATGAAGAACTCTGGCAAAAAACCGGATGGAATCACCTTCATTGGGGTATTGTGCTCCTGTACTCATCTTGGATTAGTTGAAGAAGGCATCTCATATTTCAACTCAATGGTCCATGACTACTCCATTGATCCTCAGATTGAGCATTATGGTTGTATGGTCGATCTATTTGGCCGAGCTGGTCTTTTGGACCGGGCAATTGAGTTCGTGAAGAGGATGCCGATCAAAGCAGATGCTGTCATCTGGGCTGCCCTGCTAGGTGCGTGCAGGACTTACAAAAACATTGATTTGGCAGAATTAGCTCTTCAGAAACTCATTCAGCTCGAACCCAAAAACCCTGCAAATTACGTCATGCTATCGAATATTTATGGAGATCTTAGTAGATGGAAAGACGTTGCACGGTTGAAGATTTTAATGAGGGACACCGGGTTCAAAAAGTTGCCAGGATGTAGCTTGATTGAGGTAAATGATAGCGTGGTTGAGTTTTATTCCTTAGATGAGAGGCATTCTCAGAGCAAGGAAATATATGGAGTCTTAAAGGGATTGATGAAATTGTTAAGATCGTATGGATATGAACCAAAT

Protein sequence

ARHLFDHFPDPKVALWNSILRGYAHNEFYREVVLLFGKMKSTDVRPNCFTFPLVLKSCAKINAFVEGEEIHCEVIKGGFRGNQFVATTLIDVYSGHMEAARKLFDAMPYRDVMSWNTMLSGYANNGDVEACEHLFEEMPERNVFSWNGLIGGYAHNGRFFDVLSCFKRMLLDGQVVPNDATLVTVLSACARLGALDLGKWVHVYAATIGFKRNIYVGNALIDMYAKCGVIENAMEVFGSMDSKDLISWNSVICGLATHGCGADALNLFHRMKNSGKKPDGITFIGVLCSCTHLGLVEEGISYFNSMVHDYSIDPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPIKADAVIWAALLGACRTYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLSRWKDVARLKILMRDTGFKKLPGCSLIEVNDSVVEFYSLDERHSQSKEIYGVLKGLMKLLRSYGYEPN
Homology
BLAST of MC04g0907 vs. ExPASy Swiss-Prot
Match: Q9SIL5 (Pentatricopeptide repeat-containing protein At2g20540 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E78 PE=2 SV=1)

HSP 1 Score: 421.0 bits (1081), Expect = 1.7e-116
Identity = 200/453 (44.15%), Postives = 305/453 (67.33%), Query Frame = 0

Query: 1   ARHLFDHFPDPKVALWNSILRGYAHNEFYREVVLLFGKM--KSTDVRPNCFTFPLVLKSC 60
           A  LF+   +P V L+NSI+R Y HN  Y +V+ ++ ++  KS ++ P+ FTFP + KSC
Sbjct: 61  ATRLFNQVSNPNVFLYNSIIRAYTHNSLYCDVIRIYKQLLRKSFEL-PDRFTFPFMFKSC 120

Query: 61  AKINAFVEGEEIHCEVIKGGFRGNQFVATTLIDVYS--GHMEAARKLFDAMPYRDVMSWN 120
           A + +   G+++H  + K G R +      LID+Y     +  A K+FD M  RDV+SWN
Sbjct: 121 ASLGSCYLGKQVHGHLCKFGPRFHVVTENALIDMYMKFDDLVDAHKVFDEMYERDVISWN 180

Query: 121 TMLSGYANNGDVEACEHLFEEMPERNVFSWNGLIGGYAHNGRFFDVLSCFKRMLLDGQVV 180
           ++LSGYA  G ++  + LF  M ++ + SW  +I GY   G + + +  F+ M L G + 
Sbjct: 181 SLLSGYARLGQMKKAKGLFHLMLDKTIVSWTAMISGYTGIGCYVEAMDFFREMQLAG-IE 240

Query: 181 PNDATLVTVLSACARLGALDLGKWVHVYAATIGFKRNIYVGNALIDMYAKCGVIENAMEV 240
           P++ +L++VL +CA+LG+L+LGKW+H+YA   GF +   V NALI+MY+KCGVI  A+++
Sbjct: 241 PDEISLISVLPSCAQLGSLELGKWIHLYAERRGFLKQTGVCNALIEMYSKCGVISQAIQL 300

Query: 241 FGSMDSKDLISWNSVICGLATHGCGADALNLFHRMKNSGKKPDGITFIGVLCSCTHLGLV 300
           FG M+ KD+ISW+++I G A HG    A+  F+ M+ +  KP+GITF+G+L +C+H+G+ 
Sbjct: 301 FGQMEGKDVISWSTMISGYAYHGNAHGAIETFNEMQRAKVKPNGITFLGLLSACSHVGMW 360

Query: 301 EEGISYFNSMVHDYSIDPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPIKADAVIWAALLG 360
           +EG+ YF+ M  DY I+P+IEHYGC++D+  RAG L+RA+E  K MP+K D+ IW +LL 
Sbjct: 361 QEGLRYFDMMRQDYQIEPKIEHYGCLIDVLARAGKLERAVEITKTMPMKPDSKIWGSLLS 420

Query: 361 ACRTYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLSRWKDVARLKILMRDTGFKKL 420
           +CRT  N+D+A +A+  L++LEP++  NYV+L+NIY DL +W+DV+RL+ ++R+   KK 
Sbjct: 421 SCRTPGNLDVALVAMDHLVELEPEDMGNYVLLANIYADLGKWEDVSRLRKMIRNENMKKT 480

Query: 421 PGCSLIEVNDSVVEFYSLDERHSQSKEIYGVLK 450
           PG SLIEVN+ V EF S D       EI  VL+
Sbjct: 481 PGGSLIEVNNIVQEFVSGDNSKPFWTEISIVLQ 511

BLAST of MC04g0907 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 416.4 bits (1069), Expect = 4.3e-115
Identity = 190/461 (41.21%), Postives = 298/461 (64.64%), Query Frame = 0

Query: 4   LFDHFPDPKVALWNSILRGYAHNEFYREVVLLFGKMKSTDVRPNCFTFPLVLKSCAKINA 63
           +F    +  V  WNS++ G+       + + LF KM+S DV+ +  T   VL +CAKI  
Sbjct: 188 VFTTIKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRN 247

Query: 64  FVEGEEIHCEVIKGGFRGNQFVATTLIDVYS--GHMEAARKLFDAMPYRDVMSWNTMLSG 123
              G ++   + +     N  +A  ++D+Y+  G +E A++LFDAM  +D ++W TML G
Sbjct: 248 LEFGRQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDG 307

Query: 124 YANNGDVEACEHLFEEMPERNVFSWNGLIGGYAHNGRFFDVLSCFKRMLLDGQVVPNDAT 183
           YA + D EA   +   MP++++ +WN LI  Y  NG+  + L  F  + L   +  N  T
Sbjct: 308 YAISEDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQIT 367

Query: 184 LVTVLSACARLGALDLGKWVHVYAATIGFKRNIYVGNALIDMYAKCGVIENAMEVFGSMD 243
           LV+ LSACA++GAL+LG+W+H Y    G + N +V +ALI MY+KCG +E + EVF S++
Sbjct: 368 LVSTLSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVE 427

Query: 244 SKDLISWNSVICGLATHGCGADALNLFHRMKNSGKKPDGITFIGVLCSCTHLGLVEEGIS 303
            +D+  W+++I GLA HGCG +A+++F++M+ +  KP+G+TF  V C+C+H GLV+E  S
Sbjct: 428 KRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAES 487

Query: 304 YFNSMVHDYSIDPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPIKADAVIWAALLGACRTY 363
            F+ M  +Y I P+ +HY C+VD+ GR+G L++A++F++ MPI     +W ALLGAC+ +
Sbjct: 488 LFHQMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIH 547

Query: 364 KNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLSRWKDVARLKILMRDTGFKKLPGCSL 423
            N++LAE+A  +L++LEP+N   +V+LSNIY  L +W++V+ L+  MR TG KK PGCS 
Sbjct: 548 ANLNLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSS 607

Query: 424 IEVNDSVVEFYSLDERHSQSKEIYGVLKGLMKLLRSYGYEP 463
           IE++  + EF S D  H  S+++YG L  +M+ L+S GYEP
Sbjct: 608 IEIDGMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEP 648

BLAST of MC04g0907 vs. ExPASy Swiss-Prot
Match: Q9FI80 (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 400.2 bits (1027), Expect = 3.2e-110
Identity = 197/481 (40.96%), Postives = 297/481 (61.75%), Query Frame = 0

Query: 1   ARHLFDHFPDPKVALWNSILRGYAHNEFYREVV---LLFGKMKSTDVRPNCFTFPLVLKS 60
           A  +F+  P      WN+I+RG++ ++  + ++   L +  M    V PN FTFP VLK+
Sbjct: 78  AHKIFNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKA 137

Query: 61  CAKINAFVEGEEIHCEVIKGGFRGNQFVATTLIDVY--SGHMEAARKLF-------DAMP 120
           CAK     EG++IH   +K GF G++FV + L+ +Y   G M+ AR LF       D + 
Sbjct: 138 CAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMVV 197

Query: 121 YRD-------VMSWNTMLSGYANNGDVEACEHLFEEMPERNVFSWNGLIGGYAHNGRFFD 180
             D       ++ WN M+ GY   GD +A   LF++M +R+V SWN +I GY+ NG F D
Sbjct: 198 MTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNGFFKD 257

Query: 181 VLSCFKRMLLDGQVVPNDATLVTVLSACARLGALDLGKWVHVYAATIGFKRNIYVGNALI 240
            +  F+ M   G + PN  TLV+VL A +RLG+L+LG+W+H+YA   G + +  +G+ALI
Sbjct: 258 AVEVFREM-KKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALI 317

Query: 241 DMYAKCGVIENAMEVFGSMDSKDLISWNSVICGLATHGCGADALNLFHRMKNSGKKPDGI 300
           DMY+KCG+IE A+ VF  +  +++I+W+++I G A HG   DA++ F +M+ +G +P  +
Sbjct: 318 DMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDV 377

Query: 301 TFIGVLCSCTHLGLVEEGISYFNSMVHDYSIDPQIEHYGCMVDLFGRAGLLDRAIEFVKR 360
            +I +L +C+H GLVEEG  YF+ MV    ++P+IEHYGCMVDL GR+GLLD A EF+  
Sbjct: 378 AYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILN 437

Query: 361 MPIKADAVIWAALLGACRTYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLSRWKDV 420
           MPIK D VIW ALLGACR   N+++ +     L+ + P +   YV LSN+Y     W +V
Sbjct: 438 MPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEV 497

Query: 421 ARLKILMRDTGFKKLPGCSLIEVNDSVVEFYSLDERHSQSKEIYGVLKGLMKLLRSYGYE 463
           + +++ M++   +K PGCSLI+++  + EF   D+ H ++KEI  +L  +   LR  GY 
Sbjct: 498 SEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYR 557

BLAST of MC04g0907 vs. ExPASy Swiss-Prot
Match: Q9SZT8 (Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=ELI1 PE=3 SV=1)

HSP 1 Score: 396.7 bits (1018), Expect = 3.5e-109
Identity = 196/463 (42.33%), Postives = 297/463 (64.15%), Query Frame = 0

Query: 4   LFDHFPDPKVALWNSILRGYAHNEFYREVVLLFGKMKSTDVRPNCFTFPLVLKSCAKINA 63
           LF    DP + L+ + +   + N    +  LL+ ++ S+++ PN FTF  +LKSC+    
Sbjct: 86  LFHQTIDPDLFLFTAAINTASINGLKDQAFLLYVQLLSSEINPNEFTFSSLLKSCST--- 145

Query: 64  FVEGEEIHCEVIKGGFRGNQFVATTLIDVYS--GHMEAARKLFDAMPYRDVMSWNTMLSG 123
              G+ IH  V+K G   + +VAT L+DVY+  G + +A+K+FD MP R ++S   M++ 
Sbjct: 146 -KSGKLIHTHVLKFGLGIDPYVATGLVDVYAKGGDVVSAQKVFDRMPERSLVSSTAMITC 205

Query: 124 YANNGDVEACEHLFEEMPERNVFSWNGLIGGYAHNGRFFDVLSCFKRMLLDGQVVPNDAT 183
           YA  G+VEA   LF+ M ER++ SWN +I GYA +G   D L  F+++L +G+  P++ T
Sbjct: 206 YAKQGNVEAARALFDSMCERDIVSWNVMIDGYAQHGFPNDALMLFQKLLAEGKPKPDEIT 265

Query: 184 LVTVLSACARLGALDLGKWVHVYAATIGFKRNIYVGNALIDMYAKCGVIENAMEVFGSMD 243
           +V  LSAC+++GAL+ G+W+HV+  +   + N+ V   LIDMY+KCG +E A+ VF    
Sbjct: 266 VVAALSACSQIGALETGRWIHVFVKSSRIRLNVKVCTGLIDMYSKCGSLEEAVLVFNDTP 325

Query: 244 SKDLISWNSVICGLATHGCGADALNLFHRMKN-SGKKPDGITFIGVLCSCTHLGLVEEGI 303
            KD+++WN++I G A HG   DAL LF+ M+  +G +P  ITFIG L +C H GLV EGI
Sbjct: 326 RKDIVAWNAMIAGYAMHGYSQDALRLFNEMQGITGLQPTDITFIGTLQACAHAGLVNEGI 385

Query: 304 SYFNSMVHDYSIDPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPIKADAVIWAALLGACRT 363
             F SM  +Y I P+IEHYGC+V L GRAG L RA E +K M + AD+V+W+++LG+C+ 
Sbjct: 386 RIFESMGQEYGIKPKIEHYGCLVSLLGRAGQLKRAYETIKNMNMDADSVLWSSVLGSCKL 445

Query: 364 YKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLSRWKDVARLKILMRDTGFKKLPGCS 423
           + +  L +   + LI L  KN   YV+LSNIY  +  ++ VA+++ LM++ G  K PG S
Sbjct: 446 HGDFVLGKEIAEYLIGLNIKNSGIYVLLSNIYASVGDYEGVAKVRNLMKEKGIVKEPGIS 505

Query: 424 LIEVNDSVVEFYSLDERHSQSKEIYGVLKGLMKLLRSYGYEPN 464
            IE+ + V EF + D  HS+SKEIY +L+ + + ++S+GY PN
Sbjct: 506 TIEIENKVHEFRAGDREHSKSKEIYTMLRKISERIKSHGYVPN 544

BLAST of MC04g0907 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 394.0 bits (1011), Expect = 2.3e-108
Identity = 209/565 (36.99%), Postives = 305/565 (53.98%), Query Frame = 0

Query: 4   LFDHFPDPKVALWNSILRGYAHNEFYREVVLLFGKMKSTDVRPNCFTFPLVLKSCAKINA 63
           +F    +P + +WN++ RG+A +      + L+  M S  + PN +TFP VLKSCAK  A
Sbjct: 90  VFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKA 149

Query: 64  FVEGEEIHCEVIKGGFRGNQFVATTLIDVY------------------------------ 123
           F EG++IH  V+K G   + +V T+LI +Y                              
Sbjct: 150 FKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKG 209

Query: 124 ---SGHMEAARKLFDAMPYRDVMSWNTMLSGYANNGD----------------------- 183
               G++E A+KLFD +P +DV+SWN M+SGYA  G+                       
Sbjct: 210 YASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTM 269

Query: 184 ---VEAC-----------------EH---------------------------LFEEMPE 243
              V AC                 +H                           LFE +P 
Sbjct: 270 VTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPY 329

Query: 244 RNVFSWNGLIGGYAHNGRFFDVLSCFKRMLLDGQVVPNDATLVTVLSACARLGALDLGKW 303
           ++V SWN LIGGY H   + + L  F+ ML  G+  PND T++++L ACA LGA+D+G+W
Sbjct: 330 KDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGE-TPNDVTMLSILPACAHLGAIDIGRW 389

Query: 304 VHVY--AATIGFKRNIYVGNALIDMYAKCGVIENAMEVFGSMDSKDLISWNSVICGLATH 363
           +HVY      G      +  +LIDMYAKCG IE A +VF S+  K L SWN++I G A H
Sbjct: 390 IHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMH 449

Query: 364 GCGADALNLFHRMKNSGKKPDGITFIGVLCSCTHLGLVEEGISYFNSMVHDYSIDPQIEH 423
           G    + +LF RM+  G +PD ITF+G+L +C+H G+++ G   F +M  DY + P++EH
Sbjct: 450 GRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEH 509

Query: 424 YGCMVDLFGRAGLLDRAIEFVKRMPIKADAVIWAALLGACRTYKNIDLAELALQKLIQLE 464
           YGCM+DL G +GL   A E +  M ++ D VIW +LL AC+ + N++L E   + LI++E
Sbjct: 510 YGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIE 569

BLAST of MC04g0907 vs. NCBI nr
Match: XP_022139698.1 (pentatricopeptide repeat-containing protein At3g29230-like [Momordica charantia])

HSP 1 Score: 939 bits (2426), Expect = 0.0
Identity = 463/527 (87.86%), Postives = 463/527 (87.86%), Query Frame = 0

Query: 1   ARHLFDHFPDPKVALWNSILRGYAHNEFYREVVLLFGKMKSTDVRPNCFTFPLVLKSCAK 60
           ARHLFDHFPDPKVALWNSILRGYAHNEFYREVVLLFGKMKSTDVRPNCFTFPLVLKSCAK
Sbjct: 99  ARHLFDHFPDPKVALWNSILRGYAHNEFYREVVLLFGKMKSTDVRPNCFTFPLVLKSCAK 158

Query: 61  INAFVEGEEIHCEVIKGGFRGNQFVATTLIDVYSG------------------------- 120
           INAFVEGEEIHCEVIKGGFRGNQFVATTLIDVYSG                         
Sbjct: 159 INAFVEGEEIHCEVIKGGFRGNQFVATTLIDVYSGGRAIGSAYKVFVGMLERNIVAWTSM 218

Query: 121 ---------------------------------------HMEAARKLFDAMPYRDVMSWN 180
                                                  HMEAARKLFDAMPYRDVMSWN
Sbjct: 219 ISGYILCNDVTFARRLFDLAPERDVVLWSIMVSGYIEIGHMEAARKLFDAMPYRDVMSWN 278

Query: 181 TMLSGYANNGDVEACEHLFEEMPERNVFSWNGLIGGYAHNGRFFDVLSCFKRMLLDGQVV 240
           TMLSGYANNGDVEACEHLFEEMPERNVFSWNGLIGGYAHNGRFFDVLSCFKRMLLDGQVV
Sbjct: 279 TMLSGYANNGDVEACEHLFEEMPERNVFSWNGLIGGYAHNGRFFDVLSCFKRMLLDGQVV 338

Query: 241 PNDATLVTVLSACARLGALDLGKWVHVYAATIGFKRNIYVGNALIDMYAKCGVIENAMEV 300
           PNDATLVTVLSACARLGALDLGKWVHVYAATIGFKRNIYVGNALIDMYAKCGVIENAMEV
Sbjct: 339 PNDATLVTVLSACARLGALDLGKWVHVYAATIGFKRNIYVGNALIDMYAKCGVIENAMEV 398

Query: 301 FGSMDSKDLISWNSVICGLATHGCGADALNLFHRMKNSGKKPDGITFIGVLCSCTHLGLV 360
           FGSMDSKDLISWNSVICGLATHGCGADALNLFHRMKNSGKKPDGITFIGVLCSCTHLGLV
Sbjct: 399 FGSMDSKDLISWNSVICGLATHGCGADALNLFHRMKNSGKKPDGITFIGVLCSCTHLGLV 458

Query: 361 EEGISYFNSMVHDYSIDPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPIKADAVIWAALLG 420
           EEGISYFNSMVHDYSIDPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPIKADAVIWAALLG
Sbjct: 459 EEGISYFNSMVHDYSIDPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPIKADAVIWAALLG 518

Query: 421 ACRTYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLSRWKDVARLKILMRDTGFKKL 463
           ACRTYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLSRWKDVARLKILMRDTGFKKL
Sbjct: 519 ACRTYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLSRWKDVARLKILMRDTGFKKL 578

BLAST of MC04g0907 vs. NCBI nr
Match: XP_038886719.1 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like [Benincasa hispida])

HSP 1 Score: 866 bits (2238), Expect = 5.71e-315
Identity = 425/527 (80.65%), Postives = 439/527 (83.30%), Query Frame = 0

Query: 1   ARHLFDHFPDPKVALWNSILRGYAHNEFYREVVLLFGKMKSTDVRPNCFTFPLVLKSCAK 60
           ARHLFDHFPDPKVALWN+  RGY HNEFYREV+ LF KMKS DVRPNCFTFPLVLKSCAK
Sbjct: 4   ARHLFDHFPDPKVALWNATFRGYFHNEFYREVIFLFAKMKSMDVRPNCFTFPLVLKSCAK 63

Query: 61  INAFVEGEEIHCEVIKGGFRGNQFVATTLIDVYSG------------------------- 120
           I AFVEGEEIHCEVIKGGF GNQFVATTLIDVYSG                         
Sbjct: 64  IRAFVEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRVIGSAYKVFVGMLERNIVAWTSM 123

Query: 121 ---------------------------------------HMEAARKLFDAMPYRDVMSWN 180
                                                   MEAARKLFD MPYRD MSWN
Sbjct: 124 ISGYILCNDVALARRLFDLAPERDVVLWNIMVSGYIEIGDMEAARKLFDTMPYRDTMSWN 183

Query: 181 TMLSGYANNGDVEACEHLFEEMPERNVFSWNGLIGGYAHNGRFFDVLSCFKRMLLDGQVV 240
           TMLSGYANNGDVEACE LFEEMPERNVFSWNGLIGGYAHNGRFF+VL CFKRML D  VV
Sbjct: 184 TMLSGYANNGDVEACEKLFEEMPERNVFSWNGLIGGYAHNGRFFEVLRCFKRMLTDAVVV 243

Query: 241 PNDATLVTVLSACARLGALDLGKWVHVYAATIGFKRNIYVGNALIDMYAKCGVIENAMEV 300
           PNDATLVTVLSACARLGALDLGKWVH+YAATIGFK NIYVGNALIDMY+KCGVIENAMEV
Sbjct: 244 PNDATLVTVLSACARLGALDLGKWVHIYAATIGFKGNIYVGNALIDMYSKCGVIENAMEV 303

Query: 301 FGSMDSKDLISWNSVICGLATHGCGADALNLFHRMKNSGKKPDGITFIGVLCSCTHLGLV 360
           FGSMD KDLI+WNSVICGLAT+GCG DALNLFH+MK +G+KPDGITFIGVLCSCTHLGLV
Sbjct: 304 FGSMDLKDLITWNSVICGLATNGCGVDALNLFHQMKITGEKPDGITFIGVLCSCTHLGLV 363

Query: 361 EEGISYFNSMVHDYSIDPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPIKADAVIWAALLG 420
           EEGISYFNSMVHDYSI PQIEHYGCMVDLFGRAGLLDRAIEFVKRMP++ADAVIWAALLG
Sbjct: 364 EEGISYFNSMVHDYSITPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPMEADAVIWAALLG 423

Query: 421 ACRTYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLSRWKDVARLKILMRDTGFKKL 463
           ACR YKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDL RWKDVARLKILMRDTG KK+
Sbjct: 424 ACRIYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKM 483

BLAST of MC04g0907 vs. NCBI nr
Match: XP_023549948.1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 855 bits (2209), Expect = 1.46e-310
Identity = 418/527 (79.32%), Postives = 440/527 (83.49%), Query Frame = 0

Query: 1   ARHLFDHFPDPKVALWNSILRGYAHNEFYREVVLLFGKMKSTDVRPNCFTFPLVLKSCAK 60
           ARHLFDHFPDPK+ALWN+  RGY  NEFYREV+ L GKMKS DVRPNCFTFPLVLKSCAK
Sbjct: 4   ARHLFDHFPDPKLALWNATFRGYVRNEFYREVIFLSGKMKSMDVRPNCFTFPLVLKSCAK 63

Query: 61  INAFVEGEEIHCEVIKGGFRGNQFVATTLIDVYS-------------------------- 120
           I+AFVEGE+IHCEVIKGGF GNQFVATTLIDVYS                          
Sbjct: 64  ISAFVEGEQIHCEVIKGGFEGNQFVATTLIDVYSVGRAIGSAYKVFVGMLERNVVAWTSM 123

Query: 121 --------------------------------------GHMEAARKLFDAMPYRDVMSWN 180
                                                 G M AARKLFDAMPYRDVMSWN
Sbjct: 124 ISGYILCNDVVFARRLFDLAPERDVVLWNIMVSGYIEIGDMAAARKLFDAMPYRDVMSWN 183

Query: 181 TMLSGYANNGDVEACEHLFEEMPERNVFSWNGLIGGYAHNGRFFDVLSCFKRMLLDGQVV 240
           TML+GYANNGD+EACE LFEEMPERNVFSWNGLIG YAH G FF+VL CFKRML+DG VV
Sbjct: 184 TMLNGYANNGDIEACEQLFEEMPERNVFSWNGLIGAYAHKGCFFEVLGCFKRMLIDGLVV 243

Query: 241 PNDATLVTVLSACARLGALDLGKWVHVYAATIGFKRNIYVGNALIDMYAKCGVIENAMEV 300
           PNDATLVTVLSACARLGALDLGKWVHVYAATIGFK +IYVGNALIDMY+KCGVIENAMEV
Sbjct: 244 PNDATLVTVLSACARLGALDLGKWVHVYAATIGFKGSIYVGNALIDMYSKCGVIENAMEV 303

Query: 301 FGSMDSKDLISWNSVICGLATHGCGADALNLFHRMKNSGKKPDGITFIGVLCSCTHLGLV 360
           FGSMDSKD+I+WNSVICGLATHGCGADALNLFH+MK +G+KPDGITFIGVLCSCTHLGLV
Sbjct: 304 FGSMDSKDVITWNSVICGLATHGCGADALNLFHQMKMTGEKPDGITFIGVLCSCTHLGLV 363

Query: 361 EEGISYFNSMVHDYSIDPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPIKADAVIWAALLG 420
           EEGISYFNSMVHDYSI PQIEHYGCMVDLFGRAGLLDRAIEFVKRMP++ADAVIWAALLG
Sbjct: 364 EEGISYFNSMVHDYSIVPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPMEADAVIWAALLG 423

Query: 421 ACRTYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLSRWKDVARLKILMRDTGFKKL 463
           ACR +KNIDL ELALQKLIQLEPKNPANYVMLSNIYGD+ RWKDVARLKILMRDTG KKL
Sbjct: 424 ACRIHKNIDLGELALQKLIQLEPKNPANYVMLSNIYGDVGRWKDVARLKILMRDTGSKKL 483

BLAST of MC04g0907 vs. NCBI nr
Match: XP_022938516.1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucurbita moschata])

HSP 1 Score: 852 bits (2201), Expect = 2.41e-309
Identity = 417/527 (79.13%), Postives = 439/527 (83.30%), Query Frame = 0

Query: 1   ARHLFDHFPDPKVALWNSILRGYAHNEFYREVVLLFGKMKSTDVRPNCFTFPLVLKSCAK 60
           ARHLFDHFPDPK+ALWN+  RGY  NEFYREV+ L GKMKS DVRPNCFTFPLVLKSCAK
Sbjct: 4   ARHLFDHFPDPKLALWNATFRGYVRNEFYREVIFLSGKMKSMDVRPNCFTFPLVLKSCAK 63

Query: 61  INAFVEGEEIHCEVIKGGFRGNQFVATTLIDVYS-------------------------- 120
           I+AFVEGE+IHCEVIKGGF GNQFVATTLIDVYS                          
Sbjct: 64  ISAFVEGEQIHCEVIKGGFEGNQFVATTLIDVYSVGRAIGSAYKVFVGMLERNVVAWTSM 123

Query: 121 --------------------------------------GHMEAARKLFDAMPYRDVMSWN 180
                                                 G M AARKLFDAMPYRDVMSWN
Sbjct: 124 ISGYILCNDVVFARRLFDLAPERDVVLWNIMVSGYIEIGDMGAARKLFDAMPYRDVMSWN 183

Query: 181 TMLSGYANNGDVEACEHLFEEMPERNVFSWNGLIGGYAHNGRFFDVLSCFKRMLLDGQVV 240
           TML+GYANNGD+EACE LFEEMPERNVFSWNGLIG YAH G FF+VL CFKRML+DG VV
Sbjct: 184 TMLNGYANNGDIEACEQLFEEMPERNVFSWNGLIGAYAHKGCFFEVLGCFKRMLIDGLVV 243

Query: 241 PNDATLVTVLSACARLGALDLGKWVHVYAATIGFKRNIYVGNALIDMYAKCGVIENAMEV 300
           PNDATLVTVLSACARLGALDLGKWVHVYAATIGFK +IYVGNALIDMY+KCGVIENAMEV
Sbjct: 244 PNDATLVTVLSACARLGALDLGKWVHVYAATIGFKGSIYVGNALIDMYSKCGVIENAMEV 303

Query: 301 FGSMDSKDLISWNSVICGLATHGCGADALNLFHRMKNSGKKPDGITFIGVLCSCTHLGLV 360
           FGSMDSKD+I+WNSVICGLATHGCGADALNLFH+MK +G KPDGITFIGVLCSCTHLGLV
Sbjct: 304 FGSMDSKDVITWNSVICGLATHGCGADALNLFHQMKITGVKPDGITFIGVLCSCTHLGLV 363

Query: 361 EEGISYFNSMVHDYSIDPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPIKADAVIWAALLG 420
           EEGISYFNSMVHDYSI PQIEHYGCMVDLFGRAGLLDRAIEFVKRMP++ADAVIWAALLG
Sbjct: 364 EEGISYFNSMVHDYSIVPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPMEADAVIWAALLG 423

Query: 421 ACRTYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLSRWKDVARLKILMRDTGFKKL 463
           ACR +KNIDL ELALQKLIQLEPKNPANYVMLSNIYGD+ RWKDVARLKILMRDTG KKL
Sbjct: 424 ACRIHKNIDLGELALQKLIQLEPKNPANYVMLSNIYGDVGRWKDVARLKILMRDTGSKKL 483

BLAST of MC04g0907 vs. NCBI nr
Match: XP_011656468.1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucumis sativus] >KGN45892.1 hypothetical protein Csa_005743 [Cucumis sativus])

HSP 1 Score: 847 bits (2188), Expect = 2.39e-307
Identity = 417/527 (79.13%), Postives = 436/527 (82.73%), Query Frame = 0

Query: 1   ARHLFDHFPDPKVALWNSILRGYAHNEFYREVVLLFGKMKSTDVRPNCFTFPLVLKSCAK 60
           ARHLFDHFPDPKV LWN+I RGY HN FYREVV LFGKMKS DVRPNCFTFPLVLKSCAK
Sbjct: 4   ARHLFDHFPDPKVELWNAISRGYFHNAFYREVVFLFGKMKSMDVRPNCFTFPLVLKSCAK 63

Query: 61  INAFVEGEEIHCEVIKGGFRGNQFVATTLIDVYSG------------------------- 120
           I AFVEGEEIHCEVIKGG  GNQFVATTLIDVYSG                         
Sbjct: 64  IGAFVEGEEIHCEVIKGGLEGNQFVATTLIDVYSGGRAIGSAYKLFVGMLERNIVAWTSM 123

Query: 121 ---------------------------------------HMEAARKLFDAMPYRDVMSWN 180
                                                   M+AARKLFD MPYRD MSWN
Sbjct: 124 ISGYILCNRVALARRLFDLAPERDVVLWNIMVSGYIEIGDMKAARKLFDTMPYRDTMSWN 183

Query: 181 TMLSGYANNGDVEACEHLFEEMPERNVFSWNGLIGGYAHNGRFFDVLSCFKRMLLDGQVV 240
           TML+GYANNGDVEACE LFEEMPERNVFSWNGLIGGYAHNG FF+VL CFKRML+DG VV
Sbjct: 184 TMLNGYANNGDVEACEQLFEEMPERNVFSWNGLIGGYAHNGCFFEVLRCFKRMLIDGLVV 243

Query: 241 PNDATLVTVLSACARLGALDLGKWVHVYAATIGFKRNIYVGNALIDMYAKCGVIENAMEV 300
           PNDATLVTVLSACARLGALDLGKWVHVYAATIGFK +IYVGNALIDMY+KCG+IENAMEV
Sbjct: 244 PNDATLVTVLSACARLGALDLGKWVHVYAATIGFKGSIYVGNALIDMYSKCGLIENAMEV 303

Query: 301 FGSMDSKDLISWNSVICGLATHGCGADALNLFHRMKNSGKKPDGITFIGVLCSCTHLGLV 360
           F SMD KDLI+WNS+ICGLATHGCGADAL LFH+MK +G+KPDGITFIGVLCSCTHLGLV
Sbjct: 304 FESMDLKDLITWNSMICGLATHGCGADALTLFHQMKINGEKPDGITFIGVLCSCTHLGLV 363

Query: 361 EEGISYFNSMVHDYSIDPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPIKADAVIWAALLG 420
           EEG SYFNSMV++YSI PQIEHYGCMVDLFGRAGLLDRAIEFVKRMP++ADAVIWAALLG
Sbjct: 364 EEGTSYFNSMVNEYSIAPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPMEADAVIWAALLG 423

Query: 421 ACRTYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLSRWKDVARLKILMRDTGFKKL 463
           ACR YKNIDLAELALQKLI LEPKNPANYV+LSNIYGDL RWKDVARLKILMRDTG KKL
Sbjct: 424 ACRIYKNIDLAELALQKLIVLEPKNPANYVLLSNIYGDLGRWKDVARLKILMRDTGSKKL 483

BLAST of MC04g0907 vs. ExPASy TrEMBL
Match: A0A6J1CGA7 (pentatricopeptide repeat-containing protein At3g29230-like OS=Momordica charantia OX=3673 GN=LOC111010545 PE=4 SV=1)

HSP 1 Score: 939 bits (2426), Expect = 0.0
Identity = 463/527 (87.86%), Postives = 463/527 (87.86%), Query Frame = 0

Query: 1   ARHLFDHFPDPKVALWNSILRGYAHNEFYREVVLLFGKMKSTDVRPNCFTFPLVLKSCAK 60
           ARHLFDHFPDPKVALWNSILRGYAHNEFYREVVLLFGKMKSTDVRPNCFTFPLVLKSCAK
Sbjct: 99  ARHLFDHFPDPKVALWNSILRGYAHNEFYREVVLLFGKMKSTDVRPNCFTFPLVLKSCAK 158

Query: 61  INAFVEGEEIHCEVIKGGFRGNQFVATTLIDVYSG------------------------- 120
           INAFVEGEEIHCEVIKGGFRGNQFVATTLIDVYSG                         
Sbjct: 159 INAFVEGEEIHCEVIKGGFRGNQFVATTLIDVYSGGRAIGSAYKVFVGMLERNIVAWTSM 218

Query: 121 ---------------------------------------HMEAARKLFDAMPYRDVMSWN 180
                                                  HMEAARKLFDAMPYRDVMSWN
Sbjct: 219 ISGYILCNDVTFARRLFDLAPERDVVLWSIMVSGYIEIGHMEAARKLFDAMPYRDVMSWN 278

Query: 181 TMLSGYANNGDVEACEHLFEEMPERNVFSWNGLIGGYAHNGRFFDVLSCFKRMLLDGQVV 240
           TMLSGYANNGDVEACEHLFEEMPERNVFSWNGLIGGYAHNGRFFDVLSCFKRMLLDGQVV
Sbjct: 279 TMLSGYANNGDVEACEHLFEEMPERNVFSWNGLIGGYAHNGRFFDVLSCFKRMLLDGQVV 338

Query: 241 PNDATLVTVLSACARLGALDLGKWVHVYAATIGFKRNIYVGNALIDMYAKCGVIENAMEV 300
           PNDATLVTVLSACARLGALDLGKWVHVYAATIGFKRNIYVGNALIDMYAKCGVIENAMEV
Sbjct: 339 PNDATLVTVLSACARLGALDLGKWVHVYAATIGFKRNIYVGNALIDMYAKCGVIENAMEV 398

Query: 301 FGSMDSKDLISWNSVICGLATHGCGADALNLFHRMKNSGKKPDGITFIGVLCSCTHLGLV 360
           FGSMDSKDLISWNSVICGLATHGCGADALNLFHRMKNSGKKPDGITFIGVLCSCTHLGLV
Sbjct: 399 FGSMDSKDLISWNSVICGLATHGCGADALNLFHRMKNSGKKPDGITFIGVLCSCTHLGLV 458

Query: 361 EEGISYFNSMVHDYSIDPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPIKADAVIWAALLG 420
           EEGISYFNSMVHDYSIDPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPIKADAVIWAALLG
Sbjct: 459 EEGISYFNSMVHDYSIDPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPIKADAVIWAALLG 518

Query: 421 ACRTYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLSRWKDVARLKILMRDTGFKKL 463
           ACRTYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLSRWKDVARLKILMRDTGFKKL
Sbjct: 519 ACRTYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLSRWKDVARLKILMRDTGFKKL 578

BLAST of MC04g0907 vs. ExPASy TrEMBL
Match: A0A6J1FDD3 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111444727 PE=4 SV=1)

HSP 1 Score: 852 bits (2201), Expect = 1.17e-309
Identity = 417/527 (79.13%), Postives = 439/527 (83.30%), Query Frame = 0

Query: 1   ARHLFDHFPDPKVALWNSILRGYAHNEFYREVVLLFGKMKSTDVRPNCFTFPLVLKSCAK 60
           ARHLFDHFPDPK+ALWN+  RGY  NEFYREV+ L GKMKS DVRPNCFTFPLVLKSCAK
Sbjct: 4   ARHLFDHFPDPKLALWNATFRGYVRNEFYREVIFLSGKMKSMDVRPNCFTFPLVLKSCAK 63

Query: 61  INAFVEGEEIHCEVIKGGFRGNQFVATTLIDVYS-------------------------- 120
           I+AFVEGE+IHCEVIKGGF GNQFVATTLIDVYS                          
Sbjct: 64  ISAFVEGEQIHCEVIKGGFEGNQFVATTLIDVYSVGRAIGSAYKVFVGMLERNVVAWTSM 123

Query: 121 --------------------------------------GHMEAARKLFDAMPYRDVMSWN 180
                                                 G M AARKLFDAMPYRDVMSWN
Sbjct: 124 ISGYILCNDVVFARRLFDLAPERDVVLWNIMVSGYIEIGDMGAARKLFDAMPYRDVMSWN 183

Query: 181 TMLSGYANNGDVEACEHLFEEMPERNVFSWNGLIGGYAHNGRFFDVLSCFKRMLLDGQVV 240
           TML+GYANNGD+EACE LFEEMPERNVFSWNGLIG YAH G FF+VL CFKRML+DG VV
Sbjct: 184 TMLNGYANNGDIEACEQLFEEMPERNVFSWNGLIGAYAHKGCFFEVLGCFKRMLIDGLVV 243

Query: 241 PNDATLVTVLSACARLGALDLGKWVHVYAATIGFKRNIYVGNALIDMYAKCGVIENAMEV 300
           PNDATLVTVLSACARLGALDLGKWVHVYAATIGFK +IYVGNALIDMY+KCGVIENAMEV
Sbjct: 244 PNDATLVTVLSACARLGALDLGKWVHVYAATIGFKGSIYVGNALIDMYSKCGVIENAMEV 303

Query: 301 FGSMDSKDLISWNSVICGLATHGCGADALNLFHRMKNSGKKPDGITFIGVLCSCTHLGLV 360
           FGSMDSKD+I+WNSVICGLATHGCGADALNLFH+MK +G KPDGITFIGVLCSCTHLGLV
Sbjct: 304 FGSMDSKDVITWNSVICGLATHGCGADALNLFHQMKITGVKPDGITFIGVLCSCTHLGLV 363

Query: 361 EEGISYFNSMVHDYSIDPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPIKADAVIWAALLG 420
           EEGISYFNSMVHDYSI PQIEHYGCMVDLFGRAGLLDRAIEFVKRMP++ADAVIWAALLG
Sbjct: 364 EEGISYFNSMVHDYSIVPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPMEADAVIWAALLG 423

Query: 421 ACRTYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLSRWKDVARLKILMRDTGFKKL 463
           ACR +KNIDL ELALQKLIQLEPKNPANYVMLSNIYGD+ RWKDVARLKILMRDTG KKL
Sbjct: 424 ACRIHKNIDLGELALQKLIQLEPKNPANYVMLSNIYGDVGRWKDVARLKILMRDTGSKKL 483

BLAST of MC04g0907 vs. ExPASy TrEMBL
Match: A0A0A0KBY4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G017080 PE=4 SV=1)

HSP 1 Score: 847 bits (2188), Expect = 1.16e-307
Identity = 417/527 (79.13%), Postives = 436/527 (82.73%), Query Frame = 0

Query: 1   ARHLFDHFPDPKVALWNSILRGYAHNEFYREVVLLFGKMKSTDVRPNCFTFPLVLKSCAK 60
           ARHLFDHFPDPKV LWN+I RGY HN FYREVV LFGKMKS DVRPNCFTFPLVLKSCAK
Sbjct: 4   ARHLFDHFPDPKVELWNAISRGYFHNAFYREVVFLFGKMKSMDVRPNCFTFPLVLKSCAK 63

Query: 61  INAFVEGEEIHCEVIKGGFRGNQFVATTLIDVYSG------------------------- 120
           I AFVEGEEIHCEVIKGG  GNQFVATTLIDVYSG                         
Sbjct: 64  IGAFVEGEEIHCEVIKGGLEGNQFVATTLIDVYSGGRAIGSAYKLFVGMLERNIVAWTSM 123

Query: 121 ---------------------------------------HMEAARKLFDAMPYRDVMSWN 180
                                                   M+AARKLFD MPYRD MSWN
Sbjct: 124 ISGYILCNRVALARRLFDLAPERDVVLWNIMVSGYIEIGDMKAARKLFDTMPYRDTMSWN 183

Query: 181 TMLSGYANNGDVEACEHLFEEMPERNVFSWNGLIGGYAHNGRFFDVLSCFKRMLLDGQVV 240
           TML+GYANNGDVEACE LFEEMPERNVFSWNGLIGGYAHNG FF+VL CFKRML+DG VV
Sbjct: 184 TMLNGYANNGDVEACEQLFEEMPERNVFSWNGLIGGYAHNGCFFEVLRCFKRMLIDGLVV 243

Query: 241 PNDATLVTVLSACARLGALDLGKWVHVYAATIGFKRNIYVGNALIDMYAKCGVIENAMEV 300
           PNDATLVTVLSACARLGALDLGKWVHVYAATIGFK +IYVGNALIDMY+KCG+IENAMEV
Sbjct: 244 PNDATLVTVLSACARLGALDLGKWVHVYAATIGFKGSIYVGNALIDMYSKCGLIENAMEV 303

Query: 301 FGSMDSKDLISWNSVICGLATHGCGADALNLFHRMKNSGKKPDGITFIGVLCSCTHLGLV 360
           F SMD KDLI+WNS+ICGLATHGCGADAL LFH+MK +G+KPDGITFIGVLCSCTHLGLV
Sbjct: 304 FESMDLKDLITWNSMICGLATHGCGADALTLFHQMKINGEKPDGITFIGVLCSCTHLGLV 363

Query: 361 EEGISYFNSMVHDYSIDPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPIKADAVIWAALLG 420
           EEG SYFNSMV++YSI PQIEHYGCMVDLFGRAGLLDRAIEFVKRMP++ADAVIWAALLG
Sbjct: 364 EEGTSYFNSMVNEYSIAPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPMEADAVIWAALLG 423

Query: 421 ACRTYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLSRWKDVARLKILMRDTGFKKL 463
           ACR YKNIDLAELALQKLI LEPKNPANYV+LSNIYGDL RWKDVARLKILMRDTG KKL
Sbjct: 424 ACRIYKNIDLAELALQKLIVLEPKNPANYVLLSNIYGDLGRWKDVARLKILMRDTGSKKL 483

BLAST of MC04g0907 vs. ExPASy TrEMBL
Match: A0A6J1K037 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111489834 PE=4 SV=1)

HSP 1 Score: 847 bits (2187), Expect = 1.58e-307
Identity = 414/527 (78.56%), Postives = 437/527 (82.92%), Query Frame = 0

Query: 1   ARHLFDHFPDPKVALWNSILRGYAHNEFYREVVLLFGKMKSTDVRPNCFTFPLVLKSCAK 60
           ARHLFDHFPDPK+ALWN+  RGY  NEFYREV+ L GKMKS DV+PNCFTFPLVLKSCAK
Sbjct: 4   ARHLFDHFPDPKLALWNATFRGYVRNEFYREVIFLSGKMKSMDVKPNCFTFPLVLKSCAK 63

Query: 61  INAFVEGEEIHCEVIKGGFRGNQFVATTLIDVYS-------------------------- 120
           I+AFVEGE+IHCEVIKGGF GNQFVATTLIDVYS                          
Sbjct: 64  ISAFVEGEQIHCEVIKGGFEGNQFVATTLIDVYSVGRAIGSAYKVFVGMLERNVVAWTSM 123

Query: 121 --------------------------------------GHMEAARKLFDAMPYRDVMSWN 180
                                                 G M AARKLFDAMPYRDVMSWN
Sbjct: 124 ISGYILCNDVVFAHRLFDLAPERDVVLWNIMVSGYIEIGDMAAARKLFDAMPYRDVMSWN 183

Query: 181 TMLSGYANNGDVEACEHLFEEMPERNVFSWNGLIGGYAHNGRFFDVLSCFKRMLLDGQVV 240
           TML+GYANNGD+EACE +FEEMPERNVFSWNGLIG YAH   FF+VL CFKRML+DG VV
Sbjct: 184 TMLNGYANNGDIEACEQMFEEMPERNVFSWNGLIGAYAHKRCFFEVLGCFKRMLIDGLVV 243

Query: 241 PNDATLVTVLSACARLGALDLGKWVHVYAATIGFKRNIYVGNALIDMYAKCGVIENAMEV 300
           PNDATLVTVLSACARLGALDLGKWVHVYAATIGFK +IYVGNALIDMY+KCGVIENAMEV
Sbjct: 244 PNDATLVTVLSACARLGALDLGKWVHVYAATIGFKGSIYVGNALIDMYSKCGVIENAMEV 303

Query: 301 FGSMDSKDLISWNSVICGLATHGCGADALNLFHRMKNSGKKPDGITFIGVLCSCTHLGLV 360
           FGSMDSKD+I+WNSVICGLATHGCGADALNLFH+MK +G KPDGITFIGVLCSCTHLGLV
Sbjct: 304 FGSMDSKDVITWNSVICGLATHGCGADALNLFHQMKITGVKPDGITFIGVLCSCTHLGLV 363

Query: 361 EEGISYFNSMVHDYSIDPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPIKADAVIWAALLG 420
           EEG SYFNSMVHDYSI PQIEHYGCMVDLFGRAGLLDRAIEFVKRMP++ADAVIWAALLG
Sbjct: 364 EEGTSYFNSMVHDYSIVPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPMEADAVIWAALLG 423

Query: 421 ACRTYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLSRWKDVARLKILMRDTGFKKL 463
           ACR +KNIDL ELALQKLIQLEPKNPANYVMLSNIYGD+ RWKDVARLKILMRDTG KKL
Sbjct: 424 ACRIHKNIDLGELALQKLIQLEPKNPANYVMLSNIYGDVGRWKDVARLKILMRDTGSKKL 483

BLAST of MC04g0907 vs. ExPASy TrEMBL
Match: A0A5A7V6Y9 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold41G001800 PE=4 SV=1)

HSP 1 Score: 837 bits (2161), Expect = 1.15e-302
Identity = 411/527 (77.99%), Postives = 434/527 (82.35%), Query Frame = 0

Query: 1   ARHLFDHFPDPKVALWNSILRGYAHNEFYREVVLLFGKMKSTDVRPNCFTFPLVLKSCAK 60
           ARHLFDHF DPKV LWN+I RGY H  FY+EVV LFGKMKS  VRPNCFTFPLVLKSCAK
Sbjct: 59  ARHLFDHFLDPKVELWNAIFRGYFHKAFYKEVVFLFGKMKSMGVRPNCFTFPLVLKSCAK 118

Query: 61  INAFVEGEEIHCEVIKGGFRGNQFVATTLIDVYS-------------------------- 120
           I AFVEGEEIH EVIKGGF GNQFVATTLIDVYS                          
Sbjct: 119 IGAFVEGEEIHSEVIKGGFEGNQFVATTLIDVYSAGRAIGSAYKVFVVMLERNIVAWTSM 178

Query: 121 --------------------------------------GHMEAARKLFDAMPYRDVMSWN 180
                                                 G M+AARKLFD MPYRD MSWN
Sbjct: 179 ISGYILCNGVALARRLFDLAPERDVVLWNIMVSGYIQIGDMKAARKLFDTMPYRDTMSWN 238

Query: 181 TMLSGYANNGDVEACEHLFEEMPERNVFSWNGLIGGYAHNGRFFDVLSCFKRMLLDGQVV 240
           TML+GYANNGDVEACE +FEEMPERNVFSWNGLIGGYAHNGRFF+VL CFKRML+DG VV
Sbjct: 239 TMLNGYANNGDVEACEQMFEEMPERNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGLVV 298

Query: 241 PNDATLVTVLSACARLGALDLGKWVHVYAATIGFKRNIYVGNALIDMYAKCGVIENAMEV 300
           PNDATLVTVLSACARLGALDLGKWVHVYAATIGFK +IYVGNALIDMY+KCGVIENAMEV
Sbjct: 299 PNDATLVTVLSACARLGALDLGKWVHVYAATIGFKGSIYVGNALIDMYSKCGVIENAMEV 358

Query: 301 FGSMDSKDLISWNSVICGLATHGCGADALNLFHRMKNSGKKPDGITFIGVLCSCTHLGLV 360
           F SMD KDLI+WNS+ICGLATHGCGADAL LFH+MK +G+KPDGITFIGVLCSCTHLGLV
Sbjct: 359 FESMDLKDLITWNSMICGLATHGCGADALTLFHQMKINGEKPDGITFIGVLCSCTHLGLV 418

Query: 361 EEGISYFNSMVHDYSIDPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPIKADAVIWAALLG 420
           EEG SYFNSMV++YSI PQIEHYGCMVDLFGRAGLLDRAI+FVKRMP++ADAVIWAALLG
Sbjct: 419 EEGTSYFNSMVNEYSITPQIEHYGCMVDLFGRAGLLDRAIDFVKRMPMEADAVIWAALLG 478

Query: 421 ACRTYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLSRWKDVARLKILMRDTGFKKL 463
           ACR YKN+DLAELALQKLI LEPKNPANYV+LSNIYGDL RWKDVARLKILMRDTG KKL
Sbjct: 479 ACRIYKNLDLAELALQKLILLEPKNPANYVLLSNIYGDLGRWKDVARLKILMRDTGCKKL 538

BLAST of MC04g0907 vs. TAIR 10
Match: AT1G13410.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 493.8 bits (1270), Expect = 1.5e-139
Identity = 231/367 (62.94%), Postives = 288/367 (78.47%), Query Frame = 0

Query: 88  TLIDVY--SGHMEAARKLFDAMPYRDVMSWNTMLSGYANNGDVEACEHLFEEMPERNVFS 147
           T+I  Y   G+M  AR LFD MP RDVMSWNT+L GYAN GD+EACE +F++MPERNVFS
Sbjct: 95  TMISGYIEMGNMLEARSLFDQMPCRDVMSWNTVLEGYANIGDMEACERVFDDMPERNVFS 154

Query: 148 WNGLIGGYAHNGRFFDVLSCFKRMLLDGQVVPNDATLVTVLSACARLGALDLGKWVHVYA 207
           WNGLI GYA NGR  +VL  FKRM+ +G VVPNDAT+  VLSACA+LGA D GKWVH Y 
Sbjct: 155 WNGLIKGYAQNGRVSEVLGSFKRMVDEGSVVPNDATMTLVLSACAKLGAFDFGKWVHKYG 214

Query: 208 ATIGF-KRNIYVGNALIDMYAKCGVIENAMEVFGSMDSKDLISWNSVICGLATHGCGADA 267
            T+G+ K ++ V NALIDMY KCG IE AMEVF  +  +DLISWN++I GLA HG G +A
Sbjct: 215 ETLGYNKVDVNVKNALIDMYGKCGAIEIAMEVFKGIKRRDLISWNTMINGLAAHGHGTEA 274

Query: 268 LNLFHRMKNSGKKPDGITFIGVLCSCTHLGLVEEGISYFNSMVHDYSIDPQIEHYGCMVD 327
           LNLFH MKNSG  PD +TF+GVLC+C H+GLVE+G++YFNSM  D+SI P+IEH GC+VD
Sbjct: 275 LNLFHEMKNSGISPDKVTFVGVLCACKHMGLVEDGLAYFNSMFTDFSIMPEIEHCGCVVD 334

Query: 328 LFGRAGLLDRAIEFVKRMPIKADAVIWAALLGACRTYKNIDLAELALQKLIQLEPKNPAN 387
           L  RAG L +A+EF+ +MP+KADAVIWA LLGA + YK +D+ E+AL++LI+LEP+NPAN
Sbjct: 335 LLSRAGFLTQAVEFINKMPVKADAVIWATLLGASKVYKKVDIGEVALEELIKLEPRNPAN 394

Query: 388 YVMLSNIYGDLSRWKDVARLKILMRDTGFKKLPGCSLIEVNDSVVEFYSLDERHSQSKEI 447
           +VMLSNIYGD  R+ D ARLK+ MRDTGFKK  G S IE +D +V+FYS  E+H +++E+
Sbjct: 395 FVMLSNIYGDAGRFDDAARLKVAMRDTGFKKEAGVSWIETDDGLVKFYSSGEKHPRTEEL 454

Query: 448 YGVLKGL 452
             +L+ L
Sbjct: 455 QRILREL 461

BLAST of MC04g0907 vs. TAIR 10
Match: AT2G20540.1 (mitochondrial editing factor 21 )

HSP 1 Score: 421.0 bits (1081), Expect = 1.2e-117
Identity = 200/453 (44.15%), Postives = 305/453 (67.33%), Query Frame = 0

Query: 1   ARHLFDHFPDPKVALWNSILRGYAHNEFYREVVLLFGKM--KSTDVRPNCFTFPLVLKSC 60
           A  LF+   +P V L+NSI+R Y HN  Y +V+ ++ ++  KS ++ P+ FTFP + KSC
Sbjct: 61  ATRLFNQVSNPNVFLYNSIIRAYTHNSLYCDVIRIYKQLLRKSFEL-PDRFTFPFMFKSC 120

Query: 61  AKINAFVEGEEIHCEVIKGGFRGNQFVATTLIDVYS--GHMEAARKLFDAMPYRDVMSWN 120
           A + +   G+++H  + K G R +      LID+Y     +  A K+FD M  RDV+SWN
Sbjct: 121 ASLGSCYLGKQVHGHLCKFGPRFHVVTENALIDMYMKFDDLVDAHKVFDEMYERDVISWN 180

Query: 121 TMLSGYANNGDVEACEHLFEEMPERNVFSWNGLIGGYAHNGRFFDVLSCFKRMLLDGQVV 180
           ++LSGYA  G ++  + LF  M ++ + SW  +I GY   G + + +  F+ M L G + 
Sbjct: 181 SLLSGYARLGQMKKAKGLFHLMLDKTIVSWTAMISGYTGIGCYVEAMDFFREMQLAG-IE 240

Query: 181 PNDATLVTVLSACARLGALDLGKWVHVYAATIGFKRNIYVGNALIDMYAKCGVIENAMEV 240
           P++ +L++VL +CA+LG+L+LGKW+H+YA   GF +   V NALI+MY+KCGVI  A+++
Sbjct: 241 PDEISLISVLPSCAQLGSLELGKWIHLYAERRGFLKQTGVCNALIEMYSKCGVISQAIQL 300

Query: 241 FGSMDSKDLISWNSVICGLATHGCGADALNLFHRMKNSGKKPDGITFIGVLCSCTHLGLV 300
           FG M+ KD+ISW+++I G A HG    A+  F+ M+ +  KP+GITF+G+L +C+H+G+ 
Sbjct: 301 FGQMEGKDVISWSTMISGYAYHGNAHGAIETFNEMQRAKVKPNGITFLGLLSACSHVGMW 360

Query: 301 EEGISYFNSMVHDYSIDPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPIKADAVIWAALLG 360
           +EG+ YF+ M  DY I+P+IEHYGC++D+  RAG L+RA+E  K MP+K D+ IW +LL 
Sbjct: 361 QEGLRYFDMMRQDYQIEPKIEHYGCLIDVLARAGKLERAVEITKTMPMKPDSKIWGSLLS 420

Query: 361 ACRTYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLSRWKDVARLKILMRDTGFKKL 420
           +CRT  N+D+A +A+  L++LEP++  NYV+L+NIY DL +W+DV+RL+ ++R+   KK 
Sbjct: 421 SCRTPGNLDVALVAMDHLVELEPEDMGNYVLLANIYADLGKWEDVSRLRKMIRNENMKKT 480

Query: 421 PGCSLIEVNDSVVEFYSLDERHSQSKEIYGVLK 450
           PG SLIEVN+ V EF S D       EI  VL+
Sbjct: 481 PGGSLIEVNNIVQEFVSGDNSKPFWTEISIVLQ 511

BLAST of MC04g0907 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 416.4 bits (1069), Expect = 3.0e-116
Identity = 190/461 (41.21%), Postives = 298/461 (64.64%), Query Frame = 0

Query: 4   LFDHFPDPKVALWNSILRGYAHNEFYREVVLLFGKMKSTDVRPNCFTFPLVLKSCAKINA 63
           +F    +  V  WNS++ G+       + + LF KM+S DV+ +  T   VL +CAKI  
Sbjct: 188 VFTTIKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRN 247

Query: 64  FVEGEEIHCEVIKGGFRGNQFVATTLIDVYS--GHMEAARKLFDAMPYRDVMSWNTMLSG 123
              G ++   + +     N  +A  ++D+Y+  G +E A++LFDAM  +D ++W TML G
Sbjct: 248 LEFGRQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDG 307

Query: 124 YANNGDVEACEHLFEEMPERNVFSWNGLIGGYAHNGRFFDVLSCFKRMLLDGQVVPNDAT 183
           YA + D EA   +   MP++++ +WN LI  Y  NG+  + L  F  + L   +  N  T
Sbjct: 308 YAISEDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQIT 367

Query: 184 LVTVLSACARLGALDLGKWVHVYAATIGFKRNIYVGNALIDMYAKCGVIENAMEVFGSMD 243
           LV+ LSACA++GAL+LG+W+H Y    G + N +V +ALI MY+KCG +E + EVF S++
Sbjct: 368 LVSTLSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVE 427

Query: 244 SKDLISWNSVICGLATHGCGADALNLFHRMKNSGKKPDGITFIGVLCSCTHLGLVEEGIS 303
            +D+  W+++I GLA HGCG +A+++F++M+ +  KP+G+TF  V C+C+H GLV+E  S
Sbjct: 428 KRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAES 487

Query: 304 YFNSMVHDYSIDPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPIKADAVIWAALLGACRTY 363
            F+ M  +Y I P+ +HY C+VD+ GR+G L++A++F++ MPI     +W ALLGAC+ +
Sbjct: 488 LFHQMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIH 547

Query: 364 KNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLSRWKDVARLKILMRDTGFKKLPGCSL 423
            N++LAE+A  +L++LEP+N   +V+LSNIY  L +W++V+ L+  MR TG KK PGCS 
Sbjct: 548 ANLNLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSS 607

Query: 424 IEVNDSVVEFYSLDERHSQSKEIYGVLKGLMKLLRSYGYEP 463
           IE++  + EF S D  H  S+++YG L  +M+ L+S GYEP
Sbjct: 608 IEIDGMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEP 648

BLAST of MC04g0907 vs. TAIR 10
Match: AT5G48910.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 400.2 bits (1027), Expect = 2.2e-111
Identity = 197/481 (40.96%), Postives = 297/481 (61.75%), Query Frame = 0

Query: 1   ARHLFDHFPDPKVALWNSILRGYAHNEFYREVV---LLFGKMKSTDVRPNCFTFPLVLKS 60
           A  +F+  P      WN+I+RG++ ++  + ++   L +  M    V PN FTFP VLK+
Sbjct: 78  AHKIFNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKA 137

Query: 61  CAKINAFVEGEEIHCEVIKGGFRGNQFVATTLIDVY--SGHMEAARKLF-------DAMP 120
           CAK     EG++IH   +K GF G++FV + L+ +Y   G M+ AR LF       D + 
Sbjct: 138 CAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMVV 197

Query: 121 YRD-------VMSWNTMLSGYANNGDVEACEHLFEEMPERNVFSWNGLIGGYAHNGRFFD 180
             D       ++ WN M+ GY   GD +A   LF++M +R+V SWN +I GY+ NG F D
Sbjct: 198 MTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNGFFKD 257

Query: 181 VLSCFKRMLLDGQVVPNDATLVTVLSACARLGALDLGKWVHVYAATIGFKRNIYVGNALI 240
            +  F+ M   G + PN  TLV+VL A +RLG+L+LG+W+H+YA   G + +  +G+ALI
Sbjct: 258 AVEVFREM-KKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALI 317

Query: 241 DMYAKCGVIENAMEVFGSMDSKDLISWNSVICGLATHGCGADALNLFHRMKNSGKKPDGI 300
           DMY+KCG+IE A+ VF  +  +++I+W+++I G A HG   DA++ F +M+ +G +P  +
Sbjct: 318 DMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDV 377

Query: 301 TFIGVLCSCTHLGLVEEGISYFNSMVHDYSIDPQIEHYGCMVDLFGRAGLLDRAIEFVKR 360
            +I +L +C+H GLVEEG  YF+ MV    ++P+IEHYGCMVDL GR+GLLD A EF+  
Sbjct: 378 AYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILN 437

Query: 361 MPIKADAVIWAALLGACRTYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLSRWKDV 420
           MPIK D VIW ALLGACR   N+++ +     L+ + P +   YV LSN+Y     W +V
Sbjct: 438 MPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEV 497

Query: 421 ARLKILMRDTGFKKLPGCSLIEVNDSVVEFYSLDERHSQSKEIYGVLKGLMKLLRSYGYE 463
           + +++ M++   +K PGCSLI+++  + EF   D+ H ++KEI  +L  +   LR  GY 
Sbjct: 498 SEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYR 557

BLAST of MC04g0907 vs. TAIR 10
Match: AT4G37380.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 396.7 bits (1018), Expect = 2.5e-110
Identity = 196/463 (42.33%), Postives = 297/463 (64.15%), Query Frame = 0

Query: 4   LFDHFPDPKVALWNSILRGYAHNEFYREVVLLFGKMKSTDVRPNCFTFPLVLKSCAKINA 63
           LF    DP + L+ + +   + N    +  LL+ ++ S+++ PN FTF  +LKSC+    
Sbjct: 86  LFHQTIDPDLFLFTAAINTASINGLKDQAFLLYVQLLSSEINPNEFTFSSLLKSCST--- 145

Query: 64  FVEGEEIHCEVIKGGFRGNQFVATTLIDVYS--GHMEAARKLFDAMPYRDVMSWNTMLSG 123
              G+ IH  V+K G   + +VAT L+DVY+  G + +A+K+FD MP R ++S   M++ 
Sbjct: 146 -KSGKLIHTHVLKFGLGIDPYVATGLVDVYAKGGDVVSAQKVFDRMPERSLVSSTAMITC 205

Query: 124 YANNGDVEACEHLFEEMPERNVFSWNGLIGGYAHNGRFFDVLSCFKRMLLDGQVVPNDAT 183
           YA  G+VEA   LF+ M ER++ SWN +I GYA +G   D L  F+++L +G+  P++ T
Sbjct: 206 YAKQGNVEAARALFDSMCERDIVSWNVMIDGYAQHGFPNDALMLFQKLLAEGKPKPDEIT 265

Query: 184 LVTVLSACARLGALDLGKWVHVYAATIGFKRNIYVGNALIDMYAKCGVIENAMEVFGSMD 243
           +V  LSAC+++GAL+ G+W+HV+  +   + N+ V   LIDMY+KCG +E A+ VF    
Sbjct: 266 VVAALSACSQIGALETGRWIHVFVKSSRIRLNVKVCTGLIDMYSKCGSLEEAVLVFNDTP 325

Query: 244 SKDLISWNSVICGLATHGCGADALNLFHRMKN-SGKKPDGITFIGVLCSCTHLGLVEEGI 303
            KD+++WN++I G A HG   DAL LF+ M+  +G +P  ITFIG L +C H GLV EGI
Sbjct: 326 RKDIVAWNAMIAGYAMHGYSQDALRLFNEMQGITGLQPTDITFIGTLQACAHAGLVNEGI 385

Query: 304 SYFNSMVHDYSIDPQIEHYGCMVDLFGRAGLLDRAIEFVKRMPIKADAVIWAALLGACRT 363
             F SM  +Y I P+IEHYGC+V L GRAG L RA E +K M + AD+V+W+++LG+C+ 
Sbjct: 386 RIFESMGQEYGIKPKIEHYGCLVSLLGRAGQLKRAYETIKNMNMDADSVLWSSVLGSCKL 445

Query: 364 YKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLSRWKDVARLKILMRDTGFKKLPGCS 423
           + +  L +   + LI L  KN   YV+LSNIY  +  ++ VA+++ LM++ G  K PG S
Sbjct: 446 HGDFVLGKEIAEYLIGLNIKNSGIYVLLSNIYASVGDYEGVAKVRNLMKEKGIVKEPGIS 505

Query: 424 LIEVNDSVVEFYSLDERHSQSKEIYGVLKGLMKLLRSYGYEPN 464
            IE+ + V EF + D  HS+SKEIY +L+ + + ++S+GY PN
Sbjct: 506 TIEIENKVHEFRAGDREHSKSKEIYTMLRKISERIKSHGYVPN 544

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SIL51.7e-11644.15Pentatricopeptide repeat-containing protein At2g20540 OS=Arabidopsis thaliana OX... [more]
O823804.3e-11541.21Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9FI803.2e-11040.96Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX... [more]
Q9SZT83.5e-10942.33Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis t... [more]
Q9LN012.3e-10836.99Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_022139698.10.087.86pentatricopeptide repeat-containing protein At3g29230-like [Momordica charantia][more]
XP_038886719.15.71e-31580.65pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like [Benin... [more]
XP_023549948.11.46e-31079.32pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucur... [more]
XP_022938516.12.41e-30979.13pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucur... [more]
XP_011656468.12.39e-30779.13pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucumis sa... [more]
Match NameE-valueIdentityDescription
A0A6J1CGA70.087.86pentatricopeptide repeat-containing protein At3g29230-like OS=Momordica charanti... [more]
A0A6J1FDD31.17e-30979.13pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like OS=Cuc... [more]
A0A0A0KBY41.16e-30779.13Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G017080 PE=4 SV=1[more]
A0A6J1K0371.58e-30778.56pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like OS=Cuc... [more]
A0A5A7V6Y91.15e-30277.99Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT1G13410.11.5e-13962.94Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G20540.11.2e-11744.15mitochondrial editing factor 21 [more]
AT2G29760.13.0e-11641.21Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G48910.12.2e-11140.96Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G37380.12.5e-11042.33Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 246..279
e-value: 1.8E-6
score: 25.7
coord: 114..143
e-value: 5.7E-8
score: 30.5
coord: 15..48
e-value: 3.4E-5
score: 21.7
coord: 218..244
e-value: 0.0028
score: 15.7
coord: 144..173
e-value: 8.4E-6
score: 23.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 144..173
e-value: 5.7E-6
score: 26.2
coord: 318..342
e-value: 0.011
score: 16.0
coord: 114..143
e-value: 2.1E-8
score: 33.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 11..60
e-value: 4.6E-8
score: 33.1
coord: 243..290
e-value: 7.1E-8
score: 32.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 12..46
score: 9.613118
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 244..278
score: 11.081932
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 111..145
score: 11.871145
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 218..431
e-value: 5.7E-38
score: 133.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 95..197
e-value: 1.4E-27
score: 98.2
coord: 1..94
e-value: 3.1E-14
score: 54.7
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 129..399
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 95..139
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 10..95
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 140..461
NoneNo IPR availablePANTHERPTHR47926:SF81TETRATRICOPEPTIDE REPEAT (TPR)-LIKE SUPERFAMILY PROTEINcoord: 95..139
NoneNo IPR availablePANTHERPTHR47926:SF81TETRATRICOPEPTIDE REPEAT (TPR)-LIKE SUPERFAMILY PROTEINcoord: 140..461
coord: 10..95

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC04g0907.1MC04g0907.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding