MC01g_new0482 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC01g_new0482
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationMC01: 20081144 .. 20084080 (+)
RNA-Seq ExpressionMC01g_new0482
SyntenyMC01g_new0482
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTTTTCTGTTAGTATAATGTGTTGAGTTGTGTTGTTTTGTGAGACTTTTTTTTCTTTTTCAACTTTCTTTCTCTCTTCTTTTTTATCTAACAAATATATTTCAAGTTTTTTTGTTTTCTAATTTTTTTCTCACTATAATGTTTTTCTATTTTTAATTTTTCTTTATCTATCTAACATGTATTTCGCTCTAACTTTATTATTTTTCTCTAAATTTTTATTCTTTTATCTCACAACTGGTCTTCAATTTTTCGAGAGAAGGGGAAGAACATACCCCATTTTATTGTCATCCATACTTTACATAAGATTTTCTATTTCTAACATATCTTTTCAACCTCGTAACTTATTCACCGTAAATAAAACAAAAAAAAACAACAACAACATATATCAGAAAAGATAAAGGTTAAAAACAGAACCCTCATAAATATTGATTTTAACATTTTATAAATTCCCAGATCTAAGCGACTGAGTGGGTTAGTTAAGTTATGATGTTTTAGAGGATATTGGTTGGTTTGACTCCTCTCCCAAGCATGTTCTGTTCTGATGCGCTCAAACCAACCGTCATTCTTCTACTCCACCGCCCGCTCCGTCGCTTCAACTTTCCCGGAAAATCACCTCACTTTCCTTCTCAGAAAATGCATCTCTCTCATACAACTCTGCGCCTCCTCCCACTTCAAGCTCAAGCAAATCCACGGCTTCTCCATCAGACATGGCGTCCCACCCCACAATCCAGACATGGGCAAGCACCTCATCTTCGCCCTCGTCTCCCTCTCGGCCCCCATGCCCTACGCGACCCGAATTTTCCGTCTGATTCGAGCCCCCAATATCTTCACGTGGAACACCATGATTAGAGGGTTTGCCGAGAGCGAGAATCCGAGGCCGGCCGTGGAGTTGTACTGCCAAATGCACGCGTCTTCGGTTCTGCCTGATACGCATACTTTCCCTTTTCTTTTGAAGGCTGCTGCTAAGTTAATGGATGTTAGAGTAGGCGAGGAGATTCACTCGATTGTTGTTAGAAATGGGTTCGGTTCGTTGCTTTTTGTTCAGAATTCGTTGGTCCATATGTACTCTGTTTTTGGGTTTGCCGAGAGTGCGTACCAGGTGTTTGAGTTTATGCTTGAGAGAGATCTCGTGGCCTGGAACTCTGTTATTAATGGCTTTGCTCTTAATGGAATGGCTAACGAAGCTCTGACCCTTTTTAGGGAAATGGGTTTGGATGGCGTGGAGCCTGATGGGTTCACCATGGTTAGTCTGTTATCTGCTTGTGTTGAGCTTGGGGCCATGGCCTTGGGGGAGAGGGTTCATGTGTATATGTTGAAGGTTGGTTTAGTACACAATCCACATGCTAGCAATGCCCTCCTTGATCTCTACTCCAAATGTGGGAACATTAGAGATGCACTGAAGGTGTTTGATGAAATGGAAGAGAGGAGTGTGGTTTCTTGGACTTCTCTGATTGTTGGGTTGGCTGTTAATGGATTAGGAAATGAAGCTCTTGAGCTGTTTGGGGAGTTGGAAAGGAAGGGGTTGAAGCCTAGTGAGATCACATTTGTTGGAGTTTTGTATGCTTGTAGCCATTGTGGGATGGTTGAGGAAGGCTTCGATTACTTTAGAAGGATGAAAGATGAATATGGCATCTTGCCAAGGATAGAGCACCATGGCTGTATTGTTGATTTGCTGTGCAGGGCCGGCAAGGTTGGAGATGCCTATGAGTATATCCGAAACATTTCGATCCCGCCCAATGCAGTCATTTGGCGGACCTTACTGGGAGCTTGCACAATCCATGGGCATCTAGAATTGGGTGAGGTTGCAAGAGCTGAAGTCCTACGCTTGGAACCGAAGCATAGCGGGGACTATGTCCTTCTCTCGAACCTTTATGCATCGGAGCGACGTTGGCTGGATGTGCAAAACGTAAGGAGGACGATGCTTATGAAAGGAGTGAGGAAAACTCCCGGGTATAGCCTCGTTGAGTTGAAAAACCGTGTTTATGAGTTTATCATGGGTGATAGATCTCATCCCCAAAGTGAGGAGACATACGCAATGCTGGGGAAGATCACAGAGTTGTTGAAAATCGAAGGCTACGTTCCTCGCACGGTTAATGTTCTTGCTGATATAGAAGAGGAAGAAAAGGAGACGGCTCTGTCTCATCACACGGAGAAAGTTGCAATTGCTTTTATGTTGGTTAACACCCCACAAAGAACTCCAATTAGAATCATGAAGAATTTGAGAGTCTGTGCAGATTGTCATCTGGCGATCAAACTCATATCCAAGGTTTTCGAACGTGAGATCATCGTAAGGGATCGTAGTAGGTTTCATCATTTTAAAGACAGTTCGTGCTCTTGTAGAGATTATTGGTAATCTTAATAGTTTCCTTGACATATACCCTCATTGATGAATGTCTGCATTTTTTTCTTTCAAGGTGAAAGATGATTCTACATTCATATGATTGAACATAGAATAGAAGGCTGAGTTAGAGGTTTATAATCTGTGTTTTGGCTAATTTTAATAATCCTTATGACACTATATATAATGGTGAGGGTGTTTGGTGGCTGGATCCAGAACCAGAAACTTTCCACAAACAATGCTGTTGAGAACATTTTAAGATGACATGATAGATATCTATTTATTTACTTATTTTTGTGTATTCTCTTTATTTTTCGAACTTCTGTATCATTAAATCAAATTATATATATACTTCTATATCCAGGGACATTCCCGTCCGCCCCAAGGTCTAGATCCGGATACAACAAAGGTTTTTGGTATTAGGACCTCTAAGTCAATATGACTAATTATCAAAAGACTTTAAGCTCTTGTTAATCGGAATCTCCTTTTGTAGTAATAAATCAAATTATACTATTAAAGCTTGTCTTATGTTGAGATGTGTTCCGTTACCTCACAAAACATCAAAGACATTCCTCATGTCACAATAAACAATATGGTTT

mRNA sequence

ATGCGCTCAAACCAACCGTCATTCTTCTACTCCACCGCCCGCTCCGTCGCTTCAACTTTCCCGGAAAATCACCTCACTTTCCTTCTCAGAAAATGCATCTCTCTCATACAACTCTGCGCCTCCTCCCACTTCAAGCTCAAGCAAATCCACGGCTTCTCCATCAGACATGGCGTCCCACCCCACAATCCAGACATGGGCAAGCACCTCATCTTCGCCCTCGTCTCCCTCTCGGCCCCCATGCCCTACGCGACCCGAATTTTCCGTCTGATTCGAGCCCCCAATATCTTCACGTGGAACACCATGATTAGAGGGTTTGCCGAGAGCGAGAATCCGAGGCCGGCCGTGGAGTTGTACTGCCAAATGCACGCGTCTTCGGTTCTGCCTGATACGCATACTTTCCCTTTTCTTTTGAAGGCTGCTGCTAAGTTAATGGATGTTAGAGTAGGCGAGGAGATTCACTCGATTGTTGTTAGAAATGGGTTCGGTTCGTTGCTTTTTGTTCAGAATTCGTTGGTCCATATGTACTCTGTTTTTGGGTTTGCCGAGAGTGCGTACCAGGTGTTTGAGTTTATGCTTGAGAGAGATCTCGTGGCCTGGAACTCTGTTATTAATGGCTTTGCTCTTAATGGAATGGCTAACGAAGCTCTGACCCTTTTTAGGGAAATGGGTTTGGATGGCGTGGAGCCTGATGGGTTCACCATGGTTAGTCTGTTATCTGCTTGTGTTGAGCTTGGGGCCATGGCCTTGGGGGAGAGGGTTCATGTGTATATGTTGAAGGTTGGTTTAGTACACAATCCACATGCTAGCAATGCCCTCCTTGATCTCTACTCCAAATGTGGGAACATTAGAGATGCACTGAAGGTGTTTGATGAAATGGAAGAGAGGAGTGTGGTTTCTTGGACTTCTCTGATTGTTGGGTTGGCTGTTAATGGATTAGGAAATGAAGCTCTTGAGCTGTTTGGGGAGTTGGAAAGGAAGGGGTTGAAGCCTAGTGAGATCACATTTGTTGGAGTTTTGTATGCTTGTAGCCATTGTGGGATGGTTGAGGAAGGCTTCGATTACTTTAGAAGGATGAAAGATGAATATGGCATCTTGCCAAGGATAGAGCACCATGGCTGTATTGTTGATTTGCTGTGCAGGGCCGGCAAGGTTGGAGATGCCTATGAGTATATCCGAAACATTTCGATCCCGCCCAATGCAGTCATTTGGCGGACCTTACTGGGAGCTTGCACAATCCATGGGCATCTAGAATTGGGTGAGGTTGCAAGAGCTGAAGTCCTACGCTTGGAACCGAAGCATAGCGGGGACTATGTCCTTCTCTCGAACCTTTATGCATCGGAGCGACGTTGGCTGGATGTGCAAAACGTAAGGAGGACGATGCTTATGAAAGGAGTGAGGAAAACTCCCGGGTATAGCCTCGTTGAGTTGAAAAACCGTGTTTATGAGTTTATCATGGGTGATAGATCTCATCCCCAAAGTGAGGAGACATACGCAATGCTGGGGAAGATCACAGAGTTGTTGAAAATCGAAGGCTACGTTCCTCGCACGGTTAATGTTCTTGCTGATATAGAAGAGGAAGAAAAGGAGACGGCTCTGTCTCATCACACGGAGAAAGTTGCAATTGCTTTTATGTTGGTTAACACCCCACAAAGAACTCCAATTAGAATCATGAAGAATTTGAGAGTCTGTGCAGATTGTCATCTGGCGATCAAACTCATATCCAAGGTTTTCGAACGTGAGATCATCGTAAGGGATCGTAGTAGGTTTCATCATTTTAAAGACAGTTCGTGCTCTTGTAGAGATTATTGGTAA

Coding sequence (CDS)

ATGCGCTCAAACCAACCGTCATTCTTCTACTCCACCGCCCGCTCCGTCGCTTCAACTTTCCCGGAAAATCACCTCACTTTCCTTCTCAGAAAATGCATCTCTCTCATACAACTCTGCGCCTCCTCCCACTTCAAGCTCAAGCAAATCCACGGCTTCTCCATCAGACATGGCGTCCCACCCCACAATCCAGACATGGGCAAGCACCTCATCTTCGCCCTCGTCTCCCTCTCGGCCCCCATGCCCTACGCGACCCGAATTTTCCGTCTGATTCGAGCCCCCAATATCTTCACGTGGAACACCATGATTAGAGGGTTTGCCGAGAGCGAGAATCCGAGGCCGGCCGTGGAGTTGTACTGCCAAATGCACGCGTCTTCGGTTCTGCCTGATACGCATACTTTCCCTTTTCTTTTGAAGGCTGCTGCTAAGTTAATGGATGTTAGAGTAGGCGAGGAGATTCACTCGATTGTTGTTAGAAATGGGTTCGGTTCGTTGCTTTTTGTTCAGAATTCGTTGGTCCATATGTACTCTGTTTTTGGGTTTGCCGAGAGTGCGTACCAGGTGTTTGAGTTTATGCTTGAGAGAGATCTCGTGGCCTGGAACTCTGTTATTAATGGCTTTGCTCTTAATGGAATGGCTAACGAAGCTCTGACCCTTTTTAGGGAAATGGGTTTGGATGGCGTGGAGCCTGATGGGTTCACCATGGTTAGTCTGTTATCTGCTTGTGTTGAGCTTGGGGCCATGGCCTTGGGGGAGAGGGTTCATGTGTATATGTTGAAGGTTGGTTTAGTACACAATCCACATGCTAGCAATGCCCTCCTTGATCTCTACTCCAAATGTGGGAACATTAGAGATGCACTGAAGGTGTTTGATGAAATGGAAGAGAGGAGTGTGGTTTCTTGGACTTCTCTGATTGTTGGGTTGGCTGTTAATGGATTAGGAAATGAAGCTCTTGAGCTGTTTGGGGAGTTGGAAAGGAAGGGGTTGAAGCCTAGTGAGATCACATTTGTTGGAGTTTTGTATGCTTGTAGCCATTGTGGGATGGTTGAGGAAGGCTTCGATTACTTTAGAAGGATGAAAGATGAATATGGCATCTTGCCAAGGATAGAGCACCATGGCTGTATTGTTGATTTGCTGTGCAGGGCCGGCAAGGTTGGAGATGCCTATGAGTATATCCGAAACATTTCGATCCCGCCCAATGCAGTCATTTGGCGGACCTTACTGGGAGCTTGCACAATCCATGGGCATCTAGAATTGGGTGAGGTTGCAAGAGCTGAAGTCCTACGCTTGGAACCGAAGCATAGCGGGGACTATGTCCTTCTCTCGAACCTTTATGCATCGGAGCGACGTTGGCTGGATGTGCAAAACGTAAGGAGGACGATGCTTATGAAAGGAGTGAGGAAAACTCCCGGGTATAGCCTCGTTGAGTTGAAAAACCGTGTTTATGAGTTTATCATGGGTGATAGATCTCATCCCCAAAGTGAGGAGACATACGCAATGCTGGGGAAGATCACAGAGTTGTTGAAAATCGAAGGCTACGTTCCTCGCACGGTTAATGTTCTTGCTGATATAGAAGAGGAAGAAAAGGAGACGGCTCTGTCTCATCACACGGAGAAAGTTGCAATTGCTTTTATGTTGGTTAACACCCCACAAAGAACTCCAATTAGAATCATGAAGAATTTGAGAGTCTGTGCAGATTGTCATCTGGCGATCAAACTCATATCCAAGGTTTTCGAACGTGAGATCATCGTAAGGGATCGTAGTAGGTTTCATCATTTTAAAGACAGTTCGTGCTCTTGTAGAGATTATTGGTAA

Protein sequence

MRSNQPSFFYSTARSVASTFPENHLTFLLRKCISLIQLCASSHFKLKQIHGFSIRHGVPPHNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFAESENPRPAVELYCQMHASSVLPDTHTFPFLLKAAAKLMDVRVGEEIHSIVVRNGFGSLLFVQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDALKVFDEMEERSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNISIPPNAVIWRTLLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEEEKETALSHHTEKVAIAFMLVNTPQRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDRSRFHHFKDSSCSCRDYW
Homology
BLAST of MC01g_new0482 vs. ExPASy Swiss-Prot
Match: A8MQA3 (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 770.0 bits (1987), Expect = 2.0e-221
Identity = 371/594 (62.46%), Postives = 471/594 (79.29%), Query Frame = 0

Query: 18  STFPENHLTFL--LRKCISLIQLC-ASSHFKLKQIHGFSIRHGVPPHNPDMGKHLIFALV 77
           S F E  +  L  + KCI+L+Q    SS  KL+QIH FSIRHGV   + ++GKHLIF LV
Sbjct: 2   SPFSETSVLLLPMVEKCINLLQTYGVSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLV 61

Query: 78  SLSAPMP--YATRIFRLIRAP-NIFTWNTMIRGFAESENPRPAVELYCQMHASSVL-PDT 137
           SL +P P  YA ++F  I  P N+F WNT+IRG+AE  N   A  LY +M  S ++ PDT
Sbjct: 62  SLPSPPPMSYAHKVFSKIEKPINVFIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDT 121

Query: 138 HTFPFLLKAAAKLMDVRVGEEIHSIVVRNGFGSLLFVQNSLVHMYSVFGFAESAYQVFEF 197
           HT+PFL+KA   + DVR+GE IHS+V+R+GFGSL++VQNSL+H+Y+  G   SAY+VF+ 
Sbjct: 122 HTYPFLIKAVTTMADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDK 181

Query: 198 MLERDLVAWNSVINGFALNGMANEALTLFREMGLDGVEPDGFTMVSLLSACVELGAMALG 257
           M E+DLVAWNSVINGFA NG   EAL L+ EM   G++PDGFT+VSLLSAC ++GA+ LG
Sbjct: 182 MPEKDLVAWNSVINGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLG 241

Query: 258 ERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDALKVFDEMEERSVVSWTSLIVGLAVN 317
           +RVHVYM+KVGL  N H+SN LLDLY++CG + +A  +FDEM +++ VSWTSLIVGLAVN
Sbjct: 242 KRVHVYMIKVGLTRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVN 301

Query: 318 GLGNEALELFGELE-RKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMKDEYGILPRIE 377
           G G EA+ELF  +E  +GL P EITFVG+LYACSHCGMV+EGF+YFRRM++EY I PRIE
Sbjct: 302 GFGKEAIELFKYMESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIE 361

Query: 378 HHGCIVDLLCRAGKVGDAYEYIRNISIPPNAVIWRTLLGACTIHGHLELGEVARAEVLRL 437
           H GC+VDLL RAG+V  AYEYI+++ + PN VIWRTLLGACT+HG  +L E AR ++L+L
Sbjct: 362 HFGCMVDLLARAGQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQL 421

Query: 438 EPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRS 497
           EP HSGDYVLLSN+YASE+RW DVQ +R+ ML  GV+K PG+SLVE+ NRV+EF+MGD+S
Sbjct: 422 EPNHSGDYVLLSNMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKS 481

Query: 498 HPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEEEKETALSHHTEKVAIAFMLVNTP 557
           HPQS+  YA L ++T  L+ EGYVP+  NV  D+EEEEKE A+ +H+EK+AIAFML++TP
Sbjct: 482 HPQSDAIYAKLKEMTGRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTP 541

Query: 558 QRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDRSRFHHFKDSSCSCRDYW 604
           +R+PI ++KNLRVCADCHLAIKL+SKV+ REI+VRDRSRFHHFK+ SCSC+DYW
Sbjct: 542 ERSPITVVKNLRVCADCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of MC01g_new0482 vs. ExPASy Swiss-Prot
Match: Q9CA54 (Pentatricopeptide repeat-containing protein At1g74630 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H71 PE=2 SV=1)

HSP 1 Score: 486.9 bits (1252), Expect = 3.3e-136
Identity = 253/644 (39.29%), Postives = 376/644 (58.39%), Query Frame = 0

Query: 25  LTFLLRKCISLIQLCASSHFKLKQIHGFSIRHGVPPHNPDMGKHLIFALVSLSAPMPYAT 84
           +T  +  C+SL+  C +    L QIHG  I++GV   +   GK ++   +S+S  +PYA 
Sbjct: 1   MTIAIHHCLSLLNSCKNLR-ALTQIHGLFIKYGVDTDSYFTGKLILHCAISISDALPYAR 60

Query: 85  RIFRLIRAPNIFTWNTMIRGFAESENPRPAVELYCQ-MHASSVLPDTHTFPFLLKAAAKL 144
           R+      P+ F +NT++RG++ES+ P  +V ++ + M    V PD+ +F F++KA    
Sbjct: 61  RLLLCFPEPDAFMFNTLVRGYSESDEPHNSVAVFVEMMRKGFVFPDSFSFAFVIKAVENF 120

Query: 145 MDVRVGEEIHSIVVRNGFGSLLFVQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVI 204
             +R G ++H   +++G  S LFV  +L+ MY   G  E A +VF+ M + +LVAWN+VI
Sbjct: 121 RSLRTGFQMHCQALKHGLESHLFVGTTLIGMYGGCGCVEFARKVFDEMHQPNLVAWNAVI 180

Query: 205 N----------------------------------------------------------- 264
                                                                       
Sbjct: 181 TACFRGNDVAGAREIFDKMLVRNHTSWNVMLAGYIKAGELESAKRIFSEMPHRDDVSWST 240

Query: 265 ---GFALNGMANEALTLFREMGLDGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVG 324
              G A NG  NE+   FRE+   G+ P+  ++  +LSAC + G+   G+ +H ++ K G
Sbjct: 241 MIVGIAHNGSFNESFLYFRELQRAGMSPNEVSLTGVLSACSQSGSFEFGKILHGFVEKAG 300

Query: 325 LVHNPHASNALLDLYSKCGNIRDALKVFDEMEE-RSVVSWTSLIVGLAVNGLGNEALELF 384
                  +NAL+D+YS+CGN+  A  VF+ M+E R +VSWTS+I GLA++G G EA+ LF
Sbjct: 301 YSWIVSVNNALIDMYSRCGNVPMARLVFEGMQEKRCIVSWTSMIAGLAMHGQGEEAVRLF 360

Query: 385 GELERKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCR 444
            E+   G+ P  I+F+ +L+ACSH G++EEG DYF  MK  Y I P IEH+GC+VDL  R
Sbjct: 361 NEMTAYGVTPDGISFISLLHACSHAGLIEEGEDYFSEMKRVYHIEPEIEHYGCMVDLYGR 420

Query: 445 AGKVGDAYEYIRNISIPPNAVIWRTLLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLL 504
           +GK+  AY++I  + IPP A++WRTLLGAC+ HG++EL E  +  +  L+P +SGD VLL
Sbjct: 421 SGKLQKAYDFICQMPIPPTAIVWRTLLGACSSHGNIELAEQVKQRLNELDPNNSGDLVLL 480

Query: 505 SNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAML 564
           SN YA+  +W DV ++R++M+++ ++KT  +SLVE+   +Y+F  G++      E +  L
Sbjct: 481 SNAYATAGKWKDVASIRKSMIVQRIKKTTAWSLVEVGKTMYKFTAGEKKKGIDIEAHEKL 540

Query: 565 GKITELLKIE-GYVPRTVNVLADIEEEEKETALSHHTEKVAIAFMLVNTPQRTPIRIMKN 604
            +I   LK E GY P   + L D+EEEEKE  +S H+EK+A+AF L    +   IRI+KN
Sbjct: 541 KEIILRLKDEAGYTPEVASALYDVEEEEKEDQVSKHSEKLALAFALARLSKGANIRIVKN 600

BLAST of MC01g_new0482 vs. ExPASy Swiss-Prot
Match: Q9FI80 (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 471.1 bits (1211), Expect = 1.9e-131
Identity = 239/608 (39.31%), Postives = 371/608 (61.02%), Query Frame = 0

Query: 46  LKQIHGFSIRHGVPPHNPDMGKHLIFALVS--LSAPMPYATRIFRLIRAPNIFTWNTMIR 105
           L QIH   I+ G         + L F   S      + YA +IF  +   N F+WNT+IR
Sbjct: 39  LSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHKIFNQMPQRNCFSWNTIIR 98

Query: 106 GFAESENPRPAVEL---YCQMHASSVLPDTHTFPFLLKAAAKLMDVRVGEEIHSIVVRNG 165
           GF+ES+  +  + +   Y  M    V P+  TFP +LKA AK   ++ G++IH + ++ G
Sbjct: 99  GFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAKTGKIQEGKQIHGLALKYG 158

Query: 166 FGSLLFVQNSLVHMYSVFGF---------------------------------------- 225
           FG   FV ++LV MY + GF                                        
Sbjct: 159 FGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMVVMTDRRKRDGEIVLWNVMIDGY 218

Query: 226 -----AESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDGVEPDGFTMV 285
                 ++A  +F+ M +R +V+WN++I+G++LNG   +A+ +FREM    + P+  T+V
Sbjct: 219 MRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNGFFKDAVEVFREMKKGDIRPNYVTLV 278

Query: 286 SLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDALKVFDEMEER 345
           S+L A   LG++ LGE +H+Y    G+  +    +AL+D+YSKCG I  A+ VF+ +   
Sbjct: 279 SVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALIDMYSKCGIIEKAIHVFERLPRE 338

Query: 346 SVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVEEGFDYF 405
           +V++W+++I G A++G   +A++ F ++ + G++PS++ ++ +L ACSH G+VEEG  YF
Sbjct: 339 NVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYINLLTACSHGGLVEEGRRYF 398

Query: 406 RRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNISIPPNAVIWRTLLGACTIHGH 465
            +M    G+ PRIEH+GC+VDLL R+G + +A E+I N+ I P+ VIW+ LLGAC + G+
Sbjct: 399 SQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIKPDDVIWKALLGACRMQGN 458

Query: 466 LELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVE 525
           +E+G+     ++ + P  SG YV LSN+YAS+  W +V  +R  M  K +RK PG SL++
Sbjct: 459 VEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMRLRMKEKDIRKDPGCSLID 518

Query: 526 LKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEEEKETALSHH 585
           +   ++EF++ D SHP+++E  +ML +I++ L++ GY P T  VL ++EEE+KE  L +H
Sbjct: 519 IDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYRPITTQVLLNLEEEDKENVLHYH 578

Query: 586 TEKVAIAFMLVNTPQRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDRSRFHHFKDS 604
           +EK+A AF L++T    PIRI+KNLR+C DCH +IKLISKV++R+I VRDR RFHHF+D 
Sbjct: 579 SEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSIKLISKVYKRKITVRDRKRFHHFQDG 638

BLAST of MC01g_new0482 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 470.3 bits (1209), Expect = 3.2e-131
Identity = 225/523 (43.02%), Postives = 349/523 (66.73%), Query Frame = 0

Query: 83  ATRIFRLIRAPNIFTWNTMIRGFAESENPRPAVELYCQMHASSVLPDTHTFPFLLKAAAK 142
           A ++F  I   ++ +WN MI G+AE+ N + A+EL+  M  ++V PD  T   ++ A A+
Sbjct: 219 AQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQ 278

Query: 143 LMDVRVGEEIHSIVVRNGFGSLLFVQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSV 202
              + +G ++H  +  +GFGS L + N+L+ +YS  G  E+A  +FE +  +D+++WN++
Sbjct: 279 SGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISWNTL 338

Query: 203 INGFALNGMANEALTLFREMGLDGVEPDGFTMVSLLSACVELGAMALGERVHVYMLK--V 262
           I G+    +  EAL LF+EM   G  P+  TM+S+L AC  LGA+ +G  +HVY+ K   
Sbjct: 339 IGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLK 398

Query: 263 GLVHNPHASNALLDLYSKCGNIRDALKVFDEMEERSVVSWTSLIVGLAVNGLGNEALELF 322
           G+ +      +L+D+Y+KCG+I  A +VF+ +  +S+ SW ++I G A++G  + + +LF
Sbjct: 399 GVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLF 458

Query: 323 GELERKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCR 382
             + + G++P +ITFVG+L ACSH GM++ G   FR M  +Y + P++EH+GC++DLL  
Sbjct: 459 SRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGH 518

Query: 383 AGKVGDAYEYIRNISIPPNAVIWRTLLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLL 442
           +G   +A E I  + + P+ VIW +LL AC +HG++ELGE     ++++EP++ G YVLL
Sbjct: 519 SGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLL 578

Query: 443 SNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAML 502
           SN+YAS  RW +V   R  +  KG++K PG S +E+ + V+EFI+GD+ HP++ E Y ML
Sbjct: 579 SNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGML 638

Query: 503 GKITELLKIEGYVPRTVNVLADIEEEEKETALSHHTEKVAIAFMLVNTPQRTPIRIMKNL 562
            ++  LL+  G+VP T  VL ++EEE KE AL HH+EK+AIAF L++T   T + I+KNL
Sbjct: 639 EEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLTIVKNL 698

Query: 563 RVCADCHLAIKLISKVFEREIIVRDRSRFHHFKDSSCSCRDYW 604
           RVC +CH A KLISK+++REII RDR+RFHHF+D  CSC DYW
Sbjct: 699 RVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of MC01g_new0482 vs. ExPASy Swiss-Prot
Match: Q9LW63 (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 466.8 bits (1200), Expect = 3.6e-130
Identity = 218/519 (42.00%), Postives = 336/519 (64.74%), Query Frame = 0

Query: 85  RIFRLIRAPNIFTWNTMIRGFAESENPRPAVELYCQMHASSVLPDTHTFPFLLKAAAKLM 144
           R+F ++   ++ ++NT+I G+A+S     A+ +  +M  + + PD+ T   +L   ++ +
Sbjct: 197 RVFEVMPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYV 256

Query: 145 DVRVGEEIHSIVVRNGFGSLLFVQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVIN 204
           DV  G+EIH  V+R G  S +++ +SLV MY+     E + +VF  +  RD ++WNS++ 
Sbjct: 257 DVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRLYCRDGISWNSLVA 316

Query: 205 GFALNGMANEALTLFREMGLDGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLVH 264
           G+  NG  NEAL LFR+M    V+P      S++ AC  L  + LG+++H Y+L+ G   
Sbjct: 317 GYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQLHGYVLRGGFGS 376

Query: 265 NPHASNALLDLYSKCGNIRDALKVFDEMEERSVVSWTSLIVGLAVNGLGNEALELFGELE 324
           N   ++AL+D+YSKCGNI+ A K+FD M     VSWT++I+G A++G G+EA+ LF E++
Sbjct: 377 NIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHGHGHEAVSLFEEMK 436

Query: 325 RKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRAGKV 384
           R+G+KP+++ FV VL ACSH G+V+E + YF  M   YG+   +EH+  + DLL RAGK+
Sbjct: 437 RQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAAVADLLGRAGKL 496

Query: 385 GDAYEYIRNISIPPNAVIWRTLLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLSNLY 444
            +AY +I  + + P   +W TLL +C++H +LEL E    ++  ++ ++ G YVL+ N+Y
Sbjct: 497 EEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSENMGAYVLMCNMY 556

Query: 445 ASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKIT 504
           AS  RW ++  +R  M  KG+RK P  S +E+KN+ + F+ GDRSHP  ++    L  + 
Sbjct: 557 ASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSMDKINEFLKAVM 616

Query: 505 ELLKIEGYVPRTVNVLADIEEEEKETALSHHTEKVAIAFMLVNTPQRTPIRIMKNLRVCA 564
           E ++ EGYV  T  VL D++EE K   L  H+E++A+AF ++NT   T IR+ KN+R+C 
Sbjct: 617 EQMEKEGYVADTSGVLHDVDEEHKRELLFGHSERLAVAFGIINTEPGTTIRVTKNIRICT 676

Query: 565 DCHLAIKLISKVFEREIIVRDRSRFHHFKDSSCSCRDYW 604
           DCH+AIK ISK+ EREIIVRD SRFHHF   +CSC DYW
Sbjct: 677 DCHVAIKFISKITEREIIVRDNSRFHHFNRGNCSCGDYW 715

BLAST of MC01g_new0482 vs. NCBI nr
Match: XP_022146486.1 (pentatricopeptide repeat-containing protein At4g21065 [Momordica charantia])

HSP 1 Score: 1217 bits (3150), Expect = 0.0
Identity = 603/603 (100.00%), Postives = 603/603 (100.00%), Query Frame = 0

Query: 1   MRSNQPSFFYSTARSVASTFPENHLTFLLRKCISLIQLCASSHFKLKQIHGFSIRHGVPP 60
           MRSNQPSFFYSTARSVASTFPENHLTFLLRKCISLIQLCASSHFKLKQIHGFSIRHGVPP
Sbjct: 1   MRSNQPSFFYSTARSVASTFPENHLTFLLRKCISLIQLCASSHFKLKQIHGFSIRHGVPP 60

Query: 61  HNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFAESENPRPAVELYCQ 120
           HNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFAESENPRPAVELYCQ
Sbjct: 61  HNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFAESENPRPAVELYCQ 120

Query: 121 MHASSVLPDTHTFPFLLKAAAKLMDVRVGEEIHSIVVRNGFGSLLFVQNSLVHMYSVFGF 180
           MHASSVLPDTHTFPFLLKAAAKLMDVRVGEEIHSIVVRNGFGSLLFVQNSLVHMYSVFGF
Sbjct: 121 MHASSVLPDTHTFPFLLKAAAKLMDVRVGEEIHSIVVRNGFGSLLFVQNSLVHMYSVFGF 180

Query: 181 AESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDGVEPDGFTMVSLLSA 240
           AESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDGVEPDGFTMVSLLSA
Sbjct: 181 AESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDGVEPDGFTMVSLLSA 240

Query: 241 CVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDALKVFDEMEERSVVSW 300
           CVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDALKVFDEMEERSVVSW
Sbjct: 241 CVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDALKVFDEMEERSVVSW 300

Query: 301 TSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMKD 360
           TSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMKD
Sbjct: 301 TSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMKD 360

Query: 361 EYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNISIPPNAVIWRTLLGACTIHGHLELGE 420
           EYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNISIPPNAVIWRTLLGACTIHGHLELGE
Sbjct: 361 EYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNISIPPNAVIWRTLLGACTIHGHLELGE 420

Query: 421 VARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRV 480
           VARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRV
Sbjct: 421 VARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRV 480

Query: 481 YEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEEEKETALSHHTEKVA 540
           YEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEEEKETALSHHTEKVA
Sbjct: 481 YEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEEEKETALSHHTEKVA 540

Query: 541 IAFMLVNTPQRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDRSRFHHFKDSSCSCR 600
           IAFMLVNTPQRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDRSRFHHFKDSSCSCR
Sbjct: 541 IAFMLVNTPQRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDRSRFHHFKDSSCSCR 600

Query: 601 DYW 603
           DYW
Sbjct: 601 DYW 603

BLAST of MC01g_new0482 vs. NCBI nr
Match: XP_038882791.1 (pentatricopeptide repeat-containing protein At4g21065 [Benincasa hispida])

HSP 1 Score: 1093 bits (2827), Expect = 0.0
Identity = 538/609 (88.34%), Postives = 571/609 (93.76%), Query Frame = 0

Query: 1   MRSNQPSFFYSTARSVAST-----FPENHLTFLLRKCISLIQLCASSHFKLKQIHGFSIR 60
           MRS Q S  YSTARS+AST     FPENHL+F+LRKCISL+QLC SS  +LKQIH FSIR
Sbjct: 1   MRSTQRSLLYSTARSIASTSSSQNFPENHLSFILRKCISLLQLCGSSQSQLKQIHAFSIR 60

Query: 61  HGVPPHNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFAESENPRPAV 120
           HGVPP NPD  KHLIFALVSLSAPM YAT IF  I+APNIFTWNTMIRGFAESENP PA+
Sbjct: 61  HGVPPPNPDFNKHLIFALVSLSAPMSYATLIFNQIQAPNIFTWNTMIRGFAESENPSPAI 120

Query: 121 ELYCQMHA-SSVLPDTHTFPFLLKAAAKLMDVRVGEEIHSIVVRNGFGSLLFVQNSLVHM 180
           ELY QM A SS+LPDTHTFPFL KA AKLMDVR+GE IHS+VVRNGF SLLFVQNSLVHM
Sbjct: 121 ELYSQMRAASSILPDTHTFPFLFKAVAKLMDVRLGEGIHSVVVRNGFDSLLFVQNSLVHM 180

Query: 181 YSVFGFAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDGVEPDGFTM 240
           YSVFGFAESAYQVFEFM +RDLVAWNSVINGFALNGMANEALTLFREMG +GVEPDGFTM
Sbjct: 181 YSVFGFAESAYQVFEFMSDRDLVAWNSVINGFALNGMANEALTLFREMGFEGVEPDGFTM 240

Query: 241 VSLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDALKVFDEMEE 300
           VSLLSACVELGA+ALGERVHVYMLKVGL+ N HASNALLDLYSKCGNIRDALK+FDEMEE
Sbjct: 241 VSLLSACVELGALALGERVHVYMLKVGLIQNLHASNALLDLYSKCGNIRDALKMFDEMEE 300

Query: 301 RSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVEEGFDY 360
           RSVVSWTSLIVGLAVNGLGN ALELFGELERKGLKPSEITFVGVLYACSHCGMV+EGF+Y
Sbjct: 301 RSVVSWTSLIVGLAVNGLGNRALELFGELERKGLKPSEITFVGVLYACSHCGMVDEGFNY 360

Query: 361 FRRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNISIPPNAVIWRTLLGACTIHG 420
           FRRMK+EYGILPRIEHHGC+VDLLCRAGKVGDAY+YIRN+S+PPNAVIWRTLLGACTIHG
Sbjct: 361 FRRMKEEYGILPRIEHHGCMVDLLCRAGKVGDAYDYIRNMSVPPNAVIWRTLLGACTIHG 420

Query: 421 HLELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLV 480
           HLELGEVARAE+L LEPKH+GD+VLLSNLYASERRWLDVQNVRR MLMKGV+KTPGYSLV
Sbjct: 421 HLELGEVARAEILLLEPKHTGDFVLLSNLYASERRWLDVQNVRRMMLMKGVKKTPGYSLV 480

Query: 481 ELKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEEEKETALSH 540
           ELKNRVY+FIMGDRSHPQSEETYAML KITELLKIEGYVPRTVNVLADIEEEEKETALSH
Sbjct: 481 ELKNRVYKFIMGDRSHPQSEETYAMLTKITELLKIEGYVPRTVNVLADIEEEEKETALSH 540

Query: 541 HTEKVAIAFMLVNTPQRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDRSRFHHFKD 600
           HTEKVAIAFMLVNTP +TPIRIMKNLR+CADCHLAIK+ISKVFEREI++RDRSRFHHFKD
Sbjct: 541 HTEKVAIAFMLVNTPPKTPIRIMKNLRICADCHLAIKIISKVFEREIVIRDRSRFHHFKD 600

Query: 601 SSCSCRDYW 603
            SCSC+DYW
Sbjct: 601 GSCSCKDYW 609

BLAST of MC01g_new0482 vs. NCBI nr
Match: XP_023002974.1 (pentatricopeptide repeat-containing protein At4g21065 [Cucurbita maxima])

HSP 1 Score: 1088 bits (2815), Expect = 0.0
Identity = 538/604 (89.07%), Postives = 566/604 (93.71%), Query Frame = 0

Query: 1   MRSNQPSFFYSTARSVASTFPENHLTFLLRKCISLIQLCASSHFKLKQIHGFSIRHGVPP 60
           MRSNQ S+  ST  +   TFPENHL+F+LR CISL+QLC SS  KLKQIH FSIRHGVPP
Sbjct: 1   MRSNQASYLASTFST--QTFPENHLSFILRNCISLLQLCGSSQSKLKQIHAFSIRHGVPP 60

Query: 61  HNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFAESENPRPAVELYCQ 120
            NPD  KHLIF+LVS+SAPM YATRIF+ I+APNIFTWNTM+RGFAESENPRPAVELY Q
Sbjct: 61  PNPDFNKHLIFSLVSISAPMSYATRIFQQIQAPNIFTWNTMVRGFAESENPRPAVELYSQ 120

Query: 121 MHA-SSVLPDTHTFPFLLKAAAKLMDVRVGEEIHSIVVRNGFGSLLFVQNSLVHMYSVFG 180
           MHA SS+ PDTHTFPFL KA AKLMD R+GE IHSIVVRNGF SLLFVQNSLVHMYSVFG
Sbjct: 121 MHAASSIQPDTHTFPFLFKAVAKLMDARLGEGIHSIVVRNGFDSLLFVQNSLVHMYSVFG 180

Query: 181 FAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDGVEPDGFTMVSLLS 240
           FAESAY+VFEFM ERDLVAWNSVINGFALNGMANEALTLF+EMG  GV+PDGFTMVSLLS
Sbjct: 181 FAESAYKVFEFMSERDLVAWNSVINGFALNGMANEALTLFKEMGSVGVKPDGFTMVSLLS 240

Query: 241 ACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDALKVFDEMEERSVVS 300
           ACVELGA+ALGERVHVYMLKVGLV NPHASNALLDLYSKCGNI DALKVFDEM ERSVVS
Sbjct: 241 ACVELGALALGERVHVYMLKVGLVQNPHASNALLDLYSKCGNIIDALKVFDEMHERSVVS 300

Query: 301 WTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMK 360
           WTSLIVGLAVNGLGNEAL+LFGELERKGLKPSEITFVGVLYACSHCGMV+EGFDYFRRMK
Sbjct: 301 WTSLIVGLAVNGLGNEALKLFGELERKGLKPSEITFVGVLYACSHCGMVDEGFDYFRRMK 360

Query: 361 DEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNISIPPNAVIWRTLLGACTIHGHLELG 420
           +EYGILPRIEHHGCIVDLLCRAGKV DAYEYIRN+S+PPNAVIWRTLLGACTIHGHLELG
Sbjct: 361 EEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNMSVPPNAVIWRTLLGACTIHGHLELG 420

Query: 421 EVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNR 480
           E+ARAE+L+LEPKH GD+VLLSNLYASERRWLDVQNVRRTMLMKGV+KTPGYSLVELKNR
Sbjct: 421 EIARAEILQLEPKHCGDFVLLSNLYASERRWLDVQNVRRTMLMKGVKKTPGYSLVELKNR 480

Query: 481 VYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEEEKETALSHHTEKV 540
           VYEFIMGDRSHPQSEETYAMLGKITE LKIEGYVPRTVNVLADIEEEEKETALSHHTEKV
Sbjct: 481 VYEFIMGDRSHPQSEETYAMLGKITESLKIEGYVPRTVNVLADIEEEEKETALSHHTEKV 540

Query: 541 AIAFMLVNTPQRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDRSRFHHFKDSSCSC 600
           AIAFMLVNTP RTPIRIMKNLRVCADCHLAIKLISKVFEREI+VRDRSRFHHFKD  CSC
Sbjct: 541 AIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLISKVFEREIVVRDRSRFHHFKDGLCSC 600

Query: 601 RDYW 603
           +DYW
Sbjct: 601 KDYW 602

BLAST of MC01g_new0482 vs. NCBI nr
Match: XP_022926338.1 (pentatricopeptide repeat-containing protein At4g21065 [Cucurbita moschata])

HSP 1 Score: 1086 bits (2809), Expect = 0.0
Identity = 537/604 (88.91%), Postives = 565/604 (93.54%), Query Frame = 0

Query: 1   MRSNQPSFFYSTARSVASTFPENHLTFLLRKCISLIQLCASSHFKLKQIHGFSIRHGVPP 60
           MRSN+ S+  ST  +   TFPENHL+F+LRKCISL+QLC SS  KLKQIH FSIRHGVPP
Sbjct: 1   MRSNRASYLASTFST--QTFPENHLSFILRKCISLLQLCGSSRSKLKQIHAFSIRHGVPP 60

Query: 61  HNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFAESENPRPAVELYCQ 120
            NPD  KHLIF+LVS+SAPM YATRIF+ I+APNIFTWNTM+RGFAESENPRPAVELY Q
Sbjct: 61  PNPDFNKHLIFSLVSISAPMTYATRIFQQIQAPNIFTWNTMVRGFAESENPRPAVELYSQ 120

Query: 121 MHA-SSVLPDTHTFPFLLKAAAKLMDVRVGEEIHSIVVRNGFGSLLFVQNSLVHMYSVFG 180
           MHA SS+ PDTHTFPFL KA AKLMD R+GE IHSIVVRNGF SLLFVQNSLVHMYSVFG
Sbjct: 121 MHAASSIQPDTHTFPFLFKAVAKLMDARLGEGIHSIVVRNGFDSLLFVQNSLVHMYSVFG 180

Query: 181 FAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDGVEPDGFTMVSLLS 240
           FAESAY+VFEFM ERDLVAWNSVINGFALNGMANEALTLF+EMG  GVEPDGFTMVSLLS
Sbjct: 181 FAESAYKVFEFMSERDLVAWNSVINGFALNGMANEALTLFKEMGSVGVEPDGFTMVSLLS 240

Query: 241 ACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDALKVFDEMEERSVVS 300
           ACVELGA+ALGERVHVYMLKVGLV NPHASNALLDLYSKCGNI  ALKVFDEM ERSVVS
Sbjct: 241 ACVELGALALGERVHVYMLKVGLVQNPHASNALLDLYSKCGNITHALKVFDEMHERSVVS 300

Query: 301 WTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMK 360
           WTSLIVGLAVNGLGNEAL+LFGELERKGLKPSEITFVGVLYACSHCGMV+EGFDYFRRMK
Sbjct: 301 WTSLIVGLAVNGLGNEALKLFGELERKGLKPSEITFVGVLYACSHCGMVDEGFDYFRRMK 360

Query: 361 DEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNISIPPNAVIWRTLLGACTIHGHLELG 420
           +EYGILPRIEHHGCIVDLLCRAGKV DAY+YIRN+S+PPNAVIWRTLLGACTIHGHLELG
Sbjct: 361 EEYGILPRIEHHGCIVDLLCRAGKVRDAYQYIRNMSVPPNAVIWRTLLGACTIHGHLELG 420

Query: 421 EVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNR 480
           E+ARAE+L+LEPKH GDYVLLSNLYASERRWLDVQNVRRTMLMKGV+KTPGYSLVELKNR
Sbjct: 421 EIARAEILQLEPKHCGDYVLLSNLYASERRWLDVQNVRRTMLMKGVKKTPGYSLVELKNR 480

Query: 481 VYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEEEKETALSHHTEKV 540
           VYEFIMGDRSHPQSEETYAMLGKITE LKIEGYVPRTVNVLADIEEEEKETALSHHTEKV
Sbjct: 481 VYEFIMGDRSHPQSEETYAMLGKITESLKIEGYVPRTVNVLADIEEEEKETALSHHTEKV 540

Query: 541 AIAFMLVNTPQRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDRSRFHHFKDSSCSC 600
           AIAFMLVNTP  TPIRIMKNLRVCADCHLAIKLISKVFEREI+VRDRSRFHHFKD  CSC
Sbjct: 541 AIAFMLVNTPPGTPIRIMKNLRVCADCHLAIKLISKVFEREIVVRDRSRFHHFKDGLCSC 600

Query: 601 RDYW 603
           +DYW
Sbjct: 601 KDYW 602

BLAST of MC01g_new0482 vs. NCBI nr
Match: KAG6594433.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1078 bits (2788), Expect = 0.0
Identity = 534/603 (88.56%), Postives = 562/603 (93.20%), Query Frame = 0

Query: 1   MRSNQPSFFYSTARSVASTFPENHLTFLLRKCISLIQLCASSHFKLKQIHGFSIRHGVPP 60
           MRSNQ  +  ST  +   TFPENHL+F+LRKCISL+QLC SS  KLKQIH FSIRHGVPP
Sbjct: 1   MRSNQAPYLASTFST--QTFPENHLSFILRKCISLLQLCGSSQSKLKQIHAFSIRHGVPP 60

Query: 61  HNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFAESENPRPAVELYCQ 120
            NPD  KHLIF+LVS+SAPM YATRIF+ I+APNIFTWNTM+RGF+ESENPRPAVELY Q
Sbjct: 61  PNPDFNKHLIFSLVSISAPMSYATRIFQQIQAPNIFTWNTMVRGFSESENPRPAVELYSQ 120

Query: 121 MHA-SSVLPDTHTFPFLLKAAAKLMDVRVGEEIHSIVVRNGFGSLLFVQNSLVHMYSVFG 180
           MHA SS+ PDTHTFPFL KA AKLMD R+GE IHSIVVRNGF SLLFVQNSLVHMYSVFG
Sbjct: 121 MHAASSIQPDTHTFPFLFKAVAKLMDARLGEGIHSIVVRNGFDSLLFVQNSLVHMYSVFG 180

Query: 181 FAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDGVEPDGFTMVSLLS 240
           FAESAY+VFEFM ERDLVAWNSVINGFALNGMANEALTLF+EMG  GVEPDGFTMVSLLS
Sbjct: 181 FAESAYKVFEFMSERDLVAWNSVINGFALNGMANEALTLFKEMGSVGVEPDGFTMVSLLS 240

Query: 241 ACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDALKVFDEMEERSVVS 300
           ACVELGA+ALGERVHVYMLKVGLV NPHASNALLDLYSKCGNI  ALKVFDEM ERSVVS
Sbjct: 241 ACVELGALALGERVHVYMLKVGLVQNPHASNALLDLYSKCGNITHALKVFDEMHERSVVS 300

Query: 301 WTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMK 360
           WTSLIVGLAVNGLGNEAL+ FGELERKGLKPSEITFVGVLYACSHCGMV+EGFDYFRRMK
Sbjct: 301 WTSLIVGLAVNGLGNEALKQFGELERKGLKPSEITFVGVLYACSHCGMVDEGFDYFRRMK 360

Query: 361 DEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNISIPPNAVIWRTLLGACTIHGHLELG 420
           +EYGILPRIEHHGCIVDLLCRAGKV DAY+YIRN+S+PPNAVIWRTLLGACTIHGHLELG
Sbjct: 361 EEYGILPRIEHHGCIVDLLCRAGKVRDAYQYIRNMSVPPNAVIWRTLLGACTIHGHLELG 420

Query: 421 EVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNR 480
           E+ARAE+L+LEPKH GDYVLLSNLYASERRWLDVQNVRRTMLMKGV+KTPGYSLVELKNR
Sbjct: 421 EIARAEILQLEPKHCGDYVLLSNLYASERRWLDVQNVRRTMLMKGVKKTPGYSLVELKNR 480

Query: 481 VYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEEEKETALSHHTEKV 540
           VYEFIMGDRSHPQSEETYAMLGKITE LKIEGYVPRTVNVLADIEEEEKETALSHHTEKV
Sbjct: 481 VYEFIMGDRSHPQSEETYAMLGKITESLKIEGYVPRTVNVLADIEEEEKETALSHHTEKV 540

Query: 541 AIAFMLVNTPQRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDRSRFHHFKDSSCSC 600
           AIAFMLVNTP  TPIRIMKNLRVCADCHLAIKLISKVFEREI+VRDRSRFHHFKD  CSC
Sbjct: 541 AIAFMLVNTPPGTPIRIMKNLRVCADCHLAIKLISKVFEREIVVRDRSRFHHFKDGLCSC 600

Query: 601 RDY 602
           +DY
Sbjct: 601 KDY 601

BLAST of MC01g_new0482 vs. ExPASy TrEMBL
Match: A0A6J1CZH8 (pentatricopeptide repeat-containing protein At4g21065 OS=Momordica charantia OX=3673 GN=LOC111015693 PE=3 SV=1)

HSP 1 Score: 1217 bits (3150), Expect = 0.0
Identity = 603/603 (100.00%), Postives = 603/603 (100.00%), Query Frame = 0

Query: 1   MRSNQPSFFYSTARSVASTFPENHLTFLLRKCISLIQLCASSHFKLKQIHGFSIRHGVPP 60
           MRSNQPSFFYSTARSVASTFPENHLTFLLRKCISLIQLCASSHFKLKQIHGFSIRHGVPP
Sbjct: 1   MRSNQPSFFYSTARSVASTFPENHLTFLLRKCISLIQLCASSHFKLKQIHGFSIRHGVPP 60

Query: 61  HNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFAESENPRPAVELYCQ 120
           HNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFAESENPRPAVELYCQ
Sbjct: 61  HNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFAESENPRPAVELYCQ 120

Query: 121 MHASSVLPDTHTFPFLLKAAAKLMDVRVGEEIHSIVVRNGFGSLLFVQNSLVHMYSVFGF 180
           MHASSVLPDTHTFPFLLKAAAKLMDVRVGEEIHSIVVRNGFGSLLFVQNSLVHMYSVFGF
Sbjct: 121 MHASSVLPDTHTFPFLLKAAAKLMDVRVGEEIHSIVVRNGFGSLLFVQNSLVHMYSVFGF 180

Query: 181 AESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDGVEPDGFTMVSLLSA 240
           AESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDGVEPDGFTMVSLLSA
Sbjct: 181 AESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDGVEPDGFTMVSLLSA 240

Query: 241 CVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDALKVFDEMEERSVVSW 300
           CVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDALKVFDEMEERSVVSW
Sbjct: 241 CVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDALKVFDEMEERSVVSW 300

Query: 301 TSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMKD 360
           TSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMKD
Sbjct: 301 TSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMKD 360

Query: 361 EYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNISIPPNAVIWRTLLGACTIHGHLELGE 420
           EYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNISIPPNAVIWRTLLGACTIHGHLELGE
Sbjct: 361 EYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNISIPPNAVIWRTLLGACTIHGHLELGE 420

Query: 421 VARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRV 480
           VARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRV
Sbjct: 421 VARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRV 480

Query: 481 YEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEEEKETALSHHTEKVA 540
           YEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEEEKETALSHHTEKVA
Sbjct: 481 YEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEEEKETALSHHTEKVA 540

Query: 541 IAFMLVNTPQRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDRSRFHHFKDSSCSCR 600
           IAFMLVNTPQRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDRSRFHHFKDSSCSCR
Sbjct: 541 IAFMLVNTPQRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDRSRFHHFKDSSCSCR 600

Query: 601 DYW 603
           DYW
Sbjct: 601 DYW 603

BLAST of MC01g_new0482 vs. ExPASy TrEMBL
Match: A0A6J1KRZ9 (pentatricopeptide repeat-containing protein At4g21065 OS=Cucurbita maxima OX=3661 GN=LOC111496722 PE=3 SV=1)

HSP 1 Score: 1088 bits (2815), Expect = 0.0
Identity = 538/604 (89.07%), Postives = 566/604 (93.71%), Query Frame = 0

Query: 1   MRSNQPSFFYSTARSVASTFPENHLTFLLRKCISLIQLCASSHFKLKQIHGFSIRHGVPP 60
           MRSNQ S+  ST  +   TFPENHL+F+LR CISL+QLC SS  KLKQIH FSIRHGVPP
Sbjct: 1   MRSNQASYLASTFST--QTFPENHLSFILRNCISLLQLCGSSQSKLKQIHAFSIRHGVPP 60

Query: 61  HNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFAESENPRPAVELYCQ 120
            NPD  KHLIF+LVS+SAPM YATRIF+ I+APNIFTWNTM+RGFAESENPRPAVELY Q
Sbjct: 61  PNPDFNKHLIFSLVSISAPMSYATRIFQQIQAPNIFTWNTMVRGFAESENPRPAVELYSQ 120

Query: 121 MHA-SSVLPDTHTFPFLLKAAAKLMDVRVGEEIHSIVVRNGFGSLLFVQNSLVHMYSVFG 180
           MHA SS+ PDTHTFPFL KA AKLMD R+GE IHSIVVRNGF SLLFVQNSLVHMYSVFG
Sbjct: 121 MHAASSIQPDTHTFPFLFKAVAKLMDARLGEGIHSIVVRNGFDSLLFVQNSLVHMYSVFG 180

Query: 181 FAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDGVEPDGFTMVSLLS 240
           FAESAY+VFEFM ERDLVAWNSVINGFALNGMANEALTLF+EMG  GV+PDGFTMVSLLS
Sbjct: 181 FAESAYKVFEFMSERDLVAWNSVINGFALNGMANEALTLFKEMGSVGVKPDGFTMVSLLS 240

Query: 241 ACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDALKVFDEMEERSVVS 300
           ACVELGA+ALGERVHVYMLKVGLV NPHASNALLDLYSKCGNI DALKVFDEM ERSVVS
Sbjct: 241 ACVELGALALGERVHVYMLKVGLVQNPHASNALLDLYSKCGNIIDALKVFDEMHERSVVS 300

Query: 301 WTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMK 360
           WTSLIVGLAVNGLGNEAL+LFGELERKGLKPSEITFVGVLYACSHCGMV+EGFDYFRRMK
Sbjct: 301 WTSLIVGLAVNGLGNEALKLFGELERKGLKPSEITFVGVLYACSHCGMVDEGFDYFRRMK 360

Query: 361 DEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNISIPPNAVIWRTLLGACTIHGHLELG 420
           +EYGILPRIEHHGCIVDLLCRAGKV DAYEYIRN+S+PPNAVIWRTLLGACTIHGHLELG
Sbjct: 361 EEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNMSVPPNAVIWRTLLGACTIHGHLELG 420

Query: 421 EVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNR 480
           E+ARAE+L+LEPKH GD+VLLSNLYASERRWLDVQNVRRTMLMKGV+KTPGYSLVELKNR
Sbjct: 421 EIARAEILQLEPKHCGDFVLLSNLYASERRWLDVQNVRRTMLMKGVKKTPGYSLVELKNR 480

Query: 481 VYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEEEKETALSHHTEKV 540
           VYEFIMGDRSHPQSEETYAMLGKITE LKIEGYVPRTVNVLADIEEEEKETALSHHTEKV
Sbjct: 481 VYEFIMGDRSHPQSEETYAMLGKITESLKIEGYVPRTVNVLADIEEEEKETALSHHTEKV 540

Query: 541 AIAFMLVNTPQRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDRSRFHHFKDSSCSC 600
           AIAFMLVNTP RTPIRIMKNLRVCADCHLAIKLISKVFEREI+VRDRSRFHHFKD  CSC
Sbjct: 541 AIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLISKVFEREIVVRDRSRFHHFKDGLCSC 600

Query: 601 RDYW 603
           +DYW
Sbjct: 601 KDYW 602

BLAST of MC01g_new0482 vs. ExPASy TrEMBL
Match: A0A6J1EHS2 (pentatricopeptide repeat-containing protein At4g21065 OS=Cucurbita moschata OX=3662 GN=LOC111433518 PE=3 SV=1)

HSP 1 Score: 1086 bits (2809), Expect = 0.0
Identity = 537/604 (88.91%), Postives = 565/604 (93.54%), Query Frame = 0

Query: 1   MRSNQPSFFYSTARSVASTFPENHLTFLLRKCISLIQLCASSHFKLKQIHGFSIRHGVPP 60
           MRSN+ S+  ST  +   TFPENHL+F+LRKCISL+QLC SS  KLKQIH FSIRHGVPP
Sbjct: 1   MRSNRASYLASTFST--QTFPENHLSFILRKCISLLQLCGSSRSKLKQIHAFSIRHGVPP 60

Query: 61  HNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFAESENPRPAVELYCQ 120
            NPD  KHLIF+LVS+SAPM YATRIF+ I+APNIFTWNTM+RGFAESENPRPAVELY Q
Sbjct: 61  PNPDFNKHLIFSLVSISAPMTYATRIFQQIQAPNIFTWNTMVRGFAESENPRPAVELYSQ 120

Query: 121 MHA-SSVLPDTHTFPFLLKAAAKLMDVRVGEEIHSIVVRNGFGSLLFVQNSLVHMYSVFG 180
           MHA SS+ PDTHTFPFL KA AKLMD R+GE IHSIVVRNGF SLLFVQNSLVHMYSVFG
Sbjct: 121 MHAASSIQPDTHTFPFLFKAVAKLMDARLGEGIHSIVVRNGFDSLLFVQNSLVHMYSVFG 180

Query: 181 FAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDGVEPDGFTMVSLLS 240
           FAESAY+VFEFM ERDLVAWNSVINGFALNGMANEALTLF+EMG  GVEPDGFTMVSLLS
Sbjct: 181 FAESAYKVFEFMSERDLVAWNSVINGFALNGMANEALTLFKEMGSVGVEPDGFTMVSLLS 240

Query: 241 ACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDALKVFDEMEERSVVS 300
           ACVELGA+ALGERVHVYMLKVGLV NPHASNALLDLYSKCGNI  ALKVFDEM ERSVVS
Sbjct: 241 ACVELGALALGERVHVYMLKVGLVQNPHASNALLDLYSKCGNITHALKVFDEMHERSVVS 300

Query: 301 WTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMK 360
           WTSLIVGLAVNGLGNEAL+LFGELERKGLKPSEITFVGVLYACSHCGMV+EGFDYFRRMK
Sbjct: 301 WTSLIVGLAVNGLGNEALKLFGELERKGLKPSEITFVGVLYACSHCGMVDEGFDYFRRMK 360

Query: 361 DEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNISIPPNAVIWRTLLGACTIHGHLELG 420
           +EYGILPRIEHHGCIVDLLCRAGKV DAY+YIRN+S+PPNAVIWRTLLGACTIHGHLELG
Sbjct: 361 EEYGILPRIEHHGCIVDLLCRAGKVRDAYQYIRNMSVPPNAVIWRTLLGACTIHGHLELG 420

Query: 421 EVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNR 480
           E+ARAE+L+LEPKH GDYVLLSNLYASERRWLDVQNVRRTMLMKGV+KTPGYSLVELKNR
Sbjct: 421 EIARAEILQLEPKHCGDYVLLSNLYASERRWLDVQNVRRTMLMKGVKKTPGYSLVELKNR 480

Query: 481 VYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEEEKETALSHHTEKV 540
           VYEFIMGDRSHPQSEETYAMLGKITE LKIEGYVPRTVNVLADIEEEEKETALSHHTEKV
Sbjct: 481 VYEFIMGDRSHPQSEETYAMLGKITESLKIEGYVPRTVNVLADIEEEEKETALSHHTEKV 540

Query: 541 AIAFMLVNTPQRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDRSRFHHFKDSSCSC 600
           AIAFMLVNTP  TPIRIMKNLRVCADCHLAIKLISKVFEREI+VRDRSRFHHFKD  CSC
Sbjct: 541 AIAFMLVNTPPGTPIRIMKNLRVCADCHLAIKLISKVFEREIVVRDRSRFHHFKDGLCSC 600

Query: 601 RDYW 603
           +DYW
Sbjct: 601 KDYW 602

BLAST of MC01g_new0482 vs. ExPASy TrEMBL
Match: A0A5A7UEB3 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G009420 PE=3 SV=1)

HSP 1 Score: 1063 bits (2748), Expect = 0.0
Identity = 523/608 (86.02%), Postives = 561/608 (92.27%), Query Frame = 0

Query: 1   MRSNQPSFFYSTARSVAS----TFPENHLTFLLRKCISLIQLCASSHFKLKQIHGFSIRH 60
           MRS Q SF   T+R++ S    +FP++ L+F+LRKCISL+QLC SS  KLKQ+H FSIRH
Sbjct: 1   MRSTQLSFLCFTSRTITSNSSQSFPDSPLSFILRKCISLVQLCGSSQSKLKQVHAFSIRH 60

Query: 61  GVPPHNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFAESENPRPAVE 120
           GVPP NPD  KHLIFALVSLSAPM +A RIF  I+APNIFTWNTMIRGFAESENP PAVE
Sbjct: 61  GVPPQNPDFNKHLIFALVSLSAPMSFAARIFNQIQAPNIFTWNTMIRGFAESENPSPAVE 120

Query: 121 LYCQMHA-SSVLPDTHTFPFLLKAAAKLMDVRVGEEIHSIVVRNGFGSLLFVQNSLVHMY 180
           L+ QMHA SS+LPDTHTFPFL KA AKLMDVR+GE IHS+VVRNGF SLLFVQNSLVHMY
Sbjct: 121 LFSQMHAASSILPDTHTFPFLFKAVAKLMDVRLGEAIHSVVVRNGFDSLLFVQNSLVHMY 180

Query: 181 SVFGFAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDGVEPDGFTMV 240
           SVFGFAESAYQVFE M +RDLVAWNSVINGFALNGM NEALTLFREMG +GVEPDGFTMV
Sbjct: 181 SVFGFAESAYQVFEIMSDRDLVAWNSVINGFALNGMPNEALTLFREMGSEGVEPDGFTMV 240

Query: 241 SLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDALKVFDEMEER 300
           SLLSACVEL A+ALGER H+YM+KVGLV N HASNALLDLYSKCGN +DA KVFDEMEER
Sbjct: 241 SLLSACVELRALALGERAHMYMVKVGLVRNQHASNALLDLYSKCGNFKDAQKVFDEMEER 300

Query: 301 SVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVEEGFDYF 360
           SVVSWTSLIVG AVNGLGNEAL+LFGELER+GLKPSEITFVGVLYACSHCGM++EGFDYF
Sbjct: 301 SVVSWTSLIVGSAVNGLGNEALKLFGELERQGLKPSEITFVGVLYACSHCGMLDEGFDYF 360

Query: 361 RRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNISIPPNAVIWRTLLGACTIHGH 420
           RRMK+EYGILPRIEHHGC+VDLLCRAGKVGDAY YIRN+ +PPNAVIWRTLLGACTIHGH
Sbjct: 361 RRMKEEYGILPRIEHHGCMVDLLCRAGKVGDAYNYIRNMPVPPNAVIWRTLLGACTIHGH 420

Query: 421 LELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVE 480
           LELGEVARAE+ RLEP+HSGD+VLLSNLYASE RWLDVQN+R+TML+KGV+KTPGYSLVE
Sbjct: 421 LELGEVARAEIQRLEPRHSGDFVLLSNLYASEGRWLDVQNLRKTMLVKGVKKTPGYSLVE 480

Query: 481 LKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEEEKETALSHH 540
           LKNRVYEFIMGDRSHPQSEETYAML KITELLKIEGYVPRTVNVLADIEEEEKETALSHH
Sbjct: 481 LKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIEGYVPRTVNVLADIEEEEKETALSHH 540

Query: 541 TEKVAIAFMLVNTPQRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDRSRFHHFKDS 600
           TEKVAIAFMLVNTP RTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDRSRFHHFKD 
Sbjct: 541 TEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDRSRFHHFKDG 600

Query: 601 SCSCRDYW 603
           SCSC+DYW
Sbjct: 601 SCSCKDYW 608

BLAST of MC01g_new0482 vs. ExPASy TrEMBL
Match: A0A1S3AZ16 (pentatricopeptide repeat-containing protein At4g21065 OS=Cucumis melo OX=3656 GN=LOC103484330 PE=3 SV=1)

HSP 1 Score: 1063 bits (2748), Expect = 0.0
Identity = 523/608 (86.02%), Postives = 561/608 (92.27%), Query Frame = 0

Query: 1   MRSNQPSFFYSTARSVAS----TFPENHLTFLLRKCISLIQLCASSHFKLKQIHGFSIRH 60
           MRS Q SF   T+R++ S    +FP++ L+F+LRKCISL+QLC SS  KLKQ+H FSIRH
Sbjct: 1   MRSTQLSFLCFTSRTITSNSSQSFPDSPLSFILRKCISLVQLCGSSQSKLKQVHAFSIRH 60

Query: 61  GVPPHNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFAESENPRPAVE 120
           GVPP NPD  KHLIFALVSLSAPM +A RIF  I+APNIFTWNTMIRGFAESENP PAVE
Sbjct: 61  GVPPQNPDFNKHLIFALVSLSAPMSFAARIFNQIQAPNIFTWNTMIRGFAESENPSPAVE 120

Query: 121 LYCQMHA-SSVLPDTHTFPFLLKAAAKLMDVRVGEEIHSIVVRNGFGSLLFVQNSLVHMY 180
           L+ QMHA SS+LPDTHTFPFL KA AKLMDVR+GE IHS+VVRNGF SLLFVQNSLVHMY
Sbjct: 121 LFSQMHAASSILPDTHTFPFLFKAVAKLMDVRLGEAIHSVVVRNGFDSLLFVQNSLVHMY 180

Query: 181 SVFGFAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDGVEPDGFTMV 240
           SVFGFAESAYQVFE M +RDLVAWNSVINGFALNGM NEALTLFREMG +GVEPDGFTMV
Sbjct: 181 SVFGFAESAYQVFEIMSDRDLVAWNSVINGFALNGMPNEALTLFREMGSEGVEPDGFTMV 240

Query: 241 SLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDALKVFDEMEER 300
           SLLSACVEL A+ALGER H+YM+KVGLV N HASNALLDLYSKCGN +DA KVFDEMEER
Sbjct: 241 SLLSACVELRALALGERAHMYMVKVGLVRNQHASNALLDLYSKCGNFKDAQKVFDEMEER 300

Query: 301 SVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVEEGFDYF 360
           SVVSWTSLIVG AVNGLGNEAL+LFGELER+GLKPSEITFVGVLYACSHCGM++EGFDYF
Sbjct: 301 SVVSWTSLIVGSAVNGLGNEALKLFGELERQGLKPSEITFVGVLYACSHCGMLDEGFDYF 360

Query: 361 RRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNISIPPNAVIWRTLLGACTIHGH 420
           RRMK+EYGILPRIEHHGC+VDLLCRAGKVGDAY YIRN+ +PPNAVIWRTLLGACTIHGH
Sbjct: 361 RRMKEEYGILPRIEHHGCMVDLLCRAGKVGDAYNYIRNMPVPPNAVIWRTLLGACTIHGH 420

Query: 421 LELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVE 480
           LELGEVARAE+ RLEP+HSGD+VLLSNLYASE RWLDVQN+R+TML+KGV+KTPGYSLVE
Sbjct: 421 LELGEVARAEIQRLEPRHSGDFVLLSNLYASEGRWLDVQNLRKTMLVKGVKKTPGYSLVE 480

Query: 481 LKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEEEKETALSHH 540
           LKNRVYEFIMGDRSHPQSEETYAML KITELLKIEGYVPRTVNVLADIEEEEKETALSHH
Sbjct: 481 LKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIEGYVPRTVNVLADIEEEEKETALSHH 540

Query: 541 TEKVAIAFMLVNTPQRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDRSRFHHFKDS 600
           TEKVAIAFMLVNTP RTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDRSRFHHFKD 
Sbjct: 541 TEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDRSRFHHFKDG 600

Query: 601 SCSCRDYW 603
           SCSC+DYW
Sbjct: 601 SCSCKDYW 608

BLAST of MC01g_new0482 vs. TAIR 10
Match: AT4G21065.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 770.0 bits (1987), Expect = 1.4e-222
Identity = 371/594 (62.46%), Postives = 471/594 (79.29%), Query Frame = 0

Query: 18  STFPENHLTFL--LRKCISLIQLC-ASSHFKLKQIHGFSIRHGVPPHNPDMGKHLIFALV 77
           S F E  +  L  + KCI+L+Q    SS  KL+QIH FSIRHGV   + ++GKHLIF LV
Sbjct: 2   SPFSETSVLLLPMVEKCINLLQTYGVSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLV 61

Query: 78  SLSAPMP--YATRIFRLIRAP-NIFTWNTMIRGFAESENPRPAVELYCQMHASSVL-PDT 137
           SL +P P  YA ++F  I  P N+F WNT+IRG+AE  N   A  LY +M  S ++ PDT
Sbjct: 62  SLPSPPPMSYAHKVFSKIEKPINVFIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDT 121

Query: 138 HTFPFLLKAAAKLMDVRVGEEIHSIVVRNGFGSLLFVQNSLVHMYSVFGFAESAYQVFEF 197
           HT+PFL+KA   + DVR+GE IHS+V+R+GFGSL++VQNSL+H+Y+  G   SAY+VF+ 
Sbjct: 122 HTYPFLIKAVTTMADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDK 181

Query: 198 MLERDLVAWNSVINGFALNGMANEALTLFREMGLDGVEPDGFTMVSLLSACVELGAMALG 257
           M E+DLVAWNSVINGFA NG   EAL L+ EM   G++PDGFT+VSLLSAC ++GA+ LG
Sbjct: 182 MPEKDLVAWNSVINGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLG 241

Query: 258 ERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDALKVFDEMEERSVVSWTSLIVGLAVN 317
           +RVHVYM+KVGL  N H+SN LLDLY++CG + +A  +FDEM +++ VSWTSLIVGLAVN
Sbjct: 242 KRVHVYMIKVGLTRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVN 301

Query: 318 GLGNEALELFGELE-RKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMKDEYGILPRIE 377
           G G EA+ELF  +E  +GL P EITFVG+LYACSHCGMV+EGF+YFRRM++EY I PRIE
Sbjct: 302 GFGKEAIELFKYMESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIE 361

Query: 378 HHGCIVDLLCRAGKVGDAYEYIRNISIPPNAVIWRTLLGACTIHGHLELGEVARAEVLRL 437
           H GC+VDLL RAG+V  AYEYI+++ + PN VIWRTLLGACT+HG  +L E AR ++L+L
Sbjct: 362 HFGCMVDLLARAGQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQL 421

Query: 438 EPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRS 497
           EP HSGDYVLLSN+YASE+RW DVQ +R+ ML  GV+K PG+SLVE+ NRV+EF+MGD+S
Sbjct: 422 EPNHSGDYVLLSNMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKS 481

Query: 498 HPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEEEKETALSHHTEKVAIAFMLVNTP 557
           HPQS+  YA L ++T  L+ EGYVP+  NV  D+EEEEKE A+ +H+EK+AIAFML++TP
Sbjct: 482 HPQSDAIYAKLKEMTGRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTP 541

Query: 558 QRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDRSRFHHFKDSSCSCRDYW 604
           +R+PI ++KNLRVCADCHLAIKL+SKV+ REI+VRDRSRFHHFK+ SCSC+DYW
Sbjct: 542 ERSPITVVKNLRVCADCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of MC01g_new0482 vs. TAIR 10
Match: AT4G21065.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 660.2 bits (1702), Expect = 1.6e-189
Identity = 305/462 (66.02%), Postives = 386/462 (83.55%), Query Frame = 0

Query: 143 LMDVRVGEEIHSIVVRNGFGSLLFVQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSV 202
           + DVR+GE IHS+V+R+GFGSL++VQNSL+H+Y+  G   SAY+VF+ M E+DLVAWNSV
Sbjct: 1   MADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSV 60

Query: 203 INGFALNGMANEALTLFREMGLDGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGL 262
           INGFA NG   EAL L+ EM   G++PDGFT+VSLLSAC ++GA+ LG+RVHVYM+KVGL
Sbjct: 61  INGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGL 120

Query: 263 VHNPHASNALLDLYSKCGNIRDALKVFDEMEERSVVSWTSLIVGLAVNGLGNEALELFGE 322
             N H+SN LLDLY++CG + +A  +FDEM +++ VSWTSLIVGLAVNG G EA+ELF  
Sbjct: 121 TRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKY 180

Query: 323 LE-RKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRA 382
           +E  +GL P EITFVG+LYACSHCGMV+EGF+YFRRM++EY I PRIEH GC+VDLL RA
Sbjct: 181 MESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARA 240

Query: 383 GKVGDAYEYIRNISIPPNAVIWRTLLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLS 442
           G+V  AYEYI+++ + PN VIWRTLLGACT+HG  +L E AR ++L+LEP HSGDYVLLS
Sbjct: 241 GQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLS 300

Query: 443 NLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLG 502
           N+YASE+RW DVQ +R+ ML  GV+K PG+SLVE+ NRV+EF+MGD+SHPQS+  YA L 
Sbjct: 301 NMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLK 360

Query: 503 KITELLKIEGYVPRTVNVLADIEEEEKETALSHHTEKVAIAFMLVNTPQRTPIRIMKNLR 562
           ++T  L+ EGYVP+  NV  D+EEEEKE A+ +H+EK+AIAFML++TP+R+PI ++KNLR
Sbjct: 361 EMTGRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLR 420

Query: 563 VCADCHLAIKLISKVFEREIIVRDRSRFHHFKDSSCSCRDYW 604
           VCADCHLAIKL+SKV+ REI+VRDRSRFHHFK+ SCSC+DYW
Sbjct: 421 VCADCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 462

BLAST of MC01g_new0482 vs. TAIR 10
Match: AT1G74630.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 486.9 bits (1252), Expect = 2.4e-137
Identity = 253/644 (39.29%), Postives = 376/644 (58.39%), Query Frame = 0

Query: 25  LTFLLRKCISLIQLCASSHFKLKQIHGFSIRHGVPPHNPDMGKHLIFALVSLSAPMPYAT 84
           +T  +  C+SL+  C +    L QIHG  I++GV   +   GK ++   +S+S  +PYA 
Sbjct: 1   MTIAIHHCLSLLNSCKNLR-ALTQIHGLFIKYGVDTDSYFTGKLILHCAISISDALPYAR 60

Query: 85  RIFRLIRAPNIFTWNTMIRGFAESENPRPAVELYCQ-MHASSVLPDTHTFPFLLKAAAKL 144
           R+      P+ F +NT++RG++ES+ P  +V ++ + M    V PD+ +F F++KA    
Sbjct: 61  RLLLCFPEPDAFMFNTLVRGYSESDEPHNSVAVFVEMMRKGFVFPDSFSFAFVIKAVENF 120

Query: 145 MDVRVGEEIHSIVVRNGFGSLLFVQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVI 204
             +R G ++H   +++G  S LFV  +L+ MY   G  E A +VF+ M + +LVAWN+VI
Sbjct: 121 RSLRTGFQMHCQALKHGLESHLFVGTTLIGMYGGCGCVEFARKVFDEMHQPNLVAWNAVI 180

Query: 205 N----------------------------------------------------------- 264
                                                                       
Sbjct: 181 TACFRGNDVAGAREIFDKMLVRNHTSWNVMLAGYIKAGELESAKRIFSEMPHRDDVSWST 240

Query: 265 ---GFALNGMANEALTLFREMGLDGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVG 324
              G A NG  NE+   FRE+   G+ P+  ++  +LSAC + G+   G+ +H ++ K G
Sbjct: 241 MIVGIAHNGSFNESFLYFRELQRAGMSPNEVSLTGVLSACSQSGSFEFGKILHGFVEKAG 300

Query: 325 LVHNPHASNALLDLYSKCGNIRDALKVFDEMEE-RSVVSWTSLIVGLAVNGLGNEALELF 384
                  +NAL+D+YS+CGN+  A  VF+ M+E R +VSWTS+I GLA++G G EA+ LF
Sbjct: 301 YSWIVSVNNALIDMYSRCGNVPMARLVFEGMQEKRCIVSWTSMIAGLAMHGQGEEAVRLF 360

Query: 385 GELERKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCR 444
            E+   G+ P  I+F+ +L+ACSH G++EEG DYF  MK  Y I P IEH+GC+VDL  R
Sbjct: 361 NEMTAYGVTPDGISFISLLHACSHAGLIEEGEDYFSEMKRVYHIEPEIEHYGCMVDLYGR 420

Query: 445 AGKVGDAYEYIRNISIPPNAVIWRTLLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLL 504
           +GK+  AY++I  + IPP A++WRTLLGAC+ HG++EL E  +  +  L+P +SGD VLL
Sbjct: 421 SGKLQKAYDFICQMPIPPTAIVWRTLLGACSSHGNIELAEQVKQRLNELDPNNSGDLVLL 480

Query: 505 SNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAML 564
           SN YA+  +W DV ++R++M+++ ++KT  +SLVE+   +Y+F  G++      E +  L
Sbjct: 481 SNAYATAGKWKDVASIRKSMIVQRIKKTTAWSLVEVGKTMYKFTAGEKKKGIDIEAHEKL 540

Query: 565 GKITELLKIE-GYVPRTVNVLADIEEEEKETALSHHTEKVAIAFMLVNTPQRTPIRIMKN 604
            +I   LK E GY P   + L D+EEEEKE  +S H+EK+A+AF L    +   IRI+KN
Sbjct: 541 KEIILRLKDEAGYTPEVASALYDVEEEEKEDQVSKHSEKLALAFALARLSKGANIRIVKN 600

BLAST of MC01g_new0482 vs. TAIR 10
Match: AT5G48910.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 471.1 bits (1211), Expect = 1.4e-132
Identity = 239/608 (39.31%), Postives = 371/608 (61.02%), Query Frame = 0

Query: 46  LKQIHGFSIRHGVPPHNPDMGKHLIFALVS--LSAPMPYATRIFRLIRAPNIFTWNTMIR 105
           L QIH   I+ G         + L F   S      + YA +IF  +   N F+WNT+IR
Sbjct: 39  LSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHKIFNQMPQRNCFSWNTIIR 98

Query: 106 GFAESENPRPAVEL---YCQMHASSVLPDTHTFPFLLKAAAKLMDVRVGEEIHSIVVRNG 165
           GF+ES+  +  + +   Y  M    V P+  TFP +LKA AK   ++ G++IH + ++ G
Sbjct: 99  GFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAKTGKIQEGKQIHGLALKYG 158

Query: 166 FGSLLFVQNSLVHMYSVFGF---------------------------------------- 225
           FG   FV ++LV MY + GF                                        
Sbjct: 159 FGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMVVMTDRRKRDGEIVLWNVMIDGY 218

Query: 226 -----AESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDGVEPDGFTMV 285
                 ++A  +F+ M +R +V+WN++I+G++LNG   +A+ +FREM    + P+  T+V
Sbjct: 219 MRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNGFFKDAVEVFREMKKGDIRPNYVTLV 278

Query: 286 SLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDALKVFDEMEER 345
           S+L A   LG++ LGE +H+Y    G+  +    +AL+D+YSKCG I  A+ VF+ +   
Sbjct: 279 SVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALIDMYSKCGIIEKAIHVFERLPRE 338

Query: 346 SVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVEEGFDYF 405
           +V++W+++I G A++G   +A++ F ++ + G++PS++ ++ +L ACSH G+VEEG  YF
Sbjct: 339 NVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYINLLTACSHGGLVEEGRRYF 398

Query: 406 RRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNISIPPNAVIWRTLLGACTIHGH 465
            +M    G+ PRIEH+GC+VDLL R+G + +A E+I N+ I P+ VIW+ LLGAC + G+
Sbjct: 399 SQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIKPDDVIWKALLGACRMQGN 458

Query: 466 LELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVE 525
           +E+G+     ++ + P  SG YV LSN+YAS+  W +V  +R  M  K +RK PG SL++
Sbjct: 459 VEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMRLRMKEKDIRKDPGCSLID 518

Query: 526 LKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEEEKETALSHH 585
           +   ++EF++ D SHP+++E  +ML +I++ L++ GY P T  VL ++EEE+KE  L +H
Sbjct: 519 IDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYRPITTQVLLNLEEEDKENVLHYH 578

Query: 586 TEKVAIAFMLVNTPQRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDRSRFHHFKDS 604
           +EK+A AF L++T    PIRI+KNLR+C DCH +IKLISKV++R+I VRDR RFHHF+D 
Sbjct: 579 SEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSIKLISKVYKRKITVRDRKRFHHFQDG 638

BLAST of MC01g_new0482 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 470.3 bits (1209), Expect = 2.3e-132
Identity = 225/523 (43.02%), Postives = 349/523 (66.73%), Query Frame = 0

Query: 83  ATRIFRLIRAPNIFTWNTMIRGFAESENPRPAVELYCQMHASSVLPDTHTFPFLLKAAAK 142
           A ++F  I   ++ +WN MI G+AE+ N + A+EL+  M  ++V PD  T   ++ A A+
Sbjct: 219 AQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQ 278

Query: 143 LMDVRVGEEIHSIVVRNGFGSLLFVQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSV 202
              + +G ++H  +  +GFGS L + N+L+ +YS  G  E+A  +FE +  +D+++WN++
Sbjct: 279 SGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISWNTL 338

Query: 203 INGFALNGMANEALTLFREMGLDGVEPDGFTMVSLLSACVELGAMALGERVHVYMLK--V 262
           I G+    +  EAL LF+EM   G  P+  TM+S+L AC  LGA+ +G  +HVY+ K   
Sbjct: 339 IGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLK 398

Query: 263 GLVHNPHASNALLDLYSKCGNIRDALKVFDEMEERSVVSWTSLIVGLAVNGLGNEALELF 322
           G+ +      +L+D+Y+KCG+I  A +VF+ +  +S+ SW ++I G A++G  + + +LF
Sbjct: 399 GVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLF 458

Query: 323 GELERKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCR 382
             + + G++P +ITFVG+L ACSH GM++ G   FR M  +Y + P++EH+GC++DLL  
Sbjct: 459 SRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGH 518

Query: 383 AGKVGDAYEYIRNISIPPNAVIWRTLLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLL 442
           +G   +A E I  + + P+ VIW +LL AC +HG++ELGE     ++++EP++ G YVLL
Sbjct: 519 SGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLL 578

Query: 443 SNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAML 502
           SN+YAS  RW +V   R  +  KG++K PG S +E+ + V+EFI+GD+ HP++ E Y ML
Sbjct: 579 SNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGML 638

Query: 503 GKITELLKIEGYVPRTVNVLADIEEEEKETALSHHTEKVAIAFMLVNTPQRTPIRIMKNL 562
            ++  LL+  G+VP T  VL ++EEE KE AL HH+EK+AIAF L++T   T + I+KNL
Sbjct: 639 EEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLTIVKNL 698

Query: 563 RVCADCHLAIKLISKVFEREIIVRDRSRFHHFKDSSCSCRDYW 604
           RVC +CH A KLISK+++REII RDR+RFHHF+D  CSC DYW
Sbjct: 699 RVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A8MQA32.0e-22162.46Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
Q9CA543.3e-13639.29Pentatricopeptide repeat-containing protein At1g74630 OS=Arabidopsis thaliana OX... [more]
Q9FI801.9e-13139.31Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX... [more]
Q9LN013.2e-13143.02Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9LW633.6e-13042.00Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
XP_022146486.10.0100.00pentatricopeptide repeat-containing protein At4g21065 [Momordica charantia][more]
XP_038882791.10.088.34pentatricopeptide repeat-containing protein At4g21065 [Benincasa hispida][more]
XP_023002974.10.089.07pentatricopeptide repeat-containing protein At4g21065 [Cucurbita maxima][more]
XP_022926338.10.088.91pentatricopeptide repeat-containing protein At4g21065 [Cucurbita moschata][more]
KAG6594433.10.088.56Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
Match NameE-valueIdentityDescription
A0A6J1CZH80.0100.00pentatricopeptide repeat-containing protein At4g21065 OS=Momordica charantia OX=... [more]
A0A6J1KRZ90.089.07pentatricopeptide repeat-containing protein At4g21065 OS=Cucurbita maxima OX=366... [more]
A0A6J1EHS20.088.91pentatricopeptide repeat-containing protein At4g21065 OS=Cucurbita moschata OX=3... [more]
A0A5A7UEB30.086.02Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3AZ160.086.02pentatricopeptide repeat-containing protein At4g21065 OS=Cucumis melo OX=3656 GN... [more]
Match NameE-valueIdentityDescription
AT4G21065.11.4e-22262.46Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G21065.21.6e-18966.02Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G74630.12.4e-13739.29Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G48910.11.4e-13239.31Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G08070.12.3e-13243.02Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 341..433
e-value: 8.5E-14
score: 53.3
coord: 158..247
e-value: 3.6E-19
score: 70.8
coord: 248..340
e-value: 1.7E-22
score: 81.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 9..157
e-value: 1.5E-11
score: 46.2
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 469..593
e-value: 6.6E-37
score: 126.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 298..332
e-value: 2.6E-5
score: 22.1
coord: 96..130
e-value: 3.4E-6
score: 24.9
coord: 334..366
e-value: 2.3E-4
score: 19.1
coord: 197..230
e-value: 2.2E-7
score: 28.6
coord: 270..298
e-value: 4.0E-6
score: 24.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 93..141
e-value: 6.0E-11
score: 42.4
coord: 195..241
e-value: 7.8E-10
score: 38.8
coord: 296..342
e-value: 1.7E-8
score: 34.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 195..229
score: 12.079411
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 94..128
score: 11.070971
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 265..295
score: 10.183105
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 296..330
score: 11.279235
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 40..589
NoneNo IPR availablePANTHERPTHR47926:SF70PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 40..589

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC01g_new0482.1MC01g_new0482.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding